Practical Assessment 2 Data Wrangling

Practical Assessment 2 Data Wrangling

Practical Assessment 2 Data Wrangling

Assessment type:Written report (PDF document) using R Markdown
Due date:26th May 2022, 5 pm Melbourne time
Weighting:30%
Word limit:Maximum 25 pages
Feedback mode:Feedback will be provided using Canvas marking tool and general text comments.

Group assessment

You will work on this assessment in a team of up to three students. You will select your own team member. Although you will work on the assessment together as a team, one of the team members will submit the report (PDF document) using R Markdown (otherwise there will be mixed up). Please write the details of your team members at the beginning of the report. If you prefer to work individually, that is also fine.

Purpose

The purpose of this assessment is to put to work the tools and knowledge that you gain throughout this course. This provides you with multiple benefits.

  • It will provide you with more experience using data preprocessing tools on real life data sets.
  • It helps you to self-direct your learning and interests to find unique and creative ways to wrangle your data.
  • It starts to build your data analytics portfolio. Portfolios (or e-portfolios) are a great way to show potential employers what you are capable of.

Overview

This assessment requires you to find some open data, and use your knowledge, skills gained during the course to preprocess the data. You will create a report using R Markdown to explain the steps taken by you in order to perform the data preprocessing tasks.

Assessment criteria and weighting

Please see the marking rubric to know the assessment criteria and weightage. Course learning outcomes This assessment is linked to the following course learning outcomes:

Course Learning outcomes

This assessment is linked to the following course learning outcomes:

  • Accurately, logically and ethically combine data from multiple sources to make suitable for statistical analysis and draw valid interpretations.
    • Articulate how data meets the best practice standards (e.g. tidy data principles).
    • Select, perform and justify data validation processes for raw datasets.
    • Use leading open source software (e.g. R) for reproducible, automated data processing.

Assignment data sources

Assessment 2 is open-ended however you are required to find suitable datasets that fulfil the minimum requirements given below. All the datasets that you use in this Assessment must be open and ideally have a Creative Commons Licence. This will ensure you can share your work with anyone provided you make proper attribution. If you’re not sure if data is Open, contact the provider, read the documentation or post on the discussion board and I will investigate. Some open data sources are provided below, but I encourage you to find others:

Minimum requirements for the data sets

Considering this is a data preprocessing class, I do expect your data set to have certain requirements so that you can demonstrate your knowledge of data preprocessing. The following are the minimum requirements for the data sets that I will look for:

  1. At least two data sets should be merged to create your assessment data (for example you can take crime statistics for the cities/states in Australia and merge this data set with cities/states’ per capita income data).
  • Your data set should include multiple data types (numeric, character, factor, etc).
  • Your data set should include variables suitable for data type conversions so that you should be able to apply the required data type conversions (e.g., character -> factor, character -> date, numeric -> factor, etc. conversions).
  • Your data set should include at least one factor variable that needs to be labelled and/or ordered.
  • At least one of the data sets that you use should be Untidy. You need to explain why the data set or data sets you used is/are Untidy. Then you need to apply the required steps to reshape your data into a tidy format.
  • At least one variable needs to be created/mutated from the existing ones (e.g., the data may contain income and expense variables and you may create a savings variable out of the income and expense variables).
  • You are expected to scan all variables for missing values, special values, and obvious errors (i.e., inconsistencies). If there are missing values, use any of the suitable techniques outlined in Module 5 to deal with them, reason and document your approach properly. If there are no missing values in the data, then scan all variables for any special values and obvious errors, use any of the suitable techniques outlined in Module 5 to deal with them, reason and document your approach properly.
  • You are expected to scan all numeric variables for outliers. If there are outliers, use any of the suitable techniques outlined in Module 6 to deal with them, reason and document your approach properly.
  • You are expected to apply data transformations on at least one of the variables. The purpose of this transformation should be one of the following reasons: i) to change the scale for better understanding of the variable, ii) to convert a non-linear relation into linear one, or

iii) to decrease the skewness and convert the distribution into a normal distribution.

  1. The packages/functions readr, xlsx, readxl, foreign, gdata, rvest, dplyr, tidyr, deductive, deducorrect, editrules, validate, Hmisc, forecast, stringr, lubridate, car, outliers, MVN, infotheo, MASS, caret, MLR, ggplot2, knitr and base R will be useful. You can also use your own functions. This will show your accumulated knowledge that you gained throughout the semester in this course.

Optional things that you can do to preprocess data:

You can subset your data by selecting variables and/or filtering in (or out) cases. Please don’t

forget to put an explanation in your report if you do so.

  • Your data set can include date or string information or both. If this is the case, I expect you to apply required date conversions for dates and string manipulations for strings as required.
  • Depending on your level of knowledge gained in other courses (i.e., Applied Analytics

and/or Machine Learning, etc) you may apply data normalisation, feature selection and

feature extraction. Note that, this is an optional task, and you don’t have to apply any of these techniques if you don’t know the theory and the fundamentals.

Important Note:

Note that sometimes the order of the tasks may be different than the order given here. For example, you may need to tidy the data sets first to be able to create the common key to merge. Therefore, for such cases you may have a different ordering of the sections. Any further or optional pre-processing tasks can be added to the template using an additional section in the R Markdown file. Make sure your code is visible (within the margin of the page).

Create the report using R Markdown

The assessment 2 report must be completed using the R Markdown template. Note that this is an R Markdown notebook template. Information for using the R Markdown package can be found Here. The R Markdown template must be updated with your name(s) and student number(s). You must use the headings and chunks provided in the template. You can add more chunks if required. Your report will be composed of the following sections. In the report, all R chunks and outputs need to be visible. Failure to do so will result in a loss of marks.

Sections of the report:

  1. Students’ details [YAML input]: Add students’ full names, numbers, and the percentage of contributions in table “Group information”. Add the leader’s information (the one that submits assessment report) in “author” entries in the YAML header (located at the top of the R Markdown Template). If you work individually, then add only your own details in the table and write 100 % in percentage of contribution.
  2. Required packages [R code]: Provide the packages required to reproduce the report.
  3. Executive Summary [Plain text]: In your own words, provide a summary of the preprocessing. Explain the steps that you have taken to preprocess your data. Write this section last after you have performed all data preprocessing. (Word count Max: 300 words).
  4. Data [Plain text & R code & Output]: A clear description of data sets, their sources, and variable descriptions should be provided. In this section, you must also provide the R codes with outputs (e.g., head of data sets) that you used to import/read/scrape the data set. You need to fulfil the minimum requirement #1 and merge at least two data sets to create the one you are going to work on. In addition to the R codes and outputs, you need to explain the steps that you have taken.
  5. Understand [Plain text & R code & Output]: Summarise the types of variables and data structures, check the attributes in the data and apply proper data type conversions. In addition to the R codes and outputs, briefly explain the steps that you have taken. In this section, show that you have fulfilled minimum requirements 2-4.
  • Tidy & Manipulate Data I [Plain text & R code & Output]: Explain why your data (or one of the data sets) doesn’t conform the tidy data principles (minimum requirement #5). Apply the required steps to reshape the data into a tidy format. In addition to the R codes and outputs, explain everything that you do in this step.
  • Tidy & Manipulate Data II [Plain text & R code & Output]: Create/mutate at least one variable from the existing variables (minimum requirement #6). In addition to the R codes and outputs, explain everything that you do in this step.
  • Scan I [Plain text & R code & Output]: Scan the data for missing values, special values, and obvious errors (i.e., inconsistencies). In this step, you should fulfil the minimum requirement #7. In addition to the R codes and outputs, explain your methodology (i.e., explain why you have chosen that methodology and the actions that you have taken to handle these values) and communicate your results clearly.
  • Scan II [Plain text & R code & Output]: Scan the numeric data for outliers. In this step, you should fulfil the minimum requirement #8. In addition to the R codes and outputs, explain your methodology (i.e., explain why you have chosen that methodology and the actions that you have taken to handle these values) and communicate your results clearly.
  • Transform [Plain text & R code & Output]: Apply an appropriate transformation for at least one of the variables. In addition to the R codes and outputs, explain everything that you do in this step. In this step, you should fulfil the minimum requirement #9.

Submission Format

  • Upload the report as one single file (PDF) via the assessment 2 page in CANVAS.
  • The easiest way to produce a PDF file from the RMarkdown is to Run all R chunks, then Preview your notebook in HTML (by clicking Preview) → Open in Browser (Chrome) → Right- click on the report in Chrome → Click Print and Select the Destination Option to Save as PDF.
  • After creating your PDF file make sure and check that your codes and outputs are visible.

Referencing guidelines

You must acknowledge all the sources of information you have used in your assessments. Refer to the RMIT Easy Cite Referencing Tool to see examples and tips on how to reference in the appropriate style. You can also refer to the Library Referencing Page for more tools such as

EndNote, referencing tutorials and referencing guides for printing. Use the RMIT Harvard referencing method for this assessment.

Collaboration

You are permitted to discuss and collaborate on the assessment with other groups. However, the write-up of the report must be with your own allocated group effort. Assignments will be

submitted through Turnitin, so if you’ve copied from other groups, it will be detected. It is your responsibility to ensure you do not copy or do not allow another group to copy your work. If plagiarism is detected, both groups will be responsible. It is good practice to never share assessment files with others. You should ensure you understand your responsibilities by reading the RMIT University website on Academic Integrity. Ignorance is no excuse.

Academic integrity and plagiarism

Academic integrity is about the honest presentation of your academic work. It means acknowledging the work of others while developing your own insights, knowledge, and ideas. You should take extreme care that you have:

  • Acknowledged words, data, diagrams, models, frameworks, and/or ideas of others you have quoted (i.e., directly copied), summarised, paraphrased, discussed, or mentioned in your assessment through the appropriate referencing methods.
  • Provided a reference list of the publication details so your reader can locate the source if

necessary. This includes material taken from internet sites.

If you do not acknowledge the sources of your material, you may be accused of plagiarism because you have passed off the work and ideas of another person, without appropriate referencing, as if they were your own.

RMIT University treats plagiarism as a very serious offense constituting misconduct. Plagiarism covers a variety of inappropriate behaviours, including:

  • Failure to properly document a source.
  • Copyright material from the internet or databases.
  • Collusion between students. For further information on our policies and procedures, please refer to the University Website.

Assessment declaration

When you submit work electronically, you agree to the Assessment Declaration.

Extensions and special consideration

This course follows the RMIT University Assessment policy for extensions and special consideration. Information is available Here. Ensure you understand these guidelines before applying.

Extensions will only be granted in accordance with the RMIT University Extension and Special Consideration Policy. No exceptions. Assignments submitted late will be penalised (see below for further details).

Late submission of assessment

Late submissions, without an approved extension or special consideration, will incur a penalty of 10% of the total mark per day for up to 5 days late (so the maximum late penalty is 50%). Submissions more than 5 days late are not accepted.

Penalty for exceeding maximum number of 25 Pages

A penalty of 5% of the total mark will be applied per each extra page.

Assessment2 marking rubric

CriteriaTo meet all requirements and get the full point, you must complete the following criteria for each part
Executive Summary (5)  A complete summary of the data preprocessing tasks was provided
Data (10)Complete & clear description ofdata sets,their sources,variable descriptions were provided.Data met the minimum requirement #1.Merging data was correct (you may want to merge your data sets after tidying data set(s)).R codes with outputs (head of data) were providedBrief explanations of steps were given.
Understand (15)Complete inspection ofdata structure,variables types, were done.Attributes were checked & proper data type conversions were applied.Inspection met the minimum requirements #2-4.R codes with outputs were provided.Brief explanations of steps were given.
Tidy & Manipulate 1(15)Able to reflect on the tidy data principles.Clear explanation was provided.Complete set of tasks were provided to tidy and manipulate the data properly (you may want to tidy your data set(s) before merging them).R codes with outputs were provided.Brief explanations of steps were given.
Tidy & Manipulate 2(5)Able to create/mutate at least one variable from the existing variables fulfilling the (minimum requirement #6).R codes with outputs were provided.Brief explanations of steps were given.
Scan I (20)Complete set of tasks were provided to scan the data for missing values, special values and obvious errors (minimum requirement #7).Safe and suitable methodology was followed to scan and deal with missing values, special values and obvious errors.Methodology taken was explained thoroughly.R codes were provided.Results and outputs were presented clearly.
Scan II (20)Complete sets of tasks were provided to scan the data for outliers.Safe and suitable methodology was followed to scan and deal with outliers.Methodology taken was explained thoroughly.R codes were provided.Results and outputs were presented clearly.
Transform (5)Complete set of tasks were provided to apply the transformation properly, fulfilling requirement #9.R codes with outputs were provided.Brief explanations of steps were given.
Succinct (5)The report was written succinctly and clearly.
Order Now

Get expert help for Practical Assessment 2 Data Wrangling and many more. 24X7 help, plag free solution. Order online now!

Universal Assignment (July 26, 2024) Practical Assessment 2 Data Wrangling. Retrieved from https://universalassignment.com/practical-assessment-2-data-wrangling/.
"Practical Assessment 2 Data Wrangling." Universal Assignment - July 26, 2024, https://universalassignment.com/practical-assessment-2-data-wrangling/
Universal Assignment May 22, 2023 Practical Assessment 2 Data Wrangling., viewed July 26, 2024,<https://universalassignment.com/practical-assessment-2-data-wrangling/>
Universal Assignment - Practical Assessment 2 Data Wrangling. [Internet]. [Accessed July 26, 2024]. Available from: https://universalassignment.com/practical-assessment-2-data-wrangling/
"Practical Assessment 2 Data Wrangling." Universal Assignment - Accessed July 26, 2024. https://universalassignment.com/practical-assessment-2-data-wrangling/
"Practical Assessment 2 Data Wrangling." Universal Assignment [Online]. Available: https://universalassignment.com/practical-assessment-2-data-wrangling/. [Accessed: July 26, 2024]

Please note along with our service, we will provide you with the following deliverables:

Please do not hesitate to put forward any queries regarding the service provision.

We look forward to having you on board with us.

Categories

Get 90%* Discount on Assignment Help

Most Frequent Questions & Answers

Universal Assignment Services is the best place to get help in your all kind of assignment help. We have 172+ experts available, who can help you to get HD+ grades. We also provide Free Plag report, Free Revisions,Best Price in the industry guaranteed.

We provide all kinds of assignmednt help, Report writing, Essay Writing, Dissertations, Thesis writing, Research Proposal, Research Report, Home work help, Question Answers help, Case studies, mathematical and Statistical tasks, Website development, Android application, Resume/CV writing, SOP(Statement of Purpose) Writing, Blog/Article, Poster making and so on.

We are available round the clock, 24X7, 365 days. You can appach us to our Whatsapp number +1 (613)778 8542 or email to info@universalassignment.com . We provide Free revision policy, if you need and revisions to be done on the task, we will do the same for you as soon as possible.

We provide services mainly to all major institutes and Universities in Australia, Canada, China, Malaysia, India, South Africa, New Zealand, Singapore, the United Arab Emirates, the United Kingdom, and the United States.

We provide lucrative discounts from 28% to 70% as per the wordcount, Technicality, Deadline and the number of your previous assignments done with us.

After your assignment request our team will check and update you the best suitable service for you alongwith the charges for the task. After confirmation and payment team will start the work and provide the task as per the deadline.

Yes, we will provide Plagirism free task and a free turnitin report along with the task without any extra cost.

No, if the main requirement is same, you don’t have to pay any additional amount. But it there is a additional requirement, then you have to pay the balance amount in order to get the revised solution.

The Fees are as minimum as $10 per page(1 page=250 words) and in case of a big task, we provide huge discounts.

We accept all the major Credit and Debit Cards for the payment. We do accept Paypal also.

Popular Assignments

Assessment Task 2 – NAPLAN Exercise

Assessment Task 2 (35%) – Evaluation and discussion of test items Assessment Task 2 (35%) – Evaluation and discussion of test items AITSL Standards: This assessmeAITSL Standards: This assessment provides the opportunity to develop evidence that demonstrates these Standards: 1.2        Understand how students learn 1.5        Differentiate teaching to meet with

Read More »

EBY014 Degree Tutor Group 2 Assignment

  Assignment Brief Module Degree Tutor Group 2 Module Code EBY014 Programme BA (Hons) Business and Management with   Foundation Year Academic Year 2024/2025 Issue Date 6th May 2024 Semester Component Magnitude Weighting Deadline Learning outcomes assessed 2 1 2000 words Capstone Assessment 100% 26th July, 2024 1/2/3/4 Module Curriculum

Read More »

NTW 600 Computer Network and Security

Assessment 2 Information and Rubric Subject Code  NTW 600 Subject Name Computer Network and Security Assessment Number and Title Assessment 2: Cyber Security Threats to IT Infrastructure of a real-world Organisation Assessment Type Group Assessment Length / Duration  1500 words Weighting %  30% Project Report: 20% Presentation :10% (Recorded) Total

Read More »

LAW500 Business Law Assessment 2 – Group Project

Assessment Information and Rubric Subject Code LAW500 Subject Name Business Law Assessment Number and Title Assessment 2 – Group Project Assessment Type Group Length / Duration 3000 words maximum, no ±10%, and excluding references Weighting % 30% Total Marks 100 Submission Online Submission via TurnitIn for the written report Due

Read More »

Population Nutrition Case Study Analysis

HSN702 – Lifespan Nutrition Assessment Task: 1 Assignment title: Population Nutrition Case Study Analysis Assignment task type: Short Written Report and Literature Search Strategy Task details The primary focus of this assignment is on population nutrition. Nutritionists play an important role in promoting population health through optimal nutritional intake. In

Read More »

Applied Quantitative Economics Assignment

Goldsmiths College, University of London Applied Quantitative Economics Project ** You must attempt only one project, and you must complete it either in R or in Excel ** General Background Key Stage 4 (KS4) is a legal term for the last two years of secondary school education in England leading

Read More »

Score good marks in your Master of Pharmacy (M.Pharm)

Master Your Knowledge: A Guide to a Fulfilling Career in Pharmacy with an M.Pharm The Master of Pharmacy (M.Pharm) program equips you to become a medication management specialist, delving into the science behind drugs and their impact on the human body. You’ll explore advanced pharmaceutical topics, from drug discovery and

Read More »

Score good marks in your Master of Science (M.Sc) in Microbiology

Unveiling the Microscopic Marvels: A Guide to Your Master of Science (M.Sc.) in Microbiology The Master of Science (M.Sc.) in Microbiology program equips you to delve into the fascinating world of microorganisms, from life-saving bacteria to infectious pathogens. You’ll explore their role in health, disease, the environment, and even industrial

Read More »

Score good marks in your Master of Science (M.Sc) in Microbiology

Master Your Craft: A Guide to a Thriving Career in Biotechnology with an M.Sc. The Master of Science (M.Sc.) in Biotechnology program equips you to be at the forefront of scientific discovery, innovation, and problem-solving in the exciting field of biotechnology. From developing life-saving drugs to creating sustainable biofuels, the

Read More »

Score good marks in your Master of Technology (M.Tech) in Information Technology

The Master of Technology (M.Tech) in Information Technology program propels you to the forefront of the ever-evolving IT landscape. You’ll delve into advanced computing concepts, cutting-edge technologies, and specialized areas of IT expertise. But navigating complex algorithms, intricate software systems, and in-depth research projects can feel overwhelming. Universal Assignment Solutions

Read More »

Score good marks in your MBA in Human Resource Management

The Master of Business Administration (MBA) in Human Resource Management (HRM) equips you to become a strategic HR professional, shaping the future of workplaces by attracting, developing, and retaining top talent. However, navigating complex workforce management issues, crafting effective HR policies, and staying abreast of evolving labor laws can feel

Read More »

A Guide to Your Master of Computer Applications (MCA) Journey

The Master of Computer Applications (MCA) program equips you with the skills to become a sought-after software developer or IT professional. However, navigating complex programming languages, intricate algorithms, and advanced software development methodologies can feel overwhelming. Universal Assignment Solutions can be your guiding light! We offer comprehensive assignment help designed

Read More »

Expert Assignment Help for Master of Climate Science Students

Embark on your Master of Climate Science journey with confidence! As you delve into the complexities of climate systems, atmospheric dynamics, and the pressing challenges of climate change, feeling overwhelmed by demanding coursework is natural. Universal Assignment Solutions can be your guide! We offer comprehensive assignment help designed to empower

Read More »

Expert Assignment Help for Master of Information Systems Students

Mastering the complexities of information systems (IS) in today’s ever-evolving digital landscape requires a blend of technical expertise and strategic thinking. Enrolled in a Master of Information Systems (MSIS) program, you’re poised to become a leader in designing, implementing, and managing the information systems that power our world. But feeling

Read More »

Expert Assignment Help for Your Master of Science in Statistics

Embarking on your Master of Science in Statistics (MS Statistics) program is an exciting step towards a rewarding career. As you delve into the intricacies of statistical theory, data analysis techniques, and advanced modeling, feeling overwhelmed by challenging coursework is natural. Universal Assignment Solutions can be your guide! We offer

Read More »

Expert Assignment Help for Master of Urban Design Students

Master of Urban Design (MUD) programs equip you to transform cities into vibrant, sustainable, and equitable spaces. But navigating complex urban design theories, crafting master plans, and tackling real-world design challenges can feel overwhelming. Universal Assignment Solutions can be your design compass! We offer comprehensive assignment help designed to empower

Read More »

Expert Assignment Help for Master of Public History Students

Embark on your Master of Public History journey with confidence! As you delve into the complexities of museums, archives, historic preservation, and interpreting the past for diverse audiences, feeling overwhelmed by demanding coursework is natural. Universal Assignment Solutions can be your trusted partner! We offer comprehensive assignment help designed to

Read More »

Expert Assignment Help for Master of Conservation Biology Students

Are you passionate about protecting our planet’s incredible biodiversity but feeling overwhelmed by the complexities of conservation biology? Struggling with research proposals, population modeling, or navigating the intricacies of habitat restoration in your Master’s program? Universal Assignment Solutions can be your scientific compass! We offer comprehensive assignment help designed to

Read More »

Expert Assignment Help for Master of International Education Students

Embarking on your Master of International Education (MIE) journey is a noble pursuit. As you delve into the complexities of intercultural learning, global citizenship education, and preparing students for a globally interconnected world, feeling overwhelmed by demanding coursework is natural. Universal Assignment Solutions can be your trusted guide! We offer

Read More »

Expert Assignment Help for Master of Public Art Studies Students

Embarking on your Master of Public Art Studies program is an exciting venture. As you delve into the world of public art theory, community engagement, and artistic interventions in the urban landscape, feeling overwhelmed by demanding coursework is natural. Universal Assignment Solutions can be your trusted guide! We offer comprehensive

Read More »

Expert Assignment Help for Master of Real Estate Finance Students

Feeling lost in the labyrinth of loan-to-value ratios, cap rates, and complex financial modeling for real estate projects? Drowning in the sea of market analysis and feasibility studies in your Master of Real Estate Finance program? Universal Assignment Solutions can be your compass, navigating you towards becoming a real estate

Read More »

Expert Assignment Help for Master of Corporate Finance Students

Feeling lost in the labyrinth of financial models, complex valuation techniques, and demanding coursework in your Master of Corporate Finance program? Universal Assignment Solutions can be your compass to navigating the exciting world of corporate finance! We offer comprehensive assignment help designed to empower you to become a financial whiz

Read More »

Expert Assignment Help for Master of Arts in Teaching Students

Embarking on your Master of Arts in Teaching (MAT) journey is a noble pursuit. As you delve into the complexities of pedagogy, curriculum development, and educational leadership, feeling overwhelmed by demanding coursework is natural. Universal Assignment Solutions can be your trusted guide! We offer comprehensive assignment help designed to empower

Read More »

Expert Assignment Help for Master of Food Science Students

Feeling overwhelmed by the intricate dance of chemistry, biology, and engineering in your food? Drowning in complex food processing techniques and demanding coursework in your Master of Food Science program? Universal Assignment Solutions can be your culinary compass! We offer comprehensive assignment help designed to empower you to become a

Read More »

Expert Assignment Help for Master of Educational Leadership Students

Feeling overwhelmed by the complexities of educational leadership, the weight of educational policy, and demanding coursework in your Master of Educational Leadership program? Universal Assignment Solutions can be your guiding light! We offer comprehensive assignment help designed to empower you to become a visionary leader who transforms schools. Why Choose

Read More »

Can't Find Your Assignment?

Open chat
1
Free Assistance
Universal Assignment
Hello 👋
How can we help you?