# Using aggregation functions for data analysis

Using aggregation functions for data analysis

Total Marks 100, Weighting 20%

The provided zip file contains the data file [RedWine.txt] and the R code [AggWaFit718.R] to use with the following tasks, include these in your R working directory. You can use the R script [template.R] to organise your code.

Clarification and related resources are provided

## Red wine quality Dataset

The given dataset, “RedWine.txt”, is used to model wine quality based on physicochemical tests. The dataset provides the 1,599 red wine samples from the north of Portugal. It is a modified version of the data used in the study [1]. This dataset includes 5 variables, denoted as X1, X2, X3, X4, X5, and Y, described as follows:

X1 – citric acid X2 – chlorides

X3 – total sulfur dioxide X4 – pH

X5 – alcohol

Y – quality (score between 0 and 10)

[1] P. Cortez, A. Cerdeira, F. Almeida, T. Matos and J. Reis. Modeling wine preferences by data mining from physicochemical properties. In Decision Support Systems, Elsevier, 47(4):547-553, 2009.

• Understand the data
1. Import the txt file (RedWine.txt) and save it to your R working directory.
1. Assign the data to a matrix, e.g. using the.data <- as.matrix(read.table(“RedWine.txt “))
• The variable of interest is quality (Y). To investigate Y, generate a subset of 440 data, e.g. using:

my.data <- the.data[sample(1:1599,440),c(1:6)]

[The following tasks are based on the 440 sample data]

• Using scatter plots and histograms to understand the relationship between each of the variables X1, X2, X3, X4, X5 and the variable of interest Y.
• Transform the data

Choose any four from the five variables (X1, X2, …, X5). Make appropriate transformations to the chosen four variables and the variable of interest Y individually, so that the values can be aggregated in order to predict the variable of interest. Assign your transformed data along with your transformed variable of interest to an array.

[All the following tasks are based on the saved transformed data]

• Build models and investigate the importance of each variable
• Import AggWaFit718.R file to your working directory and load into the R workspace using, source(“AggWaFit718.R”)
• Evaluating the following fitting functions on the transformed data:
• A weighted arithmetic mean (WAM)
• Weighted power means (WPM) with P=2
• An ordered weighted averaging function (OWA)
• Use your model for prediction

Using your best fitting model based on Q3, predict the wine quality for the input: X1=1; X2= 0.075; X3=41; X4=3.53; X5=9.3.

[Apply the same pre-process as Q2 for the new input]

• Summarising your data analysis procedures in up to 20 slides for a 5-minutes presentation. The slides should include the following contents:
• What kinds of the data distribution you have identified in the raw data.
• Explain the transformations applied for the selected four variables and the variable of interest.
• Include two tables – one with the error measures and correlation coefficients, and one summarising the weights/parameters and any other useful information learned for your data.
• Explain the importance of each of the variables (the four variables that you have selected).
• Which fitting function is the best fitting model on your selected data.
• Give your prediction result and comment on whether you think it is reasonable.
• Discuss the best conditions (in terms of your chosen four variables) under which a higher quality wine will occur.
• Comment the implications and the limitations of the fitting model you used for prediction.

The 5-minutes presentation can be using a simple and accessible platform such as YouTube or PowerPoint Audio.

## Submission requirements

Submit to the SIT718 CloudDeakin Dropbox. Your final submission must include the following TWO files:

2. The R code file (that you have written to produce your results) named “name- code.R” (where “name” is replaced with your surname or first name).

Your assignment will not be assessed if the code is missing, or the outputs of the code are inconsistent with the slides.

Following Harvard style for code citation and reference in your R script with comments

You must cite all the datasets and packages you used for this assessment. You will lose some scores for inappropriate citations/references.

Get expert help for Using aggregation functions for data analysis and many more. 24X7 help, plag free solution. Order online now!

Universal Assignment (June 2, 2023) Using aggregation functions for data analysis. Retrieved from https://universalassignment.com/using-aggregation-functions-for-data-analysis/.
"Using aggregation functions for data analysis." Universal Assignment - June 2, 2023, https://universalassignment.com/using-aggregation-functions-for-data-analysis/
Universal Assignment May 22, 2023 Using aggregation functions for data analysis., viewed June 2, 2023,<https://universalassignment.com/using-aggregation-functions-for-data-analysis/>
Universal Assignment - Using aggregation functions for data analysis. [Internet]. [Accessed June 2, 2023]. Available from: https://universalassignment.com/using-aggregation-functions-for-data-analysis/
"Using aggregation functions for data analysis." Universal Assignment - Accessed June 2, 2023. https://universalassignment.com/using-aggregation-functions-for-data-analysis/
"Using aggregation functions for data analysis." Universal Assignment [Online]. Available: https://universalassignment.com/using-aggregation-functions-for-data-analysis/. [Accessed: June 2, 2023]

## Please note along with our service, we will provide you with the following deliverables:

Please do not hesitate to put forward any queries regarding the service provision.

We look forward to having you on board with us.

# Get 90%* Discount on Assignment Help

### Most Frequent Questions & Answers

Universal Assignment Services is the best place to get help in your all kind of assignment help. We have 172+ experts available, who can help you to get HD+ grades. We also provide Free Plag report, Free Revisions,Best Price in the industry guaranteed.

We provide all kinds of assignmednt help, Report writing, Essay Writing, Dissertations, Thesis writing, Research Proposal, Research Report, Home work help, Question Answers help, Case studies, mathematical and Statistical tasks, Website development, Android application, Resume/CV writing, SOP(Statement of Purpose) Writing, Blog/Article, Poster making and so on.

We are available round the clock, 24X7, 365 days. You can appach us to our Whatsapp number +1 (613)778 8542 or email to info@universalassignment.com . We provide Free revision policy, if you need and revisions to be done on the task, we will do the same for you as soon as possible.

We provide services mainly to all major institutes and Universities in Australia, Canada, China, Malaysia, India, South Africa, New Zealand, Singapore, the United Arab Emirates, the United Kingdom, and the United States.

We provide lucrative discounts from 28% to 70% as per the wordcount, Technicality, Deadline and the number of your previous assignments done with us.

After your assignment request our team will check and update you the best suitable service for you alongwith the charges for the task. After confirmation and payment team will start the work and provide the task as per the deadline.

Yes, we will provide Plagirism free task and a free turnitin report along with the task without any extra cost.

No, if the main requirement is same, you don’t have to pay any additional amount. But it there is a additional requirement, then you have to pay the balance amount in order to get the revised solution.

The Fees are as minimum as \$10 per page(1 page=250 words) and in case of a big task, we provide huge discounts.

We accept all the major Credit and Debit Cards for the payment. We do accept Paypal also.

### Popular Assignments

NURBN3034 Assessment Task 1a Global Health Issues Group ePoster Presentation Weighting: 30% Due date: Thursday August 18th 13.59pm each group Purpose: The purpose of this learning task is to choose one of the following global health issues that under the right conditions has or is likely to have a signiﬁcant

School of Business, IT and Logistics — ISYS3375 Business Analytics Assessment 2: Case Study Assessment Type: Individual report                                               Word limit: 2000-3000 (+/– 10%) Each Table or Figure is counted as Due date: Sunday of Week 5 23:59 (Melbourne time) Weighting: 35% 50 words + the number of words in its

### NURBN3032 Task 2: Managing a Transition to Practice Issue

NURBN3032 Task 2: Managing a Transition to Practice Issue Weight: 60% Due: Thursday 18th May (Week 11) In this task, students are required to demonstrate knowledge relating to understanding and addressing a transitional issue that can affect new graduate nurses. Using evidence from current scholarly literature (i.e., less than seven

### ATS2561 Sex and the Media

Assessment Guide – Research Essay Due: Friday Week 12, submit on Moodle Weighting: 40% Length: 2000 words Write an essay responding to one of the following questions/topics: ‘objectifying’ images might be different, or that they should be understood as the same, for men and women. Your argument should address why

### Shoulder Range of Motion: A Key Component of Upper Body Functionality

Title: Shoulder Range of Motion: A Key Component of Upper Body Functionality Short Descriptiont: Shoulder range of motion is crucial for optimal upper body functionality as it involves a complex interplay of muscles, bones, and joints. Limitations in shoulder mobility can impact daily activities, athletic performance, as well as overall

### ITC597 Digital Forensics

ITC597 Digital Forensics – SAMPLE EXAM ONLY This paper is for Distance Education (Distance), Port Macquarie, Study Centre Sydney and Study Centre Melbourne students. EXAM CONDITIONS: NO REFERENCE MATERIALS PERMITTED No calculator is permitted No dictionary permitted WRITING TIME:                     2 hours plus 10 minutes reading time Writing is permitted during

### Simulation Project- Computer Lab Project

Model and analyse the communication tower at the Casuarina campus. Apply dead, live and wind load as per in AS 1170 or other relevant standards in SAP2000. You should measure size of the elements as far as you can from or make reasonable assumptions about the dimensions. Reasonable assumptions should

### COM621 UX Strategy

Solent University Coursework Assessment Submission Module Name:    UX Strategy Module Code:    COM621 Module Leader: Assessment Submission Date: Student Number: UX Strategy Contents Part 1 – Introduction to System (1K words) 2 1.0 Introduction. 2 1.1       Current SUAA UX Design and Business Model 2 1.2       Academic and Market Research. 3 1.3

### MIT302 Internet of Things

Group Presentation and Video (part 2) Unit:             MIT302 Internet of Things Due Date:       09/06/2023 Total Marks:    This assessment is worth 10% of the full marks in the unit. Instructions: 1.        Students are required to cover all stated requirements. 3.        Please save the document as: MIT302_Firstname_Surname_StudentNumber[assessment1].ppt Requirements: Write a PowerPoint of

### MBA600 Capstone: Strategy

Assessment 1 Information Subject Code: Subject Name: Assessment Title: Assessment Type: Length: Weighting: Total Marks: Submission: Due Date: MBA600 Capstone: Strategy Competitive Advantage Video Project Individual video recording 5 minutes (no more) 25% 100 Online Week 5 Your task Individually, you are required  to record a 5-minute video, in which

### WHY SHOULD ALL NURSES LEARN ABOUT END OF LIFE CARE?

Background You are a newly graduated nurse in the emergency department. Tom has been admitted with left abdominal pain radiating through to the back exacerbated by eating and drinking. The pain has significantly increased over the past two days and he currently rates it as 9/10. He has been unable

### Strategic Management Assignment

Assignment: Prepare a Comprehensive Strategic Management Analysis Report of Infosys Task: You need to develop a max 3,000-word Comprehensive Strategic Management Analysis Report addressing the four specific tasks set out in the strategic management assignment brief. The 3,000 words, exclude the Title, Abstract, Table of contents, Bibliography and Appendices. The company

### MID-PLACEMENT PRESENTATION ASSIGNMENT

Progress: What is going on well CHALLENGES (WORKING REMOTELY) AREAS FOR DEVELOPMENT STRENGTHS NEXT PART OF PLACEMENT MULLER, 2014 The power of story Song line and dreaming tracks Defining knowledge, theories, and purposes: The most significant part was likened to a snake whose tail was cut off, and it had

### QUATTRO-CANNA HOLDINGS RESEARCH PROJECT

Background Local hemp products development company Quattro-Canna Holdings has signed a licence agreement with hemp processing equipment developer, Canadian Greenfield Technologies, to manufacture the HempTrain decorticator plant, designed for mass-processing of hemp straw- bales into bast fibre, hurd and green microfiber (GMF), in South Africa. The HempTrain will be manufactured

### Information Booklet Scaffold

An information booklet contains-relevant information on a topic for a particular target audience. The format of an information booklet can vary, however, there are common elements, including: The following process can support you in developing an information booklet on a topic for a particular audience. There are three stages to

### CHCCSM004 Coordinate Complex Case Requirements

Assessment Task 1: Written Questions b. List and describe eight of a coordinators responsibilities. a. Explain how information about external service providers might be sourced. b. List three circumstances it might be necessary for a coordinator to use external service providers to ensure that a consumer’s care plan meets their needs and

A: Assessment Details Module Title Leadership in Action Module Code BU7401 Module Leader Component Number 1 Assessment Type, Word Count & Weighting Individual written assignment 4000 words 100% of module grade Submission Deadline 21/10/2022 Submission Instructions Online submission using Turn It In Feedback Return Date 4 weeks after submission B:

### Management Research Perspectives

SBS – DBA Assignment – 2023 UNIT TITLE:                                            NAME (in Full):                                                               GENERAL INSTRUCTIONS converted to 90 marks. Total Marks                      / 90 PLAGIARISM Plagiarism is a form of cheating, by representing someone else’s work as your own or using someone else’s work (another student or author) without acknowledging it

### MARKETING PLAN ASSIGNMENT HELP

I.        Executive Summary           The executive summary is a synopsis of the overall marketing plan and easier to write last, after the entire marketing plan has been written. II.       Environmental Analysis Micro Analysis:                     Competitive forces (Five Force Analysis)                               Who are our major competitors?  What are their characteristics (size,

### Bunnyland and Otherland: One Year Later – Exploring Food, Art, Leadership, Music, Psychology, and Self-Improvement

Word count – 2000 words Total Marks – 65 we return to Bunnyland and Otherland one year later! When we last saw them things had come to a tentative conclusion but substantial challenges remained. Could people from both lands manage to work together to solve their food problem? Would tensions

### MQBS7030 Final Assessment Data Analysis and Report

ASSIGNMENT TASK: For this assignment, you need to refer to “Fringe” dataset. Fringe is concerned with the factors that contribute to the fringe benefits of employees. The dataset includes a range of different variables, which allows for a range of different tests to be performed. You should note that our

### MIS770 Foundation Skills in Business Analysis

MIS770 Foundation Skills in Business Analysis Department of Information Systems and Business Analytics Deakin Business School Faculty of Business and Law, DeakinUniversity Assignment Two Analysis of Click Sales Data Particulars Assurance of Learning This assignment assesses the following Graduate Learning Outcomes and related Unit Learning Outcomes: Graduate Learning Outcome (GLO)

### Myopia and Later Physical Activity in Adolescence: A Prospective Study

Question 1 ( Read the paper Deere K, Williams C, Leary S, et al (2009). Myopia and later physical activity in adolescence: a prospective study.  British Journal of Sports Medicine, 43,542–544. Critically appraise of the statistical material in this paper against items 10, 12-17 of the STROBE checklist. Present your

### ITECH7407 – Real Time Analytics

Assessment Task – Data Analytics Assignment Overview For this assessment task, you will work in a group to analyse a selected data set, and provide recommendations to the leadership of the company based on your findings. Timelines and Expectations Percentage Value of Task: 25% Due: Week 11, Sunday 5pm Minimum

### BSB123 Data Analysis

BSB123 Data Analysis Research Report Assessment Semester 1, 2021 Due Date: 11:59 30th May The data for the Assignment can be found in the file Research Report Assessment (2021-01).xlsx on Blackboard The Problem FringeTech is an information technology / electrical engineering company that employs thousands of people Australia wide. Recently

### Final Analysis Assignment Help

Refer to the attached excel file, answer the questions below. Use graph if required. The file that can be accessed through the link below contains data on 100 employees in a particular occupation. Suppose that interest centres on investigating the factors that explain salary differences. The data set contains the following

### VETS6103 Data Analysis Assignment

Factors influencing milk production in Australian dairy cattle Assignment overview: This assignment involves analysing a dataset, interpreting results, and drawing conclusions based on the analyses. The dataset can be found in the file “practical_assignment_2021.xls” which is on Canvas under the Assignments folder. It is a group task worth 50% of