The objective of the project is to use the dataset ‘Factor-Hair-Revised.csv’ to build an optimum regression model to predict satisfaction. You are expected to
- Perform exploratory data analysis on the dataset. Showcase some charts, graphs. Check for outliers and missing values (8 marks)
- Is there evidence of multicollinearity ? Showcase your analysis(6 marks)
- Perform simple linear regression for the dependent variable with every independent variable (6 marks)
- Perform PCA/Factor analysis by extracting 4 factors. Interpret the output and name the Factors (20 marks)
- Perform Multiple linear regression with customer satisfaction as dependent variables and the four factors as independent variables. Comment on the Model output and validity. Your remarks should make it meaningful for everybody
Please note the following:
- You have to submit 2 files :
- Business Report: In this you need to submit all the answers to all the questions in a sequential manner. Your answer should include detailed explanations & inferences to all the questions. Your report should not be filled with codes. You will be evaluated based on the business report. It should include the detailed explanation of approach used, insights, inferences, all outputs of codes like graphs, tables etc.
- R code file : This is a must and will be used for reference while evaluating
- You must give the sources of data presented. Do not refer to blogs; Wikipedia etc.
- Any assignment found copied/ plagiarized with other group(s) will not be graded and marked as zero.