Practical Data Science (with Python)

Project Assignment 3, Semester 2, 2022

Marks : This assignment is worth 30% of the overall assessment for this course.

Due Date : Wed, 26 October 2022, 11:59PM (Week 14), via canvas. Late penalties apply. A penalty of 10% of the total project score will be deducted per day. No submissions will be accepted 5 days beyond the due date.

Objective

The key objectives of this assignment are to learn how to compare and contrast several recommendation system algorithms. There are three major components of the assignment – a completed Jupyter notebook used to run your experiments, a written report, and a short video presentation where you describe what you did and your key findings.

The dataset you will use will be a sample of the Netflix Prize data. The problem is movie recommendation, and the data is already split into a training and validation set that you can use to run all of your experiments.

Provided files

The following template files are provided:

  • SXXXXX-A3.ipynb : The primer Jupyter notebook file you should use to stage and run all of your experiments.
  • netflix-5k.movie-titles.feather : The movie title dataframe that can be used to map a movieID to a title, as well as a list of genres.
  • netflix-5k.train.feather : The training tuples for 5,000 users, where each tuple is

⟨userID,movieID,rating⟩.

  • netflix-5k.validation.feather : A predefined set of validation tuples for the same users that can be used by you to benchmark the performance of various algorithms.
  • A3.pdf : This specification file.

Creating Your Workspace

Once again, you should rename the file SXXXXX-A3.ipynb appropriately based on your student ID.

Creating Your Anaconda Environment

In order to create your anaconda environment for this project, you should run the following command in a terminal shell:

conda create -n PDSA3 python=3.8 conda activate PDSA3

pip install jupyterhub notebook numpy pandas pip install matplotlib scikit-learn seaborn

pip install kneed scikit-surprise

Note that both kneed and scikit-surprise can be finicky on some systems. For example on my machine, an error was thrown during the compile of scikit-surprise (Macbook Pro M1) but the install

still worked when it tried a fallback install method. So if you find that it really fails for you using pip, then and only then resort to conda. This would break requirements.txt, but it should work reliably for everyone albeit not very reproducible. The magic commands would be:

conda install -c conda-forge kneed

conda install -c conda-forge scikit-surprise

You can type “pip freeze” to see a list of the packages that are correctly installed in your environment. You can also install scikit-learn-intelex and/or psutil if you want to use the Intel-based optimisations or debug memory management as shown in the sample Jupyter notebook.

If you also wish the timing and other jupyter extensions to be enabled in your notebook (optional but may be useful depending on how you decide to present your results), you need to run the following additional commands:

pip install jupyter_contrib_nbextensions

pip install jupyter_nbextensions_configurator jupyter contrib nbextension install –user jupyter nbextensions_configurator enable –user

Now you just need to type “jupyter notebook” to start up jupyter correctly with access to the libraries you just installed. Also, recall that if you ever stop working in your environment and come back later.

You must open a terminal, run “conda activate PDSA3” and then “jupyter notebook”, otherwise you will not be working in the virtual environment you created above, and most things you try to do will probably start failing. You should not use any other libraries to complete your assignment beyond the ones shown above without written permission from the course coordinator (Shane).

1   The Jupyter Notebook Primer (5 marks)

I have included the jupyter notebook I walked through at the end of the Week 10 Lectorial. This notebook will provide you with everything you need to correctly load the dataframe files from feather, and also includes an example of how to do both a grid search and randomised search for parameter tuning on one of the recommendation system algorithms included in Surprise. You should spend some time reading the API documentation and tutorial for this library provided at https://surpriselib.com. This will be critical information you should use to stage your experiments.

2   The Report (15 marks)

The main component of your assignment will be to carefully write up your key findings. Your report should not be more than 5 pages using 11pt font. You may also have one 2 additional Appendix page containing additional graphs or tables. Your final report must be submitted as a PDF file. You can use Microsoft Word, but I would strongly encourage you to consider writing up your report using LATEX (https:\overleaf. com).

Writing in LATEX may seem daunting at first, but Overleaf provides plenty of tutorials and examples, and it is pretty easy once you get the hang of it. This spec file was written using LATEX. Microsoft Word is not a good tool for writing technical documents, and in fact most Computer Science Conferences all require papers to be written in LATEX. The quality of of the presentation layer is easily discernible by most people in Computer Science. Write a paragraph or two with a graph or diagram in overleaf and then in MSWord and compare the two – I bet you’ll see the difference immediately!

Regardless of which tool you use to generate your final PDF, the format of the report should be:

  1. Your name and student number at the top of page 1.
  2. Introduction (usually no more than 1/2 page).
  3. Methodology (usually about 1 page) – This would contain a clear description of each recommendation system algorithm you are using and a rationale as to why it is being used.
  4. Experiments ( 3 pages) – This is the main component of your report. Here you should document all of the parameters and algorithms used, Tables, Figures, or Graphs that you create in order to compare and contrast all of the algorithms you have benchmarked and a discussion about what you have discovered. There is no reason include images of code snippets as you are submitting your Jupyter notebook already – you should include images of graphs you create – assuming they are important to the story you are telling.
  5. Conclusion (1/2 page) – A clear summary of your key findings.
  6. References (Separate page) – You can include as many references as you see fit and need in your report, and this is not counted against the 5 page limit.
  7. Appendix (Separate page) – Any additional graphs or tables that you think are important but that you could not include into the main document because of space constraints.

Summary – a report that has a 5 page body, 1+ pages of citations, and 1 optional page at the end as an Appendix.

You must compare at least four different algorithms from the surprise library in your report. Note that one algorithm (e.g. SVD) with two different parameter settings does not count as two different algorithms. That is one algorithm with two different parameter settings, i.e. one algorithm. You can certainly include results for multiple parameter settings for each of the four algorithms in your experiments, just make sure you are using 4 different algorithms in total in your shootout.

In week 8, I provided several evaluation classes you can use to compute a wide variety of evaluation measures, and we encourage you to explore as many as you can, as doing so will provide additional evidence that your “winning” algorithm is really a winner. The “official” metric will be RMSE as this is the metric used in the original competition, but you should compare the 4 algorithms using a minimum of three different evaluation metrics. You should aim to have at least one algorithm that can achieve an RMSE score ≤ 0.800. This should be achievable with a little parameter tuning, if you choose good algorithms from surprise, and you may even decide that the algorithm that got the best RMSE score is not necessarily the “best” algorithm overall based on the experiments you have ran.

Hint: Think carefully about what “good” movie recommendations really mean to you. We all know what it is, so what do you think is the best way to prove or disprove recommendations from Algorithm A are clearly better than the ones you get from Algorithm B. We covered a wide variety of evaluation measures in the Week 8 Lectorial, so go look at what was in that notebook and think about it.

Other key hints – (1) If you have a Figure, Table, or Diagram in the report, it must have a caption and you must reference it in your report and discuss it. By that I mean “Table 1 contains a table of RMSE results for Algorithms A-E. We can see that …” (2) If you use ideas or code from somewhere else, you must include it in your references. A typical “bibtex” citation for something taken from the web would look something like:

@misc{StackA,

title = {{Stackoverflow Discussion on X}, howpublished = {\url{http://www.example.com}}, note = {Accessed: 2022-10-05}

The preferred referencing style for Computer Science is usually APA. See https://libguides.murdoch.edu.au/APA/sample for an example. There are lots of tutorials available online on APA referencing, so if you have never seen it, just search for tutorials on APA referencing and you’ll find more than you could ever read/watch. (3) If it isn’t clear, you must include graphs, tables, and/or diagrams in your experimental section, which are to be used as evidence to back any claims about algorithm performance that you make.

Order Now

Get expert help for Practical Data Science and many more. 24X7 help, plag-free solution. Order online now!

Universal Assignment (June 27, 2025) Practical Data Science (with Python). Retrieved from https://universalassignment.com/practical-data-science-with-python/.
"Practical Data Science (with Python)." Universal Assignment - June 27, 2025, https://universalassignment.com/practical-data-science-with-python/
Universal Assignment October 21, 2022 Practical Data Science (with Python)., viewed June 27, 2025,<https://universalassignment.com/practical-data-science-with-python/>
Universal Assignment - Practical Data Science (with Python). [Internet]. [Accessed June 27, 2025]. Available from: https://universalassignment.com/practical-data-science-with-python/
"Practical Data Science (with Python)." Universal Assignment - Accessed June 27, 2025. https://universalassignment.com/practical-data-science-with-python/
"Practical Data Science (with Python)." Universal Assignment [Online]. Available: https://universalassignment.com/practical-data-science-with-python/. [Accessed: June 27, 2025]

Please note along with our service, we will provide you with the following deliverables:

Please do not hesitate to put forward any queries regarding the service provision.

We look forward to having you on board with us.

Most Frequent Questions & Answers

Universal Assignment Services is the best place to get help in your all kind of assignment help. We have 172+ experts available, who can help you to get HD+ grades. We also provide Free Plag report, Free Revisions,Best Price in the industry guaranteed.

We provide all kinds of assignmednt help, Report writing, Essay Writing, Dissertations, Thesis writing, Research Proposal, Research Report, Home work help, Question Answers help, Case studies, mathematical and Statistical tasks, Website development, Android application, Resume/CV writing, SOP(Statement of Purpose) Writing, Blog/Article, Poster making and so on.

We are available round the clock, 24X7, 365 days. You can appach us to our Whatsapp number +1 (613)778 8542 or email to info@universalassignment.com . We provide Free revision policy, if you need and revisions to be done on the task, we will do the same for you as soon as possible.

We provide services mainly to all major institutes and Universities in Australia, Canada, China, Malaysia, India, South Africa, New Zealand, Singapore, the United Arab Emirates, the United Kingdom, and the United States.

We provide lucrative discounts from 28% to 70% as per the wordcount, Technicality, Deadline and the number of your previous assignments done with us.

After your assignment request our team will check and update you the best suitable service for you alongwith the charges for the task. After confirmation and payment team will start the work and provide the task as per the deadline.

Yes, we will provide Plagirism free task and a free turnitin report along with the task without any extra cost.

No, if the main requirement is same, you don’t have to pay any additional amount. But it there is a additional requirement, then you have to pay the balance amount in order to get the revised solution.

The Fees are as minimum as $10 per page(1 page=250 words) and in case of a big task, we provide huge discounts.

We accept all the major Credit and Debit Cards for the payment. We do accept Paypal also.

Popular Assignments

Nursing Ethics and Law – Henry Pearson Case Study

Nursing Ethics and Law – Henry Pearson Case Study Course Code & NameNUR1103 |Context of Professional PracticeAssessment Item and NameAssessment THREE | Case StudyAssessment Item TypeEssay/ Case studyDue Date & TimeWeek 10 | 15th March 23:59 hrsLengthEssay is 1200 words + or – 10%Marks and WeightingOverall mark is out of

Read More »

NUR3397 – Complex Care Case Study Presentation

Course Code & NameNUR3397 |Complex Care AAssessment Item and NameAssessment TWO | PresentationAssessment Item TypeIndividual oral presentationDue Date & TimeWeek 10 | 22nd April 23:59 hrsResults data will be returned to you three weeks after your submission dateLength12-15 minute oral presentation recorded to ZOOM cloud + or – 10%Marks and

Read More »

AI in Recruitment: Legal and Ethical Implications for Harmony Haven

PurposeThis assessment helps you demonstrate report-writing skills essential for HR and other professional roles. It develops your research abilities, including sourcing, reviewing, and synthesizing academic and non-academic literature. Strong report-writing skills support informed business decisions, enhancing your ability to assist managers and advance your career. AI in Recruitment: Legal and

Read More »

Youth Justice Crisis: Indigenous Incarceration in Australia

issues During Impact Root  cause Youth justice crisis ongoing Disproportionate indigenous youth incarcerations reports of abuse eg Don Dale Low age of criminal responsibility (10) – Systemic racism and overpolicing – Lack of diversion and rehabilitation pathways Word: 1000 Topic selected: Youth Justic Crisis, Assessment 1: Conflict Analysis Exercise –

Read More »

PV System Design and Energy Analysis for Residential Use

Executive Summary Provide a brief summary of the key methods and key results, max 500 words. 1.         Introduction (aims and objectives and brief description of the system studied and methods of the next sections) approximately half a page 2.         Solar irradiation analysis Provide location and data used. Provide hourly GHI,

Read More »

Assignment 3: Statistical Analysis and Recommendations for Enhancing HDI

Student Name:               Your full name Student ID:                     Your Student ID Make sure to delete the instructions!! Introduction: Include a succinct introduction at the start of your report. You may write a few sentences about purpose of this report, the type of analysis, or any other relevant information (about 50 words).

Read More »

Brian Old Age Case study Assignment

Assessment 1 – Written AssessmentAssessment TypePurposeDescriptionWritten AssignmentThe purpose of this assessment is to broaden each student’s understanding of the modulecontent using a case study and assessment toolsCase Study: Brian is an 84-year-old retired farmer in a rural area in Northern Territory. Hewas recently assessed following a minor motor vehicle accident

Read More »

Assessment name: Portfolio of planning cycle

Assessment name: Portfolio of planning cycleDue Date: Friday 13 June 11:59pmWeighting: 50%Length: 2000 wordsTask Description: This Portfolio is comprised of two tasks. You must submit your assessment as onedocument. Task 1: Anecdotal record and learning experienceAnecdotal recordView the video of pre-schoolers provided under the link “Video for Assessment 2” andcomplete

Read More »

NUR5327 Assessment 3 Assignment Help

Name NUR5327 Assessment 3 (Essay)Purpose The purpose of this assessment is to demonstrate your understanding of therolesof leadership and management in healthcare by identifying and analysinga change you have actively participated in, and how it relates to key topicssuch as interprofessional communication, evidence-based practice, and staffdevelopment.LearningOutcomes NUR5327 Assessment 3 Assignment

Read More »

Mathematics Investigation and Reflection Assignment Help

Submission: Mathematics Investigation and Reflection Assignment Help TurnitinFormat:Individual written document.Uses the current APA referencing style correctly.Length:2,000 wordsThreshold Detail:For this assessment task you must obtain at least 50% of the overall result (i.e. 25 points). If the total result for this unit is at least 50 points but you scored less

Read More »

FASS Research Proposal Template Assignment

FASS Research Proposal Template Word length2000 to 3000 wordsTitleUse a concise and descriptive title that accurately reflects the content of the proposal.Background context and significanceThis section should explain the background and context of the proposed research work,indicating the main contribution to knowledge you wish to make.Aims and objectivesInclude a clear

Read More »

Evidence to Inform Nursing Practice Assignment Help

Unit Code:   NURS12165 Unit Title:    Evidence to Inform Nursing Practice Assessment Three Type:                               Written Assessment Due date:                         Week 11: Wednesday, 28 May 2025 at 1600 (AEST) Extensions:                     Available as per policy Return date:                    Results for this assessment will be made available on Wednesday, 18 June 2025 Weighting:                       50% Length:                           

Read More »

NUR1120 | Burden of Disease and Health Equity

Assessment Item Task SheetCourse code andnameNUR1120 | Burden of Disease and Health Equity Assessment itemand nameAssessment Three | ReportDue date and time Week 11 | 22/04/2025 at 2359 hours AESTLength 1400 words (+/- 10% in each section) – includes in-text references, but not reference list.Marks out of:Weighting:80 Marks50%Assessed CourseLearning Outcomes(CLO)CLO1,

Read More »

PSY1040 Portfolio: Cultural Responsiveness & Self-Awareness

Course Code and NamePSY1040: An Introduction to Cultural Safety in PracticeAssessment Item Number and NameAssessment 2: PortfolioAssessment Item TypePortfolio PSY1040 Portfolio: Cultural Responsiveness & Self-AwarenessDue Date & TimeTuesday, 29 April 2025 (Week 12), 11:59pmLength2000 words – an average of 400 words per task.Marks and WeightingMarked out of: 100Weighting: 50%Assessed Course

Read More »

Innovative Digital App Development Report

OVERALL DESCRIPTION OF TYPE OF ASSIGNMENT Assessment 1- Type of Assignment Individual Written Report Details Individual Written Report 3,000 words (500 words of the Report is Contextualisation) Weighting of Assessment : 70% INDIVIDUAL MARK Learning outcomes assessed by Assessment: 1, 2, 3 and 4 – See Module Listings of Learning

Read More »

Tourism Trends and Investment Decisions: A Comparative Study

Assignment TaskYou are a strategist working for a major hospitality group based in Australia. The company is planninginternational expansion, and the board has asked you to compile a report to identify the most suitablelocation for the project. The board has shortlisted two international locations (which will be allocatedto you by

Read More »

EC502 Language and Literacy in the Early Years

EC502 Language and Literacy in the Early Years Unit Code/Description EC502 Language and Literacy in the Early Years Course/Subject Bachelor of Early Childhood Education Semester March 2025 Assessment Overview   Unit Learning Outcomes Addressed 1, 2, 3 Assessment Objective Assessment 1: Poster Including an Invigilated stage in Week 3. Due

Read More »

EC501 Early Childhood Learning and Development

Unit Code/Description EC501 Early Childhood Learning and Development Course/Subject Graduate Diploma in Education (early childhood) Semester S 1, 2025 Assessment Overview   Unit Learning Outcomes Addressed 1, 2, 3 Assessment Objective In this assessment, student are required to select one of the case studies provided and critically analyze the child’s

Read More »

JSB172: Professional Academic Skills

JSB172: Professional Academic SkillsAssessment: Workplace Report and Presentation Weight: 50%Due date: Friday 30th May 11:59pm Length: 1,750 words (+/- 10 %) / 5minutesPurpose/Learning Objectives:This assessment relates to Learning Outcomes 1, 2, 3, and 4: Task:Your task is to write a Workplace Report identifying how to address the topic/issue chosen or

Read More »

2015PSY Developmental Psychology Assignment

2015PSY Developmental Psychology Assignment 2025 2015PSY Developmental Psychology Assignment Assignment MaterialsAssignment Information Sheet & Marking Criteria.pdf (this document)Assignment Template.docx (template)Example Assignment.pdf (HD exemplar)Due Date: Friday 16 May, 11:59PM (Week 10)Weighting: Marked out of 100 (worth 30% of course grade)Word Count: 1,500 words maximum(inclusive of main text, headings, in-text citations; excluding

Read More »

Principles of Economics Federal Budget

Principles of Economics Short-answer Assignment V1 (20% of final mark) The assignment consists of four questions.  You should allocate at least half a page (or 250 words) to each answer or 1000 words for all four answers depending on the nature of and/or marks allocated for the question/s. You may

Read More »

LML6003 – AUSTRALIA’S VISA SYSTEM 1 (FAMILY AND OTHERVISAS)

Graduate Diploma in Migration Law LML6003 – AUSTRALIA’S VISA SYSTEM 1 (FAMILY AND OTHER VISAS) Assessment Task 2 – Semester 1, 2025 LML6003 – AUSTRALIA’S VISA SYSTEM 1 (FAMILY AND OTHERVISAS) Instructions: 1. Students must answer all questions as indicated. Make certain all answers are clearly labelled. 2. Make certain

Read More »

Construction Cadetships in the Australian Construction Industry

REPORT TOPICPrepare an Academic Report on the following:‘Construction Cadetships in the Australian Construction Industry’.The report should encompass the following: Your personal evaluation and critique of the key findings in your report including your evaluation of construction cadetships, yourfindings in relation to potential issues/problems with cadetships and your recommendations to improve

Read More »

Assessing Corporate Governance and its Significance

Assessing Corporate Governance and its Significance: A Case Study Analysis Overview: Accounting irregularities have cost investors millions of dollars and, most importantly, adversely impacted their confidence in the financial system. While there have been remarkable improvements in regulatory supervision, auditing framework and reporting transparency, young graduates must assess major corporate

Read More »

Master of Professional Accounting and Accounting Advanced

Assessment 2 – Business Case (CVP) AnalysisUnit Code/Description ACC901 Accounting for Managerial DecisionsCourse/Subject Master of Professional Accounting and Master of Professional Accounting AdvancedSemester S1 2025 Assessment Overview Unit Learning OutcomesAddressed1,2,3,4 and 5Assessment Objective The primary objective of this assessment is to assess the students’ ability to apply CVPanalysis and relevant

Read More »

Urban Design Theory Essay writing

Essays are a major form of assessment at university. Through essays, you develop your understanding of discipline-specific content, strengthen your critical thinking, and develop your ability to translate that thinking into a persuasive written form. This assignment assesses your understanding of the following Unit Learning Outcomes: 1) understand the historic

Read More »

Statutory Interpretation of Disability Discrimination in NSW Law

Foundations of Law 70102 – Assessment Task 3 – Autumn 2025Statutory Interpretation and Research ExerciseDue: Thursday 22 May 2025 by 23.59Length: 2000 words (excluding the headings Part A, Part B and Part C, footnotes andbibliography. Any additional headings that you decide to use will be included in the wordcount)Weighting: 40%Task

Read More »

Can't Find Your Assignment?