MAPREDUCE, SIMILAR ITEMS AND DATA STREAMS

ASSESSMENT 1 – MAPREDUCE, SIMILAR ITEMS AND DATA STREAMS

Due: Week 2, Wednesday at 11.59 pm Weighting: 20%

Purpose

The purpose of this assessment is to evaluate your learning and knowledge of using the MapReduce technique mentioned in Week 1, finding similar items and mining data streams.

Your Task

Your task is to complete the following exercises:

Exercise 1 – Friend Recommendation System (Stanford) (40 points)

Write a MapReduce program in Spark (see Overview Module for download instructions) that implements a simple “People You Might Know” social network friendship recommendation algorithm.

The key idea is that if two people have a lot of mutual friends, then the system should recommend that they connect with each other.

Input:

· Download the input file from the link: http://snap.stanford.edu/class/ cs246- data/hw1q1.zip. The input file contains the adjacency list and has multiple lines in the following format: <User><TAB><Friends>
· Here, <User> is a unique integer ID corresponding to a unique user, and <Friends> is a comma separated list of unique IDs corresponding to the friends of the user with the unique ID <User>.
· Note that the friendships are mutual (i.e., edges are undirected): if A is friends with B then B is also friends with A. Algorithm: Let us use a simple algorithm such that, for each user U, the algorithm recommends N = 10 users who are not already friends with U, but have the greatest number of mutual friends in common with U.

Output:

The output should contain one line per user in the following format:

<User><TAB><Recommendations> where <User> is a unique ID corresponding to a user and <Recommendations> is a comma separated list of unique IDs corresponding to the algorithm’s recommendation of people that <User> might know, ordered in decreasing number of mutual friends.

Even if a user has less than 10 second-degree friends, output all of them in decreasing order of the number of mutual friends.

If there are recommended users with the same number of mutual friends, then output those user IDs in numerically ascending order.

Also, please provide a description of how you are going to use MapReduce jobs to solve this problem. Do not write more than 3 to 4 sentences for this: only a very high- level description of your strategy to tackle this problem.

For your submission

Include your source code

Include in your writeup a short paragraph describing your algorithm to tackle this problem.

Include in your writeup the recommendations for the users with following user IDs: 924, 8941, 8942, 9019, 9020, 9021, 9022, 9990, 9992, 9993.

Exercise 2 S-curve (exercise 3.4.1 in Leskovec, Rajaraman and Ullman) (7+7+7 points)

Evaluate the S-curve 1 − (1 − sr)b for s = 0.1, 0.2, . . ., 0.9, for the following values of r and b;

r=3 and b=10.
r=6 and b=20.
r=5 and b=50.

Exercise 3 Filtering Streams (similar to Exercises of 4.3 in Leskovec, Rajaraman and Ullman) (10 + 10 points)

For the situation of the running example of Section 4.3.1 in Leskovec, Rajaraman and Ullman with changed conditions (10 billion bits, 2 billion members of the set S).

Calculate the false-positive rate when using three hash functions. Do the same for four hash functions.

As a function of n, the number of bits and m the number of members in the set S, what number of hash functions minimizes the false-positive rate?

Reference:

Leskovec, J., Rajaraman, A. and Ullman, J.D., 2020. Mining of massive data sets. Cambridge university press.

Outcomes

This task addresses the following course learning outcomes.

Course Learning Outcomes

CLO1

Explain algorithms for big data sets and methodologies in the context of data mining.

CLO3 Develop and integrate algorithms as a part of software development for mining big data.

CLO5 Utilise contemporary technologies and practices to effectively handle big datasets.

Requirements

You must submit your assessment using the relevant portal in MyUni/Canvas.
All written assessments (excluding quizzes) must be submitted using a text document e.g., doc, docx, pdf via the link at the top of the page.
Consult the assessment rubric when preparing your submission.
Questions can be posted to the relevant assessment Discussion Board.

Academic Integrity

Please ensure that you have read the Academic Integrity Policy.

Grading Criteria

This assessment is worth 20% of your overall grade. Refer to the attached rubric for detailed information on the grading criteria for this assessment.

Rubric title: Assessment 1 — MapReduce, Similar Items and Data Streams
Criteria	Ratings	Points
Exercise 1:	Points: 40.0 Name: Full points	Points: 30.0 Name: Partial points	Points: 20.0 Name: Partial points	Points: 0.0 Name: No points	40.0 pts
MapReduce implementation works and the output meets all the requirements.	MapReduce implementation works but meets only some requirements and includes small mistakes.	Shows understanding of how to solve the task, but the MapReduce implementation only partially works.	No working implementation of MapReduce.
Exercise 2:	Points: 21.0 Name: Full points	Points: 14.0 Name: Partial points	Points: 7.0 Name: Partial points	Points: 0.0 Name: No points	21.0 pts
All results are correct.	2/3 of the results are correct.	1/3 of the results are correct.	No correct results.
Exercise 3:	Points: 20.0 Name: Full points	Points: 15.0 Name: Partial points	Points: 10.0 Name: Partial points	Points: 0.0 Name: No points	20.0 pts
Both parts are completely correct.	One of the parts is completely correct, the other has small mistakes.	One of the parts is partially correct; or, both are not completely correct but follow the correct approach.	Both parts are incorrect and the approach to solve them is incorrect.
Total:	81 pts

Get expert help for SIMILAR ITEMS AND DATA STREAMS and many more. 24X7 help, plag free solution. Order online now!

APAMLAHarvardVancouverChicagoIEEE

Universal Assignment (July 1, 2025) MAPREDUCE, SIMILAR ITEMS AND DATA STREAMS. Retrieved from https://universalassignment.com/mapreduce-similar-items-and-data-streams/.

"MAPREDUCE, SIMILAR ITEMS AND DATA STREAMS." Universal Assignment - July 1, 2025, https://universalassignment.com/mapreduce-similar-items-and-data-streams/

Universal Assignment March 12, 2023 MAPREDUCE, SIMILAR ITEMS AND DATA STREAMS., viewed July 1, 2025,<https://universalassignment.com/mapreduce-similar-items-and-data-streams/>

Universal Assignment - MAPREDUCE, SIMILAR ITEMS AND DATA STREAMS. [Internet]. [Accessed July 1, 2025]. Available from: https://universalassignment.com/mapreduce-similar-items-and-data-streams/

"MAPREDUCE, SIMILAR ITEMS AND DATA STREAMS." Universal Assignment - Accessed July 1, 2025. https://universalassignment.com/mapreduce-similar-items-and-data-streams/

"MAPREDUCE, SIMILAR ITEMS AND DATA STREAMS." Universal Assignment [Online]. Available: https://universalassignment.com/mapreduce-similar-items-and-data-streams/. [Accessed: July 1, 2025]

Please note along with our service, we will provide you with the following deliverables:

A premium expert will be assigned to complete your assignment.
Quality Control team will check the assignment on a regular basis before the delivery.
Plagiarism-free assignment will be provided to you with the Turnitin report.
Free revision policy will be provided in case you need any changes or amendments upto 15 days of final submission.
We strictly follow the assignment's guideline.
Your assignment will be delivered before the deadline provided.
We are here to help you 24X7 around the clock, 365 days a year.

Please do not hesitate to put forward any queries regarding the service provision.

We look forward to having you on board with us.

Recent Assignments

Get 90%* Discount on Assignment Help

Most Frequent Questions & Answers

Why Choose Universal Assignment?

Universal Assignment Services is the best place to get help in your all kind of assignment help. We have 172+ experts available, who can help you to get HD+ grades. We also provide Free Plag report, Free Revisions,Best Price in the industry guaranteed.

What services do you offer in my Assignment?

We provide all kinds of assignmednt help, Report writing, Essay Writing, Dissertations, Thesis writing, Research Proposal, Research Report, Home work help, Question Answers help, Case studies, mathematical and Statistical tasks, Website development, Android application, Resume/CV writing, SOP(Statement of Purpose) Writing, Blog/Article, Poster making and so on.

If I need and urgent revision or support, how can I approach you?

We are available round the clock, 24X7, 365 days. You can appach us to our Whatsapp number +1 (613)778 8542 or email to info@universalassignment.com . We provide Free revision policy, if you need and revisions to be done on the task, we will do the same for you as soon as possible.

Do you Familier with my University rules?

We provide services mainly to all major institutes and Universities in Australia, Canada, China, Malaysia, India, South Africa, New Zealand, Singapore, the United Arab Emirates, the United Kingdom, and the United States.

How much discount do you provide for the task?

We provide lucrative discounts from 28% to 70% as per the wordcount, Technicality, Deadline and the number of your previous assignments done with us.

How does this works?

After your assignment request our team will check and update you the best suitable service for you alongwith the charges for the task. After confirmation and payment team will start the work and provide the task as per the deadline.

Will you provide Plag report for the task?

Yes, we will provide Plagirism free task and a free turnitin report along with the task without any extra cost.

Do I have to pay any additional amount for the revisions or changes?

No, if the main requirement is same, you don’t have to pay any additional amount. But it there is a additional requirement, then you have to pay the balance amount in order to get the revised solution.

What is the minimum charge for the service.

The Fees are as minimum as $10 per page(1 page=250 words) and in case of a big task, we provide huge discounts.

How will I pay for the service?

We accept all the major Credit and Debit Cards for the payment. We do accept Paypal also.

Popular Assignments

Nursing Ethics and Law – Henry Pearson Case Study

Nursing Ethics and Law – Henry Pearson Case Study Course Code & NameNUR1103 |Context of Professional PracticeAssessment Item and NameAssessment THREE | Case StudyAssessment Item TypeEssay/ Case studyDue Date & TimeWeek 10 | 15th March 23:59 hrsLengthEssay is 1200 words + or – 10%Marks and WeightingOverall mark is out of

NUR3397 – Complex Care Case Study Presentation

Course Code & NameNUR3397 |Complex Care AAssessment Item and NameAssessment TWO | PresentationAssessment Item TypeIndividual oral presentationDue Date & TimeWeek 10 | 22nd April 23:59 hrsResults data will be returned to you three weeks after your submission dateLength12-15 minute oral presentation recorded to ZOOM cloud + or – 10%Marks and

AI in Recruitment: Legal and Ethical Implications for Harmony Haven

PurposeThis assessment helps you demonstrate report-writing skills essential for HR and other professional roles. It develops your research abilities, including sourcing, reviewing, and synthesizing academic and non-academic literature. Strong report-writing skills support informed business decisions, enhancing your ability to assist managers and advance your career. AI in Recruitment: Legal and

Youth Justice Crisis: Indigenous Incarceration in Australia

issues During Impact Root cause Youth justice crisis ongoing Disproportionate indigenous youth incarcerations reports of abuse eg Don Dale Low age of criminal responsibility (10) – Systemic racism and overpolicing – Lack of diversion and rehabilitation pathways Word: 1000 Topic selected: Youth Justic Crisis, Assessment 1: Conflict Analysis Exercise –

PPMP20008 Assessment 3 Assignment 3: Project Plan for Tarneit Community Centre

ASSESSMENT#3 – TERM 1 2025 WRITTEN ASSIGNMENT – DESCRIPTION Assessment type: Group work – Project plan Word limit: Part A: Presentation Equivalent to 500 words |Part B: Project plan 4000 words ± 5% Due date: Week 11 Friday 11:45 pm AEST Late submission: Mark deduction of 10% per

PV System Design and Energy Analysis for Residential Use

Executive Summary Provide a brief summary of the key methods and key results, max 500 words. 1. Introduction (aims and objectives and brief description of the system studied and methods of the next sections) approximately half a page 2. Solar irradiation analysis Provide location and data used. Provide hourly GHI,

Assignment 3: Statistical Analysis and Recommendations for Enhancing HDI

Student Name: Your full name Student ID: Your Student ID Make sure to delete the instructions!! Introduction: Include a succinct introduction at the start of your report. You may write a few sentences about purpose of this report, the type of analysis, or any other relevant information (about 50 words).

Brian Old Age Case study Assignment

Assessment 1 – Written AssessmentAssessment TypePurposeDescriptionWritten AssignmentThe purpose of this assessment is to broaden each student’s understanding of the modulecontent using a case study and assessment toolsCase Study: Brian is an 84-year-old retired farmer in a rural area in Northern Territory. Hewas recently assessed following a minor motor vehicle accident

Assessment name: Portfolio of planning cycle

Assessment name: Portfolio of planning cycleDue Date: Friday 13 June 11:59pmWeighting: 50%Length: 2000 wordsTask Description: This Portfolio is comprised of two tasks. You must submit your assessment as onedocument. Task 1: Anecdotal record and learning experienceAnecdotal recordView the video of pre-schoolers provided under the link “Video for Assessment 2” andcomplete

NUR5327 Assessment 3 Assignment Help

Name NUR5327 Assessment 3 (Essay)Purpose The purpose of this assessment is to demonstrate your understanding of therolesof leadership and management in healthcare by identifying and analysinga change you have actively participated in, and how it relates to key topicssuch as interprofessional communication, evidence-based practice, and staffdevelopment.LearningOutcomes NUR5327 Assessment 3 Assignment

Mathematics Investigation and Reflection Assignment Help

Submission: Mathematics Investigation and Reflection Assignment Help TurnitinFormat:Individual written document.Uses the current APA referencing style correctly.Length:2,000 wordsThreshold Detail:For this assessment task you must obtain at least 50% of the overall result (i.e. 25 points). If the total result for this unit is at least 50 points but you scored less

FASS Research Proposal Template Assignment

FASS Research Proposal Template Word length2000 to 3000 wordsTitleUse a concise and descriptive title that accurately reflects the content of the proposal.Background context and significanceThis section should explain the background and context of the proposed research work,indicating the main contribution to knowledge you wish to make.Aims and objectivesInclude a clear

Evidence to Inform Nursing Practice Assignment Help

Unit Code: NURS12165 Unit Title: Evidence to Inform Nursing Practice Assessment Three Type: Written Assessment Due date: Week 11: Wednesday, 28 May 2025 at 1600 (AEST) Extensions: Available as per policy Return date: Results for this assessment will be made available on Wednesday, 18 June 2025 Weighting: 50% Length:

NUR1120 | Burden of Disease and Health Equity

Assessment Item Task SheetCourse code andnameNUR1120 | Burden of Disease and Health Equity Assessment itemand nameAssessment Three | ReportDue date and time Week 11 | 22/04/2025 at 2359 hours AESTLength 1400 words (+/- 10% in each section) – includes in-text references, but not reference list.Marks out of:Weighting:80 Marks50%Assessed CourseLearning Outcomes(CLO)CLO1,

PSY1040 Portfolio: Cultural Responsiveness & Self-Awareness

Course Code and NamePSY1040: An Introduction to Cultural Safety in PracticeAssessment Item Number and NameAssessment 2: PortfolioAssessment Item TypePortfolio PSY1040 Portfolio: Cultural Responsiveness & Self-AwarenessDue Date & TimeTuesday, 29 April 2025 (Week 12), 11:59pmLength2000 words – an average of 400 words per task.Marks and WeightingMarked out of: 100Weighting: 50%Assessed Course

Innovative Digital App Development Report

OVERALL DESCRIPTION OF TYPE OF ASSIGNMENT Assessment 1- Type of Assignment Individual Written Report Details Individual Written Report 3,000 words (500 words of the Report is Contextualisation) Weighting of Assessment : 70% INDIVIDUAL MARK Learning outcomes assessed by Assessment: 1, 2, 3 and 4 – See Module Listings of Learning

SOM7001A – The Sports Business Environment

Assessment Brief – Assignment Two (Individual Report) SOM7001A – The Sports Business Environment

MATH1316: Practical package utilisation and report writing on control charting

Overview of Assignment and Assessment Criteria MATH1316: Practical package utilisation and report writing on control charting Learning Outcomes Feedback and grades Feedback on your assignment and your grade will be released via the Grades item in the left menu. (a) Analyse these data using Individual, Moving Range and Cumulative Control charts. What

Tourism Trends and Investment Decisions: A Comparative Study

Assignment TaskYou are a strategist working for a major hospitality group based in Australia. The company is planninginternational expansion, and the board has asked you to compile a report to identify the most suitablelocation for the project. The board has shortlisted two international locations (which will be allocatedto you by

EC502 Language and Literacy in the Early Years

EC502 Language and Literacy in the Early Years Unit Code/Description EC502 Language and Literacy in the Early Years Course/Subject Bachelor of Early Childhood Education Semester March 2025 Assessment Overview Unit Learning Outcomes Addressed 1, 2, 3 Assessment Objective Assessment 1: Poster Including an Invigilated stage in Week 3. Due

EC501 Early Childhood Learning and Development

Unit Code/Description EC501 Early Childhood Learning and Development Course/Subject Graduate Diploma in Education (early childhood) Semester S 1, 2025 Assessment Overview Unit Learning Outcomes Addressed 1, 2, 3 Assessment Objective In this assessment, student are required to select one of the case studies provided and critically analyze the child’s

JSB172: Professional Academic Skills

JSB172: Professional Academic SkillsAssessment: Workplace Report and Presentation Weight: 50%Due date: Friday 30th May 11:59pm Length: 1,750 words (+/- 10 %) / 5minutesPurpose/Learning Objectives:This assessment relates to Learning Outcomes 1, 2, 3, and 4: Task:Your task is to write a Workplace Report identifying how to address the topic/issue chosen or

2015PSY Developmental Psychology Assignment

2015PSY Developmental Psychology Assignment 2025 2015PSY Developmental Psychology Assignment Assignment MaterialsAssignment Information Sheet & Marking Criteria.pdf (this document)Assignment Template.docx (template)Example Assignment.pdf (HD exemplar)Due Date: Friday 16 May, 11:59PM (Week 10)Weighting: Marked out of 100 (worth 30% of course grade)Word Count: 1,500 words maximum(inclusive of main text, headings, in-text citations; excluding

Principles of Economics Federal Budget

Principles of Economics Short-answer Assignment V1 (20% of final mark) The assignment consists of four questions. You should allocate at least half a page (or 250 words) to each answer or 1000 words for all four answers depending on the nature of and/or marks allocated for the question/s. You may

LML6003 – AUSTRALIA’S VISA SYSTEM 1 (FAMILY AND OTHERVISAS)

Graduate Diploma in Migration Law LML6003 – AUSTRALIA’S VISA SYSTEM 1 (FAMILY AND OTHER VISAS) Assessment Task 2 – Semester 1, 2025 LML6003 – AUSTRALIA’S VISA SYSTEM 1 (FAMILY AND OTHERVISAS) Instructions: 1. Students must answer all questions as indicated. Make certain all answers are clearly labelled. 2. Make certain

Construction Cadetships in the Australian Construction Industry

REPORT TOPICPrepare an Academic Report on the following:‘Construction Cadetships in the Australian Construction Industry’.The report should encompass the following: Your personal evaluation and critique of the key findings in your report including your evaluation of construction cadetships, yourfindings in relation to potential issues/problems with cadetships and your recommendations to improve

Assessing Corporate Governance and its Significance

Assessing Corporate Governance and its Significance: A Case Study Analysis Overview: Accounting irregularities have cost investors millions of dollars and, most importantly, adversely impacted their confidence in the financial system. While there have been remarkable improvements in regulatory supervision, auditing framework and reporting transparency, young graduates must assess major corporate

Master of Professional Accounting and Accounting Advanced

Assessment 2 – Business Case (CVP) AnalysisUnit Code/Description ACC901 Accounting for Managerial DecisionsCourse/Subject Master of Professional Accounting and Master of Professional Accounting AdvancedSemester S1 2025 Assessment Overview Unit Learning OutcomesAddressed1,2,3,4 and 5Assessment Objective The primary objective of this assessment is to assess the students’ ability to apply CVPanalysis and relevant

Urban Design Theory Essay writing

Essays are a major form of assessment at university. Through essays, you develop your understanding of discipline-specific content, strengthen your critical thinking, and develop your ability to translate that thinking into a persuasive written form. This assignment assesses your understanding of the following Unit Learning Outcomes: 1) understand the historic

Statutory Interpretation of Disability Discrimination in NSW Law

Foundations of Law 70102 – Assessment Task 3 – Autumn 2025Statutory Interpretation and Research ExerciseDue: Thursday 22 May 2025 by 23.59Length: 2000 words (excluding the headings Part A, Part B and Part C, footnotes andbibliography. Any additional headings that you decide to use will be included in the wordcount)Weighting: 40%Task

MAPREDUCE, SIMILAR ITEMS AND DATA STREAMS

Purpose

Your Task

Exercise 1 – Friend Recommendation System (Stanford) (40 points)

Input:

Output:

For your submission

Exercise 2 S-curve (exercise 3.4.1 in Leskovec, Rajaraman and Ullman) (7+7+7 points)

Exercise 3 Filtering Streams (similar to Exercises of 4.3 in Leskovec, Rajaraman and Ullman) (10 + 10 points)

Outcomes

Requirements

Academic Integrity

Grading Criteria

Please note along with our service, we will provide you with the following deliverables:

Recent Assignments

Categories

Get 90%* Discount on Assignment Help

Most Frequent Questions & Answers

Popular Assignments

Can't Find Your Assignment?

Universal Assignment

Quick links

Services

Legal

Follow Us

Best Assignment Help services available in