MAPREDUCE, SIMILAR ITEMS AND DATA STREAMS

SIMILAR ITEMS AND DATA STREAMS

ASSESSMENT 1 – MAPREDUCE, SIMILAR ITEMS AND DATA STREAMS

Due: Week 2, Wednesday at 11.59 pm Weighting: 20%

Purpose

The purpose of this assessment is to evaluate your learning and knowledge of using the MapReduce technique mentioned in Week 1, finding similar items and mining data streams.

Your Task

Your task is to complete the following exercises:

Exercise 1 – Friend Recommendation System (Stanford) (40 points)

  • Write a MapReduce program in Spark (see Overview Module for download instructions) that implements a simple “People You Might Know” social network friendship recommendation algorithm.

The key idea is that if two people have a lot of mutual friends, then the system should recommend that they connect with each other.

Input:

  • ·       Download the input file from the link: http://snap.stanford.edu/class/ cs246- data/hw1q1.zip. The input file contains the adjacency list and has multiple lines in the following format: <User><TAB><Friends>
  • ·       Here, <User> is a unique integer ID corresponding to a unique user, and <Friends> is a comma separated list of unique IDs corresponding to the friends of the user with the unique ID <User>.
  • ·       Note that the friendships are mutual (i.e., edges are undirected): if A is friends with B then B is also friends with A. Algorithm: Let us use a simple algorithm such that, for each user U, the algorithm recommends N = 10 users who are not already friends with U, but have the greatest number of mutual friends in common with U.

Output:

  • The output should contain one line per user in the following format:

<User><TAB><Recommendations> where <User> is a unique ID corresponding to a user and <Recommendations> is a comma separated list of unique IDs corresponding to the algorithm’s recommendation of people that <User> might know, ordered in decreasing number of mutual friends.

  • Even if a user has less than 10 second-degree friends, output all of them in decreasing order of the number of mutual friends.
  • If there are recommended users with the same number of mutual friends, then output those user IDs in numerically ascending order.
  • Also, please provide a description of how you are going to use MapReduce jobs to solve this problem. Do not write more than 3 to 4 sentences for this: only a very high- level description of your strategy to tackle this problem.

For your submission

  • Include your source code
  • Include in your writeup a short paragraph describing your algorithm to tackle this problem.
  • Include in your writeup the recommendations for the users with following user IDs: 924, 8941, 8942, 9019, 9020, 9021, 9022, 9990, 9992, 9993.

Exercise 2 S-curve (exercise 3.4.1 in Leskovec, Rajaraman and Ullman) (7+7+7 points)

Evaluate the S-curve 1 − (1 − sr)b for s = 0.1, 0.2, . . ., 0.9, for the following values of r and b;

  • r=3 and b=10.
  • r=6 and b=20.
  • r=5 and b=50.

Exercise 3 Filtering Streams (similar to Exercises of 4.3 in Leskovec, Rajaraman and Ullman) (10 + 10 points)

  1. For the situation of the running example of Section 4.3.1 in Leskovec, Rajaraman and Ullman with changed conditions (10 billion bits, 2 billion members of the set S).

Calculate the false-positive rate when using three hash functions. Do the same for four hash functions.

  • As a function of n, the number of bits and m the number of members in the set S, what number of hash functions minimizes the false-positive rate?

Reference:

Leskovec, J., Rajaraman, A. and Ullman, J.D., 2020. Mining of massive data sets. Cambridge university press.

Outcomes

This task addresses the following course learning outcomes.

Course Learning Outcomes
CLO1
Explain algorithms for big data sets and methodologies in the context of data mining.
CLO3   Develop and integrate algorithms as a part of software development for mining big data.
CLO5   Utilise contemporary technologies and practices to effectively handle big datasets.

Requirements

  • You must submit your assessment using the relevant portal in MyUni/Canvas.
  • All written assessments (excluding quizzes) must be submitted using a text document e.g., doc, docx, pdf via the link at the top of the page.
  • Consult the assessment rubric when preparing your submission.
  • Questions can be posted to the relevant assessment Discussion Board.

Academic Integrity

Please ensure that you have read the Academic Integrity Policy.

Grading Criteria

This assessment is worth 20% of your overall grade. Refer to the attached rubric for detailed information on the grading criteria for this assessment.

Rubric title: Assessment 1 — MapReduce, Similar Items and Data Streams
CriteriaRatingsPoints
Exercise 1:Points: 40.0 Name: Full pointsPoints: 30.0 Name: Partial pointsPoints: 20.0 Name: Partial pointsPoints: 0.0 Name: No points40.0 pts
MapReduce implementation works and the output meets all the requirements.MapReduce implementation works but meets only some requirements and includes small mistakes.Shows understanding of how to solve the task, but the MapReduce implementation only partially works.No working implementation of MapReduce.
Exercise 2:Points: 21.0 Name: Full pointsPoints: 14.0 Name: Partial pointsPoints: 7.0 Name: Partial pointsPoints: 0.0 Name: No points21.0 pts
All results are correct.2/3 of the results are correct.1/3 of the results are correct.No correct results.
Exercise 3:Points: 20.0 Name: Full pointsPoints: 15.0 Name: Partial pointsPoints: 10.0 Name: Partial pointsPoints: 0.0 Name: No points20.0 pts
Both parts are completely correct.One of the parts is completely correct, the other has small mistakes.One of the parts is partially correct; or, both are not completely correct but follow the correct approach.Both parts are incorrect and the approach to solve them is incorrect.
Total:81 pts
Order Now

Get expert help for SIMILAR ITEMS AND DATA STREAMS and many more. 24X7 help, plag free solution. Order online now!

Universal Assignment (August 31, 2025) MAPREDUCE, SIMILAR ITEMS AND DATA STREAMS. Retrieved from https://universalassignment.com/mapreduce-similar-items-and-data-streams/.
"MAPREDUCE, SIMILAR ITEMS AND DATA STREAMS." Universal Assignment - August 31, 2025, https://universalassignment.com/mapreduce-similar-items-and-data-streams/
Universal Assignment March 12, 2023 MAPREDUCE, SIMILAR ITEMS AND DATA STREAMS., viewed August 31, 2025,<https://universalassignment.com/mapreduce-similar-items-and-data-streams/>
Universal Assignment - MAPREDUCE, SIMILAR ITEMS AND DATA STREAMS. [Internet]. [Accessed August 31, 2025]. Available from: https://universalassignment.com/mapreduce-similar-items-and-data-streams/
"MAPREDUCE, SIMILAR ITEMS AND DATA STREAMS." Universal Assignment - Accessed August 31, 2025. https://universalassignment.com/mapreduce-similar-items-and-data-streams/
"MAPREDUCE, SIMILAR ITEMS AND DATA STREAMS." Universal Assignment [Online]. Available: https://universalassignment.com/mapreduce-similar-items-and-data-streams/. [Accessed: August 31, 2025]

Please note along with our service, we will provide you with the following deliverables:

Please do not hesitate to put forward any queries regarding the service provision.

We look forward to having you on board with us.

Most Frequent Questions & Answers

Universal Assignment Services is the best place to get help in your all kind of assignment help. We have 172+ experts available, who can help you to get HD+ grades. We also provide Free Plag report, Free Revisions,Best Price in the industry guaranteed.

We provide all kinds of assignmednt help, Report writing, Essay Writing, Dissertations, Thesis writing, Research Proposal, Research Report, Home work help, Question Answers help, Case studies, mathematical and Statistical tasks, Website development, Android application, Resume/CV writing, SOP(Statement of Purpose) Writing, Blog/Article, Poster making and so on.

We are available round the clock, 24X7, 365 days. You can appach us to our Whatsapp number +1 (613)778 8542 or email to info@universalassignment.com . We provide Free revision policy, if you need and revisions to be done on the task, we will do the same for you as soon as possible.

We provide services mainly to all major institutes and Universities in Australia, Canada, China, Malaysia, India, South Africa, New Zealand, Singapore, the United Arab Emirates, the United Kingdom, and the United States.

We provide lucrative discounts from 28% to 70% as per the wordcount, Technicality, Deadline and the number of your previous assignments done with us.

After your assignment request our team will check and update you the best suitable service for you alongwith the charges for the task. After confirmation and payment team will start the work and provide the task as per the deadline.

Yes, we will provide Plagirism free task and a free turnitin report along with the task without any extra cost.

No, if the main requirement is same, you don’t have to pay any additional amount. But it there is a additional requirement, then you have to pay the balance amount in order to get the revised solution.

The Fees are as minimum as $10 per page(1 page=250 words) and in case of a big task, we provide huge discounts.

We accept all the major Credit and Debit Cards for the payment. We do accept Paypal also.

Popular Assignments

Assignment Help in Rowville, Melbourne

Introduction Rowville is a well-established suburb located around 27 km south-east of Melbourne’s CBD. Known for its family-friendly vibe, spacious parks like Stud Park Reserve, and excellent community facilities, Rowville offers students a balanced mix of suburban comfort and easy access to education hubs. With major roads such as the

Read More »

Assignment Help in Rosanna, Melbourne

Introduction Rosanna is a leafy and family-friendly suburb located about 12 km north-east of Melbourne’s CBD. Known for its green streets, Rosanna Parklands, and strong community atmosphere, it provides students with a calm environment while staying close to the city. The suburb is well connected by Rosanna Station and major

Read More »

Assignment Help in Ripponlea, Melbourne

Introduction Ripponlea is a charming inner-city suburb located just 8 km south-east of Melbourne’s CBD. Famous for the heritage-listed Rippon Lea Estate and its leafy residential streets, Ripponlea offers students a mix of cultural richness and convenient city living. With easy tram and train connections, students can travel effortlessly to

Read More »

Assignment Help in Ringwood North, Melbourne

Introduction Ringwood North is a leafy and family-friendly suburb located about 28 km east of Melbourne’s CBD. Known for its green streets, parks, and welcoming community, Ringwood North offers a peaceful residential lifestyle while remaining close to Melbourne’s educational and business hubs. With strong transport connections via nearby Ringwood Station

Read More »

Assignment Help in Ringwood East, Melbourne

Introduction Ringwood East is a peaceful residential suburb located around 25 km east of Melbourne’s CBD. Known for its leafy streets, friendly community, and excellent access to parks and schools, Ringwood East is ideal for students seeking a quieter lifestyle while still being close to Melbourne’s educational hubs. With its

Read More »

Assignment Help in Ringwood, Melbourne

Introduction Ringwood is a bustling suburb located around 25 km east of Melbourne’s CBD. Known for its leafy residential streets, modern shopping centres like Eastland, and easy access to the scenic Dandenong Ranges, Ringwood offers a great lifestyle for students and professionals. The suburb is well connected via Ringwood Station

Read More »

Assignment Help in Richmond, Melbourne

Introduction Richmond is one of Melbourne’s most iconic inner-city suburbs, located just 3 km east of the CBD. Famous for its lively café culture, bustling shopping streets like Bridge Road, and its thriving arts and sports scene, Richmond blends lifestyle and convenience perfectly. With excellent tram and train connections, students

Read More »

Assignment Help in Reservoir, Melbourne

Assignment Help in Reservoir, MelbourneIntroductionReservoir is a thriving suburb located just 12 kilometres north of Melbourne’s CBD. Known for its multicultural community, leafy parks, and welcoming neighbourhoods, Reservoir strikes the perfect balance between suburban comfort and city convenience. It is well-connected through trains, buses, and trams, making travel easy for

Read More »

Assignment Help in Parkville – Universal Assignment

Parkville, located just 3 km north of Melbourne CBD, is one of Melbourne’s most prestigious academic hubs. It is home to The University of Melbourne, Monash University (Pharmacy campus), and major research institutes like the Walter and Eliza Hall Institute and the Royal Melbourne Hospital precinct. With such a concentration

Read More »

Assignment Help in Oakleigh East – Universal Assignment

Oakleigh East, located about 17 km south-east of Melbourne CBD, is a peaceful and residential suburb in the City of Monash, known for its family-friendly environment and convenient access to shopping and educational facilities. With nearby institutions like Monash University, Holmesglen Institute, and TAFE Victoria, many students living in Oakleigh

Read More »

Assignment Help in Preston – Universal Assignment

Preston, located about 9 km north of Melbourne’s CBD, is a thriving multicultural suburb that attracts many university and TAFE students. With the presence of Melbourne Polytechnic (Preston Campus) and close proximity to La Trobe University (Bundoora campus) and RMIT University, Preston has a large student population. While the suburb

Read More »

Assignment Help in Point Cook – Universal Assignment

Point Cook, located around 25 km southwest of Melbourne’s CBD, is a fast-growing residential suburb in the City of Wyndham. Known for its modern housing estates, coastal beauty, and cultural diversity, Point Cook is also home to many university and TAFE students. With Victoria University, RMIT, Deakin University, and the

Read More »

Assignment Help in Pascoe Vale South – Universal Assignment

Pascoe Vale South, situated about 9 km north of Melbourne’s CBD, is a quiet, family-friendly suburb within the City of Merri-bek. Known for its leafy streets, community parks, and excellent schools, the suburb is also home to many university students due to its proximity to RMIT University, La Trobe University,

Read More »

Assignment Help in Pascoe Vale – Universal Assignment

Pascoe Vale, located just 10 km north of Melbourne’s CBD, is a thriving suburb in the City of Merri-bek. It is a popular residential area for students due to its proximity to leading universities such as RMIT University, La Trobe University, and the University of Melbourne. With excellent transport links

Read More »

Assignment Help in Prahran – Universal Assignment

Prahran, located just 5 km southeast of Melbourne’s CBD, is one of the city’s most vibrant inner suburbs. Known for its trendy cafes, boutique shopping on Chapel Street, and buzzing nightlife, Prahran also has a large student population. Its close proximity to Monash University (Caulfield campus), Swinburne University of Technology,

Read More »

Assignment Help in Parkdale – Universal Assignment

Parkdale, located around 23 km south-east of Melbourne CBD, is a beautiful beachside suburb in the City of Kingston. Known for its relaxed lifestyle, coastal charm, and proximity to Monash University, Holmesglen Institute, and Deakin University, Parkdale is home to many students balancing academic studies with personal and professional commitments.

Read More »

Assignment Help in Oakleigh South – Universal Assignment

Oakleigh South, located about 20 km south-east of Melbourne CBD, is a suburban area in the City of Kingston, known for its family-friendly environment, parks, and convenient access to educational facilities. With nearby institutions like Monash University, Holmesglen Institute, and TAFE campuses, students in Oakleigh South often require professional assignment

Read More »

Assignment Help in Ormond – Universal Assignment

Ormond, located about 12 km south-east of Melbourne CBD, is a well-connected residential suburb in the City of Glen Eira. Known for its proximity to Monash University Caulfield Campus, Holmesglen Institute, and local schools, Ormond attracts both local and international students. With increasing academic demands, many students search for assignment

Read More »

Assignment Help in Oakleigh – Universal Assignment

Oakleigh, located about 14 km south-east of Melbourne CBD, is a lively suburb in the City of Monash, known for its multicultural community, shopping precincts, and proximity to educational institutions. With access to Monash University, Holmesglen Institute, and nearby TAFE campuses, many students in Oakleigh seek professional assignment help in

Read More »

Assignment Help in Oak Park – Universal Assignment

Oak Park, located about 12 km north of Melbourne CBD, is a suburban area in the City of Moreland, known for its peaceful residential streets, schools, and local amenities. With access to nearby institutions such as RMIT University, University of Melbourne, and TAFE campuses, many students in Oak Park seek

Read More »

Assignment Help in Nunawading – Universal Assignment

Nunawading, located about 20 km east of Melbourne CBD, is a thriving suburb in the City of Whitehorse, known for its leafy streets, shopping centres, and proximity to educational institutions. With access to Deakin University, Box Hill Institute, and nearby TAFE campuses, students in Nunawading often seek professional assignment help

Read More »

Assignment Help in Notting Hill – Universal Assignment

Notting Hill, located about 22 km south-east of Melbourne CBD, is a residential suburb in the City of Monash, known for its peaceful environment and proximity to shopping centres, schools, and educational institutes. With access to nearby campuses like Monash University, Holmesglen Institute, and TAFE Victoria, many students in Notting

Read More »

Assignment Help in Northcote – Universal Assignment

Northcote, located about 7 km north-east of Melbourne CBD, is a vibrant suburb known for its multicultural community, trendy cafes, and lively student population. With easy access to RMIT University, University of Melbourne, and nearby TAFE institutes, Northcote has become a popular area for students pursuing higher education. Many students

Read More »

Assignment Help in North Melbourne – Universal Assignment

North Melbourne, located just 2 km north-west of Melbourne CBD, is a bustling inner-city suburb known for its historic architecture, multicultural community, and proximity to major educational institutions. With easy access to RMIT University, University of Melbourne, and nearby TAFE campuses, students in North Melbourne often require professional assignment help

Read More »

Assignment Help in Noble Park – Universal Assignment

Noble Park, located about 25 km south-east of Melbourne CBD, is a thriving suburb in the City of Greater Dandenong. Known for its multicultural community, shopping centres, and schools, Noble Park is home to a growing student population. With access to nearby institutions like Monash University, Chisholm Institute, and TAFE

Read More »

Assignment Help in Niddrie – Universal Assignment

Niddrie, located about 10 km north-west of Melbourne CBD, is a bustling suburb known for its residential communities, shopping centres, and schools. With easy access to Victoria University, RMIT, and nearby TAFE institutes, Niddrie is home to many students who often seek assignment help in Niddrie to manage academic workloads

Read More »

Assignment Help in Narre Warren South – Universal Assignment

Narre Warren South, located 38 km south-east of Melbourne CBD, is a growing suburb in the City of Casey. Known for its residential communities, schools, and green spaces, Narre Warren South is home to many students who pursue higher education at nearby institutions such as Monash University Clayton Campus, Federation

Read More »

Assignment Help in Narre Warren North – Universal Assignment

Narre Warren North, located about 36 km south-east of Melbourne CBD, is a fast-growing suburb in the City of Casey. Known for its family-friendly community, excellent schools, and green spaces, the suburb also attracts students pursuing higher education in Melbourne. With access to nearby institutions like Monash University Clayton Campus,

Read More »

Assignment Help in Narre Warren – Universal Assignment

Narre Warren, located about 38 km south-east of Melbourne CBD, is one of the fastest-growing suburbs in the City of Casey. Known for Fountain Gate Shopping Centre, family-friendly communities, and excellent schools, Narre Warren also attracts a large number of students pursuing higher education in Melbourne and surrounding areas. With

Read More »

Assignment Help in Mulgrave – Universal Assignment

Mulgrave, located 21 km south-east of Melbourne CBD, is a vibrant suburb with a mix of residential, business, and educational opportunities. With close access to Monash University Clayton Campus, Holmesglen Institute, and nearby TAFE colleges, Mulgrave attracts a large number of students. Many students here face challenges with assignments and

Read More »

Can't Find Your Assignment?