MAPREDUCE, SIMILAR ITEMS AND DATA STREAMS

SIMILAR ITEMS AND DATA STREAMS

ASSESSMENT 1 – MAPREDUCE, SIMILAR ITEMS AND DATA STREAMS

Due: Week 2, Wednesday at 11.59 pm Weighting: 20%

Purpose

The purpose of this assessment is to evaluate your learning and knowledge of using the MapReduce technique mentioned in Week 1, finding similar items and mining data streams.

Your Task

Your task is to complete the following exercises:

Exercise 1 – Friend Recommendation System (Stanford) (40 points)

  • Write a MapReduce program in Spark (see Overview Module for download instructions) that implements a simple “People You Might Know” social network friendship recommendation algorithm.

The key idea is that if two people have a lot of mutual friends, then the system should recommend that they connect with each other.

Input:

  • ·       Download the input file from the link: http://snap.stanford.edu/class/ cs246- data/hw1q1.zip. The input file contains the adjacency list and has multiple lines in the following format: <User><TAB><Friends>
  • ·       Here, <User> is a unique integer ID corresponding to a unique user, and <Friends> is a comma separated list of unique IDs corresponding to the friends of the user with the unique ID <User>.
  • ·       Note that the friendships are mutual (i.e., edges are undirected): if A is friends with B then B is also friends with A. Algorithm: Let us use a simple algorithm such that, for each user U, the algorithm recommends N = 10 users who are not already friends with U, but have the greatest number of mutual friends in common with U.

Output:

  • The output should contain one line per user in the following format:

<User><TAB><Recommendations> where <User> is a unique ID corresponding to a user and <Recommendations> is a comma separated list of unique IDs corresponding to the algorithm’s recommendation of people that <User> might know, ordered in decreasing number of mutual friends.

  • Even if a user has less than 10 second-degree friends, output all of them in decreasing order of the number of mutual friends.
  • If there are recommended users with the same number of mutual friends, then output those user IDs in numerically ascending order.
  • Also, please provide a description of how you are going to use MapReduce jobs to solve this problem. Do not write more than 3 to 4 sentences for this: only a very high- level description of your strategy to tackle this problem.

For your submission

  • Include your source code
  • Include in your writeup a short paragraph describing your algorithm to tackle this problem.
  • Include in your writeup the recommendations for the users with following user IDs: 924, 8941, 8942, 9019, 9020, 9021, 9022, 9990, 9992, 9993.

Exercise 2 S-curve (exercise 3.4.1 in Leskovec, Rajaraman and Ullman) (7+7+7 points)

Evaluate the S-curve 1 − (1 − sr)b for s = 0.1, 0.2, . . ., 0.9, for the following values of r and b;

  • r=3 and b=10.
  • r=6 and b=20.
  • r=5 and b=50.

Exercise 3 Filtering Streams (similar to Exercises of 4.3 in Leskovec, Rajaraman and Ullman) (10 + 10 points)

  1. For the situation of the running example of Section 4.3.1 in Leskovec, Rajaraman and Ullman with changed conditions (10 billion bits, 2 billion members of the set S).

Calculate the false-positive rate when using three hash functions. Do the same for four hash functions.

  • As a function of n, the number of bits and m the number of members in the set S, what number of hash functions minimizes the false-positive rate?

Reference:

Leskovec, J., Rajaraman, A. and Ullman, J.D., 2020. Mining of massive data sets. Cambridge university press.

Outcomes

This task addresses the following course learning outcomes.

Course Learning Outcomes
CLO1
Explain algorithms for big data sets and methodologies in the context of data mining.
CLO3   Develop and integrate algorithms as a part of software development for mining big data.
CLO5   Utilise contemporary technologies and practices to effectively handle big datasets.

Requirements

  • You must submit your assessment using the relevant portal in MyUni/Canvas.
  • All written assessments (excluding quizzes) must be submitted using a text document e.g., doc, docx, pdf via the link at the top of the page.
  • Consult the assessment rubric when preparing your submission.
  • Questions can be posted to the relevant assessment Discussion Board.

Academic Integrity

Please ensure that you have read the Academic Integrity Policy.

Grading Criteria

This assessment is worth 20% of your overall grade. Refer to the attached rubric for detailed information on the grading criteria for this assessment.

Rubric title: Assessment 1 — MapReduce, Similar Items and Data Streams
CriteriaRatingsPoints
Exercise 1:Points: 40.0 Name: Full pointsPoints: 30.0 Name: Partial pointsPoints: 20.0 Name: Partial pointsPoints: 0.0 Name: No points40.0 pts
MapReduce implementation works and the output meets all the requirements.MapReduce implementation works but meets only some requirements and includes small mistakes.Shows understanding of how to solve the task, but the MapReduce implementation only partially works.No working implementation of MapReduce.
Exercise 2:Points: 21.0 Name: Full pointsPoints: 14.0 Name: Partial pointsPoints: 7.0 Name: Partial pointsPoints: 0.0 Name: No points21.0 pts
All results are correct.2/3 of the results are correct.1/3 of the results are correct.No correct results.
Exercise 3:Points: 20.0 Name: Full pointsPoints: 15.0 Name: Partial pointsPoints: 10.0 Name: Partial pointsPoints: 0.0 Name: No points20.0 pts
Both parts are completely correct.One of the parts is completely correct, the other has small mistakes.One of the parts is partially correct; or, both are not completely correct but follow the correct approach.Both parts are incorrect and the approach to solve them is incorrect.
Total:81 pts
Order Now

Get expert help for SIMILAR ITEMS AND DATA STREAMS and many more. 24X7 help, plag free solution. Order online now!

Universal Assignment (February 12, 2026) MAPREDUCE, SIMILAR ITEMS AND DATA STREAMS. Retrieved from https://universalassignment.com/mapreduce-similar-items-and-data-streams/.
"MAPREDUCE, SIMILAR ITEMS AND DATA STREAMS." Universal Assignment - February 12, 2026, https://universalassignment.com/mapreduce-similar-items-and-data-streams/
Universal Assignment March 12, 2023 MAPREDUCE, SIMILAR ITEMS AND DATA STREAMS., viewed February 12, 2026,<https://universalassignment.com/mapreduce-similar-items-and-data-streams/>
Universal Assignment - MAPREDUCE, SIMILAR ITEMS AND DATA STREAMS. [Internet]. [Accessed February 12, 2026]. Available from: https://universalassignment.com/mapreduce-similar-items-and-data-streams/
"MAPREDUCE, SIMILAR ITEMS AND DATA STREAMS." Universal Assignment - Accessed February 12, 2026. https://universalassignment.com/mapreduce-similar-items-and-data-streams/
"MAPREDUCE, SIMILAR ITEMS AND DATA STREAMS." Universal Assignment [Online]. Available: https://universalassignment.com/mapreduce-similar-items-and-data-streams/. [Accessed: February 12, 2026]

Please note along with our service, we will provide you with the following deliverables:

Please do not hesitate to put forward any queries regarding the service provision.

We look forward to having you on board with us.

Most Frequent Questions & Answers

Universal Assignment Services is the best place to get help in your all kind of assignment help. We have 172+ experts available, who can help you to get HD+ grades. We also provide Free Plag report, Free Revisions,Best Price in the industry guaranteed.

We provide all kinds of assignmednt help, Report writing, Essay Writing, Dissertations, Thesis writing, Research Proposal, Research Report, Home work help, Question Answers help, Case studies, mathematical and Statistical tasks, Website development, Android application, Resume/CV writing, SOP(Statement of Purpose) Writing, Blog/Article, Poster making and so on.

We are available round the clock, 24X7, 365 days. You can appach us to our Whatsapp number +1 (613)778 8542 or email to info@universalassignment.com . We provide Free revision policy, if you need and revisions to be done on the task, we will do the same for you as soon as possible.

We provide services mainly to all major institutes and Universities in Australia, Canada, China, Malaysia, India, South Africa, New Zealand, Singapore, the United Arab Emirates, the United Kingdom, and the United States.

We provide lucrative discounts from 28% to 70% as per the wordcount, Technicality, Deadline and the number of your previous assignments done with us.

After your assignment request our team will check and update you the best suitable service for you alongwith the charges for the task. After confirmation and payment team will start the work and provide the task as per the deadline.

Yes, we will provide Plagirism free task and a free turnitin report along with the task without any extra cost.

No, if the main requirement is same, you don’t have to pay any additional amount. But it there is a additional requirement, then you have to pay the balance amount in order to get the revised solution.

The Fees are as minimum as $10 per page(1 page=250 words) and in case of a big task, we provide huge discounts.

We accept all the major Credit and Debit Cards for the payment. We do accept Paypal also.

Popular Assignments

LCRM301 Researching Criminology

LCRM301 Researching CriminologyTopic: Introduction to theoryand the research processand undertaking acriminological literaturereviewAcknowledgement ofcountryPART ONE: Preparingcriminological research(Weeks 1-5)In the first part of the unit, we focus on preparing criminologicalresearch and decision-making processes. This includes deciding onthe topic you want to focus on, the kinds of methods to use and thetypes of

Read More »

LCRM301 Researching criminology

LCRM301 Researching criminologyWorksheet 1This worksheet will be disseminated to students in Week 3 and will assist them in the planning and development of the second assessment task: literature review. PART 1: Refining your topicThe topic I am interested in is: I am interested in this topic because: This is an

Read More »

LCRM301 Researching Criminology

LCRM301 Researching CriminologyTopic: Planningcriminological research andformulating effectiveresearch questionsAcknowledgement ofcountryLecture structure Example?How do police perceptions of their own organization influence their views of the public,specifically their trust of people in the areas they patrol?How do women with insecure migration status experience and seek help for family violenceand what are their experiences

Read More »

ASSESSMENT NO.2 – COURT APPLICATION FACT PATTERN

FACULTY OF LAW AND BUSINESSThomas More Law SchoolLAWS201: CIVIL PROCEDURE & ADRSEMESTER TWO, 2025 – NATIONAL –ASSESSMENT NO.2 – COURT APPLICATION FACT PATTERN(AND THE FACT PATTERN FOR THE PLEADINGS WORKSHOP IN WEEK 5)BackgroundRachel Richardson is a printer by trade although she currently works in art design fora fashion magazine based

Read More »

Assessment 2: Court Application

02/09/2025, 20:3430 Points PossibleAssessment 2: Court ApplicationAssessment 2: Court ApplicationDetailsAssessment task 2: Court Application (worth 30% of the final total mark for this unit)Add commentA simulated court application will be run in weeks 6 and 7 of semester on selected topics from the first five weeks of the unit. Your

Read More »

APPENDIX B: ASSESSMENT TASK 2 – COURT APPLICATION

APPENDIX B: ASSESSMENT TASK 2 – COURT APPLICATION (30% OF FINAL MARK)General informationThis Assessment task is worth 30 marks of your final mark.The task is either making (Applicant) or opposing (Respondent) an application before theSupreme Court in your respective state based on a fact scenario, which will be uploaded on

Read More »

Assessment task Assessment 1 – ASSIGNMENT

Assessment Task 1 (30% of the final mark)This Week30 Points PossibleIn ProgressNEXT UP: Submit assignmentUnlimited Attempts Allowed11/08/2025Attempt 1 Add commentDetails Assessment task Assessment 1 – ASSIGNMENTPurpose To give students the opportunity to produce a well-written piece of formal analysis on a topic inland law. Graduate capabilities GC1,3,7-11 are covered by

Read More »

Assessment Brief- Assessment 3- Map-Reduce Programming Challenge

Assessment Brief- Assessment 3- Map-Reduce Programming Challenge Unit Code/Description ICT313 Big Data for Software DevelopmentCourse/Subject Bachelor of Information TechnologySemester S1 – 2025Unit Learning OutcomesAddressedULO3: Critically assess and implement advanced data pre-processing andanalytics strategies in a software development context, focusing on tasks like datacleansing, transformation, and feature selection.ULO4: Design, develop, and

Read More »

Assessment 2 Infographic and Reflection

Assessment 2 Infographic and Reflection Part 1 Infographic and Part 2 Reflection using the Gibbs CycleThis assessment is worth 35% of your final grade (Infographic 18% + Reflection 17%). Assessment InstructionsPart 1: InfographicCreate an A3 size infographic (portrait layout).Submit as a .pdfTOPIC:Create your infographic as a poster (A3 paper size)

Read More »

CHCCSM013 Facilitate and review case management

ContentsAbout this document 4Student details section 4The community services work environment 5Workplace hours 6Responsibilities 6Evidence collection 7Section 1: Determine response for case management 9Section 2: Conduct case management meetings and develop case management plan 18Section 3: Monitor and review case work activities and processes 24Section 4: Supervisor report 28 About

Read More »

BMM6302 Entrepreneurship and Creativity Assignment help

Assignment Brief – Cohort 07 BMM6302 Entrepreneurship and Creativity  Component number Assignment 1 Assignment type 01 Business Idea (Video Presentation – Panopto) Learning outcomes for this assessment (Please see module Handbook for all learning outcomes) Compare and contrast a variety of strategic models to develop and critically evaluate business start-up

Read More »

BMM6582: eBusiness and eMarketing 02 Analytical Review Assignment Help

Assignment Brief BMM6582: eBusiness and eMarketing Component number Assignment 2 Assignment type 02 Analytical Review Learning outcomes for this assessment (Please see module Handbook for all learning outcomes) Compare, contrast, and analyse approaches used to evaluate a company’s current positioning in the traditional and electronic marketplaces. Appraise various options for

Read More »

BMM6582: eBusiness and eMarketing Assignment 2 Assignment Help

Assignment Brief BMM6582: eBusiness and eMarketing Component number Assignment 2 Assignment type 02 Analytical Review Learning outcomes for this assessment (Please see module Handbook for all learning outcomes) Compare, contrast, and analyse approaches used to evaluate a company’s current positioning in the traditional and electronic marketplaces. Appraise various options for

Read More »

BMM6582: eBusiness and eMarketing Assignment Help

Assignment Brief BMM6582: eBusiness and eMarketing Component number Assignment 1 Assignment type 01 Video presentation (Panopto) Learning outcomes for this assessment (Please see module Handbook for all learning outcomes) Compare, contrast, and analyse approaches used to evaluate a company’s current positioning in the traditional and electronic marketplaces. Appraise various options

Read More »

Business and Management Strategy Assignment Help

Business and Management Strategy: Assignment Guidelines and Suggested Structure (3,000 Words) 1. Introduction (approx. 250–300 words) 2. Industry Analysis and Competitive Environment (400–450 words) 3. Competition and Customer Requirements (550–600 words) 4. Resources and Strategy Implementation (500–550 words) 5. Strategic Capabilities & Sustainable Advantage (400–450 words) 6. Development of Resources

Read More »

BMM6452 – Professional Learning Through Work Assignment Help

Assignment Brief  BMM6452 – Professional Learning Through Work Assignment type  Final Assessment (Assessment 3)-To be Negotiated Learning outcomes   (Please see module Handbook for all learning outcomes)  By the end of this module, the student should be able to:  Demonstrate enhanced graduate employability skills, knowledge, behaviours and attitudes developed through working

Read More »

CSE2/4DBF (2025) – Assignment 1

CSE2/4DBF (2025) – Assignment 1 Page 1/3 CSE2/4DBF 2025Assignment 1 (20%)Due date: this week AIMS AND OBJECTIVES:✓ to represent a problem description given in natural language as an (Enhanced) EntityRelationship modelThis is an individual Assignment. You are not permitted to work as a group when writingthis assignment. The use of

Read More »

Can I Pay Someone to Do My Assignment? A Complete Student Guide

If you’re a student juggling multiple deadlines, exams, part-time work, or personal responsibilities, you may have found yourself asking: can I pay someone to do my assignment? You’re not alone. This is one of the most searched academic questions today, especially among university and college students worldwide. In this blog,

Read More »

Project Development and Analysis in Emerging Technologies

Assessment Brief- Assessment 2 Unit Code/Description ICT305 – Topics in IT Course/Subject BIT Semester 2024- S1 Unit Learning Outcomes Addressed ULO 1, 2, and 3. Assessment Objective The primary objective of this assessment is to provide students with hands-on experience in designing, implementing, and analysing a project in one of

Read More »

EDUC1006 Interdisciplinary Studies: Crossing the line

ASSESSMENT 2: Report Summary Title Assessment 2 Type Report Due Date Thursday 17 April, 11.59 pm (end of Week 6) Length 1500 words or equivalent Weighting 50% Academic Integrity The use of GenAI is allowed but limited for this assessment task. Submission Word document or PDF submitted to Turnitin Unit

Read More »

Writing in Community Development

Assessment Overview Overview Length or Duration Worth Due This essay should demonstrate a coherent argument, which is backed up by evidence from relevant journal articles, books and websites. You are expected to make two direct quotations only; and the rest should be paraphrases. You should also list at least eight sources.   If you are unsure of

Read More »

Counselling Theory and Practice in Schools

Assignment 1 Requirements Word limit 2500 words; excluding references Referencing You’re required to follow APA Academic Integrity Please refer to the Guidelines Task Purpose 🎯 This assessment task is designed to develop and assess students’ critical thinking and reflective skills, essential for counselling professionals in educational contexts. By engaging in a literature

Read More »

PSY1040 Cultural Responsiveness Self-Assessment

PSY1040 Cultural Responsiveness Self-Assessment The below self-assessment tool has been adapted from the following resource: Bennett, B., & Morse, C. (2023). The Continuous Improvement Cultural Responsiveness Tools (CICRT): Creating more culturally responsive social workers. Australian Social Work, 76(3), 315–329. Bennett’s collection of Cultural Responsiveness Self-Assessment Tools is designed for social workers

Read More »

TEAC7094 Assessment 2 Report: Analysis of a Student Work Sample

TEAC7094 Assessment 2 Report: Analysis of a Student Work SampleRemember to include a completed Cover Sheet for this task. CONTEXT PROBLEM AND SOLUTION (approx. 600 – 800 words) RECOMMENDATIONS (approx. 400 words) CONCLUSION REFERENCES Appendix One: Annotated and coded interview transcript from working with the child Appendix Two: Annotated and

Read More »

Psychological Data Analysis Report

Written Assignment This page outlines the major written assignment and the steps involved in preparing for submission. This assignment will allow you to develop essential skills in analysing and interpreting a data set to address a psychological issue and report the results in APA style. Note that separate documents are

Read More »

Principles of Economics

Principles of Economics Short-answer Assignment V1 (20% of final mark) The assignment consists of four questions.  You should allocate at least half a page (or 250 words) to each answer or 1000 words for all four answers depending on the nature of and/or marks allocated for the question/s. You may

Read More »

MRTY 5134 Laboratory Report Assignment

MRTY 5134 Laboratory Report Assignment Semester 1 2025Due 18th May 2025Answer TemplateEnter your name and student number below.Name:SID:Use this document to record your answers to the tasks described in the laboratoryreport assignment. When completed submit this document for marking via theassignment portal in Canvas.Things to note:

Read More »

Mind Map – Personal Philosophy

Mind Map – Personal Philosophy Assessment 2  Assessment Overview Overview Length or Duration Worth Due Part A – Annotated mind-map (equivalent to 350 words). Part B – 350 word personal reflection about your history, identity and values and link it with concepts explored in the unit. Part A – 350 words equivalent

Read More »

Can't Find Your Assignment?