Information retrieval

Information retrieval

Take-home exam

Course: Information retrieval 1, 7,5 hp (C3LIR1)

Publication date: 2022-10-18

Please note that the exam consists of 13 questions (with subquestions). The maximum number of points is 45. To obtain one of the grades A–E, you need to get at least the following number of points: grade E ≥ 31 points, grade D ≥ 34 points, grade C ≥ 37 points, grade B ≥ 40 points, and grade A ≥ 43 points. This exam is to be performed individually and should be submitted in Canvas at the latest by 2022-10-30.

  1. Many queries used on the web consist of a few search terms (maybe only one term) and lack operators (e.g., Boolean operators). This makes it in many ways problematic for a search en- gine to retrieve relevant information for the user. Please characterize, using different concepts from the IR theory, two of the problems that the search engine is faced with when dealing with such short, operator-less queries, and indicate for each problem a method or approach that can be used to handle that problem in the design and construction of the search engine. [4 p]
  2. Assume that we, in an IR system based on the Boolean model, formulate a query q with the following structure:

q = renewable AND (wind OR (NOT oil))

  • With regard to the presence / absence of the three query terms – which documents will be retrieved as (system) relevant? [2 p]
    • Which one of the three terms should the system first examine (with regard to the possible presence of the term in the document) to ascertain as quickly as possible whether a given document should be retrieved or not? Why this term in particular? [2 p]
  • Term weighting. Let k1 and k2 be two terms (if you want to, you can replace these variables with two concrete terms in the answer), D a document collection, and d a document in D. Assume that the term k1 occurs 4 times in the document d and that the term k2 also occurs 4 times in d. Finally, assume that the term k1 occurs in 10 documents in D and that the term k2 term is found in 15 documents in D. Use reasoning * to find out whether the term weight of k1 in d will be greater than, less than or equal to the term weight of k2 in d, given that the following IR models and/or term weighting schemes are used:
    • the vector space model with tf-idf weighting. [1 p]
    • the classical probabilistic model (binary independence model). [1 p]

*You should not have to use a calculator to obtain the answer.

  • A commonly used similarity measure in the vector space model (VSM) is the cosine measure. Please explain the general reasoning behind the design and use of this measure. You do not need to use any mathematical formulas in your answer – it is sufficient to explain with words only. [2 p]
  • A model of how users go about when searching for information is called the berrypicking model (formulated by Marcia Bates). Briefly explain this model and how it differs from the traditional understanding of the search process. [2 p]
  • One of the basic components of every IR model is a ranking function sim that in the context of a query q assigns to each document a score that indicates the ”similarity” between the document and the query, according to their representations in the specific IR model. We say that two documents are separable in a given IR model if it is possible to formulate at least one query that results in different values of the ranking function sim for the two documents. Consider the (short) documents:

d1 = to go or not to go

and

d2 = to go or not

Are these documents separable in

  • the Boolean model, [2 p]
    • the vector space model with tf-idf weighting and the cosine similarity measure? [2 p] Please substantiate your answers.
  • Lexical analysis and term weighting.
  • What is meant by lexical analysis? [1 p]
    • Present a disadvantage of using tf weighting instead of tf-idf weighting for document representation? [1 p]
    • What is meant by document length normalization and why may it be important to perform when generating document representations? [2 p]
  • Text processing.
  • Briefly present a potential advantage of using stemming in an IR system. [1 p]
    • Also indicate a potential disadvantage of using stemming in an IR system. [1 p]
    • What is the Levenshtein distance between the terms string and song? [1 p]
  • Relevance feedback.
  • What is meant by relevance feedback and how does this work in principle? [2 p]
    • Explain how relevance feedback works, by exemplifying with the Rocchio method or the classical probabilistic model. Please note that it is sufficient that you explain only one of these methods, and that you do not need to use mathematical formalism in your answer (though it may facilitate your presentation). [2 p]
    • Compare relevance feedback with query expansion. What similarities and differences regarding purpose and approach can you find? [2 p]
    • What is the difference between local and global analysis in the context of implicit feed- back? [2 p]
  • We perform a search in a reference collection on a topic with 12 known relevant documents. The current IR system is based on the vector space model. The returned documents are rel- evance assessed up until DCV = 20 (where DCV, i.e. document cutoff value is equal to the position up until which the evaluation measurements are calculated), whereby the following list is obtained. We let R represent a relevant document and 0 a non-relevant document.

R R 0 R 0 0 R R 0 0 R 0 0 0 0 R 0 0 0 0

For the search result above and DCV = 20, please calculate

  • recall [1 p]
    • precision [1 p]
    • R-precision [1 p]
    • the F-measure (F1) [1 p]
  • Evaluation.
  • Why is recall an inappropriate measure when evaluating web searches? [1 p]
    • A common observation when evaluating search results is that the recall tends to be higher for larger DCV values (higher document positions), while the precision tends to decrease for larger DCV values. Present an explanation of this phenomenon based on the defini- tions of recall and precision. [2 p]
  • Compare the algorithm HITS and PageRank for similarities and differences in how they rank web pages. [2 p]
  • For this exam question we use the simple definition of PageRank, which is formulated as fol- lows. Let w and v be web pages. By the notation ∀v : v ↣  w we denote ”for all pages v such that v links to page w”. We also let r(w) denote the PageRank value for page w and let g(v) denote the number of links from page v. PageRank is then defined:
∑  

r(w) =

v:v w

r(v)

g(v)

Consider the link structure in the figure below. The nodes (circles) represent web pages and the arrows represent hypertext links. Assume that the value of r(A), that is, the PageRank value for page A, is known and happens to be independent of the structure in the figure. Furthermore, assume that the PageRank values can be fully calculated based on the structure in the figure. Which of the following statements is correct and how did you come up with the answer?

  • r(B) > r(C) [r(B) is greater than r(C)].
    • r(B) < r(C) [r(B) is less than r(C)].
    • r(B) = r(C) [r(B) equals r(C)].

The answer is not dependent on the exact value of r(A), but you can assume, for example, that

r(A) = 18.0. [3 p]

Order Now

Get expert help for Information retrieval and many more. 24X7 help, plag-free solution. Order online now!

Universal Assignment (September 8, 2024) Information retrieval. Retrieved from https://universalassignment.com/information-retrieval/.
"Information retrieval." Universal Assignment - September 8, 2024, https://universalassignment.com/information-retrieval/
Universal Assignment December 5, 2022 Information retrieval., viewed September 8, 2024,<https://universalassignment.com/information-retrieval/>
Universal Assignment - Information retrieval. [Internet]. [Accessed September 8, 2024]. Available from: https://universalassignment.com/information-retrieval/
"Information retrieval." Universal Assignment - Accessed September 8, 2024. https://universalassignment.com/information-retrieval/
"Information retrieval." Universal Assignment [Online]. Available: https://universalassignment.com/information-retrieval/. [Accessed: September 8, 2024]

Please note along with our service, we will provide you with the following deliverables:

Please do not hesitate to put forward any queries regarding the service provision.

We look forward to having you on board with us.

Categories

Get 90%* Discount on Assignment Help

Most Frequent Questions & Answers

Universal Assignment Services is the best place to get help in your all kind of assignment help. We have 172+ experts available, who can help you to get HD+ grades. We also provide Free Plag report, Free Revisions,Best Price in the industry guaranteed.

We provide all kinds of assignmednt help, Report writing, Essay Writing, Dissertations, Thesis writing, Research Proposal, Research Report, Home work help, Question Answers help, Case studies, mathematical and Statistical tasks, Website development, Android application, Resume/CV writing, SOP(Statement of Purpose) Writing, Blog/Article, Poster making and so on.

We are available round the clock, 24X7, 365 days. You can appach us to our Whatsapp number +1 (613)778 8542 or email to info@universalassignment.com . We provide Free revision policy, if you need and revisions to be done on the task, we will do the same for you as soon as possible.

We provide services mainly to all major institutes and Universities in Australia, Canada, China, Malaysia, India, South Africa, New Zealand, Singapore, the United Arab Emirates, the United Kingdom, and the United States.

We provide lucrative discounts from 28% to 70% as per the wordcount, Technicality, Deadline and the number of your previous assignments done with us.

After your assignment request our team will check and update you the best suitable service for you alongwith the charges for the task. After confirmation and payment team will start the work and provide the task as per the deadline.

Yes, we will provide Plagirism free task and a free turnitin report along with the task without any extra cost.

No, if the main requirement is same, you don’t have to pay any additional amount. But it there is a additional requirement, then you have to pay the balance amount in order to get the revised solution.

The Fees are as minimum as $10 per page(1 page=250 words) and in case of a big task, we provide huge discounts.

We accept all the major Credit and Debit Cards for the payment. We do accept Paypal also.

Popular Assignments

FPC006 Taxation for Financial Planning

Assignment 2 Instructions Assignment marks: 95 | Referencing and presentation: 5 Total marks: 100 Total word limit: 3,000 words Weighting: 40% Download and use the Assignment 2 Answer Template provided in KapLearn to complete your assignment. Your assignment should be loaded into KapLearn by 11.30 pm AEST/AEDT on the wdue

Read More »

TCHR5001 Assessment Brief 1

TCHR5001 Assessment Brief 1 Assessment Details Item Assessment 1: Pitch your pedagogy Type Digital Presentation (Recorded) Due Monday, 16th September 2024, 11:59 pm AEST (start of Week 4) Group type Individual Length 10 minutes (equivalent to 1500 words) Weight 50% Gen AI use Permitted, restrictions apply Aligned ULOS ULO1, ULO2,

Read More »

HSH725 Assessment Task 2

turquoise By changing the Heading 3 above with the following teal, turquoise, orange or pink you can change the colour theme of your CloudFirst CloudDeakin template page. When this page is published the Heading 3 above will be removed, but it will still be here in edit mode if you wish to change the colour theme.

Read More »

Evidence in Health Assessment 2: Evidence Selection

Evidence in Health Assessment 2: Evidence Selection Student name:                                                                    Student ID: Section 1: PICO and search strategy Evidence Question: Insert evidence question from chosen scenario here including all key PICO terms.       PICO Search Terms                                                                                                                                                                                                          Complete the following table.   Subject headings Keywords Synonyms Population  

Read More »

Assessment 1 – Lesson Plan and annotation

ASSESSMENT TASK INFORMATION: XNB390 Assessment 1 – Lesson Plan and annotation This document provides you with information about the requirements for your assessment. Detailed instructions and resources are included for completing the task. The Criterion Reference Assessment (CRA) Marking Matrix that XNB390 markers will use to grade the assessment task

Read More »

XNB390 Task 1 – Professional Lesson Plan

XNB390 Template for Task 1 – Professional Lesson Plan CONTEXT FOR LESSON: SOCIAL JUSTICE CONSIDERATIONS: Equity Diversity Supportive Environment UNIT TITLE:    TERM WEEK DAY TIME 1   5           YEAR/CLASS STUDENT NUMBERS/CONTEXT LOCATION LESSON DURATION         28 Children (chl): 16 boys; 12

Read More »

A2 Critical Review Assignment

YouthSolutions Summary The summary should summarise the key points of the critical review. It should state the aims/purpose of the program and give an overview of the program or strategy you have chosen. This should be 200 words – included in the word count. Critical analysis and evaluation Your critical

Read More »

PUN364 – Workplace activity Assignment

Assessment 1 – DetailsOverviewFor those of you attending the on-campus workshop, you will prepare a report on the simulated simulated inspection below. For those of you who are not attending, you will be required to carry out your own food business inspection under the supervision of a suitably qualified Environmental

Read More »

FPC006 Taxation for Financial Planning

Assignment 1 Instructions Assignment marks: 95 | Referencing and presentation: 5 Total marks: 100 Total word limit: 3,600 words Weighting: 40% Download and use the Assignment 1 Answer Template provided in KapLearn to complete your assignment. Your assignment should be loaded into KapLearn by 11.30 pm AEST/AEDT on the due

Read More »

Mental health Nursing assignment

Due Aug 31 This is based on a Mental health Nursing assignment Used Microsoft word The family genogram is a useful tool for the assessment of individuals, couples, and families.  It can yield significant data and lead to important, new patient understandings and insights as multigenerational patterns take shape and

Read More »

Assessment 2: Research and Policy Review

Length: 2000 words +/- 10% (excluding references)For this assessment, you must choose eight sources (academic readings and policy documents) as the basis of your Research and Policy Review. You must choose your set of sources from the ‘REFERENCES MENU’ on the moodle site, noting the minimum number of sources required

Read More »

HSN702 – Lifespan Nutrition

Assessment Task: 2 Assignment title: Population Nutrition Report and Reflection Assignment task type: Written report, reflection, and short oral presentation Task details The primary focus of this assignment is on population nutrition. Nutritionists play an important role in promoting population health through optimal nutritional intake. You will be asked to

Read More »

Written Assessment 1: Case Study

Billy a 32-year-old male was admitted to the intensive care unit (ICU) with a suspected overdose of tricyclic antidepressants. He is obese (weight 160kg, height 172cm) and has a history of depression and chronic back pain for which he takes oxycodone. On admission to the emergency department, Paramedics were maintaining

Read More »

Assessment Task 8 – Plan and prepare to assess competence

Assessment Task 8 – Plan and prepare to assess competence Assessment Task 8 consists of the following sections: Section 1:      Short answer questions Section 2:      Analyse an assessment tool Section 3:      Determine reasonable adjustment and customisation of assessment process Section 4:      Develop an assessment plan Student Instructions To complete this

Read More »

Nutrition Reviews Assignment 2 – Part A and Part B

This assignment provides you with the opportunity to determine an important research question that is crucial to address based on your reading of one of the two systematic reviews below (Part A). You will then develop a research proposal outlining the study design and methodology needed to answer that question

Read More »

NUR332 – TASK 3 – WRITTEN ASSIGNMENT

NUR332 – TASK 3 – WRITTEN ASSIGNMENT for S2 2024. DESCRIPTION (For this Task 3, the word ‘Indigenous Australians’, refers to the Aboriginal and Torres Strait Islander Peoples of Australia) NUR332 Task 3 – Written Assignment – Due – WEEK 12 – via CANVAS on Wednesday, Midday (1200hrs) 16/10/2024. The

Read More »

NUR100 Task 3 – Case study

NUR100 Task 3 – Case study To identify a key child health issue and discuss this issue in the Australian context. You will demonstrate understanding of contemporary families in Australia. You will discuss the role of the family and reflect on how the family can influence the overall health outcomes

Read More »

NUR 100 Task 2 Health Promotion Poster

NUR 100 Task 2 Health Promotion Poster The weighting for this assessment is 40%. Task instructions You are not permitted to use generative AI tools in this task. Use of AI in this task constitutes student misconduct and is considered contract cheating. This assessment requires you to develop scholarship and

Read More »

BMS 291 Pathophysiology and Pharmacology CASE STUDY

BMS 291 Pathophysiology and Pharmacology CASE STUDY Assessment No: 1 Weighting: 40% Due date Part A: midnight Friday 2nd August 2024 Due date Part B: midnight Sunday 29th September 2024 General information In this assessment, you will develop your skills for analysing, integrating and presenting information for effective evidence-based communication.

Read More »

Assessment Task: Health service delivery

Assessment Task Health service delivery is inherently unpredictable. This unpredictability can arise from, for example, the assortment of patient presentations, environmental factors, changing technologies, shifts in health policy and changes in division leadership. It can also arise from changes in policy within an organisation and/or associated health services that impact

Read More »

LNDN08002 Business Cultures Resit Assessment

LNDN08002 Business Cultures Resit Assessment Briefing 2023–2024 (Resit for Term 1) Contents Before starting this resit, please: 1 Assessment Element 1: Individual Report 1 Case Report Marking Criteria. 3 Assessment Element 2: Continuing Personal Development (CPD) 4 Guidance for Assessment 2: Reflection and Reflective Practice. 5 Student Marking Criteria –

Read More »

Assessment Task 2 – NAPLAN Exercise

Assessment Task 2 (35%) – Evaluation and discussion of test items Assessment Task 2 (35%) – Evaluation and discussion of test items AITSL Standards: This assessmeAITSL Standards: This assessment provides the opportunity to develop evidence that demonstrates these Standards: 1.2        Understand how students learn 1.5        Differentiate teaching to meet with

Read More »

EBY014 Degree Tutor Group 2 Assignment

  Assignment Brief Module Degree Tutor Group 2 Module Code EBY014 Programme BA (Hons) Business and Management with   Foundation Year Academic Year 2024/2025 Issue Date 6th May 2024 Semester Component Magnitude Weighting Deadline Learning outcomes assessed 2 1 2000 words Capstone Assessment 100% 26th July, 2024 1/2/3/4 Module Curriculum

Read More »

NTW 600 Computer Network and Security

Assessment 2 Information and Rubric Subject Code  NTW 600 Subject Name Computer Network and Security Assessment Number and Title Assessment 2: Cyber Security Threats to IT Infrastructure of a real-world Organisation Assessment Type Group Assessment Length / Duration  1500 words Weighting %  30% Project Report: 20% Presentation :10% (Recorded) Total

Read More »

LAW500 Business Law Assessment 2 – Group Project

Assessment Information and Rubric Subject Code LAW500 Subject Name Business Law Assessment Number and Title Assessment 2 – Group Project Assessment Type Group Length / Duration 3000 words maximum, no ±10%, and excluding references Weighting % 30% Total Marks 100 Submission Online Submission via TurnitIn for the written report Due

Read More »

Population Nutrition Case Study Analysis

HSN702 – Lifespan Nutrition Assessment Task: 1 Assignment title: Population Nutrition Case Study Analysis Assignment task type: Short Written Report and Literature Search Strategy Task details The primary focus of this assignment is on population nutrition. Nutritionists play an important role in promoting population health through optimal nutritional intake. In

Read More »

Applied Quantitative Economics Assignment

Goldsmiths College, University of London Applied Quantitative Economics Project ** You must attempt only one project, and you must complete it either in R or in Excel ** General Background Key Stage 4 (KS4) is a legal term for the last two years of secondary school education in England leading

Read More »

Can't Find Your Assignment?

Open chat
1
Free Assistance
Universal Assignment
Hello 👋
How can we help you?