Information Retrieval 2074

Question Paper Details
Tribhuwan University
Institute of Science and Technology
2074
Bachelor Level / Seventh Semester / Science
Computer Science and Information Technology ( CSC413 )
( Information Retrieval )
Full Marks: 60
Pass Marks: 24
Time: 3 hours
Candidates are required to give their answers in their own words as far as practicable.
The figures in the margin indicate full marks.

Attempt any ten questions.

Official Answer
AI Generated Answer

AI is thinking...

1. How IR in web search is different from other IR systems? Discuss IR architecture with suitable example.

2+4 marks
Details
Official Answer
AI Generated Answer

AI is thinking...

2. Assume that document space is defined by four terms: Network, CSIT, Nepal, TU and Graduate. And we have three documents containing the following terms:

Doc1: CSIT Nepal

Doc2: TU CSIT

Doc3: CSIT TU Nepal

If the query is "CSIT NEPAL", find top 2 documents retrieved by Boolean space model

6 marks
Details
Official Answer
AI Generated Answer

AI is thinking...

3. What is meant by stop word removal? Explain text normalization with suitable example. 

1+5 marks
Details
Official Answer
AI Generated Answer

AI is thinking...

4. Suppose the table given below lists all the documents retrieved by an algorithm. If total number of relevant documents is 6, calculate the value of recall, precision, and F-score.


sn
Doc ID
relevant
1
D1no
2D2no
3D3yes
4D4no
5D5yes
6D6yes
7D7no
8D8no
9D9yes


6 marks
Details
Official Answer
AI Generated Answer

AI is thinking...

5. Why query expansion is important? Discuss query expansion techniques with examples.

1+5 marks
Details
Official Answer
AI Generated Answer

AI is thinking...

6. Why Hits algorithm is used? Discuss its working with example.

2+4 marks
Details
Official Answer
AI Generated Answer

AI is thinking...

7. How Bots are different from spiders? Describe simple and multithreded spidering algorithm.

1+5 marks
Details
Official Answer
AI Generated Answer

AI is thinking...

8. How text categorization is different from clustering? Explain nearest neighbor categorization algorithm.

1+5 marks
Details
Official Answer
AI Generated Answer

AI is thinking...

9. Differentiate collaborative filtering from content based filtering? Discuss content based recommender system with its strengths and drawbacks

2+4 marks
Details
Official Answer
AI Generated Answer

AI is thinking...

10. Why TF-IDF weighting is important in information retrieval? Explain with suitable example.

6 marks
Details
Official Answer
AI Generated Answer

AI is thinking...

11. How information extraction differs from information retrieval? Discuss role of XML in information extraction.

6 marks
Details
Official Answer
AI Generated Answer

AI is thinking...