Data Warehousing and Data Mining 2074

Question Paper Details
Tribhuwan University
Institute of Science and Technology
2074
Bachelor Level / Seventh Semester / Science
Computer Science and Information Technology ( CSC410 )
( Data Warehousing and Data Mining )
Full Marks: 60
Pass Marks: 24
Time: 3 hours
Candidates are required to give their answers in their own words as far as practicable.
The figures in the margin indicate full marks.

                                    Group - A

Attempt any two Questions                     (10 x 2 = 20)

Official Answer
AI Generated Answer

AI is thinking...

1.  You are given the transaction data shown below from a fast food restaurant. There are 9 distinct transactions (order 1 to order 9). There are total 5 meal (M1 to M5) involved in transactions.

Meal ItemsList of item IDsMeal Items
List of item IDs

order 1

order 2

order 3

order 4

order 5

M1, M2, M5

M2, M4

M2, M3

M1, M2, M4

M1, M3

order 6

order 7

order 8

order 9

M2, M3

M1, M3

M1, M2, M3, M5

M1, M2, M3

Minimum support =2, Minimum confidence = 0,7

Apply the Apriori algorithm to the database to identify frequent k-itemset and find all strong association rules.

10 marks
Details
Official Answer
AI Generated Answer

AI is thinking...

2.  Why do we need to preprocess the data before running the algorithm? What are the processes for this? Explain. Give some examples of noise that must be removed in data while extracting the pattern.

10 marks
Details
Official Answer
AI Generated Answer

AI is thinking...

3.  List the two steps used in classification approach with its issues. Is this right decision to use neural network always as a classifier? Give your opinion. Discuss the working mechanism of back propagation classification algorithm.

10 marks
Details
Official Answer
AI Generated Answer

AI is thinking...

                                     Group - B

Attempt any eight Questions                     (8 x 5 = 40)

Official Answer
AI Generated Answer

AI is thinking...

4. List and describe the five primitives for specifying a data mining task.

5 marks
Details
Official Answer


AI Generated Answer

AI is thinking...

5.  Describe the types of data used in data mining.

5 marks
Details
Official Answer
AI Generated Answer

AI is thinking...

6.  Explain the similarities and dissimilarities between operational database and data warehouse.

5 marks
Details
Official Answer
AI Generated Answer

AI is thinking...

7.  List the types of OLAP operations with example.

5 marks
Details
Official Answer
AI Generated Answer

AI is thinking...

8.  Illustrate the strength and weakness of k-mean in comparison with k-medoids algorithm.
5 marks
Details
Official Answer
AI Generated Answer

AI is thinking...

9.  Why data cube computation is essential task in data mining? Describe general strategy in data cube computation.

5 marks
Details
Official Answer
AI Generated Answer

AI is thinking...

10.  Describe the different components of a data warehouse.

5 marks
Details
Official Answer
AI Generated Answer

AI is thinking...

11.  Define dimension table and fact table. What makes the necessity of multidimensional data model?

5 marks
Details
Official Answer
AI Generated Answer

AI is thinking...

12.  Discuss the approach behind Bayesian classification. Why smoothing technique is necessary in Bayesian classification?

5 marks
Details
Official Answer
AI Generated Answer

AI is thinking...

13.  Write short notes on (any two):

            a)  Concept hierarchy

            b)  Data mining Query Language

            c)  Text mining

            d)  ROLAP vs MOLAP

5 marks
Details
Official Answer
AI Generated Answer

AI is thinking...