Data Warehousing and Data Mining Model

Question Paper Details
Tribhuwan University
Institute of Science and Technology
Model
Bachelor Level / Seventh Semester / Science
Computer Science and Information Technology ( CSC410 )
( Data Warehousing and Data Mining )
Full Marks: 60
Pass Marks: 24
Time: 3 hours
Candidates are required to give their answers in their own words as far as practicable.
The figures in the margin indicate full marks.

Group-A

Official Answer
AI Generated Answer

AI is thinking...

Long Answer Questions (Attempt any Two questions) [2x10=20]

Official Answer
AI Generated Answer

AI is thinking...

1. Suppose that a data warehouse for Big University consists of the following four dimensions: student, course, semester, and instructor, and two measures count and avg-grade. When at the lowest conceptual level (e.g., for a given student, course, semester, and instructor combination), the avg-grade measure stores the actual course grade of the student. At higher conceptual levels, avg-grade stores the average grade for the given combination.

a) Draw a snowflake schema diagram for the data warehouse.

b) Starting with the base cuboid [student, course, semester, instructor], what specific OLAP operations (e.g., roll-up from semester to year) should one perform in order to list the average grade of CS courses for each Big University Student.

c) If each dimension has five levels (including all), such as “student < major < status < university < all”, how many cuboids will this cube contain (including the base and apex cuboids)?

10 marks
Details
Official Answer
AI Generated Answer

AI is thinking...

2. A= {A1, A2, A3, A4, A5, A6}, Assume σ = 35%. Use Apriori algorithm to get the desired solution.


A1A2A3A4A5A6
000111
011100
100111
110100
101011
011101
000110
010101
100100
111111


10 marks
Details
Official Answer
AI Generated Answer

AI is thinking...

3. What kind of data preprocessing do we need before applying data mining algorithm to any data set. Explain minning method to handle noisy data with example.

10 marks
Details
Official Answer
AI Generated Answer

AI is thinking...

Group- B

Official Answer
AI Generated Answer

AI is thinking...

Short Answer Questions (Attempt any Eight questions) [8x5=40]

Question number 13 is compulsory.

Official Answer
AI Generated Answer

AI is thinking...

4. Explain the use of frequent item set generation process.

5 marks
Details
Official Answer
AI Generated Answer

AI is thinking...

5. Differentiate between data marts and data cubes.

5 marks
Details
Official Answer
AI Generated Answer

AI is thinking...

6. Explain OLAP operations with example?

5 marks
Details
Official Answer
AI Generated Answer

AI is thinking...

7. List the drawbacks of ID3 algorithm with over-fitting and its remedy techniques

5 marks
Details
Official Answer
AI Generated Answer

AI is thinking...

8. Write the algorithm for K-means clustering. Compare it with k-nearest neighbor algorithm.

5 marks
Details
Official Answer
AI Generated Answer

AI is thinking...

9. What is text mining? Explain the text indexing techniques.

5 marks
Details
Official Answer
AI Generated Answer

AI is thinking...

10. Describe genetic algorithm using as problem solving technique in data mining.

5 marks
Details
Official Answer
AI Generated Answer

AI is thinking...

11. What do you mean by WWW mining? Explain WWW mining techniques.

5 marks
Details
Official Answer
AI Generated Answer

AI is thinking...

12. What is DMQL? How do you define Star Schema using DMQL?

5 marks
Details
Official Answer
AI Generated Answer

AI is thinking...

13. Write short notes (Any Two)

a) Text Database Mining

b) Back propagation Algorithm

c) Regression

d) HOLAP

5 marks
Details
Official Answer
AI Generated Answer

AI is thinking...