Question Paper: Data Warehouse & Mining Question Paper - May 17 - Computer Engineering (Semester 8) - Mumbai University (MU)
0

Data Warehouse & Mining - May 17

MU Computer Engineering (Semester 8)

Total marks: --
Total time: --
INSTRUCTIONS
(1) Assume appropriate data and state your reasons
(2) Marks are given to the right of every question
(3) Draw neat diagrams wherever necessary

1(A) What is dimensional modeling? Design the data warehouse for wholesale furniture Company. The data warehouse has to allow analyzing the company's situation at least with respect to the Furniture , Customer and Time. More ever, the company needs to analyses: The furniture with respect to its type category and material. The customer with respect to their spatial location, by considering at least cities, regions and states. The company is interested in learning the quantity, income and discount of its sales.
10 marks 12784

1(B) Discuss the different steps involved in Data Pre-processing.
10 marks 5927

2(A) The college wants to record the Marks for the courses completed by students using the dimensions : i) Course, ii) Student, iii) Time & measure Aggregate marks Create a Cube and describe following OLAP operations.
i)Slice ii) Dice iii) Roll up iv) Drill down v) Pivot
10 marks 12785

2(B) Apply the Naive Bayes classifier algorithm for buys computer classification and classify the tuple = X(age "young". Income = "medium", student = "yes" and credit - rating = "fair")
 ID Age Income Student Credit-rating buys computer 1 young high no fair no 2 young high no good no 3 middle high no fair yes 4 old medium no fair yes 5 old medium no fair yes 6 old low yes good no 7 middle low yes good yes 8 young medium no fair yes 9 young low yes fair yes 10 old medium yes fair yes 11 young medium yes fair yes 12 middle medium no good yes 13 middle high yes fair yes 14 old medium no good no
10 marks 12786

3(A) Explain ETL of data warehousing in details?
10 marks 5912

3(B) Explain types of attributes and data visualization for data exploration
10 marks 12787

4(A) Illustrate the architecture of Data Warehouse system. Differentiate Data warehouse and Data Mart.
10 marks 5390

4(B) Explain K-means clustering algorithm? Apply K-Means algorithm for the following Data Set = { 15, 15, 16, 19, 20, 21, 22, 28, 35, 40, 41, 42, 43, 44, 60, 61, 65}
10 marks 12788

5(A) Explain Updates to dimensions table in detail.
10 marks 5967

5(B) A database has ten transactions. Let minmum support = 30% and minimum Cofidence = 70%
i) Find all frequent patterns using AprioriAlgorithm.
ii) List strong association rules.
 Transaction_Id Items 01 A, B, C, D 02 A, B, C, D, E, G 03 A, C, G, H, K 04 B,C, D, E, K 05 D, E, F, H, L 06 A, B, C, D, L 07 B, I, E, K, L 08 A, B, D, E, K 09 A, E, F, H, L 010 B, C, D, F
10 marks 12789

Write short note any four question from Q.6(a, b, c, d, e,)

6(a) Major-issues in Data Mining
10 marks 5920

6(b) Metadata in Data Warehouse
10 marks 5392

6(c) FP Tree
10 marks 5965

6(d) DBSCAN
10 marks 5942

6(e) Hierarchical Clustering
10 marks 5938