0

Big Data Analytics Question Paper - May 18 - Computer Engineering (Semester 7) - Mumbai University (MU)

## Big Data Analytics - May 18

### Computer Engineering (Semester 7)

Total marks: 80

Total time: 3 Hours
INSTRUCTIONS

(1) Question 1 is compulsory.

(2) Attempt any **three** from the remaining questions.

(3) Draw neat diagrams wherever necessary.

**1.a.**Explain the role and effect of damping factor (teleportation) in PageRank computation.

**1.b**Agility is a NoSQL business driver. Justify.

**1.c**Give the updating buckets approach of DGIM algorithm.

**1.d**Find Cosine Distance between the d1 and d2 vectors:

Index | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 |
---|---|---|---|---|---|---|---|---|---|---|

d1 | 5 | 2 | 1 | 0 | 0 | 0 | 0 | 1 | 3 | 7 |

d2 | 5 | 2 | 1 | 0 | 0 | 1 | 2 | 2 | 0 | 2 |

**2.a**List the different NoSQL data stores. Explain any two with diagram.

**2.b**Write steps of Girvan- Newman Algorithm. Explain clustering of Social-Network Graphs using GN algorithm with example?

**3.a**Explain Flajolet Marting Algorithm with example.

**3.b**Distinguish the following:

i) DBMS and DSMS ii) PCY, Multistage and Multihash

**4.a**List Relational-Algebra Operations. Explain any two using MapReduce.

**4.b**Compute Efficient PageRank with the damping factor d=0.8 for web.

**5.a**What are different recommender systems. Explain any one with example.

**5.b**Define Hub and Authority. Compute Hub and Authority scores for web.

**6 Answer the following:**

**6.a**Core Hadoop Components

**6.b**CURE Algorithm

**6.c**SON Algorithm and MapReduce

**6.d**Matrix-Vector Multiplication by MapReduce

0

ADD COMMENT