WMR Exemplar: Flickster network data

Jeffrey Lyman (edited by Libby Shoop), Macalester College
Author Profile

Summary

The purpose of this module is to teach students how to analyze networks and datasets distributed over multiple files using the Hadoop framework with the WebMapReduce (WMR) interface. It is assumed that students are already familiar with the basics of hadoop and CSInParallel's WMR interface.
The exercises in this module use a network of friendships on the social movie recommendation site Flixster. Students will use it to learn how to analyze networks and chain jobs. The module contains example code to try and then additional suggestions for analyses that students can perform.


Learning Goals

Given an example of a map-reduce computation that computes a clustering coefficient, students will be able to complete their own analyses of the data to answer different questions.

Context for Use

This example can be used after students have gained some familiarity with the map-reduce concept and have used WebMapReduce (WMR), such as completing the introductory WMR module from this collection.

Description and Teaching Materials

You can visit the module in your browser:
Hadoop Network Analysis

Or you can download the module in either PDF, latex, or Word format.
PDF Format: HadoopNetworkAnalysis.pdf.
Latex Format: latex.tar.gz.
Word Format: HadoopNetworkAnalysis.docx.

Teaching Notes and Tips

This example can be used after students have gained some familiarity with the map-reduce concept and have used WebMapReduce (WMR), such as completing the introductory WMR module from this collection.

Some math is used in this example.

Assessment

Assessment instrument not available.

References and Resources

Please see the WebMapRedcue home page for how to obtain and install WMR.


Comment? Start the discussion about WMR Exemplar: Flickster network data