An Example Data Analysis Task Using Pig

The goal is to find the top 10 most-visited pages in each category.

Given data:

The Data Flow Graph for the task is shown below:

Pig Latin code for the task:

Last updated

Was this helpful?