Design Amazon's sales rank by category feature
Created by: myaser
We'll use a multi-step MapReduce:
Step 1 - Transform the data to (category, product_id), sum(quantity) Step 2 - Perform a distributed sort
Why one should perform distributed sort then stores data in a SQL database SQL tables don't preserve order