This is a project for music recommendations with Spark.
- Accelerated parallel breadth-first search with Spark 10x faster than that with MapReduce in the self-deployed cluster after compressing the dataset with a 2% ratio by Apache Avro.
- Employed the PageRank algorithm within ego-nets to generate divers.