using apachespark, Download large data set and define problem and provide a solution - 01/05/2018 17:24 EDT
$30-250 USD
Paid on delivery
Search online to download reasonably large dataset. Define your own problem based on the dataset and provide a solution to it with your knowledge of Apache PySpark platform. You may obtain some idea for defining your own problem by referring to research papers. Include the reference in this case.
Project ID: #16854033
About the project
2 freelancers are bidding on average $211 for this job
I am skilled in Hadoop ecosystem components -- Hadoop, MapReduce, Hive, Spark, Kafka, Flume, Sqoop, Oozie and experience in Python and Java. I developed real-time spark streaming applications for different clients.