using apachespark, Download large data set and define problem and provide a solution - 01/05/2018 17:24 EDT

Cancelled Posted 5 years ago Paid on delivery
Cancelled Paid on delivery

Search online to download reasonably large dataset. Define your own problem based on the dataset and provide a solution to it with your knowledge of Apache PySpark platform. You may obtain some idea for defining your own problem by referring to research papers. Include the reference in this case.

Apache Hadoop Python Scala

Project ID: #16854033

About the project

2 proposals Remote project Active 5 years ago

2 freelancers are bidding on average $211 for this job

madhavsagar09

I am skilled in Hadoop ecosystem components -- Hadoop, MapReduce, Hive, Spark, Kafka, Flume, Sqoop, Oozie and experience in Python and Java. I developed real-time spark streaming applications for different clients.

$222 USD in 5 days
(2 Reviews)
3.4