Create Web Scraping Program To Gather Data From 9 Diamond Me
$30-250 USD
Paid on delivery
I am an economics professor who is interested in hiring someone to write a program to scrape diamond data from 9 different diamond sites. I'm doing a follow-up to an academic study I did earlier ([login to view URL]). For each of the sites below, the program would extract the data on diamond prices and characteristics into either an excel spreadsheet, a text file, or a comma delimited file. The program would gather data for all diamond shapes (round, radiant, princess, pear, etc.). Note that different diamond characteristics are available based on the diamond's shape, so the program would need to take that into account.
The 9 sites are:
[login to view URL]
variables: shape,carats,color,clarity,report,cut,price,diamond id (which is obtained from the URL when you click "view")
[login to view URL]
variables: id,size,color,clarity,cut,wire price,price,polish/symmetry,certificate
[login to view URL]
variables: shape,ID No.,carat,color,clarity,depth,table,cut grade,report,price
[login to view URL]
variables: shape,carat,color,clarity,cut,certificate,price
[login to view URL]
variables: shape,carat weight,color,clarity,lab,inscribed,depth,table,fluor,pol/sym,cut,price,stock id,measurements
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
etc.
variables: shape,carat,color,clarity,cut,polish/symmetry,certificate,price,product id
[login to view URL]
variables: shape,carat,cut,color,clarity,report,polish,symmetry,price,stock number,measurements,depth,table
[login to view URL];filter_id=0
variables: shape,carat,cut,color,clarity,polish/symmetry,report,price,stock number
[login to view URL];productGroupID=loose%5Fdiamonds
variables: shape,carat,cut,color,clarity,price
I've written web scraping programs myself in the past (using Lencom's Visual Web Task), but I'm hoping to get someone to write a more efficient program for gathering the data. I've set this listing to run for 15 days, but I'm willing to award this sooner if someone gives a very competitive proposal.
Project ID: #473404