Find Jobs
Hire Freelancers

Scrape a website & insert into database & perform some tasks with the information

$250-750 CAD

Closed
Posted about 8 years ago

$250-750 CAD

Paid on delivery
I need someone to write some software that will archive every listing posted on a particular website and use that information as described in the features section of this post. Basic logic of program: 1. Send a request to a website that returns listings in xml format 2. Check each listing against a mysql database 3. Send a web request to each new listing individually to get all the information 4. Features 1,2,3 (Explained in detail below) 5. Upload images from the listings to amazon S3 6. Add the information for each listing to a mysql database 7. Sleep before looping back to step 1 (Read feature 4) Limitations: The website is limited to a 20 listings at a time (Step 1). If all new listings are found, keep sending web requests for the next page of listings until previous listings are found, so no listings are missed. (During peak times it is possible for more than 20 listings to be posted between the minimum sleep period of 2 minutes) Features: 1. Create a table that tracks listings that are from the same user (by using two values found in the listing). Keep a tally of how many listings that user has posted and a tally of how many of those listings are unique (I suggest this is done on a separate thread as to not slow down the scraping). 2. If enabled, check each new listing's price against comparable listings on another website (web request to an api), and calculate the average value for comparable listings using the archive of listings in my database. Use some math calculations to decide if the listing is undervalued by a configurable amount/percent and send an alert (Amazon SNS and database entry). (This must be done on a separate thread as to not slow down the scraping) 3. Check each listing against search criteria, which can be configured by adding rows of criteria to a mysql database, and send an alert (Amazon SNS and database entry) if a new listing satisfies that criteria. (This will be simple criteria, such as if the listings price is >100, or if the listing is a specific model, etc). (This must be done on a separate thread as to not slow down the scraping) 4. Adjust the sleep time automatically as to minimize the amount of pages requested before finding previous listings (Explained in limitations). With a minimum sleep time of 2 minutes, a maximum of 15 minutes from 7AM - 11PM, and a maximum of 2 hours from 11PM-7AM, before looping. 5. Once daily check each active listing in the database against the website to see if the listing has been updated, or if the listing has been deleted. If it has been updated, save the changes to the database as a new row. If it has been deleted, change the status in the database so the listing will not be checked again. (I suggest this be a separate script ran by a cron job). Requirements: 1. Must run on a linux server 2. Error Handling (Website down, website responds with unexpected data, etc) 3. Log activity/errors in a text file. Send an alert if errors occur (Amazon SNS and entry into database) Program can be coded in any language that can run on a linux vps and take advantage of the multiple ip addresses the server has. PHP would be preferred.
Project ID: 10165405

About the project

26 proposals
Remote project
Active 8 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
26 freelancers are bidding on average $631 CAD for this job
User Avatar
Hi sir, I am scraping expert, I have did too many similar projects, please check my feedback then you will know. Can you tell me more details? then I will provide demo data for you. Thanks, Kimi
$699 CAD in 6 days
5.0 (495 reviews)
8.1
8.1
User Avatar
HI. I am an expert in developing automated tools and scraping scripts and you will tons of similar bots with requirement as you need here (ip switching , useragent switching, storing images to Amazon s3 buckets etc) Please send me more details regarding the work. Thank you
$526 CAD in 10 days
5.0 (243 reviews)
8.2
8.2
User Avatar
I'm one of the best web scraping experts here that's why I'm sure you'll be impressed with my work. I can create such scraper in less than 5 days and I can offer you best price here. You have pretty good project description so all I need to start is website URL to check everything. Thanks. Roman
$631 CAD in 5 days
4.9 (655 reviews)
8.0
8.0
User Avatar
Hi I work towards providing reliable, relevant and robust IT solutions at most competitive prices to my customers. I ensure 100% customer satisfaction so lets start Thanks
$670 CAD in 18 days
4.8 (471 reviews)
8.3
8.3
User Avatar
Hello, what is the website ? ...........................................................................
$722 CAD in 10 days
5.0 (448 reviews)
7.6
7.6
User Avatar
Experienced TEAM HERE to work for your project. Let's discuss more and finalize the project and cost. We have our own workplace with 7+ working PCs and laptops, with ~15mbps internet connections. We also have a backup internet connection with 10MBPS speed. And we never out re-source our projects that we have taken from our clients :-) Feel free to ask me questions, if any. I look forward to work with you. You can also contact me through Skype. Have a good day and stay fine :-) Sincere regards, Jubair eSolutions
$800 CAD in 15 days
4.9 (342 reviews)
7.9
7.9
User Avatar
Hello, Sir I am from vSol CORP (TEAM: 19 employees). I have checked your project requirement. If you like to have a look on some sample data then please let us know. Kindly interview us to review our competencies & hire us for best desired results. Thanks
$250 CAD in 4 days
4.8 (261 reviews)
7.8
7.8
User Avatar
Dear Sir, I'm very much delighted to let you know that i did data scraping with PHP-cURL, PhantomJS, Node.js, Selenium from many sites. I just scraped the data from web site and then wrote the data in mysql database or excel or csv or xml file. I worked on many similar projects, I have big experience in data mining projects. I have written hundreds of web scrapers which scrape millions of pages each day. I'm ready to fulfill your requirement. I can finish this task in short time, with the best quality. I can assure 100% accuracy. Please give me the opportunity to do the work. With Kind Regards, Debdulal Roy Proshanta
$400 CAD in 10 days
4.9 (97 reviews)
7.5
7.5
User Avatar
Hope you are doing great. I am interested to provide you my services. I have more than 5 years experience in providing professional website development services and worked with almost every type of project. So this is an easy one for me as well. What I need is to finalize clear requirements (that play a big role in success of every project) and that needs your few minutes to discuss over the project with me. Let me know your availability and let’s get this project done!
$1,111 CAD in 14 days
5.0 (67 reviews)
6.8
6.8
User Avatar
================== Amazon MWS API Experts ================== NOTE: Most of the requirement of your project scope is already completed by us and we have demo for you as well. We are Amazon MWS API experts and completed so many projects using its API I have ready to use API for -- 1)Amazon Orders of seller 2)Amazon Product API 3)Amazon Price API 4)Amazon Repricer 5)Amazon SES API 6)Amazon SQS API 7)Amazon Product Advertising API I have done so many complex projects based on Amazon MWS API and i am sure your project would be very easy for me. I have demos ready along with me, ping me so that i can share the links of demos with you.
$773 CAD in 20 days
4.8 (54 reviews)
7.1
7.1
User Avatar
Hello! I'm web scraping expert and i can done your project. I use python language and scrapy framework. My scripts works on windows, mac or linux, but linux is preferably. I can schedule scripts on server if it is required. I have more 200 finished projects (google scraping, facebook scraping, yellow pages, linkedinIn, amazon, webshops and other sites with lists of any items). I can scrape secured and protected sites, ii site block IP i can use proxy or TOR, also i can try avoid captcha. I can export data into json, xml, csv (excel), or any database (mysql, mongodb, mssql, etc). Message me, if you have any questions!
$599 CAD in 10 days
4.9 (107 reviews)
6.6
6.6
User Avatar
Hello, I am very interested. I am very familiar with this jobs and has done many of that. I have long term relationship with some clients for this kind of jobs and has done almost a thousand of the jobs. Looking forward to working with you. Thanks and all the best, Steve
$600 CAD in 10 days
4.8 (33 reviews)
6.1
6.1
User Avatar
I'm Povilas and I would be perfect for your task. I'm specializing in data mining and I can provide you top quality web crawlers. I have exerence in this field from easy data scraping to complex scraping using Selenium. My most recent project was for african market price comparison website creating ~30 e commerce website crawlers that is scraping data each week looking for new products and after finding them storing them in database for future use. On this project I had encoutered and successfully solved a lot of difficult chalanges gaining priceless experence. Now for your project as a professional I strongly recommend using Python instead of PHP. Python has many performance and other advantages agains PHP. That is why it is so loved by big data scientists / data analysts. In future if you will wan't to add more functionality manipulating your data it will be more scalable. My last client and I shared very same view and that is why we completed successfull project with him. Now I think that YOU also have the same view as we did. Go to my profile and under my name check one phrase that reflects that view. I have some Ideas and some questions about your project so please contact me now and we can discuss them. Regards, Povilas Kudriavcevas
$666 CAD in 14 days
5.0 (15 reviews)
5.2
5.2
User Avatar
Hi. I have many exp in scraping . I will do this project in Java ( with advantage of multithreading ) Can we talk in detail about this project? Hope to work with you. Thanks,Tale
$777 CAD in 10 days
4.9 (25 reviews)
5.0
5.0
User Avatar
Dear Sir, I am happy to be here to contact you. Sometimes a chance may change the whole life, and it will be that great chance of success of your business and life. "No pains, no gains", but you get me here, you can gain success with no pain, as I have rich and profound experience in web development. I have worked for domestic and global development with wordpress and codeigniter for 4 years. So your project is not great difficulty for me, but it is of my skill and experience. High quality, short development time, fast and flexible communication will give you amusement also in business. Life is beautiful, and let's make business beautiful. Best regards !
$555 CAD in 10 days
4.3 (4 reviews)
4.7
4.7
User Avatar
Hi there, I am Toronto based experienced developer with good amount of experience in same, I have worked on many successful project. I would like to know more about your scrape project. let's discuss more about it.
$611 CAD in 7 days
4.9 (5 reviews)
2.9
2.9
User Avatar
Hi, i will provide script written in php. If you will want to hire me, contact with me via PM. Thanks in advance. Reagrds.
$666 CAD in 7 days
5.0 (3 reviews)
1.4
1.4
User Avatar
i have gone through your requirement we done similar kind of job before looking forward your earliest Reply on this for a project discussion Awaiting for your earliest reply
$555 CAD in 10 days
0.0 (0 reviews)
0.0
0.0

About the client

Flag of CANADA
Regina, Canada
5.0
3
Payment method verified
Member since Mar 31, 2015

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.