Web Crawler with simple interface

Completed Posted Feb 18, 2004 Paid on delivery
Completed Paid on delivery

Need a program that will crawl our local online classified listings and put them in a database. Also need a web-based interface to work with the records. This DOES NOT need to be written from scratch, I have a copy of this program nearly completed already that you can work with, it only needs a few bugs fixed and a few small features added. The crawler needs to get each of the rental categories listed at [url removed, login to view] and download each record within each category and insert the advertiser,ad and date into mysql database. The script will run nightly in crontab, so the script must not insert duplcate records. Upon completion of the script, i'd like a daily summary & error report emailed to me. You can see the current version of the interface at [url removed, login to view] The main user-interface shows the list of categories, for each category it shows the count of records, it should also show how many were most recently inserted. Each category is a link to the list of records in that category. The list of records is already grouped and ordered by phone number, the phone number and number of bedrooms are set to bold. Each group of phone numbers has a button to clear the ad. Since we cannot download duplicate records, the ads may need to be marked 'inactive' rather than deleted, when the 'Clear' button is pressed for a given phone number, and it will need to reload the same page. Ads older than a few days should be flagged in the list. Lastly, each of these phone numbers needs to be compared against phone numbers in an existing database and flagged accordingly. ALL code must be thoroughly well commented.

## Deliverables

1) Complete and fully-functional working program(s) in executable form as well as complete, well-documented source code of all work done. 2) Installation package that will install the software (in ready-to-run condition) on the platform(s) specified in this bid request. 3) Complete ownership and distribution copyrights to all work purchased.

## Platform

Linux 2.4, RedHat 9, PHP Version 4.3.1, MySQL Version 3.23.58.

Engineering Linux MySQL PHP Software Architecture Software Testing

Project ID: #3104368

About the project

2 proposals Remote project Active Feb 20, 2004

Awarded to:

marcodvw

See private message.

$21.25 USD in 7 days
(21 Reviews)
4.3

2 freelancers are bidding on average $53 for this job

superpedro

See private message.

$85 USD in 7 days
(6 Reviews)
3.7