I require a web-scraping application to be written that does not require additional dependencies (ie nothing other than the base install java/jvm/jre).
*** READ ENTIRE PROJECT REQUIREMENT BEFORE BIDDING ***
Scraper must log to an excel document (see sample in files section)
Sample URL for testing:
[login to view URL]|&bedrooms=0&bathrooms=1&Accessible=False&pictures=False&pets=False&ac=False&AgeRestricted=False&smoking=False&coveredParking=False&MaxSqFt=5000&MinSqFt=0&keyword=&sortBy=LastUpdate
Scraper requirements:
- GUI must match design of sample in files section (Scraper [login to view URL])
- accept direct input of URL to begin scraping
- allow adding/editing/deleting of saved URLs
- - a saved URL item will contain a title and the URL. Only the title will be displayed in the GUI list
- clicking an entry in the list will load the URL into the URL text box above the list
- select an output file / option (... button) to type or browse to target file location
- - browse dialog must filter for .xlsx by default
- - scraper must log data into an excel document
- - if file exists, data will be appended to the existing file
- - - set all entries in the "Active" column to "No"
- - if file does not exist, scraper must create the target file
- - - First row must contain the following headings: Active, Last Active, Landlord Name, Phone, Contacted, Notes
- "Scrape" button will begin scraping the URL in the URL text box.
- - MUST use the text in the box, as it might be manually edited before running
- connect to website - determine # of listings/pages returned
- visit each listing's page
- save each landlord's name and telephone number (on the right side of the page)
- - if excel file selected earlier exists, scan the file to see if landlord information is already in the file (excluding '
- - if already in the file
- - - update record's "Active" field to "Yes"
- - - update record's "Last Active" field to current date (YYYY/MM/DD format)
- - - go to next property
- - if not already in file
- - - add a record to the file using field structure below
Scraper must go through each page of returned results to get all data. Links for each page are at the bottom of the page. Visual progress should be displayed as scraper runs, and stored in a log file named "[login to view URL]" representing the date and time the scraper was executed. See "console and log [login to view URL]" for example of what both should look like.
Fields: Active (Yes/No)
Last Active (YYYY/MM/DD format)
Landlord Name (as it appears)
Phone (###-###-#### format, no ( ) around first digits)
Contacted ("No" by default for all new contacts. do not alter for existing contacts)
Notes (leave blank, do not edit)
The end-state is to have an excel document i can use to keep updating and adding new contacts based on the scrape of the [login to view URL] website.
Deliverable includes all source code files.
To be considered for this project, you MUST:
- Have a bid proposal within the posted project budget
- Include the phrase "Java is more than just coffee" as the first line in your bid proposal
- State when you will be able to begin actively working the project, and if you are working any other projects at this time
- Visit the sample URL, select the first property, and confirm you can see the name and phone number on the page
*** Your bid will not be considered if you do not conduct the steps above ***
Hello Sir,
How are you?
I understand your job and very much excited to offer my services for your job. Please feel free to contact me directly to discuss this position further. I am all time online on Skype and Gtalk/ Google Hangouts. We can discuss there in details in a quick way, if you are interested.
Best Regards,
Shafaqat Ali
Dear Sir,I read your job description very carefully .I ready to start this project .i can show sample .if you interested please discuss over live chat about project.
Thank you,
Arham IT
Hello, I am experienced and specialized in C/C++ and Java programming, I have experience in Android. I have taken part in many competitions, including national competition, in which I won 2nd prize. I am currently a Computer Science student at National University of Singapore. I always work with responsibility and quality. Please contact me for details and time required, I will finish the job within time required. Thank you!
Java is more than just coffee
I have a lot of experience with custom web scraping, you can check my previous projects about scraping.
Also, I am good at web design so that the a good at designing gui/s, or it can be a command line tool as needed.
But, I do not think that your budget is suitable for this task as it is like undermining programming effort that will be put into it. Plus, you have a lot of requests.
I will not go below my bid.