Scraping web content with Selenium in Java

In Progress Posted 6 years ago Paid on delivery
In Progress Paid on delivery

I want to download the content of a website but it's is a single page site (like instagram) and I cant just download an html file. We have to programmatically access the website through a browser (preferably in a headless environment like a Linux machine running in aws) and then use xpath to find the information we need and save this information somewhere (text files would be ok as long as the data is consistent).

Assuming we continue with our Instagram example, the objective is to take a snapshot of all the information in a public account. This can be split into several steps:

* Start by going to the main page of an Instagram user and get the number of posts, followers, following and bio/description

* Get the full list of followers (instagram ids). Note this requires a sub-window and scrolling down.

* Get the full list of following (instagram ids). Note this requires a sub-window and scrolling down.

* Then for each post get a link to the post, a link and maybe a thumbnail of the image, the number of likes, the number of comments, the list of hastags, the list of people who commented, the list of people mentioned and possibly the text in the post plus all the comments. Note that since this has to be done for all posts we will need to scroll down the page all the way to the end.

This is a side project for me and I will have to maintain the code so very simple and easy to read source code is preferred. With plenty of comments explaining why things are done. Please take that into account when making an offer. Unit tests would be a very nice addition, it would be great to include them (starting from the very obvious to more complicated cases).

A payment schedule will be established for each one of the steps, starting from the most basic of just getting the number of posts, number of followers, number of following and bio/description.

Please only apply if you are very familiar with this kind of work. I am software engineer myself and although I am not familiar with the details of web automation I do a very decent job reading java code.

Java JUnit Selenium Webdriver Web Scraping xpath

Project ID: #16630199

About the project

18 proposals Remote project Active 6 years ago

18 freelancers are bidding on average €188 for this job

seaanddream

Hi, my name is Sevinc. My bid covers all the phases you explained above... I read your "Scraping web content with Selenium in Java" project descriptions carefully before bidding. I checked your requirements as well... More

€500 EUR in 7 days
(258 Reviews)
7.7
zekovicm

Hi there,I am Miljan,Web Scraping expert from Bosnia & Herzegovina,Europe. I have carefully gone through with your requirements and I would like to help you with this job ! I can start immediately and finish it within More

€222 EUR in 3 days
(79 Reviews)
7.0
phpXpertbd

Dear Sir, I'm very much delighted to let you know that i did data scraping with PHP-cURL, PhantomJS, Node.js, Selenium from many sites. I just scraped the data from web site and then wrote the data in mysql database More

€100 EUR in 3 days
(66 Reviews)
7.2
abedin94

hi , I have scrapped 100+ websites . Amazon , adidias and many more r among them . knk me so that we can discuss . I can do this using python not java , if language is not a problem for me then knk me up .

€166 EUR in 3 days
(116 Reviews)
6.4
schoudhary1553

Hi there..... Warm Greetings We came along with your request for Scraping web content with Selenium in Java and we reviewed your project description. We'd like to help you with confidence and satisfying results.. More

€300 EUR in 5 days
(26 Reviews)
6.0
olegivanoviam

Hi. My name is Oleg and I am senior C#/.NET software developer from Ukraine. Please contact to discuss details regarding the project.

€250 EUR in 3 days
(3 Reviews)
4.5
fertroya

Hi, I'm interested in this job also as a side job. In my current job wehave selenium with chrome web driver in a headless environment running in ec2 already setup. We use this environment to run our end to end tests. More

€166 EUR in 3 days
(15 Reviews)
4.4
DarkKnight2206

I am a python developer. I have great experience in web scraping and I am an expert in it. I have all necessary skills to scrape almost any website. I have even scraped sites like google, whatsapp web, instagram which More

€125 EUR in 2 days
(14 Reviews)
4.8
sonaligupta334

A proposal has not yet been provided

€155 EUR in 3 days
(2 Reviews)
2.2
WebStar426

I am experienced SELENIUM Expert. Hi, I’ve carefully gone through your job posting. I am Web Scraping Expert.... I will get all data you want. I can scrapy even Ebay site. I can work with you anytime you want. More

€222 EUR in 3 days
(2 Reviews)
1.6
VirtualBrainInc

Hello, I have briefly read the description on Scraping web content with Selenium in Java development, and I can deliver as per the requirements however I need us to discuss for more clarity on the details, deadline More

€155 EUR in 3 days
(2 Reviews)
1.2
sergsyuzev

Hello. I spend a lot of time with python+selenium and I can do it for you. I need more details about your task. Best regards, Serge

€166 EUR in 5 days
(0 Reviews)
0.0
pradeepta02

I have more than 10 years of experience in automation testing using selenium/webdriver, Java, Python, Appium etc. My specialities include building automation frameworks from scratch, writing automated test scripts and More

€155 EUR in 3 days
(0 Reviews)
0.0