Find Jobs
Hire Freelancers

Scrapy code for scraping several e-commerce website

$250-750 USD

Closed
Posted about 6 years ago

$250-750 USD

Paid on delivery
I want the product name/title, unique product code, current price, availability, discount (if there is any), size and fit information, list of product codes with alternative colors, and all original image urls associated with the product (typically 4-6 image urls per product). Original image urls meaning that it is not the thumbnail version since that is too small to view. I describe where they are in the first picture attached, which is a clothing: 1) product name/title: in this case, Ophelia Wide Leg Flare Jeans 2) For this website, unique product code is farfetch id, which is at the center bottom. 3) Discount and price info is at top right 4) there are four images associated with this dress, which you could click to check them all in the top left. Somehow this sight does not have thumbnail version, which is good. 5) list of product codes with alternative colors: does not apply for clothes usually, therefore an empty list [] 6) Fit and Size information is a bit complicated. There are two parts and I want both. FIrst is in the section Size & Fit, here for clothes, it is generally the measurement of the model and which size the model is wearing. That is fine, I just need the entire text. The second part is the dropdown menu on the top right: "Select Size". I need to get all the available sizes, in this case ["23 WAIST", "24 WAIST", ....] AND there availability and price for each size. Availability are very often different for different sizes, as we can see here 23 and 24 are "Last 1 left" while the rest shows nothing, which means "in stock" in general. Also different sizes sometimes have different prices, I also want to get that information. So to sum up, for the first part, I want a big text string containing whatever is in Size & Fit tab. On the second part, I want in the select size dropdown menu, a list of tuple (size, price, avilability) per size option. Can use NA if nothing is there. Point 3) above is mainly relevant for bags, for example, in the attached second picture, which is a brown handbag: on the top right just under "Add to bag" there is a button for a red bag where you could click to get into another bag with exactly the same style but just different colors. I want that red bag's product ID to be stored here with the current brown bag as list of product codes with alternative colors (and by symmetry, current product code should also be stored as alternative color product code list of the red bag). I need to a list here since sometimes there are 4 different colors for a bag. For clothes, this could just be an empty list []. I want it done for two categories: women clothing and women bags, on U.S. store of the following websites: [login to view URL] [login to view URL] [login to view URL] [login to view URL] [login to view URL] Some notes: 1. I did some easy check, and found that one website [login to view URL] needs to get around reCaptcha. I have no idea whether it is possible to do or not and how hard it is. If it takes too much time, then forget it since I just want to get some data and it is OK to lose one website.. 2. The approach I have in mind is to scrape start from, say for handbag, page shop all bags. And scrape all bags on the first page and move on to the next, etc.. For Neyman marcus, the page is: [login to view URL] There we could see the importance of doing step 3) above. If we just scrape everything page by page from [login to view URL] and do not check the alternative colors, we will lose those bags, since if a bag has 4 colors, only one will show up on all handbags page and in order to get everything it is crucial to check for alternative colors.
Project ID: 16196906

About the project

7 proposals
Remote project
Active 6 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
7 freelancers are bidding on average $1,016 USD for this job
User Avatar
Hi, I am interested in your project related to scrape some websites for women clothes and bags categories. Please send me a message so we can discuss all the details.
$705 USD in 8 days
5.0 (188 reviews)
7.3
7.3
User Avatar
Hi, I am expert in doing such tasks and ready to do it as per according to your needs. I assure you to provide you best quality results without errors. Lets discuss further and get started. You can check my profile for work experience. Thanks
$647 USD in 10 days
4.9 (165 reviews)
6.4
6.4
User Avatar
Hi there, As one of the best web scraping experts here on freelancers I can scrape those 5 sites for you. Check out my profile, portfolio, and former clients feedback - that'll let you know everything about me. There is a lot of experience behind me, so it will be a huge satisfaction for me to do this job. Feel free to contact me so we can discuss further details. Thank you for taking the time to read my proposal.I am looking forward to hearing from you. Kind regards
$500 USD in 10 days
5.0 (28 reviews)
5.7
5.7
User Avatar
Hi, I have great experience on web scraping , Data Mining, Research, Web search and automation. Will write a Python script to extract the requested data . Thanks
$750 USD in 10 days
4.9 (81 reviews)
5.9
5.9

About the client

Flag of UNITED STATES
New York, United States
5.0
1
Payment method verified
Member since Jan 31, 2018

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.