Scrape room rental listings from websites and provide JSON files with the data
- or -
Post a project like this3520
£140(approx. $175)
- Posted:
- Proposals: 2
- Remote
- #544529
- Awarded
Description
Experience Level: Expert
General information for the business: Student housing website
Description of requirements/functionality: I need you to write a script that will extract data about room rental listings from three websites (Gumtree, Kangaroom and Easyroommate).
I will provide a few URLs with search criteria, listing rooms in a particular region (e.g. http://www.gumtree.com/search?current_distance=3.0&seller_type=&property_type=&min_property_number_beds=&max_property_number_beds=&min_price=&max_price=&photos_filter=Y&q=&search_location=Paisley&category=flats-and-houses-for-rent-offered&search_scope=title)
Your script must crawl the room details page for each of the search results, extract the data for each room and produce a JSON list of dictionaries, where one dictionary represents one room. I will tell you which fields need to be extracted (e.g. title, description, source URL, price, etc.) and what the names of the JSON fields should be. Any photos of a room must also be included, as URLs to the image on the source website. The location coordinates must be extracted as well (from the Google map).
I need this done quickly (i.e. no more than 2 - 3 days).
You are free to use any language or technology you are comfortable with. I care more about the end results, than the script itself.
Extra notes:
Description of requirements/functionality: I need you to write a script that will extract data about room rental listings from three websites (Gumtree, Kangaroom and Easyroommate).
I will provide a few URLs with search criteria, listing rooms in a particular region (e.g. http://www.gumtree.com/search?current_distance=3.0&seller_type=&property_type=&min_property_number_beds=&max_property_number_beds=&min_price=&max_price=&photos_filter=Y&q=&search_location=Paisley&category=flats-and-houses-for-rent-offered&search_scope=title)
Your script must crawl the room details page for each of the search results, extract the data for each room and produce a JSON list of dictionaries, where one dictionary represents one room. I will tell you which fields need to be extracted (e.g. title, description, source URL, price, etc.) and what the names of the JSON fields should be. Any photos of a room must also be included, as URLs to the image on the source website. The location coordinates must be extracted as well (from the Google map).
I need this done quickly (i.e. no more than 2 - 3 days).
You are free to use any language or technology you are comfortable with. I care more about the end results, than the script itself.
Extra notes:
Jordan D.
100% (58)Projects Completed
75
Freelancers worked with
66
Projects awarded
48%
Last project
6 Mar 2017
United Kingdom
New Proposal
Login to your account and send a proposal now to get this project.
Log inClarification Board Ask a Question
-
There are no clarification messages.
We collect cookies to enable the proper functioning and security of our website, and to enhance your experience. By clicking on 'Accept All Cookies', you consent to the use of these cookies. You can change your 'Cookies Settings' at any time. For more information, please read ourCookie Policy
Cookie Settings
Accept All Cookies