Scraper for Tripadvisor using scrapy framework
- or -
Post a project like this$$
- Posted:
- Proposals: 1
- Remote
- #1115319
- Expired
Description
Experience Level: Intermediate
Hi, our company (we communicate mainly in French) is looking for a top-notch developer to create a software for scraping www.tripadvisor.fr
Importants points :
- the software should be able to retrieve many fields, but above all email and website fields !
- the footprints used to retrieve all the fields should be easily accessed for modifiying, in case Tripadvisor change the page structure of the site.
- the software should be able to scrape the french version of Tripadvisor.
- the software should be able to handle large quantities of data, scraping thousands of pages at a time, multitasking should be considered.
- if needed, the software should be able to handle proxy rotating and to limit the number of requests/second.
- the software should let the user specify the location and the service looked for (hotel, restaurant, camping, etc...)
- the software will have to run on a desktop PC (manjaro Linux)
- export will be done in CSV format
- the software will be delivered with editable source code
==> Using Scrapy and Scrapoxy tools will be a major trump
WARNING ! To qualify for that project, you must=
- 2 years+ experience in web scraping
- Good background & portfolio are required
- Being reactive & easily reachable
- Attention to details
Any application which doesn't fit to one or more from criterias above, will be ignored.
Note that, we're looking for someone we can work with on a regular/long-term basis then we rely on your offer/bid to be reasonable.
IMPORTANT: In a few words, thanks to tell us more about your background/experience and a link to your portfolio : )
Thanks for your attention & interest
Annexed screen shots are included, indicating the fields to be scraped and their names.
Note 1: Excellence and Revendication fields can be boolean (yes/no - 0/1)
Note 2: revendication field can be reversed (yes/1 if not displayed, no/0 if displayed)
Note 3: The URL of the page will be included to the scraped fields, which makes a total of 12 fields to be scraped.
Importants points :
- the software should be able to retrieve many fields, but above all email and website fields !
- the footprints used to retrieve all the fields should be easily accessed for modifiying, in case Tripadvisor change the page structure of the site.
- the software should be able to scrape the french version of Tripadvisor.
- the software should be able to handle large quantities of data, scraping thousands of pages at a time, multitasking should be considered.
- if needed, the software should be able to handle proxy rotating and to limit the number of requests/second.
- the software should let the user specify the location and the service looked for (hotel, restaurant, camping, etc...)
- the software will have to run on a desktop PC (manjaro Linux)
- export will be done in CSV format
- the software will be delivered with editable source code
==> Using Scrapy and Scrapoxy tools will be a major trump
WARNING ! To qualify for that project, you must=
- 2 years+ experience in web scraping
- Good background & portfolio are required
- Being reactive & easily reachable
- Attention to details
Any application which doesn't fit to one or more from criterias above, will be ignored.
Note that, we're looking for someone we can work with on a regular/long-term basis then we rely on your offer/bid to be reasonable.
IMPORTANT: In a few words, thanks to tell us more about your background/experience and a link to your portfolio : )
Thanks for your attention & interest
Annexed screen shots are included, indicating the fields to be scraped and their names.
Note 1: Excellence and Revendication fields can be boolean (yes/no - 0/1)
Note 2: revendication field can be reversed (yes/1 if not displayed, no/0 if displayed)
Note 3: The URL of the page will be included to the scraped fields, which makes a total of 12 fields to be scraped.
Patrick R.
0% (0)Projects Completed
-
Freelancers worked with
-
Projects awarded
0%
Last project
13 Dec 2024
France
New Proposal
Login to your account and send a proposal now to get this project.
Log inClarification Board Ask a Question
-
There are no clarification messages.
We collect cookies to enable the proper functioning and security of our website, and to enhance your experience. By clicking on 'Accept All Cookies', you consent to the use of these cookies. You can change your 'Cookies Settings' at any time. For more information, please read ourCookie Policy
Cookie Settings
Accept All Cookies