Data Scraping (From Directories)
- or -
Post a project like this2338
£200(approx. $251)
- Posted:
- Proposals: 11
- Remote
- #1785320
- OPPORTUNITY
- Completed
TOP CERT Virtual Assistant, Data Analyst, Proofreader, Excel, Word and PowerPoint Trainer
Swindon
scrapingsolution.com, IT Consultant, Python Developer, Process Automation, Desktop Applications, Web Scraping, Web Crawling
Newport Pagnell
E-mail scrap, Web Research, Data Entry, Image Editor, Powerpoint presentation Designer
San Mateo
Data Mining, Lead Generation, Searching, Data Entry, Website Design, Research & Virtual Analyst, Business Process & all Others relates to ITEs
Udaipur
75264929684184666610162521142294123542312935741493238184834318994901915918
Description
Experience Level: Intermediate
Local Business Directory Data Scraping
We are looking to build a local community comprising of all local businesses in Fife (Scotland, UK). In order to get this under way we are launching a "Local Business Directory". The "seed" listings will come from scraped data as below.
The following is an overview of the work that we would like undertaken:
Scraping - 1st Wave (Milestone 1)
The following data from the initial scrape is a mandatory requirement:
○ Business name, Address, Postcode
○ Landline telephone number, mobile as well if possible
○ Business category as per the directory (.e hairdresser, taxi etc)
○ Website address
When obtaining the first scrape it is possible that you may also obtain other data i.e facebook page, reviews etc. If these cannot be obtained in the first scrape, see "Scraping - 2nd Wave" (Below)
These are the directories to be scrapped:
○ business.yell.com
○ http://www.thomsonlocal.com/
○ http://www.scoot.co.uk/
○ http://findit.fifetoday.co.uk/
○ www.freeindex.co.uk
○ http://www.118118.com
○ www.192.com
This data will then be provided to us, by directory, in CSV/Excel format. Ideally, we would also like the business logo's in the CSV file if this is technically possible.
We will then run perform some cleansing of the file and provide a cleaned up version back to you, ready for "Wave 2 Scraping".
Scraping - 2nd Wave (Milestone 2)
This activity will focus on trying to obtain missing data from wave 1. Typically this may be telephone numbers and facebook business page.
To achieve this, when we cleanse the file we will also prepare a version for you where business telephone numbers and/or facebook business page listings are missing.
You will then load these website URL's up into your scraper and have the websites scraped for this missing data (And perhaps any other data obtainable).
Timescales
We need this work completed within 7 days of being given the assignment.
We are looking to build a local community comprising of all local businesses in Fife (Scotland, UK). In order to get this under way we are launching a "Local Business Directory". The "seed" listings will come from scraped data as below.
The following is an overview of the work that we would like undertaken:
Scraping - 1st Wave (Milestone 1)
The following data from the initial scrape is a mandatory requirement:
○ Business name, Address, Postcode
○ Landline telephone number, mobile as well if possible
○ Business category as per the directory (.e hairdresser, taxi etc)
○ Website address
When obtaining the first scrape it is possible that you may also obtain other data i.e facebook page, reviews etc. If these cannot be obtained in the first scrape, see "Scraping - 2nd Wave" (Below)
These are the directories to be scrapped:
○ business.yell.com
○ http://www.thomsonlocal.com/
○ http://www.scoot.co.uk/
○ http://findit.fifetoday.co.uk/
○ www.freeindex.co.uk
○ http://www.118118.com
○ www.192.com
This data will then be provided to us, by directory, in CSV/Excel format. Ideally, we would also like the business logo's in the CSV file if this is technically possible.
We will then run perform some cleansing of the file and provide a cleaned up version back to you, ready for "Wave 2 Scraping".
Scraping - 2nd Wave (Milestone 2)
This activity will focus on trying to obtain missing data from wave 1. Typically this may be telephone numbers and facebook business page.
To achieve this, when we cleanse the file we will also prepare a version for you where business telephone numbers and/or facebook business page listings are missing.
You will then load these website URL's up into your scraper and have the websites scraped for this missing data (And perhaps any other data obtainable).
Timescales
We need this work completed within 7 days of being given the assignment.
Jason N.
0% (0)Projects Completed
3
Freelancers worked with
3
Projects awarded
56%
Last project
7 Feb 2019
United Kingdom
New Proposal
Login to your account and send a proposal now to get this project.
Log inClarification Board Ask a Question
-
Do you have another data scrape project?
538365
We collect cookies to enable the proper functioning and security of our website, and to enhance your experience. By clicking on 'Accept All Cookies', you consent to the use of these cookies. You can change your 'Cookies Settings' at any time. For more information, please read ourCookie Policy
Cookie Settings
Accept All Cookies