
SITE DATA SCRAPING
- or -
Post a project like this1292
£330(approx. $450)
- Posted:
- Proposals: 23
- Remote
- #3613751
- OPPORTUNITY
- Awarded
Virtual Assistant(VA), SEO expert, Google Ads expert, WordPress Expert, Content Creator.
Full Stack Developer/React.JS/Node.JS/ReactNative/Hybrid Apps /PHP/Wordpress/Shopify/Laravel/WebRTC/Angular/Digital Marketing
Business Leads, LinkedIn/YouTube/Facebook optimization, Digital Marketing, Content Writing

Researcher, Data base building, Lead generation, E Mail database, Data Entry, Research base writing, Market Research,

Digital Marketing Expert | LinkedIn Specialist | MACHINE LEARNING | ADVANCE EXCEL| POWER BI EXPERT| B2B Lead Generation | Social Media Strategy | Competitor Analysis| GENERATIVE AI | PYTHON

Digital Marketing, Email Marketing, SMTP Server, WordPress Expert and SEO & Lead Generation Expert

Virtual Assistant, Web Scraping, Data Mining, Python Bot creation, Data Entry, Photoshop
72257145623148306362111853407044313837804375545055612512114947349163970444097225393
Description
Experience Level: Expert
We want someone who has expert knowledge of scraping websites to collect all data from the following sites
FACEBOOK (UK BUSINESSES ONLY)
GOOGLE BUSINESS DIRECTORY (UK BUSINESSES ONLY)
YELP.CO.UK
RAW files to be delivered first. Description below:
A RAW file is all of the data on each site. From the very first entry to a specified date i.e., 14th June 2022. So, if the first entry was 1st February 2004, every entry from that date to 14th June 2022.
I will require confirmation that all of the brief can be accomplished and the timeframe to deliver RAW files. The estimated Raw amounts are as follows:
FACEBOOK: 4 million entries to date
Google: 4 million entries to date
Yelp: 1 million entries to date
So to be bale to deliver the Raw files within 4 weeks, 200000 records per day will need to be collected. The weekly scrape will obviously be a lot less in numbers.
Weekly Scrape
The weekly scrape will be from 15th June 2022 to 22nd June 2022, continuing weekly, 23rd June to 30th June etc.
The information we require from each site is as follows, in this order:
company name
business category
address 1
address 2
address 3
town
postcode
landline telephone
mobile telephone
website
email
owner
date joined
URL
If any of the above are not present on the scraping sites, especially mobile telephone & email address, we would like this information collected from their own website.
I have attached a sample file from a site we are currently scraping, so you can view the layout and information we require. As you will see there is an extra column "mobile extra" for numbers grabbed from business websites. You will also notice that there are 2 tabs, 1 without mobile telephone numbers and the other with. We would like to copy this format also.
FACEBOOK (UK BUSINESSES ONLY)
GOOGLE BUSINESS DIRECTORY (UK BUSINESSES ONLY)
YELP.CO.UK
RAW files to be delivered first. Description below:
A RAW file is all of the data on each site. From the very first entry to a specified date i.e., 14th June 2022. So, if the first entry was 1st February 2004, every entry from that date to 14th June 2022.
I will require confirmation that all of the brief can be accomplished and the timeframe to deliver RAW files. The estimated Raw amounts are as follows:
FACEBOOK: 4 million entries to date
Google: 4 million entries to date
Yelp: 1 million entries to date
So to be bale to deliver the Raw files within 4 weeks, 200000 records per day will need to be collected. The weekly scrape will obviously be a lot less in numbers.
Weekly Scrape
The weekly scrape will be from 15th June 2022 to 22nd June 2022, continuing weekly, 23rd June to 30th June etc.
The information we require from each site is as follows, in this order:
company name
business category
address 1
address 2
address 3
town
postcode
landline telephone
mobile telephone
website
owner
date joined
URL
If any of the above are not present on the scraping sites, especially mobile telephone & email address, we would like this information collected from their own website.
I have attached a sample file from a site we are currently scraping, so you can view the layout and information we require. As you will see there is an extra column "mobile extra" for numbers grabbed from business websites. You will also notice that there are 2 tabs, 1 without mobile telephone numbers and the other with. We would like to copy this format also.
Andrew B.
100% (1)Projects Completed
1
Freelancers worked with
1
Projects awarded
80%
Last project
15 Jun 2022
United Kingdom
New Proposal
Login to your account and send a proposal now to get this project.
Log inClarification Board Ask a Question
-
There are no clarification messages.
We collect cookies to enable the proper functioning and security of our website, and to enhance your experience. By clicking on 'Accept All Cookies', you consent to the use of these cookies. You can change your 'Cookies Settings' at any time. For more information, please read ourCookie Policy
Cookie Settings
Accept All Cookies