Webscraping EPALE/Erasmus+/Etwinning
- or -
Post a project like this1445
€200(approx. $213)
- Posted:
- Proposals: 6
- Remote
- #2784597
- OPPORTUNITY
- Awarded
Web Scraping | Web Design & Development |Mobile App Design & Development | Wordpress Website Development | Salesforce
Oakland
WordPress Expert✮Shopify Expert✮Graphic Designer✮AutoCAD 2D & 3D✮CV Writer & Designer✮Fullstack developer
Rawalpindi
Excel Expert |WordPress| Google Sheets |VBA & Google App Script | Woocommerce
Jhelum
LinkedIn Lead Expert, Web Scraping, Data Entry, Data mining, Web Crawling, Data extraction and Email Lists
London
34762936365732782977128591912354232749520
Description
Experience Level: Expert
Webscrape the following sources.
#EPALE - 2100 records
For each page https://epale.ec.europa.eu/en/organisations?alphabet_1=All&field_organization_country_tid_i18n=All&field_organization_type_tid_i18n=All&combine=&page=1
With page id going from 1 to 208
Select each of the 10 records per page
Extract the organization name and the country (use the flag caption), and email (if present)
If the organization description does not include an email, go on the organization website and search for the email
The final Excel file must be in the:
Country
Organization Name
Organization website(if present on page)
Email (either if present on page or found by crawling collected "Organization website" above.
URL of the detailed record (from epale website)
#ETWINNING – 11k records
For each of the following countries in the map, please select the country and click on Browse Schools.
https://www.etwinning.net/en/pub/community/countries.cfm
Turkey
Spain
France
Germany
Poland
Belgium
Sweden
Romania
Austria
Portugal
Netherlands
Step 1 Scraping fields from http://www.etwinning.net
- School Name
- City
- Region
- Website (If Present)
- URL of the detailed record (from etwinning website)
Step 2 Crawling Emails where Website available from step 1
#ERASMUS+ - 15k records
File attached.
Crawl all 15K sites and collect emails from those where email exist and can be collected automatically(i.e not hidden in image/ some jumbled words make email and site not opening can not be collect)
This will include automatic collection only not the manual searching of organization and collecting data.
Add the email column to the attached file.
DEADLINE
The job must be completed before Fri 10th April 14.00 - please, send daily updates with correct data so I can provide feedback
#EPALE - 2100 records
For each page https://epale.ec.europa.eu/en/organisations?alphabet_1=All&field_organization_country_tid_i18n=All&field_organization_type_tid_i18n=All&combine=&page=1
With page id going from 1 to 208
Select each of the 10 records per page
Extract the organization name and the country (use the flag caption), and email (if present)
If the organization description does not include an email, go on the organization website and search for the email
The final Excel file must be in the:
Country
Organization Name
Organization website(if present on page)
Email (either if present on page or found by crawling collected "Organization website" above.
URL of the detailed record (from epale website)
#ETWINNING – 11k records
For each of the following countries in the map, please select the country and click on Browse Schools.
https://www.etwinning.net/en/pub/community/countries.cfm
Turkey
Spain
France
Germany
Poland
Belgium
Sweden
Romania
Austria
Portugal
Netherlands
Step 1 Scraping fields from http://www.etwinning.net
- School Name
- City
- Region
- Website (If Present)
- URL of the detailed record (from etwinning website)
Step 2 Crawling Emails where Website available from step 1
#ERASMUS+ - 15k records
File attached.
Crawl all 15K sites and collect emails from those where email exist and can be collected automatically(i.e not hidden in image/ some jumbled words make email and site not opening can not be collect)
This will include automatic collection only not the manual searching of organization and collecting data.
Add the email column to the attached file.
DEADLINE
The job must be completed before Fri 10th April 14.00 - please, send daily updates with correct data so I can provide feedback
Francesco C.
99% (70)Projects Completed
70
Freelancers worked with
22
Projects awarded
62%
Last project
26 Jan 2024
Italy
New Proposal
Login to your account and send a proposal now to get this project.
Log inClarification Board Ask a Question
-
There are no clarification messages.
We collect cookies to enable the proper functioning and security of our website, and to enhance your experience. By clicking on 'Accept All Cookies', you consent to the use of these cookies. You can change your 'Cookies Settings' at any time. For more information, please read ourCookie Policy
Cookie Settings
Accept All Cookies