Web crawler to find mentions of phrase within website body text
- or -
Post a project like this1663
$250
- Posted:
- Proposals: 8
- Remote
- #2542191
- OPPORTUNITY
- Awarded
Excel VBA, MS PRoject Expert, Web Scraper, Arena, Simulation, Spreadsheet, Wordpress customisation
Istanbul
Python | Data Analyst | Pandas | Seaborn | Matplotlib | Web Scraping | Automation
Ghandinagar
14184486085130044914297132419330274675227920673048737
Description
Experience Level: Expert
I need a script that is capable of doing the following..
From a .csv list of keywords and phrases the script searches an entire defined web domain and finds every exact mention of that phrase (ideally only within the rendered body next, not menus footers etc.. xpath?) and exports a resulting list with each imported keyword/phrase mapped against each URL on the domain which mentions that keyword.
So an example import file would be similar to..
Domain: domain.com
Keyword 1
Keyword 2
Keyword 3
Keyword 4
Keyword 5
Keyword 6 etc etc
The exported csv file mail look like,
Keyword 1 URL1.html URL1.html
Keyword 2 URL2.html URL7.html URL5.html
Keyword 3 URL1.html
Keyword 4 URL4.html URL4.html
Keyword 5
Keyword 6 URL3.html URL2.html URL4.html
Import files may contain upwards if 600 words/phrases
From a .csv list of keywords and phrases the script searches an entire defined web domain and finds every exact mention of that phrase (ideally only within the rendered body next, not menus footers etc.. xpath?) and exports a resulting list with each imported keyword/phrase mapped against each URL on the domain which mentions that keyword.
So an example import file would be similar to..
Domain: domain.com
Keyword 1
Keyword 2
Keyword 3
Keyword 4
Keyword 5
Keyword 6 etc etc
The exported csv file mail look like,
Keyword 1 URL1.html URL1.html
Keyword 2 URL2.html URL7.html URL5.html
Keyword 3 URL1.html
Keyword 4 URL4.html URL4.html
Keyword 5
Keyword 6 URL3.html URL2.html URL4.html
Import files may contain upwards if 600 words/phrases
Matt S.
95% (36)Projects Completed
29
Freelancers worked with
40
Projects awarded
53%
Last project
30 Sep 2019
United Kingdom
New Proposal
Login to your account and send a proposal now to get this project.
Log inClarification Board Ask a Question
-
There are no clarification messages.
We collect cookies to enable the proper functioning and security of our website, and to enhance your experience. By clicking on 'Accept All Cookies', you consent to the use of these cookies. You can change your 'Cookies Settings' at any time. For more information, please read ourCookie Policy
Cookie Settings
Accept All Cookies