Regex and webscraping engine
- or -
Post a project like this$$$
- Posted:
- Proposals: 2
- Remote
- #1043676
- Expired
Description
Experience Level: Expert
General information for the business: Machine learning to identify specific content in websites and monitor that content for change
Kind of development: New program from scratch
Description of requirements/functionality: Our dev team are building our frontends and we now need to find that extraordinary team that can build our backend engine in record time.
The backend engine will read a URL from our MongoDB.
The URL is then to be processed by the engine and based on a number of predefined rules extract a specific content from the website.
The rules most probably will combine regex and web scraping methods to create a number of rules that can be combined in one way on one website and another way on another website to create the perfect combination of rules for each website (It is not possible to create site specific rules for the engine as the websites are unknown as the engine will operate on any website that any of our users will save to the DB.
When the engine identify the data based on these rules the engine save it to the DB and will monitor the website for that data and write to a new field in the DB if the data changes at any point.
If you are interested in working in this really fun project then first of all go to this URL.
To Sign the NDA:https://www.docracy.com/linksign/0bfdmhf8x3f
Secondly contact me on skype so that we can talk more about the project details after you signed the NDA. My Skype ID is klillrud
Specific technologies required: machine learning
Extra notes:
Kind of development: New program from scratch
Description of requirements/functionality: Our dev team are building our frontends and we now need to find that extraordinary team that can build our backend engine in record time.
The backend engine will read a URL from our MongoDB.
The URL is then to be processed by the engine and based on a number of predefined rules extract a specific content from the website.
The rules most probably will combine regex and web scraping methods to create a number of rules that can be combined in one way on one website and another way on another website to create the perfect combination of rules for each website (It is not possible to create site specific rules for the engine as the websites are unknown as the engine will operate on any website that any of our users will save to the DB.
When the engine identify the data based on these rules the engine save it to the DB and will monitor the website for that data and write to a new field in the DB if the data changes at any point.
If you are interested in working in this really fun project then first of all go to this URL.
To Sign the NDA:https://www.docracy.com/linksign/0bfdmhf8x3f
Secondly contact me on skype so that we can talk more about the project details after you signed the NDA. My Skype ID is klillrud
Specific technologies required: machine learning
Extra notes:
Karl L.
100% (1)Projects Completed
2
Freelancers worked with
2
Projects awarded
6%
Last project
4 Apr 2020
Sweden
New Proposal
Login to your account and send a proposal now to get this project.
Log inClarification Board Ask a Question
-
There are no clarification messages.
We collect cookies to enable the proper functioning and security of our website, and to enhance your experience. By clicking on 'Accept All Cookies', you consent to the use of these cookies. You can change your 'Cookies Settings' at any time. For more information, please read ourCookie Policy
Cookie Settings
Accept All Cookies