Regex and webscraping engine

- or -

Post a project like this

Ended at: 08/03/2016

Fixed Price

$$$

Posted: 8 years ago
Proposals: 2
Remote
#1043676
Expired

have already sent a proposal.

Description

Experience Level: Expert

General information for the business: Machine learning to identify specific content in websites and monitor that content for change
Kind of development: New program from scratch
Description of requirements/functionality: Our dev team are building our frontends and we now need to find that extraordinary team that can build our backend engine in record time.

The backend engine will read a URL from our MongoDB.
The URL is then to be processed by the engine and based on a number of predefined rules extract a specific content from the website.

The rules most probably will combine regex and web scraping methods to create a number of rules that can be combined in one way on one website and another way on another website to create the perfect combination of rules for each website (It is not possible to create site specific rules for the engine as the websites are unknown as the engine will operate on any website that any of our users will save to the DB.

When the engine identify the data based on these rules the engine save it to the DB and will monitor the website for that data and write to a new field in the DB if the data changes at any point.

If you are interested in working in this really fun project then first of all go to this URL.
To Sign the NDA:https://www.docracy.com/linksign/0bfdmhf8x3f

Secondly contact me on skype so that we can talk more about the project details after you signed the NDA. My Skype ID is klillrud
Specific technologies required: machine learning
Extra notes:

New Proposal

Clarification Board Ask a Question

There are no clarification messages.

Description

Karl L.

New Proposal

Clarification Board Ask a Question