Expired Domain Grabber
- or -
Post a project like this£7/hr(approx. $9/hr)
- Posted:
- Proposals: 3
- Remote
- #421847
- Expired
Description
Experience Level: Entry
General information for the business: Domains
Kind of development: New program from scratch
Description of requirements/functionality: Expired Domain Grabber
Extra notes: Expired Domain Grabber - Project Description
General Mode of Operation
There is a need for a script that scans the Internet for domains which are not being payed for by the owners after the end of a fixed time, and therefore, are not being extended by registrars. Such domains are called "expired domains."
The script is to start from a certain starting point (for example, a popular domain such as "www.bbc.com") and then, like a crawler, pursue all external links that are found on this popular site. The URLs that are found are to be written into a database. Simultaneously and on a permanent basis, this database is to be scanned by the script for expired domains. If the script finds an expired domain, then the URL to such domain is to be indicated for further manual examination via a web interface. At the same time, the script continues to pursue external links and incorporate newly found URLs into the database.
The goal is the incorporation into the database of as many as possible domains and their URLs. The more domains there are in the database, the more URLs can be scanned. The more URLs are scanned, the higher is the potential yield of expired domains.
Special Mode of Operation
1. Blacklist
So that, from the outset, only URLs that are as useful as possible are saved in the database, an automatic pre-selection is necessary. For this purpose, the script must follow all links that are found (in order to find additional URLs), but must not incorporate every URL into the database. This particularly applies to domains that have certain words in their URLs, such as, for example, words in the field of pornography ("sex," "fuck," etc.). A blacklist must be created for such domain names. If the script finds a domain, the domain name of which contains a word from the blacklist, then the script is to follow this link (in order to find other potential domains), but not writing into the database the relevant URL with the word from the blacklist.
See full document for brief
Kind of development: New program from scratch
Description of requirements/functionality: Expired Domain Grabber
Extra notes: Expired Domain Grabber - Project Description
General Mode of Operation
There is a need for a script that scans the Internet for domains which are not being payed for by the owners after the end of a fixed time, and therefore, are not being extended by registrars. Such domains are called "expired domains."
The script is to start from a certain starting point (for example, a popular domain such as "www.bbc.com") and then, like a crawler, pursue all external links that are found on this popular site. The URLs that are found are to be written into a database. Simultaneously and on a permanent basis, this database is to be scanned by the script for expired domains. If the script finds an expired domain, then the URL to such domain is to be indicated for further manual examination via a web interface. At the same time, the script continues to pursue external links and incorporate newly found URLs into the database.
The goal is the incorporation into the database of as many as possible domains and their URLs. The more domains there are in the database, the more URLs can be scanned. The more URLs are scanned, the higher is the potential yield of expired domains.
Special Mode of Operation
1. Blacklist
So that, from the outset, only URLs that are as useful as possible are saved in the database, an automatic pre-selection is necessary. For this purpose, the script must follow all links that are found (in order to find additional URLs), but must not incorporate every URL into the database. This particularly applies to domains that have certain words in their URLs, such as, for example, words in the field of pornography ("sex," "fuck," etc.). A blacklist must be created for such domain names. If the script finds a domain, the domain name of which contains a word from the blacklist, then the script is to follow this link (in order to find other potential domains), but not writing into the database the relevant URL with the word from the blacklist.
See full document for brief
Kevin W.
100% (4)Projects Completed
9
Freelancers worked with
9
Projects awarded
12%
Last project
7 Dec 2019
United Kingdom
New Proposal
Login to your account and send a proposal now to get this project.
Log inClarification Board Ask a Question
-
There are no clarification messages.
We collect cookies to enable the proper functioning and security of our website, and to enhance your experience. By clicking on 'Accept All Cookies', you consent to the use of these cookies. You can change your 'Cookies Settings' at any time. For more information, please read ourCookie Policy
Cookie Settings
Accept All Cookies