Online Web Directory Data Mining Project

  • Posted
  • Proposals 7
  • Remote
  • #1322
  • Expired
Steve A.Dinesh S.Paul T.Ankur B.Satish Kumar S. + 2 others have already sent a proposal.
  • 0

Description

Experience Level: Expert
Online Web Directory Data Mining Project (Open to ideas like Web Crawler, Bot, Web Program and applications)


We require good to an ingenious programmer able to develop an application which might be similar to a web bot/crawler.


(We are absolutely open minded to good ideas so if you can formulated a program that can download/retrieve all search results from the on-line directory without this process below we are very much interested.)


The chosen developer will be working closely the Data acquisition consultant (That me!!) Guiding to get the application developed to spec. We will love you to really enjoy what you do, our approaches are based on innovative ideas and we like to get the right people on board


Brief Description


Application to run specific searches on an on-line directory (search criteria are taken from a data base) and return the results into a database or Delimited format.


The on-line directory to worked on will be similar to www.yell.com (Please do check it out!)


On the directory there are 3 search fields however 2 will be primarily used e.g search to Barbers in London. (Criteria Barber and London)



Search (Field 1): Barber

Or Name (Field 2):

Location (Field 3): London



The specific search criteria is preferably taken automatically from a list in one or two data base however we can negotiate some kind of semi-manually workable operation.


Imperative the application should retrieve specific data(Primarily Name and Tel.) from page to page i.e. for 160 results at 20 per page = 8 pages thus application must retrieve all 160 by collect the 20 from each page then perform another search


We can provide some general information to guide the developments such as the the primary data are in specific Font formats and are always displayed specific fields locations (to be tested)


There are some specific constraints such as max info per page(20 per page) and max page numbers per search(10 pages thus max info 200) we can work around and we can discuss further with the respective Candidates.


We love great ideas and we are open to approaches that might be different, Special consideration will be given to candidates directory programming experiences and web crawling

Plus further work to follow after successful completion

Clarification Board

    There are no clarification messages.