Bookie Website Odds Comparison Excel VBA Scraper
- or -
Post a project like this- Posted:
- Proposals: 4
- Remote
- #1591060
- Awarded
Description
Description of requirements/functionality: I am looking for some software that will automatically scrape golf odds from a large selection of UK bookies that can then be exported to excel. I list a series of websites below and all odds under the golf section of that website should be scraped. The software should be able to do this on demand. It should also have the functionality to scrape all or a selection of the bookies listed.
I want this to come straight from the bookies, not odds comparison websites. I currently scrape prices from oddschecker but I'm finding that they don't re-map for new markets often enough, miss out a few markets and exclude a few websites that I would like to see.
There are many different types of markets to import but I want this to include every single golf price a bookie offers. An example of the markets that are likely to be found are:
• Winner
• Top 5
• Top 10
• Top 20
• Top Continental Euro
• Top Englishman
• 3 way 18 holers
• 2 way 18 holers
• 2 way 72 hole matches
• 72 Hole Groups
• Round X Leader
Etc….
The most important data to scrape from these websites will be:
• Tournament Name – For example, The Irish Open, The Open Championship, The Masters etc.
• Market Name – For example, tournament winner, top European, 72 hole match bet etc.
• Bookie – For example, Ladbrokes, bet365, William Hill etc.
• Odds – For example, 10/11, 16/1 etc. These should be expressed in what is known as decimal odds. This is just a different way of recording these fractions. Put simply, if a website records their odds in fractions then 10/11 becomes 1.909009, 11/8 becomes 2.375, 16/1 becomes 17.0. It is basically the fractional odds +1. Some bookies such as pinnacles and betfair already record their odds in decimal.
• Each way terms – this is split into two factors. Each way places and each way fraction. For example, on a golf win market you will typically see ¼ 1,2,3,4,5. This means that the number of each way places is 5, and the fraction is ¼.
The data collated should then be rearranged to create an odds comparison grid. This will probably require some manual lookups to define how each bookie writes a market or player name. For example Ladbrokes may call one market “Tournament Winner” and bet365 may call the same market “to win outright”. In addition to market name differences, player name differences will also happen. For example, Rafa Cabrera Bello v Rafael Cabrera-Bello. Similarly, tournament name will require lookups. For example, The Masters v US Masters.
The current list of bookies I would like to scrape is below. I can provide urls to the data but the successful applicant will have to find the data feeds themselves:
• 188bet.co.uk
• 10bet.co.uk
• 888sport.com
• betpack.com
• bet365.com
• betbright.com
• betdaq.com
• betfair exchange - betfair.com/exchange
• betfair sportsbook - betfair.com/sport
• betfred.com
• betstars.uk
• betvictor.com
• betway.com
• blacktype.bet
• boylesports.com
• bwin.com
• sports.coral.co.uk
• en.expekt.com
• geoff-banks.com
• jenningsbet.com
• ladbrokes.com
• marathonbet.co.uk
• matchbook.com
• paddypower.com
• pinnacle.com
• sbobet.com
• skybet.com
• sportingbet.com
• spreadex.com
• stanjames.com
• sunbets.co.uk/
• totesport.com
• unibet.co.uk
• vernons.com
• williamhill.com
• sports.winner.co.uk
I understand that the websites that I will be scraping will constantly be updating. Therefore, the successful applicant will probably get more work in the future as the feeds break down or as new betting websites are created.
The code in this project should be openly accessible to me, i.e. no passwords or similar should be used to restrict access. Unfortunately the last person to attempt this project did not complete it but some of the code that was written you can find the in the attached spreadsheets. I don't know whether it will be of use.
I would prefer for this to be written in VBA. I understand this is not the best code for scraping but I would like i to be fully contained within excel as I'm more comfortable with that software.
Please look through the attached spreadsheets before applying for this job. They show the basic process I would like. At its most simple, I would like all data to be imported into something like the "Golf Odds Scraper Example". This data should be cleansed at that point before the data for a specific tournament can be exported in to a grid comparison in a spreadsheet like "Tournament Export Example". Basically, the end product should look something like www.oddschecker.com/golf/the-memorial-tournament/winner and all the other markets but I don't want to be reliant on them mapping markets so I want software to do it directly with the bookie websites mentioned above.
If you need any more info then please don't hesitate to contact me
Steven H.
100% (67)New Proposal
Login to your account and send a proposal now to get this project.
Log inClarification Board Ask a Question
-
Hi Steven,
Thanks for the posting.
Could you please send me reference website, your budget and time-frame.
Regards
-
Hi Steven
Are you serious about this? From a rough calculation you have 27 sites feeding 41 sheets. Each of those combinations may require their own scraping code. Then on top of that you want the data "normalised/harmonised" so different data representations are turned into comparable values. As you mention some of the sites will from time to time change their page layout potential requiring an underlying change. Also some sites may not be hospitable to being scraped!
Whats your budget as this on the face of it will be a long and costly job.
Best Regards
Dave Gregory