Scraping information from Soccerway and turning into database format
- or -
Post a project like this- Posted:
- Proposals: 5
- Remote
- #1177037
- Completed
Description
Kind of development: New program from scratch
Description of requirements/functionality: I would like to scrape specific details from matches on the soccerway website, regarding players. My database has one line per player per match. It records details such as their time on the pitch, how many goals were scored by the team during that time, and how many by the player. It will also sum up the all of the minutes in which the said player scored (TG), how many of those goals were penalty's, if any of the goals were the first goal in the match (FGS), the last goal in the match (LGS), if they scored two or more in the match (Brace) and if they scored 3 or more in the match (Trick). I have attached my database (Data) which unfortunately i failed to maintain since late last year due to time restraints, hence I am wishing to do this now. In the past I would copy and paste the a match per sheet in excel and then run a previously made tool that would convert the data. I have also included an example of the pasted data from the website, as if it would be easier to create a tool that would just simply take the data to a spreadsheet and then I can run my own tool to convert it, that would suffice.
OS requirements: Windows
Daniel D.
0% (0)New Proposal
Login to your account and send a proposal now to get this project.
Log inClarification Board Ask a Question
-
Hi Daniel, Could you please confirm that you would rather have the pasted to an Excel sheet? And also important as well how would you like to be able to specify what matches data are you after? Cheers, Tom
Daniel D.03 Jun 2016Ultimately I would prefer a tool that would scrape and convert all data to database format, but when having my previous tool built there were some issues where some matches do not have shirt numbers for players, more than one manager etc which changed the formatting slightly. The tool was built around all of these potential hiccups in the end and it can now manage the data regardless, so just having it pasted and ran through my tall seems much easier. I think perhaps selecting a date range and list of League names would make most sense. There are however occasionally matches that have no data on the players at all, so this would need to be able to ignore those matches.
Thanks for your questions