I need a script to scrape Google Popular Times

  • Posted:
  • Proposals: 2
  • Remote
  • #1530877
  • Expired
Yavor D.Tushar G. have already sent a proposal.
  • 2

Description

Experience Level: Intermediate
General information for the business: N/A
Description of requirements/functionality: I'm doing a data analysis project, and I would like a script to scrape the 'Popular Times' section from a google places search.

The final script should be called from the command line(terminal). The script should take as an input argument a URL and return the following items in JSON form:

'Popular Times'
Monday - the numeric height of the bar (as given in the source) for each every available hour.
Tuesday - the numeric height of the bar (as given in the source) for each every available hour.
Wednesday - the numeric height of the bar (as given in the source) for each every available hour.
Thursday - the numeric height of the bar (as given in the source) for each every available hour.
Friday - the numeric height of the bar (as given in the source) for each every available hour.
Saturday - the numeric height of the bar (as given in the source) for each every available hour.
Sunday - the numeric height of the bar (as given in the source) for each every available hour.

Live Popularity - the numeric height of the pink bar

Opening Times:
Monday - Friday: the opening times for everyday of the week as listed

Address: the address as shown in the attached image.

General Information: description etc etc, as shown in the attached image.

Timestamp: the time and date of the script execution

Missing data: I understand that not all google places data contains each of the following fields. The script should be able to handle missing information. If a place is encountered with missing information, the missing fields should be tagged in the 'Missing Data' variable. If no fields are missing, please return NULL or NA for this field.

Preferably the script will be written in Python (because I can code in this language), but any other languages could also be accepted. Any packages or libraries used should be documented.

I've tried to write the requirements as clearly as possible, but I understand that there might be things that I've missed or additional requirements that I have not thought of. If you feel like there is a better/alternative method, please contact me and we can discuss the changes. Please comment the code as is standard practice and provide documentation for each function.

Example URL:
https://www.google.de/webhp?sourceid=chrome-instant&ion=1&espv=2&ie=UTF-8#q=fernsehturm+berlin
Specific technologies required: Python, Javascript
OS requirements: Mac OS, Linux
Extra notes: I'm doing a data analysis project, and I would like a script to scrape the 'Popular Times' section from a google places search.

The final script should be called from the command line(terminal). The script should take as an input argument a URL and return the following items in JSON form:

'Popular Times'
Monday - the numeric height of the bar (as given in the source) for each every available hour.
Tuesday - the numeric height of the bar (as given in the source) for each every available hour.
Wednesday - the numeric height of the bar (as given in the source) for each every available hour.
Thursday - the numeric height of the bar (as given in the source) for each every available hour.
Friday - the numeric height of the bar (as given in the source) for each every available hour.
Saturday - the numeric height of the bar (as given in the source) for each every available hour.
Sunday - the numeric height of the bar (as given in the source) for each every available hour.

Live Popularity - the numeric height of the pink bar

Opening Times:
Monday - Friday: the opening times for everyday of the week as listed

Address: the address as shown in the attached image.

General Information: description etc etc, as shown in the attached image.

Timestamp: the time and date of the script execution

Missing data: I understand that not all google places data contains each of the following fields. The script should be able to handle missing information. If a place is encountered with missing information, the missing fields should be tagged in the 'Missing Data' variable. If no fields are missing, please return NULL or NA for this field.

Preferably the script will be written in Python (because I can code in this language), but any other languages could also be accepted. Any packages or libraries used should be documented.

I've tried to write the requirements as clearly as possible, but I understand that there might be things that I've missed or additional requirements that I have not thought of. If you feel like there is a better/alternative method, please contact me and we can discuss the changes. Please comment the code as is standard practice and provide documentation for each function.

Example URL:
https://www.google.de/webhp?sourceid=chrome-instant&ion=1&espv=2&ie=UTF-8#q=fernsehturm+berlin



New Proposal

Create an account now and send a proposal now to get this project.

Sign up

Clarification Board Ask a Question

    There are no clarification messages.