Scrape and Parse a Lyrics Website

  • Posted:
  • Proposals: 9
  • Remote
  • #663805
  • Expired
Alexey S.Mandip S.Advanced W.SUMIT A.Daluyar H. + 4 others have already sent a proposal.
  • 2

Description

Experience Level: Expert
General information for the website: Scraping and Parsing
Description of requirements/features: IMPORTANT! I will not respond to any form letters or lists of projects that you've done in the past. What I specifically want to see from any proposals is your approach to the problem. Anything other than that will be ignored.

I'm looking for someone to write a script to scrape a lyrics website and parse it into the following three tables:

SONGS TABLE
id
Artist - Artist whose song it is
Song Title - Title of the song
Featured Artists (comma-separated) - All the featured artists on the song
Number of Verses - Number of Verses in the song
Verse IDs - ids of the verses from the verse table in order of appearance on the song
Album - Name of the Album the song Appears on

VERSE TABLE
id
artist - Artist name
verse - Text of the verse
order - Order number of the verse in the song

ALBUMS TABLE
id
artist
song IDs - list of song ids in order that appear on the album

The site in question is a combination of text files and web pages. It has formatting, to delineate these data points, but it can be inconsistent. Your deliverables will be:
- The database in MySQL format
- A script that crawls and scrapes the site and updates the database whenever new lyrics are added. It should be smart enough to crawl the new lyrics page and identify what hasn't been downloaded in the past.

IN YOUR PROPOSAL - Let me know how you might approach this project and I'll respond with the website in question. The person with the best approach will be chosen.
Extra notes:

New Proposal

Create an account now and send a proposal now to get this job.

Sign up

Clarification Board Ask a Question

    There are no clarification messages.