Post Project
  • Search
    • Buyers can
    • Search offers to buy now
    • Search freelancers to request a proposal
    • Freelancers can
    • Search projects to quote on
  • How it works
  • Log in
  • Sign up
  • Freelancer?
Browse by Category
    Technology & ProgrammingWriting & TranslationDesignDigital MarketingVideo, Photo & ImageBusinessMusic & AudioMarketing, Branding & SalesSocial Media

    Scrapy webscrape spider

    - or -

    Post a project like this
    1205
    $175
    • Posted: 3 years ago
    • Proposals: 8
    • Remote
    • #2829654
    • PRE-FUNDED
    • Awarded
    Marwen B.
    Marwen B.
    Software and web engineer
    Germany Dresden
    Hasan M.
    Hasan M.
    Top Cert WordPress|Shopify|Wix|PHP|React JS|Full Stack| Developer | Mobile Apps
    Top Endorsed
    Romania Bucharest
    SARAVANAKUMAR K.
    SARAVANAKUMAR K.
    Virtual Assistant, Web Scraping, Data Mining, Python Bot creation, Data Entry, Photoshop
    India Salem
    Kbizsoft Solutions
    Kbizsoft Solutions
    UK No. 1 Web and Mobile App Programming Experts
    Top Endorsed
    United States Bay Minette
    Hoang D.
    Hoang D.
    Web Scraping, Data Extracting, Data Analysis with Python
    Top Endorsed
    Singapore Singapore
    Amit U.
    Amit U.
    Web Scraping/Extraction and Automation Engineer
    Top Endorsed
    Canada London
    Oleksandr H.
    Oleksandr H.
    Experienced WordPress / Shopify / Vue / Python Developer
    Top Endorsed
    Russian Federation Michurinsk
    Pavel G.
    Pavel G.
    Web Developer
    Russian Federation Yekaterinburg
    7423832062946211494724990062746752379045340821504192410
    Marwen B.
    Hasan M.SARAVANAKUMAR K.Kbizsoft SolutionsHoang D. + 3 others have already sent a proposal.
    • 3
    • 3

    Description

    Experience Level: Intermediate
    ** Do not send me a generic, automated response and I will automatically decline it. ** Best response is to send me a couple relevant projects where you have used scrapy.

    Develop a python script using scrapy 2.1 to crawl and scrape

    for fiscal years beginning in 2015 in the Annual Index to the right of the page.


    Click through that link and you see a Fast Facts tab and a Highlights tab, capture that text (don't need the images). Always capture the unique links associated with reports so we can easily get to the original website url needed. Some have podcasts which we don't need to download but capture the URL to the podcast, if one exists.


    Similarly, this same process for each of the months listed on the page. Be sure to note any duplicates (links in months that are duplicated in the years). The numbers should be your key like GAO-19-539.



    For reports that have a recommendation (indicated by a Y in the above index file), there is a second csv file of this nature:
    sequence number (key to the index file above), report number, recommendation number, priority flag, recommendation, agency affected, status, comments

    Some recommendations are "priority recommendations".
    The priority flag I mention is set to Y if it's a priority recommendation. Using this example, I'd see something like this in the file


    I'm going to leave the budget open. But I do not expect this to be very expensive. I would like to see initial results in two days, a test run with just a couple of records, for me to evaluate and comment on. I'll likely award within 2-3 days.
    Herschel C.
    Herschel C.
    100% (57)
    Projects Completed
    53
    Freelancers worked with
    45
    Projects awarded
    43%
    Last project
    12 Apr 2022
    United States

    New Proposal

    Login to your account and send a proposal now to get this project.

    Log in

    Clarification Board Ask a Question

      There are no clarification messages.
    1205
    $175

    - or -

    Post a project like this
    Herschel C.
    Herschel C.
    100% (57)
    Projects Completed
    53
    Freelancers worked with
    45
    Projects awarded
    43%
    Last project
    12 Apr 2022
    United States

    Related project Searches


    automation comma separated values(csv) development Python programming language scraping web

    Product

    • About
    • Team
    • Careers

    Support

    • How it works
    • Trust & Safety
    • Help Centre

    Discover

    • GuidesStoriesNews

    Resources

    • Customer Stories
    • Business Cost Calculator
    • Startup Cities

    Browse

    • Freelance Services
    • Freelance Services By Country
    • Freelance Skills
    • Terms
    • Privacy
    • Sitemap
    • Company Details
    • © 2023 People Per Hour Ltd
    We collect cookies to enable the proper functioning and security of our website, and to enhance your experience. By clicking on 'Accept All Cookies', you consent to the use of these cookies. You can change your 'Cookies Settings' at any time. For more information, please read ourCookie Policy
    Cookie Settings
    Accept All Cookies