
Scraping Projects
Looking for freelance Scraping jobs and project work? PeoplePerHour has you covered.
Data Scraping for Tax Data Hub Website
I need someone to scrape data from a specific website for 9 counties. Your task is to extract all data for each county, filter municipalities, and save it as a separate CSV file per county. Important: Do not use the Excel download option as it corrupts the data. The data must be scraped directly in raw CSV format. Requirements: - Experience with web scraping and data extraction. - Ability to deliver raw CSV files without formatting issues.
a day ago45 proposalsRemoteScraping email addresses
On excel sheet there are 1158 companies (not people in this database) as on sheet 3 'Key companies’ - if you can manipulate Excel you can also get the people’s name for each of these companies How much would it to be to get emails from this list of 1158 where obviously it will be much letter if we have john.smith@xxxx rather info@xxxx
10 days ago55 proposalsRemoteProducts Data Scrapping
An experienced data engineer is sought to build a product data scraping application. The web scraper will harvest key details like product names, descriptions, images and pricing from various e-commerce websites. The scraped structured data needs to be normalized and stored in a database. Tools like Python and libraries such as BeautifulSoup and Scrapy should be utilized to automate the scrape process. The candidate must have strong knowledge of web scraping techniques and building robust data pipelines. Prior experience developing similar commercial grade web scrapers and handling large scrapped datasets is essential. Companies looking to leverage competitor pricing and listings online will benefit from this data harvesting solution.
4 days ago29 proposalsRemoteScraping Tool
*MUST QUOTE: I WANT THIS JOB AT TOP OF THE APPLICATION* Hi I am looking for a developer that can create a scraping tool for the https://landlordregistrationscotland.gov.uk/ I want to be able to put in a postcode and then get all the landlords and relevant information in a downloadable CSV Thanks *Holding budget
19 days ago32 proposalsRemoteopportunity
Python Script for Email Scraping
I'm looking for a Python expert to create a web scraping script for me. The script needs to extract emails, currency, and language from the site https://www.merchantgenius.io/. Requirements: - The extracted data should be saved into an Excel file. - The script does not need to handle website login or authentication. - Checking for duplicate emails is not necessary. If this project goes well, I have around 50 more similar websites and would be interested in hiring you for a more permanent role.
22 days ago84 proposalsRemoteScrape me 600 general emailaddresses from domain list I provide
I need around 600 emailaddresses form the list attached. The list provides the domain name. You visit the domain and look for info@, service@ (no personal email addresses). Price of 1 Euro is symbolic, we dont work with budgets, please just make a bid.
11 days ago80 proposalsRemoteOctoparse Web Scraping Expert for Real Estate Data
I am looking for a highly experienced freelancer who can build a robust and repeatable web scraping framework using Octoparse (or a similar platform) to help us collect property data from three major UK listing platforms: -Rightmove.co.uk -Zoopla.co.uk -OpenRent.co.uk The scraping setup should follow specific parameters and export new listings daily to a Google Sheet, appending new data without replacing existing entries. For each property that meets the criteria, the following data should be scraped: -Website link to the listing -Name of the lettings agent -property address as on the website -Marketing price -Agent’s email address (by searching the agent's website) Filtering Criteria (to be built into the scrape): -Number of Bedrooms: **Multiple searches based on presets (e.g. 2-bed, 3-bed, 4-bed, 5+ bed), each tied to its own maximum budget cap. -Budget Ranges: **Configurable per bedroom range (to be input by us via preset criteria). -Proximity to Train/Overground Stations: **Properties must be within 1 mile of a train or overground station (use of geolocation required, not just keyword mention). -Distance from Central London: **Must fall within a maximum 20-mile radius from Central London (this should be applied strictly using each platform’s area filters or via Octoparse logic). Output Details: -Data should be exported to a Google Sheet, which appends new listings daily without overwriting or duplicating existing entries. -Manual export is fine for now, but automated export (if possible within Octoparse free version) is preferred. Deliverables: -Fully configured scraping tasks in Octoparse for all 3 platforms -Setup of multiple preset searches (based on bedroom/budget combinations) -Tutorial (video or written guide) showing how we can adjust budgets, bedroom filters, and radius in the future
13 days ago31 proposalsRemoteBusco servicio de scraping y subida de productos a Woocommerce
Estoy buscando un profesional o equipo especializado en scraping para transferir los productos de una página web a mi tienda en línea que utiliza la plataforma Woocommerce. El trabajo incluye: - Extracción de títulos, descripciones, variaciones, precios y fotos de los productos de una web especificada. - Preparación y subida de los productos directamente a Woocommerce, listos para su venta. - Organización adecuada de los productos en categorías según sea necesario. - Garantía de que todas las imágenes y datos se carguen correctamente y sin errores. Es importante que el profesional tenga experiencia previa en scraping y manejo de Woocommerce, y que pueda garantizar la precisión y calidad del contenido subido. Indicar tiempos estimados de entrega y costo del servicio en la propuesta.
15 days ago26 proposalsRemoteWant to get job done of email scraping.
We have approximately 12 thousand plus company name. We want extraction of authentic emails and company URLs. those who are ready to extract all complete set of emails should contact only. Emails must be authentic.
a month ago33 proposalsRemoteopportunity
Expert Web Scraper & WordPress Developer
We seek an experienced developer to create a daily/on-demand web scraper for a car listing website and automate data import into a WordPress site. This is a long-term project with potential for permanent collaboration. --- Key Responsibilities 1. Web Scraping - Build a robust scraper to extract: - Listing Data: - Make - Model - Mileage - Colour - Specification (e.g., engine type, transmission, fuel type) - Registration year - Price - Location (city/town) - Seller type (private/dealer) - Contact details (if available) - Vehicle condition (new/used) - Features (e.g., navigation system, heated seats, parking sensors) - Images: High-resolution photos of the vehicle(s) - Implement daily/on-demand scraping with: - Duplicate detection - Removal of orphaned listings (if source listing is deleted) 2. WordPress Integration - Automatically import scraped data into WordPress: - Assign listings to user/dealer accounts (1:many relationships) - Handle user roles (standard users vs. multi-listing dealers) - Ensure seamless synchronization between source and target sites 3. Maintenance & Optimization - Address anti-scraping measures (e.g., CAPTCHA, IP rotation) - Optimize performance for large datasets --- Requirements - Expertise in: - Web scraping (Python/Scrapy, BeautifulSoup, or similar) - WordPress development (custom plugins, REST API, user role management) - Database integration (MySQL) - **Proven experience** with: - Handling dynamic/content-heavy websites (e.g., pagination, AJAX) - Automated data synchronization - Familiarity with version control (Git) --- - **Include in your proposal: 1. Examples of past scraping + WordPress projects 2. Brief outline of your approach for this project 3. Reference **CS26MIL** *(applications without this will be rejected) --- Why Join Us? - Long-term collaboration with a growing UK tech company - Remote work flexibility (team spans 5 locations)
21 days ago51 proposalsRemoteUK Cold Callers Wanted
My company is offering small businesses FREE website previews - no strings attached - and we need enthusiastic cold callers to help us reach companies who don’t yet have a website. What You’ll Do - Use our database to handle simple data inputting, you'll have the list of companies we've scraped that don't have websites. Easy management of, i.e. called, converted, no answer etc. - Pitch Our Offer: Explain that we have created them a FREE website preview and simply need their email address to send over this personalised preview. - Hand Over: Once you’ve secured the email, move on. That's it! No hard sales needed. What I'm looking for: - Excellent spoken & written English. - Confident, upbeat phone manner and a can-do attitude - Self-motivated Basic computer skills (to log calls & capture emails)
5 days ago19 proposalsRemoteopportunity
Develop an AI Agent for Lead Gen
We are looking for a developer to build an AI agent that can be integrated with LinkedIn to support lead generation. The agent should be able to: Automatically connect with selected LinkedIn profiles based on defined filters. Send tailored connection messages. Follow up with short, predefined message sequences. Extract lead details and store them in a structured format. Once leads are collected, the agent must: Sync the data with Microsoft 365, including storing lead details in Outlook Contacts or SharePoint (depending on what's most efficient). Maintain a log of all interactions for review. Requirements: Strong experience with LinkedIn API or relevant scraping tools. Experience with AI agents Proficiency in integrating systems with Microsoft 365. Solid understanding of data privacy and LinkedIn’s usage policies. Ability to build a simple UI/dashboard to monitor and control the agent's activity. Deliverables: A working AI agent with LinkedIn integration. Connection to Microsoft 365 for lead import. Documentation for setup and use. We would also like to be able to edit/use this for different people, so an easy way to add this to various profiles ourselves. IP to be exclusive and owned by us. Please include examples of similar projects you’ve completed.
2 days ago30 proposalsRemoteNeed Php Edits on my Simple Wordpress Plugin
Hello, I have hired a developer some months ago to develop me a wordpress plugin. She made the delivery and left. Now there are some problems with the plugin I am looking for some1 who can help me the fix the problems. How plugin works. Plugin creates articles with given keywords and rss source with OPENAI ChatGPT with parts and make a long article at the end with images. But the developer filtered all the necessary html tags like Bold, italic, tables, charts, listings and other basic features whiel writing articles and the result is simple text with no rich elements. The developer also used only gpt 4.turbo which is expensive and l want to use other protocols too. The developer forgot to add google images, bing images and yandex images to search keywords and scrape related images. the developer also developed the plugin in a way when we run the plugin it is generating ///////// codes in the promts. You will fix these problems. My budget is 20 usd.
13 days ago14 proposalsRemoteopportunity
AI Agent for Real Estate Agency
I need a developer to build an Al agent for my Dubai real estate agency startup. All trainable on my style. Focused on: • Off-plan developments and investments • Secondary property sales and acquisitions Tasks include: • Posting daily on Instagram, Facebook, and Linkedin in my style • Engaging with comments and messages • Sorting landlord lists into spreadsheets • Uploading the lists to Pipedrive • Sending emails and WhatsApp messages to contacts • Using Sales Navigator to find and message leads • Scraping online forums for prospects It will offer landlords and investors valuations or listings for their existing property, and it will offer investors and potential investors a call with me to discuss opportunities, arranged using Calendly.
23 days ago67 proposalsRemoteData Scraper to Collect Contact Details from Online Directories
We’re looking for a reliable data scraper to help us gather email addresses and contact details from several online directories we will provide. The collected data will be used to grow our mailing list for business outreach and marketing purposes. Project Scope: - Scrape contact details (name, email address, phone number if available, and any relevant business info) - We will provide a list of online directories for data collection - Goal: 3,000 valid and accurate contact details to start - Data to be organised and delivered in a clean, structured Excel or Google Sheets format To Apply: Please share: - Examples of similar work you've completed - Your estimated turnaround time for collecting 3,000 contacts - Any questions you may have about the project This could turn into an ongoing opportunity if the initial task is completed successfully.
19 days ago73 proposalsRemoteProduct Professionals - Database
Looking to gather up 10,000 lines of data. + UK Based (Mandarin Speaking a bonus) - Manchester (Mandarin Speaking a bonus) - North West UK - North UK - England - UK Wide + Job Titles - Product Manager - Lead Product Manager - Chief Product Officer - VP of Product - DevOps Engineer (Must Speak Mandarin) - Senior DevOps Engineer (Must Speak Mandarin) - QA Tester (Must Speak Mandarin) - LinkedIn URL - Email Address if Available (not important) Will only pay $0.01 per line as it is just LinkedIn scraping.
a month ago22 proposalsRemoteopportunity
Automating a shopify item creation upload file
I am looking to automate the process of uploading new item information into my Shopify store from supplier-provided files. Currently, I receive a CSV or Excel file from my various suppliers containing the item names, descriptions, affiliated picture URLs and other relevant product details. However, this format cannot be directly imported into Shopify for creating new item listings. I need a freelancer well-versed in web scraping and Shopify APIs to programmatically parse the supplied files and transform the item data within into the appropriate JSON format specified by the Shopify item creation API endpoint. The transformed data should then be uploaded in bulk to my store via the API to generate all the corresponding new product listings. Experience with Shopify and familiarity with common product data formats is essential. The goal is to develop a seamless process that can automatically ingest new item uploads from my suppliers and populate my online store in a structured and organized way without any manual data reentry.
23 days ago45 proposalsRemoteAI expert to create some scrapping and automation.
Specifications Automation by AI I am looking for an AI automation expert to optimize its business processes to support the agency in creating an Automation of the price benchmark of a training course VS competition • Objective: Competitive comparison of the different training courses. • Process: • Create a user interface based on python • Use a Webcrawler and an AI agent for web scraping of the sites in question. • Compare about 47 competitors (to be defined beforehand). • Source new competitors (difficult to set up in this project alone). • Create an automation of the excel or matplot document to display the graphs in excel. • Necessary data: Comparison variables and desired formatting ● Output: Excel file (.xlsx format) we given you the list of the competitors on each website they have Training pages and you need scrap the datas as the need into the prompt • Link to the training • Training title • Duration • Price inter-intreprises • Price intra-entreprises
a month ago22 proposalsRemoteopportunity
AI Engineer
I'm looking to develop an AI assistant that can construct digital versions of people. These avatars should be realistic, mimicking real humans in appearance and behaviour. The AI needs to enable these avatars to: - Change outfits: Users should be able to select from a variety of clothing options. - Walk around: The avatars should have the capability to walk within the chosen background. - Talk naturally: The avatars need to have a custom voice and be able to communicate in a realistic manner. - Act as brand ambassadors: The avatars should be able to showcase products in a convincing manner. The system should be user-friendly, allowing individuals to simply upload a photo or provide a description of their desired avatar. Users can choose different styles (with the primary focus being a realistic style), select outfits, backgrounds, and create a custom voice. The AI should also have the capacity to remember user preferences and previous avatars. The output of the system should be versatile, generating content in the form of: - Videos - GIFs - Images All of which can be used for presentations, social media, or marketing. The central concept of this project is to create adaptable digital humans that appear authentic and can perform various actions on command. This would essentially empower anyone to create their own virtual spokesperson or character without needing advanced technical knowledge. Ideal skills and experience for the job include: - Proficient knowledge of AI and machine learning - Experience in creating realistic digital avatars - Understanding of user-friendly interface design - Capability in developing versatile output formats - Expertise in voice modulation technology. To clarify: -I’m aiming for ultra-realistic human-like avatars, similar to MetaHuman or Synthesia quality — with facial expressions, body motion, and physics-aware interaction. -Users should be able to generate avatars using: Selfies or facial photos (the system should complete the rest of the body using traits like age, gender, skin tone, etc.) -Text prompts or even inspirational reference images for pose/style guidance For now, let’s go with a web-based platform that works smoothly on mobile browsers. We can consider an app version later. Business overview: This project envisions a groundbreaking AI platform that enables users — from creators and marketers to everyday storytellers — to generate high-quality, original digital content by intelligently analyzing existing media and transforming it into fresh, personalized output. At the core of the system is a next-gen AI avatar engine that creates ultra-realistic digital humans from user inputs. These avatars are not generic — they’re deeply personalized, generated through text prompts, selfies, or photo uploads, and enhanced with advanced facial recognition to extract identity features. When a full-body image isn’t provided, the system intelligently maps the user's face onto contextually appropriate body models based on parameters like age, ethnicity, gender, height, and skin tone. The platform also allows users to upload inspirational images, which the AI uses to influence styling, posture, clothing, and motion aesthetics — making the avatar truly reflective of the user’s vision or brand. Every avatar is 360-degree, physics-aware, and capable of natural gestures, motion, and lip-sync across multiple languages and emotional tones. Users can choose from a wide range of AI-generated voices, regional accents, or even cloned voices. In parallel, the system features a media intelligence layer that scrapes and deconstructs any type of input — videos, reels, images, or articles — analyzing structure, tone, gestures, content themes, and model behaviours. Instead of copying, the AI recreates new scripts and visual sequences inspired by the source — ensuring originality while retaining the creative intent. The result is a seamless, highly realistic video or media asset — fully customizable with dynamic backgrounds, lighting, and formats — ready for publishing on Instagram, YouTube Shorts, advertisements, presentations, or digital platforms. This AI system transforms the way content is conceived and created, offering a powerful tool to bridge the gap between inspiration and production. With no need for cameras, actors, studios, or complex editing software, anyone can bring their ideas to life — faster, smarter, and more beautifully than ever before.
6 days ago22 proposalsRemote
Past "Scraping" Projects
I need to scrape web page data
I need some to scrape some web page data.