Data Extraction Projects

Looking for freelance Data Extraction jobs and project work? PeoplePerHour has you covered.

SAVED SEARCHES

113 results

by Syed Faiq Y.
$20
LinkedIn Profile Data Extraction
I need comprehensive data extracted from approximately 375 LinkedIn profiles and delivered in a structured JSON format. You will use your own tools and infrastructure, ensuring each profile's roles are individually detailed. Scope of work - Extract info from 375 LinkedIn profiles, including full name, headline, location, LinkedIn URL, email, phone, about text, connection count, follower count, profile photo URL, premium status, and verified status. - For each profile, list every individual role separately with detailed info, including company name, job title, start date, end date, duration, location, employment type, description text, company industry, company size, company website, and LinkedIn URL. - Capture education details like school, degree, field of study, dates, and activities or societies. - Extract certifications with name, issuing organization, issue date, and credential ID. - Compile skills with endorsement counts, languages and proficiency, volunteer work, honors/awards, recommendations, featured section content, groups, and interests/follows. - Include recent posts/activity with full text, post date, likes, and comments with names. Read more Additional information You'll start with a test batch of 5 profiles and proceed with the full list upon approval. Please go through the pdf attached
a month ago12 proposalsRemote
by John C.
$118
BNI contact details extraction
I am looking for a freelancer to build a spreadsheet database of BNI members worldwide. The task is to visit the official BNI member directory/website and collect available contact details for members whose profession/speciality matches any of the following categories: Ticketing Tours/Tour Guide Travel Agent Visa Consultant Travel (Other) This is for all countries where BNI member listings are publicly available. The final deliverable should be a clean spreadsheet with the information organized logically, including fields such as: Country City/Region Chapter name Member name Company name Profession/Speciality Phone number, if available Email, if available Website, if available Profile URL/source link Any notes Requirements: Accuracy is very important. Only collect publicly available information. Provide source links for verification. The spreadsheet should be clean, deduplicated, and easy to filter. Please mention your estimated timeline, cost, and how many records you expect to collect.
11 days ago21 proposalsRemote
by Laurent R.
$47
I need an expert on scrapping with appollo.io
Hi i need an expert to help us to scrap some datas with appollo.io My need is to scrap and extract some contacts in france based on Human ressources function
19 days ago26 proposalsRemote
by Robert B.
$286
opportunity
Document Extraction & AI Query Platform (second stage)
Overview We are building a system that collects and analyses documents from UK council websites. Stage 1 has already been completed and is working. It successfully: Scrapes a council website Identifies and downloads document files (primarily PDFs) Stores those files in a structured format Extracts basic text for inspection Stage 2 is to build on this foundation and develop a scalable backend system that can operate across multiple councils, organise documents, extract useful content, and enable AI-based querying of that data. Scope 2A(i) – Scraping & Document System Develop the existing scraper into a system that can: Explore council websites and locate documents across multiple sections Download and store documents in an organised and structured way Track documents over time (new, existing, changed, duplicate) Categorise documents (e.g. minutes, agendas, policies) Extract basic information (titles, dates, sections where possible) Provide clear visibility of what has been found, stored, and processed 2A(ii) – Multi-Council Validation Extend the system from a single working example to at least 3 different council websites Demonstrate that it adapts to different website structures 2B – Document Processing & Structuring Extract readable text from documents Clean and structure the content Break documents into smaller usable sections Link all extracted content back to its source Prepare the data for both keyword and semantic search 2C – AI Query Capability Accept natural language questions about council documents Use AI to identify and retrieve relevant content Return clear answers grounded in the documents Include references to source material Indicate when no reliable answer is available Core Requirements System must build directly on the existing Stage 1 functionality Must be usable across multiple councils Must be accessible via a backend interface (API) Must run reliably and allow monitoring of processes Must allow inspection of stored documents and extracted data Must be structured so a multi-user frontend can be built on top Deliverable A working backend system that: Extends the existing Stage 1 scraper into a multi-council system Collects, tracks, and organises council documents Extracts and structures document content Supports AI-based querying with referenced answers Has been demonstrated across multiple council websites Please only provide FIXED bids. Placeholder bids will be immediately rejected. Any bid will be deemed your full and final price for the job. Please add the text 'This is my full and final bid based up your job description' to your message to confirm understanding of this. The budget is only an auto suggestion by PPH and is not reflective of my assessment of the job value. Please take the time to calculate what you believe to be the cost and tailor your bid accordingly. AI responses will be rejected.
9 days ago34 proposalsRemote
by Glenn L.
$20_/hr
AI/ML Engineer
Seeking an experienced AI/ML Engineer to advance an AI meeting assistant application. Responsibilities include designing and implementing robust speech-to-text, speaker diarization, and natural language understanding pipelines; fine-tuning transformer models for summarization, action-item extraction, and intent classification; optimizing inference for real-time performance on cloud or edge; ensuring data privacy and scalable deployment. Ideal candidate has proven experience with deep learning frameworks, NLP, ASR, model optimization, and production-ready ML systems.
2 days ago48 proposalsRemote
by Laurent R.
$40
I need scrap datas from linkedin with apollo or lemlist
I'm looking for someone who can make a count on france on the journalist and scraps emails and datas of a company list we will given. can you help us on it ?
4 days ago33 proposalsRemote
by Roanna Q.
$286
I need information off of websites and sheets for CRM set up
We require meticulous consolidation of legacy event and membership data for CRM ingestion. Extract detailed event and attendee records from Eventbrite into a structured spreadsheet, ensuring each event and participant is logged. Cleanse and harmonize historical event CSVs and membership exports, map fields consistently to a master template, resolve duplicates and formatting issues, and deliver a validated, CRM-ready master spreadsheet with clear field mappings and data integrity checks.
a month ago41 proposalsRemote
by Neil O.
$41
Data Scrape a Facebook Group
Hi I need someone to datascrspe a Facebook group.
10 days ago26 proposalsRemote
by Dilip Kumar K.
$15_/hr
UK Company Data Enrichment Specialist
Summary We are looking for a skilled data enrichment specialist to help us enhance an existing dataset of UK-based companies. We already have a list of company names (sourced from Companies House). Your role will be to enrich this dataset with accurate and verified business and founder-level information. Scope of Work: For each company, extract and validate: * Company website * Founder(s) / Director(s) name (if needed, validate existing data) * Founder email address (preferred: direct, not generic) * Founder and Company phone number * Founder LinkedIn profile URL * Company LinkedIn page URL * Estimated employee count Data Quality Expectations: * High accuracy (no guesswork or random scraping) * Verified emails * Avoid generic emails unless no alternative exists * Provide confidence level or source for each data point Tools & Approach: Please clearly explain your approach in your proposal. Deliverable Format: Spreadsheet (Google Sheets or Excel) with structured columns: Company Name | Company Email | Company Phone | Website | Founders Name | Founders Email | Founders Phone | Founders LinkedIn | Company LinkedIn | Employee Count | Source/Notes What We’re Looking For: * Proven experience in B2B data enrichment / lead generation * Strong research skills (especially UK companies) * Ability to maintain accuracy at scale * Clear communication and structured delivery To Apply: Please include: 1. Your approach to finding verified founder emails 2. Sample of similar work (if available) 3. Turnaround time Bonus: Experience working with Companies House data or UK-based businesses is a plus.
14 days ago18 proposalsRemote
by Ethan P.
$3K
Senior Data Engineer
We are looking for a Senior Data Engineer to serve as a technical leader within our Analytics Engineering team. In this role, you will design and build scalable data platforms and high-impact data products that power critical business decisions, analytics, and machine learning use cases. You will work cross-functionally with engineering, product, data science, and business teams to deliver reliable, high-quality data solutions while setting standards and best practices across the organization. Design, build, and maintain scalable data pipelines and data products Architect robust data models and transformation frameworks Lead end-to-end data platform initiatives (design → development → deployment) Define and implement best practices for data quality, testing, and observability Collaborate with cross-functional teams to gather requirements and deliver solutions Optimize data systems for performance, scalability, and cost-efficiency Mentor engineers and contribute to team-wide technical standards Drive adoption of modern data tools and frameworks Build reusable components and improve overall platform efficiency 5+ years of experience in Data Engineering or Analytics Engineering Strong expertise in SQL and Python Experience building and maintaining large-scale data pipelines Hands-on experience with: Cloud platforms (AWS, GCP, or Azure) Data warehouses (Snowflake, BigQuery, Redshift) Data transformation tools (dbt or similar) Workflow orchestration tools (Airflow, Dagster, etc.) Strong understanding of data modeling, ETL/ELT, and data architecture Experience with CI/CD and DevOps practices for data systems Ability to lead complex projects and work across teams Strong communication skills (technical + non-technical) Experience supporting machine learning workflows Knowledge of data governance and data quality frameworks Experience with cost optimization (FinOps) Background working in startup or high-growth environments Experience building internal data platforms or shared infrastructure Strong problem-solving and system design skills Passion for building scalable and maintainable systems Ability to work with ambiguity and drive clarity Leadership mindset with a focus on mentoring and collaboration Continuous improvement mindset with attention to quality and performance Experience with real-time data processing Exposure to data observability tools Experience designing semantic layers or metrics layers Job Skills
18 days ago21 proposalsRemote
by Vasco D.
$353
opportunity
PhD Qualitative Data Analysis with Nvivo
Seeking an experienced qualitative researcher to conduct thematic analysis for a PhD study using NVivo on 20 interview transcripts. The ideal candidate will possess robust expertise in qualitative methods, demonstrated NVivo proficiency, and a track record of academic-standard analysis. Responsibilities include coding, theme development, analytic memos, and clear presentation of findings aligned with research objectives. Meticulous attention to detail, insightful interpretation of complex data, and adherence to rigorous methodological practices are essential.
10 days ago13 proposalsRemote
by Ady B.
$41
Need help with Contabo VPS Issues
Need help with the following for Contabo VPS - Preserve any needed data. If you do not have a backup listed in your Customer Control Panel, boot into the Rescue System to copy off any still-accessible files: https://help.contabo.com/en/support/solutions/articles/103000295053-how-do-i-boot-a-rescue-system-. Treat extracted files as potentially tampered with and review them carefully before reuse. Reinstall the server from scratch via your Customer Control Panel to a clean OS image. This is the only reliable way to remove backdoors left by the attacker. If (and only if) you hold a clean backup in the Customer Control Panel and choose to restore it, you must immediately log in via SSH and run /scripts/upcp --force to apply the security patch - otherwise the server will be re-infected within minutes. Apply the patch on the clean installation following the recommended procedure: Update cPanel & WHM to the latest available version. Fully update all OS packages. Verify that WHM is running one of the recommended versions, as per the article below. Support article for reference: https://support.cpanel.net/hc/en-us/articles/40073787579671-Security-CVE-2026-41940-cPanel-WHM-WP2-Security-Update-04-28-2026 Change all credentials (WHM/cPanel accounts, SSH keys, API tokens, database users, email accounts) once the clean system is patched.
3 days ago14 proposalsRemote
by Daniel N.
$150
Looking for a data administrator
We are looking for a data administrator. There are requirements on this role. - Need to base on East Europe, United States or Canada - It is not full time, part time position Need to work few hours per week. - Need to speak in English If you are okay, share your CV Best regards
a month ago14 proposalsRemote
by Roy S.
$306
OGL Software & Sales Vision Stock Data Analysis
Abacus Creative Resources use a Software System called OGL which has ODBC & uses Pivot Tables to analyse data such as stock patterns & trends. We are looking for someone that has experience with OGL, or similar, as we are wanting to set up a routine analysis system. Currently I am looking through daily despatch notes which takes me an hour each day, and tells me what our daily fulfillment rate is along with new customers, helps identify what products are selling well and helps me plan for offers. We would like to automate this, so that we can analyse the data quickly.
5 days ago21 proposalsRemote
by Pushpal K.
$286
Power BI sales & inventory KPI dashboard – eCommerce
I run a growing eCommerce business selling on Shopify and Amazon. Currently I spend 5–6 hours a week manually exporting sales and inventory data into Excel to build reports for my team. I need an experienced data analyst to create a live, interactive Power BI dashboard that connects to my sales and inventory data (Google Sheets / CSV to start). The dashboard should: - Show daily revenue, orders, and profit margin - Track inventory health: stock levels, weeks of cover, reorder alerts - Break down performance by product, channel, and category - Include filters for date, product, and channel - Automate data refresh (scheduled or on-demand) I want a clean, CEO-ready overview page plus a couple of drill-down pages. A walkthrough and some documentation for my assistant would be appreciated. Please share examples of similar dashboards you’ve built, especially any that reduced reporting time or uncovered profit leaks.
17 hours ago20 proposalsRemote
by Reginald S.
$42
DATA ANALYTIC STRUCTURE
Seeking a skilled data analytics structure specialist to collaborate with a UX designer. Project involves organizing, modeling, and documenting data flows, creating clean, scalable schemas, and defining metrics to inform user experience decisions. Deliverables: annotated data architecture diagrams, data dictionary, ETL blueprint, and recommendations for instrumentation and dashboards. Require clear, concise documentation and pragmatic solutions to support iterative UX research and product design.
25 days ago12 proposalsRemote
by Prashanta P.
$10
Audio Data Collection with Wireless Earbuds
I need two participants for an audio data collection project using wireless earbuds. The task involves recording natural conversation between two people in a quiet indoor environment using the Riverside platform. Project Details: * Total Sessions: 2 recordings (10 minutes each) * Participants: Exactly 2 people * Device Requirement: Wireless earbuds with microphone (e.g., AirPods or similar) * Audio Format: WAV * Recording Method: Audio must be captured only through the earbuds microphone (not phone/laptop mic) Recording Process: 1. Session 1 (10 minutes): * Person A wears earbuds (primary speaker) * Person B sits 1–3 meters away (secondary speaker) 2. Session 2 (10 minutes): * Roles are switched * Person B wears earbuds (primary speaker) * Person A sits 1–3 meters away Requirements: * Continuous recording (no pauses, cuts, or edits) * Natural conversation (no scripted reading) * Distance must remain between 1 to 3 meters * Both participants must be physically present in the same room * Earbuds must remain in use throughout the session Additional Requirements: * Provide metadata including: * Earbud brand/model * Distance between participants * Ages of participants * Recording duration * Environment details (room setup, objects) * Background noise type and level * Room size category * Links to uploaded WAV files Important Notes: * Perform a short test recording before starting * Ensure devices are fully charged * Follow all instructions strictly to avoid rejection Deliverables: * Two 10-minute WAV audio files (one per primary speaker session) * Completed metadata sheet with all required details This is a simple task but requires strict adherence to guidelines and high-quality, natural audio recording.
18 days ago0 proposalsRemote
by D'Asia L.
$209
Data Research Assistant – Trucking Companies
I’m looking for a detail-oriented assistant to research trucking companies that hire CDL drivers with 0–12 months of experience. This is strictly a research and data entry task. No outreach, account creation, or posting on external platforms is required. Responsibilities include: * Researching trucking company websites * Identifying hiring requirements for entry-level CDL drivers * Collecting accurate information from official company career pages * Organizing data into a structured spreadsheet All information must be sourced directly from public company websites. Short sample task may be requested to confirm fit and accuracy.
18 days ago28 proposalsRemote
by London School of Barbering A.
$1.7K
Excel & Automation Specialist Needed / Fix Data Tool
We are seeking an experienced Excel and automation specialist to take over, troubleshoot, and enhance an existing data management tool, as well as implement automation for invoicing and customer communications. This project involves improving an Excel-based system currently used to manage student course data, alongside building a streamlined workflow for generating invoices and sending confirmation emails. **Part 1: Existing Excel Tool (Fix & Optimisation)** An Excel-based tool has already been developed to extract and organise course sales data. It captures key student information, including: * Full name * Date of birth * Address * Enrolled course * Course start and end dates **Current functionality includes:** * Generating student name lists for class registers * Exporting student data for certification purposes * Structuring data for upload to a governing body platform **Current issue:** The data refresh function is not working correctly. When attempting to update the dataset with the latest orders, an error alert appears and the refresh fails. **Requirements:** * Diagnose and fix the refresh/data connection issue * Review and optimise the existing tool * Ensure reliable and efficient data updates * Improve usability where necessary **Part 2: Invoice & Email Automation** We also require automation of our invoicing and confirmation email process. **Current workflow:** * Order/customer data is exported from our WordPress website * Invoices and confirmation emails are created and sent manually **Requirements:** * Automatically generate invoices using order data * Create professional invoice templates (PDF format preferred) * Automatically send confirmation emails to customers * Emails must include accurate course and student details (course name, dates, etc.) * Attach invoices to emails where applicable **Integration & Workflow:** * The solution must work with our existing WordPress data exports (CSV format) * We are open to the best technical approach (Excel, Power Automate, VBA, Zapier, Make, or other solutions) * The system should be reliable, easy to use, and suitable for ongoing operational use **Deliverables:** * Fully functional and stable Excel tool with working data refresh * Automated invoice generation system * Automated email confirmation system * Clean templates for invoices and emails * Documentation or handover instructions Please Include in Your Proposal: * Relevant experience (Excel automation, Power Query, VBA, APIs, or workflow tools) * Examples of similar projects * Your proposed approach/tech stack * Estimated timeframe * Cost estimate We are looking for someone who can take ownership of this project, resolve existing issues, and deliver a reliable, long-term solution.
24 days ago75 proposalsRemote
by Leonard L.
$449
opportunity
AI-Driven Survey Data Analysis App
I need help creating an AI application to analyze survey data. The surveys focus on public sentiment about social issues and government policy. The goal is to glean meaningful insights from survey responses. Scope of work - Analyze survey response data in xlsx format - Identify key findings and trends - Generate a PowerPoint report with data visualizations - Develop user-friendly application with detailed documentation Additional information Looking for someone based in Singapore and is open to face-to-face interactions. Application Type Desktop Application
16 days ago24 proposalsRemote