
Data Mining Projects
Looking for freelance Data Mining jobs and project work? PeoplePerHour has you covered.
Physical Visit to Saint Petersburg Mining University
We are from Integrity Asia, a background screening company operating across Indonesia, Thailand, and Malaysia. We are currently looking for a local freelancer in Russia who can assist us with a physical visit to Saint Petersburg Mining University (2, 21st Line, St Petersburg 199106, Russia) to conduct a degree verification on behalf of one of our candidates as part of our pre-employment background screening services. The purpose of this verification is to confirm the candidate's educational background, including attendance records, graduation status, and the authenticity of their qualification. We previously attempted to contact the university via e-mail, but have not received a response. Therefore, we are seeking assistance from someone who can conduct an in-person visit to the university on our behalf. Please let me know if this is something you would be comfortable assisting with. If so, I would be happy to provide further details regarding the assignment. I look forward to hearing from you. Thank you.
13 days ago2 proposalsRemoteopportunity
Data Scraper
I need Desicion maker emails, phone numbers business name, all checked that are working of restaurants in Alicante, Valencia cities in Spain and the small towns around them.
8 days ago47 proposalsRemoteopportunity
Yorkshire Data Scraper
We are looking to purchase or commission an up-to-date, high-quality B2B marketing database covering Yorkshire and surrounding areas The data will be used to market Material Handling Equipment, Forklift Trucks, Warehouse Equipment, Scrubber Dryers and Industrial Sweepers, so the data must be relevant to companies with factories, warehouses, depots, production sites, industrial premises or operational facilities We are not looking for a basic scraped company list. We require verified, marketing-ready data with named contacts and valid emails wherever possible Include companies located in the following postcode areas: BB, BD, BL, HG, LS, HX, HD, WF, DN, S, SK The address should relate to the actual trading, manufacturing, warehouse, depot or operational site, not just a registered office address where possible. Include companies in the following sectors: 10 – Manufacture of food products 11 – Manufacture of beverages 13 – Manufacture of textiles 14 – Manufacture of wearing apparel 15 – Manufacture of leather and related products 16 – Manufacture of wood and products of wood and cork, except furniture 17 – Manufacture of paper and paper products 18 – Printing and reproduction of recorded media 19 – Manufacture of coke and refined petroleum products 20 – Manufacture of chemicals and chemical products 21 – Manufacture of pharmaceutical products and preparations 22 – Manufacture of rubber and plastic products 23 – Manufacture of other non-metallic mineral products 24 – Manufacture of basic metals 25 – Manufacture of fabricated metal products, except machinery and equipment 26 – Manufacture of computer, electronic and optical products 27 – Manufacture of electrical equipment 28 – Manufacture of machinery and equipment n.e.c. 29 – Manufacture of motor vehicles, trailers and semi-trailers 30 – Manufacture of other transport equipment 31 – Manufacture of furniture 38 – Waste collection, treatment and disposal activities; materials recovery 41 – Construction of buildings 52 – Warehousing and support activities for transportation We want to focus mainly on companies with 1–199 employees Please include employee banding, for example: 1–10 11–25 26–50 51–100 101–199 200+, only if highly relevant For every company, we require the following where available: Company type/industry Company name SIC code Website Employee banding Site address split into address line 1, line 2, line 3, town/city, county and postcode Postcode area Company telephone number Company LinkedIn page Date verified Contact Requirements We require named decision-maker contacts wherever possible. Priority contacts include: Operations Manager Warehouse Manager Purchasing Manager Procurement Manager Facilities Manager Site Manager Production Manager Logistics Manager Managing Director Director General Manager For each contact, please provide: Contact name Job title Direct email address Generic email address, only where no direct email is available Email type: Direct or Generic Email validation status Landline number Direct dial, where available Mobile number, where available Contact LinkedIn profile Date contact was last verified Email Quality Requirements Valid, up-to-date emails are essential. Direct business emails are strongly preferred. Generic emails should only be used where a verified named contact email cannot be found. Please separate direct and generic emails into separate columns. Do not mix them in the same field. We do not want large volumes of guessed, invalid, risky or unverified email addresses. Please confirm what email validation method or tool has been used and include the validation result in the data. Acceptable generic emails, only where direct contacts are unavailable, may include: info@ sales@ enquiries@ purchasing@ procurement@ operations@ Please do not include personal email addresses such as Gmail, Outlook, Hotmail or Yahoo. The data must be: Recently verified Relevant to the requested postcode areas Relevant to the requested industries Deduplicated Suitable for B2B marketing use Free from dormant or dissolved companies Free from irrelevant businesses Focused on genuine operating companies Accurate and cleanly formatted Please do not apply if you are only able to provide basic scraped Companies House data without named contacts and verified emails. Before placing the full order, please provide a sample of 50–100 records. The sample should include a mix of postcode areas, sectors, employee sizes, direct contacts and generic contacts where direct emails are unavailable. The sample will be checked for relevance, accuracy, direct email quality, duplicate records, address quality and suitability for marketing material handling and floor care equipment. We expect there to be over **2,500 companies** matching the requested criteria. Please provide an estimated breakdown by: Postcode area Industry / SIC code Employee banding Number of direct contacts Number of generic contacts Number of validated emails
5 days ago41 proposalsRemoteCold Call Insurance Data
I am looking for an experienced telemarketer to dial B2C data that will be provided. I would like to trial 2000 records to see how they can perform. Data capture will need to include; Insured, Uninsured, Insured through work then insurance provider i.e AXA, Aviva etc then renewal date and interested not interested. I am a broker in Private health insurance so will all need to be compliant etc. Of course i'd love everyone to be ready to discuss, transferred or appointment to be booked but as long as we can cleanse the data that is my main focus at this stage. if successful this will be an ongoing project for the foreseeable. This project is for real humans only with clear English as this is a very sceptical industry and I do not feel it would be very successful otherwise. Please offer a fair price on the service and happy for 2 prices, one for a trial and one for an ongoing contract I look forward to hearing from you Ben
6 days ago12 proposalsRemoteData Scraping - Competitor feedback
I need someone who can look at competitor recommendation websites for certain businesses and then work out who their customers are from people who have left comments of that business. This role is best suited to someone in the UK or at least very familiar with what names are common / uncommon in the UK. There will be names that are too generic and you cannot find the exact person for the business on companies house (like John Smith, Adam Cox, Joe Frost) , so they can be skipped. I need you to find exact businesses and be 100% sure that the company you find belongs to the person who left the comment. This task will take accuracy and care, rather than quickly gathering as much data that wont be needed. The company on Companies House needs to show it is ACTIVE, not anything else. Otherwise move on to the next record. EXAMPLE For example there is a comment from someone called Louise Lamberti (which isnt a very common name) Then i searched the name on Companies House and find the company she has: https://find-and-update.company-information.service.gov.uk/officers/0jr3CzJOVyJKznJUNuQQgwaAZH8/appointments There is no one else with this name as an Officer on Companies House so i am pretty sure this is the company it relates to. Then i need someone to gather as much information as possible on the business which the attached spreadsheet. I need named business email addresses and contact numbers - no personal email addresses! Tab 1 holds information for the business that we need Tab 2 are the different company names that you can search their feedback If you need more clarification please let me know. Looking for someone to work on this for an ongoing basis - will pay £10 per hour and expect to get around 40 records per hour. PLEASE COULD YOU SEND ME AN EXAMPLE ON THE ATTACHED SPREADSHEET OF 2 RECORDS THAT YOU FIND SO I CAN TEST THE QUALITY OF YOUR WORK BEFORE I ACCEPT. Happy to go through a video call to explain in more detail if this is required. £10 per 100 records - ongoing project
6 days ago13 proposalsRemoteAUDIENCE DATA BUY
I am looking to acquire a database of approximately 50,000 individuals interested in transitioning their careers to the IT industry. My target audience is people originally from Nigeria who are currently living in the USA or Canada. If you can provide access to this type of audience data, please let me know the available options, data quality, and pricing.
13 days ago14 proposalsRemoteData leak from laptop
A critical data leak originating from a laptop requires urgent remediation. I need an experienced cybersecurity specialist to investigate and remediate information leakage, identify root cause, secure vulnerable endpoints, recover compromised data where possible, and implement preventive controls and policies. Deliverables include forensic analysis, vulnerability patching, secure configuration, encryption and backup recommendations, and a concise incident report with actionable steps to prevent recurrence.
14 days ago12 proposalsRemoteSecondary school contact data gap-fill
I have a spreadsheet of 67 secondary UK schools where I need to identify, by name, the people who sit in six specific roles at each school — and find their verified work email address. You will be filling in a pre-built spreadsheet I provide. Most of the work is done — the school list, websites, addresses, phone numbers and which roles still need finding are all already mapped. Your job is to look up the names and verify the emails. The 6 roles per school Head Teacher Cover Manager (the day-to-day supply booker) School Business Manager (SBM) Head Teacher's PA / Office Manager Deputy or Assistant Head (with a staffing or cover portfolio if possible) SENCO (Special Educational Needs Coordinator) What you'll deliver The same spreadsheet I send you, with every red "FIND" cell filled in using one of three formats: Verified name | verified email — for example: Sample Postholder | s.postholder@exampleschool.org.uk NAME FOUND — EMAIL NOT VERIFIED | [name] — if you can identify the postholder but cannot get a verified email NOT FOUND — if no current postholder can be confirmed after a proper search There are approximately 240 individual cells to fill across the 67 schools. Not every school needs all 6 — some roles are already covered and marked HAVE in green; just skip those. You will need to have access to a range of software and resources to support you as these email addresses are not simply on the school website - therefore research and effort will be required to find and source correct validated emails.
8 days ago45 proposalsRemoteDocumentation & Data Entry Assistant
We are looking for a reliable and detail-oriented Documentation & Data Entry Assistant to support our team with simple documentation, content organization, and data entry tasks. This is a remote support role. No advanced technical skills are required, but the ideal person should be responsible, responsive, organized, and willing to learn. The work will mainly involve entering information accurately, formatting documents, updating internal records, and helping keep our project information organized. Responsibilities: Enter and update information accurately in internal documents and spreadsheets. Organize notes, records, and simple project information. Format written content clearly and consistently. Review information for basic spelling, structure, and accuracy. Follow clear instructions and complete assigned tasks on time. Communicate progress and ask questions when clarification is needed. Requirements: Good written English. Strong attention to detail. Ability to follow instructions carefully. Reliable communication and quick response time. Basic computer skills, including Google Docs, Google Sheets, Microsoft Word, or similar tools. Interest in learning and supporting documentation-related tasks. Must be available to overlap at least 4 hours per day during US business hours. Important Notes: This role is only for internal documentation, data entry, and administrative support. The selected freelancer will not be asked to post listings on other platforms, create accounts, manage third-party accounts, or perform any activity that violates another website’s policies. Work Arrangement: Remote, part-time or flexible schedule. Daily communication and at least 4 hours of overlap with US business hours are required. Ideal Candidate: Someone who is reliable, honest, responsive, detail-oriented, and passionate about doing accurate work.
14 days ago56 proposalsRemoteopportunity
Structured Data Verifier / Research Assistant
Seeking a Structured Data Verifier / Research Assistant to compile accurate source URLs, required documents, processing times, and basic eligibility criteria for Canadian relocation pathways. Candidate will populate a provided template with verified, up-to-date information across multiple programs, ensuring consistency, proper citation formatting, and clear, concise entries. Attention to detail, reliable research skills, and familiarity with immigration resources are essential. Timely delivery and data integrity required.
13 days ago29 proposalsRemoteData extraction from web
i need intelligentbodymassage.com all blogs and pages on word if someone can do send a proposal
a month ago23 proposalsRemoteLogin Testing & Data Extraction Automation
Deadline is 23.00. 2026-06-05. We are looking for an experienced automation developer to build a solution that can process a list of authorized test accounts, verify login credentials, and extract specific account information for reporting purposes. The project is intended for legitimate QA, testing, or account-management workflows and must comply with all applicable laws, platform terms of service, and authorization requirements. The tool should accept a CSV or TXT file containing usernames or email addresses along with passwords. It should attempt to log into each account, determine whether the login was successful or failed, and record the result. For successfully authenticated accounts, the tool should collect available information including shipment totals, country counts (for example, GB: 2, US: 9), account type (Individual or Business), account status (Active or Inactive), account country, account address, company name, and whether an active payment method is attached to the account. The solution must not collect, display, store, export, or process any sensitive payment information. This includes credit card numbers, expiration dates, CVV codes, bank account details, or payment credentials. The only payment-related information that should be returned is a simple Yes or No indicator showing whether an active payment method exists. The final output should be provided as a CSV report containing the username, login result, shipment total, country count, account type, account status, account country, account address, company name, payment method present (Yes/No), and any notes or error messages encountered during processing. OB2 is the preferred technology, The solution should include proper error handling, retry logic where appropriate, clear documentation, and complete source code delivery. The final deliverables should include the full source code, setup instructions, documentation, an example output CSV file, and a demonstration using authorized test accounts.
18 days ago26 proposalsRemoteCreate automated planning application spreadsheet
I am looking for someone to create an automated spreadsheet/report that pulls the previous week’s UK planning applications from the UK PlanIt data/API. The purpose of the report is to help identify potential trade opportunities for builders, loft conversion companies, kitchen/renovation firms, roofers and other home improvement trades. I need the spreadsheet to be refreshable each week, so I can update it with the latest planning applications and then export or share the data with ChatGPT for analysis. What I need is a spreadsheet system, ideally in Google Sheets or Excel, that can: 1. Pull planning application data from UK PlanIt. 2. Refresh the report weekly to show the previous 7 days of applications. 3. Cover all UK counties / planning areas, not just one local authority. 4. Handle multiple planning authorities within each county where needed. 5. Pull clean, structured data into a spreadsheet. 6. Allow me to filter by county, planning authority, date, application type and likely trade opportunity. 7. Create summary tabs that make the data easy to review. 8. Be simple enough for a non-technical user to refresh each week. **Important functionality required:** The spreadsheet should include: * A refresh button or clear refresh process. * A date range selector, ideally defaulting to the last 7 days. * County / area filtering. * Planning authority filtering. * Keyword filtering for relevant trade opportunities. * Automatic categorisation where possible, for example: * Extensions * Loft conversions * Garage conversions * Renovations/refurbishments * Kitchens * Roofing * Outbuildings * Commercial fit-outs * Other building works * A clean export tab that can be copied into ChatGPT for analysis. * Basic error handling if the API limit is reached or if a request fails. * A simple instruction tab explaining how I refresh and use the report. **Suggested spreadsheet tabs:** 1. **Instructions** Simple user guide explaining how to refresh the data and use the spreadsheet. 2. **Settings / Control Panel** Date range, counties/areas to include, keywords, refresh controls and any API settings. 3. **Raw Planning Applications** The unedited data pulled from PlanIt. 4. **Cleaned Applications** Clean version of the data with standardised columns. 5. **Trade Categorisation** Applications categorised by likely trade relevance. 6. **County Summary** Number of opportunities by county and trade type. 7. **Planning Authority Summary** Number of opportunities by local authority. 8. **ChatGPT Export** A clean tab designed specifically so I can copy/export the data and ask ChatGPT to analyse it. **Required data fields:** Where available from PlanIt, I would like the spreadsheet to include: * Application name/reference * Planning authority * County / area * Application start date * Address * Description * Application type * Development type * Status * Decision, if available * Applicant / agent details, if available * Link to planning application * Latitude / longitude, if available * Last scraped / last changed date * Suggested trade category * Opportunity score, if possible **Trade opportunity scoring:** Ideally, I would like a simple scoring system to highlight the best opportunities. For example: * High relevance: extension, loft conversion, conversion, major renovation * Medium relevance: alterations, outbuildings, roof works, garage conversion * Low relevance: tree works, signage, minor admin applications, discharge of conditions I am happy for the freelancer to suggest the best scoring approach. **Technical requirements:** The freelancer should be comfortable working with: * APIs * Google Sheets Apps Script and/or Excel Power Query * CSV/JSON data imports * Pagination * Rate limits * Data cleaning * Building refreshable dashboards/reports The PlanIt API has paging and request limits, so the system must be built responsibly and should not rely on one huge request. **End goal:** Each week I want to be able to refresh the spreadsheet, see the latest planning applications across all counties, identify the best trade opportunities, and then ask ChatGPT to analyse the data by county, trade type and opportunity quality. **Deliverables:** 1. A working Google Sheet or Excel workbook. 2. Automated or semi-automated weekly refresh process. 3. All required tabs and filters. 4. Clean data structure ready for ChatGPT analysis. 5. Simple instructions for use. 6. A short handover call or written walkthrough. 7. Notes on any limitations of the PlanIt API or recommended future improvements. **Please include in your response:** * Whether you recommend Google Sheets or Excel for this. * Examples of similar API/spreadsheet automation work. * How you would handle all counties and multiple planning authorities. * How you would manage API limits and pagination. * Estimated delivery time. * Fixed price quote.
15 hours ago31 proposalsRemotePowerBi Project
Seeking an experienced Power BI developer to implement a series of reports sourced via API from a MySQL database. We are a membership organisation using Paid Memberships Pro since October 2025; customer data is available and report layouts are designed on paper. Tasks: review data model, create optimized datasets, build interactive Power BI reports, and validate against designs. Project kickoff in the next weeks with completion by end of July. Opportunity for ongoing occasional support across other Power BI datasets.
8 days ago54 proposalsRemoteApp based KPI Dashboard
Hi I am developing a company KPI scorecard for our property investment company and we are currently inputting data into an excel spreadsheet and I am hoping this can then be developed into a dashboard for board meetings. My intial though is to use Microsoft Power Apps as we are on MS 365 / Teams etc. Hoping someone can help Thanks
5 days ago49 proposalsRemoteTelesales 12 hours per week
I’m looking for a telesales person to work around 12 hours per week, calling local businesses in Surrey and offering SEO and copywriting services. We have the data. It is cold calling. My company is Vanilla Circus, and we’ve been in business since 2009. Many thanks, Benedict Sykes Vanilla Circus
6 days ago30 proposalsRemoteBookkeeping - Xero
Need Xero experienced bookkeeper based in Jaipur only please. Data entry, full reconciliation of 20 banks and accuracy required e-commerce skills necessary
12 days ago15 proposalsRemoteopportunity
I need a Excel Dashboard creating
I am looking for an Excel/Power Query/VBA specialist to build a planning application lead-generation workbook using the UK PlanIt data https://www.planit.org.uk (https://www.planit.org.uk/api/) The workbook needs to let me click a button each week to pull the last 7 weeks of planning applications from all councils/planning authorities in Great Britain via the UK PlanIt API. Core requirements: 1. Data Import * Connect to the UK PlanIt API. * Pull applications from all active planning authorities/councils in Great Britain. * Use paging properly so results are not missed. * Respect API rate limits. * Pull new, changed and decided applications where possible. * Store the raw imported data in a master table. 2. Refresh Button * Add a simple “Refresh Planning Data” button. * When clicked, it should update the master data table. * Avoid duplicating applications already imported. * Keep a log of refresh date, number of records pulled, errors and skipped records. 3. Data Enrichment Columns Add calculated/enriched fields such as: * Project category * Builder opportunity: Yes/No * Roofer opportunity: Yes/No * Electrician opportunity: Yes/No * Plumber opportunity: Yes/No * Landscaper opportunity: Yes/No * Architect opportunity: Yes/No * Estimated project value range * Lead rating: Hot/Warm/Cold * Opportunity score 1–10 * Planning stage * Contact timing * Recommended action 4. Filters / Subscriber Views I need a way to filter the master data so different subscribers can only see opportunities relevant to: * Their trade type, e.g. builder, roofer, architect, electrician * Their subscribed area/council/postcode radius * Their lead rating, e.g. Hot only * Their project type, e.g. extensions, loft conversions, new builds Ideally, I would like a controlled output sheet where I can select a subscriber name and the workbook generates only the records they are allowed to see. 5. Data Protection / Access Control I need advice on the best way to protect the wider data so subscribers cannot access the full master database. Please advise whether this can be safely done in Excel, or whether separate exported workbooks/reports should be generated per subscriber. 6. Output Reports The workbook should be able to create/export: * A full internal master report * A filtered subscriber report * Trade-specific sheets * Area-specific sheets * Weekly opportunity summary 7. Dashboard Create a dashboard showing: * Total applications imported * Hot leads * Warm leads * Cold leads * Applications by council * Applications by trade opportunity * Estimated project value * New/changed/decided applications * Refresh status 8. Future Proofing Please build this in a way that can later connect to AI/ChatGPT enrichment, where application descriptions can be summarised and scored automatically. Important: The workbook must be reliable, repeatable and easy for a non-technical user to operate each week. Please also advise whether Excel is the right tool for this, or whether the data import and subscriber filtering should eventually sit in a database or web app.
13 days ago72 proposalsRemoteOpenClaw/OutlierAI Specialist-Python, Rubrics & Agent Trace
Overview: We are looking for a remote AI developer/evaluator with hands-on experience in OpenClaw and AI task platforms such as Outlier. The ideal candidate can support both OpenClaw Atlas-style tasks and authentic OpenClaw session trace submission work. Responsibilities: - Work on OpenClaw/Outlier-style AI evaluation tasks - Build or review agent workflows using OpenClaw - Create task-specific rubrics, validation checks, and unit tests - Evaluate AI model trajectories, outputs, and common LLM errors - Use Python for coding-related tasks when needed - Export and prepare eligible completed OpenClaw sessions with 150+ turns - Redact PII, credentials, confidential data, and sensitive identifiers - Package traces and related artifacts according to submission guidelines - Follow all platform rules, rights, privacy, and compliance requirements Requirements: - Proven experience with OpenClaw - Experience with Outlier, RLHF, LLM evaluation, or AI training platforms - Strong Python/coding knowledge - Ability to write clear rubrics and evaluation criteria - Understanding of AI agents, tool use, trajectories, and prompt quality - Experience with long-horizon agentic sessions preferred - Must only use legitimate personal work/data with proper rights to share - Strong English reading and writing skills - Detail-oriented, reliable, and able to work remotely Nice to Have: - Existing real OpenClaw sessions with 150+ turns - Experience with OpenClaw Atlas - Experience in data redaction, DevOps, cybersecurity, or API workflows - Prior work on rubrics, unit tests, or model evaluation tasks Important: - No fabricated sessions - No account sharing - No confidential, customer, or employer data - No policy violations - All work must comply with platform terms, privacy rules, and data rights requirements
12 days ago30 proposalsRemoteNeed Senior Web3 Engineer for DAO Platform Development
We’re hiring some Senior Full-Stack Web3 Engineer to help develop a DAO platform using Next.js, Node.js, and Web3 technologies. Features end-to-end: - proposal/voting workflows - wallet transaction UX - backend API reliability/security - on-chain integration + data sync Stack: React, Next.js, Node.js, Blockchain, Web3, API integration, Smart Contract Can you share your experience participating in a similar project previously?
3 days ago28 proposalsRemote