Extraction Projects
Looking for freelance Extraction jobs and project work? PeoplePerHour has you covered.
opportunity
Data extraction and ChatGPT integration
To include ChatGPT in an app created on an app designed to assist athletes by answering questions and providing personalised explosive workout plans, sourced from PDFs of public domain books Step 1: Data Extraction from PDFs Handle Non-text Content: If the PDFs contain important non-text elements (like images of exercises), consider using an OCR (Optical Character Recognition) tool like Tesseract to convert these images to text. Step 2: Data Cleaning and Preparation Process: Clean Extracted Data: Use Python to clean the data, including removing unwanted characters, standardizing terminology, and correcting OCR errors. Structure Data: Convert cleaned data into a structured format like JSON or CSV, categorizing content by topics such as "speed training", "strength workouts", etc. Natural Language Processing: Apply NLP techniques to refine the text, extract keywords, and prepare it for easy querying. Configure API calls to send user queries to ChatGPT and receive responses. Store structured data from PDFs in Bubble’s database for quick reference by the AI.
4 days ago38 proposalsRemoteopportunity
Data Extraction and Input
Hi all, I am looking for help to pull some price modelling data from an website and place it into an excel file that replicates how it is shown on the web page. There is a lot of data and I do not think it can be exported, it needs to be done manually. I do not think it is viable to scrape the data. Once we have pulled the data we then need to build charts / data filtering to display the data correctly. My feeling is that the freelancer that does the job will need to be an expert an MS excel. It would be easier to do a Zoom call to show the freelancer the system and explain fully what is required so the price below is a placeholder. Thanks!
21 days ago80 proposalsRemoteopportunitypre-funded
Email addresses from LinkedIn profiles
I have a few 1,000 LinkedIn profiles and want someone to extract the names, job titles, employers and email addresses. Please quote me the cost and time it would take (define any batch size you want). The cost of this project is just a placeholder as I don't know how much you charge. Say what you think is fair and we will agree before I accept the proposal Thanks
3 hours ago57 proposalsRemoteBank Statement Conversion into Excel
I'm looking for a professional freelancer to convert a 300-line PDF bank statement into an Excel report for reporting purposes. The specific bank statement details that need to be included are Date and Description as well as Debit and Credit amounts. Key Requirements: - Extract date and description details from the bank statement - Extract Debit and Credit amounts from the bank statement - Must have a clear understanding of date formatting in Excel Ideal Skills/Experience: - Must have extensive experience in data entry and Excel - Should have a good understanding of banking terms - Attention to detail is critical - Previous experience in bank statement conversion will be advantageous.
17 days ago96 proposalsRemoteMeta-analysis on epidemiological scientific literature
- Must have experience reviewing scientific literature - Review published academic literature on the risk factors of transmission of a virus - Separation of literature into those which meet inclusion criteria and those which don't - Extraction of data from those which meet inclusion criteria
19 days ago15 proposalsRemoteExpert photo editor
I am looking for a professional photo editor that can extract layers from images both foreground and background. Enhance photo's the end result is for use in an app. So it has to be really good.
a month ago20 proposalsRemoteWeb scraping
I am looking at Web scraping software, based on input; to produce a spreadsheet of URL, email address and phone numbers extracted from the contact us pages,using legitimate methods. A URL scrapper will also need to be built into the application. To be able to facilitate the above process.
24 days ago29 proposalsRemoteopportunity
I need email addresses for my linkedin contacts database
We have a database of approximately 6,500 LinkedIn contacts, and unfortunately, the email addresses of many contacts are missing. Our goal is to find a freelancer who can help us update the database with valid email addresses. The freelancer should have experience in data mining, web scraping, or any other method that can help them extract email addresses from LinkedIn profiles. The project requires the freelancer to work remotely and should be completed within a reasonable timeframe. We are willing to provide additional information and support to the freelancer throughout the project. If you are interested in this project, please submit your proposal, including your estimated cost and timeline. We look forward to hearing from you and discussing the details further.
9 days ago72 proposalsRemoteWeb Scraping
I have a series of projects whereby we would like to take the contact data of travel agents from membership directory websites. An example of this, is this site. https://www.abta.com/abta-member-search/results?search=London Another example is this site. https://www.etoa.org/member-search/?term=&type=&country=United+Kingdom&v_s_m=&d_s=&p_s=&c_r_l=&y_e=&c_a_p_s=&c= We need directory data extracted and ideally we would like the data to be in a spreadsheet with name, email address, telephone number, Company name etc. We have 11 websites we would like the data from. And multiple ofther projects too. Kind regards, Nicola
11 days ago74 proposalsRemoteDaily MLS Listings Update and Maintenance
I am seeking a skilled freelancer to manage my MLS listings. Your responsibilities will include: Propli task: New Development upload Requirements: we are looking for a data entry team to upload and update the details of new development constructions project on the Costa del Sol (southern Spain) As a starter project, we will provide you access details to one developer’s website (Nvoga) where they have 11 different projects listed. Your “intro” job will be to extract the information from the “Salvia” development (No.3 on the page) in that site and insert it into our site. We have also provided a few videos showing a complete walk-through of the process 2 of the projects which you should be able to use as a reference. Once we have checked that the information is correct and you have understood the full job we will set up the milestone on freelancer and the work can begin on the remaining projects with this developer (and of course later we will be providing details of other developer websites with their own different projects) it's important to understand that each developer will display the information in a different way so you need to know what information to look for. That's the main purpose of this first exercise….. Before bidding please look at these 2 videos showing the import of the first project “The Bliss Village”…. [login to view URL] [login to view URL] We have got just less than 5 weeks to get around 200 developments done.
2 days ago7 proposalsRemoteSeeking Web Scraper for Adjustments to Existing Real Est. Script
I am a Python enthusiast in need of a freelance web scraper for a personal project to make adjustments to an existing Python script that extracts data from Idealista.com. The goal is to refine the script to handle the website’s anti-scraping measures effectively. Project Overview: Script Adjustment: Update and optimize an existing WORKING script to improve data extraction reliability and efficiency. But issues with anti scrapping defence from the website. Anti-Scraping Strategy: Implement simple and cost-effective strategies to overcome scraping blocks. Requirements: Proven experience in modifying existing web scraping scripts. Familiarity with Python and scraping libraries like BeautifulSoup, or others. Understanding of techniques to bypass or handle anti-scraping technologies. Ability to work within a tight budget and deliver quick solutions. Deliverables: An updated, fully functional scraping script. Brief documentation on the changes made and how to handle potential future issues. Project Duration: Quick turnaround expected. Budget: Since this is an adjustment to an existing script, I am looking for a cost-effective solution. Please provide your fixed-rate offer for this adjustment. I can provide my existing code, so you can evaluate changes needed. If you are interested, please respond with: Any initial thoughts or specific techniques you think could be effective. I am looking to start as soon as possible and appreciate your swift response. Thank you!
19 days ago21 proposalsRemoteCreate a new packaging using current logo, colours and template
A creative design professional is sought to generate fresh product packaging utilizing an existing branding guideline. The objective is to develop new packaging options that align with an established brand identity while breathing new life into product presentation. Six distinctive color schemes and complementary logos have been defined, along with a template formatting desired elements. Additionally, CAD technical drawings and samples of the current packaging showcase demonstrate visual style and written components to retain. This project offers a designer flexibility within set parameters to showcase their skill crafting aesthetically pleasing packaging concepts. Examples of the logo, color and template assets will be provided to extract guidance for harmonizing new designs. Efficiency is valued as all graphic assets and specifications are prepared in advance, minimizing requirement for supplemental direction. The focus lies in designing captivating packages that uphold brand consistency while exploring design possibilities within the predefined brand framework. Solid knowledge of print production specifications would prove advantageous in submitting designs achievable through standard printing methods.
8 hours ago22 proposalsRemoteopportunity
Parsing email headers
We require a skilled freelancer to develop a solution that seamlessly saves emails from Outlook to SharePoint (365). The primary objective is to extract specific email header information, including the sender, recipient, subject, and received date, and populate it into metadata within the email. This metadata will then be accessible and usable in SharePoint for further processing and organization. The solution should be designed to run automatically upon saving an email into a designated document library, ensuring a smooth and efficient process. The freelancer should possess a deep understanding of SharePoint and Outlook APIs, as well as experience with email parsing and metadata manipulation. A working prototype or demonstration of the solution is expected, along with comprehensive documentation and support to facilitate its implementation and maintenance.
14 days ago21 proposalsRemoteData Scraping Virtual Assistant
Dear Prospective Virtual Assistants, Domin8 Digital is a dynamic digital marketing investment company looking for reliable and proficient virtual assistants to handle our data scraping needs. As we expand our operations, we recognise the value of outsourcing this crucial task to dedicated professionals like you. Scope of Work: - Efficiently scrape targeted data from specified sources. - Organise and format scraped data according to provided guidelines. - Ensure accuracy and quality of extracted data. - Adhere to deadlines and project requirements. Requirements: - Proficient in data scraping tools and techniques. - Attention to detail and ability to maintain data integrity. - Strong organisational and time management skills. - Previous experience in similar roles preferred. If you possess the necessary skills and commitment to deliver high-quality results, we invite you to submit your proposal detailing your relevant experience, availability, and rates. Thank you for considering this opportunity to collaborate with us. We look forward to receiving your proposals. Best Wishes, Domin8 Digital
15 days ago37 proposalsRemoteSeeking No-Code Architect for Automation Project
We are seeking a talented no-code architect to assist us with a project focused on web scraping, content filtering, and process automation. The ideal candidate will have expertise in identifying and configuring the best no-code tools and platforms for our requirements, enabling us to automate processes and extract valuable insights without traditional coding. Responsibilities: Evaluate project requirements and objectives for web scraping, content filtering, and automation. Research and assess available no-code platforms, tools, and integrations suitable for the project's needs. Recommend and select the most suitable no-code solutions for web scraping, data filtering, and automation tasks. Configure and set up integrations between selected no-code tools and platforms to achieve seamless automation. Provide guidance and best practices for leveraging no-code solutions effectively to meet project goals. Requirements: Proven experience as a no-code architect or consultant, with a strong track record of implementing successful no-code solutions. Comprehensive knowledge of various no-code platforms, tools, and integrations, particularly those suitable for web scraping and automation. Familiarity with web scraping techniques, data extraction methods, and content filtering processes. Strong problem-solving skills and the ability to assess project requirements to recommend suitable no-code solutions. Excellent communication skills and the ability to collaborate effectively with project stakeholders. If you're passionate about harnessing the power of no-code tools to automate processes and drive efficiency, we'd love to hear from you! Please provide details of your relevant experience and examples of previous projects involving no-code architecture and automation. Budget: Flexible, depending on experience and scope of work. Duration: Short-term project with potential for ongoing collaboration. Deadline for Applications: [Insert deadline or state "Open until filled"] We look forward to reviewing your proposals and discussing how we can work together to achieve our project goals. Thank you for considering this opportunity!
22 days ago13 proposalsRemoteWhats app integration
This project requires developing a customized automated solution for large scale WhatsApp communication. The client needs to send personalized text messages to more than 200,000 recipients simultaneously through the popular messaging platform WhatsApp. The proposed system should be able to connect to the client's user database to extract requisite contact details like mobile numbers and fetch personalized message templates assigned to different user segments. It is imperative that all communication is sent while adhering to WhatsApp's platform policies and terms of use. The automated messaging mechanism developed should be scalable to handle such high volumes of concurrent outbound deliveries without encountering errors or delays. It must employ intelligent throttling techniques to stagger message dispatch over an extended period if required by WhatsApp APIs. Detailed logging and reporting functionalities are important to track message transmission status for each contact. The interface for operators should be intuitive and allow easy template management, recipient filtering, scheduling message runs. Security and privacy of user data being accessed and transmitted needs to be of utmost priority. Advanced technical and WhatsApp integration skills are necessary to take up this challenging project. Knowledge of application development, automation, APIs and messaging protocols is a pre-requisite. Creating a robust, policy-compliant and high performance solution within the stipulated timeline will be the key objective.
22 days ago15 proposalsRemoteDimension Elevations of CAD Drawing Required
Dimensioned elevations of CAD drawings are required for a recently completed architectural project. Detailed measurements need to be extracted and annotated directly onto relevant views within the CAD files. The elevations must be dimensioned to current industry standards and formatting conventions with all linear measurements clearly labelled. Additional notations for calculated areas, materials, finishes and other requisite specifications should also be included as needed. The applicant should have extensive experience in generating technical drawings directly from 3D CAD models using specialized software such as AutoCAD, Revit or similar platforms. Strong understanding of architectural dimensioning practices and annotation techniques is essential. The ability to thoroughly inspect drawings for completeness and accuracy is important as the dimensions will be relied upon for construction and fabrication purposes. Positive communication and timely delivery are important as tight project deadlines must be met. The freelancer will be expected to work independently with minimal guidance required. Proficiency in dimensioning a variety of building elements such as walls, floors, roofs, fixtures and other architectural components from multiple CAD views is necessary. Only applicants with proven portfolio of dimensioned technical drawings generated from 3D models need apply for this prestigious project. Reference checks will be conducted to validate experience and quality of past work.
a month ago18 proposalsRemoteResearch and Summary on APIs for DMCA Takedown Tool
We are seeking a skilled researcher to conduct a thorough analysis of APIs relevant to building a tool for automating DMCA takedown notices. The ideal candidate will have experience in API research and summarization, with a focus on identifying APIs suitable for DMCA takedown requests and copyright enforcement. Responsibilities: - Conduct research to identify APIs related to DMCA takedown notices, content analysis, reverse image search, text analysis, and copyright monitoring. - Evaluate the features, capabilities, and pricing of each API to assess its suitability for automating DMCA takedown requests. - Provide a comprehensive summary of key findings, including a comparison of APIs based on factors such as accuracy, coverage, ease of integration, and cost-effectiveness. - Present recommendations for the most suitable APIs to use in building the DMCA takedown tool, considering the project's goals and requirements for copyright enforcement. Requirements: - Strong research skills and experience in gathering information from various sources, including API documentation, developer resources, and online reviews. - Familiarity with APIs related to DMCA takedown notices, content analysis, reverse image search, text analysis, and copyright monitoring, as well as their respective use cases and limitations. - Excellent communication skills and the ability to articulate findings and recommendations clearly and concisely. - Attention to detail and the ability to analyze complex information to extract key insights relevant to building a DMCA takedown tool. If you have a passion for API research and expertise in identifying and evaluating APIs for copyright enforcement purposes, we invite you to apply. Please provide details of your relevant experience and examples of previous research or analysis work. We look forward to receiving your proposals and collaborating with you to identify the best APIs for building our DMCA takedown tool. Thank you for considering this opportunity!
19 days ago11 proposalsRemote
Past "Extraction" Projects
Link extraction
Budget is a placeholder only, need firm quote. Looking for someone to build a small program that can take an upload of a PDF, extract all links in that document and enter into a CSV or Excel file. And to advise if this program can be installed on a laptop, or require a server. Probably only an hour work, in looking at online suggestions, seems Python is a good way of doing,
Design a brochure
Good Day I need to design a brochure to hand over during a conference on A5 size and an openable small brochure as well Information, data and design should be extracted from a website into an A5 size front and back to be printed and distributed in a conference next week this is a 24 hours job that need to be done urgently The designer should pick and extract all the information from an existing website www.bisbaghdad.com