
Data extraction and ChatGPT integration
- or -
Post a project like this415
£396(approx. $533)
- Posted:
- Proposals: 31
- Remote
- #4204353
- OPPORTUNITY
- Awarded
Software Engineer | IT Professional | AI developer | Mobile App Developer and more !
⭐ TOP CERT Graphic Designer ⭐| Expert 2D/3D Render | Video Animator | Web Developer |Logo Designer |Graphic Animations | Video Editor ||Illustration.

Ranked #1 In WordPress Development on PPH | SEO Specialist | Web Design & Development

Web and App Development | Database Expert | Database Analysis| Python Developer


WordPress Expert | Web & App Developer | SEO Specialist | Content Writer | Blockchain | Python | OpenAI | Machine Learning

♛ PPH No. #1 ♛ 21Years of Experience in Web Development , Web Designing, Magento , Shopify, WordPress , API Integration, Full-Stack Ruby on Rails Developer,AngularJS / Node.js


Python | Django | | OpenAI | GenerativeAI | ML | AI| Face recognition|ChatGPT|GoLang|React | Mobile App | Graphic Design
⚡ Data Scientist & Data Extraction Expert | Python, Excel Dashboards, Tableau, VA Support |PowerPoint Presentation

71021469487324108897169977324102179691122079731510051170005280013283605989315815





Description
Experience Level: Expert
To include ChatGPT in an app created on an app designed to assist athletes by answering questions and providing personalised explosive workout plans, sourced from PDFs of public domain books
Step 1: Data Extraction from PDFs
Handle Non-text Content: If the PDFs contain important non-text elements (like images of exercises), consider using an OCR (Optical Character Recognition) tool like Tesseract to convert these images to text.
Step 2: Data Cleaning and Preparation
Process:
Clean Extracted Data: Use Python to clean the data, including removing unwanted characters, standardizing terminology, and correcting OCR errors.
Structure Data: Convert cleaned data into a structured format like JSON or CSV, categorizing content by topics such as "speed training", "strength workouts", etc.
Natural Language Processing: Apply NLP techniques to refine the text, extract keywords, and prepare it for easy querying.
Configure API calls to send user queries to ChatGPT and receive responses.
Store structured data from PDFs in Bubble’s database for quick reference by the AI.
Step 1: Data Extraction from PDFs
Handle Non-text Content: If the PDFs contain important non-text elements (like images of exercises), consider using an OCR (Optical Character Recognition) tool like Tesseract to convert these images to text.
Step 2: Data Cleaning and Preparation
Process:
Clean Extracted Data: Use Python to clean the data, including removing unwanted characters, standardizing terminology, and correcting OCR errors.
Structure Data: Convert cleaned data into a structured format like JSON or CSV, categorizing content by topics such as "speed training", "strength workouts", etc.
Natural Language Processing: Apply NLP techniques to refine the text, extract keywords, and prepare it for easy querying.
Configure API calls to send user queries to ChatGPT and receive responses.
Store structured data from PDFs in Bubble’s database for quick reference by the AI.

Ukdawgz A.
100% (2)Projects Completed
2
Freelancers worked with
2
Projects awarded
13%
Last project
25 Jun 2025
United Kingdom
New Proposal
Login to your account and send a proposal now to get this project.
Log inClarification Board Ask a Question
-
What about the use of Javascript nodejs ? will you allow me to use expressjs file reader to extract meaningful information from your pdf?
-
Will you make use of application programming interface and appi key to just make a request to open ai endpoint and get all response needed and include in your frontend ?
10952161095214
We collect cookies to enable the proper functioning and security of our website, and to enhance your experience. By clicking on 'Accept All Cookies', you consent to the use of these cookies. You can change your 'Cookies Settings' at any time. For more information, please read ourCookie Policy
Cookie Settings
Accept All Cookies