
Data extraction and ChatGPT integration
- or -
Post a project like this650
£396(approx. $526)
- Posted:
- Proposals: 31
- Remote
- #4204353
- OPPORTUNITY
- Awarded
AI-ML, application development, Mobile APP, Web Application Development Service Provider

52047431130412148645583597491119304267481812444828637457
Description
Experience Level: Expert
To include ChatGPT in an app created on an app designed to assist athletes by answering questions and providing personalised explosive workout plans, sourced from PDFs of public domain books
Step 1: Data Extraction from PDFs
Handle Non-text Content: If the PDFs contain important non-text elements (like images of exercises), consider using an OCR (Optical Character Recognition) tool like Tesseract to convert these images to text.
Step 2: Data Cleaning and Preparation
Process:
Clean Extracted Data: Use Python to clean the data, including removing unwanted characters, standardizing terminology, and correcting OCR errors.
Structure Data: Convert cleaned data into a structured format like JSON or CSV, categorizing content by topics such as "speed training", "strength workouts", etc.
Natural Language Processing: Apply NLP techniques to refine the text, extract keywords, and prepare it for easy querying.
Configure API calls to send user queries to ChatGPT and receive responses.
Store structured data from PDFs in Bubble’s database for quick reference by the AI.
Step 1: Data Extraction from PDFs
Handle Non-text Content: If the PDFs contain important non-text elements (like images of exercises), consider using an OCR (Optical Character Recognition) tool like Tesseract to convert these images to text.
Step 2: Data Cleaning and Preparation
Process:
Clean Extracted Data: Use Python to clean the data, including removing unwanted characters, standardizing terminology, and correcting OCR errors.
Structure Data: Convert cleaned data into a structured format like JSON or CSV, categorizing content by topics such as "speed training", "strength workouts", etc.
Natural Language Processing: Apply NLP techniques to refine the text, extract keywords, and prepare it for easy querying.
Configure API calls to send user queries to ChatGPT and receive responses.
Store structured data from PDFs in Bubble’s database for quick reference by the AI.
Ukdawgz A.
100% (2)Projects Completed
2
Freelancers worked with
2
Projects awarded
10%
Last project
6 Mar 2026
United Kingdom
New Proposal
Login to your account and send a proposal now to get this project.
Log inClarification Board Ask a Question
-

What about the use of Javascript nodejs ? will you allow me to use expressjs file reader to extract meaningful information from your pdf?
-

Will you make use of application programming interface and appi key to just make a request to open ai endpoint and get all response needed and include in your frontend ?
10952161095214
We collect cookies to enable the proper functioning and security of our website, and to enhance your experience. By clicking on 'Accept All Cookies', you consent to the use of these cookies. You can change your 'Cookies Settings' at any time. For more information, please read ourCookie Policy
Cookie Settings
Accept All Cookies