Data extraction and ChatGPT integration

- or -

Post a project like this

Ends in (days)

786

Fixed Price

£396(approx. $530)

Posted: 2 years ago
Proposals: 31
Remote
#4204353
OPPORTUNITY
Awarded

+ have already sent a proposal.

Description

Experience Level: Expert

To include ChatGPT in an app created on an app designed to assist athletes by answering questions and providing personalised explosive workout plans, sourced from PDFs of public domain books

Step 1: Data Extraction from PDFs

Handle Non-text Content: If the PDFs contain important non-text elements (like images of exercises), consider using an OCR (Optical Character Recognition) tool like Tesseract to convert these images to text.

Step 2: Data Cleaning and Preparation

Process:

Clean Extracted Data: Use Python to clean the data, including removing unwanted characters, standardizing terminology, and correcting OCR errors.

Structure Data: Convert cleaned data into a structured format like JSON or CSV, categorizing content by topics such as "speed training", "strength workouts", etc.

Natural Language Processing: Apply NLP techniques to refine the text, extract keywords, and prepare it for easy querying.

Configure API calls to send user queries to ChatGPT and receive responses.

Store structured data from PDFs in Bubble’s database for quick reference by the AI.

New Proposal

Clarification Board Ask a Question

06 May 2024

What about the use of Javascript nodejs ? will you allow me to use expressjs file reader to extract meaningful information from your pdf?
06 May 2024

Will you make use of application programming interface and appi key to just make a request to open ai endpoint and get all response needed and include in your frontend ?

Description

Ukdawgz A.

New Proposal

Clarification Board Ask a Question