
Python: Regex, ML, vision, Text extraction for template creation
- or -
Post a project like this1821
$1.5k
- Posted:
- Proposals: 6
- Remote
- #3168267
- OPPORTUNITY
- Awarded
Graphic Designer |Experienced Web Designer | Video/Audio Editor | PowerPoint/Keynote | Content Writer |

Full-Stack developer(C++, C#, Python, JavaScript, PHP, React, Django, Node.js, Laravel, Flutter, React-Native)

250936134314163491639487252933651485168111
Description
Experience Level: Expert
1. Description
We need to create a template based flow of documents which automates the templates.
Process is:
1. read the file and extract all text (you)
2. find values in table and compare with text extracted (you)
3. if not found send to template creation setting a vlue to 0 instead of 1(you)
5. manual labeling stores coordinates for every label
6. Based on the coordinates you extract text strings inside of the boxes with regex for example and store values to template table(you)
7. read the document again and extract based on coordinate and compare with media_template table and store the results in media table(you)
2+ time document arrives
1. upload document (already done)
2. Detect if there is a template already. Extract text strings with regex by using coordinates and. Store strings in media table
You work is to quickly do the above work.
2. Skills
Python
MySQL
5+ years
You must have done something similar previously and you know what regex and tesseract is and have used it several times.
You have worked with vision, ML, DL or NN
3. Time
3-4 days is what this script takes to do.
Price is a price holder
We need to create a template based flow of documents which automates the templates.
Process is:
1. read the file and extract all text (you)
2. find values in table and compare with text extracted (you)
3. if not found send to template creation setting a vlue to 0 instead of 1(you)
5. manual labeling stores coordinates for every label
6. Based on the coordinates you extract text strings inside of the boxes with regex for example and store values to template table(you)
7. read the document again and extract based on coordinate and compare with media_template table and store the results in media table(you)
2+ time document arrives
1. upload document (already done)
2. Detect if there is a template already. Extract text strings with regex by using coordinates and. Store strings in media table
You work is to quickly do the above work.
2. Skills
Python
MySQL
5+ years
You must have done something similar previously and you know what regex and tesseract is and have used it several times.
You have worked with vision, ML, DL or NN
3. Time
3-4 days is what this script takes to do.
Price is a price holder
Robert W.
100% (3)Projects Completed
3
Freelancers worked with
3
Projects awarded
17%
Last project
10 Jul 2021
United Kingdom
New Proposal
Login to your account and send a proposal now to get this project.
Log inClarification Board Ask a Question
-
There are no clarification messages.
We collect cookies to enable the proper functioning and security of our website, and to enhance your experience. By clicking on 'Accept All Cookies', you consent to the use of these cookies. You can change your 'Cookies Settings' at any time. For more information, please read ourCookie Policy
Cookie Settings
Accept All Cookies