Create OCR centered proof of concept
- or -
Post a project like this2426
£300(approx. $409)
- Posted:
- Proposals: 5
- Remote
- #474044
- Awarded
Description
Experience Level: Expert
Estimated project duration: 1 - 2 weeks
General information for the business: validating a business idea
Kind of development: New program from scratch
Description of requirements/functionality: Ultimately I want to create an online app that will take bills, and other unstructured scanned paperwork / documents and populate a database with the data.
For the proof of concept, I want to show that we can take documents and dump relevant content into a table displayed on a website in real time.
The scope could be limited to some predetermined templates
I see two ways of doing this,
1. OCR with vectorisation, i..e the page is split into a grid and we tell the software what each element in the grid aligns with.
2. OCR with text matching, i.e. we look for keywords on a page and associated nearby content with the content we are looking for
3. any ideas you can come up with
CMS and Admin requirements: Ability to add a file
Purge the database after each file upload
display the file content in a table
Specific technologies required: does not matter for this but should be able to run on a unix/linux web host
OS requirements: Linux
Extra notes: attached is an example water bill which would need to be converted. output in a table could be:
1. name of supplier
2. payment period
3. payment amount
4. payment type
5. payment due date
etc
Kind of development: New program from scratch
Description of requirements/functionality: Ultimately I want to create an online app that will take bills, and other unstructured scanned paperwork / documents and populate a database with the data.
For the proof of concept, I want to show that we can take documents and dump relevant content into a table displayed on a website in real time.
The scope could be limited to some predetermined templates
I see two ways of doing this,
1. OCR with vectorisation, i..e the page is split into a grid and we tell the software what each element in the grid aligns with.
2. OCR with text matching, i.e. we look for keywords on a page and associated nearby content with the content we are looking for
3. any ideas you can come up with
CMS and Admin requirements: Ability to add a file
Purge the database after each file upload
display the file content in a table
Specific technologies required: does not matter for this but should be able to run on a unix/linux web host
OS requirements: Linux
Extra notes: attached is an example water bill which would need to be converted. output in a table could be:
1. name of supplier
2. payment period
3. payment amount
4. payment type
5. payment due date
etc

Dan J.
100% (4)Projects Completed
6
Freelancers worked with
6
Projects awarded
50%
Last project
18 Dec 2014
United Kingdom
New Proposal
Login to your account and send a proposal now to get this project.
Log inClarification Board Ask a Question
-
There are no clarification messages.
We use cookies to improve your experience and our services. By using PeoplePerHour, you agree to ourCookie Policy