Checking pdf box values using Python PdFminer
- or -
Post a project like this$$
- Posted:
- Proposals: 5
- Remote
- #2043968
- Expired
Description
Experience Level: Intermediate
The purpose of this script is to validate if specific fields/boxes in a pdf contain specific information.
I want a python script using PDFMiner https://euske.github.io/pdfminer/
This will be a script that can be run from the command line.
It will ask for 3 inputs, 1. pdf file 2. Field to check 3. text in field to validate
The output will say if the value is present or not.
For example in GSA300, Field 3. Contract number contains 47PM1118D1133,
The user should then be able to run the script on the inputs of File: GSA300, Field: 3. Contract number, value: 47PM1118D1133. The script should then output that the field does contain this value.
There should also be a section of the code, where the user enter the input values as variables, instead of having to input it in the command line. This section should be able to be commented out, incase user does not want to use this option. For ex. It would have variables X = File: GSA300, Y = Field: 3. Contract number, Z = value: 47PM1118D1133. then output that the field does contain this value, same as above.
Also would like a print out of pdf in xml format.
Please provide comments explaining codebase
See attached pdfs, for use cases of that will use this script.
I want a python script using PDFMiner https://euske.github.io/pdfminer/
This will be a script that can be run from the command line.
It will ask for 3 inputs, 1. pdf file 2. Field to check 3. text in field to validate
The output will say if the value is present or not.
For example in GSA300, Field 3. Contract number contains 47PM1118D1133,
The user should then be able to run the script on the inputs of File: GSA300, Field: 3. Contract number, value: 47PM1118D1133. The script should then output that the field does contain this value.
There should also be a section of the code, where the user enter the input values as variables, instead of having to input it in the command line. This section should be able to be commented out, incase user does not want to use this option. For ex. It would have variables X = File: GSA300, Y = Field: 3. Contract number, Z = value: 47PM1118D1133. then output that the field does contain this value, same as above.
Also would like a print out of pdf in xml format.
Please provide comments explaining codebase
See attached pdfs, for use cases of that will use this script.
Sayed C.
100% (2)Projects Completed
2
Freelancers worked with
2
Projects awarded
25%
Last project
31 Jul 2021
United States
New Proposal
Login to your account and send a proposal now to get this project.
Log inClarification Board Ask a Question
-
There are no clarification messages.
We collect cookies to enable the proper functioning and security of our website, and to enhance your experience. By clicking on 'Accept All Cookies', you consent to the use of these cookies. You can change your 'Cookies Settings' at any time. For more information, please read ourCookie Policy
Cookie Settings
Accept All Cookies