Compile OCRmyPDF for AWS Lambda function
- or -
Post a project like this2107
$500
- Posted:
- Proposals: 4
- Remote
- #2067499
- Awarded
Description
Experience Level: Expert
Estimated project duration: less than 1 week
Tesseract works very well on AWS Lambda as per (https://stackoverflow.com/questions/33588262/tesseract-ocr-on-aws-lambda-via-virtualenv/35724894).
OCRmyPDF (https://github.com/jbarlow83/OCRmyPDF) takes in PDF input and generates PF output. It uses tesseract as underlying core functionality. I need OCRmyPDF to be compiled for AWS Lambda. The use case is when a image-pdf file gets uploaded on S3 bucket, Lambda gets triggered and will apply OCRmyPDF to the image-pdf file. The output searchable-pdf will be stored back into S3 bucket.
OCRmyPDF (https://github.com/jbarlow83/OCRmyPDF) takes in PDF input and generates PF output. It uses tesseract as underlying core functionality. I need OCRmyPDF to be compiled for AWS Lambda. The use case is when a image-pdf file gets uploaded on S3 bucket, Lambda gets triggered and will apply OCRmyPDF to the image-pdf file. The output searchable-pdf will be stored back into S3 bucket.
Apoorva B.
100% (1)Projects Completed
1
Freelancers worked with
1
Projects awarded
50%
Last project
19 Jul 2018
United States
New Proposal
Login to your account and send a proposal now to get this project.
Log inClarification Board Ask a Question
-
Do you have all IM_User privileges enabled to access s3 through Lambda function?
Apoorva B.07 Jul 2018Yes. And Tesseract just works fine for me.
Jakhode M.07 Jul 2018Ok so the main issue is to make a copy of pdf into image and pushing it on other bucket over s3?
670983
We collect cookies to enable the proper functioning and security of our website, and to enhance your experience. By clicking on 'Accept All Cookies', you consent to the use of these cookies. You can change your 'Cookies Settings' at any time. For more information, please read ourCookie Policy
Cookie Settings
Accept All Cookies