Compile OCRmyPDF for AWS Lambda function

- or -

Post a project like this

Ends in (days)

2107

Fixed Price

$500

Posted: 6 years ago
Proposals: 4
Remote
#2067499
Awarded

have already sent a proposal.

Description

Experience Level: Expert

Estimated project duration: less than 1 week

Tesseract works very well on AWS Lambda as per (https://stackoverflow.com/questions/33588262/tesseract-ocr-on-aws-lambda-via-virtualenv/35724894).
OCRmyPDF (https://github.com/jbarlow83/OCRmyPDF) takes in PDF input and generates PF output. It uses tesseract as underlying core functionality. I need OCRmyPDF to be compiled for AWS Lambda. The use case is when a image-pdf file gets uploaded on S3 bucket, Lambda gets triggered and will apply OCRmyPDF to the image-pdf file. The output searchable-pdf will be stored back into S3 bucket.

New Proposal

Clarification Board Ask a Question

07 Jul 2018

Do you have all IM_User privileges enabled to access s3 through Lambda function?

Apoorva B.07 Jul 2018
Yes. And Tesseract just works fine for me.

Jakhode M.07 Jul 2018
Ok so the main issue is to make a copy of pdf into image and pushing it on other bucket over s3?
- Show more messages

Description

Apoorva B.

New Proposal

Clarification Board Ask a Question