
Urgent OCR
- or -
Post a project like this29
£200(approx. $266)
- Posted:
- Proposals: 20
- Remote
- #4481707
- OPPORTUNITY
- Open for Proposals
AI Chatbot & Automation Developer | Full-Stack Web Engineer | Web Scraping Expert
Digital Marketing| writing &Translation | Tech & Design Pro| Shopify, WordPress, SEO, Writing, Graphics

Quick Graphic Designer + Animator + Video Editor + Photo Editor + Logo Designer + Autocad Designer

⭐⭐⭐⭐⭐ Transforming ideas into stunning architectural visuals & profitable online stores | Expert in Revit, CAD, SEO & Digital Growth | ★★★★★ Rated

Award-Winning Digital Marketing/ Figma Designer & Web Development Expert | Your #1 Choice for Exceptional Results
Responsive Web Developer | WordPress, Magento & eCommerce Expert Developer Specializing in WordPress, Magento & HTML5
Software Engineer | IT Professional | AI developer | Mobile App Developer and more !
TOP RATED ⭐️⭐️⭐️⭐️⭐️ Full-Stack Developer |Shopify Expert| WordPress | Webflow | Sqaurespace | Zapier

Graphic Designer|Video editing|2D/3D Logo I Animation | Content Writer | Web Content | Data Entry | Translator | Specialized in Power Point, Excel, Word, Translation

Professional Graphic Designer|Book Publishing (Amazon KDP) |Web developer | C++ programmer|UX/UI interface |SEO expert

12122667129987816476906121768841181793812216595710214611147039915235525823501157516011818456
Description
Experience Level: Entry
In this folder are three folders called Bishopsgate archives, Newham archives and special collections and RIBA collections. https://www.dropbox.com/scl/fo/ixwakdbgdhodiq9sqlbk6/AAuJLgrYLb2WB1a--E_b76A?rlkey=y0f9poa3t1rpalmemh2qljs2n&st=dfvttlf8&dl=0. Ignore the folder called riba objects called scanning.
Each of those three folders has many subfolders. For each of those subfolders perform these instructions
OCR INSTRUCTIONS
1. One file per folder
For each archive folder (e.g. TBUK1, NEWHAM2), create ONE plain text file (.txt).
Put all documents from that folder into that one file.
2. Clear document separation and stable IDs
Every time a new document starts, write:
==============================
ARCHIVE_FOLDER: TBUK1
DOCUMENT_ID: TBUK1_01
DATE: (write date exactly as shown, or Unknown)
PLACE: (write place exactly as shown, or Not stated)
------------------------------
Then paste the full OCR text of that document.
For the next document:
==============================
ARCHIVE_FOLDER: TBUK1
DOCUMENT_ID: TBUK1_02
DATE:
PLACE:
------------------------------
Continue sequentially: TBUK1_03, TBUK1_04, etc.
For a different folder (e.g. NEWHAM2):
ARCHIVE_FOLDER: NEWHAM2
DOCUMENT_ID: NEWHAM2_01
Do not restart numbering without the folder prefix.
If date or place is not visible:
DATE: Unknown
PLACE: Not stated
Do NOT guess.
3. OCR rules
* Copy text exactly as written.
* Do NOT correct spelling or grammar.
* Do NOT rewrite sentences.
* Do NOT summarise.
* Keep paragraph breaks.
* Remove page numbers.
* If a word cannot be read, write: [illegible]
* Do not insert commentary.
4. Hand-drawn diagrams
Do NOT attempt full OCR of technical drawings.
Instead include:
==============================
ARCHIVE_FOLDER: TBUK1
DOCUMENT_ID: TBUK1_XX
DATE:
PLACE:
------------------------------
HAND-DRAWN ENGINEERING DRAWING
Title: (if visible)
Location: (if visible)
Company: (if visible)
If no readable text at all:
HAND-DRAWN ENGINEERING DRAWING (no readable text)
Do NOT copy measurements or technical numbers from diagrams.
5. Save format
* Save as .txt
* Use UTF-8 encoding
6. For each subfolder also export one pdf for me to refer to easily - pdf must be under 30MB. You can use these Lower res versions of the subfolders for the pdfs. https://www.dropbox.com/scl/fo/hjed8pk08njhzfns6hhvb/ALP8N6HCGRHAuYwiJR_tEg4?rlkey=t4p5y3q3954monibbm2tktz90&st=lb5eo4lp&dl=0
Each of those three folders has many subfolders. For each of those subfolders perform these instructions
OCR INSTRUCTIONS
1. One file per folder
For each archive folder (e.g. TBUK1, NEWHAM2), create ONE plain text file (.txt).
Put all documents from that folder into that one file.
2. Clear document separation and stable IDs
Every time a new document starts, write:
==============================
ARCHIVE_FOLDER: TBUK1
DOCUMENT_ID: TBUK1_01
DATE: (write date exactly as shown, or Unknown)
PLACE: (write place exactly as shown, or Not stated)
------------------------------
Then paste the full OCR text of that document.
For the next document:
==============================
ARCHIVE_FOLDER: TBUK1
DOCUMENT_ID: TBUK1_02
DATE:
PLACE:
------------------------------
Continue sequentially: TBUK1_03, TBUK1_04, etc.
For a different folder (e.g. NEWHAM2):
ARCHIVE_FOLDER: NEWHAM2
DOCUMENT_ID: NEWHAM2_01
Do not restart numbering without the folder prefix.
If date or place is not visible:
DATE: Unknown
PLACE: Not stated
Do NOT guess.
3. OCR rules
* Copy text exactly as written.
* Do NOT correct spelling or grammar.
* Do NOT rewrite sentences.
* Do NOT summarise.
* Keep paragraph breaks.
* Remove page numbers.
* If a word cannot be read, write: [illegible]
* Do not insert commentary.
4. Hand-drawn diagrams
Do NOT attempt full OCR of technical drawings.
Instead include:
==============================
ARCHIVE_FOLDER: TBUK1
DOCUMENT_ID: TBUK1_XX
DATE:
PLACE:
------------------------------
HAND-DRAWN ENGINEERING DRAWING
Title: (if visible)
Location: (if visible)
Company: (if visible)
If no readable text at all:
HAND-DRAWN ENGINEERING DRAWING (no readable text)
Do NOT copy measurements or technical numbers from diagrams.
5. Save format
* Save as .txt
* Use UTF-8 encoding
6. For each subfolder also export one pdf for me to refer to easily - pdf must be under 30MB. You can use these Lower res versions of the subfolders for the pdfs. https://www.dropbox.com/scl/fo/hjed8pk08njhzfns6hhvb/ALP8N6HCGRHAuYwiJR_tEg4?rlkey=t4p5y3q3954monibbm2tktz90&st=lb5eo4lp&dl=0
Sasha W.
100% (16)Projects Completed
20
Freelancers worked with
18
Projects awarded
44%
Last project
13 Mar 2026
United Kingdom
New Proposal
Login to your account and send a proposal now to get this project.
Log inClarification Board Ask a Question
-
There are no clarification messages.
We collect cookies to enable the proper functioning and security of our website, and to enhance your experience. By clicking on 'Accept All Cookies', you consent to the use of these cookies. You can change your 'Cookies Settings' at any time. For more information, please read ourCookie Policy
Cookie Settings
Accept All Cookies