OCR Technical Manuals

  • Posted
  • Proposals 1
  • Remote
  • #4228
  • Expired
Iris J. has already sent a proposal.
  • 1

Description

Experience Level: Intermediate
We require about 100 (although the number may rise, and the project may be split, or a sample batch undertaken only) technical documents (vehicle workshop manuals and parts manuals) OCR'ing and then accurate proofing/correction. There is no translation element involved, although some of the project has Italian text the majority is in English.
The manuals have already been scanned and would be supplied as PDF documents or as Omnipage Pro v15/v16 files.

A. Parts Manuals (between 100-300 pages with a varied amount of content per page) (sample of the two page types enclosed)
These consist of just the parts listings pages that relate to exploded diagrams of the various systems on a motor vehicle (thediagrams are not being OCR'd and will not be supplied) and the index pages. Three parts are to be captured:
1. The main body to be output to an Excel file in a table format under the following headings:
Page Number; Position Number; Part Number; Quantity; Description (English); Description (Italian){only if listed}
2. There is an index, which needs to be captured to excel as three columns:
Part number; Page Number; Position
3. A summary in Excel of the Pages is also required as two columns:
Page Number; Page Title

B. Workshop Manuals (between 150 and 500 pages)
Mixed text and graphics/illustrations/tables. Often in 4 languages listed in columns for each language (only interested in OCR'ing the English).
Manual supplied in PDF/OmniPage Pro, and output to be in same format, along with the final Omnipage pro file.

Onmipage Pro v16 is the software we have used in the past for such work, although this is not set in stone and we are open to other alternatives. If you have done this sort of work before, you'll be aware that the first couple of manuals will take longer as the dictionary of technical words and abreviations is built by the OCR software.

The pricing structure is flexible, but a cost per page is probably the simplest but open to offers, one cost for parts pages, one for workshop manuals. Accuracy of the part numbers is CRITICAL.

A simple resume of other projects related to this example that have been completed would be useful, and possibly you might want to return the enclosed pages as per the specification to show your understanding of the requirement.

Clarification Board

    There are no clarification messages.