Convert Documents to Images

  • Posted
  • Proposals 0
  • Remote
  • #22086
  • Expired
  • 0

Description

Experience Level: Expert
Write a program to use the open source Virtual Image Printer driver (http://sourceforge.net/projects/imageprinter/) to convert documents into JPEG images.

1) Documents to be converted to images will be placed in the \"C:\\input\" folder.
2) The program should monitor the directory for new files.
3) As soon as there are new files, send the document to the Virtual Image Printer Driver to print it as image.
4) Place the output images to a subfolder named after the document under the \"C:\\output\" folder, 1 image for each page of the document. For example, if the document is \"hello.doc\" and it contains 3 pages, there will be 3 output images, namely, 1.jpg, 2.jpg and 3.jpg, placed under the folder \"C:\\output\\hello.doc\\\".
5) If output directory already exists, the original content should be deleted and replaced with the new output images.
6) Delete the original document from \"C:\\input\" folder after conversion.

Must support the following document formats:
1) MS Office (.doc, .docx, .xls, .xlsx, .ppt, .pptx)
2) OpenOffice (.odc, .ods, .odt, .odp)
3) Other text formats (.txt, .rft)
4) PDF (.pdf)
5) HTML (.htm)

Clarification Board

    There are no clarification messages.