AccuSoft ImageGear Professional

Download ImageGear

PDF Module for HP-UX and Linux(PPC)

The ImageGear PDF module lets you read and rasterize Adobe PDF, PostScript (PS) and Encapsulated PostScript (EPS) files.

The resulting images can be used with any ImageGear function or module and written out in any format supported by ImageGear to create applications of unsurpassed power and flexibility!

The ImageGear PDF module also allows you to read the original PDF document and extract textual information from it without resorting to OCR. The resulting text can be dropped into any document for editing, filing or transmittal.

Highlights

 

The module also provides the capability to write images out as PDF files. For example, combine this with ImageGear's ISIS module and you can create a system for scanning documents directly to PDF.

The PDF module works through the core ImageGear API offering extra control flexibility by using image control parameters.

Read Support



Write Support



PDF Text Extraction



PDF Text extraction has been added to the PDF module.  ImageGear can read the original PDF file and extract the textual content at just about 100% accuracy (English text is approximated at close to 100%, but some symbols in the extracted text (i.e.,  hyphens) may be approximated incorrectly).  This is not OCR output; rather the software scans the original source document (a PDF file), ignores all the format characters and notation, and outputs all the words as a text file.

Exceptions



PDF support is compatible with PDF 1.0, 1.1, 1.2 and 1.3 as defined in the Portable Document Format Reference Manual Version 1.3 of March 11, 1999, distributed by Adobe Systems Incorporated, except as noted with the following exceptions:

PS/EPS support is compatible with the Level 3 PostScript (TM) language with the following exceptions: