AccuSoft ImageGear Professional
PDF Module for HP-UX and Linux(PPC)
The ImageGear PDF module lets you read and rasterize Adobe PDF,
PostScript (PS) and Encapsulated PostScript (EPS) files.
The resulting images can be used with any ImageGear function or module
and written out in any format supported by ImageGear to create
applications of unsurpassed power and flexibility!
The ImageGear PDF module also allows you to read the original PDF
document and extract textual information from it without resorting to
OCR. The resulting text can be dropped into any document for editing,
filing or transmittal.
Highlights
- Convert PDFs for use with your existing document management systems
- Display and print PDFs without any additional components
- Convert PDFs for use in Web pages (without requiring your readers to install
file viewers)
The PDF module works through the core ImageGear API offering extra control flexibility by using image control parameters.
Read Support
- PDF 1.4 standard used by Acrobat 5 (and any previous versions)
- PostScript (PS) level 3
- Encapsulated PostScript (EPS)
- Multi-page file support (PDF and PS only); one-page-per-image
- Rasterizes at any resolution up to the limits of system memory
- Text and graphic anti-aliasing
- Application controllable resolution, color depth, anti-aliasing color depth and text encoding
- Included set of 35 PDF fonts for imaging documents
Write Support
- Creates files conforming to PDF 1.4 standard used by Acrobat 5
- Uncompressed 1-, 4-, 8-, and 24-bit-per-pixel images
- Compressed CCITT G3 and G4 1-bit-per-pixel images
- 8 and 24 bit per pixel JPEG compression
PDF Text Extraction
PDF Text extraction has been added to the PDF module. ImageGear can read the original PDF file and extract the textual content at just about 100% accuracy (English text is approximated at close to 100%, but some symbols in the extracted text (i.e., hyphens) may be approximated incorrectly). This is not OCR output; rather the software scans the original source document (a PDF file), ignores all the format characters and notation, and outputs all the words as a text file.
Exceptions
PDF support is compatible with PDF 1.0, 1.1, 1.2 and 1.3 as defined in the Portable Document Format Reference Manual Version 1.3 of March 11, 1999, distributed by Adobe Systems Incorporated, except as noted with the following exceptions:
- Masked images
- Type 2 (smooth shading) Pattern color spaces, although the operator for smooth-shaded fills is implemented
- Native Separation and DeviceN color spaces - the alternate space is always used
- Native ICCBased color spaces - the alternate space is always used
- In-RIP trapping
- Native Separation and DeviceN color spaces - the alternate space is always used
- Settable ProcessColorModel for page devices, except for a very few special devices
- IODevices other than %stdin, %stdout, %stderr, %lineedit, %statementedit, %os%, and (if configured) %pipe%
