Document Imaging Library Features

The ImageGear document and imaging library includes a wide variety of features to help you build your application quickly. The entire document imaging lifecycle, from scanning and document creation through printing, viewing and archiving, can be found in the expansive features set of the ImageGear library.

File Format Support

Load and save over 100 image, graphic, and document formats, including raster and vector images, PDF, JPEG, JPEG 2000, GIF, TIFF, DICOM, CAD, ABIC Check Imaging, HD Photo, camera raw, and more (see complete list here).

All formats also have robust save options for complete format control. The ImageGear library easily converts images from one format to another. Load and display Microsoft Word, Excel, and PowerPoint documents. You may also load raw image files without headers.

  • Read and write metadata without decoding or recompressing an image.
  • Work with a wide variety of file formats across bit depths.
  • Create batch conversion processes and eliminates compatibility issues.
  • Access all pages of any multi-page file.
  • Deliver fast TIFF, JPEG, GIF, MO:DCA, and Group IV processing.
Need DICOM format? See ImageGear Medical.

Raster Formats

  • Easily convert to and from various raster image formats such as TIFF, JPEG, and PDF.
  • Quickly convert various images into one standard format for consistent viewing.
  • Build applications to convert to and from bitmap images with various bit depths, palettes, compression, and encoding options.
  • It includes hundreds of digital camera raw formats. (See the list of raw formats).

Vector Formats

This feature enables an app to read, render, and write vector files such as PDF, CAD, SVG, and XPS in their native formats.

  • Read, write, and render vector images.
  • Read and write DWG and DXF file formats.
  • Render 3D wireframe objects.
  • Render solid 3D objects with adjustable lighting for shading purposes.
  • Read and write in the SVG file format.

Image Viewing & Display

Build Windows-based viewers using ActiveX controls or DLLs and for .NET, WinForms, ASP.NET, and WPF. Use shared libraries for Linux and Unix applications or frameworks for Mac OS X applications. The advanced viewing features in this document imaging library can be added rapidly into your applications including GUI features such as common dialogs for image loading, saving, processing, and more. High-speed display provides complete control over how your application displays images. Shorten development efforts by using GUI functions that simplify the implementation of several sophisticated GUI features. Provide a better UX with thumbnail creation for easy navigation of folders and multi-page files. The product includes common dialogs for image loading, saving, processing, and more.

Manipulate and Manage PDFs

ImageGear brings a variety of features to the table including the ability to:

  • Create, edit, view, annotate, and print PDFs to your application;
  • Automatically conform to the PDF Language Standard;
  • Open and process PS and EPS files;
  • Convert Microsoft Office documents to PDF and existing PDF files to PDF/A;
  • Create PDF/A files from raster images;
  • Read, write, display, and edit Portable Document Format (PDF), PostScript (PS), and Encapsulated PostScript (EPS);
  • Comprehend PDF API for native PDF annotations, searchable text, and more.

PDF Annotations

Add annotations to a PDF document. Annotation marks include text, line, freehand polyline, rectangle, ellipse, polygon, polyline, audio, image, ruler, protractor, encryption, button, hot spot, and rich text. This version also includes the ability to import and export these marks to XML files. The application developer has the flexibility to embed a subset of these annotation types into the PDF file as a true Adobe annotation or to save them in a separate file.

PDF Highlights

  • Ability to convert Microsoft Office documents to PDF
  • Support Adobe® PDF v1.7 and PS v3
  • Easily read and write to and from PDF, PostScript, and Encapsulated PostScript files
  • Quickly add a wide variety of annotations to a PDF document
  • Preserve vector data when saving PDF files
  • Work with 3D PDF content in Universal 3D (U3D) format
  • Provide more control of displaying individual CMYK channels and Pantone spot colors in a PDF
  • Convert scanned pages into PDF searchable text using the ImageGear OCR functionality
  • Create PDF/A files from raster image files and scanned images
  • Verify PDF documents for reliable graphic content exchange through PDF/X (PDF/X-1a, PDF/X-3, and PDF/X-4) and PDF/A (PDF/A-1a and PDF/A-1b) compliance
  • PDF Text Extraction to allow for the extraction of words from PDF documents or specified pages, including word enumerating and sorting and obtaining word layouts, styles, and characters
  • Double byte support so that information can be shared across multiple languages including Japanese, Chinese, Korean, Arabic, and Hebrew
  • Native PDF document printing renders document content directly to the printer, providing more speed and reducing memory requirements
  • Embed and subset fonts into PDFs to ensure the document is viewed exactly as it was created
  • Provides listing of available host system fonts and finds system/PDF font matches, as well as font creation from system fonts and for the editing of font information

Full-Page OCR

ImageGear provides OCR for over 100 languages, including Asian languages, and automatic redaction of regular expression search results. OCR can be purchased as an add-on to provide a complete document imaging library for your application development.

OCR Languages

  • Includes over 100 different languages
  • Asian Languages (Chinese, Korean, and Japanese)
  • Recognizes characters from multiple languages within a single image
  • For a complete list of languages, please see our product documentation

OCR Zone Based Processing

Auto-Zoning (Segmentation):

  • Automatically segments page into individual zones for processing.
  • Assigns a type to located zones based on expected content: flow, table, graphic.
  • Improves recognition results and performance by removing image areas from page prior to OCR operation.
  • Advanced table detection improves data result reconstruction.

User Defined Zones:

  • Process an entire image or individual region of the page.
  • Zones can be defined on the fly by a user, loaded from a file, or detected automatically by the engine.
  • Flexible API gives developers the ability to define areas of images to be processed and the type of content located in that defined area.
  • Apply advanced data checking zone by zone.
  • If you have specialized content, you can define the appropriate recognition module for your content.

OCR Editions

OCR Language Options
The ImageGear functionality works on two sets of languages: Western and Asian. These language options are licensed separately for development and deployment. The Asian language offering has some basic support for western characters but will not utilize any dictionaries to improve results. If you need both sets of languages, please contact us.

OCR Deployment Options
Languages for the ImageGear Professional DLL functionality is available in three different editions for distribution, standard, plus, and Asian. The primary difference between these versions is the list of output formats created by the OCR engine and the addition of Asian languages.

Standard Edition
The PDF formatting in the Standard Edition is accomplished using text output reported by the OCR engine and ImageGear’s internal PDF engine.

Standard Edition Output Formats:

  • Searchable text PDF files
  • Text documents

Plus Edition
Formatted output is created by using all of the recognition information (font detail, located image areas, and recognized table structure information) to reconstruct a representation of the original document. The Plus Edition leverages the power of the OCR engine to create the robust formatted output.

Plus Edition Output Formats:

  • Searchable text PDF files
  • Text documents
  • Word
  • Excel
  • HTML
  • Etc.

Asian Edition
Formatted output is created by using all of the recognition information (font detail, located image areas, and recognized table structure information) to reconstruct a representation of the original document. The Asian Edition leverages the power of the OCR engine to create the robust formatted output of document images with Asian language.

Asian Edition Output Formats:

  • Text documents
  • Word
  • Excel
  • HTML
  • Etc.

OCR Image Pre-Processing

  • Advanced image processing methods are available to improve OCR accuracy.
  • Auto inversion functionality detects if the image needs to be inverted for highest accuracy.
  • Automatic image orientation detects and adjusts images so they are properly oriented.
  • Deskew methods detect image misalignment and automatically correct it, improving segmentation and recognition accuracy.
  • Despeckling methods remove minor dots and imperfections in the image capture process.
  • Resolution enhancement improves the quality of the low resolution images.

OCR Data Checking

  • A complete checking subsystem improves recognition accuracy.
  • ImageGear uses advanced spell check for 17 different languages, each in a specific dictionary. Each of the 17 dictionaries contain between 100,000 to 200,000 entries.
  • Vertical dictionaries improve spell checking and OCR accuracy for medical and legal industries.
  • Customize validation by defining user dictionaries with values specific to your needs.
  • Validate results using regular expressions.

OCR Result Processing

Recognition Details

  • Each character is returned with an accuracy confidence value.
  • Separate word confidence values provide additional accuracy indication.
  • Advanced font information and location information allows ImageGear to create text representations of the original with a similar layout.

Language Control

  • The ImageGear OCR engine processes all data in a Unicode format. The data output can be formatted for a specific code page.

Multiple Output Format Options

  • Image over PDF
  • Text based PDF
  • Microsoft Office 2007
  • Microsoft Office 97 (Word, Excel, and Powerpoint)
  • RTF
  • HTML
  • XML

Annotation

Give your users the ability to add, edit, and burn-in XML-based annotations and define your own annotation types. Users can mark up any image with text note, redactions, line, arrow, rectangle, ellipse, highlight, protractor, ruler, and more.

Developers can build collaboration workflows allowing each person to attach comments to images as they are viewed and burn-in annotations, such as for permanent redaction of sensitive information.

You can merge annotations with another image or keep the annotations in a separate file. The separate file can act as an overlay to display the changes. In this way, the original image is never directly altered. The Accusoft Redlining Toolkit™ (ART) component is a flexible and powerful annotation library. It provides a convenient way to add annotations, drawings, hyperlinks, and more to your images.

This feature works with WinForms, WPF, and ASP.NET and offers easy to implement UI elements with XML-based annotation for images and medical data.

Process and Edit Document Images

Empower your applications to perform a broad range of image cleanup, correction, and transformation functions.

Easy-to-use ImageClean™ includes:

  • Hole punch removal
  • Line removal
  • Dotted line removal
  • Clean borders
  • Negate
  • Auto-crop
  • Image dilation
  • Erosion
  • Etc.

Additional cleanup features are available with ScanFix Xpress.

Easy to use ImageClean™

Correct scanning and faxing issues using despeckle and deskew.

Image Correction, such as Despeckle and Deskew

  • Image Maintenance includes cropping, resizing, thumbnail creation, encryption, and decryption.
  • Mathematical Morphology includes edge detection, noise removal, image enhancement, image segmentation, opening, closing, and more.
  • Image Transformation rotates the file.
  • Image Correction acts as a tool to despeckle and deskew the file.

Photo and Color Processing

Easily manipulate photos and color images to get enhanced image quality and compressed image file sizes. Region of Interest (ROI) permits specification of a shape, such as an ellipse, polygon, freehand, or 1-bit mask, for identifying pixels to include/exclude from image processing algorithms. Powerful color reduction methods are available for maximum quality and minimum size, using dithering or halftone.

  • Area detection and processing uses a predefined or custom pixel checking method.
  • ICC color profile allows for accurate color display.
  • Pantone channels enables true image representation.
  • Blending and combining images combines data from two or more images, such as Alpha Blend.
  • Blending and Combining Images

  • Image Maintenance includes cropping, resizing, thumbnail creation, encryption, and decryption.
  • Image Transformation is the rotation of the file.
  • Advanced Image Processing methods adjust brightness and contrast, reduce or promote bit depth, and sepia.
  • Advanced Image Processing methods

Printing

Maximize image print quality and control while minimizing the time and effort required to integrate print functions. Overcome inherent print driver limitations to ensure the highest possible print quality and size and color correct. You can also perform single or multi-page printing. Functions control your sizes, color correction, pagination, and the ability to build custom printing dialogs.

Scanning

Comprehensive scanning features allow for actionable functions like:

  • Programmatically build your own user interface
  • Multi-page and single page scanning with configurable parameters
  • Indicating the brightness and contrast during acquisition
  • Defining the resolution for acquisition
  • Offering a single call function to initiate the scanning process
  • Using events to control the scanning process
  • Easily interfacing with the Automatic Document Feeder (ADF) for multi-page scanning capabilities

It also includes high-level functions that provide quick implementation of superior TWAIN support without a steep learning curve.

ISIS Scanning
Build custom interfaces for high-speed ISIS® scanners. Capture data from paper documents and deliver it to custom applications or ECM systems. ISIS delivers interface consistency, high speed, and reliability for scanning applications.

TWAIN
ImageGear provides comprehensive support for TWAIN devices including scanners, digital cameras, and video capture boards. Experience the control of TWAIN transfer modes. ImageGear interfaces with TWAIN data source listing.

Scanning From a Web App
Hook web apps into scanner functions to deliver high-quality scanning directly to cloud storage repositories. This feature enables web applications to communicate with document scanners without requiring special servers or ActiveX plug-ins.
It leverages the power of ISIS to control scanner features and provides connectivity to TWAIN-based devices. It is available in zero-footprint capture for all devices that support Cloud Capture natively or a one-time download of the Cloud Capture web services component. Support for popular Windows browsers like, Internet explorer, Firefox, Safari, and Chrome is offered.