Providing the full spectrum of document, content, & imaging solutions
Prizm Text Extraction

Extract text from more than 300 file formats at a high speed

Prizm Text Extraction functionality enables you to extract text from MS Word, MS Excel, PDF and over 300 other file formats, to create a text data stream that can be processed by content aggregation tools and used for storing, publishing, archiving or searching. The output can be in form of a data stream or a text file, and can be automated to directly import the output into almost any database, repository, or file system.

With text extraction enabled at speeds of up to 15,000+ words per second, search indexing is a breeze. And because Prizm Text Extraction ensures that text and formatting are maintained when converting document formats, keyword searches result in fully preserved content being displayed.

Business Benefits

Save. Prizm Text Extraction is not a virtual printer-based conversion server, which means no installation of a virtual printer on the server is ever required. The Prizm Text Extraction software is based on a Multiple Tenancy Concept, which yields faster conversion rates and performance, as multiple users can access the server at the same time and convert various documents simultaneously. Additionally, no additional file format specific software is required to render or extract text from documents. Save time and money by eliminating the virtual printer queue and reducing your software footprint.

Organize. Prizm Text Extraction enables you to extract text from MS Word, MS Excel, PDF and over 300 other file formats, so whatever documents your business uses can quickly be fully indexed for search, independently of your file system.

Preserve. Prizm Text Extraction not only retains content text accurately but also retains spacing, paragraphs and appearance of the document. Whether you are rendering a MS Word, MS Excel, MS PowerPoint or a PDF, document formatting is fully preserved.