ABBYY unlocks images to boost Google Search Appliance

ABBYY is promoting its new Recognition Server for Google Search Appliance is a solution for organisations looking to ensure that all valuable organizational knowledge is electronically accessible 24x7.

Recognition Server works as a background optical character recognition (OCR) service, enabling Google Search Appliance to index full text content from documents in image formats to make them easily discoverable.

The addition of ABBYY Recognition Server extends Google Search Appliance's indexing capabilities beyond text-based formats such as HTML, DOC, XLS and TXT to support documents in TIFF, JPEG, image-based PDF and other image formats to make them discoverable.

Google Search Appliance is an enterprise search system designed to index searchable and editable file formats on an organization's file servers, Web servers, content management systems, and other resources. Google Search Appliance makes these documents available for the user to search using an easy to use Google interface.

"OCR technology is a valuable tool for organizations of any size that need their documents and information readily available to them, without the cumbersome process of sorting through scattered information," said Dean Tang, CEO of ABBYY USA.

"ABBYY Recognition Server for Google Search Appliance gives enterprise organizations a cost-effective and efficient way to access knowledge that was previously un-retrievable in search results, reducing the organizational cost that results from lost documents."

ABBYY Recognition Server for Google Search Appliance automatically crawls specified network archives, retrieves image files and performs OCR on the document. It then submits the recognized text of the document to Google Search Appliance in an XML feed, accompanied with a link to the original image. The feed is then indexed by Google Search Appliance and the image becomes fully searchable, based on all text elements within.