DocsCorp, (www.docscorp.com) a leader in PDF integration and workflow technology, today announced it is launching pdfDocs Content Crawler OCR, a module in its new integrated analysis, reporting and processing framework - pdfDocs Content Crawler. The release will integrate with Document Management Systems Autonomy iManage and Opentext eDOCS.
Documents often get profiled in the Document Management System (DMS) through a variety of workflow loopholes – fax, scanner and users profiling email attachments. These image-based document workflows bypass the OCR processing that would make them text-searchable. Once in the DMS, these documents become completely “invisible” to the search engines.
“Businesses have made considerable investments in Document Management and Search technologies, but it is estimated that 10-20% of documents in a DMS are non-searchable. This figure represents a significant risk to any business. Its reputation and financial well-being could be impacted simply by failing to produce a specific document on demand,” says David Woolstencroft, DocsCorp President Marketing, Sales and Strategy.
pdfDocs Content Crawler provides a framework for searching an entire DMS database or a subset of documents based on specific DMS queries. The Content Crawler OCR module identifies non-searchable content in image files, PDF files and even looks inside attachments to emails. The files are converted to text-searchable PDFs using DocsCorp’s OCR technology and saved back into the DMS. Content Crawler can search and convert backlogs of legacy documents as well as actively monitoring newly-profiled documents.
Woolstencroft adds “if you don’t know the extent of the problem, or you are not sure if you have a problem, DocsCorp invites you to use Content Crawler (trial version mode) to provide an audit report of your DMS documents.“
The current release integrates with Autonomy iManage 8.2 or higher and Opentext eDOCS DM 5.1.05 or higher. Further DMS and Content Repository integrations will follow.
www.docscorp.com/contentcrawler |