NetDocuments, a leading cloud-based document and email management service, now integrates with the new cloud-based version of contentCrawler, our integrated analysis, processing and reporting framework. Running on the Microsoft Azure platform, the contentCrawler cloud version is available for download from Microsoft Azure Marketplace.
Cloud-to-cloud processing makes OCR quicker and easier than ever
Since contentCrawler (via Azure) and NetDocuments are cloud-based solutions, processing is faster and more secure as files aren't downloaded to local machines. Operating in the cloud also means no on-premises infrastructure is necessary. Provision of the software is very fast and occurs within minutes. contentCrawler running in Microsoft Azure comes preconfigured and ready to run in Audit mode, providing insight into how much non-searchable content exists in the firm’s NetDocuments cabinets.
DocsCorp CTO and Co-Founder Shane Barnett explains, “Now available in the cloud via Microsoft Azure, contentCrawler integration adds an important dimension to NetDocuments, ensuring all cabinets are completely searchable so there are no missing documents and cabinet content is transparent to users. Also, since both solutions are cloud-based, the speed of deployment and security of the contentCrawler/NetDocuments integration is optimal. Our clients will enjoy a new degree of flexibility since contentCrawler desktop and cloud editions now both integrate with NetDocuments.”
“We’re thrilled to see DocsCorp embracing a cloud-first strategy by leveraging the Microsoft Azure platform for the contentCrawler integration with NetDocuments,” Alvin Tedjamulia, CTO at NetDocuments stated. “The rate of innovation in the cloud continues to eclipse that of on-premises or hosted on-premises solutions, and smart companies like DocsCorp are taking full advantage of this by responding to market demand and offering contentCrawler as a cloud-to-cloud integrated solution.”
How will the cloud-to-cloud integration work?
The NetDocuments cloud-based DMS gives firms document and email workflows that are more productive, efficient and secure. However, firms with image-based files and scanned documents cannot take full advantage of NetDocuments' enterprise search engine.
contentCrawler changes that by converting all image-based documents to text-searchable PDFs, which can then be indexed by NetDocuments for improved searchability across a matter, cabinet, or repository.
By finding non-searchable documents such as TIFF and scanned PDF files in NetDocuments, contentCrawler ensures ALL documents are 100% searchable and retrievable, even if attached to email messages. contentCrawler then OCRs these files, automatically profiling the resulting searchable documents directly into NetDocuments with minimal intervention.
contentCrawler currently supports two services: OCR and compression. In the case of the Compression module, contentCrawler will identify documents where a certain level of compression is achievable to reduce storage and upload/download costs. IT Administrators can combine contentCrawler modules into a single, multi-process service for even greater efficiency and productivity. For example, a combined OCR and Compression service would locate all the non-searchable image-based documents in NetDocuments, OCR and convert them to smaller, text-searchable PDFs.