As more and more of your files move to the cloud, so should your OCR tool. Searchability is just as essential in an online library as it is on your hard drive.
Our latest integration means contentCrawler can search Microsoft SharePoint Online libraries for image files such as TIFFs and scanned PDFs - even within email attachments. It then adds a text layer to these documents and puts the now-searchable documents back into SharePoint.
A cloud-to-cloud solution
contentCrawler and SharePoint Online are cloud-based solutions, which means processing is faster and more secure as files are never downloaded to local machines. Operating in the cloud also means on-premise infrastructure is unnecessary. contentCrawler running in Microsoft Azure comes preconfigured and ready to run in Audit mode so you can see what percentage of your files are hidden.
contentCrawler currently supports two services, OCR and compression. In the case of the Compression module, contentCrawler will identify documents where a certain level of compression is achievable to free up space for other documents to be added. For example, a combined OCR and Compression service would locate all the image-based documents in SharePoint Online, OCR and convert them to smaller, text-searchable PDFs.