As more and more of your files move to the cloud, so should your OCR tool. Searchability is just as essential in an online library as it is on your hard drive.
Our latest integration means contentCrawler can search Microsoft SharePoint Online libraries for image files such as TIFFs and scanned PDFs - even within email attachments. It then adds a text layer to these documents and puts the now-searchable documents back into SharePoint.
A cloud-to-cloud solution
contentCrawler is offered as an on-prem solution which can be integrated with any SharePoint Online environment. However to take advantage of faster processing and elevating the need for files to be downloaded to a local on-prem machine. contentCrawler on-prem can be installed on a Cloud hosted VM such as Microsoft Azure VMs providing a ‘cloud-to-cloud’ solution. Operating in the cloud also means on-premise infrastructure is unnecessary. contentCrawler can be installed and ready to run in Audit mode so you can see what percentage of your files are hidden.
contentCrawler currently supports two services, OCR and compression. In the case of the Compression module, contentCrawler will identify documents where a certain level of compression is achievable to free up space for other documents to be added. For example, a combined OCR and Compression service would locate all the image-based documents in SharePoint Online, OCR and convert them to smaller, text-searchable PDFs.
Get the contentCrawler SharePoint cloud app. Learn more about DocsCorp SharePoint Integration here.