logo

Free Trial
  • About
    • About Us
    • Our Customers
    • Industry Guides
    • Watch Our Story
    • Customer Success Stories
  • Solutions
    • For GDPR Compliance
    • For Developers
    • For Document Bundling
  • Industries
    • Accounting
    • Legal
    • Financial Services
    • Government Departments
    • Technology Specialists
    • Oil, Resource & Energy
    • Life Science and Pharmaceuticals
    • Appraisal Management Companies
  • Products
      • compareDocs
      • compareDocs cloud
      • compareDocs SDK
      See the difference. Instantly.
      • pdfDocs
      • pdfDocs Binder
      Create complex PDFs. Easily.
      • contentCrawler
      • contentCrawler cloud
      See what you're missing.
      • cleanDocs
      • cleanDocs server
      The smart way to send.
  • Integrations
    • iManage
    • SharePoint
    • NetDocuments
    • OpenText
    • Worldox
    • Other
  • News
    • Press Releases
    • Events and Tradeshows
    • Our Blog
    • Webinars
    • Infographics
    New DocsCorp White Paper Encourages Businesses to Review their 2018 Data Protection Strategy DocsCorp White Paper Encourages Businesses to Review their 2018 Data Protection Strategy – Protection through Technology iManage ConnectLive New York iManage ConnectLive New York
  • Support
    • About Our Support
    • My Portal
    • My Invoices
    • My Support
    • My Training
  • Buy
    • cleanDocs
    • compareDocs
    • pdfDocs
  • Contact
    • Contact Us
    • Find a Partner
    • Log a Support Request
  • Home
  • News
  • Media Room
  • Our Blog Posts
  • contentCrawler finds the 30% of documents in content repositories your search technology cannot

The invisible files lurking in your file management system

Up to 30% of files could be hidden

 

Often, file indexing and searching are the Achilles Heels of document and enterprise content management systems. Image-based files can easily become lost, with our research indicating that up to 30% of total files stored are invisible to search. Not being able to find a document within your repository means you aren't getting the full return on investment that you should be with these kinds of systems. However, knowing you have a problem is the first step on the road to recovery. 

What are the file types invisible to search?

The culprits usually turn out to be image-based documents - JPGs, TIFFs, PNGs and image PDFs. Often, they are scanned invoices or client IDs; email attachments; and documents bulk-imported as part of a merger or acquisition. If these documents don't have OCR technology applied to them, they aren't indexed and remain as image files with no text - becoming invisible to search. 

Mobile technology, document ingestion, and staff workarounds have punched huge holes in OCR'ing processes and workflows. This poses significant risks to all kinds of businesses, though perhaps especially so in the legal industry.

Make your files 100% searchable

OCR technology assesses documents, determining whether they are image-based and need to be scanned for text. contentCrawler is one such application; it applies the all-important text layer to image-based documents so that they can be indexed and found by search engines. 

Knowing where in the workflow to apply an OCR framework is crucial to its success. contentCrawler is a backend rather than a frontend process that delivers huge benefits in terms of efficiency, searchability, and cost savings. A backend approach to OCR'ing ensures that all documents are made searchable once they are saved into the content repository, irrespective of the entry point.

contentCrawler works in two modes: one monitors newly profiled documents so that they are OCR'ed and made available for indexing immediately; the other OCR’s all the legacy documents in the system. 

Request a free audit today to see how many invisible files are lurking in your DMS or ECM. 

  • Answers to common compareDocs questions
  • Why document scanning workflows should be automated
  • 75% of people at BLTF have sent an email to the wrong person
  • You've Got Accidental Mail
  • GDPR: More than a third of fines issued by the ICO in 2017 were due to data breaches
  • compareDocs lets you find changes in snippets of code
  • Uncovering the Hidden Files in Your DMS Before It's Too Late
  • GDPR: Why Accidental Leaks Should Have You Worried
  • How to justify email recipient checking software for your business
  • How to use 2 new comparison workflows in compareDocs 4.3 U2
  • What You Need to Know About Australia’s New Data Breach Notification Laws
  • DocsCorp will be at the ABA TECHSHOW Conference and Expo in Chicago
  • The PDF software that accountants still count on 15 years later
  • 4 Smart Ways to Compare with compareDocs in iManage Work 10
  • New Feature: Recipient Checking in cleanDocs 2.1
  • Why a feature analysis is an important part of a software swap-out
  • Feature Spotlight: Customizing a TOC in pdfDocs
  • Upgrading to Windows 10? Update your document apps at the same time
  • A new way to compare documents in iManage Work 10
  • Complex Document Comparison Workflows Now Reduced to a Click
  • Report: 10 reasons why Microsoft Word doesn’t compare
  • Feature Spotlight: Bates Numbering in pdfDocs
  • 3 Ways to Save By Combining Upgrades
  • PDF software that makes the client review process easier
  • pdfDocs helps Swansea Council go digital
  • 5 GDPR Hacks for Accounting Firms
  • Solve your hidden data problem
  • ILTACON 2017: The Next Steps
  • The Brexit Brain Drain
  • ILTACON 2017: Be Smart and Win
  • ILTACON 2017: Cheat Sheet
  • ILTACON 2017: Software Shopping
  • Digital workflows that help accountants go paperless
  • 3 ways accounting software can make tax time less taxing
  • compareDocs cloud is now a Windows 10 Universal app
  • ILTACON 2016
  • New White Paper Analyzes Risks from Invisible Documents
  • Two GDPR Challenges You Probably Aren’t Prepared For
  • DocsCorp goes to Washington DC for ILTACON 2016
  • New integration with compareDocs, NetDocuments, iManage, and Microsoft Office
  • compareDocs cloud launches
  • How does your document comparison software measure up?
  • contentCrawler now integrates with Microsoft SharePoint Online
  • Cloud-to-Cloud integration: contentCrawler and NetDocuments
  • Compare Two Excel Files with compareDocs
  • Document Comparison with a Difference
  • Working with Auto Page Numbering in pdfDocs 3.3
  • Department of Justice gaffe is a reminder to redact with care
  • DocsCorp extends DocXtools document comparison capabilities
  • The matter-centric PDF workspace in pdfDocs represents a New Paradigm in PDF Production and Distribution Workflows
  • Protection Against Data Disclosure
  • Document comparison and metadata cleaning: you can’t have one without the other
  • cleanDocs Removes 100 Metadata Types at Sub-Second Speeds
  • Technology driving change in document comparison
  • contentCrawler finds the 30% of documents in content repositories your search technology cannot
  • Metadata management: prevention is better than cure

Receive our newsletter
Sign up

Home
  • Disclaimer
  • Privacy policy
  • Data security
  • Service Level Agreement
  • Human Rights Policy
Products
  • pdfDocs
  • compareDocs
  • cleanDocs
  • contentCrawler
Media Room
  • Press releases
  • Events
  • White papers
  • Case studies
myDocsCorp
  • Support login
  • Pay2Go
  • Find a partner
  • Contact us
  • blog
  • linkedin
  • twitter
  • facebook
logo

© Copyright DocsCorp 2018 - All rights reserved.