logo

Free Trial
  • About
    • About Us
    • Industry Guides
    • Watch Our Story
    • Customer Success Stories
    • Contact Us
  • Solutions
    • Redaction
    • Finding Documents
    • Recipient Checking
    • Manage Metadata
    • Document Comparison
    • Document Bundling
    • OCR for Dropbox
    • Legal Software
    • Accounting Software
    • Mimecast
  • Products
      • veroDocs
      • cleanDocs
      • cleanDocs Server
      • compareDocs
      • compareDocs Cloud
      • pdfDocs
      • pdfDocs Binder
      • contentCrawler
      • contentCrawler Cloud
  • Developers
    • compareDocs SDK
    • compareDocs Cloud API
  • Integrations
    • iManage
    • NetDocuments
    • OpenText
    • SharePoint
    • Worldox
    • Other Integrations
  • News
    • Press Releases
    • Events and Webinars
    • Our Blog
    • Infographics
    • Customer Success Stories
    • Industry Guides
    DocsCorp compareDocs Virtual User Group Meeting - Americas DocsCorp compareDocs User Group Meeting - Americas Weathering the Storm | DocsCorp CEO Dean Sappey Reflects on a Year Like No Other Weathering the Storm | DocsCorp CEO Dean Sappey Reflects on a Year Like No Other
  • Support
    • Customer Support
    • Client Portal
    • myDocsCorp
    • Credit Card Payments
    • eLearning
    • Training Partners
    • Quick Training Guides
    • Product FAQs
  • Partners
    • Become a Partner
    • Find a Partner
    • Training Partners
    • Partner Portal
  • Buy
    • cleanDocs
    • compareDocs
    • pdfDocs
DOWNLOAD THE PDF

How Stibbe used contentCrawler to index 28 million documents and emails for its enterprise search engine

Business goals

  • Automatically find and convert image-based files to searchable PDFs
  • Maximize the value of the enterprise search engine
  • Comply with GDPR data return, erase, and portability requirements
  • Avoid impacting staff workflows with new OCR or scanning requirements


About Stibbe

Stibbe is an internationally-orientated Benelux law firm with over 375 lawyers. From its main offices in Amsterdam, Brussels, and Luxembourg, together with its branch office in Dubai, London, and New York, Stibbe handles complex legal challenges for its clients both locally and cross-border. As a specialist firm, Stibbe’s lawyers work in multidisciplinary teams and deliver pragmatic advice. They build close business relationships with their clients that range from local and multinational corporations to financial institutions, government organizations, and public authorities. Stibbe’s understanding of its clients’ commercial objectives, their position in the market, and their sector or industry allows the firm to always provide clients with timely, effective and appropriate advice on their complex local and cross-border legal challenges.

Using enterprise search for GDPR compliance

The ability to search for and find 100% of documents is required to meet data return, erase, and portability requirements under the GDPR.

“We invested in an enterprise search engine to be future-proof before the GDPR came into force in May 2018,” explained Olivier Van Eesbeecq, Head of ICT & Facilities at Stibbe Belgium. “Several products we were using – including our document management system – came with their own search engines, but we found them to be lacking. So, we decided to invest in enterprise search technology.”

The problem with non-searchable files

For it to work effectively enterprise search relies on the existence of a text layer in every file in your system. But scanned files, TIFFs, JPEGs, and image-based PDFs (of which Stibbe Brussels had many) – don’t have that layer.

Full-text search in your documents is important because a) people don’t always remember the name of a file so it’s essential that on-page content can be searched, and b) under the GDPR, you need to be able to search for and find every document that contains a name, email address, bank account number, or other personal data.

To get the maximum benefit of its enterprise search investment, Stibbe needed a solution that could find non-searchable files that were not indexed for searching and could process them, so it had the necessary text layer to be indexed for searching.

Bulk conversion into searchable PDFs 

Search and assess technologies using OCR software can find non-searchable content and automatically convert them into text-searchable PDFs. Stibbe required a solution that could work “in the background,” so it wouldn’t impact staff workflows or processes.

“We were already using the DocsCorp desktop productivity solutions,” said Olivier, “so when we learned there was an automated OCR solution as well, choosing it was a no-brainer for us.”

contentCrawler is configured at Stibbe to be a set-and-forget solution. Staff continue to upload documents into the document management system, for example, without worrying about their need to be OCRed. “If our lawyers photocopy or scan a file they simply add it to the document management system, and it’s automatically made searchable. That’s a big advantage,” Olivier commented.

“contentCrawler connected to all our document sources – like file servers, email servers, the document management system, SharePoint – and converted all the content into searchable PDFs,” Olivier continued. “Once contentCrawler processed the files the search engine picked it up and indexed it within minutes.”

“We now have more than 28 million documents and emails indexed by our enterprise search engine. All that content is now searchable thanks to contentCrawler.”

Have staff noticed a difference?

“Absolutely. Our staff have certainly noticed a difference since having contentCrawler,” said Olivier. “Although it’s a background process, they really see the value because they trust that their documents will be automatically indexed and made searchable. It also saves them time since they no longer need to use desktop scanners to manually OCR files.”

Summary

Stibbe used contentCrawler to unlock the benefits of its enterprise search engine since non-searchable documents were impacting its performance. Now, the firm has a solution that works silently behind the scenes, automatically catching every new document added to its file systems and adding a text layer when needed. Staff are able to search for and find content across 28 million documents and emails, and the firm can comply with GDPR requirements for data storage and handling.

DOWNLOAD THE PDF

Related

image

Case study

Save hours of work

How DBL Law converts image files to searchable PDFs using batch OCR processing

image

Case study

Find more content, more easily

How contentCrawler improves the user experience of searching in Microsoft SharePoint

image

Case study

Meet Court filing requirements

How a top U.S. firm automated OCR, resulting in more litigation items being filed in less time

  • How MacRoberts simplified software management with DocsCorp
  • Madgwicks Lawyers increases productivity using a PDF editor with iManage integration
  • How DocsCorp and SeeUnity delivered a joint solution that cleans documents of metadata as they sync between systems
  • DocsCorp and Morae’s Phoenix Business Solutions: a partnership based on teamwork and trust
  • How Delphi uses technology from DocsCorp to minimize human error
  • How award-winning Benelux law firm Stibbe uses DocsCorp solutions for core legal workflows
  • How Stibbe used contentCrawler to index 28 million documents and emails for its enterprise search engine
  • Automating electronic binder production boosted productivity and morale at this leading Australian law firm
  • How U.S. law firm Taft eliminated licensing and performance issues when it switched to compareDocs for document comparison
  • How contentCrawler improves the user experience of searching in Microsoft SharePoint
  • An IT Infrastructure Manager explains how simple it was to manage compareDocs during a merger and Office 365 update
  • How confidence was restored in core applications like metadata cleaning and document comparison at UK firm Hempsons
  • How Top U.S. firm Shook, Hardy & Bacon automated OCR, resulting in more litigation items being filed in less time
  • How UK Top 100 Firm Winckworth Sherwood strengthened data protection for GDPR compliance
  • How DBL Law converts image files to searchable PDFs using batch OCR processing
  • How this insurance broker slashed paper usage with document comparison software
  • How Simpson Grierson found a better alternative for document comparison
  • How a PDF binder solution gave Simpson Grierson a competitive edge on billable hours, and their clients an even better experience
  • Preparing for changes to data privacy regulations: How Simpson Grierson improved and streamlined its metadata cleaning
  • A regulatory body in Australia’s legal industry reduces production costs for mandatory paperless Court Briefs by 84%
  • How an ISO 27001 certified law firm manages the risk of data breaches
  • Four solutions, one firm: How ByrneWallace uses the DocsCorp productivity suite
  • How Mobile Helix used compareDocs SDK to provide accurate document comparison in the LINK App for Lawyers
  • Seddons uses cleanDocs and contentCrawler to support their GDPR compliance goals
  • cleanDocs helps DBL Law protect against accidental data leaks
  • contentCrawler helps minimize the number of hidden files in Makinson d'Apice DMS
  • contentCrawler changed Becker & Poliakoff's OCR workflow from four steps to one

NEWS IN YOUR INBOX

Home
  • About DocsCorp
  • Disclaimer
  • Privacy Policy
  • GDPR Policy
  • Data Security
  • Accreditations
  • Service Level Agreement
  • Human Rights Policy
  • Anti-Slavery and Human Trafficking Policy
  • Anti-Bribery and Corruption Policy
  • COVID-19 Statement
Products
  • pdfDocs
  • pdfDocs Binder
  • compareDocs
  • compareDocs Cloud
  • compareDocs SDK
  • cleanDocs
  • cleanDocs Server
  • contentCrawler
  • contentCrawler Cloud
News
  • Press Releases
  • Events/Webinars
  • Industry Guides
  • Case Studies
  • Blog Posts
  • Infographics
  • Watch Our Story
myDocsCorp
  • Support login
  • Pay2Go
  • myDocsCorp
  • Training directory
  • Customer Support
  • Product FAQs
  • Find a partner
  • Contact us
  • blog
  • linkedin
  • twitter
  • facebook
logo

© Copyright DocsCorp 2021 - All rights reserved.