Welcome!

News Feed Item

Catalyst Files Patent for Next-Generation Technology Assisted Review Based on 'Reinforcement Learning'

New Research Validates That Continuous Learning Methods Improve Savings and Results in Technology Assisted Review

DENVER CO -- (Marketwired) -- 06/11/14 -- Catalyst Repository Systems -- a pioneer in developing secure, cloud-based software to help corporations and their law firms take control of e-discovery, compliance and regulatory matters -- today announced it has applied for a patent on the type of continuous learning capability it invented for its next-generation technology assisted review (TAR 2.0) platform, Insight Predict.

Described in the patent application as "reinforcement learning based document coding," Catalyst's TAR technology is able to continuously learn from actions taken by the review team throughout the review process. With reinforcement learning, certain actions -- such as coding a document as responsive or not or adding additional documents -- enable the system to continue to grow "smarter" in its ability to select relevant documents.

What is Reinforcement Learning?

Reinforcement learning differs from older TAR 1.0 systems which require training by a high-level attorney. This expensive and time-consuming approach requires the senior attorney to first review and code an initial training set of randomly selected documents. With Catalyst's reinforcement learning technology, the full review team can begin right away. As reviewers' judgments are fed back into the system and new documents added, the system's selection and ranking of relevant documents continuously improves.

A new, peer-reviewed study by two leading experts in e-discovery validates the effectiveness of continuous learning technologies in e-discovery. In a paper they will present at the Association of Computing Machinery Special Interest Group on Information Retrieval (SIGIR) international conference in July 2014, "Evaluation of Machine-Learning Protocols for Technology-Assisted Review in Electronic Discovery," Gordon V. Cormack and Maura R. Grossman conclude that non-random training methods using continuous active learning "require substantially and significantly less human review effort" and yield "generally superior results."

Why is Catalyst's Approach Unique?

Even among continuous learning systems, Catalyst's method is unique for its use of reinforcement learning rather than active learning. Active learning systems are geared towards optimizing the quality of the classifier, the algorithm that labels documents as relevant or not. By contrast, reinforcement learning is designed to optimize for the goal the user seeks to achieve, which is generally to find as many relevant documents as possible. In this way, reinforcement learning helps users reach that goal more quickly.

"In contrast to the 'one bite of the apple' approach of earlier TAR engines, Insight Predict is able to use judgmental seeds and relevance feedback to continuously learn and rank throughout the review process, while avoiding the problems of bias and incomplete coverage through its use of contextual diversity," said John Tredennick, Catalyst's founder and CEO. "This is a major benefit to our clients because it eliminates the need for subject-matter experts for training, allows the review to get started sooner, accommodates rolling uploads, and ultimately delivers savings in time and costs."

Catalyst's unique reinforcement learning system was developed by Dr. Jeremy Pickens, Catalyst's senior research scientist, and Bruce Kiefer, Catalyst's vice president, platform. Pickens, one of the world's leading search scientists and a pioneer in the field of collaborative exploratory search, has a number of patents and patents pending in the field of information retrieval.

Overcoming the Five Myths of TAR

Catalyst's technology upends a number of common misconceptions about TAR -- that training is finite based on an initial seed set, that documents for training must be selected at random, that subject matter experts are required to train the system, that training cannot start until all documents on hand, and that it does not work for non-English documents.

To read more about the myths surrounding TAR and how advanced systems disprove them, see John Tredennick's Law Technology News article, Five Myths About Technology Assisted Review.

About Catalyst Repository Systems

A pioneer in cloud-based litigation technology, Catalyst provides global corporations and their counsel with secure, hosted document repositories to manage discovery, regulatory inquiries and other complex legal matters. Clients use Insight, Catalyst's "Big Discovery" platform, and Insight Predict, our advanced technology assisted review engine, to reduce discovery costs and associated risks. Corporations gain greater control and predictability over the discovery process and greater visibility across all their legal matters.

For more information, visit Catalyst at www.catalystsecure.com or follow us on Twitter at: http://twitter.com/catalystsecure.

For more information, press only
Shana Graham
Plat4orm PR
Email Contact
206.661.6336

More Stories By Marketwired .

Copyright © 2009 Marketwired. All rights reserved. All the news releases provided by Marketwired are copyrighted. Any forms of copying other than an individual user's personal reference without express written permission is prohibited. Further distribution of these materials is strictly forbidden, including but not limited to, posting, emailing, faxing, archiving in a public database, redistributing via a computer network or in a printed form.

Latest Stories
With more than 30 Kubernetes solutions in the marketplace, it's tempting to think Kubernetes and the vendor ecosystem has solved the problem of operationalizing containers at scale or of automatically managing the elasticity of the underlying infrastructure that these solutions need to be truly scalable. Far from it. There are at least six major pain points that companies experience when they try to deploy and run Kubernetes in their complex environments. In this presentation, the speaker will d...
While DevOps most critically and famously fosters collaboration, communication, and integration through cultural change, culture is more of an output than an input. In order to actively drive cultural evolution, organizations must make substantial organizational and process changes, and adopt new technologies, to encourage a DevOps culture. Moderated by Andi Mann, panelists discussed how to balance these three pillars of DevOps, where to focus attention (and resources), where organizations might...
The deluge of IoT sensor data collected from connected devices and the powerful AI required to make that data actionable are giving rise to a hybrid ecosystem in which cloud, on-prem and edge processes become interweaved. Attendees will learn how emerging composable infrastructure solutions deliver the adaptive architecture needed to manage this new data reality. Machine learning algorithms can better anticipate data storms and automate resources to support surges, including fully scalable GPU-c...
When building large, cloud-based applications that operate at a high scale, it's important to maintain a high availability and resilience to failures. In order to do that, you must be tolerant of failures, even in light of failures in other areas of your application. "Fly two mistakes high" is an old adage in the radio control airplane hobby. It means, fly high enough so that if you make a mistake, you can continue flying with room to still make mistakes. In his session at 18th Cloud Expo, Le...
Machine learning has taken residence at our cities' cores and now we can finally have "smart cities." Cities are a collection of buildings made to provide the structure and safety necessary for people to function, create and survive. Buildings are a pool of ever-changing performance data from large automated systems such as heating and cooling to the people that live and work within them. Through machine learning, buildings can optimize performance, reduce costs, and improve occupant comfort by ...
As Cybric's Chief Technology Officer, Mike D. Kail is responsible for the strategic vision and technical direction of the platform. Prior to founding Cybric, Mike was Yahoo's CIO and SVP of Infrastructure, where he led the IT and Data Center functions for the company. He has more than 24 years of IT Operations experience with a focus on highly-scalable architectures.
CI/CD is conceptually straightforward, yet often technically intricate to implement since it requires time and opportunities to develop intimate understanding on not only DevOps processes and operations, but likely product integrations with multiple platforms. This session intends to bridge the gap by offering an intense learning experience while witnessing the processes and operations to build from zero to a simple, yet functional CI/CD pipeline integrated with Jenkins, Github, Docker and Azure...
The explosion of new web/cloud/IoT-based applications and the data they generate are transforming our world right before our eyes. In this rush to adopt these new technologies, organizations are often ignoring fundamental questions concerning who owns the data and failing to ask for permission to conduct invasive surveillance of their customers. Organizations that are not transparent about how their systems gather data telemetry without offering shared data ownership risk product rejection, regu...
René Bostic is the Technical VP of the IBM Cloud Unit in North America. Enjoying her career with IBM during the modern millennial technological era, she is an expert in cloud computing, DevOps and emerging cloud technologies such as Blockchain. Her strengths and core competencies include a proven record of accomplishments in consensus building at all levels to assess, plan, and implement enterprise and cloud computing solutions. René is a member of the Society of Women Engineers (SWE) and a m...
Dhiraj Sehgal works in Delphix's product and solution organization. His focus has been DevOps, DataOps, private cloud and datacenters customers, technologies and products. He has wealth of experience in cloud focused and virtualized technologies ranging from compute, networking to storage. He has spoken at Cloud Expo for last 3 years now in New York and Santa Clara.
Enterprises are striving to become digital businesses for differentiated innovation and customer-centricity. Traditionally, they focused on digitizing processes and paper workflow. To be a disruptor and compete against new players, they need to gain insight into business data and innovate at scale. Cloud and cognitive technologies can help them leverage hidden data in SAP/ERP systems to fuel their businesses to accelerate digital transformation success.
Containers and Kubernetes allow for code portability across on-premise VMs, bare metal, or multiple cloud provider environments. Yet, despite this portability promise, developers may include configuration and application definitions that constrain or even eliminate application portability. In this session we'll describe best practices for "configuration as code" in a Kubernetes environment. We will demonstrate how a properly constructed containerized app can be deployed to both Amazon and Azure ...
Poor data quality and analytics drive down business value. In fact, Gartner estimated that the average financial impact of poor data quality on organizations is $9.7 million per year. But bad data is much more than a cost center. By eroding trust in information, analytics and the business decisions based on these, it is a serious impediment to digital transformation.
Digital Transformation: Preparing Cloud & IoT Security for the Age of Artificial Intelligence. As automation and artificial intelligence (AI) power solution development and delivery, many businesses need to build backend cloud capabilities. Well-poised organizations, marketing smart devices with AI and BlockChain capabilities prepare to refine compliance and regulatory capabilities in 2018. Volumes of health, financial, technical and privacy data, along with tightening compliance requirements by...
Predicting the future has never been more challenging - not because of the lack of data but because of the flood of ungoverned and risk laden information. Microsoft states that 2.5 exabytes of data are created every day. Expectations and reliance on data are being pushed to the limits, as demands around hybrid options continue to grow.