Welcome!

News Feed Item

The International Computer Science Institute (ICSI) Leads Team Researching Ways to Build Speech Recognition Systems for New Languages Under Severe Data and Time Constraints

The International Computer Science Institute (ICSI) is leading a research team under the IARPA Babel Program that is focused on building speech recognition solutions with self-imposed time and data limitations for a variety of languages. The work aims to better understand fundamental challenges and discover new methods for development of speech models for languages that could emerge as important in the future.

“The goal of the Babel program is to rapidly build speech recognition systems to support effective keyword search for new languages using limited amounts of transcribed speech recorded in real-world conditions,” said Mary Harper, the IARPA Program Manager in charge of the Babel program.

Using only a fraction of the training data usually required, the team aims to build speech recognition systems for several languages in just one week by the end of the program.

“ICSI excels at intellectual challenges and unique approaches to research. This is an intriguing project that puts significant constraints on our researchers as a means to discover better ways to develop automatic speech recognition systems,” said Roberto Pieraccini, director and president of ICSI.

By working on a variety of languages with time and data restrictions, the team will research basic principles of speech technology rather than incremental improvements to existing technology. In addition, this research will be useful in enabling keyword-search systems for those languages that do not have large amounts of transcribed audio.

“The speech recognition systems we’ve built in the past have the curse of being reasonably good, particularly for a few languages and speech recorded in good acoustic conditions, which has often reduced the impetus to significantly change the technology,” said Professor Nelson Morgan, deputy director and leader of the Speech Group at ICSI. “This project strongly pushes us to solve fundamental problems in speech recognition to address the Babel challenge."

In each of the four periods of the project, the team will be given a set of languages and will be tasked with developing methods to quickly build a system. Speech recognition systems are typically trained on thousands of hours of transcribed audio. In this project, the team was initially given only 80 hours of conversational speech for each language, and in each succeeding period a smaller fraction of the audio is transcribed. At the end of each period, the team will be given a new language to build a system – initially in four weeks, but by the end of the program down to just one week.

In addition to Morgan, the leaders of the team are Steven Wegmann of ICSI, Professor Mari Ostendorf of the University of Washington, Professor Janet Pierrehumbert of Northwestern University, Professor Eric Fosler-Lussier of The Ohio State University, and Professor Dan Ellis of Columbia University. Morgan says an important element of the project is that these team leaders have had strong previous research ties with one another in research topics that are essential to the Babel problem.

The project is funded by the Intelligence Advanced Research Projects Activity (IARPA), a research arm of the Office of the Director of National Intelligence, which invests in high-risk/high-payoff research programs.

About ICSI

The International Computer Science Institute (ICSI) is a leading center for research in computer science and one of the few independent, nonprofit research institutes in the United States. With its unique focus on international collaboration and its affiliation with the University of California at Berkeley, ICSI brings together the most influential U.S. scientists and experts from around the world in areas such as computer networking and security, speech and language processing, algorithms, bioinformatics, computer architecture, computer vision, and artificial intelligence. For more information, check ICSI out on the Web:

www.ICSI.berkeley.EDU | http://twitter.com/ICSIatBerkeley | http://blog.ICSI.berkeley.EDU

www.facebook.com/ICSIatBerkeley | www.youtube.com/ICSIatBerkeley

More Stories By Business Wire

Copyright © 2009 Business Wire. All rights reserved. Republication or redistribution of Business Wire content is expressly prohibited without the prior written consent of Business Wire. Business Wire shall not be liable for any errors or delays in the content, or for any actions taken in reliance thereon.

Latest Stories
SYS-CON Events announced today that Interoute, owner-operator of one of Europe's largest networks and a global cloud services platform, has been named “Bronze Sponsor” of SYS-CON's 20th Cloud Expo, which will take place on June 6-8, 2017 at the Javits Center in New York, New York. Interoute is the owner-operator of one of Europe's largest networks and a global cloud services platform which encompasses 12 data centers, 14 virtual data centers and 31 colocation centers, with connections to 195 add...
Historically, some banking activities such as trading have been relying heavily on analytics and cutting edge algorithmic tools. The coming of age of powerful data analytics solutions combined with the development of intelligent algorithms have created new opportunities for financial institutions. In his session at 20th Cloud Expo, Sebastien Meunier, Head of Digital for North America at Chappuis Halder & Co., will discuss how these tools can be leveraged to develop a lasting competitive advanta...
TechTarget storage websites are the best online information resource for news, tips and expert advice for the storage, backup and disaster recovery markets. By creating abundant, high-quality editorial content across more than 140 highly targeted technology-specific websites, TechTarget attracts and nurtures communities of technology buyers researching their companies' information technology needs. By understanding these buyers' content consumption behaviors, TechTarget creates the purchase inte...
With the introduction of IoT and Smart Living in every aspect of our lives, one question has become relevant: What are the security implications? To answer this, first we have to look and explore the security models of the technologies that IoT is founded upon. In his session at @ThingsExpo, Nevi Kaja, a Research Engineer at Ford Motor Company, will discuss some of the security challenges of the IoT infrastructure and relate how these aspects impact Smart Living. The material will be delivered i...
In his session at @ThingsExpo, Eric Lachapelle, CEO of the Professional Evaluation and Certification Board (PECB), will provide an overview of various initiatives to certifiy the security of connected devices and future trends in ensuring public trust of IoT. Eric Lachapelle is the Chief Executive Officer of the Professional Evaluation and Certification Board (PECB), an international certification body. His role is to help companies and individuals to achieve professional, accredited and worldw...
My team embarked on building a data lake for our sales and marketing data to better understand customer journeys. This required building a hybrid data pipeline to connect our cloud CRM with the new Hadoop Data Lake. One challenge is that IT was not in a position to provide support until we proved value and marketing did not have the experience, so we embarked on the journey ourselves within the product marketing team for our line of business within Progress. In his session at @BigDataExpo, Sum...
Your homes and cars can be automated and self-serviced. Why can't your storage? From simply asking questions to analyze and troubleshoot your infrastructure, to provisioning storage with snapshots, recovery and replication, your wildest sci-fi dream has come true. In his session at @DevOpsSummit at 20th Cloud Expo, Dan Florea, Director of Product Management at Tintri, will provide a ChatOps demo where you can talk to your storage and manage it from anywhere, through Slack and similar services ...
SYS-CON Events announced today that SoftLayer, an IBM Company, has been named “Gold Sponsor” of SYS-CON's 18th Cloud Expo, which will take place on June 7-9, 2016, at the Javits Center in New York, New York. SoftLayer, an IBM Company, provides cloud infrastructure as a service from a growing number of data centers and network points of presence around the world. SoftLayer’s customers range from Web startups to global enterprises.
SYS-CON Events announced today that Ocean9will exhibit at SYS-CON's 20th International Cloud Expo®, which will take place on June 6-8, 2017, at the Javits Center in New York City, NY. Ocean9 provides cloud services for Backup, Disaster Recovery (DRaaS) and instant Innovation, and redefines enterprise infrastructure with its cloud native subscription offerings for mission critical SAP workloads.
SYS-CON Events announced today that Juniper Networks (NYSE: JNPR), an industry leader in automated, scalable and secure networks, will exhibit at SYS-CON's 20th International Cloud Expo®, which will take place on June 6-8, 2017, at the Javits Center in New York City, NY. Juniper Networks challenges the status quo with products, solutions and services that transform the economics of networking. The company co-innovates with customers and partners to deliver automated, scalable and secure network...
Have you ever noticed how some IT people seem to lead successful, rewarding, and satisfying lives and careers, while others struggle? IT author and speaker Don Crawley uncovered the five principles that successful IT people use to build satisfying lives and careers and he shares them in this fast-paced, thought-provoking webinar. You'll learn the importance of striking a balance with technical skills and people skills, challenge your pre-existing ideas about IT customer service, and gain new in...
Interoute has announced the integration of its Global Cloud Infrastructure platform with Rancher Labs’ container management platform, Rancher. This approach enables enterprises to accelerate their digital transformation and infrastructure investments. Matthew Finnie, Interoute CTO commented “Enterprises developing and building apps in the cloud and those on a path to Digital Transformation need Digital ICT Infrastructure that allows them to build, test and deploy faster than ever before. The int...
VeriStor Systems has announced that CRN has named VeriStor to its 2017 Managed Service Provider (MSP) 500 list in the Elite 150 category. This annual list recognizes North American solution providers with cutting-edge approaches to delivering managed services. Their offerings help companies navigate the complex and ever-changing landscape of IT, improve operational efficiencies, and maximize their return on IT investments. In today’s fast-paced business environments, MSPs play an important role...
DevOps is often described as a combination of technology and culture. Without both, DevOps isn't complete. However, applying the culture to outdated technology is a recipe for disaster; as response times grow and connections between teams are delayed by technology, the culture will die. A Nutanix Enterprise Cloud has many benefits that provide the needed base for a true DevOps paradigm. In his Day 3 Keynote at 20th Cloud Expo, Chris Brown, a Solutions Marketing Manager at Nutanix, will explore t...
What if you could build a web application that could support true web-scale traffic without having to ever provision or manage a single server? Sounds magical, and it is! In his session at 20th Cloud Expo, Chris Munns, Senior Developer Advocate for Serverless Applications at Amazon Web Services, will show how to build a serverless website that scales automatically using services like AWS Lambda, Amazon API Gateway, and Amazon S3. We will review several frameworks that can help you build serverle...