Science of Media Information

Exhibition Program 19

Listening languages around the world simultaneously

Multilingual speech identification & recognition technology

Abstract

We are developing the technology to listen languages around the world simultaneously, composed of 3 components;
1. Intelligent Microphone:
Separate speech spoken simultaneously, being able to separate up to 7 speakers.
2. Language Identification:
Identify the language of speech precisely using LSTM framework
3. Multilingual Speech Recognition:
Support 10 languages in Asia-Pacific region, using DNN structure robust for noisy environment.
We are planning to increase the supporting language and applying it to natural speech interface for foreign visitors.

Photos

Poster


Please click the thumbnail image to open the full-size PDF file.

Presenters

Hirokazu Masataki
Hirokazu Masataki
Media Intelligence Laboratories
Yoshikazu Yamaguchi
Yoshikazu Yamaguchi
Media Intelligence Laboratories
Taichi Asami
Taichi Asami
Media Intelligence Laboratories