Interactive Systems Labs (ISL)

Automatic Speech Recognition (ASR)

Automatic Speech Recognition (ASR) is the science of automatically transforming spoken text into a written form. The main applications for which we develop ASR systems at our laboratory are for the use in speech translation systems, such as the simultaneous lecture translation system. We conduct research in all areas relevant for ASR, we offer several courses in ASR, and teach how to build an ASR system with our in-house speech recognition toolkit (Janus Recognition Toolkit).

We conduct research in all areas relevant for ASR:
  • acoustic pre-processing and feature extraction
  • acoustic modeling
  • language modeling
  • search
We offer both, a class in ASR, and a laboratory that teaches how to build an ASR system with our in-house speech recognition toolkit Janus Recognition Toolkit (JRTk).

Applications for this technology are manifold. While the original idea was to create an automatic typewriter for dictation purposes, nowadays speech recognition software can be found in many applications that ask for a natural interface:

  • Dictation software
  • Speech Translation Systems
  • Smart Rooms
  • Human-Robot Communication
  • Telephone help lines
  • Machine control
  • Car navigation- and entertainment systems
  • Pick-to-voice systems
  • Appliances
  • Medical systems in operating rooms