A complete KALDI recipe for building Arabic speech recognition systems

A complete KALDI recipe for building Arabic speech recognition systems

Ali, Ahmed and Zhang, Yifan and Cardinal, Patrick and Dahak, Najim and Vogel, Stephan and Glass, James

2014 IEEE Workshop on Spoken Language Technology, SLT 2014 – Proceedings 2014

Abstract : In this paper we present a recipe and language resources for training and testing Arabic speech recognition systems using the KALDI toolkit. We built a prototype broadcast news system using 200 hours GALE data that is publicly available through LDC. We describe in detail the decisions made in building the system: using the MADA toolkit for text normalization and vowelization; why we use 36 phonemes; how we generate pronunciations; how we build the language model. We report results using state-of-the-art modeling and decoding techniques. The scripts are released through KALDI and resources are made available on QCRI’s language resources web portal. This is the first effort to share reproducible sizable training and testing results on MSA system.