Garnishing a phonetic dictionary for ASR intake

Research output: Chapter in Book/Report/Conference proceedingChapterpeer-review

20 Downloads (Pure)

Abstract

We present a new method for preparing a lexical-phonetic database as a resource for acoustic model training. The research is an offshoot of the ongoing Project Ravnur (Speech Recognition for Faroese), but the method is language-independent. At NODALIDA 2019 we demonstrate the method (called SHARP) online, showing how a traditional lexical-phonetic dictionary (with a very rich phone inventory) is transformed into an ASR-friendly database (with reduced phonetics, preventing data sparseness). The mapping procedure is informed by a corpus of speech transcripts. We conclude with a discussion on the benefits of a well-thought-out BLARK design (Basic Language Resource Kit), making tools like SHARP possible.
Original languageEnglish
Title of host publicationProceedings of the 22nd Nordic Conference on Computational Linguistics
Place of PublicationTurku
PublisherLinköping University Electronic Press
Pages395-399
Number of pages5
Volume2019
EditionSeptember–October
Publication statusPublished - 2019

Keywords

  • Phonetics
  • Databases

Fingerprint

Dive into the research topics of 'Garnishing a phonetic dictionary for ASR intake'. Together they form a unique fingerprint.

Cite this