Standardising Pronunciation for a Grapheme-to-Phoneme Converter for Faroese

Sandra Saxov Lamhauge, Iben Nyholm Debess, Carlos Daniel Hernández Mena, Annika Simonsen, Jón Gudnason

Research output: Contribution to conferencePaperpeer-review

19 Downloads (Pure)

Abstract

Pronunciation dictionaries allow computational modelling of the pronunciation
of words in a certain language and are widely used in speech technologies, especially in the fields of speech recognition and synthesis. On the other hand, a grapheme-to-phoneme tool is a generalization of a pronunciation dictionary that is not limited to a given and finite vocabulary. In this paper, we present a set of standardized phonological rules for the Faroese language; we introduce FARSAMPA, a machine-readable character set suitable for phonetic transcription of Faroese, and we present a set of grapheme-to-phoneme models for Faroese, which are publicly available and shared under a creative commons license. We
present the G2P converter and evaluate the performance. The evaluation shows reliable results that demonstrate the quality of the data.
Original languageEnglish
Pages308-317
Number of pages10
Publication statusPublished - May 2023
EventNodalida 2023: Nordic Conference on Computational Linguistics - Tórshavn, Faroe Islands
Duration: 22 May 202324 May 2023
Conference number: 24
http://nodalida2023.fo

Conference

ConferenceNodalida 2023
Abbreviated titleNodalida
Country/TerritoryFaroe Islands
CityTórshavn
Period22/05/2324/05/23
Internet address

Keywords

  • pronunciation dictionaries
  • computational modeling
  • speech technologies
  • grapheme-to-phoneme tool
  • machine-readable character set
  • Faroese language

Fingerprint

Dive into the research topics of 'Standardising Pronunciation for a Grapheme-to-Phoneme Converter for Faroese'. Together they form a unique fingerprint.

Cite this