Faroese Language Models with Pronunciations: A set of n-gram language models in ARPA format along with pronunciation dictionaries

Carlos Daniel Hernández Mena, Sandra Saxov Lamhauge, Iben Nyholm Debess, Annika Simonsen

Research output: Other contribution

Abstract

In the context of Automatic Speech Recognition (ASR), a n-gram language model
is a plain-text file containing the probabilities of word sequences with
distict lengths or "n-grams" (for example, a sequence of one word is a 1-gram,
a sequence of two words is a 2-gram and so on). Acoording to this, the "Faroese
Language Models with Pronunciations" is a set of n-gram language models in ARPA
format along with pronunciation dictionaries containing the words that are
present in such language models.
Original languageEnglish
TypeFaroese Language Models with Pronunciations
Media of outputData files
PublisherCLARIN
Publication statusPublished - 2022

Publication series

NameCLARIN-IS

Keywords

  • ASR
  • n-gram
  • language model
  • Faroese
  • pronunciation models
  • pronunciation dictionaries

Fingerprint

Dive into the research topics of 'Faroese Language Models with Pronunciations: A set of n-gram language models in ARPA format along with pronunciation dictionaries'. Together they form a unique fingerprint.

Cite this