Abstract
This study evaluates GPT-4's English-to-Faroese translation capabilities, comparing it with multilingual models on FLORES-200 and Sprotin datasets. We propose a prompt optimization strategy using Semantic Textual Similarity (STS) to improve translation quality. Human evaluation confirms the effectiveness of STS-based few-shot example selection, though automated metrics fail to capture these improvements. Our findings advance LLM applications for low-resource language translation while highlighting the need for better evaluation methods in this context.
| Original language | English |
|---|---|
| Pages | 622-633 |
| Number of pages | 12 |
| Publication status | Published - 2025 |
| Event | Nodalida/Baltic-HLT 2025: The Joint 25th Nordic Conference on Computational Linguistics and 11th Baltic Conference on Human Language Technologies - Tallinn, Estonia Duration: 2 Mar 2025 → 5 Mar 2025 Conference number: 25 https://www.nodalida-bhlt2025.eu/conference |
Conference
| Conference | Nodalida/Baltic-HLT 2025 |
|---|---|
| Country/Territory | Estonia |
| City | Tallinn |
| Period | 2/03/25 → 5/03/25 |
| Internet address |
Keywords
- Machine learning
- translations
- GPT-4