Abstract
This study evaluates GPT-4's English-to-Faroese translation capabilities, comparing it with multilingual models on FLORES-200 and Sprotin datasets. We propose a prompt optimization strategy using Semantic Textual Similarity (STS) to improve translation quality. Human evaluation confirms the effectiveness of STS-based few-shot example selection, though automated metrics fail to capture these improvements. Our findings advance LLM applications for low-resource language translation while highlighting the need for better evaluation methods in this context.
Original language | English |
---|---|
Pages | 622-633 |
Number of pages | 12 |
Publication status | Published - 2025 |
Event | Nodalida/Baltic-HLT 2025: The Joint 25th Nordic Conference on Computational Linguistics and 11th Baltic Conference on Human Language Technologies - Tallinn, Estonia Duration: 2 Mar 2025 → 5 Mar 2025 Conference number: 25 https://www.nodalida-bhlt2025.eu/conference |
Conference
Conference | Nodalida/Baltic-HLT 2025 |
---|---|
Country/Territory | Estonia |
City | Tallinn |
Period | 2/03/25 → 5/03/25 |
Internet address |
Keywords
- Machine learning
- translations
- GPT-4