- A Bayesian approach to translatorsâ reliability assessment
- Adding Chocolate to Mint Mitigating Metric Interference in Machine Translation
- Analyzing Context Contributions in LLM-based Machine Translation
- Automatic post-editing for machine translation a look at the future post on the FBK blog from 2019-03-07 by Matteo Negri
- BOUQuET dataset, Benchmark and Open initiative for Universal Quality Evaluation in Translation
- Building Machine Translation Systems for the Next Thousand Languages
- Continuous Learning from Human Post-Edits for Neural Machine Translation
- Evaluating the Cross-Lingual Effectiveness of Massively Multilingual Neural Machine Translation
- Exploring Massively Multilingual, Massive Neural Machine Translation and references therein
- Googleâs Multilingual Neural Machine Translation System Enabling Zero-Shot Translation
- Googleâs Neural Machine Translation System Bridging the Gap between Human and Machine Translation
- How Effective are State Space Models for Machine Translation
- Improving Neural Machine Translation Models with Monolingual Data
- Improving Neural Machine Translation Models with Monolingual Data
- Improving Zero-Shot Translation by Disentangling Positional Information
- Investigating Multilingual NMT Representations at Scale
- Massively Multilingual Neural Machine Translation in the Wild Findings and Challenges
- Massively Multilingual Neural Machine Translation
- No Language Left Behind Scaling Human-Centered Machine Translation
- On Instruction-Finetuning Neural Machine Translation Models
- Quantifying the Plausibility of Context Reliance in Neural Machine Translation
- Seamless Multilingual Expressive and Streaming Speech Translation
- SeamlessM4T Massively Multilingual & Multimodal Machine Translation
- Searching for Needles in a Haystack On the Role of Incidental Bilingualism in PaLMâs Translation Capability
- Sequence-Level Knowledge Distillation
- State Spaces Arenât Enough Machine Translation Needs Attention
- Unsupervised Neural Machine Translation
- WikiMatrix Mining 135M Parallel Sentences in 1620 Language Pairs from Wikipedia
- xTower A Multilingual LLM for Explaining and Correcting Translation Errors
- Did Translation Models Get More Robust Without Anyone Even Noticing
- Looking for a Needle in a Haystack A Comprehensive Study of Hallucinations in Neural Machine Translation
The above includes some speech-to-text and speech-to-speech translation literature.
Evaluation
- sacrebleu: A Call for Clarity in Reporting BLEU Scores
- COMET A Neural Framework for MT Evaluation
- xCOMET Transparent Machine Translation Evaluation through Fine-grained Error Detection
- MetricX-24 The Google Submission to the WMT 2024 Metrics Shared Task
- MQM (Multidimensional Quality Metrics) â The place to go to learn about MQM (https://themqm.org/)
Implementation
- MarianNMT - Fast Neural Machine Translation in C++
See Also / Related
- Bergamot - Machine Translation done locally in your browser
- Publications of the Bergamot Project (stop in 2021; end of EU Horizon grant period)