Machine Translation

Machine Translation is an excellent example of how cutting-edge research and world-class infrastructure come together at Google. We focus our research efforts on developing statistical translation techniques that improve with more data and generalize well to new languages. Our large scale computing infrastructure allows us to rapidly experiment with new models trained on web-scale data to significantly improve translation quality. This research backs the translations served at translate.google.com, allowing our users to translate text, web pages and even speech. Deployed within a wide range of Google services like GMail, Books, Android and web search, Google Translate is a high-impact, research-driven product that bridges language barriers and makes it possible to explore the multilingual web in 90 languages. Exciting research challenges abound as we pursue human quality translation and develop machine translation systems for new languages.

Recent Publications

Connecting Language Technologies with Rich, Diverse Data Sources Covering Thousands of Languages

Daan van Esch

Sandy Ritchie

Sebastian Ruder

Julia Kreutzer

Clara Rivera

Ishank Saxena

Isaac Caswell

Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

Prompting PaLM for Translation: Assessing Strategies and Performance

David Vilar Torres

Markus Freitag

Colin Cherry

Jiaming Luo

Viresh Ratnakar

George Foster

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Association for Computational Linguistics, Toronto, Canada (2023), 15406–15427

INSTRUCTSCORE: Towards Explainable Text Generation Evaluation with Automatic Feedback

Wenda Xu

Danqing Wang

Liangming Pan

Zhenqiao Song

Markus Freitag

William Wang

Lei Li

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, Singapore, pp. 5967-5994

Epsilon Sampling Rocks: Investigating Sampling Strategies for Minimum Bayes Risk Decoding for Machine Translation

Markus Freitag

Behrooz Ghorbani

Patrick Fernandes

Findings of the Association for Computational Linguistics: EMNLP 2023, Association for Computational Linguistics, Singapore, pp. 9198-9209

BiLex Rx: Lexical Data Augmentation for Massively Multilingual Machine Translation

Alex Jones

Isaac Caswell

Orhan Firat

ArXiv (2023)

Mu2SLAM: Multitask, Multilingual Speech and Language Models

Yong Cheng

Yu Zhang

Melvin Johnson

Wolfgang Macherey

Ankur Bapna

Submission to ACL 2023

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations  & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Machine Translation

Recent Publications

Join us

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Machine Translation

Recent Publications

Join us

AI/ML Foundations  & Capabilities