``You Sound Just Like Your Father'' Commercial Machine Translation Systems Include Stylistic Biases
Dirk Hovy, Federico Bianchi, Tommaso Fornaciari
Machine Translation Short Paper
Session 2B: Jul 6
(09:00-10:00 GMT)
Session 3A: Jul 6
(12:00-13:00 GMT)
Abstract:
The main goal of machine translation has been to convey the correct content. Stylistic considerations have been at best secondary. We show that as a consequence, the output of three commercial machine translation systems (Bing, DeepL, Google) make demographically diverse samples from five languages ``sound'' older and more male than the original. Our findings suggest that translation models reflect demographic bias in the training data. This opens up interesting new research avenues in machine translation to take stylistic considerations into account.
You can open the
pre-recorded video
in a separate window.
NOTE: The SlidesLive video may display a random order of the authors.
The correct author list is shown at the top of this webpage.
Similar Papers
Predictive Biases in Natural Language Processing Models: A Conceptual Framework and Overview
Deven Santosh Shah, H. Andrew Schwartz, Dirk Hovy,

Reducing Gender Bias in Neural Machine Translation as a Domain Adaptation Problem
Danielle Saunders, Bill Byrne,

That is a Known Lie: Detecting Previously Fact-Checked Claims
Shaden Shaar, Nikolay Babulkov, Giovanni Da San Martino, Preslav Nakov,
