Sprachkontrolle im Spiegel der Maschinellen Übersetzung: Untersuchung zur Wechselwirkung ausgewählter Regeln der Kontrollierten Sprache mit verschiedenen Ansätzen der Maschinellen Übersetzung

Shaimaa Marzouk  

Synopsis

Examining the general impact of the Controlled Languages rules in the context of Machine Translation has been an area of research for many years. The present study focuses on the following question: How do the Controlled Language (CL) rules impact the Machine Translation (MT) output individually? Analyzing a German corpus-based test suite of technical texts that have been translated into English by different MT systems, the study endeavors to answer this question at different levels: the general impact of CL rules (rule- and system-independent), their impact at rule level (system-independent), their impact at system level (rule-independent), and at rule and system level. The results of five MT systems (a rule-based system, a statistical system, two differently constructed hybrid systems, and a neural system) are analyzed and contrasted. For this, a mixed-methods triangulation approach that includes error annotation, human evaluation, and automatic evaluation was applied. The data were analyzed both qualitatively and quantitatively based on the following parameters: number and type of MT errors, style and content quality, and scores from two automatic evaluation metrics. In line with many studies, the results show a general positive impact of the applied CL rules on the MT output. However, at rule level, only four rules proved to have positive effects on all parameters; three rules had negative effects on the parameters; and two rules did not show any significant impact. At rule and system level, the rules affected the MT systems differently, as expected. Some rules that had a positive impact on earlier MT approaches did not show the same impact on the neural MT approach. Furthermore, the neural MT delivered distinctly better results than earlier MT approaches, namely the highest error-free, style and content quality rates both before and after the rules application, which indicates that the neural MT offers a promising solution that no longer requires CL rules for improving the MT output, what in turn allows for a more natural style.

Der Preis der gebundenen Ausgabe ist in Deutschland auf 100,00€ festgesetzt.

Statistics

Author Biography

Shaimaa Marzouk

Shaimaa Marzouk obtained her PhD degree in Translation Studies from the Johannes Gutenberg University in Mainz, Germany after completing two Master’s degrees – majoring in Specialized Translation at the Johannes Gutenberg University and in IT Management at the Humboldt University in Berlin. Her Bachelor degree was in Banking from the Sadat Academy for Management Sciences, Cairo, Egypt. Moreover, she has worked as an IT consultant in the field of ERP systems at an international consulting company on projects in Spain and England.

She is particularly interested in interdisciplinary topics that combine IT and translation research, which is why her research work focusses on the analysis of linguistic and functional usability using eye tracking and other techniques as well as the analysis of machine translation in the context of controlled language conducting corpus-based studies.

In order to put her research into practice and at the same time gain valuable input for further research, Marzouk has recently established AUTHENTIC TRANSLATION (www.authentic-translation.de), a translation company that she herself manages.

book cover

Published

August 29, 2022
LaTeX source on GitHub
Cite as
Marzouk, Shaimaa. 2022. Sprachkontrolle im Spiegel der Maschinellen Übersetzung: Untersuchung zur Wechselwirkung ausgewählter Regeln der Kontrollierten Sprache mit verschiedenen Ansätzen der Maschinellen Übersetzung. (Translation and Multilingual Natural Language Processing 20). Berlin: Language Science Press. DOI: 10.5281/zenodo.7031898

License

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 International License.

Details about the available publication format: PDF

PDF

ISBN-13 (15)

978-3-96110-394-2

doi

10.5281/zenodo.7031898

Details about the available publication format: Hardcover

Hardcover

ISBN-13 (15)

978-3-98554-052-5

Physical Dimensions

180mm x 245mm