Interactive machine translation for the language modernization and spelling normalization of historical documents
Pattern Analysis and Applications ( IF 3.7 ) Pub Date : 2023-04-24 , DOI: 10.1007/s10044-023-01164-w
Miguel Domingo , Francisco Casacuberta

Historical documents are an important part of our cultural heritage. Among other task related to their processing, it is important to modernize their language in order to make them accessible to a broader audience and to achieve an orthography consistency to reduce the linguistic variation inherent in them. Language modernization and spelling normalization have those goals in mind. However, they still have a long way to go. Thus, in order to help scholars generate error-free modernizations/normalizations when the quality is essential, we propose an interactive framework based on interactive machine translation. In this work, we deployed two different interactive protocols into these tasks. We evaluated our proposal under simulated environments, observing significant reductions of the human effort.



