The fourth annual ELG conference



Together Towards Digital Language Equality

Brussels, Belgium
Hybrid conference

Berlin Skyline
© Adobe Stock – Sergij Figurnyi

Project Expo

Project Profile

Project abbreviation: OPUS-MT

Project name: Open Translation Models, Tools and Services

Project coordinator: Jörg Tiedemann

Project consortium: University of Helsinki

Funding: ELG (European Language Grid) Pilot Projects Open Call 1

Project duration: August 2020 – August 2021

Main key words: machine translation (MT), neural machine translation (NMT), computer-aided translation (CAT), translation services, low resource languages

Background of the research topic: Machine translation is an important application that connects people with different backgrounds and makes information accessible across language boundaries. Automatic translation is also one of the most prominent research topics in language technology. However, most models and tools are unavailable for research and development and translation services are dominated by commercial providers. It is important to release translation models to reduce dependence on commercial products, increase transparency and drastically increase reuse, replicability and sustainability of machine translation approaches.

Goal of the project: The project aims at the development of open models and services for machine translation to be used in research and development. The main focus is set on Nordic European languages and European minority languages. We also want to support the workflow of professional translators with a seamless integration of MT in their production environment.

Project abstract: OPUS-MT will produce state-of-the-art neural machine translation models that can freely be shared, re-used and integrated in open web services and professional translation workflows. The project will focus on European minority languages and their improved support through multilingual NMT models and transfer learning. Furthermore, OPUS-MT will deliver easily deployable translation services and tools for quick domain-adaptation and on-demand personalisation. We will emphasise open resources that can freely be distributed and used in research and professional applications. For the latter we want to offer local solutions that are independent of on-line services to avoid security risks with open data transfer.