National Competence Centre Finland

The Languages of Finland

Finnish and Swedish are defined as the official languages of Finland. Approximately 90% of the Finnish population (as of 2015) speaks Finnish as their native language. In Sweden, Finish is declared as an official minority language. In the past, many different languages were spoken in Finland, like Sámi languages, Romany, Karelian languages or sign languages. Finnish is part of the Finno-Ugric language group and belongs to the Baltic-Finnish branch like Estonian. The dialects are classified into two categories: east and west. They differ mainly in their pronunciation, some word forms and vocabularies.

Features of Finnish:

  • There is no grammatical genus or articles, instead, words can be inflected by 15 cases.
  • Finnish has a rich inflectional system. Because of the affixes, which mark the syntactical role of the words, the speakers can choose a relatively free word order.
  • Every noun can have up to 2000 word forms and every verb up to 12.000. The word forms are built with several affixes, which can be stacked.
  • New words are build by derivation and composition. In Finnish, the basic words make up just 10-15% of the vocabulary, derivatives are 20-30% and compounds are the majority of 60-70%.
  • Morphophonological features of Finnish are vowel harmony and vocal mutation between stems and endings.

NCC Lead Finland

Dr. Krister Lindén is the Research Director of Language Technology at the Department of Digital Humanities and the Deputy Team Supervisor of the Centre of Excellence in Ancient Near Eastern Empires (ANEE). He received his PhD in Language Technology in 2005 at the University of Helsinki. His research interests focus on language technology application, language resources in research infrastructures and digital humanities applied to ancient near eastern empires. Since 2015, he is the National Coordinator of FIN-CLARIN, the Finnish part of the CLARIN initiative. In addition, he has led the research activities of the Language Bank of Finland since 2010.

NCC lead Finland

Current National Initiatives

  • The government has opened resources and databases produced by government-funded activities.
  • In late 2019, the Ministry of Finance issued a “Development and implementation plan for AuroraAI 2019–2023”, which includes the goal to identify service needs which the citizen expresses in natural language, written or spoken. The reports assume that LT is available for the languages used in Finland, so from 2019, the state-owned development company VAKE has included support for LT development in its strategy for digitalisation.

Wikipedia contributors. (2020, May 15). Finnish language. In Wikipedia, The Free Encyclopedia. Retrieved 17:30, June 15, 2020, https://en.wikipedia.org/wiki/Finnish_language.

Events

2020
4th Regional ELG Workshop: Finland - symbol of elg in colour
- Slides
Regional workshop (online) Helsinki, Finland December 15

META-NET White Paper on Finnish

Kimmo Koskenniemi, Krister Lindén, Lauri Carlson, Martti Vainio, Antti Arppe, Mietta Lennes, Hanna Westerlund, Mirka Hyvärinen, Imre Bartis, Pirkko Nuolijärvi, and Aino Piehl. Suomen kieli digitaalisella aikakaudella - The Finnish Language in the Digital Age. META-NET White Paper Series: Europe's Languages in the Digital Age. Springer, Heidelberg, New York, Dordrecht, London, 9 2012. Georg Rehm and Hans Uszkoreit (series editors).
Full text of this META-NET White Paper (PDF)
Additional information on this META-NET White Paper

Cover of Finnish whitepaper

Availability of Tools and Resources for Finnish (as of 2012)

The following table illustrates the support of the Estonian language through speech technologies, machine translation, text analytics and language resources.

Speech technologies Excellent
support
Good
support
Moderate
support
Fragmentary
support
Weak/no
support
Machine translation Excellent
support
Good
support
Moderate
support
Fragmentary
support
Weak/no
support
Text analytics Excellent
support
Good
support
Moderate
support
Fragmentary
support
Weak/no
support
Language resources Excellent
support
Good
support
Moderate
support
Fragmentary
support
Weak/no
support