Research and technology

Since its inception, Miðeind has prioritized research in language technology and AI alongside software development. Our team has authored numerous peer-reviewed academic papers; a full list is available at the bottom of this page.

Miðeind also contributes to the developer community by releasing various open-source software packages, which can be found on GitHub.

Miðeind cares deeply about the status of Icelandic in AI technology and maintains a well-regarded leaderboard on Hugging Face that evaluates the Icelandic proficiency of Large Language Models. The leaderboard is largely based on our own benchmarks; you can read more about it here.

Grants and awards

As an implementation partner for the government's Language Technology Programme I (2019-2022), Miðeind helped develop essential solutions in machine translation, grammar checking, and NLP infrastructure.

Our work is supported by grants from major competitive funds, including the Technology Development Fund, the Strategic Programme in Language and Technology, Horizon Europe, Fléttan, and Skerfur.

In recognition of our impact, Miðeind received the prestigious Ský IT Award in 2024 for outstanding contributions to the Icelandic IT sector.

Collaboration

Are you a student looking for a thesis topic or a summer internship? We are constantly generating new ideas, often more than we have the capacity to execute ourselves. We strongly believe in academic collaboration and welcome the opportunity to advise students ready to tackle an interesting challenge.

Research papers

Svanhvít Lilja Ingólfsdóttir, Haukur Jónsson, Kári Steinn Aðalsteinsson, Róbert Fjölnir Birkisson, Sveinbjörn Þórðarson og Þorvaldur Páll Helgason. (2025). Miðeind at WMT25 General Machine Translation Task. Proceedings of the Tenth Conference on Machine Translation, bls. 577–582. Suzhou, Kína.


Þórunn Arnardóttir, Elías Bjartur Einarsson, Garðar Ingvarsson Juto, Þorvaldur Páll Helgason, and Hafsteinn Einarsson. (2025). WikiQA-IS: Assisted Benchmark Generation and Automated Evaluation of Icelandic Cultural Knowledge in LLMs. In Proceedings of the Third Workshop on Resources and Representations for Under-Resourced Languages and Domains (RESOURCEFUL-2025), pages 64–73, Tallinn, Estonia. University of Tartu Library, Estonia.


Þórunn Arnardóttir, Svanhvít Lilja Ingólfsdóttir, Haukur Barri Símonarson, Hafsteinn Einarsson, Anton Karl Ingason og Vilhjálmur Þorsteinsson. (2024). Beyond Error Categories: A Contextual Approach of Evaluating Emerging Spell and Grammar Checkers. Proceedings of the 3rd Annual Meeting of the Special Interest Group on Under-resourced Languages@LREC-COLING 24, pages 45-52.


Svanhvít Lilja Ingólfsdóttir, Pétur Orri Ragnarsson, Haukur Páll Jónsson, Haukur Barri Símonarson, Vilhjálmur Þorsteinsson og Vésteinn Snæbjarnarson. (2023). Byte-Level Grammatical Error Correction Using Synthetic and Curated Corpora. Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics Volume 1: Long Papers, pages 7299–7316.


Hulda Óladóttir, Þórunn Arnardóttir, Anton Karl Ingason og Vilhjálmur Þorsteinsson. (2022). Developing a Spell and Grammar Checker for Icelandic Using an Error Corpus. Proceedings of the Thirteenth Language Resources and Evaluation Conference, pages 4644-4653.


Vésteinn Snæbjarnarson og Hafsteinn Einarsson. (2022). Cross-Lingual QA as a Stepping Stone for Monolingual Open QA in Icelandic. arXiv forprentun arXiv:2207.01918.


Vésteinn Snæbjarnarson og Hafsteinn Einarsson. (2022). Natural questions in Icelandic. Proceedings of the Thirteenth Language Resources and Evaluation Conference, pages 4488-4496.


Vésteinn Snæbjarnarson, Haukur Barri Símonarson, Pétur Orri Ragnarsson, Svanhvít Lilja Ingólfsdóttir, Haukur Páll Jónsson, Vilhjálmur Þorsteinsson og Hafsteinn Einarsson. (2022). A Warm Start and a Clean Crawled Corpus - A Recipe for Good Language Models. arXiv preprint arXiv:2201.05601.


Anna Björk Nikulásdóttir, Þórunn Arnardóttir, Jón Guðnason, Þorsteinn Daði Gunnarsson, Anton Karl Ingason, Haukur Páll Jónsson, Hrafn Loftsson, Hulda Óladóttir, Einar Freyr Sigurðsson, Atli Þór Sigurgeirsson, Vésteinn Snæbjarnarson og Steinþór Steingrímsson. (2021). Help yourself from the Buffet: National Language Technology Infrastructure Initiative on CLARIN-IS. CLARIN Annual Conference 2021 (rafrænn viðburður).


Haukur Barri Símonarson, Vésteinn Snæbjarnarson, Pétur Orri Ragnarsson, Haukur Páll Jónsson, og Vilhjálmur Þorsteinsson. (2021). Miðeind's WMT 2021 submission. arXiv preprint arXiv:2109.07343.


Haukur Barri Símonarson og Vésteinn Snæbjarnarson. (2021). Icelandic Parallel Abstracts Corpus. arXiv forprentun arXiv:2108.05289.Vilhjálmur Þorsteinsson, Hulda Óladóttir og Hrafn Loftsson. (2019). A Wide-Coverage Context-Free Grammar for Icelandic and an Accompanying Parsing System. Í Proceedings of Recent Advances in Natural Language Processing (RANLP 2019). Varna, Búlgaría.


Haukur Páll Jónsson, Haukur Barri Símonarson, Vésteinn Snæbjarnarson, Steinþór Steingrímsson og Hrafn Loftsson. (2020). Experimenting with Different Machine Translation Models in Medium-Resource Settings. 23rd International Conference on Text, Speech and Dialogue (TSD 2020). Brno, Tékkland.