About us

Home / Research / Work Package 5

Norwegian Language Technologies

Home / Research / Work Package 5

About us

Home / Research / Work Package 5

/ Introduction

Language technologies are at the core of media technologies. This work package aims to provide datasets and models for Norwegian (Bokmål/Nynorsk) that support the automated understanding as well as the automated production of media texts in this language. 

Objective: WP5 adopts theoretical approaches and methodologies primarily based on linguistic data science, including neural learning. Based on language data in the media from the user partners and data and tools at the research partners, large corpora will be annotated. The labelled examples in these corpora will be used for training and evaluating supervised models that demonstrate advanced approaches in areas such as robust deep language analysis, adaptive language generation, event identification and extraction, and analyzing opinions. The partners will cooperate to explore the use of such models for innovative purposes.

/ Introduction

Language technologies are at the core of media technologies. This work package aims to provide datasets and models for Norwegian (Bokmål/Nynorsk) that support the automated understanding as well as the automated production of media texts in this language. 

Objective: WP5 adopts theoretical approaches and methodologies primarily based on linguistic data science, including neural learning. Based on language data in the media from the user partners and data and tools at the research partners, large corpora will be annotated. The labelled examples in these corpora will be used for training and evaluating supervised models that demonstrate advanced approaches in areas such as robust deep language analysis, adaptive language generation, event identification and extraction, and analyzing opinions. The partners will cooperate to explore the use of such models for innovative purposes.

/ Introduction

Language technologies are at the core of media technologies. This work package aims to provide datasets and models for Norwegian (Bokmål/Nynorsk) that support the automated understanding as well as the automated production of media texts in this language. 

Objective: WP5 adopts theoretical approaches and methodologies primarily based on linguistic data science, including neural learning. Based on language data in the media from the user partners and data and tools at the research partners, large corpora will be annotated. The labelled examples in these corpora will be used for training and evaluating supervised models that demonstrate advanced approaches in areas such as robust deep language analysis, adaptive language generation, event identification and extraction, and analyzing opinions. The partners will cooperate to explore the use of such models for innovative purposes.

/ People

Lilja Øvrelid

Lilja Øvrelid

Work Package Leader

University of Oslo 

Read more
Koenraad De Smedt

Koenraad De Smedt

Work Package Co-Leader

Erik Velldal

Erik Velldal

Key Researcher and Task Leader

University of Oslo 

Read more
Samia Touileb

Samia Touileb

Key Researcher

University of Oslo 

Read more

/ Publications

2020

Touileb, Samia; Øvrelid, Lilja; Velldal, Erik

Gender and sentiment, critics and authors: a dataset of Norwegian book reviews Journal Article

Gender Bias in Natural Language Processing. Association for Computational Linguistics, 2020, (Pre SFI).

Abstract | Links | BibTeX

Barnes, J; Velldal, Erik; Øvrelid, Lilja

Improving sentiment analysis with multi-task learning of negation Journal Article

2020, (Pre SFI).

Links | BibTeX

Barnes, J; Øvrelid, Lilja; Velldal, Erik

Sentiment analysis is not solved! Assessing and probing sentiment classification Proceeding

2020, (Pre SFI).

Links | BibTeX

Adouane, Wafia; Touileb, Samia; Bernardy, Jean-Philippe

Identifying Sentiments in Algerian Code-switched User-generated Comments Conference

2020, (Pre SFI).

Abstract | Links | BibTeX

Meurer, P; Rosén, V; Smedt, Koenraad De

Interactive Visualizations in INESS Book Chapter

Butt, M; Hautli-Janisz, A; (Eds.), Lyding V (Ed.): 2020, (Pre SFI).

Links | BibTeX

Øvrelid, Lilja; Mæhlum, P; Barnes, J; Velldal, Erik

A Fine-Grained Sentiment Dataset for Norwegian Proceeding

2020, (Pre SFI).

Links | BibTeX

Jørgensen, F; Aasmoe, T; Husevåg, ASR; Øvrelid, Lilja; Velldal, Erik (Ed.)

NorNE: Annotating Named Entities for Norwegian Proceeding

2020, (Pre SFI).

Links | BibTeX

Lison, Pierre; Hubin, Aliaksandr; Barnes, Jeremy; Touileb, Samia

Named Entity Recognition without Labelled Data: A Weak Supervision Approach Journal Article

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 1518–1533, 2020, (Pre SFI).

Abstract | Links | BibTeX

de Smedt, Koenraad; Koureas, D; Wittenberg, P

FAIR Digital Objects for Science: From Data Pieces to Actionable Knowledge Units Journal Article

2020, (Pre SFI).

Links | BibTeX

2019

Barnes, Jeremy; Touileb, Samia; Øvrelid, Lilja; Velldal, Erik

Lexicon information in neural sentiment analysis: a multi-task learning approach Conference

Linköping University Electronic Press, 2019, (Pre SFI).

Abstract | Links | BibTeX

2018

Kutuzov, A; Øvrelid, Lilja; Szymanski, T; Velldal, Erik

Diachronic word embeddings and semantic shifts: a survey Proceeding

2018, (Pre SFI).

Links | BibTeX

Velldal, Erik; Øvrelid, Lilja; Bergem, Eivind Alexander; Stadsnes, Cathrine; Touileb, Samia; Jørgensen, Fredrik

NoReC: The Norwegian Review Corpus Proceeding

2018, (Pre SFI).

Abstract | BibTeX

2017

Touileb, Samia; Pedersen, Truls; Sjøvaag, Helle

Automatic identification of unknown names with specific roles Journal Article

Proceedings of the Second Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, pp. 150-158, 2017, (Pre SFI).

Abstract | Links | BibTeX

Fares, M; Kutuzov, A; Oepen, S; Velldal, Erik

Word vectors, reuse, and replicability: Towards a community repository of large-text resources Proceeding

2017, (Pre SFI).

Links | BibTeX

2016

Rosén, V; Thunes, M; Haugereid, P; Losnegaard, GS; Dyvik, H; Meurer, P; Lyse, G; Smedt, Koenraad De

The enrichment of lexical resources through incremental parsebanking Journal Article

2016, (Pre SFI).

Links | BibTeX

Dyvik, H; Meurer, P; Rosén, V; Smedt, Koenraad De; Haugereid, P; Losnegaard, GS; Lyse, G; Thunes, M

NorGramBank: A 'Deep' Treebank for Norwegian.Proceedings of LREC Proceeding

2016, (Pre SFI).

Links | BibTeX

Rosén, V; Smedt, Koenraad De; Losnegaard, GS; Bejcek, E; Savary, A; Osenova, P

MWEs in Treebanks: From Survey to Guidelines Proceeding

2016, (Pre SFI).

Links | BibTeX

Øvrelid, Lilja; Hohle, P

Universal dependencies for Norwegian Proceeding

2016, (Pre SFI).

Links | BibTeX

2012

Lapponi, E; Read, J; Øvrelid, Lilja

Representing and resolving negation for sentiment analysis Proceeding

2012, (Pre SFI).

Links | BibTeX

Velldal, Erik; Øvrelid, Lilja; Read, J; Oepen, S

Speculation and negation: Rules, rankers, and the role of syntax Journal Article

2012, (Pre SFI).

Links | BibTeX

/ Publications

2020

Touileb, Samia; Øvrelid, Lilja; Velldal, Erik

Gender and sentiment, critics and authors: a dataset of Norwegian book reviews Journal Article

Gender Bias in Natural Language Processing. Association for Computational Linguistics, 2020, (Pre SFI).

Abstract | Links | BibTeX

Barnes, J; Velldal, Erik; Øvrelid, Lilja

Improving sentiment analysis with multi-task learning of negation Journal Article

2020, (Pre SFI).

Links | BibTeX

Barnes, J; Øvrelid, Lilja; Velldal, Erik

Sentiment analysis is not solved! Assessing and probing sentiment classification Proceeding

2020, (Pre SFI).

Links | BibTeX

Adouane, Wafia; Touileb, Samia; Bernardy, Jean-Philippe

Identifying Sentiments in Algerian Code-switched User-generated Comments Conference

2020, (Pre SFI).

Abstract | Links | BibTeX

Meurer, P; Rosén, V; Smedt, Koenraad De

Interactive Visualizations in INESS Book Chapter

Butt, M; Hautli-Janisz, A; (Eds.), Lyding V (Ed.): 2020, (Pre SFI).

Links | BibTeX

Øvrelid, Lilja; Mæhlum, P; Barnes, J; Velldal, Erik

A Fine-Grained Sentiment Dataset for Norwegian Proceeding

2020, (Pre SFI).

Links | BibTeX

Jørgensen, F; Aasmoe, T; Husevåg, ASR; Øvrelid, Lilja; Velldal, Erik (Ed.)

NorNE: Annotating Named Entities for Norwegian Proceeding

2020, (Pre SFI).

Links | BibTeX

Lison, Pierre; Hubin, Aliaksandr; Barnes, Jeremy; Touileb, Samia

Named Entity Recognition without Labelled Data: A Weak Supervision Approach Journal Article

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 1518–1533, 2020, (Pre SFI).

Abstract | Links | BibTeX

de Smedt, Koenraad; Koureas, D; Wittenberg, P

FAIR Digital Objects for Science: From Data Pieces to Actionable Knowledge Units Journal Article

2020, (Pre SFI).

Links | BibTeX

2019

Barnes, Jeremy; Touileb, Samia; Øvrelid, Lilja; Velldal, Erik

Lexicon information in neural sentiment analysis: a multi-task learning approach Conference

Linköping University Electronic Press, 2019, (Pre SFI).

Abstract | Links | BibTeX

2018

Kutuzov, A; Øvrelid, Lilja; Szymanski, T; Velldal, Erik

Diachronic word embeddings and semantic shifts: a survey Proceeding

2018, (Pre SFI).

Links | BibTeX

Velldal, Erik; Øvrelid, Lilja; Bergem, Eivind Alexander; Stadsnes, Cathrine; Touileb, Samia; Jørgensen, Fredrik

NoReC: The Norwegian Review Corpus Proceeding

2018, (Pre SFI).

Abstract | BibTeX

2017

Touileb, Samia; Pedersen, Truls; Sjøvaag, Helle

Automatic identification of unknown names with specific roles Journal Article

Proceedings of the Second Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, pp. 150-158, 2017, (Pre SFI).

Abstract | Links | BibTeX

Fares, M; Kutuzov, A; Oepen, S; Velldal, Erik

Word vectors, reuse, and replicability: Towards a community repository of large-text resources Proceeding

2017, (Pre SFI).

Links | BibTeX

2016

Rosén, V; Thunes, M; Haugereid, P; Losnegaard, GS; Dyvik, H; Meurer, P; Lyse, G; Smedt, Koenraad De

The enrichment of lexical resources through incremental parsebanking Journal Article

2016, (Pre SFI).

Links | BibTeX

Dyvik, H; Meurer, P; Rosén, V; Smedt, Koenraad De; Haugereid, P; Losnegaard, GS; Lyse, G; Thunes, M

NorGramBank: A 'Deep' Treebank for Norwegian.Proceedings of LREC Proceeding

2016, (Pre SFI).

Links | BibTeX

Rosén, V; Smedt, Koenraad De; Losnegaard, GS; Bejcek, E; Savary, A; Osenova, P

MWEs in Treebanks: From Survey to Guidelines Proceeding

2016, (Pre SFI).

Links | BibTeX

Øvrelid, Lilja; Hohle, P

Universal dependencies for Norwegian Proceeding

2016, (Pre SFI).

Links | BibTeX

2012

Lapponi, E; Read, J; Øvrelid, Lilja

Representing and resolving negation for sentiment analysis Proceeding

2012, (Pre SFI).

Links | BibTeX

Velldal, Erik; Øvrelid, Lilja; Read, J; Oepen, S

Speculation and negation: Rules, rankers, and the role of syntax Journal Article

2012, (Pre SFI).

Links | BibTeX

/ Publications

2020

Touileb, Samia; Øvrelid, Lilja; Velldal, Erik

Gender and sentiment, critics and authors: a dataset of Norwegian book reviews Journal Article

Gender Bias in Natural Language Processing. Association for Computational Linguistics, 2020, (Pre SFI).

Abstract | Links | BibTeX

Barnes, J; Velldal, Erik; Øvrelid, Lilja

Improving sentiment analysis with multi-task learning of negation Journal Article

2020, (Pre SFI).

Links | BibTeX

Barnes, J; Øvrelid, Lilja; Velldal, Erik

Sentiment analysis is not solved! Assessing and probing sentiment classification Proceeding

2020, (Pre SFI).

Links | BibTeX

Adouane, Wafia; Touileb, Samia; Bernardy, Jean-Philippe

Identifying Sentiments in Algerian Code-switched User-generated Comments Conference

2020, (Pre SFI).

Abstract | Links | BibTeX

Meurer, P; Rosén, V; Smedt, Koenraad De

Interactive Visualizations in INESS Book Chapter

Butt, M; Hautli-Janisz, A; (Eds.), Lyding V (Ed.): 2020, (Pre SFI).

Links | BibTeX

Øvrelid, Lilja; Mæhlum, P; Barnes, J; Velldal, Erik

A Fine-Grained Sentiment Dataset for Norwegian Proceeding

2020, (Pre SFI).

Links | BibTeX

Jørgensen, F; Aasmoe, T; Husevåg, ASR; Øvrelid, Lilja; Velldal, Erik (Ed.)

NorNE: Annotating Named Entities for Norwegian Proceeding

2020, (Pre SFI).

Links | BibTeX

Lison, Pierre; Hubin, Aliaksandr; Barnes, Jeremy; Touileb, Samia

Named Entity Recognition without Labelled Data: A Weak Supervision Approach Journal Article

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 1518–1533, 2020, (Pre SFI).

Abstract | Links | BibTeX

de Smedt, Koenraad; Koureas, D; Wittenberg, P

FAIR Digital Objects for Science: From Data Pieces to Actionable Knowledge Units Journal Article

2020, (Pre SFI).

Links | BibTeX

2019

Barnes, Jeremy; Touileb, Samia; Øvrelid, Lilja; Velldal, Erik

Lexicon information in neural sentiment analysis: a multi-task learning approach Conference

Linköping University Electronic Press, 2019, (Pre SFI).

Abstract | Links | BibTeX

2018

Kutuzov, A; Øvrelid, Lilja; Szymanski, T; Velldal, Erik

Diachronic word embeddings and semantic shifts: a survey Proceeding

2018, (Pre SFI).

Links | BibTeX

Velldal, Erik; Øvrelid, Lilja; Bergem, Eivind Alexander; Stadsnes, Cathrine; Touileb, Samia; Jørgensen, Fredrik

NoReC: The Norwegian Review Corpus Proceeding

2018, (Pre SFI).

Abstract | BibTeX

2017

Touileb, Samia; Pedersen, Truls; Sjøvaag, Helle

Automatic identification of unknown names with specific roles Journal Article

Proceedings of the Second Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, pp. 150-158, 2017, (Pre SFI).

Abstract | Links | BibTeX

Fares, M; Kutuzov, A; Oepen, S; Velldal, Erik

Word vectors, reuse, and replicability: Towards a community repository of large-text resources Proceeding

2017, (Pre SFI).

Links | BibTeX

2016

Rosén, V; Thunes, M; Haugereid, P; Losnegaard, GS; Dyvik, H; Meurer, P; Lyse, G; Smedt, Koenraad De

The enrichment of lexical resources through incremental parsebanking Journal Article

2016, (Pre SFI).

Links | BibTeX

Dyvik, H; Meurer, P; Rosén, V; Smedt, Koenraad De; Haugereid, P; Losnegaard, GS; Lyse, G; Thunes, M

NorGramBank: A 'Deep' Treebank for Norwegian.Proceedings of LREC Proceeding

2016, (Pre SFI).

Links | BibTeX

Rosén, V; Smedt, Koenraad De; Losnegaard, GS; Bejcek, E; Savary, A; Osenova, P

MWEs in Treebanks: From Survey to Guidelines Proceeding

2016, (Pre SFI).

Links | BibTeX

Øvrelid, Lilja; Hohle, P

Universal dependencies for Norwegian Proceeding

2016, (Pre SFI).

Links | BibTeX

2012

Lapponi, E; Read, J; Øvrelid, Lilja

Representing and resolving negation for sentiment analysis Proceeding

2012, (Pre SFI).

Links | BibTeX

Velldal, Erik; Øvrelid, Lilja; Read, J; Oepen, S

Speculation and negation: Rules, rankers, and the role of syntax Journal Article

2012, (Pre SFI).

Links | BibTeX