About us

Home / Research / Work Package 5

Norwegian Language Technologies

Home / Research / Work Package 5

About us

Home / Research / Work Package 5

/ Introduction

Language technologies are at the core of media technologies. This work package aims to provide datasets and models for Norwegian (Bokmål/Nynorsk) that support the automated understanding as well as the automated production of media texts in this language. 

Objective: WP5 adopts theoretical approaches and methodologies primarily based on linguistic data science, including neural learning. Based on language data in the media from the user partners and data and tools at the research partners, large corpora will be annotated. The labelled examples in these corpora will be used for training and evaluating supervised models that demonstrate advanced approaches in areas such as robust deep language analysis, adaptive language generation, event identification and extraction, and analyzing opinions. The partners will cooperate to explore the use of such models for innovative purposes.

/ Introduction

Language technologies are at the core of media technologies. This work package aims to provide datasets and models for Norwegian (Bokmål/Nynorsk) that support the automated understanding as well as the automated production of media texts in this language. 

Objective: WP5 adopts theoretical approaches and methodologies primarily based on linguistic data science, including neural learning. Based on language data in the media from the user partners and data and tools at the research partners, large corpora will be annotated. The labelled examples in these corpora will be used for training and evaluating supervised models that demonstrate advanced approaches in areas such as robust deep language analysis, adaptive language generation, event identification and extraction, and analyzing opinions. The partners will cooperate to explore the use of such models for innovative purposes.

/ Introduction

Language technologies are at the core of media technologies. This work package aims to provide datasets and models for Norwegian (Bokmål/Nynorsk) that support the automated understanding as well as the automated production of media texts in this language. 

Objective: WP5 adopts theoretical approaches and methodologies primarily based on linguistic data science, including neural learning. Based on language data in the media from the user partners and data and tools at the research partners, large corpora will be annotated. The labelled examples in these corpora will be used for training and evaluating supervised models that demonstrate advanced approaches in areas such as robust deep language analysis, adaptive language generation, event identification and extraction, and analyzing opinions. The partners will cooperate to explore the use of such models for innovative purposes.

/ People

Lilja Øvrelid

Lilja Øvrelid

Work Package Leader

University of Oslo 

Read more
Koenraad De Smedt

Koenraad De Smedt

Work Package Co-Leader

Erik Velldal

Erik Velldal

Key Researcher and Task Leader

University of Oslo 

Read more
Samia Touileb

Samia Touileb

Researcher

/ Publications

2020

Gender and sentiment, critics and authors: a dataset of Norwegian book reviews Journal Article

Touileb, Samia; Øvrelid, Lilja; Velldal, Erik

Gender Bias in Natural Language Processing. Association for Computational Linguistics, 2020, (Pre SFI).

Abstract | Links | BibTeX

Improving sentiment analysis with multi-task learning of negation Journal Article

Barnes, J; Velldal, Erik; Øvrelid, Lilja

2020, (Pre SFI).

Links | BibTeX

Sentiment analysis is not solved! Assessing and probing sentiment classification Proceeding

Barnes, J; Øvrelid, Lilja; Velldal, Erik

2020, (Pre SFI).

Links | BibTeX

Identifying Sentiments in Algerian Code-switched User-generated Comments Conference

Adouane, Wafia; Touileb, Samia; Bernardy, Jean-Philippe

2020, (Pre SFI).

Abstract | Links | BibTeX

Interactive Visualizations in INESS Book Chapter

Meurer, P; Rosén, V; Smedt, Koenraad De

Butt, M; Hautli-Janisz, A; (Eds.), Lyding V (Ed.): 2020, (Pre SFI).

Links | BibTeX

A Fine-Grained Sentiment Dataset for Norwegian Proceeding

Øvrelid, Lilja; Mæhlum, P; Barnes, J; Velldal, Erik

2020, (Pre SFI).

Links | BibTeX

NorNE: Annotating Named Entities for Norwegian Proceeding

Jørgensen, F; Aasmoe, T; Husevåg, ASR; Øvrelid, Lilja; Velldal, Erik (Ed.)

2020, (Pre SFI).

Links | BibTeX

Named Entity Recognition without Labelled Data: A Weak Supervision Approach Journal Article

Lison, Pierre; Hubin, Aliaksandr; Barnes, Jeremy; Touileb, Samia

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 1518–1533, 2020, (Pre SFI).

Abstract | Links | BibTeX

FAIR Digital Objects for Science: From Data Pieces to Actionable Knowledge Units Journal Article

de Smedt, Koenraad; Koureas, D; Wittenberg, P

2020, (Pre SFI).

Links | BibTeX

2019

Lexicon information in neural sentiment analysis: a multi-task learning approach Conference

Barnes, Jeremy; Touileb, Samia; Øvrelid, Lilja; Velldal, Erik

Linköping University Electronic Press, 2019, (Pre SFI).

Abstract | Links | BibTeX

2018

Diachronic word embeddings and semantic shifts: a survey Proceeding

Kutuzov, A; Øvrelid, Lilja; Szymanski, T; Velldal, Erik

2018, (Pre SFI).

Links | BibTeX

NoReC: The Norwegian Review Corpus Proceeding

Velldal, Erik; Øvrelid, Lilja; Bergem, Eivind Alexander; Stadsnes, Cathrine; Touileb, Samia; Jørgensen, Fredrik

2018, (Pre SFI).

Abstract | BibTeX

2017

Automatic identification of unknown names with specific roles Journal Article

Touileb, Samia; Pedersen, Truls; Sjøvaag, Helle

Proceedings of the Second Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, pp. 150-158, 2017, (Pre SFI).

Abstract | Links | BibTeX

Word vectors, reuse, and replicability: Towards a community repository of large-text resources Proceeding

Fares, M; Kutuzov, A; Oepen, S; Velldal, Erik

2017, (Pre SFI).

Links | BibTeX

2016

The enrichment of lexical resources through incremental parsebanking Journal Article

Rosén, V; Thunes, M; Haugereid, P; Losnegaard, GS; Dyvik, H; Meurer, P; Lyse, G; Smedt, Koenraad De

2016, (Pre SFI).

Links | BibTeX

NorGramBank: A 'Deep' Treebank for Norwegian.Proceedings of LREC Proceeding

Dyvik, H; Meurer, P; Rosén, V; Smedt, Koenraad De; Haugereid, P; Losnegaard, GS; Lyse, G; Thunes, M

2016, (Pre SFI).

Links | BibTeX

MWEs in Treebanks: From Survey to Guidelines Proceeding

Rosén, V; Smedt, Koenraad De; Losnegaard, GS; Bejcek, E; Savary, A; Osenova, P

2016, (Pre SFI).

Links | BibTeX

Universal dependencies for Norwegian Proceeding

Øvrelid, Lilja; Hohle, P

2016, (Pre SFI).

Links | BibTeX

2012

Representing and resolving negation for sentiment analysis Proceeding

Lapponi, E; Read, J; Øvrelid, Lilja

2012, (Pre SFI).

Links | BibTeX

Speculation and negation: Rules, rankers, and the role of syntax Journal Article

Velldal, Erik; Øvrelid, Lilja; Read, J; Oepen, S

2012, (Pre SFI).

Links | BibTeX

/ Publications

2020

Gender and sentiment, critics and authors: a dataset of Norwegian book reviews Journal Article

Touileb, Samia; Øvrelid, Lilja; Velldal, Erik

Gender Bias in Natural Language Processing. Association for Computational Linguistics, 2020, (Pre SFI).

Abstract | Links | BibTeX

Improving sentiment analysis with multi-task learning of negation Journal Article

Barnes, J; Velldal, Erik; Øvrelid, Lilja

2020, (Pre SFI).

Links | BibTeX

Sentiment analysis is not solved! Assessing and probing sentiment classification Proceeding

Barnes, J; Øvrelid, Lilja; Velldal, Erik

2020, (Pre SFI).

Links | BibTeX

Identifying Sentiments in Algerian Code-switched User-generated Comments Conference

Adouane, Wafia; Touileb, Samia; Bernardy, Jean-Philippe

2020, (Pre SFI).

Abstract | Links | BibTeX

Interactive Visualizations in INESS Book Chapter

Meurer, P; Rosén, V; Smedt, Koenraad De

Butt, M; Hautli-Janisz, A; (Eds.), Lyding V (Ed.): 2020, (Pre SFI).

Links | BibTeX

A Fine-Grained Sentiment Dataset for Norwegian Proceeding

Øvrelid, Lilja; Mæhlum, P; Barnes, J; Velldal, Erik

2020, (Pre SFI).

Links | BibTeX

NorNE: Annotating Named Entities for Norwegian Proceeding

Jørgensen, F; Aasmoe, T; Husevåg, ASR; Øvrelid, Lilja; Velldal, Erik (Ed.)

2020, (Pre SFI).

Links | BibTeX

Named Entity Recognition without Labelled Data: A Weak Supervision Approach Journal Article

Lison, Pierre; Hubin, Aliaksandr; Barnes, Jeremy; Touileb, Samia

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 1518–1533, 2020, (Pre SFI).

Abstract | Links | BibTeX

FAIR Digital Objects for Science: From Data Pieces to Actionable Knowledge Units Journal Article

de Smedt, Koenraad; Koureas, D; Wittenberg, P

2020, (Pre SFI).

Links | BibTeX

2019

Lexicon information in neural sentiment analysis: a multi-task learning approach Conference

Barnes, Jeremy; Touileb, Samia; Øvrelid, Lilja; Velldal, Erik

Linköping University Electronic Press, 2019, (Pre SFI).

Abstract | Links | BibTeX

2018

Diachronic word embeddings and semantic shifts: a survey Proceeding

Kutuzov, A; Øvrelid, Lilja; Szymanski, T; Velldal, Erik

2018, (Pre SFI).

Links | BibTeX

NoReC: The Norwegian Review Corpus Proceeding

Velldal, Erik; Øvrelid, Lilja; Bergem, Eivind Alexander; Stadsnes, Cathrine; Touileb, Samia; Jørgensen, Fredrik

2018, (Pre SFI).

Abstract | BibTeX

2017

Automatic identification of unknown names with specific roles Journal Article

Touileb, Samia; Pedersen, Truls; Sjøvaag, Helle

Proceedings of the Second Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, pp. 150-158, 2017, (Pre SFI).

Abstract | Links | BibTeX

Word vectors, reuse, and replicability: Towards a community repository of large-text resources Proceeding

Fares, M; Kutuzov, A; Oepen, S; Velldal, Erik

2017, (Pre SFI).

Links | BibTeX

2016

The enrichment of lexical resources through incremental parsebanking Journal Article

Rosén, V; Thunes, M; Haugereid, P; Losnegaard, GS; Dyvik, H; Meurer, P; Lyse, G; Smedt, Koenraad De

2016, (Pre SFI).

Links | BibTeX

NorGramBank: A 'Deep' Treebank for Norwegian.Proceedings of LREC Proceeding

Dyvik, H; Meurer, P; Rosén, V; Smedt, Koenraad De; Haugereid, P; Losnegaard, GS; Lyse, G; Thunes, M

2016, (Pre SFI).

Links | BibTeX

MWEs in Treebanks: From Survey to Guidelines Proceeding

Rosén, V; Smedt, Koenraad De; Losnegaard, GS; Bejcek, E; Savary, A; Osenova, P

2016, (Pre SFI).

Links | BibTeX

Universal dependencies for Norwegian Proceeding

Øvrelid, Lilja; Hohle, P

2016, (Pre SFI).

Links | BibTeX

2012

Representing and resolving negation for sentiment analysis Proceeding

Lapponi, E; Read, J; Øvrelid, Lilja

2012, (Pre SFI).

Links | BibTeX

Speculation and negation: Rules, rankers, and the role of syntax Journal Article

Velldal, Erik; Øvrelid, Lilja; Read, J; Oepen, S

2012, (Pre SFI).

Links | BibTeX

/ Publications

2020

Gender and sentiment, critics and authors: a dataset of Norwegian book reviews Journal Article

Touileb, Samia; Øvrelid, Lilja; Velldal, Erik

Gender Bias in Natural Language Processing. Association for Computational Linguistics, 2020, (Pre SFI).

Abstract | Links | BibTeX

Improving sentiment analysis with multi-task learning of negation Journal Article

Barnes, J; Velldal, Erik; Øvrelid, Lilja

2020, (Pre SFI).

Links | BibTeX

Sentiment analysis is not solved! Assessing and probing sentiment classification Proceeding

Barnes, J; Øvrelid, Lilja; Velldal, Erik

2020, (Pre SFI).

Links | BibTeX

Identifying Sentiments in Algerian Code-switched User-generated Comments Conference

Adouane, Wafia; Touileb, Samia; Bernardy, Jean-Philippe

2020, (Pre SFI).

Abstract | Links | BibTeX

Interactive Visualizations in INESS Book Chapter

Meurer, P; Rosén, V; Smedt, Koenraad De

Butt, M; Hautli-Janisz, A; (Eds.), Lyding V (Ed.): 2020, (Pre SFI).

Links | BibTeX

A Fine-Grained Sentiment Dataset for Norwegian Proceeding

Øvrelid, Lilja; Mæhlum, P; Barnes, J; Velldal, Erik

2020, (Pre SFI).

Links | BibTeX

NorNE: Annotating Named Entities for Norwegian Proceeding

Jørgensen, F; Aasmoe, T; Husevåg, ASR; Øvrelid, Lilja; Velldal, Erik (Ed.)

2020, (Pre SFI).

Links | BibTeX

Named Entity Recognition without Labelled Data: A Weak Supervision Approach Journal Article

Lison, Pierre; Hubin, Aliaksandr; Barnes, Jeremy; Touileb, Samia

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 1518–1533, 2020, (Pre SFI).

Abstract | Links | BibTeX

FAIR Digital Objects for Science: From Data Pieces to Actionable Knowledge Units Journal Article

de Smedt, Koenraad; Koureas, D; Wittenberg, P

2020, (Pre SFI).

Links | BibTeX

2019

Lexicon information in neural sentiment analysis: a multi-task learning approach Conference

Barnes, Jeremy; Touileb, Samia; Øvrelid, Lilja; Velldal, Erik

Linköping University Electronic Press, 2019, (Pre SFI).

Abstract | Links | BibTeX

2018

Diachronic word embeddings and semantic shifts: a survey Proceeding

Kutuzov, A; Øvrelid, Lilja; Szymanski, T; Velldal, Erik

2018, (Pre SFI).

Links | BibTeX

NoReC: The Norwegian Review Corpus Proceeding

Velldal, Erik; Øvrelid, Lilja; Bergem, Eivind Alexander; Stadsnes, Cathrine; Touileb, Samia; Jørgensen, Fredrik

2018, (Pre SFI).

Abstract | BibTeX

2017

Automatic identification of unknown names with specific roles Journal Article

Touileb, Samia; Pedersen, Truls; Sjøvaag, Helle

Proceedings of the Second Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, pp. 150-158, 2017, (Pre SFI).

Abstract | Links | BibTeX

Word vectors, reuse, and replicability: Towards a community repository of large-text resources Proceeding

Fares, M; Kutuzov, A; Oepen, S; Velldal, Erik

2017, (Pre SFI).

Links | BibTeX

2016

The enrichment of lexical resources through incremental parsebanking Journal Article

Rosén, V; Thunes, M; Haugereid, P; Losnegaard, GS; Dyvik, H; Meurer, P; Lyse, G; Smedt, Koenraad De

2016, (Pre SFI).

Links | BibTeX

NorGramBank: A 'Deep' Treebank for Norwegian.Proceedings of LREC Proceeding

Dyvik, H; Meurer, P; Rosén, V; Smedt, Koenraad De; Haugereid, P; Losnegaard, GS; Lyse, G; Thunes, M

2016, (Pre SFI).

Links | BibTeX

MWEs in Treebanks: From Survey to Guidelines Proceeding

Rosén, V; Smedt, Koenraad De; Losnegaard, GS; Bejcek, E; Savary, A; Osenova, P

2016, (Pre SFI).

Links | BibTeX

Universal dependencies for Norwegian Proceeding

Øvrelid, Lilja; Hohle, P

2016, (Pre SFI).

Links | BibTeX

2012

Representing and resolving negation for sentiment analysis Proceeding

Lapponi, E; Read, J; Øvrelid, Lilja

2012, (Pre SFI).

Links | BibTeX

Speculation and negation: Rules, rankers, and the role of syntax Journal Article

Velldal, Erik; Øvrelid, Lilja; Read, J; Oepen, S

2012, (Pre SFI).

Links | BibTeX