Hopp til innhold

Norsk aviskorpus annotert (2001-2009)

This is a subpart of the Norwegian Newspaper Corpus for bokmål, grammatically annotated with information about each word’s lemma, part of speech (word class) and morphological analysis based on an automatic analysis. Like the full Norwegian Newspaper Corpus, this annotated subpart is freely accessible text and represents modern Norwegian in the written variety Norwegian Bokmål, comprising 35 692 210 tokens and covering the time span 2001-2009.
Through the search interface Corpuscle, you may search for all running words in the text (tokens) and also search with the following attributes: lemma, pos (part of speech), morphology, source (newspaper name), year, date, gender of the author, author and language (Norwegian Bokmål and Nynorsk).

This is a subpart of the Norwegian Newspaper Corpus for bokmål, grammatically annotated with information about each word’s lemma, part of speech (word class) and morphological analysis based on an automatic analysis. Like the full Norwegian Newspaper Corpus, this annotated subpart is freely accessible text and represents modern Norwegian in the written variety Norwegian Bokmål, comprising 35 692 210 tokens and covering the time span 2001-2009.
Through the search interface Corpuscle, you may search for all running words in the text (tokens) and also search with the following attributes: lemma, pos (part of speech), morphology, source (newspaper name), year, date, gender of the author, author and language (Norwegian Bokmål and Nynorsk).

Utvidet metadata

Last ned metadata