Skip to content

Norwegian Anaphora Resolution Corpus

Norwegian-BokmaalNARC and Norwegian-NynorskNARC are conversions of the Bokmål and Nynorsk parts of the Norwegian Anaphora Resolution Corpus (NARC), respectively. This is the first publicly available corpus annotated with anaphoric relations between noun phrases for Norwegian.

The annotation is made on top of and enriches the existing annotation in the Norwegian Dependency Treebank (NDT). The resulting corpus contains a total of 15,742 sentences and 245,515 tokens for Norwegian Bokmål, and 12,481 sentences and 206,660 tokens for Norwegian Nynorsk.

The accompanying paper by Mæhlum et al. (from CRAC 2022) describes the annotation effort in more detail.

Norwegian-BokmaalNARC and Norwegian-NynorskNARC are conversions of the Bokmål and Nynorsk parts of the Norwegian Anaphora Resolution Corpus (NARC), respectively. This is the first publicly available corpus annotated with anaphoric relations between noun phrases for Norwegian.

The annotation is made on top of and enriches the existing annotation in the Norwegian Dependency Treebank (NDT). The resulting corpus contains a total of 15,742 sentences and 245,515 tokens for Norwegian Bokmål, and 12,481 sentences and 206,660 tokens for Norwegian Nynorsk.

The accompanying paper by Mæhlum et al. (from CRAC 2022) describes the annotation effort in more detail.

Extended metadata

Download resources

Download metadata