Norwegian-BokmaalNARC and Norwegian-NynorskNARC are conversions of the Bokmål and Nynorsk parts of the Norwegian Anaphora Resolution Corpus (NARC), respectively. This is the first publicly available corpus annotated with anaphoric relations between noun phrases for Norwegian.
The annotation is made on top of and enriches the existing annotation in the Norwegian Dependency Treebank (NDT). The resulting corpus contains a total of 15,742 sentences and 245,515 tokens for Norwegian Bokmål, and 12,481 sentences and 206,660 tokens for Norwegian Nynorsk.
The accompanying paper by Mæhlum et al. (from CRAC 2022) describes the annotation effort in more detail.
Norwegian-BokmaalNARC and Norwegian-NynorskNARC are conversions of the Bokmål and Nynorsk parts of the Norwegian Anaphora Resolution Corpus (NARC), respectively. This is the first publicly available corpus annotated with anaphoric relations between noun phrases for Norwegian.
The annotation is made on top of and enriches the existing annotation in the Norwegian Dependency Treebank (NDT). The resulting corpus contains a total of 15,742 sentences and 245,515 tokens for Norwegian Bokmål, and 12,481 sentences and 206,660 tokens for Norwegian Nynorsk.
The accompanying paper by Mæhlum et al. (from CRAC 2022) describes the annotation effort in more detail.
Extended metadata
resource Common Info:
resource Type: corpus
identification Info:
resource Name: Norwegian Anaphora Resolution Corpus
resource Name: Norsk koreferansekorpus
description: Norwegian-BokmaalNARC and Norwegian-NynorskNARC are conversions of the Bokmål and Nynorsk parts of the Norwegian Anaphora Resolution Corpus (NARC), respectively. This is the first publicly available corpus annotated with anaphoric relations between noun phrases for Norwegian.
The annotation is made on top of and enriches the existing annotation in the Norwegian Dependency Treebank (NDT). The resulting corpus contains a total of 15,742 sentences and 245,515 tokens for Norwegian Bokmål, and 12,481 sentences and 206,660 tokens for Norwegian Nynorsk.
The accompanying paper by Mæhlum et al. (from CRAC 2022) describes the annotation effort in more detail.
description: Norwegian-BokmaalNARC og Norwegian-NynorskNARC er konverteringar av høvesvis bokmåls- og nynorskdelen av Norwegian Anaphora Resolution Corpus (NARC), det første offentleg tilgjengelege korpuset annotert med anaforiske relasjonar mellom substantivfrasar for norsk.
Annoteringa er gjort på toppen av og berikar den eksisterande annoteringa til Norsk dependenstrebank (NDT). Korpuset inneheld totalt 15.742 setningar og 245.515 "tokens" for bokmål, og 12.481 setningar og 206.660 "tokens" for nynorsk.
Den vedlagde artikkelen av Mæhlum et al. (frå CRAC 2022) skildrar annoteringsarbeidet i detalj.
modality Type Details: Blog text, news text, parliament proceedings, government white papers
annotation Info:
annotation Type: discourseAnnotation-coreference
dc:type
corpus
dc:title
Norwegian Anaphora Resolution Corpus
dc:identifier
oai:nb.no:sbr-82
dc:description
Norwegian-BokmaalNARC and Norwegian-NynorskNARC are conversions of the Bokmål and Nynorsk parts of the Norwegian Anaphora Resolution Corpus (NARC), respectively. This is the first publicly available corpus annotated with anaphoric relations between noun phrases for Norwegian.
The annotation is made on top of and enriches the existing annotation in the Norwegian Dependency Treebank (NDT). The resulting corpus contains a total of 15,742 sentences and 245,515 tokens for Norwegian Bokmål, and 12,481 sentences and 206,660 tokens for Norwegian Nynorsk.
The accompanying paper by Mæhlum et al. (from CRAC 2022) describes the annotation effort in more detail.