Skip to content

Norsk Ordbank – Norwegian Nynorsk 2012

Norsk Ordbank – Nynorsk is a lexical database reflecting the official spelling reform that took effect on 1 August 2012, and later adjustments to the spelling for Norwegian Nynorsk.

The database consists of a basic vocabulary (lemmas) and a set of inflectional patterns. Each lemma has one or more inflectional patterns connected with it. Each inflectional pattern contains a number of lines that spells out every inflected form of the lemma. Each line contains a transformation pattern and information about word class and morphological features. The pattern shows how the base word can be expanded into an inflected form.

The data is stored in seven tables. The table “lemma” contains all entries in Nynorskordboka (an official Nynorsk dictionary) with specification of article number. The list of full forms contains all possible inflected forms of all entries, in accord with current official spelling. (Note that this table also contains forms that are a result of overgeneration, e.g. the putative plural form ‘snøar’ (‘snows’) of ‘snø’ (‘snow’).

The tables “lemma_paradigme”, “paradigme”, “paradigme_boying”, “boyingsgruppe” and “boying” contain information that is necessary to generate the full forms based on the basic vocabulary (“lemma”). In other words, they contain the link between the lemmas and inflectional patterns, rules and categorial information.

The table “leddanalyse” contains information on the decomposition of compund words. In Nynorskordboka, decomposition is indicated with a vertical line, e.g. ‘post|boks’ (‘P.O.|box’).

The fullform list contains information about argument structure for some verbs. The argument structure codes used are explained in the file “norsk_ordbank_argstr.txt”.

Please note that this is a dump of the database athe state it was in on 1 February 2022. The latest version (1 February 2022) contains 117,445 lemmas.

Norsk Ordbank – Nynorsk is a lexical database reflecting the official spelling reform that took effect on 1 August 2012, and later adjustments to the spelling for Norwegian Nynorsk.

The database consists of a basic vocabulary (lemmas) and a set of inflectional patterns. Each lemma has one or more inflectional patterns connected with it. Each inflectional pattern contains a number of lines that spells out every inflected form of the lemma. Each line contains a transformation pattern and information about word class and morphological features. The pattern shows how the base word can be expanded into an inflected form.

The data is stored in seven tables. The table “lemma” contains all entries in Nynorskordboka (an official Nynorsk dictionary) with specification of article number. The list of full forms contains all possible inflected forms of all entries, in accord with current official spelling. (Note that this table also contains forms that are a result of overgeneration, e.g. the putative plural form ‘snøar’ (‘snows’) of ‘snø’ (‘snow’).

The tables “lemma_paradigme”, “paradigme”, “paradigme_boying”, “boyingsgruppe” and “boying” contain information that is necessary to generate the full forms based on the basic vocabulary (“lemma”). In other words, they contain the link between the lemmas and inflectional patterns, rules and categorial information.

The table “leddanalyse” contains information on the decomposition of compund words. In Nynorskordboka, decomposition is indicated with a vertical line, e.g. ‘post|boks’ (‘P.O.|box’).

The fullform list contains information about argument structure for some verbs. The argument structure codes used are explained in the file “norsk_ordbank_argstr.txt”.

Please note that this is a dump of the database athe state it was in on 1 February 2022. The latest version (1 February 2022) contains 117,445 lemmas.

Extended metadata

Download resources

Download metadata