Normkorpuset
Utvidet metadata
- resource Common Info:
- resource Type: corpus
- identification Info:
- resource Name: Normkorpuset
- resource Name: The NORM Corpus
- description: More than 5000 texts written by pupils in Norwegian primary schools (age 8 – 13 years). The text corpus have three different versions of each text: one scanned original in pdf format and two transcribed versions in txt format: one original transcription with errors and one version where the errors are corrected. All versions are linked and it is possible to search in both transcribed versions. The corpus is one of the outcomes from the project "Developing national standards for the assessment of writing – a tool for teaching and learning" (The NORM Project).
- description: Mer enn 5000 tekster skrevet av elever i norske barneskoler (alder fra 8 – 13 år). Korpuset har tre ulike versjoner av hver elevtekst: en skannet original i pdf-format og to transkriberte i txt-format, den ene versjonen med feil. I den andre versjonen er feilene rettet. Versjonene er lenket til hverandre, og det er mulig å søke i begge de transkriberte versjonene. Korpuset er et av resultatene fra prosjektet "Developing national standards for the assessment of writing – a tool for teaching and learning" (The NORM Project).
- resource Short Name: NORM
- url: http://norm.skrivesenteret.no/
- url: http://www.hf.uio.no/iln/english/about/organization/text-laboratory/projects/norm/index.html
- P I D: http://hdl.handle.net/11538/0000-000B-C021-6
- distribution Info:
- licence Info:
- user Category: Academic
- distribution Access Medium: accessibleThroughInterface
- execution Location: https://tekstlab.uio.no/glossa3/norm
- licence:
- licence Family: CLARIN
- licence Name: CLARIN_ACA-NC-LOC-ND
- licence Url: https://kitwiki.csc.fi/twiki/bin/view/FinCLARIN/ClarinEulaAca?ID=1&AFFIL=EDU&BY=1&NC=1&LOC=1&NORED=1&ND=1
- conditions Of Use: BY
- conditions Of Use: ID
- conditions Of Use: LOC
- conditions Of Use: NC
- conditions Of Use: ND
- conditions Of Use: NORED
- non Standard Conditions Of Use: Due to text contributor agreements, the texts are only available through Glossa, a search and post-processing tool developed by the Text Laboratory. The texts should be used with caution and respect, due to the fact that they are written by children in authentic educational situations.
- licensor:
- actor Info:
- actor Type: organization
- organization Info:
- organization Name: Norges teknisk-naturvitenskapelige universitet
- organization Name: Norwegian University of Science and Technology
- organization Short Name: NTNU
- organization Short Name: NTNU
- department Name: Department for Teacher Education
- department Name: Institutt for lærerutdanning
- communication Info:
- email: synnove.matre@ntnu.no
- email: randi.solheim@ntnu.no
- url: https://www.ntnu.no/ilu
- distribution Rights Holder
- actor Info:
- actor Type: organization
- organization Info:
- organization Name: University of Oslo
- organization Name: Universitetet i Oslo
- organization Short Name: UiO
- organization Short Name: UoO
- department Name: Department of Linguistics and Scandinavian Studies
- department Name: Institutt for lingvistiske og nordiske studier (ILN)
- communication Info:
- email: tekstlab-post@iln.uio.no
- url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
- address: Box 1102 Blindern
- zip Code: 0317
- city: OSLO
- country: Norway
- actor Info:
- actor Type: person
- person Info:
- surname: Matre
- given Name: Synnøve
- position: Professor
- affiliation:
- organization Info:
- organization Name: Norges teknisk-naturvitenskapelige universitet
- organization Name: Norwegian University of Science and Technology
- organization Short Name: NTNU
- organization Short Name: NTNU
- department Name: Department for Teacher Education
- department Name: Institutt for lærerutdanning
- actor Info:
- actor Type: organization
- organization Info:
- organization Name: The Text Laboratory
- organization Short Name: Textlab
- department Name: Department of Linguistics and Scandinavian Studies, University of Oslo
- actor Info:
- actor Type: person
- person Info:
- surname: Hagen
- given Name: Kristin
- actor Info:
- actor Type: organization
- organization Info:
- organization Name: Developing national standards for the assessment of writing – a tool for teaching and learning (Normprosjektet, 2012–2016)
- organization Name: Developing national standards for the assessment of writing – a tool for teaching and learning (The NORM Project, 2012–2016)
- organization Short Name: Normprosjektet
- organization Short Name: The NORM Project
- corpus Info:
- corpus Type: Written Corpus
- corpus Part Info:
- media Type: text
- corpus Text Info:
- text Format Info:
- mime Type: txt
- character Encoding Info:
- character Encoding: utf-8
- size Per Character Encoding:
- size Info:
- size: 1250756
- size Unit: tokens
- corpus Part General Info:
- source Work Info:
- work Description: Texts written by pupils in Norwegian primary schools.The texts differ in length, genre and type.
- language Info:
- language Id: Norwegian Bokmål
- language Name: nb
- size Per Language:
- size Info:
- size: 4247
- size Unit: texts
- language Info:
- language Id: Nn
- language Name: Norwegian Nynorsk
- size Per Language:
- size Info:
- size: 949
- size Unit: texts
- modality Info:
- modality Type: writtenLanguage
- size Info:
- size: 1250 756
- size Unit: tokens
- annotation Info:
- annotation Type: lemmatization
- annotation Type: morphosyntacticAnnotation-posTagging
- segmentation Level: word
- tagset: The Oslo Bergen-tagger tagset: http://tekstlab.uio.no/obt-ny/english/index.html
- tagset Language Id: Nb
- tagset Language Name: Norwegian Bokmål
- theoretic Model: Constraint Grammar
- annotation Mode: automatic
- annotation Manual Unstructured:
- role: annotationManual
- document Unstructured: http://www.tekstlab.uio.no/obt-ny/english/index.html
- annotation Tool:
- target Resource Name U R I: The Oslo-Bergen Tagger: http://tekstlab.uio.no/obt-ny/english/index.html
- classification Info:
- genre Info:
- genre Type: textGenre
- genre: unstandardised
- unstandardised Genre: Texts written by pupils in Norwegian primary schools. The texts are available in three different versions: one scanned original in pdf format and two transcribed versions in txt format: one original transcription with errors and one version where the errors are corrected. All versions are linked and it is possible to search in both transcribed versions.
- time Coverage Info:
- time Coverage: The texts were mostly written in 2012-2014
dc:type | corpus |
dc:title | Normkorpuset |
dc:identifier | oai:tekstlab.uio.no:norm |
dc:description | Mer enn 5000 tekster skrevet av elever i norske barneskoler (alder fra 8 – 13 år). Korpuset har tre ulike versjoner av hver elevtekst: en skannet original i pdf-format og to transkriberte i txt-format, den ene versjonen med feil. I den andre versjonen er feilene rettet. Versjonene er lenket til hverandre, og det er mulig å søke i begge de transkriberte versjonene. Korpuset er et av resultatene fra prosjektet "Developing national standards for the assessment of writing – a tool for teaching and learning" (The NORM Project). |
dc:publisher | |
dc:format | accessibleThroughInterface |
dc:date | 2013-01-01 |
dc:date | 2016-04-01 |
dc:rights | Academic |
dc:rights | CLARIN |
dc:rights | CLARIN_ACA-NC-LOC-ND |
dc:rights | https://kitwiki.csc.fi/twiki/bin/view/FinCLARIN/ClarinEulaAca?ID=1&AFFIL=EDU&BY=1&NC=1&LOC=1&NORED=1&ND=1 |
dc:creator | Developing national standards for the assessment of writing – a tool for teaching and learning (Normprosjektet, 2012–2016) |
dc:creator | The Text Laboratory |
dc:creator | Nasjonalt senter for skriveopplæring og skriveforsking |
dc:lang | nb |
dc:lang | nynorsk |