ASK Treebank – The Norwegian Second Language Corpus
Utvidet metadata
- resource Common Info
- resource Type: corpus
- identification Info
- resource Name: ASK Treebank – The Norwegian Second Language Corpus
- description: "ASK Treebank – The Norwegian Second Language Corpus" gives sentence analyses of a subset of the texts available in the ASK corpus (https://hdl.handle.net/11495/DA23-DEB6-9EE5-2). ASK is an electronic, searchable text corpus of Norwegian as a second language, with links between linguistic data and personal data, created by the Norwegian Second Language Corpus project (ASK). The corpus contains written texts produced by language learners from ten different language backgrounds: German, Dutch, English, Spanish, Russian, Polish, Bosnian-Croatian-Serbian, Albanian, Vietnamese and Somali. The size of the corpus and the flexible query system make it possible to develop a new methodological approach to the study of transfer when the L2 is Norwegian. The selection of texts is primarily based on the native language of the test takers, and the typological distribution of these languages is taken into consideration. A corpus of Norwegian as a second language makes it possible to use quantitative methods in second language research, and provides a basis for pedagogical developments. ACCESS: Four texts of the material have been uploaded and parsed as a small treebank in INESS. The full ASK corpus is available in searchable form via the corpus search engine Corpuscle (see links in metadata). One can enter the ASK corpus directly via the Corpuscle main page, or using a direct link to this specific corpus via the ASK project page.
- resource Short Name: ASK
- url: http://clarino.uib.no/iness/landing-page?resource=nob-ask&view=short
- url: http://clarino.uib.no/iness/landing-page?identifier=nob-ask
- url: https://hdl.handle.net/11495/DA23-DEB6-9EE5-2
- url: http://www.uib.no/fg/askeladden/
- url: http://clarino.uib.no/ask/
- P I D: hdl:11495/DA66-1C52-ACB5-1
- identifier: nob-ask
- distribution Info
- licence Info
- user Category: Restricted
- distribution Access Medium: accessibleThroughInterface
- execution Location: http://hdl.handle.net/11495/DA66-1C52-ACB5-1
- attribution Text: Tenfjord, Kari; Meurer, Paul; Hofland, Knut. The ASK Corpus — A Language Learner Corpus of Norwegian as a Second Language. Proceedings from 5th International Conference on Language Resources and Evaluation (LREC), Genova 2006. URL http://www.lrec- conf.org/proceedings/lrec2006/pdf/573_pdf
- licence
- licence Family: CLARIN
- licence Name: CLARIN_RES-PRIV
- licence Url: https://kitwiki.csc.fi/twiki/bin/view/FinCLARIN/ClarinEulaRes?ID=1&PERM=1&PLAN=1&BY=1&PRIV=1&NORED=1
- conditions Of Use: BY
- conditions Of Use: ID
- conditions Of Use: NORED
- conditions Of Use: PERM
- conditions Of Use: PLAN
- conditions Of Use: PRIV
- ipr Holder
- actor Info
- actor Type: organization
- organization Info
- organization Name: University of Bergen
- organization Name: Universitetet i Bergen
- organization Short Name: UiB
- organization Short Name: UoB
- department Name: Department of Linguistic, Literary and Aesthetic Studies
- department Name: Institutt for lingvistiske, litterære og estetiske studier (LLE)
- actor Info
- licence Info
- contact
- actor Info
- actor Type: organization
- organization Info
- organization Name: CLARINO Bergen Centre
- communication Info
- email: clarin@uib.no
- url: https://repo.clarino.uib.no/
- url: https://clarin.b.uib.no
- city: Bergen
- country: Norway
- actor Info
- actor Type: person
- person Info
- surname: Rosén
- given Name: Victoria
- sex: female
- position: Associate Professor
- affiliation:
- organization Info
- organization Name: University of Bergen
- organization Short Name: UiB
- organization Short Name: UoB
- department Name: Department of Linguistic, Literary and Aesthetic Studies
- communication Info
- email: iness@uib.no
- actor Info
- metadata Info
- metadata Creation Date: 10.02.2016
- metadata Last Date Updated: 10.06.2016
- metadata Creator
- actor Info
- actor Type: person
- person Info
- surname: Lyse
- given Name: Gunn Inger
- sex: female
- position: Researcher (Ph.D)
- affiliation:
- organization Info
- organization Name: University of Bergen
- organization Name: Universitetet i Bergen
- organization Short Name: UiB
- organization Short Name: UoB
- department Name: Department of Linguistic, Literary and Aesthetic Studies
- communication Info
- email: clarin@uib.no
- actor Info
- documentation Unstructured
- role: documentation
- document Unstructured: http://clarino.uib.no/ask/page?page-id=Publikasjoner
- resource Creator
- actor Info
- actor Type: person
- role: Project leader
- person Info
- surname: Tenfjord
- given Name: Kari
- sex: female
- position: Professor
- affiliation:
- organization Info
- organization Name: University of Bergen
- organization Name: Universitetet i Bergen
- organization Short Name: UiB
- organization Short Name: UoB
- department Name: Department of Linguistic, Literary and Aesthetic Studies
- department Name: Institutt for lingvistiske, litterære og estetiske studier (LLE)
- actor Info
- project Name: The ASKeladden project (ASKeladden)
- funding Type: nationalFunds
- funder: The Research Council of Norway
- funding Country: Norway
- project Start Date: 2003
- corpus Info
- corpus Type: Treebank
- corpus Part Info
- media Type: text
- corpus Part General Info
- source Work Info
- work Description: The corpus contains person information (anonymized) about, and texts written by, candidates who have completed two different tests in Norwegian: "Språkprøven i norsk for voksne innvandrere" [the language test for adult immigrants] and "Test i norsk – høyere nivå" [Test in Norwegian – higher level]
- linguality Info
- linguality Type: monolingual
- language Info
- language Id: nb
- language Name: Norwegian bokmål
- language Info
- language Id: no
- language Name: Norwegian
- modality Info
- modality Type: writtenLanguage
- size Info
- size: 137
- size Unit: sentences
- size Info
- size: 1864
- size Unit: words
- size Info
- size: 4
- size Unit: texts
- annotation Info
- annotation Type: syntacticAnnotation-treebanks
- annotation Standoff: false
- segmentation Level: sentence
- annotation Format: XLE (Packed c- and f-structures in Prolog)
- tagset: http://prosjekt.digital.uni.no/projects/inesspublic/wiki/NorGram_Lexical_Categories_(Preterminals); http://prosjekt.digital.uni.no/projects/inesspublic/wiki/NorGram_Phrase_Structure_Categories; http://prosjekt.digital.uni.no/projects/inesspublic/wiki/NorGram_F-structure_Features
- theoretic Model: Lexical Functional Grammar (LFG)
- annotation Mode: mixed
- annotation Mode Details: Automatic parsing, manual disambiguation using discriminants.
- annotation Manual Unstructured
- role: annotationManual
- document Unstructured: http://clarino.uib.no/iness/page?page-id=_NorGram_annotator_guidelines_
- annotation Tool
- target Resource Name U R I: LFG Parsebanker
- source Work Info
dc:type | corpus |
dc:title | ASK Treebank – The Norwegian Second Language Corpus |
dc:identifier | oai:clarino.uib.no:nob-ask |
dc:description | "ASK Treebank – The Norwegian Second Language Corpus" gives sentence analyses of a subset of the texts available in the ASK corpus (https://hdl.handle.net/11495/DA23-DEB6-9EE5-2). ASK is an electronic, searchable text corpus of Norwegian as a second language, with links between linguistic data and personal data, created by the Norwegian Second Language Corpus project (ASK). The corpus contains written texts produced by language learners from ten different language backgrounds: German, Dutch, English, Spanish, Russian, Polish, Bosnian-Croatian-Serbian, Albanian, Vietnamese and Somali. The size of the corpus and the flexible query system make it possible to develop a new methodological approach to the study of transfer when the L2 is Norwegian. The selection of texts is primarily based on the native language of the test takers, and the typological distribution of these languages is taken into consideration. A corpus of Norwegian as a second language makes it possible to use quantitative methods in second language research, and provides a basis for pedagogical developments. ACCESS: Four texts of the material have been uploaded and parsed as a small treebank in INESS. The full ASK corpus is available in searchable form via the corpus search engine Corpuscle (see links in metadata). One can enter the ASK corpus directly via the Corpuscle main page, or using a direct link to this specific corpus via the ASK project page. |
dc:publisher | |
dc:format | accessibleThroughInterface |
dc:date | |
dc:date | |
dc:rights | Restricted |
dc:rights | CLARIN |
dc:rights | CLARIN_RES-PRIV |
dc:rights | https://kitwiki.csc.fi/twiki/bin/view/FinCLARIN/ClarinEulaRes?ID=1&PERM=1&PLAN=1&BY=1&PRIV=1&NORED=1 |
dc:creator | Kari Tenfjord |
dc:lang | Norwegian bokmål |
dc:lang | norsk |