ASK Treebank – The Norwegian Second Language Corpus

“ASK Treebank – The Norwegian Second Language Corpus” gives sentence analyses of a subset of the texts available in the ASK corpus (https://hdl.handle.net/11495/DA23-DEB6-9EE5-2). ASK is an electronic, searchable text corpus of Norwegian as a second language, with links between linguistic data and personal data, created by the Norwegian Second Language Corpus project (ASK).

The corpus contains written texts produced by language learners from ten different language backgrounds: German, Dutch, English, Spanish, Russian, Polish, Bosnian-Croatian-Serbian, Albanian, Vietnamese and Somali. The size of the corpus and the flexible query system make it possible to develop a new methodological approach to the study of transfer when the L2 is Norwegian.

The selection of texts is primarily based on the native language of the test takers, and the typological distribution of these languages is taken into consideration. A corpus of Norwegian as a second language makes it possible to use quantitative methods in second language research, and provides a basis for pedagogical developments.

ACCESS: Four texts of the material have been uploaded and parsed as a small treebank in INESS. The full ASK corpus is available in searchable form via the corpus search engine Corpuscle (see links in metadata). One can enter the ASK corpus directly via the Corpuscle main page, or using a direct link to this specific corpus via the ASK project page.

Extended metadata

resource Common Info
- resource Type: corpus
- identification Info
  - resource Name: ASK Treebank – The Norwegian Second Language Corpus
  - description: "ASK Treebank – The Norwegian Second Language Corpus" gives sentence analyses of a subset of the texts available in the ASK corpus (https://hdl.handle.net/11495/DA23-DEB6-9EE5-2). ASK is an electronic, searchable text corpus of Norwegian as a second language, with links between linguistic data and personal data, created by the Norwegian Second Language Corpus project (ASK). The corpus contains written texts produced by language learners from ten different language backgrounds: German, Dutch, English, Spanish, Russian, Polish, Bosnian-Croatian-Serbian, Albanian, Vietnamese and Somali. The size of the corpus and the flexible query system make it possible to develop a new methodological approach to the study of transfer when the L2 is Norwegian. The selection of texts is primarily based on the native language of the test takers, and the typological distribution of these languages is taken into consideration. A corpus of Norwegian as a second language makes it possible to use quantitative methods in second language research, and provides a basis for pedagogical developments. ACCESS: Four texts of the material have been uploaded and parsed as a small treebank in INESS. The full ASK corpus is available in searchable form via the corpus search engine Corpuscle (see links in metadata). One can enter the ASK corpus directly via the Corpuscle main page, or using a direct link to this specific corpus via the ASK project page.
  - resource Short Name: ASK
  - url: http://clarino.uib.no/iness/landing-page?resource=nob-ask&view=short
  - url: http://clarino.uib.no/iness/landing-page?identifier=nob-ask
  - url: https://hdl.handle.net/11495/DA23-DEB6-9EE5-2
  - url: http://www.uib.no/fg/askeladden/
  - url: http://clarino.uib.no/ask/
  - P I D: hdl:11495/DA66-1C52-ACB5-1
  - identifier: nob-ask
- distribution Info
  - licence Info
    - user Category: Restricted
    - distribution Access Medium: accessibleThroughInterface
    - execution Location: http://hdl.handle.net/11495/DA66-1C52-ACB5-1
    - attribution Text: Tenfjord, Kari; Meurer, Paul; Hofland, Knut. The ASK Corpus — A Language Learner Corpus of Norwegian as a Second Language. Proceedings from 5th International Conference on Language Resources and Evaluation (LREC), Genova 2006. URL http://www.lrec- conf.org/proceedings/lrec2006/pdf/573_pdf
    - licence
      - licence Family: CLARIN
      - licence Name: CLARIN_RES-PRIV
      - licence Url: https://kitwiki.csc.fi/twiki/bin/view/FinCLARIN/ClarinEulaRes?ID=1&PERM=1&PLAN=1&BY=1&PRIV=1&NORED=1
      - conditions Of Use: BY
      - conditions Of Use: ID
      - conditions Of Use: NORED
      - conditions Of Use: PERM
      - conditions Of Use: PLAN
      - conditions Of Use: PRIV
  - ipr Holder
    - actor Info
      - actor Type: organization
      - organization Info
        organization Name: University of Bergen
        organization Name: Universitetet i Bergen
        organization Short Name: UiB
        organization Short Name: UoB
        department Name: Department of Linguistic, Literary and Aesthetic Studies
        department Name: Institutt for lingvistiske, litterære og estetiske studier (LLE)
- contact
  - actor Info
    - actor Type: organization
    - organization Info
      - organization Name: CLARINO Bergen Centre
    - communication Info
      - email: clarin@uib.no
      - url: https://repo.clarino.uib.no/
      - url: https://clarin.b.uib.no
      - city: Bergen
      - country: Norway
  - actor Info
    - actor Type: person
    - person Info
      - surname: Rosén
      - given Name: Victoria
      - sex: female
      - position: Associate Professor
      - affiliation:
      - organization Info
        organization Name: University of Bergen
        organization Short Name: UiB
        organization Short Name: UoB
        department Name: Department of Linguistic, Literary and Aesthetic Studies
    - communication Info
      - email: iness@uib.no
- metadata Info
  - metadata Creation Date: 10.02.2016
  - metadata Last Date Updated: 10.06.2016
  - metadata Creator
    - actor Info
      - actor Type: person
      - person Info
        surname: Lyse
        given Name: Gunn Inger
        sex: female
        position: Researcher (Ph.D)
        affiliation:
        organization Info
        organization Name: University of Bergen
        organization Name: Universitetet i Bergen
        organization Short Name: UiB
        organization Short Name: UoB
        department Name: Department of Linguistic, Literary and Aesthetic Studies
      - communication Info
        email: clarin@uib.no
- resource Documentation Info
  - documentation Unstructured
    - role: documentation
    - document Unstructured: http://clarino.uib.no/ask/page?page-id=Publikasjoner
- resource Creation Info
  - resource Creator
    - actor Info
      - actor Type: person
      - role: Project leader
      - person Info
        surname: Tenfjord
        given Name: Kari
        sex: female
        position: Professor
        affiliation:
        organization Info
        organization Name: University of Bergen
        organization Name: Universitetet i Bergen
        organization Short Name: UiB
        organization Short Name: UoB
        department Name: Department of Linguistic, Literary and Aesthetic Studies
        department Name: Institutt for lingvistiske, litterære og estetiske studier (LLE)
  - funding Project:
  - project Info
    - project Name: The ASKeladden project (ASKeladden)
    - funding Type: nationalFunds
    - funder: The Research Council of Norway
    - funding Country: Norway
    - project Start Date: 2003

Download metadata

Download metadata

Go to resource page

Go to resource page http://hdl.handle.net/11495/DA66-1C52-ACB5-1

dc:type	corpus
dc:title	ASK Treebank – The Norwegian Second Language Corpus
dc:identifier	oai:clarino.uib.no:nob-ask
dc:description	"ASK Treebank – The Norwegian Second Language Corpus" gives sentence analyses of a subset of the texts available in the ASK corpus (https://hdl.handle.net/11495/DA23-DEB6-9EE5-2). ASK is an electronic, searchable text corpus of Norwegian as a second language, with links between linguistic data and personal data, created by the Norwegian Second Language Corpus project (ASK). The corpus contains written texts produced by language learners from ten different language backgrounds: German, Dutch, English, Spanish, Russian, Polish, Bosnian-Croatian-Serbian, Albanian, Vietnamese and Somali. The size of the corpus and the flexible query system make it possible to develop a new methodological approach to the study of transfer when the L2 is Norwegian. The selection of texts is primarily based on the native language of the test takers, and the typological distribution of these languages is taken into consideration. A corpus of Norwegian as a second language makes it possible to use quantitative methods in second language research, and provides a basis for pedagogical developments. ACCESS: Four texts of the material have been uploaded and parsed as a small treebank in INESS. The full ASK corpus is available in searchable form via the corpus search engine Corpuscle (see links in metadata). One can enter the ASK corpus directly via the Corpuscle main page, or using a direct link to this specific corpus via the ASK project page.
dc:publisher
dc:format	accessibleThroughInterface
dc:date
dc:date
dc:rights	Restricted
dc:rights	CLARIN
dc:rights	CLARIN_RES-PRIV
dc:rights	https://kitwiki.csc.fi/twiki/bin/view/FinCLARIN/ClarinEulaRes?ID=1&PERM=1&PLAN=1&BY=1&PRIV=1&NORED=1
dc:creator	Kari Tenfjord
dc:lang	Norwegian bokmål
dc:lang	Norwegian

ASK Treebank – The Norwegian Second Language Corpus

Extended metadata

Resource Common Info

Corpus Info

Dublin Core (DC)

Download metadata

Go to resource page