TAUS – downloadable transcriptions

TAUS (The spoken language investigation in Oslo) v.3 is a speech corpus with 86 speakers and 387 551 tokens. The downloadable version of the corpus contains the transcriptions, approx. 387 500 tokens, all of them orthographically transcribed. Some of the interviews are also transcribed phonetically.

The material from TAUS is based on informal interviews with people from Oslo. The interviews were made in 1971-73. The informants are mainly from two eastern districts (Vålerenga and Kampen) and a western (Frogner), and have a social background that can be considered representative with respect to education, occupation and place of adolescence. The informants fall into three groups based on age: youth (15 – 17 years), young adults (20 – 30) and adults (34 – 75).

The topics for the interviews are experiences and descriptions from childhood and adolescence. The interviews were conducted at home with an unceremoniously and informal tone, so that the linguistic style can be described as informal vernacular.

In 2006 – 2007 the TAUS-tapes from the A and B series were digitized, and all the interviews were transcribed orthographically and linked to the digital audio files. The transcriptions are now searchable via the search interface tool Glossa.

In 2014 – 2019 the tapes from the B-series were digitized and transcribed during the LIA-project.

In January 2020 TAUS v.3 was published with all available material from the A, B og C series.

In 2014 – 2019 the tapes from the B-series were digitized and transcribed during the LIA-project.

In January 2020 TAUS v.3 was published with all available material from the A, B og C series.

Download resources

Extended metadata

Go to resource page

Go to resource page http://www.tekstlab.uio.no/nota/taus/index.html

dc:type	corpus
dc:title	TAUS – downloadable transcriptions
dc:identifier	oai:tekstlab.uio.no:taus-transcriptions
dc:description	TAUS (The spoken language investigation in Oslo) v.3 is a speech corpus with 86 speakers and 387 551 tokens. The downloadable version of the corpus contains the transcriptions, approx. 387 500 tokens, all of them orthographically transcribed. Some of the interviews are also transcribed phonetically. The material from TAUS is based on informal interviews with people from Oslo. The interviews were made in 1971-73. The informants are mainly from two eastern districts (Vålerenga and Kampen) and a western (Frogner), and have a social background that can be considered representative with respect to education, occupation and place of adolescence. The informants fall into three groups based on age: youth (15 – 17 years), young adults (20 – 30) and adults (34 – 75). The topics for the interviews are experiences and descriptions from childhood and adolescence. The interviews were conducted at home with an unceremoniously and informal tone, so that the linguistic style can be described as informal vernacular. In 2006 – 2007 the TAUS-tapes from the A and B series were digitized, and all the interviews were transcribed orthographically and linked to the digital audio files. The transcriptions are now searchable via the search interface tool Glossa. In 2014 – 2019 the tapes from the B-series were digitized and transcribed during the LIA-project. In January 2020 TAUS v.3 was published with all available material from the A, B og C series.
dc:publisher
dc:format	downloadable
dc:date	1970-01-01
dc:date	2020-01-15
dc:rights	Public
dc:rights	Creative Commons (CC)
dc:rights	Creative_Commons-BY-NC-SA (CC-BY-NC-SA)
dc:rights	http://creativecommons.org/licenses/by-nc-sa/4.0/
dc:creator	Prosjektet Talemålsundersøkelsen i Oslo (1971-1976)
dc:creator	The Text Laboratory
dc:lang	Norwegian
dc:lang	Norwegian Bokmål

TAUS – downloadable transcriptions

Download resources

Extended metadata

Dublin Core (DC)

Go to resource page