Corpus of Doctor-Patient Consultations from Ahus
Extended metadata
- resource Common Info
- resource Type: corpus
- identification Info
- resource Name: Lege-pasient-korpuset fra Ahus
- resource Name: Corpus of Doctor-Patient Consultations from Ahus
- description: The Corpus of Doctor-Patient Consultations from Ahus is a unique corpus of transcribed dialogue between doctors and patients in different types of consultations at Akershus University Hospital (Ahus). The audio files are not available in the corpus due to their sensitive nature. Regional Ethics Committee for Medical and Health Research has approved permanent storage of the transcriptions for research purposes. Version 2 of the corpus (June 2015) has 220 consultations, well over 950 000 words.
- description: Lege-pasient-korpuset er et unikt korpus med transkripsjoner av samtaler mellom leger og pasienter i forskjellige typer konsultasjoner på Akershus universitetssykehus (Ahus). Fordi materialet er sensitivt, er ikke lydfilene tilgjengelige i korpuset. Transkripsjonene i korpuset bygger på videoopptak av samtaler mellom lege og pasient/pårørende ved Ahus i 2007 og 2008. Materialet ble samlet inn i forbindelse med en studie der formålet var å studere effekten av et kurs i kommunikasjon for sykehusleger. Legene representerer alle ikke-psykiatriske kliniske fagområder (indremedisin, kirurgi, ortopedi, gynekologi, pediatri, nevrologi, øre-nese-hals, anestesiologi) ved sykehuset. Det ble gjort inntil 8 opptak av hver lege med ulike pasienter, i alt 497 opptak. Versjon 2 av korpuset (juni 2015) inneholder 220 samtaler og drøye 950 000 ord. Regional etisk komité for medisinsk forskning har godkjent varig lagring av transkripsjonene for forskning.
- resource Short Name: Lege-pasient
- url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/prosjekter/lege-pasient/
- url: http://www.hf.uio.no/iln/english/about/organization/text-laboratory/projects/doctor-patient/index.html
- P I D: http://hdl.handle.net/11538/0000-000B-C020-7
- distribution Info
- licence Info
- user Category: Academic
- distribution Access Medium: accessibleThroughInterface
- execution Location: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/prosjekter/lege-pasient/
- execution Location: http://www.hf.uio.no/iln/english/about/organization/text-laboratory/projects/doctor-patient/index.html
- licence
- licence Family: CLARIN
- licence Name: CLARIN_ACA-NC-LOC-PRIV-ND-*
- licence Url: https://kitwiki.csc.fi/twiki/bin/view/FinCLARIN/ClarinEulaAca?ID=1&AFFIL=EDU&BY=1&NC=1&LOC=1&PRIV=1&NORED=1&ND=1
- conditions Of Use: *
- conditions Of Use: BY
- conditions Of Use: ID
- conditions Of Use: LOC
- conditions Of Use: NC
- conditions Of Use: ND
- conditions Of Use: NORED
- conditions Of Use: PRIV
- non Standard Conditions Of Use: The corpus contains transcripts of sensitive hospital consultations. In agreement with Regional Ethics Committee for Medical and Health Research and the Ipr holders of the material, the corpus is accessible only through Glossa, a search and post-processing tool developed by the Text Laboratory. The Ipr holders want the users of the corpus to send a message to them that the corpus is used, and in what kind of research. Where it is natural to draw medical or psychological expertise into the research, the Ipr holders should be asked whether they wish to participate, before eventually seeking expertise elsewhere. Contact: pal.gulbrandsen by medisin.uio.no
- licensor:
- actor Info
- actor Type: organization
- organization Info
- organization Name: University of Oslo
- organization Name: Universitetet i Oslo
- organization Short Name: UiO
- organization Short Name: UoO
- department Name: Institute of Clinical Medicine
- department Name: Institutt for klinisk medisin
- communication Info
- email: pal.gulbrandsen@medisin.uio.no
- url: http://www.med.uio.no/klinmed/personer/vit/paalgul/index.html
- address: Akershus universitetssykehus
- zip Code: 1478
- city: LØRENSKOG
- country: Norway
- distribution Rights Holder
- actor Info
- actor Type: organization
- organization Info
- organization Name: University of Oslo
- organization Name: Universitetet i Oslo
- organization Short Name: UiO
- organization Short Name: UoO
- department Name: Department of Linguistics and Scandinavian Studies
- department Name: Institutt for lingvistiske og nordiske studier (ILN)
- communication Info
- email: tekstlab-post@iln.uio.no
- url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
- address: Box 1102 Blindern
- zip Code: 0317
- city: OSLO
- country: Norway
- actor Info
- licence Info
- ipr Holder
- actor Info
- actor Type: person
- person Info
- surname: Gulbrandsen
- given Name: Pål
- sex: male
- affiliation:
- organization Info
- organization Name: University of Oslo
- organization Name: Universitetet i Oslo
- organization Short Name: UoO
- organization Short Name: UiO
- department Name: Institute of Clinical Medicine
- department Name: Institutt for klinisk medisin
- communication Info
- email: pal.gulbrandsen@medisin.uio.no
- url: http://www.med.uio.no/klinmed/personer/vit/paalgul/index.html
- address: Akershus universitetssykehus
- zip Code: 1478
- city: LØRENSKOG
- country: Norway
- actor Info
- actor Info
- actor Type: person
- person Info
- surname: Finset
- given Name: Arnstein
- sex: male
- affiliation:
- organization Info
- organization Name: University of Oslo
- organization Name: Universitetet i Oslo
- organization Short Name: UoO
- organization Short Name: UiO
- department Name: Institute of Basic Medical Sciences
- department Name: Institutt for medisinske basalfag
- communication Info
- email: arnstein.finset @medisin.uio.no
- url: http://www.med.uio.no/imb/english/people/aca/arnsten/index.html
- address: Postboks 1111 Blindern
- zip Code: 0317
- city: Oslo
- actor Info
- actor Type: person
- person Info
- surname: Jensen
- given Name: Bård Fossli
- sex: male
- actor Info
- actor Type: organization
- organization Info
- organization Name: The Text Laboratory
- organization Short Name: Textlab
- department Name: Department of Linguistics and Scandinavian Studies, University of Oslo
- communication Info
- email: tekstlab-post@iln.uio.no
- url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
- address: Box 1102 Blindern
- zip Code: 0317
- city: OSLO
- country: Norway
- actor Info
- actor Type: person
- person Info
- surname: Gulbrandsen
- given Name: Pål
- sex: male
- affiliation:
- organization Info
- organization Name: University of Oslo
- organization Name: Universitetet i Oslo
- organization Short Name: UoO
- organization Short Name: UiO
- department Name: Institute of Clinical Medicine
- department Name: Institutt for klinisk medisin
- communication Info
- email: pal.gulbrandsen@medisin.uio.no
- url: http://www.med.uio.no/klinmed/personer/vit/paalgul/index.html
- address: Akershus universitetssykehus
- zip Code: 1478
- city: LØRENSKOG
- country: Norway
- metadata Creation Date: 06.04.2017
- metadata Last Date Updated: 05.06.2018
- metadata Creator
- actor Info
- actor Type: person
- person Info
- surname: Hagen
- given Name: Kristin
- organization Info
- organization Name: The Text Laboratory
- organization Short Name: Textlab
- department Name: Department of Linguistics and Scandinavian Studies, University of Oslo
- communication Info
- email: kristin.hagen@iln.uio.no
- url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
- address: Box 1102 Blindern
- zip Code: 0317
- city: OSLO
- country: Norway
- actor Info
- version: Version 2
- last Date Updated: 01.06.2015
- validated: true
- validation Type: content
- validation Mode: manual
- validation Mode Details: The transcriptions are proof read against the audio files.
- validation Extent: full
- validator:
- actor Info
- actor Type: organization
- organization Info
- organization Name: The Text Laboratory
- organization Short Name: Textlab
- department Name: Department of Linguistics and Scandinavian Studies, University of Oslo
- communication Info
- email: tekstlab-post@iln.uio.no
- url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
- address: Box 1102 Blindern
- zip Code: 0317
- city: OSLO
- country: Norway
- documentation Unstructured
- role: documentation
- document Unstructured: In Norwegian only: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/prosjekter/lege-pasient/lege-pasient-transkripsjonsveil7.pdf
- creation Start Date: 01.01.2007
- creation End Date: 01.06.2015
- corpus Info
- corpus Type: Multimodal Corpus
- corpus Part Info
- media Type: text
- corpus Text Info
- text Format Info
- mime Type: txt
- size Per Text Format
- size Info
- size: 958 830
- size Unit: words
- size Info
- character Encoding Info
- character Encoding: Latin1
- text Format Info
- corpus Part Info
- media Type: video
- corpus Video Info
- video Content Info
- type Of Video Content: Conversations between doctors and patients in different types of consultations
- video Format Info
- mime Type: The videos are not available in the corpus due to the sensitiveness of the conversations
- video Content Info
- corpus Part General Info
- person Source Set Info
- number Of Persons: 220
- age Of Persons: teenager
- age Of Persons: adult
- age Of Persons: elderly
- age Range Start: 1
- age Range End: 100
- sex Of Persons: mixed
- origin Of Persons: mixed
- dialect Accent Of Persons: Some of the patients and doctors speak Norwegian as a second language.
- linguality Info
- linguality Type: monolingual
- language Info
- language Id: No
- language Name: Norwegian
- language Info
- language Id: Nb
- language Name: Norwegian Bokmål
- modality Info
- modality Type: spokenLanguage
- modality Type Details: Orthographic transcription of 220 patients (some together with their next of kin) their doctors and other health personnel. There are many descriptions/stage directions of the consultant situation to make up for the missing videos.
- size Info
- size: 958 830
- size Unit: words
- annotation Info
- annotation Type: morphosyntacticAnnotation-posTagging
- annotated Elements: other
- segmentation Level: word
- tagset: POS tagset created for the statistical NoTa-tagger – based on the tagset of the Oslo Bergen Tagger.
- tagset Language Id: Nb
- tagset Language Name: Norwegian Bokmål
- theoretic Model: TreeTagger
- annotation Mode: automatic
- annotation Manual Structured
- role: annotationManual
- document Info
- document Type: article
- title: Tagging a Norwegian Speech Corpus
- author: Anders Nøklestad and Åshild Søfteland
- editor: Joakim Nivre,Heiki-Jaan Kaalep,Kadri Muischnek, Mare Koit
- year: 2007
- book Title: Proceedings of the 16th Nordic Conference of Computational Linguistics NODALIDA-2007
- pages: 245–248
- conference: Nodalida 2007
- document Language Name: English
- document Language Id: en
- annotation Manual Structured
- role: annotationManual
- document Info
- document Type: article
- title: Manuell morfologisk tagging av NoTa-materialet med støtte fra en statistisk tagger.
- author: Åshild Søfteland og Anders Nøklestad
- editor: Janne Bondi Johannessen og Kristin Hagen
- year: 2008
- publisher: Novus forlag
- book Title: Språk i Oslo. Ny forskning omkring talespråk
- pages: 226–234.
- I S B N: 978-82-7099-471-7
- document Language Name: Norwegian
- document Language Id: nb
- annotation Manual Structured
- role: annotationManual
- document Info
- document Type: manual
- title: NoTa-taggeren: TAGGEVEILEDNING
- author: Åshild Søfteland
- year: 2007
- url: http://www.tekstlab.uio.no/nota/oslo/Taggeveiledning2.pdf
- document Language Name: Norwegian bokmål
- document Language Id: nb
- annotation Info
- annotation Type: speechAnnotation-orthographicTranscription
- annotation Manual Unstructured
- role: annotationManual
- document Unstructured: Orthographic transcription,cf Bokmålsordboka (Wangensteen 2004)
- annotation Manual Structured
- role: annotationManual
- document Info
- document Type: manual
- title: Transkripsjonsveiledning for NoTa-Oslo
- author: Kristin Hagen
- year: 2008
- url: http://www.tekstlab.uio.no/nota/oslo/transkripsjon/NoTa-transkripsjonsveil22.pdf
- annotation Tool
- target Resource Name U R I: Transcriber (http://trans.sourceforge.net/en/presentation.php )
- classification Info
- genre Info
- genre Type: speechGenre
- genre: informal
- unstandardised Genre: patient conversations
- genre Info
- time Coverage Info
- time Coverage: 2007 – 2008
- person Source Set Info
dc:type | corpus |
dc:title | Corpus of Doctor-Patient Consultations from Ahus |
dc:identifier | oai:tekstlab.uio.no:lege-pasient |
dc:description | The Corpus of Doctor-Patient Consultations from Ahus is a unique corpus of transcribed dialogue between doctors and patients in different types of consultations at Akershus University Hospital (Ahus). The audio files are not available in the corpus due to their sensitive nature. Regional Ethics Committee for Medical and Health Research has approved permanent storage of the transcriptions for research purposes. Version 2 of the corpus (June 2015) has 220 consultations, well over 950 000 words. |
dc:publisher | |
dc:format | accessibleThroughInterface |
dc:date | 2007-01-01 |
dc:date | 2015-06-01 |
dc:rights | Academic |
dc:rights | CLARIN |
dc:rights | CLARIN_ACA-NC-LOC-PRIV-ND-* |
dc:rights | https://kitwiki.csc.fi/twiki/bin/view/FinCLARIN/ClarinEulaAca?ID=1&AFFIL=EDU&BY=1&NC=1&LOC=1&PRIV=1&NORED=1&ND=1 |
dc:lang | Norwegian |
dc:lang | Norwegian Bokmål |