Corpus of Doctor-Patient Consultations from Ahus
Extended metadata
- resource Common Info:
- resource Type: corpus
- identification Info:
- resource Name: Lege-pasient-korpuset fra Ahus
- resource Name: Corpus of Doctor-Patient Consultations from Ahus
- description: The Corpus of Doctor-Patient Consultations from Ahus is a unique corpus of transcribed dialogue between doctors and patients in different types of consultations at Akershus University Hospital (Ahus). The audio files are not available in the corpus due to their sensitive nature. Regional Ethics Committee for Medical and Health Research has approved permanent storage of the transcriptions for research purposes. Version 2 of the corpus (June 2015) has 220 consultations, well over 950 000 words.
- description: Lege-pasient-korpuset er et unikt korpus med transkripsjoner av samtaler mellom leger og pasienter i forskjellige typer konsultasjoner på Akershus universitetssykehus (Ahus). Fordi materialet er sensitivt, er ikke lydfilene tilgjengelige i korpuset. Transkripsjonene i korpuset bygger på videoopptak av samtaler mellom lege og pasient/pårørende ved Ahus i 2007 og 2008. Materialet ble samlet inn i forbindelse med en studie der formålet var å studere effekten av et kurs i kommunikasjon for sykehusleger. Legene representerer alle ikke-psykiatriske kliniske fagområder (indremedisin, kirurgi, ortopedi, gynekologi, pediatri, nevrologi, øre-nese-hals, anestesiologi) ved sykehuset. Det ble gjort inntil 8 opptak av hver lege med ulike pasienter, i alt 497 opptak. Versjon 2 av korpuset (juni 2015) inneholder 220 samtaler og drøye 950 000 ord. Regional etisk komité for medisinsk forskning har godkjent varig lagring av transkripsjonene for forskning.
- resource Short Name: Lege-pasient
- url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/prosjekter/lege-pasient/
- url: http://www.hf.uio.no/iln/english/about/organization/text-laboratory/projects/doctor-patient/index.html
- P I D: http://hdl.handle.net/11538/0000-000B-C020-7
- distribution Info:
- licence Info:
- user Category: Academic
- distribution Access Medium: accessibleThroughInterface
- execution Location: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/prosjekter/lege-pasient/
- execution Location: http://www.hf.uio.no/iln/english/about/organization/text-laboratory/projects/doctor-patient/index.html
- licence:
- licence Family: CLARIN
- licence Name: CLARIN_ACA-NC-LOC-PRIV-ND-*
- licence Url: https://kitwiki.csc.fi/twiki/bin/view/FinCLARIN/ClarinEulaAca?ID=1&AFFIL=EDU&BY=1&NC=1&LOC=1&PRIV=1&NORED=1&ND=1
- conditions Of Use: *
- conditions Of Use: BY
- conditions Of Use: ID
- conditions Of Use: LOC
- conditions Of Use: NC
- conditions Of Use: ND
- conditions Of Use: NORED
- conditions Of Use: PRIV
- non Standard Conditions Of Use: The corpus contains transcripts of sensitive hospital consultations. In agreement with Regional Ethics Committee for Medical and Health Research and the Ipr holders of the material, the corpus is accessible only through Glossa, a search and post-processing tool developed by the Text Laboratory. The Ipr holders want the users of the corpus to send a message to them that the corpus is used, and in what kind of research. Where it is natural to draw medical or psychological expertise into the research, the Ipr holders should be asked whether they wish to participate, before eventually seeking expertise elsewhere. Contact: pal.gulbrandsen by medisin.uio.no
- licensor:
- actor Info:
- actor Type: organization
- organization Info:
- organization Name: University of Oslo
- organization Name: Universitetet i Oslo
- organization Short Name: UiO
- organization Short Name: UoO
- department Name: Institute of Clinical Medicine
- department Name: Institutt for klinisk medisin
- communication Info:
- email: pal.gulbrandsen@medisin.uio.no
- url: http://www.med.uio.no/klinmed/personer/vit/paalgul/index.html
- address: Akershus universitetssykehus
- zip Code: 1478
- city: LØRENSKOG
- country: Norway
- distribution Rights Holder
- actor Info:
- actor Type: organization
- organization Info:
- organization Name: University of Oslo
- organization Name: Universitetet i Oslo
- organization Short Name: UiO
- organization Short Name: UoO
- department Name: Department of Linguistics and Scandinavian Studies
- department Name: Institutt for lingvistiske og nordiske studier (ILN)
- communication Info:
- email: tekstlab-post@iln.uio.no
- url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
- address: Box 1102 Blindern
- zip Code: 0317
- city: OSLO
- country: Norway
- actor Info:
- actor Type: person
- person Info:
- surname: Gulbrandsen
- given Name: Pål
- sex: male
- affiliation:
- organization Info:
- organization Name: University of Oslo
- organization Name: Universitetet i Oslo
- organization Short Name: UoO
- organization Short Name: UiO
- department Name: Institute of Clinical Medicine
- department Name: Institutt for klinisk medisin
- actor Info:
- actor Type: organization
- organization Info:
- organization Name: The Text Laboratory
- organization Short Name: Textlab
- department Name: Department of Linguistics and Scandinavian Studies, University of Oslo
- actor Info:
- actor Type: person
- person Info:
- surname: Hagen
- given Name: Kristin
- corpus Info:
- corpus Type: Multimodal Corpus
- corpus Part Info:
- media Type: text
- corpus Text Info:
- text Format Info:
- mime Type: txt
- size Per Text Format:
- size Info:
- size: 958 830
- size Unit: words
- character Encoding Info:
- character Encoding: Latin1
- corpus Part Info:
- media Type: video
- corpus Video Info:
- video Content Info:
- type Of Video Content: Conversations between doctors and patients in different types of consultations
- video Format Info:
- mime Type: The videos are not available in the corpus due to the sensitiveness of the conversations
- corpus Part General Info:
- person Source Set Info:
- number Of Persons: 220
- age Of Persons: teenager
- age Of Persons: adult
- age Of Persons: elderly
- age Range Start: 1
- age Range End: 100
- sex Of Persons: mixed
- origin Of Persons: mixed
- dialect Accent Of Persons: Some of the patients and doctors speak Norwegian as a second language.
- linguality Info:
- linguality Type: monolingual
- language Info:
- language Id: No
- language Name: Norwegian
- language Info:
- language Id: Nb
- language Name: Norwegian Bokmål
- modality Info:
- modality Type: spokenLanguage
- modality Type Details: Orthographic transcription of 220 patients (some together with their next of kin) their doctors and other health personnel. There are many descriptions/stage directions of the consultant situation to make up for the missing videos.
- size Info:
- size: 958 830
- size Unit: words
- annotation Info:
- annotation Type: morphosyntacticAnnotation-posTagging
- annotated Elements: other
- segmentation Level: word
- tagset: POS tagset created for the statistical NoTa-tagger – based on the tagset of the Oslo Bergen Tagger.
- tagset Language Id: Nb
- tagset Language Name: Norwegian Bokmål
- theoretic Model: TreeTagger
- annotation Mode: automatic
- annotation Manual Structured:
- role: annotationManual
- document Info:
- document Type: article
- title: Tagging a Norwegian Speech Corpus
- author: Anders Nøklestad and Åshild Søfteland
- editor: Joakim Nivre,Heiki-Jaan Kaalep,Kadri Muischnek, Mare Koit
- year: 2007
- book Title: Proceedings of the 16th Nordic Conference of Computational Linguistics NODALIDA-2007
- pages: 245–248
- conference: Nodalida 2007
- document Language Name: English
- document Language Id: en
- annotation Manual Structured:
- role: annotationManual
- document Info:
- document Type: article
- title: Manuell morfologisk tagging av NoTa-materialet med støtte fra en statistisk tagger.
- author: Åshild Søfteland og Anders Nøklestad
- editor: Janne Bondi Johannessen og Kristin Hagen
- year: 2008
- publisher: Novus forlag
- book Title: Språk i Oslo. Ny forskning omkring talespråk
- pages: 226–234.
- I S B N: 978-82-7099-471-7
- document Language Name: Norwegian
- document Language Id: nb
- annotation Manual Structured:
- role: annotationManual
- document Info:
- document Type: manual
- title: NoTa-taggeren: TAGGEVEILEDNING
- author: Åshild Søfteland
- year: 2007
- url: http://www.tekstlab.uio.no/nota/oslo/Taggeveiledning2.pdf
- document Language Name: Norwegian bokmål
- document Language Id: nb
- annotation Info:
- annotation Type: speechAnnotation-orthographicTranscription
- annotation Manual Unstructured:
- role: annotationManual
- document Unstructured: Orthographic transcription,cf Bokmålsordboka (Wangensteen 2004)
- annotation Manual Structured:
- role: annotationManual
- document Info:
- document Type: manual
- title: Transkripsjonsveiledning for NoTa-Oslo
- author: Kristin Hagen
- year: 2008
- url: http://www.tekstlab.uio.no/nota/oslo/transkripsjon/NoTa-transkripsjonsveil22.pdf
- annotation Tool:
- target Resource Name U R I: Transcriber (http://trans.sourceforge.net/en/presentation.php )
- classification Info:
- genre Info:
- genre Type: speechGenre
- genre: informal
- unstandardised Genre: patient conversations
- time Coverage Info:
- time Coverage: 2007 – 2008
dc:type | corpus |
dc:title | Corpus of Doctor-Patient Consultations from Ahus |
dc:identifier | oai:tekstlab.uio.no:lege-pasient |
dc:description | The Corpus of Doctor-Patient Consultations from Ahus is a unique corpus of transcribed dialogue between doctors and patients in different types of consultations at Akershus University Hospital (Ahus). The audio files are not available in the corpus due to their sensitive nature. Regional Ethics Committee for Medical and Health Research has approved permanent storage of the transcriptions for research purposes. Version 2 of the corpus (June 2015) has 220 consultations, well over 950 000 words. |
dc:publisher | |
dc:format | accessibleThroughInterface |
dc:date | 2007-01-01 |
dc:date | 2015-06-01 |
dc:rights | Academic |
dc:rights | CLARIN |
dc:rights | CLARIN_ACA-NC-LOC-PRIV-ND-* |
dc:rights | https://kitwiki.csc.fi/twiki/bin/view/FinCLARIN/ClarinEulaAca?ID=1&AFFIL=EDU&BY=1&NC=1&LOC=1&PRIV=1&NORED=1&ND=1 |
dc:lang | Norwegian |
dc:lang | Norwegian Bokmål |