Omsetjingsminne frå Nynorsk pressekontor 2021
Utvidet metadata
- resource Common Info:
- resource Type: corpus
- identification Info:
- resource Name: Translation memoriy from Nynorsk News Press Agency 2021
- resource Name: Omsetjingsminne frå Nynorsk pressekontor 2021
- description: This translation memory contains translations of news text from Norwegian Bokmål to Norwegian Nynorsk. The texts are produced by the Norwegian News Agency NTB (https://www.ntb.no/about-ntb), and translated by Nynorsk News Press Agency (https://www.npk.no/). The texts have first| been translated automatically (using the so called Nynorsk Robot), then the translations have been proofread and corrected manually before publication. The material in this corpus was produced in the period from February 2011 to September 2021. For copyright reasons, the translation units (TUs, in most cases corresponding to sentence pairs) have been randomized. The material is divided into three parts, and there are two different file formats, tmx and tsv. In total, the material consists of approximately 700,000 translation pairs/sentence pairs. See the documentation file for more information.
- description: Dette korpuset inneheld omsetjingar frå bokmål til nynorsk av nyhendetekst frå Norsk telegrambyrå (NTB). Tekstene er omsette av Nynorsk pressekontor (NPK), som nyttar den såkalla Nynorskroboten til automatisk omsetjing av tekster frå bokmål til nynorsk, og korrigerer feila roboten gjer manuelt før publisering. Les meir om dette hjå Nynorsk pressekontor (https://www.npk.no/nynorskroboten). Materialet skriv seg frå perioden februar 2011 til september 2021. Av opphavsrettslege grunnar er omsetjingseiningane randomiserte. Ei omsetjingseining svarar stort sett til eit setningspar. Materialer delt opp i tre delar, og det finst to ulike filformat, tmx og tsv. Totalt inneheld korpuset omlag 700.000 omsetjingspar/setningspar. Sjå dokumentasjonsfila for meir informasjon.
- url: https://www.nb.no/sprakbanken/ressurskatalog/oai-nb-no-sbr-47/
- P I D: hdl:21.11146/47
- identifier: sbr-47
- distribution Info:
- licence Info:
- user Category: Public
- distribution Access Medium: downloadable
- download Location: https://www.nb.no/sprakbanken/ressurskatalog/oai-nb-no-sbr-47/
- licence:
- licence Family: Creative Commons (CC)
- licence Name: Creative_Commons-ZERO (CC-ZERO)
- licence Url: https://creativecommons.org/publicdomain/zero/1.0/
- licensor:
- actor Info:
- actor Type: organization
- role: Licensor
- organization Info:
- organization Name: Nynorsk News Press Agency
- organization Name: Nynorsk pressekontor
- organization Short Name: NPK
- actor Info:
- actor Type: organization
- role: Licensor
- organization Info:
- organization Name: The Norwegian News Agency
- organization Name: Norsk Telegrambyrå
- organization Short Name: NTB
- organization Short Name: NTB
- distribution Rights Holder
- actor Info:
- actor Type: organization
- role: Distribution Rights Holder
- organization Info:
- organization Name: National Library of Norway
- organization Name: Nasjonalbiblioteket
- organization Short Name: NLN
- organization Short Name: NB
- department Name: The Language Bank
- department Name: Språkbanken
- communication Info:
- email: sprakbanken@nb.no
- url: https://www.nb.no/sprakbanken/
- address: P.O. Box 2674 Solli
- zip Code: 0203
- city: Oslo
- region: Oslo
- country: Norway
- actor Info:
- actor Type: organization
- role: IPR Holder
- organization Info:
- organization Name: Nynorsk News Press Agency
- organization Name: Nynorsk pressekontor
- organization Short Name: NPK
- actor Info:
- actor Type: organization
- role: Contact
- organization Info:
- organization Name: National Library of Norway
- organization Name: Nasjonalbiblioteket
- organization Short Name: NLN
- organization Short Name: NB
- department Name: The Language Bank
- department Name: Språkbanken
- actor Info:
- actor Type: person
- role: Metadata Creator
- person Info:
- surname: Lindstad
- given Name: Arne Martinus
- affiliation:
- organization Info:
- organization Name: National Library of Norway
- organization Name: Nasjonalbiblioteket
- organization Short Name: NLN
- organization Short Name: NB
- department Name: The Language Bank
- department Name: Språkbanken
- actor Info:
- actor Type: organization
- role: Resource Creator
- organization Info:
- organization Name: The Norwegian News Agency
- organization Name: Norsk Telegrambyrå
- organization Short Name: NTB
- organization Short Name: NTB
- corpus Info:
- corpus Type: Multilingual Corpus
- corpus Part Info:
- media Type: text
- corpus Text Info:
- text Format Info:
- mime Type: application/x-tmx+xml
- size Per Text Format:
- size Info:
- size: 339000
- size Unit: units
- size Info:
- size: 25000
- size Unit: texts
- size Info:
- size: 101,6
- size Unit: mb
- text Format Info:
- mime Type: text/tab-separated-values
- size Per Text Format:
- size Info:
- size: 359595
- size Unit: units
- size Info:
- size: 2
- size Unit: files
- size Info:
- size: 90,3
- size Unit: mb
- character Encoding Info:
- character Encoding: UTF-8
- corpus Part General Info:
- linguality Info:
- linguality Type: multilingual
- multilinguality Type: parallel
- multilinguality Type Details: Translation memories
- language Info:
- language Id: nb
- language Name: Norwegian Bokmål
- size Per Language:
- size Info:
- size: 698595
- size Unit: units
- language Variety Info:
- language Variety Type: other
- language Variety Name: news text
- language Info:
- language Id: nn
- language Name: Norwegian Nynorsk
- size Per Language:
- size Info:
- size: 698595
- size Unit: units
- language Variety Info:
- language Variety Type: other
- language Variety Name: news text
- modality Info:
- modality Type: writtenLanguage
- modality Type Details: news text
- size Info:
- size: 698595
- size Unit: units
- size Info:
- size: 3
- size Unit: files
- size Info:
- size: 191,9
- size Unit: mb
- annotation Info:
- annotation Type: alignment
- segmentation Level: sentence
- annotation Format: tmx/xml, tsv
- annotation Mode: automatic
- time Coverage Info:
- time Coverage: 2011-2021
dc:type | corpus |
dc:title | Omsetjingsminne frå Nynorsk pressekontor 2021 |
dc:identifier | oai:nb.no:sbr-47 |
dc:description | Dette korpuset inneheld omsetjingar frå bokmål til nynorsk av nyhendetekst frå Norsk telegrambyrå (NTB). Tekstene er omsette av Nynorsk pressekontor (NPK), som nyttar den såkalla Nynorskroboten til automatisk omsetjing av tekster frå bokmål til nynorsk, og korrigerer feila roboten gjer manuelt før publisering. Les meir om dette hjå Nynorsk pressekontor (https://www.npk.no/nynorskroboten). Materialet skriv seg frå perioden februar 2011 til september 2021. Av opphavsrettslege grunnar er omsetjingseiningane randomiserte. Ei omsetjingseining svarar stort sett til eit setningspar. Materialer delt opp i tre delar, og det finst to ulike filformat, tmx og tsv. Totalt inneheld korpuset omlag 700.000 omsetjingspar/setningspar. Sjå dokumentasjonsfila for meir informasjon. |
dc:publisher | |
dc:format | downloadable |
dc:date | 2019-02-26 |
dc:date | 2021-11-02 |
dc:rights | Public |
dc:rights | Creative Commons (CC) |
dc:rights | Creative_Commons-ZERO (CC-ZERO) |
dc:rights | https://creativecommons.org/publicdomain/zero/1.0/ |
dc:creator | The Norwegian News Agency |
dc:creator | Nynorsk News Press Agency |
dc:creator | Vitec MV |
dc:lang | bokmål |
dc:lang | nynorsk |
Last ned ressurser
-
2011_2019_tm_npk_ntb_vitecmv.zip
-
2011_2019_tm_npk_ntb_vitecmv.pdf
-
2011_2019_tm_npk_ntb.tar.gz
-
2020_2021_tm_npk_ntb.zip
-
2019_tm_npk_ntb.zip
-
2011-2021_tm_npk.pdf