ONOMASTICA Pronunciation Lexicon

ONOMASTICA is a database containing original data from the Norwegian part of the ONOMASTICA project, a European research project aiming at producing pronunciation lexica of proper names for various European languages. The data include first names, family names, company names, street names, place names and foreign names.

The database contains a total of 556,499 transcribed names. The data was automatically transcribed, but partially checked manually by trained phoneticians. The material is transcribed using SAMPA. See the documentation files for details.

The database is published with permission from Telenor, and may be used freely and distributed without compensation. Telenor must be credited when the database is used or distributed.

Note that an updated and more user-frendly version of the database in csv format has been published by the Language Bank. Type “sbr-67” in the search bar to find the updated version.

The database is published with permission from Telenor, and may be used freely and distributed without compensation. Telenor must be credited when the database is used or distributed.

Note that an updated and more user-frendly version of the database in csv format has been published by the Language Bank. Type “sbr-67” in the search bar to find the updated version.

Extended metadata

resource Common Info:
resource Type: lexicalConceptualResource
identification Info:
resource Name: ONOMASTICA Pronunciation Lexicon
resource Name: ONOMASTICA uttaleleksikon
resource Name: ONOMASTICA uttaleleksikon
description: ONOMASTICA is a database containing original data from the Norwegian part of the ONOMASTICA project, a European research project aiming at producing pronunciation lexica of proper names for various European languages. The data include first names, family names, company names, street names, place names and foreign names. The database contains a total of 556,499 transcribed names. The data was automatically transcribed, but partially checked manually by trained phoneticians. The material is transcribed using SAMPA. See the documentation files for details. The database is published with permission from Telenor, and may be used freely and distributed without compensation. Telenor must be credited when the database is used or distributed. Note that an updated and more user-frendly version of the database in csv format has been published by the Language Bank. Type "sbr-67" in the search bar to find the updated version.
description: Databasen ONOMASTICA er eit uttaleleksikon med data frå den norske delen av ONOMASTICA-prosjektet, eit europeisk forskingsprosjekt som hadde som mål å lage uttaleleksika for egennamn for fleire europeiske språk. Databasen inneheld fonetiske transkripsjonar av fornamn, etternamn, namn på selskap, gatenamn, stadnamn og utanlandske namn. Totalt inneheld databasen 556.499 namn. Namna er automatisk transkriberte, men delar av materialet er manuelt sjekka av fonetikarar. Materialet er transkribert i SAMPA. Sjå dokumentasjonsfilene for fleire detaljar. Databasen vert publisert med løyve frå Telenor, og kan nyttast fritt og utan vederlag. Telenor skal krediterast når databasen vert brukt eller distribuert. Legg merkje til at Språkbanken har publisert ein oppdatert og meir brukarvenleg versjon av ONOMASTICA i csv-format. Denne versjonen finn du ved å skrive "sbr-67" i søkjefeltet.
description: Databasen ONOMASTICA er et uttaleleksikon med data fra den norske delen av ONOMASTICA-prosjektet, et europeisk forskningsprosjekt som hadde som mål å lage uttaleleksika for egennavn for flere europeiske språk. Databasen inneholder fonetiske transkripsjoner av fornavn, etternavn, navn på selskaper, gatenavn, stedsnavn og utenlandske navn. Totalt inneholder databasen 556.499 navn. Navnene er automatisk transkribert, men deler av materialet er manuelt sjekket av fonetikere. Materialet er transkribert i SAMPA. Se dokumentasjonsfilene for flere detaljer. Databasen publiseres med tillatelse fra Telenor, og kan benyttes fritt og uten vederlag. Telenor skal krediteres ved bruk og når databasen blir videredistribuert. Legg merke til at Språkbanken har publisert en oppdatert og mer brukervennlig versjon av ONOMASTICA i csv-format. Denne versjonen finner du ved å skrive "sbr-67" i søkefeltet. samisk
resource Short Name: ONOMASTICA
resource Short Name: ONOMASTICA
url: https://www.nb.no/sprakbanken/ressurskatalog/oai-nb-no-sbr-38/
P I D: hdl:21.11146/38
identifier: sbr-38
distribution Info:
licence Info:
user Category: Public
distribution Access Medium: downloadable
download Location: https://www.nb.no/sprakbanken/ressurskatalog/oai-nb-no-sbr-38/
licence:
licence Family: Creative Commons (CC)
licence Name: Creative_Commons-BY (CC-BY)
licence Url: https://creativecommons.org/licenses/by/4.0/
conditions Of Use: BY
licensor:
actor Info:
actor Type: organization
role: Licensor
organization Info:
organization Name: Telenor
organization Name: Telenor
distribution Rights Holder
- actor Info:
- actor Type: organization
- role: Distribution Rights Holder
- organization Info:
- organization Name: National Library of Norway
- organization Name: Nasjonalbiblioteket
- organization Short Name: NLN
- organization Short Name: NB
- department Name: The Language Bank
- department Name: Språkbanken
- communication Info:
- email: sprakbanken@nb.no
- url: https://www.nb.no/sprakbanken/
- address: P.O. Box 2674 Solli
- zip Code: 0203
- city: Oslo
- region: Oslo
- country: Norway
ipr Holder
- actor Info:
- actor Type: organization
- role: IPR Holder
- organization Info:
- organization Name: Telenor
- organization Name: Telenor
contact
- actor Info:
- actor Type: organization
- role: Contact
- organization Info:
- organization Name: National Library of Norway
- organization Name: Nasjonalbiblioteket
- organization Short Name: NLN
- organization Short Name: NB
- department Name: The Language Bank
- department Name: Språkbanken
- communication Info:
- email: sprakbanken@nb.no
- url: https://www.nb.no/sprakbanken/
- address: P.O. Box 2674 Solli
- zip Code: 0203
- city: Oslo
- region: Oslo
- country: Norway
metadata Info:
metadata Creation Date: 29.01.2016
metadata Language Name: English
metadata Language Id: en
metadata Last Date Updated: 17.04.2024
metadata Creator
- actor Info:
- actor Type: person
- role: Metadata Creator
- person Info:
- surname: Lindstad
- given Name: Arne Martinus
- affiliation:
- organization Info:
- organization Name: National Library of Norway
- organization Name: Nasjonalbiblioteket
- organization Short Name: NLN
- organization Short Name: NB
- department Name: The Language Bank
- department Name: Språkbanken
- communication Info:
- email: sprakbanken@nb.no
- url: https://www.nb.no/sprakbanken/
- address: P.O. Box 2674 Solli
- zip Code: 0203
- city: Oslo
- region: Oslo
- country: Norway
version Info:
version: 1999
last Date Updated: 31.12.1999
validation Info:
validated: true
validation Type: content
validation Mode: manual
validation Mode Details: Manual validation of automatic phonetic transcriptions by phoneticians.
validation Extent: partial
validator:
actor Info:
actor Type: organization
role: Resource Validator
organization Info:
organization Name: Telenor
organization Name: Telenor
resource Documentation Info:
documentation Unstructured:
role: documentation
document Unstructured: https://www.nb.no/sbfil/dok/onomastica_readme.txt
documentation Unstructured:
role: documentation
document Unstructured: https://www.nb.no/sbfil/dok/onomastica_lesmeg.txt
documentation Unstructured:
role: documentation
document Unstructured: https://www.nb.no/sbfil/dok/onomastica.pdf
resource Creation Info:
creation End Date: 31.12.1999
resource Creator
- actor Info:
- actor Type: organization
- role: Resource Creator
- organization Info:
- organization Name: Telenor
- organization Name: Telenor

Download resources

Download metadata

Download metadata https://www.nb.no/sprakbanken/oai?verb=GetRecord&identifier=oai:nb.no:sbr-38&metadataPrefix=cmdi

dc:type	lexicalConceptualResource
dc:title	ONOMASTICA Pronunciation Lexicon
dc:identifier	oai:nb.no:sbr-38
dc:description	ONOMASTICA is a database containing original data from the Norwegian part of the ONOMASTICA project, a European research project aiming at producing pronunciation lexica of proper names for various European languages. The data include first names, family names, company names, street names, place names and foreign names. The database contains a total of 556,499 transcribed names. The data was automatically transcribed, but partially checked manually by trained phoneticians. The material is transcribed using SAMPA. See the documentation files for details. The database is published with permission from Telenor, and may be used freely and distributed without compensation. Telenor must be credited when the database is used or distributed. Note that an updated and more user-frendly version of the database in csv format has been published by the Language Bank. Type "sbr-67" in the search bar to find the updated version.
dc:publisher
dc:format	downloadable
dc:date
dc:date	1999-12-31
dc:rights	Public
dc:rights	Creative Commons (CC)
dc:rights	Creative_Commons-BY (CC-BY)
dc:rights	https://creativecommons.org/licenses/by/4.0/
dc:creator	Telenor
dc:lang	Norwegian

ONOMASTICA Pronunciation Lexicon

Extended metadata

Resource Common Info

Lexical Conceptual Resource Info

Dublin Core (DC)

Download resources

Download metadata