OCR Models for Sámi Languages
Extended metadata
- resource Common Info
- resource Type: toolService
- identification Info
- resource Name: OCR-modeller for samiske språk
- resource Name: OCR Models for Sámi Languages
- description: Dette er en samling av modeller for OCR (optical character recognition) av samiske språk. Disse kan brukes til å gjenkjenne tekst i bilder av trykt tekst (scannede bøker, magasiner, o.l) på nordsamisk, sørsamisk, lulesamisk og inaresamisk. Mer detaljert informasjon om trening og evaluering av modellene kan du lese i artikkelen 'Comparative analysis of optical character recognition methods for Sámi texts from the National Library of Norway', se https://arxiv.org/abs/2501.07300. Samlingen består tre forskjellige typer modeller: Transkribus-modeller, Tesseract-modeller og TrOCR-modeller. Se dokumentasjonsfilen for mer informasjon.
- description: This is a collection of models for OCR (optical character recognition) of Sámi languages. These can be used to recognize text in images of printed text (scanned books, magazines, etc.) in North Sámi, South Sámi, Lule Sámi, and Inari Sámi. You can read more detailed information about the training and evaluation of the models in the article 'Comparative analysis of optical character recognition methods for Sámi texts from the National Library of Norway', see https://arxiv.org/abs/2501.07300. The collection consists of three different types of models: Transkribus models, Tesseract models, and TrOCR models. See the documentation file for more information.
- url: https://www.nb.no/sprakbanken/ressurskatalog/oai-nb-no-sbr-100/
- P I D: hdl:21.11146/100
- identifier: sbr-100
- distribution Info
- licence Info
- user Category: Public
- distribution Access Medium: downloadable
- download Location: https://www.nb.no/sprakbanken/ressurskatalog/oai-nb-no-sbr-100/
- attribution Text: 'Comparative analysis of optical character recognition methods for Sámi texts from the National Library of Norway', https://arxiv.org/abs/2501.07300
- licence
- licence Family: Creative Commons (CC)
- licence Name: Creative_Commons-BY (CC-BY)
- licence Url: https://creativecommons.org/licenses/by/4.0/
- conditions Of Use: BY
- licensor:
- actor Info
- actor Type: organization
- role: Licensor
- organization Info
- organization Name: Nasjonalbiblioteket
- organization Name: National Library of Norway
- organization Short Name: NB
- organization Short Name: NLN
- department Name: Språkbanken
- department Name: The Language Bank
- communication Info
- email: sprakbanken@nb.no
- url: https://www.nb.no/sprakbanken/
- licence Info
- contact
- actor Info
- actor Type: organization
- role: Contact
- organization Info
- organization Name: Nasjonalbiblioteket
- organization Name: National Library of Norway
- organization Short Name: NB
- organization Short Name: NLN
- department Name: Språkbanken
- department Name: The Language Bank
- communication Info
- email: sprakbanken@nb.no
- url: https://www.nb.no/sprakbanken/
- actor Info
- metadata Info
- metadata Creation Date: 22.01.2025
- metadata Language Name: Norwegian Bokmål
- metadata Language Name: English
- metadata Language Id: nb
- metadata Language Id: en
- metadata Last Date Updated: 22.01.2025
- metadata Creator
- actor Info
- actor Type: organization
- role: Metadata Creator
- organization Info
- organization Name: Nasjonalbiblioteket
- organization Name: National Library of Norway
- organization Short Name: NB
- organization Short Name: NLN
- department Name: Språkbanken
- department Name: The Language Bank
- communication Info
- email: sprakbanken@nb.no
- url: https://www.nb.no/sprakbanken/
- actor Info
- resource Creation Info
- creation Start Date: 01.08.2024
- creation End Date: 22.01.2025
- resource Creator
- actor Info
- actor Type: organization
- role: Resource Creator
- organization Info
- organization Name: Nasjonalbiblioteket
- organization Name: National Library of Norway
- organization Short Name: NB
- organization Short Name: NLN
- department Name: Språkbanken
- department Name: The Language Bank
- communication Info
- email: sprakbanken@nb.no
- url: https://www.nb.no/sprakbanken/
- actor Info
- tool Info
- description: OCR Models for Sámi Languages
- input Info
- media Type: image
- output Info
- media Type: text
- Service
- Name: OCR Models for Sámi Languages
- Service Description Location:
- Location: https://www.nb.no/sprakbanken/ressurskatalog/oai-nb-no-sbr-100/
- Operations:
- Operation
- Name: OCR
- Output:
- Parameter Group
- Name: text
- Parameters:
- Parameter
- Name: OCR
dc:type | toolService |
dc:title | OCR Models for Sámi Languages |
dc:identifier | oai:nb.no:sbr-100 |
dc:description | This is a collection of models for OCR (optical character recognition) of Sámi languages. These can be used to recognize text in images of printed text (scanned books, magazines, etc.) in North Sámi, South Sámi, Lule Sámi, and Inari Sámi. You can read more detailed information about the training and evaluation of the models in the article 'Comparative analysis of optical character recognition methods for Sámi texts from the National Library of Norway', see https://arxiv.org/abs/2501.07300. The collection consists of three different types of models: Transkribus models, Tesseract models, and TrOCR models. See the documentation file for more information. |
dc:publisher | |
dc:format | downloadable |
dc:date | 2024-08-01 |
dc:date | 2025-01-22 |
dc:rights | Public |
dc:rights | Creative Commons (CC) |
dc:rights | Creative_Commons-BY (CC-BY) |
dc:rights | https://creativecommons.org/licenses/by/4.0/ |
dc:creator | National Library of Norway |
dc:lang |