Corpus with Book Reviews from Bokelskere.no
Extended metadata
- resource Common Info:
- resource Type: corpus
- identification Info:
- resource Name: Corpus with Book Reviews from Bokelskere.no
- resource Name: Korpus med bokomtalar frå Bokelskere.no
- description: This corpus is a dump of user generated book reviews and discussions from Bokelskere.no (meaning "book lovers"), a web community where users review and discuss new and old literature, both fiction and non-fiction. The corpus is structured as a JSON Array where each object corresponds to a review or comment to a review on Bokelskere.no. Each object has the following fields: – "post_id": unique identifier for review – "date": date when the review was posted – "user_id": unique identifier for the user – "isbn13": ISBN for the the rewieved book – "post_title": title of review – "text": review – "score": evaluation (from 1-6, where 6 is the best) – "main_title": title of reviewed book – "author": author of reviewed book – "parent_id": identifier of review which has been commented upon The corpus contains approximately 219,000 posts/objects, and 1.5 million word tokens (in the "text"-field).
- description: Dette korpuset inneheld ein dump av brukargenererte bokomtalar og diskusjonar frå Bokelskere.no, ein nettstad der brukarane skriv omtalar av og diskuterer nye og eldre bøker, både skjønnlitteratur og fagprosa. Korpuset er på JSON-format, der kvart objekt svarar til ein omtale eller ein kommentar til ein omtale på Bokelskere.no. Kvart objekt inneheld dei følgjande felta: – "post_id": unik identifkator for omtalen – "date": dato når omtalen blei posta – "user_id": unik identifikator for brukaren – "isbn13": ISBN-nummer for den aktuelle boka – "post_title": tittel på omtalen – "text": omtalen – "score": evaluering (terningkast 1-6) – "main_title": tittel på boka – "author": forfattar av boka – "parent_id": identifikator for ein omtale som er kommentert Korpuset inneheld omlag 219.000 postar/objekt, og 1,5 millionar ord (i "text"-feltet).
- url: https://www.nb.no/sprakbanken/ressurskatalog/oai-nb-no-sbr-53/
- P I D: hdl:21.11146/53
- identifier: sbr-53
- distribution Info:
- licence Info:
- user Category: Public
- distribution Access Medium: downloadable
- download Location: https://www.nb.no/sprakbanken/ressurskatalog/oai-nb-no-sbr-53/
- licence:
- licence Family: Creative Commons (CC)
- licence Name: Creative_Commons-ZERO (CC-ZERO)
- licence Url: https://creativecommons.org/publicdomain/zero/1.0/
- licensor:
- actor Info:
- actor Type: organization
- role: Licensor
- organization Info:
- organization Name: National Library of Norway
- organization Name: Nasjonalbiblioteket
- organization Short Name: NLN
- organization Short Name: NB
- department Name: The Language Bank
- department Name: Språkbanken
- communication Info:
- email: sprakbanken@nb.no
- url: https://www.nb.no/sprakbanken/
- address: P.O. Box 2674 Solli
- zip Code: 0203
- city: Oslo
- region: Oslo
- country: Norway
- distribution Rights Holder
- actor Info:
- actor Type: organization
- role: Distribution Rights Holder
- organization Info:
- organization Name: National Library of Norway
- organization Name: Nasjonalbiblioteket
- organization Short Name: NLN
- organization Short Name: NB
- department Name: The Language Bank
- department Name: Språkbanken
- communication Info:
- email: sprakbanken@nb.no
- url: https://www.nb.no/sprakbanken/
- address: P.O. Box 2674 Solli
- zip Code: 0203
- city: Oslo
- region: Oslo
- country: Norway
- actor Info:
- actor Type: organization
- role: Contact
- organization Info:
- organization Name: National Library of Norway
- organization Name: Nasjonalbiblioteket
- organization Short Name: NLN
- organization Short Name: NB
- department Name: The Language Bank
- department Name: Språkbanken
- actor Info:
- actor Type: person
- role: Metadata Creator
- person Info:
- surname: Lindstad
- given Name: Arne Martinus
- affiliation:
- organization Info:
- organization Name: National Library of Norway
- organization Name: Nasjonalbiblioteket
- organization Short Name: NLN
- organization Short Name: NB
- department Name: The Language Bank
- department Name: Språkbanken
- actor Info:
- actor Type: person
- role: Resource Creator
- person Info:
- surname: Kåsen
- given Name: Andre
- affiliation:
- organization Info:
- organization Name: National Library of Norway
- organization Name: Nasjonalbiblioteket
- organization Short Name: NLN
- organization Short Name: NB
- department Name: The Language Bank
- department Name: Språkbanken
- corpus Info:
- corpus Type: Written Corpus
- corpus Part Info:
- media Type: text
- corpus Text Info:
- text Format Info:
- mime Type: application/json
- size Per Text Format:
- size Info:
- size: 218937
- size Unit: entries
- size Info:
- size: 136
- size Unit: mb
- character Encoding Info:
- character Encoding: UTF-8
- corpus Part General Info:
- linguality Info:
- linguality Type: monolingual
- language Info:
- language Id: nb
- language Name: Norwegian Bokmål
- language Info:
- language Id: nn
- language Name: Norwegian Nynorsk
- modality Info:
- modality Type: writtenLanguage
- size Info:
- size: 218937
- size Unit: entries
- size Info:
- size: 136
- size Unit: mb
- annotation Info:
- annotation Type: other
- segmentation Level: paragraph
- annotation Mode: automatic
- time Coverage Info:
- time Coverage: 2009-2019
dc:type | corpus |
dc:title | Corpus with Book Reviews from Bokelskere.no |
dc:identifier | oai:nb.no:sbr-53 |
dc:description | This corpus is a dump of user generated book reviews and discussions from Bokelskere.no (meaning "book lovers"), a web community where users review and discuss new and old literature, both fiction and non-fiction. The corpus is structured as a JSON Array where each object corresponds to a review or comment to a review on Bokelskere.no. Each object has the following fields: – "post_id": unique identifier for review – "date": date when the review was posted – "user_id": unique identifier for the user – "isbn13": ISBN for the the rewieved book – "post_title": title of review – "text": review – "score": evaluation (from 1-6, where 6 is the best) – "main_title": title of reviewed book – "author": author of reviewed book – "parent_id": identifier of review which has been commented upon The corpus contains approximately 219,000 posts/objects, and 1.5 million word tokens (in the "text"-field). |
dc:publisher | |
dc:format | downloadable |
dc:date | 2020-02-24 |
dc:date | 2020-02-25 |
dc:rights | Public |
dc:rights | Creative Commons (CC) |
dc:rights | Creative_Commons-ZERO (CC-ZERO) |
dc:rights | https://creativecommons.org/publicdomain/zero/1.0/ |
dc:creator | Andre Kåsen |
dc:lang | Norwegian Bokmål |
dc:lang | Norwegian Nynorsk |