This corpus is a dump of discussion threads the Norwegian Wikipedia, where authors discuss various issues regarding specific Wikipedia articles.
The material is split into two files, one each for Bokmål (nb.wikipedia.json) and Nynorsk (nn.wikipedia.json). Each file is a structured JSON array. Each discussion corresponds to one element, with one level containing text and metadata. There are eight key/value pairs per discussion:
- title: title of article under discussion
- pageid: text identifier
- revid: audit information
- wikidata: other data
- contentcategories: metadata
- hiddencategories: metadata
- text: discussion text
- bytelength: length of text in number of bytes
An example of this can be found in the pdf file (2019_wikidisc.pdf).
Build on reliable and scalable technology
FAQ
Frequently Asked Questions
Some basic informations about API Store ®.
Operation and development of APIs are currently fully funded by company Apitalks and its usage is for free.
Yes, you can.
All important information such as time of last update, license and other information are in response of each API call.
In case of major update that would not be compatible with previous version of API, we keep for 30 days both versions so you will have enough time to transfer to new version. We will inform you about the changes in advance by e-mail.