Please, provide a detailed description of the issue.
Corpus, settings, query and your username are sent automatically.

Interface language
This action may take several minutes for large corpora, please wait.

Corpus Norwegian Web 2015 (Nynorsk) – statistics and info

Norwegian web corpus crawled by SpiderLing in Febreuary 2015. Encoded in UTF-8, cleaned, deduplicated. Tagged by Oslo-Bergen Tagger.

Counts
Tokens63828239
Words54511854
Sentences3517938
Paragraphs1269924
Documents214379
General info
Corpus description Document
LanguageNorwegian
EncodingUTF-8
Compiled05/04/2017 15:21:49
Tagset Description
Lexicon sizes
word1444673
bm_score623
nn_score585
da_score623
sv_score575
lc1261843

Structures and attributes

hide detail