Please, provide a detailed description of the issue.
Corpus, settings, query and your username are sent automatically.

Interface language
This action may take several minutes for large corpora, please wait.

Corpus Norwegian Web 2015 (Bokmål) – statistics and info

Norwegian web corpus crawled by SpiderLing in Febreuary 2015. Encoded in UTF-8, cleaned, deduplicated. Tagged by Oslo-Bergen Tagger.

Counts
Tokens1364503936
Words1178357993
Sentences73185546
Paragraphs25794358
Documents3443807
General info
Corpus description Document
LanguageNorwegian
EncodingUTF-8
Compiled10/13/2017 01:00:26
Tagset Description
Word sketch grammar Definition
Lexicon sizes
word10671830
tag258
lempos_tc9170130
tag_attrs31943
bm_score625
nn_score593
da_score625
sv_score580
lc9271815
lempos8543951
lemma8208048
lemma_lc8208048
Tags legend (tagset)
adjectiveadj
adverbadv
conjunctionkonj
interjectioninterj
nounsubst
pronounpron
verbverb
Lempos suffixes
adjective-j
adverb-a
conjunction-c
interjection-i
noun-n
preposition-p
pronoun-d
verb-v
other-x

Structures and attributes

hide detail