Please, provide a detailed description of the issue.
Corpus, settings, query and your username are sent automatically.

Interface language
This action may take several minutes for large corpora, please wait.

Corpus Oromo WaC [2016] – statistics and info

Oromo web corpus. Crawled by SpiderLing in January 2016. Encoded in UTF-8, cleaned, deduplicated.

Counts
Tokens5091696
Words4249953
Sentences250432
Paragraphs76115
Documents8851
General info
Corpus description Document
LanguageOromo
EncodingUTF-8
Compiled06/02/2017 13:45:04
Word sketch grammar Definition
Lexicon sizes
word273056
tag12
lc242958

Structures and attributes

hide detail