Keywords
Keywords are individual words (%[token|tokens]%) which appear more frequently in the
focus corpus than in the
reference corpus. Any %[token]% can qualify for a keyword if it is used more frequently in the focus corpus than in the reference corpus. In reality, the result will include mainly nouns and adjectives because the frequencies of other parts of speech tend to be similar in all texts.
Terms
Terms are multi-word expressions which appear more frequently in the focus corpus than in the reference corpus and, additionally, match the typical format of terminology in the language. The format is defined in the
term grammar.
The result of term extraction is displayed as %[lemma|lemmas]%.
Gender lemmas are used for languages where the word form of an adjective has to match the gender of the noun.