Frequentielijsten corpora - INT Taalmaterialen

De 5000 meest voorkomende woorden uit de Miljoenencorpora, het PAROLE-corpus 2004, het Corpus Gesproken Nederlands, het Algemeen Nederlands Woordenboekcorpus, het Eindhoven-corpus, het D-Coi-corpus en het SoNaR-corpus. Voor vrijwel elk van deze producten is er zowel een lemmafrequentielijst als een typefrequentielijst beschikbaar.

Voor commercieel gebruik zie de commerciële productpagina.

The 5000 most frequent words from the Million Corpora, the 2004 PAROLE corpus, the Corpus Gesproken Nederlands, the Algemeen Nederlands Woordenboek corpus, the Eindhoven corpus, the D-Coi corpus and the SoNaR corpus. For almost each of these products, both a lemma frequency list and a type frequency list are available.

For commercial use, see the commercial product page.

Productdetails

Dataformaat	txt
Jaar	2014
Opdrachtgever	INT
Refereren	Frequentielijsten corpora (Version 4.0.1) (2014) [Data set]. Available at the Dutch Language Institute: http://hdl.handle.net/10032/tm-a2-f8
Talen	Nederlands
Toepassing	Referentiemateriaal voor bijvoorbeeld onderwijs: tekstschrijvers kunnen nagaan of bepaalde woorden moeilijk (infrequent) zijn.
Versie	4.0.1

Downloaddetails

Bestand
Frequentielijsten_corpora_4.0.1.zip

Aantal bestanden 1
Aantal downloads 357
Bestandsgrootte 958.42 KB
Datum plaatsing 03/09/2020
Laatst bijgewerkt 23/01/2026
Versie 4.0.1