Fonologie

Corpus Gesproken Nederlands (CGN)

Een verzameling van ongeveer 900 uur gesproken Standaardnederlands afkomstig van Vlamingen en Nederlanders.
A collection of about 900 hours spoken standard Dutch from Flanders and the Netherlands.

Corpus Gesproken Nederlands (CGN) Commercieel

Een verzameling van ongeveer 900 uur gesproken Standaardnederlands afkomstig van Vlamingen en Nederlanders.
A collection of about 900 hours spoken standard Dutch from Flanders and the Netherlands.

Children’s Oral Reading Corpus (CHOREC)

Een verzameling van 130 uur voorgelezen kinderspraak.
A collection of 130 hours of speech by children (reading loud).

Frequentielijsten corpora Commercieel

De 5000 meest voorkomende woorden uit de Miljoenencorpora, het PAROLE-corpus 2004, het CGN, het ANW-corpus, het Eindhoven-corpus, het D-Coi-corpus en het SoNaR-corpus.
The 5000 most frequent words from the Millions Corpora, the PAROLE 2004 Corpus, the Spoken Dutch Corpus, the ANW Corpus, the Eindhoven Corpus, the D-Coi Corpus and the SoNaR corpus.

e-Lex Commercieel

Lexicon met ruim 200.000 lemma’s en ruim 640.000 woordvormen voorzien van o.a. POS-tag, complementatiepatroon, semantisch type en uitspraakinformatie.
A lexical database consisting of over 200,000 entries and over 640,000 word forms, enriched with part of speech, complementation type, semantic type, and phonological information.

« Vorige