Lemma's

IFA Corpus

Een database voor fonetisch onderzoek die bestaat uit Nederlandse spraakdata van 8 personen; 4 mannelijk en 4 vrouwelijk.
A corpus for phonetic research consisting of speech data of 4 male and 4 female persons.

Dutch Parallel Corpus (DPC)

Een parallel corpus van 10 miljoen woorden voor de taalparen Nederlands-Engels en Nederlands-Frans.
A parallel corpus of 10 million words for the language pairs Dutch-English and Dutch-French.

Dupira

Parser voor het Nederlands voor toepassingen in information retrieval.
Parser for Dutch for applications in information retrieval.

D-TUNA-corpus

Het D-TUNA-corpus bestaat uit 2400 geschreven en (getranscribeerde) gesproken referentiële expressies.
The D-TUNA Corpus consists of 2400 written and (transcribed) spoken referential expressions.

CombiLex

CombiLex is een lijst van lemma’s en woordvormen zonder toegevoegde taalkundige informatie.
Combilex is a list of Dutch lemmas and word forms without further annotation.

« Vorige

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.