D-TUNA-corpus - INT Taalmaterialen

Het D-TUNA-corpus bestaat uit 2400 geschreven en (getranscribeerde) gesproken referentiële expressies. De semantische annotatie van alle expressies (xml-formaat) maakt het corpus bruikbaar als input voor taalgeneratiesystemen. De samenstelling van het D-TUNA-corpus is geïnspireerd op het Engelse TUNA Corpus.

The D-TUNA corpus consists of 2400 written and (transcribed) spoken referential expressions. The semantic annotation of all expressions (xml format) makes the corpus usable as input for language generation systems. The composition of the D-TUNA corpus is inspired by the English TUNA Corpus.

Productdetails

Dataformaat	Annotaties (xml)
Documentatie	Documentatie
Eigenaar	Universiteit van Tilburg
Jaar	2009
Refereren	D-TUNA-corpus (Version 1.0) (2009) [Data set]. Available at the Dutch Language Institute: http://hdl.handle.net/10032/tm-a2-k5
Talen	Nederlands
Toepassing	Input voor taalgeneratiesystemen.
Versie	1.0

Downloaddetails

Bestand
D-TUNA-corpus_1.0p1.zip

Aantal bestanden 1
Aantal downloads 16
Bestandsgrootte 2.44 MB
Datum plaatsing 03/09/2020
Laatst bijgewerkt 23/01/2026
Versie 1.0