COREA-coreferentiecorpus Commercieel

Het COREA-coreferentiecorpus (circa 150.000 woorden) bestaat uit Nederlandse teksten waarin coreferentierelaties systematisch gemarkeerd zijn. De teksten bestaan uit krantenartikelen (D-Coi), getranscribeerde spraak (CGN) en lemma's uit de Spectrum (Winkler Prins) Medische Encyclopedie.

The COREA coreference corpus (approximately 150,000 words) consists of Dutch texts in which coreference relationships are systematically marked. The texts consist of newspaper articles (D-Coi), transcribed speech (CGN) and lemmas from the Spectrum (Winkler Prins) Medical Encyclopedia.

Dit product is gratis, maar het tekenen van een licentie is vereist. De download bevat de licentie en verdere instructies voor het plaatsen van een bestelling.

This product is free, but signing a license agreement is required. The download contains the license and further instructions for placing an order.

Productdetails

Dataformaat	xml, MMAX2
Demo	Voorbeelden van gemarkeerde corpusteksten
Documentatie	LREC2008-artikel
Eigenaar	Taalunie
Financier	NTU\|STEVIN
Jaar	2014
Opdrachtgever	NTU\|STEVIN
Project	COREA
Refereren	COREA-coreferentiecorpus Commercieel (Version 1.0.1) (2014) [Data set]. Available at the Dutch Language Institute: http://hdl.handle.net/10032/tm-a2-e9
Talen	Nederlands
Toepassing	Automatische tekstanalyse, automatisch samenvatten.
Versie	1.0.1

Downloaddetails

Bestand
BP_COREA-coreferentiecorpus_C.zip

Aantal bestanden 1
Aantal downloads 15
Bestandsgrootte 51.68 KB
Datum plaatsing 02/09/2020
Laatst bijgewerkt 15/12/2025
Versie 1.0.1