Een multimodaal parallel corpus met de talen Nederlands en Nederlandse Gebarentaal (NGT). 297 geschreven hotelbeoordelingen werden vertaald uit het Nederlands in NGT door 6 professionele, dove vertalers. Elke beoordeling is vertaald door slechts 1 vertaler. Het aantal woorden in de beoordelingen varieerde tussen 15 en 400. De duur van de NGT-video's varieerde tussen 10 seconden tot ongeveer 4 minuten. Het resulterende corpus bevat 22.197 woorden in het Nederlands en zo'n 3 uur en 50 minuten aan NGT-videomateriaal.
Versie 1.2 bevat 14 additionele video's ten opzichte van de voorgaande versies.
A multimodal parallel corpus of Dutch and Sign Language of the Netherlands (NGT). 297 hotel reviews in written Dutch were translated into NGT videos by 6 professional, deaf translators. Each review was translated by only one translator. The word length of the Dutch reviews varies from around 15 to 400 words; the NGT videos duration ranged from around 10 seconds to around 4 minutes. The total amount of words contained in the corpus is 22,197; the NGT translations consist of about 3 hours and 50 minutes of videos.
Version 1.2 contains 14 additional videos compared to the previous versions.
Productdetails
Dataformaat | mp4, mov, txt |
Jaar | 2024 |
Opdrachtgever | |
Project | NGT-HoReCo |
Financier | European Language Equality 2 (ELE2) |
Documentatie | Project Report |
Refereren | Hotel Review Corpus Nederlandse Gebarentaal (NGT-HoReCo) (Version 1.2) (2024) [Data set]. Available at the Dutch Language Institute: https://hdl.handle.net/10032/tm-a2-x7. |
Talen | Nederlands, Nederlandse Gebarentaal |
Omvang | ±4 uur, 22197 woorden |
Toepassing | Automatisch vertalen, ontwikkelen van taalmodellen |
Versie | 1.2 |
Licentie | Creative Commons Attribution-NonCommercial 3.0 Unported License. |
Downloaddetails
Bestand | |
---|---|
ngt-horeco1.2.zip | |
20150602_SoNaRCorpus_NC_1.2.1.tgz | |
20150730_SoNaRNewMediaCorpus_1.0.1.zip | |
20150804_Afrikaans_Custom_Dictionary_for_Government_Domain_1.0.zip | |
20150804_Afrikaans_Genre_Classification_Corpus_1.0.zip | |
20150804_Autshumato_Afrikaans-English_Translation_Memory_1.0.zip | |
20150804_Autshumato_English-Afrikaans_Parallel_Corpora_1.0.zip | |
20150804_Autshumato_English-Afrikaans_Translation_Memory_1.0.zip | |
20150804_Autshumato_English-Sesotho_sa_Leboa_Parallel_Corpora_1.0.zip | |
20150804_Autshumato_English-isiZulu_Translation_Memory_1.0.zip | |
20150804_Autshumato_English-isuZulu_Parallel_Corpora_1.0.zip | |
20150804_Autshumato_Sesotho_sa_Leboa-English_Translation_Memory_1.0.zip | |
20150804_Autshumato_isiZulu-English_Translation_Memory_1.0.zip | |
20150804_Lwazi_Afrikaans_ASR_Corpus_1.0.zip | |
20150804_isiNdebele_Custom_Dictionary_for_Government_Domain_1.0.zip | |
20150804_isiNdebele_Genre_Classification_Corpus_1.0.zip | |
20150804_isiXhosa_Custom_Dictionary_for_Government_Domain_1.0.zip | |
20150804_isiXhosa_Genre_Classification_Corpus_1.0.zip | |
20150804_isiZulu_Custom_Dictionary_for_Government_Domain_1.0.zip | |
20150804_isiZulu_Genre_Classification_Corpus_1.0.zip | |
20150808_Lwazi_Afrikaans_Pronunciation_Dictionary_1.2.zip | |
20150808_Lwazi_English_ASR_Corpus_1.0.zip | |
20150808_Lwazi_English_Pronunciation_Dictionary_1.2.zip | |
20150808_Lwazi_Sepedi_ASR_Corpus_1.0.zip | |
20150808_Lwazi_Sepedi_Pronunciation_Dictionary_1.2.zip | |
20150808_Lwazi_Sesotho_ASR_Corpus_1.0.zip | |
20150808_Lwazi_Sesotho_Pronunciation_Dictionary_1.2.zip | |
20150808_Lwazi_Setswana_ASR_Corpus_1.0.zip | |
20150808_Lwazi_Setswana_Pronuncation_Dictionary_1.2.zip | |
20150808_Lwazi_Siswati_ASR_Corpus_1.0.zip | |
20150808_Lwazi_Siswati_Pronunciation_Dictionary_1.2.zip | |
20150808_Lwazi_Tshivenda_ASR_Corpus_1.0.zip | |
20150808_Lwazi_Tshivenda_Pronunciation_Dictionary_1.2.zip | |
20150808_Lwazi_Xitsonga_ASR_Corpus_1.0.zip | |
20150808_Lwazi_Xitsonga_Pronunciation_Dictionary_1.2.zip | |
20150808_Lwazi_isiNdebele_ASR_Corpus_1.0.zip | |
20150808_Lwazi_isiNdebele_Pronunciation_Dictionary_1.2.zip | |
20150808_Lwazi_isiXhosa_ASR_Corpus_1.0.zip | |
20150808_Lwazi_isiXhosa_Pronunciation_Dictionary_1.2.zip | |
20150808_Lwazi_isiZulu_ASR_Corpus_1.0.zip | |
20150808_Lwazi_isiZulu_Pronunciation_Dictionary_1.2.zip | |
20150809_Sepedi_Custom_Dictionary_for_Government_Domain_1.0.zip | |
20150809_Sesotho_Custom_Dictionary_for_Government_Domain_1.0.zip | |
20150809_Sesotho_Genre_Classification_Corpus_1.0.zip | |
20150809_Sesotho_sa_Leboa_Genre_Classification_Corpus_1.0.zip | |
20150809_Setswana_Custom_Dictionary_for_Government_Domain_1.0.zip | |
20150809_Setswana_Genre_Classification_Corpus_1.0.zip | |
20150809_Siswati_Custom_Dictionary_for_Government_Domain_1.0.zip | |
20150809_Siswati_Genre_Classification_Corpus_1.0.zip | |
20150809_Tshivenda_Custom_Dictionary_for_Government_Domain_1.0.zip | |
20150809_Tshivenda_Genre_Classification_Corpus_1.0.zip | |
20150809_Xitsonga_Custom_Dictionary_for_Government_Domain_1.0.zip | |
20150809_Xitsonga_Genre_Classification_Corpus_1.0.zip | |
20151207_CGN_2_0_3.tgz | |
20151207_CGN_2_0_3.tgz.old | |
AITrainingset1.0.zip | |
BLISS-SpokenDialogueDataset1.0.zip | |
BLISS-Summaries1.1.zip | |
BP_2BDutch_NC.zip | |
BP_AUTONOMATA-namencorpus_C.zip | |
BP_BasiLex-corpus_C.zip | |
BP_BasiLex-corpus_NC.zip | |
BP_BasiLex-lexicon_C.zip | |
BP_BasiLex-lexicon_NC.zip | |
BP_BasiScriptCorpusC.zip | |
BP_BasiScriptCorpusNC.zip | |
BP_BasiScriptLexiconC.zip | |
BP_BasiScriptLexicon_NC.zip | |
BP_CGN-annotaties_C.zip | |
BP_CGN_C.zip | |
BP_CHN_N-grams_C.zip | |
BP_CHOREC_NC.zip | |
BP_COREA-coreferentiecorpus_C.zip | |
BP_CombiLex_C.zip | |
BP_DAESO_C.zip | |
BP_DID-NS_C.zip | |
BP_DPC_C.zip | |
BP_DuELME_C.zip | |
BP_DuOMAn_C.zip | |
BP_Frequentielijsten_C.zip | |
BP_GiGaNT-Molex_C.zip | |
BP_JASMIN_C.zip | |
BP_JASMIN_NC.zip | |
BP_LassyGroot-corpus_C/BP_LassyGroot_C.zip | |
BP_LassyGroot-corpus_C/Bestelinstructies.doc | |
BP_LassyGroot-corpus_C/Licentie-C_LassyGrootCorpus_7.0.docx | |
BP_LassyGroot-corpus_C/Licentie-C_LassyGrootCorpus_7.0.pdf | |
BP_LassyGroot-corpus_C/Licentie-C_LassyGrootCorpus_7.0_Onbeveiligd.docx | |
BP_LassyGroot-corpus_C.zip | |
BP_LassyGroot_C/BP_LassyGroot_C.zip | |
BP_LassyGroot_C/Bestelinstructies.doc | |
BP_LassyGroot_C/Licentie-C_LassyGrootCorpus_7.0.docx | |
BP_LassyGroot_C/Licentie-C_LassyGrootCorpus_7.0.pdf | |
BP_LassyGroot_C/Licentie-C_LassyGrootCorpus_7.0_Onbeveiligd.docx | |
BP_LassyGroot_NC.zip | |
BP_LassyKlein-corpus_C.zip | |
BP_MedTermBN_C.zip | |
BP_NAMES_Corpus_C.zip | |
BP_OMBI-Arabisch-Nederlands_C.zip | |
BP_OMBI-Nederlands-Arabisch_C.zip | |
BP_OMBI-Nederlands-Deens_C.zip | |
BP_OMBI-Nederlands-Indonesisch_C.zip | |
BP_RBBN_C.zip | |
BP_RBN-KleinC.zip | |
BP_RBN_C.zip | |
BP_SoNaR_Groot_C.zip | |
BP_SoNaR_Klein_C.zip | |
BP_SumNL_C.zip | |
BP_e-Lex_C.zip | |
BaB2.0.zip | |
BasilexCorpus.xml | |
CGNAnn_2.0.3.zip | |
CGN_2.0.3.zip | |
CMN-TEI.zip | |
COPAS_1.0.1.tgz | |
COREA_Corpus_1.0.1p1.zip | |
COUC1.0.zip | |
Combilex_1.0.1.zip | |
CorpusGysseling1.0.zip | |
D-TUNA1.0p1.zip | |
DuOMAn-subjectivitylexicon_1.0.zip | |
Duelme2.tgz | |
DuidelijkeTaal_DatasetSynthetischeSimplificatie_1.0.zip | |
DuidelijkeTaal_EvaluatieGeautomatiseerdeTekstvereenvoudiging_1.0.zip | |
Eindhoven_2.0.1.zip | |
Frequentielijsten_corpora_4.0.1.zip | |
HOTNev1.1.zip | |
IFADV_Corpus1.0.tgz | |
IFA_Corpus.zip | |
INL_historische_woordenlijst_1.1.zip | |
LassyLargeNC_7.0.zip | |
LiLaH-Ext1.0.zip | |
MNL.zip | |
MedPilot1.0.zip | |
MedTermBN1.0.zip | |
Moroccorp.zip | |
MuST-Cinema-PE1.0.zip | |
OMBI-Arabic-Dutch.zip | |
OMBI-Dutch-Arabic.zip | |
PaCo-MT_ParallelleCorpora_1.0.tar.gz | |
RBBN_1.0.1.zip | |
RBN-klein_1.0.1.zip | |
RBN_2.0.1.zip | |
RND1.1.zip | |
SABeD-corpus_1.0.zip | |
SumNL_1.0.1.zip | |
Wablieft1.0.zip | |
WoordenboekVGT1.0.zip | |
a.txt.txt | |
acter1.4.zip | |
acter1.5.zip | |
autonomataCorpus1.0.tgz | |
autonomataTooCorpus1.0.tgz | |
autonomataTooDemo1.1.1.zip | |
autonomata_G2P_v61.tgz | |
backups/taalmaterialen.ivdnt.org-20230711-151403-uhqxi7.wpress | |
backups/taalmaterialen.ivdnt.org-20230831-153838-cjoxlw.wpress | |
backups/taalmaterialen.ivdnt.org-20231114-130350-i5tslk.wpress | |
backups/taalmaterialen.ivdnt.org-20231114-134140-ola6ef.wpress | |
backups/taalmaterialen.ivdnt.org-20231207-120712-gwxq06.wpress | |
becos-corpus1.1.zip | |
beschikbaarstelling/20150625_SoNaRLargeCorpus_Commercial_1.2.1.tgz | |
boarnsterhimcorpus/BoarnsterhimCorpus1.0.zip | |
celex-2-nl.zip | |
chn-ngrams/chn-ngrams1.0.tgz | |
cvgt1.0.zip | |
daeso10.tgz | |
deliverable_EE3-10_NELexicon_Dutch.zip | |
did-ns1.0.zip | |
dpc1.0p2.tgz | |
dupira1.0.zip | |
e-Lex_1.1.1.zip | |
file_5ed60d40c92ae_names_corpus1.1.zip | |
filenames.txt | |
files | |
gigant-molex1.0.zip | |
gigant-molex2.0.zip | |
gostparc1.0.zip | |
hotnev2.0.zip | |
lassysmall70.zip | |
lse-horeco1.0.zip | |
metacovid_ontology1.0.zip | |
moroccorpV1.1.zip | |
names_corpus1.1.zip | |
ngt-horeco1.2.zip | |
ngt_horeco1.1.zip | |
ombiDutchDanish.tgz | |
ombiDutchIndonesian.tgz | |
paroleLexicon.tgz | |
rbn2.0-distributie3.zip | |
sonar_ngrams1.0.zip | |
vgt_horeco1.1.zip | |
wablieft_v1.1.tgz | |
wablieft_v1.2.tgz | |
wai-not-corpus1.0.zip |
- Aantal bestanden 195
- Aantal downloads 5
- Bestandsgrootte 146.62 MB
- Datum plaatsing 12/02/2024
- Laatst bijgewerkt 26/07/2024
- Versie 1.1