• Door naar de hoofd inhoud
  • Skip to secondary menu
  • Spring naar de eerste sidebar

INT Taalmaterialen

Bronnen, data en tools voor
taalkundig onderzoek binnen het
Nederlandse taalgebied.

U bent ingelogd.

MENUMENU
  • Nieuw
  • Alle taalmaterialen
  • Over deze website
  • Mijn taalmaterialen
  • Registreren
  • Inloggen
  • Zoeken

Corpus Ondertitelde UVN-Colleges (COUC)

This corpus contains 57 (2020-07-16) subtitled lectures from the Universiteit van Nederland (UVN). Subtitles were added to existing video recordings of lectures of the UVN.

Unlike common subtitles, the subtitles generated in this project are a nearly 100% literal representation of the speech as spoken by the people in the recordings. They contain exact orthographic transcriptions of subsequent words and thus show the peculiarities of the spoken language modality, lacking grammatical coherence typical for written texts.
On the other hand, the transcriptions do not contain speaker noises (such as lip smacks or coughs) nor hesitation sounds as "ehm". For the sake of readability punctuation markers were included.

The purpose of the subtitles is to add support for language learners of Dutch.
The videos are selected to reflect the language variety of spoken Dutch in an educational setting covering a large diversity of lecture topics at a popular level such as linguistics, physics and history. The videos include speakers of Northern Dutch as spoken in the Netherlands and of South Dutch as spoken in Flanders (Belgium). Moreover, some speakers have an audible different "language" background such as English or Moroccan.

Productdetails

Aantal uren spraak meer dan 14 uur
Dataformaat Video: mp4, geluid: wav, transcripties: txt
Documentatie Ondertitelen-UvN-Final.pdf.
Eigenaar Nederlandse Taalunie
Financier Nederlandse Taalunie
Jaar 2020
Refereren Corpus Ondertitelde UVN-Colleges - COUC (Version 1.0) (2020) [Data set]. Available at the Dutch Language Institute: http://hdl.handle.net/10032/tm-a2-s3
Talen Nederlands, Vlaams
Toepassing Onderzoek, testen van spraakherkenners
Versie 1.0

Downloaddetails


Bestand
COUC1.0.zip
  • Aantal bestanden 1
  • Aantal downloads 73
  • Bestandsgrootte 21,935.82 MB
  • Datum plaatsing 04/12/2020
  • Laatst bijgewerkt 20/05/2021
  • Versie 1.0
Log in om te downloaden

Primaire Sidebar

Zoek op naam / tags

  • Disclaimer
  • Privacy Policy

© 2022 — Instituut voor de Nederlandse Taal — Contact: taalmaterialen@ivdnt.org

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Op deze website maken wij gebruik van cookies. Lees meerIk ga akkoord
Privacy & Cookies Policy

Privacy Overview

This website uses cookies to improve your experience while you navigate through the website. Out of these cookies, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may have an effect on your browsing experience.
Necessary
Altijd ingeschakeld
Necessary cookies are absolutely essential for the website to function properly. This category only includes cookies that ensures basic functionalities and security features of the website. These cookies do not store any personal information.
Non-necessary
Any cookies that may not be particularly necessary for the website to function and is used specifically to collect user personal data via analytics, ads, other embedded contents are termed as non-necessary cookies. It is mandatory to procure user consent prior to running these cookies on your website.
OPSLAAN & ACCEPTEREN
Skip to content
Open toolbar

Toegankelijkheid

  • Vergroot tekst
  • Verklein tekst
  • Grijstinten
  • Hoog contrast
  • Negatief contrast
  • Lichte achtergrond
  • Links onderstreept
  • Leesbaar font
  • Reset