English and Afrikaans parallel corpora aligned on sentence level through a combination of automatic and manual alignment techniques. The parallel corpora were obtained from the SA government domain.
Productdetails
Aantal woorden | Text: 421 319 sentences (tokens) |
Annotaties | UTF8, Aligned, Sentence segmented |
Dataformaat | text |
Documentatie | Readme contained in download |
Eigenaar | North-West University, Centre for Text Technology (CTexT) |
Financier | Department of Arts and Culture |
Licentiesoort | Creative Commons Attribution-NonCommercial-ShareAlike 2.5 South Africa |
Opdrachtgever | Department of Arts and Culture |
Talen | Afrikaans, English |
Versie | 1.0 |
Downloaddetails
Bestand | |
---|---|
20150804_Autshumato_English-Afrikaans_Parallel_Corpora_1.0.zip |
- Aantal bestanden 1
- Aantal downloads
- Bestandsgrootte 5.80 MB
- Datum plaatsing 02/09/2020
- Laatst bijgewerkt 17/09/2020
- Versie