English and Afrikaans parallel corpora aligned on sentence level through a combination of automatic and manual alignment techniques. The parallel corpora were obtained from the SA government domain.
Productdetails
| Aantal woorden | Text: 421 319 sentences (tokens) |
| Annotaties | UTF8, Aligned, Sentence segmented |
| Dataformaat | text |
| Documentatie | Readme contained in download |
| Eigenaar | North-West University, Centre for Text Technology (CTexT) |
| Financier | Department of Arts and Culture |
| Licentiesoort | Creative Commons Attribution-NonCommercial-ShareAlike 2.5 South Africa |
| Opdrachtgever | Department of Arts and Culture |
| Talen | Afrikaans, English |
| Versie | 1.0 |
Downloaddetails
| Bestand | |
|---|---|
| 20150804_Autshumato_English-Afrikaans_Parallel_Corpora_1.0.zip |
- Aantal bestanden 1
- Aantal downloads 2
- Bestandsgrootte 5.80 MB
- Datum plaatsing 02/09/2020
- Laatst bijgewerkt 17/09/2020
- Versie