SoNaR-corpus Het SoNaR-corpus bevat ruim 500 miljoen woorden afkomstig uit (standaard) Nederlandstalige teksten van na 1954. The SoNaR Corpus contains more than 500 million words from texts in standard Dutch later than 1954. Lees meer
Siswati Genre Classification Corpus Contains training and testing data for genre classification for Siswati. Lees meer
Setswana Genre Classification Corpus Contains training and testing data for genre classification for Setswana. Lees meer
Sesotho sa Leboa Genre Classification Corpus Contains training and testing data for genre classification for Sesotho sa Leboa. Lees meer
Sesotho Genre Classification Corpus Contains training and testing data for genre classification for Sesotho. Lees meer