Filter by:
German (6)
Romanian (6)
English (5)
Croatian (3)
Estonian (3)
Greek (3)
Latvian (3)
Lithuanian (3)
Slovenian (3)
Italian (2)
Russian (2)
Turkish (2)
Albanian (1)
Amharic (1)
Arabic (1)
Aymara (1)
Bengali (1)
Bulgarian (1)
Burmese (1)
Central Khmer (1)
Chech (1)
Chinese (1)
Czech (1)
Danish (1)
Dutch (1)
Dutch; Flemish (1)
Esperanto (1)
Finnish (1)
French (1)
Hebrew (1)
Hindi (1)
Hungarian (1)
Indonesian (1)
Japanese (1)
Korean (1)
Macedonian (1)
Malagasy (1)
Oriya (1)
Persian (1)
Polish (1)
Portuguese (1)
Serbian (1)
Swahili (1)
Swedish (1)
Urdu (1)
Text (6)
CC - BY (6)
Nlp Applications (6)
Corpus (6)
Attribution (1)
True (1)
Text Mining (1)
Multilingual (4)
Bilingual (2)
Parallel (4)
Comparable (2)
Written Language (3)
Resource Type:
Corpus: | |
Lexical/Conceptual: | |
Tool/Service: | |
Language Description: |
Media Type:
Text: | |
Audio: | |
Image: | |
Video: | |
Text Numerical: | |
Text N-Gram: |
6 Language Resources
Order by:
ACCURAT balanced test corpus for under resourced languages
0
279
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of comparable sentences
0
261
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of Wikipedia texts
0
275
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
Parallel Global Voices
0
167
- Albanian
- Amharic
- Arabic
- Aymara
- Bengali
- Bulgarian
- Burmese
- Catalan; Valencian
- Central Khmer
- Chinese
- Czech
- Danish
- Dutch; Flemish
- English
- Esperanto
- Filipino; Pilipino
- French
- German
- Greek, Modern (1453-)
- Hebrew
- Hindi
- Hungarian
- Indonesian
- Italian
- Japanese
- Korean
- Macedonian
- Malagasy
- Oriya
- Persian
- Polish
- Portuguese
- Romanian
- Russian
- Serbian
- Spanish; Castilian
- Swahili
- Swedish
- Turkish
- Urdu