Filter by:
German (12)
Romanian (12)
English (11)
Estonian (6)
Finnish (6)
Italian (6)
Latvian (6)
Lithuanian (6)
Czech (5)
Russian (5)
Slovenian (5)
Bulgarian (4)
Croatian (4)
Danish (4)
Dutch; Flemish (4)
French (4)
Greek (4)
Hungarian (4)
Polish (4)
Portuguese (4)
Swedish (4)
Turkish (4)
Slovak (3)
Spanish (3)
Albanian (2)
Arabic (2)
Chech (2)
Chinese (2)
Dutch (2)
Hebrew (2)
Japanese (2)
Macedonian (2)
Maltese (2)
Serbian (2)
Amharic (1)
Armenian (1)
Aymara (1)
Azerbaijani (1)
Basque (1)
Bengali (1)
Bosnian (1)
Burmese (1)
Central Khmer (1)
Esperanto (1)
Galician (1)
Georgian (1)
Hindi (1)
Icelandic (1)
Indonesian (1)
Irish (1)
Kazakh (1)
Kirghiz; Kyrgyz (1)
Korean (1)
Malagasy (1)
Mongolian (1)
Norwegian (1)
Oriya (1)
Persian (1)
Slovene (1)
Swahili (1)
Tajik (1)
Turkmen (1)
Ukrainian (1)
Urdu (1)
Corpus (12)
Text (12)
Nlp Applications (12)
Human Use (1)
True (2)
Text Mining (1)
Written Language (5)
1996-2011 (1)
Resource Type:
Corpus: | |
Lexical/Conceptual: | |
Tool/Service: | |
Language Description: |
Media Type:
Text: | |
Audio: | |
Image: | |
Video: | |
Text Numerical: | |
Text N-Gram: |
12 Language Resources
Order by:
ACCURAT balanced test corpus for under resourced languages
0
279
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of comparable sentences
0
261
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of Wikipedia texts
0
275
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
Bulgarian-X language Parallel Corpus
0
269
- Albanian
- Arabic
- Armenian
- Azerbaijani
- Basque
- Bosnian
- Bulgarian
- Catalan; Valencian
- Chinese
- Croatian
- Czech
- Danish
- Dutch
- English
- Estonian
- Finnish
- French
- Galician
- Georgian
- German
- Greek
- Hebrew
- Hungarian
- Icelandic
- Irish
- Italian
- Japanese
- Kazakh
- Kirghiz; Kyrgyz
- Latvian
- Lithuanian
- Macedonian
- Maltese
- Mongolian
- Norwegian
- Polish
- Portuguese
- Romanian
- Russian
- Serbian
- Slovak
- Slovene
- Spanish
- Swedish
- Tajik
- Turkish
- Turkmen
- Ukrainian
Europarl Parallel Corpus
0
190
- Bulgarian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Greek, Modern (1453-)
- Hungarian
- Italian
- Latvian
- Lithuanian
- Polish
- Portuguese
- Romanian
- Slovak
- Slovenian
- Spanish
- Swedish
JRC-Acquis Multilingual Parallel Corpus
0
174
- Bulgarian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Greek, Modern (1453-)
- Hungarian
- Italian
- Latvian
- Lithuanian
- Maltese
- Polish
- Portuguese
- Romanian
- Slovak
- Slovenian
- Spanish
- Swedish
Parallel Global Voices
0
167
- Albanian
- Amharic
- Arabic
- Aymara
- Bengali
- Bulgarian
- Burmese
- Catalan; Valencian
- Central Khmer
- Chinese
- Czech
- Danish
- Dutch; Flemish
- English
- Esperanto
- Filipino; Pilipino
- French
- German
- Greek, Modern (1453-)
- Hebrew
- Hindi
- Hungarian
- Indonesian
- Italian
- Japanese
- Korean
- Macedonian
- Malagasy
- Oriya
- Persian
- Polish
- Portuguese
- Romanian
- Russian
- Serbian
- Spanish; Castilian
- Swahili
- Swedish
- Turkish
- Urdu