Filter by:
English (17)
German (9)
French (8)
Latvian (8)
Lithuanian (8)
Romanian (8)
Estonian (7)
Slovenian (7)
Bulgarian (6)
Croatian (6)
Hungarian (5)
Italian (5)
Polish (5)
Portuguese (5)
Czech (4)
Danish (4)
Greek (4)
Serbian (4)
Slovak (4)
Spanish (4)
Swedish (4)
Albanian (3)
Chinese (3)
Dutch; Flemish (3)
Finnish (3)
Macedonian (3)
Arabic (2)
Dutch (2)
Hebrew (2)
Japanese (2)
Maltese (2)
Russian (2)
Turkish (2)
Amharic (1)
Armenian (1)
Aymara (1)
Azerbaijani (1)
Basque (1)
Bengali (1)
Bosnian (1)
Burmese (1)
Central Khmer (1)
Esperanto (1)
Galician (1)
Georgian (1)
Hindi (1)
Icelandic (1)
Indonesian (1)
Irish (1)
Kazakh (1)
Kirghiz; Kyrgyz (1)
Korean (1)
Malagasy (1)
Mongolian (1)
Norwegian (1)
Oriya (1)
Persian (1)
Slovene (1)
Swahili (1)
Tajik (1)
Turkmen (1)
Ukrainian (1)
Urdu (1)
Corpus (17)
Nlp Applications (17)
Human Use (2)
Multilingual (17)
Monolingual (2)
True (6)
Machine Translation (10)
Text Mining (1)
Parallel (12)
Comparable (5)
News (1)
Racial discourse (1)
Renewable energy (1)
Wikipedia (1)
Education (1)
Environment (1)
Health (1)
Law (1)
Literature (1)
Politics (1)
Tourism (1)
Travel (1)
1996-2011 (1)
Resource Type:
Corpus: | |
Lexical/Conceptual: | |
Tool/Service: | |
Language Description: |
Media Type:
Text: | |
Audio: | |
Image: | |
Video: | |
Text Numerical: | |
Text N-Gram: |
17 Language Resources
Order by:
ACCURAT balanced test corpus for under resourced languages
0
277
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of comparable sentences
0
259
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of Wikipedia texts
0
271
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
Bulgarian-X language Parallel Corpus
0
263
- Albanian
- Arabic
- Armenian
- Azerbaijani
- Basque
- Bosnian
- Bulgarian
- Catalan; Valencian
- Chinese
- Croatian
- Czech
- Danish
- Dutch
- English
- Estonian
- Finnish
- French
- Galician
- Georgian
- German
- Greek
- Hebrew
- Hungarian
- Icelandic
- Irish
- Italian
- Japanese
- Kazakh
- Kirghiz; Kyrgyz
- Latvian
- Lithuanian
- Macedonian
- Maltese
- Mongolian
- Norwegian
- Polish
- Portuguese
- Romanian
- Russian
- Serbian
- Slovak
- Slovene
- Spanish
- Swedish
- Tajik
- Turkish
- Turkmen
- Ukrainian
DICTA-SIGN corpus
0
205
- British Sign Language
- English
- French
- French Sign Language
- German
- German Sign Language
- Greek Sign Language
- Greek, Modern (1453-)
English-Estonian cross-linked collection of comparable sentences from Wikipedia
0
99
- English
- Estonian
English-Lithuanian cross-linked collection of comparable sentences from Wikipedia
0
94
- English
- Lithuanian
Europarl Parallel Corpus
0
188
- Bulgarian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Greek, Modern (1453-)
- Hungarian
- Italian
- Latvian
- Lithuanian
- Polish
- Portuguese
- Romanian
- Slovak
- Slovenian
- Spanish
- Swedish
JRC-Acquis Multilingual Parallel Corpus
0
172
- Bulgarian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Greek, Modern (1453-)
- Hungarian
- Italian
- Latvian
- Lithuanian
- Maltese
- Polish
- Portuguese
- Romanian
- Slovak
- Slovenian
- Spanish
- Swedish
Multilingual aligned corpus of Subtitles annotated for sentiment
0
202
- English
- Greek, Modern (1453-)
- Spanish; Castilian
Multilingual Edition of Verne's Novel "Around the World in 80 Days"
0
214
- Albanian
- Bulgarian
- Chinese
- Croatian
- Dutch
- English
- French
- German
- Greek, Modern (1453-)
- Hungarian
- Italian
- Macedonian
- Polish
- Portuguese
- Serbian
- Slovak
- Slovenian
- Spanish
Parallel Global Voices
0
165
- Albanian
- Amharic
- Arabic
- Aymara
- Bengali
- Bulgarian
- Burmese
- Catalan; Valencian
- Central Khmer
- Chinese
- Czech
- Danish
- Dutch; Flemish
- English
- Esperanto
- Filipino; Pilipino
- French
- German
- Greek, Modern (1453-)
- Hebrew
- Hindi
- Hungarian
- Indonesian
- Italian
- Japanese
- Korean
- Macedonian
- Malagasy
- Oriya
- Persian
- Polish
- Portuguese
- Romanian
- Russian
- Serbian
- Spanish; Castilian
- Swahili
- Swedish
- Turkish
- Urdu