Filter by:
Croatian (10)
German (10)
English (9)
French (7)
Greek (7)
Polish (7)
Portuguese (7)
Estonian (6)
Italian (6)
Latvian (6)
Lithuanian (6)
Romanian (6)
Slovenian (6)
Spanish (6)
Swedish (6)
Arabic (5)
Chinese (5)
Czech (5)
Danish (5)
Dutch (5)
Finnish (5)
Norwegian (5)
Russian (5)
Turkish (5)
Bulgarian (4)
Hungarian (4)
Japanese (4)
Slovak (4)
Icelandic (3)
Korean (3)
Maltese (3)
Thai (3)
Ukrainian (3)
Vietnamese (3)
Albanian (2)
Irish (2)
Macedonian (2)
Serbian (2)
Armenian (1)
Azerbaijani (1)
Basque (1)
Belarussian (1)
Bosnian (1)
Dutch; Flemish (1)
Galician (1)
Georgian (1)
Hausa (1)
Hebrew (1)
Hindi (1)
Kazakh (1)
Kirghiz; Kyrgyz (1)
Mongolian (1)
Persian (1)
Slovene (1)
Swahili (1)
Tajik (1)
Tamil (1)
Turkmen (1)
True (4)
Nlp Applications (5)
Human Use (2)
Text Mining (1)
Multilingual (7)
Monolingual (3)
Parallel (5)
Comparable (2)
Written Language (4)
Text/xml (1)
European Union (1)
Resource Type:
Corpus: | |
Lexical/Conceptual: | |
Tool/Service: | |
Language Description: |
Media Type:
Text: | |
Audio: | |
Image: | |
Video: | |
Text Numerical: | |
Text N-Gram: |
10 Language Resources
Order by:
ACCURAT balanced test corpus for under resourced languages
0
277
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of comparable sentences
0
259
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of Wikipedia texts
0
271
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
Bulgarian-X language Parallel Corpus
0
263
- Albanian
- Arabic
- Armenian
- Azerbaijani
- Basque
- Bosnian
- Bulgarian
- Catalan; Valencian
- Chinese
- Croatian
- Czech
- Danish
- Dutch
- English
- Estonian
- Finnish
- French
- Galician
- Georgian
- German
- Greek
- Hebrew
- Hungarian
- Icelandic
- Irish
- Italian
- Japanese
- Kazakh
- Kirghiz; Kyrgyz
- Latvian
- Lithuanian
- Macedonian
- Maltese
- Mongolian
- Norwegian
- Polish
- Portuguese
- Romanian
- Russian
- Serbian
- Slovak
- Slovene
- Spanish
- Swedish
- Tajik
- Turkish
- Turkmen
- Ukrainian
Collins Multilingual database (MLD) – PhraseBank with audio files
0
192
- Arabic
- Chinese
- Croatian
- Czech
- Danish
- Dutch
- English
- Finnish
- French
- German
- Greek
- Hindi
- Italian
- Japanese
- Korean
- Norwegian
- Persian
- Polish
- Portuguese
- Russian
- Spanish
- Swedish
- Thai
- Turkish
- Vietnamese
Collins Multilingual database (MLD) – WordBank with audio files
0
169
- Arabic
- Chinese
- Croatian
- Czech
- Danish
- Dutch
- English
- Finnish
- French
- German
- Greek
- Italian
- Japanese
- Korean
- Norwegian
- Polish
- Portuguese
- Russian
- Spanish
- Swedish
- Thai
- Turkish
- Vietnamese
GlobalPhone 2000 Speaker Package
0
85
- Arabic
- Bulgarian
- Chinese
- Croatian
- Czech
- French
- German
- Hausa
- Japanese
- Korean
- Polish
- Portuguese
- Russian
- Spanish
- Swahili
- Swedish
- Tamil
- Thai
- Turkish
- Ukrainian
- Vietnamese
Multilingual Edition of Verne's Novel "Around the World in 80 Days"
0
214
- Albanian
- Bulgarian
- Chinese
- Croatian
- Dutch
- English
- French
- German
- Greek, Modern (1453-)
- Hungarian
- Italian
- Macedonian
- Polish
- Portuguese
- Serbian
- Slovak
- Slovenian
- Spanish
PELCRA mutlilingual parallel corpora (CC-BY)
0
236
- Arabic
- Belarussian
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch
- English
- Estonian
- Finnish
- French
- German
- Greek
- Hungarian
- Icelandic
- Irish
- Italian
- Latvian
- Lithuanian
- Maltese
- Norwegian
- Polish
- Portuguese
- Romanian
- Russian
- Slovak
- Slovenian
- Spanish
- Swedish
- Turkish
- Ukrainian
Tilde MODEL - Multilingual Open Data for EU Languages
0
85
- Croatian
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Greek, Modern (1453-)
- Hungarian
- Icelandic
- Italian
- Latvian
- Lithuanian
- Maltese
- Norwegian
- Polish
- Portuguese
- Romanian
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish