Filter by:
Croatian (11)
English (11)
Greek (11)
German (10)
Romanian (9)
Estonian (8)
Latvian (8)
Lithuanian (8)
Czech (7)
Danish (7)
Dutch (7)
Finnish (7)
French (7)
Italian (7)
Polish (7)
Portuguese (7)
Slovenian (7)
Spanish (7)
Swedish (7)
Russian (6)
Bulgarian (5)
Hungarian (5)
Maltese (5)
Slovak (5)
Turkish (5)
Arabic (4)
Norwegian (4)
Basque (3)
Chinese (3)
Japanese (3)
Albanian (2)
Bosnian (2)
Icelandic (2)
Irish (2)
Korean (2)
Macedonian (2)
Serbian (2)
Thai (2)
Ukrainian (2)
Vietnamese (2)
Armenian (1)
Azerbaijani (1)
Belarussian (1)
Galician (1)
Georgian (1)
Hebrew (1)
Hindi (1)
Kazakh (1)
Kirghiz; Kyrgyz (1)
Latin (1)
Mongolian (1)
Norvegian (1)
Persian (1)
Slovene (1)
Tajik (1)
Turkmen (1)
True (3)
Nlp Applications (7)
Human Use (4)
Text Mining (1)
Multilingual (9)
Monolingual (2)
Parallel (3)
Comparable (2)
Written Language (7)
Text/xml (1)
News (1)
Wikipedia (1)
Accounting (1)
Animal product (1)
Land transport (1)
Law_politics (1)
Prices (1)
Science (1)
Transport policy (1)
European Union (1)
Brazil (2)
Modern (1453-) (2)
Resource Type:
Corpus: | |
Lexical/Conceptual: | |
Tool/Service: | |
Language Description: |
Media Type:
Text: | |
Audio: | |
Image: | |
Video: | |
Text Numerical: | |
Text N-Gram: |
11 Language Resources
Order by:
ACCURAT balanced test corpus for under resourced languages
0
277
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of comparable sentences
0
259
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of Wikipedia texts
0
271
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
Bulgarian-X language Parallel Corpus
0
263
- Albanian
- Arabic
- Armenian
- Azerbaijani
- Basque
- Bosnian
- Bulgarian
- Catalan; Valencian
- Chinese
- Croatian
- Czech
- Danish
- Dutch
- English
- Estonian
- Finnish
- French
- Galician
- Georgian
- German
- Greek
- Hebrew
- Hungarian
- Icelandic
- Irish
- Italian
- Japanese
- Kazakh
- Kirghiz; Kyrgyz
- Latvian
- Lithuanian
- Macedonian
- Maltese
- Mongolian
- Norwegian
- Polish
- Portuguese
- Romanian
- Russian
- Serbian
- Slovak
- Slovene
- Spanish
- Swedish
- Tajik
- Turkish
- Turkmen
- Ukrainian
Collins Multilingual database (MLD) – PhraseBank with audio files
0
192
- Arabic
- Chinese
- Croatian
- Czech
- Danish
- Dutch
- English
- Finnish
- French
- German
- Greek
- Hindi
- Italian
- Japanese
- Korean
- Norwegian
- Persian
- Polish
- Portuguese
- Russian
- Spanish
- Swedish
- Thai
- Turkish
- Vietnamese
Collins Multilingual database (MLD) – WordBank with audio files
0
169
- Arabic
- Chinese
- Croatian
- Czech
- Danish
- Dutch
- English
- Finnish
- French
- German
- Greek
- Italian
- Japanese
- Korean
- Norwegian
- Polish
- Portuguese
- Russian
- Spanish
- Swedish
- Thai
- Turkish
- Vietnamese
EuroTermBank
0
137
- Basque
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch
- English
- Estonian
- Finnish
- French
- German
- Greek
- Hungarian
- Italian
- Latin
- Latvian
- Lithuanian
- Maltese
- Norvegian
- Polish
- Portuguese
- Romanian
- Russian
- Slovak
- Slovenian
- Spanish
- Swedish
EUROVOC tezaurus (v4.2)
0
167
- Croatian
- Czech
- Danish
- Dutch
- English
- Estonian
- Finnish
- French
- German
- Greek
- Hungarian
- Italian
- Latvian
- Lithuanian
- Maltese
- Polish
- Portuguese
- Romanian
- Slovak
- Slovenian
- Spanish
- Swedish
Microsoft Terminology Collection
0
194
- Basque
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch
- English
- Estonian
- Finnish
- French
- German
- Greek
- Hungarian
- Italian
- Latvian
- Lithuanian
- Maltese
- Polish
- Portuguese
- Romanian
- Russian
- Slovak
- Slovenian
- Spanish
- Swedish
PELCRA mutlilingual parallel corpora (CC-BY)
0
236
- Arabic
- Belarussian
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch
- English
- Estonian
- Finnish
- French
- German
- Greek
- Hungarian
- Icelandic
- Irish
- Italian
- Latvian
- Lithuanian
- Maltese
- Norwegian
- Polish
- Portuguese
- Romanian
- Russian
- Slovak
- Slovenian
- Spanish
- Swedish
- Turkish
- Ukrainian
South-East European Parallel Corpus
0
183
- Albanian
- Bosnian
- Bulgarian
- Croatian
- English
- Greek
- Macedonian
- Romanian
- Serbian
- Turkish