Filter by:
Multilingual (17)
Written Language (17)
English (12)
Estonian (9)
German (9)
Latvian (7)
Lithuanian (7)
Finnish (6)
Swedish (6)
Croatian (5)
French (5)
Romanian (5)
Greek (4)
Slovenian (4)
Hungarian (3)
Italian (3)
Polish (3)
Czech (2)
Danish (2)
Portuguese (2)
Russian (2)
Turkish (2)
Arabic (1)
Belarussian (1)
Bulgarian (1)
Dutch (1)
Eastern Mari (1)
Erzya (1)
Hill Mari (1)
Icelandic (1)
Ingrian (1)
Irish (1)
Khanty (1)
Maltese (1)
Mansi (1)
Moksha (1)
Norwegian (1)
Selkup (1)
Slovak (1)
Spanish (1)
Tundra Nenets (1)
Ukrainian (1)
Veps (1)
Corpus (15)
Text (17)
True (6)
Nlp Applications (8)
Human Use (2)
Text Mining (1)
Text/xml (1)
European Union (1)
Resource Type:
Corpus: | |
Lexical/Conceptual: | |
Tool/Service: | |
Language Description: |
Media Type:
Text: | |
Audio: | |
Image: | |
Video: | |
Text Numerical: | |
Text N-Gram: |
17 Language Resources
Order by:
ACCURAT balanced test corpus for under resourced languages
0
277
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of comparable sentences
0
259
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of Wikipedia texts
0
271
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
English-Estonian cross-linked collection of comparable sentences from Wikipedia
0
99
- English
- Estonian
English-Lithuanian cross-linked collection of comparable sentences from Wikipedia
0
94
- English
- Lithuanian
Fenno-ugrica, Kielipankki Version
0
107
- Eastern Mari
- Erzya
- Hill Mari
- Ingrian
- Khanty
- Mansi
- Moksha
- Selkup
- Tundra Nenets
- Veps
Latvian-Lithuanian cross-linked collection of comparable sentences from Wikipedia
0
93
- Latvian
- Lithuanian
Opus, Helsinki Korp Version
0
128
- Czech
- Danish
- English
- Estonian
- Finnish
- French
- German
- Greek, Modern (1453-)
- Hungarian
- Italian
- Polish
- Portuguese
- Russian
- Spanish; Castilian
- Swedish
- Turkish
PELCRA mutlilingual parallel corpora (CC-BY)
0
236
- Arabic
- Belarussian
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch
- English
- Estonian
- Finnish
- French
- German
- Greek
- Hungarian
- Icelandic
- Irish
- Italian
- Latvian
- Lithuanian
- Maltese
- Norwegian
- Polish
- Portuguese
- Romanian
- Russian
- Slovak
- Slovenian
- Spanish
- Swedish
- Turkish
- Ukrainian
The Helsinki Korp Europarl Bilingual Corpora
0
89
- English
- Estonian
- Finnish
- French
- German
- Spanish; Castilian
- Swedish
The Helsinki Korp JRC-Acquis Bilingual Parallel Corpora
0
62
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Italian
- Polish
- Spanish; Castilian
- Swedish