Filter by:
Corpus (48)
Written Language (48)
Spoken Language (1)
Finnish (26)
English (15)
Estonian (10)
German (9)
Swedish (8)
French (7)
Latvian (6)
Hungarian (5)
Lithuanian (5)
Romanian (5)
Croatian (4)
Italian (4)
Polish (4)
Serbian (4)
Slovenian (4)
Czech (3)
Danish (3)
Greek (3)
Portuguese (2)
Russian (2)
Bulgarian (1)
Dutch; Flemish (1)
Eastern Mari (1)
Erzya (1)
Finland Swedish (1)
Hill Mari (1)
Ingrian (1)
Khanty (1)
Kildin Sami (1)
Mansi (1)
Moksha (1)
Selkup (1)
Slovak (1)
Spanish (1)
Swahili (1)
Ter Sami (1)
Tundra Nenets (1)
Turkish (1)
Veps (1)
True (5)
Nlp Applications (11)
Human Use (5)
Text/plain (1)
Text/xml (1)
Literature (2)
News (1)
Renewable energy (1)
Wikipedia (1)
General (1)
Law_politics (1)
News (1)
Science (1)
Finland (3)
Ekavian (4)
Resource Type:
Corpus: | |
Lexical/Conceptual: | |
Tool/Service: | |
Language Description: |
Media Type:
Text: | |
Audio: | |
Image: | |
Video: | |
Text Numerical: | |
Text N-Gram: |
48 Language Resources (Page 1 of 3)
« Previous | Next »Order by:
ACCURAT balanced test corpus for under resourced languages
0
277
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of comparable sentences
0
259
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of Wikipedia texts
0
271
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
Europarl Parallel Corpus
0
188
- Bulgarian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Greek, Modern (1453-)
- Hungarian
- Italian
- Latvian
- Lithuanian
- Polish
- Portuguese
- Romanian
- Slovak
- Slovenian
- Spanish
- Swedish
Fenno-ugrica, Kielipankki Version
0
107
- Eastern Mari
- Erzya
- Hill Mari
- Ingrian
- Khanty
- Mansi
- Moksha
- Selkup
- Tundra Nenets
- Veps
« Previous | Next »