Filter by:
English (222)
German (96)
French (84)
Russian (51)
Finnish (49)
Icelandic (44)
Swedish (44)
Latvian (41)
Estonian (39)
Portuguese (37)
Spanish (35)
Italian (34)
Polish (33)
Danish (30)
Hungarian (30)
Lithuanian (30)
Bulgarian (26)
Romanian (24)
Czech (22)
Spanish; Castilian (20)
Croatian (16)
Latin (15)
Slovenian (15)
Dutch (14)
Norwegian (13)
Slovak (13)
Basque (12)
Greek (11)
Dutch; Flemish (9)
Serbian (9)
Arabic (8)
Chinese (8)
Maltese (8)
Turkish (7)
Japanese (6)
Albanian (5)
Faroese (4)
Hindi (4)
Macedonian (4)
Northern Sami (4)
Persian (4)
Bengali (3)
Erzya (3)
Galician (3)
Irish (3)
Sign Languages (3)
Ukrainian (3)
Urdu (3)
Bosnian (2)
Catalan (2)
Chuvash (2)
Gujarati (2)
Hebrew (2)
Khanty (2)
Korean (2)
Kurdish (2)
Moksha (2)
Oriya (2)
Panjabi (2)
Sinhalese (2)
Tajik (2)
Tamil (2)
Tatar (2)
Udmurt (2)
Uzbek (2)
Vietnamese (2)
Amharic (1)
Armenian (1)
Assamese (1)
Avaric (1)
Aymara (1)
Azerbaijani (1)
Belarussian (1)
Burmese (1)
Central Khmer (1)
Chukchi (1)
Esperanto (1)
Even (1)
Evenki (1)
Gaelic (1)
Georgian (1)
Indonesian (1)
Ingrian (1)
Kalmyk; Oirat (1)
Kannada (1)
Kashmiri (1)
Kazakh (1)
Kildin Sami (1)
Proprietary (117)
CC - BY (22)
ELRA_END_USER (20)
Under Negotiation (17)
ELRA_VAR (14)
CC - BY - SA (8)
CLARIN_RES (7)
ELRA_EVALUATION (6)
Other (6)
CLARIN_ACA - NC (3)
CC - BY - NC (2)
CC - BY - ND (2)
CC - ZERO (1)
LGPL (1)
Attribution (27)
Commercial Use (15)
Share Alike (9)
Other (8)
Evaluation Use (6)
Redeposit (4)
Inform Licensor (3)
No Derivatives (3)
True (11)
Nlp Applications (138)
Human Use (124)
Information Retrieval (114)
Machine Translation (14)
Other (3)
Text Mining (2)
Annotation (1)
Event Extraction (1)
Lemmatization (1)
Parsing (1)
Pos Tagging (1)
Semantic Web (1)
Speech Analysis (1)
Text/xml (6)
Plain text (3)
Video/x-msvideo (3)
Video/mpeg (2)
Text (1)
Text/plain (1)
Video/mp2t (1)
Video/mp4 (1)
Wav (1)
Xml (1)
Health (7)
Communications (4)
Economics (4)
Energy (4)
Environment (4)
Humanities (4)
Law (4)
Science (4)
Taxation (4)
Community law (3)
Education (3)
Finance (3)
Social affairs (3)
Social questions (3)
Accounting (2)
Civil law (2)
Documentation (2)
Marketing (2)
Politics (2)
Tariff policy (2)
Teaching (2)
Transport (2)
Wood industry (2)
Europarl (1)
Legal news (1)
News (1)
Racial discourse (1)
Renewable energy (1)
Wikipedia (1)
Animal product (1)
Budget (1)
Consumption (1)
Criminal law (1)
Defence (1)
Family (1)
Fisheries (1)
Food technology (1)
Foodstuff (1)
General (1)
Geography (1)
Land transport (1)
Law_politics (1)
Literature (1)
Management (1)
Medicine (1)
Prices (1)
Seismology (1)
Tourism (1)
Trade (1)
Transport policy (1)
Transportation (1)
Travel (1)
1996-2011 (2)
1800-2000 (1)
1986-1994 (1)
2003-2012 (1)
2004-2012 (1)
2005-2012 (1)
2009-2012 (1)
2011-2012 (1)
Years 2010-2011 (1)
Castilian (15)
Flemish (5)
Punjabi (2)
Australian (1)
Brazil (1)
Finland Swedish (1)
Native Finnish (1)
New Zealand (1)
Read-aloud text (1)
Scottish (1)
Scottish Gaelic (1)
Southern English (1)
Resource Type:
Corpus: | |
Lexical/Conceptual: | |
Tool/Service: | |
Language Description: |
Media Type:
Text: | |
Audio: | |
Image: | |
Video: | |
Text Numerical: | |
Text N-Gram: |
222 Language Resources (Page 1 of 12)
« Previous | Next »Order by:
ACCURAT balanced test corpus for under resourced languages
0
276
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of comparable sentences
0
258
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of Wikipedia texts
0
271
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ARCADE II Evaluation Package
0
283
- Arabic
- Chinese
- English
- French
- German
- Greek, Modern (1453-)
- Italian
- Japanese
- Persian
- Russian
- Spanish
Art Lexicon: Painting, Sculpture, Graphics, Architecture and Industrial Artist in Estonian, English, French, German and Swedish
0
197
- English
- Estonian
- French
- German
Bilingual term pairs extracted from comparable news feeds resources using the TaaS Bilingual Term Extraction System.
0
100
- English
- German
- Latvian
Bilingual term pairs extracted from comparable Web resources using the TaaS Bilingual Term Extraction System
0
181
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Greek, Modern (1453-)
- Hungarian
- Italian
- Latvian
- Lithuanian
- Polish
- Portuguese
- Romanian
- Russian
- Slovak
- Slovenian
- Spanish
- Swedish
Bilingual term pairs extracted from Wikipedia using the TaaS Bilingual Term Extraction System
0
136
- Bulgarian
- Croatian
- Danish
- English
- Estonian
- Greek, Modern (1453-)
- Irish
- Latvian
- Lithuanian
- Maltese
- Romanian
- Slovak
- Slovenian
Bulgarian-X language Parallel Corpus
0
263
- Albanian
- Arabic
- Armenian
- Azerbaijani
- Basque
- Bosnian
- Bulgarian
- Catalan; Valencian
- Chinese
- Croatian
- Czech
- Danish
- Dutch
- English
- Estonian
- Finnish
- French
- Galician
- Georgian
- German
- Greek
- Hebrew
- Hungarian
- Icelandic
- Irish
- Italian
- Japanese
- Kazakh
- Kirghiz; Kyrgyz
- Latvian
- Lithuanian
- Macedonian
- Maltese
- Mongolian
- Norwegian
- Polish
- Portuguese
- Romanian
- Russian
- Serbian
- Slovak
- Slovene
- Spanish
- Swedish
- Tajik
- Turkish
- Turkmen
- Ukrainian
CESAR Aligned Wikipedia Headwords List
0
202
- Bulgarian
- Croatian
- English
- Hungarian
- Polish
- Serbian
- Slovakian
CLEF AdHoc-News Test Suites (2004-2008) – Evaluation Package
0
232
- Bulgarian
- Czech
- Dutch
- English
- Finnish
- French
- German
- Hungarian
- Italian
- Persian
- Portuguese
- Russian
- Spanish
- Swedish
« Previous | Next »