Filter by:
Corpus (1485)
Text/xml (141)
Text/plain (14)
Praat Text Grid (8)
Gr AF (4)
SAMPA (4)
Text/csv (4)
Plain txt (3)
Tmx (3)
XML (2)
Orthography (2)
Othography (2)
Plain text / xml (2)
Text (2)
Txt/xml (2)
.mlf (1)
.trs (1)
Co NLL (1)
ELAN (.eaf) (1)
PRAAT/ Text Grid (1)
SAM V4.1 (1)
TIPSTER (1)
TMX (1)
Text (1)
Aligned (1)
Eaf (1)
Other (1)
Parallel corpora (1)
Txt / cqpweb (1)
English (371)
German (164)
Finnish (153)
French (140)
Swedish (132)
Spanish (102)
Portuguese (95)
Estonian (85)
Chinese (83)
Italian (72)
Russian (55)
Arabic (54)
Hungarian (54)
Polish (43)
Bulgarian (40)
Danish (40)
Spanish; Castilian (40)
Czech (39)
Japanese (34)
Romanian (34)
Turkish (31)
Basque (30)
Slovak (30)
Finland Swedish (25)
Croatian (24)
Latvian (24)
Maltese (22)
Dutch; Flemish (21)
Korean (20)
Serbian (18)
Dutch (17)
Catalan (14)
Icelandic (14)
Slovenian (13)
Lithuanian (12)
Thai (12)
Norwegian (10)
Persian (9)
Erzya (8)
Greek (8)
Hindi (8)
Vietnamese (8)
Northern Sami (7)
Pushto (7)
Swahili (7)
Chech (6)
Galician (6)
Latin (6)
Moksha (6)
Sign Languages (6)
Albanian (5)
Tamil (5)
Bengali (4)
Hebrew (4)
Ingrian (4)
Khanty (4)
Macedonian (4)
Tundra Nenets (4)
Ukrainian (4)
Urdu (4)
Irish (3)
Karelian (3)
Kildin Sami (3)
Komi Zyrian (3)
Kurdish (3)
Ludian (3)
Nepali (3)
Panjabi (3)
Slovene (3)
Tajik (3)
Udmurt (3)
Uzbek (3)
Amharic (2)
Avaric (2)
Bosnian (2)
Chukchi (2)
Chuvash (2)
Eastern Mari (2)
Even (2)
Evenki (2)
Gujarati (2)
Hausa (2)
Hill Mari (2)
Inari Sami (2)
Indonesian (2)
Kalmyk; Oirat (2)
Koryak (2)
Lak (2)
Malayalam (2)
Mansi (2)
Mongolian (2)
No Language (2)
Available - Restricted Use (1222)
Under Negotiation (38)
ELRA_END_USER (546)
ELRA_VAR (470)
CC - BY (214)
Other (163)
MS - NC - No Re D (86)
Under Negotiation (77)
CC - BY - NC - SA (57)
CLARIN_RES (56)
CC - BY - SA (54)
ELRA_EVALUATION (46)
CC - BY - NC (39)
CLARIN_ACA - NC (36)
Proprietary (34)
MS - C - No Re D (21)
CLARIN_ACA (18)
GPL (13)
CC - BY - NC - ND (10)
CC - BY - ND (5)
GFDL (5)
BSD - Style (4)
CC - ZERO (4)
CLARIN_PUB (2)
MS Commons - BY (2)
AGPL (1)
LGPL (1)
Commercial Use (499)
Attribution (331)
Other (160)
No Redistribution (130)
Share Alike (67)
Evaluation Use (46)
No Derivatives (30)
Inform Licensor (29)
Redeposit (15)
Only M Smembers (4)
Nlp Applications (189)
Human Use (90)
Machine Translation (67)
Linguistic Research (47)
Speech Analysis (15)
Speech Recognition (13)
Pos Tagging (12)
Parsing (10)
Speech Synthesis (9)
Lexicon Access (7)
Other (7)
Lemmatization (5)
Text Mining (5)
Annotation (4)
Event Extraction (3)
Face Recognition (1)
Semantic Web (1)
Summarisation (1)
Text Generation (1)
Web Services (1)
Written Language (274)
Spoken Language (95)
Voice (45)
Body Gesture (34)
Facial Expression (31)
Sign Language (20)
Other (9)
Text/xml (61)
Plain text (29)
Text/plain (29)
Wave/audio (16)
Audio/wav (10)
Text (10)
Wav (8)
Xml (6)
Audio (3)
Audio/ PCMA (3)
Audio/x-wav (3)
Video/mpeg (3)
Video/x-msvideo (3)
WAV (2)
Audio/mp3 (2)
Video/mp4 (2)
Audio/wav (1)
MS Word (1)
US- ASCII (1)
XML (1)
Application/pdf (1)
Audio/flac (1)
Audio/speex (1)
Audio/vorbis (1)
Mp3 (1)
Plain/text (1)
Sgml (1)
Text/csv (1)
Text/txt (1)
Txt (1)
Txt/xml (1)
Video/mp2t (1)
TEI (27)
XCES (25)
TEI_P5 (21)
Other (16)
TMX (7)
EAGLES (3)
MULTEXT (2)
Prague Treebank (2)
EML (1)
MUMIN (1)
Penn Tree Bank (1)
Time ML (1)
Law (26)
Environment (20)
General (19)
Education (18)
Health (16)
Law_politics (14)
News (13)
Science (13)
Literature (8)
Politics (8)
Tourism (8)
Novels (7)
Test (7)
Medicine (6)
Finance (5)
Laptop reviews (5)
Computer science (4)
Economy (4)
Pharma (3)
Fiction (3)
History (3)
Movies (3)
News (3)
Society (3)
Travel (3)
General (2)
Political (2)
Blog (2)
Business (2)
Camera (2)
Entertainment (2)
Forum (2)
Geography (2)
Government (2)
Humanities (2)
Informative (2)
Leisure (2)
Periodicals (2)
Religion (2)
Technology (2)
Unknown (2)
Weather report (2)
Automotive (1)
Environment (1)
Europarl (1)
Fiction (1)
General language (1)
IT (1)
Legal news (1)
Medical History (1)
Politics (1)
Racial discourse (1)
Renewable energy (1)
Science (1)
Wikipedia (1)
Agriculture (1)
Construction (1)
Economics (1)
Everyday scenes (1)
Laws (1)
Legal (1)
Nanotechnology (1)
Physics (1)
Portugal (16)
Iceland (7)
Finland (4)
Poland (4)
Helsinki (3)
Europe, Asia (2)
European Union (2)
Is (2)
Brasil (1)
Brazil (1)
Espoo, Finland (1)
Europe (1)
Greece (1)
IS (1)
Karelia (1)
Mozambique (1)
Scotland (1)
Thrace (1)
UK (1)
Vantaa, Finland (1)
English (1)
Portuguese (1)
Pt (1)
1996-2011 (9)
2003-2012 (3)
1970 - 2002 (2)
2000-2008 (2)
2001-2015 (2)
2003 (2)
2012-2014 (2)
Early 1990s (2)
1410-1681 (1)
1540-1750 (1)
1543-1810 (1)
1564-1939 (1)
16.-18. century (1)
1726-1912 (1)
1770-1949 (1)
1770-2011 (1)
1785 (1)
1800-2000 (1)
1809-1899 (1)
1810-1940 (1)
1820-2000 (1)
1840 - 2013 (1)
1844-2000 (1)
1855-1871 (1)
1880-1949 (1)
1895-1909 (1)
1918-2011 (1)
1920-1939 (1)
1934-1935 (1)
1935–2007 (1)
1958-2006 (1)
1967-2008 (1)
1970 to 2001 (1)
1970-1974 (1)
1970-1975 (1)
1970-1989 (1)
1970-2001 (1)
1970-2002 (1)
1972-2013 (1)
1978-2000 (1)
1980-1990 (1)
1981-1990 (1)
1986 (1)
1986 - 1987 (1)
1986-1994 (1)
1987-2000 (1)
1989-1998 (1)
1989-2007 (1)
1990-2015 (1)
1993-2012 (1)
1995-2003 (1)
1996-1997 (1)
2000-2010 (1)
2001-2005 (1)
2001-2014 (1)
2002-2003 (1)
2003-2011 (1)
2003-2015 (1)
2004-2005 (1)
2004-2009 (1)
2004-2011 (1)
2004-2012 (1)
2005, 2010 (1)
2005-2010 (1)
2005-2012 (1)
2006-2015 (1)
2006-2016 (1)
2007-2009 (1)
2008-2010 (1)
2008-2014 (1)
2009-2012 (1)
2011-2012 (1)
2011-2014 (1)
2013 (1)
2013- (1)
2013-2014 (1)
771 - 1884 (1)
Years 2010-2011 (1)
After 1990 (1)
Ca. 730–1710 (1)
Until 2006 (1)
Castilian (71)
Flemish (10)
Valencian (10)
Legalese (9)
Ekavian (5)
Brazil (3)
Punjabi (3)
Mexico (2)
Modern (1453-) (2)
Venezuela (2)
Newspaper (2)
American English (1)
American Finnish (1)
Australian (1)
Costa Rica (1)
Finland Swedish (1)
Resource Type:
Corpus: | |
Lexical/Conceptual: | |
Tool/Service: | |
Language Description: |
Media Type:
Text: | |
Audio: | |
Image: | |
Video: | |
Text Numerical: | |
Text N-Gram: |
1484 Language Resources (Page 1 of 75)
« Previous | Next »Order by:
2006 CoNLL Shared Task - Ten Languages
0
333
- Bulgarian
- Danish
- Dutch
- German
- Japanese
- Portuguese
- Slovenian
- Spanish
- Swedish
- Turkish
ACCURAT balanced test corpus for under resourced languages
0
276
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of comparable sentences
0
258
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of Wikipedia texts
0
271
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
« Previous | Next »