Filter by:
Corpus (959)
Text/xml (132)
Text/plain (14)
Praat Text Grid (6)
Gr AF (4)
SAMPA (4)
Text/csv (4)
Plain txt (3)
Tmx (3)
XML (2)
Orthography (2)
Othography (2)
Plain text / xml (2)
Text (2)
Txt/xml (2)
.mlf (1)
.trs (1)
Co NLL (1)
ELAN (.eaf) (1)
PRAAT/ Text Grid (1)
SAM V4.1 (1)
TIPSTER (1)
TMX (1)
Text (1)
Aligned (1)
Eaf (1)
Other (1)
Parallel corpora (1)
Txt / cqpweb (1)
Text (959)
Political (2)
Advertising (2)
Discussion (2)
Feature (2)
Fiction (2)
Information (2)
Non-fiction (2)
Official (2)
Private (2)
Subtitles (2)
Biomedical text (1)
Fiction (1)
Published text (1)
Spoken (1)
Written (1)
Interviews (1)
News (1)
Scripts (1)
Transcripts (1)
Text/xml (10)
Fiction (2)
Newspaper (2)
Tabular (2)
Text (2)
Book (1)
Fairy tales (1)
Formal/ Media (1)
Informal/ Public (1)
Miscellaneous (1)
Newspaper (1)
Periodical (1)
Poems (1)
Encyclopaedic (1)
Examinations (1)
Political (1)
Quasi-spoken (1)
Script (1)
Audio (95)
Video (29)
Image (4)
Textnumerical (1)
English (276)
Finnish (129)
Swedish (110)
French (91)
German (83)
Portuguese (78)
Estonian (74)
Italian (52)
Hungarian (50)
Russian (48)
Spanish (42)
Spanish; Castilian (37)
Arabic (35)
Bulgarian (35)
Polish (34)
Romanian (31)
Czech (30)
Danish (28)
Slovak (28)
Chinese (25)
Finland Swedish (25)
Latvian (24)
Turkish (24)
Dutch; Flemish (21)
Croatian (20)
Basque (19)
Maltese (18)
Serbian (16)
Icelandic (14)
Dutch (12)
Lithuanian (12)
Slovenian (12)
Erzya (8)
Northern Sami (7)
Catalan (6)
Chech (6)
Greek (6)
Japanese (6)
Latin (6)
Moksha (6)
Norwegian (6)
Pushto (6)
Albanian (5)
Galician (5)
Persian (5)
Swahili (5)
Bengali (4)
Hindi (4)
Ingrian (4)
Khanty (4)
Macedonian (4)
Sign Languages (4)
Tundra Nenets (4)
Urdu (4)
Irish (3)
Kildin Sami (3)
Komi Zyrian (3)
Korean (3)
Ludian (3)
Panjabi (3)
Slovene (3)
Tajik (3)
Tamil (3)
Udmurt (3)
Uzbek (3)
Vietnamese (3)
Amharic (2)
Avaric (2)
Bosnian (2)
Chukchi (2)
Chuvash (2)
Eastern Mari (2)
Even (2)
Evenki (2)
Gujarati (2)
Hebrew (2)
Hill Mari (2)
Inari Sami (2)
Indonesian (2)
Kalmyk; Oirat (2)
Karelian (2)
Koryak (2)
Kurdish (2)
Lak (2)
Malayalam (2)
Mansi (2)
Mongolian (2)
Nepali (2)
Oriya (2)
Selkup (2)
Sinhalese (2)
Skolt Sami (2)
Tabassaran (2)
Under Negotiation (28)
CC - BY (207)
Other (150)
ELRA_END_USER (127)
ELRA_VAR (84)
MS - NC - No Re D (84)
Under Negotiation (62)
CC - BY - NC - SA (49)
CLARIN_RES (49)
CC - BY - SA (42)
CC - BY - NC (38)
CLARIN_ACA - NC (34)
Proprietary (34)
MS - C - No Re D (21)
CLARIN_ACA (15)
ELRA_EVALUATION (15)
GPL (13)
CC - BY - ND (5)
GFDL (5)
CC - ZERO (4)
BSD - Style (3)
CLARIN_PUB (2)
AGPL (1)
LGPL (1)
MS Commons - BY (1)
Attribution (316)
Other (144)
No Redistribution (124)
Commercial Use (105)
Share Alike (61)
Inform Licensor (27)
No Derivatives (26)
Evaluation Use (15)
Redeposit (14)
Only M Smembers (3)
Nlp Applications (180)
Human Use (85)
Machine Translation (64)
Linguistic Research (43)
Speech Analysis (14)
Pos Tagging (12)
Parsing (10)
Speech Synthesis (8)
Lexicon Access (7)
Other (7)
Lemmatization (5)
Text Mining (5)
Annotation (4)
Event Extraction (3)
Face Recognition (1)
Semantic Web (1)
Summarisation (1)
Text Generation (1)
Web Services (1)
Written Language (270)
Spoken Language (67)
Voice (32)
Body Gesture (23)
Facial Expression (20)
Sign Language (15)
Other (9)
Text/xml (61)
Plain text (29)
Text/plain (19)
Wave/audio (16)
Text (10)
Audio/wav (7)
Wav (7)
Xml (6)
Audio/ PCMA (3)
WAV (2)
Audio/mp3 (2)
Audio/x-wav (2)
Video/mp4 (2)
Video/mpeg (2)
Video/x-msvideo (2)
MS Word (1)
US- ASCII (1)
XML (1)
Application/pdf (1)
Audio/flac (1)
Audio/speex (1)
Audio/vorbis (1)
Mp3 (1)
Plain/text (1)
Sgml (1)
Text/csv (1)
Text/txt (1)
Txt (1)
Txt/xml (1)
TEI (27)
XCES (25)
TEI_P5 (21)
Other (16)
TMX (7)
EAGLES (3)
MULTEXT (2)
Prague Treebank (2)
EML (1)
MUMIN (1)
Penn Tree Bank (1)
Time ML (1)
Law (26)
General (19)
Education (18)
Health (16)
Environment (14)
Law_politics (14)
News (13)
Science (12)
Literature (8)
Novels (7)
Test (7)
Politics (7)
Medicine (6)
Tourism (6)
Finance (5)
Laptop reviews (5)
Computer science (4)
Economy (4)
Pharma (3)
Fiction (3)
History (3)
Movies (3)
News (3)
Society (3)
Travel (3)
Political (2)
Blog (2)
Business (2)
Camera (2)
Entertainment (2)
Forum (2)
Geography (2)
Government (2)
Humanities (2)
Informative (2)
Leisure (2)
Periodicals (2)
Religion (2)
Technology (2)
Unknown (2)
Weather report (2)
Automotive (1)
Environment (1)
Europarl (1)
Fiction (1)
General (1)
General language (1)
IT (1)
Legal news (1)
Medical History (1)
Politics (1)
Racial discourse (1)
Renewable energy (1)
Science (1)
Wikipedia (1)
Agriculture (1)
Construction (1)
Economics (1)
Everyday scenes (1)
Laws (1)
Legal (1)
Nanotechnology (1)
Physics (1)
Portugal (16)
Iceland (7)
Poland (4)
Helsinki (3)
Europe, Asia (2)
European Union (2)
Finland (2)
Is (2)
Brasil (1)
Brazil (1)
Europe (1)
Greece (1)
IS (1)
Karelia (1)
Mozambique (1)
Scotland (1)
Thrace (1)
UK (1)
English (1)
Portuguese (1)
Pt (1)
1996-2011 (9)
2003-2012 (3)
1970 - 2002 (2)
2000-2008 (2)
2001-2015 (2)
2003 (2)
2012-2014 (2)
Early 1990s (2)
1410-1681 (1)
1540-1750 (1)
1543-1810 (1)
1564-1939 (1)
16.-18. century (1)
1726-1912 (1)
1770-2011 (1)
1785 (1)
1800-2000 (1)
1809-1899 (1)
1810-1940 (1)
1840 - 2013 (1)
1844-2000 (1)
1855-1871 (1)
1880-1949 (1)
1895-1909 (1)
1918-2011 (1)
1920-1939 (1)
1934-1935 (1)
1935–2007 (1)
1958-2006 (1)
1967-2008 (1)
1970-1975 (1)
1970-1989 (1)
1970-2001 (1)
1970-2002 (1)
1972-2013 (1)
1978-2000 (1)
1980-1990 (1)
1981-1990 (1)
1986 (1)
1986 - 1987 (1)
1986-1994 (1)
1987-2000 (1)
1989-1998 (1)
1989-2007 (1)
1990-2015 (1)
1995-2003 (1)
1996-1997 (1)
2000-2010 (1)
2001-2014 (1)
2002-2003 (1)
2003-2011 (1)
2003-2015 (1)
2004-2005 (1)
2004-2009 (1)
2004-2011 (1)
2004-2012 (1)
2005, 2010 (1)
2005-2010 (1)
2005-2012 (1)
2006-2015 (1)
2006-2016 (1)
2007-2009 (1)
2008-2010 (1)
2009-2012 (1)
2011-2012 (1)
2011-2014 (1)
2013 (1)
2013- (1)
2013-2014 (1)
771 - 1884 (1)
Years 2010-2011 (1)
After 1990 (1)
Ca. 730–1710 (1)
Until 2006 (1)
Castilian (16)
Legalese (9)
Flemish (7)
Ekavian (5)
Punjabi (3)
Valencian (2)
Newspaper (2)
American English (1)
Finland Swedish (1)
Resource Type:
Corpus: | |
Lexical/Conceptual: | |
Tool/Service: | |
Language Description: |
Media Type:
Text: | |
Audio: | |
Image: | |
Video: | |
Text Numerical: | |
Text N-Gram: |
958 Language Resources (Page 1 of 48)
« Previous | Next »Order by:
2006 CoNLL Shared Task - Ten Languages
0
334
- Bulgarian
- Danish
- Dutch
- German
- Japanese
- Portuguese
- Slovenian
- Spanish
- Swedish
- Turkish
ACCURAT balanced test corpus for under resourced languages
0
277
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of comparable sentences
0
259
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of Wikipedia texts
0
271
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
« Previous | Next »