Filter by:
Corpus (180)
Text/xml (17)
Text/plain (10)
Praat Text Grid (5)
XML (2)
Plain text / xml (2)
Tmx (2)
PRAAT/ Text Grid (1)
SAM V4.1 (1)
TIPSTER (1)
Text (1)
Eaf (1)
Txt / cqpweb (1)
Txt/xml (1)
Text (180)
Political (2)
Subtitles (2)
Biomedical text (1)
Fiction (1)
Published text (1)
Spoken (1)
Written (1)
Advertising (1)
Discussion (1)
Feature (1)
Fiction (1)
Information (1)
Interviews (1)
Non-fiction (1)
Official (1)
Private (1)
Scripts (1)
Transcripts (1)
Fiction (2)
Newspaper (2)
Book (1)
Fairy tales (1)
Formal/ Media (1)
Informal/ Public (1)
Miscellaneous (1)
Newspaper (1)
Periodical (1)
Poems (1)
Script (1)
Audio (27)
Video (5)
Image (1)
Textnumerical (1)
Nlp Applications (180)
Human Use (53)
English (72)
German (35)
Portuguese (28)
Hungarian (20)
Latvian (20)
Slovak (20)
Finnish (19)
Bulgarian (16)
Czech (15)
French (14)
Romanian (13)
Serbian (13)
Russian (10)
Lithuanian (9)
Maltese (9)
Estonian (8)
Icelandic (8)
Italian (8)
Slovenian (8)
Polish (7)
Turkish (7)
Chech (6)
Chinese (6)
Croatian (6)
Danish (4)
Dutch; Flemish (4)
Greek (4)
Spanish (4)
Swedish (4)
Albanian (3)
Arabic (3)
Dutch (3)
Macedonian (3)
Slovene (3)
Hebrew (2)
Japanese (2)
Persian (2)
Vietnamese (2)
Amharic (1)
Armenian (1)
Aymara (1)
Azerbaijani (1)
Basque (1)
Bengali (1)
Bosnian (1)
Burmese (1)
Central Khmer (1)
Esperanto (1)
Galician (1)
Georgian (1)
Hindi (1)
Indonesian (1)
Irish (1)
Kazakh (1)
Kirghiz; Kyrgyz (1)
Korean (1)
Malagasy (1)
Mongolian (1)
Norwegian (1)
Oriya (1)
Swahili (1)
Tajik (1)
Thai (1)
Turkmen (1)
Ukrainian (1)
Urdu (1)
Other (50)
CC - BY (41)
CC - BY - NC (14)
Under Negotiation (13)
Proprietary (12)
ELRA_END_USER (7)
CC - BY - SA (5)
CC - ZERO (3)
CLARIN_ACA - NC (3)
ELRA_VAR (2)
AGPL (1)
GFDL (1)
GPL (1)
LGPL (1)
MS - C - No Re D (1)
Attribution (59)
No Redistribution (10)
Share Alike (5)
Other (3)
Commercial Use (2)
Redeposit (2)
No Derivatives (1)
Machine Translation (64)
Linguistic Research (19)
Speech Analysis (14)
Pos Tagging (12)
Parsing (10)
Speech Synthesis (8)
Lexicon Access (7)
Lemmatization (5)
Text Mining (5)
Annotation (4)
Event Extraction (3)
Other (3)
Face Recognition (1)
Semantic Web (1)
Summarisation (1)
Text Generation (1)
Web Services (1)
Text/xml (22)
Wave/audio (13)
Text/plain (11)
Text (6)
Xml (4)
Audio/ PCMA (3)
Video/mp4 (2)
Plain text (1)
WAV (1)
Application/pdf (1)
Audio/flac (1)
Audio/speex (1)
Audio/vorbis (1)
Mp3 (1)
Plain/text (1)
Sgml (1)
Text/txt (1)
Txt (1)
Txt/xml (1)
Video/mpeg (1)
Video/x-msvideo (1)
Wav (1)
XCES (20)
TMX (7)
TEI (6)
Other (4)
EAGLES (3)
TEI_P5 (3)
MULTEXT (2)
Prague Treebank (2)
EML (1)
MUMIN (1)
Penn Tree Bank (1)
Law (22)
Education (16)
Health (13)
Law_politics (12)
General (11)
News (8)
Environment (7)
Science (7)
Literature (6)
Politics (6)
Tourism (6)
Finance (5)
Novels (4)
Test (4)
Pharma (3)
News (3)
Travel (3)
Political (2)
Entertainment (2)
Fiction (2)
History (2)
Society (2)
Technology (2)
Weather report (2)
Environment (1)
Fiction (1)
General (1)
General language (1)
Medical History (1)
Politics (1)
Racial discourse (1)
Renewable energy (1)
Science (1)
Wikipedia (1)
Business (1)
Everyday scenes (1)
Geography (1)
Humanities (1)
Leisure (1)
Portugal (12)
Iceland (5)
Brasil (1)
Brazil (1)
Greece (1)
IS (1)
Mozambique (1)
Thrace (1)
UK (1)
English (1)
Is (1)
Portuguese (1)
Pt (1)
1970 - 2002 (2)
1996-2011 (2)
1810-1940 (1)
1840 - 2013 (1)
1918-2011 (1)
1970-2002 (1)
1980-1990 (1)
1981-1990 (1)
1986 - 1987 (1)
1989-1998 (1)
1996-1997 (1)
2000-2008 (1)
2003 (1)
2004-2005 (1)
2004-2009 (1)
2005, 2010 (1)
2007-2009 (1)
2013- (1)
2013-2014 (1)
After 1990 (1)
Until 2006 (1)
Resource Type:
Corpus: | |
Lexical/Conceptual: | |
Tool/Service: | |
Language Description: |
Media Type:
Text: | |
Audio: | |
Image: | |
Video: | |
Text Numerical: | |
Text N-Gram: |
177 Language Resources (Page 1 of 9)
« Previous | Next »Order by:
ACCURAT balanced test corpus for under resourced languages
0
277
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of comparable sentences
0
259
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of Wikipedia texts
0
271
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
Bulgarian-X language Parallel Corpus
0
263
- Albanian
- Arabic
- Armenian
- Azerbaijani
- Basque
- Bosnian
- Bulgarian
- Catalan; Valencian
- Chinese
- Croatian
- Czech
- Danish
- Dutch
- English
- Estonian
- Finnish
- French
- Galician
- Georgian
- German
- Greek
- Hebrew
- Hungarian
- Icelandic
- Irish
- Italian
- Japanese
- Kazakh
- Kirghiz; Kyrgyz
- Latvian
- Lithuanian
- Macedonian
- Maltese
- Mongolian
- Norwegian
- Polish
- Portuguese
- Romanian
- Russian
- Serbian
- Slovak
- Slovene
- Spanish
- Swedish
- Tajik
- Turkish
- Turkmen
- Ukrainian
« Previous | Next »