INTERA Corpus - the Slovene SVEZ ACQUIS POS annotated part of the EN-SL SVEZ ACQUIS Corpus

121 Last view: 2026-06-11

INTERA Corpus - the Slovene SVEZ ACQUIS POS annotated part of the EN-SL SVEZ ACQUIS Corpus

The Slovene SVEZ ACQUIS POS annotated part of the INTERA corpus; written, domain specific (law); (2 MWs); XCES ANA format.

You don’t have the permission to edit this resource.

DistributionAvailability

Available - Restricted Use

Licence

CC - BY - NC

Restrictions: Academic - Non Commercial Use, Attribution

Distribution Access/Medium: Downloadable

Attribution Details: The INTERA Corpus - the Slovene SVEZ ACQUIS POS annotated part of the ILSP/RC Athena licensed under CC-BY-NC as accessed via META-SHARE

Contact Person

Maria Gavrilidou

text

Monolingual text corpusLanguages

Slovene (2,000,000 Words)

Linguality

Linguality type: Monolingual

Text Format

application/x-xces+xml

Size

2,000,000 Words

Character encoding

UTF - 8

Domains

law

Modalities

Written Language

AnnotationMorphosyntactic Annotation - Pos Tagging

StandOff: False

Segmentation level: Word

Format: application/x-xces+xml

Standard practices conformance: XCES

Segmentation

Segmentation level: Sentence

Creation

Creation mode details: web crawling; manual selection; semi-automatic conversion to the desired formats

Creation mode: Mixed

Original Sources

various texts found mainly over the internet

Resource Creation

Creation lasted: 01/01/2003 - 12/31/2004

Funding Project

Integrated European language data Repository Area (INTERA - e-content EDC-22076 INTERA / 27924)

URL: http://www.elda.org/...

Funding Type: Eu Funds

Funder: eContent

Project duration: 01/01/2003 - 12/31/2004

Metadata

Created: 02/02/2012

Last Updated: 01/08/2016

Usage

Foreseen UseNlp Applications

Use NLP Specific: Machine Translation

Actual Use - Nlp Applications

Use NLP Specific: Terminology Extraction

Relation

Related Resource: INTERA corpus

Relation Type: isPartOf

Documentation

Document Type: In Proceedings

Maria Gavrilidou and Penny Labropoulou and Elina Desipri and Voula Giouli et al, Building parallel corpora for eContent professionals, , COLING 2004 , 2004

Book Title: Proceedings of COLING 2004

Document Type: In Proceedings

Maria Gavrilidou and Penny Labropoulou and Stelios Piperidis et al, Language resources production models: the case of INTERA multilingual corpus and terminology, , 5th International Conference on Language Resources and Evaluation (LREC-2006) , 2006

Book Title: Porceedings of the 5th International Conference on Language Resources and Evaluation (LREC-2006)

Document Type: In Proceedings

Maria Gavrilidou and Penny Labropoulou and Monica Monachini and Stelios Piperidis and Claudia Soria, Building Multilingual Terminological Resources, , RANLP 2005 International Workshop on Language and Speech Infrastructure for Information Access in the Balkan Countries , 2005

Book Title: Proceedings of the RANLP 2005 International Workshop on Language and Speech Infrastructure for Information Access in the Balkan Countries

Document Type: Tech Report

Maria Gavrilidou and Voula Giouli and Elina Desipri and Penny Labropoulou and Monica Monachini et al, D5.2 - Report on the multilingual resources production, http://www.elda.org/... , 2004

People who looked at this resource also viewed the following:

Resources from the same project