COST232

52 Last view: 2026-05-28

1 Last update: 2013-06-26

http://catalog.elra.info/product_info.php?products_id=39

ID:

ELRA-S0009

The COST232 consortium collected a "Multi-English" speech database over the telephone in Europe. Originally, it had been planned to collect data only at FUB (Fondazione Ugo Bordoni) in Rome, but in the event it was also possible to make a collection at BT labs in the UK. A total of 797 "successful" calls were collected.
Two countries received calls - Italy and the UK, using different types of collecting equipment (FUB in Rome used analog lines and BT in the UK used digital ones). Everybody had to repeat the same vocabulary - the "TI (Texas Instrument) words" - which makes this database unique in many respects.
The vocabulary comprised the name of the speaker's laboratory, the digits ("oh", zero, one , two, three, four, five, six, seven, eight and nine) and the words: "yes, no, erase, rubout, stop, start, help, enter, repeat, go". The data was collected from the following countries: Belgium, Czechoslovakia, Denmark, England, Germany, Italy, Norway, Portugal, Slovenia, Spain, Sweden and Switzerland. Each country provided 8 speakers who made 2 calls from a fixed set and a mobile to both the Italian and UK collection system (i.e. a total of 8 calls per speaker). Although the database was intended to aid for speech recognition, it is also balanced and can therefore be used for speaker recognition training and testing.

View resource description in all available languages

Le consortium COST232 a enregistré une base de données téléphonique connue à travers l'Europe sous le nom de base de données "Multi-English".

Les enregistrements ont été réalisés par des plates-formes installées en Grande Bretagne (Labo BT) et en Italie (à la Fondazione Ugo Bordoni, Rome). Les appels émanaient de plus de 12 pays européens (France, Belgique, Danemark, Tchécoslovaquie, Espagne, Angleterre, ...). Chaque locuteur a prononcé le fameux vocabulaire TI (Texas Instrument) qui consiste en une vingtaine de mots (chiffres + des mots usuels tels que : yes, no, stop, go, rebout, ...). 8 locuteurs de chaque pays ont effectué chacun deux appels : deux d'un téléphone fixe et deux d'un téléphone mobile vers chacune des deux plates-formes (résultat : 8 appels/locuteur).

Cette base peut convenir aussi bien à l'évaluation et à l'apprentissage des systèmes de reconnaissance de la parole qu'à des systèmes d'identification du locuteur.

You don’t have the permission to edit this resource.

DistributionAvailability

Available - Restricted Use

Start date: 05/13/1997

Licence

ELRA END USER

Restrictions: Academic - Non Commercial Use

For Non Members of ELRA

User Nature: Commercial

ELRA VAR

Restrictions: Commercial Use

For Members of ELRA

User Nature: Commercial

ELRA END USER

Restrictions: Academic - Non Commercial Use

For Members of ELRA

User Nature: Commercial

ELRA VAR

Restrictions: Commercial Use

For Members of ELRA

User Nature: Academic

ELRA END USER

Restrictions: Academic - Non Commercial Use

For Members of ELRA

User Nature: Academic

ELRA VAR

Restrictions: Commercial Use

For Non Members of ELRA

User Nature: Commercial

ELRA VAR

Restrictions: Commercial Use

For Non Members of ELRA

User Nature: Academic

ELRA END USER

Restrictions: Academic - Non Commercial Use

For Non Members of ELRA

User Nature: Academic

Contact Person

Mapelli Valérie

audio

Monolingual audio corpusLanguages

English

Linguality

Linguality type: Monolingual

Size

no size available

Resource Creation

Funding Project

COST232

Funding Type: Eu Funds

Metadata

Created: 05/12/2005

Version

Version: 1.0

Last Updated: 05/23/2012

People who looked at this resource also viewed the following: