Hungarian MTBA

171 Last view: 2026-04-25

hu-MTBA

http://alpha.tmit.bme.hu/speech/hdbMTBA.php

ID:

207 Hungarian MTBA is issued from a project for the creation of the fixed line and mobil telephone voices based Hungarian speech database.
The goal of the project was collecting speech telephone database, in which some major dialectal variants are represented. This database provided a realistic base both for the training and testing of the present-day teleservices, and - because of the phonetically richness - the training of real speaker independent speech recognizers. The database contains records based on the definition in SpeechDatE for the dialectical, age and sex balance and vocabulary. Important and different from the SpeechDatE database is, that the phonetically rich sentences and words have been segmented and labelled at phoneme level. Thus the database gives possibility to train phoneme based recognizers. During planning the corpus, we took into consideration not only the variety of the dialectical aspects, but the special characteristics of Hungarian language too. Since the Hungarian is an agglutinative language, we needed to create a larger vocabulary in some categories, than it was mandatory. We tried to pay an extra attention to the topic 'phonetically rich sentences and words', to create a phonetically well balanced speech database for text independent speech recognizers. A detailed statistical analysis was prepared to examine the statistics of phonemes, diphones, triphones and syllables.

You don’t have the permission to edit this resource.

DistributionAvailability

Available - Restricted Use

Start date: 07/02/2012

Licence

MS - C - No ReD - ND - FF

Restrictions: No Redistribution

Fee: 6,500 EUR

Distribution Access/Medium: CD - ROM

Attribution Details: In case of interest, please contact the IPR-holder specified below.

IPR Holder

Klára Vicsi

Contact Person

Klára Vicsi

text
audio

Monolingual text corpusLanguages

Hungarian

Linguality

Linguality type: Monolingual

Size

5 Hours

Creation

Creation mode: Manual

Original Sources

research

Monolingual audio corpusLanguages

Hungarian (5 Hours)

Linguality

Linguality type: Monolingual

Size

5 Hours

AnnotationSegmentation

Annotated elements: Speaker Noise

Segmentation level: Phoneme

Annotation Mode: Manual (annotation based on listening)

Annotation Tools:

Self developed annotator tool

Start date: 01/01/2002

End date: 12/31/2003

Content

Speech items: Isolated Words, Natural Numbers, Phonetically Rich Sentences

Noise Level: Medium

Audio Formatswave/audio (5 Hours)

Compression: False

Recording quality: High

Quantization: 16

Number of tracks: 1

Sampling rate: 8000

Signal encoding: LinearPCM

CapturePerson SourceSet

Origin of persons: Native

Age of persons: Adult

Sex of persons: Mixed

Number of persons: 500

Dialect accent of persons: varied, balanced

Age range end: 99

Hearing impairment of persons: No

Number of trained speakers: 0

Age range start: 18

Speaking impairment of persons: No

CreationOriginal Sources

corpora (phonetically rich + numbers)

Resource Creation

Resource Creator

Klára Vicsi

Creation lasted: 01/01/2001 - 12/31/2003

Metadata

Created: 07/02/2012

Last Updated: 07/02/2012

Source: CESAR

Metadata Creator

György Szaszák

Version

Version: 1.0

Last Updated: 12/31/2003

Usage

Foreseen UseNlp Applications

Use NLP Specific: Person Recognition, Speech Analysis, Speech Recognition, Spoken Dialogue Systems

Human UseActual Use - Human Use

Use NLP Specific: Spoken Dialogue Systems

People who looked at this resource also viewed the following:

Resources from the same creators