Experiments were carried out to validate our
algorithm by evaluating the accuracy of detection in comparison
with well-known cepstrum.
A database of every speech file and reference
contour are
constructed from the ATR Speech Database.
All signals were digitized at
kHz sampling rate and
analyzed with Hamming window where frame length and shift were 64 ms and
10 ms, respectively.
The initial number of the tied-GMMs was set to
and the frequency range was
from
Hz to
Hz, and
was assigned to
.
Speech files begin with `myi-' and `fym-' stand for speech signals of a
male and a female speakers.
Deviations over
from the references were deemed as gross errors.
Every accuracy shown in table 1, 2 and 3 is a percentage of frames at which
s are correctly detected.