- Open Access
- Total Downloads : 266
- Authors : Manasi R.Dixit, Shaila D. Apte
- Paper ID : IJERTV2IS50781
- Volume & Issue : Volume 02, Issue 05 (May 2013)
- Published (First Online): 22-05-2013
- ISSN (Online) : 2278-0181
- Publisher Name : IJERT
- License: This work is licensed under a Creative Commons Attribution 4.0 International License
Speech Evaluation of Cleft Palate – Velopharyngeal Dysfunction Affected
Manasi R.Dixit1
1 Department of Electronics K.I.T.s College of Engineering, Kolhapur,416008
Maharashtra,India
Abstract
Communication disorders are very complicated in individuals with cleft lip and/ or palate. In this paper , the pathological cases of speech disabled children affected with cleft type characteristics (CTCs) and Velopharyngeal dysfunction (VPD) are analyzed. The speech signal samples of children of age between three to eight years are considered for the present study. These speech signals are digitized and are used to determine the pathologic speech characteristics. This analysis is conducted on speech data samples which are concerned with both place of articulation and manner of articulation. The speech disability of pathological subjects was estimated using results of above analysis.
Keywords
CTC,VPD, Hypernasality ,Hyponasality ,Audible nasal turbulence ,Consonant production errors Speech intelligibility, nasality severity index –NSI, Speech- recognition, spectrograms, pitch-formant analysis
-
Introduction
Children with a cleft lip or palate have difficulties controlling the muscles of the soft palate leading to impaired speech and language development. The disorders of the Velopharyngeal valve are known as Velopharyngeal dysfunction (VPD). It includes three sub terms. Velopharyngeal insufficiency occurs in children with a history of cleft palate or sub mucous cleft, who have short or otherwise abnormal vela. Velopharyngeal inadequacy is defective closure of the Velopharyngeal valve due to its lack of speed and
Shaila D. Apte2 2Department of Electronics and Communication Engineering,
Rajarshi Shahu College of Engineering, Pune
Maharashtra,India
precision. It is caused by a neurologic disorder or injury (e.g. cerebral palsy or traumatic brain injury).
Velopharyngeal mislearning is observed in normal children . If the child has never learnt how to use the valve correctly then it leads to hyper nasal speech. The child cannot produce oral sounds (vowels and consonants) correctly. Only the nasal sounds can be be correctly produced. Speech Intelligibility requires the ability to close the nasal cavity, as all English sounds, except "m", "n", and "ng", have airflow only through the oral cavity. The nasality severity index (NSI) is an objective measurement of hyper nasality based on a multiparameter approach. The multiparameter approach consists of the nasalance, the nasality, and aerodynamic capacities. A new method based on pitch correlation, formant analysis and NSI evaluation is developed for the analysis of perceptual speech. In this present work, the speech data utterances by children of age group of 3 to 8 years were recorded and digitized. The digitized signal was further processed by using a MATLAB platform. The speech data was analysed to check the disorders due to pharyngeal articulation, glottal articulation, active nasal fricatives, double articulation
,weak and or nasalized consonants, nasal realization of plosives, gliding of fricatives/affricates. The acoustic analysis (e.g., spectrography, the oral-nasal acoustic ratio) is conducted on speech data. Perceptual assessment of speech is done so as to get important information regarding articulation, resonance, voice and speech intelligibility. Perceptual assessment provides important information regarding Misarticulation, resonance, voice and speech intelligibility, In this present work ,it is observed that the speech disabled children produce Compensatory or Misarticulation leading to nasal air emission or hypernasality that is phoneme-specific.
-
Methodology
The present work is based on study of children with Marathi mother tongue. The speech data of normal subjects/children and pathological subjects/children of the same age group between 5 to 8 years is collected. The children were trained to utter similar words before recording. The speech data consists of isolated words, connected words , fast uttered sentences and songs for eg. Prarthana-School-Prayer, National anthem and Pledge ,Nursery Rhymes
,famous film songs etc. The speech data was recorded using Sony Intelligent Portable Ocular Device (IPOD) in digital form. The recording was carried out in a pleasant atmosphere and maintaining the children in tension-stress free environment. The recorded signal is transformed into
.wav file by using GOLDWAVE software. The data was collected at Chetana Vikas Mandir, a special school established to educate Mentally Retarded children as well as children with various disorders. It is located at Kolhapur, India.
The Formant analysis was carried out for particular isolated words. The utterances made by 20 normal subjects was analyzed and reference /threshold level was considered for each phoneme. Various Misarticulation cases were analyzed in case of pathological subjects. The spectrograms were studied for Formant analysis. Fast uttered words or continuous sentences exhibit greater complexities with respect to speech intelligibility.
The nasality severity index (NSI) is also calculated in case of pathological speech data. The isolated word speech data emphasised in the present work is described below. It indicates case studies of different types of Articulation errors with respect to both place of articulation and manner of articulation.
The isolated word speech data is given in table-1.
Table-1. List of specific place of articulation and manner of articulation
alveolar
fricative
s,z
alveolar
approximant
r/l
5
post- alveolar
fricative
sh,zh
post- alveolar
affricate
ch,j
post- alveolar
approximant
y
6
velar
plosive
kg
velar
nasal
ng
7
glottal
fricative
h
-
DETERMINATION OF FUNDAMENTAL FREQUENCY F0 AND VARIATIONS OF F0
TABLE-2. F0 and variations of f0 for different speakers
Sr. No
.
SUBJE CT
F0
VARIATI ONS OF F0
REMARKS
1
Speake r 1
255
Hz
Min 204-
318max Hz
Female-Normal
2
Speake r 2
195
Hz
Min 150-
320max Hz
Male-Pathologic- VPD, Hyper nasality
3
Speake r 3
180
Hz
Min 104-
300max Hz
Male -Pathologic-
,hyper nasality, weak consonants
4
Speake r 4
280
Hz
Min 210-
325max Hz
Female – Pathologic – gliding of fricatives
5
Speake r 5
290
Hz
Min 204-
352max Hz
Female – Pathologic -active nasal fricatives
6
Speake r 6
185
Hz
Min 100-
312max Hz
Male -Pathologic- hype nasality
7
Speake r 7
198
Hz
Min 104-
328max Hz
Male-Pathologic- only nasal consonants
8
Speake r 8
170
Hz
Min 140-
290max Hz
Male-Normal with nasality shown
9
Speake r 9
205
Hz
Min 170-
338max Hz
Male -Pathologic
No alveolar approximant (r/l)
Sr. No
.
SUBJE CT
F0
VARIATI ONS OF F0
REMARKS
1
Speake r 1
255
Hz
Min 204-
318max Hz
Female-Normal
2
Speake r 2
195
Hz
Min 150-
320max Hz
Male-Pathologic- VPD, Hyper nasality
3
Speake r 3
180
Hz
Min 104-
300max Hz
Male -Pathologic-
,hyper nasality, weak consonants
4
Speake r 4
280
Hz
Min 210-
325max Hz
Female – Pathologic – gliding of fricatives
5
Speake r 5
290
Hz
Min 204-
352max Hz
Female – Pathologic -active nasal fricatives
6
Speake r 6
185
Hz
Min 100-
312max Hz
Male -Pathologic- hyper nasality
7
Speake r 7
198
Hz
Min 104-
328max Hz
Male-Pathologic- only nasal consonants
8
Speake r 8
170
Hz
Min 140-
290max Hz
Male-Normal with nasality shown
9
Speake r 9
205
Hz
Min 170-
338max Hz
Male -Pathologic
No alveolar approximant (r/l)
Sr.
No.
Place of articulation
Manner of articulation
English Letters
1
bilabial
plosive
p,b
bilabial
nasal
m
bilabial
approximant
w
2
labio- dental
fricative
f,v
3
dental
fricative
th,th
4
alveolar
plosive
t,d
alveolar
nasal
n
or alveolar plosives(t/d)
10
Speake r 10
175
Hz
Min 120-
320max Hz
Male-Normal with nasality shown
411max Hz
Hz
652max Hz
or alveolar plosives(t/d)
10
Speake r 10
175
Hz
Min 120-
320max Hz
Male-Normal with nasality shown
411max Hz
Hz
652max Hz
Table-3. F1 AND F2 FOR DIFFERENT SUBJECTS
-
Conclusion
The pathological subjects affected with CTC-VPD exhibit very weak consonant production.
Sl No
.
SUBJE CT
f1
VARIA TIONS OF f2
f2
VARIATIO NS OF f2
1
Speaker 1
515
Hz
Min 440-
570max Hz
778
Hz
Min 700-
842max Hz
2
Speaker 2
400
Hz
Min 340-
490max Hz
590
Hz
Min 505-
670max Hz
3
Speaker 3
368
Hz
Min 310-
420max Hz
560
Hz
Min 496-
642max Hz
4
Speaker 4
562
Hz
Min 500-
630max Hz
844
Hz
Min 740-
910max Hz
5
Speaker 5
582
Hz
Min 460-
673max Hz
872
Hz
Min 740-
922max Hz
6
Speaker 6
372
Hz
Min 290-
470max Hz
558
Hz
Min 495-
652max Hz
7
Speaker 7
398
Hz
Min 295-
493max Hz
595
Hz
Min 500-
684max Hz
8
Speaker 8
342
Hz
Min 300-
395max Hz
512
Hz
Min 453-
585max Hz
9
Speaker 9
414
Hz
Min 368-
497max Hz
622
Hz
Min 560-
714max Hz
10
Speaker 10
354
Hz
Min 278-
528
Min 470-
Sl No
.
SUBJE CT
f1
VARIA TIONS OF f2
f2
VARIATIO NS OF f2
1
Speaker 1
515
Hz
Min 440-
570max Hz
778
Hz
Min 700-
842max Hz
2
Speaker 2
400
Hz
Min 340-
490max Hz
590
Hz
Min 505-
670max Hz
3
Speaker 3
368
Hz
Min 310-
420max Hz
560
Hz
Min 496-
642max Hz
4
Speaker 4
562
Hz
Min 500-
630max Hz
844
Hz
Min 740-
910max Hz
5
Speaker 5
582
Hz
Min 460-
673max Hz
872
Hz
Min 740-
922max Hz
6
Speaker 6
372
Hz
Min 290-
470max Hz
558
Hz
Min 495-
652max Hz
7
Speaker 7
398
Hz
Min 295-
493max Hz
595
Hz
Min 500-
684max Hz
8
Speaker 8
342
Hz
Min 300-
395max Hz
512
Hz
Min 453-
585max Hz
9
Speaker 9
414
Hz
Min 368-
497max Hz
622
Hz
Min 560-
714max Hz
10
Speaker 10
354
Hz
Min 278-
528
Min 470-
Various types of misartculation errors occur in different subjects. Very strong nasal consonants follow almost every phoneme uttered. Utterances of some of the plosives and alveolar approximants were not possible in case of some of the subjects. The Formants were seen to be widely spread in pathological subjects.
-
References
-
Marcelo de Oliveira Rosa*, José Carlos Pereira, and Marcos Grellet Adaptive Estimation of Residue Signal for Voice Pathology Diagnosis IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, VOL. 47, NO. 1,
JANUARY 2000
-
Darush Mehta Aspiration Noise During Phonation: Synthesis, Analysis, And Pitch -Scale Modification MASSACHUSETTS INSTITUTE OF TECHNOLOGY February 2006
-
Lingyun Gu, John G. Harris,et.al. Disordered Speech Assessment Using Automatic Methods Based on Quantitative Measures EURASIP Journal on Applied Signal Processing 2005:9, 14001409 c_ 2005 Hindawi Publishing Corporation
-
Oscar Saz,1 Javier Sim´ on,2 W.-Ricardo Rodr´guez,1 Eduardo Lleida,1 and Carlos Vaquero1 Research Article Analysis of Acoustic Features in Speakers with Cognitive Disorders and Speech Impairments Hindawi Publishing Corporation EURASIP Journal on Advances in Signal Processing Volume 2009, Article ID 159234, 11 pages doi:10.1155/2009/159234
-
Juan Ignacio Godino-Llorente,1 Pedro G´omez-Vilda (EURASIPMember),2 and Tan Lee3 Editorial
Analysis and Signal Processing of Oesophageal and Pathological Voices Hindawi Publishing Corporation EURASIP Journal on Advances in Signal Processing Volume 2009, Article ID 283504, 4 pages doi:10.1155/2009/283504
-
CatherineMiddag,1 Jean-PierreMartens,1 Gwen Van Nuffelen,2 andMarc De Bodt2 Research ArticleAutomated Intelligibility Assessment of Pathological
Speech Using Phonological Features Hindawi Publishing Corporation EURASIP Journal on Advances in Signal Processing Volume 2009, Article ID 629030, 9 pages doi:10.1155/2009/629030