Speech intelligibility tests for the Turkish language

Speech intelligibility tests for the Turkish language Melanie A. Zokoll1, Sabine Hochmuth1, Dilek Fidan2, Kirsten C. Wagener3, İclâl Ergenç4, Birger K...
Author: Asher Dixon
25 downloads 0 Views 371KB Size
Speech intelligibility tests for the Turkish language Melanie A. Zokoll1, Sabine Hochmuth1, Dilek Fidan2, Kirsten C. Wagener3, İclâl Ergenç4, Birger Kollmeier,1,3,5 1

Medical Physics, Institut of Physics, Carl von Ossietzky University Oldenburg, Oldenburg, Germany

2

Department of Turkish Language Teaching, Kocaeli University, Kocaeli, Turkey

3

Hörzentrum Oldenburg GmbH, Oldenburg, Germany

4

Department of Linguistics, Ankara University, Ankara, Turkey

5

HörTech gGmbH, Oldenburg, Germany

Key words: hearing screening, speech-in-noise test, digit triplet test, Matrix test, Turkish Introduction Professional hearing diagnostics and verification of technical hearing aids often fails because of speech barriers: speech intelligibility tests can usually only be used if both, testing person and patient have the same language competence and results are not comparable internationally. If additionally the tests are measured in noise to obtain results in situations close to reality, non-native listeners perform significant worse in comparison to native listeners with the same hearing ability (e.g., Warzybok et al., 2010, van Wijngaard et al., 2002). Multilingual versions of tests provide the possibility to test non-native listeners more exactly. Thus, two projects (HearCom and its partial follow-up HurDig) are concerned with the selection and if necessary the development, optimization and evaluation of internationally comparable speech intelligibility tests for several languages. A main focus is laid on the internationalization of the digit triplet test (Smits et al., 2004, Wagener et al., 2005, 2006) and the Oldenburg sentence test (OLSA, Wagener et al., 1999 a,b,c, see also Hagerman, 1984) as a matrix test. The digit triplet test was developed for hearing screening via telephone by Smits and his colleagues (2004, see also Buschermöhle et al., 2009), but can also be administered via smartphone or Internet. The test comprises the announcement “the digits” and digit triplets (e.g., 1-3-6) presented in a background of noise. The subject’s task is to select the digits heard on a given number pad. During measurement, the signal-to-noise ratio (SNR) between spoken digit triplets and noisy background is adaptively changed according to the performance of the subject and the speech reception threshold (SRT, i.e., the SNR that yields 50 % speech intelligibility) is estimated. The Matrix test is for speech intelligibility diagnostics in noise and also obtains the SRT. It consists of syntactically correct, but semantically unpredictable sentences (e.g., “Kathy ordered nineteen dark desks”). Its strengths are a steep discrimination function, a high number of test lists - which are repeatable, a mean speech tempo, as well as that it can be conducted in noise or quiet and in the open or closed format. In the closed format (word understood are selected from visually depicted response alternatives), the Matrix test allows for measuring a patient’s speech intelligibility in his or her respective mother tongue, even if the executing personal is not competent in that language. Meanwhile both tests are available in several languages (British English, Danish, German, French, Polish, Swedish, Spanish and soon also in Russian and American English). This contribution reports the development and optimization of the Turkish digit triplet and Matrix test as an extension of the existing test battery. For Germany this language is very important since about 2 Million Turkish native speakers live in Germany (status 2009).

Speech material The speech material for the Turkish digit triplet test as well as the Turkish Matrix test was developed by linguists of the University of Ankara and Kocaeli which are also co-authors of this paper. For the digit triplet test, all Turkish digits between zero and nine were selected [sıfır (0), bir (1), iki (2), üç (3), dört (4), beş (5), altı (6), yedi (7), sekiz (8), dokuz (9)]. Six different test lists were prepared. Each list contained 30 digit triplets. In each list a certain digit appeared three times at each position in a triplet (1st, 2nd, and 3rd position). In accordance with the procedure applied for the German digit triplet test, the noise background was generated by 30-fold superimposition of the speech material (i.e., the digits and the announcement “sayılar”). For the Matrix test, a basis list of ten sentences à five words with the same sentence structure was prepared (Tab.1).

The words were selected according to their frequency in the Turkish language. By pseudo-randomly selecting words within the single word groups of this basis list new sentences were composed and combined in 30 test lists à ten sentences each. For the optimization measurements described below, succeeding test lists were combined to ten test lists à 30 sentences. The test-specific noise was generated like for the digit triplet test (see Wagener et al., 1999a, 2003 for more details).

Tab. 1. Basis list of the Turkish Matrix test. 

Optimization of the speech material Testing in noise requires higher precision of the SRT estimation than testing in quiet, since the SRT of normal hearing listeners is raised by the masking noise level and differences in speech intelligibility between normal hearing and hearing impaired become smaller. The speech intelligibility function of a sentence test can be described by the following logistic model function: s50

(1)

SI is the speech intelligibility and s50 the slope of the intelligibility function at the SRT. Homogeneous wordspecific speech intelligibility is supposed to result in a steep intelligibility function of the overall speech material and therefore in an exact estimation of the SRT (Kollmeier 1990). Thus, the optimization of the speech test material targets to homogenize the speech intelligibility of the single words by adequate selection of the material and level adjustments. To obtain the word-specific intelligibility, optimization measurements with twelve normal hearing Turkish native speakers (between 21 and 36 years of age, mean age 28 years) were conducted. All subjects grew up and visited school in Turkey. At time of testing, their residence in Germany was less than five years. Word-specific intelligibility was measured presenting the test lists (six test lists à 30 digit triplets for the digit triplet test, and ten test lists à 30 sentences for the Matrix test, respectively) over headphones (Sennheiser HDA200) at fixed SNRs [range from –2 to –16 dB SNR (step size 2 dB SNR) for the digit triplet test, and 5.5 to –22 dB SNR (step size 2.5 dB SNR) for the Matrix test]. The level of the noise was 65 dB SPL. Responses were gathered with word scoring, i.e., every word was counted separately. The mean word-specific SRT was calculated and the intelligibility of all words was adjusted to this value (either by increasing or decreasing the level). The level adjustments were limited to 3 dB for both tests. For the Matrix test, this was predominantly done to preserve the natural sound impression of the sentences. Matrix test words with a very shallow slope of the word-specific speech intelligibility function were excluded.

Results Turkish digit triplet test The mean digit-specific speech intelligibility function before optimization has a SRT of -10.0 ± 2.2 dB SNR and a slope of 21.9 %/dB. The resulting slope of the overall speech material, estimated according to the probabilistic model of Kollmeier (1990), is 13.9 %/dB. Fig. 1 shows the expected shift in SRT for the different digits independent of their position in the triplet. The optimization procedure resulted in a computed mean SRT of -10.1 ± 0.5 dB SNR, whereas the slope stays the same. According to the probabilistic model, decreasing the standard deviation of the SRTs from 2.2 to 0.5 dB SNR increases the slope of the overall material (s50test) from 13.9 to 21.0 %/dB.

Fig. 1. Intelligibility functions for the different digits independent of their position in the triplet before (left graph) and after  optimization (right graph). 

Turkish Matrix test Fig. 2 shows the distribution of the word-specific SRTs of the speech material before (left graph) and after optimization (right graph). The mean SRT of all words is -9.4 ± 2.6 dB SNR before and -9.3 ± 0.8 dB SNR after optimization. The median slope of the word-specific speech intelligibility function is 17.8 %/dB. The resulting slope of the overall speech material (s50test) could be increased from 11.6 to 16.8 %/dB due to optimization.

Fig. 2. Distribution of the word‐specific SRTs for the speech material of the Matrix test before (left graph) and after  optimization (right graph).  

Discussion The goal of the study was to develop Turkish speech in noise tests that are internationally comparable and multilingual applicable. For both tests, suitable speech material was generated and optimized, i.e., homogenized in intelligibility. This is shown by the decreasing distribution of the SRT values (Fig. 1 and 2) and the steeper slope of the overall material (s50test) in comparison to the non-optimized test material. In comparison to the test-specific speech intelligibility functions obtained for other languages with tests of the same format in headphone measurements (e.g., Wagener et al., 2009, see Fig. 3), the Turkish tests show similar test-specific SRTs and slopes.

Fig. 3. Test‐specific speech intelligibility functions for digit triplet and Matrix tests in several languages. Speech intelligibility  functions of the Turkish Matrix test base on optimization data. For the British English Matrix test, data were obtained with  sentences scoring (i.e., complete sentences were counted as either correct or incorrect). 

Acknowledgements This work has been done in the context of the European Regional Development Fund (ERDF) funded project “Network for multilingual hearing and speech intelligibility diagnostics” (HurDig). Special thanks to Müge Kaya who helped in judging the sound impression of the sentences.

References Buschermöhle  M,  Berg  D,  Pelz  C,  Wagener  KC,  Kollmeier  B  (2009)  Der  Hörtest  per  Telefon  in  Deutschland  –  Zwischenbilanz  nach  einem  halben  Jahr.  DGA  12.  Jahrestagung  Innsbruck  2009,  Tagungs‐CD  ISBN  3‐ 9809869‐9‐3  Hagerman B (1982) Sentences for testing speech intelligibility in noise. Scand Audiol 11, 79‐87  Kollmeier  B  (1990)  Meßmethodik,  Modellierung  und  Verbesserung  der  Verständlichkeit  von  Sprache.  (Phd  thesis) Georg‐August‐Universität Göttingen  Smits C, Kepteyn T S, Houtgast T (2004) Development and validation of an automatic speech‐in‐noise screening  test by telephone. Int J Audiol 43, 15‐28  van Wijngaarden, SJ, Steeneken, HJM, Houtgast, T (2002) Quantifying the intelligibility of speech in noise for  non‐native listeners. J Acoust Soc Am 112(6), 1906‐1916  Wagener  KC,  Bräcker  T,  Brand  T,  Kollmeier  B  (2006)  Evaluation  des  Ziffern‐Tripel‐Test  über  Kopfhörer  und  Telefon. DGA 9. Jahrestagung Köln 2006, Tagungs‐CD, ISBN 3‐9809869‐5‐0   Wagener  KC,  Brand  T,  Kollmeier,  B  (1999b)  Entwicklung  und  Evaluation  eines  Satztests  in  deutscher  Sprache  Teil II: Optimierung des Oldenburger Satztests  Z Audiol 38, 44‐56  Wagener KC, Brand T, Kollmeier, B (1999c) Entwicklung und Evaluation eines Satztests für die deutsche Sprache  Teil III: Evaluation des Oldenburger Satztests  Z Audiol 38, 86–95  Wagener KC, Eeenboom F, Brand T, Kollmeier B (2005) Ziffern‐Tripel‐Test: Sprachverständlichkeitstest über das  Telefon. DGA 8. Jahrestagung Kiel 2005, Tagungs‐CD ISBN 3‐9809869‐4‐2   Wagener, KC, Josvassen, J L, Ardenkjaer, R (2003) Design, optimization and evaluation of a Danish sentence test  in noise. Int J Audiol 42, 10‐17  Wagener KC, Kühnel V, Kollmeier, B (1999a) Entwicklung und Evaluation eines Satztests in deutscher Sprache    I: Design des Oldenburger Satztests Z Audiol 38, 4–15  Wagener KC, Zokoll MA, Kollmeier B (2009) Vergleichbarkeit von Sprachtests für sechs europäische Sprachen.  DGA 12. Jahrestagung Innsbruck 2009, Tagungs‐CD ISBN 3‐9809869‐9‐3 

Warzybok A, Wagener KC, Brand T, Kollmeier, B (2010)Influence of linguistic complexity in relation to speech  material on non‐native soeech perception in noise. DAGA 2010, Berlin   

Suggest Documents