Issue StoriesSpeech or Noise? Using Signal Detection and Noise Reductionby Nancy Tellier, MSc, Horst Arndt, PhD, and Henry Luo, PhD A technical review of a new multidimensional signal detection and noise reduction system, along with background on various digital strategies for enhancing speech in noise. Listening in the presence of background noise has traditionally posed the greatest challenge for hearing aid wearers and is also a primary source of dissatisfaction with hearing aids.1,2 Ideally, an intelligent hearing instrument would distinguish between speech and noise, and then amplify the desirable speech while suppressing undesirable noise in the signal. Recent developments3 in digital signal processingincluding intelligent signal detection and noise reduction and switchable directional microphoneshave helped to make this wish a reality. Classifying Noise However, the incoming signal cannot be properly classified as speech or noise based only on a single dimension; sound events cannot be accurately categorized solely on the basis of their amplitudes. Furthermore, a single dimension does not take into account changing conditions in which noise is more or less prevalent at different times. For example, when listening to the radio while riding in a car, the amplitude of the noise from the car rises and falls proportionately as the car changes speed; yet the speech from the car radio remains constant. The result is that the more intense noise events, such as engine noise when the car speeds up, will not be categorized as noise, and many non-noise sound events, such as the car radio, will be categorized as noise.3 The modulation index is another dimension that can be used to categorize the incoming signal as either speech or noise.4-9 The modulation index is defined as the rate of change of the signal amplitude in each band. Speech exhibits rapid and frequent amplitude changes and, therefore, has a high modulation index. Steady state noise, on the other hand, has a low modulation index. If noise is detected in a single band, the gain of that band is reduced relative to the other non-noise bands. Thus, bands with noise are suppressed in favor of bands without noise. Classifying noise solely on the basis of its modulation index would assume that the frequency spectrums for all noise demonstrate little or no variation over time. Such a system would be good at detecting stationary or slowly changing pseudo-stationary noise, but these characteristics do not constitute the majority of listening environments. Consequently, just as you cannot classify noise based solely on its amplitude, it may not be appropriate to classify the incoming signal as speech or noise based solely on its modulation index. Multidimensional Signal Detection In a new noise reduction algorithm, ClearPath technology used in Unitron Hearings Conversa, incoming signals are analyzed in 16 separate bands along 3 different dimensions rather than using a unidimensional approach. Amplification or suppression is then applied differently, and on a weighted basis to the signal based on the categorization. Signal Analysis and Classification
The differing characteristics of these four signal categories are illustrated in Figure 1.
Using these three dimensions, the four signal types (three types of noise plus desirable signals) can be placed on the continuum scale found in Table 1.
An analysis of intensity change, modulation frequency and time is done simultaneously in each narrow frequency band to determine subindices for each of these three dimensions. These are combined to produce a signal index on a three-dimensional continuum within each frequency band. The signal in each band is classified as one of the four signal types by its three dimensional index (Figure 2). This signal index determines how much the output of that band is amplified or suppressed. Consequently, the most gain is applied to bands containing desirable signals, and less or no gain is applied to bands containing noise. This is designed to result in a reliable, accurate, adaptive signal detection system to drive the noise reduction algorithm.10
Optimization of Noise Reduction
Attack and Release Times: The attack and release times need to be optimized to minimize impact on speech sounds when the noise reduction is activated. Time constants are a crucial factor in a noise reduction algorithm. If the algorithm reacts too quickly to changes in the signal index, a sudden transient burst from a brief phoneme (eg, /d/ or /t/) could be suppressed as if it were a noise and become inaudible. Conversely, when the time constants are too long, pseudo-stationary noises and transient noises would not be detected quickly enough. The gain in the frequency channels containing noise would not be reduced in time and there would be no benefit from the noise reduction algorithm. Furthermore, as the signal changes from noise to speech, the time required for the algorithm to increase the gain in each band would be too long. Therefore, important speech information may be lost.
Figure 4 shows the effects of different time constants on a signal consisting of speech in the presence of fan noise. Note that with slower time constants (top diagram), noise reduction is less effective in removing the noise from the speech than with faster time constants(bottom diagram). Adjustable Noise Reduction: Providing fitter adjustment for the degree of noise reduction adds further flexibility to a noise reduction system. Choices of off, moderate, or maximum for each listening program allow different settings according to the wearers needs or preferences in different listening environments. For example, switching from moderate to maximum noise reduction increases the amount of available gain reduction equally in each frequency band. This means that the amount of absolute gain reduction can be changed, but the spectral weighting bias will be unaffected (Figure 5).
Summary
Individuals fit with this technology consistently comment on its comfort and effectiveness in noisy settings. The multidimensional signal detection, adjustable noise reduction, and optimized attack and release times are designed to reduce unwanted background noise with minimal impact on important speech cues. When coupled with directional microphone technology, this noise reduction system can provide an effective solution for listening to speech in noise, increasing overall user satisfaction with their hearing aids.
References |
|
|
|
ADDITIONAL ONLINE RESOURCES |
Featured Employer
|