A list,
Speech enhancement refers to the technology of extracting useful speech signals from the noise background to suppress and reduce the noise interference when the speech signal is disturbed or even submerged by various noises. In a word, extract the original speech as pure as possible from the noisy speech. The actual speech interference can be divided into the following categories: (1) periodic noise, such as electrical interference, interference caused by engine rotating part, such as some discrete narrow frequency peaks; ② Impulse noise, such as some electric spark, discharge noise interference; ③ Wideband noise, which refers to gaussian noise or white noise, which is characterized by wide frequency band, almost covering the whole speech band; (4) voice interference, such as picking up other people’s speech in the microphone, or transmission encountered crosstalk caused by speech. To deal with the different types of noise mentioned above, the enhancement techniques are also different. One of the main goals of speech enhancement is to extract the original speech as pure as possible from the noisy speech signal. However, since the interference is usually random, it is almost impossible to extract completely pure speech from noisy speech. In this case, the purpose of voice enhancement is mainly two: one is to improve voice quality, eliminate background noise, so that the listener is happy to accept, do not feel tired, which is a subjective measure; Second, it’s an objective measure. These two purposes are often incompatible. At present, there are some speech enhancement methods for low SNR noisy speech, which can significantly reduce the background noise and improve the speech quality, but can not improve the speech intelligibility, even slightly decreased. Speech enhancement is not only related to the theory of speech signal digital processing, but also involves human auditory perception and phonetics. In addition, noise comes from many sources and their characteristics vary from one application to another. Therefore, different speech enhancement strategies must be adopted for different noises. Some speech enhancement algorithms have been proved to be effective in practical applications. They can be roughly divided into four categories: noise elimination method, harmonic enhancement method, speech reconstruction method based on parameter estimation and speech enhancement algorithm based on short-time spectrum estimation.
Ii. Source code
[y,fs] = Audioread (' myName.wav '); % y is the sampled data; Fs is the sampling frequency sound(y,fs); % Play language signal myname N = length(y); % signal length t = (0: n-1)/fs; % Time domain figure(1); subplot(211); plot(t,y); Title (' Audio time domain map '); % xlabel('Time'); ylabel('Amplitude'); Y = fft(y,N); % FFT Analysis spectrum df = fs/length(Y); % Calculate the line interval f = 0:df:(fs/2-df); % Spectrum range, cut the first half (sampling frequency is higher than 2 times the maximum frequency) Yf = ABS (Y); % amplitude response Yf = Yf(1:length(Yf)/2); % Since the amplitude response is even function, cut half % to draw figure(1); subplot(212); The axis ([0, fs / 5,0,5000]); plot(f,Yf); Title (' Audio spectrum map '); The time domain and frequency domain graphs of %% P2_2_2 with noise y = y(:,1); y = y'; nt = randn(1,length(y)); % Generate Gaussian noise % design BPF to get 4KHZ to 5KHZ noise FP1 = 4000; fp2 = 5000; % BPF WP1 = 2* PI * fP1 /fs; WP2 = 2*pi*fp2/fs; % Convert analog index to digital index Wn1 = [WP1 WP2]; b = fir1(34,wn1,'bandpass'); yn = filter(b,1,nt); % noise signal s = y+yn; % mix band limited noise with original audio %sound(s,fs); S = fft(s,N); Sf = abs(S); % amplitude DF = fs/length(S); % Calculate the line interval f = 0:DF:(fs/ 2-df); % Spectrum range, cut the first half (sampling frequency is higher than 2 times the maximum frequency) Sf = Sf(1:length(Sf)/2); % Since the amplitude response is even function, cut half % to draw figure(2); subplot(211); plot(t,s); Title (' Audio time domain map with noise '); xlabel('Time'); ylabel('Amplitude'); grid on; subplot(212); The axis ([0, fs / 5,0,10000]); plot(f,Sf); Title (' Noise-added audio spectrum '); xlabel('Frequency'); ylabel('Amplitude'); grid on; %% P2_2_3 design filter, and filter % design band stop filter to filter noise FP1 = 3500; FS1 = 3600; FP2 = 5600; FS2 = 5500; rp2 = 1; rs2 = 40; FP = [FP1 FP2]; FS = [FS1 FS2]; wp = 2*pi*FP/fs; % Convert analog index to digital index ws = 2* PI *FS/ FS; % Chebyshev IIR filter [N,wn] = Cheb1ORD (WP, WS, RP2,rs2); % the minimum order of filter is n, wn is system frequency band [bz,az] = cheby1(n,rp2, WP,'stop'); sound(yfilt,fs); % Yfil = abs(FFT (yfilt,N)); Yf = Yfil(1:length(Yfil)/2);Copy the code
3. Operation results
Fourth, note
Complete code or write to add QQ1575304183