Research of the impact of noise reduction methods on the quality of audio signal recovery

Автор(и)

DOI:

https://doi.org/10.18664/ikszt.v29i3.313606

Ключові слова:

noise suppression, filtering, audio, noise, SNR, PSNR, spectral subtraction, frequency filters, wavelet transform, experiment

Анотація

The subject of the study is the analysis  of various filtering algorithms for the  quality of the resulting audio files. The  importance  of audio line filtering has  grown significantly in recent years due  to its key role in a variety of  applications such as speech  reduction and artificial intelligence. Taking into account the growing demand for  solving problems related to speech  recognition, the processing of audio series becomes important for  determining the accuracy and  efficiency of the obtained  solution.
The purpose of the work is to study the impact of noise suppression methods on the quality of restoration of an audio signal, which was alternately noisy with one of five types of noise - white, pink, brown, impulse, Gaussian with different power. To achieve the goal, the following tasks were solved: an analysis of the types of noise was carried out and analysis of noise reduction and filtering methods. A generalized model of noise reduction and filtering was developed, and an experiment was planned depending on the type and power of noise.
Simulation of the experiment was  performed by comparing the  parameters of the signal-to-noise ratio before and after the experiment and  the peak signal-to-noise ratio in the  processed file. The following methods  are used: spectral subtraction, filtering  based on frequency filters and wavelet transformation.
The following results were obtained:  depending on the selected noises and  algorithms, it was possible to achieve  the lowest value of the peak signal-to-noise ratio of 21.52db, and the signal-to-noise ratio increased, which allowed further work with these audio files. The practical significance of this work is the increase in the number of available  audio files for further work.
Conclusions: the analysis of the  obtained results showed that filtering based on frequency filters only  worsened the output signal, that is, not only noise, but also useful information is filtered. In all runs, the SNR deteriorates to - 18dB. which is worse than no filtering. Algorithms of spectral subtraction and wavelet transformation improved SNR parameters and output audio files noisy with the most powerful noises in the range of 20dB, which can be considered acceptable for further processing. The results highlight the importance of using denoising and filtering for complex audio processing tasks, particularly neural  network  training tasks.

Біографії авторів

Олеся Юріївна Барковська, Kharkiv National University of Radio Electronics

Ph.D (Engineering Sciences), Docent, Associate Professor Department of Electronic Computers

Антон Олегович Гаврашенко, Kharkiv National University of Radio Electronics

Professor Assistant at Department of Electronic Computer

Посилання

MALIK, Mishaim, et al. Automatic speech recognition: a survey. Multimedia Tools and Applications, 2021, 80: 9411-9457.

KUMAR, Yogesh; KOUL, Apeksha; SINGH, Chamkaur. A deep learning approaches in text-to-speech system: a systematic review and recent research perspective. Multimedia Tools and Applications, 2023, 82.10: 15171-15197.

HAN, Wei, et al. Contextnet: Improving convolutional neural networks for automatic speech recognition with global context. arXiv preprint arXiv:2005.03191, 2020.

Barkovska О., Havrashenko А. Analysis of the influence of selected audio pre-processing stages on accuracy of speaker language recognition // Innovative technologies and scientific solutions for industries. – 2023. – №. 4 (26). – С. 16-23. https://doi.org/10.30837/ITSSI.2023.26.016

Havrashenko A., Barkovska O. Analysis of text augmentation algorithms in artificial language machine translation systems //Advanced Information Systems. – 2023. – Т. 7. – №. 1. – С. 47-53. https://doi.org/10.20998/2522-9052.2023.1.08

Barkovska О., Havrashenko А. Analysis of word search algorithms in the dictionaries of machine translation systems for artificial languages //Computer systems and information technologies. – 2023. – №. 2. – С. 17-24. https://doi.org/10.31891/csit-2023-2-2

DUBEY, Harishchandra, et al. Icassp 2023 deep noise suppression challenge. IEEE Open Journal of Signal Processing, 2024.

PATIDAR, Pawan, et al. Image de-noising by various filters for different noise. International journal of computer applications, 2010, 9.4: 45-50.

KARAM, Marc, et al. Noise removal in speech processing using spectral subtraction. Journal of Signal and Information Processing, 2014, 2014.

KUMAR, Ashish, et al. Stationary wavelet transform based ECG signal denoising method. ISA transactions, 2021, 114: 251-262.

DARRAS, Kevin FA, et al. High microphone signal-to-noise ratio enhances acoustic sampling of wildlife. PeerJ, 2020, 8: e9955.

##submission.downloads##

Опубліковано

2024-10-25