Methodics and tools of cough sound processing on basic of neural net

Vishniakou, U. А.; Shaya, B. H.

doi:10.21122/2309-4923-2023-1-35-41

dc.contributor.author	Vishniakou, U. А.
dc.contributor.author	Shaya, B. H.
dc.coverage.spatial	Минск	ru
dc.date.accessioned	2023-07-19T12:26:54Z
dc.date.available	2023-07-19T12:26:54Z
dc.date.issued	2023
dc.identifier.citation	Vishniakou, U. А. Methodics and tools of cough sound processing on basic of neural net = Методы и средства обработки звуков кашля на базе нейронной сети / U. А. Vishniakou, B. H. Shaya // Системный анализ и прикладная информатика. – 2023. – № 1. – С. 35-41.	ru
dc.identifier.uri	https://rep.bntu.by/handle/data/131742
dc.description.abstract	The purpose of the article is to analyze the methods and means of processing cough sounds to detect lung diseases, as well as to describe the developed system for classifying and detecting cough sounds based on a deep neural network. Four types of machine learning and the use of convolutional neural network (CNN) are considered. Hypermarkets of CNN are given. Varieties of machine learning based on the CNN are discussed. The analysis of works on the methodology and means of processing cough sounds based on the CNN with the reduction of the means used and the accuracy of recognition is carried out. Details of machine learning using the environmental sound classification 50 (ESC-50) dataset are discussed. To recognize COVID-19 cough, a classifier was analyzed using CNN as a machine learning model. The proposed CNN system is designed to classify and detect cough sounds based on ESC-50. After selecting a set of sound classification data, four stages are described: extraction of features from audio files, labeling, training, testing. The ESC-50 used for the study was downloaded from the Kaggle website. Python libraries and modules related to deep learning and data science were used to implement the project: NumPy, Librosa, Matplotlib, Hickle, Sci-Kit Learn, Keras. The implemented network used a stochastic gradient algorithm. Several volunteers recorded their voices while coughing using their smartphones and it was assured to record their voices in a public environment to introduce noise to the sounds, in addition to some audio files that were downloaded online. The results showed an average accuracy of 85.37 %, precision of 78.8 % and a recall record of 91.9 %.	ru
dc.language.iso	en	ru
dc.publisher	БНТУ	ru
dc.title	Methodics and tools of cough sound processing on basic of neural net	ru
dc.title.alternative	Методы и средства обработки звуков кашля на базе нейронной сети	ru
dc.type	Article	ru
dc.identifier.doi	10.21122/2309-4923-2023-1-35-41
local.description.annotation	Целью статьи является анализ методики и средств обработки звуков кашля для выявления легочных заболевания, а также описание разработанной системы для классификации и обнаружения звуков кашля на базе глубинной нейронной сети. Рассмотрены четыре типа машинного обучения, использование сверточной нейронной сети (СНС). Приведены гипермаркеты СНС. Обсуждены разновидности машинного обучения на базе СНС. Выполнен анализ работ по методике и средствам обработки звуков кашля на базе СНС с приведением используемых средств и точности распознавания. Обсуждены детали машинного обучения с использованием набора данных классификации звуков (environmental sound classification 50 – ESC-50). Для распознования кашля COVID-19 проанализирован классификатор, используя CNN в качестве модели машинного обучения. Предлагаемая система СНН предназначена для классификации и обнаружения звуков кашля на базе ESC-50. После выбора набора данных классификации звуков описаны четыре этапа: извлечение признаков из аудиофайлов, маркировки, обучение, тестирование. ESC-50, использованный для исследования, был загружен с веб-сайта Kaggle. Для реализации проекта были использованы библиотеки и модули Python, которые связаны с глубоким обучением и наукой о данных: NumPy, Librosa, Matplotlib, Hickle, Sci-Kit Learn, Keras. Реализованная сеть использовала алгоритм стохастического градиента. Несколько добровольцев записали свои голоса во время кашля с помощью своих смартфонов, было гарантировано, что они будут записывать свои голоса в общественных местах, чтобы внести шум в звуки, в дополнение к некоторым аудиофайлам, которые были загружены онлайн. Результаты показали среднюю точность 85,37 %, точность 78,8 % и рекорд отзыва 91,9 %.	ru

Files in this item

Name:: 35-41.pdf
Size:: 329.9Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

№ 1[8]

Show simple item record