Please use this identifier to cite or link to this item: https://hdl.handle.net/10316/113751
Title: Automatic wheeze segmentation using harmonic-percussive source separation and empirical mode decomposition
Authors: Rocha, Bruno M. 
Pessoa, Diogo 
Marques, Alda
Carvalho, Paulo de 
Paiva, Rui Pedro 
Keywords: Respiratory sound analysis; expert systems; harmonic-percussive source separation; empirical mode decomposition; sound event detection
Issue Date: 23-Feb-2023
Publisher: IEEE
Project: UIDB/00326/2020 
UIDP/00326/2020 
SFRH/BD/135686/2018 
DFA/BD/4927/2020 
POCI- 01-0145-FEDER-007628 
UIDB/04501/2020 
Serial title, monograph or event: IEEE Journal of Biomedical and Health Informatics
Volume: 27
Issue: 4
Abstract: Wheezes are adventitious respiratory sounds commonly present in patients with respiratory conditions. The presence of wheezes and their time location are relevant for clinical reasons, such as understanding the degree of bronchial obstruction. Conventional auscultation is usually employed to analyze wheezes, but remote monitoring has become a pressing need during recent years. Automatic respiratory sound analysis is required to reliably perform remote auscultation. In this work we propose a method for wheeze segmentation. Our method starts by decomposing a given audio excerpt into intrinsic mode frequencies using empirical mode decomposition. Then, we apply harmonic-percussive source separation to the resulting audio tracks and get harmonic-enhanced spectrograms, which are processed to obtain harmonic masks. Subsequently, a series of empirically derived rules are applied to find wheeze candidates. Finally, the candidates stemming from the different audio tracks are merged and median filtered. In the evaluation stage, we compare our method to three baselines on the ICBHI 2017 Respiratory Sound Database, a challenging dataset containing various noise sources and background sounds. Using the full dataset, our method outperforms the baselines, achieving an F1 of 41.9%. Our method's performance is also better than the baselines across several stratified results focusing on five variables: recording equipment, age, sex, body-mass index, and diagnosis. We conclude that, contrary to what has been reported in the literature, wheeze segmentation has not been solved for real life scenario applications. Adaptation of existing systems to demographic characteristics might be a promising step in the direction of algorithm personalization, which would make automatic wheeze segmentation methods clinically viable.
URI: https://hdl.handle.net/10316/113751
DOI: 10.1109/JBHI.2023.3248265
Rights: openAccess
Appears in Collections:I&D CISUC - Artigos em Revistas Internacionais

Show full item record

Page view(s)

46
checked on Jun 26, 2024

Download(s)

17
checked on Jun 26, 2024

Google ScholarTM

Check

Altmetric

Altmetric


This item is licensed under a Creative Commons License Creative Commons