LungNeXt: A novel lightweight network utilizing enhanced mel-spectrogram for lung sound classification


Journal article


Fan Wang, Xiaochen Yuan, Yue Liu, Chan-Tong Lam
Journal of King Saud University - Computer and Information Sciences, vol. 36, 2024, p. 102200


Link
Cite

Cite

APA   Click to copy
Wang, F., Yuan, X., Liu, Y., & Lam, C.-T. (2024). LungNeXt: A novel lightweight network utilizing enhanced mel-spectrogram for lung sound classification. Journal of King Saud University - Computer and Information Sciences, 36, 102200. https://doi.org/10.1016/j.jksuci.2024.102200


Chicago/Turabian   Click to copy
Wang, Fan, Xiaochen Yuan, Yue Liu, and Chan-Tong Lam. “LungNeXt: A Novel Lightweight Network Utilizing Enhanced Mel-Spectrogram for Lung Sound Classification.” Journal of King Saud University - Computer and Information Sciences 36 (2024): 102200.


MLA   Click to copy
Wang, Fan, et al. “LungNeXt: A Novel Lightweight Network Utilizing Enhanced Mel-Spectrogram for Lung Sound Classification.” Journal of King Saud University - Computer and Information Sciences, vol. 36, 2024, p. 102200, doi:10.1016/j.jksuci.2024.102200.


BibTeX   Click to copy

@article{wang2024a,
  title = {LungNeXt: A novel lightweight network utilizing enhanced mel-spectrogram for lung sound classification},
  year = {2024},
  journal = {Journal of King Saud University - Computer and Information Sciences},
  pages = {102200},
  volume = {36},
  doi = {10.1016/j.jksuci.2024.102200},
  author = {Wang, Fan and Yuan, Xiaochen and Liu, Yue and Lam, Chan-Tong}
}

Flowchart of the proposed lung sound classification using LungNeXt
Abstract: Lung auscultation is essential for early lung condition detection. Categorizing adventitious lung sounds requires expert discrimination by medical specialists. This paper details the features of LungNeXt, a novel classification model specifically designed for lung sound analysis. Furthermore, we propose two auxiliary methods: RandClipMix (RCM) for data augmentation and Enhanced Mel-Spectrogram for Feature Extraction (EMFE). RCM addresses the issue of data imbalance by randomly mixing clips within the same category to create new adventitious lung sounds. EMFE augments specific frequency bands in spectrograms to highlight adventitious features. These contributions enable LungNeXt to achieve outstanding performance. LungNeXt optimally integrates an appropriate number of NeXtblocks, ensuring superior performance and a lightweight model architecture. The proposed RCM and EMFE methods, along with the LungNeXt classification network, have been evaluated on the SPRSound dataset. Experimental results revealed a commendable score of 0.5699 for the lung sound five-category task on SPRSound. Specifically, the LungNeXt model is characterized by its efficiency, with only 3.804M parameters and a computational complexity of 0.659G FLOPS. This lightweight and efficient model is particularly well-suited for applications in electronic stethoscope back-end processing equipment, providing efficient diagnostic advice to physicians and patients.


Tools
Translate to