Multi-Scale Self-Attention Network for Denoising Medical Images

Research output: Contribution to journalArticlepeer-review

5 Scopus citations

Abstract

Deep learning-based image denoising plays a critical role in medical imaging, especially when dealing with rapid fluorescence and ultrasound captures where traditional noise mitigation strategies are limited, such as increasing pixel dwell time or frame averaging. Although numerous denoising techniques based on deep learning have exhibited commendable results across biomedical domains, further optimization is pivotal, particularly for precise real-time tracking of molecular kinetics in cellular settings. This is vital for decoding the intricate dynamics of biological processes. In this context, we propose the Multi-Scale Self-Attention Network (MSAN), an innovative architecture tailored for optimal denoising of fluorescence and ultrasound images. MSAN integrates three main modules: a feature extraction layer adept at discerning high and low-frequency attributes, a multi-scale self-attention mechanism that predicts residuals using original and downsampled feature maps, and a decoder that produces a residual image. When offset from the original image, the residual output yields the denoised result. Benchmarking shows MSAN outperforms state-of-the-art models such as RIDNet and DnCNN, achieving peak signal-to-noise ratio improvements of 0.17 dB, 0.23 dB, and 1.77dB on the FMD, W2S datasets, and ultrasound dataset, respectively, thus showcasing its superior denoising capability for fluorescence and ultrasound imagery.

Original languageEnglish
Pages (from-to)1-26
Number of pages26
JournalAPSIPA Transactions on Signal and Information Processing
Volume12
Issue number5
DOIs
StatePublished - 22 Jan 2024

Bibliographical note

Publisher Copyright:
© 2024 K. Lee et al.

Keywords

  • Blind source separation
  • Circular microphone array
  • Online-independent vector analysis
  • Sound field interpolation

Fingerprint

Dive into the research topics of 'Multi-Scale Self-Attention Network for Denoising Medical Images'. Together they form a unique fingerprint.

Cite this