ESTMST-ST: An End-to-End Soft Threshold and Multiloss Self-Distillation Based Swin Transformer for Underwater Acoustic Signal Recognition

Wu Fan, Yao Haiyang*, Zhao Zhongda, Zhao Xiaobo, Zang Yuzhang, Wang Haiyan

*Corresponding author af dette arbejde

Publikation: Bidrag til tidsskrift/Konferencebidrag i tidsskrift /Bidrag til avisTidsskriftartikelForskningpeer review

Abstract

Underwater acoustic signal recognition (UASR) is significant for marine life and ecological environment protection. However, 2-D fixed-parameter inputs are inadequate for adapting to the variable underwater acoustic environment, and learnable-parameter inputs with inductive bias priors in transformers lead to difficulties in model convergence. Additionally, the differing optimization objectives between noise reduction and recognition methods can cause signal distortion, hindering recognition accuracy. To address these issues, this article proposes an innovative end-to-end soft threshold Swin Transformer model based on a multiloss self-distillation training strategy (ESTMST-ST) for robust recognition of weak underwater acoustic targets. Building on the previously proposed time-frequency Swin Transformer (TFST), we design a learnable dual filter module (LDFM) that decomposes underwater acoustic signals in the frequency direction, with parameters obtained through model training. To improve the model's antinoise performance, we incorporate a soft threshold strategy within TFST to reduce nonstationary interference in underwater acoustic signals. For enhanced robustness and training efficiency, we introduce a self-distillation training strategy with four specific loss functions in selected stage in TFST. Using publicly available datasets, ShipsEar and DeepShip, we conduct three experiments: fixed signal-to-noise ratio (SNR) UASR, multi-SNR UASR, and model generalization ability tests. The experimental results demonstrate that ESTMST-ST achieves superior performance (at least a 1.6 improvement in F scores and a 2.2 improvement in kappa coefficients) compared to five state-of-the-art methods across two open-source datasets.

OriginalsprogEngelsk
Artikelnummer4200813
TidsskriftIEEE Transactions on Geoscience and Remote Sensing
Vol/bind63
Antal sider13
ISSN0196-2892
DOI
StatusUdgivet - 2025

Fingeraftryk

Dyk ned i forskningsemnerne om 'ESTMST-ST: An End-to-End Soft Threshold and Multiloss Self-Distillation Based Swin Transformer for Underwater Acoustic Signal Recognition'. Sammen danner de et unikt fingeraftryk.

Citationsformater