Logo image
Gate‐Align‐SED: Semi‐Supervised Sound Event Detection via Adaptive Feature Gating and Cross‐Task Alignment in Situation Awareness
Journal article   Open access   Peer reviewed

Gate‐Align‐SED: Semi‐Supervised Sound Event Detection via Adaptive Feature Gating and Cross‐Task Alignment in Situation Awareness

Jieli Chen, Li‐Minn Ang, Chee Shen Lim, Kah Phooi Seng and Jeremy Smith
Advanced Intelligent Systems, Vol.Advanced access
16-Apr-2026
pdf
Advanced Intelligent Systems - 2026 - Chen - Gate‐Align‐SED Semi‐Supervised Sound Event Detection via Adaptive Feature643.96 kBDownloadView
Published Version (Advanced Access) Open Access CC BY V4.0

Abstract

disaster monitoring representation learning sem-supervised learning situation awareness sound event detection
In complex real‐world environments such as disaster monitoring, effective sound event detection (SED) is often hindered by the presence of noise and limited labeled data. This article presents Gate‐Align‐SED, a unified semi‐supervised framework designed to bridge the gap between clip‐level and frame‐level acoustic modeling for disaster‐related audio understanding. The proposed method integrates adaptive feature fusion, mutual attention mechanisms, and a novel label alignment strategy that introduces a learnable correlation matrix to align heterogeneous label granularities. Furthermore, we incorporate a consistency learning paradigm grounded in the Mean‐Teacher framework, promoting robust representation learning across both temporal scales and annotation levels. Experiments demonstrate that the proposed approach enhances both the flexibility and stability of SED systems, particularly under label‐sparse or noisy conditions. Our work offers a scalable and generalizable solution for leveraging both weakly labeled and unlabeled data in critical acoustic event recognition scenarios.

Details

Metrics

1 Record Views
Logo image