摘要
Most online action detection methods focus on solving a (K + 1) classification problem, where the additional category represents the ‘background’ class. However, training on the ‘background’ class and managing data imbalance are common challenges in online action detection. To address these issues, we propose a framework for online action detection by incorporating an additional pathway between the feature extractor and online action detection model. Specifically, we present one configuration that retains feature distinctions for fusion with the final decision from the Long Short-Term Transformer (LSTR), enhancing its performance in the (K + 1) classification. Experimental results show that the proposed method achieves an accuracy of 71.2% in mean Average Precision (mAP) on the Thumos14 dataset, outperforming the 69.5% achieved by the original LSTR method.
原文 | 英語 |
---|---|
文章編號 | 4110 |
期刊 | Electronics (Switzerland) |
卷 | 13 |
發行號 | 20 |
DOIs | |
出版狀態 | 已發佈 - 2024 10月 |
ASJC Scopus subject areas
- 控制與系統工程
- 訊號處理
- 硬體和架構
- 電腦網路與通信
- 電氣與電子工程