A Transformer-Based Framework for Tiny Object Detection

Yi Kai Liao, Gong Si Lin, Mei Chen Yeh

研究成果: 書貢獻/報告類型會議論文篇章

摘要

This paper proposes a fully transformer-based method for building an end-to-end model dedicated to tiny object detection. Our approach eliminates the components which are difficult to be designed in detecting tiny objects, such as anchor generation and non-maximum suppression. Additionally, we address the issue of receptive fields for tiny objects in convolutional neural networks through self-attention. The model named Swin-Deformable DEtection TRansformer (SD DETR) integrates Swin Transformer [1] and Deformable DETR [2]. Furthermore, we have introduced architectural enhancements and optimized the loss function to improve the model's ability in detecting tiny objects. Experimental results on the AI-TOD [3] dataset demonstrate that SD DETR achieves 10.9 AP for very tiny objects with only 2 to 4 pixels, showcasing a significant improvement of +1.2 AP compared to the current state-of-the-art model. The code is available at https://github.com/kai271828/SDDERT.

原文英語
主出版物標題2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2023
發行者Institute of Electrical and Electronics Engineers Inc.
頁面373-377
頁數5
ISBN(電子)9798350300673
DOIs
出版狀態已發佈 - 2023
事件2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2023 - Taipei, 臺灣
持續時間: 2023 10月 312023 11月 3

出版系列

名字2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2023

會議

會議2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2023
國家/地區臺灣
城市Taipei
期間2023/10/312023/11/03

ASJC Scopus subject areas

  • 硬體和架構
  • 訊號處理
  • 人工智慧
  • 電腦科學應用

指紋

深入研究「A Transformer-Based Framework for Tiny Object Detection」主題。共同形成了獨特的指紋。

引用此