Abstract
In this paper, we present offline-to-online knowledge distillation (OOKD) for video instance segmentation (VIS), which transfers a wealth of video knowledge from an offline model to an online model for consistent prediction. Unlike previous methods that have adopted either an online or offline model, our single online model takes advantage of both models by distilling offline knowledge. To transfer knowledge correctly, we propose query filtering and association (QFA), which filters irrelevant queries to exact instances. Our KD with QFA increases the robustness of feature matching by encoding object-centric features from a single frame supplemented by long-range global information. We also propose a simple data augmentation scheme for knowledge distillation in the VIS task that fairly transfers the knowledge of all classes into the online model. Extensive experiments show that our method significantly improves the performance in video instance segmentation, especially for challenging datasets, including long, dynamic sequences. Our method also achieves state-of-the-art performance on YTVIS-21, YTVIS-22, and OVIS datasets, with mAP scores of 46.1%, 43.6%, and 31.1%, respectively.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - 2024 IEEE Winter Conference on Applications of Computer Vision, WACV 2024 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 158-167 |
| Number of pages | 10 |
| ISBN (Electronic) | 9798350318920 |
| DOIs | |
| State | Published - 3 Jan 2024 |
| Event | 2024 IEEE Winter Conference on Applications of Computer Vision, WACV 2024 - Waikoloa, United States Duration: 4 Jan 2024 → 8 Jan 2024 |
Publication series
| Name | Proceedings - 2024 IEEE Winter Conference on Applications of Computer Vision, WACV 2024 |
|---|
Conference
| Conference | 2024 IEEE Winter Conference on Applications of Computer Vision, WACV 2024 |
|---|---|
| Country/Territory | United States |
| City | Waikoloa |
| Period | 4/01/24 → 8/01/24 |
Bibliographical note
Publisher Copyright:© 2024 IEEE.
Keywords
- Algorithms
- Algorithms
- Image recognition and understanding
- Video recognition and understanding
Fingerprint
Dive into the research topics of 'Offline-to-Online Knowledge Distillation for Video Instance Segmentation'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver