Abstract
In this paper, we present deep residual network (ResNet)-based vehicle classification and localization methods using real traffic surveillance recordings. We utilize a MIOvision traffic dataset, which comprises 11 categories including a variety of vehicles, such as bicycle, bus, car, motorcycle, and so on. To improve the classification performance, we exploit a technique called joint fine-tuning (JF). In addition, we propose a dropping CNN (DropCNN) method to create a synergy effect with the JF. For the localization, we implement basic concepts of state-of-the-art region based detector combined with a backbone convolutional feature extractor using 50 and 101 layers of residual networks and ensemble them into a single model. Finally, we achieved the highest accuracy in both classification and localization tasks using the dataset among several state-of-the-art methods, including VGG16, AlexNet, and ResNet50 for the classification, and YOLO Faster R-CNN, and SSD for the localization reported on the website.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2017 |
| Publisher | IEEE Computer Society |
| Pages | 934-940 |
| Number of pages | 7 |
| ISBN (Electronic) | 9781538607336 |
| DOIs | |
| State | Published - 22 Aug 2017 |
| Event | 30th IEEE Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2017 - Honolulu, United States Duration: 21 Jul 2017 → 26 Jul 2017 |
Publication series
| Name | IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops |
|---|---|
| Volume | 2017-July |
| ISSN (Print) | 2160-7508 |
| ISSN (Electronic) | 2160-7516 |
Conference
| Conference | 30th IEEE Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2017 |
|---|---|
| Country/Territory | United States |
| City | Honolulu |
| Period | 21/07/17 → 26/07/17 |
Bibliographical note
Publisher Copyright:© 2017 IEEE.