The PASCAL Visual Object Classes Challenge: A Retrospective. The PASCAL Visual Object Classes (VOC) challenge consists of two components: (i) a publicly available dataset of images together with ground truth annotation and standardised evaluation software; and (ii) an annual competition and workshop. There are five challenges: classification, detection, segmentation, action classification, and person layout. In this paper we provide a review of the challenge from 2008–2012. The paper is intended for two audiences: algorithm designers, researchers who want to see what the state of the art is, as measured by performance on the VOC datasets, along with the limitations and weak points of the current generation of algorithms; and, challenge designers, who want to see what we as organisers have learnt from the process and our recommendations for the organisation of future challenges. To analyse the performance of submitted algorithms on the VOC datasets we introduce a number of novel evaluation methods: a bootstrapping method for determining whether differences in the performance of two algorithms are significant or not; a normalised average precision so that performance can be compared across classes with different proportions of positive instances; a clustering method for visualising the performance across multiple algorithms so that the hard and easy images can be identified; and the use of a joint classifier over the submitted algorithms in order to measure their complementarity and combined performance. We also analyse the community’s progress through time using the methods of Hoiem et al. (Proceedings of European Conference on Computer Vision, 2012) to identify the types of occurring errors. We conclude the paper with an appraisal of the aspects of the challenge that worked well, and those that could be improved in future challenges.

References in zbMATH (referenced in 139 articles )

Showing results 1 to 20 of 139.
Sorted by year (citations)

1 2 3 ... 5 6 7 next

  1. Cauchois, Maxime; Gupta, Suyash; Duchi, John C.: Knowing what you know: valid and validated confidence sets in multiclass and multilabel prediction (2021)
  2. Chen, Zhe; Zhang, Jing; Tao, Dacheng: Recursive context routing for object detection (2021)
  3. Ignatiev, V. Yu.; Matveev, I. A.; Murynin, A. B.; Usmanova, A. A.; Tsurkov, V. I.: Increasing the spatial resolution of panchromatic satellite images based on generative neural networks (2021)
  4. Ma, Jiayi; Jiang, Xingyu; Fan, Aoxiang; Jiang, Junjun; Yan, Junchi: Image matching from handcrafted to deep features: a survey (2021)
  5. Mortier, Thomas; Wydmuch, Marek; Dembczyński, Krzysztof; Hüllermeier, Eyke; Waegeman, Willem: Efficient set-valued prediction in multi-class classification (2021)
  6. Wang, Zhengyang; Ji, Shuiwang: Smoothed dilated convolutions for improved dense prediction (2021)
  7. Wu, Zhenni; Chen, Hengxin; Fang, Bin; Li, Zihao; Chen, Xinrun: Building pose estimation from the perspective of UAVs based on CNNs (2021)
  8. Zhang, Jianjia; Wang, Lei; Zhou, Luping; Li, Wanqing: Beyond covariance: SICE and kernel based visual feature representation (2021)
  9. Chen, Yi-Wen; Tsai, Yi-Hsuan; Lin, Yen-Yu; Yang, Ming-Hsuan: VOSTR: video object segmentation via transferable representations (2020)
  10. Khattar, Sahil; Rama Krishna, C.: Adversarial attack to fool object detector (2020)
  11. Liu, Li; Ouyang, Wanli; Wang, Xiaogang; Fieguth, Paul; Chen, Jie; Liu, Xinwang; Pietikäinen, Matti: Deep learning for generic object detection: a survey (2020)
  12. Ma, Fan; Meng, Deyu; Dong, Xuanyi; Yang, Yi: Self-paced multi-view co-training (2020)
  13. Sharma, Vipul; Mir, Roohie Naaz: A comprehensive and systematic look up into deep learning based object detection techniques: a review (2020)
  14. Sun, Xiao; Lian, Zhouhui: EasyMesh: an efficient method to reconstruct 3D mesh from a single image (2020)
  15. Valada, Abhinav; Mohan, Rohit; Burgard, Wolfram: Self-supervised model adaptation for multimodal semantic segmentation (2020)
  16. Wang, Xiang; Liu, Sifei; Ma, Huimin; Yang, Ming-Hsuan: Weakly-supervised semantic segmentation by iterative affinity learning (2020)
  17. Zhao, Jiaojiao; Han, Jungong; Shao, Ling; Snoek, Cees G. M.: Pixelated semantic colorization (2020)
  18. Zhao, Yue; Xiong, Yuanjun; Wang, Limin; Wu, Zhirong; Tang, Xiaoou; Lin, Dahua: Temporal action detection with structured segment networks (2020)
  19. Daneshmand, Amir; Sun, Ying; Scutari, Gesualdo; Facchinei, Francisco; Sadler, Brian M.: Decentralized dictionary learning over time-varying digraphs (2019)
  20. Jens Behley, Martin Garbade, Andres Milioto, Jan Quenzel, Sven Behnke, Cyrill Stachniss, Juergen Gall: SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences (2019) arXiv

1 2 3 ... 5 6 7 next