TY - JOUR
T1 - Skeleton Ground Truth Extraction
T2 - Methodology, Annotation Tool and Benchmarks
AU - Yang, Cong
AU - Indurkhya, Bipin
AU - See, John
AU - Gao, Bo
AU - Ke, Yan
AU - Boukhers, Zeyd
AU - Yang, Zhenyu
AU - Grzegorzek, Marcin
N1 - Funding Information:
Research activities leading to this work have been supported by the Natural Science Foundation of the Jiangsu Higher Education Institutions of China (Grant Number: 22KJB520008) and the Research Fund of Clobotics (Grant Number: KB1801ZW201609-03). We would like to thank Zixuan Chen from Darmstadt University of Technology (Germany) for his help in assembling the first version of SkeView.
Publisher Copyright:
© 2023, The Author(s).
PY - 2023/11/1
Y1 - 2023/11/1
N2 - Skeleton Ground Truth (GT) is critical to the success of supervised skeleton extraction methods, especially with the popularity of deep learning techniques. Furthermore, we see skeleton GTs used not only for training skeleton detectors with Convolutional Neural Networks (CNN), but also for evaluating skeleton-related pruning and matching algorithms. However, most existing shape and image datasets suffer from the lack of skeleton GT and inconsistency of GT standards. As a result, it is difficult to evaluate and reproduce CNN-based skeleton detectors and algorithms on a fair basis. In this paper, we present a heuristic strategy for object skeleton GT extraction in binary shapes and natural images. Our strategy is built on an extended theory of diagnosticity hypothesis, which enables encoding human-in-the-loop GT extraction based on clues from the target’s context, simplicity, and completeness. Using this strategy, we developed a tool, SkeView, to generate skeleton GT of 17 existing shape and image datasets. The GTs are then structurally evaluated with representative methods to build viable baselines for fair comparisons. Experiments demonstrate that GTs generated by our strategy yield promising quality with respect to standard consistency, and also provide a balance between simplicity and completeness.
AB - Skeleton Ground Truth (GT) is critical to the success of supervised skeleton extraction methods, especially with the popularity of deep learning techniques. Furthermore, we see skeleton GTs used not only for training skeleton detectors with Convolutional Neural Networks (CNN), but also for evaluating skeleton-related pruning and matching algorithms. However, most existing shape and image datasets suffer from the lack of skeleton GT and inconsistency of GT standards. As a result, it is difficult to evaluate and reproduce CNN-based skeleton detectors and algorithms on a fair basis. In this paper, we present a heuristic strategy for object skeleton GT extraction in binary shapes and natural images. Our strategy is built on an extended theory of diagnosticity hypothesis, which enables encoding human-in-the-loop GT extraction based on clues from the target’s context, simplicity, and completeness. Using this strategy, we developed a tool, SkeView, to generate skeleton GT of 17 existing shape and image datasets. The GTs are then structurally evaluated with representative methods to build viable baselines for fair comparisons. Experiments demonstrate that GTs generated by our strategy yield promising quality with respect to standard consistency, and also provide a balance between simplicity and completeness.
UR - http://www.scopus.com/inward/record.url?scp=85175372155&partnerID=8YFLogxK
U2 - 10.1007/s11263-023-01926-3
DO - 10.1007/s11263-023-01926-3
M3 - Article
AN - SCOPUS:85175372155
SN - 0920-5691
JO - International Journal of Computer Vision
JF - International Journal of Computer Vision
ER -