introduction
the “casia-tencent road scene dataset” (rs10k) was built by the state key laboratory of multimodal artificial intelligence systems (mais), institute of automation of chinese academy of sciences (casia), and t lab, tencent map, tencent technology (beijing) co., ltd. the images in our dataset were taken by onboard cameras of various vehicles from 31 cities in china. in each city, we randomly chose some sections of high-level roads (roads above national standard level 6) to collect images. rs10k is a comprehensive dataset, including annotations of various elements and relations, which can be used for road and lane segmentation, traffic sign detection, traffic sign understanding, and visual traffic knowledge graph generation. we hope rs10k can provide support for the development of the community. due to some images involving sensitive information, they have been removed, so the publicly available dataset slightly differs from the statistical data published in paper [1].
annotations
there are 10066 images in rs10k, of which 7041 are for training and 3000 are for testing. the element annotations include roads and lanes annotated by masks, signs and components annotated by quadrilateral bounding boxes. the relation annotations include s-s relations between signs, c-c relations between components, and a-t relations between the arrow element and traffic element. the statistics are shown in table 1.
tabel.1 statistics of elements and relations.
all the defined symbols of the components are shown in table 2.
tabel.2 categories of symbols in components.
the visual traffic knowledge graphs are composed of one or several knowledge trees, each of which is organized as figure 1.
figure 1. the structure of a knowledge tree.
condition of use
reference
rs10k was first used in the research work referred to as
[1] yunfei guo, fei yin, xiao-hui li, xudong yan, tao xue, shuqi mei and cheng-lin liu. visual traffic knowledge graph generation from scene images[c]//proceedings of the ieee/cvf international conference on computer vision (iccv). 2023: 21604-21613.
contact
cheng-lin liu (liucl@nlpr.ia.ac.cn), fei yin ()
national laboratory of pattern recognition (nlpr)
institute of automation of chinese academy of sciences
95 zhongguancun east road, beijing 100190, p.r. china
haidian | beijing | china
phone : ( 86-10)8254-4797
fax : ( 86-10) 8254-4594
email:liucl@nlpr.ia.ac.cn
website:www.nlpr.ia.ac.cn/pal/