TY - JOUR
T1 - R2S100K
T2 - Road-Region Segmentation Dataset for Semi-supervised Autonomous Driving in the Wild
AU - Butt, Muhammad Atif
AU - Ali, Hassan
AU - Qayyum, Adnan
AU - Sultani, Waqas
AU - Al-Fuqaha, Ala
AU - Qadir, Junaid
N1 - Publisher Copyright:
© The Author(s) 2024.
PY - 2024/8/23
Y1 - 2024/8/23
N2 - Semantic understanding of roadways is a key enabling factor for safe autonomous driving. However, existing autonomous driving datasets provide well-structured urban roads while ignoring unstructured roadways containing distress, potholes, water puddles, and various kinds of road patches i.e., earthen, gravel etc. To this end, we introduce Road Region Segmentation dataset (R2S100K)—a large-scale dataset and benchmark for training and evaluation of road segmentation in aforementioned challenging unstructured roadways. R2S100K comprises 100K images extracted from a large and diverse set of video sequences covering more than 1000 km of roadways. Out of these 100K privacy respecting images, 14,000 images have fine pixel-labeling of road regions, with 86,000 unlabeled images that can be leveraged through semi-supervised learning methods. Alongside, we present an Efficient Data Sampling based self-training framework to improve learning by leveraging unlabeled data. Our experimental results demonstrate that the proposed method significantly improves learning methods in generalizability and reduces the labeling cost for semantic segmentation tasks. Our benchmark will be publicly available to facilitate future research at https://r2s100k.github.io/.
AB - Semantic understanding of roadways is a key enabling factor for safe autonomous driving. However, existing autonomous driving datasets provide well-structured urban roads while ignoring unstructured roadways containing distress, potholes, water puddles, and various kinds of road patches i.e., earthen, gravel etc. To this end, we introduce Road Region Segmentation dataset (R2S100K)—a large-scale dataset and benchmark for training and evaluation of road segmentation in aforementioned challenging unstructured roadways. R2S100K comprises 100K images extracted from a large and diverse set of video sequences covering more than 1000 km of roadways. Out of these 100K privacy respecting images, 14,000 images have fine pixel-labeling of road regions, with 86,000 unlabeled images that can be leveraged through semi-supervised learning methods. Alongside, we present an Efficient Data Sampling based self-training framework to improve learning by leveraging unlabeled data. Our experimental results demonstrate that the proposed method significantly improves learning methods in generalizability and reduces the labeling cost for semantic segmentation tasks. Our benchmark will be publicly available to facilitate future research at https://r2s100k.github.io/.
KW - Autonomous driving
KW - Semantic segmentation
KW - Semi-supervised learning
UR - http://www.scopus.com/inward/record.url?scp=85201968717&partnerID=8YFLogxK
U2 - 10.1007/s11263-024-02207-3
DO - 10.1007/s11263-024-02207-3
M3 - Article
AN - SCOPUS:85201968717
SN - 0920-5691
JO - International Journal of Computer Vision
JF - International Journal of Computer Vision
ER -