TY - JOUR
T1 - Active Learning with Noisy Labelers for Improving Classification Accuracy of Connected Vehicles
AU - Abdellatif, Alaa Awad
AU - Chiasserini, Carla Fabiana
AU - Malandrino, Francesco
AU - Mohamed, Amr
AU - Erbad, Aiman
N1 - Publisher Copyright:
© 1967-2012 IEEE.
PY - 2021/4
Y1 - 2021/4
N2 - Machine learning has emerged as a promising paradigm for enabling connected, automated vehicles to autonomously cruise the streets and react to unexpected situations. Reacting to such situations requires accurate classification for uncommon events, which in turn depends on the selection of large, diverse, and high-quality training data. In fact, the data available at a vehicle (e.g., photos of road signs) may be affected by errors or have different levels of resolution and freshness. To tackle this challenge, we propose an active learning framework that, leveraging the information collected through onboard sensors as well as received from other vehicles, effectively deals with scarce and noisy data. Given the information received from neighboring vehicles, our solution: (i) selects which vehicles can reliably generate high-quality training data, and (ii) obtains a reliable subset of data to add to the training set by trading off between two essential features, i.e., quality and diversity. The results, obtained with different real-world datasets, demonstrate that our framework significantly outperforms state-of-the-art solutions, providing high classification accuracy with a limited bandwidth requirement for the data exchange between vehicles.
AB - Machine learning has emerged as a promising paradigm for enabling connected, automated vehicles to autonomously cruise the streets and react to unexpected situations. Reacting to such situations requires accurate classification for uncommon events, which in turn depends on the selection of large, diverse, and high-quality training data. In fact, the data available at a vehicle (e.g., photos of road signs) may be affected by errors or have different levels of resolution and freshness. To tackle this challenge, we propose an active learning framework that, leveraging the information collected through onboard sensors as well as received from other vehicles, effectively deals with scarce and noisy data. Given the information received from neighboring vehicles, our solution: (i) selects which vehicles can reliably generate high-quality training data, and (ii) obtains a reliable subset of data to add to the training set by trading off between two essential features, i.e., quality and diversity. The results, obtained with different real-world datasets, demonstrate that our framework significantly outperforms state-of-the-art solutions, providing high classification accuracy with a limited bandwidth requirement for the data exchange between vehicles.
KW - Connected automated vehicles
KW - data selection
KW - labelers selection
KW - labeling quality
KW - online learning
UR - http://www.scopus.com/inward/record.url?scp=85103037533&partnerID=8YFLogxK
U2 - 10.1109/TVT.2021.3066210
DO - 10.1109/TVT.2021.3066210
M3 - Article
AN - SCOPUS:85103037533
SN - 0018-9545
VL - 70
SP - 3059
EP - 3070
JO - IEEE Transactions on Vehicular Technology
JF - IEEE Transactions on Vehicular Technology
IS - 4
M1 - 9380524
ER -