TY - GEN
T1 - Dynamic Pruning for Distributed Inference via Explainable AI
T2 - IEEE International Conference on Communications (IEEE ICC)
AU - Erbad, Aiman
AU - Mohamed, Amr
AU - Hamdi, Mounir
AU - Guizani, Mohsen
AU - Baccour, Emna
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - The healthcare sector has undergone a significant transformation with the widespread adoption of Deep Neural Networks (DNN). However, due to privacy constraints and stringent latency requirements, online remote inference is not a viable option in healthcare scenarios. Many efforts have been conducted to enable local computation, such as network compression using pruning or DNN distribution among multiple resource-constrained devices. Yet, it is still challenging to conduct distributed inference due to the latency and energy overheads resulting from intermediate shared data. On the other hand, given that realistic healthcare systems use pre-trained models, local pruning and fine-tuning relying only on the scarce and biased data is not possible. Even pre-pruned DNNs are not efficient as they are not customized to the local load of data and the dynamics of devices. The dynamic and online pruning of DNN without fine-tuning is a promising solution; however, it was not considered in the literature as most well-known techniques do not perform well without adjustment. In this paper, driven by the data restrictions in healthcare sector, we propose a novel pruning strategy based on Explainable AI (XAI), with a target to enhance the pruned DNN performance without fine-tuning. Moreover, to maintain the highest possible accuracy, we propose to combine distribution and pruning techniques to perform online distributed inference assisted by dynamic pruning only when needed. Our experiments show the performance of our pruning criterion compared to other reference techniques, in addition to its ability to assist the distribution by reducing the shared data, while keeping high accuracy.
AB - The healthcare sector has undergone a significant transformation with the widespread adoption of Deep Neural Networks (DNN). However, due to privacy constraints and stringent latency requirements, online remote inference is not a viable option in healthcare scenarios. Many efforts have been conducted to enable local computation, such as network compression using pruning or DNN distribution among multiple resource-constrained devices. Yet, it is still challenging to conduct distributed inference due to the latency and energy overheads resulting from intermediate shared data. On the other hand, given that realistic healthcare systems use pre-trained models, local pruning and fine-tuning relying only on the scarce and biased data is not possible. Even pre-pruned DNNs are not efficient as they are not customized to the local load of data and the dynamics of devices. The dynamic and online pruning of DNN without fine-tuning is a promising solution; however, it was not considered in the literature as most well-known techniques do not perform well without adjustment. In this paper, driven by the data restrictions in healthcare sector, we propose a novel pruning strategy based on Explainable AI (XAI), with a target to enhance the pruned DNN performance without fine-tuning. Moreover, to maintain the highest possible accuracy, we propose to combine distribution and pruning techniques to perform online distributed inference assisted by dynamic pruning only when needed. Our experiments show the performance of our pruning criterion compared to other reference techniques, in addition to its ability to assist the distribution by reducing the shared data, while keeping high accuracy.
KW - Healthcare
KW - Xai
KW - Distributed inference
KW - Pruning
KW - Resource constraints
KW - Scarce data
UR - https://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=hbku_researchportal&SrcAuth=WosAPI&KeyUT=WOS:001094862603084&DestLinkType=FullRecord&DestApp=WOS_CPL
U2 - 10.1109/ICC45041.2023.10279608
DO - 10.1109/ICC45041.2023.10279608
M3 - Conference contribution
T3 - Ieee International Conference On Communications
SP - 3394
EP - 3400
BT - Icc 2023-ieee International Conference On Communications
A2 - Zorzi, Michele
A2 - Tao, Meixia
A2 - Saad, Walid
PB - IEEE
Y2 - 28 May 2023 through 1 June 2023
ER -