Throughput Maximization of Delay-Aware DNN Inference in Edge Computing by Exploring DNN Model Partitioning and Inference Parallelism

Publication
IEEE Transactions on Mobile Computing (TMC) (CCF-A)