RTSS'17 | GPU Scheduling on the NVIDIA TX2: Hidden Details Revealed
This paper configured multiple experiments to explore the rules of the GPU kernel-level scheduling.
This paper won the Best Paper Award at MobiSys 2021. The paper proposes nn-Meter, a model inference time prediction system that can efficiently and accurately predict the inference latency of DNN models on different edge devices. The key idea is to divide the entire model into kernels (execution units on the device), and then perform kernel-level prediction.
As more and more deep neural networks emerge, systems for predicting network inference performance need to be generalizable to adapt.
Habitat is a Python library that can predict performance on GPUs with the help of a GPU that the user already has.
The approaches for the DL performance analysis today include
Hence, habitat based on the observations: