CPU-GPU Layer-Switched Low Latency CNN Inference

Publication
Proc. of the 25th Euromicro Conference on Digital System Design