Kubism: Disassembling and Reassembling K-Means Clustering for Mobile Heterogeneous Platforms
Kubism disassembles and reassembles the K-means algorithm to better utilize CPU and GPU resources on mobile heterogeneous platforms, achieving up to 2.65× speedup per iteration and an average 1.23× end-to-end improvement on NVIDIA Jetson Orin AGX.