Modern accelerators use hierarchical parallel programming models that enable massive multithreading within a processing element (PE), …