linux.git - Linux kernel source tree

diff options

author	Mikko Perttunen <mperttunen@nvidia.com>	2025-07-08 20:25:08 +0900
committer	Thierry Reding <treding@nvidia.com>	2025-09-11 18:56:35 +0200
commit	63d47cc6eeb27fa0f5b2d9e2e9b950d728b6ca24 (patch)
tree	c6280de4608404ae80c416920a35bb68f4213ac5 /tools/perf/scripts/python/stackcollapse.py
parent	c7d393267c497502fa737607f435f05dfe6e3d9b (diff)

gpu: host1x: Wait prefences outside MLOCK

The current submission opcode sequence first takes the engine MLOCK, and then switches to HOST1X class to wait prefences. This is fine while we only use a single channel per engine and there is no virtualization, since jobs are serialized on that one channel anyway. However, when that assumption doesn't hold, we are keeping the engine locked while not running anything on it while waiting for prefences to complete. To resolve this, execute wait commands in the beginning of the job outside the engine MLOCK. We still take the HOST1X MLOCK because recent hardware requires register opcodes to be executed within some MLOCK, but the hardware also allows unlimited channels to take the HOST1X MLOCK at the same time. Signed-off-by: Mikko Perttunen <mperttunen@nvidia.com> Signed-off-by: Thierry Reding <treding@nvidia.com> Link: https://lore.kernel.org/r/20250708-host1x-wait-prefences-outside-mlock-v1-1-13e98044e35a@nvidia.com

Diffstat (limited to 'tools/perf/scripts/python/stackcollapse.py')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: