diff options
author | Lizhi Hou <lizhi.hou@amd.com> | 2024-11-18 09:29:41 -0800 |
---|---|---|
committer | Jeffrey Hugo <quic_jhugo@quicinc.com> | 2024-11-22 11:44:47 -0700 |
commit | 4fd4ca984b833a41f36bf7b2eaa9025377e310d0 (patch) | |
tree | 8cc115047f720efcc39aa601a8493179201727b8 /drivers/infiniband/ulp | |
parent | bed4c73e59e8e32a3dd68a5ea755601ab000bf7b (diff) |
accel/amdxdna: Add error handling
When there is a hardware error, the NPU firmware notifies the host through
a mailbox message. The message includes details of the error, such as the
tile and column indexes where the error occurred.
The driver starts a thread to handle the NPU error message. The thread
stops the clients which are using the column where error occurred. Then
the driver resets that column.
Co-developed-by: Min Ma <min.ma@amd.com>
Signed-off-by: Min Ma <min.ma@amd.com>
Reviewed-by: Jeffrey Hugo <quic_jhugo@quicinc.com>
Signed-off-by: Lizhi Hou <lizhi.hou@amd.com>
Signed-off-by: Jeffrey Hugo <quic_jhugo@quicinc.com>
Link: https://patchwork.freedesktop.org/patch/msgid/20241118172942.2014541-10-lizhi.hou@amd.com
Diffstat (limited to 'drivers/infiniband/ulp')
0 files changed, 0 insertions, 0 deletions