summaryrefslogtreecommitdiff
path: root/include/rdma/ib_mad.h
diff options
context:
space:
mode:
authorLizhi Hou <lizhi.hou@amd.com>2024-11-18 09:29:41 -0800
committerJeffrey Hugo <quic_jhugo@quicinc.com>2024-11-22 11:44:47 -0700
commit4fd4ca984b833a41f36bf7b2eaa9025377e310d0 (patch)
tree8cc115047f720efcc39aa601a8493179201727b8 /include/rdma/ib_mad.h
parentbed4c73e59e8e32a3dd68a5ea755601ab000bf7b (diff)
accel/amdxdna: Add error handling
When there is a hardware error, the NPU firmware notifies the host through a mailbox message. The message includes details of the error, such as the tile and column indexes where the error occurred. The driver starts a thread to handle the NPU error message. The thread stops the clients which are using the column where error occurred. Then the driver resets that column. Co-developed-by: Min Ma <min.ma@amd.com> Signed-off-by: Min Ma <min.ma@amd.com> Reviewed-by: Jeffrey Hugo <quic_jhugo@quicinc.com> Signed-off-by: Lizhi Hou <lizhi.hou@amd.com> Signed-off-by: Jeffrey Hugo <quic_jhugo@quicinc.com> Link: https://patchwork.freedesktop.org/patch/msgid/20241118172942.2014541-10-lizhi.hou@amd.com
Diffstat (limited to 'include/rdma/ib_mad.h')
0 files changed, 0 insertions, 0 deletions