I have sporadic crashes (of the GPU firmware?) on a Lenovo ThinkPad T14s. The kernel itself does not crash but is unusable (and I reboot via a non-graphical console).
uname -r: 7.0.0-rc7-00666-gbc7c0351d6a1
Kernel log (of the crash):
platform 3d6a000.gmu: [drm:a6xx_gmu_resume [msm]] ERROR GMU firmware initialization timed out
platform 3d6a000.gmu: [drm:a6xx_gmu_set_oob [msm]] ERROR Timeout waiting for GMU OOB set GPU_SET: 0x0
platform 3d6a000.gmu: [drm:a6xx_gmu_set_oob [msm]] ERROR Timeout waiting for GMU OOB set GPU_SET: 0x0
platform 3d6a000.gmu: [drm:a6xx_gmu_set_oob [msm]] ERROR Timeout waiting for GMU OOB set GPU_SET: 0x0
platform 3d6a000.gmu: [drm:a6xx_gmu_set_oob [msm]] ERROR Timeout waiting for GMU OOB set GPU_SET: 0x0
msm_dpu ae01000.display-controller: [drm:hangcheck_handler [msm]] ERROR 67.5.12.1: hangcheck detected gpu lockup rb 0!
msm_dpu ae01000.display-controller: [drm:hangcheck_handler [msm]] ERROR 67.5.12.1: completed fence: 14113
msm_dpu ae01000.display-controller: [drm:hangcheck_handler [msm]] ERROR 67.5.12.1: submitted fence: 14117
msm_dpu ae01000.display-controller: [drm:recover_worker [msm]] ERROR 67.5.12.1: hangcheck recover!
msm_dpu ae01000.display-controller: [drm:recover_worker [msm]] ERROR 67.5.12.1: offending task: sway (sway)
adreno 3d00000.gpu: [drm:a6xx_recover [msm]] ERROR cx gdsc didn't collapse
platform 3d6a000.gmu: [drm:a6xx_gmu_set_oob [msm]] ERROR Timeout waiting for GMU OOB set GPU_SET: 0x0
msm_dpu ae01000.display-controller: [drm:hangcheck_handler [msm]] ERROR 67.5.12.1: hangcheck detected gpu lockup rb 0!
msm_dpu ae01000.display-controller: [drm:hangcheck_handler [msm]] ERROR 67.5.12.1: completed fence: 14114
msm_dpu ae01000.display-controller: [drm:hangcheck_handler [msm]] ERROR 67.5.12.1: submitted fence: 14117
msm_dpu ae01000.display-controller: [drm:recover_worker [msm]] ERROR 67.5.12.1: hangcheck recover!
msm_dpu ae01000.display-controller: [drm:recover_worker [msm]] ERROR 67.5.12.1: offending task: sway (sway)
hw recovery is not complete for ctl:2
[drm:dpu_encoder_phys_vid_prepare_for_kickoff:569] [dpu error]enc40 intf5 ctl 2 reset failure: -22
[drm:dpu_encoder_frame_done_timeout:2731] [dpu error]enc40 frame done timeout
[drm:dpu_encoder_phys_vid_wait_for_commit_done:543] [dpu error]vblank timeout: 21300
[drm:dpu_kms_wait_for_commit_done:525] [dpu error]wait for commit done returned -110
adreno 3d00000.gpu: [drm:a6xx_recover [msm]] ERROR cx gdsc didn't collapse
platform 3d6a000.gmu: [drm:a6xx_gmu_set_oob [msm]] ERROR Timeout waiting for GMU OOB set GPU_SET: 0x0
msm_dpu ae01000.display-controller: [drm:hangcheck_handler [msm]] ERROR 67.5.12.1: hangcheck detected gpu lockup rb 0!
msm_dpu ae01000.display-controller: [drm:hangcheck_handler [msm]] ERROR 67.5.12.1: completed fence: 14115
msm_dpu ae01000.display-controller: [drm:hangcheck_handler [msm]] ERROR 67.5.12.1: submitted fence: 14117
msm_dpu ae01000.display-controller: [drm:recover_worker [msm]] ERROR 67.5.12.1: hangcheck recover!
msm_dpu ae01000.display-controller: [drm:recover_worker [msm]] ERROR 67.5.12.1: offending task: sway (sway)
adreno 3d00000.gpu: [drm:a6xx_recover [msm]] ERROR cx gdsc didn't collapse
platform 3d6a000.gmu: [drm:a6xx_gmu_set_oob [msm]] ERROR Timeout waiting for GMU OOB set GPU_SET: 0x0
msm_dpu ae01000.display-controller: [drm:hangcheck_handler [msm]] ERROR 67.5.12.1: hangcheck detected gpu lockup rb 0!
msm_dpu ae01000.display-controller: [drm:hangcheck_handler [msm]] ERROR 67.5.12.1: completed fence: 14116
msm_dpu ae01000.display-controller: [drm:hangcheck_handler [msm]] ERROR 67.5.12.1: submitted fence: 14117
msm_dpu ae01000.display-controller: [drm:recover_worker [msm]] ERROR 67.5.12.1: hangcheck recover!
msm_dpu ae01000.display-controller: [drm:recover_worker [msm]] ERROR 67.5.12.1: offending task: sway (sway)
[drm:dpu_encoder_phys_vid_wait_for_commit_done:543] [dpu error]vblank timeout: 21300
[drm:dpu_kms_wait_for_commit_done:525] [dpu error]wait for commit done returned -110
hw recovery is not complete for ctl:2
[drm:dpu_encoder_phys_vid_prepare_for_kickoff:569] [dpu error]enc40 intf5 ctl 2 reset failure: -22
[drm:dpu_encoder_frame_done_timeout:2731] [dpu error]enc40 frame done timeout
[drm:dpu_encoder_phys_vid_wait_for_commit_done:543] [dpu error]vblank timeout: 21300
[drm:dpu_kms_wait_for_commit_done:525] [dpu error]wait for commit done returned -110
platform 3d6a000.gmu: [drm:a6xx_gmu_set_oob [msm]] ERROR Timeout waiting for GMU OOB set GPU_SET: 0x0
platform 3d6a000.gmu: [drm:a6xx_gmu_set_oob [msm]] ERROR Timeout waiting for GMU OOB set GPU_SET: 0x0
msm_dpu ae01000.display-controller: [drm:hangcheck_handler [msm]] ERROR 67.5.12.1: hangcheck detected gpu lockup rb 0!
msm_dpu ae01000.display-controller: [drm:hangcheck_handler [msm]] ERROR 67.5.12.1: completed fence: 14117
msm_dpu ae01000.display-controller: [drm:hangcheck_handler [msm]] ERROR 67.5.12.1: submitted fence: 14119
msm_dpu ae01000.display-controller: [drm:recover_worker [msm]] ERROR 67.5.12.1: hangcheck recover!
msm_dpu ae01000.display-controller: [drm:recover_worker [msm]] ERROR 67.5.12.1: offending task: sway ()
adreno 3d00000.gpu: [drm:a6xx_recover [msm]] ERROR cx gdsc didn't collapse
platform 3d6a000.gmu: [drm:a6xx_gmu_set_oob [msm]] ERROR Timeout waiting for GMU OOB set GPU_SET: 0x0
msm_dpu ae01000.display-controller: [drm:hangcheck_handler [msm]] ERROR 67.5.12.1: hangcheck detected gpu lockup rb 0!
msm_dpu ae01000.display-controller: [drm:hangcheck_handler [msm]] ERROR 67.5.12.1: completed fence: 14118
msm_dpu ae01000.display-controller: [drm:hangcheck_handler [msm]] ERROR 67.5.12.1: submitted fence: 14119
msm_dpu ae01000.display-controller: [drm:recover_worker [msm]] ERROR 67.5.12.1: hangcheck recover!
msm_dpu ae01000.display-controller: [drm:recover_worker [msm]] ERROR 67.5.12.1: offending task: sway ()
[drm:dpu_encoder_phys_vid_wait_for_commit_done:543] [dpu error]vblank timeout: 21300
[drm:dpu_kms_wait_for_commit_done:525] [dpu error]wait for commit done returned -110
hw recovery is not complete for ctl:2
[drm:dpu_encoder_phys_vid_prepare_for_kickoff:569] [dpu error]enc40 intf5 ctl 2 reset failure: -22
[drm:dpu_encoder_frame_done_timeout:2731] [dpu error]enc40 frame done timeout
[drm:dpu_encoder_phys_vid_wait_for_commit_done:543] [dpu error]vblank timeout: 21300
[drm:dpu_kms_wait_for_commit_done:525] [dpu error]wait for commit done returned -110
hw recovery is not complete for ctl:2
[drm:dpu_encoder_phys_vid_prepare_for_kickoff:569] [dpu error]enc36 intf4 ctl 2 reset failure: -22
[drm:dpu_encoder_phys_vid_wait_for_commit_done:543] [dpu error]vblank timeout: 80821300
[drm:dpu_kms_wait_for_commit_done:525] [dpu error]wait for commit done returned -110
[drm:dpu_encoder_frame_done_timeout:2731] [dpu error]enc36 frame done timeout
[drm:dpu_encoder_phys_vid_wait_for_commit_done:543] [dpu error]vblank timeout: 80821300
[drm:dpu_kms_wait_for_commit_done:525] [dpu error]wait for commit done returned -110
hw recovery is not complete for ctl:2
[drm:dpu_encoder_phys_vid_prepare_for_kickoff:569] [dpu error]enc36 intf4 ctl 2 reset failure: -22
[drm:dpu_encoder_phys_vid_wait_for_commit_done:543] [dpu error]vblank timeout: 80821300
[drm:dpu_kms_wait_for_commit_done:525] [dpu error]wait for commit done returned -110
[drm:dpu_encoder_frame_done_timeout:2731] [dpu error]enc36 frame done timeout
[drm:dpu_encoder_phys_vid_wait_for_commit_done:543] [dpu error]vblank timeout: 80821300
[drm:dpu_kms_wait_for_commit_done:525] [dpu error]wait for commit done returned -110
hw recovery is not complete for ctl:2
[drm:dpu_encoder_phys_vid_prepare_for_kickoff:569] [dpu error]enc36 intf4 ctl 2 reset failure: -22
[drm:dpu_encoder_phys_vid_wait_for_commit_done:543] [dpu error]vblank timeout: 80821300
[drm:dpu_kms_wait_for_commit_done:525] [dpu error]wait for commit done returned -110
[drm:dpu_encoder_frame_done_timeout:2731] [dpu error]enc36 frame done timeout
[drm:dpu_encoder_phys_vid_wait_for_commit_done:543] [dpu error]vblank timeout: 80821300
[drm:dpu_kms_wait_for_commit_done:525] [dpu error]wait for commit done returned -110
hw recovery is not complete for ctl:2
[drm:dpu_encoder_phys_vid_prepare_for_kickoff:569] [dpu error]enc36 intf4 ctl 2 reset failure: -22
[drm:dpu_encoder_phys_vid_wait_for_commit_done:543] [dpu error]vblank timeout: 80821300
[drm:dpu_kms_wait_for_commit_done:525] [dpu error]wait for commit done returned -110
[drm:dpu_encoder_frame_done_timeout:2731] [dpu error]enc36 frame done timeout"
I have sporadic crashes (of the GPU firmware?) on a Lenovo ThinkPad T14s. The kernel itself does not crash but is unusable (and I reboot via a non-graphical console).
uname -r: 7.0.0-rc7-00666-gbc7c0351d6a1Kernel log (of the crash):