Skip to content

hamoa: sporadic GPU crashes on 7.0.0.-rc7 #503

@derSteFfi

Description

@derSteFfi

I have sporadic crashes (of the GPU firmware?) on a Lenovo ThinkPad T14s. The kernel itself does not crash but is unusable (and I reboot via a non-graphical console).

uname -r: 7.0.0-rc7-00666-gbc7c0351d6a1

Kernel log (of the crash):

platform 3d6a000.gmu: [drm:a6xx_gmu_resume [msm]] ERROR GMU firmware initialization timed out
platform 3d6a000.gmu: [drm:a6xx_gmu_set_oob [msm]] ERROR Timeout waiting for GMU OOB set GPU_SET: 0x0
platform 3d6a000.gmu: [drm:a6xx_gmu_set_oob [msm]] ERROR Timeout waiting for GMU OOB set GPU_SET: 0x0
platform 3d6a000.gmu: [drm:a6xx_gmu_set_oob [msm]] ERROR Timeout waiting for GMU OOB set GPU_SET: 0x0
platform 3d6a000.gmu: [drm:a6xx_gmu_set_oob [msm]] ERROR Timeout waiting for GMU OOB set GPU_SET: 0x0
msm_dpu ae01000.display-controller: [drm:hangcheck_handler [msm]] ERROR 67.5.12.1: hangcheck detected gpu lockup rb 0!
msm_dpu ae01000.display-controller: [drm:hangcheck_handler [msm]] ERROR 67.5.12.1: completed fence: 14113
msm_dpu ae01000.display-controller: [drm:hangcheck_handler [msm]] ERROR 67.5.12.1: submitted fence: 14117
msm_dpu ae01000.display-controller: [drm:recover_worker [msm]] ERROR 67.5.12.1: hangcheck recover!
msm_dpu ae01000.display-controller: [drm:recover_worker [msm]] ERROR 67.5.12.1: offending task: sway (sway)
adreno 3d00000.gpu: [drm:a6xx_recover [msm]] ERROR cx gdsc didn't collapse
platform 3d6a000.gmu: [drm:a6xx_gmu_set_oob [msm]] ERROR Timeout waiting for GMU OOB set GPU_SET: 0x0
msm_dpu ae01000.display-controller: [drm:hangcheck_handler [msm]] ERROR 67.5.12.1: hangcheck detected gpu lockup rb 0!
msm_dpu ae01000.display-controller: [drm:hangcheck_handler [msm]] ERROR 67.5.12.1: completed fence: 14114
msm_dpu ae01000.display-controller: [drm:hangcheck_handler [msm]] ERROR 67.5.12.1: submitted fence: 14117
msm_dpu ae01000.display-controller: [drm:recover_worker [msm]] ERROR 67.5.12.1: hangcheck recover!
msm_dpu ae01000.display-controller: [drm:recover_worker [msm]] ERROR 67.5.12.1: offending task: sway (sway)
hw recovery is not complete for ctl:2
[drm:dpu_encoder_phys_vid_prepare_for_kickoff:569] [dpu error]enc40 intf5 ctl 2 reset failure: -22
[drm:dpu_encoder_frame_done_timeout:2731] [dpu error]enc40 frame done timeout
[drm:dpu_encoder_phys_vid_wait_for_commit_done:543] [dpu error]vblank timeout: 21300
[drm:dpu_kms_wait_for_commit_done:525] [dpu error]wait for commit done returned -110
adreno 3d00000.gpu: [drm:a6xx_recover [msm]] ERROR cx gdsc didn't collapse
platform 3d6a000.gmu: [drm:a6xx_gmu_set_oob [msm]] ERROR Timeout waiting for GMU OOB set GPU_SET: 0x0
msm_dpu ae01000.display-controller: [drm:hangcheck_handler [msm]] ERROR 67.5.12.1: hangcheck detected gpu lockup rb 0!
msm_dpu ae01000.display-controller: [drm:hangcheck_handler [msm]] ERROR 67.5.12.1: completed fence: 14115
msm_dpu ae01000.display-controller: [drm:hangcheck_handler [msm]] ERROR 67.5.12.1: submitted fence: 14117
msm_dpu ae01000.display-controller: [drm:recover_worker [msm]] ERROR 67.5.12.1: hangcheck recover!
msm_dpu ae01000.display-controller: [drm:recover_worker [msm]] ERROR 67.5.12.1: offending task: sway (sway)
adreno 3d00000.gpu: [drm:a6xx_recover [msm]] ERROR cx gdsc didn't collapse
platform 3d6a000.gmu: [drm:a6xx_gmu_set_oob [msm]] ERROR Timeout waiting for GMU OOB set GPU_SET: 0x0
msm_dpu ae01000.display-controller: [drm:hangcheck_handler [msm]] ERROR 67.5.12.1: hangcheck detected gpu lockup rb 0!
msm_dpu ae01000.display-controller: [drm:hangcheck_handler [msm]] ERROR 67.5.12.1: completed fence: 14116
msm_dpu ae01000.display-controller: [drm:hangcheck_handler [msm]] ERROR 67.5.12.1: submitted fence: 14117
msm_dpu ae01000.display-controller: [drm:recover_worker [msm]] ERROR 67.5.12.1: hangcheck recover!
msm_dpu ae01000.display-controller: [drm:recover_worker [msm]] ERROR 67.5.12.1: offending task: sway (sway)
[drm:dpu_encoder_phys_vid_wait_for_commit_done:543] [dpu error]vblank timeout: 21300
[drm:dpu_kms_wait_for_commit_done:525] [dpu error]wait for commit done returned -110
hw recovery is not complete for ctl:2
[drm:dpu_encoder_phys_vid_prepare_for_kickoff:569] [dpu error]enc40 intf5 ctl 2 reset failure: -22
[drm:dpu_encoder_frame_done_timeout:2731] [dpu error]enc40 frame done timeout
[drm:dpu_encoder_phys_vid_wait_for_commit_done:543] [dpu error]vblank timeout: 21300
[drm:dpu_kms_wait_for_commit_done:525] [dpu error]wait for commit done returned -110
platform 3d6a000.gmu: [drm:a6xx_gmu_set_oob [msm]] ERROR Timeout waiting for GMU OOB set GPU_SET: 0x0
platform 3d6a000.gmu: [drm:a6xx_gmu_set_oob [msm]] ERROR Timeout waiting for GMU OOB set GPU_SET: 0x0
msm_dpu ae01000.display-controller: [drm:hangcheck_handler [msm]] ERROR 67.5.12.1: hangcheck detected gpu lockup rb 0!
msm_dpu ae01000.display-controller: [drm:hangcheck_handler [msm]] ERROR 67.5.12.1: completed fence: 14117
msm_dpu ae01000.display-controller: [drm:hangcheck_handler [msm]] ERROR 67.5.12.1: submitted fence: 14119
msm_dpu ae01000.display-controller: [drm:recover_worker [msm]] ERROR 67.5.12.1: hangcheck recover!
msm_dpu ae01000.display-controller: [drm:recover_worker [msm]] ERROR 67.5.12.1: offending task: sway ()
adreno 3d00000.gpu: [drm:a6xx_recover [msm]] ERROR cx gdsc didn't collapse
platform 3d6a000.gmu: [drm:a6xx_gmu_set_oob [msm]] ERROR Timeout waiting for GMU OOB set GPU_SET: 0x0
msm_dpu ae01000.display-controller: [drm:hangcheck_handler [msm]] ERROR 67.5.12.1: hangcheck detected gpu lockup rb 0!
msm_dpu ae01000.display-controller: [drm:hangcheck_handler [msm]] ERROR 67.5.12.1: completed fence: 14118
msm_dpu ae01000.display-controller: [drm:hangcheck_handler [msm]] ERROR 67.5.12.1: submitted fence: 14119
msm_dpu ae01000.display-controller: [drm:recover_worker [msm]] ERROR 67.5.12.1: hangcheck recover!
msm_dpu ae01000.display-controller: [drm:recover_worker [msm]] ERROR 67.5.12.1: offending task: sway ()
[drm:dpu_encoder_phys_vid_wait_for_commit_done:543] [dpu error]vblank timeout: 21300
[drm:dpu_kms_wait_for_commit_done:525] [dpu error]wait for commit done returned -110
hw recovery is not complete for ctl:2
[drm:dpu_encoder_phys_vid_prepare_for_kickoff:569] [dpu error]enc40 intf5 ctl 2 reset failure: -22
[drm:dpu_encoder_frame_done_timeout:2731] [dpu error]enc40 frame done timeout
[drm:dpu_encoder_phys_vid_wait_for_commit_done:543] [dpu error]vblank timeout: 21300
[drm:dpu_kms_wait_for_commit_done:525] [dpu error]wait for commit done returned -110
hw recovery is not complete for ctl:2
[drm:dpu_encoder_phys_vid_prepare_for_kickoff:569] [dpu error]enc36 intf4 ctl 2 reset failure: -22
[drm:dpu_encoder_phys_vid_wait_for_commit_done:543] [dpu error]vblank timeout: 80821300
[drm:dpu_kms_wait_for_commit_done:525] [dpu error]wait for commit done returned -110
[drm:dpu_encoder_frame_done_timeout:2731] [dpu error]enc36 frame done timeout
[drm:dpu_encoder_phys_vid_wait_for_commit_done:543] [dpu error]vblank timeout: 80821300
[drm:dpu_kms_wait_for_commit_done:525] [dpu error]wait for commit done returned -110
hw recovery is not complete for ctl:2
[drm:dpu_encoder_phys_vid_prepare_for_kickoff:569] [dpu error]enc36 intf4 ctl 2 reset failure: -22
[drm:dpu_encoder_phys_vid_wait_for_commit_done:543] [dpu error]vblank timeout: 80821300
[drm:dpu_kms_wait_for_commit_done:525] [dpu error]wait for commit done returned -110
[drm:dpu_encoder_frame_done_timeout:2731] [dpu error]enc36 frame done timeout
[drm:dpu_encoder_phys_vid_wait_for_commit_done:543] [dpu error]vblank timeout: 80821300
[drm:dpu_kms_wait_for_commit_done:525] [dpu error]wait for commit done returned -110
hw recovery is not complete for ctl:2
[drm:dpu_encoder_phys_vid_prepare_for_kickoff:569] [dpu error]enc36 intf4 ctl 2 reset failure: -22
[drm:dpu_encoder_phys_vid_wait_for_commit_done:543] [dpu error]vblank timeout: 80821300
[drm:dpu_kms_wait_for_commit_done:525] [dpu error]wait for commit done returned -110
[drm:dpu_encoder_frame_done_timeout:2731] [dpu error]enc36 frame done timeout
[drm:dpu_encoder_phys_vid_wait_for_commit_done:543] [dpu error]vblank timeout: 80821300
[drm:dpu_kms_wait_for_commit_done:525] [dpu error]wait for commit done returned -110
hw recovery is not complete for ctl:2
[drm:dpu_encoder_phys_vid_prepare_for_kickoff:569] [dpu error]enc36 intf4 ctl 2 reset failure: -22
[drm:dpu_encoder_phys_vid_wait_for_commit_done:543] [dpu error]vblank timeout: 80821300
[drm:dpu_kms_wait_for_commit_done:525] [dpu error]wait for commit done returned -110
[drm:dpu_encoder_frame_done_timeout:2731] [dpu error]enc36 frame done timeout"

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions