-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Nvidia 560.28.03-1 throwing kernel stack trace with linux kernels from 6.10.3 up to 6.10.9 or newer #705
Comments
due to this, it seems that on my end x11/xwayland has stopped working. cannot even launch any proton games on xwayland. |
Same here on 6.11 kernel when I try to enter sleep mode on wayland, Ubuntu 24.10 beta:
nvidia-smi:
|
This has been known for months: see #662 No need to create dupes. Though actually it may help NVIDIA prioritize fixing this bug because it's annoying as hell. I got 40KB worth of back traces on every suspend, and now I simply power off the PC entirely, since I got fed up with this. |
Same here. 560.35.03 and earlier. |
Good to know, following #662 :) |
It's important to keep issues distinct and avoid mislabeling them as duplicates without clear evidence. This bug report has totally different stack trace signature than #662 and original report didn't mention any suspend related issues. |
I don't know if the #662 is related but on the closed-source side, this error is already known by Nvidia Nvidia forum. I have the same problem on the Arch Linux and my workaround was to use the Linux-LTS 6.6.52-1-lts temporarily. The problem was posted on the Arch forums since july Arch Forum I hope they fix this soon, because 6.10 is basically incompatible with Nvidia drivers open or not without errors. |
I can confirm this issue is indeed resolved with the API changes in kernel 6.12.
|
NVIDIA Open GPU Kernel Modules Version
560.35.03
Please confirm this issue does not happen with the proprietary driver (of the same version). This issue tracker is only for bugs specific to the open kernel driver.
Operating System and Version
Debian GNU/Linux trixie/sid
Kernel Release
6.10.9
Please confirm you are running a stable release kernel (e.g. not a -rc). We do not accept bug reports for unreleased kernels.
Hardware: GPU
NVIDIA GeForce RTX 4090 Laptop GPU
Describe the bug
I am getting lots of errors and kernel tainted with stack in dmesg
with latest nvidia driver 560.28.03-1 and linux kernel 6.10.3 (for
full log see nvidia-bug-report.log.gz included in this report) on GNU/Linux Debian setup.
Short summary:
follow_pte+0x1de/0x200.
nv_revoke_gpu_mappings+0x67/0xb0 [nvidia]
RmHandleIdleSustained+0x39/0x130 [nvidia]
rm_execute_work_item+0xe0/0x150 [nvidia]
3.The module list shows NVIDIA modules loaded:
nvidia_uvm(OE)
nvidia_drm(OE)
nvidia_modeset(OE)
nvidia(OE)
The (OE) suffix likely indicates these are out-of-tree (externally
built) modules and NVIDIA is the only OE module I have.
is likely an NVIDIA driver thread.
suggests there might be an issue with how the NVIDIA driver is
handling read-write semaphores in the kernel.
To Reproduce
Boot 6.10.9 kernel with latest official nvidia driver and check dmesg logs.
Bug Incidence
Always
nvidia-bug-report.log.gz
nvidia-bug-report.log.gz
Above nvidia-bug-report.log.gz includes this but also pasting here for convinience
More Info
No response
The text was updated successfully, but these errors were encountered: