367/370.xx + 980m w/4k screen = lock up at boot (Ubuntu 16.10)

UPDATED: Attached nvidia-bug-report to this topic.

Here’s the scenario:

Starting my laptop (Sager NP8658-S / CLEVO 650RG) with the 367.18 drivers results in a crash that can only be exited via REISUB. At startup I see it try to switch modes, but it ends up hanging with a cursor in the top left. I’ve tested this with the 4.4 and 4.6 (mainline) kernels, returning the same results. Rolling back to the 364.19 drivers restores the system to normal operation.

Os: Ubuntu 16.10 pre-alpha
Displays: Built-in = 4k 3840x2160, External HDMI = 2560x1080

May 21 21:20:53 sager kernel: [   77.238821] nvidia-modeset: ERROR: GPU:0: Idling display engine timed out: 0x0000957d:0:0:0x00000040

May 21 21:21:21 sager kernel: [  104.529532] NMI watchdog: BUG: soft lockup - CPU#7 stuck for 22s! [Xorg:2319]
May 21 21:21:21 sager kernel: [  104.529535] Modules linked in: drbg ansi_cprng ctr ccm pci_stub vboxpci(OE) vboxnetadp(OE) vboxnetflt(OE) rfcomm bnep vboxdrv(OE) binfmt_misc nls_iso8859_1 snd_usb_audio snd_usbmidi_lib uvcvideo videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 videobuf2_core videodev media btusb btrtl mxm_wmi snd_hda_codec_hdmi arc4 snd_hda_codec_realtek snd_hda_codec_generic intel_rapl x86_pkg_temp_thermal intel_powerclamp iwlmvm kvm_intel snd_hda_intel snd_hda_codec mac80211 kvm snd_hda_core snd_hwdep irqbypass snd_pcm input_leds snd_seq_midi joydev snd_seq_midi_event snd_rawmidi iwlwifi serio_raw snd_seq snd_seq_device cfg80211 snd_timer nvidia_uvm(POE) snd rtsx_pci_ms memstick soundcore mei_me mei shpchp hci_uart btbcm btqca btintel bluetooth wmi intel_lpss_acpi intel_lpss mac_hid acpi_pad tpm_crb coretemp parport_pc ppdev lp parport autofs4 btrfs xor raid6_pq algif_skcipher af_alg dm_crypt hid_generic hid_plantronics usbhid uas usb_storage mmc_block rtsx_pci_sdmmc nvidia_drm(POE) nvidia_modeset(POE) crct10dif_pclmul drm_kms_helper crc32_pclmul ghash_clmulni_intel syscopyarea sysfillrect sysimgblt fb_sys_fops aesni_intel drm aes_x86_64 lrw gf128mul glue_helper ablk_helper cryptd psmouse nvidia(POE) ahci r8169 rtsx_pci libahci mii pinctrl_sunrisepoint i2c_hid video pinctrl_intel hid fjes
May 21 21:21:21 sager kernel: [  104.529617] CPU: 7 PID: 2319 Comm: Xorg Tainted: P           OE   4.6.0-040600-generic #201605151930
May 21 21:21:21 sager kernel: [  104.529618] Hardware name: Notebook                         P65_P67RGRERA/P65_P67RGRERA, BIOS 1.05.13RLS1 02/02/2016
May 21 21:21:21 sager kernel: [  104.529619] task: ffff88007ee95e00 ti: ffff88084fe10000 task.ti: ffff88084fe10000
May 21 21:21:21 sager kernel: [  104.529620] RIP: 0010:[<ffffffffc19f18b5>]  [<ffffffffc19f18b5>] _nv001865kms+0xa5/0x120 [nvidia_modeset]
May 21 21:21:21 sager kernel: [  104.529630] RSP: 0018:ffff88084fe13718  EFLAGS: 00000202
May 21 21:21:21 sager kernel: [  104.529631] RAX: 0000000000000000 RBX: ffff88084ba4b808 RCX: 0000000000000000
May 21 21:21:21 sager kernel: [  104.529632] RDX: 0000000000000000 RSI: 0000000000635619 RDI: ffffffff81e248c0
May 21 21:21:21 sager kernel: [  104.529632] RBP: ffff88007ed45000 R08: 0000000000000001 R09: 0000000000000004
May 21 21:21:21 sager kernel: [  104.529633] R10: 0000000000000020 R11: 0000000000000000 R12: 00053364f958d01f
May 21 21:21:21 sager kernel: [  104.529634] R13: 0000000000000001 R14: ffff88084f70fc08 R15: 0000000000000001
May 21 21:21:21 sager kernel: [  104.529635] FS:  00007f5c58ad3a00(0000) GS:ffff8808765c0000(0000) knlGS:0000000000000000
May 21 21:21:21 sager kernel: [  104.529636] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
May 21 21:21:21 sager kernel: [  104.529637] CR2: 000056289b0cc3b0 CR3: 000000084c057000 CR4: 00000000003406e0
May 21 21:21:21 sager kernel: [  104.529638] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
May 21 21:21:21 sager kernel: [  104.529638] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
May 21 21:21:21 sager kernel: [  104.529639] Stack:
May 21 21:21:21 sager kernel: [  104.529640]  ffff88084ba4b808 0000000000000000 ffff88084ba4c008 00000000c19e4d25
May 21 21:21:21 sager kernel: [  104.529654]  0000000000000000 0000000000000001 ffff88084ba4c008 ffff88084ba4b808
May 21 21:21:21 sager kernel: [  104.529655]  ffff88084ba4c008 0000000000000001 00000000ffffffff ffffffffc19e8d83
May 21 21:21:21 sager kernel: [  104.529656] Call Trace:
May 21 21:21:21 sager kernel: [  104.529663]  [<ffffffffc19e8d83>] ? _nv001762kms+0xa3/0x110 [nvidia_modeset]
May 21 21:21:21 sager kernel: [  104.529671]  [<ffffffffc1a03d08>] ? _nv001994kms+0x1ca8/0x2300 [nvidia_modeset]
May 21 21:21:21 sager kernel: [  104.529676]  [<ffffffffc19d03e0>] ? nvkms_alloc+0x50/0x60 [nvidia_modeset]
May 21 21:21:21 sager kernel: [  104.529680]  [<ffffffffc19d1520>] ? _nv000325kms+0x30/0x30 [nvidia_modeset]
May 21 21:21:21 sager kernel: [  104.529684]  [<ffffffffc19d154e>] ? _nv000334kms+0x2e/0x40 [nvidia_modeset]
May 21 21:21:21 sager kernel: [  104.529689]  [<ffffffffc19d20e1>] ? nvKmsIoctl+0x161/0x1e0 [nvidia_modeset]
May 21 21:21:21 sager kernel: [  104.529693]  [<ffffffffc19d0d85>] ? nvkms_ioctl_common+0x45/0x80 [nvidia_modeset]
May 21 21:21:21 sager kernel: [  104.529698]  [<ffffffffc19d0e31>] ? nvkms_ioctl+0x71/0xa0 <a target='_blank' rel='noopener noreferrer' href=''></a>[nvidia_modeset]
May 21 21:21:21 sager kernel: [  104.529746]  [<ffffffffc008b080>] ? nvidia_frontend_unlocked_ioctl+0x40/0x50 [nvidia]
May 21 21:21:21 sager kernel: [  104.529748]  [<ffffffff812372c1>] ? do_vfs_ioctl+0xa1/0x5b0
May 21 21:21:21 sager kernel: [  104.529750]  [<ffffffff81225ba1>] ? __sb_end_write+0x21/0x30
May 21 21:21:21 sager kernel: [  104.529751]  [<ffffffff812231cd>] ? vfs_write+0x15d/0x1a0
May 21 21:21:21 sager kernel: [  104.529752]  [<ffffffff81237849>] ? SyS_ioctl+0x79/0x90
May 21 21:21:21 sager kernel: [  104.529755]  [<ffffffff818421b6>] ? entry_SYSCALL_64_fastpath+0x1e/0xa8
May 21 21:21:21 sager kernel: [  104.529755] Code: d5 48 8b 6c 24 20 48 03 93 90 02 00 00 48 03 6c c2 20 e8 df ec fd ff 48 63 54 24 1c 4c 8d a0 c0 c6 2d 00 48 89 54 24 08 8b 45 00 <44> 21 f8 41 39 c5 74 45 e8 be ec fd ff 44 8b 9b 88 04 00 00 45

nvidia-bug-report.log.gz (213 KB)

3vi1, Is there any reproduction steps to hit this issue? What desktop env you are running KDE, GNOME or else? Did you observed this issue on any other system?

Is there any reproduction steps to hit this issue?

This happens consistently at boot with the 367.18 drivers installed; there are no steps needed to trigger it other than having the 367.18 drivers installed and rebooting. I’ve gone back and forth to the 364 drivers (which always work fine) 4 times, and with 367.18 the issue always occurs.

What desktop env you are running

I have KDE, GNOME, and Unity installed, but the hard crash happens right before LightDM should comes up - so I never get to the point of starting a desktop env when this issue occurs.

Did you observed this issue on any other system?

I don’t have any other systems with the same CPU/GPU, so I don’t have a good platform for comparison. I installed using the 367.18 package from the official Ubuntu graphics-drivers PPA (the same place I get the 364 drivers), which obviously must work for some people. There were no visible installation errors when installing that package via a console.

I retested this a few moments ago without the external monitor attached to verify that it has the same problem. It still fails in the same manner:

May 23 14:12:10 sager nvidia-persistenced: Started (557)
May 23 14:12:10 sager nvidia-persistenced: Failed to query NVIDIA devices. Please ensure that the NVIDIA device files (/dev/nvidia*) exist, and that user 123 has read and write permissions for those files.
May 23 14:12:10 sager nvidia-persistenced: The daemon no longer has permission to remove its runtime data directory /var/run/nvidia-persistenced
May 23 14:12:10 sager nvidia-persistenced: Shutdown (557)
May 23 14:12:10 sager nvidia-persistenced: Started (761)
May 23 14:12:10 sager nvidia-persistenced: Shutdown (761)
May 23 14:12:10 sager kernel: [    1.418089] nvidia: module license 'NVIDIA' taints kernel.
May 23 14:12:10 sager kernel: [    1.421691] nvidia: module verification failed: signature and/or required key missing - tainting kernel
May 23 14:12:10 sager kernel: [    1.432635] nvidia-nvlink: Nvlink Core is being initialized, major device number 247
May 23 14:12:10 sager kernel: [    1.482775] nvidia-modeset: Loading NVIDIA Kernel Mode Setting Driver for UNIX platforms  367.18  Mon May 16 17:36:40 PDT 2016
May 23 14:12:10 sager kernel: [    1.486525] [drm] [nvidia-drm] [GPU ID 0x00000100] Loading driver
May 23 14:12:10 sager kernel: [   22.341147] nvidia-uvm: Loaded the UVM driver in 8 mode, major device number 245
May 23 14:12:10 sager nvidia-persistenced: Started (2376)
May 23 14:12:11 sager kernel: [   37.101342] nvidia-modeset: Allocated GPU:0 (GPU-e2f980da-ea7e-4335-6ae3-41ae731aed6d) @ PCI:0000:01:00.0
May 23 14:12:18 sager kernel: [   44.187395] nvidia-modeset: WARNING: GPU:0: Lost display notification; continuing.
May 23 14:12:21 sager kernel: [   46.939167] nvidia-modeset: ERROR: GPU:0: Idling display engine timed out: 0x0000957d:0:0:0x00000040

I have the same issue with Ubuntu 16.04 (kernels 4.4.0-23 to 4.6) (nvidia 361, 364 and 367.18)

I have GTX 980 and I7-5820k. I do not have the issue with driver 355.11. But then this driver does not have modesetting.

I am looking forward for any workaround that can found.

In my case it is triggered when A monitor wakes up. In particular if the monitor (not system) was suspended. I have 3 4k monitor connected to the display port (It takes longer than average to wake up these monitors) . My temporary solution is to disable screensaver and force dpms off.

I’m hitting the same issue. I have a Macbook Pro (2012 edition), and am running Ubuntu 16.04. I tried the 361, 364, and 367 drivers and all have the same issue. About 50% of the time when it boots up the screen is blank, and it seems to be correlated with this message showing up in the syslog:
“[ 15.126301] nvidia-modeset: WARNING: GPU:0: Lost display notification; continuing.”

What does that message mean, and is there a workaround? When I was running 14.04 everything worked fine, although that was with an older driver.

I just tested with the 367.27 drivers (from the Ubuntu graphics-drivers ppa), and am still seeing the same problem.

Backleveling to 364.19, as usual, gets things working again.

Something I noticed today while retesting with the latest 4.4.0-25 Ubuntu yakkety kernel. The system still hangs in the same manner, this is just an observation of a difference:

Xorg-0.log - 364.19 [Working driver]

[    56.177] (--) NVIDIA(0): DPI set to (286, 288); computed from "UseEdidDpi" X config
[    56.177] (--) NVIDIA(0):     option
[    56.177] (II) UnloadModule: "nouveau"
[    56.177] (II) Unloading nouveau
[    56.177] (II) UnloadModule: "modesetting"
[    56.177] (II) Unloading modesetting
[    56.177] (II) UnloadModule: "fbdev"
[    56.177] (II) Unloading fbdev
[    56.177] (II) UnloadSubModule: "fbdevhw"
[    56.177] (II) Unloading fbdevhw
[    56.177] (II) UnloadModule: "vesa"
[    56.177] (II) Unloading vesa
[    56.177] (--) Depth 24 pixmap format is 32 bpp
[    56.177] (II) NVIDIA: Using 3072.00 MB of virtual memory for indirect memory
[    56.177] (II) NVIDIA:     access.
[    56.212] (II) NVIDIA(0): Setting mode "DFP-1:nvidia-auto-select"
[    57.251] (==) NVIDIA(0): Disabling shared memory pixmaps
[    57.251] (==) NVIDIA(0): Backing store enabled
[    57.251] (==) NVIDIA(0): Silken mouse enabled
[    57.252] (==) NVIDIA(0): DPMS enabled
...

Xorg-0.log - 367.27 [Broken]

[    33.177] (--) NVIDIA(0): DPI set to (286, 288); computed from "UseEdidDpi" X config
[    33.177] (--) NVIDIA(0):     option
[    33.177] (II) UnloadModule: "nouveau"
[    33.177] (II) Unloading nouveau
[    33.177] (II) UnloadModule: "modesetting"
[    33.177] (II) Unloading modesetting
[    33.177] (II) UnloadModule: "fbdev"
[    33.177] (II) Unloading fbdev
[    33.177] (II) UnloadSubModule: "fbdevhw"
[    33.177] (II) Unloading fbdevhw
[    33.177] (II) UnloadModule: "vesa"
[    33.177] (II) Unloading vesa
[    33.177] (--) Depth 24 pixmap format is 32 bpp
[    33.178] (II) NVIDIA: Using 12288.00 MB of virtual memory for indirect memory
[    33.178] (II) NVIDIA:     access.
[    37.202] (II) NVIDIA(0): Setting mode "DFP-1:nvidia-auto-select"
---LOG ENDS HERE when system hangs---

The amount of virtual memory used is very (4x) different between the two driver version. This laptop has 32GB of ram, but I think the 980M itself is only 8GB. I’m not sure why the newer driver would want 12GB for indirect access.

Is this by design?

I’m still unable to generate an nvidia-bug-report after the driver attempts to load. As soon as I make a remote SSH connection to the machine, that session hangs and never presents a MOTD/shell prompt.

odror and cberner, 3vi1 , Please provide nvidia bug report and repro steps of any.

odror, What laptop you using ?

Are you all exact same issue described by 3vi1 in first comment ?

My issue is exactly the same, except that I don’t even see the mouse cursor. For me, the following steps reproduce it:

  1. install Ubuntu 16.04
  2. enable Nvidia drivers via the Additional Drivers program
  3. reboot several times (blank screen happens ~30% of the time)

Also, to note, I have a Dell U3011 monitor that I use with my MacBook Pro, but the issues happens even when the external monitor is not connected.

Here’s the Dropbox link to my Nvidia log: Dropbox - File Deleted

This was captured when my computer worked fine, as I can’t log into it when the issue occurs.

One other data point. I previously had 14.04 installed and there I did not have this issue.

Let me know if there’s additional information that would be useful!

Different topic, but apparently the same problem:
[url]https://devtalk.nvidia.com/default/topic/941096/linux/ubuntu-16-nvidia-36x-drivers-black-screen/?offset=11#4909836[/url]

Ubuntu 14 → works
Ubuntu 16 → not working

Is this a bug in the NVIDIA driver? If so, is there a way to make NVIDIA aware of it? It’s really annoying, I unable to use my computer for weeks now :-(

This problem persists with the 367.35 driver. Backleveling to 364.19 works fine, as usual:

Jul 16 17:09:25 sager systemd[1]: Startup finished in 17.226s (kernel) + 22.919s (userspace) = 40.146s.
Jul 16 17:09:27 sager kernel: [   42.497180] nvidia-modeset: ERROR: GPU:0: Idling display engine timed out: 0x0000957d:0:0:0x00000040
Jul 16 17:09:53 sager kernel: [   68.107441] NMI watchdog: BUG: soft lockup - CPU#2 stuck for 22s! [Xorg:4942]
Jul 16 17:09:53 sager kernel: [   68.107444] Modules linked in: pci_stub vboxpci(OE) vboxnetadp(OE) vboxnetflt(OE) vboxdrv(OE) rfcomm bnep binfmt_misc arc4 mxm_wmi snd_hda_codec_realtek snd_hda_codec_generic intel_rapl snd_usb_audio btusb x86_pkg_temp_thermal nls_iso8859_1 snd_hda_codec_hdmi snd_usbmidi_lib btrtl intel_powerclamp kvm_intel i915_bpo iwlmvm kvm mac80211 irqbypass intel_ips i2c_algo_bit snd_hda_intel snd_hda_codec snd_hda_core snd_hwdep nvidia_uvm(POE) snd_seq_midi snd_seq_midi_event snd_pcm iwlwifi snd_rawmidi input_leds joydev snd_seq serio_raw cfg80211 snd_seq_device snd_timer rtsx_pci_ms memstick snd soundcore hci_uart btbcm mei_me btqca mei btintel bluetooth shpchp wmi intel_lpss_acpi intel_lpss mac_hid tpm_crb acpi_pad coretemp parport_pc ppdev lp parport autofs4 btrfs xor raid6_pq drbg ansi_cprng algif_skcipher af_alg hid_generic dm_crypt mmc_block hid_plantronics usbhid rtsx_pci_sdmmc nvidia_drm(POE) nvidia_modeset(POE) drm_kms_helper crct10dif_pclmul crc32_pclmul syscopyarea sysfillrect sysimgblt fb_sys_fops drm aesni_intel aes_x86_64 lrw gf128mul glue_helper ablk_helper cryptd psmouse nvidia(POE) r8169 rtsx_pci ahci mii libahci i2c_hid hid pinctrl_sunrisepoint video pinctrl_intel fjes
Jul 16 17:09:53 sager kernel: [   68.107541] CPU: 2 PID: 4942 Comm: Xorg Tainted: P           OE   4.4.0-30-generic #49-Ubuntu
Jul 16 17:09:53 sager kernel: [   68.107542] Hardware name: Notebook                         P65_P67RGRERA/P65_P67RGRERA, BIOS 1.05.13RLS1 02/02/2016
Jul 16 17:09:53 sager kernel: [   68.107543] task: ffff880845b4e740 ti: ffff88084fa08000 task.ti: ffff88084fa08000
Jul 16 17:09:53 sager kernel: [   68.107544] RIP: 0010:[<ffffffffc195e9d0>]  [<ffffffffc195e9d0>] _nv001865kms+0xd0/0x120 [nvidia_modeset]
Jul 16 17:09:53 sager kernel: [   68.107553] RSP: 0018:ffff88084fa0b728  EFLAGS: 00000293
Jul 16 17:09:53 sager kernel: [   68.107554] RAX: 00000000000000dc RBX: ffff88084fb06008 RCX: 0000000000000000
Jul 16 17:09:53 sager kernel: [   68.107555] RDX: 0000000000000000 RSI: 000000000062c443 RDI: ffffffff81e27b80
Jul 16 17:09:53 sager kernel: [   68.107555] RBP: ffff88084de81000 R08: 0000000000000001 R09: 0000000000000004
Jul 16 17:09:53 sager kernel: [   68.107556] R10: 0000000000000020 R11: 0000000000000000 R12: 000537c7fd4281f4
Jul 16 17:09:53 sager kernel: [   68.107557] R13: 0000000000000001 R14: ffff880035486d08 R15: 0000000000000001
Jul 16 17:09:53 sager kernel: [   68.107558] FS:  00007f4c26a6da00(0000) GS:ffff880876480000(0000) knlGS:0000000000000000
Jul 16 17:09:53 sager kernel: [   68.107558] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jul 16 17:09:53 sager kernel: [   68.107559] CR2: 000056066147ab18 CR3: 0000000846de4000 CR4: 00000000003406e0
Jul 16 17:09:53 sager kernel: [   68.107560] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Jul 16 17:09:53 sager kernel: [   68.107561] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Jul 16 17:09:53 sager kernel: [   68.107561] Stack:
Jul 16 17:09:53 sager kernel: [   68.107562]  ffff88084fb06008 0000000000000000 ffff88084fb06808 00000000c1951d45
Jul 16 17:09:53 sager kernel: [   68.107563]  0000000000000000 0000000000000001 ffff88084fb06808 ffff88084fb06008
Jul 16 17:09:53 sager kernel: [   68.107564]  ffff88084fb06808 0000000000000001 00000000ffffffff ffffffffc1955da3
Jul 16 17:09:53 sager kernel: [   68.107565] Call Trace:
Jul 16 17:09:53 sager kernel: [   68.107572]  [<ffffffffc1955da3>] ? _nv001762kms+0xa3/0x110 [nvidia_modeset]
Jul 16 17:09:53 sager kernel: [   68.107580]  [<ffffffffc1970e18>] ? _nv001994kms+0x1ca8/0x2300 [nvidia_modeset]
Jul 16 17:09:53 sager kernel: [   68.107585]  [<ffffffffc193d3e0>] ? nvkms_alloc+0x50/0x60 [nvidia_modeset]
Jul 16 17:09:53 sager kernel: [   68.107589]  [<ffffffffc193e540>] ? _nv000325kms+0x30/0x30 [nvidia_modeset]
Jul 16 17:09:53 sager kernel: [   68.107593]  [<ffffffffc193e56e>] ? _nv000334kms+0x2e/0x40 [nvidia_modeset]
Jul 16 17:09:53 sager kernel: [   68.107597]  [<ffffffffc193f101>] ? nvKmsIoctl+0x161/0x1e0 [nvidia_modeset]
Jul 16 17:09:53 sager kernel: [   68.107602]  [<ffffffffc193dda5>] ? nvkms_ioctl_common+0x45/0x80 [nvidia_modeset]
Jul 16 17:09:53 sager kernel: [   68.107606]  [<ffffffffc193de51>] ? nvkms_ioctl+0x71/0xa0 [nvidia_modeset]
Jul 16 17:09:53 sager kernel: [   68.107655]  [<ffffffffc0073080>] ? nvidia_frontend_unlocked_ioctl+0x40/0x50 [nvidia]
Jul 16 17:09:53 sager kernel: [   68.107658]  [<ffffffff81220c0f>] ? do_vfs_ioctl+0x29f/0x490
Jul 16 17:09:53 sager kernel: [   68.107660]  [<ffffffff8120f7e1>] ? __sb_end_write+0x21/0x30
Jul 16 17:09:53 sager kernel: [   68.107661]  [<ffffffff8120d3ed>] ? vfs_write+0x15d/0x1a0
Jul 16 17:09:53 sager kernel: [   68.107662]  [<ffffffff81220e79>] ? SyS_ioctl+0x79/0x90
Jul 16 17:09:53 sager kernel: [   68.107664]  [<ffffffff8182db32>] ? entry_SYSCALL_64_fastpath+0x16/0x71
Jul 16 17:09:53 sager kernel: [   68.107665] Code: 44 21 f8 41 39 c5 74 45 e8 ce eb fd ff 44 8b 9b 88 04 00 00 45 85 db 75 e4 49 39 c4 73 df 48 8b 4c 24 08 49 8b 44 ce 60 8b 40 04 <41> 39 86 b4 00 00 00 75 c9 48 8b 7c 24 10 48 c7 c2 18 59 9b c1

Decided to test with the latest 4.7 mainline kernel today, since this laptop is Skylake. But it unfortunately seems to have the same type of issue with 367.35:

Jul 26 10:14:17 sager kernel: [   81.590322] INFO: rcu_sched self-detected stall on CPU
Jul 26 10:14:17 sager kernel: [   81.590326] 	5-...: (5249 ticks this GP) idle=c97/140000000000001/0 softirq=4914/4914 fqs=5249 
Jul 26 10:14:17 sager kernel: [   81.590327] 	 (t=5250 jiffies g=2856 c=2855 q=13368)
Jul 26 10:14:17 sager kernel: [   81.590329] Task dump for CPU 5:
Jul 26 10:14:17 sager kernel: [   81.590330] Xorg            R  running task        0  9801   9772 0x00400008
Jul 26 10:14:17 sager kernel: [   81.590332]  ffffffff81a4f000 000000000a4e3ce4 ffffffff811755bc ffff880876557b00
Jul 26 10:14:17 sager kernel: [   81.590333]  ffffffff81a4f000 0000000000000000 ffff880849445040 ffffffff810dbcf3
Jul 26 10:14:17 sager kernel: [   81.590335]  ffffffff810e7fee 003b9aca00000000 ffff880876543ee8 0000000000000046
Jul 26 10:14:17 sager kernel: [   81.590336] Call Trace:
Jul 26 10:14:17 sager kernel: [   81.590360]  <IRQ>  [<ffffffff811755bc>] ? rcu_dump_cpu_stacks+0x6e/0x87
Jul 26 10:14:17 sager kernel: [   81.590367]  [<ffffffff810dbcf3>] ? rcu_check_callbacks+0x773/0x810
Jul 26 10:14:17 sager kernel: [   81.590368]  [<ffffffff810e7fee>] ? timekeeping_update+0xee/0x150
Jul 26 10:14:17 sager kernel: [   81.590369]  [<ffffffff810e9875>] ? update_wall_time+0x485/0x790
Jul 26 10:14:17 sager kernel: [   81.590371]  [<ffffffff810f1100>] ? tick_sched_handle.isra.13+0x50/0x50
Jul 26 10:14:17 sager kernel: [   81.590372]  [<ffffffff810e20f2>] ? update_process_times+0x32/0x60
Jul 26 10:14:17 sager kernel: [   81.590374]  [<ffffffff810f10d0>] ? tick_sched_handle.isra.13+0x20/0x50
Jul 26 10:14:17 sager kernel: [   81.590376]  [<ffffffff810f1138>] ? tick_sched_timer+0x38/0x70
Jul 26 10:14:17 sager kernel: [   81.590377]  [<ffffffff810e299a>] ? __hrtimer_run_queues+0xea/0x280
Jul 26 10:14:17 sager kernel: [   81.590394]  [<ffffffff810e3119>] ? hrtimer_interrupt+0x99/0x190
Jul 26 10:14:17 sager kernel: [   81.590396]  [<ffffffff815ee4f9>] ? smp_apic_timer_interrupt+0x39/0x50
Jul 26 10:14:17 sager kernel: [   81.590397]  [<ffffffff815ec822>] ? apic_timer_interrupt+0x82/0x90
Jul 26 10:14:17 sager kernel: [   81.590398]  <EOI>  [<ffffffffc1b76870>] ? _nv001865kms+0xd0/0x120 [nvidia_modeset]
Jul 26 10:14:17 sager kernel: [   81.590440]  [<ffffffffc1b6dc43>] ? _nv001762kms+0xa3/0x110 [nvidia_modeset]
Jul 26 10:14:17 sager kernel: [   81.590449]  [<ffffffffc1b88cb8>] ? _nv001994kms+0x1ca8/0x2300 [nvidia_modeset]
Jul 26 10:14:17 sager kernel: [   81.590454]  [<ffffffffc1b563e0>] ? _nv000325kms+0x30/0x30 [nvidia_modeset]
Jul 26 10:14:17 sager kernel: [   81.590458]  [<ffffffffc1b5640e>] ? _nv000334kms+0x2e/0x40 [nvidia_modeset]
Jul 26 10:14:17 sager kernel: [   81.590462]  [<ffffffffc1b56fa1>] ? nvKmsIoctl+0x161/0x1e0 [nvidia_modeset]
Jul 26 10:14:17 sager kernel: [   81.590467]  [<ffffffffc1b55c5e>] ? nvkms_ioctl_common+0x3e/0x80 [nvidia_modeset]
Jul 26 10:14:17 sager kernel: [   81.590471]  [<ffffffffc1b55d0f>] ? nvkms_ioctl+0x6f/0xa0 [nvidia_modeset]
Jul 26 10:14:17 sager kernel: [   81.590520]  [<ffffffffc006a06c>] ? nvidia_frontend_unlocked_ioctl+0x3c/0x40 [nvidia]
Jul 26 10:14:17 sager kernel: [   81.590522]  [<ffffffff81209cfd>] ? do_vfs_ioctl+0x9d/0x5c0
Jul 26 10:14:17 sager kernel: [   81.590524]  [<ffffffff811f6943>] ? vfs_write+0x163/0x1a0
Jul 26 10:14:17 sager kernel: [   81.590525]  [<ffffffff8120a294>] ? SyS_ioctl+0x74/0x80
Jul 26 10:14:17 sager kernel: [   81.590528]  [<ffffffff815ebc36>] ? entry_SYSCALL_64_fastpath+0x1e/0xa8
Jul 26 10:14:19 sager kernel: [   82.938396] INFO: rcu_sched detected expedited stalls on CPUs/tasks: { 5-... } 5250 jiffies s: 33 root: 0x20/.
Jul 26 10:14:19 sager kernel: [   82.938400] blocking rcu_node structures:
Jul 26 10:14:19 sager kernel: [   82.938401] Task dump for CPU 5:
Jul 26 10:14:19 sager kernel: [   82.938402] Xorg            R  running task        0  9801   9772 0x00400008
Jul 26 10:14:19 sager kernel: [   82.938421]  ffff88082686ab08 0000000000000000 0000000000000000 ffffffffc023e856
Jul 26 10:14:19 sager kernel: [   82.938423]  ffff8808350fa7c8 0000000000000010 ffffc90003b964f0 ffff8808350f8008
Jul 26 10:14:19 sager kernel: [   82.938424]  0000000000000010 ffffffffc0415e94 ffff88082686af88 0000000000000003
Jul 26 10:14:19 sager kernel: [   82.938425] Call Trace:
Jul 26 10:14:19 sager kernel: [   82.938550]  [<ffffffffc023e856>] ? _nv014256rm+0xf6/0x1d0 [nvidia]
Jul 26 10:14:19 sager kernel: [   82.938668]  [<ffffffffc0415e94>] ? _nv017187rm+0x24/0x70 [nvidia]
Jul 26 10:14:19 sager kernel: [   82.938785]  [<ffffffffc03f92fd>] ? _nv017014rm+0x8d/0x230 [nvidia]
Jul 26 10:14:19 sager kernel: [   82.938893]  [<ffffffffc033e5fd>] ? _nv011093rm+0xfd/0x170 [nvidia]
Jul 26 10:14:19 sager kernel: [   82.938999]  [<ffffffffc033e5fd>] ? _nv011093rm+0xfd/0x170 [nvidia]
Jul 26 10:14:19 sager kernel: [   82.939037]  [<ffffffffc0075c5e>] ? os_acquire_spinlock+0xe/0x20 [nvidia]
Jul 26 10:14:19 sager kernel: [   82.939142]  [<ffffffffc033de0b>] ? _nv011359rm+0x37b/0x560 [nvidia]
Jul 26 10:14:19 sager kernel: [   82.939244]  [<ffffffffc0337a8c>] ? _nv011422rm+0xac/0x190 [nvidia]
Jul 26 10:14:19 sager kernel: [   82.939349]  [<ffffffffc0337b28>] ? _nv011422rm+0x148/0x190 [nvidia]
Jul 26 10:14:19 sager kernel: [   82.939454]  [<ffffffffc0337953>] ? _nv011419rm+0xd3/0x150 [nvidia]
Jul 26 10:14:19 sager kernel: [   82.939521]  [<ffffffffc0552688>] ? _nv011533rm+0x8/0x40 [nvidia]
Jul 26 10:14:19 sager kernel: [   82.939588]  [<ffffffffc055300d>] ? _nv011576rm+0xd/0x40 [nvidia]
Jul 26 10:14:19 sager kernel: [   82.939655]  [<ffffffffc055317c>] ? _nv011602rm+0x2c/0xd0 [nvidia]
Jul 26 10:14:19 sager kernel: [   82.939720]  [<ffffffffc05864c9>] ? _nv018778rm+0x619/0x700 [nvidia]
Jul 26 10:14:19 sager kernel: [   82.939794]  [<ffffffffc020085f>] ? _nv006395rm+0xdf/0xf0 [nvidia]
Jul 26 10:14:19 sager kernel: [   82.939796]  [<ffffffff810bdee2>] ? up+0x12/0x60
Jul 26 10:14:19 sager kernel: [   82.939861]  [<ffffffffc0585620>] ? _nv018729rm+0x10/0x30 [nvidia]
Jul 26 10:14:19 sager kernel: [   82.939926]  [<ffffffffc05855d0>] ? _nv018730rm+0x90/0xd0 [nvidia]
Jul 26 10:14:19 sager kernel: [   82.939928]  [<ffffffff81033fc5>] ? read_tsc+0x5/0x10
Jul 26 10:14:19 sager kernel: [   82.939930]  [<ffffffff810e8bb7>] ? __getnstimeofday64+0x37/0xc0
Jul 26 10:14:19 sager kernel: [   82.939931]  [<ffffffff810e8c95>] ? do_gettimeofday+0x25/0x90
Jul 26 10:14:19 sager kernel: [   82.939938]  [<ffffffffc1b55511>] ? nvkms_get_usec+0x21/0x50 [nvidia_modeset]
Jul 26 10:14:19 sager kernel: [   82.939943]  [<ffffffffc1b76852>] ? _nv001865kms+0xb2/0x120 [nvidia_modeset]
Jul 26 10:14:19 sager kernel: [   82.939949]  [<ffffffffc1b6dc43>] ? _nv001762kms+0xa3/0x110 [nvidia_modeset]
Jul 26 10:14:19 sager kernel: [   82.939957]  [<ffffffffc1b88cb8>] ? _nv001994kms+0x1ca8/0x2300 [nvidia_modeset]
Jul 26 10:14:19 sager kernel: [   82.939962]  [<ffffffffc1b563e0>] ? _nv000325kms+0x30/0x30 [nvidia_modeset]
Jul 26 10:14:19 sager kernel: [   82.939966]  [<ffffffffc1b5640e>] ? _nv000334kms+0x2e/0x40 [nvidia_modeset]
Jul 26 10:14:19 sager kernel: [   82.939971]  [<ffffffffc1b56fa1>] ? nvKmsIoctl+0x161/0x1e0 [nvidia_modeset]
Jul 26 10:14:19 sager kernel: [   82.939975]  [<ffffffffc1b55c5e>] ? nvkms_ioctl_common+0x3e/0x80 [nvidia_modeset]
Jul 26 10:14:19 sager kernel: [   82.939980]  [<ffffffffc1b55d0f>] ? nvkms_ioctl+0x6f/0xa0 [nvidia_modeset]
Jul 26 10:14:19 sager kernel: [   82.940016]  [<ffffffffc006a06c>] ? nvidia_frontend_unlocked_ioctl+0x3c/0x40 [nvidia]
Jul 26 10:14:19 sager kernel: [   82.940018]  [<ffffffff81209cfd>] ? do_vfs_ioctl+0x9d/0x5c0
Jul 26 10:14:19 sager kernel: [   82.940020]  [<ffffffff811f6943>] ? vfs_write+0x163/0x1a0
Jul 26 10:14:19 sager kernel: [   82.940021]  [<ffffffff8120a294>] ? SyS_ioctl+0x74/0x80
Jul 26 10:14:19 sager kernel: [   82.940024]  [<ffffffff815ebc36>] ? entry_SYSCALL_64_fastpath+0x1e/0xa8

Backleveling to 364 fixes the problem as usual, but I had to also revert back to the released yakkety 4.4 kernel since the 364 drivers won’t build against the 4.7 headers.

Exact same symptoms when upgrading to the 370.23 drivers. :\

My problem was “mostly” solved when I switched to kernel 4.6.6 from the Mainline. After You switch to this kernel. Do not reboot. Shutdown and start again. If you reboot for some unknown reason the system load jumps.

Also If you disable modesetting for Nvidia it may help. The solution above works without disabling modesetting.

I have also tried Kernel 4.7. This kernel have other issues. 4.6.6 seems to be the best. You need to shutdown and start. Do not reboot. Can you report if this solves your issue.

I use Nvidia 367.35 driver. I have a 3 4k monitor system. One of them does not wake up from sleep (sometimes) which triggers the issue. It is not the only thing that triggers the issue. Again with kernel 4.6.6 It does not happen (I have been testing it for more than a week).

No joy. Mainline kernel 4.6.6 + 370.35 crashes the same way, even when starting in the way you suggested:

Aug 17 15:51:53 sager kernel: [   72.336865] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [Xorg:6202]
Aug 17 15:51:53 sager kernel: [   72.336867] Modules linked in: pci_stub vboxpci(OE) vboxnetadp(OE) vboxnetflt(OE) rfcomm cmac bnep vboxdrv(OE) binfmt_misc snd_hda_codec_hdmi mxm_wmi nls_iso8859_1 arc4 snd_usb_audio snd_usbmidi_lib uvcvideo videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 videobuf2_core videodev media btusb btrtl snd_hda_codec_realtek snd_hda_codec_generic intel_rapl x86_pkg_temp_thermal intel_powerclamp kvm_intel snd_hda_intel snd_hda_codec kvm snd_hda_core snd_hwdep irqbypass snd_pcm joydev iwlmvm input_leds snd_seq_midi snd_seq_midi_event nvidia_uvm(POE) mac80211 snd_rawmidi serio_raw snd_seq snd_seq_device iwlwifi snd_timer rtsx_pci_ms cfg80211 snd memstick soundcore mei_me mei hci_uart btbcm btqca btintel bluetooth intel_lpss_acpi wmi shpchp intel_lpss tpm_crb acpi_pad mac_hid coretemp parport_pc ppdev lp parport autofs4 btrfs xor raid6_pq algif_skcipher af_alg dm_crypt hid_generic usbhid mmc_block rtsx_pci_sdmmc nvidia_drm(POE) nvidia_modeset(POE) drm_kms_helper crct10dif_pclmul crc32_pclmul syscopyarea ghash_clmulni_intel sysfillrect sysimgblt aesni_intel fb_sys_fops aes_x86_64 lrw gf128mul glue_helper drm ablk_helper rtsx_pci cryptd r8169 mii psmouse nvidia(POE) ahci libahci i2c_hid pinctrl_sunrisepoint hid pinctrl_intel video fjes
Aug 17 15:51:53 sager kernel: [   72.336947] CPU: 3 PID: 6202 Comm: Xorg Tainted: P           OE   4.6.6-040606-generic #201608100733
Aug 17 15:51:53 sager kernel: [   72.336948] Hardware name: Notebook                         P65_P67RGRERA/P65_P67RGRERA, BIOS 1.05.13RLS1 02/02/2016
Aug 17 15:51:53 sager kernel: [   72.336949] task: ffff88084ec4af00 ti: ffff88084bfec000 task.ti: ffff88084bfec000
Aug 17 15:51:53 sager kernel: [   72.336950] RIP: 0010:[<ffffffffc19f3bb0>]  [<ffffffffc19f3bb0>] _nv001875kms+0xd0/0x120 [nvidia_modeset]
Aug 17 15:51:53 sager kernel: [   72.336960] RSP: 0018:ffff88084bfef700  EFLAGS: 00000287
Aug 17 15:51:53 sager kernel: [   72.336960] RAX: 00000000000000dc RBX: ffff880832880808 RCX: 0000000000000000
Aug 17 15:51:53 sager kernel: [   72.336961] RDX: 0000000000000000 RSI: 000000000123b487 RDI: 0030dfcd1a02294e
Aug 17 15:51:53 sager kernel: [   72.336962] RBP: ffff88084fcf6000 R08: 0000000000000001 R09: 0000000000000004
Aug 17 15:51:53 sager kernel: [   72.336963] R10: 0000000000000020 R11: 0000000000000000 R12: 00053a4aa145101d
Aug 17 15:51:53 sager kernel: [   72.336963] R13: 0000000000000001 R14: ffff88084715c008 R15: 0000000000000001
Aug 17 15:51:53 sager kernel: [   72.336964] FS:  00007f52e056aa40(0000) GS:ffff8808764c0000(0000) knlGS:0000000000000000
Aug 17 15:51:53 sager kernel: [   72.336965] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Aug 17 15:51:53 sager kernel: [   72.336966] CR2: 00007fd2352e8f10 CR3: 000000084b5b6000 CR4: 00000000003406e0
Aug 17 15:51:53 sager kernel: [   72.336967] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Aug 17 15:51:53 sager kernel: [   72.336967] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Aug 17 15:51:53 sager kernel: [   72.336968] Stack:
Aug 17 15:51:53 sager kernel: [   72.336968]  ffff880832880808 0000000000000000 ffff880832881008 00000000c19e6ea5
Aug 17 15:51:53 sager kernel: [   72.336970]  0000000000000000 0000000000000001 ffff880832881008 ffff880832880808
Aug 17 15:51:53 sager kernel: [   72.336971]  ffff880832881008 0000000000000001 00000000ffffffff ffffffffc19ea713
Aug 17 15:51:53 sager kernel: [   72.336972] Call Trace:
Aug 17 15:51:53 sager kernel: [   72.336979]  [<ffffffffc19ea713>] ? _nv001769kms+0xa3/0x110 [nvidia_modeset]
Aug 17 15:51:53 sager kernel: [   72.336987]  [<ffffffffc1a05ff8>] ? _nv002008kms+0x1ca8/0x2300 [nvidia_modeset]
Aug 17 15:51:53 sager kernel: [   72.336992]  [<ffffffffc19d23f0>] ? nvkms_alloc+0x50/0x60 [nvidia_modeset]
Aug 17 15:51:53 sager kernel: [   72.336996]  [<ffffffffc19d31a0>] ? _nv000310kms+0x60/0x60 [nvidia_modeset]
Aug 17 15:51:53 sager kernel: [   72.337000]  [<ffffffffc19d31ce>] ? _nv000340kms+0x2e/0x40 [nvidia_modeset]
Aug 17 15:51:53 sager kernel: [   72.337004]  [<ffffffffc19d4101>] ? nvKmsIoctl+0x161/0x1e0 [nvidia_modeset]
Aug 17 15:51:53 sager kernel: [   72.337009]  [<ffffffffc19d2da5>] ? nvkms_ioctl_common+0x45/0x80 [nvidia_modeset]
Aug 17 15:51:53 sager kernel: [   72.337013]  [<ffffffffc19d2e51>] ? nvkms_ioctl+0x71/0xa0 [nvidia_modeset]
Aug 17 15:51:53 sager kernel: [   72.337065]  [<ffffffffc0071080>] ? nvidia_frontend_compat_ioctl+0x40/0x50 [nvidia]
Aug 17 15:51:53 sager kernel: [   72.337104]  [<ffffffffc007109e>] ? nvidia_frontend_unlocked_ioctl+0xe/0x10 [nvidia]
Aug 17 15:51:53 sager kernel: [   72.337106]  [<ffffffff81239c03>] ? do_vfs_ioctl+0xa3/0x5d0
Aug 17 15:51:53 sager kernel: [   72.337107]  [<ffffffff81228061>] ? __sb_end_write+0x21/0x30
Aug 17 15:51:53 sager kernel: [   72.337108]  [<ffffffff812256bc>] ? vfs_write+0x14c/0x190
Aug 17 15:51:53 sager kernel: [   72.337109]  [<ffffffff8123a1a9>] ? SyS_ioctl+0x79/0x90
Aug 17 15:51:53 sager kernel: [   72.337111]  [<ffffffff8184cf36>] ? entry_SYSCALL_64_fastpath+0x1e/0xa8
Aug 17 15:51:53 sager kernel: [   72.337112] Code: 44 21 f8 41 39 c5 74 45 e8 ee e9 fd ff 44 8b 9b 80 04 00 00 45 85 db 75 e4 49 39 c4 73 df 48 8b 4c 24 08 49 8b 44 ce 60 8b 40 04 <41> 39 86 b4 00 00 00 75 c9 48 8b 7c 24 10 48 c7 c2 28 a9 a4 c1

364.19 continues to work as a fallback.

Did you try disabling the nvidia modesetting as a kernel parameter.

It appears to make no difference; even with nomodeset, and the intel/nouveau nomodesetting for good measure, the nvidia-modeset executes and hangs at startup:

Aug 18 08:08:34 sager kernel: [    0.000000] Kernel command line: BOOT_IMAGE=/vmlinuz-4.6.6-040606-generic root=/dev/mapper/ubuntu--vg-root ro acpi_enforce_resources=lax nomodeset i915.modeset=0 nouveau.modeset=0
...
Aug 18 08:08:42 sager kernel: [   42.831120] nvidia-modeset: WARNING: GPU:0: Lost display notification; continuing.
Aug 18 08:08:45 sager kernel: [   45.586628] nvidia-modeset: ERROR: GPU:0: Idling display engine timed out: 0x0000957d:0:0:0x00000040
Aug 18 08:09:11 sager kernel: [   72.177195] NMI watchdog: BUG: soft lockup - CPU#1 stuck for 23s! [Xorg:6165]
Aug 18 08:09:11 sager kernel: [   72.177197] Modules linked in: pci_stub vboxpci(OE) vboxnetadp(OE) vboxnetflt(OE) rfcomm cmac bnep vboxdrv(OE) binfmt_misc mxm_wmi snd_hda_codec_hdmi snd_usb_audio snd_usbmidi_lib btusb nls_iso8859_1 btrtl uvcvideo videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 videobuf2_core videodev media intel_rapl x86_pkg_temp_thermal snd_hda_codec_realtek intel_powerclamp snd_hda_codec_generic iwlmvm kvm_intel snd_hda_intel kvm snd_hda_codec mac80211 snd_hda_core irqbypass snd_hwdep snd_pcm nvidia_uvm(POE) snd_seq_midi snd_seq_midi_event snd_rawmidi iwlwifi joydev snd_seq rtsx_pci_ms cfg80211 input_leds snd_seq_device memstick snd_timer serio_raw snd soundcore mei_me shpchp mei hci_uart btbcm btqca btintel bluetooth wmi intel_lpss_acpi intel_lpss acpi_pad mac_hid tpm_crb coretemp parport_pc ppdev lp parport autofs4 btrfs xor raid6_pq algif_skcipher af_alg dm_crypt hid_generic usbhid mmc_block rtsx_pci_sdmmc nvidia_drm(POE) nvidia_modeset(POE) drm_kms_helper syscopyarea crct10dif_pclmul crc32_pclmul ghash_clmulni_intel sysfillrect sysimgblt fb_sys_fops drm aesni_intel aes_x86_64 lrw gf128mul glue_helper ablk_helper cryptd rtsx_pci r8169 nvidia(POE) mii psmouse ahci libahci i2c_hid hid pinctrl_sunrisepoint video pinctrl_intel fjes
Aug 18 08:09:11 sager kernel: [   72.177293] CPU: 1 PID: 6165 Comm: Xorg Tainted: P           OE   4.6.6-040606-generic #201608100733
Aug 18 08:09:11 sager kernel: [   72.177294] Hardware name: Notebook                         P65_P67RGRERA/P65_P67RGRERA, BIOS 1.05.13RLS1 02/02/2016
Aug 18 08:09:11 sager kernel: [   72.177295] task: ffff880836478000 ti: ffff88084da28000 task.ti: ffff88084da28000
Aug 18 08:09:11 sager kernel: [   72.177296] RIP: 0010:[<ffffffffc0da0bb0>]  [<ffffffffc0da0bb0>] _nv001875kms+0xd0/0x120 [nvidia_modeset]
Aug 18 08:09:11 sager kernel: [   72.177305] RSP: 0018:ffff88084da2b700  EFLAGS: 00000293
Aug 18 08:09:11 sager kernel: [   72.177306] RAX: 00000000000000dc RBX: ffff88084881d008 RCX: 0000000000000000
Aug 18 08:09:11 sager kernel: [   72.177307] RDX: 0000000000000000 RSI: 0000000001324b84 RDI: 0027059dcf50bf0e
Aug 18 08:09:11 sager kernel: [   72.177308] RBP: ffff8808364ad000 R08: 0000000000000001 R09: 0000000000000004
Aug 18 08:09:11 sager kernel: [   72.177308] R10: 0000000000000020 R11: 0000000000000000 R12: 00053a58484cb325
Aug 18 08:09:11 sager kernel: [   72.177309] R13: 0000000000000001 R14: ffff88084d4a4c08 R15: 0000000000000001
Aug 18 08:09:11 sager kernel: [   72.177310] FS:  00007fd7e92d2a40(0000) GS:ffff880876440000(0000) knlGS:0000000000000000
Aug 18 08:09:11 sager kernel: [   72.177311] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Aug 18 08:09:11 sager kernel: [   72.177311] CR2: 000055d08adc8d30 CR3: 000000084b6c2000 CR4: 00000000003406e0
Aug 18 08:09:11 sager kernel: [   72.177312] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Aug 18 08:09:11 sager kernel: [   72.177313] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Aug 18 08:09:11 sager kernel: [   72.177314] Stack:
Aug 18 08:09:11 sager kernel: [   72.177314]  ffff88084881d008 0000000000000000 ffff88084881c008 00000000c0d93ea5
Aug 18 08:09:11 sager kernel: [   72.177316]  0000000000000000 0000000000000001 ffff88084881c008 ffff88084881d008
Aug 18 08:09:11 sager kernel: [   72.177317]  ffff88084881c008 0000000000000001 00000000ffffffff ffffffffc0d97713
Aug 18 08:09:11 sager kernel: [   72.177318] Call Trace:
Aug 18 08:09:11 sager kernel: [   72.177324]  [<ffffffffc0d97713>] ? _nv001769kms+0xa3/0x110 [nvidia_modeset]
Aug 18 08:09:11 sager kernel: [   72.177332]  [<ffffffffc0db2ff8>] ? _nv002008kms+0x1ca8/0x2300 [nvidia_modeset]
Aug 18 08:09:11 sager kernel: [   72.177337]  [<ffffffffc0d7f3f0>] ? nvkms_alloc+0x50/0x60 [nvidia_modeset]
Aug 18 08:09:11 sager kernel: [   72.177342]  [<ffffffffc0d801a0>] ? _nv000310kms+0x60/0x60 [nvidia_modeset]
Aug 18 08:09:11 sager kernel: [   72.177346]  [<ffffffffc0d801ce>] ? _nv000340kms+0x2e/0x40 [nvidia_modeset]
Aug 18 08:09:11 sager kernel: [   72.177350]  [<ffffffffc0d81101>] ? nvKmsIoctl+0x161/0x1e0 [nvidia_modeset]
Aug 18 08:09:11 sager kernel: [   72.177354]  [<ffffffffc0d7fda5>] ? nvkms_ioctl_common+0x45/0x80 [nvidia_modeset]
Aug 18 08:09:11 sager kernel: [   72.177358]  [<ffffffffc0d7fe51>] ? nvkms_ioctl+0x71/0xa0 [nvidia_modeset]
Aug 18 08:09:11 sager kernel: [   72.177412]  [<ffffffffc0083080>] ? nvidia_frontend_compat_ioctl+0x40/0x50 [nvidia]
Aug 18 08:09:11 sager kernel: [   72.177453]  [<ffffffffc008309e>] ? nvidia_frontend_unlocked_ioctl+0xe/0x10 [nvidia]
Aug 18 08:09:11 sager kernel: [   72.177455]  [<ffffffff81239c03>] ? do_vfs_ioctl+0xa3/0x5d0
Aug 18 08:09:11 sager kernel: [   72.177456]  [<ffffffff81228061>] ? __sb_end_write+0x21/0x30
Aug 18 08:09:11 sager kernel: [   72.177458]  [<ffffffff812256bc>] ? vfs_write+0x14c/0x190
Aug 18 08:09:11 sager kernel: [   72.177459]  [<ffffffff8123a1a9>] ? SyS_ioctl+0x79/0x90
Aug 18 08:09:11 sager kernel: [   72.177460]  [<ffffffff8184cf36>] ? entry_SYSCALL_64_fastpath+0x1e/0xa8
Aug 18 08:09:11 sager kernel: [   72.177461] Code: 44 21 f8 41 39 c5 74 45 e8 ee e9 fd ff 44 8b 9b 80 04 00 00 45 85 db 75 e4 49 39 c4 73 df 48 8b 4c 24 08 49 8b 44 ce 60 8b 40 04 <41> 39 86 b4 00 00 00 75 c9 48 8b 7c 24 10 48 c7 c2 28 79 df c0

UPDATE: Going to add nvidia.modeset=0, which I notice I didn’t add, and try again.

Tried 370 again a few times with nvidia.modeset=0 added to boot options… same bag of errors:

Aug 18 08:43:22 sager kernel: [    0.000000] Command line: BOOT_IMAGE=/vmlinuz-4.6.6-040606-generic root=/dev/mapper/ubuntu--vg-root ro acpi_enforce_resources=lax nomodeset nvidia.modeset=0 i915.modeset=0 nouveau.modeset=0
...
Aug 18 08:43:23 sager kernel: [   38.801237] nvidia-modeset: Allocated GPU:0 (GPU-e2f980da-ea7e-4335-6ae3-41ae731aed6d) @ PCI:0000:01:00.0
---
Aug 18 08:43:30 sager kernel: [   45.882081] nvidia-modeset: WARNING: GPU:0: Lost display notification; continuing.
Aug 18 08:43:33 sager kernel: [   48.634811] nvidia-modeset: ERROR: GPU:0: Idling display engine timed out: 0x0000957d:0:0:0x00000040
...
Aug 18 08:44:00 sager kernel: [   76.257786] NMI watchdog: BUG: soft lockup - CPU#2 stuck for 23s! [Xorg:5801]
Aug 18 08:44:00 sager kernel: [   76.257788] Modules linked in: pci_stub vboxpci(OE) vboxnetadp(OE) vboxnetflt(OE) cmac rfcomm bnep vboxdrv(OE) binfmt_misc nls_iso8859_1 uvcvideo videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 videobuf2_core snd_usb_audio videodev media snd_usbmidi_lib btusb btrtl snd_hda_codec_hdmi arc4 mxm_wmi snd_hda_codec_realtek snd_hda_codec_generic snd_hda_intel snd_hda_codec snd_hda_core snd_hwdep snd_pcm iwlmvm nvidia_uvm(POE) mac80211 intel_rapl snd_seq_midi x86_pkg_temp_thermal intel_powerclamp kvm_intel snd_seq_midi_event snd_rawmidi kvm irqbypass iwlwifi snd_seq input_leds rtsx_pci_ms cfg80211 snd_seq_device memstick joydev snd_timer serio_raw snd soundcore mei_me mei shpchp hci_uart btbcm btqca btintel bluetooth intel_lpss_acpi intel_lpss wmi acpi_pad tpm_crb mac_hid coretemp parport_pc ppdev lp parport autofs4 btrfs xor raid6_pq algif_skcipher af_alg dm_crypt hid_generic usbhid mmc_block rtsx_pci_sdmmc nvidia_drm(POE) nvidia_modeset(POE) crct10dif_pclmul crc32_pclmul ghash_clmulni_intel drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops aesni_intel drm aes_x86_64 lrw gf128mul glue_helper ablk_helper cryptd rtsx_pci psmouse nvidia(POE) r8169 ahci mii libahci i2c_hid pinctrl_sunrisepoint hid video pinctrl_intel fjes
Aug 18 08:44:00 sager kernel: [   76.257884] CPU: 2 PID: 5801 Comm: Xorg Tainted: P           OE   4.6.6-040606-generic #201608100733
Aug 18 08:44:00 sager kernel: [   76.257884] Hardware name: Notebook                         P65_P67RGRERA/P65_P67RGRERA, BIOS 1.05.13RLS1 02/02/2016
Aug 18 08:44:00 sager kernel: [   76.257885] task: ffff88084b7b5e00 ti: ffff88084b94c000 task.ti: ffff88084b94c000
Aug 18 08:44:00 sager kernel: [   76.257886] RIP: 0010:[<ffffffffc0e36bb0>]  [<ffffffffc0e36bb0>] _nv001875kms+0xd0/0x120 [nvidia_modeset]
Aug 18 08:44:00 sager kernel: [   76.257896] RSP: 0018:ffff88084b94f700  EFLAGS: 00000283
Aug 18 08:44:00 sager kernel: [   76.257897] RAX: 00000000000000dc RBX: ffff880832ea4008 RCX: 0000000000000000
Aug 18 08:44:00 sager kernel: [   76.257898] RDX: 0000000000000000 RSI: 0000000001247d88 RDI: 002c61ce4e403726
Aug 18 08:44:00 sager kernel: [   76.257898] RBP: ffff88084b5bf000 R08: 0000000000000001 R09: 0000000000000004
Aug 18 08:44:00 sager kernel: [   76.257899] R10: 0000000000000020 R11: 0000000000000000 R12: 00053a58c4c1ed2a
Aug 18 08:44:00 sager kernel: [   76.257900] R13: 0000000000000001 R14: ffff88084cdf8d08 R15: 0000000000000001
Aug 18 08:44:00 sager kernel: [   76.257901] FS:  00007f13385b3a40(0000) GS:ffff880876480000(0000) knlGS:0000000000000000
Aug 18 08:44:00 sager kernel: [   76.257901] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Aug 18 08:44:00 sager kernel: [   76.257902] CR2: 00007f4a636cd750 CR3: 000000084cd99000 CR4: 00000000003406e0
Aug 18 08:44:00 sager kernel: [   76.257903] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Aug 18 08:44:00 sager kernel: [   76.257903] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Aug 18 08:44:00 sager kernel: [   76.257904] Stack:
Aug 18 08:44:00 sager kernel: [   76.257905]  ffff880832ea4008 0000000000000000 ffff880832ea4808 00000000c0e29ea5
Aug 18 08:44:00 sager kernel: [   76.257906]  0000000000000000 0000000000000001 ffff880832ea4808 ffff880832ea4008
Aug 18 08:44:00 sager kernel: [   76.257907]  ffff880832ea4808 0000000000000001 00000000ffffffff ffffffffc0e2d713
Aug 18 08:44:00 sager kernel: [   76.257909] Call Trace:
Aug 18 08:44:00 sager kernel: [   76.257915]  [<ffffffffc0e2d713>] ? _nv001769kms+0xa3/0x110 [nvidia_modeset]
Aug 18 08:44:00 sager kernel: [   76.257923]  [<ffffffffc0e48ff8>] ? _nv002008kms+0x1ca8/0x2300 [nvidia_modeset]
Aug 18 08:44:00 sager kernel: [   76.257928]  [<ffffffffc0e153f0>] ? nvkms_alloc+0x50/0x60 [nvidia_modeset]
Aug 18 08:44:00 sager kernel: [   76.257932]  [<ffffffffc0e161a0>] ? _nv000310kms+0x60/0x60 [nvidia_modeset]
Aug 18 08:44:00 sager kernel: [   76.257937]  [<ffffffffc0e161ce>] ? _nv000340kms+0x2e/0x40 [nvidia_modeset]
Aug 18 08:44:00 sager kernel: [   76.257941]  [<ffffffffc0e17101>] ? nvKmsIoctl+0x161/0x1e0 [nvidia_modeset]
Aug 18 08:44:00 sager kernel: [   76.257945]  [<ffffffffc0e15da5>] ? nvkms_ioctl_common+0x45/0x80 [nvidia_modeset]
Aug 18 08:44:00 sager kernel: [   76.257950]  [<ffffffffc0e15e51>] ? nvkms_ioctl+0x71/0xa0 [nvidia_modeset]
Aug 18 08:44:00 sager kernel: [   76.258001]  [<ffffffffc0095080>] ? nvidia_frontend_compat_ioctl+0x40/0x50 [nvidia]
Aug 18 08:44:00 sager kernel: [   76.258039]  [<ffffffffc009509e>] ? nvidia_frontend_unlocked_ioctl+0xe/0x10 [nvidia]
Aug 18 08:44:00 sager kernel: [   76.258041]  [<ffffffff81239c03>] ? do_vfs_ioctl+0xa3/0x5d0
Aug 18 08:44:00 sager kernel: [   76.258042]  [<ffffffff81228061>] ? __sb_end_write+0x21/0x30
Aug 18 08:44:00 sager kernel: [   76.258043]  [<ffffffff812256bc>] ? vfs_write+0x14c/0x190
Aug 18 08:44:00 sager kernel: [   76.258044]  [<ffffffff8123a1a9>] ? SyS_ioctl+0x79/0x90
Aug 18 08:44:00 sager kernel: [   76.258046]  [<ffffffff8184cf36>] ? entry_SYSCALL_64_fastpath+0x1e/0xa8
Aug 18 08:44:00 sager kernel: [   76.258047] Code: 44 21 f8 41 39 c5 74 45 e8 ee e9 fd ff 44 8b 9b 80 04 00 00 45 85 db 75 e4 49 39 c4 73 df 48 8b 4c 24 08 49 8b 44 ce 60 8b 40 04 <41> 39 86 b4 00 00 00 75 c9 48 8b 7c 24 10 48 c7 c2 28 d9 e8 c0

Back to running 364.19 again. :\