Hello
Fedora Linux guest is spitting errors after a migration to a new host. On the previous host (Centos 8.2 with linux 4.18) no such errors were noticed on the same guest. The new host is Manjaro running more recent Linux kernel (5.8) freshly installed.
The guest is headless; it is used as a Media server. iGPU passthrough provides HW accelerated video encoding/decoding using FFMpeg.
Any pointers on how to address this issue would be most welcome.
Thank you.
Host specs:
CPU is Intel 8700K
RAM 32GB
Host OS details:
Linux 5.8.18-1-MANJARO #1 SMP PREEMPT Sun Nov 1 14:10:04 UTC 2020 x86_64 GNU/Linux
Host proc/cmdline:
[...] rw quiet splash rhgb nmi_watchdog=0 loglevel=3 vt.global_cursor_default=0 i915.enable_gvt=1 i915.enable_guc=0 biosdevname=0 net.ifnames=0 blacklist=nouveau intel_iommu=on,igfx_off iommu=pt intel_iommu=on,igfx_off vfio-pci.ids=8086:1901,1b21:2142,10de:1b82,10de:10f0,1033:0194 vfio_iommu_type1.allow_unsafe_interrupts=1 transparent_hugepage=never isolcpus=2,8,3,9,4,10,5,11
Guest OS Details
Linux 5.9.8-100.fc32.x86_64 #1 SMP Tue Nov 10 22:39:06 UTC 2020 x86_64 x86_64 x86_64 GNU/Linux
VM exerpt from xml:
<os>
<type arch="x86_64" machine="pc-q35-5.1">hvm</type>
<loader readonly="yes" type="pflash">/usr/share/edk2-ovmf/x64/OVMF_CODE.fd</loader>
<nvram>/var/lib/libvirt/qemu/nvram/spree_VARS.fd</nvram>
<boot dev="hd"/>
</os>
<features>
<acpi/>
<apic/>
<vmport state="off"/>
<smm state="on"/>
</features>
<cpu mode="custom" match="exact" check="none">
<model fallback="forbid">Skylake-Client-noTSX-IBRS</model>
</cpu>
<clock offset="utc">
<timer name="rtc" tickpolicy="catchup"/>
<timer name="pit" tickpolicy="delay"/>
<timer name="hpet" present="no"/>
</clock>
.....
<hostdev mode="subsystem" type="mdev" managed="no" model="vfio-pci" display="off">
<source>
<address uuid="3907b8ac-e1c2-46f5-b68f-0df7d0a6cd92"/>
</source>
<alias name="hostdev0"/>
<address type="pci" domain="0x0000" bus="0x06" slot="0x00" function="0x0"/>
</hostdev>
.....
<qemu:commandline>
<qemu:arg value="-set"/>
<qemu:arg value="device.hostdev0.x-igd-opregion=on"/>
<qemu:arg value="-set"/>
<qemu:arg value="device.hostdev0.romfile=/usr/local/share/vbios/vbios_gvt_uefi.rom"/>
</qemu:commandline>
Expected behavior: no such errors displayed in the guest logs:
Nov 17 16:06:58 guest.hostame kernel: pcieport 0000:00:02.2: pciehp: Failed to check link status
Nov 17 16:06:58 guest.hostame kernel: pcieport 0000:00:02.6: pciehp: Failed to check link status
Nov 17 16:06:58 guest.hostame kernel: i915 0000:06:00.0: [drm] *ERROR* [CRTC:51:pipe A] mismatch in infoframes.enable (expected 0x00000008, found 0x00000000)
Nov 17 16:06:58 guest.hostame kernel: i915 0000:06:00.0: [drm] *ERROR* mismatch in avi infoframe
Nov 17 16:06:58 guest.hostame kernel: i915 0000:06:00.0: [drm] *ERROR* expected:
Nov 17 16:06:58 guest.hostame kernel: i915 0000:06:00.0: HDMI infoframe: Auxiliary Video Information (AVI), version 2, length 13
Nov 17 16:06:58 guest.hostame kernel: i915 0000:06:00.0: colorspace: RGB
Nov 17 16:06:58 guest.hostame kernel: i915 0000:06:00.0: scan mode: No Data
Nov 17 16:06:58 guest.hostame kernel: i915 0000:06:00.0: colorimetry: No Data
Nov 17 16:06:58 guest.hostame kernel: i915 0000:06:00.0: picture aspect: No Data
Nov 17 16:06:58 guest.hostame kernel: i915 0000:06:00.0: active aspect: 14:9 Top
Nov 17 16:06:58 guest.hostame kernel: i915 0000:06:00.0: itc: No Data
Nov 17 16:06:58 guest.hostame kernel: i915 0000:06:00.0: extended colorimetry: xvYCC 601
Nov 17 16:06:58 guest.hostame kernel: i915 0000:06:00.0: quantization range: Default
Nov 17 16:06:58 guest.hostame kernel: i915 0000:06:00.0: nups: Unknown Non-uniform Scaling
Nov 17 16:06:58 guest.hostame kernel: i915 0000:06:00.0: video code: 0
Nov 17 16:06:58 guest.hostame kernel: i915 0000:06:00.0: ycc quantization range: Limited
Nov 17 16:06:58 guest.hostame kernel: i915 0000:06:00.0: hdmi content type: Graphics
Nov 17 16:06:58 guest.hostame kernel: i915 0000:06:00.0: pixel repeat: 0
Nov 17 16:06:58 guest.hostame kernel: i915 0000:06:00.0: bar top 0, bottom 0, left 0, right 0
Nov 17 16:06:58 guest.hostame kernel: i915 0000:06:00.0: [drm] *ERROR* found:
Nov 17 16:06:58 guest.hostame kernel: ------------[ cut here ]------------
Nov 17 16:06:58 guest.hostame kernel: pipe state doesn't match!
Nov 17 16:06:58 guest.hostame kernel: WARNING: CPU: 1 PID: 345 at drivers/gpu/drm/i915/display/intel_display.c:14311 intel_atomic_commit_tail+0x113d/0x1280 [i915]
Nov 17 16:06:58 guest.hostame kernel: Modules linked in: i915(+) video i2c_algo_bit drm_kms_helper cec crct10dif_pclmul drm crc32_pclmul crc32c_intel serio_raw ghash_clmulni_intel virt>
Nov 17 16:06:58 guest.hostame kernel: CPU: 1 PID: 345 Comm: systemd-udevd Tainted: G U 5.9.8-100.fc32.x86_64 #1
Nov 17 16:06:58 guest.hostame kernel: Hardware name: QEMU Standard PC (Q35 + ICH9, 2009), BIOS 0.0.0 02/06/2015
Nov 17 16:06:58 guest.hostame kernel: RIP: 0010:intel_atomic_commit_tail+0x113d/0x1280 [i915]
Nov 17 16:06:58 guest.hostame kernel: Code: 93 20 55 00 00 38 d0 0f 84 08 f5 ff ff e9 1b b3 09 00 41 83 bd 8c 0a 00 00 03 0f 85 c5 fa ff ff e9 cf fa ff ff e8 d3 3d 68 f3 <0f> 0b e9 5f >
Nov 17 16:06:58 guest.hostame kernel: RSP: 0000:ffffab270036ba38 EFLAGS: 00010292
Nov 17 16:06:58 guest.hostame kernel: RAX: 0000000000000019 RBX: ffff963dd7b202d0 RCX: ffff963dfc498d08
Nov 17 16:06:58 guest.hostame kernel: RDX: 00000000ffffffd8 RSI: 0000000000000027 RDI: ffff963dfc498d00
Nov 17 16:06:58 guest.hostame kernel: RBP: ffff963dd867b000 R08: 00000000000002a3 R09: 0000000000000003
Nov 17 16:06:58 guest.hostame kernel: R10: 0000000000000000 R11: 0000000000000001 R12: ffff963dd7b20000
Nov 17 16:06:58 guest.hostame kernel: R13: ffff963dd85f0000 R14: ffff963dd7b202d8 R15: ffff963dd85f3000
Nov 17 16:06:58 guest.hostame kernel: FS: 00007fc33bc9fb80(0000) GS:ffff963dfc480000(0000) knlGS:0000000000000000
Nov 17 16:06:58 guest.hostame kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Nov 17 16:06:58 guest.hostame kernel: CR2: 00007fc02f5f6000 CR3: 000000005819a001 CR4: 00000000003706e0
Nov 17 16:06:58 guest.hostame kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Nov 17 16:06:58 guest.hostame kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Nov 17 16:06:58 guest.hostame kernel: Call Trace:
Nov 17 16:06:58 guest.hostame kernel: intel_atomic_commit+0x36e/0x3f0 [i915]
Nov 17 16:06:58 guest.hostame kernel: intel_modeset_init+0x910/0x1e90 [i915]
Nov 17 16:06:58 guest.hostame kernel: i915_driver_probe+0x833/0xde0 [i915]
Nov 17 16:06:58 guest.hostame kernel: ? vga_switcheroo_client_probe_defer+0x1f/0x40
Nov 17 16:06:58 guest.hostame kernel: ? i915_pci_probe+0x31/0x110 [i915]
Nov 17 16:06:58 guest.hostame kernel: local_pci_probe+0x42/0x80
Nov 17 16:06:58 guest.hostame kernel: ? _cond_resched+0x16/0x40
Nov 17 16:06:58 guest.hostame kernel: pci_device_probe+0xd9/0x190
Nov 17 16:06:58 guest.hostame kernel: really_probe+0x205/0x460
Nov 17 16:06:58 guest.hostame kernel: driver_probe_device+0xe1/0x150
Nov 17 16:06:58 guest.hostame kernel: device_driver_attach+0xa1/0xb0
Nov 17 16:06:58 guest.hostame kernel: __driver_attach+0x8a/0x150
Nov 17 16:06:58 guest.hostame kernel: ? device_driver_attach+0xb0/0xb0
Nov 17 16:06:58 guest.hostame kernel: ? device_driver_attach+0xb0/0xb0
Nov 17 16:06:58 guest.hostame kernel: bus_for_each_dev+0x64/0x90
Nov 17 16:06:58 guest.hostame kernel: bus_add_driver+0x12b/0x1e0
Nov 17 16:06:58 guest.hostame kernel: driver_register+0x8b/0xe0
Nov 17 16:06:58 guest.hostame kernel: ? 0xffffffffc067d000
Nov 17 16:06:58 guest.hostame kernel: i915_init+0x5d/0x70 [i915]
Nov 17 16:06:58 guest.hostame kernel: do_one_initcall+0x46/0x200
Nov 17 16:06:58 guest.hostame kernel: ? do_init_module+0x23/0x250
Nov 17 16:06:58 guest.hostame kernel: ? kmem_cache_alloc_trace+0xe9/0x1f0
Nov 17 16:06:58 guest.hostame kernel: do_init_module+0x5c/0x250
Nov 17 16:06:58 guest.hostame kernel: __do_sys_init_module+0x121/0x150
Nov 17 16:06:58 guest.hostame kernel: do_syscall_64+0x33/0x40
Nov 17 16:06:58 guest.hostame kernel: entry_SYSCALL_64_after_hwframe+0x44/0xa9
Nov 17 16:06:58 guest.hostame kernel: RIP: 0033:0x7fc33cdc945e
Nov 17 16:06:58 guest.hostame kernel: Code: 48 8b 0d 3d 0a 0c 00 f7 d8 64 89 01 48 83 c8 ff c3 66 2e 0f 1f 84 00 00 00 00 00 90 f3 0f 1e fa 49 89 ca b8 af 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 0a>
Nov 17 16:06:58 guest.hostame kernel: RSP: 002b:00007ffe55af8b88 EFLAGS: 00000246 ORIG_RAX: 00000000000000af
Nov 17 16:06:58 guest.hostame kernel: RAX: ffffffffffffffda RBX: 0000555e7213ad60 RCX: 00007fc33cdc945e
Nov 17 16:06:58 guest.hostame kernel: RDX: 0000555e72148ef0 RSI: 00000000004f0996 RDI: 0000555e729f7300
Nov 17 16:06:58 guest.hostame kernel: RBP: 0000555e729f7300 R08: 0000555e72142620 R09: 00007ffe55af73c6
Nov 17 16:06:58 guest.hostame kernel: R10: 0000000000000007 R11: 0000000000000246 R12: 0000000000000000
Nov 17 16:06:58 guest.hostame kernel: R13: 0000555e72148ef0 R14: 0000555e721437e0 R15: 000000000000000d
Nov 17 16:06:58 guest.hostame kernel: ---[ end trace ec2694ff20c7c541 ]---
Guest OS Details
Linux 5.9.8-100.fc32.x86_64 #1 SMP Tue Nov 10 22:39:06 UTC 2020 x86_64 x86_64 x86_64 GNU/Linux
Guest lspci -vvvs
06:00.0 VGA compatible controller: Intel Corporation UHD Graphics 630 (Desktop) (prog-if 00 [VGA controller])
Subsystem: Gigabyte Technology Co., Ltd Device d000
Physical Slot: 0-5
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+
Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
Latency: 0, Cache Line Size: 64 bytes
Interrupt: pin A routed to IRQ 42
Region 0: Memory at c0000000 (64-bit, non-prefetchable) [size=16M]
Region 2: Memory at 800000000 (64-bit, prefetchable) [size=256M]
Expansion ROM at <ignored> [disabled]
Capabilities: [40] Vendor Specific Information: Len=0c <?>
Capabilities: [70] Express (v2) Endpoint, MSI 00
DevCap: MaxPayload 128 bytes, PhantFunc 0, Latency L0s <64ns, L1 <1us
ExtTag- AttnBtn- AttnInd- PwrInd- RBE+ FLReset+ SlotPowerLimit 0.000W
DevCtl: CorrErr- NonFatalErr- FatalErr- UnsupReq-
RlxdOrd- ExtTag- PhantFunc- AuxPwr- NoSnoop- FLReset-
MaxPayload 128 bytes, MaxReadReq 128 bytes
DevSta: CorrErr- NonFatalErr- FatalErr- UnsupReq- AuxPwr- TransPend-
LnkCap: Port #0, Speed 2.5GT/s, Width x1, ASPM not supported
ClockPM- Surprise- LLActRep- BwNot- ASPMOptComp-
LnkCtl: ASPM Disabled; RCB 64 bytes Disabled- CommClk-
ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt-
LnkSta: Speed unknown (downgraded), Width x0 (downgraded)
TrErr- Train- SlotClk- DLActive- BWMgmt- ABWMgmt-
DevCap2: Completion Timeout: Not Supported, TimeoutDis-, NROPrPrP-, LTR-
10BitTagComp-, 10BitTagReq-, OBFF Not Supported, ExtFmt-, EETLPPrefix-
EmergencyPowerReduction Not Supported, EmergencyPowerReductionInit-
FRS-, TPHComp-, ExtTPHComp-
AtomicOpsCap: 32bit- 64bit- 128bitCAS-
DevCtl2: Completion Timeout: 50us to 50ms, TimeoutDis-, LTR-, OBFF Disabled
AtomicOpsCtl: ReqEn-
LnkCtl2: Target Link Speed: 2.5GT/s, EnterCompliance- SpeedDis-
Transmit Margin: Normal Operating Range, EnterModifiedCompliance- ComplianceSOS-
Compliance De-emphasis: -6dB
LnkSta2: Current De-emphasis Level: -6dB, EqualizationComplete-, EqualizationPhase1-
EqualizationPhase2-, EqualizationPhase3-, LinkEqualizationRequest-
Capabilities: [ac] MSI: Enable+ Count=1/1 Maskable- 64bit-
Address: fee00000 Data: 4028
Capabilities: [d0] Power Management version 2
Flags: PMEClk- DSI+ D1- D2- AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-)
Status: D0 NoSoftRst- PME-Enable- DSel=0 DScale=0 PME+
Capabilities: [100 v1] Process Address Space ID (PASID)
PASIDCap: Exec- Priv-, Max PASID Width: 14
PASIDCtl: Enable- Exec- Priv-
Capabilities: [200 v1] Address Translation Service (ATS)
ATSCap: Invalidate Queue Depth: 00
ATSCtl: Enable-, Smallest Translation Unit: 00
Capabilities: [300 v1] Page Request Interface (PRI)
PRICtl: Enable- Reset-
PRISta: RF- UPRGI- Stopped+
Page Request Capacity: 00008000, Page Request Allocation: 00000000
Kernel driver in use: i915
Kernel modules: i915
vainfo
error: XDG_RUNTIME_DIR not set in the environment.
error: can't connect to X server!
libva info: VA-API version 1.7.0
libva info: User environment variable requested driver 'i965'
libva info: Trying to open /usr/lib64/dri/i965_drv_video.so
libva info: Found init function __vaDriverInit_1_7
libva info: va_openDriver() returns 0
vainfo: VA-API version: 1.7 (libva 2.7.0.pre1)
vainfo: Driver version: Intel i965 driver for Intel(R) Coffee Lake - 2.4.1
vainfo: Supported profile and entrypoints
VAProfileMPEG2Simple : VAEntrypointVLD
VAProfileMPEG2Simple : VAEntrypointEncSlice
VAProfileMPEG2Main : VAEntrypointVLD
VAProfileMPEG2Main : VAEntrypointEncSlice
VAProfileH264ConstrainedBaseline: VAEntrypointVLD
VAProfileH264ConstrainedBaseline: VAEntrypointEncSlice
VAProfileH264ConstrainedBaseline: VAEntrypointEncSliceLP
VAProfileH264Main : VAEntrypointVLD
VAProfileH264Main : VAEntrypointEncSlice
VAProfileH264Main : VAEntrypointEncSliceLP
VAProfileH264High : VAEntrypointVLD
VAProfileH264High : VAEntrypointEncSlice
VAProfileH264High : VAEntrypointEncSliceLP
VAProfileH264MultiviewHigh : VAEntrypointVLD
VAProfileH264MultiviewHigh : VAEntrypointEncSlice
VAProfileH264StereoHigh : VAEntrypointVLD
VAProfileH264StereoHigh : VAEntrypointEncSlice
VAProfileVC1Simple : VAEntrypointVLD
VAProfileVC1Main : VAEntrypointVLD
VAProfileVC1Advanced : VAEntrypointVLD
VAProfileNone : VAEntrypointVideoProc
VAProfileJPEGBaseline : VAEntrypointVLD
VAProfileJPEGBaseline : VAEntrypointEncPicture
VAProfileVP8Version0_3 : VAEntrypointVLD
VAProfileVP8Version0_3 : VAEntrypointEncSlice
VAProfileHEVCMain : VAEntrypointVLD
VAProfileHEVCMain : VAEntrypointEncSlice
VAProfileHEVCMain10 : VAEntrypointVLD
VAProfileHEVCMain10 : VAEntrypointEncSlice
VAProfileVP9Profile0 : VAEntrypointVLD
VAProfileVP9Profile0 : VAEntrypointEncSlice
VAProfileVP9Profile2 : VAEntrypointVLD
Hello
Fedora Linux guest is spitting errors after a migration to a new host. On the previous host (Centos 8.2 with linux 4.18) no such errors were noticed on the same guest. The new host is Manjaro running more recent Linux kernel (5.8) freshly installed.
The guest is headless; it is used as a Media server. iGPU passthrough provides HW accelerated video encoding/decoding using FFMpeg.
Any pointers on how to address this issue would be most welcome.
Thank you.
Host specs:
Host OS details:
Linux 5.8.18-1-MANJARO #1 SMP PREEMPT Sun Nov 1 14:10:04 UTC 2020 x86_64 GNU/LinuxHost proc/cmdline:
[...] rw quiet splash rhgb nmi_watchdog=0 loglevel=3 vt.global_cursor_default=0 i915.enable_gvt=1 i915.enable_guc=0 biosdevname=0 net.ifnames=0 blacklist=nouveau intel_iommu=on,igfx_off iommu=pt intel_iommu=on,igfx_off vfio-pci.ids=8086:1901,1b21:2142,10de:1b82,10de:10f0,1033:0194 vfio_iommu_type1.allow_unsafe_interrupts=1 transparent_hugepage=never isolcpus=2,8,3,9,4,10,5,11Guest OS Details
Linux 5.9.8-100.fc32.x86_64 #1 SMP Tue Nov 10 22:39:06 UTC 2020 x86_64 x86_64 x86_64 GNU/LinuxVM exerpt from xml:
Expected behavior: no such errors displayed in the guest logs:
Guest OS Details
Linux 5.9.8-100.fc32.x86_64 #1 SMP Tue Nov 10 22:39:06 UTC 2020 x86_64 x86_64 x86_64 GNU/LinuxGuest lspci -vvvs
vainfo