Ubuntu Version: Ubuntu Server 20.04.6 LTS
Desktop Environment (if applicable): byobu tmuxer, nothing else
Problem Description: Applications have been segfaulting all over the place, and I’m unable to pinpoint a rhyme or reason why.
Relevant System Information:
- Hardware
- HP Compaq Microtower dc5800
- Core 2 Duo E8400 CPU, 2 cores @3.0GHz (no tweaking, stock)
- BIOS v1.60, Oct. 26 2015
- 6GB system RAM
- Host: Windows Server 2012 R2 & Hyper-V (v. 6.3.9600)
- Ubuntu guest is a Gen. 2, Version 5.0 virtual machine
$ uname -a
Linux ubuntush 5.15.0-1089-azure #98~20.04.1-Ubuntu SMP- Docker is installed and running some applications (postgres, mariadb)
Screenshots or Error Messages:
A brief excerpt of journalctl -r | grep -e "traps:" -e "segfault"
May 24 12:01:46 ubuntush kernel: apport[386957]: segfault at 0 ip 00000000005a9075 sp 00007ffe01ca7e00 error 6 in python3.8[423000+295000]
May 24 10:34:04 ubuntush kernel: postgres[371158]: segfault at 56546f8320ae ip 00007fc2d2174c00 sp 00007fff5bd4fb68 error 4 in
libc-2.31.so[7fc2d205b000+15a000]
May 24 01:34:21 ubuntush kernel: php[338919]: segfault at 5622cb98339a ip 00007fc8352544dd sp 00007fff99b0fba8 error 4 in dom.so[7fc83524a000+1a000]
May 23 10:04:07 ubuntush kernel: php[138526]: segfault at 0 ip 00007f199c43da87 sp 00007ffe68553120 error 4 in libc.so.6[7f199c3cb000+155000]
May 22 16:15:50 ubuntush kernel: tmux: server[933]: segfault at 555f8518a6e0 ip 0000555f8518a6e0 sp 00007fff5ef6bd50 error 15
May 21 14:01:21 ubuntush kernel: mariadbd[284037]: segfault at 14d7817fa8 ip 00000014d7817fa8 sp 00007fffd7817cb0 error 14 in mariadbd[559b03e98000+640000]
May 21 13:36:59 ubuntush kernel: traps: php[37760] general protection fault ip:7f8ff57bc06f sp:7ffe73f2e430 error:0 in libc.so.
6[7f8ff574d000+155000]
May 21 13:30:58 ubuntush kernel: .NET TP Worker[25102]: segfault at 0 ip 00007f816b9f2fa5 sp 00007f40413fbd00 error 4 in libcoreclr.so[7f816b5d2000+4b7000]
May 21 13:27:09 ubuntush kernel: traps: pgrep[10378] general protection fault ip:5639d8e12763 sp:7ffe92648b80 error:0 in pgrep[5639d8e11000+3000]
May 21 13:17:09 ubuntush kernel: apport-retrace[298701]: segfault at 7f535d64ed ip 00000000005f62c0 sp 00007ffd99c3c880 error 6 in python3.8[423000+295000]
May 21 13:16:36 ubuntush kernel: apport-retrace[298637]: segfault at 18 ip 000000000050a722 sp 00007ffd2d4dde20 error 4 in python3.8[423000+295000]
Message:
This has been a fun little problem I’ve been dealing with for quite some time now. It’s gotten to the point (finally) where data is actually being corrupted, so I’m hoping for some direction in getting to the bottom of this and putting a bandage on the issues before I blow this VM away and move to a different platform.
It used to be that the init process would segfault/die and be unrecoverable, preventing me from shutting down or rebooting the machine. Somehow that stopped at some point, although I’m still bombarded with a nearly constant stream of segfaults as you can see from my paste above.
Things I’ve tried:
- Memtest86+ runs without issue in the same virtual machine, which makes me think that processing & memory aren’t the issue.
- Various fsck runs of the file systems come back clean, plus I’m not seeing any log indications of storage issues
debsums
command comes back clean for non-configuration files - hoping this speaks to system file integrity- Ubuntu is configured according to Microsoft guidelines, specifically, I’m running the linux-azure kernel image which seems to guarantee maximum possible Hyper-V compatibility
- Although, Hyper-V still complains about an outdated guest communication protocol, which seems to be the responsibility of the hv-kvp-daemon which I believe is the latest.
- The Ubuntu guide for Hyper-V is less detailed, but I believe I’m in compliance nonetheless.
apport
is installed and operating (most of the time, as you can see by its own crashes in the logs above).whoopsie
is also operating in the hopes that it may provide some useful information. They seem to miss a chunk of the segfaults, however. I find myself getting a little over my head as I follow instructions for opening up the.crash
files and attempting to read/understand the backtrace.- Here’s an example of an uploaded crash for apport-cli