systemctl reload snap.lxd.daemon “fixes” it but it keeps reapearring. I’m still running old RHEL8 and lxd 5.21 LTS and there hasn’t been any updates since forever, what is rthis and where does it come from, any ideas?
That “[nvidia-containe] <defunct>” is a leftover nvidia-container-cli helper that LXD launches when it probes GPUs. In the libnvidia-container version bundled with LXD 5.21.3 the child process exits but never gets wait()-ed, so it stays as a harmless zombie under the lxd daemon.
Fix it in two lines:
# pull the snap that has the patched libnvidia-container
sudo snap refresh lxd --channel=5.21/stable # or 6.0/stable if you prefer
# restart the daemon so it uses the new toolkit
sudo systemctl restart snap.lxd.daemon
No NVIDIA in your containers? You can also skip GPU probing altogether:
lxc config set core.nvidia_runtime false
Either way, the “nvidia-containe” zombies stop appearing.