I am not able to perform any action on my container. How can I remove the container or perform any action?
I will provide commands that can help you because I am not very familiar with the UI. In the worst case scenario, even if the container is not stopping neither cleanly with lxc stop awi1
nor using lxc stop swi1 -f
, you can forcefully kill the container process by getting its PID with lxc info c1 | grep PID:
and sudo kill -9 <container_PID>
. Then optionally remove it from the LXD database with lxc delete <container_name>
.
Could you give more detail on what caused the container enter this state and what you have tried? Specifically what are you trying to run on the container? Perhaps it is also an option to kill the running process from inside the container if you can lxc shell
into it.
I have a cluster of 3 nodes and we have zfs storage driver and SAN is mounted. When everything is fine I created the container awi1
and after some days there was so activity which causes the zfs/SAN error. Now the specific node where my container is created is not allowing me to create new containers. I think the issue is caused by the storage.
I think the issue is caused by the storage.
Indeed it seems that way, so just getting rid of this specific container will not be enough. At least this should help unblock you to keep debugging the problem and making use of your LXD in general.
So if we want to salvage this storage pool, we can perform a few steps (all within the problematic node):
1- Look closely at what the error is when trying to create containers in that pool
2- Perhaps the image was corrupted, does creating VMs or custom volumes work on that storage pool?
3- Get the pool name with zpool list
, create a volume in it with zfs create <pool_name>/volume
and snapshot it with zfs snapshot <pool_name>/volume@snap1
? Were these operations successful?
4- Do other zfs
storage pools work properly? What about with other drivers?