To Reproduce
When a volume backup job fails mid-execution (e.g. due to a network error preventing Docker from pulling the required image), the container that was stopped as part of the "Turn Off Container During Backup" flow is not automatically restarted. The container remains in a stopped state indefinitely until manually reloaded by the user.
- Create a service with a volume backup schedule
- Enable the "Turn Off Container During Backup" option on the backup configuration.
- Trigger a backup run (automatically via cron or manually).
- Simulate or encounter a network failure during the backup, in my specific case it was a TLS handshake timeout to registry-1.docker.io
- Observe that the backup job fails and exits.
- Check the container status; it remains stopped and does not recover automatically.
Current vs. Expected behavior
Current behavior: When the backup fails after the container has been stopped, the container is left in a stopped state. No automatic recovery or restart is attempted, leaving the service offline until a user manually intervenes and clicks "Reload".
Expected behavior: If the backup fails for any reason after the container was stopped as part of the backup process, Dokploy should automatically attempt a container restart anyway to restore service availability; regardless of where in the backup pipeline the failure occurred.
Provide environment information
Ubuntu 24.04
Dokploy v0.28.8
Which area(s) are affected? (Select all that apply)
Application
Are you deploying the applications where Dokploy is installed or on a remote server?
Same server where Dokploy is installed
Additional context
Failed backup log:
Initializing volume backup
Waiting for volume backup lock: /etc/dokploy/volume-backup-lock-[service-id]
Volume backup lock acquired
Stopping compose container
[container-id]
Volume name: [volume-name]
Backup file name: [volume-name]-[timestamp].tar
Turning off volume backup: Yes
Starting volume backup
Dir: /etc/dokploy/volume-backups/[backup-dir]
Unable to find image 'ubuntu:latest' locally
docker: Error response from daemon: failed to resolve reference
"docker.io/library/ubuntu:latest": failed to do request: Head
"https://registry-1.docker.io/v2/library/ubuntu/manifests/latest":
net/http: TLS handshake timeout
Additional context:
- The failure occurred during a scheduled (cron) automatic backup.
- The root cause of the failure was a transient network issue. A TLS handshake timeout when Docker attempted to pull ubuntu:latest from Docker Hub.
- Manually clicking "Reload" on the service successfully restarted the container with no issues, confirming the container itself was healthy.
Will you send a PR to fix it?
No
To Reproduce
When a volume backup job fails mid-execution (e.g. due to a network error preventing Docker from pulling the required image), the container that was stopped as part of the "Turn Off Container During Backup" flow is not automatically restarted. The container remains in a stopped state indefinitely until manually reloaded by the user.
Current vs. Expected behavior
Current behavior: When the backup fails after the container has been stopped, the container is left in a stopped state. No automatic recovery or restart is attempted, leaving the service offline until a user manually intervenes and clicks "Reload".
Expected behavior: If the backup fails for any reason after the container was stopped as part of the backup process, Dokploy should automatically attempt a container restart anyway to restore service availability; regardless of where in the backup pipeline the failure occurred.
Provide environment information
Which area(s) are affected? (Select all that apply)
Application
Are you deploying the applications where Dokploy is installed or on a remote server?
Same server where Dokploy is installed
Additional context
Failed backup log:
Additional context:
Will you send a PR to fix it?
No