# Job Resume Plugin (Enterprise)
Available in Rundeck Enterprise
This plugin allows Jobs to Resume execution at a failed workflow step.
When you enable this plugin for a Job, the plugin will record the internal Workflow State as the Execution progresses. When a step fails, the state from immediately before the step started is serialized and stored. You can then choose to "Resume" the execution at the failed step. The plugin will load the stored state contents, and start a new execution with the resume state, proceeding to execute the previously failed step with the same inputs.
# Requirements
WARNING
This plugin currently works only for Jobs which use the "Sequential" (aka "Step-first") Workflow Strategy.
- The Job must use the "Sequential" Workflow Strategy. With this strategy, we know that the internal state of the workflow engine before the failed step runs can be used to resume the same Job execution again.
- The Job's workflow sequence should not be modified before resuming the execution. The plugin will store a "snapshot" of the workflow sequence within the resume state, allowing it to compare that to the workflow sequence used to resume it. If the Job workflow sequence has been modified, the plugin will fail to start. This means that you should not reorder, add/remove, or replace steps in the Workflow before resuming the execution. However, modifying existing steps should be possible.
# Usage
To use this plugin, edit or create a Job, and under the "Execution Plugins" tab, enable the checkbox next to the "Resumable Job" plugin.
data:image/s3,"s3://crabby-images/2a020/2a02033a989c982a5825aa3ef178fb8ae406883c" alt="Execution Plugins"
When an execution fails at some step, a new menu option is available under the "Run Again…" drop down.
TIP
Note: The page might need to be refreshed to show the menu item.
You will see a confirmation page to begin the resumed execution.
data:image/s3,"s3://crabby-images/35f16/35f16165c9b3d9518fce52bf502468ffa2ab351a" alt="Confirm Resume Execution"
After confirming, the new execution will start at the step that previously failed.
data:image/s3,"s3://crabby-images/80c95/80c952d572a185b40483ce2726eccefe169a95e6" alt="Resumed Execution"
# Resume On Retry
When a Job uses the "Retry" feature, it normally will automatically start a fresh Execution when it fails, after an optional delay.
The Job Resume plugin can be configured to automatically use the Resume state when the execution is retried.
# Configuration
The configuration of this plugin is simple. Once added to a Job definition, it is enabled by default.
<joblist>
<job>
<!-- ... -->
<plugins>
<ExecutionLifecycle type='resume'/>
</plugins>
</job>
</joblist>
If you also want to enable it for automatic Retries, set onRetry
to true
.
<joblist>
<job>
<!-- ... -->
<plugins>
<ExecutionLifecycle type='resume'>
<configuration data='true'>
<map>
<string key='onRetry'>true</string>
</map>
</configuration>
</ExecutionLifecycle>
</plugins>
</job>
</joblist>
# Clustering
This plugin works with Clustering in Rundeck Enterprise. The serialized "Resume state" is stored alongside the execution log and execution state file in
your configured Log Storage system, with a file extension of .resume.json
.
You can initiate the Resume Execution from any Cluster member if the Resume state file has been copied to the Log Storage system. Before the execution starts, the file will be loaded from the Log Storage system.
# Limitations
- Currently only the top level workflow steps will be tracked for resume. Referenced jobs will not resume at their internally failed step.
- If the Rundeck server is interrupted, such as an application or server crash, before the workflow state is serialized and stored the execution will not be able to resume from the failed step.
- Currently, if you kill manually a job during its execution, this job cannot be resumed. The plugin does not allow this action. Only a job can be resumed when this fails by itself.