Cancellation in Hatchet Tasks
Hatchet provides a mechanism for canceling task executions gracefully, allowing you to signal to running tasks that they should stop running. Cancellation can be triggered on graceful termination of a worker or automatically through concurrency control strategies like CANCEL_IN_PROGRESS
, which cancels currently running task instances to free up slots for new instances when the concurrency limit is reached.
When a task is canceled, Hatchet sends a cancellation signal to the task. The task can then check for the cancellation signal and take appropriate action, such as cleaning up resources, aborting network requests, or gracefully terminating their execution.
Cancellation Mechanisms
Hatchet’s Python SDK provides a number of ways to handle cancellations, in addition to some important footguns to be aware of when running tasks.
For async tasks, Hatchet uses the cancel
method of asyncio’s task
to handle cancellation. This means that for any async work, the task will be cancelled at the next await
point via a CancelledError
exception.
For synchronous tasks, cancellation is more involved (and riskier) because of how Hatchet uses threads to run synchronous work without blocking the event loop. Threads cannot be cancelled the same was as async tasks can, which means that synchronous tasks, when running, cannot be cancelled easily. If you need to have tasks be cancellable, we highly recommend making them async. However, if they really need to be both synchronous and cancellable, Hatchet exposes a configuration option called enable_force_kill_sync_threads
, which can be set by setting the HATCHET_CLIENT_ENABLE_FORCE_KILL_SYNC_THREADS
to True
, which will forcibly kill the thread the task is running on and cause it to exit immediately.
It’s important to note that forcibly killing threads is an inherently dangerous operation, which can lead to data loss, data corruption, and so on. As mentioned above, a much preferred option for tasks that need to be cancellable is to make them asynchronous.
While your task is running, you can manage cancellation by:
- Checking for cancellation using the
Context.exit_flag
, which indicates whether a task has been cancelled. You can check this flag at any point in your task to determine whether or not to exit. For example:
- Using
Context.cancel
orContext.aio_cancel
to cancel the task. This method will set theexit_flag
toTrue
and will notify the engine that it should cancel the task. For example:
Cancellation Best Practices
When working with cancellation in Hatchet tasks, consider the following best practices:
-
Graceful Termination: When a task receives a cancellation signal, aim to terminate its execution gracefully. Clean up any resources, abort pending operations, and perform any necessary cleanup tasks before returning from the task function.
-
Cancellation Checks: Regularly check for cancellation signals within long-running tasks or loops. This allows the task to respond to cancellation in a timely manner and avoid unnecessary processing.
-
Asynchronous Operations: If a task performs asynchronous operations, such as network requests or file I/O, consider passing the cancellation signal to those operations. Many libraries and APIs support cancellation through the
AbortSignal
interface. -
Error Handling: Handle cancellation errors appropriately. Distinguish between cancellation errors and other types of errors to provide meaningful error messages and take appropriate actions.
-
Cancellation Propagation: If a task invokes other functions or libraries, consider propagating the cancellation signal to those dependencies. This ensures that cancellation is handled consistently throughout the task.
Additional Features
In addition to the methods of cancellation listed here, Hatchet also supports bulk cancellation, which allows you to cancel many tasks in bulk using either their IDs or a set of filters, which is often the easiest way to cancel many things at once.
Conclusion
Cancellation is a powerful feature in Hatchet that allows you to gracefully stop task executions when needed. Remember to follow best practices when implementing cancellation in your tasks, such as graceful termination, regular cancellation checks, handling asynchronous operations, proper error handling, and cancellation propagation.
By incorporating cancellation into your Hatchet tasks and workflows, you can build more resilient and responsive systems that can adapt to changing circumstances and user needs.