Add to Cursor Add to Claude Copy for LLM View as MD

Declarative Workflow Design (DAGs)

Hatchet workflows are designed in a Directed Acyclic Graph (DAG) format, where each task is a node in the graph, and the dependencies between tasks are the edges. This structure ensures that workflows are organized, predictable, and free from circular dependencies.

How DAG Workflows Work

You declare the graph

Define tasks and their dependencies upfront. Hatchet knows the full shape of work before execution begins.

Hatchet executes in order

Tasks run as soon as their parents complete. Independent tasks run in parallel automatically. A worker slot is only assigned when a task is ready to execute, so tasks waiting on parents consume no resources. Each task has configurable retry policies and timeouts.

Results flow downstream

Task outputs are cached and passed to child tasks. If a failure occurs mid-workflow, completed tasks don’t re-run.

Everything is observable

Every task execution is tracked in the dashboard — inputs, outputs, durations, and errors. You can see exactly where a workflow succeeded or failed.

Defining a Workflow

Start by declaring a workflow with a name. The workflow object can declare additional workflow-level configuration options which we’ll cover later.

The returned object is an instance of the Workflow class, which is the primary interface for interacting with the workflow (i.e. running, enqueuing, scheduling, etc).

examples/python/dag/worker.py

dag_workflow = hatchet.workflow(name="DAGWorkflow")

examples/typescript/dag/workflow.ts

// First, we declare the workflow
export const dag = hatchet.workflow<DagInput, DagOutput>({
  name: 'simple',
});

examples/go/dag/main.go

workflow := client.NewWorkflow("dag-workflow")

The Ruby SDK is in early access, and may change. We'd love your feedback!

examples/ruby/dag/worker.rb

DAG_WORKFLOW = HATCHET.workflow(name: "DAGWorkflow")

💡

The Workflow return object can be interacted with in the same way as a task, however, it can only take a subset of options which are applied at the task level.

Defining a Task

Now that we have a workflow, we can define a task to be executed as part of the workflow. Tasks are defined by calling the task method on the workflow object.

The task method takes a name and a function that defines the task’s behavior. The function will receive the workflow’s input and return the task’s output. Tasks also accept a number of other configuration options, which are covered elsewhere in our documentation.

In Python, the task method is a decorator, which is used like this to wrap a function:

examples/python/dag/worker.py

@dag_workflow.task(execution_timeout=timedelta(seconds=5))
def step1(input: EmptyModel, ctx: Context) -> StepOutput:
    return StepOutput(random_number=random.randint(1, 100))

The function takes two arguments: input, which is a Pydantic model, and ctx, which is the Hatchet Context object. We’ll discuss both of these more later.

In the internals of Hatchet, the task is called using positional arguments, meaning that you can name input and ctx whatever you like.

For instance, def task_1(foo: EmptyModel, bar: Context) -> None: is perfectly valid.

Building a DAG with Task Dependencies

The power of Hatchet’s workflow design comes from connecting tasks into a DAG structure. Tasks can specify dependencies (parents) which must complete successfully before the task can start.

examples/python/dag/worker.py

@dag_workflow.task(execution_timeout=timedelta(seconds=5))
async def step2(input: EmptyModel, ctx: Context) -> StepOutput:
    return StepOutput(random_number=random.randint(1, 100))


@dag_workflow.task(parents=[step1, step2])
async def step3(input: EmptyModel, ctx: Context) -> RandomSum:
    one = ctx.task_output(step1).random_number
    two = ctx.task_output(step2).random_number

    return RandomSum(sum=one + two)

Accessing Parent Task Outputs

As shown in the examples above, tasks can access outputs from their parent tasks using the context object:

examples/python/dag/worker.py

@dag_workflow.task(execution_timeout=timedelta(seconds=5))
async def step2(input: EmptyModel, ctx: Context) -> StepOutput:
    return StepOutput(random_number=random.randint(1, 100))


@dag_workflow.task(parents=[step1, step2])
async def step3(input: EmptyModel, ctx: Context) -> RandomSum:
    one = ctx.task_output(step1).random_number
    two = ctx.task_output(step2).random_number

    return RandomSum(sum=one + two)

Running a Workflow

You can run workflows directly or enqueue them for asynchronous execution. All the same methods for running a task are available for workflows!

examples/python/dag/trigger.py

dag_workflow.run()

Pre-Determined Pipelines

DAGs naturally model fixed multi-stage pipelines where the sequence of tasks and their dependencies are known before execution. ETL workflows, document processing pipelines, and CI/CD workflows all follow this pattern: each stage depends on the previous, and the overall structure is visible and predictable in the dashboard.

Durable Execution Best Practices

We use cookies

Declarative Workflow Design (DAGs)

How DAG Workflows Work

You declare the graph

Hatchet executes in order

Results flow downstream

Everything is observable

Defining a Workflow

Defining a Task

Building a DAG with Task Dependencies

Accessing Parent Task Outputs

Running a Workflow

Pre-Determined Pipelines