Oban Pro Releases

This release enhances workflows with sub-workflows and context sharing, overhauls queue partitioning for better performance, improves dynamic plugins, and adds various usability improvements.

See the v1.6 Upgrade Guide for complete upgrade steps and migration caveats.

🗂️ Sub-Workflows

Workflows gain powerful new capabilities for organizing complex job relationships with two major enhancements: add_workflow/4 and add_many/4.

Use add_workflow/4 to nest entire workflows within others to create hierarchical job dependencies. This makes complex workflows more maintainable by grouping related jobs together:

extr_flow =
  Workflow.new(workflow_name: "extract")
  |> Workflow.add(:extract, WorkerA.new(%{source: "database"}))
  |> Workflow.add(:transform, WorkerB.new(%{}), deps: :extract)

# Add sub-workflow as a dependency
Workflow.new()
|> Workflow.add(:setup, WorkerC.new(%{mode: "initialize"}))
|> Workflow.add_workflow(:extract, extr_flow, deps: :setup)
|> Workflow.add(:finalize, WorkerC.new(%{}), deps: :extract)

Workflows can depend on other workflows, and downstream deps will wait until the sub-workflow completes before executing.

Need to run similar jobs in parallel? Use add_many/4 to add multiple jobs with a single dependency name:

# Add multiple email jobs that can run in parallel
email_jobs = Enum.map(users, &EmailWorker.new(%{user_id: &1.id}))

workflow =
  Workflow.new()
  |> Workflow.add_many(:emails, email_jobs)
  |> Workflow.add(:report, ReportWorker.new(), deps: :emails)

The add_many/4 step creates a sub workflow from either a list or a map, and the full recorded results can be extracted with a single call:

def process(job) do
  map_of_results = Workflow.all_recorded(job, with_subs: true)
end

🌊 Cascading Workflows

The most significant workflow enhancement in v1.6 is the ability to build workflows with cascading functions. Using add_cascade/3, you can build workflows using regular Elixir functions, with results from each step automatically passed to subsequent steps:

Workflow.new()
|> Workflow.put_context(%{user_id: 123})
|> Workflow.add_cascade(:fetch, &MyApp.fetch_user/1)
|> Workflow.add_cascade(:process, &MyApp.process_user/1, deps: :fetch)
|> Workflow.add_cascade(:notify, &MyApp.send_notification/1, deps: :process)

Each function receives a context map containing the workflow's shared context and any results from its upstream dependencies. For example, the process_user/1 function automatically receives a map with the results of fetch_user/1 under the :fetch key.

Additionally, add_cascade/3 supports fan-out operations with {enumerable, function/2} tuples:

items = [1, 2, 3]

Workflow.new()
|> Workflow.add_cascade(:process_items, {items, &process_item/2})
|> Workflow.add_cascade(:finalize, &finalize/1, deps: :process_items)

The process_item/2 function is called for each item in the enumerable, with the first argument receiving the item and the second receiving the context.

🪴 Grafting Sub-Workflows

Grafting allows sub-workflows to be attached after a workflow has started. With grafting, you define placeholders that are expanded into full sub-workflows later when data becomes available. Downstream jobs that depend on the graft will automatically wait for the sub-workflow, not just the graft job itself.

Grafting solves the tricky problem of inserting multiple jobs into the middle of a workflow. For example, to wait until an unknown number of notifications are sent before summarizing the output:

Workflow.new()
|> Workflow.put_context(%{account_id: account_id})
|> Workflow.add_graft(:users, &expand_users/1)
|> Workflow.add_cascade(:summary, &send_summary/1, deps: :users)

The expand_users/1 function takes the account_id, fetches all of the users for the account, then expands the graft job into a sub-workflow that sends each user a notice:

def expand_users(%{account_id: account_id}) do
  user_ids = MyApp.Account.get_user_ids!(account_id)

  Workflow.apply_graft({user_ids, &send_notice/2})
end

🖼️ Context and Workflow Status

Workflows that rely on common data can now share data without duplicating arguments using put_context/3:

workflow =
  Workflow.new()
  |> Workflow.put_context(%{user_id: 123, app_version: "1.2.3"})
  |> Workflow.add(:job_a, WorkerA.new(%{}))
  |> Workflow.add(:job_b, WorkerB.new(%{}))

# Later in a worker:
def process(job) do
  context = Workflow.get_context(job)
  # Use context map...
end

It's now easier to check workflow progress with status/1, which provides execution stats:

%{
  total: 5,
  state: :executing,
  counts: %{completed: 3, executing: 2},
  duration: 15_620
} = Workflow.status(workflow_id)

🧰 Queue Partitioning Overhaul

Queue partitioning is completely redesigned for dramatic performance improvements. Jobs are now assigned partition keys on insert rather than at runtime, enabling more efficient querying and eliminating head-of-line blocking when one partition has many jobs.

The new design has none of the issues of the previous solution:

Job processing is completely fair. Jobs from a single partition can't block processing of other partitions after bulk insert. No priority or scheduling workarounds are necessary.
Querying in partitioned queues relies on a single, partial index
Partitioning uses a single, optimized query without any unions or dynamic sections. That allows ecto to prepare and cache a single plan for faster fetching and less memory usage.

In a benchmark of 10k jobs spread across 20 partitions (200k jobs), processing took 17s in v1.6, down from 360s in v1.5 (20x faster) with far less load on the database.

🧨 Global Burst Mode

Global partitioning gained an advanced feature called "burst mode" that allows you to maximize throughput by temporarily exceeding per-partition global limits when there are available resources.

Each global partition is typically restricted to the configured allowed value. However, with burst mode enabled, the system can intelligently allocate more jobs to each active partition, potentially exceeding the per-partition limit while still respecting the overall queue concurrency.

This is particularly useful when:

You have many potential partitions but only a few are active at any given time
You want to maximize throughput while maintaining some level of fairness between partitions
You need to ensure your queues aren't sitting idle when capacity is available

Here's an example of a queue that will 5 jobs from a single partition concurrently under load, but can burst up to 100 for a single partition when there is available capacity:

queues: [
  exports: [
    global_limit: [
      allowed: 5,
      burst: true,
      partition: [args: :tenant_id]
    ],
    local_limit: 100
  ]
]

🍋 Preserve Queue Updates

DynamicQueues now preserves queue changes made at runtime across application restarts. This brings two key improvements:

Runtime changes to queues (via Web or CLI) persist until explicitly changed in configuration
A new :automatic sync mode that can manage queue deletions based on configuration

# Automatic mode - Deletes queues missing from configuration
config :my_app, Oban,
  plugins: [{DynamicQueues, sync_mode: :automatic, queues: [...]}]

In automatic mode, any queue that exists in the database but isn't defined in the configuration will be automatically deleted during startup. This is useful when you want to ensure your runtime queue configuration exactly matches what's defined in your application config.

Now when you pause a queue through the dashboard or change its limits via API, those changes will persist across application restarts until you explicitly update those options in your configuration.

🎨 Decorator Enhancements

Decorated jobs gain a few new capabilities. You can now use current_job/0 to access the underlying job struct from within decorated functions, making it easier to work with job context or pass job details to other functions. Additionally, you can mark any decorated job as recorded at runtime with the recorded option, enabling workflow composition and return value access without separate modules.

defmodule Business do
  use Oban.Pro.Decorator

  @job queue: :default, recorded: true
  def process_account(account_id) do
    job = current_job()

    IO.puts("Processing account #{account_id} with job #{job.id}")

    {:ok, %{processed: true}}
  end
end

v1.6.2

July 17, 2025

Bug Fixes

[Workflow] Only atomize top level keys in recorded cascade

Cascade functions may record arbitrary terms, including maps with purposeful string keys and structs. Now only the top level of recorded maps are atomized and nested map keys are left as is.
[Worker] Pass changes from before_process in after_process

Changes made to jobs in a before_process hook are now passed through to after_process. Previously, only modified args were retained.
[Plugins] Add logging for unexpected messages to all processes

Add a catch all to log a warning when plugins, or other GenServers, receive unexpected messages.
[Refresher] Prevent simultaneous producer refresh deadlocks.

Deleting the same producer records from multiple nodes simultaneously could lead to a deadlock. Now, only the leader will perform deletions.

v1.6.1

July 8, 2025

Bug Fixes

[Workflow] Include grafted workflows in cascade context

The results from a graft are now passed down to the context for cascade jobs. In addition, nested map keys from grafted cascade jobs are atomized when possible.
[Workflow] Increase rescue limit to capture more potentially stuck workflows

The decreased limit failed to rescue stalled workflows in especially busy systems.
[Worker] Dump structs nested within maps in structured args

Structs within a map weren't dumped to a map the way the were at the top level. Now structs are uniformly escaped at the top level, within embeds, within lists, and within maps.
[Smart] Cast states to strings on unique violation

Handling unique violations from the available->executing transition correctly uses strings instead of atoms now.

v1.6.0

June 30, 2025

Rerun Index Migrations

The workflow indexes changed between v1.6.0-rc.5 and v1.6.0. If you're upgrading from a v1.6 release candidate, you should rerun index migrations:
Oban.Pro.Migration.down(only: :indexes)
Oban.Pro.Migration.up(only: :indexes)

Bug Fixes

[Workflow] Optimize workflow deps queries for index use

Revamped queries are able to fully utilize workflow indexes for all dependency checks. The resulting queries remain extremely fast even in systems with thousands of parallel workflows.
[Workflow] Correct add_cascade_opts type definition.

The type incorrectly combined Job.option and a list of add options, rather than forming a single list of types.
[Refresher] Ensure the refresher flushes continuously.

This is a critical bug that would cause producer records to accumulate indefinitely after the first 15 seconds of the refresher running.
[Smart] Block chain preparation during ack updates

Acking chains while inserting a new job in the chain was prone to a race condition that would lead to "stuck" chains. This augments the chain insertion query so that it blocks until other jobs in the chain are committed to prevent transactional races.
[Smart] Correct locking when flush handlers present

Nodes must acquire a lock to safely coordinate workflow, batch, and chain queries while acking. This corrects the logic used to check whether there are pending flushes during a transaction.

v1.6.0-rc.5

June 9, 2025

Enhancements

[Pro] Address deprecation warnings from Elixir v1.19.
[Refresher] Simplify refreshing logic for stale producers.

Now that producer tracking is centralized, the queries for purging stale records can be much simpler and purge records even sooner.

Bug Fixes

[Decorator] Ensure decorated modules are loaded on process.

Ensure the module is loaded before converting the function to prevent unknown atom issues.
[Migration] Refactor migration versioning to support separate schema/index tracking

Split version tracking to independently track schema and index migrations, enabling more granular migration control when using the :only option.
[Migration] Fix migration version check and warning logic when using split migrations.
[Workflow] Fix sub-workflow context merging for cascades

Scoping within a sub-workflow prevented correctly fetching recorded cascade values. Now context is merged between parent and child cascade jobs.
[Workflow] Prevent seq scan from workflow deps query

The workflow deps query checked for the presence of a 'workflow_id' on the wrong table reference, which prevented the correct usage.
[Workflow] Correctly resolve downstream graft dependencies

A race condition in deps checking could allow workflow jobs that dependeded on a graft job to execute too soon.

v1.6.0-rc.4

May 29, 2025

Enhancements

[Workflow] Introduce add_graft/3 and apply_graft/2.

Graft jobs serve as placeholders in a workflow. When a graft job executes, it must build and attach a new sub-workflow at that point. Any downstream jobs that depend on the grafter will wait for the entire grafted sub-workflow to complete.
[Plugins] Enable default logger output for all plugins.

All plugins now implement a callback that allows their output to be logged automatically by the default Oban.Telemetry logger.

Bug Fixes

[Workflow] Optimize workflow deps query to allow index usage.

The query now uses an index enabled operator rather than a function. The resulting is much uglier, but hundreds of times faster on larger tables.
[Smart] Safely decode legacy rate limit data from existing producers.
[Smart] Expire locally cached producers after a brief time to prevent memory leaks and ensure producers are cleaned up shortly after a crash.
[Migration] Record separate schema and index migration version

The recorded migration version now indicates if only schemas or indexes were migrated, which ensures an indexes migration will always run after a schemas migration.
[Migration] Make dropping the old workflow index concurrent when index-only migrations are enabled.
[Migration] Respect dynamic repo configuration during version checks.

v1.6.0-rc.3

April 18, 2025

Enhancements

[Migration] Add concurrent migration options.

The new only option makes it possible to split migrations into schema and index changes. That allows for concurrent index creation to prevent locking the jobs table during migrations for tables with a large number of retained jobs.

This removes the need to set concurrently manually, deriving instead from the only option itself. It also ensures all down migrations are using the concurrently option as well.

Bug Fixes

[Smart] Track partition keys on insert.

For queues without a backlog, but frequent inserts, partitioned jobs wouldn't be fetched immediately. Now partition keys are added to the available keys cache immediately on insert for the current node.
[Smart] Briefly retain tracked global keys.

Retaining tracked keys for several cycles beyond when they've had something to fetch helps compensate for partitions without a backlog of jobs
[Smart] Lower partition keys cache TTL.

The TTL is reduced to 5s to reduce the lag before new partitions are processed in most systems. This setting is now documented as well, in the Smart engine documentation.
[Workflow] Correct logic for sub-workflow dependency checks.

Jobs with dependencies on sub-workflows wouldn't be moved to the available state in some situations. This corrects the order of status checks, and merges staging with rescuing logic.
[Workflow] Ensure arg value is always set for cascade jobs.

The default nil value is stripped from the encoded args map. This changes the default to a non-nil value so it's retained for all cascade jobs.

v1.6.0-rc.2

April 11, 2025

Enhancements

[Workflow] Add cascading workflow mode with context-aware functions.

Cascade mode simplifies building workflows that rely on context or output from previous functions. Cascade jobs use a strictly defined function signature and conventions to make expressive workflows that "cascade" context between functions according to dependencies.
[Workflow] Create and render subgraphs for sub-workflows.

Graphs for sub-workflows now correctly track dependencies between jobs and subs, and sub-graphs are created for sub-workflows. That includes subgraph rendering for to_dot/1 and to_mermaid/1 rendered output.
[Workflow] Enhance status/1 output with the workflow id, atom statuses, and sub workflows.

Sub-workflows are now included in the output of the status/1 function, along with the workflow id. The workflow status is now reported as an atom rather than a string, and there is a full typespec for the return value.
[Worker] Increase the default recorded value to 64mb

The default record value of 32k was highly conservative and a poor fit for decorated jobs or cascading workflows. The default is now much larger than any value that should be stored as a return value.

Bug Fixes

[Workflow] Fix grouping of workflow and sup workflow in query.

The workflow query lacked the grouping parenthesis required to select the correct partial index.
[DynamicCron] Remove double wrapping of cron_opt keyword

The cron_opt type changed to a keyword list, but it was still wrapped in a list. This removes the double wrapping.

v1.6.0-rc.1

March 28, 2025

Bug Fixes

[Smart] Safely accept legacy rate limit windows.

The legacy windows format is a list of maps, not a single map. This adds a custom type to translate legacy values, and correctly handles those values in the limiter itself to prevent crashes.
[Smart] Safely handle legacy global tracking data.

During the initial upgrade, legacy producers will have a different shape of tracked data for partitioned and non-partitioned queues. This addds a translation step to prevent crashing new producers.

v1.6.0-rc.0

March 28, 2025

Enhancements

[Workflow] Add add_workflow/4 for creating nested sub-workflows.

Sub-workflows simplify organizing and managing complex job dependencies by grouping related jobs:
```
Workflow.new()
|> Workflow.add_workflow(:sub, MyApp.SubWorkflow.new())
|> Workflow.add(:final, FinalWorker.new(%{}), deps: :sub)
```
Downstream dependencies can reference the sub-workflow as a whole, including any jobs that may be dynamically added during runtime.

[Workflow] Add add_many/4 function for creating fan-out sub-workflows.

This helper enables adding multiple jobs to a workflow with a single name:

email_jobs = Enum.map(users, &EmailWorker.new(%{user_id: &1.id})

workflow =
  Workflow.new()
  |> Workflow.add_many(:emails, email_jobs))
  |> Workflow.add(:report, ReportWorker.new(), deps: :emails)

[Workflow] Add sub-workflow metadata and query support.

Support retrieving sub-workflow jobs with new with_subs option. This makes it possible to fetch all sub-workflow jobs, including all recordings, with a single call.
```
Workflow.all_recorded(job, with_subs: true)
```
[Workflow] Add put_context/3 for sharing values between workflow jobs.

The helper inserts a completed job with a recorded value for all other jobs to fetch. This simplifies context sharing and eliminates the need to add the same values as args to all jobs in a workflow.
```
workflow = Workflow.put_context(workflow, %{id: 123, name: "Alice"})

def process(job) do
  %{id: id, name: name} = Workflow.get_recorded(job, :context)
end
```
[Workflow] Add status/1 helper for getting workflow execution information.

The new helper simplifies gathering runtime information about a workflow, including the name, total jbs, overall state of the workflow, elapsed duration, state counts, and timestamps.
```
%{total: 5, counts: %{completed: 5}} = Workflow.status(workflow.id)
```
[Workflow] Add retry_jobs/2 for retrying workflow jobs.

The new helper function will retry jobs in the workflow and hold jobs with dependencies accordingly.
```
Workflow.retry_jobs(job)
```
[Workflow] Optimize workflow indexes and eliminate containment queries.

A new partial workflow index checks held jobs, workflow ID, and sub-workflow ID at once, eliminating the need for a GIN index on meta and args.

[Smart] Add burst mode for global partitioned queues.

Allows partitioned queues to exceed per-partition limits when capacity is available, while still respecting the overall concurrency limit:

config :my_app, Oban,
  queues: [
    exports: [
      global_limit: [
        allowed: 5,
        burst: true,
        partition: [args: :tenant_id]
      ],
      local_limit: 100
    ]
  ]

[Smart] Allow using :meta in partition configuration.

Now queue partitioning can use values from the job's metadata, including composition values such as workflow_id.
```
config :my_app, Oban,
  queues: [media: [global_limit: [allowed: 1, partition: [meta: :workflow_id]]]]
```
[Smart] Overhaul queue partitioning for significant performance improvements.

Adds a generated partition_key column with a partial index to oban_jobs. Jobs are pre-partitioned on insert based on queue configuration, which simplifies partition queries and provides ~20x faster performance.
[Smart] Centralize producer refresh and cleanup.

Producer refreshing and cleanup is now centralized to reduce database load. Rather than one transaction per queue per node every 30 seconds, there is now one transaction per node every 30 seconds. This offers significant query savings in systems running numerous queues.
[Smart] Provide alternate map hashing to avoid collision.

The use of phash2/1 could cause collisions between values such as uuids, which led to false positives for unique checks. This adds an alternate mechanism for hashing that avoids any such collisions, but it is opt-in for backward compatibility.

To switch, set the following compile time config:
```
config :oban_pro, Oban.Pro.Utils, safe_hash: true
```
[Decorator] Add current_job/0 to decorated modules.

The current_job/0 function allows decorated jobs to access the underlying job for inspection or to pass as context to other functions such as Backoff, or Workflow helpers.
[Decorator] Support recording as a runtime option.

Any job may now be marked as recorded at runtime, including decorated jobs:
```
@job queue: :processing, recorded: true
def process_account(account_id) do
  # ...
end
```
[DynamicPrioritizer] Expand options for finer control.

New :limit and :max_priority options add control for many jobs are prioritized and their maximum priority level.
[DynamicQueues] Preserve runtime updates without configuration changes.

Add automatic sync_mode to insert/update/delete queues based on configuration and prevent overwriting runtime updates to queues unless the configuration changes.
[DynamicCron] Use optimized cron expression calculation.

The new last_at/2 and next_at/2 cron expression functions are vastly faster and more efficient than the previous implementation. This improves cron job insertion performance in guaranteed mode.

Bug Fixes

[Migration] Drop tables entirely during initial down migration

The presence of a uniq_key column prevents dropping the partitioned table after a rollback. The specific sequence of migrating and backfilling is unlikely to be used in the real world.