Optimizers¶

Divi provides built-in support for optimizing quantum programs using three distinct methods, each suited to different problem types and user requirements.

All optimizers can be accessed through the divi.qprog.optimizers module. Scipy-based optimizers rely on the ScipyMethod enum to specify the optimizer used.

Monte Carlo Optimization¶

The Monte Carlo [1] method in Divi is a stochastic global optimization approach. It works by randomly sampling the parameter space and selecting configurations that minimize the target cost function. This method is particularly useful when:

The optimization landscape is rugged or non-convex.
Gradients are not available or are unreliable.
A rough global search is preferred before local refinement.

Monte Carlo optimization can help identify promising regions in high-dimensional parameter spaces before applying more refined methods.

Configure MonteCarloOptimizer by passing population_size (the number of parameter sets evaluated per iteration) and optionally n_best_sets (how many top-performing sets are carried to the next iteration) to its constructor. The read-only n_param_sets property then reflects the configured population size.

SciPy Optimizers¶

Divi provides several SciPy-based optimizers through the ScipyOptimizer class:

Nelder-Mead¶

Nelder-Mead [2] is a gradient-free, simplex-based optimization algorithm. It is ideal for local optimization in low to moderate dimensional spaces. The method iteratively refines a simplex (a geometrical figure defined by a set of parameter vectors) by evaluating cost function values and applying operations such as reflection, expansion, and contraction.

Use Nelder-Mead when:

Your problem is continuous but noisy.
Gradients are unavailable or expensive to compute.
You are tuning parameters in a relatively low-dimensional space.

from divi.qprog.optimizers import ScipyOptimizer, ScipyMethod

optimizer = ScipyOptimizer(method=ScipyMethod.NELDER_MEAD)

L-BFGS-B¶

L-BFGS-B (Limited-memory Broyden–Fletcher–Goldfarb–Shanno with Bound constraints) [3] is a quasi-Newton method that leverages gradient information to efficiently converge to a local minimum. In Divi, gradient calculation is performed using the parameter shift rule, a technique well-suited to quantum circuits that allows for analytical gradient computation by evaluating the function at shifted parameter values.

Divi computes these parameter shifts in parallel, significantly reducing wall-clock time for gradient evaluations.

Use L-BFGS-B when:

You require fast convergence to a local minimum.
Your cost function is smooth and differentiable.

optimizer = ScipyOptimizer(method=ScipyMethod.L_BFGS_B)

COBYLA¶

COBYLA (Constrained Optimization BY Linear Approximations) [4] is a gradient-free, local optimization method—like Nelder-Mead—that supports nonlinear inequality constraints. It constructs successive linear approximations of the objective function and constraints, iteratively refining the solution within a trust region.

Use COBYLA when:

Your optimization problem includes constraints.
Gradients are inaccessible or too noisy.
You seek a reliable optimizer for low to moderate-dimensional spaces.

COBYLA is also a good choice of optimizer when trying out QAOA for a new problem/experimenting, but your mileage may vary.

optimizer = ScipyOptimizer(method=ScipyMethod.COBYLA)

PyMOO Optimizers¶

Divi also supports evolutionary algorithms through PyMOO:

CMA-ES (Covariance Matrix Adaptation Evolution Strategy)¶

CMA-ES [5] is a stochastic, derivative-free method for numerical optimization of non-linear or non-convex continuous optimization problems.

from divi.qprog.optimizers import PymooOptimizer, PymooMethod

optimizer = PymooOptimizer(method=PymooMethod.CMAES)

Differential Evolution¶

Differential Evolution [6] is a method that optimizes a problem by iteratively trying to improve a candidate solution with regard to a given measure of quality.

optimizer = PymooOptimizer(method=PymooMethod.DE)

Grid Search¶

The GridSearchOptimizer performs an exhaustive evaluation of every point on a user-defined parameter grid and returns the best-performing combination. It is designed for low-dimensional parameter spaces (1–3 parameters) where you want full visibility into the loss landscape.

Use Grid Search when:

You have a small number of variational parameters (e.g. QAOA with 1 layer: γ and β).
You want to visualize the loss landscape.
You need a deterministic, reproducible sweep.
You want to warm-start a variational optimizer from the best grid point.

import numpy as np

from divi.qprog.optimizers import GridSearchOptimizer

# Auto-generate a 20×20 grid over [0, 2π] × [0, π]
optimizer = GridSearchOptimizer(
    param_ranges=[(0, 2 * 3.14159), (0, 3.14159)],
    grid_points=20,
)

# Or supply an explicit grid
optimizer = GridSearchOptimizer(
    param_grid=np.array([[0.1, 0.2], [0.3, 0.4], [0.5, 0.6]])
)

The grid is evaluated in a single pass regardless of max_iterations. A warning is issued if max_iterations > 1 is supplied.

Note

Grid search scales as grid_points ** n_params, so it becomes impractical beyond ~3 parameters. For higher dimensions, use MonteCarloOptimizer or CMA-ES instead.

Choosing the Right Optimizer¶

For :class:`~divi.qprog.algorithms.VQE`:

L-BFGS-B: Best for smooth, differentiable landscapes with good initial parameters
Monte Carlo: Excellent for exploration and avoiding local minima
COBYLA: Good for constrained problems or when gradients are unreliable
Nelder-Mead: Robust choice for noisy or discontinuous landscapes

For :class:`~divi.qprog.algorithms.QAOA`:

Grid Search: Best for 1–2 layer QAOA where you want full landscape visibility
COBYLA: Often the best starting point for QAOA problems
Nelder-Mead: Good for noisy landscapes and parameter initialization
Monte Carlo: Excellent for global exploration and avoiding barren plateaus
L-BFGS-B: Use when you have good initial parameters and smooth landscapes

For PyMOO Optimizers:

CMA-ES: Excellent for high-dimensional parameter spaces and when you need robust global optimization. Particularly effective for VQE with many parameters.
Differential Evolution: Good for multimodal optimization landscapes and when you need to escape local minima. Works well for QAOA parameter optimization.

For Hyperparameter Sweeps:

Monte Carlo: Best for initial exploration across parameter ranges
L-BFGS-B: Use for fine-tuning after Monte Carlo exploration
Nelder-Mead: Robust fallback when other methods fail
CMA-ES: Excellent for high-dimensional sweeps with many parameters

Quantum-Specific Considerations:

Barren Plateaus: Use MonteCarloOptimizer or CMA-ES to avoid getting trapped in flat regions
Parameter Initialization: Start with small random values (typically [-0.1, 0.1]) for better convergence
Circuit Depth: Deeper circuits benefit from more robust optimizers like CMA-ES or MonteCarloOptimizer
Noise Resilience: Nelder-Mead and COBYLA are more robust to quantum noise than gradient-based methods

Early Stopping¶

Long-running optimizations can waste resources once convergence has effectively stalled. Divi’s EarlyStopping controller lets you terminate the loop automatically based on configurable criteria.

Pass an EarlyStopping instance to any variational algorithm:

from divi.qprog import VQE, EarlyStopping
from divi.qprog.optimizers import ScipyOptimizer, ScipyMethod

vqe = VQE(
    molecule=molecule,
    backend=backend,
    optimizer=ScipyOptimizer(method=ScipyMethod.COBYLA),
    max_iterations=200,
    early_stopping=EarlyStopping(
        patience=10,
        min_delta=1e-5,
    ),
)

vqe.run()
print(f"Stopped at iteration {vqe.current_iteration}")
print(f"Reason: {vqe.stop_reason}")        # e.g. "patience_exceeded"
print(f"Converged: {vqe.optimize_result.success}")  # False for early stop

Stopping Criteria¶

Three criteria are available and are evaluated in priority order after every iteration. The first one that fires stops the loop.

Patience (always active) — Stop when the cost has not improved by at least min_delta for patience consecutive iterations.
```
EarlyStopping(patience=10, min_delta=1e-4)
```
Gradient norm (optional) — Stop when the L2 norm of the gradient falls below grad_norm_threshold. Only effective with gradient-based optimizers such as ScipyOptimizer(method=ScipyMethod.L_BFGS_B).
```
EarlyStopping(patience=10, grad_norm_threshold=1e-6)
```
Cost variance (optional) — Stop when the rolling variance of the last variance_window cost values drops below variance_threshold. Useful for noisy landscapes where cost oscillates but no longer trends downward.
```
EarlyStopping(
    patience=10,
    variance_window=20,
    variance_threshold=1e-8,
)
```

All three criteria can be enabled simultaneously; the first one that triggers will stop the loop.

After the Run¶

After run() completes, use stop_reason to determine why optimization ended:

None — optimization ran to max_iterations without triggering early stopping
"patience_exceeded" — cost plateaued
"gradient_below_threshold" — gradient vanished
"cost_variance_settled" — cost variance settled

The optimize_result attribute is always populated and its message field includes the stop reason.

Inspecting Optimizer Results¶

After running a variational algorithm, you can inspect the raw result object returned by the underlying optimizer via the optimize_result property. This exposes optimizer-specific diagnostics such as:

nfev – number of cost-function evaluations
njev – number of Jacobian (gradient) evaluations (gradient-based optimizers)
nit – number of iterations completed
success – whether the optimizer converged
message – convergence or termination message

program.run()

result = program.optimize_result
if result is not None:
    print(f"Function evaluations: {result.nfev}")
    print(f"Converged: {result.success}")

Note

optimize_result is always populated after run() completes. When optimization converges normally, success is True. When early stopping or cancellation terminates the run, success is False and the message field describes the reason. The available attributes depend on the optimizer; see scipy.optimize.OptimizeResult for the full specification.

Next Steps¶

tutorials/ — runnable examples
Ground-State Energy Estimation with VQE and Combinatorial Optimization with QAOA and PCE — algorithm-specific guidance
Program Ensembles and Workflows — optimizers in large-scale sweeps and ensembles

Optimizers¶

Monte Carlo Optimization¶

SciPy Optimizers¶

Nelder-Mead¶

L-BFGS-B¶

COBYLA¶

PyMOO Optimizers¶

CMA-ES (Covariance Matrix Adaptation Evolution Strategy)¶

Differential Evolution¶

Grid Search¶

Choosing the Right Optimizer¶

Early Stopping¶

Stopping Criteria¶

After the Run¶

Inspecting Optimizer Results¶

Next Steps¶

References¶