slsqp_jax package ¶

Parameters:

hvp_fn (Callable[[Float[Array, 'n']], Float[Array, 'n']])
g (Float[Array, 'n'])
A (Float[Array, 'm n'])
b (Float[Array, 'm'])
active_mask (Bool[Array, 'm'])
precond_fn (Callable[[Float[Array, 'n']], Float[Array, 'n']] | None)
free_mask (Bool[Array, 'n'] | None)
d_fixed (Float[Array, 'n'] | None)
adaptive_tol (Float[Array, ''] | float | None)

Args:

hvp_fn: Hessian-vector product function v -> B @ v. g: Linear term (gradient of objective). A: Combined constraint matrix (m x n). b: Combined RHS vector (m,). active_mask: Boolean mask (m,) indicating active constraints. precond_fn: Optional preconditioner v -> M @ v where M ~ B^{-1}. free_mask: Optional boolean mask (n,). When provided, only

variables with free_mask[i] = True are optimized.

d_fixed: Values for fixed variables (n,). Required when: free_mask is provided.
adaptive_tol: Optional Eisenstat-Walker tolerance override.: When provided, overrides the solver’s default convergence tolerance for this call only.

Returns:

InnerSolveResult with the direction, multipliers, and convergence flag.

build_projection_context(hvp_fn, g, A, b, active_mask, precond_fn=None, free_mask=None, d_fixed=None)[source]¶

Build a reusable projector + multiplier-recovery context.

Composed strategies (e.g. HRInexactSTCG) call this on the underlying inner solver to obtain its null-space projector, particular solution and multiplier-recovery closure without running the projector’s own CG loop.

The default implementation raises NotImplementedError so full-KKT solvers (MinresQLPSolver) cleanly opt out — they have no separate projection step and therefore cannot supply the inexact-projector W̃_k that HR Algorithm 4.5 needs.

Return type:

Parameters:

hvp_fn (Callable[[Float[Array, 'n']], Float[Array, 'n']])
g (Float[Array, 'n'])
A (Float[Array, 'm n'])
b (Float[Array, 'm'])
active_mask (Bool[Array, 'm'])
precond_fn (Callable[[Float[Array, 'n']], Float[Array, 'n']] | None)
free_mask (Bool[Array, 'n'] | None)
d_fixed (Float[Array, 'n'] | None)

__init__()¶

Return type:: None

class slsqp_jax.HRInexactSTCG[source]¶

Heinkenschloss-Ridzal (2014) Algorithm 4.5 — STCG with inexact null-space projections.

Composes an existing null-space inner solver (ProjectedCGCholesky or ProjectedCGCraig) to obtain its projector W̃_k, particular solution d_p and multiplier-recovery closure, then runs a separate CG iteration on top whose three textbook three-term-recurrence cancellations are replaced by full H-conjugacy reorthogonalisation against every previous search direction.

See AGENTS.md (“Pluggable Inner QP Solvers” → HRInexactSTCG) for the full algorithmic discussion and references.

Attributes:

inner: Composed null-space inner solver supplying the: projector and multiplier-recovery infrastructure. Must implement build_projection_context; the saddle-point MinresQLPSolver does not and will raise on the first solve call.
max_cg_iter: Static upper bound on the number of inner CG: iterations. Determines the size of the reorth buffers.
cg_tol: Relative convergence tolerance for the projected: residual ‖z̃_i‖ ≤ tol · ‖r̃_0‖.
cg_regularization: Curvature-guard threshold δ² used by: the SNOPT-style scale-invariant short-circuit ⟨p̃, H p̃⟩ ≤ δ² ‖p̃‖². Defaults to 1e-6; set to 0.0 to disable.

inner: AbstractInnerSolver¶

max_cg_iter: int¶

cg_tol: Float[Array, ''] | float¶

cg_regularization: float = 1e-06¶

build_projection_context(hvp_fn, g, A, b, active_mask, precond_fn=None, free_mask=None, d_fixed=None)[source]¶

Build a reusable projector + multiplier-recovery context.

Return type:

Parameters:

hvp_fn (Callable[[Float[Array, 'n']], Float[Array, 'n']])
g (Float[Array, 'n'])
A (Float[Array, 'm n'])
b (Float[Array, 'm'])
active_mask (Bool[Array, 'm'])
precond_fn (Callable[[Float[Array, 'n']], Float[Array, 'n']] | None)
free_mask (Bool[Array, 'n'] | None)
d_fixed (Float[Array, 'n'] | None)

solve(hvp_fn, g, A, b, active_mask, precond_fn=None, free_mask=None, d_fixed=None, adaptive_tol=None)[source]¶

Solve the equality-constrained QP subproblem.

Solves:

minimize    (1/2) d^T B d + g^T d
subject to  A[active] d = b[active]
            d[i] = d_fixed[i]  for i where free_mask[i] is False

where B is given implicitly via hvp_fn(v) = B @ v.

Return type:

Parameters:

hvp_fn (Callable[[Float[Array, 'n']], Float[Array, 'n']])
g (Float[Array, 'n'])
A (Float[Array, 'm n'])
b (Float[Array, 'm'])
active_mask (Bool[Array, 'm'])
precond_fn (Callable[[Float[Array, 'n']], Float[Array, 'n']] | None)
free_mask (Bool[Array, 'n'] | None)
d_fixed (Float[Array, 'n'] | None)
adaptive_tol (Float[Array, ''] | float | None)

Args:

variables with free_mask[i] = True are optimized.

d_fixed: Values for fixed variables (n,). Required when: free_mask is provided.
adaptive_tol: Optional Eisenstat-Walker tolerance override.: When provided, overrides the solver’s default convergence tolerance for this call only.

Returns:

InnerSolveResult with the direction, multipliers, and convergence flag.

__init__(inner, max_cg_iter, cg_tol, cg_regularization=1e-06)¶

Parameters:

inner (AbstractInnerSolver)
max_cg_iter (int)
cg_tol (Float[Array, ''] | float)
cg_regularization (float)

Return type:

None

class slsqp_jax.InnerSolveResult[source]¶

Bases: NamedTuple

Result from an inner equality-constrained QP solve.

Attributes:

d: Search direction. multipliers: Lagrange multipliers (shape (m,); entries for

inactive constraints are zero).

converged: True when the inner Krylov / projection iteration: satisfied its tolerance.
proj_residual: Post-solve constraint residual ||A d - b||: (Euclidean norm, restricted to active rows). Always 0 for null-space solvers (CG / CRAIG) where feasibility is enforced structurally; non-zero for MinresQLPSolver where it reflects the floor of the M-metric range-space projection after iterative refinement.
n_proj_refinements: Number of M-metric projection refinement: rounds actually applied. Always 0 for null-space solvers. At most MinresQLPSolver.proj_refine_max_iter.
projected_grad_norm: Norm of the projected initial gradient: W̃_k g that the inner solver actually iterated against (HR 2014, Theorem 3.5). This is the noise-aware stationarity proxy: when the outer SQP enables use_inexact_stationarity, the run is allowed to converge once this value drops below rtol * max(mu_max, 1) (filterSQP eq. 6 with the shared denominator from slsqp_jax.slsqp.termination.compute_mu_max()). Defaults to inf so that solvers which do not produce this quantity (i.e. anything other than HRInexactSTCG) cannot accidentally satisfy a < rtol test even if the user toggles the flag — the inexact path silently degrades to “never converges this way”.

d: Float[Array, 'n']¶: Alias for field number 0

multipliers: Float[Array, 'm']¶: Alias for field number 1

converged: Bool[Array, '']¶: Alias for field number 2

proj_residual: Float[Array, '']¶: Alias for field number 3

n_proj_refinements: Int[Array, '']¶: Alias for field number 4

projected_grad_norm: Float[Array, '']¶: Alias for field number 5

class slsqp_jax.MinresQLPSolver[source]¶

Preconditioned MINRES-QLP on the full saddle-point KKT system.

Solves the KKT system directly:

[B    A^T] [d]       [-g]
[A    0  ] [lambda] = [b ]

using PMINRES-QLP (Choi, Paige & Saunders, SISC 2011, Table 3.5) with a block-diagonal SPD preconditioner:

M = [B_diag^{-1}    0      ]
    [0              S^{-1} ]

where B_diag = diag(B_0) (L-BFGS diagonal) and S = A B_diag^{-1} A^T is the Schur complement.

After PMINRES-QLP returns the iterate d, an M-metric range-space projection drives A d = b on the active rows. The single shot is followed by up to proj_refine_max_iter rounds of iterative refinement, each costing one matvec + one Schur back-solve (no refactorisation). Refinement squares the relative feasibility error per round. See HR (2014, Algorithm 4.18 step 1(a)) for the motivation.

max_iter: int = 200¶

tol: float = 1e-10¶

max_cg_iter: int = 50¶

proj_refine_max_iter: int = 3¶

proj_refine_rtol: float = 1e-10¶

proj_refine_atol: float = 1e-14¶

solve(hvp_fn, g, A, b, active_mask, precond_fn=None, free_mask=None, d_fixed=None, adaptive_tol=None)[source]¶

Solve the equality-constrained QP subproblem.

Solves:

minimize    (1/2) d^T B d + g^T d
subject to  A[active] d = b[active]
            d[i] = d_fixed[i]  for i where free_mask[i] is False

where B is given implicitly via hvp_fn(v) = B @ v.

Return type:

Parameters:

hvp_fn (Callable[[Float[Array, 'n']], Float[Array, 'n']])
g (Float[Array, 'n'])
A (Float[Array, 'm n'])
b (Float[Array, 'm'])
active_mask (Bool[Array, 'm'])
precond_fn (Callable[[Float[Array, 'n']], Float[Array, 'n']] | None)
free_mask (Bool[Array, 'n'] | None)
d_fixed (Float[Array, 'n'] | None)
adaptive_tol (Float[Array, ''] | float | None)

Args:

variables with free_mask[i] = True are optimized.

d_fixed: Values for fixed variables (n,). Required when: free_mask is provided.
adaptive_tol: Optional Eisenstat-Walker tolerance override.: When provided, overrides the solver’s default convergence tolerance for this call only.

Returns:

InnerSolveResult with the direction, multipliers, and convergence flag.

__init__(max_iter=200, tol=1e-10, max_cg_iter=50, proj_refine_max_iter=3, proj_refine_rtol=1e-10, proj_refine_atol=1e-14)¶

Parameters:

max_iter (int)
tol (float)
max_cg_iter (int)
proj_refine_max_iter (int)
proj_refine_rtol (float)
proj_refine_atol (float)

Return type:

None

class slsqp_jax.ProjectedCGCholesky[source]¶

Projected CG with Cholesky-based null-space projection.

This is the original implementation: Cholesky-factor A A^T (with regularization), use it for the null-space projector and particular solution, run CG in the null space, and recover multipliers via iterative refinement.

When use_constraint_preconditioner is True and a preconditioner is provided, the constraint preconditioner (Gould, Hribar & Nocedal, 2001) is used instead of the naive P(M(r)).

max_cg_iter: int¶

cg_tol: Float[Array, ''] | float¶

cg_regularization: float = 1e-06¶

use_constraint_preconditioner: bool = False¶

solve(hvp_fn, g, A, b, active_mask, precond_fn=None, free_mask=None, d_fixed=None, adaptive_tol=None)[source]¶

Solve the equality-constrained QP subproblem.

Solves:

minimize    (1/2) d^T B d + g^T d
subject to  A[active] d = b[active]
            d[i] = d_fixed[i]  for i where free_mask[i] is False

where B is given implicitly via hvp_fn(v) = B @ v.

Return type:

Parameters:

hvp_fn (Callable[[Float[Array, 'n']], Float[Array, 'n']])
g (Float[Array, 'n'])
A (Float[Array, 'm n'])
b (Float[Array, 'm'])
active_mask (Bool[Array, 'm'])
precond_fn (Callable[[Float[Array, 'n']], Float[Array, 'n']] | None)
free_mask (Bool[Array, 'n'] | None)
d_fixed (Float[Array, 'n'] | None)
adaptive_tol (Float[Array, ''] | float | None)

Args:

variables with free_mask[i] = True are optimized.

d_fixed: Values for fixed variables (n,). Required when: free_mask is provided.
adaptive_tol: Optional Eisenstat-Walker tolerance override.: When provided, overrides the solver’s default convergence tolerance for this call only.

Returns:

InnerSolveResult with the direction, multipliers, and convergence flag.

build_projection_context(hvp_fn, g, A, b, active_mask, precond_fn=None, free_mask=None, d_fixed=None)[source]¶

Build a reusable projector + multiplier-recovery context.

Return type:

Parameters:

hvp_fn (Callable[[Float[Array, 'n']], Float[Array, 'n']])
g (Float[Array, 'n'])
A (Float[Array, 'm n'])
b (Float[Array, 'm'])
active_mask (Bool[Array, 'm'])
precond_fn (Callable[[Float[Array, 'n']], Float[Array, 'n']] | None)
free_mask (Bool[Array, 'n'] | None)
d_fixed (Float[Array, 'n'] | None)

__init__(max_cg_iter, cg_tol, cg_regularization=1e-06, use_constraint_preconditioner=False)¶

Parameters:

max_cg_iter (int)
cg_tol (Float[Array, ''] | float)
cg_regularization (float)
use_constraint_preconditioner (bool)

Return type:

None

class slsqp_jax.ProjectedCGCraig[source]¶

Projected CG with CRAIG-based iterative null-space projection.

Replaces the Cholesky factorization of A A^T with iterative CRAIG solves (Golub-Kahan bidiagonalization). This eliminates the O(m^3) factorization cost and the 1e-8 diagonal regularization, at the cost of an iterative solve per projection.

For multiplier recovery (done once after the CG loop), CG on the normal equations A A^T y = rhs is used, reusing the existing solve_unconstrained_cg infrastructure.

max_cg_iter: int¶

cg_tol: Float[Array, ''] | float¶

cg_regularization: float = 1e-06¶

use_constraint_preconditioner: bool = False¶

craig_tol: float = 1e-10¶

craig_max_iter: int = 200¶

mult_recovery_tol: float = 1e-12¶

mult_recovery_max_iter: int = 200¶

solve(hvp_fn, g, A, b, active_mask, precond_fn=None, free_mask=None, d_fixed=None, adaptive_tol=None)[source]¶

Solve the equality-constrained QP subproblem.

Solves:

minimize    (1/2) d^T B d + g^T d
subject to  A[active] d = b[active]
            d[i] = d_fixed[i]  for i where free_mask[i] is False

where B is given implicitly via hvp_fn(v) = B @ v.

Return type:

Parameters:

hvp_fn (Callable[[Float[Array, 'n']], Float[Array, 'n']])
g (Float[Array, 'n'])
A (Float[Array, 'm n'])
b (Float[Array, 'm'])
active_mask (Bool[Array, 'm'])
precond_fn (Callable[[Float[Array, 'n']], Float[Array, 'n']] | None)
free_mask (Bool[Array, 'n'] | None)
d_fixed (Float[Array, 'n'] | None)
adaptive_tol (Float[Array, ''] | float | None)

Args:

variables with free_mask[i] = True are optimized.

d_fixed: Values for fixed variables (n,). Required when: free_mask is provided.
adaptive_tol: Optional Eisenstat-Walker tolerance override.: When provided, overrides the solver’s default convergence tolerance for this call only.

Returns:

InnerSolveResult with the direction, multipliers, and convergence flag.

build_projection_context(hvp_fn, g, A, b, active_mask, precond_fn=None, free_mask=None, d_fixed=None)[source]¶

Build a reusable projector + multiplier-recovery context.

Return type: