HiTaB: Tighter AI Safety Certificates via

Q: Remainder scaling

Cubic (ε³) — The second-order Taylor remainder in HiTaB scales as ⅙ L∇²f ε³ — shrinking much faster than the quadratic ½ L∇f ε² of first-order methods as perturbation size decreases

Q: Verification hierarchy levels

3 — HiTaB unifies zeroth-, first-, and second-order bounds, always selecting the tightest certificate automatically

Q: Quadrotor demo tolerance

0.001 — Reachable sets for the drone navigation experiment were computed with a certified tolerance of 0.001

Q: Novel quantity bounded

L∇²f — The Lipschitz constant of the Hessian — quantifying how fast a network's curvature changes — had never before been bounded efficiently for deep networks

Q: Input perturbation models

2 — HiTaB supports both ℓ₂ (Euclidean ball) and ℓ∞ (box-shaped) perturbation sets, covering the two dominant models in AI robustness certification

Q: Layer composition rule

1st — Theorem 4.1 provides the first compositional algorithm for propagating Hessian Lipschitz bounds through the layers of a deep network

Key Facts

Cubic (ε³) Remainder scaling The second-order Taylor remainder in HiTaB scales as ⅙ L∇²f ε³ — shrinking much faster than the quadratic ½ L∇f ε² of first-order methods as perturbation size decreases

3 Verification hierarchy levels HiTaB unifies zeroth-, first-, and second-order bounds, always selecting the tightest certificate automatically

0.001 Quadrotor demo tolerance Reachable sets for the drone navigation experiment were computed with a certified tolerance of 0.001

L∇²f Novel quantity bounded The Lipschitz constant of the Hessian — quantifying how fast a network's curvature changes — had never before been bounded efficiently for deep networks

2 Input perturbation models HiTaB supports both ℓ₂ (Euclidean ball) and ℓ∞ (box-shaped) perturbation sets, covering the two dominant models in AI robustness certification

1st Layer composition rule Theorem 4.1 provides the first compositional algorithm for propagating Hessian Lipschitz bounds through the layers of a deep network

Imagine you are certifying that a drone controlled by a neural network will never hit an obstacle, no matter how its sensors flutter within their known error margins. You do not need to know the exact output of the network for every possible input — that would require solving an astronomically large search problem. You need something simpler and more powerful: a guaranteed upper bound on the worst the network could do. Get that bound tight enough, and you can sign off on the system. Leave it too loose, and you either reject a perfectly safe drone or, worse, accept a dangerous one under a false certificate.

This is the central problem of neural network reachability analysis: computing or bounding the set of outputs a network can produce over a given range of inputs. It sits at the foundation of AI safety for autonomous vehicles, medical diagnostics, and any other domain where a learning-enabled system must be certified before deployment. And it is, in general, computationally intractable to solve exactly — which is why the field has spent years building tractable overapproximations, bounds that are always safe (they never underestimate the danger) but are often far more conservative than they need to be.

A new framework called HiTaB (Hierarchical End-to-End Taylor Bounds), developed by Taha Entesari and Mahyar Fazlyab at Johns Hopkins University, takes a significant step toward closing that gap. The key insight: existing state-of-the-art methods for smooth networks exploit at most the network's second derivative — its curvature — but stop there. HiTaB goes one order higher, bounding how quickly that curvature itself changes. The result is a hierarchy of safety certificates that are provably tighter than anything currently available for smooth, differentiable neural networks (Entesari & Fazlyab, 2026).

The Science

The problem HiTaB addresses can be stated precisely. Given a neural network representing a function $f : R^{n} \to R$ and a ball of admissible inputs $B (x_{c}, ε)$ — all inputs within distance $ε$ of some nominal point $x_{c}$ — what is the largest output $f$ can produce? Formally:

$x \in B (x_{c}, ε) max f (x)$

Solving this exactly is nonconvex and NP-hard in general. The practical alternative is to find a majorizer $\overset{ˉ}{f}$ : a function that is always at least as large as $f$ , but whose maximum over the input ball can be computed cheaply. The tighter $\overset{ˉ}{f}$ is to $f$ , the more useful the resulting certificate.

Prior work has built these majorizers from Taylor expansions — the mathematical technique of approximating a function near a point using its value, slope, and curvature. A zeroth-order bound uses only the function value and a global Lipschitz constant (a measure of how fast the output can change at all). A first-order bound adds the local gradient. A second-order bound adds the Hessian — the matrix of second derivatives that captures local curvature. Each step tightens the approximation, but each also requires computing something harder about the network.

The bottleneck HiTaB solves is the third step. A second-order Taylor approximation of $f$ comes with a remainder term — the error between the approximation and reality — that scales with how quickly the Hessian itself changes. That rate of change is formalized as $L_{\nabla^{2} f}$ , the Lipschitz constant of the Hessian (informally: the maximum rate at which the network's curvature can shift as inputs move). If you can bound $L_{\nabla^{2} f}$ , you can certify that the remainder of your Taylor approximation never exceeds $\frac{1}{6} L_{\nabla^{2} f} ∥ δ ∥_{2}^{3}$ , where $δ$ is the size of the perturbation. That cubic scaling is the key: for small perturbations — precisely the regime that matters most in robustness certification — a cubic error term is much smaller than the quadratic error terms of first-order methods.

No previous tool had computed $L_{\nabla^{2} f}$ efficiently for deep networks. HiTaB provides the first practical algorithm to do so.

The work is grounded in a clean mathematical hierarchy. Three majorizers are derived, each indexed by the highest order of derivative information used:

$\overset{ˉ}{f}_{0}$ : uses only the function value and global Lipschitz constant $L_{f}$
$\overset{ˉ}{f}_{1}$ : adds the local gradient $\nabla f (x_{c})$ and gradient Lipschitz constant $L_{\nabla f}$
$\overset{ˉ}{f}_{2}$ : adds the local Hessian $\nabla^{2} f (x_{c})$ and Hessian Lipschitz constant $L_{\nabla^{2} f}$

The second-order majorizer takes the form:

$\overset{ˉ}{f}_{2} (x_{c}, δ) = f (x_{c}) + \nabla f (x_{c})^{⊤} δ + \frac{1}{2} δ^{⊤} \nabla^{2} f (x_{c}) δ + \frac{1}{6} L_{\nabla^{2} f} ∥ δ ∥_{2}^{3}$

The last term is the certified cubic remainder — the mathematical guarantee that the Taylor approximation never understates the true maximum by more than this amount. Because all three bounds can be computed once the required Lipschitz constants are in hand, the framework automatically selects the minimum of the three certificates at runtime, ensuring the tightest possible guarantee in every case.

What They Found

Hierarchy of Taylor Bound Orders: Remainder Scaling

How the error (remainder) term in each order of Taylor bound scales with perturbation size ε. Zeroth-order scales linearly, first-order scales quadratically (½ L∇f ε²), and second-order scales cubically (⅙ L∇²f ε³). At small ε, higher-order bounds are dramatically tighter.

Hierarchy of Taylor Bound Orders: Remainder Scaling
Label	Value
Zeroth-order (linear)	1 relative remainder at ε=1
First-order (quadratic)	0.5 relative remainder at ε=1
Second-order (cubic)	0.167 relative remainder at ε=1

The central theoretical result is a precise condition under which each higher-order bound beats the one below it. The second-order certificate $\overset{ˉ}{f}_{2}^{sb}$ is provably tighter than the first-order certificate $\overset{ˉ}{f}_{1}^{*}$ whenever:

$ε \leq \frac{3}{L _{\nabla^{2} f}} (L_{\nabla f} - (λ_{m a x} (\nabla^{2} f (x_{c})))_{+})$

Here $λ_{m a x} (\nabla^{2} f (x_{c}))$ is the largest eigenvalue of the Hessian at $x_{c}$ — a measure of the strongest local upward curvature — and $(a)_{+} = max (a, 0)$ . The threshold is always nonnegative, because $L_{\nabla f} \geq sup_{x} λ_{m a x} (\nabla^{2} f (x))$ by definition. This means the second-order bound never hurts: there always exists a range of perturbation sizes for which it wins, and when it doesn't, the framework falls back automatically to the first-order result (Entesari & Fazlyab, 2026).

An analogous condition characterizes when the first-order bound beats the zeroth-order bound:

$ε \leq \frac{2 ( L _{f} - ∥\nabla f ( x _{c} ) ∥ _{2} )}{L _{\nabla f}}$

The improvement is greatest when the local gradient is small relative to the global Lipschitz constant — when the function is flatter locally than it is globally. This is exactly what happens at saddle points, near local optima, or in well-trained networks where most inputs land in stable regions of the output space.

Remainder Advantage of Second-Order Bound at Small Perturbations

Ratio of first-order quadratic remainder to second-order cubic remainder at different perturbation sizes ε, illustrating how much tighter the second-order bound becomes as ε decreases.

Remainder Advantage of Second-Order Bound at Small Perturbations
Label	Value
ε = 1.0	3 ratio
ε = 0.5	6 ratio
ε = 0.25	12 ratio
ε = 0.1	30 ratio
ε = 0.01	300 ratio

The technical core of the paper is a layerwise compositional algorithm for bounding $L_{\nabla^{2} f}$ in feedforward networks. For a network with layers indexed $1$ through $L$ , the algorithm propagates curvature bounds forward through each layer. At layer $I$ , the Hessian Lipschitz constant of the $j$-th output neuron satisfies the bound:

$L_{\nabla^{2} a_{j}^{I}} \leq L_{\nabla^{2} F_{j}^{I}} L_{a^{I - 1}}^{3} + 2 L_{D a^{I - 1}} L_{a^{I - 1}} L_{\nabla F_{j}^{I}} + i = 1 \sum N_{I - 1} (L_{\partial_{i} F_{j}^{I}} L_{a^{I - 1}} L_{D a_{i}^{I - 1}} + L_{\nabla^{2} a_{i}^{I - 1}} x sup ∣ \partial_{i} F_{j}^{I} (x) ∣)$

Each quantity in this expression has a concrete meaning: $L_{a^{I - 1}}$ is the Lipschitz constant of the previous layer's output (its maximum rate of change), $L_{D a^{I - 1}}$ is the Lipschitz constant of its Jacobian (how fast the layer's slope changes), and $L_{\nabla^{2} F_{j}^{I}}$ depends only on the activation function and the weights of the current layer. All of these can be computed with existing certified tools — the new contribution is the composition rule that chains them into a network-level bound on $L_{\nabla^{2} f}$ .

For smooth activations like sigmoid or tanh, the algorithm exploits the fact that $σ^{''}$ (the second derivative of the activation) is itself Lipschitz with constant $L_{σ^{''}}$ . The bound on $L_{\nabla^{2} F_{j}^{I}}$ then becomes simply $L_{σ^{''}} ∥ W_{j, :}^{I} ∥_{2}^{3}$ — a product of the activation's smoothness and the cube of the weight row's norm. Crucially, these are quantities engineers already know for any trained network.

The framework applies to both $\ell_2$-bounded perturbations (Euclidean balls, relevant for adversarial robustness) and $\ell_\infty$-bounded perturbations (box constraints, which are the standard model in image classifier certification). For $\ell_\infty$ inputs, the bounds are reformulated using norm equivalences and matrix operator norms, preserving the same hierarchical structure.

Why This Changes Things

Figure 1: The quadrotor problem setup. The point clouds show trajectory samples from the system via numerical simulation. The obstacles are shown as two spheres. The reachable sets are calculated with a tolerance of 0.0010.001. Source: Taha Entesari, Mahyar Fazlyab

To see why this matters in practice, consider the quadrotor control problem demonstrated in the paper. A drone navigating through space is governed by nonlinear dynamics, and a neural network controller produces thrust commands from sensor readings. Certifying that the drone will not enter a collision zone requires bounding the reachable set of the closed-loop system — essentially answering the reachability question repeatedly, at each time step, for each possible state. Even a small tightening of the per-step bound compounds across time horizons: a looser bound at step one inflates the reachable set fed into step two, and so on. The drone's "certified safe zone" shrinks with every conservative approximation that compounds through the trajectory.

HiTaB's tighter per-step certificates directly translate to less conservative reachable sets over the full horizon. The paper demonstrates the framework on a quadrotor navigating around two spherical obstacles, computing trajectory reachable sets with a tolerance of $0.001$ (Entesari & Fazlyab, 2026). The key advantage is that the second-order bound's cubic remainder $\frac{1}{6} L_{\nabla^{2} f} ε^{3}$ shrinks much faster than the quadratic remainders of first-order methods as $ε$ decreases — and in branch-and-bound verification, where the input domain is recursively subdivided into smaller and smaller subregions, this faster decay is exactly what accelerates convergence.

The broader significance is that smooth networks — networks using sigmoid, tanh, SiLU, GELU, or other differentiable activations — are increasingly common in safety-critical applications. GELU activations dominate modern transformers; smooth activations are preferred in physics-informed networks and neural ordinary differential equations. Yet the verification literature has focused overwhelmingly on ReLU networks, which are piecewise linear and admit mixed-integer linear programming formulations. HiTaB opens a principled route to tighter verification for the smooth network architectures that real systems increasingly rely on.

The framework also has a structural elegance that matters for trust. The hierarchy is not a heuristic — it comes with provable monotonicity conditions that tell practitioners exactly when each higher-order bound will win. This is not "try the second-order method and see"; it is "the second-order method is guaranteed to be better whenever $ε$ is smaller than this computable threshold." That kind of explicitness is rare and valuable in safety engineering, where a practitioner needs to justify every design choice.

Information Exploited by Each Verification Tier

A qualitative comparison of what local network information each order of bound in the HiTaB hierarchy uses, illustrating the progressive enrichment from zeroth to second order.

Information Exploited by Each Verification Tier
Label	Value
Function value f(xc)	1
Global Lipschitz Lf	1
Local gradient ∇f	0
Gradient Lipschitz L∇f	0
Hessian ∇²f	0
Hessian Lipschitz L∇²f	0

What's Next

The most immediate extension flagged by the researchers is integration into complete branch-and-bound verification pipelines. Modern verifiers like $\alpha$-CROWN and BaB-based frameworks subdivide the input space recursively, solving a reachability problem at each node. HiTaB's bounds slot naturally into this paradigm: tighter bounds at each node mean the verifier can prune branches faster, reducing the total computation needed to certify or refute a property. The paper demonstrates the conceptual integration but leaves full-scale empirical comparison to future work.

There are real caveats. The layerwise bound on $L_{\nabla^{2} f}$ is an overapproximation — it is guaranteed never to underestimate the true Hessian Lipschitz constant, but it may overestimate it, especially in deep networks where multiplicative compounding across layers introduces conservatism. This is the same fundamental tension that afflicts all propagation-based Lipschitz estimates: the bound is provably safe, but it may be loose. Future work could sharpen the layer-level estimates using semidefinite programming relaxations (analogous to LipSDP for first-order constants) or by exploiting correlations between layers that the current elementwise bounds ignore.

The framework currently handles scalar-valued outputs — a single neuron or a single logit. Extensions to vector-valued networks, relevant for multi-output controllers or multi-class classifiers, would require bounding the Hessian Lipschitz constant of the full Jacobian, a technically richer problem. The paper notes this as an open direction.

There is also an interesting theoretical question lurking here: the hierarchy stops at second-order information, but the same compositional logic could in principle be extended to third-order bounds, using the Lipschitz constant of the third derivative to control a quartic remainder. Whether the additional complexity pays off — whether networks in practice have third derivatives that are meaningfully bounded by computable quantities — is an open empirical and theoretical question.

What HiTaB establishes, for the first time, is that the mathematics of neural network verification does not have to stop at curvature. The compositional structure of deep networks, which makes them hard to analyze globally, is also what makes them amenable to hierarchical smoothness analysis — each layer's contribution to higher-order behavior can be bounded independently and composed. That insight, once formalized, turns a seemingly intractable object (the Hessian Lipschitz constant of a 50-layer network) into something computable in a forward pass.

The dream of certifiably safe AI — not just probably safe, not just empirically robust, but provably safe in the mathematical sense — is still distant for large-scale systems. But it is built from exactly these kinds of incremental, principled advances: tighter bounds, sharper conditions, cleaner hierarchies. HiTaB adds a new rung to that ladder, and in safety-critical engineering, every rung counts.

Teaching Machines to Doubt Themselves: A New Framework That Makes AI Safety Certificates Tighter

The Science

What They Found

Why This Changes Things

What's Next

Source articles

Comments (0)