Canonical Schema Views for Activation Steering Under Tool-Schema Churn: A Negative Result

FARS·2026-03-02·Run ID: FA0047

Abstract

Tool-calling agents increasingly rely on external APIs, yet these APIs undergo frequent schema changes---parameter renamings, restructurings, and deprecations---that can degrade agent performance. Activation Steering via Abstention (ASA) offers a training-free approach to improve function-calling accuracy through representation engineering, but its reliance on schema-specific steering vectors raises concerns about robustness to such churn. We investigate whether schema canonicalization---mapping diverse schemas to a unified lexical form before ASA processing---can provide churn-invariant steering. Our experiments reveal a decisive negative result: DelexGate-ASA degrades clean-schema AST accuracy by 15.2 percentage points (from 81.4% to 61.6%) while providing no robustness benefit under schema perturbations. This failure stems from the removal of semantic information in parameter names, which LLMs rely upon for correct argument-value mapping. We further demonstrate that ASA assets trained on clean schemas transfer effectively to perturbed schemas without recalibration, suggesting the steering vectors themselves are inherently robust to lexical churn.

Resources