Cuong Dang, Jiahao Zhang, Hieu Ta Quang, Dung Le, Lu Cheng, Suhang Wang
While confidence calibration is essential for trustworthy decision-making in safety-critical applications, the robustness of calibrated GNNs to adversarial structural perturbations remains largely unexplored. However, studying calibration attacks on graphs presents unique technical challenges: (1) the discrete nature of graph structures complicates gradient-based optimization, (2) existing underconfidence objectives fail to drive predictions toward uniform distributions, and (3) GNNs are highly sensitive to edge perturbations, often causing unintended label changes that violate attack constraints. To address these challenges, we propose a \textbf{Unified Graph Calibration Attack (UGCA)} framework designed for \textbf{worst-case (white-box) analysis} of GNN calibration robustness. UGCA introduces a KL-divergence loss to encourage uniform predictive distributions, a reranking mechanism to reduce label flipping, a hybrid loss to recover labels when violations occur, and beam search to explore a broader adversarial search space. We further provide theoretical insights linking model generalization, dataset complexity, and calibration vulnerability, showing that models with higher accuracy or trained on datasets with more classes are more susceptible under this threat model. Extensive experiments demonstrate that UGCA substantially increases Expected Calibration Error while preserving classification accuracy. Our code is publicly available at https://github.com/CaptainCuong/Graph-Calibration-Attack.git.
This paper introduces UGCA (Unified Graph Calibration Attack), the first dedicated framework for adversarial calibration attacks on Graph Neural Networks. The core problem is compelling: can an adversary manipulate a GNN's confidence scores (making them overconfident or underconfident) without changing predicted labels, thereby silently undermining trustworthiness while the model appears healthy on standard accuracy metrics?
The framework addresses this through four technical components: (1) a KL-divergence loss that pushes predictions toward uniform distributions (underconfidence attack), (2) a reranking mechanism derived from first-order approximations to avoid label-flipping edges, (3) a hybrid loss that switches between attack and label-recovery modes, and (4) beam search to escape local optima in the discrete edge perturbation space. The paper also provides theoretical results connecting model accuracy, number of classes, and calibration vulnerability (Theorem 4.1 and Corollary 4.3).
Direct impact: This work opens a new attack surface for GNN security research. The insight that calibration can be independently attacked without affecting accuracy is practically important for safety-critical deployments (fraud detection, medical diagnosis). It forces the community to consider calibration robustness as a distinct security property.
Defensive implications: The finding that graph-aware calibration methods (DCGC) are more robust than classical post-hoc methods (TS, VS) provides actionable guidance for practitioners. Figure 4's analysis of calibration set size effects offers practical deployment recommendations.
Broader relevance: The work bridges adversarial ML, GNN calibration, and trustworthy AI. The framework could inspire similar analyses for other graph-based tasks (link prediction, graph classification) and other uncertainty quantification methods beyond calibration.
Limitations on impact: The attack requires white-box access including gradients, which is a strong assumption. The practical threat model (fraudster manipulating edges) is plausible but narrow. The ECE increases, while statistically significant, are sometimes modest (e.g., OGBN-Arxiv improvements are small in absolute terms), raising questions about real-world severity.
The paper addresses a timely gap at the intersection of two active research areas: GNN calibration and adversarial robustness. With increasing deployment of GNNs in safety-critical systems and growing emphasis on trustworthy AI, understanding calibration vulnerabilities is directly relevant. The fact that no prior work has studied calibration attacks for graphs makes this a genuine first-mover contribution. The KDD '26 venue is appropriate for the application-oriented nature of the work.
Generated Jun 9, 2026
Paper 2 addresses a novel and underexplored security vulnerability in GNN calibration, combining adversarial robustness with calibration—two critical topics in trustworthy AI. It provides theoretical insights linking generalization and calibration vulnerability, introduces a comprehensive framework (UGCA) with multiple technical innovations, and has broader implications for safety-critical AI deployment. Paper 1, while practically useful for ECG efficiency, is more incremental—combining existing techniques (Grad-CAM, progressive data dropout) in a narrower clinical domain. Paper 2's findings about fundamental model vulnerabilities have wider cross-domain relevance.
Paper 1 has higher potential impact: it introduces a novel, broadly applicable adversarial framework (UGCA) for a timely trustworthiness problem (calibration robustness) in GNNs, with clear methodological contributions (new objective, search strategy, constraint-handling) plus theory linking accuracy/dataset complexity to vulnerability, and extensive evaluation with released code. Its implications span security, safety-critical ML, and graph learning across many domains. Paper 2 is promising but is demonstrated on a single minimal testbed with modest improvements and narrower immediate breadth, so its near-term impact is likely more limited.
Paper 2 demonstrates higher scientific impact by addressing a critical vulnerability in trustworthy AI: calibration robustness in Graph Neural Networks. It provides strong theoretical insights linking model generalization to vulnerability and offers a novel attack framework with broad implications for safety-critical systems. Furthermore, it contributes public code to the community. While Paper 1 offers a valuable optimization framework for e-commerce marketing, its scientific breadth and theoretical contributions are narrower compared to the fundamental machine learning security and reliability focus of Paper 2.
Paper 2 addresses a novel and important problem at the intersection of adversarial robustness and calibration for GNNs, introducing a new attack framework (UGCA) with theoretical insights and practical implications for safety-critical applications. It opens a new research direction with broader impact across trustworthy AI, adversarial ML, and graph learning. Paper 1, while practically useful, is primarily an empirical study of architectural depth for a specific model (STGCN) in traffic prediction, offering incremental insights about over-parameterization without introducing new methods or broader theoretical contributions.
Paper 2 addresses the broadly important problem of differentially private synthetic data generation for tabular data, which has wide real-world applications across healthcare, finance, and government. It extends the Private Evolution framework to a new domain with practical improvements (28x faster, 10% accuracy gain), addressing a gap in handling high-order correlations. Paper 1, while technically sound, targets a niche problem (calibration attacks on GNNs) with a narrower audience. Paper 2's combination of privacy guarantees, practical scalability, and broad applicability across sensitive data domains gives it higher potential impact.
Paper 2 addresses a fundamental and broadly applicable issue in machine learning: the vulnerability of Graph Neural Networks to calibration attacks. Its theoretical insights into model generalization and dataset complexity, combined with a novel attack framework, offer significant implications for trustworthy AI across multiple domains. In contrast, while Paper 1 presents a rigorous benchmark for OLED molecular generation, its impact is more narrowly focused on computational chemistry and materials science.
Paper 1 likely has higher impact: it targets a timely, fast-growing area (trustworthiness and adversarial robustness of GNNs) with clear security/safety implications and broad relevance across ML, graph mining, and risk-sensitive deployments. The UGCA framework addresses concrete technical barriers (discrete structure, objective mismatch, label-flip constraints) and offers both empirical validation and theoretical insights, plus open-source code—supporting adoption and follow-on work. Paper 2 is a solid methodological contribution to NMF tri-factorization with heuristic solvers, but its novelty and breadth appear narrower and more incremental.
Paper 1 represents a foundational breakthrough in physics-informed machine learning by exactly embedding complex thermodynamic structures into neural operators. This advancement has broad, interdisciplinary impact across physics, engineering, and applied mathematics for solving PDEs. Paper 2, while important for GNN security and trustworthiness, addresses a more specialized vulnerability within adversarial machine learning, making its overall scientific footprint narrower compared to the fundamental theoretical and practical contributions of Paper 1.
Paper 2 addresses a critical real-world cybersecurity problem in power systems with a principled, physically-grounded solution. Its contribution—preserving geometric correspondence between physical and measurement-derived null spaces—is theoretically rigorous with formal proofs and demonstrates clear practical superiority over multiple baselines on standard IEEE benchmarks. The method has immediate applicability to critical infrastructure protection. Paper 1, while technically sound in exploring GNN calibration robustness, addresses a more niche problem with narrower real-world applicability and incremental contributions to the adversarial ML literature.
Paper 2 is more scientifically novel and timely: it introduces a new adversarial threat model (calibration attacks) tailored to graph structures and provides a unified attack framework with concrete technical innovations (KL objective, reranking, hybrid loss, beam search) plus theoretical links between calibration vulnerability, accuracy, and dataset complexity. Its implications span trustworthy ML, security, and GNN deployment in safety-critical settings, with rigorous evaluation and public code. Paper 1 is application-oriented and useful for accessibility, but appears more like an integration of existing AutoML/decision-support ideas with less methodological novelty.