HYDRA: Proactive Android Malware Drift Adaptation via Hierarchical Graph Contrastive Learning

May 20, 2026ยท
Han Chen
Hanchen Wang
Hanchen Wang
,
Hongmei Chen
,
Lu Qin
,
Wenjie Zhang
,
Ying Zhang
ยท 0 min read
Abstract
Concept drift, driven by the rapid evolution of Android malware, severely degrades the performance of machine learning detectors. Current adaptation strategies are often reactive, responding only after performance has dropped and imposing a significant manual annotation burden, or they are proactive but rely on unstable adversarial training and incomplete, single-level graph representations. To overcome these limitations, we propose HYDRA (Hybrid Drift Adaptation), a proactive adaptation framework that learns drift-invariant representations from hierarchically structured data. HYDRA first models applications using a hybrid graph structure, combining fine-grained Control Flow Graphs (CFGs) and coarse-grained Function Call Graphs (FCGs) to capture comprehensive behavioral patterns. It then introduces a novel cross-domain contrastive learning objective that aligns historical (source) and new (target) data distributions. By generating pseudo-labels for unlabeled target samples, our method pulls representations of semantically similar applications together, regardless of their domain, within a single, stable optimization process. This approach unifies feature learning and domain alignment, eliminating the need for complex adversarial objectives. Extensive experiments on large-scale, time-ordered malware datasets demonstrate that HYDRA achieves state-of-the-art performance with an F1-score of up to 96.9%, while reducing labeling effort by up to 87.5% compared to the best-performing baseline. Our work thus offers a robust and efficient solution to combat concept drift in security applications.
Type
Publication
ACM Conference on Computer and Communications Security (CCS) 2026