Researchers at Israeli security firm Air Security published a proof-of-concept showing how an apparently harmless AI agent skill can be weaponized after the fact. They published a “clean” skill, let it build a track record of legitimate use and trust, then performed a classic “rug pull” — quietly swapping its behavior to malicious — and used it to gain control over 26,000 agents that had installed it.
The technique highlights a structural gap in how agent skills are currently trusted: approval and reputation are typically evaluated once, at install time, with no guarantee the skill’s behavior stays the same afterward.