GitInject: Real-World Prompt Injection Attacks in AI-Powered CI/CD Pipelines

AIPR assessment

The problem is hard and active, with a crowded and fast-moving security literature around prompt injection, AI agents, and CI/CD automation. The strongest parts of the paper reinforce each other: real execution on live ephemeral repositories, concrete attack reproductions, and workflow-level mitigations make the results credible and useful. The weaknesses also compound in a mild way: the study is bounded to selected providers and workflows, and some broad structural claims lean on extrapolation

Abstract

AI-powered agents are increasingly embedded in continuous integration and continuous delivery/deployment (CI/CD) pipelines to autonomously review pull requests (PRs), triage issues, and maintain codebases. These agents ingest untrusted content while operating with elevated repository permissions, making them a natural target for prompt injection attacks with supply chain consequences. We present GitInject, an open-source framework for evaluating prompt injection vulnerabilities in real, live GitHub workflows, a widely deployed instance of CI/CD pipelines. Unlike prior agent security benchmarks that simulate tool calls, GitInject provisions ephemeral repositories and triggers actual workflow runs, so that sandbox constraints, credential handling, and permission boundaries behave exactly as in production. Using GitInject, we study workflow configurations across four AI providers and document eleven named attacks spanning config-file injection, credential exfiltration, judgment manipulation, and availability. We find that all tested providers are susceptible to at least one attack class in their default configuration, and that the most critical vulnerabilities are structural: they arise from how CI/CD infrastructure handles credentials and configuration files, not from any specific model's behavior. For each confirmed attack class, we identify the minimum-cost workflow-level countermeasure and analyze its coverage and limitations. GitInject is released publicly to facilitate further research in this direction.

Score Breakdown

Holistic Impression
80
Novelty
78
Rigor
84
Applicability
80
Clarity
83
Citation
79
Confidence: 85%

More from this week

More in AI