Data security & privacy review

Is system_prompts_leaks safe?

Agent  asgeirtj/system_prompts_leaks

aiai-agentsanthropicawesomechatbotchatgptclaudeclaude-code

What is system_prompts_leaks?

Extracted system prompts from Anthropic - Claude Fable 5, Opus 4.8, Claude Code, Claude Design. OpenAI - ChatGPT 5.5 Thinking, GPT 5.5 Instant, Codex. Google - Gemini 3.5 Flash, 3.1 Pro, Antigravity. xAI - Grok, Cursor, Copilot, VS Code, Perplexity, and more. Updated regularly.

Type: Agent License: CC0-1.0 Source: repository ↗

Data-security signals

Public, checkable facts about system_prompts_leaks — they show the risk surface, not what it does with your data at runtime.

  • Open-source — the CC0-1.0-licensed code is publicly auditable on its public repository.
  • High access surface — as an AI agent, it can run with your keys, files, environment and network.
  • Maintenance — actively published.
  • ?
    Independent exfiltration test — not yet independently tested by Oxavion.

Is system_prompts_leaks safe? The honest answer.

The signals above show what system_prompts_leaks can reach. But no public metadata reveals what it actually does with your data once it runs — that only shows up when you watch it in a sandbox. Oxavion runs system_prompts_leaks with planted canary secrets and watches every outbound channel, then emails you the evidence.

✓ Request received — we'll run the scan and email your report shortly.

We scan system_prompts_leaks in our sandbox and email your report. No install, no access to your systems.

How to tell if system_prompts_leaks is safe

Before you trust any AI tool with your environment, check:

  1. Is the source auditable? Yes — open-source, you can read it.
  2. Does it need your keys or credentials? Most agents do — so it holds them at runtime.
  3. Does it make outbound network calls, and where to? The repo hints at this; only a run confirms it.
  4. Has it been tested for data exfiltration? Not yet — this is the one you cannot verify from the outside.

The first three you can check from the repo yourself. The last — what it does with your data at runtime — needs a test. That is exactly what an Oxavion scan does →

Frequently asked

Is system_prompts_leaks safe to use?
It depends on what it does with your data at runtime — something a static look can't settle. Oxavion answers it empirically: we sandbox system_prompts_leaks, feed it canary secrets and data, and report exactly what (if anything) leaves. Request a free scan for a verdict on the version you run.
How does Oxavion test it?
An isolated gVisor micro-VM, a transparent egress gateway that captures HTTP/S, DNS and raw TCP, planted canary secrets/PII, and encoding-aware detection — aligned to OWASP LLM Top 10 and MITRE ATLAS, calibrated to zero false-negatives / zero false-positives.

Related Agents