scanning-for-hardcoded-secrets

Scan a source-code tree for hardcoded credentials embedded in source files: AWS access keys, GitHub tokens, Stripe keys, Slack tokens, Anthropic API keys, OpenAI keys, JWT signing secrets, generic base64-encoded passwords, RSA / SSH private keys, and high-entropy string literals that pattern-match common credential shapes. Use when: pre-commit gate before pushing a feature branch, audit before SOC2, post-incident scan after a leak, or inheriting a codebase you didn't write. Threshold: any source file contains a string that matches a canonical credential regex (AWS AKIA prefix, GitHub ghp_ prefix, etc.) OR a string with Shannon entropy above 4.5 in a field context (key=, token:, secret=). Trigger with: "scan secrets", "credential scan", "find hardcoded keys", "leak check".

v3.0.0-dev

Jeremy Longshore

MIT

Allowed Tools

ReadBash(python3:*)GlobGrep

Provided by Plugin

penetration-tester

25-skill pentest pack with engagement governance, network/code/dependency scans, OWASP Top 10 mapping, and exec-readable reporting. Heavy-hitter compliant; chain-of-custody attestable.

security v3.0.0

View Plugin

Installation

This skill is included in the penetration-tester plugin:

/plugin install penetration-tester@claude-code-plugins-plus

Click to copy

Instructions

Scanning for Hardcoded Secrets

Overview

The single most common cause of credential breach in 2026 remains

hardcoded secrets in source code. Engineers paste an API key into a

config file "just for testing," forget to remove it, commit the

file. The credential is now in the repository's history forever

(git rebase doesn't help if anyone cloned in between) and

extractable by anyone who reaches the repo: contractors,

ex-employees, attackers via .git/ directory exposure (see skill

six), GitHub bot scrapers crawling public repos.

The cost of detection-after-commit is near-zero (free tools exist:

gitleaks, trufflehog, this skill). The cost of detection-before-commit

is also near-zero (pre-commit hooks). The cost of remediation after

the fact is rotating every credential exposed + auditing for

exploitation + potentially notifying customers of breach. The

asymmetry is severe, the discipline is the only constraint.

This skill scans a filesystem tree, matching against a canonical

regex library covering the credential shapes attackers and bots

search for first.

When the skill produces findings

Finding	Severity	Threshold	Affected control
AWS access key (AKIA-prefix)	CRITICAL	Literal `AKIA[0-9A-Z]{16}` in any file	CWE-798
AWS secret access key	CRITICAL	40-char base64 in `awssecretaccess_key` field context	CWE-798
GitHub personal access token	CRITICAL	`ghp[A-Za-z0-9]{36}` or `gho`, `ghu`, `ghs`, `ghr_`	CWE-798
GitHub app installation token	CRITICAL	`ghs_[A-Za-z0-9]{36}`	CWE-798
Stripe live key	CRITICAL	`sklive[A-Za-z0-9]{24,}`	CWE-798
Stripe test key	MEDIUM	`sktest[A-Za-z0-9]{24,}`	CWE-798
Anthropic API key	CRITICAL	`sk-ant-api03-[A-Za-z0-9_-]{93}` or similar	CWE-798
OpenAI API key	CRITICAL	`sk-(proj-)?[A-Za-z0-9_-]{40,}`	CWE-798
Slack bot token	CRITICAL	`xoxb-[A-Za-z0-9-]+`	CWE-798
Slack user token	CRITICAL	`xoxp-[A-Za-z0-9-]+`	CWE-798
Google API key	HIGH	`AIza[A-Za-z0-9_-]{35}`	CWE-798
RSA / OpenSSH private key	CRITICAL	BEGIN PRIVATE KEY header (RSA, OPENSSH, EC, DSA variants)	CWE-321
JWT secret	HIGH	Long string in `jwtsecret`, `JWT``SECRET`, `signing_secret` field	CWE-321
Generic password literal	HIGH	`password = "..."` with non-placeholder value	CWE-798
High-entropy string in key/token field	MEDIUM	Shannon entropy ≥ 4.5 in `key:`/`token:` field context	CWE-798
`.env`-shaped KEY=VALUE in non-`.env` file	HIGH	Multiple `[A-Z_]+=` lines in `.py`/`.js`/`.md` files	CWE-200

Prerequisites

Python 3.9+
Target source-code tree on local filesystem

Instructions

Step 1 — Identify the scan target

This skill scans a filesystem path. No authorization gate (it

operates on local source code, not network targets).

Step 2 — Run the scanner


python3 ${CLAUDE_PLUGIN_ROOT}/skills/scanning-for-hardcoded-secrets/scripts/scan_secrets.py /path/to/repo

Options:


Usage: scan_secrets.py PATH [OPTIONS]

Options:
  --output FILE      Write findings to FILE (default: stdout)
  --format FMT       json | jsonl | markdown (default: markdown)
  --min-severity SEV (default: info)
  --include-tests    Include files under tests/, test/, __tests__/, spec/
                     (default: excluded to reduce false positives)
  --git-history N    Also scan the last N git commits' diffs (default: 0
                     = working tree only)
  --exclude GLOB     Skip files matching glob (repeatable)
  --entropy-only     Only flag entropy-based findings (skip regex)

The scanner walks the tree, applies the regex library to every

file's contents, and emits a Finding per match with file path, line

number, severity, and the redacted matched text.

Step 3 — Interpret findings

CRITICAL = the matched string is a real credential shape that

upstream tools auto-extract. Rotate the credential immediately.

Audit logs for any API call against that credential since the

commit landed.

HIGH = pattern strongly suggests credential but requires manual

verification (the literal might be a placeholder or test fixture).

MEDIUM / LOW = entropy-based heuristic that needs human review.

Step 4 — Remediation

For any confirmed real credential:

Rotate immediately. Don't wait to refactor; the leak window

is between when the commit landed and when you rotate.

Audit usage. Check provider's API logs for any unfamiliar

calls against that credential since the leak commit timestamp.

Remove from source. Move to environment variables, secrets

manager, or a runtime-provisioned secret. See

references/PLAYBOOK.md for per-language patterns.

Scrub history if reasonable. git filter-repo or `BFG

Repo-Cleaner` can purge the secret from history, but only if you

can force-push and coordinate with every clone-holder. For

public repos, history-scrub is often not worth the disruption

compared to just rotating.

Examples

Example 1 — Pre-commit gate


# .git/hooks/pre-commit (or via pre-commit framework)
python3 plugins/security/penetration-tester/skills/scanning-for-hardcoded-secrets/scripts/scan_secrets.py \
    --min-severity high --format json . | jq -e 'length == 0' \
    || { echo "Secrets detected. Fix before commit."; exit 1; }

Example 2 — CI scan on every push


- name: Hardcoded-secrets scan
  run: |
    python3 plugins/security/penetration-tester/skills/scanning-for-hardcoded-secrets/scripts/scan_secrets.py \
        . --min-severity high --format json --output secrets-scan.json
- run: |
    if jq 'length > 0' secrets-scan.json | grep -q true; then
      echo "::error::Hardcoded secret detected"
      exit 1
    fi

Example 3 — Audit inherited codebase


python3 ${CLAUDE_PLUGIN_ROOT}/skills/scanning-for-hardcoded-secrets/scripts/scan_secrets.py \
    /path/to/acquired-repo --include-tests --min-severity medium

--include-tests is important here because legacy test fixtures

often contain real credentials someone forgot to redact.

Output

JSON / JSONL / Markdown per lib/report.py. Exit codes: 0 clean, 1

high/critical, 2 error.

Matched strings are partially redacted in output (first 4 + last 4

chars visible, middle redacted) to avoid the scanner output itself

becoming a leak surface.

Error Handling

False positive on placeholder strings like KEYHERE> →

the scanner skips strings containing <, >, EXAMPLE,

PLACEHOLDER, YOUR_, XXXX (configurable).

Binary file in tree → skipped (the scanner reads only text

files by content-type sniffing).

Large file → files >5 MB are skipped (avoids scanning compiled

artifacts and lockfiles).

Resources

references/THEORY.md — Per-credential-family threat model, why

each provider's keys are extracted by bots first, history-scrub

decision framework, entropy-detection theory

references/PLAYBOOK.md — Per-language migration patterns

(Python dotenv, Node .env+dotenv, Ruby Rails credentials, Go

envconfig), provider rotation procedures, GitHub secret-scanning

integration

Allowed Tools

Provided by Plugin

penetration-tester

Installation

Instructions

Scanning for Hardcoded Secrets

Overview

When the skill produces findings

Prerequisites

Instructions

Step 1 — Identify the scan target

Step 2 — Run the scanner

Step 3 — Interpret findings

Step 4 — Remediation

Examples

Example 1 — Pre-commit gate

Example 2 — CI scan on every push

Example 3 — Audit inherited codebase

Output

Error Handling

Resources

Ready to use penetration-tester?

Related Skills

agent-safety-preflight

analyzing-dependencies

analyzing-security-headers

analyzing-tls-config

assisting-with-soc2-audit-preparation

auditing-access-control