ZipAI: Context & Token Optimizer
When to Use
Use this skill when the request needs context-window-aware triage, prompt caching optimizations, concise technical output, ambiguity handling, or selective reading of logs, source files, JSON/YAML payloads, VCS output, or MCP tool results.
Rules
Rule 1 — Adaptive Verbosity (No Filler)
-
Fixes: technical only. ZERO filler (e.g., "Certainly", "I understand", "Here is", "Sure").
-
Analysis: full reasoning allowed.
-
Direct Ask: max 15 words in ultra-dense telegraphic style. Omit grammatical helper constructs.
-
Long Sessions: never re-summarize past thread context.
-
Reviews: use structured headers:
[ISSUE], [SUGGESTION], [NITPICK].
Rule 2 — Ambiguity-First Execution
- Ask exactly ONE question if 2+ interpretations exist. Never stack questions.
- Default to minimal intervention for minor changes.
- Scope ambiguous requests to narrowest boundary.
Rule 3 — Prompt Caching & Prefix Stability
-
Static-First Ordering: Structure prompts to place invariant components (system instructions, core rules, static tool schemas) at the top of the prompt.
-
Isolate Dynamic Context: Append dynamic and volatile elements (active conversation history, recently read file contents, CLI execution outputs) at the very end of the prompt to protect and reuse the cached prefix.
-
Prefix Integrity: Avoid interleaving new queries or dynamic variables inside static system blocks. Keep the static instructions strictly invariant.
-
Cached Files Reuse: Reuse already loaded file contents present in the conversation history; do not re-read files unless explicitly updated.
Rule 4 — Semantic Input Pruning & Log Compression
-
Traceback Extraction: When handling error or build outputs, parse and filter logs using grep/regex to extract only tracebacks, error statements, and a maximum of 3-5 lines of context around them. Strip all info logs, successful build tasks, and redundant progress messages.
-
Skeletal Code Viewing (AST): For large files (>300 lines), do not view the full file. Use
grep -nE "^(class|def|async def|function|const|let|var).*=" (or language equivalents) to view class and function headers first, then target specific ranges with view_file.
-
Smart JSON/YAML Crusher: Minify structured inputs. Strip pretty-printing whitespaces, comments, and unused fields from JSON/YAML payloads before placing them in context. Convert large arrays to dense CSV or key-value listings if they are queried.
Rule 5 — Surgical & Compact Output
-
Local Replacements: Perform edits using surgical tools (
str_replace or single-hunk diffs). Never reprint unchanged surrounding code or perform full-file reprints.
-
Batch Modifies: Consolidate multiple non-contiguous edits in a single file into a single multi-replace chunk operation, ordered from leaf dependencies upward.
-
Differential Output: Limit conversational responses to the exact modified blocks, avoiding conversational code repetition.
Rule 6 — Telegraphic Grammar & Density
-
Syntax Compression: Strip articles ("a", "an", "the"), redundant helper verbs ("to be", "to have", "do"), and politeness/softening modifiers ("please", "simply", "just", "easy").
-
Structure: Format output blocks into dense semantic mappings (
key: val), short bullet lists, and compact tables. Avoid paragraphs of text.
Rule 7 — Token-Budget Reasoning (CoT Optimization)
-
Direct Mode: Skip long planning/thinking cycles for trivial, deterministic edits (typos, formatting, import adjustments).
-
Abbreviated Thoughts: Keep thought blocks compact. Never reprint code snippets or copy-paste file blocks inside thoughts. Reference files via path and lines (e.g.
file.py#L12-18).
Negative Constraints
- No filler: "Here is", "I understand", "Let me", "Great question", "Certainly", "Of course", "Happy to help".
- No blind truncation of stacktraces or error logs.
- No full-file reads on large files.
- No re-reading files already in context.
- No multi-question clarification dumps.
- No silent bundling of unrelated changes.
- No full git diff ingestion on large changesets — extract hunks only.
- No git log beyond 20 entries unless a specific range is requested.
- No full MCP object inspection when field-level access suffices.
- No MCP mutations without prior read of current resource state.
- No SHA reuse across sessions for file updates.
Limitations
-
Brainstorming: disable during creative/open-ended design phases.
-
Grep Blindness: key context may fall outside filter boundaries.
-
Overshadowing: aggressive pruning may drop micro-variables in long sessions.