Agency Agents ships 20+ specialists, RTK cuts tokens 80%, Anthropic's plugins go live

Wednesday, 20 May 2026

Agency Agents is finally available: 20+ meticulously crafted AI specialist personalities for everything from frontend wizardry to Reddit ninja tactics.

Anthropic's official plugins directory went live for Claude Code, whilst shimmy hit 1.9.0 with all GPU backends bundled for local.

🤖Free Claude Code proxy

GitHub

Routes Anthropic API calls from Claude Code to ten provider backends including NVIDIA NIM, Kimi, OpenRouter, DeepSeek, LM Studio, llama.cpp, Ollama, OpenCode Zen, Wafer, and Z.ai.

Deep Dive

PythonClaudeCost Optimisation

📊AI IaC adoption survey results

DevOps practitioners share real production usage of AI agents for Terraform, CloudFormation, and Bicep generation.

Hot TakeTerraformDevOpsAI Workflows

🐳Docker ships Gordon AI agent

docker

AI agent for entire container workflow that understands our environment and takes action across Docker Desktop.

Bigger Picture

Agent personalities get serious

Moving beyond generic prompt templates to battle-tested specialist personas is smart. Each agent ships with success metrics and proven deliverables rather than vague role descriptions. The Claude Code integration with ./scripts/install.sh shows they understand distribution.

Under The RadarDockerAgentsDevOps

📈ML infra career transition data

Discussion on taking pay cuts to move into ML infrastructure roles and whether the domain shift pays off.

Divisive

💬Anthropic widens AI conversation

Anthropic

Anthropic publishes stance on widening conversations around frontier AI development and safety.

Takeaway

Signals Anthropic's direction on open research and public discourse. This affects how they'll handle future model releases and API access policies.

AnthropicAI Safety

⚙️Engineering AI/ML systems insights

Experienced devs discuss the unique challenges and patterns when building production AI/ML infrastructure.

AI Workflows

🎭Agency Agents ships specialists

GitHub

Complete AI agency with a growing collection of specialised agent personalities: frontend wizards, Reddit ninjas, reality checkers.

Top VotedAgentsAI WorkflowsDev Tools

⚡RTK cuts tokens 80%

GitHub

CLI proxy that reduces LLM token consumption by approximately 80% on common dev commands like ls, cat, git status, with savings ranging from 70% to 92% across different operations.

Bigger Picture

Token efficiency finally shipping

We've seen compression tools before, but RTK's approach of filtering command output before it hits our LLM context is genuinely different. The 80% savings on git diff and 92% on commit operations address the exact pain points we hit daily. This could reshape how we design AI workflows.

Deep Dive

RustCLICost Optimisation

Learn/Multiple Mentions

What is MCP in AI development context?

MCP (Model Context Protocol) is a standardised way for AI tools to connect with external systems and data sources. Today's releases show its adoption: code review graphs, OmniRoute gateways, IDA Pro bridges, and MATLAB integration all use MCP to extend AI capabilities beyond basic chat.

IntegrationProtocols

🌉Automatic fallbacks with Bifrost

Dev.to

Deep dive into implementing automatic fallback patterns for distributed AI systems and API reliability.

Takeaway

The promise vs reality gap in AI deployments. Bifrost's approach to handling distributed failures and provider outages shows us patterns we should build into our own stacks.

AI WorkflowsDevOpsLLM Ops

Yesterday's Sentiment/Bullish

Productivity Tools Take Centre Stage

Devs are shipping the infrastructure we've been waiting for. RTK finally tackles the token waste problem whilst Agency Agents delivers production-ready AI specialists. Privacy-preserving patterns from MemPrivacy and self-healing systems like Claude Soul show the community maturing beyond basic wrappers.

✂️slopless catches prose slop

GitHub

Deterministic textlint rules for catching AI-generated prose patterns in Markdown documentation.

Trending

TypeScriptCLIAI Workflows

🔌Anthropic's plugins go live

GitHub

Official directory for Claude Code plugins with curated internal and external partner submissions.

Bigger Picture

Privacy meets agent memory

Local pseudonymization before cloud storage solves a real production problem. Replacing 160/110 with Health_Info_1 preserves semantic meaning whilst keeping raw values local. This architecture pattern will become standard for enterprise agent deployments.

Trending

PythonClaudeAnthropic

🔄LLMix wraps provider SDKs

GitHub

Production LLM layer for Python, TypeScript, Rust with cache, retries, circuit breakers, key rotation.

Trending

Python

TypeScript

Rust

🌐OmniRoute routes 160+ providers

GitHub

Free AI gateway with RTK compression, auto-fallback, MCP support, and Desktop/PWA for 160+ providers.

Trending

TypeScriptCost OptimisationMCP

📊Code review graph cuts tokens 6x

GitHub

Persistent codebase knowledge graph for Claude Code that builds a structural map with Tree-sitter, tracks changes incrementally, and provides precise context to avoid re-reading the entire codebase on every task.

Trending

PythonClaudeCode Gen

🔍iida-mcp bridges IDA Pro

GitHub

IDA Pro plugin exposing static analysis via HTTP MCP with 77 tools and optional Windows kernel access.

Trending

PythonMCPSecurity

🚀shimmy 1.9 bundles all GPU

GitHub

OpenAI-compatible inference server with all GPU backends included in single binary - no compilation needed.

Bigger Picture

Self-healing becomes the norm

Built for hostile sites where traditional scrapers break. Four healing strategies compete when selectors fail, and winners persist for next time. The TLS fingerprinting layer addresses bot detection at the protocol level where most tools fail. Production-grade scraping.

Trending

RustLocal AIOpenAI

🔍semble_rs speeds code search

GitHub

Rust code search with hybrid BM25 + semantic, Tree-sitter chunking, dependency analysis for AI agents.

Trending

RustCode GenAI Workflows

🔧NeuralInverse tackles legacy

GitHub

AI-native IDE for modernizing legacy systems, firmware development, and regulated codebase migrations.

Trending

TypeScriptAI WorkflowsDev Tools

🔄Gemini-API reverse engineers web

GitHub

Asynchronous Python wrapper for Google Gemini with persistent cookies, image generation, deep research workflow.

Trending

PythonGoogleGemini

🧠Claude Soul builds persistence

GitHub

Self-correcting learning engine that tracks corrections, builds cross-session memory, and develops judgment over time.

Trending

TypeScriptClaudeAI Workflows

🛡️MemPrivacy shields agent data

GitHub

Privacy-preserving memory framework that replaces sensitive spans with typed placeholders before cloud storage.

Trending

PythonPrivacyAgents

📐MATLAB ships official MCP

GitHub

Official MATLAB MCP Server from MathWorks for AI applications and coding agents integration.

Trending

GoMCP

🌐CDP Bridge controls browsers

GitHub

MCP server connecting clients to real browser sessions through CDP and Chrome extension for logged-in automation.

Trending

PythonMCP

📊Byaan learns your databases

GitHub

Open-source data analyst that auto-discovers schema, learns business rules, and compounds knowledge over time.

Trending

PythonMCP

🔍fff fast file search

GitHub

High-performance file search toolkit optimized for AI agents, Neovim, and development workflows.

Trending

RustAI Workflows

⚖️German legal skills for Claude

GitHub

Experimental Claude skills for German law practice with Arbeitsrecht, Datenschutz, and Insolvenzrecht workflows.

Trending

PythonClaude

🚪Claude Code gateway simplifies

GitHub

Desktop app for routing Claude Code through any provider - OpenRouter, DeepSeek, GitHub Copilot, local models.

Trending

TypeScriptClaude

📚LLM engineering course repo

GitHub

Repository accompanying comprehensive LLM engineering course with practical examples and production patterns.

Trending

PythonLLM Ops

🕷️Anansi self-heals selectors

GitHub

Web scraper that repairs CSS selectors automatically, switches to browser rendering when needed, evades bot detection.

Trending

PythonMCP

Learn/Core Concept

How does token compression actually work?

Token compression reduces LLM API costs by encoding common patterns more efficiently before sending requests. Instead of sending raw text, systems like RTK analyse command outputs and replace repetitive structures with compact representations that models can still understand. This cuts token usage by 70-92% on typical dev commands without losing meaning.

TokenisationInference

Read online