AirTranslate floats macOS captions, Dynamo serves inference 7x faster | recall.ac

The daily AI changelog for full-stack developers.

Tools, techniques, and the tech changing how we ship, delivered daily. No spam, ever.

Saturday, 23 May 2026

AirTranslate captures Mac audio and floats live captions with translation overlay while GitNexus builds client-side knowledge graphs for any.

Microsoft's Agent Governance Toolkit enforces zero-trust policies on AI agents with sub-millisecond checks, moving from prompt-based safety's 26.67% violation rate to.

📊FreeCodeCamp covers LLM A/B

Guide on cluster randomisation for experimenting with collaborative AI features. Python-focused methodology for product testing.

Takeaway

If we're shipping AI features to users, we need proper experimental design rather than hoping engagement metrics tell the story. The cluster randomisation approach handles the collaborative nature of AI tools where individual

PythonTestingResearch

📰DEV.to ships semantic feeds

Dev.to·2 min read

Uses Gemini Embeddings with pgvector to personalise content feed beyond clicks and recency. Ruby-based auditing for AI calls.

Takeaway

Shows how to build semantic recommendations without becoming a clickbait echo chamber. They're combining community signals (follows, reactions) with vector similarity in PostgreSQL. The Ai::Base wrapper pattern for auditing API

RubyGeminiEmbeddings

📦kubernetes-sigs

Manages isolated, stateful, singleton workloads in Kubernetes. Designed for AI agent runtimes that need persistent state.

Trending

GoKubernetesAgents

📺AirTranslate floats macOS captions

Captures Mac system audio, transcribes with Apple Speech, translates live, and floats captions over other apps. Optional GPT mode.

Top VotedSwiftAudioApple

🕸️GitNexus builds knowledge graphs

Client-side codebase analyser. Drop in a repo, get interactive knowledge graph with dependencies, call chains, and MCP integration.

Deep Dive

TypeScriptMCPCode Gen

💬feishu-claude-code-bridge links

Bridges Feishu/Lark messenger with local Claude Code CLI. Streaming cards, per-chat sessions, multiple workspaces.

Trending

TypeScriptClaudeCLI

🔴offensive-claude ships red team

25 specialized skills and 6 agents for Claude Code covering exploit development, AD attacks, EDR bypass, and mobile pentesting. Includes 47 vulnerability pattern files for the full offensive security lifecycle.

Deep DiveClaudeSecurityCLI

Yesterday's Sentiment/Energised

Infrastructure Scaling Meets Agent Governance

Dynamo's datacenter orchestration claims and Microsoft's zero-trust agent governance show the space maturing toward production-grade tooling. Strong engagement around GitNexus knowledge graphs and security-focused Claude skills suggests devs want architectural clarity and safety guardrails.

Learn/Multiple Mentions

What are AI agent skills exactly?

AI agent skills are discrete, reusable capabilities that agents can execute to perform specific tasks or access external systems. They're essentially function definitions that agents can invoke during conversations or workflows. Offensive-claude ships 25 specialised skills for security testing, while vbsec provides security scanning skills. Unlike hardcoded features, skills are modular and composable, letting devs build agent capabilities incrementally.

MCPWorkflows

🤖Routa coordinates multi-agent

Workspace-first platform with Kanban boards, MCP/ACP integration, and stacked review gates for software delivery workflows.

Trending

TypeScriptAgentsMCP

🛡️vbsec scans security

Claude Code skill that detects 20+ security flaws including SQL injection, JWT misuse, and hardcoded secrets across multiple languages.

Trending

PythonClaudeSecurity

🦙Ollama adds new model support

Updated with Kimi-K2.5, GLM-5, MiniMax, and other recent models. Integrates with Claude Code, OpenClaw, OpenCode, Codex, and Copilot.

Trending

GoLocal AIClaude

⚡Dynamo serves datacenter inference

Orchestrates SGLang, TensorRT-LLM, and vLLM across multiple nodes with disaggregated serving, KV-aware routing, and SLA-based autoscaling.

Bigger Picture

Multi-node orchestration reality check

Dynamo's 7x throughput claims come from GB200 benchmarks, but the real value may be in making SGLang, vLLM, and TensorRT-LLM work together without custom glue code. Most teams hit the multi-GPU coordination wall before they reach individual engine limits.

Trending

Rust

PythonInference

📱Lucarne syncs agents via mobile

Zero-intrusion agent notifications through WeChat/Telegram. No hooks, skills, or MCP changes. QR code setup.

Trending

RustAgentsMobile

🔗LangChain grows agent platform

Framework evolution continues with Deep Agents package for built-in planning, subagents, and file system usage patterns. Integrates with LangGraph for controllable agent workflows and LangSmith for deployment and debugging.

Trending

PythonLangChainAgents

🛡️Microsoft ships agent governance

Zero-trust policy enforcement for AI agents with sub-millisecond checks. Every tool call evaluated against policy before execution.

Bigger Picture

Microsoft's governance enforcement approach

The move from 26.67% to 0% policy violations isn't just about better rules, it's about enforcement architecture. Microsoft's toolkit intercepts every tool call rather than trusting agent behaviour, which may become the standard for production deployments.

Trending

Python

TypeScriptMicrosoft

🚀Google Antigravity 2.0 migration

Dev.to·2 min read

Migration guide from 1.0 to 2.0. Key insight: customizations don't carry over since 2.0 no longer uses VS Code foundation.

Takeaway

If we're using Google Antigravity, backup ~/.gemini/antigravity before upgrading. The 2.0 rewrite breaks compatibility with extensions and history. The guide covers recovering stranded conversations and agent brain entries that

Under The RadarGoogleVS CodeAI Workflows

🧠MemOS evolves agent memory

Self-evolving memory system with L1 trace, L2 policy, L3 world model layers. Claims 35.24% token savings and 43.70% accuracy boost.

Trending

TypeScriptAgentsOpen Source

🌉Bifrost unifies 23+ AI providers

Enterprise AI gateway with automatic failover, load balancing, and semantic caching. Unifies access to 23+ providers through a single OpenAI-compatible API with zero-configuration deployment.

Bigger Picture

Security shift in agent tooling

Both vbsec and offensive-claude signal that security-aware AI coding is moving from afterthought to first-class concern. The timing aligns with more production deployments hitting real attack surfaces.

Trending

GoOpenAIAnthropic

💼AI's subtle career transformation

Personal reflection on how AI changes job satisfaction and career paths rather than just replacing roles. Focus on adaptation strategies.

Takeaway

The shift isn't about AI replacing us, it's about how it changes what parts of our work feel meaningful. Understanding which skills remain irreplaceable helps focus learning time on areas where human judgment still matters most.

AI Workflows

🔧OpenSRE builds incident agents

Open framework for AI SRE agents with 60+ tool integrations, synthetic incident simulations, and reinforcement learning environment.

Trending

PythonAgentsDevOps

🗣️Parallel agent deliberation tool

Reddit user built a system for AI agents to discuss in parallel terminals and communicate between sessions. Early prototype stage.

AgentsOpen Source

🏗️AWS ships AI-DLC workflows

Adaptive development lifecycle rules for AI coding agents. Three-phase workflow with quality gates and intelligent steering.

Trending

PythonAWSAI Workflows

Learn/Core Concept

How does disaggregated serving work?

Disaggregated serving separates model inference components across multiple nodes rather than bundling everything on single machines. Instead of each server handling compute, memory, and storage together, we split these functions so specialised nodes handle different parts of the inference pipeline. Dynamo demonstrates this with KV-aware routing that sends requests to nodes based on cached key-value pairs. This architecture improves resource utilisation and enables more flexible scaling patterns.

OrchestrationAutoscaling