BerriAI ships agent platform, Voicebox clones voices

Monday, 18 May 2026

BerriAI ships agent platform that runs Claude Code and Codex inside Kubernetes sandboxes with credential vaults. SANA with efficient diffusion transformers for high-res image synthesis.

The business side stayed relatively quiet, with most action focused on local AI infrastructure. DreamServer positions itself as "AI sovereignty" for personal devices whilst Voicebox clones voices locally across 23 languages.

🎨SANA ships diffusion transformer

GitHub

Efficiency-oriented codebase for high-resolution image and video generation from NVIDIA Labs. Provides complete training and inference pipelines for high-resolution synthesis.

Bigger Picture

Storage Claims Need Verification

LEANN's 97% storage reduction through on-demand embedding computation sounds revolutionary but needs independent benchmarking. If the quality claims hold up, this could reshape how we think about vector database costs.

Top Voted

PythonNvidiaDiffusion

🤖BerriAI ships agent platform

GitHub

Self-hosted platform that runs Claude Code and Codex in isolated Kubernetes pods with credential vault proxy. Agents see stub tokens whilst real keys get swapped on every outbound TLS connection.

Deep Dive

TypeScriptAgentsKubernetes

🛡️Agent Skills ships registry

GitHub

Curated skill library for AI coding agents with security validation. Claims 100% vulnerability-free skills compared to 13% of marketplace alternatives.

Trending

TypeScriptAgentsSecurity

🔍whichllm finds optimal models

GitHub

Tool that auto-detects our hardware and ranks the best local LLMs that actually fit and perform well. Uses real benchmarks instead of parameter count.

Trending

PythonLocal AITool Comparison

Yesterday's Sentiment/Energised

Infrastructure Over Intelligence

BerriAI's agent platform and LEANN's storage breakthrough signal the community prioritising production readiness over raw capabilities. Local AI infrastructure like Voicebox and DreamServer shows devs want control over their AI stack rather than depending on cloud subscriptions.

💱OKX Agent Trade Kit adds MCP

GitHub

AI-powered trading toolkit with 145 tools across spot, futures, and algo trading. Ships MCP server for Claude integration and read-only safety controls.

Trending

TypeScriptMCPAPI

🎙️Voicebox ships voice cloning

GitHub

Local-first voice studio that clones voices from seconds of audio and generates speech in 23 languages. Alternative to ElevenLabs that runs entirely on our machine.

Bigger Picture

Voice Sovereignty Movement

Voicebox's local voice cloning aligns with a broader pattern we're seeing around AI sovereignty. Teams want the capabilities of cloud AI without the vendor dependence or privacy trade-offs that come with it.

Trending

TypeScriptAudioLocal AI

🔄Langflow

GitHub

Visual workflow builder for AI agents and workflows gains fresh momentum. Ships with MCP server support and multi-agent orchestration capabilities.

Trending

PythonAgentsMCP

Learn/Multiple Mentions

What is local-first AI deployment?

Local-first AI runs models entirely on our hardware without cloud dependencies. Today's issue features Voicebox for voice cloning, DreamServer's complete AI stack, and whichllm's hardware detection. This approach prioritises privacy and eliminates subscription costs but requires careful hardware matching and model optimisation.

PrivacyHardware

🎯Polymarket AI trading system

GitHub

Paper-trading tools for prediction markets with mean-reversion strategies and optional OpenAI integration. Includes dashboard and Kelly-style position sizing.

Deep Dive

JavaScriptOpenAI

☁️Golem Cloud ships agent platform

GitHub

Platform for building AI agents and distributed applications with persistent state. Supports Rust, TypeScript, Scala, and MoonBit development.

Trending

RustAgentsWebAssembly

🏠DreamServer ships local AI stack

GitHub

Complete local-first AI platform covering inference, chat, voice, agents, workflows, and RAG. Positions itself as an alternative to cloud AI subscriptions.

Trending

PythonLocal AISelf-Hosting

📈Best-of Algorithmic Trading

GitHub

Curated list of 109 algorithmic trading projects ranked by quality metrics. Covers bots, frameworks, APIs, and educational resources.

Trending

TypeScriptOpen Source

🗜️LEANN ships 97% storage savings

GitHub

Graph-based vector database that computes embeddings on-demand instead of storing them. Claims to achieve 97% storage savings compared to traditional solutions through graph-based selective recomputation, enabling indexing of millions of documents on a personal laptop.

Trending

PythonRAGVector DB

Learn/Core Concept

How does on-demand computation work?

On-demand computation calculates values when needed rather than storing pre-computed results. LEANN's vector database exemplifies this by computing embeddings in real-time instead of storing them, achieving 97% storage savings. This trade-off exchanges memory for computation cycles, making it ideal for resource-constrained environments where storage costs exceed processing overhead.

EmbeddingsCaching

Read online