cheahjs/free-llm-api-resources trends, Craft Agents ships desktop

Wednesday, 6 May 2026

Cheahjs/free-llm-api-resources cataloguing free LLM API access across providers, whilst Craft Agents ships an open-source desktop workspace for Claude-class agents. Plus TabPFN gains traction as a foundation model specifically for tabular data.

Meanwhile, Anthropic launched finance-specific agent templates for pitchbooks and KYC screening, and Google opened a $3.5 million film competition to showcase AI tools in creative production. The infrastructure layer keeps maturing with proper tooling for production agent workloads.

🔬GPT-5.x theoretical physics

latent·2 min read

OpenAI researcher shares how GPT-5 reproduced his best physics paper in 30 minutes. Discusses the jagged frontier of AI capabilities in scientific research.

Takeaway

The jagged frontier concept is key - models excel at latest research whilst struggling with basic tasks. We need to test our specific use cases rather than assuming general capability.

OpenAIResearchGPT

⚡llama.cpp optimises Hexagon

github·1 min read

Latest release processes M-tail rows on HMX instead of HVX. Performance improvement for Qualcomm Hexagon acceleration.

Takeaway

Edge deployment getting more efficient. If we're running inference on mobile or embedded hardware, these Hexagon optimisations might actually be noticeable.

Follow-UpC++CUDAPerformance

💰Anthropic launches finance agents

Anthropic·2 min read

Ten agent templates for financial services: pitchbooks, KYC screening, month-end closing. Ships as Claude Cowork plugins and managed agent cookbooks.

Bigger Picture

Finance Templates Signal Maturity

Anthropic's finance agent templates aren't just demos - they're production cookbooks for regulated industries. The progression from general-purpose models to domain-specific agent architectures suggests the market's moving beyond 'AI can do anything' to 'AI can do specific things really well'.

Under The RadarAnthropicClaudeAgents

🔒Vercel locks production

vercel

New production-only access for marketplace integrations removes non-production access and protects credentials as sensitive environment variables.

Takeaway

Smart security model that actually helps us sleep better. Separating production credentials from dev environments reduces the blast radius when someone inevitably commits an API key.

VercelSecurityDevOps

🎬Google launches $3.5M film contest

Google

Partnership with XPRIZE for Future Vision film competition. Grand prize winner gets Google support to turn 3-minute submission into full-length feature.

Takeaway

Signals Google's serious push into creative tooling beyond just APIs. The production support for winners suggests they're building a complete creative pipeline, not just individual tools.

Google

🧠AWS architect rebuilds AI mental

Dev.to·2 min read

AWS architect documents rebuilding AI understanding from first principles after maternity leave. Practical examples using Amazon Bedrock Playground.

Takeaway

Refreshing honesty about AI complexity from someone who actually builds systems. The progression from extract → interpret → personalise shows a solid framework for evaluating AI capabilities.

AWS

Yesterday's Sentiment/Bullish

Agents Hit Production Grade

The agent community feels like it's crossing the chasm from demos to real systems. Craft Agents ships a proper desktop workspace whilst Anthropic launches production-ready templates for regulated finance work. Even the infrastructure layer is maturing with tools like agent-browser gaining serious traction at.

🏢Microsoft advances networked

microsoft·2 min read

11 papers accepted to NSDI '26 covering datacenter networks, AI systems, and cloud infrastructure. Includes KV cache sharing and automated testing advances.

Takeaway

DroidSpeak's KV cache sharing could dramatically reduce inference costs for similar models. Worth tracking if we're running multiple fine-tuned variants of the same base model.

MicrosoftResearch

Learn/Multiple Mentions

What is the Model Context Protocol?

MCP is a standard protocol for connecting AI models to external data sources and tools, enabling context sharing across different systems. Featured in OKX Agent Trade Kit and Google's Agent Development Kit, it's becoming the backbone for production agent architectures that need persistent context.

AgentsIntegration

😬AI cafe experiment ethics concern

simonwillison·2 min read

Andon Labs' AI manager 'Mona' opens Stockholm cafe, spams suppliers with emergency emails and wastes human time with permit applications and order corrections.

Takeaway

Simon's right - it's not ethical to waste real humans' time with AI experiments they didn't consent to. We need to be careful about where our AI systems interact with the outside world.

AI Safety

🔍Multimodal RAG with Gemini File

Dev.to·2 min read

Google's File Search tool now supports image indexing alongside text documents. Native visual retrieval without relying on OCR.

Takeaway

This changes how we approach document search with charts and diagrams. Instead of OCR preprocessing, we can index visual content directly and get proper citations linking back to source images.

GoogleGeminiRAGMultimodal

🚢AWS Bedrock RAG case study

amazon·2 min read

Hapag-Lloyd transforms customer feedback analysis using Amazon Bedrock. Shows evolution from manual analysis to AI-driven insights at enterprise scale.

Takeaway

Solid reference architecture for enterprise RAG deployment. The progression from manual processes to AI-driven analysis shows realistic implementation patterns we can adapt.

AWSRAG

🔗cheahjs/free-llm-api-resources

GitHub

Curated list of free LLM inference resources. Covers OpenRouter, Google AI Studio, Nvidia NIM, Mistral, and trial credit providers.

Top Voted

PythonLLM OpsCost Optimisation

🎨gpt-image-canvas ships locally

GitHub

Local AI image canvas built with tldraw for prompt-to-image generation and multi-step agent planning. Uses SQLite for project state.

Deep Dive

TypeScriptAI WorkflowsLocal AI

📚awesome-ai-apps

GitHub

Collection of 80+ AI app examples covering RAG, agents, voice assistants, and MCP tools. Organised by complexity and framework.

Deep Dive

PythonAgentsRAG

🧠Craft Agents ships desktop

GitHub

Open-source Electron app for Claude-class agents with MCP servers, skills, automations, and multi-provider support. Built on Claude Agent SDK.

TrendingAnthropicClaudeAgents

📈OKX Agent Trade Kit connects AI

GitHub

MCP server and CLI for connecting Claude/Cursor to OKX trading accounts. 140 tools across 10 modules covering market data, orders, and portfolio management.

Trending

TypeScriptMCPAgents

🌐agent-browser CLI

GitHub

Rust-based browser automation CLI for AI agents gains traction. Fast native performance with accessibility tree snapshots and ref-based clicking.

Bigger Picture

Desktop vs Cloud Agent Futures

The Craft Agents desktop workspace signals an interesting divergence - whilst everyone's building cloud-first agent platforms, there's real demand for local-first agent tooling. The MCP integration means we get the best of both worlds: local control with cloud service connectivity.

Trending

RustVercelAgents

✨Vercel AI CLI generates anything

GitHub

Tiny, agent-native CLI for generating images, video and text with stdin support. Uses Vercel AI SDK and AI Gateway for unified access to hundreds of models.

Trending

TypeScriptVercelCLI

🏗️Google Next '26 agent breakdown

Dev.to·2 min read

Deep dive into Google's Agent Development Kit, Model Context Protocol integration, and shift from prompts to production agent architectures.

Takeaway

The title's right - we're moving from prompt crafting to agent architecture. The ADK patterns show how to structure skills and tools for systems that actually scale beyond demos.

GoogleAgentsMCP

📊TabPFN foundation model trends

GitHub

Foundation model specifically for tabular data gains traction. Trained on synthetic data with scikit-learn compatible API for classification and regression.

Trending

PythonResearch

Learn/Core Concept

How does model distillation work?

Model distillation trains a smaller student model to mimic a larger teacher model's outputs, transferring knowledge without copying weights. The student learns from the teacher's probability distributions rather than just final predictions, capturing nuanced decision-making patterns. It's how we get GPT-4 level performance in a model that runs locally on our laptop.

QuantisationPruning

Read online