Long-context inference makes the KV cache one of the main costs of serving LLMs. During autoregressive decoding, the cache grows […]
Category: Staff
Step by Step Guide to Build and Compare FedAvg and FedProx Federated Learning on Non-IID CIFAR-10 with NVIDIA FLARE
In this tutorial, we build an advanced federated learning experiment with NVIDIA FLARE. We compare FedAvg and FedProx on a […]
Best Authentication Platforms for AI Agents and MCP Servers in 2026
The Model Context Protocol has moved from Anthropic’s internal experiment to a de facto industry standard at a speed few […]
WorkOS Releases auth.md: An Open Agent Registration Protocol Built on OAuth Standards
For years, authentication on the web followed one design assumption: a human sits behind a browser. Click a button. Fill […]
Build a Complete Langfuse Observability and Evaluation Pipeline for Tracing, Prompt Management, Scoring, and Experiments
In this tutorial, we implement the Langfuse (an open-source LLM engineering platform) pipeline for tracing, prompt management, scoring, datasets, and […]
StepFun Releases StepAudio 2.5 Realtime: An End-to-End Voice Model with Roleplay-Specific RLHF and Paralinguistic Comprehension
StepFun, the Shanghai-based AI lab, released StepAudio 2.5 Realtime. It is an end-to-end real-time speech large language model with fully […]
Microsoft Research Releases Webwright: A Terminal-Native Web Agent Framework That Scores 60.1% on Odysseys, Up from Base GPT-5.4’s 33.5%
Most web agents today drive a browser one action at a time. The model receives the current page state — […]
NVIDIA AI Releases Gated DeltaNet-2: A Linear Attention Layer That Decouples Erase and Write in the Delta Rule
Linear attention replaces the unbounded KV cache of softmax attention with a fixed-size recurrent state. This cuts sequence mixing to […]
Tencent Open-Sources TencentDB Agent Memory: A 4-Tier Local Memory Pipeline for AI Agents
Tencent has released TencentDB Agent Memory, an open-source memory system for AI agents. The project ships under the MIT license. […]
Build a SuperClaude Framework Workflow with Commands, Agents, Modes, and Session Memory
In this tutorial, we build an advanced workflow using the SuperClaude Framework as a structured layer on top of the […]
