Meta released Astryx this week. It is an open-source design system, currently in Beta. The project grew inside Meta’s monorepo […]
Category: Staff
Building Supervised Fine-Tuning Data from NVIDIA Open-SWE-Traces: Trajectory Parsing, Patch Analysis, Token Budgets, and Tool-Use Metrics
In this tutorial, we explore the Open-SWE-Traces dataset as a practical resource for studying and preparing agentic software-engineering trajectories for […]
Cursor Study Finds Reward Hacking Inflates Coding-Agent Benchmark Scores on SWE-bench Pro
A new Cursor study reports that newer coding agents often retrieve known fixes instead of deriving them, inflating popular benchmark […]
Perplexity Launches Computer for Counsel: A Multi-Model Agentic Layer for Legal Workflows
Perplexity launched Computer for Counsel. It is an agentic AI system built for legal teams. The product extends Perplexity Computer, […]
OpenAI Previews GPT-5.6 With Sol, Terra, and Luna: Tiered Models, New Reasoning Modes, Limited Access
OpenAI has begun a limited preview of GPT-5.6, its next-generation model series. The lineup splits into three named tiers: Sol, […]
Meet container: Apple’s Open-Source Swift Tool for Running Linux Containers as Lightweight VMs on Apple Silicon
Apple research team recently released the container project. It is an open-source command-line tool written in Swift. It creates and […]
DeepReinforce Releases Ornith-1.0: An Open-Source Coding Model Family That Learns Its Own RL Scaffolds
DeepReinforce has released Ornith-1.0, an open-source model family built for agentic coding. The lineup spans four sizes, from a 9B […]
Baidu Releases Unlimited OCR, a 3B Model That Keeps the KV Cache Flat for Long-Document Parsing
Most end-to-end OCR models slow down as output grows. Each generated token adds to the KV cache. Memory rises and […]
Gradium Launches stt-translate and s2s-translate, Real-Time Speech Translation Models Beating gpt-realtime-translate on Accuracy and Latency
Gradium today released two real-time speech translation models: stt-translate and s2s-translate. Both run across five languages and stream results live […]
How to Design an OpenHarness Style Agent Runtime with Tools, Memory, Permissions, Skills, and Multi-Agent Coordination
In this tutorial, we build OpenHarness from scratch to better understand how a practical agent harness works. We recreate the […]
