How do you keep RAG systems accurate and efficient when every query tries to stuff thousands of tokens into the […]
Category: Open Source
StepFun AI Releases Step-Audio-R1: A New Audio LLM that Finally Benefits from Test Time Compute Scaling
Why do current audio AI models often perform worse when they generate longer reasoning instead of grounding their decisions in […]
NVIDIA AI Releases Orchestrator-8B: A Reinforcement Learning Trained Controller for Efficient Tool and Model Selection
How can an AI system learn to pick the right model or tool for each step of a task instead […]
DeepSeek AI Releases DeepSeekMath-V2: The Open Weights Maths Model That Scored 118/120 on Putnam 2024
How can an AI system prove complex olympiad level math problems in clear natural language while also checking that its […]
OceanBase Releases seekdb: An Open Source AI Native Hybrid Search Database for Multi-model RAG and AI Agents
AI applications rarely deal with one clean table. They mix user profiles, chat logs, JSON metadata, embeddings, and sometimes spatial […]
Tencent Hunyuan Releases HunyuanOCR: a 1B Parameter End to End OCR Expert VLM
Tencent Hunyuan has released HunyuanOCR, a 1B parameter vision language model that is specialized for OCR and document understanding. The […]
KEV catalog missing 88 percent of exploits
New research from Miggo Security suggests that CISA’s Known Exploited Vulnerabilities (KEV) catalog now reflects only a small slice of […]
Black Forest Labs Releases FLUX.2: A 32B Flow Matching Transformer for Production Image Pipelines
Black Forest Labs has released FLUX.2, its second generation image generation and editing system. FLUX.2 targets real world creative workflows […]
Microsoft AI Releases Fara-7B: An Efficient Agentic Model for Computer Use
How do we safely let an AI agent handle real web tasks like booking, searching, and form filling directly on […]
NVIDIA AI Releases Nemotron-Elastic-12B: A Single AI Model that Gives You 6B/9B/12B Variants without Extra Training Cost
Why are AI dev teams still training and storing multiple large language models for different deployment needs when one elastic […]
