Staff – Page 2 – The TechBriefs

Polaris-4B and Polaris-7B: Post-Training Reinforcement Learning for Efficient Math and Logic Reasoning

The Rising Need for Scalable Reasoning Models in Machine Intelligence Advanced reasoning models are at the frontier of machine intelligence, […]

GURU: A Reinforcement Learning Framework that Bridges LLM Reasoning Across Six Domains

Limitations of Reinforcement Learning in Narrow Reasoning Domains Reinforcement Learning RL has demonstrated strong potential to enhance the reasoning capabilities […]

Google AI Releases Gemma 3n: A Compact Multimodal Model Built for Edge Deployment

Google has introduced Gemma 3n, a new addition to its family of open models, designed to bring large multimodal AI […]

Inception Labs Introduces Mercury: A Diffusion-Based Language Model for Ultra-Fast Code Generation

Generative AI and Its Challenges in Autoregressive Code Generation The field of generative artificial intelligence has significantly impacted software development […]

MIT and NUS Researchers Introduce MEM1: A Memory-Efficient Framework for Long-Horizon Language Agents

Modern language agents need to handle multi-turn conversations, retrieving and updating information as tasks evolve. However, most current systems simply […]

Google AI Releases Gemini CLI: An Open-Source AI Agent for Your Terminal

Google has unveiled Gemini CLI, an open-source command-line AI agent that integrates the Gemini 2.5 Pro model directly into the […]

New AI Research Reveals Privacy Risks in LLM Reasoning Traces

Introduction: Personal LLM Agents and Privacy Risks LLMs are deployed as personal assistants, gaining access to sensitive user data through […]

ETH and Stanford Researchers Introduce MIRIAD: A 5.8M Pair Dataset to Improve LLM Accuracy in Medical AI

Challenges of LLMs in Medical Decision-Making: Addressing Hallucinations via Knowledge Retrieval LLMs are set to revolutionize healthcare through intelligent decision […]

Google DeepMind Releases Gemini Robotics On-Device: Local AI Model for Real-Time Robotic Dexterity

Google DeepMind has unveiled Gemini Robotics On-Device, a compact, local version of its powerful vision-language-action (VLA) model, bringing advanced robotic […]

ByteDance Researchers Introduce Seed-Coder: A Model-Centric Code LLM Trained on 6 Trillion Tokens

Reframing Code LLM Training through Scalable, Automated Data Pipelines Code data plays a key role in training LLMs, benefiting not […]