Chroma 1.0 is a real time speech to speech dialogue model that takes audio as input and returns audio as […]
Category: AI Paper Summary
Salesforce AI Introduces FOFPred: A Language-Driven Future Optical Flow Prediction Framework that Enables Improved Robot Control and Video Generation
Salesforce AI research team present FOFPred, a language driven future optical flow prediction framework that connects large vision language models […]
Microsoft Research Releases OptiMind: A 20B Parameter Model that Turns Natural Language into Solver Ready Optimization Models
Microsoft Research has released OptiMind, an AI based system that converts natural language descriptions of complex decision problems into mathematical […]
Google AI Releases TranslateGemma: A New Family of Open Translation Models Built on Gemma 3 with Support for 55 Languages
Google AI has released TranslateGemma, a suite of open machine translation models built on Gemma 3 and targeted at 55 […]
NVIDIA AI Open-Sourced KVzap: A SOTA KV Cache Pruning Method that Delivers near-Lossless 2x-4x Compression
As context lengths move into tens and hundreds of thousands of tokens, the key value cache in transformer decoders becomes […]
DeepSeek AI Researchers Introduce Engram: A Conditional Memory Axis For Sparse LLMs
Transformers use attention and Mixture-of-Experts to scale computation, but they still lack a native way to perform knowledge lookup. They […]
How This Agentic Memory Research Unifies Long Term and Short Term Memory for LLM Agents
How do you design an LLM agent that decides for itself what to store in long term memory, what to […]
Meta and Harvard Researchers Introduce the Confucius Code Agent (CCA): A Software Engineering Agent that can Operate at Large-Scale Codebases
How far can a mid sized language model go if the real innovation moves from the backbone into the agent […]
Stanford Researchers Build SleepFM Clinical: A Multimodal Sleep Foundation AI Model for 130+ Disease Prediction
A team of Stanford Medicine researchers have introduced SleepFM Clinical, a multimodal sleep foundation model that learns from clinical polysomnography […]
Tencent Researchers Release Tencent HY-MT1.5: A New Translation Models Featuring 1.8B and 7B Models Designed for Seamless on-Device and Cloud Deployment
Tencent Hunyuan researchers have released HY-MT1.5, a multilingual machine translation family that targets both mobile devices and cloud systems with […]
