Can we render long texts as images and use a VLM to achieve 3–4× token compression, preserving accuracy while scaling […]
Category: New Releases
Meet ‘kvcached’: A Machine Learning Library to Enable Virtualized, Elastic KV Cache for LLM Serving on Shared GPUs
Large language model serving often wastes GPU memory because engines pre-reserve large static KV cache regions per model, even when […]
A New AI Research from Anthropic and Thinking Machines Lab Stress Tests Model Specs and Reveal Character Differences among Language Models
AI companies use model specifications to define target behaviors during training and evaluation. Do current specs state the intended behaviors […]
Liquid AI’s LFM2-VL-3B Brings a 3B Parameter Vision Language Model (VLM) to Edge-Class Devices
Liquid AI released LFM2-VL-3B, a 3B parameter vision language model for image text to text tasks. It extends the LFM2-VL […]
Salesforce AI Research Introduces WALT (Web Agents that Learn Tools): Enabling LLM agents to Automatically Discover Reusable Tools from Any Website
A team of Salesforce AI researchers introduced WALT (Web Agents that Learn Tools), a framework that reverse-engineers latent website functionality […]
Anthrogen Introduces Odyssey: A 102B Parameter Protein Language Model that Replaces Attention with Consensus and Trains with Discrete Diffusion
Anthrogen has introduced Odyssey, a family of protein language models for sequence and structure generation, protein editing, and conditional design. […]
PokeeResearch-7B: An Open 7B Deep-Research Agent Trained with Reinforcement Learning from AI Feedback (RLAIF) and a Robust Reasoning Scaffold
Pokee AI has open sourced PokeeResearch-7B, a 7B parameter deep research agent that executes full research loops, decomposes a query, […]
How to Design a Fully Functional Enterprise AI Assistant with Retrieval Augmentation and Policy Guardrails Using Open Source AI Models
In this tutorial, we explore how we can build a compact yet powerful Enterprise AI assistant that runs effortlessly on […]
OpenAI Introduces ChatGPT Atlas: A Chromium-based browser with a built-in AI agent
OpenAI just launched ChatGPT Atlas, a new AI browser that embeds ChatGPT at the core of navigation, search, and on-page […]
Google AI Research Releases DeepSomatic: A New AI Model that Identifies Cancer Cell Genetic Variants
A team of researchers from Google Research and UC Santa Cruz released DeepSomatic, an AI model that identifies cancer cell […]
