OpenAI has released a research preview of gpt-oss-safeguard, two open weight safety reasoning models that let developers apply custom safety […]
Category: Applications
Ant Group Releases Ling 2.0: A Reasoning-First MoE Language Model Series Built on the Principle that Each Activation Enhances Reasoning Capability
How do you build a language model that grows in capacity but keeps the computation for each token almost unchanged? The […]
Microsoft Releases Agent Lightning: A New AI Framework that Enables Reinforcement Learning (RL)-based Training of LLMs for Any AI Agent
How do you convert real agent traces into reinforcement learning RL transitions to improve policy LLMs without changing your existing […]
Liquid AI Releases LFM2-ColBERT-350M: A New Small Model that brings Late Interaction Retrieval to Multilingual and Cross-Lingual RAG
Can a compact late interaction retriever index once and deliver accurate cross lingual search with fast inference? Liquid AI released […]
MiniMax Releases MiniMax M2: A Mini Open Model Built for Max Coding and Agentic Workflows at 8% Claude Sonnet Price and ~2x Faster
Can an open source MoE truly power agentic coding workflows at a fraction of flagship model costs while sustaining long-horizon […]
Zhipu AI Releases ‘Glyph’: An AI Framework for Scaling the Context Length through Visual-Text Compression
Can we render long texts as images and use a VLM to achieve 3–4× token compression, preserving accuracy while scaling […]
Meet ‘kvcached’: A Machine Learning Library to Enable Virtualized, Elastic KV Cache for LLM Serving on Shared GPUs
Large language model serving often wastes GPU memory because engines pre-reserve large static KV cache regions per model, even when […]
5 Common LLM Parameters Explained with Examples
Large language models (LLMs) offer several parameters that let you fine-tune their behavior and control how they generate responses. If […]
Liquid AI’s LFM2-VL-3B Brings a 3B Parameter Vision Language Model (VLM) to Edge-Class Devices
Liquid AI released LFM2-VL-3B, a 3B parameter vision language model for image text to text tasks. It extends the LFM2-VL […]
An Implementation on Building Advanced Multi-Endpoint Machine Learning APIs with LitServe: Batching, Streaming, Caching, and Local Inference
In this tutorial, we explore LitServe, a lightweight and powerful serving framework that allows us to deploy machine learning models […]

 
			