Anthropic released Claude Haiku 4.5, a latency-optimized “small” model that delivers similar levels of coding performance to Claude Sonnet 4 […]
Category: Small Language Model
Alibaba’s Qwen AI Releases Compact Dense Qwen3-VL 4B/8B (Instruct & Thinking) With FP8 Checkpoints
Do you actually need a giant VLM when dense Qwen3-VL 4B/8B (Instruct/Thinking) with FP8 runs in low VRAM yet retains […]
Liquid AI Releases LFM2-8B-A1B: An On-Device Mixture-of-Experts with 8.3B Params and a 1.5B Active Params per Token
How much capability can a sparse 8.3B-parameter MoE with a ~1.5B active path deliver on your phone without blowing latency […]
Large Language Models LLMs vs. Small Language Models SLMs for Financial Institutions: A 2025 Practical Enterprise AI Guide
Table of contents 1. Regulatory and Risk Posture 2. Capability vs. Cost, Latency, and Footprint 3. Security and Compliance Trade-offs […]
Alibaba Qwen Unveils Qwen3-4B-Instruct-2507 and Qwen3-4B-Thinking-2507: Refreshing the Importance of Small Language Models
Smaller Models with Smarter Performance and 256K Context Support Alibaba’s Qwen team has introduced two powerful additions to its small […]
OpenAI Just Released the Hottest Open-Weight LLMs: gpt-oss-120B (Runs on a High-End Laptop) and gpt-oss-20B (Runs on a Phone)
OpenAI has just sent seismic waves through the AI world: for the first time since GPT-2 hit the scene in […]
The Ultimate 2025 Guide to Coding LLM Benchmarks and Performance Metrics
Large language models (LLMs) specialized for coding are now integral to software development, driving productivity through code generation, bug fixing, […]
Mistral AI Releases Devstral 2507 for Code-Centric Language Modeling
Mistral AI, in collaboration with All Hands AI, has released updated versions of its developer-focused large language models under the […]
Hugging Face Releases SmolLM3: A 3B Long-Context, Multilingual Reasoning Model
Hugging Face just released SmolLM3, the latest version of its “Smol” language models, designed to deliver strong multilingual reasoning over […]
Enterprise AI Without GPU Burn: Salesforce’s xGen-small Optimizes for Context, Cost, and Privacy
Language processing in enterprise environments faces critical challenges as business workflows increasingly depend on synthesising information from diverse sources, including […]
