The Alibaba Qwen team has introduced Qwen-VLo, a new addition to its Qwen model family, designed to unify multimodal understanding […]
Category: New Releases
Unbabel Introduces TOWER+: A Unified Framework for High-Fidelity Translation and Instruction-Following in Multilingual LLMs
Large language models have driven progress in machine translation, leveraging massive training corpora to translate dozens of languages and dialects […]
Polaris-4B and Polaris-7B: Post-Training Reinforcement Learning for Efficient Math and Logic Reasoning
The Rising Need for Scalable Reasoning Models in Machine Intelligence Advanced reasoning models are at the frontier of machine intelligence, […]
Google AI Releases Gemma 3n: A Compact Multimodal Model Built for Edge Deployment
Google has introduced Gemma 3n, a new addition to its family of open models, designed to bring large multimodal AI […]
Inception Labs Introduces Mercury: A Diffusion-Based Language Model for Ultra-Fast Code Generation
Generative AI and Its Challenges in Autoregressive Code Generation The field of generative artificial intelligence has significantly impacted software development […]
Google DeepMind Releases AlphaGenome: A Deep Learning Model that can more Comprehensively Predict the Impact of Single Variants or Mutations in DNA
A Unified Deep Learning Model to Understand the Genome Google DeepMind has unveiled AlphaGenome, a new deep learning framework designed […]
Google AI Releases Gemini CLI: An Open-Source AI Agent for Your Terminal
Google has unveiled Gemini CLI, an open-source command-line AI agent that integrates the Gemini 2.5 Pro model directly into the […]
Google DeepMind Releases Gemini Robotics On-Device: Local AI Model for Real-Time Robotic Dexterity
Google DeepMind has unveiled Gemini Robotics On-Device, a compact, local version of its powerful vision-language-action (VLA) model, bringing advanced robotic […]
ByteDance Researchers Introduce Seed-Coder: A Model-Centric Code LLM Trained on 6 Trillion Tokens
Reframing Code LLM Training through Scalable, Automated Data Pipelines Code data plays a key role in training LLMs, benefiting not […]
ByteDance Researchers Introduce VGR: A Novel Reasoning Multimodal Large Language Model (MLLM) with Enhanced Fine-Grained Visual Perception Capabilities
Why Multimodal Reasoning Matters for Vision-Language Tasks Multimodal reasoning enables models to make informed decisions and answer questions by combining […]
