New Releases – Page 58 – The TechBriefs

ByteDance Researchers Introduce Seed-Coder: A Model-Centric Code LLM Trained on 6 Trillion Tokens

Reframing Code LLM Training through Scalable, Automated Data Pipelines Code data plays a key role in training LLMs, benefiting not […]

ByteDance Researchers Introduce VGR: A Novel Reasoning Multimodal Large Language Model (MLLM) with Enhanced Fine-Grained Visual Perception Capabilities

Why Multimodal Reasoning Matters for Vision-Language Tasks Multimodal reasoning enables models to make informed decisions and answer questions by combining […]

BAAI Launches OmniGen2: A Unified Diffusion and Transformer Model for Multimodal AI

Beijing Academy of Artificial Intelligence (BAAI) introduces OmniGen2, a next-generation, open-source multimodal generative model. Expanding on its predecessor OmniGen, the […]

New from Chinese Academy of Sciences: Stream-Omni, an LLM for Cross-Modal Real-Time AI

Understanding the Limitations of Current Omni-Modal Architectures Large multimodal models (LMMs) have shown outstanding omni-capabilities across text, vision, and speech […]

Moonshot AI Unveils Kimi-Researcher: An Reinforcement Learning RL-Trained Agent for Complex Reasoning and Web-Scale Search

The Challenge: Scaling Autonomous Agents with RL Autonomous AI agents have been at the forefront of taking computational abilities to […]

EmbodiedGen: A Scalable 3D World Generator for Realistic Embodied AI Simulations

The Challenge of Scaling 3D Environments in Embodied AI Creating realistic and accurately scaled 3D environments is essential for training […]

Google Researchers Release Magenta RealTime: An Open-Weight Model for Real-Time AI Music Generation

Google’s Magenta team has introduced Magenta RealTime (Magenta RT), an open-weight, real-time music generation model that brings unprecedented interactivity to […]