Staff – The TechBriefs

MDM-Prime: A generalized Masked Diffusion Models (MDMs) Framework that Enables Partially Unmasked Tokens during Sampling

Introduction to MDMs and Their Inefficiencies Masked Diffusion Models (MDMs) are powerful tools for generating discrete data, such as text […]

University of Michigan Researchers Propose G-ACT: A Scalable Machine Learning Framework to Steer Programming Language Bias in LLMs

LLMs and the Need for Scientific Code Control LLMs have rapidly evolved into complex natural language processors, enabling the development […]

A Coding Guide to Build a Functional Data Analysis Workflow Using Lilac for Transforming, Filtering, and Exporting Structured Insights

In this tutorial, we demonstrate a fully functional and modular data analysis pipeline using the Lilac library, without relying on […]

UC San Diego Researchers Introduced Dex1B: A Billion-Scale Dataset for Dexterous Hand Manipulation in Robotics

Challenges in Dexterous Hand Manipulation Data Collection Creating large-scale data for dexterous hand manipulation remains a major challenge in robotics. […]

Tencent Open Sources Hunyuan-A13B: A 13B Active Parameter MoE Model with Dual-Mode Reasoning and 256K Context

Tencent’s Hunyuan team has introduced Hunyuan-A13B, a new open-source large language model built on a sparse Mixture-of-Experts (MoE) architecture. While […]

Alibaba Qwen Team Releases Qwen-VLo: A Unified Multimodal Understanding and Generation Model

The Alibaba Qwen team has introduced Qwen-VLo, a new addition to its Qwen model family, designed to unify multimodal understanding […]

Getting Started with MLFlow for LLM Evaluation

MLflow is a powerful open-source platform for managing the machine learning lifecycle. While it’s traditionally used for tracking model experiments, […]

Unbabel Introduces TOWER+: A Unified Framework for High-Fidelity Translation and Instruction-Following in Multilingual LLMs

Large language models have driven progress in machine translation, leveraging massive training corpora to translate dozens of languages and dialects […]

Polaris-4B and Polaris-7B: Post-Training Reinforcement Learning for Efficient Math and Logic Reasoning

The Rising Need for Scalable Reasoning Models in Machine Intelligence Advanced reasoning models are at the frontier of machine intelligence, […]

GURU: A Reinforcement Learning Framework that Bridges LLM Reasoning Across Six Domains

Limitations of Reinforcement Learning in Narrow Reasoning Domains Reinforcement Learning RL has demonstrated strong potential to enhance the reasoning capabilities […]