Introduction to MDMs and Their Inefficiencies Masked Diffusion Models (MDMs) are powerful tools for generating discrete data, such as text […]
Category: Staff
University of Michigan Researchers Propose G-ACT: A Scalable Machine Learning Framework to Steer Programming Language Bias in LLMs
LLMs and the Need for Scientific Code Control LLMs have rapidly evolved into complex natural language processors, enabling the development […]
A Coding Guide to Build a Functional Data Analysis Workflow Using Lilac for Transforming, Filtering, and Exporting Structured Insights
In this tutorial, we demonstrate a fully functional and modular data analysis pipeline using the Lilac library, without relying on […]
UC San Diego Researchers Introduced Dex1B: A Billion-Scale Dataset for Dexterous Hand Manipulation in Robotics
Challenges in Dexterous Hand Manipulation Data Collection Creating large-scale data for dexterous hand manipulation remains a major challenge in robotics. […]
Tencent Open Sources Hunyuan-A13B: A 13B Active Parameter MoE Model with Dual-Mode Reasoning and 256K Context
Tencent’s Hunyuan team has introduced Hunyuan-A13B, a new open-source large language model built on a sparse Mixture-of-Experts (MoE) architecture. While […]
Alibaba Qwen Team Releases Qwen-VLo: A Unified Multimodal Understanding and Generation Model
The Alibaba Qwen team has introduced Qwen-VLo, a new addition to its Qwen model family, designed to unify multimodal understanding […]
Getting Started with MLFlow for LLM Evaluation
MLflow is a powerful open-source platform for managing the machine learning lifecycle. While it’s traditionally used for tracking model experiments, […]
Unbabel Introduces TOWER+: A Unified Framework for High-Fidelity Translation and Instruction-Following in Multilingual LLMs
Large language models have driven progress in machine translation, leveraging massive training corpora to translate dozens of languages and dialects […]
Polaris-4B and Polaris-7B: Post-Training Reinforcement Learning for Efficient Math and Logic Reasoning
The Rising Need for Scalable Reasoning Models in Machine Intelligence Advanced reasoning models are at the frontier of machine intelligence, […]
GURU: A Reinforcement Learning Framework that Bridges LLM Reasoning Across Six Domains
Limitations of Reinforcement Learning in Narrow Reasoning Domains Reinforcement Learning RL has demonstrated strong potential to enhance the reasoning capabilities […]