Skip to content
Monday, February 2, 2026
The TechBriefs
  • Home
  • Technology
  • AI
  • Computers
  • Security
  • Internet
  • Press Releases
    • GlobeNewswire
    • PRNewswire
  • Contact

Category: Machine Learning

  • Home
  • Machine Learning
  • Page 27
Can We Improve Llama 3’s Reasoning Through Post-Training Alone? ASTRO Shows +16% to +20% Benchmark Gains
  • AI
  • AI Paper Summary
  • AI Shorts
  • Applications
  • Artificial Intelligence
  • Editors Pick
  • Language Model
  • Large Language Model
  • Machine Learning
  • Staff
  • Tech News
  • Technology

Can We Improve Llama 3’s Reasoning Through Post-Training Alone? ASTRO Shows +16% to +20% Benchmark Gains

  • 0

Improving the reasoning capabilities of large language models (LLMs) without architectural changes is a core challenge in advancing AI alignment […]

Crome: Google DeepMind’s Causal Framework for Robust Reward Modeling in LLM Alignment
  • AI
  • AI Paper Summary
  • AI Shorts
  • Applications
  • Artificial Intelligence
  • Editors Pick
  • Machine Learning
  • New Releases
  • Staff
  • Tech News
  • Technology

Crome: Google DeepMind’s Causal Framework for Robust Reward Modeling in LLM Alignment

  • 0

Reward models are fundamental components for aligning LLMs with human feedback, yet they face the challenge of reward hacking issues. […]

Thought Anchors: A Machine Learning Framework for Identifying and Measuring Key Reasoning Steps in Large Language Models with Precision
  • AI
  • AI Paper Summary
  • AI Shorts
  • Applications
  • Artificial Intelligence
  • Editors Pick
  • Language Model
  • Large Language Model
  • Machine Learning
  • Staff
  • Tech News
  • Technology

Thought Anchors: A Machine Learning Framework for Identifying and Measuring Key Reasoning Steps in Large Language Models with Precision

  • 0

Understanding the Limits of Current Interpretability Tools in LLMs AI models, such as DeepSeek and GPT variants, rely on billions […]

DeepSeek R1T2 Chimera: 200% Faster Than R1-0528 With Improved Reasoning and Compact Output
  • AI
  • AI Paper Summary
  • AI Shorts
  • Applications
  • Artificial Intelligence
  • Editors Pick
  • Language Model
  • Large Language Model
  • Machine Learning
  • New Releases
  • Open Source
  • Staff
  • Tech News
  • Technology

DeepSeek R1T2 Chimera: 200% Faster Than R1-0528 With Improved Reasoning and Compact Output

  • 0

TNG Technology Consulting has unveiled DeepSeek-TNG R1T2 Chimera, a new Assembly-of-Experts (AoE) model that blends intelligence and speed through an […]

Shanghai Jiao Tong Researchers Propose OctoThinker for Reinforcement Learning-Scalable LLM Development
  • AI
  • AI Paper Summary
  • AI Shorts
  • Applications
  • Artificial Intelligence
  • Editors Pick
  • Language Model
  • Large Language Model
  • Machine Learning
  • New Releases
  • Staff
  • Tech News
  • Technology

Shanghai Jiao Tong Researchers Propose OctoThinker for Reinforcement Learning-Scalable LLM Development

  • 0

Introduction: Reinforcement Learning Progress through Chain-of-Thought Prompting LLMs have shown excellent progress in complex reasoning tasks through CoT prompting combined […]

ReasonFlux-PRM: A Trajectory-Aware Reward Model Enhancing Chain-of-Thought Reasoning in LLMs
  • AI
  • AI Shorts
  • Artificial Intelligence
  • Editors Pick
  • Language Model
  • Large Language Model
  • Machine Learning
  • New Releases
  • Staff
  • Tech News
  • Technology

ReasonFlux-PRM: A Trajectory-Aware Reward Model Enhancing Chain-of-Thought Reasoning in LLMs

  • 0

Understanding the Role of Chain-of-Thought in LLMs Large language models are increasingly being used to solve complex tasks such as […]

Baidu Open Sources ERNIE 4.5: LLM Series Scaling from 0.3B to 424B Parameters
  • AI
  • AI Paper Summary
  • AI Shorts
  • Applications
  • Artificial Intelligence
  • Editors Pick
  • Language Model
  • Large Language Model
  • Machine Learning
  • New Releases
  • Open Source
  • Staff
  • Tech News
  • Technology

Baidu Open Sources ERNIE 4.5: LLM Series Scaling from 0.3B to 424B Parameters

  • 0

Baidu has officially open-sourced its latest ERNIE 4.5 series, a powerful family of foundation models designed for enhanced language understanding, […]

OMEGA: A Structured Math Benchmark to Probe the Reasoning Limits of LLMs
  • AI
  • AI Paper Summary
  • AI Shorts
  • Applications
  • Artificial Intelligence
  • Editors Pick
  • Language Model
  • Large Language Model
  • Machine Learning
  • Staff
  • Tech News
  • Technology

OMEGA: A Structured Math Benchmark to Probe the Reasoning Limits of LLMs

  • 0

Introduction to Generalization in Mathematical Reasoning Large-scale language models with long CoT reasoning, such as DeepSeek-R1, have shown good results […]

TabArena: Benchmarking Tabular Machine Learning with Reproducibility and Ensembling at Scale
  • AI
  • Artificial Intelligence
  • Editors Pick
  • Machine Learning
  • Staff
  • Tech News
  • Technology

TabArena: Benchmarking Tabular Machine Learning with Reproducibility and Ensembling at Scale

  • 0

Understanding the Importance of Benchmarking in Tabular ML Machine learning on tabular data focuses on building models that learn patterns […]

MDM-Prime: A generalized Masked Diffusion Models (MDMs) Framework that Enables Partially Unmasked Tokens during Sampling
  • AI
  • Artificial Intelligence
  • Editors Pick
  • Machine Learning
  • Staff
  • Technology

MDM-Prime: A generalized Masked Diffusion Models (MDMs) Framework that Enables Partially Unmasked Tokens during Sampling

  • 0

Introduction to MDMs and Their Inefficiencies Masked Diffusion Models (MDMs) are powerful tools for generating discrete data, such as text […]

Posts pagination

Previous 1 … 26 27 28 … 95 Next
  • Privacy Policy
  • Terms of use
Theme: Terminal News By Adore Themes.