Skip to content
Wednesday, May 14, 2025
The TechBriefs
  • Home
  • Technology
  • AI
  • Computers
  • Security
  • Internet
  • Press Releases
    • GlobeNewswire
    • PRNewswire
  • Contact

Category: Large Language Model

  • Home
  • Large Language Model
This AI Paper Investigates Test-Time Scaling of English-Centric RLMs for Enhanced Multilingual Reasoning and Domain Generalization
  • AI
  • AI Paper Summary
  • AI Shorts
  • Applications
  • Artificial Intelligence
  • Editors Pick
  • Language Model
  • Large Language Model
  • Staff
  • Tech News
  • Technology

This AI Paper Investigates Test-Time Scaling of English-Centric RLMs for Enhanced Multilingual Reasoning and Domain Generalization

  • 0

Reasoning language models, or RLMs, are increasingly used to simulate step-by-step problem-solving by generating long, structured reasoning chains. These models […]

Reinforcement Learning, Not Fine-Tuning: Nemotron-Tool-N1 Trains LLMs to Use Tools with Minimal Supervision and Maximum Generalization
  • AI
  • AI Paper Summary
  • AI Shorts
  • Applications
  • Artificial Intelligence
  • Editors Pick
  • Language Model
  • Large Language Model
  • Machine Learning
  • Staff
  • Tech News
  • Technology

Reinforcement Learning, Not Fine-Tuning: Nemotron-Tool-N1 Trains LLMs to Use Tools with Minimal Supervision and Maximum Generalization

  • 0

Equipping LLMs with external tools or functions has become popular, showing great performance across diverse domains. Existing research depends on […]

OpenAI Releases HealthBench: An Open-Source Benchmark for Measuring the Performance and Safety of Large Language Models in Healthcare
  • AI
  • AI Paper Summary
  • AI Shorts
  • Applications
  • Artificial Intelligence
  • Editors Pick
  • Language Model
  • Large Language Model
  • New Releases
  • Open Source
  • Staff
  • Tech News
  • Technology

OpenAI Releases HealthBench: An Open-Source Benchmark for Measuring the Performance and Safety of Large Language Models in Healthcare

  • 0

OpenAI has released HealthBench, an open-source evaluation framework designed to measure the performance and safety of large language models (LLMs) […]

PrimeIntellect Releases INTELLECT-2: A 32B Reasoning Model Trained via Distributed Asynchronous Reinforcement Learning
  • AI
  • AI Paper Summary
  • AI Shorts
  • Artificial Intelligence
  • Editors Pick
  • Language Model
  • Large Language Model
  • Machine Learning
  • New Releases
  • Open Source
  • Staff
  • Tech News
  • Technology

PrimeIntellect Releases INTELLECT-2: A 32B Reasoning Model Trained via Distributed Asynchronous Reinforcement Learning

  • 0

As language models scale in parameter count and reasoning complexity, traditional centralized training pipelines face increasing constraints. High-performance model training […]

This AI Paper Introduces Effective State-Size (ESS): A Metric to Quantify Memory Utilization in Sequence Models for Performance Optimization
  • AI
  • AI Paper Summary
  • AI Shorts
  • Applications
  • Artificial Intelligence
  • Editors Pick
  • Language Model
  • Large Language Model
  • Machine Learning
  • Staff
  • Tech News
  • Technology

This AI Paper Introduces Effective State-Size (ESS): A Metric to Quantify Memory Utilization in Sequence Models for Performance Optimization

  • 0

In machine learning, sequence models are designed to process data with temporal structure, such as language, time series, or signals. […]

Huawei Introduces Pangu Ultra MoE: A 718B-Parameter Sparse Language Model Trained Efficiently on Ascend NPUs Using Simulation-Driven Architecture and System-Level Optimization
  • AI
  • AI infrastructure
  • AI Paper Summary
  • AI Shorts
  • Applications
  • Artificial Intelligence
  • Editors Pick
  • Language Model
  • Large Language Model
  • Staff
  • Tech News
  • Technology

Huawei Introduces Pangu Ultra MoE: A 718B-Parameter Sparse Language Model Trained Efficiently on Ascend NPUs Using Simulation-Driven Architecture and System-Level Optimization

  • 0

Sparse large language models (LLMs) based on the Mixture of Experts (MoE) framework have gained traction for their ability to […]

ZeroSearch from Alibaba Uses Reinforcement Learning and Simulated Documents to Teach LLMs Retrieval Without Real-Time Search
  • AI
  • AI Paper Summary
  • AI Shorts
  • Applications
  • Artificial Intelligence
  • Editors Pick
  • Language Model
  • Large Language Model
  • Machine Learning
  • New Releases
  • Staff
  • Tech News
  • Technology

ZeroSearch from Alibaba Uses Reinforcement Learning and Simulated Documents to Teach LLMs Retrieval Without Real-Time Search

  • 0

Large language models are now central to various applications, from coding to academic tutoring and automated assistants. However, a critical […]

Enterprise AI Without GPU Burn: Salesforce’s xGen-small Optimizes for Context, Cost, and Privacy
  • AI
  • Artificial Intelligence
  • Editors Pick
  • Enterprise AI
  • Language Model
  • Large Language Model
  • New Releases
  • Small Language Model
  • Staff
  • Technology

Enterprise AI Without GPU Burn: Salesforce’s xGen-small Optimizes for Context, Cost, and Privacy

  • 0

Language processing in enterprise environments faces critical challenges as business workflows increasingly depend on synthesising information from diverse sources, including […]

AI That Teaches Itself: Tsinghua University’s ‘Absolute Zero’ Trains LLMs With Zero External Data
  • AI
  • AI Paper Summary
  • AI Shorts
  • Applications
  • Artificial Intelligence
  • Editors Pick
  • Large Language Model
  • Machine Learning
  • Staff
  • Tech News
  • Technology

AI That Teaches Itself: Tsinghua University’s ‘Absolute Zero’ Trains LLMs With Zero External Data

  • 0

LLMs have shown advancements in reasoning capabilities through Reinforcement Learning with Verifiable Rewards (RLVR), which relies on outcome-based feedback rather […]

ServiceNow AI Released Apriel-Nemotron-15b-Thinker: A Compact Yet Powerful Reasoning Model Optimized for Enterprise-Scale Deployment and Efficiency
  • AI
  • AI Shorts
  • Applications
  • Artificial Intelligence
  • Editors Pick
  • Language Model
  • Large Language Model
  • New Releases
  • Small Language Model
  • Staff
  • Tech News
  • Technology

ServiceNow AI Released Apriel-Nemotron-15b-Thinker: A Compact Yet Powerful Reasoning Model Optimized for Enterprise-Scale Deployment and Efficiency

  • 0

AI models today are expected to handle complex tasks such as solving mathematical problems, interpreting logical statements, and assisting with […]

Posts pagination

1 2 … 35 Next
  • Privacy Policy
  • Terms of use
Theme: Terminal News By Adore Themes.