Large Language Model – The TechBriefs

DeepRare: The First AI-Powered Agentic Diagnostic System Transforming Clinical Decision-Making in Rare Disease Management

0

Rare diseases impact some 400 million people worldwide, accounting for over 7,000 individual disorders, and most of these, about 80%, […]

Tencent Open Sources Hunyuan-A13B: A 13B Active Parameter MoE Model with Dual-Mode Reasoning and 256K Context

0

Tencent’s Hunyuan team has introduced Hunyuan-A13B, a new open-source large language model built on a sparse Mixture-of-Experts (MoE) architecture. While […]

Alibaba Qwen Team Releases Qwen-VLo: A Unified Multimodal Understanding and Generation Model

0

The Alibaba Qwen team has introduced Qwen-VLo, a new addition to its Qwen model family, designed to unify multimodal understanding […]

Google DeepMind Releases AlphaGenome: A Deep Learning Model that can more Comprehensively Predict the Impact of Single Variants or Mutations in DNA

0

A Unified Deep Learning Model to Understand the Genome Google DeepMind has unveiled AlphaGenome, a new deep learning framework designed […]

MIT and NUS Researchers Introduce MEM1: A Memory-Efficient Framework for Long-Horizon Language Agents

0

Modern language agents need to handle multi-turn conversations, retrieving and updating information as tasks evolve. However, most current systems simply […]

New AI Research Reveals Privacy Risks in LLM Reasoning Traces

0

Introduction: Personal LLM Agents and Privacy Risks LLMs are deployed as personal assistants, gaining access to sensitive user data through […]

ByteDance Researchers Introduce Seed-Coder: A Model-Centric Code LLM Trained on 6 Trillion Tokens

0

Reframing Code LLM Training through Scalable, Automated Data Pipelines Code data plays a key role in training LLMs, benefiting not […]

ByteDance Researchers Introduce ProtoReasoning: Enhancing LLM Generalization via Logic-Based Prototypes

0

Why Cross-Domain Reasoning Matters in Large Language Models (LLMs) Recent breakthroughs in LRMs, especially those trained using Long CoT techniques, […]

CMU Researchers Introduce Go-Browse: A Graph-Based Framework for Scalable Web Agent Training

0

Why Web Agents Struggle with Dynamic Web Interfaces Digital agents designed for web environments aim to automate tasks such as […]

Do AI Models Act Like Insider Threats? Anthropic’s Simulations Say Yes

0

Anthropic’s latest research investigates a critical security frontier in artificial intelligence: the emergence of insider threat-like behaviors from large language […]