Medical artificial intelligence (AI) is full of promise but comes with its own set of challenges. Unlike straightforward mathematical problems, […]
Category: Large Language Model
B-STAR: A Self-Taught AI Reasoning Framework for LLMs
A direct correlation exists between an LLM’s training corpus quality and its capabilities. Consequently, researchers have invested a great deal […]
This AI Paper Proposes TALE: An AI Framework that Reduces Token Redundancy in Chain-of-Thought (CoT) Reasoning by Incorporating Token Budget Awareness
Large Language Models (LLMs) have shown significant potential in reasoning tasks, using methods like Chain-of-Thought (CoT) to break down complex […]
YuLan-Mini: A 2.42B Parameter Open Data-efficient Language Model with Long-Context Capabilities and Advanced Training Techniques
Large language models (LLMs) built using transformer architectures heavily depend on pre-training with large-scale data to predict sequential tokens. This […]
Quasar-1: A Rigorous Mathematical Framework for Temperature-Guided Reasoning in Language Models
Large language models (LLMs) encounter significant difficulties in performing efficient and logically consistent reasoning. Existing methods, such as CoT prompting, […]
Meet SemiKong: The World’s First Open-Source Semiconductor-Focused LLM
The semiconductor industry enables advancements in consumer electronics, automotive systems, and cutting-edge computing technologies. The production of semiconductors involves sophisticated […]
Google DeepMind Introduces Differentiable Cache Augmentation: A Coprocessor-Enhanced Approach to Boost LLM Reasoning and Efficiency
Large language models (LLMs) are integral to solving complex problems across language processing, mathematics, and reasoning domains. Enhancements in computational […]
DeepSeek-AI Just Released DeepSeek-V3: A Strong Mixture-of-Experts (MoE) Language Model with 671B Total Parameters with 37B Activated for Each Token
The field of Natural Language Processing (NLP) has made significant strides with the development of large-scale language models (LLMs). However, […]
A Comprehensive Analytical Framework for Mathematical Reasoning in Multimodal Large Language Models
Mathematical reasoning has emerged as a critical frontier in artificial intelligence, particularly in developing Large Language Models (LLMs) capable of […]
Tsinghua University Researchers Just Open-Sourced CogAgent-9B-20241220: The Latest Version of CogAgent
Graphical User Interfaces (GUIs) are central to how users engage with software. However, building intelligent agents capable of effectively navigating […]
