The field of Natural Language Processing (NLP) has made significant strides with the development of large-scale language models (LLMs). However, […]
Category: Large Language Model
A Comprehensive Analytical Framework for Mathematical Reasoning in Multimodal Large Language Models
Mathematical reasoning has emerged as a critical frontier in artificial intelligence, particularly in developing Large Language Models (LLMs) capable of […]
Tsinghua University Researchers Just Open-Sourced CogAgent-9B-20241220: The Latest Version of CogAgent
Graphical User Interfaces (GUIs) are central to how users engage with software. However, building intelligent agents capable of effectively navigating […]
Qwen Team Releases QvQ: An Open-Weight Model for Multimodal Reasoning
Multimodal reasoning—the ability to process and integrate information from diverse data sources such as text, images, and video—remains a demanding […]
Salesforce AI Research Introduces AGUVIS: A Unified Pure Vision Framework Transforming Autonomous GUI Interaction Across Platforms
Graphical User Interfaces (GUIs) play a fundamental role in human-computer interaction, providing the medium through which users accomplish tasks across […]
Why Do Task Vectors Exist in Pretrained LLMs? This AI Research from MIT and Improbable AI Uncovers How Transformers Form Internal Abstractions and the Mechanisms Behind in-Context Learning (ICL)
Large Language Models (LLMs) have demonstrated remarkable similarities to human cognitive processes’ ability to form abstractions and adapt to new […]
ConfliBERT: A Domain-Specific Language Model for Political Violence Event Detection and Classification
The transformation of unstructured news texts into structured event data represents a critical challenge in social sciences, particularly in international […]
Microsoft Researchers Release AIOpsLab: An Open-Source Comprehensive AI Framework for AIOps Agents
The increasing complexity of cloud computing has brought both opportunities and challenges. Enterprises now depend heavily on intricate cloud-based infrastructures […]
TOMG-Bench: Text-based Open Molecule Generation Benchmark
Molecule discovery is important in various scientific research fields, particularly pharmaceuticals and materials science. While the emergence of Graph Neural […]
This AI Paper from aiXplain Introduces Bel Esprit: A Multi-Agent Framework for Building Accurate and Adaptive AI Model Pipelines
Artificial intelligence has progressed from handling atomic tasks to addressing intricate, real-world problems requiring the integration of multiple specialized models. […]