Artificial intelligence has come a long way, transforming the way we work, live, and interact. Yet, challenges remain. Many AI […]
Category: Editors Pick
Graph Generative Pre-trained Transformer (G2PT): An Auto-Regressive Model Designed to Learn Graph Structures through Next-Token Prediction
Graph generation is an important task across various fields, including molecular design and social network analysis, due to its ability […]
From Latent Spaces to State-of-the-Art: The Journey of LightningDiT
Latent diffusion models are advanced techniques for generating high-resolution images by compressing visual data into a latent space using visual […]
ScreenSpot-Pro: The First Benchmark Driving Multi-Modal LLMs into High-Resolution Professional GUI-Agent and Computer-Use Environments
GUI agents face three critical challenges in professional environments: (1) the greater complexity of professional applications compared to general-use software, […]
Enhancing Protein Docking with AlphaRED: A Balanced Approach to Protein Complex Prediction
Protein docking, the process of predicting the structure of protein-protein complexes, remains a complex challenge in computational biology. While advances […]
Scaling of Search and Learning: A Roadmap to Reproduce o1 from Reinforcement Learning Perspective
Achieving expert-level performance in complex reasoning tasks is a significant challenge in artificial intelligence (AI). Models like OpenAI’s o1 demonstrate […]
Researchers from NVIDIA, CMU and the University of Washington Released ‘FlashInfer’: A Kernel Library that Provides State-of-the-Art Kernel Implementations for LLM Inference and Serving
Large Language Models (LLMs) have become an integral part of modern AI applications, powering tools like chatbots and code generators. […]
PRIME: An Open-Source Solution for Online Reinforcement Learning with Process Rewards to Advance Reasoning Abilities of Language Models Beyond Imitation or Distillation
Large Language Models (LLMs) face significant scalability limitations in improving their reasoning capabilities through data-driven imitation, as better performance demands […]
FutureHouse Researchers Propose Aviary: An Extensible Open-Source Gymnasium for Language Agents
Artificial intelligence (AI) has made significant strides in developing language models capable of solving complex problems. However, applying these models […]
This AI Paper Introduces SWE-Gym: A Comprehensive Training Environment for Real-World Software Engineering Agents
Software engineering agents have become essential for managing complex coding tasks, particularly in large repositories. These agents employ advanced language […]