Large Language Model – Page 48

Evola: An 80B-Parameter Multimodal Protein-Language Model for Decoding Protein Functions via Natural Language Dialogue

Proteins, essential molecular machines evolved over billions of years, perform critical life-sustaining functions encoded in their sequences and revealed through […]

This AI Paper Explores Quantization Techniques and Their Impact on Mathematical Reasoning in Large Language Models

Mathematical reasoning stands at the backbone of artificial intelligence and is highly important in arithmetic, geometric, and competition-level problems. Recently, […]

AMD Researchers Introduce Agent Laboratory: An Autonomous LLM-based Framework Capable of Completing the Entire Research Process

Scientific research is often constrained by resource limitations and time-intensive processes. Tasks such as hypothesis testing, data analysis, and report […]

From Contradictions to Coherence: Logical Alignment in AI Models

Large Language Models (LLMs) aim to align with human preferences, ensuring reliable and trustworthy decision-making. However, these models acquire biases, […]

Researchers from Princeton University Introduce Metadata Conditioning then Cooldown (MeCo) to Simplify and Optimize Language Model Pre-training

The pre-training of language models (LMs) plays a crucial role in enabling their ability to understand and generate text. However, […]

DeepMind Research Introduces The FACTS Grounding Leaderboard: Benchmarking LLMs’ Ability to Ground Responses to Long-Form Input

Large language models (LLMs) have revolutionized natural language processing, enabling applications that range from automated writing to complex decision-making aids. […]

Researchers from USC and Prime Intellect Released METAGENE-1: A 7B Parameter Autoregressive Transformer Model Trained on Over 1.5T DNA and RNA Base Pairs

In a time when global health faces persistent threats from emerging pandemics, the need for advanced biosurveillance and pathogen detection […]

Enhancing Clinical Diagnostics with LLMs: Challenges, Frameworks, and Recommendations for Real-World Applications

Using LLMs in clinical diagnostics offers a promising way to improve doctor-patient interactions. Patient history-taking is central to medical diagnosis. […]

Dolphin 3.0 Released (Llama 3.1 + 3.2 + Qwen 2.5): A Local-First, Steerable AI Model that Puts You in Control of Your AI Stack and Alignment

Artificial intelligence has come a long way, transforming the way we work, live, and interact. Yet, challenges remain. Many AI […]

Researchers from NVIDIA, CMU and the University of Washington Released ‘FlashInfer’: A Kernel Library that Provides State-of-the-Art Kernel Implementations for LLM Inference and Serving

Large Language Models (LLMs) have become an integral part of modern AI applications, powering tools like chatbots and code generators. […]