Language Model – Page 75

TOMG-Bench: Text-based Open Molecule Generation Benchmark

Molecule discovery is important in various scientific research fields, particularly pharmaceuticals and materials science. While the emergence of Graph Neural […]

Researchers at Stanford Use AI and Spatial Transcriptomics to Discover What Makes Some Cells Age Faster/Slower in the Brain

Aging is linked to a significant rise in neurodegenerative diseases like Alzheimer’s and cognitive decline. While brain aging involves complex […]

LightOn and Answer.ai Releases ModernBERT: A New Model Series that is a Pareto Improvement over BERT with both Speed and Accuracy

Since the release of BERT in 2018, encoder-only transformer models have been widely used in natural language processing (NLP) applications […]

Slim-Llama: An Energy-Efficient LLM ASIC Processor Supporting 3-Billion Parameters at Just 4.69mW

Large Language Models (LLMs) have become a cornerstone of artificial intelligence, driving advancements in natural language processing and decision-making tasks. […]

Google DeepMind Introduces FACTS Grounding: A New AI Benchmark for Evaluating Factuality in Long-Form LLM Response

Despite the transformative potential of large language models (LLMs), these models face significant challenges in generating contextually accurate responses faithful […]

Meet Moxin LLM 7B: A Fully Open-Source Language Model Developed in Accordance with the Model Openness Framework (MOF)

The rapid development of Large Language Models (LLMs) has transformed natural language processing (NLP). Proprietary models like GPT-4 and Claude […]

Scaling Language Model Evaluation: From Thousands to Millions of Tokens with BABILong

Large Language Models (LLMs) and neural architectures have significantly advanced capabilities, particularly in processing longer contexts. These improvements have profound […]

Patronus AI Open Sources Glider: A 3B State-of-the-Art Small Language Model (SLM) Judge

Large Language Models (LLMs) play a vital role in many AI applications, ranging from text summarization to conversational AI. However, […]

Meta AI Introduces ExploreToM: A Program-Guided Adversarial Data Generation Approach for Theory of Mind Reasoning

Theory of Mind (ToM) is a foundational element of human social intelligence, enabling individuals to interpret and predict the mental […]

Hugging Face Releases Picotron: A Tiny Framework that Solves LLM Training 4D Parallelization

The rise of large language models (LLMs) has transformed natural language processing, but training these models comes with significant challenges. […]