Since the release of BERT in 2018, encoder-only transformer models have been widely used in natural language processing (NLP) applications […]
Category: Large Language Model
Slim-Llama: An Energy-Efficient LLM ASIC Processor Supporting 3-Billion Parameters at Just 4.69mW
Large Language Models (LLMs) have become a cornerstone of artificial intelligence, driving advancements in natural language processing and decision-making tasks. […]
Google DeepMind Introduces FACTS Grounding: A New AI Benchmark for Evaluating Factuality in Long-Form LLM Response
Despite the transformative potential of large language models (LLMs), these models face significant challenges in generating contextually accurate responses faithful […]
Meta AI Introduces ExploreToM: A Program-Guided Adversarial Data Generation Approach for Theory of Mind Reasoning
Theory of Mind (ToM) is a foundational element of human social intelligence, enabling individuals to interpret and predict the mental […]
Meet Genesis: An Open-Source Physics AI Engine Redefining Robotics with Ultra-Fast Simulations and Generative 4D Worlds
The robotics and embodied AI field has long struggled with accessibility and efficiency issues. Creating realistic physical simulations requires extensive […]
Google DeepMind Introduces ‘SALT’: A Machine Learning Approach to Efficiently Train High-Performing Large Language Models using SLMs
Large Language Models (LLMs) are the backbone of numerous applications, such as conversational agents, automated content creation, and natural language […]
Microsoft AI Research Open-Sources PromptWizard: A Feedback-Driven AI Framework for Efficient and Scalable LLM Prompt Optimization
One of the crucial factors in achieving high-quality outputs from these models lies in the design of prompts—carefully crafted input […]
Microsoft AI Introduces SCBench: A Comprehensive Benchmark for Evaluating Long-Context Methods in Large Language Models
Long-context LLMs enable advanced applications such as repository-level code analysis, long-document question-answering, and many-shot in-context learning by supporting extended context […]
Infinigence AI Releases Megrez-3B-Omni: A 3B On-Device Open-Source Multimodal Large Language Model MLLM
The integration of artificial intelligence into everyday life faces notable hurdles, particularly in multimodal understanding—the ability to process and analyze […]
Technology Innovation Institute TII-UAE Just Released Falcon 3: A Family of Open-Source AI Models with 30 New Model Checkpoints from 1B to 10B
The advancements in large language models (LLMs) have created opportunities across industries, from automating content creation to improving scientific research. […]