The advancements in large language models (LLMs) have significantly enhanced natural language processing (NLP), enabling capabilities like contextual understanding, code […]
Category: Small Language Model
LLaSA-3B: A Llama 3.2B Fine-Tuned Text-to-Speech Model with Ultra-Realistic Audio, Emotional Expressiveness, and Multilingual Support
Text-to-speech (TTS) technology has emerged as a critical tool for bridging the gap between human and machine interaction. The demand […]
Kyutai Labs Releases Helium-1 Preview: A Lightweight Language Model with 2B Parameters, Targeting Edge and Mobile Devices
The growing reliance on AI models for edge and mobile devices has underscored significant challenges. Balancing computational efficiency, model size, […]
What are Small Language Models (SLMs)?
Large language models (LLMs) like GPT-4, PaLM, Bard, and Copilot have made a huge impact in natural language processing (NLP). […]
Microsoft AI Just Released Phi-4: A Small Language Model Available on Hugging Face Under the MIT License
Microsoft has released Phi-4, a compact and efficient small language model, on Hugging Face under the MIT license. This decision […]
Meet HuatuoGPT-o1: A Medical LLM Designed for Advanced Medical Reasoning
Medical artificial intelligence (AI) is full of promise but comes with its own set of challenges. Unlike straightforward mathematical problems, […]
Meet Moxin LLM 7B: A Fully Open-Source Language Model Developed in Accordance with the Model Openness Framework (MOF)
The rapid development of Large Language Models (LLMs) has transformed natural language processing (NLP). Proprietary models like GPT-4 and Claude […]
Patronus AI Open Sources Glider: A 3B State-of-the-Art Small Language Model (SLM) Judge
Large Language Models (LLMs) play a vital role in many AI applications, ranging from text summarization to conversational AI. However, […]
Google DeepMind Introduces ‘SALT’: A Machine Learning Approach to Efficiently Train High-Performing Large Language Models using SLMs
Large Language Models (LLMs) are the backbone of numerous applications, such as conversational agents, automated content creation, and natural language […]
Infinigence AI Releases Megrez-3B-Omni: A 3B On-Device Open-Source Multimodal Large Language Model MLLM
The integration of artificial intelligence into everyday life faces notable hurdles, particularly in multimodal understanding—the ability to process and analyze […]
