AI – Page 168 – The TechBriefs

Microsoft AI Research Introduces OLA-VLM: A Vision-Centric Approach to Optimizing Multimodal Large Language Models

Multimodal large language models (MLLMs) are advancing rapidly, enabling machines to interpret and reason about textual and visual data simultaneously. […]

Meta FAIR Releases Meta Motivo: A New Behavioral Foundation Model for Controlling Virtual Physics-based Humanoid Agents for a Wide Range of Complex Whole-Body Tasks

Foundation models, pre-trained on extensive unlabeled data, have emerged as a cutting-edge approach for developing versatile AI systems capable of […]

The dark side of AI: How automation is fueling identity theft

Automations empowered by artificial intelligence are reshaping the business landscape. They give companies the capability to connect with, guide, and […]

Nexa AI Releases OmniAudio-2.6B: A Fast Audio Language Model for Edge Deployment

Audio language models (ALMs) play a crucial role in various applications, from real-time transcription and translation to voice-controlled systems and […]

DeepSeek-AI Open Sourced DeepSeek-VL2 Series: Three Models of 3B, 16B, and 27B Parameters with Mixture-of-Experts (MoE) Architecture Redefining Vision-Language AI

Integrating vision and language capabilities in AI has led to breakthroughs in Vision-Language Models (VLMs). These models aim to process […]

BiMediX2: A Groundbreaking Bilingual Bio-Medical Large Multimodal Model integrating Text and Image Analysis for Advanced Medical Diagnostics

Recent advancements in healthcare AI, including medical LLMs and LMMs, show great potential for improving access to medical advice. However, […]

Category: AI

Microsoft AI Research Introduces OLA-VLM: A Vision-Centric Approach to Optimizing Multimodal Large Language Models

Meta FAIR Releases Meta Motivo: A New Behavioral Foundation Model for Controlling Virtual Physics-based Humanoid Agents for a Wide Range of Complex Whole-Body Tasks

The dark side of AI: How automation is fueling identity theft

Nexa AI Releases OmniAudio-2.6B: A Fast Audio Language Model for Edge Deployment

DeepSeek-AI Open Sourced DeepSeek-VL2 Series: Three Models of 3B, 16B, and 27B Parameters with Mixture-of-Experts (MoE) Architecture Redefining Vision-Language AI

BiMediX2: A Groundbreaking Bilingual Bio-Medical Large Multimodal Model integrating Text and Image Analysis for Advanced Medical Diagnostics

Meta AI Proposes Large Concept Models (LCMs): A Semantic Leap Beyond Token-based Language Modeling

From Theory to Practice: Compute-Optimal Inference Strategies for Language Model

This AI Paper Introduces SRDF: A Self-Refining Data Flywheel for High-Quality Vision-and-Language Navigation Datasets

Beyond the Mask: A Comprehensive Study of Discrete Diffusion Models