Generative Large Multimodal Models (LMMs), such as LLaVA and Qwen-VL, excel in vision-language (VL) tasks like image captioning and visual […]
Category: AI
MiniMax-Text-01 and MiniMax-VL-01 Released: Scalable Models with Lightning Attention, 456B Parameters, 4M Token Contexts, and State-of-the-Art Accuracy
Large Language Models (LLMs) and Vision-Language Models (VLMs) transform natural language understanding, multimodal integration, and complex reasoning tasks. Yet, one […]
MinMo: A Multimodal Large Language Model with Approximately 8B Parameters for Seamless Voice Interaction
Advances in large language and multimodal speech-text models have laid a foundation for seamless, real-time, natural, and human-like voice interactions. […]
This AI Study Saves Researchers from Metadata Chaos with a Comparative Analysis of Extraction Techniques for Scholarly Documents
Scientific metadata in research literature holds immense significance, as highlighted by flourishing research in scientometrics—a discipline dedicated to analyzing scholarly […]
Meta takes us a step closer to Star Trek’s universal translator
The computer science behind translating speech from 100 source languages. In 2023, AI researchers at Meta interviewed 34 native Spanish […]
Researchers use AI to design proteins that block snake venom toxins
Skip to content It’s a good example of how computer developments can be used for practical problems. It has been […]
Microsoft increases its focus on artificial intelligence by creating a new CoreAI team
Microsoft continues to bet big on AI and the company has created a new artificial intelligence engineering division called CoreAI. […]
Redefining Single-Channel Speech Enhancement: The xLSTM-SENet Approach
Speech processing systems often struggle to deliver clear audio in noisy environments. This challenge impacts applications such as hearing aids, […]
Beyond Passwords: A Multimodal Approach to Biometric Authentication Using ECG and Iris Data
Biometric authentication has emerged as a promising solution to enhance security by offering a more robust defense against cyber threats. […]
Efficient Blockchain State Management with Quick Merkle Database (QMDB)
Blockchain systems face significant challenges in efficiently managing and updating state storage due to high write amplification (WA) and extensive […]
