Microsoft has released VibeVoice-ASR as part of the VibeVoice family of open source frontier voice AI models. VibeVoice-ASR is described […]
Category: Artificial Intelligence
Google begins offering free SAT practice tests powered by Gemini
It’s no secret that students worldwide use AI chatbots to do their homework and avoid learning things. On the flip […]
Google adds your Gmail and Photos to AI Mode to enable “Personal Intelligence”
Google believes AI is the future of search, and it’s not shy about saying it. After adding account-level personalization to […]
FlashLabs Researchers Release Chroma 1.0: A 4B Real Time Speech Dialogue Model With Personalized Voice Cloning
Chroma 1.0 is a real time speech to speech dialogue model that takes audio as input and returns audio as […]
Inworld AI Releases TTS-1.5 For Realtime, Production Grade Voice Agents
Inworld AI has introduced Inworld TTS-1.5, an upgrade to its TTS-1 family that targets realtime voice agents with strict constraints […]
Has Gemini surpassed ChatGPT? We put the AI models to the test.
Which is more “artificial”? Which is more “intelligent”? Did Apple make the right choice in partnering with Google for Siri’s […]
Salesforce AI Introduces FOFPred: A Language-Driven Future Optical Flow Prediction Framework that Enables Improved Robot Control and Video Generation
Salesforce AI research team present FOFPred, a language driven future optical flow prediction framework that connects large vision language models […]
How AutoGluon Enables Modern AutoML Pipelines for Production-Grade Tabular Models with Ensembling and Distillation
In this tutorial, we build a production-grade tabular machine learning pipeline using AutoGluon, taking a real-world mixed-type dataset from raw […]
Liquid AI Releases LFM2.5-1.2B-Thinking: a 1.2B Parameter Reasoning Model That Fits Under 1 GB On-Device
Liquid AI has released LFM2.5-1.2B-Thinking, a 1.2 billion parameter reasoning model that runs fully on device and fits in about […]
Zhipu AI Releases GLM-4.7-Flash: A 30B-A3B MoE Model for Efficient Local Coding and Agents
GLM-4.7-Flash is a new member of the GLM 4.7 family and targets developers who want strong coding and reasoning performance […]
