New Releases – Page 59 – The TechBriefs

DeepSeek-AI Releases DeepSeek-R1-Zero and DeepSeek-R1: First-Generation Reasoning Models that Incentivize Reasoning Capability in LLMs via Reinforcement Learning

Large Language Models (LLMs) have made significant progress in natural language processing, excelling in tasks like understanding, generation, and reasoning. […]

Salesforce AI Research Introduced CodeXEmbed (SFR-Embedding-Code): A Code Retrieval Model Family Achieving #1 Rank on CoIR Benchmark and Supporting 12 Programming Languages

Code retrieval has become essential for developers in modern software development, enabling efficient access to relevant code snippets and documentation. […]

Kyutai Labs Releases Helium-1 Preview: A Lightweight Language Model with 2B Parameters, Targeting Edge and Mobile Devices

The growing reliance on AI models for edge and mobile devices has underscored significant challenges. Balancing computational efficiency, model size, […]

MiniMax-Text-01 and MiniMax-VL-01 Released: Scalable Models with Lightning Attention, 456B Parameters, 4M Token Contexts, and State-of-the-Art Accuracy

Large Language Models (LLMs) and Vision-Language Models (VLMs) transform natural language understanding, multimodal integration, and complex reasoning tasks. Yet, one […]

Alibaba Qwen Team just Released ‘Lessons of Developing Process Reward Models in Mathematical Reasoning’ along with a State-of-the-Art 7B and 72B PRMs

Mathematical reasoning has long been a significant challenge for Large Language Models (LLMs). Errors in intermediate reasoning steps can undermine […]

OpenBMB Just Released MiniCPM-o 2.6: A New 8B Parameters, Any-to-Any Multimodal Model that can Understand Vision, Speech, and Language and Runs on Edge Devices

Artificial intelligence has made significant strides in recent years, but challenges remAIn in balancing computational efficiency and versatility. State-of-the-art multimodal […]

Mistral AI Unveils Codestral 25.01: A New SOTA Lightweight and fast Coding AI Model

In today’s fast-paced world of software development, artificial intelligence plays a crucial role in simplifying workflows, speeding up coding tasks, […]

UC Berkeley Researchers Released Sky-T1-32B-Preview: An Open-Source Reasoning LLM Trained for Under $450 Surpasses OpenAI-o1 on Benchmarks like Math500, AIME, and Livebench

The rapid advancements in artificial intelligence have opened new possibilities, but the associated costs often limit who can benefit from […]

Google AI Just Released TimesFM-2.0 (JAX and Pytorch) on Hugging Face with a Significant Boost in Accuracy and Maximum Context Length

Time-series forecasting plays a crucial role in various domains, including finance, healthcare, and climate science. However, achieving accurate predictions remains […]

Good Fire AI Open-Sources Sparse Autoencoders (SAEs) for Llama 3.1 8B and Llama 3.3 70B

Large language models (LLMs) like OpenAI’s GPT and Meta’s LLaMA have significantly advanced natural language understanding and text generation. However, […]