Zlab Princeton researchers have released LLM-Pruning Collection, a JAX based repository that consolidates major pruning algorithms for large language models […]
Category: Artificial Intelligence
Tencent Researchers Release Tencent HY-MT1.5: A New Translation Models Featuring 1.8B and 7B Models Designed for Seamless on-Device and Cloud Deployment
Tencent Hunyuan researchers have released HY-MT1.5, a multilingual machine translation family that targets both mobile devices and cloud systems with […]
DeepSeek Researchers Apply a 1967 Matrix Normalization Algorithm to Fix Instability in Hyper Connections
DeepSeek researchers are trying to solve a precise issue in large language model training. Residual connections made very deep networks […]
Recursive Language Models (RLMs): From MIT’s Blueprint to Prime Intellect’s RLMEnv for Long Horizon LLM Agents
Recursive Language Models aim to break the usual trade off between context length, accuracy and cost in large language models. […]
Generative AI: closing the developer gap and redefining the software moat [Q&A]
Generative AI (GenAI) is reshaping software development, closing the long‑standing gap between surging demand for new applications and the limited […]
Tencent Released Tencent HY-Motion 1.0: A Billion-Parameter Text-to-Motion Model Built on the Diffusion Transformer (DiT) Architecture and Flow Matching
Tencent Hunyuan’s 3D Digital Human team has released HY-Motion 1.0, an open weight text-to-3D human motion generation family that scales […]
Meet LLMRouter: An Intelligent Routing System designed to Optimize LLM Inference by Dynamically Selecting the most Suitable Model for Each Query
LLMRouter is an open source routing library from the U Lab at the University of Illinois Urbana Champaign that treats […]
China drafts world’s strictest rules to end AI-encouraged suicide, violence
China drafted landmark rules to stop AI chatbots from emotionally manipulating users, including what could become the strictest policy worldwide […]
From Gemma 3 270M to FunctionGemma, How Google AI Built a Compact Function Calling Specialist for Edge Workloads
Google has released FunctionGemma, a specialized version of the Gemma 3 270M model that is trained specifically for function calling […]
A Coding Implementation on Building Self-Organizing Zettelkasten Knowledge Graphs and Sleep-Consolidation Mechanisms
In this tutorial, we dive into the cutting edge of Agentic AI by building a “Zettelkasten” memory system, a “living” […]
