Enterprise AI in the U.S. has left the experimentation phase. CFOs expect clear ROI, boards expect evidence of risk oversight, […]
Category: Staff
A Full Code Implementation to Design a Graph-Structured AI Agent with Gemini for Task Planning, Retrieval, Computation, and Self-Critique
In this tutorial, we implement an advanced graph-based AI agent using the GraphAgent framework and the Gemini 1.5 Flash model. […]
GPZ: A Next-Generation GPU-Accelerated Lossy Compressor for Large-Scale Particle Data
Particle-based simulations and point-cloud applications are driving a massive expansion in the size and complexity of scientific and commercial datasets, […]
Prefix-RFT: A Unified Machine Learning Framework to blend Supervised Fine-Tuning (SFT) and Reinforcement Fine-Tuning (RFT)
Large language models are typically refined after pretraining using either supervised fine-tuning (SFT) or reinforcement fine-tuning (RFT), each with distinct […]
JSON Prompting for LLMs: A Practical Guide with Python Coding Examples
JSON Prompting is a technique for structuring instructions to AI models using the JavaScript Object Notation (JSON) format, making prompts […]
What is a Voice Agent in AI? Top 9 Voice Agent Platforms to Know (2025)
What is a Voice Agent? An AI voice agent is a software system that can hold two-way, real-time conversations over […]
Large Language Models LLMs vs. Small Language Models SLMs for Financial Institutions: A 2025 Practical Enterprise AI Guide
Table of contents 1. Regulatory and Risk Posture 2. Capability vs. Cost, Latency, and Footprint 3. Security and Compliance Trade-offs […]
Google AI Proposes Novel Machine Learning Algorithms for Differentially Private Partition Selection
Differential privacy (DP) stands as the gold standard for protecting user information in large-scale machine learning and data analytics. A […]
Native RAG vs. Agentic RAG: Which Approach Advances Enterprise AI Decision-Making?
Retrieval-Augmented Generation (RAG) has emerged as a cornerstone technique for enhancing Large Language Models (LLMs) with real-time, domain-specific knowledge. But […]
Huawei CloudMatrix: A Peer-to-Peer AI Datacenter Architecture for Scalable and Efficient LLM Serving
LLMs have rapidly advanced with soaring parameter counts, widespread use of mixture-of-experts (MoE) designs, and massive context lengths. Models like […]