Skip to content
Thursday, January 15, 2026
The TechBriefs
  • Home
  • Technology
  • AI
  • Computers
  • Security
  • Internet
  • Press Releases
    • GlobeNewswire
    • PRNewswire
  • Contact

Category: Computer Vision

  • Home
  • Computer Vision
Thinking Machines Lab Makes Tinker Generally Available: Adds Kimi K2 Thinking And Qwen3-VL Vision Input
  • agentic AI
  • AI
  • AI infrastructure
  • AI Shorts
  • Applications
  • Artificial Intelligence
  • Computer Vision
  • Editors Pick
  • Language Model
  • Large Language Model
  • Machine Learning
  • New Releases
  • Staff
  • Technology

Thinking Machines Lab Makes Tinker Generally Available: Adds Kimi K2 Thinking And Qwen3-VL Vision Input

  • 0

Thinking Machines Lab has moved its Tinker training API into general availability and added 3 major capabilities, support for the […]

Zhipu AI Releases GLM-4.6V: A 128K Context Vision Language Model with Native Tool Calling
  • AI
  • Artificial Intelligence
  • Computer Vision
  • Editors Pick
  • Language Model
  • New Releases
  • Open Source
  • Staff
  • Technology
  • Vision Language Model

Zhipu AI Releases GLM-4.6V: A 128K Context Vision Language Model with Native Tool Calling

  • 0

Zhipu AI has open sourced the GLM-4.6V series as a pair of vision language models that treat images, video and […]

Black Forest Labs Releases FLUX.2: A 32B Flow Matching Transformer for Production Image Pipelines
  • AI
  • Artificial Intelligence
  • Computer Vision
  • Editors Pick
  • New Releases
  • Open Source
  • Staff
  • Technology

Black Forest Labs Releases FLUX.2: A 32B Flow Matching Transformer for Production Image Pipelines

  • 0

Black Forest Labs has released FLUX.2, its second generation image generation and editing system. FLUX.2 targets real world creative workflows […]

Meta AI Releases Segment Anything Model 3 (SAM 3) for Promptable Concept Segmentation in Images and Videos
  • AI
  • AI Paper Summary
  • AI Shorts
  • Applications
  • Artificial Intelligence
  • Computer Vision
  • Editors Pick
  • New Releases
  • Open Source
  • Staff
  • Tech News
  • Technology

Meta AI Releases Segment Anything Model 3 (SAM 3) for Promptable Concept Segmentation in Images and Videos

  • 0

How do you reliably find, segment and track every instance of any concept across large image and video collections using […]

Why Spatial Supersensing is Emerging as the Core Capability for Multimodal AI Systems?
  • AI
  • Artificial Intelligence
  • Computer Vision
  • Editors Pick
  • New Releases
  • Staff
  • Technology

Why Spatial Supersensing is Emerging as the Core Capability for Multimodal AI Systems?

  • 0

Even strong ‘long-context’ AI models fail badly when they must track objects and counts over long, messy video streams, so […]

Zhipu AI Releases ‘Glyph’: An AI Framework for Scaling the Context Length through Visual-Text Compression
  • AI
  • AI Shorts
  • Applications
  • Artificial Intelligence
  • Computer Vision
  • Editors Pick
  • Language Model
  • New Releases
  • Open Source
  • Staff
  • Tech News
  • Technology

Zhipu AI Releases ‘Glyph’: An AI Framework for Scaling the Context Length through Visual-Text Compression

  • 0

Can we render long texts as images and use a VLM to achieve 3–4× token compression, preserving accuracy while scaling […]

Salesforce AI Research Introduces WALT (Web Agents that Learn Tools): Enabling LLM agents to Automatically Discover Reusable Tools from Any Website
  • agentic AI
  • AI
  • AI Agents
  • Artificial Intelligence
  • Computer Vision
  • Editors Pick
  • New Releases
  • Technology

Salesforce AI Research Introduces WALT (Web Agents that Learn Tools): Enabling LLM agents to Automatically Discover Reusable Tools from Any Website

  • 0

A team of Salesforce AI researchers introduced WALT (Web Agents that Learn Tools), a framework that reverse-engineers latent website functionality […]

UltraCUA: A Foundation Computer-Use Agents Model that Bridges the Gap between General-Purpose GUI Agents and Specialized API-based Agents
  • agentic AI
  • AI
  • AI Agents
  • Artificial Intelligence
  • Computer Vision
  • Editors Pick
  • Staff
  • Technology

UltraCUA: A Foundation Computer-Use Agents Model that Bridges the Gap between General-Purpose GUI Agents and Specialized API-based Agents

  • 0

Computer-use agents have been limited to primitives. They click, they type, they scroll. Long action chains amplify grounding errors and […]

Google AI Introduces VISTA: A Test Time Self Improving Agent for Text to Video Generation
  • agentic AI
  • AI
  • AI Agents
  • AI Paper Summary
  • AI Shorts
  • Applications
  • Artificial Intelligence
  • Computer Vision
  • Editors Pick
  • Language Model
  • Staff
  • Tech News
  • Technology

Google AI Introduces VISTA: A Test Time Self Improving Agent for Text to Video Generation

  • 0

TLDR: VISTA is a multi agent framework that improves text to video generation during inference, it plans structured prompts as […]

NVIDIA AI Open-Sources ViPE (Video Pose Engine): A Powerful and Versatile 3D Video Annotation Tool for Spatial AI
  • AI
  • AI Paper Summary
  • Applications
  • Artificial Intelligence
  • Computer Vision
  • Editors Pick
  • New Releases
  • Open Source
  • Promote
  • Sponsored
  • Staff
  • Tech News
  • Technology

NVIDIA AI Open-Sources ViPE (Video Pose Engine): A Powerful and Versatile 3D Video Annotation Tool for Spatial AI

  • 0

How do you create 3D datasets to train AI for Robotics without expensive traditional approaches? A team of researchers from […]

Posts pagination

1 2 … 14 Next
  • Privacy Policy
  • Terms of use
Theme: Terminal News By Adore Themes.