
AI-Powered Thermal System Shields San Francisco Whales From Ship Strikes
A new detection network combining thermal imaging and machine learning aims to prevent collisions as climate-displaced gray whales increasingly stop in busy Bay Area waters.
Papers, breakthroughs, benchmarks, and the long-arc trends shaping artificial intelligence. arXiv highlights and lab announcements, distilled.

A new detection network combining thermal imaging and machine learning aims to prevent collisions as climate-displaced gray whales increasingly stop in busy Bay Area waters.

Most companies want to deploy autonomous AI systems but lack the structural readiness, forcing executives to rethink workflows from the ground up.

A new consolidation technique lets language models compress long conversations into persistent memory, solving a critical scaling bottleneck.

Researchers demonstrate AI can categorize code changes with 84% recall, enabling faster reviews without manual taxonomy engineering.

Researchers bridge vision-language models and segmentation AI to enable instruction-driven instance detection in a single pass.

Researchers combine traditional geometry with neural networks to solve long-standing reconstruction challenges across diverse imaging conditions.

Researchers propose entity-focused approach to keep multimodal models accurate across different video datasets.

Researchers develop a technique to compress powerful video generators into faster versions that work with incomplete information.

Researchers demonstrate that selectively repeating transformer layers in masked diffusion models cuts training costs by 70% while improving reasoning capabilities.

Researchers tackle a fundamental tradeoff in subject-driven synthesis by leveraging multimodal language models alongside specialized identity conditioning.

Researchers combine distillation and reinforcement learning to create faster text-to-image models that align better with human preferences.

New framework handles transparent materials and complex topology shifts that have stymied previous video-to-4D systems.

Researchers release Prism, a modular toolkit designed to streamline continuous learning in vision-language systems without requiring code rewrites.

New research argues that scaling AI agents depends as much on system design as foundation model improvements.

Researchers develop a controllable system that synthesizes complex traffic scenes from simple layouts, advancing autonomous vehicle safety testing.

Researchers release MobileGym, enabling faster training of AI systems that navigate smartphone interfaces through parallel simulation.

TriSplat eliminates expensive post-processing steps by directly generating simulation-compatible meshes from sparse camera views.