ATGS

@ATGS: Anchored Temporal Gaussian Splatting for Long Volumetric Video Representation

¹Peking University ²Pengcheng Laboratory

SIGGRAPH(TOG) 2026

Abstract

Volumetric video enables immersive free viewpoint rendering of dynamic real world scenes, yet existing methods struggle with long sequences and complex motions, often leading to temporal instability and visual artifacts. To address these challenges, we propose ATGS, a Gaussian splatting based framework for volumetric video reconstruction. Our key insight is that explicitly tracking long term complex motion with individual Gaussian primitives is inherently unstable. Instead, we organize Gaussians around time conditioned anchors that localize their spatial and temporal support, thereby reducing long range motion complexity. We further introduce a temporal windowing strategy to activate only anchors relevant to the queried time, which improves scalability and temporal coherence. In addition, to ensure spatial and temporal stability, we design a compact set of multi level anchor features that encode global features, local spatial features, and local temporal features, jointly constraining Gaussian generation. Extensive experiments demonstrate that ATGS consistently outperforms prior methods on long sequence volumetric videos with complex motions.

TL;DR

We present ATGS, a novel framework for volumetric video reconstruction that effectively handles long sequences and complex motions. By utilizing time-conditioned anchors and a temporal windowing strategy, ATGS enhances temporal coherence and scalability. Our approach outperforms existing methods, delivering high-quality volumetric videos with improved stability and detail.

Long 360 Dataset (Large-Scale Motion, 1400 frames)

Our method demonstrates significantly superior modeling quality over previous approaches in long-sequence, large-motion scenarios.

LocalDyGS (one Clip)

LocalDyGS (140 Clips, flickering)

4DGaussian (One Clip)

Ours (One Clip)

MeetRoom (Only 11 views)

Our method demonstrates impressive performance in the MeetRoom dataset, effectively capturing intricate details and dynamics.

4DGaussian

LocalDyGS

Ours

VRU Dataset GZ scene (Large-Scale Motion)

Our method achieves superior reconstruction fidelity for both dynamic objects (e.g., athletes) and static regions (e.g., floor textures) in scenes with large-scale motions.