Machine learning perspectives: from 'replay' to 'reconstruction' – SOTA analysis

23 Jun 2026

15:30 - 15:55

AVT Live Stage

Tuesday, June 23, AVT Live Zone – afternoon session

The fields of 3D reconstruction and generative video are moving at a staggering pace. Over 1,700 3D Gaussian splatting papers have been published since mid-2023, and video diffusion research has surpassed 2,300 papers in roughly the same window. The state of the art is changing month to month. Techniques that were considered standard, like COLMAP for structure from motion, are hitting fundamental limits in speed and robustness that new deep-learning-based approaches are rapidly overcoming. Parallel Domain has a dedicated team working at the frontier of these methods, and in this talk we share what we are seeing ahead: how 3D Gaussian splatting, feed-forward reconstruction models and proprietary pose estimation are converging to turn a single drive log into a fully controllable, photorealistic 4D simulation, bridging the gap between rigid replay and hallucinated generative video to enable software-augmented testing at scale.

How research in 3D Gaussian splatting and video diffusion has exploded, and why the state of the art in 3D reconstruction is shifting faster than ever
Why industry-standard tools like COLMAP struggle with speed and sparse-feature environments, and how new deep-learning approaches to structure from motion are overcoming these limits
How a single real-world drive log can be transformed into a controllable, sensor-realistic 4D simulation
What this shift means for engineering and program leaders evaluating simulation strategies: how software-augmented testing reduces reliance on costly real-world fleet miles while accelerating safety validation timelines

Speakers

Johan Vertens, head of machine learning - Parallel Domain