ShareVerse: Collaborative Video Generation for Shared World Modeling

ShareVerse: Collaborative Video Generation for Shared World Modeling. ShareVerse empowers distributed agents to collaboratively synthesize a globally consistent virtual environment. We bridge isolated generative priors through two core mechanisms: (1) implicit cross-agent interaction, which resolves visual conflicts during concurrent exploration (red/blue vehicles); and (2) a global Spatiotemporal Memory Cache, which guarantees long-term environmental permanence during asynchronous revisitation (green vehicle).

ShareVerse: Collaborative Video Generation for Shared World Modeling

Framework Overview

Experimental Results

(a) Both Straight (Opposing)

(b) Both Turning (Parallel Opposing)

(c) A1 Straight, A2 Turning (Lateral)

(d) A1 Straight, A2 Turning (Longitudinal)

(e) Both Turning (T-Pattern)

(f) Both Turning (X-Pattern)