LiveSVG: Zero-Shot SVG Animation via Video Generation

LiveSVG turns a static SVG into a prompt-aligned animation, using video generation for motion guidance.

Matan Levy1,2 Ran Margolin1 Bar Cavia1 Dvir Samuel3 Yael Pritch1 Shmuel Peleg2 Alex Rav Acha1 Ariel Shamir1,4 Dani Lischinski1,2
1Google 2The Hebrew University of Jerusalem 3Bar-Ilan University 4Reichman University
Paper (PDF) arXiv View Results Cite
Rapping into a microphone
A couple dancing tango
A parrot flying
Gymnastic split
Overview

Animated motion while preserving editable SVG geometry

LiveSVG animates an existing SVG from a motion prompt without training on vector-animation data. Instead of optimizing against a noisy in-loop video prior, it first generates a concrete target video that can be previewed, then fits the original SVG paths to that target with differentiable rendering. The result is prompt-aligned motion that remains editable as SVG.
Method

How LiveSVG works

The pipeline separates motion generation from SVG optimization, so the target motion is explicit before vector fitting begins.

1

Group

An LLM groups related SVG elements so coherent parts can move together.

2

Recolor

Temporary sphere-packing colors separate paths and improve pixel-space correspondence.

3

Generate video

An image-to-video model creates a previewable motion target from the SVG and prompt.

4

Fit SVG

Group homographies and local Bézier offsets deform the original vector geometry over time.

LiveSVG pipeline diagram.

Pipeline overview. The target video provides the motion; differentiable rendering transfers it back to editable SVG paths.

Qualitative Comparison

LiveSVG vs. prior methods

LiveSVG (highlighted) produces large, topology-preserving deformations. SDS- and LLM-based baselines remain close to the static input or exhibit only limited motion.

"Full gymnastic split on the floor, both hands raised high."
LiveSVG animation preview
LiveSVG (Ours)
AniClipart animation preview
AniClipart
FlexiClip animation preview
FlexiClip
LiveSketch animation preview
LiveSketch
LINR-Bridge animation preview
LINR-Bridge
Vector Prism animation preview
Vector Prism
"A man sits on the floor."
LiveSVG animation preview
LiveSVG (Ours)
AniClipart animation preview
AniClipart
FlexiClip animation preview
FlexiClip
LiveSketch animation preview
LiveSketch
LINR-Bridge animation preview
LINR-Bridge
Vector Prism animation preview
Vector Prism
Diversity

Multiple plausible motions from a single prompt

Different random seeds produce distinct motions for the same input. Users preview each candidate before committing to SVG fitting — a key advantage of decoupling video generation from optimization.

Prompt — "A person rapping into a microphone"
Variant A animation preview
Variant A
Variant B animation preview
Variant B
Variant C animation preview
Variant C
Variant D animation preview
Variant D
Prompt — "A woman standing and waving her hand"
Variant A animation preview
Variant A
Variant B animation preview
Variant B
Variant C animation preview
Variant C
Variant D animation preview
Variant D
New Benchmark

Introducing ChallengeSVG

AniClipart covers simple single-subject clipart. ChallengeSVG adds 35 complex, multi-object SVGs from SVGX-Core-250k that expose failure modes beyond this narrow setting.

Existing benchmarkAniClipart — 43 examples

Single-subject clipart with clean backgrounds and roughly 20 paths per example, biased toward human and animal subjects.

  • Single-subject clipart
  • Clean white backgrounds
  • Skeleton-friendly subjects

New benchmarkChallengeSVG — 35 examples

Multi-object scenes with layered occlusions, non-empty backgrounds, dense path counts, and open-domain subjects.

  • Multi-object & multi-part scenes
  • Non-empty backgrounds
  • Not skeleton-centric
Example inputs from ChallengeSVG
climber
Climber
cyclist
Cyclist
rider
Rider
jellyfish
Jellyfish
surfer
Surfer
earth
Earth
juggler
Juggler
Comparison on a ChallengeSVG example
"The Earth spinning around its axis." ChallengeSVG
LiveSVG animation preview
LiveSVG (Ours)
AniClipart animation preview
AniClipart
FlexiClip animation preview
FlexiClip
LiveSketch animation preview
LiveSketch
LINR-Bridge animation preview
LINR-Bridge
Vector Prism animation preview
Vector Prism
"A moving jellyfish." ChallengeSVG
LiveSVG jellyfish animation preview
LiveSVG (Ours)
AniClipart jellyfish animation preview
AniClipart
FlexiClip jellyfish animation preview
FlexiClip
LiveSketch jellyfish animation preview
LiveSketch
LINR-Bridge jellyfish animation preview
LINR-Bridge
Vector Prism jellyfish animation preview
Vector Prism
Evaluation

Human preference and automatic metrics

LiveSVG wins human preference on both benchmarks and achieves the lowest runtime and GPU footprint among optimization-based baselines.

86.7%
AniClipart
human preference (Oracle)
84.8%
ChallengeSVG
human preference
5.2 min
Runtime per SVG
vs. up to 22.3 min
7.4 GB
GPU memory
vs. up to 40 GB
Human preference: LiveSVG wins vs. each baseline
LiveSVG preferred (> 50%) Baseline preferred 50% chance level
Show automatic metric tables
Automatic Metrics

Quantitative evaluation

Best prompt alignment (X-CLIP), best appearance preservation among optimization-based methods, and the lowest computational cost by a large margin.

AniClipart Benchmark

MethodX-CLIP↑LPIPS↓SSIM↑DOVER↑Time↓VMem↓
No animation0.2110.0001.0000.444
LLM-based
Vector Prism0.2110.0320.9730.4514.6m
Video SDS optimization
LiveSketch0.2060.1530.9100.49622.3m40.0G
AniClipart0.2140.1040.9370.4276.9m27.8G
FlexiClip0.2130.0920.9380.43114.5m28.1G
LINR-Bridge0.2150.1740.9250.43310.7m16.8G
Target-video fitting (ours)
LiveSVG (Veo 3.1)Best0.2160.0870.9420.4475.2m7.4G
LiveSVG (LTX 2.3)0.2150.1050.9400.4455.2m7.4G
LiveSVG (WAN 2.2)0.2140.1160.9380.4465.2m7.4G

ChallengeSVG Benchmark

MethodX-CLIP↑LPIPS↓SSIM↑DOVER↑Time↓VMem↓
No animation0.2140.0001.0000.470
LLM-based
Vector Prism0.2110.1390.8670.4615.2m
Video SDS optimization
LiveSketch0.1820.5030.6090.3975.6m40.4G
AniClipart0.2040.2740.7810.43326.1m28.1G
FlexiClip0.2010.2980.7730.42450.5m28.1G
LINR-Bridge0.2050.4910.7290.42615.7m17.2G
Target-video fitting (ours)
LiveSVG (WAN 2.2)Best0.2150.2080.8440.4764.7m7.4G
Citation

Cite this work

BibTeX coming soon. A citation entry will be made available once the paper is published.