Loading Notice: Due to the large number of videos, the initial page load may take some time. Thank you for your patience.
We use a script to help synchronize video playback, but slight delays may still occur.

SimpleGVR: A Simple Baseline for
Latent-Cascaded Generative Video Super-Resolution

Supplementary Material

Paper ID #5131

On this page, we first present the VSR comparison results, where the input is the low-resolution output of the Large T2V model (i.e., 384×672 resolution).

Subsequently, for the high-resolution T2V task, we present a comparison between Cascade (Large T2V + SimpleGVR) and End-to-End (Large T2V) results.

We recommend watching all comparisons in full screen. Click on the videos for seeing them in full scale.

Visual Comparison of different methods

"A close-up view of a cute little white chick wearing black-framed glasses perched on the end of its beak, sitting comfortably on a plush, brown, and worn-out sofa in a cozy living room. "

Input

SimpleGVR (Ours)

SeedVR1 (7B)

FlashVideo

STAR

VEnhancer

Upscale-A-Video

Lavie

RealBasicVSR

Visual Comparison of different methods

"A giant panda with soft, fluffy fur and a gentle demeanor is sitting on a wooden dock by the serene shores of a tranquil lake, strumming the strings of a guitar with its paws."

Input

SimpleGVR (Ours)

SeedVR1 (7B)

FlashVideo

STAR

VEnhancer

Upscale-A-Video

Lavie

RealBasicVSR

Visual Comparison of different methods

"Two majestic tigers, their vibrant orange and black stripes glistening in the sunlight, are sitting at a sleek, modern dining table, wearing trendy sunglasses with brightly colored frames."

Input

SimpleGVR (Ours)

SeedVR1 (7B)

FlashVideo

STAR

VEnhancer

Upscale-A-Video

Lavie

RealBasicVSR

Visual Comparison of different methods

"A stunning 18-year-old woman with a delicate melon-seed face, defined facial features, and a slender oval face shape stands confidently by a serene pool area, capturing the viewer's attention."

Input

SimpleGVR (Ours)

SeedVR1 (7B)

FlashVideo

STAR

VEnhancer

Upscale-A-Video

Lavie

RealBasicVSR

Visual Comparison of different methods

"A young lady wearing a sling deep V skirt eats roast chicken in front of the camera."

Input

SimpleGVR (Ours)

SeedVR1 (7B)

FlashVideo

STAR

VEnhancer

Upscale-A-Video

Lavie

RealBasicVSR

1080p T2V Comparison: End-to-End vs. Cascade

A giant panda with soft, fluffy fur and a gentle demeanor is sitting on a wooden dock by the serene shores of a tranquil lake, strumming the strings of a guitar with its paws.

End-to-End (Large T2V)

Cascade (Large T2V + SimpleGVR)

1080p T2V Comparison: End-to-End vs. Cascade

A furry kitten squats in front of a glass fish tank, staring intently at the orange fish swimming in the water. The bottom of the fish tank is covered with colorful stones.

End-to-End (Large T2V)

Cascade (Large T2V + SimpleGVR)

1080p T2V Comparison: End-to-End vs. Cascade

The mayonnaise is squeezed onto the sandwich, and the thick, yellow sauce slowly spreads across the surface of the bread.

End-to-End (Large T2V)

Cascade (Large T2V + SimpleGVR)