SCAIL-2: End-to-end In-Context Conditioning for Character Animation
SCAIL-2 feeds the raw driving video into the generation sequence instead of a pose skeleton, cutting FVD to 287 vs 305 for Wan-Animate on Studio-Bench, with one model covering animation and replacement.