Describe a product → GLM writes the script → FLUX paints each scene → Kokoro voices it → ffmpeg cuts a vertical (9:16) ad. GPU runs on-demand on Modal. First run after idle ~30–60s (cold start).
AI 视频会为每个场景生成真实动态片段,明显更慢、更费 GPU。
Ready.