Deploying the System Design of a Text-to-Video Generation System
Understand the System Design of a text-to-video generation system.
In the previous lesson, we chose a model similar to
Let’s start with the model size estimation:
Text-to-video model size estimation
We are considering a similar model to Mochi 1, which has approximately 10 billion parameters. For FP32 floating-point precision, the model size becomes:
Get hands-on with 1300+ tech skills courses.