Combine AnimateDiff and the ST-MFNet frame interpolator to create smooth and realistic videos from a text prompt