Use Jina, Whisper and Stable Diffusion to build a cloud-native application for generating images using your speech.