Text and Image-to-Text Generation

We have gone through the text generation from text and image prompts individually and seen how Gemini can be creatively used in various applications. Now, it’s time to extend it further to multimodality. We’ll generate text through multiple input formats:

  • Image file: Visual data representing an image.

  • Text file: Structured text-based information.

  • Simple text: Unstructured text-based prompt.

Get hands-on with 1400+ tech skills courses.