Hands-on with Gemini: Interacting with multimodal AI

11 months ago
21

Gemini is our natively multimodal AI model capable of reasoning across text, images, audio, video and code. This video highlights some of our favorite interactions with Gemini. Learn more and try the model: https://deepmind.google/gemini

Explore our prompting approaches here: https://goo.gle/how-its-made-gemini

For the purposes of this demo, latency has been reduced and Gemini outputs have been shortened for brevity.

Loading comments...