Google's latest multimodal model, supports image and video[0] in text or chat prompts.
Optimized for language tasks including:
- Code generation
- Text generation
- Text editing
- Problem solving
- Recommendations
- Information extraction
- Data extraction or generation
- AI agents
Usage of Gemini is subject to Google's Gemini Terms of Use.
- [0]: Video input is not available through OpenRouter at this time.