← All courses

Multimodal AI

Course on multimodal AI models: GPT-4V, CLIP, Whisper, Stable Diffusion. Learn to work with images, audio and video via AI, build multimodal RAG pipelines and AI agents. Master Document AI, content generation and cost optimization.

This is a premium course

Purchase access to unlock all modules. One payment — one course of your choice.

300 USDT

Multimodal AI AI