Skip to content
Course Rockstar
TechnologyAdvanced

Multi-modal AI

Learn to build production applications by combining visual and textual inputs with AI coding tools. You will explore multi-modal programming where screenshots,...

By Alfredo Deza on Coursera

About This Course

Learn to build production applications by combining visual and textual inputs with AI coding tools. You will explore multi-modal programming where screenshots, images, and text serve as inputs for AI-assisted code generation, and set up development environments configured for visual AI workflows. The course covers prompt engineering with visual context to improve code generation accuracy, and hands-on development with GitHub Copilot in VS Code for inline suggestions and chat-based interactions. You will build a complete project using live reload and browser developer tools for rapid feedback between AI generation and visual output. The iterative development module teaches documentation-driven design where documentation guides AI toward desired outcomes, image-based iteration for refining generated code through visual comparison, and automated checks and validations that maintain quality through development cycles. You will learn to identify and overcome common iteration challenges including regression and context drift. The advanced module covers Model Context Protocol for connecting AI tools with external capabilities, Playwright for browser automation and visual testing, and Playwright MCP for AI-driven browser interactions that validate web applications directly. By completing this course, you will be able to convert screenshots into production code through iterative, automated, multi-modal AI workflows.

Topics Covered

Frequently Asked Questions

How much does Multi-modal AI cost?

Visit the Multi-modal AI course page for current pricing and available discounts.

Who teaches Multi-modal AI ?

Multi-modal AI is taught by Alfredo Deza, Pragmatic AI Labs.

What skill level is Multi-modal AI for?

This course is designed for advanced learners.

Similar Courses

Included with membership
Enroll Now
Students0
Duration5 hours
LevelAdvanced
Languageen
PlatformCoursera