Event Details:
Friday, August 15, 2025
9:00am - 5:00pm PDT
Location
Stanford University Historic Campus
This event is open to:
Affiliate Members
This workshop explores the evolution of computer vision from early classification models to modern generative systems powered by diffusion and large vision models. Through a mix of theory and practical insights, learners will understand how these models work and how they’re applied in real-world scenarios.
- Overview of key milestones in computer vision, from CNNs to Vision Transformers.
- Introduction to multimodal learning with models like CLIP that connect vision and language.
- Deep dive into generative models: autoencoders, GANs, and diffusion models.
- Controllability and practical applications: inpainting, segmentation, text-to-image, and video generation.
By the end, participants will gain a strong conceptual understanding of how large vision models are designed, how they generate and edit images, and how they are shaping the future of generative models.
Related Topics
Explore More Events
-
Workshops
Enhancing Optimization Workflows with Quantum Computers Workshop
-Stanford University Historic Campus -
Career Forum & Education
ICME Career Forum 2025
-Stanford University - Huang Engineering Center
475 Via Ortega
Basement Atrium and Mackenzie Room
Stanford, CA 94305
United States -
Research Symposium & Seminars
ICME Research Symposium 2026
-Stanford University - Huang Bldg.
475 Via Ortega
Stanford, CA 94305-4042
United States