Skip to main content Skip to secondary navigation
Main content start

Diffusion & Large Vision Models Workshop

Event Details:

Friday, August 15, 2025
9:00am - 5:00pm PDT

Location

Stanford University Historic Campus

This event is open to:

Affiliate Members

This workshop explores the evolution of computer vision from early classification models to modern generative systems powered by diffusion and large vision models. Through a mix of theory and practical insights, learners will understand how these models work and how they’re applied in real-world scenarios.

  • Overview of key milestones in computer vision, from CNNs to Vision Transformers.
  • Introduction to multimodal learning with models like CLIP that connect vision and language.
  • Deep dive into generative models: autoencoders, GANs, and diffusion models.
  • Controllability and practical applications: inpainting, segmentation, text-to-image, and video generation.

By the end, participants will gain a strong conceptual understanding of how large vision models are designed, how they generate and edit images, and how they are shaping the future of generative models. 
 

Related Topics

Explore More Events