ML Institute Open Lectures

Location:

London

Start Date:

March 19, 2025 6:00 PM

End Date:

March 19, 2025 8:00 PM

Price:

Free

In this talk, Besart Shyti will explore the architectures behind multimodal models, from vision-language transformers to generative AI systems that create text from images and audio. We'll discuss how these models integrate diverse information, their current limitations, and the emerging breakthroughs. From improving search engines to enabling video understanding, multimodal AI is increasingly entering real-world applications.

Official Event Page