log in  |  register  |  feedback?  |  help  |  web accessibility
Challenges and Opportunities in Open Generative Models
Aaron Gokaslan
IRB 4105 or https://umd.zoom.us/s/2203990027
Wednesday, March 26, 2025, 2:30-3:30 pm
  • You are subscribed to this talk through .
  • You are watching this talk through .
  • You are subscribed to this talk. (unsubscribe, watch)
  • You are watching this talk. (unwatch, subscribe)
  • You are not subscribed to this talk. (watch, subscribe)
Abstract

Open source generative models like OpenGPT2, BLOOM, and others have been pivotal in advancing AI technology. These models leverage extensive text data to achieve advanced linguistic capabilities. However, the trend towards proprietary tools and closed large language models is growing, posing unique challenges in open-source AI development. This discussion will explore the challenges and opportunities training these foundation models, the hurdles in dataset governance, and downstream AI for science applications. We will also explore algorithmic improvements in training these models such as discrete diffusion, learned adaptive noise schedules, and modern GAN baselines. We will cover the challenges of generative models in several different modalities: text, image, and biological sequence data.

Bio
Aaron Gokaslan is a 4th year PhD Student at Cornell University advised by Volodymyr Kuleshov. Previously, he was a Facebook AI Resident advised by Dhruv Batra. Before that, he did his masters and undergrad at Brown University with James Tompkin. His work has been recognized by orals and invited talks at top conferences. He has received awards for his open source contributions from the Linux Foundation.
 
This talk is organized by Samuel Malede Zewdu