log in  |  register  |  feedback?  |  help  |  web accessibility
Logo
Teaching Machines to Read with Less Supervision
Alan Ritter
Virtual- https://umd.zoom.us/j/592194770
Friday, April 10, 2020, 11:00 am-12:00 pm Calendar
  • You are subscribed to this talk through .
  • You are watching this talk through .
  • You are subscribed to this talk. (unsubscribe, watch)
  • You are watching this talk. (unwatch, subscribe)
  • You are not subscribed to this talk. (watch, subscribe)
Abstract

In recent years, advances in speech recognition and machine translation (MT) have led to wide adoption, for example, by helping people issue voice commands to their phone and talk with people who do not speak the same language. These advances are possible due to the use of neural network methods on large, high-quality datasets. However, computers still struggle to understand the meaning of language. In this talk, I will present two efforts to scale up natural language understanding, drawing inspiration from recent successes in speech and MT. First, I will describe an effort to extract structured knowledge from text, without relying on human labels. Our approach combines the benefits of structured learning and neural networks, accurately predicting latent relation mentions given only indirect supervision from a knowledge base. In extensive experiments, we demonstrate that the combination of structured inference, missing data modeling, and end-to-end learned representations leads to state-of-the-art results on a minimally supervised relation extraction task. In the second part of the talk, I will discuss conversational agents that are learned from scratch in a purely data-driven way. To address the challenge of dull responses, which are common in neural dialogue, I will present several strategies that maximize the long-term success of a conversation.

Bio

Alan Ritter is an assistant professor in Computer Science and Engineering at Ohio State University. His research interests include natural language processing, information extraction, and machine learning. He completed his Ph.D. at the University of Washington and was a postdoctoral fellow in the Machine Learning Department at Carnegie Mellon. His research aims to solve challenging technical problems that can help machines learn to read vast quantities of text with minimal supervision. In a recent project, covered by WIRED (https://www.wired.com/story/machine-learning-tweets-critical-security-flaws/), his group built a system that reads millions of tweets for mentions of new software vulnerabilities. He is the recipient of an NSF CAREER award and an Amazon Research Award.

This talk is organized by Richa Mathur