log in  |  register  |  feedback?  |  help  |  web accessibility
Logo
Recent work on Interpretable Machine Learning
Cynthia Rudin - Duke University
Friday, December 1, 2017, 11:00 am-12:00 pm Calendar
  • You are subscribed to this talk through .
  • You are watching this talk through .
  • You are subscribed to this talk. (unsubscribe, watch)
  • You are watching this talk. (unwatch, subscribe)
  • You are not subscribed to this talk. (watch, subscribe)
Abstract

This issue of interpretability in predictive modeling is particularly important, given that the US government currently pays private companies for black box predictions that are used throughout the US Justice System. Do we really trust a black box model to make decisions on criminal justice? Propublica claimed that we should not. In particular, the black box predictions purchased by the US government are potentially biased. The US government could have tried to prove that no white box (interpretable) model exists that has the same accuracy, but they did not attempt that. For decisions of this gravity - for justice standards, healthcare, energy reliability or other critical infrastructure standards - we should consider interpretable models before resorting to a black box.


In this talk I will discuss algorithms for interpretable machine learning. Some of these algorithms are designed to create proofs of nearness to optimality. I will focus on some of our most recent work, including:

1.    work on optimal rule list models using customized bounds and data  structures (these are an  alternative to CART)

2.    work on optimal scoring systems (alternatives to logistic regression + rounding)

Since we have methods that can produce optimal or near-optimal models, we can use them to produce interesting new forms of interpretable models. These new forms were simply not possible before, since they are almost impossible to produce using traditional techniques (like greedy splitting and pruning).

In particular:

       (3)   Falling rule lists

       (4)   Causal falling rule lists

       (5)  Cost-effective treatment regimes

 

Work on (1) is joint with postdoc Elaine Angelino, students Nicholas Larus-Stone and Daniel Alabi, and colleague Margo Seltzer. Work on (2) is joint with student Berk Ustun. Work on (3) and (4) are joint with students Fulton Wang and Chaofan Chen, and (5) is joint with student Himabindu Lakkaraju.

Drafts are here for (1) and (2) (both papers are current work):

Certifiably Optimal Rule Lists

https://xxx.arxiv.org/pdf/1704.01701.pdf

Longer version of KDD 2017 paper (oral)

Learning Risk Scores from Large-Scale Datasets

http://web.mit.edu/ustunb/www/docs/OptimizedRiskScores.pdf

Longer version of KDD 2017 paper

(INFORMS 2017 ICS Best Student Paper Award)

Other papers for (3), (4), (5) are on my website:

https://users.cs.duke.edu/~cynthia/papers.html

Bio

Cynthia Rudin is an associate professor of computer science, electrical and computer engineering, and statistics at Duke University, and directs the Prediction Analysis Lab. Her interests are in machine learning, data mining, applied statistics, and knowledge discovery (Big Data). Her application areas are in energy grid reliability, healthcare, and computational criminology. Previously, Prof. Rudin held positions at MIT, Columbia, and NYU. She holds an undergraduate degree from the University at Buffalo, and a PhD in applied and computational mathematics from Princeton University. She is the recipient of the 2013 and 2016 INFORMS Innovative Applications in Analytics Awards, an NSF CAREER award, was named as one of the "Top 40 Under 40" by Poets and Quants in 2015, and was named by Businessinsider.com as one of the 12 most impressive professors at MIT in 2015. Work from her lab has won 10 best paper awards in the last 5 years. She is past chair of the INFORMS Data Mining Section, and is currently chair of the Statistical Learning and Data Science section of the American Statistical Association. She also serves on (or has served on) committees for DARPA, the National Institute of Justice, the National Academy of Sciences (for both statistics and criminology/law), and AAAI.

This talk is organized by Adelaide Findlay