log in  |  register  |  feedback?  |  help  |  web accessibility
Logo
Putting your database on autopilot: self-driving database management systems
Lin Ma
Virtual- https://umd.zoom.us/j/98095131895?pwd=bFRySUJZSytQcjFVVis0dFpuWU1TZz09
Monday, March 7, 2022, 11:00 am-12:00 pm Calendar
  • You are subscribed to this talk through .
  • You are watching this talk through .
  • You are subscribed to this talk. (unsubscribe, watch)
  • You are watching this talk. (unwatch, subscribe)
  • You are not subscribed to this talk. (watch, subscribe)
Abstract

Database management systems (DBMSs) are essential for modern data-driven applications. However, they are notoriously difficult to deploy and administer because they have many aspects that one can change that affect their performance, including database physical design and system configuration. There are existing methods that recommend how to change these aspects of databases for an application. But most of them still require humans to make final decisions on what changes to apply and when to apply them. Thus, DBMS administrations today remain onerous and costly.

In this talk, I present a self-driving DBMS architecture that enables automatic system management and removes the administration impediments. Our approach consists of three frameworks inspired by self-driving car architectures: (1) workload forecasting, (2) behavior modeling, and (3) action planning. The workload forecasting framework predicts the query arrival rates under varying database workload patterns using an ensemble of time-series forecasting models. The behavior modeling framework constructs fine-grained machine learning models that predict the runtime behavior of the DBMS. Lastly, the action planning framework generates a sequence of optimization actions based on these forecasted workload patterns and behavior model estimations. It uses receding horizon control and Monte Carlo tree search to effectively approximate the complex optimization problem.

Our forecasting-modeling-planning architecture enables an autonomous DBMS that proactively plans for optimization actions without expensive testing. It automatically applies the actions at proper times, holistically controls all system aspects, and provides explanations on its decisions.

Bio

Lin Ma is a postdoctoral researcher in the Computer Science Department at Carnegie Mellon University (CMU). His research focuses on autonomous database management systems, and he is the lead developer of the NoisePage self-driving database system. He completed his PhD at CMU under the supervision of Andy Pavlo in 2021.

This talk is organized by Richa Mathur