Over the last decade, information networks have become ubiquitous and widespread. These include social networks, communication networks, financial transaction networks, citation networks, disease transmission networks, and many more. There is a growing need for data management systems that can support real-time ingest, storage, and querying over information networks, and complex analysis over them. In this talk, I will discuss some of the key challenges in building such systems and discuss our work on building a distributed graph database system to support declarative analytics and continuous queries over very large, dynamic information networks.
I will also discuss a new research project that we have recently began, where the goal is to build a github-like collaborative data management system for enabling data science (described in more detail in a recent paper: http://arxiv.org/abs/1409.0798).