Lily did a fabulous job presenting her early work on this research. I’ve listed the abstract and video below.
Logging what matters: The Pythia just-in-time instrumentation framework (Lily Sturmann) (Slides): We will present our current work on Pythia, a just-in-time instrumentation framework for distributed systems that automatically enables instrumentation in the right areas to provide visibility into newly-observed problems in a running system. The talk will discuss key challenges involved in creating such a framework: (1) understanding where in the distributed system (e.g., which components) additional instrumentation is needed, (2) understanding what instrumentation (e.g., log statements or information contained in logs, such as function parameter values) is needed, (3) Understanding how to limit the overheads of enabling too much instrumentation. It will discuss how end-to-end tracing, combined with statistical measures and machine-learning techniques, provide a foundation to address these challenges. The talk will conclude with our current progress building Pythia and applying it to problems in OpenStack.