PyData Vermont 2024

Abhishek Murthy

Abhishek Murthy is currently a Senior ML/AI Architect at Schneider Electric (SE) in Boston, Massachusetts USA. He is passionate about sustainability, with a focus on climate change. To that end, he develops Machine Learning (ML) algorithms on sensor data that are critical for the sustainability commitments of the Industrial Automation and Energy Management businesses of SE. He is also a lecturer at Northeastern University and teaches machine learning algorithms for the Internet of Things.

Abhishek received his PhD in Computer Science from Stony Brook University, State University of New York and MS in Computer Science from University at Buffalo. His doctoral research, which was part of a National Science Foundation Expedition in Computing, entailed developing algorithms for automatically establishing the input-to-output stability of dynamical systems.

He led the Data Science Algorithms team at WHOOP before joining SE. He also worked at Signify, formerly called Philips Lighting, as a Senior Data Scientist and led research on IoT applications for smart buildings. Abhishek has served on several conference review committees and NSF panels. His research includes several publications and research articles with more than 190 citations. He has been awarded 15 patents and has more than 50 applications pending.

The speaker's profile picture

Sessions

07-29
11:15
30min
Backtesting Time Series Forecasting Algorithms in SKTime and SKForecast
Abhishek Murthy

Backtesting refers to the process of evaluating a time series forecasting algorithm on historical data by replicating the corresponding real-world scenario. Concurrently, parameters such as the model updating and retraining frequencies are also tuned based on the usecase and relevant computational constraints.

In this talk, we will review the backtesting of time series algorithms using sktime and skforecast. The two open-source machine learning libraries are popular options for developing and deploying forecasting models. Specifically, the following aspects will be covered.

• Comparing and contrasting backtesting-related features of the two libraries

• An overview of the different types of cross-validation schemes for time series forecasting, including expanding and fixed windows

• Model update and retraining for both direct and recursive multistep forecasts

The talk is geared toward data scientists that want to systematically evaluate time series forecasting models in varied settings. In addition to gaining an overview of the various aspects, the audience will also learn about the implementation options supported by the two libraries. No prior knowledge about machine learning algorithms for forecasting is needed to attend the talk.

Filmhouse