Effective Strategies for Monitoring Machine Learning Models in Businesses

May 2nd 2023 New Story

3min

by @huwfulcher

Huw Fulcher

@huwfulcher

Software/MLOps Engineer, ex-Data Science Lead. Writing on mlopsnow.com to help...

Read this story in a terminal

Print this story

Read this story w/o Javascript

Too Long; Didn't Read

Monitoring the performance of machine learning models is crucial for the success of businesses. Unlike monitoring software, model monitoring requires consideration of concept drift and prediction confirmation lag time. Here are three strategies for easier model monitoring: 1) monitor data quality, 2) monitor prediction distribution for deviations indicating drift, and 3) watch the news for real-world events that change data. By following these strategies, businesses can ensure that their models remain accurate and reliable over time, ensuring the sustainability of their operations.

People Mentioned

Machine Learning

@machinelearning2

Your browser does not support theaudio element.

Read by Dr. One (en-US)

Audio Presented by

@huwfulcher

Huw Fulcher

Software/MLOps Engineer, ex-Data Science Lead. Writing on...

Receive Stories from @huwfulcher

Model monitoring is critical to the operational success of Machine Learning within businesses. A well-trained model can take advantage of large and complex datasets to significantly impact decision-making and profitability. The last place you want to be is in front of your boss explaining why the forecasting model has royally screwed the business over.

Monitoring models is not the same as monitoring an online service. Monitoring of software is usually restricted to how fast it runs and whether it is encountering failures or not. For models, these things need to be considered as well as monitoring for concept drift where underlying patterns change and make models less accurate.

Another significant challenge is when the outcome of a prediction is not able to be confirmed for weeks or months. This can leave businesses with uncertainty about the efficacy of a model until it is too late and they are negatively impacted.

3 ways to make model monitoring easier

While each implementation of a model comes with unique monitoring challenges here are 3 strategies that can be used. These are particularly effective for models that have a significant lag time between prediction and confirmation.

1. Monitor Data Quality: Garbage In, Garbage Out

The reliable phrase used by Data Scientists naturally holds for models in production as well.

If the quality of data begins to degrade upstream of a model then it will hurt the performance of the model. This is particularly insidious when the data is present but incorrect. Missing data would cause the model to throw an error, an event that is much easier to detect. Data in, for example, a categorical variable being incorrectly categorized is a much more difficult problem to identify.

There are several ways to monitor data quality. Comparing distributions of data against those used to train/test the model can help flag potential anomalies. Another approach is to use data validation packages to confirm the quality of data before it is passed into a model.

2. Monitor prediction distribution: Deviations indicate drift

Similar to monitoring the distribution of data going in, monitoring prediction distribution is a good way to identify when a model is starting to drift. Producing a value that compares prediction distribution at training time and now is a great metric to add to a dashboard or early warning system.

Changes in distribution may not necessarily indicate poor performance but they can be a way to notify you of when to investigate to perform preventative maintenance rather than reactive.

3. Watch the news: Real-world events change data

Watching the news may not be what you think about when monitoring models but it can provide effective early warnings.

The COVID-19 pandemic saw customer behavior change overnight, sending many companies using ML reeling as their models could no longer make sense of their customers. While there isn’t much that can be done by the instantaneous change in customer behavior it provides an important lesson in why you should be informed of world events.

If you’re a FinTech and the economy begins to take a turn for the worse your customer behaviour will inevitably change. This provides you with an opportunity to collect data to retrain your model so that when it drifts too far it can quickly be corrected to the new status quo.

Conclusion

Model monitoring is not an easy task but a necessary one. Ensuring your models remain accurate and reliable over time will ensure the sustainability of your business. Monitoring data quality, prediction and the news are good ways to make monitoring your models easier.

Also published here.

Featured Photo by Pawel Czerwinski on Unsplash

by Huw Fulcher @huwfulcher.Software/MLOps Engineer, ex-Data Science Lead. Writing on mlopsnow.com to help you get into MLOps.

Learn How to Write Clean Code for Python

Comments

Customiz|

Effective Strategies for Monitoring Machine Learning Models in Businesses

Effective Strategies for Monitoring Machine Learning Models in Businesses

Huw Fulcher

@huwfulcher

Too Long; Didn't Read

People Mentioned

@huwfulcher

3 ways to make model monitoring easier

1. Monitor Data Quality: Garbage In, Garbage Out

2. Monitor prediction distribution: Deviations indicate drift

3. Watch the news: Real-world events change data

Conclusion

Comments

Recommend

GM lures buyers in China with new Wuling Bingo EV starting at under $10K with fu...

使用 Semantic Kernel 实现 Microsoft 365 Copilot 架构 - 张善友

Google Nest cameras and thermostats now work with the Abode home security system...

Fire-Boltt Phoenix Ultra智能手表发布具有蓝牙通话

不愿“困于人海”的年轻人，盯上徒步

How We Accidentally DoS-ed Ourselves with Kotlin Coroutines

华信永道获北交所IPO批文，拟募资约1.29亿元

China plans to begin building lunar base as early as this decade

SAP PaPM Cloud: Introduction of Query based Master Data and Hierarchy

Is infinity an odd or even number?

About Joyk