incremental_training/README.md

# Keeping your ML model in shape with Kafka, Airflow and MLFlow
### How to incrementally update your ML model in an automated way as new training data becomes available

Fitting and serving your machine learning (ML) model is one thing, but what about keeping it in shape over time?

Let's say we got a ML model that has been put in production and is actively serving predictions. Simultaneously, we got new training data that becomes available in a streaming way while users use the model. Incrementally updating the model with new data can improve the model, whilst it also might reduce model drift. However, it often comes with additional overhead. Luckily, there are tools that allow you to automate many parts of this process. 

This repository takes on the topic of incrementally updating a ML model as new data becomes available. It mainly leans on three nifty tools, being [Kafka](https://github.com/apache/kafka), [Airflow](https://github.com/apache/airflow), and [MLFlow](https://github.com/mlflow/mlflow). 

The corresponding [walkthrough/post](https://medium.com/vantageai/keeping-your-ml-model-in-shape-with-kafka-airflow-and-mlflow-143d20024ba6) on Medium lays out the workings of this repo step-by-step.
-												Update README.md
											
										
										
											2019-11-05 23:15:37 +08:00
+								# Keeping your ML model in shape with Kafka, Airflow and MLFlow
-												Update README.md
											
										
										
											2019-11-05 23:15:58 +08:00
+								### How to incrementally update your ML model in an automated way as new training data becomes available
-												Update README.md
											
										
										
											2019-11-05 23:15:37 +08:00
 								Fitting and serving your machine learning (ML) model is one thing, but what about keeping it in shape over time?
 								Let's say we got a ML model that has been put in production and is actively serving predictions. Simultaneously, we got new training data that becomes available in a streaming way while users use the model. Incrementally updating the model with new data can improve the model, whilst it also might reduce model drift. However, it often comes with additional overhead. Luckily, there are tools that allow you to automate many parts of this process.
-												Update README.md
											
										
										
											2019-11-05 23:26:26 +08:00
+								This repository takes on the topic of incrementally updating a ML model as new data becomes available. It mainly leans on three nifty tools, being [Kafka](https://github.com/apache/kafka), [Airflow](https://github.com/apache/airflow), and [MLFlow](https://github.com/mlflow/mlflow).
-												Update README.md
											
										
										
											2019-11-05 23:20:33 +08:00
-												Update README.md
											
										
										
											2019-11-06 04:19:41 +08:00
+								The corresponding [walkthrough/post](https://medium.com/vantageai/keeping-your-ml-model-in-shape-with-kafka-airflow-and-mlflow-143d20024ba6) on Medium lays out the workings of this repo step-by-step.