Automation

Updated May 13, 2023 ·

Overview

Automation improve efficiency and reduce human error in ML pipelines. The four main principles are:

Continuous Integration (CI): Regularly merge code changes into a shared repository.
Continuous Delivery (CD): Automatically build, test, and deploy code.
Continuous Training (CT): Automatically update models as new data arrives.
Continuous Monitoring (CM): Continuously track model performance.

Continuous Integration (CI) means frequently integrating code changes and testing them automatically.

On the other hand, Continuous Deployment (CD) automates the release of validated code after testing.

For more information, please see CICD Overview.

info

Tools like Git, AWS CodePipeline, Jenkins, and Travis CI are commonly used to implement CI/CD.

Continuous Training involves automatically retraining models as new data becomes available, keeping models accurate.

Continuous Monitoring is the practice of continuously monitoring performance, identifying issues early, and triggering retraining if necessary.

Here’s how automation works in a typical ML pipeline: