Training

Introduction

The third swimlane in Fig. 1 illustrates the Training work stream. In this work stream, the ML model coded in the Modeling work stream is trained and validated; the validated trained model is registered for deployment in the Inference work stream. The activities in this work stream can be done mannually, or automated whereever possible especially for recurrent training.

Activity Description Inputs Outputs
Train Model The pre-trained ML model coded in the Modeling work stream is trainined with a updated training set using the selected hyperparameters. Pre-trainined model & hyperparameters & training set Trained model
Validate Model The new trainied ML model is validated with a testing set on the inference accuracy. Only the model with an expected accuracy is used for deployment. Trained model & testing set Validated model
Register Model The validated model with an expected accuracy is registered to the registry for deployment to the Inference work stream. Validated model Trained model published to registry

Train Model

The coded ML model and the selected hyperparameters from the Modeling work stream are passed to this activity. The pre-trainined ML model is recurrently trained when a updated training set is generated. This activity can be done mannually. If the trained model needs to be updated regularly, the training process should be programmatically automated. Note that the training set should be representative of the current data pattern for inference.

Validate Model

The newly trained model is validated against performing inference on a testing set. A validation rule can be set to define the criteria on the acceptance of an updated trained model. For example, the precision and recall of the updated trained model must be above predefined thresholds. Note that the testing set should be representative of the current data pattern for inference.

Register Model

The trained model accepted after validation can be registered to a registry for deployment to the Inference work stream.