Inference

Serving and runtime concerns after model training.

Inference is the runtime phase where a trained model produces outputs for real user requests.