Metrics Job

A batch job that evaluates a model's performance using labeled test data. Common outputs include F1 scores, confusion matrices, and fairness indicators.