Evaluation Task
The most basic evaluation workflow in Datumo Eval. It utilizes a Judge evaluation model to compare and evaluate the responses of a Target model, and can quantify model performance based on a Dataset.
The most basic evaluation workflow in Datumo Eval. It utilizes a Judge evaluation model to compare and evaluate the responses of a Target model, and can quantify model performance based on a Dataset.