This example shows how to create a basic evaluation flow.
Tools used in this flow:
python
tool
Install promptflow sdk and other dependencies in this folder:
pip install -r requirements.txt
In this flow, you will learn
- how to compose a point based evaluation flow, where you can calculate point-wise metrics.
- the way to log metrics. use
from promptflow import log_metric
- see file aggregate.
Testing flow/node:
# test with default input value in flow.dag.yaml
pf flow test --flow .
# test with flow inputs
pf flow test --flow . --inputs groundtruth=ABC prediction=ABC
# test node with inputs
pf flow test --flow . --node line_process --inputs groundtruth=ABC prediction=ABC
There are two ways to evaluate an classification flow.
pf run create --flow . --data ./data.jsonl --stream