Skip to content

CI Usage

ECP is designed to fail builds when agent behavior regresses.

GitHub Actions

name: ECP evals

on:
  pull_request:
  push:
    branches: [main]

jobs:
  evals:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - uses: actions/setup-python@v5
        with:
          python-version: "3.11"
      - run: pip install "ecp-runtime==0.3.1" "ecp-sdk==0.3.1"
      - run: ecp validate examples/customer_support_demo/manifest.yaml
      - run: ecp run --manifest examples/customer_support_demo/manifest.yaml --json-out ecp-report.json
      - uses: actions/upload-artifact@v4
        if: always()
        with:
          name: ecp-report
          path: ecp-report.json

ecp run exits with code 2 when checks fail. This makes it suitable for pull request gates.

Useful Flags

ecp run --manifest evals/support.yaml --json
ecp run --manifest evals/support.yaml --json-out report.json
ecp run --manifest evals/support.yaml --report report.html
ecp run --manifest evals/support.yaml --no-fail-on-error

Local Preflight

ecp doctor
ecp validate evals/support.yaml
ecp conformance --target "python agent.py"