EvalDog vs Promptfoo vs LangSmith

All three help you evaluate LLM outputs. They’re built for different people. Here’s the honest breakdown.

Comparison reflects publicly available info as of June 2026 and may change. Promptfoo is open source (now part of OpenAI); LangSmith is by LangChain.

When to choose EvalDog

You want a hosted pass/fail report without standing up infrastructure.
You’re a QA engineer or small team, not a full-time ML engineer.
You need a zero-token quality gate in CI — and alerts when a model update breaks you.

Love the free Promptfoo CLI? Keep it. EvalDog adds the hosted, watching layer on top — for when “run it when I remember” isn’t enough.