quality drops, you instead write real tests and let DeepEval drive pytest. assert_test(test_case, [metrics]) — like `assert`, but for LLM quality. "You can return ...
It does NOT run end-to-end agent execution — snippets that call agent.run() are expected to fail at the network call, not at import/construction time.