📄️ Overview and basic usage
Streams are a way to sample real plans and plan runs from your Portia cloud account allowing you to monitor the performance of your agents in production.
📄️ Custom Stream evaluators
Evaluators are responsible for the calculation of metrics. To help you get started quickly, Steel Thread provides a built-in LLMJudgeEvaluator for stream based evaluation using LLM-as-Judge. This explained in the previous section on basic usage.
📄️ Visualise Stream results
Stream metrics are pushed to the Portia dashboard. Clicking on any stream will show the latest metrics for it grouped by the time of run to show you the performance of the stream over time.