Notebooks
0001 Agentic Evals Baseline Notebook
A Python notebook for prompt fixtures, scoring checks, and baseline observations for the first experiment.
Published Work
Jupyter notebooks tied to experiments and implementation work.
Notebooks
A Python notebook for prompt fixtures, scoring checks, and baseline observations for the first experiment.