Discussion about this post

User's avatar
H. Floyd's avatar

Grading trajectories instead of final answers is one of those things you only appreciate after an agent fails in a way that looked correct. I have seen the same pattern. Something I have been thinking about alongside this: a paper last week found that in one 5-agent setup, the system entropy roughly doubled every 150 rounds even under fixed conditions. That suggests even a good trajectory eval dataset might need periodic refresh, because the system's stability profile is shifting as it runs.

No posts

Ready for more?