Discussion about this post

User's avatar
Will Kiely's avatar

Even if 50% Success is going super-exponential now, wouldn't you expect 80% Success to need to go exponential before full automation of AI R&D happens?

Jacob Asmuth's avatar

Agreed with everything written in the comments to you so far. Lots of useful feedback for you to iterate on!

1. We already measure 80%, so simply stop reporting 50% as it is clearly becoming saturated and therefore less useful.

2. 80% seems to lag 50% by nearly a full year, so that buys you a LOT of time for your colleagues to build better tasks.

3. Clearly AI agents need much higher success rates than 80%. Investigate 95% success rate measurements in the future for further savings as your tasks begin to become saturated at 80%.

19 more comments...

No posts

Ready for more?