Can AI Follow the Money?

A new benchmark assesses language models’ ability to handle complex financial instructions, revealing surprising strengths in open-weight systems.