AI Financial Reasoning Benchmark vs Traditional LLM Evaluation Castro Tennant 30 Jun 2026 · 6 min read