There’s a Benchmark Test That Measures AI ‘Bullshit’—Most Models Fail BullshitBench tests whether AI models can detect nonsensical questions—or if they’ll confidently answer them anyway. The results are dire. Leave a Reply Cancel replyYour email address will not be published. Required fields are marked *Comment * Name * Email * Website Save my name, email, and website in this browser for the next time I comment. Filed under: Bitcoin - @ March 10, 2026 7:26 pm