Still think AI is ready to revolutionize the economy? A new experiment might change your mind.
In a bold test of Anthropic’s latest version of its AI Claude, The Wall Street Journal gave the large language model (LLM) a shot at running an office vending machine. The result was an unmitigated — if unintentionally comical — disaster, forcing the team in charge to pull the plug after three weeks.
The whole thing began as a test called Project Vend, devised by Anthropic’s stress testers, who are known colloquially as the “red team.” Together with the WSJ‘s business journalists, they unleashed two AI agents, one named…

