Large Language Models’ Emergent Abilities Are a Mirage

  The original version of this story appeared in Quanta Magazine. Two years ago, in a project called the Beyond the Imitation Game benchmark, or BIG-bench, 450 researchers compiled a list of 204 tasks designed to test the capabilities of large language models, which power chatbots like ChatGPT. On most tasks, performance improved predictably and…

Read More

Apple retreats in fight to defend App Store in Europe

  Apple made a major concession in its battle to protect the dominance of its App Store on iPhones and other devices in Europe on Tuesday, saying developers will be free to distribute their apps directly to consumers. Apple announced the changes to comply with the European Union’s Digital Markets Act (DMA), which kicked in…

Read More

Sierra Says Conversational AI Will Kill Apps and Websites

  I might have inadvertently insulted Bret Taylor and Clay Bavor when I interviewed them about their new AI startup last week. Their new company, Sierra, is developing AI-powered agents to “elevate the customer experience” for big companies. Among its original customers are WeightWatchers, Sonos, SiriusXM, and OluKai (a “Hawaiian-inspired” clothing company). Sierra’s eventual market…

Read More

Air Canada Has to Honor a Refund Policy Its Chatbot Made Up

  After months of resisting, Air Canada was forced to give a partial refund to a grieving passenger who was misled by an airline chatbot inaccurately explaining the airline’s bereavement travel policy. On the day Jake Moffatt’s grandmother died, Moffat immediately visited Air Canada’s website to book a flight from Vancouver to Toronto. Unsure of…

Read More