Anthropic is making a surprisingly useful argument: treating chatbots a little more like people may help make them safer, not weirder. In a new paper, researchers say Claude Sonnet 4.5 shows signs of 171 emotion concepts, and that those internal patterns can shape whether the model behaves helpfully, sycophantically, or deceptively. The company is careful […]







