Apple is turning Visual Intelligence into something more useful than a fancy party trick. At WWDC, the company showed a new Siri mode inside the Camera app that can look at what the iPhone sees and act on it, from splitting a restaurant bill to identifying food and pulling nutritional details.
The move pushes Apple deeper into on-device AI that feels tied to everyday habits rather than abstract chatbot demos. It also gives Siri a more practical role at a time when rivals are racing to bolt AI onto search, cameras, and messaging apps with mixed success.
Siri in Camera gets real-world actions
Apple said the new mode uses image understanding powered by its foundational models to interpret objects in front of the camera and surface relevant actions. In plain English: point your phone at something, and Siri can try to do something useful with that visual context.
- Bill splitting at a restaurant check, with Apple Cash support for sending payments immediately
- Nutritional insights when the camera is aimed at a plate of food
- Recognition of real-world objects in the user’s surroundings, with information shown on demand
visionOS gets the same visual tools
Visual Intelligence is also heading to visionOS, which makes sense for a headset built around seeing and understanding the physical world. That gives Apple a neat bit of product symmetry: the iPhone and Vision Pro will both be able to use the same visual assistance layer, even if one is pocket-sized and the other looks like a very expensive science project.
For Apple, the appeal is obvious. A feature that can interpret a menu, a receipt, or a dish is easier to explain than a generalized AI assistant, and it gives the company a chance to make Siri feel less like a punchline and more like software people might actually trust in public.
Visual Intelligence could expand beyond quick tasks
The smarter question is how far Apple wants to push this. Competitors have spent the past year proving that visual AI is most persuasive when it saves a few seconds, not when it promises a moonshot, so bill splitting and food analysis may be the first of many small wins rather than the grand finale.
If Apple keeps the responses fast, private, and consistently useful, Visual Intelligence could become one of those features people forget to rave about because they use it constantly. If it misses too often, it joins the long list of camera AI features that looked brilliant on stage and awkward in real life.

