|
Microsoft is pitching Copilot Vision as a collaborative assistant that works along with you as you browse the Internet. Vision was part of a suite of new Copilot demonstrations that Microsoft showed off in October, and was easily the most underwhelming example of new Copilot technologies that I'd seen.
Copilot Vision isn't a multimodal AI capability that can interpret the world around you. Instead, it's a cross between Windows Recall and the current version of Copilot. Vision is basically a real-time version of Copilot that's only available in Microsoft's Edge browser. Instead of asking Copilot for insights, Vision apparently can "scan, analyze and offer insights" based on what your eyes, and its AI sensors, see on the page.
Where might Copilot Vision be useful? Dealing with an overwhelming array of information, possibly. Think of a densely packed shopping page, all with similar products; making sense of the mess is what Copilot Vision promises. In this scenario, asking Copilot Vision what items meet your preferences, or offer the most value based on the available features, might be of interest.
In the October demonstration Copilot didn't offer anything insightful, often basically reading aloud what it — and presumably you — saw on the page. Furthermore, Microsoft is taking a very hesitant approach to Vision, possibly as a reaction to the
|
|