xAI’s Grok chatbot has taken a significant step forward with the introduction of Grok Vision, a new feature that allows the AI to “see” and interpret the world through your smartphone’s camera. Similar to the real-time vision capabilities offered by Google’s Gemini and ChatGPT, Grok Vision enables users to point their phone at various objects—products, signs, documents—and ask questions about them.
Announced on Tuesday, Grok Vision is currently available through the Grok app for iOS, although Android users will have to wait for the feature to be rolled out. The camera access, integrated with Grok’s voice mode, allows users to simply ask, “What am I looking at?” and receive instant answers based on the real-world objects in view. You can see a demonstration of this feature through the tweet by Mario Nawfal on April 20, 2025.

Alongside Grok Vision, other new features have been introduced, including multilingual audio and real-time search in Grok’s voice mode. However, Android users will only be able to access these updates if they are subscribed to xAI’s.
Grok’s development has been rapid, with continuous feature additions such as a that allows the bot to remember and reference past conversations. Additionally, the introduction of a canvas-like tool for creating documents and apps further expands Grok’s capabilities, pushing it closer to a versatile AI assistant for everyday tasks.
Also Read : Bezos-Backed Startup Slate Auto Teases Shape-Shifting EV Ahead of Official Launch