Uh-oh, X’s Grok AI can now ‘understand’ images

Rebecka Geddes April 13, 2024

2 2 minutes read

Uh-oh, X’s Grok AI can now ‘understand’ images

New “Vision” Grok will be available to testers and select users.

A phone displaying the Grok xAI logo, which is a white box with a black slash across it.

X’s new Grok “Vision” can process complex visual information.
Credit: Jaap Arriens/NurPhoto via Getty Images

Elon Musk’s AI chatbot can now “understand” images, including information-riddled diagrams and charts. Sorry, doesn’t everyone use the platform once known as Twitter for multi-disciplinary research and optimizing their work flows??

Introduced as Grok-1.5V — Or Grok 1.5 “Vision,” the company’s “first-generation multimodal model” — the bot will be able to not only respond to your uploaded pictures and screenshots but also reason through complex documents, science diagrams, charts, screenshots, and photographs, the company says. Additionally, Grok-1.5V will gain “real-world spatial understanding” to better understand the physical world depicted in the images uploaded by its users.

“Advancing both our multimodal understanding and generation capabilities are important steps in building beneficial AGI that can understand the universe,” the company wrote in its’ announcement. “In the coming months, we anticipate to make significant improvements in both capabilities, across various modalities such as images, audio, and video.”

Example use cases include translating a diagram into Python code, turning a child’s drawing into a bedroom story, pinpointing the largest object among a group of many, and telling a driver if they have enough space to drive around an obstacle.

Grok-1.5V is released along with xAI’s RealWorldQA, an image and prompt dataset designed to test other GenAI models against Grok’s real world reasoning.

Tweet may have been deleted

Competition is the least of Grok’s worries, however. Despite xAI’s continued investment, Grok has yet to stick with early users and staff — a new report alleges its own developers struggle to use the slow xAI API. That same report, published by Fortune this week, highlighted X employee concerns about Musk suggesting Grok write paid user’s posts for them, despite warnings from developers and staff. Last week, Grok came under fire for generating fake news headlines from an alternate reality where Iran had assailed Tel Aviv with a military arsenal — not its first time.

While GenAI chatbots hallucinating realities and generating fake news is par for the course, Grok’s gaffe is indicative of yet another site wide issue. The bot, a par for the course response to ChatGPT from Musk, is integrating into a platform that has slowly whittled away at its defenses against AI gone bad. Combined with X’s all around poor reputation for moderation and the CEO’s own refusal to address misinformation in aid of the site’s “citizen journalists,” Grok occupies a precarious spot in the platform’s besieged information ecosystem.

Grok-1.5V will be available to early testers and select users soon.

Chase sits in front of a green framed window, wearing a cheetah print shirt and looking to her right. On the window's glass pane reads

Chase joined Mashable’s Social Good team in 2020, covering online stories about digital activism, climate justice, accessibility, and media representation. Her work also touches on how these conversations manifest in politics, popular culture, and fandom. Sometimes she’s very funny.

This newsletter may contain advertising, deals, or affiliate links. Subscribing to a newsletter indicates your consent to our Terms of Use and Privacy Policy. You may unsubscribe from the newsletters at any time.

Uh-oh, X’s Grok AI can now ‘understand’ images

Rebecka Geddes

HBO’s ‘The Last of Us’ Review: The Greatest Video Game Adaptation Ever Made

The Emergence of New Technologies: Blockchain and Cryptocurrency

The Potential of 5G Networks in Transforming Connectivity and Communication

The Growth of the Internet of Things (IoT) and its Impact on Daily Life

Reese Witherspoon, Her Mom, and Her Daughter Could Pass for Triplets in Latest Instagram Post

AMD finally compares Radeon RX 7900 XT and Radeon RX 7900 XTX with NVIDIA GeForce RTX 4080 and shares new gaming benchmarks

Rebecka Geddes

With Product You Purchase

Subscribe to our mailing list to get the new updates!

Ford looks to convert Tesla owners with 'Conquest Bonus Cash,' offering $1,500 rebates for F-150 Lightning and Mustang Mach-E electric vehicles

Calvin Kattar vs. Aljamain Sterling prediction, pick, start time, odds for UFC 300

Related Articles

The Best Deals Today: Death Stranding 2, Split Fiction, AirPods Pro 2, and More

HP Just Cut Almost $2000 from Its EliteBook Laptop Packing Intel Core Ultra 7, 32GB DDR5 RAM, and 1TB SSD

Is Charli XCX About to Drop a Taylor Swift Response Song? Fans Think So

Bad Bunny Used His Saturday Night Live Gig to Roast MAGA, During a Punchy 51st Season Premiere