Elon Musk’s AI startup, xAI, is taking braver steps to move beyond the traditional chatbots to creating AI systems that can perceive and speak to the real world.
The company recently poached two prominent AI researchers from NVIDIA, Zeeshan Patel and Ethan He, while continuing to accelerate development of what insiders refer to as “world models,” the potential piece of tech that can transform the nature of how AI systems operate.
These aren’t your run-of-the-mill big language models such as ChatGPT or xAI’s own Grok chatbot. World models mark a giant step ahead since these models are trained on video streams as well as robotics data, unlike text-centric models.
The idea is to develop AI systems that can perceive three-dimensional spaces, forecast object motion as well as interaction, and explore the real world with true understanding.
How xAI is Betting on Video and Robotics Data to Revolutionize AI?
By hiring talent from NVIDIA, the leading-edge company in AI hardware as well as research, xAI is gearing up to compete with big tech giants such as Google and Meta that have both been investing big time in the same technologies. The battle is on, and Musk seems to be determined to be on the front foot.
The key difference between world models and traditional AI lies in the training data. Where chatbots like ChatGPT can perceive language patterns across billions of textual records, they do not actually gain any understanding about the ways the real world behaves. They can describe the trajectory of a ball around the edge of a hill, but cannot predict the course or the physics.

World models change that equation. By training on video and robotics data, these systems develop an internal representation of how the world functions, how gravity works, how objects collide, and how humans move through spaces. This understanding could unlock entirely new applications for AI in gaming, robotics, autonomous vehicles, and virtual reality.
xAI Targets Gaming and Robotics with Advanced World Models and “Omni Team” Hiring
As the Financial Times reported, xAI is actually focusing on specific use cases in gaming and robot automation. For games, world models can potentially allow AI to produce rich, interactive 3D worlds or even full games from a blank page. They can guide robot automation to navigate unknown terrain as well as handle things with unprecedented accuracy.
True to form, Musk has not hesitated to announce his intentions. He recently reiterated his promise to develop “a great AI-written game sometime in the coming year.” Little is revealed about the project, but the deadline suggests that xAI is further along in developing world models than most could possibly think.
The timing of this announcement is noteworthy. Just one week earlier, xAI had launched the new image and video creation model with “monster upgrades” that can be had freely by the users. This seems to suggest that the company is building a suite of end-to-end AI that can read as well as produce content in a number of different formats text, image, video, and interactive 3D worlds, among others.
xAI’s recruitment drive reveals the scope of its ambitions. The company is actively hiring for what it calls its “omni team,” a group focused on creating AI systems capable of understanding and generating content across different formats and modalities.
Technical experts in image and video generation can expect salaries ranging from $180,000 to $440,000, compensation that reflects both the specialized nature of the work and the competitive market for AI talent.
xAI Hires Video Game Tutors to Advance AI into ‘World Models’ and Augment Game Creation
Most intriguingly, xAI has posted a job listing for a “video games tutor” to teach Grok how to craft video games as well as lead individuals to try out AI-powered game development.
The $45-$100 per hour job implies a probable, near-term application of the company’s tech that is realistic, unlike the fully autonomous creation of games. Rather than waiting for fully autonomous creation of games, xAI is apparently building tools to augment human innovation in making games.
The drive towards world models is a fundamental transformation in the development of AI. As much as chatbots have stirred the public imagination and shown commercial worth, their limitations are beginning to emerge. They are word-processing and word-generating tools, not for comprehending or interfacing with the world.
World models could possibly close that gap, enabling systems that talk about the world rather than actually knowing it. Success along those lines for xAI would be validation that the move to start a competitor to Open, the company with whom he co-founded OpenAI but later left due to disagreements over direction and control, was the correct one.
The recruitment of NVIDIA researchers is proof that xAI is intent on playing at the top level. With big wallets, aggressive timelines, and a desire to pay top dollar for the best minds, Musk’s AI company is making itself a force to be reckoned with in the battle to develop the next generation of artificial intelligence.




