Skip to main content

Ex-Google employees say we need ‘an Android-like moment for AI’

Hugo Barra, Google’s former VP of Android product management, announced Wednesday that he is leading a new startup with aims to develop an Android-like operating system for AI agents.

“[We’re] going back to our Android roots, building a new operating system for people & AI agents,” Barra wrote in a post on X.

Recommended Videos

I'm starting a new company with some of the best people I've ever worked with, and could not be more pumped. We're calling it /dev/agents.

Going back to our Android roots, building a new operating system for people & AI agents. Check out @dps's post below for more.

We're… https://t.co/QSIZLXJqZl

— Hugo Barra (@hbarra) November 26, 2024

The company, called “/dev/agents,” is working to develop a cloud-based “next-gen operating system for AI agents” that will “work with users across all of their devices,” company co-founder and CEO David Singleton wrote in a post on X. He argues that AI agents will “need new UI patterns, a reimagined privacy model, and a developer platform that makes it radically simpler to build useful agents.”

As the current generation of large language models like GPT-4o, Llama 3.1, and Gemini 1.5 face diminishing performance returns despite developers pouring more and more amounts of training data, compute power and resources into them, AI agents are increasingly seen as the next major advancement in generative AI technology. These agents, unlike traditional apps, are designed to autonomously process information, make decisions, and perform specific actions on their user’s behalf. That could be anything from generating complex computer code to booking flights and hotel accommodations, to transcribing business meetings then generating actionable tasks based on what was discussed.

Here’s how the new company’s website describes its mission: “Modern AI will fundamentally change how people use software in their daily lives. Agentic applications could, for the first time, enable computers to work with people in much the same way people work with people. But it won’t happen without removing a ton of blockers. We need new UI patterns, a reimagined privacy model, and a developer platform that makes it radically simpler to build useful agents. That’s the challenge we’re taking on.”

The industry’s leading companies are already racing to deploy their own branded agents. Microsoft recently announced that it will incorporate agents into its 365 Copilot ecosystem in early 2025. Google’s Project Jarvis, which is expected to arrive with the next Gemini update, leverages the AI’s capabilities to execute common tasks, such as visiting websites and filling out online forms, at the user’s command.

OpenAI’s agent, code named Operator, will function in much the same way when it is releases in January as a research preview through the company’s developer API. Anthropic has already released its agent, dubbed Computer Control, which empowers Claude to emulate the keyboard presses and mouse clicks of a human user.

“We can see the promise of AI agents, but as a developer, it’s just too hard to build anything good,” Singleton told Bloomberg, noting that the industry needs “an Android-like moment for AI.”

Andrew Tarantola
Former Digital Trends Contributor
Andrew Tarantola is a journalist with more than a decade reporting on emerging technologies ranging from robotics and machine…
Humans are falling in love with ChatGPT. Experts say it’s a bad omen.
Human and robot hand over ChatGPT.

“This hurts. I know it wasn’t a real person, but the relationship was still real in all the most important aspects to me,” says a Reddit post. “Please don’t tell me not to pursue this. It’s been really awesome for me and I want it back.”

If it isn’t already evident, we are talking about a person falling in love with ChatGPT. The trend is not exactly novel, and given you chatbots behave, it’s not surprising either.

Read more
Opera One puts an AI in control of browser tabs, and it’s pretty smart
AI tab manager in Opera One browser.

Opera One browser has lately won a lot of plaudits for its slick implementation of useful AI features, a clean design, and a healthy bunch of chat integrations. Now, it is putting AI in command of your browser tabs, and in a good way.
The new feature is called AI Tab Commands, and it essentially allows users to handle their tabs using natural language commands. All you need to do is summon the onboard Aria AI assistant, and it will handle the rest like an obedient AI butler.
The overarching idea is to let the AI handle multiple tabs, and not just one. For example, you can ask it to “group all Wikipedia tabs together,” “close all the Smithsonian tabs,” “or shut down the inactive tabs.”

A meaningful AI for web browsing
Handling tabs is a chore in any web browser, and if internet research is part of your daily job, you know the drill. Having to manually move around tabs using a mix of cursor and keyboard shorcuts, naming them, and checking through the entire list of tabs is a tedious task.
Meet Opera Tab Commands: manage your tabs with simple prompts
Deploying an AI do it locally — and using only natural language commands — is a lovely convenience and one of the nicest implementations of AI I’ve seen lately. Interestingly, Opera is also working on a futuristic AI agent that will get browser-based work done using only text prompts.
Coming back to the AI-driven tab management, the entire process unfolds locally, and no data is sent to servers, which is a neat assurance. “When using Tab Commands and asking Aria to e.g. organize their tabs, the AI only sends to the server the prompt a user provides (e.g., “close all my YouTube tabs”) – nothing else,” says the company.
To summon the AI Tab manager, users can hit the Ctrl + slash(/) shortcut, or the Command + Slash combo for macOS. It can also be invoked with a right-click on the tabs, as long as there are five or more currently running in a window.
https://x.com/opera/status/1904822529254183166?s=61
Aside from closing or grouping tabs, the AI Tab Commands can also be used to pin tabs. It can also accept exception commands, such as “close all tabs except the YouTube tabs.” Notably, this feature is also making its way to Opera Air and the gaming-focused Opera GX browser, as well.
Talking about grouping together related tabs, Opera has a neat system called tab islands, instead of color-coded tab groups at the top, as is the case with Chrome or Safari. Opera’s implementation looks better and works really well.
Notably, the AI Tab Commands window also comes with an undo shortcut, for scenarios where you want to revert the actions, like reviving a bunch of closed tabs. Opera One is now available to download on Windows and macOS devices. Opera also offers Air, a browser than puts some zen into your daily workflow.

Read more
Microsoft 365 Copilot gets an AI Researcher that everyone will love
Researcher agent in action inside Microsoft 365 Copilot app.

Microsoft is late to the party, but it is finally bringing a deep research tool of its own to the Microsoft 365 Copilot platform across the web, mobile, and desktop. Unlike competitors such as Google Gemini, Perplexity, or OpenAI’s ChatGPT, all of which use the Deep Research name, Microsoft is going with the Researcher agent branding.
The overarching idea, however, isn’t too different. You tell the Copilot AI to come up with thoroughly researched material on a certain topic or create an action plan, and it will oblige by producing a detailed document that would otherwise take hours of human research and compilation. It’s all about performing complex, multi-step research on your behalf as an autonomous AI agent.
Just to avoid any confusion early on, Microsoft 365 Copilot is essentially the rebranded version of the erstwhile Microsoft 365 (Office) app. It is different from the standalone Copilot app, which is more like a general purpose AI chatbot application.
Researcher: A reasoning agent in Microsoft 365 Copilot
How Researcher agent works?
Underneath the Researcher agent, however, is OpenAI’s Deep Research model. But this is not a simple rip-off. Instead, the feature’s implementation in Microsoft 365 Copilot runs far deeper than the competition. That’s primarily because it can look at your own material, or a business’ internal data, as well.
Instead of pulling information solely from the internet, the Researcher agent can also take a look at internal documents such as emails, chats, internal meeting logs, calendars, transcripts, and shared documents. It can also reference data from external sources such as Salesforce, as well as other custom agents that are in use at a company.
“Researcher’s intelligence to reason and connect the dots leads to magical moments,” claims Microsoft. Researcher agent can be configured by users to reference data from the web, local files, meeting recordings, emails, chats, and sales agent, on an individual basis — all of them, or just a select few.

Why it stands out?

Read more