Tech Rivals kapløb om at lancere multimodale AI Wearables - Rapport

Tech Rivals kapløb om at lancere multimodale AI Wearables – Rapport

Major tech companies like Microsoft, Google, OpenAI, and others are racing to integrate multimodal AI to build smart glasses and other wearable devices with front-looking cameras.

Multimodal AI er en kraftfuld form for teknologien, der kombinerer mange datakilder for at gå ud over simple genererede tekstsvar. Den kan forstå tekst, billeder, lyd, video, tale og endda håndbevægelser.

As rapporteret by The Information, big tech companies are betting that multimodal systems can be a good fit for smarte briller with in-built cameras in front as well as other wearable technology.

Læs også: Meta’s Ray-Ban Glasses Now Have AI Capabilities for Sound and Sight

New battle for AI dominance

The vision is shaping up to become a key area of development and AI rivalry for Big Tech in 2024. Many of the companies have talked about this vision or worked on it for several years, the report said.

Now, they are confident they can sell smart glasses powered by AI. For example, OpenAI discussed “embedding” its object recognition software, GPT-4 with Vision, into Snapchat’s Spectacles wearables.

The deal with Snap, the parent company of Snapchat, could result in new features for the smart glasses, The Information wrote. The firm has struggled to turn the device into a mass-market product.

Tech Rivals kapløb om at lancere multimodale AI Wearables - Rapport

Tech Rivals kapløb om at lancere multimodale AI Wearables - Rapport

In February, Snap hinted at how it plans to integrate generative AI into its photo-and-video recording glasses, Spectacles. CEO Evan Spiegel said AI could be used to “improve the resolution and clarity of a Snap after the user captures it,” ifølge til branchemedier.

It could even be used for “more extreme transformations,” like editing images or creating Snaps based on text input, he added.

OpenAI and Microsoft are already working with AI startup Humane, which recently launched a device called the Hej Pin that uses a laser projection system to display text and images on a user’s hand.

The gadget is designed to be worn on clothing and can be tapped to talk to a virtual assistant powered by OpenAI’s GPT-4 technology and cloud computing power from microsoft.

Metas AI-drevne Ray-Ban-briller skaber røre på sociale medier

Metas AI-drevne Ray-Ban-briller skaber røre på sociale medier

Meta leads the industry push

The tech industry push comes as Meta last week revealed the latest version of its Ray-Ban smart glasses, which use AI to “see, hear, and identify things via a built-in camera and microphone.”

When activated, the Ray-Ban can respond to a voice command like, “Is this tea caffeine-free?” by taking a picture, analyzing it, and then providing a response, said Meta CEO Mark Zuckerberg.

But a test by CNET shows that the Ray-Bans hallucinate—the glasses saw things that weren’t really present and went on to give a description of the items. It is a fælles problem with generative AI.

As for Google, in 2013, the company started selling a prototype of its earliest smart glasses, known simply as Glass, for $1,500. The glasses did not catch on, and were criticized as a threat to privacy.

Eventually, Google stoppet producing Glass. The company is now adding multimodal artificial intelligence to ChatGPT rival Gemini and is also expected to incorporate the technology into its wearables.

The integration of multimodal AI into wearables like augmented reality smart glasses typically aims to enhance their functionality and offer users a more immersive experience.

It can also be used for a lot of practical applications, including translating languages, remote support for engineers, and real-time data sharing for soldiers in combat.

In 2022, the global wearables market was valued at about $61 billion, according to skøn. The sector is expected to grow by 15% every year until 2030—faster than the smartphone marked.

Tidsstempel:

Mere fra MetaNews