News | Qualcomm Exhibits Edge-based Generative AI Applications

Qualcomm Exhibits Edge-based Generative AI Applications

Published by: Insights Desk Released: Jun 26, 2023 Source: DemandTalk

Highlights:

The business showed numerous mobile-only generative AI applications, including a novel picture generation model, a huge language model-based fitness coach, and a 3D reconstruction tool for extended reality.
Qualcomm said ControlNet is powered by a broad portfolio of AI optimizations spanning its model architecture, specialist AI software including the Qualcomm AI suite and AI Engine, and neural hardware accelerators on the device.

Recently, Qualcomm Inc. took the stage at the annual IEEE/CVF Conference on Computer Vision and Pattern Recognition to announce its latest advancements in edge-based generative artificial intelligence.

The company demonstrated several new mobile-only generative AI applications, such as a new image generation model, a large language model-based fitness coach, and a 3D reconstruction tool for extended reality.

ControlNet, a 1.5 billion parameter image-to-image model that operates on a typical midrange smartphone, was Qualcomm’s crowning achievement. The company explained that ControlNet belongs to a class of generative AI algorithms known as language-vision models, which enable precise control over image generation by conditioning an input image with an input text description.

In an on-stage demonstration, Qualcomm demonstrated how ControlNet could generate new images in less than 12 seconds simply by uploading a photo and describing how it should be edited in straightforward English. In one instance, it uploaded a simple illustration of a kitten with the caption “yellow kitten, photorealistic, 4k.” Within seconds, the ControlNet-enabled mobile device made the sketch considerably more remarkable.

Qualcomm explained that ControlNet is propelled by a full suite of AI optimizations across its model architecture and specialized AI software such as the Qualcomm AI suite and AI Engine and neural hardware accelerators on the device itself.

Qualcomm demonstrated how it utilized an LLM similar to OpenAI LP’s ChatGPT to develop a digital fitness instructor capable of natural, context-aware interactions in real-time. Qualcomm explained that an action recognition model will process the data on the device. The user merely films themselves while exercising.

Then, based on the recognized actions, a stateful organizer translates that into prompts supplied into the LLM, allowing the digital fitness coach to provide the user with feedback as their workout progresses. Qualcomm noted that this was made possible by three new innovations: a vision model trained to identify fitness activities, a language model taught to create words based on visual concepts, and an orchestrator that coordinates the interaction between these two modalities to provide live feedback.

The 3D construction tool for XR, an umbrella term for augmented, virtual, and mixed reality, allows developers to create highly detailed 3D models of virtually any environment that can operate solely on a mobile device. According to Qualcomm, depth maps are generated from individual pictures and combined to build 3D scene representations.

Qualcomm stated that its precise 3D maps can be utilized in various AR and VR applications. Qualcomm developed an augmented reality (AR) scenario that enables users to fire virtual spheres at real objects, such as walls and furniture, and observe them rebounding off those objects in a realistic manner based on accurate physics calculations.

Qualcomm adapted generative AI to the construction of facial avatars for XR environments. It demonstrated a model that can capture one or more 2D photographs of a person’s face, apply a customized mesh and texture, and convert the image into a 3D face avatar.

The avatars can even depict the user’s actions in real-time through headset cameras that monitor the user’s eye and facial movements and recreate them within the avatar. Qualcomm explained that the purpose of this model is to enable users to construct digital human avatars for use in the metaverse and human-machine interfaces on its Snapdragon XR platform.

Finally, Qualcomm demonstrated how it incorporates AI into its driver monitoring technology. In this instance, a computer vision model capable of detecting unsafe driving conditions was developed and combined with active infrared cameras that monitor the driver’s status in real-time, including indications of distraction or fatigue. Qualcomm stated that the system, which operates on the Snapdragon Ride Flex system-on-chip, can alert the motorist whenever it detects dangerous driving.

ai governance for the enterprise...

empower ai and real-time insights at the edge...

power ai and analytics workloads with performance,...

how to choose the right ai foundation model...

pros enterprise ai for the industrial industries (...

unlocking ai’s potential: challenges and opportu...

transforming procurement with ai: opportunities, c...

adobe acrobat ai assistant: reinventing productivi...

adobe acrobat ai assistant: reinventing productivi...

ai, automation, and the strategic cao...

an introduction to ai in customer service...

5 ways ai can transform your customer experience...

ciso guide to generative ai attacks...

10 reasons to hire a customer-led voice assistant...

10 reasons to hire a customer-led voice assistant...

the definitive buying guide for contact center her...

cfo's guide to ai...

discover the future of business innovation with ge...

preparing for the future of cx by harnessing the p...

tableau gpt: innovate for the future with generati...

profitable ai-powered data management solutions to...

business-centric cognitive architecture revolution...

ai use cases – innovations for business success...

the role of ai in software development...

ai in cybersecurity – your digital guardian...

how chatbot marketing supports today’s business ...

advanced adaptive ai bolsters business intelligenc...

the dynamic impact of ai in procurement...

ai in customer service – revealing common applic...

how to use dall-e for marketing success...

rpa vs ai: a comparative analysis for business aut...

maximizing business efficiency through ai integrat...

7 trendiest ai marketing campaigns igniting commer...

liquid neural network unveiling the fluid intellig...

the art of prompt engineering in general & marketi...

what is amazon bedrock?...

decode data like never before: chatgpt for data an...

workforce planning models –the power of ai skil...

black friday and the impact of ai in e-commerce...

how digital brain is a game changer for business s...

microsoft introduces bing generative search in lim...

cytoreason raises usd 80 m in the funding round in...

google unveils a suite of new features for ai apps...

kindo reels in usd 20.6 m and acquires whiterabbit...

microsoft’s spreadsheetllm enhances ai’s compr...

herculesai raises usd 26 m to develop and expand i...

intel capital leads usd 15 m investment in ai cons...

aws unveils app studio to accelerate app developme...

captions llc raises usd 60 m for generative video ...

enso technologies secures usd 6 m for smb-focused ...

hebbia raises usd 130 m to develop data search pla...

meta releases four open-source language models...

harvey is reportedly raising usd 100 m at usd 1.5 ...

cloudflare introduces a new no-code feature to pre...

redactive raises usd 7.5 m to expand headcount and...

rapid7 acquires noetic cyber to help businesses fi...

runway ai aims for usd 450 m amid ai startup inter...

gen ai coding assistant startup magic ai aims to r...

anthropic introduces new program to fund enhanced ...

meta to open-source meta llm compiler for code opt...

role of machine learning in networking...

Qualcomm Exhibits Edge-based Generative AI Applications

Insights Desk

Related posts

Microsoft Introduces Bing Generative Search in Lim...

CytoReason Raises USD 80 M in the Funding Round In...

Google Unveils a Suite of New Features for AI Apps...

Kindo Reels in USD 20.6 M and Acquires WhiteRabbit...

Microsoft’s SpreadsheetLLM Enhances AI’s Compr...

HerculesAI Raises USD 26 M to Develop and Expand i...

Intel Capital Leads USD 15 M Investment in AI Cons...

AWS Unveils App Studio to Accelerate App Developme...

Captions LLC Raises USD 60 M for Generative Video ...

Enso Technologies Secures USD 6 M for SMB-focused ...

Our Brands