News | OpenAI Introduces POINT-E, a Text-to-image AI System

OpenAI Introduces POINT-E, a Text-to-image AI System

Published by: Insights Desk Released: Dec 21, 2022 Source: DemandTalk

Highlights:

Point-E’s text-to-image model generates a synthetic rendered object that’s fed to the image-to-3D model, which then generates a point cloud.
The thing to note is that the model can sometimes miss certain parts of objects, resulting in distorted or blocky images.

OpenAI LLC this week detailed POINT – E. It is a new Artificial Intelligence (AI) system that helps in generating 3D models based on text prompts. The research group has made the code for POINT-E available on GitHub.

There are numerous AI applications that can generate 2D images based on a user-supplied text description. When running on a single data center graphics card, such applications, according to OpenAI, render images in a matter of seconds or minutes. Generating a 3D model typically takes a few hours when using comparable hardware. POINT-E by OpenAI will speed up this process.

According to the research group, the new AI system can also generate a 3D model in as little as one minute when running on an Nvidia V100 graphics card.

The AI system doesn’t directly generate a 3D model of the object, after receiving a user prompt describing an object. What it actually does is create a 2D drawing of that object. From that point, POINT-E turns the 2D drawing into a 3D point cloud, and this basic version simply works as an outline.

Different neural networks carry out each step of the process. The first step of turning object description into 2D drawing, is carried out by a neural network dubbed GLIDE that OpenAI released last year. This GLIDE version used in POINT-E has three billion parameters. The configuration settings that define how a neural network processes data are known as parameters.

After POINT-E generates a 2D drawing of an object, the drawing is turned into a point cloud by two different neural networks. The first neural network produces a basic, low-resolution point cloud with 1,000 pixels. In the second version, it adds 3,000 more pixels to increase the point cloud’s resolution and is the simpler version of the first.

OpenAI scientists explained POINT-E in a research paper by saying, “For image diffusion models, the best quality is typically achieved by using some form of hierarchy, where a low-resolution base model produces output which is then upsampled by another model. Our upsampler uses the same architecture as our base model.”

The neural networks mentioned above are based on diffusion, which is a machine-learning method. The method, introduced in 2015, also powers an image-generation AI that Google LLC launched earlier this year.

To build a diffusion model, engineers create images that consist of a type of error known as Gaussian noise. They then use the model to remove the noise. After repeating the process a few times, a neural network can learn techniques allowing it to make images from scratch.

POINT-E creates a point cloud of an object and then uses Blender, an open-source graphic design application, to convert the point cloud into a 3D model. Typically, an automated script manages the process of creating a 3D model in Blender.

OpenAI’s researchers added details, “While our method performs worse on this evaluation than state-of-the-art techniques, it produces samples in a small fraction of the time. This could make it more practical for certain applications, or could allow for the discovery of higher-quality 3D objects by sampling many objects and selecting the best one.”

ai governance for the enterprise...

empower ai and real-time insights at the edge...

power ai and analytics workloads with performance,...

how to choose the right ai foundation model...

pros enterprise ai for the industrial industries (...

unlocking ai’s potential: challenges and opportu...

transforming procurement with ai: opportunities, c...

adobe acrobat ai assistant: reinventing productivi...

adobe acrobat ai assistant: reinventing productivi...

ai, automation, and the strategic cao...

an introduction to ai in customer service...

5 ways ai can transform your customer experience...

ciso guide to generative ai attacks...

10 reasons to hire a customer-led voice assistant...

10 reasons to hire a customer-led voice assistant...

the definitive buying guide for contact center her...

cfo's guide to ai...

discover the future of business innovation with ge...

preparing for the future of cx by harnessing the p...

tableau gpt: innovate for the future with generati...

profitable ai-powered data management solutions to...

business-centric cognitive architecture revolution...

ai use cases – innovations for business success...

the role of ai in software development...

ai in cybersecurity – your digital guardian...

how chatbot marketing supports today’s business ...

advanced adaptive ai bolsters business intelligenc...

the dynamic impact of ai in procurement...

ai in customer service – revealing common applic...

how to use dall-e for marketing success...

rpa vs ai: a comparative analysis for business aut...

maximizing business efficiency through ai integrat...

7 trendiest ai marketing campaigns igniting commer...

liquid neural network unveiling the fluid intellig...

the art of prompt engineering in general & marketi...

what is amazon bedrock?...

decode data like never before: chatgpt for data an...

workforce planning models –the power of ai skil...

black friday and the impact of ai in e-commerce...

how digital brain is a game changer for business s...

microsoft introduces bing generative search in lim...

cytoreason raises usd 80 m in the funding round in...

google unveils a suite of new features for ai apps...

kindo reels in usd 20.6 m and acquires whiterabbit...

microsoft’s spreadsheetllm enhances ai’s compr...

herculesai raises usd 26 m to develop and expand i...

intel capital leads usd 15 m investment in ai cons...

aws unveils app studio to accelerate app developme...

captions llc raises usd 60 m for generative video ...

enso technologies secures usd 6 m for smb-focused ...

hebbia raises usd 130 m to develop data search pla...

meta releases four open-source language models...

harvey is reportedly raising usd 100 m at usd 1.5 ...

cloudflare introduces a new no-code feature to pre...

redactive raises usd 7.5 m to expand headcount and...

rapid7 acquires noetic cyber to help businesses fi...

runway ai aims for usd 450 m amid ai startup inter...

gen ai coding assistant startup magic ai aims to r...

anthropic introduces new program to fund enhanced ...

meta to open-source meta llm compiler for code opt...

role of machine learning in networking...

OpenAI Introduces POINT-E, a Text-to-image AI System

Insights Desk

Related posts

Microsoft Introduces Bing Generative Search in Lim...

CytoReason Raises USD 80 M in the Funding Round In...

Google Unveils a Suite of New Features for AI Apps...

Kindo Reels in USD 20.6 M and Acquires WhiteRabbit...

Microsoft’s SpreadsheetLLM Enhances AI’s Compr...

HerculesAI Raises USD 26 M to Develop and Expand i...

Intel Capital Leads USD 15 M Investment in AI Cons...

AWS Unveils App Studio to Accelerate App Developme...

Captions LLC Raises USD 60 M for Generative Video ...

Enso Technologies Secures USD 6 M for SMB-focused ...

Our Brands