OctoML Introduces a Method for Mass-customizing AI Image Gen Model

Published by: Insights Desk Released: Nov 09, 2023 Source: DemandTalk

Highlights:

A new “Asset Orchestrator” at the heart of the release will enable developers to fine-tune their models using assets like Low-Rank Adaptations, or LoRAs.
The new service, OctoML claims, can produce art generations in an average of 2.8 seconds, significantly speeding up image production with the photo-realistic model Stable Diffusion XL.

Recently, OctoML Inc., a startup specializing in artificial intelligence optimization, announced the release of OctoAI Image Gen. This architecture enables developers to customize image generation on well-known models like Stable Diffusion and simultaneously apply changes to thousands of assets.

Luis Ceze, Chief Executive of OctoML, said, “Image generation applications have quickly gone from fad to real business, with many e-commerce, entertainment and creative organizations looking to differentiate their service with AI. But building these custom experiences with Stable Diffusion today is an extensive engineering effort that simply doesn’t scale.”

In June, OctoAI was introduced to assist developers in creating and expanding their artificial intelligence models. With the addition of this new offering, it can now offer an API endpoint and enable mass fine-tuning with its resources.

The new “Asset Orchestrator,” which is the centerpiece of the release, will enable developers to enhance their models with assets like Low-Rank Adaptations, or LoRAs. Using a LoRA, users can quickly train Stable Diffusion on various concepts, like a specific character or style. LoRAs are fine-tuning models.

Unlike standard image-generating models, which can be cumbersome due to their size, LoRAs generate small portable models, making them useful. Due to their reduced processing power requirements, they are also far faster and simpler to train.

A LoRA can enhance a Stable Diffusion model once it has been trained to produce an image with that particular character or style. Therefore, for fine-tuning a model, LoRAs represent a reasonable trade-off in terms of size, time, and computing power.

Users can prompt a Stable Diffusion model with text to create an image of a video game character or comic book character, for instance. This would probably lead to finicky and inconsistent results — and would probably require a lot of trying to get the model to manifest the image they want. This is called prompt engineering.

A LoRA trained on pictures of that particular character and styles the user desired, like from a certain video game or art style from a certain era of comic books, would more accurately align the model to match the intended results. A lot less engineering would be needed for the model to produce a reasonably good customized image.

Using OctoML’s photo-realistic model Stable Diffusion XL, the new service produces art generation on average in 2.8 seconds, according to OctoML. As part of the asset management feature, users can manage and pull models and data from popular sources like CivitAI, an open-source tool where users can share AI artwork from Stable Diffusion, and Hugging Face Inc.’s open-source AI model repository.

Numerous clients, such as Storytime AI, which creates an app that employs AI to create children’s stories, and NightCafe Studio Ply Ltd., which operates an AI art generator website and community, have already implemented the OctAI image generation solution in their business applications.

Brian Carlson, CEO of Storytime AI, said, “Our top priority is to deliver kid-safe, consistent, engaging images for our custom children’s stories. Previously, this process relied on heavy-handed prompt engineering. But OctoAI helped us stand up a whole new image gen architecture utilizing assets like LoRAs to create consistent visuals without the added complexity of prompt engineering.”

ai governance for the enterprise...

empower ai and real-time insights at the edge...

power ai and analytics workloads with performance,...

how to choose the right ai foundation model...

pros enterprise ai for the industrial industries (...

unlocking ai’s potential: challenges and opportu...

transforming procurement with ai: opportunities, c...

adobe acrobat ai assistant: reinventing productivi...

adobe acrobat ai assistant: reinventing productivi...

ai, automation, and the strategic cao...

an introduction to ai in customer service...

5 ways ai can transform your customer experience...

ciso guide to generative ai attacks...

10 reasons to hire a customer-led voice assistant...

10 reasons to hire a customer-led voice assistant...

the definitive buying guide for contact center her...

cfo's guide to ai...

discover the future of business innovation with ge...

preparing for the future of cx by harnessing the p...

tableau gpt: innovate for the future with generati...

profitable ai-powered data management solutions to...

business-centric cognitive architecture revolution...

ai use cases – innovations for business success...

the role of ai in software development...

ai in cybersecurity – your digital guardian...

how chatbot marketing supports today’s business ...

advanced adaptive ai bolsters business intelligenc...

the dynamic impact of ai in procurement...

ai in customer service – revealing common applic...

how to use dall-e for marketing success...

rpa vs ai: a comparative analysis for business aut...

maximizing business efficiency through ai integrat...

7 trendiest ai marketing campaigns igniting commer...

liquid neural network unveiling the fluid intellig...

the art of prompt engineering in general & marketi...

what is amazon bedrock?...

decode data like never before: chatgpt for data an...

workforce planning models –the power of ai skil...

black friday and the impact of ai in e-commerce...

how digital brain is a game changer for business s...

microsoft introduces bing generative search in lim...

cytoreason raises usd 80 m in the funding round in...

google unveils a suite of new features for ai apps...

kindo reels in usd 20.6 m and acquires whiterabbit...

microsoft’s spreadsheetllm enhances ai’s compr...

herculesai raises usd 26 m to develop and expand i...

intel capital leads usd 15 m investment in ai cons...

aws unveils app studio to accelerate app developme...

captions llc raises usd 60 m for generative video ...

enso technologies secures usd 6 m for smb-focused ...

hebbia raises usd 130 m to develop data search pla...

meta releases four open-source language models...

harvey is reportedly raising usd 100 m at usd 1.5 ...

cloudflare introduces a new no-code feature to pre...

redactive raises usd 7.5 m to expand headcount and...

rapid7 acquires noetic cyber to help businesses fi...

runway ai aims for usd 450 m amid ai startup inter...

gen ai coding assistant startup magic ai aims to r...

anthropic introduces new program to fund enhanced ...

meta to open-source meta llm compiler for code opt...

role of machine learning in networking...

OctoML Introduces a Method for Mass-customizing AI Image Gen Model

Highlights:

Insights Desk

Related posts

Microsoft Introduces Bing Generative Search in Lim...

CytoReason Raises USD 80 M in the Funding Round In...

Google Unveils a Suite of New Features for AI Apps...

Kindo Reels in USD 20.6 M and Acquires WhiteRabbit...

Microsoft’s SpreadsheetLLM Enhances AI’s Compr...

HerculesAI Raises USD 26 M to Develop and Expand i...

Intel Capital Leads USD 15 M Investment in AI Cons...

AWS Unveils App Studio to Accelerate App Developme...

Captions LLC Raises USD 60 M for Generative Video ...

Enso Technologies Secures USD 6 M for SMB-focused ...

Our Brands