Edit Content

-
-
Google released a neural network StyleDrop, it can create images in a precisely defined style

Google released a neural network StyleDrop, it can create images in a precisely defined style

Google_released_StyleDrop_neural network, it_can_create_images

StyleDrop learns the style of any image and helps a generative artificial intelligence model recreate it. Google's method is superior to others, such as Dreambooth, LoRA, or Textual Inversion.

Google's new method lets you synthesize images in a specific style using Muse's text-image model. StyleDrop conveys all the subtleties of custom styling, including color schemes, shading, design patterns, local and global effects. According to Google, only one image is required as input.

StyleDrop learns a new style by fine-tuning a small number of parameters of the training network, and then improves the quality of the model by iterative learning with human or automatic feedback.

StyleDrop learns quickly and on a small number of examples

Specifically, StyleDrop is trained on an input image and generates a set of images to play back that image. From these, the highest quality images are selected, either by CLIP score or by human feedback, and used for further training. An image is considered high quality if it reproduces not the content but the style of the original image.

According to the team, the entire process takes less than three minutes, even with human feedback. That's because StyleDrop requires less than a dozen images for iterative learning, they say.

StyleDrop is superior to other methods of transferring style from text to image, including Dreambooth, LoRAs, Textual Inversion in Imagen and Stable Diffusion, according to the team.

Google released a neural network StyleDrop, it can create images in a precisely defined style

StyleDrop for styles, Dreambooth for objects

"We see that StyleDrop is able to capture nuances of texture, shading and structure across a wide range of styles, much better than previous approaches, allowing much more control over style than was previously possible," the team said.

The team also combined StyleDrop with Dreambooth to explore and create a new object in different styles as an image and can use methods with Muse to create a custom object in a custom style.

Google sees StyleDrop as a versatile tool, one use of which is for designers or companies to practice with their brand's resources and quickly prototype new ideas in their style. More information can be found on the project page StyleDrop.

Google released a neural network StyleDrop, it can create images in a precisely defined style

More in the category

Google_presented_Project_Astra_-_an_innovative_II_assistant
As part of Google's annual I/O developer event, the head of DeepMind's artificial intelligence division, Demis Hassabis, provided a first look at what...
O_2024_announced_a_row_of_high-profile_novelties
The latest Google I/O 2024 event showcases significant innovations in artificial intelligence that deserve a special review. - The family of open...
Google_announces_updates_II_for_search_and_Gemini_for_the_new_S24
As part of the Samsung Galaxy Unpacked conference, Google announced two major search updates: Circle to Search and multisearch based on...
SoundStorm_Google_has_represented_the_revolutionary_instrument_of_the_art
Google has unveiled its latest breakthrough in artificial intelligence technology, SoundStorm, an advanced model for efficient and non-autoregressive audio generation....
Google_opened_free_access_to_users_to_create_music
Google announced the availability to the general public of its MusicLM neural network, which allows you to create music based on text descriptions. The system successfully...
How Siri, Alexa and Google Assistant lost out in the A.I. race
Virtual assistants had more than a decade to become indispensable. But they were hampered by clumsy design and calculation errors, which...
Google is one step closer to creating a 1,000-language artificial intelligence model
Google is developing all sorts of AI technologies, including a universal speech model, which is part of an attempt to create a model that can understand the 1000 most common...