Microsoft Built an AI That Draws What You Tell It To

Mehedi Hassan
Jan 18, 2018
4

Researchers at Microsoft are working on a new artificial intelligence system that’s capable of drawing images based on text descriptions. The new AI, simply called the drawing bot, is based on a technology called a Generative Adversarial Network (GAN), which consists of two different machine learning models.

The first one is used to actually generate the images from text descriptions — while the other, known as the discriminator, is used to score the authenticity of the generated image. These two models work together in order to achieve the best possible accuracy possible in the final drawing.

Microsoft’s drawing bot doesn’t actually use a basic GAN — instead, the company’s researcher designed a new system called the Attentional GAN, or AttnGAN which is capable of perfecting the drawing using the provided description. Microsoft says the regular GAN would not be able to draw pixel-perfect or sharp images based on descriptions where there are a variety of different colours, so the AttnGAN is being used in order to tackle the problem by effectively picking out the key variables from the provided description and matching them against the drawing.

Microsoft’s AttnGAN isn’t just only about improving the accuracy, and it also has the basic common sense of humans. Like every other AI systems, Microsoft used a lot of training data in order to train the models required by the AttnGAN, allowing the system to pick up important details that will be useful when drawing images. “Since many images of birds in the training data show birds sitting on tree branches, the AttnGAN usually draws birds sitting on branches unless the text specifies otherwise,” Microsoft said in a blog post.

Microsoft says the drawing bot could one day be used as sketch assistants to painters, or even help filmmakers save time and money by drawing animation scenes based on the screenplay. For now though, it’s still a work in progress.

Tagged with

About author

Mehedi Hassan

Mehedi has been covering all things Microsoft for years, including all the breaking news about Windows Phone, to all the developments to Windows 10 and other consumer-oriented products from Redmond. Mehedi has gained substantial experience as a developer building rich web-based applications and mobile applications while designing intuitive user experiences on the side.

View Articles

Currently on Forums
Visit the forums
- Drama
  Posted by Omegaman
  
  1
  comment
- [CLOSED] Ask Paul for this Friday, May 22
  Posted by Paul Thurrott
  
  6
  comments
- Microsoft surprises with its first server Linux distribution: Azure Linux 4.0
  Posted by shameer_mulji
  
  9
  comments
- Virtual Machines on Snaodrsgon
  Posted by lenh51
  
  8
  comments
Podcasts
Podcast Hub
- First Ring Daily 1967: Gritty Surfaces
  
  Aired on May 22, 2026 by Brad Sams with 1 Comment
- Windows Weekly 984: For Entertainment Purposes Only
  
  Aired on May 21, 2026 by Paul Thurrott with 1 Comment
- First Ring Daily 1966: Scrape of the Future
  
  Aired on May 21, 2026 by Brad Sams with 2 Comments
- First Ring Daily 1965: Small Break
  
  Aired on May 14, 2026 by Brad Sams with 0 Comments
Join the crowd where the love of tech is real - become a Thurrott Premium Member today!

Explore Premium Benefits

Tagged with

Share post