The technology can generate images of everything from ordinary pastoral scenes, such as grazing livestock, to the absurd, such as a floating double-decker bus, Microsoft said in a blog post.
Each image contains details that are absent from the text descriptions, indicating that this artificial intelligence contains an artificial imagination, it said.
The technology under development in Microsoft's research labs is programmed to pay close attention to individual words when generating images from caption-like text descriptions, the company said.
"If you go to Bing and you search for a bird, you get a bird picture. But here, the pictures are created by the computer, pixel by pixel, from scratch," said Xiaodong He, a principal researcher at Microsoft's research lab in Washington.
Also Read
He and colleagues started with technology that automatically writes photo captions - the CaptionBot - and then moved to the one that answers questions humans ask about images, such as the location or attributes of objects, which can be especially helpful for blind people.
Text-to-image generation technology could find practical applications acting as a sort of sketch assistant to painters and interior designers, or as a tool for voice-activated photo refinement, the researchers said.
At the core of Microsoft's drawing bot is a technology known as a Generative Adversarial Network, or GAN.
The network consists of two machine learning models, one that generates images from text descriptions and another, known as a discriminator, that uses text descriptions to judge the authenticity of generated images.
Disclaimer: No Business Standard Journalist was involved in creation of this content