Wednesday, December 17, 2025 | 11:56 AM ISTहिंदी में पढें

Home / India News / This Microsoft bot can sketch an image from caption-like descriptions

This Microsoft bot can sketch an image from caption-like descriptions

The core of this bot is a technology known as a 'Generative Adversarial Network' or GAN

IANS San Francisco

Last Updated : Jan 19 2018 | 12:54 PM IST

Microsoft is developing a bot that can draw what you want it to by leveraging Artificial Intelligence (AI) technology -- programmed to pay close attention to individual words when generating images from caption-like text descriptions.

The technology, which the researchers simply call the drawing bot, can generate images of everything from ordinary pastoral scenes -- such as grazing livestock -- to the absurd and a floating double-decker bus.

Each image contains details that are absent from the text descriptions, indicating that this AI contains an artificial imagination.

"If you go to Bing and you search for a bird, you get a bird picture. But here, the pictures are created by the computer, pixel by pixel, from scratch. These birds may not exist in the real world -- they are just an aspect of our computer's imagination of birds," Xiaodong He from Microsoft's research lab in a blog post late on Thursday.

According to results on an industry standard test, reported in a research paper posted on arXiv.org, the bot produced a nearly three-fold boost in image quality compared to the previous state-of-the-art technique for text-to-image generation.

Also Read

New Year 2018: Fintech seen shedding GST, note-ban blues with chatbots, AI

AI could help add $957 bn to Indian economy: Accenture

Skilling enterprises, start-up developers key to India's digital dream: IBM

Secretive Apple tries to open up on artificial intelligence

Cloud computing and AI are most used technology in BFSI sector in 2017

The core of this bot is a technology known as a "Generative Adversarial Network" or GAN.

The network consists of two Machine Learning models -- one that generates images from text descriptions and another, known as a discriminator, that uses text descriptions to judge the authenticity of generated images.

The researchers said that text-to-image generation technology could find practical applications acting as a sort of sketch assistant to painters and interior designers or as a tool for voice-activated photo refinement.

For now, the technology is imperfect.

"For AI and humans to live in the same world, they have to have a way to interact with each other. The language and vision are the two most important modalities for humans and machines to interact with each other," The blog post explained.

More From This Section

Islamic State recruit from Kerala's Kannur killed while fighting in Syria

Explore News

Stock Market LIVE Updates Stocks to Watch Today ICICI Prudential AMC IPO Allotment Gujarat Kidney IPO Parliament Winter Session LIVE Gold-Silver Price Today BGMI Redeem Codes BS-VI Rule in Delhi IPL 2026 Auction Personal Finance