In a breakthrough, researchers have developed an Artificial Intelligence algorithm capable of diagnosing 134 skin disorders and supporting specialists by augmenting the accuracy of diagnoses and predicting treatment options.
The deep learning-based artificial intelligence (AI) algorithm can accurately classify cutaneous skin disorders, predict malignancy, suggest primary treatment options, and serve as an ancillary tool to enhance the diagnostic accuracy of clinicians. With the assistance of this system, the diagnostic accuracy of dermatologists, as well as the general public, was significantly improved. This novel study is reported in the Journal of Investigative Dermatology.
Skin diseases are common, but it is not always easy to visit a dermatologist quickly or distinguish malignant from benign conditions. "Recently, there have been remarkable advances in the use of AI in medicine. For specific problems, such as distinguishing between melanoma and nevi, AI has shown results comparable to those of human dermatologists. However, for these systems to be practically useful, their performance needs to be tested in an environment similar to real practice, which requires not only classifying malignant versus benign lesion but also distinguishing skin cancer from numerous other skin disorders including inflammatory and infectious conditions," explained lead investigator Jung-Im Na, MD, PhD, Department of Dermatology, Seoul National University, Seoul, Korea.
Using a "convolutional neural network," a specialized AI algorithm, investigators developed an AI system capable of predicting malignancy, suggesting treatment options, and classifying skin disorders. Investigators collected 220,000 images of Asians and Caucasians with 174 skin diseases and trained neural networks to interpret those images. They found that the algorithm could diagnose 134 skin disorders and suggest primary treatment options, render multi-class classification among disorders, and enhance the performance of medical professionals through Augmented Intelligence. Most prior studies have been limited to specific binary tasks, such as differentiating melanoma from nevi.
The algorithm's performance was initially compared with the performance of 21 dermatologists, 26 dermatology residents, and 23 members of the general public. Its performance was similar to that of the dermatology residents but slightly below that of the dermatologists. After the initial test, the test participants were informed of the results of the algorithm and subsequently modified their answers. The sensitivity of the malignancy diagnosis of the 47 clinicians improved from 77.4 per cent to 86.8 per cent. Similarly, the sensitivity of the diagnosis of malignancy by the 23 members of the general public improved markedly from 47.6 per cent to 87.5 per cent. Notably, based on the initial result, half of the malignancies would have been missed by the general public without referral to specialists.
"Our results suggest that our algorithm may serve as an Augmented Intelligence that can empower medical professionals in diagnostic dermatology," noted Dr Na. "Rather than AI replacing humans, we expect AI to support humans as Augmented Intelligence to reach diagnoses faster and more accurately."
The researchers caution that AI cannot definitively interpret images that it is not trained to interpret even when the problem presented is straightforward. For example, an algorithm trained only to differentiate between melanoma and nevi cannot differentiate between an image of a nail hematoma and either a melanoma or a nevus. If the shape of the hematoma is irregular, the algorithm may diagnose it as melanoma. They also point out that the algorithm was trained and tested using high-quality images and its performance is generally suboptimal if the input images are of low quality.