"To our knowledge, this is the first example of a predictive model for sexual orientation based on molecular markers," said Tuck C Ngun, first author on the study and a postdoctoral researcher at the David Geffen School of Medicine of the University of California, Los Angeles.
Beyond the genetic information contained in DNA, the researchers examined patterns of DNA methylation - a molecular modification to DNA that affects when and how strongly a gene is expressed - across the genome in pairs of identical male twins.
Thus, by studying twins, the researchers could control for genetic differences and tease out the effect of methylation.
In all, the study involved 37 pairs of twins in which one twin was homosexual and the other was heterosexual, and 10 pairs in which both twins were homosexual.
More From This Section
"A challenge was that because we studied twins, their DNA methylation patterns were highly correlated," Ngun explained.
Even after some initial analysis, the researchers were left with over 400,000 data points to sort through.
To sort through this data set, Ngun and his colleagues devised a machine learning algorithm called FuzzyForest. They found that methylation patterns in nine small regions, scattered across the genome, could be used to predict study participants' sexual orientation with 70 per cent accuracy.
"Previous studies had identified broader regions of chromosomes that were involved in sexual orientation, but we were able to define these areas down to the base pair level with our approach," Ngun said.
The researchers are currently testing the algorithm's accuracy in a more general population of men.
"Sexual attraction is such a fundamental part of life, but it's not something we know a lot about at the genetic and molecular level," Ngun said.
The research was presented at the American Society of Human Genetics (ASHG) 2015 Annual Meeting in Baltimore.