Now this from Nature :
How alike are you and me? About 99.5%
Nearly six years after the sequence of the human genome was sketched out, one might assume that researchers had worked out what all that DNA means. But a new investigation has left them wondering just how similar one person's genome is to another's.
Geneticists have generally assumed that your string of DNA 'letters' is 99.9% identical to that of your neighbour's, with differences in the odd individual letter. These differences make each person genetically unique — influencing everything from appearance and personality to susceptibility to disease.
But hold on, say the authors of a new study published in Nature1. They have identified surprisingly large chunks of the genome that can differ dramatically from one person to the next. "Everyone has a unique pattern," says one of the lead authors, Matthew Hurles at the Wellcome Trust Sanger Institute in Cambridge, UK.
The differences in question - made up of stretches of DNA that span tens to hundreds of thousands of chemical letters — are called 'copy-number variants', or CNVs. Within a given stretch of DNA, one person may carry one copy of a DNA segment, another may have two, three or more. The region might be completely absent from a third person's genome. And sometimes the segments are shuffled up in different ways.
These variable regions received short shrift for many years. When the human genome sequence was pieced together, they were largely glossed over, because researchers were focused on finding one overarching reference sequence — and because the repetitive nature of the segments makes them hard to sequence. "It was swept under the rug," says Michael Wigler who is also mapping CNVs at Cold Spring Harbor Laboratory, New York.
They found nearly 1,500 such regions, taking up some 12% of the human genome. That doesn't mean that your DNA is 12% different from mine (or 88% similar), because any two people's DNA will differ at only a handful of these spots.
According to the team's back-of-the-envelope calculations, one person's DNA is probably 99.5% similar to their neighbour's. Or a bit less. "I've tried to do the calculation and it's very complicated," says Hurles. "It all depends on how you do the accounting."
The answer is also unclear because researchers think that there are many more variable blocks of sequence that are 10,000 or 1,000 letters long and were excluded from the current study. Because of limits with their methods, the new map mainly identified variable chunks larger than 50,000 letters long.
Many of these CNVs are thought to be important in our biology. The team found that 10% of human genes are spanned by these regions, meaning that they might be doubled, deleted or otherwise jumbled in a way that could help to determine whether and when we develop diseases.
You see, I belong to a mailing list that is particularly interested in Gender and Genetics, that calls attention to articles like this. Funny that... anyway, it's another data point, and yet another illustration of just how much we don't know!