Shining a Light on the Genome's 'Dark Matter'
You know this is going to be good when 'Dark Matter' is included in the title. Plus, there are scare quote. Yippee!!
It used to seem so straightforward. DNA told the body how to build proteins. The instructions came in chapters called genes. Strands of DNA's chemical cousin RNA served as molecular messengers, carrying orders to the cells' protein factories and translating them into action.
Thanks for the introduction to molecular biology.
Between the genes lay long stretches of “junk DNA,” incoherent, useless, and inert.
OMG! Scare quotes! Junk DNA! We were so stupid back then, in the dark ages, when we thought that the genome was made up only of protein coding sequences and other crap that did nothing.
That was then.
Of course, back then. When? I don't know, but I hope Lizzy tells us. It was probably in like 1908 or something.
In fact, gene regulation has turned out to be a surprisingly complex process governed by various types of regulatory DNA...
Gosh darn, I'm so glad we figured out how complex gene regulation was. But when did this happen? Recently? Or in like 1985 or something (I just threw out a year, it doesn't mean anything)?
...which may lie deep in the wilderness of supposed “junk.”
More JUNK! In "Scare" "Quotes". Again.
Far from being humble messengers, RNAs of all shapes and sizes are actually powerful players in how genomes operate.
You mean like tRNAs, which we've known about for decades? Or rRNA which we've known about since 1634 (date is approximate).
Finally, there's been increasing recognition of the widespread role of chemical alterations called epigenetic factors that can influence the genome across generations without changing the DNA sequence itself.
Junk DNA and epigenetics in one article? Someone call the Association for the Regulation of Sensationalist Science Journalism (ARSSJ). I think we may have a violation of the "Maximum Number of Topics that Science Journalists Misunderstand in one Article" (MNTSJM1A, found in section 23Aii of the bylaws, rules, and regulations, right after the part about using Oxford commas) case on our hands.
The scope of this “dark genome” became apparent in 2001, when the human genome was first published. Scientists expected to find as many as 100,000 genes packed into the 3 billion bases of human DNA; they were startled to learn that there were fewer than 35,000. (The current count is 21,000.) Protein-coding regions accounted for just 1.5% of the genome. Could the rest of our DNA really just be junk?
Dark Genome! Junk! Less than 100,000 Genes! Who would have thought? Scientists that's who would have thought there would be less than 100,000 genes. And they thought it way before the human genome was sequenced.
The deciphering of the mouse genome in 2002 showed that there must be more to the story. Mice and people turned out to share not only many genes but also vast stretches of noncoding DNA. To have been “conserved” throughout the 75 million years since the mouse and human lineages diverged, those regions were likely to be crucial to the organisms' survival.
Vast amounts of conserved noncoding DNA! What could it be doing? Maybe regulating the expression of the coding DNA! That's why they looked for conservation in the noncoding DNA -- to find the important regulatory sequences. The hypothesis that there would be conserved noncoding functional sequences motivated the search for those sequences! We didn't need an a posteriori explanation. See here. That paper came out in the year 2000. Two years before 2002.
Edward Rubin and Len Pennacchio of the Joint Genome Institute in Walnut Creek, California, and colleagues figured out that some of this conserved DNA helps regulate genes, sometimes from afar, by testing it for function in transgenic mouse embryos. Studies by the group and others suggested that noncoding regions were littered with much more regulatory DNA than expected.
Further evidence that noncoding DNA is vital has come from studies of genetic risk factors for disease. In large-scale searches for single-base differences between diseased and healthy individuals, about 40% of the disease-related differences show up outside of genes.
40% of SNPs associated with disease risk are outside of genes. But, Lizzy, didn't you just say that 1.5% of the genome is protein coding? So, by my math that means >40x as many SNPs were found in genes than expected if the SNPs were scattered randomly around the genome (note: my math is really bad, and, yes, I do realize that there is an ascertainment bias because SNPs are more likely to be genotype in coding sequences, but still, c'mon, right?).
Genetic dark matter also loomed large when scientists surveyed exactly which DNA was being transcribed, or decoded, into RNA.
Wait, so now "decoded" means "transcribed"? I thought it meant sequenced, as in "J. Craig Venter decoded the genome of the mid-Atlantic braided dottleygart" or whatever the hell the dude has sequenced most recently.
Scientists thought that most RNA in a cell was messenger RNA generated by protein-coding genes, RNA in ribosomes, or a sprinkling of other RNA elsewhere. But surveys by Thomas Gingeras, now at Cold Spring Harbor Laboratory in New York, and Michael Snyder, now at Stanford University in Palo Alto, California, found a lot more RNA than expected, as did an analysis of mouse RNA by Yoshihide Hayashizaki of the RIKEN Omics Science Center in Japan and colleagues. Other researchers were skeptical, but confirmation soon came from Ewan Birney of the European Bioinformatics Institute and the Encyclopedia of DNA Elements project, which aims to determine the function of every base in the genome. The 2007 pilot results were eye-opening: Chromosomes harbored many previously unsuspected sites where various proteins bound—possible hotbeds of gene regulation or epigenetic effects. Strikingly, about 80% of the cell's DNA showed signs of being transcribed into RNA. What the RNA was doing was unclear.
Really? I mean, c'mon now, really? Trotting out Gingeras and the ENCODE data to argue that the entire freakin' genome is transcribed? You're better than that Liz. At least I hope you are. This is the best smackdown of the "dark matter" transcripts hypothesis. But I'm sure you can find others. You're a "journalist", right? Do some journalizing.
Other studies revealed that RNA plays a major role in gene regulation and other cellular functions. The story started to unfold in the late 1990s, when plant researchers and nematode biologists learned to use small RNA molecules to shut down genes. Called RNA interference (RNAi), the technique has become a standard way to control gene activity in a variety of species, earning a Nobel Prize in 2006.
To understand RNAi and RNA in general, researchers began isolating and studying RNA molecules just 21 to 30 bases long. It turned out that such “small RNAs” can interfere with messenger RNA, destabilizing it. Four papers in 2002 showed that small RNAs also affect chromatin, the complex of proteins and DNA that makes up chromosomes, in ways that might further control gene activity. In one study, yeast missing certain small RNAs failed to divide properly. Other studies have linked these tiny pieces of RNA to cancer and to development.
The surprises didn't stop at small RNAs. In 2007, a group led by Howard Chang of Stanford and John Rinn, now at Beth Israel Deaconess Medical Center in Boston, pinned down a gene-regulating function by so-called large intervening noncoding RNAs. Rinn and colleagues later determined that the genome contained about 1600 of these lincRNAs. They and other researchers think this type of RNA will prove as important as protein-coding genes in cell function.
Yes, there are functional non-coding sequences that are transcribed. No one is denying that. Many of them are essential to the organisms in whose genomes they reside. Others are really freakin' useful for those of us doing molecular genetics. All that said, a huge chunk of the human genome is just junk. Dead transposable elements. Dead viruses. Other crap that just sits there, taking advantage of our DNA replication machinery.
Many mysteries about the genome's dark matter are still under investigation. Even so, the overall picture is clear: 10 years ago, genes had the spotlight all to themselves. Now they have to share it with a large, and growing, ensemble.
Sure we (not me, actually) have found a bunch of functional elements in the genome that do not encode proteins. We knew about a lot of those before the human genome was sequenced. Others were subsequently discovered. But that doesn't take away from the fact that the genome is loaded with a bunch of junk. Useless crap that does nothing productive other than exist. Sort of like Elizabeth Pennisi.
It's not her first time but she obviously ignores the critique she recieved at Sandwlk and other places. And head ID-creationist Casey Luskin of the notorious Discovery Institute uses the opportunity to cite some Science piece to create the impression that ID-crationism is science:
ReplyDelete"The December 17, 2010 issue of Science has yet another article explaining why the concept of "junk"-DNA should no longer be given much credence [...]"
link:
http://www.evolutionnews.org/2010/12/subtle-but_important-functions041961.html