Three Billion Base Pairs vs. One Powerful Computer
The human genome comprises roughly three billion base pairs and around 20,000 protein-coding genes, according to recent estimates. That’s a lot of information crammed into the tiny nucleus of a cell, and it doesn’t even include the many genes that do not produce a protein or the fact that most genes come in multiple flavors that vary in different individuals. Add to that the phenomenon of an identical gene being either more or less active in two different people and you can quickly end up with genomic datasets that would overload nearly any computer. Fortunately for IRP senior investigator Daniel Levy, M.D., the NIH IRP has one of the few computer systems in the world that can handle this mountain of information.