Assembly-Solving Really Big Puzzles
One of the primary duties of a Bioinformacian is to combine little pieces of DNA into bigger pieces. When scientists sequence the genome of a species, it doesn't spit out of a machine in one magical lump. Sequencing machines (that read DNA sequences) produce lots of little sequences of DNA (strings of A's, T's, G's, or C's) 50-700 base pairs (bps) long. They spit out millions of them. The challenge of bioinformatics is to assemble those millions of short reads into the full sequence of the genome. Imagine shredding a textbook and putting the pieces back together. This process is called (no surprise here) Assembly, since we're assembling pieces of DNA into a larger sequence. This process really made a splash in 2003 when the human genome was sequenced ...