For the first draft of the genome sequence, both teams were working to identify the number of human genes. Here, Ewan Birney, a "numbers man" from the public genome project, explains how genes can be recognized and the data from the genome project used.
We're writing computer programs that do that. In fact it's very similar to speech recognition software that people use in, in you know, other fields, say in the telephone industry. So it's very much like a recognition, we're trying to recognize now not parts of speech but parts of DNA that show you where the genes are. So genes start in a particular way and they end in a particular way, and we know some way about how they're put together in the middle. And so we kind of build a mathematical model of how a gene looks, using these techniques.
Keywords:
number of human genes,speech recognition software,human genome project,numbers man,birney,genome sequence,embl,interviewee,parts of speech,mathematical model,first draft,computer programs,dna
For the first draft of the genome sequence, both teams were working to identify the number of human genes. Here, Ewan Birney, a "numbers man" from the public genome project, explains how genes can be recognized and the data from the genome project used.
James Watson describes sequencing the human genome using markers and BACs, and Craig Venter explains using cDNA libraries, ESTs, and shotgun sequencing.
Craig Venter, the leader of the private genome effort, talks about the "whole genome shotgun" technique that was used by Celera Genomics to sequence the human genome.