Feb. 20, 1997

|
After much effort, I have pieced together 71 cytochrome P450 protein sequences
starting
from C. elegans genomic sequences.  Some have been in the GenBank
database for
some
time.  Many others are unfinished cosmid sequences in the ftp repositories at
the Sanger
Center and the Wash U. Genome Sequencing Center.  These later sequences were
downloaded and translated in three frames to identify where the beginning and
ends of the
sequence lie.  The region was fed through the Baylor College of Medicing
GeneFinder
website set for nematode sequences.  If this successfully assembled the gene, I
stopped
there.  If not, then I did it the hard way by inspection of the three translated
frames to find
the most probable protein sequence.  These sequences may not have the correct
intron exon
structures, but they should be pretty close.  I should mention that the Gene
Finder
program was not very successful in constructing P450 genes.  I had to do most of
them by
myself.  To see a list click here.  I will try to
post the
actual sequences soon if I get permission from the sequencers.


THIS REPRESENTS THE LARGEST COLLECTION OF P450S FROM A SINGLE
ORGANISM.  The genome of C. elegans is about 67% complete, so there might
be
as
many as 100 P450 genes in C. elegans, though there are a lot on
chromosome V
for some
reason.  This may skew the results, so there might be much less than 100.  The
odd
distribution is noticeable on CHR I also.  There is only one P450 on CHR I so
far.