Hydra P450s

A preliminary list of 14+ Hydra P450 ESTs taken from Genbank by 
searching for sequences that were from Hydra magnipapillata ESTs 
and then containing the word P450 (128 entries).  The reductase sequences 
have been removed and another false positive called TBP-like 
sequences (TATA binding Protein) that have become linked in 
old entries to a P450 by accident.  Also removed are hits to Trembl TR:O64410
This removed 114 entries/128.  Afew additional entries were added later
These sequences have been translated and sorted into 11 gene bins.  
Combining opposite ends of clones reduces this down to just 9 gene bins.
No genes are complete.  

The sequences are not very differentiated into the known Clans
The 2 clan, 3 clan and 4 clan are present.  No CYP51 is detected.
No mito clan is detected.  
The annotation from the Genbank entries is not necessarily correct.


D. Nelson June 25, 2004

Modified May 25, 2005

current FASTA sequences from hydra

current detailed sequences from hydra CYP1 like (only one seq) >CN775634 tae83e09.y1 Hydra EST Darmstadt I Hydra magnipapillata cDNA 5' similar to SW:CP11_OPSTA Q92095 CYTOCHROME P450 1A1 AGAGAGTGAAGAGCTACATAAAAGACTTTTAATGAAATCAAAAACTTCCGTAGATCTGAAAACTGAATTT GGTGCTGCAATTATAAACGTGATTTGTTTCATTGTTTTTGGGGAAAGATACCAGTATTCAAATTCAGAAT TTAAAGAAGTTCTTACAACAATAAACAATATAGTCGATGGGTTGTCAAATACAACTGCTGTCGGTTTTTT GCCGTGGTTGAGATTTTTACCGTTTTCTCCAATAAAAAAACTGAGTATTTCACTTTCAAAATATATTCGT TTTTTAAACGATAAGTTGACCAAACATAAGGAAACATTTAATGAAAACAAAATTCGAGATTCTACTGATT CTATTATAAAC 32% to CYP1C1 aa 173-297 2 ESEELHKRLLMKSKTSVDLKTEFGAAIINVICFIVFGERYQYSNSEFKEVLTTINNIV 175 176 DGLSNTTAVGFLPWLRFLPFSPIKKLSISLSKYIRFLNDKLTKHKETFNENKIRDS 343 344 TDSIIN 361 opposite end of clone = >CN774619 tae83e09.x1 Hydra EST Darmstadt I Hydra magnipapillata cDNA 3' similar to SW:CPT7_CHICK P12394 CYTOCHROME P450 17 TTTAAACTACATAAGATTGTTTACTCTTGGAATCAAGTATGCGCAAAATTGTTTTGGAGCATGAGTAATA TTGCATTTTCCAATTAAACTTGGAGGTGGACAACCAGGTGTGCACTCAAACTTAAAATCTCGAATTAATC TAGAAAAGAAGAAAAATAGTTCATTTTCAGCAACTGTTTTCCCTAAACAAACTCTTGTACCGGCTGAAAA AGGTAAGAAACTTGTTGCTTTACTTGGATCAAATTTCTTATCTTTGCCAATCCGCCGATAAGGAATTAAA TTCATTAGGATTTTCCCAGGAGTTAGTAACGTTATGAATCTGCCAATGATTAATCATGACAGTAGCAATT TTCAGGAATGCTATGTGACATGAGAACTCCCCCCCAGAGGT 38% to CYP17A2 Fugu aa 383-485 391 TSGGEFSCHIAFLKIATVMINHWQIHNVTNSWENPNEFN 275 273 PYRRIGKDKKFDPSKATSFLPFSAGTRVCLGKTVAENELFFFFSRLIRDFKFECTPGCPP 94 93 PSLIGKCNITHAPKQFCAYLIPRVNNLM* 7 combined seq CYP2 clan member [Gene 1] 32% to CYP1C1 aa 173-297 29% to 17A2 2 ESEELHKRLLMKSKTSVDLKTEFGAAIINVICFIVFGERYQYSNSEFKEVLTTINNIV 175 176 DGLSNTTAVGFLPWLRFLPFSPIKKLSISLSKYIRFLNDKLTKHKETFNENKIRDS 343 344 TDSIIN 361 38% to CYP17A2 Fugu aa 383-485 391 TSGGEFSCHIAFLKIATVMINHWQIHNVTNSWENPNEFN 275 273 PYRRIGKDKKFDPSKATSFLPFSAGTRVCLGKTVAENELFFFFSRLIRDFKFECTPGCPP 94 93 PSLIGKCNITHAPKQFCAYLIPRVNNLM* 7 CYP2 like (2 different sequences) >CN566581 taf98h10.x1 Hydra EST -Kiel 1 Hydra magnipapillata cDNA 3' similar to SW:CPC8_HUMAN P10632 CYTOCHROME CACGCGTCCGCTTTTTAGTCCCTGTGTAATAAGTATTTCTTAGACATAATTTTAAAATGTTTCTTGAAGT TATTGGCGCAGTCTTTATTCCACCTTTGATATGGACTATATGGGTTTACATTAAACATTTAATTGATTGT TTGCATTATCCAAGAGGACCAATACCACTACCATTTATTGGAAATGGTTATTTGATAAGAAAAGCTGAAC CATATAAAGAGTTGGTTAACTTAGGAAAAATATATGGCGATGTTTTTAGTTTTAGCGTTGGTTCAGTCAG ATATGTAATTGTCAACAGTTTAGAAGGAATTCAAGAAGTACTAGTTAAAAAAGGGTGGCAATTTGCTGGT CGTCCAAAAGGTCCAAGTTGGGATAGATCCATTCACGGTCTAATCCAACGTGATCCAAGTAAAAAATTTA AAATATTACGGAAGCTAGCAACATCATCTTTGAAAATCTTTGCTGATGGATTGGCAGGGATGGAAAGTAA AGCTATA 32% to 2X9 aa 26-146 57 MFLEVIGAVFIPPLIWTIWVYIKHLIDCL 144 HYPRGPIPLPFIGNGYLIRKAEPYKELVNLGKIYGDVFSFSVGSVRYVIVNSLEGIQEVL 323 324 VKKGWQFAGRPKGPSWDRSIHGLIQRDPSKKFKILRKLATSSLKIFADGLAGMESKAI 497 opposite end of clone = >CN566859 taf98h10.y1 Hydra EST -Kiel 1 Hydra magnipapillata cDNA 5' similar to SW:CPT7_SQUAC Q92113 CYTOCHROME P450 17 TAATCATAAATGTGTATTTAAAATAAAAATTAAAACTCTTTTTGATAGACAATAGTTAGCTAATAGAAAT TAGAAAAATTTTTAATTAACACATTTTGCCTAAAATTTAAAGGTTAAGCAAACATTTACTAATCGTTTCA TTTTCAAATTATTTACTATTTATCTAGTATCTAATTTTTCAATCAATATAGTTTATATAAATTTAAAAAA AAAAACATTAGCAAAAT TAAAG CAATAAATTTTTACT CCTTGGAACTATTTCAACTTTAAAGTCATAAGG AGTACAAGTGATGCCAAAACTACCTTCTAAGCGCGGTAATTCTTCTAAAATTGGTTTCACAAATCGGAAA TCATTTATCAATCGTGATATAAAAATAAACAACTCAGTTTTTGCCAATGTTTCTCCAATACAGCTACGAG GTCCACTAGAGAATGGTAAATAAGCGTTTCCTAGTTTAGAATTAAATTCGCCAGAATTTTCTAACCAACG TTCAGGGTAAAAACTCATTGCATTTTTCCAGTAACTTTCGTCATGGTGTATACTCCATAGGTTGGTTATT ATAAGAGCCCCTTT GGGGAATAGGTTTTCCACAAATGCTACCTTTCTCC 44% to 17A1 fugu aa 378-485 607 RKVAFVENLFPKGALIITNLWSIHHDESYWKNAMSFYPERWLENSGEFNSKLGNAYLPFSSGPRSCIGETL 395 394 AKTELFIFISRLINDFRFVKPILEELPRLEGSFGITCTPYDFKVEIVPRSKNLLV* 227 combined seq CYP2 clan member [gene 2] 32% to 2X9 aa 26-146 34% to CYP17A 57 MFLEVIGAVFIPPLIWTIWVYIKHLIDCL 144 HYPRGPIPLPFIGNGYLIRKAEPYKELVNLGKIYGDVFSFSVGSVRYVIVNSLEGIQEVL 323 324 VKKGWQFAGRPKGPSWDRSIHGLIQRDPSKKFKILRKLATSSLKIFADGLAGMESKAI 497 44% to 17A1 fugu aa 378-485 607 RKVAFVENLFPKGALIITNLWSIHHDESYWKNAMSFYPERWLENSGEFNSKLGNAYLPFSSGPRSCIGETL 395 394 AKTELFIFISRLINDFRFVKPILEELPRLEGSFGITCTPYDFKVEIVPRSKNLLV* 227 >CB271974 taa28h08.x3 Hydra EST -II Hydra magnipapillata cDNA 3' similar to SW:CPT7_ONCMY P30437 CYTOCHROME P450 17 TTCATAACTTTTTGTTTTCATAAACATAACTTTTTTATTAATGTAAAACAATTTTTTCCAAGAGATGTTT CAATTTCAGCTTTAAATTCCAACAAATTAAAATTAATTTTC T TACGCTTCTATTTTTTGGCTGATCAAGT TGTTTGATCGCGGTTTTAGAATAATGTATGACGTATCAGGAGTTAAGGTTACTCGCAAAACACCGTTTAA ACTTGGAGTAGATGCGCCAGGTTTCAAACAAAGATTATTTTCTCTTATCAATCGGGTAAAAAACAAAAAA ATGATGGATTCGGTTGTGTGATAACCTAAGCAGGCTCTTAATCCGGTTGAAAATGGAAGATAGTTTGGGT TT 40% to 2N12 36% to 2D6 C-term aa 435-498 351 NPNYLPFSTGLRACLGYHTTESIIFLFFTRLIRENNLCLKPGASTPSLNGVLRVTLTPDTSYIILKPRSNNLISQKIEA* 112 >CB073123 taa28f06.y1 Hydra EST -II Hydra magnipapillata cDNA 5' similar to SW:CPT7_ORYLA P70085 CYTOCHROME P450 17 AGAATATCTTTGACAAAAGCAAAAACCCAAACTATCTTCCATTTTCAACCGGATTAAGAGCCTGCTTAGG TTATCACACAACCGAATCCATCATTTTTTTGTTTTTTACCCGATTGATAAGAGATTTTAATCTTTGTTTG AAACCTGGCGCATCTACTCCAAGTTTAAACGGTGTTTTGCGAGTAACCTTAACTCCTGATACGTCATACA TTATTCTAAAACCGCGATCAAACAACTTGATCAGCCAAAAAATAGAAGCGTAAGAAAATTAATTTTAATT TGTTGGAATTTAAAGCTGAAATTGAAACATCTCTTGGAAAAAATTTTTTACATTAATAAAAAAGTTATGT TTATGAAAACAAAAAGTTATGAAATTTGTAATTAAATATAAGTTATAAAGTTGTAAAACTGAATGTCTTT AATGTTTTTTATTTTCAACATGATTTCTT Same seq as CB271974 same as CB271957 taa28f06.x3 same as BP505786 3 NIFDKSKNPNYLPFSTGLRACLGYHTTESIIFLFFTRLIRENNLCLKPGASTPSLNGVLR VTLTPDTSYIILKPRSNNLISQKIEA* >BP505786 NIDEQLSEDNMEMMLQDIFISGSETTISTLLWFAVYLVNWPKYQDDIYDETIKIVGNDRY PSLSDRPKLHLFESAMKETLRLSSVIPLGLPHRSLEETSIKKFKIPKNTNVMINLWQLHH DSKSWSDPHTFNPYRWLNDK NIFDKSKNPNYLPFSTGLRACLGYHTTESIIFLFFTRLIRDFNLCLKPGAS >Combined seqeunces BP505786 and CB073123 and CB271974 40% to CYP17A [gene 3] NIDEQLSEDNMEMMLQDIFISGSETTISTLLWFAVYLVNWPKYQDDIYDETIKIVGNDRY PSLSDRPKLHLFESAMKETLRLSSVIPLGLPHRSLEETSIKKFKIPKNTNVMINLWQLHH DSKSWSDPHTFNPYRWLNDKNIFDKSKNPNYLPFSTGLRACLGYHTTESIIFLFFTRLIRENNLCLKPGASTPSLNGVLR VTLTPDTSYIILKPRSNNLISQKIEA* CYP3 like (two different sequences) >CN567799 opposite end = CN567598 tag12b09.x1 tag12b09.y1 Hydra EST -Kiel 1 Hydra magnipapillata cDNA 5' similar to TR:Q9PVE8 Q9PVE8 CYTOCHROME P450 3A30 TTACTTTGTATTTAAAAATCATCAAAAGAAAACCCCAAACAATCATTATTATAAATGTTAGGACTTAAAG TTTTAAATAAAGTTTATTCTTCTGTTGAATATTATTCTGCAATAGGTTTTA CTCTAATTAGCAATGGTTC AACTGTTCTCATAGTTAACCAAATAATCTTTGTATAATTTATATGCATAGGGTCAATAGAAAATTTGAAT TGCAACAACGCTTTGACCAAGAATGTTTTAATTTCAAGCAAAGCAAAGTTCTTTCCAATACAGTTATAAA CACCGTGCCCAAAAGGAAGATAGAATGATGCTGGTATTTCCCCTGTCATAAACCTTTCAGGTTTAAATGA ATGAGGATCTCGATAAATAGTCTCATTCATGTGTAGATTGCTAACATGAGTTTGAAGTACTGTGTTTGCA GGAATTTCATATTCGCCAAATTTTATTGCATTAATGGTCTCTCTGCCTAAAACTGGTGCAAGTCCATGCA AGCGCAAGGTTTCTTTAACGACACAATCCAAATATTTTAAACTCGTAACTTCTTCAAAAGAAATATTATT TACATCCAATTTATTTTTCTGAATCTCTTCTCTAAGTTTCTCTTGAACCTCTAAGTTAATGGCTAGCATA TATAAACACAAAGTCAAAGTAGTAGAAATTGTTTCATAACCAGCTATGAAAAA Combined N and C-terms >CN567598 tag12b09.x1 [gene 4] N-term RVRFLLRYLLKRIFHPLRFLPSPKEQLITGHINHFQGRDHSSTYLSFNEKFKEESLCTLD TLHVPRFVYLIAPEFIKKIFADGKLFQRSKSIRTLAPLIGNSMVGSNYEHHHWQRKLFNG AFTSQQLKNYFPAFLKHTNLLMKLWSYTCDKESGTNLTVLDDLSNLSF 39% to 3A27 trout aa 307-472 58% to CN770283 [gene 4] 683 FFIAGYETISTTLTLCLYMLAINLEVQEKLREEIQKNKLDVNNISFEEVTSLKYLDCVVK 504 503 ETLRLHGLAPVLGRETINAIKFGEYEIPANTVLQTHVSNLHMNETIYRDPHSFKPERFMT 324 323 GEIPASFYLPFGHGVYNCIGKNFALLEIKTFLVKALLQFKFSIDPMHINYTK 168 167 IIWLTMRTVEPLLIRVKPIAE* 102 >CN770283 58% to CN567799 tad87b02.y2 Hydra EST Darmstadt I Hydra magnipapillata cDNA 5' similar to SW:CP4C_BLADI P29981 CYTOCHROME P450 4C1 AAAGAATGTATTTGATATAAAAAGTGTTTCATATGAAGAAGTTTTGAGTATCAAGTACTTAGATTGTGTT GTTAAAGAAACATTGCGCATGCATACACCTGTAGCATTTATTGGTAGAATAAATAAGAATCAAACAAAGT TTGGTGACTTTGATGTACCTGCTGGCTCTTTTTTACGAATTCCTATTGACAGTGCACATATGAACGAGTC TGTTTATCATGATCCTCATTCATTTAGACCACAACGATTCTTGACAGGTGAAATACCACCATTATCCTTC CTTACATTTGGGCAAGGTACATATAATTGTATCGGAAAGAATTTTGCTTTGCTTGAAATCAAAACATTCT TGGTCAAAGCGTTGCTGCAATTCAAATTTTCAGTAGACCTTAAGCGTTTGGAAATTAACAAGCTGAAT 35% to 3A27 trout aa 343-472 2 KNVFDIKSVSYEEVLSIKYLDCVVKETLRMHTPVAFIGRINKNQTKFGDFDVPAGSFLRI PIDSAHMNESVYHDPHSFRPQRFLTGEIPPLSFLTFGQGTYNCIGKNFALLEIKTFLVKA LLQFKFSVDLKRLEINKLN 418 >CN776982 taf28f06.x1 Hydra EST Darmstadt I Hydra magnipapillata cDNA 3' similar to TR:Q9VXY0 Q9VXY0 CG9081 PROTEIN. ;. Length = 316 Query: 79 RPQRFLTGEIPPLSFLTFGQGTYNCIGKNFALLEIKTFLVKALLQFKFSVDLKRLEINKL 138 R +RFLTGEIPPLSFLTFGQG YNCIGKNFALLEIKTFLVKALLQF+FSVDLK L KL Sbjct: 307 RQERFLTGEIPPLSFLTFGQGIYNCIGKNFALLEIKTFLVKALLQFEFSVDLKHLNYKKL 128 QRTRQERFLTGEIPPLSFLTFGQGIYNCIGKNFALLEIKTFLVKALLQFEFSVDLKHLNY KKLISITNKTVEPLWIRVKPI* Combined CN776982 and CN770283 [gene 5] 2 KNVFDIKSVSYEEVLSIKYLDCVVKETLRMHTPVAFIGRINKNQTKFGDFDVPAGSFLRI PIDSAHMNESVYHDPHSFRPQRFLTGEIPPLSFLTFGQGIYNCIGKNFALLEIKTFLVKALLQFEFSVDLKHLNY KKLISITNKTVEPLWIRVKPI* CYP4 Like (only one seq) >CN627429 tae92b11.y1 Hydra EST Darmstadt I Hydra magnipapillata cDNA 5' similar to SW:CP51_CANGA P50859 CYTOCHROME P450 51 GATAATGGTACTAACATAATTGTTTTGGATGACTTATCTAATTTATCATTTGATATAATTGGGGATGTTG GTTTTGGCTATCAATTTAACACAATTACTTCTCATTCTGGTAATGAGTTTACAAAAGCGCTTCAGAGTTA TTGTCAACTACGATTTCAATTGAATGCCGTCCATAAAGCTCTACTAGCTTTCTTTCCATTTTTAATGCAT CTGTCATTTATGTATGGAAAACGTAAACGAGCTGAGCAAGTCATCTGTAATACTTTAAACATGCTTATTA ATAAACGCAAAAAAGAGATAGACCACCGAATAGCAGCTGATCAAAAAGATTTTTTAACAGTTGTTTTAAA AGATCAACAAAAAGAAGGCAACAAGATGACAAATGACTTGATTAAAAATAATCTGATGACGCCTTTAATT GCAGGTCACAAAACAACTTCCACTGACATGCCATGGTGTTTCAACGTGCTTGCGCCAAACCCAAGTGCTA CCAAACACATGCAAAGAAACAGAAAGAAA GAATACATCT CGACCACAAAA 31% to 4T5 aa 183-347 1 DNGTNIIVLDDLSNLSFDIIGDVGFGYQFNTITSHSGNEFTKALQSYCQLRFQLNAVHKALLA 189 190 FFPFLMHLSFMYGKRKRAEQVICNTLNMLINKRKKEIDHRIAADQKDFLTVVLK 351 352 DQQKEGNKMTNDLIKNNLMTPLIAGHKTTSTDMPWCFNVLAPNPSATKHMQRNRKKEYISTTK 540 CN770090 taf75f08.y1 Hydra EST Darmstadt I Hydra magnipapillata cDNA 5' similar to TR:Q40411 Q40411 PUTATIVE CYTOCHROME P-450. ;. Length = 299 Score = 113 bits (283), Expect = 1e-26 Identities = 54/56 (96%), Positives = 55/56 (98%) Frame = -1 Query: 1 DNGTNIIVLDDLSNLSFDIIGDVGFGYQFNTITSHSGNEFTKALQSYCQLRFQLNA 56 DNGTNIIVLDDLSNLSFDIIGDVGFGYQFNTITSHSGNEFTKALQSYCQLRFQL + Sbjct: 170 DNGTNIIVLDDLSNLSFDIIGDVGFGYQFNTITSHSGNEFTKALQSYCQLRFQLKS 3 CN775805 tae77f11.x1 Hydra EST Darmstadt I Hydra magnipapillata cDNA 3' similar to TR:Q42700 Q42700 CYTOCHROME P450 ;. Length = 562 Score = 58.5 bits (140), Expect = 5e-10 Identities = 27/27 (100%), Positives = 27/27 (100%) Frame = +3 Query: 1 DNGTNIIVLDDLSNLSFDIIGDVGFGY 27 DNGTNIIVLDDLSNLSFDIIGDVGFGY Sbjct: 480 DNGTNIIVLDDLSNLSFDIIGDVGFGY 560 >Combined CN627429 CN775805 27% to 4T5 [gene 6] GIAVLIFLCFSLFFANILKRFYHPLRKLPSPKENFFTAHYGYFNGYDQINAVINFGKQFK ERGLYTLDTLNGFRFVNLLMPEFIKTVFSDGNSFQRSTATKVIFPLVGNGIFVSNYEDHH WQRKVLNEAFTLQQLKNYFPAFTVHIDLLMKLWSYSCDK 1 DNGTNIIVLDDLSNLSFDIIGDVGFGYQFNTITSHSGNEFTKALQSYCQLRFQLNAVHKALLA 189 190 FFPFLMHLSFMYGKRKRAEQVICNTLNMLINKRKKEIDHRIAADQKDFLTVVLK 351 352 DQQKEGNKMTNDLIKNNLMTPLIAGHKTTSTDMPWCFNVLAPNPSATKHMQRNRKKEYISTTK 540 CYP17 like (4 different sequences) >CN769570 taf31c10.y1 Hydra EST Darmstadt I Hydra magnipapillata cDNA 5' similar to SW:CPT7_CHICK P12394 CYTOCHROME P450 17 ACAGAAAATCTTACGATGAGAATAATTTACGTGACATAACCGATGCATTAATAAAAGTGTCTTTAGATTC AGAGATGGGTGAAGAATTAACTGAAAAGATTACTGATGATAATATTGAGTTTCTTTTAAACGATTTTATG ATTGCTGGATCCGAAACTTCATCAAGTACTATTCTTTGGTTTATTGTTTACATGTTACATTGGCCAGAAT ACCAAAATAAACTTTATGATGAAATTACTAAAGTAGCATCAGATAACCGTTATGTATCTTTAAAGGATCG ACCTATGCTTCATTTAATGCAAGCTACAATTCATGAAACACTTAGACTGTCATCGGTGGTACCTCTTGGT TTGGTTCATAAAGCAATGGAGAACAGTAGCATTTGTGGCAAGTTTGTTCCTAAGGGAGCTCTTATTTTAA CAAATTTATGGAGTATGCATCACGATGAAAGCTATTGGAAAAATGCAATGAGTTTTTACTCGGAACGTTG GCTGGAAAAATCTGGCGAGTTCCATTATAAATTGGGGTACGCATAATTACCGTTTTCTATAGGG 35% to CYP17A zebrafish aa 266-449 3 RKSYDENNLRDITDALIKVSLDSEMGEELTEKITDDNIEFLLNDFMIAGSETSSSTILWF IVYMLHWPEYQNKLYDEITKVASDNRYVSLKDRPMLHLMQATIHETLRLSSVVPLGLVHK AMENSSICGKFVPKGALILTNLWSMHHDESYWKNAMSFYSERWLEKSGEFHYKLGYA*LP FSIG 554 >CN769290 opposite end = CN769570 taf31c10.y1 taf31c10.x1 Hydra EST Darmstadt I Hydra magnipapillata cDNA 3' similar to SW:CPT7_RANDY O57525 CYTOCHROME P450 17 TTATCTTGCTTTTCACGTTTCCTTGGTTATTCCATATTTTTGAAATTTTTAAATATTATATAAAGCAAAA ATACAGAAAAGTGAAGCAAAAATAGTTAATTTCTTGGAATTATCACGACTTCAAAGTCATTAGGAGGGGA GGTGATTCCAGAACGACCATCTAAACAAGGTAACTCTTTTCCAGTTGGCATTTCAAATCGGTAATCTTTA AGTAATCGTGTAATAAACACAAACAACTCTGTTTTTGCCAATGTTTCTCCTAAACAACTACGAGGTCCAT TAGAAAACGGTAAATATGCGTACCCCAATTTATAATTGAACTCGCCAGATTTTTCCAGCCAACGTTCCGG GTAAAAACTCATTGCATTTTTCCAATAGCTTTCATCGTGATGCATACTCCATAAATTTGTTAAAATAAGA GCTCCCTTAGGAACAAACTTGCCACAAATGCTACTGTTCTCCATTGCTTTATGAACCAAACCAAGAGGTA CCACCGATGACAGTCTAAGTGTTTCATGAATTGTAGCTTGCATTAAATGAAGCATAGGTCGATCCTTTAA AGATACATAACGGTTATCTGATGCTACTTTAGTAATTTCATCATAAAGTTTATTTTGGTATTCTGGCCAA TGTAACATGTAAACAATAAACCAAAGAATAGTACTTGATGAAGTTTCGGATCCAGCAATCATAAAATCGT TTACAAGAAACTCAATATTATCCTCAGT 42% to CYP17A aa 299-503 728 TEDNIEFLVNDFMIAGSETSSSTILWFIVYMLHWPEYQNKLYDEITKVASDNRYVSLKDR PMLHLMQATIHETLRLSSVVPLGLVHKAMENSSICGKFVPKGALILTNLWSMHHDESYWK NAMSFYPERWLEKSGEFNYKLGYAYLPFSNGPRSCLGETLAKTELFVFITRLLKDYRFEM PTGKELPCLDGRSGITSPPNDFEVVIIPRN* >Combined seq from CN769290 and CN769570 39% to CYP17A [gene 7] RKSYDENNLRDITDALIKVSLDSEMGEELTEKITDDNIEFLLNDFMIAGSETSSSTILWF IVYMLHWPEYQNKLYDEITKVASDNRYVSLKDR PMLHLMQATIHETLRLSSVVPLGLVHKAMENSSICGKFVPKGALILTNLWSMHHDESYWK NAMSFYPERWLEKSGEFNYKLGYAYLPFSNGPRSCLGETLAKTELFVFITRLLKDYRFEM PTGKELPCLDGRSGITSPPNDFEVVIIPRN* >CN774619 tae83e09.x1 Hydra EST Darmstadt I Hydra magnipapillata cDNA 3' similar to SW:CPT7_CHICK P12394 CYTOCHROME P450 17 TTTAAACTACATAAGATTGTTTACTCTTGGAATCAAGTATGCGCAAAATTGTTTTGGAGCATGAGTAATA TTGCATTTTCCAATTAAACTTGGAGGTGGACAACCAGGTGTGCACTCAAACTTAAAATCTCGAATTAATC TAGAAAAGAAGAAAAATAGTTCATTTTCAGCAACTGTTTTCCCTAAACAAACTCTTGTACCGGCTGAAAA AGGTAAGAAACTTGTTGCTTTACTTGGATCAAATTTCTTATCTTTGCCAATCCGCCGATAAGGAATTAAA TTCATTAGGATTTTCCCAGGAGTTAGTAACGTTATGAATCTGCCAATGATTAATCATGACAGTAGCAATT TTCAGGAATGCTATGTGACATGAGAACTCCCCCCCAGAGGT 38% to CYP17A2 Fugu aa 383-485 391 TSGGEFSCHIAFLKIATVMINHWQIHNVTNSWENPNEFN 275 273 PYRRIGKDKKFDPSKATSFLPFSAGTRVCLGKTVAENELFFFFSRLIRDFKFECTPGCPP 94 93 PSLIGKCNITHAPKQFCAYLIPRVNNLM* 7 >CN570733 same as CN570522 BP505786 tag42d11.y1 Hydra EST -Kiel 2 Hydra magnipapillata cDNA 5' similar to SW:CPT7_ORYLA P70085 CYTOCHROME P450 17 AGCTGGTTTCTTCAAGACTTCTGTGAGGTAAACCTAATGGAATGACAGACGACAAACGCAGAGTTTCTTT CATAGCACTTTCAAATAAATGAAGCTTTGGACGATCTGAAAGACTAGGATACCTATCATTACCGACTATT TTAATAGTTTCATCATAGATATCATCTTGATACTTTGGCCAGTTAACTAAATAAAC VYLVNWPKYQDDIYDETIKIVGNDRYPSLSDRPKLHLFESAMKETLRLSSVIPLGLPHRSLEETS >CN570522 same as CN570733 tag42d11.x1 Hydra EST -Kiel 2 Hydra magnipapillata cDNA 3' similar to SW:CPT7_ORYLA P70085 CYTOCHROME P450 17 GGTGTTTATTTAGTTAACTGGCCAAAGTATCAAGATGATATCTATGATGAAACTATTAAAATAGTCGGTA ATGATAGGTATCCTAGTCTTTCAGATCGTCCAAAGCTTCATTTATTTGAAAGTGCTATGAAAGAAACTCT GCGTTTGTCGTCTGTCATTCCATTAGGTTTACCTCACAGAAGTCTTGAAGAAACCAGC 43% to CYP17 aa 326-389 same as BP505786 GVYLVNWPKYQDDIYDETIKIVGNDRYPSLSDRPKLHLFESAMKETLRLSSVIPLGLPHRSLEETS >CN566859 opposite end = CN566581 taf98h10.x1 taf98h10.y1 Hydra EST -Kiel 1 Hydra magnipapillata cDNA 5' similar to SW:CPT7_SQUAC Q92113 CYTOCHROME P450 17 TAATCATAAATGTGTATTTAAAATAAAAATTAAAACTCTTTTTGATAGACAATAGTTAGCTAATAGAAAT TAGAAAAATTTTTAATTAACACATTTTGCCTAAAATTTAAAGGTTAAGCAAACATTTACTAATCGTTTCA TTTTCAAATTATTTACTATTTATCTAGTATCTAATTTTTCAATCAATATAGTTTATATAAATTTAAAAAA AAAAACATTAGCAAAAT TAAAG CAATAAATTTTTACT CCTTGGAACTATTTCAACTTTAAAGTCATAAGG AGTACAAGTGATGCCAAAACTACCTTCTAAGCGCGGTAATTCTTCTAAAATTGGTTTCACAAATCGGAAA TCATTTATCAATCGTGATATAAAAATAAACAACTCAGTTTTTGCCAATGTTTCTCCAATACAGCTACGAG GTCCACTAGAGAATGGTAAATAAGCGTTTCCTAGTTTAGAATTAAATTCGCCAGAATTTTCTAACCAACG TTCAGGGTAAAAACTCATTGCATTTTTCCAGTAACTTTCGTCATGGTGTATACTCCATAGGTTGGTTATT ATAAGAGCCCCTTT GGGGAATAGGTTTTCCACAAATGCTACCTTTCTCC Combined seq Opposite end MFLEVIGAVFIPPLIWTIWVYIKHLIDCLHYPRGPIPLPFIG NGYLIRKAEPYKELVNLGKIYGDVFSFSVGSVRYVIVNSLEGIQEVLVKKGWQFAGRPKG PSWDRSIHGLIQRDPSKKFKILRKLATSSLKIFADGLAGMESKAI 44% to 17A1 fugu aa 378-485 [gene 2] 607 RKVAFVENLFPKGALIITNLWSIHHDESYWKNAMSFYPERWLENSGEFNSKLGNAYLPFSSGPRSCIGETL 395 394 AKTELFIFISRLINDFRFVKPILEELPRLEGSFGITCTPYDFKVEIVPRSKNLLV* 227 CYP46 like (only one seq) >CN775805 tae77f11.x1 Hydra EST Darmstadt I Hydra magnipapillata cDNA 3' similar to TR:Q42700 Q42700 CYTOCHROME P450 TCGGCATAGCAGTATTAATTTTTTTGTGTTTTTCACTGTTTTTTGCTAATATTTTAAAACGTTTTTATCA TCCGCTTCGTAAGTTGCCATCACCTAAAGAAAATTTCTTTACTGCTCATTATGGCTACTTTAATGGCTAT GATCAAATAAATGCTGTAATAAATTTTGGAAAACAGTTTAAAGAGCGTGGCTTGTATACATTAGATACAT TAAATGGATTTAGATTTGTTAATCTTTTAATGCCAGAATTTATTAAAACAGTGTTTTCTGATGGAAACTC ATTCCAAAGATCGACCGCTACAAAAGTTATATTTCCTCTAGTTGGAAATGGTATTTTTGTGTCAAATTAT GAAGATCATCATTGGCAAAGAAAAGTGTTAAATGAAGCTTTTACTTTACAACAGCTAAAAAATTATTTTC CAGCTTTTACAGTGCACATTGATTTGCTAATGAAACTTTGGTCATATTCATGTGACAAGGATAATGGTAC TAACATAATTGTTTTGGATGACTTATCTAATTTATCATTTGATATAATTGGGGATGTTGGTTTTGGCTAT CA N-term 26% to CYP46a zebrafish aa 10-203 3 GIAVLIFLCFSLFFANILKRFYHPLRKLPSPKENFFTAHYGYFNGYDQINAVINF 167 168 GKQFKERGLYTLDTLNGFRFVNLLMPEFIKTVFSDGNSFQRSTATKVIFPLVGNG 332 333 IFVSNYEDHHWQRKVLNEAFTLQQLKNYFPAFTVHIDLLMKLWSYSCDKDNGTNII 500 501 VLDDLSNLSFDIIGDVGFGY 560 >gi|47138506|gb|CN627429.1|CN627429 tae92b11.y1 Hydra EST Darmstadt I Hydra magnipapillata cDNA 5' similar to SW:CP51_CANGA P50859 CYTOCHROME P450 51 ;. Length = 540 Score = 58.5 bits (140), Expect = 5e-10 Identities = 27/27 (100%), Positives = 27/27 (100%) Frame = +1 Query: 160 DNGTNIIVLDDLSNLSFDIIGDVGFGY 186 DNGTNIIVLDDLSNLSFDIIGDVGFGY Sbjct: 1 DNGTNIIVLDDLSNLSFDIIGDVGFGY 81 DNGTNIIVLDDLSNLSFDIIGDVGFGYQFNTITSHSGNEFTKALQSYCQLRFQLNAVHKA LLAFFPFLMHLSFMYGKRKRAEQVICNTLNMLINKRKKEIDHRIAADQKDFLTVVLKDQQ KEGNKMTNDLIKNNLMTPLIAGHKTTSTDMPWCFNVLAPNPSATKHMQRNRKKEYISTTK Combined CN775805 and CN627429 [gene 8] 3 GIAVLIFLCFSLFFANILKRFYHPLRKLPSPKENFFTAHYGYFNGYDQINAVINF 167 168 GKQFKERGLYTLDTLNGFRFVNLLMPEFIKTVFSDGNSFQRSTATKVIFPLVGNG 332 333 IFVSNYEDHHWQRKVLNEAFTLQQLKNYFPAFTVHIDLLMKLWSYSCDKDNGTNII 500 501 VLDDLSNLSFDIIGDVGFGY 560 QFNTITSHSGNEFTKALQSYCQLRFQLNAVHKA LLAFFPFLMHLSFMYGKRKRAEQVICNTLNMLINKRKKEIDHRIAADQKDFLTVVLKDQQ KEGNKMTNDLIKNNLMTPLIAGHKTTSTDMPWCFNVLAPNPSATKHMQRNRKKEYISTTK BP514308 BP514308 Hydra magnipapillata cDNA library Hydra magnipapillata cDNA clone hydmg002bw_87. Length = 586 Score = 66.6 bits (161), Expect = 1e-12 Identities = 28/56 (50%), Positives = 37/56 (66%) Frame = +2 Query: 5 LIFLCFSLFFANILKRFYHPLRKLPSPKENFFTAHYGYFNGYDQINAVINFGKQFK 60 +I + F A KRFYH R LPSPKE+ T HY YF+ +D +N ++NFGK+FK Sbjct: 53 IIIVPLVFFVAVFFKRFYHQFRLLPSPKESLITCHYSYFDVHDHVNTLLNFGKEFK 220 >BP514308 N-term 25% to 46a [gene 9] MYSIYIAIIIVPLVFFVAVFFKRFYHQFRLLPSPKESLITCHYSYFDVHD HVNTLLNFGKEFKDYGLYTINTLIGPRQVHLLLPHFIKTVIADGKFFQRSPVFKAVFPLV GNSMIVSNYEDHHWQRKLFNQAFTSQQLKRYFLAFTLHTDLLMKLWSCTCDKENGTNLNV WSDLSNLSFDIIGDV >BP508840 BP508840 Hydra magnipapillata cDNA library Hydra magnipapillata cDNA clone hmp_03437. Length = 452 Blast with CYP20 Fugu C-term Query: 88 VDQHLIPKESLVIYALGVILQDSDTWNAPYRFDPDRFEEESVKK----SFHLLGFSGSQT 143 VD HLIPK++ +I ALG + QD + P RFDPDRF ++ +++ +F GF+G + Sbjct: 18 VDGHLIPKKTPIILALGTVFQDETIFPEPDRFDPDRFSDKQIEERSALAFQPFGFAGKRK 197 Query: 144 CPELRFAYTVATVLLSVLVRQLKLHRLKDTLMEVRSELVSTPRDETWI 191 CP R AY +++ + +++ V+ P +E WI Sbjct: 198 CPGYRLAYAETLTYTFYIIKNFHISLFDKQSVKMHYGFVTKPSEEIWI 341 Note: this seqs. Best match in Fugu human and Ciona is CYP20 DYDIIVDGHLIPKKTPIILALGTVFQDETIFPEPDRFDPDRFSDKQIEERSALAFQPFGF AGKRKCPGYRLAYAETLTYTFYIIKNFHISLFDKQSVKMHYGFVTKPSEEIWIKVLRRKNI* Note: this sequence finds a Leech EST in searches as the best EST hit after itself Below are some ESTs and related sequences that may be CYP20 orthologs or close relatives. CN807321 HDAH05F11.F Leech Haementeria depressa library HDA Haementeria depressa cDNA 5' similar to cytochrome P450 monooxygenase. Length = 560 Score = 96.3 bits (238), Expect = 1e-19 Identities = 48/113 (42%), Positives = 63/113 (55%) Frame = +2 Best matches are to CYP20s Query: 3 DIIVDGHLIPKKTPIILALGTVFQDETIFPEPDRFDPDRFSDKQIEERSALAFQPFGFAG 62 DI++ IP TPII ALG V D IFPEPD F+P+RF K + + A Q FGFAG Sbjct: 221 DIVIKKDKIPANTPIIHALGVVSSDPAIFPEPDEFNPERFDSKN--KPNTFACQSFGFAG 394 Query: 63 KRKCPGYRLAYAETLTYTFYIIKNFHISLFDKQSVKMHYGFVTKPSEEIWIKV 115 KRKCP + + +++ F + L Q V H+G KP EEIW+ + Sbjct: 395 KRKCPASSTVASHAALFYSHLLDCFELKLAPDQIVSSHHGLNAKPKEEIWLSL 553 Best matches are to CYP20s AYLKYSSLLTWMGYFLALEPDVQHKLRKELGLVSVSDLSSVNINKFKYFQNTLKETLRC AKIVPYTARIH NTDIVIKKDKIPANTPIIHALGVVSSDPAIFPEPDEFNPERFDSKNKPN TFACQSFGFAGKRKCPASSTVASHAALFYSHLLDCFELKLAPDQIVSSHHGLNAKPKEEIWLSLEM The Hydra seq. Also matches a tunicate Query: 1 DYDIIVDGHLIPKKTPIILALGTVFQDETIFPEPDRFDPDRFSDKQIEERSALAFQPFGF 60 D DI V G+LI K TP+I A+GT DE F +P FDP RFS+ E + L + PFGF Sbjct: 75 DIDINVGGNLITKYTPVIEAIGTAMTDEGNFSDPSEFDPKRFSEDNDEVINPLDYIPFGF 254 Query: 61 AGKRKCPGYRLAYAETLTYTFYIIKNFHISLFDKQSVKM----HYGFVTKPS-EEIWIKV 115 G RK P L + +I+N + + S FV++PS EE+W+ V Sbjct: 255 GGSRKAPQNSLVFLLASIVASTLIRNLKVCIAGDSSQNQPDPDFSAFVSRPSKEELWVTV 434 Best matches are to CYP20s >AV385510 601 bp mRNA linear EST 27-OCT-1999 Halocynthia roretzi (tunicate) 51% to CYP20 like in Ciona KYMKMVLKESLRSGCVFSWAGRQQDIDINVGGNLITKYTPVIEAIGTAMTDEGNFSDPSE FDPKRFSEDNDEVINPLDYIPFGFGGSRKAPQNSLVFLLASIVASTLIRNLKVCIAGDSS QNQPDPDFSAFVSRPSKEELWVTVSALGKKKAETVG* The most ancient match is to a sponge Y17816.1|SDO17816 Suberites domuncula mRNA for cytochrome P450 Length = 1798 Score = 110 bits (274), Expect = 4e-23 Identities = 61/186 (32%), Positives = 99/186 (53%), Gaps = 4/186 (2%) Frame = +3 Query: 8 LLTWMGYFLALEPDVQHKLRKELGLVSVSD----LSSVNINKFKYFQNTLKETLRCAKIV 63 + TWM ++L+ P+ Q +LR E+ + + L ++ + + ET+R + + Sbjct: 975 MFTWMLWYLSSHPESQDRLRTEIERETGGERGDRLKEYSLRADTFLRQVQDETIRLSTLA 1154 Query: 64 PYTARIHDYDIIVDGHLIPKKTPIILALGTVFQDETIFPEPDRFDPDRFSDKQIEERSAL 123 P+ AR D + V G+ IP KTP+I ALG +++T++ D +DPDRFS R Sbjct: 1155 PWAARYSDKKVTVCGYTIPAKTPMIHALGVGLKNKTVWENTDSWDPDRFSP---NGRRGN 1325 Query: 124 AFQPFGFAGKRKCPGYRLAYAETLTYTFYIIKNFHISLFDKQSVKMHYGFVTKPSEEIWI 183 F PFG +RKCPGY +Y E + ++ F I + Q+V +G VT+P ++I I Sbjct: 1326 DFCPFGVHSRRKCPGYLFSYFEVGVFASILLSRFEIVPVEGQTVIQVHGLVTEPKDDIKI 1505 Query: 184 KVLRRK 189 + RK Sbjct: 1506 YIRSRK 1523 Best match CYP20 10-49 = CYP38A1 MLDFVIFAITAVAGLIGILLFFYFSRSTETKPVSSASPTSTIPR WSAPPADIEKGDLDVMMKKHGSLHQFLLHLHDNGKTPVTSFWWGKTHVVSFCSPQAFK ESAVFVNRPVELFVGFEPLITPFSIQYANDEDWVQRSKCLYHTLKGDDLKSYFHHFVQ IAQEEESLWSSYTSDKEVSLTKEVFPMTIKGIARTCFGDIFKDENELSKMAESYHVCW RTMEEGVPEAGSKRETEFLKHRRVLEDIIRRIIQERKEGEDLQELPFIDSMLQNYDSE DKIIADAISFMVGGFHTSGYMFTWMLWYLSSHPESQDRLRTEIERETGGERGDRLKEY SLRADTFLRQVQDETIRLSTLAPWAARYSDKKVTVCGYTIPAKTPMIHALGVGLKNKT VWENTDSWDPDRFSPNGRRGNDFCPFGVHSRRKCPGYLFSYFEVGVFASILLSRFEIV PVEGQTVIQVHGLVTEPKDDIKIYIRSRKED Also in a tick a 4V like sequence CB721393 jnn606F11.f op Ornithodoros porcinus porcinus cDNA 5'. Length = 608 Score = 75.5 bits (184), Expect = 2e-13 Identities = 42/116 (36%), Positives = 64/116 (55%), Gaps = 3/116 (2%) Frame = -3 Query: 3 DIIVDGHLIPKKTPIILALGTVFQDETIFPEPDRFDPDRFSDKQIEERSALAFQPFGFAG 62 D+ +G I T ++ + +DE +PEPDRFDP+RFS + + LA+Q FG G Sbjct: 567 DVEYNGLKIKAGTSVMAPTFQIHRDERYWPEPDRFDPERFSPENKAAQHPLAYQAFGM-G 391 Query: 63 KRKCPGYRLAYAETLTYTFYIIKNFHISLFDKQSVKMHYGF---VTKPSEEIWIKV 115 R C G RLA E++ ++ +F I L D Q +M G+ V+ P ++WIK+ Sbjct: 390 PRSCAGKRLAVLESMYTAARMLLDFRIELGDSQEGQMKMGYFAMVSVPGNDLWIKL 223 PPSLTFTTRRAAEDVEYNGLKIKAGTSVMAPTFQIHRDERYWPEPDRFDPERFSPENKAA QHPLAYQAFGMGPRSCAGKRLAVLESMYTAARMLLDFRIELGDSQEGQMKMGYFAMVSVPGNDLWIKLHKI* >A tardigrade 4V like sequence CF075781 Hd_mx17_60A05_T7 Hypsibius dujardini mixed stage (fraction 7) Hypsibius dujardini cDNA clone Hd_mx17_60A05 5' similar to Similar to C4C3_DROME (Q9VA27) Cytochrome P450 4c3 (EC 1.14.-.-) (CYPI. Length = 656 Score = 66.2 bits (160), Expect = 1e-10 Identities = 37/118 (31%), Positives = 66/118 (55%), Gaps = 1/118 (0%) Frame = +2 Query: 1 DYDIIVDGHLIPKKTPIILALGTVFQDETIFPEPDRFDPDRFSDKQIEERSALAFQPFGF 60 DY + +IP+ + + + + + +D +FPEPDRFDP+RFSD + + PF Sbjct: 305 DYKLPDSDIIIPEGSIVSIFIFALGRDPDMFPEPDRFDPERFSDTKSARHRPFDYVPFS- 481 Query: 61 AGKRKCPGYRLAYAETLTYTFYIIKNFHI-SLFDKQSVKMHYGFVTKPSEEIWIKVLR 117 AG R C G + A E T I++++ I S+ D +++++ +T+P + I +K+ R Sbjct: 482 AGPRNCIGQKFAVMEEKTILSTILRSYTIQSVEDPETLRIAGELITRPIDGISVKLFR 655 53% to 4V2 SAKTEDGKVLDNASIREEVDTFMFEGHDTTSVAAAFAVFLLGNHQDVQQKAYEELRQIF GDSDRAPTTEDLKNMRYLDCVLKESLRLYPSVPMITRKCNEDYKLPDSDIIIPEGSIVSI FIFALGRDPDMFPEPDRFDPERFSDTKSARHRPFDYVPFSAGPRNCIGQKFAVMEEKTIL STILRSYTIQSVEDPETLRIAGELITRPIDGISVKLFRX