ࡱ>   GbjbjҬҬ 2ZƜ_Ɯ_- """""6668nd6C"&NNN!!!!!!!$i$'!"NNNNN!""h!NNNN""!NN!NN tz!uJENF!!"0C"N!,'N'z!z!'"! NNNNNNNN!!NNNNC"NNNN'NNNNNNNNN X : >gnl|ti|647066038 1095898227332 34% to 17A1 35% to 2U1 fugu 33% to 2U1 human 74% to 1095901734433 1097567032902 1096761105127 1096123323522 1096088745900 Combined seq from CN769290 and CN769570 39% to CYP17A EST = CN769570.1 mate pair of 1096088745900 had partial match to N-TERM. WALKED UPSTREAM TO 1097672588127, N-term still missing, end of this exon seq not certain cannot walk upstream any further (1) AFNRNTNSLINSDPGPRFKILRKLASSSLKIYAEGLLGMERIAISEYCELSKKLQSIKEKPVSVHKIM (1) AGCATTCAACAGAAATACGAACAGCCTCATTAACAGTGATCCAGGCCCGCGTTTTAAAATTTTA CGAAAGTTAGCATCATCTTCTTTGAAAATTTACGCTGAGGGTTTATTGGGAATGGAAAGA ATAGCAATCAGTGAATATTGTGAACTGAGTAAAAAGTTACAATCAATAAAAGAAAAACCA GTATCGGTTCATAAAATAATGGGT (0) QSTLNIICTILFNHRYEDDNQEFQNIIKYSSLIVQTFNETSYVSS IPLLRYFPTATSRNIFEIIRLRDPILKRKLQEHRKSYDKNNLRDITDALIKVSLDSEMGE ELTEKITDDNIEFLLNDFMIAGSETSSSTILWFIVYMLHWPEYQNKLYDEITKVASDNRY VSLKDRPMLHLMQAAIHETLRLSSVVPLGLVHKAMENSSICGKFVPKGALILTNLWSMHH DESYWKNAMSFYPERWLEKSGEFNYKLGYAYLPFSNGPRSCLGETLAKTELFVFITRLLK DYRFEMPTGKELPCLDGRSGITSPPNDFEVVIIPRN* AGCAAAGCACACTTAACATTA TTTGTACCATTCTTTTTAATCATCGCTACGAGGATGACAACCAGGAGTTTCAGAATATCA TAAAATACTCAAGTTTAATCGTTCAAACTTTTAATGAAACCAGTTACGTATCTTCCATTC CATTGCTGCGCTATTTCCCAACGGCAACGTCGCGAAATATTTTTGAAATCATAAGGCTTC GTGATCCGATTTTAAAACGAAAACTCCAAGAGCACAGAAAATCTTACGATAAGAATAATT TACGTGACATAACCGATGCATTAATAAAAGTGTCTTTAGATTCAGAGATGGGTGAAGAAT TAACTGAAAAGATTACTGATGATAATATTGAGTTTCTTTTAAACGATTTTATGATTGCTG GATCCGAAACTTCATCAAGTACTATTCTTTGGTTTATTGTTTACATGTTACATTGGCCAG AATACCAAAATAAACTTTATGATGAAATTACTAAAGTAGCATCAGATAACCGTTATGTAT CTTTAAAGGATCGACCTATGCTTCATTTAATGCAAGCTGCAATTCATGAAACACTTAGAC TGTCATCGGTGGTACCTCTTGGTTTGGTTCATAAAGCAATGGAGAACAGTAGCATTTGTG GCAAGTTTGTTCCTAAGGGAGCTCTTATTTTAACAAATTTATGGAGTATGCATCACGATG AAAGCTATTGGAAAAATGCAATGAGTTTTTACCCGGAACGTTGGCTGGAAAAATCTGGCG AGTTCAATTATAAATTGGGGTACGCATATTTACCGTTTTCTAATGGACCTCGTAGTTGTT TAGGAGAAACATTGGCAAAAACAGAGTTGTTTGTGTTTATTACACGATTACTTAAAGATT ACCGATTTGAAATGCCAACTGGAAAAGAGTTACCTTGTTTAGATGGTCGTTCTGGAATCA CCTCCCCTCCTAATGACTTTGAAGTCGTGATAATTCCAAGAAATTAA >complete combined seq CN566859 CN566581 CYP2 clan member [gene 2] 1097326058990 32% to 2X9 aa 26-146 34% to CYP17A MFLEVIGAVFIPPLIWTIWVYIKHLIDCLHYPRGPIPLPFIGNGYLIRKAEPYKELVNLGKIYG DVFSFSVGSVRYVIVNSLEGIQEVLVKKGWQFAGRPKGP () SWDRSIHGLIQRDPSKKFKILRKLATSSLKIFADGLAGMESKAIEESFQLNKKLLETNGKPFSMQEIT (1) 1097329249233 (1) TLCVLNIICSILFNHRYKEDDLEFQDIIKYSNICFKERGVNNYIISIPWLRY FPSASSRNLDEMIKIRDPLL KKKVQEHKRSYDEYNLRDLTDALIKASNSETGQDPDEKVTDDNIVFILN NFILAGSETSSNTILWFIVYILHWPEYQDKLYDEILKVTSGSRYPCLKDRPSLHLMQAAI YETLRLSSVAPFGLHHKAMEKSSICGKSI PKGALIITNLWSIHHDESYWKNAMSFYPERWLENSGEFNSKLGNAYLPFSSGPRSCIGETL 395 394 AKTELFIFISRLINDFRFVKPILEELPRLEGSFGITCTPYDFKVEIVPRSKNLLV* 227 AGATGCAACTGATAATAAATTCGCGATCCGCTATAAAGAAAAAA GTCCAAGAGCACAAAAGATCGTATGACGAATATAATTTACGCGATCTAACAGATGCTTTA ATAAAAGCATCAAACTCGGAGACGGGACAAGATCCGGATGAAAAAGTTACTGATGATAAT ATTGTATTTATCTTAAATAATTTTATACTCGCAGGATCAGAGACTTCATCAAATACGATT CTTTGGTTCATTGTTTATATTTTACATTGGCCGGAGTATCAAGATAAACTTTATGATGAA ATTTTAAAAGTAACATCAGGTAGCCGTTACCCTTGCTTAAAAGATCGCCCGTCACTACAT TTAATGCAAGCTGCAATTTATGAAACACTTAGGTTGTCATCGGTCGCACCTTTTGGTTTA CATCATAAAGCCATGGAGAAAAGTAGCATTTGTGGAAAATCTATCCCTAAAGGCGCTCTT ATAATAACCAATCTATGGAGTATACACCATGACGAAAGCTACTGGAAAAATGCAATGAGT TTTTACCCTGAACGTTGGTTGGAAAGTTCTGGCGAATTTAACTCTAAACTAGGAAATGCG TATTTACCATTCTCTAGTGGACCTCGTAGCTGTATTGGAGAAACATTAGCAAAAACTGAG >1097329039870 1096703377333 1097664213870 MLFKVIGTILIPPLIWVVWIYIKHLVDCLSYPQGPFPLPFIGNAHLIRNRESYKVF SEFQKIYGSVFGFSIGSTRYVVVNNLEGVQEVLIKKGSQFAGRPRRA (1) ATGCTCTTTAAAGTCATTGG TACAATCTTGGTTCCACCTTTAATATGGGTTGTATGGATTTATATCAAACATCTTGTTGA CTGCTTGTCCTATCCTCAAGGACCATTTCCTCTCCCATTTATAGGAAATGCTCATTTAAT AAGAAATAGGGAGTCTTATAAAGTGTTTTCTGAATTTCAGAAGATTTATGGCAGCGTTTT TGGATTTAGCATTGGCTCAACCAGATATGTGGTTGTAAATAACTTAGAAGGAGTTCAAGA GGTTTTGATCAAAAAAGGTTCACAGTTTGCAGGCCGCCCAAGACGAGCAAGT >1096703827379 1095896863976 MFPEIVGAIMLPPLIWAAWIYIKHLVDCLVYPRGPFPLPFVGNAYLFSKGKPYKEFVKLG 103 KTYGDVFGFSIGSIRYVVVNSLEGIKKXXXXXXXXXXXXXXXX ATGTTTCCTGAAATCGTTG GCGCAATTATGCTTCCTCCCTTGATATGGGCAGCGTGGATTTACATAAAACATCTTGTTG ACTGTTTAGTTTATCCCCGAGGACCATTTCCACTACCTTTTGTAGGAAATGCATATCTCT TCAGTAAAGGCAAACCTTATAAAGAATTTGTTAAACTTGGAAAAACTTACGGCGATGTAT TTGGCTTTAGCATTGGTTCAATACGATATGTAGTCGTGAACAGCTTGGAAGGTATCAAGA AGT >1095899160393 frameshifted MFFEVIRAFFTPPLVWIIMVYIKNLIDYLYYPREPIPLPFIGNGDLIRKAEPFKEL VNLEKKYGDVFSFRIGLVRFVVVSSLEVILEILVKKGWQANGRPKAP (1) ATGTTTTTTGAAGTTATTCGCGCCTTCTTTACTCCACCTTTGGTATGGATTATAATGGTTTATATAAAA AATTTAATCGATTATTTGTATTATCCACGAG AACCGATACCACTACCATTTATTGGAAATGGTGATTTGATAAGAAAAGCAGAACCGTTTA AAGAGTTGGTTAACCTGGAAAAAAAATATGGCGATGTTTTTAGTTTTAGGATTGGTTTAG TCAGATTTGTGGTTGTTTCA AGTTTAGAAGTAATTTTAGAAATACTAGTAAAAAAAGGGTG GCAGGCAAATGGTCGTCCAAAAGCTCCAAGT >1097329360095 4 aa diffs to CN566859 from PKG FYLNNFILAGSETSSNTILWFIVYILHWPEYQDKLYDEILKVTSGSRYPCLKDRPSLHLM QAAIYETLRLSSVAPFGLHHKAMEKSSICGKSIPKGALIITNLWSIHHDESYWKNAMSFY PERWLESSGEFNSKLGNAYLPFSSGPRSCIGETLAKTELFIFISRLINDFRFVKPISEEL PRLDGSFGITCTPYDFKVEIVPRSKNLLF* TTTTATCTTAATAATTTTATACTTGCAGGATCAGAGACTTCAT CAAATACGATTCTTTGGTTCATTGTTTATATTTTACATTGGCCGGAGTATCAAGATAAAC TTTATGATGAAATTTTAAAAGTAACATCAGGTAGCCGTTACCCTTGCTTAAAAGATCGCC CGTCACTACATTTAATGCAAGCTGCAATTTATGAAACACTTAGGTTGTCATCGGTCGCAC CTTTTGGTTTACATCATAAAGCCATGGAGAAAAGTAGCATTTGTGGAAAATCTATCCCTA AAGGCGCTCTTATAATAACCAATCTATGGAGTATACACCATGACGAAAGCTACTGGAAAA ATGCAATGAGTTTTTACCCTGAACGTTGGTTGGAAAGTTCTGGCGAATTTAACTCTAAAC TAGGAAATGCGTATTTACCATTCTCTAGTGGACCTCGTAGCTGTATTGGAGAAACATTAG CAAAAACTGAGTTGTTTATTTTTATATCCCGATTAATAAATGATTTCCGATTTGTAAAAC CGATATCAGAGGAATTACCGCGTTTAGATGGTAGTTTTGGCATCACTTGTACTCCTTATG ACTTTAAAGTTGAAATAGTTCCAAGGAGTAAAAATTTACTGTTTTAA >1097509039345 92% identical to 1096064108200, probably joins with 1095898835518 1096625274183 1095900033599 1095896933215 100% match so this similar seq is real 1097206379175 1097678021634 MFLEVAFGVVTPLFLYVIATYLDHLFKCRFYPPGPFPLPIIGNLHLIGKKPHEKFVEYSK 538 KYGEVFSLSFGMHRVVIVSGKDSIREVLVQKSNIFAGRPKNYIANIVSRGYKNIGYGDIG 718 PKWKILRKIAHSSLKNYGESTAHLETLVVRESEELHKNLYKKSNRSTKLEHKF (1) >gnl|ti|649400787 1095898835518 93% identical to 1096064108200, 39% to 17A1 fugu 35% to 2U1 gnl|ti|647175227 1095898288652 1096602038000 (1) GVAVLNVICSIVFGKRYEYENCEFKEILTYMNYVFTGVAGTNAISFIPWLRFLPLDGLR KLKKGLSIRDPVLRKQLLYHRETYNESNLRDYTDYVIQFSRDEAILKKFGEQLTDDY LELLLNDIFIAGTETALTTLLWSIIYLIHWPKFQDKIYNEIVSAIGKNRYPSMKDRNMLP LVNAALSETLRLSSVTPLGVPHKAMEDTTLLNDLKIPKGTTILTNLWQLHHNKNCWENPH EFNPYRWFTNDQTLDSIKSMNFLPFSAGTRVCLGKGIAEVELFLFYSRLVRDFKFEVKP GDSLPSLYGNCGLL* AGGTGTTGCGGTATTAAATGT CATTTGCTCTATTGTATTTGGAAAACGCTATGAGTACGAAAATTGTGAATTTAAAGAAAT CCTAACCTACATGAATTATGTTTTTACTGGTGTAGCTGGTACAAACGCAATTTCTTTTAT TCCGTGGCTTCGTTTCCTTCCATTAGATGGATTACGAAAATTAAAAAAAGGACTTTCAAT TAGAGATCCGGTTCTTCGGAAGCAGTTGTTATATCACAGAGAGACCTACAATGAAAGTAA CCTGCGTGACTATACAGACTATGTCATACAATTTTCAAGAGATGAGGCCATCTTGAAAAA GTTTGGAGAACAGCTAACTGATGACTACTTAGAGCTTTTACTTAATGATATATTTATAGC TGGAACTGAAACTGCATTGACAACTTTACTTTGGTCAATTATCTACCTTATTCACTGGCC AAAGTTTCAAGACAAAATTTACAATGAAATTGTTTCAGCTATTGGTAAAAATAGATATCC TTCTATGAAAGATCGTAATATGCTGCCTCTTGTTAACGCTGCGTTATCAGAAACATTGCG GTTATCTTCTGTTACTCCATTAGGAGTACCTCACAAAGCTATGGAAGATACAACTCTCTT GAATGATTTAAAGATTCCCAAAGGCACCACAATTTTAACGAACCTTTGGCAATTACATCA CAATAAAAACTGTTGGGAAAATCCACATGAGTTTAATCCATATAGATGGTTTACTAATGA TCAAACACTTGATTCTATAAAATCTATGAATTTTTTACCTTTTTCTGCTGGTACCAGAGT GTGTTTAGGAAAGGGTATTGCTGAAGTTGAACTTTTTCTTTTTTACTCAAGGCTGGTTCG TGATTTTAAGTTTGAAGTAAAACCCGGCGATAGTCTTCCAAGTTTATATGGAAATTGTGG ATTACTCTAA >gnl|ti|648017453 1095896110991 52 1e-05 35% to 17A1 fugu 34% to 2U1 fugu gnl|ti|647987527 1095895119635 1096703762277 used this seq to walk upstream past a repeat could not go futher 71% to 1095898227332 (1) ELTTLNIICTILFNQRYEQDDDEFQNIIKYSNLSFKAFSASNLLSSIPWLRYFPTTASKYIQ 707 706 EIERLRDPILKRKLQEHRKSYDENNLRDITDALIKASIHLNAEKDSLIKVTDDNIQFILN 527 526 DLILAGSETSSSTITWFIVYMLHYPEYQDKIFNEVIKVTSGNRYPCLNDRPLLHLLQATI 347 346 HETLRLSSVAPLGLRHKAMENSTICDKPVLKGTLIITNLWSIHHDERYWKNPMSFYPERW 167 166 LNETGEFDYKLGNAYIPFSGGPRACLGETLAKTELFVIISRLVTDFYFEKSVEEDLPRLDSF 374 375 PGVTRSPYDFKVVVVSRS* >gnl|ti|647193621 1095899233960 1096082123583 1097696262164 1096620040714 1097206342731 Combined seqeunces BP505786 and CB073123 and CB271974 40% to CYP17A [gene 3] CN570733 same as CN570522 BP505786 50% to 1095898835518 37% to 17A1 (1) VTGVMNVLCGIVFGTQYEENDKELEKVISFKQLILDGVADTFAISFLPWLRFFPSNGLKKVRK GVLIRDKLLRFQLKKHRETYNPVQIRDYTDYVLKYSKEFETSRNIDEQLSEDNMEMM LQDIFISGSETTISTLLWFAVYLVNWPKYQDDIYDETIKIVGNDRYPSLSDRPKLHLFES AMKETLRLSSVIPLGLPHRSLEETSIKKFKIPKNTNVMINLWQLHHDSKSWSDPHTFNPY RWLNDKNIFDKSKNPNYLPFSTGLRACLGYHTTESIIFLFFTRLIRDFNLCLKPGASTP SLNGVLRVTLTPDTSYIILKPRSNNLISQKIEA* AGTTACTGGAGTGATGAACGTTCTTTG TGGAATTGTTTTTGGTACACAATATGAAGAAAATGATAAAGAACTTGAAAAAGTCATATC TTTTAAACAGTTAATATTAGATGGAGTAGCAGATACATTCGCAATATCTTTTTTGCCGTG GTTAAGGTTTTTTCCTTCAAACGGATTAAAGAAAGTACGAAAAGGCGTGTTGATAAGAGA TAAACTACTTAGGTTTCAATTAAAAAAACATCGAGAAACATACAATCCAGTTCAAATAAG AGATTACACTGATTACGTACTTAAATACTCAAAAGAGTTCGAAACTTCAAGAAACATAGA TGAGCAGTTAAGTGAAGATAATATGGAAATGATGCTTCAGGATATTTTCATTAGTGGTAG CGAAACAACTATATCAACACTTCTTTGGTTTGCTGTTTATTTAGTTAACTGGCCAAAGTA TCAAGATGATATCTATGATGAAACTATTAAAATAGTCGGTAATGATAGGTATCCTAGTCT TTCAGATCGTCCAAAGCTTCATTTATTTGAAAGTGCTATGAAAGAAACTCTGCGTTTGTC GTCTGTCATTCCATTAGGTTTACCTCACAGAAGTCTTGAAGAAACCAGCATAAAAAAATT TAAAATTCCTAAAAATACAAACGTAATGATTAATCTGTGGCAGTTGCACCATGATAGTAA ATCTTGGAGTGATCCTCATACATTTAATCCATATAGATGGTTAAATGACAAGAATATCTT TGACAAAAGCAAAAACCCAAACTATCTTCCATTTTCAACCGGATTAAGAGCCTGCTTAGG TTATCACACAACCGAATCCATCATTTTTTTGTTTTTTACCCGATTGATAAGAGATTTTAA TCTTTGTTTGAAACCTGGCGCATCTACTCCAAGTTTAAACGGTGTTTTGCGAGTAACCTT AACTCCTGATACGTCATACATTATTCTAAC >gnl|ti|648033522 1095897342515 39% to 17A1 N-term 1095899118747 1095900033599 1096071090512 1096703396910 1096608233968 MFLEIAFGVTAPLLLYVIATYLDHLFKCRFYPP GPFPLPIIGNLHLIGKKPHEKFVEYSKKYGEVFSLSFGMHRVVIVSGKDSIREVLVQK SNIFAGRPKNYIANIVSRGYKNIGYGDIGPKWKILRKIAHSSLKNYGESTKHLETLVVK ESEELHKRLFKNCNRSTELEDEF (1) ATGTTCTTAGAAATTGCTTTTGGAGTAACAGCTCCTCTGCTTTTGTATGTCATTGCAACTTATCTAG ATCATTTGTTTAAATGCAGATTTTACCCGCCAGGCCCTTTTCCTTTACCGATTATTGGGA ACTTACATTTGATTGGAAAAAAACCACATGAAAAGTTTGTAGAATATTCAAAAAAGTATG GAGAAGTATTCAGTCTAAGTTTTGGAATGCATCGTGTTGTTATTGTTTCAGGAAAAGATT CTATTAGAGAGGTTTTGGTTCAAAAATCAAACATTTTTGCAGGGCGTCCTAAAAACTACA TTGCTAATATTGTATCTCGTGGTTATAAAAATATTGGCTACGGAGATATTGGACCTAAAT GGAAAATTTTGAGGAAAATTGCTCACTCTTCTTTAAAAAACTATGGAGAGTCAACTAAAC ATTTGGAAACGCTTGTCGTAAAAGAAAGCGAAGAGCTACACAAAAGACTTTTTAAAAATT GTAACAGATCCACAGAGCTAGAAGATGAGTTTGGT 1096064108200 93% to 1095898835518 1097206931796(9 aa diffs) 1097206498632 walked up to 1096081234652 found mate pair 1096071090512 already known N-term seq matches 1095897342515 100% 1095897342515 38% to 17A1 fugu whole seq. MFLEIAFGVTAPLLLYVIATYLDHLFKCRFYPP GPFPLPIIGNLHLIGKKPHEKFVEYSKKYGEVFSLSFGMHRVVIVSGKDSIREVLVQK SNIFAGRPKNYIANIVSRGYKNIGYGDIGPKWKILRKIAHSSLKNYGESTKHLETLVVK ESEELHKRLFKNCNRSTELEDEF (1) (1) GVAVLNVICFIVFAKRYENKDSEFKKILMYMNYVFSGVASTNFASFIPWLRFFPLDGLR KLKKGLSIRDPVLRKQLLYHRETYNESNLRDYTDYVIQFSRDEAILKKFGEQLTDDYLEL LLNDIFIAGTETALTTLLWSIIYLIHWPKFQDEIYNEIVSTIGKDRYPSMKDRNMLPLVN AALSETLRLSSVTPLGVPHKAMEDTTLLNDLKIPKGTTILTNLWQLHHNENCWENPHEFNPYRWF TNDQALDSIKSMNFLPFSAGTRVCLGKGIAEVELFLFYSRLVRDFKFEVKPGDSLPSLDG NYGITLTPRIFTTFVVARNDSLVAQNHSL* >gnl|ti|647182814 1095899213949 1095958075467 1095733042694 1097672545497 54% to 1095898835518, 36% to 17A1 36% to 2U1 walked upstream to 1097672406696 which mate pairs to exon 2 below (1) GVAVLNVICFIVFGERYQYSDPAFIEILTTINNIVSGLSNTTAVDFLPGLRYLQFSEIK 256 257 KLKSSLVIYFRLLNDQLKKHKKTFDENNIRDFTDSIIKFSKDETMENKFEEELTDEHLEH 436 437 VIGDMFIAGSETTLTSLLWLIIYMIHYPKYQEEIFEEITRVIGENRYPQLSDRDSLHLVK 616 617 ASIKECLRLSSIIPLGVPHKTMSDTTLIGYNIPKNTTVIINHWQIHNDTNHWKNPNEFNP 796 797 HRWIDDDSKFDATRATSYLPFSAGTRVCLGKTVAETELFFFFTRLIRDFKFE GVPGCPLPSLIGKCSITLAPEEFNVHVTPRINSLMFSKNVLPE* >combined seq CN774619 CN775634 CYP2 clan member [Gene 1] 32% to CYP1C1 aa 173-297 29% to 17A2 2 ESEELHKRLLMKSKTSVDLKTEFGAAIINVICFIVFGERYQYSNSEFKEVLTTINNIV 175 176 DGLSNTTAVGFLPWLRFLPFSPIKKLSISLSKYIRFLNDKLTKHKETFNENKIRDS 343 344 TDSIIN 361 >1096526199166 frame3_ORF1 7aa diffs to CN774619 may be same gene (1) GAAIINVICFIVFGERYQYSDSEFKEVLTTINDIVDGLSNTTAVGFLPWLRFLPFSPIKKLSIS LSKYVRFLNDKLKKHKETFDEKKIRDFTDSIINFSNNEAVKQKFKNVDEHLEPVIGDLFI TGSETTLTSLLWLILYMMHYPKYQQEIFKEITTVIGEDRYPCLNDRDSLHLVKAALKECL RLSSIVPLGLPHKTTKETVLMGHSIPGNATVMINHWQIHNDTNYWENPNEFNPYRWIGKD KKFDPSKATSFLPFSAGTRVCLGKTVAENELFFFFSRLIRDFNFECIPGCPPPSLIGKCN ITHAPKQFCAYLTPRINNLM* AGGTGCTGCAATTATAAACGTGATTTGTTTCATTGTTTTTGGGGAAAGATACCAGTATTCAGATTCAGAAT TTAAAGAAGTTCTTACAACAATAAATGATATAGTCGATGGGTTGTCAAATACAACTGCTG TTGGATTTTTGCCGTGGTTGAGATTTTTACCGTTTTCTCCAATAAAAAAACTGAGTATTT CACTTTCAAAATATGTTCGTTTTTTAAACGATAAGTTGAAAAAACATAAGGAAACATTTG ATGAAAAGAAAATTCGAGATTTTACTGATTCTATTATAAATTTTTCTAATAACGAAGCTG TCAAACAAAAATTTAAAAACGTTGATGAACATTTAGAGCCTGTGATTGGGGATTTATTTA TAACGGGTAGTGAGACCACATTAACATCTTTATTGTGGTTAATTCTTTATATGATGCATT ATCCCAAATATCAACAAGAAATTTTTAAAGAAATTACAACGGTTATTGGTGAAGACCGGT ACCCATGTTTAAATGACCGTGATTCTTTGCATCTTGTTAAAGCCGCATTAAAAGAGTGTC TGCGTTTATCTTCAATTGTTCCTCTTGGATTACCACACAAAACAACCAAAGAAACAGTTC TTATGGGACATAGCATTCCTGGGAATGCAACAGTCATGATTAATCATTGGCAGATTCATA ACGATACTAACTACTGGGAAAATCCTAACGAATTTAATCCTTATCGGTGGATTGGTAAAG ATAAGAAATTTGATCCAAGTAAAGCAACAAGTTTTTTACCTTTTTCAGCCGGTACAAGAG TTTGTTTAGGGAAAACAGTTGCTGAAAATGAACTATTTTTCTTCTTTTCTAGATTAATTC GAGATTTTAACTTTGAGTGCATACCTGGTTGTCCACCTCCAAGTTTAATTGGTAAATGCA ATATTACTCATGCTCCAAAACAGTTTTGCGCATACTTGACTCCAAGAATAAACAATCTAATGTAA >whole gene 1095899272864 1096526199166 MWYEIICGLIISILLYIIGSYLMHLLECRKYPLGPFPIPIFGNLHLLGTEPHKILAAYS KKYGAVFSISLGLQRIVIISDITTTREALVQKASIFAGRPKSYLIQLISSGYKGIAFMDY GSFWKVLRKVSHSSLKIYGEGHERFEKILTKESEELHKRLLKKSNNSVELKSEF (1) GAAIINVICFIVFGERYQYSDSEFKEVLTTINDIVDGLSNTTAVGFLPWLRFLPFSPIKKLSIS LSKYVRFLNDKLKKHKETFDEKKIRDFTDSIINFSNNEAVKQKFKNVDEHLEPVIGDLFI TGSETTLTSLLWLILYMMHYPKYQQEIFKEITTVIGEDRYPCLNDRDSLHLVKAALKECL RLSSIVPLGLPHKTTKETVLMGHSIPGNATVMINHWQIHNDTNYWENPNEFNPYRWIGKD KKFDPSKATSFLPFSAGTRVCLGKTVAENELFFFFSRLIRDFNFECIPGCPPPSLIGKCN ITHAPKQFCAYLTPRINNLM* >gnl|ti|655005893 1095958068757 44 0.002 43% to 4V5 fugu 36% to 4T5 gnl|ti|651153924 1095901025079 N-term gnl|ti|651153911 1095901025066 1097206604076 1097206339312 complete gene no introns ESTs = CV566433.1 CX054637.1 CV566166.1 MVSVFYILFSGLVFYVVSKILWKLWRNSYGLSSIVTPPNVPFFGTSLYLHSDA RKFFFQLYDYTRRYGDVFCIWLGPKPVICSSSVKFSEAVLSSQKVITKGFSYDFLHDWLK TGLLTSTGSKWKTRRRLLTPSFHFSILNNFIKIFEEQASILVDKLAVAADNKEVVDVQVP IGLATLDIICETSMGVKVNAQSHPDSEYVKAITVLNEEIQMRQKFPWLWFDAIYKLL 568 567 PCGKRFYKALDVAHKLSFDVINERMQMKIQESYCETASDEKKFFLDLLLDIYRKGKI 397 396 DTEGIQEEVDTFMFEGHDTTSAALGWTLWLLGKNPDVQKKLHKEIDEIELNGGSLYDKVR 217 216 QSKYLEIILKESLRMHPPVPMYGRTVEEDMTIDGQFVPKGAQIVLLVLILHSNPDYWEN 40 39 PNDFIPERFEADSYEKRNPYSYVPFSAGPRNCIGQKFAMIEEK ILLYSIMKNFHLKSMQNENEVFGTLDIIHKSINGINIKFTRR* ATGGTATCAGTTTTTTATATATTATTTAGTGGACTT GTTTTCTATGTTGTTAGTAAGATATTGTGGAAGTTATGGAGAAATTCATATGGTTTATCA TCAATAGTTACACCTCCAAATGTACCATTTTTTGGAACATCTTTGTACTTGCATAGTGAT GCCCGCAAATTTTTTTTCCAACTATATGACTACACAAGAAGATATGGCGATGTGTTTTGC ATTTGGTTGGGGCCAAAACCAGTAATATGTTCTTCCTCTGTAAAATTCTCAGAAGCAGTA TTAAGTAGTCAGAAAGTTATCACCAAAGGATTTTCTTATGATTTTTTGCATGACTGGTTA AAAACTGGGTTACTTACAAGCACAGGATCAAAATGGAAAACACGTAGAAGGCTACTAACT CCAAGTTTTCATTTTTCTATACTCAATAACTTTATTAAAATATTCGAAGAGCAAGCATCC ATTCTGGTGGACAAACTAGCTGTAGCTGCTGACAACAAGGAAGTTGTAGATGTGCAAGTA CCTATTGGTTTGGCAACCTTGGATATAATCTGCGAAACTTCAATGGGTGTAAAAGTAAAT GCACAAAGTCATCCAGATTCTGAGTATGTTAAAGCT ATCACAGTTTTAAATGAAGAAATTCAAATGCGTCAAAAGTTTCCTTGGCTTTG GTTTGATGCCATTTACAAACTGTTGCCTTGTGGGAAAAGGTTTTATAAGGCTTTAGATGT TGCTCATAAGCTATCTTTTGATGTAATAAATGAACGCATGCAAATGAAAATTCAAGAATC TTATTGTGAGACTGCGTCAGATGAAAAGAAATTTTTTTTAGATTTATTGTTAGATATATA TCGCAAAGGTAAAATTGACACTGAAGGTATTCAAGAAGAAGTTGATACTTTTATGTTTGA AGGTCATGATACAACTTCAGCTGCACTAGGTTGGACTCTTTGGTTGTTAGGAAAAAATCC AGATGTTCAAAAAAAGCTGCACAAAGAAATTGATGAGATAGAGTTAAATGGAGGTTCACT TTATGATAAAGTCAGACAGTCTAAATACCTTGAAATTATTCTTAAAGAATCATTACGAAT GCATCCTCCTGTCCCTATGTATGGAAGAACAGTTGAGGAAGATATGACTATTGATGGTCA GTTTGTTCCCAAAGGAGCACAAATAGTTCTTTTAGTTTTAATCTTGCACTCAAACCCTGA TTATTGGGAAAACCCAAATGATTTTATACCTGAACGT TTTGAAGCTGATAGTTATGAAAAGCGCAACCC ATACAGTTATGTACCTTTTTCTGCTGGACCAAGGAATTGCATTGGCCAAAAATTTGCCAT GATTGAAGAGA AAATATTACTGTATAGCATAATGAAAAACTTCCATCTTAAGTCAATGCAGAATGAAAATG AGGTTTTTGGTACTCTTGATATAATTCATAAGTCAATTAATGGAATTAATATAAAGTTCA CAAGAAGATAA >1096064105622 very similar to 1095958068757 varies at N-term 86% 346 MIYASYLVLVGLFVFFVSKILWKLWKSSYGLETIATPPNIPVFGTSLYLHSDARKFFFQL 525 526 SEFTKKYGTVFCIWLGPKPMIISSSVKFSEAVLSSQKVITKGFSYDFLHDWLKTGLLTST 705 706 GSKWKTRRRLLTPSFHFSILNNFIKIFEEQASILVDKLAVAADNKEVVDVQVPIGLATLD 885 886 IICETSMGVKVNAQSHPDSEYVKAITVLNEEFVMRIKYPWXLWFDVIYKLLPCGKR 34 aa gap between these two seqs >CV564924.1 EST 93% to 1095958068757 EKKFFLDLLWDIYRKGEIDTEGIQEEVDTFMFEGHDTTSAALGWTLWLLGKNPDVQRKLHKEIDEIE LNGGSLYDKVRQSKYLENILKESLRMHPPVPMYGRTVEEDMTIDDQFIPKGAQIILLVLM LHSNPEYWENPNDFMPERFEADSYEKRNPYSYVPFSAGPRNCIGQKFAMIEEKILLYSIM KNFHLKSMQDENEVFGTVDVIHKSINGINIMFTRR GAAAAGAAATTTTTTTTAGATTTGTTATGGGATATATATCGAAAAGGTGAAATTGACACTGAAGGTATTCAAGAAGAAGTTGATACTTTTATGTTTGAAGGTCATGATACTACTTCAGCTGCACTAGGTTGGACTCTTTGGTTGTTAGGAAAAAATCCAGACGTTCAAAGGAAGTTGCACAAAGAAATTGATGAAATAGAGTTAAATGGAGGTTCACTTTATGATAAAGTTAGACAGTCTAAATACCTTGAAAATATTCTTAAAGAATCATTACGAATGCATCCTCCTGTCCCTATGTATGGAAGAACAGTTGAGGAAGATATGACTATTGATGATCAGTTTATTCCCAAAGGAGCACAAATTATTCTTTTAGTCCTAATGTTGCATTCGAACCCAGAATATTGGGAAAATCCAAATGATTTCATGCCTGAACGTTTTGAAGCTGATAGTTATGAAAAGCGCAACCCATACAGTTATGTACCTTTTTCTGCTGGACCAAGGAATTGCATTGGCCAAAAATTTGCCATGATTGAAGAGAAAATATTACTGTACAGCATAATGAAAAACTTCCATCTTAAGTCAATGCAGGATGAAAATGAAGTATTTGGGACTGTTGATGTAATCCATAAATCAATTAATGGAATTAATATAATGTTCACCAGAAGAAAAGGAAAAACTTATCTTGTTTAGTTTAGTTCATTATTTATCAGTAATTTGAAATAAT >1096064105622 90% to 1095958068757 varies at N-term 1096071088011 joins CV564924.1 EST MIYASYLVLVGLFVFFVSKILWKLWKSSYGLETIATPPNIPVFGTSLYLHSDARKFFFQL SEFTKKYGTVFCIWLGPKPMIISSSVKFSEAVLSSQ KVITKGFSYDFLHDWLKTGLLTSTGSKWKTRRRLLTPSFHFSILNNFIKIFEEQASILV DKLAVAADNKEVVDVQVPIGLATLDIICETSMGVKVNAQSHPDSEYVKAITVLNEEFVMR IKYPWLWFDVIYKLLPCGKRFYKALDVAHKLSFDVINERMQMKIRESYCETASDEKKFFL DLLLDIYQKGEIDTEGIQEEVDTFMFEGHDTTSAALGWTLWLLGKNPDVQRKLHKEIDEI ELNGGSLYDKVRQSKYLENILKESLRMHPPVPMYGRTVEEDMTIDNQFIPKGAQIILLVL MLHSNPEYWENPNDFMPDRFEADSYEKRNPYSYVPFSAGPRNCIGQKFAMIEEKILLYSIM KNFHLKSMQDENEVFGTVDVIHKSINGINIMFTRR >gnl|ti|655009968 1095963046224 42 0.010 46% to CYP20 35% to 27B1 419 DGGIHKFLVENHKRLGPMFSFYWGKELAVSLACPILFKEVATLFNRP 559 >gnl|ti|646849327 1095897329284 1097672251908 mate pair = 1097672200068 has exon 2 40% to 2X2 N-term MLLQITCGFLFPPLIWIVWTYIKHLYDCLSYPQGPIPLPFIGNAHLLRKGEPYKELVNLGKIYGDVFGFSIGSIRYVVVNNLEGIKEVLIKKGSQFAGRPRLKFTI (1) ATGCTTCTTGAAATTACTTGTGGGGTTCTGTTCCCACC GTTAATATGGATTGTCTGGACATATATTAAACATCTTTATGATTGTTTGAGTTATCCACA AGGACCAATACCACTGCCATTTATAGGAAATGCTCATCTTTTAAGAAAAGGTGAACCTTA CAAGGAATTAGTTAATCTTGGAAAGATATATGGTGATGTTTTTGGATTTAGTATTGGTTC AATTAGATATGTAGTTGTAAACAATTTAGAAGGTATTAAGGAAGTTTTGATTAAAAAAGG TTCACAGTTTGCTGGTCGTCCAAGGCTAAAGTTTACTATTAGT Exon 2 1097331043073 1097206900216 1097672200068 mate pair = 1097672251908 This mate pair has 2 aa diffs to 1095897329284 one nuc diff same aa seq 1096124035195 1096041114543 1097329360644 1096625189581 1095958061778 1095898207031 (1) ALSRGMNGLIMSDPSPHFRILRKLASSSLKIYAEGLDGMEKKAINEYSYLHKKLSTMNGKAVSLKRMI (1) AGCTTTGAGTAGGGGTATGAATGGCCTTATTATGAGTGATCCT TCACCACATTTTAGAATTTTACGAAAATTAGCATCATCTTCGTTAAAAATTTATGCTGAA GGATTAGACGGGATGGAAAAAAAAGCTATAAATGAGTACAGTTATTTGCATAAAAAATTA TCAACAATGAATGGAAAGGCTGTATCTTTAAAAAGAATGATAGGT >1096124019772 related exon 2 5 aa diffs to 1097331043073 1096123858905 1096123680637 (1) ALTRAMNGLIISDPSPHFKILRKLASSSLKLYAEGLDGMEKKAINEYSYLHKKLSTMNGKAVSLKRMI (1) >1097265020030 new N-term weak with frameshifts and a stop codon no exact matches exist so this may be poor quality sequence TCGCTYPQKIWNVL WTDIKHLSDSESYPQGPISLPI XXXAHIERKGETYREIDRLR*IYGDDIGMCIGTLRYVDVNNLEGIRDVLIYTGTQFL ACGTGTGGGTGTACGTACCCACAGAAAATATGGAATGTC TGGACAGATATAAAACATCTCTCAGATAGTGAGAGTTATCCACAAGGACCAATATCACTGCCAATT GCACATATAGAAAGAAAAGGTGAGACATACAGGGAGATAGATAGACTTAGATAG ATATATGGTGATGATATAGGTATGTGTATCGGTACACTTAGATATGT AGATGTAAACAATTTAGAAGGTATTAGGGACGTTTTGATTTACACAGGTACACAGTTTCT CTGGT >1096110062131 related exon 2 73% to 1095897329284 1097331675401 1097646001099 1096704247756 (1) AWSRALNGLVACDPGPRFKVLRKLASSSLKIYAEGLDGMEKKAADEYSHLNKKLQTMNGKPVSLQNMI (1) mate pair of 1097646001099 = 1097664041480, continues on 1096703402618 1097329754969 1097664053056 possible frameshift at NDRP_LHL (1) ELGTLNIICTILFNHRYEEDDKEFQDIIKYSNLTVKIFGGTSILSSIPWLRFLPSASSRSIYE IVRIRDPLLKKKLQEHKSSFDENNLRDVTDVLIKVSLGSDIAKGSEEKITDENIEFLLND FIIAGSETSSSTILWFIVYLLHWPEYQDKLYNEIIKVTSGKRYPCLNDRP ? LHLTQATIHETLRLSSVGPLAIVHKAMENSSICGKPVPKGAFILTNLWSTH HDESYWKNPMCFYPERWLEKSGEFNSKLGYAFLPFSGGPRSCLGEALARTELFVFFSRLV TDYRFEKPNGEELPRLNGRFGLTCSPFDFKSVVVPRC* AGAGTTAGGTACCCTCAACATCATTTGTACTATTTTGTTCAATCATCGATATGAAGAAGAT GATAAAGAATTTCAGGATATCATCAAATACTCAAATCTGACTGTTAAAATTTTTGGTGGA ACAAGCATTTTATCTTCTATTCCATGGCTGCGTTTTTTACCATCAGCTTCTTCAAGAAGC ATATATGAGATAGTAAGAATACGTGATCCACTTTTGAAAAAAAAGCTACAAGAGCACAAG AGCTCGTTTGATGAGAATAACTTACGTGATGTGACTGATGTATTAATTAAGGTTTCTTTG GGTTCAGATATTGCAAAAGGTTCCGAAGAAAAAATTACTGACGAAAACATAGAGTTTCTT TTAAACGATTTCATAATTGCCGGATCAGAAACTTCATCAAGTACAATTCTTTGGTTTATT GTTTATCTTTTACATTGGCCAGAATACCAAGATAAACTTTATAACGAAATTATAAAAGTT ACATCAGGTAAGCGTTACCCATGTTTAAACGATCGCCCc CTTCATTTAACGCAAGCCACAATTCATGAAACACTTCGATTGTCATCAGTAGGTCCTC TTGCTATAGTTCATAAAGCGATGGAAAACAGTTCCATATGTGGAAAACCAGTTCCCAAAG GAGCTTTTATACTAACAAATTTATGGAGTACACATCATGATGAAAGTTATTGGAAAAATC CAATGTGTTTTTATCCAGAACGTTGGTTAGAAAAATCTGGTGAGTTTAATTCTAAGTTAG GGTATGCATTTTTGCCGTTTTCAGGCGGACCTCGTAGCTGTTTAGGAGAAGCACTTGCAA GAACAGAGTTGTTTGTCTTTTTTTCACGATTAGTAACAGATTATCGGTTTGAAAAACCAA ATGGTGAGGAGTTACCGCGTTTGAATGGTCGTTTTGGTCTCACTTGCTCTCCTTTTGACT TTAAATCGGTGGTTGTTCCAAGATGTTAA >1097206642797 related exon 2 61% to 1095897329284 1096761288099 1096082164704 1097567110690 1097672343044 (1) DWSRTMNSLINNDLNATFKVLRKITSSSLKIYAEGLVGMEKRAIEEYTHLNKKLLSLKGQAVSIKNMI (1) AGATTGGAGTAGAACAATGAACAGCCTCATCAATAACGACTTAAATGCAACCT TTAAAGTTTTACGAAAAATAACATCCTCATCATTAAAGATTTATGCGGAAGGATTGGTGG GAATGGAAAAAAGAGCTATTGAGGAATACACCCACTTAAATAAAAAGCTTTTATCATTGA AAGGGCAAGCAGTATCTATTAAAAACATGATTGGT >1097206059080 5 aa diffs to 1095898809307 might be the same gene (1) GPCKPSHIICTILFNHRYDENDQEFQDIIKYSNLSVRASSATSLISSIPWLRFFPSTASR NIYEIIRLRDPILKRKLQEHRSSYDENNLRDVTDSLIKVSLDSALENNSHEKITDDNIEF LLNDFIIAGSETSSNTVLWFIVYMLHWPEYQDKLYNEILKITSGNRYPCLSDRPMLHLMQ AAIHETLRLSSVAPLGVGHKAMESSSICGKPVPKGAFILTNLWSIHHDETHWNNAMSFYP ERWLEKSGEFNLKLGEAYLPFSSGPRSCLGETLAKIELFVFISRLVKDYRFEKPTEEDLP NLKGESGITRTPSEFKVMAIPRN* AGGGCCGTGCAAACCGTCTCACATAATTTGCACAATACTTTTTAATCATCGATATGATG AAAATGATCAAGAATTTCAAGATATCATAAAATATTCAAATTTGTCTGTTAGAGCATCTA GTGCAACCAGTCTTATATCTTCTATTCCATGGTTACGGTTTTTTCCTTCAACTGCTTCAA GAAATATTTATGAAATAATAAGACTTCGTGATCCGATTTTGAAACGGAAACTTCAAGAAC ACCGAAGTTCTTATGATGAAAATAATTTACGCGATGTGACTGATTCCTTAATAAAAGTCT CTTTGGATTCAGCATTGGAAAACAATTCACATGAGAAAATCACAGATGATAACATTGAGT TTCTTTTAAACGATTTTATAATTGCTGGATCAGAAACGTCGTCAAACACTGTTCTTTGGT TTATTGTTTATATGTTGCATTGGCCAGAATATCAAGATAAACTTTATAATGAAATTTTAA AGATAACATCCGGAAATCGTTATCCATGTTTAAGCGATCGCCCTATGCTTCATTTGATGC AAGCTGCAATTCATGAAACACTTAGACTGTCGTCAGTAGCACCTTTGGGTGTAGGTCATA AAGCAATGGAAAGCAGTAGCATCTGTGGTAAACCTGTTCCAAAGGGTGCTTTTATATTAA CAAACTTGTGGAGCATACATCACGATGAGACTCATTGGAATAATGCCATGAGTTTTTATC CAGAACGTTGGCTGGAAAAATCTGGTGAGTTTAATTTGAAACTTGGTGAAGCGTACTTAC CATTTTCAAGTGGACCGCGTAGTTGTTTGGGAGAAACATTAGCTAAAATTGAATTGTTTG TATTTATATCACGGTTAGTAAAAGATTATCGGTTTGAAAAACCAACTGAAGAAGACTTAC CAAACTTAAAAGGTGAATCTGGCATAACTCGCACTCCTTCTGAATTTAAAGTTATGGCTA TTCCAAGAAATTAA >gnl|ti|649393684 1095898809307 45% to 17A1 C-term. No exact matches (1) VYLKLGEAYLPFSSGPRSCLGEALAKIELFIFISRLVKDYRFEKPTEEELPNLKGESGITRIPSEFKVMTIPRN* AGTTTATTTGAAACTTGGTGAAGCGTACTTACCATTTTC AAGTGGACCGCGTAGTTGTTTGGGAGAAGCATTAGCAAAAATAGAGTTGTTTATATTTAT ATCACGGTTAGTAAAAGATTATCGGTTTGAAAAACCAACTGAAGAAGAGTTACCAAACTT AAAAGGTGAATCTGGCATAACTCGCATTCCTTCTGAATTTAAAGTTATGACTATTCCAAGAAATTAA >gnl|ti|646968536 1095898162561 83% to 1095897329284 37% to 2X2 N-term 1096041100060 1097672010393 1096602125478 MILKVIGSIFFPPLIWFVYSYIKHLIECLYYPKGPVPLPFIGNTNLLRKKETCKEFVNLGKIYGDIFGFSIGSIRYVIVNNLEGIHEVLIKKGSQFSGRPRII (1) ATGATTCTTAAAGTCATTGGTAGCATTTTTTTC CCGCCTCTTATTTGGTTTGTCTACAGTTACATCAAACATCTTATAGAATGTTTGTACTAT CCGAAAGGACCAGTTCCTCTACCGTTCATAGGAAATACAAACTTATTAAGAAAAAAGGAA ACTTGTAAAGAGTTTGTTAATCTTGGGAAGATATATGGTGATATTTTTGGATTCAGCATT GGTTCTATTAGATATGTAATTGTTAACAACTTAGAAGGTATTCATGAAGTTTTAATTAAA AAAGGCTCACAATTTTCTGGTCGACCAAGGATTATATGT >1097509072583 new exon 3 boundary wrong (0) LWSYTCDKESGTNLTVLDDLSNLSFDIVGDVGFGYQFNTITSHSSNEFTSAVRNLTKMQI 694 NASVFSKVLITCFPFLVKFLLLFGKRRNLIQIVYKTLNK (2) AGCTTTGGTCATATACATGCGATAAAGA AAGTGGGACAAACCTAACTGTTCTGGATGATTTGTCTAATCTGTCATTCGATATAGTTGG TGATGTTGGTTTTGGGTACCAATTTAACACAATCACTTCTCATTCTAGTAATGAATTTAC TTCAGCTGTTCGGAATTTGACTAAAATGCAAATCAATGCTAGTGTGTTCTCAAAAGTTTT AATAACTTGTTTTCCATTTTTGGTCAAATTCTTGTTATTGTTTGGAAAGCGTAGAAATCT TATACAGATTGTTTATAAAACTTTGAACAAGT >gnl|ti|648014530 1095896049543 41% to CYP21 LKYLDCVVK PANTILRTHVSSIHMNETIYPDPHSFKHERFMTG AGTCCTGCAAATACAATCC TACGAACTCATGTTAGCAGTATACACATGAATGAAACTATTTATCCAGATCCTCATTCAT TTAAACATGAAAGGTTTATGACAGGT >1096082202706 probably the same as 1095896049543 which has errors 1097664076692 1095994179331 (0) NIEVQEKLREDIQKNILDVNNISFEEVMSLKYLDCVVKETLRLHGPAPLLGRRTISATKF GEYEVPANTILRTHVSSIHMNETIYPDPHSFKPERFMT (1) AGAACATAGAAGTTCAAGAGAAACTTAGAGAAGATATCCAGAA AAACATATTGGATGTAAATAATATTTCTTTTGAGGAAGTTATGAGTTTAAAATATTTGGA TTGTGTCGTTAAAGAAACCTTGCGCTTACATGGACCTGCACCACTTTTAGGCAGAAGGAC CATTAGTGCAACAAAATTTGGTGAATATGAAGTTCCTGCAAATACAATCCTACGAACTCA TGTTAGCAGTATACACATGAATGAAACTATTTATCCAGATCCTCATTCATTTAAACCTGA AAGGTTTATGACAGGT >1097309000937 1097206907008 1095901911044 MFLVCLALIVLFIGLFLLCYLLKRTFHPLRLLPSPKEQLITGHNRYFHGRDHTSTYLSFN 858 EKFKEEGLCTLDTLY (1) ATGTTTCTAGTATGTCTAGCACTCATAGTTTTATTTATTGGATTA TTTTTACTGTGTTATTTATTAAAACGTACCTTTCACCCTCTTCGACTTTTACCATCACCA AAAGAACAACTTATTACTGGTCATAATAGGTACTTTCACGGCCGCGACCATACTAGCACC TATTTGAGTTTCAACGAAAAGTTTAAAGAAGAAGGTTTATGTACGCTAGATACATTATATGGT >1096091465110 88% to 1097331817678 1096625274441 1096123742264 1097265046825 1095964362241 MFLICLALLILSIGLFFLRYLLKRIFHPLQLLPSPKEQLITGHISHFQGRDHSNTFLGF 701 NEKFKEEGLCTLDTLY (1) >1097331817678 1096526275245 1096124165677 1096110023112 1096761988512 1096701884902 walked down from end of 1096526275245 walked farther from end of 1097672563082 ran into a repeat region MFVICLALITLFIGLFFLRCLLKRIFHPLRLLPSPKEHLITGHISHFQGRDHSNTFLSFNEKFKEE GLCTLDTLY (1) ATGTTTGTGATATGTCTAGCACTCATAACTTTGTTTATTGGATTATTTTTCCTGC GTTGTTTATTAAAACGTATCTTTCACCCTCTTCGATTATTACCATCACCAAAAGAACATC TCATTACTGGTCATATTAGTCACTTTCAAGGCCGTGACCATTCTAACACCTTTTTGAGCT TCAACGAAAAATTTAAAGAAGAAGGTTTATGCACGCTAGATACATTATATGGT >1095899139433 1096703930092 1097509100606 1097675339850 new exon 2 VPRYVYLIAPEFIKKIFADGKLFQRPTTLKILAPLIGNSMLGSNYEDHHWQRKLFNGAFT 549 SQQLKNYFPAFLKHTNLLMK AGTGCCCAGGTATGTTTATCTAATTGCTCCAGAATTCATTAAAAAGATATT TGCAGATGGGAAACTTTTTCAAAGGCCTACTACATTAAAAATCTTGGCACCATTAATTGG AAACAGCATGCTTGGTTCAAATTACGAAGACCATCATTGGCAAAGAAAGTTATTCAATGG AGCATTTACTTCACAACAGCTGAAAAATTATTTTCCTGCATTTTTAAAGCATACTAATTT GCTAATGAAAGT >new exon 2 1095899339221 1097206043402 1097672369437 possible frameshift/insertion (1) GFKFIYLLMPEYIKTMVSNGKVFQKSTAMKVIFPLVGNGMLVSNYEHHHWQRKLFNEAFS AQQLKKYFPAFKEHT DLLIK (0) AGGGTTCAAATTTATTTACCTTTTAATGCCAGAATATATTAAAACAA TGGTTTCTAATGGCAAGGTTTTTCAAAAATCGACTGCAATGAAAGTTATATTTCCTCTAG TTGGCAACGGTATGCTTGTGTCAAATTATGAACATCACCATTGGCAAAGAAAATTATTTA ATGAAGCATTTTCTGCACAACAGTTAAAAAAATATTTTCCTGCATTTAAAGAGCATACTA ATAAAAGATTTACTAATAAAAGT >1095964240637 1097516021618 1096705343938 1095900018167 1096607016658 new exon 2 (1) GFRFVDLLLPEFIKTIFSDGKVFHRSNVLKVLFPLVGNGMIVSNYEDHHWQRKVLNEAFT 854 SQQLKNYFPAFTLHTDLLMK (0) AGGTTTCAGATTTGTTGATCTATTATTGCCAGAATTTATTAAAACAA TATTTTCTGATGGTAAAGTTTTTCACAGATCGAATGTTTTGAAAGTTTTGTTTCCTCTAG TTGGAAATGGTATGATTGTATCAAATTATGAAGATCATCATTGGCAAAGAAAAGTTTTAA ATGAAGCTTTTACCTCCCAACAGCTAAAGAATTATTTTCCTGCTTTTACATTGCATACTG ATTTGCTAATGAAAGT >1097675832709 new exon 1 with one possible frameshift or there is another exon MCMVYIAVLILLCLIVFF ANVLKRFYHPLRNFPSPQENLITGHYSYFYRYDHVKTLLNFGKQFEKNGLYTLDTLN (1) ATGTGTATGGTTTATATAGCAGTATTGATTTTAT TATGTTTAATAGTATTCTTTGCTAATGTTTTAAAGCGTTTTTATCATCCGCTTCGTAAT TTTCCCTCACCTCAAGAAAATTTAATTACAGGCCATTATAGCTATTTTTATCGTTATGAT CATGTCAAGACTTTGTTAAATTTTGGAAAGCAGTTTGAAAAGAATGGCTTATATACATTA GATACATTAAATGGT N-terminal EST sequences for hydra P450s >DN812964.1 ACAC-aac48b12.g1 Hydra EST UCI 7..same as DN812371.1 MYTIGIAVLIFLCFSLFFANILKRFYHPLRKLPSPKENFFTAHYGYFNGYDQINAVINFG 280 KQFKERGLYTLDTLN >DN810769.1 ACAC-aac19b13.g1 Hydra EST UCI 7.. same as DN812371.1 MYTIGIAVLIFLCFSLFFANILKRFYHPLRKLPSPKENFFTAHYGYFNGYDQINAVINFG 256 KQFKERGLYTLDTLN >DN816152.1 ACAC-aac24b14.g1 Hydra EST UCI 7.. same as DN812371.1 IAVLIFLCFSLFFANILKRFYHPLRKLPSPKENFFTAHYGYFNGYDQINAVINFGKQFKE 199 RGLYTLDTLN >CN775805.1 tae77f11.x1 Hydra EST Darmstadt .. same as DN812371.1 IAVLIFLCFSLFFANILKRFYHPLRKLPSPKENFFTAHYGYFNGYDQINAVINFGKQFKE 185 RGLYTLDTLN >BP514308.1 BP514308 Hydra magnipapillata c...have this one MYSIYIAIIIVPLVFFVAVFFKRFYHQFRLLPSPKESLITCHYSYFDVHDHVNTLLNFG 208 KEFKDYGLYTINTL >BP514307.1 BP514307 Hydra magnipapillata c...same as BP514308.1 MYSIYIAIIIVPLVFFVAVFFKRFYHQFRLLPSPKESLITCHYSYFDVHDHVNTLLNFG 208 KEFKDYGLYTINTL >BP505238.1 BP505238 Hydra magnipapillata c... same as BP514308.1 MYSIYIAIIIVPLVFFVAVFFKRFYHQFRLLPSPKESLITCHYSYFDVHDHVNTLLNFG 209 KEFKDYGLYTINTL >CO509836.1 tai58f02.y1 Hydra EST UCI 5 ALP .. same as BP514308.1 IYIAIIIVPLVFFVAVFFKRFYHQFRLLPSPKESLITCHYSYFDVHDHVNTLLNFGKEF 181 KDYGLYTINTL >DN813094.1 ACAC-aab89g09.g1 Hydra EST UCI 7..= 1097675463974 MFLICLALLILSIGLFFLRYLLKRIFHPLQLLPSPKEQLITGHISHFQGRDHSNTFLGFN 303 EKFKEEGLCTLDTL >DN603400.1 ACAC-aac10m18.g1 Hydra EST UCI 7..= 1097675463974 same as 1096091465110 DN813094.1 DN137655.1 MFLICLALLILSIGLFFLRYLLKRIFHPLQLLPSPKEQLITGHISHFQGRDHSNTFLGFN 283 EKFKEEGLCTLDTL >DN137655.1 ACAE-aaa07c04.g1 Hydra EST UCI 5.. ..= 1097675463974 same as 1096091465110 DN813094.1 DN137655.1 LICLALLILSIGLFFLRYLLKRIFHPLQLLPSPKEQLITGHISHFQGRDHSNTFLGFNEK 192 FKEEGLCTLDTL >CN567598.1 tag12b09.x1 Hydra EST -Kiel 1 Hy..we have this one LLKRIFHPLRFLPSPKEQLITGHINHFQGRDHSSTYLSFNEKFKEESLCTLDTLH >CX833403.1 ACAC-aaa40d06.g1 Hydra EST UCI 7.. 91% to 1095899139433 exon 2 IIFTVLFW*RTFHPLQLLPSPKEQLITGHNMYFHGRDHTSTYLSFNKQFKK*GLCTQHTLX VPRYVYLIAPQFITKIFAYGKLFQRPTTLKILAPLIGNSMLGSNYKDHHWQKKLFNGAFT 431 SQQLKNYFPAFLKHTT*LMKHWSYTCDKESGTNLTVLDDLSNLSFNIVGDVGFLGFGYQFTQ ITSHASNEYTS >1097675877620 new exon 1 with one possible frameshift or there is another exon MYMICIAAIVILCFL VLAVMLKRFYYPLCMLPSPKENLFTAHYRYFYGHDHINAFLNFQNQFKDYGLYTLDLLLG (1) ATGTATATGATTTGCATAGCAGCCATTGTTATATTATGTTTTCTC GTCCTTGCTGTTATGTTAAAACGTTTTTATTATCCGCTTTGTATGCTTCCATCACCCAAA GAAAATTTATTTACAGCTCATTATAGATATTTTTATGGTCATGATCATATCAACGCTTTT TTAAATTTTCAAAACCAGTTTAAAGACTATGGCTTGTATACATTAGATTTATTACTTGGT >1095901177607 new exon 1 only 5 aa diffs from 1097675877620 all three of these sequences seem to have a frameshift. So this is probably evidence for another upstream exon. They almost certainly dont have a frameshift MYMICIAAIVILCFL VLAVMLKRFYYPLCMLPSPKENLFTAHYRYIYGHDHINAFLYFQNQFKEYGLYTLDIFL ATGTATATGATTTGCATAGCAGCCATTGTTATATTATGTTTTCTC GTCCTTGCTGTTATGTTAAAACGTTTTTATTATCCGCT TTGTATGCTTCCATCACCCAAAGAAAATTTATTTACAGCTCATTATAGATATATTTATGG TCATGATCATATCAACGCTTTTTTATATTTTCAAAACCAGTTTAAAGAATATGGCTTGTA TACATTAGATATATTTCTTGGT >DN813094.1 ACAC-aab89g09.g1 Hydra EST UCI 7..= 1097675463974 MFLICLALLILSIGLFFLRYLLKRIFHPLQLLPSPKEQLITGHISHFQGRDHSNTFLGFN 303 EKFKEEGLCTLDTLY (1) ATGTTTCTGATTTGTCTAGCACTTTTAATTTTATCTATTGGATTATTTTTTTTGCGT TATTTATTAAAACGTATCTTTCACCCTCTTCAACTTTTACCATCACCAAAAGAACAACTC ATTACTGGTCATATTAGTCACTTTCAAGGCCGCGACCATTCCAACACCTTTTTGGGTTTC AACGAAAAGTTTAAAGAAGAAGGTTTATGTACGCTAGATACATTATATGTGCCCAGGTAT >1097675463974 mate pair to exon 2 1097675525814 also 1096123961892 1097329363407 1097509311387 1096064006710 1097509202292 1096703858851 1097675514139 1096607047135 walked up to 1096526789565 continued on 1097516017620 (1) VPRYVYLIAPEFIKKIFADGKLFQRATSLKVLAPIIGNSMLTSNYEDHHWQRKLFNGAFT 565 SQQLKNYFPAFLTHTDFLMK (0) (1) AGTGCCCAGGTATGTTTATCTAATTGCTCCAGAATT TATAAAAAAGATATTTGCAGATGGAAAATTTTTTCAAAGAGCTACTTCATTAAAGGTTTT GGCACCTATAATTGGAAATAGCATGCTTACTTCAAATTACGAAGACCATCATTGGCAAAG AAAGTTATTCAATGGAGCATTCACTTCACAACAGCTAAAAAACTACTTTCCTGCATTTTT AACGCATACTGATTTTCTAATGAAAGTAAGTTTTGATAATATTTAGTTATAGTTTTGTTG TTTTTATTATAATAATGCAAAACAAATTCTTTTAGCTTGTAAGTACATGGT >gnl|ti|647058148 1095898198167 I-helix mate pair = 1095898261914 1095898261914 1097206705284 1096761285028 1096761249195 1097206596155 1097675525814 1097675035030 1097329367631 1097206911388 1097206828844 1097460197276 LWSYTSDKESGTNLTVLDDLSNLSFDIIGDVGFGYQFNTINSHSGNEFT SAFRYLTELQHNASVFSKVLISCFPFLAQFLLLFGKRRKLIQVVHKTLNK (0) 1097664219266 1097664219266 1096123852196 1097206273255 these have 2 aa diffs 1097331817534 1097206329872 1096526011941 (0) LIEKRKKEIDDGISTEEKDIITIVLKDQQQESSKLTNDLIRDNLLLFLIAGHETTSTAMTWCLYMLGT (0) AGCTTTGGTCTTATACAAGTGATAAAGAAAGTGGGACAA ACTTAACTGTTTTGGATGATTTGTCTAATCTGTCCTTTGATATAATTGGTGATGTTGGTT TTGGCTACCAATTTAACACAATCAACTCTCATTCTGGTAATGAATTTACATCAGCTTTTA GATATTTGACTGAACTGCAACATAATGCTAGTGTGTTCTCAAAAGTTTTGATAAGTTGTT TTCCGTTTTTGGCGCAATTTTTGTTATTGTTTGGAAAACGTAGAAAACTTATACAAGTTG TCCATAAAACTTTGAATAAGGT 1096123183594 1097335028467 mate pair 1097335034435 = 100% match to 1095898198167 1097325031454 1097509311387 (0) NLEVQEKLREEIQKNILDKKNITFEEILSLKYLDCVVKETLRLHGPAPILGRRNINATKF 957 GEYEVPANTVLRTH VSSLHMNETIYPDPHSFKPERFMT (1) AGAACTTAGAAGTTCAAGAAAAACTTAGAGAAGAGATCCAGAAAAATATATTGGATAAAAAAAAT ATTACTTTTGAAGAAATCTTGAGTTTGAAATACTTAGATTGTGTCGTTAAAGAAACCTTG CGCTTGCATGGACCAGCACCAATTTTAGGCAGAAGAAACATTAATGCAACAAAATTTGGC GAATATGAAGTTCCTGCCAACACAGTACTACGAACTCAT EST = DN137322.1 extends to end of gene = 1096071008743 1097672372091 1097509003243 1097329517617 LLFLIAGHETTSTAMTWCLYMLGT NLEVQEKLREEIQKNILDKKNITFEEILSLKYLDCV 579 VKETLRLHGPAPILGRRNINATKFGEYEVPANTVLRTH VSSLHMNETIYPDPHSFKPERFMT 1096625230620 1096705372268 1097675934526 1097622179423 1097265071715 1097675205779 GEIPATFYLTFGHGIYNCIGKNFALLEIKTFLVKALLQFEFSVDPEHISYKKFIWLTTITAEPLSIRVKPIAD* GTTAGCAGTCTACACATGAATGAGACTATTTATCCAGATCCTCATTCGTT TAAACCTGAAAGGTTTATGACAGGCGAAATACCAGCAACATTCTATCTTACTTTTGGGCA CGGTATATATAACTGTATTGGAAAGAATTTTGCTTTGCTTGAAATCAAAACATTCTTGGT CAAAGCGTTGTTGCAATTCGAATTTTCTGTTGACCCTGAACATATCAGTTACAAAAAGTT TATTTGGTTAACTACGATAACAGCAGAACCATTGTCAATTAGAGTAAAACCTATTGCAGATTGA >1096703991752 1096123270489 1096526190394 probably same as 1096625230620 GEIPATFYLTFGHGIYNCIGKNFALLEIKTFLVKALLQFEFSVDPEHISYTKFVWLTTXXXXXXIRVNLIAD* AGGCGAAATACCAGCAACATTCTAT CTTACTTTTGGGCACGGTATATATAACTGCATTGGAAAGAATTTTGCTTTGCTTGAAATC AAAACATTCTTGGTCAAAGCGTTGTTGCAATTCGAATTtTCTGTTGACCCTGAACATATCA GTTACACGAAGTTTGTTTGGTTAACTACGXXXXXXXXXXXXXXXXXXGTCATTAGAGTAAACC TAaTTGCAGATTaa >these have 2 aa diffs from 1097675463974, 1097331817534 1097206329872 1096526011941 (0) LIEKRKKEIDDGISTEEKDIITIVLKDQQQESSKLTNDLIRDNLLLFLVAGHETTSNAMTWCLYMLGT (0) >1096071008743 84% to CN567799 1096602116307 1096123983311 1096124057195 886 (0) NLEVQDKLREEILKNILDVNNISFEEVMSLKYLDCVVKETLRLHGPAPILRRRTMNAIKF 707 706 GEYEVPANTVLQTHISSLHMNETIYADPHLFKPERFMT (1) 593 >gnl|ti|648470985 1095898761545 N-term mate pair = C-term 1095899295538 1097264057439 extends N-term down 1097325864056 joins N and C-terms 681 MFLVYSLLVVIFSYFLIKISWKLWIYSYGLSTVPTPPTIPFFGNCLQLESDSVKFNKQI 854 855 REWSKIYGNVFCVWIGLTPMIYSSSVNFSEAILSSQKVLKKASVYEFLYEWLQTGLLTSTGNK WKLRRRLLTPSFHFSILNNFLKIFEEQGACLVDKLRIYAKSGGNFDIQVPIGLATLDIICETSM GVKVNAQSHPDSEYAKAIGILSEEIPKRIKYPWLWPDIIYKHLACGKRYYKALDVAHKLSLDVI KERVKTLIQNKSEVTSNKNKK ATGTTTTTGGTGTACAGTCTATTGGTTGTTATTTTTTCATACT TTTTAATTAAAATATCTTGGAAACTTTGGATTTATTCTTATGGTCTCTCGACTGTTCCAA CACCTCCAACCATACCATTTTTTGGCAATTGTCTTCAGCTTGAAAGTGATTCTGTAAAGT TTAACAAACAAATACGCGAGTGGAGCAAAATATACGGAAATGTTTTCTGCGTTTGGATAG GCCTTACGCCAATGATATACTCATCTTCTGTAAATTTCTCGGAAGCAATCTTAAGCAGTC AAAAAGTCCTCAAAAAAGCATCTGTTTATGAATTTTTGTATGAATGGCTTCAAACCGGG TTACTGACAAGCACAGGAAATAAG TGGAAACTGCGTCGTCGACTTCTTACACCAAGCTTTCATTTTTCTATACTCAATAATT TTTTAAAAATTTTTGAAGAGCAAGGAGCTTGTTTAGTTGATAAATTACGTATTTATGCCA AAAGTGGTGGAAATTTCGATATCCAGGTACCTATTGGATTAGCAACTTTAGATATAATAT GCGAGACATCAATG GGAGTAAAAGTAAATGCACAGAGTCACCCAGACTCAGA GTATGCTAAAGCCATCGGTATATTAAGTGAAGAAATACCAAAAAGAATTAAGTACCCATG GTTATGGCCAGATATTATTTATAAACATCTTGCTTGTGGAAAAAGATATTATAAAGCTCT AGATGTTGCTCATAAATTATCTCTTGATGTAATAAAAGAAAGAGTTAAAACACTTATTCA AAATAAAAGCGAGGTTACATCAAATAAAAACAAAAAA GAATCAGGCTCTGAAAAAAAAAA ATTTTTTTTAGACTTATTGTTAGATATGCATAAAAAAGGTGAAATTGATACTGAAGGGAT TCAAGAAGAGGTTGATACTTTTATGTTTGAAGGTCATGATAGCACTTCATCAGCATTAAG CTGGATGCTGTGGTTGCTAGGAAGATATCCACAAGTTCAACAGAAACTGCATTCAGAAAT TGATGAAGTGgAA 1095899295538 1096703530556 1096705948493 1096526527227 1096625218937 1097191001062 ESGSEKKKFFLDLLLDMHKKGEIDTEGIQEEVDTFMFEGHDSTSSALSWMLWLLGRYPQVQQKLHSEIDEVE LTGGSLYEKVRNFKYLENVVKESMRIHPPVPLIGRHIEEDMVIDGQFVPKSSEIVLLVMM MQSSPEYWKDPYDFIPERFEQEDFVKRNPYIYIPFSAGPRNCIGQKFAMIEEKMLLYIIM KNFYVQSIQNENEILLALNIIHKSSNGIIMKFTER* TTAACTGGAGGTTCACTTTATGAAAAAGTAAGAAACT TTAAATATCTTGAAAACGTTGTAAAAGAAAGTATGCGAATTCACCCACCTGTTCCTTTAA TTGGCAGGCATATTGAAGAAGACATGGTAATTGATGGTCAGTTTGTTCCTAAAAGTTCTG AAATTGTTTTACTTGTAATGATGATGCAATCAAGTCCTGAATACTGGAAAGATCCATATG ATTTCATACCTGAAAGGTTTGAACAAGAAGATTTTGTTAAGCGCAATCCATATATCTATA TTCCATTTTCAGCAGGTCCAAGAAACTGTATTGGTCAGAAGTTCGCAATGATTGAAGAGA AAATGCTGTTATATATCATAATGAAAAACTTTTACGTCCAATCCATCCAGAATGAAAATG AAATACTTCTTGCTCTAAATATTATACATAAATCGAGTAATGGTATCATAATGAAATTCA CTGAAAGATGA >1096083942127 1097329109827 clearly best match to 4V sequences MAFILLIFFLLLITLFLIWIYWVRSYNLNFVPSPLRFPLFGCALFLKSESH ELFKQVRWFFSEFGSAFCLWIGPKPVLMTGNIDHIQTVLKSQKIITKSSSYTFLNE WLGTGLLTSTGAKWKSRRKVLTKAFHFSIINSYVDSFYQNSVSLSNHLENHSGVPINIQA LMSLFTLDIICETAMGFKLNSMKNLNCDYVNAVEEVKILLIERQKSPWLWNKFVYKLFSS GKKFYTQLQVLKSFTKKIVNKRIKNYSLSSNGCKSFLDLLIDAYNQGKIDLEGIYEEVDT FMFAGHDTTAAALSYIFLMLGTHPKVQKKLHEEIDTNVNINSYENLSEKIRKMEYLDCVI KESLRLHPPVSVFGRILEDDTIFSNHLVGKGADIVLCPETLHTDPLYWENHRSFIPERFS NVEFAFCQPYLYIPFSAGPRNCIGQKFALMEIKIAIFVVMSKFIVTAVEQCLSPM ATFIQRYENGVLMLFEDEKRFLYML* >1097329374310 no introns very similar to 1095899295538 seq 1096608398403 1096761840588 1097460256370 67% to 1095958068757 88% to 1095898761545 MFIAYSLLVVVSLYFVIKLFWK FWIYSYGLSTVPTPPTIPFFGNSLQLESDSVKFNKQLCEWSKIYGNVFCVWVGLR PTIFSSSVNFSEAILSSQEVLKKASIYEFLHDWLKTGLLTSTGNK WKLRRRLLTPSFHFSILNNFLKIFEEQGACLVDKLRTYAKSGENFDIQVPIGLATLDIICETSMGVKVNAQSHP DSAYVKAINILSEEIPRRFKYPWLWPDIIYKHLACGKRYYKALDVAHKLSLDVINERIETLFQNE NNVTTNKNKEVSSEKKKFFLDLLLDIHKKGEIDTEGIQEEVDTFMFEGHDTTSSALSWIL WLLGRYPQVQQKLHSEIDEVELTGGSLYEKVRNFKYLENIIKESLRIHPPVPLIGRHIEK DMVIDGQFIPKKSEIGVLVMMMHSSPEYWKDPYDFIPERFEQEDFVKRNPYIYIPFSAGPRNCIG QKFAMIEEKMLLYSIMKNFYVQSMQNENEILPSLDLIRKSVNGIILKLTER* ATGTTTATTGCGTACAGTTTGTTGGTTGTAGTTTCTTTATACTTTGTAATTAAATTATTTTGGAAG TTTTGGATTTATTCTTATGGTCTCTCGACTGTTCCAACACCTC CAACCATACCATTTTTTGGGAATTCTCTTCAACTTGAAAGTGATTCTGTTAAGTTTAATA AACAACTATGCGAGTGGAGCAAAATATACGGAAATGTGTTCTGTGTTTGGGTAGGCCTTA GGCCAACTATTTTCTCATCTTCTGTAAATTTCTCGGAAGCAATTTTAAGCAGTCAAGAAG TCCTTAAAAAAGCATCAATTTATGAATTTTTGCATGACTGGCTTAAAACTGGATTACTAA CAAGCACAGGAAATAAGTGGAAACTGCGTCGTCGACTC CTTACACCAAGCTTTCATTTTTCTATACTCAATAATTTTTTAAAAATT TTTGAAGAGCAAGGAGCTTGTTTAGTTGATAAACTACGTACTTATGCCAAAA GTGGTGAAAATTTTGATATCCAGGTACCTATTGGATTAGCAACTTTAGATATAATATGTG AGACATCAATGGGAGTAAAAGTAAATGCACAGAGTCACCCAGATTCAGCGTATGTTAAAG CCATTAATATTTTAAGTGAGGAAATACCAAGGAGATTTAAATACCCATGGTTGTGGCCAG ATATTATTTATAAACATCTTGCTTGTGGAAAGAGATATTATAAAGCACTAGATGTTGCTC ACAAATTGTCTCTAGATGTAATAAATGAAAGAATTGAAACACTTTTTCAAAATGAAAACA ATGTTACCACAAATAAGAACAAAGAAGTTAGCTCAGAAAAAAAAAAGTTTTTTTTAGACC TACTGTTAGATATACATAAAAAAGGTGAAATTGATACTGAAGGGATTCAAGAAGAGGTTG ATACTTTTATGTTTGAAGGTCATGATACCACCTCATCAGCATTAAGCTGGATACTTTGGT TGCTAGGAAGATATCCACAAGTTCAACAGAAACTGCATTCAGAAATTGATGAAGTTGAAT TAACCGGAGGTTCACTTTATGAAAAAGTAAGAAACTTTAAATATCTAGAAAACATCATAA AAGAAAGCCTGCGAATTCATCCGCCTGTTCCTTTAATTGGCAGACATATTGAAAAAGATA TGGTAATTGATGGTCAGTTTATTCCTAAAAAATCTGAAATTGGTGTTCTTGTCATGATGA TGCATTCAAGTCCTGAATATTGGAAAGATCCATATGATTTCATTCCTGAAAGGTTTGAAC AAGAAGATTTTGTTAAGCGCAATCCCTATATCTATATTCC ATTTTCTGCAGGTCCGAGAAATTGTATTGGTCAGAAGTTCGCAATGATTGAAGAGAAAAT GTTGTTATATAGCATAATGAAAAACTTTTACGTCCAATCCATGCAGAATGAAAATGAAAT ACTTCCTTCTCTAGATCTTATACGTAAGTCGGTTAATGGTATCATATTAAAACTTACTGA ACGATAA >1095964281471 1097672357643 1097675038710 1096526281478 1097675573844 MFSNIKMIYTLCIIICGFYFLIKILWMCWKYSYGLTSIATPPNTPFLGTSFYFLSDS RKSYFQLCNYTKQFGNVFCIWLGPKPMIVSSSVKFLKAVLSSEKITTKGFSYDWIHDWLK TGLLTSSGPKWKARRKLL TSSFHFSVFNRLKIIIEEQACILVDKISFAADNKKVVDVQTLIGLATLDVICETIMGVKINAQ 780 SYPDSEYVKAISVLHKEIVNRMKFPWLWFDVIYKLLPCGKRFYKALDVAHKFTFDIINKR 600 MEISVNESYIDTPLEEKSYFLDLLLNIHKKKEIDMEGIQEEVDTF IFAGHDTISVALSWTLWLLGKYSEIQRKLHKSIDEIELNGGSLFEKVRNFKYLENII KESMRIHPPVPMYGRTVEENMTIDGQFVPKGAQIILLVLMLHSDPNIWENPKEFIPERFE TDDWKIKNSYSYLPFSAGSRNCLGQKFAMIEAKMLLYSIM KKFSLKSMQDENEVYGTVDILHKSINGINILFTRR* >gnl|ti|648478468 1095898788708 N-term EST = CV564880.1 1097672125473 1097509103730 1096123847153 1097664004740 1097329293298 1096092407854 1097325278081 1097675392269 1097672158546 1097206129107 1095899351259 76% to 1097329374310 similar to 4V5 MFLTFMFLFLIYFLIKVFWKLWIYSYGLSTVSTPPTLPLFGNCLQIKSDPVKASKQL FEWSRVYGKVFCVWVGIRPTIFSSSVNFSEAILSSQKIIQKGFVYNFLHEWLKTGLLTST GNKWKLRLRLLTPSFHFSILNNFLKIFEEQGNCLIDKFRVLAQNGKYFDIQVPIGLATLD IICETSMGVKINAQYQPDSEYVTAINILSEEIVRRFKYPWLWPNIFYKHFSCGKRYFKAL DIAHKLSLNVIHERIQTSLQNESENVLINKLDNKSVLNNEEELGVRKKRFFLDLLLDMHK KGEIDVDGIQEEVDTFMFEGHDTTSS AMCWTLWLLGRYPQIQQKLHAEVDEVELTSGSLYEKVRNFKYLE NVLKESLRLHPPVPLISRYIEEDMMIDGQFIPKKSEIAILVMMIHLNPEYWKDPHSFIPE RFDQDDFVKRNPYTYIPFSAGPRNCIGQKFAMIEEKMLLYNIMKHFYVESMQNENEILRT QDLISKSANGIMMKFYER* ATGTTTTTAACTTTTATGTTTTTGTTTCTTATTTATTTTCTAATTA AAGTATTTTGGAAGCTTTGGATTTATTCTTATGGCCTGTCAACTGTTTCTACACCTCCCA CATTACCATTATTTGGCAATTGTCTTCAAATCAAAAGTGATCCTGTAAAAGCCAGCAAAC AACTATTCGAGTGGAGCAGAGTATACGGAAAAGTGTTTTGTGTTTGGGTTGGCATTCGGC CAACTATATTCTCATCTTCTGTTAATTTTTCCGAAGCAATTTTAAGCAGTCAAAAAATAA TTCAAAAAGGATTTGTGTACAATTTTTTGCATGAATGGCTTAAAACTGGTCTACTAACAA GTACGGGAAATAAGTGGAAATTGCGTCTTCGACTTCTAACGCCAAGCTTTCATTTTTCTA TACTCAATAACTTTTTAAAAATTTTTGAAGAGCAAGGAAATTGTTTAATTGATAAATTTC GCGTTCTTGCCCAAAATGGAAAATATTTTGATATTCAGGTGCCTATTGGGTTAGCTACAT TAGATATAATATGCGAGACGTCAATGGGAGTGAAAATAAACGCGCAGTATCAGCCAGATT CCGAATATGTTACTGCCATTAACATCTTAAGTGAGGAAATAGTTAGACGGTTTAAGTACC CGTGGTTGTGGCCAAATATTTTTTATAAGCATTTTTCTTGTGGAAAACGGTACTTTAAAG CATTAGACATTGCTCATAAACTGTCTCTTAATGTAATTCATGAAAGAATTCAAACTAGTT TACAAAACGAAAGTGAGAATGTGTTAATCAATAAACTTGACAATAAGAGCGTGTTGAACA ATGAAGAGGAACTCGGTGTACGTAAAAAGAGGTTTTTCTTAGATTTATTGTTAGACATGC ATAAAaAAGGTGAAATT GATGTTGATGGGATTCAAGAGGAGGTGGATACATTTATGTTTGAAGGTCACGACACCACC TCATCAGCAATGTGTTGGACATTATGGTTGCTGGGAAGATATCCACAAATTCAACAGAAA CTGCATGCTGAAGTTGATGAAGTTGAACTAACTTCGGGTTCACTATATGAAAAAGTACGA AACTTTAAATATCTTGAAAATGTTTTAAAAGAAAGCCTGAGACTTCATCCACCAGTTCCC TTAATCAGTAGGTATATTGAAGAAGATATGATGATTGATGGTCAGTTTATTCCTAAAAAA TCTGAAATCGCTATTCTTGTGATGATGATACACTTAAATCCTGAGTATTGGAAAGATCCT CACAGCTTTATACCTGAAAGATTTGATCAAGATGATTTTGTAAAGCGTAATCCATACACT TACATTCCATTCTCCGCTGGCCCTAGAAATTGCATTGGTCAAAAGTTTGCAATGATAGAA GAAAAAATGCTGTTATATAACATAATGAAACATTTTTATGTAGAATCCATGCAGAATGAA AATGAAATTTTAAGAACTCAAGATCTTATAAGTAAATCAGCTAATGGTATCATGATGAAGTTCTATGAAAGATGA >Combined CN627429 CN775805 27% to 4T5 [gene 6] GIAVLIFLCFSLFFANILKRFYHPLRKLPSPKENFFTAHYGYFNGYDQINAVINFGKQFK ERGLYTLDTLNGFRFVNLLMPEFIKTVFSDGNSFQRSTATKVIFPLVGNGIFVSNYEDHH WQRKVLNEAFTLQQLKNYFPAFTVHIDLLMKLWSYSCDK DNGTNIIVLDDLSNLSFDIIGDVGFGYQFNTITSHSGNEFTKALQSYCQLRFQLNAVHKALLA FFPFLMHLSFMYGKRKRAEQVICNTLNM LINKRKKEIDHRIAADQKDFLTVVLK DQQKEGNKMTNDLIKNNLMTPLIAGHKTTSTDMPWCFNVLAPNPSATKHMQRNRKKEYISTTK >EST DN812371.1 joins with CN627429 CN775805 and 1095901729505 1097325001902 CN776982 and CN770283 1097206896815 1096110026952 1096110107596 MYTIGIAVLIFLCFSLFFANILKRFYHPLRKLPSPKENFFTAHYGYFNGYDQINAVINFGKQFKERGLYTLDTLN GFRFVNLLMPEFIKTVFSDGNSFQRSTATKVIFPLVGNGIFVSNYEDHH WQRKVLNEAFTLQQLKNYFPAFTVHIDLLMK LWSYSCDKDNGTNIIVLDDLSNLSFDIIGDVGFGYQFNTITSHSGNEFTKALQSYCQL RFQLNAVHKALLAFFPFLMHLSFMYGKRKRAEQVICNTLNM LINKRKKEIEDGIAADQKDFLTVVLKDQQKEGSKMTNDLIKDNLMTLLIAGHETTSTAMQWCLYMLGT (0) NLGVQNKLREEIKKNVFDIKSVSYEEVLSIKYLDCVVKETLRMHTPVAFIGRINKNQTKFGDFDV PAGSFLRIPIDSAHMNESVYHDPYSFRPERFLTGEIPPLSFLTFGQGIYNCIGKNFALLE IKTFLVKALLQFEFSVDLKHLNYKKLISITNKTVEPLWIRVKPI* EST matches 1097325001902 1097265052814 for exon 1 1 tcggcatagc agtattaatt tttttgtgtt tttcactgtt ttttgctaat attttaaaac 61 gtttttatca tccgcttcgt aagttgccat cacctaaaga aaatttcttt actgctcatt 121 atggctactt taatggctat gatcaaataa atgctgtaat aaattttgga aaacagttta 181 aagagcgtgg cttgtataca ttagatacat taaat ggatt tagatttgtt aatcttttaa 241 tgccagaatt tattaaaaca gtgttttctg atggaaactc attccaaaga tcgaccgcta 301 caaaagttat atttcctcta gttggaaatg gtatttttgt gtcaaattat gaagatcatc 361 attggcaaag aaaagtgtta aatgaagctt ttactttaca acagctaaaa aattattttc 421 cagcttttac agtgcacatt gatttgctaa tgaaactttg gtcatattca tgtgacaagg 481 ataatggtac taacataatt gttttggatg acttatctaa tttatcattt gatataattg 541 gggatgttgg ttttggctat ca >1096526100337 74% to CN776982 1096526100337 1097206278072 1096123494736 (0) NLDVQNKLREEIKKNVFDIKSILREEVLSIKYLDCVVKETLRMHPPASFISRKNKTETKL 308 GDYDIPAGTFLRISINNVHMNESVYPDPYLFKPERFMT (1) 1095898850029 same as 1096520314506 = mate pair match to 1096526207508 1097206730806 DEIPPSSFLSFGQGIYNCIGKNFALLEIKTFLVKALLHFEVSVDPSHVNYTKQILLTLNTVEPIWIRVKSIEE* AGAATTTAGATGTTCAAAACAAACTAAGAGAAGAAATAAAGAAAAA TGTCTTTGATATAAAAAGTATTTTACGGGAAGAAGTTTTAAGCATCAAGTACTTGGATTG TGTAGTTAAAGAGACATTACGCATGCATCCACCTGCGTCATTTATATCTCGAAAAAACAA AACTGAAACAAAGTTGGGTGATTATGATATACCTGCTGGCACGTTTTTAAGAATTTCAAT TAACAACGTACATATGAATGAGTCTGTTTATCCTGATCCTTATTTATTTAAGCCGGAACG ATTTATGACAGGT AGATGAAATACCACCATCGTCTTTTCTCTCATTTGGGCAAGGTATTTATAATTGTATTGGAAAGAAT TTTGCTTTGCTTGAAATTAAAACGTTTTTGGTTAAAGCATTATTACATTTTGAAGTTTCT GTCGACCCAAGTCATGTGAATTATACAAAACAGATTTTGTTAACTTTAAATACCGTTGAA CCCATTTGGATAAGAGTGAAATCTATTGAAGAATAA >1097696222067 new exon 6 1097375001145 1097672638278 1096041191032 1097678083218 GEIQPYSYLTFGQGIFNCIGKNFALLEIKTFLVKALLQFEFSVDLEHMNYIKKIFISTKTVEPLWIRVKPI* AGGTGAAATACAACCAT ATTCCTACCTCACATTTGGGCAAGGTATTTTTAATTGTATTGGAAAGAATTTTGCTTTGC TTGAAATTAAAACATTCTTGGTCAAAGCGTTGTTGCAATTCGAATTTTCTGTTGACCTTG AGCATATGAATTATATAAAGAAAATTTTCATTTCTACTAAAACTGTTGAACCGTTATGGA TAAGAGTGAAACCTATATAA >1097331770349 new exon six with stop, no other exact matches DEIPSSSYLTFGYGIYNCIGKNFALLEIKTFLIKAL*QFEFLVDPEQLSYKKQISIST 330 KTAEPLWIRVKSI* AGATGAAATACCATCTTCATCCTAC CTTACATTTGGGTATGGTATTTATAACTGTATTGGAAAGAATTTTGCTTTGCTTGAAATT AAAACATTTTTGATTAAAGCGTTGTAACAATTTGAGTTTTTGGTTGACCCTGAGCAATTA AGTTATAAAAAGCAGATTTCAATTTCTACTAAAACAGCTGAACCGTTATGGATAAGAGTA AAGTCTATATAA >1097329444796 new exon 6 no other exact matches, most like CN567799 GEIPASFYLPFGHGVYNCIGKNFALLEIKTFLVKALLQFEFSVDPKNINYTKVIWLTTRTVEPLLIRVKPLQPV AGGCGAAATACCAGCATCGTTCTATCTTCCTT TTGGACACGGTGTTTATAACTGCATTGGAAAGAATTTTGCTTTGCTTGAAATTAAAACAT TTTTGGTCAAAGCGTTGTTGCAATTCGAATTTTCTGTCGATCCTAAGAATATAAATTATA CAAAGGTTATTTGGTTAACTACGAGAACAGTTGAACCATTGCTTATAAGAGTAAAGCCAT TACAGCCCGTAC >1097325113147 1097690001285 1097942838551 GEIPATFYLPFGHGVYNCIGKNFALLEIKTFLVKALLQFEFSVDPKHANYTKVIWLTAKT 290 TEPLSIRVKPIVD* AGGCGAAATAC CAGCAACATTCTATCTTCCTTTTGGGCATGGTGTTTATAACTGTATTGGAAAGAATTTTG CTTTGCTTGAAATCAAAACATTTTTGGTCAAAGCGTTGTTGCAATTCGAATTTTCTGTTG ACCCTAAGCATGCAAATTATACAAAGGTTATTTGGCTAACTGCAAAAACAACTGAACCAT TGTCAATCAGAGTAAAGCCTATTGTAGATTGA >1097206250175 1095899118096 GEVPPFSFLTFGRSNYNCIGKNFVLLDIKAFLVKALLQFKFSVDP 360 MHLNYKKPISITNKAVDPLWIRVKTI* AGGTGAAGTACCGCCATTTTCCTTTCTAACAT TTGGGCGAAGTAATTATAATTGTATTGGAAAGAATTTTGTTCTGCTTGACATCAAAGCAT TCTTGGTCAAAGCGTTATTGCAGTTTAAATTTTCAGTAGACCCTATGCATTTGAATTATA AGAAGCCGATTTCTATTACTAATAAAGCCGTTGATCCCTTATGGATTAGAGTAAAGACTA TATAA >1096123749751 boundary is not right, no other exact matches GETPASLYLPFGHGVYKVIGKNFSLLEIKTLSVKALLQLEKVVDPKNINYSKVIWLTSRT 211 VEPLFIRVKLIVD* GGTGAAACACCAGCATCGCTTTATCT TCCTTTTGGACACGGTGTTTATAAAGTCATTGGAAAGAATTTTTCTTTGCTTGAAATTAA AACATTGTCGGTCAAAGCATTGTTGCAATTAGAAAAGGTTGTCGATCCTAAGAATATAAA TTATTCAAAGGTTATTTGGTTAACTTCGAGAACAGTTGAACCATTGTTTATAAGAGTAAA GCTTATTGTAGATTAA >gnl|ti|654999901 1095901768752 87% to 1095901729505 1095901905311 mate pair links to 1095901795880 exon 5 1097331953492 1097664070304 1096761841205 1097675516783 1096602049536 1096761821875 (0) LINKRKKEIEDGIETGEKDFLTIVLKDQQKEGSKMTNDLIRNNLVTLLIAGHETTSVAMQWCLYILGT (0) AGCTTATCAACAAACGTAAAAAAGAAATAGAAGATGGAATAGAAACTGGTGAAAAAGATTTTTTAACA ATTGTTTTAAAAGATCAACAAAAAGAGGGCAGCAAGATGACAAATGATTTGATTAGAAAT AATCTAGTAACACTTTTAATTGCTGGTCATGAAACAACTTCTGTAGCAATGCAATGGTGC TTATACATTCTTGGCACAGT 1095901795880 1097491021716 1096123686039 1097672412446 (0) NSDVQNKLREDIKKNVFDIKSITCEEVLSIKYLDCVVKEVLRLHPPVSFIGRINTR QTNFGEYNVPAGSYLRVPINSAHMNESVYPDPYSFKPERFLT (1) AGAATTCAGATGTTCAAAACAAGCTACGAGAAGACATAAAGAAAAATGTCTTTGATATAA AAAGTATTACGTGTGAAGAAGTTTTAAGTATTAAGTATTTAGATTGTGTAGTTAAAGAAG TGTTGCGCTTGCATCCGCCTGTATCATTTATAGGTAGAATCAACACTAGACAAACAAACT TTGGTGAATATAATGTACCTGCTGGCTCTTATCTACGAGT >1097206350025 1097675534489 1096602217388 all with frameshift LIDKRKKEIEDGIATDEKDLLTIALKDQQKENSKMTX NLIRDNLMTFLIAAHETTSTGMQWCLYMLGT (0) AGCTTATCGACAAACG AAAAAAAGAAATAGAAGACGGAATAGCAACTGATGAAAAAGATTTATTAACAATCGCTTT AAAAGATCAGCAAAAAGAAAACAGCAAGATGACCTTNAATTTAATTAGAGATAATCTAATG ACATTTTTAATTGCTGCTCATGAAACAACTTCTACGGGAATGCAATGGTGTTTGTATATG CTTGGCACAGT >1097331459342 framshift and short 2 aa (pseudogene?) LIDKRKKEIEDGIATDEKDLLTIALKDQQKENSKMT NLIRDNLMTFLIAAHETTSTGMQWCLYML AGCTTATCGACAAACGAAAAAAAGAAATAGAAGACGGAATAGCAACTGATGAA AAAGATTTATTAACAATCGCTTTAAAAGATCAGCAAAAAGAAAACAGCAAGATGACCTTA ATTTAATTAGAGATAATCTAATGACATTTTTAATTGCTGCTCATGAAACAACTTCTACGG GAATGCAATGGTGTTTGTATATGCTG >1097263640455 new exon 5 (0) NLDVQEKLREGIKKNVSDIKNISYEEVLSNKYLDCVVKEALRIHPPRS AGAATTTAGACGTTCAAGAAAAACTAAGAGAAGGGATAAAGAAGAATGTA TCTGATATAAAGAATATTTCATATGAAGAGGTTTTAAGTAACAAGTACTTAGATTGTGTA GTTAAAGAAGCATTGCGCATCCATCCACCGCGCTCCAGCTA >1096526374787 no 100% matches to this seq, best match is 1095901795880 but intron boundaries do not match this may be a poor quality sequence or pseudogene (1)EKVINIKYLDCVVKEVLRLHPPVLFIGRINTRQTNLGKYIETAGSNQRVPINNAHMNESVYPDPYSFMPKRLLT (1) AGAAAAAGTTATAAATATTAAGTATTTAGATTGTGTAGTTAAAGAAGTGTTGCGCTTGCA TCCGCCTGTATTATTTATAGGTAGAATCAACACTAGACAAACAAACTTAGGTAAATATAT AGAAACTGCTGGCTCTAATCAACGAGTTCCTATTAACAATGCTCATATGAATGAGTCTGT TTATCCTGATCCTTATTCATTTATGCCAAAGAGGTTGCTGACAGGT >CN567598.1 tag12b09.x1 Hydra EST -Kiel 1 Hy.. LLKRIFHPLRFLPSPKEQLITGHINHFQGRDHSSTYLSFNEKFKEESLCTLDTLH >Combined N and C-terms CN567799 CN567598 tag12b09.x1 [gene 4] N-term N-term has an extension 1096761916754 probably 1095964418219 (poor quality seq) RVRFLLRYLLKRIFHPLRFLPSPKEQLITGHINHFQGRDHSSTYLSFNEKFKEESLCTLDTLH (1) VPRFVYLIAPEFIKKIFADGKLFQRSKSIRTLAPLIGNSMVGSNYEHHHWQRKLFNG AFTSQQLKNYFPAFLKHTNLLMK (0) LWSYTCDKESGTNLTVLDDLSNLSF CN567598 part of three exons 1 cacgcgtccg atttttactg cgttatctat taaagcgcat ctttcaccct cttcgatttt 61 taccatcacc aaaagaacaa ctcattactg gtcatattaa tcactttcaa ggccgcgacc 121 attctagcac ctatttgagt ttcaacgaaa agtttaaaga agaaagttta tgcacgctag 181 atacattaca t gtgcccagg tttgtttatc taattgctcc agagtttatt aaaaagatat 241 ttgcagatgg aaaacttttt caaaggtcaa aatcaataag aactttggcc cctttaattg 301 gaaacagcat ggttggttca aattacgaac accatcattg gcaaagaaag ttattcaatg 361 gagctttcac ttcacaacaa ctgaaaaatt attttccagc atttttaaaa catactaatt 421 tgcttatgaa g ctttggtca tatacatgtg ataaagaaag tgggacaaat ttaactgttt 481 tggatgattt gtctaatctg tcatttg DIVGDVGFGYHFNTITSHSGNEVTKAFQKY CQLRHSLHPFYKALFAYFPFLMRLSFMFGKHKKAEQVISYTXXX (0) AGCTTTGGTCATATACATGCGATAAAGAAAGTGGTACCAACATAATTGTTTT GGATGATTTGTCTAATCTATCATTTGATATAGTTGGTGATGTTGGTTTCGGCTATCATTT TAACACCATAACTTCTCATTCCGGTAATGAAGTTACAAAAGCCTTCCAAAAGTATTGTCA ACTACGACATAGCTTGCATCCCTTTTATAAAGCTTTATTTGCTTATTTTCCATTTTTAAT GCGTCTATCATTCATGTTTGGAAAACATAAAAAAGCTGAGCAAGTTATAAGTTATACT >1096081231152 new exon 3 IWSYTCDKENGTKIIVLDDLSNLSLDIIGDVGYGYQFNTLTSHSGNEFTKAFQSYCQLQY 135 NIKPIYKALSAFFPFLMGLSIMFGKRKKTEEILRNNLNM AGATTTGGTCATATACATGTGATAAAGAAAATGGTACCAAAATAATTGTTTT AGATGACTTGTCTAATTTATCACTTGATATAATTGGTGATGTTGGTTATGGCTATCAATT TAACACCTTAACTTCTCATTCTGGTAATGAATTTACAAAGGCTTTTCAAAGTTATTGTCA ACTACAATATAACATAAAGCCAATCTATAAAGCTCTATCAGCTTTTTTTCCTTTCCTAAT GGGGCTGTCAATCATGTTTGGAAAACGAAAGAAAACAGAGGAAATTTTACGTAATAATCT AAACATGGT >1095898814465 new exon 2 mate pair to 1095899110069 1095993318769 1095963238168 1096704141526 1095899349182 1097664110168 1097622218011 (1) VPRYVYLIAPEFIKKIFADGKLFQRTTSIRIMAPSIGNSMLSSNYEDHHWQRKLFNGAFT 471 SQQLKNYFPSFLTHTNLLMK (0) AGTGCCCAGGTATGTTTATCTAATTGCTCCAGAATTTATTAAAAAAATATTTGCTGATGGCAAACTTTTT CAAAGAACTACTTCAATTAGAATTATGGCACCTTCAATTGGAAACAGCATGCTTAGTTCA AATTACGAAGACCATCATTGGCAAAGAAAATTATTCAATGGAGCATTCACTTCACAACAG CTAAAAAACTATTTTCCTTCATTTTTAACGCATACTAATTTACTGATGAAAGT 1097567103129 1097675494277 1095899110069 (0) IWSYTCDKESGTNLTVLDDLSNLSFDIIGDVGFGYQFNTITSHSRNEFTSAIRYLAEIQL 657 NASVFLKVLISYFPFLIQLLVMFGKRRKFIQIVRKTLNK (0) AGATTTGGTCTTATACATGTGATAAAGAAAGTGGGACAAACT TAACTGTTTTGGATGATTTGTCTAATCTGTCATTTGATATAATCGGTGATGTTGGTTTTG GTTACCAATTTAACACAATTACATCTCATTCTCGTAATGAATTTACTTCAGCTATTCGGT ATTTGGCTGAAATTCAACTCAATGCTAGTGTGTTCTTAAAAGTTTTAATAAGTTATTTTC CATTTTTAATTCAATTGTTGGTAATGTTTGGGAAGCGTAGAAAATTTATACAGATTGTCC GTAAAACATTGAACAAGGT 39% to 3A27 trout aa 307-472 58% to CN770283 [gene 4] 683 FFIAGYETISTTLTLCLYMLAINLEVQEKLREEIQKNKLDVNNISFEEVTSLKYLDCVVK 504 503 ETLRLHGLAPVLGRETINAIKFGEYEIPANTVLQTHVSNLHMNETIYRDPHSFKPERFMT 324 323 GEIPASFYLPFGHGVYNCIGKNFALLEIKTFLVKALLQFKFSIDPMHINYTK 168 167 IIWLTMRTVEPLLIRVKPIAE* 102 TTTTTCATAGCTGGTTATGAAACAATTTCTACTACTTTGACTTTGTGTTTATATATGCTA GCCATTAACTTAGAGGTTCAAGAGAAACTTAGAGAAGAGATTCAGAAAAATAAATTGGAT GTAAATAATATTTCTTTTGAAGAAGTTACGAGTTTAAAATATTTGGATTGTGTCGTTAAA GAAACCTTGCGCTTGCATGGACTTGCACCAGTTTTAGGCAGAGAGACCATTAATGCAATA AAATTTGGCGAATATGAAATTCCTGCAAACACAGTACTTCAAACTCATGTTAGCAATCTA CACATGAATGAGACTATTTATCGAGATCCTCATTCATTTAAACCTGAAAGGTTTATGACA GGGGAAATACCAGCATCATTCTATCTTCCTTTTGGGCACGGTGTTTATAACTGTATTGGA AAGAACTTTGCTTTGCTTGAAATTAAAACATTCTTGGTCAAAGCGTTGTTGCAATTCAAA TTTTCTATTGACCCTATGCATATAAATTATACAAAGATTATTTGGTTAACTATGAGAACA GTTGAACCATTGCTAATTAGAGTAAAACCTATTGCAGAATAA >gnl|ti|648014530 1095896049543 41% to CYP21 LKYLDCVVKETLRLHGXXXXXXXXXXXXX KFGEYEVPANTILRTHVSSIHMNETIYPDPHSFKHERFMTG GTTTAAAATATTTGGATTGTGTCGTAAAG GAAACCTTGCGCTTACATGGA AAATTTGGTGAATATGAAGTCCCTGCAAATACAATCC TACGAACTCATGTTAGCAGTATACACATGAATGAAACTATTTATCCAGATCCTCATTCAT TTAAACATGAAAGGTTTATGACAGGT >1096082202706 probably the same as 1095896049543 which has errors 1095994179331 (0) NIEVQEKLREDIQKNILDVNNISFEEVMSLKYLDCVVKETLRLHGPAPLLGRRTISATKF GEYEVPANTILRTHVSSIHMNETIYPDPHSFKPERFMT (1) AGAACATAGAAGTTCAAGAGAAACTTAGAGAAGATATCCAGAA AAACATATTGGATGTAAATAATATTTCTTTTGAGGAAGTTATGAGTTTAAAATATTTGGA TTGTGTCGTTAAAGAAACCTTGCGCTTACATGGACCTGCACCACTTTTAGGCAGAAGGAC CATTAGTGCAACAAAATTTGGTGAATATGAAGTTCCTGCAAATACAATCCTACGAACTCA TGTTAGCAGTATACACATGAATGAAACTATTTATCCAGATCCTCATTCATTTAAACCTGA AAGGTTTATGACAGGT >1096526207508 with frameshifts probably same seq as 1096526100337 mate pair = 1096520314506 same as 1095898850029 (0) NLDVQNKLREEIKKNVFDIKSILR EEVLSIKYL DCVVKETLRMHPPASFISRKNKTETKLGDYDLPAGTFLRISINNVHMNESVLSWIPYLFKPER AGAATTTAGATGTTCAAAACAAACTAAGAGAAGAAATAAAGAAAAATGTCTTTGATATAAAAAGTATT TTACGG GAAGAAGTTTTAAGCATCAAGTACTT GATTGTGTAGTTAAAGAGACATT ACGCATGCATCCACCTGCGTCATTTATATCTCGAAAAAACAAAACTGAAACAAAGTTGGG TGATTATGATCTACCTGCTGGCACGTTTTTAAGAATTTCAATTAACAACGTACATATGAA TGAGTCTGTTTTATCCTGGATCCCTTATTTATTTAAGCCCGAACGAA >BP514308 N-term 25% to 46a [gene 9] 1096761991009 1096082187152 1096123591182 1095899045709 1097383004013 MYSIYIAIIIVPLVFFVAVFFKRFYHQFRLLPSPKESLITCHYSYFDVHDHVNTLLNFGKEFKDYGLYTINTLI (1) ATGTATTCGATATACATAGCGATTATAATAGTTC CTTTAGTTTTTTTCGTTGCTGTTTTTTTTAAACGTTTTTATCATCAATTTCGTTTGTTGC CATCACCCAAAGAGAGTTTAATTACATGTCACTATAGTTATTTTGATGTTCATGACCATG TTAACACTCTGTTAAACTTTGGTAAAGAGTTTAAAGATTACGGATTATATACAATTAATA CTTTAATTGGT 1096124094276 1095964247544 1095901005745 1095899045709 GPRQVHLLLPHFIKTVIADGKFFQRSPVFKAVFPLVGNSMIVSNYEDHHWQRKLF NQAFTSQQLKRYFLAFTLHTDLLMK (0) AGGACCCAGACAAGTTCATCTTTTATTGC CACATTTCATTAAAACAGTAATTGCAGATGGAAAGTTTTTTCAAAGATCACCAGTTTTTA AAGCCGTATTTCCTCTTGTTGGAAACAGTATGATCGTTTCTAATTATGAAGATCATCATT GGCAAAGAAAATTATTTAATCAAGCCTTTACTTCGCAACAATTAAAAAGATATTTTTTAG CTTTTACTCTGCATACTGATTTGCTAATGAAGGT LWSCTCDKENGTNLNVWSDLSNLSFDIIGDVGFGYQFNTITSHSGNAFTKALRSY INLRFNSSVVHNVLIAYFPFLMRFLSKFGNLNKAEQ (0) 1095958061820 82% to 1095901729505 1095964290917 1096124019775 1096528662475 1096159548758 1097622041589, 1097672472615 1096064134288 mate pair = 1096041094868 exon 3, (0) LIDKRKKEIENGLVKEEKDFLSIVLKDQQQEKSKLTNDLIRDNLMTLLIAGHETTSTAMLWCLYTLGT (0) AGCTTATCGATAAGCGTAAAAAAGAAATAGAAAATGGATTAGTAAAAGAAGA GAAAGATTTTTTATCAATTGTTTTAAAAGATCAACAACAAGAAAAGAGCAAACTGACAAA TGATTTGATTAGAGATAATTTAATGACGCTTTTAATTGCTGGTCATGAAACTACTTCTAC TGCAATGCTGTGGTGTTTATACACATTAGGAACAGT run into seq gap downstream >gnl|ti|655009968 1095963046224 KYG region 46% to CYP20 35% to 27B1 (2) LGNLGSLTFDGGIHKFLVENHKRLGPMFSFYWGKELAVSLACPILFKEVATLFNRP (1) AGATTGGGAAATCTTGGCTCTCTAACATTTGA TGGTGGAATTCACAAGTTTCTTGTTGAAAACCATAAAAGGCTTGGTCCAATGTTCAGCTT TTATTGGGGCAAAGAACTGGCTGTTAGTCTAGCTTGTCCAATTCTTTTTAAGGAGGTT GCCACTCTATTTAATCGACCAGGT >1097263613070 mate pair to 1097206643989 I-helix/J helix boundary? 1097329235455 1097672289528 1096705876537 1096110072452 LTWLVYFLCKHPEVESKVYNEIKEFTEKDLDMELLTKFS (2) AGTATTGACATGGCTTGTTTATTTCTTATGTAAACATCCAGAAGTGGAATCTAAGGTATACA ATGAGATAAAAGAATTTACAGAAAAAGATCTAGATATGGAATTACTTACAAAATTTAGGT >BP508840 BP508840 Best match in Fugu, human and Ciona is CYP20 1096602177777 1097264059772 1096123966865 1097672368420 1097206643989 1096110028119 (2) YTKQVIDEVMRIAVLAPYAARYSDYDIIVDGHLIPKK (0) (0) TPIILALGTVFQDETIFPEPDR (2) (2) FDPDRFSDKQIEERSALAFQPFGFAGKRKCPGYRLAYAETLTYTFYIIKNFHISL FDKQSVKMHYGFVTKPSEEIWIKVLRRKNI* 1096111030955 AGTTACACAAAGCAAGTTATTGATGAAGTTATGAGAATTGCTGTACTTGCTCCATATG CAGCTAGATATTCGGATTATGATATAATCGTTGATGGACATCTAATACCAAAAAAGGT AGATTTGACCCTGACCGTTTTAGTGATAAACAAATTGAAGAACG TTCAGCGTTAGCTTTTCAGCCGTTTGGATTTGCGGGAAAAAGAAAGTGTCCTGGATATAG ATTAGCATATGCTGAAACATTGACGTACACATTTTATATCATCAAGAATTTTCATATTTC GCTATTCGATAAGCAATCTGTGAAAATGCATTATGGTTTTGTCACAAAACCGTCTGAAGA AATTTGGATTAAAGTGTTACGACGTAAAAATATCTAG >CYP20 amphioxus 39% to CYP20 Danio MLDYAIFAITFVVFLIATVLYLYP (0) (0) GANKITTIPGLEPSDPK (2) (2) DGNLGDVGRAGSLHEFLLKLHTEYGDIASFWWGQQLVVSLGAPELWKQH ERIFDRP (1) (1) ALLFKGFEPLIGAKSIQYANSVDGRTRRKLYDPSYGHNAMKHYYSIFQE (0) (0) LGQEMAKKWESMKGDQHIPLHAHIIALAMKAITRSSFGDAFKDEKECVQFGRNYDI (0) (0) CWNDMEERIKGSHPTEGSPREKKFKE (1) (1) ALGKLHATIARVAKYRRENPSPPQEQLFIDVLIEGNLPEEQ (0) (0) VLCDAMTFTVGGIHTSGN (1) (1) LLTWALYYIATHEEVEEKLHQELSDVLGKKGEVTPDNISQLV (2) (2) YLRQVLDESLRCAVIAPWGARYMDLDAEVGGHIVPAK (0) (0) QTPVIHAFGVVLQDERIWPEPNK (2) FDPDRFDAENSKGRHKLAFQPFGFAGGRKCP (1) (1) GYRFTYTWTSVFLSILCRQFKLHLVDGQVVKPCHGLVTRPVDEIWITVTKRD* 1096111030955 ATCTCCATCAGTTAATTGCATAGATGCGAGAATGCATGTTAAGTGCATGTAACTCTGAATATTAGCGACAAAGTTTAGTATCACTATCTATGATAGTATTTTTAGTATTTATATGATTCATATTTTTCAGCATAACTCTAATAATAAATATTAATCTAAATTTTAATGCTTTTTTTTTTAAATGATTGAATATTTTTAAAACATGTATTAAATAGTTTATTAACTAAATTTTTAAAGTATTTTTAAGTACTAATAAATTTAAAAAATAAAAAAAGATGTTTATGTTTCAAAGTTTATATTTCAGTTACACAAAGCAAGTTATTGATGAAGTTATGAGAATTGCTGTACTTGCTCCATATGCAGCTAGATATTCGGATTATGATATAATCGTTGATGGACATCTAATACCAAAAAAGGTTTATACTAATATTA >gnl|ti|646862798 1095898098005 41 0.018 35% to 17A1 34% to 2P4 gnl|ti|647168675 1095899196297 1097622027233 1096703988838 1097329365089 1097329154279 42% to 1095898227332 MLVFQQLIFAVLVPAFLYFVFSYLQHLWICSKYPKGLFPLPLLGNIH QLGKNSSQTFSSLTKIYGDIFSVSIGTQRLVILNSMESIHEALLTKGSTFGGRPTEF TSNVFTKGYKNLSHTDYGPNLKALRKVIHLSVQKYAGGLTRQEQMITFERDELCKKLFN TEKEIALRCEI (1) ATGCTTGTTTTTCAACAATTAATATTCGCCG TACTTGTTCCGGCTTTTTTATATTTTGTTTTTTCTTATTTGCAACATTTATGGATTTGTA GTAAGTACCCAAAGGGTCTGTTTCCATTACCGTTGTTAGGAAACATTCATCAATTAGGTA AAAACTCTTCTCAAACATTTTCATCTTTAACAAAAATTTATGGAGATATATTTAGTGTGA GTATTGGTACCCAGCGACTCGTTATACTCAATAGTATGGAAAGCATACATGAAGCTTTGTTAACCAAAGGTTCAACTT TTGGTGGTAGACCAACTGAgGTTACGTCAAATGTTTTTACAAAAGGATATAAAAACTTATC GCACACTGATTATGGACCGAATTTAAAAGCGTTGCGAAAAGTTATTCATCTTTCCGTTCA AAAATATGCTGGCGGACTAACGAGACAAGAACAGATGATAACTTTTGAAAGAGACGAACT TTGTAAAAAACTTTTTAATACTGAAAAGGAAATAGCTTTACGTTGTGAAATTGGT (1) DFCTVNVMSGYLFNERFLNQNSEFKDVVKSIQLLLDNSGITDKTTFIHWLRYLPLREWN 793 EIKQARLVLNPWVEKKVEDHWRKYNENEIINVTDSMIQHFLTKYDGLDTDFAKKYITLL 617 616 LIELLVAGTETTAITICWMVLYLIHNPEYQEEIYKEITLNIGCRYPTLAEKNLFPLL 446 445 QAFIQETLRITSVVPLNLAHKALKDTSICGKIIPKDAIVITNLWNLHHDNRYFKNPNEFD 266 265 PKRWINENGLFDSISQKYFKPFSAGARVCLGETLAKNQLFLIISGLIMNFIFTSAPGKD 89 88 LPSLEGQFGITFRPNSFKVL* 29 AGATTTTTGCACTGTAAATGTAATGTCGGGGTATTTATTCAATGAACGCTTTCTGAACCAAAATTCCG AGTTTAAAGATGTCGTAAAAAGTATTCAACTTTTGCTAGATAACTCTGGAATTACAGATA AAACCACGTTCATACATTGGCTTCGTTACTTGCCATTGCGGGAATGGAATGAAATAAAAC AAGCGAGACTTGTCTTAAACCCGTGGGTCGAAAAAAAGGTTGAAGATCATTGGAGAAAGT ATAATGAAAATGAAATCATTAATGTAACTGATAGCATGATTCAACATTTTTTAACAAAGT ACGATGGTTTAGACACTGATTTTGCAAAGAAATACATTACCTTATTATTGATCGAATTAC TTGTTGCCGGTACCGAAACGACAGCTATTACTATTTGCTGGATGGTTTTATATCTAATAC ATAACCCTGAGTATCAAGAAGAAATTTATAAAGAAATTACATTAAATATTGGTTGTAGAT TGCGAATAACATCTGTTGTGCCACTAAACTTGGCTCACAAAGCATTAAAAGATACCAGCA TTTGTGGAAAAATTATTCCTAAAGACGCTATAGTAATTACAAATTTATGGAATCTTCATC ACGACAACAGATACTTTAAAAATCCTAATGAATTTGATCCTAAACGCTGGATAAACGAAA ATGGTCTATTTGACTCAATTTCTCAAAAATATTTTAAACCTTTTTCGGCTGGAGCGAGAG TATGTCTTGGCGAGACATTAGCCAAAAATCAACTTTTTTTAATCATCTCCGGTCTAATTA TGAATTTTATTTTCACATCTGCACCAGGAAAAGACTTACCTAGTCTTGAAGGACAATTTG GAATCACATTCCGTCCCAATAGTTTTAAGGTTTTATAA >gnl|ti|651477674 1095901303788 39 0.11 39% to CYP21 39% to 2R1 40% to 2P4 49% to 1095898227332 1096703646566 mate pair = 1096703498438 = N-terminal exon 1097675091467 MFLFVVFEVVFGLIIPVLLYVI VVYIYHIWECQRYPPGPFPLPVIGNYNLLANDPVKALCDLEIIYGDVFSLSLGTVR VVVVSSHESIYDVLVGDGSNFSGRPREYSSLLFTGGFENLSHMDNNPLTKKIRKVFYSKL KTNGSILAHNENIVKHESELLHQRLLQNEGSVTNLRYEI (1) (1) DLCIVNSICSIIFGNRLSDTCEVHEILKATRLLLKNLSNIEIMHYLPWM RFFLLKKQNEISESRNICKFWIQTQLHKRKKSLKNENISDILLNLWDQQKQENP NEEQYRMILVELVMAGSETTAATITWLIFYLLHWPHYQSILYKEIKNVCGDQYPTFNDIKSMPIMQATILE TLRLSSVVPLSLSHKAVNNAKINKFTIPKDTIIITNLWGVHHNEKYWEKPFEFNPMRWLD KNGELSTAKRLGYFPFSAGPRGCIGESFARMQMFIICSRLIKDFSFELPQSGETPKLDGD IGITLTPLPYNAVAKQRT* ATGTTTCTTTTTGTAGTTTTTGAAGTTGTATTTGGGCTGATAATTCCCGTTTTACTTTACGTAATAGTT GTTTATATTTATCATATTTGGGAATGTCAAAGATACCCACCAGGT CCATTTCCTCTTCCGGTAATTGGAAACTACAACTTGTTAGCAAATGATCCTGTGAAGGCA TTGTGCGATCTAGAAATTATTTACGGAGATGTTTTCAGTTTAAGTTTAGGAACCGTTCGG GTGGTTGTTGTAAGCAGCCACGAGAGTATTTACGATGTTTTAGTTGGAGATGGATCAAAT TTTTCCGGAAGACCCAGGGAGTATTCATCTTTACTTTTTACTGGAGGTTTTGAAAACCTT TCCCATATGGATAACAACCCGTTGACTAAAAAAATCAGAAAAGTTTTTTATTCAAAACTT AAAACAAACGGAAGTATTTTAGCACACAATGAAAATATTGTCAAACATGAAAGTGAACTT TTACATCAAAGACTACTGCAAAACGAAGGAAGCGTCACCAATCTTCGTTATGAAATCGGT AGATCTTTGTATTGTTAACAGCA TATGCAGTATTATTTTTGGTAACCGGCTTAGTGATACTTGTGAAGTTCATGAAATTTTAA AAGCGACCAGGTTACTTCTAAAAAACTTGTCAAACATTGAAATTATGCATTATTTACCAT GGATGAGATTTTTTTTATTAAAAAAGCAAAACGAAATCAGCGAATCTAGAAACATTTGCA AATTTTGGATTCAAACCCAGTTGCATAAACGAAAAAAAAGTTTAAAAAACGAAAATATCT CAGATATTCTTTTGAACCTTTGGGACCAACAAAAACAAGAAAACCCTAATGAGGAACAAT ACAGAATGATTTTAGTTGAGTTAGTTATGGCTGGTTCCGAAACAACAGCCGCAACGATAAC TTGGCTAATCTTTTATCTTTTGCATTGGCCTCACTATCAAAGCATTCTTTACAAAGAAAT CAAAAATGTTTGTGGTGATCAGTACCCTACGTTTAATGATATTAAATCAATGCCTATAAT GCAAGCAACTATACTTGAAACTTTAAGGTTGTCTTCTGTCGTTCCTTTAAGCTTATCTCA CAAAGCCGTAAATAACGCGAAAATTAATAAATTCACAATCCCTAAAGATACAATAATAAT AACAAATTTATGGGGCGTACATCATAATGAAAAATACTGGGAAAAACCGTTTGAATTCAA TCCTATGCGTTGGCTTGATAAAAATGGCGAACTTTCAACAGCAAAGCGTTTAGGATATTT CCCTTTTTCAGCCGGCCCAAGAGGTTGCATTGGTGAGTCATTTGCAAGAATGCAAATGTT TATTATATGCTCTCGACTGATAAAAGATTTCTCCTTTGAGTTGCCTCAAAGCGGAGAAAC CCCAAAACTAGATGGTGATATTGGAATTACACTAACGCCCCTTCCTTATAATGCAGTAGC TAAACAGCGAACCTAA >gnl|ti|654998190 1095901734433 33% to CYP21 33% to CYP17 33% to 2U1 gnl|ti|651148169 1095901003210 gnl|ti|651162328 1095901096755 74% to 1095898227332 possible pseudogene 870 FQDIIKTHNET 837 SYISSIPWLRYFPTATSRNMIILNKNKYNIFEIIRLRDPILIRKLQEHKRTYD*SNLGDV 658 657 TDALIKISLESVTETDSHEKITDDNTEFLLNDFMIAGSETSSNMILWFIVYILHRPEYQDKLYDEIFKVFSG IGYPSLNDRPRFHLIQAIIHETLRLLSVAPLGLCHKALENGSICGKFVPKG (frameshift) LLILTNLWSIHHDERYWKNAINFFPERWLDNSGNFNYNLGYAYLPFS 147 146 GGPRSCLGETLAKTELFVFISRLVKDYRFEKPNGKDSLDGRSGVTCLPYEFEIVMIPRS* >gnl|ti|655009845 1095963045220 near C-helix region poor match gnl|ti|648592188 1095595897239 NOT A P450, MATCHED FISH AND DROSOPHILA SEQUENCES 637 STSNCVTMNQSDYLQNVLQKFGFDNCKQRSSPCKQVPYSYHNQESIKSNGSMIKL 473 472 YRQMVGSLLYAMTCTRPDLSYVITKLSQHLSKPNSGDWIMIKHVFRYIKHTLN 314 313 YCLTFRK 293 >gnl|ti|648047811 1095899057643 I-helix 4 aa diffs to 1095898198167 (0) LIEKRKKEIDDGISTKEKDIITIVLKDQQQESSKLTNDLIRDNLLLFLIAGHKTTSTTMTWCLYILGT (0) AGCTTATTGAAAAACGTAAAAAAGAAATCGACGATGGAATATCAACAAAAGAGA AGGATATTATCACAATTGTCTTAAAAGATCAACAGCAAGAAAGCAGCAAACTAACAAATG ATTTGATTAGAGATAATTTATTATTATTTCTCATAGCTGGTCATAAAACAACTTCTACTA CTATGACTTGGTGTTTATATATACTAGGCACTGT CYP1 like (only one seq) 1095899272864 5 aa diffs in N-term overlap region MWYEIICGLIISILLYIIGSYLMHLLECRKYPLGPFPIPIFGNLHLLGTEPHKILAAYS KKYGAVFSISLGLQRIVIISDITTTREALVQKASIFAGRPKSYLIQLISSGYKGIAFMDY GSFWKVLRKVSHSSLKIYGEGHERFEKILTKESEELHKRLLKKSNNSVELKSEF (1) >gnl|ti|648485307 1095899272864 57% to 1095897342515 ATGTGGTATGAAAT TATCTGCGGACTGATCATTTCGATTTTGCTATATATTATTGGTTCTTACTTGATGCACTT GCTGGAATGCAGGAAGTATCCTCTTGGACCTTTTCCAATACCAATCTTTGGTAACTTGCA TTTATTAGGAACAGAGCCACATAAAATACTTGCTGCATACTCAAAAAAGTATGGAGCAGT CTTTAGCATAAGTTTAGGATTGCAAAGAATTGTTATAATTTCTGACATTACTACAACTAG AGAAGCACTAGTTCAAAAAGCATCCATATTTGCAGGTAGACCAAAATCTTATTTAATTCA ATTAATTTCAAGTGGGTACAAAGGCATTGCATTTATGGACTATGGTTCCTTCTGGAAAGT TTTGCGTAAAGTTAGTCATTCTTCATTAAAAATATATGGAGAAGGACATGAACGTTTTGA AAAGATACTTACAAAAGAAAGTGAAGAGCTACATAAAAGACTTTTAAAGAAATCAAATAA TTCCGTAGAGCTGAAATCTGAATTTGGT >CN775634 tae83e09.y1 Hydra EST Darmstadt I Hydra magnipapillata cDNA 5' similar to SW:CP11_OPSTA Q92095 CYTOCHROME P450 1A1 AGAGAGTGAAGAGCTACATAAAAGACTTTTAATGAAATCAAAAACTTCCGTAGATCTGAAAACTGAATTT GGTGCTGCAATTATAAACGTGATTTGTTTCATTGTTTTTGGGGAAAGATACCAGTATTCAAATTCAGAAT TTAAAGAAGTTCTTACAACAATAAACAATATAGTCGATGGGTTGTCAAATACAACTGCTGTCGGTTTTTT GCCGTGGTTGAGATTTTTACCGTTTTCTCCAATAAAAAAACTGAGTATTTCACTTTCAAAATATATTCGT TTTTTAAACGATAAGTTGACCAAACATAAGGAAACATTTAATGAAAACAAAATTCGAGATTCTACTGATT CTATTATAAAC 32% to CYP1C1 aa 173-297 2 ESEELHKRLLMKSKTSVDLKTEF (1) GAAIINVICFIVFGERYQYSNSEFKEVLTTINNIV 175 176 DGLSNTTAVGFLPWLRFLPFSPIKKLSISLSKYIRFLNDKLTKHKETFNENKIRDS 343 344 TDSIIN 361 opposite end of clone = >CN774619 tae83e09.x1 Hydra EST Darmstadt I Hydra magnipapillata cDNA 3' similar to SW:CPT7_CHICK P12394 CYTOCHROME P450 17 TTTAAACTACATAAGATTGTTTACTCTTGGAATCAAGTATGCGCAAAATTGTTTTGGAGCATGAGTAATA TTGCATTTTCCAATTAAACTTGGAGGTGGACAACCAGGTGTGCACTCAAACTTAAAATCTCGAATTAATC TAGAAAAGAAGAAAAATAGTTCATTTTCAGCAACTGTTTTCCCTAAACAAACTCTTGTACCGGCTGAAAA AGGTAAGAAACTTGTTGCTTTACTTGGATCAAATTTCTTATCTTTGCCAATCCGCCGATAAGGAATTAAA TTCATTAGGATTTTCCCAGGAGTTAGTAACGTTATGAATCTGCCAATGATTAATCATGACAGTAGCAATT TTCAGGAATGCTATGTGACATGAGAACTCCCCCCCAGAGGT 38% to CYP17A2 Fugu aa 383-485 391 TSGGEFSCHIAFLKIATVMINHWQIHNVTNSWENPNEFN 275 273 PYRRIGKDKKFDPSKATSFLPFSAGTRVCL (1) GKTVAENELFFFFSRLIRDFKFECTPGCPP 94 93 PSLIGKCNITHAPKQFCAYLIPRVNNLM* 7 1097672474909 AGGGAAA GTTGCTGAAAATGAACTATTTTTC TTCTTTTCTAGATTAATTCGAGATTTTAAGTTTG AGTGCACACCT GGTTGTCCACCTCCAAGTTTAGTTGGAAAATGCAATATTACTCATGCT CCAAAACAATTTTGCGCATACTTGATTCCAAGAATAAACAATCTTATGTAG >1096526199166 frame3_ORF1 7aa diffs to CN774619 may be same gene (1) GAAIINVICFIVFGERYQYSDSEFKEVLTTINDIVDGLSNTTAVGFLPWLRFLPFSPIKKLSIS LSKYVRFLNDKLKKHKETFDEKKIRDFTDSIINFSNNEAVKQKFKNVDEHLEPVIGDLFI TGSETTLTSLLWLILYMMHYPKYQQEIFKEITTVIGEDRYPCLNDRDSLHLVKAALKECL RLSSIVPLGLPHKTTKETVLMGHSIPGNATVMINHWQIHNDTNYWENPNEFNPYRWIGKD KKFDPSKATSFLPFSAGTRVCLGKTVAENELFFFFSRLIRDFNFECIPGCPPPSLIGKCN ITHAPKQFCAYLTPRINNLM* AGGTGCTGCAATTATAAACGTGATTTGTTTCATTGTTTTTGGGGAAAGATACCAGTATTCAGATTCAGAAT TTAAAGAAGTTCTTACAACAATAAATGATATAGTCGATGGGTTGTCAAATACAACTGCTG TTGGATTTTTGCCGTGGTTGAGATTTTTACCGTTTTCTCCAATAAAAAAACTGAGTATTT CACTTTCAAAATATGTTCGTTTTTTAAACGATAAGTTGAAAAAACATAAGGAAACATTTG ATGAAAAGAAAATTCGAGATTTTACTGATTCTATTATAAATTTTTCTAATAACGAAGCTG TCAAACAAAAATTTAAAAACGTTGATGAACATTTAGAGCCTGTGATTGGGGATTTATTTA TAACGGGTAGTGAGACCACATTAACATCTTTATTGTGGTTAATTCTTTATATGATGCATT ATCCCAAATATCAACAAGAAATTTTTAAAGAAATTACAACGGTTATTGGTGAAGACCGGT ACCCATGTTTAAATGACCGTGATTCTTTGCATCTTGTTAAAGCCGCATTAAAAGAGTGTC TGCGTTTATCTTCAATTGTTCCTCTTGGATTACCACACAAAACAACCAAAGAAACAGTTC TTATGGGACATAGCATTCCTGGGAATGCAACAGTCATGATTAATCATTGGCAGATTCATA ACGATACTAACTACTGGGAAAATCCTAACGAATTTAATCCTTATCGGTGGATTGGTAAAG ATAAGAAATTTGATCCAAGTAAAGCAACAAGTTTTTTACCTTTTTCAGCCGGTACAAGAG TTTGTTTAGGGAAAACAGTTGCTGAAAATGAACTATTTTTCTTCTTTTCTAGATTAATTC GAGATTTTAACTTTGAGTGCATACCTGGTTGTCCACCTCCAAGTTTAATTGGTAAATGCA ATATTACTCATGCTCCAAAACAGTTTTGCGCATACTTGACTCCAAGAATAAACAATCTAATGTAA >whole gene 1095899272864 1096526199166 MWYEIICGLIISILLYIIGSYLMHLLECRKYPLGPFPIPIFGNLHLLGTEPHKILAAYS KKYGAVFSISLGLQRIVIISDITTTREALVQKASIFAGRPKSYLIQLISSGYKGIAFMDY GSFWKVLRKVSHSSLKIYGEGHERFEKILTKESEELHKRLLKKSNNSVELKSEF (1) GAAIINVICFIVFGERYQYSDSEFKEVLTTINDIVDGLSNTTAVGFLPWLRFLPFSPIKKLSIS LSKYVRFLNDKLKKHKETFDEKKIRDFTDSIINFSNNEAVKQKFKNVDEHLEPVIGDLFI TGSETTLTSLLWLILYMMHYPKYQQEIFKEITTVIGEDRYPCLNDRDSLHLVKAALKECL RLSSIVPLGLPHKTTKETVLMGHSIPGNATVMINHWQIHNDTNYWENPNEFNPYRWIGKD KKFDPSKATSFLPFSAGTRVCLGKTVAENELFFFFSRLIRDFNFECIPGCPPPSLIGKCN ITHAPKQFCAYLTPRINNLM* CYP2 like (2 different sequences) >CN566581 taf98h10.x1 Hydra EST -Kiel 1 Hydra magnipapillata cDNA 3' similar to SW:CPC8_HUMAN P10632 CYTOCHROME CACGCGTCCGCTTTTTAGTCCCTGTGTAATAAGTATTTCTTAGACATAATTTTAAAATGTTTCTTGAAGT TATTGGCGCAGTCTTTATTCCACCTTTGATATGGACTATATGGGTTTACATTAAACATTTAATTGATTGT TTGCATTATCCAAGAGGACCAATACCACTACCATTTATTGGAAATGGTTATTTGATAAGAAAAGCTGAAC CATATAAAGAGTTGGTTAACTTAGGAAAAATATATGGCGATGTTTTTAGTTTTAGCGTTGGTTCAGTCAG ATATGTAATTGTCAACAGTTTAGAAGGAATTCAAGAAGTACTAGTTAAAAAAGGGTGGCAATTTGCTGGT CGTCCAAAAGGTCCAAGTTGGGATAGATCCATTCACGGTCTAATCCAACGTGATCCAAGTAAAAAATTTA AAATATTACGGAAGCTAGCAACATCATCTTTGAAAATCTTTGCTGATGGATTGGCAGGGATGGAAAGTAA AGCTATA 32% to 2X9 aa 26-146 57 MFLEVIGAVFIPPLIWTIWVYIKHLIDCL 144 HYPRGPIPLPFIGNGYLIRKAEPYKELVNLGKIYGDVFSFSVGSVRYVIVNSLEGIQEVL 323 324 VKKGWQFAGRPKGPS WDRSIHGLIQRDPSKKFKILRKLATSSLKIFADGLAGMESKAI 497 EESFQLNKKLLETNGKPF opposite end of clone = >CN566859 taf98h10.y1 Hydra EST -Kiel 1 Hydra magnipapillata cDNA 5' similar to SW:CPT7_SQUAC Q92113 CYTOCHROME P450 17 TAATCATAAATGTGTATTTAAAATAAAAATTAAAACTCTTTTTGATAGACAATAGTTAGCTAATAGAAAT TAGAAAAATTTTTAATTAACACATTTTGCCTAAAATTTAAAGGTTAAGCAAACATTTACTAATCGTTTCA TTTTCAAATTATTTACTATTTATCTAGTATCTAATTTTTCAATCAATATAGTTTATATAAATTTAAAAAA AAAAACATTAGCAAAAT TAAAG CAATAAATTTTTACT CCTTGGAACTATTTCAACTTTAAAGTCATAAGG AGTACAAGTGATGCCAAAACTACCTTCTAAGCGCGGTAATTCTTCTAAAATTGGTTTCACAAATCGGAAA TCATTTATCAATCGTGATATAAAAATAAACAACTCAGTTTTTGCCAATGTTTCTCCAATACAGCTACGAG GTCCACTAGAGAATGGTAAATAAGCGTTTCCTAGTTTAGAATTAAATTCGCCAGAATTTTCTAACCAACG TTCAGGGTAAAAACTCATTGCATTTTTCCAGTAACTTTCGTCATGGTGTATACTCCATAGGTTGGTTATT ATAAGAGCCCCTTT GGGGAATAGGTTTTCCACAAATGCTACCTTTCTCC 44% to 17A1 fugu aa 378-485 607 RKVAFVENLFPKGALIITNLWSIHHDESYWKNAMSFYPERWLENSGEFNSKLGNAYLPFSSGPRSCIGETL 395 394 AKTELFIFISRLINDFRFVKPILEELPRLEGSFGITCTPYDFKVEIVPRSKNLLV* 227 CYP3 like (two different sequences) >CN567799 opposite end = CN567598 tag12b09.x1 tag12b09.y1 Hydra EST -Kiel 1 Hydra magnipapillata cDNA 5' similar to TR:Q9PVE8 Q9PVE8 CYTOCHROME P450 3A30 TTACTTTGTATTTAAAAATCATCAAAAGAAAACCCCAAACAATCATTATTATAAATGTTAGGACTTAAAG TTTTAAATAAAGTTTATTCTTCTGTTGAATATTATTCTGCAATAGGTTTTA CTCTAATTAGCAATGGTTC AACTGTTCTCATAGTTAACCAAATAATCTTTGTATAATTTATATGCATAGGGTCAATAGAAAATTTGAAT TGCAACAACGCTTTGACCAAGAATGTTTTAATTTCAAGCAAAGCAAAGTTCTTTCCAATACAGTTATAAA CACCGTGCCCAAAAGGAAGATAGAATGATGCTGGTATTTCCCCTGTCATAAACCTTTCAGGTTTAAATGA ATGAGGATCTCGATAAATAGTCTCATTCATGTGTAGATTGCTAACATGAGTTTGAAGTACTGTGTTTGCA GGAATTTCATATTCGCCAAATTTTATTGCATTAATGGTCTCTCTGCCTAAAACTGGTGCAAGTCCATGCA AGCGCAAGGTTTCTTTAACGACACAATCCAAATATTTTAAACTCGTAACTTCTTCAAAAGAAATATTATT TACATCCAATTTATTTTTCTGAATCTCTTCTCTAAGTTTCTCTTGAACCTCTAAGTT AATGGCTAGCATA TATAAACACAAAGTCAAAGTAGTAGAAATTGTTTCATAACCAGCTATGAAAAA Combined N and C-terms >CN567598 tag12b09.x1 [gene 4] N-term RVRFLLRYLLKRIFHPLRFLPSPKEQLITGHINHFQGRDHSSTYLSFNEKFKEESLCTLD TLHVPRFVYLIAPEFIKKIFADGKLFQRSKSIRTLAPLIGNSMVGSNYEHHHWQRKLFNG AFTSQQLKNYFPAFLKHTNLLMKLWSYTCDKESGTNLTVLDDLSNLSF 39% to 3A27 trout aa 307-472 58% to CN770283 [gene 4] 683 FFIAGYETISTTLTLCLYMLAI (0) NLEVQEKLREEIQKNKLDVNNISFEEVTSLKYLDCVVK 504 503 ETLRLHGLAPVLGRETINAIKFGEYEIPANTVLQTHVSNLHMNETIYRDPHSFKPERFMT (1) 323 GEIPASFYLPFGHGVYNCIGKNFALLEIKTFLVKALLQFKFSIDPMHINYTK 168 167 IIWLTMRTVEPLLIRVKPIAE* 102 >CN770283 58% to CN567799 tad87b02.y2 Hydra EST Darmstadt I Hydra magnipapillata cDNA 5' similar to SW:CP4C_BLADI P29981 CYTOCHROME P450 4C1 AAAGAATGTATTTGATATAAAAAGTGTTTCATATGAAGAAGTTTTGAGTATCAAGTACTTAGATTGTGTT GTTAAAGAAACATTGCGCATGCATACACCTGTAGCATTTATTGGTAGAATAAATAAGAATCAAACAAAGT TTGGTGACTTTGATGTACCTGCTGGCTCTTTTTTACGAATTCCTATTGACAGTGCACATATGAACGAGTC TGTTTATCATGATCCTCATTCATTTAGACCACAACGATTCTTGACAGGTGAAATACCACCATTATCCTTC CTTACATTTGGGCAAGGTACATATAATTGTATCGGAAAGAATTTTGCTTTGCTTGAAATCAAAACATTCT TGGTCAAAGCGTTGCTGCAATTCAAATTTTCAGTAGACCTTAAGCGTTTGGAAATTAACAAGCTGAAT 35% to 3A27 trout aa 343-472 2 KNVFDIKSVSYEEVLSIKYLDCVVKETLRMHTPVAFIGRINKNQTKFGDFDVPAGSFLRI PIDSAHMNESVYHDPHSFRPQRFLTGEIPPLSFLTFGQGTYNCIGKNFALLEIKTFLVKA LLQFKFSVDLKRLEINKLN 418 >CN776982 taf28f06.x1 Hydra EST Darmstadt I Hydra magnipapillata cDNA 3' similar to TR:Q9VXY0 Q9VXY0 CG9081 PROTEIN. ;. Length = 316 Same as 1096041191032 Query: 79 RPQRFLTGEIPPLSFLTFGQGTYNCIGKNFALLEIKTFLVKALLQFKFSVDLKRLEINKL 138 R +RFLTGEIPPLSFLTFGQG YNCIGKNFALLEIKTFLVKALLQF+FSVDLK L KL Sbjct: 307 RQERFLTGEIPPLSFLTFGQGIYNCIGKNFALLEIKTFLVKALLQFEFSVDLKHLNYKKL 128 QRTRQERFLT GEIPPLSFLTFGQGIYNCIGKNFALLEIKTFLVKALLQFEFSVDLKHLNY KKLISITNKTVEPLWIRVKPI* AGGTGAAATACCACCATTATCCTTCCTTACATTTGGGCAAGGTATATATAATTGTATCGGAAAGAAT TTTGCTTTGCTTGAAATCAAAACATTCTTGGTCAAAGCGTTGCTGCAATTCGAATTTTCA GTAGACCTTAAGCATTTGAATTATAAGAAGCTGATTTCGATTACTAATAAAACCGTTGAA CCGTTATGGATAAGAGTGAAGCCTATATAA Combined CN776982 and CN770283 [gene 5] (0) NLGVQNKLREEIKKNVFDIKSVSYEEVLSIKYLDCVVKETLRMHTPVAFIGRINKNQTK FGDFDVPAGSFLRIPIDSAHMNESVYHDPHSFRPQRFLT () GEIPPLSFLTFGQGIYNCIGKNFALLEIKTFLVKALLQFEFSVDLKHLNY KKLISITNKTVEPLWIRVKPI* AGAACTTAGGTGTTCAAAACAAATTGAGAGAAGAGATAAAAAAGAATGT ATTTGATATAAAAAGTGTTTCATATGAAGAAGTTTTGAGTATCAAGTACTTAGATTGTGT TGTTAAAGAAACATTGCGCATGCATACACCTGTAGCATTTATTGGTAGAATAAATAAGAA TCAAACAAAGTTTGTTGACTTTGATGTACCTGCTGGCTCTTTT CYP4 Like >CN627429 tae92b11.y1 Hydra EST Darmstadt I Hydra magnipapillata cDNA 5' similar to SW:CP51_CANGA P50859 CYTOCHROME P450 51 GATAATGGTACTAACATAATTGTTTTGGATGACTTATCTAATTTATCATTTGATATAATTGGGGATGTTG GTTTTGGCTATCAATTTAACACAATTACTTCTCATTCTGGTAATGAGTTTACAAAAGCGCTTCAGAGTTA TTGTCAACTACGATTTCAATTGAATGCCGTCCATAAAGCTCTACTAGCTTTCTTTCCATTTTTAATGCAT CTGTCATTTATGTATGGAAAACGTAAACGAGCTGAGCAAGTCATCTGTAATACTTTAAACATGCTTATTA ATAAACGCAAAAAAGAGATAGACCACCGAATAGCAGCTGATCAAAAAGATTTTTTAACAGTTGTTTTAAA AGATCAACAAAAAGAAGGCAACAAGATGACAAATGACTTGATTAAAAATAATCTGATGACGCCTTTAATT GCAGGTCACAAAACAACTTCCACTGACATGCCATGGTGTTTCAACGTGCTTGCGCCAAACCCAAGTGCTA CCAAACACATGCAAAGAAACAGAAAGAAA GAATACATCT CGACCACAAAA 31% to 4T5 aa 183-347 1 DNGTNIIVLDDLSNLSFDIIGDVGFGYQFNTITSHSGNEFTKALQSYCQLRFQLNAVHKALLA 189 190 FFPFLMHLSFMYGKRKRAEQVICNTLNM LINKRKKEIDHRIAADQKDFLTVVLK 351 352 DQQKEGNKMTNDLIKNNLMTPLIAGHKTTSTDMPWCFNVLAPNPSATKHMQRNRKKEYISTTK 540 CN770090 taf75f08.y1 Hydra EST Darmstadt I Hydra magnipapillata cDNA 5' similar to TR:Q40411 Q40411 PUTATIVE CYTOCHROME P-450. ;. Length = 299 Score = 113 bits (283), Expect = 1e-26 Identities = 54/56 (96%), Positives = 55/56 (98%) Frame = -1 Query: 1 DNGTNIIVLDDLSNLSFDIIGDVGFGYQFNTITSHSGNEFTKALQSYCQLRFQLNA 56 DNGTNIIVLDDLSNLSFDIIGDVGFGYQFNTITSHSGNEFTKALQSYCQLRFQL + Sbjct: 170 DNGTNIIVLDDLSNLSFDIIGDVGFGYQFNTITSHSGNEFTKALQSYCQLRFQLKS 3 CN775805 tae77f11.x1 Hydra EST Darmstadt I Hydra magnipapillata cDNA 3' similar to TR:Q42700 Q42700 CYTOCHROME P450 ;. Length = 562 Score = 58.5 bits (140), Expect = 5e-10 Identities = 27/27 (100%), Positives = 27/27 (100%) Frame = +3 Query: 1 DNGTNIIVLDDLSNLSFDIIGDVGFGY 27 DNGTNIIVLDDLSNLSFDIIGDVGFGY Sbjct: 480 DNGTNIIVLDDLSNLSFDIIGDVGFGY 560 >Combined CN627429 CN775805 27% to 4T5 [gene 6] GIAVLIFLCFSLFFANILKRFYHPLRKLPSPKENFFTAHYGYFNGYDQINAVINFGKQFK ERGLYTLDTLNGFRFVNLLMPEFIKTVFSDGNSFQRSTATKVIFPLVGNGIFVSNYEDHH WQRKVLNEAFTLQQLKNYFPAFTVHIDLLMKLWSYSCDK 1 DNGTNIIVLDDLSNLSFDIIGDVGFGYQFNTITSHSGNEFTKALQSYCQL RFQLNAVHKALLA 189 190 FFPFLMHLSFMYGKRKRAEQVICNTLNMLINKRKKEIDHRIAADQKDFLTVVLK 351 352 DQQKEGNKMTNDLIKNNLMTPLIAGHKTTSTDMPWCFNVLAPNPSATKHMQRNRKKEYISTTK 540 Combined CN776982 and CN770283 [gene 5] = DN812371.1 (0) NLGVQNKLREEIKKNVFDIKSVSYEEVLSIKYLDCVVKETLRMHTPVAFIGRINKNQTK FGDFDVPAGSFLRIPIDSAHMNESVYHDPHSFRPQRFLT () GEIPPLSFLTFGQGIYNCIGKNFALLEIKTFLVKALL QFEFSVDLKHLNY KKLISITNKTVEPLWIRVKPI* AGAACTTAGGTGTTCAAAACAAATTGAGAGAAGAGATAAAAAAGAATGT ATTTGATATAAAAAGTGTTTCATATGAAGAAGTTTTGAGTATCAAGTACTTAGATTGTGT TGTTAAAGAAACATTGCGCATGCATACACCTGTAGCATTTATTGGTAGAATAAATAAGAA TCAAACAAAGTTTGTTGACTTTGATGTACCTGCTGGCTCTTTT EST DN812371.1 joins with CN627429 CN775805 and 1095901729505 GIAVLIFLCFSLFFANILKRFYHPLRKLPSPKENFFTAHYGYFNGYDQINAVINFGKQFK ERGLYTLDTLNGFRFVNLLMPEFIKTVFSDGNSFQRSTATKVIFPLVGNGIFVSNYEDHH WQRKVLNEAFTLQQLKNYFPAFTVHIDLLMKLWSYSCDK DNGTNIIVLDDLSNLSFDIIGDVGFGYQFNTITSHSGNEFTKALQSYCQL RFQLNAVHKALLAFFPFLMHLSFMYGKRKRAEQVICNTLNM LINKRKKEIEDGIAADQKDFLTVVLKDQQKEGSKMTNDLIKDNLMTLLIAGHETTSTAMQWCLYMLGT NLGVQNKLREEIKKNVFDIKSVSYEEVLSIKYLDCVVKETLRMHTPVAFIGRINKNQTKFGDFDV PAGSFLRIPIDSAHMNESVYHDPYSFRPERFLTGEIPPLSFLTFGQGIYNCIGKNFALLE IKTFLVKALLQFEFSVDLKHLNYKKLISITNKTVEPLWIRVKPI* 1095901729505 I-helix part of DN812371.1 1097675072038 1097672494604 1097567117390 (0) LINKRKKEIEDGIAADQKDFLTVVLKDQQKEGSKMTNDLIKDNLMTLLIAGHETTSTAMQWCLYMLGT (0) AGCTTATTAATAAACGCAAAAAAGAAATAGAAG ATGGAATAGCAGCTGATCAAAAAGATTTTTTAACAGTTGTTTTAAAAGATCAACAAAAAG AAGGCAGCAAGATGACAAATGACTTGATTAAAGATAATCTGATGACGCTTTTAATTGCTG GTCACGAAACAACTTCTACTGCAATGCAATGGTGTTTATACATGCTTGGCACAGT CYP17 like (4 different sequences) >CN769570 taf31c10.y1 Hydra EST Darmstadt I Hydra magnipapillata cDNA 5' similar to SW:CPT7_CHICK P12394 CYTOCHROME P450 17 ACAGAAAATCTTACGATGAGAATAATTTACGTGACATAACCGATGCATTAATAAAAGTGTCTTTAGATTC AGAGATGGGTGAAGAATTAACTGAAAAGATTACTGATGATAATATTGAGTTTCTTTTAAACGATTTTATG ATTGCTGGATCCGAAACTTCATCAAGTACTATTCTTTGGTTTATTGTTTACATGTTACATTGGCCAGAAT ACCAAAATAAACTTTATGATGAAATTACTAAAGTAGCATCAGATAACCGTTATGTATCTTTAAAGGATCG ACCTATGCTTCATTTAATGCAAGCTACAATTCATGAAACACTTAGACTGTCATCGGTGGTACCTCTTGGT TTGGTTCATAAAGCAATGGAGAACAGTAGCATTTGTGGCAAGTTTGTTCCTAAGGGAGCTCTTATTTTAA CAAATTTATGGAGTATGCATCACGATGAAAGCTATTGGAAAAATGCAATGAGTTTTTACTCGGAACGTTG GCTGGAAAAATCTGGCGAGTTCCATTATAAATTGGGGTACGCATAATTACCGTTTTCTATAGGG 35% to CYP17A zebrafish aa 266-449 3 RKSYDENNLRDITDALIKVSLDSEMGEELTEKITDDNIEFLLNDFMIAGSETSSSTILWF IVYMLHWPEYQNKLYDEITKVASDNRYVSLKDRPMLHLMQATIHETLRLSSVVPLGLVHK AMENSSICGKFVPKGALILTNLWSMHHDESYWKNAMSFYSERWLEKSGEFHYKLGYA*LP FSIG 554 >CN769290 opposite end = CN769570 taf31c10.y1 taf31c10.x1 Hydra EST Darmstadt I Hydra magnipapillata cDNA 3' similar to SW:CPT7_RANDY O57525 CYTOCHROME P450 17 TTATCTTGCTTTTCACGTTTCCTTGGTTATTCCATATTTTTGAAATTTTTAAATATTATATAAAGCAAAA ATACAGAAAAGTGAAGCAAAAATAGTTAATTTCTTGGAATTATCACGACTTCAAAGTCATTAGGAGGGGA GGTGATTCCAGAACGACCATCTAAACAAGGTAACTCTTTTCCAGTTGGCATTTCAAATCGGTAATCTTTA AGTAATCGTGTAATAAACACAAACAACTCTGTTTTTGCCAATGTTTCTCCTAAACAACTACGAGGTCCAT TAGAAAACGGTAAATATGCGTACCCCAATTTATAATTGAACTCGCCAGATTTTTCCAGCCAACGTTCCGG GTAAAAACTCATTGCATTTTTCCAATAGCTTTCATCGTGATGCATACTCCATAAATTTGTTAAAATAAGA GCTCCCTTAGGAACAAACTTGCCACAAATGCTACTGTTCTCCATTGCTTTATGAACCAAACCAAGAGGTA CCACCGATGACAGTCTAAGTGTTTCATGAATTGTAGCTTGCATTAAATGAAGCATAGGTCGATCCTTTAA AGATACATAACGGTTATCTGATGCTACTTTAGTAATTTCATCATAAAGTTTATTTTGGTATTCTGGCCAA TGTAACATGTAAACAATAAACCAAAGAATAGTACTTGATGAAGTTTCGGATCCAGCAATCATAAAATCGT TTACAAGAAACTCAATATTATCCTCAGT 42% to CYP17A aa 299-503 728 TEDNIEFLVNDFMIAGSETSSSTILWFIVYMLHWPEYQNKLYDEITKVASDNRYVSLKDR PMLHLMQATIHETLRLSSVVPLGLVHKAMENSSICGKFVPKGALILTNLWSMHHDESYWK NAMSFYPERWLEKSGEFNYKLGYAYLPFSNGPRSCLGETLAKTELFVFITRLLKDYRFEM PTGKELPCLDGRSGITSPPNDFEVVIIPRN* >Combined seq from CN769290 and CN769570 39% to CYP17A [gene 7] RKSYDENNLRDITDALIKVSLDSEMGEELTEKITDDNIEFLLNDFMIAGSETSSSTILWF IVYMLHWPEYQNKLYDEITKVASDNRYVSLKDR PMLHLMQATIHETLRLSSVVPLGLVHKAMENSSICGKFVPKGALILTNLWSMHHDESYWK NAMSFYPERWLEKSGEFNYKLGYAYLPFSNGPRSCLGETLAKTELFVFITRLLKDYRFEM PTGKELPCLDGRSGITSPPNDFEVVIIPRN* >CN774619 tae83e09.x1 Hydra EST Darmstadt I Hydra magnipapillata cDNA 3' similar to SW:CPT7_CHICK P12394 CYTOCHROME P450 17 TTTAAACTACATAAGATTGTTTACTCTTGGAATCAAGTATGCGCAAAATTGTTTTGGAGCATGAGTAATA TTGCATTTTCCAATTAAACTTGGAGGTGGACAACCAGGTGTGCACTCAAACTTAAAATCTCGAATTAATC TAGAAAAGAAGAAAAATAGTTCATTTTCAGCAACTGTTTTCCCTAAACAAACTCTTGTACCGGCTGAAAA AGGTAAGAAACTTGTTGCTTTACTTGGATCAAATTTCTTATCTTTGCCAATCCGCCGATAAGGAATTAAA TTCATTAGGATTTTCCCAGGAGTTAGTAACGTTATGAATCTGCCAATGATTAATCATGACAGTAGCAATT TTCAGGAATGCTATGTGACATGAGAACTCCCCCCCAGAGGT 38% to CYP17A2 Fugu aa 383-485 391 TSGGEFSCHIAFLKIATVMINHWQIHNVTNSWENPNEFN 275 273 PYRRIGKDKKFDPSKATSFLPFSAGTRVCLGKTVAENELFFFFSRLIRDFKFECTPGCPP 94 93 PSLIGKCNITHAPKQFCAYLIPRVNNLM* 7 >CN570733 same as CN570522 BP505786 tag42d11.y1 Hydra EST -Kiel 2 Hydra magnipapillata cDNA 5' similar to SW:CPT7_ORYLA P70085 CYTOCHROME P450 17 AGCTGGTTTCTTCAAGACTTCTGTGAGGTAAACCTAATGGAATGACAGACGACAAACGCAGAGTTTCTTT CATAGCACTTTCAAATAAATGAAGCTTTGGACGATCTGAAAGACTAGGATACCTATCATTACCGACTATT TTAATAGTTTCATCATAGATATCATCTTGATACTTTGGCCAGTTAACTAAATAAAC VYLVNWPKYQDDIYDETIKIVGNDRYPSLSDRPKLHLFESAMKETLRLSSVIPLGLPHRSLEETS >CN570522 same as CN570733 tag42d11.x1 Hydra EST -Kiel 2 Hydra magnipapillata cDNA 3' similar to SW:CPT7_ORYLA P70085 CYTOCHROME P450 17 GGTGTTTATTTAGTTAACTGGCCAAAGTATCAAGATGATATCTATGATGAAACTATTAAAATAGTCGGTA ATGATAGGTATCCTAGTCTTTCAGATCGTCCAAAGCTTCATTTATTTGAAAGTGCTATGAAAGAAACTCT GCGTTTGTCGTCTGTCATTCCATTAGGTTTACCTCACAGAAGTCTTGAAGAAACCAGC 43% to CYP17 aa 326-389 same as BP505786 GVYLVNWPKYQDDIYDETIKIVGNDRYPSLSDRPKLHLFESAMKETLRLSSVIPLGLPHRSLEETS >CN566859 opposite end = CN566581 taf98h10.x1 taf98h10.y1 Hydra EST -Kiel 1 Hydra magnipapillata cDNA 5' similar to SW:CPT7_SQUAC Q92113 CYTOCHROME P450 17 TAATCATAAATGTGTATTTAAAATAAAAATTAAAACTCTTTTTGATAGACAATAGTTAGCTAATAGAAAT TAGAAAAATTTTTAATTAACACATTTTGCCTAAAATTTAAAGGTTAAGCAAACATTTACTAATCGTTTCA TTTTCAAATTATTTACTATTTATCTAGTATCTAATTTTTCAATCAATATAGTTTATATAAATTTAAAAAA AAAAACATTAGCAAAAT TAAAG CAATAAATTTTTACT CCTTGGAACTATTTCAACTTTAAAGTCATAAGG AGTACAAGTGATGCCAAAACTACCTTCTAAGCGCGGTAATTCTTCTAAAATTGGTTTCACAAATCGGAAA TCATTTATCAATCGTGATATAAAAATAAACAACTCAGTTTTTGCCAATGTTTCTCCAATACAGCTACGAG GTCCACTAGAGAATGGTAAATAAGCGTTTCCTAGTTTAGAATTAAATTCGCCAGAATTTTCTAACCAACG TTCAGGGTAAAAACTCATTGCATTTTTCCAGTAACTTTCGTCATGGTGTATACTCCATAGGTTGGTTATT ATAAGAGCCCCTTT GGGGAATAGGTTTTCCACAAATGCTACCTTTCTCC Combined seq Opposite end MFLEVIGAVFIPPLIWTIWVYIKHLIDCLHYPRGPIPLPFIG NGYLIRKAEPYKELVNLGKIYGDVFSFSVGSVRYVIVNSLEGIQEVLVKKGWQFAGRPKG PSWDRSIHGLIQRDPSKKFKILRKLATSSLKIFADGLAGMESKAI 44% to 17A1 fugu aa 378-485 [gene 2] 607 RKVAFVENLFPKGALIITNLWSIHHDESYWKNAMSFYPERWLENSGEFNSKLGNAYLPFSSGPRSCIGETL 395 394 AKTELFIFISRLINDFRFVKPILEELPRLEGSFGITCTPYDFKVEIVPRSKNLLV* 227 CYP46 like (only one seq) >CN775805 tae77f11.x1 Hydra EST Darmstadt I Hydra magnipapillata cDNA 3' similar to TR:Q42700 Q42700 CYTOCHROME P450 TCGGCATAGCAGTATTAATTTTTTTGTGTTTTTCACTGTTTTTTGCTAATATTTTAAAACGTTTTTATCA TCCGCTTCGTAAGTTGCCATCACCTAAAGAAAATTTCTTTACTGCTCATTATGGCTACTTTAATGGCTAT GATCAAATAAATGCTGTAATAAATTTTGGAAAACAGTTTAAAGAGCGTGGCTTGTATACATTAGATACAT TAAATGGATTTAGATTTGTTAATCTTTTAATGCCAGAATTTATTAAAACAGTGTTTTCTGATGGAAACTC ATTCCAAAGATCGACCGCTACAAAAGTTATATTTCCTCTAGTTGGAAATGGTATTTTTGTGTCAAATTAT GAAGATCATCATTGGCAAAGAAAAGTGTTAAATGAAGCTTTTACTTTACAACAGCTAAAAAATTATTTTC CAGCTTTTACAGTGCACATTGATTTGCTAATGAAACTTTGGTCATATTCATGTGACAAGGATAATGGTAC TAACATAATTGTTTTGGATGACTTATCTAATTTATCATTTGATATAATTGGGGATGTTGGTTTTGGCTAT CA N-term 26% to CYP46a zebrafish aa 10-203 3 GIAVLIFLCFSLFFANILKRFYHPLRKLPSPKENFFTAHYGYFNGYDQINAVINF 167 168 GKQFKERGLYTLDTLNGFRFVNLLMPEFIKTVFSDGNSFQRSTATKVIFPLVGNG 332 333 IFVSNYEDHHWQRKVLNEAFTLQQLKNYFPAFTVHIDLLMKLWSYSCDKDNGTNII 500 501 VLDDLSNLSFDIIGDVGFGY 560 >gi|47138506|gb|CN627429.1|CN627429 tae92b11.y1 Hydra EST Darmstadt I Hydra magnipapillata cDNA 5' similar to SW:CP51_CANGA P50859 CYTOCHROME P450 51 ;. Length = 540 Score = 58.5 bits (140), Expect = 5e-10 Identities = 27/27 (100%), Positives = 27/27 (100%) Frame = +1 Query: 160 DNGTNIIVLDDLSNLSFDIIGDVGFGY 186 DNGTNIIVLDDLSNLSFDIIGDVGFGY Sbjct: 1 DNGTNIIVLDDLSNLSFDIIGDVGFGY 81 DNGTNIIVLDDLSNLSFDIIGDVGFGYQFNTITSHSGNEFTKALQSYCQLRFQLNAVHKA LLAFFPFLMHLSFMYGKRKRAEQVICNTLNMLINKRKKEIDHRIAADQKDFLTVVLKDQQ KEGNKMTNDLIKNNLMTPLIAGHKTTSTDMPWCFNVLAPNPSATKHMQRNRKKEYISTTK Combined CN775805 and CN627429 [gene 8] 3 GIAVLIFLCFSLFFANILKRFYHPLRKLPSPKENFFTAHYGYFNGYDQINAVINF 167 168 GKQFKERGLYTLDTLNGFRFVNLLMPEFIKTVFSDGNSFQRSTATKVIFPLVGNG 332 333 IFVSNYEDHHWQRKVLNEAFTLQQLKNYFPAFTVHIDLLMKLWSYSCDKDNGTNII 500 501 VLDDLSNLSFDIIGDVGFGY 560 QFNTITSHSGNEFTKALQSYCQLRFQLNAVHKA LLAFFPFLMHLSFMYGKRKRAEQVICNTLNMLINKRKKEIDHRIAADQKDFLTVVLKDQQ KEGNKMTNDLIKNNLMTPLIAGHKTTSTDMPWCFNVLAPNPSATKHMQRNRKKEYISTTK BP514308 BP514308 Hydra magnipapillata cDNA library Hydra magnipapillata cDNA clone hydmg002bw_87. Length = 586 Score = 66.6 bits (161), Expect = 1e-12 Identities = 28/56 (50%), Positives = 37/56 (66%) Frame = +2 Query: 5 LIFLCFSLFFANILKRFYHPLRKLPSPKENFFTAHYGYFNGYDQINAVINFGKQFK 60 +I + F A KRFYH R LPSPKE+ T HY YF+ +D +N ++NFGK+FK Sbjct: 53 IIIVPLVFFVAVFFKRFYHQFRLLPSPKESLITCHYSYFDVHDHVNTLLNFGKEFK 220 >BP514308 N-term 25% to 46a [gene 9] 1096761991009 1096082187152 1096123591182 1095899045709 1097383004013 MYSIYIAIIIVPLVFFVAVFFKRFYHQFRLLPSPKESLITCHYSYFDVHDHVNTLLNFGKEFKDYGLYTINTLI (1) GPRQVHLLLPHFIKTVIADGKFFQRSPVFKAVFPLVGNSMIVSNYEDHHWQRKLF NQAFTSQQLKRYFLAFTLHTDLLMK (0) LWSCTCDKENGTNLNVWSDLSNLSFDIIGDVGFGYQFNTITSHSGNAFTKALRSY INLRFNSSVVHNVLIAYFPFLMRFLSKFGNLNKAEQVIYNTLNM (0) ATGTATTCGATATACATAGCGATTATAATAGTTC CTTTAGTTTTTTTCGTTGCTGTTTTTTTTAAACGTTTTTATCATCAATTTCGTTTGTTGC CATCACCCAAAGAGAGTTTAATTACATGTCACTATAGTTATTTTGATGTTCATGACCATG TTAACACTCTGTTAAACTTTGGTAAAGAGTTTAAAGATTACGGATTATATACAATTAATA CTTTAATTGGT 1096041094868 1096013042315 1097383004013 1097675345181 1096602222013 AGCTCTGGTCATGCACATGTGATAAAGAAAATGGCACTAACTTAAACGTTTGGAGTGACTTGTCTAATCTTTC ATTTGATATAATTGGTGACGTTGGTTTTGGCTATCAATTCAACACTATTACATCTCATTC TGGAAATGCGTTTACAAAAGCACTTCGAAGTTATATTAACTTACGATTTAATTCTAGCGT AGTGCACAATGTTCTAATAGCTTATTTTCCATTCTTAATGCGTTTTTTATCAAAGTTTGG AAATCTTAATAAAGCTGAGCAAGTTATTTACAATACCCTGAACATGGT >BP508840 BP508840 Hydra magnipapillata cDNA library Hydra magnipapillata cDNA clone hmp_03437. Length = 452 Blast with CYP20 Fugu C-term Query: 88 VDQHLIPKESLVIYALGVILQDSDTWNAPYRFDPDRFEEESVKK----SFHLLGFSGSQT 143 VD HLIPK++ +I ALG + QD + P RFDPDRF ++ +++ +F GF+G + Sbjct: 18 VDGHLIPKKTPIILALGTVFQDETIFPEPDRFDPDRFSDKQIEERSALAFQPFGFAGKRK 197 Query: 144 CPELRFAYTVATVLLSVLVRQLKLHRLKDTLMEVRSELVSTPRDETWI 191 CP R AY +++ + +++ V+ P +E WI Sbjct: 198 CPGYRLAYAETLTYTFYIIKNFHISLFDKQSVKMHYGFVTKPSEEIWI 341 Note: this seqs. Best match in Fugu human and Ciona is CYP20 DYDIIVDGHLIPKKTPIILALGTVFQDETIFPEPDRFDPDRFSDKQIEERSALAFQPFGF AGKRKCPGYRLAYAETLTYTFYIIKNFHISLFDKQSVKMHYGFVTKPSEEIWIKVLRRKNI* >gnl|ti|647066038 1095898227332 Length = 1123 Score = 59.7 bits (143), Expect = 5e-08 Identities = 66/274 (24%), Positives = 119/274 (43%), Gaps = 21/274 (7%) Frame = -3 Query: 226 PEAGSKRETEFLKHRRVLEDIIRRIIQERKEGEDLQELPFI-DSMLQ-NYDSE------D 277 P A S+ E ++ R + I++R +QE ++ D L I D++++ + DSE + Sbjct: 866 PTATSRNIFEIIRLR---DPILKRKLQEHRKSYDKNNLRDITDALIKVSLDSEMGEELTE 696 Query: 278 KIIADAISF-----MVGGFHTSGYMFTWMLWYLSSHPESQDXXXXXXXXXXXXXXXXXXK 332 KI D I F M+ G TS W + Y+ PE Q+ K Sbjct: 695 KITDDNIEFLLNDFMIAGSETSSSTILWFIVYMLHWPEYQNKLYDEITKVASDNRYVSLK 516 Query: 333 EYSLRADTFLRQVQDETIRLSTLAPWA-ARYSDKKVTVCGYTIPAKTPMIHALGVGLKNK 391 + + ++ ET+RLS++ P + + ++CG +P ++ L ++ Sbjct: 515 DRPML--HLMQAAIHETLRLSSVVPLGLVHKAMENSSICGKFVPKGALILTNLWSMHHDE 342 Query: 392 TVWENTDSWDPDRFSP-----NGRRGNDFCPFGVHSRRKCPGYLFSYFEVGVFASILLS- 445 + W+N S+ P+R+ N + G + PF + R C G + E+ VF + LL Sbjct: 341 SYWKNAMSFYPERWLEKSGEFNYKLGYAYLPFS-NGPRSCLGETLAKTELFVFITRLLKD 165 Query: 446 -RFEIVPVEGQTVIQVHGLVTEPKDDIKIYIRSR 478 RFE+ + + +T P +D ++ I R Sbjct: 164 YRFEMPTGKELPCLDGRSGITSPPNDFEVVIIPR 63 37% to 2U1 fugu 866 PTATSRNIFEIIRLRDPILKRKLQEHRKSYDKNNLRDITDALIKVSLDSEMGEELTE 696 695 KITDDNIEFLLNDFMIAGSETSSSTILWFIVYMLHWPEYQNKLYDEITKVASDNRYVSLK 516 515 DRPMLHLMQAAIHETLRLSSVVPLGLVHKAMENSSICGKFVPKGALILTNLWSMHHDE 342 341 SYWKNAMSFYPERWLEKSGEFNYKLGYAYLPFSNGPRSCLGETLAKTELFVFITRLLKD 165 164 YRFEMPTGKELPCLDGRSGITSPPNDFEVVIIPR 63 >gnl|ti|648017453 1095896110991 Length = 1042 Score = 52.0 bits (123), Expect = 1e-05 Identities = 58/226 (25%), Positives = 95/226 (42%), Gaps = 19/226 (8%) Frame = -1 Query: 241 RVLEDIIRRIIQERKEGEDLQELPFIDSML--------QNYDSEDKIIADAISFMVG--- 289 R+ + I++R +QE ++ D L I L DS K+ D I F++ Sbjct: 697 RLRDPILKRKLQEHRKSYDENNLRDITDALIKASIHLNAEKDSLIKVTDDNIQFILNDLI 518 Query: 290 --GFHTSGYMFTWMLWYLSSHPESQDXXXXXXXXXXXXXXXXXXKEYSLRADTFLRQVQD 347 G TS TW + Y+ +PE QD + L L+ Sbjct: 517 LAGSETSSSTITWFIVYMLHYPEYQDKIFNEVIKVTSGNRYPCLNDRPLL--HLLQATIH 344 Query: 348 ETIRLSTLAPWAARY-SDKKVTVCGYTIPAKTPMIHALGVGLKNKTVWENTDSWDPDRF- 405 ET+RLS++AP R+ + + T+C + T +I L ++ W+N S+ P+R+ Sbjct: 343 ETLRLSSVAPLGLRHKAMENSTICDKPVLKGTLIITNLWSIHHDERYWKNPMSFYPERWL 164 Query: 406 SPNG----RRGNDFCPFGVHSRRKCPGYLFSYFEVGVFASILLSRF 447 + G + GN + PF R C G + E+ V S L++ F Sbjct: 163 NETGEFDYKLGNAYIPFS-GGPRACLGETLAKTELFVIISRLVTDF 29 38% to 17A1 fugu 37% to 2U1 fugu 697 RLRDPILKRKLQEHRKSYDENNLRDITDALIKASIHLNAEKDSLIKVTDDNIQFILNDLI 518 517 LAGSETSSSTITWFIVYMLHYPEYQDKIFNEVIKVTSGNRYPCLNDRPLLHLLQATIH 344 343 ETLRLSSVAPLGLRHKAMENSTICDKPVLKGTLIITNLWSIHHDERYWKNPMSFYPERWL 164 163 NETGEFDYKLGNAYIPFSGGPRACLGETLAKTELFVIISRLVTDF 29 >gnl|ti|655005893 1095958068757 Length = 952 Score = 44.3 bits (103), Expect = 0.002 Identities = 33/145 (22%), Positives = 59/145 (40%), Gaps = 5/145 (3%) Frame = -2 Query: 265 FIDSMLQNYDSEDKIIADAI-----SFMVGGFHTSGYMFTWMLWYLSSHPESQDXXXXXX 319 F+D +L Y + KI + I +FM G T+ W LW L +P+ Q Sbjct: 438 FLDLLLDIY-RKGKIDTEGIQEEVDTFMFEGHDTTSAALGWTLWLLGKNPDVQKKLHKEI 262 Query: 320 XXXXXXXXXXXXKEYSLRADTFLRQVQDETIRLSTLAPWAARYSDKKVTVCGYTIPAKTP 379 K +R +L + E++R+ P R ++ +T+ G +P Sbjct: 261 DEIELNGGSLYDK---VRQSKYLEIILKESLRMHPPVPMYGRTVEEDMTIDGQFVPKGAQ 91 Query: 380 MIHALGVGLKNKTVWENTDSWDPDR 404 ++ + + N WEN + + P+R Sbjct: 90 IVLLVLILHSNPDYWENPNDFIPER 16 44% to 4V5 fugu 42% to 4T5 438 FLDLLLDIYRKGKIDTEGIQEEVDTFMFEGHDTTSAALGWTLWLLGKNPDVQKKLHKEI 262 261 DEIELNGGSLYDKVRQSKYLEIILKESLRMHPPVPMYGRTVEEDMTIDGQFVPKGAQ 91 90 IVLLVLILHSNPDYWENPNDFIPER 16 >gnl|ti|655009968 1095963046224 Length = 1057 Score = 42.0 bits (97), Expect = 0.010 Identities = 21/47 (44%), Positives = 26/47 (55%) Frame = +2 46% to CYP20 35% to 27B1 Query: 65 GSLHQFLLHLHDNGKTPVTSFWWGKTHVVSFCSPQAFKESAVFVNRP 111 G +H+FL+ H P+ SF+WGK VS P FKE A NRP Sbjct: 422 GGIHKFLVENHKR-LGPMFSFYWGKELAVSLACPILFKEVATLFNRP 559 >gnl|ti|646862798 1095898098005 Length = 963 Score = 41.2 bits (95), Expect = 0.018 Identities = 56/246 (22%), Positives = 99/246 (40%), Gaps = 18/246 (7%) Frame = -3 Query: 235 EFLKHRRVLEDIIRRIIQE--RKEGEDLQELPFIDSMLQN----YDSEDKIIADA----- 283 E + R VL + + +++ RK E+ + + DSM+Q+ YD D A Sbjct: 793 EIKQARLVLNPWVEKKVEDHWRKYNEN-EIINVTDSMIQHFLTKYDGLDTDFAKKYITLL 617 Query: 284 -ISFMVGGFHTSGYMFTWMLWYLSSHPESQDXXXXXXXXXXXXXXXXXXKEYSLRADTFL 342 I +V G T+ WM+ YL +PE Q+ ++ L Sbjct: 616 LIELLVAGTETTAITICWMVLYLIHNPEYQEEIYKEITLNIGCRYPTLAEK---NLFPLL 446 Query: 343 RQVQDETIRLSTLAPW-AARYSDKKVTVCGYTIPAKTPMIHALGVGLKNKTVWENTDSWD 401 + ET+R++++ P A + K ++CG IP +I L + ++N + +D Sbjct: 445 QAFIQETLRITSVVPLNLAHKALKDTSICGKIIPKDAIVITNLWNLHHDNRYFKNPNEFD 266 Query: 402 PDRF-SPNGR----RGNDFCPFGVHSRRKCPGYLFSYFEVGVFASILLSRFEIVPVEGQT 456 P R+ + NG F PF R C G + ++ + S L+ F G+ Sbjct: 265 PKRWINENGLFDSISQKYFKPFSA-GARVCLGETLAKNQLFLIISGLIMNFIFTSAPGKD 89 Query: 457 VIQVHG 462 + + G Sbjct: 88 LPSLEG 71 35% to 17A1 34% to 2P4 793 EIKQARLVLNPWVEKKVEDHWRKYNEN-EIINVTDSMIQHFLTKYDGLDTDFAKKYITLL 617 616 LIELLVAGTETTAITICWMVLYLIHNPEYQEEIYKEITLNIGCRYPTLAEKNLFPLL 446 445 QAFIQETLRITSVVPLNLAHKALKDTSICGKIIPKDAIVITNLWNLHHDNRYFKNPNEFD 266 265 PKRWINENGLFDSISQKYFKPFSAGARVCLGETLAKNQLFLIISGLIMNFIFTSAPGKD 89 88 LPSLEG 71 >gnl|ti|651477674 1095901303788 Length = 819 Score = 38.5 bits (88), Expect = 0.11 Identities = 40/175 (22%), Positives = 69/175 (39%), Gaps = 7/175 (4%) Frame = +2 Query: 289 GGFHTSGYMFTWMLWYLSSHPESQDXXXXXXXXXXXXXXXXXXKEYSLRADTFLRQVQDE 348 G F T+ TW+++YL P Q +++ ++ E Sbjct: 29 GWFRTTAATITWLIFYLLHWPHYQSILYKEIKNVCGDQYPTFN---DIKSMPIMQATILE 199 Query: 349 TIRLSTLAPWAARYSD-KKVTVCGYTIPAKTPMIHALGVGLKNKTVWENTDSWDPDRF-S 406 T+RLS++ P + + + +TIP T +I L N+ WE ++P R+ Sbjct: 200 TLRLSSVVPLSLSHKAVNNAKINKFTIPKDTIIITNLWGVHHNEKYWEKPFEFNPMRWLD 379 Query: 407 PNGRRGN----DFCPFGVHSRRKCPGYLFSYFEVGVFASILLSRFEI-VPVEGQT 456 NG + PF R C G F+ ++ + S L+ F +P G+T Sbjct: 380 KNGELSTAKRLGYFPFSA-GPRGCIGESFARMQMFIICSRLIKDFSFELPQSGET 541 39% to CYP21 39% to 2R1 40% to 2P4 29 GWFRTTAATITWLIFYLLHWPHYQSILYKEIKNVCGDQYPTFNDIKSMPIMQATILE 199 200 TLRLSSVVPLSLSHKAVNNAKINKFTIPKDTIIITNLWGVHHNEKYWEKPFEFNPMRWLD 379 380 KNGELSTAKRLGYFPFSAGPRGCIGESFARMQMFIICSRLIKDFSFELPQSGET 541 >gnl|ti|654998190 1095901734433 Length = 1030 Score = 38.1 bits (87), Expect = 0.15 Identities = 48/200 (24%), Positives = 74/200 (37%), Gaps = 21/200 (10%) Frame = -2 Query: 197 FGDIFKDENELSKMAESYHVCWRTMEEGVPEAGSKRETEFLKHRRVLEDIIR-------R 249 F DI K NE S ++ + W P A S+ K++ + +IIR R Sbjct: 870 FQDIIKTHNETSYISS---IPWLRY---FPTATSRNMIILNKNKYNIFEIIRLRDPILIR 709 Query: 250 IIQERKEGEDLQELPFIDSML--------QNYDSEDKIIADAISF-----MVGGFHTSGY 296 +QE K D L + L DS +KI D F M+ G TS Sbjct: 708 KLQEHKRTYD*SNLGDVTDALIKISLESVTETDSHEKITDDNTEFLLNDFMIAGSETSSN 529 Query: 297 MFTWMLWYLSSHPESQDXXXXXXXXXXXXXXXXXXKEYSLRADTFLRQVQDETIRLSTLA 356 M W + Y+ PE QD + ++ + ET+RL ++A Sbjct: 528 MILWFIVYILHRPEYQDKLYDEIFKVFSGIGYPSLNDRP--RFHLIQAITHETLRLLSVA 355 Query: 357 PWA-ARYSDKKVTVCGYTIP 375 P + + ++CG +P Sbjct: 354 PLGLCHKAMENGSICGKFVP 295 30% to CYP21 25% to 1C2 870 FQDIIKTHNETSYISSIPWLRYFPTATSRNMIILNKNKYNIFEIIRLRDPILIR 709 708 KLQEHKRTYD*SNLGDVTDALIKISLESVTETDSHEKITDDNTEFLLNDFMIAGSETSSN 529 528 MILWFIVYILHRPEYQDKLYDEIFKVFSGIGYPSLNDRPRFHLIQAITHETLRLLSVA 355 354 PLGLCHKAMENGSICGKFVP 295 >gnl|ti|651148169 1095901003210 Length = 1130 Score = 37.4 bits (85), Expect = 0.25 Identities = 39/137 (28%), Positives = 54/137 (39%), Gaps = 20/137 (14%) Frame = -2 Query: 197 FGDIFKDENELSKMAESYHVCWRTMEEGVPEAGSKRETEFLKHRRVLEDIIR-------R 249 F DI K NE S ++ + W P A S+ K++ + +IIR R Sbjct: 544 FQDIIKTHNETSYISS---IPWLRY---FPTATSQNMIILNKNKYNIFEIIRLRDPILIR 383 Query: 250 IIQERKEGEDLQELPFIDSML--------QNYDSEDKIIADAISF-----MVGGFHTSGY 296 +QE K D L + +L DS +KI D F M+ G TS Sbjct: 382 KLQEHKRTYD*SNLGDVTDVLIKISLESVTETDSHEKITDDNTEFLLNDFMIAGSETSSN 203 Query: 297 MFTWMLWYLSSHPESQD 313 M W + Y+ PE QD Sbjct: 202 MILWFIVYILHRPEYQD 152 Same as above 544 FQDIIKTHNETSYISSIPWLRYFPTATSQNMIILNKNKYNIFEIIRLRDPILIR 383 382 KLQEHKRTYD*SNLGDVTDVLIKISLESVTETDSHEKITDDNTEFLLNDFMIAGSETSSN 203 202 MILWFIVYILHRPEYQD 152 Database: fasta.hydra_magnipapillata.001 Posted date: May 16, 2005 8:55 AM Number of letters in database: 513,442,738 Number of sequences in database: 500,000 gnl|ti|647066038 1095898227332 60 5e-08 gnl|ti|648017453 1095896110991 52 1e-05 gnl|ti|655005893 1095958068757 44 0.002 gnl|ti|655009968 1095963046224 42 0.010 gnl|ti|646862798 1095898098005 41 0.018 gnl|ti|651477674 1095901303788 39 0.11 gnl|ti|654998190 1095901734433 38 0.15 gnl|ti|651148169 1095901003210 37 0.25 CYP21danio search gnl|ti|647066038 1095898227332 177 2e-43 gnl|ti|649400787 1095898835518 153 3e-36 gnl|ti|648017453 1095896110991 150 2e-35 gnl|ti|647182814 1095899213949 142 8e-33 gnl|ti|646862798 1095898098005 141 1e-32 gnl|ti|651477674 1095901303788 141 1e-32 gnl|ti|647193621 1095899233960 133 3e-30 gnl|ti|647987527 1095895119635 97 4e-19 gnl|ti|651162328 1095901096755 96 9e-19 gnl|ti|654998190 1095901734433 91 2e-17 gnl|ti|655006784 1095958075467 81 2e-14 gnl|ti|651148169 1095901003210 74 3e-12 gnl|ti|648033522 1095897342515 72 8e-12 gnl|ti|647134594 1095899118747 72 8e-12 gnl|ti|648485307 1095899272864 71 2e-11 gnl|ti|648026854 1095896933215 70 4e-11 gnl|ti|651118815 1095900033599 70 4e-11 gnl|ti|655005893 1095958068757 60 5e-08 gnl|ti|647175227 1095898288652 45 5e-07 gnl|ti|648589386 1095733042694 56 8e-07 gnl|ti|649393684 1095898809307 54 2e-06 gnl|ti|646849327 1095897329284 51 2e-05 gnl|ti|646968536 1095898162561 49 9e-05 gnl|ti|647168675 1095899196297 48 2e-04 gnl|ti|649448444 1095899351259 33 0.079 gnl|ti|648014530 1095896049543 35 1.9 gnl|ti|655009845 1095963045220 34 3.2 gnl|ti|648592188 1095595897239 34 3.2 gnl|ti|653058100 1095949490108 34 3.2 >gnl|ti|647066038 1095898227332 Length = 1123 Score = 177 bits (449), Expect = 2e-43 Identities = 109/320 (34%), Positives = 168/320 (52%), Gaps = 8/320 (2%) Frame = -3 Query: 215 NVITTLTFSKAYDKSSAELQKLQECLNEIVSLWGSPWISALDSFPLLRKFPNPPFSRLMK 274 N+I T+ F+ Y+ + E Q + + + IV + S + S PLLR FP + + Sbjct: 1010 NIICTILFNHRYEDDNQEFQNIIKYSSLIVQTFNET--SYVSSIPLLRYFPTATSRNIFE 837 Query: 275 EVARRDELIGKHIEEFKKSEHKEG-GTLTSSLLKC-LEPQQGAANHXXXXXXXXXXXXXX 332 + RD ++ + ++E +KS K +T +L+K L+ + G Sbjct: 836 IIRLRDPILKRKLQEHRKSYDKNNLRDITDALIKVSLDSEMGEELTEKITDDNIEFLLND 657 Query: 333 XLIGGTETIAALLNWTVAFLLHRPEVQDKVYEELCCVL-DVRYPQYSDRHKLPYLCALIS 391 +I G+ET ++ + W + ++LH PE Q+K+Y+E+ V D RY DR L + A I Sbjct: 656 FMIAGSETSSSTILWFIVYMLHWPEYQNKLYDEITKVASDNRYVSLKDRPMLHLMQAAIH 477 Query: 392 EMLRLRPVAPLAVPHRAIRNSSIAGHFIPKNTIIIPNLYGAHHDPEVWDDPYSFKPERFL 451 E LRL V PL + H+A+ NSSI G F+PK +I+ NL+ HHD W + SF PER+L Sbjct: 476 ETLRLSSVVPLGLVHKAMENSSICGKFVPKGALILTNLWSMHHDESYWKNAMSFYPERWL 297 Query: 452 EGGGGSLRSL----IPFGGGARLCLGEAVAKMEMFLFTAYLLREFKF-LPASKEEPLPEL 506 E G L +PF G R CLGE +AK E+F+F LL++++F +P KE LP L Sbjct: 296 EKSGEFNYKLGYAYLPFSNGPRSCLGETLAKTELFVFITRLLKDYRFEMPTGKE--LPCL 123 Query: 507 RGVASVVLKVKPYTVIAHPR 526 G + + + V+ PR Sbjct: 122 DGRSGITSPPNDFEVVIIPR 63 34% to 17A1 35% to 2U1 fugu 33% to 2U1 human 1010 NIICTILFNHRYEDDNQEFQNIIKYSSLIVQTFNETSYVSSIPLLRYFPTATSRNIFE 837 836 IIRLRDPILKRKLQEHRKSYDKNNLRDITDALIKVSLDSEMGEELTEKITDDNIEFLLND 657 656 FMIAGSETSSSTILWFIVYMLHWPEYQNKLYDEITKVASDNRYVSLKDRPMLHLMQAAIH 477 476 ETLRLSSVVPLGLVHKAMENSSICGKFVPKGALILTNLWSMHHDESYWKNAMSFYPERWL 297 296 EKSGEFNYKLGYAYLPFSNGPRSCLGETLAKTELFVFITRLLKDYRFEMPTGKELPCL 123 122 DGRSGITSPPNDFEVVIIPR 63 >gnl|ti|649400787 1095898835518 Length = 1120 Score = 153 bits (387), Expect = 3e-36 Identities = 97/309 (31%), Positives = 161/309 (52%), Gaps = 11/309 (3%) Frame = +2 Query: 211 VASSNVITTLTFSKAYDKSSAELQKLQECLNEIVSLWGSPWISALDSFPLLRKFPNPPFS 270 VA NVI ++ F K Y+ + E +++ +N + + G +A+ P LR P Sbjct: 44 VAVLNVICSIVFGKRYEYENCEFKEILTYMNYVFT--GVAGTNAISFIPWLRFLPLDGLR 217 Query: 271 RLMKEVARRDELIGK----HIEEFKKSEHKEGGTLTSSLLKCLEPQQGAANHXXXXXXXX 326 +L K ++ RD ++ K H E + +S ++ T +++ + Sbjct: 218 KLKKGLSIRDPVLRKQLLYHRETYNESNLRD---YTDYVIQFSRDEAILKKFGEQLTDDY 388 Query: 327 XXXXXXXL-IGGTETIAALLNWTVAFLLHRPEVQDKVYEELCCVLDV-RYPQYSDRHKLP 384 + I GTET L W++ +L+H P+ QDK+Y E+ + RYP DR+ LP Sbjct: 389 LELLLNDIFIAGTETALTTLLWSIIYLIHWPKFQDKIYNEIVSAIGKNRYPSMKDRNMLP 568 Query: 385 YLCALISEMLRLRPVAPLAVPHRAIRNSSIAGHF-IPKNTIIIPNLYGAHHDPEVWDDPY 443 + A +SE LRL V PL VPH+A+ ++++ IPK T I+ NL+ HH+ W++P+ Sbjct: 569 LVNAALSETLRLSSVTPLGVPHKAMEDTTLLNDLKIPKGTTILTNLWQLHHNKNCWENPH 748 Query: 444 SFKPERFLEGGG--GSLRSL--IPFGGGARLCLGEAVAKMEMFLFTAYLLREFKFLPASK 499 F P R+ S++S+ +PF G R+CLG+ +A++E+FLF + L+R+FKF Sbjct: 749 EFNPYRWFTNDQTLDSIKSMNFLPFSAGTRVCLGKGIAEVELFLFYSRLVRDFKF-EVKP 925 Query: 500 EEPLPELRG 508 + LP L G Sbjct: 926 GDSLPSLYG 952 >gnl|ti|649400787 1095898835518 44% to 1095898227332, 39% to 17A1 fugu 35% to 2U1 44 VAVLNVICSIVFGKRYEYENCEFKEILTYMNYVFTGVAGTNAISFIPWLRFLPLDGLR 217 218 KLKKGLSIRDPVLRKQLLYHRETYNESNLRDYTDYVIQFSRDEAILKKFGEQLTDDY 388 389 LELLLNDIFIAGTETALTTLLWSIIYLIHWPKFQDKIYNEIVSAIGKNRYPSMKDRNMLP 568 569 LVNAALSETLRLSSVTPLGVPHKAMEDTTLLNDLKIPKGTTILTNLWQLHHNKNCWENPH 748 749 EFNPYRWFTNDQTLDSIKSMNFLPFSAGTRVCLGKGIAEVELFLFYSRLVRDFKFEVKP 925 926 GDSLPSLYG 952 >gnl|ti|648017453 1095896110991 Length = 1042 Score = 150 bits (380), Expect = 2e-35 Identities = 98/286 (34%), Positives = 150/286 (52%), Gaps = 8/286 (2%) Frame = -1 Query: 215 NVITTLTFSKAYDKSSAELQKLQECLNEIVSLWGSPWISALDSFPLLRKFPNPPFSRLMK 274 N+I T+ F++ Y++ E Q + + N + + + L S P LR FP S+ ++ Sbjct: 877 NIICTILFNQRYEQDDDEFQNIIKYSNLSFKAFSAS--NLLSSIPWLRYFPTTA-SKYIQ 707 Query: 275 EVAR-RDELIGKHIEEFKKSEHKEG-GTLTSSLLKCLEPQQGAANHXXXXXXXXXXXXXX 332 E+ R RD ++ + ++E +KS + +T +L+K + Sbjct: 706 EIERLRDPILKRKLQEHRKSYDENNLRDITDALIKASIHLNAEKDSLIKVTDDNIQFILN 527 Query: 333 XLI-GGTETIAALLNWTVAFLLHRPEVQDKVYEELCCVLDV-RYPQYSDRHKLPYLCALI 390 LI G+ET ++ + W + ++LH PE QDK++ E+ V RYP +DR L L A I Sbjct: 526 DLILAGSETSSSTITWFIVYMLHYPEYQDKIFNEVIKVTSGNRYPCLNDRPLLHLLQATI 347 Query: 391 SEMLRLRPVAPLAVPHRAIRNSSIAGHFIPKNTIIIPNLYGAHHDPEVWDDPYSFKPERF 450 E LRL VAPL + H+A+ NS+I + K T+II NL+ HHD W +P SF PER+ Sbjct: 346 HETLRLSSVAPLGLRHKAMENSTICDKPVLKGTLIITNLWSIHHDERYWKNPMSFYPERW 167 Query: 451 LEGGGGSLRSL----IPFGGGARLCLGEAVAKMEMFLFTAYLLREF 492 L G L IPF GG R CLGE +AK E+F+ + L+ +F Sbjct: 166 LNETGEFDYKLGNAYIPFSGGPRACLGETLAKTELFVIISRLVTDF 29 877 NIICTILFNQRYEQDDDEFQNIIKYSNLSFKAFSASNLLSSIPWLRYFPTTASKYIQ 707 706 EIERLRDPILKRKLQEHRKSYDENNLRDITDALIKASIHLNAEKDSLIKVTDDNIQFILN 527 526 DLILAGSETSSSTITWFIVYMLHYPEYQDKIFNEVIKVTSGNRYPCLNDRPLLHLLQATI 347 346 HETLRLSSVAPLGLRHKAMENSTICDKPVLKGTLIITNLWSIHHDERYWKNPMSFYPERW 167 166 LNETGEFDYKLGNAYIPFSGGPRACLGETLAKTELFVIISRLVTDF 29 >gnl|ti|647182814 1095899213949 Length = 1074 Score = 142 bits (357), Expect = 8e-33 Identities = 89/291 (30%), Positives = 145/291 (49%), Gaps = 7/291 (2%) Frame = +2 Query: 211 VASSNVITTLTFSKAYDKSSAELQKLQECLNEIVSLWGSPWISALDSFPLLRKFPNPPFS 270 VA NVI + F + Y S ++ +N IVS G +A+D P LR Sbjct: 83 VAVLNVICFIVFGERYQYSDPAFIEILTTINNIVS--GLSNTTAVDFLPGLRYLQFSEIK 256 Query: 271 RLMKEVARRDELIGKHIEEFKKS-EHKEGGTLTSSLLKCLEPQQGAANHXXXXXXXXXXX 329 +L + L+ +++ KK+ + T S++K + + Sbjct: 257 KLKSSLVIYFRLLNDQLKKHKKTFDENNIRDFTDSIIKFSKDETMENKFEEELTDEHLEH 436 Query: 330 XXXXL-IGGTETIAALLNWTVAFLLHRPEVQDKVYEELCCVL-DVRYPQYSDRHKLPYLC 387 + I G+ET L W + +++H P+ Q++++EE+ V+ + RYPQ SDR L + Sbjct: 437 VIGDMFIAGSETTLTSLLWLIIYMIHYPKYQEEIFEEITRVIGENRYPQLSDRDSLHLVK 616 Query: 388 ALISEMLRLRPVAPLAVPHRAIRNSSIAGHFIPKNTIIIPNLYGAHHDPEVWDDPYSFKP 447 A I E LRL + PL VPH+ + ++++ G+ IPKNT +I N + H+D W +P F P Sbjct: 617 ASIKECLRLSSIIPLGVPHKTMSDTTLIGYNIPKNTTVIINHWQIHNDTNHWKNPNEFNP 796 Query: 448 ERFLEG----GGGSLRSLIPFGGGARLCLGEAVAKMEMFLFTAYLLREFKF 494 R+++ S +PF G R+CLG+ VA+ E+F F L+R+FKF Sbjct: 797 HRWIDDDSKFDATRATSYLPFSAGTRVCLGKTVAETELFFFFTRLIRDFKF 949 >gnl|ti|647182814 1095899213949 54% to 1095898835518, 36% to 17A1 36% to 2U1 83 VAVLNVICFIVFGERYQYSDPAFIEILTTINNIVSGLSNTTAVDFLPGLRYLQFSEIK 256 257 KLKSSLVIYFRLLNDQLKKHKKTFDENNIRDFTDSIIKFSKDETMENKFEEELTDEHLEH 436 437 VIGDMFIAGSETTLTSLLWLIIYMIHYPKYQEEIFEEITRVIGENRYPQLSDRDSLHLVK 616 617 ASIKECLRLSSIIPLGVPHKTMSDTTLIGYNIPKNTTVIINHWQIHNDTNHWKNPNEFNP 796 797 HRWIDDDSKFDATRATSYLPFSAGTRVCLGKTVAETELFFFFTRLIRDFKF 949 >gnl|ti|646862798 1095898098005 Length = 963 Score = 141 bits (356), Expect = 1e-32 Identities = 71/193 (36%), Positives = 113/193 (58%), Gaps = 4/193 (2%) Frame = -3 Query: 334 LIGGTETIAALLNWTVAFLLHRPEVQDKVYEELCCVLDVRYPQYSDRHKLPYLCALISEM 393 L+ GTET A + W V +L+H PE Q+++Y+E+ + RYP ++++ P L A I E Sbjct: 604 LVAGTETTAITICWMVLYLIHNPEYQEEIYKEITLNIGCRYPTLAEKNLFPLLQAFIQET 425 Query: 394 LRLRPVAPLAVPHRAIRNSSIAGHFIPKNTIIIPNLYGAHHDPEVWDDPYSFKPERFLEG 453 LR+ V PL + H+A++++SI G IPK+ I+I NL+ HHD + +P F P+R++ Sbjct: 424 LRITSVVPLNLAHKALKDTSICGKIIPKDAIVITNLWNLHHDNRYFKNPNEFDPKRWINE 245 Query: 454 GG----GSLRSLIPFGGGARLCLGEAVAKMEMFLFTAYLLREFKFLPASKEEPLPELRGV 509 G S + PF GAR+CLGE +AK ++FL + L+ F F A ++ LP L G Sbjct: 244 NGLFDSISQKYFKPFSAGARVCLGETLAKNQLFLIISGLIMNFIFTSAPGKD-LPSLEGQ 68 Query: 510 ASVVLKVKPYTVI 522 + + + V+ Sbjct: 67 FGITFRPNSFKVL 29 >gnl|ti|651477674 1095901303788 Length = 819 Score = 141 bits (355), Expect = 1e-32 Identities = 75/196 (38%), Positives = 111/196 (56%), Gaps = 5/196 (2%) Frame = +2 Query: 336 GGTETIAALLNWTVAFLLHRPEVQDKVYEELCCVLDVRYPQYSDRHKLPYLCALISEMLR 395 G T AA + W + +LLH P Q +Y+E+ V +YP ++D +P + A I E LR Sbjct: 29 GWFRTTAATITWLIFYLLHWPHYQSILYKEIKNVCGDQYPTFNDIKSMPIMQATILETLR 208 Query: 396 LRPVAPLAVPHRAIRNSSIAGHFIPKNTIIIPNLYGAHHDPEVWDDPYSFKPERFLEGGG 455 L V PL++ H+A+ N+ I IPK+TIII NL+G HH+ + W+ P+ F P R+L+ G Sbjct: 209 LSSVVPLSLSHKAVNNAKINKFTIPKDTIIITNLWGVHHNEKYWEKPFEFNPMRWLDKNG 388 Query: 456 ----GSLRSLIPFGGGARLCLGEAVAKMEMFLFTAYLLREFKF-LPASKEEPLPELRGVA 510 PF G R C+GE+ A+M+MF+ + L+++F F LP S E P+L G Sbjct: 389 ELSTAKRLGYFPFSAGPRGCIGESFARMQMFIICSRLIKDFSFELPQSGE--TPKLDGDI 562 Query: 511 SVVLKVKPYTVIAHPR 526 + L PY +A R Sbjct: 563 GITLTPLPYNAVAKQR 610 29 GWFRTTAATITWLIFYLLHWPHYQSILYKEIKNVCGDQYPTFNDIKSMPIMQATILETLR 208 209 LSSVVPLSLSHKAVNNAKINKFTIPKDTIIITNLWGVHHNEKYWEKPFEFNPMRWLDKNG 388 389 ELSTAKRLGYFPFSAGPRGCIGESFARMQMFIICSRLIKDFSFELPQSGETPKLDGDI 562 563 GITLTPLPYNAVAKQR 610 >gnl|ti|647193621 1095899233960 Length = 1050 Score = 133 bits (335), Expect = 3e-30 Identities = 96/310 (30%), Positives = 149/310 (48%), Gaps = 10/310 (3%) Frame = +2 Query: 215 NVITTLTFSKAYDKSSAELQKLQECLNEIVSLWGSPWISALDSFPLLRKFPNPPFSRLMK 274 NV+ + F Y+++ EL+K+ I+ G A+ P LR FP+ ++ K Sbjct: 110 NVLCGIVFGTQYEENDKELEKVISFKQLILD--GVADTFAISFLPWLRFFPSNGLKKVRK 283 Query: 275 EVARRDELIG----KHIEEFKKSEHKEGGTLTSSLLKCLEPQQGAANHXXXXXXXXXXXX 330 V RD+L+ KH E + + ++ T +LK + + + N Sbjct: 284 GVLIRDKLLRFQLKKHRETYNPVQIRD---YTDYVLKYSKEFETSRNIDEQLSEDNMEMM 454 Query: 331 XXXL-IGGTETIAALLNWTVAFLLHRPEVQDKVYEELCCVL-DVRYPQYSDRHKLPYLCA 388 + I G+ET + L W +L++ P+ QD +Y+E ++ + RYP SDR KL + Sbjct: 455 LQDIFISGSETTISTLLWFAVYLVNWPKYQDDIYDETIKIVGNDRYPSLSDRPKLHLFES 634 Query: 389 LISEMLRLRPVAPLAVPHRAIRNSSIAGHFIPKNTIIIPNLYGAHHDPEVWDDPYSFKPE 448 + E LRL V PL +PHR++ +SI IPKNT ++ NL+ HHD + W DP++F P Sbjct: 635 AMKETLRLSSVIPLGLPHRSLEETSIKKFKIPKNTNVMINLWQLHHDSKSWSDPHTFNPY 814 Query: 449 RFLEGGG----GSLRSLIPFGGGARLCLGEAVAKMEMFLFTAYLLREFKFLPASKEEPLP 504 R+L + +PF G R CLG + +FLF L+R+F L P Sbjct: 815 RWLNDKNIFDKSKNPNYLPFSTGLRACLGYHTTESIIFLFFTRLIRDFN-LCLKPGASTP 991 Query: 505 ELRGVASVVL 514 L GV V L Sbjct: 992 SLNGVLRVTL 1021 >gnl|ti|647193621 1095899233960 50% to 1095898835518 37% to 17A1 110 NVLCGIVFGTQYEENDKELEKVISFKQLILDGVADTFAISFLPWLRFFPSNGLKKVRK 283 284 GVLIRDKLLRFQLKKHRETYNPVQIRDYTDYVLKYSKEFETSRNIDEQLSEDNMEMM 454 455 LQDIFISGSETTISTLLWFAVYLVNWPKYQDDIYDETIKIVGNDRYPSLSDRPKLHLFES 634 635 AMKETLRLSSVIPLGLPHRSLEETSIKKFKIPKNTNVMINLWQLHHDSKSWSDPHTFNPY 814 815 RWLNDKNIFDKSKNPNYLPFSTGLRACLGYHTTESIIFLFFTRLIRDFN-LCLKPGASTP 991 992 SLNGVLRVTL 1021 >gnl|ti|647987527 1095895119635 Length = 1003 Score = 96.7 bits (239), Expect = 4e-19 Identities = 57/137 (41%), Positives = 74/137 (54%), Gaps = 4/137 (2%) Frame = +3 Query: 394 LRLRPVAPLAVPHRAIRNSSIAGHFIPKNTIIIPNLYGAHHDPEVWDDPYSFKPERFLEG 453 LRL VAPL + H+A+ NS+I + K T+II NL+ HHD W +P SF PER+L Sbjct: 18 LRLSSVAPLGLRHKAMENSTICDKPVLKGTLIITNLWSIHHDERYWKNPMSFYPERWLNE 197 Query: 454 GGGSLRSL----IPFGGGARLCLGEAVAKMEMFLFTAYLLREFKFLPASKEEPLPELRGV 509 G L IPF GG R CLGE +AK E+F+ + L+ +F F S EE LP L Sbjct: 198 TGEFDYKLGNAYIPFSGGPRACLGETLAKTELFVIISRLVTDFYF-EKSVEEDLPRLDSF 374 Query: 510 ASVVLKVKPYTVIAHPR 526 V + V+ R Sbjct: 375 PGVTRSPYDFKVVVVSR 425 >gnl|ti|647987527 1095895119635 Same as 1095896110991 18 LRLSSVAPLGLRHKAMENSTICDKPVLKGTLIITNLWSIHHDERYWKNPMSFYPERWLNE 197 198 TGEFDYKLGNAYIPFSGGPRACLGETLAKTELFVIISRLVTDFYFEKSVEEDLPRLDSF 374 375 PGVTRSPYDFKVVVVSR 425 >gnl|ti|651162328 1095901096755 Length = 986 Score = 95.5 bits (236), Expect = 9e-19 Identities = 59/181 (32%), Positives = 94/181 (51%), Gaps = 5/181 (2%) Frame = -1 Query: 334 LIGGTETIAALLNWTVAFLLHRPEVQDKVYEELCCVLD-VRYPQYSDRHKLPYLCALISE 392 +I G+ET + ++ W + ++LHRPE QDK+Y+E+ V + YP +DR + + A+I E Sbjct: 752 MIAGSETSSNMILWFIVYILHRPEYQDKLYDEIFKVFSGIGYPSLNDRPRFHLIQAIIHE 573 Query: 393 MLRLRPVAPLAVPHRAIRNSSIAGHFIPKNTIIIPNLYGAHHDPEVWDDPYSFKPERFLE 452 LRL VAPL + H+A+ N SI G F+PK L + + F Sbjct: 572 TLRLLSVAPLGLCHKALENGSICGKFVPKGASYSD*LMEYTS**ALLEKCNKFFS*ALAR 393 Query: 453 GGGG----SLRSLIPFGGGARLCLGEAVAKMEMFLFTAYLLREFKFLPASKEEPLPELRG 508 G + F GG R CLGE +AK E+ +F + L+++++F + ++ L G Sbjct: 392 *FGQF*L*FRLCIFVFSGGPRSCLGETLAKTELLVFISRLVKDYRFEKPNGKDSLDGRSG 213 Query: 509 V 509 V Sbjct: 212 V 210 >gnl|ti|651162328 1095901096755 2 aa diffs to 1095901734433 752 MIAGSETSSNMILWFIVYILHRPEYQDKLYDEIFKVFSGIGYPSLNDRPRFHLIQAIIHE 573 572 TLRLLSVAPLGLCHKALENGSICGKFVPKGASYSD*LMEYTS**ALLEKCNKFFS*ALAR 393 392 *FGQF*L*FRLCIFVFSGGPRSCLGETLAKTELLVFISRLVKDYRFEKPNGKDSLDGRSG 213 212 V 210 >gnl|ti|654998190 1095901734433 Length = 1030 Score = 90.9 bits (224), Expect = 2e-17 Identities = 57/182 (31%), Positives = 93/182 (51%), Gaps = 13/182 (7%) Frame = -2 Query: 253 SALDSFPLLRKFP----------NPPFSRLMKEVARRDELIGKHIEEFKKS-EHKEGGTL 301 S + S P LR FP N + + + RD ++ + ++E K++ + G + Sbjct: 837 SYISSIPWLRYFPTATSRNMIILNKNKYNIFEIIRLRDPILIRKLQEHKRTYD*SNLGDV 658 Query: 302 TSSLLKC-LEPQQGAANHXXXXXXXXXXXXXXXLIGGTETIAALLNWTVAFLLHRPEVQD 360 T +L+K LE +H +I G+ET + ++ W + ++LHRPE QD Sbjct: 657 TDALIKISLESVTETDSHEKITDDNTEFLLNDFMIAGSETSSNMILWFIVYILHRPEYQD 478 Query: 361 KVYEELCCVLD-VRYPQYSDRHKLPYLCALISEMLRLRPVAPLAVPHRAIRNSSIAGHFI 419 K+Y+E+ V + YP +DR + + A+ E LRL VAPL + H+A+ N SI G F+ Sbjct: 477 KLYDEIFKVFSGIGYPSLNDRPRFHLIQAITHETLRLLSVAPLGLCHKAMENGSICGKFV 298 Query: 420 PK 421 PK Sbjct: 297 PK 292 Score = 73.6 bits (179), Expect = 4e-12 Identities = 41/107 (38%), Positives = 61/107 (57%), Gaps = 5/107 (4%) Frame = -3 Query: 410 RNSSIAGHFIPKNTIIIPNLYGAHHDPEVWDDPYSFKPERFLEGGGGSLRSL----IPFG 465 R G+ P+ +I+ NL HHD W + +F PER+L+ G +L +PF Sbjct: 326 RTVVFVGNLFPRELLILTNL*SIHHDECYWKNAINFFPERWLDNSGNFNYNLGYAYLPFS 147 Query: 466 GGARLCLGEAVAKMEMFLFTAYLLREFKF-LPASKEEPLPELRGVAS 511 GG R CLGE +AK E+F+F + L+++++F P KE LP L G +S Sbjct: 146 GGPRSCLGETLAKTELFVFISRLVKDYRFEKPNGKE--LPSLDGRSS 12 837 SYISSIPWLRYFPTATSRNMIILNKNKYNIFEIIRLRDPILIRKLQEHKRTYD*SNLGDV 658 657 TDALIKISLESVTETDSHEKITDDNTEFLLNDFMIAGSETSSNMILWFIVYILHRPEYQD 478 477 KLYDEIFKVFSGIGYPSLNDRPRFHLIQAITHETLRLLSVAPLGLCHKAMENGSICGKFV 298 297 PKELLILTNL*SIHHDECYWKNAINFFPERWLDNSGNFNYNLGYAYLPFS 147 146 GGPRSCLGETLAKTELFVFISRLVKDYRFEKPNGKELPSLDGRSS 12 >gnl|ti|655006784 1095958075467 Length = 931 Score = 80.9 bits (198), Expect = 2e-14 Identities = 46/141 (32%), Positives = 73/141 (51%), Gaps = 6/141 (4%) Frame = -2 Query: 392 EMLRLRPVAPLAVPHRAIRNSSIAGHFIPKNTIIIPNLYGAHHDPEVWDDPYSFKPERFL 451 E LRL + PL VPH+ + ++++ + + +I N + H+D W +P P R++ Sbjct: 744 ECLRLSSIIPLGVPHKTMSDTTLIAYTLNIYGTVIINHWQIHNDTNHWKNPNESTPHRWI 565 Query: 452 EGGGG----SLRSLIPFGGGARLCLGEAVAKMEMFLFTAYLLREFKF--LPASKEEPLPE 505 + S +PF G R+CLG+ VA+ E+F F L+R+FKF +P PLP Sbjct: 564 DDDSKFDATRATSYLPFSAGTRVCLGKTVAETELFFFFTRLIRDFKFEGVPGC---PLPS 394 Query: 506 LRGVASVVLKVKPYTVIAHPR 526 L G S+ L + + V PR Sbjct: 393 LIGKCSITLAPEEFNVHVTPR 331 >gnl|ti|655006784 1095958075467 Same as 1095899213949 744 ECLRLSSIIPLGVPHKTMSDTTLIAYTLNIYGTVIINHWQIHNDTNHWKNPNESTPHRWI 565 564 DDDSKFDATRATSYLPFSAGTRVCLGKTVAETELFFFFTRLIRDFKFEGVPGCPLPS 394 393 LIGKCSITLAPEEFNVHVTPR 331 >gnl|ti|651148169 1095901003210 Length = 1130 Score = 73.9 bits (180), Expect = 3e-12 Identities = 50/170 (29%), Positives = 83/170 (48%), Gaps = 13/170 (7%) Frame = -2 Query: 253 SALDSFPLLRKFP----------NPPFSRLMKEVARRDELIGKHIEEFKKS-EHKEGGTL 301 S + S P LR FP N + + + RD ++ + ++E K++ + G + Sbjct: 511 SYISSIPWLRYFPTATSQNMIILNKNKYNIFEIIRLRDPILIRKLQEHKRTYD*SNLGDV 332 Query: 302 TSSLLKC-LEPQQGAANHXXXXXXXXXXXXXXXLIGGTETIAALLNWTVAFLLHRPEVQD 360 T L+K LE +H +I G+ET + ++ W + ++LHRPE QD Sbjct: 331 TDVLIKISLESVTETDSHEKITDDNTEFLLNDFMIAGSETSSNMILWFIVYILHRPEYQD 152 Query: 361 KVYEELCCVLD-VRYPQYSDRHKLPYLCALISEMLRLRPVAPLAVPHRAI 409 K+Y+E+ V + YP +DR + + A+I E LRL VAPL H+ + Sbjct: 151 KLYDEIFKVFSGIGYPSLNDRPRFHLIQAIIHETLRLLSVAPLG*SHKPV 2 >gnl|ti|648033522 1095897342515 Length = 1108 Score = 72.4 bits (176), Expect = 8e-12 Identities = 46/143 (32%), Positives = 80/143 (55%), Gaps = 2/143 (1%) Frame = +3 Query: 69 GPRSLPLLGNMLDLAQDHLPIHLTALAKCYGNIYRLNCGSTSAMVVLNNSEIIREALVKK 128 GP LP++GN L L +K YG ++ L+ G +V+++ + IRE LV+K Sbjct: 153 GPFPLPIIGN-LHLIGKKPHEKFVEYSKKYGEVFSLSFGM-HRVVIVSGKDSIREVLVQK 326 Query: 129 WSDFAGRPYSYTGXDIVSGGGRTISLGDFSEEWKAHRRVTHSALQRC--TTDSLHSVIEK 186 + FAGRP +Y +IVS G + I GD +WK R++ HS+L+ +T L +++ K Sbjct: 327 SNIFAGRPKNYIA-NIVSRGYKNIGYGDIGPKWKILRKIAHSSLKNYGESTKHLETLVVK 503 Query: 187 QAQHLCQVLRDYSGKAVDLSEDF 209 +++ L + L ++ +L ++F Sbjct: 504 ESEELHKRLFKNCNRSTELEDEF 572 >gnl|ti|648033522 1095897342515 39% to 17A1 N-term 153 GPFPLPIIGNLHLIGKKPHEKFVEYSKKYGEVFSLSFGMHRVVIVSGKDSIREVLVQK 326 327 SNIFAGRPKNYIANIVSRGYKNIGYGDIGPKWKILRKIAHSSLKNYGESTKHLETLVVK 503 504 ESEELHKRLFKNCNRSTELEDEF 572 >gnl|ti|647134594 1095899118747 Length = 1050 Score = 72.4 bits (176), Expect = 8e-12 Identities = 46/143 (32%), Positives = 80/143 (55%), Gaps = 2/143 (1%) Frame = -1 Query: 69 GPRSLPLLGNMLDLAQDHLPIHLTALAKCYGNIYRLNCGSTSAMVVLNNSEIIREALVKK 128 GP LP++GN L L +K YG ++ L+ G +V+++ + IRE LV+K Sbjct: 612 GPFPLPIIGN-LHLIGKKPHEKFVEYSKKYGEVFSLSFGM-HRVVIVSGKDSIREVLVQK 439 Query: 129 WSDFAGRPYSYTGXDIVSGGGRTISLGDFSEEWKAHRRVTHSALQRC--TTDSLHSVIEK 186 + FAGRP +Y +IVS G + I GD +WK R++ HS+L+ +T L +++ K Sbjct: 438 SNIFAGRPKNYIA-NIVSRGYKNIGYGDIGPKWKILRKIAHSSLKNYGESTKHLETLVVK 262 Query: 187 QAQHLCQVLRDYSGKAVDLSEDF 209 +++ L + L ++ +L ++F Sbjct: 261 ESEELHKRLFKNCNRSTELEDEF 193 612 GPFPLPIIGNLHLIGKKPHEKFVEYSKKYGEVFSLSFGMHRVVIVSGKDSIREVLVQK 439 438 SNIFAGRPKNYIANIVSRGYKNIGYGDIGPKWKILRKIAHSSLKNYGESTKHLETLVVK 262 261 ESEELHKRLFKNCNRSTELEDEF 193 >gnl|ti|648485307 1095899272864 Length = 944 Score = 70.9 bits (172), Expect = 2e-11 Identities = 46/143 (32%), Positives = 80/143 (55%), Gaps = 2/143 (1%) Frame = -2 Query: 69 GPRSLPLLGNMLDLAQDHLPIHLTALAKCYGNIYRLNCGSTSAMVVLNNSEIIREALVKK 128 GP +P+ GN+ L + I L A +K YG ++ ++ G +V++++ REALV+K Sbjct: 799 GPFPIPIFGNLHLLGTEPHKI-LAAYSKKYGAVFSISLG-LQRIVIISDITTTREALVQK 626 Query: 129 WSDFAGRPYSYTGXDIVSGGGRTISLGDFSEEWKAHRRVTHSALQRCTT--DSLHSVIEK 186 S FAGRP SY ++S G + I+ D+ WK R+V+HS+L+ + ++ K Sbjct: 625 ASIFAGRPKSYL-IQLISSGYKGIAFMDYGSFWKVLRKVSHSSLKIYGEGHERFEKILTK 449 Query: 187 QAQHLCQVLRDYSGKAVDLSEDF 209 +++ L + L S +V+L +F Sbjct: 448 ESEELHKRLLKKSNNSVELKSEF 380 >gnl|ti|648485307 1095899272864 57% to 1095897342515 799 GPFPIPIFGNLHLLGTEPHKILAAYSKKYGAVFSISLGLQRIVIISDITTTREALVQK 626 625 ASIFAGRPKSYLIQLISSGYKGIAFMDYGSFWKVLRKVSHSSLKIYGEGHERFEKILTK 449 448 ESEELHKRLLKKSNNSVELKSEF 380 >gnl|ti|648026854 1095896933215 Length = 1081 Score = 70.1 bits (170), Expect = 4e-11 Identities = 46/143 (32%), Positives = 78/143 (54%), Gaps = 2/143 (1%) Frame = +1 Query: 69 GPRSLPLLGNMLDLAQDHLPIHLTALAKCYGNIYRLNCGSTSAMVVLNNSEIIREALVKK 128 GP LP++GN L L +K YG ++ L+ G +V+++ + IRE LV+K Sbjct: 175 GPFPLPIIGN-LHLIGKKPHEKFVEYSKKYGEVFSLSFGM-HRVVIVSGKDSIREVLVQK 348 Query: 129 WSDFAGRPYSYTGXDIVSGGGRTISLGDFSEEWKAHRRVTHSALQRC--TTDSLHSVIEK 186 + FAGRP +Y +IVS G + I GD +WK R++ HS+L+ +T L +++ + Sbjct: 349 SNIFAGRPKNYIA-NIVSRGYKNIGYGDIGPKWKILRKIAHSSLKNYGESTAHLETLVVR 525 Query: 187 QAQHLCQVLRDYSGKAVDLSEDF 209 +++ L + L S ++ L F Sbjct: 526 ESEELHKNLYKKSNRSTKLEHKF 594 175 GPFPLPIIGNLHLIGKKPHEKFVEYSKKYGEVFSLSFGMHRVVIVSGKDSIREVLVQK 348 349 SNIFAGRPKNYIANIVSRGYKNIGYGDIGPKWKILRKIAHSSLKNYGESTAHLETLVVR 525 526 ESEELHKNLYKKSNRSTKLEHKF 594 >gnl|ti|651118815 1095900033599 Length = 1071 Score = 70.1 bits (170), Expect = 4e-11 Identities = 46/143 (32%), Positives = 78/143 (54%), Gaps = 2/143 (1%) Frame = -1 Query: 69 GPRSLPLLGNMLDLAQDHLPIHLTALAKCYGNIYRLNCGSTSAMVVLNNSEIIREALVKK 128 GP LP++GN L L +K YG ++ L+ G +V+++ + IRE LV+K Sbjct: 516 GPFPLPIIGN-LHLIGKKPHEKFVEYSKKYGEVFSLSFGM-HRVVIVSGKDSIREVLVQK 343 Query: 129 WSDFAGRPYSYTGXDIVSGGGRTISLGDFSEEWKAHRRVTHSALQRC--TTDSLHSVIEK 186 + FAGRP +Y +IVS G + I GD +WK R++ HS+L+ +T L +++ + Sbjct: 342 SNIFAGRPKNYIA-NIVSRGYKNIGYGDIGPKWKILRKIAHSSLKNYGESTAHLETLVVR 166 Query: 187 QAQHLCQVLRDYSGKAVDLSEDF 209 +++ L + L S ++ L F Sbjct: 165 ESEELHKNLYKKSNRSTKLEHKF 97 516 GPFPLPIIGNLHLIGKKPHEKFVEYSKKYGEVFSLSFGMHRVVIVSGKDSIREVLVQK 343 342 SNIFAGRPKNYIANIVSRGYKNIGYGDIGPKWKILRKIAHSSLKNYGESTAHLETLVVR 166 165 ESEELHKNLYKKSNRSTKLEHKF 97 >gnl|ti|655005893 1095958068757 Length = 952 Score = 59.7 bits (143), Expect = 5e-08 Identities = 70/308 (22%), Positives = 122/308 (39%), Gaps = 8/308 (2%) Frame = -2 Query: 150 RTISLGDFSEEWKAHRRVTHSALQRCTTDSLHSVIEKQAQHLCQVLRDYSG--KAVDLSE 207 +T L +WK RR+ + ++ + E+QA L L + + VD+ Sbjct: 921 KTGLLTSTGSKWKTRRRLLTPSFHFSILNNFIKIFEEQASILVDKLAVAADNKEVVDVQV 742 Query: 208 DFTVASSNVITTLTFS---KAYDKSSAELQKLQECLNEIVSLWGS-PWISALDSFPLLRK 263 +A+ ++I + A +E K LNE + + PW+ + LL Sbjct: 741 PIGLATLDIICETSMGVKVNAQSHPDSEYVKAITVLNEEIQMRQKFPWLWFDAIYKLL-- 568 Query: 264 FPNPPFSRLMKEVARRDELIGKHIEEFKKSEHKEGGTLTSSLLK--CLEPQQGAANHXXX 321 P R K + +L I E + + +E T+S K L+ Sbjct: 567 ---PCGKRFYKALDVAHKLSFDVINERMQMKIQESYCETASDEKKFFLDLLLDIYRKGKI 397 Query: 322 XXXXXXXXXXXXLIGGTETIAALLNWTVAFLLHRPEVQDKVYEELCCVLDVRYPQYSDRH 381 + G +T +A L WT+ L P+VQ K+++E+ + Y Sbjct: 396 DTEGIQEEVDTFMFEGHDTTSAALGWTLWLLGKNPDVQKKLHKEIDEIELNGGSLYDKVR 217 Query: 382 KLPYLCALISEMLRLRPVAPLAVPHRAIRNSSIAGHFIPKNTIIIPNLYGAHHDPEVWDD 441 + YL ++ E LR+ P P+ + +I G F+PK I+ + H +P+ W++ Sbjct: 216 QSKYLEIILKESLRMHPPVPM-YGRTVEEDMTIDGQFVPKGAQIVLLVLILHSNPDYWEN 40 Query: 442 PYSFKPER 449 P F PER Sbjct: 39 PNDFIPER 16 921 KTGLLTSTGSKWKTRRRLLTPSFHFSILNNFIKIFEEQASILVDKLAVAADNKEVVDVQV 742 741 PIGLATLDIICETSMGVKVNAQSHPDSEYVKAITVLNEEIQMRQKFPWLWFDAIYKLL 568 567 PCGKRFYKALDVAHKLSFDVINERMQMKIQESYCETASDEKKFFLDLLLDIYRKGKI 397 396 DTEGIQEEVDTFMFEGHDTTSAALGWTLWLLGKNPDVQKKLHKEIDEIELNGGSLYDKVR 217 216 QSKYLEIILKESLRMHPPVPMYGRTVEEDMTIDGQFVPKGAQIVLLVLILHSNPDYWEN 40 39 PNDFIPER 16 >gnl|ti|647175227 1095898288652 Length = 1081 Score = 45.1 bits (105), Expect(2) = 5e-07 Identities = 21/54 (38%), Positives = 33/54 (61%) Frame = -1 Query: 461 LIPFGGGARLCLGEAVAKMEMFLFTAYLLREFKFLPASKEEPLPELRGVASVVL 514 + F G R+CLG+ +A++E+FLF + L+R+FKF + LP L G + L Sbjct: 961 IFTFSAGTRVCLGKGIAEVELFLFYSRLVRDFKF-EVKPGDSLPSLYGNCGITL 803 Score = 31.2 bits (69), Expect(2) = 5e-07 Identities = 10/26 (38%), Positives = 17/26 (65%) Frame = -3 Query: 425 IIPNLYGAHHDPEVWDDPYSFKPERF 450 I+ NL+ HH+ W++P+ F P R+ Sbjct: 1079 ILTNLWQLHHNKNCWENPHEFNPYRW 1002 ILTNLWQLHHNKNCWENPHEFNPYRWXXXXXXXXXX IFTFSAGTRVCLGKGIAEVELFLFYSRLVRDFKFEVKPGDSLPSLYGNCGITL >gnl|ti|648589386 1095733042694 Length = 1032 Score = 55.8 bits (133), Expect = 8e-07 Identities = 48/189 (25%), Positives = 84/189 (44%), Gaps = 6/189 (3%) Frame = +1 Query: 211 VASSNVITTLTFSKAYDKSSAELQKLQECLNEIVSLWGSPWISALDSFPLLRKFPNPPFS 270 VA NVI + F + Y S ++ +N IV+ G +A+D P LR Sbjct: 454 VAILNVICFIVFGERYQYSDPAFIEILTTINNIVA--GLSNTTAVDFLPGLRYLQFSEIK 627 Query: 271 RLMKEVARRDELIG----KHIEEFKKSEHKEGGTLTSSLLKCLEPQQGAANHXXXXXXXX 326 +L + L+ KH E F ++ ++ T S++K + + Sbjct: 628 KLKSSLVIYFRLLNDQLKKHKETFDENNIRD---FTDSIIKFSKDETMENKFEEELTDEH 798 Query: 327 XXXXXXXL-IGGTETIAALLNWTVAFLLHRPEVQDKVYEELCCVL-DVRYPQYSDRHKLP 384 + IGG+ET L W + +++H P+ Q++++EE+ V+ + RYP+ SDR L Sbjct: 799 LEHVIGDMFIGGSETTLTSLLWLIIYMIHYPKYQEEIFEEITRVIGENRYPKLSDRDSLH 978 Query: 385 YLCALISEM 393 + A I + Sbjct: 979 LVKASIKRV 1005 454 VAILNVICFIVFGERYQYSDPAFIEILTTINNIVAGLSNTTAVDFLPGLRYLQFSEIK 627 628 KLKSSLVIYFRLLNDQLKKHKETFDENNIRDFTDSIIKFSKDETMENKFEEELTDEH 798 799 LEHVIGDMFIGGSETTLTSLLWLIIYMIHYPKYQEEIFEEITRVIGENRYPKLSDRDSLH 978 979 LVKASIKRV 1005 >gnl|ti|649393684 1095898809307 Length = 1093 Score = 54.3 bits (129), Expect = 2e-06 Identities = 25/65 (38%), Positives = 43/65 (66%) Frame = +2 Query: 462 IPFGGGARLCLGEAVAKMEMFLFTAYLLREFKFLPASKEEPLPELRGVASVVLKVKPYTV 521 +PF G R CLGEA+AK+E+F+F + L+++++F ++EE LP L+G + + + V Sbjct: 50 LPFSSGPRSCLGEALAKIELFIFISRLVKDYRFEKPTEEE-LPNLKGESGITRIPSEFKV 226 Query: 522 IAHPR 526 + PR Sbjct: 227 MTIPR 241 >gnl|ti|649393684 1095898809307 45% to 17A1 C-term LPFSSGPRSCLGEALAKIELFIFISRLVKDYRFEKPTEEELPNLKGESGITRIPSEFKVMTIPR >gnl|ti|646849327 1095897329284 Length = 980 Score = 51.2 bits (121), Expect = 2e-05 Identities = 30/68 (44%), Positives = 39/68 (57%) Frame = +2 Query: 69 GPRSLPLLGNMLDLAQDHLPIHLTALAKCYGNIYRLNCGSTSAMVVLNNSEIIREALVKK 128 GP LP +GN L + L L K YG+++ + GS VV+NN E I+E L+KK Sbjct: 302 GPIPLPFIGNAHLLRKGEPYKELVNLGKIYGDVFGFSIGSIR-YVVVNNLEGIKEVLIKK 478 Query: 129 WSDFAGRP 136 S FAGRP Sbjct: 479 GSQFAGRP 502 >gnl|ti|646849327 1095897329284 40% to 2X2 C-term GPIPLPFIGNAHLLRKGEPYKELVNLGKIYGDVFGFSIGSIRYVVVNNLEGIKEVLIKKGSQFAGRP >gnl|ti|646968536 1095898162561 Length = 1074 Score = 48.9 bits (115), Expect = 9e-05 Identities = 32/91 (35%), Positives = 44/91 (48%), Gaps = 2/91 (2%) Frame = +1 Query: 48 FPKLLHSLYKLFFSTVSPTI--SGPRSLPLLGNMLDLAQDHLPIHLTALAKCYGNIYRLN 105 FP L+ +Y + GP LP +GN L + L K YG+I+ + Sbjct: 598 FPPLIWFVYSYIKHLIECLYYPKGPVPLPFIGNTNLLRKKETCKEFVNLGKIYGDIFGFS 777 Query: 106 CGSTSAMVVLNNSEIIREALVKKWSDFAGRP 136 GS V++NN E I E L+KK S F+GRP Sbjct: 778 IGSIR-YVIVNNLEGIHEVLIKKGSQFSGRP 867 >gnl|ti|646968536 1095898162561 83% to 1095897329284 FPPLIWFVYSYIKHLIECLYYPKGPVPLPFIGNTNLLRKKETCKEFVNLGKIYGDIFGFSIGSIRYVIVNNLEGIHEVLIKKGSQFSGRP >gnl|ti|647168675 1095899196297 Length = 998 Score = 48.1 bits (113), Expect = 2e-04 Identities = 18/48 (37%), Positives = 32/48 (66%) Frame = -2 Same as 1095898098005 Query: 334 LIGGTETIAALLNWTVAFLLHRPEVQDKVYEELCCVLDVRYPQYSDRH 381 L+ GTET A + W V +L+H PE Q+++Y+E+ + RYP ++++ Sbjct: 175 LVAGTETTAITICWMVLYLIHNPEYQEEIYKEITSNIGCRYPTLAEKN 32 >gnl|ti|649448444 1095899351259 Length = 1086 Score = 32.7 bits (73), Expect(2) = 0.079 Identities = 15/32 (46%), Positives = 19/32 (59%) Frame = -3 Query: 414 IAGHFIPKNTIIIPNLYGAHHDPEVWDDPYSF 445 I G FIPK + I + H +PE W DP+SF Sbjct: 1039 IDGQFIPKKSEIAILVMMIHLNPEYWKDPHSF 944 Score = 25.4 bits (54), Expect(2) = 0.079 Identities = 10/31 (32%), Positives = 17/31 (54%) Frame = -2 Query: 462 IPFGGGARLCLGEAVAKMEMFLFTAYLLREF 492 IPF G R C+G+ A +E + +++ F Sbjct: 890 IPFSAGPRNCIGQKFAMIEEKMLLYNIMKHF 798 >gnl|ti|649448444 1095899351259 similar to 4V5 IDGQFIPKKSEIAILVMMIHLNPEYWKDPHSFIP IPFSAGPRNCIGQKFAMIEEKMLLYNIMKHF AGTAGGTATATTGAAGAAGATATGATGATTGATGGTCAGT TTATTCCTAAAAAATCCGAAATCGCTATTCTTGTGATGATGATACATTTAAATCCTGAGT ATTGGAAAGATCCTCACAGCTTTATAcCTGAAAGATTTGATCAAGATGATTTTGTAAAGCG TAATCCATACACTTACATTCCATTCTCCGCTGGCCCTAGAAATTGCATTGGTCAAAAGTT TGCAATGATAGAGGAAAAAATGCTGTTATATAACATAATGAAACATTTTTATGTAGAATC CATGCAGAATGAAAATGAAATTTTAAGAACTCAAGATCTTATAAGTAAATCAGCTAATGG TATCATGATGAAGTTCTATGAAAGATGA >gnl|ti|648014530 1095896049543 Length = 1075 Score = 34.7 bits (78), Expect = 1.9 Identities = 14/34 (41%), Positives = 22/34 (64%) Frame = -3 Query: 420 PKNTIIIPNLYGAHHDPEVWDDPYSFKPERFLEG 453 P NTI+ ++ H + ++ DP+SFK ERF+ G Sbjct: 971 PANTILRTHVSSIHMNETIYPDPHSFKHERFMTG 870 >gnl|ti|648014530 1095896049543 41% to CYP21 PANTILRTHVSSIHMNETIYPDPHSFKHERFMTG >gnl|ti|655009845 1095963045220 Length = 1143 Score = 33.9 bits (76), Expect = 3.2 Identities = 35/127 (27%), Positives = 50/127 (39%), Gaps = 10/127 (7%) Frame = -3 Query: 108 STSAMVVLNNSEIIREALVKKWSDFAGR--------PYSYTGXDIVSGGGRTISLGDFSE 159 STS V +N S+ ++ L K D + PYSY + + G I L Sbjct: 637 STSNCVTMNQSDYLQNVLQKFGFDNCKQRSSPCKQVPYSYHNQESIKSNGSMIKL----- 473 Query: 160 EWKAHRRVTHSAL--QRCTTDSLHSVIEKQAQHLCQVLRDYSGKAVDLSEDFTVASSNVI 217 +R++ S L CT L VI K +QHL + SG + + F + Sbjct: 472 ----YRQMVGSLLYAMTCTRPDLSYVITKLSQHLS---KPNSGDWIMIKHVFRYIKHTLN 314 Query: 218 TTLTFSK 224 LTF K Sbjct: 313 YCLTFRK 293 >gnl|ti|655009845 1095963045220 near C-helix region poor match 637 STSNCVTMNQSDYLQNVLQKFGFDNCKQRSSPCKQVPYSYHNQESIKSNGSMIKL 473 472 YRQMVGSLLYAMTCTRPDLSYVITKLSQHLSKPNSGDWIMIKHVFRYIKHTLN 314 313 YCLTFRK 293 >gnl|ti|648592188 1095595897239 Length = 1036 Score = 33.9 bits (76), Expect = 3.2 Identities = 35/127 (27%), Positives = 50/127 (39%), Gaps = 10/127 (7%) Frame = -3 Query: 108 STSAMVVLNNSEIIREALVKKWSDFAGR--------PYSYTGXDIVSGGGRTISLGDFSE 159 STS V +N S+ ++ L K D + PYSY + + G I L Sbjct: 638 STSNCVTMNQSDYLQNVLQKFGFDNCKQRSSPCKQVPYSYHNQESIKSNGSMIKL----- 474 Query: 160 EWKAHRRVTHSAL--QRCTTDSLHSVIEKQAQHLCQVLRDYSGKAVDLSEDFTVASSNVI 217 +R++ S L CT L VI K +QHL + SG + + F + Sbjct: 473 ----YRQMVGSLLYAMTCTRPDLSYVITKLSQHLS---KPNSGDWIMIKHVFRYIKHTLN 315 Query: 218 TTLTFSK 224 LTF K Sbjct: 314 YCLTFRK 294 638 STSNCVTMNQSDYLQNVLQKFGFDNCKQRSSPCKQVPYSYHNQESIKSNGSMIKL 474 473 YRQMVGSLLYAMTCTRPDLSYVITKLSQHLSKPNSGDWIMIKHVFRYIKHTLN 315 314 YCLTFRK 294 >gnl|ti|653058100 1095949490108 Length = 1087 Score = 33.9 bits (76), Expect = 3.2 Identities = 16/37 (43%), Positives = 22/37 (59%) Frame = -1 Pretty doubtful match to mid CYP21 region Query: 229 SSAELQKLQECLNEIVSLWGSPWISALDSFPLLRKFP 265 S LQ L + ++ S W SPW+ +LD LL+KFP Sbjct: 124 SGRSLQWLLQWYSQQSSQWYSPWLHSLDKCRLLKKFP 14     MNUbc. / t u $ % a b ' ( d e      W X   K L ?@ *h5l7CJOJQJaJ" *h5l7B*CJOJQJaJph" *h5l7B*CJOJQJaJphh5l7B*CJOJQJaJphh5l7CJOJQJaJFNc/ u % b ( e   X  L ed`lM @}4q"0SOd3tXed`lM@|}34pq!"/0RSNOcd23stWXKL./g" *h5l7B*CJOJQJaJphh5l7B*CJOJQJaJph *h5l7CJOJQJaJh5l7CJOJQJaJNL/h+h9fz1nrsed`lMed`lMgh*+gh89efiyz01mnqrsABab8:;<=>blh5l7B*CJOJQJaJph *h5l7CJaJ *h5l7CJaJh5l7CJaJ *h5l7CJOJQJaJ *h5l7CJOJQJaJh5l7CJOJQJaJEBb;<=>m$Co&cWed`lMed`lMlm#$BCbno%&bcVWbBhi@A A E F ! !!!!"!A!^! *h5l7CJOJQJaJh5l7" *h5l7B*CJOJQJaJphh5l7CJOJQJaJh5l7B*CJOJQJaJphLiA F !"!_!!!"S""" #G###ed`lMed`lM^!_!!!!!""A"R"S""""" # #A#F#G#######:$;$A$w$x$$$$$$$%%/%0%~%%%%%%%& &0&d&e&&&&&' ' '0'S'T'b'c''''''''0(8(9(r(s((((()h5l7B*CJOJQJaJph *h5l7CJOJQJaJh5l7CJOJQJaJQ##;$x$$$$%0%%%% &e&&& ' 'T'c''''9(s((())L)h)ed`lM))()))0)K)L)N)g)h)))))**[*\*****+++O+P+++++,,,C,D,,,,,,,----N-O-k-------.2.3.N.O.R..... / //I/J/////000=0>0_0h5l7B*CJOJQJaJph *h5l7CJOJQJaJh5l7CJOJQJaJ *h5l7CJOJQJaJOh)))*\***+P+++,D,,,,--O----3.O... /J///ed`lM/0>0b0001E1g1111:2w22233R3S3T33334T444#5\55ed`lM_0a0b00000011D1E1f1g1111111192:2v2w2222222333Q3R3S3T33333333344S4T444444"5#5[5\5z555555555556.6/6o6p6~66666" *h5l7B*CJOJQJaJphh5l7B*CJOJQJaJphh5l7CJOJQJaJ *h5l7CJOJQJaJM55555/6p6667C77778X8889L999:@:}:::4;q;;ed`lM677B7C77777777888W8X88888899K9L999999::?:@:|:}::::::3;4;p;q;;;;;;;;<<T<U<<<<<<<<<<<<< ==J=K=U=" *h5l7B*CJOJQJaJph" *h5l7B*CJOJQJaJph" *h5l7B*CJOJQJaJphh5l7B*CJOJQJaJphI;;;<U<<<=K===>>>c>>>>?=?z???7@|@@@A@A}Aed`lMU=====>>>>>U>b>c>>>>>>>>??/?5?GGed`lMDD-E.E4EjEkEEEEE F F+F,F4FhFiFtFuFFFFFFFFF4G=G>GGGGG H H4HIHJHjHkHlHHHHHIIMINIqIrIIJKFLGLHL}L~LLLLLLLMM?M@M|M}MMMMMMh5l7B*CJaJphh5l7CJaJh5l7B*CJOJQJaJph *h5l7CJOJQJaJh5l7CJOJQJaJMGG HJHkHHHINIrIGLHL~LLLM@M}MMM4NrNNNNNOOmOed`lMed`lMM3N4N]NqNrNNNNNNNNOOO9OGOTOlOmO~OOOOOOOOPPQPRPPPPPQ Q2Q4Q5Q6QQQQQQQQRR"R#RoRpRrRRRRЗ" *h5l7B*CJOJQJaJph *h5l7CJOJQJaJ *h5l7CJOJQJaJ *h5l7CJOJQJaJh5l7B*CJOJQJaJphh5l7CJOJQJaJh5l7B*CJaJphh5l7CJaJ;mOOOPRPPP Q5Q6QQQQR#RpRRRSDSESSSSS+TgTvTTed`lMed`lMRRRSSASCSDSES~SSSSSSSS*T+TfTgTuTvTTTTTTTT1U2UhUiUUUUUUUUUUVVoqosotouoooooopppDpEppҳ *h5l7CJOJQJaJ" *h5l7B*CJOJQJaJphh5l7CJOJQJaJh5l7B*CJOJQJaJph" *h5l7B*CJOJQJaJphF\k]kkkk lIlllll$mdmzm{mmmm;n~nnno>otouooopEped`lMpppppppqq q qTq]q^qbqqqqqqqq%r&rbrcrrrrrrrrr sssRsSskslsnssssstttRtStatctdtetttttuu uu(u)udużųŦ *h5l7CJOJQJaJ *h5l7CJaJ *h5l7CJaJh5l7CJaJ *h5l7CJOJQJaJ *h5l7CJOJQJaJ *h5l7CJOJQJaJh5l7CJOJQJaJAEpppp q q^qqqq&rcrrrrsSslssstStdtetttu)ueued`lMed`lMdueuuuuuuuuuvvvYvZvvvvvvvv-w.w=w>wwwwwwww xxNxOxYxZxxxxxxxx&y'yfygyuyvyyyyyzzHzIzzzzzzzz{{"{#{`{a{{{{{{{{||I|J|||||||}} *h5l7CJOJQJaJh5l7CJOJQJaJ[euuuuuvZvvvv.w>wwwwxOxZxxxx'ygyvyyyzIzzzed`lMzz{#{a{{{{|J||||}>}Z}}}~$~%~u~~~~1ned`lMed`lM}=}>}J}Y}Z}}}}}~~#~$~%~J~t~u~~~~~~~01Jmn56JԀՀ؀)*Jfg:;NORłƂh5l7B*CJOJQJaJph" *h5l7B*CJOJQJaJph *h5l7CJOJQJaJ *h5l7CJOJQJaJh5l7CJaJh5l7CJOJQJaJE6Հ*g;OƂ@Arb{ed`lMed`lM?@Aqrabz{Z[Ʌ˅̅ WXQRއ߇CD *h5l7CJOJQJaJ" *h5l7B*CJOJQJaJph" *h5l7B*CJOJQJaJph" *h5l7B*CJOJQJaJphh5l7CJOJQJaJh5l7B*CJOJQJaJph?[̅XR߇Dfʉ8uڊ <~ed`lMefɉʉ78tuيڊ ;<}~\]̌͌ FGҍӍ ׳׳ס׳׳" *h5l7B*CJOJQJaJph" *h5l7B*CJOJQJaJph" *h5l7B*CJOJQJaJphh5l7B*CJOJQJaJph *h5l7CJOJQJaJh5l7CJOJQJaJ?~]͌ GӍ7tY9lm=ed`lM 67stXY89klm$*<=FI‘ﭭh5l7CJaJ *h5l7CJOJQJaJh5l7CJOJQJaJ" *h5l7B*CJOJQJaJph" *h5l7B*CJOJQJaJph" *h5l7B*CJOJQJaJphh5l7B*CJOJQJaJph6=‘E9uɓCRy0Vn%3yіed`lMed`lMDE89tuȓɓBCQRxyɔ/0UVmnɕ$%23xyɖЖі !/:JKopɗҗӗLMƘǘɘ@AI *h5l7CJOJQJaJh5l7CJOJQJaJ *h5l7CJOJQJaJUіKpӗMǘAMN™8u,d(ed`lMILMN™78tu+,cd~'(_`؜ٜWX֝ם NOz{12no{Ɵǟ89uv{ *h5l7CJOJQJaJh5l7B*CJOJQJaJphh5l7CJOJQJaJ *h5l7CJOJQJaJO(`ٜXם O{2oǟ9v-j!^آed`lM,-ij{ !]^{עآ=>z{DE{~ΤϤST!"FGHIv̦ͦ$%HI9:vwh5l7B*CJOJQJaJph *h5l7CJOJQJaJh5l7CJOJQJaJUآ>{EϤT"GHIͦ%I:ed`lM:w9M|3p'dޫXҬ!^حRed`lM89LMP{|23op&'cdݫޫWXѬҬ !]^׭حQRˮ̮ QTUV¯ï'(ghh5l7B*CJOJQJaJph *h5l7CJOJQJaJ *h5l7CJOJQJaJh5l7CJOJQJaJP̮ UVï(h.n Gq9gh'med`lMh߰-.Dmn FGpq89fgh&'lm?@˵̵./0NO|}¶&34AB" *h5l7B*CJOJQJaJphh5l7CJOJQJaJh5l7B*CJOJQJaJphOm@̵/0O}¶4B6s?|ع=ed`lM56Brs>?B{|׹ع<=?BNOȺɺBZ[»û<=yz޺޺ޮޮ޺ޮh5l7CJOJQJaJ" *h5l7B*CJOJQJaJph" *h5l7B*CJOJQJaJphh5l7B*CJOJQJaJph" *h5l7B*CJOJQJaJphC=Oɺ[û=zͼ9v)jy¾ed`lMed`lM̼ͼ 89uv()ijxy{¾;<Y\]^z{ȿɿ˿&'cd%&4оܾܬܾܬ" *h5l7B*CJOJQJaJph" *h5l7B*CJOJQJaJphh5l7CJOJQJaJh5l7B*CJOJQJaJphh5l7B*CJaJphh5l7CJaJD¾<]^{ɿ'd&5PN'led`lM45OPMN&')kl 34^bpqVWq}ѷѪј" *h5l7B*CJOJQJaJph *h5l7CJOJQJaJ *h5l7CJOJQJaJ *h5l7CJOJQJaJh5l7CJOJQJaJ" *h5l7B*CJOJQJaJphh5l7B*CJOJQJaJph64qW ]@^)*h^hed`lM & Fed`lMed`lM} \] ?@]^`  ()*+CDxy{ N[\~Ҹҫҙ" *h5l7B*CJOJQJaJph *h5l7CJOJQJaJ *h5l7CJOJQJaJ *h5l7CJOJQJaJh5l7CJOJQJaJh5l7B*CJOJQJaJph" *h5l7B*CJOJQJaJph<*Dy\@}QR-gFed`lM~ ?@|}PQR(,-NTfg~EFcd *h5l7CJOJQJaJ" *h5l7B*CJOJQJaJph" *h5l7B*CJOJQJaJph" *h5l7B*CJOJQJaJphh5l7B*CJOJQJaJphh5l7CJOJQJaJ?d7hTI=GH}ed`lM67ghjSTHI<=DFGHo|}568|}*,-WX￿ﷷh5l7CJaJh5l7CJOJQJaJ" *h5l7B*CJOJQJaJph" *h5l7B*CJOJQJaJphh5l7B*CJOJQJaJphH6}-X1n3xV Jed`lMed`lM01mn237jswx|7UV 7IJ7=>ehijֲֲֲֲ֠" *h5l7B*CJOJQJaJph" *h5l7B*CJOJQJaJph" *h5l7B*CJOJQJaJphh5l7B*CJOJQJaJph" *h5l7B*CJOJQJaJphh5l7CJaJ?>ij:w%P|3p\ed`lMj79:vw$%*KOPR{|23op~ո" *h5l7B*CJOJQJaJph" *h5l7B*CJOJQJaJphh5l7B*CJOJQJaJph *h5l7CJOJQJaJ" *h5l7B*CJOJQJaJph *h5l7CJOJQJaJh5l7CJOJQJaJ:[\^TU@ADcd$&'_`MN12ɷɮɷ *h5l7CJaJ *h5l7CJaJ *h5l7CJaJh5l7CJaJh5l7CJOJQJaJ" *h5l7B*CJOJQJaJphh5l7B*CJOJQJaJphD\UAd'`N2ed`lMed`lM9:vw;<VYpx|},./0stҸ҈vҚ" *h5l7B*CJOJQJaJphh5l7 *h5l7B*CJaJph *h5l7CJaJh5l7B*CJaJph *h5l7CJaJh5l7CJaJ" *h5l7B*CJOJQJaJphh5l7B*CJOJQJaJph" *h5l7B*CJOJQJaJphh5l7CJOJQJaJ.:w<}/0tY"A}ed`lMed`lMed`lMDVXY!"@A|}#%&')ST -μΪΘ΋h5l7CJOJQJaJ *h5l7CJOJQJaJ" *h5l7B*CJOJQJaJph" *h5l7B*CJOJQJaJph" *h5l7B*CJOJQJaJphh5l7B*CJOJQJaJph *h5l7B*CJaJphh5l7B*CJaJphh5l7CJaJ,&'T 12Vw Nq?gabcded`lM-012UVvw MNpqu>?fguu`abcdu?EFGuIJIJ" *h5l7B*CJOJQJaJphh5l7B*CJOJQJaJph" *h5l7B*CJOJQJaJphh5l7CJOJQJaJ *h5l7CJOJQJaJCdG*g1n#g1N Jed`lM)*fgu01mn"#fg01MNP IJ=>z{12noҸ *h5l7CJOJQJaJ *h5l7CJOJQJaJh5l7CJOJQJaJh5l7B*CJOJQJaJph" *h5l7B*CJOJQJaJphE>{2o$:t<r.k0ed`lMed`lM#$+9:st;<qr-.jk~/0lm#$`a暚 *h5l7CJOJQJaJ *h5l7CJOJQJaJ *h5l7CJOJQJaJh5l7CJaJ *h5l7CJOJQJaJh5l7B*CJOJQJaJphh5l7CJOJQJaJ *h5l7CJOJQJaJ<0m$a1n&cWK\]ed`lM01mn%&bcVW JKX[\] ^_ !abc23pq" *h5l7B*CJOJQJaJphh5l7B*CJOJQJaJph *h5l7CJOJQJaJh5l7CJOJQJaJN _!bc3qK6red`lMJK56qr #./kl"#_`134=>ʸʸʸʸ *h5l7CJOJQJaJ" *h5l7B*CJOJQJaJphh5l7B*CJOJQJaJph *h5l7CJOJQJaJ *h5l7CJOJQJaJh5l7CJOJQJaJC /l#`4>A"#<\ed`lM>@A!"#;<A[\A C D l m     A B           - A I J l m           F G z { |      " *h5l7B*CJOJQJaJphh5l7CJOJQJaJh5l7B*CJOJQJaJphRD m   B      J m      G { |   @ }    Ued`lM ? @ | }        TU  HI<=yz01mnQR  GH" *h5l7B*CJOJQJaJph" *h5l7B*CJOJQJaJph" *h5l7B*CJOJQJaJphh5l7B*CJOJQJaJphIU I=z1nR H9ed`lM89:CD89 TU !45dewxy  PQ()opDEwxy%&'JKLyzh5l7B*CJOJQJaJph^9:D9U!5exy Q)ped`lMpExy&'KLz.vK hed`lM-.uvJKv ghvWXv  N O v        !!:!;!v!!!!!""V"W""""""###?#@#|#}######$$$/$0$E$F$$$h5l7B*CJOJQJaJph^X O     !;!!!"W""""#@#}####$0$F$ed`lMF$$$&%'%2%e%|%%%:&Y&Z&&&& '7'8'j'''(((('(r(((ed`lM$$$%%%&%'%1%2%d%e%{%|%~%%%%%&9&:&U&X&Y&Z&&&&&&&'' '6'7'8':'i'j'''''((((((&('(q(r((((((()))o)p))))))D*E*********+ +@+" *h5l7B*CJOJQJaJph" *h5l7B*CJOJQJaJphh5l7B*CJOJQJaJphP())p)))E***** +A+`++++3,A,X,Y,,,,, -M----.ed`lM@+A+_+`++++++2,3,@,A,W,X,Y,,,,,,,,,- -L-M-------..,.-...V.W...........///E/F////////000n0o0s000000,1-1W1X1s11111111122O2" *h5l7B*CJOJQJaJphh5l7B*CJOJQJaJphW.-...W......//F////00o0000-1X111112P2|22ed`lMO2P2s2{2|222222223343[3\3s3333333?4@4s4|4}444444445L5M5O5n5o5s555556 6!6"6#6$6F6G6H6Q6R6^66666 7 7S7T7^77777(8)8 *h5l7CJOJQJaJ *h5l7CJOJQJaJ" *h5l7B*CJOJQJaJphh5l7CJOJQJaJh5l7B*CJOJQJaJphH2243\3333@4}44445M5o555!6"6#6$6G6H6R666 7T777ed`lM7)8p88889\99999:Y:::;W;;;,<s<<=H=e=f====ed`lM)8^8o8p88888899[9\9^99999999 ::X:Y:^:::::;;V;W;;;;;<+<,<r<s<<<===G=H=d=e=f=~======>9>:>Y>Z>[>>>>>>>?7?8?t?u?????????@@@[@\@@@@@0A1Ah5l7B*CJOJQJaJph^=:>Z>[>>>>8?u??????@\@@@1AxAAAAA6BZB[BBBBed`lM1AwAxAAAAAAAA5B6BYBZB[B~BBBBBBB4C5C{C|CCCCCCCCDDhDiDDDDDDEELEMENEvEwEEEEEEE=F>FWFXFFFFF,G-GXGvGwGGGHHKHLHXHHHHHHHHHHHIIKILIXIyIzI{IIIh5l7B*CJOJQJaJph^B5C|CCCCCDiDDDEMENEwEEEE>FXFFF-GwGGHLHHHHed`lMHHHHILIzI{III1J2J3J4JNJOJYJJJ KTKKK)LpLLLMM+Med`lMIII0J1J2J3J4JMJNJOJXJYJJJJJ K KSKTKKKKK L(L)LoLpLLLLLMMM M*M+MjMkMMMMMN N N NVNWNmNnNNNNNNNN O$O%O0O1O2O\O]OOOOOOO'P(PdPePfPPPPPP Q QLQMQiQjQQQQQh5l7B*CJOJQJaJph^+MkMMM N NWNnNNNNN%O1O2O]OOOO(PePfPPP QMQjQQQRed`lMQQRRRRPRQRuRvRRRRRRRRRRRR=S>SSSSSSSSS5T6TTTTTTTUU6UCUDUEUHUgUhUUUUUVV(V*V+V6VpVqVsVVVVV4W5W6WqWrWWWWW" *h5l7B*CJOJQJaJph" *h5l7B*CJOJQJaJph" *h5l7B*CJOJQJaJphh5l7B*CJOJQJaJphIRRRQRvRRRRRRR>SSSSS6TTTTUDUEUhUUUV+VqVVed`lMVV5WrWWWWX&X'XDXEXXX%Y&YfYYYY Z!Z^ZZZZZZZZed`lMWWWXX%X&X'X6XCXDXEXXXXX$Y%Y&Y6YeYfYYYYYYYZ Z!Z]Z^ZZZZZZZZZZZZ[[K[L[W[X[Y[[[[[[8\9\:\\\\\\]]]f]g]]]]]]]G^H^^^^^^^__<_=_m_n_o_~_h5l7CJOJQJaJh5l7B*CJOJQJaJphZZ[L[X[Y[[[9\:\\\]]g]]]]H^^^^_=_n_o___`I``ed`lM~_____``H`I`````````````aacadaoapaqaaaabbPbQbRbbbbbb1c2c3c>c~ccccddd>dQdRddddddd.e/e>eqereeeeee f f#f$f%f>fMfNffffffff6g7g>gggggghh>hh5l7CJOJQJaJ````````adapaqaabQbRbbb2c3cccddRdddd/ereeed`lMeee f$f%fNfffff7gggghchdhhhhh9iziiiiiiied`lM>hbhchdhhhhhhhhhh8i9iyiziiiiiiiiiiiii0j1j@uv@UVŃƃ56mn݄ބ6MNޅ߅ !jkvwxĆņ Z[\h5l7B*CJOJQJaJph3fh5l7CJOJQJaJZ΁>vVƃ6nބN߅!kwxņ[ed`lM[\?@ֈ#$qU2VNj S݌ed`lM>?@Ոֈ"#$pqTU12UVƋNj  RS܌݌234\]GH()tuv  UVW678h5l7CJOJQJaJ`34]H)uv VW78̑2G`ed`lM8ˑ̑12FG_`67{|89PQRz{ÔĔϔДєܔdeܕEFܖޖߖ&'rstܗSTU̘͘ܘ   LM֙h5l7B*CJOJQJaJphh5l7CJOJQJaJZ`7|9QR{ĔДєeFߖ'sted`lMTU͘  MיRSs  W8̜͜ed`lM֙יܙQRSrs   VW78˜̜͜`aABўҞSTUtu|)*nos'()QRsh5l7B*CJOJQJaJphh5l7CJOJQJaJZaBҞTUu*o()R<ed`lM;<sԢբhijsHIJfgsԤդ֤GHSTU`456`ɦʦ`bc34STU`ިߨ!":;<=\]`tuvh5l7CJOJQJaJ`բijIJgդ֤HTU56ʦed`lMc4TUߨ";<=]uvB֪ת#ked`lMABժ֪ת"#jkKL,-xyzŭƭ YZ[tuŮƮͮ*+lm;<OPQpqSTh5l7B*CJOJQJaJphh5l7CJOJQJaJZL-yzƭZ[uƮ+m<PQqed`lMT5}ɲʲ)*J`$;<eNed`lM45|}Ȳɲʲ()*IJR_`#$:;<deMN./z{|Ƕȶ[\]mnz{ȷɷ RSW¸øڸ۸h5l7B*CJOJQJaJphh5l7CJOJQJaJZN/{|ȶ\]n{ɷSø۸ܸNZ[ed`lM۸ܸMNWYZ[:;<WϺкWhi/01234[\4DEнѽ  4IJKԾվ4TU¿ÿ34?@A !"h5l7CJOJQJaJ`;<кi0123\Eѽ JKed`lMվU¿ÿ4@A!"n(In+IJed`lM"mn"'(HImn"*+HIJij"MN"./vwCD/0;<=ijh5l7B*CJOJQJaJphh5l7CJOJQJaJZJjN/wD0<=ed`lMj&Ipq+KLlO0xed`lM%&HIopq*+JKLklNO/0wx567yz?@()tuv  UVW}~h5l7CJOJQJaJ`67z@)uv VW~ed`lMAB PQ\]^=>? FGij89:YZqrs<=ef"#$(fgh5l7CJOJQJaJ`B Q]^>? Gj9:Zred`lMrs=f#$g,ued`lM(+,tu(abc(BCD#$%(pq(QR()*no89{|%&123xyh5l7CJOJQJaJ`bcCD$%qR)*o9|ed`lM&23y/bnoIJjed`lM./abmnoHIJijLM-.uvVW/0qr,-_`klmLMh5l7CJOJQJaJ`M.vW0r-ed`lM-`lmMNct67`4ed`lMMNbcst567_`34;<=\]tuv<=+,Z[\  #$%MNh5l7B*CJOJQJaJphh5l7CJOJQJaJZ<=]uv=,[\ $%Ned`lM]^_`45ab/0Z[f%&bcfWXfqtuv *h5l7CJOJQJaJh5l7B*CJOJQJaJphh5l7CJOJQJaJU^_`5b0[&ed`lM&cXuvGu34Zed`lMFGftu2345YZ5CD$%5pqr235pq )*567 cd "#h5l7CJOJQJaJ`D%qr3q*67ded`lMd#:;{)\h,-/02356gd9ed`lM#9:;z{ ()[\gh+,-./0123456789:;<=>?@ABCDEFGh9h5l7jh5l7Uh5l7CJOJQJaJ9689:;<=>?@ABCDEFGed`lMgd901h/R / =!"#$% x666666666vvvvvvvvv666666>6666666666666666666666666666666666666666666666666hH66666666666666666666666666666666666666666666666666666666666666666p62&6FVfv2(&6FVfv&6FVfv&6FVfv&6FVfv&6FVfv&6FVfv8XV~ 0@ 0@ 0@ 0@ 0@ 0@ 0@ 0@ 0@ 0@ 0@ 0@ 0@ 0@66666_HmH nH sH tH H`H Normal CJOJQJ_HaJmH sH tH DA D Default Paragraph FontRiR 0 Table Normal4 l4a (k ( 0No List D/D msonormaldd[$\$OJQJHB`H 0 Body TextB*CJOJQJaJphH/H 0Body Text CharCJOJPJQJaJ8Z`"8 0 Plain TextOJQJN/1N 0Plain Text CharCJOJPJQJ^JaJ4@B4 90Header  H$B/QB 90 Header CharCJOJPJQJaJ4 @b4 90Footer  H$B/qB 90 Footer CharCJOJPJQJaJPK![Content_Types].xmlN0EH-J@%ǎǢ|ș$زULTB l,3;rØJB+$G]7O٭Vj\{cp/IDg6wZ0s=Dĵw %;r,qlEآyDQ"Q,=c8B,!gxMD&铁M./SAe^QשF½|SˌDإbj|E7C<bʼNpr8fnߧFrI.{1fVԅ$21(t}kJV1/ ÚQL×07#]fVIhcMZ6/Hߏ bW`Gv Ts'BCt!LQ#JxݴyJ] C:= ċ(tRQ;^e1/-/A_Y)^6(p[_&N}njzb\->;nVb*.7p]M|MMM# ud9c47=iV7̪~㦓ødfÕ 5j z'^9J{rJЃ3Ax| FU9…i3Q/B)LʾRPx)04N O'> agYeHj*kblC=hPW!alfpX OAXl:XVZbr Zy4Sw3?WӊhPxzSq]y GZ @gl^!)_06U=DMR[X_fXkpdu} Ih4}~j-> $@+O2)81AIQW~_>holxR8֙۸"M#G  !#&(*,.02579;=?ACEGHJLNPRTVXZ\_bdfiknpsux{} #h)/5;}AGmOTDZE`f\kEpeuz~=і(آ:m=¾*\d0U9pF$(.27=BH+MRVZ`einzsx2}΁[`NJr-&d6G    "$%')+-/13468:<>@BDFIKMOQSUWY[]^`aceghjlmoqrtvwyz|~T # @H 0(  0(  b S  ?C""Vn2 + p^`phH() ^`hH. pp^p`hH. @ @ ^@ `hH. ^`hH. ^`hH. ^`hH. ^`hH. PP^P`hH."Vn"Vn ( ( S) IV8- ) ,Ģ        `lM5l79-/@G@Unknown G.[x Times New Roman5^Symbol3. *Cx Arial7Courier?Z PTimesTimes9=  @ ConsolasC.,*{$ Calibri Light7.*{$ CalibriA$BCambria Math"1hS'S'J`J`%0--B@P $P'92!xx`Sxi NormalHydra Terry Mark MajorMooney, Charles P Oh+'0   @ L Xdlt|'Hydra Terry Mark MajorNormalMooney, Charles P2Microsoft Office Word@@hg4E@hg4EJ` ՜.+,0 hp  '(Center of Genomics and Bioinformatics- Hydra Title  !"#$%&'()*+,-./0123456789:;<=>?@ABCDEFGHIJKLMNOPQRSTUVWXYZ[\]^_`abcdefghijklmnopqrstuvwxyz{|}~      !"#$%&'()*+,-./0123456789:;<=>?@ABCDEFGHIJKLMNOPQRSTUVWXYZ[\]^_`abcdefghijklmnopqrstuvwxyz{|}~Root Entry F9vJE1Table'WordDocument2ZSummaryInformation(DocumentSummaryInformation8CompObjr  F Microsoft Word 97-2003 Document MSWordDocWord.Document.89q