>gnl|ti|647066038 1095898227332  34% to 17A1 35% to 2U1 fugu 33% to 2U1 human

74% to 1095901734433

1097567032902 1096761105127 1096123323522 1096088745900

Combined seq from CN769290 and CN769570 39% to CYP17A

EST = CN769570.1

mate pair of 1096088745900 had partial match to N-TERM.  WALKED UPSTREAM TO

1097672588127, N-term still missing, end of this exon seq not certain

cannot walk upstream any further

 

(1) AFNRNTNSLINSDPGPRFKILRKLASSSLKIYAEGLLGMERIAISEYCELSKKLQSIKEKPVSVHKIM (1)

AGCATTCAACAGAAATACGAACAGCCTCATTAACAGTGATCCAGGCCCGCGTTTTAAAATTTTA

CGAAAGTTAGCATCATCTTCTTTGAAAATTTACGCTGAGGGTTTATTGGGAATGGAAAGA

ATAGCAATCAGTGAATATTGTGAACTGAGTAAAAAGTTACAATCAATAAAAGAAAAACCA

GTATCGGTTCATAAAATAATGGGT

 

(0) QSTLNIICTILFNHRYEDDNQEFQNIIKYSSLIVQTFNETSYVSS

IPLLRYFPTATSRNIFEIIRLRDPILKRKLQEHRKSYDKNNLRDITDALIKVSLDSEMGE

ELTEKITDDNIEFLLNDFMIAGSETSSSTILWFIVYMLHWPEYQNKLYDEITKVASDNRY

VSLKDRPMLHLMQAAIHETLRLSSVVPLGLVHKAMENSSICGKFVPKGALILTNLWSMHH

DESYWKNAMSFYPERWLEKSGEFNYKLGYAYLPFSNGPRSCLGETLAKTELFVFITRLLK

DYRFEMPTGKELPCLDGRSGITSPPNDFEVVIIPRN*

AGCAAAGCACACTTAACATTA

TTTGTACCATTCTTTTTAATCATCGCTACGAGGATGACAACCAGGAGTTTCAGAATATCA

TAAAATACTCAAGTTTAATCGTTCAAACTTTTAATGAAACCAGTTACGTATCTTCCATTC

CATTGCTGCGCTATTTCCCAACGGCAACGTCGCGAAATATTTTTGAAATCATAAGGCTTC

GTGATCCGATTTTAAAACGAAAACTCCAAGAGCACAGAAAATCTTACGATAAGAATAATT

TACGTGACATAACCGATGCATTAATAAAAGTGTCTTTAGATTCAGAGATGGGTGAAGAAT

TAACTGAAAAGATTACTGATGATAATATTGAGTTTCTTTTAAACGATTTTATGATTGCTG

GATCCGAAACTTCATCAAGTACTATTCTTTGGTTTATTGTTTACATGTTACATTGGCCAG

AATACCAAAATAAACTTTATGATGAAATTACTAAAGTAGCATCAGATAACCGTTATGTAT

CTTTAAAGGATCGACCTATGCTTCATTTAATGCAAGCTGCAATTCATGAAACACTTAGAC

TGTCATCGGTGGTACCTCTTGGTTTGGTTCATAAAGCAATGGAGAACAGTAGCATTTGTG

GCAAGTTTGTTCCTAAGGGAGCTCTTATTTTAACAAATTTATGGAGTATGCATCACGATG

AAAGCTATTGGAAAAATGCAATGAGTTTTTACCCGGAACGTTGGCTGGAAAAATCTGGCG

AGTTCAATTATAAATTGGGGTACGCATATTTACCGTTTTCTAATGGACCTCGTAGTTGTT

TAGGAGAAACATTGGCAAAAACAGAGTTGTTTGTGTTTATTACACGATTACTTAAAGATT

ACCGATTTGAAATGCCAACTGGAAAAGAGTTACCTTGTTTAGATGGTCGTTCTGGAATCA

CCTCCCCTCCTAATGACTTTGAAGTCGTGATAATTCCAAGAAATTAA

 

>complete combined seq CN566859 CN566581 CYP2 clan member [gene 2]

1097326058990

32% to 2X9 aa 26-146 34% to CYP17A

MFLEVIGAVFIPPLIWTIWVYIKHLIDCLHYPRGPIPLPFIGNGYLIRKAEPYKELVNLGKIYG

DVFSFSVGSVRYVIVNSLEGIQEVLVKKGWQFAGRPKGP ()

SWDRSIHGLIQRDPSKKFKILRKLATSSLKIFADGLAGMESKAIEESFQLNKKLLETNGKPFSMQEIT (1)

1097329249233

(1) TLCVLNIICSILFNHRYKEDDLEFQDIIKYSNICFKERGVNNYIISIPWLRY

FPSASSRNLDEMIKIRDPLL

KKKVQEHKRSYDEYNLRDLTDALIKASNSETGQDPDEKVTDDNIVFILN

NFILAGSETSSNTILWFIVYILHWPEYQDKLYDEILKVTSGSRYPCLKDRPSLHLMQAAI

YETLRLSSVAPFGLHHKAMEKSSICGKSI

PKGALIITNLWSIHHDESYWKNAMSFYPERWLENSGEFNSKLGNAYLPFSSGPRSCIGETL 395

394 AKTELFIFISRLINDFRFVKPILEELPRLEGSFGITCTPYDFKVEIVPRSKNLLV* 227

AGATGCAACTGATAATAAATTCGCGATCCGCTATAAAGAAAAAA

GTCCAAGAGCACAAAAGATCGTATGACGAATATAATTTACGCGATCTAACAGATGCTTTA

ATAAAAGCATCAAACTCGGAGACGGGACAAGATCCGGATGAAAAAGTTACTGATGATAAT

ATTGTATTTATCTTAAATAATTTTATACTCGCAGGATCAGAGACTTCATCAAATACGATT

CTTTGGTTCATTGTTTATATTTTACATTGGCCGGAGTATCAAGATAAACTTTATGATGAA

ATTTTAAAAGTAACATCAGGTAGCCGTTACCCTTGCTTAAAAGATCGCCCGTCACTACAT

TTAATGCAAGCTGCAATTTATGAAACACTTAGGTTGTCATCGGTCGCACCTTTTGGTTTA

CATCATAAAGCCATGGAGAAAAGTAGCATTTGTGGAAAATCTATCCCTAAAGGCGCTCTT

ATAATAACCAATCTATGGAGTATACACCATGACGAAAGCTACTGGAAAAATGCAATGAGT

TTTTACCCTGAACGTTGGTTGGAAAGTTCTGGCGAATTTAACTCTAAACTAGGAAATGCG

TATTTACCATTCTCTAGTGGACCTCGTAGCTGTATTGGAGAAACATTAGCAAAAACTGAG

 

>1097329039870 1096703377333 1097664213870

MLFKVIGTILIPPLIWVVWIYIKHLVDCLSYPQGPFPLPFIGNAHLIRNRESYKVF

SEFQKIYGSVFGFSIGSTRYVVVNNLEGVQEVLIKKGSQFAGRPRRA (1)

ATGCTCTTTAAAGTCATTGG

TACAATCTTGGTTCCACCTTTAATATGGGTTGTATGGATTTATATCAAACATCTTGTTGA

CTGCTTGTCCTATCCTCAAGGACCATTTCCTCTCCCATTTATAGGAAATGCTCATTTAAT

AAGAAATAGGGAGTCTTATAAAGTGTTTTCTGAATTTCAGAAGATTTATGGCAGCGTTTT

TGGATTTAGCATTGGCTCAACCAGATATGTGGTTGTAAATAACTTAGAAGGAGTTCAAGA

GGTTTTGATCAAAAAAGGTTCACAGTTTGCAGGCCGCCCAAGACGAGCAAGT

 

>1096703827379 1095896863976

MFPEIVGAIMLPPLIWAAWIYIKHLVDCLVYPRGPFPLPFVGNAYLFSKGKPYKEFVKLG 103

KTYGDVFGFSIGSIRYVVVNSLEGIKKXXXXXXXXXXXXXXXX

ATGTTTCCTGAAATCGTTG

GCGCAATTATGCTTCCTCCCTTGATATGGGCAGCGTGGATTTACATAAAACATCTTGTTG

ACTGTTTAGTTTATCCCCGAGGACCATTTCCACTACCTTTTGTAGGAAATGCATATCTCT

TCAGTAAAGGCAAACCTTATAAAGAATTTGTTAAACTTGGAAAAACTTACGGCGATGTAT

TTGGCTTTAGCATTGGTTCAATACGATATGTAGTCGTGAACAGCTTGGAAGGTATCAAGA

AGT

 

>1095899160393 frameshifted

MFFEVIRAFFTPPLVWIIMVYIKNLIDYLYYPREPIPLPFIGNGDLIRKAEPFKEL

VNLEKKYGDVFSFRIGLVRFVVVSSLEVILEILVKKGWQANGRPKAP (1)

ATGTTTTTTGAAGTTATTCGCGCCTTCTTTACTCCACCTTTGGTATGGATTATAATGGTTTATATAAAA

AATTTAATCGATTATTTGTATTATCCACGAG

AACCGATACCACTACCATTTATTGGAAATGGTGATTTGATAAGAAAAGCAGAACCGTTTA

AAGAGTTGGTTAACCTGGAAAAAAAATATGGCGATGTTTTTAGTTTTAGGATTGGTTTAG

TCAGATTTGTGGTTGTTTCA

AGTTTAGAAGTAATTTTAGAAATACTAGTAAAAAAAGGGTG

GCAGGCAAATGGTCGTCCAAAAGCTCCAAGT

 

 

 

>1097329360095 4 aa diffs to CN566859 from PKG

FYLNNFILAGSETSSNTILWFIVYILHWPEYQDKLYDEILKVTSGSRYPCLKDRPSLHLM

QAAIYETLRLSSVAPFGLHHKAMEKSSICGKSIPKGALIITNLWSIHHDESYWKNAMSFY

PERWLESSGEFNSKLGNAYLPFSSGPRSCIGETLAKTELFIFISRLINDFRFVKPISEEL

PRLDGSFGITCTPYDFKVEIVPRSKNLLF*

TTTTATCTTAATAATTTTATACTTGCAGGATCAGAGACTTCAT

CAAATACGATTCTTTGGTTCATTGTTTATATTTTACATTGGCCGGAGTATCAAGATAAAC

TTTATGATGAAATTTTAAAAGTAACATCAGGTAGCCGTTACCCTTGCTTAAAAGATCGCC

CGTCACTACATTTAATGCAAGCTGCAATTTATGAAACACTTAGGTTGTCATCGGTCGCAC

CTTTTGGTTTACATCATAAAGCCATGGAGAAAAGTAGCATTTGTGGAAAATCTATCCCTA

AAGGCGCTCTTATAATAACCAATCTATGGAGTATACACCATGACGAAAGCTACTGGAAAA

ATGCAATGAGTTTTTACCCTGAACGTTGGTTGGAAAGTTCTGGCGAATTTAACTCTAAAC

TAGGAAATGCGTATTTACCATTCTCTAGTGGACCTCGTAGCTGTATTGGAGAAACATTAG

CAAAAACTGAGTTGTTTATTTTTATATCCCGATTAATAAATGATTTCCGATTTGTAAAAC

CGATATCAGAGGAATTACCGCGTTTAGATGGTAGTTTTGGCATCACTTGTACTCCTTATG

ACTTTAAAGTTGAAATAGTTCCAAGGAGTAAAAATTTACTGTTTTAA

 

>1097509039345 92% identical to 1096064108200, probably joins with 1095898835518

1096625274183  1095900033599  1095896933215 100% match so this similar seq is real

1097206379175 1097678021634

MFLEVAFGVVTPLFLYVIATYLDHLFKCRFYPPGPFPLPIIGNLHLIGKKPHEKFVEYSK 538

KYGEVFSLSFGMHRVVIVSGKDSIREVLVQKSNIFAGRPKNYIANIVSRGYKNIGYGDIG 718

PKWKILRKIAHSSLKNYGESTAHLETLVVRESEELHKNLYKKSNRSTKLEHKF (1)

>gnl|ti|649400787 1095898835518 93% identical to 1096064108200, 39% to 17A1 fugu

35% to 2U1

gnl|ti|647175227 1095898288652 1096602038000

(1) GVAVLNVICSIVFGKRYEYENCEFKEILTYMNYVFTGVAGTNAISFIPWLRFLPLDGLR

KLKKGLSIRDPVLRKQLLYHRETYNESNLRDYTDYVIQFSRDEAILKKFGEQLTDDY

LELLLNDIFIAGTETALTTLLWSIIYLIHWPKFQDKIYNEIVSAIGKNRYPSMKDRNMLP

LVNAALSETLRLSSVTPLGVPHKAMEDTTLLNDLKIPKGTTILTNLWQLHHNKNCWENPH

EFNPYRWFTNDQTLDSIKSMNFLPFSAGTRVCLGKGIAEVELFLFYSRLVRDFKFEVKP

GDSLPSLYGNCGLL*

AGGTGTTGCGGTATTAAATGT

CATTTGCTCTATTGTATTTGGAAAACGCTATGAGTACGAAAATTGTGAATTTAAAGAAAT

CCTAACCTACATGAATTATGTTTTTACTGGTGTAGCTGGTACAAACGCAATTTCTTTTAT

TCCGTGGCTTCGTTTCCTTCCATTAGATGGATTACGAAAATTAAAAAAAGGACTTTCAAT

TAGAGATCCGGTTCTTCGGAAGCAGTTGTTATATCACAGAGAGACCTACAATGAAAGTAA

CCTGCGTGACTATACAGACTATGTCATACAATTTTCAAGAGATGAGGCCATCTTGAAAAA

GTTTGGAGAACAGCTAACTGATGACTACTTAGAGCTTTTACTTAATGATATATTTATAGC

TGGAACTGAAACTGCATTGACAACTTTACTTTGGTCAATTATCTACCTTATTCACTGGCC

AAAGTTTCAAGACAAAATTTACAATGAAATTGTTTCAGCTATTGGTAAAAATAGATATCC

TTCTATGAAAGATCGTAATATGCTGCCTCTTGTTAACGCTGCGTTATCAGAAACATTGCG

GTTATCTTCTGTTACTCCATTAGGAGTACCTCACAAAGCTATGGAAGATACAACTCTCTT

GAATGATTTAAAGATTCCCAAAGGCACCACAATTTTAACGAACCTTTGGCAATTACATCA

CAATAAAAACTGTTGGGAAAATCCACATGAGTTTAATCCATATAGATGGTTTACTAATGA

TCAAACACTTGATTCTATAAAATCTATGAATTTTTTACCTTTTTCTGCTGGTACCAGAGT

GTGTTTAGGAAAGGGTATTGCTGAAGTTGAACTTTTTCTTTTTTACTCAAGGCTGGTTCG

TGATTTTAAGTTTGAAGTAAAACCCGGCGATAGTCTTCCAAGTTTATATGGAAATTGTGG

ATTACTCTAA

 

>gnl|ti|648017453 1095896110991     52   1e-05 35% to 17A1 fugu 34% to 2U1 fugu

gnl|ti|647987527 1095895119635

1096703762277 used this seq to walk upstream past a repeat could not go futher

71% to 1095898227332

(1) ELTTLNIICTILFNQRYEQDDDEFQNIIKYSNLSFKAFSASNLLSSIPWLRYFPTTASKYIQ 707

706 EIERLRDPILKRKLQEHRKSYDENNLRDITDALIKASIHLNAEKDSLIKVTDDNIQFILN 527

526 DLILAGSETSSSTITWFIVYMLHYPEYQDKIFNEVIKVTSGNRYPCLNDRPLLHLLQATI 347

346 HETLRLSSVAPLGLRHKAMENSTICDKPVLKGTLIITNLWSIHHDERYWKNPMSFYPERW 167

166 LNETGEFDYKLGNAYIPFSGGPRACLGETLAKTELFVIISRLVTDFYFEKSVEEDLPRLDSF 374

375 PGVTRSPYDFKVVVVSRS*

 

>gnl|ti|647193621 1095899233960 1096082123583 1097696262164 1096620040714

1097206342731

Combined seqeunces BP505786 and CB073123 and CB271974 40% to CYP17A [gene 3]

CN570733 same as CN570522 BP505786

50% to 1095898835518 37% to 17A1

(1) VTGVMNVLCGIVFGTQYEENDKELEKVISFKQLILDGVADTFAISFLPWLRFFPSNGLKKVRK

GVLIRDKLLRFQLKKHRETYNPVQIRDYTDYVLKYSKEFETSRNIDEQLSEDNMEMM

LQDIFISGSETTISTLLWFAVYLVNWPKYQDDIYDETIKIVGNDRYPSLSDRPKLHLFES

AMKETLRLSSVIPLGLPHRSLEETSIKKFKIPKNTNVMINLWQLHHDSKSWSDPHTFNPY

RWLNDKNIFDKSKNPNYLPFSTGLRACLGYHTTESIIFLFFTRLIRDFNLCLKPGASTP

SLNGVLRVTLTPDTSYIILKPRSNNLISQKIEA*

AGTTACTGGAGTGATGAACGTTCTTTG

TGGAATTGTTTTTGGTACACAATATGAAGAAAATGATAAAGAACTTGAAAAAGTCATATC

TTTTAAACAGTTAATATTAGATGGAGTAGCAGATACATTCGCAATATCTTTTTTGCCGTG

GTTAAGGTTTTTTCCTTCAAACGGATTAAAGAAAGTACGAAAAGGCGTGTTGATAAGAGA

TAAACTACTTAGGTTTCAATTAAAAAAACATCGAGAAACATACAATCCAGTTCAAATAAG

AGATTACACTGATTACGTACTTAAATACTCAAAAGAGTTCGAAACTTCAAGAAACATAGA

TGAGCAGTTAAGTGAAGATAATATGGAAATGATGCTTCAGGATATTTTCATTAGTGGTAG

CGAAACAACTATATCAACACTTCTTTGGTTTGCTGTTTATTTAGTTAACTGGCCAAAGTA

TCAAGATGATATCTATGATGAAACTATTAAAATAGTCGGTAATGATAGGTATCCTAGTCT

TTCAGATCGTCCAAAGCTTCATTTATTTGAAAGTGCTATGAAAGAAACTCTGCGTTTGTC

GTCTGTCATTCCATTAGGTTTACCTCACAGAAGTCTTGAAGAAACCAGCATAAAAAAATT

TAAAATTCCTAAAAATACAAACGTAATGATTAATCTGTGGCAGTTGCACCATGATAGTAA

ATCTTGGAGTGATCCTCATACATTTAATCCATATAGATGGTTAAATGACAAGAATATCTT

TGACAAAAGCAAAAACCCAAACTATCTTCCATTTTCAACCGGATTAAGAGCCTGCTTAGG

TTATCACACAACCGAATCCATCATTTTTTTGTTTTTTACCCGATTGATAAGAGATTTTAA

TCTTTGTTTGAAACCTGGCGCATCTACTCCAAGTTTAAACGGTGTTTTGCGAGTAACCTT

AACTCCTGATACGTCATACATTATTCTAAC

 

>gnl|ti|648033522 1095897342515  39% to 17A1 N-term

1095899118747 1095900033599 1096071090512  1096703396910  1096608233968  

MFLEIAFGVTAPLLLYVIATYLDHLFKCRFYPP

GPFPLPIIGNLHLIGKKPHEKFVEYSKKYGEVFSLSFGMHRVVIVSGKDSIREVLVQK

SNIFAGRPKNYIANIVSRGYKNIGYGDIGPKWKILRKIAHSSLKNYGESTKHLETLVVK

ESEELHKRLFKNCNRSTELEDEF (1)

ATGTTCTTAGAAATTGCTTTTGGAGTAACAGCTCCTCTGCTTTTGTATGTCATTGCAACTTATCTAG

ATCATTTGTTTAAATGCAGATTTTACCCGCCAGGCCCTTTTCCTTTACCGATTATTGGGA

ACTTACATTTGATTGGAAAAAAACCACATGAAAAGTTTGTAGAATATTCAAAAAAGTATG

GAGAAGTATTCAGTCTAAGTTTTGGAATGCATCGTGTTGTTATTGTTTCAGGAAAAGATT

CTATTAGAGAGGTTTTGGTTCAAAAATCAAACATTTTTGCAGGGCGTCCTAAAAACTACA

TTGCTAATATTGTATCTCGTGGTTATAAAAATATTGGCTACGGAGATATTGGACCTAAAT

GGAAAATTTTGAGGAAAATTGCTCACTCTTCTTTAAAAAACTATGGAGAGTCAACTAAAC

ATTTGGAAACGCTTGTCGTAAAAGAAAGCGAAGAGCTACACAAAAGACTTTTTAAAAATT

GTAACAGATCCACAGAGCTAGAAGATGAGTTTGGT

1096064108200 93% to 1095898835518  1097206931796(9 aa diffs)

1097206498632 walked up to 1096081234652 found mate pair 1096071090512

already known N-term seq matches 1095897342515 100%

1095897342515 38% to 17A1 fugu whole seq.

MFLEIAFGVTAPLLLYVIATYLDHLFKCRFYPP

GPFPLPIIGNLHLIGKKPHEKFVEYSKKYGEVFSLSFGMHRVVIVSGKDSIREVLVQK

SNIFAGRPKNYIANIVSRGYKNIGYGDIGPKWKILRKIAHSSLKNYGESTKHLETLVVK

ESEELHKRLFKNCNRSTELEDEF (1)

(1) GVAVLNVICFIVFAKRYENKDSEFKKILMYMNYVFSGVASTNFASFIPWLRFFPLDGLR

KLKKGLSIRDPVLRKQLLYHRETYNESNLRDYTDYVIQFSRDEAILKKFGEQLTDDYLEL

LLNDIFIAGTETALTTLLWSIIYLIHWPKFQDEIYNEIVSTIGKDRYPSMKDRNMLPLVN

AALSETLRLSSVTPLGVPHKAMEDTTLLNDLKIPKGTTILTNLWQLHHNENCWENPHEFNPYRWF

TNDQALDSIKSMNFLPFSAGTRVCLGKGIAEVELFLFYSRLVRDFKFEVKPGDSLPSLDG

NYGITLTPRIFTTFVVARNDSLVAQNHSL*

 

 

>gnl|ti|647182814 1095899213949  1095958075467  1095733042694

1097672545497

54% to 1095898835518, 36% to 17A1 36% to 2U1

walked upstream to 1097672406696 which mate pairs to exon 2 below

(1) GVAVLNVICFIVFGERYQYSDPAFIEILTTINNIVSGLSNTTAVDFLPGLRYLQFSEIK 256

257 KLKSSLVIYFRLLNDQLKKHKKTFDENNIRDFTDSIIKFSKDETMENKFEEELTDEHLEH 436

437 VIGDMFIAGSETTLTSLLWLIIYMIHYPKYQEEIFEEITRVIGENRYPQLSDRDSLHLVK 616

617 ASIKECLRLSSIIPLGVPHKTMSDTTLIGYNIPKNTTVIINHWQIHNDTNHWKNPNEFNP 796

797 HRWIDDDSKFDATRATSYLPFSAGTRVCLGKTVAETELFFFFTRLIRDFKFE

GVPGCPLPSLIGKCSITLAPEEFNVHVTPRINSLMFSKNVLPE*

 

>combined seq CN774619 CN775634 CYP2 clan member  [Gene 1]

 

32% to CYP1C1 aa 173-297 29% to 17A2

  2 ESEELHKRLLMKSKTSVDLKTEFGAAIINVICFIVFGERYQYSNSEFKEVLTTINNIV 175

176 DGLSNTTAVGFLPWLRFLPFSPIKKLSISLSKYIRFLNDKLTKHKETFNENKIRDS 343

344 TDSIIN 361

>1096526199166 frame3_ORF1 7aa diffs to CN774619 may be same gene

(1) GAAIINVICFIVFGERYQYSDSEFKEVLTTINDIVDGLSNTTAVGFLPWLRFLPFSPIKKLSIS

LSKYVRFLNDKLKKHKETFDEKKIRDFTDSIINFSNNEAVKQKFKNVDEHLEPVIGDLFI

TGSETTLTSLLWLILYMMHYPKYQQEIFKEITTVIGEDRYPCLNDRDSLHLVKAALKECL

RLSSIVPLGLPHKTTKETVLMGHSIPGNATVMINHWQIHNDTNYWENPNEFNPYRWIGKD

KKFDPSKATSFLPFSAGTRVCLGKTVAENELFFFFSRLIRDFNFECIPGCPPPSLIGKCN

ITHAPKQFCAYLTPRINNLM*

AGGTGCTGCAATTATAAACGTGATTTGTTTCATTGTTTTTGGGGAAAGATACCAGTATTCAGATTCAGAAT

TTAAAGAAGTTCTTACAACAATAAATGATATAGTCGATGGGTTGTCAAATACAACTGCTG

TTGGATTTTTGCCGTGGTTGAGATTTTTACCGTTTTCTCCAATAAAAAAACTGAGTATTT

CACTTTCAAAATATGTTCGTTTTTTAAACGATAAGTTGAAAAAACATAAGGAAACATTTG

ATGAAAAGAAAATTCGAGATTTTACTGATTCTATTATAAATTTTTCTAATAACGAAGCTG

TCAAACAAAAATTTAAAAACGTTGATGAACATTTAGAGCCTGTGATTGGGGATTTATTTA

TAACGGGTAGTGAGACCACATTAACATCTTTATTGTGGTTAATTCTTTATATGATGCATT

ATCCCAAATATCAACAAGAAATTTTTAAAGAAATTACAACGGTTATTGGTGAAGACCGGT

ACCCATGTTTAAATGACCGTGATTCTTTGCATCTTGTTAAAGCCGCATTAAAAGAGTGTC

TGCGTTTATCTTCAATTGTTCCTCTTGGATTACCACACAAAACAACCAAAGAAACAGTTC

TTATGGGACATAGCATTCCTGGGAATGCAACAGTCATGATTAATCATTGGCAGATTCATA

ACGATACTAACTACTGGGAAAATCCTAACGAATTTAATCCTTATCGGTGGATTGGTAAAG

ATAAGAAATTTGATCCAAGTAAAGCAACAAGTTTTTTACCTTTTTCAGCCGGTACAAGAG

TTTGTTTAGGGAAAACAGTTGCTGAAAATGAACTATTTTTCTTCTTTTCTAGATTAATTC

GAGATTTTAACTTTGAGTGCATACCTGGTTGTCCACCTCCAAGTTTAATTGGTAAATGCA

ATATTACTCATGCTCCAAAACAGTTTTGCGCATACTTGACTCCAAGAATAAACAATCTAATGTAA

 

>whole gene 1095899272864 1096526199166

MWYEIICGLIISILLYIIGSYLMHLLECRKYPLGPFPIPIFGNLHLLGTEPHKILAAYS

KKYGAVFSISLGLQRIVIISDITTTREALVQKASIFAGRPKSYLIQLISSGYKGIAFMDY

GSFWKVLRKVSHSSLKIYGEGHERFEKILTKESEELHKRLLKKSNNSVELKSEF (1)

GAAIINVICFIVFGERYQYSDSEFKEVLTTINDIVDGLSNTTAVGFLPWLRFLPFSPIKKLSIS

LSKYVRFLNDKLKKHKETFDEKKIRDFTDSIINFSNNEAVKQKFKNVDEHLEPVIGDLFI

TGSETTLTSLLWLILYMMHYPKYQQEIFKEITTVIGEDRYPCLNDRDSLHLVKAALKECL

RLSSIVPLGLPHKTTKETVLMGHSIPGNATVMINHWQIHNDTNYWENPNEFNPYRWIGKD

KKFDPSKATSFLPFSAGTRVCLGKTVAENELFFFFSRLIRDFNFECIPGCPPPSLIGKCN

ITHAPKQFCAYLTPRINNLM*

 

>gnl|ti|655005893 1095958068757     44   0.002 43% to 4V5 fugu 36% to 4T5

gnl|ti|651153924 1095901025079 N-term

gnl|ti|651153911 1095901025066

1097206604076 1097206339312 

complete gene no introns ESTs = CV566433.1 CX054637.1 CV566166.1

MVSVFYILFSGLVFYVVSKILWKLWRNSYGLSSIVTPPNVPFFGTSLYLHSDA

RKFFFQLYDYTRRYGDVFCIWLGPKPVICSSSVKFSEAVLSSQKVITKGFSYDFLHDWLK

TGLLTSTGSKWKTRRRLLTPSFHFSILNNFIKIFEEQASILVDKLAVAADNKEVVDVQVP

IGLATLDIICETSMGVKVNAQSHPDSEYVKAITVLNEEIQMRQKFPWLWFDAIYKLL 568

567 PCGKRFYKALDVAHKLSFDVINERMQMKIQESYCETASDEKKFFLDLLLDIYRKGKI 397

396 DTEGIQEEVDTFMFEGHDTTSAALGWTLWLLGKNPDVQKKLHKEIDEIELNGGSLYDKVR 217

216 QSKYLEIILKESLRMHPPVPMYGRTVEEDMTIDGQFVPKGAQIVLLVLILHSNPDYWEN 40

39  PNDFIPERFEADSYEKRNPYSYVPFSAGPRNCIGQKFAMIEEK

ILLYSIMKNFHLKSMQNENEVFGTLDIIHKSINGINIKFTRR*

ATGGTATCAGTTTTTTATATATTATTTAGTGGACTT

GTTTTCTATGTTGTTAGTAAGATATTGTGGAAGTTATGGAGAAATTCATATGGTTTATCA

TCAATAGTTACACCTCCAAATGTACCATTTTTTGGAACATCTTTGTACTTGCATAGTGAT

GCCCGCAAATTTTTTTTCCAACTATATGACTACACAAGAAGATATGGCGATGTGTTTTGC

ATTTGGTTGGGGCCAAAACCAGTAATATGTTCTTCCTCTGTAAAATTCTCAGAAGCAGTA

TTAAGTAGTCAGAAAGTTATCACCAAAGGATTTTCTTATGATTTTTTGCATGACTGGTTA

AAAACTGGGTTACTTACAAGCACAGGATCAAAATGGAAAACACGTAGAAGGCTACTAACT

CCAAGTTTTCATTTTTCTATACTCAATAACTTTATTAAAATATTCGAAGAGCAAGCATCC

ATTCTGGTGGACAAACTAGCTGTAGCTGCTGACAACAAGGAAGTTGTAGATGTGCAAGTA

CCTATTGGTTTGGCAACCTTGGATATAATCTGCGAAACTTCAATGGGTGTAAAAGTAAAT

GCACAAAGTCATCCAGATTCTGAGTATGTTAAAGCT

ATCACAGTTTTAAATGAAGAAATTCAAATGCGTCAAAAGTTTCCTTGGCTTTG

GTTTGATGCCATTTACAAACTGTTGCCTTGTGGGAAAAGGTTTTATAAGGCTTTAGATGT

TGCTCATAAGCTATCTTTTGATGTAATAAATGAACGCATGCAAATGAAAATTCAAGAATC

TTATTGTGAGACTGCGTCAGATGAAAAGAAATTTTTTTTAGATTTATTGTTAGATATATA

TCGCAAAGGTAAAATTGACACTGAAGGTATTCAAGAAGAAGTTGATACTTTTATGTTTGA

AGGTCATGATACAACTTCAGCTGCACTAGGTTGGACTCTTTGGTTGTTAGGAAAAAATCC

AGATGTTCAAAAAAAGCTGCACAAAGAAATTGATGAGATAGAGTTAAATGGAGGTTCACT

TTATGATAAAGTCAGACAGTCTAAATACCTTGAAATTATTCTTAAAGAATCATTACGAAT

GCATCCTCCTGTCCCTATGTATGGAAGAACAGTTGAGGAAGATATGACTATTGATGGTCA

GTTTGTTCCCAAAGGAGCACAAATAGTTCTTTTAGTTTTAATCTTGCACTCAAACCCTGA

TTATTGGGAAAACCCAAATGATTTTATACCTGAACGT

TTTGAAGCTGATAGTTATGAAAAGCGCAACCC

ATACAGTTATGTACCTTTTTCTGCTGGACCAAGGAATTGCATTGGCCAAAAATTTGCCAT

GATTGAAGAGA

AAATATTACTGTATAGCATAATGAAAAACTTCCATCTTAAGTCAATGCAGAATGAAAATG

AGGTTTTTGGTACTCTTGATATAATTCATAAGTCAATTAATGGAATTAATATAAAGTTCA

CAAGAAGATAA

 

>1096064105622 very similar to 1095958068757 varies at N-term 86%

346 MIYASYLVLVGLFVFFVSKILWKLWKSSYGLETIATPPNIPVFGTSLYLHSDARKFFFQL 525

526 SEFTKKYGTVFCIWLGPKPMIISSSVKFSEAVLSSQKVITKGFSYDFLHDWLKTGLLTST 705

706 GSKWKTRRRLLTPSFHFSILNNFIKIFEEQASILVDKLAVAADNKEVVDVQVPIGLATLD 885

886 IICETSMGVKVNAQSHPDSEYVKAITVLNEEFVMRIKYPWXLWFDVIYKLLPCGKR

34 aa gap between these two seqs

>CV564924.1 EST 93% to 1095958068757

EKKFFLDLLWDIYRKGEIDTEGIQEEVDTFMFEGHDTTSAALGWTLWLLGKNPDVQRKLHKEIDEIE

LNGGSLYDKVRQSKYLENILKESLRMHPPVPMYGRTVEEDMTIDDQFIPKGAQIILLVLM

LHSNPEYWENPNDFMPERFEADSYEKRNPYSYVPFSAGPRNCIGQKFAMIEEKILLYSIM

KNFHLKSMQDENEVFGTVDVIHKSINGINIMFTRR

GAAAAGAAATTTTTTTTAGATTTGTTATGGGATATATATCGAAAAGGTGAAATTGACACTGAAGGTATTCAAGAAGAAGTTGATACTTTTATGTTTGAAGGTCATGATACTACTTCAGCTGCACTAGGTTGGACTCTTTGGTTGTTAGGAAAAAATCCAGACGTTCAAAGGAAGTTGCACAAAGAAATTGATGAAATAGAGTTAAATGGAGGTTCACTTTATGATAAAGTTAGACAGTCTAAATACCTTGAAAATATTCTTAAAGAATCATTACGAATGCATCCTCCTGTCCCTATGTATGGAAGAACAGTTGAGGAAGATATGACTATTGATGATCAGTTTATTCCCAAAGGAGCACAAATTATTCTTTTAGTCCTAATGTTGCATTCGAACCCAGAATATTGGGAAAATCCAAATGATTTCATGCCTGAACGTTTTGAAGCTGATAGTTATGAAAAGCGCAACCCATACAGTTATGTACCTTTTTCTGCTGGACCAAGGAATTGCATTGGCCAAAAATTTGCCATGATTGAAGAGAAAATATTACTGTACAGCATAATGAAAAACTTCCATCTTAAGTCAATGCAGGATGAAAATGAAGTATTTGGGACTGTTGATGTAATCCATAAATCAATTAATGGAATTAATATAATGTTCACCAGAAGAAAAGGAAAAACTTATCTTGTTTAGTTTAGTTCATTATTTATCAGTAATTTGAAATAAT

 

>1096064105622 90% to 1095958068757 varies at N-term

1096071088011 joins CV564924.1 EST

MIYASYLVLVGLFVFFVSKILWKLWKSSYGLETIATPPNIPVFGTSLYLHSDARKFFFQL

SEFTKKYGTVFCIWLGPKPMIISSSVKFSEAVLSSQ

KVITKGFSYDFLHDWLKTGLLTSTGSKWKTRRRLLTPSFHFSILNNFIKIFEEQASILV

DKLAVAADNKEVVDVQVPIGLATLDIICETSMGVKVNAQSHPDSEYVKAITVLNEEFVMR

IKYPWLWFDVIYKLLPCGKRFYKALDVAHKLSFDVINERMQMKIRESYCETASDEKKFFL

DLLLDIYQKGEIDTEGIQEEVDTFMFEGHDTTSAALGWTLWLLGKNPDVQRKLHKEIDEI

ELNGGSLYDKVRQSKYLENILKESLRMHPPVPMYGRTVEEDMTIDNQFIPKGAQIILLVL

MLHSNPEYWENPNDFMPDRFEADSYEKRNPYSYVPFSAGPRNCIGQKFAMIEEKILLYSIM

KNFHLKSMQDENEVFGTVDVIHKSINGINIMFTRR

 

 

>gnl|ti|655009968 1095963046224     42   0.010 46% to CYP20 35% to 27B1

419 DGGIHKFLVENHKRLGPMFSFYWGKELAVSLACPILFKEVATLFNRP 559

 

>gnl|ti|646849327 1095897329284 1097672251908 mate pair =  1097672200068 has exon 2

40% to 2X2 N-term

MLLQITCGFLFPPLIWIVWTYIKHLYDCLSYPQGPIPLPFIGNAHLLRKGEPYKELVNLGKIYGDVFGFSIGSIRYVVVNNLEGIKEVLIKKGSQFAGRPRLKFTI (1)

ATGCTTCTTGAAATTACTTGTGGGGTTCTGTTCCCACC

GTTAATATGGATTGTCTGGACATATATTAAACATCTTTATGATTGTTTGAGTTATCCACA

AGGACCAATACCACTGCCATTTATAGGAAATGCTCATCTTTTAAGAAAAGGTGAACCTTA

CAAGGAATTAGTTAATCTTGGAAAGATATATGGTGATGTTTTTGGATTTAGTATTGGTTC

AATTAGATATGTAGTTGTAAACAATTTAGAAGGTATTAAGGAAGTTTTGATTAAAAAAGG

TTCACAGTTTGCTGGTCGTCCAAGGCTAAAGTTTACTATTAGT

 

Exon 2 1097331043073  1097206900216  1097672200068  mate pair = 1097672251908

This mate pair has 2 aa diffs to 1095897329284

one nuc diff same aa seq

1096124035195 1096041114543 1097329360644 1096625189581

1095958061778 1095898207031

(1) ALSRGMNGLIMSDPSPHFRILRKLASSSLKIYAEGLDGMEKKAINEYSYLHKKLSTMNGKAVSLKRMI (1)

AGCTTTGAGTAGGGGTATGAATGGCCTTATTATGAGTGATCCT

TCACCACATTTTAGAATTTTACGAAAATTAGCATCATCTTCGTTAAAAATTTATGCTGAA

GGATTAGACGGGATGGAAAAAAAAGCTATAAATGAGTACAGTTATTTGCATAAAAAATTA

TCAACAATGAATGGAAAGGCTGTATCTTTAAAAAGAATGATAGGT

 

>1096124019772 related exon 2 5 aa diffs to 1097331043073

1096123858905 1096123680637

(1) ALTRAMNGLIISDPSPHFKILRKLASSSLKLYAEGLDGMEKKAINEYSYLHKKLSTMNGKAVSLKRMI (1)

 

>1097265020030 new N-term weak with frameshifts and a stop codon

no exact matches exist so this may be poor quality sequence

TCGCTYPQKIWNVL

WTDIKHLSDSESYPQGPISLPI

XXXAHIERKGETYREIDRLR*IYGDDIGMCIGTLRYVDVNNLEGIRDVLIYTGTQFL

ACGTGTGGGTGTACGTACCCACAGAAAATATGGAATGTC

TGGACAGATATAAAACATCTCTCAGATAGTGAGAGTTATCCACAAGGACCAATATCACTGCCAATT

GCACATATAGAAAGAAAAGGTGAGACATACAGGGAGATAGATAGACTTAGATAG

ATATATGGTGATGATATAGGTATGTGTATCGGTACACTTAGATATGT

AGATGTAAACAATTTAGAAGGTATTAGGGACGTTTTGATTTACACAGGTACACAGTTTCT

CTGGT

 

>1096110062131  related exon 2 73% to 1095897329284

1097331675401  1097646001099  1096704247756

(1) AWSRALNGLVACDPGPRFKVLRKLASSSLKIYAEGLDGMEKKAADEYSHLNKKLQTMNGKPVSLQNMI (1)

mate pair of 1097646001099 = 1097664041480, continues on 1096703402618

1097329754969 1097664053056

possible frameshift at NDRP_LHL

(1) ELGTLNIICTILFNHRYEEDDKEFQDIIKYSNLTVKIFGGTSILSSIPWLRFLPSASSRSIYE

IVRIRDPLLKKKLQEHKSSFDENNLRDVTDVLIKVSLGSDIAKGSEEKITDENIEFLLND

FIIAGSETSSSTILWFIVYLLHWPEYQDKLYNEIIKVTSGKRYPCLNDRP ?

LHLTQATIHETLRLSSVGPLAIVHKAMENSSICGKPVPKGAFILTNLWSTH

HDESYWKNPMCFYPERWLEKSGEFNSKLGYAFLPFSGGPRSCLGEALARTELFVFFSRLV

TDYRFEKPNGEELPRLNGRFGLTCSPFDFKSVVVPRC*

AGAGTTAGGTACCCTCAACATCATTTGTACTATTTTGTTCAATCATCGATATGAAGAAGAT

GATAAAGAATTTCAGGATATCATCAAATACTCAAATCTGACTGTTAAAATTTTTGGTGGA

ACAAGCATTTTATCTTCTATTCCATGGCTGCGTTTTTTACCATCAGCTTCTTCAAGAAGC

ATATATGAGATAGTAAGAATACGTGATCCACTTTTGAAAAAAAAGCTACAAGAGCACAAG

AGCTCGTTTGATGAGAATAACTTACGTGATGTGACTGATGTATTAATTAAGGTTTCTTTG

GGTTCAGATATTGCAAAAGGTTCCGAAGAAAAAATTACTGACGAAAACATAGAGTTTCTT

TTAAACGATTTCATAATTGCCGGATCAGAAACTTCATCAAGTACAATTCTTTGGTTTATT

GTTTATCTTTTACATTGGCCAGAATACCAAGATAAACTTTATAACGAAATTATAAAAGTT

ACATCAGGTAAGCGTTACCCATGTTTAAACGATCGCCCc

CTTCATTTAACGCAAGCCACAATTCATGAAACACTTCGATTGTCATCAGTAGGTCCTC

TTGCTATAGTTCATAAAGCGATGGAAAACAGTTCCATATGTGGAAAACCAGTTCCCAAAG

GAGCTTTTATACTAACAAATTTATGGAGTACACATCATGATGAAAGTTATTGGAAAAATC

CAATGTGTTTTTATCCAGAACGTTGGTTAGAAAAATCTGGTGAGTTTAATTCTAAGTTAG

GGTATGCATTTTTGCCGTTTTCAGGCGGACCTCGTAGCTGTTTAGGAGAAGCACTTGCAA

GAACAGAGTTGTTTGTCTTTTTTTCACGATTAGTAACAGATTATCGGTTTGAAAAACCAA

ATGGTGAGGAGTTACCGCGTTTGAATGGTCGTTTTGGTCTCACTTGCTCTCCTTTTGACT

TTAAATCGGTGGTTGTTCCAAGATGTTAA

 

>1097206642797 related exon 2 61% to 1095897329284

1096761288099 1096082164704 1097567110690 1097672343044

(1) DWSRTMNSLINNDLNATFKVLRKITSSSLKIYAEGLVGMEKRAIEEYTHLNKKLLSLKGQAVSIKNMI (1)

AGATTGGAGTAGAACAATGAACAGCCTCATCAATAACGACTTAAATGCAACCT

TTAAAGTTTTACGAAAAATAACATCCTCATCATTAAAGATTTATGCGGAAGGATTGGTGG

GAATGGAAAAAAGAGCTATTGAGGAATACACCCACTTAAATAAAAAGCTTTTATCATTGA

AAGGGCAAGCAGTATCTATTAAAAACATGATTGGT

 

>1097206059080 5 aa diffs to 1095898809307 might be the same gene

(1) GPCKPSHIICTILFNHRYDENDQEFQDIIKYSNLSVRASSATSLISSIPWLRFFPSTASR

NIYEIIRLRDPILKRKLQEHRSSYDENNLRDVTDSLIKVSLDSALENNSHEKITDDNIEF

LLNDFIIAGSETSSNTVLWFIVYMLHWPEYQDKLYNEILKITSGNRYPCLSDRPMLHLMQ

AAIHETLRLSSVAPLGVGHKAMESSSICGKPVPKGAFILTNLWSIHHDETHWNNAMSFYP

ERWLEKSGEFNLKLGEAYLPFSSGPRSCLGETLAKIELFVFISRLVKDYRFEKPTEEDLP

NLKGESGITRTPSEFKVMAIPRN*

AGGGCCGTGCAAACCGTCTCACATAATTTGCACAATACTTTTTAATCATCGATATGATG

AAAATGATCAAGAATTTCAAGATATCATAAAATATTCAAATTTGTCTGTTAGAGCATCTA

GTGCAACCAGTCTTATATCTTCTATTCCATGGTTACGGTTTTTTCCTTCAACTGCTTCAA

GAAATATTTATGAAATAATAAGACTTCGTGATCCGATTTTGAAACGGAAACTTCAAGAAC

ACCGAAGTTCTTATGATGAAAATAATTTACGCGATGTGACTGATTCCTTAATAAAAGTCT

CTTTGGATTCAGCATTGGAAAACAATTCACATGAGAAAATCACAGATGATAACATTGAGT

TTCTTTTAAACGATTTTATAATTGCTGGATCAGAAACGTCGTCAAACACTGTTCTTTGGT

TTATTGTTTATATGTTGCATTGGCCAGAATATCAAGATAAACTTTATAATGAAATTTTAA

AGATAACATCCGGAAATCGTTATCCATGTTTAAGCGATCGCCCTATGCTTCATTTGATGC

AAGCTGCAATTCATGAAACACTTAGACTGTCGTCAGTAGCACCTTTGGGTGTAGGTCATA

AAGCAATGGAAAGCAGTAGCATCTGTGGTAAACCTGTTCCAAAGGGTGCTTTTATATTAA

CAAACTTGTGGAGCATACATCACGATGAGACTCATTGGAATAATGCCATGAGTTTTTATC

CAGAACGTTGGCTGGAAAAATCTGGTGAGTTTAATTTGAAACTTGGTGAAGCGTACTTAC

CATTTTCAAGTGGACCGCGTAGTTGTTTGGGAGAAACATTAGCTAAAATTGAATTGTTTG

TATTTATATCACGGTTAGTAAAAGATTATCGGTTTGAAAAACCAACTGAAGAAGACTTAC

CAAACTTAAAAGGTGAATCTGGCATAACTCGCACTCCTTCTGAATTTAAAGTTATGGCTA

TTCCAAGAAATTAA

 

>gnl|ti|649393684 1095898809307 45% to 17A1 C-term. No exact matches

(1) VYLKLGEAYLPFSSGPRSCLGEALAKIELFIFISRLVKDYRFEKPTEEELPNLKGESGITRIPSEFKVMTIPRN*

AGTTTATTTGAAACTTGGTGAAGCGTACTTACCATTTTC

AAGTGGACCGCGTAGTTGTTTGGGAGAAGCATTAGCAAAAATAGAGTTGTTTATATTTAT

ATCACGGTTAGTAAAAGATTATCGGTTTGAAAAACCAACTGAAGAAGAGTTACCAAACTT

AAAAGGTGAATCTGGCATAACTCGCATTCCTTCTGAATTTAAAGTTATGACTATTCCAAGAAATTAA

 

>gnl|ti|646968536 1095898162561 83% to 1095897329284 37% to 2X2 N-term

1096041100060 1097672010393 1096602125478

MILKVIGSIFFPPLIWFVYSYIKHLIECLYYPKGPVPLPFIGNTNLLRKKETCKEFVNLGKIYGDIFGFSIGSIRYVIVNNLEGIHEVLIKKGSQFSGRPRII (1)

ATGATTCTTAAAGTCATTGGTAGCATTTTTTTC

CCGCCTCTTATTTGGTTTGTCTACAGTTACATCAAACATCTTATAGAATGTTTGTACTAT

CCGAAAGGACCAGTTCCTCTACCGTTCATAGGAAATACAAACTTATTAAGAAAAAAGGAA

ACTTGTAAAGAGTTTGTTAATCTTGGGAAGATATATGGTGATATTTTTGGATTCAGCATT

GGTTCTATTAGATATGTAATTGTTAACAACTTAGAAGGTATTCATGAAGTTTTAATTAAA

AAAGGCTCACAATTTTCTGGTCGACCAAGGATTATATGT

 

>1097509072583 new exon 3 boundary wrong

(0) LWSYTCDKESGTNLTVLDDLSNLSFDIVGDVGFGYQFNTITSHSSNEFTSAVRNLTKMQI 694

NASVFSKVLITCFPFLVKFLLLFGKRRNLIQIVYKTLNK (2)

AGCTTTGGTCATATACATGCGATAAAGA

AAGTGGGACAAACCTAACTGTTCTGGATGATTTGTCTAATCTGTCATTCGATATAGTTGG

TGATGTTGGTTTTGGGTACCAATTTAACACAATCACTTCTCATTCTAGTAATGAATTTAC

TTCAGCTGTTCGGAATTTGACTAAAATGCAAATCAATGCTAGTGTGTTCTCAAAAGTTTT

AATAACTTGTTTTCCATTTTTGGTCAAATTCTTGTTATTGTTTGGAAAGCGTAGAAATCT

TATACAGATTGTTTATAAAACTTTGAACAAGT

 

>gnl|ti|648014530 1095896049543 41% to CYP21

LKYLDCVVK

PANTILRTHVSSIHMNETIYPDPHSFKHERFMTG

AGTCCTGCAAATACAATCC

TACGAACTCATGTTAGCAGTATACACATGAATGAAACTATTTATCCAGATCCTCATTCAT

TTAAACATGAAAGGTTTATGACAGGT

>1096082202706 probably the same as 1095896049543 which has errors

1097664076692 1095994179331

(0) NIEVQEKLREDIQKNILDVNNISFEEVMSLKYLDCVVKETLRLHGPAPLLGRRTISATKF

GEYEVPANTILRTHVSSIHMNETIYPDPHSFKPERFMT (1)

AGAACATAGAAGTTCAAGAGAAACTTAGAGAAGATATCCAGAA

AAACATATTGGATGTAAATAATATTTCTTTTGAGGAAGTTATGAGTTTAAAATATTTGGA

TTGTGTCGTTAAAGAAACCTTGCGCTTACATGGACCTGCACCACTTTTAGGCAGAAGGAC

CATTAGTGCAACAAAATTTGGTGAATATGAAGTTCCTGCAAATACAATCCTACGAACTCA

TGTTAGCAGTATACACATGAATGAAACTATTTATCCAGATCCTCATTCATTTAAACCTGA

AAGGTTTATGACAGGT

 

 

>1097309000937 1097206907008 1095901911044

MFLVCLALIVLFIGLFLLCYLLKRTFHPLRLLPSPKEQLITGHNRYFHGRDHTSTYLSFN 858

EKFKEEGLCTLDTLY (1)

ATGTTTCTAGTATGTCTAGCACTCATAGTTTTATTTATTGGATTA

TTTTTACTGTGTTATTTATTAAAACGTACCTTTCACCCTCTTCGACTTTTACCATCACCA

AAAGAACAACTTATTACTGGTCATAATAGGTACTTTCACGGCCGCGACCATACTAGCACC

TATTTGAGTTTCAACGAAAAGTTTAAAGAAGAAGGTTTATGTACGCTAGATACATTATATGGT

 

>1096091465110 88% to 1097331817678

1096625274441 1096123742264 1097265046825 1095964362241

MFLICLALLILSIGLFFLRYLLKRIFHPLQLLPSPKEQLITGHISHFQGRDHSNTFLGF 701

NEKFKEEGLCTLDTLY (1)

 

>1097331817678  1096526275245  1096124165677 1096110023112 1096761988512

1096701884902

walked down from end of 1096526275245

walked farther from end of 1097672563082 ran into a repeat region

MFVICLALITLFIGLFFLRCLLKRIFHPLRLLPSPKEHLITGHISHFQGRDHSNTFLSFNEKFKEE

GLCTLDTLY (1)

ATGTTTGTGATATGTCTAGCACTCATAACTTTGTTTATTGGATTATTTTTCCTGC

GTTGTTTATTAAAACGTATCTTTCACCCTCTTCGATTATTACCATCACCAAAAGAACATC

TCATTACTGGTCATATTAGTCACTTTCAAGGCCGTGACCATTCTAACACCTTTTTGAGCT

TCAACGAAAAATTTAAAGAAGAAGGTTTATGCACGCTAGATACATTATATGGT

 

>1095899139433 1096703930092 1097509100606 1097675339850  new exon 2

VPRYVYLIAPEFIKKIFADGKLFQRPTTLKILAPLIGNSMLGSNYEDHHWQRKLFNGAFT 549

SQQLKNYFPAFLKHTNLLMK

AGTGCCCAGGTATGTTTATCTAATTGCTCCAGAATTCATTAAAAAGATATT

TGCAGATGGGAAACTTTTTCAAAGGCCTACTACATTAAAAATCTTGGCACCATTAATTGG

AAACAGCATGCTTGGTTCAAATTACGAAGACCATCATTGGCAAAGAAAGTTATTCAATGG

AGCATTTACTTCACAACAGCTGAAAAATTATTTTCCTGCATTTTTAAAGCATACTAATTT

GCTAATGAAAGT

 

>new exon 2 1095899339221 1097206043402 1097672369437 possible frameshift/insertion

(1) GFKFIYLLMPEYIKTMVSNGKVFQKSTAMKVIFPLVGNGMLVSNYEHHHWQRKLFNEAFS

AQQLKKYFPAFKEHT DLLIK (0)

AGGGTTCAAATTTATTTACCTTTTAATGCCAGAATATATTAAAACAA

TGGTTTCTAATGGCAAGGTTTTTCAAAAATCGACTGCAATGAAAGTTATATTTCCTCTAG

TTGGCAACGGTATGCTTGTGTCAAATTATGAACATCACCATTGGCAAAGAAAATTATTTA

ATGAAGCATTTTCTGCACAACAGTTAAAAAAATATTTTCCTGCATTTAAAGAGCATACTA

ATAAAAGATTTACTAATAAAAGT

 

>1095964240637 1097516021618 1096705343938 1095900018167  1096607016658  new exon 2

(1) GFRFVDLLLPEFIKTIFSDGKVFHRSNVLKVLFPLVGNGMIVSNYEDHHWQRKVLNEAFT 854

SQQLKNYFPAFTLHTDLLMK (0)

AGGTTTCAGATTTGTTGATCTATTATTGCCAGAATTTATTAAAACAA

TATTTTCTGATGGTAAAGTTTTTCACAGATCGAATGTTTTGAAAGTTTTGTTTCCTCTAG

TTGGAAATGGTATGATTGTATCAAATTATGAAGATCATCATTGGCAAAGAAAAGTTTTAA

ATGAAGCTTTTACCTCCCAACAGCTAAAGAATTATTTTCCTGCTTTTACATTGCATACTG

ATTTGCTAATGAAAGT

 

>1097675832709 new exon 1 with one possible frameshift or there is another exon

MCMVYIAVLILLCLIVFF

ANVLKRFYHPLRNFPSPQENLITGHYSYFYRYDHVKTLLNFGKQFEKNGLYTLDTLN (1)

ATGTGTATGGTTTATATAGCAGTATTGATTTTAT

TATGTTTAATAGTATTCTTTGCTAATGTTTTAAAGCGTTTTTATCATCCGCTTCGTAAT

TTTCCCTCACCTCAAGAAAATTTAATTACAGGCCATTATAGCTATTTTTATCGTTATGAT

CATGTCAAGACTTTGTTAAATTTTGGAAAGCAGTTTGAAAAGAATGGCTTATATACATTA

GATACATTAAATGGT

 

N-terminal EST sequences for hydra P450s

>DN812964.1 ACAC-aac48b12.g1 Hydra EST UCI 7..same as DN812371.1

MYTIGIAVLIFLCFSLFFANILKRFYHPLRKLPSPKENFFTAHYGYFNGYDQINAVINFG 280

KQFKERGLYTLDTLN

>DN810769.1 ACAC-aac19b13.g1 Hydra EST UCI 7.. same as DN812371.1

MYTIGIAVLIFLCFSLFFANILKRFYHPLRKLPSPKENFFTAHYGYFNGYDQINAVINFG 256

KQFKERGLYTLDTLN

>DN816152.1 ACAC-aac24b14.g1 Hydra EST UCI 7.. same as DN812371.1

IAVLIFLCFSLFFANILKRFYHPLRKLPSPKENFFTAHYGYFNGYDQINAVINFGKQFKE 199

RGLYTLDTLN

>CN775805.1 tae77f11.x1 Hydra EST Darmstadt .. same as DN812371.1

IAVLIFLCFSLFFANILKRFYHPLRKLPSPKENFFTAHYGYFNGYDQINAVINFGKQFKE 185

RGLYTLDTLN

>BP514308.1 BP514308 Hydra magnipapillata c...have this one

MYSIYIAIIIVPLVFFVAVFFKRFYHQFRLLPSPKESLITCHYSYFDVHDHVNTLLNFG 208

KEFKDYGLYTINTL

>BP514307.1 BP514307 Hydra magnipapillata c...same as BP514308.1

MYSIYIAIIIVPLVFFVAVFFKRFYHQFRLLPSPKESLITCHYSYFDVHDHVNTLLNFG 208

KEFKDYGLYTINTL

>BP505238.1 BP505238 Hydra magnipapillata c... same as BP514308.1

MYSIYIAIIIVPLVFFVAVFFKRFYHQFRLLPSPKESLITCHYSYFDVHDHVNTLLNFG 209

KEFKDYGLYTINTL

>CO509836.1 tai58f02.y1 Hydra EST UCI 5 ALP .. same as BP514308.1

IYIAIIIVPLVFFVAVFFKRFYHQFRLLPSPKESLITCHYSYFDVHDHVNTLLNFGKEF 181

KDYGLYTINTL

>DN813094.1 ACAC-aab89g09.g1 Hydra EST UCI 7..= 1097675463974

MFLICLALLILSIGLFFLRYLLKRIFHPLQLLPSPKEQLITGHISHFQGRDHSNTFLGFN 303

EKFKEEGLCTLDTL

>DN603400.1 ACAC-aac10m18.g1 Hydra EST UCI 7..= 1097675463974

same as 1096091465110 DN813094.1 DN137655.1

MFLICLALLILSIGLFFLRYLLKRIFHPLQLLPSPKEQLITGHISHFQGRDHSNTFLGFN 283

EKFKEEGLCTLDTL

>DN137655.1 ACAE-aaa07c04.g1 Hydra EST UCI 5.. ..= 1097675463974

same as 1096091465110 DN813094.1 DN137655.1

LICLALLILSIGLFFLRYLLKRIFHPLQLLPSPKEQLITGHISHFQGRDHSNTFLGFNEK 192

FKEEGLCTLDTL

>CN567598.1 tag12b09.x1 Hydra EST -Kiel 1 Hy..we have this one

LLKRIFHPLRFLPSPKEQLITGHINHFQGRDHSSTYLSFNEKFKEESLCTLDTLH

>CX833403.1 ACAC-aaa40d06.g1 Hydra EST UCI 7..

91% to 1095899139433 exon 2

IIFTVLFW*RTFHPLQLLPSPKEQLITGHNMYFHGRDHTSTYLSFNKQFKK*GLCTQHTLX

VPRYVYLIAPQFITKIFAYGKLFQRPTTLKILAPLIGNSMLGSNYKDHHWQKKLFNGAFT 431

SQQLKNYFPAFLKHTT*LMKHWSYTCDKESGTNLTVLDDLSNLSFNIVGDVGFLGFGYQFTQ

ITSHASNEYTS

 

>1097675877620 new exon 1 with one possible frameshift or there is another exon

MYMICIAAIVILCFL

VLAVMLKRFYYPLCMLPSPKENLFTAHYRYFYGHDHINAFLNFQNQFKDYGLYTLDLLLG (1)

ATGTATATGATTTGCATAGCAGCCATTGTTATATTATGTTTTCTC

GTCCTTGCTGTTATGTTAAAACGTTTTTATTATCCGCTTTGTATGCTTCCATCACCCAAA

GAAAATTTATTTACAGCTCATTATAGATATTTTTATGGTCATGATCATATCAACGCTTTT

TTAAATTTTCAAAACCAGTTTAAAGACTATGGCTTGTATACATTAGATTTATTACTTGGT

 

>1095901177607 new exon 1 only 5 aa diffs from 1097675877620

all three of these sequences seem to have a frameshift. So this is probably

evidence for another upstream exon.  They almost certainly don’t have a frameshift

MYMICIAAIVILCFL

VLAVMLKRFYYPLCMLPSPKENLFTAHYRYIYGHDHINAFLYFQNQFKEYGLYTLDIFL

ATGTATATGATTTGCATAGCAGCCATTGTTATATTATGTTTTCTC

GTCCTTGCTGTTATGTTAAAACGTTTTTATTATCCGCT

TTGTATGCTTCCATCACCCAAAGAAAATTTATTTACAGCTCATTATAGATATATTTATGG

TCATGATCATATCAACGCTTTTTTATATTTTCAAAACCAGTTTAAAGAATATGGCTTGTA

TACATTAGATATATTTCTTGGT

 

>DN813094.1 ACAC-aab89g09.g1 Hydra EST UCI 7..= 1097675463974

MFLICLALLILSIGLFFLRYLLKRIFHPLQLLPSPKEQLITGHISHFQGRDHSNTFLGFN 303

EKFKEEGLCTLDTLY (1)

ATGTTTCTGATTTGTCTAGCACTTTTAATTTTATCTATTGGATTATTTTTTTTGCGT

TATTTATTAAAACGTATCTTTCACCCTCTTCAACTTTTACCATCACCAAAAGAACAACTC

ATTACTGGTCATATTAGTCACTTTCAAGGCCGCGACCATTCCAACACCTTTTTGGGTTTC

AACGAAAAGTTTAAAGAAGAAGGTTTATGTACGCTAGATACATTATATGTGCCCAGGTAT

 

>1097675463974 mate pair to exon 2 1097675525814

also 1096123961892 1097329363407 1097509311387

1096064006710 1097509202292 1096703858851 1097675514139 1096607047135

walked up to 1096526789565 continued on 1097516017620

(1) VPRYVYLIAPEFIKKIFADGKLFQRATSLKVLAPIIGNSMLTSNYEDHHWQRKLFNGAFT 565

SQQLKNYFPAFLTHTDFLMK (0)

(1) AGTGCCCAGGTATGTTTATCTAATTGCTCCAGAATT

TATAAAAAAGATATTTGCAGATGGAAAATTTTTTCAAAGAGCTACTTCATTAAAGGTTTT

GGCACCTATAATTGGAAATAGCATGCTTACTTCAAATTACGAAGACCATCATTGGCAAAG

AAAGTTATTCAATGGAGCATTCACTTCACAACAGCTAAAAAACTACTTTCCTGCATTTTT

AACGCATACTGATTTTCTAATGAAAGTAAGTTTTGATAATATTTAGTTATAGTTTTGTTG

TTTTTATTATAATAATGCAAAACAAATTCTTTTAGCTTGTAAGTACATGGT

>gnl|ti|647058148 1095898198167 I-helix mate pair = 1095898261914

1095898261914  1097206705284  1096761285028  1096761249195  1097206596155

1097675525814  1097675035030  1097329367631  1097206911388 1097206828844

1097460197276

LWSYTSDKESGTNLTVLDDLSNLSFDIIGDVGFGYQFNTINSHSGNEFT

SAFRYLTELQHNASVFSKVLISCFPFLAQFLLLFGKRRKLIQVVHKTLNK (0)

1097664219266 1097664219266 1096123852196 1097206273255

these have 2 aa diffs 1097331817534 1097206329872 1096526011941

(0) LIEKRKKEIDDGISTEEKDIITIVLKDQQQESSKLTNDLIRDNLLLFLIAGHETTSTAMTWCLYMLGT (0)

AGCTTTGGTCTTATACAAGTGATAAAGAAAGTGGGACAA

ACTTAACTGTTTTGGATGATTTGTCTAATCTGTCCTTTGATATAATTGGTGATGTTGGTT

TTGGCTACCAATTTAACACAATCAACTCTCATTCTGGTAATGAATTTACATCAGCTTTTA

GATATTTGACTGAACTGCAACATAATGCTAGTGTGTTCTCAAAAGTTTTGATAAGTTGTT

TTCCGTTTTTGGCGCAATTTTTGTTATTGTTTGGAAAACGTAGAAAACTTATACAAGTTG

TCCATAAAACTTTGAATAAGGT

1096123183594 1097335028467  mate pair 1097335034435  = 100% match to 1095898198167

1097325031454  1097509311387 

(0) NLEVQEKLREEIQKNILDKKNITFEEILSLKYLDCVVKETLRLHGPAPILGRRNINATKF 957

GEYEVPANTVLRTH VSSLHMNETIYPDPHSFKPERFMT (1)

AGAACTTAGAAGTTCAAGAAAAACTTAGAGAAGAGATCCAGAAAAATATATTGGATAAAAAAAAT

ATTACTTTTGAAGAAATCTTGAGTTTGAAATACTTAGATTGTGTCGTTAAAGAAACCTTG

CGCTTGCATGGACCAGCACCAATTTTAGGCAGAAGAAACATTAATGCAACAAAATTTGGC

GAATATGAAGTTCCTGCCAACACAGTACTACGAACTCAT

EST = DN137322.1 extends to end of gene = 1096071008743 1097672372091

1097509003243 1097329517617

LLFLIAGHETTSTAMTWCLYMLGT NLEVQEKLREEIQKNILDKKNITFEEILSLKYLDCV 579

VKETLRLHGPAPILGRRNINATKFGEYEVPANTVLRTH

VSSLHMNETIYPDPHSFKPERFMT

1096625230620 1096705372268 1097675934526 1097622179423 1097265071715

1097675205779

GEIPATFYLTFGHGIYNCIGKNFALLEIKTFLVKALLQFEFSVDPEHISYKKFIWLTTITAEPLSIRVKPIAD*

GTTAGCAGTCTACACATGAATGAGACTATTTATCCAGATCCTCATTCGTT

TAAACCTGAAAGGTTTATGACAGGCGAAATACCAGCAACATTCTATCTTACTTTTGGGCA

CGGTATATATAACTGTATTGGAAAGAATTTTGCTTTGCTTGAAATCAAAACATTCTTGGT

CAAAGCGTTGTTGCAATTCGAATTTTCTGTTGACCCTGAACATATCAGTTACAAAAAGTT

TATTTGGTTAACTACGATAACAGCAGAACCATTGTCAATTAGAGTAAAACCTATTGCAGATTGA

 

>1096703991752 1096123270489 1096526190394 probably same as 1096625230620

GEIPATFYLTFGHGIYNCIGKNFALLEIKTFLVKALLQFEFSVDPEHISYTKFVWLTTXXXXXXIRVNLIAD*

AGGCGAAATACCAGCAACATTCTAT

CTTACTTTTGGGCACGGTATATATAACTGCATTGGAAAGAATTTTGCTTTGCTTGAAATC

AAAACATTCTTGGTCAAAGCGTTGTTGCAATTCGAATTtTCTGTTGACCCTGAACATATCA

GTTACACGAAGTTTGTTTGGTTAACTACGXXXXXXXXXXXXXXXXXXGTCATTAGAGTAAACC

TAaTTGCAGATTaa

 

>these have 2 aa diffs from 1097675463974,   1097331817534 1097206329872 1096526011941

(0) LIEKRKKEIDDGISTEEKDIITIVLKDQQQESSKLTNDLIRDNLLLFLVAGHETTSNAMTWCLYMLGT (0)

 

>1096071008743 84% to CN567799 1096602116307 1096123983311 1096124057195

886 (0) NLEVQDKLREEILKNILDVNNISFEEVMSLKYLDCVVKETLRLHGPAPILRRRTMNAIKF 707

706 GEYEVPANTVLQTHISSLHMNETIYADPHLFKPERFMT (1) 593

 

>gnl|ti|648470985 1095898761545 N-term mate pair = C-term 1095899295538

1097264057439 extends N-term down 1097325864056 joins N and C-terms

681 MFLVYSLLVVIFSYFLIKISWKLWIYSYGLSTVPTPPTIPFFGNCLQLESDSVKFNKQI 854

855 REWSKIYGNVFCVWIGLTPMIYSSSVNFSEAILSSQKVLKKASVYEFLYEWLQTGLLTSTGNK

WKLRRRLLTPSFHFSILNNFLKIFEEQGACLVDKLRIYAKSGGNFDIQVPIGLATLDIICETSM

GVKVNAQSHPDSEYAKAIGILSEEIPKRIKYPWLWPDIIYKHLACGKRYYKALDVAHKLSLDVI

KERVKTLIQNKSEVTSNKNKK

ATGTTTTTGGTGTACAGTCTATTGGTTGTTATTTTTTCATACT

TTTTAATTAAAATATCTTGGAAACTTTGGATTTATTCTTATGGTCTCTCGACTGTTCCAA

CACCTCCAACCATACCATTTTTTGGCAATTGTCTTCAGCTTGAAAGTGATTCTGTAAAGT

TTAACAAACAAATACGCGAGTGGAGCAAAATATACGGAAATGTTTTCTGCGTTTGGATAG

GCCTTACGCCAATGATATACTCATCTTCTGTAAATTTCTCGGAAGCAATCTTAAGCAGTC

AAAAAGTCCTCAAAAAAGCATCTGTTTATGAATTTTTGTATGAATGGCTTCAAACCGGG

TTACTGACAAGCACAGGAAATAAG

TGGAAACTGCGTCGTCGACTTCTTACACCAAGCTTTCATTTTTCTATACTCAATAATT

TTTTAAAAATTTTTGAAGAGCAAGGAGCTTGTTTAGTTGATAAATTACGTATTTATGCCA

AAAGTGGTGGAAATTTCGATATCCAGGTACCTATTGGATTAGCAACTTTAGATATAATAT

GCGAGACATCAATG

GGAGTAAAAGTAAATGCACAGAGTCACCCAGACTCAGA

GTATGCTAAAGCCATCGGTATATTAAGTGAAGAAATACCAAAAAGAATTAAGTACCCATG

GTTATGGCCAGATATTATTTATAAACATCTTGCTTGTGGAAAAAGATATTATAAAGCTCT

AGATGTTGCTCATAAATTATCTCTTGATGTAATAAAAGAAAGAGTTAAAACACTTATTCA

AAATAAAAGCGAGGTTACATCAAATAAAAACAAAAAA

GAATCAGGCTCTGAAAAAAAAAA

ATTTTTTTTAGACTTATTGTTAGATATGCATAAAAAAGGTGAAATTGATACTGAAGGGAT

TCAAGAAGAGGTTGATACTTTTATGTTTGAAGGTCATGATAGCACTTCATCAGCATTAAG

CTGGATGCTGTGGTTGCTAGGAAGATATCCACAAGTTCAACAGAAACTGCATTCAGAAAT

TGATGAAGTGgAA

1095899295538 1096703530556 1096705948493 1096526527227 1096625218937

1097191001062

ESGSEKKKFFLDLLLDMHKKGEIDTEGIQEEVDTFMFEGHDSTSSALSWMLWLLGRYPQVQQKLHSEIDEVE

LTGGSLYEKVRNFKYLENVVKESMRIHPPVPLIGRHIEEDMVIDGQFVPKSSEIVLLVMM

MQSSPEYWKDPYDFIPERFEQEDFVKRNPYIYIPFSAGPRNCIGQKFAMIEEKMLLYIIM

KNFYVQSIQNENEILLALNIIHKSSNGIIMKFTER*

TTAACTGGAGGTTCACTTTATGAAAAAGTAAGAAACT

TTAAATATCTTGAAAACGTTGTAAAAGAAAGTATGCGAATTCACCCACCTGTTCCTTTAA

TTGGCAGGCATATTGAAGAAGACATGGTAATTGATGGTCAGTTTGTTCCTAAAAGTTCTG

AAATTGTTTTACTTGTAATGATGATGCAATCAAGTCCTGAATACTGGAAAGATCCATATG

ATTTCATACCTGAAAGGTTTGAACAAGAAGATTTTGTTAAGCGCAATCCATATATCTATA

TTCCATTTTCAGCAGGTCCAAGAAACTGTATTGGTCAGAAGTTCGCAATGATTGAAGAGA

AAATGCTGTTATATATCATAATGAAAAACTTTTACGTCCAATCCATCCAGAATGAAAATG

AAATACTTCTTGCTCTAAATATTATACATAAATCGAGTAATGGTATCATAATGAAATTCA

CTGAAAGATGA

 

>1096083942127 1097329109827 clearly best match to 4V sequences

MAFILLIFFLLLITLFLIWIYWVRSYNLNFVPSPLRFPLFGCALFLKSESH

ELFKQVRWFFSEFGSAFCLWIGPKPVLMTGNIDHIQTVLKSQKIITKSSSYTFLNE

WLGTGLLTSTGAKWKSRRKVLTKAFHFSIINSYVDSFYQNSVSLSNHLENHSGVPINIQA

LMSLFTLDIICETAMGFKLNSMKNLNCDYVNAVEEVKILLIERQKSPWLWNKFVYKLFSS

GKKFYTQLQVLKSFTKKIVNKRIKNYSLSSNGCKSFLDLLIDAYNQGKIDLEGIYEEVDT

FMFAGHDTTAAALSYIFLMLGTHPKVQKKLHEEIDTNVNINSYENLSEKIRKMEYLDCVI

KESLRLHPPVSVFGRILEDDTIFSNHLVGKGADIVLCPETLHTDPLYWENHRSFIPERFS

NVEFAFCQPYLYIPFSAGPRNCIGQKFALMEIKIAIFVVMSKFIVTAVEQCLSPM

ATFIQRYENGVLMLFEDEKRFLYML*

 

 

>1097329374310 no introns very similar to 1095899295538 seq

1096608398403 1096761840588 1097460256370

67% to 1095958068757 88% to 1095898761545

MFIAYSLLVVVSLYFVIKLFWK

FWIYSYGLSTVPTPPTIPFFGNSLQLESDSVKFNKQLCEWSKIYGNVFCVWVGLR

PTIFSSSVNFSEAILSSQEVLKKASIYEFLHDWLKTGLLTSTGNK

WKLRRRLLTPSFHFSILNNFLKIFEEQGACLVDKLRTYAKSGENFDIQVPIGLATLDIICETSMGVKVNAQSHP

DSAYVKAINILSEEIPRRFKYPWLWPDIIYKHLACGKRYYKALDVAHKLSLDVINERIETLFQNE

NNVTTNKNKEVSSEKKKFFLDLLLDIHKKGEIDTEGIQEEVDTFMFEGHDTTSSALSWIL

WLLGRYPQVQQKLHSEIDEVELTGGSLYEKVRNFKYLENIIKESLRIHPPVPLIGRHIEK

DMVIDGQFIPKKSEIGVLVMMMHSSPEYWKDPYDFIPERFEQEDFVKRNPYIYIPFSAGPRNCIG

QKFAMIEEKMLLYSIMKNFYVQSMQNENEILPSLDLIRKSVNGIILKLTER*

ATGTTTATTGCGTACAGTTTGTTGGTTGTAGTTTCTTTATACTTTGTAATTAAATTATTTTGGAAG

TTTTGGATTTATTCTTATGGTCTCTCGACTGTTCCAACACCTC

CAACCATACCATTTTTTGGGAATTCTCTTCAACTTGAAAGTGATTCTGTTAAGTTTAATA

AACAACTATGCGAGTGGAGCAAAATATACGGAAATGTGTTCTGTGTTTGGGTAGGCCTTA

GGCCAACTATTTTCTCATCTTCTGTAAATTTCTCGGAAGCAATTTTAAGCAGTCAAGAAG

TCCTTAAAAAAGCATCAATTTATGAATTTTTGCATGACTGGCTTAAAACTGGATTACTAA

CAAGCACAGGAAATAAGTGGAAACTGCGTCGTCGACTC

CTTACACCAAGCTTTCATTTTTCTATACTCAATAATTTTTTAAAAATT

TTTGAAGAGCAAGGAGCTTGTTTAGTTGATAAACTACGTACTTATGCCAAAA

GTGGTGAAAATTTTGATATCCAGGTACCTATTGGATTAGCAACTTTAGATATAATATGTG

AGACATCAATGGGAGTAAAAGTAAATGCACAGAGTCACCCAGATTCAGCGTATGTTAAAG

CCATTAATATTTTAAGTGAGGAAATACCAAGGAGATTTAAATACCCATGGTTGTGGCCAG

ATATTATTTATAAACATCTTGCTTGTGGAAAGAGATATTATAAAGCACTAGATGTTGCTC

ACAAATTGTCTCTAGATGTAATAAATGAAAGAATTGAAACACTTTTTCAAAATGAAAACA

ATGTTACCACAAATAAGAACAAAGAAGTTAGCTCAGAAAAAAAAAAGTTTTTTTTAGACC

TACTGTTAGATATACATAAAAAAGGTGAAATTGATACTGAAGGGATTCAAGAAGAGGTTG

ATACTTTTATGTTTGAAGGTCATGATACCACCTCATCAGCATTAAGCTGGATACTTTGGT

TGCTAGGAAGATATCCACAAGTTCAACAGAAACTGCATTCAGAAATTGATGAAGTTGAAT

TAACCGGAGGTTCACTTTATGAAAAAGTAAGAAACTTTAAATATCTAGAAAACATCATAA

AAGAAAGCCTGCGAATTCATCCGCCTGTTCCTTTAATTGGCAGACATATTGAAAAAGATA

TGGTAATTGATGGTCAGTTTATTCCTAAAAAATCTGAAATTGGTGTTCTTGTCATGATGA

TGCATTCAAGTCCTGAATATTGGAAAGATCCATATGATTTCATTCCTGAAAGGTTTGAAC

AAGAAGATTTTGTTAAGCGCAATCCCTATATCTATATTCC

ATTTTCTGCAGGTCCGAGAAATTGTATTGGTCAGAAGTTCGCAATGATTGAAGAGAAAAT

GTTGTTATATAGCATAATGAAAAACTTTTACGTCCAATCCATGCAGAATGAAAATGAAAT

ACTTCCTTCTCTAGATCTTATACGTAAGTCGGTTAATGGTATCATATTAAAACTTACTGA

ACGATAA

 

>1095964281471 1097672357643 1097675038710 1096526281478 1097675573844

MFSNIKMIYTLCIIICGFYFLIKILWMCWKYSYGLTSIATPPNTPFLGTSFYFLSDS

RKSYFQLCNYTKQFGNVFCIWLGPKPMIVSSSVKFLKAVLSSEKITTKGFSYDWIHDWLK

TGLLTSSGPKWKARRKLL

TSSFHFSVFNRLKIIIEEQACILVDKISFAADNKKVVDVQTLIGLATLDVICETIMGVKINAQ 780

SYPDSEYVKAISVLHKEIVNRMKFPWLWFDVIYKLLPCGKRFYKALDVAHKFTFDIINKR 600

MEISVNESYIDTPLEEKSYFLDLLLNIHKKKEIDMEGIQEEVDTF

IFAGHDTISVALSWTLWLLGKYSEIQRKLHKSIDEIELNGGSLFEKVRNFKYLENII

KESMRIHPPVPMYGRTVEENMTIDGQFVPKGAQIILLVLMLHSDPNIWENPKEFIPERFE

TDDWKIKNSYSYLPFSAGSRNCLGQKFAMIEAKMLLYSIM

KKFSLKSMQDENEVYGTVDILHKSINGINILFTRR*

 

 

>gnl|ti|648478468 1095898788708 N-term EST = CV564880.1

1097672125473  1097509103730  1096123847153  1097664004740  1097329293298 

1096092407854  1097325278081  1097675392269  1097672158546 1097206129107

1095899351259

76% to 1097329374310 similar to 4V5

MFLTFMFLFLIYFLIKVFWKLWIYSYGLSTVSTPPTLPLFGNCLQIKSDPVKASKQL

FEWSRVYGKVFCVWVGIRPTIFSSSVNFSEAILSSQKIIQKGFVYNFLHEWLKTGLLTST

GNKWKLRLRLLTPSFHFSILNNFLKIFEEQGNCLIDKFRVLAQNGKYFDIQVPIGLATLD

IICETSMGVKINAQYQPDSEYVTAINILSEEIVRRFKYPWLWPNIFYKHFSCGKRYFKAL

DIAHKLSLNVIHERIQTSLQNESENVLINKLDNKSVLNNEEELGVRKKRFFLDLLLDMHK

KGEIDVDGIQEEVDTFMFEGHDTTSS

AMCWTLWLLGRYPQIQQKLHAEVDEVELTSGSLYEKVRNFKYLE

NVLKESLRLHPPVPLISRYIEEDMMIDGQFIPKKSEIAILVMMIHLNPEYWKDPHSFIPE

RFDQDDFVKRNPYTYIPFSAGPRNCIGQKFAMIEEKMLLYNIMKHFYVESMQNENEILRT

QDLISKSANGIMMKFYER*

ATGTTTTTAACTTTTATGTTTTTGTTTCTTATTTATTTTCTAATTA

AAGTATTTTGGAAGCTTTGGATTTATTCTTATGGCCTGTCAACTGTTTCTACACCTCCCA

CATTACCATTATTTGGCAATTGTCTTCAAATCAAAAGTGATCCTGTAAAAGCCAGCAAAC

AACTATTCGAGTGGAGCAGAGTATACGGAAAAGTGTTTTGTGTTTGGGTTGGCATTCGGC

CAACTATATTCTCATCTTCTGTTAATTTTTCCGAAGCAATTTTAAGCAGTCAAAAAATAA

TTCAAAAAGGATTTGTGTACAATTTTTTGCATGAATGGCTTAAAACTGGTCTACTAACAA

GTACGGGAAATAAGTGGAAATTGCGTCTTCGACTTCTAACGCCAAGCTTTCATTTTTCTA

TACTCAATAACTTTTTAAAAATTTTTGAAGAGCAAGGAAATTGTTTAATTGATAAATTTC

GCGTTCTTGCCCAAAATGGAAAATATTTTGATATTCAGGTGCCTATTGGGTTAGCTACAT

TAGATATAATATGCGAGACGTCAATGGGAGTGAAAATAAACGCGCAGTATCAGCCAGATT

CCGAATATGTTACTGCCATTAACATCTTAAGTGAGGAAATAGTTAGACGGTTTAAGTACC

CGTGGTTGTGGCCAAATATTTTTTATAAGCATTTTTCTTGTGGAAAACGGTACTTTAAAG

CATTAGACATTGCTCATAAACTGTCTCTTAATGTAATTCATGAAAGAATTCAAACTAGTT

TACAAAACGAAAGTGAGAATGTGTTAATCAATAAACTTGACAATAAGAGCGTGTTGAACA

ATGAAGAGGAACTCGGTGTACGTAAAAAGAGGTTTTTCTTAGATTTATTGTTAGACATGC

ATAAAaAAGGTGAAATT

GATGTTGATGGGATTCAAGAGGAGGTGGATACATTTATGTTTGAAGGTCACGACACCACC

TCATCAGCAATGTGTTGGACATTATGGTTGCTGGGAAGATATCCACAAATTCAACAGAAA

CTGCATGCTGAAGTTGATGAAGTTGAACTAACTTCGGGTTCACTATATGAAAAAGTACGA

AACTTTAAATATCTTGAAAATGTTTTAAAAGAAAGCCTGAGACTTCATCCACCAGTTCCC

TTAATCAGTAGGTATATTGAAGAAGATATGATGATTGATGGTCAGTTTATTCCTAAAAAA

TCTGAAATCGCTATTCTTGTGATGATGATACACTTAAATCCTGAGTATTGGAAAGATCCT

CACAGCTTTATACCTGAAAGATTTGATCAAGATGATTTTGTAAAGCGTAATCCATACACT

TACATTCCATTCTCCGCTGGCCCTAGAAATTGCATTGGTCAAAAGTTTGCAATGATAGAA

GAAAAAATGCTGTTATATAACATAATGAAACATTTTTATGTAGAATCCATGCAGAATGAA

AATGAAATTTTAAGAACTCAAGATCTTATAAGTAAATCAGCTAATGGTATCATGATGAAGTTCTATGAAAGATGA

 

>Combined CN627429 CN775805 27% to 4T5 [gene 6]

GIAVLIFLCFSLFFANILKRFYHPLRKLPSPKENFFTAHYGYFNGYDQINAVINFGKQFK

ERGLYTLDTLNGFRFVNLLMPEFIKTVFSDGNSFQRSTATKVIFPLVGNGIFVSNYEDHH

WQRKVLNEAFTLQQLKNYFPAFTVHIDLLMKLWSYSCDK

DNGTNIIVLDDLSNLSFDIIGDVGFGYQFNTITSHSGNEFTKALQSYCQLRFQLNAVHKALLA

FFPFLMHLSFMYGKRKRAEQVICNTLNM

LINKRKKEIDHRIAADQKDFLTVVLK

DQQKEGNKMTNDLIKNNLMTPLIAGHKTTSTDMPWCFNVLAPNPSATKHMQRNRKKEYISTTK

 

>EST DN812371.1 joins with CN627429 CN775805 and 1095901729505 1097325001902

CN776982 and CN770283 1097206896815 1096110026952 1096110107596

MYTIGIAVLIFLCFSLFFANILKRFYHPLRKLPSPKENFFTAHYGYFNGYDQINAVINFGKQFKERGLYTLDTLN

GFRFVNLLMPEFIKTVFSDGNSFQRSTATKVIFPLVGNGIFVSNYEDHH

WQRKVLNEAFTLQQLKNYFPAFTVHIDLLMK

LWSYSCDKDNGTNIIVLDDLSNLSFDIIGDVGFGYQFNTITSHSGNEFTKALQSYCQL

RFQLNAVHKALLAFFPFLMHLSFMYGKRKRAEQVICNTLNM

LINKRKKEIEDGIAADQKDFLTVVLKDQQKEGSKMTNDLIKDNLMTLLIAGHETTSTAMQWCLYMLGT (0)

NLGVQNKLREEIKKNVFDIKSVSYEEVLSIKYLDCVVKETLRMHTPVAFIGRINKNQTKFGDFDV

PAGSFLRIPIDSAHMNESVYHDPYSFRPERFLTGEIPPLSFLTFGQGIYNCIGKNFALLE

IKTFLVKALLQFEFSVDLKHLNYKKLISITNKTVEPLWIRVKPI*

 

EST matches 1097325001902 1097265052814 for exon 1

  1 tcggcatagc agtattaatt tttttgtgtt tttcactgtt ttttgctaat attttaaaac

 61 gtttttatca tccgcttcgt aagttgccat cacctaaaga aaatttcttt actgctcatt

121 atggctactt taatggctat gatcaaataa atgctgtaat aaattttgga aaacagttta

181 aagagcgtgg cttgtataca ttagatacat taaat ggatt tagatttgtt aatcttttaa

241 tgccagaatt tattaaaaca gtgttttctg atggaaactc attccaaaga tcgaccgcta

301 caaaagttat atttcctcta gttggaaatg gtatttttgt gtcaaattat gaagatcatc

361 attggcaaag aaaagtgtta aatgaagctt ttactttaca acagctaaaa aattattttc

421 cagcttttac agtgcacatt gatttgctaa tgaaactttg gtcatattca tgtgacaagg

481 ataatggtac taacataatt gttttggatg acttatctaa tttatcattt gatataattg

541 gggatgttgg ttttggctat ca

 

>1096526100337 74% to CN776982

1096526100337  1097206278072  1096123494736 

(0) NLDVQNKLREEIKKNVFDIKSILREEVLSIKYLDCVVKETLRMHPPASFISRKNKTETKL 308

GDYDIPAGTFLRISINNVHMNESVYPDPYLFKPERFMT (1)

1095898850029 same as 1096520314506 = mate pair match to 1096526207508

1097206730806

DEIPPSSFLSFGQGIYNCIGKNFALLEIKTFLVKALLHFEVSVDPSHVNYTKQILLTLNTVEPIWIRVKSIEE*

AGAATTTAGATGTTCAAAACAAACTAAGAGAAGAAATAAAGAAAAA

TGTCTTTGATATAAAAAGTATTTTACGGGAAGAAGTTTTAAGCATCAAGTACTTGGATTG

TGTAGTTAAAGAGACATTACGCATGCATCCACCTGCGTCATTTATATCTCGAAAAAACAA

AACTGAAACAAAGTTGGGTGATTATGATATACCTGCTGGCACGTTTTTAAGAATTTCAAT

TAACAACGTACATATGAATGAGTCTGTTTATCCTGATCCTTATTTATTTAAGCCGGAACG

ATTTATGACAGGT

AGATGAAATACCACCATCGTCTTTTCTCTCATTTGGGCAAGGTATTTATAATTGTATTGGAAAGAAT

TTTGCTTTGCTTGAAATTAAAACGTTTTTGGTTAAAGCATTATTACATTTTGAAGTTTCT

GTCGACCCAAGTCATGTGAATTATACAAAACAGATTTTGTTAACTTTAAATACCGTTGAA

CCCATTTGGATAAGAGTGAAATCTATTGAAGAATAA

 

>1097696222067 new exon 6 1097375001145 1097672638278

1096041191032 1097678083218

GEIQPYSYLTFGQGIFNCIGKNFALLEIKTFLVKALLQFEFSVDLEHMNYIKKIFISTKTVEPLWIRVKPI*

AGGTGAAATACAACCAT

ATTCCTACCTCACATTTGGGCAAGGTATTTTTAATTGTATTGGAAAGAATTTTGCTTTGC

TTGAAATTAAAACATTCTTGGTCAAAGCGTTGTTGCAATTCGAATTTTCTGTTGACCTTG

AGCATATGAATTATATAAAGAAAATTTTCATTTCTACTAAAACTGTTGAACCGTTATGGA

TAAGAGTGAAACCTATATAA

 

>1097331770349 new exon six with stop, no other exact matches

DEIPSSSYLTFGYGIYNCIGKNFALLEIKTFLIKAL*QFEFLVDPEQLSYKKQISIST 330

KTAEPLWIRVKSI*

AGATGAAATACCATCTTCATCCTAC

CTTACATTTGGGTATGGTATTTATAACTGTATTGGAAAGAATTTTGCTTTGCTTGAAATT

AAAACATTTTTGATTAAAGCGTTGTAACAATTTGAGTTTTTGGTTGACCCTGAGCAATTA

AGTTATAAAAAGCAGATTTCAATTTCTACTAAAACAGCTGAACCGTTATGGATAAGAGTA

AAGTCTATATAA

 

>1097329444796 new exon 6 no other exact matches, most like CN567799

GEIPASFYLPFGHGVYNCIGKNFALLEIKTFLVKALLQFEFSVDPKNINYTKVIWLTTRTVEPLLIRVKPLQPV

AGGCGAAATACCAGCATCGTTCTATCTTCCTT

TTGGACACGGTGTTTATAACTGCATTGGAAAGAATTTTGCTTTGCTTGAAATTAAAACAT

TTTTGGTCAAAGCGTTGTTGCAATTCGAATTTTCTGTCGATCCTAAGAATATAAATTATA

CAAAGGTTATTTGGTTAACTACGAGAACAGTTGAACCATTGCTTATAAGAGTAAAGCCAT

TACAGCCCGTAC

 

>1097325113147 1097690001285 1097942838551

GEIPATFYLPFGHGVYNCIGKNFALLEIKTFLVKALLQFEFSVDPKHANYTKVIWLTAKT 290

TEPLSIRVKPIVD*

AGGCGAAATAC

CAGCAACATTCTATCTTCCTTTTGGGCATGGTGTTTATAACTGTATTGGAAAGAATTTTG

CTTTGCTTGAAATCAAAACATTTTTGGTCAAAGCGTTGTTGCAATTCGAATTTTCTGTTG

ACCCTAAGCATGCAAATTATACAAAGGTTATTTGGCTAACTGCAAAAACAACTGAACCAT

TGTCAATCAGAGTAAAGCCTATTGTAGATTGA

 

>1097206250175 1095899118096

GEVPPFSFLTFGRSNYNCIGKNFVLLDIKAFLVKALLQFKFSVDP 360

MHLNYKKPISITNKAVDPLWIRVKTI*

AGGTGAAGTACCGCCATTTTCCTTTCTAACAT

TTGGGCGAAGTAATTATAATTGTATTGGAAAGAATTTTGTTCTGCTTGACATCAAAGCAT

TCTTGGTCAAAGCGTTATTGCAGTTTAAATTTTCAGTAGACCCTATGCATTTGAATTATA

AGAAGCCGATTTCTATTACTAATAAAGCCGTTGATCCCTTATGGATTAGAGTAAAGACTA

TATAA

 

>1096123749751 boundary is not right, no other exact matches

GETPASLYLPFGHGVYKVIGKNFSLLEIKTLSVKALLQLEKVVDPKNINYSKVIWLTSRT 211

VEPLFIRVKLIVD*

GGTGAAACACCAGCATCGCTTTATCT

TCCTTTTGGACACGGTGTTTATAAAGTCATTGGAAAGAATTTTTCTTTGCTTGAAATTAA

AACATTGTCGGTCAAAGCATTGTTGCAATTAGAAAAGGTTGTCGATCCTAAGAATATAAA

TTATTCAAAGGTTATTTGGTTAACTTCGAGAACAGTTGAACCATTGTTTATAAGAGTAAA

GCTTATTGTAGATTAA

 

>gnl|ti|654999901 1095901768752 87% to 1095901729505

1095901905311 mate pair links to 1095901795880 exon 5

1097331953492 1097664070304 1096761841205 1097675516783 1096602049536

1096761821875

 (0) LINKRKKEIEDGIETGEKDFLTIVLKDQQKEGSKMTNDLIRNNLVTLLIAGHETTSVAMQWCLYILGT (0)

AGCTTATCAACAAACGTAAAAAAGAAATAGAAGATGGAATAGAAACTGGTGAAAAAGATTTTTTAACA

ATTGTTTTAAAAGATCAACAAAAAGAGGGCAGCAAGATGACAAATGATTTGATTAGAAAT

AATCTAGTAACACTTTTAATTGCTGGTCATGAAACAACTTCTGTAGCAATGCAATGGTGC

TTATACATTCTTGGCACAGT

1095901795880 1097491021716  1096123686039 1097672412446

(0) NSDVQNKLREDIKKNVFDIKSITCEEVLSIKYLDCVVKEVLRLHPPVSFIGRINTR

QTNFGEYNVPAGSYLRVPINSAHMNESVYPDPYSFKPERFLT (1)

AGAATTCAGATGTTCAAAACAAGCTACGAGAAGACATAAAGAAAAATGTCTTTGATATAA

AAAGTATTACGTGTGAAGAAGTTTTAAGTATTAAGTATTTAGATTGTGTAGTTAAAGAAG

TGTTGCGCTTGCATCCGCCTGTATCATTTATAGGTAGAATCAACACTAGACAAACAAACT

TTGGTGAATATAATGTACCTGCTGGCTCTTATCTACGAGT

 

>1097206350025 1097675534489 1096602217388 all with frameshift

(0)   LIDKRKKEIEDGIATDEKDLLTIALKDQQKENSKMTX

    NLIRDNLMTFLIAAHETTSTGMQWCLYMLGT (0)

AGCTTATCGACAAACG

AAAAAAAGAAATAGAAGACGGAATAGCAACTGATGAAAAAGATTTATTAACAATCGCTTT

AAAAGATCAGCAAAAAGAAAACAGCAAGATGACCTTNAATTTAATTAGAGATAATCTAATG

ACATTTTTAATTGCTGCTCATGAAACAACTTCTACGGGAATGCAATGGTGTTTGTATATG

CTTGGCACAGT

 

>1097331459342 framshift and short 2 aa (pseudogene?)

LIDKRKKEIEDGIATDEKDLLTIALKDQQKENSKMT

NLIRDNLMTFLIAAHETTSTGMQWCLYML

AGCTTATCGACAAACGAAAAAAAGAAATAGAAGACGGAATAGCAACTGATGAA

AAAGATTTATTAACAATCGCTTTAAAAGATCAGCAAAAAGAAAACAGCAAGATGACCTTA

ATTTAATTAGAGATAATCTAATGACATTTTTAATTGCTGCTCATGAAACAACTTCTACGG

GAATGCAATGGTGTTTGTATATGCTG

 

>1097263640455 new exon 5

(0) NLDVQEKLREGIKKNVSDIKNISYEEVLSNKYLDCVVKEALRIHPPRS

AGAATTTAGACGTTCAAGAAAAACTAAGAGAAGGGATAAAGAAGAATGTA

TCTGATATAAAGAATATTTCATATGAAGAGGTTTTAAGTAACAAGTACTTAGATTGTGTA

GTTAAAGAAGCATTGCGCATCCATCCACCGCGCTCCAGCTA

 

>1096526374787 no 100% matches to this seq, best match is 1095901795880

but intron boundaries do not match

this may be a poor quality sequence or pseudogene

(1)EKVINIKYLDCVVKEVLRLHPPVLFIGRINTRQTNLGKYIETAGSNQRVPINNAHMNESVYPDPYSFMPKRLLT (1)

AGAAAAAGTTATAAATATTAAGTATTTAGATTGTGTAGTTAAAGAAGTGTTGCGCTTGCA

TCCGCCTGTATTATTTATAGGTAGAATCAACACTAGACAAACAAACTTAGGTAAATATAT

AGAAACTGCTGGCTCTAATCAACGAGTTCCTATTAACAATGCTCATATGAATGAGTCTGT

TTATCCTGATCCTTATTCATTTATGCCAAAGAGGTTGCTGACAGGT

 

>CN567598.1 tag12b09.x1 Hydra EST -Kiel 1 Hy..

LLKRIFHPLRFLPSPKEQLITGHINHFQGRDHSSTYLSFNEKFKEESLCTLDTLH

 

>Combined N and C-terms CN567799 CN567598 tag12b09.x1 [gene 4] N-term

N-term has an extension

1096761916754 probably 1095964418219 (poor quality seq)

RVRFLLRYLLKRIFHPLRFLPSPKEQLITGHINHFQGRDHSSTYLSFNEKFKEESLCTLDTLH (1)

VPRFVYLIAPEFIKKIFADGKLFQRSKSIRTLAPLIGNSMVGSNYEHHHWQRKLFNG

AFTSQQLKNYFPAFLKHTNLLMK

(0) LWSYTCDKESGTNLTVLDDLSNLSF

CN567598 part of three exons

  1 cacgcgtccg atttttactg cgttatctat taaagcgcat ctttcaccct cttcgatttt

 61 taccatcacc aaaagaacaa ctcattactg gtcatattaa tcactttcaa ggccgcgacc

121 attctagcac ctatttgagt ttcaacgaaa agtttaaaga agaaagttta tgcacgctag

181 atacattaca t

     gtgcccagg tttgtttatc taattgctcc agagtttatt aaaaagatat

241 ttgcagatgg aaaacttttt caaaggtcaa aatcaataag aactttggcc cctttaattg

301 gaaacagcat ggttggttca aattacgaac accatcattg gcaaagaaag ttattcaatg

361 gagctttcac ttcacaacaa ctgaaaaatt attttccagc atttttaaaa catactaatt

421 tgcttatgaa g

     ctttggtca tatacatgtg ataaagaaag tgggacaaat ttaactgttt

481 tggatgattt gtctaatctg tcatttg

DIVGDVGFGYHFNTITSHSGNEVTKAFQKY

CQLRHSLHPFYKALFAYFPFLMRLSFMFGKHKKAEQVISYTXXX (0)

AGCTTTGGTCATATACATGCGATAAAGAAAGTGGTACCAACATAATTGTTTT

GGATGATTTGTCTAATCTATCATTTGATATAGTTGGTGATGTTGGTTTCGGCTATCATTT

TAACACCATAACTTCTCATTCCGGTAATGAAGTTACAAAAGCCTTCCAAAAGTATTGTCA

ACTACGACATAGCTTGCATCCCTTTTATAAAGCTTTATTTGCTTATTTTCCATTTTTAAT

GCGTCTATCATTCATGTTTGGAAAACATAAAAAAGCTGAGCAAGTTATAAGTTATACT

 

>1096081231152 new exon 3

IWSYTCDKENGTKIIVLDDLSNLSLDIIGDVGYGYQFNTLTSHSGNEFTKAFQSYCQLQY 135

NIKPIYKALSAFFPFLMGLSIMFGKRKKTEEILRNNLNM

AGATTTGGTCATATACATGTGATAAAGAAAATGGTACCAAAATAATTGTTTT

AGATGACTTGTCTAATTTATCACTTGATATAATTGGTGATGTTGGTTATGGCTATCAATT

TAACACCTTAACTTCTCATTCTGGTAATGAATTTACAAAGGCTTTTCAAAGTTATTGTCA

ACTACAATATAACATAAAGCCAATCTATAAAGCTCTATCAGCTTTTTTTCCTTTCCTAAT

GGGGCTGTCAATCATGTTTGGAAAACGAAAGAAAACAGAGGAAATTTTACGTAATAATCT

AAACATGGT

 

>1095898814465 new exon 2 mate pair to 1095899110069

1095993318769  1095963238168  1096704141526  1095899349182  1097664110168 

1097622218011

(1) VPRYVYLIAPEFIKKIFADGKLFQRTTSIRIMAPSIGNSMLSSNYEDHHWQRKLFNGAFT 471

SQQLKNYFPSFLTHTNLLMK (0)

AGTGCCCAGGTATGTTTATCTAATTGCTCCAGAATTTATTAAAAAAATATTTGCTGATGGCAAACTTTTT

CAAAGAACTACTTCAATTAGAATTATGGCACCTTCAATTGGAAACAGCATGCTTAGTTCA

AATTACGAAGACCATCATTGGCAAAGAAAATTATTCAATGGAGCATTCACTTCACAACAG

CTAAAAAACTATTTTCCTTCATTTTTAACGCATACTAATTTACTGATGAAAGT

1097567103129 1097675494277 1095899110069

(0) IWSYTCDKESGTNLTVLDDLSNLSFDIIGDVGFGYQFNTITSHSRNEFTSAIRYLAEIQL 657

NASVFLKVLISYFPFLIQLLVMFGKRRKFIQIVRKTLNK (0)

AGATTTGGTCTTATACATGTGATAAAGAAAGTGGGACAAACT

TAACTGTTTTGGATGATTTGTCTAATCTGTCATTTGATATAATCGGTGATGTTGGTTTTG

GTTACCAATTTAACACAATTACATCTCATTCTCGTAATGAATTTACTTCAGCTATTCGGT

ATTTGGCTGAAATTCAACTCAATGCTAGTGTGTTCTTAAAAGTTTTAATAAGTTATTTTC

CATTTTTAATTCAATTGTTGGTAATGTTTGGGAAGCGTAGAAAATTTATACAGATTGTCC

GTAAAACATTGAACAAGGT

 

39% to 3A27 trout aa 307-472 58% to CN770283 [gene 4]

683 FFIAGYETISTTLTLCLYMLAINLEVQEKLREEIQKNKLDVNNISFEEVTSLKYLDCVVK 504

503 ETLRLHGLAPVLGRETINAIKFGEYEIPANTVLQTHVSNLHMNETIYRDPHSFKPERFMT 324

323 GEIPASFYLPFGHGVYNCIGKNFALLEIKTFLVKALLQFKFSIDPMHINYTK 168

167 IIWLTMRTVEPLLIRVKPIAE* 102

TTTTTCATAGCTGGTTATGAAACAATTTCTACTACTTTGACTTTGTGTTTATATATGCTA

GCCATTAACTTAGAGGTTCAAGAGAAACTTAGAGAAGAGATTCAGAAAAATAAATTGGAT

GTAAATAATATTTCTTTTGAAGAAGTTACGAGTTTAAAATATTTGGATTGTGTCGTTAAA

GAAACCTTGCGCTTGCATGGACTTGCACCAGTTTTAGGCAGAGAGACCATTAATGCAATA

AAATTTGGCGAATATGAAATTCCTGCAAACACAGTACTTCAAACTCATGTTAGCAATCTA

CACATGAATGAGACTATTTATCGAGATCCTCATTCATTTAAACCTGAAAGGTTTATGACA

GGGGAAATACCAGCATCATTCTATCTTCCTTTTGGGCACGGTGTTTATAACTGTATTGGA

AAGAACTTTGCTTTGCTTGAAATTAAAACATTCTTGGTCAAAGCGTTGTTGCAATTCAAA

TTTTCTATTGACCCTATGCATATAAATTATACAAAGATTATTTGGTTAACTATGAGAACA

GTTGAACCATTGCTAATTAGAGTAAAACCTATTGCAGAATAA

 

>gnl|ti|648014530 1095896049543 41% to CYP21

LKYLDCVVKETLRLHGXXXXXXXXXXXXX

KFGEYEVPANTILRTHVSSIHMNETIYPDPHSFKHERFMTG

GTTTAAAATATTTGGATTGTGTCGTAAAG

GAAACCTTGCGCTTACATGGA

AAATTTGGTGAATATGAAGTCCCTGCAAATACAATCC

TACGAACTCATGTTAGCAGTATACACATGAATGAAACTATTTATCCAGATCCTCATTCAT

TTAAACATGAAAGGTTTATGACAGGT

 

>1096082202706 probably the same as 1095896049543 which has errors

1095994179331

(0) NIEVQEKLREDIQKNILDVNNISFEEVMSLKYLDCVVKETLRLHGPAPLLGRRTISATKF

GEYEVPANTILRTHVSSIHMNETIYPDPHSFKPERFMT (1)

AGAACATAGAAGTTCAAGAGAAACTTAGAGAAGATATCCAGAA

AAACATATTGGATGTAAATAATATTTCTTTTGAGGAAGTTATGAGTTTAAAATATTTGGA

TTGTGTCGTTAAAGAAACCTTGCGCTTACATGGACCTGCACCACTTTTAGGCAGAAGGAC

CATTAGTGCAACAAAATTTGGTGAATATGAAGTTCCTGCAAATACAATCCTACGAACTCA

TGTTAGCAGTATACACATGAATGAAACTATTTATCCAGATCCTCATTCATTTAAACCTGA

AAGGTTTATGACAGGT

 

>1096526207508 with frameshifts probably same seq as 1096526100337

mate pair = 1096520314506 same as 1095898850029

(0) NLDVQNKLREEIKKNVFDIKSILR

EEVLSIKYL

DCVVKETLRMHPPASFISRKNKTETKLGDYDLPAGTFLRISINNVHMNESVLSWIPYLFKPER

AGAATTTAGATGTTCAAAACAAACTAAGAGAAGAAATAAAGAAAAATGTCTTTGATATAAAAAGTATT TTACGG

GAAGAAGTTTTAAGCATCAAGTACTT

GATTGTGTAGTTAAAGAGACATT

ACGCATGCATCCACCTGCGTCATTTATATCTCGAAAAAACAAAACTGAAACAAAGTTGGG

TGATTATGATCTACCTGCTGGCACGTTTTTAAGAATTTCAATTAACAACGTACATATGAA

TGAGTCTGTTTTATCCTGGATCCCTTATTTATTTAAGCCCGAACGAA

 

>BP514308  N-term 25% to 46a [gene 9]

1096761991009 1096082187152 1096123591182 1095899045709 1097383004013

MYSIYIAIIIVPLVFFVAVFFKRFYHQFRLLPSPKESLITCHYSYFDVHDHVNTLLNFGKEFKDYGLYTINTLI (1)

ATGTATTCGATATACATAGCGATTATAATAGTTC

CTTTAGTTTTTTTCGTTGCTGTTTTTTTTAAACGTTTTTATCATCAATTTCGTTTGTTGC

CATCACCCAAAGAGAGTTTAATTACATGTCACTATAGTTATTTTGATGTTCATGACCATG

TTAACACTCTGTTAAACTTTGGTAAAGAGTTTAAAGATTACGGATTATATACAATTAATA

CTTTAATTGGT

1096124094276 1095964247544 1095901005745 1095899045709

GPRQVHLLLPHFIKTVIADGKFFQRSPVFKAVFPLVGNSMIVSNYEDHHWQRKLF

NQAFTSQQLKRYFLAFTLHTDLLMK (0)

AGGACCCAGACAAGTTCATCTTTTATTGC

CACATTTCATTAAAACAGTAATTGCAGATGGAAAGTTTTTTCAAAGATCACCAGTTTTTA

AAGCCGTATTTCCTCTTGTTGGAAACAGTATGATCGTTTCTAATTATGAAGATCATCATT

GGCAAAGAAAATTATTTAATCAAGCCTTTACTTCGCAACAATTAAAAAGATATTTTTTAG

CTTTTACTCTGCATACTGATTTGCTAATGAAGGT

LWSCTCDKENGTNLNVWSDLSNLSFDIIGDVGFGYQFNTITSHSGNAFTKALRSY

INLRFNSSVVHNVLIAYFPFLMRFLSKFGNLNKAEQ (0)

1095958061820 82% to 1095901729505

1095964290917 1096124019775 1096528662475 1096159548758 1097622041589, 1097672472615

1096064134288 mate pair = 1096041094868 exon 3,

(0) LIDKRKKEIENGLVKEEKDFLSIVLKDQQQEKSKLTNDLIRDNLMTLLIAGHETTSTAMLWCLYTLGT (0)

AGCTTATCGATAAGCGTAAAAAAGAAATAGAAAATGGATTAGTAAAAGAAGA

GAAAGATTTTTTATCAATTGTTTTAAAAGATCAACAACAAGAAAAGAGCAAACTGACAAA

TGATTTGATTAGAGATAATTTAATGACGCTTTTAATTGCTGGTCATGAAACTACTTCTAC

TGCAATGCTGTGGTGTTTATACACATTAGGAACAGT

run into seq gap downstream

 

>gnl|ti|655009968 1095963046224   KYG region 46% to CYP20 35% to 27B1

(2) LGNLGSLTFDGGIHKFLVENHKRLGPMFSFYWGKELAVSLACPILFKEVATLFNRP (1)

AGATTGGGAAATCTTGGCTCTCTAACATTTGA

TGGTGGAATTCACAAGTTTCTTGTTGAAAACCATAAAAGGCTTGGTCCAATGTTCAGCTT

TTATTGGGGCAAAGAACTGGCTGTTAGTCTAGCTTGTCCAATTCTTTTTAAGGAGGTT

GCCACTCTATTTAATCGACCAGGT

 

>1097263613070 mate pair to 1097206643989 I-helix/J helix boundary?

1097329235455  1097672289528  1096705876537  1096110072452 

LTWLVYFLCKHPEVESKVYNEIKEFTEKDLDMELLTKFS (2)

AGTATTGACATGGCTTGTTTATTTCTTATGTAAACATCCAGAAGTGGAATCTAAGGTATACA

ATGAGATAAAAGAATTTACAGAAAAAGATCTAGATATGGAATTACTTACAAAATTTAGGT

>BP508840 BP508840 Best match in Fugu, human and Ciona is CYP20

1096602177777 1097264059772  1096123966865  1097672368420 

1097206643989  1096110028119 

(2) YTKQVIDEVMRIAVLAPYAARYSDYDIIVDGHLIPKK (0)

(0) TPIILALGTVFQDETIFPEPDR (2)

(2) FDPDRFSDKQIEERSALAFQPFGFAGKRKCPGYRLAYAETLTYTFYIIKNFHISL

    FDKQSVKMHYGFVTKPSEEIWIKVLRRKNI*

 

1096111030955

AGTTACACAAAGCAAGTTATTGATGAAGTTATGAGAATTGCTGTACTTGCTCCATATG

CAGCTAGATATTCGGATTATGATATAATCGTTGATGGACATCTAATACCAAAAAAGGT

 

AGATTTGACCCTGACCGTTTTAGTGATAAACAAATTGAAGAACG

TTCAGCGTTAGCTTTTCAGCCGTTTGGATTTGCGGGAAAAAGAAAGTGTCCTGGATATAG

ATTAGCATATGCTGAAACATTGACGTACACATTTTATATCATCAAGAATTTTCATATTTC

GCTATTCGATAAGCAATCTGTGAAAATGCATTATGGTTTTGTCACAAAACCGTCTGAAGA

AATTTGGATTAAAGTGTTACGACGTAAAAATATCTAG

 

>CYP20 amphioxus 39% to CYP20 Danio

    MLDYAIFAITFVVFLIATVLYLYP (0)

(0) GANKITTIPGLEPSDPK (2)

(2) DGNLGDVGRAGSLHEFLLKLHTEYGDIASFWWGQQLVVSLGAPELWKQH ERIFDRP (1)

(1) ALLFKGFEPLIGAKSIQYANSVDGRTRRKLYDPSYGHNAMKHYYSIFQE (0)

(0) LGQEMAKKWESMKGDQHIPLHAHIIALAMKAITRSSFGDAFKDEKECVQFGRNYDI (0)

(0) CWNDMEERIKGSHPTEGSPREKKFKE (1)

(1) ALGKLHATIARVAKYRRENPSPPQEQLFIDVLIEGNLPEEQ (0)

(0) VLCDAMTFTVGGIHTSGN (1)

(1) LLTWALYYIATHEEVEEKLHQELSDVLGKKGEVTPDNISQLV (2)

(2) YLRQVLDESLRCAVIAPWGARYMDLDAEVGGHIVPAK (0)

(0) QTPVIHAFGVVLQDERIWPEPNK (2)

    FDPDRFDAENSKGRHKLAFQPFGFAGGRKCP (1)

(1) GYRFTYTWTSVFLSILCRQFKLHLVDGQVVKPCHGLVTRPVDEIWITVTKRD*

 

1096111030955

ATCTCCATCAGTTAATTGCATAGATGCGAGAATGCATGTTAAGTGCATGTAACTCTGAATATTAGCGACAAAGTTTAGTATCACTATCTATGATAGTATTTTTAGTATTTATATGATTCATATTTTTCAGCATAACTCTAATAATAAATATTAATCTAAATTTTAATGCTTTTTTTTTTAAATGATTGAATATTTTTAAAACATGTATTAAATAGTTTATTAACTAAATTTTTAAAGTATTTTTAAGTACTAATAAATTTAAAAAATAAAAAAAGATGTTTATGTTTCAAAGTTTATATTTCAGTTACACAAAGCAAGTTATTGATGAAGTTATGAGAATTGCTGTACTTGCTCCATATGCAGCTAGATATTCGGATTATGATATAATCGTTGATGGACATCTAATACCAAAAAAGGTTTATACTAATATTA

 

 

 

>gnl|ti|646862798 1095898098005     41   0.018 35% to 17A1 34% to 2P4

gnl|ti|647168675 1095899196297

1097622027233 1096703988838 1097329365089 1097329154279

42% to 1095898227332

 

MLVFQQLIFAVLVPAFLYFVFSYLQHLWICSKYPKGLFPLPLLGNIH

QLGKNSSQTFSSLTKIYGDIFSVSIGTQRLVILNSMESIHEALLTKGSTFGGRPTEF

TSNVFTKGYKNLSHTDYGPNLKALRKVIHLSVQKYAGGLTRQEQMITFERDELCKKLFN

TEKEIALRCEI (1)

ATGCTTGTTTTTCAACAATTAATATTCGCCG

TACTTGTTCCGGCTTTTTTATATTTTGTTTTTTCTTATTTGCAACATTTATGGATTTGTA

GTAAGTACCCAAAGGGTCTGTTTCCATTACCGTTGTTAGGAAACATTCATCAATTAGGTA

AAAACTCTTCTCAAACATTTTCATCTTTAACAAAAATTTATGGAGATATATTTAGTGTGA

GTATTGGTACCCAGCGACTCGTTATACTCAATAGTATGGAAAGCATACATGAAGCTTTGTTAACCAAAGGTTCAACTT

TTGGTGGTAGACCAACTGAgGTTACGTCAAATGTTTTTACAAAAGGATATAAAAACTTATC

GCACACTGATTATGGACCGAATTTAAAAGCGTTGCGAAAAGTTATTCATCTTTCCGTTCA

AAAATATGCTGGCGGACTAACGAGACAAGAACAGATGATAACTTTTGAAAGAGACGAACT

TTGTAAAAAACTTTTTAATACTGAAAAGGAAATAGCTTTACGTTGTGAAATTGGT

(1) DFCTVNVMSGYLFNERFLNQNSEFKDVVKSIQLLLDNSGITDKTTFIHWLRYLPLREWN

793 EIKQARLVLNPWVEKKVEDHWRKYNENEIINVTDSMIQHFLTKYDGLDTDFAKKYITLL 617

616 LIELLVAGTETTAITICWMVLYLIHNPEYQEEIYKEITLNIGCRYPTLAEKNLFPLL 446

445 QAFIQETLRITSVVPLNLAHKALKDTSICGKIIPKDAIVITNLWNLHHDNRYFKNPNEFD 266

265 PKRWINENGLFDSISQKYFKPFSAGARVCLGETLAKNQLFLIISGLIMNFIFTSAPGKD 89

88  LPSLEGQFGITFRPNSFKVL* 29

AGATTTTTGCACTGTAAATGTAATGTCGGGGTATTTATTCAATGAACGCTTTCTGAACCAAAATTCCG

AGTTTAAAGATGTCGTAAAAAGTATTCAACTTTTGCTAGATAACTCTGGAATTACAGATA

AAACCACGTTCATACATTGGCTTCGTTACTTGCCATTGCGGGAATGGAATGAAATAAAAC

AAGCGAGACTTGTCTTAAACCCGTGGGTCGAAAAAAAGGTTGAAGATCATTGGAGAAAGT

ATAATGAAAATGAAATCATTAATGTAACTGATAGCATGATTCAACATTTTTTAACAAAGT

ACGATGGTTTAGACACTGATTTTGCAAAGAAATACATTACCTTATTATTGATCGAATTAC

TTGTTGCCGGTACCGAAACGACAGCTATTACTATTTGCTGGATGGTTTTATATCTAATAC

ATAACCCTGAGTATCAAGAAGAAATTTATAAAGAAATTACATTAAATATTGGTTGTAGAT

TGCGAATAACATCTGTTGTGCCACTAAACTTGGCTCACAAAGCATTAAAAGATACCAGCA

TTTGTGGAAAAATTATTCCTAAAGACGCTATAGTAATTACAAATTTATGGAATCTTCATC

ACGACAACAGATACTTTAAAAATCCTAATGAATTTGATCCTAAACGCTGGATAAACGAAA

ATGGTCTATTTGACTCAATTTCTCAAAAATATTTTAAACCTTTTTCGGCTGGAGCGAGAG

TATGTCTTGGCGAGACATTAGCCAAAAATCAACTTTTTTTAATCATCTCCGGTCTAATTA

TGAATTTTATTTTCACATCTGCACCAGGAAAAGACTTACCTAGTCTTGAAGGACAATTTG

GAATCACATTCCGTCCCAATAGTTTTAAGGTTTTATAA

 

>gnl|ti|651477674 1095901303788     39  0.11 39% to CYP21 39% to 2R1 40% to 2P4

49% to 1095898227332

1096703646566 mate pair = 1096703498438 = N-terminal exon

1097675091467

MFLFVVFEVVFGLIIPVLLYVI

VVYIYHIWECQRYPPGPFPLPVIGNYNLLANDPVKALCDLEIIYGDVFSLSLGTVR

VVVVSSHESIYDVLVGDGSNFSGRPREYSSLLFTGGFENLSHMDNNPLTKKIRKVFYSKL

KTNGSILAHNENIVKHESELLHQRLLQNEGSVTNLRYEI (1)

(1) DLCIVNSICSIIFGNRLSDTCEVHEILKATRLLLKNLSNIEIMHYLPWM

RFFLLKKQNEISESRNICKFWIQTQLHKRKKSLKNENISDILLNLWDQQKQENP

NEEQYRMILVELVMAGSETTAATITWLIFYLLHWPHYQSILYKEIKNVCGDQYPTFNDIKSMPIMQATILE

TLRLSSVVPLSLSHKAVNNAKINKFTIPKDTIIITNLWGVHHNEKYWEKPFEFNPMRWLD

KNGELSTAKRLGYFPFSAGPRGCIGESFARMQMFIICSRLIKDFSFELPQSGETPKLDGD

IGITLTPLPYNAVAKQRT*

ATGTTTCTTTTTGTAGTTTTTGAAGTTGTATTTGGGCTGATAATTCCCGTTTTACTTTACGTAATAGTT

GTTTATATTTATCATATTTGGGAATGTCAAAGATACCCACCAGGT

CCATTTCCTCTTCCGGTAATTGGAAACTACAACTTGTTAGCAAATGATCCTGTGAAGGCA

TTGTGCGATCTAGAAATTATTTACGGAGATGTTTTCAGTTTAAGTTTAGGAACCGTTCGG

GTGGTTGTTGTAAGCAGCCACGAGAGTATTTACGATGTTTTAGTTGGAGATGGATCAAAT

TTTTCCGGAAGACCCAGGGAGTATTCATCTTTACTTTTTACTGGAGGTTTTGAAAACCTT

TCCCATATGGATAACAACCCGTTGACTAAAAAAATCAGAAAAGTTTTTTATTCAAAACTT

AAAACAAACGGAAGTATTTTAGCACACAATGAAAATATTGTCAAACATGAAAGTGAACTT

TTACATCAAAGACTACTGCAAAACGAAGGAAGCGTCACCAATCTTCGTTATGAAATCGGT

 

AGATCTTTGTATTGTTAACAGCA

TATGCAGTATTATTTTTGGTAACCGGCTTAGTGATACTTGTGAAGTTCATGAAATTTTAA

AAGCGACCAGGTTACTTCTAAAAAACTTGTCAAACATTGAAATTATGCATTATTTACCAT

GGATGAGATTTTTTTTATTAAAAAAGCAAAACGAAATCAGCGAATCTAGAAACATTTGCA

AATTTTGGATTCAAACCCAGTTGCATAAACGAAAAAAAAGTTTAAAAAACGAAAATATCT

CAGATATTCTTTTGAACCTTTGGGACCAACAAAAACAAGAAAACCCTAATGAGGAACAAT

ACAGAATGATTTTAGTTGAGTTAGTTATGGCTGGTTCCGAAACAACAGCCGCAACGATAAC

TTGGCTAATCTTTTATCTTTTGCATTGGCCTCACTATCAAAGCATTCTTTACAAAGAAAT

CAAAAATGTTTGTGGTGATCAGTACCCTACGTTTAATGATATTAAATCAATGCCTATAAT

GCAAGCAACTATACTTGAAACTTTAAGGTTGTCTTCTGTCGTTCCTTTAAGCTTATCTCA

CAAAGCCGTAAATAACGCGAAAATTAATAAATTCACAATCCCTAAAGATACAATAATAAT

AACAAATTTATGGGGCGTACATCATAATGAAAAATACTGGGAAAAACCGTTTGAATTCAA

TCCTATGCGTTGGCTTGATAAAAATGGCGAACTTTCAACAGCAAAGCGTTTAGGATATTT

CCCTTTTTCAGCCGGCCCAAGAGGTTGCATTGGTGAGTCATTTGCAAGAATGCAAATGTT

TATTATATGCTCTCGACTGATAAAAGATTTCTCCTTTGAGTTGCCTCAAAGCGGAGAAAC

CCCAAAACTAGATGGTGATATTGGAATTACACTAACGCCCCTTCCTTATAATGCAGTAGC

TAAACAGCGAACCTAA

 

>gnl|ti|654998190 1095901734433  33% to CYP21 33% to CYP17 33% to 2U1

gnl|ti|651148169 1095901003210

gnl|ti|651162328 1095901096755

74% to 1095898227332

possible pseudogene

870 FQDIIKTHNET

837 SYISSIPWLRYFPTATSRNMIILNKNKYNIFEIIRLRDPILIRKLQEHKRTYD*SNLGDV 658

657 TDALIKISLESVTETDSHEKITDDNTEFLLNDFMIAGSETSSNMILWFIVYILHRPEYQDKLYDEIFKVFSG

IGYPSLNDRPRFHLIQAIIHETLRLLSVAPLGLCHKALENGSICGKFVPKG (frameshift)

LLILTNLWSIHHDERYWKNAINFFPERWLDNSGNFNYNLGYAYLPFS 147

146 GGPRSCLGETLAKTELFVFISRLVKDYRFEKPNGKDSLDGRSGVTCLPYEFEIVMIPRS*

 

>gnl|ti|655009845 1095963045220 near C-helix region poor match

gnl|ti|648592188 1095595897239

NOT A P450, MATCHED FISH AND DROSOPHILA SEQUENCES

637 STSNCVTMNQSDYLQNVLQKFGFDNCKQRSSPCKQVPYSYHNQESIKSNGSMIKL 473

472 YRQMVGSLLYAMTCTRPDLSYVITKLSQHLSKPNSGDWIMIKHVFRYIKHTLN 314

313 YCLTFRK 293

 

 

>gnl|ti|648047811 1095899057643 I-helix 4 aa diffs to 1095898198167

(0) LIEKRKKEIDDGISTKEKDIITIVLKDQQQESSKLTNDLIRDNLLLFLIAGHKTTSTTMTWCLYILGT (0)

AGCTTATTGAAAAACGTAAAAAAGAAATCGACGATGGAATATCAACAAAAGAGA

AGGATATTATCACAATTGTCTTAAAAGATCAACAGCAAGAAAGCAGCAAACTAACAAATG

ATTTGATTAGAGATAATTTATTATTATTTCTCATAGCTGGTCATAAAACAACTTCTACTA

CTATGACTTGGTGTTTATATATACTAGGCACTGT

 

CYP1 like (only one seq)

 

1095899272864 5 aa  diffs in N-term overlap region

MWYEIICGLIISILLYIIGSYLMHLLECRKYPLGPFPIPIFGNLHLLGTEPHKILAAYS

KKYGAVFSISLGLQRIVIISDITTTREALVQKASIFAGRPKSYLIQLISSGYKGIAFMDY

GSFWKVLRKVSHSSLKIYGEGHERFEKILTKESEELHKRLLKKSNNSVELKSEF (1)

>gnl|ti|648485307 1095899272864 57% to 1095897342515

ATGTGGTATGAAAT

TATCTGCGGACTGATCATTTCGATTTTGCTATATATTATTGGTTCTTACTTGATGCACTT

GCTGGAATGCAGGAAGTATCCTCTTGGACCTTTTCCAATACCAATCTTTGGTAACTTGCA

TTTATTAGGAACAGAGCCACATAAAATACTTGCTGCATACTCAAAAAAGTATGGAGCAGT

CTTTAGCATAAGTTTAGGATTGCAAAGAATTGTTATAATTTCTGACATTACTACAACTAG

AGAAGCACTAGTTCAAAAAGCATCCATATTTGCAGGTAGACCAAAATCTTATTTAATTCA

ATTAATTTCAAGTGGGTACAAAGGCATTGCATTTATGGACTATGGTTCCTTCTGGAAAGT

TTTGCGTAAAGTTAGTCATTCTTCATTAAAAATATATGGAGAAGGACATGAACGTTTTGA

AAAGATACTTACAAAAGAAAGTGAAGAGCTACATAAAAGACTTTTAAAGAAATCAAATAA

TTCCGTAGAGCTGAAATCTGAATTTGGT

>CN775634

tae83e09.y1 Hydra EST Darmstadt I Hydra magnipapillata cDNA 5' similar to

SW:CP11_OPSTA Q92095 CYTOCHROME P450 1A1

AGAGAGTGAAGAGCTACATAAAAGACTTTTAATGAAATCAAAAACTTCCGTAGATCTGAAAACTGAATTT

GGTGCTGCAATTATAAACGTGATTTGTTTCATTGTTTTTGGGGAAAGATACCAGTATTCAAATTCAGAAT

TTAAAGAAGTTCTTACAACAATAAACAATATAGTCGATGGGTTGTCAAATACAACTGCTGTCGGTTTTTT

GCCGTGGTTGAGATTTTTACCGTTTTCTCCAATAAAAAAACTGAGTATTTCACTTTCAAAATATATTCGT

TTTTTAAACGATAAGTTGACCAAACATAAGGAAACATTTAATGAAAACAAAATTCGAGATTCTACTGATT

CTATTATAAAC

 

32% to CYP1C1 aa 173-297

  2 ESEELHKRLLMKSKTSVDLKTEF (1)

GAAIINVICFIVFGERYQYSNSEFKEVLTTINNIV 175

176 DGLSNTTAVGFLPWLRFLPFSPIKKLSISLSKYIRFLNDKLTKHKETFNENKIRDS 343

344 TDSIIN 361

 

 

opposite end of clone =

>CN774619

tae83e09.x1 Hydra EST Darmstadt I Hydra magnipapillata cDNA 3' similar to

SW:CPT7_CHICK P12394 CYTOCHROME P450 17

TTTAAACTACATAAGATTGTTTACTCTTGGAATCAAGTATGCGCAAAATTGTTTTGGAGCATGAGTAATA

TTGCATTTTCCAATTAAACTTGGAGGTGGACAACCAGGTGTGCACTCAAACTTAAAATCTCGAATTAATC

TAGAAAAGAAGAAAAATAGTTCATTTTCAGCAACTGTTTTCCCTAAACAAACTCTTGTACCGGCTGAAAA

AGGTAAGAAACTTGTTGCTTTACTTGGATCAAATTTCTTATCTTTGCCAATCCGCCGATAAGGAATTAAA

TTCATTAGGATTTTCCCAGGAGTTAGTAACGTTATGAATCTGCCAATGATTAATCATGACAGTAGCAATT

TTCAGGAATGCTATGTGACATGAGAACTCCCCCCCAGAGGT

 

38% to CYP17A2 Fugu aa 383-485

391 TSGGEFSCHIAFLKIATVMINHWQIHNVTNSWENPNEFN 275

273 PYRRIGKDKKFDPSKATSFLPFSAGTRVCL

(1) GKTVAENELFFFFSRLIRDFKFECTPGCPP 94

 93 PSLIGKCNITHAPKQFCAYLIPRVNNLM* 7

 

1097672474909

AGGGAAA GTTGCTGAAAATGAACTATTTTTC TTCTTTTCTAGATTAATTCGAGATTTTAAGTTTG

AGTGCACACCT GGTTGTCCACCTCCAAGTTTAGTTGGAAAATGCAATATTACTCATGCT

CCAAAACAATTTTGCGCATACTTGATTCCAAGAATAAACAATCTTATGTAG

 

>1096526199166 frame3_ORF1 7aa diffs to CN774619 may be same gene

(1) GAAIINVICFIVFGERYQYSDSEFKEVLTTINDIVDGLSNTTAVGFLPWLRFLPFSPIKKLSIS

LSKYVRFLNDKLKKHKETFDEKKIRDFTDSIINFSNNEAVKQKFKNVDEHLEPVIGDLFI

TGSETTLTSLLWLILYMMHYPKYQQEIFKEITTVIGEDRYPCLNDRDSLHLVKAALKECL

RLSSIVPLGLPHKTTKETVLMGHSIPGNATVMINHWQIHNDTNYWENPNEFNPYRWIGKD

KKFDPSKATSFLPFSAGTRVCLGKTVAENELFFFFSRLIRDFNFECIPGCPPPSLIGKCN

ITHAPKQFCAYLTPRINNLM*

AGGTGCTGCAATTATAAACGTGATTTGTTTCATTGTTTTTGGGGAAAGATACCAGTATTCAGATTCAGAAT

TTAAAGAAGTTCTTACAACAATAAATGATATAGTCGATGGGTTGTCAAATACAACTGCTG

TTGGATTTTTGCCGTGGTTGAGATTTTTACCGTTTTCTCCAATAAAAAAACTGAGTATTT

CACTTTCAAAATATGTTCGTTTTTTAAACGATAAGTTGAAAAAACATAAGGAAACATTTG

ATGAAAAGAAAATTCGAGATTTTACTGATTCTATTATAAATTTTTCTAATAACGAAGCTG

TCAAACAAAAATTTAAAAACGTTGATGAACATTTAGAGCCTGTGATTGGGGATTTATTTA

TAACGGGTAGTGAGACCACATTAACATCTTTATTGTGGTTAATTCTTTATATGATGCATT

ATCCCAAATATCAACAAGAAATTTTTAAAGAAATTACAACGGTTATTGGTGAAGACCGGT

ACCCATGTTTAAATGACCGTGATTCTTTGCATCTTGTTAAAGCCGCATTAAAAGAGTGTC

TGCGTTTATCTTCAATTGTTCCTCTTGGATTACCACACAAAACAACCAAAGAAACAGTTC

TTATGGGACATAGCATTCCTGGGAATGCAACAGTCATGATTAATCATTGGCAGATTCATA

ACGATACTAACTACTGGGAAAATCCTAACGAATTTAATCCTTATCGGTGGATTGGTAAAG

ATAAGAAATTTGATCCAAGTAAAGCAACAAGTTTTTTACCTTTTTCAGCCGGTACAAGAG

TTTGTTTAGGGAAAACAGTTGCTGAAAATGAACTATTTTTCTTCTTTTCTAGATTAATTC

GAGATTTTAACTTTGAGTGCATACCTGGTTGTCCACCTCCAAGTTTAATTGGTAAATGCA

ATATTACTCATGCTCCAAAACAGTTTTGCGCATACTTGACTCCAAGAATAAACAATCTAATGTAA

 

>whole gene 1095899272864 1096526199166

MWYEIICGLIISILLYIIGSYLMHLLECRKYPLGPFPIPIFGNLHLLGTEPHKILAAYS

KKYGAVFSISLGLQRIVIISDITTTREALVQKASIFAGRPKSYLIQLISSGYKGIAFMDY

GSFWKVLRKVSHSSLKIYGEGHERFEKILTKESEELHKRLLKKSNNSVELKSEF (1)

GAAIINVICFIVFGERYQYSDSEFKEVLTTINDIVDGLSNTTAVGFLPWLRFLPFSPIKKLSIS

LSKYVRFLNDKLKKHKETFDEKKIRDFTDSIINFSNNEAVKQKFKNVDEHLEPVIGDLFI

TGSETTLTSLLWLILYMMHYPKYQQEIFKEITTVIGEDRYPCLNDRDSLHLVKAALKECL

RLSSIVPLGLPHKTTKETVLMGHSIPGNATVMINHWQIHNDTNYWENPNEFNPYRWIGKD

KKFDPSKATSFLPFSAGTRVCLGKTVAENELFFFFSRLIRDFNFECIPGCPPPSLIGKCN

ITHAPKQFCAYLTPRINNLM*

 

 

CYP2 like (2 different sequences)

 

>CN566581

taf98h10.x1 Hydra EST -Kiel 1 Hydra magnipapillata cDNA 3' similar to SW:CPC8_HUMAN

P10632 CYTOCHROME

CACGCGTCCGCTTTTTAGTCCCTGTGTAATAAGTATTTCTTAGACATAATTTTAAAATGTTTCTTGAAGT

TATTGGCGCAGTCTTTATTCCACCTTTGATATGGACTATATGGGTTTACATTAAACATTTAATTGATTGT

TTGCATTATCCAAGAGGACCAATACCACTACCATTTATTGGAAATGGTTATTTGATAAGAAAAGCTGAAC

CATATAAAGAGTTGGTTAACTTAGGAAAAATATATGGCGATGTTTTTAGTTTTAGCGTTGGTTCAGTCAG

ATATGTAATTGTCAACAGTTTAGAAGGAATTCAAGAAGTACTAGTTAAAAAAGGGTGGCAATTTGCTGGT

CGTCCAAAAGGTCCAAGTTGGGATAGATCCATTCACGGTCTAATCCAACGTGATCCAAGTAAAAAATTTA

AAATATTACGGAAGCTAGCAACATCATCTTTGAAAATCTTTGCTGATGGATTGGCAGGGATGGAAAGTAA

AGCTATA

 

32% to 2X9 aa 26-146

 57 MFLEVIGAVFIPPLIWTIWVYIKHLIDCL

144 HYPRGPIPLPFIGNGYLIRKAEPYKELVNLGKIYGDVFSFSVGSVRYVIVNSLEGIQEVL 323

324 VKKGWQFAGRPKGPS

WDRSIHGLIQRDPSKKFKILRKLATSSLKIFADGLAGMESKAI 497

EESFQLNKKLLETNGKPF

 

opposite end of clone =

>CN566859

taf98h10.y1 Hydra EST -Kiel 1 Hydra magnipapillata cDNA 5' similar to SW:CPT7_SQUAC

Q92113 CYTOCHROME P450 17

TAATCATAAATGTGTATTTAAAATAAAAATTAAAACTCTTTTTGATAGACAATAGTTAGCTAATAGAAAT

TAGAAAAATTTTTAATTAACACATTTTGCCTAAAATTTAAAGGTTAAGCAAACATTTACTAATCGTTTCA

TTTTCAAATTATTTACTATTTATCTAGTATCTAATTTTTCAATCAATATAGTTTATATAAATTTAAAAAA

AAAAACATTAGCAAAAT TAAAG CAATAAATTTTTACT CCTTGGAACTATTTCAACTTTAAAGTCATAAGG

AGTACAAGTGATGCCAAAACTACCTTCTAAGCGCGGTAATTCTTCTAAAATTGGTTTCACAAATCGGAAA

TCATTTATCAATCGTGATATAAAAATAAACAACTCAGTTTTTGCCAATGTTTCTCCAATACAGCTACGAG

GTCCACTAGAGAATGGTAAATAAGCGTTTCCTAGTTTAGAATTAAATTCGCCAGAATTTTCTAACCAACG

TTCAGGGTAAAAACTCATTGCATTTTTCCAGTAACTTTCGTCATGGTGTATACTCCATAGGTTGGTTATT

ATAAGAGCCCCTTT GGGGAATAGGTTTTCCACAAATGCTACCTTTCTCC

 

44% to 17A1 fugu aa 378-485

607 RKVAFVENLFPKGALIITNLWSIHHDESYWKNAMSFYPERWLENSGEFNSKLGNAYLPFSSGPRSCIGETL 395

394 AKTELFIFISRLINDFRFVKPILEELPRLEGSFGITCTPYDFKVEIVPRSKNLLV* 227

 

CYP3 like (two different sequences)

 

>CN567799 opposite end = CN567598 tag12b09.x1

tag12b09.y1 Hydra EST -Kiel 1 Hydra magnipapillata cDNA 5' similar to TR:Q9PVE8 Q9PVE8

CYTOCHROME P450 3A30

TTACTTTGTATTTAAAAATCATCAAAAGAAAACCCCAAACAATCATTATTATAAATGTTAGGACTTAAAG

TTTTAAATAAAGTTTATTCTTCTGTTGAATATTATTCTGCAATAGGTTTTA CTCTAATTAGCAATGGTTC

AACTGTTCTCATAGTTAACCAAATAATCTTTGTATAATTTATATGCATAGGGTCAATAGAAAATTTGAAT

TGCAACAACGCTTTGACCAAGAATGTTTTAATTTCAAGCAAAGCAAAGTTCTTTCCAATACAGTTATAAA

CACCGTGCCCAAAAGGAAGATAGAATGATGCTGGTATTTCCCCTGTCATAAACCTTTCAGGTTTAAATGA

ATGAGGATCTCGATAAATAGTCTCATTCATGTGTAGATTGCTAACATGAGTTTGAAGTACTGTGTTTGCA

GGAATTTCATATTCGCCAAATTTTATTGCATTAATGGTCTCTCTGCCTAAAACTGGTGCAAGTCCATGCA

AGCGCAAGGTTTCTTTAACGACACAATCCAAATATTTTAAACTCGTAACTTCTTCAAAAGAAATATTATT

TACATCCAATTTATTTTTCTGAATCTCTTCTCTAAGTTTCTCTTGAACCTCTAAGTT AATGGCTAGCATA

TATAAACACAAAGTCAAAGTAGTAGAAATTGTTTCATAACCAGCTATGAAAAA

 

Combined N and C-terms

 

>CN567598 tag12b09.x1 [gene 4] N-term

RVRFLLRYLLKRIFHPLRFLPSPKEQLITGHINHFQGRDHSSTYLSFNEKFKEESLCTLD

TLHVPRFVYLIAPEFIKKIFADGKLFQRSKSIRTLAPLIGNSMVGSNYEHHHWQRKLFNG

AFTSQQLKNYFPAFLKHTNLLMKLWSYTCDKESGTNLTVLDDLSNLSF

 

39% to 3A27 trout aa 307-472 58% to CN770283 [gene 4]

683 FFIAGYETISTTLTLCLYMLAI

(0) NLEVQEKLREEIQKNKLDVNNISFEEVTSLKYLDCVVK 504

503 ETLRLHGLAPVLGRETINAIKFGEYEIPANTVLQTHVSNLHMNETIYRDPHSFKPERFMT (1)

323 GEIPASFYLPFGHGVYNCIGKNFALLEIKTFLVKALLQFKFSIDPMHINYTK 168

167 IIWLTMRTVEPLLIRVKPIAE* 102

 

>CN770283 58% to CN567799

tad87b02.y2 Hydra EST Darmstadt I Hydra magnipapillata cDNA 5' similar to

SW:CP4C_BLADI P29981 CYTOCHROME P450 4C1

AAAGAATGTATTTGATATAAAAAGTGTTTCATATGAAGAAGTTTTGAGTATCAAGTACTTAGATTGTGTT

GTTAAAGAAACATTGCGCATGCATACACCTGTAGCATTTATTGGTAGAATAAATAAGAATCAAACAAAGT

TTGGTGACTTTGATGTACCTGCTGGCTCTTTTTTACGAATTCCTATTGACAGTGCACATATGAACGAGTC

TGTTTATCATGATCCTCATTCATTTAGACCACAACGATTCTTGACAGGTGAAATACCACCATTATCCTTC

CTTACATTTGGGCAAGGTACATATAATTGTATCGGAAAGAATTTTGCTTTGCTTGAAATCAAAACATTCT

TGGTCAAAGCGTTGCTGCAATTCAAATTTTCAGTAGACCTTAAGCGTTTGGAAATTAACAAGCTGAAT

 

35% to 3A27 trout aa 343-472

2 KNVFDIKSVSYEEVLSIKYLDCVVKETLRMHTPVAFIGRINKNQTKFGDFDVPAGSFLRI

PIDSAHMNESVYHDPHSFRPQRFLTGEIPPLSFLTFGQGTYNCIGKNFALLEIKTFLVKA

LLQFKFSVDLKRLEINKLN 418

 

>CN776982 taf28f06.x1 Hydra EST Darmstadt I Hydra magnipapillata cDNA 3'

           similar to TR:Q9VXY0 Q9VXY0 CG9081 PROTEIN. ;.

          Length = 316

Same as 1096041191032

Query: 79  RPQRFLTGEIPPLSFLTFGQGTYNCIGKNFALLEIKTFLVKALLQFKFSVDLKRLEINKL 138

           R +RFLTGEIPPLSFLTFGQG YNCIGKNFALLEIKTFLVKALLQF+FSVDLK L   KL

Sbjct: 307 RQERFLTGEIPPLSFLTFGQGIYNCIGKNFALLEIKTFLVKALLQFEFSVDLKHLNYKKL 128

 

QRTRQERFLT

GEIPPLSFLTFGQGIYNCIGKNFALLEIKTFLVKALLQFEFSVDLKHLNY

KKLISITNKTVEPLWIRVKPI*

AGGTGAAATACCACCATTATCCTTCCTTACATTTGGGCAAGGTATATATAATTGTATCGGAAAGAAT

TTTGCTTTGCTTGAAATCAAAACATTCTTGGTCAAAGCGTTGCTGCAATTCGAATTTTCA

GTAGACCTTAAGCATTTGAATTATAAGAAGCTGATTTCGATTACTAATAAAACCGTTGAA

CCGTTATGGATAAGAGTGAAGCCTATATAA

 

Combined CN776982 and CN770283 [gene 5]

(0) NLGVQNKLREEIKKNVFDIKSVSYEEVLSIKYLDCVVKETLRMHTPVAFIGRINKNQTK

FGDFDVPAGSFLRIPIDSAHMNESVYHDPHSFRPQRFLT ()

GEIPPLSFLTFGQGIYNCIGKNFALLEIKTFLVKALLQFEFSVDLKHLNY

KKLISITNKTVEPLWIRVKPI*

 

AGAACTTAGGTGTTCAAAACAAATTGAGAGAAGAGATAAAAAAGAATGT

ATTTGATATAAAAAGTGTTTCATATGAAGAAGTTTTGAGTATCAAGTACTTAGATTGTGT

TGTTAAAGAAACATTGCGCATGCATACACCTGTAGCATTTATTGGTAGAATAAATAAGAA

TCAAACAAAGTTTGTTGACTTTGATGTACCTGCTGGCTCTTTT

 

CYP4 Like

 

>CN627429

tae92b11.y1 Hydra EST Darmstadt I Hydra magnipapillata cDNA 5' similar to

SW:CP51_CANGA P50859 CYTOCHROME P450 51

GATAATGGTACTAACATAATTGTTTTGGATGACTTATCTAATTTATCATTTGATATAATTGGGGATGTTG

GTTTTGGCTATCAATTTAACACAATTACTTCTCATTCTGGTAATGAGTTTACAAAAGCGCTTCAGAGTTA

TTGTCAACTACGATTTCAATTGAATGCCGTCCATAAAGCTCTACTAGCTTTCTTTCCATTTTTAATGCAT

CTGTCATTTATGTATGGAAAACGTAAACGAGCTGAGCAAGTCATCTGTAATACTTTAAACATGCTTATTA

ATAAACGCAAAAAAGAGATAGACCACCGAATAGCAGCTGATCAAAAAGATTTTTTAACAGTTGTTTTAAA

AGATCAACAAAAAGAAGGCAACAAGATGACAAATGACTTGATTAAAAATAATCTGATGACGCCTTTAATT

GCAGGTCACAAAACAACTTCCACTGACATGCCATGGTGTTTCAACGTGCTTGCGCCAAACCCAAGTGCTA

CCAAACACATGCAAAGAAACAGAAAGAAA GAATACATCT CGACCACAAAA

 

31% to 4T5 aa 183-347

  1 DNGTNIIVLDDLSNLSFDIIGDVGFGYQFNTITSHSGNEFTKALQSYCQLRFQLNAVHKALLA 189

190 FFPFLMHLSFMYGKRKRAEQVICNTLNM

LINKRKKEIDHRIAADQKDFLTVVLK 351

352 DQQKEGNKMTNDLIKNNLMTPLIAGHKTTSTDMPWCFNVLAPNPSATKHMQRNRKKEYISTTK 540

 

CN770090 taf75f08.y1 Hydra EST Darmstadt I Hydra magnipapillata cDNA 5'

           similar to TR:Q40411 Q40411 PUTATIVE CYTOCHROME P-450.

           ;.

          Length = 299

 

 Score =  113 bits (283), Expect = 1e-26

 Identities = 54/56 (96%), Positives = 55/56 (98%)

 Frame = -1

 

Query: 1   DNGTNIIVLDDLSNLSFDIIGDVGFGYQFNTITSHSGNEFTKALQSYCQLRFQLNA 56

           DNGTNIIVLDDLSNLSFDIIGDVGFGYQFNTITSHSGNEFTKALQSYCQLRFQL +

Sbjct: 170 DNGTNIIVLDDLSNLSFDIIGDVGFGYQFNTITSHSGNEFTKALQSYCQLRFQLKS 3

 

CN775805 tae77f11.x1 Hydra EST Darmstadt I Hydra magnipapillata cDNA 3'

           similar to TR:Q42700 Q42700 CYTOCHROME P450 ;.

          Length = 562

 

 Score = 58.5 bits (140), Expect = 5e-10

 Identities = 27/27 (100%), Positives = 27/27 (100%)

 Frame = +3

 

Query: 1   DNGTNIIVLDDLSNLSFDIIGDVGFGY 27

           DNGTNIIVLDDLSNLSFDIIGDVGFGY

Sbjct: 480 DNGTNIIVLDDLSNLSFDIIGDVGFGY 560

 

>Combined CN627429 CN775805 27% to 4T5 [gene 6]

GIAVLIFLCFSLFFANILKRFYHPLRKLPSPKENFFTAHYGYFNGYDQINAVINFGKQFK

ERGLYTLDTLNGFRFVNLLMPEFIKTVFSDGNSFQRSTATKVIFPLVGNGIFVSNYEDHH

WQRKVLNEAFTLQQLKNYFPAFTVHIDLLMKLWSYSCDK

1 DNGTNIIVLDDLSNLSFDIIGDVGFGYQFNTITSHSGNEFTKALQSYCQL  RFQLNAVHKALLA 189

190 FFPFLMHLSFMYGKRKRAEQVICNTLNMLINKRKKEIDHRIAADQKDFLTVVLK 351

352 DQQKEGNKMTNDLIKNNLMTPLIAGHKTTSTDMPWCFNVLAPNPSATKHMQRNRKKEYISTTK 540

 

Combined CN776982 and CN770283 [gene 5] = DN812371.1

(0) NLGVQNKLREEIKKNVFDIKSVSYEEVLSIKYLDCVVKETLRMHTPVAFIGRINKNQTK

FGDFDVPAGSFLRIPIDSAHMNESVYHDPHSFRPQRFLT ()

GEIPPLSFLTFGQGIYNCIGKNFALLEIKTFLVKALL QFEFSVDLKHLNY

KKLISITNKTVEPLWIRVKPI*

 

AGAACTTAGGTGTTCAAAACAAATTGAGAGAAGAGATAAAAAAGAATGT

ATTTGATATAAAAAGTGTTTCATATGAAGAAGTTTTGAGTATCAAGTACTTAGATTGTGT

TGTTAAAGAAACATTGCGCATGCATACACCTGTAGCATTTATTGGTAGAATAAATAAGAA

TCAAACAAAGTTTGTTGACTTTGATGTACCTGCTGGCTCTTTT

EST DN812371.1 joins with CN627429 CN775805 and 1095901729505

GIAVLIFLCFSLFFANILKRFYHPLRKLPSPKENFFTAHYGYFNGYDQINAVINFGKQFK

ERGLYTLDTLNGFRFVNLLMPEFIKTVFSDGNSFQRSTATKVIFPLVGNGIFVSNYEDHH

WQRKVLNEAFTLQQLKNYFPAFTVHIDLLMKLWSYSCDK

DNGTNIIVLDDLSNLSFDIIGDVGFGYQFNTITSHSGNEFTKALQSYCQL

RFQLNAVHKALLAFFPFLMHLSFMYGKRKRAEQVICNTLNM

LINKRKKEIEDGIAADQKDFLTVVLKDQQKEGSKMTNDLIKDNLMTLLIAGHETTSTAMQWCLYMLGT

NLGVQNKLREEIKKNVFDIKSVSYEEVLSIKYLDCVVKETLRMHTPVAFIGRINKNQTKFGDFDV

PAGSFLRIPIDSAHMNESVYHDPYSFRPERFLTGEIPPLSFLTFGQGIYNCIGKNFALLE

IKTFLVKALLQFEFSVDLKHLNYKKLISITNKTVEPLWIRVKPI*

 

1095901729505 I-helix part of DN812371.1

1097675072038 1097672494604 1097567117390

(0) LINKRKKEIEDGIAADQKDFLTVVLKDQQKEGSKMTNDLIKDNLMTLLIAGHETTSTAMQWCLYMLGT (0)

AGCTTATTAATAAACGCAAAAAAGAAATAGAAG

ATGGAATAGCAGCTGATCAAAAAGATTTTTTAACAGTTGTTTTAAAAGATCAACAAAAAG

AAGGCAGCAAGATGACAAATGACTTGATTAAAGATAATCTGATGACGCTTTTAATTGCTG

GTCACGAAACAACTTCTACTGCAATGCAATGGTGTTTATACATGCTTGGCACAGT

 

 

 

CYP17 like (4 different sequences)

 

>CN769570

taf31c10.y1 Hydra EST Darmstadt I Hydra magnipapillata cDNA 5' similar to

SW:CPT7_CHICK P12394 CYTOCHROME P450 17

ACAGAAAATCTTACGATGAGAATAATTTACGTGACATAACCGATGCATTAATAAAAGTGTCTTTAGATTC

AGAGATGGGTGAAGAATTAACTGAAAAGATTACTGATGATAATATTGAGTTTCTTTTAAACGATTTTATG

ATTGCTGGATCCGAAACTTCATCAAGTACTATTCTTTGGTTTATTGTTTACATGTTACATTGGCCAGAAT

ACCAAAATAAACTTTATGATGAAATTACTAAAGTAGCATCAGATAACCGTTATGTATCTTTAAAGGATCG

ACCTATGCTTCATTTAATGCAAGCTACAATTCATGAAACACTTAGACTGTCATCGGTGGTACCTCTTGGT

TTGGTTCATAAAGCAATGGAGAACAGTAGCATTTGTGGCAAGTTTGTTCCTAAGGGAGCTCTTATTTTAA

CAAATTTATGGAGTATGCATCACGATGAAAGCTATTGGAAAAATGCAATGAGTTTTTACTCGGAACGTTG

GCTGGAAAAATCTGGCGAGTTCCATTATAAATTGGGGTACGCATAATTACCGTTTTCTATAGGG

 

35% to CYP17A zebrafish  aa 266-449

3 RKSYDENNLRDITDALIKVSLDSEMGEELTEKITDDNIEFLLNDFMIAGSETSSSTILWF

IVYMLHWPEYQNKLYDEITKVASDNRYVSLKDRPMLHLMQATIHETLRLSSVVPLGLVHK

AMENSSICGKFVPKGALILTNLWSMHHDESYWKNAMSFYSERWLEKSGEFHYKLGYA*LP

FSIG 554

 

>CN769290 opposite end = CN769570 taf31c10.y1

taf31c10.x1 Hydra EST Darmstadt I Hydra magnipapillata cDNA 3' similar to

SW:CPT7_RANDY O57525 CYTOCHROME P450 17

TTATCTTGCTTTTCACGTTTCCTTGGTTATTCCATATTTTTGAAATTTTTAAATATTATATAAAGCAAAA

ATACAGAAAAGTGAAGCAAAAATAGTTAATTTCTTGGAATTATCACGACTTCAAAGTCATTAGGAGGGGA

GGTGATTCCAGAACGACCATCTAAACAAGGTAACTCTTTTCCAGTTGGCATTTCAAATCGGTAATCTTTA

AGTAATCGTGTAATAAACACAAACAACTCTGTTTTTGCCAATGTTTCTCCTAAACAACTACGAGGTCCAT

TAGAAAACGGTAAATATGCGTACCCCAATTTATAATTGAACTCGCCAGATTTTTCCAGCCAACGTTCCGG

GTAAAAACTCATTGCATTTTTCCAATAGCTTTCATCGTGATGCATACTCCATAAATTTGTTAAAATAAGA

GCTCCCTTAGGAACAAACTTGCCACAAATGCTACTGTTCTCCATTGCTTTATGAACCAAACCAAGAGGTA

CCACCGATGACAGTCTAAGTGTTTCATGAATTGTAGCTTGCATTAAATGAAGCATAGGTCGATCCTTTAA

AGATACATAACGGTTATCTGATGCTACTTTAGTAATTTCATCATAAAGTTTATTTTGGTATTCTGGCCAA

TGTAACATGTAAACAATAAACCAAAGAATAGTACTTGATGAAGTTTCGGATCCAGCAATCATAAAATCGT

TTACAAGAAACTCAATATTATCCTCAGT

 

42% to CYP17A aa 299-503

728 TEDNIEFLVNDFMIAGSETSSSTILWFIVYMLHWPEYQNKLYDEITKVASDNRYVSLKDR

PMLHLMQATIHETLRLSSVVPLGLVHKAMENSSICGKFVPKGALILTNLWSMHHDESYWK

NAMSFYPERWLEKSGEFNYKLGYAYLPFSNGPRSCLGETLAKTELFVFITRLLKDYRFEM

PTGKELPCLDGRSGITSPPNDFEVVIIPRN*

 

>Combined seq from CN769290 and CN769570 39% to CYP17A [gene 7]

RKSYDENNLRDITDALIKVSLDSEMGEELTEKITDDNIEFLLNDFMIAGSETSSSTILWF

IVYMLHWPEYQNKLYDEITKVASDNRYVSLKDR

PMLHLMQATIHETLRLSSVVPLGLVHKAMENSSICGKFVPKGALILTNLWSMHHDESYWK

NAMSFYPERWLEKSGEFNYKLGYAYLPFSNGPRSCLGETLAKTELFVFITRLLKDYRFEM

PTGKELPCLDGRSGITSPPNDFEVVIIPRN*

 

 

>CN774619

tae83e09.x1 Hydra EST Darmstadt I Hydra magnipapillata cDNA 3' similar to

SW:CPT7_CHICK P12394 CYTOCHROME P450 17

TTTAAACTACATAAGATTGTTTACTCTTGGAATCAAGTATGCGCAAAATTGTTTTGGAGCATGAGTAATA

TTGCATTTTCCAATTAAACTTGGAGGTGGACAACCAGGTGTGCACTCAAACTTAAAATCTCGAATTAATC

TAGAAAAGAAGAAAAATAGTTCATTTTCAGCAACTGTTTTCCCTAAACAAACTCTTGTACCGGCTGAAAA

AGGTAAGAAACTTGTTGCTTTACTTGGATCAAATTTCTTATCTTTGCCAATCCGCCGATAAGGAATTAAA

TTCATTAGGATTTTCCCAGGAGTTAGTAACGTTATGAATCTGCCAATGATTAATCATGACAGTAGCAATT

TTCAGGAATGCTATGTGACATGAGAACTCCCCCCCAGAGGT

 

38% to CYP17A2 Fugu aa 383-485

391 TSGGEFSCHIAFLKIATVMINHWQIHNVTNSWENPNEFN 275

273 PYRRIGKDKKFDPSKATSFLPFSAGTRVCLGKTVAENELFFFFSRLIRDFKFECTPGCPP 94

 93 PSLIGKCNITHAPKQFCAYLIPRVNNLM* 7

 

>CN570733 same as CN570522 BP505786

tag42d11.y1 Hydra EST -Kiel 2 Hydra magnipapillata cDNA 5' similar to SW:CPT7_ORYLA

P70085 CYTOCHROME P450 17

AGCTGGTTTCTTCAAGACTTCTGTGAGGTAAACCTAATGGAATGACAGACGACAAACGCAGAGTTTCTTT

CATAGCACTTTCAAATAAATGAAGCTTTGGACGATCTGAAAGACTAGGATACCTATCATTACCGACTATT

TTAATAGTTTCATCATAGATATCATCTTGATACTTTGGCCAGTTAACTAAATAAAC

 

VYLVNWPKYQDDIYDETIKIVGNDRYPSLSDRPKLHLFESAMKETLRLSSVIPLGLPHRSLEETS

 

>CN570522 same as CN570733

tag42d11.x1 Hydra EST -Kiel 2 Hydra magnipapillata cDNA 3' similar to SW:CPT7_ORYLA

P70085 CYTOCHROME P450 17

GGTGTTTATTTAGTTAACTGGCCAAAGTATCAAGATGATATCTATGATGAAACTATTAAAATAGTCGGTA

ATGATAGGTATCCTAGTCTTTCAGATCGTCCAAAGCTTCATTTATTTGAAAGTGCTATGAAAGAAACTCT

GCGTTTGTCGTCTGTCATTCCATTAGGTTTACCTCACAGAAGTCTTGAAGAAACCAGC

 

43% to CYP17 aa 326-389 same as BP505786

GVYLVNWPKYQDDIYDETIKIVGNDRYPSLSDRPKLHLFESAMKETLRLSSVIPLGLPHRSLEETS

 

>CN566859 opposite end = CN566581 taf98h10.x1

taf98h10.y1 Hydra EST -Kiel 1 Hydra magnipapillata cDNA 5' similar to SW:CPT7_SQUAC

Q92113 CYTOCHROME P450 17

TAATCATAAATGTGTATTTAAAATAAAAATTAAAACTCTTTTTGATAGACAATAGTTAGCTAATAGAAAT

TAGAAAAATTTTTAATTAACACATTTTGCCTAAAATTTAAAGGTTAAGCAAACATTTACTAATCGTTTCA

TTTTCAAATTATTTACTATTTATCTAGTATCTAATTTTTCAATCAATATAGTTTATATAAATTTAAAAAA

AAAAACATTAGCAAAAT TAAAG CAATAAATTTTTACT CCTTGGAACTATTTCAACTTTAAAGTCATAAGG

AGTACAAGTGATGCCAAAACTACCTTCTAAGCGCGGTAATTCTTCTAAAATTGGTTTCACAAATCGGAAA

TCATTTATCAATCGTGATATAAAAATAAACAACTCAGTTTTTGCCAATGTTTCTCCAATACAGCTACGAG

GTCCACTAGAGAATGGTAAATAAGCGTTTCCTAGTTTAGAATTAAATTCGCCAGAATTTTCTAACCAACG

TTCAGGGTAAAAACTCATTGCATTTTTCCAGTAACTTTCGTCATGGTGTATACTCCATAGGTTGGTTATT

ATAAGAGCCCCTTT GGGGAATAGGTTTTCCACAAATGCTACCTTTCTCC

 

Combined seq

 

Opposite end

MFLEVIGAVFIPPLIWTIWVYIKHLIDCLHYPRGPIPLPFIG

NGYLIRKAEPYKELVNLGKIYGDVFSFSVGSVRYVIVNSLEGIQEVLVKKGWQFAGRPKG

PSWDRSIHGLIQRDPSKKFKILRKLATSSLKIFADGLAGMESKAI

 

44% to 17A1 fugu aa 378-485 [gene 2]

607 RKVAFVENLFPKGALIITNLWSIHHDESYWKNAMSFYPERWLENSGEFNSKLGNAYLPFSSGPRSCIGETL 395

394 AKTELFIFISRLINDFRFVKPILEELPRLEGSFGITCTPYDFKVEIVPRSKNLLV* 227

 

 

 

CYP46 like (only one seq)

 

>CN775805

tae77f11.x1 Hydra EST Darmstadt I Hydra magnipapillata cDNA 3' similar to TR:Q42700

Q42700 CYTOCHROME P450

TCGGCATAGCAGTATTAATTTTTTTGTGTTTTTCACTGTTTTTTGCTAATATTTTAAAACGTTTTTATCA

TCCGCTTCGTAAGTTGCCATCACCTAAAGAAAATTTCTTTACTGCTCATTATGGCTACTTTAATGGCTAT

GATCAAATAAATGCTGTAATAAATTTTGGAAAACAGTTTAAAGAGCGTGGCTTGTATACATTAGATACAT

TAAATGGATTTAGATTTGTTAATCTTTTAATGCCAGAATTTATTAAAACAGTGTTTTCTGATGGAAACTC

ATTCCAAAGATCGACCGCTACAAAAGTTATATTTCCTCTAGTTGGAAATGGTATTTTTGTGTCAAATTAT

GAAGATCATCATTGGCAAAGAAAAGTGTTAAATGAAGCTTTTACTTTACAACAGCTAAAAAATTATTTTC

CAGCTTTTACAGTGCACATTGATTTGCTAATGAAACTTTGGTCATATTCATGTGACAAGGATAATGGTAC

TAACATAATTGTTTTGGATGACTTATCTAATTTATCATTTGATATAATTGGGGATGTTGGTTTTGGCTAT

CA

 

N-term 26% to CYP46a zebrafish aa 10-203

  3 GIAVLIFLCFSLFFANILKRFYHPLRKLPSPKENFFTAHYGYFNGYDQINAVINF 167

168 GKQFKERGLYTLDTLNGFRFVNLLMPEFIKTVFSDGNSFQRSTATKVIFPLVGNG 332

333 IFVSNYEDHHWQRKVLNEAFTLQQLKNYFPAFTVHIDLLMKLWSYSCDKDNGTNII 500

501 VLDDLSNLSFDIIGDVGFGY 560

 

>gi|47138506|gb|CN627429.1|CN627429 tae92b11.y1 Hydra EST Darmstadt I Hydra

magnipapillata cDNA 5'

           similar to SW:CP51_CANGA P50859 CYTOCHROME P450 51 ;.

          Length = 540

 

 Score = 58.5 bits (140), Expect = 5e-10

 Identities = 27/27 (100%), Positives = 27/27 (100%)

 Frame = +1

 

Query: 160 DNGTNIIVLDDLSNLSFDIIGDVGFGY 186

           DNGTNIIVLDDLSNLSFDIIGDVGFGY

Sbjct: 1   DNGTNIIVLDDLSNLSFDIIGDVGFGY 81

DNGTNIIVLDDLSNLSFDIIGDVGFGYQFNTITSHSGNEFTKALQSYCQLRFQLNAVHKA

LLAFFPFLMHLSFMYGKRKRAEQVICNTLNMLINKRKKEIDHRIAADQKDFLTVVLKDQQ

KEGNKMTNDLIKNNLMTPLIAGHKTTSTDMPWCFNVLAPNPSATKHMQRNRKKEYISTTK

 

Combined CN775805 and CN627429 [gene 8]

3 GIAVLIFLCFSLFFANILKRFYHPLRKLPSPKENFFTAHYGYFNGYDQINAVINF 167

168 GKQFKERGLYTLDTLNGFRFVNLLMPEFIKTVFSDGNSFQRSTATKVIFPLVGNG 332

333 IFVSNYEDHHWQRKVLNEAFTLQQLKNYFPAFTVHIDLLMKLWSYSCDKDNGTNII 500

501 VLDDLSNLSFDIIGDVGFGY 560

QFNTITSHSGNEFTKALQSYCQLRFQLNAVHKA

LLAFFPFLMHLSFMYGKRKRAEQVICNTLNMLINKRKKEIDHRIAADQKDFLTVVLKDQQ

KEGNKMTNDLIKNNLMTPLIAGHKTTSTDMPWCFNVLAPNPSATKHMQRNRKKEYISTTK

 

 

BP514308 BP514308 Hydra magnipapillata cDNA library Hydra magnipapillata

           cDNA clone hydmg002bw_87.

          Length = 586

 

 Score = 66.6 bits (161), Expect = 1e-12

 Identities = 28/56 (50%), Positives = 37/56 (66%)

 Frame = +2

 

Query: 5   LIFLCFSLFFANILKRFYHPLRKLPSPKENFFTAHYGYFNGYDQINAVINFGKQFK 60

           +I +    F A   KRFYH  R LPSPKE+  T HY YF+ +D +N ++NFGK+FK

Sbjct: 53  IIIVPLVFFVAVFFKRFYHQFRLLPSPKESLITCHYSYFDVHDHVNTLLNFGKEFK 220

 

>BP514308 N-term 25% to 46a [gene 9]

1096761991009 1096082187152 1096123591182 1095899045709 1097383004013

MYSIYIAIIIVPLVFFVAVFFKRFYHQFRLLPSPKESLITCHYSYFDVHDHVNTLLNFGKEFKDYGLYTINTLI (1)

GPRQVHLLLPHFIKTVIADGKFFQRSPVFKAVFPLVGNSMIVSNYEDHHWQRKLF

NQAFTSQQLKRYFLAFTLHTDLLMK (0)

LWSCTCDKENGTNLNVWSDLSNLSFDIIGDVGFGYQFNTITSHSGNAFTKALRSY

INLRFNSSVVHNVLIAYFPFLMRFLSKFGNLNKAEQVIYNTLNM (0)

 

ATGTATTCGATATACATAGCGATTATAATAGTTC

CTTTAGTTTTTTTCGTTGCTGTTTTTTTTAAACGTTTTTATCATCAATTTCGTTTGTTGC

CATCACCCAAAGAGAGTTTAATTACATGTCACTATAGTTATTTTGATGTTCATGACCATG

TTAACACTCTGTTAAACTTTGGTAAAGAGTTTAAAGATTACGGATTATATACAATTAATA

CTTTAATTGGT

1096041094868 1096013042315 1097383004013 1097675345181 1096602222013

AGCTCTGGTCATGCACATGTGATAAAGAAAATGGCACTAACTTAAACGTTTGGAGTGACTTGTCTAATCTTTC

ATTTGATATAATTGGTGACGTTGGTTTTGGCTATCAATTCAACACTATTACATCTCATTC

TGGAAATGCGTTTACAAAAGCACTTCGAAGTTATATTAACTTACGATTTAATTCTAGCGT

AGTGCACAATGTTCTAATAGCTTATTTTCCATTCTTAATGCGTTTTTTATCAAAGTTTGG

AAATCTTAATAAAGCTGAGCAAGTTATTTACAATACCCTGAACATGGT

 

>BP508840 BP508840 Hydra magnipapillata cDNA library Hydra magnipapillata

           cDNA clone hmp_03437.

          Length = 452

 

Blast with CYP20 Fugu C-term

 

Query: 88  VDQHLIPKESLVIYALGVILQDSDTWNAPYRFDPDRFEEESVKK----SFHLLGFSGSQT 143

           VD HLIPK++ +I ALG + QD   +  P RFDPDRF ++ +++    +F   GF+G +

Sbjct: 18  VDGHLIPKKTPIILALGTVFQDETIFPEPDRFDPDRFSDKQIEERSALAFQPFGFAGKRK 197

 

Query: 144 CPELRFAYTVATVLLSVLVRQLKLHRLKDTLMEVRSELVSTPRDETWI 191

           CP  R AY         +++   +       +++    V+ P +E WI

Sbjct: 198 CPGYRLAYAETLTYTFYIIKNFHISLFDKQSVKMHYGFVTKPSEEIWI 341

 

Note: this seqs. Best match in Fugu human and Ciona is CYP20

 

DYDIIVDGHLIPKKTPIILALGTVFQDETIFPEPDRFDPDRFSDKQIEERSALAFQPFGF

AGKRKCPGYRLAYAETLTYTFYIIKNFHISLFDKQSVKMHYGFVTKPSEEIWIKVLRRKNI*

 

 

 

>gnl|ti|647066038 1095898227332

          Length = 1123

 

 Score = 59.7 bits (143), Expect = 5e-08

 Identities = 66/274 (24%), Positives = 119/274 (43%), Gaps = 21/274 (7%)

 Frame = -3

 

Query: 226 PEAGSKRETEFLKHRRVLEDIIRRIIQERKEGEDLQELPFI-DSMLQ-NYDSE------D 277

           P A S+   E ++ R   + I++R +QE ++  D   L  I D++++ + DSE      +

Sbjct: 866 PTATSRNIFEIIRLR---DPILKRKLQEHRKSYDKNNLRDITDALIKVSLDSEMGEELTE 696

 

Query: 278 KIIADAISF-----MVGGFHTSGYMFTWMLWYLSSHPESQDXXXXXXXXXXXXXXXXXXK 332

           KI  D I F     M+ G  TS     W + Y+   PE Q+                  K

Sbjct: 695 KITDDNIEFLLNDFMIAGSETSSSTILWFIVYMLHWPEYQNKLYDEITKVASDNRYVSLK 516

 

Query: 333 EYSLRADTFLRQVQDETIRLSTLAPWA-ARYSDKKVTVCGYTIPAKTPMIHALGVGLKNK 391

           +  +     ++    ET+RLS++ P      + +  ++CG  +P    ++  L     ++

Sbjct: 515 DRPML--HLMQAAIHETLRLSSVVPLGLVHKAMENSSICGKFVPKGALILTNLWSMHHDE 342

 

Query: 392 TVWENTDSWDPDRFSP-----NGRRGNDFCPFGVHSRRKCPGYLFSYFEVGVFASILLS- 445

           + W+N  S+ P+R+       N + G  + PF  +  R C G   +  E+ VF + LL 

Sbjct: 341 SYWKNAMSFYPERWLEKSGEFNYKLGYAYLPFS-NGPRSCLGETLAKTELFVFITRLLKD 165

 

Query: 446 -RFEIVPVEGQTVIQVHGLVTEPKDDIKIYIRSR 478

            RFE+   +    +     +T P +D ++ I  R

Sbjct: 164 YRFEMPTGKELPCLDGRSGITSPPNDFEVVIIPR 63

 

37% to 2U1 fugu

866 PTATSRNIFEIIRLRDPILKRKLQEHRKSYDKNNLRDITDALIKVSLDSEMGEELTE 696

695 KITDDNIEFLLNDFMIAGSETSSSTILWFIVYMLHWPEYQNKLYDEITKVASDNRYVSLK 516

515 DRPMLHLMQAAIHETLRLSSVVPLGLVHKAMENSSICGKFVPKGALILTNLWSMHHDE 342

341 SYWKNAMSFYPERWLEKSGEFNYKLGYAYLPFSNGPRSCLGETLAKTELFVFITRLLKD 165

164 YRFEMPTGKELPCLDGRSGITSPPNDFEVVIIPR 63

 

 

>gnl|ti|648017453 1095896110991

          Length = 1042

 

 Score = 52.0 bits (123), Expect = 1e-05

 Identities = 58/226 (25%), Positives = 95/226 (42%), Gaps = 19/226 (8%)

 Frame = -1

 

Query: 241 RVLEDIIRRIIQERKEGEDLQELPFIDSML--------QNYDSEDKIIADAISFMVG--- 289

           R+ + I++R +QE ++  D   L  I   L           DS  K+  D I F++   

Sbjct: 697 RLRDPILKRKLQEHRKSYDENNLRDITDALIKASIHLNAEKDSLIKVTDDNIQFILNDLI 518

 

Query: 290 --GFHTSGYMFTWMLWYLSSHPESQDXXXXXXXXXXXXXXXXXXKEYSLRADTFLRQVQD 347

             G  TS    TW + Y+  +PE QD                   +  L     L+   

Sbjct: 517 LAGSETSSSTITWFIVYMLHYPEYQDKIFNEVIKVTSGNRYPCLNDRPLL--HLLQATIH 344

 

Query: 348 ETIRLSTLAPWAARY-SDKKVTVCGYTIPAKTPMIHALGVGLKNKTVWENTDSWDPDRF- 405

           ET+RLS++AP   R+ + +  T+C   +   T +I  L     ++  W+N  S+ P+R+

Sbjct: 343 ETLRLSSVAPLGLRHKAMENSTICDKPVLKGTLIITNLWSIHHDERYWKNPMSFYPERWL 164

 

Query: 406 SPNG----RRGNDFCPFGVHSRRKCPGYLFSYFEVGVFASILLSRF 447

           +  G    + GN + PF     R C G   +  E+ V  S L++ F

Sbjct: 163 NETGEFDYKLGNAYIPFS-GGPRACLGETLAKTELFVIISRLVTDF 29

38% to 17A1 fugu 37% to 2U1 fugu

697 RLRDPILKRKLQEHRKSYDENNLRDITDALIKASIHLNAEKDSLIKVTDDNIQFILNDLI 518

517 LAGSETSSSTITWFIVYMLHYPEYQDKIFNEVIKVTSGNRYPCLNDRPLLHLLQATIH 344

343 ETLRLSSVAPLGLRHKAMENSTICDKPVLKGTLIITNLWSIHHDERYWKNPMSFYPERWL 164

163 NETGEFDYKLGNAYIPFSGGPRACLGETLAKTELFVIISRLVTDF 29

 

>gnl|ti|655005893 1095958068757

          Length = 952

 

 Score = 44.3 bits (103), Expect = 0.002

 Identities = 33/145 (22%), Positives = 59/145 (40%), Gaps = 5/145 (3%)

 Frame = -2

 

Query: 265 FIDSMLQNYDSEDKIIADAI-----SFMVGGFHTSGYMFTWMLWYLSSHPESQDXXXXXX 319

           F+D +L  Y  + KI  + I     +FM  G  T+     W LW L  +P+ Q      

Sbjct: 438 FLDLLLDIY-RKGKIDTEGIQEEVDTFMFEGHDTTSAALGWTLWLLGKNPDVQKKLHKEI 262

 

Query: 320 XXXXXXXXXXXXKEYSLRADTFLRQVQDETIRLSTLAPWAARYSDKKVTVCGYTIPAKTP 379

                       K   +R   +L  +  E++R+    P   R  ++ +T+ G  +P   

Sbjct: 261 DEIELNGGSLYDK---VRQSKYLEIILKESLRMHPPVPMYGRTVEEDMTIDGQFVPKGAQ 91

 

Query: 380 MIHALGVGLKNKTVWENTDSWDPDR 404

           ++  + +   N   WEN + + P+R

Sbjct: 90  IVLLVLILHSNPDYWENPNDFIPER 16

44% to 4V5 fugu 42% to 4T5

438 FLDLLLDIYRKGKIDTEGIQEEVDTFMFEGHDTTSAALGWTLWLLGKNPDVQKKLHKEI 262

261 DEIELNGGSLYDKVRQSKYLEIILKESLRMHPPVPMYGRTVEEDMTIDGQFVPKGAQ 91

90  IVLLVLILHSNPDYWENPNDFIPER 16

 

 

>gnl|ti|655009968 1095963046224

          Length = 1057

 

 Score = 42.0 bits (97), Expect = 0.010

 Identities = 21/47 (44%), Positives = 26/47 (55%)

 Frame = +2

46% to CYP20 35% to 27B1

Query: 65  GSLHQFLLHLHDNGKTPVTSFWWGKTHVVSFCSPQAFKESAVFVNRP 111

           G +H+FL+  H     P+ SF+WGK   VS   P  FKE A   NRP

Sbjct: 422 GGIHKFLVENHKR-LGPMFSFYWGKELAVSLACPILFKEVATLFNRP 559

 

 

>gnl|ti|646862798 1095898098005

          Length = 963

 

 Score = 41.2 bits (95), Expect = 0.018

 Identities = 56/246 (22%), Positives = 99/246 (40%), Gaps = 18/246 (7%)

 Frame = -3

 

Query: 235 EFLKHRRVLEDIIRRIIQE--RKEGEDLQELPFIDSMLQN----YDSEDKIIADA----- 283

           E  + R VL   + + +++  RK  E+ + +   DSM+Q+    YD  D   A      

Sbjct: 793 EIKQARLVLNPWVEKKVEDHWRKYNEN-EIINVTDSMIQHFLTKYDGLDTDFAKKYITLL 617

 

Query: 284 -ISFMVGGFHTSGYMFTWMLWYLSSHPESQDXXXXXXXXXXXXXXXXXXKEYSLRADTFL 342

            I  +V G  T+     WM+ YL  +PE Q+                  ++        L

Sbjct: 616 LIELLVAGTETTAITICWMVLYLIHNPEYQEEIYKEITLNIGCRYPTLAEK---NLFPLL 446

 

Query: 343 RQVQDETIRLSTLAPW-AARYSDKKVTVCGYTIPAKTPMIHALGVGLKNKTVWENTDSWD 401

           +    ET+R++++ P   A  + K  ++CG  IP    +I  L     +   ++N + +D

Sbjct: 445 QAFIQETLRITSVVPLNLAHKALKDTSICGKIIPKDAIVITNLWNLHHDNRYFKNPNEFD 266

 

Query: 402 PDRF-SPNGR----RGNDFCPFGVHSRRKCPGYLFSYFEVGVFASILLSRFEIVPVEGQT 456

           P R+ + NG         F PF     R C G   +  ++ +  S L+  F      G+

Sbjct: 265 PKRWINENGLFDSISQKYFKPFSA-GARVCLGETLAKNQLFLIISGLIMNFIFTSAPGKD 89

 

Query: 457 VIQVHG 462

           +  + G

Sbjct: 88  LPSLEG 71

35% to 17A1 34% to 2P4

793 EIKQARLVLNPWVEKKVEDHWRKYNEN-EIINVTDSMIQHFLTKYDGLDTDFAKKYITLL 617

616 LIELLVAGTETTAITICWMVLYLIHNPEYQEEIYKEITLNIGCRYPTLAEKNLFPLL 446

445 QAFIQETLRITSVVPLNLAHKALKDTSICGKIIPKDAIVITNLWNLHHDNRYFKNPNEFD 266

265 PKRWINENGLFDSISQKYFKPFSAGARVCLGETLAKNQLFLIISGLIMNFIFTSAPGKD 89

88  LPSLEG 71

 

>gnl|ti|651477674 1095901303788

          Length = 819

 

 Score = 38.5 bits (88), Expect = 0.11

 Identities = 40/175 (22%), Positives = 69/175 (39%), Gaps = 7/175 (4%)

 Frame = +2

 

Query: 289 GGFHTSGYMFTWMLWYLSSHPESQDXXXXXXXXXXXXXXXXXXKEYSLRADTFLRQVQDE 348

           G F T+    TW+++YL   P  Q                       +++   ++    E

Sbjct: 29  GWFRTTAATITWLIFYLLHWPHYQSILYKEIKNVCGDQYPTFN---DIKSMPIMQATILE 199

 

Query: 349 TIRLSTLAPWAARYSD-KKVTVCGYTIPAKTPMIHALGVGLKNKTVWENTDSWDPDRF-S 406

           T+RLS++ P +  +       +  +TIP  T +I  L     N+  WE    ++P R+ 

Sbjct: 200 TLRLSSVVPLSLSHKAVNNAKINKFTIPKDTIIITNLWGVHHNEKYWEKPFEFNPMRWLD 379

 

Query: 407 PNGRRGN----DFCPFGVHSRRKCPGYLFSYFEVGVFASILLSRFEI-VPVEGQT 456

            NG         + PF     R C G  F+  ++ +  S L+  F   +P  G+T

Sbjct: 380 KNGELSTAKRLGYFPFSA-GPRGCIGESFARMQMFIICSRLIKDFSFELPQSGET 541

39% to CYP21 39% to 2R1 40% to 2P4

29  GWFRTTAATITWLIFYLLHWPHYQSILYKEIKNVCGDQYPTFNDIKSMPIMQATILE 199

200 TLRLSSVVPLSLSHKAVNNAKINKFTIPKDTIIITNLWGVHHNEKYWEKPFEFNPMRWLD 379

380 KNGELSTAKRLGYFPFSAGPRGCIGESFARMQMFIICSRLIKDFSFELPQSGET 541

 

>gnl|ti|654998190 1095901734433

          Length = 1030

 

 Score = 38.1 bits (87), Expect = 0.15

 Identities = 48/200 (24%), Positives = 74/200 (37%), Gaps = 21/200 (10%)

 Frame = -2

 

Query: 197 FGDIFKDENELSKMAESYHVCWRTMEEGVPEAGSKRETEFLKHRRVLEDIIR-------R 249

           F DI K  NE S ++    + W       P A S+      K++  + +IIR       R

Sbjct: 870 FQDIIKTHNETSYISS---IPWLRY---FPTATSRNMIILNKNKYNIFEIIRLRDPILIR 709

 

Query: 250 IIQERKEGEDLQELPFIDSML--------QNYDSEDKIIADAISF-----MVGGFHTSGY 296

            +QE K   D   L  +   L           DS +KI  D   F     M+ G  TS 

Sbjct: 708 KLQEHKRTYD*SNLGDVTDALIKISLESVTETDSHEKITDDNTEFLLNDFMIAGSETSSN 529

 

Query: 297 MFTWMLWYLSSHPESQDXXXXXXXXXXXXXXXXXXKEYSLRADTFLRQVQDETIRLSTLA 356

           M  W + Y+   PE QD                   +        ++ +  ET+RL ++A

Sbjct: 528 MILWFIVYILHRPEYQDKLYDEIFKVFSGIGYPSLNDRP--RFHLIQAITHETLRLLSVA 355

 

Query: 357 PWA-ARYSDKKVTVCGYTIP 375

           P      + +  ++CG  +P

Sbjct: 354 PLGLCHKAMENGSICGKFVP 295

30% to CYP21 25% to 1C2

870 FQDIIKTHNETSYISSIPWLRYFPTATSRNMIILNKNKYNIFEIIRLRDPILIR 709

708 KLQEHKRTYD*SNLGDVTDALIKISLESVTETDSHEKITDDNTEFLLNDFMIAGSETSSN 529

528 MILWFIVYILHRPEYQDKLYDEIFKVFSGIGYPSLNDRPRFHLIQAITHETLRLLSVA 355

354 PLGLCHKAMENGSICGKFVP 295

 

>gnl|ti|651148169 1095901003210

          Length = 1130

 

 Score = 37.4 bits (85), Expect = 0.25

 Identities = 39/137 (28%), Positives = 54/137 (39%), Gaps = 20/137 (14%)

 Frame = -2

 

Query: 197 FGDIFKDENELSKMAESYHVCWRTMEEGVPEAGSKRETEFLKHRRVLEDIIR-------R 249

           F DI K  NE S ++    + W       P A S+      K++  + +IIR       R

Sbjct: 544 FQDIIKTHNETSYISS---IPWLRY---FPTATSQNMIILNKNKYNIFEIIRLRDPILIR 383

 

Query: 250 IIQERKEGEDLQELPFIDSML--------QNYDSEDKIIADAISF-----MVGGFHTSGY 296

            +QE K   D   L  +  +L           DS +KI  D   F     M+ G  TS 

Sbjct: 382 KLQEHKRTYD*SNLGDVTDVLIKISLESVTETDSHEKITDDNTEFLLNDFMIAGSETSSN 203

 

Query: 297 MFTWMLWYLSSHPESQD 313

           M  W + Y+   PE QD

Sbjct: 202 MILWFIVYILHRPEYQD 152

Same as above

544 FQDIIKTHNETSYISSIPWLRYFPTATSQNMIILNKNKYNIFEIIRLRDPILIR 383

382 KLQEHKRTYD*SNLGDVTDVLIKISLESVTETDSHEKITDDNTEFLLNDFMIAGSETSSN 203

202 MILWFIVYILHRPEYQD 152

 

 

  Database: fasta.hydra_magnipapillata.001

    Posted date:  May 16, 2005  8:55 AM

  Number of letters in database: 513,442,738

  Number of sequences in database:  500,000

 

 

gnl|ti|647066038 1095898227332     60   5e-08

gnl|ti|648017453 1095896110991     52   1e-05

gnl|ti|655005893 1095958068757     44   0.002

gnl|ti|655009968 1095963046224     42   0.010

gnl|ti|646862798 1095898098005     41   0.018

gnl|ti|651477674 1095901303788     39   0.11

gnl|ti|654998190 1095901734433     38   0.15

gnl|ti|651148169 1095901003210     37   0.25

 

CYP21danio search

 

gnl|ti|647066038 1095898227332              177   2e-43

gnl|ti|649400787 1095898835518              153   3e-36

gnl|ti|648017453 1095896110991              150   2e-35

gnl|ti|647182814 1095899213949              142   8e-33

gnl|ti|646862798 1095898098005              141   1e-32

gnl|ti|651477674 1095901303788              141   1e-32

gnl|ti|647193621 1095899233960              133   3e-30

gnl|ti|647987527 1095895119635               97   4e-19

gnl|ti|651162328 1095901096755               96   9e-19

gnl|ti|654998190 1095901734433               91   2e-17

gnl|ti|655006784 1095958075467               81   2e-14

gnl|ti|651148169 1095901003210               74   3e-12

gnl|ti|648033522 1095897342515               72   8e-12

gnl|ti|647134594 1095899118747               72   8e-12

gnl|ti|648485307 1095899272864               71   2e-11

gnl|ti|648026854 1095896933215               70   4e-11

gnl|ti|651118815 1095900033599               70   4e-11

gnl|ti|655005893 1095958068757               60   5e-08

gnl|ti|647175227 1095898288652               45   5e-07

gnl|ti|648589386 1095733042694               56   8e-07

gnl|ti|649393684 1095898809307               54   2e-06

gnl|ti|646849327 1095897329284               51   2e-05

gnl|ti|646968536 1095898162561               49   9e-05

gnl|ti|647168675 1095899196297               48   2e-04

gnl|ti|649448444 1095899351259               33   0.079

gnl|ti|648014530 1095896049543               35   1.9 

gnl|ti|655009845 1095963045220               34   3.2 

gnl|ti|648592188 1095595897239               34   3.2 

gnl|ti|653058100 1095949490108               34   3.2 

 

>gnl|ti|647066038 1095898227332

          Length = 1123

 

 Score =  177 bits (449), Expect = 2e-43

 Identities = 109/320 (34%), Positives = 168/320 (52%), Gaps = 8/320 (2%)

 Frame = -3

 

Query: 215  NVITTLTFSKAYDKSSAELQKLQECLNEIVSLWGSPWISALDSFPLLRKFPNPPFSRLMK 274

            N+I T+ F+  Y+  + E Q + +  + IV  +     S + S PLLR FP      + +

Sbjct: 1010 NIICTILFNHRYEDDNQEFQNIIKYSSLIVQTFNET--SYVSSIPLLRYFPTATSRNIFE 837

 

Query: 275  EVARRDELIGKHIEEFKKSEHKEG-GTLTSSLLKC-LEPQQGAANHXXXXXXXXXXXXXX 332

             +  RD ++ + ++E +KS  K     +T +L+K  L+ + G                 

Sbjct: 836  IIRLRDPILKRKLQEHRKSYDKNNLRDITDALIKVSLDSEMGEELTEKITDDNIEFLLND 657

 

Query: 333  XLIGGTETIAALLNWTVAFLLHRPEVQDKVYEELCCVL-DVRYPQYSDRHKLPYLCALIS 391

             +I G+ET ++ + W + ++LH PE Q+K+Y+E+  V  D RY    DR  L  + A I

Sbjct: 656  FMIAGSETSSSTILWFIVYMLHWPEYQNKLYDEITKVASDNRYVSLKDRPMLHLMQAAIH 477

 

Query: 392  EMLRLRPVAPLAVPHRAIRNSSIAGHFIPKNTIIIPNLYGAHHDPEVWDDPYSFKPERFL 451

            E LRL  V PL + H+A+ NSSI G F+PK  +I+ NL+  HHD   W +  SF PER+L

Sbjct: 476  ETLRLSSVVPLGLVHKAMENSSICGKFVPKGALILTNLWSMHHDESYWKNAMSFYPERWL 297

 

Query: 452  EGGGGSLRSL----IPFGGGARLCLGEAVAKMEMFLFTAYLLREFKF-LPASKEEPLPEL 506

            E  G     L    +PF  G R CLGE +AK E+F+F   LL++++F +P  KE  LP L

Sbjct: 296  EKSGEFNYKLGYAYLPFSNGPRSCLGETLAKTELFVFITRLLKDYRFEMPTGKE--LPCL 123

 

Query: 507  RGVASVVLKVKPYTVIAHPR 526

             G + +      + V+  PR

Sbjct: 122  DGRSGITSPPNDFEVVIIPR 63

34% to 17A1 35% to 2U1 fugu 33% to 2U1 human

1010 NIICTILFNHRYEDDNQEFQNIIKYSSLIVQTFNETSYVSSIPLLRYFPTATSRNIFE 837

836  IIRLRDPILKRKLQEHRKSYDKNNLRDITDALIKVSLDSEMGEELTEKITDDNIEFLLND 657

656  FMIAGSETSSSTILWFIVYMLHWPEYQNKLYDEITKVASDNRYVSLKDRPMLHLMQAAIH 477

476  ETLRLSSVVPLGLVHKAMENSSICGKFVPKGALILTNLWSMHHDESYWKNAMSFYPERWL 297

296  EKSGEFNYKLGYAYLPFSNGPRSCLGETLAKTELFVFITRLLKDYRFEMPTGKELPCL 123

122  DGRSGITSPPNDFEVVIIPR 63

 

>gnl|ti|649400787 1095898835518

          Length = 1120

 

 Score =  153 bits (387), Expect = 3e-36

 Identities = 97/309 (31%), Positives = 161/309 (52%), Gaps = 11/309 (3%)

 Frame = +2

 

Query: 211 VASSNVITTLTFSKAYDKSSAELQKLQECLNEIVSLWGSPWISALDSFPLLRKFPNPPFS 270

           VA  NVI ++ F K Y+  + E +++   +N + +  G    +A+   P LR  P    

Sbjct: 44  VAVLNVICSIVFGKRYEYENCEFKEILTYMNYVFT--GVAGTNAISFIPWLRFLPLDGLR 217

 

Query: 271 RLMKEVARRDELIGK----HIEEFKKSEHKEGGTLTSSLLKCLEPQQGAANHXXXXXXXX 326

           +L K ++ RD ++ K    H E + +S  ++    T  +++    +             

Sbjct: 218 KLKKGLSIRDPVLRKQLLYHRETYNESNLRD---YTDYVIQFSRDEAILKKFGEQLTDDY 388

 

Query: 327 XXXXXXXL-IGGTETIAALLNWTVAFLLHRPEVQDKVYEELCCVLDV-RYPQYSDRHKLP 384

                  + I GTET    L W++ +L+H P+ QDK+Y E+   +   RYP   DR+ LP

Sbjct: 389 LELLLNDIFIAGTETALTTLLWSIIYLIHWPKFQDKIYNEIVSAIGKNRYPSMKDRNMLP 568

 

Query: 385 YLCALISEMLRLRPVAPLAVPHRAIRNSSIAGHF-IPKNTIIIPNLYGAHHDPEVWDDPY 443

            + A +SE LRL  V PL VPH+A+ ++++     IPK T I+ NL+  HH+   W++P+

Sbjct: 569 LVNAALSETLRLSSVTPLGVPHKAMEDTTLLNDLKIPKGTTILTNLWQLHHNKNCWENPH 748

 

Query: 444 SFKPERFLEGGG--GSLRSL--IPFGGGARLCLGEAVAKMEMFLFTAYLLREFKFLPASK 499

            F P R+        S++S+  +PF  G R+CLG+ +A++E+FLF + L+R+FKF    

Sbjct: 749 EFNPYRWFTNDQTLDSIKSMNFLPFSAGTRVCLGKGIAEVELFLFYSRLVRDFKF-EVKP 925

 

Query: 500 EEPLPELRG 508

            + LP L G

Sbjct: 926 GDSLPSLYG 952

>gnl|ti|649400787 1095898835518 44% to 1095898227332, 39% to 17A1 fugu 35% to 2U1

44  VAVLNVICSIVFGKRYEYENCEFKEILTYMNYVFTGVAGTNAISFIPWLRFLPLDGLR 217

218 KLKKGLSIRDPVLRKQLLYHRETYNESNLRDYTDYVIQFSRDEAILKKFGEQLTDDY 388

389 LELLLNDIFIAGTETALTTLLWSIIYLIHWPKFQDKIYNEIVSAIGKNRYPSMKDRNMLP 568

569 LVNAALSETLRLSSVTPLGVPHKAMEDTTLLNDLKIPKGTTILTNLWQLHHNKNCWENPH 748

749 EFNPYRWFTNDQTLDSIKSMNFLPFSAGTRVCLGKGIAEVELFLFYSRLVRDFKFEVKP 925

926 GDSLPSLYG 952

 

 

>gnl|ti|648017453 1095896110991

          Length = 1042

 

 Score =  150 bits (380), Expect = 2e-35

 Identities = 98/286 (34%), Positives = 150/286 (52%), Gaps = 8/286 (2%)

 Frame = -1

 

Query: 215 NVITTLTFSKAYDKSSAELQKLQECLNEIVSLWGSPWISALDSFPLLRKFPNPPFSRLMK 274

           N+I T+ F++ Y++   E Q + +  N     + +   + L S P LR FP    S+ ++

Sbjct: 877 NIICTILFNQRYEQDDDEFQNIIKYSNLSFKAFSAS--NLLSSIPWLRYFPTTA-SKYIQ 707

 

Query: 275 EVAR-RDELIGKHIEEFKKSEHKEG-GTLTSSLLKCLEPQQGAANHXXXXXXXXXXXXXX 332

           E+ R RD ++ + ++E +KS  +     +T +L+K         +               

Sbjct: 706 EIERLRDPILKRKLQEHRKSYDENNLRDITDALIKASIHLNAEKDSLIKVTDDNIQFILN 527

 

Query: 333 XLI-GGTETIAALLNWTVAFLLHRPEVQDKVYEELCCVLDV-RYPQYSDRHKLPYLCALI 390

            LI  G+ET ++ + W + ++LH PE QDK++ E+  V    RYP  +DR  L  L A I

Sbjct: 526 DLILAGSETSSSTITWFIVYMLHYPEYQDKIFNEVIKVTSGNRYPCLNDRPLLHLLQATI 347

 

Query: 391 SEMLRLRPVAPLAVPHRAIRNSSIAGHFIPKNTIIIPNLYGAHHDPEVWDDPYSFKPERF 450

            E LRL  VAPL + H+A+ NS+I    + K T+II NL+  HHD   W +P SF PER+

Sbjct: 346 HETLRLSSVAPLGLRHKAMENSTICDKPVLKGTLIITNLWSIHHDERYWKNPMSFYPERW 167

 

Query: 451 LEGGGGSLRSL----IPFGGGARLCLGEAVAKMEMFLFTAYLLREF 492

           L   G     L    IPF GG R CLGE +AK E+F+  + L+ +F

Sbjct: 166 LNETGEFDYKLGNAYIPFSGGPRACLGETLAKTELFVIISRLVTDF 29

 

877 NIICTILFNQRYEQDDDEFQNIIKYSNLSFKAFSASNLLSSIPWLRYFPTTASKYIQ 707

706 EIERLRDPILKRKLQEHRKSYDENNLRDITDALIKASIHLNAEKDSLIKVTDDNIQFILN 527

526 DLILAGSETSSSTITWFIVYMLHYPEYQDKIFNEVIKVTSGNRYPCLNDRPLLHLLQATI 347

346 HETLRLSSVAPLGLRHKAMENSTICDKPVLKGTLIITNLWSIHHDERYWKNPMSFYPERW 167

166 LNETGEFDYKLGNAYIPFSGGPRACLGETLAKTELFVIISRLVTDF 29

 

>gnl|ti|647182814 1095899213949

          Length = 1074

 

 Score =  142 bits (357), Expect = 8e-33

 Identities = 89/291 (30%), Positives = 145/291 (49%), Gaps = 7/291 (2%)

 Frame = +2

 

Query: 211 VASSNVITTLTFSKAYDKSSAELQKLQECLNEIVSLWGSPWISALDSFPLLRKFPNPPFS 270

           VA  NVI  + F + Y  S     ++   +N IVS  G    +A+D  P LR       

Sbjct: 83  VAVLNVICFIVFGERYQYSDPAFIEILTTINNIVS--GLSNTTAVDFLPGLRYLQFSEIK 256

 

Query: 271 RLMKEVARRDELIGKHIEEFKKS-EHKEGGTLTSSLLKCLEPQQGAANHXXXXXXXXXXX 329

           +L   +     L+   +++ KK+ +       T S++K  + +                

Sbjct: 257 KLKSSLVIYFRLLNDQLKKHKKTFDENNIRDFTDSIIKFSKDETMENKFEEELTDEHLEH 436

 

Query: 330 XXXXL-IGGTETIAALLNWTVAFLLHRPEVQDKVYEELCCVL-DVRYPQYSDRHKLPYLC 387

               + I G+ET    L W + +++H P+ Q++++EE+  V+ + RYPQ SDR  L  +

Sbjct: 437 VIGDMFIAGSETTLTSLLWLIIYMIHYPKYQEEIFEEITRVIGENRYPQLSDRDSLHLVK 616

 

Query: 388 ALISEMLRLRPVAPLAVPHRAIRNSSIAGHFIPKNTIIIPNLYGAHHDPEVWDDPYSFKP 447

           A I E LRL  + PL VPH+ + ++++ G+ IPKNT +I N +  H+D   W +P  F P

Sbjct: 617 ASIKECLRLSSIIPLGVPHKTMSDTTLIGYNIPKNTTVIINHWQIHNDTNHWKNPNEFNP 796

 

Query: 448 ERFLEG----GGGSLRSLIPFGGGARLCLGEAVAKMEMFLFTAYLLREFKF 494

            R+++           S +PF  G R+CLG+ VA+ E+F F   L+R+FKF

Sbjct: 797 HRWIDDDSKFDATRATSYLPFSAGTRVCLGKTVAETELFFFFTRLIRDFKF 949

 

>gnl|ti|647182814 1095899213949

54% to 1095898835518, 36% to 17A1 36% to 2U1

83  VAVLNVICFIVFGERYQYSDPAFIEILTTINNIVSGLSNTTAVDFLPGLRYLQFSEIK 256

257 KLKSSLVIYFRLLNDQLKKHKKTFDENNIRDFTDSIIKFSKDETMENKFEEELTDEHLEH 436

437 VIGDMFIAGSETTLTSLLWLIIYMIHYPKYQEEIFEEITRVIGENRYPQLSDRDSLHLVK 616

617 ASIKECLRLSSIIPLGVPHKTMSDTTLIGYNIPKNTTVIINHWQIHNDTNHWKNPNEFNP 796

797 HRWIDDDSKFDATRATSYLPFSAGTRVCLGKTVAETELFFFFTRLIRDFKF 949

 

>gnl|ti|646862798 1095898098005

          Length = 963

 

 Score =  141 bits (356), Expect = 1e-32

 Identities = 71/193 (36%), Positives = 113/193 (58%), Gaps = 4/193 (2%)

 Frame = -3

 

Query: 334 LIGGTETIAALLNWTVAFLLHRPEVQDKVYEELCCVLDVRYPQYSDRHKLPYLCALISEM 393

           L+ GTET A  + W V +L+H PE Q+++Y+E+   +  RYP  ++++  P L A I E

Sbjct: 604 LVAGTETTAITICWMVLYLIHNPEYQEEIYKEITLNIGCRYPTLAEKNLFPLLQAFIQET 425

 

Query: 394 LRLRPVAPLAVPHRAIRNSSIAGHFIPKNTIIIPNLYGAHHDPEVWDDPYSFKPERFLEG 453

           LR+  V PL + H+A++++SI G  IPK+ I+I NL+  HHD   + +P  F P+R++ 

Sbjct: 424 LRITSVVPLNLAHKALKDTSICGKIIPKDAIVITNLWNLHHDNRYFKNPNEFDPKRWINE 245

 

Query: 454 GG----GSLRSLIPFGGGARLCLGEAVAKMEMFLFTAYLLREFKFLPASKEEPLPELRGV 509

            G     S +   PF  GAR+CLGE +AK ++FL  + L+  F F  A  ++ LP L G

Sbjct: 244 NGLFDSISQKYFKPFSAGARVCLGETLAKNQLFLIISGLIMNFIFTSAPGKD-LPSLEGQ 68

 

Query: 510 ASVVLKVKPYTVI 522

             +  +   + V+

Sbjct: 67  FGITFRPNSFKVL 29

 

 

>gnl|ti|651477674 1095901303788

          Length = 819

 

 Score =  141 bits (355), Expect = 1e-32

 Identities = 75/196 (38%), Positives = 111/196 (56%), Gaps = 5/196 (2%)

 Frame = +2

 

Query: 336 GGTETIAALLNWTVAFLLHRPEVQDKVYEELCCVLDVRYPQYSDRHKLPYLCALISEMLR 395

           G   T AA + W + +LLH P  Q  +Y+E+  V   +YP ++D   +P + A I E LR

Sbjct: 29  GWFRTTAATITWLIFYLLHWPHYQSILYKEIKNVCGDQYPTFNDIKSMPIMQATILETLR 208

 

Query: 396 LRPVAPLAVPHRAIRNSSIAGHFIPKNTIIIPNLYGAHHDPEVWDDPYSFKPERFLEGGG 455

           L  V PL++ H+A+ N+ I    IPK+TIII NL+G HH+ + W+ P+ F P R+L+  G

Sbjct: 209 LSSVVPLSLSHKAVNNAKINKFTIPKDTIIITNLWGVHHNEKYWEKPFEFNPMRWLDKNG 388

 

Query: 456 ----GSLRSLIPFGGGARLCLGEAVAKMEMFLFTAYLLREFKF-LPASKEEPLPELRGVA 510

                      PF  G R C+GE+ A+M+MF+  + L+++F F LP S E   P+L G 

Sbjct: 389 ELSTAKRLGYFPFSAGPRGCIGESFARMQMFIICSRLIKDFSFELPQSGE--TPKLDGDI 562

 

Query: 511 SVVLKVKPYTVIAHPR 526

            + L   PY  +A  R

Sbjct: 563 GITLTPLPYNAVAKQR 610

 

29  GWFRTTAATITWLIFYLLHWPHYQSILYKEIKNVCGDQYPTFNDIKSMPIMQATILETLR 208

209 LSSVVPLSLSHKAVNNAKINKFTIPKDTIIITNLWGVHHNEKYWEKPFEFNPMRWLDKNG 388

389 ELSTAKRLGYFPFSAGPRGCIGESFARMQMFIICSRLIKDFSFELPQSGETPKLDGDI 562

563 GITLTPLPYNAVAKQR 610

 

 

>gnl|ti|647193621 1095899233960

          Length = 1050

 

 Score =  133 bits (335), Expect = 3e-30

 Identities = 96/310 (30%), Positives = 149/310 (48%), Gaps = 10/310 (3%)

 Frame = +2

 

Query: 215 NVITTLTFSKAYDKSSAELQKLQECLNEIVSLWGSPWISALDSFPLLRKFPNPPFSRLMK 274

           NV+  + F   Y+++  EL+K+      I+   G     A+   P LR FP+    ++ K

Sbjct: 110 NVLCGIVFGTQYEENDKELEKVISFKQLILD--GVADTFAISFLPWLRFFPSNGLKKVRK 283

 

Query: 275 EVARRDELIG----KHIEEFKKSEHKEGGTLTSSLLKCLEPQQGAANHXXXXXXXXXXXX 330

            V  RD+L+     KH E +   + ++    T  +LK  +  + + N            

Sbjct: 284 GVLIRDKLLRFQLKKHRETYNPVQIRD---YTDYVLKYSKEFETSRNIDEQLSEDNMEMM 454

 

Query: 331 XXXL-IGGTETIAALLNWTVAFLLHRPEVQDKVYEELCCVL-DVRYPQYSDRHKLPYLCA 388

              + I G+ET  + L W   +L++ P+ QD +Y+E   ++ + RYP  SDR KL    +

Sbjct: 455 LQDIFISGSETTISTLLWFAVYLVNWPKYQDDIYDETIKIVGNDRYPSLSDRPKLHLFES 634

 

Query: 389 LISEMLRLRPVAPLAVPHRAIRNSSIAGHFIPKNTIIIPNLYGAHHDPEVWDDPYSFKPE 448

            + E LRL  V PL +PHR++  +SI    IPKNT ++ NL+  HHD + W DP++F P

Sbjct: 635 AMKETLRLSSVIPLGLPHRSLEETSIKKFKIPKNTNVMINLWQLHHDSKSWSDPHTFNPY 814

 

Query: 449 RFLEGGG----GSLRSLIPFGGGARLCLGEAVAKMEMFLFTAYLLREFKFLPASKEEPLP 504

           R+L            + +PF  G R CLG    +  +FLF   L+R+F  L        P

Sbjct: 815 RWLNDKNIFDKSKNPNYLPFSTGLRACLGYHTTESIIFLFFTRLIRDFN-LCLKPGASTP 991

 

Query: 505 ELRGVASVVL 514

            L GV  V L

Sbjct: 992 SLNGVLRVTL 1021

>gnl|ti|647193621 1095899233960

50% to 1095898835518 37% to 17A1

110 NVLCGIVFGTQYEENDKELEKVISFKQLILDGVADTFAISFLPWLRFFPSNGLKKVRK 283

284 GVLIRDKLLRFQLKKHRETYNPVQIRDYTDYVLKYSKEFETSRNIDEQLSEDNMEMM 454

455 LQDIFISGSETTISTLLWFAVYLVNWPKYQDDIYDETIKIVGNDRYPSLSDRPKLHLFES 634

635 AMKETLRLSSVIPLGLPHRSLEETSIKKFKIPKNTNVMINLWQLHHDSKSWSDPHTFNPY 814

815 RWLNDKNIFDKSKNPNYLPFSTGLRACLGYHTTESIIFLFFTRLIRDFN-LCLKPGASTP 991

992 SLNGVLRVTL 1021

 

>gnl|ti|647987527 1095895119635

          Length = 1003

 

 Score = 96.7 bits (239), Expect = 4e-19

 Identities = 57/137 (41%), Positives = 74/137 (54%), Gaps = 4/137 (2%)

 Frame = +3

 

Query: 394 LRLRPVAPLAVPHRAIRNSSIAGHFIPKNTIIIPNLYGAHHDPEVWDDPYSFKPERFLEG 453

           LRL  VAPL + H+A+ NS+I    + K T+II NL+  HHD   W +P SF PER+L 

Sbjct: 18  LRLSSVAPLGLRHKAMENSTICDKPVLKGTLIITNLWSIHHDERYWKNPMSFYPERWLNE 197

 

Query: 454 GGGSLRSL----IPFGGGARLCLGEAVAKMEMFLFTAYLLREFKFLPASKEEPLPELRGV 509

            G     L    IPF GG R CLGE +AK E+F+  + L+ +F F   S EE LP L  

Sbjct: 198 TGEFDYKLGNAYIPFSGGPRACLGETLAKTELFVIISRLVTDFYF-EKSVEEDLPRLDSF 374

 

Query: 510 ASVVLKVKPYTVIAHPR 526

             V      + V+   R

Sbjct: 375 PGVTRSPYDFKVVVVSR 425

 

>gnl|ti|647987527 1095895119635

Same as 1095896110991

18  LRLSSVAPLGLRHKAMENSTICDKPVLKGTLIITNLWSIHHDERYWKNPMSFYPERWLNE 197

198 TGEFDYKLGNAYIPFSGGPRACLGETLAKTELFVIISRLVTDFYFEKSVEEDLPRLDSF 374

375 PGVTRSPYDFKVVVVSR 425

 

>gnl|ti|651162328 1095901096755

          Length = 986

 

 Score = 95.5 bits (236), Expect = 9e-19

 Identities = 59/181 (32%), Positives = 94/181 (51%), Gaps = 5/181 (2%)

 Frame = -1

 

Query: 334 LIGGTETIAALLNWTVAFLLHRPEVQDKVYEELCCVLD-VRYPQYSDRHKLPYLCALISE 392

           +I G+ET + ++ W + ++LHRPE QDK+Y+E+  V   + YP  +DR +   + A+I E

Sbjct: 752 MIAGSETSSNMILWFIVYILHRPEYQDKLYDEIFKVFSGIGYPSLNDRPRFHLIQAIIHE 573

 

Query: 393 MLRLRPVAPLAVPHRAIRNSSIAGHFIPKNTIIIPNLYGAHHDPEVWDDPYSFKPERFLE 452

            LRL  VAPL + H+A+ N SI G F+PK       L        + +    F      

Sbjct: 572 TLRLLSVAPLGLCHKALENGSICGKFVPKGASYSD*LMEYTS**ALLEKCNKFFS*ALAR 393

 

Query: 453 GGGG----SLRSLIPFGGGARLCLGEAVAKMEMFLFTAYLLREFKFLPASKEEPLPELRG 508

             G         +  F GG R CLGE +AK E+ +F + L+++++F   + ++ L    G

Sbjct: 392 *FGQF*L*FRLCIFVFSGGPRSCLGETLAKTELLVFISRLVKDYRFEKPNGKDSLDGRSG 213

 

Query: 509 V 509

           V

Sbjct: 212 V 210

>gnl|ti|651162328 1095901096755

2 aa  diffs to 1095901734433

752 MIAGSETSSNMILWFIVYILHRPEYQDKLYDEIFKVFSGIGYPSLNDRPRFHLIQAIIHE 573

572 TLRLLSVAPLGLCHKALENGSICGKFVPKGASYSD*LMEYTS**ALLEKCNKFFS*ALAR 393

392 *FGQF*L*FRLCIFVFSGGPRSCLGETLAKTELLVFISRLVKDYRFEKPNGKDSLDGRSG 213

212 V 210

 

>gnl|ti|654998190 1095901734433

          Length = 1030

 

 Score = 90.9 bits (224), Expect = 2e-17

 Identities = 57/182 (31%), Positives = 93/182 (51%), Gaps = 13/182 (7%)

 Frame = -2

 

Query: 253 SALDSFPLLRKFP----------NPPFSRLMKEVARRDELIGKHIEEFKKS-EHKEGGTL 301

           S + S P LR FP          N     + + +  RD ++ + ++E K++ +    G +

Sbjct: 837 SYISSIPWLRYFPTATSRNMIILNKNKYNIFEIIRLRDPILIRKLQEHKRTYD*SNLGDV 658

 

Query: 302 TSSLLKC-LEPQQGAANHXXXXXXXXXXXXXXXLIGGTETIAALLNWTVAFLLHRPEVQD 360

           T +L+K  LE      +H               +I G+ET + ++ W + ++LHRPE QD

Sbjct: 657 TDALIKISLESVTETDSHEKITDDNTEFLLNDFMIAGSETSSNMILWFIVYILHRPEYQD 478

 

Query: 361 KVYEELCCVLD-VRYPQYSDRHKLPYLCALISEMLRLRPVAPLAVPHRAIRNSSIAGHFI 419

           K+Y+E+  V   + YP  +DR +   + A+  E LRL  VAPL + H+A+ N SI G F+

Sbjct: 477 KLYDEIFKVFSGIGYPSLNDRPRFHLIQAITHETLRLLSVAPLGLCHKAMENGSICGKFV 298

 

Query: 420 PK 421

           PK

Sbjct: 297 PK 292

 

 

 

 Score = 73.6 bits (179), Expect = 4e-12

 Identities = 41/107 (38%), Positives = 61/107 (57%), Gaps = 5/107 (4%)

 Frame = -3

 

Query: 410 RNSSIAGHFIPKNTIIIPNLYGAHHDPEVWDDPYSFKPERFLEGGGGSLRSL----IPFG 465

           R     G+  P+  +I+ NL   HHD   W +  +F PER+L+  G    +L    +PF

Sbjct: 326 RTVVFVGNLFPRELLILTNL*SIHHDECYWKNAINFFPERWLDNSGNFNYNLGYAYLPFS 147

 

Query: 466 GGARLCLGEAVAKMEMFLFTAYLLREFKF-LPASKEEPLPELRGVAS 511

           GG R CLGE +AK E+F+F + L+++++F  P  KE  LP L G +S

Sbjct: 146 GGPRSCLGETLAKTELFVFISRLVKDYRFEKPNGKE--LPSLDGRSS 12

 

837 SYISSIPWLRYFPTATSRNMIILNKNKYNIFEIIRLRDPILIRKLQEHKRTYD*SNLGDV 658

657 TDALIKISLESVTETDSHEKITDDNTEFLLNDFMIAGSETSSNMILWFIVYILHRPEYQD 478

477 KLYDEIFKVFSGIGYPSLNDRPRFHLIQAITHETLRLLSVAPLGLCHKAMENGSICGKFV 298

297 PKELLILTNL*SIHHDECYWKNAINFFPERWLDNSGNFNYNLGYAYLPFS 147

146 GGPRSCLGETLAKTELFVFISRLVKDYRFEKPNGKELPSLDGRSS 12

 

>gnl|ti|655006784 1095958075467

          Length = 931

 

 Score = 80.9 bits (198), Expect = 2e-14

 Identities = 46/141 (32%), Positives = 73/141 (51%), Gaps = 6/141 (4%)

 Frame = -2

 

Query: 392 EMLRLRPVAPLAVPHRAIRNSSIAGHFIPKNTIIIPNLYGAHHDPEVWDDPYSFKPERFL 451

           E LRL  + PL VPH+ + ++++  + +     +I N +  H+D   W +P    P R++

Sbjct: 744 ECLRLSSIIPLGVPHKTMSDTTLIAYTLNIYGTVIINHWQIHNDTNHWKNPNESTPHRWI 565

 

Query: 452 EGGGG----SLRSLIPFGGGARLCLGEAVAKMEMFLFTAYLLREFKF--LPASKEEPLPE 505

           +           S +PF  G R+CLG+ VA+ E+F F   L+R+FKF  +P     PLP

Sbjct: 564 DDDSKFDATRATSYLPFSAGTRVCLGKTVAETELFFFFTRLIRDFKFEGVPGC---PLPS 394

 

Query: 506 LRGVASVVLKVKPYTVIAHPR 526

           L G  S+ L  + + V   PR

Sbjct: 393 LIGKCSITLAPEEFNVHVTPR 331

>gnl|ti|655006784 1095958075467

Same as 1095899213949

744 ECLRLSSIIPLGVPHKTMSDTTLIAYTLNIYGTVIINHWQIHNDTNHWKNPNESTPHRWI 565

564 DDDSKFDATRATSYLPFSAGTRVCLGKTVAETELFFFFTRLIRDFKFEGVPGCPLPS 394

393 LIGKCSITLAPEEFNVHVTPR 331

 

>gnl|ti|651148169 1095901003210

          Length = 1130

 

 Score = 73.9 bits (180), Expect = 3e-12

 Identities = 50/170 (29%), Positives = 83/170 (48%), Gaps = 13/170 (7%)

 Frame = -2

 

Query: 253 SALDSFPLLRKFP----------NPPFSRLMKEVARRDELIGKHIEEFKKS-EHKEGGTL 301

           S + S P LR FP          N     + + +  RD ++ + ++E K++ +    G +

Sbjct: 511 SYISSIPWLRYFPTATSQNMIILNKNKYNIFEIIRLRDPILIRKLQEHKRTYD*SNLGDV 332

 

Query: 302 TSSLLKC-LEPQQGAANHXXXXXXXXXXXXXXXLIGGTETIAALLNWTVAFLLHRPEVQD 360

           T  L+K  LE      +H               +I G+ET + ++ W + ++LHRPE QD

Sbjct: 331 TDVLIKISLESVTETDSHEKITDDNTEFLLNDFMIAGSETSSNMILWFIVYILHRPEYQD 152

 

Query: 361 KVYEELCCVLD-VRYPQYSDRHKLPYLCALISEMLRLRPVAPLAVPHRAI 409

           K+Y+E+  V   + YP  +DR +   + A+I E LRL  VAPL   H+ +

Sbjct: 151 KLYDEIFKVFSGIGYPSLNDRPRFHLIQAIIHETLRLLSVAPLG*SHKPV 2

 

 

>gnl|ti|648033522 1095897342515

          Length = 1108

 

 Score = 72.4 bits (176), Expect = 8e-12

 Identities = 46/143 (32%), Positives = 80/143 (55%), Gaps = 2/143 (1%)

 Frame = +3

 

Query: 69  GPRSLPLLGNMLDLAQDHLPIHLTALAKCYGNIYRLNCGSTSAMVVLNNSEIIREALVKK 128

           GP  LP++GN L L            +K YG ++ L+ G    +V+++  + IRE LV+K

Sbjct: 153 GPFPLPIIGN-LHLIGKKPHEKFVEYSKKYGEVFSLSFGM-HRVVIVSGKDSIREVLVQK 326

 

Query: 129 WSDFAGRPYSYTGXDIVSGGGRTISLGDFSEEWKAHRRVTHSALQRC--TTDSLHSVIEK 186

            + FAGRP +Y   +IVS G + I  GD   +WK  R++ HS+L+    +T  L +++ K

Sbjct: 327 SNIFAGRPKNYIA-NIVSRGYKNIGYGDIGPKWKILRKIAHSSLKNYGESTKHLETLVVK 503

 

Query: 187 QAQHLCQVLRDYSGKAVDLSEDF 209

           +++ L + L     ++ +L ++F

Sbjct: 504 ESEELHKRLFKNCNRSTELEDEF 572

 

>gnl|ti|648033522 1095897342515 39% to 17A1 N-term

153 GPFPLPIIGNLHLIGKKPHEKFVEYSKKYGEVFSLSFGMHRVVIVSGKDSIREVLVQK 326

327 SNIFAGRPKNYIANIVSRGYKNIGYGDIGPKWKILRKIAHSSLKNYGESTKHLETLVVK 503

504 ESEELHKRLFKNCNRSTELEDEF 572

 

>gnl|ti|647134594 1095899118747

          Length = 1050

 

 Score = 72.4 bits (176), Expect = 8e-12

 Identities = 46/143 (32%), Positives = 80/143 (55%), Gaps = 2/143 (1%)

 Frame = -1

 

Query: 69  GPRSLPLLGNMLDLAQDHLPIHLTALAKCYGNIYRLNCGSTSAMVVLNNSEIIREALVKK 128

           GP  LP++GN L L            +K YG ++ L+ G    +V+++  + IRE LV+K

Sbjct: 612 GPFPLPIIGN-LHLIGKKPHEKFVEYSKKYGEVFSLSFGM-HRVVIVSGKDSIREVLVQK 439

 

Query: 129 WSDFAGRPYSYTGXDIVSGGGRTISLGDFSEEWKAHRRVTHSALQRC--TTDSLHSVIEK 186

            + FAGRP +Y   +IVS G + I  GD   +WK  R++ HS+L+    +T  L +++ K

Sbjct: 438 SNIFAGRPKNYIA-NIVSRGYKNIGYGDIGPKWKILRKIAHSSLKNYGESTKHLETLVVK 262

 

Query: 187 QAQHLCQVLRDYSGKAVDLSEDF 209

           +++ L + L     ++ +L ++F

Sbjct: 261 ESEELHKRLFKNCNRSTELEDEF 193

 

612 GPFPLPIIGNLHLIGKKPHEKFVEYSKKYGEVFSLSFGMHRVVIVSGKDSIREVLVQK 439

438 SNIFAGRPKNYIANIVSRGYKNIGYGDIGPKWKILRKIAHSSLKNYGESTKHLETLVVK 262

261 ESEELHKRLFKNCNRSTELEDEF 193

 

>gnl|ti|648485307 1095899272864

          Length = 944

 

 Score = 70.9 bits (172), Expect = 2e-11

 Identities = 46/143 (32%), Positives = 80/143 (55%), Gaps = 2/143 (1%)

 Frame = -2

 

Query: 69  GPRSLPLLGNMLDLAQDHLPIHLTALAKCYGNIYRLNCGSTSAMVVLNNSEIIREALVKK 128

           GP  +P+ GN+  L  +   I L A +K YG ++ ++ G    +V++++    REALV+K

Sbjct: 799 GPFPIPIFGNLHLLGTEPHKI-LAAYSKKYGAVFSISLG-LQRIVIISDITTTREALVQK 626

 

Query: 129 WSDFAGRPYSYTGXDIVSGGGRTISLGDFSEEWKAHRRVTHSALQRCTT--DSLHSVIEK 186

            S FAGRP SY    ++S G + I+  D+   WK  R+V+HS+L+      +    ++ K

Sbjct: 625 ASIFAGRPKSYL-IQLISSGYKGIAFMDYGSFWKVLRKVSHSSLKIYGEGHERFEKILTK 449

 

Query: 187 QAQHLCQVLRDYSGKAVDLSEDF 209

           +++ L + L   S  +V+L  +F

Sbjct: 448 ESEELHKRLLKKSNNSVELKSEF 380

 

>gnl|ti|648485307 1095899272864 57% to 1095897342515

799 GPFPIPIFGNLHLLGTEPHKILAAYSKKYGAVFSISLGLQRIVIISDITTTREALVQK 626

625 ASIFAGRPKSYLIQLISSGYKGIAFMDYGSFWKVLRKVSHSSLKIYGEGHERFEKILTK 449

448 ESEELHKRLLKKSNNSVELKSEF 380

 

>gnl|ti|648026854 1095896933215

          Length = 1081

 

 Score = 70.1 bits (170), Expect = 4e-11

 Identities = 46/143 (32%), Positives = 78/143 (54%), Gaps = 2/143 (1%)

 Frame = +1

 

Query: 69  GPRSLPLLGNMLDLAQDHLPIHLTALAKCYGNIYRLNCGSTSAMVVLNNSEIIREALVKK 128

           GP  LP++GN L L            +K YG ++ L+ G    +V+++  + IRE LV+K

Sbjct: 175 GPFPLPIIGN-LHLIGKKPHEKFVEYSKKYGEVFSLSFGM-HRVVIVSGKDSIREVLVQK 348

 

Query: 129 WSDFAGRPYSYTGXDIVSGGGRTISLGDFSEEWKAHRRVTHSALQRC--TTDSLHSVIEK 186

            + FAGRP +Y   +IVS G + I  GD   +WK  R++ HS+L+    +T  L +++ +

Sbjct: 349 SNIFAGRPKNYIA-NIVSRGYKNIGYGDIGPKWKILRKIAHSSLKNYGESTAHLETLVVR 525

 

Query: 187 QAQHLCQVLRDYSGKAVDLSEDF 209

           +++ L + L   S ++  L   F

Sbjct: 526 ESEELHKNLYKKSNRSTKLEHKF 594

 

175 GPFPLPIIGNLHLIGKKPHEKFVEYSKKYGEVFSLSFGMHRVVIVSGKDSIREVLVQK 348

349 SNIFAGRPKNYIANIVSRGYKNIGYGDIGPKWKILRKIAHSSLKNYGESTAHLETLVVR 525

526 ESEELHKNLYKKSNRSTKLEHKF 594

 

>gnl|ti|651118815 1095900033599

          Length = 1071

 

 Score = 70.1 bits (170), Expect = 4e-11

 Identities = 46/143 (32%), Positives = 78/143 (54%), Gaps = 2/143 (1%)

 Frame = -1

 

Query: 69  GPRSLPLLGNMLDLAQDHLPIHLTALAKCYGNIYRLNCGSTSAMVVLNNSEIIREALVKK 128

           GP  LP++GN L L            +K YG ++ L+ G    +V+++  + IRE LV+K

Sbjct: 516 GPFPLPIIGN-LHLIGKKPHEKFVEYSKKYGEVFSLSFGM-HRVVIVSGKDSIREVLVQK 343

 

Query: 129 WSDFAGRPYSYTGXDIVSGGGRTISLGDFSEEWKAHRRVTHSALQRC--TTDSLHSVIEK 186

            + FAGRP +Y   +IVS G + I  GD   +WK  R++ HS+L+    +T  L +++ +

Sbjct: 342 SNIFAGRPKNYIA-NIVSRGYKNIGYGDIGPKWKILRKIAHSSLKNYGESTAHLETLVVR 166

 

Query: 187 QAQHLCQVLRDYSGKAVDLSEDF 209

           +++ L + L   S ++  L   F

Sbjct: 165 ESEELHKNLYKKSNRSTKLEHKF 97

 

516 GPFPLPIIGNLHLIGKKPHEKFVEYSKKYGEVFSLSFGMHRVVIVSGKDSIREVLVQK 343

342 SNIFAGRPKNYIANIVSRGYKNIGYGDIGPKWKILRKIAHSSLKNYGESTAHLETLVVR 166

165 ESEELHKNLYKKSNRSTKLEHKF 97

 

>gnl|ti|655005893 1095958068757

          Length = 952

 

 Score = 59.7 bits (143), Expect = 5e-08

 Identities = 70/308 (22%), Positives = 122/308 (39%), Gaps = 8/308 (2%)

 Frame = -2

 

Query: 150 RTISLGDFSEEWKAHRRVTHSALQRCTTDSLHSVIEKQAQHLCQVLRDYSG--KAVDLSE 207

           +T  L     +WK  RR+   +      ++   + E+QA  L   L   +   + VD+ 

Sbjct: 921 KTGLLTSTGSKWKTRRRLLTPSFHFSILNNFIKIFEEQASILVDKLAVAADNKEVVDVQV 742

 

Query: 208 DFTVASSNVITTLTFS---KAYDKSSAELQKLQECLNEIVSLWGS-PWISALDSFPLLRK 263

              +A+ ++I   +      A     +E  K    LNE + +    PW+     + LL 

Sbjct: 741 PIGLATLDIICETSMGVKVNAQSHPDSEYVKAITVLNEEIQMRQKFPWLWFDAIYKLL-- 568

 

Query: 264 FPNPPFSRLMKEVARRDELIGKHIEEFKKSEHKEGGTLTSSLLK--CLEPQQGAANHXXX 321

              P   R  K +    +L    I E  + + +E    T+S  K   L+          

Sbjct: 567 ---PCGKRFYKALDVAHKLSFDVINERMQMKIQESYCETASDEKKFFLDLLLDIYRKGKI 397

 

Query: 322 XXXXXXXXXXXXLIGGTETIAALLNWTVAFLLHRPEVQDKVYEELCCVLDVRYPQYSDRH 381

                       +  G +T +A L WT+  L   P+VQ K+++E+  +       Y   

Sbjct: 396 DTEGIQEEVDTFMFEGHDTTSAALGWTLWLLGKNPDVQKKLHKEIDEIELNGGSLYDKVR 217

 

Query: 382 KLPYLCALISEMLRLRPVAPLAVPHRAIRNSSIAGHFIPKNTIIIPNLYGAHHDPEVWDD 441

           +  YL  ++ E LR+ P  P+        + +I G F+PK   I+  +   H +P+ W++

Sbjct: 216 QSKYLEIILKESLRMHPPVPM-YGRTVEEDMTIDGQFVPKGAQIVLLVLILHSNPDYWEN 40

 

Query: 442 PYSFKPER 449

           P  F PER

Sbjct: 39  PNDFIPER 16

 

921 KTGLLTSTGSKWKTRRRLLTPSFHFSILNNFIKIFEEQASILVDKLAVAADNKEVVDVQV 742

741 PIGLATLDIICETSMGVKVNAQSHPDSEYVKAITVLNEEIQMRQKFPWLWFDAIYKLL 568

567 PCGKRFYKALDVAHKLSFDVINERMQMKIQESYCETASDEKKFFLDLLLDIYRKGKI 397

396 DTEGIQEEVDTFMFEGHDTTSAALGWTLWLLGKNPDVQKKLHKEIDEIELNGGSLYDKVR 217

216 QSKYLEIILKESLRMHPPVPMYGRTVEEDMTIDGQFVPKGAQIVLLVLILHSNPDYWEN 40

39  PNDFIPER 16

 

 

>gnl|ti|647175227 1095898288652

          Length = 1081

 

 Score = 45.1 bits (105), Expect(2) = 5e-07

 Identities = 21/54 (38%), Positives = 33/54 (61%)

 Frame = -1

 

Query: 461 LIPFGGGARLCLGEAVAKMEMFLFTAYLLREFKFLPASKEEPLPELRGVASVVL 514

           +  F  G R+CLG+ +A++E+FLF + L+R+FKF      + LP L G   + L

Sbjct: 961 IFTFSAGTRVCLGKGIAEVELFLFYSRLVRDFKF-EVKPGDSLPSLYGNCGITL 803

 

 

 

 Score = 31.2 bits (69), Expect(2) = 5e-07

 Identities = 10/26 (38%), Positives = 17/26 (65%)

 Frame = -3

 

Query: 425  IIPNLYGAHHDPEVWDDPYSFKPERF 450

            I+ NL+  HH+   W++P+ F P R+

Sbjct: 1079 ILTNLWQLHHNKNCWENPHEFNPYRW 1002

 

ILTNLWQLHHNKNCWENPHEFNPYRWXXXXXXXXXX

IFTFSAGTRVCLGKGIAEVELFLFYSRLVRDFKFEVKPGDSLPSLYGNCGITL

 

>gnl|ti|648589386 1095733042694

          Length = 1032

 

 Score = 55.8 bits (133), Expect = 8e-07

 Identities = 48/189 (25%), Positives = 84/189 (44%), Gaps = 6/189 (3%)

 Frame = +1

 

Query: 211 VASSNVITTLTFSKAYDKSSAELQKLQECLNEIVSLWGSPWISALDSFPLLRKFPNPPFS 270

           VA  NVI  + F + Y  S     ++   +N IV+  G    +A+D  P LR       

Sbjct: 454 VAILNVICFIVFGERYQYSDPAFIEILTTINNIVA--GLSNTTAVDFLPGLRYLQFSEIK 627

 

Query: 271 RLMKEVARRDELIG----KHIEEFKKSEHKEGGTLTSSLLKCLEPQQGAANHXXXXXXXX 326

           +L   +     L+     KH E F ++  ++    T S++K  + +             

Sbjct: 628 KLKSSLVIYFRLLNDQLKKHKETFDENNIRD---FTDSIIKFSKDETMENKFEEELTDEH 798

 

Query: 327 XXXXXXXL-IGGTETIAALLNWTVAFLLHRPEVQDKVYEELCCVL-DVRYPQYSDRHKLP 384

                  + IGG+ET    L W + +++H P+ Q++++EE+  V+ + RYP+ SDR  L

Sbjct: 799 LEHVIGDMFIGGSETTLTSLLWLIIYMIHYPKYQEEIFEEITRVIGENRYPKLSDRDSLH 978

 

Query: 385 YLCALISEM 393

            + A I  +

Sbjct: 979 LVKASIKRV 1005

 

454 VAILNVICFIVFGERYQYSDPAFIEILTTINNIVAGLSNTTAVDFLPGLRYLQFSEIK 627

628 KLKSSLVIYFRLLNDQLKKHKETFDENNIRDFTDSIIKFSKDETMENKFEEELTDEH 798

799 LEHVIGDMFIGGSETTLTSLLWLIIYMIHYPKYQEEIFEEITRVIGENRYPKLSDRDSLH 978

979 LVKASIKRV 1005

 

>gnl|ti|649393684 1095898809307

          Length = 1093

 

 Score = 54.3 bits (129), Expect = 2e-06

 Identities = 25/65 (38%), Positives = 43/65 (66%)

 Frame = +2

 

Query: 462 IPFGGGARLCLGEAVAKMEMFLFTAYLLREFKFLPASKEEPLPELRGVASVVLKVKPYTV 521

           +PF  G R CLGEA+AK+E+F+F + L+++++F   ++EE LP L+G + +      + V

Sbjct: 50  LPFSSGPRSCLGEALAKIELFIFISRLVKDYRFEKPTEEE-LPNLKGESGITRIPSEFKV 226

 

Query: 522 IAHPR 526

           +  PR

Sbjct: 227 MTIPR 241

 

>gnl|ti|649393684 1095898809307 45% to 17A1 C-term

LPFSSGPRSCLGEALAKIELFIFISRLVKDYRFEKPTEEELPNLKGESGITRIPSEFKVMTIPR

 

>gnl|ti|646849327 1095897329284

          Length = 980

 

 Score = 51.2 bits (121), Expect = 2e-05

 Identities = 30/68 (44%), Positives = 39/68 (57%)

 Frame = +2

 

Query: 69  GPRSLPLLGNMLDLAQDHLPIHLTALAKCYGNIYRLNCGSTSAMVVLNNSEIIREALVKK 128

           GP  LP +GN   L +      L  L K YG+++  + GS    VV+NN E I+E L+KK

Sbjct: 302 GPIPLPFIGNAHLLRKGEPYKELVNLGKIYGDVFGFSIGSIR-YVVVNNLEGIKEVLIKK 478

 

Query: 129 WSDFAGRP 136

            S FAGRP

Sbjct: 479 GSQFAGRP 502

 

>gnl|ti|646849327 1095897329284 40% to 2X2 C-term

GPIPLPFIGNAHLLRKGEPYKELVNLGKIYGDVFGFSIGSIRYVVVNNLEGIKEVLIKKGSQFAGRP

 

>gnl|ti|646968536 1095898162561

          Length = 1074

 

 Score = 48.9 bits (115), Expect = 9e-05

 Identities = 32/91 (35%), Positives = 44/91 (48%), Gaps = 2/91 (2%)

 Frame = +1

 

Query: 48  FPKLLHSLYKLFFSTVSPTI--SGPRSLPLLGNMLDLAQDHLPIHLTALAKCYGNIYRLN 105

           FP L+  +Y      +       GP  LP +GN   L +         L K YG+I+  +

Sbjct: 598 FPPLIWFVYSYIKHLIECLYYPKGPVPLPFIGNTNLLRKKETCKEFVNLGKIYGDIFGFS 777

 

Query: 106 CGSTSAMVVLNNSEIIREALVKKWSDFAGRP 136

            GS    V++NN E I E L+KK S F+GRP

Sbjct: 778 IGSIR-YVIVNNLEGIHEVLIKKGSQFSGRP 867

 

>gnl|ti|646968536 1095898162561 83% to 1095897329284

FPPLIWFVYSYIKHLIECLYYPKGPVPLPFIGNTNLLRKKETCKEFVNLGKIYGDIFGFSIGSIRYVIVNNLEGIHEVLIKKGSQFSGRP

 

>gnl|ti|647168675 1095899196297

          Length = 998

 

 Score = 48.1 bits (113), Expect = 2e-04

 Identities = 18/48 (37%), Positives = 32/48 (66%)

 Frame = -2

Same as 1095898098005

Query: 334 LIGGTETIAALLNWTVAFLLHRPEVQDKVYEELCCVLDVRYPQYSDRH 381

           L+ GTET A  + W V +L+H PE Q+++Y+E+   +  RYP  ++++

Sbjct: 175 LVAGTETTAITICWMVLYLIHNPEYQEEIYKEITSNIGCRYPTLAEKN 32

 

 

>gnl|ti|649448444 1095899351259

          Length = 1086

 

 Score = 32.7 bits (73), Expect(2) = 0.079

 Identities = 15/32 (46%), Positives = 19/32 (59%)

 Frame = -3

 

Query: 414  IAGHFIPKNTIIIPNLYGAHHDPEVWDDPYSF 445

            I G FIPK + I   +   H +PE W DP+SF

Sbjct: 1039 IDGQFIPKKSEIAILVMMIHLNPEYWKDPHSF 944

 

 

 

 Score = 25.4 bits (54), Expect(2) = 0.079

 Identities = 10/31 (32%), Positives = 17/31 (54%)

 Frame = -2

 

Query: 462 IPFGGGARLCLGEAVAKMEMFLFTAYLLREF 492

           IPF  G R C+G+  A +E  +    +++ F

Sbjct: 890 IPFSAGPRNCIGQKFAMIEEKMLLYNIMKHF 798

 

>gnl|ti|649448444 1095899351259 similar to 4V5

IDGQFIPKKSEIAILVMMIHLNPEYWKDPHSFIP

IPFSAGPRNCIGQKFAMIEEKMLLYNIMKHF

AGTAGGTATATTGAAGAAGATATGATGATTGATGGTCAGT

TTATTCCTAAAAAATCCGAAATCGCTATTCTTGTGATGATGATACATTTAAATCCTGAGT

ATTGGAAAGATCCTCACAGCTTTATAcCTGAAAGATTTGATCAAGATGATTTTGTAAAGCG

TAATCCATACACTTACATTCCATTCTCCGCTGGCCCTAGAAATTGCATTGGTCAAAAGTT

TGCAATGATAGAGGAAAAAATGCTGTTATATAACATAATGAAACATTTTTATGTAGAATC

CATGCAGAATGAAAATGAAATTTTAAGAACTCAAGATCTTATAAGTAAATCAGCTAATGG

TATCATGATGAAGTTCTATGAAAGATGA

 

>gnl|ti|648014530 1095896049543

          Length = 1075

 

 Score = 34.7 bits (78), Expect = 1.9

 Identities = 14/34 (41%), Positives = 22/34 (64%)

 Frame = -3

 

Query: 420 PKNTIIIPNLYGAHHDPEVWDDPYSFKPERFLEG 453

           P NTI+  ++   H +  ++ DP+SFK ERF+ G

Sbjct: 971 PANTILRTHVSSIHMNETIYPDPHSFKHERFMTG 870

 

 

>gnl|ti|648014530 1095896049543 41% to CYP21

PANTILRTHVSSIHMNETIYPDPHSFKHERFMTG

 

>gnl|ti|655009845 1095963045220

          Length = 1143

 

 Score = 33.9 bits (76), Expect = 3.2

 Identities = 35/127 (27%), Positives = 50/127 (39%), Gaps = 10/127 (7%)

 Frame = -3

 

Query: 108 STSAMVVLNNSEIIREALVKKWSDFAGR--------PYSYTGXDIVSGGGRTISLGDFSE 159

           STS  V +N S+ ++  L K   D   +        PYSY   + +   G  I L    

Sbjct: 637 STSNCVTMNQSDYLQNVLQKFGFDNCKQRSSPCKQVPYSYHNQESIKSNGSMIKL----- 473

 

Query: 160 EWKAHRRVTHSAL--QRCTTDSLHSVIEKQAQHLCQVLRDYSGKAVDLSEDFTVASSNVI 217

               +R++  S L    CT   L  VI K +QHL    +  SG  + +   F      +

Sbjct: 472 ----YRQMVGSLLYAMTCTRPDLSYVITKLSQHLS---KPNSGDWIMIKHVFRYIKHTLN 314

 

Query: 218 TTLTFSK 224

             LTF K

Sbjct: 313 YCLTFRK 293

 

>gnl|ti|655009845 1095963045220 near C-helix region poor match

637 STSNCVTMNQSDYLQNVLQKFGFDNCKQRSSPCKQVPYSYHNQESIKSNGSMIKL 473

472 YRQMVGSLLYAMTCTRPDLSYVITKLSQHLSKPNSGDWIMIKHVFRYIKHTLN 314

313 YCLTFRK 293

 

>gnl|ti|648592188 1095595897239

          Length = 1036

 

 Score = 33.9 bits (76), Expect = 3.2

 Identities = 35/127 (27%), Positives = 50/127 (39%), Gaps = 10/127 (7%)

 Frame = -3

 

Query: 108 STSAMVVLNNSEIIREALVKKWSDFAGR--------PYSYTGXDIVSGGGRTISLGDFSE 159

           STS  V +N S+ ++  L K   D   +        PYSY   + +   G  I L    

Sbjct: 638 STSNCVTMNQSDYLQNVLQKFGFDNCKQRSSPCKQVPYSYHNQESIKSNGSMIKL----- 474

 

Query: 160 EWKAHRRVTHSAL--QRCTTDSLHSVIEKQAQHLCQVLRDYSGKAVDLSEDFTVASSNVI 217

               +R++  S L    CT   L  VI K +QHL    +  SG  + +   F      +

Sbjct: 473 ----YRQMVGSLLYAMTCTRPDLSYVITKLSQHLS---KPNSGDWIMIKHVFRYIKHTLN 315

 

Query: 218 TTLTFSK 224

             LTF K

Sbjct: 314 YCLTFRK 294

 

638 STSNCVTMNQSDYLQNVLQKFGFDNCKQRSSPCKQVPYSYHNQESIKSNGSMIKL 474

473 YRQMVGSLLYAMTCTRPDLSYVITKLSQHLSKPNSGDWIMIKHVFRYIKHTLN 315

314 YCLTFRK 294

 

>gnl|ti|653058100 1095949490108

          Length = 1087

 

 Score = 33.9 bits (76), Expect = 3.2

 Identities = 16/37 (43%), Positives = 22/37 (59%)

 Frame = -1

Pretty doubtful match to mid CYP21 region

Query: 229 SSAELQKLQECLNEIVSLWGSPWISALDSFPLLRKFP 265

           S   LQ L +  ++  S W SPW+ +LD   LL+KFP

Sbjct: 124 SGRSLQWLLQWYSQQSSQWYSPWLHSLDKCRLLKKFP 14