48 Tetrahymena P450 genes

 

This file is being updated to complete these genes.  Currently 37 are

complete (magenta).  There are very few short introns present.

There are four full length pseudogenes and one solo exon pseudogene.

Note: this is the first time CYP names have used four digit numbers for families.

 

Blast server including Tetrahymena P450s

 

Tree of 47 Tetrahymena P450s

 

sequence alignment of 47 Tetrahymena P450s

 

D. Nelson April 23, 2004

 

>CYP5001A1 8254555 596115 bp 4 contigs 32% to CYP689A1 This gene's N-term is very

hard to detect.  I have assumed one intron, but this may not be true.

MPIQHNSQPKEVQTCEQMMKQLENNGFIRLKSYRPFQEEDLFKYDDELYSVKQ

FKSRYPNAKGFICYSDSSQKVPYNYIGICDADLIHEFMVEKEKYYERDVTLSYVARILGD

SLLFNLSDIWRMKRKVYGQMFHFYQYHQLVKLVREITTKNSYIIDMNSQKPVQLISHFEDILGEWSLHLFFGY

NASQYDLQSKLKAAYNNARSAILLLQPQNIINCLDPLKEVLLEYLYITSDQIKSDPDYQSKPQNNITC (2)

YSFIHSFYLLIIFLKFKNRFLLNSKEVQWNKDFL

105041 DQICQDIIIFFFAGRDTMAHTLQMMFYYLCIYPEYKKKIDEEINSLNGDYSVQNISNLN 105217

105218 YLEAVLKETLRHYGPTTTILPKVVMERHSFMNDKVFLEEGTIVSLAIQELNYDPQHYENP 105397

105398 FKFYPERWFDENLKQKEKKIFMPFSYGSHKCIGDKLALPVMKMLYITLNEYFSIDIDPSF 105577

105578 KLKLVALIQYTPSTQIPFLITKKKNIQSNNEVYFS* 105685

 

ATTCTAGGAGAATGGTCTCTTCATCTATTTTTCGGATATAATGCTAGCTAATATGATTTA

                           F  F  G  Y  N  A  S  Q  Y  D  L

CAATCTAAATTAAAAGCAGCTTATAACAATGCTCGCTCTGCCATACTCCTTTTACAGCCC

Q  S  K  L  K  A  A  Y  N  N  A  R  S  A  I  L  L  L  Q  P

                                        P  Y  S  F  Y  S  P

CAAAATATAATAAACTGTCTTGACCCTTTAAAGGAAGTCCTCCTTGAATATCTATACATC

Q  N  I  I  N  C1 L  D  P  L  K  E  V0 L  L  E  Y  L  Y  I

 K  I  Q  Q  T  V  L  T  L  Q  R  K  S  S  L  N  I  Y  T  S 

ACTTCTGATTAGATAAAAAGCGATCCTGATTACTAAAGCAAACCTTAAAATAACATCACA

T  S  D  Q  I  K  S  D  P  D  Y  Q  S  K  P  Q  N  N  I  T

 L  L  I  R2 Q  K  A1 I  L  I  T  K  A1 N  L  K  I  T  S  H

TGGTATAAAATTTAAAAATTAATTAAAAACAATAAAATAAATGTTTTTTTTAGTTAGTTA

W2 Y  K  I  Q  K  L  I  K  N  N  K  I  N  V0 F  F  S1 Q2 L 

 G  I  K  F  K  N  Q  L  K  T  I  K  Q  M  F  F  L  V  S2 Y

TTCATTCATTCATTCATTTTACTTACTAATAATTTTTTTGAAATTTAAAAACAGGTTTTT

F  I  H  S  F  I  L  L  T  N  N  F  F  E  I  Q  K  Q  V0

 S  F  I  H  S  F  Y  L  L  I  I  F  L  K  F  K  N  R2 F  L

GCTAAATAGCAAAGAAGTTTAATGGAACAAAGATTTCCTTGATTAAATTTGCTAAGATAT

 L  N  S2 K  E1 V1 Q  W  N  K  D1 F  L  D  Q  I  C  Q  D  I

TATCATCTTTTTCTTCGCTGGAAGAGATACGATGGCTCATACTCTTTAAATGATGTTTTA

             F  A  G  R  D  T

TTATTTATGTATTTACCCTGAATATAAAAAGAAGATAGACGAAGAAATAAATAGCCTAAA

CGGAGATTATTCTGTTTAAAATATTTCAAATTTAAATTATTTAGAAGCTGTTTTAAAAGA

 

5 POSSIBLE INTRON LOCATIONS THAT MAKE SENSE.

 

VPYNYIGICDADLIHEFMVEKEKYYERDVTLSYVARILGDSLLFNLSDIWRMKRKVYGQM

FHFYQYHQLVKLVREITTKNSYIIDMNSQKPVQLISHFEDILGEWSLHLFFGYNASQYDL

QSKLKAAYNNARSAILLLQPQNIIN (1) SILITKANLKITSH

GIKFKNQLKTIKQMFFLVSYSFIHSFYLLIIFLKFKNRFLLNSKEVQWNKDFLDQICQDI

IIFFFAGRDTMAHTLQMMFYYLCIYPEYKKKIDEEINSLNGDYSVQNISNLNYLEAVLK

 

VPYNYIGICDADLIHEFMVEKEKYYERDVTLSYVARILGDSLLFNLSDIWRMKRKVYGQM

FHFYQYHQLVKLVREITTKNSYIIDMNSQKPVQLISHFEDILGEWSLHLFFGYNASQYDL

QSKLKAAYNNARSAILLLQPQNIIN (1) SNLKITSH

GIKFKNQLKTIKQMFFLVSYSFIHSFYLLIIFLKFKNRFLLNSKEVQWNKDFLDQICQDI

IIFFFAGRDTMAHTLQMMFYYLCIYPEYKKKIDEEINSLNGDYSVQNISNLNYLEAVLK

 

MY CHOICE BASED ON COMPARISON TO PARAMECIUM P450S

VPYNYIGICDADLIHEFMVEKEKYYERDVTLSYVARILGDSLLFNLSDIWRMKRKVYGQM

FHFYQYHQLVKLVREITTKNSYIIDMNSQKPVQLISHFEDILGEWSLHLFFGY NASQYDL

QSKLKAAYNNARSAILLLQPQNIINCLDPLKEVLLEYLYITSDQIKSDPDYQSKPQNNITC

 (2) YSFIHSFYLLIIFLKFKNRFLLNSKEVQWNKDFLDQICQDI

IIFFFAGRDTMAHTLQMMFYYLCIYPEYKKKIDEEINSLNGDYSVQNISNLNYLEAVLK

 

VPYNYIGICDADLIHEFMVEKEKYYERDVTLSYVARILGDSLLFNLSDIWRMKRKVYGQM

FHFYQYHQLVKLVREITTKNSYIIDMNSQKPVQLISHFEDILGEWSLHLFFGYNASQYDL

QSKLKAAYNNARSAILLLQPQNIINCLDPLKEVLLEYLYITSDQIKSDPDYQSKPQNNIT

WYKIQKLIKNNKINVFF (1) KVQWNKDFLDQICQDI

IIFFFAGRDTMAHTLQMMFYYLCIYPEYKKKIDEEINSLNGDYSVQNISNLNYLEAVLK

 

VPYNYIGICDADLIHEFMVEKEKYYERDVTLSYVARILGDSLLFNLSDIWRMKRKVYGQM

FHFYQYHQLVKLVREITTKNSYIIDMNSQKPVQLISHFEDILGEWSLHLFFGYNASQYDL

QSKLKAAYNNARSAILLLQPQNIINCLDPLKEVLLEYLYITSDQIKSDPDYQSKPQNNIT

WYKIQKLIKNNKINVFF (1) IQWNKDFLDQICQDI

IIFFFAGRDTMAHTLQMMFYYLCIYPEYKKKIDEEINSLNGDYSVQNISNLNYLEAVLK

 

>CYP5002A1 8254654 510086 bp 17 contigs 30% to CYP696C1

MLTELFYTVILLIGVLFYLTFLRPLYKLLMLKIKHGKEILIKFEPIIGIHAYLKKSEKIHGDPMKLLKLEMAKN

PKLKAVACSLGFGVLITSVDPDMNKSFLIDNYQKSVKLAPIFNYKILKNGIIFSE

GEQWKKQRKMISDSFHFDSKYYFFSQKFNLLTLIMFKKVQNR (1)

VSVDSQLFFTKIKGEIGIRVFFGPDFGDHQIDGKNISEELKE (0)

IVDQTIVYSYTSIFFQLK

RIFLGDELAMKFLTAKEKKLLSRSDRINQIAMEQIEKKIKYYEGQPSVQSDYLIDV

292370 LVRNYLTKQDPNLTKEQ (0)

ICQQCITMFIAALDNTGCIIA 292543

292544 WIIFCLANYKEEGDKVRQEINSVFPTQETLDNIQFDDLKNLNYTSAFIDECFRHYTSSMG 292723

292724 VFLRRVVEPFKSGNFEIQK (1)

       GDIIQSLWH

292894 PTQFNPDWFENPDKFDVNRFLNGKEKQGNSYAFTPFSIGPRNCIGKQMSLVEVKVSLIYF 293073

293074 VKKFNCIRNSVPLKTTASIVYKPVDQNLIKISFCL* 293181

 

AATATTTAACTATAAAATACTTAAAAATGGAATAATCTTTTCAGAAGGAGAACAGTGGAA

                                                       W  K

GAAATAGCGCAAAATGATATCAGATTCCTTCCATTTCGATAGTAAATACTATTTTTTCAG

 K  Q  R  K  M  I  S  D  S  F  H  F  D  S1 K  Y  Y  F  F  S1

TCAAAAATTTAATTTATTAACTCTCATTATGTTCAAAAAGGTTTAAAATCGAGGTTTTAA

 Q  K  F  N  L  L  T  L  I  M2 F  K  K  V0 Q  N  R  G1 F  K2

GTTATTGCATAAGTAACTAAAGAAACAATTGACAGTCTCAGCAAATATGTAGGAAAGGAT

 L  L  H  K2 Q  L  K  K  Q  L  T  V0 S  A  N  M2 Q  E  R  I 

TACTTCCCTTTATTTGATGAAATGATGAAGCATTCAGGTAATTACTTTAATTAACATTTT

 T  S  L  Y  L  M  K  *

AGTTTCAGTTGATTCTCAACTTTTTTTTACTAAAATTAAAGGCGAAATAGGTATTAGGGT

 V1 S  V1 D                    K  I  K  G  E  I  G  I  R  V

TTTCTTTGGACCAGATTTTGGTGATCACCAAATAGATGGTAAGAATATATCTGAAGAGCT

 F  F  G  P  D  F  G  D  H  Q  I  D  G  K  N  I  S  E  E  L

TAAGGAGGTAAATGATTAATTGATTCATTTATTTATTTATTTCGTAGGTATATTATAATT

 K  E  V

TGTATATGATTTATCTTAACATCAAGAATGAAAACATATTTATATAAATGCAAAAATAAA

                               K  H  I  Y  I  N  A  K  I  K

ATTAAAGATTGTAGATCAGACTATTGTTTACAGCTATACTTCTATTTTCTTCTAATTAAA

 L  K  I  V  D  Q  T  I  V  Y  S  Y  T  S  I  F  F  Q  L  K

GAGAATCTTTTTGGGAGATGAATTGGCAATGAAGTTTTTAACTGCGAAAGAAAAGAAACT

 R  I  F  L  G  D  E  L  A  M  K  F  L  T  A  K  E  K  K  L

TTTGTCTAGGAGTGATAGGATAAATTAAATTGCTATGGAATAAATAGAAAAAAAAATAAA

ATACTATGAAGGTTAACCTAGTGTACAATCAGACTACCTTATAGATGTATTAGTGAGAAA

TTATCTCACAAAATAAGACCCAAATCTAACAAAAGAATAAGTAATCCATTTTATATTAAG

                D  P  N  L  T  K  E  Q  V  I  H  F  I  L  S

CTTTTTTTAAAAATAAATGTCTTCATTTATATCTAATTAGATATGCTAATAGTGCATAAC

 F  F  Q  K  Q  M  S  S  F  I  S  N  Q  I  C

TATGTTTATTGCAGCTTTAGACAACACTGGATGCATAATAGCTTGGATAATTTTTTGTTT

GGCTAACTATAAAGAAGAAGGAGATAAAGTAAGATAAGAAATCAACTCCGTCTTTCCAAC

ATAGGAGACCTTGGATAACATCTAATTTGATGATTTAAAAAATCTTAATTATACTTCAGC

ATTTATTGATGAATGTTTCAGGCACTATACAAGTTCTATGGGCGTTTTTCTCCGTAGAGT

                      H  Y  T  S  S  M  G  V  F  L  R  R  V

AGTTGAACCCTTTAAGTCTGGAAACTTTGAAATATAAAAAGGTACTGATTAATCTATCAT

 V  E  P  F  K  S  G  N  F  E  I  Q  K  G

TAATAAATATTAATTATTTTAATTTAAAACTAAACTAATTAACTAAAAATTTTTAAATTA

AATAAAGGAGATATTATTTAAAGTTTATGGCATCCTACCTAATTTAATCCTGACTGGTTT

   K  G  D  I  I  Q  S

GAAAATCCTGATAAATTTGATGTGAATAGATTCTTAAATGGAAAAGAAAAGTAAGGAAAC

 

>CYP5003A1 8254752 1171415 bp INTRONS 1, 3 AND 4 HAVE UNDEFINED BOUNDARIES 29% TO CYP694C3

MILLIIGLLIFSLFSYFAYLIFVKPYVRNKWYVKQYGKICKPIYYPILGNIKFVRESFKKYG

DSLQWIRELLKNDNDVQFLIGYNNIHIVDAQLAKDITLNNLQNIRKDMGMQFGDLLFKQG

IVFSEGEKWKKQRQILSHSFSFDVLKNRVSKINNIVKEMINKIQIEE (1)

GKPTKIVTECTKITAEVVLRSFFGERFSQQQHKGINLGIYLAQL

LLSVFDQTRRLRMLIPSFFFPKFMSKIFADEKYYEVQKNCIEFRQLIEKEVNLRIENYDQ

NKTQKDFLDILVDEYKLNYSKNDGKVEEYITKESIIHQFIVFFFA (1)

GMDTTGNLTGIMLYW (2)

VSKRKDIYEKLVQEIKSVFGQNKDPEINDEQLKKLNYCHMFIQECLRYHCPAMLL (2)

FTRRAERDFYIGDNILVQKGMQVNISLHGVLRRE

262363 KYFQNPDEFIPERFSEENKKNINHLAFIPFSSGPKNCIGQHMALIEAKIILVQFILNFDI 262184

262183 TNNENVPLKMDSNSLYCPVNDELLFLKRKQ* 262091

 

AATCTCTTTCAGCTCTACGTGTGAACCTTAAAAATTTACAAAAGTTAATTTTAATAGACT

 D1 R2 E  A1 R  R  T  F  R2 L  F  K  C  F  N  I  K  I  S  Q

                                                   L  L  S0

AAAAATATTATTTATTTACAATAACATTGCTGGGCAGTGGTATCTGAGACATTCTTAGAT

F  I  N  N  I  Q  L2 L  M2 A  P

AAACATGTGGCAATAATTTAATTTCTTTAGTTATTCGTCATTGATTTCAGGATCTTTATT

                                                          

TTACCCAAATACACTTTTTATTTCTTAAACTAATTTCTCATATATATCTTTTCTTTTAGA

                                                   R  K  S

AACCCTATTTGATAATAATTTACATTTAATTATTATTTTTATTATTTTTATAATATCTAC

V  R2                                                      W

CAATAAAGCATGATGCCAGTCAAATTTCCAGTTGTATCCATACCTAAATTTTAAATGAGT

 

 

CAAAAAAGGATCTTAAAACAACTTCTGCCGTTATCTT AGTGCATTCAGTTACAATTTTAG

                         A  T  I  K0

TTGGTTTTCCTATAAAAAAATAATTAAATTATTTACATACGATATTGTTTTAACTTTATA

 P  K  G1 I

TTCATCAAATCAGATTAATTATAAATAACTAAATTACCTTCTTCAATTTATATTTTATTA

                          V  L  N  G  E  E  I  Q  I  K  N  I

ATCATTTCTTTAACAATATTATTGATTTTAGATACTCTATTTTTCAACACATCAAAAGAG

 

>CYP5004A1 8254638 979488 bp 2 contigs 31% to CYP694C3 COMPLETE, NO INTRONS

747540 MIILIASLLTILTFCILLLIFSPKIKLLHLAKKYKDTEVLFSPVGGIVNLYQQSFEKY 747367

747366 GDALYFIRDLIQKKPNTKAILYNLGVDIVVNFQDAELVKEVHQSYSDYYEKLKGILTLQY 747187

747186 LFDKGMLMTDGDEWQKQRKLLANTFLFQNIKSRLPVTKQVVQEIYSSKQLDLSKPINALH 747007

747006 MSEKVTSDVIFQTFFGISFRGKQINGVEIQKELSNVLIEGFNYIQSSFYYKLKMVILK 746833

746832 EKAVGFKPFKGEEDLLKRMKNLRDTTEEVIQNRLKQIKSGNLDISQSELFLDILLRDYI 746656

746655 QKNSESEKDIQQMIDQFMTFFFGGTDTTADLVALTLYHLSHNQNFQTKLRQEIKSIINNF 746476

746475 EELNYDNLNQMNYLQCVIKESLRIHPPAVGVLPRVCIKDHKVGQIEMKKGMLMDTHFIGV 746296

746295 LNNPKYYDNPQDFNPDRWNDSKKMESAPFSPFGIGKRSCIGQHLGMMNAKVIICFLVMNY 746116

746115 LVKPNHQQKLRMSLNTVYTPFNDDLVYFEKIN* 746017

 

>CYP5005A1 8254815a 909308 bp 16 contigs 10 genes all in (+) 29% to CYP694C2

       MILVSYIALGA

241470 IVLLIAIFIYNLILYPQIRLNLMKQKHGDKIKTFFHIGEGIFKEYYTNLTQKKDSVAFIR 241649

241650 GIHQKNIKALAYNLGTRICLNFVDPELVKQVHQNPEAFRKNDASLAITFLFGNSILFA 241823

241824 KGKSWQRQRQFLGKSFHFEEIKNYLPLIKEICDKVFERVSEKLTQNDQLEINAVKICEQV 242003

242004 TSEVVFRAFFGSTTQNLVITRQDGSQIPIADELIQAIMNSFQLLKKDKIALFKYLIFGRN 242183

242184 STKFLRTQGEQSILNRLIAIKESCLQVVQQRKNQLQNDPTQAKKNFLDQYLYDMITNKQ 242360

242361 SEVNNEEIIDNFLGLFFAGTDTTGNMTGVALYYLSRYPDIQKQAREEVIQILSQNSNKRN 242540

242541 HSELFSQLTFENLQNMNLINSILKESLRLIPPAIEVFPRVAIHNMKIGEFQIKKGDLVST 242720

242721 YFIYNQSNPELYQDPEIFNPQRWMNVKD 242804 (2)

242867 SQNNFNFTPFSLGPRNCIGQHLAMIEGKCMLACILLNFDILPNYQQEVVKELRVIY 243034

243035 GFQKDNLIYFKQRKQNN* 243088

 

ATCCAGAAATATTTAATCCTCAAAGGTGGATGAATGTAAAAGAGTAAGTCTTTTATATTT

                          W  M  N  V  K  E2

ATTATATTCTATATTTATAATGTATCTTCTTAATATAAATCAAAGCTCATAGAATAATTT

                                           S2 S  Q0 N  N  F

CAACTTTACTCCGTTTTCACTAGGCCCTAGAAATTGCATCGGTTAGCATCTTGCTATGAT

 N  F  T  P  F  S

 

>CYP5005A2 8254815b 909308 bp 16 contigs 10 genes all in (+) 30% to CYP694C1

244655 MLLNSIIISFVIILIIYKFLLYPKLRLIQIKRKYGDKIKVIHHYGPGLLVEYKK 244816

244817 SYTQNKDSLSFLRQMAYQNGKSIQAYAFNIGTNVGYSFVDPELIKQVHQNHDAFQKIDV 244993

244994 SQAIAYLFSNSLLFATGKSWQKQRQFLGKSFHYDEIKNYFPSIKDICLKVSDRISEELI 245170

245171 QNPNKEIKVVKTCEQITSEVVFRAFFGATSENIFISTENGSTKIPISQELVSVIMDSFRI 245350

245351 LQQSKLIQLKSQLLKRKTFNIFPLKEEKKLLNRLITLKQACKNIVLKRK 245497

245498 EELVSDPSLYKKNFIDLYLKEIIQNSNNEITIEEIVDNFCALFFAGTDTTGNMTGAALYF 245677

245678 LSLNPKIQQEAREEVMQIIKQKANSSDLKDIYNYLSFEDLTKLDLINSILKESLRLLPPASGV 245866

245867 LPRISNRDIKIGQFEVKKGDLVNTHFIYHQSNPEVYENQDQFNPYRWMKGKE 246043 (2)

246053 QNNAFNFTPFSLGPRNCIGQHLAMIEGKCILINFLINFDIMPIPQKQV 246229

246230 QYEMKVIYGLQPDDLVYFRKRNN* 246301

 

TAGATGGATGAAAGGAAAAGAGTAAAACATTTTTATATAATTATAAAAAAATTAAATAAT

    W  M  K  G  K  E

TTTTAAATAAATATAAATTAAAAGGTAAAATAATGCATTCAATTTTACTCCTTTTTCGTT

                         Q  N  N  A  F  N  F  T  P  F  S  L

AGGACCTAGAAACTGTATAGGATAACATTTGGCGATGATAGAAGGTAAGTGTATTCTTAT

 

>CYP5005A3 8254815c 909308 bp 16 contigs 10 genes all in (+) 31% to CYP694C1

247451 MIFVAIYTILGILALIFAIALYNLVFYPYLRLKIMKQKYGDKIQVFFNIGDGILKKYKNDLE 247636

247637 DKGDSLAFIRGMHQKNLKAIGFNLGAKVGLSFVDPDLIKQVHQNHDSFHKIDAAMAIT 247810

247811 FLFGNSILYAKGKDWQRQRQFLGKSFHFEEIKNYLPLIKEVCQNTFQNVNKQLALKEQTE 247990

247991 IQAVKICEQITSEVVFRVFFGSTSQNLDIKKEDGTRIPIAHELVETIMNSFQLLQNDKIA 248170

248171 LIKWLLFKRNSTKFLRTQGEESILNRLITVKQTCLQVVIKRRDELFKDPSQAKKNF 248338

248339 LDQYLIDMIQNKNSKVTYDEIVDNFSGLFFAGTDTTGNMTGVALYYLSLYPEIQQQAREE 248518

248519 VIKVLSQKQKEKKTDQLF

248573 NQLTFDDLSNLDLINSILKESLRLVPPANEVFPRIADHDMKIGDFQIEKGDLVNTHFIYN 248752

248753 QSNPEYYPNPDIFDPYRWMNAKD 248836 (2)

248875 QQNTFHFTPFSLGPRNCIGQHLAMIEGKCMLASILLQFEILPNHSAKIVKEVKVIYGLKN

       DNIIFFKKLKAN* 249093

 

CCCTGAATATTATCCAAATCCAGATATTTTTGATCCTTATAGATGGATGAACGCAAAAGA

                                        R  W  M  N  A  K  E

GTAAATTTTATTTTTATATTCATTCTTAAAAAATAATATTTAACTAAAAATAGTTAATAG

                                                   S  Q  Q

AACACATTTCACTTTACTCCATTCTCTTTAGGTCCCAGAAATTGCATAGGTTAACATCTA

N  T  F  H  F  T  P  F  S  L

 

>CYP5005A4 8254815d 909308 bp 16 contigs 10 genes all in (+) 29% to CYP696C1

249796 MFVENINLIPVLFILGILFILYKMILYPKIRLMQLQKKYGESIKVVHHYGTGLLMEYKKGFQQLND 249993

249994 SMAFIRKMSKENQSNVEAFAFNIGPKIGISFVYPELIKQVHKNHDAFQKIDVSQAVVYLF 250173

250174 SNSLIYATGKQWQRQRQFLGKSFHFEEIKNYLPCIKDICIKTSNQISEDIRINPQKEIQV 250353

250354 VKICEKVTSEVVFRVFFGSTQENILIEREDGQKLSISEELVSIVMDSFRILQQDKLLLIK 250533

250534 SLILKRYSMKIFPLREEKSLHNRLIQLKKACETIVLQRKKQLQLDPALYKNNFLDLYL 250707

250708 KEMIQNSNTQITIDEIIANFCGLFFAGTDTTGNMTGVALYYLSLNPQIQKEAREEVIQI 250884

250885 ISKKNSNLDLKDQFNYITFEDLSSMNLINSILKESLRLIPPAIGVFPRYANRDIKI 251052

251053 GQFELKKGDLVNTHFIYNQSNPSIFQNPEQFDPKRWMNGND (2)

251242 LQFAFSFTPFSLGPRNCIGQHLAMIEGKCMLANFLLKYDILPNKSQNIGLEMKIIYGLSPDNLVYFKQR* 251451

 

TACAATTAGTCAAATCCTTCAATATTTTAAAATCCAGAACAGTTTGATCCTAAAAGATGG

                                                      R  W

ATGAATGGAAATGAGTAATTAATTAATTAATTAATCATTTAATTTTTTATTATTTATTTA

M  N  G  N  E

CAAAAATTTATTAAATATAGTTTGTAATTTGCGTTTAGCTTTACACCATTTTCATTAGGC

                     L  Q  F  A  F  S  F  T  P  F  S  L

 

>CYP5005A5P 8254815e 909308 bp 16 contigs 10 genes all in (+) (+) 29% to CYP694C1

one in frame stop and a frameshift, stop codon missing at end, may be a pseudogene

252767 MLKILITFLGLFLFLLVFVVYKVFIYPYLRLSTFSKKHGKKVKVYFNFGVGLL 252925

252926 KEYKQSLIQYKDSLAFFRNLHQKSLQAIAFCFGTKVGLIFVDSNLIKQVHQNHEAFM 253096

253097 KIDASMAIIYLFGDSLIYAKGKEWQRQRKFLGKSFHFEEIKNYLPSIKETSIKIFGEIKD 253276

253277 QLKQSDQMEIQVVKTCESVTSEVVFRVFFGQTSQNLKNTKEDGSQISVAQELVSTITNSF 253456

253457 QILQNDKISLIK*IFFERNTIKYFPTKNEQNLQKRLIEIKKQCMQVVEKRRIELIE 253624

253625 VIKSAKNNFLHQYLKEMIVNEKSKISNEEIIDNFLALFFAGTDTTGNMTRVALYYLSLYP 253804

253805 DIQNKAREEIIKLASSRINSTNPVDLFNSLTFEDIQNLNFLNSILKESLRLIPPAIEVFP 253984

253985 RLAIQDIQIGD 254017 (fs)

       YEVKKGDFINTYFIYNFSNPEIYPEPDNFDPSRWMKQID (2)

254199 QQNTFNFTPFSLGPRNCIGQHLAMIEGKSMLAYILLNFEKFPNKNQEVVKEMKIIYGFQKDNLVYFKNRSQ 254411

RKKSGNIQLTQQFYIQIKKYLFLLCLKFQYVFFTTKKQKPSILQKFIKEKQNKFKHQKQTRFYFQHSKQKEVEINMI

LIISIIILQIKKQNMRKNISYKQKQFKQKNFLSLIELLFLSFKLKFFIFQK*

 

ATTTTGATCCTTCAAGATGGATGAAACAAATAGAGTATTTATTTAATATTCATAAATAAT

                 W  M  K  Q  I  E2

TAGTTAACATTAATGTATTTTTTTAATGAATAAAAAGTTAGTAGAATACATTTAATTTCA

                                      Q  Q  N  T  F  N  F  T

CTCCCTTCTCATTAGGACCAAGAAATTGCATTGGATAGCATCTAGCCATGATTGAAGGAA

  P  F  S  L

 

>CYP5005A6 8254815f 909308 bp 16 contigs 10 genes all in (+) 30% to CYP696C1

285979 MLIIFQIIFAIIASILTLAIYSLVLYPFFWFILMKYKYGDKIKVFFNIGQGMLKEYKKGLIE 286164

286165 KNDGVAFIRNMHQKNLKAMAFNYGSKVGFSFLDPELIKQVHQNHEYFIKMDATMAVTF 286338

286339 LFGNSILYAKGKEWQKQRQFLGKSFHFDEIKSYFPQIKEICQNTFRNTYQKLTNEEQINI 286518

286519 QAVKICETVTSEVIFRTFFGKTSQNLNITKKDGSQILLAFELVEVVKSAFQMLT 286680

286681 IDKIALIKWIFLQRNSARYFRTEKEEDLFQRLTAIKECCLQIVQKRRDELIKEPTQAKK 286857

286858 IFLDQYLIDTMQNQSSQVTDDEIIDNFCGMFFAGTDTTANMTGVALYYLSIYPEIQRQAR 287037

287038 EEIIKILSSKSNDKNPDSLFSQFSIEDL 287121

       SNLDLINSILKESMRLIPPAIQVFPRIACQDIKIGEFEIKKGQLVTTNFVYN

       QSNPEIFPKPDNFDPFRWMNEDN 287346 (2)

       TQSNTFNFTPFSQGPRNCIGQHLAMIEGKCMLVYILLLFDILPNPSVLVAKEVKV 287569

287570 IYGFQNDNLIYFKKRQNII* 287629

 

region around intron in CYP5006A6

TCAAATCCTGAAATATTTCCTAAGCCAGATAATTTTGATCCATTTAGATGGATGAATGAA

                                                W  M  N  E

GATAAGTAAGTTTAATTAATATTTATATTTTTAGAAAATATTTATCATTAATTTAACAAA

D  K2 Q  V0

 

TAGCACTTAGAGTAATACATTTAACTTTACTCCTTTCTCTTAAGGCCCAAGAAATTGTAT

 S2 T  Q0 S2 N  T  F  N  F  T  P  F 

 

>CYP5005A7P 8254815g 909308 bp 16 contigs 10 genes all in (+) 30% to CYP694C1

one in frame stop codon same as CYP5005A5, might be a pseudogene

       MLKILFTFLGLFL

289100 LLLVFVVYKVFIYPYLRLSTFSKKHGKKVKVYFNFGVGLLKEYKQSLSQYKDSLAFFR 289273

289274 NLHQKSLQAIAFCFGTKVGLIFVDSNLIKQVHQNHEAFMKIDASMAIIYLFGDSLLYA 289447

289448 KGKEWQRQRQFLGKSFHFEEIKNYLPSIKETSMKLFGEIKEQLKQSDQIEIQVVKTCESV 289627

289628 TSEVSFRVFFGQTSQNLKITKEDGSQISVAQELVSTITNSFQILQNDKISLIK*IF 289795

289796 FERNTIKYFPTKNEQNLQKRLIEIKKQCMQVVEKRRIELIEVIKSAKNNFLHQYLK 289963

289964 EMIVNEKSKISNEEIIDNFLALFFAGTDTTGNMTRVALYYLSLYPDIQNKAREEIIKLAS 290143

290144 SRTKSTNPVDLFNSLTFE 290197

290198 DIQNLNFLNSVLKESLRLIPPAIEVFPRVAIQDIQIGDFEVKKGDFINTFFIYNFS 290365

290366 NPEIYPEPNNFDPSRWMKQID 290428 (2)

290492 QQNTFNFTPFSLGPRNCIGQHLAMIEGKSMLAYILLNFEIFPNKNQEVVKEMK

       IIYGFQKDNLVYFKNRSQ* 290707

 

region around intron in CYP5006A7

TCCTTCAAGATGGATGAAACAAATAGAGTATTTATTTAATATTCATAAATAATTAGTTAA

       R  W  M  K  Q  I  E

CATTAATGTATTTTTTTAATGAATAAAAAGTTAGTAGAATACATTTAATTTCACTCCCTT

                            S2 Q0 Q0 N  T  F  N  F  T  P  F

CTCATTAGGACCAAGAAATTGCATTGGATAGCATCTAGCCATGATTGAAGGAAAATCTAT

 

>CYP5005A8 8254815h 909308 bp 16 contigs 10 genes all in (+) 30% to CYP694C1

381326 MISLLHLIFGVLLIPIILIIYKVILYPLTRLLYLKYVHLDKVRIFYTYGQGFFPQFKKNLLEKK 381517

381518 DSLAFVREMYSKKLQTALFSIGTKVGFIFLDPELIKQVHQNADAFKKNDGSLAITYLF 381691

381692 SNSLLFVKGKEWQRQRQFLAKSFNFQDIKNYLPIFKDVSSKIFGLIRNNLEISGTQEIEI 381871

381872 VKTCEKVTSEAVFRVFFGQTSQNLMVTSKDGSSTLLAHELVAVVVDSFHMLLHDKLLLI 382048

382049 KFTMLGKNSINILPTQSELKLLNRLKEVKRVCLEIVEKRRNELLKDQSQFKNNFLDQYL 382225

382226 KETLINQNKLITDEEIIENFIGLFFAGTDTTGNMTGVALYYLSLYPEIQQKAREEIQKVLS 382408

382409 SKCNSNENLDELFNSLTFEDLQQLDLINSILKESLRLIPPAPSVFPRIAERDIKIGDFQL 382588

382589 KKGDFVNTYFIYNFYNPDQFSNPEVFDPYRWMNQNE 382702 (2)

382764 SQNVFNFTPFSLGPRNCIGQHFGMIEGKCMLIYALLNFDILPNKNQEVKKI

       MGTIYGFEKDNLVYFKKRNINN* 382970

 

TTACAATTTCTACAATCCTGATTAATTTAGTAACCCAGAAGTCTTCGACCCATACAGGTG

                                                       R  W

GATGAACCAAAATGAGTAAACTTTTTTATTTTAATTTTTTTGATTTAAGATCTATTTTAA

 M  N  Q  N  E2

TTAAAAGGTCATAAAATGTATTTAACTTCACGCCTTTCTCCTTAGGCCCAAGAAATTGTA

        S  Q  N  V  F  N  F  T  P  F  S  L

TAGGTCAGCATTTTGGTATGATAGAAGGAAAATGCATGCTTATCTATGCTTTGCTTAATT

 

>CYP5005A9 8254815i 909308 bp 16 contigs 10 genes all in (+) (+) 32% to CYP694C1

       MIGTLFYSILGI

392542 IILTFLVLLYKLVFYPKLRLAAIKSKYGDKVKTYFYVGEGLLAEYRNNLLKYNDSVYFM 392718

392719 RNLHQKNLKALAFNFSTKVCISFLDPMLIKQFHQNHDAFKKVDVSMAITYLFGDSIIF 392892

392893 ARGKQWQRQRQFLGKSFHFEEIKNYLPQIKQISENVFGNINLNGSENEEICAVQICQKVT 393072

393073 SEVVFQIFFGSTSQNLVITTQKGHQIKVAEELISAIVDSMQIFETDKIALLKWIF 393237

393238 LEKNSTKYFTTKSEKQLKDRLIAIKQACLNVIKKRNDQLSQDISLVKQNFLDLYLIEM 393411

393412 ITNQNTQITYEEIIDNYLTLFFAGTDTTGNMTGVALYYLSVNPQIQEQARDEILQKLSQK 393591

393592 TASKDPKYLFDSFSFDDLASLNLLNSILKESLRLIPPAIDVFPREAVQDIKIGEFEIKKG 393771

393772 DIVTSHFIYNQSNNEIFSNPDIFDPQRWMNGKE 393870 (2)

       QQNAYNFTPFSLGPRNCIGQHLAMIEGKCMLVYILLNYQILPNTNQQLIRENNLV 394107

394108 YGFKKDNLIFFKKIIL* 394158

 

GATCCCTAAAGATGGATGAATGGCAAAGAGTAATTAATTATTACAATATTTCTTTAATTT

         R  W  M  N  G  K  E2

TTATATTAATTAATATTTTATTCAATTTTTTTAAAAAAAAGGTAATAAAATGCATACAAT

                                          Q  Q  N  A  Y  N

TTCACTCCTTTCTCATTAGGACCTAGAAACTGTATTGGCTAACACTTAGCTATGATAGAA

F  T  P  F  S  L

 

>CYP5005A10 8254815j 909308 bp 16 contigs 10 genes all in (+) (+) 30% to CYP696C1

404751 MIMAILITVFAVLTIIIVKLHFIPIIKIMILKRKHPNQIESIVHLGPSLVREE 404909

404910 KYNLKHYDDNIYSIRNIYKKNQDVKLILYKNAKGQLGYIFVDPELIKQVHQNPEVFQKQM 405089

405090 SNPIIKYLFQDSLLFKTGKEWQRQRQFLGKSFHFDEIKNYLQDIKQVTKEELQKVKLKIE 405269

405270 SQPDRSIEIVKTCQNITQEVVFRIFFGSTNLKVSKPDGKEIPIADEMISSIESFQVLFRS 405449

405450 NYLLFAKFVILGQNVLEVLPTNKEREIKQRFIDLKKACKLIISQRKEQL 405596

405597 IKNPTLYKYNFLDQYLKEVIVNNNQSITDEEIIQNSLSLFFAGTDTTGNLVGSALYYLSL 405776

405777 NLTIQNQAREEVLKVLSQKKLNENLEEKL

405864 ASLTLQDLSNMNFLNSILKETLRLIPPAPNVLSRICVQDFQIGDFFIKKGTPVGTYFISS 406043

406044 QMNPKLYPNPETFDPNRWMNAQE 406112 (2)

406173 QNSINFTPFSLGPRNCIGQHLAIIEAKCILSYILINYDILPNENQKIQFRSKFVYGIQPDNFITLKIR* 406379

 

TTGATCCAAATAGATGGATGAACGCACAAGAGTAATTAATATTTTATTTAATTAATTTAT

           R  W  M  N  A  Q  E2

TCATTCAAATTAATAAAATTATATCTTATAGACAAAATTCAATAAATTTCACTCCATTTT

                                Q  N  S  I  N  F  T  P  F 

 

>CYP5005A11P 8254665d  877599 bp pseudogene fragment, no additional P450 seq. found

332385 QNNAFNFTPLSLGHRNCIGQYLAIIEEKYMLSNNFLNYNILLYPSEKLINNLWYLSRQFCLF* 332197

 

>CYP5005A12P 8254665a  877599 bp pseudogene N-term is missing, 80% to 8254665c

362819 LDPEIVKQVHQNHDAFLKMDISQAFI (fs)

FLFSNSLLCVSGKEKQRQRHF (fs)

FLDKSFYFD 362652

362651 EIKNDFPIIKEVSLVVSDEITQDLNNSAQKMIQVVK (fs)

KICEKLTSEVAFRAFFSQNSKNIIIKRED

GSTILFSSEIVQLLIDSIRILQTKKLLLLKDVILKRKSQNIFPMKKEK (fs)

TLFSYLNVY (fs)

KYTFKQFIMQRKEELQKDPSLFIRNFLDLYLKEIPLNENKETIIDEIIINLCALIF 362121

362120 VGTDKIGNMTGVSLYYLSQNSKIQEEARDEVINILKSIIQSQDPQDLFKALTFEDLSQML 361941

361940 LVHSILKVSQRLIPPSE (fs)

361890 AVLGRYANKDIEIGEFLVKKGYLVNTHFIYSQSNPDLYQN 361771

361770 PEKFDFYRWMNQKE 361729 (NO INTRON BOUNDARY)

       QINAFNFTPFSL (fs)

361640 LSRNCIGKYLAIEENCILTNYLLNYNIL 361557

       PNPSQEPIKKTKKNVLFFSRLVSLF*

 

 

>CYP5005A13P 8254665b  877599 bp pseudogene N-term missing, several frameshifts

       EIVKQVHQNHDAFLKMDISQAFVFLFSNSLLCVSVK 366568

366567 EQQRQINFLCQSFYFDEIKNDFPTIKQVSHVISDEITQDLNNSAQKMIQVVK 366412 (fs)

366412 ICQKLTSEVPFRALFSQTSKDITIKRQDESTILFSSEIVQLLIDLICILQTNKLLLLK 366239

366238 YVILKRKSQNIFPMKNETSFTCLNVYKVIMQRKEELQKDSSLFKRNLLDLYLKE 366077

366076 IPLNENKEITIDEIITNFCALFFAGTDKIGNMTGVSLYCLSQNPEIQEEARDEV 365915 (fs)

       INILKFFVQSQD (fs)

365879 KTLKTYLKHLHSK

365840 IYQMPLVHSILKESLRLIPPAAAVLGRYANKDIEIGEFYVKKGYLFNTKLxx 365691 (fs)

       SQRNPDLYQNPEKFDLYRWMNQKE (no intron boundary)

       KINAFNFTPx (fs and small deletion)

365545 PRNCIGKHLAIEVNCILANYLLNYNVLPNPSQKILKRQKKINGFFLDNLVYFELRK* 365375

 

>CYP5005A14 8254665c  877599 bp 4 genes 29% to CYP694C2 complete

368910 MISGFATFALGLLLTLLLFAFYIVILPNIKLNRIKRMYGDKIKIIYHFGVGLLGEYRKGLKNKNDSL 368710

368709 SFLREMALTEKSKVKAYSFNIGLKLGYCFLDP ELIKQVHQNHDAFSKMDISQAFVFLFS 368533

368532 NSLLCVSGKEWQRQRHFLGKSFHFDEIKNYFPTIKEVSLVVSDEITQDLKNSSQKKMIQV 368353

368352 VKTCEKVTSEVAFRAFFGQTSKNITIKREDGSTVLASSEIVQLLTDSFRILQTNKLLLLK 368173

368172 YVILKRNSQNIFPMKEEKSLFARLNVFKDTCKQVIMQRKEELQKDPSLFKKNFLD 368008

368007 LYLKEILLNENKEITIDEIIDNFCALFFAGTDTTGNMTGVSLYYLSQNPKIQEEARDEV 367831

367830 INILKSRTQSQDPKDLFNSLAFEDLSQMP 367744

       LVHSILKESLRLIPPAAAVLGRYANRDIEIGEFSVKKGDLVNTHFIYSQS 367594

367593 NPDLYPNPEKFDPYRWMNQKE 367531 (2)

367445 QKNAFNFTPFSLGPRNCIGQHLAIIEGKCMIANYLLNYNILPNPSQ 367308

367307 EPIKEAKTIYGLCPDNLVYFELRK* 367233

 

>CYP5005A15 8254543 195986 bp 3 contigs 28% to CYP694C2

82913 MSLLITIICIILLLLTCIAFKIVVYPQIQIFYFKNKYGKDVEVLYHFGSGLLQEFKQSLEKEG 82725

82724 DSFGRIRKIMQINPDAQAIIHNNASGKMAYSFIDPDLIKQVHQQPEYFEKQIAHPAIISL 82545

82544 FSKSILYLSGKKWQKQRQFFGKSFHFDEIRNYLPVIKQTSQDVFSSIKQSLNNSLQQEVQ 82365

82364 VVKTCEKVTSEVVVKVFFGSTSQNIIVTRSDGTKLPVAQELIEIMVDSFRVFRNSKFAFV 82185

82184 KFLLLRQRSFQILPTQSEKDLAQRIHNAREECNQIIQDRKQQLLNDPT 82041

82040 QFKSNFLDMYLKEIIINKNEDIKEEYIIDNFLALLFAGTDTTGNMTGAAFYYLSLNTDIQ 81861

81860 SRAREEIISILNRKGQQNNTEELYNSM 81780

81779 TFEDIANLNLINSILKESLRLIPPAAAVFPRRVIKDIKIGNFQLKKGDAINTQFICNHYN 81600

81599 PQIYKNPDIFDPNRWMEQKE (2)

81473 QNLFNFTPFSLGPRNCIGQHLAMIEGKCMIAYVLLNFEIIPNHEVQVKKEIKLTYGLLPDNLVYFKKRQQI* 81258

 

CCAATACAATTTCTAGGACCTAGTGAGAAAGGAGTGAAATTGAATAAGTTTTATCTTAAT

                               P  T  F  N  F  L  N  Q

ATATTTAAATAATAATTAAAATACGAAAAACTTTTAGAAAAATAAATTTAAATTAATTAC

TCTTTTTATTCCATCCATCTGTTTGGATCAAAAATATCTGGATTTTTGTAAATCTAAGGA

             M  W

 

>CYP5005A16 8254716b 1191929 bp 2 genes 30% to CYP694C3

374097 MSFLAYFALIATVTVIYLAFRIVIFPKLKIKKLKNKYGDQVIVLYNFGSGLI 373942

373941 QEKKKSLKQQGDSLGTIRKLLWSNPKAKAIIHNNASGQIAYSFIDPELIKQVHQQPEFFV 373762

373761 KQVNHPAVLYLFSKSILCMQGKEWQKQRQFFGKSFHFEEIRNYLPIIKETSQSVFGSIKE 373582

373581 NLSDSSQIEIQVVKICESVTSEVVFKAFFGFNSQNLINGSQNSLVQEIVSIMTDSYRIFQ 373402

373401 FNKLAFIKFLIFRKRSYDFFPLQCEREIVQRLRKVKEECDKIVQQRKQQLLNNPSL 373234

373233 YKYNFLDQYLKEILLNKNQDIREENIIDNFLALLFAGTDTTGNMVGTSIFYLSQNPEI 373060

373059 QNKARDEVIEL 373027 LRKNHQCNKMQ

372993 DLYEQMTFEDITNFNYLNSVLKESLRLIPPAVNVFPRKVIKDLQIGKFQLKKGDAVNTNF 372814

372813 ICGHYNPQVFKNPEQFDPSRWMNANE 372721 (2)

372681 QNQFSFTPFSLGPRNCIGQHLAMIEGKCMLAYILLNFEIIPNLKVKVRKEAKMSYGLI 372508

372507 PDNLVFFKKNCKNYD* 372460

 

>CYP5005A17 8254693 27987 bp 1 contigs 31% to CYP694C2 complete

3973 MLVESLLYITLFFALGIIFLIFRAIIYPKIRLLQLKKKYGDKIQIVHHYGTGLPM 3809

3808 EYRKGFVERNDSMAFVRKMAAEKTNKMQVFGFNIGYNVGLSFLDPELIKQVHQSHDAFQ 3632

3631 KINISQAFNYLFSDSLFYATGKNWQRQRQFLGKSFHFDEIRNYLPSIKELCSKVTDQLN 3455

3454 KDLKANPQQEIQVINVCEKITSEVVFRVFFGSTQENMMITREDGSQLHLSEELVSLIMDS 3275

3274 FRILQQNKLLLLKNLILKSYSFKIFPLKEEKWLLNRLVQVKAACEKVVLKRKEELQ 3107

3106 QNLDQYKNNFLDLYLKEMIENKNSGITYDEIIANFCGLFFAGTDTTGNMTGVALYYLSL 2930

2929 NPQIQKEARDEVIRVISKKNQDIKGNDLFSSLQFED

2821 LSNMNLCNSILKESLRLIPPAFGVFPRYVTRDIKIGQYELKKGDLVNTHFIYNQSNPQTF 2642

2641 TNPDKFDPYRWMNGNE 2594 (2)

     SQNAFN

2521 FTPFSLGPRNCIGQHLAMIEGKCMLANFLLNYDILPNESQKVQLEMKIIYGFSQDNLVYFKKRQQ* 2324

 

>CYP5005A18 8254583a 93101 bp 2 genes 29% to CYP694C1

74438 MIVLIYSSLIIFATIIAIGFYIFVIYPTMRFLVMKYKYGHKVKVFFSIGTGIL 74596

74597 KFYNQSLIQNKDSIAFIRNMHQKGLQAIAFNLGTKVGLSFLDPELQKQVHQSYQSFR 74767

74768 RIDAIAAMTYLLGNSMIYAADKEWKRQRLFLGKFFHFDEIKNYYPTIQETTKQVLQGVNE 74947

74948 QLNQKEQIEIKAVYIIQKITSEATFKMFFGLTTESLMITRKDGTQIAFSDELALLIRHSF 75127

75128 QLLQTDKIALLKWILLKRKSTDYFSTKGEQDLMDRLKNLRQGFIKIVSKRKEELSQ 75295

75296 DPSKAKNNFLDYYLTEMVQNQYSGITYDEIVDNFTGIFLAGTDTTSNMVGSALYYLSINP 75475

75476 EIQQKAREEVVQVVSSKQDFKKHDQLASQLRFEDITHFNFLNCILKESLRLIPPVIQITS 75655

75656 RIANHNMKIGEFDIKQGDLVTTNFAYNLSNPDIFPNPELFNPQRWMTANN 75805 (2)

75861 YSETFSFTPFSQGPRNCIGQHLAMIEGKSILASILLQFEILPNPNTEVIMDM 76016

76017 KMMYGFQNDNLIYFKKLKQ* 76076

 

AAATTTTGCATATAACCTATCAAATCCTGATATATTCCCAAATCCTGAATTATTTAATCC

TTAAAGGTGGATGACTGCAAATAAGTAAAATAATTTGTCTTTTTATAAGTATTTTATTTA

    R  W  M  T  A  N  K

AATCTAAATTTTATAATAGTTATTCCGAAACATTTAGCTTTACTCCTTTCTCACAAGGTC

                    Y  S  E  T  F  S  F  T  P  F  S  Q

CTAGAAATTGTATTGGCTAGCATCTTGCTATGATAGAAGGAAAGTCTATTCTTGCTTCTA

 

>CYP5005A19 8254583b 93101 bp 2 genes 29% to CYP694C1

78030 MIILFYIILGIIATILAIGVYILVIYPIIRFSEMRRKYGDKIKVYFSIGKGIFNFYQQD 78206

78207 LIKKNDSLAFIRSMHHENLKAFAFNYGTKVGLSFVEPEHQRQLFQNIHSFSKADGPI 78377

78378 AITFLFGHSIAFAPEKEWRRQRQFLSKSFHFEEIKNYLPLIKQSCEKIVKKLDTKLLQND 78557

78558 QIEVKVVGISQEVTSEISYKIFFGSASENQMKITKKDGTQIPISEEIVQIIIDSFTLLKT 78737

78738 DKLALIKWILLKRNSVNYFPTKGEKEILIRLQALKQVQIDVVTKRNEELIKDPSQA 78905

78906 KKNFLDQYLIEMIQDKNSGITYDEIISNFAAIYFAGTDTSGNMVGVALYYLSRFPEIQQQ 79085

79086 ARAEIIKVLSSKKEIMKTEFSQLEFEDIQNLNFIHCILKESLRIMGPSISSQIKQADH 79259

79260 NIKIGEFDIKKGDLVTAHYFYNHTNPSVFENPDDFKPERWLDSNN 79394 (2)

      LKQQSNFSPFSQGPRSCIGQHLAIIEGKCLLASILLQFEIIPNT 79578

79579 SQQVIRELKSIYTFQNDNLVFFKKIKQINQKTEMMI* 79692

 

ACAATCACACTAATCCTTCAGTTTTTGAAAATCCAGACGATTTCAAACCTGAAAGATGGT

                                                     R  W  L

TGGACTCAAATAAGTATAATATTTTTTTCATATTTTTTATAATTTATTAATTATTTTTAA

  D  S  N  K2

AATAGTCTAAAGCAACAATCAAACTTTTCTCCTTTCTCACAAGGGCCAAGAAGTTGCATT

      L  K  Q  Q  S  N  F  S  P  F  S  Q

 

>CYP5005A20 8254626 31828 bp 2 contigs 30% to CYP694C1 complete

27801 MIFLIYISLGIIASIIAFVSYILILYPIFRFSQMKRKYGDKVKVFFSIGQGILKFYNQDLAN 27986

27987 KNDSIAFIRNMHEPNLKAIAFNFGTKVGFSFVDPELQKQVLQNPHSYSKVDGPMAITF 28160

28161 LFANSIAYATEKEWKRQRLFLGKSFHFEEIKNYFPLIIETCQKTIKKIDQKLIERETINI 28340

28341 QILKTCQEITSEVSFKVFFGSNNENLTIITRKDGSSTTISNELVQVLIDSYYLFQTDKVA 28520

28521 LIKWILLKRKSTSFFLTKGEEQLLNRLKALRQACTAIVSKRKEELTQDRLLAKKN 28685

28686 FLDQYIIEMAQNENSGITYDEIIDNFSAIYLAGTDTTSNMAGVALYYLSLYPEIQQQARE 28865

28866 EVIKVLSVKLKENKTDQLFSLLAFEDLQNLNLINSILKESLRLMPPAIQSQTKFANQDIK 29045

29046 IGEFDVKKGDLVTNHFSYNLSNPEVFPNPDVFNPNRWM 29159

      AANI (2)

      HNEMANFTPFSQGPRNCIG

      QHLAMIEGKCILASLLLQYEILPNPAEKVVRQMRIVYGFQYDNLVYFKKINV* 29441

 

ATTTATCAAATCCTGAAGTATTCCCTAACCCAGATGTCTTTAATCCTAATAGATGGATGG

                                                  R  W  M  A

CTGCAAATATGTAAATTTTTTTATTTCATTTTAAAGTATTAAAGTATTCATTTTATATAA

  A  N  M2

ATAGTCACAACGAAATGGCAAACTTTACTCCCTTTTCATAAGGGCCAAGGAATTGCATTG

     H  N  E  M

 

>CYP5006A1 8254798 983259 bp 23 contigs 28% to CYP689A1 no introns

753493 MSIIQLFIGFIIGTIIYQTVIKTLYLYYVYKRRYGKDIIVLFYPVIGWYYFVNQ 753654

753655 SFKKYGDSHQFYKTLIKENPNAKAILALSGFTNIITIIDTNWVKKCLTNQHLFFKKDGPF 753834

753835 GFDLLFGEGLLFKNKQEWKNQRAFLQGNFHFNALMERFQMFKDCNLQFCSEVPSNGEYVK 754014

754015 LDYYNEAAKVSSDIISYSFMGRSINEIKSNQKSFSIELINLVKDCFVYRITNPF 754176

754177 FMIKASVFGPRKSTTSFNSSTEKGLLSRIKSINNCLYNIIEEREQQLKK 754323

       KYKNHNEMVPANFLDMYLKEIWKQEEENKESKNKITLT

754438 RKEIVNQYITVIFAGTDTTSHLIGNILFELSRNPDVYQRLQKEIDDNILSFENMKYEDLN 754617

754618 NLPYLRLVIKETQRLYPAIFYLFGRFMDKSYTDEDLFISKDLEVQIRLVSQQEKQSGLFS 754797

754798 DIQKFNPDRYNSTDKEDPYDFIPFSAGPRNCIGQYLALIETKVAIVYALKKFHLKKRDDY 754977

754978 ELKMFQEFVYSPVEKDLFYIKRK* 755049

 

>CYP5007A1 8254605 384764 bp 4 contigs 29% to CYP694C2

93611 MIKYIVYLVALILTLLSLIILRQAYKSLRLCLIYRKKYGNDVKILFYPILGIGYFMNQ 93438

93437 SFEKHDNSYEWMKRIVRENPNVKAILILSGLFDIIILQDEKWVKNFSQDHILYYKN 93270

93269 DGNFGYTFAFEKGLFYSSFGDWKRQRIFLNSSFHFESLKSYVPIIKEQTKKLLANLKEED 93090

93089 GFVKLNIFQELQKITSEIITISYFGKSLVNVKCKSGIELYKENNDIIQDSYIYR 92928

92927 FTSPYFILKACIFGAKKASMIYHNQAEKKLIARMKDIFETYEKIIDEELEKIKMSKI 92757

92756 DPMDIQPKTMIELYLKEYLIQQ 92691

      QNRATMNPNDIFE

92651 KKEIIHQFLTFYFAGSETTSHLVAMTFYELTKNQQIYDKLMKEINENAKDFDKVEHSDLS 92472

92471 KFQYLDIVFKETGRLHNAIAFTSPRVTDQDYIKEDMYLNKDLLVCHRLTQYSQQCEVLKD 92292

92291 SDEYIPERYSRNTKGFSSFDVIPFSTGPRNCIGQHLALIKSKFLVIYFLKNFELKELPGY 92112

92111 KLRTMQKLSNHPIDEEIVLIKRKKQ* 92034

 

>CYP5007B1 8254600b 816542 bp 2 genes 30% to CYP694A1 50% to 8254600a complete

505285 MIFNLLIGFGLIILLILLRITYKTARLYLIYKNKYGNQVKILFYPILGLAYFMQNSLKLHDDGY 505476

505477 QFLKQIVKENPKVKAIIILSGFMDIFIANDNEWIKYLSQDQKQFYKDDAFFGYDSLH 505647

505648 GEGMPFCDYNRWKKQRLFLNSSFHFDSLKQRVPILKDEINNFCSFLKPSDGFVKINCYNQ 505827

505828 LKLITSEFFTRTFFGRPFKDVKCKNGKNISLEICDICSDGYTLRISSLYFIIKA 505989

505990 AIFGCAKASHILQSQAEKKIMNRIRDLYSIVEGVVDEKISEMSKMKDLIQYQPTN 506154

506155 FSELYVKEYLLQQKNIQNYKKEDIISKVELVQQYLTFFFAGTDTTAQLTSMCLYEISKNPSI

KEKIFSELKQCGKNYEQLESSDLQKLPYLDAIIRECNRLYSVVAFLFPRRTDTNYQRGDL

YIDKDLLATNRLIQYADEKEFFNESEKFIPERFLNQKNVQQLSPYDMIPFSSGERICIGQ

NMAIMEAKFLILYILTHFEIAEVPNYKLRMSQFMINQPVDLEIIQIKRI*

 

>CYP5007C1 8254600a 816542 bp 6 contigs 2 genes 30% to CYP689A1 complete

508388 MINLIKISFYLFLTVFSILIAIQVWKTLRIYLIYKKKYGNQIEFMFFPLLGLIYFM 508555

508556 QKSLKENDDGAHFIKKVCREKPHVKAIIILSGLFNYFIAIDNEWIKFLSQDQDRFYKN 508729

508730 DGIFAFSKMYGDSLLFTWSKMWKRQRALLNSSFHYDSLKARVPIIKDEAKNFCNFLKPE 508906

508907 QGFVKVDMFDETQKITSEIITRTFFNREFKNKFISNGKSLTQEIGDITQEAYNY 509068

509069 RLSSPYFILKSSFLGNTGASFILNSPTEKEYLQRMDQILKIIEQDVDQSIQEVIST 509236

509237 GIDIFEYQPRNFIEIYIKEYLIQQ 509308

       KNLKSIPQADIITKKEIVQQYLTFYFAGTETTAHLISMTLYALA 509440

509441 NNKQVYEKLMKEIKENIKQYDEFQHNNLLQLNYLDLVIKESTRLYVAAPFIFPRVTNQDY 509620

509621 KRENLFIRKDLLVMYKIAQYSEDQSQVFKDSDLFIPERFLNGKINNPYEMIPFSSGSRNC 509800

509801 IGQHMAILEAKYILIHILMNFEIEPVPNYKLSIIQKVTNAPLDQKLVLIKRK* 509959

 

>CYP5008A1 8254716a 1191929 bp 2 genes 37% to CYP694C1 = EST seq 5 and seq 4

       MIGYIAIALVLLFV

414374 YRFIIKNLMIYFTYKSKFGDKVVFMFYPLLGNFGISRKSFRQYGDSLHILKTIQRTKPDV 414553

414554 KAVMVFNGFIIQIIIIDTQLQKVFLQNTKPYYKVEGPFGARDAFGQGLVVAEEKVWTRQ 414730

414731 RNFLSNSFHFNALKNRVLSSRRLPRSSWATLPSDGKTPITIIEELQNITSE

       VVIQTFFGENLKGMTVNGLQPSVEISKIIGDGFSYKANSFAYFLKLMVFGQEKASRVLNT

       TFEKNFLKRVENYNQFIEGIVDKRLSELEKLTDTSKVDENFLNLYLLEYIKQQKALKENPKIYADYEIIP

415274 KREIVHQFTTFFFAGMDTTANQTGICLWVLAQHPELQQKIRAEIDSVIQTFDDLKHEDLN 415453

415454 KLEYFNAFFKESLRVYPTAPQVIPRVSARDHMVDDFPIPKGAFVSNLTIQYNEQKFPLLC 415633

415634 KDIDTFNPDRFLDKNIIQDHFSFIPYSAGPRNCIGQHLALIEAKIMIAYILKNYVVLPNE 415813

415814 EHQKVRFNHLFLYTSYERDIIKLKKLTA* 415900

 

>CYP5008A2 8254176 54804 bp 4 contigs 34% to CYP694C2

32112 MLGYLLIPLFLYFVYRFIIINLIIYFRYKIKFGDKVAFLFYPLLGGLGISRKSYR 31948

31947 KYGDSLHILKQIQRNKPDMKAVVIFNGFNIQITLIDTGLLKVFLQNTKNYYKAEGPFGAR 31768

31767 YVFGLGLVKSEGKIWARQRSLLSNTFHFNALKDRVPVIKEITKEYLQKLDSYKDSAVPI 31591

31590 IEELQNITSEVIIQTFFGENLRDKKVNDLQPSVEISKIILEGFKYKANSFPYFLKLMVF 31414

31413 GQEKAFRVLNTQFEKDFIKRVENFNQFIEEIIEIRLNQLQNTYETSQVDENFLNLYLL 31240

31239 EYIKQQKALKENSKQFADYELIS

31170 KREIAHQFSTFFFAGMDTTANQTGICLWLLAKYPEIQKRVSDEINQAVNNFDQLKHEDLS 30991

30990 KLDYFNAFFKESLRMYPTAPHIIPRVSACDHFVEDFFVPKGAIINGVTIQHNEQKYPSLC 30811

30810 KEIDTFNPDRFLNQNVIQDHFSFIPFSAGPRNCIGQHLALIEAKIIIIYMLKNYVIIPND 30631

30630 ELENVEFNHLFVYTSIQRNLIKLKKIKSE* 30541

 

>CYP5009A1 8254437 547543 bp 2 contigs 29% to CYP691A1

118190 MITTILIALTFIGIALLLFKAFIQPLYRISFYTKQGLKQKFFVPFLGL 118047

118046 LYQMIKDMNGKQKDAMYTLKHLPQNIKEQNLDGFVANLGSTPVIYLLDATLIGEFLQKQ 117870

117869 TEYYQKSEFATKMSILLFGKGVVFSEGEEWKTKRKLLSNTFHYNLIQDQVKLVLNVTEKI 117690

117689 FEQQDQNSVCFLQTVEKIGGDVGLMSFLGADINNYTLFGQEPTAGINQYINQLM 117528

117527 SYMKNPLTVLMGGYLFNNQVRGSDKKVKQGSQEIKNLLKKVVLDVIDRTKNQMNDGKQNV 117348

117347 DTSIISLMLKNGQLKTEEEIDELVILTLNFYFAAKDTVSRLVANLMYYIGKNDHIYNKLE 117168

117167 KEILSFKDEEINIENMKKYNYLEAVIKESLRVCGPSPFLIQRTALQDHNL 117018 (1)

116968 GKYKIQKGTDVNCVFIYNFYNEKYFENPFEFNPERWLDQAQLEKIKINPFSYLPFSGGS 116792

116791 RNCIGQYFAMMEIKAIMIYFMRTYEKFQIPENFQYHIIIKTTIETKDHLTCSLKKR 116624

 

CAGTTCCTTTCTATATTTTGTATTTACCTATTTATAAAATTTAAAATAAGAATTTTTATG

                   Y  K  G  I

AAAATATTCAGTTTACCTAAATTATGATCTTAAAGTGCTGTCCTCTGAATTAAAAATGGA

              G1 L  N  H  D

 

>CYP5010A1 8254362 275164 bp 1 contigs 26% to CYP694C2

80265 MGLLQLLVLVCLVFIILLFLKYYVLTKIAMRYYSKQGIKEYKYYPFTGIASEWIDQSLDD 80086

80085 SLKTIKYIYFDDKFKSQDAILSNFLDKALLIFINPNLINEFLQNQQNYVKFENAL 79921

79920 TGFKQFLGNGIAFEEGEKWKSKRRFVSQLFKFDVVVNYLSSIRKVTQMMLNRVEDSNFQ 79744

79743 VQKQFEALNSAISVQIFMGANIDEYKIDDKNSYDAIDQMCSDIFYHYQN 79597

79596 ITTAFLGEKFVKLKLRQSDKVICDKVQYVRQQMKNIILDTINREKINIQQNKESEN 79429

79428 ISIIALLLKQGHQIQEEKDIIDVLELTFSLYFASRDTLASVLTIMLYYLLKNPDCFQTVE 79249

79248 KEILSLPQNYTFSDIQKLDFLQACFKETIRINTSAPIILMREAKVDHKIGNYKIQKG 79078

79077 TFVNVGLISNQLNERYFKDPLTFNPNRWFDQSTENVIKENPFVYLPFSGGMRNCIGQHLA 78898

78897 NLQAKIVLIEFIKKYQ 78850

      QPELPKDFVLELVIKQNYQIKNPFILNLQKRQNN* 78745

 

>CYP5010A2 8254542 59795 bp 2 contigs 27% to CYP691A1 complete

8155 MTILDFFGIS

8125 LLTFVIFLVLKYFVFTKMAMRFYANQGINEYKYYPISGMAREWMDKSSNDNLKTVKYIK 7949

7948 FDDKYKGEDAILSNLLDRALLIFIEPNLINEFLQNQQNYVKFEAPLIGFKQFLGNGI 7778

7777 ALEEGEKWKIQRKFISKLFKFDVVVNYLSSIRRVSKTMLDRIEDSNFQVLKQFEAINSAI 7598

7597 SVQIFMGANIDDYKIDGKNPYNAIDTMCAEIFHHYMSIIPIIFGEKFVK 7451

7450 LKLRKSDRIICEKVQYVRQQMKNIILDTIKREKFHIEQNKEQENINIIALLIKEGH 7283

7282 KMEDEQEITDILNLTFSLYFASRDTLASVLTMMLYYLIKNPDYFKQVEKEILNLPPNYTF 7103

7102 SDIQKLDFLQACLKETIRINTSAPILFTREAKVDHKIGNYNIQKGTYVNVGLISNQL 6932

6931 DEKYFKDPLNFNPNRWLDHSTENMIKENPFVYLPFSGGMRNCIGQHLANLQAKIVFAEFI 6752

6751 KKYQLPELPNDFVLEFTLKQNYQIKNPLILNLKKRQSN* 6635

 

>CYP5010A3 8254220 4918 bp 2 contigs 27% to CYP694C2 complete

1829 MAILELIGAS

1799 LLAFIIFLVIKYYVLTKMAMRFYANQGIKEYKYYPVTGMAKEWMDYSSSDNLKTIKYI 1626

1625 KFDDKFKGEDAILSNLLDRALLIFTNPNLINEFLQNQQNYIKFEAPVVGFKQFLGNG 1455

1454 IALEEGEKWKSKRKFISKLFKFDVVVNYLNSIRKISKMMLDRVENDSNFSVQKSFEAINS 1275

1274 AISVQIFMGANIDDYKIDGKNPYDAIDTMCFEIFQHYLNLIPGILGERFVKLK 1116

1115 LRKSDRIICEKVQYVRQQMKNIILDTIKREKSHAEQNKESENINIIALLIKE 960

 959 GHKLDEEKEIIDILELTFSLYFASRDTLASVLTMMLYYLIKNPDYFKLVEKEVLSLPQN 783

 782 YSFSDIQKLDFLQACFKETIRINTSAPILFTREAKVDHKIGNYNIQKGTYVNVGLIS 612

 611 NQIDETYFKDPLTFNPNRWLDHSAENMIKENPFVYLPFSGGMRNCIGQHLANLQAKIVLV 432

 431 EFVKKYQIPQMPNDFVLEFVIKQNYQIKNPLILNLTKRQAY* 306

 

>CYP5010A4 8254598b 267927 bp 3 genes 27% to CYP694C2

267521 MAFLELIGASILAFIIFLVIKYYVLTKMAMRFYANQGIKEYKYYPVTGMAKEWMDYSSSDNLKTIKYI 267318

267317 KFDDKFKGEDAILSNLLDRALLIFTNPNLINEFLQNQQNYIKFEAPVVGFKQFLGNG 267147

267146 IALEEGEKWKSKRKFISKLFKFDVVVNYLNSIRKISKMMLDRVENDSNFSVQKSFEAINS 266967

266966 AISVQIFMGANIDDYKIDGKNPYDAIDTMCFEIFQHYLNLIPGILGERFVKLK 266808

266807 LRKSDRIICEKVQYVRQQMKNIILDTIKREKSHAEQNKESENINIIALLIKE 266652

266651 GHKLDEEKEIIDILELTFSLYFASRDTLASVLTMMLYYLIKNPDYFKLVEKEVLSLPQN 266475

266474 YSFSDIQKLDFLQACFKETIRINTSAPILFTREAKVDHKIGNYNIQKGTYVNVGLIS 266304

266303 NQIDETYFKDPLTFNPNRWLDHSAENMIKENPFVYLPFSGGMRNCIGQHLANLQAKIVLV 266124

266123 EFVKKYQIPQMPNDFVLEFVIKQNYQIKNPLILNLTKRQAY* 265998

 

>CYP5010A5 8254598a 267927 bp 2 genes 27% to CYP694C2 1aa diff to 8254598b 3aa diffs to 8254220

258120 MAFLELIGASILAFIIFLVIKYYVLTKMAMRFYANQGIKEYKYYPVTGMAKEWMDYSSSDNLKTIKYI 257917

257916 KFDDKFKGEDAILSNLLDRALLIFTNPNLINEFLQNQQNYIKFEAPVVGFKQFLGNG 257746

257745 IALEEGEKWKSKRKFISKLFKFDVVVNYLNSIRKISKMMLDRVENDSNFSVQKSFEAINS 257566

257565 AISVQIFMGANIDDYKIDGKNPYDAIDTMCFEIFQHYLNLIPGILGERFVKLK 257407

257406 LRKSDRIICEKVQYVRQQMKNIILDTIKREKSHAEQNKESENINIIALLIKE 257251

257250 GHKLDEEKEIIDILELTFSLYFASRDTLASVLTMMLYYLIKNPDYFKLVEKEVLSLPQN 257074

257073 YSFSDIQKLDFLQACFKETIRINTSAPILFTREAKVDHKIGNYNIQKGTYVNVGLIS 256903

256902 NQIDETYFKDPLTFNPNRWLDHSAENIIKENPFVYLPFSGGMRNCIGQHLANLQAKIVLV 256723

256722 EFVKKYQIPQMPNDFVLEFVIKQNYQIKNPLILNLTKRQAY* 256597

 

>CYP5010B1 8254645 1076301 bp 3 contigs 27% to CYP696C1

1020997 MIFLIIQALVISFVLYLVGKFIVLPKLRMQFYRNQNIKEYNFFPFYGWFWDSILQK 1021164

1021165 TNKHKDCIHQIKHISESADFKDQDIILSNLNSKASLLIVNPDLISEFLQCQQHYIKYEL 1021341

1021342 PFHSLNKLIGTGLAKHYGDGWKKSRKILSNMFTFEQIALQLGSIRELSKRYISQQNEKSF 1021521

1021522 NIQKVFEMTAGASDLSIFLGFDMDEYKNKDGKTLGELIGSFANDVYLRDVGMF 1021680

1021681 SMIFGTKATELGLRKSDKDLNERGKNLRKFMYEIILECKKREQQPGYQPKYPSFI 1021845

1021846 HHLVQENLLNTEEELEDLLAISFNLFLAAKETTSKIAANVVYYAIKYPEQYKKIQEEIKT 1022025

1022026 YVPNDDYTFADLQKMNILQANVKECLRYDTSGPFLFQREVVQDHTLGKYNIKKGTLVNCG 1022205

1022206 YVVNFFNEKYFENPFSYQPERWLDSNNDSKMKNSHLVYIPFSGGARNCIGQHLAVMQIKA 1022385

1022386 MAIEFFKQFKLPEIPKDFDTEKIMTLSYGFKNDLLLNLERI* 1022511

 

>CYP5010C1 8253873a 25773 bp 1 contigs 3 genes 26% to CYP694C3 complete

 9357 MLLELIVLAIFSIFLYYFAYYVVLPKYRMQFYKNQGLKEYEFRPVYGWLTDITLMKRNQ 9533

 9534 WNDCLYYIKRLSQNPDYKDQDIILSNLADQALLTLINPELINEFLQRQNEYIKYQLPVQG 9713

 9714 IEDFLGNGLSRQYGEKWKNGRKVISNMFTFENITAQLQRVKEISRKYINNEDTEKFNIK 9890

 9891 YVFEKISGSQNLQVLLGAEVEKYKDSEGRHLGECVQQLSNDICALYISPFAMI 10049

10050 FGQKILKWNLREADRQIQKRMKDMRELMTKILIDCVNHEKDPSYEAKFPSYI 10205

10206 TFLLRAGYLQTDEEINDLRATAFSIFLGAKESTSKLANMTIYNLIKHPEQYKLVYDEIQ 10382

10383 KYVSHDDYNYLDIQKLNHLQANIKETQRTDPSSAFLRLRVAQNDHTLGKYQIKKGSIINV 10562

10563 GYLANFYNEKYFRDPFTYNPSRWFDKQEDLKIKENHFVFIPFSAGSRNCIGQHLATMQIK 10742

10743 VMIVEFIKKFHLPQIPPDFDDEKTMIQSYGFKNPLIVKLQKRS* 10874

 

>CYP5010C2 8254686 541833 bp 6 contigs 28% to CYP694C2

314867 MIFQLISFAICVVIFYYAAYYFLIPKYRMQFYKNQGIKEYKFVPIYGWLKDILFLKKN 314694

314693 QWNDSLYQFKRISQDPEFKDEDILLGNVANQAFLTLIKPELINEFLQRQNEYEKYSVP 314520

314519 LRGVMNFLGNGLARIYGDKWKKSRKVISNMFTFENITAQVGRIKEVSRKYINNENPEKF 314343

314342 NIKYVFEKISGSQDLQVLLGAEVEKYKDSQGRHLGECIEQLSSDICIHY 314196

314195 ISPFAMIFGQKILTWNLRESDRYISKRMKDMRQFMTQILTDCVNREKDPTYQAK 314034

314033 YPSYITFLLKAGYLQKEEEINELLATSFSLFMAAKDTTSKLTSMAIYNLIKHPEQYKLV 313857

313856 FEEIQKYVSHDDYDYNDLQKLSHLHANLKETLRMDPSVPFILSRVAQADHMLGKYKIKKG 313677

313676 TIINIGYLCNLYNEKYFKDPFNYNPSRWLDNEEDMKMKENHFVFIPFSAGPRNCIGQHLA 313497

313496 MMQMKVMIVEFIKKFYLPQIPKDYDDEKILTQSYGFKNPLTVKLQVRN* 313350

 

>CYP5011A1 8253811 495369 bp 4 contigs 28% to CYP694C3 complete INTRONS?

369901 MIFVAIKILFFLILCLSFFKMVIYPVANLIRYRIQGVSIIKYYPISGLWGILQKDFKKYGDSLRFFN (2)

       VEKKNIQEVKLYKQIGGNIALYLIDPEMIKEFY

       SLNCYQKSPFIVKMFSRVLGNGFLFDEISHNRKRNIFSKLFHFDNLKQVVPKIEQITQNW

       VIQKCPLVKKTVQRIDMIDLSQHISGDIITQQFFGINMNNQRYKEKTIPQNLYHLNTATC

       NQMRTIRYILTGTYTFDKGWCRFDRVVNEDIKNIKQILIDALYSRINKQ (1)

       NKNSIIDQIIKNGYILDKKSSDMSKQEKEELTKSNKILPEELLEEYLTFYLAGMETT (1)

368711 GHTVGFCFYFYCKYPH 368661 (0)

       KLRKEINQHFSHDQPITYEKISQLNYLDCFIKEIFRFYGPTSTLMYRI (0)

368353 ATQDHKIKDIEIKKGSMVNLAILTNFYNSKIFKNPEEFNPDRWLEKNQINSYAYLPFSAG 368174

368173 RRNCIGQHLGLLTIKIVFSVILRDFEVTLTNPDYQMDRLQKFVHGPSSPIYCNFERLKQKN* 367988

 

POSSIBLE INTRONS

 

NQMRTIRYILTGTYTFDKGWCRFDRVVNEDIKNIKQILIDALYSRINKQ (1)

NTENSIIDQIIKNGYILDKKSSDMSKQEKEELTKSNKILPEELLEEYLTFYLAGMETT (1)

 

MY CHOICE

NQMRTIRYILTGTYTFDKGWCRFDRVVNEDIKNIKQILIDALYSRINKQ (1)

NKNSIIDQIIKNGYILDKKSSDMSKQEKEELTKSNKILPEELLEEYLTFYLAGMETT (1)

 

NQMRTIRYILTGTYTFDKGWCRFDRVVNEDIKNIKQILID

ALYSRINKQSQQQKQSQSQEDFQVDTENSIIDQIIKN (1)

ELTKSNKILPEELLEEYLTFYLAGMETT (1)

 

CCATTCCAGCTAAATAAAAGGTCAAGTACTCTTCTAAAAGTTCTTCTGGTAATATTTTGT

                         Y0 E1 E1 L  L  E1 E1 P  L  I  K  N

TTGACTTAGTTAGCTCTTCCTTTTCCTGTTTACTCATATCTGAGGATTTTTTATCTAATA

 S0 K  T  L0 E1 E0 K  E0 Q  K  S1 M  D1 S  S  K  K  D1 L  I

TATAACCATTTTTGATGATCTGATCGATGATTGAATTTTCTGTATCTACTTAAAAATCTT

 Y  G1 N  K  I  I0 Q  D  I  I  S  N  E1 T  D1 V0 Q  F  D  E

CTTAGCTTTAAGATTATTTTTATTGTTAACTTTATTTATTTATCCTTGAGTACAAAGCAT

 Q  S  Q  S  Q  K  Q  Q  Q  S1

CAATTAGAATTTGTTTAATATTTTTTATGTCTTCATTAACTACTCTGTCAAATCTACACC

AACCTTTATCAAATGTGTAAGTTCCTGTTAGAATATATCTGATTGTTCTCATTTAATTGC

AAGTAGCTGTATTTAAATGATATAGGTTTTGGGGAATGGTTTTCTCTTTGTAACGTTAAT

TATTCATATTTATACCAAAAAACTATTATGTGATAATATCTCCACTGATGTGTTAAGATA

AATCAATCATATCAATTCTTTAAACTGTTTTTTTAACTAATGGACATTTTTATATAACCC

AATTTTAAGTTATCTACTCAATCTTTGGCACCACTTGCTTCAAATTGTCAAAATGAAAAA

GCTTAGAAAAAATATTTCGTTTTCTATTATGGGAAATTTCGTCAAATAAAAATCCATTTC

CTAATACTCTACTAAACATTTTCACTATGAAAGGACTTTTTTAGTAGCAGTTTAGAGAGT

AAAATTCTTTTATCATTTCAGGATCTATCAAGTAGAGAGCAATGTTTCCTCCTATTTATT

TATATAGTTTTACTTCTTATATGTTCTTTTTTTCCACACTTATTAATTTACCTATGTTTA

                                     S2 I  L  K  G1 I  N  I

TAGCTAAGGATTTAGCATCAGGATGCTCAATTCCTACTTACTTAAAGAACCTAAGAGAGT

 A1 L  S  K  A  D                       K2 F  F  R2 L  S  D

CACCATACTTTTTAAAATCTTTCTGTAAAATACCCCATAAACCTGATATAGGATAGTATT

 G  Y  K

 

>CYP5012A1 8254582b 1075194 bp 7 contigs 2 genes 27% to CYP694C3 complete

626541 MNILIQALLLIAVLLLLMILFFVNKYYIKTRSLMRYYENQGINNSYYFPLVGKYKRMLD 626717

       SIKEHGDCMHYYKQLLTNQPDCKYTVTNL (1)

       AHNVCIYFYDLELLNQLYKCSDLVKHQFGV

626951 EIFKQVMAGLVFNEGEEHSRKRKILSQAFHYQLLQNMIPQLSQIYEEFFEKIRNKRVS 627124

627125 DCIMMGQEIVAESIVRLFFSNSFQGLKFKGKTMTEAVSGIIERVGSMAK 627271

627272 SASYALFGKICFSFGQMASVKKDLKQIKEIYKEIITNRINSTNLEELQHKKQKDLIDYL 627448

627449 IESKGISKIESEESITIDQIIEEFVTFQVAGLDSS (1)

       GHTLGMVLYYLCIFPE (0)

       LVEEIEQTIKTPDDLNTESIKKMKYLGAFVNEVLRYYSVADQIFPRT (0)

       CNKDIQIGDLIIKK

627962 GTQVNVGIIENHYNPKYFKDPHQFNPMRWLDGSLDKLNSYAFNPFSSGKHNCIGQHFAQLEIKVGII 628162

628163 KFLQEFDLELDQDYKFILKFSFLTEPLNPIPITFKSKKQ* 628282

 

>CYP5012A2 8254582a 1075194 bp 7 contigs 2 genes 28% to CYP696C1 complete and boundaries checked

MGLILQIIIFILTIISLLALYLVYKFYVKPKILMG

FYEKQGVKNLCFYPFIGKYSFVSRDIMLKGDGFYSIKDFLAQNP

GCKYSMTNL (1)

623793 AEFVCIQLYDPDIIKEFYNCKHIVKHKFGVEIFQEFMNGIVFAEGEEHTRKRKIISQAFHYQLLQS 623990

623991 MISPLSEIYDEFIDKVKDKKVDDCISMFQDISGESVLRLFFSKSFKDYKYRNKTMTETLS 624170

624171 DLINCAGQMAKSTEYAFLGKGALRFGKLSTLQKTTKDIKKIFKEVITNRIDS 624326

624327 TNLDDLKHKQRKDMIDYLIENQGIDKIESENKITIETIIEEFITFYVAGLDTT 624485 (1)

GHTLGMAMYYLSVYPE (0)

LSEEIKATIKSQADLTSENIKNMKYLQAFVNEVLRHYTIADQLFMRT (0)

ATQDIQLGNLKIKKGTVLNVGNLE

624929 NHFDPKYFKDPHQFNPQRWLDGSLDKLPPYIFNPFSAGKNNCIGQHFAQINIKIGLI 625099

625100 KFIQSFDFEIDPNYKLILTFQFLIEPVQPIALTFKSKKL* 625219

 

>CYP5013A1 8254607c 557789 bp 5 contigs 3 genes 29% to CYP694C1 = EST seq 1

113493 MFIEILIIILFFAALRLVIIPYFKFLKYKKYGDGRFVPLLGELVEIKEAIK

       KYEDVDYFVSHQCDENPDLRLYVVNL 113723 (1)

       GSKIKLRL

113801 VDPDLMRDFFLNQQYYEKDTFYIGNVLRCAPQGIAFVEGEQ WKKARKMFSQAFHFEYLT 113977

113978 SLAPLIEQIASKVFNQAMESSEILANYDPLVYSQKITGQVVIATFFGEQVNE 114133

114134 KKFRGMDLVSALTHMLNLLGEQSMSLQYFLFGADFFKLRLTKSQRYVDDIIKDFRSFMK 114310

114311 DLIEEKIESFQKELKEQGKISVPSILAQIVSTQEITEESKKYLFDDFTTFYAAGTDTT 114484

114485 GHLLAMTFYYITQNPELQKALQREVDANQDQSPQGLAQLNYLNAFLKETLRYYGPANF 114658

114659 TFDRI (0)

114752 ATKDHYLGDIPIAKGTIVVPISQSTHRNKKYYEDPHRYNPERWLDNKQKQIHNYAYMPFS 114931

114932 SGQRNCIGQHLAMIVARITLNKFVKMFEFSCDPNYKLVITQRFMSEPLDPIPLR 115093

 

TAAGATTATATGTTGTCAATCTTGGTAATAAAATTACTTAAAAGTATTAAATTTTACTAT

  R  L  Y  V  V  N  L  G

GTAATCTTAAAAAATAGGCTCTAAGATAAAGCTTAGATTAGTAGATCCCGACTTAATGAG

                   S  K  I  K  L  R  L

 

TTTTGATAGAATTGTAAACAAACAATTTTAAAACAAAAAAAATATTTAAAAAAATGTGAT

 F  D  R  I  V

TTTACAAATTAATTATTTTTTTAATAAAAAGGCAACTAAAGATCATTACTTAGGAGATAT

                               A  T  K  D

 

>CYP5013B1 8254607b 557789 bp 5 contigs 3 genes 26% to CYP694C1

note: the name given previously was incorrect,

this seq is not CYP5013C1.  (3/9/2009, DRN)

117635 MIFYFILIALGIYCYYYLVKPFKIFNKYKKQGEGFFFPLIGEFLLQNSCFEKFGDVDYIY 117814

117815 KHINDNPANQKKMFVENI 117868 (1)

117921 GKNIKIKLTSIDMMKDFYSYLGDKLTIDTFYTQNFKRIFGNGLIFSQGER 118070

118071 WKNMRKMFSPAFHFEYLKNLSSSIEKIVDNALLEHINQDELQNVNIINLIFDMTSQV 118241

118242 VFFSFLGDNLQDIKFKGKSLAQAITHILSALSNQVMELGYLVFGAKYLKLGIF 118400

118401 KQHRELENYLQEFKSFFLDIIKSKKEELQKEFQETGDIQQNCVLSILIKENNIQNLSDSD 118580

118581 LLDNLMTFFIGGTDTTSHLIAMAIYYVCQNEHFYSMTQKEVDYYKQDKSKSMQDLPFM 118754

118755 NAVIKETLRIYGPGNKSFDQK (0)

       VTEDIMINNIKIEKDIVLTSYVN 118934

118935 SVHRDPQIYKDPHTFRPQRWINQETNDLPSY 119027 AFIPFSSG

119052 KKNCIGQHLAMIEARIILFKFFDYFDIEPQKDYVLKIKYGLLPQPVNPLLFNLKTRQNNID* 119237

 

TTATTAAAAAGTTTATGTAACCAATATTGGTAAATTATTTATTTATTTATGAGCAAATGA

                V  T  N  I  G

AATTATTTTATTAAAAAATATATAAATAAATAGGTCCTTATATTAAATTAAGAATATTTG

                                   P  Y  I  K  L

 

CAAATATTCTCTTTGATAGAATCGTAATATTTTTGTTAATTTAAAATGTTGCAAACATAT

              D  R  I  V

TAAGAAACCTAATACTGCAAAACATTATATAATTATTTTATTTTAATATAAAGGCTACAT

                                                     A  T  S

CAGATCACGTTATTGGAGGCATACCTATTAAAAAAGGAATGATTATTACCCCTTTTGCTA

  D  H  V

 

>CYP5013C1 8254607a 557789 bp 5 contigs 3 genes 33% to CYP694A1

54% to CYP5013C2

note: the name given previously was incorrect,

this seq is not CYP5013B1.  (3/9/2009, DRN)

115472 MNIILILVLFLAAFIIVKLFILPYKLHKKYKKYGQGQFVPILGEILEFQQNLQKNQDMEYSLK 115660

115661 HQLDKDPYQKVYVTNI 115708 (1)

       GPYIKLRIFEPEIIRDFFQNQHNYKKFE 115856

115857 FFTGNLMRFVNGIITAEGQDWKSSRKLFSQAFHFDYIQKLAPLIEKSTNTIIDEIIA 116027

116028 KNELQNFNSITCMQELTGRVVIASFFGESLENERLKGRTIVETLSYILNALARQTGT 116198

116199 LIYLIFGKKFFQLGLTKHSREINQLIEEFNSFLQNKIASKIEEIQNDIKQNGNTS 116363

116364 NFSILAQLVSTSQIDQISRQQLFEDFKAFYLAGMDTTGHLLGMTMYYLTQYKDVYSKLQQ 116543

116544 EIDTNQDQSIQCISQLPYLNAVIKESLRYYGPANILFDRI 116663 (0)

116754 ATSDHVIGGIPIKKGMIITPFAISMHRNQNIFENPHAYNPSRWLDKKISEINSFSYIPFS 116933

116934 SGQRNCIGQHLAQMQTKIILNKFIKKFNFSCPENYKLVMAFKFLSEPVDPLCLKLTLRQ* 117113

 

TTAAAAAAAGATGTTTGTTGAAAATATAGGTATTTAAAAATTTTAATTGATTTATTATAT

                V  E  N  I  G

TCATTCTCATATTTTATAAAGGAAAAAATATCAAAATAAAGCTGACATCCATTGATATGA

                    G  K  N  I  K

 

AATCATTTGATCAAAAGGTATGATTTGCATTTTAAAGATTTTAGTAATTATATTAAAAAA

  S  F  D  Q  K  V

ATTAGGTTACAGAAGATATCATGATCAACAATATTAAAATTGAGAAAGATATAGTTCTTA

     V  T  E  D  I

 

>CYP5013C2 8254373 875719 bp 4 contigs 29% to CYP694C3

486130 MIFELILIAVALFAYFKIAKPYFSYLKYRKYGKGFYYPILGEMIEQEQDLKQHADADYSV 486309

486310 HHALDKDPDQKLFVTNL 486360 (1)

       GTKVKLRL

486439 IEPEIIKDFFSKSQYYQKDQTFIQNITRFLKNGIVFSEGNTWKESRKLFSPAFHYEYIQ 486615

486616 KLTPLINDITDTIFNLAVKNQELKNFDPIAQIQEITGRVIIASFFGEVIEGEKFQGLTI 486792

486793 IQCLSHIINTLGNQTYSIMYFLFGSKYFELGVTEEHRKFNKFIAEFNKYLLQKI 486954

486955 DQQIEIMSNELQTKGYIQNPCILAQLISTHKIDEITRNQLFQDFKTFYIAGMDTTGHLLG 487134

487135 MTIYYVSQNKDIYTKLQSEIDSNTDQSAHGLIKNLPYLNAVIKETLRYYGPGNILFDRI 487311 (0)

487422 AIKDHELAGIPIKKGTIVTPYAMSMQRNSKYYQDPHKYNPSRWLEKQSSDLHPDANIPFS 487601

487602 AGQRKCIGEQLALLEARIILNKFIKMFDFTCPQDYKLMMNYKFLSEPVNPLPLQLTLRKQ* 487784

 

TATTCTGTTCATCATGCTTTAGATAAAGATCCAGATTAAAAATTGTTTGTAACAAATTTA

                                                V  T  N  L 

GGTAATTATTTTTTAATGAATATTCCTTACTGAATTTATTTTTGAATCTTATAAGGAACA

G                                                     G  T 

AAAGTCAAATTAAGATTAATTGAACCTGAAATAATTAAGGATTTCTTCTCAAAATCTCAG

K  V  K  L

 

AAAGAAACTCTTAGATATTATGGTCCTGGAAATATTTTATTTGATAGAATAGTAATTAAT

                                          D  R  I  V0

TAATTACTTTTTTATTTATTTTATTATATATAATAAAATTAATAACTTTAAGAATTATCT

AAGTATTTGTTTATATTAAATAAAATATATCTATTAAAAAGGCTATTAAAGATCATGAGC

                                         A  L  K  D  H

 

>CYP5013D1 8254719 598405 bp 5 contigs 31% to CYP694A1 = EST seq 2 and seq 3a

NO INTRONS

491991 MVSYFALAGLAIVLYILYVFIINPYLQYRKYLKWGKGSFYPFVGVFYGAGLRVK 492152

492153 QYKDVDHHLKHMYDDGSDPKIYVENNATGAIIKISDPEYIKEFVQLENKAYQKTTLLID 492329

492330 NIIRLVGQGIIFSEGPQWKKNRNVLSGVFHFEQLSKRVPSIEKITKEVYKRYIDSGNV 492503

492504 KNVDVIELFQEITAEVVIQTFFSNISKDKSFYGMSLSVALSYLINS 492641

492642 VGKQISTPFYFLFRTNFFKWGIRESDRKLNKQIKEFRQMIGDIINERIKEEEELEKRGE 492818

492819 QTTKEDLVYYLKKNNLLGVLSLDEIISEFMTFYVAGMDTTGHLCGMAIWFLTQHPEIKKK 492998

492999 LQEELDANTDYSQNGLLKLPYLNGVIQETQRLYGPAGQLFNRVALRDHMLKDIPIKKG 493172

493173 TIVKPSPCSVHRHPKYFEDPHSFKPERWFNKNTVTPYTFIPFNAGPRNCIGQHLALIEAR 493352

493353 IMINYFMKTFDFESDPNFEMVLNYAFLIEPVDKLRINLKLKENPLIYQ* 493499

 

>CYP5013E1 8254431 715652 bp 1 contigs 28% to 693A1 COMPLETE

603788 MIYIIITLILMSILYKYLVKPYSLYRKYKKYGDGYFFPLFGEAAEYYQNLQKHQDCEYS 603964

603965 FKHYFDKRKPEDNR 604006

604007 IIVSNM (1)

       GTSVKINL

604112 VDPDLIKQFYQKSENYHMDHFFMSNLYRLFGTEEEDQKLHRNFRKQFAPAFN 604267

604268 FEYLQNLSSQITKITDNKINSLIQANNFKDIDPIAFSSEITGQVILLSFLGDTLENLS 604441

604442 FRNMKLPHALSYILNCLCGQMGEIGYILFGAKYTQKGYFKQHKEVEDFMQEFKNY 604606

604607 LRVILKQKIEIYRKEFEQTGNIKQTCILSLFIQNGEIDKIDLDDLLKYFLNFFLAGTETT 604786

604787 SNLVGMTIYYLTQNKEVQQKLQQEIDQNTDYSAEALSNLPYLNATIKETLRYQGPGNS 604960

604961 LLDRI (0)

605028 AVKEHYLEDIKIEKGVIVNVYMKAVHRDSRFYSDPHTYNPQRWLDSKENKSLHPFTYLPF 605207

605208 SAGQRNCIGQHLALIEARIILNSMIKNFNFKINEGYKLILDFKFSPQPLNPLQIHFDRR* 605381

 

CTCATTCAAGCATTATTTTGACAAAAGAAAGCCTGAAGATAATAGGATAATAGTTAGTAA

                                        N  R  I  I  V  S  N

CATGGGTAACTTATATTAATTAAAAAATGTAATTATTAGTAATTAAATTTATTTTAATAA

 M  G  N  L  Y  Q  L  K  N  V  I  I  S  N  Q  I  Y  F  N  N

TAAAATAGGGACTTCTGTCAAAATCAATTTAGTTGATCCAGATTTGATAAAATAGTTTTA

 K  I  G

 

TATAAAAGAAACACTTAGATATTAAGGTCCTGGCAACAGTCTATTGGATAGAATCGTATT

       E  T  L  R  Y  Q  G  P  G  N  S  L  L  D  R  I  V  L

ATTAAAAATTTATTTAATATTTATTAAATCATTTTCATTTAAAAAAGGCTGTTAAGGAAC

                                            K  A  V  K  E  H

 L  K  I  Y  L  I  F  I  K  S  F  S  F  K  K  G

ATTATCTTGAAGATATTAAAATCGAAAAAGGTGTAATTGTAAATGTATATATGAAAGCAG

  Y  L  E  D  I  K  I  E  K