Honeybee
Cytochrome P450s from version 1 of the Apis mellifera genome
These
sequences were submitted Feb. 11, 2004 by May Berenbaum in collaboration with
Gene
Robinson (genome sequencing), Hugh Roberstson (genome annotation) and Reed
Johnson
(P450 annotation).
There are
4 CYP clans in insects. CYP2,
CYP3, CYP4 and mito
CYP2 is
the clan with CYP18 in it.
Sometimes it is called the CYP18 clan.
CYP2 also
has CYP303, 304, 305, 306, 307
CYP3 has
CYP6 and 9 in it and CYP28, 308, 309, 310, 317(CYP6 subfam)
CYP4 has
CYP4, 311, 312, 313, 316, 318
mito clan
has CYP12, 49, 301, 302, 314, 315
The
honeybee sequences have been sorted in to these main CYP clan bins.
32 genes
have been named.
Feb 20
version 1PM, D. Nelson
CYP2 clan (11 seqs)
>AmGroup13.1a
(71904-74217) plus, 524 aa, 5 exons (71904-72236,
72653-72774,
72894-74323, 73404-73668, 73794-74217)
60%
to CYP18A1m = CYP18A1 honeybee
MGGTRIEVLCTFLVFLGVLLVARCLQWLRYVRSLPPGPWGVPVFGYLPFLKGDVHLRYG
ELAKKYGPMFSARLGTQLVVVLSDHRTIRDTFRREEFTGRPHTEFINILGGYG(1)
IINTEGAMWKDQRKFLHDKLRGFGMTYMGGGKKIMESRIM(0)
REVKTFLRGLASKRGTPTDVSASLGMSISNVICSIIMGVRFQHGDARFKRFMDLIEEGFKLFGSMAAVNFIPVMR
YLPCLQKVRNKLAENRAEMAGFFQETVDQHRATFDEGTMRDLVDAYLLEIEKAKGEGRATTLFQGKNHD(1)
RQMQQILGDLFSAGMETVKTTLEWAIILMLHHPDAAIAVQEELDQVVGKSRMP
VLEDLPFLPITEATILEVLRRSSVVPLGTTHATTR(2)
DVTLHGYTIPAGSQVVPLLHAVHMDPELWEKPEEFRPSRFLSAEGKVQKPEYFMPFGVGRRMCLG
DVLARMELFLFFSSLMHTFELRSPQGSSLPSLRGNAGVTVTPDPFDVCLLPRNLDLIEDNDMISTGAILRNIGSH*
>CYP306A1 AmGroup13.1b
(76876-78884) minus, 499aa, 7 exons (78884-78557,
78493-78345,
78209-78007, 77901-77595, 77538-77383, 77298-77123,
77057-76876)
AADG02005913.1
46%
to 306A1 = CYP306A1 cyan region does not
match. CYP306s are longer in this
region
check for frameshift. look for
YEPECILEH. in 18clan/2clan
CYP306B1
may be the ortholog of CYP306A1 in diptera (flies and mosquitos)
MILDHYIAIFVLPFLLLLYVVRKNRKARRLPPGPWQLPLLGYLPWIDAEKPHETLTR
LSRVYGPVCGFRMGSVYTVLLSDPQLIRQSFAKDSITNRAPLYLTHGIMKGYG(1)
IICAEGEQWKDQRKFISNCLRNFGMVKHEGAKRDKMEERISDAVNECVS(0)
VLRDRGANGPIDPLDTLHHCLGNLVNSIVFGKTYEEEDRIWKWLRHLQEEGVKQIGVAGPLNFLPFLR(2)
FLPQYGRVIRSIVDGKDKTHEIYRQILD EHRARVDSGNGCKIDSFLAAFDEQMRKK
DGAESGYFTEP
QLYHLLADLFGAGTDTTLTTLRWFLLFMAAHPMEQ(0)
EKIQSEMDLCLREGEQPTLNDRIVMPRLEAAIAEVQRIRSVTPLGIPHGTSE(0)
DVEIGGYDIPCGAMIVPMQWAIHTDPAYWRDPLEFRPDRFLSEDGTFFKPESFLPFQNG(1)
KRVCVGEELARMILFLFAGRILRAFSVRVPAGEIADLEGECGITLVPKPHRLAFVGRDR*
>AmGroup
14.9 (18918-22680) minus, 508 aa, 3 exons (22680-22278,
20154-19363,
19246-18921)
CYP307B1 55% to 307B1 Anopheles, probably the
ortholog of 307B1
MIPLTATTCFLIAITFLALALILLDHLRSKKTTKSVVPGDDDQHALPEPPGPKPWPILGSLHILGRYD
VPYKAFADLVRDFDCQVIKLRMGSVPCVVVNGLENIKEVLTVKGHHFDSRPNFARYHLLFGGNKENS(1)
LAFCNWSDVQKARREMLRAHTFPRAFSTRFNELNGIIGDEMEFMVNHLDSLSGTSVHAKPLILHCCANIFI
TYLCSKNFHLEHDGFRNMVENFDKVFFEVNQGYAADFLPFLMPLHHRNMARMAHWSHEIRRFVIKNIIADR
VNSWNDVVPEKDYVDCLINHVKSGTEPQMSWNTALFVMEDIIGGHTAIGNLLVKVLGFLATRPEIQRLAQD
EIDALGLAGNFVGLENRRSLPYVEAIILETIRIIASPIVPHVANQDSSIAG(1)
FRIKKDTFIFLNNYDLNMSTDLWTSPEEFMPDRFVQNGRLLKPEHFLPFGGGRRSCMGYKLVQYVSFAILA
SILKNFTITPVQKEDYTIPIGNLALPEMTYKFRFERR*
>CYP303A1 AmGroupUn.2253 incomplete on N-terminus
(55-1889) plus, 382 aa, 8 exons
43%
to 303A1m, 44% to 303A1 anoph. CYP303A1 in 2 clan/18 clan
may
be the ortholog of 303As
(1)LLLVDGNLWNEQRRFVLKHLRDFGFGRQS(1)
LYMNANEYTGNNVTQSQLGTIISMHNIFGITVLNSLWKMLAGKRL(2)
YNIDDKELIYFQRILSITLNEIDMLGAPFSHFPLLRFIAPEISGYKSFVKIHEELWKFFK(0)
DEVNNHKNTFNSDSPGNLIDIYLTILNSENYGKTFSD(1)
VSEPQLVAICVDLFMAGSETTSKVLGFCFLYLVLFPHVQKKAHEEIDRVIGRNKLPTAEDKAK(2)
MTYMNAIVLESLRMFAGRSLNLPHRVQRDTKISDYKIPK(0)
NTIIITNFNGILMDESWGDPENFRPERFIDGSGNIVTPSRFLPFSAG(1)
KHRCMGENLAKTNIFIIATTLLQAFTFSEIPGEKPTIEHFIDGTTISPKPYRVNVSLRI*
>AmGroupUn.6110
incomplete on both ends (1-622) minus, 126 aa, 2 exons
39%
to 303A1
LDEQLMMILIDLFLAGFTTTSTTLDFLFLIVTLFPDVQRKVQKEIDSVIPYDRLPNMEDKAK(2)LPYVEAVISETYRLWPVFPIIGPRRVLCDTNIDKYVIPKDTTILFNTYSINKDPTLYPDPDKFM
>AmGroupUn.7901
incomplete on C-terminus (1-459) minus, 126 aa, 2 exons
40%
to 15B1
MWFVILCFVIVLIKILFDYSRPINFPPG(1)
PRGLPFIGNILDIIRLINETKYYSDTWCRLAEKYGSVVGLRLGLDQPLIIVSGKSAVTEMLNRSEFDGRPSGFLYKYRCGGMQQGILFTDTDVWHSQR
>AmGroupUn.10970
incomplete on N-terminus (112-677) plus, 121 aa, 2 exons
probable
18clan/2clan 41% to 15B1 40% to 304B1, 41% to 305A2
(0)DTTVLLDFHSAHNDPAYWDHPEEFRPQRFLDANGRFCQNNANIPFGLA(1)
IPFFLGKRRCPGEMLARTSLFLYFAYVIHYFDIEISPEHGKPDLNGHDGFTISPKSYYLKITARSDVTNCSTI*
>AmGroupUn.8493
incomplete on both ends (509-1028) minus, 141 aa, 2 exons
39%
to 304A1
(1)LPRLPIIGSYWHLLWHDYEYPYNGIIHYVNKLQSKIVTCYFG
SHKTIIANDYKSIKEVLTKQEFNGRPINVDIVLQRAFGKSLG(1)
IFFTEGTLWHEQRRFALRHMRDFGFGRRHEIFETNVMEEIAILVDMLKEGPINDEEK(0)
>AmGroupUn.897
incomplete, missing exon 5 (51606-53773) plus, 357 aa, 5 exons
37%
to 305a1m 35% to 303A1m
MLYVVISLLLALYCIFCIYDCVKPHNFPPG(1)
PKWLPLIGCFLTFRRLKLKHKYTYVAFQELSKTYGPILGLKL
GSQKLVVISTHDLVKKVLLQDEFNGRPDGFFFRVRAFGKRKG(1)
ILFTEGSMWSQCRRFTMRHLRSFGLGQSTMEKYLTVEAENLVNYL
RRVSTKGPVPMHTAFDIAVLNSLWCMFAGHRFDYENEKLAEILEIVHDSFR(2)
LMDTMGGIISQMPFLRFIIPELSGYNNLMEILRKLWNFLDEEINNHEKHLSGNQPQDLIEAFLLEISSRNGVQNDSIFDS(1)???
(1)KRRCPGEMLARTSLFLYFAYVIHYFDIEISPEHGKPDLNGHDGFTISPKSYYLKITARSDVTNCSTI*
>AmGroupUn.7452
incomplete on both ends (700-954) minus, 85 aa, 1 exon
no
good match 38% to 305A2
(1)PFSWPFIGNQILLKRLSRKFGGQHKAFMELSKRYNSDIITVNISY
EKIIVVSGSKFCDMILQNEEFQGRPWNEFIKVRNMGKKQG(1)
>AmGroupUn.960
incomplete on both ends (8311-11562) minus, 147 aa, 2 exons
42%
to 305a3
(1)NQLLYIIKDLFSAGVDTTNSTIGFIIAFLVVHQDVQSKVYDEISRVIDKDIYPSLSDKDR(2)LPYLKAVIAEVSRLANIGPTSIPHRAVKDSTFLGFEIKKNYTLLANFKSIHMDKEHWGDPEIFRPERFINEKGDFINDSWLMPFGLG(1)
>BI513047 EST 50% to 305A1ps C-term
2
FFFLQFGKGKRRCPGDILAKATIFILFVGIMQKYTLLPVPGKGPHSIKINSGITLTPQPYNVLVEKR* 205
CYP3 clan (36 seqs)
>CYP6AQ1 AmGroup12.14 (315628-318450) plus, 500 aa, 5
exons (315628-316108,
316238-316604,
316677-316861, 316989-317277, 318272-318450)
45%
to 6K1, 42% to 6g2m 43% to 6G1ps new subfamily in CYP6
cyan
= missing seq. from EST BE844578
yellow = EST BE844462, underlined seq = EST BE844394, green = EST BE844353
magenta = EST BE844352, gray = EST BE844331 all ESTs from antennae
MNLLTPYWSLDILIVSSSLMIAVYLYASWKLKYWSRRGIMQITPSPLFGNFKKCILFQKSVSEIIRELYGQNEGLPFMGFY
IFYKPFFLVRDIELVKHILVKDFNTFANKHTSADSKNDRIGYSNLFIIKNPAWKYLRGKLTSVFTSGKLKKMFDLMLIIG(1)
KNLEKHLELLNLDG
NGKEVELKDLCANFTTDLIGTTAFGVNLNSLKDPNSDFRENGRLVFDYNLKRAFEFFSIFFFPNLS
KYVSIKFFGKATDYFRNSFWSVINQRIESNVKRNDLIDCLIELREKHKNDESFEGFR(1)
FDGDDLVSQAAIFFTGGFETSSTTISFTLYELALNKDIQKTVRTEIHEALAQTDGKITYDM(0)
ITNLPYLDMVVSETLRKYPPLGFLDRVALHDYKIPNSDVTIDKDTPVIIPMIAFHYD
PKYFPNPEKYDPLRFSEEVKKTRPSYVYMPFGEGPHICIG(1)
MRLGLLQSKLGIIEILKDYEVSPCEKTKIPMVLDPKGLTTTALGGLYLNIRKITIAAG*
>CYP6AR1 AmGroupUn.19 (44801-47550) plus, 502 aa, 5
exons
50%
to AmGroupUn.5496, 47% to AmGroupUn.792b, 38% to 6a13ps all best hits to 6as
probable
new subfamily in CYP6
MSWLMIETVGLIATVFFLLYYYSMSKLDYWRKRGVKGPKPLPFLGNFKDVLLAKESTMDCFERAYKEFKDEPMVGMYGSHEPLLILRDLDLIKDVLIKDFNKFAQRTQGAIRE(0)VEPLSEQLFRLDAERWRPLRLKLSSFFSSGKLKEMFHLFVECSDNFEKYLEKMVEKGGLVECRDAAAKFSTDVIGACAFSIHTNALTDENSQFRKMGKQALATNLQQFLNDRLREYPFLFKIFGRFFVDHEVTNFFANSIKDAMDYRIQNNVHLRDVIDILADIRENPTKCGLKE(1)ADNLFLTSQAVLFFLAGFENASLTISNALYELAWKPEIQEKARAEIVNVLQKYDGKITYDGLEEMKYLEACIFE(1)TLRMYPVLQWLSREAMETYTFTGTKVTIPKGQQVFLPIYAIQRDPDIYPNPDNFDPERFTDDKIKTRHSMTHLPFGDGPRHCSG(1)IRLAKKQLKVGLVTVLSKFKVEVCEKTRKIYQKDKKPLFLLQPVDGIHLKISKVSV*
>CYP6AS1 AmGroupUn.5496 (264-2651) plus, 498 aa, 5
exons
44%
to 6A14, 64% to AmGroupUn.792b
MDYFQILCAISIVILTIYYYYSSKYTFWKKRGISGPKPIIFFGNFVDSIIQKRSTSEAVKKWYDDYKHESVFGIFGGTTPLLVINDLDMIKDVLIRDFSLFVDRGFHIFPK(0)IEPLSEHLFLLEAERWRPMRMKLSPIFTSGKLKEM
FFLIMESAGNLEKYLDEVIKKDEMVECRELAAKFMTDVIGSCAFGINTNSLLE
EDSEFRRMGKKISTPNLKVMLGNICKEFFPPLYEIVGSIF
TLKDVNEFFINLVSDTMKYRKDNNIIRSDFINMLMQLKEHPEKMENIE(1)
LTNTLLTAQAVVFFVAGFETSSSTMAFSLYELAQNQEIQDKLREEIRNMHEKNKGVLTYTDVKEMKYLDKVFKE(1)TLRKYPILPMLFRQAMENYTFKDTKITIPKGMKLWVPVHGIHHDPNIYPEPEVFDPERFEDDAFASRHPMSYLPFGDGPRNCIG(1)ARFAHYQSKVGLITILRHHKVNVCEKTTIPFKADERSFLLTLKGGVHLKITKI*
>CYP6AS2 AmGroupUn.792a incomplete on C-terminus of
exon 2, missing 3rd exon
(8375-10702)
minus, 356 aa, 5 exons cyan is from EST BE844607from antennae
86%
to AmGroupUn.5496
MDYFQILCAISIVILTIYYYYSLKYAFWKDRGISGPKPIIFFGNL?NSIIKKKSLSETVKKWYDDYKHESVFGIYEGTIPVLVINDLDMIKDVLIRDFSIFVDRGFHTFPK(0)IEPLTQHLFLLEAERWRPMRMKLSPIFTSRKLKEM???(1)
gap of 53
aa
EDSEFRRMGQE 511
509
IFSPSLKLMIGNTCKVFFPSLYEVIGNIFTMKDVDEFFINLVSDTMKYRKDND 351
350 IVRSDFINMLMQLKEHPEKMDNIE
LTDTLLTAQAVVFFIAGFETSSSTIAFGLYELAQNQEIQDKLREEIRKMHEKNKGILTYTDIKEMKYLDKVFKE(1)
TLRKYPILSTLSRKAMEN
YTFKGTKITIPKGTKVWVPVYGIQHDPNIYPKPEVFDPERFEDDAFASRHPMSYLPFGDGPRNCIG(1)ARFAHYQSKVGLITILRNHKVNVCEKTTIPFKADERSFLLALKGGVHLKITKI*
BE844607
EST = 100% to AmGroupUn.792a,
83% to AmGroupUn.5496
543
EDSEFRRMGQE 511
509
IFSPSLKLMIGNTCKVFFPSLYEVIGNIFTMKDVDEFFINLVSDTMKYRKDND 351
350
IVRSDFINMLMQLKEHPEKMDNIE
LTDTLLTAQAVVFFIAGFETSSSTIAFGLYELAQNQ 171
170
EIQDKLREEIRKMHEKNKGILTYTDIKEMKYLDKVFKETLRKYPILSTLSRKAMEN 3
>CYP6AS3 AmGroupUn.792b (3396-5727) minus, 499 aa, 5
exons
64%
to AmGroupUn.5496
MDYFQLLCVIGALLFAIYYYLTLTFDTWKNRGIPGPKPTIFFGNFQEVILKKISLAEKTKQLYQEYKNELVFGIFQGRTPILVINDLEMIKDVLIRDFSVFPDRGIHVNPK(0)VEPIFQTLFSLKSKTWRPLRMKLSPVFTSGKLKDMFPLILDCAKNLEEFVEKVRNSGEPVDCRDMAAKFTTDVIGSCAFGVCMNSLSPEGSEFRRMGEQLGKFSFKKLARDFTRLYMPFLFDIIGGYLQSHEVNNFFINLIRDSIKYRQENNVYRPDFVNTLKELKEHPEKLENIE(1)LTDALLTSQALVFFLAGFETSSTTISNALYELAQNPEMQDKLRKEIKEVYENNGGALSYTDVKEMKYLDKVFKE(1)TLRKYPVLAALSRQATENYTFKDTKIKISKGTRIWIPVYGIQHDPNIYPEPEVFDPERFEDDAFTSRHPMTYLPFGDGPRNCIG(1)ARFAHYQSKVGLITILRDNKVEVCAKTLIPYKSEPRNILMIPKGGKVELGITKV*
>CYP6AS4 AmGroupUn.1753 incomplete on C-terminus of
exon 1 (2509-4596)
minus, 411
aa, 5 exons
80%
to AmGroupUn.42b, 58% to AmGroupUn.792b, 41% to 6a17, 44% to 6a13,
new
subfamily in CYP6?
MLHHFHILTAFVAIFLALYYYL(0)AELFSVNLFSVDATRWRPLRMRLSPVFTSGKLKEMFPLILECAEHLEQCLEDAVKRGGPVDCFEIPARYTTDVIGSCAFGINMNALSDERSEFRKMGRNMFDQNMIKFTRNLLRDFFPRFYNLLGFVLPYTESTVFMTKLIKGTIKYREENDVVRPDFVNLLMELKKHPEKLKNIE(1)ITDTLLAAQASVFFAAGFETSSTTMAHALYEMALNPDIQDKLRNEMKEFHAKNNGNLKYEDIKEMKYLDKVFRE(1)TLRKYPPGMLLRRKCNSNYTFHGTKVSIPAGTSVIIPLYAIQIDPKFYENPDVFDPERFNEDAVAARHPMTYLPFGDGPRNCVG(1)ARFAVYQTKVGLIKILQNFRVDVCEKTMIPYVKKINSITLAPRDGIFLKIEKITD*
>CYP6AS5 AmGroupUn.42a pseudogene? stop codon in exon
4 before heme binding
site and
missing exon 5 (126603-129298) minus, 439 aa, 4 exons
56%
to AmGroupUn.5496 43% to 6M2 new subfamily in CYP6?
MASSFEILCGIAVLFLALYYYLTSTFDFWKSRGVVGPKPVPFFGTTKDLILVKKSTAHFVKDIYEKYKNEPMVGLYATRSPFLLLNDPELIKDILIRDFSKFANRGLGVFER(0)TEPLSPHLLNLEVERWRPLRSRLSPIFTSGKLKEMFYLIIECSLNLEMYLDKLIEKNEPIECRELTARFTTDVIGSCAFGIDMSSMTNENSEFRRMGREVFAVNFMNVMRMKLKQFMPRLYDLLGYVMPDRTFAPFFTRVVTDTIKYRNDNNIVRPDFINMLMELQKNPQKLENIK(1)LTDSLIAAQAFVFFLAGFETSSTTMSNALYELALNQDVQKKLREEINTFCPQNNKELKYDDKEMEYLDKVFKE(1)TLRMYPPASILMRKAISDYTFNDTKITIPKEMKIWIPAFAIHRDSVINPNPNSFDPERFDKDAMASRHPMHYLPFGDG*
>CYP6AS6 AmGroupUn.42b alternate splice for exon 3?
(121844-124324) minus,
497 aa, 5
exons
41%
to 6a17, best matches all CYP6As 80% to AmGroupUn.1753
MFDYFQILIAFVASFLALYYYLTSNFDFWKNRNVVAPKPIPFFGNTKDVVLKKIEISNFIAELYKKYENEAMFGIFFGGSPNLILRDLDLIKDVLIKDFSTFDERGFKISER(0)ADPLNANLFNMDVTRWRPLRIKLSPVFTSGKLKEMFPLILKCAERLEQCLEDAVKRGGPVDCFEISARYTTDVIGSCAFGINMNALSDERSEFRRIGKRIFDLDKNILRSFLRQFFPRFYNLLGFVIPYSETSKFVTKFISEMIKYREENNVVKADFVNLLMELKKHPEKLQNIK(1)ITDNLLAAQAFVFFAAGFETSSTTMAHALYEMALNPNIQDKLRKEIKEFYANNNFTYEEVKKMKYLDKVFKE(1)TLRKYPPGVFLKRKCNSNYTFKGTKVSIPAGTSVIIPVYSIQTDPKFYENPDVFDPERFNEDAVAARHPMTYLPFGDGPRKCIG(1)IRFGVYQTKVGLIKMLKNFKVNVCEKTMIPYIKKINSFTLAPKDGIFLKVEKNN*
>CYP6AS7 AmGroupUn.42c incomplete on N-terminus
(115419-117055) minus, 265 aa, 3 exons
45%
to 6M2, 66% to AmGroupUn.1753
VTKFLTNIIVSTMKYRQENNIVRPDFVNMLIELKKHPDKLENIK(1)LTDTLLTAQAFVFFIAGFETSSSAISNALYELALNPEVQNKLRQEIKEYFNKHNELKYEYIKNMIYLDLVFRE(1)TLRKYPPGPLILRKSITNYTFNNTKVSIPEESFVWIPLYAIHHDPKIYPNPDAFIPERFNDDAIATRHPMHYLPFGDGPRNCIG(1)ARFAVYQSKIGLITILWNYKVEVCDKTMIPYEINPAAFLLTPKGGIYLKFTKIKNNEEILN*
>CYP6AS8 AmGroupUn.4533 (10741-13406) minus, 500 aa, 5
exons
53%
to AmGroupUn.2631 42% to 6P4 48% to 6N1 partial
MYISLEIFCGIVVALIALYYYLTVNNNFWKNRGIAGPEPVLGFGNMKKVLLGKESMSQFLTKIYHEYKNEPIIGIFTTRTPQLIIKDPDLIKTILIKDFSKIMNRGLLPMVS(0)
GEPISQHLFNIEAERWRPLRIHLTPVFTANKLRGMFSLILECSMHFVSYVDSLVKKGEPVNVREVAARFTTDVVGSCGFGVEMNSLSEKESEFRRVGKSVFATNYARIIKHRIREFMPRLYNYILYLWPTDEMAEKIIKLTRERLEYREKNNLFRPDFMNILLDLKKHPEKIGLD(1)VTNEFLAAQAFIFFVAGFETSSSTISNALYELALNPDVQDKLRKEIKEFAAKNDGEWRYETIKEMEYLGKVFQE(1)TLRKYPSLPFLTRELIEDYTFESNKVTIPKGLKIWIPTYAIHNDPDIYPDPDKFDPERFSDDKIKQRHPMHFLPFGHGPRNCIG(1)ARFAIYQTKIGLINILRNFKLDVCDKTLIPYKHHPRGLLLMPLTDLYLKITRLTN*
>AmGroupUn.4532
incomplete on both ends (1-757) minus, 131 aa, 2 exons
83%
to AmGroupUn.4533
LAAQVFIFFAAGFETSSTLISNALYELALNPNIQDKLRKEIKKFESQNDEEWKYETIRNMDYLEKVIQE(1)TLRKYPPVPFLNRELIDDYTFESNKVTIPKGLKIWIPTYAIHNDPDIYPDPDKFDPERFSED
>CYP6AS9 AmGroupUn.2792 incomplete on N-terminus,
pseudogene??? heme binding
site
probably not functional due to change in splice donor
(995-3268)
minus, 367 aa, 4 exons
65%
to AmGroupUn.4533
(1)IISSPVFTSGKLKGTFAQILNCSNDLVTHIDTLSKMEDSINMREVAAKFTTDVIGSCVFGIKMNSLSEIENEFRRIGRNIFATNFTNILKIRLLQSTPFLHSLLCKILPDEEMEMFMRITKETMEYREKNNLVRPDFMNILLELKKHPERIADIE(1)LTDDLLAAQAYIFFAAGFETSSTTISNALYELALNHDMQDKLREEIKEFEAKNDGEWRYETIKEMQYLGKIFQE(1)TLRKYPALSFLSRESIIDDYTFRDTKLTISKGTLVWIPVFPIHHDPNIYPEPEKFDPERFTEDKIKERNLMHYFPFGHGPRNC(1)ARFAIYQTKIGLIKILRNYKVDVCSKTLIPYKYDPFSFILVPLGGLYLKITRIQD*
>AmGroupUn.4458
incomplete on both ends (1-1218) plus, 203 aa, 2 exons
3aa
diffs to AmGroupUn.2792 61% to AmGroupUn.4533
VFTSGKLKGTFAQILNCSNDLVTHIDTLLKMEDSINMREVAAKFTTDVIGSCVFGIKMNSLSEIENEFRRIGRNIFATNFTNILKIRLLQSTPLLHSLLCKILPDEEMEMFMRITKETMEYREKNNLVRPDFMNILLELKKHPERIADIE(1)ITDDLLAAQAYIFFAAGFETSSTTISNALYELALNHDMQDKLREEIKEFEAKN
>CYP6AS10 AmGroupUn.2631 incomplete on C-terminus of
exon 1 (2686-5804)
minus, 495
aa, 5 exons
40%
to 6M2 new subfam in CYP6? 53% to AmGroupUn.4533
MAAFEILCGFIIFIFAFYYYLIKPQEYWKNRGVPGPKPIPIFGNFFRLTFARISIGDLMTKFYKEYKHEPVFGLYMRNVRVLAINNPDLIKTVLIKDFSKFAHRGLA???(0)TEPLSQHLFVLEPKRWRPLRTKLSPIFTSGKLKDMFSLIIECSNTLENYVEHLISKNDRVEVRDLAAKFTTDVIGSCGFGVDMNAMSDVQCKFRDIGREFFGPSFKQILKIRLRENLPRLYTFLGYILPRDETTTFFTNVVLDMIKYRKTNDIYRPDFINALINIQNHPEKLDIE(1)LTEPLLVAQAFLFFVAGFETSSLTIATALYELAQNQDIQDKLRDEITEHHKLNNGEWQYENIKNMPYLDAVFKE(1)TLRKYVPLTVLMRQSLEDYTFESINLTIPKDTRIFIPIYAIHRDPDIYPNPEVFDINRFSKEAEATRHPMHYLPFGDGPRNCIG(1)ARFAIFQTKIGLIKILRTYKVDVCNETQIPFINEPRTFTLAPKHDLTLKITKIEN*
>AmGroupUn.248a
incomplete on C-terminus (38475-40756) plus, 338 aa, 4 exons
49%
to 6Aa14, 73% to AmGroupUn.4533
MYIGLEILCGIVITLIAFYCYLTINNNYWKNRGIPGPKPVPGFGNMKNVIFGKESVSQFLTRMYNEYKDEPMIGVFSKRTPVLIVKDVDLIKTILIKEFPKFANRGLFPIFS(0)
110aa
gap here
ILSIRIQDMLPWLYNSFLYVLPRDEKTRIIMKLMTETMEYREENNVFRPDFINMLLNLKKHPEKIDIE(1)LTDDLLAAQIFIFFAAGFETSSSTISNALYELALNPDIQEKLRKEIKEFEARNNGEWRYEIMKEMEYLEKVFQE(1)TLRKYPSLPFLNRKLINDYTFESNNVTVSKDLKIWIPVYGIHHDPDIYPDPEKFDPERFSKEEIMKRHPMHFLPFGHGPRNCIG(1)
>AmGroupUn.248b
incomplete on C-terminus and N-terminus of exon 2
(43061-45682)
plus, 245 aa, 4 exons 78% to AmGroupUn.4533
MYINLEIFCAIVIAFIAFYYYLTINNNFWKHRGISGPKPVLGFGNMKKIILGEESMSQYLTKLYHKYKNESMIGIFRLRTPALIIKDPDLIKIVLIKDFSKFMNRGLLPIIS(0)
sequence
gap here 165aa
NIILELKKHPEKINID(1)ITNELLAAQIFIFFAAGFETSSTLISNALYELALNPNIQDKLREEIKKFESQNDEEWKYETIKKMDYLEKVIQE(1)TLRKYPPVPFLNRELIDDYTFESNKVTIPKGLKIWIPTYAIHN
>AmGroupUn.248c
incomplete on C-terminus (47350-47685) plus, 112 aa, 1 exon
94%
to AmGroupUn.248b, only 6aa diffs at N-term
MYIGFEIIYGIVIVFIAFYYYLTINNNFWKHRGISGPKPVLGFGNMKKIILGEESMSQYLTKLYHKYKNESMIGIFRLRTPALIIKDPDLIKIVLIKDFSKFMNRGLLPIIS(0)
>AmGroupUn.9652
incomplete on C-terminus (993-1236) plus, 81 aa, 1 exon
72%
to AmGroupUn.248a
MNISLEILCGIIVALIVFYYYLIINNNFWKNRKISGPKPVIGFGNMLSIILGKESTSQFLTRIYNEYKNEPMIGIFSKNNP
>AmGroupUn.8460
incomplete on both ends (123-685) plus, 139 aa, 2 exons
71%
to AmGroupUn.4533
(1)TLRKYPVVPFLNRELISDYTFENSKITIPKGLKIWIPVYGIHHDPDIYPNPEKFDPERFSEDKIKERHSMHYLPFGHGPRNCIG(1)SRFGTYQTKIGLVKIIRKYKVEICDKTLIPYKFNSFANFLMPSTGLYLMITDVEN*
>AmGroupUn.8178
incomplete on both ends (801-1322) minus, 174 aa, 1 exon
64%
to AmGroupUn.4533
(1)PLSQNLFGLEVERWRPLRIHFSPIFTTNKLKGLCSLILECSEQLEKYMDILIRKGEPLDIREIAARFTTDVIGSCAFGIEMNSLSENESEFRRLGKGVFNTTFRRIVKTRIRNLMPWLYNFFLRILPWDEITKKIVKLTTETIEYRNKNNIVRSDFINVLLNLKKHPEKIAEIG(1)
>AmGroupUn.8712
incomplete on N-terminus (13-1433) minus, 267 aa, 4 exons
60%
to AmGroupUn.42c
DLLGPLVPEREVTPFFIKVVTDAMKYKKESNVFRPDFIDTLMKLRDDPESLSDIE(1)LTDAFLTAQAYVFFAAGFETGASTISNTLYELAQNQGMQDRLREEIREHCDKYGGELMYENIKEMEYLDKVFKE(1)TLRKYPPGTLIPRRSVSEYTFKNTNVTIPKGTMIWIPAFPIHRDPNIYPNPDDFNPENFTEDAINNRHPMNYLAFSNGPRNCIG(1)ARFANYQVKIGLIMILRNYKVEVCEKTVIPYQFDPNLFLLGPKGGIYLRVTKVE*
>AmGroupUn.2707
incomplete on N-terminus and probably missing exon 3
(1-3597)
plus, 368 aa, 4 exons
59%
to AmGroupUn.4533
LTRVYNEFKNEALIGVFMKTYPALVVKDPDLIKDIMIKDFYKFPNRGFPKSDS(0)ADPLTQHLFLVEEEKWRPLRTQLSPVFSTGKLRGTFTQILDCSNHLVTYMDKLVEIGEPIDVREVTAKFTTDVIGSCVFGIKMNSLSGKESEFRRFGRQIFAMNFLKILRLRIKQFLPMLHYLLVRILPPDEETKIMLKLTRDTFKFREAHNIVRPDFMNILMELKKHPEKVPSLG(1)???
70
aa gap here
(1)TLRKYPVLPYLSRRSIEDYTFEGTKVSIPKNTLICIPVYPIHHDSSIYPNPEKFDPERFSEDEVKKRHSMHYFPFGHGPRNCIG(1)LRFAIYQSKIALIKILSNYKIEICDKTLIPYKYDPFSFISLPLTGIFLKITKLQN*
>AmGroupUn.2162
incomplete on both ends (1-852) plus, 223 aa 3 exons
39%
to 6g1 aa175-381 probable new subfamily in CYP6
NSMNKYLDDEFSHDTKTKTIMIKDVTLKYTTNVISSVAFGIQVNSFNPKTIQFYEEG(1)LKTTFSRSMQLFISFFFPKLSPYLNTRMLGSSTNFFRKVFWNSMDNREITKTKREDLIDSLIELKNSKQDKDFK(1)FEGDALLSQSAIFFIAGRETSISIICLTLYELAKHPEIQKRTREEINEKLKEHGMTYEGVQSMKYLHQVVSEILRIYPPTPIIDRVAVADYK(0)
>AmGroupUn.2637
incomplete on C-terminus (7-2478) plus, 330 aa, 4 exons
45%
to 6P4 new subfamily in CYP6? 46% to 6F1
MLYKPYLIVNDPNLIRDILTKEFTNFHDRGIFYNEEVDPLSGHLFQLPGKKWRNLRVKLTPTFTSGKIKQFFPILNEAGNILAKYLEEEARKGSTID(0)ICSRSIIFSRYSTDIIMSVAFGISCDSFKEPNNEFRYWGKKIFDPKPLWNALILFAPQILNFFSISYTEKSVTKFFTNMFKQTVKYRESNNIERKDFLNLLIQLMKNGYVDADDESLSNNVNAAK(1)NKLTMMEAAAQAYVFFLAGFETSSTTVTFCLYELAKNQDIQNKVREEIQTMIKKNGDLTYNALNDMNYLHKVISE(1)TLRKYPPVVILNRICTNDVKLSTTDFCIPKGTC???
>AmGroupUn.10510
incomplete on both ends (122-647) minus, 176 aa, 1 exon
54%
to AmGroupUn.42a, 39% to 6a16m all best hits = CYP6As from aa107-274
probably
a new CYP6 subfam or CYP6a
(0)VEPLSQHLFNLEPKRWRPLRSKLSPMFTSKKLKEMFGLILECGRHFEKYVDGLAARRQPVDFCEVAAKYTTDVIGSCAFGINMNAMSSEGSEFREAGRKIFEPTWNSIIRLKFKITMPTLYDLLGPLVPEREVTPFFIKVVTDAMKYRKERNVFRSDFIDTLMKLRDDPESLSDIG(1)
>AmGroupUn.6966
incomplete on C-terminus (878-1100) minus, 72 aa, 1 exon
40%
to 28A5 N-term 38% top 6a20 57% to AmGroupUn.42a
MAYVEILCVGIIVSMLAFYYYLTSAFNFWKIRGIPGPKPKFLFGNIRDIILSRISTPAFIKNVCDTYTNEPM(0)
>AmGroupUn.4500
incomplete on N-terminus (298-1025) minus, 84 aa, 2 exons
52%
to AmGroupUn.5496 53% to 6AM1
AIFNPERFTEENKRTRHPYAYLPFGEGPRNCIG(1)MRFALLQIKVGIISFLRNHRVETCQKTITPIKFSRRSLVTTSEKGFWLRIK*
>AmGroupUn.5730
incomplete on both ends (990-741) minus, 84 aa, 1 exon
54%
to 6d4m 60% to AmGroupUn.8460 60% to 6AM1
(1)TLRKYPPVVILNRICTNDVKLSTTDFCIPKGTCIAIPVFGLHRDSNIFPNPEKFDPERFSEENIKTRHPYVYLPFGEGPRICIG(1)
New families in the 3 clan
>CYP335A1 AmGroup14.7a (400127-401674) minus, 515 aa,
1 exon
40%
to 9f2m new subfamily in CYP9 63% to AmGroup14.7b 39% to 9D1, 38% to 9K1
50%
to 9J8 partial
MEVSHLTTFELLLLTLIFIILAKLVSILYTQFTYWKRNKVPYIRSSPLFGTAWRVFFRLVSFPNYCKYIYNYYPDARYVGVMDFATPTVIVRDPKLIKEIAVKNFDNFPDHRSFVTEEMDPVFGKNVFSLKGDRWREMRNTLSPSFTANKMRFMFDLVSKCSHDFVSCLHDRLESSSSEIEGKNLFTRYSNDVIATVAFGISVNSIEHPDNEFYRRGIDVSTFSGTFRFIKFMLFRLNPRLTRMAGFTFLSRATSKFFWRVISETVTARKRRGIVRPDMIHLLMQATDSKKKSIHQTMTIDDIVAQAFIFFLAGFDTTSTLMCYVVHELALHQDVQRRLREEVDRVLDDGTEISYEDTLGMEYLEMVISETLRMHPPTLLIDRQCAKEFHLPPAGPGYESVTIHPGENIWFPVLAIHRDPAHFPDPDKFDPERFNRENRNGIDPYTYIPFGVGPRKCIGNRFALMETKLLIIRLLRKFVIKPCERTMDPIVYKKGNFTLMPKDGFWVTFEKRNDH*
>CYP335A2 AmGroup14.7b (395285-396839) minus, 519 aa,
1 exon
39%
to 9f2m 63% to AmGroup14.7a 40% to 9K1 51% to 9E2 partial, 54% to 9J7 partial
MESPTLLFSFELLAIGLTAIVLAKFVSLLHHQYNYWRKRRVPHVGAVPVLGSSWRIFTRRMSLPNFCSLVYKHRPGSRYLGMMDCFTPVVVVRDPNLIKEIAVKNFDHFPDHHSFINEKIDPIFGKNVFSLKGDRWREMRNTLSPSFTASKMRFMFDLVSNCSEEFVRYLYDHPEFSSSIEAKDAFTRYTNDVIATVAFGISVNSMENRDNEFYTKGADATNFGGIFRLFKFMLFRVNPRLTRMAGLSFLSRGTATFFHRVVRETVRARDERRIVRPDMIHLLMQARDKEDRRPVATVDNRMTIDDITAQAFIFFLAGFDTSSTLMCYVAHELALNPPVQERLREEVDRFMDGGNGAITYEALLKMEYMDMVTSETLRKYPPIVFIDRLCVEKFELPPAEQGYDHLIVHPDNIVWFPVYGLHHDPKYFPEPEKFDPERFNDANKRNIVPYTYMPFGLGPRKCIGNRFALMETKILIAYMLRKFRIKRTEKTRASIEFSKTNFSLTPDHGFWIGLEKRDP*
>CYP335B1 AmGroup14.7c (305231-306757) minus, 510 aa,
1 exon
39%
to 9E2 35% to 9f2m new family 57% to AmGroup14.7d 42% to 14.7a
MDYLQLGLTLLAILVAVYYLSTRNHKLLKRHGIVHIPPTPLFGNLGPLVRRKCHMEDVIQRVYDLDPDARYVGMYEFTTPLIIIRDPELIKTIGVKEITNFTNHRPFVDVGVDPMLGEVLFAMQGDRWREHRTMLTTLFTSSKIKSMFVLMSDCAKRFADYLSKVEREIELKSVLTRYTNDVIARCVYGVSVDSVNEPENIFYRYGQVASQLSTFKQNLMIFVHRNSPRLARLFNLKILPVHIEKFFHRLVMDTIETRRREGVHGLDMLQQLMDMQSRRKESEEGKRGMTVTDIANHAFSFFFGSVDTMATQISLISHMLAVNPDVQQRLQEEIDEVLSASEDKQVGYDVIQEMKYLDAVMSEAMRYHPILLFVDRVCGETFELPPALPGARPFKLERGMNIWFPVKAIHHDPKYFENPDRFDPDRFLRDGKGIASSGAYMPFGMGPRKCIGSRFALTEMKILLFNILAKCSFKVGSKTMVPLKFKEGVFNPVAKNGFWLKIERRENSCC*
>CYP335B2 AmGroup14.7d (252241-253839) plus, 532 aa, 1
exon
57%
to AmGroup14.7c same subfamily
MEFLSLALVLAAISIIAYYYCFVRKNFNLFQEHGILHVPPSPLVGNFGPLIRGKENVHDTIQRIYNIHPDAKYVGIFEFLTPVIMIRDLDLIKSITMKNFDQFPDHRPMFCKSVDPMLGEMLFIMDGERWKEHRNMLSPTFTSSKIKTMFVHMSECAKRFAHHLSKLPEKDRETEMKALLTRYTNDVIAACIYGVNVDSIKEPRNVFYMYGRVGATLIGLKKNLKIMVHRNMPWLANLLRLNILERHIAKFFTDLVVETVEERERNGTTNSDLIQLMMDTRNKKESGKKNLTVQNMANHAFSFFFGGFDTVSSQTCVLLHMLVENPEVQQRLQQEIDETLESNNGQLSYDVIQEMRYLDAVINEILRLHPIAVFIDRMCVKSFELPPALPGDVPFTVKPGMNVWIPVKAIHHDPRYYDEPEKFKPERFLDNGKNIIGSGAYFPFGIGPRICIGNRFALIEMKVLVCHILAVCDIKAGARTGIPLEFEKGVFNATAKTGFWLKIEPRKYSYHSGQINGLVNNHVINGACKTGI*
>CYP335B3 AmGroup14.7e (266474-268027) plus, 518 aa, 1
exon
58%
to AmGroup14.7d same subfamily ESTs BI508364
BI516506 BI505081 BI505012
MDYLTISLSLITVFVAVYYLATRNNDFFKKHGIPHVPPVPFLGNMGSLVRQKSNLHDVIDRTYNLDPGAKYVGIYEFTTPIIILRDLDLIKTITMKYLDHFPDHRSFAYEGADPVFGSMLFAMKGERWKEHRNMLTPTLTSSKIKGMFKLMTECAVRFADFLSVLPENERETEMKALLSRYANDVIASCVYGVSVDSINDPKNIFYVYGRRGTNVVGLKKSMFVLIHRNMPWLAKLFGLRFLEKHVQKFFYDLVYETIESREKLGTNRSDVLQLLMDIRDKANSSGKMTTMTVENVAIHAFTFFFGGFDSITSVTTLLTQMLAEHPDVQARLQQEIDETLRSNDGVLTYDAVHGMKYMDAVINETMRFCPVLPFLDRMCVESFQLPAPVPGGQPFTLRPGMNVWIPLAAIGRDPEYFEDPDKFDPDRFLNPEAGIKNSGAHFPFGLGQRKCIGERFAMMEMKVLLCYVLAACNVRIGSKTTVPMKLEKGLINANVKGGFWLKIEPRKVTYYNSSRSN*
>CYP335C1 AmGroup14.7f (258230-259825) minus, 531 aa,
1 exon
44%
to AmGroup14.7d 49% to AmGroup14.7g
MLDSWSITAAIVAVLAIAYYQLIWKYKHFERIGIPCYHSIPLLGSFWEAVIQRNNFAEISRKIYNSYPDTKYMGMYDTTTPVLLIRDTELIKAISVKHFEQFPDHRSFQNEATDPLFAKNLFALRGDRWREIRNLLSPAFTSSKMKSMFILMRDCAKEYGDYFASLTGDESTIELKDAFTRYTNDVIATCAFGVEVNSMKDRKNKFYVYGREGTTFGSWASIKFFVTRVLPVSVCTLLRIRLIRKEISDFFIDLVSTTIKTREEKGIVRPDMIQLMMESKGKLGAGKEMSMIDICAQAFVFFFGGFESTSTLMCFAAYEIAVNEDIQRRLQNEIDQVLEERDGEVTYAAVNEMKFLDAIIYEALRMYPVVVATDRVCMKPFELPPNRPGEKPYLLKEGDNVWFPIYAIQRDPQYYPEPDKFDPDRFLNDTKQMINSGLFLTFGIGPRMCIGNRFAMLETKVLLFHLFARCNLVPCSKTTIPMKLNRKGFSMTAENGFWFKIEPRSAKKEEKIAVPGTTMLIDKIPDRYPDN*
>CYP335D1 AmGroup14.7g (261868-263392) minus, 508 aa,
1 exon
49%
to AmGroup14.7f
MAIFALLLIVLGILGSYHLLKSQNPFKEHGLPYKSYLPILGSTWESILRRKSFAVVIQEIYNLAPSARYVGFYNRTTPIVMIRDPELIKTIAVKNFDAFRNHRTVNDTQTDDVLLSGNLLLLRDNRWREVRSLNTPAFSTSKIRSMYRSMSEIAINVARYLSTLAPGQNIVEMKDIFTRYANDVFATCAFGISVDSLSDRENKFYELGREALDIHSTPILKLILIFAFPKLARRLGVSLVSKEATNFFTRVVSENIKMREEKGITRPDFIQAMIDKRNGRGRDDELTVEDITAQAFVFFFGGFETTSGLLSFAVHELAANPEIQGKVHAEIDRVLVSNNEITFERVNGLVYLDAVINETLRMYPIIPITDRECSKRFELPPVLPDAKPYVLKEGSHVWFPIYAIQRDPRYFEKPDCFDPDRFLDDNKKRSDAFNGDAYMPFGAGPRNCIGNRFSMVETKVALFHILAKCRLDVCPKTTIPMELRKRGVFLTAKNGFWLRIVPRHPVT*
>CYP336A1 AmGroup2.15 (200938-202428) plus, 496 aa, 1
exon
34%
to 6AH1 Anopheles new family? EST BI946448
from brain
MASAFLTLVTGALLLLCFYLYLKYTYWKRNGIP
YSKGYYPIIGHFLPLIMKKQSYSEIIEEIYRDSNHSMVGMYKGMKPVLILRDINLIKTVLQSNFSKFHENAVKIDPKLDPLLAKNPFFCYGELWQTGRKRLTYAFSNARLKILFAAVYEVCTKFRNFLDRRLESSKKYEVELKSLFLKFTSEVVANAGLGIEGFCFEDGKVQSIFTNLDNNDFLDTFLVGIIMHFPFLTKLLRIKFLPTKHDKFF
RTVVKKNLELRKSDPIPRNDFIQLMIEMEQTGEKIDEEIVAAHAVSFYLDGVETSSVTLNFIGCQLAIHQDVQEKLRKEVRSTLEKHGGVLTFEALKDMTYMNQVISESQRYFSALGFLGKICTDEFELQGSDGLNYRAKPGTELLIPICGLHKDPKYWDNPEIFDPERFSDENKQRIEKMAFIPFGEGPRICVGMRMAMLQMKSCLATLMKDYKLEVSPKMQLPLKLSPTYFLSAPLGGGWVLISKA*
CYP4 clan (5 seqs)
>AmGroupUn.2145
(673-4743) minus, 545 aa, 7 exons
CYP4G11 63% to 4g15m, 100% to AF207948 partial
seq
MAAASATGFSASSVFLSLLIPALILYFIYFRISRRHLLELAEKIPGPPALPLIGNALDLFGT(1)
MFSQVLKKAENFKDVVKIWVGPKLVICLIDPRDVEIILSSNVYIDKSTEYRFFKPWLGDGLLISTG(1)
QKWRNHRKLIAPTFHLNVLKSFIDLFNANARSVVEKMRKENG
KEFDCHNYMSELTVDILLETAMGVSKPTRDHNAFEYAMAVMK(2)
MCDILHLRHTKIWLRPDWLFNLTKYGKNQIKLLEIIHGLTKKVIQLKKEEYKSGKRNIIDNSAQKTESK(0)
XXXXXXXXXXXXXXXXXXXXXXXXXXXX
TNNIVVEGVSFGQSVGLKDDLDIDDDVGEKKRQAFLDLLIEAGQNGVLLTDKEVKEQVDTIMFE(0)
GHDTTASGSSFFLAVMGCHPDIQEKVIQELDEIFGDSDRPATFQDTLEMKYLERCLLETLRMYPPVPLIAREIKTDLKLA(1)
SGDYTIPAGCTVVIGTFKLHRQPHIYPNPDVFDPDNFLPEKTANRHYYAFV
PFSAGPRSCVGRKYAMLKLKIVLSTILRNFRVRSDVKESEFRLQADIILKRADGFKIRLEPRKQVASTA*
>CYP4AV1 AmGroupUn.2000 incomplete on N-terminus
(1-4856) plus, 496 aa 7 exons
41%
to 4c3 probable new subfamily in CYP4
AADG02019009.1
MIKWGKELGDMYLIWVGMRPFIFLYKAEAIQPLLSSSVHIDKSLEYQYLQPWLGSGLVTSTG(1)
EKWHFHRKLLTPTFHSGLLELYLKTTIREAQILISCLRKEIGKPEFDIVPYAKRAALDIICD(1)
SSMGCNINAQKNFENEYVQAVNT(2)
LASISQRRFLNVWMSFDPIFKLTSWGKRHDHALSVTHGFVNK(0)
XXXXXXXXXXXXXXXXXXXXX
WKDRKDTNFNEKSHKRQALLDLLLELSKDGKVLTDDDIRDEVNTFMFAGHDTTATSVSWILYALGRHPQYQ(0)
ELIIEEYDETVGTKELTLDILSKLTWLEACIKESWRLYPVTPLIARQIYHPITIL(1)
GHEIPIGSTVLVNSFLLHRDSRYFPEPDIYRPERFLPDGPKYPSYAFVPFSAGSRNCIGWKYGTMIVKVLILYILKNFHVESLDTEDQLRFISELVLHNADGLRLKITPRK*
>AmGroupUn.8281
incomplete on both ends (644-1499) plus, 132 aa, 3 exons
44%
to 4c3 aa97-209 45% to 4M5
(1)ELWKFLTQLSKQYYPIYRMWTFLEAYVHICHPDDIE(0)
TILGNIKFTKKGFGYKYLKPWFNTGLLTSSG(1)
HKWHVRRKILTSAFHFNVLRQFVDIFIEDAERLIKTLESEEGIFVENLLQLTSEHTLNVICG(1)
C-helix
>AmGroupUn.2540
incomplete on N-terminus (5369-5938) minus, 71 aa, 2 exons
52%
to 4ac1m. 53% to 4J9
PYAYVPFSAGPRNCIG(1)
QRFAMLELKTYLGLLLYNYYFEPIDYLKDVTFVSGIVLRLENPVRMKFIPVKKIC*
>AmGroupUn.3324
incomplete on N-terminus (901-1990) plus, 285 aa, 3 exons
50%
to 4aa1 55% to 4AN3 partial
(2)GQIMLLYRMIRPWLLIEWIYRLTKYGREEEKQRKNLFDTCFKMVKEKRDLLQSKDRISNNDIKKNKN
ISLLEYMVEINEKNPCFSDEDIVEECCTFMLAGQDSVGTATAMTIFLLANHPEWQNKCIEEIDEIFNGDT
RFPTISDLKEMKCLEMCIKESLRLYPSVPIIGRTLGEDIKIG(1)
XXXXXXXXXXXXXXX
NTHHLPHHFPDPDTFKPERFNSENSEKRHPYAYIPFSAGPRNCIG(1)
YKFAMLEMKSIISAILRKCRLQSIPGKKEIRPKFRMTIRAQGGLWVKIIERDQILKSIAA*
mito clan (8 seqs)
>CYP314A1 AmGroup5.25 AADG02002903.1 incomplete on
N-terminus (17670-19852) minus, 471 aa, 9
exons
(19852-19708, 19574-19378, 19298-19111, 19031-18829,
18707-18579,
18471-18305, 18238-18160, 18072-17875, 17777-17670)
43%
to 314A1, 55% to pea aphid (Acyrthosiphon pisum) P450 below
cyan
is extra seq added based on CF588143
note
This seq is less than 55% identical to drosophila and anopheles 314A1s,
but
there appears to be only one orthologous seq in each species and it does not
make sense to give the ortholog a different subfamily.
MLLSSAWFEVIAAVLLTILIFVTSHRPAWWFWTATSHEASGKSVKILLPRFT (1) possible N-term exon
LPGPFSLPIFGTRWIFSCIGYYKLNKIHDAYKD(1)
LNQRYGALCKEEALWNFPMISVFSRQDIETIIRRNSRYPLRPPQEVISHYRRTRRDRYTNLGLVNE(2)
QGQTWHDLRVALTSELTAASTVLGFFPALNIVADSFIELIRRQRVGYKVTGFEELAYKMGLES(1)
TCTLILGRHLGFLKPDSSSELATRLAEAVRIHFTASRDAFYGLPLWKLLPTCAYKQLIESEDAIYN
(2) revised
IISEIIETTIQEKRDDAKDESVEAIFQSILRQKNLDIRDKKAAIVDFIAAGIHT(0)
LGNTLVFLFDLIGRNPTVQNKLYEETYALAPAGCDLTIDNLRKAKYLRACITESLR(2)
LIPTTTCIARILDEPIELSGYRLTAG(0)
TVVLLHTWIAGLNEENFKDAKKYLPERWTTPTTPHSPLLVAPFGAGRRICPGKRFVDLALQLILAK(0)
IIREFEIIVEEELDLQFEFILAPKGPVSLGFRDRS*
>CB336480
Tribolium castaneum embryonic cDNA
MFEKIFQSLDVTSLLIIAIFFLFLEYRPPWWYRNNDCKKGVKLIPGPLAL
PGLGTTWIFFFGGFSFNRLHLYYENMYKR
YGPVMKEEYWCNIPVINLFEKREIVKVLKAGGKYPLRPPVEAVAHYRRSRLIDTLALG
>CF588143
pea aphid (Acyrthosiphon pisum) 48% to 314A1ps
2
RVLRQSGKYPIRPPNEVTANYRKSRPDRYTNTGLVNEQGEVWAMLRNKLTPELTSPRTIR 181
182
RFLPEVNQLADDFNNLISLARDGNNVVRGFEGYCNRMGLES 304
305
TCTLILGRRIGFLDGEVSETATRLADSVTSQFRASQEAFYGLPLWKLIPTKAYKDFVAS 481
482
EDALYDIVSEFVESALIDEQQSFTDVRSVFVSILQASELDNRDKKAAIIDYIAAGIKT 655
LGNTLVFILYLV 691
>CYP334A1 AmGroup9.6 alternate splicing possible?
(1158493-1161102) minus,
514 aa, 9
exons (1158603-1158493, 1158868-1158668, 1159034-1158959,
1159285-1159117,
1159526-1159346, 1159843-1159593, 1160068-1159927,
1160289-1160153,
1161102-1160830)
34%
to 12A5m a new family in the mito clan
MTESQTASVDESLRTDTIPLLDHTATTEVSPTTFEVSTMKVDTQIFDKA
PLPFDEIPGPAILKIWEKYWKYVPLLGTQLLSSLLINRFTQG(0)
VPCCRPEHIAEVFKQEGDTPVRSGIDILQHYRLNYRKYRLAGPFSM(2)
QGTEWLEIRDKVEDTFNQISSTFFTKIDTCCNELITRICKIRNRQNE(0)
VPVSFYEDLIRWAMECFCDLTFNKRLGFLEPIGYNSSSEASKLINALTTAHKY
MSRCETGFQVWRFFLTPFARKLFEACDVLDK(2)
VIGKYVRQAQCKLRIRKSHSEESSMTERSPVLEKLLLNEGIHPDDICTMLMDMIILGIQA(0)
TVNSEAFLLYHLAKNPRTQRKVYDEIISVLSNDNSSFTEKSLKNMPYLKACIQETLR(2)
LHPAIPYITRLLPKTISLHGYTIPKG(0)
TFVIMANQITSQREENFEDPFKFWPERWLSNSSKEDVHFSYLPFGHGIRSCLGKNMAEAKMMLLTAK(0)
LVRQFRIEYDYADIKSRFMMVNVPNKPLRFRFVNRN*
>CYP315A1 AmGroupUn.1189 (10257-12510) minus, 535 aa,
6 exons
38%
to 315a1m, probable mito clan, maybe 315Bx
note
This seq is less than 55% identical to drosophila and anopheles 315A1s,
but
there appears to be only one orthologous seq in each species and it does not make
sense to give the only ortholog a different subfamily.
MNLAQNILKSGKSVSLSSNVIALKYNVPGCGYAGASQTSRIDDLSDISKSTDGGNRSKIEITEKLRDRNYGTVAVATSESILQEMPEPRGIPVFGTLFSFILSGGPKKQHEYVDKRHKELGPVYKERIGPTTAVFVNSIHEFRKIFRLEGSTPKHFLPEAWTLYNEIRKCRRGLLFM(2)NGEEWVYFRKILNKVMLLPDPTNLMIAPCQEVAIELKRKWQKQIKTNNIISNLQVQLYQWSIEA(1)MMATLMGSYWYSYKHQLSRDFEILAETLHEIFEYSAKLSIIPVKLAMNLRLPVWKKFVASADTAFEIVRMLVPEMAKLGGNGLLKKMMDEGIRAEDAICIVTDFILAAGDT(0)TATTLQWILLLLCNHPEKQEELFKHLKDLSQEDILRLPLLKGIIKESLRLYPIAPFISRYLPEDSVIGNYFVPKG(0)ELLVLSLYSSGRDAANFPQPNEFRPERWIRTQKGIYQGVVHPHASLPFALGARSCIGRKLAEIQISFALAE(0)LIKSFKIECINKNQVKLILHLISVPSQSIKLKLMERN*
>CYP302A1 N-term AmGroupUn.2216 incomplete on
C-terminus (289-2832) minus, 202 aa, 3 exons
45%
to CYP302A1 aa19-183 in CYP302 family, mito clan (disembodied)
MCTLLKKCNQSIRKKLFIKFYSNEFTKSKIKINHSQPKAFYDIPGPKSLPIIGTLYKYLPFIG(1)
EYSFTNLYESGKKKLKCFGPLVREEIIPNVNVIWIYRPEDIAEIFKAESGLHPERRSHLALLKYRKDRPNIYNTGGLLPT(2)
NGSEWWRLRKEFQKVSSKPQDVINYLKETDCVIQEFVELCNNEKFADFLPLLSRLFLEC(1)
>CYP302A1 C-term AmGroupUn.3145 incomplete on
N-terminus (122-2230) plus, 250 aa, 4 exons
51%
to 302A1 (disembodied)
note
these two pieces are probably from a single gene
(2)IALELVSRKKNNMKIRYNKSFLDAYLENPVLDIKDIVGMACDMLLAGIDT(0)
TSYSTAYILYHLAKNQNIQEKLRIEATQLLKNHNEPISINILRNASYTKAVIKESLRLNPISIGIGRILQTDVVLSGYRVPKG(0)
SVVVTQNQIICRLPEYFEEPNLFIPERWLREYSENNNKINYKKTVHPYVLLPFGHGPRSCIARRFAEQNMQILLLR(0)
ICRRLKISWHGDDLGMISLLINKPNALLKFNFHDILNNNSV*
>CYP49A1 ortholog
AmGroupUn.423a incomplete on N-terminus and C-terminus of exon 6
(214-3539)
plus, 312 aa, 7 exons
AADG02012456
Amel_1.1_Contig12456, AADG02012457 Amel_1.1_Contig12457
45%
to 49A1, mito clan
(2)IKKIRNKNDEVPDDFLNEIHKWSLES(1)
IARVALDVRLGCLDDDANIETQQLIDAVTTFFKNVGILELKIPFWKLFNTPTWLKYVNALDTILS(2)
ITSRYTTVALSRTKEAEKSDKEPSLLERVLALENDTKLATILSLDLFLVGIDT(0)
TSSTVASTLYQLALHPDEQDRAYNEVCNILPSKDMQLDGKHLDKLKYLKACIKETLR(2)
MYPVVIGNGRCMTKDTIIKGYRVPKG(0)
VQVVFQHYVISNLDKYFPHSDKFLPERWLQSDGVRHSFASLPFGYGRRMCXXXXXXXXXXXXXXXX (0)
ILQRYKIEYHHEKLEYYINPMYTPKGSLNLKFIDR*
>CYP301A1 ortholog
AmGroupUn.423b incomplete on N-terminus and missing exon
(3686-6941)
minus, 532 aa, 8 exons
CYP301Ax
66% to 301a1m revised intron 1 boundary
MFCNETIRLKIEEVETDVKQVENGEAFCPTLSDREQASLP
cgt RI AADG02008009 possible N-term??
INFEVLSLYLHIFISNSTRAHFPLYCRVRTGAITSTKSCMVD
(not part of protein)
HDTTTIQQGKPYKDIPGPRPIPILGNTWRLFPMIGQYEISDIAKLSQIFYDEYGKIVRLTGLIGRPDLLFVYDVDEIEKIYRQEGPTPFRPSMPCLVHYKSVVRKDFFGSLPGVVGV
(2) HGEPWREFRTRVQKPILQPQTVRKYTVRKYITPIEMVTSDFIQR(2)
IQEIKGEDGEVPGDFDNEIHKWALEC(1)
IGRVALDVRLGCLSSNLTSDSEPQKIIDAAKFALRNVAILELKAPYWRYVPTLLWSRYVRNMNYFIE(2)VCMKYIDATMERLKTKKAVDEYDLSLMERILAKETDPKIAYILALDLILVGIDT(0)
ISMAVCSILYQLATRPEEQEKIYQELVEILPDPSVPLNMSHLDKAIYMKAFIREVFRYY*???(0)
VQVVFPTVVTGNMEKYVTDAKIFKPMRWLKESTKTLHPFASLPYGHGARMCLGRRFADLEIQVLLAK(0)
LIRSYKLEYHHKPLKYKITFMYAPDGELKFKVLPR*
>AmGroupUn.10899
incomplete on both ends (1221-1394) minus, 58 aa, 1 exon
probable
CYP301A seq. 61% to 301A1ps aa126-182 57% to 301A1 honeybee aa 115-171
EGLLGRPDMVFIYDANEIERIFRQEERMPYRPSMPSLNYYKHVLRKEFFKENAGVIAV(2)