Honeybee Cytochrome P450s from version 1 of the Apis mellifera genome

 

These sequences were submitted Feb. 11, 2004 by May Berenbaum in collaboration with

Gene Robinson (genome sequencing), Hugh Roberstson (genome annotation) and Reed

Johnson (P450 annotation).

 

There are 4 CYP clans in insects.  CYP2, CYP3, CYP4 and mito

CYP2 is the clan with CYP18 in it.  Sometimes it is called the CYP18 clan.

CYP2 also has CYP303, 304, 305, 306, 307

CYP3 has CYP6 and 9 in it and CYP28, 308, 309, 310, 317(CYP6 subfam)

CYP4 has CYP4, 311, 312, 313, 316, 318

mito clan has CYP12, 49, 301, 302, 314, 315

 

The honeybee sequences have been sorted in to these main CYP clan bins.

32 genes have been named.

Feb 20 version 1PM, D. Nelson

 

CYP2 clan (11 seqs)

 

>AmGroup13.1a (71904-74217) plus, 524 aa, 5 exons (71904-72236,

72653-72774, 72894-74323, 73404-73668, 73794-74217)

60% to CYP18A1m = CYP18A1 honeybee

MGGTRIEVLCTFLVFLGVLLVARCLQWLRYVRSLPPGPWGVPVFGYLPFLKGDVHLRYG

ELAKKYGPMFSARLGTQLVVVLSDHRTIRDTFRREEFTGRPHTEFINILGGYG(1)

IINTEGAMWKDQRKFLHDKLRGFGMTYMGGGKKIMESRIM(0)

REVKTFLRGLASKRGTPTDVSASLGMSISNVICSIIMGVRFQHGDARFKRFMDLIEEGFKLFGSMAAVNFIPVMR

YLPCLQKVRNKLAENRAEMAGFFQETVDQHRATFDEGTMRDLVDAYLLEIEKAKGEGRATTLFQGKNHD(1)

RQMQQILGDLFSAGMETVKTTLEWAIILMLHHPDAAIAVQEELDQVVGKSRMP

VLEDLPFLPITEATILEVLRRSSVVPLGTTHATTR(2)

DVTLHGYTIPAGSQVVPLLHAVHMDPELWEKPEEFRPSRFLSAEGKVQKPEYFMPFGVGRRMCLG

DVLARMELFLFFSSLMHTFELRSPQGSSLPSLRGNAGVTVTPDPFDVCLLPRNLDLIEDNDMISTGAILRNIGSH*

 

>CYP306A1 AmGroup13.1b (76876-78884) minus, 499aa, 7 exons (78884-78557,

78493-78345, 78209-78007, 77901-77595, 77538-77383, 77298-77123,

77057-76876)

AADG02005913.1

46% to 306A1 = CYP306A1 cyan region does not match.  CYP306s are longer in this

region check for frameshift.  look for YEPECILEH.  in 18clan/2clan

CYP306B1 may be the ortholog of CYP306A1 in diptera (flies and mosquitos)

MILDHYIAIFVLPFLLLLYVVRKNRKARRLPPGPWQLPLLGYLPWIDAEKPHETLTR

LSRVYGPVCGFRMGSVYTVLLSDPQLIRQSFAKDSITNRAPLYLTHGIMKGYG(1)

IICAEGEQWKDQRKFISNCLRNFGMVKHEGAKRDKMEERISDAVNECVS(0)

VLRDRGANGPIDPLDTLHHCLGNLVNSIVFGKTYEEEDRIWKWLRHLQEEGVKQIGVAGPLNFLPFLR(2)

FLPQYGRVIRSIVDGKDKTHEIYRQILD  EHRARVDSGNGCKIDSFLAAFDEQMRKK

DGAESGYFTEP  QLYHLLADLFGAGTDTTLTTLRWFLLFMAAHPMEQ(0)

EKIQSEMDLCLREGEQPTLNDRIVMPRLEAAIAEVQRIRSVTPLGIPHGTSE(0)

DVEIGGYDIPCGAMIVPMQWAIHTDPAYWRDPLEFRPDRFLSEDGTFFKPESFLPFQNG(1)

KRVCVGEELARMILFLFAGRILRAFSVRVPAGEIADLEGECGITLVPKPHRLAFVGRDR*

 

>AmGroup 14.9 (18918-22680) minus, 508 aa, 3 exons (22680-22278,

20154-19363, 19246-18921)

CYP307B1  55% to 307B1 Anopheles, probably the ortholog of 307B1

MIPLTATTCFLIAITFLALALILLDHLRSKKTTKSVVPGDDDQHALPEPPGPKPWPILGSLHILGRYD

VPYKAFADLVRDFDCQVIKLRMGSVPCVVVNGLENIKEVLTVKGHHFDSRPNFARYHLLFGGNKENS(1)

LAFCNWSDVQKARREMLRAHTFPRAFSTRFNELNGIIGDEMEFMVNHLDSLSGTSVHAKPLILHCCANIFI

TYLCSKNFHLEHDGFRNMVENFDKVFFEVNQGYAADFLPFLMPLHHRNMARMAHWSHEIRRFVIKNIIADR

VNSWNDVVPEKDYVDCLINHVKSGTEPQMSWNTALFVMEDIIGGHTAIGNLLVKVLGFLATRPEIQRLAQD

EIDALGLAGNFVGLENRRSLPYVEAIILETIRIIASPIVPHVANQDSSIAG(1)

FRIKKDTFIFLNNYDLNMSTDLWTSPEEFMPDRFVQNGRLLKPEHFLPFGGGRRSCMGYKLVQYVSFAILA

SILKNFTITPVQKEDYTIPIGNLALPEMTYKFRFERR*

 

>CYP303A1 AmGroupUn.2253 incomplete on N-terminus (55-1889) plus, 382 aa, 8 exons

43% to 303A1m, 44% to 303A1 anoph. CYP303A1 in 2 clan/18 clan

may be the ortholog of 303As

(1)LLLVDGNLWNEQRRFVLKHLRDFGFGRQS(1)

LYMNANEYTGNNVTQSQLGTIISMHNIFGITVLNSLWKMLAGKRL(2)

YNIDDKELIYFQRILSITLNEIDMLGAPFSHFPLLRFIAPEISGYKSFVKIHEELWKFFK(0)

DEVNNHKNTFNSDSPGNLIDIYLTILNSENYGKTFSD(1)

VSEPQLVAICVDLFMAGSETTSKVLGFCFLYLVLFPHVQKKAHEEIDRVIGRNKLPTAEDKAK(2)

MTYMNAIVLESLRMFAGRSLNLPHRVQRDTKISDYKIPK(0)

NTIIITNFNGILMDESWGDPENFRPERFIDGSGNIVTPSRFLPFSAG(1)

KHRCMGENLAKTNIFIIATTLLQAFTFSEIPGEKPTIEHFIDGTTISPKPYRVNVSLRI*

 

>AmGroupUn.6110 incomplete on both ends (1-622) minus, 126 aa, 2 exons

39% to 303A1

LDEQLMMILIDLFLAGFTTTSTTLDFLFLIVTLFPDVQRKVQKEIDSVIPYDRLPNMEDKAK(2)LPYVEAVISETYRLWPVFPIIGPRRVLCDTNIDKYVIPKDTTILFNTYSINKDPTLYPDPDKFM

 

>AmGroupUn.7901 incomplete on C-terminus (1-459) minus, 126 aa, 2 exons

40% to 15B1

MWFVILCFVIVLIKILFDYSRPINFPPG(1)

PRGLPFIGNILDIIRLINETKYYSDTWCRLAEKYGSVVGLRLGLDQPLIIVSGKSAVTEMLNRSEFDGRPSGFLYKYRCGGMQQGILFTDTDVWHSQR

 

>AmGroupUn.10970 incomplete on N-terminus (112-677) plus, 121 aa, 2 exons

probable 18clan/2clan 41% to 15B1 40% to 304B1, 41% to 305A2

(0)DTTVLLDFHSAHNDPAYWDHPEEFRPQRFLDANGRFCQNNANIPFGLA(1)

IPFFLGKRRCPGEMLARTSLFLYFAYVIHYFDIEISPEHGKPDLNGHDGFTISPKSYYLKITARSDVTNCSTI*

 

>AmGroupUn.8493 incomplete on both ends (509-1028) minus, 141 aa, 2 exons

39% to 304A1

(1)LPRLPIIGSYWHLLWHDYEYPYNGIIHYVNKLQSKIVTCYFG

SHKTIIANDYKSIKEVLTKQEFNGRPINVDIVLQRAFGKSLG(1)

IFFTEGTLWHEQRRFALRHMRDFGFGRRHEIFETNVMEEIAILVDMLKEGPINDEEK(0)

 

>AmGroupUn.897 incomplete, missing exon 5 (51606-53773) plus, 357 aa, 5 exons

37% to 305a1m 35% to 303A1m

MLYVVISLLLALYCIFCIYDCVKPHNFPPG(1)

PKWLPLIGCFLTFRRLKLKHKYTYVAFQELSKTYGPILGLKL

GSQKLVVISTHDLVKKVLLQDEFNGRPDGFFFRVRAFGKRKG(1)

ILFTEGSMWSQCRRFTMRHLRSFGLGQSTMEKYLTVEAENLVNYL

RRVSTKGPVPMHTAFDIAVLNSLWCMFAGHRFDYENEKLAEILEIVHDSFR(2)

LMDTMGGIISQMPFLRFIIPELSGYNNLMEILRKLWNFLDEEINNHEKHLSGNQPQDLIEAFLLEISSRNGVQNDSIFDS(1)???

 

(1)KRRCPGEMLARTSLFLYFAYVIHYFDIEISPEHGKPDLNGHDGFTISPKSYYLKITARSDVTNCSTI*

 

>AmGroupUn.7452 incomplete on both ends (700-954) minus, 85 aa, 1 exon

no good match 38% to 305A2

(1)PFSWPFIGNQILLKRLSRKFGGQHKAFMELSKRYNSDIITVNISY

EKIIVVSGSKFCDMILQNEEFQGRPWNEFIKVRNMGKKQG(1)

 

>AmGroupUn.960 incomplete on both ends (8311-11562) minus, 147 aa, 2 exons

42% to 305a3

(1)NQLLYIIKDLFSAGVDTTNSTIGFIIAFLVVHQDVQSKVYDEISRVIDKDIYPSLSDKDR(2)LPYLKAVIAEVSRLANIGPTSIPHRAVKDSTFLGFEIKKNYTLLANFKSIHMDKEHWGDPEIFRPERFINEKGDFINDSWLMPFGLG(1)

 

>BI513047 EST 50% to 305A1ps C-term

2 FFFLQFGKGKRRCPGDILAKATIFILFVGIMQKYTLLPVPGKGPHSIKINSGITLTPQPYNVLVEKR* 205

 

 

CYP3 clan (36 seqs)

 

>CYP6AQ1 AmGroup12.14 (315628-318450) plus, 500 aa, 5 exons (315628-316108,

316238-316604, 316677-316861, 316989-317277, 318272-318450)

45% to 6K1, 42% to 6g2m 43% to 6G1ps new subfamily in CYP6

cyan = missing seq. from EST BE844578

yellow = EST BE844462, underlined seq = EST BE844394, green = EST BE844353

magenta = EST BE844352, gray = EST BE844331 all ESTs from antennae

 

MNLLTPYWSLDILIVSSSLMIAVYLYASWKLKYWSRRGIMQITPSPLFGNFKKCILFQKSVSEIIRELYGQNEGLPFMGFY

IFYKPFFLVRDIELVKHILVKDFNTFANKHTSADSKNDRIGYSNLFIIKNPAWKYLRGKLTSVFTSGKLKKMFDLMLIIG(1)

KNLEKHLELLNLDG

NGKEVELKDLCANFTTDLIGTTAFGVNLNSLKDPNSDFRENGRLVFDYNLKRAFEFFSIFFFPNLS

KYVSIKFFGKATDYFRNSFWSVINQRIESNVKRNDLIDCLIELREKHKNDESFEGFR(1)

FDGDDLVSQAAIFFTGGFETSSTTISFTLYELALNKDIQKTVRTEIHEALAQTDGKITYDM(0)

ITNLPYLDMVVSETLRKYPPLGFLDRVALHDYKIPNSDVTIDKDTPVIIPMIAFHYD

PKYFPNPEKYDPLRFSEEVKKTRPSYVYMPFGEGPHICIG(1)

MRLGLLQSKLGIIEILKDYEVSPCEKTKIPMVLDPKGLTTTALGGLYLNIRKITIAAG*

 

>CYP6AR1 AmGroupUn.19 (44801-47550) plus, 502 aa, 5 exons

50% to AmGroupUn.5496, 47% to AmGroupUn.792b, 38% to 6a13ps all best hits to 6as

probable new subfamily in CYP6

MSWLMIETVGLIATVFFLLYYYSMSKLDYWRKRGVKGPKPLPFLGNFKDVLLAKESTMDCFERAYKEFKDEPMVGMYGSHEPLLILRDLDLIKDVLIKDFNKFAQRTQGAIRE(0)VEPLSEQLFRLDAERWRPLRLKLSSFFSSGKLKEMFHLFVECSDNFEKYLEKMVEKGGLVECRDAAAKFSTDVIGACAFSIHTNALTDENSQFRKMGKQALATNLQQFLNDRLREYPFLFKIFGRFFVDHEVTNFFANSIKDAMDYRIQNNVHLRDVIDILADIRENPTKCGLKE(1)ADNLFLTSQAVLFFLAGFENASLTISNALYELAWKPEIQEKARAEIVNVLQKYDGKITYDGLEEMKYLEACIFE(1)TLRMYPVLQWLSREAMETYTFTGTKVTIPKGQQVFLPIYAIQRDPDIYPNPDNFDPERFTDDKIKTRHSMTHLPFGDGPRHCSG(1)IRLAKKQLKVGLVTVLSKFKVEVCEKTRKIYQKDKKPLFLLQPVDGIHLKISKVSV*

 

>CYP6AS1 AmGroupUn.5496 (264-2651) plus, 498 aa, 5 exons

44% to 6A14, 64% to AmGroupUn.792b

MDYFQILCAISIVILTIYYYYSSKYTFWKKRGISGPKPIIFFGNFVDSIIQKRSTSEAVKKWYDDYKHESVFGIFGGTTPLLVINDLDMIKDVLIRDFSLFVDRGFHIFPK(0)IEPLSEHLFLLEAERWRPMRMKLSPIFTSGKLKEM FFLIMESAGNLEKYLDEVIKKDEMVECRELAAKFMTDVIGSCAFGINTNSLLE

EDSEFRRMGKKISTPNLKVMLGNICKEFFPPLYEIVGSIF

TLKDVNEFFINLVSDTMKYRKDNNIIRSDFINMLMQLKEHPEKMENIE(1)

LTNTLLTAQAVVFFVAGFETSSSTMAFSLYELAQNQEIQDKLREEIRNMHEKNKGVLTYTDVKEMKYLDKVFKE(1)TLRKYPILPMLFRQAMENYTFKDTKITIPKGMKLWVPVHGIHHDPNIYPEPEVFDPERFEDDAFASRHPMSYLPFGDGPRNCIG(1)ARFAHYQSKVGLITILRHHKVNVCEKTTIPFKADERSFLLTLKGGVHLKITKI*

 

>CYP6AS2 AmGroupUn.792a incomplete on C-terminus of exon 2, missing 3rd exon

(8375-10702) minus, 356 aa, 5 exons cyan is from EST BE844607from antennae

86% to AmGroupUn.5496

MDYFQILCAISIVILTIYYYYSLKYAFWKDRGISGPKPIIFFGNL?NSIIKKKSLSETVKKWYDDYKHESVFGIYEGTIPVLVINDLDMIKDVLIRDFSIFVDRGFHTFPK(0)IEPLTQHLFLLEAERWRPMRMKLSPIFTSRKLKEM???(1)

gap of 53 aa

EDSEFRRMGQE 511

509 IFSPSLKLMIGNTCKVFFPSLYEVIGNIFTMKDVDEFFINLVSDTMKYRKDND 351

350 IVRSDFINMLMQLKEHPEKMDNIE

LTDTLLTAQAVVFFIAGFETSSSTIAFGLYELAQNQEIQDKLREEIRKMHEKNKGILTYTDIKEMKYLDKVFKE(1)

TLRKYPILSTLSRKAMEN YTFKGTKITIPKGTKVWVPVYGIQHDPNIYPKPEVFDPERFEDDAFASRHPMSYLPFGDGPRNCIG(1)ARFAHYQSKVGLITILRNHKVNVCEKTTIPFKADERSFLLALKGGVHLKITKI*

 

BE844607 EST = 100% to AmGroupUn.792a, 83% to AmGroupUn.5496

543 EDSEFRRMGQE 511

509 IFSPSLKLMIGNTCKVFFPSLYEVIGNIFTMKDVDEFFINLVSDTMKYRKDND 351

350 IVRSDFINMLMQLKEHPEKMDNIE  LTDTLLTAQAVVFFIAGFETSSSTIAFGLYELAQNQ 171

170 EIQDKLREEIRKMHEKNKGILTYTDIKEMKYLDKVFKETLRKYPILSTLSRKAMEN 3

 

>CYP6AS3 AmGroupUn.792b (3396-5727) minus, 499 aa, 5 exons

64% to AmGroupUn.5496

MDYFQLLCVIGALLFAIYYYLTLTFDTWKNRGIPGPKPTIFFGNFQEVILKKISLAEKTKQLYQEYKNELVFGIFQGRTPILVINDLEMIKDVLIRDFSVFPDRGIHVNPK(0)VEPIFQTLFSLKSKTWRPLRMKLSPVFTSGKLKDMFPLILDCAKNLEEFVEKVRNSGEPVDCRDMAAKFTTDVIGSCAFGVCMNSLSPEGSEFRRMGEQLGKFSFKKLARDFTRLYMPFLFDIIGGYLQSHEVNNFFINLIRDSIKYRQENNVYRPDFVNTLKELKEHPEKLENIE(1)LTDALLTSQALVFFLAGFETSSTTISNALYELAQNPEMQDKLRKEIKEVYENNGGALSYTDVKEMKYLDKVFKE(1)TLRKYPVLAALSRQATENYTFKDTKIKISKGTRIWIPVYGIQHDPNIYPEPEVFDPERFEDDAFTSRHPMTYLPFGDGPRNCIG(1)ARFAHYQSKVGLITILRDNKVEVCAKTLIPYKSEPRNILMIPKGGKVELGITKV*

 

>CYP6AS4 AmGroupUn.1753 incomplete on C-terminus of exon 1 (2509-4596)

minus, 411 aa, 5 exons

80% to AmGroupUn.42b, 58% to AmGroupUn.792b, 41% to 6a17, 44% to 6a13,

new subfamily in CYP6?

MLHHFHILTAFVAIFLALYYYL(0)AELFSVNLFSVDATRWRPLRMRLSPVFTSGKLKEMFPLILECAEHLEQCLEDAVKRGGPVDCFEIPARYTTDVIGSCAFGINMNALSDERSEFRKMGRNMFDQNMIKFTRNLLRDFFPRFYNLLGFVLPYTESTVFMTKLIKGTIKYREENDVVRPDFVNLLMELKKHPEKLKNIE(1)ITDTLLAAQASVFFAAGFETSSTTMAHALYEMALNPDIQDKLRNEMKEFHAKNNGNLKYEDIKEMKYLDKVFRE(1)TLRKYPPGMLLRRKCNSNYTFHGTKVSIPAGTSVIIPLYAIQIDPKFYENPDVFDPERFNEDAVAARHPMTYLPFGDGPRNCVG(1)ARFAVYQTKVGLIKILQNFRVDVCEKTMIPYVKKINSITLAPRDGIFLKIEKITD*

 

>CYP6AS5 AmGroupUn.42a pseudogene? stop codon in exon 4 before heme binding

site and missing exon 5 (126603-129298) minus, 439 aa, 4 exons

56% to AmGroupUn.5496 43% to 6M2 new subfamily in CYP6?

MASSFEILCGIAVLFLALYYYLTSTFDFWKSRGVVGPKPVPFFGTTKDLILVKKSTAHFVKDIYEKYKNEPMVGLYATRSPFLLLNDPELIKDILIRDFSKFANRGLGVFER(0)TEPLSPHLLNLEVERWRPLRSRLSPIFTSGKLKEMFYLIIECSLNLEMYLDKLIEKNEPIECRELTARFTTDVIGSCAFGIDMSSMTNENSEFRRMGREVFAVNFMNVMRMKLKQFMPRLYDLLGYVMPDRTFAPFFTRVVTDTIKYRNDNNIVRPDFINMLMELQKNPQKLENIK(1)LTDSLIAAQAFVFFLAGFETSSTTMSNALYELALNQDVQKKLREEINTFCPQNNKELKYDDKEMEYLDKVFKE(1)TLRMYPPASILMRKAISDYTFNDTKITIPKEMKIWIPAFAIHRDSVINPNPNSFDPERFDKDAMASRHPMHYLPFGDG*

 

>CYP6AS6 AmGroupUn.42b alternate splice for exon 3? (121844-124324) minus,

497 aa, 5 exons

41% to 6a17, best matches all CYP6As 80% to AmGroupUn.1753

MFDYFQILIAFVASFLALYYYLTSNFDFWKNRNVVAPKPIPFFGNTKDVVLKKIEISNFIAELYKKYENEAMFGIFFGGSPNLILRDLDLIKDVLIKDFSTFDERGFKISER(0)ADPLNANLFNMDVTRWRPLRIKLSPVFTSGKLKEMFPLILKCAERLEQCLEDAVKRGGPVDCFEISARYTTDVIGSCAFGINMNALSDERSEFRRIGKRIFDLDKNILRSFLRQFFPRFYNLLGFVIPYSETSKFVTKFISEMIKYREENNVVKADFVNLLMELKKHPEKLQNIK(1)ITDNLLAAQAFVFFAAGFETSSTTMAHALYEMALNPNIQDKLRKEIKEFYANNNFTYEEVKKMKYLDKVFKE(1)TLRKYPPGVFLKRKCNSNYTFKGTKVSIPAGTSVIIPVYSIQTDPKFYENPDVFDPERFNEDAVAARHPMTYLPFGDGPRKCIG(1)IRFGVYQTKVGLIKMLKNFKVNVCEKTMIPYIKKINSFTLAPKDGIFLKVEKNN*

 

>CYP6AS7 AmGroupUn.42c incomplete on N-terminus (115419-117055) minus, 265 aa, 3 exons

45% to 6M2, 66% to AmGroupUn.1753

VTKFLTNIIVSTMKYRQENNIVRPDFVNMLIELKKHPDKLENIK(1)LTDTLLTAQAFVFFIAGFETSSSAISNALYELALNPEVQNKLRQEIKEYFNKHNELKYEYIKNMIYLDLVFRE(1)TLRKYPPGPLILRKSITNYTFNNTKVSIPEESFVWIPLYAIHHDPKIYPNPDAFIPERFNDDAIATRHPMHYLPFGDGPRNCIG(1)ARFAVYQSKIGLITILWNYKVEVCDKTMIPYEINPAAFLLTPKGGIYLKFTKIKNNEEILN*

 

>CYP6AS8 AmGroupUn.4533 (10741-13406) minus, 500 aa, 5 exons

53% to AmGroupUn.2631 42% to 6P4 48% to 6N1 partial

MYISLEIFCGIVVALIALYYYLTVNNNFWKNRGIAGPEPVLGFGNMKKVLLGKESMSQFLTKIYHEYKNEPIIGIFTTRTPQLIIKDPDLIKTILIKDFSKIMNRGLLPMVS(0)

GEPISQHLFNIEAERWRPLRIHLTPVFTANKLRGMFSLILECSMHFVSYVDSLVKKGEPVNVREVAARFTTDVVGSCGFGVEMNSLSEKESEFRRVGKSVFATNYARIIKHRIREFMPRLYNYILYLWPTDEMAEKIIKLTRERLEYREKNNLFRPDFMNILLDLKKHPEKIGLD(1)VTNEFLAAQAFIFFVAGFETSSSTISNALYELALNPDVQDKLRKEIKEFAAKNDGEWRYETIKEMEYLGKVFQE(1)TLRKYPSLPFLTRELIEDYTFESNKVTIPKGLKIWIPTYAIHNDPDIYPDPDKFDPERFSDDKIKQRHPMHFLPFGHGPRNCIG(1)ARFAIYQTKIGLINILRNFKLDVCDKTLIPYKHHPRGLLLMPLTDLYLKITRLTN*

 

>AmGroupUn.4532 incomplete on both ends (1-757) minus, 131 aa, 2 exons

83% to AmGroupUn.4533

LAAQVFIFFAAGFETSSTLISNALYELALNPNIQDKLRKEIKKFESQNDEEWKYETIRNMDYLEKVIQE(1)TLRKYPPVPFLNRELIDDYTFESNKVTIPKGLKIWIPTYAIHNDPDIYPDPDKFDPERFSED

 

>CYP6AS9 AmGroupUn.2792 incomplete on N-terminus, pseudogene??? heme binding

site probably not functional due to change in splice donor

(995-3268) minus, 367 aa, 4 exons

65% to AmGroupUn.4533

(1)IISSPVFTSGKLKGTFAQILNCSNDLVTHIDTLSKMEDSINMREVAAKFTTDVIGSCVFGIKMNSLSEIENEFRRIGRNIFATNFTNILKIRLLQSTPFLHSLLCKILPDEEMEMFMRITKETMEYREKNNLVRPDFMNILLELKKHPERIADIE(1)LTDDLLAAQAYIFFAAGFETSSTTISNALYELALNHDMQDKLREEIKEFEAKNDGEWRYETIKEMQYLGKIFQE(1)TLRKYPALSFLSRESIIDDYTFRDTKLTISKGTLVWIPVFPIHHDPNIYPEPEKFDPERFTEDKIKERNLMHYFPFGHGPRNC(1)ARFAIYQTKIGLIKILRNYKVDVCSKTLIPYKYDPFSFILVPLGGLYLKITRIQD*

 

>AmGroupUn.4458 incomplete on both ends (1-1218) plus, 203 aa, 2 exons

3aa diffs to AmGroupUn.2792 61% to AmGroupUn.4533

VFTSGKLKGTFAQILNCSNDLVTHIDTLLKMEDSINMREVAAKFTTDVIGSCVFGIKMNSLSEIENEFRRIGRNIFATNFTNILKIRLLQSTPLLHSLLCKILPDEEMEMFMRITKETMEYREKNNLVRPDFMNILLELKKHPERIADIE(1)ITDDLLAAQAYIFFAAGFETSSTTISNALYELALNHDMQDKLREEIKEFEAKN

 

>CYP6AS10 AmGroupUn.2631 incomplete on C-terminus of exon 1 (2686-5804)

minus, 495 aa, 5 exons

40% to 6M2 new subfam in CYP6? 53% to AmGroupUn.4533

MAAFEILCGFIIFIFAFYYYLIKPQEYWKNRGVPGPKPIPIFGNFFRLTFARISIGDLMTKFYKEYKHEPVFGLYMRNVRVLAINNPDLIKTVLIKDFSKFAHRGLA???(0)TEPLSQHLFVLEPKRWRPLRTKLSPIFTSGKLKDMFSLIIECSNTLENYVEHLISKNDRVEVRDLAAKFTTDVIGSCGFGVDMNAMSDVQCKFRDIGREFFGPSFKQILKIRLRENLPRLYTFLGYILPRDETTTFFTNVVLDMIKYRKTNDIYRPDFINALINIQNHPEKLDIE(1)LTEPLLVAQAFLFFVAGFETSSLTIATALYELAQNQDIQDKLRDEITEHHKLNNGEWQYENIKNMPYLDAVFKE(1)TLRKYVPLTVLMRQSLEDYTFESINLTIPKDTRIFIPIYAIHRDPDIYPNPEVFDINRFSKEAEATRHPMHYLPFGDGPRNCIG(1)ARFAIFQTKIGLIKILRTYKVDVCNETQIPFINEPRTFTLAPKHDLTLKITKIEN*

 

>AmGroupUn.248a incomplete on C-terminus (38475-40756) plus, 338 aa, 4 exons

49% to 6Aa14, 73% to AmGroupUn.4533

MYIGLEILCGIVITLIAFYCYLTINNNYWKNRGIPGPKPVPGFGNMKNVIFGKESVSQFLTRMYNEYKDEPMIGVFSKRTPVLIVKDVDLIKTILIKEFPKFANRGLFPIFS(0)

110aa gap here

ILSIRIQDMLPWLYNSFLYVLPRDEKTRIIMKLMTETMEYREENNVFRPDFINMLLNLKKHPEKIDIE(1)LTDDLLAAQIFIFFAAGFETSSSTISNALYELALNPDIQEKLRKEIKEFEARNNGEWRYEIMKEMEYLEKVFQE(1)TLRKYPSLPFLNRKLINDYTFESNNVTVSKDLKIWIPVYGIHHDPDIYPDPEKFDPERFSKEEIMKRHPMHFLPFGHGPRNCIG(1)

 

>AmGroupUn.248b incomplete on C-terminus and N-terminus of exon 2

(43061-45682) plus, 245 aa, 4 exons 78% to AmGroupUn.4533

MYINLEIFCAIVIAFIAFYYYLTINNNFWKHRGISGPKPVLGFGNMKKIILGEESMSQYLTKLYHKYKNESMIGIFRLRTPALIIKDPDLIKIVLIKDFSKFMNRGLLPIIS(0)

sequence gap here 165aa

NIILELKKHPEKINID(1)ITNELLAAQIFIFFAAGFETSSTLISNALYELALNPNIQDKLREEIKKFESQNDEEWKYETIKKMDYLEKVIQE(1)TLRKYPPVPFLNRELIDDYTFESNKVTIPKGLKIWIPTYAIHN

 

>AmGroupUn.248c incomplete on C-terminus (47350-47685) plus, 112 aa, 1 exon

94% to AmGroupUn.248b, only 6aa diffs at N-term

MYIGFEIIYGIVIVFIAFYYYLTINNNFWKHRGISGPKPVLGFGNMKKIILGEESMSQYLTKLYHKYKNESMIGIFRLRTPALIIKDPDLIKIVLIKDFSKFMNRGLLPIIS(0)

 

>AmGroupUn.9652 incomplete on C-terminus (993-1236) plus, 81 aa, 1 exon

72% to AmGroupUn.248a

MNISLEILCGIIVALIVFYYYLIINNNFWKNRKISGPKPVIGFGNMLSIILGKESTSQFLTRIYNEYKNEPMIGIFSKNNP

 

>AmGroupUn.8460 incomplete on both ends (123-685) plus, 139 aa, 2 exons

71% to AmGroupUn.4533

(1)TLRKYPVVPFLNRELISDYTFENSKITIPKGLKIWIPVYGIHHDPDIYPNPEKFDPERFSEDKIKERHSMHYLPFGHGPRNCIG(1)SRFGTYQTKIGLVKIIRKYKVEICDKTLIPYKFNSFANFLMPSTGLYLMITDVEN*

 

>AmGroupUn.8178 incomplete on both ends (801-1322) minus, 174 aa, 1 exon

64% to AmGroupUn.4533

(1)PLSQNLFGLEVERWRPLRIHFSPIFTTNKLKGLCSLILECSEQLEKYMDILIRKGEPLDIREIAARFTTDVIGSCAFGIEMNSLSENESEFRRLGKGVFNTTFRRIVKTRIRNLMPWLYNFFLRILPWDEITKKIVKLTTETIEYRNKNNIVRSDFINVLLNLKKHPEKIAEIG(1)

 

>AmGroupUn.8712 incomplete on N-terminus (13-1433) minus, 267 aa, 4 exons

60% to AmGroupUn.42c

DLLGPLVPEREVTPFFIKVVTDAMKYKKESNVFRPDFIDTLMKLRDDPESLSDIE(1)LTDAFLTAQAYVFFAAGFETGASTISNTLYELAQNQGMQDRLREEIREHCDKYGGELMYENIKEMEYLDKVFKE(1)TLRKYPPGTLIPRRSVSEYTFKNTNVTIPKGTMIWIPAFPIHRDPNIYPNPDDFNPENFTEDAINNRHPMNYLAFSNGPRNCIG(1)ARFANYQVKIGLIMILRNYKVEVCEKTVIPYQFDPNLFLLGPKGGIYLRVTKVE*

 

>AmGroupUn.2707 incomplete on N-terminus and probably missing exon 3

(1-3597) plus, 368 aa, 4 exons

59% to AmGroupUn.4533

LTRVYNEFKNEALIGVFMKTYPALVVKDPDLIKDIMIKDFYKFPNRGFPKSDS(0)ADPLTQHLFLVEEEKWRPLRTQLSPVFSTGKLRGTFTQILDCSNHLVTYMDKLVEIGEPIDVREVTAKFTTDVIGSCVFGIKMNSLSGKESEFRRFGRQIFAMNFLKILRLRIKQFLPMLHYLLVRILPPDEETKIMLKLTRDTFKFREAHNIVRPDFMNILMELKKHPEKVPSLG(1)???

70 aa gap here

(1)TLRKYPVLPYLSRRSIEDYTFEGTKVSIPKNTLICIPVYPIHHDSSIYPNPEKFDPERFSEDEVKKRHSMHYFPFGHGPRNCIG(1)LRFAIYQSKIALIKILSNYKIEICDKTLIPYKYDPFSFISLPLTGIFLKITKLQN*

 

>AmGroupUn.2162 incomplete on both ends (1-852) plus, 223 aa 3 exons

39% to 6g1 aa175-381 probable new subfamily in CYP6

NSMNKYLDDEFSHDTKTKTIMIKDVTLKYTTNVISSVAFGIQVNSFNPKTIQFYEEG(1)LKTTFSRSMQLFISFFFPKLSPYLNTRMLGSSTNFFRKVFWNSMDNREITKTKREDLIDSLIELKNSKQDKDFK(1)FEGDALLSQSAIFFIAGRETSISIICLTLYELAKHPEIQKRTREEINEKLKEHGMTYEGVQSMKYLHQVVSEILRIYPPTPIIDRVAVADYK(0)

 

>AmGroupUn.2637 incomplete on C-terminus (7-2478) plus, 330 aa, 4 exons

45% to 6P4 new subfamily in CYP6? 46% to 6F1

MLYKPYLIVNDPNLIRDILTKEFTNFHDRGIFYNEEVDPLSGHLFQLPGKKWRNLRVKLTPTFTSGKIKQFFPILNEAGNILAKYLEEEARKGSTID(0)ICSRSIIFSRYSTDIIMSVAFGISCDSFKEPNNEFRYWGKKIFDPKPLWNALILFAPQILNFFSISYTEKSVTKFFTNMFKQTVKYRESNNIERKDFLNLLIQLMKNGYVDADDESLSNNVNAAK(1)NKLTMMEAAAQAYVFFLAGFETSSTTVTFCLYELAKNQDIQNKVREEIQTMIKKNGDLTYNALNDMNYLHKVISE(1)TLRKYPPVVILNRICTNDVKLSTTDFCIPKGTC???

 

>AmGroupUn.10510 incomplete on both ends (122-647) minus, 176 aa, 1 exon

54% to AmGroupUn.42a, 39% to 6a16m all best hits = CYP6As from aa107-274

probably a new CYP6 subfam or CYP6a

(0)VEPLSQHLFNLEPKRWRPLRSKLSPMFTSKKLKEMFGLILECGRHFEKYVDGLAARRQPVDFCEVAAKYTTDVIGSCAFGINMNAMSSEGSEFREAGRKIFEPTWNSIIRLKFKITMPTLYDLLGPLVPEREVTPFFIKVVTDAMKYRKERNVFRSDFIDTLMKLRDDPESLSDIG(1)

 

>AmGroupUn.6966 incomplete on C-terminus (878-1100) minus, 72 aa, 1 exon

40% to 28A5 N-term 38% top 6a20 57% to AmGroupUn.42a

MAYVEILCVGIIVSMLAFYYYLTSAFNFWKIRGIPGPKPKFLFGNIRDIILSRISTPAFIKNVCDTYTNEPM(0)

 

>AmGroupUn.4500 incomplete on N-terminus (298-1025) minus, 84 aa, 2 exons

52% to AmGroupUn.5496 53% to 6AM1

AIFNPERFTEENKRTRHPYAYLPFGEGPRNCIG(1)MRFALLQIKVGIISFLRNHRVETCQKTITPIKFSRRSLVTTSEKGFWLRIK*

 

>AmGroupUn.5730 incomplete on both ends (990-741) minus, 84 aa, 1 exon

54% to 6d4m 60% to AmGroupUn.8460 60% to 6AM1

(1)TLRKYPPVVILNRICTNDVKLSTTDFCIPKGTCIAIPVFGLHRDSNIFPNPEKFDPERFSEENIKTRHPYVYLPFGEGPRICIG(1)

 

 

New families in the 3 clan

 

>CYP335A1 AmGroup14.7a (400127-401674) minus, 515 aa, 1 exon

40% to 9f2m new subfamily in CYP9 63% to AmGroup14.7b 39% to 9D1, 38% to 9K1

50% to 9J8 partial

MEVSHLTTFELLLLTLIFIILAKLVSILYTQFTYWKRNKVPYIRSSPLFGTAWRVFFRLVSFPNYCKYIYNYYPDARYVGVMDFATPTVIVRDPKLIKEIAVKNFDNFPDHRSFVTEEMDPVFGKNVFSLKGDRWREMRNTLSPSFTANKMRFMFDLVSKCSHDFVSCLHDRLESSSSEIEGKNLFTRYSNDVIATVAFGISVNSIEHPDNEFYRRGIDVSTFSGTFRFIKFMLFRLNPRLTRMAGFTFLSRATSKFFWRVISETVTARKRRGIVRPDMIHLLMQATDSKKKSIHQTMTIDDIVAQAFIFFLAGFDTTSTLMCYVVHELALHQDVQRRLREEVDRVLDDGTEISYEDTLGMEYLEMVISETLRMHPPTLLIDRQCAKEFHLPPAGPGYESVTIHPGENIWFPVLAIHRDPAHFPDPDKFDPERFNRENRNGIDPYTYIPFGVGPRKCIGNRFALMETKLLIIRLLRKFVIKPCERTMDPIVYKKGNFTLMPKDGFWVTFEKRNDH*

 

>CYP335A2 AmGroup14.7b (395285-396839) minus, 519 aa, 1 exon

39% to 9f2m 63% to AmGroup14.7a 40% to 9K1 51% to 9E2 partial, 54% to 9J7 partial

MESPTLLFSFELLAIGLTAIVLAKFVSLLHHQYNYWRKRRVPHVGAVPVLGSSWRIFTRRMSLPNFCSLVYKHRPGSRYLGMMDCFTPVVVVRDPNLIKEIAVKNFDHFPDHHSFINEKIDPIFGKNVFSLKGDRWREMRNTLSPSFTASKMRFMFDLVSNCSEEFVRYLYDHPEFSSSIEAKDAFTRYTNDVIATVAFGISVNSMENRDNEFYTKGADATNFGGIFRLFKFMLFRVNPRLTRMAGLSFLSRGTATFFHRVVRETVRARDERRIVRPDMIHLLMQARDKEDRRPVATVDNRMTIDDITAQAFIFFLAGFDTSSTLMCYVAHELALNPPVQERLREEVDRFMDGGNGAITYEALLKMEYMDMVTSETLRKYPPIVFIDRLCVEKFELPPAEQGYDHLIVHPDNIVWFPVYGLHHDPKYFPEPEKFDPERFNDANKRNIVPYTYMPFGLGPRKCIGNRFALMETKILIAYMLRKFRIKRTEKTRASIEFSKTNFSLTPDHGFWIGLEKRDP*

 

 

>CYP335B1 AmGroup14.7c (305231-306757) minus, 510 aa, 1 exon

39% to 9E2 35% to 9f2m new family 57% to AmGroup14.7d 42% to 14.7a

MDYLQLGLTLLAILVAVYYLSTRNHKLLKRHGIVHIPPTPLFGNLGPLVRRKCHMEDVIQRVYDLDPDARYVGMYEFTTPLIIIRDPELIKTIGVKEITNFTNHRPFVDVGVDPMLGEVLFAMQGDRWREHRTMLTTLFTSSKIKSMFVLMSDCAKRFADYLSKVEREIELKSVLTRYTNDVIARCVYGVSVDSVNEPENIFYRYGQVASQLSTFKQNLMIFVHRNSPRLARLFNLKILPVHIEKFFHRLVMDTIETRRREGVHGLDMLQQLMDMQSRRKESEEGKRGMTVTDIANHAFSFFFGSVDTMATQISLISHMLAVNPDVQQRLQEEIDEVLSASEDKQVGYDVIQEMKYLDAVMSEAMRYHPILLFVDRVCGETFELPPALPGARPFKLERGMNIWFPVKAIHHDPKYFENPDRFDPDRFLRDGKGIASSGAYMPFGMGPRKCIGSRFALTEMKILLFNILAKCSFKVGSKTMVPLKFKEGVFNPVAKNGFWLKIERRENSCC*

 

>CYP335B2 AmGroup14.7d (252241-253839) plus, 532 aa, 1 exon

57% to AmGroup14.7c same subfamily

MEFLSLALVLAAISIIAYYYCFVRKNFNLFQEHGILHVPPSPLVGNFGPLIRGKENVHDTIQRIYNIHPDAKYVGIFEFLTPVIMIRDLDLIKSITMKNFDQFPDHRPMFCKSVDPMLGEMLFIMDGERWKEHRNMLSPTFTSSKIKTMFVHMSECAKRFAHHLSKLPEKDRETEMKALLTRYTNDVIAACIYGVNVDSIKEPRNVFYMYGRVGATLIGLKKNLKIMVHRNMPWLANLLRLNILERHIAKFFTDLVVETVEERERNGTTNSDLIQLMMDTRNKKESGKKNLTVQNMANHAFSFFFGGFDTVSSQTCVLLHMLVENPEVQQRLQQEIDETLESNNGQLSYDVIQEMRYLDAVINEILRLHPIAVFIDRMCVKSFELPPALPGDVPFTVKPGMNVWIPVKAIHHDPRYYDEPEKFKPERFLDNGKNIIGSGAYFPFGIGPRICIGNRFALIEMKVLVCHILAVCDIKAGARTGIPLEFEKGVFNATAKTGFWLKIEPRKYSYHSGQINGLVNNHVINGACKTGI*

 

>CYP335B3 AmGroup14.7e (266474-268027) plus, 518 aa, 1 exon

58% to AmGroup14.7d same subfamily ESTs BI508364 BI516506 BI505081 BI505012

MDYLTISLSLITVFVAVYYLATRNNDFFKKHGIPHVPPVPFLGNMGSLVRQKSNLHDVIDRTYNLDPGAKYVGIYEFTTPIIILRDLDLIKTITMKYLDHFPDHRSFAYEGADPVFGSMLFAMKGERWKEHRNMLTPTLTSSKIKGMFKLMTECAVRFADFLSVLPENERETEMKALLSRYANDVIASCVYGVSVDSINDPKNIFYVYGRRGTNVVGLKKSMFVLIHRNMPWLAKLFGLRFLEKHVQKFFYDLVYETIESREKLGTNRSDVLQLLMDIRDKANSSGKMTTMTVENVAIHAFTFFFGGFDSITSVTTLLTQMLAEHPDVQARLQQEIDETLRSNDGVLTYDAVHGMKYMDAVINETMRFCPVLPFLDRMCVESFQLPAPVPGGQPFTLRPGMNVWIPLAAIGRDPEYFEDPDKFDPDRFLNPEAGIKNSGAHFPFGLGQRKCIGERFAMMEMKVLLCYVLAACNVRIGSKTTVPMKLEKGLINANVKGGFWLKIEPRKVTYYNSSRSN*

 

>CYP335C1 AmGroup14.7f (258230-259825) minus, 531 aa, 1 exon

44% to AmGroup14.7d 49% to AmGroup14.7g

MLDSWSITAAIVAVLAIAYYQLIWKYKHFERIGIPCYHSIPLLGSFWEAVIQRNNFAEISRKIYNSYPDTKYMGMYDTTTPVLLIRDTELIKAISVKHFEQFPDHRSFQNEATDPLFAKNLFALRGDRWREIRNLLSPAFTSSKMKSMFILMRDCAKEYGDYFASLTGDESTIELKDAFTRYTNDVIATCAFGVEVNSMKDRKNKFYVYGREGTTFGSWASIKFFVTRVLPVSVCTLLRIRLIRKEISDFFIDLVSTTIKTREEKGIVRPDMIQLMMESKGKLGAGKEMSMIDICAQAFVFFFGGFESTSTLMCFAAYEIAVNEDIQRRLQNEIDQVLEERDGEVTYAAVNEMKFLDAIIYEALRMYPVVVATDRVCMKPFELPPNRPGEKPYLLKEGDNVWFPIYAIQRDPQYYPEPDKFDPDRFLNDTKQMINSGLFLTFGIGPRMCIGNRFAMLETKVLLFHLFARCNLVPCSKTTIPMKLNRKGFSMTAENGFWFKIEPRSAKKEEKIAVPGTTMLIDKIPDRYPDN*

 

>CYP335D1 AmGroup14.7g (261868-263392) minus, 508 aa, 1 exon

49% to AmGroup14.7f

MAIFALLLIVLGILGSYHLLKSQNPFKEHGLPYKSYLPILGSTWESILRRKSFAVVIQEIYNLAPSARYVGFYNRTTPIVMIRDPELIKTIAVKNFDAFRNHRTVNDTQTDDVLLSGNLLLLRDNRWREVRSLNTPAFSTSKIRSMYRSMSEIAINVARYLSTLAPGQNIVEMKDIFTRYANDVFATCAFGISVDSLSDRENKFYELGREALDIHSTPILKLILIFAFPKLARRLGVSLVSKEATNFFTRVVSENIKMREEKGITRPDFIQAMIDKRNGRGRDDELTVEDITAQAFVFFFGGFETTSGLLSFAVHELAANPEIQGKVHAEIDRVLVSNNEITFERVNGLVYLDAVINETLRMYPIIPITDRECSKRFELPPVLPDAKPYVLKEGSHVWFPIYAIQRDPRYFEKPDCFDPDRFLDDNKKRSDAFNGDAYMPFGAGPRNCIGNRFSMVETKVALFHILAKCRLDVCPKTTIPMELRKRGVFLTAKNGFWLRIVPRHPVT*

 

>CYP336A1 AmGroup2.15 (200938-202428) plus, 496 aa, 1 exon

34% to 6AH1 Anopheles new family? EST BI946448 from brain

MASAFLTLVTGALLLLCFYLYLKYTYWKRNGIP

YSKGYYPIIGHFLPLIMKKQSYSEIIEEIYRDSNHSMVGMYKGMKPVLILRDINLIKTVLQSNFSKFHENAVKIDPKLDPLLAKNPFFCYGELWQTGRKRLTYAFSNARLKILFAAVYEVCTKFRNFLDRRLESSKKYEVELKSLFLKFTSEVVANAGLGIEGFCFEDGKVQSIFTNLDNNDFLDTFLVGIIMHFPFLTKLLRIKFLPTKHDKFF

RTVVKKNLELRKSDPIPRNDFIQLMIEMEQTGEKIDEEIVAAHAVSFYLDGVETSSVTLNFIGCQLAIHQDVQEKLRKEVRSTLEKHGGVLTFEALKDMTYMNQVISESQRYFSALGFLGKICTDEFELQGSDGLNYRAKPGTELLIPICGLHKDPKYWDNPEIFDPERFSDENKQRIEKMAFIPFGEGPRICVGMRMAMLQMKSCLATLMKDYKLEVSPKMQLPLKLSPTYFLSAPLGGGWVLISKA*

 

CYP4 clan (5 seqs)

 

>AmGroupUn.2145 (673-4743) minus, 545 aa, 7 exons

CYP4G11 63% to 4g15m, 100% to AF207948 partial seq

MAAASATGFSASSVFLSLLIPALILYFIYFRISRRHLLELAEKIPGPPALPLIGNALDLFGT(1)

MFSQVLKKAENFKDVVKIWVGPKLVICLIDPRDVEIILSSNVYIDKSTEYRFFKPWLGDGLLISTG(1)

QKWRNHRKLIAPTFHLNVLKSFIDLFNANARSVVEKMRKENG

KEFDCHNYMSELTVDILLETAMGVSKPTRDHNAFEYAMAVMK(2)

MCDILHLRHTKIWLRPDWLFNLTKYGKNQIKLLEIIHGLTKKVIQLKKEEYKSGKRNIIDNSAQKTESK(0)

XXXXXXXXXXXXXXXXXXXXXXXXXXXX

TNNIVVEGVSFGQSVGLKDDLDIDDDVGEKKRQAFLDLLIEAGQNGVLLTDKEVKEQVDTIMFE(0)

GHDTTASGSSFFLAVMGCHPDIQEKVIQELDEIFGDSDRPATFQDTLEMKYLERCLLETLRMYPPVPLIAREIKTDLKLA(1)

SGDYTIPAGCTVVIGTFKLHRQPHIYPNPDVFDPDNFLPEKTANRHYYAFV PFSAGPRSCVGRKYAMLKLKIVLSTILRNFRVRSDVKESEFRLQADIILKRADGFKIRLEPRKQVASTA*

 

>CYP4AV1 AmGroupUn.2000 incomplete on N-terminus (1-4856) plus, 496 aa 7 exons

41% to 4c3 probable new subfamily in CYP4

AADG02019009.1

MIKWGKELGDMYLIWVGMRPFIFLYKAEAIQPLLSSSVHIDKSLEYQYLQPWLGSGLVTSTG(1)

EKWHFHRKLLTPTFHSGLLELYLKTTIREAQILISCLRKEIGKPEFDIVPYAKRAALDIICD(1)

SSMGCNINAQKNFENEYVQAVNT(2)

LASISQRRFLNVWMSFDPIFKLTSWGKRHDHALSVTHGFVNK(0)

XXXXXXXXXXXXXXXXXXXXX

WKDRKDTNFNEKSHKRQALLDLLLELSKDGKVLTDDDIRDEVNTFMFAGHDTTATSVSWILYALGRHPQYQ(0)

ELIIEEYDETVGTKELTLDILSKLTWLEACIKESWRLYPVTPLIARQIYHPITIL(1)

GHEIPIGSTVLVNSFLLHRDSRYFPEPDIYRPERFLPDGPKYPSYAFVPFSAGSRNCIGWKYGTMIVKVLILYILKNFHVESLDTEDQLRFISELVLHNADGLRLKITPRK*

 

>AmGroupUn.8281 incomplete on both ends (644-1499) plus, 132 aa, 3 exons

44% to 4c3 aa97-209 45% to 4M5

(1)ELWKFLTQLSKQYYPIYRMWTFLEAYVHICHPDDIE(0)

TILGNIKFTKKGFGYKYLKPWFNTGLLTSSG(1)

HKWHVRRKILTSAFHFNVLRQFVDIFIEDAERLIKTLESEEGIFVENLLQLTSEHTLNVICG(1) C-helix

 

>AmGroupUn.2540 incomplete on N-terminus (5369-5938) minus, 71 aa, 2 exons

52% to 4ac1m. 53% to 4J9

PYAYVPFSAGPRNCIG(1)

QRFAMLELKTYLGLLLYNYYFEPIDYLKDVTFVSGIVLRLENPVRMKFIPVKKIC*

 

>AmGroupUn.3324 incomplete on N-terminus (901-1990) plus, 285 aa, 3 exons

50% to 4aa1 55% to 4AN3 partial

(2)GQIMLLYRMIRPWLLIEWIYRLTKYGREEEKQRKNLFDTCFKMVKEKRDLLQSKDRISNNDIKKNKN

ISLLEYMVEINEKNPCFSDEDIVEECCTFMLAGQDSVGTATAMTIFLLANHPEWQNKCIEEIDEIFNGDT

RFPTISDLKEMKCLEMCIKESLRLYPSVPIIGRTLGEDIKIG(1)

XXXXXXXXXXXXXXX

NTHHLPHHFPDPDTFKPERFNSENSEKRHPYAYIPFSAGPRNCIG(1)

YKFAMLEMKSIISAILRKCRLQSIPGKKEIRPKFRMTIRAQGGLWVKIIERDQILKSIAA*

 

mito clan (8 seqs)

 

>CYP314A1 AmGroup5.25 AADG02002903.1 incomplete on N-terminus (17670-19852) minus, 471 aa, 9

exons (19852-19708, 19574-19378, 19298-19111, 19031-18829,

18707-18579, 18471-18305, 18238-18160, 18072-17875, 17777-17670)

43% to 314A1, 55% to pea aphid (Acyrthosiphon pisum) P450 below

cyan is extra seq added based on CF588143

note This seq is less than 55% identical to drosophila and anopheles 314A1s,

but there appears to be only one orthologous seq in each species and it does not make sense to give the ortholog a different subfamily.

MLLSSAWFEVIAAVLLTILIFVTSHRPAWWFWTATSHEASGKSVKILLPRFT (1) possible N-term exon

LPGPFSLPIFGTRWIFSCIGYYKLNKIHDAYKD(1)

LNQRYGALCKEEALWNFPMISVFSRQDIETIIRRNSRYPLRPPQEVISHYRRTRRDRYTNLGLVNE(2)

QGQTWHDLRVALTSELTAASTVLGFFPALNIVADSFIELIRRQRVGYKVTGFEELAYKMGLES(1)

TCTLILGRHLGFLKPDSSSELATRLAEAVRIHFTASRDAFYGLPLWKLLPTCAYKQLIESEDAIYN (2) revised

IISEIIETTIQEKRDDAKDESVEAIFQSILRQKNLDIRDKKAAIVDFIAAGIHT(0)

LGNTLVFLFDLIGRNPTVQNKLYEETYALAPAGCDLTIDNLRKAKYLRACITESLR(2)

LIPTTTCIARILDEPIELSGYRLTAG(0)

TVVLLHTWIAGLNEENFKDAKKYLPERWTTPTTPHSPLLVAPFGAGRRICPGKRFVDLALQLILAK(0)

IIREFEIIVEEELDLQFEFILAPKGPVSLGFRDRS*

 

>CB336480 Tribolium castaneum embryonic cDNA

MFEKIFQSLDVTSLLIIAIFFLFLEYRPPWWYRNNDCKKGVKLIPGPLAL

PGLGTTWIFFFGGFSFNRLHLYYENMYKR

YGPVMKEEYWCNIPVINLFEKREIVKVLKAGGKYPLRPPVEAVAHYRRSRLIDTLALG

 

>CF588143 pea aphid (Acyrthosiphon pisum) 48% to 314A1ps

  2 RVLRQSGKYPIRPPNEVTANYRKSRPDRYTNTGLVNEQGEVWAMLRNKLTPELTSPRTIR 181

182 RFLPEVNQLADDFNNLISLARDGNNVVRGFEGYCNRMGLES 304

305 TCTLILGRRIGFLDGEVSETATRLADSVTSQFRASQEAFYGLPLWKLIPTKAYKDFVAS 481

482 EDALYDIVSEFVESALIDEQQSFTDVRSVFVSILQASELDNRDKKAAIIDYIAAGIKT 655

    LGNTLVFILYLV 691

 

>CYP334A1 AmGroup9.6 alternate splicing possible? (1158493-1161102) minus,

514 aa, 9 exons (1158603-1158493, 1158868-1158668, 1159034-1158959,

1159285-1159117, 1159526-1159346, 1159843-1159593, 1160068-1159927,

1160289-1160153, 1161102-1160830)

34% to 12A5m a new family in the mito clan

MTESQTASVDESLRTDTIPLLDHTATTEVSPTTFEVSTMKVDTQIFDKA

PLPFDEIPGPAILKIWEKYWKYVPLLGTQLLSSLLINRFTQG(0)

VPCCRPEHIAEVFKQEGDTPVRSGIDILQHYRLNYRKYRLAGPFSM(2)

QGTEWLEIRDKVEDTFNQISSTFFTKIDTCCNELITRICKIRNRQNE(0)

VPVSFYEDLIRWAMECFCDLTFNKRLGFLEPIGYNSSSEASKLINALTTAHKY

MSRCETGFQVWRFFLTPFARKLFEACDVLDK(2)

VIGKYVRQAQCKLRIRKSHSEESSMTERSPVLEKLLLNEGIHPDDICTMLMDMIILGIQA(0)

TVNSEAFLLYHLAKNPRTQRKVYDEIISVLSNDNSSFTEKSLKNMPYLKACIQETLR(2)

LHPAIPYITRLLPKTISLHGYTIPKG(0)

TFVIMANQITSQREENFEDPFKFWPERWLSNSSKEDVHFSYLPFGHGIRSCLGKNMAEAKMMLLTAK(0)

LVRQFRIEYDYADIKSRFMMVNVPNKPLRFRFVNRN*

 

>CYP315A1 AmGroupUn.1189 (10257-12510) minus, 535 aa, 6 exons

38% to 315a1m, probable mito clan, maybe 315Bx

note This seq is less than 55% identical to drosophila and anopheles 315A1s,

but there appears to be only one orthologous seq in each species and it does not make sense to give the only ortholog a different subfamily.

MNLAQNILKSGKSVSLSSNVIALKYNVPGCGYAGASQTSRIDDLSDISKSTDGGNRSKIEITEKLRDRNYGTVAVATSESILQEMPEPRGIPVFGTLFSFILSGGPKKQHEYVDKRHKELGPVYKERIGPTTAVFVNSIHEFRKIFRLEGSTPKHFLPEAWTLYNEIRKCRRGLLFM(2)NGEEWVYFRKILNKVMLLPDPTNLMIAPCQEVAIELKRKWQKQIKTNNIISNLQVQLYQWSIEA(1)MMATLMGSYWYSYKHQLSRDFEILAETLHEIFEYSAKLSIIPVKLAMNLRLPVWKKFVASADTAFEIVRMLVPEMAKLGGNGLLKKMMDEGIRAEDAICIVTDFILAAGDT(0)TATTLQWILLLLCNHPEKQEELFKHLKDLSQEDILRLPLLKGIIKESLRLYPIAPFISRYLPEDSVIGNYFVPKG(0)ELLVLSLYSSGRDAANFPQPNEFRPERWIRTQKGIYQGVVHPHASLPFALGARSCIGRKLAEIQISFALAE(0)LIKSFKIECINKNQVKLILHLISVPSQSIKLKLMERN*

 

>CYP302A1 N-term AmGroupUn.2216 incomplete on C-terminus (289-2832) minus, 202 aa, 3 exons

45% to CYP302A1 aa19-183 in CYP302 family, mito clan (disembodied)

MCTLLKKCNQSIRKKLFIKFYSNEFTKSKIKINHSQPKAFYDIPGPKSLPIIGTLYKYLPFIG(1)

EYSFTNLYESGKKKLKCFGPLVREEIIPNVNVIWIYRPEDIAEIFKAESGLHPERRSHLALLKYRKDRPNIYNTGGLLPT(2)

NGSEWWRLRKEFQKVSSKPQDVINYLKETDCVIQEFVELCNNEKFADFLPLLSRLFLEC(1)

 

>CYP302A1 C-term AmGroupUn.3145 incomplete on N-terminus (122-2230) plus, 250 aa, 4 exons

51% to 302A1 (disembodied)

note these two pieces are probably from a single gene

(2)IALELVSRKKNNMKIRYNKSFLDAYLENPVLDIKDIVGMACDMLLAGIDT(0)

TSYSTAYILYHLAKNQNIQEKLRIEATQLLKNHNEPISINILRNASYTKAVIKESLRLNPISIGIGRILQTDVVLSGYRVPKG(0)

SVVVTQNQIICRLPEYFEEPNLFIPERWLREYSENNNKINYKKTVHPYVLLPFGHGPRSCIARRFAEQNMQILLLR(0)

ICRRLKISWHGDDLGMISLLINKPNALLKFNFHDILNNNSV*

 

>CYP49A1 ortholog AmGroupUn.423a incomplete on N-terminus and C-terminus of exon 6

(214-3539) plus, 312 aa, 7 exons

AADG02012456 Amel_1.1_Contig12456, AADG02012457 Amel_1.1_Contig12457

45% to 49A1, mito clan

(2)IKKIRNKNDEVPDDFLNEIHKWSLES(1)

IARVALDVRLGCLDDDANIETQQLIDAVTTFFKNVGILELKIPFWKLFNTPTWLKYVNALDTILS(2)

ITSRYTTVALSRTKEAEKSDKEPSLLERVLALENDTKLATILSLDLFLVGIDT(0)

TSSTVASTLYQLALHPDEQDRAYNEVCNILPSKDMQLDGKHLDKLKYLKACIKETLR(2)

MYPVVIGNGRCMTKDTIIKGYRVPKG(0)

VQVVFQHYVISNLDKYFPHSDKFLPERWLQSDGVRHSFASLPFGYGRRMCXXXXXXXXXXXXXXXX (0)

ILQRYKIEYHHEKLEYYINPMYTPKGSLNLKFIDR*

 

>CYP301A1 ortholog AmGroupUn.423b incomplete on N-terminus and missing exon

(3686-6941) minus, 532 aa, 8 exons

CYP301Ax 66% to 301a1m revised intron 1 boundary

MFCNETIRLKIEEVETDVKQVENGEAFCPTLSDREQASLP cgt RI AADG02008009 possible N-term??

INFEVLSLYLHIFISNSTRAHFPLYCRVRTGAITSTKSCMVD (not part of protein)

HDTTTIQQGKPYKDIPGPRPIPILGNTWRLFPMIGQYEISDIAKLSQIFYDEYGKIVRLTGLIGRPDLLFVYDVDEIEKIYRQEGPTPFRPSMPCLVHYKSVVRKDFFGSLPGVVGV (2) HGEPWREFRTRVQKPILQPQTVRKYTVRKYITPIEMVTSDFIQR(2)

IQEIKGEDGEVPGDFDNEIHKWALEC(1)

IGRVALDVRLGCLSSNLTSDSEPQKIIDAAKFALRNVAILELKAPYWRYVPTLLWSRYVRNMNYFIE(2)VCMKYIDATMERLKTKKAVDEYDLSLMERILAKETDPKIAYILALDLILVGIDT(0)

ISMAVCSILYQLATRPEEQEKIYQELVEILPDPSVPLNMSHLDKAIYMKAFIREVFRYY*???(0)

VQVVFPTVVTGNMEKYVTDAKIFKPMRWLKESTKTLHPFASLPYGHGARMCLGRRFADLEIQVLLAK(0)

LIRSYKLEYHHKPLKYKITFMYAPDGELKFKVLPR*

 

>AmGroupUn.10899 incomplete on both ends (1221-1394) minus, 58 aa, 1 exon

probable CYP301A seq. 61% to 301A1ps aa126-182 57% to 301A1 honeybee aa 115-171

EGLLGRPDMVFIYDANEIERIFRQEERMPYRPSMPSLNYYKHVLRKEFFKENAGVIAV(2)