Honeybee Cytochrome P450 Sequences

 

48 Honeybee P450 protein sequences plus 13 fragments, revised Sept 27, 2005-09-22 edited by Reed Johnson, Rene Feyereisen and David Nelson

 

CYP2 clan CYP15, CYP18, CYP303, CYP304, CYP305, CYP306,

CYP307, CYP342 (name changed to CYP369A1), CYP343 (note CYP304 is not found in honeybee)

 

>CYP15A1

MLYVVISLLLALYCIFCIYDCVKPHNFPP (1)

GPKWLPLIGCFLTFRRLKLKHKYTYVAFQELSKTYGPILGLKLGSQKLVVISTHDLVKKVL

LQDEFNGRPDGFFFRVRAFGKRK (1)

GILFTEGSMWSQCRRFTMRHLRSFGLGQSTMEKYLTVEAENLVNYLRRVSTKGPVPMHTAF

DIAVLNSLWCMFAGHRFDYENEKLAEILEIVHDSFR (2)

LMDTMGGIISQMPFLRFIIPELSGYNNLMEILRKLWNFLDEEINNHEKHLSGNQPQDLIEA

FLLEISSRNGVQNDSIFDR (1)

ENLLILCLDLFLAGSKTTTDTLSTSILFLSLHSEWIKILQEELDNVVGRSRSPTLEDYSSL

PIMESFLAE (0)

IQRFLILAPLGVPHKTTKDVILNGYNIPK (0)

DTTVLLDFHSAHNDPAYWDHPEEFRPQRFLDANGRFCQNNANIPFGL (1)

GKRRCPGEMLARTSLFLYFAYVIHYFDIEISPEHGKPDLNGHDGFTISPKSYYLKITARSD

VTNCSTI*

 

>CYP18A1

MLVEHAAQWAWQAMGGTRIEVLCTFLVFLGVLLVARCLQWLRYVRSLPPGPWGVPVFGYLP

FLKGDVHLRYGELAKKYGPMFSARLGTQLVVVLSDHRTIRDTFRREEFTGRPHTEFINILG

GY (1)

GIINTEGAMWKDQRKFLHDKLRGFGMTYMGGGKKIMESRIM (0)

REVKTFLRGLASKRGTPTDVSASLGMSISNVICSIIMGVRFQHGDARFKRFMDLIEEGFKL

FGSMAAVNFIPVMRYLPCLQKVRNKLAENRAEMAGFFQETVDQHRATFDEGTMRDLVDAYL

LEIEKAKGEGRATTLFQGKNH (1)

DRQMQQILGDLFSAGMETVKTTLEWAIILMLHHPDAAIAVQEELDQVVGKSRMPVLEDLPF

LPITEATILEVLRRSSVVPLGTTHATTR (2)

DVTLHGYTIPAGSQVVPLLHAVHMDPELWEKPEEFRPSRFLSAEGKVQKPEYFMPFGVGRR

MCLGDVLARMELFLFFSSLMHTFELRSPQGSSLPSLRGNAGVTVTPDPFDVCLLPRNLDLI

EDNDMISTGAILRNIGSH*

 

>CYP303A1 hybrid seq from parts not joined in genome assembly

MFSMVILCIVLLLLLLYLGLQKPKGYPPG (1)

XXXXXIIGSRLRILSFTLQNHYLFKTFSVLSKKYGPIIGMKIGINHVVVLNEYESIK

SMLSNEDCDGRPFGILYDKRTGGINR (1)

GLLLVDGNLWNEQRRFVLKHLRDFGFGRQSMMSIIIEE (1)

NLYMNANEYTGNNVTQSQLGTIISMHNIFGITVLNSLWKMLAGKR(2)

YNIDDKELIYFQRILSITLNEIDMLGAPFSHFPLLRFIAPEISGYKSFVKIHEELWKFFK (0)

DEVNNHKNTFNSDSPGNLIDIYLTILNSENYGKTFS (1)

EPQLVAICVDLFMAGSETTSKVLGFCFLYLVLFPHVQKKAHEEIDRVIGRNKLPTAEDKAK (2)

MTYMNAIVLESLRMFAGRSLNLPHRVQRDTKISDYKIPK(0)

NTIIITNFNGILMDESWGDPENFRPERFIDGSGNIVTPSRFLPFSA (1)

GKHRCMGENLAKTNIFIIATTLLQAFTFSEIPGEKPTIEHFIDGTTISPKPYRVNVSLRI*

 

>CYP305D1

MFVIMLIVILILTIIFIKIEKNTKLYPP (1)

GPFSWPFIGNQILLKRLSRKFGGQHKAFMELSKRYNSDIITVNISYEKIIVVSGSKFCDMI

LQNEEFQGRPWNEFIKVRNMGKKQ (1)

GITMNDGTEWKELRNWMMRTMKIFGFGKSEMIEMIQHQLVIFSENLNKNKLHQLKLLFVPA

VINVLWNFITGELVAFNQQQK (2)

LEHFLDLLDRRSRCFDITGGLLAAFPWIRYIAPEISGYNIMCMLNKELKDFLM (0)

KTINDHKEKYIEGKEADLIDMFIQEMRKNEKSSIFT (1)

DEQLMMILIDLFLAGFTTTSTTLDFLFLIVTLFPDVQRKVQKEIDSVIPYDRLPNMEDKAK (2)

LPYVEAVISETYRLWPVFPIIGPRRVLCDTNIDKYVIPKDTTILFNTYSINKDPTLYPDPD

KFMPERFIKNGVFEPDEYSLQFGK (1)

GKRRCPGDILAKATIFILFVGIMQKYTLLPVPGKGPHSIKINSGITLTPQPYNVLVEKR*

 

>CYP306A1

MILDHYIAIFVLPFLLLLYVVRKNRKARRLPPGPWQLPLLGYLPWIDAEKPHETLTRLSRV

YGPVCGFRMGSVYTVLLSDPQLIRQSFAKDSITNRAPLYLTHGIMKGY (1)

GIICAEGEQWKDQRKFISNCLRNFGMVKHEGAKRDKMEERISDAVNECVS (0)

VLRDRGANGPIDPLDTLHHCLGNLVNSIVFGKTYEEEDRIWKWLRHLQEEGVKQIGVAGPL

NFLPFLR (2)

FLPQYGRVIRSIVDGKDKTHEIYRQILDEHRARVDSGNGCKIDSFLAAFDEQMRKKDGAES

GYFTEPQLYHLLADLFGAGTDTTLTTLRWFLLFMAAHPMEQ (0)

EKIQSEMDLCLREGEQPTLNDRIVMPRLEAAIAEVQRIRSVTPLGIPHGTSE (0)

DVEIGGYDIPCGAMIVPMQWAIHTDPAYWRDPLEFRPDRFLSEDGTFFKPESFLPFQN (1)

GKRVCVGEELARMILFLFAGRILRAFSVRVPAGEIADLEGECGITLVPKPHRLAFVGRDR*

 

>CYP307B1

MIPLTATTCFLIAITFLALALILLDHLRSKKTTKSVVPGDDDQHALPEPPGPKPWPILGSL

HILGRYDVPYKAFADLVRDFDCQVIKLRMGSVPCVVVNGLENIKEVLTVKGHHFDSRPNFA

RYHLLFGGNKEN (1)

SLAFCNWSDVQKARREMLRAHTFPRAFSTRFNELNGIIGDEMEFMVNHLDSLSGTSVHAKP

LILHCCANIFITYLCSKNFHLEHDGFRNMVENFDKVFFEVNQGYAADFLPFLMPLHHRNMA

RMAHWSHEIRRFVIKNIIADRVNSWNDVVPEKDYVDCLINHVKSGTEPQMSWNTALFVMED

IIGGHTAIGNLLVKVLGFLATRPEIQRLAQDEIDALGLAGNFVGLENRRSLPYVEAIILET

IRIIASPIVPHVANQDSSIA (1)

GFRIKKDTFIFLNNYDLNMSTDLWTSPEEFMPDRFVQNGRLLKPEHFLPFGGGRRSCMGYK

LVQYVSFAILASILKNFTITPVQKEDYTIPIGNLALPEMTYKFRFERR*

 

>CYP342A1X name changes to CYP369A1 due to a naming conflict with a Nereis sequence.

MISFLFIIFLLLIIYKIYNSVIHVSSNTPPC(1) LPRLPIIGSYWHLLWHDYEYPYNGIIHYVNKLQSKIVTCYFGSHKTIIANDYKSIKEVLTK

QEFNGRPINVDIVLQRAFGKSL (1)

GIFFTEGTLWHEQRRFALRHMRDFGFGRRHEIFETNVMEEIAILVDMLKEGPINDEEK (0)

KFLKNGYACFPDILYPYVANVILNIMFGERFDRSQYHKLIYFCESSMMFQKSLDTSGGAIF

QFWFLKYFGNIFGYTNAIKATYQMINFIE (0)

EYIDNKKDLDDYDKGLIGRYLKILKEKNNITSTFSQKQLIMTLVDFMFPATSALPSALVHA

IKLVMHHPRVVNNIQEEIDRVVGTGRLVTWSDRKN (2)

LPYIEATIRESLRYETLTPLSVFHKTLKKTTLCDYDIPKDTLVVTNLVALNTDPDLWGDPE

NFRPERFLDENNELRKDFTFPFGF (1)

GHRVCPGETYSRYNMFEVFAVLMQNFNFSFVEGEPTGLDDKESGLIVTPKKTWIQVKARNMK*

 

>CYP343A1

MWFVILCFVIVLIKILFDYSRPINFPP (1)

GPRGLPFIGNILDIIKLINETKYYSDTWCRLAEKYGSVVGLRLGLDQPLIIVSGKSAVTEM

LNRSEFDGRPSGFLYKYRCGGMQQGILFTDTDVWHSQRR (2)

FALKTLKQFGFGKNSMEHILQHDAIALTNIIIELTKDGTVKNIRSIISAAVLSNLWLLIDGTK (2)

FDIGMENSNLKEAINIVQDIVKSSNVSGGIINQFPFLRHLFPNLTGFSAFVERQKRINNFFM (0)

EVIAKHKWKKINEEGTNFIDVYLQEIQKKNSSHSFFN (1)

ENQLLYIIKDLFSAGVDTTNSTIGFIIAFLVVHQDVQSKVYDEISRVIDKDIYPSLSDKDR (2)

LPYLKAVIAEVSRLANIGPTSIPHRAVKDSTFLGFEIKKNYTLLANFKSIHMDKEHWGDPE

IFRPERFINEKGDFINDSWLMPFGL (1)

GRRKCLGETLAKNTVFLFVACMLQRLHFMLPSNHPPPCLQGIDGFVIAPPMMDIIAVQRF*

 

CYP4 clan CYP4G11, CYP4AV1, CYP4AB3, CYP4AA1

Note: CYP4 clan extremely reduced compared to flies and mosquitos

 

>CYP4G11

MAAASATGFSASSVFLSLLIPALILYFIYFRISRRHLLELAEKIPGPPALPLIGNALDLFGSPD (1)

AMFSQVLKKAENFKDVVKIWVGPKLVICLIDPRDVEIILSSNVYIDKSTEYRFFKPWLGDGLLIST (1)

GQKWRNHRKLIAPTFHLNVLKSFIDLFNANARSVVEKMRKENGKEFDCHNYMSELTVDILL

ETAMGVSKPTRDHNAFEYAMAVMK (2)

MCDILHLRHTKIWLRPDWLFNLTKYGKNQIKLLEIIHGLTKKVIQLKKEEYKSGKRNIIDN

SAQKTESK (0)

TNNIVVEGVSFGQSVGLKDDLDIDDDVGEKKRQAFLDLLIEAGQNGVLLTDKEVKEQVDTIMFE (0)

GHDTTASGSSFFLAVMGCHPDIQEKVIQELDEIFGDSDRPATFQDTLEMKYLERCLLETLR

MYPPVPLIAREIKTDLKL (1)

ASGDYTIPAGCTVVIGTFKLHRQPHIYPNPDVFDPDNFLPEKTANRHYYAFVPFSAGPRSC

VGRKYAMLKLKIVLSTILRNFRVRSDVKESEFRLQADIILKRADGFKIRLEPRKQVASTA*

 

>CYP4AV1

MSSSTIIMGVSWLTMILSICLMTIVVLLLVRRGKFLYALRKVPCPPAFPIIGNAYELCCSPE (1)

EAFKKMIKWGKELGDMYLIWVGMRPFIFLYKAEAIQPLLSSSVHIDKSLEYQYLQPWLGSGLVTST (1)

GEKWHFHRKLLTPTFHSGLLELYLKTTIREAQILISCLRKEIGKPEFDIVPYAKRAALDIIC (1)

DSSMGCNINAQKNFENEYVQAVNT (2)

LASISQRRFLNVWMSFDPIFKLTSWGKRHDHALSVTHGFVNK (0)

IIAERKAEWKDRKDTNFNEKSHKRQALLDLLLELSKDGKVLTDDDIRDEVNTFMFAGHDTT

ATSVSWILYALGRHPQYQ (0)

ELIIEEYDETVGTKELTLDILSKLTWLEACIKESWRLYPVTPLIARQIYHPITIL (1)

GHEIPIGSTVLVNSFLLHRDSRYFPEPDIYRPERFLPDGPKYPSYAFVPFSAGSRNCIGWK

YGTMIVKVLILYILKNFHVESLDTEDQLRFISELVLHNADGLRLKITPRK*

 

>CYP4AB3 revised to add 27 amino acids at N-term

name changed from 4AZ1 since wasp has a large clade

of 23 CYP4AB genes and 4AZ1 belongs with this group

MISAILFFIFLLATLHYFLLHHRKFGKMINLIPGPEPLPILGNIPTFHNISP (1)

SELWKFLTQLSKQYYPIYRMWTFLEAYVHICHPDDIE (0)

TILGNIKFTKKGFGYKYLKPWFNTGLLTSS (1)

GHKWHVRRKILTSAFHFNVLRQFVDIFIEDAERLIKTLESEEGIFVENLLQLTSEHTLNVIC (1)

ETAMGTSLKNKEKFQYEYRKAVYNMGCIFANR(2)

IVKPWFYYDFFFNLSPEGWQQSKLLKILHNFTRK(0)

IIQERKEYHDKTNGRYLNDFHENINENDNNNDYND(1)

FRRKRLAMLDLLIEAHRNNKIDDEGIREEVDTFMFR(0)

GHDTTAISFCFSIMLLAEHKEIQ(0)

DRARAEIKAAIEENGGKLNITVLQNLPYLERCIKESLRLFPSVPRISRKLETSVKL (1)

SNYEIPSNTIINVNIFDTHRDPKFWPNPNKFDPDRFLPENSKKRHPYAYVPFSAGPRNCI (1)

GQRFAMLELKTYLGLLLYNYYFEPIDYLKDVTFVSGIVLRLENPVRMKFIPVKKIC*

 

>CYP4AA1 name changed from CYP4BA1 (ortholog)

hybrid seq from parts not joined in genome assembly

MGKVSSSLTSIYILYILRTYIRSIIYISQLNGPKTIPIIGNANSILKEN (1)

FLYRLAYESQMYGRIVRIWLTIFPYVMFLEPEDIQLILNNAKHTQKVFFYKLLDNFLGKGL

ITRDIKSWRIHRRLLQPAFHLHVLEKFTNTFAKHADHLMNKILEKNNQEINITTFINDSVYNILS (1)

ETVLGINRINRINEMDDLPFRK (2)

GQIMLLYRMIRPWLLIEWIYRLTKYGREEEKQRKNLFDTCFKMVKEKRDLLQSKDRISNNDIKKNKNISLLEYMVEINEKNPCFSDEDIVEECCTFMLAGQDSVGTATAMTIFLLANHPEWQNKCIEEIDEIFNGDTRFPTISDLKEMKCLEMCIKESLRLYPSVPIIGRTLGEDIKI (1)

GKHIIPAGCSVLISPYSTHHLPHHFPDPDTFKPERFNSENSEKRHPYAYIPFSAGPRNCI (1)

GYKFAMLEMKSIISAILRKCRLQSIPGKKEIRPKFRMTIRAQGGLWVKIIERDQILKSIAA*

 

CYP3 clan CYP6, CYP9 CYP336

 

>CYP6AQ1

MNLLTPYWSLDILIVSSSLMIAVYLYASWKLKYWSRRGIMQITPSPLFGNFKKCILFQKSV

SEIIRELYGQNEGLPFMGFYIFYKPFFLVRDIELVKHILVKDFNTFANKHTSADSKNDRIG

YSNLFIIKNPAWKYLRGKLTSVFTSGKLKKMFDLMLIIGKNLEKHLELLNLD (1)

GNGKEVELKDLCANFTTDLIGTTAFGVNLNSLKDPNSDFRENGRLVFDYNLKRAFEFFSIF

FFPNLSKYVSIKFFGKATDYFRNSFWSVINQRIESNVKRNDLIDCLIELREKHKNDESFEGF (1)

RFDGDDLVSQAAIFFTGGFETSSTTISFTLYELALNKDIQKTVRTEIHEALAQTDGKITYDM(0)

ITNLPYLDMVVSETLRKYPPLGFLDRVALHDYKIPNSDVTIDKDTPVIIPMIAFHYDPKYF

PNPEKYDPLRFSEEVKKTRPSYVYMPFGEGPHICI (1)

GMRLGLLQSKLGIIEILKDYEVSPCEKTKIPMVLDPKGLTTTALGGLYLNIRKITIAAG*

 

>CYP6AR1

MIETVGLIATVFFLLYYYSMSKLDYWRKRGVKGPKPLPFLGNFKDVLLAKESTMDCFERAY

KEFKDEPMVGMYGSHEPLLILRDLDLIKDVLIKDFNKFAQRTQGAIRE (0)

VEPLSEQLFRLDAERWRPLRLKLSSFFSSGKLKEMFHLFVECSDNFEKYLEKMVEKGGLVE

CRDAAAKFSTDVIGACAFSIHTNALTDENSQFRKMGKQALATNLQQFLNDRLREYPFLFKI

FGRFFVDHEVTNFFANSIKDAMDYRIQNNVHLRDVIDILADIRENPTKCGLK (1)

EADNLFLTSQAVLFFLAGFENASLTISNALYELAWKPEIQEKARAEIVNVLQKYDGKITYD

GLEEMKYLEACIF (1)

ETLRMYPVLQWLSREAMETYTFTGTKVTIPKGQQVFLPIYAIQRDPDIYPNPDNFDPERFT

DDKIKTRHSMTHLPFGDGPRHCS (1)

GIRLAKKQLKVGLVTVLSKFKVEVCEKTRKIYQKDKKPLFLLQPVDGIHLKISKVSV*

 

>CYP6AS1

MDYFQILCAISIVILTIYYYYSSKYTFWKKRGISGPKPIIFFGNFVDSIIQKRSTSEAVKK

WYDDYKHESVFGIFGGTTPLLVINDLDMIKDVLIRDFSLFVDRGFHIFPK (0)

IEPLSEHLFLLEAERWRPMRMKLSPIFTSGKLKEMFFLIMESAGNLEKYLDEVIKKDEMVE

CRELAAKFMTDVIGSCAFGINTNSLLEEDSEFRRMGKKISTPNLKVMLGNICKEFFPPLYE

IVGSIFTLKDVNEFFINLVSDTMKYRKDNNIIRSDFINMLMQLKEHPEKMENI (1)

ELTNTLLTAQAVVFFVAGFETSSSTMAFSLYELAQNQEIQDKLREEIRNMHEKNKGVLTYT

DVKEMKYLDKVFK (1)

ETLRKYPILPMLFRQAMENYTFKDTKITIPKGMKLWVPVHGIHHDPNIYPEPEVFDPERFE

DDAFASRHPMSYLPFGDGPRNCI (1)

GARFAHYQSKVGLITILRHHKVNVCEKTTIPFKADERSFLLTLKGGVHLKITKI*

 

>CYP6AS2

MDYFQILCAISIVILTIYYYYSLKYAFWKDRGISGPKPIIFFGNFGNSIIKKKSLSETVKK

WYDDYKHESVFGIFEGTIPVLVINDLDMIKDILIRDFSLFVDRGFHIFPK (0)

IEPLTQHLFLLEAERWRPMRMKLSPIFTSGKLKEMFSLIVESAGNLEKYLDEVIKKNEMVE

CRDLAAKFTTDVIGSCAFGINTNSLLEEDSEFRRMGKKIFSPSLKLMIGNTCKVFFPSLYE

VIGNIFTMKDVDEFFINLVSDTMKYRKDNDIVRSDFINMLMQLKEHPEKMDNI (1)

ELTDTLLTAQAVVFFIAGFETSSSTIAFGLYELAQNQEIQDKLREEIRKMHEKNKGILTYT

DIKEMKYLDKVFK (1)

ETLRKYPILSTLSRKAMENYTFKGTKITIPKGTKVWVPVYGIQHDPNIYPKPEVFDPERFE

DDAFASRHPMSYLPFGDGPRNCI (1)

GARFAHYQSKVGLITILRNHKVNVCEKTTIPFKADERSFLLALKGGVHLKITKI*

 

>CYP6AS3

MDYFQLLCVIGALLFAIYYYLTLTFDTWKNRGIPGPKPTIFFGNFQEVILKKISLAEKTKQ

LYQEYKNELVFGIFQGRTPILVINDLEMIKDVLIRDFSVFPDRGIHVNPK (0)

VEPIFQTLFSLKSKTWRPLRMKLSPVFTSGKLKDMFPLILDCAKNLEEFVEKVRNSGEPVD

CRDMAAKFTTDVIGSCAFGVCMNSLSPEGSEFRRMGEQLGKFSFKKLARDFTRLYMPFLFD

IIGGYLQSHEVNNFFINLIRDSIKYRQENNVYRPDFVNTLKELKEHPEKLENI (1)

ELTDALLTSQALVFFLAGFETSSTTISNALYELAQNPEMQDKLRKEIKEVYENNGGALSYT

DVKEMKYLDKVFK (1)

ETLRKYPVLAALSRQATENYTFKDTKIKISKGTRIWIPVYGIQHDPNIYPEPEVFDPERFE

DDAFASRHPMTYLPFGDGPRNCI (1)

GARFAHYQSKVGLITILRNNKVEVCAKTLIPYKSEPRNILMIPKGGKVELRITKV*

 

>CYP6AS4

MLHHFHILTAFVAIFLALYYYLTSKFDFWKNRGVSGPRPVPFFGNAKDVLLRKIGIGSFIA

ELYKRYDNEAMFGIFIGRSPNLVLRDLDLIKDVLIKDFSIFDNRGLNIPER (0)

AEPFSVNLFSVDATRWRPLRMRLSPVFTSGKLKEMFPLILECAEHLEQCLEDAVKRGGPVD

CFEIPARYTTDVIGSCAFGINMNALSDERSEFRKMGRNMFDQNMIKFTRNLLRDFFPRFYN

LLGFVLPYTESTVFMTKLIKGTIKYREENDVVRPDFVNLLMELKKHPEKLKNI (1)

EITDTLLAAQASVFFAAGFETSSTTMAHALYEMALNPDIQDKLRNEMKEFHAKNNGNLKYE

DIKEMKYLDKVFR (1)

ETLRKYPPGMLLRRKCNSNYTFHGTKVSIPAGTSVIIPLYAIQIDPKFYENPDVFDPERFN

EDAVAARHPMTYLPFGDGPRNCV (1)

GARFAVYQTKVGLIKILQNFRVDVCEKTMIPYVKKINSITLAPRDGIFLKIEKITD*

 

>CYP6AS5

MASSFEILCGIAVLFLALYYYLTSTFDFWKSRGVVGPKPVPFFGTTKDLILVKKSTAHFVK

DIYEKYKNEPMVGLYATRSPFLLLNDPELIKDILIRDFSKFANRGLGVFER (0)

TEPLSPHLLNLEVERWRPLRSRLSPIFTSGKLKEMFYLIIECSLNLEMYLDKLIEKNEPIE

CRELTARFTTDVIGSCAFGIDMSSMTNENSEFRRMGREVFAVNFMNVMRMKLKQFMPRLYD

LLGYVMPDRTFAPFFTRVVTDTIKYRNDNNIVRPDFINMLMELQKNPQKLENI (1)

KLTDSLIAAQAFVFFLAGFETSSTTMSNALYELALNQDVQKKLREEINTFCPKNNKELKYD

DIKEMEYLDKVFK (1)

ETLRMYPPASILMRKAISDYTFNDTKITIPKEMKIWIPAFAIHRDSAIYPNPDSFDPERFD

KDAMASRHPMHYLPFGDGPRNCI (1)

GARFAVYQTKVGLITILRNHKVEVCEKTVIPYEFDPGAFLLSPKDGIYLKITKI*

 

>CYP6AS7

MPSLEIIFGIILLFPAIYYYFTSTFDFWKVRGVPGPKPIPIFGNIKNVMLLKTSMCHYLKK

LCEEYKHEPMIGIFTRKTPILIIQDPDLIKDVLIRDFSKFANRGIPIHEK (0)

AEPLSPHLFNLEVERWRPLRTRLSPVFTSGKLKEMFPLILDCAKHLEQYLDKLVLREEFIE

CRELTAKYTTDVIGSCAFGIEMNALSDEESEFRRIGRKVFSNSFGQILRFRFRQIFPRIYN

LLGFVLPPMEVTKFLTNIIVSTMKYRQENNIVRPDFVNMLIELKKHPDKLENI (1)

KLTDTLLTAQAFVFFIAGFETSSSAISNALYELALNPEVQNKLRQEIKEYFNKHNELKYEY

IKNMIYLDLVFR (1)

ETLRKYPPGPLILRKSITNYTFNNTKVSIPEESFVWIPLYAIHHDPKIYPNPDAFIPERFN

DDAIATRHPMHYLPFGDGPRNCI (1)

GARFAVYQSKIGLITILWNYKVEVCDKTMIPYEINPAAFLLTPKGGIYLKFTKIKNNEEILN*

 

>CYP6AS8

MYISLEIFCGIVVALIALYYYLTVNNNFWKNRGIAGPEPVLGFGNMKKVLLGKESMSQFLT

KIYHEYKNEPIIGIFTTRTPQLIIKDPDLIKTILIKDFSKIMNRGLLPMVS (0)

GEPISQHLFNIEAERWRPLRIHLTPVFTANKLRGMFSLILECSMHFVSYVDSLVKKGEPVN

VREVAARFTTDVVGSCGFGVEMNSLSEKESEFRRVGKSVFATNYARIIKHRIREFMPRLYN

YILYLWPTDEMAEKIIKLTRETLEYREKNNLFRPDFMNILLDLKKHPEKIGL (1)

DVTNEFLAAQAFIFFVAGFETSSSTISNALYELALNPDVQDKLRKEIKEFAAKNDGEWRYE

TIKEMEYLGKVFQ (1)

ETLRKYPSLPFLTRELIEDYTFESNKVTIPKGLKIWIPTYAIHNDPDIYPDPDKFDPERFS

DDKIKQRHPMHFLPFGHGPRNCI (1)

GARFAIYQTKIGLINILRNFKLDVCDKTLIPYKHHPRGLLLMPLTDLYLKITRLTN*

 

>CYP6AS9P PSEUDOGENE? GC boundary at CIG, IFGG = IFQE in v1.2

MFINLETLCGFVIVLIAFYYYLTINNNFWKNRGIPGPKPTIGFGNMWTVMFGKESFSQLLT

TIYNKYKDEPMIGIFFRRRPVLLLKDFDLIKDVLIKDFSKFANRGFLKTNP (0)

KVPLTNHLFALEVKRWRPLRNHLXSPVFTSGKLKGTFAQILNCSNDLVTHIDTLSKMEDSI

NMREVAAKFTTDVIGSCVFGIKMNSLSEIENEFRRIGRNIFATNFTNILKIRLLQSTPFLH

SLLCKILPDEEMEMFMRITKETMEYREKNNLVRPDFMNILLELKKHPERIADI (1)

ELTDDLLAAQAYIFFAAGFETSSTTISNALYELALNHDMQDKLREEIKEFEAKNDGEWRYE

TIKEMQYLGKIFG (9)

GTLRKYPALSFLSRESIIDDYTFRDTKLTISKGTLVWIPVFPIHHDPNIYPDPEKFDPERF

TEDKIKERNLMHYFPFGHGPRNCI (1)

GARFAIYQTKIGLIKILRNYKVDVCSKTLIPYKYDPFSFILVPLGGLYLKITRI*

 

>CYP6AS10

MAAFEILCGFIIFIFAFYYYLIKPQEYWKNRGVPGPKPIPIFGNFFRLTFARISIGDLMTK

FYKEYKHEPVFGLYMRNVRVLAINNPDLIKTVLIKDFSKFAHRGLALNEV (0)

TEPLSQHLFVLEPKRWRPLRTKLSPIFTSGKLKDMFSLIIECSNTLENYVEHLISKNDRVE

VRDLAAKFTTDVIGSCGFGVDMNAMSDVQCKFRDIGREFFGPSFKQILKIRLRENLPRLYT

FLGYILPRDETTTFFTNVVLDMIKYRKTNDIYRPDFINALINIQNHPEKLDI (1)

ELTEPLLVAQAFLFFVAGFETSSLTIATALYELAQNQDIQDKLRDEITEHHKLNNGEWQYE

NIKNMPYLDAVFK (1)

ETLRKYVPLTVLMRQSLEDYTFESINLTIPKDTRIFIPIYAIHRDPDIYPNPEVFDINRFS

KEAEATRHPMHYLPFGDGPRNCI (1)

GARFAIFQTKIGLIKILRTYKVDVCNETQIPFINEPRTFTLAPKHDLTLKITKIEN*

 

>CYP6AS11

MYINLEIFCAIVIAFIAFYYYLTINNNFWKHRGISGPKPVLGFGNMKKIILGEESMSQYLT

KLYHKYKNESMIGIFRLRTPALIIKDPDLIKIVLIKDFSKFMNRGLLPIIS (0)

TEPISHHLFALEAERWHPLRKHLTSGFTSNKLKGMFCMIHECSKHLVNYLDILVRKEEPVN

VREVAARFTTDVVGSCGFGVEMNSLSEQESEFRRLGKSIFNTNVQKIIKDRIRELTPQVYN

FLLYILPLDGISPKILKLMKETIKYRKKYDIFRPDFMNIILELKKHPEKINI (1)

DITNELLAAQIFIFFAAGFETSSTLISNALYELALNPNIQDKLREEIKKFESQNDEEWKYE

TIKKMDYLEKVIQ (1)

ETLRKYPPVPFLNRELIDDYTFESNKVTIPKGLKIWIPTYAIHNDPDIYPDPDKFDPERFS

EDNIKQRHPMHFLPFGHGPRNCI (1)

GIRFAEYQTKIGLINILRNFKLDVCDKTLIPYKLHPRGLILIPLTDLYLKITRLTN*

 

>CYP6AS12

MAYIEILCGIIVSMLVFYYYFTSVFNFWKIRGIPGPKPKFLFGNIRDIILSRISTPTFIKN

IYDTYTNEPMVGLYMGRNPILLLKDPELIKDVLIRDFSKFADRGFNVHEK (0)

VEPLSQHLFNLEPKRWRPLRSKLSPMFTSKKLKEMFGLILECGHHFEKYVDGLAARRQPVD

FCEVAAKYTTDVIGSCAFGINMNAMSSEGSEFREAGRKIFEPTWNSIIRLKFKITMPTLYD

LLGPLVPEREVTPFFIKVVTDAMKYRKERNVFRPDFIDTLMKLRDDPESLSDI (1)

ELTDAFLTAQAYVFFAAGFETGASTISNTLYELAQNQGMQDRLREEIREHCDKYGGELMYE

NIKEMEYLDKVFK (1)

ETLRKYPPGTLIPRRSVSEYTFKNTNVTIPKGTMIWIPAFPIHRDPNIYPNPDDFNPENFT

EDAINNRHPMNYLAFSNGPRNCI (1)

GARFANYQVKIGLIMILRNYKVEVCEKTVIPYQFDPNLFLLGPKGGIYLRVTKVE*

 

>CYP6AS13

MDFFQIFCAICIMLLAIYYYYTSIYNYWKVRGIPGPEPTIIIGNFMEVFLKKISINDKLRF

LYNKYKNEPMFGIFEGSSPILVLNDLDLIKDVLIKDFSIFSNRGFRIFPK (0)

AEPLGEHLFALETERWRPMRAKLSPIFTSGKLKEMFPLIIECSKNMEPYLDKIAERGKYIE

CRDLAAKFTTDVIGSCAFGIDMNSISDKDSEFRIIGRKLFTPTFKTIVRDVCRQFLPGLYD

VIGHKLQIEEVNEFLTNLIKDTINYRKENKIVRPDFVNTLIELKDHPEKLETI (1)

KLTDSMIASQAFVFFVAGFETSSSTISHALYELAQNQEIQDKLREEIREVYEKHGELTYDV

IKNMKYLDKVLK (1)

ETLRKYPIMAMLTREAQENYTFKGTKVTIEKGIKVWILPYGIQNDPDIFPNPDIFDPERFD

EEAVAARHPMSYLPFGDGPRNCI (1)

GARFAQFQSKIGIITIVRNHKIDVCEQTKIPYESDPFQFLLALKGGINLKISKI*

 

>CYP6AS14

MYIGLEILCGIVITLIAFYCYLTINNNYWKNRGIPGPKPVPGFGNMKNVIFGKESVSQFLT

RMYNEYKDEPMIGVFSKRTPVLIVKDVDLIKTILIKEFPKFANRGLFPIFS (0)

RDPLTHHLFNLEVERWKPLRTQFTPLFTSSKLKEMFSLILECSNHLESYMDTLIKKGEPID

MREVSARYTTDVVGSCAFGIDMNSLSEKESVFRRLGKLIFATNLRKILSIRIQDMLPWLYN

SFLYVLPRDEKTRIIMKLMTETMEYREENNVFRPDFINMLLNLKKHPEKIDI (1)

ELTDDLLAAQIFIFFAAGFETSSSTISNALYELALNPDIQEKLRKEIKEFEARNNGEWRYE

IMKEMEYLEKVFQ (1)

ETLRKYPSLPFLNRKLINDYTFESNNVTVSKDLKIWIPVYGIHHDPDIYPDPEKFDPERFS

KKEEIMKRHPMHFLPFGHGPRNCI (1)

GARFAVYQTKIGLIKILRNFEVQVCNKTLIPYKVNPYTSLLIPITGLYLNVVKLEN*

 

>CYP6AS15

MNISLEILCGIIVALIVFYYYLIINNNFWKNRKISGPKPVIGFGNMLSIILGKESTSQFLT

RIYNEYKNEPMIGIFSKNNPALVIKNPDLIKTVLIKDFHKFANRGLFPVNS (0)

REPLSQNLFGLEVERWRPLRIHFSPIFTTNKLKGLCSLILECSEQLEKYMDILIRKGEPLD

IREIAARFTTDVIGSCAFGIEMNSLSENESEFRRLGKGVFNTTFRRIVKTRIRNLMPWLYN

FFLRILPWDEITKKIVKLTTETIEYRNKNNIVRSDFINVLLNLKKHPEKIAEI (1)

ELTNDLLSAQTFVFFGAGFETSSTTISNALYELALNHDIQYKLREEIKEFEKKNDGKWTYE

SIKEMQYLNKIFQ (1)

ETLRKYPVVPFLNRELISDYTFENSKITIPKGLKIWIPVYGIHHDPDIYPNPEKFDPERFS

EDKIKERHSMHYLPFGHGPRNCI (1)

GSRFGTYQTKIGLVKIIRKYKVEICDKTLIPYKFNSFANFLMPSTGLYLMITDVEN*

 

>CYP6AS16P PSEUDOGENE? Frameshift at X

MIANLEIFCGIIVILIAFYYYIRLEQFWKFXGIPGPEPLPGFGNVLMIVLGKEAPFQFLT

RVYNEFKNEALIGVFMKTYPALVVKDPDLIKDIMIKDFYKFPNRGFPKSDS (0?)

XNPLTQHLFLVEEEKWRPLRTQLSPVFSTGKLRGTFTQILDCSNHLVTYMDKLVEIGEPID

VREVTAKFTTDVIGSCVFGIKMNSLSGKESEFRRFGRQIFAMNFLKILRLRIKQFLPMLHY

LLVRILPPDEETKIMLKLTRDTFKFREAHNIVRPDFMNILMELKKHPEKVPSL (1)

DLTDGLLAAQAFIFFAAGFETSSTTVANTLYELALNPDIQEKLRQEIKEFEANNDGEWKYE

TIQEMKYLDKTFK (1)

ETLRKYPVLPYLSRRSIEDYTFEGTKVSIPKNTLICIPVYPIHHDSSIYPNPEKFDPERFS

EDEVKKRHSMHYFPFGHGPRNCI (1)

GLRFAIYQSKIALIKILSNYKIEICDKTLIPYKYDPFSFISLPLTGIFLKITKLQN*

 

>CYP6AS17 name changed from CYP6AS6v1, two genes, not alleles

MFDYFQILIAFVASFLALYYYLTSNFDFWKNRNVVAPKPIPFFGNTKDVVLKKIEISNFIA

ELYKKYENEAMFGIFFGGSPNLILRDLDLIKDVLIKDFSTFDERGFKISER (0)

ADPLNANLFNMDVTRWRPLRIKLSPVFTSGKLKEMFPLILKCAERLEQCLEDAVKRGGPVD

CFEISARYTTDVIGSCAFGINMNALSDERSEFRRIGKRIFDLDKNILRSFLRQFFPRFYNL

LGFVIPYSETSKFVTKFISEMIKYREENNVVKADFVNLLMELKKHPEKLQNI (1)

KITDNLLAAQAFVFFAAGFETSSTTMAHALYEMALNPNIQDKLRKEIKEFYANNNFTYEEV

KKMKYLDKVFK (1)

ETLRKYPPGVFLKRKCNSNYTFKGTKVSIPAGTSVIIPVYSIQTDPKFYENPDVFDPERFN

EDAVAARHPMTYLPFGDGPRKCI (1)

GIRFGVYQTKVGLIKMLKNFKVNVCEKTMIPYIKKINSFTLAPKDGIFLKIEKNN*

 

>CYP6AS18 name changed from CYP6AS6v2, two genes, not alleles

MFDYFQILIAFVASFLALYYYLTSNFDFWKNRNVVAPKPIPFFGNTKDVVLKKIEISNFIA

ELYKKYENEAMFGIFFGGSPNLILRDLDLIKDVLIKDFSTFDERGFKISER (0)

ADPLNANLFNLDVTRWRSLRIKLSPVFTSGKLKEIFPLILKCAERLEQCLEDAVKRGGSVD

CFEISARYTTDVIGSCAFGINMNALSDERSEFRRIGKRIFDLDKNILRSFLRQFFPRFYNL

LGFVIPYSETSKFVTKFISEMIKYREENNVVKADFVNLLMELKKHPEKLQNI (1)

KITDNLLAAQAFVFFAAGFETSSTTMAHALYEMALNPNIQDKLRKEIKEFYANNNFTYEEV

KKMKYLDKVFK (1)

ETLRKYPPGVFLKRKCNSNYTFKGTKVSIPAGTSVIIPVYSIQTDPKFYENPDVFDPERFN

EDAVAARHPMTYLPFGDGPRKCI (1)

GIRFGVYQTKVGLIKMLKNFKVNVCEKTMIPYIKKINSFTLAPKDGIFLKVEKNN*

 

>CYP6BC1 MWNIIRELLEQFLLPGLFLGILYCFLTSTFDFWKNRGVPFRKPTVLFGNFAPMLLFRKSLP

EGIKEMYEWFKDERYFGAFRVRSPVLILRDPDLVKNICVKNFTSFSNRGIPVNSQ (0)

DPLSAHLFNLEGKKWKSLRSKLTPAFSSGKLKRMFYLLAECGEEFEKLIDISSETDRPYEI

RELAAKFTIDVIGTCAFGIQINALTDEESEFHRAAKKLSKPSYKATLWRMLRTAMPRLYKF

LGVQVIDPGVTKFFKDVVSQMIKQRGEYGIKRHDFMDLLIELKNKGTLDEFG (1)

KLDENSIAAQAFVFFAAGYETSSNTIAFCLHELALNTEIQEKTRRDIQDAIDSRNGNLTYD

AVQDMKYLDMVIA (1)

ETLRKYPPASMLSRRCEYQYQIPNSKVELPAGIRVIIPIYGLHHDPDYYPSPAIFNPERFT

EENKRTRHPYAYLPFGEGPRNCI (1)

GMRFALLQIKVGIISFLRNHRVETCQKTITPIKFSRRSLVTTSEKGFWLRIK*

 

>CYP6BD1

MAMQNSLFMPIIFSIAFFVIILILYLKYTRTYWLRRGVPTVPGHWLFGNIKKILDLKKPPA

YVISDIYQKCSENDDILGIYIFFKPFLLIKDPAIIKQILIKDFNYFPDRNFTIQSFYDEIG

NKSLFTLKNPQWKYLR (2)

TKLSPIFSSAKVKKLFHLMVEAANSMNKYLDDEFSNDTKTKTIMIKDVTLKYTTNVISSVA

FGIQVNSFNPKTIQFYEE (1)

AQKGLKTTFSRSMQLCISFFFPKLSPYLNTRMLGSSTNFFRKVFWNSMDNREITKTKREDL

IDSLMELKNAKQDKDF (1)

KFEGDALLSQSAIFFIAGRETSISIICLTLYELAKHPEIQKRTREEINEKLKEHGMTYEGV

QSMKYLHQVVSEILRIYPPTPIIDRVAVADYK (0)

IPGTDIVIEKGTSVFIVLTALHNDPKYHPDPLRFNPDRFSDENKENIKPFTYIPFGEGPRICI(1)

GARIGQLQSIIGLITIIKNYEISFISKCKGDLDVRNIFITPINDFSLNLTKI*

 

>CYP6BE1

MFLTTWLIPDIIAVASLITVGLYFYYKLYLFKFWHKKGIFYVKPSFPTGNIMPIINGKLSL (1)

AEFFRDIYEHNKHHRLVGIYMLYKPYLIVNDPNLIRDILTKEFTNFHDRGIFYNEEVDPLS

GHLFQLPGKKWRNLRVKLTPTFTSGKIKQFFPILNEAGNILAKYLEEEARKGSTIDVKDIF

AR(2)YSTDIIMSVAFGISCDSFKEPNNEFRYWGKKIFDPKPLWNALILFAPQILNFFSIS

YTEKSVTKFFTNMFKQTVKYRESNNIERKDFLNLLIQLMKNGYVDADDESLSNNVNAA (1)

KNKLTMMEAAAQAYVFFLAGFETSSTTVTFCLYELAKNQDIQNKVREEIQTMIKKNGDLTY

NALNDMNYLHKVIS (1)

ETLRKYPPVVILNRICTNDVKLSTTDFCIPKGTCIAIPVFGLHRDSNIFPNPEKFDPERFS

EENIKTRHPYVYLPFGEGPRICI (1)

GLRFGLIQTKIAIINALLKNKFKFGPNTPSTLEFEKGSLILIGKGGIHLNIEPI*

 

>CYP9P1 old 335A1

MEVSHLTTFELLLLTLIFIILAKLVSILYTQFTYWKRNKVPYIRSSPLFGTAWRVFFRLVSFPNYCKYIYNYYPDARYVGVMDFATPTVIVRDPKLIKEIAVKNFNNFPDHRSFVTEEMDPVFGKNVFSLKGDRWREMRNTLSPSFTANKMRFMFDLVSKCSHDFVSCLHDRLESSSSEIEGKNLFTRYSNDVIATVAFGISVNSIEHPDNEFYRRGIDVSTFSGTFRFIKFMLFRLNPRLTRMAGFTFLSRATSKFFWRVISETVTARKRRGIVRPDMIHLLMQATDSKKKSIHQTMTIDDIVAQAFIFFLAGFDTTSTLMCYVVHELALHQDVQRRLREEVDRVLDDGTEISYEDTLGMEYLEMVISETLRMHPPTLLIDRQCAKEFHLPPAGPGYESVTIHPGENIWFPVLAIHRDPAHFPDPDKFDPERFNRENRNGIDPYTYIPFGVGPRKCIGNRFALMETKLLIIRLLRKFVIKPCERTMDPIVYKKGNFTLMPKDGFWVTFEKRNDH*

 

>CYP9P2 old 335A2

MESPTLLFSFELLAIGLTAIVLAKFVSLLHHQYNYWRKRRVPHVGAVPVLGSSWRIFTRRMSLPNFCSLVYKHRPGSRYLGMMDCFTPVVVVRDPNLIKEIAVKNFDHFPDHHSFINEKIDPIFGKNVFSLKGDRWREMRNTLSPSFTASKMRFMFDLVSNCSEEFVRYLYDHPEFSSSIEAKDAFTRYTNDVIATVAFGISVNSMENRDNEFYTKGADATNFGGIFRLFKFMLFRVNPRLTRMAGLSFLSRGTATFFHRVVRETVRARDERRIVRPDMIHLLMQARDKEDRRPVATVDNRMTIDDITAQAFIFFLAGFDTSSTLMCYVAHELALNPPVQERLREEVDRFMDGGNGAITYEALLKMEYMDMVTSETLRKYPPIVFIDRLCVEKFELPPAEQGYDHLIVHPDNIVWFPVYGLHHDPKYFPEPEKFDPERFNDANKRNIVPYTYMPFGLGPRKCIGNRFALMETKILIAYMLRKFRIKRTEKTRASIEFSKTNFSLTPDHGFWIGLEKRDP*

 

>CYP9Q1 old 335B1

MDYLQLGLTLLAILVAVYYLSTRNHKLLKRHGIVHIPPTPLFGNLGPLVRRKCHMEDVIQRVYDLDPDARYVGMYEFTTPLIIIRDPELIKTIGVKEITNFTNHRPFVDVGVDPMLGEVLFAMQGDRWREHRTMLTTLFTSSKIKSMFVLMSDCAKRFADYLSKVEREIELKSVLTRYTNDVIARCVYGVSVDSVNEPENIFYRYGQVASQLSTFKQNLMIFVHRNSPRLARLFNLKILPVHIEKFFHRLVMDTIETRRREGVHGLDMLQQLMDMQSRRKESEEGKRGMTVTDIANHAFSFFFGSVDTMATQISLISHMLAVNPDVQQRLQEEIDEVLSASEDKQVGYDVIQEMKYLDAVMSEAMRYHPILLFVDRVCGETFELPPALPGARPFKLERGMNIWFPVKAIHHDPKYFENPDRFDPDRFLRDGKGIASSGAYMPFGMGPRKCIGSRFALTEMKILLFNILAKCSFKVGSKTMVPLKFKEGVFNPVAKNGFWLKIERRENSCC*

 

>CYP9Q2 old name 335B2

MEFLSLALVLAAISIIAYYYCFVRKNFNLFQEHGILHVPPSPLVGNFGPLIRGKENVHDTIQRIYNIHPDAKYVGIFEFLTPVIMIRDLDLIKSITMKNFDQFPDHRPMFCKSVDPMLGEMLFIMDGERWKEHRNMLSPTFTSSKIKTMFVHMSECAKRFAHHLSKLPEKDRETEMKALLTRYTNDVIAACIYGVNVDSIKEPRNVFYMYGRVGATLIGLKKNLKIMVHRNMPWLANLLRLNILERHIAKFFTDLVVETVEERERNGTTNSDLIQLMMDTRNKKESGKKNLTVQNMANHAFSFFFGGFDTVSSQTCVLLHMLVENPEVQQRLQQEIDETLESNNGQLSYDVIQEMRYLDAVINEILRLHPIAVFIDRMCVKSFELPPALPGDVPFTVKPGMNVWIPVKAIHHDPRYYDEPEKFKPERFLDNGKNIIGSGAYFPFGIGPRICIGNRFALIEMKVLVCHILAVCDIKAGARTGIPLEFEKGVFNATAKTGFWLKIEPRKYSYHSGQINGLVNNHVINGACKTGI*

 

>CYP9Q3 old 335B3

MDYLTISLSLITVFVAVYYLATRNNDFFKKHGIPHVPPVPFLGNMGSLVRQKSNLHDVIDRTYNLDPGAKYVGIYEFTTPIIILRDLDLIKTITMKYLDHFPDHRSFAYEGADPVFGSMLFAMKGERWKEHRNMLTPTLTSSKIKGMFKLMTECAVRFADFLSVLPENERETEMKALLSRYANDVIASCVYGVSVDSINDPKNIFYVYGRRGTNVVGLKKSMFVLIHRNMPWLAKLFGLRFLEKHVQKFFYDLVYETIESREKLGTNRSDVLQLLMDIRDKANSSGKMTTMTVENVAIHAFTFFFGGFDSITSVTTLLTQMLAEHPDVQARLQQEIDETLRSNDGVLTYDAVHGMKYMDAVINETMRFCPVLPFLDRMCVESFQLPAPVPGGQPFTLRPGMNVWIPLAAIGRDPEYFEDPDKFDPDRFLNPEAGIKNSGAHFPFGLGQRKCIGERFAMMEMKVLLCYVLAACNVRIGSKTTVPMKLEKGLINANVKGGFWLKIEPRKVTYYNSSRSN*

 

>CYP9R1 old 335C1

MLDSWSITAAIVAVLAIAYYQLIWKYKHFERIGIPCYHSIPLLGSFWEAVIQRNNFAEISRKIYNSYPDTKYMGMYDTTTPVLLIRDTELIKAISVKHFEQFPDHRSFQNEATDPLFAKNLFALRGDRWREIRNLLSPAFTSSKMKSMFILMRDCAKEYGDYFASLTGDESTIELKDAFTRYTNDVIATCAFGVEVNSMKDRKNKFYVYGREGTTFGSWASIKFFVTRVLPVSVCTLLRIRLIRKEISDFFIDLVSTTIKTREEKGIVRPDMIQLMMESKGKLGAGKEMSMIDICAQAFVFFFGGFESTSTLMCFAAYEIAVNEDIQRRLQNEIDQVLEERDGEVTYAAVNEMKFLDAIIYEALRMYPVVVATDRVCMKPFELPPNRPGEKPYLLKEGDNVWFPIYAIQRDPQYYPEPDKFDPDRFLNDTKQMINSGLFLTFGIGPRMCIGNRFAMLETKVLLFHLFARCNLVPCSKTTIPMKLNRKGFSMTAENGFWFKIEPRSAKKEEKIAVPGTTMLIDKIPDRYPDN*

 

>CYP9S1 old 335D1

MAIFALLLIVLGILGSYHLLKSQNPFKEHGLPYKSYLPILGSTWESILRRKSFAVVIQEIYNLAPSARYVGFYNRTTPIVMIRDPELIKTIAVKNFDAFRNHRTVNDTQTDDVLLSGNLLLLRDNRWREVRSLNTPAFSTSKIRSMYRSMSEIAINVARYLSTLAPGQNIVEMKDIFTRYANDVFATCAFGISVDSLSDRENKFYELGREALDIHSTPILKLILIFAFPKLARRLGVSLVSKEATNFFTRVVSENIKMREEKGITRPDFIQAMIDKRNGRGRDDELTVEDITAQAFVFFFGGFETTSGLLSFAVHELAANPEIQGKVHAEIDRVLVSNNEITFERVNGLMYLDAVINETLRMYPIIPITDRECSKRFELPPVLPDAKPYVLKEGSHVWFPIYAIQRDPRYFEKPDCFDPDRFLDDNKKRSDAFNGDAYMPFGAGPRNCIGNRFSMVETKVALFHILAKCRLDVCPKTTIPMELRKRGVFLTAKNGFWLRIVPRHPVT*

 

>CYP336A1

MASAFLTLVTGALLLLCFYLYLKYTYWKRNGIPYSKGYYPIIGHFLPLIMKKQSYSEIIEEIYRDSNHSMVGMYKGMKPVLILRDINLIKTVLQSNFSKFHENAVKIDPKLDPLLAKNPFFCYGELWQTGRKRLTYAFSNARLKILFAAVYEVCTKFRNFLDRRLESSKKYEVELKSLFLKFTSEVVANAGLGIEGFCFEDDKVQSMFTNLDNNDFLDTFLIGIIVHFPFLTKLLRIQFLPTKHDKFFRTVVKKNLELRKSDPIPRNDFIQLMIEMEQTGEKIDEEIVAAHAVSFYLDGVETSSVTLNFIGCQLAIHQDVQEKLRKEVRSTLEKHGGVLTFEALKDMTYMNQVISESQRYFSALGFLGKICTDEFELQGSDGLNYRAKPGTELLIPICGLHKDPKYWDNPEIFDPERFSDENKQRIEKMAFIPFGEGPRICVGMRMAMLQMKSCLATLMKDYKLEVSPKMQLPLKLSPTYFLSAPLGGGWVLISKA*

 

CYPmito clan CYP12, CYP49, CYP301, CYP302, CYP314, CYP315, CYP334

 

>CYP301B1 name changed from CYP49A1 because it has higher seq

identity to CYP301, but Drosophila, Aedes, Anopheles, and Bombyx all have CYP49 related sequences, so it seems logical that Apis will also. This is the best candidate for a CYP49A1 ortholog.

50% to 301A1 Anoph. 46% to 49A1 Anoph. 50% to 301A1 Anoph. (but better expect value for 49A1 –150 vs. –142)

49% to 301A1 Dros. pseudo. 45% to 49A1 Dros. pseudo.

MQILNCKFTKLLKNTVKQSNNIIKTLETLTTEVEEQDWSRCRPYSEIPGPKPIPFLGNTWRFIPFI (1)

GDFKIQAVDQVSKKLYKEFGDIVKVEGLLGRPDMVFIYDANEIERIFRQEERMPYRPSMPS

LNYYKHVLRKEFFKENAGVIAV (2)

HGESWYNFRSKVQQVMLQPRTARMYITSMEEASLAFLER (2)

IKKIRNKNDEVPDDFLNEIHKWSLE (1)

SIARVALDVRLGCLDDDANIETQQLIDAVTTFFKNVGILELKIPFWKLFNTPTWLKYVNALDTILS (2)

ITSRYTTVALSRTKEAEKSDKEPSLLERVLALENDTKLATILSLDLFLVGIDT (0)

TSSTVASTLYQLALHPDEQDRAYNEVCNILPSKDMQLDGKHLDKLKYLKACIKETLR (2)

MYPVVIGNGRCMTKDTIIKGYRVPKG (0)

VQVVFQHYVISNLDKYFPHSDKFLPERWLQSDGVRHSFASLPFGYGRRMCLGRRFAELEMLVVISK (0)

ILQRYKIEYHHEKLEYYINPMYTPKGSLNLKFIDR*

 

>CYP301A1 68% to 301A1 Drosophila 45% to 49A1 Drosophila

67% to 301A1 Anopheles, 46% to 49A1 Anopheles

MMTGKTRYLFFKIHFLLLYVLI (2)

CMVDHDTTTIQQGKPYKDIPGPRPIPILGNTWRLFPMIGQYEISDIAKLSQIFYDEYGKIV

RLTGLIGRPDLLFVYDVDEIEKIYRQEGPTPFRPSMPCLVHYKSVVRKDFFGSLPGVVGV (2)

HGEPWREFRTRVQKPILQPQTVRKYITPIEMVTSDFIQR (2)

IQEIKGEDGEVPGDFDNEIHKWALE (1)

CIGRVALDVRLGCLSSNLTSDSEPQKIIDAAKFALRNVAILELKAPYWRYVPTLLWSRYVRNMNYFIE (2)

VCMKYIDATMERLKTKKAVDEYDLSLMERILAKETDPKIAYILALDLILVGIDT (0)

ISMAVCSILYQLATRPEEQEKIYQELVEILPDPSVPLNMSHLDKAIYMKAFIREVFR (2)

VYSTVIGNGRTLQNDTIICGYKVPKG (0)

VQVVFPTVVTGNMEKYVTDAKIFKPMRWLKESTKTLHPFASLPYGHGARMCLGRRFADLEIQVLLAK (0)

LIRSYKLEYHHKPLKYKVTFMYAPDGELKFKVLPR*

 

>CYP302A1 hybrid seq from parts not joined in genome assembly

MCTLLKKCNQSIRKKLFIKFYSNEFTKSKIKINHSQPKAFYDIPGPKSLPIIGTLYKYLPFIG (1)

EYSFTNLYESGKKKLKCFGPLVREEIIPNVNVIWIYRPEDIAEIFKAESGLHPERRSHLALLKYRKDRPNIYNTGGLLPT (2)

NGSEWWRLRKEFQKVSSKPQDVINYLKETDCVIQEFVELCNNEKFADFLPLLSRLFLEL (1)

TCLVVFDIRLNSFSKEERCENSISSKLIKAAFATNSAILKLDNGLQLWRLFETPLYRKLRKAQTYMEM (2)

IALELVSRKKNNMKIRYNKSFLDAYLENPVLDIKDIVGMACDMLLAGIDT (0)

TSYSTAYILYHLAKNQNIQEKLRIEATQLLKNHNEPISINILRNASYTKAVIKESLRLNPI

SIGIGRILQTDVVLSGYRVPKG (0)

SVVVTQNQIICRLPEYFEEPNLFIPERWLREYSENNNKINYKKTVHPYVLLPFGHGPRSCI

ARRFAEQNMQILLLR (0)

ICRRLKISWHGDDLGMISLLINKPNALLKFNFHDILNNNSV*

 

>CYP314A1

MLLSSAWFEVIAAVLLTILIFVTSHRPAWWFWTATSHEAS (1)

APAEGKFKTVSKVPGPFSLPIFGTRWIFSCIGYYKLNKIHDAYK (1)

DLNQRYGALCKEEALWNFPMISVFSRQDIETIIRRNSRYPLRPPQEVISHYRRTRRDRYTNLGLVNE (2)

QGQTWHDLRVALTSELTAASTVLGFFPALNIVADSFIELIRRQRVGYKVTGFEELAYKMGLE (1)

STCTLILGRHLGFLKPDSSSELATRLAEAVRIHFTASRDAFYGLPLWKLLPTCAYKQLIESEDAIYN (2)

IISEIIETTIQEKRDDAKDESVEAIFQSILRQKNLDIRDKKAAIVDFIAAGIHT (0)

LGNTLVFLFDLIGRNPAVQNKLYEETYALAPAGCDLTIDNLRKAKYLRACITESLR (2)

LIPTTTCIARILDEPIELSGYRLTAG (0)

TVVLLHTWIAGLNEENFKDAKKYLPERWTTPTTPHSPLLVAPFGAGRRICPGKRFVDLALQLILAK (0)

IIREFEIIVEEELDLQFEFILAPKGPVSLGFRDRS*

 

>CYP315A1

MNLAQNILKSGKSVSLSSNVIALKYNVPGCGYAGASQTSRIDDLSDISKSTDGGNRSKIEI

TEKLRDRNYGTVAVATSESILQEMPEPRGIPVFGTLFSFILSGGPKKQHEYVDKRHKELGP

VYKERIGPTTAVFVNSIHEFRKIFRLEGSTPKHFLPEAWTLYNEIRKCRRGLLFM (2)

NGEEWVYFRKILNKVMLLPDPTNLMIAPCQEVAIELKRKWQKQIKTNNIISNLQVQLYQWSIE (1)

AMMATLMGSYWYSYKHQLSRDFEILAETLHEIFEYSAKLSIIPVKLAMNLRLPVWKKFVAS

ADTAFEIVRMLVPEMAKLGGNGLLKKMMDEGIRAEDAICIVTDFILAAGDT (0)

TATTLQWILLLLCNHPEKQEELFKHLKDLSQEDILRLPLLKGIIKESLRLYPIAPFISRYL

PEDSVIGNYFVPKG (0)

ELLVLSLYSSGRDAANFPQPNEFRPERWIRTQKGIYQGVVHPHASLPFALGARSCIGRKLA

EIQISFALAE (0)

LIKSFKIECINKNQVKLILHLISVPSQSIKLKLMERN*

 

>CYP334A1

MTESQTASVDESLRTDTIPLLDHTATTEVSPTTFEVSTMKVDTQIFDKAPLPFDEIPGPAI

LKIWEKYWKYVPLL (1)

GRLTWNRNITPLKYLFNEYGCIVRINGPLSGDIVMIHR (2)

PEHIAEVFKQEGDTPVRSGIDILQHYRLNYRKYRLAGPFSM (2)

QGTEWLEIRDKVEDTFNQISSTFFTKIDTCCNELITRICKIRNRQNE (0)

VPVSFYEDLIRWAMECFCDLTFNKRLGFLEPIGYNSSSEASKLINALTTAHKYMSRCETGF

QVWRFFLTPFARKLFEACDVLDN (2)

VIGKYVRQAQCKLRIRKSHSEESSMTERSPVLEKLLLNEGIHPDDICTMLMDMIILGIQA (0)

TVNSEAFLLYHLAKNPRTQRKVYDEIISVLSNDNSSFTEKSLKNMPYLKACIQETLR (2)

LHPAIPYITRLLPKTISLHGYTIPKG (0)

TFVIMANQITSQREENFEDPFKFWPERWLSNSSKEDVHFSYLPFGHGIRSCLGKNMAEAKMMLLTAK (0)

LVRQFRIEYDYADIKSRFMMVNVPNKPLRFRFVNRN*

 

Fragments of P450s

 

>Am2_6006 INCOMPLETE 5 aa diffs to 6AS12

YGRIVHGTESDSLLKDPELIKDVLIRDFSKFADRGFNVHEK (0)

VEPLSQHLFNLEPKRWRPLRSKLSPMFTSKKLKEMFGLILECGRHFEKYVDGLAARRQPVD

FCEVAAKYTTDVIGSCAFGINMNAMSSEGSEFREAGRKIFEPTWNSIIRLKFKITMPTLYD

LLGPLVPEREVTPFFIKVVTDAMKYKKESNVFRPDFIDTLMKLRDDPESLSDI (1)

ELTDAFLTAQAYVFFAAGFETGASTISNTLYELAQNQGMQDRLREEIREHCDKYGGELMYE

NIKEMEYLDKVFK (1)

XTLRKYPPGTLIPRRSVSEYTFKNTNVTIPKGTMIWIPAFPIHRDPDIYPNPDDFNPENFT

EDAINNRHPMNYLAFSNGPRNCIG (1)

 

>CYP6AS2-1like FRAGMENT GB19197-PA 209aa 3 exons gnl|Amel_2.0|GroupUn.5353 minus (2931-2710,2531-2286,2114-1950) 51% to D. melanogaster CYP6a17(AC020447) pseudogene fragment, similar to C-term of CYP6AS2 100%

(1)LTDILLTAQAVVFFVAGFETSSSTMAFSLYELAQNQEIQDKLREEIRKVHEKNKGVLT

YTDIKEMKYLDKVFK (1)

ETLRKYPILSTLSRKVMENYTFKGTKITIPKGTKIWVHGIQHDPNIYPEPEVFDPERFEDD

AFASRHPMSYLPFGDGPRNCI (1)

GLVLPHYQSKVGLITILRNHKXVNVCEKTTIPFKADERSFLLVPKGGIHLKIIKI*

 

>Am2_101397060 FRAGMENT GB19113-PA first 29 aa same as 6AS14

XTLRKYPSLPFLNRKLINDYTFESNNVTVSQRFKRFWIPVYRNLTPISDIYPDPEKSHL*K

ISLGGNYGEGAPKAFLPFGQGPQNMSCVWPIK (0)

 

>Am2_98681617 FRAGMENT GB12136-PA nearly identical to 6AS12

VDVAFYYYFTSALNFWKIRGIPGPHXPKFLFGNIRDIILSRISTPTFIKNVYDTYTNEPMV

GLYMARNPILLLKDPELIKDVLIRDFSKFADRGFNVHEK (0)

 

>Am2_98611857 FRAGMENT GB14612-PA

PRDGSSQEKFDPERFSEENIKLGHPEVYLPFGEGPRIL (9)

IGLRFGLIQTKIAIINALLKNKFKFRPNTPSTSEFEKGSLILIGTGGIHLNIEPL*

 

>CYP6AS2-2like FRAGMENT GB11798-PA 1st exon of CYP6AS2

MDYFQILCAISIVILTIYYYYSLKYAFWKDRGISGPKPIIFFGNLGNSIIKKKSLSETVKK

WYDDYKHESVFGIFEGTIPVLVINDLDMIKDILIRDFSLFVDRGFHIFPK(0)

 

>Am2_79040249 FRAGMENT GB11754-PA

(0)KVPLTNHLFALEVKRWRPLRNHLSPVFTSGKLKGTFAQILNXAPMILXDSYRHFIEDG

RLYKHAREVAAKFTTDVIGSCVFGIKMNSLAEREXEFRRIGRNIFATNFTNILKIRLLTST

PLLHSLLCRILPDEEMRXMFFRITKETMEYREKNNPPQAGFYEYTSRLETEV

 

>Am2_48488186 FRAGMENT GB12885-PA

DTLSKMEDSINMREVAAKFTTDVIGXSCVFGIKMNSLSEIENEFRRIGRNIFATNFTNILK

IRLLQSTPFLHSLLCKILPDEEMEMFMRITKETMEYREKNNLVRPDFMNILLELKKHPERIADIG (1)

 

>CYP6AS11-like FRAGMENT GB11027-PA

MYIGFEIIYGIVIVFIAFYYYLTINNNFWKHRGISGPKPVLGFGNMKKIILGEESMSQYLT

KLYHKYKNESMIGIFRLRTPALIIKDPDLIKIVLIKDFSKFMNRGLLPIIS(0)

 

>CYP4AZ1-like FRAGMENT GB10905-PA 55% to CYP4AB1(AAP97090.1)

(0)DRPPAEIKAAIEENGGKLNITVLQFFSDLERCIKESLRLFPTVAGGSKLETSRVKF (1)

 

>Am2_Un.5369 FRAGMENT GB18672-PA 96% to CYP6AS9

EIENEFRRIGRNIFATPFYKYIEDRLLQSTPLLHSLLCKILPDEEMEMFMRITKETMEYRE

KNNLVRPDFMNILLELKKHPERIADI (1)

EITDDLLAAQAYIFFAAGFETSSTTISNALYELALNHDMQDKLREEIKEFEAKNDGEWRYE

TIKEMQYLGKIFQE  (1)

ETLRKYPALSFLSRESIIDDYTFRDTKLTISKGTLVWIPVFPIHHDPNIYPEPEKFDPERF

TEDKIKERNLMHYFPFGHGPRNCI (1)

GARFAIYQTKIGLIKILRNYKVDVCSKTLIPYKYDPFSFILVPLGGLYLKITRIQD*

 

>Am2_57889394 FRAGMENT GB17793-PA

ETVEDTERNGTTNSDLIQLMMDTRNKKESGKKNLTVQNMANHAFSFFFGGFDTVSSQTCVL

LHMLVENPEVQQRLQQEIDETLESMMNYSKYQFS (1)

 

>Am2_14.6 PSEUDOGENE 38% to CYP9L1(AAAB01008986.1)

DSFRYSAHFANSSS*KNPVGPVLAQSSSKREILLDFTLNPSINQGDSFDRFIDRSSTRNIFSF*GAKGRKR*GILSPSFTMNEIDVRNCSQVCGSFALIYPRWTQRKILSSATT*SFIYQILLESV*IRWKTTSQFPLMRFRVHD**DY*E*FLKSTIFDFVPRVVSNGIEWRTRHR*TGYTAFIARNKEKSSIDKMTIDDIVSRAIRFSTWMALTLALSNFMRFVLYDPAFGWRGKEIDG*NILRILRRVYTGMMMSETLGKYSGLIFVDRVCTRKFALSGAGYDGATLYPGDIVFSPYVGSRS*IFFRSRNEKNEGNIVPYWNLLLEIKIFMIPSRRFSIRLNEKTPEAYRLSEK*SCIHS*RRNLD*IGRKERL*ILWLVKNFVHYFAKRFM