ࡱ>  ٩bjbjв 2к_к_8&< 0222222$ VVTk00ʘP0Y!Y!Y! VVY!X T: Honeybee Cytochrome P450s from version 1.2 of the Apis mellifera genome Sept. 17, 2004 These sequences were submitted Feb. 11, 2004 by May Berenbaum in collaboration with Gene Robinson (genome sequencing), Hugh Roberstson (genome annotation) and Reed Johnson (P450 annotation). On Sept. 2, 2004 Reed Johnson submitted the revised sequences from Honeybee Version 1.2. These sequence have been further edited and revised by D. Nelson on Sept 13-17. Many partial sequences have been combined and the number of complete genes increased from 20 to 33. The number of P450 genes including fragments is now 55, including 22 partial sequences. There are 4 CYP clans in insects. CYP2, CYP3, CYP4 and mitochondrial CYP2 is the clan with CYP18 in it. Sometimes it is called the CYP18 clan. CYP2 also has CYP303, 304, 305, 306, 307 CYP3 has CYP6 and 9 in it and CYP28, 308, 309, 310, 317(a CYP6 subfam) CYP4 has CYP4, 311, 312, 313, 316, 318 the mito clan has CYP12, 49, 301, 302, 314, 315 The honeybee sequences have been sorted in to these main CYP clan bins. 38 genes have been named. 3 named genes are pseudogenes. CYP2 clan (9 sequences, 6 complete, 3 incomplete) >CYP18A1 honeybee Am1.2_13.1b (32196-34509) plus, 524 aa, 5 exons version 1.1 = Am1.1_13.1a (71904-74217) plus, 524 aa, 5 exons (71904-72236, 72653-72774, 72894-74323, 73404-73668, 73794-74217) 60% to CYP18A1 D. melanogaster MGGTRIEVLCTFLVFLGVLLVARCLQWLRYVRSLPPGPWGVPVFGYLPFLKGDVHLRYG ELAKKYGPMFSARLGTQLVVVLSDHRTIRDTFRREEFTGRPHTEFINILGGYG(1) IINTEGAMWKDQRKFLHDKLRGFGMTYMGGGKKIMESRIM(0) REVKTFLRGLASKRGTPTDVSASLGMSISNVICSIIMGVRFQHGDARFKRFMDLIEEGFKLFGSMAAVNFIPVMR YLPCLQKVRNKLAENRAEMAGFFQETVDQHRATFDEGTMRDLVDAYLLEIEKAKGEGRATTLFQGKNHD(1) RQMQQILGDLFSAGMETVKTTLEWAIILMLHHPDAAIAVQEELDQVVGKSRMP VLEDLPFLPITEATILEVLRRSSVVPLGTTHATTR(2) DVTLHGYTIPAGSQVVPLLHAVHMDPELWEKPEEFRPSRFLSAEGKVQKPEYFMPFGVGRRMCLG DVLARMELFLFFSSLMHTFELRSPQGSSLPSLRGNAGVTVTPDPFDVCLLPRNLDLIEDNDMISTGAILRNIGSH* >CYP306A1 Am1.2_13.1a (39182-37170) minus, 499 aa, 7 exons version 1.1 = CYP306A1 Am1.1_13.1b (76876-78884) minus, 499aa, 7 exons (78884-78557, 78493-78345, 78209-78007, 77901-77595, 77538-77383, 77298-77123, 77057-76876) 44% to 306A1 Anopheles gambiae = CYP306A1 in 18clan/2clan CYP306A1 is the probable ortholog of CYP306A1 in diptera (flies and mosquitos) the % identity is below the usual cutoff for subfamily membership, but it makes sense to name orthologs with the same name. >CYP306A1 Am1.2_13.1a (39182-37170) minus, 495 aa, 7 exons MILDHYIAIFVLPFLLLLYVVRKNRKARRLPPGPWQLPLLGYLPWIDAEKPHETLTR LSRVYGPVCGFRMGSVYTVLLSDPQLIRQSFAKDSITNRAPLYLTHGIMKGYG (1) IICAEGEQWKDQRKFISNCLRNFGMVKHEGAKRDKMEERISDAVNECVS (0) VLRDRGANGPIDPLDTLHHCLGNLVNSIVFGKTYEEEDRIWKWLRHLQEEGVKQIGVAGPLNFLPFLR (2) FLPQYGRVIRSIVDGKDKTHEIYRQILDEHRARVDSGNGCKIDSFLAAFDEQMRKK DGAESGYFTEPQLYHLLADLFGAGTDTTLTTLRWFLLFMAAHPMEQ (0) EKIQSEMDLCLREGEQPTLNDRIVMPRLEAAIAEVQRIRSVTPLGIPHGTSE (0) DVEIGGYDIPCGAMIVPMQWAIHTDPAYWRDPLEFRPDRFLSEDGTFFKPESFLPFQNG (1) KRVCVGEELARMILFLFAGRILRAFSVRVPAGEIADLEGECGITLVPKPHRLAFVGRDR* >CYP307B1 Am1.2_14.10 (757965-761727) plus, 507 aa, 3 exons version 1.1 = Am1.1_14.9 (18918-22680) minus, 507 aa, 3 exons (22680-22278, 20154-19363, 19246-18921) CYP307B1 55% to 307B1 Anopheles, probably the ortholog of 307B1 MIPLTATTCFLIAITFLALALILLDHLRSKKTTKSVVPGDDDQHALPEPPGPKPWPILGSL HILGRYDVPYKAFADLVRDFDCQVIKLRMGSVPCVVVNGLENIKEVLTVKGHHFDSRPNFARYHLLFGGNKENS (1) LAFCNWSDVQKARREMLRAHTFPRAFSTRFNELNGIIGDEMEFMVNHLDSLSGTSVHAKPLILHCCANIFITYL CSKNFHLEHDGFRNMVENFDKVFFEVNQGYAADFLPFLMPLHHRNMARMAHWSHEIRRFVIKNIIADRVNSWND VVPEKDYVDCLINHVKSGTEPQMSWNTALFVMEDIIGGHTAIGNLLVKVLGFLATRPEIQRLAQDEIDALGLAG NFVGLENRRSLPYVEAIILETIRIIASPIVPHVANQDSSIAG (1) FRIKKDTFIFLNNYDLNMSTDLWTSPEEFMPDRFVQNGRLLKPEHFLPFGGGRRSCMGYKLVQYVSFAILASIL KNFTITPVQKEDYTIPIGNLALPEMTYKFRFERR* >CYP303A1 Am1.2_Un.1361 incomplete on N-terminus (5389-7223) minus, 378 aa, 8 exons AADG03013520.1 (WGS) version 1.1 = CYP303A1 Am1.1_Un.2253 incomplete on N-terminus (55-1889) plus, 378 aa, 8 exons 43% to 303A1 D.mel., 44% to 303A1 Anoph. CYP303A1 in 2 clan/18 clan may be the ortholog of 303A1, first exon below is 75% to 304A1 (1)LLLVDGNLWNEQRRFVLKHLRDFGFGRQN (1) LYMNANEYTGNNVTQSQLGTIISMHNIFGITVLNSLWKMLAGKR (2) YNIDDKELIYFQRILSITLNEIDMLGAPFSHFPLLRFIAPEISGYKSFVKIHEELWKFFK (0) DEVNNHKNTFNSDSPGNLIDIYLTILNSENYGKTFSD (1) VSEPQLVAICVDLFMAGSETTSKVLGFCFLYLVLFPHVQKKAHEEIDRVIGRNKLPTAEDKAK (2) MTYMNAIVLESLRMFAGRSLNLPHRVQRDTKISDYKIPK (0) NTIIITNFNGILMDESWGDPENFRPERFIDGSGNIVTPSRFLPFSAG (1) KHRCMGENLAKTNIFIIATTLLQAFTFSEIPGEKPTIEHFIDGTTISPKPYRVNVSLRI* >CYP305D1? Am1.2GroupUn.127b (70044-73538) minus, 490 aa, 8 exons (first exon is a guess) version 1.1 Am1.1_Un.7452 incomplete on both ends (700-954) minus, 85 aa, exon 2 version 1.1 Am1.1_Un.6110 incomplete on both ends (1-622) minus, 126 aa, exons 6,7 Two fragments from v.1.1 have been combined in this sequence. The first exon is a best guess. BI513047.1 (EST) AADG03008281.1 (WGS) 39% to 305A2, 39% to 305B1 Bombyx 36% to 305C1 partial seq. MNKNFVKIFNLLLYLFIDVFNNLFFIECD (1) GPFSWPFIGNQILLKRLSRKFGGQHKAFMELSKRYNSDIITVNISYEKIIV VSGSKFCDMILQNEEFQGRPWNEFIKVRNMGKKQG (1) ITMNDGTEWKELRNWMMRTMKIFGFGKSEMIEMIQHQLVIFSENLNKNKLHQLKLLFVPAVINVLWNFITGELVAFNQQQK(2) LEHFLDLLDRRSRCFDITGGLLAAFPWIRYIAPEISGYNIMCMLNKELKDFLM (0) KTINDHKEKYIEGKEADLIDMFIQEMRKNEKSSIFT (1) DEQLMMILIDLFLAGFTTTSTTLDFLFLIVTLFPDVQRKVQKEIDSVIPYDRLPNMEDKAK (2) LPYVEAVISETYRLWPVFPIIGPRRVLCDTNIDKYVIPKDTTILFNTYSINKDPTLYPDPDKFMPE (frameshift) RFIX (frameshift) KWSFLNQMIFFTIGK (1) GKRRCPGDILAKATIFILFVGIMQKYTLLPVPGKGPHSIKINSGITLTPQPYNVLVEKR* >CYP15A1 Am1.2_Un.127a (66491-68918) plus, 504 aa, 8 exons XM_392687.1 partial gnl|Amellifera1|165771382 BCM Apis mellifera 12/11/2003 Am1.1_Un.897 incomplete, missing exon 5 (51606-53773) plus, 357 aa, 5 exons Am1.1_Un.10970 incomplete on N-terminus (112-677) plus, 121 aa, 2 exons Two fragments from v.1.1 have been combined. 18clan/2clan 47% to CYP15A1 Diploptera punctata (probable ortholog) 39% to CYP15B1 Anopheles gambiae 36% to 303A1, 35% to 305A3, 31% to 304B1 MLYVVISLLLALYCIFCIYDCVKPHNFPP (1) GPKWLPLIGCFLTFRRLKLKHKYTYVAFQELSKTYGPILGLKLGSQKLVVI STHDLVKKVLLQDEFNGRPDGFFFRVRAFGKRK (1) GILFTEGSMWSQCRRFTMRHLRSFGLGQSTMEKYLTVEAENLVNYLRRVST KGPVPMHTAFDIAVLNSLWCMFAGHRFDYENEKLAEILEIVHDSFR (2) LMDTMGGIISQMPFLRFIIPELSGYNNLMEILRKLWNFLDEEINNHEKHLSGNQPQDLIEAFLLEISSRNGVQNDSIFDR (1) ENLLILCLDLFLAGSKTTTDTLSTSILFLSLHSEWIKILQEELDNVVGRSRSPTLEDYSSLPIMESFLAE (0) IQRFLIWAPLGVPHKTTKDVILNGYNIPK (0 GC boundary?) DTTVLLDFHSAHNDPAYWDHPEEFRPQRFLDANGRFCQNNANIPFGL (1) GKRRCPGEMLARTSLFLYFAYVIHYFDIEISPEHGKPDLNGHDGFTISPKSYYLKITARSDVTNCSTI* >CYP342A1 Am1.2_8.3 incomplete on N-terminus (1269-3097) plus, 505 aa, 7 exons version 1.1 Am1.1_Un.8493 incomplete on both ends (509-1028) minus, 141 aa, 2 exons 36% to 304A1 36% to 304B1 18clan/2clan MISFLFIIFLLLIIYKIYNSVIHVSSNTPP (1) CLPRLPIIGSYWHLLWHDYEYPYNGIIHYVNKLQSKIVTCYFG SHKTIIANDYKSIKEVLTKQEFNGRPINVDIVLQRAFGKSL (1) GIFFTEGTLWHEQRRFALRHMRDFGFGRRHEIFETNVMEEIAILVDMLKEGPINDEEK (0) KFLKNGYACFPDILYPYVANVILNIMFGERFDRSQYHKLIYFCESS MMFQKSLDTSGGAIFQFWFLKYFGNIFGYTNAIKATYQMINFIE (0) EYIDNKKDLDDYDKGLIGRYLKILKEKNNITSTFSQKQLIMTLVDFMFPATSALPSA LVHAIKLVMHHPRVVNNIQEEIDRVVGTGRLVTWSDRKN (2) LPYIEATIRESLRYETLTPLSVFHKTLKKTTLCDYDIPKDTLVVTNLVALNTDPDLWGDP ENFRPERFLDENNELRKDFTFPFGF (1) GHRVCPGETYSRYNMFEVFAVLMQNFNFSFVEGEPTGLDDKESGLIVTPKKTWIQVKARNMK* >Am1.2_Un.3241 (94-459) minus, 96 aa, 2 exons at N-terminus up to C-helix version 1.1 = Am1.1_Un.7901 incomplete on C-terminus (1-459) minus, 126 aa, 2 exons 40% to 15B1 MWFVILCFVIVLIKILFDYSRPINFPPG(1) PRGLPFIGNILDIIRLINETKYYSDTWCRLAEKYGSVVGLRLGLDQPLIIVSGKSAVTEMLNRSEFDG RPSGFLYKYRCGGMQQGILFTDTDVWHSQR >Am1.2_Un.6423 incomplete on N-terminus (67-247) minus, 254 aa, 4 exons AADG03019874.1 (WGS) Am1.2_Un.1305 incomplete on N-terminus (2043-2647) plus, 146 aa, 2 exons Am1.1_Un.960 incomplete on both ends (8311-11562) minus, 147 aa, 2 exons One fragment in v.1.1 was divided into two in v.1.2, but they are probably from one gene. 43% to 305a4 (0) EVIAKHKWKKINEEGTNFIDVYLQEIQKKNSSHSFFN (1) ENQLLYIIKDLFSAGVDTTNSTIGFIIAFLVVHQDVQSKVYDEISRVIDKDIYPSLSDKDR (2) LPYLKAVIAEVSRLANIGPTSIPHRAVKDSTFLGFEIKKNYTLLANFKSIHMDK EHWGDPEIFRPERFINEKGDFINDSWLMPFGL (1) GRRKCLGETLAKNTVFLFVACMLQRLHFMLPSNHPPPCLQGIDGFVIAPPMMDIIAVQRF* CYP3 clan (34 seqs, 19 complete and 15 incomplete) >CYP6AQ1 Am1.2_12.14 (417204-420026) plus, 514 aa, 5 exons CYP6AQ1 Am1.1_12.14 (315628-318450) plus, 514 aa, 5 exons (315628-316108, 316238-316604, 316677-316861, 316989-317277, 318272-318450) 45% to 6K1, 42% to 6g2m 43% to 6G1ps new subfamily in CYP6 cyan = missing seq. from EST BE844578 yellow = EST BE844462, underlined seq = EST BE844394, green = EST BE844353 magenta = EST BE844352, gray = EST BE844331 all ESTs from antennae MNLLTPYWSLDILIVSSSLMIAVYLYASWKLKYWSRRGIMQITPSPLFGNFKKCILFQKSVSEIIRELYGQNEGLPFMGFY IFYKPFFLVRDIELVKHILVKDFNTFANKHTSADSKNDRIGYSNLFIIKNPAWKYLRGKLTSVFTSGKLKKMFDLMLIIG (1) KNLEKHLELLNLDG NGKEVELKDLCANFTTDLIGTTAFGVNLNSLKDPNSDFRENGRLVFDYNLKRAFEFFSIFFFPNLS KYVSIKFFGKATDYFRNSFWSVINQRIESNVKRNDLIDCLIELREKHKNDESFEGFR (1) FDGDDLVSQAAIFFTGGFETSSTTISFTLYELALNKDIQKTVRTEIHEALAQTDGKITYDM (0) ITNLPYLDMVVSETLRKYPPLGFLDRVALHDYKIPNSDVTIDKDTPVIIPMIAFHYD PKYFPNPEKYDPLRFSEEVKKTRPSYVYMPFGEGPHICIG (1) MRLGLLQSKLGIIEILKDYEVSPCEKTKIPMVLDPKGLTTTALGGLYLNIRKITIAAG* >CYP6AR1 Am1.2_5.5 (430078-432827) minus, 502 aa, 5 exons CYP6AR1 Am1.1_Un.19 (44801-47550) plus, 502 aa, 5 exons 50% to AmGroupUn.5496, 47% to AmGroupUn.792b, 38% to 6a13ps all best hits to 6as MSWLMIETVGLIATVFFLLYYYSMSKLDYWRKRGVKGPKPLPFLGNFKDVLLAKESTMDCFERA YKEFKDEPMVGMYGSHEPLLILRDLDLIKDVLIKDFNKFAQRTQGAIRE (0) VEPLSEQLFRLDAERWRPLRLKLSSFFSSGKLKEMFHLFVECSDNFEKYLEKMVEKGGLVECRDAAAK FSTDVIGACAFSIHTNALTDENSQFRKMGKQALATNLQQFLNDRLREYPFLFKIFGRFFVDHEVTNFFAN SIKDAMDYRIQNNVHLRDVIDILADIRENPTKCGLKE (1) ADNLFLTSQAVLFFLAGFENASLTISNALYELAWKPEIQEKARAEIVNVLQKYDGKITYDGLEEMKYLEACIF (1) ETLRMYPVLQWLSREAMETYTFTGTKVTIPKGQQVFLPIYAIQRDPDIYPNPDNFDPERFTDDKIKTRHSMTHLPFGDGPRHCS (1) GIRLAKKQLKVGLVTVLSKFKVEVCEKTRKIYQKDKKPLFLLQPVDGIHLKISKVSV* >CYP6AS1 Am1.2_Un.6491 (264-2651) plus, 498 aa, 5 exons CYP6AS1 Am1.1_Un.5496 (264-2651) plus, 498 aa, 5 exons 44% to 6A14, 64% to AmGroupUn.792b MDYFQILCAISIVILTIYYYYSSKYTFWKKRGISGPKPIIFFGNFVDSIIQ KRSTSEAVKKWYDDYKHESVFGIFGGTTPLLVINDLDMIKDVLIRDFSLFVDRGFHIFPK (0) IEPLSEHLFLLEAERWRPMRMKLSPIFTSGKLKEM FFLIMESAGNLEKYLDEVIKKDEMVECRELAAKFMTDVIGSCAFGINTNSLLE EDSEFRRMGKKISTPNLKVMLGNICKEFFPPLYEIVGSIF TLKDVNEFFINLVSDTMKYRKDNNIIRSDFINMLMQLKEHPEKMENI (1) ELTNTLLTAQAVVFFVAGFETSSSTMAFSLYELAQNQEIQDKLREEIRNMHEKNKGVLTYTDVKEMKYLDKVFK (1) ETLRKYPILPMLFRQAMENYTFKDTKITIPKGMKLWVPVHGIHHDPNIYPEPEVFDPERFEDDAFASRHPMSYLPFGDGPRNCI (1) GARFAHYQSKVGLITILRHHKVNVCEKTTIPFKADERSFLLTLKGGVHLKITKI* >CYP6AS2 Am1.2_Un.601a (2171-4675) plus, 498 aa, 5 exons CYP6AS2 Am1.1_Un.792a incomplete on C-terminus of exon 2, missing 3rd exon (8375-10702) minus, 356 aa, 5 exons cyan is from EST BE844607 from antennae 86% to AmGroupUn.5496 cyan = places that differ between ver1.1 and ver 1.2 MDYFQILCAISIVILTIYYYYSSKYAFWKDRGISGPKPIVFFGNFGNSIVKKRSISET VKKWYDDYKHESVFGIYEGTIPVLVINDLDMIKDVLIRDFSIFVDRGFHTFPK (0) IEPLTQHLFLLEAERWRPMRMKLSPIFTSGKLKEMFSLIVESAGNLEKYLDEVIKKNEMVECRDLAAK FTTDVIGSCAFGINTNSLLEEDSEFRRMGKKIFSPSLKLMIGNTCKVFFPSLYEVIGNIFTMKDVDE FFINLVSDTMKYRKDNDIVRSDFINMLMQLKEHPEKMDNI (1) ELTDTLLTAQAVVFFIAGFETSSSTIAFGLYELAQNQEIQDKLREEIRKMHEKNKGILTYTDIKEMKYLDKVFK (1) ETLRKYPILSTLSRKAMENYTFKGTKITIPKGTKVWVPVYGIQHDPNIYPKPEVFDPERFEDDAFASRHPMSYLPFGDGPRNCI (1) GARFAHYQSKVGLITILRNHKVNVCEKTTIPFKADERSFLLALKGGVHLKITKI* >CYP6AS fragment Am1.2_Un.1326b incomplete on C-terminus (4-243) minus, 80 aa, 1 exon 1 aa diff to Am1.2_Un.5413 >Am1.2_Un.5413 stop codon in 3rd exon, incomplete on both ends, (1-1334) plus, 315 aa, 3 exons 60% to 6AS2, 71% to Am1.2_Un.6932 in one overlapping exon 3 New seq in ver 1.2 MDFFQIFCAICIMLLAIYYYYTSFYNYWKVRGIPGPEPTIIIGNFMEVFLKKISINDKLRFLYNKYKN EPMFGIFEGSSPILVLNDLDLIKDVLIKDFSIFSNRGFRIFPK(0) AEPLGEHLFALETERWRPMRAKLSPIFTSGKLKEMFPLIIECSKNMEPYLDKIAERGKYIECRDLAAK FTTDVIGSCAFGIDMNSISDKDSEFRIIGRKLFTPTFKTIVRDVCRQFLPGLYDVIGHKLQIEEVNEF LTNLIKDTINYRKENKIVRPDFVNTLIELKDHPEKLETIK(1) LTDSMIASQAFV*GVAGFETSSSTISHALYELAQNQEIQDKL >CYP6AS fragment Am1.2_Un.6932 incomplete on N-terminus (41-1021) minus, 209 aa, 3 exons, 92% to last 3 exons of CYP6AS2 New seq in ver 1.2 (1)LTDILLTAQAVVFFVAGFETSSSTMAFSLYELAQNQEIQDKLREEIRKVHEKNKGVLTYTDIKEMKYLDKVFK (1) ETLRKYPILSTLSRKVMENYTFKGTKITIPKGTKIWVHGIQHDPNIYPEPEVFDPERFEDDAFASRHPMSYLPFGDGPRNCI (1) GARFAHYQSKVGLITILRNHKVNVCEKTTIPFKADERSFLLVPKGGIHLKIIKI* >CYP6AS3 Am1.2_Un.601b (7365-9709) plus, 499 aa, 5 exons CYP6AS3 Am1.1_Un.792b (3396-5727) minus, 499 aa, 5 exons 64% to AmGroupUn.5496 MDYFQLLCVIGALLFAIYYYLTLTFDTWKNRGIPGPKPTIFFGNFQEVILKKISLA EKTKQLYQEYKNELVFGIFQGRTPILVINDLEMIKDVLIRDFSVFPDRGIHVNPK (0) VEPIFQTLFSLKSKTWRPLRMKLSPVFTSGKLKDMFPLILDCAKNLEEFVEKVRNSGEPVDC RDMAAKFTTDVIGSCAFGVCMNSLSPEGSEFRRMGEQLGKFSFKKLARDFTRLYMPFLFDII GGYLQSHEVNNFFINLIRDSIKYRQENNVYRPDFVNTLKELKEHPEKLENI (1) ELTDALLTSQALVFFLAGFETSSTTISNALYELAQNPEMQDKLRKEIKEVYENNGGALSYTDVKEMKYLDKVFK (1) ETLRKYPVLAALSRQATENYTFKDTKIKISKGTRIWIPVYGIQHDPNIYPEPEVFDPERFEDDAFTSRHPMTYLPFGDGPRNCI (1) GARFAHYQSKVGLITILRDNKVEVCAKTLIPYKSEPRNILMIPKGGKVELGITKV* >CYP6AS4 Am1.2_Un.1740a (1295-3956) plus, 501 aa, 5 exons CYP6AS4 Am1.1_Un.1753 incomplete on C-terminus of exon 1 (2509-4596) minus, 411 aa, 5 exons 80% to AmGroupUn.42b, 58% to AmGroupUn.792b, 41% to 6a17, 44% to 6a13, MLHHFHILTAFVAIFLALYYYLTSKFDFWKNRGVSGPRPVPFFGNAKDVLLRKIGIG SFIAELYKRYDNEAMFGIFIGRSPNLVLRDLDLIKDVLIKDFSIFDNRGLNIPER (0) AEPFSVNLFSVDATRWRPLRMRLSPVFTSGKLKEMFPLILECAEHLEQCLEDAVKRGGPVDCFE IPARYTTDVIGSCAFGINMNALSDERSEFRKMGRNMFDQNMIKFTRNLLRDFFPRFYNLLGFVL PYTESTVFMTKLIKGTIKYREENDVVRPDFVNLLMELKKHPEKLKNI (1) EITDTLLAAQASVFFAAGFETSSTTMAHALYEMALNPDIQDKLRNEMKEFHAKNNGNLKYEDIKEMKYLDKVFR (1) ETLRKYPPGMLLRRKCNSNYTFHGTKVSIPAGTSVIIPLYAIQIDPKFYENPDVFDPERFNEDAVAARHPMTYLPFGDGPRNCV (1) GARFAVYQTKVGLIKILQNFRVDVCEKTMIPYVKKINSITLAPRDGIFLKIEKITD* >CYP6AS5 Am1.2_13.15b (125743-128770) minus, 499 aa, 5 exons, now complete CYP6AS5 Am1.1_Un.42a pseudogene? stop codon in exon 4 before heme binding site and missing exon 5 (126603-129298) minus, 439 aa, 4 exons 56% to AmGroupUn.5496 43% to 6M2 new subfamily in CYP6? MASSFEILCGIAVLFLALYYYLTSTFDFWKSRGVVGPKPVPFFGTTKDLILVKKSTAHFV KDIYEKYKNEPMVGLYATRSPFLLLNDPELIKDILIRDFSKFANRGLGVFER (0) TEPLSPHLLNLEVERWRPLRSRLSPIFTSGKLKEMFYLIIECSLNLEMYLDKLIE KNEPIECRELTARFTTDVIGSCAFGIDMSSMTNENSEFRRMGREVFAVNFMNVMR MKLKQFMPRLYDLLGYVMPDRTFAPFFTRVVTDTIKYRNDNNIVRPDFINMLMELQKNPQKLENI (1) KLTDSLIAAQAFVFFLAGFETSSTTMSNALYELALNQDVQKKLREEINTFCPKNNKELKYDDIKEMEYLDKVFK(1) ETLRMYPPASILMRKAISDYTFNDTKITIPKEMKIWIPAFAIHRDSAIYPNPDSFDPERFDKDAMASRHPMHYLPFGDGPRNCI (1) GARFAVYQTKVGLITILRNHKVEVCEKTVIPYEFDPGAFLLSPKDGIYLKITKI* >CYP6AS6 Am1.2_13.15c (120924-123194) minus, 497 aa, 5 exons CYP6AS6 Am1.1_Un.42b alternate splice for exon 3? (121844-124324) minus, 497 aa, 5 exons 41% to 6a17, best matches all CYP6As 80% to AmGroupUn.1753 I = V in ver 1.1 MFDYFQILIAFVASFLALYYYLTSNFDFWKNRNVVAPKPIPFFGNTKDVVLKKIE ISNFIAELYKKYENEAMFGIFFGGSPNLILRDLDLIKDVLIKDFSTFDERGFKISER (0) ADPLNANLFNMDVTRWRPLRIKLSPVFTSGKLKEMFPLILKCAERLEQCLEDAVKR GGPVDCFEISARYTTDVIGSCAFGINMNALSDERSEFRRIGKRIFDLDKNILRSFL RQFFPRFYNLLGFVIPYSETSKFVTKFISEMIKYREENNVVKADFVNLLMELKKHPEKLQNI (1) KITDNLLAAQAFVFFAAGFETSSTTMAHALYEMALNPNIQDKLRKEIKEFYANNNFTYEEVKKMKYLDKVFK (1) ETLRKYPPGVFLKRKCNSNYTFKGTKVSIPAGTSVIIPVYSIQTDPKFYENPDVFDPERFNEDAVAARHPMTYLPFGDGPRKCI (1) GIRFGVYQTKVGLIKMLKNFKVNVCEKTMIPYIKKINSFTLAPKDGIFLKIEKNN* >CYP6AS6-se1[5] Am1.2_13.15a solo exon pseudogene (132407-132573) minus, 54 aa, 1 exon 1 aa diff to last exon of CYP6AS6, but in different region on 13.15 the two sequences are on AADG03005752.1 11483 bp apart new sequence in ver 1.2 14302 (1)IRFGVYQTKVGLIKMLKNFKVNVCEKTMIPYIKKINSFTLAPKDGIFLKVEKNN* 14138 >CYP6AS7 Am1.2_13.15d (115784-118302) minus, 487 aa, 5 exons AADG03005751.1 (WGS) CYP6AS7 Am1.1_Un.42c incomplete on N-terminus (115419-117055) minus, 265 aa, 3 exons 45% to 6M2, 66% to AmGroupUn.1753 Nearly complete sequence. BI505169.1 EST (N-terminal) MXXXXXXXXXXXXXFLAIYYYFTSTFDFWKVRGV PGPKPIPIFGNIKNVMLLKTSMCHYLKKLCEEYKH EPMIGIFTRKTPILIIQDPDLIKDVLIRDFSKFANRGIPIHEK (0) AEPLSPHLFNLEVERWRPLRTRLSPVFTSGKLKEMFPLILDCAKHLEQYLDKLVLREEFIE CRELTAKYTTDVIGSCAFGIEMNALSDEESEFRRIGRKVFSNSFGQILRFRFRQIFPRIYN LLGFVLPPMEVTKFLTNIIVSTMKYRQENNIVRPDFVNMLIELKKHPDKLENI (1) KLTDTLLTAQAFVFFIAGFETSSSAISNALYELALNPEVQNKLRQEIKEYFNKHNELKYEYIKNMIYLDLVFR (1) ETLRKYPPGPLILRKSITNYTFNNTKVSIPEESFVWIPLYAIHHDPKIYPNPDAFIPERFNDDAIATRHPMHYLPFGDGPRNCI (1) GARFAVYQSKIGLITILWNYKVEVCDKTMIPYEINPAAFLLTPKGGIYLKFTKIKNNEEILN* >CYP6AS8 Am1.2_Un.5081 (10839-13505) minus, 500 aa, 5 exons CYP6AS8 Am1.1_Un.4533 (10741-13406) minus, 500 aa, 5 exons 53% to AmGroupUn.2631 42% to 6P4 48% to 6N1 partial T = R in ver 1.1 MYISLEIFCGIVVALIALYYYLTVNNNFWKNRGIAGPEPVLGFGNMKKVLL GKESMSQFLTKIYHEYKNEPIIGIFTTRTPQLIIKDPDLIKTILIKDFSKIMNRGLLPMVS (0) GEPISQHLFNIEAERWRPLRIHLTPVFTANKLRGMFSLILECSMHFVSYVDSLVKKGEP VNVREVAARFTTDVVGSCGFGVEMNSLSEKESEFRRVGKSVFATNYARIIKHRIREFMP RLYNYILYLWPTDEMAEKIIKLTRETLEYREKNNLFRPDFMNILLDLKKHPEKIGL (1) DVTNEFLAAQAFIFFVAGFETSSSTISNALYELALNPDVQDKLRKEIKEFAAKNDGEWRYETIKEMEYLGKVFQ (1) ETLRKYPSLPFLTRELIEDYTFESNKVTIPKGLKIWIPTYAIHNDPDIYPDPDKFDPERFSDDKIKQRHPMHFLPFGHGPRNCI (1) GARFAIYQTKIGLINILRNFKLDVCDKTLIPYKHHPRGLLLMPLTDLYLKITRLTN* >CYP6AS9 Am1.2_Un.1054b frameshift in exon 2, possibly a pseudogene because splice site donor for intron 4 removes 2 aa from heme binding region (1823-4511) minus, 499 aa, 5 exons CYP6AS9 Am1.1_Un.2792 incomplete on N-terminus, pseudogene??? heme binding site probably not functional due to change in splice donor (995-3268) minus, 367 aa, 4 exons 65% to 6AS8 Am1.1_Un.4458 incomplete on both ends (1-1218) plus, 203 aa, 2 exons 3aa diffs to AmGroupUn.2792 61% to 6AS8 AADG03012612.1 AADG03012613.1 MFINLETLCGFVIVLIAFYYYLTINNNFWKNRGIPGPKPTIGFGNMWTVMFGKESFSQLLTTIYNKYKDE PMIGIFFRRRPVLLLKDFDLIKDVLIKDFSKFANRGFLKTNP (0) KVPLTNHLFALEVKRWRPLRNHL (FRAMESHIFT) SPVFTSGKLKGTFAQILNCSNDLVTHIDTLSKMEDSINMREVAAKFTTDVIGSCVFGIKMNSLSEIENEF RRIGRNIFATNFTNILKIRLLQSTPFLHSLLCKILPDEEMEMFMRITKETMEYREKNNLVRPDFMNILLELKKHPERIADIE (1) LTDDLLAAQAYIFFAAGFETSSTTISNALYELALNHDMQDKLREEIKEFEAKNDGEWRYETIKEMQYLGKIFQE (1) TLRKYPALSFLSRESIIDDYTFRDTKLTISKGTLVWIPVFPIHHDPNIYPEPEKFDPERFTEDKIKERNLMHYFPFGHGPRNC (1) ARFAIYQTKIGLIKILRNYKVDVCSKTLIPYKYDPFSFILVPLGGLYLKITRIQD* Am1.1_Un.4458 and Am1.1_Un.2792 from v.1.1 match best to CYP6AS9 in v.1.2. >CYP6AS10 Am1.2_Un.1054a (7016-10145) minus, 499 aa, 5 exons CYP6AS10 Am1.1_Un.2631 incomplete on C-terminus of exon 1 (2686-5804) minus, 495 aa, 5 exons AADG03012613.1 AADG03012614.1 gnl|Amellifera1|174017188 BCM Apis mellifera 12/11/2003 40% to 6M2 53% to AmGroupUn.6AS8 MAAFEILCGFIIFIFAFYYYLIKPQEYWKNRGVPGPKPIPIFGNFFRLTFA RISIGDLMTKFYKEYKHEPVFGLYMRNVRVLAINNPDLIKTVLIKDFSKFAHRGLALNEV (0) TEPLSQHLFVLEPKRWRPLRTKLSPIFTSGKLKDMFSLIIECSNTLENYVEHLISKNDRVE VRDLAAKFTTDVIGSCGFGVDMNAMSDVQCKFRDIGREFFGPSFKQILKIRLRENLPRLYT FLGYILPRDETTTFFTNVVLDMIKYRKTNDIYRPDFINALINIQNHPEKLDI (1) ELTEPLLVAQAFLFFVAGFETSSLTIATALYELAQNQDIQDKLRDEITEHHKLNNGEWQYENIKNMPYLDAVFK (1) ETLRKYVPLTVLMRQSLEDYTFESINLTIPKDTRIFIPIYAIHRDPDIYPNPEVFDINRFSKEAEATRHPMHYLPFGDGPRNCI (1) GARFAIFQTKIGLIKILRTYKVDVCNETQIPFINEPRTFTLAPKHDLTLKITKIEN* >CYP6AS11P (pseudogene) Am1.2_Un.5080a incomplete on N-terminus (1-1697) plus, 229 aa, 4 exons with three frameshifts and a stop codon gnl|Amellifera1|166350743 BCM Apis mellifera 12/11/2003 Am1.1_Un.4532 incomplete on N-terminus (1-757) minus, 131 aa, 2 exons 83% to 6AS8 = differences in ver 1.1 HLPSRFTSNKL frameshift KGMFCMIHECSKHLVNY frameshift LDIXVEKEDPVNVRE frameshift VAARFTTDVVGSCGFGVEMNSWSEQESEFRRLGKSIFNTNVQKILKDRFRE*RPQVYNFL LYILPLDGISPKILKLMKETIKYRKKYDIFRPDXNIILELKKHPEKINI (1) DITNELLAAQIFIFFAAGFETSSTLISNALYELALNPNIQDKLREEIKKFESQNDEEWKYETIKKMDYLEKVIQ (1) ETLRKYPPVPFLNRELIDDYTFESNKVTIPKGLKIWIPTYAIHNDPDIYPDPDKFDPERFSEDNIKQRHPMHFLPFGHGPRNCI (1) GIRFAEYQTKIGLINILRNFKLDVCDKTLIPYKLHPRGLILIPLTDLYLKITRLTN* >Am1.2_1.19, incomplete on C-terminus (860909-863534) plus, 381 aa, 3 exons in CYP6, 46% to 6AS5 missing EXXR to end new in ver 1.2 MWNIIRELLEQFLLPGLFLGILYCFLTSTFDFWKNRGVPFRKPTVLFGNFAPMLLFRKSLPE GIKEMYEWFKDERYFGAFRVRSPVLILRDPDLVKNICVKNFTSFSNRGIPVNSQ(0) DPLSAHLFNLEGKKWKSLRSKLTPAFSSGKLKRMFYLLAECGEEFEKLIDISSETDRPYE IRELAAKFTIDVIGTCAFGIQINALTDEESEFHRAAKKLSKPSYKATLWRMLRTAMPRLY KFLGVQVIDPGVTKFFKDVVSQMIKQRGEYGIKRHDFMDLLIELKNKGTLDEFG (1) KLDENSIAAQAFVFFAAGYETSSNTIAFCLHELALNTEIQEKTRRDIQDAIDSRNGNLTYDAVQDMKYLDMVIAG(1) >Am1.2_13.16 incomplete on C-terminus (2365-4885) minus, 445 aa, 4 exons Am1.1_Un.248a incomplete on C-terminus (38475-40756) plus, 338 aa, 4 exons 49% to 6Aa14, 73% to 6AS8 MYIGLEILCGIVITLIAFYCYLTINNNYWKNRGIPGPKPVPGFGNMKNVIFGKESV SQFLTRMYNEYKDEPMIGVFSKRTPVLIVKDVDLIKTILIKEFPKFANRGLFPIFS (0) RDPLTHHLFNLEVERWKPLRTQFTPLFTSSKLKEMFSLILECSNHLESYMDTLI KKGEPIDMREVSARYTTDVVGSCAFGIDMNSLSEKESVFRRLGKLIFATNLRKI LSIRIQDMLPWLYNSFLYVLPRDEKTRIIMKLMTETMEYREENNVFRPDFINMLLNLKKHPEKIDI (1) ELTDDLLAAQIFIFFAAGFETSSSTISNALYELALNPDIQEKLRKEIKEFEARNNGEWRYEIMKEMEYLEKVFQ (1) ETLRKYPSLPFLNRKLINDYTFESNNVTVSKDLKIWIPVYGIHHDPDIYPDPEKFDPERFSKEEIMKRHPMHFLPFGHGPRNCIG (1) >Am1.2_Un.5080b incomplete on C-terminus (3231-4470) plus, 287 aa, 2 exons Am1.1_Un.248b incomplete on C-terminus and N-terminus of exon 2 (43061-45682) plus, 245 aa, 4 exons 78% to 6AS8 Am1.1_Un.248c incomplete on C-terminus (47350-47685) plus, 112 aa, 1 exon 94% to AmGroupUn.248b, only 6aa diffs at N-term Am1.1_Un.248b and Am1.1_Un.248c match best to the same sequence in v.1.2, suggesting that the aa differences between them were errors. The C-terminal portion of Am1.1_Un.248b matches exactly to the C-terminal part of Am1.1_Un.248a/Am1.2_13.16. The C-terminal part of Am1.1_Un.248b was not found on Am1.2_Un.5080b and may have been the product of misassembly in v.1.1. MYIGFEIIYGIVIVFIAFYYYLTINNNFWKHRGISGPKPVLGFGNMKKIIL GEESMSQYLTKLYHKYKNESMIGIFRLRTPALIIKDPDLIKIVLIKDFSKFMNRGLLPIIS (0) TEPISHHLFALEAERWHPLRKHLTSGFTSNKLKGMFCMIHECSKHLVNYLDNLVRKEEPVNV REVAAKFTTDVVGNCGFGVEMNSLSEKESEFRRLGKSIFNTNLQKIIKDRIRELTPQIYNFL LYILPLDGISPKILKLMEETITYRKQYDIFRPDFMTIILELKNHPEKFNIG(1) >Am1.2_Un.7762 incomplete on C-terminus (1072-1314) plus, 81 aa, 1 exon gnl|Amellifera1|2043985811BCM Apis mellifera 8/15/2003 Am1.1_Un.9652 incomplete on C-terminus (993-1236) plus, 81 aa, 1 exon 72% to AmGroupUn.248a note: this exon 1 may join with Am1.2_Un.218 to make a complete sequence MNISLEILCGIIVALIVFYYYLIINNNFWKNRKISGPKPVIGFGNMLSIILGKESTSQFLTRIYNEYKNEPMIGIFSKNNP ALGIRNPDLIETVLIKDFHKFANRGLFPVNS (0) >Am1.2_Un.218 incomplete on N-terminus (12841-15336) minus, 389 aa, 4 exons Am1.1_Un.8460 incomplete on both ends (123-685) plus, 139 aa, 2 exons 71% to 6AS8 Am1.1_Un.8178 incomplete on both ends (801-1322) minus, 174 aa, 1 exon 64% to 6AS8 (0)REPLSQNLFGLEVERWRPLRIHFSPIFTTNKLKGLCSLILECSEQLEKYMDILIRK GEPLDIREIAARFTTDVIGSCAFGIEMNSLSENESEFRRLGKGVFNTTFRRIVKTRIRN LMPWLYNFFLRILPWDEITKKIVKLTTETIEYRNKNNIVRSDFINVLLNLKKHPEKIAEI (1) ELTNDLLSAQTFVFFGAGFETSSTTISNALYELALNHDIQYKLREEIKEFEKKNDGKWTYESIKEMQYLNKIFQ (1) ETLRKYPVVPFLNRELISDYTFENSKITIPKGLKIWIPVYGIHHDPDIYPNPEKFDPERFSEDKIKERHSMHYLPFGHGPRNCI (1) GSRFGTYQTKIGLVKIIRKYKVEICDKTLIPYKFNSFANFLMPSTGLYLMITDVEN* >CYP6AS12P Am1.2_Un.1740b possible psuedogene (6374-9226) minus, 500 aa, 5 exons AADG03014312.1 Am1.1_Un.8712 incomplete on N-terminus (13-1433) minus, 267 aa, 4 exons 60% to 6AS5 Am1.1_Un.10510 incomplete on both ends (122-647) minus, 176 aa, 1 exon 54% to AmGroupUn.42a, 39% to 6a16m all best hits = CYP6As from aa107-274 probably a new CYP6 subfam or CYP6a Am1.1_Un.6966 incomplete on C-terminus (878-1100) minus, 72 aa, 1 exon 40% to 28A5 N-term 38% top 6a20 57% to AmGroupUn.42a Possible pseudogene because of stop in first exon. DIFFERENCES WITH VER 1.1 MAYVEILCVGIIVSMLAFYYYFTSAFNFWKIRGIPGPKPKFLFGNIRDIILSRISTPAFIKNVCDTYTN EPMVG*YMGRNTLLLLKGPELIKDVLIRDFSKFADRGFNVHEK (0) VEPLSQHLFNLEPKRWRPLRSKLSPMFTSKKLKEMFGLILECGRHFEKYV DGLAARRQPVDFCEVAAKYTTDVIGSCAFGINMNAMSSEGSEFREAGRKIFEPTWNSII RLKFKITMPTLYDLLGPLVPEREVTPFFIKVVTDAMKYKKESNVFRPDFIDTLMKLRDDPESLSDI (1) ELTDAFLTAQAYVFFAAGFETGASTISNTLYELAQNQGMQDRLREEIREHCDKYGGELMYENIKEMEYLDKVFK (1) ETLRKYPPGTLIPRRSVSEYTFKNTNVTIPKGTMIWIPAFPIHRDPNIYPNPDDFNPENFTEDAINNRHPMNYLAFSNGPRNCI (1) GARFANYQVKIGLIMILRNYKVEVCEKTVIPYQFDPNLFLLGPKGGIYLRVTKVE* >Am1.2_Un.1326a incomplete, missing 3rd exon (4844-8009) minus, 430 aa, 4 exons Am1.1_Un.2707 incomplete missing exon 3 (1-3597) plus, 368 aa, 4 exons 59% to AmGroupUn.4533 note XM_396006.1 mRNA predicted by program GNOMON skips exon 3 MIANLEIFCGIIVIVIAFYYYITARNNFWKIRGIPGPEPLPGFGNVLMIVLGKE APFQFLTRVYNEFKNEALIGVFMKTYPALVVKDPDLIKDIMIKDFYKFPNRGFPKSDS (0) ADPLTQHLFLVEEEKWRPLRTQLSPVFSTGKLRGTFTQILDCSNHLVTYMDKLVEIGEP IDVREVTAKFTTDVIGSCVFGIKMNSLSGKESEFRRFGRQIFAMNFLKILRLRIKQFLP MLHYLLVRILPPDEETKIMLKLTRDTFKFREAHNIVRPDFMNILMELKKHPEKVPSL (1) missing exon 3 ETLRKYPVLPYLSRRSIEDYTFEGTKVSIPKNTLICIPVYPIHHDSSIYPNPEKFDPERFSEDEVKKRHSMHYFPFGHGPRNCI (1) GLRFAIYQSKIALIKILSNYKIEICDKTLIPYKYDPFSFISLPLTGIFLKITKLQN* >Am1.2_Un.416 incomplete on both ends (10974-18858) minus, 223 aa, 3 exons gnl|Amellifera1|236106554 BCM Apis mellifera 12/11/2003 Am1.1_Un.2162 incomplete on both ends (1-852) plus, 223 aa 3 exons 39% to 6g1 aa175-381 probable new subfamily in CYP6 41% to 6AQ1 NSMNKYLDDEFSHD TKTKTIMIKDVTLKYTTNVISSVAFGIQVNSFNPKTIQFYEE (1) GLKTTFSRSMQLFISFFFPKLSPYLNTRMLGSSTNFFRKVFWNSMDNREITKTKREDLIDSLIELKNSKQDKDFS (1) FEGDALLSQSAIFFIAGRETSISIICLTLYELAKHPEIQKRTREEINEKLKE HGMTYEGVQSMKYLHQVVSEILRIYPPTPIIDRVAVADYK (0) >Am1.2_Un.6686 incomplete on N-terminus (2630-5871) minus, 453 aa, 5 exons missing about 54 aa gnl|Amellifera1|2049199734 BCM Apis mellifera 8/15/2003 Am1.1_Un.2637 incomplete on C-terminus (7-2478) plus, 330 aa, 4 exons 45% to 6P4 new subfamily in CYP6? 46% to 6F1 Am1.1_Un.5730 incomplete on both ends (990-741) minus, 84 aa, 1 exon 54% to 6d4m 60% to AmGroupUn.8460 60% to 6AM1 intron boundary revised. 44% to 6AS5 FIAEFFRDIYEHNKHHRLVGIYMLYKPYLIVNDPNLIRDILTKEFTNFHDRGIFYNEEVDPLS GHLFQLPGKKWRNLRVKLTPTFTSGKIKQFFPILNEAGNILAKYLEEEARKGSTIDVKDIFAR (2) YSTDIIMSVAFGISCDSFKEPNNEFRYWGKKIFDPKPLWNALILFAPQILNFFSISY TEKSVTKFFTNMFKQTVKYRESNNIERKDFLNLLIQLMKNGYVDADDESLSNNVNAA (1) KNKLTMMEAAAQAYVFFLAGFETSSTTVTFCLYELAKNQDIQNKVREEIQTMIKKNGDLTYNALNDMNYLHKVIS (1) ETLRKYPPVVILNRICTNDVKLSTTDFCIPKGTCIAIPVFGLHRDSNIFPNPEKFDPERFSEENIKTRHPYVYLPFGEGPRICI (1) GLRFGLIQTKIAIINALLKNKFKFGPNTPSTLEFEKGSLILIGKGGIHLNIEPI* >Am1.2_Un.383 incomplete on N-terminus (6360-7086) plus, 84 aa, 2 exons Am1.1_Un.4500 incomplete on N-terminus (298-1025) minus, 84 aa, 2 exons 52% to AmGroupUn.5496 53% to 6AM1 AIFNPERFTEENKRTRHPYAYLPFGEGPRNCI (1) GMRFALLQIKVGIISFLRNHRVETCQKTITPIKFSRRSLVTTSEKGFWLRIK* New families in the 3 clan CYP335 and CYP336 >CYP335A1 Am1.2_14.13a (10454-12001) plus, 515 aa, 1 exon CYP335A1 Am1.1_14.7a (400127-401674) minus, 515 aa, 1 exon 40% to 9f2m new subfamily in CYP9 63% to 335A2 39% to 9D1, 38% to 9K1 50% to 9J8 partial MEVSHLTTFELLLLTLIFIILAKLVSILYTQFTYWKRNKVPYIRSSPLFGTAWRVFFRLVSFPNYCKYIYNYYPDARYVGVMDFAT PTVIVRDPKLIKEIAVKNFDNFPDHRSFVTEEMDPVFGKNVFSLKGDRWREMRNTLSPSFTANKMRFMFDLVSKCSHDFVSCLHDR LESSSSEIEGKNLFTRYSNDVIATVAFGISVNSIEHPDNEFYRRGIDVSTFSGTFRFIKFMLFRLNPRLTRMAGFTFLSRATSKFF WRVISETVTARKRRGIVRPDMIHLLMQATDSKKKSIHQTMTIDDIVAQAFIFFLAGFDTTSTLMCYVVHELALHQDVQRRLREEVD RVLDDGTEISYEDTLGMEYLEMVISETLRMHPPTLLIDRQCAKEFHLPPAGPGYESVTIHPGENIWFPVLAIHRDPAHFPDPDKFD PERFNRENRNGIDPYTYIPFGVGPRKCIGNRFALMETKLLIIRLLRKFVIKPCERTMDPIVYKKGNFTLMPKDGFWVTFEKRNDH* >CYP335A2 Am1.2_14.13c (15290-16849) plus, 519 aa, 1 exon CYP335A2 Am1.1_14.7b (395285-396839) minus, 519 aa, 1 exon 39% to 9f2m 63% to 335A1 40% to 9K1 51% to 9E2 partial, 54% to 9J7 partial MESPTLLFSFELLAIGLTAIVLAKFVSLLHHQYNYWRKRRVPHVGAVPVLGSSWRIFTRRMSLPNFCSLVYKHRPGSRYLGMMDCF TPVVVVRDPNLIKEIAVKNFDHFPDHHSFINEKIDPIFGKNVFSLKGDRWREMRNTLSPSFTASKMRFMFDLVSNCSEEFVRYLYD HPEFSSSIEAKDAFTRYTNDVIATVAFGISVNSMENRDNEFYTKGADATNFGGIFRLFKFMLFRVNPRLTRMAGLSFLSRGTATFF HRVVRETVRARDERRIVRPDMIHLLMQARDKEDRRPVATVDNRMTIDDITAQAFIFFLAGFDTSSTLMCYVAHELALNPPVQERLR EEVDRFMDGGNGAITYEALLKMEYMDMVTSETLRKYPPIVFIDRLCVEKFELPPAEQGYDHLIVHPDNIVWFPVYGLHHDPKYFPE PEKFDPERFNDANKRNIVPYTYMPFGLGPRKCIGNRFALMETKILIAYMLRKFRIKRTEKTRASIEFSKTNFSLTPDHGFWIGLEK RDP* >CYP335B1 Am1.2_14.13h (106877-108409) plus, 510 aa, 1 exon CYP335B1 Am1.1_14.7c (305231-306757) minus, 510 aa, 1 exon 39% to 9E2 35% to 9f2m 57% to 335B2 42% to 335A1 MDYLQLGLTLLAILVAVYYLSTRNHKLLKRHGIVHIPPTPLFGNLGPLVRRKCHMEDVIQRVYDLDPDARYVGMYEFTTPLIIIRD PELIKTIGVKEITNFTNHRPFVDVGVDPMLGEVLFAMQGDRWREHRTMLTTLFTSSKIKSMFVLMSDCAKRFADYLSKVEREIELK SVLTRYTNDVIARCVYGVSVDSVNEPENIFYRYGQVASQLSTFKQNLMIFVHRNSPRLARLFNLKILPVHIEKFFHRLVMDTIETR RREGVHGLDMLQQLMDMQSRRKESEEGKRGMTVTDIANHAFSFFFGSVDTMATQISLISHMLAVNPDVQQRLQEEIDEVLSASEDK QVGYDVIQEMKYLDAVMSEAMRYHPILLFVDRVCGETFELPPALPGARPFKLERGMNIWFPVKAIHHDPKYFENPDRFDPDRFLRD GKGIASSGAYMPFGMGPRKCIGSRFALTEMKILLFNILAKCSFKVGSKTMVPLKFKEGVFNPVAKNGFWLKIERRENSCC* >CYP335B2 Am1.2_14.13g (159215-160813) minus, 532 aa, 1 exon CYP335B2 Am1.1_14.7d (252241-253839) plus, 532 aa, 1 exon 57% to 335B1 MEFLSLALVLAAISIIAYYYCFVRKNFNLFQEHGILHVPPSPLVGNFGPLIRGKENVHDTIQRIYNIHPDAKYVGIFEFLTPVIMI RDLDLIKSITMKNFDQFPDHRPMFCKSVDPMLGEMLFIMDGERWKEHRNMLSPTFTSSKIKTMFVHMSECAKRFAHHLSKLPEKDR ETEMKALLTRYTNDVIAACIYGVNVDSIKEPRNVFYMYGRVGATLIGLKKNLKIMVHRNMPWLANLLRLNILERHIAKFFTDLVVE TVEERERNGTTNSDLIQLMMDTRNKKESGKKNLTVQNMANHAFSFFFGGFDTVSSQTCVLLHMLVENPEVQQRLQQEIDETLESNN GQLSYDVIQEMRYLDAVINEILRLHPIAVFIDRMCVKSFELPPALPGDVPFTVKPGMNVWIPVKAIHHDPRYYDEPEKFKPERFLD NGKNIIGSGAYFPFGIGPRICIGNRFALIEMKVLVCHILAVCDIKAGARTGIPLEFEKGVFNATAKTGFWLKIEPRKYSYHSGQIN GLVNNHVINGACKTGI* >CYP335B3 Am1.2_14.13d (145027-146580) minus, 517 aa, 1 exon CYP335B3 Am1.1_14.7e (266474-268027) plus, 518 aa, 1 exon 58% to 335B2 ESTs BI508364 BI516506 BI505081 BI505012 MDYLTISLSLITVFVAVYYLATRNNDFFKKHGIPHVPPVPFLGNMGSLVRQKSNLHDVIDRTYNLDPGAKYVGIYEFTTPIIILRD LDLIKTITMKYLDHFPDHRSFAYEGADPVFGSMLFAMKGERWKEHRNMLTPTLTSSKIKGMFKLMTECAVRFADFLSVLPENERET EMKALLSRYANDVIASCVYGVSVDSINDPKNIFYVYGRRGTNVVGLKKSMFVLIHRNMPWLAKLFGLRFLEKHVQKFFYDLVYETI ESREKLGTNRSDVLQLLMDIRDKANSSGKMTTMTVENVAIHAFTFFFGGFDSITSVTTLLTQMLAEHPDVQARLQQEIDETLRSND GVLTYDAVHGMKYMDAVINETMRFCPVLPFLDRMCVESFQLPAPVPGGQPFTLRPGMNVWIPLAAIGRDPEYFEDPDKFDPDRFLN PEAGIKNSGAHFPFGLGQRKCIGERFAMMEMKVLLCYVLAACNVRIGSKTTVPMKLEKGLINANVKGGFWLKIEPRKVTYYNSSRS N* >CYP335C1 Am1.2_14.13f (153220-154824) plus, 531 aa, 1 exon CYP335C1 Am1.1_14.7f (258230-259825) minus, 531 aa, 1 exon 44% to AmGroup14.7d 49% to AmGroup14.7g MLDSWSITAAIVAVLAIAYYQLIWKYKHFERIGIPCYHSIPLLGSFWEAVIQRNNFAEISRKIYNSYPDTKYMGMYDTTTPVLLIR DTELIKAISVKHFEQFPDHRSFQNEATDPLFAKNLFALRGDRWREIRNLLSPAFTSSKMKSMFILMRDCAKEYGDYFASLTGDEST IELKDAFTRYTNDVIATCAFGVEVNSMKDRKNKFYVYGREGTTFGSWASIKFFVTRVLPVSVCTLLRIRLIRKEISDFFIDLVSTT IKTREEKGIVRPDMIQLMMESKGKLGAGKEMSMIDICAQAFVFFFGGFESTSTLMCFAAYEIAVNEDIQRRLQNEIDQVLEERDGE VTYAAVNEMKFLDAIIYEALRMYPVVVATDRVCMKPFELPPNRPGEKPYLLKEGDNVWFPIYAIQRDPQYYPEPDKFDPDRFLNDT KQMINSGLFLTFGIGPRMCIGNRFAMLETKVLLFHLFARCNLVPCSKTTIPMKLNRKGFSMTAENGFWFKIEPRSAKKEEKIAVPG TTMLIDKIPDRYPDN* >CYP335D1 Am1.2_14.13e (151195-149678) minus, 507 aa, 1 exon CYP335D1 Am1.1_14.7g (261868-263392) minus, 507 aa, 1 exon 49% to 335C1 MAIFALLLIVLGILGSYHLLKSQNPFKEHGLPYKSYLPILGSTWESILRRKSFAVVIQEIYNLAPSARYVGFYNRTTPIVMIRDPE LIKTIAVKNFDAFRNHRTVNDTQTDDVLLSGNLLLLRDNRWREVRSLNTPAFSTSKIRSMYRSMSEIAINVARYLSTLAPGQNIVE MKDIFTRYANDVFATCAFGISVDSLSDRENKFYELGREALDIHSTPILKLILIFAFPKLARRLGVSLVSKEATNFFTRVVSENIKM REEKGITRPDFIQAMIDKRNGRGRDDELTVEDITAQAFVFFFGGFETTSGLLSFAVHELAANPEIQGKVHAEIDRVLVSNNEITFE RVNGLVYLDAVINETLRMYPIIPITDRECSKRFELPPVLPDAKPYVLKEGSHVWFPIYAIQRDPRYFEKPDCFDPDRFLDDNKKRS DAFNGDAYMPFGAGPRNCIGNRFSMVETKVALFHILAKCRLDVCPKTTIPMELRKRGVFLTAKNGFWLRIVPRHPVT* >CYP336A1 Am1.2_Group2.13 (199618-201108) plus, 496 aa, 1 exon CYP336A1 Am1.Am1.2.15 (200938-202428) plus, 496 aa, 1 exon 34% to 6AH1 Anopheles EST BI946448 from brain MASAFLTLVTGALLLLCFYLYLKYTYWKRNGIP YSKGYYPIIGHFLPLIMKKQSYSEIIEEIYRDSNHSMVGMYKGMKPVLILRDINLIKTVLQSNFSKFHENAVKIDPKLDPLLAKNP FFCYGELWQTGRKRLTYAFSNARLKILFAAVYEVCTKFRNFLDRRLESSKKYEVELKSLFLKFTSEVVANAGLGIEGFCFEDGKVQ SIFTNLDNNDFLDTFLVGIIMHFPFLTKLLRIKFLPTKHDKFF RTVVKKNLELRKSDPIPRNDFIQLMIEMEQTGEKIDEEIVAAHAVSFYLDGVETSSVTLNFIGCQLAIHQDVQEKLRKEVRSTLEK HGGVLTFEALKDMTYMNQVISESQRYFSALGFLGKICTDEFELQGSDGLNYRAKPGTELLIPICGLHKDPKYWDNPEIFDPERFSD ENKQRIEKMAFIPFGEGPRICVGMRMAMLQMKSCLATLMKDYKLEVSPKMQLPLKLSPTYFLSAPLGGGWVLISKA* CYP4 clan (4 seqs, 2 complete, 2 incomplete) >CYP4G11 Am1.2_Un.3078 (673-4557) minus, 545 aa, 7 exons Am1.1_Un.2145 (673-4743) minus, 545 aa, 7 exons CYP4G11 63% to 4g15m, 100% to AF207948 partial seq MAAASATGFSASSVFLSLLIPALILYFIYFRISRRHLLELAEKIPGPPALPLIGNALDLFGT (1) MFSQVLKKAENFKDVVKIWVGPKLVICLIDPRDVEIILSSNVYIDKSTEYRFFKPWLGDGLLIST (1) GQKWRNHRKLIAPTFHLNVLKSFIDLFNANARSVVEKMRKENG KEFDCHNYMSELTVDILLETAMGVSKPTRDHNAFEYAMAVMK (2) MCDILHLRHTKIWLRPDWLFNLTKYGKNQIKLLEIIHGLTKKVIQLKKEEYKSGKRNIIDNSAQKTESK (0) TNNIVVEGVSFGQSVGLKDDLDIDDDVGEKKRQAFLDLLIEAGQNGVLLTDKEVKEQVDTIMFE (0) GHDTTASGSSFFLAVMGCHPDIQEKVIQELDEIFGDSDRPATFQDTLEMKYLERCLLETLRMYPPVPLIAREIKTDLKLA (1) SGDYTIPAGCTVVIGTFKLHRQPHIYPNPDVFDPDNFLPEKTANRHYYAFV PFSAGPRSCVGRKYAMLKLKIVLSTILRNFRVRSDVKESEFRLQADIILKRADGFKIRLEPRKQVASTA* >CYP4AV1 Am1.2_Un.1387 (15087-18877) plus, 124 aa, 2 exons Am1.2_Un.1625 (113-5079) plus, 437 aa, 7 exons >CYP4AV1 Am1.1_Un.2000 incomplete on N-terminus (1-4856) plus, 496 aa 7 exons 39% to 4c3 modified intron boundary Am1.1_Un.2000 appears to have been split into Am1.2_Un.1387 and Am1.2_Un.1625 in v.1.2. MSSSTIIMGVSWLTMILSICLMTIVVLLLVRRGKFLYALRKVPCPPAFPIIGNAYELCAVREV (1?) AFKKMIKWGKELGDMYLIWVGMRPFIFLYKAEAIQPLLSSSVHIDKSLEYQYLQPWLGSGLVTST (1) GEKWHFHRKLLTPTFHSGLLELYLKTTIREAQILISCLRKEIGKPEFDIVPYAKRAALDIIC (1) DSSMGCNINAQKNFENEYVQAVNT (2) LASISQRRFLNVWMSFDPIFKLTSWGKRHDHALSVTHGFVNK (0) IIAERKAEWKDRKDTNFNEKSHKRQALLDLLLELSKDGKVLTDDDIRDEVNTFMFAGHDTTATSVSWILYALGRHPQYQ (0) ELIIEEYDETVGTKELTLDILSKLTWLEACIKESWRLYPVTPLIARQIYHPITIL (1) GHEIPIGSTVLVNSFLLHRDSRYFPEPDIYRPERFLPDGPKYPSYAFVPFSAGSRNCIGWKYGT MIVKVLILYILKNFHVESLDTEDQLRFISELVLHNADGLRLKITPRK* >Am1.2_Un.1527 (1556-4363) plus, 478 aa, 10 exons Am1.1_Un.8281 incomplete on both ends (644-1499) plus, 132 aa, 3 exons Am1.1_Un.2540 incomplete on N-terminus (5369-5938) minus, 71 aa, 2 exons 43% to 4M5 MISAILFFIFLLATLHYFLLHHRKFGKMINLIPGPEPLPILGNIPTFHNISP (1) SELWKFLTQLSKQYYPIYRMWTFLEAYVHICHPDDIE (0) TILGNIKFTKKGFGYKYLKPWFNTGLLTSS (1) GHKWHVRRKILTSAFHFNVLRQFVDIFIEDAERLIKTLESEEGIFVENLLQLTSEHTLNVIC (1) ETAMGTSLKNKEKFQYEYRKAVYNMGCIFANR (2) IVKPWFYYDFFFNLSPEGWQQSKLLKILHNFTRK (0) IIQERKEYHDKTNGRYLNDFHENINENDNNNDYNDCR (1) IRRKRLAMLDLLIEAHRNNKIDDEGIREEVDTFMFR (0) GHDTTAISFCFSIMLLAEHKEIQ (0) DRARAEIKAAIEENGGKLNITVLQNLPYLERCIKESLRLFPSVPRISRKLYIEVLMNYCTID (?) 43aa exon with PKG and PERF motifs (0?) *RHPYAYVPFSAGPRNCI (1) GQRFAMLELKTYLGLLLYNYYFEPIDYLKDVTFVSGIVLRLENPVRMKFIPVKKIC* >Am1.2_Un.1327 incomplete on N-terminus (901-1990) plus, 299 aa, 3 exons Am1.1_Un.3324 incomplete on N-terminus (901-1990) plus, 285 aa, 3 exons 50% to 4aa1 55% to 4AN3 partial (2)GQIMLLYRMIRPWLLIEWIYRLTKYGREEEKQRKNLFDTCFKMVKEKRDLLQSKDRISNNDIKKNK NISLLEYMVEINEKNPCFSDEDIVEECCTFMLAGQDSVGTATAMTIFLLANHPEWQNKCIEEIDEIFNG DTRFPTISDLKEMKCLEMCIKESLRLYPSVPIIGRTLGEDIKI (1) GKHIIPAGCSVLISPYSTHHLPHHFPDPDTFKPERFNSENSEKRHPYAYIPFSAGPRNCI (1) GYKFAMLEMKSIISAILRKCRLQSIPGKKEIRPKFRMTIRAQGGLWVKIIERDQILKSIAA* mito clan (7 sequences, 5 complete, 2 incomplete but from a single gene) >CYP314A1 Am1.2_5.28 (13954-16400) plus, 516 aa, 10 exons (first exon is a guess) CYP314A1 Am1.1_5.25 AADG02002903.1 incomplete on N-terminus (17670-19852) minus, 471 aa, 9 exons (19852-19708, 19574-19378, 19298-19111, 19031-18829, 18707-18579, 18471-18305, 18238-18160, 18072-17875, 17777-17670) 43% to 314A1, 55% to pea aphid (Acyrthosiphon pisum) P450 below cyan is extra seq added based on CF588143 note This seq is less than 55% identical to Drosophila and Anopheles 314A1s, but there appears to be only one orthologous seq in each species and it does not make sense to give the ortholog a different subfamily. MLLSSAWFEVIAAVLLTILIFVTSHRPAWWFWTATSHEASGKS (0) GKFKTVSKVPGPFSLPIFGTRWIFSCIGYYKLNKIHDAYK (1) DLNQRYGALCKEEALWNFPMISVFSRQDIETIIRRNSRYPLRPPQEVISHYRRTRRDRYTNLGLVNE (2) QGQTWHDLRVALTSELTAASTVLGFFPALNIVADSFIELIRRQRVGYKVTGFEELAYKMGLE (1) STCTLILGRHLGFLKPDSSSELATRLAEAVRIHFTASRDAFYGLPLWKLLPTCAYKQLIESEDAIYN (2) IISEIIETTIQEKRDDAKDESVEAIFQSILRQKNLDIRDKKAAIVDFIAAGIHT (0) LGNTLVFLFDLIGRNPAVQNKLYEETYALAPAGCDLTIDNLRKAKYLRACITESLR (2) LIPTTTCIARILDEPIELSGYRLTAG (0) TVVLLHTWIAGLNEENFKDAKKYLPERWTTPTTPHSPLLVAPFGAGRRICPGKRFVDLALQLILAK (0) IIREFEIIVEEELDLQFEFILAPKGPVSLGFRDRS* >CB336480 Tribolium castaneum embryonic cDNA MFEKIFQSLDVTSLLIIAI FFLFLEYRPPWWYRNNDCKKGVKLIPGPLALPGLGTTWIFFFGGFSFNRLHLYYENMYKR YGPVMKEEYWCNIPVINLFEKREIVKVLKAGGKYPLRPPVEAVAHYRRSRLIDTLALG >CF588143 pea aphid (Acyrthosiphon pisum) 48% to 314A1 D. pseudoobscura 2 RVLRQSGKYPIRPPNEVTANYRKSRPDRYTNTGLVNEQGEVWAMLRNKLTPELTSPRTIR 181 182 RFLPEVNQLADDFNNLISLARDGNNVVRGFEGYCNRMGLES 304 305 TCTLILGRRIGFLDGEVSETATRLADSVTSQFRASQEAFYGLPLWKLIPTKAYKDFVAS 481 482 EDALYDIVSEFVESALIDEQQSFTDVRSVFVSILQASELDNRDKKAAIIDYIAAGIKT 655 LGNTLVFIL 682 >CYP334A1 Am1.2_9.8 (72740-75349) plus, 530 aa, 10 exons (second exon is a guess) CYP334A1 Am1.1_9.6 alternate splicing possible? (1158493-1161102) minus, 514 aa, 9 exons (1158603-1158493, 1158868-1158668, 1159034-1158959, 1159285-1159117, 1159526-1159346, 1159843-1159593, 1160068-1159927, 1160289-1160153, 1161102-1160830) 34% to 12A5m a new family in the mito clan MTESQTASVDESLRTDTIPLLDHTATTEVSPTTFEVSTMKVDTQIFDKAPLPFDEIPGPAILKIWEKYWKYVPLLG(1) RLTWNRNITPLKYLFNEYGCIVRINGPLSGDIVMIHR(2) PEHIAEVFKQEGDTPVRSGIDILQHYRLNYRKYRLAGPFSM(2) QGTEWLEIRDKVEDTFNQISSTFFTKIDTCCNELITRICKIRNRQNE(0) VPVSFYEDLIRWAMECFCDLTFNKRLGFLEPIGYNSSSEASKLINALTTAHKYMSRCETGFQVWRFFLTPFARKLFEACDVLDN (2) VIGKYVRQAQCKLRIRKSHSEESSMTERSPVLEKLLLNEGIHPDDICTMLMDMIILGIQA(0) TVNSEAFLLYHLAKNPRTQRKVYDEIISVLSNDNSSFTEKSLKNMPYLKACIQETLR(2) LHPAIPYITRLLPKTISLHGYTIPKG(0) TFVIMANQITSQREENFEDPFKFWPERWLSNSSKEDVHFSYLPFGHGIRSCLGKNMAEAKMMLLTAK(0) VRQFRIEYDYADIKSRFMMVNVPNKPLRFRFVNRN* >CYP315A1 Am1.2_Un.768 (10162-7910) minus, 535 aa, 6 exons CYP315A1 Am1.1_Un.1189 (10257-12510) minus, 535 aa, 6 exons 38% to 315a1m, probable mito clan, maybe 315Bx note This seq is less than 55% identical to drosophila and anopheles 315A1s, but there appears to be only one orthologous seq in each species and it does not make sense to give the only ortholog a different subfamily. MNLAQNILKSGKSVSLSSNVIALKYNVPGCGYAGASQTSRIDDLSDISKSTDGGNRSKIEITEKLRDR NYGTVAVATSESILQEMPEPRGIPVFGTLFSFILSGGPKKQHEYVDKRHKELGPVYKERIGPTTAVFV NSIHEFRKIFRLEGSTPKHFLPEAWTLYNEIRKCRRGLLFM (2) NGEEWVYFRKILNKVMLLPDPTNLMIAPCQEVAIELKRKWQKQIKTNNIISNLQVQLYQWSIE (1) AMMATLMGSYWYSYKHQLSRDFEILAETLHEIFEYSAKLSIIPVKLAMNLRLPVWKKFVASA DTAFEIVRMLVPEMAKLGGNGLLKKMMDEGIRAEDAICIVTDFILAAGDT (0) TATTLQWILLLLCNHPEKQEELFKHLKDLSQEDILRLPLLKGIIKESLRLYPIAPFISRYLPEDSVIGNYFVPKG (0) ELLVLSLYSSGRDAANFPQPNEFRPERWIRTQKGIYQGVVHPHASLPFALGARSCIGRKLAEIQISFALAE (0) LIKSFKIECINKNQVKLILHLISVPSQSIKLKLMERN* >CYP302A1 N-term Am1.2_Un.8443 incomplete on C-terminus (289-2450) minus, 202 aa, 3 exons CYP302A1 N-term Am1.1_Un.2216 incomplete on C-terminus (289-2832) minus, 202 aa, 3 exons 45% to CYP302A1 aa19-183 in CYP302 family, mito clan (disembodied) MCTLLKKCNQSIRKKLFIKFYSNEFTKSKIKINHSQPKAFYDIPGPKSLPIIGTLYKYLPFI (1) GEYSFTNLYESGKKKLKCFGPLVREEIIPNVNVIWIYRPEDIAEIFKAESGLHPERRSHLALLKYRKDRPNIYNTGGLLPT (2) NGSEWWRLRKEFQKVSSKPQDVINYLKETDCVIQEFVELCNNEKFADFLPLLSRLFLEC (1) >CYP302A1 C-term Am1.2_Un.2118 incomplete on N-terminus (3101-5209) minus, 250 aa, 4 exons CYP302A1 C-term Am1.1_Un.3145 incomplete on N-terminus (122-2230) plus, 250 aa, 4 exons 51% to 302A1 (disembodied) note these two pieces are probably from a single gene missing about 70aa inbetween (2)IALELVSRKKNNMKIRYNKSFLDAYLENPVLDIKDIVGMACDMLLAGIDT (0) TSYSTAYILYHLAKNQNIQEKLRIEATQLLKNHNEPISINILRNASYTKAVIKESLRLNPISIGIGRILQTDVVLSGYRVPKG(0) SVVVTQNQIICRLPEYFEEPNLFIPERWLREYSENNNKINYKKTVHPYVLLPFGHGPRSCIARRFAEQNMQILLLR (0) ICRRLKISWHGDDLGMISLLINKPNALLKFNFHDILNNNSV* >CYP49A1 ortholog Am1.2_Un.343b (13080-19643) minus, 531 aa, 10 exons CYP49A1 ortholog Am1.1_Un.423a incomplete on N-terminus and C-terminus of exon 6 (214-3539) plus, 312 aa, 7 exons AmGroupUn.10899 incomplete on both ends (1221-1394) minus, 58 aa, 1 exon 46% to 49A1, 50% to 301A1 mito clan Combined Am1.1_Un.423a and Am1.1_Un.10899, MLKIFVIKFLCCRVKMQILNCKFTKLLKNTVKQSNNIIKTLETLTTEVEEQDWSRCRPYSEIPGPKPIPFLGNTWRFIPFI (1) GDFKIQAVDQVSKKLYKEFGDIVKVEGLLGRPDMVFIYDANEIERIFRQEERMPYRPSMPSLNYYKHVLRKEFFKENAGVIAV (2) HGESWYNFRSKVQQVMLQPRTARMYITSMEEASLAFLER (2) IKKIRNKNDEVPDDFLNEIHKWSLE (1) SIARVALDVRLGCLDDDANIETQQLIDAVTTFFKNVGILELKIPFWKLFNTPTWLKYVNALDTILS (2) ITSRYTTVALSRTKEAEKSDKEPSLLERVLALENDTKLATILSLDLFLVGIDT (0) TSSTVASTLYQLALHPDEQDRAYNEVCNILPSKDMQLDGKHLDKLKYLKACIKETLR (2) MYPVVIGNGRCMTKDTIIKGYRVPKG (0) VQVVFQHYVISNLDKYFPHSDKFLPERWLQSDGVRHSFASLPFGYGRRMCLGRRFAELEMLVVISK (0) ILQRYKIEYHHEKLEYYINPMYTPKGSLNLKFIDR* >CYP301A1 ortholog Am1.2_Un.343a (9993-12955) plus, 532 aa, 9 exons (exon 1 is a guess) CYP301A1 ortholog Am1.1_Un.423b incomplete on N-terminus and missing exon (3686-6941) minus, 534 aa, 8 exons 69% to 301a1 D. melanogaster MMTGKTRYLFFKIHFLLLYVLI (2) CMVDHDTTTIQQGKPYKDIPGPRPIPILGNTWRLFPMIGQYEISDIAKLSQIFYDEYG KIVRLTGLIGRPDLLFVYDVDEIEKIYRQEGPTPFRPSMPCLVHYKSVVRKDFFGSLPGVVGV (2) HGEPWREFRTRVQKPILQPQTVRKYITPIEMVTSDFIQR (2) IQEIKGEDGEVPGDFDNEIHKWALE (1) CIGRVALDVRLGCLSSNLTSDSEPQKIIDAAKFALRNVAILELKAPYWRYVPTLLWSRYVRNMNYFIE(2) VCMKYIDATMERLKTKKAVDEYDLSLMERILAKETDPKIAYILALDLILVGIDT (0) ISMAVCSILYQLATRPEEQEKIYQELVEILPDPSVPLNMSHLDKAIYMKAFIREVFR (2) VYSTVIGNGRTLQNDTIICGYKVPKG (0) VQVVFPTVVTGNMEKYVTDAKIFKPMRWLKESTKTLHPFASLPYGHGARMCLGRRFADLEIQVLLAK (0) LIRSYKLEYHHKPLKYKVTFMYAPDGELKFKVLPR*     VWX   n o   ^ _   ? @ & ' ` a b " # V W u v bc JK佽h2FB*CJOJQJaJphh2FB*CJOJQJaJph h2F5\h2FCJOJQJaJh2F5B*CJ\aJphJWX  o  _  @ ' a b # W v ed >u c K*kxD}-cQ ed >uK)*jkwxCD|},-bcPQ  VWpqy{>?{ NO{h2FB*CJOJQJaJphh2FB*CJOJQJaJphh2Fh2FCJOJQJaJR Wq? O( ._:nWed >u'(6?{ -.^_{ 9:mn{PVW{ QRVkl *h2FCJOJQJaJ *h2FCJOJQJaJh2FB*CJOJQJaJphh2FB*CJOJQJaJphh2FB*CJOJQJaJphh2FB*CJOJQJaJphh2FCJOJQJaJ<Rl?z5G[ l'q! T ed >u>?Vyz%45FGVZ[ Vkl&'.pq ! . S T !"!#!V!W!!!!!!!!"h2FB*CJOJQJaJphh2FB*CJOJQJaJph *h2FCJOJQJaJh2FB*CJOJQJaJphh2FCJOJQJaJH #!W!!!!A"h""""$#S####'$E$$$$$%0%P%%%%&[&&ed >u"@"A"g"h""""""""###$#R#S########$&$'$D$E$$$$$$%#%$%/%0%O%P%T%Z%p%s%%%%%%%&&Z&[&&&&&&& ' 'h2FB*CJOJQJaJph *h2FCJOJQJaJ" *h2FB*CJOJQJaJphh2FB*CJOJQJaJphh2FB*CJOJQJaJphh2FCJOJQJaJ>&& '9'{'''((I(J((( )G)m))))N****3+u+++,,S,ed >u '8'9'F'L'z'{'''''''''''''(((H(I(J(K(R((((((( ) )F)G)l)m))))))))))))) *9*㿿㘥 *h2FCJOJQJaJ *h2FCJOJQJaJh2F>*CJOJQJaJ *h2FCJOJQJaJh2FB*CJOJQJaJph h2F5\ *h2FCJOJQJaJh2FCJOJQJaJh2FB*CJOJQJaJph49*M*N*********+++,+.+2+3+F+H+N+P+t+u+++++,,,,,!,N,R,S,[,,,,,--R-S-----ŲޥҥҥҘ{{h2FB*CJOJQJaJph *h2FCJOJQJaJ *h2FCJOJQJaJ *h2FCJOJQJaJ% *h2F>*B*CJOJQJaJphh2F>*CJOJQJaJh2FCJOJQJaJh2FB*CJOJQJaJph"h2F>*B*CJOJQJaJph0S,,,-S--- .W....$/[/~///0O0x000T11112]2s22ed >u--. .V.W.........#/$/,/Z/[/}/~//////00N0O0w0x000000S1T111111111122B2E2F2\2]2r2s2w2222222222222222233:3;3a3 *h2FCJOJQJaJh2FB*CJOJQJaJphh2FB*CJOJQJaJphh2FCJOJQJaJN223b333"4w4{444 5%5}5556H6666)7*7|7778_888ed >ua3b3333333!4"4v4w4z4{4444444 5 5$5%5C5E5|5}55555566G6H66666666(7)7*7+7:7{7|77777788^8_8888888888 9 99 9X9h2FB*CJOJQJaJphh2FCJH*OJQJaJh2FB*CJOJQJaJph *h2FCJOJQJaJh2FCJOJQJaJI88 9 9Y999:K::::,;-;g;;; <E<<<=7====>>e>>ed >uX9Y99999:: :J:K::::::: ;+;,;-;.;5;f;g;o;;;;; < < <D<E<<<<<== =6=7=======>>>> >">d>e>m>>>>>>%?&?b?c?????? @ @Q@R@@@@@@@@0Ah2FB*CJOJQJaJphh2FB*CJOJQJaJphh2FCJOJQJaJT>>&?c??? @R@@@@1A2AoAAABBLBBBB?CCCCDDwDDed >u0A1A2A3A:AnAoAwAAAAABBBBBKBLBBBBBBBB>C?CCCCCCCCDDDDD D.DvDwDDDDDD E EPEQERESE[EEEEEEEF4F5FPFQFh2FB*CJOJQJaJphh2FB*CJOJQJaJph *h2FCJOJQJaJh2FB*CJOJQJaJphh2FB*CJOJQJaJphh2FCJOJQJaJBDD EQEREEE5FQFtFFFGDG~GG!H%HeHfHHHI"IVIIIJMJJed >uQFsFtFFFFFFGGCGDG}G~GGGG H!H$H%HdHeHfHgHnHHHHHHIII!I"IUIVIIIIIIJJ)J*JLJMJJJJJJJ.K/K0K1K8KKKKKKKKK0L1LkLlLLLLLLLL *h2FCJOJQJaJh2FB*CJOJQJaJphh2FB*CJOJQJaJphh2FCJOJQJaJNJJJ/K0KKKK1LlLLLLM%MlMMMN^NNOO>OOOOPDP|Ped >uLMM$M%MkMlMMMMMMNN]N^NNNNOOOO=O>OOOOOOOOOOPP&PCPDP{P|PPPPPPQQOQPQQQQQQRRjRkRnRoRRRRRRRSS1S2Sӳӳӳh2FB*CJOJQJaJphh2FB*CJOJQJaJphh2FCJOJQJaJh2FB*CJOJQJaJphh2FB*CJOJQJaJphF|PPPQPQQQRkRoRRRS2SjSSSSS T&TcTTT>UBU|U}UUUed >u2SiSjSSSSSSSSSSS T T%T&TbTcTTTTTTTTTTTTT=U>UAUBU{U|U}UUUUUUVV@VAVzV{VVVVV/W0WAW~WWWWWXX-X.XAXfXgXXXXXYYAYXYYYYY *h2FCJOJQJaJh2FB*CJOJQJaJphh2FCJOJQJaJh2FB*CJOJQJaJphNUVAV{VVV0WWWWX.XgXXXYYYYYZZNZZZ[8[[[1\\ed >uYYYZZZAZMZNZZZZZ[[7[8[A[[[[[0\1\A\\\\\\\#]$]b]c]]]]]]]!^"^X^Y^^^^^^^^P_Q_t_u_v____````Z`[`f`g``````a anaoaaaaaabbbh2FB*CJOJQJaJphh2FB*CJOJQJaJphh2FCJOJQJaJT\\\$]c]]]]"^Y^^^^Q_u_v__``[`g``` aoaaabbcbed >ubb bTbbbcbqbbbbbbbcGcHckclcccccdddd5d6dKdLd{d|dddddddeeeAeBeDeEeIeJeaebeeefff f fCfDfEfffffffffg2g3gigjgg *h2FCJOJQJaJh2FB*CJOJQJaJphh2FB*CJOJQJaJphh2FB*CJOJQJaJphh2FCJOJQJaJHcbbbbHclcccd6d|dddebeef fDfEfffff3gjggg!h_hed >uggggh h!h^h_hmhnhhhhhiiiLiMiniiiiijjjjEjFjnjjjjjjjjjjjjCkDkWkXknkkkkkllGlHlnlvlwlxllllllllm"m#m$m\m]mnmԺ *h2FCJOJQJaJ *h2FCJOJQJaJh2FB*CJOJQJaJphh2FB*CJOJQJaJphh2FCJOJQJaJI_hnhhhiiMiiijjFjjjjjDkXkkklHlwlll#m]mmmAned >unmmmmmmmmn n#n'n5n?n@nAnDnEnFn|n}n~nnn oo/o0oFoToUoooooooooo.p/pFpupvppppp6q7qFqqqqq;ruYtZtttuu^u_ucudueufuguouuuuuuu vvdvevvvwwewiwjwwwxxxxxxQxRxZxexxxxxxxFyGyeyyyyyKzLzezzzzzzzzzzz,{-{:{>{?{Q{b{c{{{||Q|g|h||h2FB*CJOJQJaJphh2FB*CJOJQJaJphh2FCJOJQJaJTZttu_udueufuuuvevvwjwwxxxRxxxxGyyyLzzzzzed >uz-{c{{|h||}m}p}q}r}}}~h~~m,-j `ed >u||}}Q}l}m}o}p}q}r}s}{}}}}}}~~Q~g~h~~~Qlm+,-.7Qijr Q_` de./EHI\]~Ճփ,-27=>h2FB*CJOJQJaJphh2FB*CJOJQJaJphh2FCJOJQJaJTe/]փ-YUV!dֆOefed >u>PRXYTUV !cdՆֆNOdefgoψЈш؈و)*+,DEى̺h2F" *h2FB*CJOJQJaJphh2FB*CJOJQJaJph h2F5\h2FCJOJQJaJh2FB*CJOJQJaJphHfЈ*E) H4}ŒRwȍ Ped >u(),1ي ,0GHgkx34|}ŒQRvwǍȍ OPZrsȎɎʎZ[z{ʺh2FB*CJOJQJaJphh2FB*CJOJQJaJph *h2FCJOJQJaJ *h2FCJOJQJaJh2FCJOJQJaJIPsɎʎ[{7xT+k9klɓTed >u67HZwx STZ\*+Zjk89ZjklȓɓST֔ה23Tz{Εϕh2FB*CJOJQJaJphh2FB* CJOJQJaJphh2FB*CJOJQJaJphh2FB*CJOJQJaJph h2F5\ *h2FCJOJQJaJh2FCJOJQJaJ?Tה3{ϕ [\_Q٘&v̙ed >uϕ TZ[\T^_PQTؘ٘%&Tuv˙̙STWXԚ՚9:^_`ai֛כRSh2FB*CJOJQJaJphh2FB*CJOJQJaJphh2FCJOJQJaJh2FB* CJOJQJaJphMTX՚:_`כS&kݝSqʟ Ped >u%&jkܝݝRS pqɟʟ OPABQAB[\cdlͣΣh2FB*CJOJQJaJphh2FCJOJQJaJh2FB*CJOJQJaJphh2FB* CJOJQJaJphMBB\ΣgA_=Led >u:fg@A^_ߥ<= KLno&'RSpq12PQ©éĩũƩh2Fjh2FUh2FB*CJOJQJaJphh2FB*CJOJQJaJphh2FB*CJOJQJaJphh2FCJOJQJaJHLo'Sq2Q©ĩũǩȩʩ˩̩ͩΩϩЩgdJyed >uƩǩȩɩʩ˩̩ͩΩϩЩѩҩөԩթ֩שة٩h2FCJOJQJaJhJyjh2FUh2FЩѩҩөԩթ֩שة٩ed >u 01h/R / =!"#$% x666666666vvvvvvvvv666666>6666666666666666666666666666666666666666666666666hH66666666666666666666666666666666666666666666666666666666666666666p62&6FVfv2(&6FVfv&6FVfv&6FVfv&6FVfv&6FVfv&6FVfv8XV~ 0@ 0@ 0@ 0@ 0@ 0@ 0@ 0@ 0@ 0@ 0@ 0@ 0@ 0@66666_HmH nH sH tH H`H Normal CJOJQJ_HaJmH sH tH DA D Default Paragraph FontRiR 0 Table Normal4 l4a (k ( 0No List D/D msonormaldd[$\$OJQJ8Z 8 0 Plain TextOJQJN/N 0Plain Text CharCJOJPJQJ^JaJ4@"4 Jy0Header  H$B/1B Jy0 Header CharCJOJPJQJaJ4 @B4 Jy0Footer  H$B/QB Jy0 Footer CharCJOJPJQJaJPK![Content_Types].xmlN0EH-J@%ǎǢ|ș$زULTB l,3;rØJB+$G]7O٭Vj\{cp/IDg6wZ0s=Dĵw %;r,qlEآyDQ"Q,=c8B,!gxMD&铁M./SAe^QשF½|SˌDإbj|E7C<bʼNpr8fnߧFrI.{1fVԅ$21(t}kJV1/ ÚQL×07#]fVIhcMZ6/Hߏ bW`Gv Ts'BCt!LQ#JxݴyJ] C:= ċ(tRQ;^e1/-/A_Y)^6(p[_&N}njzb\->;nVb*.7p]M|MMM# ud9c47=iV7̪~㦓ødfÕ 5j z'^9J{rJЃ3Ax| FU9…i3Q/B)LʾRPx)04N O'> agYeHj*kblC=hPW!alfpX OAXl:XVZbr Zy4Sw3?WӊhPxzSq]y ١c K" '9*-a3X90AQFL2SYbgnmYt|>ϕƩ٩UXZ\^`acegikmoqsuwy|~  &S,28>DJ|PU\cb_hAnZtzfPTLЩ٩VWY[]_bdfhjlnprtvxz{}@ @H 0(  0(  b S  ?C" OLE_LINK23 OLE_LINK17 OLE_LINK27 OLE_LINK25 OLE_LINK28 OLE_LINK8 OLE_LINK16h.Q>H6\6\ڡ)/e@J\D^ڡ >u2FJy@١0@Unknown G.[x Times New Roman5^Symbol3. *Cx Arial7Courier?Z PTimesTimes9=  @ ConsolasC.,*{$ Calibri Light7.*{$ CalibriA$BCambria Math"1h*•'*•'$R%$R%%0mmB@P $P'Jy2!xx{ Normal(Honeybee Cytochrome P450s from version 1Stephen NelsonMicrosoft Office User Oh+'0 8D d p |',Honeybee Cytochrome P450s from version 1Stephen NelsonNormalMicrosoft Office User2Microsoft Office Word@F#@JP@JP$ ՜.+,0 hp  'Private%Rm )Honeybee Cytochrome P450s from version 1 Title  !"#$%&'()*+,-./0123456789:;<=>?@ABCDEFGHIJKLMNOPQRSTUVWXYZ[\]^_`abcdefghijklmnopqrstuvwxyz{|}~Root Entry FʘP1TableY!WordDocument2SummaryInformation(DocumentSummaryInformation8CompObjr  F Microsoft Word 97-2003 Document MSWordDocWord.Document.89q