Honeybee Cytochrome P450s from version 2 of the Apis mellifera genome

 

Last modified Feb. 22, 2005

 

These sequences were submitted Feb. 11, 2004 by May Berenbaum in collaboration with

Gene Robinson (genome sequencing), Hugh Roberstson (genome annotation) and Reed

Johnson (P450 annotation).

 

On Sept. 2, 2004 Reed Johnson submitted the revised sequences from Honeybee Version

1.2.  These sequence were further edited and revised by D. Nelson on Sept 13-17.

Many partial sequences have been combined and the number of complete genes

increased from 20 to 33.  The number of P450 genes including fragments for ver 1.2

is 55, including 22 partial sequences. 

 

On Feb. 9, 2005 Reed Johnson submitted the revised sequences from Honeybee Version 2.

More sequences are complete and some contamination from cow DNA was noted (CYP2E1).

The gene statistics are 44 complete genes, 3 incomplete but expected to be full length, 2 pseudogenes plus 5-6 small fragments.

 

There are 4 CYP clans in insects.  CYP2, CYP3, CYP4 and mitochondrial

CYP2 is the clan with CYP18 in it.  Sometimes it is called the CYP18 clan.

CYP2 has CYP15, 18, 303, 304, 305, 306, 307, 342, 343

CYP3 has CYP6 and 9 in it and CYP28, 308, 309, 310, 317(a CYP6 subfam)

CYP4 has CYP4, 311, 312, 313, 316, 318

the mito clan has CYP12, 49, 301, 302, 314, 315

 

The honeybee sequences have been sorted into these main CYP clan bins.

48 genes have been named. 1 named gene is a pseudogene.

 

CYP2 clan (8 sequences, 7 complete, 1 incomplete)

 

>CYP18A1 Am2_13.1 (31931-34244), 524 aa, 5 exons

version 1.2 = CYP18A1 Am1.2_13.1b (32196-34509) plus, 524 aa, 5 exons

version 1.1 = Am1.1_13.1a (71904-74217) plus, 524 aa, 5 exons (71904-72236,

72653-72774, 72894-74323, 73404-73668, 73794-74217)

60% to CYP18A1 D. melanogaster

MGGTRIEVLCTFLVFLGVLLVARCLQWLRYVRSLPPGPWGVPVFGYLPFLKGDVHL

RYGELAKKYGPMFSARLGTQLVVVLSDHRTIRDTFRREEFTGRPHTEFINILGGYG(1)

IINTEGAMWKDQRKFLHDKLRGFGMTYMGGGKKIMESRIM(0)

REVKTFLRGLASKRGTPTDVSASLGMSISNVICSIIMGVRFQHGDARFKRFMDLIEEGFKLFGSMAAVNFIPV

MRYLPCLQKVRNKLAENRAEMAGFFQETVDQHRATFDEGTMRDLVDAYLLEIEKAKGEGRATTLFQGKNHD(1)

RQMQQILGDLFSAGMETVKTTLEWAIILMLHHPDAAIAVQEELDQVVGKS

RMPVLEDLPFLPITEATILEVLRRSSVVPLGTTHATTR(2)

DVTLHGYTIPAGSQVVPLLHAVHMDPELWEKPEEFRPSRFLSAEGKVQKPEYFMPFGVGRRMCLGDVLARMEL

FLFFSSLMHTFELRSPQGSSLPSLRGNAGVTVTPDPFDVCLLPRNLDLIEDNDMISTGAILRNIGSH*

 

>Am2_Un.5851 new (1417-191), this is a Bos taurus CYP2E1 low similarity to CYP18 mid region

(1)IIFNNGSTWRDTRRFSLTTLRDLGMGKQGNEQRIQREAHFLLEVLRKTQG(1)QPFDPTFVVGFAPYNVISDILFHKRFDYKDQTSLRLMSLFNENFYLLSSPWIQVKHSPRFCLFCTLLQFDD(1)YLQYLPGSHRKLLKNVSEVKSYALERVKDHQKSLEPSCPRGFLDTMLIEMAKVHS

 

>gi|2511604|emb|AJ001715.1|BTCYP2E Bos taurus mRNA for cytochrome P450 (CYP2E)

          Length = 1603

 

 Score =  313 bits (803), Expect = 2e-84

 Identities = 157/173 (90%), Positives = 160/173 (92%)

 Frame = +3

 

Query: 1   IIFNNGSTWRDTRRFSLTTLRDLGMGKQGNEQRIQREAHFLLEVLRKTQGQPFDPTFVVG 60

           IIFNNGSTWRDTRRFSLTTLRDLGMGKQGNEQRIQREAHFLLEVLRKTQGQPFDPTFVVG

Sbjct: 360 IIFNNGSTWRDTRRFSLTTLRDLGMGKQGNEQRIQREAHFLLEVLRKTQGQPFDPTFVVG 539

 

Query: 61  FAPYNVISDILFHKRFDYKDQTSLRLMSLFNENFYLLSSPWIQVKHSPRFCLFCTLLQFD 120

           FAPYNVISDILFHKRFDYKDQTSLRLMSLFNENFYLLSSPWIQ+ ++           F

Sbjct: 540 FAPYNVISDILFHKRFDYKDQTSLRLMSLFNENFYLLSSPWIQLYNN-----------FP 686

 

Query: 121 DYLQYLPGSHRKLLKNVSEVKSYALERVKDHQKSLEPSCPRGFLDTMLIEMAK 173

           DYLQYLPGSHRKLLKNVSEVKSYALERVKDHQKSLEPSCPRGFLDTMLIEMAK

Sbjct: 687 DYLQYLPGSHRKLLKNVSEVKSYALERVKDHQKSLEPSCPRGFLDTMLIEMAK 845

 

>CYP306A1 Am2_13.1 (38911-36903), 499 aa, 7 exons

version 1.2 = CYP306A1 Am1.2_13.1a (39182-37170) minus, 499 aa, 7 exons

version 1.1 = CYP306A1 Am1.1_13.1b (76876-78884) minus, 499aa, 7 exons (78884-78557,

78493-78345, 78209-78007, 77901-77595, 77538-77383, 77298-77123,

77057-76876)

44% to 306A1 Anopheles gambiae = CYP306A1 in 18clan/2clan

CYP306A1 is the probable ortholog of CYP306A1 in diptera (flies and mosquitos)

the % identity is below the usual cutoff for subfamily membership,

but it makes sense to name orthologs with the same name.

MILDHYIAIFVLPFLLLLYVVRKNRKARRLPPGPWQLPLLGYLPWIDAEKPHETLTRLS

RVYGPVCGFRMGSVYTVLLSDPQLIRQSFAKDSITNRAPLYLTHGIMKGYG(1)

IICAEGEQWKDQRKFISNCLRNFGMVKHEGAKRDKMEERISDAVNECVS(0)

VLRDRGANGPIDPLDTLHHCLGNLVNSIVFGKTYEEEDRIWKWLRHLQEEGVKQIGVAGPLNFLPFLR(2)

FLPQYGRVIRSIVDGKDKTHEIYRQILDEHRARVDSGNGCKIDSFLAAFDEQ

MRKKDGAESGYFTEPQLYHLLADLFGAGTDTTLTTLRWFLLFMAAHPMEQ(0)

EKIQSEMDLCLREGEQPTLNDRIVMPRLEAAIAEVQRIRSVTPLGIPHGTSE(0)

DVEIGGYDIPCGAMIVPMQWAIHTDPAYWRDPLEFRPDRFLSEDGTFFKPESFLPFQNG(1)

KRVCVGEELARMILFLFAGRILRAFSVRVPAGEIADLEGECGITLVPKPHRLAFVGRDR*

 

>Am2_Un.5840 new (1363-466), 74 aa, 2 exons 41% to 306A1 this is a CYP2E seq from Bos taurus

(0)GTVVIPTLDSVLHDRQEFPEPEKFKPEHFLNENGKFKYSDHFKAFSAG(1)KRVCVGEGLARMELFLLLAAILQHFT*

 

>gi|2511604|emb|AJ001715.1|BTCYP2E Bos taurus mRNA for cytochrome P450 (CYP2E)

          Length = 1603

 

 Score =  123 bits (308), Expect = 3e-27

 Identities = 62/73 (84%), Positives = 62/73 (84%)

 Frame = +3

 

Query: 1    GTVVIPTLDSVLHDRQXXXXXXXXXXXHFLNENGKFKYSDHFKAFSAGKRVCVGEGLARM 60

            GTVVIPTLDSVLHDRQ           HFLNENGKFKYSDHFKAFSAGKRVCVGEGLARM

Sbjct: 1176 GTVVIPTLDSVLHDRQEFPEPEKFKPEHFLNENGKFKYSDHFKAFSAGKRVCVGEGLARM 1355

 

Query: 61   ELFLLLAAILQHF 73

            ELFLLLAAILQHF

Sbjct: 1356 ELFLLLAAILQHF 1394

 

>CYP307B1 Am2_14.4 (871542-875286), 507 aa, 3 exons

version 1.2 = CYP307B1 Am1.2_14.10 (757965-761727) plus, 507 aa, 3 exons

version 1.1 = Am1.1_14.9 (18918-22680) minus, 507 aa, 3 exons (22680-22278,

20154-19363, 19246-18921)

CYP307B1  55% to 307B1 Anopheles, probably the ortholog of 307B1

MIPLTATTCFLIAITFLALALILLDHLRSKKTTKSVVPGDDDQHALPEPPGPKPWPILGSLHILGRYDVPYKAFADLVRDFDCQVIKLRMGSVPCVVVNGLENIKEVLTVKGHHFDSRPNFARYHLLFGGNKENS(1)

LAFCNWSDVQKARREMLRAHTFPRAFSTRFNELNGIIGDEMEFMVNHLDSLSGTSVHAKPLILHCCANIFITYLCSKNFHLEHDGFRNMVENFDKVFFEVNQGYAADFLPFLMPLHHRNMARMAHWSHEIRRFVIKNIIADRVNSWNDVVPEKDYVDCLINHVKSGTEPQMSWNTALFVMEDIIGGHTAIGNLLVKVLGFLATRPEIQRLAQDEIDALGLAGNFVGLENRRSLPYVEAIILETIRIIASPIVPHVANQDSSIAG(1)

FRIKKDTFIFLNNYDLNMSTDLWTSPEEFMPDRFVQNGRLLKPEHFLPFGGGRRSCMGYKLVQYVSFAILASILKNFTITPVQKEDYTIPIGNLALPEMTYKFRFERR*

 

>CYP303A1 Am2_Un.6309 (6408-4574), 375 aa, 8 exons

version 1.2 = CYP303A1 Am1.2_Un.1361 incomplete on N-terminus (5389-7223) minus, 378 aa, 8 exons

AADG03013520.1 (WGS)

version 1.1 = CYP303A1 Am1.1_Un.2253 incomplete on N-terminus (55-1889) plus, 378 aa, 8 exons

43% to 303A1 D.mel., 44% to 303A1 Anoph. CYP303A1 in 2 clan/18 clan

may be the ortholog of 303A1, first exon below is 75% to 304A1

(1)LLLVDGNLWNEQRRFVLKHLRDFGFGRQN (1)

LYMNANEYTGNNVTQSQLGTIISMHNIFGITVLNSLWKMLAGKR (2)

YNIDDKELIYFQRILSITLNEIDMLGAPFSHFPLLRFIAPEISGYKSFVKIHEELWKFFK (0)

DEVNNHKNTFNSDSPGNLIDIYLTILNSENYGKTFSD (1)

VSEPQLVAICVDLFMAGSETTSKVLGFCFLYLVLFPHVQKKAHEEIDRVIGRNKLPTAEDKAK (2)

MTYMNAIVLESLRMFAGRSLNLPHRVQRDTKISDYKIPK (0)

NTIIITNFNGILMDESWGDPENFRPERFIDGSGNIVTPSRFLPFSAG (1)

KHRCMGENLAKTNIFIIATTLLQAFTFSEIPGEKPTIEHFIDGTTISPKPYRVNVSLRI*

 

>CYP305D1 Am2_7.2 frameshift repaired in 7th exon  (72431-68935), 490 aa, 8 exons

version 1.2 = CYP305D1 Am1.2GroupUn.127b (70044-73538) minus, 490 aa, 8 exons (first exon is a guess)

version 1.1 Am1.1_Un.7452 incomplete on both ends (700-954) minus, 85 aa, exon 2

version 1.1 Am1.1_Un.6110 incomplete on both ends (1-622) minus, 126 aa, exons 6,7

Two fragments from v.1.1 have been combined in this sequence.  The first exon is a best guess. BI513047.1 (EST) AADG03008281.1 (WGS) 39% to 305A2, 39% to 305B1 Bombyx

39% to 305B1 silkworm

MNKNFVKIFNLLLYLFIDVFNNLFFIECDG(1)

PFSWPFIGNQILLKRLSRKFGGQHKAFMELSKRYNSDIITVNISYEKIIVVSGSK

FCDMILQNEEFQGRPWNEFIKVRNMGKKQG(1)

ITMNDGTEWKELRNWMMRTMKIFGFGKSEMIEMIQHQLVIFSENLNKNKLHQLKLLFVPAVINVLWNFITGELVAFNQQQK(2)

LEHFLDLLDRRSRCFDITGGLLAAFPWIRYIAPEISGYNIMCMLNKELKDFLM(0)

KTINDHKEKYIEGKEADLIDMFIQEMRKNEKSSIFTD(1)

EQLMMILIDLFLAGFTTTSTTLDFLFLIVTLFPDVQRKVQKEIDSVIPYDRLPNMEDKAK(2)

LPYVEAVISETYRLWPVFPIIGPRRVLCDTNIDKYVIPKDTTILFNTYSINKDPTL

YPDPDKFMPERFIKNGVFEPDEYSLQFGKG(1)

KRRCPGDILAKATIFILFVGIMQKYTLLPVPGKGPHSIKINSGITLTPQPYNVLVEKR*

 

>CYP15A1 Am2_7.2  (65177-67812), 500 aa, 8 exons

W->L, changed splice donor on penultimate intron and both splice sites on final intron

version 1.2 = CYP15A1 Am1.2_Un.127a (66491-68918) plus, 504 aa, 8 exons XM_392687.1 partial

gnl|Amellifera1|165771382 BCM Apis mellifera 12/11/2003

Am1.1_Un.897 incomplete, missing exon 5 (51606-53773) plus, 357 aa, 5 exons

Am1.1_Un.10970 incomplete on N-terminus (112-677) plus, 121 aa, 2 exons

Two fragments from v.1.1 have been combined.  18clan/2clan

47% to CYP15A1 Diploptera punctata (probable ortholog)

39% to CYP15B1 Anopheles gambiae 36% to 303A1, 35% to 305A3, 31% to 304B1

MLYVVISLLLALYCIFCIYDCVKPHNFPP (1)

GPKWLPLIGCFLTFRRLKLKHKYTYVAFQELSKTYGPILGLKLGSQKLVVI

STHDLVKKVLLQDEFNGRPDGFFFRVRAFGKRK (1)

GILFTEGSMWSQCRRFTMRHLRSFGLGQSTMEKYLTVEAENLVNYLRRVST

KGPVPMHTAFDIAVLNSLWCMFAGHRFDYENEKLAEILEIVHDSFR (2)

LMDTMGGIISQMPFLRFIIPELSGYNNLMEILRKLWNFLDEEINNHEKHLSGNQPQDLIEAFLLEISSRNGVQNDSIFDR (1)

ENLLILCLDLFLAGSKTTTDTLSTSILFLSLHSEWIKILQEELDNVVGRSRSPTLEDYSSLPIMESFLAE (0)

IQRFLILAPLGVPHKTTKDVILNGYNIPK (0 GC boundary?)

DTTVLLDFHSAHNDPAYWDHPEEFRPQRFLDANGRFCQNNANIPFGL (1)

GKRRCPGEMLARTSLFLYFAYVIHYFDIEISPEHGKPDLNGHDGFTISPKSYYLKITARSDVTNCSTI*

 

Note: I still argue for a GC boundary at YNIPK, because this is a conserved sequence and it should not be deleted to make a conventional GT boundary at TTKD. The same is also true at the heme signature intron.  A GC boundary preserves conserved motifs at this site.

 

>Am2_Un.4815  new (4673-4457), 72 aa, 1 exon, similar to 15A1 this is Bos taurus CYP2E1

(2)MGLPWRQRGSPHGLLSLQLAERYGPVFTLYLGSQRAVVVHGYKPVKEVLLDYKNEFSGRGENPGFQMHKNNG(0)

 

>CYP2E1 TC189658 = AJ001715 79% to human 2E1 Length = 1900

        Length = 496

 

 Score = 291 (102.4 bits), Expect = 7.1e-29, P = 7.1e-29

 Identities = 54/55 (98%), Positives = 55/55 (100%)

 

Query:    18 QLAERYGPVFTLYLGSQRAVVVHGYKPVKEVLLDYKNEFSGRGENPGFQMHKNNG 72

             +LAERYGPVFTLYLGSQRAVVVHGYKPVKEVLLDYKNEFSGRGENPGFQMHKNNG

Sbjct:    59 RLAERYGPVFTLYLGSQRAVVVHGYKPVKEVLLDYKNEFSGRGENPGFQMHKNNG 113

 

>CYP342A1 Am2_8.3 (44357-46424), 505 aa, 7 exons

version 1.2 = CYP342A1 Am1.2_8.3 incomplete on N-terminus (1269-3097) plus, 505 aa, 7 exons

version 1.1 Am1.1_Un.8493 incomplete on both ends (509-1028) minus, 141 aa, 2 exons

36% to 304A1 36% to 304B1 18clan/2clan

MISFLFIIFLLLIIYKIYNSVIHVSSNTPPC(1)

LPRLPIIGSYWHLLWHDYEYPYNGIIHYVNKLQSKIVTCYFGSHKTIIANDYKSIKEVL

TKQEFNGRPINVDIVLQRAFGKSLG(1)

IFFTEGTLWHEQRRFALRHMRDFGFGRRHEIFETNVMEEIAILVDMLKEGPINDEEK(0)

KFLKNGYACFPDILYPYVANVILNIMFGERFDRSQYHKLIYFCESSMMFQKSLDTSG

GAIFQFWFLKYFGNIFGYTNAIKATYQMINFIE(0)

EYIDNKKDLDDYDKGLIGRYLKILKEKNNITSTFSQKQLIMTLVDFMFPATSALPSA

LVHAIKLVMHHPRVVNNIQEEIDRVVGTGRLVTWSDRKN(2)

LPYIEATIRESLRYETLTPLSVFHKTLKKTTLCDYDIPKDTLVVTNLVALNTDPDLW

GDPENFRPERFLDENNELRKDFTFPFGFG(1)

HRVCPGETYSRYNMFEVFAVLMQNFNFSFVEGEPTGLDDKESGLIVTPKKTWIQVKARNMK*

 

>CYP343A1 3241+6423 Am2_Un.573, R->K (725-7080), 496 aa, 8 exons  combined Am1.2_Un.3241 and Am1.2_6423 to complete

version 1.2 = Am1.2_Un.3241 (94-459) minus, 96 aa, 2 exons at N-terminus up to C-helix

version 1.1 = Am1.1_Un.7901 incomplete on C-terminus (1-459) minus, 126 aa, 2 exons 40% to 15B1

version 1.2 = Am1.2_Un.6423 incomplete on N-terminus (67-247) minus, 254 aa, 4 exons

AADG03019874.1 (WGS)

Am1.2_Un.1305 incomplete on N-terminus (2043-2647) plus, 146 aa, 2 exons

Am1.1_Un.960 incomplete on both ends (8311-11562) minus, 147 aa, 2 exons

One fragment in v.1.1 was divided into two in v.1.2, but they are probably from

one gene. 34% to 305a4, 36% to CYP15A1, new family in the CYP2 clan

MWFVILCFVIVLIKILFDYSRPINFPPG(1)

PRGLPFIGNILDIIKLINETKYYSDTWCRLAEKYGSVVGLRLGLDQPLIIVSGKSAVTEMLNR

SEFDGRPSGFLYKYRCGGMQQGILFTDTDVWHSQRR(2)

FALKTLKQFGFGKNSMEHILQHDAIALTNIIIELTKDGTVKNIRSIISAAVLSNLWLLIDGTK(2)

FDIGMENSNLKEAINIVQDIVKSSNVSGGIINQFPFLRHLFPNLTGFSAFVERQKRINNFFM(0)

EVIAKHKWKKINEEGTNFIDVYLQEIQKKNSSHSFFNE(1)

NQLLYIIKDLFSAGVDTTNSTIGFIIAFLVVHQDVQSKVYDEISRVIDKDIYPSLSDKDR(2)

LPYLKAVIAEVSRLANIGPTSIPHRAVKDSTFLGFEIKKNYTLLANFKSIHMDKEHWGDPEIF

RPERFINEKGDFINDSWLMPFGLG(1)

RRKCLGETLAKNTVFLFVACMLQRLHFMLPSNHPPPCLQGIDGFVIAPPMMDIIAVQRF*

 

note: intact seq 36% to 15A1, 35% to 15B1, 34% to 305A4

 

CYP3 clan (31 seqs, 28 complete and 3 incomplete, 2 of the incomplete are pseudogenes, 5-6 fragments)

 

>CYP6AQ1 Am2_12.16   (1067523-1070348), 514 aa, 5 exons

version 1.2 = CYP6AQ1 Am1.2_12.14 (417204-420026) plus, 514 aa, 5 exons

version 1.1 = CYP6AQ1 Am1.1_12.14 (315628-318450) plus, 514 aa, 5 exons (315628-316108,

316238-316604, 316677-316861, 316989-317277, 318272-318450)

45% to 6K1, 42% to 6g2m 43% to 6G1ps new subfamily in CYP6

cyan = missing seq. from EST BE844578

yellow = EST BE844462, underlined seq = EST BE844394, green = EST BE844353

magenta = EST BE844352, gray = EST BE844331 all ESTs from antennae

 

MNLLTPYWSLDILIVSSSLMIAVYLYASWKLKYWSRRGIMQITPSPLFGNFKKCILFQKSVSEIIRELYGQNEGLPFMGFY

IFYKPFFLVRDIELVKHILVKDFNTFANKHTSADSKNDRIGYSNLFIIKNPAWKYLRGKLTSVFTSGKLKKMFDLMLIIG (1)

KNLEKHLELLNLDG

NGKEVELKDLCANFTTDLIGTTAFGVNLNSLKDPNSDFRENGRLVFDYNLKRAFEFFSIFFFPNLS

KYVSIKFFGKATDYFRNSFWSVINQRIESNVKRNDLIDCLIELREKHKNDESFEGFR (1)

FDGDDLVSQAAIFFTGGFETSSTTISFTLYELALNKDIQKTVRTEIHEALAQTDGKITYDM (0)

ITNLPYLDMVVSETLRKYPPLGFLDRVALHDYKIPNSDVTIDKDTPVIIPMIAFHYD

PKYFPNPEKYDPLRFSEEVKKTRPSYVYMPFGEGPHICIG (1)

MRLGLLQSKLGIIEILKDYEVSPCEKTKIPMVLDPKGLTTTALGGLYLNIRKITIAAG*

 

>CYP6AR1 Am2_5.8 (434944-432195), 502 aa, 5 exons

version 1.2 = CYP6AR1 Am1.2_5.5 (430078-432827) minus, 502 aa, 5 exons

version 1.1 = CYP6AR1 Am1.1_Un.19 (44801-47550) plus, 502 aa, 5 exons

50% to AmGroupUn.5496, 47% to AmGroupUn.792b, 38% to 6a13ps all best hits to 6as

MSWLMIETVGLIATVFFLLYYYSMSKLDYWRKRGVKGPKPLPFLGNFKDVLLAKESTMDCFERA

YKEFKDEPMVGMYGSHEPLLILRDLDLIKDVLIKDFNKFAQRTQGAIRE(0)

VEPLSEQLFRLDAERWRPLRLKLSSFFSSGKLKEMFHLFVECSDNFEKYLEKMVEKGGLVECRDAAAKFSTDVIGACAFSIHTNALTDENSQFRKMGKQALATNLQQFLNDRLREYPFLFKIFGRFFVDHEVTNFFANSIKDAMDY

RIQNNVHLRDVIDILADIRENPTKCGLKE(1)

ADNLFLTSQAVLFFLAGFENASLTISNALYELAWKPEIQEKARAEIVNVLQKYDGKITYDGLEEMKYLEACIFE(1)

TLRMYPVLQWLSREAMETYTFTGTKVTIPKGQQVFLPIYAIQRDPDI

YPNPDNFDPERFTDDKIKTRHSMTHLPFGDGPRHCSG(1)

IRLAKKQLKVGLVTVLSKFKVEVCEKTRKIYQKDKKPLFLLQPVDGIHLKISKVSV*

 

>CYP6AS1 Am2_Un.5363 (3492-1105), 498 aa, 5 exons

version 1.2 = CYP6AS1 Am1.2_Un.6491 (264-2651) plus, 498 aa, 5 exons

version 1.1 = CYP6AS1 Am1.1_Un.5496 (264-2651) plus, 498 aa, 5 exons

44% to 6A14, 64% to AmGroupUn.792b

MDYFQILCAISIVILTIYYYYSSKYTFWKKRGISGPKPIIFFGNFVDSIIQKRSTSEAVKK

WYDDYKHESVFGIFGGTTPLLVINDLDMIKDVLIRDFSLFVDRGFHIFPK(0)

IEPLSEHLFLLEAERWRPMRMKLSPIFTSGKLKEMFFLIMESAGNLEKYLDEVIKKDEMVECRELAAKFMTDVIGSCAFGINTNSLLEEDSEFRRMGKKISTPNLKVMLGNICKEFFPPLYEIVGSIFTLKDVNEFFIN

LVSDTMKYRKDNNIIRSDFINMLMQLKEHPEKMENIE(1)

LTNTLLTAQAVVFFVAGFETSSSTMAFSLYELAQNQEIQDKLREEIRNMHEKNKGVLTYTDVKEMKYLDKVFKE(1)

TLRKYPILPMLFRQAMENYTFKDTKITIPKGMKLWVPVHGIHHDPNI

YPEPEVFDPERFEDDAFASRHPMSYLPFGDGPRNCIG(1)

ARFAHYQSKVGLITILRHHKVNVCEKTTIPFKADERSFLLTLKGGVHLKITKI*

 

>CYP6AS2 Am2_Un.476 (25830-23485), 498 aa, 5 exons

version 1.2 = CYP6AS2 Am1.2_Un.601a (2171-4675) plus, 498 aa, 5 exons

version 1.1 = CYP6AS2 Am1.1_Un.792a incomplete on C-terminus of exon 2, missing 3rd exon

(8375-10702) minus, 356 aa,  EST BE844607 from antennae

86% to AmGroupUn.5496

MDYFQILCAISIVILTIYYYYSLKYAFWKDRGISGPKPIIFFGNFGNSIIKKKSLSETVK

KWYDDYKHESVFGIFEGTIPVLVINDLDMIKDILIRDFSLFVDRGFHIFPK(0)

IEPLTQHLFLLEAERWRPMRMKLSPIFTSGKLKEMFSLIVESAGNLEKYLDEVIKKNEMVECRDLAAK

FTTDVIGSCAFGINTNSLLEEDSEFRRMGKKIFSPSLKLMIGNTCKVFFPSLYEVIGNIFTMKDVDEF

FINLVSDTMKYRKDNDIVRSDFINMLMQLKEHPEKMDNIE(1)

LTDTLLTAQAVVFFIAGFETSSSTIAFGLYELAQNQEIQDKLREEIRKMHEKNKGILTYTDIKEMKYLDKVFKE(1)

TLRKYPILSTLSRKAMENYTFKGTKITIPKGTKVWVPVYGIQHDPNI

YPKPEVFDPERFEDDAFASRHPMSYLPFGDGPRNCIG(1)

ARFAHYQSKVGLITILRNHKVNVCEKTTIPFKADERSFLLALKGGVHLKITKI*

 

>Am2_Un.4631 new (605-937), 111 aa, 1 exon

only one aa diff to 6AS2 probably the same seq.

MDYFQILCAISIVILTIYYYYSLKYAFWKDRGISGPKPIIFFGNLGNSIIKKKSLSETVKKWYDDYKHESVFGIFEGTIPVLVINDLDMIKDILIRDFSLFVDRGFHIFPK(0)

 

>gi|15354903|gb|BI504529.1|   BB170023B10C08.5 Bee Brain Normalized/Subtracted Library, BB17 Apis

           mellifera cDNA clone BB170023B10C08 5', mRNA sequence.

          Length = 657

 

 Score =  190 bits (483), Expect = 1e-48

 Identities = 88/114 (77%), Positives = 97/114 (85%), Gaps = 7/114 (6%)

 Frame = +3

 

Query: 18  YYYYSLKYAFWKDRGISGPKPIIFFGNLGNSIIKKKSLSETVKKWYDDYKHESVFGIFEG 77

           YYYYS KYAFWKDRGISGPKPI+FFGN GNSI+KK+S+SETVKKWYDDYKHESVFGI+EG

Sbjct: 3   YYYYSSKYAFWKDRGISGPKPIVFFGNFGNSIVKKRSISETVKKWYDDYKHESVFGIYEG 182

 

Query: 78  TIPVLVINDLDMIKDILIRDFSLFVDRGFHIFPK-------LLLVDGNLWNEQR 124

           TIPVLVINDLDMIKD+LIRDFS+FVDRGFH FPK       L L++   W   R

Sbjct: 183 TIPVLVINDLDMIKDVLIRDFSIFVDRGFHTFPKIEPLTQHLFLLEAERWRPMR 344

 

an EST  95% to 6AS2 above, exact match to Un.601a from ver 1.2

There might be alternative splicing of the first exon.

YYYYSSKYAFWKDRGISGPKPIVFFGNFGNSIVKKRSISETVKKWYDDYKHESVFGIYEG

TIPVLVINDLDMIKDVLIRDFSIFVDRGFHTFPKIEPLTQHLFLLEAERWRPMRMKLSPI

FTSGKLKEMFSLIVESAGNLEKYLDEVIKKNEMVECRDLAAKFTTDVIGSCAFGINTNSL

LEEDSEFRrMGKKIFSPSLKLMIGNTCKVFFPSLYEVI

 

>CYP6AS13 CYP6ASf1 Am2_Un.5370 (9442-11518), 497 aa, 5 exons,  completion on C-terminus stop codon changed to A (63% to 6AS2)

version 1.2 = CYP6AS fragment Am1.2_Un.1326b incomplete on C-terminus (4-243) minus, 80 aa, 1 exon

1 aa diff to Am1.2_Un.5413

version 1.2 Am1.2_Un.5413 stop codon in 3rd exon, incomplete on both ends, (1-1334) plus, 315 aa, 3 exons 60% to 6AS2, 71% to Am1.2_Un.6932 in one overlapping exon 3

MDFFQIFCAICIMLLAIYYYYTSIYNYWKVRGIPGPEPTIIIGNFMEVFLKKISINDKLRFL

YNKYKNEPMFGIFEGSSPILVLNDLDLIKDVLIKDFSIFSNRGFRIFPK(0)

AEPLGEHLFALETERWRPMRAKLSPIFTSGKLKEMFPLIIECSKNMEPYLDKIAERGKYIECRDLAAKFTTDVIGSCAFGIDMNSISDKDSEFRIIGRKLFTPTFKTIVRDVCRQFLPGLYDVIGHKLQIEEVNEFLT

NLIKDTINYRKENKIVRPDFVNTLIELKDHPEKLETIK(1)

LTDSMIASQAFVFFVAGFETSSSTISHALYELAQNQEIQDKLREEIREVYEKHGELTYDVIKNMKYLDKVLKE(1)

TLRKYPIMAMLTREAQENYTFKGTKVTIEKGIKVWILPYGIQNDPDIFPNPDI

FDPERFDEEAVAARHPMSYLPFGDGPRNCIG(1)

ARFAQFQSKIGIITIVRNHKIDVCEQTKIPYESDPFQFLLALKGGINLKISKI*

 

>CYP6AS2P1 CYP6ASf2 Am2_Un.5353 (2878-1897), 209 aa, 3 exons,  4 aa change at C-end of last exon due to frameshift, frameshift in middle of last exon

version 1.2 = CYP6AS fragment Am1.2_Un.6932 incomplete on N-terminus (41-1021) minus, 209 aa, 3 exons, 92% to last 3 exons of CYP6AS2

(1)LTDILLTAQAVVFFVAGFETSSSTMAFSLYELAQNQEIQDKLREEIRKVHEKNKGVLTYTDIKEMKYLDKVFKE(1)

TLRKYPILSTLSRKVMENYTFKGTKITIPKGTKIWVHGIQHDPNIYPEPEVFDPERFEDDAFASRHPMSYLPFGDGPRNCIG(1)

LVLPHYQSKVGLITILRNHKXVNVCEKTTIPFKADERSFLLVPKGGIHLKIIKI*

 

6AS2 (top) compared to 6ASf2

Query: 1   LTDSMIASQAFVFFVAGFETSSSTISHALYELAQNQEIQDKLREEIREVYEKH-GELTYD 59

           LTD ++ +QA VFFVAGFETSSST++ +LYELAQNQEIQDKLREEIR+V+EK+ G LTY

Sbjct: 1   LTDILLTAQAVVFFVAGFETSSSTMAFSLYELAQNQEIQDKLREEIRKVHEKNKGVLTYT 60

 

Query: 60  VIKNMKYLDKVLKETLRKYPIMAMLTREAQENYTFKGTKVTIEKGIKVWILPYGIQNDPD 119

            IK MKYLDKV KETLRKYPI++ L+R+  ENYTFKGTK+TI KG K+W+  +GIQ+DP+

Sbjct: 61  DIKEMKYLDKVFKETLRKYPILSTLSRKVMENYTFKGTKITIPKGTKIWV--HGIQHDPN 118

 

Query: 120 IFPNPDIFDPERFDEEAVAARHPMSYLPFGDGPRNCIGARFAQFQSKIGIITIVRNHK-I 178

           I+P P++FDPERF+++A A+RHPMSYLPFGDGPRNCIG     +QSK+G+ITI+RNHK +

Sbjct: 119 IYPEPEVFDPERFEDDAFASRHPMSYLPFGDGPRNCIGLVLPHYQSKVGLITILRNHKXV 178

 

Query: 179 DVCEQTKIPYESDPFQFLLALKGGINLKISKI 210

           +VCE+T IP+++D   FLL  KGGI+LKI KI

Sbjct: 179 NVCEKTTIPFKADERSFLLVPKGGIHLKIIKI 210

 

the bottom seq has some defects.  The last frameshift X should not be counted as an extra amino acid since it adds to the length and this does not match 6AS2.

 

The heme region is out of phase

 

     1: K  C  I  F  D  A  V  F  L  *  I  L  I  L  I  F  V  I  I  F 

     2:  N  A  F  S  M  Q  F  F  Y  K  F  *  F  *  F  S  *  L  F  F

     3:   M  H  F  R  C  S  F  F  I  N  F  N  F  N  F  R  N  Y  F  F

        AAATGCATTTTCGATGCAGTTTTTTTATAAATTTTAATTTTAATTTTCGTAATTATTTTT

    901 ---------!---------!---------!---------!---------!---------! 960

        TTTACGTAAAAGCTACGTCAAAAAAATATTTAAAATTAAAATTAAAAGCATTAATAAAAA

 

 

     1: F  S  A  R  F  A  A  L  S  K  *  G  W  T  Y  N  N  S  S  Q 

     2:  S  V  L  V  L  P  H  Y  Q  S  K  V  G  L  I  T  I  L  R  N

     3:   Q  C  S  F  C  R  I  I  K  V  R  L  D  L  *  Q  F  F  A  I

        TTCAGTGCTCGTTTTGCCGCATTATCAAAGTAAGGTTGGACTTATAACAATTCTTCGCAA

    961 ---------!---------!---------!---------!---------!---------! 1020

        AAGTCACGAGCAAAACGGCGTAATAGTTTCATTCCAACCTGAATATTGTTAAGAAGCGTT

 

The green show a PROBABLE FRAMESHIFT

 

The second small 2 aa deletion is in a region where length is usually conserved, between PKG and PERF motifs.  These multiple defects tag this sequence as a pseudogene

 

 

>CYP6AS3 Am2_Un.476 (20769-18418), 499 aa, 5 exons

version 1.2 = CYP6AS3 Am1.2_Un.601b (7365-9709) plus, 499 aa, 5 exons

version 1.1 = CYP6AS3 Am1.1_Un.792b (3396-5727) minus, 499 aa, 5 exons

64% to AmGroupUn.5496

MDYFQLLCVIGALLFAIYYYLTLTFDTWKNRGIPGPKPTIFFGNFQEVILKKISLAEKTKQLY

QEYKNELVFGIFQGRTPILVINDLEMIKDVLIRDFSVFPDRGIHVNPK(0)

VEPIFQTLFSLKSKTWRPLRMKLSPVFTSGKLKDMFPLILDCAKNLEEFVEKVRNSGEPVDCRDMAAKFTTDVIGSCAFGVCMNSLSPEGSEFRRMGEQLGKFSFKKLARDFTRLYMPFLFDIIGGYLQSHEVNNFFI

NLIRDSIKYRQENNVYRPDFVNTLKELKEHPEKLENIE(1)

LTDALLTSQALVFFLAGFETSSTTISNALYELAQNPEMQDKLRKEIKEVYENNGGALSYTDVKEMKYLDKVFKE(1)

TLRKYPVLAALSRQATENYTFKDTKIKISKGTRIWIPVYGIQHDPNI

YPEPEVFDPERFEDDAFASRHPMTYLPFGDGPRNCIG(1)

ARFAHYQSKVGLITILRNNKVEVCAKTLIPYKSEPRNILMIPKGGKVELRITKV*

 

>CYP6AS4 Am2_Un.476 (1718-4379), 501 aa, 5 exons

version 1.2 = CYP6AS4 Am1.2_Un.1740a (1295-3956) plus, 501 aa, 5 exons

version 1.1 = CYP6AS4 Am1.1_Un.1753 incomplete on C-terminus of exon 1 (2509-4596) minus, 411 aa, 5 exons

80% to AmGroupUn.42b, 58% to AmGroupUn.792b, 41% to 6a17, 44% to 6a13,

MLHHFHILTAFVAIFLALYYYLTSKFDFWKNRGVSGPRPVPFFGNAKDVLLRKIGIGSFIAELY

KRYDNEAMFGIFIGRSPNLVLRDLDLIKDVLIKDFSIFDNRGLNIPER(0)

AEPFSVNLFSVDATRWRPLRMRLSPVFTSGKLKEMFPLILECAEHLEQCLEDAVKRGGPVDCFEIPARYTTDVIGSCAFGINMNALSDERSEFRKMGRNMFDQNMIKFTRNLLRDFFPRFYNLLGFVLPYTESTVFMT

LIKGTIKYREENDVVRPDFVNLLMELKKHPEKLKNIE(1)

ITDTLLAAQASVFFAAGFETSSTTMAHALYEMALNPDIQDKLRNEMKEFHAKNNGNLKYEDIKEMKYLDKVFRE(1)

TLRKYPPGMLLRRKCNSNYTFHGTKVSIPAGTSVIIPLYAIQIDPKFYENPDVF

DPERFNEDAVAARHPMTYLPFGDGPRNCVG(1)

ARFAVYQTKVGLIKILQNFRVDVCEKTMIPYVKKINSITLAPRDGIFLKIEKITD*

 

>CYP6AS5 Am2_Un.4238 (9913-6869), 498 aa, 5 exons,  pseudogene?, stop codon is back in 4th exon

version 1.2 = CYP6AS5 Am1.2_13.15b (125743-128770) minus, 499 aa, 5 exons, now complete

version 1.1 = CYP6AS5 Am1.1_Un.42a pseudogene? stop codon in exon 4 before heme binding

site and missing exon 5 (126603-129298) minus, 439 aa, 4 exons

56% to AmGroupUn.5496 43% to 6M2 new subfamily in CYP6?

MASSFEILCGIAVLFLALYYYLTSTFDFWKSRGVVGPKPVPFFGTTKDLILVKKSTAHFVK

DIYEKYKNEPMVGLYATRSPFLLLNDPELIKDILIRDFSKFANRGLGVFER(0)

TEPLSPHLLNLEVERWRPLRSRLSPIFTSGKLKEMFYLIIECSLNLEMYLDKLIEKNEPIECRELTARFTTDVIGSCAFGIDMSSMTNENSEFRRMGREVFAVNFMNVMRMKLKQFMPRLYDLLGYVMPDRTFAPFF

TRVVTDTIKYRNDNNIVRPDFINMLMELQKNPQKLENIK(1)

LTDSLIAAQAFVFFLAGFETSSTTMSNALYELALNQDVQKKLREEINTFCPKNNKELKYDDIKEMEYLDKVFKE(1)

TLRMYPPASILMRKAISDYTFNDTKITIPKEMKIWIPAFAIHRDSAI

YPNPDSFDPERFDKDAMASRHPMHYLPFGDG*RNCIG(1)

ARFAVYQTKVGLITILRNHKVEVCEKTVIPYEFDPGAFLLSPKDGIYLKITKI*

 

>CYP6AS6 Am2_13.15 (124021-121751), 497 aa, 5 exons

version 1.2 = CYP6AS6 Am1.2_13.15c (120924-123194) minus, 497 aa, 5 exons

version 1.1 = CYP6AS6 Am1.1_Un.42b alternate splice for exon 3? (121844-124324) minus,

497 aa, 5 exons

41% to 6a17, best matches all CYP6As 80% to AmGroupUn.1753

MFDYFQILIAFVASFLALYYYLTSNFDFWKNRNVVAPKPIPFFGNTKDVVLKKIEISN

FIAELYKKYENEAMFGIFFGGSPNLILRDLDLIKDVLIKDFSTFDERGFKISER(0)

ADPLNANLFNMDVTRWRPLRIKLSPVFTSGKLKEMFPLILKCAERLEQCLEDAVKRGGPVDCFEISARYTTDVIGSCAFGINMNALSDERSEFRRIGKRIFDLDKNILRSFLRQFFPRFYNLLGFVIPYSETSKFVTK

FISEMIKYREENNVVKADFVNLLMELKKHPEKLQNIK(1)

ITDNLLAAQAFVFFAAGFETSSTTMAHALYEMALNPNIQDKLRKEIKEFYANNNFTYEEVKKMKYLDKVFKE(1)

TLRKYPPGVFLKRKCNSNYTFKGTKVSIPAGTSVIIPVYSIQTDPKF

YENPDVFDPERFNEDAVAARHPMTYLPFGDGPRKCIG(1)

IRFGVYQTKVGLIKMLKNFKVNVCEKTMIPYIKKINSFTLAPKDGIFLKIEKNN*

 

>CYP6AS6v2 Am2_Un.4238 (4309-2045), 497 aa, 5 exons, completes on N-term what appeared to be a single exon fragment in version 1.2

version 1.2 = CYP6AS6-se1[5] Am1.2_13.15a solo exon pseudogene (132407-132573) minus, 54 aa, 1 exon

1 aa diff to last exon of CYP6AS6, but in different region on 13.15

the two sequences are on AADG03005752.1 11483 bp apart

MFDYFQILIAFVASFLALYYYLTSNFDFWKNRNVVAPKPIPFFGNTKDVVLKKIEISNF

IAELYKKYENEAMFGIFFGGSPNLILRDLDLIKDVLIKDFSTFDERGFKISER(0)

ADPLNANLFNLDVTRWRSLRIKLSPVFTSGKLKEIFPLILKCAERLEQCLEDAVKRGGSVDCFEISARYTTDVIGSCAFGINMNALSDERSEFRRIGKRIFDLDKNILRSFLRQFFPRFYNLLGFVIPYSETSKFVTKFISEMI

KYREENNVVKADFVNLLMELKKHPEKLQNIK(1)

ITDNLLAAQAFVFFAAGFETSSTTMAHALYEMALNPNIQDKLRKEIKEFYANNNFTYEEVKKMKYLDKVFKE(1)

TLRKYPPGVFLKRKCNSNYTFKGTKVSIPAGTSVIIPVYSIQTDPKF

YENPDVFDPERFNEDAVAARHPMTYLPFGDGPRKCIG(1)

IRFGVYQTKVGLIKMLKNFKVNVCEKTMIPYIKKINSFTLAPKDGIFLKVEKNN*

 

This has only 5 aa diffs to 6AS6.  four are in exon 2. Is it a different gene? are the introns different? Is it an allele?

 

>CYP6AS7 Am2_13.15 (119129-116557), 505 aa, 5 exons,  completed on N-terminus

version 1.2 = CYP6AS7 Am1.2_13.15d (115784-118302) minus, 487 aa, 5 exons AADG03005751.1 (WGS)

version 1.1 = CYP6AS7 Am1.1_Un.42c incomplete on N-terminus (115419-117055) minus, 265 aa, 3 exons

45% to 6M2, 66% to AmGroupUn.1753 Nearly complete sequence.

BI505169.1 EST (N-terminal)

MPSLEIIFGIILLFPAIYYYFTSTFDFWKVRGVPGPKPIPIFGNIKNVMLLKTSMCH

YLKKLCEEYKHEPMIGIFTRKTPILIIQDPDLIKDVLIRDFSKFANRGIPIHEK(0)

AEPLSPHLFNLEVERWRPLRTRLSPVFTSGKLKEMFPLILDCAKHLEQYLDKLVLREEFIECRELTAKYTTDVIGSCAFGIEMNALSDEESEFRRIGRKVFSNSFGQILRFRFRQIFPRIYNLLGFVLPPMEVTKFLTNI

IVSTMKYRQENNIVRPDFVNMLIELKKHPDKLENIK(1)

LTDTLLTAQAFVFFIAGFETSSSAISNALYELALNPEVQNKLRQEIKEYFNKHNELKYEYIKNMIYLDLVFRE(1)

TLRKYPPGPLILRKSITNYTFNNTKVSIPEESFVWIPLYAIHHDPKI

YPNPDAFIPERFNDDAIATRHPMHYLPFGDGPRNCIG(1)

ARFAVYQSKIGLITILWNYKVEVCDKTMIPYEINPAAFLLTPKGGIYLKFTKIKNNEEILN*

 

>CYP6AS8 Am2_13.16 (153493-156158), 500 aa, 5 exons

version 1.2 = CYP6AS8 Am1.2_Un.5081 (10839-13505) minus, 500 aa, 5 exons version 1.1 = CYP6AS8 Am1.1_Un.4533 (10741-13406) minus, 500 aa, 5 exons

53% to AmGroupUn.2631 42% to 6P4 48% to 6N1 partial

MYISLEIFCGIVVALIALYYYLTVNNNFWKNRGIAGPEPVLGFGNMKKVLLGKESM

SQFLTKIYHEYKNEPIIGIFTTRTPQLIIKDPDLIKTILIKDFSKIMNRGLLPMVS(0)

GEPISQHLFNIEAERWRPLRIHLTPVFTANKLRGMFSLILECSMHFVSYVDSLVKKGEPVNVREVAARFTTDVVGSCGFGVEMNSLSEKESEFRRVGKSVFATNYARIIKHRIREFMPRLYNYILYLWPTDEMAEKIIKLTR

ETLEYREKNNLFRPDFMNILLDLKKHPEKIGLD(1)

VTNEFLAAQAFIFFVAGFETSSSTISNALYELALNPDVQDKLRKEIKEFAAKNDGEWRYETIKEMEYLGKVFQE(1)

TLRKYPSLPFLTRELIEDYTFESNKVTIPKGLKIWIPTYAIHNDPDI

YPDPDKFDPERFSDDKIKQRHPMHFLPFGHGPRNCIG(1)

ARFAIYQTKIGLINILRNFKLDVCDKTLIPYKHHPRGLLLMPLTDLYLKITRLTN*

 

>CYP6AS9 Am2_13.17 (7981-5161), 497 aa, 5 exons,  probably pseudogene, frameshift in 2nd exon, no good splice donor site for 3rd exon, missing IG from heme binding region – there are a 2 fragments that are similar to CYP6AS9

version 1.2 = CYP6AS9 Am1.2_Un.1054b frameshift in exon 2, possibly a pseudogene because splice site donor for intron 4 removes 2 aa from heme binding region (1823-4511) minus, 499 aa, 5 exons

version 1.1 = CYP6AS9 Am1.1_Un.2792 incomplete on N-terminus, pseudogene??? heme binding

site probably not functional due to change in splice donor

(995-3268) minus, 367 aa, 4 exons

65% to 6AS8

Am1.1_Un.4458 incomplete on both ends (1-1218) plus, 203 aa, 2 exons

3aa diffs to AmGroupUn.2792 61% to 6AS8

note: GG (9) boundary was QE (1) in ver 1.2

AADG03012612.1 AADG03012613.1

MFINLETLCGFVIVLIAFYYYLTINNNFWKNRGIPGPKPTIGFGNMWTVMFGKE

SFSQLLTTIYNKYKDEPMIGIFFRRRPVLLLKDFDLIKDVLIKDFSKFANRGFLKTNP(0)

KVPLTNHLFALEVKRWRPLRNHLXSPVFTSGKLKGTFAQILNCSNDLVTHIDTLSKMEDSINMREVAAKFTTDVIGSCVFGIKMNSLSEIENEFRRIGRNIFATNFTNILKIRLLQSTPFLHSLLCKILPDEEMEMFMRI

TKETMEYREKNNLVRPDFMNILLELKKHPERIADIE(1)

LTDDLLAAQAYIFFAAGFETSSTTISNALYELALNHDMQDKLREEIKEFEAKNDGEWRYETIKEMQYLGKIFGG(9)

TLRKYPALSFLSRESIIDDYTFRDTKLTISKGTLVWIPVFPIHHDPNI

YPDPEKFDPERFTEDKIKERNLMHYFPFGHGPRNCIG (1 GC boundary)

ARFAIYQTKIGLIKILRNYKVDVCSKTLIPYKYDPFSFILVPLGGLYLKITRI*

 

version 1.2 = CYP6AS9 Am1.2_Un.1054b frameshift in exon 2, possibly a pseudogene because splice site donor for intron 4 removes 2 aa from heme binding region (1823-4511) minus, 499 aa, 5 exons

MFINLETLCGFVIVLIAFYYYLTINNNFWKNRGIPGPKPTIGFGNMWTVMFGKESFSQLLTTIYNKYKDEPMIGIFFRRRPVLLLKDFDLIKDVLIKDFSKFANRGFLKTNP (0)

KVPLTNHLFALEVKRWRPLRNHL (FRAMESHIFT)

SPVFTSGKLKGTFAQILNCSNDLVTHIDTLSKMEDSINMREVAAKFTTDVIGSCVFGIKMNSLSEIENEFRRIGRNIFATNFTNILKIRLLQSTPFLHSLLCKILPDEEMEMFMRITKETMEYREKNNLVRPDFMNILLELKKHPERIADIE (1)

LTDDLLAAQAYIFFAAGFETSSTTISNALYELALNHDMQDKLREEIKEFEAKNDGEWRYETIKEMQYLGKIFQE (1)

TLRKYPALSFLSRESIIDDYTFRDTKLTISKGTLVWIPVFPIHHDPNIYPEPEKFDPERFTEDKIKERNLMHYFPFGHGPRNC

(1)

ARFAIYQTKIGLIKILRNYKVDVCSKTLIPYKYDPFSFILVPLGGLYLKITRIQD*

 

>Am2_79040249 new (255-761), 168 aa, 1 exon highly similar to 6AS9

(0)KVPLTNHLFALEVKRWRPLRNHLSPVFTSGKLKGTFAQILN?APMILXDSYRHFIEDGRLYKHAREVAAKFTTDVIGSCVFGIKMNSLAEREXEFRRIGRNIFATNFTNILKIRLLTSTPLLHSLLCRILPDEEMRXMFFRITKETMEYREKNN

PPQAGFYEYTSRLETEV

 

>Am2_Un.5369 (5212-3283), 300 aa, 4 exons C-terminal end of Am1.2_6AS9, also appears to be a pseudogene

version 1.2 = NUn.5369 UnOriented 6AS?? pseudogene

EIENEFRRIGRNIFATPFYKYIED*RLLQSTPLLHSLLCKILPDEEMEMFMRITKE

TMEYREKNNLVRPDFMNILLELKKHPERIADIE(1)

ITDDLLAAQAYIFFAAGFETSSTTISNALYELALNHDMQDKLREEIKEFEAKNDGEWRYETIKEMQYLGKIFQE(1)

TLRKYPALSFLSRESIIDDYTFRDTKLTISKGTLVWIPVFPIHHDPNIYPEPEKFDPERFTEDKIKERNLMHYFPFGHGPRNC(1)ARFAIYQTKIGLIKILRNYKVDVCSKTLIPYKYDPFSFILVPLGGLYLKITRIQD*

 

>Am2_48488186  like 2nd exon of Am1.2_6AS9 (826-453), 125 aa, 1 exon

DTLSKMEDSINMREVAAKFTTDVIGXSCVFGIKMNSLSEIENEFRRIGRNIFATNFTNILKIRLLQSTPFLHSLLCKILPDEEMEMFMRITKETMEYREKNNLVRPDFMNILLELKKHPERIADIG(1)

 

>CYP6AS10 Am2_13.17 (13740-10480), 499 aa, 5 exons

version 1.2 = CYP6AS10 Am1.2_Un.1054a (7016-10145) minus, 499 aa, 5 exons

version 1.2 = CYP6AS10 Am1.1_Un.2631 incomplete on C-terminus of exon 1 (2686-5804)

minus, 495 aa, 5 exons AADG03012613.1 AADG03012614.1

gnl|Amellifera1|174017188 BCM Apis mellifera 12/11/2003

40% to 6M2 53% to AmGroupUn.6AS8

MAAFEILCGFIIFIFAFYYYLIKPQEYWKNRGVPGPKPIPIFGNFFRLTFARISIGDL

MTKFYKEYKHEPVFGLYMRNVRVLAINNPDLIKTVLIKDFSKFAHRGLALNEV(0)

TEPLSQHLFVLEPKRWRPLRTKLSPIFTSGKLKDMFSLIIECSNTLENYVEHLISKNDRVEVRDLAAKFTTDVIGSCGFGVDMNAMSDVQCKFRDIGREFFGPSFKQILKIRLRENLPRLYTFLGYILPRDETTT

FFTNVVLDMIKYRKTNDIYRPDFINALINIQNHPEKLDIE(1)

LTEPLLVAQAFLFFVAGFETSSLTIATALYELAQNQDIQDKLRDEITEHHKLNNGEWQYENIKNMPYLDAVFKE(1)

TLRKYVPLTVLMRQSLEDYTFESINLTIPKDTRIFIPIYAIHRDPDI

YPNPEVFDINRFSKEAEATRHPMHYLPFGDGPRNCIG(1)

ARFAIFQTKIGLIKILRTYKVDVCNETQIPFINEPRTFTLAPKHDLTLKITKIEN*

 

>CYP6AS11 Am2_13.16 (147797-150778), 500 aa, 5 exons,  not a pseudogene after all??? complete on N-terminus, 11 aa changes

version 1.2 = CYP6AS11P (pseudogene) Am1.2_Un.5080a incomplete on N-terminus (1-1697) plus, 229 aa,

4 exons with three frameshifts and a stop codon

gnl|Amellifera1|166350743 BCM Apis mellifera 12/11/2003

Am1.1_Un.4532 incomplete on N-terminus (1-757) minus, 131 aa, 2 exons

83% to 6AS8

MYINLEIFCAIVIAFIAFYYYLTINNNFWKHRGISGPKPVLGFGNMKKIILGEESMSQ

YLTKLYHKYKNESMIGIFRLRTPALIIKDPDLIKIVLIKDFSKFMNRGLLPIIS(0)

TEPISHHLFALEAERWHPLRKHLTSGFTSNKLKGMFCMIHECSKHLVNYLDILVRKEEPVNVREVAARFTTDVVGSCGFGVEMNSLSEQESEFRRLGKSIFNTNVQKIIKDRIRELTPQVYNFLLYILPLDGIS

PKILKLMKETIKYRKKYDIFRPDFMNIILELKKHPEKINID(1)

ITNELLAAQIFIFFAAGFETSSTLISNALYELALNPNIQDKLREEIKKFESQNDEEWKYETIKKMDYLEKVIQE(1)

TLRKYPPVPFLNRELIDDYTFESNKVTIPKGLKIWIPTYAIHNDPDI

YPDPDKFDPERFSEDNIKQRHPMHFLPFGHGPRNCIG(1)

IRFAEYQTKIGLINILRNFKLDVCDKTLIPYKLHPRGLILIPLTDLYLKITRLTN*

 

>CYP6AS11alt Am2_13.16 (152312-152647), 112 aa, 1 exon this sequence is just upstream from the start of CYP6AS8 (about 2000bp) or about 1500 bp from 6AS11

version 1.2 = Am1.2_Un.5080b incomplete on C-terminus (3231-4470) plus, 287 aa, 2 exons

version 1.1 = Am1.1_Un.248b incomplete on C-terminus and N-terminus of exon 2

only 6 amino acid differences to 6AS11P in an 11 amino acid window

MYIGFEIIYGIVIVFIAFYYYLTINNNFWKHRGISGPKPVLGFGNMKKIILGEESMSQYLTKLYHKYKNESMIGIFRLRTPALIIKDPDLIKIVLIKDFSKFMNRGLLPIIS(0)

 

>CYP6BC1 383+1.19 Am2_1.19 (858065-862228), 476 aa, 5 exons, still incomplete on N-term of 4th exon, combines Am1.2_Un.383 and Am1.2_1.19 about 46% to 6AS7

version 1.2 = Am1.2_1.19, incomplete on C-terminus (860909-863534) plus, 381 aa, 3 exons

version 1.2 = Am1.2_Un.383 incomplete on N-terminus (6360-7086) plus, 84 aa, 2 exons

version 1.1 = Am1.1_Un.4500 incomplete on N-terminus (298-1025) minus, 84 aa, 2 exons

MWNIIRELLEQFLLPGLFLGILYCFLTSTFDFWKNRGVPFRKPTVLFGNFAPMLLFRKSLPEGIK

EMYEWFKDERYFGAFRVRSPVLILRDPDLVKNICVKNFTSFSNRGIPVNSQ(0)

DPLSAHLFNLEGKKWKSLRSKLTPAFSSGKLKRMFYLLAECGEEFEKLIDISSETDRPYEIRELAAKFTIDVIGTCAFGIQINALTDEESEFHRAAKKLSKPSYKATLWRMLRTAMPRLYKFLGVQVIDPGVTKFF

KDVVSQMIKQRGEYGIKRHDFMDLLIELKNKGTLDEFGK(1)

LDENSIAAQAFVFFAAGYETSSNTIAFCLHELALNTEIQEKTRRDIQDAIDSRNGNLTYDAVQDMKYLDMVIAG(1)

KVELPAGIRVIIPIYGLHHDPDYYPSPAIFNPERFTEENKRTRHPYAYLPFGEGPRNCIG(1)

MRFALLQIKVGIISFLRNHRVETCQKTITPIKFSRRSLVTTSEKGFWLRIK*

 

>Am1.2_1.19 (version 1.2), incomplete on C-terminus (860909-863534) plus, 381 aa, 3 exons

in CYP6, 46% to 6AS5 missing EXXR to end

MWNIIRELLEQFLLPGLFLGILYCFLTSTFDFWKNRGVPFRKPTVLFGNFAPMLLFRKSLPE

GIKEMYEWFKDERYFGAFRVRSPVLILRDPDLVKNICVKNFTSFSNRGIPVNSQ(0)

DPLSAHLFNLEGKKWKSLRSKLTPAFSSGKLKRMFYLLAECGEEFEKLIDISSETDRPYE

IRELAAKFTIDVIGTCAFGIQINALTDEESEFHRAAKKLSKPSYKATLWRMLRTAMPRLY

KFLGVQVIDPGVTKFFKDVVSQMIKQRGEYGIKRHDFMDLLIELKNKGTLDEFG (1)

KLDENSIAAQAFVFFAAGYETSSNTIAFCLHELALNTEIQEKTRRDIQDAIDSRNGNLTYDAVQDMKYLDMVIAG(1)

>Am1.2_Un.383 (version 1.2) incomplete on N-terminus (6360-7086) plus, 84 aa, 2 exons

version 1.1 = Am1.1_Un.4500 incomplete on N-terminus (298-1025) minus, 84 aa, 2 exons

52% to AmGroupUn.5496 53% to 6AM1

AIFNPERFTEENKRTRHPYAYLPFGEGPRNCI (1)

GMRFALLQIKVGIISFLRNHRVETCQKTITPIKFSRRSLVTTSEKGFWLRIK*

 

>CYP6AS14 13.16 Am2_13.16 (142794-146154), 501 aa, 5 exons,  complete on C-terminus, -->K

version 1.2 = Am1.2_13.16 incomplete on C-terminus (2365-4885) minus, 445 aa, 4 exons

version 1.1 = Am1.1_Un.248a incomplete on C-terminus (38475-40756) plus, 338 aa, 4 exons 49% to 6Aa14, 73% to 6AS8

MYIGLEILCGIVITLIAFYCYLTINNNYWKNRGIPGPKPVPGFGNMKNVIFGKESVSQ

FLTRMYNEYKDEPMIGVFSKRTPVLIVKDVDLIKTILIKEFPKFANRGLFPIFS(0)

RDPLTHHLFNLEVERWKPLRTQFTPLFTSSKLKEMFSLILECSNHLESYMDTLIKKGEPIDMREVSARYTTDVVGSCAFGIDMNSLSEKESVFRRLGKLIFATNLRKILSIRIQDMLPWLYNSFLYVLPRDE

KTRIIMKLMTETMEYREENNVFRPDFINMLLNLKKHPEKIDIE(1)

LTDDLLAAQIFIFFAAGFETSSSTISNALYELALNPDIQEKLRKEIKEFEARNNGEWRYEIMKEMEYLEKVFQE(1)

TLRKYPSLPFLNRKLINDYTFESNNVTVSKDLKIWIPVYGIHHDPDI

YPDPEKFDPERFSKKEEIMKRHPMHFLPFGHGPRNCIG(1)

ARFAVYQTKIGLIKILRNFEVQVCNKTLIPYKVNPYTSLLIPITGLYLNVVKLEN*

 

>Am2_101397060 (301-576), 91 aa, 1 exon very like Am2_13.16, 29aa in a row

(0)TLRKYPSLPFLNRKLINDYTFESNNVTVSQRFKRFWIPVYRNLTPISDIYPDPEKSHL*KISLGGNYGEGAPKAFLPFGQGPQNMSCVWPIK(0)

 

>CYP6AS15 218+7762 Am2_Un.210 (2742-6273), 501 aa, 5 exons, combines Am1.2_Un.218 and Am1.2_Un.7762  68% to 6AS8

MNISLEILCGIIVALIVFYYYLIINNNFWKNRKISGPKPVIGFGNMLSIILGKESTSQFL

TRIYNEYKNEPMIGIFSKNNPALVIKNPDLIKTVLIKDFHKFANRGLFPVNS(0)

REPLSQNLFGLEVERWRPLRIHFSPIFTTNKLKGLCSLILECSEQLEKYMDILIRKGEPLDIREIAARFTTDVIGSCAFGIEMNSLSENESEFRRLGKGVFNTTFRRIVKTRIRNLMPWLYNFFLRILPWDE

ITKKIVKLTTETIEYRNKNNIVRSDFINVLLNLKKHPEKIAEIE(1)

LTNDLLSAQTFVFFGAGFETSSTTISNALYELALNHDIQYKLREEIKEFEKKNDGKWTYESIKEMQYLNKIFQE(1)

TLRKYPVVPFLNRELISDYTFENSKITIPKGLKIWIPVYGIHHDPDI

YPNPEKFDPERFSEDKIKERHSMHYLPFGHGPRNCIG(1)

SRFGTYQTKIGLVKIIRKYKVEICDKTLIPYKFNSFANFLMPSTGLYLMITDVEN*

 

version 1.2 = Am1.2_Un.7762 incomplete on C-terminus (1072-1314) plus, 81 aa, 1 exon

gnl|Amellifera1|2043985811BCM Apis mellifera 8/15/2003

versio 1.1 = Am1.1_Un.9652 incomplete on C-terminus (993-1236) plus, 81 aa, 1 exon

72% to AmGroupUn.248a

note: this exon 1 may join with Am1.2_Un.218 to make a complete sequence

MNISLEILCGIIVALIVFYYYLIINNNFWKNRKISGPKPVIGFGNMLSIILGKESTSQFLTRIYNEYKNEPMIGIFSKNNP

ALGIRNPDLIETVLIKDFHKFANRGLFPVNS (0)

 

version 1.2 = Am1.2_Un.218 incomplete on N-terminus (12841-15336) minus, 389 aa, 4 exons

version 1.1 = Am1.1_Un.8460 incomplete on both ends (123-685) plus, 139 aa, 2 exons

71% to 6AS8

version 1.1 = Am1.1_Un.8178 incomplete on both ends (801-1322) minus, 174 aa, 1 exon

64% to 6AS8

(0)REPLSQNLFGLEVERWRPLRIHFSPIFTTNKLKGLCSLILECSEQLEKYMDILIRK

GEPLDIREIAARFTTDVIGSCAFGIEMNSLSENESEFRRLGKGVFNTTFRRIVKTRIRN

LMPWLYNFFLRILPWDEITKKIVKLTTETIEYRNKNNIVRSDFINVLLNLKKHPEKIAEI (1)

ELTNDLLSAQTFVFFGAGFETSSTTISNALYELALNHDIQYKLREEIKEFEKKNDGKWTYESIKEMQYLNKIFQ (1)

ETLRKYPVVPFLNRELISDYTFENSKITIPKGLKIWIPVYGIHHDPDI

YPNPEKFDPERFSEDKIKERHSMHYLPFGHGPRNCIG(1)

GSRFGTYQTKIGLVKIIRKYKVEICDKTLIPYKFNSFANFLMPSTGLYLMITDVEN*

 

>CYP6AS12 Am2_Un.476 (9611-6794), 499 aa, 5 exons,  no longer a pseudogene???  complete on C-terminus, 4 aa changes including *->L

version 1.2 = CYP6AS12P Am1.2_Un.1740b possible psuedogene (6374-9226) minus, 500 aa, 5 exons AADG03014312.1

version 1.1 = Am1.1_Un.8712 incomplete on N-terminus (13-1433) minus, 267 aa, 4 exons

60% to 6AS5

version 1.1 = Am1.1_Un.10510 incomplete on both ends (122-647) minus, 176 aa, 1 exon

54% to AmGroupUn.42a, 39% to 6a16m all best hits = CYP6As from aa107-274

probably a new CYP6 subfam or CYP6a

version 1.1 = Am1.1_Un.6966 incomplete on C-terminus (878-1100) minus, 72 aa, 1 exon

40% to 28A5 N-term 38% top 6a20 57% to AmGroupUn.42a

MAYIEILC-GIIVSMLVFYYYFTSVFNFWKIRGIPGPKPKFLFGNIRDIILSRISTPTFIKNIYDT

YTNEPMVGLYMGRNPILLLKDPELIKDVLIRDFSKFADRGFNVHEK(0)

VEPLSQHLFNLEPKRWRPLRSKLSPMFTSKKLKEMFGLILECGHHFEKYVDGLAARRQPVDFCEVAAKYTTDVIGSCAFGINMNAMSSEGSEFREAGRKIFEPTWNSIIRLKFKITMPTLYDLLGPLVPEREVTPF

FIKVVTDAMKYRKERNVFRPDFIDTLMKLRDDPESLSDIE(1)

LTDAFLTAQAYVFFAAGFETGASTISNTLYELAQNQGMQDRLREEIREHCDKYGGELMYENIKEMEYLDKVFKE(1)

TLRKYPPGTLIPRRSVSEYTFKNTNVTIPKGTMIWIPAFPIHRDPNI

YPNPDDFNPENFTEDAINNRHPMNYLAFSNGPRNCIG(1)

ARFANYQVKIGLIMILRNYKVEVCEKTVIPYQFDPNLFLLGPKGGIYLRVTKVE*

 

>Am2_98681617 new (840-543), 99 aa, 1 exon 91% to 6AS12P

VDVAFYYYFTSALNFWKIRGIPGPHXPKFLFGNIRDIILSRISTPTFIKNVYDTYTNEPMVGLYMARNPIL

LLKDPELIKDVLIRDFSKFADRGFNVHEK(0)

 

>Am2_Un.6006 (2537-189), 375 aa, 4 exons, like 6AS12, but with nucleotide deletion at X in first exon and missing both ends probably the same sequence

 =changes from CYP6AS12

YGRIVHGTESDS

LLKDPELIKDVLIRDXFSKFADRGFNVHEK(0)

VEPLSQHLFNLEPKRWRPLRSKLSPMFTSKKLKEMFGLILECGRHFEKYVDGLAARRQPVDFCEVAAKYTTDVIGSCAFGINMNAMSSEGSEFREAGRKIFEPTWNSIIRLKFKITMPTLYDLLGPLVPEREV

TPFFIKVVTDAMKYKKESNVFRPDFIDTLMKLRDDPESLSDIE(1)

LTDAFLTAQAYVFFAAGFETGASTISNTLYELAQNQGMQDRLREEIREHCDKYGGELMYENIKEMEYLDKVFKE(1)

TLRKYPPGTLIPRRSVSEYTFKNTNVTIPKGTMIWIPAFPIHRDPDI

YPNPDDFNPENFTEDAINNRHPMNYLAFSNGPRNCIG(1)

 

>CYP6AS16 1326a Am2_Un.5370 (792-4695), 499 aa, 5 exons, completes 3rd exon for Am1.2_1326a, frameshift at X (64% to 6AS9)

version 1.2 = Am1.2_Un.1326a incomplete, missing 3rd exon (4844-8009) minus, 430 aa, 4 exons

version 1.1 = Am1.1_Un.2707 incomplete missing exon 3

(1-3597) plus, 368 aa, 4 exons

59% to AmGroupUn.4533 

note XM_396006.1 mRNA predicted by program GNOMON skips exon 3

MIANLEIFCGIIVILIAFYYYIRLEQFWKFXGIPGPEPLPGFGNVLMIVLGKEAPFQFLT

RVYNEFKNEALIGVFMKTYPALVVKDPDLIKDIMIKDFYKFPNRGFPKSDSVN(1)

PLTQHLFLVEEEKWRPLRTQLSPVFSTGKLRGTFTQILDCSNHLVTYMDKLVEIGEPIDVREVTAKFTTDVIGSCVFGIKMNSLSGKESEFRRFGRQIFAMNFLKILRLRIKQFLPMLHYLLVRILPPDEETKIMLKL

TRDTFKFREAHNIVRPDFMNILMELKKHPEKVPSLD(1)

LTDGLLAAQAFIFFAAGFETSSTTVANTLYELALNPDIQEKLRQEIKEFEANNDGEWKYETIQEMKYLDKTFKE(1)

TLRKYPVLPYLSRRSIEDYTFEGTKVSIPKNTLICIPVYPIHHDSSI

YPNPEKFDPERFSEDEVKKRHSMHYFPFGHGPRNCIG(1)

LRFAIYQSKIALIKILSNYKIEICDKTLIPYKYDPFSFISLPLTGIFLKITKLQN*

 

>Am2_Un.5351 like Am1.2_Un.1326a (1327-110), 100% to Am1.2_Un.1326a

(1)GLLAAQAFIFFAAGFETSSTTVANTLYELALNPDIQEKLRQEIKEFEANNDGEWKYETIQEMKYLDKTFKE(1)

TLRKYPVLPYLSRRSIEDYTFEGTKVSIPKNTLICIPVYPIHHDSSI

YPNPEKFDPERFSEDEVKKRHSMHYFPFGHGPRNCIG(1)

 

>Am2_13.17 (752-9), 143 aa, 2 exons, like Am1.2_Un.1326a, but lacking the frameshift and missing C-terminus                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   

MIANLEIFCGIIVIVIAFYYYITARNNFWKIRGIPGPEPLPGFGNVLMIVLGKEAPFQFLTRVYNEFKNEALIGVFMKTYPALVVKDPDLIKDIMIKDFYKFPNRGFPKSDS(0)

ADPLTKHLFLVEEEKWRPLRTQCPQCSQPVN

 

>CYP6BD1 Un.416 Am2_11.13 (431584-429035), 515 aa, 6 exons, missing aa on C-terminus, changed splice acceptor site on 2nd intron, OK 5aa changes,

version 1.2 = Am1.2_Un.416 incomplete on both ends (10974-18858) minus, 223 aa, 3 exons

gnl|Amellifera1|236106554 BCM Apis mellifera 12/11/2003 42% to 6AQ1

Am1.1_Un.2162 incomplete on both ends (1-852) plus, 223 aa 3 exons

Note there is a seq duplication of the last exon

I used the end of the duplicated sequence to finish the protein sequence.

429191 ARIGQLQSIIGLITIIKNYEISFISKCKGDLDVRNIFITPINDFSL 429054

and                 429000 ISLNSKCKGDLDVRNIFITPINDFSLNLTKI* 428905

39% to 6g1 aa175-381 probable new subfamily in CYP6 41% to 6AQ1

MQNSLFMPIIFSIAFFVIILILYLKYTRTYWLRRGVPTVPGHWLFGNIKKILDLKKPPAYVISDIYQKCSENDDILGIYIFFKPFLLIKDPAIIKQILIKDFNYFPDRNFTIQSFYDEIGNKSLFTLKNPQWKYLR(2)

TKLSPIFSSAKVKKLFHLMVEAANSMNKYLDDEFSNDTKTKTIMIKDVTLKYTTNVISSVAFGIQVNSFNPKTIQFYEEA(1)

QKGLKTTFSRSMQLCISFFFPKLSPYLNTRMLGSSTNFFRKVFWNSMDNREITKTKREDLIDSLMELKNAKQDKDFK(1)

FEGDALLSQSAIFFIAGRETSISIICLTLYELAKHPEIQKRTREEINEKLKEHGMTYEGVQSMKYLHQVVSEILRIYPPTPIIDRVAVADYK(0)

IPGTDIVIEKGTSVFIVLTALHNDPKYHPDPLRFNPDRFSDENKENIKPFTYIPFGEGPRICIG(1)

ARIGQLQSIIGLITIIKNYEISFISKCKGDLDVRNIFITPINDFSLNLTKI*

 

>CYP6BE1 Un.6686 Am2_Un.1086 (577-5116), 512 aa, 6 exons, complete on N-terminus 42% to 6AS2

version 1.2 = Am1.2_Un.6686 incomplete on N-terminus (2630-5871) minus, 453 aa, 5 exons missing about 54 aa

gnl|Amellifera1|2049199734 BCM Apis mellifera 8/15/2003

Am1.1_Un.2637 incomplete on C-terminus (7-2478) plus, 330 aa, 4 exons

45% to 6P4 new subfamily in CYP6? 46% to 6F1

Am1.1_Un.5730 incomplete on both ends (990-741) minus, 84 aa, 1 exon

54% to 6d4m 60% to AmGroupUn.8460 60% to 6AM1

MFLTTWLIPDIIAVASLITVGLYFYYKLYLFKFWHKKGIFYVKPSFPTGNIMPIINGKLSLA(1)EFFRDIYEHNKHHRLVGIYMLYKPYLIVNDPNLIRDILTKEFTNFHDRGIFYNEEVDPLSGHLFQLPGKKWRNLRVK

LTPTFTSGKIKQFFPILNEAGNILAKYLEEEARKGSTIDVKDIFAR(2)

YSTDIIMSVAFGISCDSFKEPNNEFRYWGKKIFDPKPLWNALILFAPQILNFFS

ISYTEKSVTKFFTNMFKQTVKYRESNNIERKDFLNLLIQLMKNGYVDADDESLSNNVNAAK(1)

NKLTMMEAAAQAYVFFLAGFETSSTTVTFCLYELAKNQDIQNKVREEIQTMIKKNGDLTYNALNDMNYLHKVISE(1)

TLRKYPPVVILNRICTNDVKLSTTDFCIPKGTCIAIPVFGLHRDSNI

FPNPEKFDPERFSEENIKTRHPYVYLPFGEGPRICIG(1)

LRFGLIQTKIAIINALLKNKFKFGPNTPSTLEFEKGSLILIGKGGIHLNIEPI*

 

>Am2_98611857 new (16-745), 93 aa, 2 exons, 89% to Am1.2_Un.6686, CYP6P like

PRDGSSQE

KFDPERFSEENIKXLGHPEVYLPFGEGPRIL(9)IGLRFGLIQTKIAIINALLKNKFKFRPNTPSTSEFEKGSLILIGTGGIHLNIEPL*

 

New families in the 3 clan CYP335 and CYP336

>CYP335A1 Am2_14.6   (434001-435548), 515 aa, 1 exon

version 1.2 = CYP335A1 Am1.2_14.13a (10454-12001) plus, 515 aa, 1 exon

version 1.1 = CYP335A1 Am1.1_14.7a (400127-401674) minus, 515 aa, 1 exon

40% to 9f2m new subfamily in CYP9 63% to 335A2 39% to 9D1, 38% to 9K1

50% to 9J8 partial

MEVSHLTTFELLLLTLIFIILAKLVSILYTQFTYWKRNKVPYIRSSPLFGTAWRVFFRLVSFPNYCKYIYNYYPDARYVGVMDFATPTVIVRDPKLIKEIAVKNFNNFPDHRSFVTEEMDPVFGKNVFSLKGDRWREMRNTLSPSFTANKMRFMFDLVSKCSHDFVSCLHDRLESSSSEIEGKNLFTRYSNDVIATVAFGISVNSIEHPDNEFYRRGIDVSTFSGTFRFIKFMLFRLNPRLTRMAGFTFLSRATSKFFWRVISETVTARKRRGIVRPDMIHLLMQATDSKKKSIHQTMTIDDIVAQAFIFFLAGFDTTSTLMCYVVHELALHQDVQRRLREEVDRVLDDGTEISYEDTLGMEYLEMVISETLRMHPPTLLIDRQCAKEFHLPPAGPGYESVTIHPGENIWFPVLAIHRDPAHFPDPDKFDPERFNRENRNGIDPYTYIPFGVGPRKCIGNRFALMETKLLIIRLLRKFVIKPCERTMDPIVYKKGNFTLMPKDGFWVTFEKRNDH*

 

>CYP335A2 Am2_14.6 (443001-444560), 519 aa, 1 exon

version 1.2 = CYP335A2 Am1.2_14.13c (15290-16849) plus, 519 aa, 1 exon

version 1.1 = CYP335A2 Am1.1_14.7b (395285-396839) minus, 519 aa, 1 exon

39% to 9f2m 63% to 335A1 40% to 9K1 51% to 9E2 partial, 54% to 9J7 partial

MESPTLLFSFELLAIGLTAIVLAKFVSLLHHQYNYWRKRRVPHVGAVPVLGSSWRIFTRRMSLPNFCSLVYKHRPGSRYLGMMDCFTPVVVVRDPNLIKEIAVKNFDHFPDHHSFINEKIDPIFGKNVFSLKGDRWREMRNTLSPSFTASKMRFMFDLVSNCSEEFVRYLYDHPEFSSSIEAKDAFTRYTNDVIATVAFGISVNSMENRDNEFYTKGADATNFGGIFRLFKFMLFRVNPRLTRMAGLSFLSRGTATFFHRVVRETVRARDERRIVRPDMIHLLMQARDKEDRRPVATVDNRMTIDDITAQAFIFFLAGFDTSSTLMCYVAHELALNPPVQERLREEVDRFMDGGNGAITYEALLKMEYMDMVTSETLRKYPPIVFIDRLCVEKFELPPAEQGYDHLIVHPDNIVWFPVYGLHHDPKYFPEPEKFDPERFNDANKRNIVPYTYMPFGLGPRKCIGNRFALMETKILIAYMLRKFRIKRTEKTRASIEFSKTNFSLTPDHGFWIGLEKRDP*

>Am2_14.6 (440001-441185), 379 aa, 1 exon, psuedogene like 335A2

new in version 2.0

DSFRYSAHFANSSS*KNPVGPVLAQSSSKREILLDFTLNPSINQGDSFDRFIDRSSTRNIFSF*GAKGRKR*GILSPSFTMNEIDVRNCSQVCGSFALIYPRWTQRKILSSATT*SFIYQILLESV*IRWKTTSQFPLMRFRVHD**DY*E*FLKSTIFDFVPRVVSNGIEWRTRHR*TGYTAFIARNKEKSSIDKMTIDDIVSRAIRFSTWMALTLALSNFMRFVLYDPAFGWRGKEIDG*NILRILRRVYTGMMMSETLGKYSGLIFVDRVCTRKFALSGAGYDGATLYPGDIVFSPYVGSRS*IFFRSRNEKNEGNIVPYWNLLLEIKIFMIPSRRFSIRLNEKTPEAYRLSEK*SCIHS*RRNLD*IGRKERL*ILWLVKNFVHYFAKRFM

 

>CYP335B1 Am2_14.6 (530001-531533), 510 aa, 1 exon

version 1.2 = CYP335B1 Am1.2_14.13h (106877-108409) plus, 510 aa, 1 exon

version 1.1 = CYP335B1 Am1.1_14.7c (305231-306757) minus, 510 aa, 1 exon

39% to 9E2 35% to 9f2m 57% to 335B2 42% to 335A1

MDYLQLGLTLLAILVAVYYLSTRNHKLLKRHGIVHIPPTPLFGNLGPLVRRKCHMEDVIQRVYDLDPDARYVGMYEFTTPLIIIRDPELIKTIGVKEITNFTNHRPFVDVGVDPMLGEVLFAMQGDRWREHRTMLTTLFTSSKIKSMFVLMSDCAKRFADYLSKVEREIELKSVLTRYTNDVIARCVYGVSVDSVNEPENIFYRYGQVASQLSTFKQNLMIFVHRNSPRLARLFNLKILPVHIEKFFHRLVMDTIETRRREGVHGLDMLQQLMDMQSRRKESEEGKRGMTVTDIANHAFSFFFGSVDTMATQISLISHMLAVNPDVQQRLQEEIDEVLSASEDKQVGYDVIQEMKYLDAVMSEAMRYHPILLFVDRVCGETFELPPALPGARPFKLERGMNIWFPVKAIHHDPKYFENPDRFDPDRFLRDGKGIASSGAYMPFGMGPRKCIGSRFALTEMKILLFNILAKCSFKVGSKTMVPLKFKEGVFNPVAKNGFWLKIERRENSCC*

 

>CYP335B2 Am2_14.6 (589998-588400), 532 aa, 1 exon

version 1.2 = CYP335B2 Am1.2_14.13g (159215-160813) minus, 532 aa, 1 exon

version 1.1 = CYP335B2 Am1.1_14.7d (252241-253839) plus, 532 aa, 1 exon

57% to 335B1

MEFLSLALVLAAISIIAYYYCFVRKNFNLFQEHGILHVPPSPLVGNFGPLIRGKENVHDTIQRIYNIHPDAKYVGIFEFLTPVIMIRDLDLIKSITMKNFDQFPDHRPMFCKSVDPMLGEMLFIMDGERWKEHRNMLSPTFTSSKIKTMFVHMSECAKRFAHHLSKLPEKDRETEMKALLTRYTNDVIAACIYGVNVDSIKEPRNVFYMYGRVGATLIGLKKNLKIMVHRNMPWLANLLRLNILERHIAKFFTDLVVETVEERERNGTTNSDLIQLMMDTRNKKESGKKNLTVQNMANHAFSFFFGGFDTVSSQTCVLLHMLVENPEVQQRLQQEIDETLESNNGQLSYDVIQEMRYLDAVINEILRLHPIAVFIDRMCVKSFELPPALPGDVPFTVKPGMNVWIPVKAIHHDPRYYDEPEKFKPERFLDNGKNIIGSGAYFPFGIGPRICIGNRFALIEMKVLVCHILAVCDIKAGARTGIPLEFEKGVFNATAKTGFWLKIEPRKYSYHSGQINGLVNNHVINGACKTGI*

 

>Am2_57889394 (4-286), 95 aa, 1 exon, 335B2 fragment, 2aa changes, probably sequenced from the other haplotype

ETVEDTERNGTTNSDLIQLMMDTRNKKESGKKNLTVQNMANHAFSFFFGGFDTVSSQTCVLLHMLVENPEVQQRLQQEIDETLESMMNYSKYQFS(1)

 

>CYP335B3 Am2_14.6 (574998-573445), 517 aa, 1 exon

version 1.2 = CYP335B3 Am1.2_14.13d (145027-146580) minus, 517 aa, 1 exon

version 1.1 = CYP335B3 Am1.1_14.7e (266474-268027) plus, 518 aa, 1 exon

58% to 335B2 ESTs BI508364 BI516506 BI505081 BI505012

MDYLTISLSLITVFVAVYYLATRNNDFFKKHGIPHVPPVPFLGNMGSLVRQKSNLHDVIDRTYNLDPGAKYVGIYEFTTPIIILRDLDLIKTITMKYLDHFPDHRSFAYEGADPVFGSMLFAMKGERWKEHRNMLTPTLTSSKIKGMFKLMTECAVRFADFLSVLPENERETEMKALLSRYANDVIASCVYGVSVDSINDPKNIFYVYGRRGTNVVGLKKSMFVLIHRNMPWLAKLFGLRFLEKHVQKFFYDLVYETIESREKLGTNRSDVLQLLMDIRDKANSSGKMTTMTVENVAIHAFTFFFGGFDSITSVTTLLTQMLAEHPDVQARLQQEIDETLRSNDGVLTYDAVHGMKYMDAVINETMRFCPVLPFLDRMCVESFQLPAPVPGGQPFTLRPGMNVWIPLAAIGRDPEYFEDPDKFDPDRFLNPEAGIKNSGAHFPFGLGQRKCIGERFAMMEMKVLLCYVLAACNVRIGSKTTVPMKLEKGLINANVKGGFWLKIEPRKVTYYNSSRSN*

 

 

>CYP335C1 Am2_14.6 (581001-582596), 531 aa, 1 exon

version 1.2 = CYP335C1 Am1.2_14.13f (153220-154824) plus, 531 aa, 1 exon

version 1.1 = CYP335C1 Am1.1_14.7f (258230-259825) minus, 531 aa, 1 exon

44% to AmGroup14.7d 49% to AmGroup14.7g

MLDSWSITAAIVAVLAIAYYQLIWKYKHFERIGIPCYHSIPLLGSFWEAVIQRNNFAEISRKIYNSYPDTKYMGMYDTTTPVLLIRDTELIKAISVKHFEQFPDHRSFQNEATDPLFAKNLFALRGDRWREIRNLLSPAFTSSKMKSMFILMRDCAKEYGDYFASLTGDESTIELKDAFTRYTNDVIATCAFGVEVNSMKDRKNKFYVYGREGTTFGSWASIKFFVTRVLPVSVCTLLRIRLIRKEISDFFIDLVSTTIKTREEKGIVRPDMIQLMMESKGKLGAGKEMSMIDICAQAFVFFFGGFESTSTLMCFAAYEIAVNEDIQRRLQNEIDQVLEERDGEVTYAAVNEMKFLDAIIYEALRMYPVVVATDRVCMKPFELPPNRPGEKPYLLKEGDNVWFPIYAIQRDPQYYPEPDKFDPDRFLNDTKQMINSGLFLTFGIGPRMCIGNRFAMLETKVLLFHLFARCNLVPCSKTTIPMKLNRKGFSMTAENGFWFKIEPRSAKKEEKIAVPGTTMLIDKIPDRYPDN*

 

>CYP335D1 Am2_14.6 (577001-578524), 507 aa, 1 exon, 1 aa change

version 1.2 = CYP335D1 Am1.2_14.13e (151195-149678) minus, 507 aa, 1 exon

version 1.1 = CYP335D1 Am1.1_14.7g (261868-263392) minus, 507 aa, 1 exon

49% to 335C1

MAIFALLLIVLGILGSYHLLKSQNPFKEHGLPYKSYLPILGSTWESILRRKSFAVVIQEIYNLAPSARYVGFYNRTTPIVMIRDPELIKTIAVKNFDAFRNHRTVNDTQTDDVLLSGNLLLLRDNRWREVRSLNTPAFSTSKIRSMYRSMSEIAINVARYLSTLAPGQNIVEMKDIFTRYANDVFATCAFGISVDSLSDRENKFYELGREALDIHSTPILKLILIFAFPKLARRLGVSLVSKEATNFFTRVVSENIKMREEKGITRPDFIQAMIDKRNGRGRDDELTVEDITAQAFVFFFGGFETTSGLLSFAVHELAANPEIQGKVHAEIDRVLVSNNEITFERVNGLMYLDAVINETLRMYPIIPITDRECSKRFELPPVLPDAKPYVLKEGSHVWFPIYAIQRDPRYFEKPDCFDPDRFLDDNKKRSDAFNGDAYMPFGAGPRNCIGNRFSMVETKVALFHILAKCRLDVCPKTTIPMELRKRGVFLTAKNGFWLRIVPRHPVT*

 

>CYP336A1 Am2_2.8 (230001-231491), 496 aa, 1 exon,  5aa changed

version 1.2 = CYP336A1 Am1.2_Group2.13 (199618-201108) plus, 496 aa, 1 exon

version 1.1 = CYP336A1 Am1.Am1.2.15 (200938-202428) plus, 496 aa, 1 exon

34% to 6AH1 Anopheles EST BI946448 from brain

MASAFLTLVTGALLLLCFYLYLKYTYWKRNGIPYSKGYYPIIGHFLPLIMKKQSYSEIIEEIYRDSNHSMVGMYKGMKPVLILRDINLIKTVLQSNFSKFHENAVKIDPKLDPLLAKNPFFCYGELWQTGRKRLTYAFSNARLKILFAAVYEVCTKFRNFLDRRLESSKKYEVELKSLFLKFTSEVVANAGLGIEGFCFEDDKVQSMFTNLDNNDFLDTFLIGIIVHFPFLTKLLRIQFLPTKHDKFFRTVVKKNLELRKSDPIPRNDFIQLMIEMEQTGEKIDEEIVAAHAVSFYLDGVETSSVTLNFIGCQLAIHQDVQEKLRKEVRSTLEKHGGVLTFEALKDMTYMNQVISESQRYFSALGFLGKICTDEFELQGSDGLNYRAKPGTELLIPICGLHKDPKYWDNPEIFDPERFSDENKQRIEKMAFIPFGEGPRICVGMRMAMLQMKSCLATLMKDYKLEVSPKMQLPLKLSPTYFLSAPLGGGWVLISKA*

 

CYP4 clan (4 seqs, 3 complete, 1 incomplete)

 

>CYP4G11 Am2_16.8 (1440635-1444519), 545 aa, 7 exons

version 1.2 = CYP4G11 Am1.2_Un.3078 (673-4557) minus, 545 aa, 7 exons

version 1.1 = Am1.1_Un.2145 (673-4743) minus, 545 aa, 7 exons

CYP4G11 63% to 4g15m, 100% to AF207948 partial seq

MAAASATGFSASSVFLSLLIPALILYFIYFRISRRHLLELAEKIPGPPALPLIGNALDLFGT (1)

MFSQVLKKAENFKDVVKIWVGPKLVICLIDPRDVEIILSSNVYIDKSTEYRFFKPWLGDGLLIST (1)

GQKWRNHRKLIAPTFHLNVLKSFIDLFNANARSVVEKMRKENG

KEFDCHNYMSELTVDILLETAMGVSKPTRDHNAFEYAMAVMK (2)

MCDILHLRHTKIWLRPDWLFNLTKYGKNQIKLLEIIHGLTKKVIQLKKEEYKSGKRNIIDNSAQKTESK (0)

TNNIVVEGVSFGQSVGLKDDLDIDDDVGEKKRQAFLDLLIEAGQNGVLLTDKEVKEQVDTIMFE (0)

GHDTTASGSSFFLAVMGCHPDIQEKVIQELDEIFGDSDRPATFQDTLEMKYLERCLLETLRMYPPVPLIAREIKTDLKLA (1)

SGDYTIPAGCTVVIGTFKLHRQPHIYPNPDVFDPDNFLPEKTANRHYYAFV PFSAGPRSCVGRKYAMLKLKIVLSTILRNFRVRSDVKESEFRLQADIILKRADGFKIRLEPRKQVASTA*

 

>CYP4AV1 Am2_Un.282 (45375-55154), 501 aa, 8 exons, insertion of nucleotide at C-end of 1st exon changing 5 aa and donor splice site

version 1.2 = CYP4AV1 Am1.2_Un.1387 (15087-18877) plus, 124 aa, 2 exons

version 1.2 = Am1.2_Un.1625 (113-5079) plus, 437 aa, 7 exons

version 1.1 = CYP4AV1 Am1.1_Un.2000 incomplete on N-terminus (1-4856) plus, 496 aa 7 exons

39% to 4c3

modified intron boundary

MSSSTIIMGVSWLTMILSICLMTIVVLLLVRRGKFLYALRKVPCPPAFPIIGNAYELCCSPEE (1?)

AFKKMIKWGKELGDMYLIWVGMRPFIFLYKAEAIQPLLSSSVHIDKSLEYQYLQPWLGSGLVTST (1)

GEKWHFHRKLLTPTFHSGLLELYLKTTIREAQILISCLRKEIGKPEFDIVPYAKRAALDIIC (1) DSSMGCNINAQKNFENEYVQAVNT (2)

LASISQRRFLNVWMSFDPIFKLTSWGKRHDHALSVTHGFVNK (0)

IIAERKAEWKDRKDTNFNEKSHKRQALLDLLLELSKDGKVLTDDDIRDEVNTFMFAGHDTTATSVSWILYALGRHPQYQ (0)

ELIIEEYDETVGTKELTLDILSKLTWLEACIKESWRLYPVTPLIARQIYHPITIL (1)

GHEIPIGSTVLVNSFLLHRDSRYFPEPDIYRPERFLPDGPKYPSYAFVPFSAGSRNCIGWKYGT

MIVKVLILYILKNFHVESLDTEDQLRFISELVLHNADGLRLKITPRK*

 

>CYP4AZ1 Un.1527 Am2_Un.5748 (1270-4633), 487 aa, 12 exons, added 7th intron, completed 11th exon

version 1.2 = Am1.2_Un.1527 (1556-4363) plus, 478 aa, 10 exons

version 1.1 = Am1.1_Un.8281 incomplete on both ends (644-1499) plus, 132 aa, 3 exons

version 1.1 = Am1.1_Un.2540 incomplete on N-terminus (5369-5938) minus, 71 aa, 2 exons

50% to 4AB2 fire ant complete seq, revised to match fire ant seq

MISAILFFIFLLATLHYFLLHHRKFGKMINLIPGPEPLPILGNIPTFHNISPS(1)

ELWKFLTQLSKQYYPIYRMWTFLEAYVHICHPDDIE(0)

TILGNIKFTKKGFGYKYLKPWFNTGLLTSSG(1)

HKWHVRRKILTSAFHFNVLRQFVDIFIEDAERLIKTLESEEGIFVENLLQLTSEHTLNVICE(1)

TAMGTSLKNKEKFQYEYRKAVYNMGCIFANR(2)

IVKPWFYYDFFFNLSPEGWQQSKLLKILHNFTRK(0)

 IIQERKEYHDKTNGRYLNDFHENINENDNNNDYND (1)

[fire ant seq. QYLKNFNQSIITDNEEIVGS]]]

[fire ant seq. QYLKNLNKDVVPNETETIGI]]]

VRRKRLAMLDLLIEAHRNNKIDDEGIREEVDTFMFR(0)

GHDTTAISFCFSIMLLAEHKEIQ(0)

DRARAEIKAAIEENGGKLNITVLQNLPYLERCIKESLRLFPSVPRISRKLETSVKL(1)

                            [[[fire ant seq. ISRITTEEAQL

                            [[[fire ant seq. ISRITSEETEL

RNYEIPSNTIINVNIFDTHRDPKFWPNPNKFDPDRFLPENSKKRHPYAYVPFSAGPRNCIG(1)

KSHLIP]]]

KTYLIP]]]

QRFAMLELKTYLGLLLYNYYFEPIDYLKDVTFVSGIVLRLENPVRMKFIPVKKIC*

 

>CYP4AZ fragment Am2_49183273 new (163-31), 56 aa, 1 exon 79% to Un.1527 in 4 clan

(0)DRPPAEIKAAIEENGGKLNITVLQFFSDLERCIKESLRLFPTVAG frameshift

GSKLETSRVKF (0?)

 

>CYP4BA1 Un.1327 Am2_Un.1509   (16763-15674), 299 aa, 3 exons

version 1.2 = Am1.2_Un.1327 incomplete on N-terminus (901-1990) plus, 299 aa, 3 exons

version 1.1 = Am1.1_Un.3324 incomplete on N-terminus (901-1990) plus, 285 aa, 3 exons

52% to 4aa1 55% to 4AN3 partial

(2)GQIMLLYRMIRPWLLIEWIYRLTKYGREEEKQRKNLFDTCFKMVKEKRDLLQSKDRISNNDIKKNK

NISLLEYMVEINEKNPCFSDEDIVEECCTFMLAGQDSVGTATAMTIFLLANHPEWQNKCIEEIDEIFNG

DTRFPTISDLKEMKCLEMCIKESLRLYPSVPIIGRTLGEDIKI (1)

GKHIIPAGCSVLISPYSTHHLPHHFPDPDTFKPERFNSENSEKRHPYAYIPFSAGPRNCI (1)

GYKFAMLEMKSIISAILRKCRLQSIPGKKEIRPKFRMTIRAQGGLWVKIIERDQILKSIAA*

 

mito clan (6 sequences, 6 complete)

 

>CYP314A1 Am2_5.28 (288891-286071), 517 aa, 10 exons (first intron is a guess)

version 1.2 = CYP314A1 Am1.2_5.28 (13954-16400) plus, 516 aa, 10 exons (first exon is a guess)

version 1.1 = CYP314A1 Am1.1_5.25 AADG02002903.1 incomplete on N-terminus (17670-19852) minus, 471 aa, 9

exons (19852-19708, 19574-19378, 19298-19111, 19031-18829,

18707-18579, 18471-18305, 18238-18160, 18072-17875, 17777-17670)

43% to 314A1, 55% to pea aphid (Acyrthosiphon pisum) P450 below

cyan is extra seq added based on CF588143

note This seq is less than 55% identical to Drosophila and Anopheles 314A1s,

but there appears to be only one orthologous seq in each species and it does not make

sense to give the ortholog a different subfamily.

 

MLLSSAWFEVIAAVLLTILIFVTSHRPAWWFWTATSHEASA(1)

PAEGKFKTVSKVPGPFSLPIFGTRWIFSCIGYYKLNKIHDAYKD(1)

LNQRYGALCKEEALWNFPMISVFSRQDIETIIRRNSRYPLRPPQEVISHYRRTRRDRYTNLGLVNE(2)

QGQTWHDLRVALTSELTAASTVLGFFPALNIVADSFIELIRRQRVGYKVTGFEELAYKMGLES(1)

TCTLILGRHLGFLKPDSSSELATRLAEAVRIHFTASRDAFYGLPLWKLLPTCAYKQLIESEDAIYN(2)

IISEIIETTIQEKRDDAKDESVEAIFQSILRQKNLDIRDKKAAIVDFIAAGIHT(0)

LGNTLVFLFDLIGRNPAVQNKLYEETYALAPAGCDLTIDNLRKAKYLRACITESLR(2)

LIPTTTCIARILDEPIELSGYRLTAG(0)

TVVLLHTWIAGLNEENFKDAKKYLPERWTTPTTPHSPLLVAPFGAGRRICPGKRFVDLALQLILAK(0)

IIREFEIIVEEELDLQFEFILAPKGPVSLGFRDRS*

 

>Am2_59135075 (72-1), 24 aa, 1 exon, like 314A1, but with 2aa changes, probably from the other haplotype

(0)LGNTLVCLNDLIGRNPAVQNKLYE(0)

 

>CB336480 Tribolium castaneum embryonic cDNA

MFEKIFQSLDVTSLLIIAI

FFLFLEYRPPWWYRNNDCKKGVKLIPGPLALPGLGTTWIFFFGGFSFNRLHLYYENMYKR

YGPVMKEEYWCNIPVINLFEKREIVKVLKAGGKYPLRPPVEAVAHYRRSRLIDTLALG

 

>CF588143 pea aphid (Acyrthosiphon pisum) 48% to 314A1 D. pseudoobscura

  2 RVLRQSGKYPIRPPNEVTANYRKSRPDRYTNTGLVNEQGEVWAMLRNKLTPELTSPRTIR 181

182 RFLPEVNQLADDFNNLISLARDGNNVVRGFEGYCNRMGLES 304

305 TCTLILGRRIGFLDGEVSETATRLADSVTSQFRASQEAFYGLPLWKLIPTKAYKDFVAS 481

482 EDALYDIVSEFVESALIDEQQSFTDVRSVFVSILQASELDNRDKKAAIIDYIAAGIKT 655

    LGNTLVFIL 682

 

>CYP334A1 Am2_9.4 (736455-733846), 531 aa, 10 exons, second exon is a guess

version 1.2 = CYP334A1 Am1.2_9.8 (72740-75349) plus, 530 aa, 10 exons (second exon is a guess)

version 1.1 = CYP334A1 Am1.1_9.6 alternate splicing possible? (1158493-1161102) minus,

514 aa, 9 exons (1158603-1158493, 1158868-1158668, 1159034-1158959,

1159285-1159117, 1159526-1159346, 1159843-1159593, 1160068-1159927,

1160289-1160153, 1161102-1160830)

34% to 12A5m a new family in the mito clan

MTESQTASVDESLRTDTIPLLDHTATTEVSPTTFEVSTMKVDTQIFDKAPLPFDEIPGPAILKIWEKYWKYVPLLG(1)

RLTWNRNITPLKYLFNEYGCIVRINGPLSGDIVMIHR(2)

PEHIAEVFKQEGDTPVRSGIDILQHYRLNYRKYRLAGPFSM(2)

QGTEWLEIRDKVEDTFNQISSTFFTKIDTCCNELITRICKIRNRQNE(0)

VPVSFYEDLIRWAMECFCDLTFNKRLGFLEPIGYNSSSEASKLINALTTAHKYM

SRCETGFQVWRFFLTPFARKLFEACDVLDN(2)

VIGKYVRQAQCKLRIRKSHSEESSMTERSPVLEKLLLNEGIHPDDICTMLMDMIILGIQA(0)

TVNSEAFLLYHLAKNPRTQRKVYDEIISVLSNDNSSFTEKSLKNMPYLKACIQETLR(2)

LHPAIPYITRLLPKTISLHGYTIPKG(0)

TFVIMANQITSQREENFEDPFKFWPERWLSNSSKEDVHFSYLPFGHGIRSCLGKNMAEAKMMLLTAK(0)

LVRQFRIEYDYADIKSRFMMVNVPNKPLRFRFVNRN*

 

>CYP315A1 Am2_Un.248 (31787-34039), 535 aa, 6 exons

version 1.2 = CYP315A1 Am1.2_Un.768 (10162-7910) minus, 535 aa, 6 exons

version 1.1 = CYP315A1 Am1.1_Un.1189 (10257-12510) minus, 535 aa, 6 exons

38% to 315a1m, probable mito clan, maybe 315Bx

note This seq is less than 55% identical to drosophila and anopheles 315A1s,

but there appears to be only one orthologous seq in each species and it does not make

sense to give the only ortholog a different subfamily.

MNLAQNILKSGKSVSLSSNVIALKYNVPGCGYAGASQTSRIDDLSDISKSTDGGNRSKIEITEKLRDRNYGTVAVATSESILQEMPEPRGIPVFGTLFSFILSGGPKKQHEYVDKRHKELGPVYKERIGPTTAVFVNSIHEFRK

IFRLEGSTPKHFLPEAWTLYNEIRKCRRGLLFM(2)

NGEEWVYFRKILNKVMLLPDPTNLMIAPCQEVAIELKRKWQKQIKTNNIISNLQVQLYQWSIEA(1)

MMATLMGSYWYSYKHQLSRDFEILAETLHEIFEYSAKLSIIPVKLAMNLRLPVWKKFVASA

DTAFEIVRMLVPEMAKLGGNGLLKKMMDEGIRAEDAICIVTDFILAAGDT(0)

TATTLQWILLLLCNHPEKQEELFKHLKDLSQEDILRLPLLKGIIKESLRLYPIAPFISRYLPEDSVIGNYFVPKG(0)

ELLVLSLYSSGRDAANFPQPNEFRPERWIRTQKGIYQGVVHPHASLPFALGARSCIGRKLAEIQISFALAE(0)

LIKSFKIECINKNQVKLILHLISVPSQSIKLKLMERN*

 

>CYP302A1N Am2_Un.4726 (2916-5374), 228 aa, 4 exons, added to C-terminus

version 1.2 = CYP302A1 N-term Am1.2_Un.8443 incomplete on C-terminus (289-2450) minus, 202 aa, 3 exons

version 1.1 = CYP302A1 N-term Am1.1_Un.2216 incomplete on C-terminus (289-2832) minus, 202 aa, 3 exons

45% to CYP302A1 aa19-183 in CYP302 family, mito clan (disembodied)

MCTLLKKCNQSIRKKLFIKFYSNEFTKSKIKINHSQPKAFYDIPGPKSLPIIGTLYKYLPFIG(1)

EYSFTNLYESGKKKLKCFGPLVREEIIPNVNVIWIYRPEDIAEIFKAESGLHPERRSHLALLKYRKDRPNIYNTGGLLPT(2)

NGSEWWRLRKEFQKVSSKPQDVINYLKETDCVIQEFVELCNNEKFADFLPLLSRLFLEL(1)

TCLVVFDIRLNSFSKEERCENSISSK

 

>CYP302A1C Am2_Un.1798 (7517-4855), 295 aa, 5 exons, added to N-terminus

version 1.2 = CYP302A1 C-term Am1.2_Un.2118 incomplete on N-terminus (3101-5209) minus, 250 aa, 4 exons

version 1.1 = CYP302A1 C-term Am1.1_Un.3145 incomplete on N-terminus (122-2230) plus, 250 aa, 4 exons

51% to 302A1 (disembodied)

note these two pieces are probably from a single gene missing about 70aa inbetween

SSKLIKAAFATNSAILKLDNGLQLWRLFETPLYRKLRKAQTYMEM(2)

IALELVSRKKNNMKIRYNKSFLDAYLENPVLDIKDIVGMACDMLLAGIDT(0)

TSYSTAYILYHLAKNQNIQEKLRIEATQLLKNHNEPISINILRNASYTKAVIKESLRLNPISIGIGRILQTDVVLSGYRVPKG(0)

SVVVTQNQIICRLPEYFEEPNLFIPERWLREYSENNNKINYKKTVHPYVLLPFGHGPRSCIARRFAEQNMQILLLR(0)

ICRRLKISWHGDDLGMISLLINKPNALLKFNFHDILNNNSV*

 

>CYP302A1 (Am2_Un.4726), 225 aa and Am2_Un.1798, 295 aa), combined seqs 520aa, 7 exons, only 3 aa overlap between CYP302A1N and CYP302A1C, 50% to 302A1 Dm

MCTLLKKCNQSIRKKLFIKFYSNEFTKSKIKINHSQPKAFYDIPGPKSLPIIGTLYKYLPFIG(1)

EYSFTNLYESGKKKLKCFGPLVREEIIPNVNVIWIYRPEDIAEIFKAESGLHPERRSHLALLKYRKDRPNIYNTGGLLPT(2)

NGSEWWRLRKEFQKVSSKPQDVINYLKETDCVIQEFVELCNNEKFADFLPLLSRLFLEL(1)

TCLVVFDIRLNSFSKEERCENSISSKLIKAAFATNSAILKLDNGLQLWRLFETPLYRKLRKAQTYMEM(2)

IALELVSRKKNNMKIRYNKSFLDAYLENPVLDIKDIVGMACDMLLAGIDT(0)

TSYSTAYILYHLAKNQNIQEKLRIEATQLLKNHNEPISINILRNASYTKAVIKESLRLNPISIGIGRILQTDVVLSGYRVPKG(0)

SVVVTQNQIICRLPEYFEEPNLFIPERWLREYSENNNKINYKKTVHPYVLLPFGHGPRSCIARRFAEQNMQILLLR(0)

ICRRLKISWHGDDLGMISLLINKPNALLKFNFHDILNNNSV*

 

>Am2_98065759 new (612-33), 72 aa, 2 exons 62% to 302A1

(0)IIALHLWRFLAPPLYRKLRNAQTYMET(9)NWYLVTPNHMKIRSHKSFLHPYLENPALHIKAILGMACVMNYLSL

 

>CYP49A1 ortholog Am2_Un.86 (102533-109096), 531 aa, 10 exons

version 1.2 = CYP49A1 ortholog Am1.2_Un.343b (13080-19643) minus, 531 aa, 10 exons

version 1.1 = CYP49A1 ortholog Am1.1_Un.423a incomplete on N-terminus and C-terminus of exon 6

(214-3539) plus, 312 aa, 7 exons

AmGroupUn.10899 incomplete on both ends (1221-1394) minus, 58 aa, 1 exon

46% to 49A1, 50% to 301A1 mito clan Combined Am1.1_Un.423a and Am1.1_Un.10899,

MLKIFVIKFLCCRVKMQILNCKFTKLLKNTVKQSNNIIKTLETLTTEVEEQDWSRCRPYSEIPGPKPIPFLGNTWRFIPFIG(1)

DFKIQAVDQVSKKLYKEFGDIVKVEGLLGRPDMVFIYDANEIERIFRQEERMPYRPSMPSLNYYKHVLRKEFFKENAGVIAV(2)

HGESWYNFRSKVQQVMLQPRTARMYITSMEEASLAFLER(2)

IKKIRNKNDEVPDDFLNEIHKWSLES(1)

IARVALDVRLGCLDDDANIETQQLIDAVTTFFKNVGILELKIPFWKLFNTPTWLKYVNALDTILS(2)

ITSRYTTVALSRTKEAEKSDKEPSLLERVLALENDTKLATILSLDLFLVGIDT(0)

TSSTVASTLYQLALHPDEQDRAYNEVCNILPSKDMQLDGKHLDKLKYLKACIKETLR(2)

MYPVVIGNGRCMTKDTIIKGYRVPKG(0)

VQVVFQHYVISNLDKYFPHSDKFLPERWLQSDGVRHSFASLPFGYGRRMCLGRRFAELEMLVVISK(0)

ILQRYKIEYHHEKLEYYINPMYTPKGSLNLKFIDR*

 

>CYP301A1 Am2_Un.86  (112558-109218) changed to better splice sites for 2nd intron (112558-109218), 516 aa, 10 exons

version 1.2 = CYP301A1 ortholog Am1.2_Un.343a (9993-12955) plus, 532 aa, 9 exons (exon 1 is a guess)

version 1.1 = CYP301A1 ortholog Am1.1_Un.423b incomplete on N-terminus and missing exon

(3686-6941) minus, 534 aa, 8 exons

69% to 301a1 D. melanogaster

MMTGKTRYLFFKIHFLLLYVLI (2)

CMVDHDTTTIQQGKPYKDIPGPRPIPILGNTWRLFPMIGQYEISDIAKLSQIFYDEYG

KIVRLTGLIGRPDLLFVYDVDEIEKIYRQEGPTPFRPSMPCLVHYKSVVRKDFFGSLPGV (1)

FTILRRHGEPWREFRTRVQKPILQPQTVRKYITPIEMVTSDFIQR (2)

IQEIKGEDGEVPGDFDNEIHKWALE (1)

CIGRVALDVRLGCLSSNLTSDSEPQKIIDAAKFALRNVAILELKAPYWRYVPTLLWSRYVRNMNYFIE(2)

VCMKYIDATMERLKTKKAVDEYDLSLMERILAKETDPKIAYILALDLILVGIDT (0)

ISMAVCSILYQLATRPEEQEKIYQELVEILPDPSVPLNMSHLDKAIYMKAFIREVFR (2)

VYSTVIGNGRTLQNDTIICGYKVPKG (0)

VQVVFPTVVTGNMEKYVTDAKIFKPMRWLKESTKTLHPFASLPYGHGARMCLGRRFADLEIQVLLAK (0)

LIRSYKLEYHHKPLKYKVTFMYAPDGELKFKVLPR*