Monarch butterfly P450s

88 sequences from

http://monarchbase.umassmed.edu/tools/IPR_query.cgi?id=IPR001128

 

Nov. 29, 2011

 

4 seqs were joined, leaving 86 sequences, 41 are now complete

 

60/88 seqs were incorrectly assembled.

CYP2 and mito clan seqs have been revised.

44 sequences in the CYP3 and CYP4 clans are short and need work

 

CYP2 clan (8 sequences, all complete)

 

>CYP15C DPGLEAN22546

scaffold1433:- 11116-17566

Best Drosophila hit  Cyp303a1 (4e-70)

63% to CYP15C1   Bombyx mori AADK01010670.1 BABH01011971

MWPLIIFILFLIIIKDLIQKPLHYPPGPTLIPIVGNLLSVWLKVRKVKYYHKVWQHWSNI
YGNILGLRLGFVNIVIISGKELIKDVLSRDVFDGRPDGFFYLMRSFGKKIGLVFSDGRVW
NRTRRTVLKYLKNFGYGSRAMESIISEECKALINLRLQDAGTTIIVNRMFDVSIVNILWK
LVAGKRYNLKDKQLLSLCNLITQSFKVADMSGGILNFLPFLRFIMPNLIGYTELKNIHNT
LYDFLKNTIEEHKATIDVHKPRDVIDVFLIESMEHNSPTSDEELQVICLDLLEAGVETVN
NTAVFMLLHIVRNDSIQRKLQQEIDAEIGRSRLPTLSHRSSMVYTEAVILESLRISSVAA
VGIPHMALEDARIGKYVIPKGTFILLAIHDLHNGPYWKDPEVFRPDRFLTKEGNLIQNDF
FMPFGSGKRRCIGEGLARSELFLFLVHILQRFNLKIPGGDPMPCPEPIDGVTLSAKEFGI
VFEPRF

 

>CYP18A1 DPGLEAN09331

scaffold152:- 12546-14415

Best Drosophila hit  cytochrome P450-18a1, isoform A (9e-146) short

89% to CYP18A1v1   Heliconius melpomene

missing N-term (added back from genomic seq)

REVISED SEQ.
14778 MITILSNSKLLWGLWQVMSYCTSRTTMPLLLTVGLTFLVMRLINLVREIRKLPPGPWSPP 14599
14598 VVGYLPFLGVRHKTFLELARSYGALFSARLGNQLTVVLSDYKLIREAFRREEFTGRPSTPL 14416

MHILDGLGIVNSEGRLWKSQRRFLHEKLREFGMTYMGNGKKIMEARIKNEVHDLMDNLQC
TEAAPVDPNPLLALGVSNVICGITMSVRFSHGDARFARLNHLIEEGMRLFGEIHYGEYIP
LYNYLPGKAQVQEKVAKNREEMFAFYQTLIDEHRNTLDINNSRDLIDVYLIEIEKAKLEG
REGDLFDGRNHELQLKQILGDLFSAGMETIKSSLLWMIVFMLRNPDVKRRVQEELDTVIG
RERLPTIEDMSSLPYTETTILETLRMSSIVPLATTHSPTKDVHLNGYRIPAGSQVVPLIN
CVHMDPNLWDEPNKFNPSRFIDENGKIKRPEYFMPFGVGRRMCLGDVLARMEMFMFFASL
MHQFDVQTEAGAAPPSLEGTVGATISPQPFRVKFVPRSPPAPPVVLVHEHPHLRHVGSH

 

>CYP303A1 DPGLEAN08762

scaffold6:- 376081-378310

Best Drosophila hit  Cyp303a1 (1e-120)

77% to CYP303A1   Heliconius melpomene

MWLSVLFVASFIGLLLYLDTRKPRKFPPGPNWLPVLGSALEVKKSRDKTGYLYRSIKQLS
NTYCKYSPLIGLKIGKDRIVMVNTVEANKEMLHNEFIDGRPMGIFYQTRTWGERRGVLLT
DGELWKEQRKFLIRHLKEFGFGRSGMSETAVVEAEHMTQDILDIMGENKSATIQMHNFFN
TYILNTLWTMMAGKRYNPSHPEMVLLQTILFELFAAIDMVGCSFSHFPILSILAPTLSGY
KDFIRTHKRIWKFLRNEIASHKSRFDPNKEDKDFMDVYLRVLREHGEVNTYSEGQLVATC
MDMFMAGTETTTKSMSFCFSYLVREQKVQKKAQEEIDRVIGRNKTPALNDRPNMPYNEAI
MLESIRHFMGRTFGVPHRALQNTTLAGYNIPKDTMVVSNFPNILMDDDLYPEPYSFKPER
FMLDGRLSVPDHFFPFGLSKHRCMGDVLAKCNMFVFITTMLQRFSFLPVPGEPLPSLDHV
DGATASAAPFKALIIPRMQDE

 

>CYP304F DPGLEAN03543

scaffold53:+ 142212-151719

Best NR hit (blastp)  cytochrome P450 CYP304F2 [Zygaena filipendulae] (0.0)

65% to CYP304F6   Heliconius melpomene

MIGLLILVFFIIFYCVRCFKNAYKRPSDKFPPGPPKLPIHGAYWIVVLKKINNLAASFKM
LSKEYKTKVLGLYLGNFVTIIVDDPHLIKECLNREEFDGRVDFLVARLRSFWKKLGIFFT
DGYFWHVQRRFSLRYMRDFGFGRREETLETCFSNDIKEMINMATYGPNYPAEKAIIKGDL
ICLQDYFAVPFINGMLHVLVRNPLPRSEYKVLWELSRYALKFQRGTNDLGGAVSLTPCLK
DLMPKLSGYHDLRSGNQYLLDFFNKIIKDQMSTYDTTHLRHFIDVYVKKMKEEEKDTKRS
TFSVEQLQLICTDYMFPAASAVQAVLTFLVERLLIQPEVQDRIHEEIDRVVGRDRMPSLD
DRQNMPYTEACIREIMRFETLVPLGVLHRTVKPTTIDEYHIPENTVVAFNYVSLHMDKKL
WGDPENFRPERFIKNGILNLSADKSLPFGSGKRLCAGETYARQSMFLVFSAFMQAFHVST
VTGKPLKKPASRIQGIITTIKDFWVKITPRS

 

>CYP305B DPGLEAN15169

scaffold7:- 436394-441710

Best NR hit (blastp)  cytochrome P450 Cyp305B1 [Bombyx mori] (1e-155)

71% to CYP305B1   Heliconius melpomene Hm1250079b
REVISED SEQ.
472761 MLLFILIVMLVVVYMTKTVIKPKNYPP 472681
441758 GPKWYPVVGCTKIVNE

MSRQYGCQWKGLSNLAKQYSTQVLGIKLGLEPIVVVYGDHNVRRVFTEKVFEGRPNNFFL
RLRCLGKKMGITFADGELWKEHRQFTVKHLKNVGFGKTAMEGEIQRELQYIIQYIDNHNN
TPINPKTILAISVMNILWKFTAGEHINMERLIFLLDLLKARSKAFTMAGGWLNQWPWIRF
LFPNGSGYNLIRKINQQLSDIIEEAILKHKNESIPGNDFIYMFLNKIKENRESFTEEQLK
VICLDLLIAGSQTTSNTLEFAILGVLRNKIIQEKIFKEIHTILGNKMPSWSDSNRLVYTM
AYLYEIQRYFTIVPLAGPRRVSKDVTMDDFLIPKDTTVLISVGDVHFDPEMWEAPDKFMP
ERFIDDKGKLTKTENIYPFGIGRRRCPGDALAKSFIFIVFVGILQKYQIVCRDDELPSDE
PIVGLISSARPYTAEFRMRNT

 

>CYP305B DPGLEAN15167

AGBW01010572.1 Danaus plexippus

scaffold7:- 458170-472737

Best NR hit (blastp)  cytochrome P450 Cyp305B1 [Bombyx mori] (4e-59) short

53% to CYP305B2 Heliconius melpomene Hm1250079a
REVISED SEQ.
MLVVVYMTKTVIKPKNYPPGPQWYPFFGCNNIIHAKAIKFGSQYKALQEMAKEYRTQVLG
LKMGSDMVVAVFGEKNIRFVCNDLNFDARPESFFIKLRTFGKKCGITFVDGPLWREHRHF
VMKCLRKEGFAKEVMESDIQAKTEKILEILKNSNGTPVNIRFLVAMAVTNILWKYVAGEH
LRETQLHKFILLMSKRSEVFTMAGGWLNQWPWLRFIAPNWSGFSIIQKMNSEMFEIIH (0) 460797
460701 ESIRKHREGKVKGSDYIYKFMNEMYRESFT (1) 460612
460196 EDQLRGVLVDFLIASTQTTGSIIGFAIISLIRNKDIYDKIYNEINMILGKNPPKAKDCTR (2) 460017

LVYTSAFLYEILRKFTIAPFSGPRRT
LKDTIIDGYNIPKGTTILISLDDLHSSSEIWNDPEQLIPERFIDENGVLKNTNKVYPFGL
GRRACPGEPLSKSFIFIFIVSIIQKYKIECSDGEMPSDEPIIGILTEPKPFTARFIERPD
A

 

>CYP306A1 DPGLEAN09332

scaffold152:+ 8483-11568

Best NR hit (blastp)  cytochrome P450 CYP306A1 [Spodoptera littoralis] (0.0)

72% to CYP306A1   Spodoptera littoralis

MDVIFLWLVTIVLGFLLLKQLYRWRSLPPGPWGLPVFGYLPFLDPEQPHLTLTKLAERYG
PIYGIDMGSTYAVIISDYKLVREAFAKDVFSGRAPLYLTHGIMRGFGIICSEGARWKDQR
KLVTTWLKSFGMSKHSVSRDKLEKRIASGVHEVVQNLRENNGSPVNLSDMITHSLGNVVN
DIIFGFKYDADDKTWRWFRQIQEEGCHEMGVSAIVNFLPFMRFISPSIQKTMEVLIRGQA
QTHRLYASIIARRRKMLGLQKPAGAERPAHDKLFDEHPEGFIKCTKYSKNASNDEVHFFN
PDVLIPSQDECILDKFLIEQKRRYENKEESAIFVTDEQLHFLLADMFGAGLDTTSVTLSW
FLLYMALYPDEQELVREEILSVYSEECEIDSSKLPKLMAAICETQRIRSIVPVGIPHGCL
QDTYLGNYRIPKGAMIVPLQWAIHMDPNIWEDPHIFKPSRFLDENGKLLKPQEFIPFQTG
KRMCPGDELSRMLTVGFMVQLFRSFRVRLESKPPSTKEMQGKVGVTLSPPHVLFVCDSL

 

>CYP307A1 DPGLEAN00751

scaffold5170:+ 3599-7502

Best NR hit (blastp)  cytochrome P450 CYP307A1 [Spodoptera littoralis] (0.0) spookier

81% to CYP307A1   Heliconius melpomene

MSALLILAFCAIIYKLFSRRTIKWRRATKYGEEKVEDLKQAPGPIALPIVGSLHLLGKHE
SPFQAFTELSKDYGDIYSIKLGSSECLVVNNLDLIREVLNQNGKFFGGRPDFLRFHKLFA
GDRNNSLALCDWSNLQLRRRNLARRHCGPKHHTDNFSRIGQVATFEAIELVHTLKNITRN
STDSINLKPHLMATAMNMFSNYMCNVRFDDNDAEFRKIVDNFDEIFWEINQGYAVDFLPW
LAPFYKKHMDKLSNWSQDIRTFILSRIVEQRETNLDMDGPEKDFLDGLLRVLHDDQTVDR
NTIIFMLEDFLGGHSSVGNLVMLCLAAIARDPEVGKKIRSEIDGVTKGKRPVNLTDRPSL
PYTEATILECLRYASSPIVPHVATENANVAGYGVEKGTIVFINNYELNTSTKYWDEPQKF
DPSRFLERTKIRARRNSLCDSGMESDSERTGPINRATEIEKEVVSVKRNIPHFLPFSIGK
RTCIGQTLVTTMSFVMLANIVQEFDVGAVNLEDMKQKPACAALPKDTFNLYLLPRKY

 

CYP3 clan (36 sequences, 9 complete)

 

>CYP6AB  DPGLEAN11529

scaffold3734:- 10943-12394

Best NR hit (blastp)  CYP6AB4 [Bombyx mandarina] (0.0) short

69% to CYP6AB25   Heliconius melpomene Hm1245305

missing N-term ~59 aa

MVKHDKPVPFLGNNARVYTMQMSVTQLSDEVYWRYPTERVVGFYRGLLPELVVRDPELVK
RVLVTDFASFYPRGLNRHREVELLLRNLFFVDGDLWRLLRQRMTPAFTSGKLKAMFPLIE
ERAERLRSRALSASAKGRPVDARDLMARYTTDFIGACGFGLDSDSLQDENSAFRKLGAKI
FDIPIKDVILSVLKEMFPSTFKNIRVMARVEKDLMALVNEVLRQRNHEPSGRNDFIDLML
ECKKKGIITGDSIEKVNSNGSPEKASVEMTNELIAAQVFIFFAAGFETSSSATSTTLHEL
AYHPEVQKKVQNEIDTVLAKYNNKLCYEAIKEMTYLEWAFKEGMRIFPSLGFLIRQCAKP
YTFEDLGLTIDEGVKIIIPLQALHNDPQYFDNPSEFRPERFDPQNFDSRNRFVYLPFGEG
PRACIGERMGMMQSLAGLAAILSRFSVQPAPGAPRKPTIDTRSNIVQVIRGGLPLIFTER
RPD

 

>CYP6AB DPGLEAN13403

scaffold1828:+ 35191-35463

Best NR hit (blastp)  cytochrome P450 CYP6AB5 [Bombyx mandarina] (3e-24) short

65% to CYP6AB1 Trichoplusia ni

C-helix region

MSTTQISEEMYWKYPTERVVGCFYGMQPGLIVRDPELVRHILTTDFSSFYSRGINSHQKI
EPLLQNLFFADGDLWRLLRKHMTLPLQILN

 

>CYP6AB DPGLEAN13404

scaffold1828:+ 35522-36556

Best NR hit (blastp)  CYP6AB4 [Bombyx mandarina] (3e-126)

64% to CYP6AB25   Heliconius melpomene

missing N-term

LTIDARDIMARYTTDVIGRCVFGLESDSLHDENSAFRKLSQTIFNITTKDAIVIYLKQMF
PSTFKNFKVLNRVEKEMLALVDELLRQRNYKSTGRNDFIDLLLECKKKGVITGDSIEYMN
QDGSAKKATVDLTYSLIAAQLFIFFAAGFETTCSSTSITLHELAHHPEVQKKVQEEIDRV
LKRYNNKLCYEAIKEMTYLEWTFKEGMRIFPSLGFLIRQCTRRYTFNDLDLTIDKGVKII
IPLQAMQNNPKYFDNPSEFCPERFDPKNIENGSKFVYLPFGAGPRACIGERMALLQSLAG
LAVVLSRFSVHPVPEAPRKPFVDPRSNIVQVIRKGLPLTFTVRK

 

>CYP6AB DPGLEAN03899

scaffold1021:- 15632-16213

Best NR hit (blastp)  CYP6AB4 [Bombyx mandarina] (1e-48) short

57% to CYP6AB8   Bombyx mori

MLECKRKGKITCDSIERVSADGLPEKATIEMTYELIAAQ

XXXXXXXXXXX

ASRYTSITLHELAHHPEIQKK
VQREIDTVLANYDNRLCYEAIKEMTYLESAFKEGMRTFPSLGFLVRQCAKRYTFEDL

XXXXXXXXXX

IITLQALHNDPKYFENPTEFRPERF

 

This part not P450 seq

TRPSPAAMISSCTCRLARVRARVSASGWVIGTYICIVHATIYIYIYIYIIK

 

>CYP6AB DPGLEAN07566

scaffold1697:- 22159-23604

Best NR hit (blastp)  CYP6AB7 [Depressaria pastinacella] (5e-119)

74% to CYP6AB15   Heliconius melpomene

MFPLIVERTEKLAKLAEGMAETCDELDIRDLMARYTTDFIGACGFGIDSAALNDENSDFR
KLGKRIVTVTKRDHFVSLLKRMMPVLCKDLHFLAPEIENNTVSIVQQIMAQRNYKPSGRN
DFVDMMLELREKGNL

(gap ~20 aa)

VELDDQLIAAQVFIFFVAGFETSSSASSFLLHMLAYHPEVQEKCR
KEVDEVLKNHEGKLSFDAVKDMKYLEMSLKESVRFLTSPGFLIRRTVNKCTLPGTNFTLD
ENMVMIVSSQAMNMDGELFENPEEFRPERFNPDNIGDIKKCTFMPFGDGPRSCIGERLGI
MQSLAGVATILSKFTLSPSRSSLRKPRIDPMSTIVQTVIGGLPLSLKRRKDIL

 

>CYP6AN DPGLEAN15918

scaffold337:+ 268848-270578

Best NR hit (blastp)  cytochrome P450 6AN5 [Manduca sexta] (1e-164)

62% to CYP6AN13 Heliconius melpomene Hm1249436a
58% to CYP6AN1   Helicoverpa armigera
complete

MFLIILACLVALYFYTTRNHRYWAKKNIKHERPIPLFGTHFRNIVALKSASELIHEFYVK
FGDEKVIGAYRGNDKELIVKDPDIIRDILMVDFNSFHNRGLGRDPEKEPLLKNLFHAEDD
LWKLLRHKLTPAFTTAKLKSMFPLIVNCAEKLVGLGENIIAKGGDCDIREVMAKFTTEFI
GACGLGIEMDTINNESSIFRQIGKKMFSRSDMEVFLFAVWDVFPEIRNLMNLSNKKVEQV
FFDMVTKIFEQRNYKPIGRNDFVDLLLDYLSQGKIQSESIKYKNPDGTAKQVEMEMDIEV
LVAQVFVFFAAGFETSSSTTSYTLHELAYHPETQKKIQDEIDRVLSKYDNKLCYDAVNEM
TLLDMAMKEALRIMPAVGILNRQCVKPYTIKQVGLTIDPDVNIVIPVQSLHLDEKYFDEP
YEFKPERFADEDFNQRTKNVYLPFGAGPRACIGARLGQMQSLAGLAVMLHNFSVEPSENT
KRTLDINPRLNVVQGVLHGVPIKLVKRK

 

>CYP6AW DPGLEAN21828

scaffold461:- 34292-42101

Best NR hit (blastp)  cytochrome P450 [Bombyx mandarina] (3e-139) fusion?

74% to CYP6AW2   Heliconius melpomene
missing the N-term

MRLSYTKMLQFFYDKYRDERYVGIFQARRPALMLIDLELIKFVLSKDFQHFTDRISVSTD
TQREPLLRNLANMSGTEWQKMRHIVTPTFSSAKMKAMFPLVVDCARTLQTTLENESIEDI
EVPKLMCRFTTDVLGSCAFGVDPGSLKDKMSPFFIMSQKMFKTDRSTILKRYCRSFSPRL
FKFLNLRTYSLDVEVFFTNIINQVLNERRTTGKQRSDFLQLMLNVQKTEIGFTMTDELII
SNSFIFMLAGLETSATTLSFCLYQLAKDIDLQNRLRDEVRECIENHGGFNYDAIGAMRLA
TQTYLETLRLHPPTPLTTRLCTSPCTLPGTGLNMKVRDAVLVPIHQIHKDPRHFPDPEKF
DPERFGGAMNVNGFIAFGDGPRSCPGGRFAQMMVVAGLATILQNFSVEPCSKTTPTIQYE
TRSVTDIEVLQAINQGDFIEDCLNIFV

 

This part not P450 seq

EFSNNKHYEHNIEPNIPLSVLDRGSELEVRAQK
SEFNLSPTLAYTRPPPTANLVAAVADVINNITLYKRAPVQLWNEKGLYRVLFRCISQPNV
SQRSFAHTAVCRALAASSTHKCVRVALANTKDCVYHLLLTLTPIESDPSWVLIASCLSSV
LCSSVRARSFVVHRQLFRDISGVLHTMRDHLTLMGKPIDVIRNANHEPTLNTLNWVLILA
SSMMVDNPPAKDRLSEDIAASLTRLWPWCMMTEELRNSVMQFLLIFTNDCPKGAMAELCI
SCRPSTENRRRYDWVSREMLNLDITQIVILPQTRVAPSENYYFADVHGRRYLHQVGRLLV
SLS

 

>CYP6CT DPGLEAN16301

scaffold108:+ 72312-78084

Best NR hit (blastp)  cytochrome P450 CYP6CT1 [Zygaena filipendulae] (9e-174)

55% to CYP6CT1   Zygaena filipendulae GQ915316
complete

MAFTVLYVWFQYKFTYWSSKGVFSPTPVFLFGNIQDVIKRKTQFFQPYCDNYFKYKHLPY
IGMYCFNKPVLSIHDAELAKHILIKDFEHFQSHGIFSGGVGDPLAGHLFNLHGSAWKLLR
NKITSAFSSCKLKCMYPLVEKISKEALGYGDLLHARSESINFSEFYEKYTMEIIGNVGFG
VECNGFKNSNSEFYLRGHEYFNPNSMYWTLIRALAFFMPNFFDKLKIRRINPDIINFFDN
LVRETVEYRRKHSYKRNDFLQTLIDLNNDSSKCEERESQKGVFTLTDVTSNTMLYMFAGY
ETSATTGQFAAYELAKNPHIQTKAREEIRRVLAKYDGECSYEAQGEMTYMNMILDETMRM
YPPLRSLYRGCTKEYRIPDSDVTIEEGTLVLIPIHAIQMDPEIFQDPETFDPERFSPDRK
KLIHPCHWMPFGEGPRKCLGLRQGYIQSKLALVKLLHKYELLLDDRTAVPMKIKATSLAC
AADGGVWIRLKKLTDAVN

 

>CYP6FA DPGLEAN14423

scaffold12:- 153238-154799

Best NR hit (blastp)  cytochrome P450 6AE32 [Manduca sexta] (9e-145) short

59% to CYP6FA2   Heliconius melpomene

MVKDPELIKLILSKDFYYFNMREIADFIHNEPVTQNMFFGGGDRWRVIRQNLSPLFSSIK
LKSMFYIIENCAKTMEATIREEMNESGTFEVKSLLTRYTIDCVGSCVFGVNTGTLGKNYD
KNPFTLIAGQLFDVTSYGGFKLVTRAMWPSIFYGLGLKMFSDNVNNFFSNLLIGVFESRD
HKESGRHDFVDLLLNWKQKDFLCGDSITNMKTGEKKTIQLNIDEGLIIGQCALLFAAGYE
TTATTTSLLLYELAKNKDVQARVIDEVDEYYKRHHGKIEFEAINEMPYLQACIDETLRFY
PVLGLLTREAVEGYTLPTGLHLPKGTRIHIPVYHIHHNPKYFSEPERWLPERFYGEGKHF
ITPFTYMPFGEGPRVCLGKRFSKMPMYAGLLTIFKNYELELDPKTPSSYKFQPRTFVTQA
LGGIFLKFIPRK

 

>CYP6FA DPGLEAN14425

scaffold12:- 140835-142166

Best NR hit (blastp)  cytochrome P450 6AE32 [Manduca sexta] (3e-132) short

59% to CYP6FA2   Heliconius melpomene Hm1250808b

missing N-term

MFFTGGDQWKVLRQNLSPMFSSKKLKNMFYLIEVCSKTITEYMRSEIQHSNVIEVRSLLA
KYTMDCIGNCVFGVNTGTLGKNTDNNPFTIMGNKIFDISTYRGFKMVARAMWPSIFYSLQ
LKMFSEDINKFFHNLLTDVFKTRKYKESGRHDFVDLLLCWKQKDFIVGDSVSNNIKGEKS
QVQIKVDDECLISQCVLLFAAGYETTATTTSFILYELAKHKDAQIRVVKEIDEYYERHEG
KIEFESINEMPYLQACICETLRLYPVLGVITREVFEDYTLPTGLLLEKGTRIHIPIYHNQ
MNTKYFPNPEEFRPERFLGEEKKNIKPYTYLPFGEGPRMCIGMRFAKMPMVTALLAIFKD
YRIETTERTPKSFDFEPRAISTQPLGGIYLKFLSRN

 

>CYP6 DPGLEAN14441

scaffold12:+ 138276-139838

Best NR hit (blastp)  cytochrome P450 [Helicoverpa armigera] (1e-127)

53% to CYP6AE35   Heliconius melpomene
missing N-term

MTKDFYYFNGREVTQYNTREELTNNMFFHSGDKWKVVRQNLTPLFTSMKMKKMFHLVKDI
GDSFEKILEDELKSSSDVEIKSFLVKYTLECILSCGFGISGKLLEKGNKKDDNVFFMIGS
SIFDSSPSRGMKQVFRSIWPKLFYGIGLNVFDPKIFAMFKNLIAEVFQSRLSEQSNRNDF
VDLILSWKKEHYIYGDSLNPNSEKYKKISIEVTNELLIPQCVLFFAAGFETTSTTLSFLM
YELSKHRDVQDRLIEEVDDYYKKHNGVVEYECIGEMPYLDACIDEVLRLYPVLGVVAREV
MEDYTLPTGLKLEKGNRIHIPLYHIHRDPKFFPNPEEFRPDRFFGEERKNIVPYSFMPFG
EGPRICIGQRFAKMVMYNLILKIFKKYRIESSSLTPKSLEFNNISLVTQCNTSIYVKLVE
R

 

>CYP6 DPGLEAN00371

scaffold154:- 377523-378975

Best NR hit (blastp)  cytochrome P450 6AN5 [Manduca sexta] (8e-76) short

Missing the N-term

53% to CYP6AB3 heme1 Depressaria pastinacella
53% to CYP6AN5 Maduca sexta

MKKVQIFVDLVKKVCETRNYKPSGRHDFIDLLLEVALNGKLKGESNEKKNPDGSPAPIEF
DLSDPYNFAAQVFVFFTGGFETSATTTSFALHQLAMHPDIQRRVQNEIDEVLSKFDNKLC
YDAVRRMTLLEMVFLEAMRMFPPGGVLVRVCTKKYTFPGTSTTIEPGVHILIPVAVMHMD
ETIFENPEKFHPERFSSGYKESTGDMYIPFGVGRRMCIGSSLGHMQSTAGLAAVLQRFNV
ELAPTATEHVKPDRNIFYIHKAAGGLPIKLVPRILKS

 

>CYP6 DPGLEAN14424

scaffold12:- 147379-151790

Best NR hit (blastp)  cytochrome P450 [Helicoverpa armigera] (1e-103) short

505 to CYP6FA2   Heliconius melpomene Hm1250808b

MSVTTPQDIVTPKALSAGTRASLAVVNLLQPEHGPSRPGKHLTEDRREVVFNGCISSDDR
DSGLPFTLPTFSFHSPLIRNVPCLKPLPFFGNYKDYILLRKKFLFVTDEICKQFPNEPYI
GTLYGTEPTLIVKDPNLLKLITTQDFYFFSGREISNYNHKETVTNNMFFHGGDKWKVVRQ
NLTPLFTSMKMKKMFHLIKNIGDSFEKVLQVEVESASDTEIKTKLVKYTLECILSCGFGI
SAELLQSEGNDSIFINVGNSLFDNSKTRHFKNIIRSIWPKFFYALGLNLFDPKVAPVFNN
LLSDVFQSRLNKTSNRNDFVDLILSWKNEKHIYGDSLDPRSDKKISFDVTDVLLISQCIL
FYAAGFETTSATLSFLMYELSKHRDVQDRLIEEVDNYYKKHNGVIEYECIGEMPYLDACI
HEVLRLYFLL

 

>CYP6 DPGLEAN14442

scaffold12:+ 145023-146632

Best NR hit (blastp)  cytochrome P450 [Helicoverpa armigera] (5e-131)

50% to CYP6AE35   Heliconius melpomene

MWFNTLVLATLSFFFAFYYFVVRRLNYWKSRNVPCLKPLPFFGNYKDYILLQKKFLFVTD
EICKQFPNEPYIGTLYGTEPTLIVKDPKLLKLITTQDFYFFSGREITDYNHKETVTNNMF
FHGGDKWKVVRQNLTPLFTSMKMKKMFHLIKNIGDSFEKVLQVEVESASDTEIKTKLVKY
TLECILSCGFGISAELLQSEGNDSIFINVGNSLFDNSKTRHFKNIIRSIWPKFFYALGLN
LFDPKVAPVFNNLLSDVFQSRLNKTSNRNDFVDLILSWKNEKHIYGDSLDPRSDKKISFD
VTDVLLISQCILFYAAGFETTSATLSFLMYEL

(seq gap ~40 aa)

IHEVLRLYPVLGVLTREVMKSYTLPTGL
KLEKGNRIHIPLYHIHRDPKFFPNPEEFRPERFFGEEKKKVVPYSFMPFGEGPRICIGQR
FAKMVMHNIVLKIFKNCKLAESSLTPKKLDLTDVSIVTQTNDEIYVKLIKRL

 

>CYP6 DPGLEAN01161

scaffold614:+ 82963-84364

Best NR hit (blastp)  cytochrome P450 6AB13 [Manduca sexta] (6e-62) short

54% to CYP6AB21   Heliothis subflexa

missing the N-term

MFSFITDGAEKLQSLANDISELDSYDIKELMARYTTDFIGACGFGINMDTLGNDSSEFRK

(seq gap ~120 aa)
LDWQLKVQNEIDRVLIKYNGKITYEAVNEMQCLEKCFYEAMRLYPAVGFLMRKCTTSSYT
FPEIGLTIDKGVKVIIPIQAIHNDEKYFYEPNKFHPDRFTSSKDMKNSIFLPFGDGPRAC
IAARLGKLLAMAGIAAILHKFQVEPSAESKQFPEPDPMGIVSESFIGGLPIKLRRR

 

>CYP6 DPGLEAN07053

scaffold4254:+ 2947-8531

best match low CYP6AB7 [Depressaria pastinacella] (1e-11) short

a C-term exon fused to some unknown sequence downstream

60% to CYP6AB15 Heliconius melpomene gray region only

PERF to HEME

MRQTIVSTEAVNMGEELFCDPEEFRPERVKPENIGDIKCIFMPFGDGPALVSDLESCTLW
LEWPQFLVSSRCLHHVVRYVHHVSIRCPQSFRLVSADHEDVTRTCEESWENSSVCLSANI
HTTLNTPVLVRSPKLSNVGRALTIPTTKGVDNDNKLAPEDVKIDVFIDRLLKSNNFDRLV
ETIAEKIAQRIGHCDKVKQRVELKETSQDPIKRRAHKRKDAEENTSVTENKLMLKYKLFT
KNFDSDSRRSGSESRNRGLLLDFN

 

>CYP9A DPGLEAN04988

scaffold73:+ 53941-58113

Best Drosophila hit  Cyp9f2, isoform A (2e-102)

66% to CYP9A41   Heliconius melpomene
complete

MIVFLLWVVVLVAALALYYRQIYSRFSKFGVKQNNIVPFFGNMLKITLGMEHFAENIDSM
YKNFPEERFIGRFEFSKPGVLIRDLELIKKVTVKDFEHFLDHRGFVDDKVEPLFARNLFS
LKGQEWKDMRSTLSPAFTSSKIKQMVPFMEEVGEMMVRAVKKNIENSKVGQIEIDVKDLT
SRYSNDVIASCAFGLKVDSHTTEDNEFYEMGKKASTFKFKQMLLFFMNSALPFVARLFKL
KLFSRETTDFFVDLVQDTMKEREDKKIIRPDMIHLLMEAKKGKLTYDVANGKADSNVGFS
TVEESDLGQKPVTRVWSDSDLIAQAVMFFITAFETISTVASFTLYELAVNPDVQDKLFQE
IKEHNAKFGGKLDFNSIQNLTYLDMVVSEVLRLWPPAIVLDRVCNKDYNLGKPNSKATED
FIVRKGELVGVPVMSIHRDPQYFQNPNKFDPERFSEENRHLIDLMTYMPFGYGPRNCIGS
RFALCEVKVLTYQIILNFEISPAEKTCIPAKLSTGNLNLRLKEGHGLLFKLR

 

>CYP9A DPGLEAN04989

scaffold73:+ 62041-65458

Best NR hit (blastp)  antennal cytochrome P450 CYP9 [Mamestra brassicae] (0.0)

62% to CYP9A41   Heliconius melpomene
complete

MSIRNRICVLRLQMILEIWIVVLFVAAVLYYRQVYSRFTKYGVKQQTPFPLFGNMWRVTL
RLEHFTDTMNSMYKEFPEERFIGRFEFIKPCLFVTDIELIKKITVKDFEHFLDRSGFSVE
VDPLFSRNLFSLKGQEWKDMRSTLSPAFTSSKIKQMVPFMEEVGEMMIQAVKNKIKNSGS
EWIEIDCKDLTTRYTNDVIASCAFGIKVDSHTNVDNDFYKIGKIASTFKTRDVILFQLYS
AAPFLGKFFKLELLSTKTRNFFIDLVLSTMKERETHNIIRPDMIHLLMEAKKGRLHHEKD
VINADDGFATVEESEVGKNNKHKSWSDTDLIAQAVLFFIAAFEVVSTAMTFTLTELALHP
DVKERLALEIKEHNEKYGGKIDLHSIQTLTYLDMVISEVLRLWVPTILIERMCVKDYNMG
KSNGEAKEDFILRKGESLHIPTWCIHRDPQFFPNPNKFDPERFSKENRHLIHPIAYMPFG
LGPRNCIGSRFALCELKLLVYQILLNFDLSPTKKTCIPARLNKSSFHLQLEGGHALKFRS
RYNLIL

 

>CYP9G DPGLEAN03586

scaffold2875:- 6924-10657

Best NR hit (blastp)  cytochrome P450 9G3 [Bombyx mori] (0.0)

96% to CYP9G7 Danaus plexippus (monarch butterfly)
complete

MITEIVVFLLTSLIVYFIYIHKSIHKYFDDRGIKYLPGVPFFGNNLRSSFLKVHVLEDLE
AVYKAFPEERYVGYIEGTTKIIIIRDPEVMRNITVKDFQHFTDHKTFVSEDSEPLFGRSL
FFMKGERWHSMRTTLSPAFTSSKMKLMMPFMSEISSNIVEYLKGHVNEDVDVQDLMRRYT
NDVIASAAFGLPVNSVKDRDNEFFTIGRNLFSFTFFQKIYSIFVALFPNFMKKLGITVYP
TKVINFFKGIVVNTMQNREKNNIHRPDMIQLLMEASKGILKDESDTNGNKSQNNETSKQK
PIKEWSVEELSSQVFLFFAAGYESSASTLVMCVHELALNPDVQEKLYQEIKEYKEKHGEI
TFEHIHNLKYLDWVLSETSRKWSTIFVLDRTCTLPYELPPPREGLKPVQLKPGDVVYNMV
NCIHKDPIHHPNPEKFDPERFSDENKHKIKPFTYMPFGMGPRNCIGIRFAQLELKILIFD
IVSNYKIVKCTKTMDPVVLKPHAFNLQPRDGSIVRFVPRHDN

 

>CYP9G DPGLEAN12525

scaffold3896:+ 9024-13188

Best NR hit (blastp)  cytochrome P450 9G3 [Bombyx mori] (9e-44) short

72% to CYP9G9   Heliconius melpomene
C-term

MDRACTIPYELPPPREGLKPVQLQPGDVVYNMVNCIHMDPIYHPSPEKFEPERFSDENKH
KIKPFTYMPFGMGPRICIGIRFAQLEIKILIFELLLNFKIVKCPKTMDPVVLKPDTFNIL
PKDDSIVRFVPRYDH

 

>CYP9G DPGLEAN10930

scaffold34310:- 3041-3651

Best NR hit (blastp)  cytochrome P450 9G3 [Bombyx mori] (8e-33) short

69% to CYP9G9   Heliconius melpomene

C-term

MNFLHPEKVKPGDIVYNMVICIHKDPIYHPNPEKFDPERFSDENKHKIKPFTYMPFGMGP
RICIG
IRFAQLEIKILIFELLLNFKIVKCPKTMDPVVLKPDTFNILPKDDSIVRFVPRYDH

 

>CYP9G DPGLEAN12677

scaffold2384:- 3701-6076

Best NR hit (blastp)  cytochrome P450 9G3 [Bombyx mori] (2e-85) short

61% to CYP9G9   Heliconius melpomene
missing the N-term

MQHREKNNIHRPDMIQLLMEASKGTLKDERYANGNKSQNNEAFKQRPEKEWSVEELSSQV
FLFFAAGYESSASTLVMCVHELALNPDVQEKLYQEIKEYKEKHGEITFEHIHNLKYLDWV
LSETSRKWSTVLVLDRTCTLPYELPPPREGLKPVQVSYLLPTLCLWLQLKHGDLVYNMVK
SIHMDPIYHPNPEKFDPERFSDENKHKIKPFTYMPFGMGPRNCIGIRFAQLELKILIFDI
VSNYKIVKCTKTMDPVVLKPHAFNLQPRDGSIVRFVPRHAH

 

>CYP321C DPGLEAN20672

scaffold1213:+ 39371-40861

Best NR hit (blastp)  cytochrome p450 321B1 [Spodoptera litura] (5e-65) short

56% to CYP321C1   Amyelois transitella

missing middle

MFFSVLVIFFVILPLIIYTIGKRNENYWKRRGIVFHSKNKITGPFWEVFTSGEPFYLHLQ
NIYNQYPDQAAVGIGGFLTPSLYVRDPKNVQAILSSDFNSFYHRGFEVNENDKLANNILF
LNGSKWKLMRQSMTPLFTSLKLKNMFYIMDKSAQDFVQYIKESPEIRKGNTFQTLSTFCS
AAIGASVF

(seq gap)

GLELEPTYALLAAQAFFFFNAGVEPVAAAMFSALIEIGKNPEIQKKVHEEID
GTFDKNNGKLNFDIIAEMVYLDMVINEAMRMYPPIGFLSRQCVEDTVLPSGNIAVEKGTK
IFVPIFEFHHDPKYFPNPEEFNPERFSRENKKKMSDITYLPFGKGNRICIGMRYAHMQTK
TGLVHLLRNFNVKTNIGKGGIKYVKHQFQMRLTNVDVEFLNR

 

>CYP321 DPGLEAN20673

scaffold1213:+ 43910-44956

Best NR hit (blastp)  cytochrome P450 monooxygenase CYP321A1 [Helicoverpa zea] (2e-96)

54% to CYP321C2   Heliconius melpomene

C-term part

MDKSAQDFVKYLKNDRELRKYNAYQKLSMFCCAAICGTVFGIGANSIFDLPFLKIAQKSF
RSTLKINMRFALSTLSRTLFNKFKIHFFKEYEDFFIGAIKQVVHQREQDNIKKHDFVDTW
LALQKSGKLKDSDNGFELETTNELLAAQAFSFFIAGVEPTATAMFATLFELAKNPEIQQK
VHAEIDRVFENHSGALTYEMISKMVYLDMAIKEAMRLHPSIATLSRRCVQNTVLPNGNIV
VEKDTKIFIPVYELHHDEKYFPDPEAYKPERLSRENKHEIIEFTYLPFGKGNRTCIGMQY
AHMQIKTGLVHMLRHFTVHTDIQQGKQKYLKHLIQLRLDHNDIKFLTR

 

>CYP321 DPGLEAN20671

scaffold1213:+ 36083-37177

Best NR hit (blastp)  cytochrome p450 321B1 [Spodoptera litura] (2e-114) short

58% to CYP321C2   Heliconius melpomene

missing N-term

MTPLFTASKLKSMFYIIDKSAHDFLKYLRENKEIQKGDAFKTLSTFCSAAICATVFGVNT
ESIFDSPFLKIARGSFRSTLITNIQFLMSHFAEKFCQILNIKAFKQFEDFFVGAIKQVIR
QREIENIRRHDFADICIALKNSGKLIDPDTGLELETTDELLAAQAFFFFVAGVEPIATAM
FATFVELGKNSEIQKRVQDEVDGVFEEHNESLSYDIISEMVYLDMVIKEAMRLHPAIGFV
TRKCVQNTILPVGNIKVDKGTKIFVPIYELHHDPKYFPEPELFKPDRFSSENKHNILDIT
YLPFGKGKRICIGMRYAHMQVKTGLVHILRHFSIKTNINRDGIKYMKHQIQVRLENVDVE
FVPR

 

>CYP324A DPGLEAN11237

scaffold5202:+ 11854-14073

Best NR hit (blastp)  cytochrome P450 332A5 [Manduca sexta] (2e-88)

56% to CYP324A3   Heliconius melpomene
complete

MSVLFLLFSLMTCFLAYLTLRWNKVKNYWAQRGVPHSPPNPFLGSLTFIQRKNVGVWIRD
MYEHFKSPYIGIWLIWKPALIINDPEIARRILVKDSSIFRDRYLSSGSSDPIGALNIFTM
NDPDWSNIRRKLTNLFTAAKLRFVQSFALVKAKELVQRVDRDRNKGLELKTLFVDYTTDV
IGTFAFGLESNATLTSEGPLRKVTDDFMRFSVYRGLCWFSIFFWPGLVDIFRFSLFPRDT
TDFFKKIYLNIMDQRHKHPDGKQYKDLVDALIEIKKESEEKNQNYPDDLYLAQAAIVLLG
GFDSTASALTYMTYELAHDSESQEKLYRELKEAERNGANFDAQTLTELTYLNCVFKEVLR
KYAPMGWLDRIAATDYNIDENLTIAAGTVIYVNAIGMHYDPKYFPEPYKFNPDRFLPENE
SNIEPYTFMPFGDGPRVCIGQRFAYMSARTAASQLFLKYKVQPIPGSPKPKDVKIESKGL
FLGPGEPVHVEFIPRTENGHD

 

>CYP324A DPGLEAN01474

scaffold513:+ 1830-4244

Best NR hit (blastp)  cytochrome P450 332A5 [Manduca sexta] (9e-81)

55% to CYP324A3   Heliconius melpomene

missing C-term

MYEHFKSPYIGIWLIWKPALIINDPEIARRILVKDSLIFRDRYLSSGSSDPIGALNLFTV
NDPVWTSIRRKLSNVFTVAKLKALHHYTLSKVEELMRRIERDREKGLELKRLFVDYTTDV
TGTFSFGIESNATLTSKGPLREITADFGKFSIYRGICWFSIFFWPDLVDIFRFTMFPKKS
MHSFKRIFETTLNRHSNDIGGKDFKDIVDGLIEFKKEKEQKHQEVSDEFLIAQAAILLFG
GFDTTASNLTYMTYELAFNSECQEKLYNELKEAEERNGGNFDADTVSELTYLNCVLKECL
RKYPPMGWLDRIAATDYKIDDKLTIKAGTVVYVNSIGFHYDPKYFPEPTKFNPDRFLPEN
INKIKPYTFLPFGDGPRVCIG

 

>CYP324A DPGLEAN14574

scaffold8136:+ 960-2341

Best NR hit (blastp)  antennae-rich cytochrome P450 [Phyllopertha diversa] (7e-60)

56% to CYP324A3   Heliconius melpomene

missing the N-term

MVDYTTDVIGTFAFGVDSKATLTGEGPLRDITRDFIKLSLYRGLSMFSIFFWPEITDIFR
SKFFPRSSTKYFCEVVDRIMTHRKKENIRRNDLIDCLIKIERESEENNEKLPNGMIAAQA
AILLLGGFDTSAMTLTYAIYEIAHCPDIQEKLYRELLKAKERYGNEDFDMQTLGEMTYMN
CVIQETLRKYAAMTWLDRIAQSDYKIDDTLTIQGGTPVYINILGMHYDPEYFPDPYTFNP
DRFLPENNTNIKPYTFMPFGEGPRSCIGKRFGYMTMRTALAALFLNYEIFPLPNTPTPAQ
TKIDPYSMVYTPGEKLRVEFVPRNK

 

>CYP324 DPGLEAN18255

scaffold1806:- 22834-25048

Best NR hit (blastp)  cytochrome P450 CYP332A1 [Bombyx mori] (1e-24)

52% to CYP324A2   Heliconius melpomene
N-term

MFSGVLLFTSLVIIFGLLYIKWRQVSRYWSDRGVFYLAPNPFLGSLNFLLRENAALWLIR
MYEQCRRPYMGIWLIWKPALLVNCPNLARKILNQDFVNFRDRHISSGTTDPVGSLNLFTV
KDPLWSNIRSKITGIFTNAKLKSHQDYTRSKAKELVQRIQNDGYTVLHLKVHTTYTNKNK
TKTKQCPFMLN

 

>CYP324 DPGLEAN18256

scaffold1806:- 16565-20658

Best NR hit (blastp)  antennae-rich cytochrome P450 [Phyllopertha diversa] (7e-62)

56% to CYP324A3   Heliconius melpomene

C

MVDYATDVIGTFAFGVESNATLTGEGPLRDITRDFIKLTLYRGLSMFSIFFWPEVTDIFR
LKFFPRSSTKYFCEVVDRIMTHRKKENIRRNDLIDCLIKIERESEENNEKLPNGLIVAQA
AVLLLGGFDTSAMTLTYAIYEIAHCPDIQEKLYRELLEAKERYGNEDFDMQTLGEMTYLN
CVIQEALRKYAAMTWLDRIAQSDYKIDDTLTIQGGTPVYINILGMHYDPEYFPDPYTFNP
DRFLPENNTNIKPYTFMPFGEGPRSCIGKRFGYMTMRTALAALFLNYEIFPLPNTPTPAQ
TKIDPYSMFYTPGEKLRVEFVPR

 

This part not P450 seq.

NKKSLEDIIDLLKQADAVISDLLPVFVAKELHKLPPI
TFDHLDATRLLRDILKLQESMSLMQGEMESVKQECSSVRHDMKEFCTANSRQKTSQHEAL
QETWPLAHDLPLLGKVSGSFDFTCKTVVDTTAGILKGRPYGGVACCGEKMYLTLSLSYIV
VVLD

 

>CYP332A DPGLEAN17562

scaffold1344:- 9332-11893

Best NR hit (blastp)  cytochrome P450 332A4 [Manduca sexta] (3e-155)

61% to CYP332A1   Heliconius melpomene

missing about 76 aa at N-term

MRRSFSEYAYDLSVKYPRDYLGIYLGMRPALVVQTPQLARKVFSSEYDNFQDRHLYCHES
DPMGSLNIFTVKNPLWKMLRYRLSPMFTSHRLRKITELMNANATELVKKVQRDFVEKKKH
VNLKEIFSMYTSDTVAYSVFGIRVSALSDQDSPLWHITNHMVKWTFYRGLEITFIFFLPA
LGALMRLKLFSKAANDYIKQLYWKIVKDRQNHIKYDDSDLVNHLLKIKEQLELPPEADED
YADNILLAQVAVFILGSIETSSSVLSYSLHELAYHPDEQEKLYSELNNAFLSSEKDYLNY
NELLQVDYLTACIHETMRKFPLLPLIDRLCGQTCVLEDGLKIEKGVPVLVNVVAIHNNEK
YYPEPEKWKPERFMTGNKQDNREFTFLPFGDGPRFCIGKRYGMMQVRAALSQLIYNYKIE
PVVPYKVKPDPHSVILAPQDGLSVKFVPRRTEK

 

>CYP337D1 DPGLEAN18796

scaffold905:- 11572-13143

Best NR hit (blastp) 45% to cytochrome p450 CYP337B1 [Helicoverpa armigera] (7e-123)

45% to CYP337C5   Heliconius melpomene Hm1249880
44% t o CYP337A1   Bombyx mori
complete

MVLYALLVTFLILFGYYYRTEKLNYWKKRNVVQINKNTWHFIFGKYSLSEFYKNMYDLFE
AENFVGTFVGLKPVLIIKNPEDIKTVLQSDFQTFHSRGFKTNPKDDLAQNLLLIDNYTKW
RLIRQKISPTFTSKKLKNMYNILERSSSDFVKYIEKNPHMKDNPYKALHKFTSASISASV
FGINPNTKNLIDSPLVDIIWNVADSFASFNFKLALANIFPKLHNFLNLKVFGAQKDVVVD
AIKNILKYRRNTKERCHDFIDACMEMENEGVIKDNVTQYKLKVTPEFLGAQAYSLFFAGV
DTVANSMHFTLLELSNNSEILKKVHDEIDNVFDNCEGSISLKDIMNLKYLDMVISESLRK
YPPIGLMQRICANETFLSSNVKVDKGCVVIVPIYGIHRDPRHFPNPDKFDPERFSPQNRM
NISKFCYIPFGEGNRMCLGARFAMIQMKSGLAWLLKHYTLRGYNYMPNCFEPSLFVIRDP
KARYDLIVRNETVIS

 

>CYP338A DPGLEAN17199

scaffold2254:+ 1221-2961

Best NR hit (blastp)  cytochrome P450 6AB4 [Bombyx mori] (1e-47)

52% to CYP338A1   Bombyx mori
complete

MFFESFVLNLSFLAVLVVALTFDYVTKYFNYWYIRHIPNKTPIPFFGTDYHRVLGLRNTS
EEVSTLYKKYEKEKYIGCIKSRIPDLIVKDPESVKKILSTDFSNFHCRGYGLDRSQDVCL
RNNLFYAEGEKWTLLRRKFELLLNNMNNNIQDSLHDCLSGTNGDTNVQQLLSDILDTVFK
DMLIDNNNDGSAIKYMRKIIQKRTLSDKIKSYLKCIFPSIYNVLGFLVIPRQSLKQFVNV
IKESRIMKVMKKTELIVQDEMMNDVKKYDTDSEFETAFSIIAHFISEGYVPCQYLLTVLL
FELAKNPDIQSKARDSIKILANKDEENNYFKNTIREALRLHPPYSVITRKCMKLYQFNKD
IVIDKGVTITIPVEALHNDDKNYKDANIFNPSRFSDVNEKNYTFIPFGAGPRKCIGEQLA
LEIVGSITSTILEKFYLETCDKTPIVLPVTDFNSARCIERDIWLKFKPIKDLK

 

>CYP354A DPGLEAN18455

scaffold85:- 79184-87591

Best NR hit (blastp)  cytochrome P450 354A5 [Manduca sexta] (1e-164)

58% to CYP354A6   Heliconius melpomene CYP3 clan
complete

MDTNSTFNSLLLNISENWRSLLIFSLIFFIYYHFTETFDYFEKRNIKFRKPVIFFGNSFR
RLMGTISYPNFQLENYNYFKGERVGGIFEGRRPVLYIIDPDLIKSVLIGDSDHFIDRNSL
RSREPRFLSRSLINLKGAEWKAVRSLLTPTFSSSRLKHMQHLIQHCSNQLVKYLEHYDKK
EVELKDLMGHLTLEVTGTCVFGISTDALNDENAEFVKTAENFNYMSLKKRFGLLFVLFFM
PSLLKYLNISLLNYEANKKLVDILINTKSKRILSNTRRNDFLQLLVDASLKEESELSHPG
AKRYLDEDTVNAQALLFLQAGYETSSTLLSFAIHSLAVHQDIQEKLRAHIQDVTKGEELC
YDHLTQLDYLEGVIYETLRLYPSLSRLDRECTKPYIIPGTSIKVNKGDMVSVPVYGVHMD
SDIYPDAESFIPERFNSDAGKKKSHLFLSFGAGPRNCIGLRFALNVAKTTLVSLIRNYKF
STCDKTENPVTFNKRSMLLKAEKGLWVRVDKI

 

>CYP365A DPGLEAN21086

scaffold2003:+ 46017-46861

Best Drosophila hit  Cyp6d2 (3e-15) weak short

70% to CYP365A1 Bombyx mori BABH01031525.1 BABH01031526.1
from aa 48-164

MYDYDYWTNLGIFSPPALPIVGHIWSVVARKEQGGVCFQKLYNEYKDKKIIGVHQFYRRT
LVICDPVLIKRVCVNDFVYFTDRGFYFDKALDPLAESVLYLRGNEWKRLRAKISPLFS

 

>CYP365A DPGLEAN12284

scaffold2653:- 22754-27902

Best NR hit (blastp)  PREDICTED: similar to cytochrome P450 CYP6AX1 protein [Acyrthosiphon pisum] (4e-49) short

65% to CYP365A1   Bombyx mori CYP3 clan
C-term part

MRTFWPAFVLFFKIRIIPKKTHDFFFNVVSNVIKERESGIQEKRGDFIDMMMALKDERDD
DDEDDFRITDMVIAGNAFIIFLGGFETTSSTLAFLFLELAAQPDVQEQMRAEIARALAKH
GAITSDMLQDLTYMDMVIQETLRLYPPFPTIQRVCTRPYQLPGRDVIIPPGTIILFPTLG
LHRDEQYFENASSFTPDRWRRSPPPGVYAPFGDGPRYCIGKRFALIQMKCCLAKLLPRVF
ISPVTDRRGPFPVDLKSPMTLHPADSRVCLSLLTPTQC

 

CYP4 clan (30 sequences, 13 complete)

 

>CYP4G DPGLEAN17271

scaffold67:- 255-1087

Best NR hit (blastp)  cytochrome P450 CYP4G48 [Zygaena filipendulae] (9e-84) short

72% to CYP4G9   Helicoverpa armigera N-term

MTVVEDIQPHQWNSTRLYFYPLVILATSLWLLYRWQQQTKMYKLGNKIPGPMALPIFGNA
LLAVNKKPEQLLSIALEYYAYYGSVVRGWLGNNLIIFLADPNDVEVILNSNVHIDKASEY
KFFQPWLGEGLLISSGEKWRSHRKMIAPTFHINILKSFVGVFNQNSKNVVDKMRGEMGKV
FDVHDYMSGVTVDILLGEILFL

 

>CYP4G DPGLEAN12197

scaffold175:+ 370-3217

Best NR hit (blastp)  cytochrome P450 4G49 [Manduca sexta] (3e-175) short

79% to CYP4G74   Spodoptera frugiperda

MGITKETQDQSGFDYAMAVMKMCDIIHQRHYKIWLRFDAIFKFTSFFEKQKKLLAIIHGL
TNKVIKNKKERYLQNKAKGIIPPTIDEISKSAPKTENYNVLANEKTLADTVFKGYRDDLD
FNDENDVGEKKRLAFLDLMIESAQNGSNKITDFEIKEEVDTIMFEGHDTTAAGSSFVLCL
LGIHQDIQARVYDELYSIFGDSDRPATFEDTLQMKYLERVIFESLRMYPPVPIIARKINR
DVKIATNDYVLPAGCTVVIGTYGIHRNPKYYENPDVFNPDNFLPEKTQNRHYYSYIPFSA
GPRSCVGRKYAILKLKILLSTILRNYKMVSDITEDKFVLQADIILKRHDGFRVQIEPRKR
VPSTA

 

>CYP4G DPGLEAN02258

AGBW01008262.1 Danaus plexippus

scaffold193:- 88148-91793

Best NR hit (blastp)  cytochrome P450 4G4 [Manduca sexta] (0.0)

88% to CYP4G71   Heliconius melpomene
revised seq complete
MSYAAAETVAASSTWAATNLFYVLLVPALLLWYTYWRMSRRHMYELAEKIAGPPGYPLIG
NALEFTGGSDEIFKNIMKRSFEFKNETAVRIWIGPRLLVFLYDPRDVELILSSHVHIDKA
EEYRFFKPWLGDGLLISTGQKWRSHRKLIAPTFHLNVLKSFIDLFNANSRAVVNKLKKES
GEFDCHDYM
SECTVEILL (1) 4630

ETAMGVNKNTQDSGFEYAMAVMKMCDILHLRHTKIWLRPDLLFNFTQYAKV
QNKLLSIIHGLTTKVIKRKKEEFKSGKKPSILETEVTTKDTKTTSVEGLSFGQSAGLKDD
LDVDEDVGQKKRLAFLDLLLESAQGGIVISDTEIKEQVDTIMFEGHDTTAAGSSFFLSLM
GIHQDIQAKVVEELDQIFGDSDRPATFQDTLEMKYLERCLMETLRMFPPVPIIARHLNQD
ITLPSSGKKVPAGTTVVIGTYKLHRSESIYPNPDKFDPDNFLPERSANRHYYAFVPFSAG
PRSCVGRKYAMLKLKIILSTILRNFRVISDLKEEDFKLQADIILKRAEGFKVRLEPRKRL
VKA

 

>CYP4L DPGLEAN03311

scaffold140:- 130028-138940

Best NR hit (blastp)  cytochrome P450 CYP4L4 [Mamestra brassicae] (2e-179)

61% to CYP4L22 Heliconius melpomene
complete

MIIIFSLLALVVTGLISWTILLRDSKKYNVPGPTPYPIIGNGYLFISKSSDFLPQLRRLM
LEYGNAYRVFLLHEPYIVLSHPKYVEPLLSHSELIVKGKSYSYLLSWLGNGLLTSTGQRW
RNTRKFLTPAFHFNILQNFLPVFLKNEKILIKKLRDHADGRAFNVLPIMALTALDNVTES
IMGVAVNAQTNRESKYVKSIETLAKITSSRMRNPLLGEDIIFNMLSAKKQQDEAVKYVHS
ETNKVIKARRQELKLSKITRLEKNNDMGIKNKHAFLDLLLLAEVDGKPINDEHVREEVDT
FMFEGHDTTASGLAFSLYCMSLYPNVQEKILEEQKTILGDDLTRDPTYSEVQQMKYLDCV
IRESLRIFPSVPLIERMITEDSQVGELRIPKNTSVIINILELQRHPDLYEDPMEFRPERF
ETMNAKNAFSWIAFSAGPRNCIGQKFAMLELKATLASIVQKFRILPADSAEPILCAELVL
RSENGVRIKLMPRKSE

 

>CYP4L DPGLEAN19968

scaffold6335:+ 14-1218

Best NR hit (blastp)  cytochrome P450 [Helicoverpa armigera] (7e-37)

MLQFTYASLDLHKYTVTYVADISTVSFIVYVSLLKKYIV

67% to CYP4L5   Heliothis virescens
C-term 

EVGELRIPKNTSVIINILELQ
RHPDLYEDPMEFRPERFETMNAKNAFSWIAFSAGPRNCIGQKFAMLELKATLASIVQKFR
ILPADSAEPILCAELVLRSENGVRIKLMPRKSE

 

>CYP4M DPGLEAN08129

AGBW01003773

scaffold300:- 59305-65744

Best NR hit (blastp)  CYP4M25 protein [Cnaphalocrocis medinalis] (7e-118)

56% to CYP4M27 Heliconius melpomene (partial length)
50% to CYP4M15 Spodoptera frugiperda (full length)

revised seq complete

3391 MLLILLLCSIIFLIILLDWLHKDEGFEKIHGPKCVFILGNSLDILKDS (1) 3248

3158 VKIFYQFRKLAHNYKNCFKLKIFTKKILMVYNPQDVE (0) 3048

ILLSGKKHSSKGFLYNFLRPWLKDGLLLSDGIKWHSRRKILTPTFHFNILRHFNS
VLLEKVQAFVERLNSEVGNSATEIFPLITEFSLNSICETAMGTALEDEKCDMGQKYKDSI
HTLGTYVFSRSTKFWIYPDFIFNLSSLGMNQKKVLEVISQFRNNVIAKRKLDTNVRQIED
DDCDDEYFISKKQRLAMLDLLLQAEKEGHIDSDGISEEVDTFMFEGHDTVATGLQYIFML
LANHEDIQDKVFLECEEILSRNNGRPSINDMAQMKFLEACIKEALRLYPPVYFISRIGDE
PIELSNGHTCPAGTDFNILIYELHRQADQFVEPYKFLPERFLKEPTWHPFSYLPFSAGPR
NCIGQKFAMMEMKLLLSKVLTVYRLLPITKPQDIIFYADILLRSNQVKVKIEKRKK

 

>CYP4M DPGLEAN08130

scaffold300:- 52259-56621

Best NR hit (blastp)  cytochrome P450 [Spodoptera litura] (3e-170)

57% to CYP4M10v1   Helicoverpa armigera
complete

MLTIICVIVLSLCLIHLAFNYNYKARLLRKIPGPDESFIVGNAPKLFVSPGLELFAMSRQ
WSSTFKGIYRFYAFPVASVNIYNPEDVETITSNMKYHEKSLIYKFLEPWLKDGLLISNGT
KWQNRRKVLTSAFHFDVLKKYFDSIDGNSQRLVRVLQQTNGQSVDVVPILSEYTLNTICE
SAMGTALSDESSSEGKMYKNSIYQMGHTLFQRFINIYLHPDFIFNLSFLGRKQKQHLSVI
HNFTKNVIKNRRKLLDIDNISETKVAQEDGNISNVFKKKKAAMLDLLLLAEKEDLIDGDG
IQEEVDTFMFEGHDTTQAALTYCLMSLANEEFVQQKAYAEQECIFAGDNRPATLADLSEM
TYLECCIKESLRLYPPVPFISRKINEPTTLSNYTVPAGASCHIHIYDLHRQESIYKNALK
FDPDRFLKENSVGRHTYAYIPFSAGPRNCIGQKFAMMEMKSSLSAVLRNFKLVPVTSPDD
LCFMSDIILRNHAPVYLKFIKRNRIC

 

>CYP4M DPGLEAN05231

scaffold4749:+ 5992-8071

Best NR hit (blastp)  cytochrome P450 [Spodoptera litura] (2e-87) short

55% to CYP4M15   Spodoptera frugiperda

missing N-term and C-term

MSRQWSSTFKGIYRFYAFPVASVNIYNPEDVEIITSNMKYHEKSRIYKFFAPWLKEGLLT
SNGTKWQHRRKVLTSAFHFDVLKKYFDSIDGNSQRLVRVLQQTNGQSVDVVPILSEYTLN
TICESAMGTTLSDESSSEGKMYKNSIYQMGHTLFQRFINIYLHPDFIFNLSFLGRKQKQH
LSVIHNFTKNVIKNRRKLLDIDNISETKVAQEDGNISNVFKKKKAAMLDLLLLAEKEDLI
DGDGIQEEVDTFMFEGHDTTQAALTYCLMSLANEELVQVNIQKKRQRVRRVHLHIIFRYP
T

 

>CYP4S DPGLEAN09010

scaffold5075:+ 191-10345

Best NR hit (blastp)  cytochrome p450 CYP4S1 [Helicoverpa armigera] (8e-115)

60% TO CYP4S17 Heliothis virescens

DUPLICATION OF THE ETAM EXON TO THE I-HELIX

ARE THERE TWO GENES?

MLWPIFFGLLVCVVVWRLIKQEPNDLAHLPGPPRTPIFGSVFMFLGKSHSELFKMLVELP
KKYGNRLVIKAMHRYILHVYKVEDIEIVLTHSRNIKKNKPYTFIEPWLGTGLLISNGSKW
QKRRKILTPTF
HFDILKGFVKVFEEQSRNLTTMLRKKLQESNVVDTMAIMSDFTLYIICE

TAMGIRLNADKSAEKMMYKKAIMEIGQIVMKRLTTVWLHSDLIFYNMPIGKKFTKCLENV
HSFADNVILERKKKYESVANEDGGRRRLAFLDLLLEAERNGEIDLEGVREEVNTFMFEGH
DTT
ATALAFGLVLLADSEEVQ

 

TAMGIRLNADKSAEKMMYKKAIMDIGQIVMKRLTTVWLHSDLIFYNMPIGKKFTKCLENV

HSFADNVILERKKKYESVANEDGGRRRLAFLDLLLEAERNGEIDLEGVREEVNTFMFEGH

DTTATALAFGLVLLADSEEVQ

 

ERLFEECQRVGPEPSVSE
LNDMKYLEAVVKEILRLYPSVPFIGREITEDFMLDDIKVKKGCEVVVHIYDVHRRPDLYP
DPVAFKPERFLDEEKRHPYSYVPFSAGPRNCIGQKFAKLQMKVVISEIVRNFKLSPLVAG
ARPDLKVDLVLRPAETIYVKFYPR

 

>CYP4S DPGLEAN13323

scaffold7535:- 1154-4055

Best NR hit (blastp)  cytochrome p450 CYP4S1 [Helicoverpa armigera] (4e-21) weak, short

71% to CYP4S9v2 Spodoptera litura gray region

MEASSSDYSDEEVGVLARGQKLWVRGVRSHTTCADIVRALQDAPED

 

DIKVKKGCEVVVHIYDVHRRPDLYPDPVAFKPERFLDEEKRHPYSYVPFSAGPRNCI

 

This part is not P450 seq

ETDATHRNNVKPIDATE
IIEAINKLRLDKCPGSDNSTNETLKVGHKTLALTLAELFNKILQNSETPLKWSESNIILK
YKKDDPKEMMNQLATESM

 

>CYP4AU DPGLEAN18469 join with DPGLEAN18470

scaffold604:- 77341-80254

Best Drosophila hit  cytochrome P450-4d1, isoform A (5e-33) weak, short

70% to CYP4AU2   Bombyx mori

DPGLEAN18470

scaffold604:- 73288-75386

Best Drosophila hit  cytochrome P450-4g1 (4e-30) weak, short

C-term

75% to CYP4AU2   Bombyx mori

MIIFLMVVFLFCMYWLFWMSNTRRIEKFTASLPMPPSLPVIGHAALFLGNTEKILKNLED
IASIALKNNGAVKLWLGPKLYIAIGNPEDAQLILENCLDKDVVYRFLRPWLGQGLFIAPL
RLWKMHRKILLPVFHNKVIEEYIGVISKQADVLTERLEEQSGKETFDVLSYISACTLDIV
FETSMGEKMDVQHWPDTPYLRARHTVMEILNKRLFKVWLQPDCLFKLTRYAKEQKKNIDL
THKFTDEVVQKRRLQFEAKEAIGINNTKDSGIKKII

(seq gap)

MQYLERVIKESMRLYTVVPIIARNIDRDIYLPYSKVTIPSGVGAVVGAFAIHRSVDLWGS
NANEFDPDRFLPERSKNRHPCSFIPFSHGSRNCIGRNFGMIIIKGIISNVIRSFRIQADE
VGPLKIEMLLFPIRGHQIKITKRMN

 

>CYP4AU DPGLEAN13009

scaffold4227:+ 1745-8625

Best NR hit (blastp)  PREDICTED: similar to cytochrome P450 [Nasonia vitripennis] (2e-97)

There may be a fusion of two genes here. There are two N-terms

 

72% to CYP4AU3

MIIFLMVVFLLCMYWLFWMSKTRRIEKFTA

SLPMPPCLPVIGHAALFLGNTETIG

NPEDAQLILENCLDKDVVYRFLRPWLGQGLFIAPLRLWKMHRKILLPVFHNKVIEEYIGV

ISKQADVLTERLEEQSGKETFD

 

72% to CYP4AU1   Helicoverpa armigera

SLPDTGCLPILGHTHWFIGGPEKILSNILELAQLVDDSGGIGKIWIGPSLYIVSF

NPEDVQKILEGSLQKDSSYRFLQDWLGNGLFVAPVDLWKIHRKVLLSVFHNRIIEDYVGV

FGEQGKVLVGRLEDQVGKKEFD

 

VFRYITSCMLDIVFETAMGEKMNVQYNPDTPYLRARNT

VISIIGMRLFKAWMQPNALFNLTSYSKTQKENIEATHKFTDEVVKKKR
ILFEAREKIVTRGRRDLLELLLDRDTKFTDEELREHIDSITIAGNDTTALVICYALLLLG
QHSEAQDKVYNELRDIFGDSLRSPTKEDLNKMEYLERVIKETMRLYTVVPIIGRETQKEI
KLSKCTVPAGVGCAVLLFVMHRSKRIWGPDADTFNPDRFLPENSAKRHPCSYIPFSYGNR
NCIGRHFGMLAMKSILANIIRSYKITSKRCERLKIEILLYPVPGHLISIEKR

 

>CYP4CG DPGLEAN12405

scaffold4379:+ 707-7950

Best NR hit (blastp)  cytochrome P450 4CG1 [Manduca sexta] (5e-153)

59% to CYP4CG4   Heliconius melpomene
missing the N-term

MSEKNLFHYLRTLHNTYGELNQVHALSVQAVNVFSPEDMEVILSSTKHNDKQLPYTFLKP
WLGEGLLTSNGLKWHQRRKLLTKAFHFNILKKYSATFTEQTQEFIKKVHEETKKSKTDVL
PLICSATLHIMCETAIPATRNEGIQTITQKYFKSIHTVGEAVVERMCRVWLYFDPFFKLT
KTAKEQETALKELHTFTNKIIADRKEFVKNFDVSKYIDSDEYDNSKGKLTMLDLLLENEK
TGNIDLESIREEVDTFMFEGHDTTAMALSYFIMAIANEPAIQRKIYEEMEQIFGDSKRLA
TMADLHEMRYLECCIKESLRLYPSVPFIARNLTQETVLSGYTVPANTFVHLFIYDLHRRP
DLFPDPERFIPERFLPQNSLNRHPYAYIPFSAGSRNCIGQKFAMLEMKTVLSSLIRQFHI
EPVTKPSELRFRTDLVLRTTHPIYVKFKNRE

 

>CYP4 DPGLEAN05230

scaffold4749:+ 3650-3990

Best NR hit (blastp)  cytochrome CYP4M5 [Bombyx mandarina] (5e-13) short

53% to CYP4M10v2   Helicoverpa armigera

N-term part

MRKIPGPKETFIFGHALKIMVSPVELFALGRQWSSTFKGIFRFYAFSVASVNIYNPEDVE
VMYLQLI

 

>CYP340N DPGLEAN03260

scaffold2253:- 14286-17303

Best NR hit (blastp)  cytochrome P450, family 4, subfamily V, polypeptide 2 [Danio rerio] (1e-48) short

61% TO CYP340N1   Heliconius melpomene Hm1250201f
MISSING THE N-TERM

MQAEVGRGWFDQSKYVKQNFMETICLTALDDCVTPEEANEYVEAFENYLNGNILRFQTFW
LHPDITFKFSKLKKKLDASIKVLHEMSDKVLKNKRALRKLNETESSAENSPKLKVFMDLL
MDLDGGVLTDQEIRDEMNTIIMAGHETSANVIVFALILIGSYPEVQEKLHEELQRVFGDS
DRDIEKQDLSQLIYMEAVLKETMRFFVMAPFVGRHIDREVKLKNCTLKPGNNCLILYYGL
HRHPIWGPDVNEFKPERWLDPATLPKNPNAFGGFSIGKRNCIGKTYAMMSMKSTLSYVFR
RFKMQADHTKLKFKLDVLLKPITGHYVTIQNRL

 

>CYP340 DPGLEAN17514

AGBW01010921.1 allele seems to be on AGBW01003303.1 46-48kb

scaffold7304:- 266-1790

48% to CYP340Q2 Heliconius melpomene CYP4 clan
used allele seq to fill in missing parts
revised seq complete

      MLLLILVPILAFLFWLWWKRETNENEPPEYVFHLEWRKLHNVVSDGE (1)

42410 RLWRRAKKMSAECLKRDGTMKMTIGPRTVY (1) 42499

 5063 VITDPDDFLTVSNACLEKDPFYNFCKNWLGNGLISAP (1) 4953

 4174 TSKWKVHRKLLNPLFSQRAVDSLMEVFNNISRQLVENLEVEVGKGPFDHYPYTRRNALDTIC (1) 3989

 1023 MTALGAKFMENTEVRNKYIHLTETLMKIYISRLKNFFYHYDFTYNFSSLKRKEDKCTDTVRTITNA (0)826

      ILNQIKEGSKNEDKCSSTEKNQSQ (1)

  463 GLKFKPFIRVLLEMFSNSDAFTDSDIRQHVDTFVVAGEDTTAGVSMYCMLLVGSYSQVQDKIYEE (2) 269

47720 GDDERDVTKEDLSRLVYLEAVLKETMRFYPIVPIIGRHLDKDVKL (1) 47854

47992 RNCTLSKGRTAILSIYGIHRHPMWGPDADEFRPERWLSPLSNIQKMFAGFSMGRRICI (1) 48165

48241 GKTYAMASLKVILVHVFRNYIIHSDHTKCTFDLDITLKPTSGHHITIERRNKNKPSAES* 48420

 

>CYP340 DPGLEAN02857

scaffold1880:- 40864-48367

Best Drosophila hit  Cyp4d21 (3e-31) short

49% to CYP340R4   Heliconius melpomene
missing the N-term

MFLLILLWTLVFLLWIWLKRDTNKDEPPTFFALDLRKLYRITEDTEGFEFETLAEIILKI
CENNSSQFTDQDIRQHIDTFIAAGEDTSAGFIMLCLVTVGSYPRVQEEIHKELKQVFGDE
DRDVTKEDLNKLVYLDAVIKETMRFYPMVPAIARYVDKNVKLRNCTLSKGRTVILSIYGI
HRHPMWGPDADEFRPERWLSQQTNYYKYFASFGFGRRVCIGKAMAIASLKVTFAHIFQNY
RIHGDHKNMKLKYEITLKAVSGQYITIERKNRKQ

 

>CYP340 DPGLEAN02858

scaffold1880:- 9890-17135

Best Drosophila hit  Cyp4d21 (5e-28) short

53% to CYP340R4   Heliconius melpomene

missing the N-term

MSRECQKLDGVMKVPLGPRTIYGFEFEPFAETILKISENNTQFTDQDIRQHVDTFIAAGE
DTS
AGVIMLCMITVGSYPRVQKEIHKELKQIFGDEDRDVTREDLSKLVYLEAVIKETMRF
YPIVPIVARDLDKDIKLSNCTLSKGRSAVLSIYGIHRHPIWGPDADEFRPERWLDLPLNY
QKYFAAFSLGRRICIGKTMAIASLKVTMAHIFRNYIIHGEHTNMKLKFELTLKAVSGHHI
SIGRRIKNKP

 

>CYP340 DPGLEAN01985

scaffold8727:- 129-683

Best Drosophila hit  Cyp313a4 (2e-09) short

63% to CYP340R5   Heliconius melpomene

MLLIGSYPRVQNKIIEELQTVFGNDDRDVTKEDLSKLVYLDAVIKESIRLYPTVALTGRD
IEEDLKLRKQQLHIIKRGFSVSLDICVIPPSSVGP

 

>CYP340 DPGLEAN04820

scaffold3990:- 6943-7952

Best Drosophila hit  Cyp4d21 (5e-23)

52% to CYP340R5   Heliconius melpomene

C-term

MLLIGSYPRVQNKIIEELQTVFGNDDRDVTKEDLSKLVYLDAVIKESIRLYPTVALTGRD
IEEDLKLRNYTLSKGASVYLSIYALYHHPQWGPDVEEFKPERWLDPSLLPSWANSIGFGV
GRRFCIGKTYALMSIKTTIVHVCRNFLIYGNHKNMKLKLDVLLKPASGHYITIKKRT

 

>CYP340 DPGLEAN20713

scaffold1696:+ 45914-48420

Best NR hit (blastp)  cytochrome P450 CYP366A1 [Bombyx mori] (2e-42) short

56% to CYP340R5   Heliconius melpomene

C-term part

MFVYQPCGQPAYGWLAYTKETLTHFALLIREVPGATPSPPRRSSSGPSPQACPSTRPSDC
FSQGCFLTWTHGRQSPRPPSSDSLHRLPDGRLGHHVPTPECLKFKPFIRVLLEMFSNSDA
FTDSDIRQHVDTFVVAGEDTTAGVSMYCMLLVGSYSQVQDKIYEEINEIFGDDERDVTKE
DLSRLVYLEAVLKETMRFYPIVPIIGRHLDKDVKLRNCTLSKGRTAILSIYGIHRHPMWG
PDADEFRPERWLSPLSNIQKMFAGFSMGRRICIGKTYAMASLKVILVHVFRNYIIHSDHT
KCTFDLDITLKPTSGHHITIERRNKNKPSAES

 

>CYP341A DPGLEAN05062

AGBW01002935.1 Danaus plexippus

scaffold1540:- 12338-13373

Best NR hit (blastp)  cytochrome P450 [Papilio xuthus] (2e-85) short

58% to CYP341A1 Bombyx mori
revised seq complete

20003  MIVLFLLCTLLVFTIVFRFRRRRLYELAALLPGTKSAHPVLGVIPLLIGNSE (1) 19848

19589  ATMTSLQDFSYEIMEPGIGKGWLNHILYI (1)  19503

18254  LVVNPEDIEMITKTCLEKDDLHRFFRRLLGYGGIFAP  (1) 18137

18066  VSIWRRRRKILVPTFSPKILKNFVDIFAQQSNKLADKLYDKTSKTPFSIWPFLVSYNLDSVG (1)  17881

17704  ESVMGVRLNSQNIHSYSPFLNSMTTILNITCERIFRLWWHPECLFRCFPIYSIHEKCIKL  17525

17524  LHSFVDK (0)  17504

13415  QVIETKAIELQKDKMSQQNECHHFDLQEFQKKSFLENLINLSGGAQSLTNDEL

REEILTLIVAATDTSAVAMGYTLILLGKYPKIQDKVYKELNEIFGNSNRPLESEDLNKLK

YLERVIKESLRLYPPVPFIIRKIEKEIKLPSGNCLPAGSGAVLSIWGVHRNPKCWGADAE

HFDPDRFLPERFKLVKSGSYLPFSIGPRNCLGYQYALMSIKTALSTILRNYKVLGEPESS

PIPRIRVKMEIMMKAVDGFNVTLQKRE

 

>CYP341A DPGLEAN04461 join with DPGLEAN04462

AGBW01013812.1 Danaus plexippus

scaffold982:- 107774-110082

Best NR hit (blastp)  cytochrome P450 [Papilio xuthus] (2e-58) short

62% to CYP341A2   Papilio xuthus

N-term

CYP341A DPGLEAN04462

scaffold982:- 106264-107728

Best Drosophila hit  cytochrome P450-4c3 (9e-48) short

70% to CYP341A1   Bombyx mori 
C-term part
The joint is the location of the ETAM exon 
Whole seq 65% to CYP341A1 Bombyx mori
Possible frameshift (&) before ETAM exon in run of 9 A bases
Or insertion, typical seq not seen before intron. Pseudogene?
Compare to DPGLEAN05062
complete

MILWYLLLALAFWTVAFKYKRRRMYKLAALVPGPKNEYPIIGVAHDLVGTTEDILRTLQR
YSYEAMEHNGTVRGWLNHILYFLVVHPTDLEMILKTCLEKDDLHRFIQKVIGYGGIFAPV
SIWRRRRKILLPTFSPKILETFVEVFSEQSEKLVSRLGERAPQTSFSIWPFVSSYTLDSK & KNR (1)

ETAMGVKINAQSDPTSSPFLASMTSLLNIVCARIFHLWLQPDWLFKLFPQFNEHEKCIKTLHD
FTDEVIEKKRMELQEDKTSQTEVDHHLDLQDYQRKSFLDLLIKLSGGEKGYTNVELREEV
MTLTIAGTDTSAVAIGFTLILLGKYPKIQDKVYEELYGVFGDSKRPLVKEDLLKLKYLER
VVKESLRLFPPVPFIIRKIDKEIELPTGKRLPAGAGAVISIWGCHRNPEFWGPDAECFDP
DRFLPERFDLVKPGSYLPFSNGPRNCLGYQYALMSIKTALCAILRNYKILGEPEATPIPH
IRVKLDVMMKAVDGYQVCLEKRKSAA

 

>CYP366 DPGLEAN13663

scaffold10970:+ 353-1106

Best NR hit (blastp)  cytochrome P450 CYP366A1 [Bombyx mori] (1e-35) short

53% to CYP366A1 Bombyx mori

MVYREAVIYETLRMFPPIPTVMRYADRDLKLDSYTIPRGTTVVLNAWGSGRSESLWGKQA
SHFRPERWLYGEPCMSSCLAFSVSRRSCIGRSYAKAILKTMLAHCVREFRFNSDAKNLSI
KMDLVLRPISGHLLKVELRNDSH

 

>CYP367A DPGLEAN15541

scaffold1:+ 457859-463411

Best Drosophila hit  cytochrome P450-4d1, isoform A (1e-43)

62% to CYP367A1   Bombyx mori CYP4 clan
complete

MEVTYIVIIIIILPVTLYWFYWRWTHRRMLDLAKSVPGPPALPLVGNALLFMVNPKEQLK
IVNQLLNKYGDYVKFWLGPDLNICVKNPADIRFLLTSNKVTQKGPVYEFFKSFVGHGILS
GGKHWKAHRKIVMPSYNKKAVNLFSTVFNKEAEDLAKVLSQKDPHQTFNVYFDVVHCTTQ
SVCQTLMGLTKEESLNVARLREVMLETHNMYQLIHLKMTRWWLHIPIIYYLSGRKRIENK
YIKMTEDLSSDILQKRKNALKHEVTDENSMNAVDRLILEGLDEKEIKLEVFTLFTTSQEA
SAKIVAGVLLFLAHLPEWQEKVYDEILATVGFTAEVTDEHLKNLHYLDMVYKEVLRYLAI
GAMIQRSVEKEITINNGKITLPVKTSLVIPIHELHRDSRYWDEPNKVKPERFMPENVKKR
DPNAFVPFSLGPMDCLGRVYATKLIKTIVVQVIRQLKLEADGTLEELELDIAISVKFAKG
YNIRAKKRNNDATSA

 

>CYP367B DPGLEAN15542

AGBW01000003.1

scaffold1:+ 469771-473311

PREDICTED: similar to cytochrome P450 CYP4C39 [Nasonia vitripennis] (1e-70)

55% to CYP367B2 Helicoverpa armigera

revised sequence complete

5509  MFWTVTLTITALMVLWLRWRYRNRKLIK

MADKIPGPPSLPILGNALKFMVHNKELKILIKNLMNQYGEVVKFWLGMDLNILVSNPDDL
KLLLTNNKTSVKGEQYKYFKNYIGAGLLSGSGPEWRKHRKIATPNFGKQATGSYVHVFNM
EADLLLENLKETSSGDQIDVYQYIVMSTSYAVCQTLMGLTRKETLGLPHLQYVIEESPLV
YTIIFERMTKWFMQIDPVYWLTKSYQVEKEFVSRTNELSGAILKLRINKLTDIDENKLKL
LDTEKDSFNNTELSVVDRFILSRELDATDLKTEIFTIFTASQEATAKVAASLLLMLAFHP
ECQKKVYAEITSVIRNKNKHITEEDLKQMPYLEMIFKEVLRLFPTGVMLQRKINKDISIS
SCTLPAGSSLVIPLYHMHRDSRFWENPESFDPERFSAENMKKRNAYCYFPFSLGPMDCLG
RFFGTSIIKIIVIKIIQEYYLTSTYKYSDLQVGFSISVTSSIGYPVILTPRN

 

>CYP367B DPGLEAN15543

AGBW01000003.1

scaffold1:+ 475343-481009

Best Drosophila hit  cytochrome P450-4c3 (3e-39)

56% to CYP367B2 Helicoverpa armigera

revised sequence complete

MLLAVVLLALVVLGVWVHWRYKNRRLLEMSTKIPGPPTIPILGNAFYFMCRPEEMIKITK
QLIDEYGLVLRFWLGTDLNIVVSNPDDIK (0)

VLLTNNKVSVKGPQYKYMSDLIGVGILSGS (1)

14219 GSKWRKHRKLVTPNYGKRAVESYNEVCYRETKILIDNLNKVHQAGVLDIYK

HIVKTTSYIVCRES (1) 14413

ESLIGLTREETLRIPCLQNVIDESPRLYDFIFDRMTKWYLQIDPVYWLTESYKIQKKIMRD
ISEMNMFIINNRKEALTDINEDYLELLNSEQDSVKNTKLSVIDRLILSQELNHKELIEET
FTIFTSSQEATAKITSFLLLMMAYHPKCQDELYSEILNVIGNNYGPITDDYLKHMPYLEK
CVKEVLRLYPIGVMLQRTVKEDVEISTCTLPAGSSLVVPIFNLHRDPRFWEDPEAFDPER
FSTENMKKRNPFCYIPFSLGPMDCLGRFVAAKFIKTIAIMVLHEFRLSSVNDYKDLNVVM
AISAKSANGYPVILTPRKQCN

 

>CYP405A.a DPGLEAN11628

AGBW01005413.1 Danaus plexippus

scaffold2846:- 23471-27195

Best NR hit (blastp)  cytochrome P450 CYP379A2 [Zygaena filipendulae] (2e-30) short

54% to CYP405A2 Zygaena filipendulae

66% CYP405A1 Epiphyas postvittana over 391 aa at C-term

revised sequence complete

14108  MILFIFLVIVILFHEYQRKSSYRWKKLSEFPGDAPLPFFGNGLQLGFDAD (1)

       EASPKLMEMWNRHGKQNFRLTIGSEDWIMLSDPDDVG (0)

13443  TILNHPSELSKPLERNAAMKPFFGNSISSSE (1)

12123  GEKWKSTRKLMTPSFHFKTLEKRVEDVNKHCDRLFKILDTFNDKSTINLYTYLRPYMLDILC (1) 11938

11867  STLMGVDSNFLGNINHPYLEASGR (2)

       TIKIITQNYFSYWRN  11682

11681  ISKIFELSPQYRVMIKTVKALRDTSAT (0)  11601

11408  ILKDRKAIFNTMREEIAANNKIADMNIKTLLHNKVSDSGCLLDKFLLVELPNGDPIPDDI  11229

11228  INEEITLLCYT  (0) 11196

11119  GHYTTTMTICHTLYCIAKYPDIQNRIIEEQRSIFKNNFFKCPTNQDLND

       MKYLEAVLKESVRVIPTVTKIGRQLHEDLKFK (1)

       DGRIAPAGSSVVVFFEAMYQNPKIFPEPEKYDPER (2)

       FFNNMHTFAFVPFSAGPRNCI (1)

10157  GFRYAWVAMKATLSNMLRRYEVFPCDPADEPQFAHRIITESKNGINIRLKK  10005

       EINSKKSNTVFIIY* 9960

 

>CYP405A.b DPGLEAN11918

AGBW01013686.1 Danaus plexippus

scaffold971:- 8524-9686

Best NR hit (blastx)  cytochrome P450 CYP379A2 [Zygaena filipendulae] (6e-14) short

73% to CYP405A.a monarch
54% to CYP405A2 Zygaena filipendulae
59% to CYP405A1 Epiphyas postvittana at C-term

revised sequence complete

13896  MIFLLIVSFLIIFVIGREYQRKSSLRWKKLQAFPGDKPLPLIGNSLQLGFDSD (1)
       EVSQKLMAMWERHGKQNFRVSVGSEDWIMISDPDDVG (0)
11765  ALLNHPSELSKPLERNTAIKPFFGNSVSSSD (1) 11673
11011  GERWKSTRKLMTPSFHFKTLEKRVMDINNHCDNLFNILDVFQGRSSVNLYTYLRPYMFDILC (1)
       STLMGVDSNFLNNLDHPYLEANG 10353
10352  RVVNIITQNYFSYWRNIPTFFKISPQYRRMMKTVKTIIDTSTA (0) 10224
       ILNQRKKILNKIKEAMDPDHKSSDIEIVTLLNKKADDDGCMLDKFLLNKLSNG
       NPIPDDIIEEEISLICFT (0) 9902
9797   GHYTTTLTICHTLYYVAKYPEIQNRIIEEQQCIFKNDMSKKPTHKNLNDMKYLEAVIK  9624
       ESMRVLPPVTKIGRQLKKDFRFD (1) 9555
9368   DGRIAPAGSSVIVFFEAMYQNPNIFPEPEKFDPERFFNSMHLFSFVPFSAGPRNCL (1) 9201
8691   GFRYAWVAMKATLSNMLRRYEVVPCDPADEPQFAYRILTESKNGIHLKLKKRYFN*  8524

 

>CYP421 DPGLEAN05022

scaffold5975:+ 3055-9597

Best Drosophila hit  Cyp4ac1 (3e-44)

46% to CYP421A1   Heliconius melpomene CYP4 clan
complete

MIIYLGLLFGCLGVYWTYVKLSSYNNKLIPKIPKKFNLNNFRVCFKELSKWLIYAGEYSN
DVGCVSKFNYGPVVVYVVSNPEDASTVLTRCTSKSFVYDLIKPYVGSQLIASDYPVWKRN
RRLLDLAFKQNILNGYVAMFNKRADALVIDMSKEINKEVDFTHLFSRYLLGTACYTTLGV
NVNDQEVSIDSYLKSVDKLIKIMLNRFVQPWLLLDFVFKFSELKKEQDVALEIVQNFSEE
VIRKKKMQFALNSDNPGDVDEDNAESLLDILIKNRNEDEISDRNMREIVDNVILAAYDSN
VYMMLYILVCIGSYPEVQKKVYDEVISVTGQTDRDITHEDLPKLVYLEAVVKEAIRLYPA
GPIVGRVTTFDTQLKEYVLPAGCQVIVHLGAINRNKEHWGQDADDFRPERWFDSLPKHPA
AFSSFSPGRRSCIGKSYSIMLLKTMAVKIIKKFNISSDDKKLDFEFGLFLKPIAGHHIKL
ELR

 

Mito clan (12 sequences, 11 complete)

 

>CYP49A DPGLEAN11434

scaffold427:+ 11619-15371

Best Drosophila hit  Cyp49a1, isoform D (2e-95)

78% to CYP49A1 Bombyx mori 

missing the N-term, yellow region does not match Heliconius but

it is supported by an EST EY259541.1 at 98%, retained intron?

Most CYP49s are either 22 or 13 aa between PERWL and PFGFG

This one is 50 aa

revised seq

MGVARKFATIPGPRPLPILGNSWRFAIGQRPWRTRSLDATLWNLRALAGMGGAAKVAKLF
GHPDLIFPFCADETAKIYRREDTMPHRAVAPCLRHYKQELRKEFFGDEPGLIGVHGTAWS
TFRSKVSKALAAPQAAQVAVPSLDCVSNDFVHRMESILDHNRELPCDFLTELYKWALESV
GAWALGTRLGCLKDNDTDAMEIINNIHGFFHSVPELEL

SAPIWRLYRTRAYKTYIEALDSFR
NLCLKRLTDKGVCAQIALNSGE
KVATILALDLLLVGVDTTAAAAASTMYLLATNPRAQRRLQTELDINMPTDRSMNHRDLNN
LPYLKACIKEALRIKPVILGNGRCIQSDAVIAGYEVPKGSHIVFPHYVMSNEERYFPSPN
EYIPERWLRD

DTNKAGTVIPNISNEKHIEAARSVCEHAGVASVVK

KQRDIGIHPFASLPF
GFGRRMCIGKRFAEAELQLLIARAFQKYNVSWYHGELTYSVTPTYIPNEPLRFRLDSRTK
KLT

 

>CYP301A1 DPGLEAN14479

scaffold1074:+ 11966-22018

Best Drosophila hit  Cyp301a1 (0.0)

88% to CYP301A1   Bombyx mori

MGRSLHIFLKPFAPKYVRTATTGCPFSKRQRSQIAPTAALNEEIFANAKPYSEVPGPRPI
PLLGNTWRMVPVIGQYDISEFAKVTKLFLEKYGRIVRLGGLIGRPDLLFVYDADEIERMY
RREGPTPFRPAMPCLVKYKSEVRKDFFGELPGVVGVHGEQWRRFRSKVQRPILQPQTVKK
YVTPIEMVTEDFIKYMEKARDDNKDLPHEFDNDIHRWSLECIGRVALDVRLGCLSPDASS
EEPQRIIDAAKFALRNVAVLELKAPYWRYIPTPLWTKYVNNMNFFVELCSKYINEALERL
KSKQVTSENDLSLLERVLQSEGDPKIATIMALDLILVGIDTISMAVCSILYQAATRLKQQ
EKMAEEIRRVLPDPDKPLTYSDLDKLHYTKAFVREVFRMYSTVIGNGRTLQEDDVICGYH
IPKGVQVVFPTIVTGNMEQFVSNPQEFRPERWLEADGRLHSFASLPYGFGARICLGRRFA
DLEIQVLLAKLLRRYRLEYHHEPLEYAVTFMYAPDGPLRLRMIER

 

>CYP301B1 DPGLEAN14480

scaffold1074:+ 29067-40901

Best NR hit (blastp)  cytochrome P450 301B1 [Tribolium castaneum] (1e-161)

80% to CYP301B1   Heliconius melpomene

MRHSLNIVRWPLRVCKRRLASQQRAVATDGETGKISHLQICPAKGHRARSTQAVAPLESI
ASVKPWEEVPGPRPLPLLGNTWRFIPYIGGYSVEHVDKVCLSLRQEYGDCVKMEGLLGRP
DMLFVFDASEVERVFRGEDSAPHRPSMPSLNYYKHTLRKDFFGAEKDCAGVIAVHGDSWS
AFRTKVSRVALSAGAAAQYTDQVGEVADAFVKRLRKIRNERKETPDDFLNEVHKWSLESL
GLIALDTRLGCLEQHEGSETQQLIDAVNTFFLCVGELELKAPWWRLYPTAMFKRYVAALD
TILSVTQSHVSRALKECQANPNGSKSLLQDLVSAAGPRVAAVAALDMFLVGIDTTSNAVA
STLYQLSLRPDVQEKLYKEISGVLQGRPIRPGDVNKMPYLKACIKEVLRMYPVVIGNGRQ
LSKDTVICGYNIPKGTQVIFQHYVMGNSDDYFTNASQFCPERWLQRSIYKHHPFASLPFG
FGKRMCLGRRFAELEINIIICKMVQSFQMEYHHEPLEYHVHPMYTPNGPIRLKLIDR

 

>CYP302A DPGLEAN03864

AGBW01007575.1

scaffold446:- 7211-11161

Best NR hit (blastp)  cytochrome P450 CYP302A1 [Manduca sexta] (2e-180) fusion?

Top part not P450 sequence

tRNA threonylcarbamoyladenosine biosynthesis protein

MSDVSAIAVTVEPGLPLSLVIGMKYAKHLARKYKKPIIPIHHMEAHALIARMVHDISFPF
LTLLISGGHCLLAVAQDVNEFKLLGQSLDSAPGEVFDKAARRLKLRNIPELSQMSGGQAI
ETAASKATNHNIFKFPLPLAETKDCNFSFNGFKSTALYHILKKEREHNIEGDQVIPEVND
LCLAMLMGVTRHLLHRTQRAMEFCEINNLLPENKKQLVVSGGVACNNYIFKALSILCQEF
DYEIYRPPPKLCTDNGSMIAWNGLEKYKKQIDIVTDLNQININPSSPLGES

 

Revised seq complete
65% to CYP302A1 Manduca sexta

9928 MITATAIFKFKRVKTLWSFNKYFSVNNFKHQFQ

9829 TDHKNVKDFYDIPGPKSYPVVGTLYKYFPFF (1) 9737

GDYDAEELHHCAWVNWC

KYGDIVREVPGINLVHVYDPEVIEDVFRQKEKYPARRSHIAMLHYRLSKPNVY
NTGGLLSTNGPDWWRIRSAFQKNFSSPQNAKQYVDITDNIAYNLVQIIKTNKITHNEDFL
DYLNRFFLDVIGAIAFDKNFDSFSESEQHPDSCSSKIIKAAFGSNSGIMKLDKGILWRFC
KTPLYRQLEKSQEYLEKISTEILLNKIQFYKKDDSTDRSLLASFVKLSNIDLKDIVGVMV
DILMAAIDTTSYTTSFALYNLAQNKTCQEKLYNEVLTLLPSTDSKITADVLAKAVYLRSC
VKESLRLNPVSIGVGRVLQNDVILKGYLVPKGTVIVTQNMIASRLPQYLKDPSEFKPERW
IRNSPEYEHIHPFLSLPFGFGSRACIARHLAEQNISITLMRLIKEFHIKWMGGKLGVKTY
LINKPDQPISLQFTTRSS

 

>CYP314A1 DPGLEAN14158

scaffold848:+ 77093-93262

Best NR hit (blastp)  cytochrome P450 CYP314A1 [Manduca sexta] (0.0) shade

79% to CYP314A1     Manduca sexta

MCSVIVTHIRCGETIEPGGLREAKRKQSPHSPSAPPPLVDWSGVPTLILALVALVMAVTA
LLTKTVDGKRPTRLPGPPALPFLGSRWLFWSRYKMNKLHEAYEDMFRRYGPVFAETTPGG
ALLVSIADRTALEAVLKTPARRPYRPPTEIVQVYRRSRPDRYASTGIVNEQGEKWHHLRR
HLTSELTSPQTIHGFVPELNNICDDFINLLRNSRRPDGSVLGFDQLTNRVGLESVCGLML
GTRLGFLERWMSGRAAALAAAVKAHFRAQRDSYYGAPLWKFAPTSVYKTFVRSEETIHMI
VSELMEEARCRSRGAAQDDPLQEIFLKILANPELDMRDKKAAIIDFITAGIDTLANSLVF
LLYLLSGRVDWQQRIRSELPSCGELRIEELSSAPSVRAAINEAFRLLPTAPFLARLLDAP
MTIGEYRLPAGTFVLAHTGAACRREENFHRAGEYLPERWVEPREPHAPGLLAPFGRGRRM
CPGKRFVEMQLHLILAKIIQHWRVEFDGELDIQFDFLLSPKSPVSLRLVEW

 

>CYP315A1 DPGLEAN16126

scaffold11654:+ 945-5016

Best NR hit (blastp)  cytochrome P450 CYP315A1 [Manduca sexta] (4e-168)

62% to CYP315A1 Manduca sexta
missing the N-term
revised seq
855 MFRLKKISQIKIVKQKSYILPVKNYLTISD

MPKPKSLPIIGTKLEFLAAGGANKLHKYVNFRHKNLGPIFKESLGGNTDLVFLSDPELIK
TLFMNLEGKYPIHILPEPWVLYEKIYGAQRGLFFMNGEEWLNNRRILNKILLRESDDWIA
DSIEKTIRDTMQTWKLNLEKGCRFPNIESELYRLSSNVIINILLGTHSLELSDHYNEMLS
LFSDSMKKIFQTTTKLYSIPVNWCQKLNLKVWRDFKESVDLTLFLGRKITREMMFNKNKS
DGLLKRMTEENMSPEIITRIVSDLIIAAGDTTTYTALWTLLLLTRNEDTLKESRKGDQKY
IKYIVKESMRLYPVAPFLTRILPQETILGDYKLSKGTPIIASIYTTGRDKQNFSEPNSFL
PYRWDKTDPRKKDLINHVPPATLPFALGSRSCIGKKIAMKQLSEFISQITYNFDLKCNNN
QQIKSVTSQILIPDQNIDFSLSVRKQ

 

>CYP333A DPGLEAN06270

scaffold485:+ 46243-48693

Best NR hit (blastp)  cytochrome P450 333A3 [Manduca sexta] (4e-172)

60% to 333A1 Helicoverpa armigera

MGVLFRNLMFKNHRSWSSLRCYSSEVKPFTSVPGLSSLPILGPIHHFLPVIGSVGPRPNF
YDLAKVLHEKFGSIVKLDGIFSRSSMLILYDPKDFDQVYRSEEPIPSRPGFDTLNYYRKN
LEKSKYEGIHSLTTSEGVHWRDTRTKVNPVLLKPKLVKLYSPLLEEIADEMVQRIKNLDK
DARYMQENFDFEITKWSLESVAVIALGARLGCFQNDLPESHPARILIECAKDIINLSWKL
EFRPSMWKYIATKNFKKIIKAFDLQWDTSVYFIREAEKKIKERGHDVPEEDKSIIEKLLS
IDEKVALAMGNEMLLAGIDTVSFTTIGLLYYLAKYPEVQEKIRKEINSSEPSKRYLKACL
KESLRLHAVVPANLRRTTREHVVAGYLIPKGIDVIAPNEYLSKLDEHYPRAKEFIPERWL
VEKSDPLYYGNCHPMITLPFGFGVRSCIGRRIAEMEIEILVTKLLKNMKISWNGPPIQVV
TRLMNSFRKPFYFKFESIS

 

>CYP333B DPGLEAN14757

scaffold2950:- 49-9286

Best NR hit (blastp)  cytochrome P450 333B11 [Manduca sexta] (1e-152)

Gene model has some duplicated seq in it but contig does not

56% to CYP333B11   Manduca sexta
revised seq
9286 MNTIRRSVINIVVVNQRCL
9062 SFRFTSTNTIKDSEEGNNVKSWKEIPGPSTLPIIGQLHNFLPG
      GSLYGRFDEKLPGYLYETYGPIVRLEGNFGSPPMIFLYDAEPAMH
8610 IFRNENILPLRPGFQSLEYFRKTYNKKSDAVESTGLLTE
8350  HEEAWKKFRSAVNPVMLQPKTIKLYTGILAEVADDMISR
8150 MKNLRNKDNIIESELNMEMNLWSLESIGLVALGRRLNCFDPNLPEDSPVKELIKV
VHEIFYCMDKLDFQPSLWRYFSTPAFKRAMKHYEDQIR

LSEYFIEMAIKELKEEKKYQNG
EKGILEKLLEIDQKVAVIMASDMLLAGVDT

5028 TATTIIALLYLLANNPEKQKKLRNEIMTAKDKRPYLRACIKESLRLLPVVGGNFRQTTKDYDVLGYRIPKK
4663 MFVVVANQYMSQMEKYYPKPHEFIPERWIVDKNDPLYYGNTHPFVHTPFGYGVRSCI
4281 GRRIAELEIETFAAKLIQNFDVRWFGPPIKVRATTLNYFVGPYNFIFNNV* 4129

 

 

 

 

extra heme region exon

31899 GRRIAELETETFLSKVIENFHVEWFGPPLR 31810

 

 

This part is not P450 seq

GQTSRRPTWWRCRPWTSTKEFCGCVATLVWERERKRWEKGKGGKRERATGSLTHRTKT
TSRRFSVRGWCFPGRASPYSNCRDENFSAFHGFTVRVLNPSSSKKRSFNSVWDCDASRCR
ERSFNSAWDLRPRCRLKGPQLTRVNRSPPPSKLPSLLNQIQKEPTRADFEQVIGPSLLVA
CRKIKPLYLSFPLLPRTNPRRSFFQ

 

>CYP333B DPGLEAN04903

scaffold5556:+ 1609-4922

Best NR hit (blastp)  cytochrome P450 333B11 [Manduca sexta] (4e-166)

55% to CYP333B11   Manduca sexta

MNKLYMKNFIFATEINIRLAHSSSCRKIKSWKEIPGPSSLPIIGQLHHYFPGGSLYQCNG
FEFQEKLYKNYGPLVRLNTVYAGKPAIFVFDPDSMAQVLRGENWLPIRPGFDVLYHYRNF
YNQPKGGAPGLTGLISDHGQKWKQLRSLVNPIIMHPDNIKLYDTPIGEVAQDVVQRIKDL
RDEDGMITKNFDYLMYLWALESVGVVALGSRLNTFNENLESDSVVRRLITLIHEFFAISE
NLDIKPSLWRYYPTPAFKRAMKVFCDIDSITRSLVLKAKDELSQRGHSADDKKGVLEKLL
EVDEKIALIMAGDLLFTGVDTVGNTMSCTLYLLASHPEKQNTLRQEVNSGDERKSYLKAC
IKESLRVMPVAGGNIRQCTKEYNLLGYEIPKDMFVVFPHQYLSKMESQYPRANEFIPERW
LVDKDHALYHGNAHPFAYNPFGFGARICIGRRIAELELESLLSKIIQNFELEWRGPPPTM
YQSAMNYFKGPFNFVFKDIK

 

>CYP333B DPGLEAN12123

scaffold5050:+ 6034-7716

Best NR hit (blastp)  cytochrome P450 333B11 [Manduca sexta] (5e-28) short

56% CYP333B3 Helicoverpa armigera

N-term, runs off the end of the scaffold at 8134

MQVFKRSTVCLAPINKNISRSVSVNKVSVNVDATEIKLKSWREIPGPPTLPLIGQLHHYF
PGGMLEDVSNLGDVLYEKYGPIAKLDGVFGGEPVIFLFDAEAASQVLWGENRMPYRPSFI
SLEYYRKVYSNKGKTDIVTGL 

ISSHGKAWKKFRSTVNPIMLQPKTIILYSNTLTEVAEDMI 8133

 

>CYP333? DPGLEAN17012

scaffold335:+ 5982-17811

Best NR hit (blastp)  cytochrome P450 333B11 [Manduca sexta] (9e-35) mito

55% to CYP333-un1 Bombyx mori
may be a functional version of Bombyx pseudogene

MSKTVLARQCFFSRPRRQFSTSSVKRTSTSPQRYNMSEPAVRSGNVKKFSDIPGPLALPI
MRHHAHVLPRIGCFHHTVGLGLLEGLQQRYGDLVRFAKATRSRPVLYVFDPELMKEVYDS
NMTVSPQWSQSPLNEHRKNADTKCPMQSDESKAVWTALRTLLQDDSFLRNFDKAFDDIAV
DVTRRLEGLRHAGNALNEELQTEIYRWAIEIIGVLIFGMRLGCLDGNVHVPTEENRNPEK
TSMDDHIKDLCSLTKKSERDLSPAERFVRCSLDVANENYLVRSEDTLRQESETFNEALKT
FDKHYSLTDHFLVRALENLNNEELKAEQVLLNKLRPLERRILPLAADAFLAGVDPVIGAD
GDQHAVRAVAAATGGVVRRSVGELSVAGYEVPEGVDIVLAHGVSSKLEKEWGRAKSFIPE
RWHSDGWRPLNASRAHRCASMPFGQTCPAAGVVNKMLSSLAIRVLDTYRIEWHGTAPRVT
AAGVNKIQRPYYFVLQNAG

 

>CYP339A DPGLEAN11745

scaffold2021:- 10056-13482

Best NR hit (blastp)  cytochrome P450 CYP339A1 [Bombyx mori] (0.0) mito

62% to CYP339A1   Bombyx mori

MQTSKSIELNTNALQKKVLSEEIAPTSNIITRALFTNRKSYWNLSTLFDEYGPVVRFVSP
VGSDIVLINHPDLIQKVFSLEGDCPVRSNLESLERYRTEHKNHAYGGLYTVHGQEWARQR
SIMNPKTKELLSHHVHGLYDITEKFTQKIYNVRNYQDEMSKDLYKELHKWAFDSMGLITF
AKNFCLLDTELIYNQCDASWMYHSLEHATDAIIKCESGLHFWKFFTTPAWYTLVKYCDNL
DSLIGKFVLEIENDICTKSQQENISPNSLTSAMVLNDEKFKTEDIATILMDMLLIGVNTI
SSSISFLLYNIAKHQKAQKQLYQEVGNLYKDNMTVDINNLKKNTPYLQACIKESLRLVPP
IPLLTRVLSKNITLDQYNIPRGTLIIMSTKDCSMKESNFDEADKFMPERWLREDAKDYHA
FASIPFGFGARRCIGQNIAETMISLITIKVLQKFKLEYHYGDIQATKSFIAKPNKPLKIR
FVDRI