Below is a P450 collection from Phyophthora ramorum

 

P450s found in the genome blast server at JGI

 

D. Nelson July 27, 2004

 

A tree of 65 Stramenopile or Chromista P450 sequences

 

 

>CYP5014B1 scaffold 20 seq d minus strand 46% to seq 20 a

MSSVDPTSALL

YAASCSLALLVGAKVLFPEPKRAARHPDSLPLLGETWAALKYADEYYDWEAAMTEKMEGR

PWLFDVVGRPSEFVIGKPEIIEDVLRTHAESFGKGEYVHEVLSGLLGDGIIAVDGHKWAR

QRKTASNLFSLRELRESMATVVQDNVLTLNGIFQHAMDRGESLDLFQLLNRFTFEVIAEI

AFGIKFGGLATGSKHPLEAAFNCAQQRMFMRFLEPTWWWKLQRWLNVGPEGAFKKQVQII

DETCYSIISRSMKERKAKRSSHDADTLEGSASTQRQKSNIISLFLDGVSDDEAKAGDGLD

PKFLRDIVVTFMTAGRDTTASALSWFFYTLSQHPQTEEKIRQEMASKLPELANGAVSSPS

MLQANELVYLEAALKETLRLYPAVPTNIREALEDVVLCDGTVVKAGETVSWSTYALGRMP

HVWGEDAKEFKPERWVDADGKLIAVSPFKYPLFSAGPRVCLGGKLALMEMKITAASVLSK

YHFTIVPGQNVTYRIGLTFGMKNGLHVKVSEAAPAI*

 

&&&&&&&&&&&&&&&&&

 

>CYP5014C1 scaffold 20 seq c plus strand 54% to scaf 20 b +

MLSIASLKLEHPLYHALTVTSIVLLPIVFQL

SRGSSSSSTDDEERADSELERRDAD RPPWTLPVLHNTLNWILGGDGIHEWITRNCERFKG

RPFTVKALGLPEMLVVSTPEAFEDVLKNQFMNFPKGPHVKENLQDLLGDGIFAADGVKWA

HQRDVARGLFRMRELRDCMTEAITRHTKALHDVLGKVCARNRSVDLHKLLSCFSTEAFAE

ISFGMKMGCLRANKELPFQAAFDSAQRLTAQRFVRPRWFWKMQRRLGLGAEDQLQLDIKE

IDAAVLNIVQRVLSQRALVPDDGAPKSTNMLSLFLDTIAKSPKAEEQLYDPAYLRDVVVN

FLVAGRDTTAQALSWFFYNVSQNPHVEAKLRREIYKKLPELVNSEVCVPTLQQVNRLVYL

EAVMKETLRLYPPVPMSPKYAVRDAMLSDGTFVAAGSMVCLPMYAMGRMPHVWGPDAAEF

NPERWIDPATKKIVSVSAFKFVAFNAGPRMCLGTTLAGLELKLVAASLLSRFHIHVENPE

DVTHEFSLTLPVKGPMNVRLARVQAAVA*

 

>scaffold 86 seq a 86112 = cyan

KIADES MKKNRPQNSDVEAAIRDEIAEKL

PKGGRNATTSATATMQDVSQLVYLEAALKETLRLHPPVPMAPKYVVEDTTLSDGTFVKGK

SMIVLATYVMARMQEVWGPDAEVFKPERWIDPTTGKLIPVSAYKFASFNAGPRMCLGMNL

AMLEMKLVVAGLLSKFHVEVLNPEDVTYELSLTLPVKGPLNVKVSPAKLPTSPDFA*

 

>scaffold 86 seq b 76832 = cyan 86113 starts at MGCL = scaf 20 c +

MRELRDCMTEAITRHTKALHDVLGKVCARNRSVDLHKLLSCFSTEA

FAEISFGMKMGCLRANKELPFQAAFDSAQRLTAQRFVRPRWFWKMQRRLGLGAEDQLQLD

IKEIDAAVLNIVQRVLSQRALVPDDGAPKSTNMLSLFLDTIAKSPKAEEQLYDPAYLRDV

VVNFLVAGRDTTAQALSWFFYNVSQNPHVEAKLRREIYKKLPELVNSEVCVPTLQQVNRL

VYLEAVMKETLRLYPPVPMSPKYAVRDAMLSDGTFVAAGSMVCLPMYAMGRMPHVWGPDA

AEFNPERWIDPATKKIVSVSAFKFVAFNAGPRMCLGTTLAGLELKLVAASLLSRFHIHVE

NPEDVTHAFSLTLPVKGPMNVRLARVQAAVA*

 

>scaf_2029 C-term frag = scaf 86 b/ scaffold 20 c

PMYAMGRMPHVWGPDAAEFNPERWIDPATKKIVSVSAFKFVAFNAGPRMCLGTTLAGLEL

KLVAASLLSRFHIHVENPEDVTHAFSLTLPVKGPMNVRLARVQAAVA*

 

&&&&&&&&&&&&&&&&&&&

 

>CYP5014D1 scaffold 20 seq e minus strand 58% to seq 20 a

MTDKLSSSVAVAALSGLVTLPLAWYLLSTA

HGEKQLGTRKVVRPSTTKPLIGNTLDILYNLPIRHDWITSLCEEAKGEPVLLQSLGTPDM

TLLSTPGAFEDVFKNQFDNFPKGPKKSEYLRELLGEGIFAVDNEKWYRQRKTASNLFTMR

ALRDSMTSTIQRHLVVLERIFRRAAETNASIDMFRLLNRFTMEAFTEIGFGVEMNCLDSD

QEHPFQTAFDRSQQSLALRFVRPSWFWKTQRMLGLGPEGQLQQDMKVINSTICDIVAKTL

QNSARGAPKPDDKAAMDIVSLFLDDLNKSSDVDANCFDPTYLRDIVVNFIIAGRDTTAQA

LSWFFYCLSQNKQVETKIREELLAKLPDLFNGQCSPSMDAVGELTYVEAALRETLRLYPS

VPIVSKQAVQDAVLSDGTFIAAGAMAGLPMYALGRMPHVWGPDAADFKPERWVDAQTGKL

ISVSAYQFVAFNAGPRLCLGKNLAMLEMKLIVASLLSKYHVELETPKTVTYAISFTLPVK

GQLNAKISAV*

 

>CYP5014D2 scaffold 20 seq b plus strand (has one intron) 76831 = cyan

very similar to scaf 86 seq a

MLKWISLFDHSGVAPVHPLLIADRFDVDLVSIDLVRIQ (0?)

RKVHRPDSTLPLLENTLDILQAARDGDIHDRTVRACRGFKGEPVLIRSIGLPDKLVVSTPEAFEDVLKTQFNNFP

KGSYMCENLRDLLGDGIFVADGDQWVHQRKTASNLFTMRALRDSMTVVIQRHAVVLYDIL

QRASDNKETLDLFRLFNRFTIEAFAEIGFGVHMGCLDSDEEHPFQKAFDRAQRALLLRFV

RPGWFWKTQQWLSVGAEGRLKQDIEVINATVLEIVEKALAKRSTVSNDEIGDEDKNIVTL

FLDSVGGFASADSQQPDPMHLRDIVVNFLIAGRDTTAQTLSWFFLNLTKNSDVEAAIRDE

IAEKLPKGGRNATTSATATMQDVSQLVYLEAALKETLRLHPPVPMAPKYVVEDTTLSDGT

FVKGKSMIVLATYVMARMQEVWGPDAEVFKPERWIDAATGKLVNVSAYKFASFNAGPRLC

LGVNLAMLEMKLVVAGLLSKFHVEVLNPEDVTYELSLTLPVKGPLNVKVSPAKLPTSPDFA*

 

>CYP5014D3 scaffold 20 seq a 70% to scaffold 20 seq b plus strand

MLPIAELVENSSVTIGGLALLALALSFWHKMRVDEKS

KVHRPDSTLPFLENTLDLIIHGGGKGDLHDFTTQMAKQFDAE

PVFLRALGIPLNIMLFTPEAFEDVLKTQFHNFEKGPFVCENLKDLLGEGIFAVDGDQWVH

QRKTASNLFTMRALRDSMTVVIQRHAVVLYDILQRASDNKETLDLFRLFNRFTIEAFAEI

GFGVHMGCLDSDEEHPFQKAFDRAQRALLLRFTRPGWFWKTQRWLGVGVEGQLRRDIQVI

DKTVLDIVEKALAQRAQRVTHGDGGYDNGDNKVKKVGNIVSLFMDNLANSQQFDPKYLRD

IVVNFLVAGRDTTAQALSWFFVDLSKNPHVEVAIRKEMAEKLQPAEVNRPGAVSSTMEDV

SHRVYLEAALKEALRLHPSVPVVPKQAVQDTMLSDGTFVPAGSAIGLANYAMARMPQVWG

PDAEEFKPERWIDPTTGKLIPVSAYKFASFNAGPRMCLGMNLAMLEMKLVVAGLLSKFHV

EVLNPEDVTYDLSLTLPVKGALNVKVSSIGRSASPAYA*

 

>CYP5014E1 scaffold 36 plus strand 45% to scaf 6 +

MEPWSHGAKAVATVCRPADIRYRANLVADPPDLKALSRTFRPGHKSIRVV

MRTADVVACIQILWPTFPPWASVLVECLVISDRNEGGGGGGINTISYPNSTSLGQVGFKLSRLISLT

MLLPLQLMGDARAPVGLLACGLLVS

AVAVVRVRSHTQQKPHDTATRVPYLPTLVPLLGNTLELATNVGRFHEWVTGHSQQRDGKP

FALRLLGKNDLVFIARPEHFEEVLKTQSRNFCKSDTTREVFDDFLGEDIVLLNGKRWQFH

RKILASLITTRALREYMTPIIQENILRLQRTLKQWSETKGSVDFHKLVRHFTIDTFAEIG

FGCKLDVLASGEEHPFEVAFNDANRISSERITSPTWIWKLKRWLNIGDERRLREAIEEMN

ELMMKLIFTAFGLLQAGADDNQQSAHQNLISIIVSSEREITPTEVRDIALSGLEAGRNTT

ADTLSWLFHALSHNPRVETTLRAEIAAKLPKLEESDSYVPSFEEIQEVPYLEATIREVLR

LYPTVPTVPYHCVNDTVLADNTFIPAGTDVFLTLYAAGRLASVWGHNATEFSPERFIDEK

TGKLCQDSPYSAFSDGPRVCLGRNLAMLQLKIVAATIISRFHLSEDPEQDVQPILDLTIG

MKDPLMMRVETTQQEEAASFS*

 

>CYP5014F1 scaffold 36 seq a 62% to scaffold   36 seq b

MLQSMFKSPVVPSLVTAAVLVTLYWTKSAKVSAKIKGDKVKSA

VILPSTLPVLGNAVELASNAARMHDWLADQFATTDGEAFIVRLPGKDDMMFIAKPEHLEA

VLKTQFDVFPKSEYIHDVFCDMLGDGIVVTNGETWKRQRNVVVGLFSARALREHMTPLVQ

KYTVQLVDILADAAASNAPLDVFDLLHRYTFDVFGEIGFGARMGSMKGAFQPFAEAMDEA

QFLAGKRFKQPMWYWKLRRWLNVGDEKKLRENVRVIDEHLMGIIADAIERRRIRVEQKKE

GRAAALADKDIVSIVLDTMDTKGLPVNPVEVRNIALASIIAGRDTTADCMGWLFHLLSQN

PRAEAKLREEVLAKIPQLSADKHYTPTVEDINKVPYLEACIRELLRLYPPAPLITTHCIK

DTVFPDGTFVPAKTDIGIALFSAGRLTSVWGEDALEFKPERFLDTETGEVISMTATKFSA

FSAGPRICVGQNLAFIETKIVIASIVSRFRMTPEPGQNVAYTEGISLGMMDPLMMRLESVN*

 

>CYP5014F2 scaffold 36 seq b scaffold   36 seq a

MLQSFFEDKLSFLGPVVPGLI

AAAVVVAVYYSTSSTHVSPIGDNEVCDGKVKPKRVRYLPSKIPVLGNAIELLSNAERMHD

WIADQIVPFDGEPFTLSLPGKDDMMFIAKPEHIEQVLKTQFDNFPKSQHIHDLFFDLLGD

GVVITNGETWKRQRRVLVNLFSARALREHMTPISQKYVVQLRKIFEEAAASKEPMDAFGL

MHRYTLDVFAEIGFGTEMKLLEGRYQPFAEAIEESQYIVSARFKQPDALWKIKRWLNIGS

EKKLRHAVQVIDEHVMGIISGAIQRRQQRVEAARSGKEAKPVDRDIVSIILDSMESNNQV

VDPVEVRNIAAAALIAGRDTTADALGWLIHVLSENPRVEAKLRSELLAHLPKLATDIDYV

PSAEELSQVHYLEATIRELLRILPAGPAIATHCVRDTVFPDGTFVPKNTDIGISFYTTGR

LSGVWGEDVSEFKPERFLDADTGEVVKVSSSKFCAFSAGPRICVGRNLAFLEMKIVIANI

LGRFHLVPEPGQKPAYTQGITLGMQTPLMMRVEAVSSQDSSVAA*

 

>CYP5014G1 scaffold 86 49% to scaf 6 minus

MLLWFLDSDDPLKKFGLLMLGVLIGAAVASQCLPSDTTDKKLIKQQQSVTPGAHGPVRVPFL

PSMIPVVGHAVLMAYHAGRFLDWVTDVFVSRGGAPFTLRLPFQRDMILTANPEHYEHVMK

IQYDNFLKGDHIYDLLVDLLGDSILIVEGDEWKFHRRVFVNLFSSRALREHMAPIIQKHV

RVLQGVLSNAAQSKQAIDFFTYSGRFTLDAFGEIAFGFNMSTLTLQHEHPFERAFVDAQH

ITAARLVVPTWYWKLKRWLNVGSEKNLREALTTVDQFVMDVISKTMDKRNAPPNDAEDKT

RNRDIVSLILANETVDGKPVDPILVRNVVLMALIAGRDTAADAIAWLFHLLTLNPRVEAK

LRADLLARLPKLGTDLDYVPDSQDVQGLAYLEATICEALRLFSPVGLAQKLCVHDTVFPD

GTFVPKGANIALVYHAMARMPSVWGPDAASFVPERFLDPETGDLLKVSSGKFSAFNTGPR

VCVGRKLAMLEMKMVVACVVARFHLDEVPGQDVACSGGITIGMKNPLLMRVNELLPAAKS

DEVAVGVAA*

 

>CYP5014H1 scaffold 6 minus strand 49% to scaf 6 plus strand

MLRQWLVKHRVLSPLGPAGLVLLAGAAAVAAYISTRSSGVEVKEFDLKEEE

STDKPRVVPYLPSKVPLLGNMLELAGNAHRFHSWMAEQCVIHNGVFKLYLPGQSDMLVTA

VPEHYEHVVKTQFEHFSKGKQQYDMFVDLMGHSVLIIEGERWKYHRRLLVRLFSARALRE

HMTPVIQRHTRMLQTVFLKAAVAKKPVDVYMLMHRFTFKAFAEMVFNNTLDSIDSEHEHP

FEQAFDEAQSIVAGRLQQPVWFWKLKRWLNVGLEHKLREDVALIDEFIMSIISTAIETRR

RRQEDLKAGRPVKPADKDIVSIVLECMEQDGDMVSPTDVRNIAVAALGAGRDTSADAMSW

LLHTLTQHPEVENKLRAELLEKLPKLAVSATYVPSMDEVHGLVYLEATIRELLRLQTPVP

FTLRECIHDTVFPDGTFVPKGTNVGMCHFGAARRTEVWGPDAAEFKPERFVDPVTGKLLH

TPMAKFNAFSGGQRVCVGKALAMLEMKLVIATLVGRFHFSEVPGQDVQYAMGITIGMRSS

LMMNVQPVIRGAAGAAA*

 

>CYP5014K1 scaffold 6 plus strand 50% to scaffold 36 seq b

MTPPPPHHHHGGHQTSFAFVHRPVTMLSPSAVLTHPLVVGLLTAAAVAALSSVATSYAADA

DSGDEDTKPVKPVPYLPGAHPVLGHTLLMAKNLDRFQDWLVETSVARNGEPFVLRQPGKN

DWLFSARPEDFEQILRVHFDTFIKGPQVRELLDDFMGENIVIINGHRWKFQRKALVNLFT

ARALRDHMTPIVQKCALALQRVFAQAATSGEAIDVHHVMGKFTLETFAEIEFGSQLGLLE

SGQEHAFETAIDDANHISLERFAVPMWVWKLKRWLNVGSERRLREDMAVISSFVMSCISG

AMERRKQRQEAAARGEPLGPVAKDIVSILLDSEDTIGEPVLPKDVFNISLAGVLAGKDTT

GDATSWLMHMLHENPRVENKLRTILLAEVPKLAVDESYVPSMEELDGVTYLEATIRESLR

LKPPAPCVTQHCTQDTVFPDGTFVSKGTDTTLLYHASALLPSVWGPDAAEFKPERFLDEN

DKLLVLPPLKFIAFSAGPRKCVGRKLAMIEMKVVTACLLSRFHLVQVPGQDIRGTMGISL

GMKYGMKVNVQRTPGVAIRA*

 

>CYP5015A1 scaffold 63 43% to 1485 early

MGESKSSLWVALPAAAAAAVLVYLLVPDERQRAIRRLPAPASTLPL

LGNTLDLMSLELPRLHDWLAEQCKAFGGRTWRLQVLGAPPLVVVSSVECFEDVLKTQFDV

FDKGARMNEIFRDVAGGGIVAVDGPQWLAQRKMLSRLFTMRAFRETISQCVHDYTLVLGR

KLNDAVRTGAPLDFADLMHRFSFDVFTDISFGLQANALEGGEHSQFMEAMGKIVHHIEMR

FHSPDWLWKLKRALKLGGEKELAQEIAILDKMVFTIINKNMERKFNPDAAANEWPPRPPR

STKDVVSLFLDAHDEQKEAGGSPLDASFLRDIAVVVLLAGKDTTAWSLSWLIIMLNRNPK

VETKLRQELREKLPRLFSDPTYVPTMDDVENLVYLDAVLRENLRLNPLVPLNAKEANRDT

TLVDGTFVKKGTRVYIPSYTLGRMKSVWGRDASKFKPERWLMEDPWTGELTIRPVSAFQF

VSFHAGPRTCLGMRFAMLEMKTVTAYMLSKYHFTTKENPKSYTYDVASLLQVKGPLVVKV

QRAG*

 

>CYP5015B1 scaffold 27 seq d same as 1485 early 49% to 1485 later

MLPFAIAASLVAAA

VAYLTSPNDQDRAVCELPTPRSTLPVIKNTLDLAVRQRARIYDWILEQCREHGGRPWRVR

VLGRPPAVIVSSPEAMEDILKTQFEVFVKGSAVATISKDLLGDGIFAVDGSQWKHQRKAA

SHFFSMNMIRDAMEHVVRDHSVRLTKKLSEAAVSGEVVNIKRVLDFYTMDVFTKIGFGVE

LRGLETGGNSDFMEAFERATRRIMARFQQPMCIWKLARWLSVGAERQMASDMKLINGVVY

DVIHRNLEGKKQRKQSGGTFNSGREDVISLFLEKANVEYSDDDHVKMTPTMLRDMSMVFL

FAGRDSTSLTMTWYIIEMNRHPEALANVRRELTDKLPRLGLNDAETPSMEDIDELVYLEA

AIRECIRLNPVAPVLQRTAAQDTTLYDGTFVKAGTRVILPHYAMGRLETVWGPDAEQYKP

ERWIDPDTGKLVHASPYKFAAFLAGPRMCLGMRFALAEMKLTLATVLSKFHIETVEDPFE

FTYVPSVTLQVKGPVDVRVTRPSSM*

 

>CYP5015B1 scaffold 1485 early = 77926, 87040 starts ar DRAVC 49% to 1485 later

MLPPFAIAASLVAAAVAYLTSPNDQDRAVCELPTPRSTLPVIKNTLDLAVRQRARIYDWILEQ

CREHGGRPWRVRVLGRPPAVIVSSPEAMEDILKTQFEVFVKGSAVATISKDLLGDGIFAV

DGSQWKHQRKAASHFFSMNMIRDAMEHVVRDHSVRLTKKLSEAAVSGEVVNIKRVLDFYT

MDVFTKIGFGVELRGLETGGNSDFMEAFERATRRIMARFQQPMCIWKLARWLSVGAERQM

ASDMKLINGVVYDVIHRNLEGKKQRKQSGGTFNSGREDLISLFLEKANVEYSDDDHVKMT

PTMLRDMSMVFLFAGRDSTSLTMTWYIIEMNRHPEALANVRRELTDKLPRLGLNDAETPS

MEDIDELVYLEAAIRECIRLNPVAPVLQRTAAQDTTLYDGTFVKAGTRVILPHYAMGRLE

TVWGPDAEQYKPERWIDPDTGKLVHASPYKFAAFLAGPRMCLGMRFALAEMKLTLATVLS

KFHIETVEDPFEFTYVPSVTLQVKGPVDVRVTRPSSM*

 

>CYP5015C1 scaffold 27 seq c same as 1485 later 49% to 1485 early

MKLEAMTAWVSPASVAATCVAALLVYLATPSAHDRAVKHLPGPDGDLPILRN

TLEIIAAQKSGTFHDWALKYCRKYQGKPWRLRVVGKEPTVVLCCPEAFEDIQKTQFEAFD

KSRFVSAAMYDVLGQGIFAISGPLWQHQRKTASHLFTTQMLQYAMEVVVPEKGDELIKRL

DGVCLKEKTADRVVSMKRLLDLYTMDIFAKVGFDVDLHGVQSNQNAELLDSFDRMSVRIL

ERIQQPMWYWKLLRWLHVGPEKQLAEDVKKLDDLVYGVMARSIEDKNRQDASQSARKDLI

SLFIDKSDVEYTKGVHTKKDLKLMRDFVISFLAAGRETTATAMSWVILMMNRYPDVLKRV

RQELNEKLPGLASGKLRAPSMEDAQKLVFLEAVVRETLRLFPVVAITGRSATRDVYLYEG

TLIKAGTRVVMPHYAMGRMSTVWGPDVDEFKPDRWIDPITGKGKVVSPFKFSVFLGGPRI

CLGKKFALAEIKISLAKLLSQFDFKTVRDPFDFTYGSSITLQIKGPLDVVVSRLH*

 

>CYP5015C1 scaffold 1485 later 49% to 1485 early

MKLEAMTAWVSPASVAATCVAALLVYLATPSAHDRAVKHLPGPDGD

LPILRNTLEIIAAQKSGTFHDWALKYCRKYQGKPWRLRVVGKEPTVVLCCPEAFEDIQKT

QFEAFDKSRFVSAAMYDVLGQGIFAISGPLWQHQRKTASHLFTTQMLQYAMEVVVPEKGD

ELIKRLDGVCLKEKTADRVVSMKRLLDLYTMDIFAKVGFDVDLHGVQSNQNAELLDSFDR

MSVRILERIQQPMWYWKLLRWLHVGPEKQLAEDVKKLDDLVYGVMARSIEDKNRQDASQS

ARKDLISLFIDKSDVEYTKGVHTKKDLKLMRDFVISFLAAGRETTATAMSWVILMMNRYP

DVLKRVRQELNEKLPGLASGKLRAPSMEDAQKLVFLEAVVRETLRLFPVVAITGRSATRD

VYLYEGTLIKAGTRVVMPHYAMGRMSTVWGPDVDEFKPDRWIDPITGKGKVVSPFKFSVF

LGGPRICLGKKFALAEIKISLAKLLSQFDFKTVRDPFDFTYGSSITLQIKGPLDVVVSRLH*

 

>CYP5015D1 scaffold 27 seq a 46% to scaf 1485 early

MQACDYLQVDARYGCYHIVGRIARGGEVEGASSWLALSHLRASAQAPTMWSSLKFAIEASTTS

WGALLLCSLLVGWHLLSMRKQARAPSKFARPQSTLPVLGNTVDLMFTHQEDIHDWMLAEC

RRCKGRPWVLAALGRPMSLVLSNVEAFEDVLQRKFDQFGKRSAWLVSDVFGDGIFAADGV

SWIHQRKTASHLFSLHMMRESMEQVVREQAAVLCKTLFAHLNSNQSSSMADQHGVPVNLK

YTMDWYATNVFTRVGFGVDLDSLPSQEHDEFFRAFTRLPIAIHRRIQQPGWLWRIKRALN

LGYEKQLKLDMKRVDDVIYQVISRSMTSKSSEDPRRLPDLISLFLAKESNEYRDRDTKQE

GGAAATRSVKTTPKLIRDMAFNFTAAGRGTTSQSLQWFIIMLNRYPSVERKIREELLAKL

PQLFESNSSPPTMNDVQQLVYLEAAIKESLRLNPVAPLIGRTATQDVCFSDGTFITSGTR

VVIPTYAVARLKSIWGEDAAEFNPERWIDPQTGKLLVISPYKFLVFLAGPRSCLGAKLAM

LELKVALATVLSKFHLRVLRDPFEIGYDASISLPVKGDVLAVVEPAEMVVPPSAENVERSAGAA*

 

>scaffold 27 seq b introns? duplication, assembly error

366140 PDGDLPILRNTLEIIAAQK 366196

 

366212 DWALKYCRKYQGKPWRLRVVGKEPTVVLCCPEAFEDIQKTQFEAFDKSRF 366361

366362 VSAAMYDVLGQGIFAISGPLWQHQRKTASHLFTTQMLQYAMEVVVPEKGD 366511

366512 EL 366517

 

XXXXXXXXXXXXXXXXPVAPVLQR = scaf 1485 early, duplication or assembly error

TAAQDTTLYDGTFVKAGTRVILPHYAMGRLETVWGPDAEQYKPERWIDPDTGKLVHASPY

KFAAFLAGPRMCLGMRFALAEMKLTLATVLSKFHIETVEDPFEFTYVPSVTLQVKGPVDV

RVTRPSSM*

 

>transcript 87040 predicted transcript by JGI cyan = CYP5015C1

green = CYP5015B1

this is a fusion of three genes, the third is not a P450

DRAVCELPTPRSTLPVIKNTLDLAVRQRARIYDWILEQCREHGGRPWRVRVLGRPPAVIV

SSPEAMEDILKTQFEVFVKGSAVATISKDLLGDGIFAVDGSQWKHQRKAASHFFSMNMIR

DAMEHVVRDHSVRLTKKLSEAAVSGEVVNIKRVLDFYTMDVFTKIGFGVELRGLETGGNS

DFMEAFERATRRIMARFQQPMCIWKLARWLSVGAERQMASDMKLINGVVYDVIHRNLEGK

KQRKQSGGTFNSGREDLISLFLEKANVEYSDDDHVKMTPTMLRDMSMVFLFAGRDSTSLT

MTWYIIEMNRHPEALANVRRELTDKLPRLGLNDAETPSMEDIDELVYLEAAIRECIRLNP

VAPVLQRTAAQDTTLYDGTFVKAGTRVILPHYAMGRLETVWGPDAEQYKPERWIDPDTGK

LVHASPYKFAAFLAGPRMCLGMRFALAEMKLTLATVLSKFHIETVEDPFEFTYVPSVTLQ

VKGPVDVR PEE MKLEAMTAWVSPASVAATCVAALLVYLATPSAH DRAVKHLPGPDGDLPI

LRNTLEIIAAQKSGTFHDWALKYCRKYQGKPWRLRVVGKEPTVVLCCPEAFEDIQKTQFE

AFDKSRFVSAAMYDVLGQGIFAISGPLWQHQRKTASHLFTTQMLQYAMEVVVPEKGDELI

KRLDGVCLKEKTADRVVSMKRLLDLYTMDIFAKVGFDVDLHGVQSNQNAELLDSFDRMSV

RILERIQQPMWYWKLLRWLHVGPEKQLAEDVKKLDDLVYGVMARSIEDKNRQDASQSARK

DLISLFIDKSDVEYTKGVHTKKDLKLMRDFVISFLAAGRETTATAMSWVILMMNRYPDVL

KRVRQELNEKLPGLASGKLRAPSMEDAQKLVFLEAVVRETLRLFPVVAITGRSATRDVYL

YEGTLIKAGTRVVMPHYAMGRMSTVWGPDVDEFKPDRWIDPITGKGKVVSPFKFSVFLGG

PRICLGKKFALAEIKISLAKLLSQFDFKTVRDPFDFT SPRALLKMLRSSWRLRLSLRRFS

TATASSSPRQLQFVTSNAIDRRLLQGLTSADDDARPVNLIQDEEGARRVLDKIKALGPDH

FHACDTEVGQIDIKAVGPVGNGVVTCLSLYSGPDVDYGNGPYVWVDNLDSAEGTLQYFKT

FLESKQYRKVWHNYSFDRHVLYNHGINVQGLGGDTMHMARLWNTARFQNGGYSLEALTAD

LLLQRKKPMKELFGIPKLKKDGSKGKERLMPSVEELQRFPEFRKRWIRYSVYDAESTWFL

HRVLQHKLDQTFWFEKPPKTGEGEVEPQVGSMYDFYRQYIIPFGECLTDIERKGMHVDLE

YLAGVEKQALQDRARLERLVLKWASRYCEESERMNLFSAAQKQQLLFAPYFDQKKKKEVL

PAER

 

>CYP5015E1 scaffold 41 seq a 79501 89% to scaf 41 b

MKAVSELLGDRNDVAVAAAAAVAVSLGLSLLLHSSKKNATIEA

RKLPPMPKTTLPILKNILDAGGNAERFHDWLNEQSTEFDNRPWMFTIPGRPANIVLSSPE

IFEDVLKTQDDVFLRGPTGQHISYDLFGNGMVITDGDLWFYHRKTASHLFSMQMMKDVME

ATVGEKLGVFLDVLDIYHKRGKPFSIKEELSHFTMDAIAKIGFGIEMDTLKNSPDREEDH

EFLKAFNEGSVAFGVRIQSPLWLWELKKYLNIGWEKILMDNTRIMHEFINDVIV

QSMNKKAELAAKGEKLVARDLISLFMESKLRQTEEMHIEDDDATIMRDMVMTFVFAGKDSTAHSMG

WFIVNMNRYPEVLQKIREEMKDKLPGLLTGEIKVPTEQQIRDLVYLEAVVKENVRLHPST

GFIVREAMESTTLVDGTFVEKGQTMMVSSYCNARNKRTWGEDALEFKPERMIDPETGKLR

VLSPYVFSAFGSGQHVCIGQKFAQMEIKLAMATLFSKFDIKTVEDPWTLTYEFSLTIPVK

GPLNVEVTPLAPLATAASA*

 

>CYP5015E2 scaffold 41 seq b 89% to scaf 41 a

MKAVSELLGDRNDVAVTAAAAVAVSLGLSLLLHSSKKNATIEARKLP

PMPKTTLPILKNILDAGGNAERFHDWLNEQSTEFDNRPWMFTIPGRPANIVLSSPEIFED

VLKTQDDVFLRGPTGQHISYDLFGNGMVITDGDLWFYHRKTASHLFSMQMMKDVMEATVC

EKLSVFLDVLGVYHTRGRTFSVKQELSHFTMDVIAKIAFGIELDTLKNSPDRDDDHEFLK

AFNKACVAFGVRIQSPMWLWELKRYLNVGWERVFKENNTIIQKFIN DVIVQSMNKKAELA

AKGEKLVARDL ISLFMESKLRQTEEMHIEDDDATIMRDMVMSFAFAGKDSTADNMCWFIV

NMNRYPEVLQKIREEMKDKLPGLLTGEIKVPTQDQVRDLVYLEAVMKENMRLHPSTGFIV

REAMESTTLVDGTFVEKGQTLMISSYCNARNKRTWGEDALEFKPERMIDPETGKLRVLSP

YVFSGFGSGQHVCIGQKFAMMEIKLTLATLFSKFDIKTIEDPWTLTYEFSLTTPVKGGLS

VEVTPLTPAASA*

 

>CYP5015E3 scaffold 41 seq c 70% to scaf 41b

MKTVSELFGDRNDLAVTAAAAVAVSLGLSLLVHKSKKNKSTEARKLPPMPKTTLPIFKSLFDIGN

NLDRFHDWLLERSAEFDNKPWMYSIPGRPVTIVLTAPEYLKDVMVTQEDVFLRGPLTQYM

SEDIFGNGMIVTDGDPWFFHRKTSSHLFSMQMMKDVMETTVRDKLDVFLDVLDIYHKRGK

PFSIKKDISHFTTDAICRIGFGIELDTLRNGPDSDEDHTLLKAFDLASIAFVVRVQTPWW

IWEPKRFFNVGWEKVFKDNVKIVHDFIDEVILQSMNRKAELAAKGEKMEARDLITMFMES

NLKETEHMNFQDNQATIMRDMVTTFMFAGKDTVGHSLSWFIVHINRYPETLKKIREEMKE

KLPGLLTGEIRVPTHEQLRELIYLEAAVRENVRLFPSTGFIAREAMRDTTLVDGTFVGKG

QTIMVSSYCNTRNKKLWGEDALEFNPDRMIDPETGKLRVFSPYQYSGFGSGQHVCIGQKF

AMMQMKLAMATLLSKFDIKTVEDPWKLTYDFSLTIPVHGPLDVVVTPLTPLSSA*

 

>CYP5015F1 scaffold 1 55% to scaffold   21 minus

MWSSTSHDGAQQSVLMALGALTVVYASWKILNLPVPMPDPGMEQLFRPASTLPVLGNTLDVLLFNRYRMSD

WINDQTDASSGQPWILQLLFQPPWVVLSMPNDLHDVFVDQFQVFEKGGTLGDISFDVLGN

GLLNVSGDKWKQQRRAASHLFSTQSIRDVMEPVIREKTLQLRDVLAQSAGRKQTVSMKSL

LGKFTSDVFTRIGFGVELDQLGGDVFKDEQHPLDIALHAVQNRFQTPAWMWKLARFLNVG

AEKRLRESMKTVNDMVRDIMVRSISEKSSGDQKKNLLTLLMKDNAAADPRELQDTAVNFF

IAGKDTTSFSLSWLIVMMNRYPRVLQKIREEIRSVLPTLLTGEMDAPTLEDTQKLVYLDA

AVKESVRLQAVSTYRCTTRDTTLTDGAFIKKGTVVVVSKYAAARRKGVWGEDAAEYKPER

WFDEKTGEPKNITPPQFITFSTGPRKCIGMRLAMLEMKTVMAVLFSRFDIETVEDPFKIT

YDFSFVLPVKGPLAVRVRDRAPLSA*

 

&&&&&&&&&&&&&&&

 

>CYP5015F2 scaffold 21 minus strand N-term from scaf 1663 55% to scaf 1

MIEQQALPSLLATVFALLLVWWKLLNK

PSTNSQKLFRPASTLPILGNTLDLLWFQKHRLHDWMT EQSLASGGKPWLLTGIGQLPRVVVTSP

AAYEEVFKTQFDVFVRGPGETVLEVLGEGIFNVDGDKWRRQRRVTSHLFSMHMLKDCMNA

VVREKTVKLRDVLAMCAERGDPVSMKSLLNKFTADAFTRIGFGVELNGLDDPADVDTSQP

LDAALQVVQIRLQSPVWLWKLRRFFDVGSERVMRESMQQVHDTIQHIMAKSLADKEEQAA

ASEEATRTSSHKDLMTLMLQTGDFKDTREIRDVAVN FYAAGKDTTAFSLSWFIVMMNRHS

HVLCKVRDELRCVAPELFTGELDTPTLEHLQQLTYLEAALKESLRLNSLAVYRLANRDTT

LSDGTFVPKGARVVFSMYGSARQPGVWGPDAAEYKPERWIDETTGKLKNISSFQFVTFSA

GPRQCIGMRLAMMEMMTVLAVVYSRFDLKTVEDAFDITYDFSLVLPVKGPLAVRVHSLAAHKA*

 

>CYP5015F2 scaffold 1663 nearly identical to scaf 21 minus strand

MIEQQALPSLLATVFALLLVWWKLLNK PSTNSQKLFRPASTLPILGNTLDLLWFQKHRLHDWMT

(sequence gap) 87200 = cyan

VSFYAAGKDTTAFSLSWFIVM MNRHSHVLCKVRDELR

CVAPELFTGELDTPTLEHLQQLTYLEAALKESLRLNSLAVYRLANRDTTLSDGTFVPKGA

RVVFSMYGSARQPGVWGPDAAEYKPERWIDAKTGKLKNISSFQFVTFSAGPRQCIGMRLA

MMEMMTVLAVVYSRFDLKTVEDAFDITYDFSLVLPVKGPLAVRVHSLAAHKA*

 

&&&&&&&&&&&&&&&

 

>CYP5015G1 scaffold 11 from JGI seq a 75133 65% to scaf 11 seq b

MWGIAQHQVNER

QAVIAVGALSGLYLGYKL LSAVYSDMKITRALDSQGLHRPKSTLPILGNTLDVMFFQKDR

LQDWMADQSQISDGKPWVLSIIGRPQTLIITSPEACEDVFKTQFDNFGRGDELVDLQHDI

FGEGVAGVDGEKWLKQRRIASHMFSMKMLRDVMDEVIIEKSKKLRDVLAACAKEGRIAPM

KSLLGKFSSDVFTKIGFGVDLHGLDGDINSEMDHPFIEAVDGYAEVFGARLQSPMWFWKL

KRFLNIGDERMLKRCIKVATDLLNDVMLKSMSNKTAEDWNSKTDLLTLFVDSTGNTDSSD

LRDAMMNFFLAGKETTSFSMAWIIVNLNRHPRVLAKLRAQIRENLPELLTGELEVPTMED

LQKIPYVEAVLKESLRLNMTGVHRTPMRSTTLSEGTFVPFGSYVVMSVYAAARVKNVWGE

DAAEFNPDRWIDEETGKVKFVNPFQFITFGGGPHQCLGMRFALLEMQTVIAVLFSRFDIK

TLEDPFKITYDYSVTLPVKGPLECAINEVAAPAF*

 

>CYP5015G2 scaffold 11 from JGI seq b 65% to scaf 11 seq a

MWGLAQHQGNQREAMLAVGTVSALYVSYKVLSAMYKSNSIGRAFDAQG

LHRPRSTLPLLGNTLDVMFYQKERLWDWMAEQSNLSDGKPWVLSIIGRPDALIVTSPEAC

EDVFKTQFDNFGRGADLRDVIYDIFGDGIAGVDGEEWQKQRRVASHLFSMKMLRDVMDEV

IIEKVTKLREVLAGCAKEGKVVPMKSLFGKFTSEVFTKIGFGVDLHSLESDPCCDSNNAF

IKAVDVYAEVFGARVQSPAWFWK LKRFFSIGDEGRLKESAKVA EGLTQEVLAKSLEARRR

DSSEVKR TDLLTLFVETNTNIDPKAVHDTLMSFLLASKDTTSFSLSWVLINLNRYPAV LA

KLRDEIREKLPGLMTGEIKVPTMEDLQKLPYLEAVVKESLRLYMAVTNR MAKTSTTLSDG

TFVPEGCAVMVPI YASARVKNVWGDDAEEYKPERWIDLSTGKVKPVSPFKFFTFAAGPRQ

CLGMRFALLQIQTTVAVLFSHFDLKTQENPFDLTYDFAITLPVKGPLNITVRDIVPAAF*

 

&&&&&&&&&&&&&&&

 

>CYP5015G8P scaffold 11 seq c pseudogene? 75116 = cyan almost identical to scaf 2789

87865 = cyan 2 aa diffs 64% to CYP5015G6

305733 QHPSTNHNTFETAAGLSALYTWWRIASELYSQHIIDSTLKK*GPHIPPST 305882

305883 VPILDNTLDALVFQKEHFWDWLTEQSNLSGGKPWMLHLVGRPTTLL 306020

306020 ETLEDIFKSHFDTFERGDDL*ELIYPFFGDGIVGADGENWLKQRRAGRH 306166

306156 RDV MGAVVKEKTLQLHDVLVKFSKEGRTVDMTSLFGKFSSDTFTKIAFGV 306305

306306 DLNGLAGDVGAEAYHPFNAAVGVMAEMLGSRLLSPTWVWKLKHFLNIGDE 306455

306456 RKLKEACDIVHELTY* VMIESIQKKIGDADQ 306548

306562 LHDTVMNFLLAGKDTTTFSMTWILVNLNRHPEALAKLRVEIKENLPGLLTGE 306717

 

>scaffold 2789 possible pseudogene see CYP5015G8P

SALYTWWRIASELYSQHIIDSTLKKQ 913

912 GPHIPPSTVPILDNTLDALVFQKECFWDWMTEQSNLSGGKPWMLHLVGRP 763

751 ETLEDIFKSHFDTFERGDDL*ELIYPFFGDGIVGADGENWLKQRRAGRH 605

615 RDVMGAVVKEKTLQLHDVLVKFSKEGRTVDMTSLFGKFSSDTFTKIAFGV 466

465 DLNGLAGDVDAEAYHPFNAAVGVMAEMLGSRLLSPTWVWKLKQFLNIGDE 316

315 RKLKE

209 LHDTVMNFLLAGKDTTTFSMTWILVNLNRHPEALAKLRVEIKENLPGLLT 60

59 GE 54

 

&&&&&&&&&&&&&

 

>CYP5015G9P scaffold 18 pseudogene 76458 use scaff 11 b for comparison

242453 LTTLYSKRVVDAAIERQKLHSPQSTLPFLGNTLDVLFYQKGRLWDWMAEQ 242602

242603 SKLSKGRPWVLRIVGRPAIIIFASPEAFEDIFKTKFEIFX

242724 RGPDMQELFYAFFGEGVIGADGDKWLKHRPTPSLMVTTRTLRDIVDAAAK 242873

242874 EKAVQLRDVLGE*VKQRRVVSMNSLLGMFSSDVFTKIGFSVDLNGLGGDV 243023

243024 NSETGHPFIEAVEVYAEVLGSRLQSPTWLWK 243116

95 aa gap in 4 nucleotide space = pseudogene

243121 LSKLREEVRDKLCGLITGEIDAPTFEDLQNLPYLETVIKESLRLYVTATNR 243273 frameshift

243273 CSKPLKMYASARMEDVWGEDADKYKRERWIDSEIGK*RFKFIS 243398

243399 NFIAGPRQCIGMRFALLQMRIATAVIFSRFDLQTVEDAFAIMYDVAFTLPVKGPLNITVHEIAC*

 

>76458 predicted transcript Cyan = P450 C-term, the rest is not P450

MYASARMEDVWGEDADKYKRERWIDSEIG NFIAGPRQCIGMRFALLQMRIATAVIFSRFD

LQTVEDAFAIMYDVAFTLPVKGPLNIT TATSRPTASQAPRKNGSTGPKSAKKKKPQPKSN

GKDAAGPQPVAPPPVSGPWSNPNPATSSAPPPGFAPSPAKPGFSDAHQRLLRHRALFAFR

FLVGKAVELQLVASDERYAGVLDCVDPDDFSVVLKSTRRVSSASDAKPFEDGSTVIFRRH

QLAHLVADGTANYTDGAFVAGVTAAAGGFRTDTEISGRQGEHLIGRELETASSWLDPALD

TGALEDSAPNGRRHGKNQGKPGNWNQFEANEKLFGVVSTYDENIYTTKLDKTKISTEQSR

EAERLAQEIERQSAAGNFHLQEERGQAVRGGKHANDLDEEARYSSVDRRGAPPSSGGNAY

VPPALRNAQRQSSDGKSKAKATKTPPASASPATPAPPVPTAAEPVPAATTPAALSPSKPL

SFSEAVTGRSTATAAPPKSESEEKKAAAPAQVNDKDAKPAATKPKSSPKKKENGKEKKAE

SKTEESKDSFSSSTSTTTTTTTTKQTKEEAPKAAPKKELNPKAKEFKLSAAAVEFTPTFS

VPPAVKEQVSSPYRGGSPGHMPYPHPGMGYPPPMQEDWMYDGGMGGEEGGEMGMPPYGYG

VPVGPNGVPMMYPPMMPQQNMRMMGGQRGYGGYQQQGYNPRGYYSPPNGGYPAGGVPYAG

GPGGVGPQPPLPREAPTPTDDESPSESAGVPSPGAPAPPVPETTAPEVVSTPASSKKQQP

AKNK*

 

&&&&&&&&&&&&&&&&&&&

 

>CYP5016A1 scaffold 92 33% to scaf 20 e minus

MVPTNLLNTRGLVTFSPGPLLGSTVALMLLWAMRHGRERRLSWTSTPLPPRGYRVLPT

ASTSPVLVRVGMTAEQPSEGSPAFYDWVFAITTRFHGKPWLLHRPGRPDVLVVSSPGSFE

DIQRTFAVQFEKIEGDADGLAHDVHGGATALVCTGQLRPSVNMQRQLAASVLGSPALRQQ

ASALVNQHLDSLLRVLEGAARLGTGNVSTDLDVSKMMRQFAMEVFTELGFGLQLGALRSP

CYQASKLENAIDDVQRRMAERSQRPAVSWKLERLLAVGSEAALSRSIDVVSTITLEAVHA

KRSKHRAGSGCDSPLAGSRVDMLDLLLGQKCSSKSSKDPEFLAGFVLGLVVAARDSMAHA

LTKCLQCLAQNPEEQEKLLRELKEAEEEGKDPRSIARLEAVIKEALRLHPSKPFVRRRAR

QDTVLSDGTFVAEGTEVAMDLYSMARRENVWGPGSAQFRPQRWIDATNSKLRPVSKYKFN

AFLGGPRACLGADIALAELKTVIAAVVSKVYLDAIEQSDGKDKALTRGVPCGEAMRIQVR

RRDFSRPGHYS*

 

>CYP5017A1 scaffold 14 30% to scaf 20 a

MKMPWHLFFDGAIYVSSVLLQYQENLLW

WSANRSLFVQITDPKDVQHILSTNVNNYVKPQGFLDAFQEIFLNSFFALNHHPQAPDGGA

RWRLQRKVAAKVFTTANFRVFTEQVFARHADKTLASAQAGAIQAEAREGDDQATGGSFCC

DMQEISASYTLQSIFDVAFGLPLGEIEGTEDFAGHMGFVNEHCAQRLFVKQYYKMFRWFM

PSERELRRHTRAIHAVAESVLLRRLQESEEKVSSRSDLLSLFICKARELAFDGKNGHDGP

EVASLLGPETLRSIILTFVFAGRDTTAECLTYAFYAIARHPRVQKRIVEELGSAKPNDDT

PFTFDEVKHLRYLEAVVYETVRLYPALPYNVKNAVKDDYLPDGTFVPAGVDIVYSPWYMG

RNGPLWGDDPLEFRPERWLEMAKRPSAYEFPAFQAGPRVCLGMKMAVLEAKLFLATTLLR

FDVAIAPGEKHERGYMLKSGLFMGGGLPLQMTPRPRNAPPA*

 

>CYP_un1 scaf_7 C-term frag or accidental match or pseudogene frag

WLSPPTRRRFSLSPRNCLGGHVADALRTFLVCRCLSPLPPWHHRHMPVEQQWSGVSPSDSANR

DWPWQPCWKLVQRSSKRIEK*