P450s from red algae genome projects

 

Cyanidioschyzon merolae P450s 5 sequences

 

Galdiera sulphuraria P450s 8 sequences

 

These two algal genomes have P450s that cluster into five groups.

#331 is a CYP51 as is contig 1016.  #4765 and #4211 are both CYP710 sequences.  Both algae seem to have two that are recent duplications, not in the common ancestor.  #444 is probably the ortholog of contig 1062.  #201 is the probable ortholog of contig 1041.

Contigs 454, 981 and 989 are unique to Galdiera.  They cluster together on a tree.  Contig 981 and 989 are quite similar (40%) and are a recent duplication.  Any orthologs to these sequences are lost in Cyanidioschyzon.  The alignment below suggests some revision may be needed to some of the sequences.

 

Alignment of the 13 sequences (Phylip format)

 

13    628

331        -------MGT ILAALANQFQ ALLARARDGD TDAIGWLALG VAALVWLMTR

CYP51con10 ---------- ---MLSQDSI ALSTLTSSLE AYCWALVYIL STILFFGILW

4211       -MIRRPRYAL DASGSSRVGG SVARHCLRRA TLRWDRR-LG YRQPCAGVRC

4765       MIETASLTRS TPVAAAEWVV SVARHCLRRA DAALGIAGWG TASRALAYVA

710B_gen   ---------- ---------- ---------- MDIVSFNSLS SGNLIILLVV

710B1_like ---------- ---------- -------MQL TEFDSFNKFL SGNLVFLGVS

con981     ---------- ---------- ---------- ---------- -MLQLWIVLV

con989     ---------- ---------M MMSCLAVSLL QLSNLSQDWS RVFKLFILAA

con454     ---------- ---------- ---------- ---MASPNEI ARWFLQTRTK

con1062    ---------- ---------- ---------- ---------- -MWIGLLLFF

444        ---------- ---------- --MFPVFIWF VIVAIAIVTF TFGVPGTLLR

201        ---------- ---------- -----MILAR NLLHVPRPSL YVVAALGAFS

con1041    ---------- ------MSLM KRVMFLGGQV ARWLVNGGLL SLVFVDLAFS

 

           AAQALATLLS PAKPAEGLEL AYPPLYTEGF PVVGNVAAFA R---NPLQLA

           RITGSFFLSK LGIAREVKGQ QLPPTYKEGL PLVGNLIAFA K---GPLNVV

           AAGARPRDRR QVRLRRWAGY GRIPGPRFVI PLIGSIVEMV L---SPYGFW

           LLVLGLAIAE QVRLRRWAGY GRIPGPRFVI PLIGSIVEMV L---SPYGFW

           ITMICYFILE QLHYFWWKRS SKLPGPSFTL PFLGSIIEMV K---NPYQFW

           IALVCYLLFE QLRYFWWKRS SKLRGPSFTL PFLGSIIEMV K---NPYEFW

           TFSCLFLYV- FILPKWRNRH IPGPRPSLLL GNVSELSRQG G---TAPLVF

           LFWTVFKFLK YVYPYWRFRN IPGPPPKWPV GNIFELLRKP G---QEHRIL

           LHSYLFSYCF MFLAPRIALV SPEAAKHVMV KNVRNYVKPP M---VRQGLS

           VLVFTLYLVR QNTCTGNKAN LSYSPVCKGL PLLGSALEFG K---NPLKFL

           RQRRKQLRGD TFPVRDVSAE RLPPCIRGWC PYVRAGFQFA FGAAPPQSFL

           LFQVGSTVAK YALWFWKQKR AGRPVSRGLL EVVTALGPVP VPG-FGNALY

           RWSLEDLWCT PPSQSFNCIG QGRDHSMDHK SKQVIFVYII FDCNLPSLPG

 

           KRAYAQLGDV FTLRVGPKRF TFLVGPRAHE VFFRANDDEL DQG---PVYG

           QRGYQSCGDI FTFKVFHKHI TFLVGPKAHE IFFQGTDDEL DQN---EVYA

           ERQRRLNPCG LSWNALAGFF VLFVTDTDLS RYVLSENGPH AFE---MILH

           ERQRRLNPCG LSWNALAGFF VLFVTDTDLS RYVLSENGPH AFE---MILH

           EKQRLLDPQG VSANFLVGRI TLFVTDSALV RAILNNNSAR TFL---LALH

           EKQRLLDPQG VSANFSLGRI TLFVTDSALV RAILNNNSAK TFL---LALH

           ERFRKQYGDV FQIWSFYRQI VVISHPDDIK YIIVTKNFPK A-----EEFN

           LQYAKQYGPT FQLWYLNRRT IIVANPEDAK FVLATRNYPK S-----PIFC

           N-LLGNKGIL LAEGDDHARQ RRIILPAFHF DALVHLGPIF R--------A

           QECRKQYGDV FTVLLPGRRM TFIFAPTQEL RKIFFNGSPN LIS-----FT

           EAARKQHGDV FTVEMFGRRM TFLFSKNGIG QFFDSAPSKV SFIRAVEPFT

           LSRVGFFRAL YESCLERQVT VFWLSSTPLL IVSAPDLVEQ VLT----SRT

           PSPWPIVGNC IPLSSNLYQT LYQYVEQPIS LYFIASTPFV VVTDEAAVRK

 

           FSVPVFGRGV VYDAPLAIRL EQFRFVSTAL RAARLREYVP LMVAEAETYF

           FSVPIFGKGV VYDAPLEKRL QQLRIMSAAL RPARMYGYVD QMVLEAVQFF

           PNGWRILGRN NIAFKSGEEH KLLRQSFLRL FTPKALGVYV SIQERLIREH

           PNGWRILGRN NIAFKSGEEH KLLRQSFLRL FTPKALGMYV SIHERLIGEH

           PSARLILGKN NIAFMHGQEH KELRKSFLSL FTRKALGVYL TLQETSIKSH

           PSARLILGKN NIAFMHGPEH KELRKSFLAL FTRKALGVYL TLQETTIKSH

           LSLSPLAGRG LLTVGKSQHQ ERRRAISKHF NEDFLRQLHR HMRVELMILL

           RCFSPLG-HG LLTLSQEEHP VQRKAISQRF NEEFLQSLHH HLTAELEVFM

           QGQQVVQRWL NRPEEAIDVH LDMTQVTMNV IALAAFGYDP NTDSGQELYR

           AGVEPLTCRI FGISKKGFSM AHRSLLTTLR SELGAKHIPQ LAHRLINRYL

           DGIFGLAPTY FEKILHMLLV QLREELHLDK GSDRGLARHL TGFAEALRMA

           FEKPQYFGYR SRTVKTALEL HQRAELLREK IETADPDQQQ RVREDPSRAA

           VLGSGMYQKP KYFGYRSSTI RYSVEMNQKL ILTNEQMRQQ QADSSRKALK

 

           KRHWHG---- TDGTAD--LL KSLSELIILT ASRCLMGREV REQLFEEVST

           RKWGD----- -QGQVD--IL ESLSDLIILT ASRCLMGREV REQLFEKVSK

           LAAWLS---- GRGTDGTTAA FEVRPRIRDL NLRTSQTVFV GPYLRD-RER

           LAAWLS---- GRGTDGTAAA FEVRPRIRDL NLRTSQTVFV GPYLRD-RER

           LQKWIQ---- LSKEN---DE MEMSFLCRDL NLETSQYVFA GPYIGEQRDQ

           LQRWIE---- LCKEK---SP LEMSFFCRDL NLETSQYVFA GPYIGERRDE

           SKLQQV---- TERKESIDFD KEATSYTLDV MCRTGFGCTA NTQEDA-SHP

           AQMDAL---- CDTERVVDLD ALISALTLDV IARTAFGVSF TAQTSQ-HHP

           AYRDIF---- TQRPPSRMLA MLFSLLPSWL LQSMPLSRLL RRQQSN-VRL

           FTFRTV---- WGKEDEKEAS NLLTETLSDA SLRVIFGDEF ANASPS---L

           MRQQMASLDG QAVAHVDDLF DLCGRLIFTA SFQTVFGREC ANALNK-DGR

           LLDLIDR--- SLTEIASSTE KFLAELRAED NVGKDASKLI QRFYVQLNCK

           VMIDS----- -KVSDIIDGM IEAAEAVVHA VDGREQVENI RRKVIELNLN

 

           LYHQLDQGMQ PLSVFAPHFP CKAHW----- QRDRARREMR RLFASIIANR

           LYHDLDQGMQ PISVFAPYLP ISAHR----- KRDKAREEMV QLFRTVIQNR

           FCADYLLITQ GFLSFPLALP GTGLW----- RAIRARERVL RDLTACVRAS

           FCADYLLITQ GFLSFPLALP GTGLW----- RAIRARERVL RDLTACVRAS

           FCHWYITVTK AFISAPVFLP GTNLW----- KAYFARKKIV ALLENAVIQS

           FCSWYITVTK AFISAPVFFP GTNLW----- KAYFARKKIV ALLENAVIQS

           ISRAVNVSLR EMYHNLVAYP IRNCFGLYSS PALKNATGVI REFASQVIEA

           MPHAVLTLLD ELVNNMIFYP YRFWLSHITQ KRLNEAINVI RKFCNMVIDL

           VKKKVTEIVQ KRREEYEALL VKDSN----- AMG--KSTTN RDLLDMLVAA

           FKDFVDFDEW FELAATPLLP HFLLR----- PFVKSRRKLL DTISQNWKYT

           LYETFVAFDR EFELAALPIP QFLLR----- SFSRARRQLL RAMQRAVRLV

           VLFGLVVDNA EATRIASAIE KAGAEFSR-- RMILPQRALY AWFANVSYIY

           VLFGYKNDKD VGSLSHIIFE AGKEFILR-- -TVNPFRIGW RWMANFRFFQ

 

           RKRYQEIAAQ AESVGQDPQQ ALEAAKEVDV LQVFMDSQYR DGS-RLTDDQ

           RR-------- ---------- --RNVKEDDM LQTFMDASYR DGS-RPSEYE

           KERF----RK DAEAEPQCLL DFWTVSVLEE VAAAKRENRA PPK-YSADHE

           KERF----RK DAEAEPQCLL DFWTVSVLEE VAAAKRENRA PPK-YSADHE

           KK------YI GNGGTPRCLL DFWTQRVLEE MEEATQQDKE MPS-YSNNRK

           KR------YM ADGGSPRCLL DFWTQRVLEE VEEAAQQGR- SVS-YANNRK

           RRT------- ------ESEE D-KTRRPLDL LDIFLKMDN- -----LSDQN

           RLQ------- ------ESRE E-KSNRVRDL LDIFLESDE- -----TRD-N

           RD-------- ---------- --------PE LEKKSSHLP- ----YLTDEE

           KN-------- ---------- --------AP IHKLTEAYG- ------NDGN

           EP-------- ---------- -------ATP AGKLLAKLD- -----SDEKV

           HTS------- --------VL LIFGQKILRH LRISSNSWIN GWLGKAGRLR

           YVFS------ ---------L ITIGRRVCQH MDSQPATWVH GWVGKVGKIG

 

           ITGLLIAVLF AGQHTSTITG TWTGLLMLRK PELVTRVRAE QEQVLYDDDG

           VAGLLIALLF AGQHTSSITG SWTGMLLLRN KDVFERVKKE QDTIIEEHG-

           MADAMLDFLF ASQDASTASL VWTTVLVAER PDVLQRVREE QQRLRPHDE-

           MADAMLDFLF ASQDASTASL VWTTVLVAER PDVLQRVREE QQRLRPHDE-

           MAETLMDFLF ASQDASTASL TWTLALMSDY PDVLKKVQEE QKRLRPNNE-

           MAETMMDFLF ASQDASTASL TWTLALMADH PDILKRVQEE QKRLRPNNE-

           IIAEIATFLV AGHDTTSHTM SWLIYEVCQH PEIEQKIQQE VDTIWGDRQD

           VIAHVATFML AGHDSTSHTL SFCMYEIAQN RDIERKLQEE SD-RFIVAQD

           ITSQALTFMA AGQVTTAVLL SWTLFELSIH PSAQEKLRQE LQTMETTLST

           VPSLLLSALW ATWSNVSPTS FWTLTHILAD EKAKVKVLAE VEKSCPLLLS

           CASVLLTLLW ASSANTLLSA GWLIVLSAEW CNRARPVTES MA--------

           RLAKVLGLLM AATQTVPIAA SWALILVSVH EDVREKLACE ARRLLSEDLS

           KLGKVVGLIM ASSQTVPTTC LWLLFLLSKY PQVVEKIREE TSRVLHSTKK

 

           C--------- ---------- ---------- --FKIDYDAL LR-LDVMHRC

           ---------- ---------- ---------- --DELNYDVL SK-MNLLHLC

           ---------- ---------- ---------- ---PITYELL EQ-MVYTRAV

           ---------- ---------- ---------- ---PITYELL EQ-MVYTRAV

           ---------- ---------- ---------- ---PLSFELV ES-MTYTRQV

           ---------- ---------- ---------- ---PLSFELV EN-MTFTRQV

           ---------- ---------- ---------- --WMLSFEEI GQ-LEYLNKV

           ---------- ---------- ---------- --RIVPFDQV GH-LDYTRMV

           QD-------- ---------- ---------- --ITEMVQHL DK-LEYLDVV

           SK-------- ---------- ---------- --TELSLEWI FSNLPFTAYC

           ---------- ---------- ---------- ---------- WE-------L

           VNGGRGVAQQ GSQTSQTTPA QKANLTASDI ATDVNRLSKL LKKHSYFDAV

           QS-------- ---------- ---------- --MEEFTVDD LNELAYVDCV

 

           IKEALRMYPP LIFLMREVVI PRTYRDYV-- -IPKGDIVVV SPPLAMSLPE

           IKETLRMYPP LILLMRKVLK PKFYKEYV-- -IPENDIVMV SPAASGRLEN

           VLEVLRFRPP PVMVP-QVAS KRVQLPNG-- -YEVPRGALV VPSLWTACMQ

           VLEVLRFRPP PVMVP-QVAS KRVQLPNG-- -YEVPRGALV VPSLWTACMQ

           VKEILRYRPP AVMVP-QNAM GSVPLTEN-- -VTVPKGSFV MPSIWSSCMQ

           VKEILRYRPP A--------- ---------- ---------- ----------

           WKETLRKHPV AATGTLRRLD TDVTLPSCGM LLRKNTAILV PIYLVHRNPE

           WNEALRTHPA AANTSVRCAD RDDVLPGSGI PITKGTGLMV SSYLIHHLPQ

           LHESLRLHPP VLFITRQAVQ DDEILGFP-- -ISQGAIVNI PIVALHRDPE

           VSETLRLYAS VVDIR-KVVE NLEFREFI-- -IRKGDYLCI SPAVSHRETT

           VTETLRLASS GIAVR-IGCE PLYVDDFR-- -IPAGDYLCI SPWLAHQDEH

           IAETLRLFPP FPLIQRQAQC DTHLGDVF-- -VPGGTLVAA VPWLQHHHPA

           VKECLRLYPP FPLLQREPEM DDILENVK-- -IPARTPVYI VPWLLHHHPK

 

           VF-ADPDRYD PDRYA---PP REED--KRVP FSHIGFGGGR HACMGEQFAY

           VF-KNPNAWD PDRFG---PN REED--KKAP FSFIGFGGGR HGCMGEQFAY

           GF-PSPERFD PERML---PP RQEE--QKYR KHFLTFGCGP HMCVGRNYAI

           GF-PSPERFD PERML---PP RQEE--QKYR KHFLTFGCGP HMCVGRNYAI

           GF-PDAYKFD PDRMS---PE RQED--IKYR QNFLTFGIGP HVCVGREYAI

           ---------- ---------- ---------- ---------- ----------

           FW-PDPETFE PERFT---RE NTM---KRHP FAFQAFSNGP RNCIGQFFAT

           YW-ENPDHFI PERHT---KE AVR---QRSP YYFLPFSRGS RNCIGQFVAN

           QWGPDAESFR PERFLSSDKN NVVI--QRHA MAWLPFLYGT RACTGQRFAM

           LF-PQSEDFI PDRFQ--KQG THPN--AVFD KDLLTFGGGF YKCPGQSFAM

           RG---GKRFD PCRYQHIEDK KALF--RGRT RQLYTFGGGM YRCPGQEYAL

           YW-KQAESFN PERWISPTDN VARHGDAPSD YCYIPFGRGR RMCAGNPLAM

           YWK-QPEDFI PDRFMY---- NASHGDAPSD FVYIPFGRGN KMCAGYHLAL

 

           LQIKTIWSVI LRDYDLEPVG PL----PLPD YSAMVVGPKP PCLVRWRRRT

           LQIKTIWTVL VRSFDLEPIG DL----SQPD YNAMVVGPRP PCLLKYRKKK

           NHLMCYLAVL ATTVDWTRVR TV----HSDE IIYLPTIYPA DSVIRARWRV

           NHLMCYLAVL ATTVDWTRVR TV----HSDE IIYLPTIYPA DSVIRARWRV

           NNLIAFLALI STECKFQRYR TK----KSDD IIYLPTIYPG DCLMKFV---

           ---------- ---------- ---------- ---------- ----------

           HEALTTLSSL YHFFTFRLAC RA----EDVK PYHAMTMKPS VGKVSEDAKG

           HEALTILSTI YKRYEIRLAV GA----QEVE EYFRVTMKPH CRFYVQGKKD

           LEAKTILFEL LTKVSVRLQP GC----EVKG YGMVSVPRDV RLQVVDLHKE

           VEIVLLIALV FYLYDIQLVD RV----PKMK ESQSVGIKKP SCSCRIHYLW

           HELVLFIQIF FEFVERVELG VAGDPLTSMD AYRLVGIKRP TRPFPGKLFL

           VELKCLLLLV ALQEPDLLFD LEP---QETS PEPQAGMRFP PLTMRPPRFA

           LELKILTIYV CQYYDWKCSF PQ----GKEP VSKKYPIETI THSSCNRFFF

 

           P--------- ---------- --------

           DSFLDRVSLY A--------- --------

           ADASTGAGTG AGTDTACTSE AVPVVVAS

           ADASTGAGTG AGTDTACTSE AVPVVVAS

           ---------- ---------- --------

           ---------- ---------- --------

           --VSEYVKLP VWVTPRNTMA HLREE---

           PSLDAHLGLP VKIYSRKCYS --------

           ---------- ---------- --------

           KRRLAGMEEI ---------- --------

           ISSCCDLRCG DPHPSERW-- --------

           VRQEEPIHRK LCQQNSNEL- --------

           IMQLLSIGNV S--------- --------