Micromonas P450 sequences (Photosynthetic marine picoeukaryotes)

 

Micromonas sp. RCC299 (v3.0 at JGI)

 

Micromonas pusilla CCMP1545 (v2.0at JGI)

 

Closely related to but different from Micromonas sp. CCMP490 (some ESTs available)

 

D. Nelson,† April 13, 2009

 

 

>CYP51G1 Micromonas sp. RCC299

EuGene.1200010398|MicpuN3

81% to CYP51G1 Ostreococcus lucimarinus, ortholog

81% to CYP51G1 Ostreococcus Ostreococcus tauri, ortholog

MSVELIADAIGVPVWALVPVALAAAAVLVFILDGLSHSVLKPGKSPPVIGTTPVFGGMLEFLKGPIGLMARAYPKYGEAF

TVPVFHKRITFLIGPKVSEHFFKARDQEMSQKEVYEFNVPTFGKGVVFDVDHITRAEQFRFFANSLKSDRLRQYVGMMVK

EAEDYFAKWGDEGEVDLLDALSELIVLTASRCLLGREIRETLFSEVTTLVHDLDKGMVPLSVFFPYAPIEAHRKRDKARK

ELAAIFDKVIQGRRESGAVEPDVLQTFIDARYKDGSRLSNDQVLGMLIAVLFAGQHTSSITSTWTGLLSIANKERIFPNL

EKEQKDVMAKHGDKIDFDILAEMDELHFCIKEALRMHPPLIMLLRQCHVPFEVETTKGKKFVVPKGHIVATSPAFAHRMD

EVYSEPNVYKPERFKGESPEDKRAYASFIGFGSGRHGCMGETFAYMQIKTIWSHLLRNFDFELVGKLPEPDYEGMVVGPK

HPCTVRYKRRKL*

 

>CYP51G1 Micromonas pusilla CCMP1545

estExt_fgenesh1_pg.C_100222|MicpuC2

83% to CYP51G1 EuGene.1200010398|MicpuN3

MDVLAQVTEFVGVPAWGLLPVALVAAVVLFVVFDALSHVVLKPFKSPPVIGTWPVIGGMIQFLKGPIGLMNNAFPKYGEV

FTVPVFHKRITFLIGPHVSEHFFKARDADMSQKEVYEFNVPTFGKGVVFDVDHLTRAEQFRFFADSLKSDRLRSYVGMMV

KEAEDYFGKWGESGEVDLLDALSELIVLTASRCLLGREIRETLFSEVTTLVHDLDKGMVPLSVFFPYAPIEAHRKRDAAR

RDLAAIFDRVIQARREANAHEPDVLQTFIDARYRDGSRLTNDQVLGMLIAVLFAGQHTSSITSTWTGLLTIANKERVMPT

LEGEQKKVMEKHGGKIDFDVLAEMDELHYAIKEALRMHPPLIMLLRQCHKPFKVTTSKGKEFVVPKGHIVATSPAFSHRL

NNVFSDADTYKPSRFRQPSPEDKEKFASFIGFGGGRHGCMGETFAFMQIKTIWSILLRNFEFELVGKLPEPDYEGMVVGP

KHPCTIRYKRRKL*

 

>CYP97A17 Micromonas pusilla CCMP1545

estExt_Genewise1.C_50257|MicpuC2

83% to CYP97A18 EuGene.0600010504|MicpuN3 probable ortholog

MPVAAGDIREIAGQPVFVPLYKLFLAYGEMFVLAIGPKKFVVVSDNDVAREMLKDQATSFSKGLLSEILEFVMGTGLIPA

DGETWKVRRRTVVPSLHKKYVASMVDMFGDCGLNGSAQLARSEMNGDTVEMENFYSRLALDIIGKAVFNYDFNSLKMDDP

VIKAVYTVLREAEYRSVTFIPYWKVPPLRWLVPRQKACQEALVVVNDTLNMLIARTKKLVEEEDEEFVEEYLNKADPSIL

HFLIASGDDVTSKQLRDDLMTLLIAGHETTAAVLTWTTYLLATHPEIKARVQAEVDEVCGDRNPTIADMMDLKFTTRVIN

ESMRLYPQPPVLIRRALEPVTLDGYKIDAGTDFFISVWNLHRNPRLWENPDKFDPDRFPIDQKMPNEITENFAYLPFGGG

QRKCVGDQFALFESIITLAMVCRRFDFELDAKFHPDGECGMTTGATIHTTGGLHVKLKRRDGAGGNEMLGSIDCGDGVKC

ASLGELSDVNTGDGTPESGAASFDEAERAQARDLKEAAVVLGAKTAGASELKSGSGGAKGEKPSIDGAALEEAVKEAEAL

YEAEAEREAAMKKELEQSL*

 

>CYP97A17 Micromonas pusilla CCMP1545

EuGene.0000050221|MicpuC2

77% to CYP97A18, probable ortholog

same sequence as CYP97A17 estExt_Genewise1.C_50257|MicpuC2 with N-term extension

MLSLAASATTLVGANAGASAAAASTRRGGASLSARDATATTKPLRFVARGASSDRGRALTTRVNAKSLEERIASGEFTQK

RSTPAESVLNGIRDVIKDVPQSRGLSYQLAKLSRKWRKESMSK

MPVAAGDIREIAGQPVFVPLYKLFLAYGEMFVLAIGP

KKFVVVSDNDVAREMLKDQATSFSKGLLSEILEFVMGTGLIPADGETWKVRRRTVVPSLHKKYVASMVDMFGDCGLNGSA

QLARSEMNGDTVEMENFYSRLALDIIGKAVFNYDFNSLKMDDPVIKAVYTVLREAEYRSVTFIPYWKVPPLRWLVPRQKA

CQEALVVVNDTLNMLIARTKKLVEEEDEEFVEEYLNKADPSILHFLIASGDDVTSKQLRDDLMTLLIAGHETTAAVLTWT

TYLLATHPEIKARVQAEVDEVCGDRNPTIADMMDLKFTTRVINESMRLYPQPPVLIRRALEPVTLDGYKIDAGTDFFISV

WNLHRNPRLWENPDKFDPDRFPIDQKMPNEITENFAYLPFGGGQRKCVGDQFALFESIITLAMVCRRFDFELDAKFHPDG

ECGMTTGATIHTTGGLHVKLKRRDGAGGNEMLGSIDCGDGVKCASLGELSDVNTGDGTPESGAASFDEAERAQARDLKEA

AVVLGAKTAGASELKSGSGGAKGEKPSIDGAALEEAVKEAEALYEAEAEREAAMKKELEQSL*

 

>CYP97A18 Micromonas sp. RCC299

EuGene.0600010504|MicpuN3

83% to CYP97A15 Ostreococcus tauri estExt_fgenesh1_pm.C_Chr_13.00010043

80% to CYP97A15 Ostreococcus lucimarinus estExt_GenewiseEukaryote.C_Chr_130084

probable ortholog

MQSSLATGRAPTLGRARVAPSASSRSNLGAGRAMCGFGSESLRGSGRAAGVTLPRRRAATPCRAKTLEERIASGEFTKPR

SSPAEDLLNGFRGLIRNVDNPQARGLSVSLAKLSRKWRNESMSRMPVAAGDIREIAGQPVFVPLYKLFLAYGEMFILAIG

PKKFVVVSDNEVAKEMLLTQANSFSKGLLSEILDFVMGTGLIPANGETWKIRRRTVVPSLHKKYVASMVDMFGDCGVHGS

AQLAKSEREGKTVEMENFYSRLALDIIGKAVFNYDFDSLKKDDPVIKAVYTVLREAEYRSVTFIPYWKVPPLRWLVPRQK

ACQEALVVVNDTLNMLIERTKKIVEDSDEEFVEEYLSGDDPSILNFLIASGDDVTSKQLRDDLMTLLIAGHETTAAVLTW

TTYLLATHPEQMRKVQEEVDRVVGDRRPTIQDMMELKYTTRVINESMRLYPQPPVLIRRALEPVTLDGYKIETGTDFFIS

VWNLHRNPRLWPEPDKFIPERFPLDQKMPNEVTENFAYLPFGGGQRKCVGDQFALFESIITLAMVCRRFDIDLDPAFHPD

GECGMTTGATIHTTGGLHVKLTRRAGMGGDEMGYDETTRSLDELEDVNTRGGTPEAAVGSFDEREIEEARDVKEAAVVVG

AKTAGATEVRSGSGGAKGETPAIDGERFDQAVAEAEDILAKEEAARVEAQKAR*

 

>CYP97C18 Micromonas sp. RCC299

estExt_Genewise2Plus.C_Chr_140254|MicpuN3

84% to CYP97C18 Ostreococcus tauri

82% to CYP97C18 eugene.0900010237†† Ostreococcus lucimarinus

probable ortholog

MAAPDWLTQLNRLWGGNSEIPVADAKLEDITGLLGGGLFQPLFKWMKEAGPVYLLPTGPVTSYVVVSDPDCIKQILFNYG

SKYIKGTIAEAGEFLFGLGVALQENEAWKIRRKAVAPSLHRRYVEAMVDRCFGPCAERMVELVESAIDSEGGEKKRLNME

SKFSQAALDIIGISVFNYDFKALTSAAPVIQATYTALKEVETRSMDLLPTWRLPEPFLRVVSPRQKAAQDAVKIIQEVTT

KLVDDCKRMVEEEEKVGGAEEWARDYLNDSNPSVLRYLIAAREEVSSTQLRDDLLSLLVAGHETTASVLTWGTYELLKPE

NAEQLRLLRAELDEVLGDKPFPTYEDMTKMPYLERCFHESMRLYPQPPVYTRRAVVEDVLPKGLGVVPAGQDLLVSIYNL

HRSPENWGPNSQVFEPMRFGPLALGQPNELNTGYRYTPFSAGPRRCPGDKFAVLEGMAIWAVMFRRLDLTLVAGHDVVMT

SGATIHTRDGMLVNARRRRGEKGQKVDWAQLKPEKDIGAGWFEKGIEAKGSSGGGKCPVAH*

 

>CYP97C18 Micromonas pusilla CCMP1545

estExt_Genewise1Plus.C_30232|MicpuC2

88% to CYP97C18 Micromonas sp. RCC299, probable ortholog

MAAPDWMTQLNRLWGGKSEIPVADAKLDDITGLLGGGLFQPLFKWMKEAGPVYLLPTGPITSYVVVSDPDCIKQVLFNYG

SKYIKGTIAEAGAFLFGLGVALQENEAWKIRRKAVAPSLHRRYVEAMVDRCFGPCADRMVSLVEDQINADGGRRERVNME

SKFSQAALDIIGISVFNYDFKALTSAAPVIQATYTALKEVETRSMDLLPTWRLPEQFLRIVSPRQKAAQDAVTVIQEVTT

KLVDDCKKMVEEEEAVGGAEAWARDYLNDANPSVLRYLIAAREEVSSTQLRDDLLSLLVAGHETTASVLTWGTFELLKPE

NAEQLRLLRAELDEVLGDKPFPDYADMLKLPYLERCFHESMRLYPQPPVYTRRAVVEDVLPHGLGTIPAGQDLLVSIYNL

HRSPANWGPASQAFEPMRFGPLSAGQPNELNTGYRYTPFSAGPRRCPGDKFAVLEGMAIWAVLFRRLDMELVAGHDVVMT

SGATIHTRDGMLVNATRRETRRKGGGDAVDWANLRPAKDIGEGWWERGIETKGGSGADAKSKCPMPFVK*

 

>CYP97E5 Micromonas sp. RCC299

estExt_Genewise2Plus.C_Chr_160324|MicpuN3

80% to CYP97E5 Ostreococcus tauri estExt_gwp_GeneWisePlus.C_Chr_01.00010469

82% to CYP97E5 Ostreococcus lucimarinus eugene.0100010571

probable ortholog

MPQPPSTEEDIDDDFVPSQLGIKDIISLWITQILQTYGDEESKDGAPVCEGSVDDLVGGPIFLALYPYFLRYGGVFKLAF

GPKVFMVLSDPVVVREVLKEKPFAFSKGVLAEILEPIMGQGLIPAPYAVWKNRRRQLVPGFHKAWLDHMVGLFGDCSTQL

VKNLDAEIAKGNGSAIVDMEERFCSVSLDIIGLAVFNYDFGSTTRESPIIKAVYTCLQEAAHRSTFYFPYWNLPLADVLV

PRQREFKNNMNLINETLNGLIKKAQAFEGTEDLEELQNRDYSKVKDPSLLRFLVDIRGADVTDSQLRDDLMTMLIAGHET

TAAVLTWCLYCLAQDRELMARVVAEIDDVMGPADGETPTAPNYEQIQKMELVRLCLAEALRLYPEPPILIRRCLEDVPLP

KGAGDANVTLIKGMDVFISVWNLHRHPDCWEEPLKFDPTRFKRPFQNPGVKDWAGYNPDLISGLYPNEVTSDFAFIPFGA

GARKCIGDQFAMLEATSCLAMTLRRYDFEMTKDASEVGMEMGATIHTAGGLPMKVTRR*

 

>CYP97E5 Micromonas pusilla CCMP1545

e_gw1.13.59.1|MicpuC2

86% to CYP97E5 ortholog

cyan probable intron seq

MERPSQTYGDEESKDGAPVCEGSVDDLVGGPIFLALYPYFLKYGGVFKLAFGPKVFMVLSDPVIVRRVLKEKPFAFSKGV

LAEILEPIMGQGLIPAPYAVWKNRRRQLVPGFHKAWLDHMVGLFGDCSAQLVKNLGASHLTLTDASIAAGNGVARIDMEE

RFCSVSLDIIGLAVFNYDFGSTTRESPIIKAVYTCLQEAAHRSTFYFPYWNIPFMCDIVPRQREFKANMKLINDTLNGLI

TQAQQFEGTEDLEELQNRDYSKVKDPSLLRFLVDIRGADVTDLQLRDDLMTMLIAGHETTAAVLTWCLFCLVRDKPLMKK

VVEEIDSVMGPVAEEARAPNYEEIQKLELVRLCLAEALRLYPEPPILIRRCLEDVPLPKGAGDADVTLIKGMDVFISVWN

LHRHPDCWEEPLKFDPFRFKKPYSNPGVKDWAGYNPDLISGMYPNEVTSDFAFVPFGAGARKCIGDQARSCLHWSPYDRF

AMLEATSCLAMTLQRYDFELDKDAAEVGMEMGATIHTAGGLPMRVTRRK*

 

>CYP97E6 Micromonas sp. RCC299

e_gw2.16.55.1|MicpuN3

77% to CYP97E6 Ostreococcus tauri estExt_fgenesh1_pm.C_Chr_15.00010014

73% to CYP97E6 fgenesh1_pg.C_Chr_14000053†† Ostreococcus lucimarinus

probable ortholog

MPAPTRFTRAPRRAHPGSRPAERGARVARRAVEEPKTRGGDSAPGRIYRGALFPGVEVPEAGADVWRGISRVFPWGNGAP

VTEGVLGDLLKEEVRAAPLFVPLYDYYRQYGGVYNLGAGPKWFVVVSDPVVVRHMFKDNADAFSKGILTDIMEPIMGDGL

IPAPKEIWAKRRPTVGAGFHGAWLKHMTNLFGASATNLADKLEREWCDKDVAVNLEDELYAMALDVIGKAVFNYEFGALR

EETPLIKAVYRVLRESEHRSTFPLQYWNIPGAMDVVPRQKQFKEDIAAINAELSKLIADALADRNETDLAEMESRDYANV

EDASLLRFLVDVRGETVSSTQLRDDLMTMLIAGHETTAAVLTWTMYLLATHPEEAELARAEVDAIVADPSGVPTVEEIRK

LERTRLCLAEGMRMYPAPPILIRRALEDVTLPAGGMGREITLKKGTDCFVAVWNLHRSPDLWDRPDVFDPARFKREFKNP

KIEGWNGLSPELVTGLYPNEQSTDFAYVPFGGGQRRCAGDMFAMMEATVALSVLLKRFEFELGCDESEVEMITGATIHTK

AGMPVKLRSRSAK*

 

>CYP97E6 Micromonas pusilla CCMP1545

EuGene.0000090537|MicpuC2

79% to CYP97E6, ortholog

MSGRRRRRRRSASTTTTTRAAIEEPKPKALVPPYRGALFPGVEIPEAGQDAFAAISKALFPWGNGAPITEGVLGDLLKEE

VRAAPLFVPLYDYYRKYGGVYNLGAGPKWFVVVSDPVAVRTMFKDDADSFSKGILTDIMEPIMGDGLIPAPKEVWAKRRP

VVGAGFHGAWLKHMVSLFGDSANNLAAKLAPDAAGGKTVEIESKLYAMALDVIGKAVFNYEFNSLAEETPLIKAVYRVLR

ESEHRSTFPLQYWNIPGAMELVPRQKRFKEDIEMINDELSVLIAAALKSRNETDLAEMEARDYANVDDASLLRFLVDVRG

EEATGTQLRDDLMTMLIAGHETTAAVLTWTTYLLATHPEEARKIQEEIDAVVSDPGGAPTVEEIRAMEKTRLALAESMRL

YPAPPILIRRALRDVTLPRGGMGKAITLKKGTDCFVAVWNLHRSPDLWENPEKFDPSRFKRPFQNPAVEGWRGLQPELAT

GLYPNETSTDFAYVPFGGGQRRCAGDMFAMMEATVALSVLMKRFDVELACEKEDVEMITGATIHTKAGMPVKLTPRR*

 

>CYP745B1 Micromonas sp. RCC299

e_gw2.06.182.1|MicpuN3

65% to CYP745B1 Ostreococcus tauri. 62% to CYP745B1 Ostreococcus lucimarinus

probable ortholog

MIALYLILKVVVGAVASVLGFVRKELVLARAPMAPGFVPFIGHTITLFKAVGSYPCTWDLFAQWSTATAPKPVRVQIFTQ

HCVVIADPSTMKRVMSSNLKNYSKDLEFSYAPFLEILGTGLVTSGGDTWRKMRGHISKALRVEILDEIIAIATRAVDRLC

VKLDAIKGTGESIDMENEFRLLTLQVIGEAILSLSPEESDELFPSLYLPIMDECNARSLSPWRTYLPTPEWFAHRKRVRQ

LDAAIIKIVRDRWDKKRSGQNVPDDILERVLDQELNEKRSEVERQLCYEIKTFLLAGHETSAAMLTWTMHELSNNGEATE

KIRKESDKVFGRLKKDALPTRESLDTLDYSLAALKETLRLYSVVPVVNRVAEEDDDLGGVRIPKGTTVIMSLQGVHHRED

LWPDPLVYKPERFTDPGFNENEGFKFLPFIQGPRNCLGQYLALLEARVVLCTLIRKYKWTKASAENGKKHTKAIPIAPAN

GMHFTVA*

 

>CYP745B1 Micromonas pusilla CCMP1545

EuGene.0000050037|MicpuC2

73% to CYP745B1†† e_gw2.06.182.1|MicpuN3, ortholog

MAVAPPSEPVLIATSVALAVLTVAFYVSRAVAGFLYGVARWHRQAFVLRHTPTAPGYVPLIGHTIALFRAVGNYPCTWDL

FAMWATATAPKPARVQIFDRHCVVIADPSTMKRVMATNLKNYQKDLEFSYAPFLEILGTGLVTSGGETWRKMRGHISKAL

RVEILDDIIAIATRAVERLCVKLDAAKASAAAVDMEQEFRLLTLQVIGEAILSLSPEDSDELFPSLYLPIMDECNARSLS

PWRAWIPTREWFAHKARSIHWFPYDRVGVVNASRVRELDDAIISIVRARWRKKQAGEDVPDDVLERVLEQVREDEYGADV

ETQLCFEIKTFLLAGHETSAAMLTWTLHELSKAPDMMREVKRESDRVFGRTRKGSLPTRDQLASMEYTLAAFKETLRLYS

VVPVVTRVAVEDDELGGTRVPAGTTVIMSLQGVHHRADLWPEPLKYDPARFVKADENDEMRDFKFLPFIQGPRNCLGQYL

ALLEARVVLGTLVRRYAFAPSKAQGKKHTKAIPIAPANGMHFTVS*

 

>CYP746A1 Micromonas sp. RCC299

estExt_Genewise2Plus.C_Chr_040155|MicpuN3

56% to CYP746A1 Ostreococcus tauri, probable ortholog

56% to CYP746A1 eugene.0300010366 Ostreococcus lucimarinus

52% to CYP746A1 Volvox

51% to CYP746A1 Chlamydomonas

48% to CYP746B1 moss

MQTAALRGVADVPSQLAKQLQRAAAPPPAGFPPGPPDDVAVALGADPLAFIVDTQRRHGDVVGLSLAGERVVLVSDPKVA

ADVMIDRAGLFVKEGTAFFPGSSLAGEGLLVSDGESWARQRRLSNPAFRAAAVDAYATAMARAGAELLRNEWGARSVRDC

YEDFNDLTLRIVAEALFGADVRGKRAREINGAIKEAFEFFGRRSATGMIVPEWVPIPDNFAYNAAVTRLDKAVYSLIAER

RQKRVNGLESVENPDLLDRLLDARDDGEGGDGGGMDDVSLRDELMTLMVAGQETSAILLSWCCVNVASNPRCAAKLSEEA

AAVIGPDPLNDLPNASHYSRLKYAEAAVLETMRMQPPAYMVGRCCAEDVFIAGGKYALPKGTTVLIAPYLLHNDARYWDA

PGEFRPERWLEPGGGMGPGGVYVPFGAGPRVCIGTGFAMMESALLLAMVARAVDVRLRPGADPPRPRALITLRPENVRLD

VVPKRRWW*

 

>CYP746A1 Micromonas pusilla CCMP1545

EuGene.0000120135|MicpuC2

69% to CYP746A1 estExt_Genewise2Plus.C_Chr_040155|MicpuN3, ortholog

yellow regions are questionable

MSAATAPASRGFAPSRALRCRGRGEETAAVARSRRVAARDLVASSRRFERPRATASDDASTSSSSSSSSSSSSTSTAPPA

PKPTPEDIASVAAFFLETTAAATASVPLVGDVARAFARAFEPTPADFPPGPPGDAAVELAADPIAFIVKTQREHGDVVGL

RLAGEHVVLVSDPDVVRDVAIDRADLFVKEGTAFFPGSSLAGEGLLVSDGEKWARQRRLSNPAFRAAAVDAYATAMTRAG

ERLLRGPWSARTIRDAYADFNDLTLEIVADALFGADVRGARAKEINAAIAEAFEFFGKRSSSGFIVPEWAPTPENAGYAA

AVARLDDAVYALIAERR

AKREKPRYERLELLDEDREDEFVEEEEEEEEEEVPGGSSSTTPSRGP

DLLDRLLDARDDGDGG

DGGGMTDASLRDELMTLMVAGQETSAILLSWCCALLAQHPEVAERCAAEARAVLGDGSGGGDGDDEIEIASPTAENVGEM

KYVEAVVLETMRLYPPAYMVGRCAARDVVVGGHSLRKGTTILIAPYLLHRDARRWDDPDAFDPERWL

RPGGAPKKSESESESESESATDDRPMTDAPP

MAKGALRGTGPGGVYLPFGAGPRVCIGTGFAMMEATILVAMIASTVDLKTLPGRAPPRPRAL

ITLRPDNVELDVIPKRRGARARTGVGGVARSE*

 

>CYP746A1 gw1.12.369.1|MicpuC2

66% to CYP746A1 estExt_Genewise2Plus.C_Chr_040155|MicpuN3

same seq as CYP746A1 EuGene.0000120135|MicpuC2 with some deleted

SPTAENVGEMKYVEAVVLETMRLYPPAYMVGRCAARDVVVGGHSLRKGTTILIAPYLLHRDARRWDDPDAFDPER

 

GTGPGGVYLPFGAGPRVCIGTGFAMME

 

>CYP747A1 Chlorella sp. NC64A EST

59% to CYP747A1 Chlamydomonas, probable ortholog, N-term

GAAQQQEEGDGAMQTPGPSPFSXQSLMDVSLIFTEGLHEAMLRFSARYGPVSRFANPSAL

NGAAGWVFLNSPGDIAHVCAANPKNYAERFLPDIYKFVTHEKGLLGSGGDYNRRHRRLCG

PPFRSGELLRKFGEVVVDRVSRVADTFVSQPGPFATNVAVQTQRLTLDVVGLTAFSHDFG

ECQAIARDVGGRAQPSESDQDRLLWAVNAFGQVLGEVFITPMPLLRLMDAVGLRQLRTLR

EAVGVMRSSMLGVIAERRQHL

 

>CYP800A2 Micromonas sp. RCC299

EuGene.0400010034|MicpuN3

59% to EST from Micromonas sp CCMP490 EC847127.1

61% to gwEuk.7.488.1 Ostreococcus lucimarinus CP000587.1 60-62kb range

61% to CYP800A1 fgenesh1_pg.C_Chr_07.0001000025 Ostreococcus tauri

40% to CYP747A1 in three pieces

47% to EuGene.0300010745|MicpuN3

cyan possible intron

MMLATPRLSAATSAGPSSRRVSGSNRLVRRFTTRRAAVVRRVAADRENESSDAVTSRDLVDLSCGAPPCDDVVHEVPADV

PTPVEAGKSGDCPYTAAKDAVSSLLPGQRMRNKMPEGGPVMLPSNAFFKSLLKAGSSPVGMPEAMLEWVNMTGHETVGIK

NAVGPMCVSTVDPDIVEYVCHTNAKNYRLRMLPDAFRYVIKNKGITGSDGAYNREHRLMC QKPFMNSFSLDQFSRTVEER

VGLLCDAWAKAAARTDEGNLAIDIDDHSQRLTLDIVTSLAFNMDFKQVEGIDAQLNGGEEEWRKRSSGGDLDYDIIDKVL

HAYNNTSEIMGELFITPVPILKLQNLLGIGRVRELREGYAVLED

VGINHIIGRRRKELEENKQRGDNE

DYCLLDVLLKAT

DADGNPLPKEDVWGDVNDIMAAGHRTTASNLTVNLHHLARLPEVQRKVEQEVAALGGRAPTFRDVQEGRLQYTQRVVKES

LRKYAPINLFPRVVEGDDTLPTGHEVKQGDFILLSSWAMGRNPRVWDKPDEFNPDRFTEESLRQNAERLAKESCGPDATE

EELRLQMERMSRRILAGRDFTYTPFGSGPRSCIGGTFALLATTVALATVVQRFKWSTVATKDPGFEIPFLYDTTITFPKG

VHVTATPRAVPLGAERGSESQAGVGEESMAPAR*

 

>CYP800A2 Micromonas pusilla CCMP1545

est_orfs.12_7471_4272609:1|MicpuC2

75% to EuGene.0400010034|MicpuN3, probable ortholog

MERPVEIDVDDHSQRLTLDIVTSLAFNKDFKQVEGIDAQLNGGAGPDDSEISAMLDAYNNTSEIMGQLFITPVPLLKLQN

VLGIGRVRELKEGYAVLEKGIDNIINERRTQLEINSRVGDDEDYCLLDVLLRAKDSEGNPLPQEDIWGDVNDIMAAGHRT

TASNLTVNLHHVARMPEVQAAIEREVATLRGRAPTFKDVQEGRLQYTQRVVKESLRKYAPINLFPRIVESDDVLPSGHAV

EAGDFILLSSWAMGRNPRVWARPNAFDPERFTEENLRVNAERLARESAGPDADEETLRIQMERMSRRIAGGRDFTYTPFG

SGPRSCIGGTFALLATTVSLASVVQRFSFEKSSDPRKDHGFEIPFVYDTTITFPKGVHVCAVPRKVRLGADAEGGVGVGV

EEGSASEAAARDFARQAPAPAR*

 

>CYP800A2 EST from Micromonas sp CCMP490 EC847127.1

N-term seq.

59% to EuGene.0400010034|MicpuN3

DTVGVSARGASSRPRVNGVSMPPRSRHTRVRRLAIASAPSDAGDGVGAYAVADTSEQRAR

GAAVCDEEFDTTTTTAVNLETSKESTDCPYTSLNASLGGILPTKSKVPIGGPTMLDNSVF

FKSLLKAGSSPVGMPEAMLEWCDLTGQETVGIKNLVGPLCVSTIDPDIVEYVCHTNAKNY

KLRMLPDAFRYVIKNKGITGSDGRYNREHRRMC

 

>CYP800B1 Micromonas sp. RCC299

EuGene.0300010745|MicpuN3

45% to CYP800A1 fgenesh1_pg.C_Chr_07.0001000025 Ostreococcus tauri

46% to CYP800A1 gwEuk.7.488.1 Ostreococcus lucimarinus CP000587.1 60-62kb range

45% to CYP747A1 Chamydomonas from I-helix to PERF

42% to CYP747A1 from heme signature to end

47% to Micromonas CP001325.1, EuGene.0400010034|MicpuN3

63% to Ostreococcus lucimarinus CCE9901 XM_001418784.1 I-helix to PERF

MSAIHSAVTGVVAGAVPAANARASARRAPCPRALAKSSSSSPLIIDGVSLRAGRSRARLARGARTRALSPGKEGDKTDAG

SGDSQAASGGECPVGFGKNGRFSAVGEAINGVKNAAYLSFMSSSTPLTKEEAAASAAAANPDDVVILENKIFLNALRQAI

ADPVGMPAAMLEWHRLVGHDTVGIENPIGPGCVSTIDPRTVEYICKTNAANYRDRLLPDIFRLVLKDLGVTGSQGEYNRQ

HRKVCQKPFINNNFLKTFSTVVEKGVGHLIESFDMASAKAGPGVGIVEDVDLHSQHLLLDIISPISFDYNFNLLDKSRTV

ITGEGKFEANPMLEAYHRSAEIMGQVFITPLPLLKLGAKFGVGRLQELIDAYDSLERMGDDIVKQRREMHKARRERGEEI

EQVCLLDTMLYLEDDQGNLAYTDEELWGDMNDIMAAGHQTQAATMTMSLLYVSRNPDVKAKIEEELASLGGRAPTFEDVW

DGRLTYTQSVVKETLRLHPPIHMFPRIATDKDVMPSGHKVEPGDLILLSTWAMGRNPKVWEEPTKFDPERFTDERLERLA

REQNPGADADEIERAVTMLKSGRDFIYTPFGAGPRSCIGGLFSLLTVTTIIASCIQRYDFEPDDDFLPADGEIPLRYDVT

MCFPKGLKMKLTKRDVEPGSRVQTPPAVASAR*

 

>CYP800B1 Micromonas pusilla CCMP1545

est_orfs.17_5534_4271487:1|MicpuC2

70% to EuGene.0300010745|MicpuN3, probable ortholog

MTGVAVDVSNRAQGEKRSAERDRNPSTRADWLESRSMLTVDFTVSPASRARASVTPGPRRALPHSPPRAPSKTMRACAAT

PAASAAASPRRRASSSSSSSSSSPSSLRSHRKTSVVGRRSTTTRAIDRARLRATSTDDTTASSSSSSSSSSGDEKSDNSG

GECPLGFGKNGRITNAVNGVKNAAYLAFMSSSTPLTMEEARASAAAADPNDITILENKVFLNSLKEAISDPVGMPAAMLD

WHKYYDRDTVGIVNPLGPGCVSTIDPRTVEYVCKTNAANYRDRLLPDVFRVVLKDLGVTGSQGEYNRQHRKVCQKPFINA

NFLKTFSSVVESGVGHLCDSWDMAAAMPGNEKGFTQDVDLHSQHLLLDIISPISFDYSFDLLGKSRNVITGVGTFKPNKM

LEAYHRSAEIMGEVILVPLPLLKLGERFGIGRLRELIEAYDSLEVMGEEIIKRRRDAHKSAEAEGKVFEQNCGREDGNLA

YTDEELWGDVNDIMAAGHQTQAATMSTALLYVSRDEKVKRAIEREVAALGGRAPTFEDVSEGRLAYTQSVIKETLRLHPP

IHMFPRLAAEDDVMPTGHEVKAGDLILLSTWAMGRNPSVWESPLEFDPDRFTDERLVKLARDQNPGADDEEIDRAVTMLK

SGRDFIYTPFGAGPRSCIGGLFSLLTVTTIVASCVQRFDFTPDEESLPADAEIPLRYDVTMCFPKGLKMRLRRRDLDGAA

EAATPGPAVAAAR*

 

>CYP801A2 Micromonas sp. RCC299

EuGene.0200010345|MicpuN3

C-term 52% to estExt_fgenesh1_pg.C_Chr_07.00010215 Ostreococcus tauri,

C-term 52% to eugene.0700010342 Ostreococcus lucimarinus

Shows some fragmentary similarity to CYP769A1 in Chamydomonas (EXXR to PERF and heme signature)

33% to Thalassiosira pseudonana XM_002288591.1 with some gaps so actually it is

better than 33%

38% to Phaeodactylum tricornutum XM_002178008.1 C-term part

MLAGAPGTAAAAAAASVLALEGGEHALAWGVALYVICVATWTLADAVGARLVFRGKIPSVPWRPNLMFNVYATPRRNLLD

RISRRMAAARDGNDATTKHRHRHRAFATVVGNCPFAHVGGAALARAALDRQPVKSPLYRAFEAFAGVGIFTAEGDDWAAK

RSEVLGAFATAGLEPLAAASMRAAAGLTAEIDAQLAAAARGKGETRRSPLDDAEKGEGGEGRGGVGLEVDTAVEMDMLPR

LQRATLRATFEYLAGVDLPTAAAAEAGDALHDATDQPARPAGVWEDEYLAAATDLRALIPARARSVWMLSDWAYALSPLG

RLERTRIREARRLPELALRAAVPGSPLDHLRRGPAHSRRRLAGRSGGSIFGDWFLGLFRGLFRLGSRDPLLDEATTLLFA

GHDTQSATLSWALLRLAGDPAAQRELRASLADDPAAMESLGLAAGDKAASASTTRTEPVQPAWRTSAGAPVLEACLRETL

RLHPVAPFVARKLTSDAVASAGADGGDALTLPAGCAAGVWLHAVHRDPAVWVDPDSFDPSRWLITDSNAGGQGAGFSPRA

DSIHHSQGSNTPGVRVRFKGSGFMPFATGPRACVGQHLAWVYMRVVLARLVCAYEVRLAEGEGEGDALTPSVGFTVTPAN

AARVRLVPVGG*

 

>CYP801A2 Micromonas pusilla CCMP1545

fgenesh1_pg.C_scaffold_4000274|MicpuC2

52% to EuGene.0200010345|MicpuN3, probable ortholog

MIAGTPGAAATSAAVVILGAEGGARGLVLGAGCYAILCVVALVVDLARARLALRGRVPIVPWRPNLMFRVYATSKRSLLD

RVSRRMANPARVGVGDGDGDGDGATTKGGGGALDADPPARPHPPRAFGTVIGTCPFAHVGGASLARAALRAQPVKAPLYR

AFEAFAGGGIFTEEGERWEAKRAEVLRSFAVVGLDALAEASTRVATELARDIEDSASSSSGVETEMLPRLQRATLRATFE

YLTGVTVPEAAAAAAAVAAARDRAKRATSSSSRGDPLEVPLRGAPASFADAEEDEARRTAARWEDEYLAAATALRHLIPA

RARSVWMASDWAYRLSPVGRIERRSIRAARKLPTLAVRVAKPGSPLDVLSKGVAHGGGGGGGGGKGAIRLRRRLAPRRDD

DDGFGPPPPKALVDEAVTLLFAGHDTQSATLSWALLRLASDEKTQTRLRDSILADADAARDLGLDDLLPADAPPVDRAVS

RPSPTSGDANRRQRQPAWSAAARAPVLEAVLRETLRLHPVAPLVVRKLTSDAVDDGDATSSGGTTTLPEGCAVGVWLHAV

HRDPAVWSEPETFSIGRWLTTARAWRAERAGADASREDDGGSGGVGGGDDEHGDDVVVRFKGAGFMPFASGPRACVGQHL

AWVFMRLTLARLACAFDVRPGSRREGEEEGEEEDPLTPSVGFTVTPANAARVRLIPRRAGRA*

 

>CYP802A2 Micromonas sp. RCC299

EuGene.0800010406|MicpuN3

54% to ost_07_008_042|Ost9901_3 Ostreococcus lucimarinus XM_001418626.1

49% to fgenesh1_pg.C_Chr_07.0001000106 Ostreococcus tauri

MTRGQRRERTDTSQTPSPRLGHVPLEFTSRMRVSVYSTSVHSTSCLGAYTAAARDARRLRHRRLATTVCRNKREISSKLN

LDLDLGQVLGTVKRNEGLYDNLPPGKVGLFGLRETFAYLNDPNKFIRTRVEKYGPVFKTAFFFKPAVVFGSPEAIREFKD

FEGDLPADAALPETFRELHTEYGALRMSGERHKATRANFGKVLGRSALTHYTPILAKLTRDFVKGELLEKRTLQPGYDCR

QFCLKALFQLFLGTVPPQDIMEKMYFYNEGLLALGKLSPEFTEGKKALEDLQEFCLKHFRTVRAQGKLDDPEYFFLKQYS

QATDENGDLFTDERVAVTTILMIWGAFIEAAASMGHTTWLLMRNPDKAKKVRAECRSSFSREELDSGKLTLDDVYTKLTF

TECAIKEALRVMPQTAGGLRVNPETRTLAGFTVPSGYVLTADPRIAFLNPDFFPDPEDYRPERFLPAENPAITPDNFFPG

GMGQHKCPGISLSNLMVSIYLLYLYSCFDKWEPDMSAEEMDSEDPQYIQVPIVIIDDRYQLKLERNWQYEM*

 

>CYP802A2 Micromonas pusilla CCMP1545

fgenesh1_pg.C_scaffold_5000331|MicpuC2

62% to EuGene.0800010406|MicpuN3, probable ortholog

MPTSATAAGALASARALAPPPPRIRRAPRDSSPARQRRRPRRVIATRAAGFETDIGQAIGSIVRKEVTLNLPPGRVGLSG

IRETFEYLQDPRGFIERRVEAYGPVFKTGFFFKPAIVFGSAEAIEEYKRFEGELPADEALPETFRELHTAYGALRQSGAQ

YKATRANFGKVLGRAALTHYAPIIARQTRDFVRGDLLASGTLQPGYECRQHCLRSLFELFLGAIPPEDTMMKMYDYNEGL

LSLGKLTNEFEQGKAALETLTEFVLQTYRRVRASGELETDPRYFFLRQYSTATDENDELFPDDRVATTVVLMVWGAYIEA

AASMGHCAWLLMRNPDAAAKVRAECKRVLSPDALASGNVPIETLMELKYTEACVKEALRTVPQTAGGLRINPTTRTLAGY

DVPAGYVLTADPRIPFLDAKNYPEREKFQPERFLPEGAAAGNVVNGETYFPGGMGQHQCPGISLSTLMTQTFLAYMTTTF

DGWTPDLSGEGSEDPAYVQIPIVIIDDRYRLKLEKNWQFDTFDAK*

 

>CYP803A1 Micromonas sp. RCC299

EuGene.0800010288|MicpuN3

no good hit, 28% to CYP747A1 Chlamydomonas

69% to Micromonas sp. CCMP490 EST EC847008.1

MSALVANAVGPVAVARRPPRRMRHAPRERVSAPRASRADDLPGILLGAAAKKLEEDVNSFIGLFDEDAPTHERPPTLPVA

GNTLDIAQGGHRQLLEWAETYGVADGVHEVKMLSQTILHLTDPKLARELMFERSDSFPDRGVSAMAKFFREDQAAFVNTS

GEQWMAYRKMGTATVNGGALDRLAGKVAERSEALVTRWVRDASASGDGRNATEVDISDASQAVTLEVIHEALFSEQLDVI

DGERNAVALARSFREFNVANQDLLNDFLTLYQRFETPERARRDTHRRRLRAHFDERADARRAAIARDGAAAAPRDLLTAL

LTARDPATGAALTRDDVNLTLTEMMVAGHDTTAATVACMMCLLASHPEVRASVTGEVDEFRRNNGGRLPSSVADANALVK

LDDAMKETMRLYPAVLIVVRKAEEGTGGVFTKGPGREVRIPEGSGMWVSPYVLGRLARHWGGDEEDVKRFRPSRFEEARE

RGDSLDAYMPFGGGPRVCLGSRFAMLEGKVLAAHILADWDVELAAETRDAIARNDGELPIAYAAGLMSFPEPLRLRVRRR

GAASGPR*

 

>CYP803A1 Micromonas pusilla CCMP1545

EuGene.0000020033|MicpuC2

57% to EuGene.0800010288|MicpuN3, probable ortholog

yellow may be part of an intron

MKEAMRLYPAVLVVVREDG

LFFPADLSAHQPSVSIPALGAFQLRLTPLNSTPTFASLVWTLDPQ

AEELGDGGDIVVEPGP

DDSARGRRTIRVPPGTGLWMSPYVYGRLPKCWGEDSDEAVARYRPERWAAMRENGESAPDAYMPFGGGPRVCLGSRFATL

EGKARAISHWSPYDRVRVVNADP*

 

>CYP803A1 Micromonas sp. CCMP490 EST EC847008.1

69% to EuGene.0800010288|MicpuN3, probable ortholog

45% to CYP769A1 Chlamydomonas, 42% to CYP746A1 Chalmydomonas

GAQMWISPYVMGRLPHLWGGDVEDTKRFRPERFAEFRDQGIKEP

EAFMPFGGGPRVCLGSRFAMLEGKVTAAAILRDFD

 

>CYP804A1 Micromonas sp. RCC299

EuGene.0900010042|MicpuN3

no good match

Hyaloperonospora parasitica CU855855.1 7300-8000 range ~36% match in two pieces

(a stramenopile; Oomycete)

60% to Micromonas sp. CCMP490 EST EC846298.1

41% to Phytophthora infestans CV943686.1

MGGYFTSDRAAKYGWGVPRLALVTLFAFEIVPRAFLAGSRAPVDALVSTPLPFTARAAAAPFAPLTLVPYLHSLRWIAHA

YTGSIGAGVNVQVLGLLGNLAIMAVAGGLCLWDLTKDISLVVYFLKTFCGDQLAGKLDTAEWKLVLAFTFALPLAMWGAA

GANGAAFAMFLPYARIAPILFAIQAVCEFGDAHLEYHPVIGKFFRHRYGFEAFTLCALTMMPGGITAPELRVVQFDLVIC

LFYRVANLGIILHKQGFAVAAAASLAVTIRKGCFRVFGALSGQRIVNVTDAEVATAVMRASDVKGDALERHVATPAWRPL

LSLESVDHELYRNMLRDFHAVVKACPPPQRVGEIARAKVDELMYRTYSEEAEEASPVRGGAEPSPPPSPERPVVDVTLAD

SPVDSPHPHGGKGGDVGKCPFVQMQRTMRGGASGQSNAAGRSTPARSDAAPVIDADDVARLSLSVFIEYLFGREWEPKFE

TLLAASWEWRKEIAVRGRADPGVKKAAVELVVDDLIKNSHLWDLFGEKWREPRYYSLIMQPFLVSPAINVGDIAVAMKAH

PDLALEPAMRRMHPFPIFERWVDKDVVVDGRIAVRADTQVIMFTSDFANSKHLWPAFGTGPRACAGTSMALGVLNAIHQK

MLGRPGFEPERGHKFSGRNNDGVTSLSEVWYFAKTVLPVVFGFGGEKTTEAAALERAAAAALE*

 

>CYP804A1 Micromonas pusilla CCMP1545

EuGene.0000070606|MicpuC2

60% to EuGene.0900010042|MicpuN3, probable ortholog

MSGASRGDGGGYFTNDRAVKYGYGVPRLALVTLFAFEIVPRLFLAAERGGSATSAATSSASAADAAFEFFTSSPPADPRR

WALTLAPYLASRRFIGHVYIGSIGAGVNVQVLGFFGNILMLLLAAFACVWDAKRDLSIVIYLMKTFAGDQLAGKLDTAEW

KLVLFFTFGCMAPWVYFASGSMDAAASAFALYARILPILFLVQAVCEYGDAHYERFVFFRHRYGFEFFTLLALTSLPGGI

TPEELRVVQYDLVICLMYRVSNLIIIANKNGLCSGVLGCFTIAARKLCFAAFGVLRGQRLVNVTDPEVATAVMRASVVKG

DALERHVATPAWRPLLSLESVDGELHSSMMRDFHDLARRLPSPGRLGEIATRRVTELIRAHELKARIAKAAKAKELGVDV

DVAIDARAVARLSLTVFVEYVFGRRWEPAFEPLLDATWEWRKEIAVRGKADVKLKRIAVDVVVNDLLRKHPVLWHVHGED

WAKPRYYSLILQPYLVSPAINVGDIAVAMARHPNLSLENAMRRMHPFPIFERYVDRDVLIRGKVVVRAHTQVIMFTTDFA

STRYRNAQWPVFGAGPRACAGTQLALGILAAIRNNMVGHELFEPTIGHRHSGRHNDGVTTPAEAFYSAATILPIVMGWRG

DEFSDDAESLELAAAAALSGRVR*

 

>CYP804A1 Micromonas sp. CCMP490 EST EC846298.1

60% to EuGene.0900010042|MicpuN3, probable ortholog

56% to EuGene.0000070606|MicpuC2

DAEIKKKAVELVVNDLLRNCDKLWAIHGEDWQEPRYYSLIMQPFLISPAINVGDVAVSLK

RNPHLKLEHAMRASHPFPIFERFVDEDVFIGSGLKKKLAVRKNTQVVMFTSDFCGSAIPW

PVFGAGPRMCAGTGMALGVLRAVATGFQNTDRFEPERGHKYSGRHNERVASFAELAYFVK

TVVPIVLGLG