May 20, 1999

A possible Chloroplast P450

A Genbank entry AF107765 from Prunus dulcis (almond) (Mar. 9, 1999) reports a fragment of a P450 sequence that is said to be from a chloroplast gene encoding a chloroplast protein. The sequence is 64-71% identical to 71D sequences and if these have a recent common ancestor, as would be expected from the percent identity between them, the whole 71D subfamily may represent chloroplast P450s as well. At present, there is no publication showing that these are chloroplast genes or proteins, but if it is true, it would be the first example of a P450 sequence from this organelle. It would also be the first P450 in a eukaryote that was not encoded by the nuclear genome. This raises questions of its origin. Did this gene go into the chloroplast genome from the nucleus, since so many other P450s from the CYP71 family are known, and all are nuclear genes in the Arabidopsis genome. The chloroplast genome from several plants like Zea mays and tobacco has been sequenced and there are no P450s in these genomes. This would make it very unlikely that the almond would have a P450 in its genome. The other explanation may be that this is due to a cloning artifact. Prunus dulcis cytochrome P450 (P450) gene, chloroplast gene encoding chloroplast protein, partial cds. Ma,R. and Oliveira,M.M. A putative Prunus dulcis cytochrome P450 gene sequence Unpublished MFVPRECREKCEIDGYEIPVKSKVIVNAWAIGRDPNYWNEPDSFNPDRFLDSSIDY KGTNFEYIPFGAGRRMCPGMSFGLANVELPLALLLYHFDWKLPDG Alignments: CYP71D7 gb|U48435|SCU48435 Solanum chacoense putative cytochrome P450 gene, complete cds Length = 3107 Score = 166 bits (415), Expect = 2e-40 Identities = 71/101 (70%), Positives = 85/101 (83%) Frame = +3 Query: 1 MFVPRECREKCEIDGYEIPVKSKVIVNAWAIGRDPNYWNEPDSFNPDRFLDSSIDYKGTN 60 + VPRECRE+ EI+GY IPVK+KV+VN WA+GRDP YW++ +SF P+RF SID+ G N Sbjct: 2376 LLVPRECREETEINGYTIPVKTKVMVNVWALGRDPKYWDDAESFKPERFEQCSIDFIGNN 2555 Query: 61 FEYIPFGAGRRMCPGMSFGLANVELPLALLLYHFDWKLPDG 101 FEY+PFG GRR+CPG+SFGLANV LPLA LLYHFDWKLP G Sbjct: 2556 FEYLPFGGGRRICPGISFGLANVYLPLAQLLYHFDWKLPTG 2678 CYP71D10 gb|AF022459|AF022459 Glycine max cytochrome P450 monooxygenase CYP71D10p (CYP71D10) mRNA, complete cds Length = 1691 Score = 165 bits (413), Expect = 4e-40 Identities = 67/100 (67%), Positives = 89/100 (89%) Frame = +1 Query: 1 MFVPRECREKCEIDGYEIPVKSKVIVNAWAIGRDPNYWNEPDSFNPDRFLDSSIDYKGTN 60 + VPR RE+C+I+GYEIP K+++I+NAWAIGR+P YW E +SF P+RFL+SSID++GT+ Sbjct: 1150 LLVPRVSRERCQINGYEIPSKTRIIINAWAIGRNPKYWGETESFKPERFLNSSIDFRGTD 1329 Query: 61 FEYIPFGAGRRMCPGMSFGLANVELPLALLLYHFDWKLPD 100 FE+IPFGAGRR+CPG++F + N+ELPLA LLYHFDWKLP+ Sbjct: 1330 FEFIPFGAGRRICPGITFAIPNIELPLAQLLYHFDWKLPN 1449 CYP71D8 emb|Y10493|GMC450CP7 G.max mRNA for putative cytochrome P450, clone CP7 Length = 1800 Score = 164 bits (410), Expect = 9e-40 Identities = 69/98 (70%), Positives = 85/98 (86%) Frame = +2 Query: 3 VPRECREKCEIDGYEIPVKSKVIVNAWAIGRDPNYWNEPDSFNPDRFLDSSIDYKGTNFE 62 +PREC IDGYEIP+K+KV++N WAIGRDP YW++ D F P+RF DSSID+KG +FE Sbjct: 1193 IPRECIISTNIDGYEIPIKTKVMINTWAIGRDPQYWSDADRFIPERFNDSSIDFKGNSFE 1372 Query: 63 YIPFGAGRRMCPGMSFGLANVELPLALLLYHFDWKLPD 100 YIPFGAGRRMCPGM+FGLA++ LPLALLLYHF+W+LP+ Sbjct: 1373 YIPFGAGRRMCPGMTFGLASITLPLALLLYHFNWELPN 1486 CYP71D9 emb|Y10490|GMC450CP3 G.max mRNA for putative cytochrome P450, clone CP3 Length = 1754 Score = 161 bits (403), Expect = 6e-39 Identities = 67/101 (66%), Positives = 84/101 (82%) Frame = +3 Query: 1 MFVPRECREKCEIDGYEIPVKSKVIVNAWAIGRDPNYWNEPDSFNPDRFLDSSIDYKGTN 60 + +PREC + CEI+GY IP KS+VIVNAWAIGRDP W E + F P+RF++ SI+YK + Sbjct: 1122 LLLPRECGQACEINGYHIPAKSRVIVNAWAIGRDPRLWTEAERFYPERFIERSIEYKSNS 1301 Query: 61 FEYIPFGAGRRMCPGMSFGLANVELPLALLLYHFDWKLPDG 101 FE+IPFGAGRRMCPG++FGL+NVE LA+L+YHFDWKLP G Sbjct: 1302 FEFIPFGAGRRMCPGLTFGLSNVEYVLAMLMYHFDWKLPKG 1424 CYP71D11 gb|AF000403|AF000403 Lotus japonicus putative cytochorome P450 mRNA, partial cds Length = 1641 Score = 161 bits (402), Expect = 8e-39 Identities = 69/97 (71%), Positives = 83/97 (85%) Frame = +1 Query: 5 RECREKCEIDGYEIPVKSKVIVNAWAIGRDPNYWNEPDSFNPDRFLDSSIDYKGTNFEYI 64 RECR+ CEI+GY IP KS V+VN +AIG D YW EP+ F P+RF+DSSIDYKGTNFE++ Sbjct: 1075 RECRQACEINGYHIPAKSTVLVNTFAIGTDSKYWAEPERFCPERFIDSSIDYKGTNFEHL 1254 Query: 65 PFGAGRRMCPGMSFGLANVELPLALLLYHFDWKLPDG 101 PFGAGRR+CPG+++G+ANVEL LALLLYHFDW LP G Sbjct: 1255 PFGAGRRICPGINYGMANVELVLALLLYHFDWTLPKG 1365 CYP71D6 gb|U48434|SCU48434 Solanum chacoense cytochrome P450 mRNA, complete cds Length = 1641 Score = 155 bits (387), Expect = 4e-37 Identities = 67/101 (66%), Positives = 81/101 (79%) Frame = +3 Query: 1 MFVPRECREKCEIDGYEIPVKSKVIVNAWAIGRDPNYWNEPDSFNPDRFLDSSIDYKGTN 60 + VPRECR++ EI+GY IPVK+KV+VN WA+GRDP YW++ + F P+RF SID+ G N Sbjct: 1113 LLVPRECRKETEINGYTIPVKTKVMVNVWALGRDPKYWDDVECFKPERFEQCSIDFIGNN 1292 Query: 61 FEYIPFGAGRRMCPGMSFGLANVELPLALLLYHFDWKLPDG 101 FEY+PFG GRR+CPG SFGLAN LPLA LL HFDWKLP G Sbjct: 1293 FEYLPFGGGRRICPGTSFGLANDYLPLAQLLCHFDWKLPTG 1415