rice chr 1 P450s in annotated list in order on the chr [46 genes]

from http://rgp.dna.affrc.go.jp/rgp/complete-chr/chr1/chr1-complete.html

and details from:

http://ricegaas.dna.affrc.go.jp/chr1-bin/search_table.pl

 

This list is taken from the chromosome 1 annotations found by a

keyword search for P450.  Not all P450s on chr 1 were annotated in this table.

 

Of the 46 genes annotated here 12 agree 100% with my annotations

2 are fusions of two P450s, 2 more are fusions to other genes

71T2 711A2 and 711A3 are split into two genes each

 

Three pseudogenes are represented as intact genes by

creative splicing to avoid frameshifts and stop codons,

and an artificial choice of N and C-termini to finish the gene.

 

The pseudogenes CYP94D8P, CYP715B3P, 71AA1P, 71AA4P, 72A36P, 76H12P

are missed in this annotation, but the annotation does not

cover pseudogenes.

 

CYP734A6, CYP71AA3, CYP71C18, CYP71C19 are also missed

 

Gene No. : 1-1_001 CYP715B2P

8174..8624 , 8639..8844 , 8948..9019 (-)

>1-1_001

MAEGDEWARHRCIVAPAFSATNLNDMIGVMEETTSKMLGEWSDIVALGHSCIDIEKGVVR

NAAEIIAKASFSIAADDATVFHK AAGDAVPLHAVPLASLLHIRADRATYEAWKLGRKIDA

LLLDIIESRRRCEGGGRKTTTTDLLWLLLAGNEASAAAERKLTTALALSWTLLMLATHPE

WRAAVREEVEEVTGWSGPMDAAAMGKLTKMGCMLNEVLRLYPPSPNVQRPAACDAEVVRG

KR

 

>AP003727.3 $P CYP715B2P chromosome 1 clone:P0672D08 Pseudogene fragment

missing N and C-terminal and part of I-helix 39% to 715A1

NRMPMFGRGRVMAEGDEWARHRCIVAPAFSATNLN

DMIGVMEETTSKMLGEWSDIVALGHSCIDIEKGVVRNAAEIIAKASFSIAADDATVFHK (frameshift)

VRLVSVPLASLLHIRADRATYEAWKLGRKIDALLLDIIESRRRCEGGGRK

TTTTDLLWLLLAGNEASAAAERKLTTALALSWTLLMLATHPEWRAAVRE this is missing from AP004123

EVEEVTGWSGPMDAAAMGKLTKMGCMLNEVLRLYPPSPNVQRPAACDAEV

 

Gene No. : 1-2_239 CYP96D1 100% agreement

1216403..1217523 , 1217636..1218047 (+)

>1-2_239

MGPLWTFILLYPEIFLAIICFFWFSLFRPIRQRQKSNLPVNWPVFGMLPFLVQNLHYIHD

KVADVLREAGCTFMVSGPWFLNMNFLITCDPATVNHCFNANFKNYPKGSEFAEMFDILGD

GLLVADSESWEYQRRMAMYIFAARTFRSFAMSTITRKTGSVLLPYLDHMAKFGSEVELEG

VFMRFSLDVTYSTVFAADLDCLSVSSPIPVFGQATKEAEEAVLFRHVIPPSVWKLLRLLN

VGTEKKLTNAKVVIDQFIYEEIAKRKAQASDGLQGDILSMYMKWSIHESAHKQKDERFLR

DTAVGFIFAGKDLIAVTLTWFFYMMCKHPHVEARILQELKGLQSSTWPGDLHVFEWDTLR

SAIYLQAALLETLRLFPATPFEEKEALVDDVLPNGTKVSRNTRIIFSLYAMGRIEGIWGK

DCMEFKPERWVSKSGRLRHEPSYKFLSFNTGPRSCLGKELSLSNMKIIVASIIHNFKVEL

VEGHEVMPQSSVILHTQNGMMVRLKRRDAA

 

Gene No. : 1-2_240 CYP96E1

1220501..1221994 , 1226247..1226672 (+)

>1-2_240

MELLPWLLGFVVKYPEIMASAACFLLLFCRFRRRSKRIPTNWPVVGALPAIVANAGRVHD

WVTEFLRAAAMSHVVEGPWGSPGDVLITADPANVAHMFTANFGNYPKGEEFAAMFDVLGG

GIFNADGESWSFQRRKAHALLSDARFRAAVAASTSRKLGGGLVPLLDGVAASGAAVDLQD

VFMRLTFDLTAMFVFGVDPGCLAADFPTVPFAAAMDDAEEVLFYRHVAPVPWLRLQSYLK

IGHYKKMAKAREVLDASIAELIALRRERKAADANATGDADLLTAYLACQDEIGMDGAAFD

AFLRDTTLNLMVAGRDTTSSALTWFFWLLSNHPGVEARILAELRAHPPSPTGAELKRLVY

LHAALSESLRLYPPVPFEHKAAARPDTLPSGAAVGPTRRVIVSLYSMGRMEAVWGKGCEE

FRPERWLTPAGRFRHERSCKFAAFNVGPRTCLGRDLAFAQMKAVVAAVVPRFRVAAAAAP

PRPKLSIILHMRDGLKVK RRDPVQGGGGHRRGHHHEICRCPQSGSGELDKATAMAADEEE

EVAPNLVFVTIQLPPSSSSSPLKTTQQLDGEGEELIGVQPKEEDRRLEEEEGGGVAADLA

VSRGPSRQACRCTGQESRAVGRGRKQGERRPEGEGICRR

 

>AP002484b $F CYP96E1 CDS  80463..81980 43% to 96A1

MELLPWLLGFVVKYPEIMASAACFLLLFCRFRRRSKRIPTNWPV

VGALPAIVANAGRVHDWVTEFLRAAAMSHVVEGPWGSPGDVLITADPANVAHMFTANF

GNYPKGEEFAAMFDVLGGGIFNADGESWSFQRRKAHALLSDARFRAAVAASTSRKLGG

GLVPLLDGVAASGAAVDLQDVFMRLTFDLTAMFVFGVDPGCLAADFPTVPFAAAMDDA

EEVLFYRHVAPVPWLRLQSYLKIGHYKKMAKAREVLDASIAELIALRRERKAADANAT

GDADLLTAYLACQDEIGMDGAAFDAFLRDTTLNLMVAGRDTTSSALTWFFWLLSNHPG

VEARILAELRAHPPSPTGAELKRLVYLHAALSESLRLYPPVPFEHKAAARPDTLPSGA

AVGPTRRVIVSLYSMGRMEAVWGKGCEEFRPERWLTPAGRFRHERSCKFAAFNVGPRT

CLGRDLAFAQMKAVVAAVVPRFRVAAAAAPPRPKLSIILHMRDGLKVK VHRRQED*

 

 

Gene No. : 1-2_394 CYP90D2

2035743..2035993 , 2036078..2036405 , 2036438..2036531 , 2036850..2037055 , 2037375..2037623 , 2038901..2039086 , 2039597..2039718 , 2043035..2043131 (+)

 

>1-2_394

MVSAAAGWAAPAFAVAAVVIWVVLCSELLRRRRRGAGSGKGDAAAAARLPPGSFGWPVVG

ETLEFVSCAYSPRPEAFVDKRRKLHGSAVFRSHLFGSATVVTADAEVSRFVLQSDARAFV

PWYPRSLTELMGKSSILLINGALQRRVHGLVGAFFKSSHLKSQLTADMRRRLSPALSSFP

DSSLLHVQHLAKS LLDEIEWVDELEEEQSGWAWASAGVRAHVRAQMRMERNVIARNGDEM

QMQ VVFEILVRGLIGLEAGEEMQQLKQQFQEFIVGLMSLPIKLPGTRLYRSLQAKKKMAR

LIQRIIREKRARRAAASPPRDAIDVLIGDGSDELTDELISDNMIDLMIPAEDSVPVLITL

AVKFLSECPLALHQLE VITETLRLGNIIGGIMRKAVRDVEVKGHLIPKGWCVFVYFRSVH

LDDTLYDEPYKFNPWRWKEKDMSNGSFTPFGGGQRLCPGLDLARLEASIFLHHLVTSFRW

VAEEDHIVNFPTVRLKRGMPIRVTAKEDDD

 

>AP003244 $F CYP90D2 59% to 90D1

CDS join(30874..31124,31209..31536,32037..32186,32506..32754,

33832..33921,34032..34217,34728..34849,38166..38262)

AQ157843 64% identical to AQ290163 75% TO 90C1 AT HEME BINDING REGION

C97894 Rice callus Oryza sativa cDNA clone C0085_11A, mRNA sequence

extreme C-term 71% to CYP90C1 opp end = C97895 (probably 3 prime untranslated)

MVSAAAGWAAPAFAVAAVVIWVVLCSELLRRRRRGAGSGKGDAA

AAARLPPGSFGWPVVGETLEFVSCAYSPRPEAFVDKRRKLHGSAVFRSHLFGSATVVT

ADAEVSRFVLQSDARAFVPWYPRSLTELMGKSSILLINGALQRRVHGLVGAFFKSSHL

KSQLTADMRRRLSPALSSFPDSSLLHVQHLAKS VVFEILVRGLIGLEAGEEMQQLKQQ

FQEFIVGLMSLPIKLPGTRLYRSLQAKKKMARLIQRIIREKRARRAAASPPRDAIDVL

IGDGSDELTDELISDNMIDLMIPAEDSVPVLITLAVKFLSECPLALHQLE EENIQLKR

RKTDMGETLQWTDYMSLSFTQH VITETLRLGNIIGGIMRKAVRDVEVKGHLIPKGWCV

FVYFRSVHLDDTLYDEPYKFNPWRWKEKDMSNGSFTPFGGGQRLCPGLDLARLEASIF

LHHLVTSFRWVAEEDHIVNFPTVRLKRGMPIRVTAKEDDD

 

Gene No. : 12_245 CYP94E2 100% agreement

1349356..1350990 (-)

>12_245

MDGTLAPLLLLLLLFLPALLLYLRRRPAAASRINNNHCPHPNPVLGNALPFLRNRHRFLD

WATDLLAAAPTSTIEVRGALGLGSGVATANPAVVDHFLRASFPNYVKGARFAVPFEDLLG

RGLFAADGRLWALQRKLASYSFSSRSLRRFSARVLRAHLHRRLVPLLDAAAGSGEAVDLQ

DVLGRFGFDNICNVAFGVESSTLLEGGDRRHEAFFAAFDAAVEISVARVFHPTTLVWRAM

RLANVGSERRMRDAIRVIDEYVMAIVASEERLRLRRGEDEREHEQHLLSRFAASMEEEGG

ELAAMFGSPGAKRRFLRDVVVSFVMAGKDSTSSALTWLFWLLAANPRCERRVHEEVSSSR

HADPRRADAGEDGHGDGYDELRRMHYLHAAISEAMRLYPPVPIDSRVAVAADALPDGTAV

RAGWFADYSAYAMGRMPQLWGEDCREFRPERWLSDGGEFVAVDAARYPVFHAGPRACLGR

EMAYVQMKAVAAAVIRRFAVEPVQAPASMETPPACEVTTTLKMKGGLLVRIRKREDDAAQ

QKLT

 

>AP003735.2b $F CYP94E2 genomic DNA, chromosome 1, BAC clone:B1147A04, complete

61% to AP003735 4872-6534

8263 MDGTLAPLLLLLLLFLPALLLYLRRRPAAASRINNNHCPHPNPVLGNALPFLRNRHRFLDW 8445

8446 ATDLLAAAPTSTIEVRGALGLGSGVATANPAVVDHFLRASFPNYVKGARFAVPFEDLLGR 8625

8626 GLFAADGRLWALQRKLASYSFSSRSLRRFSARVLRAHLHRRLVPLLDAAAGSGEAVDLQD 8805

8806 VLGRFGFDNICNVAFGVESSTLLEGGDRRHEAFFAAFDAAVEISVARVFHPTTLVWRAMR 8985

8986 LANVGSERRMRDAIRVIDEYVMAIVASEERLRLRRGEDEREHEQHLLSRFAASMEEEGGE 9165

9166 LAAMFGSPGAKRRFLRDVVVSFVMAGKDSTSSALTWLFWLLAANPRCERRVHEEVSSSRH 9345

9346 ADPRRADAGEDGHGDGYDELRRMHYLHAAISEAMRLYPPVPIDSRVAVAADALPDGTAVR 9525

9526 AGWFADYSAYAMGRMPQLWGEDCREFRPERWLSDGGEFVAVDAARYPVFHAGPRAC 9693

9694 LGREMAYVQMKAVAAAVIRRFAVEPVQAPASMETPPACEVTTTLKMKGGLLVRIRKR 9864

     EDDAAQQKLT* 9897

 

Gene No. : 12_246 CYP94E1 100% agreement

1352719..1354368 (-)

>12_246

MDSSYLHLVLPAAAAAVVVAVVLLLSLWRRCQTTSNHRPQANPILGNLVAFLANGHRFLD

WSTGLLAAAPASTMQVHGPLGLGYCGVATASPDAVEHMLRASFHNYVDKGDRVRDAFADL

LGDGLFLANGRLWRLQRKLAASSFSPRLLRLFAGRVVLDQLRRRLLLFFDAAADARRVFD

LQDVLKRFAFDNICSVAFGVDRDDSSPSSSSPSRLEAGGDGRDDAFFAAFDDAIDISFGR

ILHPTTLAWKAMKLLDVGSERRLRQAIGVVDEYVTAIMESKQRCSDSEEESDLLSRFTAA

MMEEDGGNELGAMFDSPEAKRRFLRDTVKTFVLAGKDTTSSALTWLFWFLAANPECERRV

YEEVTALRGDTAGDERDDGYEELKRMHYLHAAITETMRLYPPVPLASRVAAADDVLPDGT

VVRAGWFADYSSYAMGRMPQLWERDCGEFRPERWLDGGGGGGGRFVAVDAARYPVFHAGP

RSCLGKEMAYVQMKAVAAAVVRRFSVEVVPAAAANAPPSPPPHETAVTLRMKGGLRVLLT

RRRGVLSHA

 

Gene No. : 12_317 CYP71AA2 100% agreement

1719193..1720104 , 1720260..1720934 (+)

>12_317

MAGIMDSTTASYYTTLLCGALLLAAVVFKLKTAAAFSRHNAGVNLPPGPWALPVIGSIHC

LLGSLPHHAMRELSRRYGPVMLLRLGHVRTLVLSSPEAAREVMKTHDVAFANRAVTPTAS

IDIVFAPFGKHLRELRKLCALELLSPRRVRSFRHVREEEAARLARSVAAAASASSAVNVS

ELVKIMTNDVTMRAIIGDRCPQREEYLEALDKTMDLLAGFNLVDLFPGSPLARVLGGRSL

RTTKRVHEKLHQITEAIIQGHGIKDTVGDEHHECEDILDVLLRFQRDGGLGITLTKEIVS

AVLFDLFAGGSETTSTTILWAMSELMRSPHVMEQAKYEIRQVLQGKAMVSEADIEGRLHY

LQLVIKETLRLHPPVPIVIPRLCSKPNSKIMGYDIPQGTSVLVNVSAIGRDEKIWKDVNE

FRPERFKDDIVDFSGTDFRFIPGGSGRRMRPGLTFGVSNIEIALVTLLYHFDWKLPSETD

THELDMRETYGLTTRRRSDLLLKATPSYARLGWSTNMQIYSVKCLVYE

 

this pseudogene not annotated

>AP004326.2d $P CYP71AA1P genomic DNA, chromosome 1, BAC clone:OJ1294_F06, complete sequence Gene 4 pseudogene

81031 DFFFCGLETTTTATIWAI*EFIKNPHAVEKAQSDIRKILGGKSIVEEADIEGQLHYFQMVN 81213 frameshift

81213 QETLRLHPPVPLLLPRLWSEPCKIMGYDIP 81302 frameshift

81304  KNTAIFVNTWALGR 81345 frameshift

81344 KIKNTGLMQVSSGLKY

81393 SRMGIVDFNGLDFRFLPCGAGRRICLGLMFELSDIELTLASLLYHSSWRLPTRSYSNK 81566

81567 LDMTEANGITTHRRIDIWLEATPFVPR 81647

 

this gene not annotated

>AP004326.2b $F CYP71AA3 genomic DNA, chromosome 1, BAC clone:OJ1294_F06, complete sequence

Gene 2 no good matches in NR 79% to AP004326.2c

71860 MAGIVDTAAFCT

71896 LLCLLLTLVVFKLKTATSSRHNAGVNLPPGPWALPVIGSIHCLLGSLPHHAMRELSRR 72069

72070 YGPVMLLRLGHVRTLVLSSPEAAREVMKTHDAAFATRAVTPTASILTYGARDIVFAPF 72243

72244 SKHLRELRKLCTLELLSPRRVRSFRHVRDEEAARLARSVAAAAPAVVNVSELVKI 72408

72409 MANNIIMTAIIGDTCPQRDEYLEALDKTMDLMNGFNLIDLFPGSRLARVLGARSLRATKR 72588

72589 VHQKLHQITDTIIQGHEIIKDGSVGDDTIQETVGTHNMHGHGHKCEDILDVLL 72747

72748 RFHRDGGLGITLTKEIVSAVLF 72813 (0)

73327 DLFAAGSETTSTTIIWAMSELVRTPHVMERAQSEIRQVLQGKTVVSEADIEGRLHYL 73497

73498 QLVIRETLRLHPPVPFLIPRLCSEANSKIMRYNIPQGAMVLVNISAIGRDEKIWKNANEF 73677

73678 RPERFKDDMVDFSGTDFRFIPGGAGRRMCPGLTFGLSNIEIALASLLYHFDWKLPNDASS 73857

73858 CKLDMRETHGVTARRRTELLLKATPLYT* 73944

 

this pseudogene not annotated

>AP004326.2a $P CYP71AA4P genomic DNA, chromosome 1, BAC clone:OJ1294_F06, complete sequence

Length = 102983

4 genes 71B like

Gene 1 pseudogene 71 family

67989 LPPVPWPLPVIGSMH*LLGSLPHH 68060 frameshift with deletion

68060 RPACAVELLSPRRARSFRRVREAEPARLVRAVAASPAWPLVNVVGGEHVAAMMTAV 68227

68228 GARP 68239 frameshift with small deletion

68238 RCPRQEEYLEELGKVAKLAAGFNLVDLFPESRLVRAAQAAHGKIHSIMDAMVQ 68396

68397 DHLKAMEERREEVADGVVDDGDGDGADRDEELLSILLRFQRDGGLGITLTNGNHQRDS 68570 (0)

68886 GILAGGSDTTTTTVMWAMSELLRCPRAMQ 68972 frameshift with deletion

69023 YMQLVIKETLRLHLPFPLLFPRLCTETCKIMGFDVPKGTIVIVNNWAISRDERCWEDAED 69202

69203 F*PERFEHDDTDYNGTYFQFLSGGFGRRMFPDFIFAQFNIEIALANLLYHFDWELPCSEN 69382

69383 RMELDMTESAGLTASRLTDLFG* 69451

 

Gene No. : 1-3_149 CYP710A5 this seq has long N-term

677956..679547 , 683244..683265 (-)

>1-3_149

MAIPGSKEHKCNCLSRSSRAFSTPRT MRTSTDPSGSIESFHGLVHLRTAAPLLAAAVALY

MLIEQLSYHRKKGSMPGAPLVVPFLGSAAHLIRDPVGFWDVQAALARKSGAGLAADFLFG

RFTVFIRDSELSHRVFANVRADAFHVVSHPFGKKLFGEHNLVYLVGEEHKDLRRRIAPNF

TPRALSTYAVIQQRVIISHLRRWLDRSASNGGKAEPIRVPCRDMNLETSQTVFVGPYLTE

KARERFDRDYNLFNVGFITLPVDLPGFAFRRARLAGARLMHTLGDCARQSRQRMLGGGEP

ECLLDYLMQETVREIDEATAAGL PPPPHTSDVEVGALLFGFLFAAQDASTSSLCWAVSAL

DSHPNVLARVRAEVAALWSPESGEPITAEMMSAMKYTQAVAREVVRYHPPATLVPHIAVE

AFQLTAQYTIPKGTMVFPSVYESSFQGFQDADAFDPDRFFSEARREDVVYKRNFLAFGAG

PHQCVGQRYALNHLVIFMALFVSLVDFRRERTEGCDVPVYMPTMVPRDGCVVYLKQR

 

>AP002092a $F CYP710A5 CDS complement(54602..56137) 60% TO 710A1

THIS SEQ IS THE SAME AS AP002093a CDS complement(98133..99668)

MRTSTDPSGSIESFHGLVHLRTAAPLLAAAVALYMLI EQLSYHR

KKGSMPGAPLVVPFLGSAAHLIRDPVGFWDVQAALARKSGAGLAADFLFGRFTVFIRD

SELSHRVFANVRADAFHVVSHPFGKKLFGEHNLVYLVGEEHKDLRRRIAPNFTPRALS

TYAVIQQRVIISHLRRWLDRSASNGGKAEPIRVPCRDMNLETSQTVFVGPYLTEKARE

RFDRDYNLFNVGFITLPVDLPGFAFRRARLAGARLMHTLGDCARQSRQRMLGGGEPEC

LLDYLMQETVREI

DEATAAGL this may be too long vs arab. Check for intron

PPPPHTSDVEVGALLFGFLFAAQDASTSSLCWAVSAL

DSHPNVLARVRAEVAALWSPESGEPITAEMMSAMKYTQAVAREVVRYHPPATLVPHIA

VEAFQLTAQYTIPKGTMVFPSVYESSFQGFQDADAFDPDRFFSEARREDVVYKRNFLA

FGAGPHQCVGQRYALNHLVIFMALFVSLVDFRRERTEGCDVPVYMPTMVPRDGCVVYL

KQR*

 

Gene No. : 1-3_150 CYP710A6 100% agreement

684165..685703 (-)

>1-3_150

MVESFHGLVVVDLRTAAPLLATAVALYILIEQLSYHRKKGSMPGPPLVVVPFLGSVTHLF

RDPVGFWDLQATRASKSGAGLTADFLFGRLMVFIRDSELSRRVFANVRADAFHLVGHPFG

KKLFGDHNLIYMVGKEHKDLRRRIAPNFTPRALSTYAVIQQRVILSHLRRWIDRSVANGG

KAEPIRVPCRDMNLETSQTVFVGPYLTVETRERFDRDYNLFNHGFITLPIDLPGSAFRRA

RLAVPRLKHILEDCARQSKQRMRGGGEPECLVDYLMQETVREIDEAAAAGLPPPPHTSDM

ETGNLLFDFLFAAQDASTSSLCWAVSALDSHPDVLARVRAEVAALWSPESGEPITAEMMT

EMKYTQAVAREVVRYWPPGPVVPHIAGEAFQLTEQYTIPKGTIVFPSVYESSFQGFPDAG

TFDPERFFSEARREDVVYKRNFLAFGAGPHQCVGQRYALNHLVIFMALLASLIDFRRERT

EGCDVPVYMPTIVPRDGCVVHLKQRCAKLPSF

 

Gene No. : 1-3_155 CYP710A7 100% agreement

704151..705665 (-)

>1-3_155

MVDSLLYGLLDLRMAAPLLAAAVALYVLVEQLSYHRKKGSLPGPPLVVPFIGSATHMIRD

PTGFWEMQAARARKSGVGFTADFLAGKFTIFIRDSELSNRVFANVRPDAFFVIGHPFGKK

LFGDHNLIYLFGDDHKDLRRRMATNFTPRALSTYAAIQQRGIVSHLRRWLDRSAANGGKA

EPIRVPCRDMNLETSQTVFAGPYLTEEARERFKSDYNLFNVGLLAFPVDLPGLAFRRARQ

AVARLVRMLRDCARESKARMRAGGEPECLVDYWMQETVREIDEAKAAGLPPPAHISDDEE

IGGFLFDFLFAAQDASTSSLCWAVSALDSHPDVLARVRAEVASLWSPDSGEPITADKIAE

MKYTKAVAREVVRHRPPATLMPHIALQNFQLTESYTIPKGTLVLPSMYESSFQGFHDPDA

FDPERFFSEARREDVVYKRNFLAFGAGPHQCVGQRYALNHLVIFMALFVSLVDFRRERTE

GCDVPVYMPTMVPRDGCVVYLKQR

 

Gene No. : 1-3_159 CYP710A8 100% agreement

723800..725326 (-)

>1-3_159

MAAVVDFLDLRAAAPFVVAALAFYFLVEQLSYHRKKGPLPGPPLVVPFVGSVAHMIRDPT

GFWDAQAARARKSGAGLAADFLIGRFVVFIRDSELSHRVFANVRPDAFHLIGHPFGKKLF

GDHNLIYMFGEDHKDLRRRIAPNFTPRALSTYAAIQQRVILSHLRRWLDRSAANGGKAEP

IRVPCRDMNLETSQTVFAGPYLTKEAREKFERDYNFFNVGLMALPVDLPGFAFRSARLGV

ARLVRTLGECARASKARMRAGGEPECLVDFWMQETVREIDEAKAAGKPPPAHTDDEELGG

FLFDFLFAAQDASTSSLCWAVSALDSHPDVLAGVRAEVASLWSPESGEPITAEKIAEMKY

TQAVAREVVRHRPPATLVPHIAGEEFQLTEWYTIPKGTIVFPSVYESSFQGFPEPDTFDP

ERFFSEARREDVVYKRNFLAFGAGPHQCVGQRYALNHLVLFMALFVSVVDFRRDRTEGCD

EPVYMPTIVPRDSCTVYLKQRCAKFPSF

 

Gene No. : 1-3_343 CYP71T1 100% agreement

1694984..1695988 , 1696589..1697260 (+)

>1-3_343

MELSSSLAAVLHSPLFLLAALLLLPVFTLLSFSSAKKPGDGGGWRLPLPPSPRGVPFLGH

LPLLGSLPHRKLRSMAEAHGPVMLLWFGRVPTVVASSAASAQEAMRARDAAFASRARVRM

AERLIYGRDMVFAPYGEFWRQARRVSVLHLLSPRRIASFRGVREQEVAALLDRVRRRCGV

RGGGETVNLSDLLMSYANGVISRAAFGDGAYGLDGDEGGEKLRELFANFEALLGTATVGE

FVPWLAWVDKLMGLDAKAARISAELDGLLERVIADHRERRRLSQPDGGDGDGDGDENVDH

RDFVDVLLDVSEVEEGAGAGEVLLFDTVAIKAIILDMIAAATDTTFTTLEWAMAELINHP

PVMRKLQCEIRAAVGVPGASGGAEVTEDHLGELRLLRAVVKETLRLHAPVPLLVPRETVE

DTELLGYRVPARTRVIINVWAIGRDAAAWGDRAEEFVPERWLDGGGEEVEYAAQLGQDFR

FVPFGAGRRGCPGAGFAAPSIELALTNLLYHFDWELPPHADGAAAATAARLDMGELFGLS

MRMKTTLNLVAKPWSSDV

 

Gene No. : 1-3_344 CYP71T2 exon 1 only + C-term extension

1699829..1700905 (+)

>1-3_344

MELSSLAALLHSPLLLAVLLLVFSWLIVSSTKKRPPPPCGDGGRRLPLPPSPPGVPLLGH

LPLLGTLPHRKLRSMAEAHGPVMLLRLGRVPAVVASSAAAAEEVMRTRDLAFASRPRVRM

SERLFYGRDMAFAPYGEFWRQARRVTVLHLLSPRRVLSFRGVREQEVAALLDRVRRRCGG

GGETVNLSDLLMSYAHGVISRAAFGHGGAHGFDGDEGGEKLRKLFADFEGLLGTMTVGEF

VPWLAWVDKLTGLDAKVARTSAAMDGLLERVIADHRERRRSRGQAVGDGEADADHRDFVD

VMLDVSEAEEGAGAGAGGVLFDTVAIKAVIL VRTPLVVVLLTCRSADATVDYFLWNQT

 

Gene No. : 1-3_345 CYP71T2 exon 2 only + N-term extension

1702184..1702900 (+)

>1-3_345

MNERFIEQ DMMAAGTDSSFTTTEWVMAELINHPRVMRKLQDEIRAVVGTSSASAAAAATG

GGQVTEDHLGELPFLRAVIKEMLRLHAPGPLLLPRETVEDTELLGYRIPARTRVIINVWA

IGRDAAAWGDSAEEFVPERWLDGGGGGGVEYAQQLGKDSRFVPFGAGRRGCPGAGFAALS

VELALANLLYHFDWELPPPAASGIMATTRLDMDELFGLSVRLKADLNLVAKPWSPGAS

 

>AP003434.1b $F CYP71T2 chromosome 1, PAC clone:P0452F10, complete = AA754300

AA754300      42% IDENTICAL TO 71A14   1/98 I-HELIX 43% to 703A2

39698 MELSSLAALLHSPLLLAVLLLVFSWLIVSSTKKRPPPPCGDGGRRLP

39839 LPPSPPGVPLLGHLPLLGTLPHRKLRSMAEAHGPVMLLRLGRVPAVVASSAAAAEEVMRT 40018

40019 RDLAFASRPRVRMSERLFYGRDM AFAPYGEFWRQARRVTVLHLLSPRRVLSFRGVREQE 40195

40196 VAALLDRVRRRCGGGGETVNLSDLLMSYAHGVISRAAFGHGGAHGFDGDEGGEKLRKLFA 40375

40376 DFEGLLGTMTVGEFVPWLAWVDKLTGLDAKVARTSAAMDGLLERVIADHRERRRSRGQA 40552

40553 VGDGEADADHR DFVDVMLDVSEAEEGAGAGAGGVLFDTVAIKAVIL (0)

42074 DMMAAGTDSSFTTTEWVMAELINHPRVMRKL 42169

42170 QDEIRAVVGTSSASAAAAATGGGQVTEDHLGELPFLRAVIKEMLRLHAPGPLLLPRETVE 42349

42350 DTELLGYRIPARTRVIINVWAIGRDAAAWGDSAEEFVPERWLDGGGGGGVEYAQQLGKDS 42529

42530 RFVPFGAGRRGCPGAGFAALSVELALANLLYHFDWELPPPAASGIMATTRLDMDELFGLS 42709

42710 VRLKADLNLVAKPWSPGAS* 42769

 

Gene No. : 1-3_346 CYP71T3 100% agreement

1708142..1709101 , 1711273..1711902 (+)

>1-3_346

MAVSLVVVVVVVIAIVVPLLYLVLLPAWKPARRDDGDGGMRRRLPPSPPWGLPLLGHLHL

LGALPHRALRSLAAAHGPVLLLRLGRVPVVVVSSAAAAEEVMRTRDLEFASRPRVAMAER

LLYGGRDVAFAPYGEYWRQTRRICVVHLLSARRVLSFRRVREEEAAALVARVRAAGGAVD

LVEHLTAYSNTVVSRAVFGDESARGLYGDVDRGRVLRKLFDDFVELLGQEPMGELLPWLG

WVDALNGMEVKVQRTFEALDGILEKVIDDHRRRRREVGRQMDDGGGGDHRDFVDVLLDVN

ETDMDAGVQLGTIEIKAIILDMFAAGTDTTTTVIEWAMAELITHPDAMRNAQDEIKAVVG

ITSHITEDHLDRLPYLKAVLKETLRLHPPLPLLVPHEPSSDTKILGYSIPACTRIVINAW

TIGRDQATWGEHAEEFIPERFLESGLDYIGQDFVLVPFGAGRRGCPGVGFAVQAMEMALA

SLLYNFDWETRVVDRRSEFGTSSLDMSEMNGLSVRLKYGLPLIAISRFP

 

Gene No. : 1-3_348 CYP71T4 extension of exon 2 not correct

1718250..1719233 , 1719660..1720331 (+)

>1-3_348

MAVSLLPAVLVLLAIVAPLLYLVLLPAVKYTTRNGAARWEDDGGDGRRRRRLPPSPRGLP

LLGHLHLLGALPHRALRSLAAAHGPVLLLRLGRVPVVVVSSAAAAEEVMRSRDMEFASRP

RMAMAELLLYGGRDVAFAPYGEYWRQARRICVVHLLSARRVLSFRRVREEEAAALVGRVR

AAAADVVVDLSDLLIAYSNTVLTRIAFGDESARGGGGGGDRGRELRKVFDDFARLLGTEP

MGELLPWFWWVDALRGIDGKVQRTFEALDGILERVIDDHRRRREGGRRMDDDGGGDHRDF

VDVLLDVNETDKDAGIQLGTVEIKAIIM LICFLLHGHEQ DMFVGGSDTTTTMMAWTMAEL

INHPRAMRKAQNEIWAVVGNTSHVTKDHVDKLPYLKAVFKETLRLHPPLPLLIPREPPAD

TQILGYTIPAHTRVVINAWAIGRDAAAWGQQPDEFSPEKFLNSTIDYKGQDFELLPFGAG

RRGCPGIVFGVSAMEIALASLLYHFDWEAAATDHRRRGSQAWALPVDMSEVNGIAVHLKY

GLHVVAKPRMP

 

>AP003434.1d $F CYP71T4 chromosome 1, PAC clone:P0452F10, complete like 71A

58119 MAVSLLPAVL

58149 VLLAIVAPLLYLVLLPAVKYTTRNGAARWEDDGGDGRRRRRLPPSPRGLPLLGHLHLLGA 58328

58329 LPHRALRSLAAAHGPVLLLRLGRVPVVVVSSAAAAEEVMRSRDMEFASRPRMAMAELLLY 58508

58509 GGRDVAFAPYGEYWRQARRICVVHLLSARRVLSFRRVREEEAAALVGRVRAAAADVVVD 58685

58686 LSDLLIAYSNTVLTRIAFGDESARGGGGGGDRGRELRKVFDDFARLLGTEPMGELLPW 58859

58860 FWWVDALRGIDGKVQRTFEALDGILERVIDDHRRRREGGRRMDDDGGGDHRDFVDVLLD 59036

59037 VNETDKDAGIQLGTVEIKAIIM 59102 (0)

59562 DMFVGGSDTTTTMMAWTMAELINHPRAMRKAQNEIWAVVGNTSHVTKDHVDK 59717

59718 LPYLKAVFKETLRLHPPLPLLIPREPPADTQILGYTIPAHTRVVINAWAIGRDAAAWGQQ 59897

59898 PDEFSPEKFLNSTIDYKGQDFELLPFGAGRRGCPGIVFGVSAMEIALASLLYHFDWEAAA 60077

60078 TDHRRRGSQAWALPVDMSEVNGIAVHLKYGLHVVAKPRMP* 60200

 

Gene No. : 3-2_172 CYP709D1 missing end of exon 4

907473..907752 , 907912..908135 , 908724..908977 , 909301..909613 ,

910180..910611 (+)

>3-2_172

MDVPSVVIPILVVLVSRLLTSALVHLLWKPYAITKLFRGQGITGPKYRLFVGSLPEIKRM

KAAAAADEVAAGAHSHDFIPIVLPQHSKWATDHGKTFLYWLGAVPAVSLGRVEQVKQVLL

ERTGSFTKNYMNANLEALLGKGLILANGEDWERHRKVVHPAFNHDKLKFMSVVMAESVES

MVQRWQSQIQQAGNNQVELDLSRELSELTSDVITRSAFGSSHEEGKEVYQAQKELQELAF

SSSLDVPALVFLRGNTRAHQLVKKSRTMLMEIIEGRLAKVEAAEAGYGSDLLGLMLEARA

LEREGNGLVLTTQEIIDECKTFFFAGQDTTSNHLVWTMFLLSSNAQWQDKLREEVLTV NM

VLLESLRLYSPVVIIRRIAGSDIDLGNLKIPKGTVLSIPIAKIHRDRDVWGPDADEFNPA

RFKNGVSRAASYPNALLSFSQGPRGCIGQTFAMLESQIAIAMILQRFEFRLSPSYVHAPM

EAITLRPRFGLPVVLRNLQG

 

>AP003258.2 $F CYP709D1 genomic DNA, chr 1, PAC clone:P0463A02, complete 46% to 709B2

N-term runs off end of contig identical to AP003764.2 (has N-term)

       MLKSTIELYIFTTAIAKKSLHSQTKHKSKMDVPSVVIP

151039 ILVVLVSRLLTSALVHLLWKPYAITKLFRGQGITGPKYRLFVGSLPEIKRMKAAAAADEVA 150857

150856 AGAHSHDFIPIVLPQHSKWATDHG (1)

       KTFLYWLGAVPAVSLGRVEQVKQVLLERTGSFTKNYMNANLEA 150497

150496 LLGKGLILANGEDWERHRKVVHPAFNHDKLK 150404 (0)

149815 FMSVVMAESVESMVQRWQSQIQQAGNNQVELDLSRELSELTSDVITRSAFGSSHEEGKE 149639

149638 VYQAQKELQELAFSSSLDVPALVFLR 149561 (2)

149237 GNTRAHQLVKKSRTMLMEIIEGRLAKVEAAEAGYGSDLLGLMLEARALEREGNGLVLTTQE 149055

149054 IIDECKTFFFAGQDTTSNHLVWTMFLLSSNAQWQDKLREEVLT VCGDAIPTPDMANRLKL 148875 (0)

148359 VNMVLLESLRLYSPVVIIRRIAGSDIDLGNLKIPKGTVLSIPIAKIHRDRDVWGPDADEF 148180

148179 NPARFKNGVSRAASYPNALLSFSQGPRGCIGQTFAMLESQIAIAMILQRFEFRLSPSYVH 148000

147999 APMEAITLRPRFGLPVVLRNLQG* 147928

 

Gene No. : 4_108 CYP71K1 100% agreement

580030..580668 , 580739..581656 (-)

>4_108

MAELPLYLLLLALLVAVPFLCLTRWSLRHGGGGGGRLPPSPWALPVIGHLHHVAGALPHR

AMRDLARRHGPLMLLRLCELRVVVACTAEAAREVTKTHDLAFATRPITPTGKVLMADSVG

VVFAPYGDGWRTLRRICTLELLSARRVRSFRAVREEEVGRLLRAVAAAAAVAALTTPGAT

AAVNLSERISAYVADSAVRAVIGSRFKNRAAFLRMLERRMKLLPAQCLPDLFPSSRAAML

VSRMPRRMKRERQEMMDFIDDIFQEHHESRAAAGAEEDLLDVLLRIQSQDKTNPALTNDN

IKTVIIDMFVASSETAATSLQWTMSELMRNPRVMRKAQDEVRRALAIAGQDGVTEESLRD

LPYLHLVIKESLRLHPPVTMLLPRECRETCRVMGFDVPEGVMVLVNAWAIGRDPAHWDSP

EEFAPERFEGVGAADFKGTDFEYIPFGAGRRMCPGMAFGLANMELALAALLYHFDWELPG

GMLPGELDMTEALGLTTRRCSDLLLVPALRVPLRDHER

 

Gene No. : 8-1_133 CYP76M14 100% agreement

696859..698433 (-)

>8-1_133

MEKSSELWLLWAVFSASLVFLYLTIRRRSGAGAGGKPPLPPGPTPLPLIGNLLDLRGGVI

HDKLAALARVYGPVMMIKLGLNDAVIISSRDAAREAFTRYDRHLAARAIPDTFRANGFHE

RSAVFLPSSDERWKALRGIQGTHIFTPRGLAAVRPVRERKVRDIIAYFRDHAGEELVIRQ

AIHTGVLNLVSSSFFSMDIAGMGSETARELREHVDEIMTVFAQPNVSDYFPFLRRLDLQG

LRRSTKRRFDRIFSILDDIVERRLVDRGERGGEGGASSNSSKSKHQYDGGDFLDALLELM

VTGKMERDDVTAMLFEAFVAGGDTVAFTLEWVMADLLRNPPVMAKLRAELDDVLGGKDQS

AIEEHDAGRLPYLQAVLKESMRLHSVGPLLHHFAAEDGVVVGGYAVPRGATVLFNTRAIM

RDPAAWERPEEFAPERFLAREGKAPVDFRGKEADFIPFGSGRRLCPGIPLAERVMPYILA

LMLREFEWRLPDGVSPEELDVSEKFMSVNVLAVPLKAVPVKVIN

 

Gene No. : 8-1_562 CYP72A31P pseudogene, not intact

3057089..3057514 , 3057625..3058009 , 3058183..3058256 (-)

>8-1_562

MLYTPYHKEMYMSVLLTSHGSNLPM SLPIENNRKMHQINKEIESILRGLIGKRMQAMKEG

ESTKDDLLGILLESNTKHMEENGQSSQGLTIKDIVEECKLFYFAGAETTSVLLTWTMLLL

SIHPEWQDHAREEIMGLFRKNKPDYEGLSRLKIVTMIFYEVLRLHPPFIEIGWKTYKEME

IGGVTYPAGVSIKIPVLFIHHDPDSWGSDVHEFKPERFSEGISKASKDPGAFLPFGWGPR

ICIGQNFALLESKMALCLILQRLEFELAPSYTHAPHTMVTLHPMHGAQMKVRAI

 

>AP003278 $P CYP72A31P chromosome 1, PAC clone:P0518F01, similar to 72A22 missing N-term half

AP003330.1 chromosome 1 clone B1085F01 CYP72A like

Pseudogene, no N-term in 9000bp upstream until next p450 ends near 22400

31539 SLPIENNRKMHQINKEIESILRGLIGKRMQAMKEGESTKDDLLGILLESNTKHMEENGQSS 31718

31719 QGLTIKDIVEECKLFYFAGAETTSVLLTWTMLLLSIHPEWQDHAREEIMGLFRKNKPDYE 31898

31899 GLSRLKI

32030 VTMIFYEVLRLHPPFIEIGWKTYKEMEIGGVTYPAGVSIKIPVLFIHHDPDSWGSDVHEF 32209

32210 KPERFSEGISKASKDPGAFLPFGWGPRICIGQNFALLESKMALCLILQRLEFELAPSYTH 32389

32390 APHTMVTLHPMHGAQMKVRAI 32452 or frameshift after KVR to

      SYMIISDYSVFYYYNSWL* (compare with end of 72A33)

 

Gene No. : 8-1_564 CYP72A32 end of gene is incorrect, missing heme signature

3066867..3066964 , 3067301..3067529 , 3067638..3068022 , 3068485..3068729 , 3069130..3069681 (-)

>8-1_564

MVLGGWLLMWAPASSPTILVAFGLLFGLVLAWQAGLQLHRLWWRPRRLEKALRARGLRGS

SYRFLTGDLAEESRRRKEAWARPLPLRCHDIAPRIEPFLHDAVVRPEQHYGKPCITWLGP

TPEVHVTDPELAKVVMSNKFGHFEKIRFQALSKLLPQGLSYHEGEKWAKHRRILNPAFQL

EKLKLMLPVFSACCEELISRWMGAIGSDGSYEVDCWPELKSLTGDVISRTAFGSSYLEGR

RIFELQGELFERVMKSVEKIFIPGYMYLPTENNRKMHQINKEIESILRSMIGKRMQAMKE

GESTKDDLLGILLESNMRHTEENSQSSQGLTIKDIMEECKLFYFAGADTTSVLLTWTILL

LSMHPEWQDRARKEILGLFGKNKPEYDGLNNLKIVTMILYEVLRLYPPFIELKRRTYKEM

KIGGVTYPAGVIINLPVLFIHHDLKIWGSDVHEFKPERFSEGISKASKDP VYGVIDFCDT

FDRLSYPLRFLMYDMVNFLQCV

 

>AP003278a $F CYP72A32 19863-22437 chromosome 1, PAC clone:P0518F01, similar to 72A22

AP003330.1 50023-47446 chromosome 1 clone B1085F01, CYP72A like 536aa

AP004738.1 Oryza sativa chromosome 6 clone OSJNBa0090D06 chrom. conflict

50023 MVLGGWLLMWAPASSPTILVAFGLLFG

49942 LVLAWQ AGLQLHRLWWRPRRLEKALRARGLRGSSYRFLTGDLAEESRRRKEAWARPLPLR 49763

49762 CHDIAPRIEPFLHDAVVRPEQHYGKPCITWLGPTPEVHVTDPELAKVVMSNKFGHFEKIR 49583

49582 FQALSKLLPQGLSYHEGEKWAKHRRILNPAFQLEKLK 49472 (0)

49071 LMLPVFSACCEELISRWMGAIGSDGSYEVDCWPELKSLTGDVISRTAFGSSYLEGR 48904

48903 RIFELQGELFERVMKSVEKIFIPGYM 48826 (2)

48363 YLPTENNRKMHQINKEIESILRSMIGKRMQAMKEGESTKDDLLGILLESNMRHT 48202

48201 EENSQSSQGLTIKDIMEECKLFYFAGADTTSVLLTWTILLLSMHPEWQDRARKEILGLFG 48022

48021 KNKPEYDGLNNLKI (0)

      VTMILYEVLR 47842

47841 LYPPFIELKRRTYKEMKIGGVTYPAGVIINLPVLFIHHDLKIWGSDVHEFKPERFSEGIS 47662

47661 KASKDP GAFLPFGWGPRICIGQNFALLEAKMALCLILQRLEFELAPTYTHAPHTMITLHP 47482

47481 MHGAQIKIRAI* 47446

 

Gene No. : 8-1_565 CYP72A33 end is wrong and two exon boundaries disagree

3071048..3071076 , 3075238..3075367 , 3075707..3075777 , 3076421..3076649 , 3076758..3077142 , 3077951..3078246 , 3078684..3079205 (-)

>8-1_565

MWAPASSPTILAAFGLVGLVLAWQ AGLQLHRLWWRPRRLEKALRARGLRGSRYRFLTGDL

AEEGRRRKEAWARPLPLRCHDIAPRVEPFLHGAVGVGAAHGKPRITWFGPTPEVHVADPE

LARVVLSNKFGHFEKVSFPELSKLIPQGLSAHEGEKWAKHRRILNPVFQLEKLK SILFLY

LIIEMSSENVQ LMLPVFSACCEELISRWMGSIGSDGSYEVDCWPEFKSLTGDVISRTAFG

SSYLEGRRIFELQGELFERVIKSIQKMFIPG YM YLPTENNRKMHQMNKEIESILRGMIGK

RMQAMKEGESTKDDLLGILLESNTRHMEVNGQSNQGLTIKDIMEECKLFYFAGADTTSVL

LTWTMLLLSMHPEWQDRAREEILGLFGKNKPDYDGLSRLKIVTMILYEVLRLYPPFIELT

RKTYKEMEIGGITYPAGVIINLPVMFIHHDPEIWGSDVHEFKPERFSEGISKASKDP VEV

PRRMEIHRSFDLPRDNIQMSQNRSPCGAPPSSSYRQIRYPEQLADEPIRLSPPDGNGDIL

DLRGIKLEHFASS

 

>AP003278b $F CYP72A33 chromosome 1, PAC clone:P0518F01, 82% to 72A22

AP003330.1 59493-56536 chromosome 1 clone B1085F01, CYP72A like 516aa

N-term does not match in both, 3278 has MVLGGGWLSMWAPASSPTILAAFGLVGLVLAWQ

before the AGLQ seq.

59493 MVLEGK AGLQLHRLWWRPRRLEKALRARGLRGSRYRFL

      TGDLAEEGRRRKEAWARPLPLRCHDIAPRVEP 59284

59283 FLHGAVGVGAAHGKPRITWFGPTPEVHVADPELARVVLSNKFGHFEKVSFPELSKLIPQG 59104

59103 LSAHEGEKWAKHRRILNPVFQLEKLK 59026 (0)

58537 LMLPVFSACCEELISRWMGSIGSDGSYEVDCWPEFKSLTGDVISRTAFGSSYLEG 58373

58372 RRIFELQGELFERVIKSIQKMFIPG 58298 (2)

57483 YLPTENNRKMHQMNKEIESILRGMIGKRMQAMKEGESTKDDLLGILLESN 57334

57333 TRHMEVNGQSNQGLTIKDIMEECKLFYFAGADTTSVLLTWTMLLLSMHPEWQDRAREEIL 57154

57153 GLFGKNKPDYDGLSRLKI (0) VTMIL 56977

56976 YEVLRLYPPFIELTRKTYKEMEIGGITYPAGVIINLPVMFIHHDPEIWGSDVHEFKPER 56800

56799 FSEGISKASKDP GAFLPFGWGPRICIGQNFALLEAKMALCLILQRLEFELATSYTHVPHT 56620

56619 IISLHPMHGAQIKVKSYMTISDYSVFY* 56536

 

 

Gene No. : 8-2_301 CYP72A17 N-term extension, C-term is wrong,

missing middle exons 2 and 3

1594648..1594915 , 1598386..1598464 , 1598535..1598717 , 1601636..1601742 , 1601972..1602084 , 1602475..1602841 , 1604468..1604841 , 1604937..1605347 , 1605464..1605643 (+)

>8-2_301

MEPSTRTRRLQPNRAPLGRYGEGGSRRIRRRRGDQNRILEAIPRWRGARGFVQQQQHQEK

GGGFGGEERRRQERRRQQEQKGNSTSSMGFLSTCAYGYLGRVDLQNSVHQSTCTVSTTSS

ASSHICFLYINLRIMSISIENDVNDNCDRNSNGGNGNGSIGNDNINTTTSESRAFCGFSF

LRPYRAVRYLRDLQPYILSSIQSASRVPPPEAPPSLPACSFGRSRPPPSLISNLSTVDHA

DAGDASPAYKREKEA MGIGIGIGIGIGIGTGTGAALPFGEASPWSLLGGAVAALLLVWAA

QMLEWAWLAPRRMERALRAQGLRGTQYRFLHGDLTEDLRLVTAARSKPVPMDRPHDFIPR

VAPLLHRALEEH ENNRRMKAIDREIKSILRGIIEKRQKATKNGEASKDDLLGLLLQSNMD

YYSDEDGKSSKGMTVEEIIDECKLFYFAGMETTAVLLTWTMVALSMHPEWQDRAREEILQ

VFGRNKPDINGVSRLKV VTMVLHEVLRLYPPVVMMNRRTYKEIELGGVRYPAGVMLSLPV

LFIHRDAAAWGHDAGEFDPGRFAEGVARACKDPGAGAFFPFSWGPRICIGQNFALLEAKV

ALGMILQRFAFELSPAYAHAPYTVLTLHPQHGVP NTFADKHWKLHVPGIRHSEISISDMK

AEHYLLTTAAAVPCIVEVYEYPYSISRVFSWSS

 

>CYP72A17 $F AP002839 Oryza sativa genomic DNA, chromosome 1 36553-39431

AG025591.1 strain ND3008 PCR from rice genomic DNA clone T8121T.Length = 401

AG025107.1 strain NC2542 PCR from rice genomic DNA clone T5184T.Length = 504

AU071192 very similar to AQ050520 = 72A17

AP002744 CYP72A17 join(109468..109819,110022..110245,110529..110781,

111446..111819,111915..112346)

MGIGIGIGIGIGIGTGTGAALPFGEASPWSLLGGAVAALLLVWAAQMLEWAWLAPRRMERALR

AQGLRGTQYRFLHGDLTEDLRLVTAARSKPVPMDRPHDFIPRVAPLLHRALEEH (phase 1 intron)

GRVSFTWFGPMPRVTITDPDLVREVLSNKFGHF

EKTKLATRLSKLLVGGLVILHGEKWVKHRRIMNPAFHAEKLK (phase 0 intron)

RMLPAFSASCSELIGRWENAVAASVGKAELDIWPDFQNLSGDVISRAAFGVRHHEGRQ

IFLLQAEQAERLVQSFRSNYIPGLS (phase 2 intron)

LLPT ENNRRMKAIDREIKSILRGIIEKRQKATKNGEASKDDLLGLLLQSNMDYYSDEDGKS

SKGMTVEEIIDECKLFYFAGMETTAVLLTWTMVALSMHPEWQDRAREEILQVFGRNKPDI

NGVSRLKV (phase 0 intron)

VTMVLHEVLRLYPPVVMMNRRTYKEIELGGVRYPAGVMLSLPVLFIHRDAAAWGHDAGEF

DPGRFAEGVARACKDPGAGAFFPFSWGPRICIGQNFALLEAKVALGMILQRFAFELSPAY

AHAPYTVLTLHPQHGVP VRLRRL*

 

Gene No. : 8-2_302 CYP72A18 C-term extension is wrong. 

1605912..1606000 , 1606047..1606200 , 1606390..1606478 , 1607050..1607241 , 1607596..1607992 , 1608401..1608779 , 1609428..1609672 , 1609767..1609987 , 1610630..1610930 (-)

>8-2_302

MLMMLGAASQWILAAAAAAAVAALLWLAVSTLEWAWWTPRRLERALRAQGIRGNRYRLFT

GDVPENVRLNREARKKPLPLGCHDIIPRVLPMFSKAVEEHGKPSFTWFGPTPRVMISDPE

SIREVMSNKFGHYGKPKPTRLGKLLASGVVSYEGEKWAKHRRILNPAFHHEKIKRMLPVF

SNCCTEMVTRWENSMSIEGMSEVDVWPEFQNLTGDVISKTAFGSSYEEGRRIFQLQAESA

ERIIQAFRTIFIPGYWFLPTKNNRRLREIEREVSKLLRGIIGKRERAIKNGETSNGDLLG

LLVESNMRESNGKAELGMTTDEIIEECKLFYFAGMETTSVLLTWTLIVLSMHPEWQERAR

EEVLHHFGRTTPDYDSLSRLKIVTMILYEVLRLYPPVVFLTRRTYKEMELGGIKYPAEVT

LMLPILFIHHDPDIWGKDAGEFNPGRFADGISNATKYQTSFFPFGWGPRICIGQNFALLE

AKMAICTILQRFSFELSPSYIHAPFTVITLHPQH VKYITTQSLHSDTSHCENRAGLLGTG

RYVPQDFHQICGSQEQNPFFVIASSLQPTPLGFDRRLKGLVMLIENPVIPLLPNRPGNQG

CDGVESPSVWQAAIVELIGHPTSIEIHQPHEVTTSRRRQRPPNRLVLELHPFQFPQTETD

EGIGLLGEVIQGVDCRPQIEIFEDCLDP

 

>CYP72A18 $F AP002839 Oryza sativa genomic DNA, chromosome 1 44993-41630

AU100789.1 Rice callus Oryza sativa cDNA clone C50810.Length = 419 C-term

AU102126.1 Rice callus cDNA clone C10756.Length = 571

AZ130306.1 OSJNBb0103O04r CUGI Rice BAC genomicLength = 320

C26802 36% TO 72  8/97 N-TERMINAL 19-67 REGION opposite end = C96903

C96903, C97406 58% IDENTICAL TO 72 C-TERM 65% to AQ050520

C96799, C28139 219-340 REGION 55% IDENTICAL TO 72 opposite end = C97406

D22332        48% TO 72     12/93  7/98 C-HELIX 89-191 REGION

AU081507.1 Rice callus Oryza sativa cDNA clone C12518_12Z.Length = 581

C26235        36% IDENTICAL TO 72     8/97 AMINO ACIDS 89-216 REGION

AP002744 complement(join(114545..114970,115379..115757,

116406..116650,116745..116965,117608..117908))

D21882        53% TO 72   5/93  7/98 245-352 REGION = 72A18

MLMMLGAASQWILAAAAAAAVAALLWLAVSTLEWAWWTPRRLERALRAQG

IRGNRYRLFTGDVPENVRLNREARKKPLPLGCHDIIPRVLPMFSKAVEEH (phase 1 intron)

GKPSFTWFGPTPRVMISDPESIREVMSNKFGHYGKPKPTRLGKLLASGVV

SYEGEKWAKHRRILNPAFHHEKIK (phase 0 intron)

RMLPVFSNCCTEMVTRWENSMSIEGM

SEVDVWPEFQNLTGDVISKTAFGSSYEEGRRIFQLQAESAERIIQAFRTIFIPGYW (phase 2 intron)

FLPTKNNRRLREIEREVSKLLRGIIGKRERAIKNGETSNGDLLG

LLVESNMRESNGKAELGMTTDEIIEECKLFYFAGMETTSVLLTWTLIVLS

MHPEWQERAREEVLHHFGRTTPDYDSLSRLKI (phase 0 intron)

VTMILYEVLRLYPPVVFL

TRRTYKEMELGGIKYPAEVTLMLPILFIHHDPDIWGKDAGEFNPGRFADG

ISNATKYQTSFFPFGWGPRICIGQNFALLEAKMAICTILQRFSFELSPSY

IHAPFTVITLHPQH GAQIKLKKI*

 

Gene No. : 8-2_305 CYP72A19 one exon boundary does not agree

1617645..1618070 , 1618263..1618587 , 1618765..1619009 , 1619127..1619606 (-)

>8-2_305

MVYGLLGLALLWQVHRLLVRLWWQPRRLERALRAQGVRGTSYRFLTGDLKDYGRLSKEAW

ARPLPLRCHDIAPRVAPFVHRTIAEHGKACLSWFGPIPKVTIADAEIAKDVLSNKMGHFE

KLKFPVLSKLLADGVANYEGEKWAKHRRILNPAFHLEKLKLMLPAFSACCEELVGRWAAS

LGSDGSNEIDVWPEMQSLTGDVISRTAFGSSYLEGRRIFQLQAEQQELFMGAIQKISIPG

YMSLPTKNNRRMYQIKNEVESIIRDLVQKRMHAMKDGERTKDDLLGILLESSTRHADENG

HSGPGMTIEEVMEECKVFYFAGMETTAILLTWTMVVLSMHPEWQHRAREEV TMILYEVLR

LYPPGIGFVRQTYKEMEIGGVKYPAGVMIELPLLFIHHDPDIWGSDVNEFKPERFAEGIS

RASNDHGAFFPFGWGPRICMGQNFALLEAKMALCMILQRFEFELAPSYTHAPHIVLMLRP

MHGAPIKLRAI

 

>CYP72A19 $F AP002839 Oryza sativa genomic DNA, chromosome 1 comp(53699-51708)

AU100635.1 Rice callus clone C10787.Length = 594

AG024141.1 strain ND3053 PCR from ricegenomic clone ND3053_0_734_1A.Length = 374

AP002744 complement(join(124623..125048,125181..125565,

125743..125987,126105..126584)) this annotation missing first 10 aa

MDPTSVPWSSMVYGLLGLALLWQVHRLLVRLWWQPRRLERALRAQGVRGTSYRFLTGDLKDYGRLS

KEAWARPLPLRCHDIAPRVAPFVHRTIAEHGKACLSWFGPIPKVTIADAEIAKDVLSNKMGHFEKLKFPVLS

KLLADGVANYEGEKWAKHRRILNPAFHLEKLK (phase 0 intron)

LMLPAFSACCEELVGRWAASLGSDGSNEIDVWPEMQSLTGDVISRTAFG

SSYLEGRRIFQLQAEQQELFMGAIQKISIPGYM (phase 2 intron)

SLPTKNNRRMYQIKNEVESIIRDLVQKR

MHAMKDGERTKDDLLGILLESSTRHADENGHSGPGMTIEEVMEECKVFYFAGMETTAILL

TWTMVVLSMHPEWQHRAREEV LSLFQKNKLDYEGLSKLKT (intron joint not correct missing one

base?)

VTMILYEVLRLYPPGIGFVRQTYKEMEIGGVKYPAG

VMIELPLLFIHHDPDIWGSDVNEFKPERFAEGISRASNDHGAFFPFGWGPRICMGQNFAL

LEAKMALCMILQRFEFELAPSYTHAPHIVLMLRPMHGAPIKLRAI

 

 

Gene No. : 8-2_308 CYP72A20 two boundaries do not agree

1622500..1622830 , 1623331..1623551 , 1623658..1623905 , 1624026..1624341 , 1624513..1624941 (+)

>8-2_308

MEEATG MEVGVSLEAGKPAAAPWGMLYYGVPALLVLGALYRAAERCWLGPRRVAGALQGQ

GLRGTAYRFPAGDLPENARRSKEARAKPMPPCHDIVPRVAPLLQDIVKEYGNVCITWFGT

TPRVVIAEPELVKDILSNKFGHFEKFTLKSLGKLIALGLASYEGEKWARHRRILNPAFHL

EKLKHMLPAFSTCCSEMIDRWDSKLAGSDGPFELDIWQEFQNLTGDVISRTAFGSSFMEG

RRIFQLQEEQADRIIKTIQYIYIPGYL YFPTENNRRMKENSREIEGLLRGIIEKRSRAVE

NGELSGDDLLGLMLKSNMDSGEPSNLRMSTEDVIEECKLFYFAGMETTSVLLTWTLVVLS

MHPEWQHRAREE VTMILHEVLRLYPPAVTLSRRTFKEIQIGGITYPAGVGLELPIILIHH

NTDVWGKDAHEFKPERFADGISKATKTNQQAFFPFGWGPRICIGQNFAMLEAKMALCVIL

QNFEFQLSPSYTHAPYASVTLHPQHGAQIILTRL

 

>CYP72A20 $F AP002839 Oryza sativa genomic DNA, chromosome 1 56581-59004

AP002744 join(129496..129808,130309..130529,130636..130883, 131004..131919)

MEVGVSLEAGKPAAAPWGMLYYGVPALLVLGALYRAAERCWLGPRRVAGALQGQGLRGTAYRFPAGDL

PENARRSKEARAKPMPPCHDIVPRVAPLLQDIVKEY (PHASE 1 INTRON)

GNVCITWFGTTPRVVIAEPELVKDILSNKFGHFEKFTLKSLGKLIALGLASYEGEKWARH

RRILNPAFHLEKLK (phase 0 intron)

HMLPAFSTCCSEMIDRWDSKLAGSDGPFELDIWQEFQNLTGDVISRTAFGSSFMEGRRI

FQLQEEQADRIIKTIQYIYIPGYL (phase 2 intron)

VSCR YFPTENNRRMKENSREIEGLLRGIIEKRSRAVENGELSGDDLLGLMLKSNMDSGE

PSNLRMSTEDVIEECKLFYFAGMETTSVLLTWTLVVLSMHPEWQHRAREEV LSAFGRDKP

NFDGLSRLKT (intron joint not correct as in gene 3)

VTMILHEVLRLYPPAVTLSRRTFKEIQIGGITYPAGVGLELPIILIHHNTDVWGKDAHEFKPERFADGISKAT

KTNQQAFFPFGWGPRICIGQNFAMLEAKMALCVILQNFEFQLSPSYTHAPYASVTLHPQH

GAQIILTRL*

 

 

Gene No. : 8-2_309 CYP72A21 C-term missing

1627909..1628439 , 1628545..1628789 , 1628910..1629294 , 1629402..1629592 , 1629673..1629694 (+)

>8-2_309

MVLGAWLMSPASVPWSLLAYGVLGLVLLWQAGRLLHSLWWRPRRLELALRAQGLRGTRYR

FLTGDLGEHGRLNREAWARPLPLRCHDIAPRVAPFLHNAVREHGSACFTWFGPTPKVTIT

DPDLAKGVLSNKFGHFEKPKFPTLTKLFSDSLANHEGEKWVKHRRILNPAFHLEKLKLML

PAFSACCEELVSKWMESLGSDGSYEVDVWPEMQILTGDVISRTAFGSSYLEGRRIFQLQA

EQTERLLKCMQKIVIPGYMSLPTKNNRKMHQIKKETDSILRGLVDKRMQAMKEGECTKDD

LLGLLLESNMRHTEEDGQSNHGLTIEEVIEECKLFYFAGMETTSVLLTWTILLLSMHPEW

QDRAREEILGLFGKNKPEYEGLSRLKIVTMILYEVLRLYPPAVTFTRKTYKQMEIGGVTY

PAGVIVELPVLLIHHDPNIWGSDAHEFKPDR SKLRIA

 

>CYP72A21 $F AP002839 Oryza sativa genomic DNA, chromosome 1 61993-63890

AZ045374.1 nbeb0080P16f CUGI Rice BAC genomic Length = 843

AQ857269.1 nbeb0005G04r CUGI Rice BAC genomic Length = 855

AQ865258.1 nbeb0025C03f CUGI Rice BAC genomicLength = 738

AP002744 join(134887..135417,135523..135767,135888..136272,

136380..136805) this annotation adds first 7 aa

AQ050520, AQ272173, AQ159375, AU031882 (opposite end = D24685)

AQ575977 nbxb0088K02r

D24685.1 RICR2374A Rice root cDNA clone R2374_1A.Length = 419 = 72A

MVLGAWLMSPASVPWSLLAYGVLGLVLLWQAGRLLHSLWWRPRRLELALRAQGLRGTRYRFLTGD

LGEHGRLNREAWARPLPLRCHDIAPRVAPFLHNAVREHGSACFTWFGPTPKVTITDPDLA

KGVLSNKFGHFEKPKFPTLTKLFSDSLANHEGEKWVKHRRILNPAFHLEKLK (phase 0 intron)

LMLPAFSACCEELVSKWMESLGSDGSYEVDVWP

EMQILTGDVISRTAFGSSYLEGRRIFQLQAEQTERLLKCMQKIVIPGYM (phase 2 intron)

SLPTKNNRKMHQIKKETDSILRGLVDKRMQA

MKEGECTKDDLLGLLLESNMRHTEEDGQSNHGLTIEEVIEECKLFYFAGMETTSVLLTWT

ILLLSMHPEWQDRAREEILGLFGKNKPEYEGLSRLKI (PHASE 0 intron)

VTMILYEVLRLYPPAVTFTRKTYKQMEIGGVTYPAGVIVELPVLLIHHDPNIWGSDAHEF

KPDR FVEGISKASKNPGAFLPFGWGPRICIGQNFALLEAKMALCMILQCFKLELMPSYTH

APYSMVTLRPMHGAQIKLRAI*

 

Gene No. : 8-2_311 fusion of CYP72A22 missing C-term and CYP72A23

1632372..1632902 , 1633047..1633300 , 1633441..1633645 , 1633936..1634126 , 1636460..1636722 , 1638075..1638625 , 1638719..1638963 , 1639103..1639487 , 1639603..1640028 (+)

>8-2_311

MVLGAGLRCPASVPWSSLAYGLLGLVLLWQGGRLLHRLWWRPRRLELALRAQGLRGTRYR

FLTGDLGEHGRLNREAWARPLPLRCHDIAPRVAPFLHSSVREHGKACFSWFGPIPKVTIA

NPDLAKDVLSNKFGHLEKHKFQGLTKLLSDGVASHEGEKWVKHRRILNPAFHLEKLKTLQ

RMLPAFSTCCEELISRWMESLGSEGSYEVDVWPEMQSLTGDVISRTAFGSSYLEGRRIFQ

LQAEQAERLLKCVQKIIIPGYMSLPTKNNRKMHQIKKEIDSILRGLIGKRMQAMREGEST

KDDLLGLLLESNMRHTAEHGQSSQGLTIEE VTMILYEVLRLYPPAVTLTRQTYKQIEIGG

VTYPAGVIIELPLLLIHSDPDIWGSDVHKFNPER RLKVWAFVSDVRGHWASASPAQSWSR

GWRCPLLPAAATSPAAHLAPAAAGGASLLRDSPAGAEREAPLAAAAAAAGRGVGRVGRRR

GESTA MVFGELFSRASLPPPWSLLAYGLVGPVLLWQAGRLLDRLWWRPRRLERALRAQGL

RGTAYRFLLGDLREFGRLNEEAWSSAPLPLGCHDIVPRVTPFVHRNVRDNGRPCCFSWFG

PIPSVTITDPAQVRDVLSNKLGHFEKPKLPALTKLLADGLTSHDGEKWVKHRRIMNPAFH

LEKLKLMLPAFSTCCEELVGKWMDSLGPDGSCELDVWPEMQSLTGDVISRTAFGSSYSEG

RRIFQLQTEQAELFIGAIQKFVIPGYMYLPTKKNRRMRRINSEVESILRGIIGKRMQAIA

EGESTNDDLLGLLLESNMRHADENGRSSPGMTTEDVIEECKLFYFAGMETTSVLLTWTMV

VLSMHPEWQDRAREEVLGLFGRDKPEYEGLSRLKTVTMVLYEVLRLYPPAIVFSRKTYKE

MEIGGVVYPRGVILELPVLFIHHDREIWGRDVHEFRPERFAEGISRASNDRGAFLPFGWG

PRVCIGQNFALLEAKMALCMILQRFEFELAASYTHAPHTVMTLHPMHGAQMKLRMI

 

>CYP72A22 $F AP002839 Oryza sativa genomic DNA, chromosome 1 66435-68424

AP002744 join(139350..139880,140025..140278,140419..140806,

140914..141339)

MVLGAGLRCPASVPWSSLAYGLLGLVLLWQGGRLLHRLWWRPRRLELALRAQGLRGTRYRFLTGDL

GEHGRLNREAWARPLPLRCHDIAPRVAPFLHSSVREHGKACFSWFGPIPKVTIANPDLAKDVLSNK

FGHLEKHKFQGLTKLLSDGVASHEGEKWVKHRRILNPAFHLEKLK (phase 0 intron)

RMLPAFSTCCEELISRWMESLGSEGSYEVDVWPEMQSL

TGDVISRTAFGSSYLEGRRIFQLQAEQAERLLKCVQKIIIPGYM (phase 2 intron)

SLPTKNNRKMHQIKKEIDSILRGLIGKRMQAMREGESTKDDLLGLLLESNMRHTAEHGQSS

QGLTIEE VIEECKLFYFAGMETTSVLLTWTMLLLSMHPEWQDHAREEILGLFGKNKPEYE

GLSRLKI (intron joint not correct)

VTMILYEVLRLYPPAVTLTRQTYKQIEIGGVTYPAGVIIELPLLLIHSDPDIWGSDVHKF

NPER FAEGISKASKDPGAFLPFSWGPRICIGQNFALLETKMALCMILQHLELELALSYTH

APQSIITLRPTHGAQIKLRAI*

 

>CYP72A23 $F AP002839 Oryza sativa genomic DNA, chromosome 1 72149-74091

AP002744 join(145064..145603,145697..145941,146081..146465,

146581..147006)

AU067870 Rice callus Oryza sativa cDNA clone C10320_12Z, CYP72 like Nterm

AU067871 AU067869 very similar to AQ050520 K-helix to heme

MVFGELFSRASLPPPWSLLAYGLVGPV

LLWQAGRLLDRLWWRPRRLERALRAQGLRGTAYRFLLGDLREFGRLNEEAWSSAPLPLGC

HDIVPRVTPFVHRNVRDNGRPCCFSWFGPIPSVTITDPAQVRDVLSNKLGHFEKPKLPAL

TKLLADGLTSHDGEKWVKHRRIMNPAFHLEKLK (PHASE 0 INTRON)

LMLPAFSTCCEELVGKWMDSLGPDGSCELDVWPEMQSLTGDVISRTAFGSSYSEGR

RIFQLQTEQAELFIGAIQKFVIPGYM (PHASE 2 INTRON)

YLPTKKNRRMRRINSEVESILRGIIGKRMQAIAEGESTNDDLLGLLLESNMRHADENGRS

SPGMTTEDVIEECKLFYFAGMETTSVLLTWTMVVLSMHPEWQDRAREEVLGLFGRDKPEY

EGLSRLKT (PHASE 0 INTRON)

VTMVLYEVLRLYPPAIVFSRKTYKEMEIGGVVYPRGVILELPVLFIHH

DREIWGRDVHEFRPERFAEGISRASNDRGAFLPFGWGPRVCIGQNFALLEAKMALCMILQ

RFEFELAASYTHAPHTVMTLHPMHGAQMKLRMI*

 

 

Gene No. : 8-2_319 CYP72A24 C-term incorrect

1675901..1676207 , 1678114..1678334 , 1678425..1678672 , 1678819..1679209 , 1679296..1679700 , 1680310..1680444 (+)

>8-2_319

MG MVVFAAGDERPLMLVWAAVAGAVLAWCAVRAMEWAWWRPRRLERALRAQGLRGTPYRS

PAGDAPLNVQLSAEARARTMPLGCHDVVPRAMPLFHQAMKEHGKVSITWFGPVPRVTITK

PELVREVLSNKFGHFEKLKFGRFQRLLHNGLGSHEGEKWAKHRRIINPAFHLEKLKRMLP

AFAACCTELVDKWEGLAKGGDEPYEVDVWPEMQSLTGDVISRAAFGSSYLEGKRIFQLQG

EQIELIVATMNKIHIPGYIHLPTKSNRRMKQIAAEIEGMLKRIIAKRESALKAGEASSDD

DLLGLLLESNLDHSKGNGGAASSGISIDDVIGECKLFYFAGMETTSVLLTWTMVVLSMHP

EWQDRAREEVLHVFGSRAPDYDGLSRLRIVTMVLYEVLRLYTPLTALQRKTYKPMELGGV

RYPAGVVLTLPLLCVHHDKDVWGADADEFRPERFAEGISKASREAPAFFPFGWGPRICIG

QNFALLEAKMGLSMILQRFSFDLSPSYTHAPFPVGLLQPEHGAQ RCLVRRHLHRLMEPVS

HWSPSPSSHLFNGYNEQPSDVPSSPRHL

 

>CYP72A24 $F AP002839 Oryza sativa genomic DNA, chromosome 1 109970-113787

AQ864347.1 nbeb0023A03r CUGI Rice BAC genomicLength = 735

MVVFAAGDERPLMLVWAAVAGAVLAWCAVRAMEWAWWRPRRLERALRAQGLRGTPYRSPA

GDAPLNVQLSAEARARTMPLGCHDVVPRAMPLFHQAMKEH (PHASE 1 INTRON)

GKVSITWFGPVPRVTITKPELVREVLSNKFGHFEKLKFGRFQRLLHNGLGSHEGEKWAKH

RRIINPAFHLEKLK(PHASE 0 INTRON)

RMLPAFAACCTELVDK

WEGLAKGGDEPYEVDVWPEMQSLTGDVISRAAFGSSYLEGKRIFQLQGEQIELIVATMNK

IHIPGYI (PHASE 2 INTRON)

HLPTKSNRRMKQIAAEIEGMLKRIIAKRESALKAGEASSDDDLLGLLLESNLDHSKGNGGA

ASSGISIDDVIGECKLFYFAGMETTSVLLTWTMVVLSMHPEWQDRAREEVLHVFGSRAPD

YDGLSRLRI (PHASE 0 INTRON)

VTMVLYEVLRLYTPLTALQRKTYKPMELGGVRYPAGVVLTLPLLCVHHDKDVWGADADEF

RPERFAEGISKASREAPAFFPFGWGPRICIGQNFALLEAKMGLSMILQRFSFDLSPSYTH

APFPVGLLQPEHGAQ VRLTRLN*

 

Gene No. : 8-2_320 CYP72A25 one intron joint does not agree

1681409..1681715 , 1681748..1682031 , 1682190..1682427 , 1682557..1682927 , 1683004..1683429 (+)

>8-2_320

MEIVDGASPPLHPWSLLLYALGALAALWWAWRALDRFWLRPRRLGRALRSQGLRGTDYRF

PSGYLKEFARLLAAALAAPMPPLSHDVASRALPFELAAIKQHAAYTHAPANVRDQTEVAW

NAGNVCVTWFGPEVRVIVSDPKLFREILANKNGRFGKQKSILWVQNLLADGLTSHQGEKW

VAHRRIMNHAFHLEKLKRMLPAFAACSSELISRWQDSVGADGAQEIDVWPEFQNLTGDVI

SRSAFGSSFSEGRRIFQLQSEQARNVMKMAKALYFP ELNRRTKANAREVRELLKGIITKR

ESAMKDGHAVNDDLLGLLLETNIKESQEAGSSKPTMTTKDIIEELKLLYFAGSDTTAVLL

TWTMVLLSMHPEWQDRAREEVLRVFGKNSPDFEGINHLKVVTMILHEVLRLYPPILLLGR

EAYEETELGGVTYPPGVTFALPIAGIHHDPDVWGEDVGEFKPERFAEGVSRASKDSPALV

PFSWGPRICVGQNFALLEAKMALSMILQRFSFGLSPSYTHAPFPIPTLQPQHGAQIKLTK

L

 

>CYP72A25 $F AP002839 Oryza sativa genomic DNA, chromosome 1 115472-117492

AG021553.1 strain NC0134 PCR from rice genomic DNA clone NC0134_0_102_1A.

Length = 636 AG021553

AG023207.1 strain NC2780 PCR from rice clone NC2780_0_701_1A.Length = 474

MEIVDGASPPLHPWSLLLYALGALAALWWAWRALDRFWLRPRRLGRALRSQGLRGTDYRFPSGYLKEFARLL

AAALAAPMPPLSHDVASRALPFELAAIKQH (PHASE 1 INTRON)

GNVCVTWFGPEVRVIVSDPKLFREILANKNGRFGKQKSILWVQNLLADGLTSHQGEKWVA

HRRIMNHAFHLEKLK (PHASE 0 INTRON)

VQRMLPAFAACSSELISRWQDSVGADGAQEIDVWPEFQNLTGDVISRSAFGSSFSEGRRI

FQLQSEQARNVMKMAKALYFP GYR (PHASE 2 INTRON

FLPT ELNRRTKANAREVRELLKGIITKRESAMKDGHAVNDDLLGLLLETNIKESQEAGSS

KPTMTTKDIIEELKLLYFAGSDTTAVLLTWTMVLLSMHPEWQDRAREEVLRVFGKNSPDF

EGINHLKV (PHASE 0 INTRON)

VTMILHEVLRLYPPILLLGREAYEETELGGVTYPPGVTFALPIAGIHHDPDVWGEDV

GEFKPERFAEGVSRASKDSPALVPFSWGPRICVGQNFALLEAKMALSMILQRFSFGLSPS

YTHAPFPIPTLQPQHGAQIKLTKL*

 

Gene No. : 9-2_186 CYP706C2 N-term extension, one boundary does not agree

947923..948172 , 949734..950614 , 950768..951400 (+)

>9-2_186

MVVWVVQAHCGLMAVTRDSGYGSGDRQVGLARRRSAWAAQRRRHRSEHQAAVQFSTRSSR

HGRVVDSGAAARQWLNGYRRLAAIMQP MATANLLYAALLVPTVLYLAVTRRRSRRLPPGP

VGLPLVGSLPFIDPNLHTYFASLAAKHGPILSIRLGSKVDIVVNSAQLAREVLRDQDSVF

ANRVMLDAGDAVSFGGAQNIVGNPLGPMWRLLRRVCVQEMMSPAGLASVHGLRRREFRST

LRYLHSKPGEPVDVGAQMFLNTMNVITGTMWGGTIGSESERSAVGSEFRGLVAEVTELLG

TPNVSDLFPVLKPFDLQGIRRKMERLRSRFDLLFTKIIQQRMRSQQDGGEMTTDFLECLL

KMEKEGSDGKTTFTMDN EMVVGGTDTTSNSVEWIMAELLQNPQVLNKVQQELDSIVGRDA

VVEESHLPQLHYLRMVIKETLRLHPPVPLLVPHSPSAAATVGGYHVPEGCRVLINVWAIQ

RNPLVWNKPLDFNPDRFARDGGHKGDFTGSQLDYLPFGSGRRMCAGMAMGEKVMVYSVAM

LLQAFDWKLPQGVQLDLSEKFGIVMKKATPLVAIPTPRLSKPELYYS

 

>AP003378.1a $F CYP706C2 chromosome 1 clone P0047E11, 49% to 706A5

no ortholog in indica on 9/6/02

28850 MATANLLYAALLVPTVLYLAVTRRRSRRLPPGPVGLPLVGSLPFIDPNLHTYFASLAAKH 29029

29030 GPILSIRLGSKVDIVVNSAQLAREVLRDQDSVFANRVMLDAGDAVSFGGAQNIVGNPLGP 29209

29210 MWRLLRRVCVQEMMSPAGLASVHGLRRREFRSTLRYLHSKPGEPVDVGAQMFLNTM 29377

29378 NVITGTMWGGTIGSESERSAVGSEFRGLVAEVTELLGTPNVSDLFPVLKPFDLQGIRRKM 29557

29558 ERLRSRFDLLFTKIIQQRMRSQQDGGEMTTDFLECLLKMEKEGSDGKTTFTMDN VKGFLL 29737 (0)

      EMVVGGTDTTSNSVE 29917

29918 WIMAELLQNPQVLNKVQQELDSIVGRDAVVEESHLPQLHYLRMVIKETLRLHPPVPLLVP 30097

30098 HSPSAAATVGGYHVPEGCRVLINVWAIQRNPLVWNKPLDFNPDRFARDGGHKGDFTGSQL 30277

30278 DYLPFGSGRRMCAGMAMGEKVMVYSVAMLLQAFDWKLPQGVQLDLSEKFGIVMKKATPLV 30457

30458 AIPTPRLSKPELYYS* 30505

 

Gene No. : 9-2_191 CYP711A2 only finds N-term

979355..979581 , 979692..979844 , 981484..981559 (+)

>9-2_191

MEISTVLGAILAEYAVTLVAMAVGFLVVGYLYEPYWKVRHVPGPVPLPLIGHLHLLAMHG

PDVFSVLTRKYGPIFRFHMGRQPLVMVADAELCKEVGVKKFKNFPNRSMPSPITNSPVHQ

KGLFFTS LALSALIPSSMLVSEDEWMRRNWG

 

>AP003254a $F CYP711A2 60% to 711A1 no indica ortholog found on 9/6/02

CDS join(126724..126950,127061..127213,130152..131133, 131374..131706)

MEISTVLGAILAEYAVTLVAMAVGFLVVGYLYEPYWKVRHVPGP

VPLPLIGHLHLLAMHGPDVFSVLTRKYGPIFRFHMGRQPLVMVADAELCKEVGVKKFK

NFPNRSMPSPITNSPVHQKGLFFTS GSRWTTMRN MILSIYQPSHLATLIPSMESCIER

AAENLEGQEEINFSKLSLSFTTDVLGQAAFGTDFGLSKKLASSDDDEDTRKIAADTCA

EAKASSEFIKMHVHATTSLKMDMSGSLSIIVGQLLPFLHEPFRQVLKRLRWTADHEID

RVNLTLGRQLDRIVAERTAAMKRDPAALQQRKDFLSVMLTARESNKSSRELLTPDYIS

ALTYEHLLAGSATTAFTLTTALYLVAKHPEVEEKLLREIDGFGPRDRVPTAEDLQTKF

PYLDQ

VLKEAMRYYPSSPLIARELNQQLEIGGY

PLPKGTWVWMAPGVLGKDPKNFPEPEVFRPERFDPNGEEEKRRHPYALFPFGIGPRAC

IGQKFAIQEMKLSAIHFYRHYVFRPSPSMESPPEFVYSIVSNFKNGAKLQVIKRHI

 

Gene No. : 9-2_192 CYP711A2 finds rest of 711A2 with retained intron seq.

982811..983764 , 984005..984337 (+)

>9-2_192

MILSIYQPSHLATLIPSMESCIERAAENLEGQEEINFSKLSLSFTTDVLGQAAFGTDFGL

SKKLASSDDDEDTRKIAADTCAEAKASSEFIKMHVHATTSLKMDMSGSLSIIVGQLLPFL

HEPFRQVLKRLRWTADHEIDRVNLTLGRQLDRIVAERTAAMKRDPAALQQRKDFLSVMLT

ARESNKSSRELLTPDYISALTYEHLLAGSATTAFTLTTALYLVAKHPEVEEKLLREIDGF

GPRDRVPTAEDLQTKFPYLDQ ARQTLPRRVYIVRHEFLTAHRARMQ VLKEAMRYYPSSPL

IARELNQQLEIGGYPLPKGTWVWMAPGVLGKDPKNFPEPEVFRPERFDPNGEEEKRRHPY

ALFPFGIGPRACIGQKFAIQEMKLSAIHFYRHYVFRPSPSMESPPEFVYSIVSNFKNGAK

LQVIKRHI

 

Gene No. : 9-2_195 CYP711A3 only finds N-term

994882..995108 , 995221..995373 , 996465..996477 (+)

>9-2_195

MEIISTVLGSTAEYAVTLVAMAVGLLLLGYLYEPYWKVRHVPGPVPLPFIGHLHLLAMHG

PDVFTVLARKYGPVFRFHMGRQPLVMVADAELCKEVGVKKFKSIPNRSMPSAIANSLINQ

KGLCFT SKPK

 

>AP003254b $F CYP711A3 82% to seq on same contig

CDS join(142251..142477,142590..142742,145800..146613,

146730..146825,146909..147244)

no indica ortholog found on 9/6/02

MEIISTVLGSTAEYAVTLVAMAVGLLLLGYLYEPYWKVRHVPGP

VPLPFIGHLHLLAMHGPDVFTVLARKYGPVFRFHMGRQPLVMVADAELCKEVGVKKFK

SIPNRSMPSAIANSLINQKGLCFT RGSRWTALRN MIISIYQPSHLASLIPTMQSCIEC

VSKNLDGQEDITFSDLALGFATDVIGQAAFGTDFGLSKISASSNDDDIDKIATDTSAE

AKASSEFIRMHVHATTSLKMDLSGSLSIIIGQLLPFLQEPFRQVLKRIPWTADHEIDH

VNLALGGQMDKIVAERAAAMERDQAAPHAQQRKDFLSVVLAARESNKSWRELLTPDYI

SALTYEHLLAGSATTAFTLSTVLYLVSKHPEVEEKLLREIDGFGPHDHAPTAEDLQTK

FPYLDQ VVKESMRFYFLSPLIA RETCEQVEIGGYALPKGTWVWLAPGVLAKDPKNFPE

PEVFRPERFDPNGEEEKRRHPYAFIPFGIGPRACIGQKFSIQEIKLSVIHLYRNYVFR

HSPSMESPLEFQYSIVCNFKYGVKLR VIKRHTA

 

 

Gene No. : 9-2_196 CYP711A3 finds rest of 711A3 with retained intron seq.

and incorrect C-term extension and some bad intron boundaries

998459..999299 , 999407..999456 , 999540..999851 , 1000430..1000886 ,

1001963..1002132 (+)

>9-2_196

MIISIYQPSHLASLIPTMQSCIECVSKNLDGQEDITFSDLALGFATDVIGQAAFGTDFGL

SKISASSNDDDIDKIATDTSAEAKASSEFIRMHVHATTSLKMDLSGSLSIIIGQLLPFLQ

EPFRQVLKRIPWTADHEIDHVNLALGGQMDKIVAERAAAMERDQAAPHAQQRKDFLSVVL

AARESNKSWRELLTPDYISALTYEHLLAGSATTAFTLSTVLYLVSKHPEVEEKLLREIDG

FGPHDHAPTAEDLQTKFPYLDQ ACMPFFDIGRIYGLRLWSP RETCEQVEIGGYALPKGTW

VWLAPGVLAKDPKNFPEPEVFRPERFDPNGEEEKRRHPYAFIPFGIGPRACIGQKFSIQE

IKLSVIHLYRNYVFRHSPSMESPLEFQYSIVCNFKYGVKLR LSLRLIVCTSTQTHSPTLI

HLGGDGNERGAVAEPPKQEAEASLTNPIEPETEATRSLPITAAHGKTVESTDCAPPTATI

WTLALALATTGSQPPSALTADASKDSVRHILTIVTLSTPTVAPRALPLMMVTASLHPLST

TSSPPPTSHVGLVVVTALWFVSQHGEDDPNLDAVTSSVYLAKSASTIKGIRLVTVYLEVT

CRIQDFGKD

 

Gene No. : 9-2_197 CYP711A4 missing exon 4 and C-term stops too soon

compare end to 711A3

1003560..1003783 , 1004069..1004221 , 1004743..1005550 , 1006236..1006505 (+)

>9-2_197

MDISEVLGATAEWAVTLVAMAVGLLVVAYLYEPYRKVWHVPGPVPLPLIGHLHLLAMHGP

DVFSVLARKHGPVFRFHMGRQPLIIVADAELCKEVGVKKFKSIPNRSMPSPIANSPIHKK

GLFFIRGPRWTSMRNMIISIYQPSHLASLIPTMESCIQRASKNLDGQKEITFSDLSLSLA

TDVIGLAAFGTDFGLSKLPVTPDDSNIDKIAADTSVEAKASSEFIKMHMHATTSLKMDLS

GSLSILVGMLLPFLQEPFRQVLKRIPGMGDYKIDRVNRALKTHMDSIVAEREAAMEHDLA

ASQQRKDFLSVVLTARESNKSSRELLTPDYISALTYEHLLAGSTTTAFTLSTVLYLVAKH

PEVEEKLLKEIDAFGPRYCVPMADDLQTKFPYLDQ GTWVWLAPGVLAKDPKNFPEPEIFR

PERFDPNGEEERRRHPYAFIPFGIGPRVCIGQKFSIQEIKLSMIHLYRHYVFRHSPSMES

PLEF

 

>AP003378.1b $F CYP711A4 chromosome 1 clone P0047E11, 64% to 711A one in frame stop

continues AP003254 contig = AQ859680.1 nbeb0013M03f BAC genomic

AQ860765.1 nbeb0015B04f CUGI Rice BAC genomic

82666 MDISEVLGATAEWAVTLVAMAVGLLVVAYLYEPYRKVWHVPGPVPLPLIGHLHLLAMHG 82842

82843 PDVFSVLARKHGPVFR 82890 (2)

83176 FHMGRQPLIIVADAELCKEVGVKKFKSIPNRSMPSPIANSPIHKKGLFFIR 83322 (2)

83850 GPRWTSMRNMIISIYQPSHLASLIPTMESCIQRASKNLDGQKEITFSDLSLSLAT 84014

84015 DVIGLAAFGTDFGLSKLPVTPDDSNIDKIAADTSVEAKASSEFIKMHMHATTSLKMDLS 84191

84192 GSLSILVGMLLPFLQEPFRQVLKRIPGMGDYKIDRVNRALKTHMDSIVAEREAAMEHDLA 84371

84372 ASQQRKDFLSVVLTARESNKSSRELLTPDYISALTYEHLLAGSTTTAFTLSTVLYLVAKH 84551

84552 PEVEEKLLKEIDAFGPRYCVPMADDLQTKFPYLDQ (0)

      VVKESMRFYIMSPLLARETLEQVEIGGYVLPK 84848 (0)

85342 GTWVWLAPGVLAKDPKNFPEPEIFRPERFDPNGEEERRRHPYAFIPFGIGPRVCIGQKF 85518

85519 SIQEIKLSMIHLYRHYVFRHSPSMESPLEF*FAIICDFKYGVKLQAIKRHHA* 85677

 

Gene No. : 9-2_453 CYP72A35 missing part of an exon

2374696..2375121 , 2376055..2376433 , 2377089..2377252 , 2377612..2377832 , 2378579..2378867 (-)

>9-2_453

MLGEAASPWSLAGAGAAVALLWLCAWTLQWAWWTPRRLERALRAQGLRGTRYRLFIGDVA

ENGRLNREAASRPLPLGSHDVVPRVMPFFCNVLKEHGKLSFVWTGPKPFVIIRDPDLARE

ILSNKSGNFAKQTTAGIAKFVVGGVVTYEGEKWAKHRRILNPAFHQEKIKRMLPVFLACC

TKMITRWVNSMSSEGISELDVWDEFQNLTGDVISRTAFGSSYQEG YLPIENNRRIREIDQ

EIRTILRGIIVKRDKAVRNGEGSNDDLLGLLVESNMRQSNEKEDVGMSIEDMIEECKLFY

AAGSETTSMLLTWTLILLSMHPEWQEQAREEVMHHFGRTTPDHDGLSRLKIVTMILHEVL

RLYPPVVFLQRTTHKEIELGGIKYPEGVNFTLPVLSIHHDPSIWGQDAIKFNPERFANGV

SKATKFQTAFFSFAWGPRICLGQSFAILEAKMALATILQSFSFELSPSYTHAPHTVLTLQ

PQYGSPIKLKKL

 

>AP002899 $F CYP72A35 52% to 72A14 = AQ161379

complement(join(51630..52055,52989..53367,53942..54186,

54546..54766,55513..55801))

MLGEAASPWSLAGAGAAVALLWLCAWTLQWAWWTPRRLERALRA

QGLRGTRYRLFIGDVAENGRLNREAASRPLPLGSHDVVPRVMPFFCNVLKEHGKLSFV

WTGPKPFVIIRDPDLAREILSNKSGNFAKQTTAGIAKFVVGGVVTYEGEKWAKHRRIL

NPAFHQEKIKRMLPVFLACCTKMITRWVNSMSSEGISELDVWDEFQNLTGDVISRTAF

GSSYQEG WRIFQLQEEQAKRVLKAFQRIFIPGYWY LPIENNRRIREIDQEIRTILRGI

IVKRDKAVRNGEGSNDDLLGLLVESNMRQSNEKEDVGMSIEDMIEECKLFYAAGSETT

SMLLTWTLILLSMHPEWQEQAREEVMHHFGRTTPDHDGLSRLKIVTMILHEVLRLYPP

VVFLQRTTHKEIELGGIKYPEGVNFTLPVLSIHHDPSIWGQDAIKFNPERFANGVSKA

TKFQTAFFSFAWGPRICLGQSFAILEAKMALATILQSFSFELSPSYTHAPHTVLTLQP

QYGSPIKLKKL

 

Gene No. : 9-4_219 CYP94D13 splices out a stop codon near PERF motif

1139526..1140716 , 1140846..1141055 (+)

>9-4_219

MEFSSSSTSLFLLLSILPLLYFLCQRNDPKKQPHAHGLKSYPVVGIVPHFTKNNDRFLEF

TTEIMKRSPTQTMSFKALGLTGGGVITANPANVEYTLKTNFGNYPKGELAVSMVVDFLGH

GIFNSDGEQWQWQRKAASYEFNKRSLRNFVVDTVRSEVVERLLPLLERAERDGRTLDVQD

VLERFAFDNICQVAFDEDPACLAEDSMASPQSAEFMRAFNDAQIAVRDRFMSPVKSLWRF

KRLFNMEPERRMREALATIHGFAERIVRERRERGKAGLARSDDFLSRFAASGEHSDESLR

DVVTNFLLAGRDTTSSALTWFFWVLSGRPDVEDKIVREIHAVRRASGSTSDATFSFDELR

DMQYLHAAITESMRLYPPVAMDTHSCKEDDFLPDGTF SPFKYPVFHAGPRMCLGKEMADI

QMKSIVASVLERFSLQYAGGEGHPGLVLSVTLRMKGGLPMQVATRG

 

>AP003232.1h $F CYP94D13 chromosome 1 clone P0034E02 one in frame stop 53% to 94D2

100994 MEFSSSSTSLFLLLSILPLLYFLCQRNDPKKQPHAHGLKSYPVVGIVPHFTKNNDRF 100824

100823 LEFTTEIMKRSPTQTMSFKALGLTGGGVITANPANVEYTLKTNFGNYPKGELAVSMVVDF 100644

100643 LGHGIFNSDGEQWQWQRKAASYEFNKRSLRNFVVDTVRSEVVERLLPLLERAERDGRTLD 100464

100463 VQDVLERFAFDNICQVAFDEDPACLAEDSMASPQSAEFMRAFNDAQIAVRDRFMSPVKSL 100284

100283 WRFKRLFNMEPERRMREALATIHGFAERIVRERRERGKAGLARSDDFLSRFAASGEHSDE 100104

100103 SLRDVVTNFLLAGRDTTSSALTWFFWVLSGRPDVEDKIVREIHAVRRASGSTSDATFSFD 99924

99923  ELRDMQYLHAAITESMRLYPPVAMDTHSCKEDDFLPDGTF VGKGWLVTY*AYAMARVEDI 99744

99743  WGADCEEFRPERWLDEVGAFRPE SPFKYPVFHAGPRMCLGKEMADIQMKSIVASVLERFS 99564

99563  LQYAGGEGHPGLVLSVTLRMKGGLPMQVATRG* 99465

 

Gene No. : 9-4_222 fusion of CYP94D12 and CYP94D11

1149455..1150963 , 1154325..1155779 (+)

>9-4_222

MEFSSSSTSLFLLLSILPLLYFLCQRHDPKKQPHAHGLKSYPVVGTLPHFAKNKDRFLEF

ITEIMKRSPTHTLSFKALGLTGGVITANPANVEYTLKTNFGNYPKGELAVSMLVDFLGHG

IFNSDGEQWQWQRKAASYEFNKRSLRNFVVDTVRSEVVERLLPLLERAERDGRTLDVQDV

LERFAFDNICHVAFDEDPACLAEDSMASPQSAKFMRAFSDAQNAVMDRFMSPVKSRWRFK

RLFNMEPERQMREALATIHGFAERIVRERRERGEAGLARSDDFLSRFAASGDHSDESLRD

VVTNFLIAGRDTTSTALTWFFWLLSGRPDVEDKIVREIHAVRRASGGTGDPTFNLDELRD

MQYLHAAITESMRLYPPVAMDSHSCKEDDFLPDGTFVGKGWFVSYSAYAMARVEDIWGAD

CEEFRPERWLDEAGAFRPESPFKYPVFHAGPRMCLGKEMAYIQMKSIVASVLERFSLRYA

GGEGHPGFVLWLTLRMKGGLPMQ DTKKQPAGSNGLKSYPVVGTLPHFAKNRHRFLEWSTD

VMKRSPTHTMTFKALGLTGGVITANVANVEHILKTNFSNYPKGELSVSLLEDLLGHGIFN

SDGEQWLWQRKAASYEFNQRSLRSFVVDTVRFEVVERLLPLLEWARRDGRTLDVQDVLER

FAFDNICHVVFHEDPACLAEDSMVSSQSAEFIRACSDAQNAIIARFMSPVKSLWRVKRLF

NLDPERRMRDALTTIHGYADRIVRERRARGEAGLARSDDFLSRFAAGGEHSDESLRDVVT

NFLIAGRDSTSSALTWFFWLVSSRPDVEDKIVHEIRAVRSASSSGGTSSATFSFDELRDM

HYLHAAITESMRLYPPVHLDTHSCKEDDFLPDGTFVGKGWLVTYCAYAMGRVEDIWGADC

EEFRPERWLDEAGAFRPDSPFKYPIFHAGPRMCLGKEMAYIQMKSIVACVLEQFSLRYAG

GDGHPGFVLWSTLRMEGGLPMQVTTRE

 

>AP003232.1g $F CYP94D12 chromosome 1 clone P0034E02

AZ135305.1 OSJNBb0115G10r CUGI Rice BAC genomic 53% to 94D2

AZ131112.1 OSJNBb0104L22f CUGI Rice BAC genomic

91065 MEFSSSSTSLFLLLSILPLLYFLCQRHDPKKQPHAHGLKSYPVVGTLPHFAKNKDRF 90895

90894 LEFITEIMKRSPTHTLSFKALGLTGGVITANPANVEYTLKTNFGNYPKGELAVSMLVDFL 90715

90714 GHGIFNSDGEQWQWQRKAASYEFNKRSLRNFVVDTVRSEVVERLLPLLERAERDGRTLDV 90535

90534 QDVLERFAFDNICHVAFDEDPACLAEDSMASPQSAKFMRAFSDAQNAVMDRFMSPVKSRW 90355

90354 RFKRLFNMEPERQMREALATIHGFAERIVRERRERGEAGLARSDDFLSRFAASGDHSDES 90175

90174 LRDVVTNFLIAGRDTTSTALTWFFWLLSGRPDVEDKIVREIHAVRRASGGTGDPTFNLDE 89995

89994 LRDMQYLHAAITESMRLYPPVAMDSHSCKEDDFLPDGTFVGKGWFVSYSAYAMARVEDIW 89815

89814 GADCEEFRPERWLDEAGAFRPESPFKYPVFHAGPRMCLGKEMAYIQMKSIVASVLERFSL 89635

89634 RYAGGEGHPGFVLWLTLRMKGGLPMQ VTTRG* 89539

 

>AP003232.1f $F CYP94D11 chromosome 1 clone P0034E02

86279 MKFSSTSTPLFILLLPFLPLLYFLYLYQ DTKKQPAGSNGLKSYPVVGTLPHFAKNRHRFLEWST 86088

86087 DVMKRSPTHTMTFKALGLTGGVITANVANVEHILKTNFSNYPKGELSVSLLEDLLGHGIF 85908

85907 NSDGEQWLWQRKAASYEFNQRSLRSFVVDTVRFEVVERLLPLLEWARRDGRTLDVQDVLE 85728

85727 RFAFDNICHVVFHEDPACLAEDSMVSSQSAEFIRACSDAQNAIIARFMSPVKSLWRVKRL 85548

85547 FNLDPERRMRDALTTIHGYADRIVRERRARGEAGLARSDDFLSRFAAGGEHSDESLRDVV 85368

85367 TNFLIAGRDSTSSALTWFFWLVSSRPDVEDKIVHEIRAVRSASSSGGTSSATFSFDELRD 85188

85187 MHYLHAAITESMRLYPPVHLDTHSCKEDDFLPDGTFVGKGWLVTYCAYAMGRVEDIWGAD 85008

85007 CEEFRPERWLDEAGAFRPDSPFKYPIFHAGPRMCLGKEMAYIQMKSIVACVLEQFSLRYA 84828

84827 GGDGHPGFVLWSTLRMEGGLPMQVTTRE* 84741

 

Gene No. : 9-4_224 CYP94D10 with C-term extension

putative cyst nematode resistance protein (partial) see NM_192720.1

1159307..1160838 , 1161725..1161756 , 1161873..1162141 , 1162201..1162359 , 1162605..1162652 (+)

>9-4_224

MELSPISASLLLILILLAFLPLLYFLYMHQDPKKKPRIHGLKSYPVVGTLPHIIKNKHRF

LKWSTSIMKCSPTNTMSYKALGLTGGVITANPANVEHILKTNFDNYPKGKLTVSMLEDFL

GHGIFNSDGEQWLWQRKAASYEFNKRSLRNFVVDTVRFEIVKRLLPLLEQAGLDGRTLDL

QDVLERFAFDNICLVAFGEDPACLTKERMAAPQSAEFMRAFNDAQNAILARFNSPAKSLW

RVKKLFNMEPERRMREALATIHGFAERIVRERRERGEAGLARGDDFLSRFAASGEHSDES

LRDVVTNFVLAGRDTTSSALTWFFWIVSGRPDVEDRVVREIRAVRASSGSTDATFSFDEL

REKHYLHAAITESMRLYPPVAIDTHSCKEDDFLPDGTFVGKGWLVMYSAYAMGRMEGIWG

ADCEEYRPERWLDEAGAFRPESTFKYPVFNAGPRICIGKEMAYIQMKSIVACVLEKFSLR

YASDANERPRSVLSLTLRMKWGLPMKVTIR NDIHNYRKWGDDQWLLDLGL PTDVGWTIEL

IDQLVTVWSAIQTIELTEHEDDQISWKLTSHGQYTAASAYNAQLLGTTANNFNNLIWKPW

APRKCKTFAWLIHMSLMSSGPGDGSTPPRRLLLHKENMGNPSGMDGVQQYSPQPLTAGFV

GVRM FSPSIFNEIGGSPIR

 

>AP003232.1e $F CYP94D10 chromosome 1 clone P0034E02 51% to 94D1 4 diffs with BE230752

3 diffs with AQ509587 78% to AZ135305

81213 MELSPISASLLLILILLAFLPLLYFLYMHQDPKKKPRIHGLKSYPVVGTLPHIIKNK 81043

81042 HRFLKWSTSIMKCSPTNTMSYKALGLTGGVITANPANVEHILKTNFDNYPKGKLTVSML 80866

80865 EDFLGHGIFNSDGEQWLWQRKAASYEFNKRSLRNFVVDTVRFEIVKRLLPLLEQAGLDGR 80686

80685 TLDLQDVLERFAFDNICLVAFGEDPACLTKERMAAPQSAEFMRAFNDAQNAILARFNSPA 80506

80505 KSLWRVKKLFNMEPERRMREALATIHGFAERIVRERRERGEAGLARGDDFLSRFAASGEH 80326

80325 SDESLRDVVTNFVLAGRDTTSSALTWFFWIVSGRPDVEDRVVREIRAVRASSGSTDATF 80149

80148 SFDELREKHYLHAAITESMRLYPPVAIDTHSCKEDDFLPDGTFVGKGWL 80002

80001 VMYSAYAMGRMEGIWGADCEEYRPERWLDEAGAFRPESTFKYPVFNAGPRICIGKE 79834

79833 MAYIQMKSIVACVLEKFSLRYAS 79765

      DANERPRSVLSLTLRMKWGLPMKVTIR K* 79678

 

Gene No. : 9-4_225 CYP94D9 100% agreement

1164142..1165689 (+)

>9-4_225

MELSSISASLLLILPLLPLLYFLYLHQDPKKQPRAHGLKSYPVVGTLPHFIKHKNHILEW

SAGVLKRSPMHTMSFKALGLTGGVFTANPANVEHMLKTNFGNYVKGEAIITMLEDFLGRG

IFNSDGEKWLWQRKATSYEFSKRTLRNFVVDTVQFEVIERLLPLLERAGRDGRTLDVQSV

LERFAFDNICRVVFDEDPACLAKDSVASPHIAEFMGACNDAQNAILARFNSPIKSLWRVK

RLFNIESERRLREALATIHAYTDRIIRERRERGEARGDDFLSRFAAGDKHSDESLHDVIT

NLVLAGRETTASALTWFFWLVSGRPDVEDNIVREIRAVRRASSSNGVTSGAAFSPHELRD

MHYLHAAITESMRLYPPVSLDTYVCKEEDFLPDGTFVGKGWQVTYCAYAMARVEDIWGTD

CEEFRPERWLDEAGVFRPESSFKYPVFHGGPRMCLGKEIAYIQMKSIVSCVFDRFTLRYT

GGEGHPGLVTSLALRMEGGLPMQVLLTNRGQAVSC

 

this annotation missed CYP94D8P

 

>AP003232.1c $P CYP94D8P chromosome 1 clone P0034E02 pseudogene fragments

no indica ortholog found on 9/6/02

67707 FT*VFNDAQNTIVSRFLS*VKSL*RFKRLFNMEPKRQMWEALAR 67576

67575 HDPATPSGSFSKHNN*SLREVVTSFLLA 67492

67313 TFVGKGWLVIYYAYAMRYVEDIRGSDCEEFRLEQWMNKAGVF*PKSSFK 67167

67164 FEYPIFYIGQRMCLGKEMTYIHGFGFTALEKLMQRVA 67054

 

Gene No. : 9-4_229 CYP94D7 with C-term extension

1179800..1181300 , 1181481..1181899 (+)

>9-4_229

MELSSTSASLLLILLLTLVYFLYLHQDPKKKPRTHGLKSYPVVGTLPHFINNKDRFLEWS

TGVMKRSPTHTMSFKELGLTGGVITANPANVEHILKANFGNYPKGELAVSLLEDFLGHGI

FNSDGEQWLWQRKAASYEFNKRSLRNFVVDTVRFEVVERLLPLLEYAGRHGRTLDVQDVL

ERFAFDNICRVAFDEDPACLTEESMAAPQSAEFMRAFNDAQNAILDRFNSPAKSLWRIKK

LFNMEPERRMRDSLATIHGYAERIVRERRERREARLERRDDFLSRFAASGEHSDESLRDV

VTNFILAGRDTTSSALTWFFWLLSGRPDVEDKIVREIRAVRQSSAGSEGTRGATFSLDEL

RDMQYLHAAITESMRLYPPVPFDTHSCKEEEFLPDGTFAGKGWLVTYCAYAMGRVEDIWG

ADCEEFRPERWLDEAGAFRPESTFKYPVFHAGPRMCLGKEMAYIQMKSIVACVLEQFSLR

YAGDAKGHPGLVVALTLRME ATLFGIHVSSAAVFGAPSEMRWARVVGYRIADLQEKLFAR

VDGLPMARPHRANLNTSSQPKSMPWRSGVGTGDKAEQLIQPLVLGVHRFILSPSPLINDE

WARIWNVSNSVGRIGERTCFFFRMRKRVSGLVFFANTQR

 

>AP003232.1b $F CYP94D7 chromosome 1 clone P0034E02

BE230752.1 99MJ354 Rice Seedling cDNA clone 99MJ354 65% to 94C1

60720 MELSSTSASLLLILLLTLVYFLYLHQDPKKKPRTHGLKSYPVVGTLPHFINNKDRF 60553

60552 LEWSTGVMKRSPTHTMSFKELGLTGGVITANPANVEHILKANFGNYPKGELAVSLLEDFL 60373

60372 GHGIFNSDGEQWLWQRKAASYEFNKRSLRNFVVDTVRFEVVERLLPLLEYAGRHGRTLDV 60193

60192 QDVLERFAFDNICRVAFDEDPACLTEESMAAPQSAEFMRAFNDAQNAILDRFNSPAKSLW 60013

60012 RIKKLFNMEPERRMRDSLATIHGYAERIVRERRERREARLERRDDFLSRFAASGEHSDES 59833

59832 LRDVVTNFILAGRDTTSSALTWFFWLLSGRPDVEDKIVREIRAVRQSSAGSEGTRGATFS 59653

59652 LDELRDMQYLHAAITESMRLYPPVPFDTHSCKEEEFLPDGTFAGKGWLVTYCAYAMGRVE 59473

59472 DIWGADCEEFRPERWLDEAGAFRPESTFKYPVFHAGPRMCLGKEMAYIQMKSIVACVLEQ 59293

59292 FSLRYAGDAKGHPGLVVALTLRME GGLPMKVTIRE* 59185

 

 

Gene No. : 9-4_234 fusion of CYP94D6 (wrong N-term and splices out a frameshift)

and some unknown gene plus a ribonuclease E -related seq

1196669..1196829 , 1197128..1197950 , 1198031..1198583 , 1200577..1200837 , 1201013..1202439 (+)

>9-4_234

MPWTTTMDANEDGRCRRRTGVGAGGGGLTTAVPGGEDDGATNGRDRVARNKGN STRKQPR

ADGLKAYPIVGILPHFVRNQHRLLEWSAGVVARCPTHTMSFNFKGFGLIAGAITGNPANV

EHIVKTNFQNYSKGEYVVSVMEDFLGHGIFNSDGDQWLWQRKAASYEFNKRSLRNFVVDT

VRSEVVDRLLPLLTRAERDGRTLDEQDVLERFAFDNVCCVAFDEDPACLTEEGMGTNART

EFLRAFNDAQNILMARFMSPVEWAWRAKRLLDLEPERRMREALATIHGYADRIVRERRER

GAAGLARKDDFLSRFAATGKHSDESLRD PDVEDRIAREIRAVRASSGSTDAAAFSFDELR

EMHYLHAAITEAMRLYPPVAMDSHCCQNDDVLPDGTFVGKGWQVTYSAYAMARLEELWGA

DCEEFRPERWLDEDGVFRPESPFKYPVFHGGPRMCLGKEMAYIQMKSIAACVFERFSFRF

VGGEGRPGLVFSVTLRMEGGLPMRVKKRRDSV SNEEAEVKVSDAETAAESAAGEAEADDV

EEEETKELRRNGVAAAPIGDALRERRPPPIAAAEILPEWSERAAIGETRASAMFGESRCD

GALLHNQPPDLIAVGRRRRRMDPSRPLLGRGALITSSAHAAAALLLVAFLFLTLRNLPIS

LSPPTAALTPTTSHLEQQDQASCDTTSTLDCADPQLFHLMMRRAIDAFPDVHFNRFGRPV

PGDPPSSSCDMAWRARSTASANYKDYRRFSVARDPVTCAYSVTSIGEYHSGPLARKPRRG

GTNATAPPPPPALSRSQFAAGKYLSYLGGGDRCKPMPHYLRSLLCSIAEARYLNRTLVLD

LSVCLAAAYAGGMPEEGKRLAFYIDIEHLQSVVGIVEHKRFWEDWDKWGAQGQLGVRIIE

DSRVAPTKFSKSRDPLIVRKFGDVEPGNYWYNVCEGEAEHVLRPPQGAIRTAPSLMDIVD

GIISRMQVDFDSVHVGGNDGNLRRRIEERLNGG GRQVYVAGEGINVVLLDALKAKYSSVH

YLDAFEELWARDSKWFLEMKRLNGGVPVEFDGYMRELVDREVFLKGKKKVEVLV

 

>AP003232.1a $F CYP94D6 chromosome 1 clone P0034E02

AZ135359.1 OSJNBb0115O14f CUGI Rice BAC genomicLength = 843 50% to 94D2

43934 MGTELSLTSSVPLLLLLLFPLLCFLCLRHG STRKQPRADGLKAYPIVGILPHFVRNQHRL 43755

43754 LEWSAGVVARCPTHTMSFNFKGFGLIAGAITGNPANVEHIVKTNFQNYSKGEYVVSVMED 43575

43574 FLGHGIFNSDGDQWLWQRKAASYEFNKRSLRNFVVDTVRSEVVDRLLPLLTRAERDGRTL 43395

43394 DEQDVLERFAFDNVCCVAFDEDPACLTEEGMGTNARTEFLRAFNDAQNILMARFMSPVEW 43215

43214 AWRAKRLLDLEPERRMREALATIHGYADRIVRERRERGAAGLARKDDFLSRFAATGKHSD 43035

43034 ESLRDV X 43017 frameshift

43014 TNFVLAGRDTTSSALTWFFWLVSGQ PDVEDRIAREIRAVRASSGSTDAAAFSFDELREMH 42835

42834 YLHAAITEAMRLYPPVAMDSHCCQNDDVLPDGTFVGKGWQVTYSAYAMARLEELWGADCE 42655

42654 EFRPERWLDEDGVFRPESPFKYPVFHGGPRMCLGKEMAYIQMKSIAACVFERFSFRFVG 42478

42477 GEGRPGLVFSVTLRMEGGLPMRVKKRRDSV C* 42382

 

Gene No. : 9-5_074 CYP73A35P erroneously converts pseudogene into a complete gene

386581..387133 , 387239..387391 , 387488..387651 , 387728..387815 ,

387886..388247 (-)

>9-5_074

MDLLFVERLLVGLLAAAVVAIAVSKLRGRKLRLPPGPTPVPVFGNWLQVGDDLNHRNLAA

LARRFGDIFLLRMGQRNLVVVSSPPLAREVLHTQGVEFGSRTRNVVFDIFTGKGQDMVFT

V GTAPGGRPRPRPWWTASAPTPPPPPAAHD ALNGERSRLAQSFEYNYGDFIPILRPFLRG

YLRICEEVKETRLKLFKDFFLEERK KLASTKAMDN NGLKCAIDHILEAQQKGEINEDNVL

YIVENINVA GTRASVQ VNHGEIQEKLRRELDTVLGPGRQITEPDTHRLPYLQAVVKETLR

LRMAIPLLVPHMNLRDAELAGYGIPAESKVLVNAWYLANDPGRWRRPEEFRPERFLEEER

NVEANGNDFRYLPSGAGRRSCPGIVLALPILGVTIGRLVQNFELLPPPGKDRVDTTEKGG

QFSLHILKHSTIVAKPRAF

 

>CYP73A35P   $P Oryza sativa (rice)  GenEMBL AP003446.1 March 29, 2001

AP003302.1  Feb. 21, 2001 Both sequences have the same frameshifts

MDLLFVERLLVGLLAAAVVAIAVSKLRGRKLRLPPGPTPVPVFGNWLQVGDDLNHRNLAA

LARRFGDIFLLRMGQRNLVVVSSPPLAREVLHTQGVEFGSRTRNVVFDIFTGKGQDMVFT

V YGDHWRKMRRIMTVPFFTGKVVQRHRAGWEAEAAAVVDGLRADPAAA 168 (25 nuc. deletion and frameshift in both sequences)

175 RRRLQLMMYSNVYRIMFDRRFESADDPLFLRLK ALNGERSRLAQSFEYNYGDFIPILRPF

LRGYLRICEEVKETRLKLFKDFFLEERK (2?)

NGLKCAIDHILEAQQKGEINEDNVLYIVENINVA (1)

AIETTLWSMEWAIAEL (2 nuc. insertion and frameshift in both seqs.)

VNHGEIQEKLRRELDTVLGPGRQITEPDTHRLPYLQAVVKETLRLRMAIPLLVPHMNLRD

AELAGYGIPAESKVLVNAWYLANDPGRWRRPEEFRPERFLEEERNVEANGNDFRYLPSGA

GRRSCPGIVLALPILGVTIGRLVQNFELLPPPGKDRVDTTEKGGQFSLHILKHSTIVAKPRAF*

 

Gene No. : 9-6_096 CYP86A9 with N-term extension (may be correct)

486465..488102 (+)

>9-6_096

MATDGGVLQLHPYA MAAAAVALASAYMVWFWALSRRLSGPRMWPLVGSLPSVVLNRARVH

DWIADNLRATGDAATYQTCILPLPFLARRQGLVTVTCNPRNLEHILRARFDNYPKGPMWQ

ASFHDLLGQGIFNSDGETWLIQRKTAALEFTTRTLRQAMARWANRSIKYRLWRILDDHCN

AAASVDLQDLLLRLTFDNICGLTFGKDPETLSPGLPENPFANAFDEATEATMQRFLFPSL

LWRIKKAFGVGSERSLRDSLAVVDRHMTETIAARKATPSDDLLSRFMKKRDSKGKAFPED

VLQWIALNFVLAGRDTSSVALSWFFWTLMQRRDVERKVVLEIASVLRETRGDDTARWTEE

PLNFDELERLVYLKAALTETLRLYPSVPQDSKYVVADDVLPDGTVVPAGSAITYSIYSVG

RMESIWGKDCAEFRPERWLSADGSRFEPVKDAYRFVAFNGGPRTCLGKDLAYLQMKSIAS

AVLLRNSVELVPGHKVEQKMSLTLFMKNGLRVHVKPRDIASYVEPSEPAPPQGSLVIPTT

TAAAA

 

>AP003442.1 $F CYP86A9 chromosome 1 clone B1096A10 72% to 86A1

AQ954084.1 nbeb0054A03r CUGI Rice BAC genomicLength = 403 57% to 86A1

D41651  71% to 86A1 almost identical to AP003442

49790 MAAAAVALASAYMVWFWALSRRLSGPRMWPLVGSLPSVVLNRARVHDWIADNLRATGDAATYQTCILPLPFLARRQGLV

50026

50027 TVTCNPRNLEHILRARFDNYPKGPMWQASFHDLLGQGIFNSDGETWLIQRKTAALEFTTR 50206

50207 TLRQAMARWANRSIKYRLWRILDDHCNAAASVDLQDLLLRLTFDNICGLTFGKDPETLSP 50386

50387 GLPENPFANAFDEATEATMQRFLFPSLLWRIKKAFGVGSERSLRDSLAVVDRHMTETIAA 50566

50567 RKATPSDDLLSRFMKKRDSKGKAFPEDVLQWIALNFVLAGRDTSSVALSWFF 50722

50723 WTLMQRRDVERKVVLEIASVLRETRGDDTARWTEEPLNFDELERLVYLKAALTETLR 50893

50894 LYPSVPQDSKYVVADDVLPDGTVVPAGSAITYSIYSVGRMESIWGKDCAEFRPERWLSAD 51073

51074 GSRFEPVKDAYRFVAFNGGPRTCLGKDLAYLQMKSIASAVLLRNSVELVPGHKVEQKM 51247

51248 SLTLFMKNGLRVHVKPRDIASYVEPSEPAPPQGSLVIPTTTAAAA* 51385

 

Gene No. : 9-6_137 fusion of CYP94C3 with an RNA helicase and additional unknown seq.

738785..738862 , 738881..739121 , 741031..741099 , 741481..741552 , 741601..741712 , 741741..741829 , 742152..742215 , 742298..742474 , 743125..743304 , 743588..743853 , 743896..743951 , 744169..744316 , 744369..744639 , 744737..744758 , 745157..745287 , 746102..746147 , 746280..747800 (-)

>9-6_137

MGVVEAEALHGAVEALAGSLQPHVATAFFVFSACTVALAALLAVVRLRPPWWCDCTVCEA

FLTASWAGEFDNLCDWYAHLLRTSPAQTVHVHVLRNVLTANPVTVDHVLRARFDNYPKGA

PFSAILADFLGRGIFNVDGDAWLFQRKLAAAELASPALRAFAARVVASELRCRLIPLLHS

ASREGNGKVLDLQDMFRRFAFDSICKISFGLDPGCLELSMPVSTLVEAFDTASTLSARRA

TVPMQIIWRLKRFLNVGDERKLRDAVRLVDALAAEVIRQRRKLGGAATGSDLLSRFMGSI

DDDKYLRDIVVSFMLAGRDTIASALTAFFLLLSDHPEVATAIRDEVARVTGDGNRTMAAT

FDKLKDMHYVHAAMYESMRLFPPVQFDSKFAAGDDTLPDGTVVAKGTRVTYHAYAMGRME

SVWGPDCAEFRPERWLRDGRFVPESPYRYPVFQAGVRVCIGKELALMEMKAVIVAVVRSF

DIEAIARSSRRPKFAPGLTATFAGGLPV TCIMSIMIDDSVTTRTKLGKFHGASEEGRPEI

STCQKVRMPSANKTPTPRLIPFIGCMGQLATAGAWDRASGKVAELVICPIYANLPAELQA

KIFEPAPAGARKVVLATNIADAETPYNPRTAMESFLVAPVSRASAEQRACRSGIQVRVAA

AVASAAAAASATAVIDETLGVLYRSNRVLIPICFGWNCNLTTKENEHERLLVLVDRDSDT

IGAYVCNPTTRRHDGAFIAFDPAVSQTRRLACGVVGRTPTLDLLIPESSSGEDELAPSPE

ERTLLLRVFSVSHAAAASASQVVGCSCQVRLRRRALTTRRPHKMDQVNLQCVQQKQFLVR

HFHCKLVAVNGFCRHPFRLVPSVDFSSAENDNACARIQDSGNHRAQPSPSDRSRQLFFFL

HEGLVRTGASVPIGSPGTRRGVDSGRSSPAVQPTGTKASGGRCRRHRRWLLSSCCLGPCR

RTQVEIFCGDAKFFSFRSDASTFIFGRPLQAHLSTSTTEAELQATVVGGWRRQQIDEKDL

LFFLLMRRGPFLQNWLYPSKPCYIRHSALLLVGILRACSDAVTEKREHIGTAMVNGVAPP

PPTSGAAPPLSSSDRPSPHPLSRLSCDGSGDGDRGWIRPRWWRPRTDLAAAAPSDPSRLP

PTVDLTALTVSAPSPERRWAREPVAAESSDADNGNDGGGE

 

>AP003289 $F CYP94C3 55% to 94C1 CDS complement(91864..93438)

MGVVEAEALHGAVEALAGSLQPHVATAFFVFSACTVALAALLAV

VRLRPPWWCDCTVCEAFLTASWAGEFDNLCDWYAHLLRTSPAQTVHVHVLRNVLTANP

VTVDHVLRARFDNYPKGAPFSAILADFLGRGIFNVDGDAWLFQRKLAAAELASPALRA

FAARVVASELRCRLIPLLHSASREGNGKVLDLQDMFRRFAFDSICKISFGLDPGCLEL

SMPVSTLVEAFDTASTLSARRATVPMQIIWRLKRFLNVGDERKLRDAVRLVDALAAEV

IRQRRKLGGAATGSDLLSRFMGSIDDDKYLRDIVVSFMLAGRDTIASALTAFFLLLSD

HPEVATAIRDEVARVTGDGNRTMAATFDKLKDMHYVHAAMYESMRLFPPVQFDSKFAA

GDDTLPDGTVVAKGTRVTYHAYAMGRMESVWGPDCAEFRPERWLRDGRFVPESPYRYP

VFQAGVRVCIGKELALMEMKAVIVAVVRSFDIEAIARSSRRPKFAPGLTATFAGGLPV

RVRRRRARASGHNPPI

 

NOT IN THIS ANNOTATION

>AP006237.3 CYP734A6 (japonica cultivar-group) genomic DNA, chromosome 1, BAC

             clone:OSJNBb0008D07

          Length = 156874

complement(join(69916..70392,70471..70870,71055..71529,72013..72430))

72340 MGWWGWAAAAAAAAAWVAV

72373 KVLEVLWWRPRRVEEHFARQGITGPRYRFLVGCVREMVALMVAASAKPMPPPYRSHNVLP 72194

72193 RVLAFYHHWKKIYGNPPPPPLLLNSILSQKQQPRTRRWQVAVVGERFAPGRYDIDMMAAL (1) 72014

71530 GSTFLIWFGPTPRLAIADPELIREVLLARADRFDRYESHPMVRQLEGEGLVSLRGDKWAH 71351

71350 HRRVLTPAFHMDNLRLLLPCVGMTVLDMADKWRAMAEADKSGEVEIDVSDWFQVVTEDAI 71171

71170 TRTAFGRSYEDGKVVFKLQAQLMAFASEAFRKVFIPGYR (2) 71054

70869 FLPTKKNTSSWKLDKEIRKNLVTLIGRRQEAGDDEKLDGCAKDLLGLMINAAASS 70705

70704 NGGKRSALPVSPITVNDIVEECKTFFFAGKQTTSNLLTWAIVVLAMHPEWQERARQEV 70531

70530 LDVCGADGVPSREQLAKLKT (0)

70392 LGMILNETLRLYPP 70351

70350 AVATVRRAKADVELGGYLRIPRDTELLIPIMAVHHDARLWGPDAAQFNPARFAGGVARAA 70171

70170 RHPAAFIPFGLGARMCIGQNLAILEAKLTVAVILHRFEFRLSARYVHAPTVLMLLHPQYG 69991

69990 APIVFRPRSSSQPTCEKMNPLTSS* 69916

 

NOT IN THIS ANNOTATION

>AP004232.1 $F CYP71C18 chromosome 1 clone OSJNBa0051H17 like CYP71C4

90% to aaaa01002989.1 version 4 in Genbank does not allow for

frameshift and skips beginning of heme signature

probably not an ortholog no 99% match in indica 9/5/02

56483 MEQAAGLVYQLFQNEMFPWILALFPFLLLALHYLATNHRTPTTCKETRNHLSPPSP 56650

56651 PRLPIIGHLHLIGDLPHVSLRELAHRYGPNLMLLHLGQVQNLVVSSPHAAEAVLR 56815

56816 THDHVFASRPHSLIGDILLYGPSDVGLSPYGEQWRRSRRIVTTHLLTNKKVRSYHVAREE 56995

56996 E 56998 (0)

57929 VHKVMTKVHELSTKGMGVDMFELFSTYSNDLICRLVSGKNFQGDKGRNKMFRQLFKANY 58105

58106 VLLAGFNLEDYYPGLARLKAVSWVMCAKARNTRKLWDELLDEIINDRMSKQPCEHDRGND 58285

58286 DQDEMDFVDVLLLQERGITRDHLKAIL 58366 (0)

58462 DMFQAGTETTSVVLVFAMAELMQKPHLMAKLQAELRTNIPK*GRELITECDQTNMTYLKA 58641

58642 VIKETLRLHPPSPLLVPHLAMADCDIDGYTVRSGTRVIVNAWAIGRNSESWEAAEEFLPE 58821

58822 RFVDGGSAANVDFIGTDFQFLPFGAX 58896 frameshift

58899 RRICPGINFASASMEIILANLLYHFDWDVSAEAAIDKDGIDMAEAFGLSVQLKEKLLLVP 59078

59079 VEYKGSVQDSAVIL* 59123

 

NOT IN THIS ANNOTATION

>AP004233.1 $F CYP71C19 chromosome 1 clone OSJNBa0065J17 50% to CYP71C4

= AQ857130 duplicate of the AP004232 gene at 27203-29726

probably not ortholog only 91%

21862 MEQAAGLVYQLFQHEMFPWTFSVLALFPFLLLVLHYLATNHRTPTTCKETKNHHPP

21694 PPSPPRLPIIGHLHLIGGLLHVSLRELAHRYGPDLMLLHLGQVPNLIVSSPRAAEAVLR 21518

21517 THDLVFASRPYSLIADILLYGPSDVGLSPYGE*WRRRIITTHLLTNKKVRSYRVAREE 21344

21343 E 21335 (0)

      VHKVMAKVHELSTKGMAVDMTELFSTFSNDLICRLVSGKNFQGEGRNKLFRQLFKAN

      SVLLAGFNLKDYYPGLARLKAVSMVMCAKARNTRKLWDELLDEIIDERMS

      KQQCEHDEGNDQDEMNFVNVLLLQEQGITREHLKAIL (0)

20004 DMYQAGTETSSVVLVFAMAELMQKPHLMAKLQAELRTTIPKQGHELITERDLTDMTYLKA 19825

19824 VIKETLRLHPPTPLLLPHLAMADCNIDGYTVRSGTRVIVNAWAIGRNSESWEAAEEFLPE 19645

19644 RFVDDGSAANVDFIGTDFQFLPFGAGRRICPGINFASASMEIILANLLYHFDWDVSAEAA 19465

19464 IDKDGIDMAEAFGLSVQLKEKLLLVPVDYKDGMQDSAVILL* 19339

 

NOT IN THIS ANNOTATION

>AP003142.2 $P CYP72A36P chromosome 1, PAC clone:P0435H01 probable pseudogene

53% to 72A15 86% to AP002899

5354 YLPIENNRRIREIY*EIRKILRGLIVKGDKAIRNGENTNDDLLGLLVESNMRQSNERE 5181

5180 EVGMSIED 5157

1115 IIEECRLFYFAGSETTS 1065 frameshift

1065 MLLT*TLIMLSMHPEWQERAREEVMHHFRRTTPDHDGLSRLKIVHM 928

 

NOT IN THIS ANNOTATION

>AP003261.1 $P CYP76H12P chromosome 1 clone P0471B04,

N-terminal exon of a CYP75 like sequence 70% identical to X70824 but no other part of

the gene is found.  Identical to AP003227 87336-87082 almost identical to AP003214

132381 MELAALCTDPVVLSAAFLCLLLNLSLCSYRPPSPGGGRRLPLGPPGVPVLGALPLVGPAPHADLASLARK 132172

132171 YGPIMYLKMGTCGVV 132127