Rice P450 sequences

Dec. 29, 2003 D. Nelson

 

#n are numbers for the ortholog pairs or unique sequences.  489 numbers were given out, 31 of these were combined and 4 were not from rice.  Therefore, there are 454 unique rice sequences.  Fragments get the same number as parents.  Order is by CYP name.

Three sequences aaaa01039155.1, aaaa01093055.1, aaaa01067419.1 are probable fungal P450 contaminants.  One seq aaaa01062516.1 is a probable insect P450 contaminant.  These are not counted in the total. 

CYP names have now been assigned to all 454 sequences.

27 sequences are partial and they may join to make a smaller number of genes.

This will probably reduce the gene count by 4 to 450 genes and pseudogenes

 

#300

>aaaa01012243.1 $FI CYP51G1 = old CYP51A5 Indica rice genome CYP51 New April 24, 2002

ortholog of AB025047 99%

5108 MTLADPNHRLIAGAALLVATLAFIKLLLSSAGGGKKRLPPTIPAAPLVGGLLRFMRGPIPM 4926

4925 IREEYARLGSVFTVPILSRKITFLIGPEVSAHFFKGNEAEMSQQEVYKFNVPTFGPGVVF 4746

4745 DVDYSVRQEQFRFFTEALRANKLRSYVDQMVVEAE 4641

2774 EYFSKWGESGTVDLKYELEHLIILTASRCLLGREVREKLFDDVSALFHDLDNGMQPVSV 2598

2597 IFPYLPIPAHRRRDRARQRLKEIFATIIKSRKASGQAEEDMLQCFIDSKYKSGRSTTEGE 2418

2417 ITGLLIAALFAGQHTSSITSTWTGAYMLRFKQYFAAAEEEQKEVMKRHGDKIDHDILAEMDVLYR 2223

2222 CIKEALRLHPPLIMLLRQSHNDFSVTTKDGKEFDIPKGHIVATSPAFANRLPHIFKNPDS 2043

2042 YDPDRYAPGREEDKAAGAFSYISFGGGRHGCLGEPFAYLQIKAIWTHLLRNFEFELVSPF 1863

1862 PETNWKAMVVGIKDEVMVNFKRRKLVVDN* 1773

 

>AB025047 CYP51G1 = old CYP51A5 rice (partial)   80% to 51A2 missing N-term 64 aa

BE040549.1 OE08G10 OE Oryza sativa cDNA 5' Length = 255 I-helix CYP51

BE230288.1 99AS641 Rice Seedling cDNA clone 99AS641.Length = 586

BE230302.1 99AS655 Rice Seedling cDNA clone 99AS655.Length = 627

BE607441.1 OE202C10 OE cDNA clone ID707 C-term CYP51 Length = 428

REEYARLGSVFTVPILSRKITFLIGPEVSAHFFKGNEAEMSQQE

VYKFNVPTFGPGVVFDVDYSVRQEQFRFFTEALRANKLRSYVDQMVVEAEEYFSKWGE

SGTVDLKYELEHLIILTASRCLLGREVREKLFDDVSSLFHDLDNGMQPVSVIFPYLPI

PAHRRRDRARQRLKEIFATIIKSRKASGRAEEDMLQCFIDSKYKSGRSTTEGEITGLL

IAALFAGQHTSSITSTWTGAYMLRFKQYFAAAEEEQKEVMKRHGDKIDHDILAEMDVL

YRCIKEALRLHPPLIMLLRQSHNDFSVTTKDGKEFDIPKGHIVATSPAFANRLPHIFK

NPDSYDPDPYAPGREEDKAAGAFSYISFGGGRHGCLGEPFAYLQIKAIWTHLLRNFEF

ELVSPFPETNWKAMVVGIKDEVMVNFKRRKLVVDN

 

#300

>aaaa01066056.1 CYP51G1 = old CYP51A5 (indica cultivar-group) = aaaa01012243.1 $FI Indica rice genome CYP51

 591 DPNHRLIAGAALLVATLAFIKLLLSSAGGGKKRLPPTIPAAPLVGGLLRFMRGPIPM 761

 762 IREEYARLGSVFTVPILRRKITFLI 836

 

#346

>aaaa01014709.1 CYP51G3 = old CYP51A15 (indica cultivar-group) 49% to 51A2

602 MDLTTGTIWLFLAQ

560 LFIIATILSKIATRERTRTTSTKISRPPPPPMARGAPLVGVLPSLLAKGPVAFIRHHYEK 381

380 MGSVFTVSLLQQKVTFLVGSEAASHFYKGLDSEISQDEVSQFTIPTFGPGVAFDVDYATR 201

200 HEQFRFFGDIMKPAKLRTYVDLMVAEVE 117

 

no japonica ortholog found 9/11/02

 

#418

>aaaa01028263.1 CYP51G3 = old CYP51A15 (indica cultivar-group) 73% to AP003866.1

8   GQHTSSSTSTWTGARLLTNPDHLRAAVEEQARLLRRHGGDRVDHAALAAMDTLQRCVKET 187

188 LRLHPPALMLLRHARRSFVVHSEDSGGGEREYEVPEGHTMASPLLLHNALPRVYRDPGE 364

365 FDPGRFGAGREEGAGGLAYTAFGGGRHACVGEAFAYMQIKVIWSHLLRNFELQL 526

    VSPFPETDWTVVMPGPKGKVMVTYKRRKLT*

 

no japonica ortholog found 9/12/02

 

#418

>aaaa01028263.1 CYP51G3 = old CYP51A15 (indica cultivar-group) 73% to AP003866.1

8   GQHTSSSTSTWTGARLLTNPDHLRAAVEEQARLLRRHGGDRVDHAALAAMDTLQRCVKET 187

188 LRLHPPALMLLRHARRSFVVHSEDSGGGEREYEVPEGHTMASPLLLHNALPRVYRDPGE 364

365 FDPGRFGAGREEGAGGLAYTAFGGGRHACVGEAFAYMQIKVIWSHLLRNFELQL 526

    VSPFPETDWTVVMPGPKGKVMVTYKRRKLT*

 

no japonica ortholog found 9/12/02

 

all three fragments CYP51G3 = old CYP51A15 #418, #453, #346 joined reduce gene count by 2

602 MDLTTGTIWLFLAQ

560 LFIIATILSKIATRERTRTTSTKISRPPPPPMARGAPLVGVLPSLLAKGPVAFIRHHYEK 381

380 MGSVFTVSLLQQKVTFLVGSEAASHFYKGLDSEISQDEVSQFTIPTFGPGVAFDVDYATR 201

200 HEQFRFFGDIMKPAKLRTYVDLMVAEVE 117

(0) GYFARWGQSGTVNMKQEFEQLVTLIASRCLLGEEVRDKMFDEV

    STLLRELNDGMRLVTILFPRLPIPAHRRRDRARARLGEIFSDIVRSRRGSSVGGGGGAPH

    DDMLQCLIDARYKDGRATTETEVAGMLVAALFA

8   GQHTSSSTSTWTGARLLTNPDHLRAAVEEQARLLRRHGGDRVDHAALAAMDTLQRCVKET 187

188 LRLHPPALMLLRHARRSFVVHSEDSGGGEREYEVPEGHTMASPLLLHNALPRVYRDPGE 364

365 FDPGRFGAGREEGAGGLAYTAFGGGRHACVGEAFAYMQIKVIWSHLLRNFELQL 526

    VSPFPETDWTVVMPGPKGKVMVTYKRRKLT*

 

#453

>aaaa01065204.1 CYP51G3 = old CYP51A15 (indica cultivar-group) exon 3

ortholog of AY022669.1 searched Genbank for extensions

(0) GYFARWGQSGTVNMKQEFEQLVTLIASRCLLGEEVRDKMFDEV

STLLRELNDGMRLVTILFPRLPIPAHRRRDRARARLGEIFSDIVRSRRGSSVGGGGGAPH

DDMLQCLIDARYKDGRATTETEVAGMLVAALFAGQHT

 

>AY022669.1 CYP51G3 = old CYP51A15 (partial)   microsatellite MRG4994 containing (CCG)X8, Length = 224

82% to CYP51 pseudogene above

222 PRLPIPAHRRRDRARARLGEIFSDIVRSRRGSSGGGGGGGGARHDDMLQCLIDARYKDGRATTETEVAGMLVAA 1

 

>AK107185.1 CYP51A15 (japonica cultivar-group) cDNA clone:002-124-H08

AC135914.2 genomic seq

    MDLTTGAIWLFLAQLFVAATMLSKIATRERTRTTGTKFSRPPPPPLARGAPLVGVLPSLLANGPVEFIRH 182

183 HYEKMGSVFTVSLLQQKVTFLVGSEASSHFYKGLDSEISQDEVSQFTIPTFGPGVAFDVD 362

363 YATRHEQFRFFGDIMKPAKLRTYVDLMVAEVE (0)

    GYFARWGQSGTVNMKQEFEQLVTLIASR 542

543 CLLGEEVRDKMFDEVSTLLRELNDGMRLVTILFPRLPIPAHRRRDRARARLGEIFSDIVR 722

723 SRRGSSGGGGGGGGARHDDMLQCLIDARYKDGRATTETEVAGMLVAALFAGQHTSSSTST 902

903 WAGARLLTNPDHLRAAVEEQARLLRRHGGDRVDHAALAAMDTLQRCVKETLRLHPPALML 1082

1083LRHARRSFVVRGGSGEREYEVPEGHTVASPLLLHNALPRVYRDPGEFDPGRFGAGRE 1253

1254EGAGGLAYTAFGGGRHACVGEAFAYMQIKVIWSHLLRNFELQLVSPFPETDWTVVMPGPK 1433

1434GKVMVTYNRRKLT* 1475

 

#182

>aaaa01005681.1b $PI CYP51G4P = old CYP51A16P (indica cultivar-group) ortholog of AP003866.1b

4595 VRFLHRKVTFLVGPEESSHFFTGLDAEISQDEVSRFIIPTFGS*VAFDA 4741

5197 GYATQ*EQFRFFGDTMKPTKLRSYVSHMVYEVE 5295

6670 VVTPIATRCLFGEVRSKMLGEVSTLLRELNDSMRLITIVFPYLPIPAHRRRDSARARLGE 6849

6850 IFVEIVRSRRSSPGGGGAGHDDMLQCLIDARYKDGRATTEG

6975 AEVAGMLVSALLAGQYTSSSTSTWTG 7052 frameshift

7055 ARLLTHPEHLRAAVREQEELVLVRHRHGGDVVDHDALQRMGHLHRCVKETLRLHPPSLML 7234

7235 LRHARRSFVVRARGSGDAEYEVPAGHTVAS 7324

     PMVIHNALPHVY 7359

7360 EDAGSFDPGRFGPAREEYRAYAADHAYTVFGGGRHACVGE 7479 frameshift

7482 VIKVIWSHLLRNFELELMLPFPETDWNDVMPGPKGKVMVLPPSHNKIISVTVGFSVQL 7655

 

>AP003866.1b $P CYP51G4P = old CYP51A16P chromosome 7 clone OJ1092_A07

No obvious N-terminal, two in frame stops, three frameshifts = Pseudogene

82%  to AY022669.1 seems to be a CYP51 pseudogene

54048 VRFLHRKVTFLVGPEESSHFFTGLDSEISQDEVSRFIIPTFGS*VAFD 54191 (intron no boundaries)

54642 AGYATQ*EQFRFFGDTMKPTKLRSYVSHMVYEVE 54743 (0) missing 20 aa

56119 VVTLIATRCLFGEVRSKMLGEVPTLLRELNDSMRLITIVFPYLPIPAHRRRDSARARL 56292

56293 GEIFVEIVRSRRSSPGGGGAGHDDMLQCLIDARYKDGRATTEG 56421 frameshift

56424 AEVAGMLVSALFAGQYTSSSTSTWTGAR 56507 frameshift

56510 LLTHPEHLRAAVREQEELVLVRHRHGGDVVDHDALQRMGHLHRCVKETLRLHPPSLMLLRHAR 56698

56699 RSFVVRARGSGDAEYEVPAGHTVASPMVIHNALPHVYEDAGSFDPGRFGPAREEYRAYAA 56878

56879 DHAYTVFGGGRHACVGE 56929 frameshift

56932 VIKVIWSHLLRNFELELMLPFPETDWNDVMPGPKGKVMVLPPSHNKIISVIVGFSVQL* 57108

 

#110

>aaaa01003099.1b CYP51H1 = old CYP51A6 (indica cultivar-group)  Nterm aa 4-160

ortholog to AP005448.1b 100%

10626 VTSSTIARGAMSWVAATVALLLTTAVILTALQKRKISSPAAAAPPVVRGAGLVRFA 10793

10794 RAMARDGPLEAIREQQAKLGSVFTAIAPFGLFKVTFLIGPEVSSHFYLAPESEMGQGSIY 10973

10974 RFTVPLFGPEVGYAVDPDTRAEQMRLFWDVLKPRSIEARVGAMAEEVQ 11117

 

>aaaa01003099.1c CYP51H1 = old CYP51A6 (indica cultivar-group)  Nterm aa 61-160

ortholog to AP005448.1b 100% these two are duplicates only count once

11261 DGPLEAIREQQAKLGSVFTAIAPFGLFKVTFLIGPEVSSHFYL 11437

11438 APESEMGQGSIYRFTVPLFGPEVGYAVDPDTRAEQMRLFWDVLKPRSIEARVGAMAEEVQ 11617

 

>aaaa01003099.1d CYP51H1 = old CYP51A6 (indica cultivar-group)  Nterm aa 61-160

ortholog to AP005448.1b 100% these two are duplicates only count once

11761 DGPLEAIREQQAKLGSVFTAIAPFGLFKVTFLIGPEVSSHFYL 11937

11938 APESEMGQGSIYRFTVPLFGPEVGYAVDPDTRAEQMRLFWDVLKPRSIEARVGAMAEEVQ 12117

 

>aaaa01003099.1e CYP51H1 = old CYP51A6 (indica cultivar-group) nearly  gene, runs off end

ortholog to AP005448.1b $F 99% plus one frameshifted region

21127 LQKRKISSPAAAAPPVVRGAGLVRLRARHGEGRAAGGDPRAAGEAGERVTAIAPF 20963

20962 GLFKVTFLIGPEVSSHFYLAAESEMGQGSIYRFTVPLFGPEVGYAVDPDTRAEQMRLFWD 20783

20782 VLKPRSIEARVGAMAEEVQ 20726 (0?)

18574 NYFSRWGEQGTVDLKKELERVLMLIASRCLLGREVRESMVDEVYELFRDLDNGLHLIS 18401

18400 TMLPYLPTPAHRRRDRARQRLGEIFTEVIRSRRNSGTAGNGDDVLQRLIDGRYKD 18236

18235 ERALTDVEVVGLLVALVFAGKHSSSSVSTWTGINLLSHPNHLAAVIAEQDRLMASRARTD 18056

18055 DDHDRVNYDTVQEMTTLHRCIKEALRLHPPAVAMFRQARKHFTVQTKEGKEYTIPGGHTV 17876

17875 LSTILVNHHMPNVYKDPHVFDPSRFARGRGEDKAAGPFSFLAFGAGRHSCAGESFAYTQI 17696

17695 KVIWSHLLRNFELKMVSPFPETSWRMVTPEPKGTVMISYRRR 17570

 

>AP005448.1b $F CYP51H1 = old CYP51A6 (japonica cultivar-group) chromosome 7 21 June 2002

100% to AP005188.2c

32724 MDHVTSSTIARGAMSWVAATVALLLTTAVILTALQKRKISSPAAAAPPVVRGAGLVRFA 32900

32901 RAMARDGPLEAIREQQAKLGSVFTAIAPFGLFKVTFLIGPEVSSHFYLAPESEMGQGSIY 33080

33081 RFTVPLFGPEVGYAVDPDTRAEQMRLFWDVLKPRSIEARVGAMAEEVQ 33224

35381 NYFSRWGEQGTVDLKKELEQVLMLIASRCLLGREVRESMVDEVYELFRDLDNGLHLIS 35554

35555 TMLPYLPTPAHRRRDRARQRLGEIFTEVIRSRRNSGTADNGDDVLQRLIDGRYKD 35719

35720 ERDLTDVEVVGLLVALVFAGKHSSSSVSTWTGINLLSHPNHLVAVIAEQDRLMASRARTD 35899

35900 DDHDRVNYDTVQEMTTLHRCIKEALRLHPPAVAMFRQARKHFTVQTKEGKEYTIPGGHTV 36079

36080 MSTILVNHHMPNVYKDPHVFDPSRFARGRGEDKAAGPFSFLAFGAGRHSCAGESFAYTQI 36259

36260 KVIWSHLLRNFELKMVSPFPETSWRMVTPEPKGTVMISYRRRNLTCK* 36403

 

>AP005188.2c $F CYP51H1 = old CYP51A6 (japonica cultivar-group) chr 7 orth to aaaa01003099.1e 99%

55155 MDHVTSSTIARGAMSWVAATVALLLTTAVILTALQKRKISSPAAAAPPVVRGAGLVRFA 55331

55332 RAMARDGPLEAIREQQAKLGSVFTAIAPFGLFKVTFLIGPEVSSHFYLAPESEMGQGSIY 55511

55512 RFTVPLFGPEVGYAVDPDTRAEQMRLFWDVLKPRSIEARVGAMAEEVQ

57812 NYFSRWGEQGTVDLKKELEQVLMLIASRCLLGREVRESMVDEVYELFRDLDNGLHLIS 57985

57986 TMLPYLPTPAHRRRDRARQRLGEIFTEVIRSRRNSGTADNGDDVLQRLIDGRYKD 58150

58151 ERDLTDVEVVGLLVALVFAGKHSSSSVSTWTGINLLSHPNHLVAVIAEQDRLMASRARTD 58330

58331 DDHDRVNYDTVQEMTTLHRCIKEALRLHPPAVAMFRQARKHFTVQTKEGKEYTIPGGHTV 58510

58511 MSTILVNHHMPNVYKDPHVFDPSRFARGRGEDKAAGPFSFLAFGAGRHSCAGESFAYTQI 58690

58691 KVIWSHLLRNFELKMVSPFPETSWRMVTPEPKGTVMISYRRRNLTCK* 58834

 

note: sequences aaaa01003099.1b to e are all probably from a single gene

 

#109

>aaaa01003099.1a CYP51H2P = old CYP51A7P (indica cultivar-group)  Nterm aa 4-94

ortholog of AP005448.1a 100%

7681 TISDTMWLTTVALLLTTVVIWTALQKRKRGEACPAAVAPPPIVQGTALVRFLRAMARDG 7857

7858 PLEVIREQLAKLGSVFMASAPLGLFKVTFLVGAEL 7962

 

>AP005188.2b $P CYP51H2P = old CYP51A7P (japonica cultivar-group) chr 7 N-term fragment

orth to aaaa01003099.1a 100% after frameshift

52199 MDHLTSS (frameshift)

      TISDTMWLTTVALLLTTVVIWTALQKRKRGEACPAAVAPPPIVQGTALVRFLRAMARDG 52375

52376 PLEVIREQLAKLGSVFMASAPLGLFKVTFLVGAEL 52480

 

>AP005448.1a CYP51H2P = old CYP51A7P (japonica cultivar-group) chromosome 7 21 June 2002

29768 TISDTMWLTTVALLLTTVVIWTALQKRKRGEACPAAVAPPPIVQGTALVRFLRAMARDG 29944

29945 PLEVIREQLAKLGSVFMASAPLGLFKVTFLVGAEL 30049

 

#63

>aaaa01001626.1 $FI CYP51H3 = old CYP51A8 (indica cultivar-group) Cterm ONE FRAMESHIFT

ortholog to AP005188.2a 98%

22316 MQLTSSTVFWLTTAVAFLLINTVILRALQKRKSSPAAAAAAPPPPVVQGVGLVRFV

      RAMARDGPLEAIREQQAKLGSVFTASAPLGTFLIGSEVSSHFYVAPDSEISMGRLY

      EFTVPIFGPGVLYGVDLETRKEQIRFNWDILKPRSLKASVGAMAEEVE (0) 22795

23040 NYFSRWGDQGTVDLKHELEQVLMLTASRCLLGKELRER 23153 (FS)

23156 VPGKLCELFGELDNGLHLISGLLPYLPIPAH

23249 RRRDRARQRLGEIITEVIRSRRNSSRGAAGTDENNDDMLQCLINSRYKDGCAMTDAE 23419

23420 TAGLVVALMFAGKHTSSGVSIWTGVHLLSNPNHLAAVVAEQDRLMASCPGRTDDYHRLD 23596

23597 YDTVQEMRSLHCCVKEALRLHPPVAAVSQAYKHFTVQTKEGKEYTIPGGHMVVSTILVNH 23776

23777 YLPHIYKDPHVFDPQRFAPGREEEKVAGRFSFLSFSAGRHACAGESFSYTQIKVLWSYLL 23956

23957 SNFEIKMVSPFLETEWSTVIPEPKGKVMVSYRRRTAPK* 24073

 

>AP005188.2a $F CYP51H3 = old CYP51A8 (japonica cultivar-group) chr 7

12878 MDQLTSSTVFWLTTAVAFLLINTVILRALQKRKSSPAAAAAAAPPPVVQGVGLVRFVRAMAR 13063

13064 DGPLEAIREQQAKLGSVFTASAPLGLFKVTFLIGSEVSSHFYVASYSEISMGRLYEFTVP 13243

13244 IFGPGVLYGVDLETRKEQIRFNWDILKPRSLKASVGAMAEEVE 13372

13617 NYFSRWGDQGTVDLKHELEQVLMLTASRCLLGKELRER 13730

13727 ESVPGKLCELFGELDNGLHLISGLLPYLPIPAHRRRDRARQRLGEIITEVIRLRRNSSR 13903

13904 GAAGTDENNDDMLQCLINSRYKDGCAMTDAEIAGLVVALMFAGKHTSSGVSIWTGVHLLS 14083

14084 NPNHLAAVVAEQDRLMASCPGRTDDYHRLDYDTVQEMRSLHCCVKEALRLHPPVAAVRQA 14263

14264 YKHFTVQTKEDKEYTIPGGHMVVSTILVNHYLPHIYK

      DPHVFDPQRFAPGREEDKVAGRFSFLSFSAGRHACAGESFSYTQIKVLWSYLLSN 14540

14541 FEIKMVSPFPET 14576 (frameshift)

      QWSTVIPEPKGKVMVSYRRRTAPK* 14649

 

Note this cluster continues on AP005188.2b and 2c

 

#256

>aaaa01009323.1 CYP51H4 = old CYP51A9 (indica cultivar-group) 55% to AP005448.1b $F

orth of AP004890.1

6368 DTEVTGLLVALLFAGHHTSSTVAVWTALRLLTHPEHLRAVRAEQERLVAAAEQQRSHHGG 6547

6548 GGGGGIDYGVLLQMDVLHRCIKEALRLHPVTPMILRRARRGFTVRDKEGGEYSVPAGRLL 6727

6771 YKDPHVFDPDRFAAGRAEDKAVAGARDLAYLSFGAGKHACMGEGYAYQQIKVILSHLVSN 6950

6951 FELK 6962

 

>AP004890.1 $F CYP51H4 = old CYP51A9 (japonica cultivar-group) chr 2

78968 MDHIFSTNAWLTIALVFIITLAAKVVRSSVTLPAEKTSKPRPPPEAKGAPLV 79123

79124 GIIPAVLRRGLQAVIREQHRALGSVFTLRSLGLAVTFLVGPECSDHFFHAPELEIAID 79297

79298 GLYEVTVPIFGKEVGYDIDLDTRNEQHRFFAKMLRPAKLRGHVLPMVCEIEX 79450

79551 YFGKWGECGVVDLMQEVDHVLMLIASRCLLGKEVRENMFDEVASLFHELMGGMHLISMFF 79730

79731 PYLPTPGHRRRDKARAKLGEIFSQIVKTRKMSGRVEDDMLQDLIDSTYGDGRATT 79895

79896 DTEVTGLLVALLFAGHHTSSTVAVWTALRLLTHPEHLRAVRAEQERLVAAAEQQRSHHGG 80075

80076 GGGGGIDYGVLLQMDVLHRCIKEALRLHPVTPMILRRARRGFTVRDKEGGEYSVPAGRLL 80255

80256 ASPLVVNTLLPNIYKDPHVFDPDRFAAGRAEDKAVAGARDLAYLSFGAGKHACMGEGYAY 80435

80436 QQIKVILSHLVSNFELKLESPFPETEDMLSMRPKGKAIVSYKRRTLS 80576

 

#404

>aaaa01023253.1 CYP51H5 = old CYP51A10 (indica cultivar-group) orth of AP004090.1 $F chromosome 2

3179 YFAQWGEDGVVDIKYEMGNLILLIANRCLLGKQFGESKLEQVSTLLHELFDNGFHLISLF 3000

2999 FPYLPTPQHRRRDKARAMLGEMIHEAVRSRRNSGVAEDDVLQKFLDSKYINGRCMTENEI 2820

2819 AGLLICMMFAAQHTSSSTSTWTGACLLSHGHRSYLAAAIQEQKRIIQQHGDRINWGILLQ 2640

2639 MTTLTHCIKEALRLHP 2592

2584 LLIRHASKSFSVQTRQGHRYQIPKGHTLATCTTVGNRLPYIYKDPNVYDPSRFGPGR 2414

2412 EEDKVGGKFSYTPFSAGRHVCLGEDFAYMPN*GDMEPFAQGNFDLELISPFPEEEWEKFI 2233

2232 PGPKGKVMVTYKRRRL 2185

 

>AP004090.1 $F CYP51H5 = old CYP51A10 chr 2 clone OJ1399_H05 49% to 51A2

AQ843111.1 nbxb0005D03r CUGI Rice BAC genomic cloneLength = 507 49% to 51A2

78158 MAVLFVATKMIQQRPRTLCLYEKENKEEELLLPPVMSVVSVLTAYLPTLIAKGLPAVIHDLH

77972 SRLVTLLVTAHFFQASESEIRQSNIYKVTVPVFGRGVLYDVDLATRSRQISFCTDS 77805

77804 IKPINLRGHVDSMVHEVE 77751 (0)

76666 GYFAQWGEDGVVDIKYEMGNLILLIANRCLLGKQFGESKLEQVSTLLHELFDNGFHLISLF 76484

76483 FPYLPTPQHRRRDKARAMLGEMIHEAVRSRRNSGVAEDDVLQKFLDSKYINGRCMTENEI 76304

76303 AGLLICMMFAAQHTSSSTSTWTGACLLSHGHRSYLAAAIQEQKRIIQQHGDRINWGILLQ 76124

76123 MTTLTHCIKEALRLHPPANLLIRHASKSFSVQTRQGHRYQIPKGHTLATCTTVGNRLPYI 75944

75943 YKDPNVYDPSRFGPGREEDKVGGKFSYTPFSAGRHVCLGEDFAYMQIKVIWSHLLRNFDL 75764

75763 ELISPFPEEEWEKFIPGPKGKVMVTYKRRRLL* 75665

 

#404

>aaaa01024682.1 CYP51H5 = old CYP51A10 (indica cultivar-group) orth of AP004090.1 $F chr 2

Nterm join with AAAA01023253.1 see this accession for ortholog

1522 LSMAVLFVATKMIQQRPRTLYLYEKENKEEELLLPPVMSVVSVLTAYLPTLIAKGLPAVI 1701

1702 HDLHSRLGSVFTVSVFGLKKVTLLVTAHFFQASESEIRQSNIYKVTVPVFGRGVLY 1869

1870 DVDLATRSRQISFCTDSIKPINLRGHVDSMVHEVE 1974

 

#246

>aaaa01008685.1 CYP51H6 = old CYP51A11 (indica cultivar-group) orth of AC108875.1a $F chr 5 100%

7539 DGRATTEAEVAGLITGVLFAAKHTSTHTSVWTGARLLTHEKFLAAAVDEQDQIVRKHGII 7360

7359 NGRIVTDHYGFLMEMHMLHICIKETLRLHPPAPMIVRTALRQFTVRTREGHEYCVPAGHT 7180

7179 MASPIVISNRVPYIYKDAHLYDPDRFGPRREEDKVGGKFSYTSFGGGRNSCVGENYA 7009

7008 YMQIKAIWSHLLRNF 6964

 

>AC108875.1a $F CYP51H6 = old CYP51A11 chr 5 51% to 51A2 same as AQ050946 AQ687182 AQ258479

58% to AP004090 this might require subfamilies in CYP51

70310 MELTSSSAMWLAMAILAITAALTKIALGGGRRRCLSESSDLTCKTPPPPPVVNCIALLGL 70489

70490 LPALFRGDVPATMQQLYAKFGSVFTVSVAGLLKATFLVGPEVSAHFFQGLESEVSHGDLF 70669

70670 EFTVPMFGKEVGHGVDNATRIEQGRFFAEALKPVRLRIHVDPMVQEVE 70813 (0?)

71263 DYFAKWGQHGTVDLKHELEQLLLLISGRCLLGKEVMGTKFDEVCNLFRDIEGGVNLMSV 71439

71440 FFPYTPLIPSNRRRDMARERLHAIFSDIVRSRKQQQGDQEEVNDKDVLQSFIDSR 71604 (2?)

71910 YKADGRATTEAEVAGLITGVLFAAKHTSTHTSVWTGARLLTHEKFLAAAVDEQDQIVR 72083

72084 KHGIINGRIVTDHYGFLMEMHMLHICIKETLRLHPPAPMIVRTALRQFTVRTREGHEYCV 72263

72264 PAGHTMASPIVISNRVPYIYKDAHLYDPDRFGPRREEDKVGGKFSYTSFGGGRNSCVGEN 72443

72444 YAYMQIKAIWSHLLRNFELKLLSPFPKTDWSKLVPEPQGKVMVSYKRRQL 72593

 

#276

>aaaa01010435.1 CYP51H7 = old CYP51A12 (indica cultivar-group) orth of AC108875.1b $F 99% chr 5 similar to 51A2

1984 WGQHGTVDLRRELEQLLLLISGRCLLGKEVMGTMFDEVCNLFRDIEGGVNLMSVFFPYTP 2163

2164 LIPSNRRRDMARKRLHAIFSDIVRSRKQREGDNVDKDVLQSLIDSRYKADGRATTEA 2334

2335 *VAALMICLLFAAKHTSAYTSVWTGARLLSHERFLTAAVDEQDKIAREHSNINGGGRITD 2514

2515 DRYGSLMEMRTLHSCIKETLRLHPPVPMLVRTAHKQFTVRTREGHEYAVPAGHTIASP 2688

2689 IVISNQVPYIYMDGHLYDPDRFGPAGREEDKVGGKFSYASFGGGRTGCVGEGYAYMQIK 2865

2866 AIWSHLLRNF 2895

 

>AC108875.1b $F CYP51H7 = old CYP51A12 chromosome 5 48% to 51A2

80009 MAIILAITAAVTKIARGGRRRSATDPTCKMPPPPPVVNSIALLRLLPTLFRSGLPA 80176

80177 ILHELYTKFGSVFTINLAGLLKMTFLVGPEVSAHFFQGLESEISHGNLLEFTVPMFGKEI 80356

80357 AHGVDSATRNEQARFFVDALKPARLRIHVDPMVQEVE 80467 (0?)

84741 DYFAKWGQHGTVDLRRELEQLLLLISGRCLLGKEVMGTMFDEVCNLFRDIEGGVNLMSV 84917

84918 FFPYTPLIPSNRRRDMARKRLHAIFSDIVRSRKQREGDNVDKDVLQSLIDSRYKADGRAT 85097

85098 TEA*VAALMICLLFAAKHTSAYTSVWTGARLLSHERFLTAAVDEQDKIAREHSNINGGGR 85277

85278 ITDDRYGSLMEMRTLHSCIKETLRLHPPVPMLVRTAHKQFTVRTREGHEYAVPAGHTIAS 85457

85458 PIVISNQVPYIYKDGHLYDPDRFGPAGREEDKVGGKFSYASFGGGRTGCVGEGYAYMQIK 85637

85638 AIWSHLLRNFELRLLSPLPKSDFTKFVPEPHGELMVSYKRRQL 85766

 

#276

>aaaa01067145.1 CYP51H7 = old CYP51A12 (indica cultivar-group) orth of AC108875.1b $F chromosome 5 1 diff see aaaa01010435.1 for ortholog

27  GRTGCVGEGYAYMQIKAIWSHLLRNFELR*LSPLPKSDFTKFVPEPHGELMVSYKRRQL 203

 

#140

>aaaa01004091.1 CYP51H8 = old CYP51A13 (indica cultivar-group) orth of AC108875.1c $F chr 5 similar to 51A2

12032 GSVIFPYIPIPSHIRRDKAHAKLAKIFSKIVRSRRDSNRPAEQDVLQYLIDSKHRDGSS 12208

12209 TTEQEVTGWIISMVFAGKHTSTNSTTWTGACLLTHDKFLTEALDEQKHMIQKHGDHIDYN 12388

12389 VLLDMDILHCCIKEALRMHPVAPIIYRKAQKSFVVRTREGDAYDIPEGHNLLSP 12550

12551 MIFNNRLPYIYKDPHMYDLDRFAPKREEDKVGGMFSYTSFGGGRHICIGEAYAYMQIKV 12727

12728 IWSHLLRNF 12754

 

>AC108875.1c $F CYP51H8 = old CYP51A13 chromosome 5 50% to 51A2

122577 MDLVSISIYMWAMAALCTIITAMVTTKLARVRRPITLNPKSKRPLPPVVNVIA 122735

122736 LLEHLPRLCTKGVIPVYKGSVFTVSLFGLKATFLVGPEVSAHFYQGMDSEISQGDLYEFT 122915

122916 VPLFGKGVGFDIDNATRTEHLRFFIDAIKTSKLRNHVNSMVQEVE 123050 (0?)

123296 DYFAKWGENGIVDIKHEFEKLLMLISGHCLLGKEVRDNMFDEVFSLF 123436

123437 HELDSGVGLGSVIFPYIPIPSHIRCDKAHAKLAKIFSKIVRSRRDSNRPAEQDVLQYLID 123616

123617 SKHRDGNSTTEQEVTGWIISMVFAGKHTSTNSTTWTGACLLTHDKFLTEALDEQKHMIQK 123796

123797 HGDHIDYNVLLDMDILHCCIKEALRMHPVAPIIYRKAQKSFVVRTREGDAYDIPEGHNLL 123976

123977 SPMIFNNRLPYIYKDPHMYDPDRFAPKREEDKVGGMFSYTSFGGGRHICIGEAYAYMQIK 124156

124157 VIWSHLLRNFELKLESPFPKTNWSKILLEPWGKVMVSYKRRRL 124285

 

#181

>aaaa01005681.1a CYP51H9 = old CYP51A14 (indica cultivar-group) orth AP003866.1a $F chr 7 >99%

2692 EQGTVDLKRELDLLILTIASRVLLGKEVRETMFADVVASFHELMDNSMHLISLC 2853

2854 FPNLPIPRHRRRDTASARLKELFSRAIQLRRGSGRAEDDVLQRFLESRYR 3003

3004 DGRAMSDNEITGMLIALVVAGQHMSSSASTWTGAFLLRDPKHLAAAVDEQRRLIG 3168

3169 DDRVDYDALTTGMSTLHRCIKEALRMHPPAPALVRTVRRGFAVRTREGKEYRMPAGHS 3342

3343 VVSYAAFNHRLGYVYRDPDEYDPERFGPERKEDRVAGKFSFTAFGGGRHACLGEHYAFLK 3522

3523 MKVIWSYLLRNFELELVSPFPEVEL 3597

 

>AP003866.1a $F CYP51H9 = old CYP51A14 chr 7 clone OJ1092_A07 53% to 51A2

AQ326645 and AQ291927 mid to K-helix region 52% identical to wheat CYP51

60% identical to AQ327456 68% to EST T88278 705 family

AQ689048.1 nbxb0078H10r CUGI Rice BAC genomic clone Length = 737

AQ396185.2 nbxb0066K16r CUGI Rice BAC genomic cloneLength = 327

50920 MDMAAAAAVWFSAIAAVLLAASTIAVVVVAKMTGKRNGGAAAAAAAAAEAELPL

51082 PPVVSGVSLIIPVITRGPMAVADELYVKLGSVFT (0)

      GGFYSRPE 51261

51262 SEVHQGGTYRMTVPMFGRGVMYDVDVATRSEQIAVCFEALRPTKLRSSTVTMVRETE 51432 (0)

52114 EYFAKWGEQGTVDLKRELDLLILTIASRVLLGKEVRETMFADVVASFHELMDNSMHLI 52299

52300 SLCFPNLPIPRHRRRDTASARLKELFSRAIQLRRGSGRAEDDVLQRFLESRYRDGRAMSD 52479

52480 NEITGMLIALVVAGQHMSSSASTWTGAFLLRDPKHLAAAVDEQRRLIGDDRVDYDALT 52653

52654 TGMSTLHRCIKEALRMHPPAPALVRTVRRGFAVWTREGKEYRMPAGHSVVSYAAFNHRLG 52833

52834 YVYRDPDEYDPERFGPERKEDRVAGKFSFTAFGGGRHACLGEHYAFLKMKVIWSYLLRNF 53013

53014 ELELVSPFPEVELNNIMLGPRGEVMVRYKRRKLTST* 53124

 

no japonica ortholog found 9/11/02

 

#10

>aaaa01000238.1f $FI CYP71C12 (indica cultivar-group) AP003909.1a 99%

also aaaa01079567.1 (98%)

44400 MAEMLDGLRHDEQASLHAPQKASTMPTMSCSDLLLAMMCPLILLLIIFRCYAYATRSGGM 44221

44220 LSRVPSPPGRLPVIGHMHLISSLPHKSLRDLATKHGPDLMLLHLGAVPTLVVSSARTAQA 44041

44040 ILRTHDRVFASRPYNTIADILLYGATDVAFSPYGDYWRQIKKIVTMNLLTIKKVHSYGQT 43861

43860 RQQEVRLVMAKIVEEAATHMAIDLTELLSCYSNNMVCHAVSGKFFREEGRNQLFKELIEI 43681

43680 NSSLLGGFNLEDYFPSLARLPVVRRLLCAKAYHVKRRWDQLLDQLIDDHASKRRSSMLDN 43501

43500 NDEESDFIDVLLSIQQEYGLTKDNIKANLVVMFEAGTDTSYIELEYAMAELIQKPQLMAK 43321

43320 LQAEVRGVVPKGQEVVTEEQLGRMPYLKAVIKETLRLHPAAPLLVPHVSMVDCNVEGYTI 43141

43140 PSGTRVIVNAWAIARDPSYWENAEEFIPERFLGNTMAGYNGNNFNFLPFGTGRRICPGMN 42961

42960 FAIAAIEVMLASLVYRFDWKLPIDQAANGGIDMTETFGITIHLKEKLLLVPHLP* 42796

 

>AP003909.1a $F CYP71C12 chromosome 8 clone OJ1300_E01 55% to 71C4

orth aaaa01000238.1f

50394 MAQMLDGLRHDEQASLHAPQEASTMPTMSCSD

50298 LLLAMMCPLILLLIIFRCYAYATRSGGMLSRVPSPPGRLPVIGHMH 50161

50160 LISSLPHKSLRDLATKHGPDLMLLHLGAVPTLVVSSARTAQAILRTHDRVFASRPYNTIA 49981

49980 DILLYGATDVAFSPYGDYWRQIKKIVTMNLLTIKKVHSYGQTRQQEVRLVMAKIVEEAAT 49801

49800 HMAIDLTELLSCYSNNMVCHAVSGKFFREEGRNQLFKELIEINSSLLGGFNLEDYFPSLA 49621

49620 RLPVVRRLLCAKAYHVKRRWDQLLDQLIDDHASKRRSSMLDNNDEESDFIDVLLSIQQEY 49441

49440 GLTKDNIKANLVVMFEAGTDTSYIELEYAMAELIQKPQLMAKLQAEVRGVVPKGQEIVTE 49261

49260 EQLGRMPYLKAVIKETLRLHPAAPLLVPHVSMVDCNVEGYTIPSGTRVIVNAWAIARDPS 49081

49080 YWENAEEFMPERFLSNTMAGYNGNNFNFLPFGTGRRICPGMNFAIAAIEVMLASLVY 48910

48909 RFNWKLPIDQAANGGIDMTETFGITIHLKEKLLLVPHLP* 48790

 

#10 part

>aaaa01079567.1 CYP71C12 (indica cultivar-group) orth AP003909.1a $F chr 8

99% 98% to aaaa01000238.1f $FI see aaaa01000238.1f for ortholog

672 DQLLDQLIDDHASKRRSSMLDNNDEESDFIDVLLSIQQEYGLTKDNIKANLVVM 511

510 FEAGTDTSYIELEYAMAELIQKPQLMAKLQAEVRGVVPKGQEIVTEEQLGRMPYLKAVI 334

333 KETLRLHPAAPLLVPHVSMVDCNVEGYTIPSGTRVIVNAWAIARDPSYWENAEEFMPERF 154

153 LSNTMAGYNGNNFNFLPFGTGRRICPGMNFAIAAIEVMLASLVYRFDWKL 4

 

#11

>aaaa01000238.1g $PI CYP71C13P (indica cultivar-group) end of clone poor quality seq

allowing frameshifts (fs) and deletions this seq 95% to AP003909.1b

(plus strand)

46070 MAQMLGALLLFQDSQMSTMTRMSYSLLLPILCPLILLLLFRCYAYATRSGGL 46225

46226 LDKLPSPPGRLPLIGHMHLIGSFPHMSLRDLATKHGPDLMLLHLGTVPTLVVSSSRMAQV 46405

46406 ILRTHDRVFASRQQSAIT 46459 gap (frameshift) XILF (deletion and fs)

46485 YGDYWRQIKKIVTTNLLTI (fs) KKIRSYSQT (fs) RQQE (fs) VRL (fs) VM (fs)

      AKI*EATTHMAV 46628 (deletion)

(minus strand)

49427 LYAKAYDVKKRWDQLLDKLIDDHSSKHRSSLLDNNEX 49320

49320 ESDFIDVLLSIQQEYGLTKDNIKANLAIMFEAGTDTSFIELEYAMAELMQKPQMIAKLQA 49141

49140 EVRGVVSKGQEIVTEEHLGRMPYLKAVIKETLRLHPAAPLLAPHVSVVDCNVEGYTTPSG 48961

48960 TRVIVNAWAIAR (fs) DPSY*ENAEEF (fs)

      XQRFLSNTMADYNGNNFNFLPFWTGRRICPGINFA 48787

48786 ITTIEIMLASLVYRFDWKLFTSRIDMTETFGATIHLKEKLFLVPHLPQDN* 48658

 

>AP003909.1b $P CYP71C13P chromosome 8 clone OJ1300_E01, 4 in frame stops pseudogene

orth aaaa01000238.1g note this seq is out of order in this gene cluster

54948 MAQMLGALLLFQDSLMSTMTRMSY

54876 SLLLPILCPLILLLLFRCYAYATRSGGMLDKLPSPPGRLPLIGHM 54742

54741 HLIGFFPHMSLRDLATDLMLLHLGTVPTLVVSSSRMAQVILRTHDRVFASR*QSAI 54574

54573 TDILF*GATDVAFSPYGDYWRQIKKIVTTNLLTIKKIRSYSQTRQQEVRLVMAKIVEEAA 54394

54393 THMA IDLTELLSCYSNNMVCHAVSGMFFCEEGRNQLFKELI*INSSLLGGFNIEDYFPSL 54214

54213 ARLPVRRL LYAKAYDVKKRWDQLLDKLIDDHSSKHRSSLLDNNDVESDFIDVLLSIQQE 54037

54036 YGLTKDNIKANLAIMFEAGTDTSFIELEYSMAELM*KPQMIAKLQAEVRGVVSKGQDIVT 53857

53856 EEHLGRMPYLKAVIKETLRLHPAAPLLAPHVSVVDCNVEGYTIPSGTRVIVNAWAIARDP 53677

53676 SYWENAEEFMPERFLSNTMADYNGNNFNFLPFRTGRRICPGINFAITTIEIMLASLV 53506

53505 YRFDWKLFTSRIDMTETFGATIHLKEKLFLVPHLPQDN* 53389

 

#9

>aaaa01000238.1e $FI CYP71C14 (indica cultivar-group) AP003909.1c 99%

      MAVMLVPIPLLLLHQHHNHEHEH

40499 PSPVAPQPTMASYYTLLLALLCPLLLLLIKLCRAKTRDDELFDKLPSPPGRLPVIGHL 40326

40325 HLIGSLPYVSFRELAIKHGPDLMLLRLGTVPTLVVSSARAAQAILRTNDHVFASRTYSAV 40146

40145 TDILFYGSSDVAFSPYGEYWRQVKKIATTHLLTNKKVRSYSRARQQEVRLVMARINEAAV 39966

39965 ARTTVDLSELLNWFTNDIVCHAVSGKFFREEGRNQMFWELIQANSLLLSGFNLEDYFPNL 39786

39785 ARVTTVRRLLCAKAHNVNKRWDQLLDKLIDDHATKRSSSVLDLDNEESDFIDVLLSIQHE 39606

39605 YGLTRDNVKAILVIMFEGGTDTAYIELEYAMAELIRKPQLMAKLQAEVRSVVPRGQEIVT 39426

39425 EEQLGRMPYLKAVIKETLRLHLAGPLLVPHLSIAECDIEGYTIPSGTRVFVNAWALSRDP 39246

39245 SFWENAEEFIPERFLNSIAPDYNGNNFHFLPFGSGRRICPGINFAIATIEIMLANLVYRF 39066

39065 DWEIPADQAAKGGIDMTEAFGLTVHRKEKLLLVPRLTQD* 38946

 

>AP003909.1c $F CYP71C14 chromosome 8 clone OJ1300_E01

orth aaaa01000238.1e

same as AP004462.1 152574-152287 region

58316 MAVMLVPIPLLLLHQHHNHEHEHPSPVAPQPTM

58217 ASYYTLLLALLCPLLLLLIKLCRAKTRDDELFDKLPSPPGRLPV 58086

58085 IGHLHLIGSLPYVSFRELAIKHGPDLMLLRLGTVPTLVVSSARAAQAILRTNDHVFASRT 57906

57905 YSAVTDILFYGSSDVAFSPYGEYWRQVKKIATTHLLTNKKVRSYSRARQQEVRLVMARIN 57726

57725 EAAVARTTVDLSELLNWFTNDIVCHAVSGKFFREEGQNQMFWELIQANSLLLGGFNLEDY 57546

57545 FPNLARVTTVRRLLCAKAHNVNKRWDQLLDKLIDDHATKRSSSVLDLDNEESDFIDVLLS 57366

57365 IQHEYGLTRDNVKAILVIMFEGGTDTAYIELEYAMAELIRKPQLMAKLQAEVRSVVPRGQ 57186

57185 EIVTEEQLGRMPY 57153 frameshift

57147 LKAVIKEMLRLHLAGPLLVPYLSIAECDIEGYTIPSGTRVFVNAWALSRDPSFWENAEEF 56968

56967 IPERFLNSIAPDYNGNNFHFLPFGSGRRICPGINFAIATIEIMLANLVYRFDWEIPA 56797

56796 DQAAKGGIDMTEAFGLTVHRRRSSSLFLGSHKIK* 56692

 

#8

>aaaa01000238.1c $FI CYP71C15 (indica cultivar-group) AP003909.1d 99%

25643 LLLPVALLLLLLRFARATTLAGDRNSELLLSKLPSPPLRLPVIGHMHLVGSLPHVSLRD 25467

25466 LAAKHGRDGLMLVHLGSVPTLVVSSPRAAEAVLRTHDLAFASRPRAMVPDIISYGATDSC 25287

25286 YGPYGDHFRKVRKAVTVHLLNSHKVQAYRPAREEEVRLVIAKLRGAAAMAGAPVDMTELL 25107

25106 HSFANDLICRAVSGKFFREEGRNKLFRELIDTNASLLGGFNLEDYFPSLARTKLLSKVIC 24927

24926 VRAMRVRRRWDQLLDKLIDDHATRLVRRHDHDQQQDSDFIDILLYHQEEYGFTRDNIKAI 24747

24746 LVX 24741

24592 MFEAGTDTSYLVLESAMVELMRKPHLLAKLKDEVRRVIPKGQEIVNEDNIVDMVYLKAVI 24413

24412 KETLRLHPPAPLYIPHLSREDCSISGYMIPTGIRVFVNAWALGRDAKFWDMPDEFLPERF 24233

24232 MDSNIDFKGHDFHYLPFGSG*RMCPGIHSATVTLEIMLANLMYCFNWKLPAGVKEEDI 24059

24058 DMTEVFGLTVHRKEKLFLVP 23999

 

>AP003909.1d $F CYP71C15 chromosome 8 clone OJ1300_E01

orth aaaa01000238.1c

AQ868830.1 nbeb0032E11f CUGI Rice BAC genomicLength = 759 57% to 76C5

same as AP004462.1 139663-140091 region

68223 MELNNTEPLTASRAQAAAVFLLLPVALLLLLLRFARATTMAGDRNSELLLSKLPS 68387

68388 PPLRLPVIGHMHLVGSLPHVSLRDLAAKHGRDGLMLVHLGSVPTLVVSSPRAAEAVLRTH 68567

68568 DLAFASRPRAMVPDIITYGATDSCYGPYGDHFRKVRKAVTVHLLNSHKVQAYRPAREEEV 68747

68748 RLVIAKLRGAAAMAGAPVDMTELLHSFANDLICRAVSGKFFREEGRNKLFRELIDTNASL 68927

68928 LGGFNLEDYFPSLARTKLLSKVICVRAMRVRRRWDQLLDKLIDDHATRLVRRHDHDQQQD 69107

69108 SDFIDILLYHQEEYGFTRDNIKAILV 69185 (0)

69331 DMFEAGTDTSYLVLESAMVELMRKPHLLAKLKDEVRRVIPKGQEVVNEDNIVDMVYLKAV 69510

69511 IKETLRLHPPAPLYIPHLSREDCSISGYMIPTGIRVFVNAWALGRDAKFWDMPDEFLPER 69690

69691 FMDSNIDFKGHDFHYLPFGSGRRMCPGIHSATVTLEIMLANLMYCFNWKLPAGVKE 69858

69859 EDIDMTEVFGLTVHRKEKLFLVPQAA* 69939

 

#7

>aaaa01000238.1b $FI CYP71C16 (indica cultivar-group) AP003909.1e 100%

14489 LLPLALLFYFARAAISSRDSKTRELILSKLPSPPFKLPVIGHMHLIGPLPYVSLRDLAA 14313

14312 KHGRDGLMLVRLGSVPTLVVSSPRAAEAVLRTHDLAFASRPRSMVTDIIMYGALDSCFAP 14133

14132 YSDHFRSVKKVVTVHLLNSKRVQAYRHVREEEVRLVMARLRGAAAAAAAVDLSQTLQFFA 13953

13952 NDLICRAVSGKFLCEQGRNKVFRDLMEANSNLLGGFNLEAYFPGLARMPLISKLICARAI 13773

13772 RIRRRWDQLLDMLIDDHVASARDRAKNDDDDFIHVLLSLQDEYGFTRDHIKAISIX 13608

13134 MFEAGTDTSHLVLEYAMVELTRKPHILTKLQDEVRRITPKGQHMVTEDDIVGMVYLKAVI 12955

12954 KETLRLHAPGGFTIPHLAREDCNVDGYMIPAGTRVLINLWALSRDANYWDKPDEFLPERF 12775

12774 MDGSNKNTDFKGQDFQFLPFGSGRRMCPGIHSGKVTLEIMLANLVYCFNWKLPSGMKKE 12598

12597 DIDMTDVFGLAIHRKEKLFLVPQI 12526

 

>AP003909.1e $F CYP71C16 chromosome 8 clone OJ1300_E01

orth aaaa01000238.1b

same as AP004462.1 128584-129021 region

78935 MELILQLEAKTAAQAVVTVFFFFLLPLALLFYFARAAISSRDSKTRELILSKLPSPPFK 79111

79112 LPVIGHMHLIGPLPYVSLRDLAAKHGRDGLMLVRLGSVPTLVVSSPRAAEAVLRTHDLAF 79291

79292 ASRPRSMVTDIIMYGALDSCFAPYSDHFRSVKKVVTVHLLNSKRVQAYRHVREEEVRLVM 79471

79472 ARLRGAAAAAAAVDLSQTLQFFANDLICRAVSGKFLCEQGRNKVFRDLMEANSNLLGGFN 79651

79652 LEAYFPGLARMPLISKLICARAIRIRRRWDQLLDMLIDDHVASARDRAKNDDDDFIHVL 79828

79829 LSLQDEYGFTRDHIKAISI 79885 (0)

80359 DMFEAGTDTSHLVLEYAMVELTRKPHILTKLQDEVRRITPKGQHMVTEDDIVGMVYLKAV 80538

80539 IKETLRLHAPGGFTIPHLAREDCNVDGYMIPAGTRVLINLWALSRDANYWDKPDEFLPER 80718

80719 FMDGSNKNTDFKGQDFQFLPFGSGRRMCPGIHSGKVTLEIMLANLVYCFNWKLPSGMKK 80895

80896 EDIDMTDVFGLAIHRKEKLFLVPQIANY* 80982

 

#6

>aaaa01000238.1a $FI CYP71C17 (indica cultivar-group) orth of AP003909.1f

2 diffs N-terminal Met not identified

     MVVQLMLFFHDKFMAPMAEEPLPF

3340 VLIMIIILLLLVLLHYYLSASTRRSSAASKSNDDVLPPSPPRLPVIGHMHLVGSNPHVSL 3161

3160 RDLAEKHAADGFMLLQLGQVRNLVVSSPRAAEAVLRAHDHVFASRPRSAIADILAYGSSN 2981

2980 ISFSPYGDYWRKARKLVAAHLLSPKKVQSLRRGREEEVGIAVAKLHEAAAAGAAVDMREL 2801

2800 LGSFTNDVLCRAVCGKSSFRREGRNKLFMELAAGNADQYAGFNLEDYFPSLAKVDLLRRV 2621

2620 VSADTKKLKEKWDSVLGDIVSEHEKKSSLRRDDQVQMDDDRDDDQEEQESDFVDILLDRQ 2441

2440 QEYNLTRHNIHAILM (0) 2396

2206 DMFAAGTDTSYIALEFAMSELIRKPHLMTKLQDEVRKNTTTKMVSEDDLNNMPYLKAVV 2036

2035 KETLRLHPPVPLLLPRLSMAQCNANGYTIPANTRVIINVWALGRDARCWENSEEFMPERF 1856

1855 MDSGDTIDNVDFKGTDFQFLPFGAGRRICPGMNFGMASVELMLSNLMYCFDWELPVGMDK 1676

1675 DDVDMTDQFALTMARKEKLYLIPRSHVIKIT* 1580

 

#6

>AP003909.1f $F CYP71C17 chromosome 8 clone OJ1300_E01

AK067200                2151 bp    mRNA    linear   PLN 24-JUL-2003

Oryza sativa (japonica cultivar-group) cDNA clone:J013097P19, full

insert sequence.

orth aaaa01000238.1a

AZ127316.1 OSJNBb0086E03f CUGI Rice BAC genomic Length = 498 54% to 71A14

AQ871024.1 nbeb0042C09f CUGI Rice BAC genomic Length = 495 56% to 71B23

same as AP004462.1 147428-146820 region

63826 MVVQLMLFFHDKFMAPMAEEPLPFVLIMIII

63733 LLLLVLLHYYLSASTRRSSAASKSNDDVLPPSPPRLPVIGHMHLVGSNPHV 63581

63580 SLRDLAEKHAADGFMLLQLGQVRNLVVSSPRAAEAVLRAHDHVFASRPRSAIADILAYGS 63401

63400 SNISFSPYGDYWRKARKLVAAHLLSPKKVQSLRRGREEEVGIAVAKLHEAAAAGAAVDMR 63221

63220 ELLGSFTNDVLCRAVCGKSSFRREGRNKLFMELAAGNADQYAGFNLEDYFPSLAKVDLLR 63041

63040 RVVSADTKKLKEKWDSVLGDIVSEHEKKSSLRRDDQVQMDDDRDDDQEEQESDFVDILLD 62861

62860 RQQEYNLTRHNIHAILM 62810 (0)

62626 DMFAAGTDTSYIALEFAMSELIRKPHLMTKLQDEVRKNTTTQMVSEDDLNNMPYLKAV 62453

62452 VKETLRLHPPVPLLLPRLSMAQCNANGYTIPANTRVIINVWALGRDAKCWENSEEFMPER 62273

62272 FMDSGDTIDNVDFKGTDFQFLPFGAGRRICPGMNFGMASVELMLSNLMYCFDWELPVGMD 62093

62092 KDDVDMTDQFALTMARKEKLYLIPRSHVIKIT* 61994

 

#6 duplicate

>aaaa01000238.1d $FI CYP71C17 (indica cultivar-group) AP003909.1f 99%

this seq 100% identical to aaaa01000238.1a, probably an error in assembly

only count this gene once see aaaa01000238.1a for ortholog

N-terminal Met not identified

      MVVQLMLFFHDKFMAPMAEEPLPF

30181 VLIMIIILLLLVLLHYYLSASTRRSSAASKSNDDVLPPSPPRLPVIGHMHLVGSNPHVSL 30360

30361 RDLAEKHAADGFMLLQLGQVRNLVVSSPRAAEAVLRAHDHVFASRPRSAIADILAYGSSN 30540

30541 ISFSPYGDYWRKARKLVAAHLLSPKKVQSLRRGREEEVGIAVAKLHEAAAAGAAVDMREL 30720

30721 LGSFTNDVLCRAVCGKSSFRREGRNKLFMELAAGNADQYAGFNLEDYFPSLAKVDLLRRV 30900

30901 VSADTKKLKEKWDSVLGDIVSEHEKKSSLRRDDQVQMDDDRDDDQEEQESDFVDILLDRQ 31080

31081 QEYNLTRHNIHAILM (0) 31137

31309 DMFAAGTDTSYIALEFAMSELIRKPHLMTKLQDEVRKNTTTKMVSEDDLNNMPYLKAVV 31485

31486 KETLRLHPPVPLLLPRLSMAQCNANGYTIPANTRVIINVWALGRDARCWENSEEFMPERF 31665

31666 MDSGDTIDNVDFKGTDFQFLPFGAGRRICPGMNFGMASVELMLSNLMYCFDWELPVGMDK 31845

31846 DDVDMTDQFALTMARKEKLYLIP RSHVIKIT* 31941

 

#105

>AP004232.1 $F CYP71C18 chromosome 1 clone OSJNBa0051H17 like CYP71C4

90% to aaaa01002989.1 version 4 in Genbank does not allow for

frameshift and skips beginning of heme signature

probably not an ortholog no 99% match in indica 9/5/02

56483 MEQAAGLVYQLFQNEMFPWILALFPFLLLALHYLATNHRTPTTCKETRNHLSPPSP 56650

56651 PRLPIIGHLHLIGDLPHVSLRELAHRYGPNLMLLHLGQVQNLVVSSPHAAEAVLR 56815

56816 THDHVFASRPHSLIGDILLYGPSDVGLSPYGEQWRRSRRIVTTHLLTNKKVRSYHVAREE 56995

56996 E 56998 (0)

57929 VHKVMTKVHELSTKGMGVDMFELFSTYSNDLICRLVSGKNFQGDKGRNKMFRQLFKANY 58105

58106 VLLAGFNLEDYYPGLARLKAVSWVMCAKARNTRKLWDELLDEIINDRMSKQPCEHDRGND 58285

58286 DQDEMDFVDVLLLQERGITRDHLKAILV 58369 (0)

58462 DMFQAGTETTSVVLVFAMAELMQKPHLMAKLQAELRTNIPK*GRELITECDQTNMTYLKA 58641

58642 VIKETLRLHPPSPLLVPHLAMADCDIDGYTVRSGTRVIVNAWAIGRNSESWEAAEEFLPE 58821

58822 RFVDGGSAANVDFIGTDFQFLPFGAX 58896 frameshift

58899 RRICPGINFASASMEIILANLLYHFDWDVSAEAAIDKDGIDMAEAFGLSVQLKEKLLLVP 59078

59079 VEYKGSVQDSAVIL* 59123

 

>AK100062.1 CYP71C18 UniGene info Oryza sativa (japonica cultivar-group) cDNA MEQAAGLVYQLFQNEMFPWILALFPFLLLALHYLATNHRTPTTCKETRNHLSPPSP 187

188 PRLPIIGHLHLIGDLPHVSLRELAHRYGPNLMLLHLGQVQNLVVSSPHAAEAVLRTHDHV 367

368 FASRPHSLIGDILLYGPSDVGLSPYGEQWRRSRRIVTTHLLTNKKVRSYHVAREEEVHKV 547

548 MTKVHELSTKGMGVDMFELFSTYSNDLICRLVSGKNFQGDKGRNKMFRQLFKANYVLLAG 727

728 FNLEDYYPGLARLKAVSWVMCAKARNTRKLWDELLDEIINDRMSKQPCEHDRGNDDQDEM 907

908 DFVDVLLLQERGITRDHLKAILVDMFQAGTETTSVVLVFAMAELMQKPHLMAKLQAELRT 1087

1088 NIPK*GRELITECDQTNMTYLKAVIKETLRLHPPSPLLVPHLAMADCDIDGYTVRSGTRV 1267

1268 IVNAWAIGRNSESWEAAEEFLPERFVDGGSAANVDFIGTDFQFLPFGAX 1411

1414 RRICPGINFASASMEIILANLLYHFDWDVSAEAAIDKDGIDMAEAFGLSVQLKEKLLLVP 1593

1594 VEYKGSV 1614

 

#447

>aaaa01059480.1 CYP71C19 (indica cultivar-group) orth of AP004233.1 100%

602 VHKVMAKVHELSTKGMAVDMTELFSTFSNDLICRLVSGKNFQGEGRNKLFRQLFKANSV 426

425 LLAGFNLKDYYPGLARLKAVSMVMCAKARNTRKLWDELLDEIIDERMSKQQCEHDEGNDQ 246

245 DEMNFVNVLLLQEQGITREHLKAIL (0)

81  DMYQAGTETSSVVLVFAMAELMQKPHL 1

 

>AP004233.1 $F CYP71C19 chromosome 1 clone OSJNBa0065J17 50% to CYP71C4

= AQ857130 duplicate of the AP004232 gene at 27203-29726

probably not ortholog only 91%

21862 MEQAAGLVYQLFQHEMFPWTFSVLALFPFLLLVLHYLATNHRTPTTCKETKNHHPP

21694 PPSPPRLPIIGHLHLIGGLLHVSLRELAHRYGPDLMLLHLGQVPNLIVSSPRAAEAVLR 21518

21517 THDLVFASRPYSLIADILLYGPSDVGLSPYGE*WRRRIITTHLLTNKKVRSYRVAREE 21344

21343 E 21335 (0)

      VHKVMAKVHELSTKGMAVDMTELFSTFSNDLICRLVSGKNFQGEGRNKLFRQLFKAN

      SVLLAGFNLKDYYPGLARLKAVSMVMCAKARNTRKLWDELLDEIIDERMS

      KQQCEHDEGNDQDEMNFVNVLLLQEQGITREHLKAIL (0)

20004 DMYQAGTETSSVVLVFAMAELMQKPHLMAKLQAELRTTIPKQGHELITERDLTDMTYLKA 19825

19824 VIKETLRLHPPTPLLLPHLAMADCNIDGYTVRSGTRVIVNAWAIGRNSESWEAAEEFLPE 19645

19644 RFVDDGSAANVDFIGTDFQFLPFGAGRRICPGINFASASMEIILANLLYHFDWDVSAEAA 19465

19464 IDKDGIDMAEAFGLSVQLKEKLLLVPVDYKDGMQDSAVILL* 19339

 

#135

>aaaa01003879.1 $FI CYP71C20 (indica cultivar-group) ortholog to AP004757.1a 97%

4645 MAQMLAAFLLDDLISHEHGHESLGAPPQAGTMAWYSLVLMTSLLFPLLVLLVMRCYVTRS 4824

4825 GAKLLDKLPSVPGRLPVIGHLHLIGSLPHISLRNLATKHSPDMMLLHLGAVPTLVVSSSR 5004

5005 VAQSILHTHDDIFASRPYSPIANILFYGATDVGFSPYNEYWWQIKKITTTHLLTVKKVRS 5184

5185 YVSARQREVRIVIARITEAASKHEVVDLTEMLSCYSNNIVCHVVCGKFS*KEGWNQLLRK 5364

5365 LVKVNTSLLGGFNIEDYFPSFTRLAAVRRLLLSCAKAHNINKRWDQLLEKLIDDHTTKHI 5544

5545 RSSSMLNHYDEEAGFIDVLLSIQHEYGLTKDNIKANLAAMLMAGTDTSFIELEYAMAELM 5724

5725 QKPHVMGKLQAEVRRVMPKGQDIVTEEQLGCMPYLKAVIKETLRLYPPAPLLMPHLSMSD 5904

5905 CNINGYTIPSGTRVIVNVWALARDSNYWENADEFIPERFIVNTLGDYNGNNFHFLSFGSG 6084

6085 RRIYPGINFAIATIEIMLANLVYRFDWELPADQAAKGGIDMTETFGVAVHRKEKLLLIPH 6264

6265 LHLR 6276

 

>AP004757.1a $F CYP71C20 52% to AF321858 Lolium rigidum 70% to AP003909

chromosome 6 clone P0652D10

      MAQMLAAFLLDGLISHEHGHESLGAPPQAGTMAWYSLVLMTS

79980 LLFPLLVLLVMRCYVTRSGAKLLDKLPSVPGRLPVIGHLHLIGSLPHISLRDLATKH 79810

79809 SPDMMLLHLGAVPTLVVSSSRVAQSILHTHDDIFASRPYSPIANILFYGATDVGFSPYNE 79630

79629 YWRQIKKITTTHLLTMKKVRSYVSARQREVRIVMARITEAASKHVVVDLTEMLSCYSNN 79453

79452 IVCHAVCGKFSLKEGWNQLLRELVKVNTSLLGGFNIEDYFPSFTRLAAVRRLLLSCAKA 79276

79275 HNINKRWDQLLEKLIDDHTTKHIRSSSMLNHYDEEAGFIDVLLSIQHEYGLTK 79117

79116 DNIKANLAAMLMAGMDTSFIELEYAMAELMQKPHVMGKLQAEVRRVMPKGQDIVTEEQLG 78937

78936 CMPYLKAVIKETLRLHPPAPLLMPHLSISDCNINGYTIPSGTRVIVNVWALARDSN 78769

78768 YWENADEFIPERFIVNTLGDYNGNNFHFLPFGSGRRICPGINFAIATIEIMLANLV 78601

78600 YRFDWELPADQAAKGGIDMTETFGVAVHRKEKLLLIPHLHLR* 78472

 

#104

>aaaa01002989.1 $FI CYP71C21 (indica cultivar-group) 91% to AP004233.1

6020 MEQAAGLVYQLFQQEMFPWTFSVLALFPFLLL frameshift

     SLHYLATNNRTPTTCKETKNHHPPPPSPPRLPIIGHLHLIGDLLHVSLRELA 5770

5769 HRYGPDLMLLHLGQVP (?)

5107 NLIVSSPRAAEAVLRTHDLVFVSRPYSLIADILLYGPSDIGLSPYGEQWRQSRRIVTTHL 4928

4927 LTNKKVRSYRVAREEE 4871 (0?)

4088 VHKVMAKVHELSTKGMAVDMTELFSTFSNDLICRLVSGKNFQGDDGRNKLFRQLFKANS 3909

3908 VLLAGFNLEDYYPSLARLKAVSRVMCAKARKTRKLWDELLDKIIDDRMSKQQCEHDRGND 3729

3728 DQDEMDFVDVLLLQERGITREHLKAIL 3648 (0?)

3552 DMFQAGTETTSVVLVFAMAELMHKPHLMAKLQAELRTNISKQGQELLTECDLTNMTYLNA 3373

3372 VIKETLRLHPPTPLLLPHLAMADCDIDGYTVRSGTRVIVNAWAIGRNSESWEAAEEFLSE 3193

3192 RFVDGGSAANVDLTGTDFQFLPFGAGRRICPGINFASASMEIILANLLYHFDWDVPAEAA 3013

3012 IDKAGIDMAEAFGLSVQLEEKLLLVPIEYKDSM 2914

 

#488

>aaaa01000238.1 CYP71C22P

Duplicated end of exon 1 from CYP71C17 = AP003909.1g

ESDFVDILLDHQQEYNLTRHNIHAILM 32576

 

#488

>AP003909.1g $P CYP2C22P chromosome 8 clone OJ1300_E01 lone pseudogene fragment

identical to Duplicated end of exon 1 on aaaa01000238.1a

61469 EQESDFVDILLDHQQEYNLTRHNIHAILM 61383

 

#488

>aaaa01000238.1a $FI CYP71C22P (indica cultivar-group) orth of AP003909.1g

Duplicated end of exon 1 same as AP003930.1g

1052 EQESDFVDILLDHQQEYNLTRHNIHAILM 966

 

#136

>aaaa01006247.1 CYP71C23P (indica cultivar-group) orth of AP004757.1b 2 diffs

2898 FAIATIEIMLANLVYRFDWEILVDQAAKGGIDMTEAFGLTVHRKQKLLLVSWLPQD 3065

 

>AP004757.1b $P CYP71C23P chr 6 Pseudogene fragment last exon similar to AP003909

103765 FAIATIEIMLANLVYRFDWEILVDQAAKGGIDMTEAFGLAVHRKEKLLLVSWLPQD* 103595

 

#28

>aaaa01000733.1 $FI CYP71E4 (indica cultivar-group) 99% to AC092559.2

5765 MATLLSKLLALPQQWQLLLLLLLLPIASLLLVIGRNTGGRRRRRHLRLPPGPARLPVLGN 5586

5585 LLQLGALPHRSLRDLARRHGPVMMLRLGAVPAVVVSSPEAAQEVLRTHDADCCSRPSSPG 5406

5405 PMRLSYGYKDVAFAPYDAYSRAARRLFVAELFSAPRVQAAWRARQDQ 5265

3896 VEKLIGKLTRPEPEPVELNDHIFALTDGIIGAVAFGSIYGTERFAGGGRKRFHHLLDDV 3720

3719 MDMLASFSAEDFFPNAAAARLFDHLTGLVARRERVFQQLDAFFEMVIEQHLDSDSSNAGG 3540

3539 GGGNLVGALIGLWKQGKQYGDRRFTRENVKAIIF 3438

3337 DAFIGGIGTSSVTILWAMAELMRSPRVMRKVQAEIRATVGDRDGGGMVQPDDLPRLAYLK 3158

3157 MVVKETLRLHPPATLLMPRETMRDVRIGGYEVAARTRVMVNAWAIGRDAARWEEAEVFDP 2978

2977 DRFEAKRVEFNGGHFELLPFGSGRRICPGIAMAAANVEFTLANLLHCFDWALPVGMAPEE 2798

2797 LSMEESGGLVFHRKAPLVLVPTRYIQL 2717

 

>AC092559.2 $F CYP71E4 chromosome 3 clone OSJNBb0096M04, 45% to 71B37

same as AC096688.3 chromosome 3

96529 MATLLSKLLALPQQWQLLLLLLLLPIASLLLVIGRNTGGRRRRRHLR

96388 LPPGPARLPVLGNLLQLGALPHRSLRDLARRHGPVMMLRLGAVPAVVVSSPEAAQEVLR 96212

96211 THDADCCSRPSSPGPMRLSYGYKDVAFAPYDAYGRAARRLFVAELFSAPRVQAAWRARQDQ 96017 (0)

94678 VEKLIGKLTRPEPEPVELNDHIFALTDGIIGAVAFGSIYGTERFAGGGRKRFHHLLDDVMDMLASFSAEDFFPNA

94454

94453 AAARLFDHLTGLVAHRERVFQQLDAFFEMVIEQHLDSDSSNAGGGGGNLVGALIGL 94286

94285 WKQGKQYGDRRFTRENVKAIIF 94220 (0)

94119 DAFIGGIGTSSVTILWAMAELMRSPRVMRKVQAEIRATVGDRDGGGMVQPDDLPRLAY 93946

93945 LKMVVKETLRLHPPATLLMPRETMRDVRIGGYEVAARTRVMVNAWAIGRDAARWEEAEVF 93766

93765 DPDRFEAKRVEFNGGHFELLPFGSGRRICPGIAMGAANVEFTLANLLHCFDWALPVGMAP 93586

93585 EELSMEESGGLVLHRKAPLVLVPTRYIQL* 93496

 

#296

>aaaa01011852.1 $FI CYP71E5 (indica cultivar-group) ortholog of AL731888.1

58% to AC092559.2 46% to 71B23

8074 MAISLITSLLFSLPQQWQP

8017 VVLTGLLPVIVSLVLLARKGRLKMPPGPEQVPLLGNLHQLAGPQPHRALRDLARVHGPV 7841

7840 MRLRLGKASAVVLTSAEAAWEALRGHDLDCCTRPVSAGTRRVTYGMKNVAFAPYGAYWR 7664

7663 EVRKLLMVELLSARRVKAAWYARHEQ (0) 7586

     VEKLLSTLRRAEGKPVALDEHILSLSDGIIGRVAFGNIYGSDKFSQNK

     NFQHALDDVMEMLSGEGSSAEDLQLPAAVGRLVDRLTGFAARRERIFRQLDSF

     FEMVIEQHLDPNRAPPENGGDLVDVLIDHWKKNEPRGTFSFTKDNVKAIIF (0)

6324 STFVAGIDTNAATILWAMSELARKPRVLKKVQAEIRAAVGVNGRVQPDDITKLSYLRK 6151

6150 VVKETLRLHPPTPLLLPRETMRHIQISGYDVPAKTRIYVNAWAIGRDPASWPDEPEEFNP 5971

5970 ERFEANEIDFKGEHPELMPFGTGRRICPGMAMAMANVEFTLANLLFAFQWSLPEGTTPDN 5791

5790 VCMEEEGRLVCHRKTPLVLVPTVYRHGLE* 5701

 

>AL731888.1 CYP71E5 chr 12

31348 MAISLITSLLFSLPQQWQPVVLTGLLPVIVSLVLLARKGRLKMPPGPEQVPLLGNLHQLA 31169

31168 GPQPHRALRDLARVHGPVMRLRLGKASAVVLTSAEAAWEALRGHDLDCCTRPVSAGTRRV 30989

30988 TYGMKNVAFAPYGAYWREVRKLLMVELLSARRVKAAWYARHEQ 30860

30546 VEKLLSTLRRAEGKPVALDEHILSLSDGIIGTVAFGNIYGSDKFSQNKNFQHALDDV 30376

30375 MEMLSGEGSSAEDLQLPAAVGRLVDRLTGFAARRERIFRQLDSFFEMVIEQHLDPNRAPP 30196

30195 ENGGDLVDVLIGHWKKNEPRGTFSFTKDNVKAIIF 30091

29601 STFVAGIDTNAATILWAMSELARKPRVLKKVQAEIRAAVGVNGRVQPDDITKLSYLRKVV 29422

29421 KETLRLHPPTPLLLPRETMRHIQISGYDVPAKTRIYVNAWAIGRDPASWPDEPEEFNPER 29242

29241 FEANEIDFKGEHPELMPFGTGRRICPGMAMAMANVEFTLANLLFAFQWSLPEGTTPDNVC 29062

29061 LEEEGRLVCHRKTPLVLVPTVYRHGLE 28981

 

#352

>aaaa01015254.1 CYP71E6 (indica cultivar-group) 94% to AC084319.5a 7 diffs

53% to AC092559.2

3134 LAVSVVLIFWSRHRRNPSSRLKLPPGPTRLPIIGNLHQIGRLPHRSLGALAGWHGPVMAL 3313

3314 WLGTVPVVVLSSPKAEREALQVHDPECCNRSPT 3412

 

>aaaa01021677.1 CYP71E6 (indica cultivar-group) orth AC084319.5a  99%

838 DTFVGGITTTSVTLHWAMSELIRNPRVMKKAQDEIRAVVGEKERVQHHDMPKLKYLK 668

667 MVVKETFRLHPPATLLVPRETTRHFKVGGYDIPEKTKVIVNEWAIGRDPNIWKDPEEFIP 488

487 ERFEEMDIDFNGAHFELVPFGSGRRICPGLAMGVANIEFILASMLFCFDWELPHGVRKED 308

307 IDMEEAGKLTFHKKIPLLLVP 245

 

>AC084319.5a CYP71E6 chr 3 Genbank translation is wrong at N-terminal

does not identify frameshift and conserved motifs PPGPXXLPIIGNL

same as AC084404.8 partial

2204 MAASLLLELLPQQWQLSITSLIL

2273 LAVSVVLIFWSRRRRNPSSRLKLPPGPTRLPIIGNLHQIGRLPHRSLEALAGWHG 2437 (fs)

2437 LGTVSVVVLSSPKAAREALKVHDPECCSRSPS 2532

2533 AGPRMLSYGYKDVAFSPYSNYVRNMRKLFVVELLSMRRVQAA 2658

8170 VMLPDYYCCM 8199

8599 VEKLIEKLTRNGRNAVAINEHIFSTVDGIIGTFALGETYAAEEFKDISETMDLLSSSSAE 8778

8779 DFFPGSVAGRLVDRLTGLAARREAIFRKLDRFFERIVDQHAAADDDGPAAARRKADDKGS 8958

8959 AGSDLVHELIDLWKMEGNTKQGFTKDHVKAMLL 9057

9159 DTFVGGITTTSVTLHWAMSELIRNPRVMKKAQDEIRAVVGEKERVQHHDMPKLKYLKMVV 9338

9339 KETFRLHPPATLLVPRETTRHFKVGGYDIPEKTKVIVNA*AIGRDPNIWKDPEEFIPERF 9518

9519 EEMDIDFNGAHFELVPFGSGRRICPGLAMGVANIEFILASMLFCFDWELPHGVRKEDIDM 9698

9699 EEAGKLTFHKKIPLLLVPTPNKAPN* 9776

 

>AC084404.8 CYP71E6 chr 3 incomplete = AC084319.5a

153211 MAASLLLELLPQQWQLSITSLIL 153279

153280 LAVSVVLIFWSRRRRNPSSRLKLPPGPTRLPIIGNLHQIGRLPHRSLEALAGWHG 153444 (fs)

153444 LGTVSVVVLSSPKAAREALKVHDPECCSRSPS 153539

153540 AGPRMLSYGYKDVAFSPYSNYVRNMRKLFVVELLSMRRVQAA 153665

 

#254

>aaaa01009177.1b CYP71K1 (indica cultivar-group) orth AP002968 $F 98%

1277 LYLLLLALLVAVPFLCLTRWSLRHGGGGGGRLPPSPWALPVIGHLHHVAGALPHRAMRDL 1456

1457 ARRHGPLMLLRLCELRVVVACTAEAAREVTKTHDLAFATRPITPTGKVLMADSVGVVFAP 1636

1637 YGDGWRTLRRICTLELLSARRVRSFRAVREEEVGRLLRAVAAAAAVAALTTPGATAAVNL 1816

1817 SERISAYVADSAVRAVIGSRFKNRAAFLRMLERRMKLLPAQCLPDLFPSSRAAML 1981

1982 VSRMPRRMKRERQEMMDFIDDIFQEHHESRAAAGAEEDLLDVLLRIQSQDKTNP 2143

2144 ALTNDNIKTVIX 2176

2244 DMFVASSETAATSLQWTMSELMRNPRVMRKAQDEVRRALAVAGQDGVTEESLPDLPYL 2423

2424 HLLIKESLRLHPPVTMLLPRECREPCRVMGFDVPEGVMVLVNAWAIGRDPAHWDSPEEFA 2603

2604 PERFEGGGAADFKGTDFEYIPFGAGRRMCPGMAFGLANMELALAALLYHFDWELPGGMLP 2783

2784 GELDMTEALGLTTRRCSDLLLVPAL 2858

 

>AP002968 $F CYP71K1 40% to 71B24 complement(1875..2513,2584..3501)

AP003204 40% to 71B24 CDS complement(121487..122125,122196..123113)

AQ870215.1 nbeb0036N08f CUGI Rice BAC genomic Length = 754 58% to 99A1

MAELPLYLLLLALLVAVPFLCLTRWSLRHGGGGGGRLPPSPWAL

PVIGHLHHVAGALPHRAMRDLARRHGPLMLLRLCELRVVVACTAEAAREVTKTHDLAF

ATRPITPTGKVLMADSVGVVFAPYGDGWRTLRRICTLELLSARRVRSFRAVREEEVGR

LLRAVAAAAAVAALTTPGATAAVNLSERISAYVADSAVRAVIGSRFKNRAAFLRMLER

RMKLLPAQCLPDLFPSSRAAMLVSRMPRRMKRERQEMMDFIDDIFQEHHESRAAAGAE

EDLLDVLLRIQSQDKTNPALTNDNIKTVIIDMFVASSETAATSLQWTMSELMRNPRVM

RKAQDEVRRALAIAGQDGVTEESLRDLPYLHLVIKESLRLHPPVTMLLPRECRETCRV

MGFDVPEGVMVLVNAWAIGRDPAHWDSPEEFAPERFEGVGAADFKGTDFEYIPFGAGR

RMCPGMAFGLANMELALAALLYHFDWELPGGMLPGELDMTEALGLTTRRCSDLLLVPA

LRVPLRDHER

 

#253

>aaaa01009177.1a $P CYP71K2P (indica cultivar-group) AP002968 $F 97%

only 363 bp away from start of second gene, cannot be complete gene

309 LYLLLLALLVAVPFLCLTRSSRRHGCGGGSRLPPSPWALPVIGHLHHVAGALPHRAMRDL 488

489 ARRHGPLMLLRLCELRVVVASTAEAAREVTKTHDLAFATRPITPTGKVLMADSVGVVFAP 668

669 YGDGWRTLRRICTLELLSARRVRSFRAVREEEVGRLLRAVAAAAAAAAAALTTPGATAAV 848

849 NLSERISAYVADSAVRAVIGSR 914

 

no ortholog found in japonica 9/13/02, may be indica unique pseudogene

does not exist on AP002968 or AP003204 so it might be a sequence assembly

error.

 

#209

>aaaa01007181.1a CYP71K3 (indica cultivar-group) orth AP003990.1h $F chr 2 99%

N-term (orientation probably incorrect on either a or b)

1038 YLLLLPLLVVPLLYLAASSSRRSGRLRLPPGPWALPVIGHLHHLALAGAPTHRAMRDMAR 859

858  RHGPLMLLRFCELPVVVASSPDAAREIMRTHDVAFASRPIGPMLRLVFQGAEGVIFAPYG 679

678  DGWRQLRKICTVELLSHRRVHSFRPVRADELGRLLRAVADQAALSSSSPVNLTGMISAFV 499

498  ADSTVRAIIGSRSRHRDTFLRLVEDGLKIMPGMSLPDLFPSSRLAMLLSRVPAKI 334

333  ERRRRGMMGFIDTIIQEHQESRAAAEDEDLLDVLLRLQKDMDSQYPLTTMN 181

180  IKSILX 166

88    DMFGAGSETSATTLQWAMAELMRNPAV 2

 

>aaaa01007181.1b CYP71K3 (indica cultivar-group) orth AP003990.1h $F chr 2 100% C-term

6892 VMRRAQDEVRRELAVAGNDRVTEDTLPSLHYLRLVIKETLRLHPPAPLLLPRECGGACKV 7071

7072 FGYDVPAGTMVLVNAWAIGRDAAAWGAAAEEFSPERFERCERDFRGADFELIPFGAGRRI 7251

7252 CPGMAFGLAHVELALAALLFHFDWRLPGGMAAGEMDMTEAAGITVRRRSDL 7404

 

>AP003990.1h $F CYP71K3 chromosome 2 clone OJ1073_F05

66403 MATELTEYLLLLPLLVVPLLYLAASSSRRSGRLRLPPGPWALPVIGHLHHLALAGAPTHR 66582

66583 AMRDMARRHGPLMLLRFCELPVVVASSPDAAREIMRTHDVAFASRPIGPMLRLVFQGAEG 66762

66763 VIFAPYGDGWRQLRKICTVELLSHRRVHSFRPVRADELGRLLRAVADQAASSSS 66924

66925 SPVNLTGMISAFVADSTVRAIIGSRSRHRDTFLRLVEDGLKIMPGMSLPDLFPSSRLAML 67104

67105 LSRVPAKIERRRRGMMGFIDTIIQEHQESRAAAEDEDLLDVLLRLQKDMDSQYPLTTMN 67281

67282 IKSILI 67299 (0)

67380 DMFGAGSETSATTLQWAMAELMRNPAVMRRAQDEVRRELAVAGNDRVTEDTLPSLHYL 67553

67554 RLVIKETLRLHPPAPLLLPRECGGACKVFGYDVPAGTMVLVNAWAIGRDAAAWGAAAEEF 67733

67734 SPERFERCERDFRGADFELIPFGAGRRICPGMAFGLAHVELALAALLFHFDWRLPGGMA 67910

67911 AGEMDMTEAAGITVRRRSDLLVFAVPRVPVPAQ* 68012

 

#210

>aaaa01007181.1c CYP71K4 (indica cultivar-group) orth AP003990.1i $F chr 2 99%

8111 LPPGPWALPVIGHLHHLAGDLPHRALSALARRHGALMLLRLGEVQAVVASSPDAAREIMR 8290

8291 THDAAFASRPLSPMQQLAYARDAEGVIFAPYGDGWRHLRKICTGELLSARRVQSFRPVRE 8470

8471 AELVRLLRSVAEATSSSSSGSLVNLTELISAFVADSTVRAIIGSRFEHRDAYLRML 8638

8639 QDGLKIVPGMTLPDLFPSSRLALFLSRVPGR 8731

9019 DMFGAGSESSATVLQWTMAELMRNPRVMQKAQDEVRRALAGHDKVTEPNLTNLPYLRL 9192

9193 VIKETLRLHPPAPLLLPRKCGSTCKILGFDVPEGVMVIVNAWAIGRDPTYWDKPEEFVPE 9372

9373 RFEHNGRDFKGMDFEFIPFGAGRRICPGITFGMAHVELVL 9492 frameshift

9494 LYHFDWELPQGMAAKDLDMTEDFGVTTQRRSNLLVRPI 9607

 

>AP003990.1i $F CYP71K4 chromosome 2 clone OJ1073_F05

68586 MPLVVLLLATIPLLFFTIKRSAQRRGGGGGGEGRLPPGPWALPVIGHLHHLAGDLPHRA 68762

68763 LSALARRHGALMLLRLGEVQAVVASSPDAARDIMRTHDAAFASRPLSPMQQLAYGRDAEG 68942

68943 VIFAPYGDGWRHLRKICTAELLSARRVQSFRPVREAELGRLLRSVAEATSSSSSA 69107

69108 SLVNLTELISAFVADSTVRAIIGSRFEHRDAYLRMLQDGLKIVPGMTLPDLFPSSRLALF 69287

69288 LSRVPGRIEHHRQGMQRFIDAIIVEHQEKRAAAAANDDDDEDEDFLDVLLKLQKEMGSQH 69467

69468 PLTTANIKTVML (0)

      DMFGAGSESSATVLQWT 69647

69648 MAELMRNPRVMQKAQDEVRRALAGHDKVTEPNLTNLPYLRLVIKETLRLHPPAPLLLP 69821

69822 RKCGSTCKILGFDVPEGVMVIVNAWAIGRDLTYWDKPEEFVPERFEHNGRDFKGMDFEF 69998

69999 IPFGAGRRICPGITFGMAHVELVLSALLYHFDWELPQGMAAKDLDMTEDFGVTTQRRSNL 70178

70179 LVRPIHRVSVPVE* 70220

 

#211

>aaaa01007181.1d CYP71K5 (indica cultivar-group) orth AP003990.1j $F chr 2 99%

10296 LLILLGSERRTAARTRLPPGPWALPVVGHLHHLAGGLPPNRAMRDLARWHGPLMLLRLGE 10475

10476 VEX 10481 frameshift

10486 VVASSPDAAREIMRTHDVAFASRPVGPMSRLWFQGADGLVFAPYGEAWRRLRRVCTQELL 10665

10666 SHRRVQSFRPVREDELGRLLRAVDAAAAAGTAVNLTAMMSTYVADSTVRAIIGSRRLKDR 10845

10846 DAFLRMLDELFTIMPGMSLPDLFPSSRLAMLVSRAPGRIMRYRRRMRRIMDSII 11007

11008 HEHQERRAAADAAGDDDDDDDEDLVDVLLRLQKEVGAQYPLTTENIKTVM 11157

11249 QDIFGAASETSSTTLEWVMAELMRSPSAMRKAQDEVRRALAAGAAGHDTVTEDILPNLNY 11428

11429 LKLVVKETLRLHPPAPLLAPRRCDSPREVLVLGHDVPAGATVLVNAWAIGRDTAAWGGAA 11608

11609 EEFSPERFERCERDFRGADFELIPFGAGRRMCPGMAFGLVHVE 11737

 

>AP003990.1j $F CYP71K5 chromosome 2 clone OJ1073_F05

70828 MAGELAFYLLLVGLVAVPLLILLGSERRTAARTRLPPGPWALPVVGHLHHLAGGLPPHRA 71007

71008 MRDLARRHGPLMLLRLGEVEAVVASSPDAAREIMRTHDVAFASRPVGPMSRLWFQGADGL 71187

71188 VFAPYGEAWRRLRRVCTQELLSHRRVQSFRPVREDELGRLLRAVDAAAAAGT 71343

71344 AVNLTAMMSTYVADSTVRAIIGSRRLKDRDAFLRMLDELFTIMPGMSLPDLFPSSRLAML 71523

71524 VSRAPGRIMRYRRRMRRIMDSIIHEHQERRAAADAAGDDDDDDDEDLVDVLLRLQKEVGA 71703

71704 QYPLTTENIKTVMM 71745 (0)

71837 DIFGAASETSSTTLEWVMAELMRSPSAMRKAQDEVRRALAAGAAGHDTVTEDILPNLSYL 72016

72017 KLVVKETLRLHPPAPLLAPRRCDSPREVLVLGHDVPAGATVLVNAWAIGRDTAAWGGAA 72193

72194 EEFSPERFERCERDFRGADFELIPFGAGRRMCPGMAFGLVHVELALAALLFHFDWSLPG 72370

72371 GMAADELDMAESSGLTTRRRLPLLVVARPHAALPTKYCN* 72490

 

#249

>aaaa01012971.1b CYP71K6 (indica cultivar-group) orth AP003523.1c $F chr 6 99%

7052 LLRYLFSVPMLFFIVPLLFLVCSPGRRRGRGSCRLPPSPWALPVVGHLHHLAGALQHRAM 7231

7232 RDIARRHGPLVLLRLGRLPVVVASSADAAREVMRTSDVAFAARPVNRMIRVVFPEGSEGV 7411

7412 IFAPYGETWRQLRKICTAELLSARRVHSFRSVREEEAGRMLRAVASAAAQTTVNLSELM 7588

7589 SAYAADSSARAMIGRRLKDRDTFLAMVERGIKLFGEQSLPNLYPSSRLAVLLSTMP 7756

7757 RRMKRHRERMTAYLDAIIEEHQESRASREDDEDLLDVLL 7873

 

>AP003523.1c $F CYP71K6 chromosome 6 clone P0416A11 six different genes

73% to AP003523.1d 64% to AP003523.1f

128476 MAAELVHLLRYLFSVPM

128425 LFFIVPLLFLVCSPRRRRGRGSCRLPPSPWALPVVGHLHHLAGALQHRAMRDIARRHGPL 128246

128245 VLLRLGRLPVVVASSADAAREVMRTSDVAFAARPVNRMIRVVFPEGSEGVIFAPYGETWR 128066

128065 QLRKICTAELLSARRVHSFRSVREEEAGRMLRAVASAAAQTTVNLSELMSA 127913

127912 YAADSSARAMIGRRLKDRDTFLAMVERGIKLFGEQSLPNLYPSSRLAVLLSTMPRRMKRH 127733

127732 RERMTAYLDAIIEEHQESRASREDDEDLLDVLLRM 127628 frameshift

       QREGDLEVSRESIRSTIG bad exon boundary should be phase 0

126439 DMFIGGSEPPAITLQWIMAELIRNPEVMQKVQDEVRQLLVGQHRVTEESLSKLGY 126275

126274 MNLVIKETLRLHPPGPRLLLRVCRTTCQVLGFDVPKGTMVLVNMWAINRDPKYWSQAEEF 126095

126094 IPERFENAGINFKGTNFEYMPFGAGRRMCPGMAFGLATLELALASLLYHFDWKLPDGV 125921

125920 EIDMKEQSGVTTRRVHDLMLVPIIRVPLPV* 125828

 

#249

>aaaa01008944.1 CYP71K6 (indica cultivar-group) orth AP003523.1c $F chr 6 98%

see aaaa01012971.1b for ortholog

1338 DMFIGGSEPPAITLQWIMAELIRNPEVMQKVQDEVRQLLVGQHRVTEESLSKLGYMNL 1511

1512 VIKETLRLHPPGPRLLLRVCRTTCQVLGFDVPKGTMVLVNMWAINRDPKYWSQAEEFIPE 1691

1692 RFENAGINFKGTNFEYMPFGAGRRMCPGMAFSLVMLELALASLLYHFDWKLPDGVEIDM 1868

1869 KEQSGVTTRRVHDLMLVPII 1928

 

#42

>aaaa01001026.1a $PI CYP71K7P (indica cultivar-group) 3 defects, probable pseudogene

probable ortholog of AP003523.1d

     MAEVVQLHHLILLLPLFILPSSSSVR (fs)

3368 RRRGACGRLPPSPWALPVIGHLHHLAGALPHRAMRDIARRHGPLVLLRLGELPVVVIASS 3189

3188 ADAARNVMKTHDLAFATRPITHMMRLVFPEGSEGIIFSPYGETWRQLRKICTVELLSARR 3009

3008 VNSFRSVREEEVNRLLRAVAAAAASATSPAKMVNLSELMSAYAADSSVRAMIGRRCKDRD 2829

2828 KFLEMLERGIKLFVTPSLPDLYPSSRLAMVVSRMPRRMRRHREEVFAFLDAIIAEHQENR 2649

2648 ASGEDEEDLLDVLLRIQREGCMEST (fs) 2580

2572 PLLSTESIRTTIG bad boundary 0 expected 2540

1662 DLFNGGSETTATTLQWIMAELMRNPRVMQKAQDEVQRVFIGQHKVTEENLSNLSYMYLVI 1483

1482 KEALRLHPPGPPLLPRECRTTCQVLGFDVPKGTIVLVNMWAINRDPKYWDQSKEFIPERF 1303

1302 EHVDINFKGMNFEYMPFGAGRRMCPGMAFGLVNLELVLASLLYHFDWKLSDKIKVGDLDM 1123

1122 TEERGATTRRLHDLLLVPVIRVPLPLDSRS* 1030

 

>AP003523.1d $F CYP71K7P chromosome 6 clone P0416A11 six different genes

probable ortholog of aaaa01001026.1a

138301 MAEVVQLHHLILLLPLFILPFLLLRSSRRRRGACGRLPPSPWALPVIGHLHHLAGALPHRAMRDIA 138152

138151 RRHGPLVLLRLGELPVVVASSADAARDVMKTHDLAFATRPITRMMRLVFPEGSEGIIFSP 137972

137971 YGETWRQLRKICTVELLSARRVNSFRSVREEEVNRLLRAVAAAAASATSPAKTVNL 137804

137803 SELMSAYAADSSVRAMIGRRCKDRDKFLAMLERGIKLFVTPSLPDLYPSSRLAMVVSRMP 137624

137623 RRMRRHREEVFAFLDAIIAEHQENRASGEDEEDLLDVLLRIQREGCME 137480 frameshift

       SPLLSTESIRTTIG bad exon boundary should be phase 0

136562 DLFNGGSETTATTLQWIMAKLMRNPRVMQKAQDEVQRVFIGQHKVTEENLSNLSYMYL 136389

136388 VIKEALRLHPPRPPLLPRECRTTCQVLGFDVPKGTIVLVNMWAINRDPKYWDQSEEFILE 136209

136208 RFEHVDINFKGMNFEYMPFGAGRRMCPGMAFGLVNLELVLASLLYHFDWKL 136056

136039 GDLDMTQERGATTRRLHDLLLVPVIRVPLPLDSRS* 135942

 

#42

>aaaa01012971.1a CYP71K7P (indica cultivar-group) orth AP003523.1d $F chr 6 99%

see aaaa01001026.1a for ortholog

4777 GSETTATTLQWIMAKLMRNPRVMQKAQDEVQRVFIGQHKVTEENLSNLSYMYLVIKE 4607

4606 ALRLHPPGPPLLPRECRTTCQVLGFDVPKGTIVLVNMWAINRDPKYWDQSEEFILERFEH 4427

4426 VDINFKGMNFEYMPFGAGRRMCPGMAFGLVNLELVLASLLYHFDWKLSDKIKVGDLDMTE 4247

4246 ERGATTRRLHDLLLVPVI 4193

 

#43

>aaaa01001026.1b $PI CYP71K8 (indica cultivar-group) Pseudogene of AP003523.1e

japonica gene does not look like a pseudogene

      MAGFPVYL (deletion and fs) LAA (fs) LIILPMANLIRSARHRRLAGAR (fs)

16438 PPPGPWALPVIGHLHHLAGKLPHHHKLRDLAARHGPLMLLRFGELPVVVASSAGAAREITK 16620

16621 THDLAFATRPVTRTARLTLPEGAEGIIFAPYGDGWRQLRKICTLELLSARRVQSFRAVRE 16800

16801 EEVRRLLLAVASPSPEGTTATASVVNLSRMISSCVADSSV RAIIGSGRFKDRETFLRLME 16980

16981 RGIKLFSGPSLPDLFPSSRLAMLVSRVPGRMRRQRKEMMEFMDTIIEEHQAAREASM 17151

17152 ELEKEDLVDVLLRVQRDGSLQFSLTTDNIKAAIA (0) 17253

this segment is homologous to 108 aa region before the sequence gap at 17972

gene may not be assembled correctly

RAMIGSRFKDRN*FLAMLERGIKLFVTPSLPDLYPSSRLAMVVSRMPRRMRRHREEVFAFLDAIIAEHQENRASGEDEEDLLDVLLRIQREGCMESTVSTESIRTTIG (0)

      missing Ihelix exon

19112 YLHLVIKETLRLHPPAPLLLARECREPCQILGFDVPKGAMVLINAWSIGRDPSNWHAPKK 19291

19292 FMPERFEQNNIDFKRTSFKYIPFGAGRRICPGMTFGLANIELLLASLLYHFDWELPHGMQ 19471

19472 AGDLDMTETLAVTARRKADLLVVPVVRVPIVG* 19570

 

>AP003523.1e $F CYP71K8 chromosome 6 clone P0416A11 six different genes AQ331067 AQ364007.2

AQ331067 55% identical to AQ328148 47% identical to C74921 57% to 71B4 58% to 76C1 64% to AP003523.1f

AQ364007.2 nbxb0060E04f CUGI Rice BAC Length = 393 65% to 99A1

End of this gene matches AP003571 at 155149

151229 MAGFPVYLLFLAALIILPMANLIRSARHRRLAGARRPPPGPWALPVIGHLHHLLAGKLPH 151408

151409 HHKLRDLAARHGPLMLLRFGELPVVVASSADAAREIAKAHDLAFATRPVTRTARLTLPEG 151588

151589 GEGVIFAPYGDGWRQLRKICTLELLSARRVLSFRAVREQEVRCLLLAVASPSPEGTTAT 151765

151766 ASVVNLSRMISSCVADSSVRAIIGSGRFKDRETFLRLMERGIKLFSCPSLPDLFPSSR 151939

151940 LAMLVSRVPGRMRRQRKEMMEFMETIIEEHQAARQASMELEKEDLVDVLLRVQRDGSLQF 152119

152120 SLTTDNIKAAIA 152155 (0)

166133 DLFIGGSETAATTLQWAMSELLNNPKVMQKAQDEIRQVLYGQERITEETISSLHYLHL 166306

166307 VIKETLRLHPPTPLLLPRECREPCQILGFDVSKGAMVLINAWSIGRDPSNWHAPEKFMPE 166486

166487 RFEQNNIDFKETSFEYIPFGAGRRICPGMTFRLANIELLLASLLYHFDWELPYGMQAGD 166663

166664 LDMTETLAVTARRKADLLVVPVVRVPIVG* 166753

 

#44

>aaaa01001026.1c $FI CYP71K9 (indica cultivar-group) Cterminal differs from AAAA01001026.1b and AP003523.1f may be a frameshift (check)

may be ortholog of AP003523.1f 95%

20983 MAAAASSVLAYLLVVALLAIVVPFVYFNRVARRRGGDVRLPPSPWGLPVIGHLHHLVGAL 21162

21163 PHVAMRDLARRHGPLMLLRLGELPVVVASSAEAAREVMKTRDLDFATRPMSRMARLVFPE 21342

21343 GGEGIIFAPYGDRWRELRKICTVELLSARRVQSFRPVREEEAGRLLRAVAAASSPSPAQ 21519

21520 AAVNLSALLSAYAADSAV RAIIGSRFKDRDKYLMLLERGLKLFARHTLPDLYPSSRLAMW 21699

21700 LSRMPRRMMQHRREAYAFTDAIIREHQENRAAGAGDDKEDLLDVLLRIQREGDLQF 21867

21868 PLSTERIKTTVG (0) 21903

22325 DMFAGGSETAGTALQWIMAELIRNPRVMHKVQDEVRQTLAGRDRVTEDAISNLNYMHL 22498

22499 VIKEVLRLHPPVPLLLPRECRNTCQVLGFDVPKGAMVLVNAWAISRDPQYWDEPEEFIPE 22678

22679 RFEDSNIDFKGTNFEYTPFGAGRRMCPGIAFGLANVELMLASLLYHFDWQLPDGMDTADL 22858

22859 DMTEEMVVSARRLXXXXXXXVVHVPLPVASS 22951

 

>AP003523.1f $F CYP71K9 chromosome 6 clone P0416A11 six different genes

AU096456 AU096455 71% to AP002968 65% to 71B24 also = AU032983

may be ortholog of aaaa01001026.1c 95%

168538 MAAAASSVLAYLLVVALLAIVPLVYFGWVARRRGEGGRLPPSPWGLPVIGHLHHLAGALPHHAMRDLA 168741

168742 RRHGPLMLLRLGELPVVVASSAEAAREVMRTRDIEFATRPMSRMTRLVFPAGTEGIIFAP 168921

168922 YGDEWRELRKVCTVELLSARRVQSFRAVREEEVGRLLRAVAATSSSPSPAQAAVNL 169089

169090 SALLSAYAADSAVHAIIGSRFKDRDKYLMLLERGLKLFARHTLPDLYPSSRLAMWLSRMP 169269

169270 RRMMQHRREAYAFTDAIIREHQENRAAGAGDGDGDDKEDLLDVLLRIQREGDLQFPLSTE 169449

169450 RIKTTVG 169470 (0)

169893 DMFAGGSETAGTALQWIMAELIRNPRVMHKVQDEVRQTLAGRDRVTEDAISNLNYMHL 170066

170067 VIKEALRLHPPVPLLLPRECRNTCQVLGFDVPKGAMVLVNAWAISRDPQYWDEPEEFIPE 170246

170247 RFEDSNIDFKGTNFEYTPFGAGRRMCPGIAFGLANVELMLASLLYHFNWQLPDGMDTAD 170423

170424 LDMTEEMVVSARRLHDLLLVPVVHVPLPVASS* 170522

 

#45

>aaaa01001026.1d $FI CYP71K10 (indica cultivar-group) orth of AP003571.1h $F 99%

28790 MAAFLVYVLVLVPLAVVPFVYFNRVARRRGGDVRLPPSPWGLPVIGHLHHLVGALPHVA 28966

28967 MRDLARRHGPLMLLRLGELPVVVASSAEAAREVMKTRDLDFATRPMSRMARLVFPEGGEG 29146

29147 IIFAPYGDRWRELRKICTVELLSARRVQSFRPVREEEAGRLLRAVAAASPGQAVN 29311

29312 LSELLSAHAADSSVRAIMGDRFRDRDAFLAMLERGLKLFAKPALPDLYPSSRLAMLLSRM 29491

29492 PRRMKQHHRDMVAFLDAIIQEHQENRSAAGDDDDNDLLDVLLRIQREGDLQFPLSS 29659

29660 ESIKATIG (0) 29683

29867 DMLVGGSETAATTLHWIMAELVRNPKVMQKAQDEIRRELIGHRKVTEDTLCRLNYMHMVI 30046

30047 KEALRLHPPGSLLLPRECRRTCQVLGYDIPKGATVFVNVSAIGRDPKYWDEAEEFIPERF 30226

30227 EHSDVDFKGTHFEYTPFGAGRRMCPGMAFGLANVELTLASLLYHFNWELPSGIHAENLDM 30406

30407 TEEMRFTTRRLHDLVLIPVVHVPLPTI* 30490

 

>AP003571.1h $F CYP71K10 chromosome 6 clone P0458E02 continuation of contig AP003523.1

40% to 71B23

AQ687385 nbxb0074N19f 50% to 71B20

AQ258331 nbxb0020M04r 71-like sequence 36% to 71B33

145371 MAAFLVYVLVLVPLAVVPFVYFNRVARRRGGDVRLPPSPWGLPVIGHLHHLVGALPH 145201

145200 VAMRDLARRHGPLMLLRLGELPVVVASSAEAAREVMKTRDLDFATRPMSRMARLVFPEG 145024

145023 GEGIIFAPYGDRWRELRKICTVELLSGRRVQSFRPVREEEAGRLLRAVAAASPG 144862

144861 QAVNLSELLSAHAADSSVRAIMGDRFRDRDAFLAMLERGLKLFAKPALPDLYPSSRLAM 144685

144684 LLSRMPRRMKQHHRDMVAFLDAIIQEHQENRSAAADDDNDLLDVLLRIQREGDLQFPLS 144508

144507 SESIKATIG 144481 (0)

144297 DMLVGGSETAATTLHWIMAELVRNPKVMQKAQDEVRRELIGHRKVTEDTLCRLNYMHM 144124

144123 VIKEALRLHPPGSLLLPRECRRTCQVLGYDIPKGATVFVNVSAIGRDPKYWDEAEEFIPE 143944

143943 RFEHSDVDFKGTHFEYTPFGAGRRMCPGMAFGLANVELTLASLLYHFNWELPSGIHAENL 143764

143763 DMTEEMRFTTRRLHDLVLIPVVHVPLPTI* 143674

 

#248

>aaaa01008885.1 $FI CYP71K11 (indica cultivar-group) almost same as AAAA01011410.1

ortholog to AC118346.1a gene 1

6884 MADQLVHLPQQLLVLLLFIAPFFFFFLIRSMRRRDGGSVRLPPSPWALPVIGHLHHLMGA 6705

6704 LPPHHAMRDIALRHGPLVRLRLGGLQVILASSVDAAREVMRTHDLAFATRPSTRVMQLVF 6525

6524 PEGSQ (0)

     GIVFTPYGDSWRNLR 6345

6344 KICTVELLSAKRVQSFRPIREEEVGRLLRAVAAASPARRAVNLSELISAYSADSTMRALI 6165

6164 GSRFKDRDRFLMLLERGVKLFATPSLPDLYPSSRLAELISRRPRQMRRHRDEVYAFLDII 5985

5984 IKEHQENRSSSDDQEDLDLVDVLLRIQRKGDFPLSTDNIKTTIG 5853

5758 DLFNGGSETTATTLKWIMAELVRNPRVMQKAQDEVRRALGKHHKVTEEALKNLSYLHL 5585

5584 VIKEGLRLHPPGLPLLLRESRTTSQVLGFDVPQGTMILVNMWAISRDPMYWDQAEEFIPE 5405

5404 RFEHVNIDYYGTDVKYMPFGVGRRICPGIAFGLVNLELVLASLLYHFDWELPDGTELGNL 5225

5224 DMKEEMGAIARRLHDLSLVPVIRHPLPVDM 5135

 

>AC118346.1a $F CYP71K11 Gene 1, 94448-96200, 3 exons 97% identical to gene 2 (12 diffs)

36% to 41% with 71As and 71Bs

94448 MADQLVHLPQQLLVLLLFIAPFFFFFLIRSMRRRDGGSVRLPPSPWALPVIGHLHHLMGA 94627

94628 LPPHHAMRDIALRHGPLVRLRLGGLQVI 94711

94712 LASSVDAAREVMRTHDLAFATRPSTRVMQLVFPEGSQ (0)

      GIVFTPYGDSWRNLRKICTVELLSAKRVQSFRPIREEEVGRLL 95071

95072 RAVAAASPARRAVNLSELISAYSADSTMRALIGSRFKDRDRFLMLLERGVKLFATPSLPD 95251

95252 LYPSSRLAELISRRPRQMRRHRDEVYAFLDIIIKEHQENRSSSDDQEDLDLVDVLLRIQR 95431

95432 KGDFPLSTDNIKTTIG (0) 95479

95574 DLFNGGSETTATTLKWIMAELVRNPRVMQKAQDEVRRALGKHHKVTEEALKNLSYLHL 95747

95748 VIKEGLRLHPPGLPLLLRESRTTSQVLGFDVPQGTMILVNMWAISRDPMYWDQAEEFIPE 95927

95928 RFEHVNIDYYGTDVKYMPFGVGRRICPGIAFGLVNLELVLASLLYHFDWELPDGTELGNL 96107

96108 DMKEEMGAIARRLHDLSLVPVIRHPLPVDM* 96200

 

#286

>aaaa01011410.1 $FI CYP71K12 (indica cultivar-group) Ortholog to AC118346.1b gene 2

4935 MADQLVHLPQQLLVL

     LLFIAPFFFFFLIRSIRRRDGGSVRLPPSPWALPVIGHLHHLMGALPPQHAMRNIALRH 5156

5157 GPLVRLRLGGLQVILASSVDAAREVMRRHDLAFATRPSTRVMQLVFPEGSQ (0) 5309

5428 GIVFTPYGDSWRNLRKICTVELLSAKRVQSFRPIREEEVGRLLRAVAAASPARRAVNLSE 5607

5608 LISAYSADSTMRALIGSRFKDRDKFLMLLERGVKLFATPSLPDLYPSSRLAELISRRPR 5784

5785 QMRRHRDEVYEFLDIIIKEHQENRSSSDDQEDLDLVDVLLRIQRKGDFPLSTDNIKTTIG (0) 5964

6058 DLFNGGSETTATTLKWIMAELIRNPRVMQKAQDEVRQVLGKHHKVTEEALR 6210

6211 NLSYLHLVIKEGLRLHPPGLPLLLRESRTTSQVLGFHVPQGTMILVNMWAISRDPMYWD 6387

6388 QAEEFIPERFEHVNIDYYGTDVKYMPFGVGRRICPGIAFGLVNLELVLASLLYHFNWELP 6567

6568 DETELGNLDMKEEMGAIARRLHDLSLVPVIRHPLPVDM* 6684

 

>AC118346.1b $F CYP71K12 (japonica cultivar-group) chromosome 11 clone Ba0039F06,

Gene 2 = AU096586.1, D48250 97% identical to gene 1 (12 diffs)

113634 MADQLVHLPQQLLVLLLFIAPFFFFFLIRSIRRRDGGSVRLPPSPWALPVIGHLHHLMGA 113813

113814 LPPQHAMRNIALRHGPLVRLRLGGLQVI 113897

113898 LASSVDAAREVMRTHDLAFATRPSTRVMQLVFPEGSQ (0) 114008

114127 GIVFTPYGDSWRNLRKICTVELLSAKRVQSFRPIREEEVGRLLRAVAAASPARRAVNL 114300

114301 SELISAYSADSTMRALIGSRFKDRDKFLMLLERGVKLFATPSLPDLYPSSRLAELISRRP 114480

114481 RQMRRHRDEVYEFLDIIIKEHQENRSSSDDQEDLDLVDVLLRIQRKGDFPLSTDNIKTTIG (0) 114663

       DLFNGGSETTATTLKWIMAELIRNPRVM 114840

114841 QKAQDEVRQVLGKHHKVTEEALRNLSYLHLVIKEGLRLHPPGLPLLLRESRTTSQVLGFH 115020

115021 VPQGTMILVNMWAISRDPMYWDQAEEFIPERFEHVNIDYYGTDVKYMPFGVGRRICPGIA 115200

115201 FGLVNLELVLASLLYHFNWELPDETELGNLDMKEEMGAIARRLHDLSLVPVIRHPLPVDM* 115383

 

#287

>AC118346.1c Gene 3 $P CYP71K13P pseudogene 56% to AC118346.1 genes 1, 2 no ortholog

117756 DAAREVMRTHDLAFATRPSTRVMQLVFLEGSQ 117661

117553 GDRFTPYGDIWRNLRRSAPLAVSAKRVQFFRPIHQEEVCRLLQAVAVASPA 117395

117394 RGPPETLTSSFRPTWATLQCAP**GARLRDRDKSLMLLYRGVKPIRHARACQIFTQSIAL 117215

117214 ADLIIKSLSPMRRASYPMSNLLDIIFK 117134

117108 SDNHMDLTLVAFLLRFHKKGACPLSFCYIRKQFG*AF 116998

 

#172

>aaaa01005413.1 $FI CYP71P1 (indica cultivar-group) ortholog to AL713951.1

5862 MSLALLVLSAAYVLVALRRSRSSSLKPRRLPPSPPGWPVIGHLHLMSGMPHHALAELARTMRAPLFRMRL

     GSVPAVVISKPDLARAALTTNDAALASRPHLLSGQFLSFGCSDVTFAPAGPYHRMARR

     VVVSELLSARRVATYGAVRVKELRRLLAHLTKNTSPAKPVDLSECFLNLANDVLCRVAFG

     RRFPHGEGDKLGAVLAEAQDLFAGFTIGDFFPELEPVASTVTGLRRRLKKCLADLREACD

     VIVDEHISGNRQRIPGDRDEDFVDVLLRVQKSPDLEVPLTDDNLKALVL (0) 4972

4875 DMFVAGTDTTFATLEWVMTELVRHPRILKKAQEE

4773 VRRVVGDSGRVEESHLGELHYMRAIIKETFRLHPAVPLLVPRESVAPCTLGGYDIPARTR 4594

4593 VFINTFAMGRDPEIWDNPLEYSPERFESAGGGGEIDLKDPDYKLLPFGGGRRGCPGYTFA 4414

4413 LATVQVSLASLLYHFEWALPAGVRAEDVNLDETFGLATRKKEPLFVAVRKSDAYEFKGEE 4234

4233 LSEV* 4219

 

>AL713951.1 $F CYP71P1 chromosome 12 clone Monsanto- 39% to 83B1

AF088221 BI305808.1 49% to 76C6 mRNA

44616 MSLALLVLSAAYVLVALRRSRSSSSKPRRLPPSPPGWPVIGHLHLMSGMPHHALAELART 44437

44436 MRAPLFRMRLGSVPAVVISKPDLARAALTTNDAALASRPHLLSGQFLSFGCSDVTFAPAG 44257

44256 PYHRMARRVVVSELLSARRVATYGAVRVKELRRLLAHLTKNTSPAKPVDLSECFLNLAND 44077

44076 VLCRVAFGRRFPHGEGDKLGAVLAEAQDLFAGFTIGDFFPELEPVASTVTGLRRRLKKCL 43897

43896 ADLREACDVIVDEHISGNRQRIPGDRDEDFVDVLLRVQKSPDLEVPLTDDNLKALVL (0)

      DMFVAGTDTTFATLEWVMTELVRHPRILKKA 43537

43536 QEEVRRVVGDSGRVEESHLGELHYMRAIIKETFRLHPAVPLLVPRESVAPCTLGGYDIPA 43357

43356 RTRVFINTFAMGRDPEIWDNPLEYSPERFESAGGGGEIDLKDPDYKLLPFGGGRRGCPGY 43177

43176 TFALATVQVSLASLLYHFEWALPAGVRAEDVNLDETFGLATRKKEPLFVAVRKSDAYEFK 42997

42996 GEELSEV* 42973

 

#264

>aaaa01009895.1 $PI CYP71P2P (indica cultivar-group) orth of AP003544.1

6273 GSMPAMVISKPNLARPALTTNDAVLASRQHLLNG*FLSFG frameshift

     CSDVTFAPAGPYHRM frameshift

     QMAR 6094

6093 GVEVSELLSAHHVAMYGVVRVKELQRLLAHLTNNTSSAKPIDLSECFLNLANDVLCRVAF 5914

5913 GRRFPRDEGDKLSAVLANAQDL 5848 frameshift

5848 LAGFTISDFFLELEPVASTVTGLCHRLKKCLADLYEACDVIVDVHISGNRQRIPSDREED 5669

5668 FVDVLLRVQ 5642

 

>AP003544.1 $P CYP71P2P chr 6 clone P0599C12 same as AP003686.1 8668-8037 pseudogene of AL713951.1

this gene matches a barley EST BF255745.2 75% and a sorghum EST BE354971.1 79%

so there must be a functional copy of this gene in rice 

107880 GSMPAVVISKPNLARPALTTNDAVLASRQHLLNG*FLSF 107764 frameshift

107762 GCSDVTFAPAGPYHRM 107715 frameshift

107713 QMARGVEVSELLSAHHVAMYGVVRVKELQRLLAHLTKNTSSAKPIDLSECFLNLANDVLCRVAF 107521

107520 GRRFPRDEGDKLSAVLANAQDLL 107452 frameshift

107452 AGFTISDFFLELEPVASTVTGLCHRLKKCLADLCEACDVIVDVHISGNRQRIPSDREEDFVDVLLRVQ 107249

 

 

#127

>aaaa01003512.1 i CYP71Q1 (indica cultivar-group) ortholog to AP004346.1a 95%

6414 SYNYLDVAFAPYSDYWREMRKLFVVELTSVSRVHSFAYARTAEVARLVDTLAASPPGVPF 6593

6594 DISCTLYQLLDGIIGTVAFGKVYGAAQWSTERAVFQDVLSELLLVLGSFSFEDFFPSSAL 6773

6774 ARWGDALAGVERRRRRIFRQVDGFLDSVIDKHLEPERLSAGVQEDMVDALVRMWREQQDR 6953

6954 PSGVLTREHIKAILM 6998

8541 NTFAGGIDTTAITAIWIMSELMRNPRVMQKAQAEVRNTVKNKPLVDEEDIQNLKYLE 8711

8712 MIIKENFRLHPPGTLLVPRQTMQPCLIGGYNVPSGTRVFINIWAMGRDPMIWDNPEEFYP 8891

8892 ERFEDRNIDFRGSHFELVPFGSGRRICPGIAMAVASLELVVANLLYCFDWKLPKGM*EED 9071

9072 IDMEEIGQLSFHRKVELFIVPVKHEQCEP*DQLMGH 9179

 

>AP004346.1a $F CYP71Q1 two genes and a pseudogene

71B like 47% to AC092559.2 75% to AP004346.1b

22020 MADDFLSSQPQPW 22058

22059 PPLLQLSAAVLFFLLPLLYLLFLRGSNGEVRGRQGNSASAPSLPGPCRQLPVLGNLLQIG 22238

22239 SRPHRYFQAVSRRYGPVVQVQLGGVRTVVVHSPEAAEDVLRTNDVHCCSRPPSPG 22405 (2)

26427 SYNYLDVAFAPYSDYWREMRKLFVVELTSVSRVRSFAYARAAEVARLVDTL 26577

26578 AASPPGVPVDLSCALYQLLDGIIGTVAFGKGYGAAQWSTERAVFQDVLSELLLVLG 26745

26746 SFSFEDFFPSSALARWADALAGVERRRRRIFRQVDGFLDSVIDKHLEPERLSAGVQED 26919

26920 MVDALVKMWREQQDRPSGVLTREHIKAILM 27009 (0)

28586 NTFAGGIDTTAITAIWIMSEIMRNPRVMQKARAEVRNTVKNKPLVDEEDSQNLKYLEMIIKEN 28774

28775 FRLHPPGNLLVPRQTMQPCLIGGYNVPSGTRVFINIWAMGRGPMIWDNPEEFYPERFE 28948

28949 DRNMDFRGSNFELVPFGSGRRICPGVAMAVTSLELVVANLLYCFDWKLPKGMKEEDIDM 29125

29126 EEIGQISFISFRRKVELFIVPVKHEQYQLMGHIN* 29221

 

#127

>aaaa01043764.1 CYP71Q1 (indica cultivar-group) orth AP004346.1a $F 97%

see aaaa01003512.1 for ortholog

905 LSAAVLFFFLLPFLYLLFLRGSNGEVRGRQGNSASAPSPPGPCRQLPVLGNLLQIGSRPH 726

725 RYFQAVSRRYGPVVQVQLGGVRTVVVHSPEAAEDVLRTNDVHCCSRP 585

 

#121

>aaaa01017559.1 CYP71Q2 (indica cultivar-group) orth AP004346.1b $F 98%

see aaaa01003239.1b above

3049 DCCLHPVCTRFFSPYSAYWREMRKLLVIELTSIRRVQSFAYARAAEVAR 3195

3196 LVDTLAASLAGVPVDLSSALYTFSDGVIGTVAFGKVYGSAAWSSSEWGGSFQEAMDETM 3372

3373 QVLGSFSFEDFFPSSALARWADALTGAAGQRRRVFHRIDGFFDAVIDKHLEPERLSAGV 3549

3550 QEDMVDATVKVWREQKDEAFGLTCDHIKAIL 3642

 

#121

>aaaa01025743.1 CYP71Q2 (indica cultivar-group) orth AP004346.1b $F 99%

see aaaa01003239.1b for ortholog

1225 FLLLPLVYLLFFKGDGNGGVMDSASAPSPPGPPRQLPVLGNLLQIGSRPHRYFQAVARRY 1404

1405 GPVVQVQLGSIRTVVVHSPEAAKDVLRTNDLQCCSRPSS 1521

 

#121

>aaaa01003239.1b i CYP71Q2 (indica cultivar-group) exon 2 ortholog to AP004346.1b 95%

17640 LQDAFVGGIDTTAVTTTWIMSELMRNPRVVQKA*AEVHNIVKNKSKVCKEDIQNMKYLKM 17461

17460 IIKENFRLHPPGTLLIPRQTMKTCTIGGYSVPSETRIYVNVWAMGRDPNICDNPEQFYPE 17281

17280 RFEDKGIDFRGSHFELLPFGSGQRICPGIAMGVANVELVVANLLYCFDWQLPKGMKEEDI 17101

17100 DMDEIGQLAFRK 17065

 

>AP004346.1b $F CYP71Q2 two genes and a pseudogene 48% to AC092559.2 75% to AP004346.1a

69192 MATELLASQLLPWQPLVQLLAAGLFLLPLVYLLFFKGDGNGG 69317

69318 VMDSASAPSPPGPPRQLPVLGNLLQIGSRPHRYFQAVARRYVPVVQVQLGSIRTVVVHS 69494

69495 PEAAKDVLRTNDLQCCSRPSSPG 69565 (2)

72047 NYNYLDVAVSPYS 72083 (frameshift)

72085 YWREMRKLLVIELTSIRRVQSFAYARAAEVARLVDTLAASPAGVPVDLSSALYTF 72249

72250 SDGVIGTVAFGKVYGSAAWSSWEWGASFQEAMDETMQVLGSFSFEDFFPSSALARWADALTGA 72438

72421 AGRRRRVFHRIDGFFDAVIDKHLEPERLSAGVQEDMVDAMVMVWREQKDEAFGLTRDHIKAILL 72630 (0)

84351 DAFVGGIDTTAVTVTWIMSELMRNPRVMQKAQAEVHNIVKNKSKVCEEDIQNMKYLKM 84524

84525 IIKENFRLHPPGTLLIPRQTMKTCTIGGYSVPSETRIYVNVWAMGRDPNIWDNPEQFY 84698

84699 PERFEDKGIDFRGSHFELLPFGSGRRICPGIAMGVANVELVVANLLYCFNWQLPKGMKEE 84878

84879 DIDMDEIGQLAFRKNFLF* 84935

 

note there is another gene on AP004346.1

 

#120

>aaaa01003239.1a $PI CYP71Q3P (indica cultivar-group) exon 2 ortholog to AP004346.1c 96%

2517 DAFAGGIDTTVVTTTWIMSELMRNPTVMQ 2431 frameshift

2432 KAQAEVHNIVKNKSKVYEENIQNMKYLKMIIKENCKLHPPGTLLIPRHTMKTCTIGGYN 2253

2252 VPSKTRIYVNVWAMWRDPNIWDNPEQFYLERFEDKGIDFRGSHFELLT 2109 (?)

1820 FGSGQRICPGIAMGVANVELVVANLLYCFDWQLPKGTKEEDIDMDEIG*LAFRK 1659

 

>AP004346.1c $P CYP71Q3P two genes and a pseudogene

probable pseudogene 89% to AP004346.1b

92865 DAFAGGIDTTVVTTTWIMSELMRNPRVMQK 92954 (frameshift)

92956 AQAEVHNIVKNKSKVYEENIQNMKYLKMIIKENCRLHPPGTLLIPRHTMKTCTIGGYSV 93132

93133 PSKRRIYVNVWAMWRDPNIWDNLEQFYLERFEDKGIDFRGSHFELLT 93273 (insertion)

93561 FGSGQRICPGIAMGVANVELVVANLLYCFDWQLPKGMKEEDTDMDEIG*LAFRKKLPLFI 93740

93741 VPMKH* 93758

 

#84

>aaaa01002200.1 $PI CYP71Q4P (indica cultivar-group) ortholog of AC087599.11 $P 94% PERF region resembles CYP71Q sequences

2930 PAPILVPRQSTATAVVQG*EILAKTSLFINVWAMGRYPAAWNSPEEFWPERFLASREAMD 3109

3110 FQGNNYQLILFITDRRICPDINFAVPVLETALVGLLHPTNELLGGGGGLMWLQRSCSRAR 3289

3290 RLRSTGHRRHRSGTHPAAAVAAAAT 3364

 

>AC087599.11 $P CYP71Q4P chromosome 10 clone OSJNBa0057L21, pseudogene fragment like 71A1 44% to AAAA01006105.1b

16812 GGGGRWTETLEWIMAELTANTRVMAKLQDEISRAADGK 16925 24 aa deletion and frameshift

16931 PAPILVPRQSTATAVVQG*EILAKTSLFINVWAMGRYPAAWNSSEEFWPEQFLASREAVD 17110

17111 FQGNNYQLILFITDRRIFPDINFAVPVLETALVGLLHPTNELLGGG 17248

17249 GGLMWLQRSCSRARRPRSTAHRRHRSGTHPAAIAAAAAT* 17368

 

#145

>aaaa01004200.1 CYP71R1 (indica cultivar-group)

16322 MAAVQLDFGLLVGFLFLATCLAVAIRSYLRSGGAAIPSPPALPVIGNLHQLGR 16480

16481 GRHHRALRELARRHGPLFQLRLGSVRALVVSSAPMAEAVLRHQDHVFCGRPQQRTARGTL 16660

16661 YGCRDVAFSPYGERWRRLRRVAVVRLLSARRVDSFRALREEEVASFVNRIRAASGGGV 16834

16835 VNLTELIVGLTHAVVSRAAFGKKLGGVDPAKVRETIGELADLLETIAVSDMFPRLRWVDW 17014

17015 ATGLDARTKRTAAKLDEVLEMALRDHEQSRGDDDDGGGGDGEPRDLMDDLLSMANDGGGD 17194

17195 HGHKLDRIDVKGLILV (1)

      NMFIA (frameshift)

      GTDTIYKSIEWT 17374

17375 MAELIKNPAEMAKVQAEVRHVAAAAHGDEDEDTVAVVREQQLGKMTLLRAA 17527

17528 MKEAMRLHPPVPLLIPREAIEDTVLHGHRVAAGTRVMINAWAIGRDEAAWEGAAEFRPGR 17707

17708 FAGGGDAAGVEYYGGGDFRFVPFGAGRRGCPGVAFGTRLAELAVANMACWFEWELPDGQ 17884

17885 DVESFEVVESS 17917

 

>AK062293                2395 bp    mRNA    linear   PLN 24-JUL-2003

DEFINITION  Oryza sativa (japonica cultivar-group) cDNA clone:001-100-F01, full

            insert sequence.

  56 MAAVQLDSGLLVGFLFLATCLAVAIRSYLRSGGAAIPSPPALPVIGNLHQLGRGRHHRAL 235

 236 RELARRHGPLFQLRLGSVRALVVSSAPMAEAELRHQDHVFCGRPQQRTARGTLYGCRDVA 415

 416 FSPYGERWRRLRRVAVVRLLSARRVDSFRALREEEVASFVNRIRAASGGGVVNLTELIVG 595

 596 LTHAVVSRAAFGKKLGGVDPAKVRETIGELADLLETIAVSDMFPRLRWVDWATGLDARTK 775

 776 RTAAKLDEVLEMALRDHEQSRGDDDDGGGGDGEPRDLMDDLLSMANDGGGDRGHKLDRID 955

 956 VKGLILDMFIAGTDTIYKSIEWTMAELIKNPAEMAKVQAEVRHVAAAAH 1102

1103 GDEDEDTVAVVREEQLGKMTLLRAAMKEAMRLHPPVPLLIPREAIEDTVLHGHRVAAGTR 1282

1283 VMINAWAIGRDEAAWEGAAEFRPGRFAGGGAAAGVEYYGGGDFRFVPFGAGRRGCPGVAF 1462

1463 GTRLAELAVANMACWFEWELPDGQDVESFEVVESS 1567

 

#214

>aaaa01007242.1 $PI CYP71R2P (indica cultivar-group) ortholog of AP003575.1 99%

11280 MAAVQLDSGLLIGFLFLATCLVVAVRSYLRSGGADGGRGGAAITSPPALPVIGNLHQLGR 11459

11460 GRHHRALRELARRHGPLFQLRLGSVRALVVSSASMAEAVLRHQDHVFCGRPQQHTARGTL 11639

11640 YGCRDVAFSPYGERWRRLRRVAVVHLLSARRVDSFRALREEEVASFVNRIRAASGGGGGV 11819

11820 VNLTELIVGLTHAVVSRAAFGKKLGGVEPAKVRETVGELADLLGTIAVSDMFPRLRWVDW 11999

12000 ATGLDARTKRTAAKLDEVLEMVLRDHEQSRGDDDDDDGDGEARDLMDDLLSMANGGDDHG 12179

12180 YKLDRIDVKGLLILV (0)

      DMFAAGTDTVYKSIE frameshift

      MAEL 12359

12360 IKNPAEMAKVQAEVRHVVAAAHGGEGDEDAVVIVKEEQASS frameshift

12482 LGKMTLLRAAMKEAMRLHPPLPLLIPREAIQDTVLHGHRVAAGTRVMINAWAIGRDEAAW 12661

12662 EDAGEFRPGRFADGGDNAGVEYYGGGGDFRFMPFGAGRRGCPGMAFATRLAELVVANMAC 12841

12842 WFEWELPDGQDVQSFEPSRSSSREVCRLGSSSGKIGNHVYAVETT 12976

 

>AP003575.1 $P CYP71R2P chromosome 6 clone P0528B02, similar to 71A24 one in frame stop codon 395 to 71A14

54816 MAAVQLDSGLLIGFLFLATCLVVAVRSYLRSGGADGGRGGAA

54690 ITSPPALPVIGNLHQLGRGRHHRALRELARRHGPLFQLRLGSVRALVVSSASMAEAVLRH 54511

54510 QDHVFCGRPQQHTARGTLYGCRDVAFSPYGERWRRLRRVAVVHLLSARRVDSFRALREEE 54331

54330 VASFVNRIRAASGGGGGVVNLTELIVGLTHAVVSRAAFGKKLGGVEPAKVRETVGEL 54160

54159 ADLLGTIAVSDMFPRLRWVDWATGLDARTKRTAAKLDEVLEMVLRDHEPSRGDDDDDDGD 53980

53979 GEARDLMDDLLSMANGGDDHGYKLDRIDVKGLLIL 53875 (0)

      DMFAAGTDTVYKSIE*TMAELIKNPAEMAKVQAEVRHVVAAAHGGEGDEDAVVIVKEEQAS 53616 (fs)

53613 LGKMTLLRAAMKEAMRLHPPLPLLIPREAIQDTVLHGHRVAAGTRVMINAWAIGRDEAAW 53434

53433 EDAGEFRPGRFADGGDNAGVEYYGGGGDFRFMPFGAGRRGCPGMAFATRLAELVVANMAC 53254

53253 WFEWELPDGQDVQSFEPSRSSSREVCRLGSSSGKIGNHVYVVQTTRM* 53110

 

#448

>aaaa01059584.1 CYP71R3 (indica cultivar-group) 59% to CYP71R1

596 VAAKTRVIINTWAIGRDSIIRENAEEFLPERFIDNGIDYNSKDFSFIPFGAGRRGCPGIAFATRLA 793

794 ELALANLMYHFDWELQEGQDLESFQLVSPSVIQTWGSS 907

 

no japonica ortholog found 12/23/03 in nr, est, htgs

 

#362

>aaaa01016223.1 CYP71S1 (indica cultivar-group) orth AL606614.1b $F chr 4 96%

1585 PRPRGLPLIGNLHQVGALPHRSLAALAARHATPLMLLHLGSVPTLVVSTADAARALFRDN 1764

1765 DRALSGRPALYAATRLSYGQKNISFAPDGAYWRAARRACMSALLGAPRVRELRDAREREA 1944

1945 AALIAAVAAAGASPVNLSDMVAATSSRIVRRVALGDGDGDESMDVKAVLDETQA 2106

2107 LLGGLWVADYVPWLRWVDTLSGMRRRLELRFHQLDALYERVIDDHLNNRKHASDEE 2274

2275 DDLVDVLLRLHGDPAHRSTFGSRSHIKGIL 2364

2699 DMFIAGSDTSAVTVQWAMTELVRNPDVLAKAQHEVRRVVAAGDKVREADLPELHY 2872

2873 LRLVIKETLRLHPAAPLLVPREMTEPFRTAHGVEIPARTRVVVNAMAIHTDPGVWGPDAE 3052

3053 RFVPERHRDDADGCAQQHDGFALVPFGIGRRRCPGVHFAAAAVELLLANLLFCFDWR 3223

3224 APPGREVDVEEENGLVVHKKNPLVLI 3301

 

>AL606614.1b $F CYP71S1 chromosome 4 clone OSJNBb0011N17 40% to 71A25

23233 MSMASLQAPEFLASCLLLATILFFKQLLAPSSKQRAASPSLPRPRGLPLIGNLHQVGALPHRSLAALAAR 23096

23095 HAAPLMLLRLGSVPTLVVSTADAARALFRDNDRALSGRPALYAATRLSYGQKSISFAPD 22919

22918 GAYWRAARRACMSELLGPPRVRGLRDAREREAAALVAAVAAAGASPVNLSDMVAATSSR 22742

22741 IVRRVAFGDGDGDESMDVKAVLNETQALLGGLWVADYVPWLRWVDTLSGKRWRLERRFRQ 22562

22561 LDALYERVIDDHLNKRKHASDEEDDLVDVLLRLHGDPAHRSTFGSRSHIKGILT 22400 (0)

22059 DMFIAGSDTSAVTVQWAMTELVRNPDVLAKAQHEVRRVVAAGDKVREADLPELHYLR 21889

21888 LVIKETLRLHPAAPLLVPRETTEPFRTAHGVEIPARTRVVVNAMAIHTDPGVWGPDAERF 21709

21708 VPERHRDDADGCAQQHDGFALVPFGIGRRRCPGVHFAAAAVELLLANLLFCFDWRAP 21538

21537 PGREVDVEEENGLAVHKKNPLVLIATKSKRNTGGH* 21427

 

#362

>aaaa01040889.1 CYP71S1 (indica cultivar-group) orth AL606614.1b $F chr 4 100%

see aaaa01016223.1 for ortholog

822 DMFIAGSDTSAVTVQWAMTELVRNPDVLAKAQHEVRRVVAAGDKVREADLPELHY 986

987 LRLVIKETLRLHPAAPLLVPRETTEPFRTAHGVEIPARTRVVVNAMAIHTDPGVWGPDAE 1166

1167RFVPERHRD 1193

 

#298

>aaaa01011971.1 CYP71S2 (indica cultivar-group) orth AL606614.1a $F chr 4 96%

987  DMFIAGSDTSAVTVQWAMTELVRNPDVLAR 1085

     AQHEVRRVVAAAGGGDKDGAMVREADLPELHYLRLVIKETLRLHPASPLVQR 1240

1241 ETTEPFRTAHGVEIPARTRVVINAMAIHTDPGVWGPDAERFVPERHRAHDADGGQQHDGF 1420

1421 ALVPFGIGRRSC 1456

1450 ELLLANLLFCFDWSAPPGREVDVEEENGLAVRKKNPLVLI 1569

 

>AL606614.1a $F CYP71S2 chromosome 4 clone OSJNBb0011N17 90% to AL606614.1b

19270 MASLQAPEFLASCLLLLATILLFKQLLAPSSKKRAASPSLPRPKGLPLIGNLHQVGALPHRSLAAL 19073

19072 AARHAAPLMLLRLGSVPTLVVSTADAARALFRNNDRALSGRPALYAATRLSYGQKNISF 18896

18895 APDGAYWRAARRACMSALLGAPRVCELRDAREREAAALIAAVAAAGASPVNLSDMVAAT 18719

18718 SSRIVRRVAFGDGDGDESMDVKAVLDETQSLLGGLWVADYVPWLRWVDTLSGMRRRLERR 18539

18538 FRQLDAFYERVIDDHINKRKHASDEEDDLVDVLLRLHGDPAHRSMFGSRTHIKGILT (0)18368

17337 DMFIAGSDTSAVTVQWAMTELVRNPDVLAKAQHEVRRVIAGGGGGDKDGAMVREADL 17149

17148 PELHYLRLVIKETLRLHPASPLVQRETTEPFRTAHGVEIPARTRVVINAMAIHTDPGVWG 16969

16968 PNAERFLPERHRAHDADGEQQHEHDGFALVPFGIGRRSCPGVHFAAAAAELLLANLLFCF 16789

16788 DWRALPGREVDVEEENGLAVRKKNPLVLIATKSKSNRDAH* 16666

 

#24

>aaaa01000559.1 $FI CYP71T1 (indica cultivar-group) 98% to AP003434.1a

22050 MELSSSLAAVLHSPLFLLAALLLLPVFTLLSFSSAKKPGDGGGRRLPLPPSPRGVPFLGH 21871

21870 LPLLGSLPHRKLRSMAEAHGPVMLLWFGRVPTVVASSAAAAQEAMRARDAAFASRARVSM 21691

21690 AERLIYGRDMVFAPYGEFWRQARRVSVLHLLSPRRIASFRGVREQEVAALLDRVRRRCGV 21511

21510 RGGGETVNLSDMLMSYANGVISRAAFGDGAYGLDGDEGGGKLRELFANFEALLGTATVGE 21331

21330 FVPWLAWVDKLMGLDAKAARISAELDGLLERVIADHRERRRLSQPDGGDGDGDGDENVDH 21151

21150 RDFVDVLLDVSEVEEGAGAGEVLLFDAVAIKAIIL 21046

20444 DMIAAATDTTFTTLEWAMAELINHPPVMRKLQCEIRAAVGVPGASGGAEVTEDHLGELRL 20265

20264 LRAVVKETLRLHAPVPLLVPRETVEDTELLGYRVPARTRVIINVWAIGRPXAAWGDRAEE 20085

20084 FVPERWLDGGGGGEAVEYAAQLGQDFRFVPFGAGRRGCPGAGFAAPSIELALTNLL 19917

      YHFDWELPPHADGAAAATAARLDMGELFGLSMRMKTTLNLVAKPWSSDV 19769

 

>AP003434.1a $F CYP71T1 chromosome 1, PAC clone:P0452F10, complete 41% to 71A24 = C98812

C98812 52% identical to D48413 43% to 71A13, 44% to 71B10

34853 MELSSSLAAVLHSPLFLLAAL

34916 LLLPVFTLLSFSSAKKPGDGGGWRLPLPPSPRGVPFLGHLPLLGSLPHRKLRSMAEAHGP 35095

35096 VMLLWFGRVPTVVASSAASAQEAMRARDAAFASRARVRMAERLIYGRDMVFAPYGEFWR 35272

35273 QARRVSVLHLLSPRRIASFRGVREQEVAALLDRVRRRCGVRGGGETVNLSDLLMSYANGV 35452

35453 ISRAAFGDGAYGLDGDEGGEKLRELFANFEALLGTATVGEFVPWLAWVDKLMGLDAKAA 35629

35630 RISAELDGLLERVIADHRERRRLSQPDGGDGDGDGDENVDHRDFVDVLLDVSEVEEGAG

      AGEVLLFDTVAIKAIIL (0)

36458 DMIAAATDTTFTTLEWAMAELINHPPVMRKLQCEIRAAVGVPGASGGAEVTEDHLGELR 36634

36635 LLRAVVKETLRLHAPVPLLVPRETVEDTELLGYRVPARTRVIINVWAIGRDAAAWGDRAE 36814

36815 EFVPERWLDGGGEEVEYAAQLGQDFRFVPFGAGRRGCPGAGFAAPSIELALTNLLYHFDW 36994

36995 ELPPHADGAAAATAARLDMGELFGLSMRMKTTLNLVAKPWSSDV* 37129

 

#76

>aaaa01002066.1 $PI CYP71T2 (indica cultivar-group) ortholog of AP003434.1b $F 99%

1728 MELSSLAALLHSPLLLAVLLLVFSWLIVSSTKKRPPPPCGDGGRRLPLPPSPPGVPLLGH 1549

1548 LPLLGTLPHRKLRSMAEAHGPVMLLRLGRVPAVVASSAAAAEEVMRTRDLAFASRPRVRM 1369

1368 SERLFYGRDM 1339 frameshift and deltion

     DFVDVMLDVSEAEEGAGAGAGGVLLDTVAIKAVIL 1233

 

>AP003434.1b $F CYP71T2 chromosome 1, PAC clone:P0452F10, complete = AA754300

AA754300      42% IDENTICAL TO 71A14   1/98 I-HELIX 43% to 703A2

39698 MELSSLAALLHSPLLLAVLLLVFSWLIVSSTKKRPPPPCGDGGRRLP

39839 LPPSPPGVPLLGHLPLLGTLPHRKLRSMAEAHGPVMLLRLGRVPAVVASSAAAAEEVMRT 40018

40019 RDLAFASRPRVRMSERLFYGRDM AFAPYGEFWRQARRVTVLHLLSPRRVLSFRGVREQE 40195

40196 VAALLDRVRRRCGGGGETVNLSDLLMSYAHGVISRAAFGHGGAHGFDGDEGGEKLRKLFA 40375

40376 DFEGLLGTMTVGEFVPWLAWVDKLTGLDAKVARTSAAMDGLLERVIADHRERRRSRGQA 40552

40553 VGDGEADADHR DFVDVMLDVSEAEEGAGAGAGGVLFDTVAIKAVIL (0)

42074 DMMAAGTDSSFTTTEWVMAELINHPRVMRKL 42169

42170 QDEIRAVVGTSSASAAAAATGGGQVTEDHLGELPFLRAVIKEMLRLHAPGPLLLPRETVE 42349

42350 DTELLGYRIPARTRVIINVWAIGRDAAAWGDSAEEFVPERWLDGGGGGGVEYAQQLGKDS 42529

42530 RFVPFGAGRRGCPGAGFAALSVELALANLLYHFDWELPPPAASGIMATTRLDMDELFGLS 42709

42710 VRLKADLNLVAKPWSPGAS* 42769

 

note there are 4 sequences on AP003434.1

 

#206

>aaaa01006724.1 CYP71T3 (indica cultivar-group) orth AP003434.1c $F chr 1 99% also AU163704.1

7723 RRRLPPSPPWGLPLLGHLHLLGALPHRALRSLAAAHGPVLLLRLGRVPVVVVSSAAAAEE 7544

7543 VMRTRDLEFASRPRVAMAERLLYGGRDVAFAPYGEYWRQTRRICVVHLLSARRVLSFRRV 7364

7363 REEEAAALVARVRAAGGAVDLVEHLTAYSNTVVSRAVFGDESARGLYGDVDRGRVLR 7193

7192 KLFDDFVELLGQEPMGELLPWLGWVDALNGMEVKVQRTFEALDGILEKVIDDHRRRRR 7019

7018 EVGRQMDDGGGGDHRDFVDVLLDVNETDMDAGVQLGTIEIKAII 6887

5268 DMFAAGTDTTTTVIEWAMAELITHPDAMRNAQDEIKAVVGITSHITEDHLDRLPYLK 5095

5094 AVLKETLRLHPPLPLLVPHEPSSDTKILGYSIPARTRIVINAWTIGRDQATWGEHAEEFI 4915

4914 PERFLESGLDYIGQDFVLVPFGAGRRGCPGVGFAVQAMEMALASLLYNFDWETRVVDRRS 4735

4734 EFGTSSLDMSEMNGLSVHLKYGLPLIAI 4651

 

>AP003434.1c $F CYP71T3 AU163704.1 chromosome 1, PAC clone:P0452F10, complete 44% 71A14

48011 MAVSLVVVVVV

48044 VIAIVVPLLYLVLLPAWKPARRDDGDGGMRRRLPPSPPWGLPLLGHLHLLGALPHRALRS 48223

48224 LAAAHGPVLLLRLGRVPVVVVSSAAAAEEVMRTRDLEFASRPRVAMAERLLYGGRDVAFA 48403

48404 PYGEYWRQTRRICVVHLLSARRVLSFRRVREEEAAALVARVRAAGGAVDLVEHLTAY 48574

48575 SNTVVSRAVFGDESARGLYGDVDRGRVLRKLFDDFVELLGQEPMGELLPWLGWVDALN 48748

48749 GMEVKVQRTFEALDGILEKVIDDHRRRRREVGRQMDDGGGGDHRDFVDVLLDVNETDMD 48925

48926 AGVQLGTIEIKAIIL 48970 (0)

51142 DMFAAGTDTTTTVIEWAMAELITHPDAMRNAQDEIKAVVGITSHITEDHLDRLPYLK 51312

51313 AVLKETLRLHPPLPLLVPHEPSSDTKILGYSIPACTRIVINAWTIGRDQATWGEHAEEFI 51492

51493 PERFLESGLDYIGQDFVLVPFGAGRRGCPGVGFAVQAMEMALASLLYNFDWETRVVDRRS 51672

51673 EFGTSSLDMSEMNGLSVRLKYGLPLIAISRFP* 51771

 

#204

>aaaa01006398.1c $FI CYP71T4 (indica cultivar-group) AP003434.1d 95%

probably orthologs since 6398 and 3434 have only 3 nuc diffs and two

1 nuc indels in the intron.

9740  MAVSLLVVLLVVLAIVVPLLYLVLLPAGNTTRNGAARWEDDGGDGRRRRRLPPSPRGLPL 9919

9920  LGHLHLLGALPHRALRSLAAAHGPVLLLRLGRVPVVVVSSAAAAEEVMRSRDVEFASRP 10096

10097 RMAMAELLLYGGRDVAFAPYGEYWRQAPRICVVHLLSARRILSFRRVREEEAAALVGRV 10273

10274 RAAAADVVDLSDLLIAYSNTVLTRIAF GDESARGGGGGDRGRELRKVFDDFARL 10435

10436 LGTEPMGELLPWFWWVDALRGIDGKVQRTFEALDGILERVIDDHRRRREGGRLMDDDGGG 10615

10616 DHRDFVDVLLDVNETDKDAGIQLGTVEIKAIIM (0) 10711

11174 DMFVGGSDTTTTMIAWTMAELINHPRAMHKAQNEIRAVVGNTSHVTKDHVDKLPYLKAVF 11353

11354 KETLRLHPPLPLLIPREPLADAQILGYTIPAHTRVVINAWAIGRDPAAWGQQPDEFSPEK 11533

11534 FLNGAIDYKGQDFELLPFGAGRRGCPGIVFGVSAMEIALASLLYHFDWE 11680

11681 AAATDHRRRGSQAWALPVDMSEVNGIAVHLKYGLHVVAKPRMP* 11812

 

>AP003434.1d $F CYP71T4 chromosome 1, PAC clone:P0452F10, complete like 71A

58119 MAVSLLPAVL

58149 VLLAIVAPLLYLVLLPAVKYTTRNGAARWEDDGGDGRRRRRLPPSPRGLPLLGHLHLLGA 58328

58329 LPHRALRSLAAAHGPVLLLRLGRVPVVVVSSAAAAEEVMRSRDMEFASRPRMAMAELLLY 58508

58509 GGRDVAFAPYGEYWRQARRICVVHLLSARRVLSFRRVREEEAAALVGRVRAAAADVVVD 58685

58686 LSDLLIAYSNTVLTRIAFGDESARGGGGGGDRGRELRKVFDDFARLLGTEPMGELLPW 58859

58860 FWWVDALRGIDGKVQRTFEALDGILERVIDDHRRRREGGRRMDDDGGGDHRDFVDVLLD 59036

59037 VNETDKDAGIQLGTVEIKAIIM 59102 (0)

59562 DMFVGGSDTTTTMMAWTMAELINHPRAMRKAQNEIWAVVGNTSHVTKDHVDK 59717

59718 LPYLKAVFKETLRLHPPLPLLIPREPPADTQILGYTIPAHTRVVINAWAIGRDAAAWGQQ 59897

59898 PDEFSPEKFLNSTIDYKGQDFELLPFGAGRRGCPGIVFGVSAMEIALASLLYHFDWEAAA 60077

60078 TDHRRRGSQAWALPVDMSEVNGIAVHLKYGLHVVAKPRMP* 60200

 

#205 = #203 = #445 reduce gene count by 2

>aaaa01006398.1d $PI CYP71T5 (indica cultivar-group) seq gap before 12414

first exon has two frameshifts and part is missing (1205) no ortholog

      GDESARG (fs) RALRKLFENFARLLGTEPMGELLPWLGWVDAV (fs)

      WLDGKVQRTFEALDSIIEKVIDDHRRRRRRREVGRQMDSDDDGGGGG

      DHRDFVDVLLDVNETDKDAGIRLGTIEIKAIIL (0)

12924 DMFAAGTDTTTTAMEWAMAELITHRDAMHKVQDEIRAVVGVTGCVTEDHIDRLPYLKAVL 13103

13104 KETLRLHPPNPLLVPHVPLADTEILGYTVPTHTRVLINAWTIGRDPATWGEHAEKFIPER 13283

13284 FLNNNVDYKGQDFGLVPFGAGRRGCPGMGFAVPTIEMALASLLYNFSWETRPVDRRCKSG 13463

13464 TSSLDMSEMNGISVRLKYGLPLIAKSHFP* 13553

 

#203 = #205 = #445 reduce gene count by 2

>aaaa01006398.1b $FI CYP71T5 (indica cultivar-group) no ortholog

4560 MAVSLLPAVLVLLAIVAPLLYLVLLPAVKYTTSNGAARWEDDDGGDGRRRRRLPPSPRGLPLLGHLHLLGAL 4775

4776 PHRALRSLAAAHGPVLLLRLGRVPAVVVSSAAAAEEVMRARDVAFASRPRVAMAERLLYG 4955

4956 GRDVAFAPYGEYWRHARRICVVHLLSARRVLSFRRVREEEAAALVARVRAAARAPGAR 5129

5130 GAVDLVEHLTAYSNTVVSRAVFGDESARGLYGDVDRGRALRKLFDDFVELLGQEPMGELL 5309

5310 PWLGWVDAVRGLDGKVQRTFEALDSIIEKVIDDHRRRRRRHEVGRQMDSDDDGG 5471

5472 GGGDHRDFVDVLLDVNETDKDAGIRLGTIEIKAIIL (0)

     DMFAAGTDTTTTAMEWAMAELITHRDAMHKVQDEIRAVVGVTG 5831

5832 CVTEDHIDRLPYLKAVLKETLRLHPPNPLLVPHVPLADTEILGYTVPTHTRVLINAWTIG 6011

6012 RDPVTWGEHAEKFIPERFLNNNVDYKGQDFGLVPFGAGRRGCPGMGFAVPTIEMALASLL 6191

6192 YNFSWETRPVDRRCKSGTSSLDMSEVNGISVHLKYGLPLMAKFYSS* 6332

 

aaaa01006398.1b no ortholog found in japonica 9/7/02

 

#445 = #203 = #205 reduce gene count by 2

>aaaa01054542.1 CYP71T5 (indica cultivar-group) 76% to AP003434.1c $F

96% to aaaa01006398.1d $PI >99% over 970 bp eve outside the coding region

581 DMFAAGTDTTTTAMEWAMAELITHRNAMHKVQDEIRAVVGVTGCVTEDH 408

407 IDRLPYLKAVLKETLRLHPPNPLLVPHVPLADTEILGYTVPTHTRVLINAWTIGRDPATW 228

227 GEHAEKFIPERFLNNYVDYKGQDYGLVPFGAGRRGCPGMGFAVPTIEMALASLLYISAWE 48

47  TRPVDR 30

 

no japonica ortholog found 9/12/02

 

#202

>aaaa01006398.1a CYP71T6 (indica cultivar-group) (partialI)

1854 MVVVVVVVAIAIVVPLLYLVLLPPARRGGGDSARRRLPPSPRGLPLLGHLHLLGALP 2024

2025 HRALRSLAAAHGPVLLLRLGRVPVVVVSSAAAAEEVMRARDVAFASRPRVAMAERLLY 2198

2199 GGRDVAFAPYGEYWR 2243 sequence gap here

 

aaaa01006398.1a no ortholog found in japonica 12/23/03 nr, est, htgs

 

#89

>aaaa01002288.1 $FI CYP71T7 (indica cultivar-group) 13560 MDISLASLVLVLLAFVLPLLYLLLQLPGKKSGGGGGDGPRLPPSPAGCLPLLGHLHL 13390

13389 LGPLPHVALRSMAAAHGPVLRLRLGRVPTVVVSSAAAAEEVLRARDAAFSSRPRSAMAER 13210

13209 ILYGRDIAFAPYGEYWRQARRVCVVHLLSAQRVSSFRRVREEEAAALADAVRAAGRGGG 13033

13032 RAFDLSGLIVAYASAVVSRAAFGDESARGMYGGADGGRAVRKAFSDFSHLFGTKPVSDYL 12853

12852 PWLGWVDTLRGRERKARRTFEALDGVLDKVIDDHRRRRDSGRRQTGDADAGHRDFVDVL 12676

12675 LDVNEMDNEAGIHLDAIEIKAIIM 12604

12529 DMFVAGSDATSKPMEWAMAELVSHPRHMRRLQDEIRAVVGGGRVTEDHVDKLPYLRAAL 12353

12352 KEALRLHAPLPLLVARETVADTEIMGYHVAARTRVVINGWAIGRDTAVWGETAEEFMPER 12173

12172 FLAGGNGGGAAAADYKVQGFEMLPFGGGRRGCPGVTFGMATVE 12044

12041 SAVASLLYHFDWEAAAADGKGGREGTPLLDMSETSGISMGLKHGLPLVAKPRFP 11880

 

aaaa01002288.1 $FI has no ortholog in nr or HTGS on 9/5/02

 

#33

>aaaa01000893.1 CYP71T8 (indica cultivar-group) Nterm 49% to AP003434.1

33765 MSSYVVVAAALLVFVVVVVAAIKNLGKGKLPPSPPSLPFVGHLHLVGELPH 33917

33918 RSLDALHRRYGSDGGLMFLRLGRAGALVVSTAAAAADLYRGHDLAFASRPPSHSAERLFY 34097

34098 GGRNMSFAPLGDAWRRTKKLAVAHLLSPRRARPRRRGR 34211

 

>AK120767 CYP71T8      2904 bp    mRNA    linear   PLN 29-OCT-2003

DEFINITION (japonica cultivar-group) cDNA clone:J023007M03, full insert sequence.

AP005682.1 (japonica cultivar-group) chromosome 9 clone

C-term = aaaa01035499.1 = CYP71Z11

869 MSSYVVVAAALLVFVVVVVAAIKNLGKGKLPPSPPSLPFVGHLHLVGELPHRSLDALHRR 690

689 YGSDGGLMFLRLGRAGALVVSTAAAAADLYRGHDLAFASRPPSHSAERLFYGGRNMSFAP 510

509 LGDAWRRTKKLAVAHLLSPRRAR 441 (fs)

439 RAQRGAGAVQPRELRTPNKK

GVITRVAAGGSGATAERFRKMMADTSELLAGFQWVDRLPEAAGWAARKLTGLNK

KLDDMADESDRFLGEILAAHDDEKAEGEEEDFVDVLLRLRRQGAAAAGGLELAEDNVKAIIK (0)

DIMGAATDTSFVTLEWIMTELIRNTQVMSKLQNEIIQVT

GSKPTVTEEDLTKLDYLKAVIKEVLRLHPPAPLLIPHHSTMPTTIQGYHIPAKTIAFINV

WAIGRDPAAWDTPDEFRPERFMGSAVDFRGNDYKFIPFGAGRRLCPGIILALPGLEMVIA

SLLYHFDWELPDGMDVQDLDMAEAPGLTTPPMNPVWLIPRCRTI*

 

#437

>aaaa01042159.1 CYP71AK1 = old CYP71T9 (indica cultivar-group) 60% to AP003434.1b

58% to wheat AL821861 This seq may belong in another family or

subfamily like CYP703 = 71AK1

DIMGAATDTSFVTLEWIMTELIRNTQVMSKLQNEIIQVTGS 3

 

no japonica ortholog found 9/12/02

 

#275 = #377 reduce gene count by 1

>aaaa01010398.1 CYP71T10 (indica cultivar-group) not an exact match 71 like pseudogene fragment 75% to 71T5 in a small region

2295 LMYLVLLPDVNRSNRPERWEDSDGWQRLPP*PRRLPLLRYLHLLSVPLHQAFHPLPR 2465

2466 HMAWCCYSSSNACRWWLSSFAATSRPKSAMAEQLLYGCDVAFAPYGEY* 2612

2613 S*ECRICVLFFRCIREEEVAVLVKHVRHPCR 2705

 

no japonica ortholog found 9/10/02

 

#377 = #275 reduce gene count by 1

>aaaa01017833.1 CYP71T10P (indica cultivar-group) N-term pseudogene fragment

3692 LMYLVLLPDVNRSNRPERWEDGDGWQRLPP*PRRLPLLRYLHLLGAPLHQAFHPLPR 3862

3863 HMAWCYYSSSNACRWWLSSFAATSRPKSAMAEQLLYGCDVAFAPYGEY* 4009

4010 S*ECRICVLFFRCIREEEVAVLVKHVRHPCR 4102

 

No japonica ortholog found 9/11/02

 

#179

>aaaa01005635.1 $PI CYP71U1P (indica cultivar-group) one stop codon, one fs

62% to AAAA01000843.1

8796 MDELSAGSLYLVVLGTLALALAFKRVLRGKETGVKLPPGPWNLPIIGSLHHLVGAHLPHRALLRVSR 8596

8595 RQGPLMLLRLGEVPAVVVSSPEAAMEFLRTRDPVFASRPRGALRSTSSASAVK 8437

8436 GSSWRHTASTGGRCARSAWWSCSAQGRCSGWSLSGRRRGGVPPRRGHRHDIA 8281

8280 CYSQHRHDPDASGAQ*RHHREGGVRRQVPTAGLRYLRVLKVVATLAGSFN 8131

8130 MVDLFPSSRLVRWLSCVERRLREHHAQTVRIVDSIIQDRKENEASASPGASAEDDDNDDL 7951

7950 LDVLLRLQREDNLTFPITAEIIGALIS (0) 7852

7202 DIFGAATDTTGSTLEWAMAELMRNPRTMEKAKQEVQNALGQGRAMVTGADIGDLHYLQMV 7023

7022 IKETLR (fs) 7005

7000 LHPSIPLIVRASEESTLVMGYDIPQGTNIFINAFAVARDPRYWKDADEFMPERFEKN 6830

6829 GDDIKATTVHMGFIPFGAGR (deletion of 18 aa heme signature region in seq gap) 6770

6669 NLLYHFDWTLINGESPESLDMGEVWGISIHRRSDLRLHAALSVSSGFLRHSDRDS* 6496

 

aaaa01005635.1 no ortholog in japonica on 9/7/02

 

#31

>aaaa01000843.1 $FI CYP71U2 (indica cultivar-group) 63% to AAAA01005635.1 37% to 71B2

92% to AP004872.1 and AP005536.1 (00843 is best indica match for these seqs)

21357 MDELSIENHSPISMDELSFG

21297 SLCLVAMATLALALALMVVMGAHRRGGEKGATTGAKNLPPGPWNLPVTGSLHHLLGASP 21121

21120 PPHRALLRLSRRHGPLMLVRLGEVPTVIVSGSDAAMEGWVLKAHDPAFADRARSTTVDAV 20941

20940 SFGGKGIIFAPYGEHWRQARRVCLAELLSARQVRRLESIRQEEVSRLVGSIAGSSNA 20770

20769 AAVDMTRALAALTNDVIARAVFGGKCARQEEYLRELGVLTALVAGFSMADLFPSSRV 20599

20598 VRWLSRRTERRLRRSHAQMARIVGSIIEERKEKKASDDGVGAKDEDDDLLGVLLRLQEED 20419

20418 SLTSPLTAEVIGALVI (0) 20371

17791 DIFGAATDTTASTLEWVMVELMRNPRAMEKAQQEVRNTLGHEKGKLIGTDISELHYLRMV 17612

17611 IKETLRLHPSSALILRQS (fs) 17558

17558 QGNCRVMGYDIPQATPVLINTFAVARDAKYWDNAEEFKPERFENSGADIRTSTAHLGFVP 17379

17378 FGAGCRQCPGALFATTTLELILANLLYHFDWALPDGVSPESLDMSEVMGITLHRSSSLHL 17199

17198 HATLSRLGFVSHSGQ* 17151

 

aaaa01000843.1 $FI may not have an ortholog

 

#41

>AP004872.1 $F CYP71U3 (japonica cultivar-group) chr 2 = AP005536.1  92% to aaaa01000843.1 (best match in indica) low percent for an ortholog

98325 MDELSIENHSPISMDELSFGSLCMVAMATLALALALMVMGAHRRGGEKGATTGAKNLPP 98149

98148 GPWNLPVIGSLHHLLGASPPHRALLRLSRRHGPLMLVRLGEVPTVIVSGSDAAMEVL 97978

97977 KARDPAFADRARSTTVDAVSFGGKGVIFAPYGEHWRHARRVCLAELLSARQVRRLESIRQ 97798

97797 EEVSRLVDSIIAGSSNAAAVDMTRALAALTNDVIARAVFGGKCARQEEYRRELGVLTTLV 97618

97617 AGYSMVDLFPSSRVVRWLSRRTERRLRRSHAEMARIVGSIIEERKEKKGSDAGVGAKDED 97438

97437 DDLLGVLLRLQEEDGLTSPLTAEVIAALV 97351

94360 XDIFGAATDTTASTLEWIMVELMRNPRAMDKAQQEVRNTLGHEKGKLIGIDISELHYLCMV 94181

94180 IKETLRLHPASALILRQSRENCRVMGYDIPQATPVLINTFAVARDPKYWDNAEEFKPE 94007

94006 RFENSGADIRTSIAHLGFIPFGAGCRQCPGALLATTTLELTLANLLYHFDWALPDGVSPK 93827

93826 SLDMSEVMGITLHRRSSLHLHTTLTRSGFFSHSGR 93722

 

#486 incorrectly labeled as #200

>aaaa01025826.1 CYP71V1 (indica cultivar-group) orth AC096855.1 $F chr 3 98%

604 YFFFLQSLLLCIAAVALLQLAKVAATMRRRPRTPPGPWRLPVIGSMHHLVNALPHRALRD 783

784 LAGVHGPLMMLRLGETPVVVASSRGAARAVLKTHDANFATRPRLLAGEIVGYGWADILFS 963

964 PSGDYWRKLRQLCAAEILSPKRVLSFRHIREDEVTARVEEIRAAAAPSTPV 1116

1672 DMFAGGTGTSASALEWAMSELMRNPAVMKKLQGQIREAFHGKAVVMEADLQASNLRYL 1851

1852 KLVIKEALRLHPPAPLLVPRESIDTCELDGYTIPAKSRVIVNVWAIGRHPKYWDDAEEFK 2031

2032 PERFDDGAIDFMGGSYKFIPFGSGRRMCPGFNYGLASMELVLVAMLYHFDWSLPVGVKEV 2211

2212 DMEEAPGLGVRRRSPLLL 2265

 

>AC096855.1 $F CYP71V1 chromosome 3 clone OJ1365_D05 54% to AC087550 frameshift before PERF?

= AQ326032 AQ329780 73% to AF321860 Lolium rigidum similar to CYP71D sequences

87309 MDDYFFLQSLLLCVAAVALLQLAKVAATMRRRPRTPPGPWRLPVIGSMHHLVNALPHR 87136

87135 AMRDLAGVHGPLMMLRLGETPVVVASSRGAARAVLKTHDANFATRPRLLAGEIVGYGW 86962

86961 ADILFSPSGDYWRKLRQLCAAEILSPKRVLSFRHIREDEVTARVEEIRAAAAPSTPVNLS 86782

86781 VLFHSTTNDIVARAAFGRKRKSAPEFMAAIKAGVGLSSGFKIPDLFPTWTTALAAVTGMK 86602

86601 RSLRGIHKTVDAILQEIIDERRCVRGDKINNGGAADDQNADENLVDVLIALQEKGGF 86431 (1)

86339 GKSVTTPWVIVTHMICTLDVQDMFAGGTGTSASALEWAMSELMRNPAVMKKLQGQIREAFH 86157

86156 RKAVVTEADLQASNLRYLKLVIKEALRLHPPAPLLVPRESIDTCELDGYTIPAKSRVIVN 85977

85976 VWAIGRDPK Y*Y*E DAEEFKPEQFDDDAIDFMGGSYEFIPFGSGRRMCPGFNYGLASMEL 85797

85796 VLVAMLYHFDWSLLVGVKEVDMEEAPGLGVRRRSPLLLCATPFVPAAVSADY* 85638

 

#200

>aaaa01006345.1 CYP71V2 (indica cultivar-group) 77% to AC096855.1 $F

no orth 9/15/02

10460 MDELFYQSLLLSVAAVT 10410

10409 VLQLLKLLLVRHRRPRTPPGPWRLPVIGSMHHLVNVLPHRKLRELAAVHGPLMMLQLGET 10230

10229 PLVVATSKETARAVLKTHDTNFATRPRLLAGEIVGYEWADILFSPSGDYWRKLRQLCAAE 10050

10049 ILSPKRVLSFRHIREDE 9999

9730 VNLSVMFHSVTNSIVSRAAFGKKRKNAAEFLAAIKSGVGLASGFNIPDLFPT 9575

9574 WTGILATVTGMKRSLRAIYTTVDGILEEIIAERKGIRDEKISGGAENVDENLVDVLIGL 9398

9397 QGKGGFGFHLDNSKIKAIILQDMFA 9218

9217 GGTGTSASAMEWGMSELMRNPSVMKKLQAEIREVLRGKTTVTEADMQAGNLRYLKMVI 9044

9043 REALRLHPPAPLLVPRESIDVCELDGYTIPAKSRVIINAWAIGRDPKYWDNPEEFRPERF 8864

8863 EDGTLDFTGSNYEFIPFGSGRRMCPGFNYGLASMELMFTGLLYHFDWSLPEGVNEVDMAE 8684

8683 APGLGVRRRSPLMLCATPFVPVVSAN* 8603

 

#139

>aaaa01004037.1  CYP71V3 (indica cultivar-group) ortholog to AL732378.3 99%

12867 LPPGPRNLPVVGSAHRLVNTLAHRVLRDLADVHGPLMHLRVGQVPVVVVTSKELARD 12697

12696 ILKTHDANFATRPKLVAGGIVAYDWTDILFSPSGDYWRKLRRLCIQEILSAKRILSFEHI 12517

12516 REDEVRMLADEIRAVGPSVAVDLSARLHRITNTIVSRAAFGNKRSNAADFLVAIKQSVIM 12337

12336 ASGFYVPDLFPRFSVLLCWLTGMRRTLHGIRDTIDSILEEIISEKEEAKQQQDNNL 12169

12168 VDVLLSLKDKGDFGFPITRDTIKAIVL 12088

11880 DIFAGGSGTSANAMEWAMSELMMNPRVMNKVQAEIRDAFHGKQSIGEADLRARDLKYLKL 11701

11700 VMKETLRLHPPAPLLVPRESIDACEINGYMIPAKARVIVNAWAISRDPRYWEDAEEFKPE 11521

11520 RFAEGGIDFYGSNYEYTPFGSGRRMCPGYNYGLASMELTLAQLLHSFDWSMPDGATEV 11347

11346 DMTEAPGLGVRRKTPLLLCAAPYVASPI 11263

 

>AL732378.3 $F CYP71V3

      MAWLDDVLSLCNNNTRMCNALVLSVVVVSFLQLLKHVLLTPSRLP

64951 LPPGPRNLPVVGSAHRLVNTLAHRVLRDLADVHGPLMHLRVGQVPVVVVTSKELARDILK 64772

64771 THDANFATRPKLVAGGIVAYDWTDILFSPSGDYWRKLRRLCIQEILSAKRILSFEHIRED 64592

64591 EVRMLADEIRAVGPSVAVDLSARLHRITNTIVSRAAFGNKRSNAADFLVAIKQSVIMASG 64412

64411 FYVPDLFPRFSVLLCWLTGMRRTLHGIRDTIDSILEEIISEKEEAKQQQDNNLVDVLLSL 64232

64231 KDKGDFGFPITRDTIKAIVL 64172

63964 DIFAGGSGTSANAMEWAMSELMMNPRVMNKVQAEIRDAFHGKQSIGEADLRARDLKYLKL 63785

63784 VMKETLRLHPPAPLLVPRESIDACEINGYMIPAKARVIVNSWAISRDPRYWEDAEEFKPE 63605

63604 RFAEGGIDFYGSNYEYTQFGSGRRMCPGYNYGLASMELTLAQLLHSFDWSMPDGATEVDM 63425

63424 TEAPGLGVRRKTPLLLCAAPYVASHIYA* 63338

 

#193

>aaaa01006105.1a $FI CYP71V4 (indica cultivar-group) similar to Lolium rigidum AF321859

MDELLYRALLLSVLAVALLQIIEAFLIIIRAKPAAPPLPPGPWRLPVIGSMHHLAGKLPHRALRD 3196

LAAAHGPLMMLRLGETPLVVASSREMAREVLRTHDANFATRPRLLAGEVVLYGGADILF 3019

SPSGEYWRRLRQLCAAEVLGPKRVLSFRHIREQE (0) 2914

MESQVEEIRAAGPSTP

VDLTAMFSFLVISNVSRASFGSKHRNAKKFLSAVKTGVTLASGFKIPDLFPTWRKVLAAV 1750

TGMRRALEDIHRVVDSTLEEVIEERRSAREDKARCGMVGTEENLVDVLIGLHEQGGC 1579

LSRNSIKSVIFDMFTAGTGTLSSTLGWGMSELMRSPMVMSKLQGEIREVFYGKATVGEED 1399

IQASRLTYLGLFIKETLRLHPPVPLLVPRESIDTCEIKGYMIPARSRIIVNAWAIGRDPR 1219

YWDDAEEFKPERFEKNIVDFTGSCYEYLPFGAGRRMCPGVAYGIPILEMALVQLLYHFDW 1039

SLPKGVVDVDMEESSGLGARRKTPLLLCATPFVVPVL* 925

 

aaaa01006105.1a  no japonica ortholog found 9/7/02

 

#439

>aaaa01045745.1 CYP71V4 (indica cultivar-group) 64% to AC096855.1 $F

98% to aaaa01006105.1a $FI 2 diffs

2   ESIDTCEIKGYMIPARSRIIVNAWAIGRDPRYWDDAEEFKPKRFEKNMVDFTGSCYEYLP 181

182 FGAGRRMCPGVAYGIPILEMALVQLLYHFDWSLPKGVVDVDMEESSGLGARRKTPLLL 355

 

no japonica ortholog found 9/12/02

 

#194

>aaaa01006105.1b $FI CYP71V5 (indica cultivar-group)

6903 MDGLLYQALLLSALAVAVLQIVKLAVVNRGKKQAAAAAPTPPGPWRLPVIGSMHHLAGKLAHRALRD 6703

6702 LAAVHGPLMMLQLGETPLVVVSSREVAREVLRTHDANFATRPRLLAGEVVLYGGADILF 6526

6525 SPSGEYWRKLRQLCAAEVLGPKRVLSFRHIREQE (0) 6421

     MASRVERIRAVGPSVP

5914 VDVSALFYDMAISIVSCASFGKKQRNADEYLSAIKTGISLASGFKIPDLFPTWRTVLAAV 5735

5734 TGMRRALENVHRIVDSTLEEVIEERRGAARECKGRLDMEDNEENLVDVLIKLHEQGG 5564

5563 HLSRNSIKSVIFDMFTAGTGTLASSLNWGMSELMRNPRVMTKLQGEIREAFHGKATV 5393

5392 GEGDIQVSNLSYLRLFIKETLRLHPPVPLLVPRESIDMCEVNGYTIPARSRIVVNAWAIG 5213

5212 RDPKYWDDPEEFKPERFEGNKVDFAGTSYEYLPFGAGRRICPGITYALPVLEIALVQLIY 5033

5032 HFNWSLPKGVTEVDMEEEPGLGARRMTPLLLCATPFVVPVL* 4907

 

aaaa01006105.1b no japonica ortholog found 9/7/02

 

#195

>aaaa01006105.1c $PI CYP71V6P (indica cultivar-group) 79% to AAAA01006105.1b

but no Nterm exon present in 2000bp to end of clone

12642 (0) IASRVDLICAVGPLTL

12594 VDVSALFYDITISIASCASFGKKHRNVDEYLSSIKTRVSLASRFKIPDLFPSWRTMLAMV 12415

12414 TGMRRALEEVHGIVDSTLEDVIEERQGEKEDKTRPDMVDTKENLVDVLIGLHENGA 12247

12246 HLSRDSIKAVIFDMFTAGTGTLASALNWGMSKLMRNPRVMTKLQGEIRKAFHGKVTVG 12073

12072 EDDIQAANLPYIRLFIEETLLLHPVVPLLVPRESIDVCEVNGYTILARSRIVVNAWAIGR 11893

11892 DPKYWDNPEEFKPEWFEGNIVDFPGSSYEYLPFGAG*RMCPGIAYGLPVLEMALVQLLYH 11713

11712 FD*SLPNGVMKVDMEEEPGLGARRKTPLLLNLFVIPVLQGQQ*  11578

 

aaaa01006105.1c no japonica ortholog found 9/7/02

 

#406

>aaaa01023722.1 $FI CYP71W1 (indica cultivar-group

71  MELTTLLLLALISFFFLVKLIARYASPSGRESALRLPPGPSQLPLIGSLHHLLLSRYGDL 250

251 PHRAMRELSLTYGPLMLLRLGAVPTLVVSSAEAAAEVMRAHDAAFAGRHLSATIDILSC 427

428 GGKDIIFGPYTERWRELRKVCALELFNHRRVLSFRPVREDEVGRLLRSVSAASAEGGA 601

602 ACFNLSERICRMTNDSVVRAAFGARCDHRDEFLHELDKAVRLTGGINLADLYPSSRLVRR 781

782 LSAATRDMARCQRNIYRIAESIIRDRDGAPPPERDEEDLLSVLLRLQRSGGLKF 943

944 ALTTEIISTVIF (0) 979

1125 DIFSAGSETSSTTLDWTMSELMKNPRILRKAQSEVRETFKGQDKLTEDDVAKLSYLQLVI 1304

1305 KETLRLHPPAPLLIPRECRETCQVMGYDVPKGTKVFVNVWKIGREGEYWGDGEIFRPERF 1484

1485 ENSTLDFRGADFEFIPFGAGRRMCPGIALGLANMELALASLLYHFDWELPDGIKSEELDM 1664

1665 TEVFGITVRRKSKLWLHAIPRVPYYSTY* 1751

 

no japonica ortholog found 9/12/02

 

#330

>aaaa01013880.1b CYP71W2P (indica cultivar-group) orth of AC120537.1a

stops are even in the same location

4325 ARRAQSPRHVR*D*QAGYLVHAVVGECAPGGAGAVVPISEKISRMVNDSVVRPAIGSRCAR 4507

     gap

4868 ETFFNMDNLRTHDTYRKKNHSGNSQHCTASFIVFSFSELQLKMTIWQSHHYKLPINLRK 5044

5045 YFQQGARQLNDTLVGNI*ASEKYPQVMQKAQTEVREKFR 5161

     G*DKLIKDDMNRLSYLHLVIQE 5226

5227 TLRLH 5241

 

>AC120537.1a CYP71W2P chromosome 3 clone pseudogene fragment

2543 ARRVQSPRHVR*D*QAGYLVHAVVGECAPGGAGAVVPISEKISRMVNDSVVRPAIGSRCA 2364

2363 RRDEFLHVQARGLRQARGRVQLGRPVPIVVASELAQRRAAVGRPSVAAGAFARCGRPAET 2184

2183 FFNMDNLRTHDTYRKKNHSGNSQHCTAFSALSFSELQLKMTIWQSHHYKLPINLREIFS  

2006 SAGSETLNDTLVGNI*ANEKYPQVMQKAQTEVREKFRG*DKLIKDDMNRLSYLHL 1846

1845 VIQETLRLH

 

#329

>aaaa01013880.1a $FI CYP71W3 (indica cultivar-group) ortholog of AC120537.1b AQ869247.1

 801 MEVSLPLLIGVVLAFLLLFVLVNVKNSCRSWWPPPEKEKKKLRLPPGPWRLPLVGSLHHVLLS (fs) 989

 991 RHADLPHRALRELAGKYGPLMMLRFGAVPTLVVSSAEAAREVLKTYDAAFASRYLTPTLA 1170

1171 VLSRGGRDILFSPYCDLWRQLRRICVHELLSARRVQSLRHVREDEAARLVRSVAAECAG 1347

1348 RGGAAVVSVGELISRAVNDSVVRSAVGARSARRDEFVRELDESVRLSGGFNLADLYPSSW 1527

1528 LARRLSCAMRETERCNRSLMAIMDDIIREHGDGEEDLLGVLLRLQRNGDVQCPL 1689

1690 TTDLITNVVL (0)

2513 DMFAAGSETSSTTLEWALTELVRNPHIMEKAQSEVREIFRGENKLTEEVMDKL 2671

2672 SYLRLVIRETLRLHLPVPFLLPRQCREPCSVMGYDIPVGTKVLVNAWAIARDNQYWDDPE 2851

2852 VFKPERFENNRVDFKGIDFEFIPFGAGRRICPGIALGLANIELMLASLLYHFDWEFLDRD 3031

3032 RNDEIDLSETFGITAKRKSKLMVYATQRIPCLG* 3133

 

>AC120537.1b $F CYP71W3 chromosome 3 clone

AQ869247.1 nbeb0034D08r CUGI Rice BAC genomic Length = 447 53% to 99A1

AZ130570.1 OSJNBb0104D19r CUGI Rice BAC genomic Length = 327

80150 MEVSLPLLIGVVLAFLLLFVLVNIKNSCRSWWPPPEKEKKKLRLPPGPWQLPLVGSLHHV 80329

80330 LLSRHADLPHRALRELAGKYGPLMMLRFGAVPTLVVSSAEAAREVLKTYDAAFASRYL 80503

80504 TPTLAVLSRGGRDILFSPYCDLWRQLRRICVHELLSARRVQSLRHGREDEAARLVRSVAA 80683

80684 ECAARGGAAVVNVGELISRAVNDSVVRSAVGARSARRDEFVRELDESVRLSGGFNLADLY 80863

80864 PSSWLARRLSGAMRETERCNRSLMAIMDDIIREHGDGEEDLLGVLLRLQRNGDVQCPLTT 81043

81044 DLITNVVL (0) 81067

81865 DMFAAGSETSSTTLEWALTELVRNPHIMEKAQSEVREIFRGENKLTEEMMDKLSYLRLVI 82044

82045 RETLRLHLPVPFLLPRQCREPCSVMGYDIPVGTKVLVNAWAIARDNQYWDDPEVFKPERF 82224

82225 ENNRVDFKGIDFEFIPFGAGRRICPGIALGLANIELMLASLLYHFDWEFLDRDRNDEIDL 82404

82405 SETFGITAKRKSKLMVYATQRIPCLG 82482

 

#392

>aaaa01021177.1 $FI CYP71W4 (indica cultivar-group) ortholog to AC120537.1c

AAAA01039974.1 (indica cultivar-group) Nterm 132 aa

396 MEMTLPLLGAAVVLAAFLLFFLVKNNRCCWSPAAERRLRLPPGPWRLPLVGSLHHVLLSR 217

216 HGDLPHRALRELAGRYGALMLLRFGAVPTLVVSSAEAAREVLKTHDACFASRHMTPTLAV 37

36  FTRGGRDILF 7

3930 SPYGDLWRQLRRICVLELFSARRVQSLRHVREDEAARLVRAVAEECAIGGGGGAVVPIGD 3751

3750 MMSRMVNDSVVRSAIGGRCARRDEFLRELEVSVRLTGGFNLADLYPSSSLARWLSGALRE 3571

3570 TEQCNRRVRAIMDDIIRERAAGKDDGDGEDDLLGVLLRLQKNGGVQCPLTTDM 3412

3411 IATVIM (0) 3394

2892 EIFSAGSETASTTLEWAISELVRNPKVMDKAQSEVRKLFEGQDNLTEDDMSRLSYLHLVI 2713

2712 RETLRLHAPAPFLLPRECREQCNVMGYDITEGTRVLVNAWAIARDTRYWEDPEIFKPERF 2533

2532 NANLVDFKGNYFEYIPFGSGRRVCPGITLGLTSMELVLASLLYYFDWELPGGKRCEEIDM 2353

2352 SEAFGITVRRKSKLVLHATPRVPCLH* 2272

 

>AC120537.1c $F CYP71W4 chromosome 3 clone OSJNBb0042N11

AQ573952 nbxb0083G09r 60% to AQ259669 52% to 99A1 51% to 71B23

BM039053 clone V013G04.Length = 527

103769 MEMTLPLLGAAVVLAAFLLFFLVKNNRCCWSPAAERR

103880 LRLPPGPWRLPLVGSLHHVLLSRHGDLPHRALRELAGRYGALMLLRFGAVPTLVVSSAE 104056

104057 AAREVLKTHDACFASRHMTPTLAVFTRGGRDILF SPYGDLWRQLRRICVLELFSARRVQS 104236

104237 LRHVREDEAARLVRAVAEECAIGGGGGAVVPIGDMMSRMVNDSVVRSAIGGRCARRDEFL 104416

104417 RELEVSVRLTGGFNLADLYPSSSLARWLSGALRETEQCNRRVRAIMDDIIRERAAGKDDG 104596

104597 DGEDDLLGVLLRLQKNGGVQCPLTTDMIATVIM (0) 104695

105190 EIFSAGSETASTTLEWAISELVRNPKVMDKAQSEVRKLFEGQDNLTEDDMSRLS 105351

105352 YLHLVIRETLRLHAPAPFLLPRECREQCNVMGYDITEGTRVLVNAWAIARDTRYWEDPEI 105531

105532 FKPERFNANLVDFKGNDFEYIPFGSGRRVCPGITLGLTSMELVLASLLYHFDWELPGGKR 105711

105712 CEEIDMSEAFGITVRRKSKLVLHATPRVPCLH* 105810

 

#393

>AC078894.1 $P CYP71W5P chromosome 10 clone OSJNBa0096G08 12 unordered pieces starts 123

probable pseudogene fragment 47% to 71B1 2 diffs with AP004175.1 $P

80% to 71W4

51119 LRRICVLELFSAHRV*SLHHVREEEAAPLVRVVADIRSPLGP 50994

 

#394

>AP004175.1 $P CYP71W6P chromosome 2 clone OJ1006_B12 pseudogene fragment

94% to AC078894 51119-50994

64520 LRRICMLELFSAHRV*SLHHVREEEAARLVRVVA 64419

 

#311

>aaaa01012657.1 CYP71X1P (indica cultivar-group) orth AP003990.1g $P chr 2 99%

7170 FTPLFLLAVLPLKLTNGGDGV*LPPGPWRLPVIGSMHHLMGESLVHRAMADLARRLDAPL 7349

7350 MYLKLGEVPVVLASSPCAAREIMRVHDVAFASRP 7451

7490 RQLRKICVVELLSARRVRTFRRVREEE 7570

6081 DLFSGGSETSATVIQWAMSELMKNPRVMRKVQAELRDKLAGKPRVTEDDLSDLKYLKL 5908

5907 VIKETLRLHPAAPLLVPRQCREPCKIMGYDIPKGTTVFVNAW 5782

5782 AIGRDPNYWDDAEVFRLERFANSTIDFKGMDMEFIPFGAGRRMCPGLAFAEAIMDLLFST 5603

5602 LLFHFDWELPCGMTASELDMIEEMALTVRRKNDLHLRPIL 5483

 

>AP003990.1g $P CYP71X1P chromosome 2 clone OJ1073_F05 pseudogene

42530 MDHVLACVGILVAFTPLFLLAVLPLKLTNGGDGVKLPPGPWRLPVIGSMHHLMGESLVHRAMAD 42721

42722 LARRLDAPLMYLKLGEVPVVLASSPCAAREIMRAHDVAFASRPLSPTVRRMR 42877

42878 PPPPRRRQLRKICVVELLSARRVRTFRRVREEEVARLVGALVCLAHVA 43021 gap

      AMIGARFERRDEFLE

      missing mid region

43862 DLFSGGSETSATVIQWAMSELMKNPRVMRKVQAELRDKLAGKPRVTEDDLSDLKYLKL 44035

44036 VIKETLRLHPAAPLLVPRQCREPCKIMGYDIPKGTTVFVNAW 44161 frameshift

44161 AIGRDPNYWDDAEVFRLERFANSTIDFKGMDMEFIPFGAGRRMCSGLAFAEAIIDLLFS 44337

44338 TLLFHFDWELPCGMTASELDMIEEMALTVRRKNDLHLRPILRVPQTQTSSALLF* 44502

 

 

#222

>aaaa01007431.1b CYP71X2 (indica cultivar-group) orth AP003990.1f

10785 MYDAVACVVAVVVVVIFAMLRVKLARSGDGGGGGGGGVR

10668 LPPGPWRLPVIGSLHHVVGDRLLHRSMARIARRLGDAPLVYLQLGEVPVVVASSPGAARE 10489

10488 VTRTHDLAFADRALNPTARRLRPGGAGVALAPYGALWRQLRKICVVELLSARRVRSFRRV 10309

10308 REEEAGRLVGALAAAAASPGEEAAVNFTERIAEAVSDAALRAMIGDRFERRDEFLQ 10141

10140 ELTEQMKLLGGFSLDDLFPSSWLASAIGGRARRAEANSRKLYELMDCAIRQHQQQRAE 9967

9966  AAVVDGGAGVEDDKNQDLIDVLLNIQKQGELETPLTMEQIKAVIL 9790

9595 DLFSGGSETSATTLQWAMSELIKNPMVMQKTQAELRDKLRRKPTVTEDDLSGLKYVKL 9422

9421 IIKETLRLHPVVPLLVARECRESCKVMGYDVPKGTTVFVNVWAIGRDPKYWDDAEEFRPE 9242

9241 RFEHSTVDFKGVDLEFIPFGAGRRICPGMAFAEAIMELLLAALLYHFDWELPNGMVASEL 9062

9061 DMTEEMGITVRRKNDLHLRPXXXXXXXXXXXXXXXRERERHFV 8936

 

>AP003990.1f $F CYP71X2 chromosome 2 clone OJ1073_F05

38238 MYDAVACVVAVVVVVVFAMLWVKLARSGDGGGGGSGGVRLPPGPWRLPVIGSLHHVVGDRLLHRSMA 38438

38439 RIARRLGDAPLVYLQLGEVPVVVASSPGAAREVTRTHDLAFADRALNPTARRLRPGGAGV 38618

38619 ALAPYGALWRQLRKICVVELLSARRVRSFRRVREEEAGRLVGALAAAAASPGEEA 38783

38784 AVNFTERIAEAVSDAALRAMIGDRFERRDEFLQELTEQMKLLGGFSLDDLFPSSWLASAI 38963

38964 GGRARRAEANSRKLYELMDCAIRQHQQQRAEAAVVDGGAGVEDDKNQDLIDVLLNIQKQG 39143

39144 ELETPLTMEQIKAVIL 39191 (0)

39428 DLFSGGSETSATTLQWAMSELIKNPMVMQKTQAELRDKLRRKPTVTEDDLSGLKYVKL 39601

39602 IIKETLRLHPVVPLLVARECRESCKVMGYDVPKGTTVFVNAWAIGRDPKYWDDAEEFRPE 39781

39782 RFEHSTVDFKGIDLEFIPFGAGRRICPGMAFAEAIMELLLAALLYHFDWELPNGMAASE 39958

39959 LDMTEEMGITVRRKNDLHLRPHPPCVVRSNFRSFVERERERHFV* 40093

 

#221

>aaaa01007431.1a CYP71X3 (indica cultivar-group) orth AP003990.1e $F chr 2 99%

lower case does not match japonica seq, but matches seq b

4252 NLPPSPSRLPFIGSFHLLRRSPLVHRALADVVRQLGAPPLMYMEIGEVPAIVVSCADAAR 4431

4432 EIMKTHDINFASRPWPPTVQKLRAQGKGIFFEPYGALWRQLRKICIVKLLSVRRVSSFHG 4611

4612 VREEEAGRLVAAVAATPPGQAVNLTERIKVAIADTTMRPMIGERFERREDFLEV 4773

4774 LPEIVKLASGFSLDDLFPSSwlagaiggsrrRGEAVNRASYELVDSAFRQRQQQKEAM 4947

4948 AAPPPDIAKEEEDDLMDELIRIHKEGSLEVPLTAGNLKAVI  5070

5313 ELFCAGSETSSNAIQWAMSELVRNPRVMEKAQNEVRSILKGKPTVTEADMVDLTY 5486

5487 VKMIVKETHRLHPVLPLLTPRVC*QTCQIMGYDVPQGSVIFINSWAIMRDPKHWDDAETF 5666

5667 KPERFEDSEIDLKGTNYEFIPYGAGRRICPGLALAQVSIEFILTTLLYHFNWELPKGAAP 5846

5847 KELDMTEDMGLTIRRKNDLYLLPTL 5921

 

>AP003990.1e $F CYP71X3 chromosome 2 clone OJ1073_F05

33613 MEQVSCFAAAAAAVLVVLSLARMLLAPRREWD

33709 GLNLPPSPSRLPFIGSFHLLRRSPLVHRALADVVRQLGAPPLMYMEIGEVPAIVVSCADA 33888

33889 AREIMKTHDINFASRPWPPTVQKLRAQGKGIFFEPYGALWRQLRKICIVKLLSVRRVSSF 34068

34069 HGVREEEAGRLVAAVAATPPGQAVNLTERIKVAIADTTMRPMIGERFERRE 34221

34222 DFLEVLPEIVKLASGFSLDDLFPSS 34296 check joint

      GSPAPSAARGEAVNRASYELVDSAFRQRQQQKEAMAAPPPDIAKEE

      EDDLMDELIRIHKEGSLEVPLTAGNLKAVIL 34528 (0)

34777 ELFCAGSETSSNAIQWAMSELVRNPKVMEKAQNEVRSILKGKPTVTEADMVDLTY 34941

34942 VKMIVKETHRLHPVLPLLTPRVCQQTCQIMGYDVPQGSVIFIKSWAIMRDPKHWDDAETF 35121

35122 KPERFEDSEIDLKGTNYEFTPYGAGRRICPGLALAQVSIEFILTMLLYHFNWELPNGAA 35298

35299 PEELDMTEDMGLTIRRKNDLYLLPTLRVPLTA* 35397

 

#221

>aaaa01070587.1 CYP71X3 (indica cultivar-group) orth AP003990.1e $F chr 2 96%

see aaaa01007431.1a for ortholog

84  YQVKVSHMLHFGIV*ELFCAGSETSSNAIQWAMSELVRNPRVMEKAQNEVRSILKGKP 257

258 TVTEANMVDLTYVKMIVKETHRLHPVLPLLTPRVCQQTCQIMGYDVPQGSVIFINSWTIM 437

438 RDPKHWDDAETFKPERFEDSEIDLKGTNYEFTPYGAGRRICPGLALAQVSIEFILATLLY 617

618 HFNWELPNGAAPKELDMTEDMGLTIRRKNDLYLLPTL 728

 

#376

>AP003990.1d $F CYP71X4 chromosome 2 clone OJ1073_F05 no ortholog

27391 MEQVSCFAAAAAVVVVVLLLARMLLAPRGEWDGLNLPPSPPRLPFIGSFHLLRRSPLVHRALADVARQL 27597

27598 GSPPLMYMRIGELPAIVVSSADAAREVMKTHDIKFASRPWPPTIRKLRAQGKGIFFEPYG 27777

27778 ALWRQLRKICIVKLLSVRRVSSFHGVREEEAGRLVAAVAATPPGQAVNLTE 27930

27931 RIEVVIADTTMRPMIGERFERREDFLELLPEIVKIASGFSLDDLFPSSWLACAIGGSQRR 28110

28111 GEASHRTSYELVDSAFRQRQQQREAMAASPPDIAKEEEDDLMDELIRIHKEGSLEVPLTA 28290

28291 GNLKAVIL 28314 (0)

28577 DLFGAGSETSSDALQWAMSELMRNPRVMEKAQNEVQSILKGKPSVTEADVANLKY 28741

28742 LKMIVKETHRLHPVLPLLIPRECQQTCQIMGYDVPQGSVIFINSWAIMRDPKHWDDAETF 28921

28922 KPERFEDGEIDLKGTNYEFTPFGAGRRICPGLALAQASIEFMLATLLYHFDWELPNRAA 29098

29099 PEELDMTEEMGITIRRKKDLYLLPTLRVPLTA* 29197

 

#375

>aaaa01017763.1 CYP71X5 (indica cultivar-group) orth AP003990.1c $F chr 2 97%

1306 FLFLALMVVRLRTKRRGDNNGGVKLPPGPWRLPLVGNLHQVMARGPLVHRTMADLARRLD 1485

1486 APLMSLRLGEVPVVVASSADAAREITKTHDVAFATRPWSPTTRRLRCDGEGVVFATYGAL 1665

1666 WRQLRKLCVVELLGARRVRSFRRVREEEARRLVAAVAASPRGEAVNVSERITAVITDAT 1842

1843 MRAMIGDRFGRRDEFLELLADIVKIGSGFSLDDLFPSWRLAGAIGGMARRAEANH 2007

2008 RRTYELMDSVFQQHEQRRVHVAAPADGAMDDAEEDLVDVLFRIQKDGGLEVPLTIGNI 2181

2182 KAIIL

2562 DLFNAGSETSANTLQWVMSELMRNPKVMRKAQAELRNNLQGKTTVTEDDLTNLKY 2735

2736 LKLVIKETLRLHPVLPLLLPRECQEACNVIGYDVPKYTTVFINVWAINRDPKYWDMAEMF 2915

2916 KPERFDNSMIDFKGTDFEFVPFGAGRRICPGIAFAQSNMELVLATLLYHFDWELPSGMSP 3095

3096 EELDMTEDMGLSVRRKNDLYLHPTV 3170

 

>AP003990.1c $F CYP71X5 chromosome 2 clone OJ1073_F05 one in frame stop at W

AQ259669 61% identical to AQ328148 53% to 76C2 also has stop at W

AQ690680.1 nbxb0082B18f CUGI Rice BAC genomic clone Length = 768

AQ579195 nbxb0084A11f AQ509836 nbxb0094K16f 72% identical to AQ259671

16507 MEKVAWCACFLLLALMVVRLTAKRRGDNGAERLPPGPWRLPLVGNLHQVMARGPLVHRTMADLARRLD 16710

16711 APLMSLRLGEVPVVVASSADAAREIMRTHDVAFATRPWNPTTRRLRCDGEGVVFATYGAL 16890

16891 WRQLRKLCVVELLGARRVRSFRRVREEEARRLVAAVAASPRGEAVNVSERI 17043

17044 TAVITDATMRAMIGDRFGRRDEFLELLADIVKIGSGFSLDDLFPSWRLAGAIGGMARRAE 17223

17224 ANHRRTYELMDSVFQQHEQRRVHVAAPADGAMDDAEEDLVDVLFRIQKDGGLEVPLTIGN 17403

17404 IKAIIL 17421 (0)

17795 DLFNAGSETSANTLQWVMSELMRNPKVMRKAQAELRNNLQGKTTVTEDDLTNLKYLKL 17968

17969 VIKETLRLHPVLPLLLPRECREACNVIGYDVPKYTTVFINV*AINRDPKYWDMAEMFKPE 18148

18149 RFDNSMIDFKGTDFEFVPFGAGRRICPGIAFAQSNMELVLATLLYYFDWELPSGMSPEE 18325

18326 LDMTEDMGLSVRRKNDLYLHPTVCVPL* 18409

 

#75

>aaaa01002047.1c $PI CYP71X6 (indica cultivar-group) ortholog of AP003990.1b 99%

20381 MEKVAWCACFLFLALMVVRLRTKRRGDNNGGVKLPPGPWRLPLVGNLHQVMARGPLVHRT 20560

20561 MADLARRLDAPLMSLRLGEVPVVVASSADAAREITKTHDVAFATRPWSSTIRVMMSEGVG 20740

20741 LVFAPYGALWRQLRKIAMVELLSARRVQSFRRIREDEVGRLVAAVAAAQPGEAVNVSERI 20920

20921 AALVSDAAVRTIIG 20962

20979 VAWCACFLFLALMVVRLRTKRRGDNNGGVKLPPGPWRLPLVGNLHQVMA 21125 aa 452 out of sequence

21126 DLFPSSRLASFIGGTTRRAEANHRKNFELMECALKQHEEKRAAAAAAAAGAVEDDEDI 21299

21300 VDVLLRIQKEGSLQVPLTMGNIKAVVL 21380

22004 DLFSAGSETSANTLQ*AMAELIMNPRTMLKAQAELRDALQGKQIVSEYDLVKLKYLKLII 22183

22184 KETLRLHPVVPLLLPRECQETCKVMDYDIPIGTIVLVNVWVIGRDPKYWD 22333

22334 DAKTFRLERFEDGHVDFKGMNFEYLPFGAGRRMCPGVAFAEAIMELALASLLYHFDWEFP 22513

22514 DGILPAKMDMMEVMGSTV*KKNDLYLVPNAHVPVAP 22621

 

>AP003990.1b $P CYP71X6 chromosome 2 clone OJ1073_F05

11020 MEKVAWCACFLFLALMVVRLRTKRRGDNNGGVKLPPGPWRLPLVGNLHQVMARGPLVHRTMADLA 11214

11215 RRLDAPLMSLRLGEVPVVVASSADAAREITKTHDVAFATRPWSSTIRVMMSDGVGLVFAP 11394

11395 YGALWRQLRKIAMVELLSARRVQSFRRIREDEVGRLVAAVAAAQPGEAVNV 11547

11548 SERIAALVSDAAVRTIIGDRFERRDEFLEGLAEGIKITSGFSLGDLFPSSRLASFIGGTT 11727

11728 RRAEANHRKNFELMECALKQHEEKRAAAAAAAAGAVEDDEDIVDVLLRIQKEGSLQVPLT 11907

11908 MGNIKAVVL 11934 (0)

12556 DLFGAGSETSANTLQ*AMAELIMNPRTMLKAQAELRDALQGKQIVSEYDLVKLKYLKL 12729

12730 IIKETLRLHPVVPLLLPRE 12786 frameshift

      CQETCKVMDYDVPIGTIVLVNMWVIGRDPKYWEDAKTFRPERFEDGHIDFKGMNF 12955

12956 EYLPFGAGRRMCPGVAFAEAIMELALASLLYHFDWEFPDGISPAKMDMMEVMGSTVRKKN 13135

13136 DLYLVPNAHVPVAP* 13180

 

note cluster continues on AP003990.1 to sequence j

 

#74

>aaaa01002047.1b $FI CYP71X7 (indica cultivar-group) ortholog of AP003990.1a 99%

16718 MADLELEKVASFLLAALLPLVLFKLAAAKRGGGDGGMRLPPGPWRLPVIGNLHQIMAGGQ 16897

16898 LVHRTMAGLARGLGDAPLLSLRLGEVPVVVASSADAAREIMSRHDAKFATRPWSPTVRVQ 17077

17078 MVDGEGLAFAPYGALWRQLRKITMVELLSPRRVRSFRRVREEEVGRLVVAVATAATRRPG 17257

17258 EAAVNVGERLTVLITDIAMRTIIGDRFERREDFLDAAAEWVKIMSGFSLGDLFPSSRLAS 17437

17438 FVSGTVRRAEANHRKNFELMDYALKQHEEKRAAAAAAGAGAVEDDEDIVDVLLRIQKEGG 17617

17618 LEVPLTMGVIKGVIR 17662

17911 DLFGAGSETSANTLQWTMSELVRNPRVMQKAQTELRDCLRGKQSVSEDDLIGLKYLK 18081

18082 LVIKETLRLHPVVPLLLPRECQETCNIMGYDVPKGTNVLVNVWAICRDPRHWENAETFIP 18261

18262 ERFEDSTVDFKGTDFEFIPFGAGRRMCPGLAFAQVSMELALASLLYHFDWELPSGVAPSN 18441

18442 LDMEEEMGITIRRKNDLYLVPKVHVPL 18522

 

>AP003990.1a $F CYP71X7 chromosome 2 clone OJ1073_F05 42% to 71B24

AQ259671 323-379 region I-helix 55% to 71B4

AQ691116.1 nbxb0088K01f CUGI Rice BAC genomic clone Length = 544

7359 MADLELEKVASFLLAALLPLVLFKLAAAKRGGGDGGMRLPPGPWRLPVIGNLHQIMAGGQ 7538

7539 LVHRTMAGLARGLGDAPLLSLRLGEVPIVVASSADAAREIMSRHDAKFATRPWSPTVRVQ 7718

7719 MVDGEGLAFAPYGALWRQLRKITMVELLSPRRVRSFRRVREEEVGRLVVAVATAA 7883

7884 TRRPGEAAVNVGERLTVLITDIAVRTIIGDRFERREDFLDAAAEWVKIMSGFSLGDLFPS 8063

8064 SRLASFVSGTVRRAEANHRKNFELMDYALKQHEEKRAAAAAAGAGAVEDDEDIVDVLLRI 8243

8244 QKEGGLEVPLTMGVIKGVIR 8303 (0)

8551 DLFGAGSETSANTLQWTMSELVRNPRVMQKAQTELRDCLRGKQSVSEDDLIGLKY 8715

8716 LKLVIKETLRLHPVVPLLLPRECQETCNIMGYDVPKGTNVLVNVWAICRDPRHWENAETF 8895

8896 IPERFEDSTVDFKGTDFEFIPFGAGRRMCPGLAFAQVSMELALASLLYHFDWELPSGVA 9072

9073 PSNLDMEEEMGITIRRKNDLYLVPKVRVPL* 9165

 

#72

>aaaa01002047.1a $FI CYP71X8 (indica cultivar-group)

ortholog of AP004000.1a 99%

2682 MAMVQDITGYLCLSLALLLLTLVLHKVARKATGNGAGKPRLPPGPWRLPVIGNLHQVAMG 2861

2862 GPLVHRTMADLARRLDAPLMSLRLGELRVVVASSANAAREITKTHDVAFATRPWSSTIRV 3041

3042 LMSDGVGLVFAPYGALWRQLRKIAVVELLSARRVQSFRRIREDEVGRLVAAVAAAPAAQP 3221

3222 VNVSERIAALISDSAVRTIIGDRFERRDEFLEGLAEAIKITSGFSLGDLFPSSRLASFVG 3401

3402 GTTRRAEANHRKNFELIECALRQHEERRAAGAVDDDEDLVDVLLRVQKDGSLQMPLTMGN 3581

3582 IKAVVL 3599

4057 ELFGAGSETSANTLQWAMTELIMNPRVMLKAQAELSNVIKGKQTISEDDLVELKYLKLII 4236

4237 KETLRLHPVVPLLLPRECRETCEVMGYDIPIGTIVLVNVWAIGRDPKYWEDAETFIPERF 4416

4417 EDGHIDFKGTNFEFIPFGAGRRMCPGMVFAEVIMELALASLLYHFDWELPDGISPTKVDM 4596

4597 MEELGATIRRKNDLYLIPAVRVPLSTVL 4680

 

>AP004000.1a $F CYP71X8 chromosome 2 clone OJ1115_B01

95316 MAMVQDITGYLCLSLALLLLTLVLHKVARKATGNGAGKPRLPPGPWRLPVIGNLHQVAMGGPLVHRTMADLA 95101

95100 RRLDAPLMSLRLGELRVVVASSANAAREITKTHDVAFATRPWTSTIRVLMSDGVGLVFAP 94921

94920 YGALWRQLRKIAVVELLSARRVQSFRRIREDEVGRLVAAVAAAPAAQPVNV 94768

94767 SERIAALISDSAVRTIIGDRFERRDEFLEGLAEAIKITSGFSLGDLFPSSRLASFVGGTT 94588

94587 RRAEANHRKNFELIECALRQHEERRAAGAVDDDEDLVDVLLRVQKDGSLQMPLTMGNIKA 94408

94407 VVL 94399 (0)

93945 ELFGAGSETSANTLQWAMTELIMNPRVMLKAQAELSNVIKGKQTISEDDLVELKYLKL 93772

93771 IIKETLRLHPVVPLLLPRECRETCEVMGYDIPIGTTVLVNVWAIGRDPKYWEDAETFIPE 93592

93591 RFEDGHIDFKGTNFEFIPFGAGRRMCPGMAFAEVIMELALASLLYHFDWELPDGISPTK 93415

93414 VDMMEELGATIRRKNDLYLIPTVRVPLSTVL* 93319

 

#72

>aaaa01027906.1 CYP71X8 (indica cultivar-group) orth AP004000.1a $F chr 2 99%

see aaaa01002047.1a for ortholog

1930 LFGAGSETSANTLQWAMTELIMNPRVMLKAQAELSNVIKGKQTISEDDLVELKYLKLI 1757

1756 IKETLRLHPVVPLLLPRECRETCEVMGYDIPIGITVLVNVWAIGRDPKYWEDAETFIPER 1577

1576 FEDGHIDFKGTNFEFIPFGAGRRMCPGMAFAEVIMELALASLLYHFDWELPDGISPTKMD 1397

1396 MMEELGATIRRKNDLYLIPAV 1334

 

#72

>aaaa01012191.1 CYP71X8 (indica cultivar-group) orth AP004000.1a $F chr 2 98%

see aaaa01002047.1a for ortholog

762 KPRLPPGPWRLPVIGNLHQIMVGGPLVHRTMADLARRLDAPLMSLRLGELRVVVLYYRFI 583

582 *IPALSPFYLATRPWSSTIRVLMSDGVGLVFAPYGALWRQLRKIAVVELLSARRVQSF 409

408 RRIREDEVGRLVAAVAAAAAASAAQPVNVSERIAALISDSAVRTIIGDRFERRDEF 241

240 LEGLAEGIKITSGFSLGDLFPS 175

411 TARRKADLHLRPCL 370

 

#73

>aaaa01030108.1 CYP71X9P orth of AP004000.1b

1684 RVIASSTGAACREFTETHDVKFATRPWSSTVRVLMADGLG 1565

1556 GLVFAPYGALWRQLRKIAMVELLSARRVQSHRRYRRRGDAAR 1431

 

>AP004000.1b $P CYP71X9P chromosome 2 clone OJ1115_B01 pseudogene fragment

3 aa diffs with AAAA01030108.1

101503 RVVASSTDAACREFTKTHDVKFATRPWSSTVRVLMADGLG 101393

 

#414

>aaaa01025401.1 CYP71X10 (indica cultivar-group) orth AP004000.1c $F chr 2 98%

2324 KPRLPPGPWRLPVIGNLHQVAMGGPLVHRTMADLARRLDAPLMSLRLGELRVVVASSADA 2145

2144 AREITKTHDVAFATRPWSPTIRVLMSDGVGLVFAPYGALWRQLRKIAMVELLSARRVQSF 1965

1964 RRIREDEVGRLVADVAAAQPGEAVNVSERITALISDSAVRTIMGDRFEK 1818

917 LFGAGSETSASTLHWAMTELIMNPKVI 837

832 DELSNVIKGKQTISEDDLVELRYLKLVIKETLRLHPVVPLLLPRECRETCEVMGY 659

658 DIPIGTTMLVNVWAIGRDPKYWEDAETFRPERFEDGHIDFKGTDFEFIPFGAGRRKCPGM 479

478 AFAEAIMELVLASLLYHFDWELPDGISPTKVDMMEELGATIRKKNDLYLVPTV 320

 

>AP004000.1c $F CYP71X10 chromosome 2 clone OJ1115_B01

109813 MAMVQYVTGYLCLLSLALLLLTLVLHKVARKATGNGAGKPRLPPGPWRL

       PVIGNLHQVAMGGPLVHRTMADLA 109595

109594 RRHDAPLMSLRLGELRVVVASSADAAREITKTHDVAFATRPWSSTIRVMMSDGVGLVFAP 109415

109414 YGALWRQLRKIAVVELLSARRVQSFRRIREDEVCRLVAAVAAAQPGEAVNV 109262

109261 SERITALISDSAVRTIMGDRFEKRDEFLEGLAEGDRIASGFSLGDLFPSSRLASFVGGTT 109082

109081 RRAEANHRKNFGLIECALRQHEERRAAGAVDDDEDLVDVLLRVQKEGSLQVPLTMGNIKAVIL 108893 (0)

107479 ELFGAGSETSASTLHWAMTELIMNPKVMLKAQDELSNVIKGKQTISEDDLVELRYLKL 107306

107305 VIKETLRLHPVVPLLLPRECRETCEVMGYDIPIGTTMLVNVWAIGRDPKYWEDAETFRPE 107126

107125 RFEDGHIDFKGTDFEFIPFGAGRRMCPGMAFAEAIMELVLASLLYHFDWELPDGISPTK 106949

106948 VDMMEELGATIRKKNDLYLVPTVRVPMSTAL* 106853

 

#106

>aaaa01002996.1a $FI CYP71X11 (indica cultivar-group) ortholog of AP004000.1d >99%

13279 MEEPTSCYGYYHYLALAVAVLVLVRVTRTRGGGSDGVRLPPGPWRLPVIGSLHHLAGKPL 13458

13459 VHRALADLARRMDAPLMYLRLGEVPVVVATSPGAAREVMRTHDVAFATRPVSPTVRIMTA 13638

13639 DGEGLVFAPYGALWRQLRRIAILELLSARRVQSFRRVREEEAARLAAAVAAAAPHGEAAV 13818

13819 NVSERIAVLIADSAVRAMIGDRFKKRDEFLEALAEGLKLVSGFSLADLFPSSWLASFVTG 13998

13999 AARRAQENHRKNFELMDRAIEQHQERRAAAAAASGDVVEDDDLVDVLLRIQKGGGLDVPL 14178

14179 TMGIIKAVIL 14208

14345 DLFSAGSETSATTIQWAMSELMRNPRVMKRAQAELRDNLQGKPKVTEEDLADLNYLKLII 14524

14525 KETLRLHLPAPLLLPRESRESCKIFGYDVPKGTTVFVNAWAIGRDPKYWDDPEEFKPERF 14704

14705 EDSKIDFKGLDFEFLPFGSGRRMCPGIMFAQPNIELALATLLYHFDWSLPAGVKPSELDM 14884

14885 TEEMGITVRRKNDLYLHAVVRVPLHATTP 14971

 

>AP004000.1d $F CYP71X11 chromosome 2 clone OJ1115_B01

123850 MEEPTSCYGYYHYLALAVAVLVLVRVTRTRGGGSDGVRLPPGPWRLPVIGSLHHLAGKPLVHRALAD 124050

124051 LARRMDAPLMYLRLGEVPVVVATSPGAAREVMRTHDVAFATRPVSPTVRIMTADGEGLVF 124230

124231 APYGALWRQLRRIAILELLSARRVQSFRRVREEEAARLAAAVAAAAPHGEAAV 124389

124390 NVSERIAVLIADSAVRAMIGDRFKKRDEFLEALAEGLKLVSGFSLADLFPSSWLASFVTG 124569

124570 AARRAQENHRKNFELMDRAIEQHQERRAAAAAASGDVVEDDDLVDVLLRIQKGGGLDVPL 124749

124750 TMGIIKAVIL 124779 (0)

124917 DLFSAGSETSATTIQW AMSELMRNPRVMKRAQAELRDNLQGKPKVTEEDLTDLNYLKL 125090

125091 IIKETLRLHLPAPLLLPRESRESCKIFGYDVPKGTTVLVNAWAIGRDPKYWDDPEEFKPE 125270

125271 RFEDSKIDFKGLDFEFLPFGSGRRMCPGIMFAQPNIELALATLLYHFDWSLPAGVKPSE 125447

125448 LDMTEEMGITVRRKNDLYLHAVVRVPLHATTP* 125546

 

#107

>aaaa01002996.1b $FI CYP71X12 (indica cultivar-group) ortholog of AP004000.1e 99%

16497 MAQDVAEYLSIFLALVVAPLLLLRVARRARGNGAGRPRLPPGPWRLPVIGSLHHLMGKPQ 16676

16677 VHRAMADLARRHGAPLMYLRLGEVPFVVASSPDAAREVLRAQDANFASRPWSPTLRVMMA 16856

16857 DGKGLTFARHGAHWRLRKICVLELLGPRRVRSFRRVREEEVARLLAAVAAAAAAGADAVV 17036

17037 NVSERAAVLVTDTTVRAMIGDRFEMRDEYLEGVAEVGKLLLGLSLGDLFP 17186

17187 SSRLASLVSGTARRAAASHRKMFELMDCAIRHHQERKAAMDADEDILDVLLRMQKEGGHD 17366

17367 APLTMGDVKDTIL 17405

17534 DLFAAGTETSTATLQWAMSEVVRNPRIMQKAQAELRNKLQGKPSVTEDDLVGLTYLKLVI 17713

17714 KETLRLHPAAPMLVPRECGESCKVLGYDVPRGTNVLINAWAIGRDPNYWDDTETFKPDRC 17893

17894 ENNKYDFRGTDFEYIPFGSRRKICPCPAFTHAILELALAALLYHFDWELPCGVAQ 18058 frameshift

18055 SGEVDMAEETGVVVRPKNDLYLRPVVRVPPGAASSGNGGT 18174

 

>AP004000.1e $F CYP71X12 chromosome 2 clone OJ1115_B01

AP004066.1 chromosome 2 clone OJ1572_F02, 55% to 71B17 aa 342-511 runs off beginning

contig of AA751324 and AQ327456 54% IDENTICAL TO 71B24   1/98 K-HELIX

58% identical to AQ328148

127153 MAQDVAEYLSIFLALVVAPLLLLRVARRARGNGAGRPRLPPGPWRLPVIGSLHHLMGKPHVHRAMA 127350

127351 DLARRHGAPLMYLRLGEVPFVVASSPDAAREVLRAQDANFASRPWSPTLRVMMADGEGLA 127530

127531 FARHGAHWRLRKICVLELLGPRRVRSFRRVREEEVARLLAAVAAAAAAGADAV 127689

127690 VNVSERAAVLVTDTX 127731 frameshift

127734 VRAMIGDRFEMRDEYLEGVAEVGKLLLGLSLGDLFPSSRLASLVSGTARRAAASHRKMFE 127913

127914 LMDCAIRHHQERKAAMDADEDILDVLLRIQKEGGHDAPLTMGDVKDTIL 128060 (0)

128189 DLFAAGTETSTATLQWAMSEVVRNPRIMQKAQAELRNKLQGKPSVTEDDLVGLTYLKL 128362

128363 VIKETLRLHPAAPMLVPRECGESCKVLGYDVPRGTNVLINAWAIGRDPNYWDDTETFKPD 128542

128543 RCENNKYNFRGTDFEYIPFGSRRKICPGPAFTHAILELALAALLYHFDWELPCGVAPGE 128719

128720 VDMAEETGVVVRPKNDLYLRPVVRVPPGAASSGNGGT* 128833

 

#96

>aaaa01002645.1a $PI CYP71X13P (indica cultivar-group) sequence gap at 950 sequence

similarity stops at 236 (80% identical to AAAA01002645.1b)

ortholog to AP005385.1b $P 99%

949 PLMRLQLGGVPVVVASSPDAAREVTCTHDAAFASRPWPPTVRRLRPHREGVVFATYGAM 773

772 WRQLRKVCIVEMLSARRVRSFRRVREEEAASLAAAVAASLSSPPARRDAVNVSALVALAV 593

592 ADATMRVVIGDRLERREEFLESMTEAVRSFTGFSLDDLFPSSRIAAAVGGMTRRAEASHR 413

412 KGNELIESAIRQHEQVRDAMAAQGGGGAMEEDLLDTLLRIQKEGALDMPLTMDNIKAVI 236

 

>AP005385.1b $P CYP71X13P (japonica cultivar-group) chr 2 = aaaa01012992.1

146254 MDQVACWSICAFLALLLLVRIGGKRGRGGDGARLRQPPPGPWRLPVIGNLHQLMLRGP 146427

146428 LVHRTMADLARGLDDAPLMRLQLGGVPVVVASSPDAAREVTCTHDAAFASRPWPPTVRRL 146607

146608 RPHREGVVFAPYGAMWRQLRKVCIVEMLSARRVRSFRRVREEEAANLAAAVAASLSSPPA 146787

146788 RRDAVNVSALVAAAVADATMRVVIGDRLERREEFLESMTEAVRSFTGFSLDDLFP 146952

146953 SSRIAAAVGGMTRRAEASHRKGNELIESAIRQHEQVRDAMAAQGGGGAMEEDLLDTLL 147126

147127 RIQKEGALDMPLTMDNIKAVI 147189

147557 DIFGAGSDTSSNIIQW

       FS and Small deletion 19aa

147612 RNTLQGKHPVKEDDLVNIKYLKLIIKETLRLHPVVPLLLPRECLHACKVMGYDVPKGTTV 147791

147792 FVNIWAINRDPKHWDDPEVFKPERFDDGKIDFKGANFEYIPFGAGRRSCPGVTFGHATVE 147971

147972 LMLATLLYHFKWELLEGVAPNELDMTEEIGINVGRKNPLWLCPIVRVPLQ* 148124

 

#96

>aaaa01012992.1 CYP71X13P (indica cultivar-group) 80% to AAAA01002645.1b

runs off end of clone (partialI) orth of AP005385.1b

see aaaa01002645.1a for ortholog

175 DIFGAGSDTSSNIIQWAMSELMRNPKVMQKAQVELRNTLQGKHPVKEDDLVNIKYLKL 348

349 IIKETLRLHPMVPLLLPRECLHACKVMGYDVPKGTTVFVNIWAINRDPKHWDDPEVFKPE 528

529 RFDDGKIDFKGANFEYIPFGAGRRSCPGVTFGHATVELMLATLLYHFKWELLEGVAPNEL 708

709 DMT 717 (fs)

717 EEIGINVGRKNPLWLCPIVRVPLQ* 791

 

#97

>aaaa01002645.1b $FI CYP71X14 (indica cultivar-group) no introns 40% to 71B23

ortholog to AP005385.1a $F 99%

2048 MEQVAYWFICACAFLALVLLVRLGAARRDVVRLPPGPWRLPVVGNLHQLMLRGPLVHRTMADLA 2239

2240 RGLDDAPLMRLQLGGVPVVVASSADAAREVTRTHDLDFASRPWPPTVRRLRPHREGVVF 2416

2417 APYGAMWRQLRKVCVVEMLSARRVRSFRRVREEEAARLVASIASSSSSSPTGHDGGAAP 2593

2594 AVNVSAPIAAAVADATMRAVIGDRFERREEFLESITEAVRSFTGFSLDDLFPSSRLAAAV 2773

2774 GGMTRRAEASIRKGHQLMDSAFRQHQQLRDAMAAQPHLDDCAMEEDLLDTLLRIQKEDNL 2953

2954 DVPLTTGNIKAVLLDIF 3133

3134 GARSDTSSHMVQWVLSELMRNPEAMHKAQTELRSTLQGKQMVSEDDFASLTYLKLVIKET 3313

3314 LRLHPMVPLLLPRECRQTCKVMGYDVPQGTTVFVNVWAINRDPRHWDEPEVFKPERFHSG 3493

3494 KIDFKGANFEYIPFGAGRRICPGMTFGHATVELMLAMLLYHFDWELPKGVAPNELDMTEE 3673

3674 MGITVGRKNALYLHPIVRVSLEQASMS* 3757

 

>AP005385.1a CYP71X14 (japonica cultivar-group) chr 2

142591 MEQVAYWFICACAFLALVLLVRLGAARRDVVRLPPGPWRLPVVGNLHQLMLRGPLVHRTM 142770

142771 ADLARGLDDAPLMRLQLGGVPVVVASSADAAREVTRTHDLDFASRPWPPTVRRLRPHREG 142950

142951 VVFAPYGAMWRQLRKVCVVEMLSARRVRSFRRVREEEAARLVASIASSSSSSPTGHDGGA 143130

143131 APAVNVSAPIAAAVADATMRAVIGDRFERREEFLESITEAVRSFTGFSLDDLFPSSRLAA 143310

143311 AVGGMTRRAEASIRKGHQLMDSAFRQHQQLRDAMAAQPHLDDCAMEEDLLDTLLRIQKED 143490

143491 NLDVPLTTGNIKAVLL (0)

143668 DIFGAGSDTSSHMVQWVLSELMRNPEAMHKAQIELRSTLQGKQMVSEDDLASLTYLKLVIK 143850

143851 ETLRLHPVVPLLLPRECRQTCKVMGYDVPQGTTVFVNVWAINRDPRHWDEPEVFKPERFH 144030

144031 SGKIDFKGANFEYIPFGAGRRICPGMTFGHATVELMLAMLLYHFDWELPKGVAPNELDMT 144210

144211 EEMGITVGRKNALYLHPIVRVPLEQATMS 144297

 

#237

>aaaa01008333.1a $FI CYP71X15 (indica cultivar-group) very similar to AP003990.1

6914 MAMVQDATGYLSLFLALLSITLVLHKVARKASGDGAGKPRLPPGPWRLPVIGNLHQIAMGG 6732

6731 PLVHRTMADLARRHDAPLMSLRLGEVPVVVASSADAAREITKTHDVAFATRPWSSTVRVL 6552

6551 MSDGVGLVFAPYGALWRQLRKIAMVELLSARRVQSFRGIREDEVGRLVAAVAAASAAQ 6378

6377 PGEAVNVSERIAVLIA DSVVRALMGDRFDRRDEFLDQLAERVKITSGFSLGDLFPSSRL 6201

6200 ASFIGGTTRRAEANHRKNFELIECALRQHEERRAARAGAAAAGAVDDDEDLVDVLLRIQK 6021

6020 EGKLEVPLTMGNINAVIY (0) 5967

5378 DLFGAGSETSANTLQWVMSELILNPRVMLKLQAELRGILQGKQRVTEDDLVELKYLKLVI 5199

5198 KETLRLHPVVPLLLARECQDTCKIMGYDIPVGTIVFVNVWVICRESKYWKDAETFRPERF 5019

5018 ENVCVDFKGTHFEYIPFGAGRRMCP 4944

4945 PGVAFAEASMELVLASLLYHFDWKLPNDILPTKLDMTEEMGLSIRRKNDLYLIPTICVPPLAA* 4754

 

no japonica ortholog on 9/7/02

 

#238

>aaaa01008333.1b CYP71X16 (indica cultivar-group) runs off end of clone (partialI)

like AP004000 exon 2

11117 EMFGAGSETSANTLQWLMSELILNPRVMSKAQVELSDTLRGKQTVTEDDLAGLKYLKLII 10938

10937 KENLRLHPVVPLLLPRECQKTCKVMMYDVPVGTTVLVNVWSINRDPKYWEDPETFKPERF 10758

10757 EDGHIDFKGTDFEFIPFGAGRRMCPGITFAEAIMELALASLLYHFDWKLLGNGISSTKLD 10578

10577 MTEELGATVRRKNDLYLVPTIRVPLPADS* 10488

 

no japonica ortholog on 12/24/03 NR, EST, HTGS

 

#68

>aaaa01001712.1 $PI CYP71X17P (indica cultivar-group) missing C-terminal exon

not found in 20000bp of seq.

6693 MAMAQDVTGYLCLFVALLVLLKVVRKASGNGAAGRLRLPPGPWRLPVIGNLHQVAMGG 6866

6867 PLVHRTMADMARRLDAPLMSLRLGEIPVVVASSADAAREITKTHDVAFATRPLSSTIRVM 7046

7047 VSDGEGLVFTPYGALWRRLRKIAMLELLSARRVQSFRRVREEEVGRLVAAVAAAAAAR 7220

7221 PGEAVNLSQLIAELISDTAARTIIGDRFEKRQELLEGLTEGIRISSGFSLGDLFPSSRL 7397

7398 ANLIGGTTRRAEANHRKNLALIECALRQHEERRAAGDEEDDEDLVDVLLRVQKEGG 7565

7566 GEVPLTMGNVKVVIR (0) 7610

 

aaaa01001712.1 $PI no ortholog yet, no match in nr or HTGS 9/5/02

 

#413

>aaaa01025223.1 CYP71Y1 (indica cultivar-group) orth? AP003571.1g $F chr 6 95%

1835 QRLPPGPWMLPAIGSLHHLAGKLPHRAMRDLARRHGPVMLLRIGEVPTLVVSSRDAAREV 1656

1655 MKTHDTAFATRPLSATLRVLTNGGRDLVFAPYGDYWRQVRKIAVTELLTARRVHSFRSIR 1476

1475 EEEVAAVLRAVAVAAGTVEMRAALSALVSDITARTVFGNRCKDRGEFLFL 1326

1325 LDRTIEFAGGFNPADLWPSSRLAGRLSGVVRRAEECRNSVYKILDGIIQEHQER 1164

1163 TGAGGEDLVDVLLRIQKEGELQFPLAMDDIKSII 1062

991 QDIFSAGSETSATTLAWAMAELIRNPTAMHKATPEVRRAFAAAGAVSEDALGELPYLH 818

817 LVIRETLRLHPPLPLLLPRECREPCRVLGYDVPRGTQVLVNAWAIGRDERCWPGGSPEEF 638

637 RPERF 623

588 RGADFEFLPFGAGRRMCPGMAFGLANVELPLASLLFHFDWEVPGLADPAKLDMTEAFGI 412

411 TARRKADLHLRPCL 370

 

>AP003571.1g $F CYP71Y1 chromosome 6 clone P0458E02

139036 MEDATHGYVYVGLALVSLFVVLLARRRRSPPPAAHGDGGLRLPPGPWTLPIIGSLHHLVGQIP 139224

139225 HRAMRDLARRHGPVMLLRIGEVPTLVVSSRDAAREVTKTHDTAFAMRPLSATLRVLTN 139398

139399 GGRDLVFAPYGDYWRQVRKIAVTELLTARRVHSFRSIREEEVAALLRAVAVA 139554

139555 AGTVEMRAALSALVSDITARTVFDNRCKDRGEF

       LVLLERTIEFAGGFNPADLWPS 139719 (?) bad exon boundary

140222 SRLAGRLSSVVRRAEECRNSVYKILDGIIQEHQERTSAGGEDLVDVLLRIQKEGG 140386

140387 LQFPLAMDDIKSIIF 140428 (0)

       DIFSAGSETSATTLAWAMAELIRNPTAMHKVMAEVRRAFAAAGAVSEDALGE 140655

140656 LRYLQLVIRETLRLHPPLPLLLPRECREPCRVLGYDVTRGTQVLVNAWAIGLDERYWPGG 140835

140836 SPEEFRPERFEDGEATAAVDFRGTDFEFLPFGAGRRMCPGMAFGLANVELPLASLLFHFD 141015

141016 WEVPGLADPAKLDMTEAFGITARRKADLHLRPCLLVSVPGV* 141141

 

#474

>aaaa01101459.1 CYP71Y2P (indica cultivar-group) orth AP003571.1f $P chr 6 97%

463 LDFRGADFELLPFGXARRMCPGMAFGLANVELPLSSLLFHFDWEVPGMADPTKLDMTEAF 284

283 GITSRRKENLHLRPLL 236

 

>AP003571.1f $P CYP71Y2P chromosome 6 clone P0458E02 pseudogene fragment

136961 VSEDALGELRYLQLVIRETLRLHPPLPLLLPRECTIGR 137074

137075 DERYWPGGSPEEFRPERFDDGEATAAVDFRGADFELLPFGGGRRMCPGMAFGLANVELPL 137254

137255 SSLLFHFDWEVPGMADPTKLDMTEAFGITSRRKENLHLRPLLRVSVPG 137398

 

#215

>aaaa01007286.1 CYP71Y3 (indica cultivar-group) orth AP003571.1e $F chr 6 100%

4839 YFYLGLALASLLVVLFARRRRSAAHGDGGLRLPPGPWQLPVIGSLHHLAGKLPHRAMRDL 4660

4659 ARRHGPVMMLRLGEVPTLVVSSRDAAREVMRAHDAAFASRPLSATVRVLTSGGRGIIFAP 4480

4479 YGGSWRQLRKIAVTELLTARRVASFRAIREEEVAAMLRAVAAAAAAGRAVELRAALSALV 4300

4299 AETTVRAVIGDRCKDRDVFLRKLQRTIELSAGFNPADLWPSSRLAGRLGG 4150

4149 AVREAEECHDTVYGILDGIIQEHMERTSSGSCGAGDGDGDGEDLLDVLL 4003

 

>AP003571.1e $F CYP71Y3 chromosome 6 clone P0458E02

128926 MADDYFYLGLALASLLVVLFARRRRSAAHGDGGLRLPPGPWQLPVIGSLHHLAGKLPHRAMRDLA 129120

129121 RRHGPVMMLRLGEVPTLVVSSRDAAREVMRAHDAAFASRPLSATVRVLTSGGRGIIFAP 129297

129298 YGGSWRQLRKIAVTELLTARRVASFRAIREEEVAAMLRAVAAAAAAGRAVELRAA 129462

129463 LSALVAETTVRAVIGDRCKDRDVFLRKLQRTIELSAGFNPADLWPSSRLAGRLGGA 129630

129631 VREAEECHDTVYGILDGIIQEHMERTSSGSCGAGDGDGDGEDLLDVLLRIQKEGGLEFPV 129810

129811 DMLAIKQVIF 129837 (0)

132964 DIFGAGSETSATTLEWVMAELIRNPKAMRKATAEVRRAFAADGVVLESALGKLHYMHLVI 133143

133144 RETFRLHTPLPLLLPRECREPCRVLGYDVPRGTQVLVNVWAIGRDERYWPGGSPEEFRPE 133323

133324 RFEDGEAAAAVDFRGADFELLPFGAGRRMCPGLAFGLANVELALASLLFHFDWEAPDVAD 133503

133504 PAEFDMTEGFGITARRKADLPLRPTLRVPVLVSVG* 133611

 

#215

>aaaa01048884.1 CYP71Y3 (indica cultivar-group) orth AP003571.1e $F chr 6

97% 3 diffs see aaaa01007286.1 for ortholog

1024 VRLHTPLPLLLPRECREPCRVLGYDVPRGSQVLVNVWAIGRDERYWPGGSPEEFRPERFE 845

844  DGEAAAAVDLRGADFELLPFGAGRRMCPGLAFGLANVELALASLLFHFDWEAPDVADPAE 665

664  FDMTEGFGITARRKANLPLRPTL 596

 

#215

>aaaa01040160.1 CYP71Y3 (indica cultivar-group) orth AP003571.1e $F chr 6 96%

see aaaa01007286.1 for ortholog

667 MQDIFGAGSETSATTLEWVMAELIRNPKAMRKATAEVRRAFAANGVVSESALGKLHYL 840

841 HLVIRETFRLHTPLPLLLPRECREPCRVLGYDVPRGSQVLVNVWAIGRDERYWPGGSPEE 1020

1021FRPERFED 1044

1064 LDFRGADFELLPFGAGRWMCPGFGVRARQRG 1156

 

#301

>aaaa01012291.1 CYP71Y4 (indica cultivar-group) orth AP003571.1d $F chr 6 99%

6215 DAGLRLPPGPWQLPVIGSLHHLAGKLPHRAMRDLARRHGPVMMLRLGEVPTLVVSSRDAA 6394

6395 REVMRTHDAAFASRPLSASVRAATKGGRDIAFAPYGDYWRQLRKIAVTELLSARRVLSFR 6574

6575 PIREEEVGRSPATLQPGQHAASGRTVELRAALCALVADSTVRAVVGERCAGLDVF 6739

6740 LRQLDRAIELAAGLNVADLWPSSRLAGRPSQRRRAPGREVRDTMFGVLDGII 6895

6896 QAHLEKTGGAGEDILDVLLRIHKEGGLEFPLDMDAVKCV

     DVISGGCETSATTLGWAFAELIRNPAAMK 7249

7250 KATAEVRRDFEAAGAVSESALSVGELPYLRLVVRETLRLHPPLPLLLPRECREPCRVLGY 7429

7430 DVPRGAQVLVNAWAIGRDERYWPGGSPEEFRPERFGDGEAAAAVDFKGADFELLPFGGGR 7609

7610 RMCPGMAFGLANVELPLASLLFHFDWEASGVADPTEFDMTEAFGITARRKANLLLRPIL 7786

 

>AP003571.1d $F  CYP71Y4 chromosome 6 clone P0458E02

118418 MADGYFYLGLALVSLLVVLFARRRRSAAAAHGDAGLRLPPGPWQLPVIGSLHHLAGKLPHRAMRDLA 118618

118619 RRHGPVMMLRLGEVPTLVVSSRDAAREVMRTHDAAFASRPLSASVRAATKGGRDIAFAP 118795

118796 YGDYWRQLRKIAVTELLSARRVLSFRPIREEEVAATLRAVAAAAADGRTVELRAA 118960

118961 LCALVADSTVRAVVGERCAGLDVFLRQLDRAIELAAGLNVADLWPSSRLAGRLSGAVRQ 119137

119138 AERCRDTMFGVLDGIIQAHLEKTGGAGEDILDVLLRIHKEGGLEFPLDMDA 119290

119291 VKCVVV 119308 (0)

119451 DVISGGCETSATTLGWAFAELIRNPAAMKKATAEVRRDFEAAGAVSESALAVGELPYLRL 119630

119631 VVRETLRLHPPLPLLLPRECREPCRVLGYDVPRGAQVLVNAWAIGRDERYWPGGSPEEFR 119810

119811 PERFGDGEAAAAVDFKGADFELLPFGGGRRMCPGMAFGFANVELPLASLLFHIDWEASGV 119990

119991 ADPTEFDMTEAFGITARRKANLLLRPILRVPVPGV* 120098

 

#390

>aaaa01020516.1 CYP71Y5 (indica cultivar-group) orth AP003571.1c $F chr 6 95%

3   IVFAPYGDYWRQLRKITVTELLSARRVASFRAIREEEVAAMLRAVAASAAAGRAVEMRPL 182

183 LSALVSDSTVRAVMGDQFPHRDVFLRELDRSIELVAGFNPADLWPSSRLAGCLT 344

345 GTMRQAKKCWDTMSSVLESTIQEHLQKNGSSGGGAGATDEDLIDVLLRIQKEGGLQFP 518

519 FDMDVIKSVI 548

 

>AP003571.1c $F CYP71Y5 chromosome 6 clone P0458E02

AQ328148 49% identical to C72289 58% to AQ327456 61% to AQ259669

56% to 71B3

106935 MADLHTYLYLGLALVSLLAVQLARRRRSSAAHGSGALRLPPGPWQLPVIGSLHHLVGKL 107111

107112 PHQAMRDLARRHGPVMMLRLGEVPTLVVSSPEAAREVTKTHDVSFATRPLSSTTRVFS 107285

107286 NGGRDIVFAPYGDYWRQLRKITVTELLSARRVASFRAIREEEVAAMLRAVGGYAA 107450

107451 AGCAVEIRPLLAALVSDSTVRAVMGDRFPHRDVFLRELDRSIELTAGFNPADLWPSSRL 107627

107628 AGCLTGTIRQAKKCWDTMSSVLESTIQEHLQKNGSSGGGAGATDEDLIDVLLRIQKEGGL 107807

107808 QFPFDMDVIKSVIH 107846 (0)

110756 NVFGAGSETSATTLGWAIAELIRNPMAMKKATAEVRQAFAAAGVVSEAALSELRYLHLVI 110935

110936 KETLRLHPPGPLLLPRECREQCKVLGYDVPRGTQVLVNVWAIGRDPRYWPGGSPEEFRPE 111115

111116 RFGDGEPAAALDFKGTDYELLPFGAGRRMCPGLAFGLANVELPLASLLFHFDWEVPGMAD 111295

111296 PTKLDMTEAFGIGVRRKADLIIRPILRVPVPGV* 111397

 

#390

>aaaa01021346.1 CYP71Y5 (indica cultivar-group) orth AP003571.1c $F chr 6 98%

1 diff see aaaa01020516.1

158 LPPGPWQLPIIGSLHHLVGKLPHQAMRDLARRHGPVMMLRLGEVPTLVVSS 6

 

#390

>aaaa01083019.1 CYP71Y5 (indica cultivar-group) orth AP003571.1c $F chr 6 99%

see aaaa01020516.1 for ortholog

501 MQNVFGAGSETSATTLGWAIAELIRNPMAMKKATAEVRQAFAAAGVVSEAALSELRYL 328

327 HLVIKETLRLHPPGPLLLPRECREQCKVLGYDVPRGTQVLVNVWAIGRDPRYWPGGSPEE 148

147 FRPERFGDGEPAAA*DFKGTDYELLTFGAGRRMCPGLAFGLANVELPL 4

 

#390

>aaaa01032282.1 CYP71Y5 (indica cultivar-group) orth AP003571.1c $F chr 6

100% see aaaa01020516.1 for ortholog

297 NVFGAGSETSATTLGWAIAELIRNPMAMKKATAEVRQAFAAAGVVSEAALSELRYL 470

471 HLVIKETLRLHPPGPLLLPRECREQCKVLGYDVPRGTQVLVNVWAIGRDPRYWPGGSPEE 650

651 FRPERFGDGEPAAALDFKGTDYELLPFGAGRRMCPGLAFGLANVELPLASLLFHFDWEVP 830

831 GMADPTKLDMTEAFGIGVRRKADLIIRPIL 920

 

#390

>aaaa01032612.1 CYP71Y5 (indica cultivar-group) orth AP003571.1c $F chr 6 100%

see aaaa01020516.1 for ortholog

1464 LPPGPWQLPVIGSLHHLVGKLPHQAMRDLARRHGPVMMLRLGEVPTLVVSSPEAAREVTK 1643

1644 THDVSFATR 1670

 

#444

>aaaa01053818.1 CYP71Y6 (indica cultivar-group) orth AP003571.1b $F chr 6 100%

600 LLTKRSRKATAQRLPPGPWQLPVIGSLHHLAGKLPHHAMRDLARRHGPVMMLRLGEVPTL 779

780 VVSSPEAAQEVMRTHDAVFATRALSATVRAATMGGRDIAFAPYGDRWRQLRKIAATQLLS 959

960 ARRVASF 980

 

>AP003571.1b $F CYP71Y6 chromosome 6 clone P0458E02

80945 MEDASHGYVYLAMAVVALLGVLLTKRSRKATAQRLPPGPWQLPVIGSLHHLAGKLPHHAMRDLARRHG 81148

81149 PVMMLRLGEVPTLVVSSPEAAQEVMRTHDAVFATRALSATVRAATMGGRDIAFAPYGDR 81325

81326 WRQLRKIAATQLLSARRVASFRAIREEEVATMLRAVAAAAADGRAVEMRAALCVV 81490

81491 VADSTARAMVGESCQERDAFLREIDRSMELVSGFNPEDLWPSSRLAGRLSGAVRKIEAS 81667

81668 LHTVLGILDRIIQKRLQEKIGGAGAAAASEDILDVLLRIHKDGGAGGLQVPLDMDDITLV 81847

81848 IT 81853 (0)

83437 SMQLQNAHACALFLTIVSSTYSYSLFNDPPSLHMQDLFSGGGETVATLLVWAMAELIRN 83613

83614 PMAMQKATAEVRRAFALPGVVSEGEGALGELRYLHLVIRETFRLHPPGPLLLPRECSEPC 83793

83794 QVLGYDVPRGTQVLVNVWAIGRDERCWPAAAGGGSPEEFWPERFEDGAEAVDLRGNNFEL 83973

83974 LPFGAGRRMCPGVAFALANIELTLASLLFHFDWEVPGMADPAKLDMAEALGITARRKGDL 84153

84154 LLRPVLRMPVPGV* 84195

 

#444

>aaaa01076398.1 CYP71Y6 (indica cultivar-group) orth AP003571.1b $F chr 6

99% see aaaa01053818.1 for ortholog

725 QLLSARRVASFRAIREEEVATMLRAVAAAAADGRAVEMRAALCVVVADSTARAMVGESCQ 546

545 ERDAFLREIDRSMELVSGFNPEDLWPSSRLAGRLSGAVRKIEASLHTVLGILDRIIQKRL 366

365 QEKIGGAGAAAASEDILDVLLRIHKDGGAGGLQVPLDMDDITLVITVSDQL 213

 

#444

>aaaa01098934.1 CYP71Y6 (indica cultivar-group) orth AP003571.1b $F chr 6 99%

see aaaa01053818.1 for ortholog

131 MQDLFSGGGETVATLLVWAMAELIRNPMAMQKATAEVRRAFALPGVVSEGEGALGELRYL 310

311 HFVIRETFRLHPPGPLLLPRECSEPCQVLGYDVPRGTQVLVNVWAIGRDERCWPAAAGGG 490

491 SPEEFWPERFED 526

 

#310

>aaaa01012578.1 CYP71Y7 (indica cultivar-group) orth AP003571.1a $F chr 6 99%

5308 LVALLGVLLTKRSRTATAQRRLPPGPWQLPVIGSLHHLIGKLPHHAMRDLTRRHGPVMML 5487

5488 RLGEVPTLVVSSPEAAQEVMRTHDAVFATRALSATVRAGTMGGRDIAFAPYGDYWRQLRK 5667

5668 IAATELLSAPRVASFRAIREEEVAATLRTVAAAAADGRAVELRAALCALVTDSTSRAVVG 5847

5848 DRCKESDALIRAFDRSMELASGFNPADLWPSSRLAGLLSGGVREIEANLHTVFGIL 6015

6016 DRLIEKRLQQKKTAPSSAAGEDILDALLRIHKEGGGLQFPLDMDSIKLII 6165

 

>AP003571.1a $F CYP71Y7 chromosome 6 clone P0458E02

67847 MADVLSQGYVYLAMALVALLGVLLTKCSRTATAQRRLPPGPWQLPVIGSLHHLIGKLPHHAMRDLTRRHG 68056

68057 PVMMLRLGEVPTLVVSSPEAAQEVMRTHDAVFATRALSATVRAGTMGGRDIAFAPYGDY 68233

68234 WRQLRKIAATELLSAPRVASFRAIREEEVAATLRTVAAAAADGRAVELRAALCAL 68398

68399 VTDSTSRAVVGDRCKESDALIRAFDRSMELASGFNPAADLWPSSRLAGLLSGGVREIEA 68575

68576 NLHTVFGILDRLIEKRLQQKKTAPSSAAGEDILDALLRIHKEGGGLQFPLDMDSIKLIIA 68755 (0)

73915 DLFSGGGETVATLLVWAMAELIRNPMAMQKATTEVRRAFALAGAVSEGKGALGELRYLHL 74094

74095 VIKEASRLHPPAPLLLPRECSEPCQVLGYDVPRGTQVLVNAWAIGRDERCWTGGSGDGSS 74274

74275 PEEFRPERFEDGAEAVDLRGNNFELLPFGAGRRMCPGMAFALANIELTLASLLFHFDWEV 74454

74455 PDMADPAKLDMTETLGITARRKGDLLLRPVLRMPVPGVY* 74574

 

#289

>aaaa01011521.1b $FI CYP71Y8 (indica cultivar-group)

6071 MADTSHGYVYIGLALVSLFVVLLDRRRRSPPPPAAH

6179 GDGGLRLPPGPWTLPIIGSLHHLVGKLPHHAMRDLARRHGPVMLLRIGQVPTLVVSSRDA 6358

6359 AREMMKTHDMAFATRPLSATLHVITCDGRDLVFAPYGDYWRQLRKIAVTELLTARRVNS 6535

6536 YRAIREEEVAAMLRAVAAAAEGSGAAAGTVEMRAALTALSTDITARAVFGNRCKDREEYL 6715

6716 AQVDHTIELTAGFNPADLWPSSRLAGRLSGIVRRAEECRDTAFKILDRIIQERLE 6880

6881 MARSDGAAGEYLIDVLLRIQKEGGLQFPLAMDDIKANIF (0) 6994

7066 DIFGAGSETSGTALAWAMAELIRNPTVMRKATAEVRRAFAAAGAVSEDGLGELPYLHLVI 7245

7246 RETFRLHPPLPLLLPRECREPCRLLGYDVPRGTQVLVNAWALGRDERYWPGGSPEEFRPE 7425

7426 RFEDGEATAAVNFRGADFEFLPFGGGRRMCPGIAFALATVELPLASLLFHFDWEVPGMAD 7605

7606 PTKLDMTEAFGITARRKADLHLRPLLRVSVPGV* 7707

 

no japonica ortholog found 9/10/02

 

#288

>aaaa01011521.1a $PI CYP71Y9P (indica cultivar-group)

1377 MADASDGYVYVG

1413 LAVVSLFVVLLAWRSRSPAAHGVGDGGLRLPPGPWTLPVIGSLHHLAGQLPHRAMRDLAR 1592

1593 RHGPLMLLRIGEVPTLVVSSRDAAREVMKTHDMAFATRPLSATLRVITCDGRDLVFAPY 1769

1770 GDYWRQVRKIAVTELLTVRRVSSFRSIREEEVAAVLRAVAAAAAVEEATPAMATVEMRAA 1949

1950 LSALVTDITARTAFGNRCKDREEYLVLLERIVEIAGGFNPADLWPSSRLAGRLKRCRAPR 2129

2130 RGVPQLGVILDGIIQEERTGAGSEDLVDVLLRIQKEGELQFPLAMDD 2270

2271 IKSIDIFNAGIETSGTTLQWAMAELIRNPTVM 2450

2451 HKATAEVRHAFAAAGDVSEDALGELRYLQL 2540 (deletion of about 104 aa)

2539 FDWEVPGMADLTKLDMTEAFGITARRKENLHLRPLLRVSVPAASS 2673

2674 RLRWTTTAFSICCHDTHLV*

 

no japonica ortholog found 9/10/02

 

#285

>aaaa01011369.1 CYP71Z1 (indica cultivar-group) orth AL606625.1 $F chr 4 99%

8999 LWFGEVGTVFASSPEAAREVLRSHDLAFADRHLTAAAAAFSFGGRDVVLSPYGERWRQLR 8820

8819 KLLTQELLTASRVRSFRRVREEEVARLMRDLSAAATAGAAVNLSEMVTRMVNDTVLRCSV 8640

8639 GSRCEHSGEYLAALHAVVRLTSGLSVADLFPS 8544

5576 KSLFQDMFAGGTDTSSTTLIWAMAELIRSPRVMAKVQSEMRQIFDGKNTITEDDLVQL 5403

5402 SYLKMVIKETLRLHCPLPLLAPRKCRETCKIMGYDVPKGTSAFVNVWAICRDSKYWEDAE 5223

5222 EFKPERFENNDIEFKGSNFEFLPFGSGRRVCPGINLGLANMEFALANLLYHFDWKLPNRM 5043

5042 LHKDLDMREAPGLLVYKHTSLNVCPVTH 4959

 

>AL606625.1 $F CYP71Z1 chromosome 4 clone OSJNBa0032I19 similar to 71B28 = AQ858445.1

AQ858445.1 nbeb0013M22r CUGI Rice BAC genomic Length = 824 54% to 71B23

82576 MGASILLVVVVSKLMISFAAKPRLNLPPGPWTLPLIGSIHHVVSSRESVHSAMRRLARRHGAPLM 82770

82771 QLWFGEVGTVVASSPEAAREVLRSHDLAFADRHLTAAAAAFSFGGRDVVLSPYGERWRQL 82950

82951 RKLLTQELLTASRVRSFRRVREEEVARLMRDLSAAATAGAAVNLSEMVTRMVNDTVLRCS 83130

83131 VGSRCEHSGEYLAALHAVVRLTSGLSVADLFP 83226

83227 SSRLAAMVSAAPRAAIANRDKMVRIIEQIIRERKAQIEADDRAADSKSC 83373

83374 ACSLDDLLRLQKEGGSPIPITNEVIVVLLM 83463 (0)

84970 DMFAGGTDTSSTTLIWAMAELIRSPRVMAKVQSEMRQIFDGKNTITEDDLVQLSY 85134

85135 LKMVIKETLRLHCPLPLLAPRKCRETCKIMGYDVPKGTSAFVNVWAICRDSKYWEDAEEF 85314

85315 KPERFENNDIEFKGSNFEFLPFGSGRRVCPGINLGLANMEFALANLLYHFDWKLPNGMLH 85494

85495 KDLDMREAPGLLVYKHTSLNVCPVTHIASSCA* 85593

 

#88

>aaaa01002274.1a CYP71Z2 (indica cultivar-group) ortholog of AP003805.1 $F 100%

a duplicate of 1b

2088 MEDKVLLAVGASLVLVFLSKLISSYAKKPRLNLPPGPWTLPLIGSAHHLVSWSESVHSVI 1909

1908 GKLAREHGPVMQLWLGEVPTVVASSPEAAQEILRDHDLIFADRHLTSTTAAITFGGTDVV 1729

1728 MAQYGERWRHLRKLLTQELLTVARVRSFRRVREEEVARLVRDLSAAAASGATVNLTDMVN 1549

1548 RLVNDTVLRCSVGSRCKYREEFLAALHAILHQTSALSVADLFPSSKLASMVATGPRNVLA 1369

1368 NRNKVERIIEEIIQERKNQIETDMMSGNDDVGDKAAVESKSCSLDVLLRLQKEGGTPIPI 1189

1188 TNQVITVLLW 1159

 

>aaaa01002274.1b CYP71Z2 (indica cultivar-group) ortholog of AP003805.1 $F 100%

duplicate of 1a count only once

22179 MEDKVLLAVGASLVLVFLSKLISSYAKKPRLNLPPGPWTLPLIGSAHHLVSWSESVHSVI 22358

22359 GKLAREHGPVMQLWLGEVPTVVASSPEAAQEILRDHDLIFADRHLTSTTAAITFGGTDVV 22538

22539 MAQYGERWRHLRKLLTQELLTVARVRSFRRVREEEVARLVRDLSAAAASGATVNLTDMVN 22718

22719 RLVNDTVLRCSVGSRCKYREEFLAALHAILHQTSALSVADLFPSSKLASMVATGPRNVLA 22898

22899 NRNKVERIIEEIIQERKNQIETDMMSGNDDVGDKAAVESKSCSLDVLLRLQKEGGTPIPI 23078

23079 TNQVITVLLW 23108

 

>AP003805.1 $F CYP71Z2 chromosome 7 clone OJ1080_F08, similar to AC087550.2

39% to 71B23

10416 MEDKVLLAVGASLVLVFLSKLISSYAKKPRLNLPPGPWTLPLIGSAHHLVSWSESVHSVIGKLAREHGPVMQ 10201

10200 LWLGEVPTVVASSPEAAQEILRDHDLIFADRHLTSTTAAITFGGTDVVMAQYGERWRHLR 10021

10020 KLLTQELLTVARVRSFRRVREEEVARLVRDLSAAAASGATVNLTDMVNRLVNDTVLRCSV 9841

9840  GSRCKYREEFLAALHAILHQTSALSVADLFPSSKLASMVATGPRNVLANRNKVERI 9673

9672  IEEIIQERKNQIETDMMSGNDDVGDKAAVESKSCSLDVLLRLQKEGGTPIPITNQVITVLLW (0)9487

3785  DMFGAGTDTSSTTLIWTMAELMRSPRVMAKVQAEMRQAFQGKNTITEDDLAQLSYLKMVL 3606

3605  KESFRLHCPVPLLSPRKCRETCKIMGYDVPKGTSVFVNVWAICRDSMYWKNAEEFKPERF 3426

3425  EDNDIELKGSNFKFLPFGSGRRICPGINLGWANMEFALANLLYHFDWNLPDGMLHKDLDM 3246

3245  QESPGLVAAKCSDLNVCPVTHISSSCA* 3162

 

#13

>aaaa01000275.1 CYP71Z3 (indica cultivar-group) orth AC087550.2 $F chr 10, 100%

same as AAAA01002847.1 $FI see that accession below for ortholog

42371 MEDKRPLALTVLSVSVLIAVVISKLVSYATKPRLNLPPGPWKLPVIGSLHHLVGSHAIHRSMRALAE 42171

42170 KHGRHHLMQISLGEVFAVVVSSPEAAEEILR 42078

 

#13

>aaaa01002847.1a $FI CYP71Z3 (indica cultivar-group) ortholog to AC087550.2a 99%

also aaaa01000275.1 part

14507 MDDKLLQLLLLALAVSVVSSIVTISKLVYRATNKPRLNLPPGPWTLPVIGSLHHLVMRSP 14328

14327 SIHRSMRALAEKHGPLMQVWLGEVPAVVVSSTEAAEEVLKNQDARFADRFITTTLGAI 14154

14153 TFGGGDLAFAPYGERWRHLKMLCTQQLLTAARVRSFRRIREEEVARLVRDLAASAGGGSE 13974

13973 VAVNLSERVARLVNDIMVRCCVGGRSKHRDEFLGALCTALSQTSWLTVADLFPSSRLARM 13794

13793 LGTAPRRALASRKKMELILEQIIQEREEMTTDRSGDGEAGPTNECFLDVLLRLQK 13629

13628 EGDTPIPITMELIVMLLF 13575

12405 DIVSGGTETSTIVLNWTMAELIRTPRVMAKAHAEVRQTFQAKSTITEDDDISGL 12244

12243 TYLKMVIKESLRMHCPVPLLGPRRCRETCKVMGYDILKDTTVFVNVWAMCRSSIYWNDAE 12064

12063 EFKPERFENKCIDYKGSNFEFVPFGSGRRMCAGMNLGMADVEFPLASLLYHFDWKLPDGM 11884

11883 SPEDIDMQEAPGLFGGRRTSLILYPITRVAPSDLQVI 11773

 

>AC087550.2a $F CYP71Z3 chromosome 10 clone nbeb0016G17 74% to AC087554 seq 14167

132002 MDDKLLQLLLLALAVSVVSIVTISKLVYRATNKPRLNLPPGPWTLPVIGSLHHLVMRSPS 131823

131822 IHRSMRALAEKHGPLMQVWLGEVPAVVVSSTEAAEEVLKNQDARFADRFITTTLGAITF 131646

131645 GGGDLAFAPYGERWRHLKMLCTQQLLTAARVRSFRRIREEEVARLVRDLAASAAGGGEVA 131466

131465 VNLSERVARLVNDIMVRCCVGGRSKHRDEFLGALCTALSQTSWLTVADLFPSSRLARML 131289

131288 GTAPRRALASRKKMELILEQIIQEREEMTTDRSGDGEAGPTNECFLDVLLRLQK 131127

131126 EGDTPIPITMELIVMLLF 131073 (0)

       DIVSGGTETSTIVLNWTMAELIRTPRVMTKAHAEVRQTFQAKSTITEDDDISGL 129741

129740 TYLKMVIKESLRMHCPVPLLGPRRCRETCKVMGYDILKDTTVFVNAWAMCRSSIYWNDAE 129561

129560 EFKPERFENKCIDYKGSNFEFIPFGSGRRMCAGMNLGMADVEFPLASLLYHFDWKLPDGM 129381

129380 SPEDIDMQEAPGLFGGRRTSLILCPITRVAPSDLQVIV* 129264

 

#101

>aaaa01002847.1b $FI CYP71Z4 (indica cultivar-group) = aaaa01000275.1

ortholog to AC087550.2b >99% 1 diff

21763 MEDKRPLALTVLSVSVLIAVVISKLVSYATKPRLNLPPGPWKLPVIGSLHHLVGSHAIHR 21584

21583 SMRALAEKHGRHHLMQISLGEVFAVVVSSPEAAEEILRNQDVTFADRFLSTTIGVITFGG 21404

21403 NDMAFAPYGERWRQLRKLCTLELLSAARVRSFRRIREEEVARLVRDLAASAAAGEAVNLS 21224

21223 GRIAKLINDVVVRCCVGGRSEHRDEFLDALRTALDQTTWLTVADVFPSSKLARMLGTAPR 21044

21043 KALASRKKIEHILEQIIQERKRIMDRSSHGGDGDGEAMNTSECFLDVLLRLQKDGNTPIP 20864

20863 ITNEVIVVLLF 20831

18797 DMFSGGSETSSSTLIWTMAELIRKPKVMAKAHVEVRQAFQGKNTITEDDGVNELTYLKMV 18618

18617 IKESLRMHCPVPLLGPRKCRETCKVMGYDIPKDTTVFVNAWAICRDPKYWDDAEEFQPER 18438

18437 FENKSIDFKGSNFEFLPFGSGRRMCAAMNLGIANVELPLASLLYHFDWKLPDGMMPEDVD 18258

18257 MQDAPGILVGKRSSLIMCPVTRVAPSNPQVIAS 18159

 

>AC087550.2b $F CYP71Z4 chromosome 10 clone nbeb0016G17 same as seq on AC087544 from 1-3082

AQ330340 nbxb0046P18r 60% to D48250 65% to 76C4 almost identical to AC087550.2

139422 MEDKLPLALTVLSVSVLIAVVISKLVSYATKPRLNLPPGPWKLPVIGSLHHLVGSHAIHR 139243

139242 SMRALAEKHGRHHLMQISLGEVFAVVVSSPEAAEEILRNQDVTFADRFLSTTIGVITFGG 139063

139062 NDMAFAPYGERWRQLRKLCTLELLSAARVRSFRRIREEEVARLVRDLAASAAAGEAVNLS 138883

138882 GRIAKLINDVVVRCCVGGRSEHRDEFLDALRTALDQTTWLTVADVFPSSKLARMLGTAP 138706

138705 RKALASRKKIEHILEQIIQERKRIMDRSSHGGDGDGEAMNTSECFLDVLLRLQKDGNT 138532

138531 PIPITNEVIVVLLF (0)

136455 DMFSGGSETSSSTLIWTMAELIRKPKVMAKAHVEVRQAFQGKNTITEDDGVNELTYLKMV 136276

136275 IKESLRMHCPVPLLGPRKCRETCKVMGYDIPKDTTVFVNAWAICRDPKYWDDAEEFQPER 136096

136095 FENKSIDFKGSNFEFLPFGSGRRMCAAMNLGIANVELPLASLLYHFDWKLPDGMMPEDVD 135916

135915 MQDAPGILVGKRSSLIMCPVTRVAPSNPQVIAS* 135814

 

#184

>aaaa01005737.1 $FI CYP71Z5 (indica cultivar-group) orth of AP004790.1 >99%

14185 MEDKTILLSLALSMLLAILLSKLVSISKKPRLNLPPGPWTLPVIGSIHHLASNPNTHRALRALSQK 13988

13987 HGPLMQLWLGEVPAVVASTPEAAREILRNQDLRFADRHVTSTVATVSFDASDIFFSPY 13814

13813 GERWRQLRKLCTQELLTATRVRSFSRVREDEVARLVRELAGGGGAAVDLTERLG 13652

13651 RLVNDVVMRCSVGGRCRYRDEFLGALHEAKNQLTWLTVADLFPSSRLARMLGAAPRRGLA 13472

13471 SRKRIERIIADIVREHEGYMGSGGGGGDEAAAAAAGKDCFLSVLLGLQKEGGTPIPITNEIIVVLLF (0) 13271

10674 DMFSGGSETSATVMIWIMAELIRWPRVMTKVQAEVRQALQGKVTVTEDDIV 10522

10521 RLNYLKMVIKETLRLHCPGPLLVPHRCRETCKVMGYDVLKGTCVFVNVWALGRDPKYWED 10342

10341 PEEFKPERFENSDMDYKGNTFEYLPFGSGRRICPGINLGIANIELPLASLLYHFDWKLPD 10162

10161 EMASKDLDMQEAPGMVAAKLTSLCVCPITRVAPLISA* 10048

 

>AP004790.1 $F CYP71Z5 (japonica cultivar-group) chr 2

51668 MEDKTILLSLALSMLLAILLSKLVSISKKPRLNLPPGPWTLPVIGSIHHLASNPNTHRAL 51847

51848 RALSQKHGPLMQLWLGEVPAVVASTPEAAREILRNQDLRFADRHVTSTVATVSFDASDIF 52027

52028 FSPYGERWRQLRKLCTQELLTATRVRSFSRVREDEVARLVRELAGGGGAAVDLTERLGRL 52207

52208 VNDVVMRCSVGGRCRYRDEFLGALHEAKNQLTWLTVADLFPSSRLARMLGAAPRRGLASR 52387

52388 KRIERIIADIVREHEGYMGSGGDGGDEAAAAAAGKDCFLSVLLGLQKEGGTPIPITNEII 52567

52568 VVLLF 52582

55179 DMFSGGSETSATVMIWIMAELIRWPRVMTKVQAEVRQALQGKVTVTEDDIVRLNYLK 55349

55350 MVIKETLRLHCPGPLLVPHRCRETCKVMGYDVLKGTCVFVNVWALGRDPKYWEDPEEFMP 55529

55530 ERFENSDMDYKGNTFEYLPFGSGRRICPGINLGIANIELPLASLLYHFDWKLPDEMASKD 55709

55710 LDMQEAPGMVAAKLTSLCVCPITRVAPLISA 55802

 

#16

>aaaa01000393.1 $FI CYP71Z6 (indica cultivar-group) 89% to AP005114.1b

9756 MEDKLILALCLSALFVVVLSKLVSSAVKPRLNLPPGPWTLPLIGSLHHLAMTKSPQTHRSLRALS 9562

9561 EKHGPIMQLWMGEVPAVVVSSPAVAEEVLKNQDLRFADRHLTATTEEIFFGGRDVIFGP 9385

9384 YGERWRHLRKICMQELLTAARVRSFRGVREGEVARLVRELAASAAGAGAGAVGAAAGVNL 9205

9204 NERISKLANDIVMVSSVGGRCSHRDEFMEALEVAKKQITWLSVADLFPSSKLARMVAVAP 9025

9024 RKGLASRKRMELVIRRIIQERKDQLMDDSAAGAGEAAAGKDCFLDVLLRLQKEGGTPVP 8848

8847 VTDEIIVVLLF (0) 88315

4681 DMISGASETSPTVLIWTLAELMRNPRIMAKAQAEVRQAVAGKTTITEDDIVG 4526

4525 LSYLKMVIKETLRLHPPAPLLNPRKCRETSQVMGYDIPKGTSVFVNMWAICRDSRYWEDP 4346

4345 EEYKPERFENNSVDYKGNNFEFLPFGSGRRICPGINLGVANLELPLASLLYHFDWKLPNG 4166

4165 MAPKDLDMHETSGMVAAKLITLNICPITHIAPSSA* 4058

 

aaaa01000393.1 has no ortholog in nr or HTGS 9/2/02

 

#328

>aaaa01013736.1 $FI CYP71Z7 (indica cultivar-group) ortholog of BI811079.1 AP005114.1b

6446 MEDNKLILALGLSVLFVLLSKLVSSAMKPRLNLPPGPWTLPLIGSLHHLVMKSPQIHRSLRALSE 6252

6251 KHGPIMQLWMGEVPAVVVSSPAVAEEVLKHQDLRFADRHLTATIEEVSFGGRDVTFAPYS 6072

6071 ERWRHLRKICMQELLTAARVRSFQGVREREVARLVRELAADAGAGGDAGVNLNERISKLA 5892

5891 NDIVMVSSVGGRCSHRDEFLDALEVAKKQITWLSVADLFPSSKLARMVAVAPRKGLASRK 5712

5711 RMELVIRRIIQERKDQLMDDSAAGAGEAAAGKDCFLDVLLRLQKEGGTPVPVTDEIIVVL 5532

5531 LF (0) 5526

4289 DMFTGASETSPTVLIWILAELMRCPRVMAKAQAEVRQAAVGKTRITENDIVGLSYLKMVI 4110

4109 KEALRLHSPAPLLNPRKCRETTQVIGYDIPKGTSVFVNMWAICRDPNYWEDPEEFKPERF 3930

3929 ENNCVDFKGNNFEFLPFGSGRRICPGINLGLANLELALASLLYHFDWKLPNEMLPKDLDM 3750

3749 QETPGIVAAKLTTLNMCPVTQIAPSSAEDAS* 3654

 

>AP005114.1b $F CYP71Z7 (japonica cultivar-group) chromosome 2

BI811079.1 clone K015D02.Length = 347 57% to AC087550.2 C-helix

41% to 71B11

120645 MEDNKLILALGLSVLFVLLSKLVSSAMKPRLNLPPGPWTLPLIGSLHHLVMKSPQIHRSL 120824

120825 RALSEKHGPIMQLWMGEVPAVIVSSPAVAEEVLKHQDLRFADRHLTATIEEVSFGGRDVT 121004

121005 FAPYSERWRHLRKICMQELLTAARVRSFQGVREREVARLVRELAADAGAGGDAGVNLNER 121184

121185 ISKLANDIVMVSSVGGRCSHRDEFLDALEVAKKQITWLSVADLFPSSKLARMVAVAPRKG 121364

121365 LASRKRMELVIRRIIQERKDQLMDDSAAGAGEAAAGKDCFLDVLLRLQKEGGTPVPVTDE 121544

121545 IIVVLLF (0) 121565

122803 DMFTGASETSPTVLIWILAELMRCPRVMAKAQAEVRQAAVGKTRITENDIVGLSYLKMVI 122982

122983 KEALRLHSPAPLLNPRKCRETTQVMGYDIPKGTSVFVNMWAICRDPNYWEDPEEFKPERF 123162

123163 ENNCVDFKGNNFEFLPFGSGRRICPGINLGLANLELALASLLYHFDWKLPNGMLPKDLDM 123342

123343 QETPGIVAAKLTTLNMCPVTQIAPSSAEDAS* 123438

 

#29

>aaaa01000805.1a CYP71Z8 partial (indica cultivar-group) 100% to AC087544.2

4553 MEDKLLLLLALAVSVLVAVVISKLVSYATKPRLNLPPGPWTLPVIGSIHHLVGSHPIHRS 4374

4373 MRALAEKHGRDLMQVWLGELPAVVVSSPEAARDVLRSQDLAFADRYVSTTIAAIYLGGRD 4194

4193 LAFAPYGERWRQLRKLCTQRLLTAARVRSFRCVREEEVARLVRDLAASAAAGEAVDLTAR 4014

4013 VAELVNDVVVRCCIGGRRSRYRDEFLDALRTALDQTTWLTVADVFPSSKLARMLGTAPRK 3834

3833 ALASRKKMERILEQIIQERKQIKERSTGAGAGADDEAAAAGNECFLDVLLRLQKEGDTPI 3654

3653 PITNETMMLLLH 3618 sequence gap

 

#29

>aaaa01000805.1b CYP71Z8 partial (indica cultivar-group) 100% to AC087544.2

duplicate of first 46 aa probably an assembly error. Count only once.

18349 MEDKLLLLLALAVSVLVAVVISKLVSYATKPRLNLPPGPWTLPVIG 18212

 

>AC087544.2 $F CYP71Z8 chromosome 10 clone nbxb0046P18,

47% to CYP71D7

AZ131846.1 OSJNBb0111D08r CUGI Rice BAC Length = 377 59% to 71B9

AZ132319.1 OSJNBb0062F12r CUGI Rice BAC genomicLength = 683

14167 MEDKLLLLLALAVSVLVAVVISKLVSYATKPRLN

14065 LPPGPWTLPVIGSIHHLVGSHPIHRSMRALAEKHGRDLMQVWLGELPAVVVSSPEAARDV 13886

13885 LRSQDLAFADRYVSTTIAAIYLGGRDLAFAPYGERWRQLRKLCTQRLLTAARVRSFRCVR 13706

13705 EEEVARLVRDLAASAAAGEAVDLTARVAELVNDVVVRCCIGGRRSRYRDEFLDALRTALD 13526

13525 QTTWLTVADVFPSSKLARMLGTAPRKALASRKKMERILEQIIQERKQIKERSTGAGAGAD 13346

13345 DEAAAAGNECFLDVLLRLQKEGDTPIPITNETMMLLLH 13232 (0)

10760 NMFSAGSETSSTTLNWTMAELIKSPRVMAKVHDEVRQAFQGKNTITDDDVAKLSYLKMVT 10581

10580 KESLRMHCPVPLLGPRRCRETCKVMGYDVPKGTIVFVNAWAICRDSKYWKSAEEFKPERF 10401

10400 ENISIDYNGNNFEFLPFGSGRRICPGITLGMANVEFPLASLLYHFDWKLPNQMEPEEIDM 10221

10220 REAPGLVGPKRTSLYLHPVTRVAPSSV* 10119

 

#29

>aaaa01011405.1 CYP71Z8 (indica cultivar-group) orth AC087544.2 $F chr 10 99%

see aaaa01000805.1a = aaaa01000805.1b for ortholog

2160 SPRVMAKVHDEVRQAFQGKNTITDDDVAKLSYLKMVTKESLRMHCPVPLLGPRRCRET 2333

2334 CKVMGYDVPKGTIVFVNAWAICRDSKYWKSAEEFKPERFENISIDYNGNNFEFLPFGSGR 2513

2514 KICPGITLG 2540

 

#466

>aaaa01088222.1 CYP71Z10 i  not an exact match 64% to AP005114.1b $F

651bp frag. N-term runs off the end 69% to Zea mays BG836429

247 MEENKALLAAVSLSILLVILSKLKSFLATKPKLNLSPGPWTLPVIG

SLHHLVRSPNIYRAMRALAQKHGQLMTLRLGEVQCM 2

 

no japonica ortholog found 12/24/03

 

#468

>aaaa01092069.1 CYP71Z9 (indica cultivar-group) 61% to AC087544.2 frag = 623bp

91% to Zea mays BG836429

623 AIAMAF RQTSVLTLADLFPSSRLMQALGTAPRKVLACRDKIQRILEQVIQEKAQEMGRGDEATAGNEGFV 414

413 GVLLRLQKEGSTPVQLTNDTIIAVLY (0) 351

207 DMFSAGSETSSTTLNWCMTELVRSPVVMAKAQ AELRDAFKGKNTITENDLEGLSYLKLVI 28

27  KEALRMHAP 1

 

no japonica ortholog found 12/24/03

 

>Note 71Z9 and 71Z10 may be from the same gene. An ortholog of Zea mays

CG360193, BG836429, CG360202, CC013336, CG376419  Zea mays

MEQKVLVAVGVAVLLVVVLSKLKSVLVTKPKLNLPPGPWTLPLIGSTHHLVTS

PSIYRAMRDLAQKYGPLMMLRLGEVPTL VVSSPEAAQAITKTHDIAFADRHMNTTIGVLTFNGT

DLVFGPYGERWRQLRKICVLELFSVARVQSFQRIREEEVARFMQSLAASAGTVNLSK

MISRFINDTFVREFIGSRCKYQDEYLD AFDTAVRQTSVLTVADLFPSSRLMQAVGT

APRNALKCRNRITRILEQIIREKVEAMGRGEKTAHEGLIGVLLRLQKEANLPTLLTNDTIVALMF (0)

DLFGAGSDTSSTTLNWCITELIRHPAAMAKAQAEVREAFKGKARIISEDDLAGAGLSYLK

LVIKEALRMHCP

LPLLLPRLCRETCQVMGYDIPKGTAVFINVWAVCRDAKYWEDPEEFRPERFEDTNLEYN

YKGTNYEFLPFGSGRRMCPGANLGLGNIELALASLLYHYDWKLPDGVKPQDVQVWEGPGL

IAKKKTGLLLRPVTCIAFACSSG*

 

#93

>aaaa01002599.1a CYP71AA1P (indica cultivar-group) orth of AP004326.2d 100%

even the frameshifts are the same

8731 FFFCGLETTTTATIWAI*EFIKNPHAVEKAQSDIRKILGGKSIVEEADIEGQLHYFQMVN 8552

8552 QETLRLHPPVPLLLPRLWSEPCKIMGYDIP

     KNTAIFVNTWALGR

     KIKNTGLMQVSSG 8382

8381 LKYSRMGIVDFNGLDFRFLPCGAGRRICLGLMFELSDIELTLASLLYHSSWRLPTRSYSN 8202

8201 KLDMTEANGITTHRRIDIWLEATPFVPR 8118

 

>AP004326.2d $P CYP71AA1P genomic DNA, chromosome 1, BAC clone:OJ1294_F06, complete sequence Gene 4 pseudogene

81031 DFFFCGLETTTTATIWAI*EFIKNPHAVEKAQSDIRKILGGKSIVEEADIEGQLHYFQMVN 81213 frameshift

81213 QETLRLHPPVPLLLPRLWSEPCKIMGYDIP 81302 frameshift

81304  KNTAIFVNTWALGR 81345 frameshift

81344 KIKNTGLMQVSSGLKY

81393 SRMGIVDFNGLDFRFLPCGAGRRICLGLMFELSDIELTLASLLYHSSWRLPTRSYSNK 81566

81567 LDMTEANGITTHRRIDIWLEATPFVPR 81647

 

#94

>aaaa01002599.1b $FI CYP71AA2 (indica cultivar-group) ortholog of AP004326.2c $F 99%

12298 MAGIMDSTTASYYTTLLCGALLLAAVVFKLKTAAAFSRHNAGVNLPPGPWALPVIGSIHC 12119

12118 LLGSLPHHAMRELSRRYGPVMLLRLGHVRTLVLSSPEAAREVMKTHDVAFANRAVTPTAS 11939

11938 ILTYGARDIVFAPFGKHLRELRKLCALELLSPRRVRSFRHVREEEAARLARSVAAAASAS 11759

11758 SAVNVSELVKIMTNDVTMRAIIGDRCPQREEYLEALDKTMDLLAGFNLVDLFPGSPLARV 11579

11578 LGGRSLRTTKRVHEKLHQITEAIIQGHGIKDTVGDEHHECEDIL 11447

11446 DVLLRFQRDGGLGITLTKEIVSAVLF 11369

11213 DLFAGGSETTSTTILWAMSELMRSPHVMEQAKYEIRQVLQGKAMVSEADIEGRLHYLQLV 11034

11033 IKETLRLHPPVPIVIPRLCSKPNSKIMGYDIPQGTSVLVNVSAIGRDEKIWKNVNEFRPE 10854

10853 RFKDDIVDFSGTDFRFIPGGSGRRMCPGLTFGVSNIEIALVTLLYHFDWKLPSETDTHEL 10674

10673 DMRETYGLTTRRRSELLLKATPSY 10602

 

>AP004326.2c $F CYP71AA2 genomic DNA, chromosome 1, BAC clone:OJ1294_F06, complete sequence Gene 3 39% to 71B11

77487 MAGIMDSTTASYYTTLLCG

77544 ALLLAAVVFKLKTAAAFSRHNAGVNLPPGPWALPVIGSIHCLLGSLPHHAMRELSRRY 77717

77718 GPVMLLRLGHVRTLVLSSPEAAREVMKTHDVAFANRAVTPTASIDIVFAPFG 77873

77874 KHLRELRKLCALELLSPRRVRSFRHVREEEAARLARSVAAAASASSAVNVSELVKIM 78044

78045 TNDVTMRAIIGDRCPQREEYLEALDKTMDLLAGFNLVDLFPGSPLARVLGGRSLRTTKRV 78224

78225 HEKLHQITEAIIQGHGIKDTVGDEHHECEDILDVLLRFQRDGGLGITLTKEIVSA 78389

78390 VLF 78398 (0)

78554 DLFAGGSETTSTTILWAMSELMRSPHVMEQAKYEIRQVLQGKAMVSEADIEGRLHYLQLV 78733

78734 IKETLRLHPPVPIVIPRLCSKPNSKIMGYDIPQGTSVLVNVSAIGRDEKIWKDVNEFRPE 78913

78914 RFKDDIVDFSGTDFRFIPGGSGRRMRPGLTFGVSNIEIALVTLLYHFDWKLPSETDTHEL 79093

79094 DMRETYGLTTRRRSDLLLKATPSYARLGWSTNMQIYSVKCLVYE* 79228

 

#95

>aaaa01002599.1c $FI CYP71AA3 (indica cultivar-group) ortholog of AP004326.2b $F >99%

22237 MAGIVDTAAFCTLLCLLLTLVVFKLKTATSSRHNAGVNLPPGPWALPVIGSIHCLLGSLP 22058

22057 HHAMRELSRRYGPVMLLRLGHVRTLVLSSPEAAREVMKTHDAAFATRAVTPTASILTYGA 21878

21877 RDIVFAPFSKHLRELRKLCTLELLSPRRVRSFRHVRDEEAARLARSVAAAAPAVVNVSEL 21698

21697 VKIMANNIIMTAIIGDTCPQRDEYLEALDKTMDLMNGFNLIDLFPGSRLARVLGARSLRA 21518

21517 TKRVHQKLHQITDTIIQGHEIIEDGSVGDDTIQETVGTHNMHGHGHKCEDILDVLLRFHR 21338

21337 DGGLGITLTKEIVSAVLF 21284

20770 DLFAAGSETTSTTIIWAMSELVRTPHVMERAQSEIHQVLQGKTVVSEADIEGRLHYLQLV 20591

20590 IRETLRLHPPVPFLIPRLCSEANSKIMRYNIPQGAMVLVNISAIGRDEKIWKNANEFRPE 20411

20410 RFKDDMVDFSGTDFRFIPGGAGRRMCPGLTFGLSNIEIALASLLYHFDWKLPNDASSCKL 20231

20230 DMRETHGVTARRRTELLLKATPLYT 20156

 

>AP004326.2b $F CYP71AA3 genomic DNA, chromosome 1, BAC clone:OJ1294_F06, complete sequence

Gene 2 no good matches in NR 79% to AP004326.2c

71860 MAGIVDTAAFCT

71896 LLCLLLTLVVFKLKTATSSRHNAGVNLPPGPWALPVIGSIHCLLGSLPHHAMRELSRR 72069

72070 YGPVMLLRLGHVRTLVLSSPEAAREVMKTHDAAFATRAVTPTASILTYGARDIVFAPF 72243

72244 SKHLRELRKLCTLELLSPRRVRSFRHVRDEEAARLARSVAAAAPAVVNVSELVKI 72408

72409 MANNIIMTAIIGDTCPQRDEYLEALDKTMDLMNGFNLIDLFPGSRLARVLGARSLRATKR 72588

72589 VHQKLHQITDTIIQGHEIIKDGSVGDDTIQETVGTHNMHGHGHKCEDILDVLL 72747

72748 RFHRDGGLGITLTKEIVSAVLF 72813 (0)

73327 DLFAAGSETTSTTIIWAMSELVRTPHVMERAQSEIRQVLQGKTVVSEADIEGRLHYL 73497

73498 QLVIRETLRLHPPVPFLIPRLCSEANSKIMRYNIPQGAMVLVNISAIGRDEKIWKNANEF 73677

73678 RPERFKDDMVDFSGTDFRFIPGGAGRRMCPGLTFGLSNIEIALASLLYHFDWKLPNDASS 73857

73858 CKLDMRETHGVTARRRTELLLKATPLYT* 73944

 

cluster continues on AP004326.2 seq a

 

#334

>aaaa01014066.1 CYP71AA4P (indica cultivar-group) orth AP004326.2a $P

chr 1 100%

5245 YMQLVIKETLRLHLPFPLLFPRLCTETCKIMGFDVPKGTIVIVNNWAISRDERCWEDAED 5424

5425 F*PERFEHDDTDYNGTYFQFLSGGFGRRMFPDFIFAQFNIEIALANLLYHFDWELPCSEN 5604

5605 RMELDMTESAGLT 5643

 

>AP004326.2a $P CYP71AA4P genomic DNA, chromosome 1, BAC clone:OJ1294_F06, complete sequence

Length = 102983

4 genes 71B like

Gene 1 pseudogene 71 family

67989 LPPVPWPLPVIGSMH*LLGSLPHH 68060 frameshift with deletion

68060 RPACAVELLSPRRARSFRRVREAEPARLVRAVAASPAWPLVNVVGGEHVAAMMTAV 68227

68228 GARP 68239 frameshift with small deletion

68238 RCPRQEEYLEELGKVAKLAAGFNLVDLFPESRLVRAAQAAHGKIHSIMDAMVQ 68396

68397 DHLKAMEERREEVADGVVDDGDGDGADRDEELLSILLRFQRDGGLGITLTNGNHQRDS 68570 (0)

68886 GILAGGSDTTTTTVMWAMSELLRCPRAMQ 68972 frameshift with deletion

69023 YMQLVIKETLRLHLPFPLLFPRLCTETCKIMGFDVPKGTIVIVNNWAISRDERCWEDAED 69202

69203 F*PERFEHDDTDYNGTYFQFLSGGFGRRMFPDFIFAQFNIEIALANLLYHFDWELPCSEN 69382

69383 RMELDMTESAGLTASRLTDLFG* 69451

 

#442

>aaaa01051575.1 CYP71AA5 (indica cultivar-group) 69% to AP004326.2b

855 DVFAAGSETTATATIWAMSELVRTPRLMERAQAEIRQLLQGKTRVAEEDIQGRLPYLQMV 676

675 IKETLRLHPPAPLILPRLCAESTKILGFDVPEGTTVFVNAWALGRDDKSWVDANEFKPER 496

495 FEDDDRVDFSGADFRFIPGGSGRRMCPGLTFGLANIETTLANLLYHFDWKLPGGANPYEL 316

315 DMAESYGITARRTTDLLLEATPYVPHGSVS* 223

 

no japonica ortholog found 12/24/03 best japonica match = 69% but Zea = 74%

 

>CG055548, CG372667, CG055546, CG349250  Zea mays genomic clones

74% possible ortholog of 71AA5

60% to 71AA2,

AETTPAVWYTLLCLLAGVAVLLKLKTKAIASRHSAGSLN CA138911 Saccharum officinarum

                          KPSPSRHGAGASA

LPLYLLPPGPRPLPVIGNLHCLLGALPHH

     LPPGPRPLPVIGNLHCLLGALPHH CA138911 Saccharum officinarum

AMRALARRYGDVVLLRLGHVPTVVVSSPEAAREVMRTHDAVVSNRPLYVTADVLSYGGQN

IAFAPSGSPHWKELRRLCAAELLSPRRVLSFRPVREEEAACLVRSVDAASPPPFLVNVSE

RVKALMNDVLMRCAVGDTCRMRDEYIAELDEALRLLAGFNLVDLFPGSRLARALGAGSLR

AAREVHGRVHRIVQAIIQDHASKAADDGAGSRDEDILDVLLRLQRDGGLETVLTTQVLCGVLF (0?)

DVFAAGSETTATTTIWAMSELAKNPGVMRRAQSEVRRVLEGKTRVAEADIQGRLPYLQAV

IREALRLHPPLPLILPRSCAEPITILGHHVPKGTTVFVNAWAIGRDERWWPDASQFKPER

FEGEGVVDFSGADFRFLPGGGGRRMCPGLTFGVANVEIALASLLYHFDWELPGGADPGAL

DMGEAYGITARRKTDLVLKATPFVPTN*

 

#273

>aaaa01010273.1 $FI CYP71AB1 (indica cultivar-group) ortholog of AC113337.1

6315 MANLIYYSLLIILPFLFLIKFYKAMFSSRKQARRLPPCPWQLPIMGSIHHLIGDLPHRAL 6494

6495 RDLSRRYGPVMLLKFGQVPFIIVSSPEAAKDIMKTHDSIFATRPQSEIMKIITKRGQGLV 6674

6675 FAPYDDQWRQLRKICIRELLCAKRVQSFCAIREEEAARLVKSISSDQAHLVNLSKKLADY 6854

6855 ATDAAIRIITGTRFENQE VRDKFQYYQDEGVHLAASFCPANLCPSLQLGNTLSRTAHKA 7031

7032 EIYREGMFAFIGGIIDEHQERRAQDMSHKEDLIDVLLRIQQEGSLESPVSMETIKFLIF (0) 7208

7297 DILAGGSETVTTVLQWAMAELMRNPTVMSKVQDEVREVFKWKEMVSNDDINKLTYLQFVI 7476

7477 KETLRLHTPGPLFMRECQEQCQVMGYDMPKGTKFLLNLWSISRDPKYWDDPETFKPERF 7653

7654 EDDARDFKGNDF EFISFGAGRRMCPGMLFGLANIELALANLLFYFDWSLPDGVLPSELDM 7833

7834 TENFGVTVRKKEDLLLHASLYAQLSC* 7914

 

>AC113337.1 $F CYP71AB1 (japonica cultivar-group) cultivar Nipponbare clone OSJNBa0061H20,

from chromosome 10

AC074355.2 Oryza sativa clone OSJNBa0071I20, gene 1 43% to 71A13

AQ288798 65-164 region C-helix 54% to 71A12 same as AC074355.2

AQ840770.1 nbxb0071I20f CUGI Rice BAC genomic cloneLength = 754

AQ840078.1 nbxb0051B18f CUGI Rice genomic cloneLength = 694

similar to lotus 71D

AQ865944.1 nbeb0026D10f BAC genomic Length = 473 59% to 99A1 69% to AP004000.1

23542 MANLIYYSLLIILPFLLLINFYKAMFSSRKQAGRLPPCPWQLPIMGSIHHLIGDLPHRSL 23721

23722 HDLSRRYGPVMLLKFGQVPFIIVSSPEAAKDIMKTHDSIFAMRPQSEIMKIITKRGQGLV 23901

23902 FAPYDDQWRQLRKICIRELLCAKRVQSFCAIREEEAARLVKSISSDQAHLVNLSKKLADY 24081

24082 ATDAAIRIITGTRFENQEVRDKFQYYQDEGVHLAASFCTANLCPSLQLGNTLSRTARKAE 24261

24262 IYREGMFAFIGGIIDEHQERRAQDMYHKEDLIDVLLRIQQEGSLESPVSMETIKFLIF (0)

      DILAGGSETVTTVLQWAMTELMRNPTVMSKAQ 24621

24622 DEVREVFKWKKMVSNDDINKLTYLQFVIKETVRLHTPGPLFMRECQEQCQVMGYDVPKGT 24801

24802 KFLLNLWSISRDPKYWDDPETFKPERFENDARDFKGNDFEFIPFGAGRRMCPGMLFGLAN 24981

24982 IELALANLLFYFDWSLPDGVLPSELDMTENFGVTVRKKEDLLLHASLYAQLSC* 25143

 

#263

>aaaa01009869.1 CYP71AB2 (indica cultivar-group) orth AP004684.1b $F chr6 98%

2834 LPLVHYLITLFLHGSRDSDLRLPPGPWRLPLIGSLHHLFFGALPHRALRDLARRHGPLML 3013

3014 LAFGDAPVVVVASTAAAAREILRTHDDNFSSRPLSAVVKACTRRGAGITFAPYGEHWRQV 3193

3194 RKICRLELLSPRRILAFRAIREEEAARLVRAIGVASPPLVTNLSQLLGNYVTDTTVHIV 3370

3371 MGERFRERDALLRYVDEAVRLAGSLTMADLFPSSRLAHAMSSTTLRRAEAFVES 3532

3533 LMEFMDRVIREHLEKKRSCQGGEREEDLIDVLLRLQAEGSLHFELTMGIIRAVIF

     DLFSGGSETATTT 3886

3887 LQWAMAELMRNPGVMSRAQAEVREAYKDKMEVTEEGLTNLTYLQCIIKETLRLHTPGP 4060

4061 LALPRECQEQCRILGYDIPKGATVLVNVWAICTDTEFWDESEKFMPERFEGSTIEHKGNN 4240

4241 FEFIPFGAGRRICPGMQFGIANIELALANLLFHFDWTLPEGTIHSDLDMTETMGITARRK 4420

4421 EDL 4429

 

>AP004684.1b $F CYP71AB2 chromosome 6 clone P0012H03, Length = 163117

New seq similar to AP004000 57% to AP003523.1 78% to AP004688.1 

36% to 41% with 71A and 71B sequences possibly new subfamily in 71

117908 MDAAVFCCLLALLPLLHYLITLFLHGSRDSDLRLPPGPWRLPLIGSLHHLF 118060

118061 FGALPHRALRDLARRHGPLMLLAFGDAPVVVVASTAGAAREILRTHDDNFSSRPLSAV 118234

118235 VKVCTRRGAGITFAPYGEHWRQVRKICRLELLSPRRILAFRAIREEEAARLVRAIGVASP 118414

118415 PLVTNLSELLGNYVTDTTVHIVMGERFRERD ALLRYVDEAVRLAGSLTMADLFPSSRLAR 118594

118595 AMSSTTLRRAEAFVESLMEFMDRVIREHLEKKRSCQGGEREEDLIDVLLRLQAEGSLHFE 118774

118775 LTMGIIRAVIF 118807 (0)

118932 DLFSGGSETATTTLQWAMAELMRNPGVMSRAQAEVREAYKDKMEVTEEGLTNLTYLQCII 119111

119112 KETLRLHTPGPLALPRECQEQCQILGYDIPKGATVLVNVWAICTDNEFWDESEKFMPERF 119291

119292 EGSTIEHKGNNFEFIPFGAGRRICPGMQFGIANIELALANLLFHFDWTLPEGTLHSDLDM 119471

119472 TETMGITARRKEDLYVHAIPFVQLP* 119549

 

#71

>aaaa01002000.1 CYP71AB3 (indica cultivar-group) ortholog of AP004688.1 $F 98%

6046 METADLCCLLALLPLVYCLLTLFHGSRESDLRLPPGPWRLPLIGSLHHLFGRTLPHHALR 5867

5866 DLARLHGPLMLLSFGQASPVVIASTAIAAREIMRTHDDNFSTRPLSTVLKVCTRYGAGMT 5687

5686 FVPYGEHWRQVRKICSLELLSPRRILKFRSIREEEVARLVLAIASSSTPTPTPPAPVNLS 5507

5506 KLLSNYMTDATVHIIMGQCFRDRDTLVRYVDEAVRLASSLTMADLFPSWRLPRVMCATTL 5327

5326 HRAEVFVESVMEFMDRVISEHLEKRSCQGGDREEDLIDVLLRLQAEGNLEFELTTSIIKA 5147

5146 IIF 5138

     ELLAGGSEAPITTLQWAMAELMRNPDVMSRAQAEVREAYKEKMKV 4911

4910 TEEGLTNLPYLHCIIKETLRLHTPGPFVLPRECQEQCQILGYDVPKRATVVVNIWAICRD 4731

4730 AEIWDEPEKFMPDRFEGSAIEHKGNHFEFIPFGAGRRICPGMNFALANMELALASLLFYF 4551

4550 DWSLPEDVLPGDLDMTETMGLTARRKEDLYVCAIPFVQLP 4431

 

>AP004688.1 $F CYP71AB3 chromosome 6 clone P0036C11, Length = 137929

New seq similar to AP004000 37% to 71B23 52% to AP003523.1 78% to AP004684.1b 

57304 METAELCCLLALLPLVYCLLTLFHGSRESDLRLPPGPWRLPLIGSLHHLFGRTLPHRA 57477

57478 LRDLARLHGPLMLLSFGQAAPVVIASTAIAAREIMRTHDDNFSTRPLSTVLKVCTRYGA 57654

57655 GMTFVPYGEHWLQVRKICSLELLSPRRILKFRSIREEEVARLVLAIASSSTPTPTPPAPV 57834

57835 NLSKLLSNYMTDATVHIIMGQCFRDRDTLVRYVDEAVRLASSLTMADLFPSWRLPRVMCA 58014

58015 TTLHRAEVFVESVMEFMDRVISEHLEKRSCQGGDREEDLIDVLLRLQAEGNLEFELSTSI 58194

58195 IKAIIF 58209 (0)

58290 ELLAGGSEAPITTLQWAMAELMRNPDVMSRAQAEVREAYKEKMKVTEEGLTNLPY 58469

58470 LHCIIKETLRLHTPGPFVLPRKCQEQCQILSYDVPKRATVVVNIWAICRDAEIWDEPEKF 58649

58650 MPDRFEGSAIEHKGNHFEFIPFGAGRRICPGMNFALANMELALASLLFYFDWSLPEDVLP 58829

58830 GDLDMTETMGLTARRKEDLYVCAIPFVQLP* 58919

 

#267

>aaaa01010030.1b CYP71AC1 (indica cultivar-group) orth AP003523.1b $F chr 6  99%

6579 QDMFAGGSESTSTTLEWALSELVRNPHVMQKAQAEIRHALQGRTRVTEDDLINLKYPK 6406

6405 NVIKETLRLHPVAPLLVPKECQESCKILGYDVPKGTIMFVNAWAIGRDPRYWNDAEVFMP 6226

6225 ERFEKVAVDFRGTNFEFIPFGAGRRMCPGITFANATIEMALTALLYHFDWHLPPGVTPDG 6046

6045 LDMEEEFGMSVSRKRDLYLRPTLH 5974

 

>AP003523.1b $F CYP71AC1 chromosome 6 clone P0416A11 six different genes

54-58% with genes in AP003090 and AP004000 group

38% to 41% with 71A and 71B sequences possibly new subfamily in 71

118132 MDLMKSNPLQGSPWSL

118084 LNLLVLIIVAAMICGELCRRRRRRRGDENGGATRLPPGPWRLPFVGSLHHLAVMRPRGVV 117905

117904 VHRALAELARRHDAPVMYLRLGELPVVVASSPEAAREVLKTHDAAFATRAMSVTVRESIG 117725

117724 DKVGILFSPYGKKWRQLRGICTLELLSVKRVRSFRPIREEQVARLVDAIAAAAASS 117557

117556 TAEAAAVNISRQITGPMTDLALRAIMGECFRWREEFLETLAEALKKTTGLGVADMFPSSR 117377

117376 LLRAVGSTVRDVKLLNAKLFELVECAIEQHREQIRAAHDNGGDDDDAHGHGDKECFLNTL 117197

117196 MRIQKEGDDLDD 117161 (frameshift) LTMATVKAVIL (0)

       DMFAGGSESTSTTLEWALSELVRNPHVMQKAQAEIRHALQGRTRV 115967

115966 TEDDLINLKYPKNIIKETLRLHPVAPLLVPKECQESCKILGYDVPKGTIMFVNAWAIGRD 115787

115786 PRYWNDAEVFMPERFEKVAVDFRGTNFEFKPFGAGRRMCPGITFANATIEMALTALLYH 115610

115609 FDWHLPPGVTPDGLDMEEEFGMSVSRKRDLYLRPTLHMGLETI* 115478

 

#267

>aaaa01031277.1 CYP71AC1 (indica cultivar-group) orth AP003523.1b $F chr 6 99%

see AP003523.1b above for ortholog

1393 LPPGPWRLPFVGSLHHLAVMRPRGVVVHRALAELARRHDAPVMYLRLGELPVVVASSPEA 1572

1573 AREVLKTHDAAFATRAMSVTVRESIGDKVGILFSPYGKKWRQLRGICTLELLSVKRVRSF 1752

1753 RPIREEQVARLVDAIAAGA 1809

 

#25

>aaaa01000575.1 $FI CYP71AC2 (indica cultivar-group) 74% to AP003523.1

same seq as aaaa01002303.1 $FI orth to AP005610.1 AP005192.1

31724 MDMEMGKLLHRPWKWSLNSPLL

31558 LLLIVPVMIHVQLKLRRRRKNAAAGTRLPPGPWRLPVIGSLHHLAMNPKAVHRALADLAR 31379

31378 RCGGGGGVMYLRLGELPVVVASSRDAAREVLRTHDAAFATRAMSVTVRDSIGDTVGILF 31202

31201 SPYGERWRRLRGICSLELLNARRVRSFRPIREEQVARLVGAIAAAAAAPGGDQPPPVNVS 31022

31021 WQIAGALTDLTLRAIMGECGFRWREEFLETLGEAQRKASRFGVADLFPSSRLLRAVGSTA 30842

30841 VRDVRALNAKLFELVDRAIEQHREAAATTAAGGDHDDGGDDDARDDNECLLNTLMRIQKE 30662

30661 GGGTLSMSTVKAVIL (0) 30617

28748 DMFAGGSETTSTILEWAMSELVKNPQVMQKAQAQIRLALQGRSRITEDDLINLSY 28584

28583 PKNIIKETLRLHPVAPLLGXKECQESCKILGYNIPKGSIMLVNVWAIGRDHRYWDDAEVF 28404

28403 LPERFEEITVDFGGTNYEFIPFGGGRRICPGITFAHATLEWALTALLYHFDWHLPPSVTP 28224

28223 DGLDMEEEFGMNVRRKRDLHLHPVIHVGVEKGIMS* 28116

 

#25

>aaaa01002303.1 $FI CYP71AC2 (indica cultivar-group) same seq as AAAA01000575.1

except 2 aa diffs and one short frameshifted region

see AAAA01000575.1 for ortholog

3994 MDMEMGKLLHRPWKWSLNSPLLLLLIVPVMIHVQLKLRRRRKNAAVGTRLPPGPWRLPVI 4173

4174 VQSAPPRHEPEGGARALADLARRCGGGGGVMYLRLGELPVVVASSRDAAREVLRTHDAAF 4353

4354 ATRAMSVTVRDSIGDTVGILFSPYGERWRRLRGICSLELLNARRVRSFRPIREEQVARLV 4533

4534 GAIAAAAAAPGGDQPPPVNVSWQIAGALTDLTLRAIMGECGFRWREEFLETLGEAQRKAS 4713

4714 RFGVADLFPSSRLLRAVGSTAVRDVRALNAKLFELVDRAIEQHREAAATTAAGGDHDDGG 4893

4894 DDDARDDNECLLNTLMRIQKEGGGTLSMSTVKAVIL 5001

6869 DMFAGGSETTSTILEWAMSELVKNPQVMQKAQAQIRLALQGRSRITEDDLINLSYPKNII 7048

7049 KETLRLHPVAPLLGXKECQESCKILGYNIPKGSIMLVNVWAIGRDHRYWDDAEVFLPERF 7228

7229 EEITVDFGGTNYEFIPFGGGRRICPGITFAHATLEWALTALLYHFDWHLPPSVTPDGLDM 7408

7409 EEEFGMNVRRKRDLHLHPVIHVGVEKGIMS 7498

 

>AP005610.1 $F CYP71AC2 (japonica cultivar-group) chr 6 = AP005192.1

115277 MDMEMGKLLHRPWKWSLNSPLLLLLIVPVMIHVQLKLRRRRKNAAAGTRLPPGPWRLPVI 115456

115457 GSLHHLAMNPKAVHRALADLARRCGGGGGVMYLRLGELPVVVASSRDAAREVLRTHDAAF 115636

115637 ATRAMSVTVRDSIGDTVGILFSPYGERWRRLRGICSLELLNARRVRSFRPIREEQVARLV 115816

115817 GAIAAAAAAPGGDQPPPVNVSWQIAGALTDLTLRAIMGECGFRWREEFLETLGEAQRKAS 115996

115997 RFGVADLFPSSRLLRAVGSTAVRDVRALNAKLFELVDRAIEQHREAAATTAAGGDHDDGG 116176

116177 DDDARDDNECLLNTLMRIQKEGGGTLSMSTVKAVIL 116284

118153 DMFAGGSETTSTILEWAMSELVKNPQVMQKAQAQIRLALQGRSRITEDDLINLSYPKNII 118332

118333 KETLRLHPVAPLLMPKECQESCKILGYNIPKGSIMLVNVWAIGRDHRYWDDAEVFLPKRF 118512

118513 EEITVDFGGTNYEFIPFGGGRRICPGITFAHATLELALTALLYHFDWHLPPSVTPDGLDM 118692

118693 EEEFGMNVRRKRDLHLHPVIHVGVEKGIMS 118782

 

>AP005192.1 $F CYP71AC2 (japonica cultivar-group) chr 6

83856 MDMEMGKLLHRPWKWSLNSPLLLLLIVPVMIHVQLKLRRRRKNAAAGTRLPPGPWRLPVI 83677

83676 GSLHHLAMNPKAVHRALADLARRCGGXGGVMYLRLGELPVVVASSRDAAREVLRTHDAAF 83497

83496 ATRAMSVTVRDSIGDTVGILFSPYGERWRRLRGICSLELLNARRVRSFRPIREEQVARLV 83317

83316 GAIAAAAAAPGGDQPPPVNVSWQIAGALTDLTLRAIMGECGFRWREEFLETLGEAQRKAS 83137

83136 RFGVADLFPSSRLLRAVGSTAVRDVRALNAKLFELVDRAIEQHREAAATTAAGGDHDDGG 82957

82956 DDDARDDNECLLNTLMRIQKEGGGTLSMSTVKAVIL 82849

80980 DMFAGGSETTSTILEWAMSELVKNPQVMQKAQAQIRLALQGRSRITEDDLINLSYPKNII 80801

80800 KETLRLHPVAPLLMPKECQESCKILGYNIPKGSIMLVNVWAIGRDHRYWDDAEVFLPKRF 80621

80620 EEITVDFGGTNYEFIPFGGGRRICPGITFAHATLELALTALLYHFDWHLPPSVTPDGLDM 80441

80440 EEEFGMNVRRKRDLHLHPVIHVGVEKGIMS 80351

 

#208

>aaaa01007044.1 $PI CYP71AC3P (indica cultivar-group) seq gap at 5222 no Nterm exon

might be in this gap but also one frameshift so probably a pseudogene of an

AP003523 like gene.

6146 DMFAGGSETTSTTLEWA 6196 (frameshift and deletion of )

6196 PEVMQKAQAEIRHALQGKSRVTEDDLINLKYPKNIIKETMRLHPLASLLVPRKCQESCKI 6375

6376 LGYDIPKGTILIMNVWTIGRDHRYWDDAEVFIPERFEDTTIDFKGTHFEFIPFGAGRRMC 6555

6556 LGMTFAHATIELALTALLYHFDWHLPHGVTHDGMDMEEQFSVTVSRKRDLYLHPIQHVGVEEI* 6747

 

aaaa01007044.1 no japonica ortholog found 9/7/02

 

#427

>aaaa01034252.1 CYP71AC4P = same gene as AC3P (indica cultivar-group)

77% to AP003523.1b may be a pseudogene

1319 R*VVHRALADLVRRCDDLAPLMYLCLSELRVVVASTPDAAREVLKTHDAAMSTVVSAN 1146

1122 FAPYGKRWRHLRGICTLELLSAKRVRSFRPIREEQDARLVGAVVAAAAPSGESVNVRRLI 943

942  GGPMTDLALRAIMGE 898

 

>AC137921.2  Download subject sequence spanning the HSP Oryza sativa chromosome 3 BAC OSJNBa0027H16 genomic sequence, complete

              sequence

          Length = 163108

 

CYP1C3P/4P combined

MDKSGTDELNLLWSCSLNLTLLLLLLVPAGIHIVSLKLRRRRENASTDGLRLP (AA 51)

deletion of 15 aa

160156 (AA 67) AMNR*VVHRALADLVRRCDDLAPLMYLCLSELRVVVASTPDAAREVLKTHDAAMSTAVSANF 159971

159950 FAPYGKRWRHLRGICTLELLSAKRVRSFRPIREEQDTRLVGAVVAAAAPSGEPVNVRRLI 159771

159770 GRPMTDLALRAIMGE 159726 (AA 219)

deletion of 34 aa

159714 (AA 254) VRTAATSAVRDVKLLSAKLYDMVGRAIEQHQEHADDGGTHGNRECLLS

TLLRIPKEGDNNDDGGDLTMANVKAVIL (AA 336)

THIS PIECE = ORTHOLOG OF aaaa01007044.1 = 71AC3P  2 AA DIFFS

(AA 337) DMFAGGSETTSTTLEWAL (FS and 6 aa deletion)

PEVMQKAQAEIRHALQGKSRVTEDDLINLKYPKNIIKETMRL

HPLASLLVPRKCQESCKILGYDIPKGTILIVNVWTIGRDHRYWDDAEVFIPERFEDTT

IDFKGTHFEFISFGAGRRMCLGMTFAHATIELALTALLYHFDWHLPHGVTHDGMDMEE

QFSVTVSRKRDLYLHPIQHVGVEEI*

 

#37

>aaaa01021566.1 CYP71AC5P (indica cultivar-group) orth of AL606658.1 2 diffs

lone pseudogene fragment

1048 SALNVSRQITGTLTDLTLRAIMGECGFRWHEEFLETLGEAQKKATRFGVADLFPSSRLLP 1227

1228 AVGSRSGD 1251

 

>AL606658.1 $P CYP71AC5P chromosome 4 clone OSJNBb0016D16 lone pseudogene fragment

72% to AP003523 118132-115478

120987 SALNVSWQITGTLTDLTLHAIMGECGFRWHEEFLETLGEAQKKATRFGVADLFPSSRLLPAVGSRSGD 120784

 

#38

>aaaa01013200.1 $PI CYP71AC6P (indica cultivar-group) 3 diffs with AL606658.1 95%

94% to AP004571.1 and AP004327.1 lone pseudogene fragment

7465 ALNVSRQITGTLTDLTLHAIMGECGFRWREEFLETLGEAQKKATRFGVADLFPSSRLLPA 7644

7645 VESRSGD 7665

 

>AP004571.1 $P CYP71AC6P (japonica cultivar-group) chr 6 94% to AAAA01013200.1

identical to AP004327.1 lone pseudogene fragment

60465 ALNVSRQITGTLTDLTLRAIMWECGFRWREEFLETLGEAQKKATRFGVADLFLSSRLLPA 60286

60285 VGSRSGD 60265

 

>AP004327.1 $P CYP71AC6P (japonica cultivar-group) chr 6 94% to AAAA01013200.1 4 diffs

identical to AP004571.1 lone pseudogene fragment

105764 ALNVSRQITGTLTDLTLRAIMWECGFRWREEFLETLGEAQKKATRFGVADLFLSSRLLPA 105943

105944 VGSRSGD 105964

 

#39

>aaaa01017762.1 $PI CYP71AC7P (indica cultivar-group) 89% to AL606658.1

92% to AAAA01013200.1 lone pseudogene fragment

4196 ALNVSRQITGTLTDLTLRAIMGECGFRWREEFLETLGEAQKKATRFGVADLFPLSRLLPV 4017

4016 IRSRSGD 3996

 

#290

>aaaa01011555.1 CYP71AD1 (indica cultivar-group) orth AC109595.1 $F chr 5 >99%

8324 NARRRLAPAPRGLPVIGNLHQVGALPHRALRALAAATGAPHLLRLRLGHVTALVASSPAA 8145

8144 AAAVMREHDHVFATRPYFRTAEILTYGFKDLVFAPYGEHWRHARRLCSEHVLSAARSH 7971

7970 RY 7965

7950 QEVALLVNAIRTEAAAAAVDVSKALYAFTNAVICRAVSGRLSREDEGRSELFRELIEE 7777

7776 NATLLGGFCVGDYFPALAWADAFLSGFAARACRNLRRWDELLEEVIAEHEARLRGGDDG 7600

7599 GGEEHREEDFVDVLLALQEESQRHDGSFKLTRDIIKSLLQDMFAAGTDTSFITLEWAMSE 7420

7419 LVKNPAAMRKLQDEVRRGGGATTAATPYLKAVVKETLRLHPPVPLLVPREC 7267

7266 ARDTDDDATVLGYHVAGGTRVFVNAWAIHRDAGAWSSPEEFRPERFLPGGGEAEAVDLRG 7087

7086 GHFQLVPFGAGRRVCPGMQFALATVELALASLVRLFDWEIPPPGELDMSDDPGFTVRRR 6910

6909 IPLRLV 6892

 

>AC109595.1 $F CYP71AD1 chr 5 39% to 71As 40% to 71Bs

44% to 71Cs clone OJ1212B02, Length = 126962

72095 MEIELSPVLLLLPFLLLGFLYLTGGVLRSGGNARRRLAPAPRGLPVIGNLHQVGALP 71925

71924 HRALRALAAATGAPHLLRLRLGHVTALVASSPAAAAAVMREHDHVFATRPYFRTAEILTY 71745

71744 GFKDLVFAPYGEHWRHARRLCSEHVLSAARSHRYGPMREQEVALLVNAIRTEAAAAAV 71571

71570 DVSKALYAFTNAVICRAVSGRLSREDEGRSELFRELIEENATLLGGFCVGDYFPALAWA 71394

71393 DAFLSGFAARACRNLRRWDELLEEVIAEHEARLRGGDDGGGEEHREEDFVDVLLALQE 71220

71219 ESQRHDGSFKLTRDIIKSLLQDMFAAGTDTSFITLEWAMSELVKNPAAMRKLQDEVRRGG 71040

71039 GATTAATPYLKAVVKETLRLHPPVPLLVPRECARDTDDDATVLGYHVAGGTRV 70881

70880 FVNAWAIHRDAGAWSSPEEFRPERFLPGGGEAEAMDLRGGHFQLVPFGAGRRVCPGMQFA 70701

70700 LATVELALASLVRLFDWEIPPPGELDMSDDPGFTVRRRIPLRLVAKPVGSEDDK* 70536

 

#383

>aaaa01019060.1 $FI CYP71AE1 (indica cultivar-group) one stop in exon 2

4042 MASLATVPNLPLLLLLHYALATFTASRARKNNKDRLPPSPLALLVIGHLLHLMGSLPRTSPSAASPHG 3839

3838 TGPTCSSGLAPCRCSLRRRRVPAAEAILRTHDHVFASRPRTVLLANIVFYRSRDVRFAPY 3659

3658 GDHWRQARKLVTTHLLSAKKVRSLRLAREEE (0) 3584

2413 VSLVMTKISKAATASAVVDIGQILRSFTNDMICRTVSGKCPRDDR*KRIFQELANETSLL 2234

2233 LGGFDIEEYFPVLARVGLVGKMMCLKAERLKKRWDELLEELINDHENDDHSCNLISDQND 2054

2053 EDFVDILLSVRQEYGFTREHVKAIL (0) 1979

1622 DVFFGGIDTSALVLEFTIAELMQRPRMLKKLQDEVRACIPKGQKIVSEVDINNMAYLRAV 1443

1442 IKEGIRLHPVAPVLAPHISMDDCNIDGYMIPSGTRVLVNVWAIGRDPRFWEDAEEFVPER 1263

1262 FIDSMSSAAANVNFTENDYQYLPFGYGRRMXPGMKFGIAVVEIMLANLMWKFDWTLPPG 1086

1085 TEIDMSEVFGLSVHRKEKLLLVPNNMSSC* 996

 

No japonica ortholog found 9/11/02

 

#382 = #425 = #40 reduce gene count by 2

>AQ573853 nbxb0085A03r CYP71AE2 (partial)   50% to 71A24 AQ691042 nbxb0086M20r

AQ795917.1 nbxb0058F03f CUGI Rice BAC genomic clone Length = 684

No indica ortholog found

MAARTWLWLLLSPLILLLLHYALALLTARRARKNPLPPSPPALPFIGHLHLIGALPHVSLCCLAT

KHAPDLMFLRLGTSLPVLVASSPCAAEAILRTHDDVFASRPRTVLADIIFYGSRDIGFAPYGEDWRQAR

 

#40 = #382 = #425 reduce gene count by 2

>AQ573853 nbxb0085A03r CYP71AE2 (partial)   50% to 71A24 AQ691042 nbxb0086M20r

AQ795917.1 nbxb0058F03f CUGI Rice BAC genomic clone Length = 684

No indica ortholog found 9/3/02

MAARTWLWLLLSPLILLLLHYALALLTARRARKNPLPPSPPALPFIGHLHLIGALPHVSLCCLAT

KHAPDLMFLRLGTSLPVLVASSPCAAEAILRTHDDVFASRPRTVLADIIFYGSRDIGFAPYGEDWRQAR

 

#425 = #382 = #40 reduce gene count by 2

>aaaa01032609.1 CYP71AE2 (indica cultivar-group) 70% to AAAA01019060.1

orth of AC132003.1 100%

138 VKIVMEKISKAAFAREAVDIGQILCSFTNDLACRVVSRKLVGDDRQKKLLQELVNKTIKL 317

318 LSIFNVEEYFSILARIGVIGKVMCARAERLKKKWDMLLKKLIADHESKCDSYLVCGRNK

    DDFVDILLSVRKEYGLTEEHVKAILE

1064 DVFIAGTQSSARVIEFTFAELMRKPHMLKKVQDEVRACIPNGQAIVSEVQVNNMTYLRAV 1243

1244 VKEVLRLHPVAPLLATHVSMADCNINGYMIPSGMRVLVNAWAIGRDERFWYDPEKFMPER 1423

1424 FVESVNGSATASVNFWVNNYQYLPFGSGRRMCLGMNFAMAVIEITLANLLWKFDWALP 1597

1598 AHAMEVDMSEEFGLSVRLKEKLLLVPKQHV* 1690

 

>AC132003.1 $F CYP71AE2 (japonica cultivar-group) chr 11 65% to aaaa01019060.1

MDEMAARTWLWLLLSPLI

LLLLHYALALLTARRARKNPLPPSPPALPFIGHLHLIGALPHVSLCCLATKHAPDLMFLR

LGTSLPVLVASSPCAAEAILRTHDDVFASRPRTVLADIIFYGSRDIGFAPYGEDWRQARK

LVNTHLLSVNKVQSLWLAREEE

34501 VKIVMEKISKAAFAREAVDIGQILCSFTNDLACRVVSRKLVGDDRQKKLLQELVNKTIKL 34322

34321 LSIFNVEEYFSILARIGVIGKVMCARAERLKKKWDMLLKKLIADHESKCDSYLVCGRNKD 34142

34141 DFVDILLSVRKEYGLTEEHVKAILE 34067 (0)

33575 DVFIAGTQSSARVIEFTFAELMRKPHMLKKVQDEVRACIPNGQAIVSEVQVNNMTYLRA 33399

33398 VVKEVLRLHPVAPLLATHVSMADCNINGYMIPSGMRVLVNAWAIGRDERFWYDPEKFMPE 33219

33218 RFVESVNGSATASVNFWVNNYQYLPFGSGRRMCLGMNFAMAVIEITLANLLWKFDWALPA 33039

33038 HAMEVDMSEEFGLSVRLKEKLLLVPKQHV* 32949

 

#266

>aaaa01010030.1a CYP71AF1 (indica cultivar-group) ortholog of AP003523.1a

see aaaa01010030.1c for ortholog

3362 LILSLAFVKLRPRNNGENPPPGPWQLPVIGSLHHLAGALPHRALRDLAARHGELMLLRLG 3541

3542 ELPVVVASSPAAAREVMRTHDAAFATRPQTATLRALTRDGLGVAFAPQGEHWRCLRKLCV 3721

3722 TELLGARRVRCLRRAREAEAAALVASLSTTTPEPVNVSSLVARYVTDAVVRAVVGDRI 3895

3896 SDRDAFLERLEEGVKVAAGFTLADVFPSSRLARALSGTARRAEAHSREMTRLMD 4057

4058 GVIEEHRQRRAATGWRDEEDEDLLDVLLRIQKDGGLQIPLDMGTIRAIIIVSSPT 4222

4308 DLFSAGSETTGTTLQWAMAELMRNPAALRKAQAEVRGVLAGHSHVTEDALPDLHYLHL 4481

4482 VIKETLRLHVAVPLLLPRECQEPRLRVLGYDVPERAMVLVNAWAICRDTAVWGPDAEEFR 4661

4662 PERFDGGAVDFKGTDFEFVPFGAGRRMCPGVAFAVAIMELGLASLLFHFDWELAGGAAAG 4841

4842 ELDMAEGLGITARRKSDLWL 4901

 

#266

>aaaa01010030.1c CYP71AF1 (indica cultivar-group) AP003523.1a $F chr 6 100% 1 diff with 1a may be an accidental duplication in assembly count only once

9521 LILSLAFVKLRPRNNGENPPPGPWQLPVIGSLHHLAGALPHRALRDLATRHGELMLLRLG 9700

9701 ELPVVVASSPAAAREVMRTHDAAFATRPQTATLRALTRDGLGVAFAPQGEHWRCLRKLCV 9880

9881 TELLGARRVRCLRRAREAEAAALVASLSTTTPEPVNVSSLVARYVTDAVVRAVVGDRI 10054

10055SDRDAFLERLEE 10090

 

>AP003523.1a $F CYP71AF1 chr 6 clone P0416A11 six different genes

AQ271656 nbxb0026B09r 53% to 71A16 N-term

35% to 40% with 71A and 71B sequences possibly new subfamily in 71

112855 MEQYLFLATLLILSLAFVKLRPRNNGENPPPGPWQLPVIGSLHHLAGALPHRALRDLA 113028

113029 TRHGELMLLRLGELPVVVASSPAAAREVMRTHDAAFATRPQTATLRALTRDGLGVAFAPQ 113208

113209 GEHWRCLRKLCVTELLGARRVRCLRRAREAEAAALVASLSTTTPEPVNVS 113358

113359 SLVARYVTDAVVRAVVGDRISDRDAFLERLEEGVKVAAGFTLADVFPSSRLARALSGTAR 113538

113539 RAEAHSREMTRLMDGVIEEHRQRRAATGWRDEEDEDLLDVLLRIQKDGGLQIPLDMGTIRAIII 113730 (0)

113831 DLFSAGSETTGTTLQWAMAELMRNPAALRKAQAEVRGVLAGHSHVTEDALPDLHYLHL 114004

114005 VIKETLRLHVAVPLLLPRECQEPRLRVLGYDVPERAMVLVNAWAICRDTAVWGPDAEEFR 114184

114185 PERFDGGAVDFKGTDFEFVPFGAGRRMCPGVAFAVAIMELGLASLLFHFDWELAGGTAA 114361

114362 GELDMAEGLGITARRKSDLWLHATVSVPVPNTETS* 114469

 

#266

>aaaa01041444.1 CYP71AF1 (indica cultivar-group) orth AP003523.1a $F chr 6 100%

see AP003523.1a above for ortholog

17  EEFRPERFDGGAVDFKGTDFEFVPFGAGRRMCPGVAFAVAIMELGLASLLFHFDWELAGG 196

197 TAAGELDMAEGLGITARRKSDLWL 268

 

#456

>aaaa01066426.1 CYP71AG1 (indica cultivar-group) 39% to AP003434.1

C-helix to J-helix

    APHGPYWX (frameshift)

809 RARKASVRHLLSPPRVRAYRAVREQEVAALLRRVTEQACGGGVVRLSELLSGFAKDVA 636

635 GRIVLGVRAGAGGDGGGWR ARMDALLEESNVLLGAFHAGDYVPWLSWVSAVDGT 474

473 DARVRTAFEKIDRILDEIVDAAAARDTPSSSPGPGNG TDGDSDAFIHLLLSLQREGTEEW 294

293 RLTRDNVKALLE (0)

    DLFGAGTEATIIVLEWAMAELLRDKGAMGNLQREVR 57

 

no japonica ortholog found 12/25/03

 

>CK207340   FGAS018961 Triticum aestivum FGAS: Library 5 GATE 7 Triticum

           aestivum cDNA.

          Length = 1048

 

Query:     1 APHGPYWXRARKASVRHLLSPPRVRAYRAVREQEVAALLRRVTEQACGGGVVRLSELLSG 60

             APHGPYW RARK  V HLLSP RVRAYRAVRE+EV ALL +V  QA    VV LSELL+G

Sbjct:   130 APHGPYWLRARKTCVLHLLSPARVRAYRAVREEEVGALLDKVRRQA---RVVPLSELLAG 186

 

Query:    61 FAKDVAGRIVLGVRAGAGGDGGGWRARMDALLEESNVLLGAFHAGDYVPWLSWVSAVDGT 120

             F KDV GRIV G  A    DG G R ++DALLEE   LLG FH G+Y P    ++  DG

Sbjct:   187 FGKDVIGRIVFGASAATRADGWGARXQVDALLEEGKALLGTFHGGEYFPTWRGLAPWDGR 246

 

Query:   121 DARVRTAFEKIDRILDEIVDAAAARDTPSSSPGPGNG 157

             +A+V   F++I  +L+E+ D    R    +  G G G

Sbjct:   247 EAQVGKGFDRIHGVLEEMADPGE-RPMGGAQRGKGLG 282

 

MEDAILLFLLPV

ATTMSILLLLRAARSNPQKRPHRSKLATVPPPSPGGGALVGNLHXLAGGRLPHRALAALA

AAHGPVMLLRLGQVPAVVLSSPDAAREVMLAQDHVFATRPSLAIPSKLFYGCTDVAF APH

GPYWLRARKTCVLHLLSPARVRAYRAVREEEVGALLDKVRRQARVVPLSELLAGFGKDVI

GRIVFGASAATRADGWGARXQVDALLEEGKALLGTFHGGEYFPTWRGLAPWDGR

EAQVGKGFDRIHGVLEEMADPGERPMGGAQRGKGLG

 

#431

>aaaa01035499.1 CYP71AK1 = OLD CYP71Z11 (indica cultivar-group)

3% to AP005114.1b (partialI)

no ortholog in known set, IN a new subfamily WITH AK120674 CYP71AK2 (64%)

3   TMPTTIQGYHIPAKTIAFINVWAIGRDPAAWDTPDEFRPERFMGSAVDFRGNDYKFIPFG 182

183 AGRRLCPGIILALPGLEMVIASLLYHFDWELPDGMDVQDLDMAEAPGLTTPPMNPVWLIP 362

363 RCRTI* 380

 

#475

>BI808626.1 CYP71AK2 = OLD CYP71Z12 (partial) clone D005B07.Length = 538 EST with numerous frameshifts similar to AAAA01035499.1 and 71Bs no ortholog found in indica

no extensions in htgs nr gss or est sections of Genbank 8/3/02

66% to AY104083.1 Zea mays

11 LFVHAWAIGRDPXAWXXPEEFRPDRFLXXSVDFRGNDYQLVPFGAAPRICPGISFXX

PVLEMALFALLHHFDWELPAGMXXXXXDMSEAPGLTTPLRVPLRLVPKRKARLPRHIYKRNVIGE*

 

>AK120674   CYP71AK2    1763 bp    mRNA    linear   PLN 29-OCT-2003

DEFINITION  Oryza sativa (japonica cultivar-group) cDNA clone:J013161I12, full

            insert sequence. PROBABLY NOT 71Z SUBFAMILY 45% TO 71T4

55% to 71T8 names need revision

  44 MYHYVFLAAVALLAVVGY 97

  98 GVKNRRRRSAKLPPSPPSVPFLGHLHLLGPLLHRSLHELHLRYGTDGGLLLLQLGRRRTL 277

 278 VVSTAAAAADLYRNHDLAFASRPLVAAAHKLSYGSKNITFAPFGEQWRRAKKTAVVHALS 457

 458 PRRVEAFAPVRAAEAAALVAATRRAADAAADGGAVELRDLLYSYTNAVVTRAATG 622

 623 AAGTTAEKLKQLLGNATSLVAGVQADDLLPGMAAKAVRWATGLEKQYDASMEEWDKF 793

 794 LSPIMAEHAEKKKKKREDIGAGEEDFIDVLLRLKEEDTELTDTHVKSRVVDLI 952

 953 AAATETTSVTLEWTMAELAANPRVMAKLQEEIARATGGKPAITEAEVGGMEYMKAVVKEV 1132

1133 LRLHPPAPILVPHESTAAAAVQGYEIPARTSLFVNAWAIGRDPAAWGSPEEFRPERFLA 1309

1310 GGPAVDFRGNDYQLVPFGAGRRICPGISFAVPVLEMALVALLHHFDWELPAGMR 1471

1472 AAELDMSEAPGLTTPLRVPLRLVPKRKAPLA* 1567

 

#218

>aaaa01007330.1b (indica cultivar-group) orth CYP72A17 $F AP002839 100%

8732 LMIDGADLWQVTMVLHEVLRLYPPVVMMNRRTYKEIELGGVRYPAGVMLSLPVLFIHR 8559

8558 DAAAWGHDAGEFDPGRFAEGVARACKDPGAGAFFPFSWGPRICIGQNFALLEAKVALGM 8382

8381 ILQRFAFELSPAYAHAPYTVLTLHPQHGVPVRLR 8280

 

>CYP72A17 $F AP002839 Oryza sativa genomic DNA, chromosome 1 36553-39431

AG025591.1 strain ND3008 PCR from rice genomic DNA clone T8121T.Length = 401

AG025107.1 strain NC2542 PCR from rice genomic DNA clone T5184T.Length = 504

AU071192 very similar to AQ050520 = 72A17

AP002744 CYP72A17 join(109468..109819,110022..110245,110529..110781,

111446..111819,111915..112346)

MGIGIGIGIGIGIGTGTGAALPFGEASPWSLLGGAVAALLLVWAAQMLEWAWLAPRRMERALR

AQGLRGTQYRFLHGDLTEDLRLVTAARSKPVPMDRPHDFIPRVAPLLHRALEEH (phase 1 intron)

GRVSFTWFGPMPRVTITDPDLVREVLSNKFGHF

EKTKLATRLSKLLVGGLVILHGEKWVKHRRIMNPAFHAEKLK (phase 0 intron)

RMLPAFSASCSELIGRWENAVAASVGKAELDIWPDFQNLSGDVISRAAFGVRHHEGRQ

IFLLQAEQAERLVQSFRSNYIPGLS (phase 2 intron)

LLPTENNRRMKAIDREIKSILRGIIEKRQKATKNGEASKDDLLGLLLQSNMDYYSDEDGKS

SKGMTVEEIIDECKLFYFAGMETTAVLLTWTMVALSMHPEWQDRAREEILQVFGRNKPDI

NGVSRLKV (phase 0 intron)

VTMVLHEVLRLYPPVVMMNRRTYKEIELGGVRYPAGVMLSLPVLFIHRDAAAWGHDAGEF

DPGRFAEGVARACKDPGAGAFFPFSWGPRICIGQNFALLEAKVALGMILQRFAFELSPAY

AHAPYTVLTLHPQHGVPVRLRRL*

 

#217

>aaaa01007330.1a (indica cultivar-group) orth CYP72A18 $F AP002839 100%

5638 LLQVTMILYEVLRLYPPVVFLTRRTYKEMELGGIKYPAEVTLMLPILFIHHDPDIWGKD 5814

5815 AGEFNPGRFADGISNATKYQTSFFPFGWGPRICIGQNFALLEAKMAICTILQRFSFE 5985

5986 LSPSYIHAPFTVITLHPQHGAQIKLK 6063

 

>CYP72A18 $F AP002839 Oryza sativa genomic DNA, chromosome 1 44993-41630

AU100789.1 Rice callus Oryza sativa cDNA clone C50810.Length = 419 C-term

AU102126.1 Rice callus cDNA clone C10756.Length = 571

AZ130306.1 OSJNBb0103O04r CUGI Rice BAC genomicLength = 320

C26802 36% TO 72  8/97 N-TERMINAL 19-67 REGION opposite end = C96903

C96903, C97406 58% IDENTICAL TO 72 C-TERM 65% to AQ050520

C96799, C28139 219-340 REGION 55% IDENTICAL TO 72 opposite end = C97406

D22332        48% TO 72     12/93  7/98 C-HELIX 89-191 REGION

AU081507.1 Rice callus Oryza sativa cDNA clone C12518_12Z.Length = 581

C26235        36% IDENTICAL TO 72     8/97 AMINO ACIDS 89-216 REGION

AP002744 complement(join(114545..114970,115379..115757,

116406..116650,116745..116965,117608..117908))

D21882        53% TO 72   5/93  7/98 245-352 REGION = 72A18

MLMMLGAASQWILAAAAAAAVAALLWLAVSTLEWAWWTPRRLERALRAQG

IRGNRYRLFTGDVPENVRLNREARKKPLPLGCHDIIPRVLPMFSKAVEEH (phase 1 intron)

GKPSFTWFGPTPRVMISDPESIREVMSNKFGHYGKPKPTRLGKLLASGVV

SYEGEKWAKHRRILNPAFHHEKIK (phase 0 intron)

RMLPVFSNCCTEMVTRWENSMSIEGM

SEVDVWPEFQNLTGDVISKTAFGSSYEEGRRIFQLQAESAERIIQAFRTIFIPGYW (phase 2 intron)

FLPTKNNRRLREIEREVSKLLRGIIGKRERAIKNGETSNGDLLG

LLVESNMRESNGKAELGMTTDEIIEECKLFYFAGMETTSVLLTWTLIVLS

MHPEWQERAREEVLHHFGRTTPDYDSLSRLKI (phase 0 intron)

VTMILYEVLRLYPPVVFL

TRRTYKEMELGGIKYPAEVTLMLPILFIHHDPDIWGKDAGEFNPGRFADG

ISNATKYQTSFFPFGWGPRICIGQNFALLEAKMAICTILQRFSFELSPSY

IHAPFTVITLHPQHGAQIKLKKI*

 

#423

>aaaa01029515.1 (indica cultivar-group) orth CYP72A19 $F AP002839 chr 1 100%

950 LYEVLRLYPPGIGFVRQTYKEMEIGGVKYPAGVMIELPLLFIHHDPDIWGSDVNE 786

785 FKPERFAEGISRASNDHGAFFPFGWGPRICMGQNFALLEAKMALCMILQRFEFELAP 615

614 SYTHAPHIVLMLRPMHGAPIKLR 546

 

>CYP72A19 $F AP002839 Oryza sativa genomic DNA, chromosome 1 comp(53699-51708)

AU100635.1 Rice callus clone C10787.Length = 594

AG024141.1 strain ND3053 PCR from ricegenomic clone ND3053_0_734_1A.Length = 374

AP002744 complement(join(124623..125048,125181..125565,

125743..125987,126105..126584)) this annotation missing first 10 aa

MDPTSVPWSSMVYGLLGLALLWQVHRLLVRLWWQPRRLERALRAQGVRGTSYRFLTGDLKDYGRLS

KEAWARPLPLRCHDIAPRVAPFVHRTIAEHGKACLSWFGPIPKVTIADAEIAKDVLSNKMGHFEKLKFPVLS

KLLADGVANYEGEKWAKHRRILNPAFHLEKLK (phase 0 intron)

LMLPAFSACCEELVGRWAASLGSDGSNEIDVWPEMQSLTGDVISRTAFG

SSYLEGRRIFQLQAEQQELFMGAIQKISIPGYM (phase 2 intron)

SLPTKNNRRMYQIKNEVESIIRDLVQKR

MHAMKDGERTKDDLLGILLESSTRHADENGHSGPGMTIEEVMEECKVFYFAGMETTAILL

TWTMVVLSMHPEWQHRAREEV LSLFQKNKLDYEGLSKLKT (intron joint not correct missing one

base?)

VTMILYEVLRLYPPGIGFVRQTYKEMEIGGVKYPAG

VMIELPLLFIHHDPDIWGSDVNEFKPERFAEGISRASNDHGAFFPFGWGPRICMGQNFAL

LEAKMALCMILQRFEFELAPSYTHAPHIVLMLRPMHGAPIKLRAI

 

#304

>aaaa01012480.1 (indica cultivar-group) orth CYP72A20 $F AP002839 98% chr 1

4577 DTNNLLQVTMILHEVLRLYPPAVTLSRRTFKEIQIGGITYPAGVGLELPIILIHH 4413

4412 NTDVWGKDAHEFKPERFADGISKATKTNQRAFFPFGWGPRICIGQNFAMLEAKMVLCV 4239

4238 ILQNFEFQLSPSYTHAPYASVTLHPQHGAQIIL 4140

 

>CYP72A20 $F AP002839 Oryza sativa genomic DNA, chromosome 1 56581-59004

AP002744 join(129496..129808,130309..130529,130636..130883, 131004..131919)

MEVGVSLEAGKPAAAPWGMLYYGVPALLVLGALYRAAERCWLGPRRVAGALQGQGLRGTAYRFPAGDL

PENARRSKEARAKPMPPCHDIVPRVAPLLQDIVKEY (PHASE 1 INTRON)

GNVCITWFGTTPRVVIAEPELVKDILSNKFGHFEKFTLKSLGKLIALGLASYEGEKWARH

RRILNPAFHLEKLK (phase 0 intron)

HMLPAFSTCCSEMIDRWDSKLAGSDGPFELDIWQEFQNLTGDVISRTAFGSSFMEGRRI

FQLQEEQADRIIKTIQYIYIPGYL (phase 2 intron)

VSCRYFPTENNRRMKENSREIEGLLRGIIEKRSRAVENGELSGDDLLGLMLKSNMDSGE

PSNLRMSTEDVIEECKLFYFAGMETTSVLLTWTLVVLSMHPEWQHRAREEVLSAFGRDKP

NFDGLSRLKT (intron joint not correct as in gene 3)

VTMILHEVLRLYPPAVTLSRRTFKEIQIGGITYPAGVGLELPIILIHHNTDVWGKDAHEFKPERFADGISKAT

KTNQQAFFPFGWGPRICIGQNFAMLEAKMALCVILQNFEFQLSPSYTHAPYASVTLHPQH

GAQIILTRL*

 

#459

>aaaa01070145.1 (indica cultivar-group) orth CYP72A21 $F AP002839 chr 1, 100%

364 LYEVLRLYPPAITFTRKTYKQMEIGGVTYPAGVIVELPVLLIHHDPNIWGSDAHEFKPD 540

541 RFAEGISKASKNPGAFLPFGWGPRICIGQNFALLEAKMALCMILQCFKLELMPSYTH 711

712 APYSMVTLRPMHGAQIKLR 768

 

>CYP72A21 $F AP002839 Oryza sativa genomic DNA, chromosome 1 61993-63890

AZ045374.1 nbeb0080P16f CUGI Rice BAC genomic Length = 843

AQ857269.1 nbeb0005G04r CUGI Rice BAC genomic Length = 855

AQ865258.1 nbeb0025C03f CUGI Rice BAC genomicLength = 738

AP002744 join(134887..135417,135523..135767,135888..136272,

136380..136805) this annotation adds first 7 aa

AQ050520, AQ272173, AQ159375, AU031882 (opposite end = D24685)

AQ575977 nbxb0088K02r

D24685.1 RICR2374A Rice root cDNA clone R2374_1A.Length = 419 = 72A

MVLGAWLMSPASVPWSLLAYGVLGLVLLWQAGRLLHSLWWRPRRLELALRAQGLRGTRYRFLTGD

LGEHGRLNREAWARPLPLRCHDIAPRVAPFLHNAVREHGSACFTWFGPTPKVTITDPDLA

KGVLSNKFGHFEKPKFPTLTKLFSDSLANHEGEKWVKHRRILNPAFHLEKLK (phase 0 intron)

LMLPAFSACCEELVSKWMESLGSDGSYEVDVWP

EMQILTGDVISRTAFGSSYLEGRRIFQLQAEQTERLLKCMQKIVIPGYM (phase 2 intron)

SLPTKNNRKMHQIKKETDSILRGLVDKRMQA

MKEGECTKDDLLGLLLESNMRHTEEDGQSNHGLTIEEVIEECKLFYFAGMETTSVLLTWT

ILLLSMHPEWQDRAREEILGLFGKNKPEYEGLSRLKI (PHASE 0 intron)

VTMILYEVLRLYPPAVTFTRKTYKQMEIGGVTYPAGVIVELPVLLIHHDPNIWGSDAHEF

KPDR FVEGISKASKNPGAFLPFGWGPRICIGQNFALLEAKMALCMILQCFKLELMPSYTH

APYSMVTLRPMHGAQIKLRAI*

 

#379

>aaaa01018418.1 (indica cultivar-group) orth CYP72A22 $F AP002839 100%

2436 MVQVTMILYEVLRLYPPAVTLTRQTYKQIEIGGVTYPAGVIIELPLLLIHSDPDI 2272

2271 WGSDVHKFNPERFAEGISKASKDPGAFLPFSWGPRICIGQNFALLETKMALCMILQH 2101

2100 LELELALSYTHAPQSIITLRPTHGAQIKLR 2011

 

>CYP72A22 $F AP002839 Oryza sativa genomic DNA, chromosome 1 66435-68424

AP002744 join(139350..139880,140025..140278,140419..140806,

140914..141339)

MVLGAGLRCPASVPWSSLAYGLLGLVLLWQGGRLLHRLWWRPRRLELALRAQGLRGTRYRFLTGDL

GEHGRLNREAWARPLPLRCHDIAPRVAPFLHSSVREHGKACFSWFGPIPKVTIANPDLAKDVLSNK

FGHLEKHKFQGLTKLLSDGVASHEGEKWVKHRRILNPAFHLEKLK (phase 0 intron)

RMLPAFSTCCEELISRWMESLGSEGSYEVDVWPEMQSL

TGDVISRTAFGSSYLEGRRIFQLQAEQAERLLKCVQKIIIPGYM (phase 2 intron)

SLPTKNNRKMHQIKKEIDSILRGLIGKRMQAMREGESTKDDLLGLLLESNMRHTAEHGQSS

QGLTIEEVIEECKLFYFAGMETTSVLLTWTMLLLSMHPEWQDHAREEILGLFGKNKPEYE

GLSRLKI (intron joint not correct)

VTMILYEVLRLYPPAVTLTRQTYKQIEIGGVTYPAGVIIELPLLLIHSDPDIWGSDVHKF

NPERFAEGISKASKDPGAFLPFSWGPRICIGQNFALLETKMALCMILQHLELELALSYTH

APQSIITLRPTHGAQIKLRAI*

 

#373

>aaaa01017648.1 (indica cultivar-group) orth of CYP72A23 $F AP002839

2676 YLPTKKNRRMRRINSEVESILRGIIGKRMQAIAEGESTNDDLLGLLLESNMRHADENGRS 2497

2496 SPGMTTEDVIEECKLFYFAGMETTSVLLTWTMVVLSMHPEWQDRAREEVLGLFGRDKP 2323

2322 EYEGLSRLKTVSTRRNNN

2159 EVLRLYPPAIVFSRKTYKEMEIGGVVYPRGVILELPVLFIHHDREIWGRDVHEFR 1995

1994 PERFAEGISRASNDRGAFLPFGWGPRVCIGQNFALLEAKMALCMILQRFEFELAASYT 1821

1820 HAPHTVMTL 1794

 

>CYP72A23 $F AP002839 Oryza sativa genomic DNA, chromosome 1 72149-74091

AP002744 join(145064..145603,145697..145941,146081..146465,

146581..147006)

AU067870 Rice callus Oryza sativa cDNA clone C10320_12Z, CYP72 like Nterm

AU067871 AU067869 very similar to AQ050520 K-helix to heme

MVFGELFSRASLPPPWSLLAYGLVGPV

LLWQAGRLLDRLWWRPRRLERALRAQGLRGTAYRFLLGDLREFGRLNEEAWSSAPLPLGC

HDIVPRVTPFVHRNVRDNGRPCCFSWFGPIPSVTITDPAQVRDVLSNKLGHFEKPKLPAL

TKLLADGLTSHDGEKWVKHRRIMNPAFHLEKLK (PHASE 0 INTRON)

LMLPAFSTCCEELVGKWMDSLGPDGSCELDVWPEMQSLTGDVISRTAFGSSYSEGR

RIFQLQTEQAELFIGAIQKFVIPGYM (PHASE 2 INTRON)

YLPTKKNRRMRRINSEVESILRGIIGKRMQAIAEGESTNDDLLGLLLESNMRHADENGRS

SPGMTTEDVIEECKLFYFAGMETTSVLLTWTMVVLSMHPEWQDRAREEVLGLFGRDKPEY

EGLSRLKT (PHASE 0 INTRON)

VTMVLYEVLRLYPPAIVFSRKTYKEMEIGGVVYPRGVILELPVLFIHH

DREIWGRDVHEFRPERFAEGISRASNDRGAFLPFGWGPRVCIGQNFALLEAKMALCMILQ

RFEFELAASYTHAPHTVMTLHPMHGAQMKLRMI*

 

#86

>aaaa01002256.1a (indica cultivar-group) orth CYP72A24 $F AP002839 chr 1 100%

18321 GVRYPAGVVLTLPLLCVHHDKDVWGADADEFRPERFAEGISKASREAPAFFPFGWGP 18491

18492 RICIGQNFALLEAKMGLSMILQRF 18563

 

>CYP72A24 $F AP002839 Oryza sativa genomic DNA, chromosome 1 109970-113787

AQ864347.1 nbeb0023A03r CUGI Rice BAC genomicLength = 735

MVVFAAGDERPLMLVWAAVAGAVLAWCAVRAMEWAWWRPRRLERALRAQGLRGTPYRSPA

GDAPLNVQLSAEARARTMPLGCHDVVPRAMPLFHQAMKEH (PHASE 1 INTRON)

GKVSITWFGPVPRVTITKPELVREVLSNKFGHFEKLKFGRFQRLLHNGLGSHEGEKWAKH

RRIINPAFHLEKLK(PHASE 0 INTRON)

RMLPAFAACCTELVDK

WEGLAKGGDEPYEVDVWPEMQSLTGDVISRAAFGSSYLEGKRIFQLQGEQIELIVATMNK

IHIPGYI (PHASE 2 INTRON)

HLPTKSNRRMKQIAAEIEGMLKRIIAKRESALKAGEASSDDDLLGLLLESNLDHSKGNGGA

ASSGISIDDVIGECKLFYFAGMETTSVLLTWTMVVLSMHPEWQDRAREEVLHVFGSRAPD

YDGLSRLRI (PHASE 0 INTRON)

VTMVLYEVLRLYTPLTALQRKTYKPMELGGVRYPAGVVLTLPLLCVHHDKDVWGADADEF

RPERFAEGISKASREAPAFFPFGWGPRICIGQNFALLEAKMGLSMILQRFSFDLSPSYTH

APFPVGLLQPEHGAQVRLTRLN*

 

#87

>aaaa01002256.1b (indica cultivar-group) orth CYP72A25 $F AP002839 2 diffs

21490 RRTKANAREVRELLKGIITKRESAMKDGHAVNDDLLGLLLETNIKESQEAGSSKP 21654

21655 TMTTKDIIEELKLLYFAGSDTTAVLLTWTMVLLSMHPKWQDRAREEVLRVFGKNSPDFE 21831

21832 GINHLK

      VTMILHEVLRLYPPILLLGRE 22008

22009 AYEETELGGVTYPPGVTFALPIACIHHDPDVWGEDVGEFKPERFAEGVSRASKDSP 22176

22177 ALVPFSWGPRICVGQNFALLEAKMALSMILQRF 22275

 

>CYP72A25 $F AP002839 Oryza sativa genomic DNA, chromosome 1 115472-117492

AG021553.1 strain NC0134 PCR from rice genomic DNA clone NC0134_0_102_1A.

Length = 636 AG021553

AG023207.1 strain NC2780 PCR from rice clone NC2780_0_701_1A.Length = 474

MEIVDGASPPLHPWSLLLYALGALAALWWAWRALDRFWLRPRRLGRALRSQGLRGTDYRFPSGYLKEFARLL

AAALAAPMPPLSHDVASRALPFELAAIKQH (PHASE 1 INTRON)

GNVCVTWFGPEVRVIVSDPKLFREILANKNGRFGKQKSILWVQNLLADGLTSHQGEKWVA

HRRIMNHAFHLEKLK (PHASE 0 INTRON)

VQRMLPAFAACSSELISRWQDSVGADGAQEIDVWPEFQNLTGDVISRSAFGSSFSEGRRI

FQLQSEQARNVMKMAKALYFPGYR (PHASE 2 INTRON

FLPTELNRRTKANAREVRELLKGIITKRESAMKDGHAVNDDLLGLLLETNIKESQEAGSS

KPTMTTKDIIEELKLLYFAGSDTTAVLLTWTMVLLSMHPEWQDRAREEVLRVFGKNSPDF

EGINHLKV (PHASE 0 INTRON)

VTMILHEVLRLYPPILLLGREAYEETELGGVTYPPGVTFALPIAGIHHDPDVWGEDV

GEFKPERFAEGVSRASKDSPALVPFSWGPRICVGQNFALLEAKMALSMILQRFSFGLSPS

YTHAPFPIPTLQPQHGAQIKLTKL*

 

#259

>aaaa01009673.1 CYP72A31P (indica cultivar-group) orth AP003278 $P chr 1 96%

4799 LYEVLRLYPPFIEIGRKTYKEMEIGGVTYPAGVSIKIPVLFIHHDPDTWGSDVHEFKPE 4975

4976 RFSEGISKASKDPGAFLPFGWGPRICIGQNFALLEAKMALCLILQRLEFELAPSYTH 5146

5147 APHTMVTLHPMHGAQI 5194

 

>AP003278 $P CYP72A31P chromosome 1, PAC clone:P0518F01, similar to 72A22 missing N-term half

AP003330.1 chromosome 1 clone B1085F01 CYP72A like

Pseudogene, no N-term in 9000bp upstream until next p450 ends near 22400

31539 SLPIENNRKMHQINKEIESILRGLIGKRMQAMKEGESTKDDLLGILLESNTKHMEENGQSS 31718

31719 QGLTIKDIVEECKLFYFAGAETTSVLLTWTMLLLSIHPEWQDHAREEIMGLFRKNKPDYE 31898

31899 GLSRLKI

32030 VTMIFYEVLRLHPPFIEIGWKTYKEMEIGGVTYPAGVSIKIPVLFIHHDPDSWGSDVHEF 32209

32210 KPERFSEGISKASKDPGAFLPFGWGPRICIGQNFALLESKMALCLILQRLEFELAPSYTH 32389

32390 APHTMVTLHPMHGAQMKVRAI 32452 or frameshift after KVR to

      SYMIISDYSVFYYYNSWL*

 

Note there are two more P450s on AP003278 called a and b

 

#259

>aaaa01035549.1 CYP72A31P (indica cultivar-group) orth AP003278 $P chr 1

see aaaa01009673.1 for ortholog

964 EVLRLHPPFIEIGWKTYKEMEIGGVTYPAGVSIKIPVLFIHHDPDSWGSDVHEFKPER 791

790 FSEGISKASKDPGAFLPFGWGPRICIGQNFALLESKMALCLILQRLEFELAPSYTHA 620

619 PHTMVTLHPMHGAQM 575

 

#332

>aaaa01013993.1 CYP72A32 (indica cultivar-group) orth AP003278a $F chr 1 100%

6953 LYEVLRLYPPFIELKRRTYKEMKIGGVTYPAGVIINLPVLFIHHDLKIWGSDVHEFKPE 6777

6776 RFSEGISKASKDPGAFLPFGWGPRICIGQNFALLEAKMALCLILQRLEFELAPTYTH 6606

6605 APHTMITLHPMHGAQIKIR 6549

 

>AP003278a $F CYP72A32 19863-22437 chromosome 1, PAC clone:P0518F01, similar to 72A22

AP003330.1 50023-47446 chromosome 1 clone B1085F01, CYP72A like 536aa

AP004738.1 Oryza sativa chromosome 6 clone OSJNBa0090D06 chrom. conflict

50023 MVLGGWLLMWAPASSPTILVAFGLLFG

49942 LVLAWQ AGLQLHRLWWRPRRLEKALRARGLRGSSYRFLTGDLAEESRRRKEAWARPLPLR 49763

49762 CHDIAPRIEPFLHDAVVRPEQHYGKPCITWLGPTPEVHVTDPELAKVVMSNKFGHFEKIR 49583

49582 FQALSKLLPQGLSYHEGEKWAKHRRILNPAFQLEKLK 49472 (0)

49071 LMLPVFSACCEELISRWMGAIGSDGSYEVDCWPELKSLTGDVISRTAFGSSYLEGR 48904

48903 RIFELQGELFERVMKSVEKIFIPGYM 48826 (2)

48363 YLPTENNRKMHQINKEIESILRSMIGKRMQAMKEGESTKDDLLGILLESNMRHT 48202

48201 EENSQSSQGLTIKDIMEECKLFYFAGADTTSVLLTWTILLLSMHPEWQDRARKEILGLFG 48022

48021 KNKPEYDGLNNLKI (0)

      VTMILYEVLR 47842

47841 LYPPFIELKRRTYKEMKIGGVTYPAGVIINLPVLFIHHDLKIWGSDVHEFKPERFSEGIS 47662

47661 KASKDPGAFLPFGWGPRICIGQNFALLEAKMALCLILQRLEFELAPTYTHAPHTMITLHP 47482

47481 MHGAQIKIRAI* 47446

 

#113

>aaaa01003187.1a CYP72A33 (indica cultivar-group) orth of AP003278b $F chr 1, 82% to 72A22

12486 FAGADTTSVLLTWTMLLLSMHPEWQDRAREEILGLCGKNKPDYDGLSRLKIVSIL

      VTMILYEVLRLYPPFIELTRKTYKEMEIGGIT 12130

12129 YPAGVIINLPVMFIHHDPEIWGSDVHEFKPERFSEGISKASKDLGAFLPFGWGPRICIGQ 11950

11949 NFALLEAKMALCLILQRLEFE 11887

 

>aaaa01003187.1b CYP72A33 (indica cultivar-group) orth of AP003278b $F chr 1, 82% to 72A22 these two seem identical only count once

20687 FAGADTTSVLLTWTMLLLSMHPEWQDRAREEILGLCGKNKPDYDGLSRLKIVSIL

      VTMILYEVLRLYPPFIELTRKTYKEMEIGGIT 20331

20330 YPAGVIINLPVMFIHHDPEIWGSDVHEFKPERFSEGISKASKDLGAFLPFGWGPRICIGQ 20151

20150 NFALLEAKMALCLILQRLEFE 20088

 

>AP003278b $F CYP72A33 chromosome 1, PAC clone:P0518F01, 82% to 72A22

AP003330.1 59493-56536 chromosome 1 clone B1085F01, CYP72A like 516aa

N-term does not match in both, 3278 has MVLGGGWLSMWAPASSPTILAAFGLVGLVLAWQ

before the AGLQ seq.

59493 MVLEGK AGLQLHRLWWRPRRLEKALRARGLRGSRYRFL

      TGDLAEEGRRRKEAWARPLPLRCHDIAPRVEP 59284

59283 FLHGAVGVGAAHGKPRITWFGPTPEVHVADPELARVVLSNKFGHFEKVSFPELSKLIPQG 59104

59103 LSAHEGEKWAKHRRILNPVFQLEKLK 59026 (0)

58537 LMLPV FSACCEELISRWMGSIGSDGSYEVDCWPEFKSLTGDVISRTAFGSSYLEG 58373

58372 RRIFELQGELFERVIKSIQKMFIPG 58298 (2)

57483 YLPTENNRKMHQMNKEIESILRGMIGKRMQAMKEGESTKDDLLGILLESN 57334

57333 TRHMEVNGQSNQGLTIKDIMEECKLFYFAGADTTSVLLTWTMLLLSMHPEWQDRAREEIL 57154

57153 GLFGKNKPDYDGLSRLKI (0) VTMIL 56977

56976 YEVLRLYPPFIELTRKTYKEMEIGGITYPAGVIINLPVMFIHHDPEIWGSDVHEFKPER 56800

56799 FSEGISKASKDPGAFLPFGWGPRICIGQNFALLEAKMALCLILQRLEFELATSYTHVPHT 56620

56619 IISLHPMHGAQIKVKSYMTISDYSVFY* 56536

 

note: there is one more gene on AP003278

 

#258

>aaaa01009485.1 CYP72A34 (indica cultivar-group) ortholog of AC119289.1 AQ916317.1

101end

1444 GKLSFIWFGPVPRVMIPDPELVREVFNKFDQFGKPKMIRVGKLLATGVVSYEGEKWAKHR 1623

1624 RILNHAFHHEKIK 1662 (0?)

1906 RMLPVFANCCTEMVTRWENSISLEAASEIDVWPEFRNLTGDVISRTAFGSSYQEGRRIF 2082

2083 QLQEELAQYLTEALQKLFIPGYW 2151 (2?)

2765 YRYLPTKNNRRMREIDREVHKILLEIIGNKERAITNGENSNDNMLGLLVESNTKQPELGM 2944

2945 STDDIIEECKLFYFAGMETTSVLLTWTLIVLSMHPEWQERAREEVLHHFGRTTTPDYDSL 3124

3125 SRLKI 3139 (0?)

3654 VTMILYEVLRLYPPVVLLNRRTFKETNLGGIKFPADMNLILPILFIHHDPEIWGKDASEF 3833

3834 NPGRFADGISNASKYHDASFFPFGWGLRICIGQSFALLEAKMALSMILQRFSLELSPSYI 4013

4014 HAPYIVLTLRPQHGAQIKLKRI* 4082

 

>AC119289.1 $F CYP72A34 (japonica cultivar-group) chromosome 5 clone

AQ916317.1 nbeb0063E04r CUGI Rice BAC genomicLength = 521 52% to 72A7

AQ871967.1 nbeb0045H22r CUGI Rice BAC genomicLength = 824

50424 MLIMLGLGLVPAGAAAALAVALVCLAAAAWWTVERAPRRLERALRAQGVGGGRY

      QLLLGGDVAENGRLNREAWSRPLPLGCHRIAPRVLPLLWNAVRDH (1) 50720

54713 GKLSFIWFGPVPRVMIPDPELVREVFNKFDQFGKPKMIRVGKLLATGVVSYEGEKWAK 54886

54887 HRRILNHAFHHEKIK (0) 54931

55175 RMLPVFANCCTEMVTRWENSISLEAASEIDVWPEFRNLTGDVISRTAFGSSYQEGRRIF 55351

55352 QLQEELAQYLTEALQKLFIPGYW (2?) 55420

56034 YRYLPTKNNRRMREIDREVRKILLEIIGNKERAITNGENSNDDMLGLLVESNTKQPELRM 56213

56214 STDDIIEECKLFYFAGMETTSVLLTWTLIVLSMHPEWQERAREEVLHHFGRTTTPDYDSL 56393

56394 SRLKI 56417 (0?)

56923 VTMILYEVLRLYPPVVLLNRRTFKETNLGGIKFPADMNLILPILFIHHDPEIWGKDASEF 57102

57103 NPGRFADGISNASKYHDASFFPFGWGPRICIGQSFALLEAKMALSMILQRFSLELSPSYI 57282

57283 HAPYIVLTLRPQHGAQIKLKRI* 57351

 

#56

>aaaa01001473.1 CYP72A35 (indica cultivar-group) orth of AP002899 $F 99%

6721 IVFLQVTMILHEVLRLYPPVVFLQRTTHKEIELGGIKYPEGVNFTLPVLSIHHDPSIWG 6897

6898 QDAIKFNPERFANGISKATKFQTAFFSFAWGPRICLGQSFAILEAKMALATILQS 7062

7063 FSFELSPSYTHAPHTVLTLQPQYGSPIKLK 7152

 

>AP002899 $F CYP72A35 52% to 72A14 = AQ161379

complement(join(51630..52055,52989..53367,53942..54186,

54546..54766,55513..55801))

MLGEAASPWSLAGAGAAVALLWLCAWTLQWAWWTPRRLERALRA

QGLRGTRYRLFIGDVAENGRLNREAASRPLPLGSHDVVPRVMPFFCNVLKEHGKLSFV

WTGPKPFVIIRDPDLAREILSNKSGNFAKQTTAGIAKFVVGGVVTYEGEKWAKHRRIL

NPAFHQEKIKRMLPVFLACCTKMITRWVNSMSSEGISELDVWDEFQNLTGDVISRTAF

GSSYQEGWRIFQLQEEQAKRVLKAFQRIFIPGYWYLPIENNRRIREIDQEIRTILRGI

IVKRDKAVRNGEGSNDDLLGLLVESNMRQSNEKEDVGMSIEDMIEECKLFYAAGSETT

SMLLTWTLILLSMHPEWQEQAREEVMHHFGRTTPDHDGLSRLKIVTMILHEVLRLYPP

VVFLQRTTHKEIELGGIKYPEGVNFTLPVLSIHHDPSIWGQDAIKFNPERFANGVSKA

TKFQTAFFSFAWGPRICLGQSFAILEAKMALATILQSFSFELSPSYTHAPHTVLTLQP

QYGSPIKLKKL

 

#156

>aaaa01004480.1 CYP72A36P (indica cultivar-group) ortholog of AP003142.2

7679 YLPIENNRRIREIY*EIRKILRGLIVKGDKAIRNGENTNDDLLGLLVESNMRQSNEREEV 7500

7499 GMSIED 7482

 

>AP003142.2 $P CYP72A36P chromosome 1, PAC clone:P0435H01 probable pseudogene

53% to 72A15 86% to AP002899

5354 YLPIENNRRIREIY*EIRKILRGLIVKGDKAIRNGENTNDDLLGLLVESNMRQSNERE 5181

5180 EVGMSIED 5157

1115 IIEECRLFYFAGSETTS 1065 frameshift

1065 MLLT*TLIMLSMHPEWQERAREEVMHHFRRTTPDHDGLSRLKIVHM 928

 

#187

>aaaa01005990.1 CYP72A37P (indica cultivar-group) orth of AP004019.1b

5834 LSLMSLAFPTCNEELVWR*TE 5772

5772 NPFNDDGLCVLLDVWPEIQRFH*DAISRTASGGGYRGRRRIFQLQSEQSE 5623

 

>AP004019.1b $P CYP72A37P chromosome 2 clone OJ1118_C03 similar to CYP72A23 of rice Pseudogene fragment 2

37945 LSLMSLAFPTCNEELVWR*TE 37883 frameshift

37883 NPFNDEGLCVLLDVWPEIQRFH*DAISRTASGGGYRERRRIFQLQSEQSE 37734

 

#188

>AP004019.1a $P CYP72A38P chromosome 2 clone OJ1118_C03 similar to CYP72A23 of rice Pseudogene fragment 1 no indica ortholog 9/7/02

19871 FSTRKNSLLGYKTESLGDDGLCELLDIWREMQRLNGCHFPHSIRQPSYCEMRRIF 19707

 

#147

>aaaa01004255.1 $PI CYP73A35P (indica cultivar-group) ortholog of 73A35P

16157 MDLLFVERLLVGLLAAAVVAIAVSKLRGRKLRLPPGPTPV 16038

16037 PVFGNWLQVGDDLNHRNLAALARRFGDIFLLRMGQRNLVVVSSPPLAREVLHTQGVEFGS 15858

15857 RTRNVVFDIFTGKGQDMVFTVYGDHWRKMRRIMTVPFFTGKVVQRHRAGWEAEAAAVVDG 15678

15677 LRADPA 15660 (fs)

15660 RRRL*LMMYNNVYRIMFDRRFESADDPLFLRLKALNGERSRLAQSFEYNYGDFIPILRPF 15481

15480 LRGCLRICEEVKETRLKLFKDFFLEERK

      KAMDNNGLKCAIDHILEAQQKGEINEDNVLYIVENINVA

15092 AIETTLWSMEWAIAEL 15045

      VNHGEIQEKLRRELDTVLGPGRQITEPDTHRLP 14944

14943 YLQAVVKETLRLRMAIPLLVPHMNLRDAELAGYGIPAESKVLVNAWYLANDPGRWRRPEE 14764

14763 FRPERFLEEERHVEANGNDFRYLPSGAGRRSCPGIVLALPILGVTIGRLVQNFELLP 14593

14592 PPGKDRVDTTEKGGQFSLHILKHSTIVAKPRAF* 14491

 

>CYP73A35P   $P Oryza sativa (rice)  GenEMBL AP003446.1 March 29, 2001

AP003302.1  Feb. 21, 2001 Both sequences have the same frameshifts

MDLLFVERLLVGLLAAAVVAIAVSKLRGRKLRLPPGPTPVPVFGNWLQVGDDLNHRNLAA

LARRFGDIFLLRMGQRNLVVVSSPPLAREVLHTQGVEFGSRTRNVVFDIFTGKGQDMVFT

VYGDHWRKMRRIMTVPFFTGKVVQRHRAGWEAEAAAVVDGLRADPAAA 168 (25 nuc. deletion and frameshift in both sequences)

175 RRRLQLMMYSNVYRIMFDRRFESADDPLFLRLKALNGERSRLAQSFEYNYGDFIPILRPF

LRGYLRICEEVKETRLKLFKDFFLEERK (2?)

NGLKCAIDHILEAQQKGEINEDNVLYIVENINVA (1)

AIETTLWSMEWAIAEL (2 nuc. insertion and frameshift in both seqs.)

VNHGEIQEKLRRELDTVLGPGRQITEPDTHRLPYLQAVVKETLRLRMAIPLLVPHMNLRD

AELAGYGIPAESKVLVNAWYLANDPGRWRRPEEFRPERFLEEERNVEANGNDFRYLPSGA

GRRSCPGIVLALPILGVTIGRLVQNFELLPPPGKDRVDTTEKGGQFSLHILKHSTIVAKPRAF*

 

#90

>aaaa01002376.1 $FI CYP73A38 (indica cultivar-group) ortholog to AQ256364.1

73% to 73A5

899  MDALLVEKVLLGLFVAAVLALVVAKLTGKRLRLPPGPAGAPIVGNWLQVGDDLN 1060

1061 HRNLMALARRFGDILLLRMGVRNLVVVSSPDLAKEVLHTQGVEFGSRTRNVVFDIFTGKG 1240

1241 QDMVFTVYGDHWRKMRRIMTVPFFTNKVVAQNRAGWEEEARLVVEDVRRDPTAATSGVVI 1420

1421 RRRLQLMMYNDMFRIMFDRRFDSVDDPLFNKLKAFNAERSRLSQSFEYNYGDFIPVLRPF 1600

1601 LRRYLARCHQLKSQRMKLFEDHFVQERK 1684 (2)

3697 RVMEQTGEIRCAMDHILEAERKGEINHDNVLYIVENINVA 3816 (1)

4917 AIETTLWSIEWGIAELVNHPSIQSKVREEMASVLGGAAVTEPDLERLPYLQAVVKETLRLRMAIP 5093

5094 LLVPHMNLADGKLAGYDIPAESKILVNAWFLANDPKRWVRPDEFRPERFLEEEKAVE 5264

5265 AHGNDFRFVPFGVGRRSCPGIILALPIIGITLGRLVQSFDLLPPPGMDKVDTT 5423 (fs)

EKPGQFSTQILKHATVVCQ (fs)

PIDA*

 

>AQ256364.1 CYP73A38 (partial) nbxb0016O01r CUGI Rice BAC clone nbxb0016O01r.Length = 607

84% to 73A5 82% to AP003446 probable CYP73A rice gene

ortholog to aaaa01002376.1 complete

AQ576280.1 nbxb0088L20r CUGI Rice BAC genomic clone

AQ289078.1 nbxb0034K09r CUGI Rice BAC genomic clone

(2) RRVMEQTGEIRCAMDHILEAERKGEINHDNVLYIVENINVA (1)

 

#191

>aaaa01006093.1a $FI CYP73A39 (indica cultivar-group) 99% to AP004850.1b

6441 MAASAMRVAIATGASLAVHLFVKSFVQAQHPALTLLLP

6327 VAVFVGIAVGAKGGSGGDGKAPPGPAAVPVFGNWLQVGNDLNHRFLAAMSARYGPVFRLR 6148

6147 LGVRNLVVVSDPKLATEVLHTQGVEFGSRPRNVVFDIFTANGADMVFTEYGDHWRRMRRV 5968

5967 MTLPFFTARVVQQYKAMWEAEMDAVVDDVRGDAVAQGAGFVVRRRLQLMLYNIMYRMMFD 5788

5787 ARFESVDDPMFIEATRFNSERSRLAQSFEYNYGDFIPILRPFLRGYLNKCRDLQSRRLAF 5608

5607 FNNNYVEKRR (2) 5578

5431 NKLRCAIDHILEAEKNGELTAENVIYIVENINVAAIETTLWSIEWALAEVVNHPAVQSKV 5252

5251 RAEINDVLGDDEPITESNIHKLTYLQAVIKETLRLHSPIPLLVPHMNLEEAKLGGYTIPK 5072

5071 GSKVVVNAWWLANNPALWENPEEFRPERFLEKESGVDATVAGKVDFRFLPFGVGRRSCPG 4892

4891 IILALPILALIVGKLVRSFEMVPPPGVEKLDVSEKGGQFSLHIAKHSVVAFHPISA* 4721

 

>AP004850.1b $F CYP73A39 (japonica cultivar-group) chromosome 2 clone OJ1342_D02

100% identical to AP004859.1a AU093333.1

48977 MAASAMRVAIATGASLAVHLFVKSFVQAQHPALTLLLPVAVFVGIAVGAKGGSGGDGKAP 49156

49157 PGPAAVPVFGNWLQVGNDLNHRFLAAMSARYGPVFRLRLGVRNLVVVSDPKLATEVLHTQ 49336

49337 GVEFGSRPRNVVFDIFTANGADMVFTEYGDHWRRMRRVMTLPFFTARVVQQYKAMWEAEM 49516

49517 DAVVDDVRGDAVAQGTGFVVRRRLQLMLYNIMYRMMFDARFESVDDPMFIEATRFNSERS 49696

49697 RLAQSFEYNYGDFIPILRPFLRGYLNKCRDLQSRRLAFFNNNYVEKRR 49840

      NKLRCAIDHILEAEKNGELTAENVIYIVENINVAAI 50094

50095 ETTLWSIEWALAEVVNHPAVQSKVRAEINDVLGDDEPITESSIHKLTYLQAVIKETLRLH 50274

50275 SPIPLLVPHMNLEEAKLGGYTIPKGSKVVVNAWWLANNPALWENPEEFRPERFLEKESGV 50454

50455 DATVAGKVDFRFLPFGVGRRSCPGIILALPILALIVGKLVRSFEMVPPPGVEKLDVSEKG 50634

50635 GQFSLHIAKHSVVAFHPISA 50694

 

#192

>aaaa01006093.1b $FI CYP73A40 (indica cultivar-group) 99% to AP004850.1a

98% to AAAA01006093.1a

11410 MAASVVRVAIATGASLAVHLFVKSFLQAQHPALTLLLP

11524 VAVFAGIAVGAKGGNGGDGKAPPGPAAVPVFGNWLQVGNDLNHRFLAAMSARYGPVFRLR 11703

11704 LGVRNLVVVSDPKLATEVLHTQGVEFGSRPRNVVFDIFTANGADMVFTEYGDHWRRMRRV 11883

11884 MTLPFFTARVVQQYKAMWEAEMDAVVDDVRGDAVAQGTGFVVRRRLQLMLYNIMYRMMFD 12063

12064 ARFKSVDDPMFIEATRFNSERSRLAQSFEYNYGDFIPILRPFLRGYLNKCRDLQSRRLAF 12243

12244 FNNNYVEKRR (2) 12273

12431 NKLRCAIDHILEAEKNGELTAENVIYIVENINVAAIETTLWSIEWALAEVVNHPAVQSKV 12610

12611 RAEINDVLGDDEPITESSIHKLTYLQAVIKETLRLHSPIPLLVPHMNLEEAKLGGYTIPK 12790

12791 GSKVVVNAWWLANNPALWENPEEFRPERFLEKESSVDATVAGKVDFRFLPFGVGRRSCPG 12970

12971 IILALPILALIVGKLVRSFEMVPPPGVEKLDVSEKGGQFSLHIAKHSVVAFHPISA* 13141

 

>AP004850.1a $F CYP73A40 (japonica cultivar-group) chromosome 2 clone OJ1342_D02

98% identical to AP004850.1b  100% to AP004859.1b

32815 MAASVVRVAIATGASLAVHLFVKSFLQAQHPALTLLLPVAVFAGIAVGAKGGNGGDGKAP 32636

32635 PGPAAVPVFGNWLHVGNDLNHRFLAAMSARYGPVFRLRLGVRNLVVVSDPKLATEVLHTQ 32456

32455 GVEFGSRPRNVVFDIFTANGADMVFTEYGDHWRRMRRVMTLPFFTARVVQQYKAMWEAEM 32276

32275 DAVVDDVRGDAVAQGTGFVVRRRLQLMLYNIMYRMMFDARFESVDDPMFIEATRFNSERS 32096

32095 RLAQSFEYNYGDFIPILRPFLRGYLNKCRDLQSRRLAFFNNNYVEKRR 31952

31797 NKLRCAIDHILEAEKNGELTAENVIYIVENINVAAIETTLWSIEWALAEVVNHPAVQSK 31621

31620 VRAEINDVLGDDEPITESSIHKLTYLQAVIKETLRLHSPIPLLVPHMNLEEAKLGGYTIP 31441

31440 KGSKVVVNAWWLANNPALWENPEEFRPERFLEKESGVDATVAGKVDFRFLPFGVGRRSCP 31261

31260 GIILALPILALIVGKLVRSFEMVPPPGVEKLDVSEKGGQFSLHIAKHSVVAFHPISA 31090

 

#83

>aaaa01002175.1 CYP74A4 (indica cultivar-group) orth of AC099043.1 $F chromosome 3 100%

4310 VIRRQTRASASASATDRQEVVSPKRRLPLRKVPGDYGPPVVGAIRDRYEYFYGPGGRDGF 4489

4490 FAARVRAHRSTVVRLNMPPGPFVARDPRVVALLDAASFPVLFDTSLVDKTDLFTGTFMPS 4669

4670 TDLTGGYRVLSYLDPSEPNHAPLKTLLFYLLSHRRQQVIPKFREVYGDLFGLMENDLARV 4849

4850 GKADFGVHNDAAAFGFLCQGLLGRDPAKSALGRDGPKLITKWVLFQLSPLLSLGLPTLVE 5029

5030 DTLLHSLRLPPALVKKDYDRLADFFRDAAKAVVDEGERLGIAREEAVHNILFALCFNSFG 5209

5210 GMKILFPTLVKWLGRAGARVHGRLATEVRGAVRDNGGEVTMKALAEMPLVKSAVYEALRI 5389

5390 EPPVAMQYGRAKRDMVVESHDYGYEVREGEMLFGYQPMATKDPRVFARPEEYVPDRFLGE 5569

5570 DGARLLRHVVWSNGPETAAPTLHDKQCAGKDFVVLVARLLLVELFLRYDSFDVEVGTSTL 5749

5750 GSSVTVTSLKKATF 5791

 

>AC099043.1 $F CYP74A4 chromosome 3 clone OSJNBa0079B15, Length = 155583

63% to AP004181.1 chromosome 2 56% to 74A Arab

AU031582 AU031583

62290 MATAAACISFASPSPARVVIR

62227 RQTRASASASATDRQEVVSPKRRLPLRKVPGDYGPPVVGAIRDRYEYFYGPGGRDGFFAA 62048

62047 RVRAHRSTVVRLNMPPGPFVARDPRVVALLDAASFPVLFDTSLVDKTDLFTGTFMPSTD 61871

61870 LTGGYRVLSYLDPSEPNHAPLKTLLFYLLSHRRQQVIPKFREVYGDLFGLMENDLARVGK 61691

61690 ADFGVHNDAAAFGFLCQGLLGRDPAKSALGRDGPKLITKWVLFQLSPLLSLGLPTLVED 61514

61513 TLLHSLRLPPALVKKDYDRLADFFRDAAKAVVDEGERLGIAREEAVHNILFALCFNSF 61340

61339 GGMKILFPTLVKWLGRAGARVHGRLATEVRGAVRDNGGEVTMKALAEMPLVKSAVYEAL 61163

61162 RIEPPVAMQYGRAKRDMVVESHDYGYEVREGEMLFGYQPMATKDPRVFARPEEYVPDRFL 60983

60982 GEDGARLLRHVVWSNGPETAAPTLHDKQCAGKDFVVLVARLLLVELFLRYDSFDVEV 60812

60811 GTSTLGSSVTVTSLKKATF* 60752

 

#397

>aaaa01021750.1 CYP74A5 (indica cultivar-group) orth to AC107226.1 $F chromosome 3 100%

521 LPRRPVPGSYGVPFVSAVRDRLDFYYLQGQDKYFESRAERYGSTVVRINVPPGPFMARDP 700

701 RVVALLDAKSFPVLFDVAKVEKRDVFTGTFMPSTSLTGGYRVCAYLDPSEPNHAKIKQLL 880

881 LSLLVSRKDAFVPVFRSNFGALLDTVESQLASGGGKSDFTALNDATSFEFIGEAYFGVRP 1060

1061SASSSLGTGGPTKAALWLLWQLAPLTTLGLPMIIEDPLLHTLPLPPFLISSDYKALYAYF 1240

1241AAAASQALDAAEGLGLSREEACHNLLFATVFNSYGGFKLLLPQILSRVAQAGEKLHERLA 1420

1421AEIRSAVADAGGNVTLAALEKMELTRSVVWEALRLDPPVRFQYGRAKADLEIESHDASFA 1600

1601IKKGEMLFGYQPCATRDPRVFGATAREFVGDRFVGEEGRKLLQYVYWSNGRETENPSVDN 1780

1781KQCPGKNLVVLVGRLLLVELFLRYDTFTAEAGKKVVITGVTKAS 1912

 

>AC107226.1 $F CYP74A5 chromosome 3

allene oxide synthase (AOS) gene, complete cds.AY062258

98% to AY055775 allene oxide synthase (AOS) probably same gene

N-term matches C72393, D47977, C99549 95%

Same as AC107207.1 chromosome 3 clone OSJNBb0106M04

MELGVPLPRRPVPGSYGVPFVSAVRDRLDFYYLQGQDKYFESRA

ERYGSTVVRINVPPGPFMARDPRVVALLDAKSFPVLFDVAKVEKRDVFTGTFMPSTSL

TGGYRVCAYLDPSEPNHAKIKQLLLSLLVSRKDAFVPVFRSNFGALLDTVESQLASGG

GKSDFTALNDATSFEFIGEAYFGVRPSASSSLGTGGPTKAALWLLWQLAPLTTLGLPM

IIEDPLLHTLPLPPFLISSDYKALYAYFAAAASQALDAAEGLGLSREEACHNLLFATV

FNSYGGFKLLLPQILSRVAQAGEKLHERLAAEIRSAVADAGGNVTLAALEKMELTRSV

VWEALRLDPPVRFQYGRAKADLEIESHDASFAIKKGEMLFGYQPCATRDPRVFGATAR

EFVGDRFVGEEGRKLLQYVYWSNGRETENPSVDNKQCPGKNLVVLVGRLLLVELFLRY

DTFTAEAGKKVVITGVTKASTSAVNRTA

 

#58

>aaaa01001499.1 CYP74E1 (indica cultivar-group) 85% to AP004181.1 $F

8039 IRSRPAMAPPPVNSGDAAAAATGEKSKLSPSGLPIREIPGGYGVPFFSPLRDRLDYFYFQ 7860

7859 GAEEYFRSRVARHGGATVLRVNMPPGPFISGDPRVVALLDARSFRVLLDDSMVDKADTLD 7680

7679 GTFMPSRALFGGHRPLAFLDAADPRHAKIKRVVMSLAAARMHHVAPAFRAAFAAMFDAVE 7500

7499 AGLGAAVEFNKLNMRYMLDFTCAALFGGEPPSKVVGDGAVTKAMAWLAFQLHPIA 7335

7334 SKVVRPWPLEELLLHTFSLPPFLVRRGYADLKAYFADAAAAVLDDAEKSHPGIPRDELLD 7155

7154 NLVFVAIFNAFGGFKIFLPHIVKWLARAGPELHAKLATEVRAAADDGITLAAVERMP 6984

6983 LVKSVVWEALRMNPPVEFQYGHARRDMIVESH 6888

 

>AP004996.1a $F CYP74E1 (japonica cultivar-group) chr 2

62616 MAPPPVNSGDAAAAATGEKSKLSPSGLPIREIPGGYGVPFFSPLRDRLDYFYFQ 62777

62778 GAEEYFRSRVARHGGATVLRVNMPPGPFISGNPRVVALLDARSFRVLLDDSMVDKADTLD 62957

62958 GTYMPSRALFGGHRPLAFLDAADPRHAKIKRVVMSLAAARMHHVAPAFRAAFAAM 63122

63143 GXGAXXEFNKLNMRDMLDFTCAALFGGEPPSKVVGDGAVTKAMAWLAFQLHPIASKVVKP 63322

63323 WPLEELLLHTFSLPPFLVRRGYADLKAYFADAAAAVLDDAEKSHTGIPRDELLDNLVFVA 63502

63503 IFNAFGGFKIFLPHIVKWLARAGPELHAKLATEVRATVPTGEDDGITLAAVERMPLVKSV 63682

63683 VWEALRMNPPVEFQYGHARRDMVVESH 63763

      DAAYEVRKGEMLFGYQPLATRDEKVFDRAGEFVADRFVAGGAAGDRPLLEHVVWSNGPETRAPSE

      GNKQCPGKDMVVAVGRLMVAELFRRYDTFAADVVEAPVEPVVTFTSLTRASSG*

 

#350

>aaaa01015196.1 CYP74E2 (indica cultivar-group) orth to AP004181.1 $F chromosome 2 96%

6580 EALRMNPPVEFQYGRARRDMVVESHDAAYEVRKGEMLFGYQPLATRDEKVFDRAGEFVPD 6401

6400 RFVSGAGGAARPLLEHVVWSNGPETGTPSEGNKQCPGKDMVVAVGRLMVAEMFRRYDTFA 6221

6220 ADVEELPLEPVVSFTSLTRAA 6158

 

>AP004181.1 $F CYP74E2 chromosome 2 clone OJ1136_G09, AU184424.1 45% to 74A of Arabidopsis

6988 MAPPRANSGDGNDGAVGGQSKLSPSGLLIREIPGGYGVPFLSPLRDRLDYYYFQGADEFFRS 7173

7174 RVARHGGATVLRVNMPPGPFLAGDPRVVALLDARSFRVLLDDSMVDKADTLDGTFMPSLA 7353

7354 LFGGHRPLAFLDAADPRHAKIKRVVMSLAAARMHHVAPAFRAAFAAMFDEVDAGLVAGGP 7533

7534 VEFNKLNMRYMLDFTCAALFGGAPPSKAMGDAAVTKAVKWLIFQLHPLASKVVKPWPLED 7713

7714 LLLHTFRLPPFLVRREYGEITAYFAAAAAAILDDAEKNHPGIPRDELLHNLVFVAVFNAY 7893

7894 GGFKIFLPHIVKWLARAGPELHAKLASEVRAAAPAGGGEITISAVEKEMPLVKSVVWEAL 8073

8074 RMNPPVEFQYGRARRDMVVESHDAAYEVRKGELLFGYQPLATRDEKVFDRAGEFVPDRFV 8253

8254 SGAGSAARPLLEHVVWSNGPETGTPSEGNKQCPGKDMVVAVGRLMVAGLFRRYDTFAADV 8433

8434 EELPLEPVVTFTSLTRAADGDGAARRGV* 8520

 

>AP004996.1b $F CYP74E2 (japonica cultivar-group) chr 2 = AP004181.1

72208 MAPPRANSGDGNDGAVGGQSKLSPSGLLIREIPGGYGVPFLSPLRDRLDYYYFQGADE 72381

72382 FFRSRVARHGGATVLRVNMPPGPFLAGDPRVVALLDARSFRVLLDDSMVDKADTLDGTFM 72561

72562 PSLALFGGHRPLAFLDAADPRHAKIKRVVMSLAAARMHHVAPAFRAAFAAMFDEVDAGLV 72741

72742 AGGPVEFNKLNMRYMLDFTCAALFGGAPPSKAMGDAAVTKAVKWLIFQLHPLASKVVKPW 72921

72922 PLEDLLLHTFRLPPFLVRREYGEITAYFAAAAAAILDDAEKNHPGIPRDELLHNLVFVAV 73101

73102 FNAYGGFKIFLPHIVKWLARAGPELHAKLASEVRAAAPAGGGEITISAVEKEMPLVKSVV 73281

73282 WEALRMNPPVEFQYGRARRDMVVESH 73359

      DAAYEVRKGELLFGYQPLATRDEKVFDRAGEFVPDRFVSGAGSAARP

      LLEHVVWSNGPETGTPSEGNKQCPGKDMVVAVGRLMVAGLFRRYDTFAADVEELPLEPVV

      TFTSLTRAADGDGAARRGV*

 

#430

>aaaa01034814.1 CYP74E3P (indica cultivar-group) 76% to AP004181 $F

8 diffs with AP004996.1a 94%

820 AVAWLVFQLHPIASKVVKPWPLEDLLLHTFSLPPFLVRRGYATLKAYFADAAEKNHPGIP 641 (fs)

641 RDVLLDNLVFVAIFNAFGGFKIFLPHIVKWLARASPELHAKLATEVRATVPTGGAHEPAG 462

175 LVWEALRMNPPVEFQYGHARRDM 104 frameshift

102 VVESHDAAYEVRKGEMLFG 43

 

no japonica ortholog found 9/12/02

 

#183

>aaaa01005693.1 CYP74F1 (indica cultivar-group) 53% to 74B2 orth to AP004752.1 99%

911  RPIPGSYGPPLLGPLRDRLDYFWFQGPDDFFRRRAADHKSTVFRANIPPTFPFFLGVDPR 1090

1091 VVAVVDAAAFTALFDPALVDKRDVLIGPYVPSLAFTRGTRVGVYLDTQDPDHARTKAFSI 1270

1271 DLLRRAARNWAAELRAAVDDMLAAVEEDLNRAPDPAAASASYLIPLQKCIFRFLCKA 1441

1442 LVGADPAADGLVDRFGVYILDVWLALQLVPTQKVGVIPQPLEELLLHSFPLPSFVVKPG 1618

1619 YDLLYRFVEKHGAAAVSIAEKEHGISKEEAINNILFVLGFNAFGGFSVFLPFLVMEVGKP 1798

1799 GRDDLRRRLREEVRRVLGGGDGDGGEAGFAAVREMALVRSTVYEVLRMQPPVPLQFGRAR 1978

1979 RDFVLRSHGGAAYEVGKGELLCGYQPLAMRDPAVFDRPEEFVPERFLGDDGEALLQYVYW 2158

2159 SNGPETGEPSPGNKQCAAKEVVVATACMLVAELFRRYDDFECDGTS 2296

 

>AP004752.1 $F CYP74F1 (japonica cultivar-group) chr 2

32688 MVPSFPQPASAAAAT

32733 RPIPGSYGPPLLGPLRDRLDYFWFQGPDDFFRRRAADHKSTVFRANIPPTFPFFLGVDPR 32912

32913 VVAVVDAAAFTALFDPALVDKRDVLIGPYVPSLAFTRGTRVGVYLDTQDPDHARTKAFSI 33092

33093 DLLRRAARNWAAELRAAVDDMLAAVEEDLNRAPDPAAASASYLIPLQKCIFRFLCKALVG 33272

33273 ADPAADGLVDRFGVYILDVWLALQLVPTQKVGVIPQPLEELLLHSFPLPSFVVKPGYDLL 33452

33453 YRFVEKHGAAAVSIAEKEHGISKEEAINNILFVLGFNAFGGFSVFLPFLVMEVGKPGRDD 33632

33633 LRRRLREEVRRVLGGGDGGEAGFAAVREMALVRSTVYEVLRMQPPVPLQFGRARRDFV 33806

33807 LRSHGGAAYEVGKGELLCGYQPLAMRDPAVFDRPEEFVPERFLGDDGEALLQYVYWSNGP 33986

33987 ETGEPSPGNKQCAAKEVVVATACMLVAELFRRYDDFECDGTSFTKLDKRELTPS* 34151

 

#465

>aaaa01085971.1 CYP75A11 (indica cultivar-group) 45% to AC021892.5 48% to 75B1

AAAA01009188.1 (indica cultivar-group) orth of AC125784.1

598 LASLARKYGPVMYLKMGTCGVVVASSPCAARSFLKALDARFANRPAVASAVDITYNYQN 422

421 MVFANYGARWKLMRKLASVHLLGARALADWAAVRRDEARRLLRGVAEASAAGRP 260

10552 YKDMIVSLLTGAGLFNISDFVPALAWLDLQGVQAKLRRIHDQFDVLITKLLADHAATNAD 10373

10372 RARAGRTDFVDRLRAAVGVDDEDGETISEVNIKGLIF 10262

6592 FRPERFMPGGAAERVDPLGNYFELIPFGAGRRICAGKLAGMVFVQYFLGTLLHSFDWR 6419

6418 LPDGEDKVDMSETFGLALPKAVPLRALVTPRLAPAAYA 6305

 

>AC125784.1 $F CYP75A11 ortholog of AAAA01085971.1 100% AAAA01009188.1 1 aa diff

AU071096 BI796894 BI796059  50% to C99304 like 75B1

93834 MELAALCTDPVVLSAAFLCLLLHLSLRSYRPPSPGGGRRLPPGPPGLPVLGALPLVGPAP 94013

94014 HAGLASLARKYGPVMYLKMGTCGVVVASSPCAARSFLKALDARFANRPAVASAVDITYNY 94193

94194 QNMVFANYGARWKLMRKLASVHLLGARALADWAAVRRDEARRLLRGVAEASAAGRP 94361

95165 YKDMIVSLLTGAGLFNISDFVPALAWLDLQGVQAKLRRIHDQFDVLITKLLADHAATAAD 95344

95345 RARAGRTDFVDRLRAAVGVDDEDGETISEVNIKGLIF 95455

99273 DMFTAGTDTSSIIVEWAMAEMMKNPAVMARAQEEMDRVVGRGRRLEESDIASLPYLQAVC 99452

99453 KEAMRLHPSTPLSLPHFSFDECDVDGYRIPANTRLLINIYAIGRDPSAWEDPLEFRPERF 99632

99633 MPGGAAERVDPLGNYFELIPFGAGRRICAGKLAGMVFVQYFLGTLLHSFDWRLPDGEDKV 99812

99813 DMSETFGLALPKAVPLRALVTPRLAPAAYA 99902

 

#178

>aaaa01005609.1 $FI CYP75B3 (indica cultivar-group) orth AC021892.5 $F chr 10 98% compare to aaaa01006229.1a and 1b 100% match

     MDVVPLPLLLGSLAVSAAVWYLVYFLRGGTGANAARK

7136 RRPLPPGPRGWPVLGNLPQLGDKPHHTMCALARQYGPLFRLRFGCAEVVVAASAPVAAQF 7315

7316 LRGHDANFSNRPPNSGAEHVAYNYQDLVFAPYGARWRALRKLCALHLFSAKALDDLRAVR 7495

7496 EGEVALMVRNLARQQAASVALGQEANVCATNTLARATIGHRVFAVDGGEGAREFKEMV 7669

7670 VELMQLAGVFNVGDFVPALRWLDPQGVVAKMKRLHRRYDNMMNGFINERKAGAQ 7831

7832 PDGVAAGEHGNDLLSVLLARMQEEQKLDGDGEKITETDIKALLL

     NLFTAGTDTTSSTVEWALAELIRHPDVLKEAQHELDTVVGRGR 8185

8186 LVSESDLPRLPYLTAVIKETFRLHPSTPLSLPREAAEECEVDGYRIPKGATLLVNVWAIA 8365

8366 RDPTQWPDPLQYQPSRFLPGRMHADVDVKGADFGLIPFGAGRRICAGLSWGLRMVTLMTA 8545

8546 TLVHGFDWTLANGATPDKLNMEEAYGLTLQRAVPLMVQPVPRLLPSAYGV

 

>AC021892.5 $F CYP75B3 chromosome 10 clone OSJNBa0053D03 21 unordered pieces

          Length = 175567 65% to 75B1 compare to aaaa01006229.1a and 1b

62759 MDVVPLPLLLGSLAVSAAVWYLVYFLRGGSGGHAA

      RKRRPLPPGPRGWAVLGNLPQLGDKPHHTMCALGRQYGPLFRLRFGCAEVVEAATS 62487

62479 AAQFLRGHDANFSNRPPNSGAEHVAYNYQDLVFAPYGARWRALRKLCALHLFSAKALDDL 62300

62299 RAVREGEVALMVRNLARQQAASVALGQEANVCATNTLARATIGHRVFAVDGGEGARE 62129

62128 FKEMVVELMQLAGVFNVGDFVPALRWLDPQGVVAKMKRLHRRYDNMMNGFINER 61967

61966 KAGAQPDGVAAGEHGNDLLSVLLARMQEEQKLDGDGEKITETDIKALLL

61786 NLFTAGTDTTSSTVEWALAELIRHPDVLKEAQHELDTVVG 61607

61606 RGRLVSESDLPRLPYLTAVIKETFRLHPSTPLSLPREAAEECEVDGYRIPKGATLLVNVW 61427

61426 AIARDPTQWPDPLQYQPSRFLPGRMHADVDVKGADFGLIPFGAGRRICAGLSWGLRMVTL 61247

61246 MTATLVHGFDWTLANGATPDKLNMEEAYGLTLQRAVPLMVQPVPRLLPSAYGV* 61085

 

#178

>aaaa01006229.1a CYP75B3 (indica cultivar-group) orth AC021892.5 $F chr 10

these two sequences are exactly identical (probably an error in assembly)

also 100% to aaaa01005609.1 count only once see aaaa01005609.1 for ortholog

2552 RRPLPPGPRGWPVLGNLPQLGDKPHHTMCALARQYGPLFRLRFGCAEVVVAASAPVAAQF 2373

2372 LRGHDANFSNRPPNSGAEHVAYNYQDLVFAPYGARWRALRKLCALHLFSAKALDDLRAVR 2193

2192 EGEVALMVRNLARQQAASVALGQEANVCATNTLARATIGHRVFAVDGGEGAREFKEMV 2019

2018 VELMQLAGVFNVGDFVPALRWLDPQGVVAKMKRLHRRYDNMMNGFINERKAGAQ 1857

1856 PDGVAAGEHGNDLLSVLLARMQEEQKLDGDGEKITETDIKALLL

     NLFTAGTDTTSSTVEWALAELIRHPDVLKEAQHELDTVVGRGR 1503

1502 LVSESDLPRLPYLTAVIKETFRLHPSTPLSLPREAAEECEVDGYRIPKGATLLVNVWAIA 1323

1322 RDPTQWPDPLQYQPSRFLPGRMHADVDVKGADFGLIPFGAGRRICAGLSWGLRMVTLMTA 1143

1142 TLVHGFDWTLANGATPDKLNMEEAYGLTLQRAVPLMVQPV 1023

 

#178

>aaaa01006229.1b CYP75B3 (indica cultivar-group) orth AC021892.5 $F chr 10

these two sequences are exactly identical (probably an error in assembly)

11898 RRPLPPGPRGWPVLGNLPQLGDKPHHTMCALARQYGPLFRLRFGCAEVVVAASAPVAAQF 12077

12078 LRGHDANFSNRPPNSGAEHVAYNYQDLVFAPYGARWRALRKLCALHLFSAKALDDLRAVR 12257

12258 EGEVALMVRNLARQQAASVALGQEANVCATNTLARATIGHRVFAVDGGEGAREFKEMV 12431

12432 VELMQLAGVFNVGDFVPALRWLDPQGVVAKMKRLHRRYDNMMNGFINERKAGAQ 12593

12594 PDGVAAGEHGNDLLSVLLARMQEEQKLDGDGEKITETDIKALLL

      NLFTAGTDTTSSTVEWALAELIRHPDVLKEAQHELDTVVGRGR 12947

12948 LVSESDLPRLPYLTAVIKETFRLHPSTPLSLPREAAEECEVDGYRIPKGATLLVNVWAIA 13127

13128 RDPTQWPDPLQYQPSRFLPGRMHADVDVKGADFGLIPFGAGRRICAGLSWGLRMVTLMTA 13307

13308 TLVHGFDWTLANGATPDKLNMEEAYGLTLQRAVPLMVQPV 13427

 

#281

>aaaa01010988.1 CYP75B4 (indica cultivar-group) orth AC119149.2 $F chr 10 99%

see aaaa01018574.1 for ortholog

8341 MQNLFVAGTDTTSTIVEWTMAELIRHPDILKQAQEELDVVVGRDRLLLESDLSHLTFF 8168

8167 HAIIKETFRLHPSTPLSLPRMASEECEIAGYRIPKGAELLVNVWGIARDPAIWPDPLEYK 7988

7987 PSRFLPGGTHTDVDVKGNDFGLIPFGAGRRICAGLSWGLRMVTMTAATLVHAFDWQLPAD 7808

7807 QTPDKLNMDEAFTLLLQRAEPLVVHPV 7727

 

#281

>aaaa01018574.1 CYP75B4 (indica cultivar-group) orth AC119149.2 $F chr 10 99%

1103 SLLLTTVALSVIVCYALVFSRAGKARAPLPLPPGPRGWPVLGNLPQLGGKTHQTLHEMTK 1282

1283 VYGPLIRLRFGSSDVVVAGSAPVAAQFLRTHDANFSSRPRNSGGEHMAYNGRDVVFGPYG 1462

1463 PRWRAMRKICAVNLFSARALDDLRAFREREAVLMVRSLAEASAAPGSSSPAAVVLGKEVN 1642

1643 VCTTNALSRAAVGRRVFAAGAGEGAREFKEIVLEVMEVGGVLNVGDFVPALRWLDPQGVV 1822

1823 ARMKKLHHRFDDMMNAIIAERRAGSLLKPTDSREEGKDLLGLLLAMVQEQEWLAAG 1990

1991 EDDRITDTEIKALI 2032

 

>AC119149.2 $F CYP75B4 Nipponbare strain, clone OSJNBb0079E01, from ch 10

53% to 75B1

D48413, AU032067 AU173484.1 AU222694.1 47% IDENTICAL TO 75B1 N-TERMINAL REGION

C99304 opposite end = C19579 65% to 75B1 74% to AC021892

3 prime UTR matches AU173485 Rice root Oryza sativa cDNA clone R3314.

Which is the opposite end of AU173484 so these are from the same gene

3 prime UTR Also = AU032964 Rice shoot Oryza sativa cDNA clone S0065_2Z.

AU032242 58% identical to C97611 47% to AA754300 47% to 75B1 I helix

74682 MEVAAMEISTSLLLTTVALSVIVCYALVFSRAGKARAPLPLPPGPRGWPVLGNLPQLGGK 74503

74502 THQTLHEMTKVYGPLIRLRFGSSDVVVAGSAPVAAQFLRTHDANFSSRPRNSGGEHMAYN 74323

74322 GRDVVFGPYGPRWRAMRKICAVNLFSARALDDLRAFREREAVLMVRSLAEASAAPGSSSP 74143

74142 AAVVLGKEVNVCTTNALSRAAVGRRVFAAGAGEGAREFKEIVLEVMEVGGVLNVGDFVPA 73963

73962 LRWLDPQGVVARMKKLHRRFDDMMNAIIAERRAGSLLKPTDSREEGKDLLGLLLAMVQEQ 73783

73782 EWLAAGEDDRITDTEIKALIL (0) 73720

65761 NLFVAGTDTTSTIVEWTMAELIRHPDILKHAQEELDVVVGRDRLLSESDLSHLTFFHAII 65582

65581 KETFRLHPSTPLSLPRMASEECEIAGYRIPKGAELLVNVWGIARDPAIWPDPLEYKPSRF 65402

65401 LPGGTHTDVDVKGNDFGLIPFGAGRRICAGLSWGLRMVTMTAATLVHAFDWQLPADQTPD 65222

65221 KLNMDEAFTLLLQRAEPLVVHPVPRLLPSAYNIA* 65117

 

#64

>aaaa01001667.1a $FI CYP76H4 (indica cultivar-group) ortholog of AC118672.1b

3640 MELLLLCTPCVILLLSSLYLLRLFVDARRNLPPGPRPQPLIGNILDLGSQPHRSLARLAGRYGPLMTL 3437

3436 RLGTVTTVVASSPGAARDILQRHDAAFSARSVPDAARACGHDGFSMGMLPPSSALWRALR 3257

3256 RVCAAELFAPRSLDAHQRLRRDKVRQLVSHVARLARDGAAVDVGRAAFTASLNLLSSTIF 3077

3076 SADLADFGDARAESSVGDLRDLISEFTIVVGVPNVSDFFPAVAPLDPQRLRRRVARVFER 2897

2896 LQAVFDGHIERRLRDRAAGEPPNNDFLDALLDYRSPEDGRGFDRPTLQFLFT (0)

     DLFSAGSDTSAVTVEWAMAQLLQNP 2537

2536 PAMAKAREELARVIGSKQEIEESDISQLKYLEAVVKETLRLHPPAPFLLPHQAETT 2369 (fs)

2367 TEIRGYTVPRGARVLVNVWAIGQDPELWPEPEVFTPERFLEKEMDFRGKDFELLP*GSGR 2188

2187 RVCPGLPLAVRMVHLMLASLLHRFEWRLLPEVEKNGVDMAEKFGMILELATPLRAVAIPV* 2005

 

>AC118672.1b $F CYP76H4 (japonica cultivar-group) chromosome 3 (3 frameshifts)

C72289       63% to 76C2 337-453 region clone E1370 like AC078944

AU093938.1 Rice panicle cDNA clone E1370. Length = 642 43% to 77A4

BE230674.1 99AS896 Rice Seedling cDNA Library cDNA clone 99AS896.

BI813139 cDNA clone J002F10.Length = 569 = C72289

29723 MELLLLCTPCVILLLSSLYLLRLFVDARRNLPPGPRPQPLIGNILDLGSQPHRSLARLAG 29902

29903 RYGPLMTLRLGTVTTVVASSPGAARDILQRHDA 30001 (fs)

30003 AFSARSVPD 30029 (fs)

30031 AARACGHDGFSMGMLPPTSALWRAL 30105 (fs)

30107 RRVCAAELFAPRSLDAHQRLRRDKVRQLVSHVARLARDGAAVDVGRAAFTA 30259

30260 SLNLLSSTIFSADLADFGDARAESSVGDLRDLISEFTIVVGVPNVSDFFPAVAPLDPQRL 30439

30440 RRRVARVFERLQAVFDGHIERRLRDRAAGEPPKNDFLDALLDYRSPEDGRGFDRPTLQFLFT (0) 30625

      DLFSAGSDTSAVTVE 30799

30800 WAMAQLLQNPPAMAKAREELARVIGSKQEIEESDISQLKYLEAVVKETLRLHPPAPFLLP 30979

30980 HQAETTTQVGGYTVPKGTRVLVNVWAIGRDSKVWSDPDKFMPERFLQSEVDLRGRDFELI 31159

31160 PFGSGRRICPGLPLAVRMVYLMLASLLHRFEWRLLPEVEKNGVDMAEKFGMILELATPLR 31339

31340 AVAIPV* 31360

 

#65

>aaaa01001667.1b CYP76H5 (indica cultivar-group) ortholog of AC118672.1a

11859 MELLLLYAPCVILLVSSLYLLRLFSDARRNLPPGPRPLPLVGNLLELGAKPHRSLARLAERHGPLMTL 11656

11655 RLGAVTTIVASSPDAARDILQRHDAAFSTRPVPDIVRACGHDRFAMPWLPPSSPQCRALR 11476

11475 KVCSAELFAPRRLDAQQRLRREKARRLVSHVARMAREGAAVDVRRVVFTTLLNMLSCTLF 11296

11295 SADLADLDEGRAGSAGELADTVAEFAGTVGVPNVVDYFPAVAAFDPQRLRRRLSRVFTR 11119

11118 LFAEFDEQIERRMRERDAGEPPKNDFLDVLLDYRTTEDGRQFDRQTLRSRFT (0) 10963

10235 DLFSAGSDTSAVTVEWAMAQLLQSPSSMMKAREELTRVIGSKPEIDESDIDSLEYLQAVV 10056

10055 KETFRLHPPAPLLLSHRAETDTEIGGYTVPKGATVMVNIWA 9933

 

>AC118672.1a $F CYP76H5 (japonica cultivar-group) chromosome 3 45% to 76C2

20322 MELLLLYAPCVILLVSSLYLLRLFSDARRNLPPGPRPLPLVGNLLELGAKPHRSLARLAE 20501

20502 RHGPLMTLRLGAVTTIVASSPDAARDILQRHDAAFSTRPVPDIVRACGHDRFAMPWLPPS 20681

20682 SPQWRALRKVCSAELFAPRRLDAQQRLRREKARRLVSHVARMAREGAAVDVRRVVFTTLL 20861

20862 NMLSCTLFSADLADLDEGRAGSAGELADTVAEFAGTVGVPNVVDYFPAVAAFDPQRLRR 21038

21039 WLSRVFTRLFAEFDEQIERRMRERDAGEPPKNDFLDVLLDYRTTEDGRQFDRQTLRSRFT 21218

21879 DLFSAGSDTSAVTVEWAMAQLLQSPSSMMKAREELTRVIGSKPEIDESDIDSLEYLQAVV 22058

22059 KETFRLHPPAPLLLSHRAETDTEIGGYTVPKGATVMVNIWAIGRDSKVWFEPDKFIPERF 22238

22239 LQKEVDFRGRDFELIPFGSGRRICPGLPLAVRMVHLMLASLLHRFEWRLLPEVERNGVNM 22418

22419 EEKFGIVMTLATPLQAIATPI* 22484

 

#149

>aaaa01004378.1a CYP76H6 (indica cultivar-group) orth AC116603.1a $F chr 10, 98%

11306 GLHDDALRSVFT (0) 11271 fragment of N-term exon

seq gap here may contain missing 76H6 N-terminal exon

 9739 DLFAAGSDTSSSTVEWAMAELLRNPLPMAKACDELQRVIGSTRRIEESDIGRLPY 9575

 9574 LQAVIKETFRLHPPVPFLLPRQATTTIQILGYTIPKGAKVFINVWAMGRDKDIWPEAEKF 9395

 9394 MPERFLERATDFKGADFELIPFGAGRRICPGLPLAVRMVHVVLASLLINFKWRLPIKVER 9215

 9214 DGVNMTEKFGVTLAKAIPL 9158

 

>AC116603.1a $F CYP76H6 Nipponbare strain, clone OSJNBb0015J03, from chromosome 10

CB000966 = EST for C-term exon starts at exon 2 DLF so cannot tell

which exon 1 is used.

9135 MAALFLWLSWLVL 9097 (frameshift)

9094 SLLSVYLLDLLAQSRRRLPPGPHPLPLIGSLHLLGDQPHRSLAGLAKTYGPLM 8936

8935 SLRLGAVTTVVVSSPDVAREFLQKHDAVFATRSAPDASGDHARNSVALLPNSPRWRELRK 8756

8755 IMATELFSTSRLDALHELRQEKVVELVDHVARLAREGAAVDVGRVAFTTSLNLLSHTIFS 8576

8575 RDLTSLDDHGASKEFQQVVTDIMGAAGSPNLSDFFPALAAADLQGWRRRLAGLFERLRRV 8396

8395 FDAEIEHRRRVVGKEHGKVKDDFLRVLLRLAARDDDTAGLHDDALQSIFT (0) 8246

6711 DLFAAGSDTSSSTVEWAMAELLRNPLPMAKACDELQRVIGSTRRIEESDIGRLPYLQ 6541

6540 AVIKETFRLHPPVPFLLPRQATTTIQILGYTIPKGAKVFINVWAMGRDKDIWPEAEKFMP 6361

6360 ERFLERATDFKGADFELIPFGAGRRICPGLPLAVRMVHVVLASLLINFKWRLPVKVERDG 6181

6180 VNMTEKFGVTLAKAIPLCAMATST* 6106

 

>possible Zea ortholog of 76H6

>CG052632 CG208675 CG328504 CG328491   OGYBY35TV ZM_0.7_1.5_KB Zea mays genomic clone ZMMBMa0644F21.

          Length = 974

MGALLPWLAWLLVSLVGVYLLGHLVQARRRRGLPPGPHPLPIIGSLHLLGNQPHRSLARLAKT

HGPLVSLRLGSVTTVV

ASSPAAAREILQRHDAAFSNRSVPDAPGAHARNSTVWLPNAPRWRALRKIMGTQLFAPHR

LDALQHLRRDKARELVDHVRRLARRGEPVNVGRVAFTTSLNLVSRTIFSRDLASLEDDGA

SRKFQEVVTDIMEAVGSPNVSDFFPALAVADLQGCRRRLARLFARLHRIFDEEIDARLRG

RDAGEPKKNDFLDLLLDAAEDDDNTAGLDRDTLRSLFT (0)

381 DLFSAGSDTSSSSVEWAMVELLRSPGSMAKACDELATVIGPRKDIEESDIGRLPYLQ 211

AVVKETFRLHPAAPLLLPRRAQADVKMMGYVIPEGSRVFVNVWAMGRDEETWPEPEKFLPERFLGKKT

QQAVDLRGGDFDLIPFGGGRRICP

GMPLAIRMVHLLLASLLNQFAWRLPAELERNGVDMAENFGITLTKAVPLCAIATEI*

 

#150

>aaaa01004378.1b CYP76H7 (indica cultivar-group) orth AC116603.1b $F chr 10

This exon in both japonica and indica upstream of 76H6.  It may be

a pseudogene or an alternative first exon to 76H6

sequence gap here, missing 13 aa at N-term

13189 LSLLSIYLLD frameshift

      ILAHSRRRLPPGP frameshift

13156 RPLPLIGSLHLLGDQPHRSLAGLAKTYGPLMSLRLGAVTTVVVSSPDVA 12977

12976 REFLQKHDAVFATRSAPDAAGDHTRNSVPWLPPGPRWRELRKIMATELFATHRLDALH 12803

12802 ELRQEKVSELVDHVARLARDGAAVDVGRVAFTTSLNLLSRTIFSRDLTSLDDRGASKEFQ 12623

12622 QVVTDIMGAAGSPNLSDFFPALAAADLQGWRRRLAGLFERLHRVFT 12485

      AEIEHRRRVAGEEHGKVKDDFLRVLLRLAARDDDTAGLDDDTLRSVFT (0) 12341

 

>AC116603.1b CYP76H7 (partial) Nipponbare strain, clone OSJNBb0015J03, from chromosome 10

12428 MASALFLWLSWLVLSLLSIYLLDLLAHSRRRLPPGPRPLPLIGSLHLLGDQPHRSL 12261

12260 AGLAKTYGPLMSLRLGAVTTVVVSSPDVAREFLQKHDAVFATRSAPDAAGDHTRNSVPWL 12081

12080 PPGPRWRELRKIMATELLATHRLDALHELRQEKVSELVDHVARLARDGAAVDVGRVAFTT 11901

11900 SLNLLSRTIFSRDLTSLDDRGASKEFQQVVTDIMGAAGSPNLSDFFPALAAADLQGWRRR 11721

11720 LAGLFERLHRVFDAEIEHRRRVAGEEHGKVKDDFLRVLLRLAARDDDTAGLDDDTLRS 11547

11546 VFT (0) 11538

 

#384

>aaaa01019517.1 CYP76H8 (indica cultivar-group) ortholog to AQ869312.1 AC116603.1c

N-term does not match as well. 47% to 76C2 78% to AC078944.5

4547 ALLPWLPWLLAALLSVYLLDLLAHSRRRLPPGPRPLPLIGSLHLLGDRPHRSL  (fs)

4373 PHHSLAGLAKKYGPLMSLRLGAVTTVVASSPEVAREFLQKHDAVFATRSTPDATGD 4206

4205 HARNSVAWLPPGPRWRELRKIMATELFSTRRLDALHELRQEKVAELVDHVARLARDGTAV 4026

4025 DIGRVAFTTSLNLVARTIFSHDLTSLDDHGASKEFQRLITDVMEAVGSPNLSDFFPALAA 3846

3845 VDLQGWRRRLSGLFARLHRLFDAEMDHRRLHGMKEKDGDFLEVLLRLAARDDDTARLDGD 3666

3665 TLRSLFT (0?) 3645

2339 DLFTAGSDTSSSTVEWAMAELLQNPISMAKLCDELRRVVGSRRRIEESEIGQLPYLQAVI 2160

2159 KETFRLHSPAPLLLPRQATRTIQIMGYTIPKGTRVLINVWAMGRDEDIWPEAGKFMPERF 1980

1979 LERTIDYKGGDLELIPFGAGRRICPGMPLAVRMVHVLLASLLIHFKWRLPAEVEGNRIDM 1800

1799 TEKFGVTLAKANHLCAMAAPT* 1734

 

>AC116603.1c $F CYP76H8 Nipponbare strain, clone OSJNBb0015J03, from chromosome 10,

AQ869312.1 70% to AC078944.1 168857-163366 same as AQ913628.1

41763 MAALLLWLSWLLLSLLSIYLLDLLAHSRRCLPPGPRPLPLIGSLHLLGDLPHRSL 41605

41604 AGLAKTYGPLMSLRLGAVTTVVASSPEVAREFLQKHDAVFATRSTPDATGDHARNSVAWL 41425

41424 PPGPRWRELRKIMATELFSTRRLDALHELRQEKVAELVDHVAGLARDGTAVDIGRVAFTT 41245

41244 SLNLVARTIFSHDLTSLDDHGASKEFQRLITDVMEAVGSPNLSDFFPALAAVDLQGWRRR 41065

41064 LSGLFARLHRLFDAEMDHRRLHGMKEKDGDFLEVLLRLAARDDDTARLDGDTLRSLFT 40891

39623 DLFTAGSDTSSSTVEWAMAELLQNPISMAKLCDELRRVVGSRRRIEESEIGQLPYLQAVI 39444

39443 KETFRLHSPAPLLLPRQATRTIQIMGYTIPKGTRVLINVWAMGRDEDIWPEAGKFIPERF 39264

39263 LERTIDYKGGDLELIPFGAGRRICPGMPLAVRMVHVLLASLLIHFKWRLPAEVEGNRIDM 39084

39083 TEKFGVTLAKANHLCAMAAPT* 39018

 

#384

>aaaa01025351.1 CYP76H8 (indica cultivar-group) orth? AC116603.1c $F 94% 2 diffs

1aa diff and 7aa deletion vs aaaa01019517.1 see aaaa01019517.1 for ortholog

2705 LPPGPRPLPLIGSLHLLGDLPHHSLAGLAKKYGPLMSL 2818

 

#59

>aaaa01001606.1a CYP76H9 (indica cultivar-group) ortholog of AC078944.5b >99%

AAAA01032734.1 (indica cultivar-group) missing Cterm, runs off end

2001 MATWILGWLLWLPVFLISLYLVDILAHSCRRLPPGPRPLPFIGSLHLLGDQPHRSLAGLA 1822

1821 KKYGPLMSLRLGAVTTVVVSSPEVAREFVQKHDAVFADRSIPDSIGDHTKNSVIWLNPGP 1642

1641 RWRALRRIMATELFSPHQLDALQQLRQEKVAELVDHVARLARESAAVDVGRVAFATSLNL 1462

1461 LSRTIFSRDLTSLDDRGASREFKQVITDIMEAAGSPNLSDFYPAIAAVDLQGWRRRCARL 1282

1281 FTQLHRLFDDEMDHRKLHSRHGG 1213 frameshift

1211 PGENGKEKDDFLEVLLRLGARDDDIAGLDGDTLRSLF 1101

572 DLFAAGSDTSSSTIEWAMVELLKNTLSMGKACDELAQVVGSRRRIEESEIGQLPYLQAVI 393

392 KETLRLHPPVPLLPHRAKMAMQIMGYTIPNGTKILINVWAMGRDKNIWTEPEKFMPERFL 213

212 DRTIDFRGGDLELIPFGAGRRICPGMPLAIRMVHVVLASLLIHFKWRLPVEVERNGIDMT 33

EKFGLTLVKA 3

 

>AC078944.5b $F CYP76H9 clone OSJNBa0089D15

47% TO AJ237995.1 Vitis vinifera 76F2 45% to 76C3

145298 MATWILGWLLWLPVFLISLYLVDILAHSCRRLPPGPRPLPFIGSLHLLGDQPHRSLAGLAKKYGP 145492

145493 LMSLRLGAVTTVVVSSPEVAREFVQKHDAVFADRSIPDSIGDHTKNSVIWLNPGPRWRAL 145672

145673 RRIMATELFSPHQLDALQQLRQEKVAELVDHVARLAREGAAVDVGRVAFATSLNLLS 145843

145844 RTIFSRDLTSLDDRGASREFKQVITDIMEAAGSPNLSDFYPAIAAVDLQGWRRRCARLFT 146023

146024 QLHRLFDDEMDHRKLHSRHGGPGENGKEKDDFLEVLLRLGARDDDIAGLDGDTLRSLF 146197 (0)

150187 DLFAAGSDTSSSTIEWAMVELLKNTLSMGKACDELAQVVGSRRRIEESEIGQLP 150348

150349 YLQAVIKETLRLHPPVPLLPHRAKMAMQIMGYTIPNGTKILINVWAIGRD 150498

150499 KNIWTEPEKFMPERFLDRTIDFRGGDLELIPFGAGRRICPGMPLAIWMVHVVLASLLIH 150675

150676 FKWRLPVEVERNGIDMTEKFGLTLVKAIPLCALATPT* 150789

 

#60

>aaaa01026521.1 CYP76H10 (indica cultivar-group) ortholog of AC078944.5a 99%

AAAA01001606.1b (indica cultivar-group)

    1 RRIMAAELFAPHRLDALRRLRREKVQELVDHVARLAEREGGAAAVDVGRVAFATSLNLLS 180

  181 STIFSRNLTSLDDHGESMEFKEVVVEIMEAGGCPNVSDFFPAIAAADLQGWRRRMACLFA 360

  361 RLHRVFDAVVEERLSERDAGEARKGDFLDVLLDVAARDNDSAGLDRDTLRSLFT 522

25247 DLFAAGSDTSSSIVEWAIAELMRNPLCMIRACDELS 25140 (fs)

25139 QAIGSGTNIEESDIGQLPYLQAVVKETFRLHPPVPLLLPRQAETTTNIAGYTIPKGARVF 24960

24959 VNVWAIGRHKDTWSQPEKFMPERFFERNIDFRGVHFELIPFGAGRRICPGLPLANRMVHL 24780

24779 VLGSLLNQFKWNLPVDIERNGIDMSEKFGLTLVKAIPLCALVTPISVESGDH 24624

 

>AC078944.5a $P CYP76H10 clone OSJNBa0089D15 2 large insertions probably make this a pseudogene 47% to 76C2

92923  MASFLPWLPWLLAALLSVYLLDLLAHSRRLLPPGPRPLPLIGSLHLLGDQPHRSLAGLAK 93102

93103  MYGPLMSLRLGAVTTVVVSSPDVAREFLQRHNAAFASRSVPDATGDHATNSVAWLPNSPR 93282

93283  WRALRRIMAAELFAPHRLDALRRLRREKVQELVDHVARLAEREGGAAAVDVGRVAFATSL 93462

93463  NLLSSTIFSRNLTSLDDHGESMEFKEVVVEIMEAGGCPNVSDFFPAIAAADLQGWRRRMA 93642

93643  GLFARLHRVFDAVVEERLSERDAGEARKGDFLDVLLDVAARDNDSAGLDRDTLRSLFT   93816 (0)

9.4kb insertion here

103281 DLFAAGSDTSPSIVEWAIAELMRNPLCMIRAC 103376 probable 10.5 kb insertion here

113843 DELSQVIGSGTNIEESDIGQLPYL 113914

113915 QAVVKETFRLHPPVPLLLPRQAETTTNIAGYTIPKGARVFVNVWAIGRHK 114064

114065 DTWSQPEKFMPERFFERNIDFRGVHFELIPFGAGRRICPRMPLANRMVHLVLGSLLNQFK 114244

114245 WNLPVDIERNGIDMSEKFGLTLVKAIPLCALVTPISVESGDH* 114373

 

#307

>aaaa01012542.1 CYP76H11 (indica cultivar-group) orth AP004183.1 $P chr 8 99%

5820 WLLGSLLSVYLLDLLAHSRRRLPPGPRPLPFIGSLHLLGDQPHRSLAALAMAYGPLMSLR 5641

5640 LGAVTTVVASSPAVARELLHRHDAAFASRSSPDSTGDHARSSVAWLPSSAPRWRALRR 5467

5466 IMATELFAPHRLDAAAPRRLRREKVRELVAHVARLAAGEGGKPAVVDVGRVAFATSLNLL 5287

5286 SRTIFSRDLTSLDDHGGSKGFQEAVARIMEAGGRPIVSDFFPVLAAADLQGWRRRLA 5116

5115 RLFARLHRVFDAEVDARLREHDAGEASKGDFLDVLLGIAARRDDAAELDRDTLRSLF 4945

4944 TVIQL 4930

4816 DLLGGALSLLPLLEWFRLVVGPRGFVVVCDHAFFRHVLCEKG 4691

991 DVTEKLEVEKN*TQQLSQVIGLGPNIKESEIGQLPYLQAVVKETFRLHPPAPLLLPRQ 818

817 AEMTMKIAGYTIPKGTRIFINVWAMGRDKDIWPEPEKFIPERFLGSKIDFKGVHFELIPF 638

637 GAGRRICPGMPLANRMVHLILGSLLNQFKWNLPVEVERNGIDMSEKFGLTLAKATPL 467

 

>AP004183.1 $P CYP76H11 chr 8 clone OJ1221_H04, Pseudogene 74% to AC078944.1 118250-117423

AP004751.1 chromosome 8 clone P0494D11

51271 MAFLFPSLLLPWLPWLLGSLLSVYLLDLLAHSRRRLPPGPRPLPFIGSLHLLGDQPHRSLAALAMAYG 51068

51067 PLMSLRLGAVTTVVASSPAVA 51005 frameshift

51003 REILHRHDAAFASRPRGDSTG 50962 numerous frameshifts in the next 60 nucleotides

possible reconstructed seq. NHARE PVAWPAHNAPGWGA LR

      RIMATELFAPQPVDSGRPGRLRREKCRSCAHTPKVGARQGGKAAGVD 50738 probable intron here

      but no AG junction

50467 GRVAFATSLNLLSRTIFSRDLTSLDDP 50387 frameshift

50385 GGSKGFQEAVARIMEAGGRPNVSDFFPVLAAADLQGWRRRLARLFARLHRVFDAEVDAR 50209

50208 LREHDAGEARKGDFLDVLLGIAARRDDAAELDRDTLRSLFT 50086 (0)

45752 DLFCAGSDTSSSTVEWAMAELMQNPKSMSRVCDELSQVIGLGRNIKESEIGQLPYLQ 45582

45581 AVVKETFRLHPPAPLLLPRQAEMTMKIAGYTIPKGTRIFVNVWAMGRDKDIWPEPEKFIP 45402

45401 ERFLGSKIDFKGVHFELIPFGAGRRICPGMPLANRMVHLILGSLLNQFKWNLPVKVERNG 45222

45221 IDMSEKFGLTLAKATPLCALVTPISVKPADHQE* 45120

 

#421

>aaaa01028537.1 CYP76H12P (indica cultivar-group) orth AP003261.1 $P chr 1 98% 1 diff

817 RRLPPGPPGVPVLGALPLVGPAPHADLASLARKYGPIMYLKMGTCGVVAV 668

684 AASSPCAARVFL 649

 

>AP003261.1 $P CYP76H12P chromosome 1 clone P0471B04,

N-terminal exon of a CYP75 like sequence 70% identical to X70824 but no other part of

the gene is found.  Identical to AP003227 87336-87082 almost identical to AP003214

132381 MELAALCTDPVVLSAAFLCLLLNLSLCSYRPPSPGGGRRLPLGPPGVPVLGALPLVGPAPHADLASLARK 132172

132171 YGPIMYLKMGTCGVV 132127

 

#422

>AP003214.3 $P CYP76H13P almost identical to AP003227 related to CYP75

90% to aaaa01028537.1 89% to AP003261.1

165814 MELAVLCTDPVVLSGPFLCLLLHLSLCSYR 165725 (2?)

157244 RPPSPGGGRRLPPD 157203 frameshift

157203 PPGVPVLDALPLVGPAPHTDLASLARKYGPIMYLKMGTCGVV 157078 frameshift

AASSPCAARAFL 157053

 

#308

>AP003622 $P CYP76H14P chromosome 6 clone P0633E08 related to AC025783.5 exon 1

also 1 difference to aaaa01012542.1

105535 DLLGGALSLLPLLEWFRLVVGPRGFVVVCDHAFSRHVLCE 105654 frameshift and 6 aa deletion

105656 KGLVAEVSKFLFG 105694

 

#226

>aaaa01007618.1 CYP76H15P (indica cultivar-group) ortholog to AC025783.5b

4002 DLLGGAFSLLPLLEWGRAVSSSSATTPFPGTCYARKGLVAEVSEFLFGSGFSTA*DALW 4178

 

>AC025783.5b $P CYP76H15P chromosome 10 clone OSJNBa0001O14 1 ordered pieces Length = 180630

Extra 97C first exon (pseudogene)

       DLLGGAFSLLPLLEWGRAVSSSSATTPFPG 124558

124557 TCYARKGLVAEVSEFLFGSGFSTA*DALW 124471

 

#477

>AC078944.1 CYP76H16P (partial) clone OSJNBa0089D15,9 unordered pieces

51% to 71C4 this seq is not found in version 5 of this accession

no indica ortholog found 9/3/02 41% to 76H10 very decayed hard to place

140438 PFQGLCS KGTFRLPAPVPRLLPGRACPSSDVAGYTLPSGARVLVYVWA WADSCVRGLVRE 140259

140258 NLCQCVFRVRSAFRGVRLELFHWGQVPVWPGIAPAA*LVH*GLGS 140124

 

#231

>aaaa01008113.1 $FI CYP76K1 (indica cultivar-group) orth of APOO5308.1b 99%

6975 MELTTISPVFLISLLGVPLLYLLWSKASKSPSGAPAAPPPPPGPTPFPVIGNIPDLLRGGEL 7160

7161 HRALTGLAASYGPVMSLRFGMASTVVLSSPDVAHEALHKKDGAISSRWVPDNANVLGHQD 7340

7341 VSMAWLPSSSPLWKHMRTLASTLLFTSRRLGASRGIRERKARELVDYLGARSGRPVRVGL 7520

7521 AVFGSVLNFMSNVFFSEDVVELGSETGQEFQQLIADSVAETAKPNISDFFPFLSA 7685

7686 LDLSRRRRAAAKNLKKFYDFFDDVIDRRLSSGEKPGDLLDSLLELHAKSQL 7838

7839 ERPLIRALMT (0) 7868

8303 DLFIAGSHTTTTTVEWALAELLRNPSKMAKARAELGEAFGRGAIEEGELARLPYLNAVI 8479

8480 KETLRLHPPAPLLLPHRVSSDSEPAGGVTLGGYSVPSGARVLINAWAIGRDPAAWSPEPD 8659

8660 AFSPERFLGREADYWGRTLEFIPFGSGRRACPGIPLAVAVVPMVVAAMVHSLEWRLPEGM 8839

8840 APGDVDVGDRFGAVLELATPLWAVPVKV* 8926

 

>AP005308.1b $F CYP76K1 (japonica cultivar-group) chr 9 = AP005575.1 74864-72921

145251 MELTTISPVFLISLLGVPLLYLLWSKASKSPSGAPAAPPPPPGPTPFPVIGNIPDLLRGG 145430

145431 ELHRALTGLAASYGPVMSLRLGMASTVVLSSPDVAHEALHKKDGAISSRWVPDNANVLGH 145610

145611 QDVSMAWLPSSSPLWKHMRTLASTLLFTSRRLGASRGIRERKARELVDYLGAR 145769

145770 SGRPVRVGLAVFGSVLNFMSNVFFSEDVVELGSETGQEFQQLIADSVAETAKPNISDFFP 145949

145950 FLSALDLSRRRRAAAKNLKKFYDFFDDVIDRRLSSGEKPGDLLDSLLELHAKSQLER 146120

146121 PLIRALM 146141

146574 DLFIAGSHTTTTTVEWAMAELLRNPSKMAKARAELGEAFGRGAVEEGELARLPYLN 146741

146742 AVIKETLRLHPPAPLLLPHRVSSDSEPAGGVTLGGYSVPSGARVLINAWAIGRDPAAWSP 146921

146922 EPDAFSPERFLGREADYWGRTLEFIPFGSGRRACPGIPLAVAVVPMVVAAMVHSLEWRL 147098

147099 PEGMAPGDVDVGDRFGAVLELATPLWAVPVKV 147194

 

#230

>aaaa01007971.1 CYP76K2P (indica cultivar-group) orth of AP005308.1a

71% to AP005308.1b

2493 EDVIEPGSETRQEFQELIADSVAETGVSDFFRFVSALDLSSRRCAATRNLSRFYDF 2326

 

>AP005308.1a $P CYP76K2P (japonica cultivar-group) chr 9 = AP005575.1 97551-97718

121122 EDVIEPGSETRQEFQELIADSVAETGVSDFFRFVSALDLSSRRCAATRNLSRFYDF 120955

 

#180

>aaaa01005655.1 $FI CYP76L1 (indica cultivar-group) orth of AP005308.1c 100%

12123 MEASTILWLLYVSLASCLLYKVFVSTKNGHPKIAARRPPGPTPVLLLGNVFDLRGELHL 11947

11946 ALARLAEEHGPVMSLKLGTATAVVASSAAAARDALQRYDHVLAARAVCDAARALGTH 11776

11775 ERSIVWLPGSSALWKRLRAVCTNHLFSARGLDATRAVREAKVRELVEHLRGHAAGAGEEE 11596

11595 AAAVDVGRVVFSAVINLVSNVLFSEDVADLSSDRAQELEMLVRDTVEEATKPNLSDLFP 11419

11418 VLAALDLQGRRRRTAVHIRKFHDFFDEIISRRQNAGGEGERKEDFLDVLLQLH 11260

11259 SADQLSLDTIKTFLG (0) 11215

10259 DLFTAGTDTNSITVEWAMAELLRHPAAMSRARAELRDALGAKPHPDESDIGRLPYLSAVV 10080

10079 METMRLHPPSPLLMPHEAVADGAAVGGYAVPRGTKVIVNVWSIMRDPASWPRPEEFEPER 9900

9899  FVAAGGSFRGGEMLEFMPFGAGRRACPGTPMATRVVTLVLASLLHAFEWRLPGGMRPCDV 9720

      DVRGRFGTSLNMVTPLKAVPVPVPARP* 9636

 

>AP005308.1c $F CYP76L1 (japonica cultivar-group) chr 9 = AP005575.1 67708-65225

152407 MEASTILWLLYVSLASCLLYKVFVSTKNGHPKIAARRPPGPTPVLLLGNVFDLRGELHLA 152586

152587 LARLAEEHGPVMSLKLGTATAVVASSAAAARDALQRYDHVLAARAVCDAARALGTHERSI 152766

152767 VWLPGSSALWKRLRAVCTNHLFSARGLDATRAVREAKVRELVEHLRGHAAGAGEEEAAAV 152946

152947 DVGRVVFSAVINLVSNVLFSEDVADLSSDRAQELEMLVRDTVEEATKPNLSDLFPVLAAL 153126

153127 DLQGRRRRTAVHIRKFHDFFDEIISRRQNAGGEGERKEDFLDVLLQLHSADQLSLDTIKT 153306

153307 FLG 153315

154270 DLFTAGTDTNSITVEWAMAELLRHPAAMSRARAELRDALGAKPHPDESDIGRLPYLSA 154443

154444 VVMETMRLHPPSPLLMPHEAVADGAAVGGYAVPRGTKVIVNVWSIMRDPASWPRPEEFEP 154623

154624 ERFVAAGGSFRGGEMLEFMPFGAGRRACPGTPMATRVVTLVLASLLHAFEWRLPGGMRPC 154803

154804 DVDVRGRFGTSLNMVTPLKAVPVPVPARP 154890

 

#112

>aaaa01003137.1 $FI CYP76M1 (indica cultivar-group) ortholog to AU173996.1

44% to 76C4 (C-term does not match cDNA check for fs in cDNA)

2741 MDVNQLWLLWATLAVSLLYYISNRRRRVGGRRRCPPGPMPLPLVGNLLNLRGHLPPALAR 2920

2921 LARTYGPVMMLKMGLTTTVVISSGDAAREAFTKHDRHLAARTVLDVTRSLDFADRSMIWL 3100

3101 PSSDTVWKTLRGVTAASIFSPRGLAALRGVRESKVRDLVGYFRGRAGEVVDVRHAVYGCM 3280

3281 LSLVSSAFFSVDVVDLSAESENEFRQSMTFLMEVVSKTNVSDFFPFLRPLDLQGWRRLTE 3460

3461 RYLGRVTCFLDDVIDRRFAADASANRHGDFLDSLLDLVSTGKIVRENVTTILLDVFIAGS 3640

3641 DTITATVEWAMAELLRNPSEMAKVRAEMDGALGGKKTVDEPDIARLPYLQAVVKEAMRLH 3820

3821 PAAPLLLPHRAVEDGVEVGGYCVPKGSMVIFNVWAIMRDPAAWERPEEFMPERFIRRGDD 4000

4001 DEVDFWGKTFEFIPFGSGRRVCAGLPMAERVVPFMLASLLRAFEWRLPDGVSAEELDMRH 4180

4181 RFTIANFRAIPLKAVPVVVS* 4243

 

>AU173996.1 CYP76M1 (partial) cDNA clone S12633. Length = 451 New seq 66% to AP003623.1 aa 413-458 no larger pieces are known

12  EVDXWGKTFEFIPFGSGRRVCAGLPMAERVVPFMLASLLRAFEWRLPDGVSAEELDMR

probable fs here does not match aaaa01003137.1

PGLPLPTSVPSLQGRARCGQLVALFHCINGELWAYVSPFGPHMSSWIL*

 

#161

>aaaa01004597.1b $PI CYP76M2 (indica cultivar-group) missing Nterm 42% to 76C4

89% to Nterm of AU172561.1 6 diffs with BI808700.1

ortholog of AP005254.1a

RRLPPGPTPLPVIGNVLSLRGNMHHALARLARERYGPVMA

LKLGLVTAVVVSSPDAAREAFTKHDRRLAARAVPDTSRVRGFADRSMIWLPSSDTRWKTL

RGVVATHVFSPRSIAAARGVRERKVRDIVGYFAAHVGEVVDVGEAVYSGVVNLVSNAFFS

GDVVDVGEESAHGLREAVEDIILAIAKPNVSDLFPFLRPLDLQGWRRWAEKRYDTVFDIL

DNITNSRLADASAGNHAGDFLDSLLGLMSYGKIARDDVTTIMFDVFGAGTDTIAITVQWA

MAELLRNPSIMAKARTEMEDVLAGKKTIEENDTEKLPYLRAVIKEAMRLHPVAPILLPHQ

AAEDGVEIGGYAVPKGSTVIFNVWAIMRDPTAWERPDEF

MPERFLQRAEVDFRGKDFEFMPFGAGRRLCPGLPMAERVVPFIL (2) 14530

Inverted orientation

ASLLHAFEWRLPDGMSAEELD

Small deletion

14623 VTVPLKAVPILASSASELQAS* 14688

 

 

>AP005254.1a $F CYP76M2 (japonica cultivar-group) chromosome 8 = AF488522.1 PM-II

42% to 76C4 = BI808700.1

59535 MERDAWLLCAALAAATVVYYLACTTSRRAQRRRLPPGPTPLPVIGNVLSLRGNMHHALARLARER 59341

59340 YGPVMALKLGLVTAVVVSSPDAAREAFTKHDRRLAARAVPDTSRVRGFADRSMIWLPSSD 59161

59160 TRWKTLRGVVATHVFSPRSIAAARGVRERKVRDIVGYFAAHVGEVVDVGEAVYSGVVNLV 58981

58980 SNAFFSGDVVDVGEESAHGLREAVEDIILAIAKPNVSDLFPFLRPLDLQGWRRWAEKRYD 58801

58800 TVFDILDNITNSRLADASAGNHAGDFLDSLLGLMSYGKIARDDVTTIMFDVFGAGTDTIA 58621

58620 ITVQWAMAELLRNPSIMAKARTEMEDVLAGKKTIEENDTEKLPYLRAVIKEAMRLHPVAP 58441

58440 ILLPHQAAEDGVEIGGYAVPKGSTVIFNVWAIMRDPTAWERPDEFMPERFLQRAEVDFRG 58261

58260 KDFEFMPFGAGRRLCPGLPMAERVVPFILASLLHAFEWRLPDGMSAEELDVSEKFTTANV 58081

58080 LTVPLKAVPILASSASELQAS* 58015

 

#160

>aaaa01004597.1a CYP76M3P (indica cultivar-group) 3 diffs with BI808700.1

runs off end of scaffold see aaaa01004597.1b for AP005254.1a seq

same as aaaa01018916.1 and aaaa01004597.1b possibly two similar genes

close together and this one may be the same as aaaa01018916.1

2   WAIMRDPTAWERPDEFMPERFLQRAEVDFRGKDFEFMPFGAGRRLCPGLPMAERVVPFI 178

179 LASLLHAFEWRLPDGMSAEELDVSEKFTTANVLTVPLKAVPILASSASELQAS* 340

 

#160

>aaaa01018916.1 $PI CYP76M3P (indica cultivar-group) missing Nterm, inverted seq at end

85% to Nterm of AU172561.1 4 diffs with BI808700.1 96% to AP005254.1a $F chr 8

4748 RRLPPGPTPLPVIGDVLGLRGNMHHALARLARERYGPVMTLKLGLVTAVVVSSPGA 4581

4580 AREAFTRHDRRLAARTVPDISRARGLAGRSMIWLPSSDPRWKTLRGVVAAHVFSPRSLAA 4401

4400 ARGVRERKVRDIVGYFAAHVGEVVDVGEAVYSGVVNLVSNAFFSGDVVDVGEESAHGLRE 4221

4220 AVEDMISAIAKPNVSDLFPFLRPLDLQGWRRWAEKRYDTVFDILDNITNSRLADASAGNH 4041

4040 AGDFLTPSLGLMSYGKIARDDVTTIMFDVFGAGTDTIAITVQWAMAELLRNPSIMAKART 3861

3860 EMEDVLAGKKTIEENDTEKLPYLRAVIKEAMRLHPVAPILLPHQAAEDGVEIGGYAVPKG 3681

3680 STVIFNVWAIMRDPTAWERPDEFMPERFLQRAEVDFRGKDFEFMPFGAGRRLCPGLPMAE 3501

3500 RVVPFIL

Inverted sequence orientation

ASLLHAFEWRLPDGMSAEELD

 

VSEKFTTANVLTVPLKAVPILASSASELQAS* 3321

 

may not have a complete ortholog

 

#331

>aaaa01013893.1 $PI CYP76M4P (indica cultivar-group) pseudogene (gap in sequence)

3 diffs with Nterm of AU172561.1 93% to Cterm of AU172561.1

ortholog of AP005254.1b (5 diffs)

3604 MESEVCWLLCAALAAAMACYYLTGTMRRRSRRLPPGPTPLPVIGNVLSLRGNMHHALERL 3783

3784 AGEHGPVMALKLGLVTAVVVSSAGAAREAFTKHDRRLAARAVPDTTRARGFASRSMIWL 3960

3961 PSSDPRWKTLRGVAATHVFSPRSLAAARGVRERKVRDIVGHLAGHAGEVVDVGKVVYGGV 4140

4141 LNL 44 amino acid gap

     RRLDLQGWRRWAE 4188

4189 KRYDKVFGIFDSVINSRLADASTGKHADAGAGDFLDSLLDLMSAGTIARDDVTSIMYDLF 4368

4369 GAGTDTIAITVEWAMAELLRNPSVMAKARAEMNHVLAGKVKATEMEENDVEKLPYLQAVV 4548

4549 KEGMRLHPAAPILVPHRAEEDDAEIGGYAVPKGSTVIFNVWAIMRDPVAWERPEEFMPER 4728

4729 FLDMAEEVDFRGKDHKFMPFGTGRRLCPGLSMAKRVVPFILASLLHAFEWRLPAGVTAEA 4908

4909 LDLSEKFTTVNVLVTPIKAIPILASDQI* 4995

 

>AP005254.1b $P CYP76M4P (japonica cultivar-group) chromosome 8

82126 MESEVCWLLCAALAAAMACYYLTGTMRRRSRRLPPGPTPLPVIGNVLSLRGNMHHALARL 81947

81946 AGEHGPVMALKLGLVTTVVVSSAGAAREAFTKHDRRLAARAVPDTTRARGFASRSMIWL  81770

81769 PSSDPRWKTLRGVAATHVFSPRSLAAARGVRERKVRDIVGHLAGHAGEVVDVG (fs)   81611

      QVVYGGVLN 44 amino acids missing

81582 LRRLDLQGWRRWA (fs) 81544

81542 EKRYDKVFGIFDSVINSRLADASTGKHADAGAGDFLDSLLDLMSAGTIARDDVTSIMYDL 81363

81362 FGAGTDTIAITVEWAMAELLRNPSVMAKARAEMNHVLAGKVKATEMEENDVEKLPYLQAV 81183

81182 VKEVMRLHPAAPILVPHRAEEDDAEIGGYAVPKGSTVIFNVWAIMRDPVAWERPEEFMPE 81003

81002 RFLDMAEEVDFRGKDHKFMPFGTGRRLCPGLSMAKRVVPFILASLLHAFEWRLPAGVTAE 80823

80822 ALDLSEKFTTVNVLVTPIKAIPILASDEI* 80733

 

#331

>aaaa01093009.1 CYP76M4P (indica cultivar-group) orth AP005254.1b $P chr 8 100%

see aaaa01013893.1 for ortholog

3   TIAITVEWAMAELLRNPSVMAKARAEMNHVLAGKVKATEMEENDVEKLPYLQAVVKEVMR 182

183 LHPAAPILVPHRAEEDDAEIGGYAVPKGSTVIFNVWAIMRDPVAWERPEEFMPERFLDMA 362

363 EEVDFRGKDHKFMPFGTGRRLCPGLSMAKRVVPFILASLLHAFEWRLPAGVTAEALDLSE 542

543 KFTTVNVLVTPIKAIPIL 596

 

#331

>aaaa01076605.1 CYP76M4P (indica cultivar-group) ortholog of AP005254.1b 100%

AU172561.1 (1 diff) see aaaa01013893.1 for ortholog

423 MESEVCWLLCAALAAAMACYYLTGTMRRRSRRLPPGPTPLPVIGNVLSLRGNMHHALARL 602

603 AGEHGPVMALKLGLVTTVVVSSAGAAREAFTKHDRRLAARAVP 731

 

#295

>aaaa01011645.1 $FI CYP76M5 (indica cultivar-group) one frameshift no introns (partialI)

runs off end 66% to AP005254.1a continues on AAAA01039014.1

1329 MEARELWVLAAALAVSLLYYLTVLMRYAGGGSSRSSRPRLPPGPTPLPLIGNLLSLR 1159

1158 GVLHHRLASLARVHGPVMALRLGLTTAVVVSSRDAAAEAFTKHDRRLAARVVPDSNRA 985

984  HGFSDRSVIWLPSSDPRWKTLRGIQATHLFSPRGLAAVRAVRESKVRDIVAYFRSRAGEE 805

804  VVFGEAIYSGVLNLVSSSFFSVNMAGVGSEEAHGLRELVEDLVEAVAKPNVSDLFPFLR 628

627  QLDLQGLRRRTEERMARAFGILDGIIDRRLANRTHGDRHGDFLDALLDLVSEG 469

468  KMARDHVTIMLFEVFGAGSDTMSISLEWAMAELLRNPRAMRKARAELEDAAAV 310

309  VEESDAARLPYLQAVVKEAMRLHPVGPILLPHRAVEDGVEIGGYAVPRGAMVI 151 (fs)

149  FNAWAIMRDPAAWERPDEFVPERFMETTTAIDFRGKEYEYLPFGSGR 3

RLCPGLPLAERVVPFVLASLLRAFEWRLPDGVSAEDLDVSERFNTANVLAVPLKVVPVIVN*

 

No japonica ortholog found 9/10/02

 

#242

>aaaa01008405.1 CYP76M6 (indica cultivar-group) top part of ortholog to BI813130.1

AAAA01075650.1 (indica cultivar-group) ortholog of BI813130.1 (4 diffs)

Orth to AP005114.1a

580 MEKLKSELWMTAVATCMSLLLYLTILRRRHASGGRSLSLPPGPTPLPLIGNLFCLGGIFH 401

400 QTLAKLARVHGPVMTLKLGLTTAVVVSSAEAAREAYTKHDQRLAARPVPDAFRANGFSER 221

220 SIVFSPSSDPQWKNLRGIHATHIFSPRALAALRGIRARKVRDIVGYIRTVAGEEMCVREV 41

40  VHNGVLNLIST 8

ELLRNPRVMAKVRAEVMDALGGKESFDEGDAASLTYLQCVFKEAMRLHPVGSILVPHLAQ

QDGVEIGGYAVPKGTTVIFNAWAIMRDPAAWESPDQFLPERFLHKESSSPPLELRGKDYEYIPF 368

367 GSGRRLCPGLPLAERAVPFILASLLHAFEWRLPDGMSPDDMDMTEKFATANVLATPLKAV 188 (fs)

186 QSSHTSYIYSLRPKI* 139 check end vs AP005114.1a

 

>AP005114.1a $F CYP76M6 (japonica cultivar-group) chromosome 2 clone P0689H05

BI813130.1 cDNA clone J002D01.Length = 497 75% to BI808700.1 68% to AP003623

BI813468.1 K008G06 Oryza sativa mature leaf library induced by M.grisea Oryza

sativa cDNA clone K008G06.Length = 474

best rice match AP003623.1 chromosome 6 63% clone P0642B07 41% to 76C4 = AU095771

26813 MEKLKSELWMTAVATCMSLLLYLTILRRRHASGGRSLALPPGPTPLPLIGNLLCLGGIFHQTLA 27004

27005 KLARVHGPVMTLKLGLTTAVVVSSAEAAREAYTKHDQRLAARPVPDAFRANGFSERSIVF 27184

27185 SPSSDPQWKNLRGIHATHIFSPRALAALRGIRERKVRDIVGYIRTVAGEEMCVREVVHNG 27364

27365 VLNLISTSFFSMDMADVRSESARGLRGLIEDIIATVAGPNVSDFFPFLRQLDLQGLRRQT 27544

27545 GSHLGIVFGLLDDIIDRRMAETRDHPDKQRHGDFLDALISLASAGKIPRYHITYLLFDVF 27724

27725 AAGADTMTTTVEWAMAELLRNPRVMAKVRAEVTDALGGRESFDEGDAASLTYLQCVFKEA 27904

27905 MRLHPVGSILVPHLAVQDGVEIGGYAVPKGTTVIFNAWAIMRDPAAWESPDQFLPERFLH 28084

28085 KEESSSPPLELRGKDYEYIPFGSGRRLCPGLPLAERAVPFILASLLHAFEWRLPDGMSPD 28264

28265 DMDMTEKFATANVLATPLKAVPVASHTS* 28351

 

#164

>aaaa01004667.1b $FI CYP76M7 (indica cultivar-group) ortholog of AP003623.1 

7298 MENSQVWLLWGALSVAVLFYLSTLRRRYAGGKPLPPGPTPLPLIGNLHLAGGTFHHKLR 7122

7121 DLARVHGPVMTLKLGLATNVVISSREAAIEAYTKYDRHLAARATPDTFRACGFADRSMVF 6942

6941 IPSSDPQWKALRGIQGSHVFTPRGLAAVRPIRERKVGDLIAYLRAHAGKEVLLGQAMYTG 6762

6761 LLNLVSFSYFSIDIVDMGSQMARDLREVVDDIISVVGKPNISDFYPFLRPLDLQGLRRWT 6582

6581 TKRFNRVFSIMGDIIDRRLAHIRDGKPRHDDFLDSLLELMATGKMERVNVVNMLFEAFVA 6402

6401 GVDTMALTLEWVMAELLHNPAIMARVRAELSDVLGGKEAVEEADAARLPYLQAVLKEAMR 6222

6221 LHPVGALLLPHFAAEDGVEIGGYAVPRGSTVLFNAWAIMRDPAAWERPDEFVPERFLGRS 6042

6041 PPLDFRGKDVEFMPFGSGRRLCPGLPLAERVVPFILASMLHTFEWKLPGGMTAEDVDVSE 5862

5861 KFKSANVLAVPLKAVPVLIK* 5799

 

>AP003623.1 $F CYP76M7 chromosome 6 clone P0642B07 41% to 76C4 = AU095771

89% to CONTIG OF C72003, C99202

D46292        42% to 71B6 and 71B8   3/95   8/95 17-91 REGION

AU032289 29% to 76C3, 30% to 71B4

84245 MENSQVWLLWGALSVAVLFYLSTLRRRHAGGKPLPPGPTPLPLIGNLHLAGGTSFHHKLRDLAR 84436

84437 VHGPVMTLKLGLATNVVISSREAAIEAYTKYDRHLAARATPDTFRACGFADRSMVFIPSS 84616

84617 DPRWKALRGIQGSHVFTPRGLAAVRPIRERKVGDLMAYLRAHAGEEVLLGQAMHTGLL 84790

84791 NLVSFSYFSIDIVDMGSQMARDLREVVDDIISVVGKPNISDFYPFLRPLDLQGLRRWTTKRFNRVFSIMGDI 85006

85007 IDRRLAHIRDNKPSHNDFLDSLLELMAAGKIDRVNVLDMLFEAFVAGADTMALTLEWVMA 85186

85187 ELLKNPGVMAKARAELRDVLGDKEVVEEADAARLPYLQAVLKEAMRLHPVGALLLPH 85357

85358 FAVEDGVEVGGYAVPKGSTVLFNAWAIMRDPAAWERPDEFVPERFVERAPLLDF 85519

85520 RGKDAEFMPFGSGRRLCPGLPLAERVMPFILASMLHTFEWKLPGGMTAEDV 85672

85673 DVSEKFKSANVLAVPLKAVPVLIK* 85747

 

#240

>aaaa01008385.1 CYP76M8 (indica cultivar-group) missing the Nterminal

ortholog of C72003 contig

8309 TLKLGLATNVIISSREAAAEAYTKYDRHLAARATPDTFRACGFADRSMVFIPSSDPQW 8136

8135 KALRGIHASHVFTPRVLAAVRPIRERKVGDLIAYLRAHAGEEVLVGHAMYTGILNMVSFS 7956

7955 YFSIDIVGMGSRMARELREVVDDIIVVVGKPNVSDFYPFLRPLDLQGLRRWTTKRFNRVF 7776

7775 SIMGDIIDRRLAHIRDNKPRHDDFLDSILELMAAGKIDRVNVLNMLFEAFVAGADTMALT 7596

7595 LEWVMAELLKNPGVMAKARAELRDVLGDKEIVEEADAARLPYLQAVLKEAMRLHPVGALL 7416

7415 LPHFAMEDGVEVGGYAVPKGSTVLFNAWAIMRDPAAWERPDEFVPERFVERTPQLDFRGK 7236

7235 DVEFMPFGSGRRLCPGLPLAERVVPFILASMLHTFEWELPGGMTAEELDVSEKFKTANVL 7056

7055 AVPLKAVPVLIK 7020

 

>CONTIG OF C72003, C99202 CYP76M8 (partial) 34% to 76C4 some identical runs with AU032289

34% TO 76C4 same clone = C19444

90% to AP003623.1 opposite end of AU176462

AU176462.1 clone E10426.Length = 475 90% to AP003623.1 also AU182232.1

opposite end of C19444 CONTIG OF C72003, C99202 lower case from AP003623.1

There are two closely related genes

This part 97% to AAAA01008385.1 93% to AAAA01004667.1

ATNVVISSREAAIEAYTKYDRHLAARATPDTFRACGFADRSMVFIPSSDPQWKA

LRGIHASHVFTPRVLAAVRPIRERKVGDLIAYLRAHAGEEVLVGHAMYTGILNMVS

FSYFSVDIVDMGSQMARELREVVDDIILVVGKPNVSDFYPFLRPLDLQGLRRW

TTKRFNRVFSIMGDIIDRRLAHIRDNKPRHDDFLDSILELXAAGKIDRVQVLNMLFEGF

 

This part 98% to AAAA01008385.1 90% to AAAA01004667.1 from opposite end of C19444

2   FNAWAIMRDAAAWERPDEFVPERFVERTPQLDFRGKDVEFMPFGSGRRLCPGLPLAERVVP 184

185 FILASMLHTFEWELPGGMTAEELDVSEKFKTANVXAVPLKAVPVLIK* 328

 

#30

>aaaa01000831.1 $FI CYP76M9 extra Nterm seq orth of AP004684.1a

12751 MQQQVTEFASAGSDTHAERKQSELTAAEMGHELWVLWATLAVSLLCYLYLTSHRLGSRRR 12930

12931 RWPPGPRPLPLLGNLLDLRGGNLHHTLARLARAHGAPVMRLQLGLSPAVVISSPGAAREA 13110

13111 FTAHDRRLAARAVPDANHALGFCDRSMIWLPSADPMWRTLRGVVAAHAFSPRALAAARAV 13290

13291 HERKVRDLVAYLRGRAGREVDVKDAVYGGVLNLVSSALFSADVVDVGGESAQGFRELVEE 13470

13471 LIESIAKPNVSDLFPFLRPFDLQGWRRWTSGHLAKIYKVLDDIIDRRSAEDDAAMDKRGD 13650

13651 FLDVLLELMSTGKIAREYLTNILFDVFTAGSDTMSLTVVWAMAELLRNPGVMAKARAEID 13830

13831 AALGGREAVEEADVARMPYVQAVLKEAMRLHPVAPVMLPRKAAEDGVEIGGFEVPRGCAV 14010

14011 IFNTWAIMRDPAAWERPDEFVPERFVGRSRATEEMDFRGKDFGFLPFGSGRRLCPGVPMA 14190

14191 ERVLPLIMASLLHAFEWRLPDGMSAEQLDVSEKFTTANVLAVPLKAVPVVIAC 14349

 

>AP004684.1a $F CYP76M9 chromosome 6 clone P0012H03, Length = 163117 41% to

76C2 New seq 76% to CONTIG OF C72003 (c-term) 59% to AP003623.1 70% to AU173996.1

10968 MQQQVTEFANAGSDTHAERKQSELTAAEMGHELWVLW

11079 ATLAVSLLCYLYLTSHRLGSRRRRWPPGPRPLPLLGNLLDLRGG 11210

11211 NLHHTLARLARAHGAPVMRLQLGLSPAVVISSPGAAREAFTAHDRRLAARAVPDAN 11378

11379 HALGFCDRSMIWLPSADPMWRTLRGVVAAHAFSPRALAAARAVHERKVRDLVA 11537

11538 YLRGRAGREVDVKDAVYGGVLNLVSSALFSADVVDVGGESAQGFRELVEELIESIAKPN 11714

11715 VSDLFPFLRRFDLQGWRRWTSGHLAKIYKVLDDIIDRRSAEDDAAMDKRGDFLDVL 11882

11883 LELMSTGKIAREYLTNILFDVFTAGSDTMSLTVVWAMAELLRNPGVMAKAR 12035

12036 AEIDAALGGREAVEEADVARMPYVQAVLKEAMRLHPVAPVMLPRKAAEDGVEIGG 12200

12201 FEVPRGCAVIFNTWAIMRDPAAWERPDEFVPERFVGRSRATEEMDFRGKDFGFLPFGSG 12377

12378 RRLCPGVPMAERVLPLIMASLLHAF 12452

12453 EWRLPDGMSAEQLDVSEKFTTANVLAVPLKAVPVVIAC* 12569

 

#22

>aaaa01000493.1a CYP76M10 (indica cultivar-group) 3 pseudogene fragments

100% to AP005254.1c with a deletion in the Ihelix

3400 ADASTKKHGDFLDSLLELMSAGKIACDDVTTVMFDAFGA 3516 frameshift

3515 LLRNPSIMAKVRAEMEDVLAGKKTIEENDTEKLPYLRAVIKEAMRLHPVAPILLPHRA 3688

3689 AEDGVEIGGYAVPKGSTVIFNVWTIMRDPAAWERPEEFMPERFLQRAEVDFRGKDFEFI 3865

3866 PFGAGRRLCPGLPMTERVVPFILASLLHAFEWRLPVGVAAETLDLSEKFTTVNVLVTPLK 4045

4046 AIPILAS 4066

 

>AP005254.1c $F CYP76M10 (japonica cultivar-group) chromosome 8 = AF488521.1 PM-I

1 diff with BM038607.1 = AP005245.1 15190-16707 41% to 76C2

103909 MEREAWLLCAALAAAMVYYYYYYLACTTRRAQRRLPPGPTPLPVIGNVLSLSGDMHHEL

104086 ARLAREQYGPVMTLKLGLFTAVVVSSPDAAREAFTKHDRRLAARTVPDISRARGLTGRSM

104266 IWLPSSDPRWKTLRSAVATHFFSPRSLAAARGVRERKVRDIVNYFAGHAAEVIDVGEAVY

104446 GGVINIVSNAFFSADVVDVGKESAHGLRETLEDIILAIAKPNVSDLFPFLRRLDLQGWRR

104626 WAEKRYDKVFGILDDKINSRLADADADASTKKHGDFLDSLLELMSAGKIACDDVTTVMFD

104806 AFGAGTDTISNTVVWAMAELLRNPSIMAKVRAEMEDVLAGKKTIEENDTEKLPYLRAVIK

104986 EAMRLHPVAPILLPHRAAEDGVEIGGYAVPKGSTVIFNVWTIMRDPAAWERPEEFMPERF

105166 LQRAEVDFRGKDFEFIPFGAGRRLCPGLPMTERVVPFILASLLHAFEWRLPVGVAAETLD

105346 LSEKFTTVNVLVTPLKAIPILASHQI* 105426

 

#22

>aaaa01069756.1 CYP76M10 (indica cultivar-group) orth AP005254.1c $F chr 8 100%

see aaaa01000493.1a for ortholog

364 AQRRLPPGPTPLPVIGNVLSLSGDMHHELARLAREQYGPVMTLKLGLFTAVVVSSPDAAR 543

544 EAFTKHDRRLAARTVPDISRARGLTGRSMIWLPSSDPRWKTLRSAVATHFFSPRSLAAAR 723

724 GVRERK 741

 

#23

>aaaa01000493.1b CYP76M11P (indica cultivar-group) 3 pseudogene fragments

1 aa diff with AP005254.1d but shorter

9051 EKRYDKVFGIFDSVINSRLADASTGKHADAGAGDFLD 8941 frameshift

8939 SLLDLMSAGKIARDDVTSIMFDLFGAGTDTIAITVEWAMAELLRNPSVMTKARAEMNHAL 8760

8759 AGKKTIEENDVEKLPYLQAVLREAMRLHPAAPILVPHRAEEDGAEIGGYAVPKGSTVIFN 8580

8579 VWAIMRDPAAWERPEEFMPERFMDMAEEVDFRGKDYKFIPFGAGRRLCPGLLMAERVVPF 8400

8399 ILASLLHSFEWRLPGGMTAESLDLSEKFTTVNVLVTPLKAIPILASKNENIRE 8241

 

#23

>aaaa01000493.1c CYP76M11P (indica cultivar-group) 3 pseudogene fragments

100% to AP005254.1d but shorter

26182 LRRLDLQGWRRWAEKRYDKVFGIFDSVINSRLADASTGKHADAGAGDFLDSLLDLMSAGK 26361

26362 IARDDVTSIMFDLFGAGTDTIAITVEWAMAELLRNPSVMTKARAEMNHALAGKKTIEEND 26541

26542 VEKLPYLQAVLREAMRLHPAAPILVPHRAEEDGAEI 26649

 

>AP005254.1d CYP76M11P (japonica cultivargroup) chromosome 8 aa 227-504

same as AP005245.1 21717-20866 ortholog of AAAA01000493.1 99% partial

upstream seq missing in both accession numbers and in AAAA01000493.1

probable pseudogene

AAAA01000493.1 has two copies of this and a third sequence

       GGVLN

110436 LRRLDLQGWRRWAEKRYDKVFGIFDSVINSRLADASTGKHADAGAGDFLDSLLDLMSAGK 110257

110256 IARDDVTSIMFDLFGAGTDTIAITVEWAMAELLRNPSVMTKARAEMNHALAGKKTIEEND 110077

110076 VEKLPYLQAVLREAMRLHPAAPILVPHRAEEDGAEIGGYAVPKGSTVIFNVWAIMRDPAA 109897

109896 WERPEEFMPERFMDMAEEVDFRGKDYKFIPFWAGRRLCPGLLMAERVVPFILASLLHSFE 109717

109716 WRLPGGMTAESLDLSEKFTTVNVLVTPLKAIPILASKNENIRE* 109585

 

#163

>aaaa01004667.1a $PI CYP76M12P (indica cultivar-group)

Cterm pseudogene fragment

1103 LHGGTTAEDVDVSEKFKSANVLAVPLQAVPVL (fs)

                                     IKFVGRT*

 

#476

>AL713947.1 $F CYP76M13 chromosome 12 clone Monsanto-OJ1003_A04,

AQ259099 41-60 N-term similar to 71B7 no ortholog found in indica 9/3/03

39% to 76C2 57% to 76M2

MASGLWFLGISLAVLLLCYVGTNRRGDGQRPPGPRTLPIVGNLLDLRGGNLHHKL

ASLAHAHGPVMTLKLGLVTTVFISSRDAAWEAFAKHDRRLAARTVPDTRRALAHAERSM

VWLPSYDPLWKTLRSIAA (fs) 45837

45835 THVFSPRSLGVARSARERKVHDMVDSFRRRAGQEVDIGQVLYHGMFDLLANVLLSVD

AHPNLRDLMEDIVAILAKPNASDFFPLLRPLDLQGIRHWTAIHMSRVLHILDSIIDC

RLAQGTDDQCKDVLDSLLVLMSTGKLSRRDVKILLFDILAAGTETTKITVEWAMAELL

RNPNVMATTRAEMKAALGGNGTITEADVVNLPYLQAAVKESMRLHPVAPLLLPHLVVEDG

VRIGGYAVSKGTTVIFNSWAIMRDSTAWERPDDFLPDRFLGKTELDLWGKQAKFIP

LGSGRRLCPALPMVELVVPFTVASLLHAFEWHLPKGMSAEEVDVTERYTSNDILVMATPLKAVPLIVT*

 

#478

>AP003267.1 $F CYP76M14 chromosome 1 clone P0496H05 38% to 76C4

no indica ortholog found 9/3/02 64% to 76M7

55979 MEKSSELWLLWAVFSASLVFLYLTIRRRSGAGAGGKPPLPPGPTPLPLIGNLLDLRGGVIHDKLAALARVYG 55764

55763 PVMMIKLGLNDAVIISSRDAAREAFTRYDRHLAARAIPDTFRANGFHERSAVFLPSSDER 55584

55583 WKALRGIQGTHIFTPRGLAAVRPVRERKVRDIIAYFRDHAGEELVIRQAIHTGVLNLVSS 55404

55403 SFFSMDIAGMGSETARELREHVDEIMTVFAQPNVSDYFPFLRRLDLQGLRRSTKRRFDR 55227

55226 IFSILDDIVERRLVDRGERGGEGGASSNSSKSKHQYDGGDFLDALLELMVTGKM 55065

55064 ERDDVTAMLFEAFVAGGDTVAFTLEWVMADLLRNPPVMAKLRAELDDVLGGKDQSAIEEH 54885

54884 DAGRLPYLQAVLKESMRLHSVGPLLHHFAAEDGVVVGGYAVPRGATVLFNTRAIMRDPAA 54705

54704 WERPEEFAPERFLAREGKAPVDFRGKEADFIPFGSGRRLCPGIPLAERVMPYILALMLRE 54525

54524 FEWRLPDGVSPEELDVSEKFMSVNVLAVPLKAVPVKVIN* 54405

 

#343

>aaaa01014464.1 $FI CYP76N1 (indica cultivar-group) 92% to AP005069.1a

4147 MAASLAWLLVAIVLASLYLAMQHRSVAAARRRRLPLGPTPLPLVGNLLSVSR 3992

3991 SSPHRSLARLAERYGQLMRVRLGAVDYVVASSPAAAGDIHHHSHNAHLASRPLFDAWRGA 3812

3811 QHHRNSVIALPPHGEWRAQRRLATEEVMSPGRLDALAPLRREKVRELVRRVAGRAARGG 3635

3634 EPVEVGREAFEAFLGILSRTAFSADLVDPDLRDAVQEATKLAATPNVSDFFPAV 3473

3472 AAADLQGLRRRMGKLVARAYGIIDELLARRKGGREAGE

     PRKDDLLDVVLDNQDEWKKENNPVIDRNNIKGLIA (0) 3254

3048 DLFVAGTDSGSTAVEWAIAELLQSPQSMQKVKNEFRRVMCTRTEIEESDISQLPYLQAVL 2869

2868 KETLRLHPSVPMTYYKAEATVEVQGYTIPKDTNIILNIWAIHRKPDVWADPDRFMPERF 2692

2691 METDTNFFGKHPEFIPFGGGRRICLGLPLAYRMVHMVLASLLFHFDWKLPEGAEKDGVDM 2512

2511 REKYGMVLHKETPLKALAIETYNRM* 2432

 

no japonica ortholog found 9/11/02

 

>AP005069.1a CYP76N1 (japonica cultivar-group) chr 2 no indica ortholog found 9/11/02

17537 MAASLAWLLVAIVLASLYLAMHHRVAAARRRRLPPGPTPLPLVGNLLSVSRSGPHRSLA 17361

17360 RLAERYCPLMRVRLGVVDYVVASSPAVAGDIHHHSHNAHLASRPLFDVWRGAEHHRNSVI 17181

17180 VLPLHGVWRAQRRLATEEVMSPRRLDALAPTRREKVRELMRCVARRAARGEPVEVGLEA 17004

17003 FEAFLGILSCTAFSADLVDP 16944 frameshift

16944 DLRDAVQEATKLAATPNASDFFPAMAAAD 16858

16686 LQGLRRRMGKLVARAYGIIDELLARRKGGREAGEPRKDDMLDVALDNEDEWKNNNPVID 16510

16509 RNNIKGLIA 16483

16286 DLFVAGTDSGSTAIEWAIVELLQNPQSMQKVKDEFRRVLGTRTEIEESDISQLPYLQAVL 16107

16106 KETLRLHPSVPMTYYKAEATVEVQGYIIPKGTNIILNIWAIHRKPDVWADPDRFMPERFM 15927

15926 ETDTNFFGKHPEFIPFGGGRRICLGLPLAYRMVHMVLASLLFHFDWKLPEGAEKDGVDMR 15747

15746 EKYGMVLHKETPLKALAIETYNR 15678

 

#229

>aaaa01007897.1 $FI CYP76N2 (indica cultivar-group)

9748 MPAMAASLAWLLLALLLASLYTATHRIAAARRRLPPGPTPLPLVGNLLSVSRTSP 9584

9583 HRSLARLAARYGPLMRVRLGVVDYVVVSSPAVADDIYHSRHNAHLSSRPPYDAWWGEKHR 9404

9403 LNSVIALPPHAVWRAQRRLAMEEVMSPGRLDALAPLRREKVRELLVRVRRVAAA 9242

9241 RGDGDGELVPVEVGQAAFEGFLSILSSTMVSVDLADSDLRDVVREAAILAATPNVSDIF 9065

9064 PAIAAADLQGYRRRMGELVARGYGIFEELLARRKGGREAGERRKDDLLDVVL 8909

8908 DREDELKKESNPVLDRNAIKGLIT (0)

7478 DLMVAGTDTSSSTIEWAMAELLQNSESMQKVKDELRRVIGTRTQIEESDISHLPYLQAII 7299

7298 KETLRLHSNVPMSYYMAEATVEVQGYTIPKGTNIIVNIWAIHHQPNVWVDPDKFMPERF 7122

7121 IGKDTNFFGKHPELIPFGGGRRICLGLPLAYRMVHVVLASLLFHFDWKLPEGAKKDGIDM 6942

6941 SEKFGLVLSMATPLKALATRSCNDM* 6864

 

aaaa01007897.1 has no japonica ortholog in nr or HTGS on 9/7/02

 

#274

>aaaa01010283.1 CYP76N3 (indica cultivar-group) orth AC074054.1a $F chr 10 97% 2 aa diffs

9319 SLLLVFIISYIFQPLLDARRRFPPGPHRLPVISNLHNIGKNPHHAFARLADRYG 9480

9481 PLMSIRLGGVRAVVATPADAAREILQL 9561

 

>AC074054.1a $F CYP76N3 chromosome 10 clone OSJNBa0090A14 gene 1

AC074355.2 23873-18195 clone OSJNBa0071I20, gene 2 39% to 76C2

21949 MAFFHLCISSLLLVFIISYIFQPLLDARRRFPPGPHRLPVISNLHNIGKNPHHAFARLADRYGPLM 21752

21751 SIRLGGVRAVVAMSADAAREILQRNNADITGRGGMDSWHACGHHANSSIALWPRWKWCA 21575

21574 MRMLCTEELL 21545

5536 GVTHAMREEVARELAHRVSDGSAGGMPVSVAREAFAAVAGVLWWSMFSEDMDAATTRQLRDVIE 5345

5344 EAVVVAGAPNLSDYFPVIAAADVMGVRRRMDNLVGWVYGIIDVQIDRRRRRRIVCEPRKN 5165

5164 DLLDVAFDMEGEVESEGWVMNQDTMRGM 5081

4770 AIYQDLLVAGSGSTSSTIEWAMAELLQNPKSMIQLPEELKGLMGTKTHVAESDISQLPYL 4591

4590 QAVIKETLRLHPTVPIAFNKAEATVEIQGYKIPQGTTVYVNIWAICRRAKIWDDLDKFM 4414

4413 PYRFLGRDINFLGTNFEFIPFGAGRRICLGMPLAEGML 4300

4291 HLMLASLLHRFEWTIPDEVKGDDLDMAEEFGLVLSMAKPLRAVAKET* 4148

 

#274

>aaaa01049788.1 CYP76N3 (indica cultivar-group) orth AC074054.1a $F chr 10 98%

see aaaa01010283.1 for ortholog

5   TIEWAMAELLQNPKSMIQLPEELKGLIGTKTHVAESDISQLPYLQAVIKETLRLHPTV 178

179 PIAFNKAEATVEIQG*KIPQGTTVYVNIWAICRRAKIWDDLDKFMPYRFLGRDINFLGT 355

356 NFEFIPFGAGRRICLGM 406

 

#464

>aaaa01082499.1 CYP76N4P (indica cultivar-group) 52% to aaaa01007897.1 76N2 $FI

637 DLFVGATDRSSNTIEWALAELLQNSQTMRRPREELRAVISSKSRFDVSIQNINDQK 470

469 F*AVLKETLRLHSVVPMLSSKAEAXX 398 FRAMESHIFT

390 TVHGYTVPEGNNVNVLVNVWAIHDNA

 

no japonica ortholog found 9/12/02

 

#344

>aaaa01014488.1 CYP76P1 (indica cultivar-group) orth AC074105.1 $F chr 10 99%

14 WAMAELLQNPQTMTKLQEELKKVIGSKTCIDEEDIDQLPYLQAVIKETHRLHPAIPLL 187

188 MYKAAVPVEIQGYKIPKETTVIVNTWAIHQNSEVWIEPDKFIPERFLQKEISLSSGSTN 364

365 IELIPFSAGRRFCLGYPVANRMLHVMLASLVHQFQWTLPEVVKKNGGVDMAEKFGITLSM 544

545 ATPLHAI 565

 

>AC074105.1 $F CYP76P1 chr 10 clone OSJNBa0030B02 19 unordered pieces 39% to 76F2

40% to 76C2

60125 MAIFIGCICS

60095 LALLLLCSHVFQLLSDARRRLPPGPRPLPVIGNLLDVAGELPHRSLACVAERYGPL 59928

59927 VTLRLGTMLAVVASSPATARDVLHRHGASITDRGTPDAWSTDGHDGNSIFAFPTRHHRWR 59748

59747 ALRRLGAEQLFSPRRVEEQRPLRRDAVRGLLRHVAELAAASGGGGAAVVDVGRAAFAAMA 59568

59567 SLLFGALFSAGIDAATSCRFRDAAREFALLTMTPNVSEFFPVVAMADLQGL 59415

59414 RRRTARHITWMYQLIDGHVERRMRGRETAGGCGAAHGEKEKDLLDVMLDMSEKEEQNDD 59238

59237 SSLTMN 59220

57988 DLLMAGSETSSAVIEWAMAELLQNPQTMTKLQEELKKVIGSKTCIDEEDIDQLPY LQAVI 57809

57808 KETHRLHPAIP LLMYKAAVPVEIQGYKIPKETTVIVNTWAIHQNSEVWIEPDKFIPERF 57632

57631 LQKEISLSSGSTNMELIPFSAGRRFCLGYPVANRMLHVMLASLVHQFQWTLPEVVKKN 57458

57457 GGVDMAEKFGITLSMATPLHAIAKNIV* 57374

 

#344

>aaaa01014797.1 CYP76P1 (indica cultivar-group) orth AC074105.1 $F chr 10 99%

see aaaa01014488.1 above

1978 DARRRLPPGPRPLPVIGNLLDVAGELPHRSLACVAERYGPLVTLRLGTMLAVVASSPATA 1799

1798 RDVLHRHGASITDRGTPDAWSTDGHDSNSIFAFPTRHHRWRALRRLGAEQLFSPRRVEEQ 1619

1618 RPLRRDAVRGLLRHVAELAAASGGGGAAVVDVGRAAFAAMASLLFGALFSAGIDAATSCR 1439

1438 FRDAAREFALLTMTPNVSEFFPVVAMADLQGLRRRTARHITWMYQLIDGH 1289

1288 VERRMRGRETPGGCGAAHGEKEKDLLDVMLDMSEKEEQNDDSSLTMNRGVIRAFMAVS 1115

1114 ICILYISVFYQIIQFTFN 1061

 

#265

>aaaa01010028.1 CYP76P2 (indica cultivar-group) ortholog to AL831811.1 99%

3898 DARRPLPPGPRPLPVIGNLLDVAGELPHRSLSRVAQRYGPLVTLRLGTTLAVVASSPATA 4077

4078 REVLHRHGASITDRGTPDAWRTDGHETNSIFALPTRHHRWRALRRLGAEQLFSPRRVEKQ 4257

4258 RPLRRDAVRGLLRHVSELAAASGGGTGTAVVDVGRAAFAAMASLLFGSLFSVGIDAATSC 4437

4438 RFRDAAREFALLTLTPNVSEFFPVVAMADLQGLRRRTARHITWMYQLIDG 4587

4588 HVERRMRGRETAGAHGEKEKDLLDVMLDISEKQEQNDDSLTINRGVIR 4731

6586 DLLTAGSETSSAVIEWAMAELLQNPQTMRKLQEELKKVIGSKTYIDEEDIDQLPYLQ 6759

6760 AVIKETHRLHPAIPLLMYKAAVPVEIQGYKIPKETTVVVNTWAIHQNSEVWIEPDKFIP 6936

6937 ERFLQKEISLSSGSTNMELVPFSAGRRFCLGYPVANRMLHLMLGSLVHQFQWTLPEVVKK 7116

7117 NGGVDMAEKFGLTLSMATPLHAI 7185

 

>AL831811.1 $F CYP76P2 chr 12

      MAIFIGCICSLALLLLCSHVFQLLS

31323 DARRPLPPGPRPLPVIGNLLDVAGELPHRSLSRVAQRYGPLVTLRLGTTLAVVASSPATA 31144

31143 REVLHRHGASITDRGTPDAWRTDGHETNSIFALPTRHHRWRALRRLGAEQLFSPRRVEKQ 30964

30963 RPLRRDAVRGLLRHVSELAAASGGGTGTAVVDVGRAAFAAMANLLFGSLFSVGIDAATSC 30784

30783 RFRDAAREFALLTLTPNVSEFFPVVAMADLQGLRRRTARHITWMYQLIDGHVERRMRGRE 30604

30603 TAGALGEKEKDLLDVMLDISEKQEQSDDSLTINRGVIR 30490

28669 DLLTAGSETSSAVIEWAMAELQQNPQTMRKLQEELKKVIGSKTYIDEEDINQLPYLQAV 28493

28492 IKETHRLHPAIPLLMYKAAVPVEIQGYKIPKETTVVVNTWAIHQNSEVWIEPDKFIPERF 28313

28312 LQKEISLSSGSTNMELVPFSAGRRFCLGYPVANRMLHLMLGSLVHQFQWTLPEVVKKNGG 28133

28132 VDMAEKFGLTLSMATPLHAIAKNIV* 28055

 

#61

>aaaa01001621.1a $FI CYP76P3 (indica cultivar-group) ortholog of AC092749.1b >99%

9137 MVFLLVCMCSLLLMFLISYALQLFGDARRRLPPGPTPLPLIGNLLDIASDLPHRSLARLA 9316

9317 GRHGPLMAVRLGTVVAVVASSPSTAREVLQTHNGSLTGRVPPDAWHGVGHAANSVFVLPP 9496

9497 RRKWRALRRIGAEHLLSARQLDGRRLLPLLRDAVLDLLRRVSEMSAASGGGAGAPVQVGH 9676

9677 AAFAAMMDMQWRAMFSAGLEDDDARVLQDAAREAVALSLKPNLSDFYPALAAVDLQGLRR 9856

9857 RFAGRVGTVYHLVDEQIERRMRRRREAAGDDGEARSEDDLLDVLLDMSEHGKDDGKVAID 10036

10037RDLIRTFLT 10063 (0?)

12305 DIFLATVDTIASTLEWAMAELLQDRETMRKLQEELKKVLGSKTHAEYADMDRLPYLRAVI 12484

12485 KETLRLHPVVPIVPNVAEEMVEIHGHVVPRGSTILVNLWAVHRDAEAWPEPNRFLPERFM 12664

12665 LRQHGQEAAGRALGTATTEFELIPFSAGRRVCLGLPLATRMLHAMLGSLLHRFEWTLPLE 12844

12845 VKENGVDMSENLGLTMTMATPLQAIAKSI 12931

 

>AC092749.1b $F CYP76P3 clone OSJNBb0023M11, from chromosome 10, complete sequence

35% to 76C3 53% to AC074105.1

71148 MVFLLVCMCSLLLMFLISYALQLFGDARRRLPPGPTPLPLIGNLLDIASDLPHRSLA 70978

70977 RLAGRHGPLMAVRLGTVVAVVASSSSTAREVLQTHNGSLTGRVPPDAWHGVGHAANSVFV 70798

70797 LPPRRKWRALRRIGAEHLLSARQLDGRRLLPLLRDAVLDLLRRVSEMSAASGGGAGAPVQ 70618

70617 VGHAAFAAMMDMQWRAMFSAGLEDDDARVLQDAAREAVALSLKPNLSDFYPALAAV 70450

70449 DLQGLRRRFAGRVGTVYHLVDEQIERRMRRRREAAGDDGEARSEDDLLDVLL 70294

70293 DMSEHGKDDGKVAIDRDLIRTFLT (0)

67984 DIFLATVDTIASTLEWAMAELLQDRETMRKLQEELKKVLGSKTHAEYADMDRLPYLRAVIKETLRLH 67784

67783 PVVPIVPNVAEEMVEIHGHVVPRGSTILVNLWAVHRDAEAWPEPNRFLPERFMLRQHGQ 67607

67606 EAAGRALGTATTEFGLIPFSAGRRVCLGLPLATRMLHAMLGSLLHRFEWTLPLEVEENGV 67427

67426 DMSENLGLTMTMATPLQAIAKSI* 67355

 

#62

>aaaa01001621.1b $PI CYP76P4P (indica cultivar-group) ortholog of AC092749.1a $P 1 aa diff

16168 LMAVRLGTVVSTVASSPSTARQILQTHNGSLTGRV

      AWHGVGVGHAANSVFVLPPRRKWRALRRIGAEHLLSTRQLDGACMPLLR 16414 frameshift

16416 SEMSAASGGAPVQVGHAAHAVFAAMVDMQWRAMFSAG 16526

 

>AC092749.1a $P CYP76P4P clone OSJNBb0023M11, from chromosome 10, complete sequence

probable pseudogene fragments aa 66-198 upstream of complete gene

64120 LMAVRLGTVVSIVASSPSTARQILQTHNGSLTGRV 64006

64019 AWHGVGVGHAANSVFVLPPRRKWRALRRIGAEHLLSTRQLDGACMPLLR 63874

63872 SEMSAASGGAPVQVGHAAHAVFAAMVDMQWRAMFSAG 63762

 

#185

>aaaa01005754.1 CYP76P5 (indica cultivar-group) orth AC074054.1b $P chr 10 99%

3717 FFLPLAFSLFLAVISAYVLQLLADARRRLPPGPWPLPLIGNLHQLDHLPHRSLARLAARH 3538

3537 GPLMSLRLGTVRAVVASSPEMAREVLQRHNADIAARSFGDSMRAGGHCENSVVCLPPRLR 3358

3357 WRALRRLSTVGLFSPRRLKKNFYIRVLSDLKLKTAEK*TSIKNPKIISK 3211

3210 FKVEIQNLTDKYKHKRKDNTLVIMHD 3133

2979 GGETTSHTMECAMAELLQCPNSMRRVQEELKSVIGTKQQMDEHDITKLPYLQAVIKET 2806

2805 LRLHPPVPLPPYEAEATVEIQGYTIPKGAKVLINLWAINRCANAWTEPDKFMPERFYDS 2629

2628 DITFMGRDFQLIPFGAGRRICLGLPLAHRMVHLMLGSLLHRFTWTLPAEAGKNGVDMRER 2449

2448 FGLTLSFVVPLYVI 2407

 

>AC074054.1b $P CYP76P5 chromosome 10 clone OSJNBa0090A14 gene 2 41% to 76F2

42% to 76C4

same sequence as AC074355 33336 34721-32944 gene 3

56068 MAFFLPLAFSLFLAVISAYVLQLLADARRRL

      PPGPWPLPLIGNLHQLDHLPHRSLARLAARHGPLMSLRLGTVRAVVASSPEMAREVLQRH 55796

55795 NADIAARSFGDSMRAGGHCENSVVCLPPRRRWRALRRLSTVGLFSPRRLDAMRALLEEKV 55616

55615 AELVRRVSGHAARGEAVDVGHAAHVAALGVLSRTMFSVDLDPEAAREVSDIVDEASVL 55442

55441 GTGPNVSDFFPAIAPADLQGVRRRMARLVKRMYAIIDEQIERRMHGRTAGEPRKNDLL 55268

55267 DVMLEEGESKEDSNEINRDAIRGL 55196

54837 DLFTGGETTSHTMECAMAELLQCPNSMRRVHELKSVIGSKQQMDEHDITKLPY 54679

      (16 aa deletion and frameshift missing the K-helix)

54682 LPPYEAEATIEIQGYTIPKGAKVLINLWAINRCANTWTEPDKFMPERFYDSDITFMGRDF 54503

54502 QLIPFGAGKRICLGLPLAHRMVHLMLGSLLHRFTWTLPAEAGKNGVDMRERFGLTLSFVAPLYVIAQEIQ*

54290

 

#219

>aaaa01007399.1 CYP76P6 (indica cultivar-group) orth AC105746.1 $F chr 10 >99%

6641 SFLLVSIMLSLLLVLFSHLLQRIAAARRRLPPGPCPLPLIGNLLDIGDLPHRSFARLAER 6820

6821 YGPLMTVRLGAATCVVASSPATARAVLQTHNASLAGRGRQDAWHAGGHAENSVFVLPPGR 7000

7001 KWRLLRKLGAAHLFSRRKLAELAPLRDEIVGGLLRRVAERADHRGGAPVNVGRLALAANV 7180

7181 ELLWRSVFSTRLDAATLDVLCDVAREAAVLLGTPNVSDFFPAVAALDLQGLRRR 7342

7343 LAELMKNTYRLVDAQIDHRMGCRELRGGRGGEAMDLLDVLLDMSEQERED 7492

8017 DLFVGGSDSTATTVEWAMAELLQNPEIMKTLQQEIKMVLGTRSQVEESDIGQLPYLQ 8190

8191 AIVKETLRLHPIVPLRLYEAERTVEIEGHTIPKGSKVIVNAWAIHQSVKVWIQPEKFLP 8367

8368 ERFITKDIDFAGRHFEFIPFGSGRHICIGLPLANRMLHMILGSLMHQFKWTMPQMVNRNG 8547

8548 LDMAE 8562

 

>AC105746.1 $F CYP76P6 chromosome 10 clone OSJNBb0086I08, Length = 123484

55% to AC074105.1 chr 10 40% to 76C2

1-115

      MSAFCWSATRSPSRQRLIENGFLFACKHHALTPARPLLPPLQ

31481 RIAAARRRLPPGPCPLPLIGNLLDIGXLPH

      RSFARLAERYGPLMTVRLGAATCVVASSPATARAV 31288

31287 LQTHNASLAGRGRQDAWHAGG

116-306

102155 HAENSVFVLPPGRKWRLLRKLGAAHLFSRRKLAELAPLRDEIVGGLLRRVAERADHRGG 101979

101978 APVNVGRLALAANVELLWRSVFSTRLDAATLDVLCDVAREAAVLLGTPNVSDFFPA 101811

101810 VAALDLQGLRRRLAELMKNTYRLVDAQIDHRMRCRELRGGRGGEAMDLLDVLLDMSE 101640

101639 QEREDGDDEVINR

307-504

101097 DLFVGGSDSTATTVEWAMAELLQNPEIMKTLQQEIKMVLGTRSQVEESDIGQL 100939

100938 PYLQAIVKETLRLHPIVPLRLYEAERTVEIEGHTIPKGSKVIVNAWAIHQSVKVWIQP 100765

100764 EKFLPKRFITKDIDFAGRHFEFIPFGSGRHICIGLPLANRMLHMILGSLMHQF 100606

100605 KWTMPQMVNRNGLDMAEKFGLAVSMATRPNIIA 100507

 

#100

>aaaa01002827.1 $FI CYP76Q1 (indica cultivar-group) one frameshift

12880 MAFFLVACLPWVCFILLSLYVFQLFADARRRLPPGPWPPKPLIGDLLALGKGDQQ 12716

12715 HRSLARLADRYGPVMSLRLGTVLTVVVSTPDAMREIFHKNKDNLAGRPTADAFNAMGHSA 12536

12535 NSLLGLEHPGVRWRAIRRFSTAELLAPRRLAALQPLCRDKVRGLVRGVSELAARGEPVHV 12356

12355 RRVALDMALSLI (fs) 12320

12322 LSAIYSVDLDPESTAVFRSVVEEAMLLIGTANLSDLFPAIAALDLQGVRRRVA

      ELFTITYRQYDEQVARRRPERDAGEAGKNDLLNVVLDMEREWQQKGSVLSHDAMRVLFT (0)

8887  DLYGAGASTTSVLIEWAIADLLQNPESMRKIKEEITNVIGTNAQIQESDIARLPYLQAVV 8708

8707  KETLRLRAVAPLVPRRAEATIEVQGFTIPKGTNVILNLWAINRDARAWNDPDKFMPERF 8531

8530  IGNDINYLGQNFQFVPFGVGRRICLGLPLAQKVMYLVLGTLVHQFEWTLPEELKDTGIDM 8351

8350  TEKCGMVLCLANPLKVMAKKM* 8285

 

aaaa01002827.1 no ortholog in nr or HTGS on 9/5/02

 

#294

>aaaa01011593.1 CYP76Q2 (indica cultivar-group) orth AP003522.1 $F chr 6 100%

2115 IDQDLYGAGASTTAALIEWGMVDLIQNPEVMTKVREELTNVLGDKLVMDESDIARLPY 2288

2289 LQAVVKETLRLRTVVPLVPRKAEVDIEVNGYRIPKGTNVILNAWAINRSADAWSEPDKF 2465

2466 IPERFLGGETRGYLGQDFEMIPFGLGRRICPGMPLAQKLIPLIIGTLLHRFEWELPADAK 2645

2646 EGGIDMTEKCGVVLSLVNPLKAIP 2717

 

>AP003522.1 $F CYP76Q2 chromosome 6 clone P0036B02, 37% to 76C2

AP004726.1 chromosome 6 clone P0486G02

AU096446.1 Rice green shoot cDNA clone S13713.Length = 477 49% to 76C2

129851 MAASSFLVECLSWLVVVLFSLYIFQLLRDARRRLPPGPWPPKPLVGDLLDLGEDGKQHRTFLRLAGR 130051

130052 YGGLMCLRFGMVPHVIVSTPDALRAVFAAAGAGGGGGGEGKKVDGIAGLPSLDVLSAMGH 130231

130232 RAHTIFALPSQDGKWRALRKFAAAEMLAPRRISSAAAGAQLQTKIVEALRREVSGHAARG 130411

130412 AAVVFRHAVLDSILSLLLGVLYSTDLEREERAMFRDLIEEIVGMLGTANVSDVFP 130576

130577 PVAALDLQGLRRRMTDLLTIMYRHFDDQVALRRRSRDAGEARKNDVLDTVLDKEES 130744

130745 EWKQEGSLLSHDVMRVLLS 130801 (0)

131025 DLYGAGASTTAALIEWGMVDLIQNPEVMTKVREELTNVLGDKLVMDESDIARLPYLQ 131195

131196 AVVKETLRLRTVVPLVPRKAEVDIEVNGYRIPKGTNVILNAWAINRSADAWSEPDKFIP 131372

131373 ERFLGGETRGYLGQDFEMIPFGLGRRICPGMPLAQKLIPLIIGTLLHRFEWELPADAK 131546

131547 EGGIDMTEKCGVVLSLVNPLKAIPKEI* 131630

 

#294

>aaaa01063083.1 CYP76Q2 (indica cultivar-group) orth AP003522.1 $F chr 6 96%

see aaaa01011593.1 for ortholog

300 LYGAGASTTAALIEWGMVDLIQNPEVMTKVREELTNVLGDKLVMDESDIARLPYLQAV 127

126 VKETLRLRTVVPLVPRKAELDIEVNGYRFPRG 31

 

#396

>aaaa01021406.1 CYP77A9 (indica cultivar-group) ortholog of AL731641.1 AU070007

(1 diff) runs off end 53% to 77A4

AAAA01011465.1 (indica cultivar-group) Cterm of CYP77A

2975 MQPTWLATMPAASSLVVGVAFTAAVAVAVAAAVARRAWRHRGLRLPPGPPSWPVVGNLLQ 3154

3155 VVFAGKPFIHYIRDLRREYGPIVKLQMGVRTLVVISSAELVHEALVEKGREFATRPAESP 3334

3335 IRSIFSSGKFTVNSAVYGPEWRSLRRNMVSGMLSAARLREFRPARLRAMERFVARVRAEA 3514

3515 AASRDGASVWVLRNVRFAMFCVLLDMTFGLLDLDEELVVRVDAVMKRVVLAVAARIDDYL 3694

3695 PFLRPFLWRQHRQAVALRREQIDTVLPLINRRRAIVRGMRAGSPPDPAVAAPY 3853

aa 294-338 missing

8908 LARVMDNPSIQARLHGEIMQRVGDARPVDDRDTDGMPYLQAFVKELLRKHPPTYFALSHA 8729

8728 AVEPGSKLAGYDVPVDANLDIFLPTISEDPKLWERPTEFDPDRFLAGGETADITGSAGVR 8549

8548 MIPFSAGRRICPGVGMGTAHIALMVARMVQAFEWRAHPSQPPLDFEDKVEFTVVMKRPLL 8369

8368 AMVTPRKLSF* 8336

 

>AL731641.1 $F CYP77A9 chromosome 4 clone OSJNBa0042I15 = AU070007

C19435 same clone as C99199 60% with 77A4 297-428 region

C-term of wheat  = NQPLLATVKPRKISL* from BE518078.1 this seq not found yet in rice

C-term of barley = NQPLLATVKPRKISL* from BG416410.1 this seq not found yet in rice

42403 MQPTWLATMPAASSLVVGVAFTAAVAVAVAAAVARRAWRHRGLRLPPGPPGWPVVGNLLQ 42582

42583 VVFAGKPFIHYIRDLRREYGPIVKLQMGVRTLVVISSAELVHEALVEKGREFATRPAESP 42762

42763 IRSIFSSGKFTVNSAVYGPEWRSLRRNMVSGMLSAARLREFRPARLRAMERFVARVRAEA 42942

42943 AASRDGASVWVLRNVRFAMFCVLLDMTFGLLDLDEELVVRVDAVMKRVVLAVAARIDDYL 43122

43123 PFLRPFLWRQHRQAVALRREQIDTVLPLINRRRAIVRGMRAGSPPDPAVAAPYSYLDSLL 43302

43303 DLRVEGRDAVPTDEELVTLCAEMINGGTDTTATAIEWAMARVMDNPSIQARLHGEIMQRV 43482

43483 GDARPVDDRDTEGMPYLKAFVKELLRKHPPTYFALSHAAVEPGSKLAGYDVPVDANLDIF 43662

43663 LPTISEDPKLWERPTEFDPDRFLAGGETADITGSAGVRMIPFSAGRRICPGVGMGTAHIA 43842

43843 LMVARMVQAFEWRAHPSQPPLDFEDKVEFTVVMKRPLLAMVTPRKLSF* 43989

 

#126

>aaaa01003438.1 $FI CYP77B2 (indica cultivar-group) ortholog to AP003723.1 100% Cterminal part

AAAA01022256.1 (indica cultivar-group) 100% Nterminal part

1463 MVDMNDVLLVVSAAVLAAMWWRRCSRTGGADGLPPGPPGWPVVGNLFQVILQRRPFMYVV 1642

1643 RDLREKYGPIFTMRMGQRTLIVVTDADLIHDALVKQGAAFASRPEDSPTRLLFSVGKCTV 1822

1823 NSAPYGPLWRALRRNFVAEIVSPPRVKGFSWIREWAVGSHLRRLRAEFAATGAVRMMANC 2002

2003 RLSICSILICICFGAKIPDELIREIEEVLKDVMMMTMPKLPDFLPLLTPLFTKQLAAARE 2182

2183 LRRRQLGCLAPLVRARREFIRGGGERNADGNTVVGGVEMVSAPGEAYVDSLFDLEPPGRG 2362

2363 KRLGEDELVTLCSEVMSAGTDTSATALEWAMMHLVLDAGVQDKLYGEVVSKVGTTARITE 2542

2543 ADVEAMPYLQ 2572

 463 AVVKETFRRHPPSHFVLSHAATRDTELGGYRVPADASVEFYTAWVTENPATWPDPEA 633

 634 WRPERFLEGGEGFDTDITATRALRMMPFGAGRRICPAATLGVLHIQLMLANMVREFRWVP 813

 814 PAGEGPPDPTETFAFTVVMKNPLRAALVERRVGGELATGGGGGAAASA 957

 

>AP003723.1 $F CYP77B2 chromosome 6 clone P0003H08 60% to CYP77B1

AU068699.1 Rice callus cDNA clone C50079_1A.Length = 330 64% to 77B1

AA753883, D41585 AA753996 64% to 77B1 11/94 268-350 region

127597 MVDMNDVLLVVSAAVLAAMWWRRCSRTGGADG

127693 LPPGPPGWPVVGNLFQVILQRRPFMYVVRDLREKYGPIFTMRMGQRTLIVVTDADLIHD 127869

127870 ALVKQGAAFASRPEDSPTRLLFSVGKCTVNSAPYGPLWRALRRNFVAEIVSPPRVKGFSW 128049

128050 IREWAVGSHLRRLRAEFAATGAVRMMANCRLSICSILICICFGAKIPDELIREIEEVLKD 128229

128230 VMMMTMPKLPDFLPLLTPLFTKQLAAARELRRRQLGCLAPLVRARREFIRGGGERNA 128400

128401 DGNTVVGGVEMVSAPGEAYVDSLFDLEPPGRGKRLGEDELVTLCSEVMSAGTDTSATAL 128577

128578 EWAMMHLVLDAGVQDKLYGEVVSKVGTTARITEADVEAMPYLQ 128706 (0)

129997 AVVKETFRRHPPSHFVLSHAATRDTELGGYRVPADASVEFYTAWVTENPATWPDP 130161

130162 EAWRPERFLEGGEGFDTDITATRALRMMPFGAGRRICPAATLGVLHIQLMLANMVREFRW 130341

130342 VPPAGEGPPDPTETFAFTVVMKNPLRAALVERRVGGELATGGGGGAAASA* 130494

 

#126

>aaaa01092197.1 CYP77B2 (indica cultivar-group) orth AP003723.1 $F chr 6 100%

see aaaa01003438.1 for ortholog

585 AWVTENPATWPDPEAWRPERFLEGGEGFDTDITATRALRMMPFGAGRRICPAATLGVLH 409

408 IQLMLANMVREFRWVPPAGEGPPDPTETFAFTVVMKNPL 292

 

#345

>aaaa01014577.1 CYP78A11 (indica cultivar-group) orth AC083943.6 $F 100%

2273 QEMIFRGTDTTALVTEWCMAEVVRNPAVQARLRAEVDAAVGGDGCPSDGDVARMPYLQ 2100

2099 AVVKETLRAHPPGPLLSWARLATADVGLANGMVVPAGTTAMVNMWAITHDGEVWADPEAF 1920

1919 APERFIPSEGGADVDVRGGDLRLAPFGAGRRVCPGKNLGLATVTLWVARLVHAFDWFLPD 1740

1739 GSPPVSLDEVLKLSLEMKTPL 1677

 

>AC083943.6 $F CYP78A11 clone OSJNBa0044A10, 61% to 78A7

orth of aaaa01014577.1

40753 MAMATATASSCVDATWWAYALPALLGADTLCAHPALLAGAVLLAFATAAVLAWAASPGGPAWAHGRGRLGA 40965

40966 TPIEGPRGLPVFGSIFALSRGLPHRALDAMSRDAAAPRARELMAFSVGETPAVVSSCPAT 41145

41146 AREVLAHPSFADRPLKRSARELLFARAIGFAPSGEYWRLLRRIASTHLFSPRRVAAHEPG 41325

41326 RQADATAMLSAMAAEQSATGAVVLRPHLQAAALNNIMGSVFGRRYDVSSSSGAAADEAEQ 41505

41506 LKSMVREGFELLGAFNWSDHLPWLAHLYDPNHVARRCAALVPRVQAFVRGVIRDHRLRRD 41685

41686 SSSTAADNADFVDVLLSLEAHENLAEDDMVAVLW 41787 (0)

41874 EMIFRGTDTTALVTEWCMAEVVRNPAVQARLRAEVDAAVGGDGCPSDGDVARMP 42035

42036 YLQAVVKETLRAHPPGPLLSWARLATADVGLANGMVVPAGTTAMVNMWAITHDGEVWADP 42215

42216 EAFAPERFIPSEGGADVDVRGGDLRLAPFGAGRRVCPGKNLGLATVTLWVARLVHAFDWF 42395

42396 LPDGSPPVSLDEVLKLSLEMKTPLAAAATPRRRRAA* 42506

 

#151

>aaaa01004390.1 $PI CYP78B4 (indica cultivar-group) Nterm missing and one frameshift

abrupt break in seq compared to 78B5, possible pseudogene, no ortholog

12657 SDRPVKDAARGLLFHRAMG 12713

12714 FAPSGDYWRALRRVSANHLFTPRRVAASAPRRLAIGERMLDRLSALAGGEIGM 12872

12873 RRVLHAASLDHVMDTVFGTRYDGDSQEGAELEAMVKEGYDLLGMFNWGDHLPLLKWL 13043

13044 DLQGVRRRCRTLVQRVDVFVRSIIDEHRQRKRRTGGNGGGEELPGDFVDVLLGLQGEEKM 13223

13224 TESDMVAVLW (0?)

      EMIFRGTDTVAILLEWI 13403

13404 MARMVLHPDIQAKAQAELDAVVGRERAVSDGDVAGLRYLQCVVKEALRVHPPGPLL 13571

      SWARLAVRDAHVGGHA frameshift

13622 VPAGTTAMVNMWAIAHDPELWPEPDEFRPERFAEEDVSVLGGDLRLAPFGAGRRACPGKT 13801

13802 LALATVHLWLAQLLHRFEWAPVGGGVHLLERLNMSLEMEKPLVCKAKPRW* 13954

 

aaaa01004390.1 no ortholog found in japonica 9/6/02

 

#130

>aaaa01003568.1 $FI CYP78B5 (indica cultivar-group) orth of AP005175.1

15269 MALSSMAAAQESSLLLFLLPTSAASVFPPLISVVVLAALLLWLSPGGP 15412

15413 AWALSRCRGTPPPPGVAGGAASALSGPAAHRVLAGMSRAVEGGAAVMSLSVGL 15571

15572 TRLVVASRPETAREILVSPAFG DRPVKDAARQLLFHRAMGFAPSGDAHWRGLRRVSAAHL 15751

15752 FGPRRVAGSAPEREAIGARIVGDVASLMSRRGEVPLRRVLHAASLDHVMATVFGKRHGDL 15931

15932 SIQDGELLEEMVTEGYDLLGKFNWADHLPLLRWLDLQGIRRRCNRLVQKVEVF 16090

16091 VGKIIQEHKAKRAAGGVAVADGVLGDFVDVLLDLQGEEKISDSDMIAVLW

16332 EMIFRGTDTVAILMEWVMARMVMHPEIQAKAQAEVDAAVGGRRGGVADGDVASLPYIQ 16505

16506 SIVKETLRMHPPGPLLSWARLAVHDARVGGHAVPAGTTAMVNMWAIAHDAAVWPEPEAF 16682

16683 RPERFSEGEDVGVLGGDLRLAPFGAGRRVCPGRMLALATAHLWLAQLLHAFDWSP 16847

16848 TAAGVDLSERLGMSLEMAAPLVCKAVAR 16931

 

>AP005175.1 $F CYP78B5 (japonica cultivar-group) chr 7

116502 MALSSMAAAQESSLLLFLLPTSAASVFPPLISVVVLAALLLWLSPGGPAWALSRCRGTPP 116323

116322 PPGVAGGAASALSGPAAHRVLAGISRAVEGGAAVMSLSVGLTRLVVASRPETAREILVSP 116143

116142 AFGDRPVKDAARQLLFHRAMGFAPSGDAHWRGLRRASAAHLFGPRRVAGSAPEREAIGAR 115963

115962 IVGDVASLMSRRGEVPLRRVLHAASLGHVMATVFGKRHGDISIQDGELLEEMVTEGYDLL 115783

115782 GKFNWADHLPLLRWLDLQGIRRRCNRLVQKVEVFVGKIIQEHKAKRAAGGVAVADGVLGD 115603

115602 FVDVLLDLQGEEKMSDSDMIAVLW (0?)

115439 EMIFRGTDTVAILMEWVMARMVMHPEIQAKAQAEVDAAVGGRRGRVADGDVASLPYIQ 115266

115265 SIVKETLRMHPPGPLLSWARLAVHDARVGGHAVPAGTTAMVNMWAIAHDAAVWPEPDAFR 115086

115085 PERFSEGEDVGVLGGDLRLAPFGAGRRVCPGRMLALATAHLWLAQLLHAFDWSPTAAGVD 114906

114905 LSERLGMSLEMAAPLVCKAVAR 114840

 

#398

>aaaa01022265.1 CYP78B6 (indica cultivar-group) orth AC097276.6 chr 3 99%

see aaaa01023161.1 for ortholog

3584 GTDTVAILLEWVLARMALHPDVQAKAQAEIDAAAVSGDAAALPYLHCVVKEC 3429

3428 LRMHPPGPLLSWARLATRDAHLDLGADAGGRAALVPAGTTAVVNMWAIARDGGLWRDPGV 3249

3248 FRPERFLGDGEAAGVGVAGGAGGYDLRLAPFGAGRRACPGRALAMATVHLWLAQLLRSFR 3069

3068 WVPSGDRGVDMSERLGMSLEMEKPL 2994

 

#398

>aaaa01023161.1 CYP78B6 (indica cultivar-group) ortholog of AC097276.6

AAAA01022265.1 (indica cultivar-group Cterminal missing 5 aa

2560 MVSFSYLDSTFLPLLATTMASPLHACLLLALLFLALAFFHPGGVAWALSSSGGHGAAAIP 2381

2380 GPRGVLLAFAGPNPHRALASLAASTRGATRLMAFSVGLTQFVVASHPDTAREILAGAAFA 2201

2200 DRPVKEAAAELMFHRAMGFAPHGGYWRRLRRLASAHALAPGRLAARRRAIGEETV 2036

2035 RRVAAAMARDGAVGVSRLLHLASLDNVMASVFGVGLGELGAGAVSELEEMVGQGYELLGT 1856

1855 FNWGDHLPLLRLLDVHGVRRKSRALASRVKVFVSKIIEEHQTRRDAKYGGCDGDGDFVDA 1676

1675 LLGLEGEERLEEEDMVAVLW (0) 1616

3584 XXXXXGTDTVAILLEWVLARMALHPDVQAKAQAEIDAAAVSGDAAALPYLHCVVKECLRMHPPGP 3405

3404 LLSWARLATRDAHLDLGADAGGRAALVPAGTTAVVNMWAIARDGGLWRDPGVFRPERFLG 3225

3224 DGEAAGVGVAGGAGGYDLRLAPFGAGRRACPGRALAMATVHLWLAQLLRSFRWVPSGDRG 3045

3044 VDMSERLGMSLEMEKPLICLALPRTSST* 2958

 

>AC097276.6 CYP78B6 chr 3 clone OJ1519_A12 38% to AC073867 57% to 78A2

C-terminal from version 1 of accession (absent in version 6)

Gap not filled in any section of GenBank

Ortholog of aaaa01023161.1 98% (6 diffs) lower case

44847 MVSFYLDSTFLPLLATTMASPLHACLLLALLFLALAFFHPGGVAWALSSSGGHGAAAIPG 45026

45027 PRGVLLAFAGPNPHRALASLAASTRGATRLMAFSVGLTQFVVASHPDTAREILAGAAFAD 45206

45207 RPVNEA (32 aa gap in seq aaelmfhramgfaphggywrrlrrlasahala)  PG 45386

45387 RLAARRRAIGEEPVRRVAPAMARDGAVGVRRLLHLASLDNVMASVFGVGLGELGAGAVSE 45566

45567 LEEMVGQGYELLGTFNWGDHLPLLRLLDVHGVRRKSRALASRVKVFVSKIIEEHKTRRDA 45746

45747 KYGGCDGDGDFVDVLLGLEGEERLEEEDMVAVLW (0?) 45848

48030 EMIFRGTDTVAILLEWVLARMALHPDVQSKAQAEIDAAAVSGDAAALPYLHCVV 48191

48192 KECLRMHPPGPLLSWARLATRDAHLDLGADAGGRAALVPAGTTAVVNMWAIARDGGLWRD 48371

48372 PGVFRPERFLGDGEAAGVGVAGGAGGYDLRLAPFGAGRRACPGRALAMATVHLWLAQLLR 48551

48552 SFRWVPSGDRGVDMSERLGMSLEMEKPLICLALPRTSST* 48671

 

#17

>aaaa01000394.1 CYP78C5 (indica cultivar-group) >99% to AP004704.1a

13114 MDMDSSPSTQDCGGWLLYVSLAAKCGGDPCRVVGFVAVAVVAFAVTSLLHWLSPGGPAWG 13293

13294 RYWWNRRGGLGIAAAIPGPRGLPVLGSMSLMAGLAHRKLAAAAGGSPARRRLMALSLGET 13473

13474 RVVVTADPGVARELLASAAFADRPVKESAYGMLFHRAIGFAPYGTYWRALRRVASTHLFS 13653

13654 PRQVSASAAQRAVIARQMVEAMRSAAAAAAGGGVAARPFLKRASLHNVMWSVFGRKYELA 13833

13834 APESEETAELRSMVDEGYDLLGQLNWSDHLPWLAPFDLQKTRSRCSSLVPRVNRFVTRII 14013

14014 DEHRARLSLAVDA 14052 small 16aa seq gap

14189 LHGGDXLSDADXVAVLWVCT 14212

14307 EMIFRGTDTVAVLIEWVAARLVLHQDVQARVHDELDRVVGSDRAVTESDASKLVYLQ 14477

14478 AVIKEVLRLHPPGPLLSWARLATSDVHVGGFLIPSGTTAMVNMWAITHDPAVWPDPNEFK 14657

14658 PERFVAGPSSDQAAEFPIMGSDLRLAPFGSGRRSCPGKSLAIATVGFWVATLLHEFDWLP 14837

14838 LSDKSRGVDLSEVLKLSCEMATPLEARLRPRRKV 14939

 

>AP004704.1a $F CYP78C5 chromosome 8 clone P0544G09, Length = 137478

New 61% to 78A9

42285 MDMDSSPSTQDCGGWLLYVSLAAKCGGDPCRVVGFVAVAVVAFAVTSLLHWLSPGGPAWGRYWWNRRGGLGIAAAI

42057 PGPRGLPVLGSMSLMAGLAHRKLAAAAGGSPARRRLMALSLGETRVVVTADPGVARELLA 41878

41877 SAAFADRPVKESAYGMLFHRAIGFAPYGTYWRALRRVASTHLFSPRQVSASAAQRA 41710

41709 VIARQMVEAMRSAAAAAAGGGVAARPFLKRASLHNVMWSVFGRKYELAAPESEETAEL 41536

41535 RSMVDEGYDLLGQLNWSDHLPWLAPFDLKKTRSRCSSLVPRVNRFVTRIIDEHRARLSLA 41356

41355 VDAAVDFTDVLLSLHGGDKLSDADMVAVLWVCT 41257 (0)

41183 EMIFRGTDTVAVLIEWVAARLVLHQDVQARVHDELDRVVGSDRAVTESDAS 41010

41009 KLVYLQAVIKEVLRLHPPGPLLSWARLATSDVHVGGFLIPSGTTAMVNMWAITHDPAVWP 40830

40829 DPNEFKPERFVAGPSSDQATEFPIMGSDLRLAPFGSGRRSCPGKSLAIATVGFWVATLLH 40650

40649 EFDWLPLSDKSRGVDLSEVLKLSCEMATPLEARLRPRRKV* 40527

 

#370

>aaaa01017257.1 CYP78C6 (indica cultivar-group) small seq gap of about 3 aa

might be a frameshifted region before that just after VIARQMV (partialI)

connot complete in indica or japonica seq.

     MATPEDTGSWLLYLSLAAKCSGDGDGQPHRLLGFVVVCAVAGLVTCLLHWSFPGGPAWGRW

3808 WWTRRRRRGSPCGVAAVPGLRGLPVIGSMWLMTGLAHRKLAAAAEAAGAGRLMALSLGE 3984

3985 TRVVVAAHPDVAREILHGAAFADRPVKESAYGLLFHRAIGFAPHGAYWRALRRVA 4149

4150 STHLFSPWQVAASAPQRAVIARQMVRAIKLQQRSRSGDSAAGA 4278 small seq gap

     RRASLHNVMWSVFGRRYELQLDPGKESDEVRELRALVDEGYDLLGQLN

     WSDHLPWLARFDLQSTRARCSRLVPRVNRFVTRIIDEHRSSAPVAAAI

     DFTDVLLSLQGSDKLADSDMVAVLWVRH (0)

4829 EMVFRGTDTVAVLIEWVLARLVLQQDVQARVHDELGRVVGLDRDVTESDTASLVYLH 4999

5000 AVIKETLRLHPPGPLLSWARLATSDVHVDGYLIPAGTTAMVNMWAIAHDPDVWAEPMEFR 5179

5180 PERFIGKAAEFSVMGSDLRLAPFGSGRRSCPGKSLAMATVAFWLATLLHEFALLPSPDP 5356

5357 AHGVDLSEVLRLSCEMATPLAVTAWPRRVV* 5449

 

No japonica ortholog found 9/11/02

 

#250

>aaaa01008990.1 CYP78C7 (indica cultivar-group) orth AC099399.1 $F chr 3 100%

5202 PGPKGLPVVGSLGLMSGLAHCSLAAEAARRPGAKRLMALSLGPVRAVVTSHPDVAKEIL 5378

5379 DNPAFADRPLNHAAYGLMFHRSIGFAEHGPYWRALRRVAAGHLFGPRQVDAFAPYRAR 5552

5553 VAGGVVAALRGAGGEAAVQVRGVLRRASLYYIMRFVFGKEYDVSRGAPESGEEVE 5717

5718 ELLEMVHEGYDLLGKENWCDYFP 5786

6294 QEMIFRGTDAMAVLMEWTLARVVLHPDVQANVHRELDAVVGRSNTVAESAVPSLPYLQ 6467

6468 ALLKEALRMHPPGPLLSWRHRAISDTYVDGHLVPAGTTAMVNQWAMSRDADVWDAPLEFQ 6647

6648 PERFLPGGKAHGVSVLGADGRLVPFGSGRRSCPGKSLAMTTVTAWMATLLHEFEWTPA 6821

6822 SGAVDLSEVLRLSCEMAVPLEV 6887

 

>AC099399.1 $F CYP78C7 chromosome 3 clone OJ1006_F06, Length = 122577

56% to 78A9

49121 MATSFVYLAIFACLAWAGTALLYWAHPGGPAWGKYRRARGQSPRCSI

48980 PGPKGLPVVGSLGLMSGLAHCSLAAEAARRPGAKRLMALSLGPVRAVVTSHPDVAKEIL 48804

48803 DNPAFADRPLNHAAYGLMFHRSIGFAEHGPYWRALRRVAAGHLFGPRQVDAFAPYRA 48633

48632 RVAGGVVAALRGAGGEAAVQVRGVLRRASLYYIMRFVFGKEYDVSRGAPESGEEVEELL 48456

48455 EMVHEGYDLLGKENWCDYFPGLAAVDPQGVGARCAELMPRVNRFVRGIIQEHRGKAIA 48282

48281 GGEARDFVDILLSLQESEGLADADIAAVLW 48192 (0)

47885 EMIFRGTDAMAVLMEWTLARVVLHPDVQANVHRELDAVVGRSNTVAESAVPSLPYLQALL 47706

47705 KEALRMHPPGPLLSWRHRAISDTYVDGHLVPAGTTAMVNQWAMSRDADVWDAPLEFQPER 47526

47525 FLPGGKAHGVSVLGADGRLVPFGSGRRSCPGKSLAMTTVTAWMATLLHEFEWTPASGA 47352

47351 VDLSEVLRLSCEMAVPLEVRVSARRNV* 472968

 

#306

>aaaa01012488.1 $FI CYP78D1 (indica cultivar-group) 47% to CYP78A5 orth AA751892

5682 MRNEVLSTIFLLLIFFTTTINPSSSQLPWLFSLLYLSLAMAVVALPPLLAKRHGHARRVN 5503

5502 GGGAAIPGPRGWPLLGSLPAVSGPLMHRRLAALAYAHGGGARR LMSLTLGATPVVVSSHP 5323

5322 DTAREILAGAAFRDRPARAAARE LMFLRAVGFAPAAGDDGGAYWRRLRRAAGAGMLS PRR 5143

5142 AAALAALRARVARRTSEAVSRGMAVPPGRVAMRALLHAASLDNMVGSVLGLEHHDHHGGV 4963

4962 ISDMGDMVREGYELVGKFNLGDYYSTTQYQCLWGLLDFHGVGPRCQRLAARVREQFGRVM 4783

4782 EERRKVSDLHKRDDLLSYMLSMPQEERIEDSDVIAVLW (0) 4669

3107 EMIFRGTDVVAILLEWAMARMVLHPDIQSKVQEELDRAVGHRPMTDSDIPSLRFLHCVIK 2928

2927 ETLRMHPPGPLLSWARLAVHDTYVGKHLVPAGTTAMVNMWAISHDETIWGDPWVFRPERF 2748

2747 MEEDINVLGSDLRLAPFGSGRRVCPGRMMGLSTSYLWFGRMLQEYKWSPAQPVKLTECLR 2568

2567 LSMEMKKPLVCHAVPRSKTG* 2505

 

>AA751892 CYP78D1 (partial)  52% to 78A5     1/98 469-519 REGION

Ortholog to aaaa01012488.1 complete 3 diffs no nr or HTGS match on 9/10/02

RXVCPGRMMGLSTAYLWFGRMLQEYKWAAAQPVKLTECLRLSMEMXKPLVCHAVPRXKTG*

 

#353

>aaaa01016663.1 CYP79A7 (indica cultivar-group) orth AC091302.4 AC084319.5b

chr 3 99%

DIMIATVDNPSNAVEWALAEMMNKPEVMRKAMNELDTVVGRDRLVQE 2600

2599 SDVRDLNYLKACIREAFRLHPYHPFNPPRVAMADTTIAGYTIPKGSQVILSRVGLGRNPR 2420

2419 VWDDPLEFRPERHLSPYPAGGRGDAGVVALTEAELRFVSFSTGRRGCPGVSLGTLITVTL 2240

2239 FARLLQGFEWSKPAGVKRVELREEAASLVLAQPLVLQATP 2120

 

>AC091302.4 CYP79A7 chr 3 = AC084319.5b 50% to 79B3

18071 MSEAMAVTMPPMRAPALVAMLVVVLVALVRRRRHRSKGAGGRLESLPPGPVGLPVIGNMH 17892

17891 QMLVNKPVFRWVHRLLADAGGEIVCVRLGPVHVVAVTSPEMAREVLRKNDAVFADRPTTF 17712

17711 AAESFSVGYRSASISPHGDQWRKMRRVLTAEILSPATEHRLRGARGEEADHLVRYVLVRC 17532

17531 GRDGAAVDVRHVARHFCGNVIRRLTLGRRHFREPRADDEDAAAPGRDEAEHVDALFATLN 17352

17351 YLDAFCVSDYFPALVGLDLDGQEKVIKKVMRTLNRLHDPVVEERVEEWRLLRKAGERRDV 17172

17171 ADFLDVLASLDDAAGRPLLTVEEIKAQTI 17085

14173 DIMIATVDNPSNAVEWALAEMMNKPEVMRKAMDELDTVVGRDRLVQESDVRDLNYLKACI 13994

13993 REAFRLHPYHPFNPPRVAMADTTIAGYTIPKGSQVILSRVGLGRNPRVWDDPLEFRPERH 13814

13813 LSPYPAGGRGDAGVVALTEAELRFVSFSTGRRGCPGVSLGTLITVTLFARLLQGFEWSKP 13634

13633 AGVERVELREEAASLVLAQPLVLQATPRLAAHLYGAGK 13520

 

>AC084319.5b $F CYP79A7 chr 3 one diff with indica seq aaaa01016663.1 = AC091302.4

start met not identified at least 5 to pick from

122542 MSEAMAVTMPPMRAPALVAMLVVVLVALVRRRRHRSKGAGGRLESLPPGPVGLPVIGNMH 122363

122362 QMLVNKPVFRWVHRLLADAGGEIVCVRLGPVHVVAVTSPEMAREVLRKNDAVFADRPTTF 122183

122182 AAESFSVGYRSASISPHGDQWRKMRRVLTAEILSPATEHRLRGARGEEADHLVRYVLVRC 122003

122002 GRDGAAVDVRHVARHFCGNVIRRLTLGRRHFREPRADDEDAAAPGRDEAEHVDALFATLN 121823

121822 YLDAFCVSDYFPALVGLDLDGQEKVIKKVMRTLNRLHDPVVEERVEEWRLLRKAGERRDV 121643

121642 ADFLDVLASLDDAAGRPLLTVEEIKAQTI 121556

118644 DIMIATVDNPSNAVEWALAEMMNKPEVMRKAMDELDTVVGRDRLVQESDVRDLNYLKACI 118465

118464 REAFRLHPYHPFNPPRVAMADTTIAGYTIPKGSQVILSRVGLGRNPRVWDDPLEFRPERH 118285

118284 LSPYPAGGRGDAGVVALTEAELRFVSFSTGRRGCPGVSLGTLITVTLFARLLQGFEWSKP 118105

118104 AGVERVELREEAASLVLAQPLVLQATPRLAAHLYGAGK 117991

 

#212

>aaaa01007189.1 $FI CYP79A9 (indica cultivar-group) ortholog of AL662990.1

     MPSQLIAQNDHAYVLTFL

4128 MSIAMAILLLVALFYRIKKQAAAMAAKRKQQPKLPPGLATMPVVRNMHQMLMNKPVFRWIHR 3943

3942 LLDEMDTEILCLRFGRVHVIAVASPEMARELLRKKDAMLASRH 3814

3814 SSFASRTFSFGYKNTIMSPGGDQWRKMRRVLTSEILSPAMERRMLGRRVEEADHLVNYVY 3635

3634 GHCNDGTVDVRHVTRHFCGNIIRKLVFGRRHFNSGDGNIGPGRDEEAHIDALFTALDYHG 3455

3454 AFSVSDYFPTLVGLDLDGHEEVVNGLMNTFNRLHDPIIMERIEEWKSLRTKGDKRREVAD 3275

3274 FLDVLISLEDAQGKPFLSVDEIKAETL (0) 3194

2567 EIILATVDNPSNAVEWALAEMVNNPKVMKKAVDELDMVVGRERLVEESDIHNLTYLKACI 2388

2387 REAFRLHPYHPFNPPHVAIADTTVAGYMIPKGSHVMLSRIGLGRNPRAWDKPLEFQPERH 2208

2207 LKNTGTVVLAEPELRFVSFSAGRRGCPAVSLGTSITMMLFARLLQGFSWSIPPGGDRIEL 2028

2027 QESATSLQLSKPLFMQAKPRLLLHLYEADVLN* 1929

 

>AL662990.1 CYP79A9 chr 4 clone OSJNBb0015C06, 58% to 79A2 67% to CYP79 sorghum

C98773  146-257 REGION C-HELIX AND ON 46% to 79B2 opp end = C98774 (probably 3

prime untranslated) = AA752332.1 cannot complete from GenBank use the indica seq.

Ortholog to aaaa01007189.1 98% shown in lower case to fill missing seq.

     mpsqliaqndhayvltfl

4128 msiamailllvalfyrikkqaaamaakrkqqpklppglatmpvvrnmhqmlmnkpvfrwihr 3943

3942 lldemdteilclrfgrvhviavaspemarellrkkdamlasrh 3814

3814 ssfasrtfsfgykn

TIMSPAGDQWRKMRQVLTSEILSPAMERRMLGRRVEEADHLVNYVYSHCNDGTVDVR

HVTRHFCGNIIRKLVFGRRHFNSGDGNIGPGRDEEAHIDALFTALDYHGAFSVSDLLP

TLVGLDLDGHEEVVNGLMNTFNRLHDPIIMERIEEW (gap? kslrtkgdkrrevadf)

119313 LDVLISLEDAQGKPFLSVDEIKAETL (0) 119236

118617 EIILATVDNPSNAVEWALAEMVNNPKVMKKAVDELDMVVGRERLVEESDIHNLTYLKACI 118438

118437 REAFRLHPYHPFNPPHVAIADTTVAGYMIPKGSHVMLSRIGLGRNPRAWDKPLEFQPERH 118258

118257 LKNTGTVVLAEPELRFVSFSAGRRGCPAVSLGTSITMMLFARLLQGFSWSIPPGGDRIEL 118078

118077 QESATSLQLSKPLFMQAKPRLLLHLYEADVLN* 117979

 

#347

>aaaa01014830.1 $FI CYP79A10 similar to AL662990.1 a CYP79

4669 MFPSSVNMREQNNII

4624 IISIAMTILLLVVFFCRMLGNMAGKNKRKKQPKLPPGPATMPVLGNIHQILMNKPVFRW 4448

4447 IHRLLDEMDTEILCLRLGSVHVIAIASPEMAREALRRNDAVLTSRPVSFAWRAFSFGYKN 4268

4267 TVGSTGDQWKKMRRMLASEILSSAMERRMLGQRVEEADHLVNYIYRNCNSGTVDIRHVT 4091

4090 RHFCGNIIRKLVFGRRHFAFGAGNIGPGRDEEAHIDALFTALDYLGAFSISDYFPSL 3920

3919 VLNGLMSTFRRLHDPIIMERMEEWRAPRRNGDERREVADFLDVLISLDDAQGK 3761

3760 PLLSLDEVKAETL 3722 (0?)

3092 EIILNSVDNPSNAVEWALAEMVNNPKVMKKAVDELDMVVGKERLVEESDIHSLTYLKACI 2913

2912 REAFRIHPYHPFNPSHVAIANITIAGFMIPKGSHIILSRIGLGRNPRAWDNPLEFRPERH 2733

2732 LKNTDNVVLAEPELRFLSFSAGRRGCPAVSLGTSITMMLFARLLQGFSWSISPGANRIEL 2553

2552 QESVTSLQLSKPLLMQAKPRLLLHLYEMDIAKQG* 2448

 

no japonica ortholog found 9/11/02

 

#239

>aaaa01008340.1 CYP79A11 (indica cultivar-group) 81-84% to aaaa01007189 (partialI)

11247 MFFPSTANMREQNN

11205 TIIVSIAMTILLLVAFFCRIKKQAAMAAKNKRKKQPKLPPGPATMPVLGNMHQMLMNKPVFRWI 11014

11013 HRLLDEMDIEILCLRLGRVHVITVASPEMAREVLRKNDALMTSRPASFAWRAFSFGYKN 10837

10836 TIGSTGDQWKKMRRALASEILSPAME 10759

sequence gap here

10649 IIMERMHEGRALRRNGDERREVADFLDVLVSLEEAQGNPLLSLDEVKAETL (0?) 10497

7815 EIFIATVDNPSNAVEWALAEMVNNPNVMKKAVDELDVVVGKERLVEESDIQNLTYLKAC 7639

7638 IREAFRIHPYHPFNPPHVAISDTIIAGYLIPKDSHVMLSRIGLGRNPRVWVNPLEFRPER 7459

7458 HLNNATSTMVLAEPELRFVSFGASRRGCPAVSLGTSITMMLFARLLQGFTWSIPPGADKI 7279

7278 ELQESASSLQLSKPLLMQAKPRLLLHLYELDRL* 7177

 

10836 TIGSTGDQWKKMRRALASEILSPAME 10759

 

missing region resembles this

 

RRMLGQRVEEADHLVNYIYRNCNSGTVDIRHVT 4091

4090 RHFCGNIIRKLVFGRRHFAFGAGNIGPGRDEEAHIDALFTALDYLGAFSISDYFPSL 3920

3919 VLNGLMSTFRRLHDP

 

10649 IIMERMHEGRALRRNGDERREVADFLDVLVSLEEAQGNPLLSLDEVKAETL (0?) 10497

 

no japonica ortholog on 9/7/02

 

#322

>aaaa01013363.1 $FI CYP81A5 (indica cultivar-group) ortholog of AC084282a 100%

3727 MDKAYIAVFSIVILFLLVDYLRRLRGGGTSNGKNKGMRLPPGLPAVPIIGHLHLVKKPMH 3906

3907 ATLSRLAARHGPVFSLRLGSRRAVVVSSPGCARECFTEHDVAFANRPRFESQLLMSFDGT 4086

4087 ALAMASYGPHWRNLRRVAAVQLLSARRVGLMSGLIAGEVRAMVRSLCRRPAAAAPVQLKR 4266

4267 RLFELSLSVLMETIAQSKATRPETTDTDTDMSMEAQEYKQVVEEILERIGTGNLCDYLPA 4446

4447 LRWFDVFGVRNRILAAVSRRDAFLRRLIYAARWRMDDGEKKSMIAVLLTLQKTQPEVYTD 4626

4627 NMITALCS (0) 4650

6103 NLLGAGTETTSTTIEWAMSLLLNHPETLKKAQAEIDASVGNSRLITADDVPRITYLQCIV 6282

6283 RETLRLYPAAPMLIPHESSADCEVGGYSVPRGTMLLVNAYAIHRDPAAWEEPERFVPERF 6462

6463 EGGGCDGNLSMPFGMGRRRCPGETLALHTVGLVLGTLIQCFDWERVDGVEVDMAEGGGLT 6642

6643 MPKVVPLEAVCRPRDAMGGVLREL* 6717

 

>AC084282a $F CYP81A5 46% to 81D5 CDS join(53081..54004,55290..55904)

AQ160774 nbxb0005P20f 58% identical to AU032242 47% to AA754300

AQ288789 perf to end 59% to 81D2 78% identical to C97610

MDKAYIAVFSIVILFLLVDYLRRLRGGGTSNGKNKGMRLPPGLP

AVPIIGHLHLVKKPMHATLSRLAARHGPVFSLRLGSRRAVVVSSPGCARECFTEHDVA

FANRPRFESQLLMSFDGTALAMASYGPHWRNLRRVAAVQLLSARRVGLMSGLIAGEVR

AMVRSLCRRPAAAAPVQLKRRLFELSLSVLMETIAQSKATRPETTDTDTDMSMEAQEY

KQVVEEILERIGTGNLCDYLPALRWFDVFGVRNRILAAVSRRDAFLRRLIYAARWRMD

DGEKKSMIAVLLTLQKTQPEVYTDNMITALCSNLLGAGTETTSTTIEWAMSLLLNHPE

TLKKAQAEIDASVGNSRLITADDVPRITYLQCIVRETLRLYPAAPMLIPHESSADCEV

GGYSVPRGTMLLVNAYAIHRDPAAWEEPERFVPERFEGGGCDGNLSMPFGMGRRRCPG

ETLALHTVGLVLGTLIQCFDWERVDGVEVDMAEGGGLTMPKVVPLEAVCRPRDAMGGV

LREL

 

#80

>aaaa01002164.1a $FI CYP81A6 (indica cultivar-group) ortholog of AC084282b $F 100%

1187 MDNAYIIAILSVAILFLLHYYLLGRGNGGAARLPPGPPAVPILGHLHLVKKPMHATMSRL 1366

1367 AERYGPVFSLRLGSRRAVVVSSPGCARECFTEHDVTFANRPRFESQLLVSFNGAALATAS 1546

1547 YGAHWRNLRRIVAVQLLSAHRVGLMSGLIAGEVRAMVRRMYRAAAASPAGAARIQLKRRL 1726

1727 FEVSLSVLMETIAHTKATRPETDPDTDMSVEAQEFKQVVDEIIPHIGAANLWDYLPALRW 1906

1907 FDVFGVRRKILAAVSRRDAFLRRLIDAERRRLDDGDEGEKKSMIAVLLTLQKTEPEVYTD 2086

2087 NMITALTA 2110

2837 NLFGAGTETTSTTSEWAMSLLLNHPDTLKKAQAEIDASVGNSRLITADDVTRLGYLQCIV 3016

3017 RETLRLYPAAPMLLPHESSADCKVGGYNIPRGSMLLINAYAIHRDPAVWEEPEKFMPERF 3196

3197 EDGGCDGNLLMPFGMGRRRCPGETLALRTVGLVLGTLIQCFDWERVDGVEVDMTEGGGLT 3376

3377 IPKVVPLEAMCRPRDAMGGVLRELV 3451

 

>AC084282b $F CYP81A6 46% to 81D5 N-term extension or too long

CDS join(57467..57739,58238..58330,59557..59693,60071..61148,

61875..62492)

C97610 61% to 81D2 C-terminal 78% identical to AQ288789

C97611.1 Rice callus Oryza sativa cDNA clone C60478_8A.Length = 661 57% to 81D4

MAFLGWAVDIARDSGASSSVVLTCDGYGSALYFSPWDSVPLPAT

ASPDDGFLLPRFPDVCVQRSQFTNHLAPANGTGGGGSRTGVKEEASEVLSWPPTSKQS

VRRLEVAEHWYRLYKTDNQRLSPDSQQVSVLAESHCDLASGNWKEISIHHKKMPSSTT

TKTTTPSRDAWIVSARSDPFHLLLEAQAPLGIKADALSQIAAVHQSHRNTSHIRELSLA

Upper part may be error (fusion to another gene?)

MDNAYIIAILSVAILFLLHYYLLGRGNGGAARLPPGPPAVPILGHLHLVKKPMHATM

SRLAERYGPVFSLRLGSRRAVVVSSPGCARECFTEHDVTFANRPRFESQLLVSFNGAA

LATASYGAHWRNLRRIVAVQLLSAHRVGLMSGLIAGEVRAMVRRMYRAAAASPAGAAR

IQLKRRLFEVSLSVLMETIAHTKATRPETDPDTDMSVEAQEFKQVVDEIIPHIGAANL

WDYLPALRWFDVFGVRRKILAAVSRRDAFLRRLIDAERRRLDDGDEGEKKSMIAVLLT

LQKTEPEVYTDNMITALTANLFGAGTETTSTTSEWAMSLLLNHPDTLKKAQAEIDASV

GNSRLITADDVTRLGYLQCIVRETLRLYPAAPMLLPHESSADCKVGGYNIPRGSMLLI

NAYAIHRDPAVWEEPEKFMPERFEDGGCDGNLLMPFGMGRRRCPGETLALRTVGLVLG

TLIQCFDWERVDGVEVDMTEGGGLTIPKVVPLEAMCRPRDAMGGVLRELV

 

#81

>aaaa01002164.1b $FI CYP81A7 (indica cultivar-group) ortholog of AC084282.1c $F 100%

6328 MDKAYIAVFSIAILFLLVDYFRCRRRRGSGSNNGENKGMLQLPPSPPAIPFFGHLHLIDK 6507

6508 PLHAALSRLAERHGPVFSLRLGSRNAVVVSSPECARECFTDNDVCFANRPQFPSQMPATF 6687

6688 YGAGFGFANYGAHWRNLRRIATVHLLSAHRVRGMAGVVSGEIRPMVQRMYRAAAAAGVGV 6867

6868 ARVQLKRRLFELSLSVLMEAIAQTKTTRPEADDADTDMSVEAQEFKNVLDELNPLLGAAN 7047

7048 LWDYLPALRVFDVLGVKRKIATLANRRDAFVRRLIDAERQRMDNGVDGGDDGEKKSVISV 7227

7228 LLSLQKTEPEVYKDIVIVNLCA 7293

8136 ALFAAGTETTAMTIEWAMSLLLNHPKILKKAKAEIDASVGNSRLINGDDMPHLSYLQCII 8315

8316 NETLRLYPVAPLLIPHESSADCKVNGYHIPSGTMLLVNVIAIQRDPMVWKEPNEFKPERF 8495

8496 ENGESEGLFMIPFGMGRRKCPGETMALQTIGLVLGALIQCFDWDRVDGAEVDMTQGSGLT 8675

8676 NPRAVPLEAMCKPREAMSDVFRELL 8750

 

>AC084282c $F CYP81A7 41% to 81H1 CDS join(65367..66332,67175..67792)

MDKAYIAVFSIAILFLLVDYFRCRRRRGSGSNNGENKGMLQLPP

SPPAIPFFGHLHLIDKPLHAALSRLAERHGPVFSLRLGSRNAVVVSSPECARECFTDN

DVCFANRPQFPSQMPATFYGAGFGFANYGAHWRNLRRIATVHLLSAHRVRGMAGVVSG

EIRPMVQRMYRAAAAAGVGVARVQLKRRLFELSLSVLMEAIAQTKTTRPEADDADTDM

SVEAQEFKNVLDELNPLLGAANLWDYLPALRVFDVLGVKRKIATLANRRDAFVRRLID

AERQRMDNGVDGGDDGEKKSVISVLLSLQKTEPEVYKDIVIVNLCAALFAAGTETTAM

TIEWAMSLLLNHPKILKKAKAEIDASVGNSRLINGDDMPHLSYLQCIINETLRLYPVA

PLLIPHESSADCKVNGYHIPSGTMLLVNVIAIQRDPMVWKEPNEFKPERFENGESEGL

FMIPFGMGRRKCPGETMALQTIGLVLGALIQCFDWDRVDGAEVDMTQGSGLTNPRAVP

LEAMCKPREAMSDVFRELL

 

#82

>aaaa01002164.1c $FI CYP81A8 (indica cultivar-group) ortholog of AC084282.1d $F 99%

11246 MV*AYIAIFSIAVLLLIHFLFRRRGRSNGMPLPPSPPAIPFFGHLHLIDKPFHAALSRLA 11425

11426 ERHGPVFSLRLGSRNAVVVSSPECARECFTDNDVCFANRPRFPSQMLATFNGTSLGSANY 11605

11606 GPHWRNLRRIATVHLLSSHRVSGMSGIISGQARHMVRRMYRAATASAAGVARVQLNRRLF 11785

11786 ELSLSVLMEAIAQSKTTRREAPDADTDMSMEAQELRHVLDELNPLIGAANLWDYLPALRW 11965

11966 FDVFGVKRKIVAAVNRRNAFMRRLIDAERQRMDNNDVDGGDDGEKKSMISVLLTLQKTQP 12145

12146 EVYTDTLIMTLCA

      PLFGAGTETTSTTIEWAMSLLLNHPEILKKAQAEIDMSVGNSRLISVV 12505

12506 DVHRLGYLQCIINETLRMYPAVPLLLPHESSADCKVGGYHIPSGAMLLVNVAAIQRDPVI 12685

12686 WKEPSEFKPERFENGRFEGLFMIPFGMGRRRCPGEMLALQTIGLVLGTMIQCFDWGRVDD 12865

12866 AMVDMTQSNGLTSLKVIPLEAMCKPREAMCDVLRKFM 12976

 

>AC084282d $F CYP81A8 40% to 81H1 join(70282..71220,71398..72015)

EST D46694, D46695 from this gene

MVKAYIAIFSIAVLLLIHFLFRRRGRSNGMPLPPSPPAIPFFGH

LHLIDKPFHAALSRLAERHGPVFSLRLGSRNAVVVSSPECARECFTDNDVCFANRPRF

PSQMLATFNGTSLGSANYGPHWRNLRRIATVHLLSSHRVSGMSGIISGQARHMVRRMY

RAATASAAGVARVQLNRRLFELSLSVLMEAIAQSKTTRREAPDADTDMSMEAQELRHV

LDELNPLIGAANLWDYLPALRWFDVFGVKRKIVAAVNRRNAFMRRLIDAERQRMDNND

VDGGDDGEKKSMISVLLTLQKTQPEVYTDTLIMTLCAPLFGAGTETTSTTIEWAMSLL

LNHPEILKKAQAEIDMSVGNSRLISVVDVHRLGYLQCIINETLRMYPAAPLLLPHESS

ADCKVGGYHIPSGAMLLVNVAAIQRDPVIWKEPSEFKPERFENGRFEGLFMIPFGMGR

RRCPGEMLALQTIGLVLGTMIQCFDWGRVDDAMVDMTQSNGLTSLKVIPLEAMCKPRE

AMCDVLRKFM

 

Note AC084282 continues this cluster on AC084282a

 

#417

>aaaa01027888.1 CYP81L2 (indica cultivar-group) orth AP005285.1a gene 1 $F 100%

2157 AGTETSAAVVEWAMSLLLNNPGAMARARGEIDACVGQPAARLLEAADLPKLHYLRCVVME 1978

1977 TLRLYPPVPLLAHESSADCDVAGFHVRKGTMLLVNTFAIHRDPQVWDEPESFFPDR 1807

 

>AP005285.1a gene 1 $F CYP81L2 43% to 81D2 (one frameshift)

MVDAMSGGVLVALMVLLLVAAPALLSQLERRRRSPSGPVALPVV (1?)

GPRGSPQNREPHPAPLP

98846 AQRPGAPVICRRFGSPPLAVVSSAPAAEECLGPHDLAFADKPRLPSGEILSYEWSTMGTA 98667

98666 RYGPYWRHIRRITVTELLSAHRVQHFAGVNAREVRAMARRLYRRAAAAAAS 98514 (fs)

98512 RVELKSRLFELFMNIMMAMICDRTFYGDGDDEVSEEARWFRSVVKETMELSGASTAWDF

      LPAAARWLFARRLTRRMRELSDSRTRFYQRLITDHRTKEKTDDDNAAAGDHSPAPRRRTM

      IGVLLSLQSKDPDACPDQLIRALCI (0) 98081

97990 GSLQAGTETSAAVVEWAMSLLLNNPGAMARARGEIDACVGQPAARLLEAADLPKLHYLRCVVMETL 97793

97792 RLYPPVPLLAHESSADCDVAGFHVRKGTMLLVNTFAIHRDPQVWDEPESFFPDR (2) 97631

94075 FADGQNEAKMVIPFGMGRRGCPGENLAMQMVGLTLGTLIQCFDWERVGEELEDMGESSG 93899

93898 ITMPKKLPLEAFYQPRACMVHLLSS* 93821

 

#417

>aaaa01048292.1 CYP81L2 (indica cultivar-group) ortholog to AP005285.1a gene 1

clone is only 1060 bp long missing N and Cterminals

see aaaa01027888.1 for ortholog

1   AFADKPRLPSGEILSYEWSTMGTASYGPYWRHIRRITVTELLSAHRVQHFAGVNAREVRA 180

181 MARRLYRRAAAAAASAGGRARVELKSRLFELFMNIMMAMICDRIFYGDGDDEVSEEARWF 360

361 RSVVKETMELSGASTAWDFLPAAARWLFARRLTRRMRELSDSRTRFYQRLITDHRTKEKT 540

541 DDDNSAAGDHSPAPRRRTMIGVLLS

617 LQSKDPDACPDQLIRALCI (0)

    GSLQAGTETSA 796

797 AVVEWAMSLLLNNPGAMARARGEIDACVGQPAARLLEAADLPKLHYLRCVVMETLRLYPP 976

977 VPLLAHESSADCDVAGFHVRKGTMLLVN 1060

 

#417

>aaaa01050076.1 CYP81L2 (indica cultivar-group) orth AP005285.1a gene 1 $F 96% 2 diffs see aaaa01027888.1 for ortholog

66  VIPFGMGRRGCPGESLAMQMVGLTLGTLIQCFDWERVGEELVDMGESSGITMPKKLPLE 242

 

#165

>aaaa01004834.1a CYP81L3 (indica cultivar-group) ortholog of AP005285.1b gene 2

8825 MAVDAMFGSVAVALLAVVVAAAALR 8751

     Missing 48 amino acids shown from ortholog

rwrrrrrrgggrplpgpvalpvvghlhlfrrplhrtlarlaarhgaav

7227 MGLRFGSRRVAVVSSAPAAEECLGPHDLAFANRPRLPSGEILAYEWSTMGTASYGPYWRH 7048

7047 IRRIAVTELLSAHRVQHFADVNVREVRALARRLYRRAAAAAAAGARTRVELKSRLF 6880

6879 ELLMNTMMSMICERTFYGA (2)

     DDDEVSEEARWFRSVV 6700

6699 KETMELSGASTVWDFLPAPARWLDAGRMTRRMRELSDSRTRFLQRLIDDQRKDMDADSDE 6520

6519 HAPAKRRTMIGVLLSLQSKDPDSCPDQLIRSLCI (0) 6418

6331 GSLQAGTDTSAATVEWAMSLLLNNPGAMARARGEIDACVGQPAARLLEAADLPKLHYLRC 6152

6151 VVMETLRLYPPVPLLAPHESSADCVVAGFHVPQGTMLLVNTFAIHRDPQVWDEPEAFIPDR (2) 5969

5398 FADGKNEGKMVIPFGMGRRRCPGENLGMQMVGLALGTLIQCFDWERVGEELVDMRECS 5225

5224 GLTMPKELPLEALYQPRASMVDLLTKI* 5141

 

>AP005285.1b gene 2 CYP81L3 (partial)   sequence gap with C-helix (about 47 amino acids) 81D like

108306 MAVDAMFGSVAVALLAVVVAAAALRRWRRRRRRGGGRPLPGPVALPVVGHLHLFRRPLHRT 108124

108123 LARLAARHGAAVMGLRFGSRRVAVVSSAPAAEECLGPHDLAFANR (?) 107989

sequence gap with C-helix (about 47 amino acids) lower case sequence shown

from ortholog aaaa01004834.1a

 

prlpsgeilayewstmgtasygpywrhirriavtellsahrvqhfad

 

107525 VNVREVRALARRLYRRAAAAAAAGARTRVELKSRLFELLMNTMMSMICERTFYGA (2) 107361

       DDDEVSEEARWFRSVVKETMELSGASTVWDFLPAPARWL 107169

107168 DAGRMTRRMRELSDSRTRFLQRLIDDQRKDMDADSDDHAPAKRRTMIGVLLSLQRKD 106998

106997 PDSCPDQLIRSLCI (0) 106956

106869 GSLQAGTDTSAATVEWAMSLLLNNPGAMARARGEIDACVGQPAARLLEAADLPKLHYLRCV 106687

106686 VMETLRLYPPVPLLAPHESSADCVVAGFHVPQGTMLLVNTFAIHRDPQVWDEPEAFIPDR (2) 106507

106178 FADGKNEGKMVIPFGMGRRRCPGENLGMQMVGLALGTLIQCFDWERVGEELVDMRECSG 106002

106001 LTMPKELPLEALYQPRASMVDLLTKI* 105922

 

#165

>aaaa01089015.1 CYP81L3 (indica cultivar-group) orth? AP005285.1b gene 2 97%

2 diffs/59 aa see aaaa01004834.1a for ortholog

375 MVLLLVAAPALLSRLERRRRPPPGPVALPVVGHLHLLRRPLHRTLARLAARHGA 536

537 AAVMGLRFGSRRVAVVSSAPAAEECLGPHDL 629

 

#166

>aaaa01004834.1b $FI CYP81L4 (indica cultivar-group) ortholog of AP005285.1c gene 3

14689 MDALLIALFLLLLIALMETARVRRSGTQRRAGNVPPPPPEPAGLPLVGHLHLFRKPLHRT 14510

14509 LARLAARHGGAVFGLRLGSRRVAVVSSAPAAEECLGAHDVAFADRPRLPSGRILSYDWST 14330

14329 MGTASYGPYWRHVRRVAVTEILSARRVQHFADVHVREARAMARHLHRAAVRHGVGGAARV 14150

14149 RVELKSRLFELLMNTMMAMICDKTYYGD (2) 14066

13973 DDDGEVSKEARWFREMVEETMALSGASTVWNFLPAALRWVDVGGVGRRLWRLRESRTRFL 13794

13793 QGLINDERKEMEQEQGGDRAQPAARRRTMIGVLLSVQRQDPDACPDQLIRSLCI (0) 13632

12083 SSLEAGTDTSADTIEWAMSLLLNNPNVMRKARDEIDAFIGQPVRLLEASDLTKLQYLQCII 11901

11900 METLRLYPPAPLLVPHEASTDCSIAGFHITRGTMLLVNTFAIHRDPQVWNEPTSFIPER (2) 11721

      FENGRSEGKMAIPFGMGRRKCPAENLGMQMVGLAL 11541

11540 GTMIQCFEWERVGEELVDMTEGSGLTMPKEVPLQAFYQPRASLMHLLY 11397

 

>AP005285.1c $F CYP81L4 gene 3 one frameshift 43% to 81H1 very similar to BI806420.1 15 diffs

114128 MDALLIALFLLLLIALMETARVRRSGTQRRAGNVPPPPPEPAG

       LPLVGHLHLFRKPLHRTLARLAARHGGAVFGLRLGSRRVA

       VVSSAPAAEECLGAHDVAFADRPRLPSGRILSYDWSTMGTASYGPYWRHVRRVAVTEILS

       ARRVQHFADVHVREARAMARHLHRAAVRHGVGGAARVRVELKSRLFELLMNTMMAMICDKTYYGD (2) 113505

113412 DDDGKVSKEARWFREMVEETMALSGASTVWDFLPAALRWVDVGGVGRRLWRLRESRTRF

       LQGLINDERKEMEQEQGGDRAQPAARRRTMIGVLLSVQRQDPDACPDQLIRSLC 113074

111518 AGTDTSADTIEWAMSLLLNNPNVMRKARDEIDAFIGQP (fs) 111405

111405 SRLLEASDLTKLQYLQCIIMETLRLYPPAPLLVPHEASTDCSIAGFHITRGTMLLVNTFA

       IHRDPQVWNEPTSFIPER (2)

       FENGRSEGKMAIPFGMGRCKCPAENLGMQMVGLALGTMIQCFEWERVGEELVDMTEGSG

       LTMPKEVPLQAFYQPRASLMHLLY* 110842

 

#167

>aaaa01004834.1c CYP81L5P = END OF 81L6 (indica cultivar-group) ortholog of AP005285.1d BI806420.1

16396 SLEAGTGTSTDTIEWAMSLLLNNPDVMRKARDEIDAFIGQPVRLLEADDLPKLQYLRCI 16220

16219 IMETLRLYPPAPLLVPHESSSDCTVAGFHIPRGTMLLVNTFDIHRDPHIWDEPTSFIPER (2?) 16040

15947 FEDGRSEGKMAIPFGMGRRKCPAENLGMQMVGLGLGTMIQCFEWERVGEELVDMTEGSG 15771

15770 LTMPKKVPLEAFYQPRASVMHLLS* 15696

 

>AP005285.1d CYP81L5P BI806420.1 (partial) clone S063A11. 57% to AL606630.1 54% to 81D2 many frameshifts ortholog of aaaa01004834.1c = AP005285.1d at 115000 region

This is between gene 3 and gene 4 on AP005285.1 (missed earlier)

There is no N-terminal (gene 4 might be the N-term but it is opposite orientation and 27000 bp away. It might have been split by a rearrangement)

The seq below is 100% identical to aaaa01004834.1c ortholog

Even the pseudogenes are highly conserved

115879 SLEAGTGTSTDTIEWAMSLLLNNPDVMRKARDEIDAFIGQPVRLLEADDLPKLQYLRCII 115700

115699 METLRLYPPAPLLVPHESSSDCTVAGFHIPRGTMLLVNTFDIHRDPHIWDEPTSFIPER 115523

115436 FEDGRSEGKMAIPFGMGRRKCPAENLGMQMVGLGLGTMIQCFEWERVGEELVDMTEGSG 115254

115253 LTMPKKVPLEAFYQPRASVMHLLS 115182

 

#129

>aaaa01003554.1 CYP81L6 = 81L5 (indica cultivar-group) ortholog to AP005285.1e gene 4

3364 MDTSTLLIALTLVLLLLLLLLLTARRRRSGRPRLRLPPEPAGLPLVGHLHLFRKPLHRTL 3543

3544 ARLAARHGAVFRLRLGSRRVAVVVSSAPAAEECLGAHDVAFAGRPRLPSAGILSYGWSTM 3723

3724 GTAAYGPYWRHVRRVAVAEILSAHRVRQFAGAHDREARATARRLCRAASRQRHHGAGAAA 3903

3904 GRVRVELKSRLFELLMNTMMAMICDKTYYGA (2) 3996

4070 DDDGEVSEEARWFREMVEETMALSGASTVWDFLPAVLRWVDVGGVGRRLWRLRESRTRF 4246

4247 LQGLIDDQRKEMEHDGDGRELPAAAARPRSMIGVLLSVQRQ 4369 100 ns in seq

missing Cterminal

 

>AP005285.1e gene 4 CYP81L6 (partial)   one frameshift, runs off end of clone missing last two exons

38% to 81H1 same as gene a on AP004022.1

AP004022.1a comp(2300-3335) chromosome 2 clone OJ1126_B06, 52% to 705A21

1-321 runs off beginning of contig about 1639 cannot extend 10/11/01

NOTE 81L5 AND 81L6 ARE FROM THE SAME GENE

142199 MDTSTLLIALTLVLLLLLLTARRRRSGRLRLRLPPEPPGLPLVGHLHLFRKPLHRTLAR 142375

142376 LAARHGAVFRLRLGSRRVAVVVSSAPAAEECLGAHDVAFAGRPRLPSAGILSYGWSTMG 142552

142553 TAAYGPYWRHVRRVAVAEILSAHRVRQFAGAHAREARATARRLCRAASRQRHGAGAAAGR 142732

142733 VRVELKSRLFELLMNTMMAMICDKTYYGA (2) 142816

142893 DDDGEVSEEARWFREMVEETMALSGASTVWDFLPAALRWVDVGGVGRRLWRLRESR 143060

143061 TRFLQGLIDDQRKEMEHDGDGRELPAAAARPRSMIGVLLSVQR 143189

143191 QDPEECPDQLISSLCI (0) 143238 end of clone

AP005285.2 OLD 81L5P

27539 AGTGTSTDTIEWAMSLLLNNPDVMRKARDEIDAFIGQPVRLLEADDLPKLQYLRCIIM 27712

27713 ETLRLYPPAPLLVPHESSSDCTVAGFHIPRGTMLLVNTFDIHRDPHIWDEPTSFIPER (2) 27886

27979 FEDGRSEGKMAIPFGMGRRKCPAENLGMQMVGLGLGTMIQCFEWERVGEELVDMTEGSG 28155

28156 LTMPKKVPLEAFYQPRASVMHLLS* 28230

 

#416

>aaaa01027323.1 CYP81M1 (indica cultivar-group) orth AL606630.1 $F chromosome 4 100%

1106 PPPPPAEPAVPPRHRGHLHLFKKPLHRALSGLAATHGPVLLLHFGSRAVLHVTDPAVAEE 1285

1286 CLTDHDVTFANRPRLPSSCHLSNGYTTLGSSSYGPNWRNLRRIATVEY 1429

 

>AL606630.1 $F CYP81M1 chromosome 4 clone OSJNBb0046P18 45% to 81D3

70830 MANTTLSSLLFLSMASALFLLTLLRILRSKKQQR 70931 frameshift

70931 PPPPPAEPAVPPRHRGHLHLFKKPLHRALSGLAATHGPVLLLHFGSRAVLHVTDPAVAE 71107

71108 ECLTDHDVTFANRPRLPSSCHLSNGYTTLGSSSYGPNWRNLRRIATVEVFSAHRLLRSAD 71287

71288 VRGGEVPHMARWLYLAAPAAGPSEPARADVKARAFELVLNVVARMVAGKQYYGGEGDAEA 71467

71468 ETEEAARFREMVREYFAMHGASNLQDFVLLLGLVDIGGAKRRAVKLSRERNTWAQRLID 71644

71645 EHRATATAAAATEARTMVGDLLKMQASEPEAYSDKVITALCL 71770 (0)

73597 SILQTGTDTSSSTIEWGMALLLNHPAAMAKARAEIDRFVGTGRVVEEADLPNLPYL 73764

73765 QCIIRENLRLYPVGPLLAPHESSADCSVSVAGGGRYAVPAGTMLLVNVHAMHRDARFWGP 73944

73945 DPESFSPERFEGGRSEGKWMLPFGMGRRRCPGEGLAVKVVGLALATLVQCFEWRRV 74112

74113 GDEEVDMTEGSGLTMPKAVPLEALYWPRPEMVPALSGIFFIYNFFY* 74253

 

#372

>aaaa01017554.1 $FI CYP81N1 (indica cultivar-group) added 8 aa to Nterm

orth of AL772426.1a

483 MTGGLEVAMVAGGSNGGAAVLVGITVLLFVVVVVVVVLVRWWSGGEGGAAPSPPALPVLGHLHLLKKPLHR 695

696 SLAAVAAGVGAPVVSLRLGARRALVVSTHAAAEECFTACDAALAGRPRTLAGEILGYDHT 875

876 IVLWAPHGDHWRALRRFLAVELLSAPRLAALAADRHAEAASLVDAILRDAAGGAKVTLR 1052

1053 PRLFELVLNVMLRAATTRRRHASVDARKLQEIIEETFSVNGTPSVGDFFPALR 1211

1212 WVDRLRGKVGSLKKLQARRDAMVTGLIDDHRQWRSGSAGDGDQDKEKKGVIDALLA 1379

1380 LQETDPDHYTDNVVKGIIL (0) 1436

1896 SLLFAGTDTSALTIEWAMAQLVTHPETMKKARAEIDANVGTARLVEEADMANLPYIQC 2069

2070 VIKETLRLRTAGPVIPAHEAMEDTTVGGFRVARGTMVLVNAWAIHRNGDVWDAPEEFRPE 2249

2250 RFVDSDAGGAVTAPMMPFGLGRRRCPGEGLAMRVVGVSVAALVQCFDWEVGDDDVV 2417

2418 DMTEGGGLTMPMATPLAAVCRPREFVKTILSTSM* 2522

 

>AL772426.1a = CYP81N1 ortholog of AAAA01017554.1 $FI 99%

69722 MTGGLEVAMVAGGGNGGAAVLVGITVLLFVVVVVVVVLVRWWSG

69590 GEGGAAPSPPALPVLGHLHLLKKPLHRSLAAVAAGVGAPVVSLRLGARRALVVSTHA 69420

69419 AAEECFTACDAALAGRPRTLAGEILGYDHTIVLWTPHGDHWRALRRFLAVELLSAPRLAA 69240

69239 LAADRHAEAASLVDAILRDAAGGAKVTLRPRLFELVLNVMLRAATTRRRHASVDA 69075

69074 RKLQEIIEETFSVNGTPSVGDFFPALRWVDRLRGKVGSLKKLQARRDAMVTGLIDD 68907

68906 HRQWRSGSAGDGDQDKEKKGVIDALLALQETDPDHYTDNVVKGIIL 68769 (0?)

68310 SLLFAGTDTSALTIEWAMAQLVTHPETMKKARAEIDANVGTARLVEEADMANLPYIQCVI 68131

68130 KETLRLRTAGPVIPAHEAMEDTTVGGFRVARGTKVLVNAWAIHRDGDVWDAPEEFRPERF 67951

67950 VDSDAGGAVTAPMMPFGLGRRRCPGEGLAVRVVGVSVAALVQCFDWEVGDDDVVDMTE 67777

67776 GGGLTMPMATPLAAVCRPREFVKTILSTS* 67687

 

note there are 2426.1b and 2426.1c that are different from this

 

#268

>aaaa01010047.1a CYP81N2 (indica cultivar-group) join with c and d

5663 HPEAMTKVRPEIDANVGAARLVEEPDMASLPYLQCVVKETLRLRPVGPVIPAHEAMEDCK 5484

5483 VGGYHVRRGTMILVNAWAIHRDGDVWGSPEEFRPERFMDDGAGAGAVTAVTAPMLPFGLG 5304

5303 RRRCPGEGLAVRLVGLTVAALVQCFDWEIGEGGAVDMAEGGGLTMPMATPLAAVCRPREF 5124

5123 VKTVVSDCF* 5094

 

>aaaa01010047.1d CYP81N2 (indica cultivar-group) (plus strand)

join with a and c

9533 DASLVVVVGVLFLMVAVVVMTRLGDGGAAPSPPAMPVLGHLHLIKKPLHRSLAEVAARVG 9712

9713 AAPVVSLRLGARRALLVSTHAAAEECFTACDAAVAGRPRLLAGDVLGYGHTTVVWASHGD 9892

9893 HWRALRRLLGVELFSN

     ARLAALAADRRAEVASLVDAVLRDAAAGGGGGGTVTLRPRLFELV 10075

 

>aaaa01010047.1c CYP81N2 (indica cultivar-group) ortholog of BE039828

join with a and d (minus strand)

7897 ARLSALAADRRAEVASLVDAVLRDAAAGGGGGGTVTLRPRLFELVLNVMLRAVTARRHAG 7718

7717 DETRRFQEIVEETFAASGAPTVGDFFPALRWVDRLRGVVATLQSLQKRRDAFVAGLVDDH 7538

7537 RRTRRAAAAAADKDQKKNGIIDALLTLQETDPDHYTDNVVKGIVLVLLTAGTDTSALTTE 7358

7357 WAMAQLVAHPEAMTKVR 7307

 

these fragments are all pointing in opposite directions and they may be out of order.  One way to assemble an intact gene is to add 1d to 1c to 1a

 

This gives:

>aaaa01010047.1dca hybrid CYP81N2 (indica cultivar-group) ortholog of AL772426.1b BE039828 46% to 81D3

9491 MVGLEVATTAVTGGDASLVVVVGVLFLMVAVVVMTRLG

     DGGAAPSPPAMPVLGHLHLIKKPLHRSLAEVAARVG 9712

9713 AAPVVSLRLGARRALLVSTHAAAEECFTACDAAVAGRPRLLAGDVLGYGHTTVVWASHGD 9892

9893 HWRALRRLLGVELFSN

7897 ARLSALAADRRAEVASLVDAVLRDAAAGGGGGGTVTLRPRLFELVLNVMLRAVTARRHAG 7718

7717 DETRRFQEIVEETFAASGAPTVGDFFPALRWVDRLRGVVATLQSLQKRRDAFVAGLVDDH 7538

7537 RRTRRAAAAAADKDQKKNGIIDALLTLQETDPDHYTDNVVKGIVLVLLTAGTDTSALTTE 7358

7357 WAMAQLVA

5663 HPEAMTKVRPEIDANVGAARLVEEPDMASLPYLQCVVKETLRLRPVGPVIPAHEAMEDCK 5484

5483 VGGYHVRRGTMILVNAWAIHRDGDVWGSPEEFRPERFMDDGAGAGAVTAVTAPMLPFGLG 5304

5303 RRRCPGEGLAVRLVGLTVAALVQCFDWEIGEGGAVDMAEGGGLTMPMATPLAAVCRPREF 5124

5123 VKTVVSDCF* 5094

 

>AL772426.1b $F CYP81N2 = ortholog of aaaa01010047.1dca hybrid 99%

BE039828 (partial) similar to Jerusalem artichoke CYP81B1

95445 MAGLEVATTAVTGGDASLVVVVGVLFLMVAVVVMTRLGDGGAAPSPPAMPVLGHLHLIKKPL 95260

95259 HRSLAEVAARVGAAPVVSLRLGARRALLVSTHAAAEECFTACDAAVAGRPRLLAGDVLG 95083

95082 YGHTTVVWASHGDHWRALRRLLGVELFSNARLAALAADRRAEVASLVDAVLRDAAAGGGG 94903

94902 GGTVTLRPRLFELVLNVMLRAVTARRHAGDETRRFQEIVEETFAASGSPTVGDFFPALR 94726

94725 WVDRLRGVVATLQSLQKRRDAFVAGLVDDHRRTRRAAAAAADKDQKKNGIIDALLT 94558

94557 LQETDPDHYTDNVVKGIVLVLLTAGTDTSALTTEWAMAQLVAHPEAMTKVRAEIDANVG 94381

94380 AARLVEEADMASLPYLQCVVKETLRLRPVGPVIPAHEAMEDCKVGGYHVRRGTMILVNAW 94201

94200 AIHRDGDVWGSPEEFRPERFMDDGAGAGAVTAVTAPMLPFGLGRRRCPGEGLAVRLVGLT 94021

94020 VAALVQCFDWEIGEGGAVDMAEGGGLTMPMATPLAAVCRPREFVKTVVSDCF* 93862

 

note aaaa010047.1b = aaaa01002713.1 #98

 

#485

>aaaa01002187.1 CYP81N3P (indica cultivar-group) 2 diffs with AL772426.1

16247 GPAAGAARVDARRGTGERFTACDAAMAGRPPAARR 16143

 

>AL772426.1 CYP81N3P P450 fragment

63156 PSQPVLGHLHLLKRPPLHRSX 63097

63092 AAAARAPLVSLRLGRAAGAARVDARRGAGERFTACDAAMAGRPPAARR 62949

 

#98

>aaaa01002713.1 $FI CYP81P1 (indica cultivar-group) runs off end of clone

aaaa01010047.1b (indica cultivar-group) ortholog to AL772426.1c AF140486

new 81 subfamily

3811 MEISQAFVFASLLLLLLLTWLLFHLLSYQAPPPNGDGGRRIPSPPALPVVGHLHLLKKPL 3632

3631 HRSLAALAARYGGGAGLLLLRFGARPVVLVSSQAATDECFTAHDAALAGRPGLASRRLL 3455

3454 TDGCPTIATAGHGARWRHLRRLATVHALCARRLAATSPARDAEARAMAARLYSSSSSSS 3278

3277 AASAVVVGVKPAAYGFVASVIMTMVAGERMAEEDVLRFKAITEAGLAAAGAANRQDFLPF 3098

3097 LRLLDFGRARRRLAGIAKERHDFGQRIVDEYRRRHRRRLAVAADDSSSSPPRRTVIGDL 2921

2920 LRQQESSPESYADEVIRTVCL 2858 (0?)

5877 SLLQAGTDTSASTIEWAMALLLNNPDVLRKATDEIDSVVGMSRLLQEPDLANLPY 6041

6042 LRCIITETLRLYPLAPHLVPHEASRDCMVAGHVIARGTMVLVDVYSMQRDPRVWEDPDKF 6221

6222 IPERFKGFKVDRSGWMMPFGMGRRKCPGEGLALRTVGMALGVMIQCFQWERLGKKKVDMS 6401

6402 EGSGLTMPMAVPLMAMCLPRVEMESVLKSL* 6494

 

>AL772426.1c = CYP81P1 ortholog of AAAA01002713.1 and aaaa01010047.1b 99% to these two

AF140486 mRNA 57% to C97610 57% to CYP81G1 56% to 81D2 = AU173235 clone R1815

99775 MEISQAFVFASLLLLLLLTWLLFHLLSYQAPPPNGDGGRRIPSPPALPVVGHLHLLKKPL 99596

99595 HRSLAALAARYGGGAGLLLLRFGARPVVLVSSQAAADECFTAHDAALAGRPGLASRRLLT 99416

99415 DGCPTIATAGHSARWRHLRRLATVHALCARRLAATS 99308 (fs)

99308 PARDAEARAMAARLYSSSSSSSAASAVVVGVKPAAYGFVASVIMSMVAGERMAEEDVLRF 99129

99128 KAITEAGLAAAGAANRQDFLPFLRLLDFGRARRRLAGIAKERHDFGQRIVDEYRRRHRRR 98949

98948 LAVAADDFSSSPPRRTVIGDLLRQQESSPESYADEVIRTVCL 98823 (0?)

98238 SLLQAGTDTSASTIEWAMALLLNNPDVLRKATDEINSVVGMSRLLQEPDLANLPYLRCII 98059

98058 TETLRLYPLAPHLVPHEASRDCMVAGHVIARGTMVLVDVYSMQRDPRVWEDPDKFIPERF 97879

97878 KGFKVDGSGWMMPFGMGRRKCPGEGLALRTVGMALGVMIQCFQWERVGKKKVDMSEGSGL 97699

97698 TMPMAVPLMAMCLPRVEMESVLKSL* 97621

 

#111

>aaaa01003110.1 $FI CYP84A5 (indica cultivar-group) ortholog of AC073867.4b $F 99%

15437 MADMVKFTMEWLQDPLSLAIVVTVAVLIMRMQRRRAAPFPPGPKPLPIVGNMAMMDQLTH 15616

15617 RGLAALAKEYGGLMHLRLGRLHAFAVSTPEYAREVLQAQDGAFSNRPATTAIAYLTYDRA 15796

15797 DMAFAHYGPFWRQMRKLCVVKLFSRRRAETWLAVRDESAALVRAVAASRGEAAVNLGELI 15976

15977 FNLTKNVIFRAAFGTRDGEGHDEFIAILQEFSKLFGAFNIGDFIPWLSWADTNGINARLV 16156

16157 AARTALDRFIDKIIDEHMERGKNPDDADADMVDDMLAFLAEAKPHAGKAAAAAAAAGDGA 16336

16337 DDLQNTLRLTRDNIKAIIM 16393

20528 DVMFGGTETVASAIEWAMAEMMHSPDDLRRVQEELAAVVGLGRDVAESDLDKLPFLRC 20701

20702 VIKETLRLHPPIPILLHETAADCLVAGYSVPRGSRVMVNVWAINRDRAAWGPDADAFPPS 20881

20882 RFAAGAAAEGLDFRGGCFEFLPFGSGRRSCPGMALGLYALELAVARLAHGFNWSLPDGMK 21061

21062 PSELDMSDIF 21091

 

>AC073867.4b $F CYP84A5 chromosome 10 clone OSJNBa0055O03, 1 ordered pieces 63% to CYP84A

BE230538.1 99AS755 Rice Seedling cDNA clone 99AS755.Length = 501

68246 MADMVKFTMEWLQDP

68196 LSLAIVVTVAVLIMRMQRRRAAPFPPGPKPLPIVGNMAMMDQLTHRGLAALAKEYGGLM 68020

68019 HLRLGRLHAFAVSTPEYAREVLQAQDGAFSNRPATTAIAYLTYDRADMAFAHYGPFWRQM 67840

67839 RKLCVVKLFSRRRAETWLAVRDESAALVRAVAASRGEAAVNLGELIFNLTKNVIFRAA 67666

67665 FGTRDGEGHDEFIAILQEFSKLFGAFNIGDFIPWLSWADTNGINARLVAARTALD 67501

67500 RFIDKIIDEHMERGKNPDDADADMVDDMLAFLAEAKPHAGKAAAAAAGAGDGADDLQNTL 67321

67320 RLTRDNIKAIIM 67285

62628 DVMFGGTETVASAIEWAMAEMMHSPDDLRRVQEELAAVVGLGRDVAESDLDKLPFLRC 62455

62454 VIKETLRLHPPIPILLHETAADCLVAGYSVPRGSRVMVNVWAIARDRAAWGPDADAFRP 62278

62277 SRFAAGAAAEGLDFRGGCFEFLPFGSGRRSCPGMALGLYALELAVARLAHGFNWSLPDGM 62098

62097 KPSELDMSDIFGLTAPRATRLSAVATPRLTCPLY* 61993

 

#148

>aaaa01004359.1 CYP84A6 (indica cultivar-group) ortholog of AC121490.1 AQ577123

deletion of Ihelix to heme

1854 MDPWLVLWLVLASMAFALLHLRRRARRGAPPLPPGPRPLPIIGNMLMMDQLTHRGLAAMAARY 2042

2043 GGLLHLRLGRVHMVVVSSPEHAREVLQVQDGDFSNRPASIAIAYLTYGRADMAFSHYGHF 2222

2223 WRQVRKLSALRLFSRRRAQSWRAVRDESAKLVGAIARRAGEAVDLGELIFGLTKDVIFRA 2402

2403 AFGTRDGGGHGELEVLLQEFSKLFGAFNVGDFIPWLAWLDPHGINRRLRAARAALDSVID 2582

2583 RIIDEHVSNPAGDEDADMVDDMLAFLDEAGRDHTGGGGELQGTLRLTR

     gap in seq.

     GRRACPAIVLGMYELELVVARLVHAFGWAPPGGVAPEE 2942

2943 LDMADGFGLTAPRAARLRAVPTPRLTCPM* 3032

 

>AC121490.1 $F CYP84A6 (japonica cultivar-group) chromosome 3 clone

AQ577123 nbxb0090O10r 57% to CYP84 C-term

92194 MDPWLVLWLVLASMAFALLHLRRRARRGAPPLPPGPRPLPIIGNMLMMDQLTHRGLAAMA 92015

92014 ARYGGLLHLRLGRVHMVVVSSPEHAREVLQVQDGDFSNRPASIAIAYLTYGRADMAFSHY 91835

91834 GHFWRQVRKLSAVRLFSRRRAQSWRAVRDESAKLVGAIARRAGEAVDLGELIFGLTKDVI 91655

91654 FRAAFGTRDGGGHGELEVLLQEFSKLFGAFNVGDFIPWLAWLDPHGINRRLRAARAALDS 91475

91474 VIDRIIDEHVSNPAGDEDADMVDDMLAFLDEAGRDQTGGGGELQGTLRLTRDNIKAIIM (0)

      DFVFGGTE

      TVASAIEWAMAELLHSPGDLRRLQAELADVVGLGRGVEEGDLEKLPFLRCVAMETLRLHP

      PIPLLLHEAAADCVVGGYSVPRGARVVVNVWSVGRDAGAWKGDAGAFRPARFMAGGEAAG

      MDLRGGCFELLPFGSGRRACPAIVLGMYELELVVARLV 90725

90724 HAFGWAPPGGVAPEELDMADGFGLTAPRAARLRAVPTPRLTCPM* 90590

 

#354

>aaaa01015479.1 $FI CYP84A7 (indica cultivar-group) ortholog of AQ160861

4482 MEYYSQDAYFVTASLISIAFFLWYASRLRRTAILLPPGPPGLPVLGNLLSVHQFTHRGLA 4661

4662 KLSKIHGGFFHLRVGQVNVFVVSSPETVREIIHENDSVFSHRPVTAAMVYVSYDLADMAF 4841

4842 AHYGPFWRQMRKLCVLKLFSPRRDVSWRVVRGEVDALVRSVAELRRVAGSVGDLVFKFAT 5021

5022 NVTFRAAFGAQSREDEKVFVDIILELSEIFMAFNMGDYIPCLGWLDLNGIGKRMAAARHA 5201

5202 LDVFIDRIIDEHLAKLRNGDVSASDMVDDMIAYLVDAPGGRHKRADGVELGDLHLTRDNI 5381

5382 KGLIMARN (0)

     DIMFGGTKTVASTVEWALSELLRNPDEL 5561

5562 RRAQDELAGVVGLRRRVNQDDLDNLPHLRCVTKEVLRLHPPLPLLLRESLHDCAIG (1?) 5738

5731 GYTVPRGSRIWINNWAMCRDEVLWGTDAAAFRPSRFADESARVEFKGGDFQYLPFGSGRR 5910

5911 SCPGMQLGMFAVELGLAELLHCFDWSLPAGTEPLELDMDDVFGLTAPKAERLCAVPSPRL 6090

6091 SCPLL* 6108

 

>AQ160861 CYP84A7 (partial) N-terminal to C-helix 59% to 84A1 cannot extend

ortholog of aaaa01015479.1 $FI 94%

MEYYSQDAYFVTASLISIAFFLWYASRLRRTAIL

LPPGPPGLPVLGNLLSVHQFTHRGLAKLSKIHGGFFHLRVGQPNVFVVSSPETVRE

IIHENDSVFSHCPVTVAMVYVXYDLADMAFAHYGPFWRQMRKLCVFKLLSPR

RDVLWRCVRGKVDALVRSV

 

#213

>aaaa01007228.1 $PI CYP84A8P (indica cultivar-group) orth of AL606587.1 100%

3578 MESWWLPTWQPLLVLLPTMLLLYHTVSSWHCGERCLPLPPGPRGLPFVGNILHTSDMTHR 3757

3758 GLAQLASRYGGLLHLRLGRLRTVVVSTPEMARLVLHVNDRAFADRPTTAAIDYLTYDRAD 3937

3938 MVFAPYGP 3961

4166 YGPLWRQLRKLCINRLFNRRRAASWAAVHDGVDSLLREVTKNSGAVVNVGELVFGMSMKI 4345

4346 TLRLRLELAGGVPGMQLGMLAVELALARLLHGFDWSLPGGTGSAGELDMEETYGLTAPRA 4525

4526 VRLSAVPVPRLSHL 4567

 

>AL606587.1 $P CYP84A8P chromosome 4 clone OSJNBa0011L07,

probable pseudogene of AC073867 68246-61993

29446 MESWWLPTWQPLLVLLPTMLLLYHTVSSWHCGE

29347 RCLPLPPGPRGLPFVGNILHTSDMTHRGLAQLASRYGGLLHLRLGRLRTVVVSTPEMARL 29168

29167 VLHVNDRAFADRPTTAAIDYLTYDRADMVFAPYGP 29063 no GT-AG boundary

28858 YGPLWRQLRKLCINRLFNRRRAASWAAVHDGVDSLLREVTKNSG-AVVNVGELVFGMSMK 28682

28681 ITLR 28670 missing internal sequence from aa 191-467

28642 PGMQLGMLAVELALARLLHGFDWSLPGGTGSAGELDMEETYGLTAPRAVRLSAVPVPRLSHL* 28454

 

#171

>aaaa01005294.1 CYP85A1 (indica cultivar-group) orth of AC092778.2 $F chr 3 CYP85A like

8096 NTTSSSSSTKSERFCRYGSVFRTHILGCPTVVCMEAELNRRALASEGRGFVPGYPQSMLD 8275

8276 ILGRNNIAAVQGPLHRAMRGAMLSLVRPAMIRSSLLPKIDAFMRSHLAAWSSS 8434

8435 SSSSAVVDIQAK 8470

9375 SDALKAELYTLVLGTISLPINLPGTNYYQGFK

     ARKKLVAMLEQMIAERRSSGQVHDDMLDALLTGVEGTREKLTDEQIIDLIITLIYS 9731

9912 FDIRKGKAPEDAIDWNDFKSMTFT 9983

10063 TMQVIFETLRLATVVNGLLRKTTQDVEMNG

      YVIPKGWRIYVYTREINYDPFLYPDPMTFNPWRWL 10356

10481 HFMLFGGGSRMCPGKEVGTVEIATFLHYFVTQYR

      WEEEGNNTILKFPRVEAPNGLHIRV 10747

 

>AC092778.2 $F CYP85A1 chromosome 3 clone OSJNBa0015G17, 59% to 85A2

AQ271015 nbxb0015G17f 173-222 56% to tomato CYP85

AQ159479 nbxb0014E19f 75% identical to N-term of 85 49-71

AQ795731.1 nbxb0057K13r Rice BAC Length = 728 76% to tomato CYP85

C97147 38% identical to C73729 71% to 85 324-401 region

AU100843.1 Rice callus cDNA clone C52418.Length = 710

BAC45000.1 whole protein seq.

105850 MVLVAIGVVVAAAVVVSSLLLRWNEVRYSRKRGLPPGTMGWPLFGETTEFLKQGPSFMKARRL (?) 106038

       RYGSVFRTHILGCPTVVCMEAELNRRALASEGRGFVPGYPQS 106945

106946 MLDILGRNNIAAVQGPLHRAMRGAMLSLVRPAMIRSSLLPKIDAFMRSHLAAWSSSSSSA 107125

107126 VVDIQAKTKE (0)

       MALLSALRQIAGVSAGPLSDALKAELYTLVLGTISLPI 108109

108110 NLPGTNYYQGFK (0) 108145 contig ends 108418 rest of gene missing

       ARKKLVAMLEQMIAERRSSGQVHDDMLDALLTGVEGTREKLTDEQIIDLIITLIYSGYETMSTTSMM

       AVKYLSDHPKALEQLR (0)

       KEHFDIRKGKAPEDAIDWNDFKSMTFTRA (0)

       VIFETLRLATVVNGLLRKTTQDVEMN (1?)

       GYVIPKGWRIYGYTREINYDPFLYPDPMTFNPWRWL (?)

       EKNMESHPHFMLFGGGSRMCPGKEVGTVEIATFLHYFVTQY (?)

       RWEEEGNNTILKFPRVEAPNGLHIRVQDY*

 

#198

>aaaa01006154.1 CYP86A9 (indica cultivar-group) orth of AP003442.1 $F chr 1 86A like

8151 DLLSRFMKKRDSKGKAFPEDVLQWIALNFVLAGRDTSSVALSWFFWTLMQRRDVERKVVL 7972

7971 EIASVLRETRGDDTARWTEEPLNFDELERLVYLKAALTETLRLYPSVPQDSKYVVADDVL 7792

7791 PDGTVVPAGSAITYSIYSVGRMESIWGKDCAEFRPERWLSADGSRFEPVKDAYRFVAFNG 7612

7611 GPRTCLGKDLAYLQMKSIASAVLLRNSVELVPGHKVEQKMSLTLFMKNGLRVHVKPRD 7438

 

>AP003442.1 $F CYP86A9 chromosome 1 clone B1096A10 72% to 86A1

AQ954084.1 nbeb0054A03r CUGI Rice BAC genomicLength = 403 57% to 86A1

D41651  71% to 86A1 almost identical to AP003442

49790 MAAAAVALASAYMVWFWALSRRLSGPRMWPLVGSLPSVVLNRARVHDWIADNLRATGDAATYQTCILPLPFLARRQGLV

50026

50027 TVTCNPRNLEHILRARFDNYPKGPMWQASFHDLLGQGIFNSDGETWLIQRKTAALEFTTR 50206

50207 TLRQAMARWANRSIKYRLWRILDDHCNAAASVDLQDLLLRLTFDNICGLTFGKDPETLSP 50386

50387 GLPENPFANAFDEATEATMQRFLFPSLLWRIKKAFGVGSERSLRDSLAVVDRHMTETIAA 50566

50567 RKATPSDDLLSRFMKKRDSKGKAFPEDVLQWIALNFVLAGRDTSSVALSWFF 50722

50723 WTLMQRRDVERKVVLEIASVLRETRGDDTARWTEEPLNFDELERLVYLKAALTETLR 50893

50894 LYPSVPQDSKYVVADDVLPDGTVVPAGSAITYSIYSVGRMESIWGKDCAEFRPERWLSAD 51073

51074 GSRFEPVKDAYRFVAFNGGPRTCLGKDLAYLQMKSIASAVLLRNSVELVPGHKVEQKM 51247

51248 SLTLFMKNGLRVHVKPRDIASYVEPSEPAPPQGSLVIPTTTAAAA* 51385

 

#232

>aaaa01008150.1 CYP86A10 (indica cultivar-group) orth of AP004139.1 $F chr 2,

9567 DDLLSRFMRKGSYSDESLQHVALNFILAGRDTSSVALSWFFWLVSTHPAVERKIVREL 9394

9393 CTVLAASRGADDPALWLAAPLNFEELDQLVYLKAALSETLRLYPSVPEDSKHVVADDYLP 9214

9213 DGTFVPAGSSVTYSIYSAGRMKTVWGEDCLEFRPERWLSADGSKFEPHDSYKFVAFNAGP 9034

9033 RICLGKDLAYLQMKNIAGSVLLRHRLAVAQGHRVEQKMSL 8914

 

>AP004139.1 $F CYP86A10 chromosome 2 clone OJ1486_E07 70% to 86A2

55281 MEVGTWAVVVAVAAAYMTWFWRMSRGLS

55335 GPRVWPVVGSLPGLVQHAENMHEWIAANLRRAGGTYQTCIFAVPGVARRGGLVTVTCDP 55511

55512 RNLEHVLKSRFDNYPKGPFWHAVFRDLLGDGIFNSDGETWVAQRKTAALEFTTRTLRTAM 55691

55692 SRWVSRSIHHRLLPILDDAAAGKAHVDLQDLLLRLTFDNICGLAFGKDPETLAKGLPENA 55871

55872 FASAFDRATEATLNRFIFPEYLWRCKKWLGLGMETTLASSVAHVDQYLAAVIKARKLELA 56051

56052 GNGKCDTVAMHDDLLSRFMRKGSYSDESLQHVALNFILAGRDTSSVALSWFFWLV 56216

56217 STHPAVERKVVHELCAVLAASRGAHDPALWLAAPFTFEELDSLVYLKAALSETLRLYP 56390

56391 SVPEDSKHVVADDYLPDGTFVPAGSSVTYSIYSAGRMKTVWGEDCLEFRPERWLSADGSK 56570

56571 FEPHDSYKFVAFNAGPRICLGKDLAYLQMKNIAGSVLLRHRLAVAQGHRVEQKMSLTL 56744

56745 FMKNGLRMEVRPRDLAPVADELRGADVRATAPCA* 56849

 

#5

>aaaa01000236.1 CYP86A11 (indica cultivar-group) orth of AL606687.1 $F chr 4 100%

34409 VAAVAAYMAWFWRMSRGLSGPRVWPVVGSLPGLVRHAEDMHEWIAANLRRTRGTYQTCIF 34588

34589 AVPGLARRGGLVTVTCDPRNLEHVLKSRFDNYPKGPFWHGVFGDLLGDGIFNSDGETWVA 34768

34769 QRKTAALEFTTRTLRTAMSRWVSRSIHSRLLPILSDAAAAGGGGGGATVDLQDLLLRLTF 34948

34949 DNICGLAFGKDPETLARGLPENDFASAFDRATEATLNRFIFPECVWRFKKWMGLGMETTL 35128

35129 ARSVQHVDRW 35158

9099 SVPEDSKHVVADDVLPDGTFVPAGSSVTYSIYSAGRMKTVWGDDCLEFRPERWLSADGTK 9278

9279 FEPHDSFRFVAFNAGPRICLGKDLAYLQMRNIAGSVLLRHRLAVAPGHRVEQKMSLTLF 9455

9456 MKHGLRMEVRPR 9491

 

>AL606687.1 $F CYP86A11 chromosome 4 clone OSJNBa0084K11 similar to AP004139

orth aaaa01000236.1

C91843        72% IDENTICAL TO 86A4    4/98 47 TO 128 REGION

AU093465.1 Rice panicle cDNA clone E31913.Length = 609

BE228888.1 98AS3245 Immature Seed cDNA clone 98AS3245. Length = 483 76% to 86A2

94178 MEAGTWAVVVAAVAAYMAWFWRMSRGLSGPRVWPVVGSLPGLVRHAEDMH

      EWIAANLRRTRGTYQTCIFAVPGLARRGGLVTVTCDPR 94441

94442 NLEHVLKSRFDNYPKGPFWHGVFGDLLGDGIFNSDGETWVAQRKTAALEFTTRTLRTA 94615

94616 MSRWVSRSIHSRLLPILSDAAAAGGGGGGATVDLQDLLLRLTFDNICGLAFGKDPETLAR 94795

94796 GLPENDFASAFDRATEATLNRFIFPECVWRFKKWMGLGMETTLARSVQHVDRYLSAVIKA 94975

94976 RKLELAAGNGKGDASSATPHDDLLSRFMRKGTYSDESLQHVALNFILAGRDTSSVA 95143

95144 LSWFFWLVSTHPAVERKIVRELCTVLAASRGADDPALWLAAPLNFEELDQLVYLKAALSE 95323

95324 TLRLYPSVPEDSKHVVADDVLPDGTFVPAGSSVTYSIYSAGRMKTVWGDDCLEFRPERWL 95503

95504 SADGTKFEPHDSFRFVAFNAGPRICLGKDLAYLQMRNIAGSVL

      LRHRLAVAPGHRVEQKMSLTLFMKHGLRMEVRPRDLAPIVDELRGAGEYAAAARATAACA* 95815

 

#158

>aaaa01004523.1 CYP86A12P (indica cultivar-group) orth AL606448.1 $P two short pseudogene fragments 1 diff

1027 WRVRAFGDLLGNGIFNSDGETWVAQRKTPA 938

 

>AL606448.1 $P CYP86A12P two short pseudogene fragments = D15209 6 diffs with AL606687.1

D15209  69% IDENTICAL TO 86A1 Arabidopsis T04172 EST  5/93  7/98 113-131 REGION

very similar to C91843 identical to AL606448.1

135204 AFGDLLGNGIFNSDGETWVAQRKTLA 135278

135631 STSIKARKLELAAESGKGDASS 135696

 

>AP003767.1 $P CYP86A12P two short pseudogene fragments only 4 diffs with AL606448.1

72817 FGDLLGNGIFNFDGETWVAQHKTTA 72891

73228 STSIKARKLELAAGSGKSDASS 73293

 

#158

>aaaa01028711 CYP86A12P (indica cultivar-group) orth AL606448.1 $P

two short pseudogene fragments = D15209 see aaaa01004523.1 for ortholog

82 AFGDLLGNGIFNSDGETWVAQRKTLA 159

514 STSIKARKLELAAESGKGDASS 577

 

#177

>aaaa01005521.1 CYP86A13P (indica cultivar-group) orth AP003230.1 $P pseudogene fragment 100%

8618 PFWHTVFRDLLGDGIFNSDGETWVA*RNTAALEFTT 8487

 

>AP003230.1 $P CYP86A13P pseudogene fragment at C-helix 2 diffs with AP004139.1

65348 PFWHTVFRDLLGDGIFNSDGETWVA*RNTAALEFTT 65241

 

#380

>aaaa01018882.1 CYP86A14P (indica cultivar-group) 91% to AP004139.1 $F

4729 PFWHAVFRDLLGDGIFNSEGETWVTQRKTTALEFTTMAAAAAVA 4598

 

No japonica ortholog found 9/11/02

 

#299

>aaaa01012078.1 CYP86A15P (indica cultivar-group) 91% AP004139.1 $F

identical to AP005298.1 and AP005389.1

4312 PFWHAVFRDLLGDNIFNSDGETWVAQWKMAALEFTTTATAAAAAGVDRTL 4163

 

>AP005389.1 CYP86A15P (japonica cultivar-group) chr 8 C-helix region

79974 PFWHAVFRDLLGDNIFNSDGETWVAQWKMAALEFTTTATAAAAAGVDRTL 80123

 

>AP005298.1 CYP86A15P (japonica cultivar-group) chr 8

11629 PFWHAVFRDLLGDNIFNSDGETWVAQWKMAALEFTTTATAAAAAGVDRTL 11480

 

#472

>aaaa01098941.1 CYP86A16P (indica cultivar-group)

61% to AP004139.1, 59% to 86As

3   VRLYPSLPYNVKNAVEDDHLPDGTFVPAGTDVVYSPWFMGRSESFWGKNALEFRPERWL 179

RVCACGMNMAILEAKIFTAVMLRHFHVKIIDGEPQDRGYVL

KSGLFMAGGLPLAMTPRARISS*

 

no japonica ortholog found 9/12/02

 

#408

>aaaa01024304.1 CYP86A17P (indica cultivar-group) ortholog of AP003242.1 1 diff

2219 ISYSIFSTGVLQITM*QGEILLHFWPSWL*LVDGKRPFGPMDGHLLLAYN 2070

 

>AP003242.1 $P CYP86A17P 38% to perf region of AP004139.1

pseudogene fragment

11819 ISYSIFSTGVLQITM*QGEILLHFWPSWL*LVDGKRPFGPMDGHLLIAYN 11670

 

#2

>aaaa01000147.1 CYP86B3 (indica cultivar-group) orth of AC087182.8 $F chr 10 100%

MMALPEVQTVELLVAVSIFVAIHSLRQRRSQGLPSWPLVGMLPSLLLGLRGDMYEWLTGV 25899

25898 LASRGGTFTFHGPWLTNLHCVVTSDPRNLEHMLKTKFGSFPKGPYFRDTVRDLLG 25734

25733 DGIFGADDEVWRRQRKAASLEFHSAEFRALTASSLVELVHRRLLRVLGDAEEAGDAVDLQ 25554

25553 DVLLRLTFDNVCMIAFGVDP 25494

16855 VVEDEVFPDGTVLKKGTKVIYAMYTMGRMESIWGEDCREYKPERWLRDGRFMGESAY 16685

16684 KFTAFNGGPRLCLGKDFAYYQMKFAAASILRRYHVRVVDGHPVAPKMALTMYMKHGLKVK 16505

16504 LTKR 16493

 

>AC087182.8 $F CYP86B3 chromosome 10 clone OSJNBa0029C15, 66% to 86B1

orth of aaaa01000147.1

D47545, D41610, D47561, D49287, AU032597 AU067853 77% identical to 86B1

AU067852 N-terminal region

AU096913.1 Rice green shoot Oryza sativa cDNA clone S16487

AU032659 66% identical to contig of D47545 63% to 86B1 same clone = D47454

53685 MDAVMPGAAGAHNATAAAAAGRRGGGIVAGMMALPEVQTVELLVAVSIFVAIHSLRQRRSQGLPSW 53882

53883 PLVGMLPSLLLGLRGDMYEWLTGVLASRGGTFTFHGPWLTNLHCVVTSDPRNLEHMLKTK 54062

54063 FGSFPKGPYFRDTVRDLLGDGIFGADDEVWRRQRKAASLEFHSAEFRALTASSLVELVHR 54242

54243 RLLRVLGDAEEAGDAVDLQDVLLRLTFDNVCMIAFGVDPGCLRPGLPEIPFAKAFEDATE 54422

54423 ATIVRFVTPTAVWRAMRALGVGHERVLQRSLAGVDRFAYDVIRQRKEEVAAGGGGGGGGR 54602

54603 SDLLTIFTKMRDADTGAAAYSDKFLRDICVNFILAGRDTSSVALAWFFWLLNKNPAVEAK 54782

54783 ILEEIDDIVAARRSSPPAPAVAANGADEDDLVFHPEEVKKMEYLHAALSEALRLYPSVPVDHKE 54974 (0)

56977 VVEDEVFPDGTVLKKGTKVIYAMYTMGRMESIWGEDCREYKPERWLRDGRFMGESAYKF 57153

57154 TAFNGGPRLCLGKDFAYYQMKFAAASILRRYHVRVVDGHPVAPKMALTMYMKHGLKVK 57327

57328 LTKRDKSKL* 57357

 

#91

>aaaa01002439.1 CYP86E1 (indica cultivar-group) orth AP004092.1 $F chromosome 2

6997 ARRRGAPVLWPLVGIVPTLFVHRDDIYEWGSAALLRAGGVFPYRGTWGGGSSG 7155

7156 VITSAPANVEHVLRANFGNYPKGPYYRERFVELLGGGIFNAD 7281

7415 LPDVPFARAFELATELSLLRFVTPPFIWKAKRLLRAGSERRLVEATRAVREFAERAVADR 7594

7595 RNEMRKVGSLRGRCDLLSRLMSSAPGADYSNEFLRDFCISFILAGRDTSSVGLAWFFW 7768

7769 LLAGHPDVESRVVGDVLAAGGDIKRMDYLHAALTEAM 7879 frameshift

7881 RLYPPVPVDFKEALADDVLPDGTPVRARQRVIYYTYAIGRDPASWGDDAAAFRPER*MRA 8060

8061 GAFAGGESPFKYAVFNAGPRLCIGKRFAYTQMKTAAAAVLSRFAVEVVPGQEVKPKLT 8234

8235 TTLYMKNGLMV 8267

 

>AP004092.1 $F CYP86E1 chromosome 2 clone OJ1568_B05, similar to 86C3

C99701 43% identical to D39760 60% to 86B1 same clone as C73927

84356 MAIASMAAAAAAAIVGAVREHVRASDLAVAGAVL

84458 FAFSAAVSAVRARRRGAPVLWPLVGIVPTLFVHRDDIYEWGSAALLRAGGVFPYRGTWG 84634

84635 GGSSGVITSAPANVEHVLRANFGNYPKGPYYRERFVELLGGGIFNADGEAWRAQRRAATA 84814

84815 EMHSSRFVEFSVRSIEQLVYGRLVPLAERLSGGGAAVDLQEVLLRFTFDNICAVAFGVDA 84994

84995 GCLADGLPDVPFARAFELATELSLLRFVTPPFIWKAKRLLRAGSERRLVEATRAVREFAE 85174

85175 RAVADRRNEMRKVGSLRGRCDLLSRLMSSAPGADYSNEFLRDFCISFILAGRDTSSVG 85348

85349 LAWFFWLLAGHPDVESRVVGDVLAAGGDIKRMDYLHAAL 85465

85466 TEAMRLYPPVPVDFKEALADDVLPDGTPVRARQRVIYYTYAIGRDPASWGDDAAAFRPER 85645

85646 WMRGGAFAGGESPFKYAVFNAGPRLCIGKRFAYTQMKTAAAAVLSRFAVEVVPGQEVK 85819

85820 PKLTTTLYMKNGLMVRFRRRSPPPPSPPPRHVVADDDDDDDVAAGRHVAVGSCNSNHL* 85996

 

#21

>aaaa01000478.1b CYP87A4 (indica cultivar-group) ortholog of AL607001.1 gene 1

39957 LVLLLRHQARRWRNPRCGGQLPPGSMGLPLVGETFQFFSSDASLDIPPFIRHRLA

39713 RYGPIFKTSLVGHPVVVSADEELNHMVFQQEGQLFQSWYPDSFVEILGKDNVGEQQGAMF 39534

39533 RYLKNMVLRYFGPESLKEGIIRDVERAVSSSLCTWSTLPAVELKEAVST

      MVFDLAASKLLGLEPSRSKILRKSFFDFVRGLISFPLYLPGTAYYSCMQ

39058 QGRRRAMVVLEQVLEERKQSTGLQRGGEAQQHGDFLDYVIQEITKEKPVMTEKMALDLMF 38879

38878 VLLFASFHTTSLALTLAVKLLADHPLVLEELT 38783

      VEHETILKDREAGSELDRITWKEYKSMAFTSQ 38589

35958 YTIPAGWGVMVCPPAVHLNPYIYPDPLTFIP 35866

32561 EINRGSKHFMAFGGGLRFCVGADFSKLQLAIFLHFLVTKYR 32683

 

>AL607001.1a $F CYP87A4 chromosome 4 clone OSJNBA0088I22, gene 1 92976-97891

48% to AC083944.6 57% to 87A2

92976 MYAYVGLVGGAAALVLLLLLRHQARRWRNPRCGGQLPPGSMGLPLVGETFQFFSSDASLDIPPFIRHRLAR  93188

(2)

93267 YGPIFKTSLVGHPVVVSADEELNHMVFQQEGQLFQSW 93377

93378 YPDSFVEILGKDNVGEQQGAMFRYLKNMVLRYFGPESLKEGIIRDVERAVSSSLCTWSTL 93557

93558 PAVELKEAVST 93590 (0)

93686 MVFDLAASKLLGLEPSRSKILRKSFFDFVRGLISFPLYLPGTAYYSCMQ 93832 (0)

93922 GRRRAMVVLEQVLEERKQSTGLQRGGEAQQHGDFLDYVIQEITKEKPVMTEKMALDLMF 94098

94099 VLLFASFHTTSLALTLAVKLLADHPLVLEELT 94194 (0)

      VEHETILKDREAGSELDRITWKEYKSMAFTSQ

94575 VINETVRLANIAPVIFRKALKDIRFN 94652 (1)

97033 GYTIPAGWGVMVCPPAVHLNPYIYPDPLTFIPSRFK 97140 (0)

97486 DKPEINRGSKHFMAFGGGLRFCVGADFSKLQLAIFLHFLVTKYR 97617 (2)

97802 WIPLGASRVVRTPGLEFPDGYRIKVIQRH* 97891

 

#291 combine with #293, #363 reduce gene count by 2

>aaaa01011592.1a CYP87A5 (indica cultivar-group) 85% to AL607001.1c chr 4, gene 3

4th line =

6806 METLVCVAVWAVAMAMVVASVMWAYRWSHPRANGRLPPGSLGLPLLGETLQFFAPNTTCDISPFVKERLN 7018

7111 RYGSIFKTSVVGRPVVVTADPELNYYVFQQEGKLFESWYPDTFTEIFGRDNVGSLHGFMY 7290

7291 KYLKSLVLRLYGQENLRAVLLDETDRACRASLASWAAQPSVELKDSISAV 7440

7554 ILTEGIALDLMFVLLFASFETTSLALTLGVRLLAENPTVLDALT 7685

 

no japonica ortholog found 9/10/02

 

#293 combine with #292, #363 reduce gene count by 2

>aaaa01011592.1b CYP87A5 (indica cultivar-group) 82% to AL607001.1c chr 4, gene 3

first two lines identical to AAAA01011592.1a

4th line = aaaa01016295.1

8837 YGSIFKTSVVGRPVVVTADPELNYYVFQQEGKLFESWYPDTFTEIFGRDNVGSLHGFMYK 8658

8657 YLKSLVLRLYGQENLRAVLLDETDRACRASLASWAAQPSVELKDSISA 8511

8400 MIFDLTAKKLISYEPSKSSENLRKNFVAFIRGLISFPVDIPGTAYHECMK 8251

7781 EEHEAIVRGRKEGCDAAGLTWAEYKSMTFTSQ 7876

 

no japonica ortholog found 9/10/02

 

#363 combine with #292, #293 reduce gene count by 2

>aaaa01016295.1 CYP87A5 (indica cultivar-group) 81% to AL607001.1c chr 4,

gene 3 3rd line = aaaa01011592.1a

8   YLCCVQGRRNAMKVLKKMMRER 73

78  EEPGRQCEDFFDVLIEELGREKPV

LTEGIALDLMFVLLFASFETTSLALTLGVRLLAENPTVLDALT 278

374 EEHEAIVRGRKEGCDAAGLTWAEYKSMTFTSQ 469

1092 VTLEMVRLANIVPGIFRKALQDIEFKG 1187

1285 YTIPAGWGVMVCPPAVHLNPEIYEDPLAFNPWRWQ 1389

2420 EITGGSKHFMAFGGGLRFCVGTDLSKVLIATFIHHLVTKY

     RWKTVKGGNIVRTPGLSFPDGFHVQFFPKN* 2746

 

#363

>aaaa01016295.1 CYP87A5 (indica cultivar-group) 81% to AL607001.1c chr 4,

gene 3 3rd line = aaaa01011592.1a

8   YLCCVQGRRNAMKVLKKMMRER 73

78  EEPGRQCEDFFDVLIEELGREKPV

LTEGIALDLMFVLLFASFETTSLALTLGVRLLAENPTVLDALT 278

374 EEHEAIVRGRKEGCDAAGLTWAEYKSMTFTSQ 469

1110 RLANIVPGIFRKALQDIEFKG1187

1285 YTIPAGWGVMVCPPAVHLNPEIYEDPLAFNPWRWQ 1389

2420 EITGGSKHFMAFGGGLRFCVGTDLSKVLIATFIHHLVTKY

     RWKTVKGGNIVRTPGLSFPDGFHVQFFPK 2740

 

No japonica ortholog found 9/11/02

 

Combined parts from #291, #292, #293, #363 CYP87A5 81% to AL607001.1c

6830 METLVCVAVWAVAMAMVVASVMWAYRWSHPRANGRLPPGSLGLPLLGETLQFFAPNTTCDISPFVKERLN 7018

7111 RYGSIFKTSVVGRPVVVTADPELNYYVFQQEGKLFESWYPDTFTEIFGRDNVGSLHGFMY 7290

7291 KYLKSLVLRLYGQENLRAVLLDETDRACRASLASWAAQPSVELKDSISA 7437

8400 MIFDLTAKKLISYEPSKSSENLRKNFVAFIRGLISFPVDIPGTAYHECMK 8251

8    GRRNAMKVLKKMMRER 73

78  EEPGRQCEDFFDVLIEELGREKPV

    LTEGIALDLMFVLLFASFETTSLALTLGVRLLAENPTVLDALT 278

374 EEHEAIVRGRKEGCDAAGLTWAEYKSMTFTSQ 469

1092 VTLEMVRLANIVPGIFRKALQDIEFKG1187

1285 YTIPAGWGVMVCPPAVHLNPEIYEDPLAFNPWRWQ 1389

2411 DKVEITGGSKHFMAFGGGLRFCVGTDLSKVLIATFIHHLVTKY

     RWKTVKGGNIVRTPGLSFPDGFHVQFFPKN* 2746

 

#85

>aaaa01002235.1 CYP87A6 (indica cultivar-group) orth to AL607001.1c gene 3 chr 4

6264 ALLCAALAAVVALLRWAYRWSHPRSNGRLPPGSLGLPVIGETLQFFAPNPTCDLSPFVKE 6443

6444 RIK

6563 RYGSIFKTSVVGRPVVVSADPEMNYYVFQQEGKLFESWYPDTFTEIFGRDNVGSLHGFM 6742

6743 YKYLKTLVLRLYGQENLKSVLLAETDAACRGSLASWASQPSVELKEGIST

     MIFDLTAKKLIGYDPSKPSQVNLRKNFGAFIRG 7102

7103 LISFPLNIPGTAYHECME

7455 GRKNAMKVLRGMMKERMAEPERPCEDFFDHVIQELRREKPLLTETIALDLMFVLLFASF 7634

7635 ETTALALTIGVKLLTENPKVVDALX 7706

7850 EEHEAIIRNRKDPNSGVTWAEYKSMTFTSQXXXXXX 7945

8422 RLANIVPGIFRKALQDVEIKX 8481

8589 YTIPAGWGIMVCPPAVHLNPEIYEDPLAFNPWRWQ 8693

9679 EITGGTKHFMAFGGGLRFCVGTDLSKVLMATFIHALVTKYR 9804

9888 WRTVKGGNIVRTPGLSFPDGFHIQLFPK 9971

 

>AL607001.1c $F CYP87A6 chromosome 4 clone OSJNBA0088I22, gene 3 136970-140694 61% to 87A2

136970 MAYIALLCAALAAVVALLRWAYRWSHPRSNGRLPPGSLGLPVIGETLQFFAPNPTCDLSPFVKERIKR 137173

(2?)

137287 YGSIFKTSVVGRPVVVSADPEMNYYVFQQEGKLFESWYPDTFTEIFGRDNVGSLHGFM 137460

137461 YKYLKTLVLRLYGQENLKSVLLAETDAACRGSLASWASQPSVELKEGIST (0)

       MIFDLTAKKLIGYDPSKPSQVNLRKNFGAFICG 137820

137821 LISFPLNIPGTAYHECME 137874 (0)

138176 GRKNAMKVLRGMMKERMAEPERPCEDFFDHVIQELRREKPLLTETIALDLMFVLLFASF 138352

138353 ETTALALTIGVKLLTENPKVVDALX 138424 (0)

138574 EEHEAIIRNRKDPNSGVTWAEYKSMTFTSQ (0)

139123 IMEIVRLANIVPGIFRKALQDVEIK 139197 (1?)

139305 XYTIPAGWGIMVCPPAVHLNPEIYEDPLAFNPWRWQ 139421 (0)

140396 XXEITGGTKHFMAFGGGLRFCVGTDLSKVLMATFIHSLVTKYR 140518 (2)

140605 WRTVKGGNIVRTPGLSFPDGFHIQLFPKN* 140694

 

AUTHORS   Chaban,C.

  TITLE     Phytochrome response in rice coleoptile - just a matter of auxin?

  JOURNAL   Thesis (2002) Department of Botany, University of Freiburg,

            Freiburg, Germany

(CYP87A3 gene) AJ459255 authors assigned should be CYP87A6

MQPYLQLASLRLATTIPLAPRLYDANLLAASGAAMASSMAYIAL

LCAALAAVVALLRWAYRWSHPRSNGRLPPGSLGLPVIGETLQFFAPNPTCDLSPFVKE

RIKRYGSIFKTSVVGRPVVVSADPEMNYYVFQQEGKLFESWYPDTFTEIFGRDNVGSL

HGFMYKYLKTLVLRLYGQENLKSVLLAETDAACRGSLASWASQPSVELKEGISTMIFD

LTAKKLIGYDPSKPSQVNLRKNFGAFICGLISFPLNIPGTAYHECMEGRKNAMKVLRG

MMKERMAEPERPCEDFFDHVIQELRREKPLLTETIALDLMFVLLFASFETTALALTIG

VKLLTENPKVVDALREEHEAIIRNRKDPNSGVTWAEYKSMTFTSQVIMEIVRLANIVP

GIFRKALQDVEIKGYTIPAGWGIMVCPPAVHLNPEIYEDPLAFNPWRWQGKPEITGGT

KHFMAFGGGLRFCVGTDLSKVLMATFIHSLVTKYSWRTVKGGNIVRTPGLSFPDGFHI

QLFPKN

 

#146

>aaaa01004251.1 $FI CYP87B1 (indica cultivar-group) ortholog of AQ157412 contig

44% to 87A2

2320 MEKSEIFVGAHSYAALCAFTLIIGWLAHWVYRWINPPCNGRLPPGSMGFPIVGETFQFFR 2499

2500 TSPSIDMPIYYKRRLER (2) 2550

2709 YGPIFKTNIGGQHVVISLDPEVNQFIFQQEGKLFQSWFPETTLNIFGKKTLTTYNRTAH 2885

2886 KLIRSFVCKLYGPENVKKSLLPELENSMRESLASWIGKPSVEVNDGVSN (0) 3032

3130 MIFGLAAKHLIGLDITNSGELKKNFQEIFQVMVSIPFPIYFPGTSFYRCMQ (0) 3282

3407 GRRNVWTTLTNVMKKRLSAPGNKFGDLVDLIVEELRSENPTI

     DESFAIDTLSGLLFASFAPLSCTLTTTFKFLNDNPEVFDKLK (0) 3658

3778 EEHEMILKKREGANSGFTWEEYKSLKFSTQ (0) 3867

3965 VVNEINRITTVIPGGFRKALTDVEVN (1)

4133 GYTIPSGWLVMISPMGVHLNPKLFEDPLKFDPWRWT (0)

4343 EEKRISMQRNFMPFGGGIRMCPAVEFNKLFITLFLHIVVTEYR 4471 (2)

4752 WKDIDGGNVKRISEVLVAQEYHIQLVPQT* 4841

 

>AQ157412 CYP87B1 (partial) nbxb0009M10r AQ577511 nbxb0091K23r

C72168 C73926 opp end C99700

(probably 3 prime untranslated) 46% to 87A2