341 japonica Rice P450 sequences sorted by clan and family. 
There are 9 P450 clans in plants. CYP727A1 is currently not classified to a clan.
Last modified July 8, 2002 there are now over 250 full length japonica P450s in 
this set.  $F means seq is complete $P means pseudogene fragment is already maximal size.  
226 $F + 64 $P = 285 finished japonica sequences or pseudogene fragments
40 partial japonica sequences need new sequence data not yet in Genbank to complete.
Clan  Members

51    51
71    71, 73, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 89, 92, 93, 97, 98, 99
      701, 703, 705, 706, 712 (712 = a 705 subfamily), 719? 723?
72    72, 709, 714, 715, 721
74    74
85    85, 87, 88, 90, 702, 707, 708, 716, 718, 720, 722, 724, 725
86    86, 94, 96, 704
97    97
710   710
711   711

CYP51 clan contains only one family CYP51 (8 sequences)

>AB025047 CYP51 rice (partial)   80% to 51A2 missing N-term 64 aa
BE040549.1 OE08G10 OE Oryza sativa cDNA 5' Length = 255 I-helix CYP51
BE230288.1 99AS641 Rice Seedling cDNA clone 99AS641.Length = 586
BE230302.1 99AS655 Rice Seedling cDNA clone 99AS655.Length = 627
BE607441.1 OE202C10 OE cDNA clone ID707 C-term CYP51 Length = 428
REEYARLGSVFTVPILSRKITFLIGPEVSAHFFKGNEAEMSQQE
VYKFNVPTFGPGVVFDVDYSVRQEQFRFFTEALRANKLRSYVDQMVVEAEEYFSKWGE
SGTVDLKYELEHLIILTASRCLLGREVREKLFDDVSSLFHDLDNGMQPVSVIFPYLPI
PAHRRRDRARQRLKEIFATIIKSRKASGRAEEDMLQCFIDSKYKSGRSTTEGEITGLL
IAALFAGQHTSSITSTWTGAYMLRFKQYFAAAEEEQKEVMKRHGDKIDHDILAEMDVL
YRCIKEALRLHPPLIMLLRQSHNDFSVTTKDGKEFDIPKGHIVATSPAFANRLPHIFK
NPDSYDPDPYAPGREEDKAAGAFSYISFGGGRHGCLGEPFAYLQIKAIWTHLLRNFEF
ELVSPFPETNWKAMVVGIKDEVMVNFKRRKLVVDN

>AP005448.1a (japonica cultivar-group) chromosome 7 21 June 2002
29768 TISDTMWLTTVALLLTTVVIWTALQKRKRGEACPAAVAPPPIVQGTALVRFLRAMARDG 29944
29945 PLEVIREQLAKLGSVFMASAPLGLFKVTFLVGAEL 30049

>AP005448.1b $F (japonica cultivar-group) chromosome 7 21 June 2002
32724 MDHVTSSTIARGAMSWVAATVALLLTTAVILTALQKRKISSPAAAAPPVVRGAGLVRFA 32900
32901 RAMARDGPLEAIREQQAKLGSVFTAIAPFGLFKVTFLIGPEVSSHFYLAPESEMGQGSIY 33080
33081 RFTVPLFGPEVGYAVDPDTRAEQMRLFWDVLKPRSIEARVGAMAEEVQ 33224
35381 NYFSRWGEQGTVDLKKELEQVLMLIASRCLLGREVRESMVDEVYELFRDLDNGLHLIS 35554
35555 TMLPYLPTPAHRRRDRARQRLGEIFTEVIRSRRNSGTADNGDDVLQRLIDGRYKD 35719
35720 ERDLTDVEVVGLLVALVFAGKHSSSSVSTWTGINLLSHPNHLVAVIAEQDRLMASRARTD 35899
35900 DDHDRVNYDTVQEMTTLHRCIKEALRLHPPAVAMFRQARKHFTVQTKEGKEYTIPGGHTV 36079
36080 MSTILVNHHMPNVYKDPHVFDPSRFARGRGEDKAAGPFSFLAFGAGRHSCAGESFAYTQI 36259
36260 KVIWSHLLRNFELKMVSPFPETSWRMVTPEPKGTVMISYRRRNLTCK* 36403

>AC108875.1a $F chromosome 5 51% to 51A2 same as AQ050946 AQ687182 AQ258479
58% to AP004090 this might require subfamilies in CYP51
70310 MELTSSSAMWLAMAILAITAALTKIALGGGRRRCLSESSDLTCKTPPPPPVVNCIALLGL 70489
70490 LPALFRGDVPATMQQLYAKFGSVFTVSVAGLLKATFLVGPEVSAHFFQGLESEVSHGDLF 70669
70670 EFTVPMFGKEVGHGVDNATRIEQGRFFAEALKPVRLRIHVDPMVQEVE 70813 (0?)
71263 DYFAKWGQHGTVDLKHELEQLLLLISGRCLLGKEVMGTKFDEVCNLFRDIEGGVNLMSV 71439
71440 FFPYTPLIPSNRRRDMARERLHAIFSDIVRSRKQQQGDQEEVNDKDVLQSFIDSR 71604 (2?)
71910 YKADGRATTEAEVAGLITGVLFAAKHTSTHTSVWTGARLLTHEKFLAAAVDEQDQIVR 72083
72084 KHGIINGRIVTDHYGFLMEMHMLHICIKETLRLHPPAPMIVRTALRQFTVRTREGHEYCV 72263
72264 PAGHTMASPIVISNRVPYIYKDAHLYDPDRFGPRREEDKVGGKFSYTSFGGGRNSCVGEN 72443
72444 YAYMQIKAIWSHLLRNFELKLLSPFPKTDWSKLVPEPQGKVMVSYKRRQL 72593

>AC108875.1b $F chromosome 5 48% to 51A2
80009 MAIILAITAAVTKIARGGRRRSATDPTCKMPPPPPVVNSIALLRLLPTLFRSGLPA 80176
80177 ILHELYTKFGSVFTINLAGLLKMTFLVGPEVSAHFFQGLESEISHGNLLEFTVPMFGKEI 80356
80357 AHGVDSATRNEQARFFVDALKPARLRIHVDPMVQEVE 80467 (0?)
84741 DYFAKWGQHGTVDLRRELEQLLLLISGRCLLGKEVMGTMFDEVCNLFRDIEGGVNLMSV 84917
84918 FFPYTPLIPSNRRRDMARKRLHAIFSDIVRSRKQREGDNVDKDVLQSLIDSRYKADGRAT 85097
85098 TEA*VAALMICLLFAAKHTSAYTSVWTGARLLSHERFLTAAVDEQDKIAREHSNINGGGR 85277
85278 ITDDRYGSLMEMRTLHSCIKETLRLHPPVPMLVRTAHKQFTVRTREGHEYAVPAGHTIAS 85457
85458 PIVISNQVPYIYKDGHLYDPDRFGPAGREEDKVGGKFSYASFGGGRTGCVGEGYAYMQIK 85637
85638 AIWSHLLRNFELRLLSPLPKSDFTKFVPEPHGELMVSYKRRQL 85766

>AC108875.1c $F chromosome 5 50% to 51A2
122577 MDLVSISIYMWAMAALCTIITAMVTTKLARVRRPITLNPKSKRPLPPVVNVIA 122735
122736 LLEHLPRLCTKGVIPVYKGSVFTVSLFGLKATFLVGPEVSAHFYQGMDSEISQGDLYEFT 122915
122916 VPLFGKGVGFDIDNATRTEHLRFFIDAIKTSKLRNHVNSMVQEVE 123050 (0?)
123296 DYFAKWGENGIVDIKHEFEKLLMLISGHCLLGKEVRDNMFDEVFSLF 123436
123437 HELDSGVGLGSVIFPYIPIPSHIRCDKAHAKLAKIFSKIVRSRRDSNRPAEQDVLQYLID 123616
123617 SKHRDGNSTTEQEVTGWIISMVFAGKHTSTNSTTWTGACLLTHDKFLTEALDEQKHMIQK 123796
123797 HGDHIDYNVLLDMDILHCCIKEALRMHPVAPIIYRKAQKSFVVRTREGDAYDIPEGHNLL 123976
123977 SPMIFNNRLPYIYKDPHMYDPDRFAPKREEDKVGGMFSYTSFGGGRHICIGEAYAYMQIK 124156
124157 VIWSHLLRNFELKLESPFPKTNWSKILLEPWGKVMVSYKRRRL 124285

>AP004090.1 $F chromosome 2 clone OJ1399_H05 49% to 51A2
AQ843111.1 nbxb0005D03r CUGI Rice BAC genomic cloneLength = 507 49% to 51A2
78158 MAVLFVATKMIQQRPRTLCLYEKENKEEELLLPPVMSVVSVLTAYLPTLIAKGLPAVIHDLH
77972 SRLVTLLVTAHFFQASESEIRQSNIYKVTVPVFGRGVLYDVDLATRSRQISFCTDS 77805
77804 IKPINLRGHVDSMVHEVE 77751 (0)
76666 GYFAQWGEDGVVDIKYEMGNLILLIANRCLLGKQFGESKLEQVSTLLHELFDNGFHLISLF 76484
76483 FPYLPTPQHRRRDKARAMLGEMIHEAVRSRRNSGVAEDDVLQKFLDSKYINGRCMTENEI 76304
76303 AGLLICMMFAAQHTSSSTSTWTGACLLSHGHRSYLAAAIQEQKRIIQQHGDRINWGILLQ 76124
76123 MTTLTHCIKEALRLHPPANLLIRHASKSFSVQTRQGHRYQIPKGHTLATCTTVGNRLPYI 75944
75943 YKDPNVYDPSRFGPGREEDKVGGKFSYTPFSAGRHVCLGEDFAYMQIKVIWSHLLRNFDL 75764
75763 ELISPFPEEEWEKFIPGPKGKVMVTYKRRRLL* 75665

>AP003866.1a $F chromosome 7 clone OJ1092_A07 53% to 51A2
AQ326645 and AQ291927 mid to K-helix region 52% identical to wheat CYP51
60% identical to AQ327456 68% to EST T88278 705 family
AQ689048.1 nbxb0078H10r CUGI Rice BAC genomic clone Length = 737
AQ396185.2 nbxb0066K16r CUGI Rice BAC genomic cloneLength = 327
50920 MDMAAAAAVWFSAIAAVLLAASTIAVVVVAKMTGKRNGGAAAAAAAAAEAELPL
51082 PPVVSGVSLIIPVITRGPMAVADELYVKLGSVFT (0)
      GGFYSRPE 51261
51262 SEVHQGGTYRMTVPMFGRGVMYDVDVATRSEQIAVCFEALRPTKLRSSTVTMVRETE 51432 (0)
52114 EYFAKWGEQGTVDLKRELDLLILTIASRVLLGKEVRETMFADVVASFHELMDNSMHLI 52299
52300 SLCFPNLPIPRHRRRDTASARLKELFSRAIQLRRGSGRAEDDVLQRFLESRYRDGRAMSD 52479
52480 NEITGMLIALVVAGQHMSSSASTWTGAFLLRDPKHLAAAVDEQRRLIGDDRVDYDALT 52653
52654 TGMSTLHRCIKEALRMHPPAPALVRTVRRGFAVWTREGKEYRMPAGHSVVSYAAFNHRLG 52833
52834 YVYRDPDEYDPERFGPERKEDRVAGKFSFTAFGGGRHACLGEHYAFLKMKVIWSYLLRNF 53013
53014 ELELVSPFPEVELNNIMLGPRGEVMVRYKRRKLTST* 53124

>AP003866.1b $P chromosome 7 clone OJ1092_A07 
No obvious N-terminal, two in frame stops, three frameshifts = Pseudogene
82%  to AY022669.1 seems to be a CYP51 pseudogene
54048 VRFLHRKVTFLVGPEESSHFFTGLDSEISQDEVSRFIIPTFGS*VAFD 54191 (intron no boundaries)
54642 AGYATQ*EQFRFFGDTMKPTKLRSYVSHMVYEVE 54743 (0) missing 20 aa
56119 VVTLIATRCLFGEVRSKMLGEVPTLLRELNDSMRLITIVFPYLPIPAHRRRDSARARL 56292
56293 GEIFVEIVRSRRSSPGGGGAGHDDMLQCLIDARYKDGRATTEG 56421 frameshift
56424 AEVAGMLVSALFAGQYTSSSTSTWTGAR 56507 frameshift
56510 LLTHPEHLRAAVREQEELVLVRHRHGGDVVDHDALQRMGHLHRCVKETLRLHPPSLMLLRHAR 56698
56699 RSFVVRARGSGDAEYEVPAGHTVASPMVIHNALPHVYEDAGSFDPGRFGPAREEYRAYAA 56878
56879 DHAYTVFGGGRHACVGE 56929 frameshift
56932 VIKVIWSHLLRNFELELMLPFPETDWNDVMPGPKGKVMVLPPSHNKIISVIVGFSVQL* 57108

>AY022669.1 (partial)   microsatellite MRG4994 containing (CCG)X8, Length = 224
82% to CYP51 pseudogene above
222 PRLPIPAHRRRDRARARLGEIFSDIVRSRRGSSGGGGGGGGARHDDMLQCLIDARYKDGRATTETEVAGMLVAA 1

CYP71 clan contains 25 families 71, 73, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 
89, 92, 93, 97, 98, 99, 701, 703, 705, 706, 712 (712 = a 705 subfamily), 719? 723?
161 sequences

>AC087599.11 $P chromosome 10 clone OSJNBa0057L21, pseudogene fragment like 71A1 44% to AAAA01006105.1b
16812 GGGGRWTETLEWIMAELTANTRVMAKLQDEISRAADGK 16925 24 aa deletion and frameshift
16931 PAPILVPRQSTATAVVQG*EILAKTSLFINVWAMGRYPAAWNSSEEFWPEQFLASREAVD 17110
17111 FQGNNYQLILFITDRRIFPDINFAVPVLETALVGLLHPTNELLGGG 17248
17249 GGLMWLQRSCSRARRPRSTAHRRHRSGTHPAAIAAAAAT* 17368

>BI808626.1 (partial) clone D005B07.Length = 538 EST with numerous frameshifts similar to AAAA01035499.1 and 71Bs no ortholog found
no extensions in htgs nr gss or est sections of Genbank 8/3/02
66% to AY104083.1 Zea mays
11 LFVHAWAIGRDPXAWXXPEEFRPDRFLXXSVDFRGNDYQLVPFGAAPRICPGISFXX
PVLEMALFALLHHFDWELPAGMXXXXXDMSEAPGLTTPLRVPLRLVPKRKARLPRHIYKRNVIGE*

>AP004684.1a $F chromosome 6 clone P0012H03, Length = 163117 41% to 76C2
New seq 76% to CONTIG OF C72003 (c-term) 59% to AP003623.1 70% to AU173996.1
10968 MQQQVTEFANAGSDTHAERKQSELTAAEMGHELWVLW
11079 ATLAVSLLCYLYLTSHRLGSRRRRWPPGPRPLPLLGNLLDLRGG 11210
11211 NLHHTLARLARAHGAPVMRLQLGLSPAVVISSPGAAREAFTAHDRRLAARAVPDAN 11378
11379 HALGFCDRSMIWLPSADPMWRTLRGVVAAHAFSPRALAAARAVHERKVRDLVA 11537
11538 YLRGRAGREVDVKDAVYGGVLNLVSSALFSADVVDVGGESAQGFRELVEELIESIAKPN 11714
11715 VSDLFPFLRRFDLQGWRRWTSGHLAKIYKVLDDIIDRRSAEDDAAMDKRGDFLDVL 11882
11883 LELMSTGKIAREYLTNILFDVFTAGSDTMSLTVVWAMAELLRNPGVMAKAR 12035
12036 AEIDAALGGREAVEEADVARMPYVQAVLKEAMRLHPVAPVMLPRKAAEDGVEIGG 12200
12201 FEVPRGCAVIFNTWAIMRDPAAWERPDEFVPERFVGRSRATEEMDFRGKDFGFLPFGSG 12377
12378 RRLCPGVPMAERVLPLIMASLLHAF 12452
12453 EWRLPDGMSAEQLDVSEKFTTANVLAVPLKAVPVVIAC* 12569

>AP004684.1b $F chromosome 6 clone P0012H03, Length = 163117
New seq similar to AP004000 57% to AP003523.1 78% to AP004688.1  
36% to 41% with 71A and 71B sequences possibly new subfamily in 71
117908 MDAAVFCCLLALLPLLHYLITLFLHGSRDSDLRLPPGPWRLPLIGSLHHLF 118060
118061 FGALPHRALRDLARRHGPLMLLAFGDAPVVVVASTAGAAREILRTHDDNFSSRPLSAV 118234
118235 VKVCTRRGAGITFAPYGEHWRQVRKICRLELLSPRRILAFRAIREEEAARLVRAIGVASP 118414
118415 PLVTNLSELLGNYVTDTTVHIVMGERFRERD ALLRYVDEAVRLAGSLTMADLFPSSRLAR 118594
118595 AMSSTTLRRAEAFVESLMEFMDRVIREHLEKKRSCQGGEREEDLIDVLLRLQAEGSLHFE 118774
118775 LTMGIIRAVIF 118807 (0)
118932 DLFSGGSETATTTLQWAMAELMRNPGVMSRAQAEVREAYKDKMEVTEEGLTNLTYLQCII 119111
119112 KETLRLHTPGPLALPRECQEQCQILGYDIPKGATVLVNVWAICTDNEFWDESEKFMPERF 119291
119292 EGSTIEHKGNNFEFIPFGAGRRICPGMQFGIANIELALANLLFHFDWTLPEGTLHSDLDM 119471
119472 TETMGITARRKEDLYVHAIPFVQLP* 119549

>AP003523.1a $F chromosome 6 clone P0416A11 six different genes 
AQ271656 nbxb0026B09r 53% to 71A16 N-term
35% to 40% with 71A and 71B sequences possibly new subfamily in 71
112855 MEQYLFLATLLILSLAFVKLRPRNNGENPPPGPWQLPVIGSLHHLAGALPHRALRDLA 113028
113029 TRHGELMLLRLGELPVVVASSPAAAREVMRTHDAAFATRPQTATLRALTRDGLGVAFAPQ 113208
113209 GEHWRCLRKLCVTELLGARRVRCLRRAREAEAAALVASLSTTTPEPVNVS 113358
113359 SLVARYVTDAVVRAVVGDRISDRDAFLERLEEGVKVAAGFTLADVFPSSRLARALSGTAR 113538
113539 RAEAHSREMTRLMDGVIEEHRQRRAATGWRDEEDEDLLDVLLRIQKDGGLQIPLDMGTIRAIII 113730 (0)
113831 DLFSAGSETTGTTLQWAMAELMRNPAALRKAQAEVRGVLAGHSHVTEDALPDLHYLHL 114004
114005 VIKETLRLHVAVPLLLPRECQEPRLRVLGYDVPERAMVLVNAWAICRDTAVWGPDAEEFR 114184
114185 PERFDGGAVDFKGTDFEFVPFGAGRRMCPGVAFAVAIMELGLASLLFHFDWELAGGTAA 114361
114362 GELDMAEGLGITARRKSDLWLHATVSVPVPNTETS* 114469

>AP003523.1b $F chromosome 6 clone P0416A11 six different genes
54-58% with genes in AP003090 and AP004000 group
38% to 41% with 71A and 71B sequences possibly new subfamily in 71
118132 MDLMKSNPLQGSPWSL
118084 LNLLVLIIVAAMICGELCRRRRRRRGDENGGATRLPPGPWRLPFVGSLHHLAVMRPRGVV 117905
117904 VHRALAELARRHDAPVMYLRLGELPVVVASSPEAAREVLKTHDAAFATRAMSVTVRESIG 117725
117724 DKVGILFSPYGKKWRQLRGICTLELLSVKRVRSFRPIREEQVARLVDAIAAAAASS 117557
117556 TAEAAAVNISRQITGPMTDLALRAIMGECFRWREEFLETLAEALKKTTGLGVADMFPSSR 117377
117376 LLRAVGSTVRDVKLLNAKLFELVECAIEQHREQIRAAHDNGGDDDDAHGHGDKECFLNTL 117197
117196 MRIQKEGDDLDD 117161 (frameshift) LTMATVKAVIL (0)
       DMFAGGSESTSTTLEWALSELVRNPHVMQKAQAEIRHALQGRTRV 115967
115966 TEDDLINLKYPKNIIKETLRLHPVAPLLVPKECQESCKILGYDVPKGTIMFVNAWAIGRD 115787
115786 PRYWNDAEVFMPERFEKVAVDFRGTNFEFKPFGAGRRMCPGITFANATIEMALTALLYH 115610
115609 FDWHLPPGVTPDGLDMEEEFGMSVSRKRDLYLRPTLHMGLETI* 115478

>AL606658.1 $P chromosome 4 clone OSJNBb0016D16 lone pseudogene fragment
72% to AP003523 118132-115478
120987 SALNVSWQITGTLTDLTLHAIMGECGFRWHEEFLETLGEAQKKATRFGVADLFPSSRLLPAVGSRSGD 120784

>AP003523.1c $F chromosome 6 clone P0416A11 six different genes
73% to AP003523.1d 64% to AP003523.1f
128476 MAAELVHLLRYLFSVPM
128425 LFFIVPLLFLVCSPRRRRGRGSCRLPPSPWALPVVGHLHHLAGALQHRAMRDIARRHGPL 128246
128245 VLLRLGRLPVVVASSADAAREVMRTSDVAFAARPVNRMIRVVFPEGSEGVIFAPYGETWR 128066
128065 QLRKICTAELLSARRVHSFRSVREEEAGRMLRAVASAAAQTTVNLSELMSA 127913
127912 YAADSSARAMIGRRLKDRDTFLAMVERGIKLFGEQSLPNLYPSSRLAVLLSTMPRRMKRH 127733
127732 RERMTAYLDAIIEEHQESRASREDDEDLLDVLLRM 127628 frameshift
       QREGDLEVSRESIRSTIG bad exon boundary should be phase 0
126439 DMFIGGSEPPAITLQWIMAELIRNPEVMQKVQDEVRQLLVGQHRVTEESLSKLGY 126275
126274 MNLVIKETLRLHPPGPRLLLRVCRTTCQVLGFDVPKGTMVLVNMWAINRDPKYWSQAEEF 126095
126094 IPERFENAGINFKGTNFEYMPFGAGRRMCPGMAFGLATLELALASLLYHFDWKLPDGV 125921
125920 EIDMKEQSGVTTRRVHDLMLVPIIRVPLPV* 125828

>AP003523.1d $F chromosome 6 clone P0416A11 six different genes 
73% to AP003523.1c 
138301 MAEVVQLHHLILLLPLFILPFLLLRSSRRRRGACGRLPPSPWALPVIGHLHHLAGALPHRAMRDIA 138152
138151 RRHGPLVLLRLGELPVVVASSADAARDVMKTHDLAFATRPITRMMRLVFPEGSEGIIFSP 137972
137971 YGETWRQLRKICTVELLSARRVNSFRSVREEEVNRLLRAVAAAAASATSPAKTVNL 137804
137803 SELMSAYAADSSVRAMIGRRCKDRDKFLAMLERGIKLFVTPSLPDLYPSSRLAMVVSRMP 137624
137623 RRMRRHREEVFAFLDAIIAEHQENRASGEDEEDLLDVLLRIQREGCME 137480 frameshift
       SPLLSTESIRTTIG bad exon boundary should be phase 0
136562 DLFNGGSETTATTLQWIMAKLMRNPRVMQKAQDEVQRVFIGQHKVTEENLSNLSYMYL 136389
136388 VIKEALRLHPPRPPLLPRECRTTCQVLGFDVPKGTIVLVNMWAINRDPKYWDQSEEFILE 136209
136208 RFEHVDINFKGMNFEYMPFGAGRRMCPGMAFGLVNLELVLASLLYHFDWKL 136056
136039 GDLDMTQERGATTRRLHDLLLVPVIRVPLPLDSRS* 135942

>AP003523.1e $F chromosome 6 clone P0416A11 six different genes AQ331067 AQ364007.2
AQ331067 55% identical to AQ328148 47% identical to C74921 57% to 71B4 58% to 76C1 64% to AP003523.1f
AQ364007.2 nbxb0060E04f CUGI Rice BAC Length = 393 65% to 99A1
End of this gene matches AP003571 at 155149
151229 MAGFPVYLLFLAALIILPMANLIRSARHRRLAGARRPPPGPWALPVIGHLHHLLAGKLPH 151408
151409 HHKLRDLAARHGPLMLLRFGELPVVVASSADAAREIAKAHDLAFATRPVTRTARLTLPEG 151588
151589 GEGVIFAPYGDGWRQLRKICTLELLSARRVLSFRAVREQEVRCLLLAVASPSPEGTTAT 151765
151766 ASVVNLSRMISSCVADSSVRAIIGSGRFKDRETFLRLMERGIKLFSCPSLPDLFPSSR 151939
151940 LAMLVSRVPGRMRRQRKEMMEFMETIIEEHQAARQASMELEKEDLVDVLLRVQRDGSLQF 152119
152120 SLTTDNIKAAIA 152155 (0)
166133 DLFIGGSETAATTLQWAMSELLNNPKVMQKAQDEIRQVLYGQERITEETISSLHYLHL 166306
166307 VIKETLRLHPPTPLLLPRECREPCQILGFDVSKGAMVLINAWSIGRDPSNWHAPEKFMPE 166486
166487 RFEQNNIDFKETSFEYIPFGAGRRICPGMTFRLANIELLLASLLYHFDWELPYGMQAGD 166663
166664 LDMTETLAVTARRKADLLVVPVVRVPIVG* 166753

>AP003523.1f $F chromosome 6 clone P0416A11 six different genes
AU096456 AU096455 71% to AP002968 65% to 71B24 also = AU032983
168538 MAAAASSVLAYLLVVALLAIVPLVYFGWVARRRGEGGRLPPSPWGLPVIGHLHHLAGALPHHAMRDLA 168741
168742 RRHGPLMLLRLGELPVVVASSAEAAREVMRTRDIEFATRPMSRMTRLVFPAGTEGIIFAP 168921
168922 YGDEWRELRKVCTVELLSARRVQSFRAVREEEVGRLLRAVAATSSSPSPAQAAVNL 169089
169090 SALLSAYAADSAVHAIIGSRFKDRDKYLMLLERGLKLFARHTLPDLYPSSRLAMWLSRMP 169269
169270 RRMMQHRREAYAFTDAIIREHQENRAAGAGDGDGDDKEDLLDVLLRIQREGDLQFPLSTE 169449 
169450 RIKTTVG 169470 (0)
169893 DMFAGGSETAGTALQWIMAELIRNPRVMHKVQDEVRQTLAGRDRVTEDAISNLNYMHL 170066
170067 VIKEALRLHPPVPLLLPRECRNTCQVLGFDVPKGAMVLVNAWAISRDPQYWDEPEEFIPE 170246
170247 RFEDSNIDFKGTNFEYTPFGAGRRMCPGIAFGLANVELMLASLLYHFNWQLPDGMDTAD 170423
170424 LDMTEEMVVSARRLHDLLLVPVVHVPLPVASS* 170522

>AC118346.1 $F Gene 1, 94448-96200, 3 exons 97% identical to gene 2 (12 diffs)
36% to 41% with 71As and 71Bs
94448 MADQLVHLPQQLLVLLLFIAPFFFFFLIRSMRRRDGGSVRLPPSPWALPVIGHLHHLMGA 94627
94628 LPPHHAMRDIALRHGPLVRLRLGGLQVI 94711
94712 LASSVDAAREVMRTHDLAFATRPSTRVMQLVFPEGSQ (0)
      GIVFTPYGDSWRNLRKICTVELLSAKRVQSFRPIREEEVGRLL 95071
95072 RAVAAASPARRAVNLSELISAYSADSTMRALIGSRFKDRDRFLMLLERGVKLFATPSLPD 95251
95252 LYPSSRLAELISRRPRQMRRHRDEVYAFLDIIIKEHQENRSSSDDQEDLDLVDVLLRIQR 95431
95432 KGDFPLSTDNIKTTIG (0) 95479
95574 DLFNGGSETTATTLKWIMAELVRNPRVMQKAQDEVRRALGKHHKVTEEALKNLSYLHL 95747
95748 VIKEGLRLHPPGLPLLLRESRTTSQVLGFDVPQGTMILVNMWAISRDPMYWDQAEEFIPE 95927
95928 RFEHVNIDYYGTDVKYMPFGVGRRICPGIAFGLVNLELVLASLLYHFDWELPDGTELGNL 96107
96108 DMKEEMGAIARRLHDLSLVPVIRHPLPVDM* 96200

>AC118346.1 $F (japonica cultivar-group) chromosome 11 clone Ba0039F06, 
Gene 2 = AU096586.1, D48250 97% identical to gene 1 (12 diffs)
113634 MADQLVHLPQQLLVLLLFIAPFFFFFLIRSIRRRDGGSVRLPPSPWALPVIGHLHHLMGA 113813
113814 LPPQHAMRNIALRHGPLVRLRLGGLQVI 113897
113898 LASSVDAAREVMRTHDLAFATRPSTRVMQLVFPEGSQ (0) 114008
114127 GIVFTPYGDSWRNLRKICTVELLSAKRVQSFRPIREEEVGRLLRAVAAASPARRAVNL 114300
114301 SELISAYSADSTMRALIGSRFKDRDKFLMLLERGVKLFATPSLPDLYPSSRLAELISRRP 114480
114481 RQMRRHRDEVYEFLDIIIKEHQENRSSSDDQEDLDLVDVLLRIQRKGDFPLSTDNIKTTIG (0) 114663
       DLFNGGSETTATTLKWIMAELIRNPRVM 114840
114841 QKAQDEVRQVLGKHHKVTEEALRNLSYLHLVIKEGLRLHPPGLPLLLRESRTTSQVLGFH 115020
115021 VPQGTMILVNMWAISRDPMYWDQAEEFIPERFEHVNIDYYGTDVKYMPFGVGRRICPGIA 115200
115201 FGLVNLELVLASLLYHFNWELPDETELGNLDMKEEMGAIARRLHDLSLVPVIRHPLPVDM* 115383

>AC118346.1 Gene 3 $P pseudogene 56% to AC118346.1 genes 1, 2
117756 DAAREVMRTHDLAFATRPSTRVMQLVFLEGSQ 117661
117553 GDRFTPYGDIWRNLRRSAPLAVSAKRVQFFRPIHQEEVCRLLQAVAVASPA 117395
117394 RGPPETLTSSFRPTWATLQCAP**GARLRDRDKSLMLLYRGVKPIRHARACQIFTQSIAL 117215
117214 ADLIIKSLSPMRRASYPMSNLLDIIFK 117134
117108 SDNHMDLTLVAFLLRFHKKGACPLSFCYIRKQFG*AF 116998

>AP003571.1 $F chromosome 6 clone P0458E02 continuation of contig AP003523.1
40% to 71B23
AQ687385 nbxb0074N19f 50% to 71B20
AQ258331 nbxb0020M04r 71-like sequence 36% to 71B33 89% to AP003571 4 diffs probably same
145371 MAAFLVYVLVLVPLAVVPFVYFNRVARRRGGDVRLPPSPWGLPVIGHLHHLVGALPH 145201
145200 VAMRDLARRHGPLMLLRLGELPVVVASSAEAAREVMKTRDLDFATRPMSRMARLVFPEG 145024
145023 GEGIIFAPYGDRWRELRKICTVELLSGRRVQSFRPVREEEAGRLLRAVAAASPG 144862
144861 QAVNLSELLSAHAADSSVRAIMGDRFRDRDAFLAMLERGLKLFAKPALPDLYPSSRLAM 144685
144684 LLSRMPRRMKQHHRDMVAFLDAIIQEHQENRSAAADDDNDLLDVLLRIQREGDLQFPLS 144508
144507 SESIKATIG 144481 (0)
144297 DMLVGGSETAATTLHWIMAELVRNPKVMQKAQDEVRRELIGHRKVTEDTLCRLNYMHM 144124
144123 VIKEALRLHPPGSLLLPRECRRTCQVLGYDIPKGATVFVNVSAIGRDPKYWDEAEEFIPE 143944
143943 RFEHSDVDFKGTHFEYTPFGAGRRMCPGMAFGLANVELTLASLLYHFNWELPSGIHAENL 143764
143763 DMTEEMRFTTRRLHDLVLIPVVHVPLPTI* 143674

>AP003571.1e $F chromosome 6 clone P0458E02 
128926 MADDYFYLGLALASLLVVLFARRRRSAAHGDGGLRLPPGPWQLPVIGSLHHLAGKLPHRAMRDLA 129120
129121 RRHGPVMMLRLGEVPTLVVSSRDAAREVMRAHDAAFASRPLSATVRVLTSGGRGIIFAP 129297
129298 YGGSWRQLRKIAVTELLTARRVASFRAIREEEVAAMLRAVAAAAAAGRAVELRAA 129462
129463 LSALVAETTVRAVIGDRCKDRDVFLRKLQRTIELSAGFNPADLWPSSRLAGRLGGA 129630
129631 VREAEECHDTVYGILDGIIQEHMERTSSGSCGAGDGDGDGEDLLDVLLRIQKEGGLEFPV 129810
129811 DMLAIKQVIF 129837 (0)
132964 DIFGAGSETSATTLEWVMAELIRNPKAMRKATAEVRRAFAADGVVLESALGKLHYMHLVI 133143
133144 RETFRLHTPLPLLLPRECREPCRVLGYDVPRGTQVLVNVWAIGRDERYWPGGSPEEFRPE 133323
133324 RFEDGEAAAAVDFRGADFELLPFGAGRRMCPGLAFGLANVELALASLLFHFDWEAPDVAD 133503
133504 PAEFDMTEGFGITARRKADLPLRPTLRVPVLVSVG* 133611

>AP003571.1f $P chromosome 6 clone P0458E02 pseudogene fragment
136961 VSEDALGELRYLQLVIRETLRLHPPLPLLLPRECTIGR 137074
137075 DERYWPGGSPEEFRPERFDDGEATAAVDFRGADFELLPFGGGRRMCPGMAFGLANVELPL 137254
137255 SSLLFHFDWEVPGMADPTKLDMTEAFGITSRRKENLHLRPLLRVSVPG 137398

>AP003571.1g $F chromosome 6 clone P0458E02
139036 MEDATHGYVYVGLALVSLFVVLLARRRRSPPPAAHGDGGLRLPPGPWTLPIIGSLHHLVGQIP 139224
139225 HRAMRDLARRHGPVMLLRIGEVPTLVVSSRDAAREVTKTHDTAFAMRPLSATLRVLTN 139398
139399 GGRDLVFAPYGDYWRQVRKIAVTELLTARRVHSFRSIREEEVAALLRAVAVA 139554
139555 AGTVEMRAALSALVSDITARTVFDNRCKDRGEF
       LVLLERTIEFAGGFNPADLWPS 139719 (?) bad exon boundary
140222 SRLAGRLSSVVRRAEECRNSVYKILDGIIQEHQERTSAGGEDLVDVLLRIQKEGG 140386
140387 LQFPLAMDDIKSIIF 140428 (0)
       DIFSAGSETSATTLAWAMAELIRNPTAMHKVMAEVRRAFAAAGAVSEDALGE 140655
140656 LRYLQLVIRETLRLHPPLPLLLPRECREPCRVLGYDVTRGTQVLVNAWAIGLDERYWPGG 140835
140836 SPEEFRPERFEDGEATAAVDFRGTDFEFLPFGAGRRMCPGMAFGLANVELPLASLLFHFD 141015
141016 WEVPGLADPAKLDMTEAFGITARRKADLHLRPCLLVSVPGV* 141141

>AP003571.1c $F chromosome 6 clone P0458E02
AQ328148 49% identical to C72289 58% to AQ327456 61% to AQ259669 
56% to 71B3
106935 MADLHTYLYLGLALVSLLAVQLARRRRSSAAHGSGALRLPPGPWQLPVIGSLHHLVGKL 107111
107112 PHQAMRDLARRHGPVMMLRLGEVPTLVVSSPEAAREVTKTHDVSFATRPLSSTTRVFS 107285
107286 NGGRDIVFAPYGDYWRQLRKITVTELLSARRVASFRAIREEEVAAMLRAVGGYAA 107450
107451 AGCAVEIRPLLAALVSDSTVRAVMGDRFPHRDVFLRELDRSIELTAGFNPADLWPSSRL 107627
107628 AGCLTGTIRQAKKCWDTMSSVLESTIQEHLQKNGSSGGGAGATDEDLIDVLLRIQKEGGL 107807
107808 QFPFDMDVIKSVIH 107846 (0)
110756 NVFGAGSETSATTLGWAIAELIRNPMAMKKATAEVRQAFAAAGVVSEAALSELRYLHLVI 110935
110936 KETLRLHPPGPLLLPRECREQCKVLGYDVPRGTQVLVNVWAIGRDPRYWPGGSPEEFRPE 111115
111116 RFGDGEPAAALDFKGTDYELLPFGAGRRMCPGLAFGLANVELPLASLLFHFDWEVPGMAD 111295
111296 PTKLDMTEAFGIGVRRKADLIIRPILRVPVPGV* 111397

>AP003571.1d $F chromosome 6 clone P0458E02
118418 MADGYFYLGLALVSLLVVLFARRRRSAAAAHGDAGLRLPPGPWQLPVIGSLHHLAGKLPHRAMRDLA 118618
118619 RRHGPVMMLRLGEVPTLVVSSRDAAREVMRTHDAAFASRPLSASVRAATKGGRDIAFAP 118795
118796 YGDYWRQLRKIAVTELLSARRVLSFRPIREEEVAATLRAVAAAAADGRTVELRAA 118960
118961 LCALVADSTVRAVVGERCAGLDVFLRQLDRAIELAAGLNVADLWPSSRLAGRLSGAVRQ 119137
119138 AERCRDTMFGVLDGIIQAHLEKTGGAGEDILDVLLRIHKEGGLEFPLDMDA 119290
119291 VKCVVV 119308 (0)
119451 DVISGGCETSATTLGWAFAELIRNPAAMKKATAEVRRDFEAAGAVSESALAVGELPYLRL 119630
119631 VVRETLRLHPPLPLLLPRECREPCRVLGYDVPRGAQVLVNAWAIGRDERYWPGGSPEEFR 119810
119811 PERFGDGEAAAAVDFKGADFELLPFGGGRRMCPGMAFGFANVELPLASLLFHIDWEASGV 119990
119991 ADPTEFDMTEAFGITARRKANLLLRPILRVPVPGV* 120098

>AP003571.1a $F chromosome 6 clone P0458E02
67847 MADVLSQGYVYLAMALVALLGVLLTKCSRTATAQRRLPPGPWQLPVIGSLHHLIGKLPHHAMRDLTRRHG 68056
68057 PVMMLRLGEVPTLVVSSPEAAQEVMRTHDAVFATRALSATVRAGTMGGRDIAFAPYGDY 68233
68234 WRQLRKIAATELLSAPRVASFRAIREEEVAATLRTVAAAAADGRAVELRAALCAL 68398
68399 VTDSTSRAVVGDRCKESDALIRAFDRSMELASGFNPAADLWPSSRLAGLLSGGVREIEA 68575
68576 NLHTVFGILDRLIEKRLQQKKTAPSSAAGEDILDALLRIHKEGGGLQFPLDMDSIKLIIA 68755 (0)
73915 DLFSGGGETVATLLVWAMAELIRNPMAMQKATTEVRRAFALAGAVSEGKGALGELRYLHL 74094
74095 VIKEASRLHPPAPLLLPRECSEPCQVLGYDVPRGTQVLVNAWAIGRDERCWTGGSGDGSS 74274
74275 PEEFRPERFEDGAEAVDLRGNNFELLPFGAGRRMCPGMAFALANIELTLASLLFHFDWEV 74454
74455 PDMADPAKLDMTETLGITARRKGDLLLRPVLRMPVPGVY* 74574

>AP003571.1b $F chromosome 6 clone P0458E02
80945 MEDASHGYVYLAMAVVALLGVLLTKRSRKATAQRLPPGPWQLPVIGSLHHLAGKLPHHAMRDLARRHG 81148
81149 PVMMLRLGEVPTLVVSSPEAAQEVMRTHDAVFATRALSATVRAATMGGRDIAFAPYGDR 81325
81326 WRQLRKIAATQLLSARRVASFRAIREEEVATMLRAVAAAAADGRAVEMRAALCVV 81490
81491 VADSTARAMVGESCQERDAFLREIDRSMELVSGFNPEDLWPSSRLAGRLSGAVRKIEAS 81667
81668 LHTVLGILDRIIQKRLQEKIGGAGAAAASEDILDVLLRIHKDGGAGGLQVPLDMDDITLV 81847
81848 IT 81853 (0)
83437 SMQLQNAHACALFLTIVSSTYSYSLFNDPPSLHMQDLFSGGGETVATLLVWAMAELIRN 83613
83614 PMAMQKATAEVRRAFALPGVVSEGEGALGELRYLHLVIRETFRLHPPGPLLLPRECSEPC 83793
83794 QVLGYDVPRGTQVLVNVWAIGRDERCWPAAAGGGSPEEFWPERFEDGAEAVDLRGNNFEL 83973
83974 LPFGAGRRMCPGVAFALANIELTLASLLFHFDWEVPGMADPAKLDMAEALGITARRKGDL 84153
84154 LLRPVLRMPVPGV* 84195

>AC091302.3 $F chromosome 3 clone OSJNBb0015I21 69% to 79F2
AC084319 50% to 79B3 complement(join(117988..118644,121556..122530))
122145 MAVTMPPMRAPALVA
122100 MLVVVLVALVRRRRHRSKGAGGRLESLPPGPVGLPVIGNMHQMLVNKPVFRWVHRLLADA 121921
121920 GGEIVCVRLGPVHVVAVTSPEMAREVLRKNDAVFADRPTTFAAESFSVGYRSASISPHGD 121741
121740 QWRKMRRVLTAEILSPATEHRLRGARGEEADHLVRYVLVRCGRDGAAVDVRHVARHFCGN 121561
121560 VIRRLTLGRRHFREPRADDEDAAAPGRDEAEHVDALFATLNYLDAFCVSDYFPALVGLD 121384
121383 LDGQEKVIKKVMRTLNRLHDPVVEERVEEWRLLRKAGERRDVADFLDVLASLDDAAGRPL 121204
121203 LTVEEIKAQTI 121171 (0)
       DIMIATVDNPSNAVEWALAEMMNKPEVMRKAMDELDTVVGRDRLVQESD 118113
118112 VRDLNYLKACIREAFRLHPYHPFNPPRVAMADTTIAGYTIPKGSQVILSRVGLGRNPRVW 117933
117932 DDPLEFRPERHLSPYPAGGRGDAGVVALTEAELRFVSFSTGRRGCPGVSLGTLITVTLFA 117753
117752 RLLQGFEWSKPAGVERVELREEAASLVLAQPLVLQATPRLAAHLYGAGL* 117603

>AL606614.1b $F chromosome 4 clone OSJNBb0011N17 40% to 71A25
23233 MSMASLQAPEFLASCLLLATILFFKQLLAPSSKQRAASPSLPRPRGLPLIGNLHQVGALPHRSLAALAAR 23096
23095 HAAPLMLLRLGSVPTLVVSTADAARALFRDNDRALSGRPALYAATRLSYGQKSISFAPD 22919
22918 GAYWRAARRACMSELLGPPRVRGLRDAREREAAALVAAVAAAGASPVNLSDMVAATSSR 22742
22741 IVRRVAFGDGDGDESMDVKAVLNETQALLGGLWVADYVPWLRWVDTLSGKRWRLERRFRQ 22562
22561 LDALYERVIDDHLNKRKHASDEEDDLVDVLLRLHGDPAHRSTFGSRSHIKGILT 22400 (0)
22059 DMFIAGSDTSAVTVQWAMTELVRNPDVLAKAQHEVRRVVAAGDKVREADLPELHYLR 21889
21888 LVIKETLRLHPAAPLLVPRETTEPFRTAHGVEIPARTRVVVNAMAIHTDPGVWGPDAERF 21709
21708 VPERHRDDADGCAQQHDGFALVPFGIGRRRCPGVHFAAAAVELLLANLLFCFDWRAP 21538
21537 PGREVDVEEENGLAVHKKNPLVLIATKSKRNTGGH* 21427

>AL606614.1a $F chromosome 4 clone OSJNBb0011N17 90% to AL606614.1b
19270 MASLQAPEFLASCLLLLATILLFKQLLAPSSKKRAASPSLPRPKGLPLIGNLHQVGALPHRSLAAL 19073
19072 AARHAAPLMLLRLGSVPTLVVSTADAARALFRNNDRALSGRPALYAATRLSYGQKNISF 18896
18895 APDGAYWRAARRACMSALLGAPRVCELRDAREREAAALIAAVAAAGASPVNLSDMVAAT 18719
18718 SSRIVRRVAFGDGDGDESMDVKAVLDETQSLLGGLWVADYVPWLRWVDTLSGMRRRLERR 18539
18538 FRQLDAFYERVIDDHINKRKHASDEEDDLVDVLLRLHGDPAHRSMFGSRTHIKGILT (0)18368
17337 DMFIAGSDTSAVTVQWAMTELVRNPDVLAKAQHEVRRVIAGGGGGDKDGAMVREADL 17149
17148 PELHYLRLVIKETLRLHPASPLVQRETTEPFRTAHGVEIPARTRVVINAMAIHTDPGVWG 16969
16968 PNAERFLPERHRAHDADGEQQHEHDGFALVPFGIGRRSCPGVHFAAAAAELLLANLLFCF 16789
16788 DWRALPGREVDVEEENGLAVRKKNPLVLIATKSKSNRDAH* 16666

>AP003522.1 $F chromosome 6 clone P0036B02, 37% to 76C2
AP004726.1 chromosome 6 clone P0486G02
AU096446.1 Rice green shoot cDNA clone S13713.Length = 477 49% to 76C2
129851 MAASSFLVECLSWLVVVLFSLYIFQLLRDARRRLPPGPWPPKPLVGDLLDLGEDGKQHRTFLRLAGR 130051
130052 YGGLMCLRFGMVPHVIVSTPDALRAVFAAAGAGGGGGGEGKKVDGIAGLPSLDVLSAMGH 130231
130232 RAHTIFALPSQDGKWRALRKFAAAEMLAPRRISSAAAGAQLQTKIVEALRREVSGHAARG 130411
130412 AAVVFRHAVLDSILSLLLGVLYSTDLEREERAMFRDLIEEIVGMLGTANVSDVFP 130576
130577 PVAALDLQGLRRRMTDLLTIMYRHFDDQVALRRRSRDAGEARKNDVLDTVLDKEES 130744
130745 EWKQEGSLLSHDVMRVLLS 130801 (0)
131025 DLYGAGASTTAALIEWGMVDLIQNPEVMTKVREELTNVLGDKLVMDESDIARLPYLQ 131195
131196 AVVKETLRLRTVVPLVPRKAEVDIEVNGYRIPKGTNVILNAWAINRSADAWSEPDKFIP 131372
131373 ERFLGGETRGYLGQDFEMIPFGLGRRICPGMPLAQKLIPLIIGTLLHRFEWELPADAK 131546
131547 EGGIDMTEKCGVVLSLVNPLKAIPKEI* 131630

>AP003990.1a $F chromosome 2 clone OJ1073_F05 42% to 71B24
AQ259671 323-379 region I-helix 55% to 71B4
AQ691116.1 nbxb0088K01f CUGI Rice BAC genomic clone Length = 544
7359 MADLELEKVASFLLAALLPLVLFKLAAAKRGGGDGGMRLPPGPWRLPVIGNLHQIMAGGQ 7538
7539 LVHRTMAGLARGLGDAPLLSLRLGEVPIVVASSADAAREIMSRHDAKFATRPWSPTVRVQ 7718
7719 MVDGEGLAFAPYGALWRQLRKITMVELLSPRRVRSFRRVREEEVGRLVVAVATAA 7883
7884 TRRPGEAAVNVGERLTVLITDIAVRTIIGDRFERREDFLDAAAEWVKIMSGFSLGDLFPS 8063
8064 SRLASFVSGTVRRAEANHRKNFELMDYALKQHEEKRAAAAAAGAGAVEDDEDIVDVLLRI 8243
8244 QKEGGLEVPLTMGVIKGVIR 8303 (0)
8551 DLFGAGSETSANTLQWTMSELVRNPRVMQKAQTELRDCLRGKQSVSEDDLIGLKY 8715
8716 LKLVIKETLRLHPVVPLLLPRECQETCNIMGYDVPKGTNVLVNVWAICRDPRHWENAETF 8895
8896 IPERFEDSTVDFKGTDFEFIPFGAGRRMCPGLAFAQVSMELALASLLYHFDWELPSGVA 9072
9073 PSNLDMEEEMGITIRRKNDLYLVPKVRVPL* 9165

>AP003990.1b $P chromosome 2 clone OJ1073_F05 
11020 MEKVAWCACFLFLALMVVRLRTKRRGDNNGGVKLPPGPWRLPLVGNLHQVMARGPLVHRTMADLA 11214
11215 RRLDAPLMSLRLGEVPVVVASSADAAREITKTHDVAFATRPWSSTIRVMMSDGVGLVFAP 11394
11395 YGALWRQLRKIAMVELLSARRVQSFRRIREDEVGRLVAAVAAAQPGEAVNV 11547
11548 SERIAALVSDAAVRTIIGDRFERRDEFLEGLAEGIKITSGFSLGDLFPSSRLASFIGGTT 11727
11728 RRAEANHRKNFELMECALKQHEEKRAAAAAAAAGAVEDDEDIVDVLLRIQKEGSLQVPLT 11907
11908 MGNIKAVVL 11934 (0)
12556 DLFGAGSETSANTLQ*AMAELIMNPRTMLKAQAELRDALQGKQIVSEYDLVKLKYLKL 12729
12730 IIKETLRLHPVVPLLLPRE 12786 frameshift
      CQETCKVMDYDVPIGTIVLVNMWVIGRDPKYWEDAKTFRPERFEDGHIDFKGMNF 12955
12956 EYLPFGAGRRMCPGVAFAEAIMELALASLLYHFDWEFPDGISPAKMDMMEVMGSTVRKKN 13135
13136 DLYLVPNAHVPVAP* 13180

>AP003990.1c $F chromosome 2 clone OJ1073_F05 one in frame stop at W 
AQ259669 61% identical to AQ328148 53% to 76C2 also has stop at W
AQ690680.1 nbxb0082B18f CUGI Rice BAC genomic clone Length = 768
AQ579195 nbxb0084A11f AQ509836 nbxb0094K16f 72% identical to AQ259671
16507 MEKVAWCACFLLLALMVVRLTAKRRGDNGAERLPPGPWRLPLVGNLHQVMARGPLVHRTMADLARRLD 16710
16711 APLMSLRLGEVPVVVASSADAAREIMRTHDVAFATRPWNPTTRRLRCDGEGVVFATYGAL 16890
16891 WRQLRKLCVVELLGARRVRSFRRVREEEARRLVAAVAASPRGEAVNVSERI 17043
17044 TAVITDATMRAMIGDRFGRRDEFLELLADIVKIGSGFSLDDLFPSWRLAGAIGGMARRAE 17223
17224 ANHRRTYELMDSVFQQHEQRRVHVAAPADGAMDDAEEDLVDVLFRIQKDGGLEVPLTIGN 17403
17404 IKAIIL 17421 (0)
17795 DLFNAGSETSANTLQWVMSELMRNPKVMRKAQAELRNNLQGKTTVTEDDLTNLKYLKL 17968
17969 VIKETLRLHPVLPLLLPRECREACNVIGYDVPKYTTVFINV*AINRDPKYWDMAEMFKPE 18148
18149 RFDNSMIDFKGTDFEFVPFGAGRRICPGIAFAQSNMELVLATLLYYFDWELPSGMSPEE 18325
18326 LDMTEDMGLSVRRKNDLYLHPTVCVPL* 18409

>AP003990.1d $F chromosome 2 clone OJ1073_F05 
27391 MEQVSCFAAAAAVVVVVLLLARMLLAPRGEWDGLNLPPSPPRLPFIGSFHLLRRSPLVHRALADVARQL 27597
27598 GSPPLMYMRIGELPAIVVSSADAAREVMKTHDIKFASRPWPPTIRKLRAQGKGIFFEPYG 27777
27778 ALWRQLRKICIVKLLSVRRVSSFHGVREEEAGRLVAAVAATPPGQAVNLTE 27930
27931 RIEVVIADTTMRPMIGERFERREDFLELLPEIVKIASGFSLDDLFPSSWLACAIGGSQRR 28110
28111 GEASHRTSYELVDSAFRQRQQQREAMAASPPDIAKEEEDDLMDELIRIHKEGSLEVPLTA 28290
28291 GNLKAVIL 28314 (0)
28577 DLFGAGSETSSDALQWAMSELMRNPRVMEKAQNEVQSILKGKPSVTEADVANLKY 28741
28742 LKMIVKETHRLHPVLPLLIPRECQQTCQIMGYDVPQGSVIFINSWAIMRDPKHWDDAETF 28921
28922 KPERFEDGEIDLKGTNYEFTPFGAGRRICPGLALAQASIEFMLATLLYHFDWELPNRAA 29098
29099 PEELDMTEEMGITIRRKKDLYLLPTLRVPLTA* 29197

>AP003990.1e $F chromosome 2 clone OJ1073_F05 
33613 MEQVSCFAAAAAAVLVVLSLARMLLAPRREWD
33709 GLNLPPSPSRLPFIGSFHLLRRSPLVHRALADVVRQLGAPPLMYMEIGEVPAIVVSCADA 33888
33889 AREIMKTHDINFASRPWPPTVQKLRAQGKGIFFEPYGALWRQLRKICIVKLLSVRRVSSF 34068
34069 HGVREEEAGRLVAAVAATPPGQAVNLTERIKVAIADTTMRPMIGERFERRE 34221
34222 DFLEVLPEIVKLASGFSLDDLFPSS 34296
      GSPAPSAARGEAVNRASYELVDSAFRQRQQQKEAMAAPPPDIAKEE
      EDDLMDELIRIHKEGSLEVPLTAGNLKAVIL 34528 (0)
34777 ELFCAGSETSSNAIQWAMSELVRNPKVMEKAQNEVRSILKGKPTVTEADMVDLTY 34941
34942 VKMIVKETHRLHPVLPLLTPRVCQQTCQIMGYDVPQGSVIFIKSWAIMRDPKHWDDAETF 35121
35122 KPERFEDSEIDLKGTNYEFTPYGAGRRICPGLALAQVSIEFILTMLLYHFNWELPNGAA 35298
35299 PEELDMTEDMGLTIRRKNDLYLLPTLRVPLTA* 35397

>AP003990.1f $F chromosome 2 clone OJ1073_F05 
38238 MYDAVACVVAVVVVVVFAMLWVKLARSGDGGGGGSGGVRLPPGPWRLPVIGSLHHVVGDRLLHRSMA 38438
38439 RIARRLGDAPLVYLQLGEVPVVVASSPGAAREVTRTHDLAFADRALNPTARRLRPGGAGV 38618
38619 ALAPYGALWRQLRKICVVELLSARRVRSFRRVREEEAGRLVGALAAAAASPGEEA 38783
38784 AVNFTERIAEAVSDAALRAMIGDRFERRDEFLQELTEQMKLLGGFSLDDLFPSSWLASAI 38963
38964 GGRARRAEANSRKLYELMDCAIRQHQQQRAEAAVVDGGAGVEDDKNQDLIDVLLNIQKQG 39143
39144 ELETPLTMEQIKAVIL 39191 (0)
39428 DLFSGGSETSATTLQWAMSELIKNPMVMQKTQAELRDKLRRKPTVTEDDLSGLKYVKL 39601
39602 IIKETLRLHPVVPLLVARECRESCKVMGYDVPKGTTVFVNAWAIGRDPKYWDDAEEFRPE 39781
39782 RFEHSTVDFKGIDLEFIPFGAGRRICPGMAFAEAIMELLLAALLYHFDWELPNGMAASE 39958
39959 LDMTEEMGITVRRKNDLHLRPHPPCVVRSNFRSFVERERERHFV* 40093

>AP003990.1g $P chromosome 2 clone OJ1073_F05 pseudogene
42530 MDHVLACVGILVAFTPLFLLAVLPLKLTNGGDGVKLPPGPWRLPVIGSMHHLMGESLVHRAMAD 42721
42722 LARRLDAPLMYLKLGEVPVVLASSPCAAREIMRAHDVAFASRPLSPTVRRMR 42877
42878 PPPPRRRQLRKICVVELLSARRVRTFRRVREEEVARLVGALVCLAHVA 43021 gap
      AMIGARFERRDEFLE
      missing mid region
43862 DLFSGGSETSATVIQWAMSELMKNPRVMRKVQAELRDKLAGKPRVTEDDLSDLKYLKL 44035
44036 VIKETLRLHPAAPLLVPRQCREPCKIMGYDIPKGTTVFVNAW 44161 frameshift
44161 AIGRDPNYWDDAEVFRLERFANSTIDFKGMDMEFIPFGAGRRMCSGLAFAEAIIDLLFS 44337
44338 TLLFHFDWELPCGMTASELDMIEEMALTVRRKNDLHLRPILRVPQTQTSSALLF* 44502

>AP003990.1h $F chromosome 2 clone OJ1073_F05 
66403 MATELTEYLLLLPLLVVPLLYLAASSSRRSGRLRLPPGPWALPVIGHLHHLALAGAPTHR 66582
66583 AMRDMARRHGPLMLLRFCELPVVVASSPDAAREIMRTHDVAFASRPIGPMLRLVFQGAEG 66762
66763 VIFAPYGDGWRQLRKICTVELLSHRRVHSFRPVRADELGRLLRAVADQAASSSS 66924
66925 SPVNLTGMISAFVADSTVRAIIGSRSRHRDTFLRLVEDGLKIMPGMSLPDLFPSSRLAML 67104
67105 LSRVPAKIERRRRGMMGFIDTIIQEHQESRAAAEDEDLLDVLLRLQKDMDSQYPLTTMN 67281
67282 IKSILI 67299 (0)
67380 DMFGAGSETSATTLQWAMAELMRNPAVMRRAQDEVRRELAVAGNDRVTEDTLPSLHYL 67553
67554 RLVIKETLRLHPPAPLLLPRECGGACKVFGYDVPAGTMVLVNAWAIGRDAAAWGAAAEEF 67733
67734 SPERFERCERDFRGADFELIPFGAGRRICPGMAFGLAHVELALAALLFHFDWRLPGGMA 67910
67911 AGEMDMTEAAGITVRRRSDLLVFAVPRVPVPAQ* 68012

>AP003990.1i $F chromosome 2 clone OJ1073_F05 
68586 MPLVVLLLATIPLLFFTIKRSAQRRGGGGGGEGRLPPGPWALPVIGHLHHLAGDLPHRA 68762
68763 LSALARRHGALMLLRLGEVQAVVASSPDAARDIMRTHDAAFASRPLSPMQQLAYGRDAEG 68942
68943 VIFAPYGDGWRHLRKICTAELLSARRVQSFRPVREAELGRLLRSVAEATSSSSSA 69107
69108 SLVNLTELISAFVADSTVRAIIGSRFEHRDAYLRMLQDGLKIVPGMTLPDLFPSSRLALF 69287
69288 LSRVPGRIEHHRQGMQRFIDAIIVEHQEKRAAAAANDDDDEDEDFLDVLLKLQKEMGSQH 69467
69468 PLTTANIKTVML (0) 
      DMFGAGSESSATVLQWT 69647
69648 MAELMRNPRVMQKAQDEVRRALAGHDKVTEPNLTNLPYLRLVIKETLRLHPPAPLLLP 69821
69822 RKCGSTCKILGFDVPEGVMVIVNAWAIGRDLTYWDKPEEFVPERFEHNGRDFKGMDFEF 69998
69999 IPFGAGRRICPGITFGMAHVELVLSALLYHFDWELPQGMAAKDLDMTEDFGVTTQRRSNL 70178
70179 LVRPIHRVSVPVE* 70220

>AP003990.1j $F chromosome 2 clone OJ1073_F05 
70828 MAGELAFYLLLVGLVAVPLLILLGSERRTAARTRLPPGPWALPVVGHLHHLAGGLPPHRA 71007
71008 MRDLARRHGPLMLLRLGEVEAVVASSPDAAREIMRTHDVAFASRPVGPMSRLWFQGADGL 71187
71188 VFAPYGEAWRRLRRVCTQELLSHRRVQSFRPVREDELGRLLRAVDAAAAAGT 71343
71344 AVNLTAMMSTYVADSTVRAIIGSRRLKDRDAFLRMLDELFTIMPGMSLPDLFPSSRLAML 71523
71524 VSRAPGRIMRYRRRMRRIMDSIIHEHQERRAAADAAGDDDDDDDEDLVDVLLRLQKEVGA 71703
71704 QYPLTTENIKTVMM 71745 (0)
71837 DIFGAASETSSTTLEWVMAELMRSPSAMRKAQDEVRRALAAGAAGHDTVTEDILPNLSYL 72016
72017 KLVVKETLRLHPPAPLLAPRRCDSPREVLVLGHDVPAGATVLVNAWAIGRDTAAWGGAA 72193
72194 EEFSPERFERCERDFRGADFELIPFGAGRRMCPGMAFGLVHVELALAALLFHFDWSLPG 72370
72371 GMAADELDMAESSGLTTRRRLPLLVVARPHAALPTKYCN* 72490

>AP004000.1a $F chromosome 2 clone OJ1115_B01
95316 MAMVQDITGYLCLSLALLLLTLVLHKVARKATGNGAGKPRLPPGPWRLPVIGNLHQVAMGGPLVHRTMADLA 95101
95100 RRLDAPLMSLRLGELRVVVASSANAAREITKTHDVAFATRPWTSTIRVLMSDGVGLVFAP 94921
94920 YGALWRQLRKIAVVELLSARRVQSFRRIREDEVGRLVAAVAAAPAAQPVNV 94768
94767 SERIAALISDSAVRTIIGDRFERRDEFLEGLAEAIKITSGFSLGDLFPSSRLASFVGGTT 94588
94587 RRAEANHRKNFELIECALRQHEERRAAGAVDDDEDLVDVLLRVQKDGSLQMPLTMGNIKA 94408
94407 VVL 94399 (0)
93945 ELFGAGSETSANTLQWAMTELIMNPRVMLKAQAELSNVIKGKQTISEDDLVELKYLKL 93772
93771 IIKETLRLHPVVPLLLPRECRETCEVMGYDIPIGTTVLVNVWAIGRDPKYWEDAETFIPE 93592
93591 RFEDGHIDFKGTNFEFIPFGAGRRMCPGMAFAEVIMELALASLLYHFDWELPDGISPTK 93415
93414 VDMMEELGATIRRKNDLYLIPTVRVPLSTVL* 93319 

>AP004000.1b $P chromosome 2 clone OJ1115_B01 pseudogene fragment
101503 RVVASSTDAACREFTKTHDVKFATRPWSSTVRVLMADGLG 101393

>AP004000.1c $F chromosome 2 clone OJ1115_B01
109813 MAMVQYVTGYLCLLSLALLLLTLVLHKVARKATGNGAGKPRLPPGPWRL
       PVIGNLHQVAMGGPLVHRTMADLA 109595
109594 RRHDAPLMSLRLGELRVVVASSADAAREITKTHDVAFATRPWSSTIRVMMSDGVGLVFAP 109415
109414 YGALWRQLRKIAVVELLSARRVQSFRRIREDEVCRLVAAVAAAQPGEAVNV 109262
109261 SERITALISDSAVRTIMGDRFEKRDEFLEGLAEGDRIASGFSLGDLFPSSRLASFVGGTT 109082
109081 RRAEANHRKNFGLIECALRQHEERRAAGAVDDDEDLVDVLLRVQKEGSLQVPLTMGNIKAVIL 108893 (0)
107479 ELFGAGSETSASTLHWAMTELIMNPKVMLKAQDELSNVIKGKQTISEDDLVELRYLKL 107306
107305 VIKETLRLHPVVPLLLPRECRETCEVMGYDIPIGTTMLVNVWAIGRDPKYWEDAETFRPE 107126
107125 RFEDGHIDFKGTDFEFIPFGAGRRMCPGMAFAEAIMELVLASLLYHFDWELPDGISPTK 106949
106948 VDMMEELGATIRKKNDLYLVPTVRVPMSTAL* 106853 

>AP004000.1d $F chromosome 2 clone OJ1115_B01
123850 MEEPTSCYGYYHYLALAVAVLVLVRVTRTRGGGSDGVRLPPGPWRLPVIGSLHHLAGKPLVHRALAD 124050
124051 LARRMDAPLMYLRLGEVPVVVATSPGAAREVMRTHDVAFATRPVSPTVRIMTADGEGLVF 124230
124231 APYGALWRQLRRIAILELLSARRVQSFRRVREEEAARLAAAVAAAAPHGEAAV 124389
124390 NVSERIAVLIADSAVRAMIGDRFKKRDEFLEALAEGLKLVSGFSLADLFPSSWLASFVTG 124569
124570 AARRAQENHRKNFELMDRAIEQHQERRAAAAAASGDVVEDDDLVDVLLRIQKGGGLDVPL 124749
124750 TMGIIKAVIL 124779 (0)
124917 DLFSAGSETSATTIQW AMSELMRNPRVMKRAQAELRDNLQGKPKVTEEDLTDLNYLKL 125090
125091 IIKETLRLHLPAPLLLPRESRESCKIFGYDVPKGTTVLVNAWAIGRDPKYWDDPEEFKPE 125270
125271 RFEDSKIDFKGLDFEFLPFGSGRRMCPGIMFAQPNIELALATLLYHFDWSLPAGVKPSE 125447
125448 LDMTEEMGITVRRKNDLYLHAVVRVPLHATTP* 125546

>AP004000.1e $F chromosome 2 clone OJ1115_B01
AP004066.1 chromosome 2 clone OJ1572_F02, 55% to 71B17 aa 342-511 runs off beginning
contig of AA751324 and AQ327456 54% IDENTICAL TO 71B24   1/98 K-HELIX 
58% identical to AQ328148
127153 MAQDVAEYLSIFLALVVAPLLLLRVARRARGNGAGRPRLPPGPWRLPVIGSLHHLMGKPHVHRAMA 127350
127351 DLARRHGAPLMYLRLGEVPFVVASSPDAAREVLRAQDANFASRPWSPTLRVMMADGEGLA 127530
127531 FARHGAHWRLRKICVLELLGPRRVRSFRRVREEEVARLLAAVAAAAAAGADAV 127689
127690 VNVSERAAVLVTDTX 127731 frameshift
127734 VRAMIGDRFEMRDEYLEGVAEVGKLLLGLSLGDLFPSSRLASLVSGTARRAAASHRKMFE 127913
127914 LMDCAIRHHQERKAAMDADEDILDVLLRIQKEGGHDAPLTMGDVKDTIL 128060 (0)
128189 DLFAAGTETSTATLQWAMSEVVRNPRIMQKAQAELRNKLQGKPSVTEDDLVGLTYLKL 128362
128363 VIKETLRLHPAAPMLVPRECGESCKVLGYDVPRGTNVLINAWAIGRDPNYWDDTETFKPD 128542
128543 RCENNKYNFRGTDFEYIPFGSRRKICPGPAFTHAILELALAALLYHFDWELPCGVAPGE 128719
128720 VDMAEETGVVVRPKNDLYLRPVVRVPPGAASSGNGGT* 128833

>AP004688.1 $F chromosome 6 clone P0036C11, Length = 137929
New seq similar to AP004000 37% to 71B23 52% to AP003523.1 78% to AP004684.1b  
57304 METAELCCLLALLPLVYCLLTLFHGSRESDLRLPPGPWRLPLIGSLHHLFGRTLPHRA 57477
57478 LRDLARLHGPLMLLSFGQAAPVVIASTAIAAREIMRTHDDNFSTRPLSTVLKVCTRYGA 57654
57655 GMTFVPYGEHWLQVRKICSLELLSPRRILKFRSIREEEVARLVLAIASSSTPTPTPPAPV 57834
57835 NLSKLLSNYMTDATVHIIMGQCFRDRDTLVRYVDEAVRLASSLTMADLFPSWRLPRVMCA 58014
58015 TTLHRAEVFVESVMEFMDRVISEHLEKRSCQGGDREEDLIDVLLRLQAEGNLEFELSTSI 58194
58195 IKAIIF 58209 (0)
58290 ELLAGGSEAPITTLQWAMAELMRNPDVMSRAQAEVREAYKEKMKVTEEGLTNLPY 58469
58470 LHCIIKETLRLHTPGPFVLPRKCQEQCQILSYDVPKRATVVVNIWAICRDAEIWDEPEKF 58649
58650 MPDRFEGSAIEHKGNHFEFIPFGAGRRICPGMNFALANMELALASLLFYFDWSLPEDVLP 58829
58830 GDLDMTETMGLTARRKEDLYVCAIPFVQLP* 58919

>AC092559.2 $F chromosome 3 clone OSJNBb0096M04, 45% to 71B37
same as AC096688.3 chromosome 3
96529 MATLLSKLLALPQQWQLLLLLLLLPIASLLLVIGRNTGGRRRRRHLR
96388 LPPGPARLPVLGNLLQLGALPHRSLRDLARRHGPVMMLRLGAVPAVVVSSPEAAQEVLR 96212
96211 THDADCCSRPSSPGPMRLSYGYKDVAFAPYDAYGRAARRLFVAELFSAPRVQAAWRARQDQ 96017 (0)
94678 VEKLIGKLTRPEPEPVELNDHIFALTDGIIGAVAFGSIYGTERFAGGGRKRFHHLLDDVMDMLASFSAEDFFPNA 
94454
94453 AAARLFDHLTGLVAHRERVFQQLDAFFEMVIEQHLDSDSSNAGGGGGNLVGALIGL 94286
94285 WKQGKQYGDRRFTRENVKAIIF 94220 (0)
94119 DAFIGGIGTSSVTILWAMAELMRSPRVMRKVQAEIRATVGDRDGGGMVQPDDLPRLAY 93946
93945 LKMVVKETLRLHPPATLLMPRETMRDVRIGGYEVAARTRVMVNAWAIGRDAARWEEAEVF 93766
93765 DPDRFEAKRVEFNGGHFELLPFGSGRRICPGIAMGAANVEFTLANLLHCFDWALPVGMAP 93586
93585 EELSMEESGGLVLHRKAPLVLVPTRYIQL* 93496 

>AP004346.1a $F two genes and a pseudogene
71B like 47% to AC092559.2 75% to AP004346.1b
22020 MADDFLSSQPQPW 22058
22059 PPLLQLSAAVLFFLLPLLYLLFLRGSNGEVRGRQGNSASAPSLPGPCRQLPVLGNLLQIG 22238
22239 SRPHRYFQAVSRRYGPVVQVQLGGVRTVVVHSPEAAEDVLRTNDVHCCSRPPSPG 22405 (2)
26427 SYNYLDVAFAPYSDYWREMRKLFVVELTSVSRVRSFAYARAAEVARLVDTL 26577
26578 AASPPGVPVDLSCALYQLLDGIIGTVAFGKGYGAAQWSTERAVFQDVLSELLLVLG 26745
26746 SFSFEDFFPSSALARWADALAGVERRRRRIFRQVDGFLDSVIDKHLEPERLSAGVQED 26919
26920 MVDALVKMWREQQDRPSGVLTREHIKAILM 27009 (0)
28586 NTFAGGIDTTAITAIWIMSEIMRNPRVMQKARAEVRNTVKNKPLVDEEDSQNLKYLEMIIKEN 28774
28775 FRLHPPGNLLVPRQTMQPCLIGGYNVPSGTRVFINIWAMGRGPMIWDNPEEFYPERFE 28948
28949 DRNMDFRGSNFELVPFGSGRRICPGVAMAVTSLELVVANLLYCFDWKLPKGMKEEDIDM 29125
29126 EEIGQISFISFRRKVELFIVPVKHEQYQLMGHIN* 29221

>AP004346.1b $F two genes and a pseudogene 48% to AC092559.2 75% to AP004346.1a
69192 MATELLASQLLPWQPLVQLLAAGLFLLPLVYLLFFKGDGNGG 69317
69318 VMDSASAPSPPGPPRQLPVLGNLLQIGSRPHRYFQAVARRYVPVVQVQLGSIRTVVVHS 69494
69495 PEAAKDVLRTNDLQCCSRPSSPG 69565 (2) 
72047 NYNYLDVAVSPYS 72083 (frameshift)
72085 YWREMRKLLVIELTSIRRVQSFAYARAAEVARLVDTLAASPAGVPVDLSSALYTF 72249
72250 SDGVIGTVAFGKVYGSAAWSSWEWGASFQEAMDETMQVLGSFSFEDFFPSSALARWADALTGA 72438
72421 AGRRRRVFHRIDGFFDAVIDKHLEPERLSAGVQEDMVDAMVMVWREQKDEAFGLTRDHIKAILL 72630 (0)
84351 DAFVGGIDTTAVTVTWIMSELMRNPRVMQKAQAEVHNIVKNKSKVCEEDIQNMKYLKM 84524
84525 IIKENFRLHPPGTLLIPRQTMKTCTIGGYSVPSETRIYVNVWAMGRDPNIWDNPEQFY 84698
84699 PERFEDKGIDFRGSHFELLPFGSGRRICPGIAMGVANVELVVANLLYCFNWQLPKGMKEE 84878
84879 DIDMDEIGQLAFRKNFLF* 84935

>AP004346.1c $P two genes and a pseudogene
probable pseudogene 89% to AP004346.1b
92865 DAFAGGIDTTVVTTTWIMSELMRNPRVMQK 92954 (frameshift)
92956 AQAEVHNIVKNKSKVYEENIQNMKYLKMIIKENCRLHPPGTLLIPRHTMKTCTIGGYSV 93132
93133 PSKRRIYVNVWAMWRDPNIWDNLEQFYLERFEDKGIDFRGSHFELLT 93273 (insertion)
93561 FGSGQRICPGIAMGVANVELVVANLLYCFDWQLPKGMKEEDTDMDEIG*LAFRKKLPLFI 93740
93741 VPMKH* 93758

>AL606625.1 $F chromosome 4 clone OSJNBa0032I19 similar to 71B28 = AQ858445.1
AQ858445.1 nbeb0013M22r CUGI Rice BAC genomic Length = 824 54% to 71B23
82576 MGASILLVVVVSKLMISFAAKPRLNLPPGPWTLPLIGSIHHVVSSRESVHSAMRRLARRHGAPLM 82770
82771 QLWFGEVGTVVASSPEAAREVLRSHDLAFADRHLTAAAAAFSFGGRDVVLSPYGERWRQL 82950
82951 RKLLTQELLTASRVRSFRRVREEEVARLMRDLSAAATAGAAVNLSEMVTRMVNDTVLRCS 83130
83131 VGSRCEHSGEYLAALHAVVRLTSGLSVADLFP 83226
83227 SSRLAAMVSAAPRAAIANRDKMVRIIEQIIRERKAQIEADDRAADSKSC 83373
83374 ACSLDDLLRLQKEGGSPIPITNEVIVVLLM 83463 (0)
84970 DMFAGGTDTSSTTLIWAMAELIRSPRVMAKVQSEMRQIFDGKNTITEDDLVQLSY 85134
85135 LKMVIKETLRLHCPLPLLAPRKCRETCKIMGYDVPKGTSAFVNVWAICRDSKYWEDAEEF 85314
85315 KPERFENNDIEFKGSNFEFLPFGSGRRVCPGINLGLANMEFALANLLYHFDWKLPNGMLH 85494
85495 KDLDMREAPGLLVYKHTSLNVCPVTHIASSCA* 85593

>AP003909.1a $F chromosome 8 clone OJ1300_E01 55% to 71C4
50394 MAQMLDGLRHDEQASLHAPQEASTMPTMSCSD
50298 LLLAMMCPLILLLIIFRCYAYATRSGGMLSRVPSPPGRLPVIGHMH 50161
50160 LISSLPHKSLRDLATKHGPDLMLLHLGAVPTLVVSSARTAQAILRTHDRVFASRPYNTIA 49981
49980 DILLYGATDVAFSPYGDYWRQIKKIVTMNLLTIKKVHSYGQTRQQEVRLVMAKIVEEAAT 49801
49800 HMAIDLTELLSCYSNNMVCHAVSGKFFREEGRNQLFKELIEINSSLLGGFNLEDYFPSLA 49621
49620 RLPVVRRLLCAKAYHVKRRWDQLLDQLIDDHASKRRSSMLDNNDEESDFIDVLLSIQQEY 49441
49440 GLTKDNIKANLVVMFEAGTDTSYIELEYAMAELIQKPQLMAKLQAEVRGVVPKGQEIVTE 49261
49260 EQLGRMPYLKAVIKETLRLHPAAPLLVPHVSMVDCNVEGYTIPSGTRVIVNAWAIARDPS 49081
49080 YWENAEEFMPERFLSNTMAGYNGNNFNFLPFGTGRRICPGMNFAIAAIEVMLASLVY 48910
48909 RFNWKLPIDQAANGGIDMTETFGITIHLKEKLLLVPHLP* 48790

>AP003909.1b $P chromosome 8 clone OJ1300_E01, 4 in frame stops pseudogene 
54948 MAQMLGALLLFQDSLMSTMTRMSY
54876 SLLLPILCPLILLLLFRCYAYATRSGGMLDKLPSPPGRLPLIGHM 54742
54741 HLIGFFPHMSLRDLATDLMLLHLGTVPTLVVSSSRMAQVILRTHDRVFASR*QSAI 54574
54573 TDILF*GATDVAFSPYGDYWRQIKKIVTTNLLTIKKIRSYSQTRQQEVRLVMAKIVEEAA 54394
54393 THMAIDLTELLSCYSNNMVCHAVSGMFFCEEGRNQLFKELI*INSSLLGGFNIEDYFPSL 54214
54213 ARLPVRRLLYAKAYDVKKRWDQLLDKLIDDHSSKHRSSLLDNNDVESDFIDVLLSIQQE 54037
54036 YGLTKDNIKANLAIMFEAGTDTSFIELEYSMAELM*KPQMIAKLQAEVRGVVSKGQDIVT 53857
53856 EEHLGRMPYLKAVIKETLRLHPAAPLLAPHVSVVDCNVEGYTIPSGTRVIVNAWAIARDP 53677
53676 SYWENAEEFMPERFLSNTMADYNGNNFNFLPFRTGRRICPGINFAITTIEIMLASLV 53506
53505 YRFDWKLFTSRIDMTETFGATIHLKEKLFLVPHLPQDN* 53389

>AP003909.1c $F chromosome 8 clone OJ1300_E01 
same as AP004462.1 152574-152287 region
58316 MAVMLVPIPLLLLHQHHNHEHEHPSPVAPQPTM
58217 ASYYTLLLALLCPLLLLLIKLCRAKTRDDELFDKLPSPPGRLPV 58086
58085 IGHLHLIGSLPYVSFRELAIKHGPDLMLLRLGTVPTLVVSSARAAQAILRTNDHVFASRT 57906
57905 YSAVTDILFYGSSDVAFSPYGEYWRQVKKIATTHLLTNKKVRSYSRARQQEVRLVMARIN 57726
57725 EAAVARTTVDLSELLNWFTNDIVCHAVSGKFFREEGQNQMFWELIQANSLLLGGFNLEDY 57546
57545 FPNLARVTTVRRLLCAKAHNVNKRWDQLLDKLIDDHATKRSSSVLDLDNEESDFIDVLLS 57366
57365 IQHEYGLTRDNVKAILVIMFEGGTDTAYIELEYAMAELIRKPQLMAKLQAEVRSVVPRGQ 57186
57185 EIVTEEQLGRMPY 57153 frameshift
57147 LKAVIKEMLRLHLAGPLLVPYLSIAECDIEGYTIPSGTRVFVNAWALSRDPSFWENAEEF 56968
56967 IPERFLNSIAPDYNGNNFHFLPFGSGRRICPGINFAIATIEIMLANLVYRFDWEIPA 56797
56796 DQAAKGGIDMTEAFGLTVHRRRSSSLFLGSHKIK* 56692

>AP003909.1d $F chromosome 8 clone OJ1300_E01 
AQ868830.1 nbeb0032E11f CUGI Rice BAC genomicLength = 759 57% to 76C5
same as AP004462.1 139663-140091 region
68223 MELNNTEPLTASRAQAAAVFLLLPVALLLLLLRFARATTMAGDRNSELLLSKLPS 68387
68388 PPLRLPVIGHMHLVGSLPHVSLRDLAAKHGRDGLMLVHLGSVPTLVVSSPRAAEAVLRTH 68567
68568 DLAFASRPRAMVPDIITYGATDSCYGPYGDHFRKVRKAVTVHLLNSHKVQAYRPAREEEV 68747
68748 RLVIAKLRGAAAMAGAPVDMTELLHSFANDLICRAVSGKFFREEGRNKLFRELIDTNASL 68927
68928 LGGFNLEDYFPSLARTKLLSKVICVRAMRVRRRWDQLLDKLIDDHATRLVRRHDHDQQQD 69107
69108 SDFIDILLYHQEEYGFTRDNIKAILV 69185 (0)
69331 DMFEAGTDTSYLVLESAMVELMRKPHLLAKLKDEVRRVIPKGQEVVNEDNIVDMVYLKAV 69510
69511 IKETLRLHPPAPLYIPHLSREDCSISGYMIPTGIRVFVNAWALGRDAKFWDMPDEFLPER 69690
69691 FMDSNIDFKGHDFHYLPFGSGRRMCPGIHSATVTLEIMLANLMYCFNWKLPAGVKE 69858
69859 EDIDMTEVFGLTVHRKEKLFLVPQAA* 69939

>AP003909.1e $F chromosome 8 clone OJ1300_E01 
same as AP004462.1 128584-129021 region
78935 MELILQLEAKTAAQAVVTVFFFFLLPLALLFYFARAAISSRDSKTRELILSKLPSPPFK 79111
79112 LPVIGHMHLIGPLPYVSLRDLAAKHGRDGLMLVRLGSVPTLVVSSPRAAEAVLRTHDLAF 79291
79292 ASRPRSMVTDIIMYGALDSCFAPYSDHFRSVKKVVTVHLLNSKRVQAYRHVREEEVRLVM 79471
79472 ARLRGAAAAAAAVDLSQTLQFFANDLICRAVSGKFLCEQGRNKVFRDLMEANSNLLGGFN 79651
79652 LEAYFPGLARMPLISKLICARAIRIRRRWDQLLDMLIDDHVASARDRAKNDDDDFIHVL 79828
79829 LSLQDEYGFTRDHIKAISI 79885 (0)
80359 DMFEAGTDTSHLVLEYAMVELTRKPHILTKLQDEVRRITPKGQHMVTEDDIVGMVYLKAV 80538
80539 IKETLRLHAPGGFTIPHLAREDCNVDGYMIPAGTRVLINLWALSRDANYWDKPDEFLPER 80718
80719 FMDGSNKNTDFKGQDFQFLPFGSGRRMCPGIHSGKVTLEIMLANLVYCFNWKLPSGMKK 80895
80896 EDIDMTDVFGLAIHRKEKLFLVPQIANY* 80982

>AP003909.1f $F chromosome 8 clone OJ1300_E01 
AZ127316.1 OSJNBb0086E03f CUGI Rice BAC genomic Length = 498 54% to 71A14
AQ871024.1 nbeb0042C09f CUGI Rice BAC genomic Length = 495 56% to 71B23
same as AP004462.1 147428-146820 region
63826 MVVQLMLFFHDKFMAPMAEEPLPFVLIMIII
63733 LLLLVLLHYYLSASTRRSSAASKSNDDVLPPSPPRLPVIGHMHLVGSNPHV 63581
63580 SLRDLAEKHAADGFMLLQLGQVRNLVVSSPRAAEAVLRAHDHVFASRPRSAIADILAYGS 63401
63400 SNISFSPYGDYWRKARKLVAAHLLSPKKVQSLRRGREEEVGIAVAKLHEAAAAGAAVDMR 63221
63220 ELLGSFTNDVLCRAVCGKSSFRREGRNKLFMELAAGNADQYAGFNLEDYFPSLAKVDLLR 63041
63040 RVVSADTKKLKEKWDSVLGDIVSEHEKKSSLRRDDQVQMDDDRDDDQEEQESDFVDILLD 62861
62860 RQQEYNLTRHNIHAILM 62810 (0)
62626 DMFAAGTDTSYIALEFAMSELIRKPHLMTKLQDEVRKNTTTQMVSEDDLNNMPYLKAV 62453
62452 VKETLRLHPPVPLLLPRLSMAQCNANGYTIPANTRVIINVWALGRDAKCWENSEEFMPER 62273
62272 FMDSGDTIDNVDFKGTDFQFLPFGAGRRICPGMNFGMASVELMLSNLMYCFDWELPVGMD 62093
62092 KDDVDMTDQFALTMARKEKLYLIPRSHVIKIT* 61994

>AP003909.1g $P chromosome 8 clone OJ1300_E01 lone pseudogene fragment
61469 EQESDFVDILLDHQQEYNLTRHNIHAILM 61383

>AP004757.1 $F 52% to AF321858 Lolium rigidum 70% to AP003909
chromosome 6 clone P0652D10
      MAQMLAAFLLDGLISHEHGHESLGAPPQAGTMAWYSLVLMTS
79980 LLFPLLVLLVMRCYVTRSGAKLLDKLPSVPGRLPVIGHLHLIGSLPHISLRDLATKH 79810
79809 SPDMMLLHLGAVPTLVVSSSRVAQSILHTHDDIFASRPYSPIANILFYGATDVGFSPYNE 79630
79629 YWRQIKKITTTHLLTMKKVRSYVSARQREVRIVMARITEAASKHVVVDLTEMLSCYSNN 79453
79452 IVCHAVCGKFSLKEGWNQLLRELVKVNTSLLGGFNIEDYFPSFTRLAAVRRLLLSCAKA 79276
79275 HNINKRWDQLLEKLIDDHTTKHIRSSSMLNHYDEEAGFIDVLLSIQHEYGLTK 79117
79116 DNIKANLAAMLMAGMDTSFIELEYAMAELMQKPHVMGKLQAEVRRVMPKGQDIVTEEQLG 78937
78936 CMPYLKAVIKETLRLHPPAPLLMPHLSISDCNINGYTIPSGTRVIVNVWALARDSN 78769
78768 YWENADEFIPERFIVNTLGDYNGNNFHFLPFGSGRRICPGINFAIATIEIMLANLV 78601
78600 YRFDWELPADQAAKGGIDMTETFGVAVHRKEKLLLIPHLHLR* 78472

>AP004757.1 $P chr 6 Pseudogene fragment last exon similar to AP003909
103765 FAIATIEIMLANLVYRFDWEILVDQAAKGGIDMTEAFGLAVHRKEKLLLVSWLPQD* 103595

>AP005285.1 gene 1 $F  43% to 81D2 (one frameshift)
MVDAMSGGVLVALMVLLLVAAPALLSQLERRRRSPSGPVALPVV (1?)
GPRGSPQNREPHPAPLP
98846 AQRPGAPVICRRFGSPPLAVVSSAPAAEECLGPHDLAFADKPRLPSGEILSYEWSTMGTA 98667
98666 RYGPYWRHIRRITVTELLSAHRVQHFAGVNAREVRAMARRLYRRAAAAAAS 98514 (fs)
98512 RVELKSRLFELFMNIMMAMICDRTFYGDGDDEVSEEARWFRSVVKETMELSGASTAWDF 
      LPAAARWLFARRLTRRMRELSDSRTRFYQRLITDHRTKEKTDDDNAAAGDHSPAPRRRTM 
      IGVLLSLQSKDPDACPDQLIRALCI (0) 98081 
97990 GSLQAGTETSAAVVEWAMSLLLNNPGAMARARGEIDACVGQPAARLLEAADLPKLHYLRCVVMETL 97793
97792 RLYPPVPLLAHESSADCDVAGFHVRKGTMLLVNTFAIHRDPQVWDEPESFFPDR (2) 97631
94075 FADGQNEAKMVIPFGMGRRGCPGENLAMQMVGLTLGTLIQCFDWERVGEELEDMGESSG 93899
93898 ITMPKKLPLEAFYQPRACMVHLLSS* 93821

>AP005285.1 gene 2 (partial)   sequence gap with C-helix (about 47 amino acids) 81D like
108306 MAVDAMFGSVAVALLAVVVAAAALRRWRRRRRRGGGRPLPGPVALPVVGHLHLFRRPLHRT 108124
108123 LARLAARHGAAVMGLRFGSRRVAVVSSAPAAEECLGPHDLAFANR (?) 107989
sequence gap with C-helix (about 47 amino acids) lower case sequence shown 
from ortholog aaaa01004834.1a

prlpsgeilayewstmgtasygpywrhirriavtellsahrvqhfad 

107525 VNVREVRALARRLYRRAAAAAAAGARTRVELKSRLFELLMNTMMSMICERTFYGA (2) 107361
       DDDEVSEEARWFRSVVKETMELSGASTVWDFLPAPARWL 107169
107168 DAGRMTRRMRELSDSRTRFLQRLIDDQRKDMDADSDDHAPAKRRTMIGVLLSLQRKD 106998
106997 PDSCPDQLIRSLCI (0) 106956
106869 GSLQAGTDTSAATVEWAMSLLLNNPGAMARARGEIDACVGQPAARLLEAADLPKLHYLRCV 106687
106686 VMETLRLYPPVPLLAPHESSADCVVAGFHVPQGTMLLVNTFAIHRDPQVWDEPEAFIPDR (2) 106507
106178 FADGKNEGKMVIPFGMGRRRCPGENLGMQMVGLALGTLIQCFDWERVGEELVDMRECSG 106002
106001 LTMPKELPLEALYQPRASMVDLLTKI* 105922

>BI806420.1 (partial) clone S063A11. 57% to AL606630.1 54% to 81D2 many frameshifts ortholog of aaaa01004834.1c = AP005285.1 at 115000 region 

IMEPLRLYPRPPLLVPSESSSXXTVXXXXXVPRGTLLLVNTCDIHRDPHIWDEPTSFIPERFEDGRSEGKMAI
PFGMGVRKCPAENLGMQMVALALGTMIQCFEWERVGEELVDMTEGSALTMPKKVPLEAFYQPRASVMHLLS*
This is between gene 3 and gene 4 on AP005285.1 (missed earlier)
There is no N-terminal (gene 4 might be the N-term but it is opposite orientation and 27000 bp away. It might have been split by a rearrangement)
The seq below is 100% identical to aaaa01004834.1c ortholog
Even the pseudogenes are highly conserved
1      SLEAGTGTSTDTIEWAMSLLLNNPDVMRKARDEIDAFIGQPVRLLEADDLPKLQYLRCII 60
       SLEAGTGTSTDTIEWAMSLLLNNPDVMRKARDEIDAFIGQPVRLLEADDLPKLQYLRCII
115879 SLEAGTGTSTDTIEWAMSLLLNNPDVMRKARDEIDAFIGQPVRLLEADDLPKLQYLRCII 115700

61     MEPLRLYPRPPLLVPSESSSXXTVXXXXXVPRGTLLLVNTCDIHRDPHIWDEPTSFIPER 120
       ME LRLYP  PLLVP ESSS  TV     +PRGT+LLVNT DIHRDPHIWDEPTSFIPER
115699 METLRLYPPAPLLVPHESSSDCTV-AGFHIPRGTMLLVNTFDIHRDPHIWDEPTSFIPER 115523

121    FEDGRSEGKMAIPFGMGVRKCPAENLGMQMVALALGTMIQCFEWERVGEELVDMTEGSA 179
       FEDGRSEGKMAIPFGMG RKCPAENLGMQMV L LGTMIQCFEWERVGEELVDMTEGS 
115436 FEDGRSEGKMAIPFGMGRRKCPAENLGMQMVGLGLGTMIQCFEWERVGEELVDMTEGSG 115254

180    LTMPKKVPLEAFYQPRASVMHLLS 203
       LTMPKKVPLEAFYQPRASVMHLLS
115253 LTMPKKVPLEAFYQPRASVMHLLS 115182


>AP005285.1 $F gene 3 one frameshift 43% to 81H1 very similar to BI806420.1 15 diffs
114128 MDALLIALFLLLLIALMETARVRRSGTQRRAGNVPPPPPEPAG
       LPLVGHLHLFRKPLHRTLARLAARHGGAVFGLRLGSRRVA 
       VVSSAPAAEECLGAHDVAFADRPRLPSGRILSYDWSTMGTASYGPYWRHVRRVAVTEILS
       ARRVQHFADVHVREARAMARHLHRAAVRHGVGGAARVRVELKSRLFELLMNTMMAMICDKTYYGD (2) 113505
113412 DDDGKVSKEARWFREMVEETMALSGASTVWDFLPAALRWVDVGGVGRRLWRLRESRTRF
       LQGLINDERKEMEQEQGGDRAQPAARRRTMIGVLLSVQRQDPDACPDQLIRSLC 113074
111518 AGTDTSADTIEWAMSLLLNNPNVMRKARDEIDAFIGQP (fs) 111405
111405 SRLLEASDLTKLQYLQCIIMETLRLYPPAPLLVPHEASTDCSIAGFHITRGTMLLVNTFA
       IHRDPQVWNEPTSFIPER (2)
       FENGRSEGKMAIPFGMGRCKCPAENLGMQMVGLALGTMIQCFEWERVGEELVDMTEGSG
       LTMPKEVPLQAFYQPRASLMHLLY* 110842

>AP005285.1 gene 4 (partial)   one frameshift, runs off end of clone missing last two exons
38% to 81H1 same as gene a on AP004022.1
AP004022.1a comp(2300-3335) chromosome 2 clone OJ1126_B06, 52% to 705A21
1-321 runs off beginning of contig about 1639 cannot extend 10/11/01
142199 MDTSTLLIALTLVLLLLLLTARRRRSGRLRLRLPPEPPGLPLVGHLHLFRKPLHRTLAR 142375
142376 LAARHGAVFRLRLGSRRVAVVVSSAPAAEECLGAHDVAFAGRPRLPSAGILSYGWSTMG 142552
142553 TAAYGPYWRHVRRVAVAEILSAHRVRQFAGAHAREARATARRLCRAASRQRHGAGAAAGR 142732
142733 VRVELKSRLFELLMNTMMAMICDKTYYGA (2) 142816
142893 DDDGEVSEEARWFREMVEETMALSGASTVWDFLPAALRWVDVGGVGRRLWRLRESR 143060
143061 TRFLQGLIDDQRKEMEHDGDGRELPAAAARPRSMIGVLLSVQR 143189 (fs)
143191 QDPEECPDQLISSLCI (0) 143238 end of clone

>AP004022.1b $F chromosome 2 clone OJ1126_B06, 54% to CYP92A5
C71836        58% TO 706A1  9/97 344-394 I-K helix REGION opposite end =
C98588 70% identical to AQ327338 54% to 71B4 C-term 53% to 92A2
64% to D24290 61% to AP003870.1 52% to aaaa01010885.1
59881 MEVQELVPSPWSSSSSFLVLVLATLLFVAAFLRRRQGARRKYNIPPGPRPWPVIGNLNLIGALPYRSIRDL 59669
59668 SRRYGPLMSLRFGSFPVVVGSSVDMARYFLRANDLAFLDRPRTAAGRYTVYNYAGVLWSH 59489
59488 YGEYWRQARRLWVTELLSARRLASTEHVRAEEVRAMLRGLSRRAGAGTAVVLKEHMLMVT 59309
59308 LNVISRMVFGKKYIVEEGEGSSPTTAEEFR 59219 (2)
59007 WMIEEIFFLNGVFNIGDMVPWLGWLDPQGYIGRMKRLGGMFDRFLEHILDEHVERRRR 58834
58833 EGDGFAARDMVDLLLQFADDPSLKVPIQRDGVKAFIL 58729 (0)
57247 ELITGSTDTTSVSVEWAMSEVLRNPSVLARATDELDRVVGRRRLVAEGDIPNLPYLDA 57074
57073 VVKESMRLHPVVPLLVPRVSREDAFSVSVAGAAASYDIPAGTRVLVNVWAIGRDPAVWGD 56894
56893 DAEEFRPERFAAGGERGGVDVKGQDFELLPFGSGRRMCPGFGLGLKMVQLTLANLLHGFA 56714
56713 WRLPGGAAAEELSMEEKFGISVSRLVQLKAIPEPKLPAHLYDE* 56582

>AQ573853 nbxb0085A03r (partial)   50% to 71A24 AQ691042 nbxb0086M20r
AQ795917.1 nbxb0058F03f CUGI Rice BAC genomic clone Length = 684
No indica ortholog found
MAARTWLWLLLSPLILLLLHYALALLTARRARKNPLPPSPPALPFIGHLHLIGALPHVSLCCLAT 
KHAPDLMFLRLGTSLPVLVASSPCAAEAILRTHDDVFASRPRTVLADIIFYGSRDIGFAPYGEDWRQAR

>AC113337.1 $F (japonica cultivar-group) cultivar Nipponbare clone OSJNBa0061H20, 
from chromosome 10
AC074355.2 Oryza sativa clone OSJNBa0071I20, gene 1 43% to 71A13
AQ288798 65-164 region C-helix 54% to 71A12 same as AC074355.2
AQ840770.1 nbxb0071I20f CUGI Rice BAC genomic cloneLength = 754
AQ840078.1 nbxb0051B18f CUGI Rice genomic cloneLength = 694
similar to lotus 71D
AQ865944.1 nbeb0026D10f BAC genomic Length = 473 59% to 99A1 69% to AP004000.1
23542 MANLIYYSLLIILPFLLLINFYKAMFSSRKQAGRLPPCPWQLPIMGSIHHLIGDLPHRSL 23721
23722 HDLSRRYGPVMLLKFGQVPFIIVSSPEAAKDIMKTHDSIFAMRPQSEIMKIITKRGQGLV 23901
23902 FAPYDDQWRQLRKICIRELLCAKRVQSFCAIREEEAARLVKSISSDQAHLVNLSKKLADY 24081
24082 ATDAAIRIITGTRFENQEVRDKFQYYQDEGVHLAASFCTANLCPSLQLGNTLSRTARKAE 24261
24262 IYREGMFAFIGGIIDEHQERRAQDMYHKEDLIDVLLRIQQEGSLESPVSMETIKFLIF (0)
      DILAGGSETVTTVLQWAMTELMRNPTVMSKAQ 24621
24622 DEVREVFKWKKMVSNDDINKLTYLQFVIKETVRLHTPGPLFMRECQEQCQVMGYDVPKGT 24801
24802 KFLLNLWSISRDPKYWDDPETFKPERFENDARDFKGNDFEFIPFGAGRRMCPGMLFGLAN 24981
24982 IELALANLLFYFDWSLPDGVLPSELDMTENFGVTVRKKEDLLLHASLYAQLSC* 25143

>AC078894.1 $P chromosome 10 clone OSJNBa0096G08 12 unordered pieces starts 123
probable pseudogene fragment 47% to 71B1
51119 LRRICVLELFSAHRV*SLHHVREEEAAPLVRVVADIRSPLGP 50994

>AP004233.1 $F chromosome 1 clone OSJNBa0065J17 50% to CYP71C4
= AQ857130 duplicate of the AP004232 gene at 27203-29726 
21862 MEQAAGLVYQLFQHEMFPWTFSVLALFPFLLLVLHYLATNHRTPTTCKETKNHHPP 
21694 PPSPPRLPIIGHLHLIGGLLHVSLRELAHRYGPDLMLLHLGQVPNLIVSSPRAAEAVLR 21518
21517 THDLVFASRPYSLIADILLYGPSDVGLSPYGE*WRRRIITTHLLTNKKVRSYRVAREE 21344
21343 E 21335 (0)
      VHKVMAKVHELSTKGMAVDMTELFSTFSNDLICRLVSGKNFQGEGRNKLFRQLFKAN
      SVLLAGFNLKDYYPGLARLKAVSMVMCAKARNTRKLWDELLDEIIDERMS
      KQQCEHDEGNDQDEMNFVNVLLLQEQGITREHLKAILV (0)
20004 DMYQAGTETSSVVLVFAMAELMQKPHLMAKLQAELRTTIPKQGHELITERDLTDMTYLKA 19825
19824 VIKETLRLHPPTPLLLPHLAMADCNIDGYTVRSGTRVIVNAWAIGRNSESWEAAEEFLPE 19645
19644 RFVDDGSAANVDFIGTDFQFLPFGAGRRICPGINFASASMEIILANLLYHFDWDVSAEAA 19465
19464 IDKDGIDMAEAFGLSVQLKEKLLLVPVDYKDGMQDSAVILL* 19339

>AP004232.1 $F chromosome 1 clone OSJNBa0051H17 like CYP71C4
56483 MEQAAGLVYQLFQNEMFPWILALFPFLLLALHYLATNHRTPTTCKETRNHLSPPSP 56650
56651 PRLPIIGHLHLIGDLPHVSLRELAHRYGPNLMLLHLGQVQNLVVSSPHAAEAVLR 56815
56816 THDHVFASRPHSLIGDILLYGPSDVGLSPYGEQWRRSRRIVTTHLLTNKKVRSYHVAREE 56995
56996 E 56998 (0)
57929 VHKVMTKVHELSTKGMGVDMFELFSTYSNDLICRLVSGKNFQGDKGRNKMFRQLFKANY 58105
58106 VLLAGFNLEDYYPGLARLKAVSWVMCAKARNTRKLWDELLDEIINDRMSKQPCEHDRGND 58285
58286 DQDEMDFVDVLLLQERGITRDHLKAIL 58366 (0)
58462 DMFQAGTETTSVVLVFAMAELMQKPHLMAKLQAELRTNIPK*GRELITECDQTNMTYLKA 58641
58642 VIKETLRLHPPSPLLVPHLAMADCDIDGYTVRSGTRVIVNAWAIGRNSESWEAAEEFLPE 58821
58822 RFVDGGSAANVDFIGTDFQFLPFGAX 58896 frameshift
58899 RRICPGINFASASMEIILANLLYHFDWDVSAEAAIDKDGIDMAEAFGLSVQLKEKLLLVP 59078
59079 VEYKGSVQDSAVIL* 59123

>AC109595.1 $F chr 5 39% to 71As 40% to 71Bs new 71 subfam? 44% to 71Cs
clone OJ1212B02, Length = 126962
72095 MEIELSPVLLLLPFLLLGFLYLTGGVLRSGGNARRRLAPAPRGLPVIGNLHQVGALP 71925
71924 HRALRALAAATGAPHLLRLRLGHVTALVASSPAAAAAVMREHDHVFATRPYFRTAEILTY 71745
71744 GFKDLVFAPYGEHWRHARRLCSEHVLSAARSHRYGPMREQEVALLVNAIRTEAAAAAV 71571
71570 DVSKALYAFTNAVICRAVSGRLSREDEGRSELFRELIEENATLLGGFCVGDYFPALAWA 71394
71393 DAFLSGFAARACRNLRRWDELLEEVIAEHEARLRGGDDGGGEEHREEDFVDVLLALQE 71220
71219 ESQRHDGSFKLTRDIIKSLLQDMFAAGTDTSFITLEWAMSELVKNPAAMRKLQDEVRRGG 71040
71039 GATTAATPYLKAVVKETLRLHPPVPLLVPRECARDTDDDATVLGYHVAGGTRV 70881
70880 FVNAWAIHRDAGAWSSPEEFRPERFLPGGGEAEAMDLRGGHFQLVPFGAGRRVCPGMQFA 70701
70700 LATVELALASLVRLFDWEIPPPGELDMSDDPGFTVRRRIPLRLVAKPVGSEDDK* 70536

>AP004326.2a $P genomic DNA, chromosome 1, BAC clone:OJ1294_F06, complete sequence
Length = 102983
4 genes 71B like
Gene 1 pseudogene 71 family
67989 LPPVPWPLPVIGSMH*LLGSLPHH 68060 frameshift with deletion
68060 RPACAVELLSPRRARSFRRVREAEPARLVRAVAASPAWPLVNVVGGEHVAAMMTAV 68227
68228 GARP 68239 frameshift with small deletion
68238 RCPRQEEYLEELGKVAKLAAGFNLVDLFPESRLVRAAQAAHGKIHSIMDAMVQ 68396
68397 DHLKAMEERREEVADGVVDDGDGDGADRDEELLSILLRFQRDGGLGITLTNGNHQRDS 68570 (0)
68886 GILAGGSDTTTTTVMWAMSELLRCPRAMQ 68972 frameshift with deletion
69023 YMQLVIKETLRLHLPFPLLFPRLCTETCKIMGFDVPKGTIVIVNNWAISRDERCWEDAED 69202
69203 F*PERFEHDDTDYNGTYFQFLSGGFGRRMFPDFIFAQFNIEIALANLLYHFDWELPCSEN 69382
69383 RMELDMTESAGLTASRLTDLFG* 69451

>AP004326.2b $F genomic DNA, chromosome 1, BAC clone:OJ1294_F06, complete sequence
Gene 2 no good matches in NR 79% to AP004326.2c 
71860 MAGIVDTAAFCT
71896 LLCLLLTLVVFKLKTATSSRHNAGVNLPPGPWALPVIGSIHCLLGSLPHHAMRELSRR 72069
72070 YGPVMLLRLGHVRTLVLSSPEAAREVMKTHDAAFATRAVTPTASILTYGARDIVFAPF 72243
72244 SKHLRELRKLCTLELLSPRRVRSFRHVRDEEAARLARSVAAAAPAVVNVSELVKI 72408
72409 MANNIIMTAIIGDTCPQRDEYLEALDKTMDLMNGFNLIDLFPGSRLARVLGARSLRATKR 72588
72589 VHQKLHQITDTIIQGHEIIKDGSVGDDTIQETVGTHNMHGHGHKCEDILDVLL 72747
72748 RFHRDGGLGITLTKEIVSAVLF 72813 (0)
73327 DLFAAGSETTSTTIIWAMSELVRTPHVMERAQSEIRQVLQGKTVVSEADIEGRLHYL 73497
73498 QLVIRETLRLHPPVPFLIPRLCSEANSKIMRYNIPQGAMVLVNISAIGRDEKIWKNANEF 73677
73678 RPERFKDDMVDFSGTDFRFIPGGAGRRMCPGLTFGLSNIEIALASLLYHFDWKLPNDASS 73857
73858 CKLDMRETHGVTARRRTELLLKATPLYT* 73944

>AP004326.2c $F genomic DNA, chromosome 1, BAC clone:OJ1294_F06, complete sequence
Gene 3 39% to 71B11
77487 MAGIMDSTTASYYTTLLCG
77544 ALLLAAVVFKLKTAAAFSRHNAGVNLPPGPWALPVIGSIHCLLGSLPHHAMRELSRRY 77717
77718 GPVMLLRLGHVRTLVLSSPEAAREVMKTHDVAFANRAVTPTASIDIVFAPFG 77873
77874 KHLRELRKLCALELLSPRRVRSFRHVREEEAARLARSVAAAASASSAVNVSELVKIM 78044
78045 TNDVTMRAIIGDRCPQREEYLEALDKTMDLLAGFNLVDLFPGSPLARVLGGRSLRTTKRV 78224
78225 HEKLHQITEAIIQGHGIKDTVGDEHHECEDILDVLLRFQRDGGLGITLTKEIVSA 78389
78390 VLF 78398 (0)
78554 DLFAGGSETTSTTILWAMSELMRSPHVMEQAKYEIRQVLQGKAMVSEADIEGRLHYLQLV 78733
78734 IKETLRLHPPVPIVIPRLCSKPNSKIMGYDIPQGTSVLVNVSAIGRDEKIWKDVNEFRPE 78913
78914 RFKDDIVDFSGTDFRFIPGGSGRRMRPGLTFGVSNIEIALVTLLYHFDWKLPSETDTHEL 79093
79094 DMRETYGLTTRRRSDLLLKATPSYARLGWSTNMQIYSVKCLVYE* 79228

>AP004326.2d $P genomic DNA, chromosome 1, BAC clone:OJ1294_F06, complete sequence
Gene 4 pseudogene
81031 DFFFCGLETTTTATIWAI*EFIKNPHAVEKAQSDIRKILGGKSIVEEADIEGQLHYFQMVN 81213 frameshift
81213 QETLRLHPPVPLLLPRLWSEPCKIMGYDIP 81302 frameshift
81304  KNTAIFVNTWALGR 81345 frameshift 
81344 KIKNTGLMQVSSGLKY
81393 SRMGIVDFNGLDFRFLPCGAGRRICLGLMFELSDIELTLASLLYHSSWRLPTRSYSNK 81566
81567 LDMTEANGITTHRRIDIWLEATPFVPR 81647

>AP004175.1 $P chromosome 2 clone OJ1006_B12 pseudogene fragment 
94% to AC078894 51119-50994
64520 LRRICMLELFSAHRV*SLHHVREEEAARLVRVVA 64419

>AC120537.1c $F chromosome 3 clone OSJNBb0042N11
AQ573952 nbxb0083G09r 60% to AQ259669 52% to 99A1 51% to 71B23
BM039053 clone V013G04.Length = 527
103769 MEMTLPLLGAAVVLAAFLLFFLVKNNRCCWSPAAERR
103880 LRLPPGPWRLPLVGSLHHVLLSRHGDLPHRALRELAGRYGALMLLRFGAVPTLVVSSAE 104056
104057 AAREVLKTHDACFASRHMTPTLAVFTRGGRDILF SPYGDLWRQLRRICVLELFSARRVQS 104236
104237 LRHVREDEAARLVRAVAEECAIGGGGGAVVPIGDMMSRMVNDSVVRSAIGGRCARRDEFL 104416
104417 RELEVSVRLTGGFNLADLYPSSSLARWLSGALRETEQCNRRVRAIMDDIIRERAAGKDDG 104596
104597 DGEDDLLGVLLRLQKNGGVQCPLTTDMIATVIM (0) 104695
105190 EIFSAGSETASTTLEWAISELVRNPKVMDKAQSEVRKLFEGQDNLTEDDMSRLS 105351
105352 YLHLVIRETLRLHAPAPFLLPRECREQCNVMGYDITEGTRVLVNAWAIARDTRYWEDPEI 105531
105532 FKPERFNANLVDFKGNDFEYIPFGSGRRVCPGITLGLTSMELVLASLLYHFDWELPGGKR 105711
105712 CEEIDMSEAFGITVRRKSKLVLHATPRVPCLH* 105810

>AC120537.1b $F chromosome 3 clone 
AQ869247.1 nbeb0034D08r CUGI Rice BAC genomic Length = 447 53% to 99A1
AZ130570.1 OSJNBb0104D19r CUGI Rice BAC genomic Length = 327
80150 MEVSLPLLIGVVLAFLLLFVLVNIKNSCRSWWPPPEKEKKKLRLPPGPWQLPLVGSLHHV 80329
80330 LLSRHADLPHRALRELAGKYGPLMMLRFGAVPTLVVSSAEAAREVLKTYDAAFASRYL 80503
80504 TPTLAVLSRGGRDILFSPYCDLWRQLRRICVHELLSARRVQSLRHGREDEAARLVRSVAA 80683
80684 ECAARGGAAVVNVGELISRAVNDSVVRSAVGARSARRDEFVRELDESVRLSGGFNLADLY 80863
80864 PSSWLARRLSGAMRETERCNRSLMAIMDDIIREHGDGEEDLLGVLLRLQRNGDVQCPLTT 81043
81044 DLITNVVL (0) 81067
81865 DMFAAGSETSSTTLEWALTELVRNPHIMEKAQSEVREIFRGENKLTEEMMDKLSYLRLVI 82044
82045 RETLRLHLPVPFLLPRQCREPCSVMGYDIPVGTKVLVNAWAIARDNQYWDDPEVFKPERF 82224
82225 ENNRVDFKGIDFEFIPFGAGRRICPGIALGLANIELMLASLLYHFDWEFLDRDRNDEIDL 82404
82405 SETFGITAKRKSKLMVYATQRIPCLG 82482

>AC120537.1a chromosome 3 clone pseudogene fragment
2543 ARRVQSPRHVR*D*QAGYLVHAVVGECAPGGAGAVVPISEKISRMVNDSVVRPAIGSRCA 2364
2363 RRDEFLHVQARGLRQARGRVQLGRPVPIVVASELAQRRAAVGRPSVAAGAFARCGRPAET 2184
2183 FFNMDNLRTHDTYRKKNHSGNSQHCTAFSALSFSELQLKMTIWQSHHYKLPINLREIFS   
2006 SAGSETLNDTLVGNI*ANEKYPQVMQKAQTEVREKFRG*DKLIKDDMNRLSYLHL 1846
1845 VIQETLRLH 

>AL662933.1 $F OSJN00145 chromosome 4 clone OSJNBa0096F01, Length = 141695
63% to 99A1 sorghum 
86110 MEINSEATVTLVSVVTLPILLALLTRKSSSKKRRPPGPWNLPLVGGLLHLLRSQPQVALRDLAGKYGP 85907
85906 VMFLRTGQVDTVVISSPAAAQEVLRDKDVTFASRPSLLVSEIFCYGNLDIGFAPYGAYWR 85727
85726 MLRKLCTVELLSTKMVRQLAPIRDGETLALVRNIEAAAGGKKPFTLATLL 85577
85576 ISCTNTFTAKAAFGQACGGELQEQFLTALDEALKFSNGFCFGDLFPSLRFIDAMTGLRSR 85397
85396 LERLRLQLDTVFDKIVAQCESNPGDSLVNVLLRIKDQGELDFPFSSTHVK 85247
85246 AIIL (0?)
84901 DMFTGGTETTSSTTEWLMSELMRNPEVMAKVQAEVRGVFDNKSPQDHEGLLENLSYMKLV 84722
84721 IKETLRLNPVLPLLLPHLCRETCEIGGYEIVEGTRVLINSWAMARSPEYWDDAEKFIPER 84542
84541 FEDGTADFKGSRFEYLPFGTGRRRCPGDIFAMATLELIVARLLYYFDWSLPDG 84383
84382 MQPGDIDMELVVGATARRKNHLQLVASPYKPISMQS* 84272

>AL713947.1 $F chromosome 12 clone Monsanto-OJ1003_A04, 
AQ259099 41-60 N-term similar to 71B7 no ortholog found in indica
39% to 76C2
MASGLWFLGISLAVLLLCYVGTNRRGDGQRPPGPRTLPIVGNLLDLRGGNLHHKL 
ASLAHAHGPVMTLKLGLVTTVFISSRDAAWEAFAKHDRRLAARTVPDTRRALAHAERSM 
VWLPSYDPLWKTLRSIAA (fs) 45837
45835 THVFSPRSLGVARSARERKVHDMVDSFRRRAGQEVDIGQVLYHGMFDLLANVLLSVD
AHPNLRDLMEDIVAILAKPNASDFFPLLRPLDLQGIRHWTAIHMSRVLHILDSIIDC 
RLAQGTDDQCKDVLDSLLVLMSTGKLSRRDVKILLFDILAAGTETTKITVEWAMAELL 
RNPNVMATTRAEMKAALGGNGTITEADVVNLPYLQAAVKESMRLHPVAPLLLPHLVVEDG 
VRIGGYAVSKGTTVIFNSWAIMRDSTAWERPDDFLPDRFLGKTELDLWGKQAKFIP 
LGSGRRLCPALPMVELVVPFTVASLLHAFEWHLPKGMSAEEVDVTERYTSNDILVMATPLKAVPLIVT* 

>AL713951.1 $F chromosome 12 clone Monsanto- 39% to 83B1
AF088221 BI305808.1 49% to 76C6 mRNA
44616 MSLALLVLSAAYVLVALRRSRSSSSKPRRLPPSPPGWPVIGHLHLMSGMPHHALAELART 44437
44436 MRAPLFRMRLGSVPAVVISKPDLARAALTTNDAALASRPHLLSGQFLSFGCSDVTFAPAG 44257
44256 PYHRMARRVVVSELLSARRVATYGAVRVKELRRLLAHLTKNTSPAKPVDLSECFLNLAND 44077
44076 VLCRVAFGRRFPHGEGDKLGAVLAEAQDLFAGFTIGDFFPELEPVASTVTGLRRRLKKCL 43897
43896 ADLREACDVIVDEHISGNRQRIPGDRDEDFVDVLLRVQKSPDLEVPLTDDNLKALVL (0)
      DMFVAGTDTTFATLEWVMTELVRHPRILKKA 43537
43536 QEEVRRVVGDSGRVEESHLGELHYMRAIIKETFRLHPAVPLLVPRESVAPCTLGGYDIPA 43357
43356 RTRVFINTFAMGRDPEIWDNPLEYSPERFESAGGGGEIDLKDPDYKLLPFGGGRRGCPGY 43177
43176 TFALATVQVSLASLLYHFEWALPAGVRAEDVNLDETFGLATRKKEPLFVAVRKSDAYEFK 42997
42996 GEELSEV* 42973

>AP003544.1 $P chromosome 6 clone P0599C12 same as AP003686.1 8668-8037 pseudogene of AL713951.1
this gene matches a barley EST BF255745.2 75% and a sorghum EST BE354971.1 79% 
so there must be a functional copy of this gene in rice  
107880 GSMPAVVISKPNLARPALTTNDAVLASRQHLLNG*FLSF 107764 frameshift
107762 GCSDVTFAPAGPYHRM 107715 frameshift
107713 QMARGVEVSELLSAHHVAMYGVVRVKELQRLLAHLTKNTSSAKPIDLSECFLNLANDVLCRVAF 107521
107520 GRRFPRDEGDKLSAVLANAQDLL 107452 frameshift
107452 AGFTISDFFLELEPVASTVTGLCHRLKKCLADLCEACDVIVDVHISGNRQRIPSDREEDFVDVLLRVQ 107249

>AC105746.1 $F chromosome 10 clone OSJNBb0086I08, Length = 123484
55% to AC074105.1 chr 10 40% to 76C2
New rice seq 
1-115
      MSAFCWSATRSPSRQRLIENGFLFACKHHALTPARPLLPPLQ
31481 RIAAARRRLPPGPCPLPLIGNLLDIGXLPH
      RSFARLAERYGPLMTVRLGAATCVVASSPATARAV 31288
31287 LQTHNASLAGRGRQDAWHAGG
116-306
102155 HAENSVFVLPPGRKWRLLRKLGAAHLFSRRKLAELAPLRDEIVGGLLRRVAERADHRGG 101979
101978 APVNVGRLALAANVELLWRSVFSTRLDAATLDVLCDVAREAAVLLGTPNVSDFFPA 101811
101810 VAALDLQGLRRRLAELMKNTYRLVDAQIDHRMRCRELRGGRGGEAMDLLDVLLDMSE 101640
101639 QEREDGDDEVINR
307-504
101097 DLFVGGSDSTATTVEWAMAELLQNPEIMKTLQQEIKMVLGTRSQVEESDIGQL 100939
100938 PYLQAIVKETLRLHPIVPLRLYEAERTVEIEGHTIPKGSKVIVNAWAIHQSVKVWIQP 100765
100764 EKFLPKRFITKDIDFAGRHFEFIPFGSGRHICIGLPLANRMLHMILGSLMHQF 100606
100605 KWTMPQMVNRNGLDMAEKFGLAVSMATRPNIIA 100507

>AC078944.1 (partial)   clone OSJNBa0089D15,9 unordered pieces STARTS 347 51% to 71C4
this seq is not found in version 5 of this accession
no indica ortholog found
140438 PFQGLCS KGTFRLPAPVPRLLPGRACPSSDVAGYTLPSGARVLVYVWA WADSCVRGLVRE 140259
140258 NLCQCVFRVRSAFRGVRLELFHWGQVPVWPGIAPAA*LVH*GLGS 140124

>AQ256364.1 (partial)   nbxb0016O01r CUGI Rice BAC clone nbxb0016O01r.Length = 607 
84% to 73A5 82% to AP003446 probable CYP73A rice gene
ortholog to aaaa01002376.1 complete
AQ576280.1 nbxb0088L20r CUGI Rice BAC genomic clone
AQ289078.1 nbxb0034K09r CUGI Rice BAC genomic clone
(2) RRVMEQTGEIRCAMDHILEAERKGEINHDNVLYIVENINVA (1)

>AP004859.1a (japonica cultivar-group) chromosome 2 clone OJ1756_B04
6 diffs to AP004850.1a 32815 seq identical after aa 75
AU058037.1 Rice callus Oryza sativa cDNA clone C62026_6Z. 58% to 73A5 
AU093333.1 this seq is partial but = AP004850.1b that is complete
73615 MAASAMRVAIATGASLAVHLFVKSFVQAQHPALTLLLPVAVFVGIAVGAKGGSGGDGKAP 73436
73435 PGPAAVPVFGNWLQVGNDLNHRFLAAMSARYGPVFRLRLGVRNLVVVSDPKLATEVLHTQ 73256
73255 GVEFGSRPRNVVFDIFTANGADMVFTEYGDHWRRMRRVMTLPFFTARVVQQYKAMWEAEM 73076
73075 DAVVDDVRGDAVAQGTGFVVRRRLQLMLYNIMYRMMFDARFESVDDPMFIEATRFNSERS 72896
72895 RLAQSFEYNYGDFIPILRPFLRGYLNKCRDLQSRRLAFFNNNYVEKRR 72752
      small sequence gap
72588 WSIEWALAEVVNHPAVQSKVRAEINDVLGDDEPITESSIHKLTYLQAVIKETLRLHSPIP 72409
72408 LLVPHMNLEEAKLGGYTIPKGSKVVVNAWWLANNPALWENPEEFRPERFLEKESGVDATV 72229
72228 AGKVDFRFLPFGVGRRSCPGIILALPILALIVGKLVRSFEMVPPPGVEKLDVSEKGGQFS 72049
72048 LHIAKHSVVAFHPISA 72001

>AP004859.1b $F (japonica cultivar-group) chromosome 2 clone OJ1756_B04
AU058037.1 Rice callus Oryza sativa cDNA clone C62026_6Z. 58% to 73A5 
89777 MAASVVRVAIATGASLAVHLFVKSFLQAQHPALTLLLPVAVFAGIAVGAKGGNGGDGKAP 89956
89957 PGPAAVPVFGNWLHVGNDLNHRFLAAMSARYGPVFRLRLGVRNLVVVSDPKLATEVLHTQ 90136
90137 GVEFGSRPRNVVFDIFTANGADMVFTEYGDHWRRMRRVMTLPFFTARVVQQYKAMWEAEM 90316
90317 DAVVDDVRGDAVAQGTGFVVRRRLQLMLYNIMYRMMFDARFESVDDPMFIEATRFNSERS 90496
90497 RLAQSFEYNYGDFIPILRPFLRGYLNKCRDLQSRRLAFFNNNYVEKRR 90640
      NKLRCAIDHILEAEKNGELTAENVIYIVENINVAAI 90894
90895 ETTLWSIEWALAEVVNHPAVQSKVRAEINDVLGDDEPITESSIHKLTYLQAVIKETLRLH 91074
91075 SPIPLLVPHMNLEEAKLGGYTIPKGSKVVVNAWWLANNPALWENPEEFRPERFLEKESGV 91254
91255 DATVAGKVDFRFLPFGVGRRSCPGIILALPILALIVGKLVRSFEMVPPPGVEKLDVSEKG 91434
91435 GQFSLHIAKHSVVAFHPISA 91494

>AP004850.1a $F (japonica cultivar-group) chromosome 2 clone OJ1342_D02
98% identical to AP004850.1b  100% to AP004859.1b
32815 MAASVVRVAIATGASLAVHLFVKSFLQAQHPALTLLLPVAVFAGIAVGAKGGNGGDGKAP 32636
32635 PGPAAVPVFGNWLHVGNDLNHRFLAAMSARYGPVFRLRLGVRNLVVVSDPKLATEVLHTQ 32456
32455 GVEFGSRPRNVVFDIFTANGADMVFTEYGDHWRRMRRVMTLPFFTARVVQQYKAMWEAEM 32276
32275 DAVVDDVRGDAVAQGTGFVVRRRLQLMLYNIMYRMMFDARFESVDDPMFIEATRFNSERS 32096
32095 RLAQSFEYNYGDFIPILRPFLRGYLNKCRDLQSRRLAFFNNNYVEKRR 31952
31797 NKLRCAIDHILEAEKNGELTAENVIYIVENINVAAIETTLWSIEWALAEVVNHPAVQSK 31621
31620 VRAEINDVLGDDEPITESSIHKLTYLQAVIKETLRLHSPIPLLVPHMNLEEAKLGGYTIP 31441
31440 KGSKVVVNAWWLANNPALWENPEEFRPERFLEKESGVDATVAGKVDFRFLPFGVGRRSCP 31261
31260 GIILALPILALIVGKLVRSFEMVPPPGVEKLDVSEKGGQFSLHIAKHSVVAFHPISA 31090

>AP004850.1b $F (japonica cultivar-group) chromosome 2 clone OJ1342_D02
100% identical to AP004859.1a AU093333.1
48977 MAASAMRVAIATGASLAVHLFVKSFVQAQHPALTLLLPVAVFVGIAVGAKGGSGGDGKAP 49156
49157 PGPAAVPVFGNWLQVGNDLNHRFLAAMSARYGPVFRLRLGVRNLVVVSDPKLATEVLHTQ 49336
49337 GVEFGSRPRNVVFDIFTANGADMVFTEYGDHWRRMRRVMTLPFFTARVVQQYKAMWEAEM 49516
49517 DAVVDDVRGDAVAQGTGFVVRRRLQLMLYNIMYRMMFDARFESVDDPMFIEATRFNSERS 49696
49697 RLAQSFEYNYGDFIPILRPFLRGYLNKCRDLQSRRLAFFNNNYVEKRR 49840
      NKLRCAIDHILEAEKNGELTAENVIYIVENINVAAI 50094
50095 ETTLWSIEWALAEVVNHPAVQSKVRAEINDVLGDDEPITESSIHKLTYLQAVIKETLRLH 50274
50275 SPIPLLVPHMNLEEAKLGGYTIPKGSKVVVNAWWLANNPALWENPEEFRPERFLEKESGV 50454
50455 DATVAGKVDFRFLPFGVGRRSCPGIILALPILALIVGKLVRSFEMVPPPGVEKLDVSEKG 50634
50635 GQFSLHIAKHSVVAFHPISA 50694

>CYP73A35P   $P Oryza sativa (rice) 
            GenEMBL AP003446.1 March 29, 2001
                    AP003302.1  Feb. 21, 2001
            Both sequences have the same frameshifts so
            This is probably a pseudogene

MDLLFVERLLVGLLAAAVVAIAVSKLRGRKLRLPPGPTPVPVFGNWLQVGDDLNHRNLAA
LARRFGDIFLLRMGQRNLVVVSSPPLAREVLHTQGVEFGSRTRNVVFDIFTGKGQDMVFT
VYGDHWRKMRRIMTVPFFTGKVVQRHRAGWEAEAAAVVDGLRADPAAA 168 (25 nuc. deletion and frameshift in both sequences)
175 RRRLQLMMYSNVYRIMFDRRFESADDPLFLRLKALNGERSRLAQSFEYNYGDFIPILRPF
LRGYLRICEEVKETRLKLFKDFFLEERK (2?)
NGLKCAIDHILEAQQKGEINEDNVLYIVENINVA (1)
AIETTLWSMEWAIAEL (2 nuc. insertion and frameshift in both seqs.)
VNHGEIQEKLRRELDTVLGPGRQITEPDTHRLPYLQAVVKETLRLRMAIPLLVPHMNLRD
AELAGYGIPAESKVLVNAWYLANDPGRWRRPEEFRPERFLEEERNVEANGNDFRYLPSGA
GRRSCPGIVLALPILGVTIGRLVQNFELLPPPGKDRVDTTEKGGQFSLHILKHSTIVAKPRAF*

>AP003261.1 $P chromosome 1 clone P0471B04, 
N-terminal exon of a CYP75 like sequence 70% identical to X70824 but no other part of 
the gene is found.  Identical to AP003227 87336-87082 almost identical to AP003214
132381 MELAALCTDPVVLSAAFLCLLLNLSLCSYRPPSPGGGRRLPLGPPGVPVLGALPLVGPAPHADLASLARK 132172
132171 YGPIMYLKMGTCGVV 132127

>AP003214.3 $P almost identical to AP003227 related to CYP75
165814 MELAVLCTDPVVLSGPFLCLLLHLSLCSYR 165725 (2?)
157244 RPPSPGGGRRLPPD 157203 frameshift 
157203 PPGVPVLDALPLVGPAPHTDLASLARKYGPIMYLKMGTCGVV 157078 frameshift
AASSPCAARAFL 157053

>AC021892.5 $F chromosome 10 clone OSJNBa0053D03 21 unordered pieces
          Length = 175567 65% to 75B1
62759 MDVVPLPLLLGSLAVSAAVWYLVYFLRGGSGGHAA
      RKRRPLPPGPRGWAVLGNLPQLGDKPHHTMCALGRQYGPLFRLRFGCAEVVEAATS 62487
62479 AAQFLRGHDANFSNRPPNSGAEHVAYNYQDLVFAPYGARWRALRKLCALHLFSAKALDDL 62300
62299 RAVREGEVALMVRNLARQQAASVALGQEANVCATNTLARATIGHRVFAVDGGEGARE 62129
62128 FKEMVVELMQLAGVFNVGDFVPALRWLDPQGVVAKMKRLHRRYDNMMNGFINER 61967
61966 KAGAQPDGVAAGEHGNDLLSVLLARMQEEQKLDGDGEKITETDIKALLL
61786 NLFTAGTDTTSSTVEWALAELIRHPDVLKEAQHELDTVVG 61607
61606 RGRLVSESDLPRLPYLTAVIKETFRLHPSTPLSLPREAAEECEVDGYRIPKGATLLVNVW 61427
61426 AIARDPTQWPDPLQYQPSRFLPGRMHADVDVKGADFGLIPFGAGRRICAGLSWGLRMVTL 61247
61246 MTATLVHGFDWTLANGATPDKLNMEEAYGLTLQRAVPLMVQPVPRLLPSAYGV* 61085

>AC119149.2 $F Nipponbare strain, clone OSJNBb0079E01, from chromosome 10
53% to 75B1
D48413, AU032067 AU173484.1 AU222694.1 47% IDENTICAL TO 75B1 N-TERMINAL REGION
C99304 opposite end = C19579 65% to 75B1 74% to AC021892
3 prime UTR matches AU173485 Rice root Oryza sativa cDNA clone R3314. 
Which is the opposite end of AU173484 so these are from the same gene
3 prime UTR Also = AU032964 Rice shoot Oryza sativa cDNA clone S0065_2Z.
AU032242 58% identical to C97611 47% to AA754300 47% to 75B1 I helix
74682 MEVAAMEISTSLLLTTVALSVIVCYALVFSRAGKARAPLPLPPGPRGWPVLGNLPQLGGK 74503
74502 THQTLHEMTKVYGPLIRLRFGSSDVVVAGSAPVAAQFLRTHDANFSSRPRNSGGEHMAYN 74323
74322 GRDVVFGPYGPRWRAMRKICAVNLFSARALDDLRAFREREAVLMVRSLAEASAAPGSSSP 74143
74142 AAVVLGKEVNVCTTNALSRAAVGRRVFAAGAGEGAREFKEIVLEVMEVGGVLNVGDFVPA 73963
73962 LRWLDPQGVVARMKKLHRRFDDMMNAIIAERRAGSLLKPTDSREEGKDLLGLLLAMVQEQ 73783
73782 EWLAAGEDDRITDTEIKALIL (0) 73720
65761 NLFVAGTDTTSTIVEWTMAELIRHPDILKHAQEELDVVVGRDRLLSESDLSHLTFFHAII 65582
65581 KETFRLHPSTPLSLPRMASEECEIAGYRIPKGAELLVNVWGIARDPAIWPDPLEYKPSRF 65402
65401 LPGGTHTDVDVKGNDFGLIPFGAGRRICAGLSWGLRMVTMTAATLVHAFDWQLPADQTPD 65222
65221 KLNMDEAFTLLLQRAEPLVVHPVPRLLPSAYNIA* 65117

>AP003623.1 $F chromosome 6 clone P0642B07 41% to 76C4 = AU095771
89% to CONTIG OF C72003, C99202 
D46292        42% to 71B6 and 71B8   3/95   8/95 17-91 REGION 
AU032289 29% to 76C3, 30% to 71B4 
84245 MENSQVWLLWGALSVAVLFYLSTLRRRHAGGKPLPPGPTPLPLIGNLHLAGGTSFHHKLRDLAR 84436
84437 VHGPVMTLKLGLATNVVISSREAAIEAYTKYDRHLAARATPDTFRACGFADRSMVFIPSS 84616
84617 DPRWKALRGIQGSHVFTPRGLAAVRPIRERKVGDLMAYLRAHAGEEVLLGQAMHTGLL 84790
84791 NLVSFSYFSIDIVDMGSQMARDLREVVDDIISVVGKPNISDFYPFLRPLDLQGLRRWTTKRFNRVFSIMGDI 85006
85007 IDRRLAHIRDNKPSHNDFLDSLLELMAAGKIDRVNVLDMLFEAFVAGADTMALTLEWVMA 85186
85187 ELLKNPGVMAKARAELRDVLGDKEVVEEADAARLPYLQAVLKEAMRLHPVGALLLPH 85357
85358 FAVEDGVEVGGYAVPKGSTVLFNAWAIMRDPAAWERPDEFVPERFVERAPLLDF 85519
85520 RGKDAEFMPFGSGRRLCPGLPLAERVMPFILASMLHTFEWKLPGGMTAEDV 85672
85673 DVSEKFKSANVLAVPLKAVPVLIK* 85747

>CONTIG OF C72003, C99202 (partial) 34% to 76C4 some identical runs with AU032289
34% TO 76C4 same clone = C19444 
90% to AP003623.1 opposite end of AU176462
AU176462.1 clone E10426.Length = 475 90% to AP003623.1 also AU182232.1
opposite end of C19444 CONTIG OF C72003, C99202 lower case from AP003623.1
There are two closely related genes
This part 97% to AAAA01008385.1 93% to AAAA01004667.1
ATNVVISSREAAIEAYTKYDRHLAARATPDTFRACGFADRSMVFIPSSDPQWKA
LRGIHASHVFTPRVLAAVRPIRERKVGDLIAYLRAHAGEEVLVGHAMYTGILNMVS
FSYFSVDIVDMGSQMARELREVVDDIILVVGKPNVSDFYPFLRPLDLQGLRRW
TTKRFNRVFSIMGDIIDRRLAHIRDNKPRHDDFLDSILELXAAGKIDRVQVLNMLFEGF

This part 98% to AAAA01008385.1 90% to AAAA01004667.1 from opposite end of C19444
2   FNAWAIMRDAAAWERPDEFVPERFVERTPQLDFRGKDVEFMPFGSGRRLCPGLPLAERVVP 184
185 FILASMLHTFEWELPGGMTAEELDVSEKFKTANVXAVPLKAVPVLIK* 328

*******100, 68F japonica, 13 japonica partials, 19 japonica pseudogenes
21 indica accessions in 20 seqs

>AP005254.1a $F (japonica cultivar-group) chromosome 8 = AF488522.1 PM-II
42% to 76C4 = BI808700.1
59535 MERDAWLLCAALAAATVVYYLACTTSRRAQRRRLPPGPTPLPVIGNVLSLRGNMHHALARLARER 59341
59340 YGPVMALKLGLVTAVVVSSPDAAREAFTKHDRRLAARAVPDTSRVRGFADRSMIWLPSSD 59161
59160 TRWKTLRGVVATHVFSPRSIAAARGVRERKVRDIVGYFAAHVGEVVDVGEAVYSGVVNLV 58981
58980 SNAFFSGDVVDVGEESAHGLREAVEDIILAIAKPNVSDLFPFLRPLDLQGWRRWAEKRYD 58801
58800 TVFDILDNITNSRLADASAGNHAGDFLDSLLGLMSYGKIARDDVTTIMFDVFGAGTDTIA 58621
58620 ITVQWAMAELLRNPSIMAKARTEMEDVLAGKKTIEENDTEKLPYLRAVIKEAMRLHPVAP 58441
58440 ILLPHQAAEDGVEIGGYAVPKGSTVIFNVWAIMRDPTAWERPDEFMPERFLQRAEVDFRG 58261
58260 KDFEFMPFGAGRRLCPGLPMAERVVPFILASLLHAFEWRLPDGMSAEELDVSEKFTTANV 58081
58080 LTVPLKAVPILASSASELQAS* 58015

>AP005254.1b $P (japonica cultivar-group) chromosome 8 
82126 MESEVCWLLCAALAAAMACYYLTGTMRRRSRRLPPGPTPLPVIGNVLSLRGNMHHALARL 81947
81946 AGEHGPVMALKLGLVTTVVVSSAGAAREAFTKHDRRLAARAVPDTTRARGFASRSMIWL  81770
81769 PSSDPRWKTLRGVAATHVFSPRSLAAARGVRERKVRDIVGHLAGHAGEVVDVG (fs)   81611
      QVVYGGVLN 44 amino acids missing
81582 LRRLDLQGWRRWA (fs) 81544 
81542 EKRYDKVFGIFDSVINSRLADASTGKHADAGAGDFLDSLLDLMSAGTIARDDVTSIMYDL 81363
81362 FGAGTDTIAITVEWAMAELLRNPSVMAKARAEMNHVLAGKVKATEMEENDVEKLPYLQAV 81183
81182 VKEVMRLHPAAPILVPHRAEEDDAEIGGYAVPKGSTVIFNVWAIMRDPVAWERPEEFMPE 81003
81002 RFLDMAEEVDFRGKDHKFMPFGTGRRLCPGLSMAKRVVPFILASLLHAFEWRLPAGVTAE 80823
80822 ALDLSEKFTTVNVLVTPIKAIPILASDEI* 80733

>AU172561.1 (partial) clone E2548.Length = 362 similar to AC092559.2 opposite end = AU0172562
AU172562.1 clone E2548. Length = 445 67% to AP003623.1 also BI809979.1
95-96% to pseudogene aaaa01013893.1 95% to 98% to AP005254.1b
this might be AP005254.1b 
68 MESEVCWLLCAALAAAMACYYLTGTMRRRSRRLPPGPTPLPVIGNVLSLRGNMHHALARLA
   GEHGPVMALKLGLVTTVVVSSAGAAREAFTKHDRALG 361
Gap 
MEENDVENLPYLQPVKEVMRLHPA
APILVPHRAEEDDAEIGGYAVPKGSTVIFNVWAIMRDPVAWERPKEFMPKRFLDMAEEVDFRGKDHKFMPFGTGRRLCPGLSMAKR
VVPFILASLLHAFEWRLPAGVTTEALDLNEKFTTVNVLVTPIKAIPILASDQI*

>AP005254.1c $F (japonica cultivar-group) chromosome 8 = AF488521.1 PM-I
1 diff with BM038607.1 = AP005245.1 15190-16707 41% to 76C2
103909 MEREAWLLCAALAAAMVYYYYYYLACTTRRAQRRLPPGPTPLPVIGNVLSLSGDMHHEL 
104086 ARLAREQYGPVMTLKLGLFTAVVVSSPDAAREAFTKHDRRLAARTVPDISRARGLTGRSM 
104266 IWLPSSDPRWKTLRSAVATHFFSPRSLAAARGVRERKVRDIVNYFAGHAAEVIDVGEAVY 
104446 GGVINIVSNAFFSADVVDVGKESAHGLRETLEDIILAIAKPNVSDLFPFLRRLDLQGWRR 
104626 WAEKRYDKVFGILDDKINSRLADADADASTKKHGDFLDSLLELMSAGKIACDDVTTVMFD 
104806 AFGAGTDTISNTVVWAMAELLRNPSIMAKVRAEMEDVLAGKKTIEENDTEKLPYLRAVIK 
104986 EAMRLHPVAPILLPHRAAEDGVEIGGYAVPKGSTVIFNVWTIMRDPAAWERPEEFMPERF 
105166 LQRAEVDFRGKDFEFIPFGAGRRLCPGLPMTERVVPFILASLLHAFEWRLPVGVAAETLD 
105346 LSEKFTTVNVLVTPLKAIPILASHQI* 105426

>AP005254.1d (japonica cultivargroup) chromosome 8 aa 227-504
same as AP005245.1 21717-20866 ortholog of AAAA01000493.1 99% partial
upstream seq missing in both accession numbers and in AAAA01000493.1
probable pseudogene
AAAA01000493.1 has two copies of this and a third sequence
       GGVLN
110436 LRRLDLQGWRRWAEKRYDKVFGIFDSVINSRLADASTGKHADAGAGDFLDSLLDLMSAGK 110257
110256 IARDDVTSIMFDLFGAGTDTIAITVEWAMAELLRNPSVMTKARAEMNHALAGKKTIEEND 110077
110076 VEKLPYLQAVLREAMRLHPAAPILVPHRAEEDGAEIGGYAVPKGSTVIFNVWAIMRDPAA 109897
109896 WERPEEFMPERFMDMAEEVDFRGKDYKFIPFWAGRRLCPGLLMAERVVPFILASLLHSFE 109717
109716 WRLPGGMTAESLDLSEKFTTVNVLVTPLKAIPILASKNENIRE* 109585

>AP005114.1a $F (japonica cultivar-group) chromosome 2 clone P0689H05
BI813130.1 cDNA clone J002D01.Length = 497 75% to BI808700.1 68% to AP003623
BI813468.1 K008G06 Oryza sativa mature leaf library induced by M.grisea Oryza
sativa cDNA clone K008G06.Length = 474
best rice match AP003623.1 chromosome 6 63% clone P0642B07 41% to 76C4 = AU095771
26813 MEKLKSELWMTAVATCMSLLLYLTILRRRHASGGRSLALPPGPTPLPLIGNLLCLGGIFHQTLA 27004
27005 KLARVHGPVMTLKLGLTTAVVVSSAEAAREAYTKHDQRLAARPVPDAFRANGFSERSIVF 27184
27185 SPSSDPQWKNLRGIHATHIFSPRALAALRGIRERKVRDIVGYIRTVAGEEMCVREVVHNG 27364
27365 VLNLISTSFFSMDMADVRSESARGLRGLIEDIIATVAGPNVSDFFPFLRQLDLQGLRRQT 27544
27545 GSHLGIVFGLLDDIIDRRMAETRDHPDKQRHGDFLDALISLASAGKIPRYHITYLLFDVF 27724
27725 AAGADTMTTTVEWAMAELLRNPRVMAKVRAEVTDALGGRESFDEGDAASLTYLQCVFKEA 27904
27905 MRLHPVGSILVPHLAVQDGVEIGGYAVPKGTTVIFNAWAIMRDPAAWESPDQFLPERFLH 28084
28085 KEESSSPPLELRGKDYEYIPFGSGRRLCPGLPLAERAVPFILASLLHAFEWRLPDGMSPD 28264
28265 DMDMTEKFATANVLATPLKAVPVASHTS* 28351

>AP005114.1b $F (japonica cultivar-group) chromosome 2 
BI811079.1 clone K015D02.Length = 347 57% to AC087550.2 C-helix
41% to 71B11
120645 MEDNKLILALGLSVLFVLLSKLVSSAMKPRLNLPPGPWTLPLIGSLHHLVMKSPQIHRSL 120824
120825 RALSEKHGPIMQLWMGEVPAVIVSSPAVAEEVLKHQDLRFADRHLTATIEEVSFGGRDVT 121004
121005 FAPYSERWRHLRKICMQELLTAARVRSFQGVREREVARLVRELAADAGAGGDAGVNLNER 121184
121185 ISKLANDIVMVSSVGGRCSHRDEFLDALEVAKKQITWLSVADLFPSSKLARMVAVAPRKG 121364
121365 LASRKRMELVIRRIIQERKDQLMDDSAAGAGEAAAGKDCFLDVLLRLQKEGGTPVPVTDE 121544
121545 IIVVLLF (0) 121565
122803 DMFTGASETSPTVLIWILAELMRCPRVMAKAQAEVRQAAVGKTRITENDIVGLSYLKMVI 122982
122983 KEALRLHSPAPLLNPRKCRETTQVMGYDIPKGTSVFVNMWAICRDPNYWEDPEEFKPERF 123162
123163 ENNCVDFKGNNFEFLPFGSGRRICPGINLGLANLELALASLLYHFDWKLPNGMLPKDLDM 123342
123343 QETPGIVAAKLTTLNMCPVTQIAPSSAEDAS* 123438

>AU173996.1 (partial) cDNA clone S12633. Length = 451 New seq 66% to AP003623.1 aa 413-458 no larger pieces are known
12  EVDXWGKTFEFIPFGSGRRVCAGLPMAERVVPFMLASLLRAFEWRLPDGVSAEELDM
R PGLPLPTSVPSLQGRARCGQLVALFHCINGELWAYVSPFGPHMSSWIL*

>AP003267.1 $F chromosome 1 clone P0496H05 38% to 76C4
55979 MEKSSELWLLWAVFSASLVFLYLTIRRRSGAGAGGKPPLPPGPTPLPLIGNLLDLRGGVIHDKLAALARVYG 55764
55763 PVMMIKLGLNDAVIISSRDAAREAFTRYDRHLAARAIPDTFRANGFHERSAVFLPSSDER 55584
55583 WKALRGIQGTHIFTPRGLAAVRPVRERKVRDIIAYFRDHAGEELVIRQAIHTGVLNLVSS 55404
55403 SFFSMDIAGMGSETARELREHVDEIMTVFAQPNVSDYFPFLRRLDLQGLRRSTKRRFDR 55227
55226 IFSILDDIVERRLVDRGERGGEGGASSNSSKSKHQYDGGDFLDALLELMVTGKM 55065
55064 ERDDVTAMLFEAFVAGGDTVAFTLEWVMADLLRNPPVMAKLRAELDDVLGGKDQSAIEEH 54885
54884 DAGRLPYLQAVLKESMRLHSVGPLLHHFAAEDGVVVGGYAVPRGATVLFNTRAIMRDPAA 54705
54704 WERPEEFAPERFLAREGKAPVDFRGKEADFIPFGSGRRLCPGIPLAERVMPYILALMLRE 54525
54524 FEWRLPDGVSPEELDVSEKFMSVNVLAVPLKAVPVKVIN* 54405

>AC078944.5a $P clone OSJNBa0089D15 large insertion probably makes this a pseudogene
47% to 76C2
92923  MASFLPWLPWLLAALLSVYLLDLLAHSRRLLPPGPRPLPLIGSLHLLGDQPHRSLAGLAK 93102
93103  MYGPLMSLRLGAVTTVVVSSPDVAREFLQRHNAAFASRSVPDATGDHATNSVAWLPNSPR 93282
93283  WRALRRIMAAELFAPHRLDALRRLRREKVQELVDHVARLAEREGGAAAVDVGRVAFATSL 93462
93463  NLLSSTIFSRNLTSLDDHGESMEFKEVVVEIMEAGGCPNVSDFFPAIAAADLQGWRRRMA 93642
93643  GLFARLHRVFDAVVEERLSERDAGEARKGDFLDVLLDVAARDNDSAGLDRDTLRSLFT   93816 (0)
103281 DLFAAGSDTSPSIVEWAIAELMRNPLCMIRAC 103376 probable 10.5 kb insertion here
113843 DELSQVIGSGTNIEESDIGQLPYL 113914
113915 QAVVKETFRLHPPVPLLLPRQAETTTNIAGYTIPKGARVFVNVWAIGRHK 114064
114065 DTWSQPEKFMPERFFERNIDFRGVHFELIPFGAGRRICPRMPLANRMVHLVLGSLLNQFK 114244
114245 WNLPVDIERNGIDMSEKFGLTLVKAIPLCALVTPISVESGDH* 114373

>AC078944.5b $F clone OSJNBa0089D15 
47% TO AJ237995.1 Vitis vinifera 76F2 45% to 76C3
145298 MATWILGWLLWLPVFLISLYLVDILAHSCRRLPPGPRPLPFIGSLHLLGDQPHRSLAGLAKKYGP 145492
145493 LMSLRLGAVTTVVVSSPEVAREFVQKHDAVFADRSIPDSIGDHTKNSVIWLNPGPRWRAL 145672
145673 RRIMATELFSPHQLDALQQLRQEKVAELVDHVARLAREGAAVDVGRVAFATSLNLLS 145843
145844 RTIFSRDLTSLDDRGASREFKQVITDIMEAAGSPNLSDFYPAIAAVDLQGWRRRCARLFT 146023
146024 QLHRLFDDEMDHRKLHSRHGGPGENGKEKDDFLEVLLRLGARDDDIAGLDGDTLRSLF 146197 (0)
150187 DLFAAGSDTSSSTIEWAMVELLKNTLSMGKACDELAQVVGSRRRIEESEIGQLP 150348
150349 YLQAVIKETLRLHPPVPLLPHRAKMAMQIMGYTIPNGTKILINVWAIGRD 150498
150499 KNIWTEPEKFMPERFLDRTIDFRGGDLELIPFGAGRRICPGMPLAIWMVHVVLASLLIH 150675
150676 FKWRLPVEVERNGIDMTEKFGLTLVKAIPLCALATPT* 150789

>AC116603.1 $F Nipponbare strain, clone OSJNBb0015J03, from chromosome 10, 
AQ869312.1 70% to AC078944.1 168857-163366 same as AQ913628.1
41763 MAALLLWLSWLLLSLLSIYLLDLLAHSRRCLPPGPRPLPLIGSLHLLGDLPHRSL 41605
41604 AGLAKTYGPLMSLRLGAVTTVVASSPEVAREFLQKHDAVFATRSTPDATGDHARNSVAWL 41425
41424 PPGPRWRELRKIMATELFSTRRLDALHELRQEKVAELVDHVAGLARDGTAVDIGRVAFTT 41245
41244 SLNLVARTIFSHDLTSLDDHGASKEFQRLITDVMEAVGSPNLSDFFPALAAVDLQGWRRR 41065
41064 LSGLFARLHRLFDAEMDHRRLHGMKEKDGDFLEVLLRLAARDDDTARLDGDTLRSLFT 40891
39623 DLFTAGSDTSSSTVEWAMAELLQNPISMAKLCDELRRVVGSRRRIEESEIGQLPYLQAVI 39444
39443 KETFRLHSPAPLLLPRQATRTIQIMGYTIPKGTRVLINVWAMGRDEDIWPEAGKFIPERF 39264
39263 LERTIDYKGGDLELIPFGAGRRICPGMPLAVRMVHVLLASLLIHFKWRLPAEVEGNRIDM 39084
39083 TEKFGVTLAKANHLCAMAAPT* 39018

>AC116603.1 (partial) Nipponbare strain, clone OSJNBb0015J03, from chromosome 10 
12419 ALFLWLSWLVLSLLSIYLLDLLAHSRRRLPPGPRPLPLIGSLHLLGDQPHRSL 12261
12260 AGLAKTYGPLMSLRLGAVTTVVVSSPDVAREFLQKHDAVFATRSAPDAAGDHTRNSVPWL 12081
12080 PPGPRWRELRKIMATELLATHRLDALHELRQEKVSELVDHVARLARDGAAVDVGRVAFTT 11901
11900 SLNLLSRTIFSRDLTSLDDRGASKEFQQVVTDIMGAAGSPNLSDFFPALAAADLQGWRRR 11721
11720 LAGLFERLHRVFDAEIEHRRRVAGEEHGKVKDDFLRVLLRLAARDDDTAGLDDDTLRS 11547
11546 VFT 11538

>AC116603.1 $F Nipponbare strain, clone OSJNBb0015J03, from chromosome 10 
9109 MACALSLLSVYLLDLLAQSRRRLPPGPHPLPLIGSLHLLGDQPHRSLAGLAKTYGPLM 8936
8935 SLRLGAVTTVVVSSPDVAREFLQKHDAVFATRSAPDASGDHARNSVALLPNSPRWRELRK 8756
8755 IMATELFSTSRLDALHELRQEKVVELVDHVARLAREGAAVDVGRVAFTTSLNLLSHTIFS 8576
8575 RDLTSLDDHGASKEFQQVVTDIMGAAGSPNLSDFFPALAAADLQGWRRRLAGLFERLRRV 8396
8395 FDAEIEHRRRVVGKEHGKVKDDFLRVLLRLAARDDDTAGLHDDALQSIFT 8246
6711 DLFAAGSDTSSSTVEWAMAELLRNPLPMAKACDELQRVIGSTRRIEESDIGRLPYLQ 6541
6540 AVIKETFRLHPPVPFLLPRQATTTIQILGYTIPKGAKVFINVWAMGRDKDIWPEAEKFMP 6361
6360 ERFLERATDFKGADFELIPFGAGRRICPGLPLAVRMVHVVLASLLINFKWRLPVKVERDG 6181
6180 VNMTEKFGVTLAKAIPLCAMATST* 6106

>AP004183.1 $P chr 8 clone OJ1221_H04, Pseudogene 74% to AC078944.1 118250-117423
AP004751.1 chromosome 8 clone P0494D11
51271 MAFLFPSLLLPWLPWLLGSLLSVYLLDLLAHSRRRLPPGPRPLPFIGSLHLLGDQPHRSLAALAMAYG 51068
51067 PLMSLRLGAVTTVVASSPAVA 51005 frameshift
51003 REILHRHDAAFASRPRGDSTG 50962 numerous frameshifts in the next 60 nucleotides
possible reconstructed seq. NHARE PVAWPAHNAPGWGA LR
      RIMATELFAPQPVDSGRPGRLRREKCRSCAHTPKVGARQGGKAAGVD 50738 probable intron here
      but no AG junction
50467 GRVAFATSLNLLSRTIFSRDLTSLDDP 50387 frameshift
50385 GGSKGFQEAVARIMEAGGRPNVSDFFPVLAAADLQGWRRRLARLFARLHRVFDAEVDAR 50209
50208 LREHDAGEARKGDFLDVLLGIAARRDDAAELDRDTLRSLFT 50086 (0)
45752 DLFCAGSDTSSSTVEWAMAELMQNPKSMSRVCDELSQVIGLGRNIKESEIGQLPYLQ 45582
45581 AVVKETFRLHPPAPLLLPRQAEMTMKIAGYTIPKGTRIFVNVWAMGRDKDIWPEPEKFIP 45402
45401 ERFLGSKIDFKGVHFELIPFGAGRRICPGMPLANRMVHLILGSLLNQFKWNLPVKVERNG 45222
45221 IDMSEKFGLTLAKATPLCALVTPISVKPADHQE* 45120

>AC074105.1 $F chr 10 clone OSJNBa0030B02 19 unordered pieces 39% to 76F2 
40% to 76C2
60125 MAIFIGCICS
60095 LALLLLCSHVFQLLSDARRRLPPGPRPLPVIGNLLDVAGELPHRSLACVAERYGPL 59928
59927 VTLRLGTMLAVVASSPATARDVLHRHGASITDRGTPDAWSTDGHDGNSIFAFPTRHHRWR 59748
59747 ALRRLGAEQLFSPRRVEEQRPLRRDAVRGLLRHVAELAAASGGGGAAVVDVGRAAFAAMA 59568
59567 SLLFGALFSAGIDAATSCRFRDAAREFALLTMTPNVSEFFPVVAMADLQGL 59415
59414 RRRTARHITWMYQLIDGHVERRMRGRETAGGCGAAHGEKEKDLLDVMLDMSEKEEQNDD 59238
59237 SSLTMN 59220
57988 DLLMAGSETSSAVIEWAMAELLQNPQTMTKLQEELKKVIGSKTCIDEEDIDQLPY LQAVI 57809
57808 KETHRLHPAIP LLMYKAAVPVEIQGYKIPKETTVIVNTWAIHQNSEVWIEPDKFIPERF 57632
57631 LQKEISLSSGSTNMELIPFSAGRRFCLGYPVANRMLHVMLASLVHQFQWTLPEVVKKN 57458
57457 GGVDMAEKFGITLSMATPLHAIAKNIV* 57374

>AC074054.1b $P chromosome 10 clone OSJNBa0090A14 gene 2 41% to 76F2 
42% to 76C4
same sequence as AC074355 33336 34721-32944 gene 3
56068 MAFFLPLAFSLFLAVISAYVLQLLADARRRL 
      PPGPWPLPLIGNLHQLDHLPHRSLARLAARHGPLMSLRLGTVRAVVASSPEMAREVLQRH 55796
55795 NADIAARSFGDSMRAGGHCENSVVCLPPRRRWRALRRLSTVGLFSPRRLDAMRALLEEKV 55616
55615 AELVRRVSGHAARGEAVDVGHAAHVAALGVLSRTMFSVDLDPEAAREVSDIVDEASVL 55442
55441 GTGPNVSDFFPAIAPADLQGVRRRMARLVKRMYAIIDEQIERRMHGRTAGEPRKNDLL 55268
55267 DVMLEEGESKEDSNEINRDAIRGL 55196
54837 DLFTGGETTSHTMECAMAELLQCPNSMRRVHELKSVIGSKQQMDEHDITKLPY 54679
      (16 aa deletion and frameshift missing the K-helix)
54682 LPPYEAEATIEIQGYTIPKGAKVLINLWAINRCANTWTEPDKFMPERFYDSDITFMGRDF 54503
54502 QLIPFGAGKRICLGLPLAHRMVHLMLGSLLHRFTWTLPAEAGKNGVDMRERFGLTLSFVAPLYVIAQEIQ* 
54290

>AC074054.1a $F chromosome 10 clone OSJNBa0090A14 gene 1
AC074355.2 23873-18195 clone OSJNBa0071I20, gene 2 39% to 76C2
21949 MAFFHLCISSLLLVFIISYIFQPLLDARRRFPPGPHRLPVISNLHNIGKNPHHAFARLADRYGPLM 21752
21751 SIRLGGVRAVVAMSADAAREILQRNNADITGRGGMDSWHACGHHANSSIALWPRWKWCA 21575
21574 MRMLCTEELL 21545
5536 GVTHAMREEVARELAHRVSDGSAGGMPVSVAREAFAAVAGVLWWSMFSEDMDAATTRQLRDVIE 5345
5344 EAVVVAGAPNLSDYFPVIAAADVMGVRRRMDNLVGWVYGIIDVQIDRRRRRRIVCEPRKN 5165
5164 DLLDVAFDMEGEVESEGWVMNQDTMRGM 5081
4770 AIYQDLLVAGSGSTSSTIEWAMAELLQNPKSMIQLPEELKGLMGTKTHVAESDISQLPYL 4591
4590 QAVIKETLRLHPTVPIAFNKAEATVEIQGYKIPQGTTVYVNIWAICRRAKIWDDLDKFM 4414
4413 PYRFLGRDINFLGTNFEFIPFGAGRRICLGMPLAEGML 4300
4291 HLMLASLLHRFEWTIPDEVKGDDLDMAEEFGLVLSMAKPLRAVAKET* 4148

>AC118672.1a $F (japonica cultivar-group) chromosome 3 45% to 76C2
20322 MELLLLYAPCVILLVSSLYLLRLFSDARRNLPPGPRPLPLVGNLLELGAKPHRSLARLAE 20501
20502 RHGPLMTLRLGAVTTIVASSPDAARDILQRHDAAFSTRPVPDIVRACGHDRFAMPWLPPS 20681
20682 SPQWRALRKVCSAELFAPRRLDAQQRLRREKARRLVSHVARMAREGAAVDVRRVVFTTLL 20861
20862 NMLSCTLFSADLADLDEGRAGSAGELADTVAEFAGTVGVPNVVDYFPAVAAFDPQRLRR 21038
21039 WLSRVFTRLFAEFDEQIERRMRERDAGEPPKNDFLDVLLDYRTTEDGRQFDRQTLRSRFT 21218
21879 DLFSAGSDTSAVTVEWAMAQLLQSPSSMMKAREELTRVIGSKPEIDESDIDSLEYLQAVV 22058
22059 KETFRLHPPAPLLLSHRAETDTEIGGYTVPKGATVMVNIWAIGRDSKVWFEPDKFIPERF 22238
22239 LQKEVDFRGRDFELIPFGSGRRICPGLPLAVRMVHLMLASLLHRFEWRLLPEVERNGVNM 22418
22419 EEKFGIVMTLATPLQAIATPI* 22484

>AC118672.1b $F (japonica cultivar-group) chromosome 3 (3 frameshifts)
C72289       63% to 76C2 337-453 region clone E1370 like AC078944
AU093938.1 Rice panicle cDNA clone E1370. Length = 642 43% to 77A4
BE230674.1 99AS896 Rice Seedling cDNA Library cDNA clone 99AS896.
BI813139 cDNA clone J002F10.Length = 569 = C72289
29723 MELLLLCTPCVILLLSSLYLLRLFVDARRNLPPGPRPQPLIGNILDLGSQPHRSLARLAG 29902
29903 RYGPLMTLRLGTVTTVVASSPGAARDILQRHDA 30001 (fs)
30003 AFSARSVPD 30029 (fs)
30031 AARACGHDGFSMGMLPPTSALWRAL 30105 (fs)
30107 RRVCAAELFAPRSLDAHQRLRRDKVRQLVSHVARLARDGAAVDVGRAAFTA 30259
30260 SLNLLSSTIFSADLADFGDARAESSVGDLRDLISEFTIVVGVPNVSDFFPAVAPLDPQRL 30439
30440 RRRVARVFERLQAVFDGHIERRLRDRAAGEPPKNDFLDALLDYRSPEDGRGFDRPTLQFLFT (0) 30625
      DLFSAGSDTSAVTVE 30799
30800 WAMAQLLQNPPAMAKAREELARVIGSKQEIEESDISQLKYLEAVVKETLRLHPPAPFLLP 30979
30980 HQAETTTQVGGYTVPKGTRVLVNVWAIGRDSKVWSDPDKFMPERFLQSEVDLRGRDFELI 31159
31160 PFGSGRRICPGLPLAVRMVYLMLASLLHRFEWRLLPEVEKNGVDMAEKFGMILELATPLR 31339
31340 AVAIPV* 31360

>AL731641.1 $F chromosome 4 clone OSJNBa0042I15 CYP77A = AU070007 
C19435 same clone as C99199 60% with 77A4 297-428 region
C-term of wheat  = NQPLLATVKPRKISL* from BE518078.1 this seq not found yet in rice
C-term of barley = NQPLLATVKPRKISL* from BG416410.1 this seq not found yet in rice
42403 MQPTWLATMPAASSLVVGVAFTAAVAVAVAAAVARRAWRHRGLRLPPGPPGWPVVGNLLQ 42582
42583 VVFAGKPFIHYIRDLRREYGPIVKLQMGVRTLVVISSAELVHEALVEKGREFATRPAESP 42762
42763 IRSIFSSGKFTVNSAVYGPEWRSLRRNMVSGMLSAARLREFRPARLRAMERFVARVRAEA 42942
42943 AASRDGASVWVLRNVRFAMFCVLLDMTFGLLDLDEELVVRVDAVMKRVVLAVAARIDDYL 43122
43123 PFLRPFLWRQHRQAVALRREQIDTVLPLINRRRAIVRGMRAGSPPDPAVAAPYSYLDSLL 43302
43303 DLRVEGRDAVPTDEELVTLCAEMINGGTDTTATAIEWAMARVMDNPSIQARLHGEIMQRV 43482
43483 GDARPVDDRDTEGMPYLKAFVKELLRKHPPTYFALSHAAVEPGSKLAGYDVPVDANLDIF 43662
43663 LPTISEDPKLWERPTEFDPDRFLAGGETADITGSAGVRMIPFSAGRRICPGVGMGTAHIA 43842
43843 LMVARMVQAFEWRAHPSQPPLDFEDKVEFTVVMKRPLLAMVTPRKLSF* 43989

>AA751892     (partial)  52% to 78A5     1/98 469-519 REGION
Ortholog to aaaa01012488.1 complete 3 diffs
RXVCPGRMMGLSTAYLWFGRMLQEYKWAAAQPVKLTECLRLSMEMXKPLVCHAVPRXKTG*

>AC097276.6 chromosome 3 clone OJ1519_A12 38% to AC073867 57% to 78A2
C-terminal from version 1 of accession (absent in version 6)
Gap not filled in any section of GenBank
Ortholog of aaaa01023161.1 98% (6 diffs) lower case
44847 MVSFYLDSTFLPLLATTMASPLHACLLLALLFLALAFFHPGGVAWALSSSGGHGAAAIPG 45026
45027 PRGVLLAFAGPNPHRALASLAASTRGATRLMAFSVGLTQFVVASHPDTAREILAGAAFAD 45206
45207 RPVNEA (32 aa gap in seq aaelmfhramgfaphggywrrlrrlasahala)  PG 45386
45387 RLAARRRAIGEEPVRRVAPAMARDGAVGVRRLLHLASLDNVMASVFGVGLGELGAGAVSE 45566
45567 LEEMVGQGYELLGTFNWGDHLPLLRLLDVHGVRRKSRALASRVKVFVSKIIEEHKTRRDA 45746
45747 KYGGCDGDGDFVDVLLGLEGEERLEEEDMVAVLW (0?) 45848
48030 EMIFRGTDTVAILLEWVLARMALHPDVQSKAQAEIDAAAVSGDAAALPYLHCVV 48191
48192 KECLRMHPPGPLLSWARLATRDAHLDLGADAGGRAALVPAGTTAVVNMWAIARDGGLWRD 48371
48372 PGVFRPERFLGDGEAAGVGVAGGAGGYDLRLAPFGAGRRACPGRALAMATVHLWLAQLLR 48551
48552 SFRWVPSGDRGVDMSERLGMSLEMEKPLICLALPRTSST* 48671

Duplication of N-terminal (probably a cloning error)
50744 MVSFYLDSTFLPLLATTMASPLHACLLLALLFLALAFFHPGGVAWALSSSGGHGAAAIPG 50565
50564 PRGVLLAFAGPNPHRALASLAASTRGATRLMAFSVGLTQFVVASHPDTAREILAGAAFAD 50385
50384 RPVKEAPRSHVHRAWGCPHGGTGA

>AC099399.1 $F chromosome 3 clone OJ1006_F06, Length = 122577
56% to 78A9
49121 MATSFVYLAIFACLAWAGTALLYWAHPGGPAWGKYRRARGQSPRCSI
48980 PGPKGLPVVGSLGLMSGLAHCSLAAEAARRPGAKRLMALSLGPVRAVVTSHPDVAKEIL 48804
48803 DNPAFADRPLNHAAYGLMFHRSIGFAEHGPYWRALRRVAAGHLFGPRQVDAFAPYRA 48633
48632 RVAGGVVAALRGAGGEAAVQVRGVLRRASLYYIMRFVFGKEYDVSRGAPESGEEVEELL 48456
48455 EMVHEGYDLLGKENWCDYFPGLAAVDPQGVGARCAELMPRVNRFVRGIIQEHRGKAIA 48282
48281 GGEARDFVDILLSLQESEGLADADIAAVLW 48192 (0)
47885 EMIFRGTDAMAVLMEWTLARVVLHPDVQANVHRELDAVVGRSNTVAESAVPSLPYLQALL 47706
47705 KEALRMHPPGPLLSWRHRAISDTYVDGHLVPAGTTAMVNQWAMSRDADVWDAPLEFQPER 47526
47525 FLPGGKAHGVSVLGADGRLVPFGSGRRSCPGKSLAMTTVTAWMATLLHEFEWTPASGA 47352
47351 VDLSEVLRLSCEMAVPLEVRVSARRNV* 472968

>AC083943.6 $F CYP78A11 clone OSJNBa0044A10, 61% to 78A7
40753 MAMATATASSCVDATWWAYALPALLGADTLCAHPALLAGAVLLAFATAAVLAWAASPGGPAWAHGRGRLGA 40965
40966 TPIEGPRGLPVFGSIFALSRGLPHRALDAMSRDAAAPRARELMAFSVGETPAVVSSCPAT 41145
41146 AREVLAHPSFADRPLKRSARELLFARAIGFAPSGEYWRLLRRIASTHLFSPRRVAAHEPG 41325
41326 RQADATAMLSAMAAEQSATGAVVLRPHLQAAALNNIMGSVFGRRYDVSSSSGAAADEAEQ 41505
41506 LKSMVREGFELLGAFNWSDHLPWLAHLYDPNHVARRCAALVPRVQAFVRGVIRDHRLRRD 41685
41686 SSSTAADNADFVDVLLSLEAHENLAEDDMVAVLW 41787 (0)
41874 EMIFRGTDTTALVTEWCMAEVVRNPAVQARLRAEVDAAVGGDGCPSDGDVARMP 42035
42036 YLQAVVKETLRAHPPGPLLSWARLATADVGLANGMVVPAGTTAMVNMWAITHDGEVWADP 42215
42216 EAFAPERFIPSEGGADVDVRGGDLRLAPFGAGRRVCPGKNLGLATVTLWVARLVHAFDWF 42395
42396 LPDGSPPVSLDEVLKLSLEMKTPLAAAATPRRRRAA* 42506

>AP004704.1 $F chromosome 8 clone P0544G09, Length = 137478
New 61% to 78A9
42285 MDMDSSPSTQDCGGWLLYVSLAAKCGGDPCRVVGFVAVAVVAFAVTSLLHWLSPGGPAWGRYWWNRRGGLGIAAAI
42057 PGPRGLPVLGSMSLMAGLAHRKLAAAAGGSPARRRLMALSLGETRVVVTADPGVARELLA 41878
41877 SAAFADRPVKESAYGMLFHRAIGFAPYGTYWRALRRVASTHLFSPRQVSASAAQRA 41710
41709 VIARQMVEAMRSAAAAAAGGGVAARPFLKRASLHNVMWSVFGRKYELAAPESEETAEL 41536
41535 RSMVDEGYDLLGQLNWSDHLPWLAPFDLKKTRSRCSSLVPRVNRFVTRIIDEHRARLSLA 41356
41355 VDAAVDFTDVLLSLHGGDKLSDADMVAVLWVCT 41257 (0) 
41183 EMIFRGTDTVAVLIEWVAARLVLHQDVQARVHDELDRVVGSDRAVTESDAS 41010
41009 KLVYLQAVIKEVLRLHPPGPLLSWARLATSDVHVGGFLIPSGTTAMVNMWAITHDPAVWP 40830
40829 DPNEFKPERFVAGPSSDQATEFPIMGSDLRLAPFGSGRRSCPGKSLAIATVGFWVATLLH 40650
40649 EFDWLPLSDKSRGVDLSEVLKLSCEMATPLEARLRPRRKV* 40527 

>AL662990.1 chromosome 4 clone OSJNBb0015C06, 58% to 79A2 67% to CYP79 sorghum 
C98773  146-257 REGION C-HELIX AND ON 46% to 79B2 opp end = C98774 (probably 3 
prime untranslated) = AA752332.1 cannot complete from GenBank use the indica seq.
Ortholog to aaaa01007189.1 98% shown in lower case to fill missing seq.
     mpsqliaqndhayvltfl
4128 msiamailllvalfyrikkqaaamaakrkqqpklppglatmpvvrnmhqmlmnkpvfrwihr 3943
3942 lldemdteilclrfgrvhviavaspemarellrkkdamlasrh 3814
3814 ssfasrtfsfgykn
TIMSPAGDQWRKMRQVLTSEILSPAMERRMLGRRVEEADHLVNYVYSHCNDGTVDVR
HVTRHFCGNIIRKLVFGRRHFNSGDGNIGPGRDEEAHIDALFTALDYHGAFSVSDLLP
TLVGLDLDGHEEVVNGLMNTFNRLHDPIIMERIEEW (gap? kslrtkgdkrrevadf)
119313 LDVLISLEDAQGKPFLSVDEIKAETL (0) 119236
118617 EIILATVDNPSNAVEWALAEMVNNPKVMKKAVDELDMVVGRERLVEESDIHNLTYLKACI 118438
118437 REAFRLHPYHPFNPPHVAIADTTVAGYMIPKGSHVMLSRIGLGRNPRAWDKPLEFQPERH 118258
118257 LKNTGTVVLAEPELRFVSFSAGRRGCPAVSLGTSITMMLFARLLQGFSWSIPPGGDRIEL 118078
118077 QESATSLQLSKPLFMQAKPRLLLHLYEADVLN* 117979

>AC093612.15 $P chromosome 10 clone OSJNBa0003P07, C-terminal seq from aa 166 like 98A3
probable pseudogene, no N-term on this contig between end of 
one in frame stop codon and no introns
39185 GEEGKPMVMRKHLSMVAFNNITRLTFGKRFIDAAGELDEQGSELKAIVNNGIKIGASLTI 39364
39365 AEHIRVLRWLNPVDEELYNAHSARRDRFTRRIMDEHARELERHGAKQHFVDALFTLRDKY 39544
39545 DLSDDTVIGLLCDMIAAGTDTTVITVEWAMAELVRNPRVQMKAQEELDRVIGRGRVMLE 39721
39722 ADIPNLPYLQAVVKESFRLHPPTPLMLPHKASTSVKIAGYDVPKDASVVVNVWAVARDPG 39901
39902 VWDNPLE*RPERFLEESIDIKGSDYRVLPFGAGRRVCPGAQLGISLVASMIGHLLHQF 40075
40076 TWALQDGTRPEDLDMMESPGLVTFMATPLQVVAMPRLDKEELFKRVPVDMS* 40231

>CYP98A4 AC108505.1 $F chromosome 5 clone OJ2086B07, Length = 79991
AC112160. chromosome 5 clone OJ1579G03
Same as BF430459 66% to 98A3 73% to AC093612.1 C74921, D47937 93% to 98A1 
AQ330678 nbxb0047B10f 95% identical to Sorghum bicolor 98A1 78% to AC093612.1
BE040993.1 OF16A05 cDNA 5' 62% to 98A3.Length = 795 77% to AC093612.1
Make assumption that BE040993 is part of same sequence based on percent identity to 98A1
72630 MDVASLLPFALALVAIPISLALLDRLRLGRIPPGPRPWPMVGNLWQIKPVRCRGFLEWAERYGPIVSVW 72836
72837 FGSSLNVVVSTSELAKEVLKENDQLLADRPRNRSTQRFSRNGMDLIWADYGPHYIKVRKL 73016
73017 CNLELFTPKRLEALRPIREDEVTAMVESVHRAVTQP (1) 73124
74708 GSEHKPIVVRNHLAMVAFNNITRLAFGKRFMNANGDIDEQGREFKTIV 74851
74852 NNGIKIGASLSVAEYIWYLRWLCPLNEELYKTHNERRDRLTKKIIDEHAKALKESGAKQ 75028
75029 HFVDALFTLREQYDLSDDTVIGLLW 75103 (0)
75681 DMITAGMDTTVISVEWAMAELVRNPRVQKKLQEELDRVVGRDRVMSETDFQSLPYLNAVV 75860
75861 KESLRLHPPTPLMLPHKASTNVKIGGYNIPKGANVMVNVWAIARDPKVWSNPLEYRPERF 76040
76041 IEENIDIKGSDFRVLPFGAGRRVCPGAQLGINLVASMIGHLLHQFEWSLPEGTRPEDVNM 76220
76221 MESNGVVTFMSTSLQVIAKPRLDNPDLYKRFPVEM* 76328

>AC105377.3 $P like 98A3 chromosome 10 clone OSJNBa0034E15 probable pseudogene 5 frameshifts
Same as AC093612.3 chromosome 10 65758-66265 and 122465-121428 region
3339 MDTPALLPLALALVAIPITILLDLQSDPPRQAVPRAPGVAGGNLFDITLVRCRCFMEWAG 3160
3159 KYGPIMTVWLGTSPTIVVSTSELAREVFKNV 3067 (frameshift)
3065 NDQQLADRPRNHSAERLSRGGTDLIWADYGPHYVKVRKLCNLELFAPRRMEALRPIS 2895
2894 EDEVNAMVESIYRAVTAP 2841 (1)
969 GEEGKPIGVRKHLSMVAFNNITRLTFGKRFIVAAGELDEQGCEQKAIVKAGIKIGASLPI 790
789 AEHILVLRWLNLVDEELYNAHSARRDRFTRRIMDEHARELERHGAKQHFV 640 (frameshift)
641 FTLRDQYDLSDDTV 600 (frameshift)
598 VIGLLWDMIAAGSDTAVITAEWAMAELVRNPRVQMKAQEELDRVIGRGRVMLEADIPNLP 419
418 YLQAVVKESFRLHTPTPLMLPHKASA 341
341 RVKIAGYD 318 (frameshift)
317 VPKGANVIVNVWAVGRDLSVWDSPLP 240 (frameshift)
240 PLEYRPERFLEESIDIKGSDYRVLPFGAGRRVCPGAQLGISLVASMIGHLLHQFTWALPD 61
60  GTWPEDLDMMESSGLVTFMA 1
25696 TPLQVVAMPRLDKEELFKRVPVDMS* 25773 from AC093612.15

>AP003446.1 $P chromosome 1 clone OJ1529_G03 CYP73A 9aa deletion and two frameshifts
probable pseudogene of CYP73A 
84699 MDLLFVERLLVGLLAAAVVAIAVSKLRGRKLRLPPGPTPVPVFGNWLQVGDDLNHRNLAALARRFGDIF 84493
84492 LLRMGQRNLVVVSSPPLAREVLHTQGVEFGSRTRNVVFDIFTGKGQDMVFTVYGDHWRKM 84313
84312 RRIMTVPFFTGKVVQRHRAGWEAEAAAVVDGLRADP 84205 9 aa deletion and frameshift
84202 RRRLQLMMYSNVYRIMFDRRFESADDPLFLRLKALNGERSRLAQSFEYNYGDFIPILRPF 84023
84022 LRGYLRICEEVKETRLKLFKDFFLEERK (2)
83842 KLASTKAMDNNGLKCAIDHILEAQQKGEINEDNVLYIVENINVA (1)
83634 AIETTLWSMEWAIAEL 83587 frameshift
      VNHGEIQEKLRRELDTVLGPGRQITEPDTHRL 83489
83488 PYLQAVVKETLRLRMAIPLLVPHMNLRDAELAGYGIPAESKVLVNAWYLANDPGRWRRPE 83309
83308 EFRPERFLEEERNVEANGNDFRYLPSGAGRRSCPGIVLALPILGVTIGRLVQNFELLPPP 83129
83128 GKDRVDTTEKGGQFSLHILKHSTIVAKPRAF* 83033

CYP81 family related

>AL606630.1 $F chromosome 4 clone OSJNBb0046P18 45% to 81D3
70830 MANTTLSSLLFLSMASALFLLTLLRILRSKKQQR 70931 frameshift 
70931 PPPPPAEPAVPPRHRGHLHLFKKPLHRALSGLAATHGPVLLLHFGSRAVLHVTDPAVAE 71107
71108 ECLTDHDVTFANRPRLPSSCHLSNGYTTLGSSSYGPNWRNLRRIATVEVFSAHRLLRSAD 71287
71288 VRGGEVPHMARWLYLAAPAAGPSEPARADVKARAFELVLNVVARMVAGKQYYGGEGDAEA 71467
71468 ETEEAARFREMVREYFAMHGASNLQDFVLLLGLVDIGGAKRRAVKLSRERNTWAQRLID 71644
71645 EHRATATAAAATEARTMVGDLLKMQASEPEAYSDKVITALCL 71770 (0)
73597 SILQTGTDTSSSTIEWGMALLLNHPAAMAKARAEIDRFVGTGRVVEEADLPNLPYL 73764
73765 QCIIRENLRLYPVGPLLAPHESSADCSVSVAGGGRYAVPAGTMLLVNVHAMHRDARFWGP 73944
73945 DPESFSPERFEGGRSEGKWMLPFGMGRRRCPGEGLAVKVVGLALATLVQCFEWRRV 74112
74113 GDEEVDMTEGSGLTMPKAVPLEALYWPRPEMVPALSGIFFIYNFFY* 74253


>AC084282a $F 46% to 81D5 CDS join(53081..54004,55290..55904)
AQ160774 nbxb0005P20f 58% identical to AU032242 47% to AA754300 
AQ288789 perf to end 59% to 81D2 78% identical to C97610
MDKAYIAVFSIVILFLLVDYLRRLRGGGTSNGKNKGMRLPPGLP 
AVPIIGHLHLVKKPMHATLSRLAARHGPVFSLRLGSRRAVVVSSPGCARECFTEHDVA
FANRPRFESQLLMSFDGTALAMASYGPHWRNLRRVAAVQLLSARRVGLMSGLIAGEVR
AMVRSLCRRPAAAAPVQLKRRLFELSLSVLMETIAQSKATRPETTDTDTDMSMEAQEY
KQVVEEILERIGTGNLCDYLPALRWFDVFGVRNRILAAVSRRDAFLRRLIYAARWRMD
DGEKKSMIAVLLTLQKTQPEVYTDNMITALCSNLLGAGTETTSTTIEWAMSLLLNHPE
TLKKAQAEIDASVGNSRLITADDVPRITYLQCIVRETLRLYPAAPMLIPHESSADCEV
GGYSVPRGTMLLVNAYAIHRDPAAWEEPERFVPERFEGGGCDGNLSMPFGMGRRRCPG
ETLALHTVGLVLGTLIQCFDWERVDGVEVDMAEGGGLTMPKVVPLEAVCRPRDAMGGV
LREL

>AC084282b $F 46% to 81D5 N-term extension or too long
CDS join(57467..57739,58238..58330,59557..59693,60071..61148,
61875..62492)
C97610 61% to 81D2 C-terminal 78% identical to AQ288789
C97611.1 Rice callus Oryza sativa cDNA clone C60478_8A.Length = 661 57% to 81D4
MAFLGWAVDIARDSGASSSVVLTCDGYGSALYFSPWDSVPLPAT
ASPDDGFLLPRFPDVCVQRSQFTNHLAPANGTGGGGSRTGVKEEASEVLSWPPTSKQS
VRRLEVAEHWYRLYKTDNQRLSPDSQQVSVLAESHCDLASGNWKEISIHHKKMPSSTT
TKTTTPSRDAWIVSARSDPFHLLLEAQAPLGIKADALSQIAAVHQSHRNTSHIRELSLA
Upper part may be error (fusion to another gene?)
MDNAYIIAILSVAILFLLHYYLLGRGNGGAARLPPGPPAVPILGHLHLVKKPMHATM
SRLAERYGPVFSLRLGSRRAVVVSSPGCARECFTEHDVTFANRPRFESQLLVSFNGAA
LATASYGAHWRNLRRIVAVQLLSAHRVGLMSGLIAGEVRAMVRRMYRAAAASPAGAAR
IQLKRRLFEVSLSVLMETIAHTKATRPETDPDTDMSVEAQEFKQVVDEIIPHIGAANL
WDYLPALRWFDVFGVRRKILAAVSRRDAFLRRLIDAERRRLDDGDEGEKKSMIAVLLT
LQKTEPEVYTDNMITALTANLFGAGTETTSTTSEWAMSLLLNHPDTLKKAQAEIDASV
GNSRLITADDVTRLGYLQCIVRETLRLYPAAPMLLPHESSADCKVGGYNIPRGSMLLI
NAYAIHRDPAVWEEPEKFMPERFEDGGCDGNLLMPFGMGRRRCPGETLALRTVGLVLG
TLIQCFDWERVDGVEVDMTEGGGLTIPKVVPLEAMCRPRDAMGGVLRELV

>AC084282c $F 41% to 81H1 CDS join(65367..66332,67175..67792)
MDKAYIAVFSIAILFLLVDYFRCRRRRGSGSNNGENKGMLQLPP
SPPAIPFFGHLHLIDKPLHAALSRLAERHGPVFSLRLGSRNAVVVSSPECARECFTDN
DVCFANRPQFPSQMPATFYGAGFGFANYGAHWRNLRRIATVHLLSAHRVRGMAGVVSG
EIRPMVQRMYRAAAAAGVGVARVQLKRRLFELSLSVLMEAIAQTKTTRPEADDADTDM
SVEAQEFKNVLDELNPLLGAANLWDYLPALRVFDVLGVKRKIATLANRRDAFVRRLID
AERQRMDNGVDGGDDGEKKSVISVLLSLQKTEPEVYKDIVIVNLCAALFAAGTETTAM
TIEWAMSLLLNHPKILKKAKAEIDASVGNSRLINGDDMPHLSYLQCIINETLRLYPVA
PLLIPHESSADCKVNGYHIPSGTMLLVNVIAIQRDPMVWKEPNEFKPERFENGESEGL
FMIPFGMGRRKCPGETMALQTIGLVLGALIQCFDWDRVDGAEVDMTQGSGLTNPRAVP
LEAMCKPREAMSDVFRELL

>AC084282d $F 40% to 81H1 join(70282..71220,71398..72015)
EST D46694, D46695 from this gene
MVKAYIAIFSIAVLLLIHFLFRRRGRSNGMPLPPSPPAIPFFGH
LHLIDKPFHAALSRLAERHGPVFSLRLGSRNAVVVSSPECARECFTDNDVCFANRPRF
PSQMLATFNGTSLGSANYGPHWRNLRRIATVHLLSSHRVSGMSGIISGQARHMVRRMY
RAATASAAGVARVQLNRRLFELSLSVLMEAIAQSKTTRREAPDADTDMSMEAQELRHV
LDELNPLIGAANLWDYLPALRWFDVFGVKRKIVAAVNRRNAFMRRLIDAERQRMDNND
VDGGDDGEKKSMISVLLTLQKTQPEVYTDTLIMTLCAPLFGAGTETTSTTIEWAMSLL
LNHPEILKKAQAEIDMSVGNSRLISVVDVHRLGYLQCIINETLRMYPAAPLLLPHESS
ADCKVGGYHIPSGAMLLVNVAAIQRDPVIWKEPSEFKPERFENGRFEGLFMIPFGMGR
RRCPGEMLALQTIGLVLGTMIQCFDWGRVDDAMVDMTQSNGLTSLKVIPLEAMCKPRE
AMCDVLRKFM

>AL606587.1 $P chromosome 4 clone OSJNBa0011L07, 
probable pseudogene of AC073867 68246-61993
29446 MESWWLPTWQPLLVLLPTMLLLYHTVSSWHCGE
29347 RCLPLPPGPRGLPFVGNILHTSDMTHRGLAQLASRYGGLLHLRLGRLRTVVVSTPEMARL 29168
29167 VLHVNDRAFADRPTTAAIDYLTYDRADMVFAPYGP 29063 no GT-AG boundary 
28858 YGPLWRQLRKLCINRLFNRRRAASWAAVHDGVDSLLREVTKNSG-AVVNVGELVFGMSMK 28682
28681 ITLR 28670 missing internal sequence from aa 191-467
28642 PGMQLGMLAVELALARLLHGFDWSLPGGTGSAGELDMEETYGLTAPRAVRLSAVPVPRLSHL* 28454

>AC073867.4 $F chromosome 10 clone OSJNBa0055O03, 1 ordered pieces 63% to CYP84A 
BE230538.1 99AS755 Rice Seedling cDNA clone 99AS755.Length = 501
68246 MADMVKFTMEWLQDP 
68196 LSLAIVVTVAVLIMRMQRRRAAPFPPGPKPLPIVGNMAMMDQLTHRGLAALAKEYGGLM 68020
68019 HLRLGRLHAFAVSTPEYAREVLQAQDGAFSNRPATTAIAYLTYDRADMAFAHYGPFWRQM 67840
67839 RKLCVVKLFSRRRAETWLAVRDESAALVRAVAASRGEAAVNLGELIFNLTKNVIFRAA 67666
67665 FGTRDGEGHDEFIAILQEFSKLFGAFNIGDFIPWLSWADTNGINARLVAARTALD 67501
67500 RFIDKIIDEHMERGKNPDDADADMVDDMLAFLAEAKPHAGKAAAAAAGAGDGADDLQNTL 67321
67320 RLTRDNIKAIIM 67285
62628 DVMFGGTETVASAIEWAMAEMMHSPDDLRRVQEELAAVVGLGRDVAESDLDKLPFLRC 62455
62454 VIKETLRLHPPIPILLHETAADCLVAGYSVPRGSRVMVNVWAIARDRAAWGPDADAFRP 62278
62277 SRFAAGAAAEGLDFRGGCFEFLPFGSGRRSCPGMALGLYALELAVARLAHGFNWSLPDGM 62098
62097 KPSELDMSDIFGLTAPRATRLSAVATPRLTCPLY* 61993

>AQ160861 (partial) N-terminal to C-helix 59% to 84A1 cannot extend
ortholog of aaaa01015479.1 $FI 94%
MEYYSQDAYFVTASLISIAFFLWYASRLRRTAIL
LPPGPPGLPVLGNLLSVHQFTHRGLAKLSKIHGGFFHLRVGQPNVFVVSSPETVRE 
IIHENDSVFSHCPVTVAMVYVXYDLADMAFAHYGPFWRQMRKLCVFKLLSPR 
RDVLWRCVRGKVDALVRSV

>AC121490.1 $F (japonica cultivar-group) chromosome 3 clone
AQ577123 nbxb0090O10r 57% to CYP84 C-term
92194 MDPWLVLWLVLASMAFALLHLRRRARRGAPPLPPGPRPLPIIGNMLMMDQLTHRGLAAMA 92015
92014 ARYGGLLHLRLGRVHMVVVSSPEHAREVLQVQDGDFSNRPASIAIAYLTYGRADMAFSHY 91835
91834 GHFWRQVRKLSAVRLFSRRRAQSWRAVRDESAKLVGAIARRAGEAVDLGELIFGLTKDVI 91655
91654 FRAAFGTRDGGGHGELEVLLQEFSKLFGAFNVGDFIPWLAWLDPHGINRRLRAARAALDS 91475
91474 VIDRIIDEHVSNPAGDEDADMVDDMLAFLDEAGRDQTGGGGELQGTLRLTRDNIKAIIM (0)
      DFVFGGTE
      TVASAIEWAMAELLHSPGDLRRLQAELADVVGLGRGVEEGDLEKLPFLRCVAMETLRLHP
      PIPLLLHEAAADCVVGGYSVPRGARVVVNVWSVGRDAGAWKGDAGAFRPARFMAGGEAAG
      MDLRGGCFELLPFGSGRRACPAIVLGMYELELVVARLV 90725
90724 HAFGWAPPGGVAPEELDMADGFGLTAPRAARLRAVPTPRLTCPM* 90590

>AA754418 (partial) 74% to 89A5 423-488 region  C-TERMINAL 86% to BI797636.1
ortholog to aaaa01016355.1 $FI 95%
IRHEXXXLTGSKEIKXMPFGAGRRVCPGMALALLHLEYFVANLVWEFDWREVAG
DEVDLTEKLEFTVVMKRPLKATAVPLRXTRSAAVE* 

>AP004811.1 $F (japonica cultivar-group) chromosome 6 
BI797636.1 clone H081H03.Length = 636 86% to AA754418 62% to AC068924.9
68% to 89A4, also = BI809563.1, BI809355.1 
(end of this message seems to be vector sequence KRRSRGSKLTYACDVIALL,
this is not the true end.  It is sequencing vector and it appears 
many times in rice ESTs) beta-galactosidase alpha peptide part of 
pSPORT2 vector used in cloning this EST
113179 MEDWIFYSLTLLLCLACSLLLRARASAAAVEVAPLPPGPRTVPVLGPLLFLARRD 113015
113014 FDVEPTLRRIAAEHGPVFTFAPLGPSRPTIFVAARGPAHRALVQRGAAFASRPRGVSPA 112838
112837 SVLLTSGGRNVSSAQHGPIWRALRRCISSGVLNPARLRAFSDARRWVLDALVSHIRGEGG 112658
112657 APLTVMEPFQYAMFCLLVYMCFGDRPGDARVREIEALQRELLSNFLSFEVFAFLPPITRL 112478
112477 VFRRRWNKLVSLRRRQEELFAPLIRARREAGAGGDCYVDSLVKLTIPEDGGRGLTDVEIV 112298
112297 SLCSEFMSAGTDTTATALQWILANLVKNPAMQDKLREEITAAAVDGEVREEDLQAMPYL 112121
112120 KAVVLEGLRRHPPDHFLLPHTVEEETTLDGYRVPANTPVNFAVGEIGLDSEVWTSPEVFR 111941
111940 PERFLAGGEGEDVDLTGSKEIKMMPFGAGRRICPGMALALLHLEYFVANLVREFEWREVA 111761
111760 GDEVDLTQKLQFTVVMKRPLKATAVPLRGDRSAAAAVTGSA* 111635

>AC090873 $F 60128..61720 47% to 89A4 same as AQ327293 AU090603
nbxb0040cH09f AQ327293 nbxb0040P17f 40% to 89A4 N-terminal
MELTWLLLLATLLLSTTLLVFLFHGGSSATGGEKRRRLPPGPAT
VPVLGNLLWATNSGMDIMRAVRRLHARHGPMLGLRMGSRLEVIVADRRLAHAALVESG
AAMADRPEFASRALLGLDTATISNSSYGPLWRLFRRNFVAEVAHP ARLRQFAPARAAV
LEELTDKLRRRQEDAGAGTILETFQYAMFFLLVAMCFGELLDERAVRDIAAAQRDLLL
HSSKKLRVFAFLPAITTRLFAGRMKAMIAMRQRLKGMFMPLIDARRARKNLVDDHGDA
TAPPPPAASATTLPHSYVDTLLNLRINDNGGERALTDDEMVALCSEFLNGGTDTTSTA
LEWIMAELVKNPTIQDKLHGEIKGAITSNSGKVSEEDVQKMPYLKAVVMEGLRRHPPG
HFVLPHAPAEDMELGGYTIPKGTLVNFTVADMGMDGAAWDRPREFLPERFMAGGDGEG
VDITGTREIRMMPFGAGRRICPGLGVATLHLEYFVANMVAAFEWRAAEGEAVDVDGEK
LEFTVVMEKPLRARLLPRAVTV

>AP004769.1 $F (japonica cultivargroup) chromosome 2 clone P0017C12, Length = 
135357 New 45% to 89A4 65% to Lolium rigidum clone Lol2 AF321865
50173 MEETWLFLLFSISLVAVLLATARRRRSSSIKARLPPGPSPLLFLAKFLRLRRSIFDLGPLLRDL 49982
49981 HARHGPVISIRLFGTTLVFVADRRLAHRALVQGGSTFADRPPLPELGRLFTSDTRDINSS 49802
49801 PYGPYWRLVRRNLASEALSPARVALFAPARRRARDVLVRGLRDRGGDGSRPVELRPLLRR 49622
49621 AMFELLLYMSLGARLAPEALEEVERLELWMLRAFTSFPVFSFFPAITKRLFRNQWAAHVA 49442
49441 VRRRVGEIYVPLINARRAGDGDGDDPPCYTDSLLQLRVAEEGDRPLTDDEIIALCSEFLN 49262
49261 AGTDTTVTLVEWIMAELVNRPDIQAKVHDEVRRRPELTEADLQAMPYLKAVVL 49103
49102 EGLRLHPPAQFLLPHGVQSDAEVGGYVVPRGAELNVWVAELGRDEVVWTAAREFMPERFM 48923
48922 DGGEVEVDVTGSREITMMPFGVGRRMCPGYTVGTLHAEYLVGSLVRELEWLP 48767
      ETEGEAADMAEELDFTTVMKHPLRARVLPRPSSLY* 48655

>AC087192 $F similar to CYP89A4 missing about 97 aa internally not assembled well
CDS join(104934..105302,105476..106045,106726..107058)
AC073867.4 chr 10 clone OSJNBa0055O03, 1 ordered pieces starts 40% to 89As
     MHAELTSTTTMEMGSLLPHAASLFAVSMASLMIAAVLSIVRRP
4971 WPWKTAAISREAVLRLLGVRLGDVPTTVVRDGAVAVDALVRRADAFSDRPAG 5126
5127 GGATSIVSGGRAHNINTVPHGPLWVALRRNLTSEAFHP  VHGLARAAPGRSSRTSRPPRR 5303
5304 APPAEGQA 5327
     VRDCLYAALFALNVATCFGDGVDGE 5413
5414 LVGAMRAAQQEFLRFLPRARVFSTFQKAARLVYPDRWKQLLRHRRRQEEMYLPLIRAIN 5590
5591 EQRRTRGTPSPPPPTT
5639 YVDTLLYLEVPADDGRRRRKLSDGEMVGLVSEYLGAATGTVVAQLEWALANLVRRPDIQT 5818
5819 RLCGEVEAAAGGEPCAYLRAVVMECLRRHPPVSSVQRHMVRDVMLGGAHV 5968
5969 ARGNV VSFAIEEIGR 6013
6667 WTSSEEFSPERFMEGGEEGVRLAIGSKQEATTKVKMMPFSAGRRTCPGMGYAILHLEYFL 6846
6847 ANLVTAFEWRRVPWEEEVDLTADYGFITTMQHPLRALVVPLSNDRSTVV* 6996

AC087192.14 smea as above but with wrong N-term
CDS             join(104934..105302,105476..106045,106726..107058)
                     /gene="OSJNBa0005K07.19"
                     /codon_start=1
                     /product="putative cytochrome P450"
                     /protein_id="AAK84448.1"
MEMGSLLPHAASLFAVSMASLMIAAVLSIVRRPWPWKTAAISRE
AVLRLLGVRLGDVPTTVVRDGAVAVDALVRRADAFSDRPAGGGATSIVSGGRAHNINT
VPHGPLWVALRRNLTSEAFHP LVGAMRAAQQEFLRFLPRARVFSTFQKAARLVYPDRW
KQLLRHRRRQEEMYLPLIRAINEQRRTRGTPSPPPPTTYVDTLLYLEVPADDGRRRRK
LSDGEMVGLVSEYLGAATGTVVAQLEWALANLVRRPDIQTRLCGEVEAAAGGEPCAYL
RAVVMECLRRHPPVSSVQRHMVRDVMLGGAHVARGNVI WTSSEEFSPERFMEGGEEGV
RLAIGSKQEATTKVKMMPFSAGRRTCPGMGYAILHLEYFLANLVTAFEWRRVPWEEEV
DLTADYGFITTMQHPLRALVVPLSNDRSTVV"

>AC097280.1 $P chromosome 3 clone OSJNBb0111B07, Length = 129338 New like 89A7
this seq like 89As from AC068924 missing N-terminal and K-helix region
      PQQQAAVEALVVDIAARVGGG
83429 SGSDGEVLIRDVVHDTLFPVAAWFCFGERDVRDLQRVLREFELDVVVEGFGGSMLANLV 83605
83606 HWWRLRRFVASGRRQAEVFLPLISQQRRTQHRGEHKFCPYVD 83731 this segment 10 aa short vs 89As
83732 SLLDLRVPVGDNAAAGEGKEEHRLSHCALTDDEMVGLVSEFLGSGTEST 83879  I-helix 
frameshift and 73aa deletion
83878 RVEHLSLPSGGVRVHFILGDIGRDGKAWTDPDEFRPDRFMASGEAEGVSPLP 84033
84034 RPKEVRMMPFGAGRRYCPSMGLGVAHACLLVAALVREFEWAAAATSGVELMEGER* 84201

>AL662964.1a $F chromosome 4 clone OSJNBa0027H09, Length = 155601
new rice seq 39% to 89A4
91854 MDTSVLLLVLFVFTLLCCLALLRRAARAPAAV
91950 QLPTIEISDAAVARRALVDYADVFANRPFLPFPVALVTGRRRRRSDNLTSVPYGPHWRVL 92129
92130 RRNLTAGIFHPSRLGLLAPLQRDAVDDLVAGISESAAGGAVPVVVRDVAYAAVFRLAARM 92309
92310 CFGDGVGERQVRALRRVIREFVLDVGVANNVFPVSTSTALARLRRWRRIRRLLSSRR 92480
92481 RQAELYLPLIDERRRRMAARRRDRDDDGGTFRPYVDALIELRVPGDGESTPAPLTDDEMV 92660
92661 SLLMEFLAASTETVVSCIEWTLAHLVLNAEAQSKLRREVGDGEHVHGRHRGRTPYM 92828
92829 RAVILESLRLHPPVPFVIREVVGGAAPPVLDELAAMPMPGGGARVHFVIGDIGR 92990
92991 DSKAWRDPKEFRPERFMAGGEAEGVGPVPGPKEVRMMPFGAGRRSCPGMGVAMAHVGLF 93167
93168 VAALVREFEWTPAAGGGVDLTQQDDFFNVMRTPLRARATPRPRAPA* 93308

>AL662964.1b $F chromosome 4 clone OSJNBa0027H09, Length = 155601
Almost identical to AC097280.1 chr 3
Same as BI799272.1 BI798940.1 rice ESTs
104753 MMLLLTLLLVLTLFLCLAVFRRTTSRARRAPVSLRQPTVEIHDGDVARRALLDHADAFVNRP 104938
104939 AIGAEPRGRRSDNLTTVRYGPQWRVLWRNLTAGFLRPSRVGLLAPLQQKAVDAL 105100
105101 VADIAARGADGGEVGVRDVVHDALFPLAVRFCFGDDIDERHVRDLRRVMQELKLDVVVK 105277
105278 GFSGSMLANLVHWWRLRRFIASGRRRAEIFLPLIAQRRRTQHRDADGGVF 105427
105428 RPYVDSLLDLRVPVGHDADADAAGCEDNEGRNSGRALTDDEMVGLVAEFLSGGTETVVSC 105607
105608 VEWTLAHLVIEPEIQDKLCRQVVAAADHHGGERGTTPAYLRAVILESLRMHPPVPLTMRD 105787
105788 VRSPQAVEHLSLPDGGARVHFILGDIGRDGKAWTDPDEFRPERFMAGGEAEGVG 105949
105950 PLPGPREVRMMPFGAGRRYCPGMGLGVAHACLLVAALVREFEWAPTAVAATGGVDLTEVN 106129
106130 GFFKMMRTPLRARATPRGTSA* 106195

>AC068924.9a $F chromosome 10 clone OSJNBa0026L12 1 ordered pieces
          Length = 154022   10 different genes
51% to 89A2 gene 1
15237 MDNLYLILGAILAAAVVPFILFRGNGGKRGRRLPPGPPAVPLLGSVVLLTKALTDVEPEL 15058
15057 LLQRLIARYGPIVSLRMGTRVSVFVADRRLAHAALVEGGAALADRPGVPASRLLGEND 14884
14883 NIITRAGYGPVWRLLRRNLVSETLHPSRARLFAPARYWVHRVIVDKLAASGQAPHDVVD 14707
14706 TLQYAMFCLLVNMCFGERLDEATVRAVEDAQRDLLIYITSQMAVFAYFPAITKHLFRGRL 14527
14526 EKIYALRRRQRELFMPLINARREYKKHGGETTKTTNKETTLEHSYVDTLLDIKLPEDGNR 14347
14346 ALTDDEIIKLCSEFLNAGTDTTSTALQWIMAELVKNPSIQSKLHDEIKSKTGDDQPEVTE 14167
14166 EDVHGMPYLRAVVLEGLRKHPPGHFVLPHRAAEDVEVGGYLIPKGATVNFMVAEIGRDER 13987
13986 EWAKPMEFIPERFLPGGDGEGVDVTGSKGIRMMPFGVGRRICAGLSFAMHHLEYFVANMV 13807
13806 REFEWKEVAGDEVEFAEKREFTTVMAKPLRARLVPRRTT* 13687

>AC068924.9b $F 49% to 89A2 gene 2
30867 MDAWQLFVPAIAILIPLLRLILFRRGDDGRRGRLPPGPPAVPLLGSTVWLTNSLYDAEPV 31046
31047 VKRLMSRHGPVVSLRFGSQLLVFVADRRLAHAALVESGASLADRPSQAASARLVGEGDTM 31226
31227 ISRASYGPVWRLLRRNLVADTLHPSRAHLFAPVRARVRRLLVDRLREEHGEAEAAPRAVV 31406
31407 ETFQYAMFCLLVLMCFGEQLDEDAVRAIGAAERDTMLYMSSEMGVFNFFPAITRHLFRGR 31586
31587 LQKAHALRRRKEELFVPLINSRREYKKNGGEPKKETTTFTHSYVDNLLDINLPEDG 31754
31755 NRALTDDELVMLCSEFLVAGTDSTSAALQWIMAELVRNPSIQSKLYEEIKSKTGGGGQHE 31934
31935 VSEEDVHDMPYLKAVVLEGLRKHPPAHMLLPHKAAEDMDVGGYLIPKGTIVNFMVAEMGR 32114
32115 DEKEWEKPMEFMPERFLPGGDGEGVDVTSSKGIRMMPFGVGRRICPGLGIAMLHLEYFVA 32294
32295 NMVREFEWKEVAGDEVDFAEKREFNTVMAKPLRVRLVPRSTLE* 32426

>AC068924.9c $F 51% to 89A2 no introns gene 3
AU092198.1 Rice callus Oryza sativa cDNA clone C11370.Length = 319
AU062487 59% to 89A2 same as AC068924 gene 3 two diffs
37901 MDTWHVAVAAILVLIPFLRLILSRRGGRGGGKRGRLPPGPPAVPLLGSTVWLTNSLADAE 37722
37721 PLLRRLIARHGPVVSLRVASRLLVFVADRRLAHAALVEKGASLADRPAMASTRLLGESDN 37542
37541 LISRASYGPVWRLLRRNLVAETLHPSRVRLFAPARAWVRRVLVEKLRDENGDAAAPHAVV 37362
37361 ETFQYAMFCLLVLMCFGERLDEDAVRAIAVAQRDALLYLSSKMPVFAFFPAVTKHLFRGR 37182
37181 LQKAHALRRRQTELFVPLINARREYKKRQGGANGEPKKETTFEHSYVDTLLDIKLPDDGN 37002
37001 RPLTDDEMVNLCSEFLNAGTDTTSTALQWIMAELVKNPSIQSKLHDEIMAKTGGGGGGGQ 36822
36821 REVSEEDIHDMPYLKAVVLEGLRKHPPGHMVLPHRAAEDMEIGGYLIPKGTTVNFMVAEM 36642
36641 GRDEKEWEKPMEFMPERFLAGGDGEGVDVTGSREIRMMPFGVGRRICAGLGVAMLHVEYF 36462
36461 VANMVSEFEWKEVAGDEVDFAEKIEFTTVMAKPLRARLVPRRA* 36330

>AC074105.3 $P chromosome 10 clone OSJNBa0030B02 19 unordered pieces from 87-152 
region 58% to rice 89 seqs 89% to AP002854 bracketed by known genes, rest absent
109989 SSFMADCCVAHATLVERGAALANRLEVARSLLGENDYTITRCRCGPVWQILRRNLVAEMLHPSRVRL 110189

>AP002854.1 $P genomic DNA, chromosome 6, BAC clone:OSJNBa0014B15, 89% to AC074105
23716 SSFVADCCVAHATLVERGTTLADRLEVARSLLGENGYTITRCRCGPMWQILRRNLVAETL 23895
23896 HPSRVRL 23916

>AC068924.9d $F 50% to 89A2 gene 4
38532 MDNLVILAGVLLLPLLFLLRNAASSRRRRLPPGPPAVPLFGNLLWLRHSAADVEPLLLTLFKK 38720
38721 YGPVVTLRIGSRLSIFVADRHLAHAALIAAGAKLADRPQAATSTLLGVSDNIITRANYGA 38900
38901 MWRLLRRNLVSQTLQQSRVDQFAPARVWVRRVLMEKLRGSGEEAPNVMEAFQYTMFCLL 39077
39078 VLMCFGERLDEPAVRDVEEAERAWLLYISRRMSVFFFFPWITKHLFRGRLEAAHALRRRQ 39257
39258 KELFVPLIEARREYKRLASQGLPPARETTFQHSYVDTLLDVKIPEEGNRALTDDEI 39425
39426 VTLCSEFLNAGTDTTSTGLQWIMAELVKNPAVQEKLYAEINATCGGDDELLERNVRDKD 39602
39603 NKMPYLNAVVKEGLRKHPPGHFVLPHKAAEDMDVGGYLIPKGATVNFMVAEIGRDEREWE 39782
39783 NPMQFMPERFLDGGHGAGVDMHGTKGIKMMPFGVGRRICAGLNIAMLHLEYFVGSMVMEF 39962
39963 EWKEVEGHEVEFAEKREFTTVMAKPLRPRLVPRRS* 40070

>AC068924.9e $F 49% to 89A2 gene 5
49550 MDNLAILTGVLLLPLLVFLRDVTSRRRRRLPPGPPAVPLFGNLLWLRHSAAELEPLLLKLFDK 49362
49361 YGPVVTLRIGWRLSIFVADRHLAHAALVGAGATLADRPRTA 49239 frameshift
48% to 89A2 starts at 103
49240 PPACSSASARIITRSNYGAMWRLLRRNLVSQTLQQSRVDQFTPARVWARRVLMEKLRE 49067
49066 TTSGVSGGGGDAPPNVTEAFQHAVFCLLLLMCFGERLHEHAVRDIEDAERAWLLYISKKT 48887
48886 SVFFFLPWITKHLFRGRLEAGHALLRRQKELFVPLIEARREYKKLASQGLPPARETT 48716
48715 FQHSYVDTLLDVKIPEEGNRSLTDGEIVTLCSEFLNAGTDTTSTGLQWIMAELVKNPAVQ 48536
48535 EKLHAEINATCGDDGELLERSVRDKDNKMPYLNPVVMEGLRKHPPSHTLLPHKAAEDIDV 48356
48355 GGYLIPKGATVNFMVAEIGRDEREWEKPMEFIPERFLAGGDGDGVDVTGSREIRMMPFGA 48176
48175 GRRICAGLNVGVMHLEYFVGSMVMEFEWKEVAGDEVEFAEKLEFTTAMAKPLRPRLVPRRVRH* 47984

>AC068924.9f $F 50% to 89A2 gene 6
58837 MANVDASQWLMLLLAFLVALFILLSLRKCGGRSRVPPGPLAVPVLGNLLWLSHSSADLEPLLRRLIA 58637
58636 VYGPVVSLRVGSHLSIFVADRRVAHAALVERGAALADRPEVTRALLGENGNTISRGNYGP 58457
58456 TWRLLRRNLVAETLHPSRARAAFAPARSWARRALVDGLVGGGAVLADAFRHAMFCL 58289
58288 LVLMCFGEWLDEAAVRAIGDAQHGWLLHYATKMKVFAFCPAVTKHIFRGRIQTSLALRRR 58109
58108 QKELFMPLISARRERKNQLAERAVPEKETTTFEHSYVDTLLDIKLPEDGGDRALTDDEMV 57929
57928 RLCSEFLDAGTDTMSTTLQWIMAELVKNPSIQSKLHDEIKSKTSDDHDEITEEDTQKMPY 57749
57748 LKAVILEGLRKHPPGHFALPHKAAEDMEVGGYLIPKGATVNFMVAEMGRDEKEWENPMEF 57569
57568 MPERFLPGGDGEGVDVTGSKGIRMMPFGVGRRICAGLNTAMLHLEYFVANMVREFEWREI 57389
57388 AGEEVDFAEKLEFTTVMAKPLRAQLVRRRMN* 57293

>AC068924.9g $P Starts at 259 gene 7 pseudogene fragments
64765 KRPATVPENDTTFEHSYVDTLLDLRLPEDGDRALTDKAMVCLCSE 64631
64626 LDSGADTMSTTLPWIMAELVKNPSIQAKLYEEIKATVSDDHDEITEEDTQKMPYLKAVIL 64447
64446 EGLRKHPLGHFALPHKAAEDMDVGGYLIPKGATVNFMVAEMGRDEREWENPMQFMPERFL 64267
64266 PGGDGEGVDMTGSKRIRMMPFGVGRRMCAGINTAMLHLEYFVANMVREFEWKAVVGDKVD 64087
64086 FAEKL 64072 EFTTVMAKPLRAQLVLRSVD 64013

>AC068924.9h $P 47% to 89A2 two stop codons and frame shift gene 8
69888 MTIDASPWLLLLVAFLFPFFVLRSLRSDGKGGGRGGGCRVPPGPLAVPVLGNLLWLWHSPADLE 69691
69690 PLLRRLIARHGPVVSLGVGSRLSIFVADRRVAHAALVERGAALADRPEVTRALLGENGNT 69511
69510 ITRASYGPTWRLLRRGLVSGTLHPSTTRARVFAPARSWARRVLVGKLAAASGQAPHGVMD 69331
69330 TLQYAMFCLLVVMCFGERLDEADVRAIATAQHDWIVYFATKMRVFAFCSTITKHLFRGRI 69151
69150 KMALALRRRQKELFVPLINARRERKTRTQPTLPENGTTFEHSYVDTLLDLRLPEDGNRAL 68971
68970 TDKEMVSLCSEFLDAGTDTMSTALQWIMAELVKNPSI*SKLYDEIKATVSDDHDEITEDD 68791
68790 TQKMPYLKAVILEGLRKHPPGHFVLAHKAAEDIEVGGYLIPKGATVNFMVAEMGRDEREW 68611
68610 ENPMQFMPERFLPGGDGEGVDMTGSKRIRMMPFGVGRRMCAGLNTAMLHL*YFVANMVRE 68431
68430 FEWKAVVGDKVDFAEKLEFTTVMAKPLRAQLVLRSVD* 68318

>AC068924.9i $F 51% to 89A2 no introns gene 9
BI812666.1 mature leaf library cDNA clone F008D03.Length = 448
86433 MATDTSWLLLLVALVLPLVVLLARRRRSGGGSRRIPPGPLAVPVLGSLLWLRHSSANLEPLLQRLI 86236
86235 ARYGPVVSLRVGSRLSIFVADRRVAHAALVERGAALADRPDVTRSLLGETGNTITRSS 86062
86061 YGPVWRVLRRNLVAETLHPSRVRLFAPARSWVRRVLVDKLADGARPESEPPRPRVVVETF 85882
85881 RYAMFCLLVLMCFGERLDEAAVRAIGAAQRDWLLYVARKTSVFAFYPAVTKHIFRRRLQI 85702
85701 GLALRRQQKELFVPLIDARRARKNHIQQAGGPPVPEKETTFEHSYVDTLLDVSLPDTDGD 85522
85521 SALTEDELVMLCSEFLNAGTDTTATALQWIMAELVKNPSIQSKLHDEIKSKTSDDEIT 85348
85347 EEDTHDMPYLKAVILEGLRKHPPAHFVLPHKAAEDVEVGGYLIPKGATVNFMVAEMGRDE 85168
85167 REWEKPMEFIPERFLAGGDGEGVDVTGSREVRMMPFGVGRRICAGLGVAMLHLEYFVANL 84988
84987 VKEFEWKEVAGDEVDLTEKNEFTTVMAKPLRAQLVKRA* 84871

>AC068924.9j $F 48% to 89A2 gene 10
92213 MAIDATQWLLLLVVFLVAFLFTLLAKHGAVKRKHGVRVPPGPLAVPVLGSLVWLTHSSSANLEPLLR 92013
92012 RLIARHGPVVSLRVGSRLSIFVADRRVAHAALVGRGAALADRPPDVTHSLLGESRNTITR 91833
91832 SGYGPVWRLLRRNLVVETTHPSRVRLFAPARSWVRRVLVDKLADAGAHPASPPRVLEVFR 91653
91652 YAMFSLLVLMCFGERLDEAAVRAIGAAQHDFLLYLGRKTSVFMFYPAITKHLFRGRVHLG 91473
91472 LAVRRRQKELFMPLIDARRERKKQIQQSGDSAASEKKKDDNTTFNHSYVDTLLTIRLQDV 91293
91292 DGDGDRALTDDEMVSLCSEFLSAGTDTTATALQWIMAELVKNPSIQSKLYEEIKATMSGD 91113
91112 NDDEINEDDARNNLPYLKAVILEGLRKHPPMHLLLPHKAAEDVEVGGYLIPKGATVNFMV 90933
90932 AEIGRDEKEWEKPTEFIPERFMAGGGDGEGVDVTGSREIRMMPFGAGRRICAALSVAMLH 90753
90752 LEYFVANMVKEFEWKEVAGDEVDFAERLEFTTVMAKSLRVRLIKRA* 90612

>AP003723.1 $F chromosome 6 clone P0003H08 60% to CYP77B1
AU068699.1 Rice callus cDNA clone C50079_1A.Length = 330 64% to 77B1
AA753883, D41585 AA753996 64% to 77B1 11/94 268-350 region
127597 MVDMNDVLLVVSAAVLAAMWWRRCSRTGGADG
127693 LPPGPPGWPVVGNLFQVILQRRPFMYVVRDLREKYGPIFTMRMGQRTLIVVTDADLIHD 127869
127870 ALVKQGAAFASRPEDSPTRLLFSVGKCTVNSAPYGPLWRALRRNFVAEIVSPPRVKGFSW 128049
128050 IREWAVGSHLRRLRAEFAATGAVRMMANCRLSICSILICICFGAKIPDELIREIEEVLKD 128229
128230 VMMMTMPKLPDFLPLLTPLFTKQLAAARELRRRQLGCLAPLVRARREFIRGGGERNA 128400
128401 DGNTVVGGVEMVSAPGEAYVDSLFDLEPPGRGKRLGEDELVTLCSEVMSAGTDTSATAL 128577
128578 EWAMMHLVLDAGVQDKLYGEVVSKVGTTARITEADVEAMPYLQ 128706 (0)
129997 AVVKETFRRHPPSHFVLSHAATRDTELGGYRVPADASVEFYTAWVTENPATWPDP 130161
130162 EAWRPERFLEGGEGFDTDITATRALRMMPFGAGRRICPAATLGVLHIQLMLANMVREFRW 130341
130342 VPPAGEGPPDPTETFAFTVVMKNPLRAALVERRVGGELATGGGGGAAASA* 130494

>AP005419.1 Oryza sativa (japonica cultivar-group) chromosome 9 clone
AP005570.1a 4 genes and a pseudogene seq  of gene 1 shown below
AQ575117.1 nbxb0086B24r CUGI Rice BAC genomic cloneLength = 830 
70% to rice 92 seq above ortholog of AAAA01038529.1 aaaa01083800.1 AAAA01003216.1
1053 MEVALCASFVAILLTTMLFLKAISTRCRRRKYNLPPGPKPWPIIGNLNLVGALPHRSIHE 1232
1233 LSRRYGPLVYLRFGSFPVVVGSSVEMARFFLKTRDAAFIDRPRTAAGKHTAYNYRDITWS 1412
1413 PCDAYWRQARRVVLTELFSARRIESYEHIRREEVHALLRDLHYASSSGGRRAIVIKDYLS 1592
1593 TASLNMITRMVMGKRYVQGEVVHEEPGSARTTLAQFKELLEELFFLNGVFNVGDQIPWLE 1772
1773 WLDLQGYVKRMKKVSKALDQLLEHVVDEHSERRQREGNGFVAGDMVDVLLRLADDSSLEV 1952
1953 KLSRDSIKAFTQ 1988 (0?)
4609 DLIAGGTESSSETIEWAISELLRKPEMFAKATEELDRIVGHRRWVNEKDILDLPYIEAIV 4788
4789 KETMRLHPIGPLLAPRLSREDTSVGGYDIPTGTRVFVNVWAIARDPTLWDASEEFMPERF 4968
4969 LGKKIDVKGQDFELLPFGSGRRMCPGYNLGLKVIQLSIANLLHGFTWRLPKDMVKEDLSM 5148
5149 EEIFGLSMPRKFPLEVAVEPKLSSHLYKGD 5238

>AP005570.1b 4 genes and a pseudogene seq  of gene 2 shown below
16178 MEFLLWVSYLTITLATILLFLRTLILRHNRRVYNLPPGPKPWPIIGNLNLMGSLPHRSIH 15999
15998 SLSKKYGPLMHLRFGSFPVVVGSSVEMAKFFLKTHDVVFADRPKTAAGKHTTYNYSDMTW 15819
15818 SPYGAYWRQARKVCLAELFSAKRIESYEHIRREEVRALLRDLHAASGRVVALKDYL 15651
15650 SAASLNVISRMVLGKKYLEREVVHEGEVVTTPERFRWMIDELFLLNGVLDIGDSIPWL 15477
15476 GWLDLQGYIRRMKKLSKMFDQFLEYVLDEHENRMCREGESFVAKDMVDVLLNVASDPSLE 15297
15296 VKFSRDSVKAFTQ 15258 (0?)
13216 DLIAGGTESSSATVDWAIAELLRKPEVFAKVTEELDRVVGRGRWVTEK 13073
13072 DIPSLPYIDAIMKETMRMHPVAPMLAPRLSREDTSVDGYDIPAGTRVLVGVWSIGRDPKL 12893
12892 WDAPEEFMPERFIGSKIDVKGQDFELLPFGSGRRMCPGYSLGLRVIQVSLANLLHGFAWR 12713
12712 LPDGMTKEQLSMEEIFGLSTPRKFPLEVVVEPKLPADLY 12596

>AP005570.1c 4 genes and a pseudogene seq  of gene 3 shown below
= D24290 $F AU070773 N-TERMINAL 67% to 92A2 C19482
AQ327338 AU068589 67% identical to C72289 76% to 92A2 PKG to end
AU068590 I-helix region 77% to CYP92A2
AU091996.1 AU091996 Rice cDNA clone R10225.Length = 435 198-328 region
D49064 60% to 92A2  3/95  8/95 432-508 REGION very similar to AQ327338
BI306309
21127 MLPPWASFVGVVLATVLFLKAVLGRSRRVYNLPPGPKPWPVIGNLNLVGTLPHRSIHNLSKKYG 20936
20935 PLMYLRFGSFPVVVGSSVEMAKFFLKTHDVVFTDRPKTAAGKHTTYNYSDITWSPYGAYW 20756
20755 RQARKMCLTELFSAKRLESYEYIRGEEVRALLRDLHGAAGGVVVLKDYLSTVSLNV 20588
20587 ITRMVLGKKYLDKDAGGSVTTPEEFKWMLDELFLLNGVLNIGDSIPWLDWLDLQG 20423
20422 YIKRMKKLGKMFDRFLEHVVDEHNERRRREGESFVAKDMVDVLLQFADNPNLEVKLKREG 20243
20242 VKAFTQ 20225 (0?)
19497 DLIAGGTESSAVTVEWALSELLKKPEVFAKATEELDRVVGRGRWVTEKDVPSLT 19336
19335 YVDAIVKETMRLHPVAPMLVPRLSREDTSVDGYDIPAGTRVLVSVWTIGRDPKLWDAPEE 19156
19155 FMPERFIGNKIDVKGQDFELLPFGSGRRMCPGYSLGLKVIQLSLANLLHGFAWRLPDGVT 18976
18975 REQLSMEEIFGLSTPRKFPLEAVVEPKLPAHLY 18877

>AP005570.1d 4 genes and a pseudogene seq  of pseudogene 4 shown below
pseudogene possible ortholog of AAAA01006146.1a $PI 4 diffs 94%
32984 MELPPWPSLLVIVLTAVVFFLATILCHGRRVYRLPLGPKPWPIIGNLNLIGALPHRSIHE 33163
33164 LSKRYGPLIQLRFGS 33208

>AP005570.1e 4 genes and a pseudogene seq  of gene 5 shown below
ortholog of AAAA01006146.1c $PI 100% first 199 aa (not a pseudo in japonica)
47412 MELPSWASVLVIVLAAVVFLLATILRHGRRAYRLPPGPNPWPIIGNLNLIGALPHRSIHE 47591
47592 LSKRYGPLMQLRFGSFPVVVGSSAEMARFFLKSHDIVFTDRPRTAAGKHTTYNYTDILWS 47771
47772 PYGAYWRQARKMCVTELFSARRLESFEHIRGEEVRALLRDLHGAAAPVLLRDYLS 47936
47937 TATLGVISRMVLGKKYVGVEGASAGNGGGGEGTSPAATPEEFKMMMDELFLLSGVLNIGD 48116
48117 FIPWLDWLDLQGYIRRMKRVGKKLDRFMEHVLDEHDKVRRQQGDRFAARDLVDVLLQLAD 48296
48297 DPNLEVQLRRDNVKALTQ 48350
48452 DLIAGGTDTSAITVEWAISELLRKPEILAKATEELDRVVGRGRLVTETDMTSLPYVEAI 48628
48629 VKETMRVHPVAPLLAPHVAREDASVGGYDIPAGTRVLVNVWTIARDPALWDSPEEFMPER 48808
48809 FIGSKIDVKGQDFQLLPFGSGRRLCPGHSLGLKVIQLSLASLLHGFEWRLPDGVSAGELS 48988
48989 MEEVFGLSTPRKVPLEVVVKPKLPAHLYTG 49078

>AP003711.1 $F chromosome 6 clone P0417G12 45% to 93D
85884 MDHQLVARGLFKPLLLFVAGLIVLYALRRRRRHRRSSGLR 
85764 LPPSPFGLPILGHLHLLAPLPHQALHRLAARHGPLLFLRLGSVPCVAACSPDAAREVLKT 85585
85584 HEAAFLDRPKPAAVHRLTYGGQDFSFSAYGPYWRFMKRACVHELLAGRTLDRLRHVRR 85411
85410 EEVARLVGSLRASADGGERVDVDAALMGLTGDIVSRMVMGRRWTGDDNDAEEMRSV 85243
85242 VAETAELTGTFNLQDYIGVFKYWDVQGLGKRIDAVHRKFDAMMERILTAREAKRKLRRQA 85063
85062 AADGEDDEKDLLDMLFDMHEDEAAEMRLTRDNIKAFML (0)
84873 DIFAAGTDTTTITLEWALSELINNPPVLRKLQAELDAVVGGARLADESDIPSLPYLQ 84703
84702 AVAKETLRLHPTGPLVVRRSLERATVAGYDVPAGATVFVNVWAIGRDAAWWPEPTAFRP 84526
84525 ERFVSGGGGGGTAADVRGQHFHLLPFGSGRRICPGASLAMLVVQAALAAMVQCFEWSPVG 84346
84345 GAPVDMEEGPGLTLPRKRPLVCTVSPRIHPLPAAASASLT* 84223

>AP004070.1 $P chromosome 2 clone OJ1705_E12 two pseudogene fragments like AP003711
107009   AAVHELLAGRTLDRLRHVGREEVARLVGSLSRSA 107110
108243 RRAAVHELLAGRTLDRLRHVGREEVARLVGSLSRSAANGEHVDADAALMGLTSDIVSRMV 108064
108041 GDDNDTEEMWSVVAETAELTSTLNLQCEVIIILVVAVEQPGEVVVVVGAALVNEEVNAVHRR 107856 

>AP004013.1 $P chromosome 8 clone OJ1575_B01 pseudogene fragment like AP003711
AP004155 76218-76039 same sequence
74046 RRPAGPGLRLPPSPFGLPILGHLHLLTPLPHQALHRLAACHGPLLYLRLGSVPCVAACSP 74225

>AP003759.1 $P chromosome 7 clone OJ1567_G09, pseudogene fragments like AP003711
100882 RRRRRHLPQQGRRLPRREAGGAPLPRAALAHDGARHG 100772
112892 AAVHELLAGRTLDRLRHVGREEVARLVGSLSRSA 112993
114111 AAVHELLAGRTLDRLRHVGREEVARLVGSLSRSAANGEHVDADAALMGLTSDIVSRMVN 113935
113934 W*WAGDDNDTEEMWSVVAETAELTSTLNLQGEVIVILVVAVEQPGEVVVVVGAALVNEEV 113755
113754 DAVHRR 113737

>AL606632.1 $P chromosome 4 clone OSJNBa0042L16 pseudogene fragments like AP003711
79792 VHELLAGLTLDRLRHVRREEVARLVGSLSWSATDAEHVDVDAALMGVTSNIVPRMVMGRR 79613
79612 WTGDDNDMEEMRSVVAETAELTGDLVYQD 79526
79515 RGRQAELDAVVGDLQAVAKETLRLHPT 79435

>AC096855.1 $F chromosome 3 clone OJ1365_D05 54% to AC087550 frameshift before PERF?
= AQ326032 AQ329780 73% to AF321860 Lolium rigidum similar to CYP71D sequences
87309 MDDYFFLQSLLLCVAAVALLQLAKVAATMRRRPRTPPGPWRLPVIGSMHHLVNALPHR 87136
87135 AMRDLAGVHGPLMMLRLGETPVVVASSRGAARAVLKTHDANFATRPRLLAGEIVGYGW 86962
86961 ADILFSPSGDYWRKLRQLCAAEILSPKRVLSFRHIREDEVTARVEEIRAAAAPSTPVNLS 86782
86781 VLFHSTTNDIVARAAFGRKRKSAPEFMAAIKAGVGLSSGFKIPDLFPTWTTALAAVTGMK 86602
86601 RSLRGIHKTVDAILQEIIDERRCVRGDKINNGGAADDQNADENLVDVLIALQEKGGF 86431 (1)
86339 GKSVTTPWVIVTHMICTLDVQDMFAGGTGTSASALEWAMSELMRNPAVMKKLQGQIREAFH 86157
86156 RKAVVTEADLQASNLRYLKLVIKEALRLHPPAPLLVPRESIDTCELDGYTIPAKSRVIVN 85977
85976 VWAIGRDPK Y*Y*E DAEEFKPEQFDDDAIDFMGGSYEFIPFGSGRRMCPGFNYGLASMEL 85797
85796 VLVAMLYHFDWSLLVGVKEVDMEEAPGLGVRRRSPLLLCATPFVPAAVSADY* 85638

>AC087544.2 $F chromosome 10 clone nbxb0046P18, 
47% to CYP71D7
AZ131846.1 OSJNBb0111D08r CUGI Rice BAC Length = 377 59% to 71B9
AZ132319.1 OSJNBb0062F12r CUGI Rice BAC genomicLength = 683
14167 MEDKLLLLLALAVSVLVAVVISKLVSYATKPRLN
14065 LPPGPWTLPVIGSIHHLVGSHPIHRSMRALAEKHGRDLMQVWLGELPAVVVSSPEAARDV 13886
13885 LRSQDLAFADRYVSTTIAAIYLGGRDLAFAPYGERWRQLRKLCTQRLLTAARVRSFRCVR 13706
13705 EEEVARLVRDLAASAAAGEAVDLTARVAELVNDVVVRCCIGGRRSRYRDEFLDALRTALD 13526
13525 QTTWLTVADVFPSSKLARMLGTAPRKALASRKKMERILEQIIQERKQIKERSTGAGAGAD 13346
13345 DEAAAAGNECFLDVLLRLQKEGDTPIPITNETMMLLLH 13232 (0)
10760 NMFSAGSETSSTTLNWTMAELIKSPRVMAKVHDEVRQAFQGKNTITDDDVAKLSYLKMVT 10581
10580 KESLRMHCPVPLLGPRRCRETCKVMGYDVPKGTIVFVNAWAICRDSKYWKSAEEFKPERF 10401
10400 ENISIDYNGNNFEFLPFGSGRRICPGITLGMANVEFPLASLLYHFDWKLPNQMEPEEIDM 10221
10220 REAPGLVGPKRTSLYLHPVTRVAPSSV* 10119

>AC087550.2 $F chromosome 10 clone nbeb0016G17 same as seq on AC087544 from 1-3082
AQ330340 nbxb0046P18r 60% to D48250 65% to 76C4 almost identical to AC087550.2
139422 MEDKLPLALTVLSVSVLIAVVISKLVSYATKPRLNLPPGPWKLPVIGSLHHLVGSHAIHR 139243
139242 SMRALAEKHGRHHLMQISLGEVFAVVVSSPEAAEEILRNQDVTFADRFLSTTIGVITFGG 139063
139062 NDMAFAPYGERWRQLRKLCTLELLSAARVRSFRRIREEEVARLVRDLAASAAAGEAVNLS 138883
138882 GRIAKLINDVVVRCCVGGRSEHRDEFLDALRTALDQTTWLTVADVFPSSKLARMLGTAP 138706
138705 RKALASRKKIEHILEQIIQERKRIMDRSSHGGDGDGEAMNTSECFLDVLLRLQKDGNT 138532
138531 PIPITNEVIVVLLF (0)
136455 DMFSGGSETSSSTLIWTMAELIRKPKVMAKAHVEVRQAFQGKNTITEDDGVNELTYLKMV 136276
136275 IKESLRMHCPVPLLGPRKCRETCKVMGYDIPKDTTVFVNAWAICRDPKYWDDAEEFQPER 136096
136095 FENKSIDFKGSNFEFLPFGSGRRMCAAMNLGIANVELPLASLLYHFDWKLPDGMMPEDVD 135916
135915 MQDAPGILVGKRSSLIMCPVTRVAPSNPQVIAS* 135814

>AP003805.1 $F chromosome 7 clone OJ1080_F08, similar to AC087550.2
39% to 71B23
10416 MEDKVLLAVGASLVLVFLSKLISSYAKKPRLNLPPGPWTLPLIGSAHHLVSWSESVHSVIGKLAREHGPVMQ 10201
10200 LWLGEVPTVVASSPEAAQEILRDHDLIFADRHLTSTTAAITFGGTDVVMAQYGERWRHLR 10021
10020 KLLTQELLTVARVRSFRRVREEEVARLVRDLSAAAASGATVNLTDMVNRLVNDTVLRCSV 9841
9840  GSRCKYREEFLAALHAILHQTSALSVADLFPSSKLASMVATGPRNVLANRNKVERI 9673
9672  IEEIIQERKNQIETDMMSGNDDVGDKAAVESKSCSLDVLLRLQKEGGTPIPITNQVITVLLW (0)9487
3785  DMFGAGTDTSSTTLIWTMAELMRSPRVMAKVQAEMRQAFQGKNTITEDDLAQLSYLKMVL 3606
3605  KESFRLHCPVPLLSPRKCRETCKIMGYDVPKGTSVFVNVWAICRDSMYWKNAEEFKPERF 3426
3425  EDNDIELKGSNFKFLPFGSGRRICPGINLGWANMEFALANLLYHFDWNLPDGMLHKDLDM 3246
3245  QESPGLVAAKCSDLNVCPVTHISSSCA* 3162

>AC087550.2 $F chromosome 10 clone nbeb0016G17 74% to AC087554 seq 14167
132002 MDDKLLQLLLLALAVSVVSIVTISKLVYRATNKPRLNLPPGPWTLPVIGSLHHLVMRSPS 131823
131822 IHRSMRALAEKHGPLMQVWLGEVPAVVVSSTEAAEEVLKNQDARFADRFITTTLGAITF 131646
131645 GGGDLAFAPYGERWRHLKMLCTQQLLTAARVRSFRRIREEEVARLVRDLAASAAGGGEVA 131466
131465 VNLSERVARLVNDIMVRCCVGGRSKHRDEFLGALCTALSQTSWLTVADLFPSSRLARML 131289
131288 GTAPRRALASRKKMELILEQIIQEREEMTTDRSGDGEAGPTNECFLDVLLRLQK 131127
131126 EGDTPIPITMELIVMLLF 131073 (0)
       DIVSGGTETSTIVLNWTMAELIRTPRVMTKAHAEVRQTFQAKSTITEDDDISGL 129741
129740 TYLKMVIKESLRMHCPVPLLGPRRCRETCKVMGYDILKDTTVFVNAWAMCRSSIYWNDAE 129561
129560 EFKPERFENKCIDYKGSNFEFIPFGSGRRMCAGMNLGMADVEFPLASLLYHFDWKLPDGM 129381
129380 SPEDIDMQEAPGLFGGRRTSLILCPITRVAPSDLQVIV* 129264

>AP005114.1b $F (japonica cultivar-group) chromosome 2 
BI811079.1 clone K015D02.Length = 347 57% to AC087550.2 C-helix
120645 MEDNKLILALGLSVLFVLLSKLVSSAMKPRLNLPPGPWTLPLIGSLHHLVMKSPQIHRSL 120824
120825 RALSEKHGPIMQLWMGEVPAVIVSSPAVAEEVLKHQDLRFADRHLTATIEEVSFGGRDVT 121004
121005 FAPYSERWRHLRKICMQELLTAARVRSFQGVREREVARLVRELAADAGAGGDAGVNLNER 121184
121185 ISKLANDIVMVSSVGGRCSHRDEFLDALEVAKKQITWLSVADLFPSSKLARMVAVAPRKG 121364
121365 LASRKRMELVIRRIIQERKDQLMDDSAAGAGEAAAGKDCFLDVLLRLQKEGGTPVPVTDE 121544
121545 IIVVLLF (0) 121565
122803 DMFTGASETSPTVLIWILAELMRCPRVMAKAQAEVRQAAVGKTRITENDIVGLSYLKMVI 122982
122983 KEALRLHSPAPLLNPRKCRETTQVMGYDIPKGTSVFVNMWAICRDPNYWEDPEEFKPERF 123162
123163 ENNCVDFKGNNFEFLPFGSGRRICPGINLGLANLELALASLLYHFDWKLPNGMLPKDLDM 123342
123343 QETPGIVAAKLTTLNMCPVTQIAPSSAEDAS* 123438

>AQ869247.1 (partial) nbeb0034D08r CUGI Rice BAC genomic Length = 447 53% to 99A1
AZ130570.1 OSJNBb0104D19r CUGI Rice BAC genomic Length = 327
(0) DMFAAGSETSSTTLEWALTELVRNPHIMEKAQSEVREIFRGENKLTEEMMD 279
280 KLSYLRLVIRETLRLHLPVPFLLPRQCREPCSVMGYDIPVGTKVLVNAWAIARDNQ 447

>AP005302.1c $F (japonica cultivar-group) chromosome 6 
AF088220 48% to 701A3 mRNA BF430785 OG04F06T3
51052 MESMLVAGAGAAAVAAVGGLVAAAALADKLVAAPPPRKNRANPPP (1) 51186
      AVPGLPIIGNLHQLKEKKP 54671
54672 HQTFAKWSETYGPIYTIKTGASPVVVLNSTEVAKE (0?)
56126 AMIDKFSSISTRKLPKAMSVLTRKSMVAISDYGDYQKMAKRNIMIGMLGFNAQ (0?) 56290
56388 FRGTRERMISNVLSTLHKLVSLDPHSPLNFRDVYINELFSLSLIQ (0?) 56525
56680 SLGEDVSSVYVEEFGREISKDEIFDVLVHEMMMCAVEADWRDYFPYLSWLPNKSFDTIV 56856
56857 STTEFRRDAIMNALIKKQKERIARGE (0?)
57020 ARASYIDFLLEAERSAQLTDDQLMLLLSESILAAADTVLVTTEWTMYEIAKNPDKQ (0) 57187
57534 ELLYQEIREACGGEAVTEDDLPRLPYLNAVFHETLRLHSPVPVLPPRFVHDDTTLAGYD 57710
57711 IAAGTQMMINVYACHMDEKVWESPGEWSPERFLGEGFEVADRYKTMAFGAGRRTCAGSLQ 57890
57891 AMNIACVAVARLVQELEWRLREGDGDKEDTMQFTALKLDPLHVHLKPRGRM* 58046

>AP005302.1d (japonica cultivargroup) chromosome 6 new ortholog to AAAA01009744.1 
88169 MESLLAAGAGGIGVAAAAAVVAATLAVVPPKDRGNNPPP 88285 (1)
      AVPGLPVIGNMHQLKEKK 91433
91434 PHHTFTKWSKTYGPIYTIKTGASSVVVLNSTEVAKE 91541 (0?)
96519 QAMIEKFSSISTKKLPKALSVISRKNMVSISDYGDFYKMAKRNIMLAILGFNAQ (0?)
      FCDTRERMVSNVLSSLHKLVAVDPHSPLNF 96878
96879 REVYTTELFGLSLIQ (0) 96923
97128 NLGEDVCSVYVEEFGREISKEEIFHVLVHEILSCVVEPDWRDYFPYLSWLPNKSFETIV 97304
97305 SSTEFRRDAVMNALIKRQKERIARGE
      ARISYIDFLLEAKNSTQLTDHQLMLLLAESIAAAVDTVLVTTEWAMYELAK 97664
97665 NPDKQ (0) gc boundary
      EWLYREIREVCGGK 97844
97845 AVTEEDLPRLPYLDAVLHETLRLHSPVPVLPTRFVHDDTTLAGYDVPAGTQVMINVFGCH 98024
98025 MDEEAWESPGEWSPERFLGEGFKLADRYKTLAFGAGRRTCAGSQQAVSIACVAIARFVQE 98204
98205 LQWTLREGDGDKEDTMQYTALKLHPLHVHLKPRG 98306

>AP005302.1a (japonica cultivar-group) chromosome 6 = AC087597.1a N-terminal missing
identical to AC087597.1c (duplicate)
     AVPGLPIIGNLHQLKE 2571
2572 KKPHQTFAKWAEIYGPIYTIRTGASSVVVLNSTEVAKE 2685
2780 AMVAKFSSISTRKLSKALTVLTRDKSMVATSDYCDFHKMVKRYVMSSMLGTSAQ 2950 (0?)
3102 FRDIRDMMIHNMLSTFHKLVKDDPHAPLIFRDVFKDELFRLSMIQ 3236
3320 SLGEDVSSVYVDEFGRDISKEEIYNATVTDMMMCAIEVDWRDFFPYLSWVPNKSFETR 3493
3494 VFTTETRRTAVMRALIKQQKERIVRGE 3574
3691 AKTCYLDFLLAENTLTDEQLMMLVWEALIEAADTTLVTTEWAMYELAKNPDKQ (0)
     ERLYQEIREVCGDETVTEEHLPRLPYLNAV 4038
4039 FHETLRRHSPVPLIPPRFVHEDTKLAGYDVPAGTE 4143
4711 MVINLYGCNMNRKEWESPEEWVPERFAGGRLEVADMYKTMAFGAGRRACAGSLQATHIA 4887
4888 CAAVARFVQEFGWRLREGDEEKVDTVQLTAYKLHPLHVHLTRRGRM 5025

>AC087597.1a $F chromosome 1 clone OSJNba0088C22 
CYP703A1-like BI118632.1 EST identical to AP005302.1b (duplicate)
66118 MESLLAAGAGGIGVAAAAVGGFIAAATLAVAPPKNRRNPPP 65996 (1) 
61809 AVPGLPIIGNLHQLKEKKPHQTFTKWAEIYGPIYTIRTGASSVVVLNSTEVAKE 61648 (0)
61535 AMVAKFSSISTRKLSKALTVLSHDKSMVATSDSGDFHKMGKRYIMLSMLGTSAQ 61374 (0)
      FRDTRDMIINN 61179
61178 MLSTFHQLVKDDPHAPLIFRDVFKDELFRLSMIQ 61077 (0)
60993 SLGEDVSSVYVDEFGRDISKEEIYNATVTDMMMCAIEVDWRDFFPYLSWVPNKSFETRV 60817
60816 FTTESRRTAVMRALIKQQKERIVRGE 60739 (0)
      ARTCYLDFLLAENTLTDEQLMMLVWEALIEAADT 59693
59692 TLVTTEWAMYELAKNPDKQ (0) gc boundary = BI118632
      ERLYQ 59513
59512 EIREVCGDEAVTEEHLPWLPYLNAVFQETLRRHSPVPLIPPRFVNEDTMLAGYDVPAGTE 59333
57544 MVINLYGCNMNKKEWESPEEWAPERFAGGRFKVADMYKTMAFGAGRRVCAGSLQATHIA 57368
57367 CAAIARFVQEFGWRLREGDEEKVDTVQLTAYKLHPLHVHLTPRGRM* 57227

>AP005302.1b $F (japonica cultivar-group) chromosome 6 = AC087597.1b see below
21490 MESLLAAGAGGIGVAAAAVGGFIAAATLAVAPPKNRRNPPP (1) 21612
      AVPGLPIIGNLHQLKEKKPHQTFTKWAEIYGPIYTIRTGASSVVVLNSTEVAKE 25960
26073 AMVAKFSSISTRKLSKALTVLSHDKSMVATSDSGDFHKMGKRYIMLSMLGTSAQ
      FRDTRDMIINN 26429
26430 MLSTFHQLVKDDPHAPLIFRDVFKDELFRLSMIQ 26531
      SLGEDVSSVYVDEFGRDISKEEIYNATVTDMMMCAIEVDWRDFFPYLSWVPNKSFETR 26788
26789 VFTTESRRTAVMRALIKQQKERIVRGE 26869
27814 ARTCYLDFLLAENTLTDEQLMMLVWEALIEAADTTLVTTEWAMYELAKNPDKQ (0) 27981
      ERLYQEIREVCGDEAVTEEHLPWLPYL 28161
28162 NAVFQETLRRHSPVPLIPPRFVNEDTMLAGYDVPAGTE 28275
30064 MVINLYGCNMNKKEWESPEEWAPERFAGGRFKVADMYKTMAFGAGRRVCAGSLQATHIA 30240
30241 CAAIARFVQEFGWRLREGDEEKVDTVQLTAYKLHPLHVHLTPRGRM 30378

>AC087597.1b $F chromosome 1 clone OSJNba0088C22 
CYP703A1 like
115151 MEAFVSGGAGGVGAAAVVGVFVAAAVVGGFVAAVALAERAGVIAPRKRPNAPP 115309 (1)
116986 AVPGLPIIGNLHQLKEKKPHQTFTKWAEIYGPIYTIRIGASSVVVLNSTEVAKE 117147 (0)
       AMVAKFSSISTRKLSKALTVLTRDKSMVATSDYGDFHKMVKRYVMSSTLGTSAQ 117420 (0)
117581 FRDTRDMMINNMLSTFHKLVKDDPHVPLIFRDVFKDELFRLSMIQ 117715 (0)
117799 SLGEDVSSVYVDEFGRDISKEEIYNATVTDMMMCAIEVDWRDFFSYLSWVPNKSFETRV 117975
117976 FTAEARRTAVMRALIKQQKERIVRGE 118053 (0)
119442 AKICYLDFLLAENTLTDEQLTMLVWEELIEAADTTLVATEWAMYELAKNPDKQ 119612 (0) 
119689 ERLYQEIREVCGDETVTEEHLPRLPYLNAVFHETLRRHSPVPLIPPRFVHEDTKLAGYD 119865
119866 VPAGTE (0)
120626 MVINLYGCNMNKKEWESPEEWVPERFTGGRLEVADMYKTMAFGAGRRACAGSLQVMHIA 120802
120803 CTAIARFVQEFGWRLTEGDEEKVDTVQFTAYKLHPLHVHLTPRGRM* 120943

>AC087597.1c $F chromosome 1 clone OSJNba0088C22 N-terminal exon =  AU163509.1
CYP703A1 like, BE039821.1 OC08F06 OC cDNA 5' similar to ga3.Length = 993 59% to 701A3
128653 MEAFVPGGAGAAAXAVGGFVAAAALAERAGVIAPRKRPNAPP 128781 (1) alt exon 1
129691 PGGVGAAAVAAVGGFVAAAALAERAGVIAPRKRSNAPP 129807 (1) alt exon 1
131498 MEAFVPGGAGAAAPAVAAVGGFVAAAALAERAGVIAPRKRTNAPP 131632 (1) 
134410 AVPGLPIIGNLHQLKEKKPHQTFAKWAEIYGPIYTIRTGASSVVVLNSTEVAKE 134571 (0)
134666 AMVAKFSSISTRKLSKALTVLTRDKSMVATSDYCDFHKMVKRYVMSSMLGTSAQ 134827 (0)
134988 FRDIRDMMIHNMLSTFHKLVKDDPHAPLIFRDVFKDELFRLSMIQ 135122 (0)
135206 SLGEDVSSVYVDEFGRDISKEEIYNATVTDMMMCAIEVDWRDFFPYLSWVPNKSFETRV 135382
135383 FTTETRRTAVMRALIKQQKERIVRGE 135460 (0)
135577 AKTCYLDFLLAENTLTDEQLMMLVWEALIEAADTTLVTTEWAMYELAKNPDKQ 135747 (0)
       ERLYQEIREVCGDETVTEEHLPRLPYLNAV 135924
135925 FHETLRRHSPVPLIPPRFVHEDTKLAGYDVPAGTE 136029 (0)
136597 MVINLYGCNMNRKEWESPEEWVPERFAGGRLEVADMYKTMAFGAGRRACAGSLQATHIA 136773
136774 CAAVARFVQEFGWRLREGDEEKVDTVQLTAYKLHPLHVHLTRRGRM* 136914

>AP004591.1 $F chromosome 8 clone P0582D05, Length = 150428
Same as AU064265 C73739 71% to 703A2 same clone as C99519 C73767
C99519.2 Rice panicle cDNA cloneE20326_8A.Length = 710 same as AU094669
AU094669.1 Rice panicle clone E20429.Length = 453 same as C99519
AQ163513 nbxb0007K19r very similar to C99519 71% to 703A2
AU176480.1 cDNA clone E20774. Length = 300 334-430 
This last EST is probably the C-term of a single 703A gene in rice (78% to 703A2)
But it might belong to a second gene.  I combined them because Arab. Has only 1 703A 
gene
4227 MDPFLLSIILCSW
4188 IFVVVSWKKLNCMRRRLPPGPPRWPIFGNLLQLSPLPHKDFARFCTKYGPLVYLRLGTID 4009
4008 AITTDDPEVIREILIRQDEVFASRPRTLAAVHLAYGCGDVALAPLGPNWKRMRRVCMEHL 3829
3828 LTTKRLESFAAHRALEAEHLCQFVWAKAQSGKPVNLREVLGAFSMNNVTRMLLGKQYFGL 3649
3648 QSAGPGEAMEFMHITHELFWLLGLIYLGDYLPAWRWLDPYGCEKKMREVEKKVDDFHQ 3475
3474 KIIDEHRKAREAKKSASLDDDNKEDMDFVDVLLSLPGENGKEHMDDVEIKALMQ 3313
2680 DMIAAATDTSSVTNEWVMAEVIKNPRVLRKIQEELDGVVGRGRMVAESDLGQLTYLRCVV 2501
2500 RESFRMHPAGPFLIPHESLKPTTIMGYDIPARTRIFINTHALGRNTRIWDDVDAFRPERH 2321
2320 LPASADGGRVEISHLPDFKILPFSAGKRKCPGAPLGVILVLMALARLFHCFDWSPPDGLR 2141
2140 PDDIDTQEVYGMTMPKAKPLVAVATPRLPPQMYGRHGKQV* 2018

>AP003870.1 $F chromosome 8 clone OJ1116_A04  70% to D24290 related to 92A
same as AP004653.1 chromosome 8
9007 MEPSALATLLSMALAAFLLLGLIKRKGSRRGYN
8908 LPPGPTPWPVIGNFNLIGALPHRSIHELSRKYGELMLLRFGSFPVVVGSSVAMARLVLKT 8729
8728 HDAVFIDRPRTASRKHTTYGYADITWSPYGAYWRQARRICVTELFSARRVASFEHIRA 8555
8554 DEVRALVRGLFAAASSGRSGAVHLNRDHLSTLSMNVITRMVLGKRFFGEGADAAEGPVST 8375
8374 LSEFKWMLDELLLLNGVLNVGDWIPWVDWMDLQGYVRRMKKVGKMFDAFMEHVLDEHSER 8195
8194 RRREGEAFVARDMVDVLMDLADDPSLEIKLGRVGVKAFTQDLIAGGTESSSVTVEWALSE 8015
8014 LFKNPAIFATATDELDRVVGRCRWVTEKDIPNLPYLDAIMKETMRMHPIVPLLIPRVARD 7835
7834 DAAVAGYDIPKGARVLINVWTIGRDPELWDAAEEFMPERFIGSRIDVKGQDFELLPFGSG 7655
7654 RRMCPGYNLGLKVMQLSLANLLHGFAWRLPEGMKEEELSMDEVFGLSTTRKYPLQVVVEPRLPVQLYSL* 7445

>AP003927.1 $P chromosome 7 clone OJ1120_A01, pseudogene fragment related to CYP92A
same as AP003846 109108-108690
58933 PLLVPHHARENTVVAGYDVPAGTRVLVNLWAIARDPASWLDRPDAFLPERFLPGAGSCDD 59112
59113 VVDVHGQHFELLPFESGRQIYPATNLAKKMVALGVASLLQGFAWR 59247 7 aa deletion and frameshift
59250 EDVSMEELVGLSTRRKVPLVAVAEPRLPAHERR* 59351

>D40799 $P 56% TO 76C1  11/94 447-476 REGION 2 diffs with AP003927.1 = AQ578139
RRISPATNLAKKMVALGVASLLQGFAWRLPD (fs) EDVSMEELVRLSTRRKVPLVTVAEPRLPPHLYAAAAA*

>AT003629.1 Magnaporthe grisea infected rice cDNA cDNA clone mgir2I19.Length = 468
nearly identical to AP003927.1 exact match to AP003846 in caps region
cannot extend ortholog to aaaa01010885.1 $FI 3 diffs, 2 at kn end
119 knvsmkxLVGLSTRRKVPLVAVAEPRLPAHlyagtaa* 232

>AP003378.1 $F chromosome 1 clone P0047E11, 49% to 706A5
28850 MATANLLYAALLVPTVLYLAVTRRRSRRLPPGPVGLPLVGSLPFIDPNLHTYFASLAAKH 29029
29030 GPILSIRLGSKVDIVVNSAQLAREVLRDQDSVFANRVMLDAGDAVSFGGAQNIVGNPLGP 29209
29210 MWRLLRRVCVQEMMSPAGLASVHGLRRREFRSTLRYLHSKPGEPVDVGAQMFLNTM 29377
29378 NVITGTMWGGTIGSESERSAVGSEFRGLVAEVTELLGTPNVSDLFPVLKPFDLQGIRRKM 29557
29558 ERLRSRFDLLFTKIIQQRMRSQQDGGEMTTDFLECLLKMEKEGSDGKTTFTMDNVKGFLL 29737 (0)
      EMVVGGTDTTSNSVE 29917
29918 WIMAELLQNPQVLNKVQQELDSIVGRDAVVEESHLPQLHYLRMVIKETLRLHPPVPLLVP 30097
30098 HSPSAAATVGGYHVPEGCRVLINVWAIQRNPLVWNKPLDFNPDRFARDGGHKGDFTGSQL 30277
30278 DYLPFGSGRRMCAGMAMGEKVMVYSVAMLLQAFDWKLPQGVQLDLSEKFGIVMKKATPLV 30457
30458 AIPTPRLSKPELYYS* 30505

>AP004704.1 $F chromosome 8 clone P0544G09, Length = 137478
New 65% to AP003378.1 49% to 706A4 same as AU174693.1 clone F10219
73% to AP003378.1 3 diffs with BE228457.1 72% to AP003254
BE228457.1 98AS2635 Rice Seed cDNA clone 98AS2635.Length = 503 51% to 706A5
100654 MEVTSTPAMLLYAALFAAALLYLAVAVRRGRGAGLPPGPTGLPLVGSLLSLDPELHTYFAGLAARYG 100454
100453 PIFSIRLGSKLGVVVTSPALAREVLRDHDLVFSNRDTPDAACSISYGGGQNIVWNPVGPT 100274
100273 WRLLRRICVHEMIGPAGLDSLHGLRRREFMATLHHLRARSGEPVNVGAQMFLTVMNVVTG 100094
100093 ALWGGNVGSESERTTVGKEFRELVADITELLGAPNVSDFFPALAPLDIQGIRNKSDLLKD 99914
99913  RFDDIFARIIQKRTESDHAAAAGETASDFLEYMLKLEKEGGDGKTAFTMTNVKALLM 99743 (0)
       DMVIGGTETTSNTVEWGMAEMLQNRGTLRKVRE 99561
99560  ELDAVVGRDGVVEESHLPKLHYLNLVVKETLRLHPALPLMVPHCPGEDATVGGHRVPAGA 99381
99380  RVFVNVWAIQRDPAVWKDPEHFIPERFLPADGGGGRRLDFTGSEQEYMPFGSGRRICAGV 99201
99200  AMAERMVAYSLAMLVQAFDWELPAGERLDLAERFGIVMKKATPLVAVPTPRLSNPQLYSA* 99018

>AP003752.1a $F chromosome 7 clone OJ1332_C12, Best match zea mays AF391808
45% to 709B2
7127 MGYGWALAAVLATWCLLDALSR
7061 LVWRPRAVAAALRRQGVRGPGYRLLVGSLGDINKLRADAAGAGGAAALDVASHDFIPFVQ 6882
6881 PQFRQWIPLY (1)
6770 GRVFLYWFGWTPDMCVADVEVAKQVLSDRTGLFPKNVTTPMLLKLFGRGLVLA 6612
6611 NGDEWQRHKKVVHPAFNTDKLK 6546 (0)
6457 MMTATMAGVARSMVSRWEEKVASHGGKVVIELSSQFEELTADVISHTAFGSSYAEGKQV 6281
6280 FMAIKELQFIAFSSLLSVQIPGS 6212 (2)
6152 RYFPTKKNLKVWRLDRKVRSTLMQIISNRLAAAAKEKAGGGGGGYGEDLLGLMLEAS 5982
5981 APPELGTKRRQPPVLSMDEIIDECKTFFFAGQETTSHLLSWTMFLLSTHPDWQDKLREEA 5802
5801 VRECAGAGAGDDDDQLPTYDMLGKLKLVGMQF 5706 (0)
5130 FLLETLRLYSPVPAIRRRTAAAVEMGGVTVPGGTMLTFPIATMHRDEEVWGAD 4975
4974 AGVFDPMRFDGGGGAMAKLLSFSTGPRACVGQSFA 4870
4869 MVEAKAVVAAILRRFRLELSPEYVHAPTDVITLRPKHGLMVVTRVHA* 4723

>AP003752.1b $F chromosome 7 clone OJ1332_C12, Best match zea mays AF391808
18593 MGNLGWMVAAAVAAVVASWAFDAVVK
18515 LVWRPRAITRRLRAQGVGGPGYRFFSGNLGEIRRLRDEGAGVVLDVSSHDFVPIVQPHFR 18336
18335 KWIPLY 18318 (1)
18171 GKTFMYWFGARPTICLADVSMVRQVLSDRTGMYPKNVSNPYFARLLGKGLVLTDGDEWKR 17992
17991 HRKVVHPAFNMDKLKMMTVTMSDCAQSMISEWESELGTKGDIVEIELSRRFEELTA 17824
17823 DVISHTAFGSSYKEGKQVFLAQRELQFLAFSTFLSIQIPGSSYLPTKKNLKTWSV 17659
17658 DKKVRSMLTDIIKSRLNNKDVAGYGNDLLGLMLEACAPE 17542
17541 HGESQPQLSMDEIIAECKTFFFAGHDTTSHLLTWTMFLLSTHPEWQEKLREEVATECDGK 17362
17361 VPTGDMLNKLKLVNMFLLETLRLYGPVAFIQRRVNAELELGGITVPEGTVLS 17206
17205 IPIATIHRDKEVWGEDADIFKPERFKNGVSKAGKYPNAL 17089
17088 LSFSSGPRACIGQNFAMIEAKAVIAMILQRFSFTLSPKYVHVPTDVITLRPKYGLPMI 16915
16914 LKSLKV* 16894

>AP003752.1c $F chromosome 7 clone OJ1332_C12, Best match zea mays AF391808
same as AP004334.1 3175-4374 region
25600 MGFEWTVATAAAAVAASWAFIAVVVK
25522 LVW*PRAITRRLGAQGVAGPGYRFFSGNLSEIRRLRAEGANLVLDVSSHDFVPIVQPHIR 25343
25342 KWIPLY 25325
25229 GKTFLYWFGTRPNICLADMNMVRQVLSDRTGMFPKYIDNMQFARLLGKGLVLTDDDEWKR 25050
25049 HYKVVHPAFDMDKLKMMTETISDYAQSMMFEWESELGMKGGSTEIELSRWFEELTA 24882
24881 DVISRTAFGSSYREGKQVFLAQRKLQFLAFSVFLTIQIPGFSYLLTKK 24738
24737 NLKTWSLDKKVRSMLMNIIKSRLTNKETMGYGNDLLGLMLEACVPEHGGSQPQLSMDD 24564
24563 IIAECKTFFFAGHDTTSQLLTWTMFLLSTHQHWMEKLRKEVRMVCNDEVPTGDMLNKLKL 24384
24383 VNMFLLETLRLYGPVSLVTRRAGTDVKLGSIKVPKGTILTIPIATIHRDKEV 24228
24227 WGEDADEFKPERSENGVLNAAKHPSALLSFSIGLRSCIG 24111
24110 QNFAMIEARTIIAMILQRFSFTLSPKYVHTPISVITLRPKYGLPMILRSLKVKRDR* 23937

>AP003752.1d $P chromosome 7 clone OJ1332_C12, Best match zea mays AF391808
same as AP004334.1 8332-8514 region lone C-term pseudogene fragment 
29328 NGVSKEAKNPNALLSFSSGPRSCIGQNFAMLEAKAVITMILQRFFFTLSPKYVHA 29164
29163 PTSLITLWPKYGLPKILRSLKMCRMESQELSQPEGGEWLVYPKT* 28979

>AP003752.1e $F chromosome 7 clone OJ1332_C12, Best match zea mays AF391808
same as AP004334.1 11654-12850 region
34102 MGNFVWMVAAAAAAVASWAFIAVVVK
34024 LVWRPRAISRRLRAQAVGGPGYRFFSGNLGEIRRLRAEGAGVVLDVSSHDFVPIVQPHFR 33845
33844 KWVSLY (1)
33706 GKTFLFWFGAQPNICLADINIVRQVLSDRTGMYPKDLTNPYFAHLLGKGLVLIDGDEWKR 33527
33526 HYKVVHPAFDMDKLKMMTVTISDCTGSMMSEWESELGMKGGSAEIELSQRFQELTA 33359
33358 DVISRTAFGSSYSEGKQVFLAQRKLQFLAFSMFLTIQIPGFRYLPTKK 33215
33214 NLKIWSLDKKVRSMLRNIIKIRLANKDTMGYGNDLLGLMLETCAPEHDESQQLS 33053
33052 MDEIIAECKTFFFGGHDTTSHLLTWTMFLLSTHPEWMRKIRKEVTTMCGDEVPTGDMLNK 32873
32872 MNLLNMFLLETLRLYSPVSLISRRTGTNAKFGGIKVPEGTILRIPIATIHRD 32717
32716 KEVWGEDADEFKPARFENGVSKAAKHPNALLSFSNGPRS 32600
32599 CIGQNFAMIEAKAVITMILQRFSFTLSPKYVHTPISVITLRPKYGLPMILRSLKVKRDL* 32420

>AP003745.1 $P chromosome 7 clone OJ1123_B01 
AP004264.1  chromosome 7 clone P0025D09 missing first 113 aa same as AP003745
two in frame stops pseudogene same as AP003745
sequence starts in mid exon, probable pseudogene related to AP003752 18593-16894
60823 VLSDRTGIYPKNLTNPHSVRLLGKGLVLTDGD 60917
60918 EWKRHRKVVHPAFNMDKLKMMTMTMSDCSRSMMSEWESELGAKGGLAEIELSRRFEELTA 61097
61098 NVISHTVFGSSYKEGKQVFLAQRELQFLAFSTFLTVQVPGF 61220 (2)
61310 SYLPTMKNFKTWSLDKKVRGMLMDIIKTRHANKDVVGYGNDLLGLLLEACAPEHRESHPQ 61489
61490 LSMDEIIDECKTFFFAGHDTTSHLLT*TMFLLSTHPDW*EKLREEIAMECGDEMPTGDML 61669
61670 NKLKMVNMFLLETLRLYSPVLLIRRKVGTDIELGGIKMPEGALLMIPIATIHRDKEVWGE 61849
61850 DADEFRPERFENGVTRAAKHPNALLSFSSGPRSCIGQNFAMIEAKAVIAMILQRFSFTLS 62029
62030 PKYVHAPTDVITLRPKYGLPMILKSLKL* 62116

>AP004264.1 $F chromosome 7 clone P0025D09 missing first 113 aa 
Like AP003752 and CYP709B2 47% to 709B1
83% to AU164994 80% to C73388 91% to AU181168 possibly = C96617 
= AU093233 clone C11668  AU091277 opp end = AU062564 
58901 MGLLAWMVAAAAAAVLASWAFSAVVHLVWRPRAISRRLRAQGVGGPGYRFFSGNLGEIKR 59080
59081 FRGDGAGVVLNVSSHDFLPIVQPHFRKWIPLY 59176 (1)
59544 GRTFLYWFGAQPNICLADVSMVWQVLSDRTGIYPKNLTNPHFVRLLGKGLVLTDGDEWKR 59723
59724 HRKVVHPAFNMDKLKMMTMTMSDCSRSMMSEWESELAAKGGLVEIELSRRFEELTADVIS 59903
59904 HTAFGSSYKEGKQVFLAQRELQFLAFSTFLTVQIPGF 60014 (2)
60110 SYLPTMKNFKTWSLDKKVRGMLMDIIKTRHANKDVAGYGNDLLGLMLEACAPEHGESCPQ 60289
60290 LSMDEIIDECKTFFFAGHDTTSHLLTWTMFLLSTHPDWQEKLREEIAMECGDKVPTGDMLNKLKMVNMFL 60499
60500 LETLRLYSPVSLIRRKVDTDIELGGIKMPEGALLTIPIATIHRDKEVWGEDADEFRPER 60676
60677 FENGVTRAAKHPNALLSFSSGPRSCIGQNFAMIEAKAVIAMILQRF 60814
60815 SFTLSPKYVHAPTDVITLRPKYGLPMILKSLKL* 60916

>AC084319 $F same as AC084404 but end is present CDS 2242..2659 modified
join(8171..8199,8599..9057,9159..9776) modified 44% to 71B2
AC084404 56% to 71B N-term contig ends in repeat region
MAALNHQPHTPRRLRRPHLLEPAPKEPQQPPQAAARPDEAPHHR
QPAPDRPSAAPEPGGAGGVARLGTVSVVVLSSPKAAREALKVHDPECCSRSPSAGPRM
LSYGYKDVAFSPYSNYVRNMRKLFVVELLSMRRVQAAL (1)
MLPDYYCCMVEKLIEKLTRNGRNAVAINEHIFSTVDGIIGTFAL
GETYAAEEFKDISETMDLLSSSSAEDFFPGSVAGRLVDRLTGLAARREAIFRKLDRFF
ERIVDQHAAADDDGPAAARRKADDKGSAGSDLVHELIDLWKMEGNTKQGFTKDHVKAM
LLDTFVGGITTTSVTLHWAMSELIRNPRVMKKAQDEIRAVVGEKERVQHHDMPKLKYL
KMVVKETFRLHPPATLLVPRETTRHFKV 
GGYDIPEKTKVIVNA*AIGR (stop codon TAG instead of W = TGG)
DPNIWKDPEEFIPERFEEMDIDFNGAHF
ELVPFGSGRRICPGLAMGVANIEFILASMLFCFDWELPHGVRKEDIDMEEAGKLTFHK
KIPLLLVPTPNKAPN

>AP002968 $F 40% to 71B24
AP003204 40% to 71B24 CDS complement(join(121487..122125,122196..123113))
AQ870215.1 nbeb0036N08f CUGI Rice BAC genomic Length = 754 58% to 99A1
MAELPLYLLLLALLVAVPFLCLTRWSLRHGGGGGGRLPPSPWAL
PVIGHLHHVAGALPHRAMRDLARRHGPLMLLRLCELRVVVACTAEAAREVTKTHDLAF
ATRPITPTGKVLMADSVGVVFAPYGDGWRTLRRICTLELLSARRVRSFRAVREEEVGR
LLRAVAAAAAVAALTTPGATAAVNLSERISAYVADSAVRAVIGSRFKNRAAFLRMLER
RMKLLPAQCLPDLFPSSRAAMLVSRMPRRMKRERQEMMDFIDDIFQEHHESRAAAGAE
EDLLDVLLRIQSQDKTNPALTNDNIKTVIIDMFVASSETAATSLQWTMSELMRNPRVM
RKAQDEVRRALAIAGQDGVTEESLRDLPYLHLVIKESLRLHPPVTMLLPRECRETCRV
MGFDVPEGVMVLVNAWAIGRDPAHWDSPEEFAPERFEGVGAADFKGTDFEYIPFGAGR
RMCPGMAFGLANMELALAALLYHFDWELPGGMLPGELDMTEALGLTTRRCSDLLLVPA
LRVPLRDHER

******200, 57F japonica, 23 japonica partials, 20 pseudogenes
110 seqs total
11 indica accessions in 10 sequences, 2F, 3P, 6 partial


CYP72 clan contains five families 72, 709, 714, 715, 721 (39 sequences)


>AL662934.1 $F chromosome 4 clone OSJNBa0027O01 Length = 150012
59% to 99A1
4399 MEINSAATLTLVSLLTLPILLALLTRKSSSKKRRPPGPWNLPLVGGLLHLLRSHPQVALRELASKYGP 4602
4603 VMFLRMGQIDTVVVSSPAAAQEVLRDKDVMFASRPSLLVSEIFCYDNLDVGFAPYGAYWR 4782
4783 MLRKLCTVELLSTKVVRQLAPVRNDETLTLVRNIKAASSGHGGGGGKKPVTLARLLTTCT 4962
4963 NTITAKAAFGQACGVELQEQFLTALDVGLKFSGGFCFGDLFPSLRFIDAMTGLRSRLW 5136
5137 RARGQLDSVFDKIIAQCEEHQGDSLVNVLLRIRDQGDLEFPFGTTNIKAIIL (0)
     DMFTGGTETTSSAAEWVMSELMRNPEVMAKVQA 5481
5482 EVRRVFDNKSPQDHEGLIDNLRYMKMVIKETMRLNPVLPLLMPHLCRETCDIGGYEVVEG 5661
5662 TRVVINSWAMARSPEYWDDAEEFKPERFEDGMADYKGSRFEYLPFGTGRRRCPGDTFGMV 5841
5842 LLELIVARLLYYFDWSLPAGMQPDDVDMDFVVTATTRRKNHLQLVASPYKLAPIQI* 6012

>AC104708.1a two $F 72A like seqs 46% to 721
same as BI305681.1 D15759 42% TO 72B1 REGION = AQ913924
27908 MAALLGILLAAFLAAAAVLFALRLLHSFLWVPRRLERRFRRQGIRGPPRRPLSGNAAGY 28084
28085 RDLLAAAQSAPLASFHHAVVARVAPHHREWPERYGRPFVFWLGPRPRLVVSDPEVVKAVL 28264
28265 TDTTGTFGKAGPGGNNPLARQLFSEGLVGLTGEAWARHRRVIAPAFNMERVK 28420 (0)
29030 AWIPEITSITSSMLDKWELQDEVRTEFEIDVNKEFHTLIADVISCVAFGSSYEEGKRVFQLQ 29215
29216 EEQLKLVILAMRTVYIPGFR 29272 (2)
29354 FVPTKKNQRRKIVNQEIRNSLRKLIEINGRKCEDSNNLLGMMLSASKIDSEF 29509
29510 KMGIEEIIDECKTFYFAGKETTATLLTWATLLLALHQEWQNKARDEVLQVCGKYEHPKAE 29689
29690 NLSELKI 29710 (0)
29798 VNMVLKETLRLYPPAVFLNKIANRDVKLGKLDIPAGTQLQLPILDIHHDVSIWGADADEF 29977
29978 DPSRFAEGKSYHLGAYFPFGIGPTICVGQNLAMVEAKVALAMILQRFALVVSPSYVH 30148
30149 APMLVVTLQPQYGAQVLVHKI* 30214

>AC104708.1b two $F 72A like seqs
same as AQ686474 nbxb0072B17f AQ853376.1 nbxb0011G03f 49% to 721 
74% identical to D15759 probably in the 721 family
AQ862098.1 nbeb0018C10r CUGI Rice BAC genomicLength = 781 52% to 72A8
41941 MAALLGILLVAFLAAAVPFALRLLHSFLWVPRRLERRFRRQGIRGPPRRPLSGNAAGYRDLLA 42129
42130 AAQSAPLASFHHAVVARVAPHYREWPERYGRPFVFWLGPRPRLVVSDPEVVKAVLTDPTG 42309
42310 TFDKAGSGGGNPLARQLFGEGLVGLTGEKWARHRRVIAPAFNMERIK 42450 (0)
43002 GWISEITAITSSMLDKWEVQDEARAEFEIDVHKEFHTLIADVISSVAFGSSYEEGKRVFELQEEQLKLAI 43211
43212 LAMTTVYIPGFR 43244 (2)
43277 FVPTKKNRRRHMLNQEVRNSLRKLIEINGRKC 43435
43436 EDSNNLLGMMLSASKLGSEFRMGIEEIIDECKTFYFTGKETTANLLTWATLL 43591
43592 LALHQEWQNKARDEVLQACGKSEHPNAENLSNLKI 43696 (0)
      VNMVLKETLRLYPPAMFLNRMVNRDVKLGKLDIPAGT 43909
43910 QLHFPILDIHHDVNIWGTNADEFDPSRFAEGKSYHLGAYFPFGIGPTICVGQNLTMV 44080
44081 EEKVALAMILQRFALVVSR 44137 (frameshift)
44137 SYVHAPMHGVTLQPQYGAQVLAHKI* 44214

Nine rice CYP72A sequences

>CYP72A17 $F AP002839 Oryza sativa genomic DNA, chromosome 1 36553-39431
AG025591.1 strain ND3008 PCR from rice genomic DNA clone T8121T.Length = 401
AG025107.1 strain NC2542 PCR from rice genomic DNA clone T5184T.Length = 504
AU071192 very similar to AQ050520 = 72A17
AP002744 CYP72A17 join(109468..109819,110022..110245,110529..110781,
111446..111819,111915..112346)
MGIGIGIGIGIGIGTGTGAALPFGEASPWSLLGGAVAALLLVWAAQMLEWAWLAPRRMERALR
AQGLRGTQYRFLHGDLTEDLRLVTAARSKPVPMDRPHDFIPRVAPLLHRALEEH (phase 1 intron)
GRVSFTWFGPMPRVTITDPDLVREVLSNKFGHF
EKTKLATRLSKLLVGGLVILHGEKWVKHRRIMNPAFHAEKLK (phase 0 intron)
RMLPAFSASCSELIGRWENAVAASVGKAELDIWPDFQNLSGDVISRAAFGVRHHEGRQ
IFLLQAEQAERLVQSFRSNYIPGLS (phase 2 intron)
LLPTENNRRMKAIDREIKSILRGIIEKRQKATKNGEASKDDLLGLLLQSNMDYYSDEDGKS
SKGMTVEEIIDECKLFYFAGMETTAVLLTWTMVALSMHPEWQDRAREEILQVFGRNKPDI
NGVSRLKV (phase 0 intron)
VTMVLHEVLRLYPPVVMMNRRTYKEIELGGVRYPAGVMLSLPVLFIHRDAAAWGHDAGEF
DPGRFAEGVARACKDPGAGAFFPFSWGPRICIGQNFALLEAKVALGMILQRFAFELSPAY
AHAPYTVLTLHPQHGVPVRLRRL*

>CYP72A18 $F AP002839 Oryza sativa genomic DNA, chromosome 1 44993-41630
AU100789.1 Rice callus Oryza sativa cDNA clone C50810.Length = 419 C-term
AU102126.1 Rice callus cDNA clone C10756.Length = 571
AZ130306.1 OSJNBb0103O04r CUGI Rice BAC genomicLength = 320
C26802 36% TO 72  8/97 N-TERMINAL 19-67 REGION opposite end = C96903
C96903, C97406 58% IDENTICAL TO 72 C-TERM 65% to AQ050520
C96799, C28139 219-340 REGION 55% IDENTICAL TO 72 opposite end = C97406
D22332        48% TO 72     12/93  7/98 C-HELIX 89-191 REGION
AU081507.1 Rice callus Oryza sativa cDNA clone C12518_12Z.Length = 581
C26235        36% IDENTICAL TO 72     8/97 AMINO ACIDS 89-216 REGION
AP002744 complement(join(114545..114970,115379..115757,
116406..116650,116745..116965,117608..117908))
D21882        53% TO 72   5/93  7/98 245-352 REGION = 72A18
MLMMLGAASQWILAAAAAAAVAALLWLAVSTLEWAWWTPRRLERALRAQG 
IRGNRYRLFTGDVPENVRLNREARKKPLPLGCHDIIPRVLPMFSKAVEEH (phase 1 intron)
GKPSFTWFGPTPRVMISDPESIREVMSNKFGHYGKPKPTRLGKLLASGVV 
SYEGEKWAKHRRILNPAFHHEKIK (phase 0 intron)
RMLPVFSNCCTEMVTRWENSMSIEGM 
SEVDVWPEFQNLTGDVISKTAFGSSYEEGRRIFQLQAESAERIIQAFRTIFIPGYW (phase 2 intron)
FLPTKNNRRLREIEREVSKLLRGIIGKRERAIKNGETSNGDLLG 
LLVESNMRESNGKAELGMTTDEIIEECKLFYFAGMETTSVLLTWTLIVLS 
MHPEWQERAREEVLHHFGRTTPDYDSLSRLKI (phase 0 intron)
VTMILYEVLRLYPPVVFL 
TRRTYKEMELGGIKYPAEVTLMLPILFIHHDPDIWGKDAGEFNPGRFADG 
ISNATKYQTSFFPFGWGPRICIGQNFALLEAKMAICTILQRFSFELSPSY 
IHAPFTVITLHPQHGAQIKLKKI*

>CYP72A19 $F AP002839 Oryza sativa genomic DNA, chromosome 1 comp(53699-51708)
AU100635.1 Rice callus clone C10787.Length = 594
AG024141.1 strain ND3053 PCR from ricegenomic clone ND3053_0_734_1A.Length = 374
AP002744 complement(join(124623..125048,125181..125565,
125743..125987,126105..126584)) this annotation missing first 10 aa
MDPTSVPWSSMVYGLLGLALLWQVHRLLVRLWWQPRRLERALRAQGVRGTSYRFLTGDLKDYGRLS
KEAWARPLPLRCHDIAPRVAPFVHRTIAEHGKACLSWFGPIPKVTIADAEIAKDVLSNKMGHFEKLKFPVLS
KLLADGVANYEGEKWAKHRRILNPAFHLEKLK (phase 0 intron)
LMLPAFSACCEELVGRWAASLGSDGSNEIDVWPEMQSLTGDVISRTAFG
SSYLEGRRIFQLQAEQQELFMGAIQKISIPGYM (phase 2 intron) 
SLPTKNNRRMYQIKNEVESIIRDLVQKR
MHAMKDGERTKDDLLGILLESSTRHADENGHSGPGMTIEEVMEECKVFYFAGMETTAILL
TWTMVVLSMHPEWQHRAREEVLSLFQKNKLDYEGLSKLKT (intron joint not correct missing one 
base?)
VTMILYEVLRLYPPGIGFVRQTYKEMEIGGVKYPAG
VMIELPLLFIHHDPDIWGSDVNEFKPERFAEGISRASNDHGAFFPFGWGPRICMGQNFAL
LEAKMALCMILQRFEFELAPSYTHAPHIVLMLRPMHGAPIKLRAI

>CYP72A20 $F AP002839 Oryza sativa genomic DNA, chromosome 1 56581-59004
AP002744 join(129496..129808,130309..130529,130636..130883, 131004..131919)
MEVGVSLEAGKPAAAPWGMLYYGVPALLVLGALYRAAERCWLGPRRVAGALQGQGLRGTAYRFPAGDL
PENARRSKEARAKPMPPCHDIVPRVAPLLQDIVKEY (PHASE 1 INTRON)
GNVCITWFGTTPRVVIAEPELVKDILSNKFGHFEKFTLKSLGKLIALGLASYEGEKWARH
RRILNPAFHLEKLK (phase 0 intron)
HMLPAFSTCCSEMIDRWDSKLAGSDGPFELDIWQEFQNLTGDVISRTAFGSSFMEGRRI
FQLQEEQADRIIKTIQYIYIPGYL (phase 2 intron)
VSCRYFPTENNRRMKENSREIEGLLRGIIEKRSRAVENGELSGDDLLGLMLKSNMDSGE
PSNLRMSTEDVIEECKLFYFAGMETTSVLLTWTLVVLSMHPEWQHRAREEVLSAFGRDKP
NFDGLSRLKT (intron joint not correct as in gene 3) 
VTMILHEVLRLYPPAVTLSRRTFKEIQIGGITYPAGVGLELPIILIHHNTDVWGKDAHEFKPERFADGISKAT
KTNQQAFFPFGWGPRICIGQNFAMLEAKMALCVILQNFEFQLSPSYTHAPYASVTLHPQH
GAQIILTRL*

>CYP72A21 $F AP002839 Oryza sativa genomic DNA, chromosome 1 61993-63890
AZ045374.1 nbeb0080P16f CUGI Rice BAC genomic Length = 843
AQ857269.1 nbeb0005G04r CUGI Rice BAC genomic Length = 855
AQ865258.1 nbeb0025C03f CUGI Rice BAC genomicLength = 738
AP002744 join(134887..135417,135523..135767,135888..136272,
136380..136805) this annotation adds first 7 aa
AQ050520, AQ272173, AQ159375, AU031882 (opposite end = D24685) 
AQ575977 nbxb0088K02r 
D24685.1 RICR2374A Rice root cDNA clone R2374_1A.Length = 419 = 72A
MVLGAWLMSPASVPWSLLAYGVLGLVLLWQAGRLLHSLWWRPRRLELALRAQGLRGTRYRFLTGD
LGEHGRLNREAWARPLPLRCHDIAPRVAPFLHNAVREHGSACFTWFGPTPKVTITDPDLA
KGVLSNKFGHFEKPKFPTLTKLFSDSLANHEGEKWVKHRRILNPAFHLEKLK (phase 0 intron)
LMLPAFSACCEELVSKWMESLGSDGSYEVDVWP
EMQILTGDVISRTAFGSSYLEGRRIFQLQAEQTERLLKCMQKIVIPGYM (phase 2 intron)
SLPTKNNRKMHQIKKETDSILRGLVDKRMQA
MKEGECTKDDLLGLLLESNMRHTEEDGQSNHGLTIEEVIEECKLFYFAGMETTSVLLTWT
ILLLSMHPEWQDRAREEILGLFGKNKPEYEGLSRLKI (PHASE 0 intron)
VTMILYEVLRLYPPAVTFTRKTYKQMEIGGVTYPAGVIVELPVLLIHHDPNIWGSDAHEF
KPDRFVEGISKASKNPGAFLPFGWGPRICIGQNFALLEAKMALCMILQCFKLELMPSYTH
APYSMVTLRPMHGAQIKLRAI* 

>AP004019.1 $P chromosome 2 clone OJ1118_C03 similar to CYP72A23 of rice
Pseudogene fragment 1
19871 FSTRKNSLLGYKTESLGDDGLCELLDIWREMQRLNGCHFPHSIRQPSYCEMRRIF 19707

>AP004019.1 $P chromosome 2 clone OJ1118_C03 similar to CYP72A23 of rice
Pseudogene fragment 2
37945 LSLMSLAFPTCNEELVWR*TE 37883 frameshift 
37883 NPFNDEGLCVLLDVWPEIQRFH*DAISRTASGGGYRERRRIFQLQSEQSE 37734

>CYP72A22 $F AP002839 Oryza sativa genomic DNA, chromosome 1 66435-68424
AP002744 join(139350..139880,140025..140278,140419..140806,
140914..141339)
MVLGAGLRCPASVPWSSLAYGLLGLVLLWQGGRLLHRLWWRPRRLELALRAQGLRGTRYRFLTGDL
GEHGRLNREAWARPLPLRCHDIAPRVAPFLHSSVREHGKACFSWFGPIPKVTIANPDLAKDVLSNK
FGHLEKHKFQGLTKLLSDGVASHEGEKWVKHRRILNPAFHLEKLK (phase 0 intron)
RMLPAFSTCCEELISRWMESLGSEGSYEVDVWPEMQSL
TGDVISRTAFGSSYLEGRRIFQLQAEQAERLLKCVQKIIIPGYM (phase 2 intron)
SLPTKNNRKMHQIKKEIDSILRGLIGKRMQAMREGESTKDDLLGLLLESNMRHTAEHGQSS
QGLTIEEVIEECKLFYFAGMETTSVLLTWTMLLLSMHPEWQDHAREEILGLFGKNKPEYE
GLSRLKI (intron joint not correct)
VTMILYEVLRLYPPAVTLTRQTYKQIEIGGVTYPAGVIIELPLLLIHSDPDIWGSDVHKF
NPERFAEGISKASKDPGAFLPFSWGPRICIGQNFALLETKMALCMILQHLELELALSYTH
APQSIITLRPTHGAQIKLRAI*

>CYP72A23 $F AP002839 Oryza sativa genomic DNA, chromosome 1 72149-74091
AP002744 join(145064..145603,145697..145941,146081..146465,
146581..147006)
AU067870 Rice callus Oryza sativa cDNA clone C10320_12Z, CYP72 like Nterm
AU067871 AU067869 very similar to AQ050520 K-helix to heme
MVFGELFSRASLPPPWSLLAYGLVGPV
LLWQAGRLLDRLWWRPRRLERALRAQGLRGTAYRFLLGDLREFGRLNEEAWSSAPLPLGC
HDIVPRVTPFVHRNVRDNGRPCCFSWFGPIPSVTITDPAQVRDVLSNKLGHFEKPKLPAL
TKLLADGLTSHDGEKWVKHRRIMNPAFHLEKLK (PHASE 0 INTRON)
LMLPAFSTCCEELVGKWMDSLGPDGSCELDVWPEMQSLTGDVISRTAFGSSYSEGR
RIFQLQTEQAELFIGAIQKFVIPGYM (PHASE 2 INTRON)
YLPTKKNRRMRRINSEVESILRGIIGKRMQAIAEGESTNDDLLGLLLESNMRHADENGRS
SPGMTTEDVIEECKLFYFAGMETTSVLLTWTMVVLSMHPEWQDRAREEVLGLFGRDKPEY
EGLSRLKT (PHASE 0 INTRON)
VTMVLYEVLRLYPPAIVFSRKTYKEMEIGGVVYPRGVILELPVLFIHH
DREIWGRDVHEFRPERFAEGISRASNDRGAFLPFGWGPRVCIGQNFALLEAKMALCMILQ
RFEFELAASYTHAPHTVMTLHPMHGAQMKLRMI*

>AP003278a $F 19863-22437 chromosome 1, PAC clone:P0518F01, similar to 72A22
AP003330.1 50023-47446 chromosome 1 clone B1085F01, CYP72A like 536aa
AP004738.1| Oryza sativa chromosome 6 clone OSJNBa0090D06 chrom. conflict
50023 MVLGGWLLMWAPASSPTILVAFGLLFG
49942 LVLAWQ AGLQLHRLWWRPRRLEKALRARGLRGSSYRFLTGDLAEESRRRKEAWARPLPLR 49763
49762 CHDIAPRIEPFLHDAVVRPEQHYGKPCITWLGPTPEVHVTDPELAKVVMSNKFGHFEKIR 49583
49582 FQALSKLLPQGLSYHEGEKWAKHRRILNPAFQLEKLK 49472 (0)
49071 LMLPVFSACCEELISRWMGAIGSDGSYEVDCWPELKSLTGDVISRTAFGSSYLEGR 48904
48903 RIFELQGELFERVMKSVEKIFIPGYM 48826 (2)
48363 YLPTENNRKMHQINKEIESILRSMIGKRMQAMKEGESTKDDLLGILLESNMRHT 48202
48201 EENSQSSQGLTIKDIMEECKLFYFAGADTTSVLLTWTILLLSMHPEWQDRARKEILGLFG 48022
48021 KNKPEYDGLNNLKI (0) 
      VTMILYEVLR 47842
47841 LYPPFIELKRRTYKEMKIGGVTYPAGVIINLPVLFIHHDLKIWGSDVHEFKPERFSEGIS 47662
47661 KASKDPGAFLPFGWGPRICIGQNFALLEAKMALCLILQRLEFELAPTYTHAPHTMITLHP 47482
47481 MHGAQIKIRAI* 47446

>AP003278b $F chromosome 1, PAC clone:P0518F01, 82% to 72A22
AP003330.1 59493-56536 chromosome 1 clone B1085F01, CYP72A like 516aa
N-term does not match in both, 3278 has MVLGGGWLSMWAPASSPTILAAFGLVGLVLAWQ 
before the AGLQ seq.
59493 MVLEGK AGLQLHRLWWRPRRLEKALRARGLRGSRYRFL
      TGDLAEEGRRRKEAWARPLPLRCHDIAPRVEP 59284
59283 FLHGAVGVGAAHGKPRITWFGPTPEVHVADPELARVVLSNKFGHFEKVSFPELSKLIPQG 59104
59103 LSAHEGEKWAKHRRILNPVFQLEKLK 59026 (0)
58537 LMLPV FSACCEELISRWMGSIGSDGSYEVDCWPEFKSLTGDVISRTAFGSSYLEG 58373
58372 RRIFELQGELFERVIKSIQKMFIPG 58298 (2)
57483 YLPTENNRKMHQMNKEIESILRGMIGKRMQAMKEGESTKDDLLGILLESN 57334
57333 TRHMEVNGQSNQGLTIKDIMEECKLFYFAGADTTSVLLTWTMLLLSMHPEWQDRAREEIL 57154
57153 GLFGKNKPDYDGLSRLKI (0) VTMIL 56977
56976 YEVLRLYPPFIELTRKTYKEMEIGGITYPAGVIINLPVMFIHHDPEIWGSDVHEFKPER 56800
56799 FSEGISKASKDPGAFLPFGWGPRICIGQNFALLEAKMALCLILQRLEFELATSYTHVPHT 56620
56619 IISLHPMHGAQIKVKSYMTISDYSVFY* 56536

>AP003330.1 $F chromosome 1 clone B1085F01 CYP72A like
BE230608.1 99AS821 Rice Seedling cDNA clone 99AS821.Length = 649 63% to 72A9
AP003514.1 $F chromosome 6 clone P0698A06, 
same start with short exon possible MAI (0) LVWR...
4264 MAI (0)
4363 LVWRPYAVGRALGQQGVRGPAYRLLVGNIGEANEMRAAASGGVLDRRCHDVVPRVLPHY 4539
4540 RAWMSRYGKVFVSWTGPFPALCVGDYAMAKEILADRTGLYAKPDPGASILALFGNGL 4710
4711 AFVNGDDWARHRRVVHPAFAMDKLKMMAKTMAECARKVIQAWEARAAAAADGERMVQVEV 4890
4891 GEQFHELTADVISHTAFGSSYRQGKEVFVAQRELQYIAMSALNSVRIPGSQYIPTKANIR 5070
5071 RRQLAKKVRGTLMAIIRERQATAAAAKEDRGYGNDLLGLMLEANAAAGGGEKSMTM 5238
5239 DEIVDECKTFFFAGHDTTSHLLTWAMFLLGTHPEWQQRLREEVLRECCGGGGGDTEALPN 5418
5419 GDALNKLKL 5445 (0?)
5528 MTMVLYETLRLYGPVSMLVRTATADAELGGVRVPKGTMTMMPVAILHRDADVWGADAGEF 5707
5708 DPLRFRGGVNKAAAHAGALLAFSLGQRSCIGQDFAMMEAKTTLAMILRRFAFEVSPEYVH 5887
5888 APLDLLTLQPKCGLPMVLKLLDQ* 5959

>AP003514.1 $F chromosome 6 clone P0698A06, start MET is missing
possible solution requires short 3 aa exon MAT (0) LVWR... 48% to 709B2
9625 MAT (0)
9530 LVWRPYAVARAFGRQRIRGPAYRLFVGSDGEANAMRAATRGDVLDRRCHDIVPRVMPHY 9354
9353 RAWMSRYGKVFVSWSGATPALCVGDYAMVKQILSDRTGLYGKPEPEPSILALIGNGL 9183
9182 IFIGGDDWARHRRVVHPAFAMDKLKMMAKTMADCAREVIREWEARATADGERRVQVEVGG 9003
9002 QLQELTADVISHTAFGSSYREGKEVFVAQRELQSIAFSAINSTRFPGSQYIPTKTNLRRR 8823
8822 HLAKKVRGTLMAIIRERQAAAAKEDSGHGNDLLGLMLEANATDACGSGGEKSMTMDEIV 8646
8645 DECKTFFFAGHDTTSHLLTWAMFLLGTHPEWQHRLREEVLRECGGGGDTEALPNGDALNK 8466
8465 LKL 8457 (0?)
8374 MTMVLYETLRLYGPVSLMARTATADAELGGVRVPKGTMTMIPMAILHRDADVWGAGAGEF 8195
8194 DPLRFRGGVNKAAAHAGALLAFSLGQRSCIGQDFAMMEAKTTLAMILRRFAFEVSPEYVH 8015
8014 APLDLLTLQPKCGLPMVLKLLDQ* 7943 

>AP002899 $F 52% to 72A14 = AQ161379
complement(join(51630..52055,52989..53367,53942..54186,
54546..54766,55513..55801))
MLGEAASPWSLAGAGAAVALLWLCAWTLQWAWWTPRRLERALRA
QGLRGTRYRLFIGDVAENGRLNREAASRPLPLGSHDVVPRVMPFFCNVLKEHGKLSFV
WTGPKPFVIIRDPDLAREILSNKSGNFAKQTTAGIAKFVVGGVVTYEGEKWAKHRRIL
NPAFHQEKIKRMLPVFLACCTKMITRWVNSMSSEGISELDVWDEFQNLTGDVISRTAF
GSSYQEGWRIFQLQEEQAKRVLKAFQRIFIPGYWYLPIENNRRIREIDQEIRTILRGI
IVKRDKAVRNGEGSNDDLLGLLVESNMRQSNEKEDVGMSIEDMIEECKLFYAAGSETT
SMLLTWTLILLSMHPEWQEQAREEVMHHFGRTTPDHDGLSRLKIVTMILHEVLRLYPP
VVFLQRTTHKEIELGGIKYPEGVNFTLPVLSIHHDPSIWGQDAIKFNPERFANGVSKA
TKFQTAFFSFAWGPRICLGQSFAILEAKMALATILQSFSFELSPSYTHAPHTVLTLQP
QYGSPIKLKKL

>AP003142.2 $P chromosome 1, PAC clone:P0435H01 probable pseudogene 
53% to 72A15 86% to AP002899
5354 YLPIENNRRIREIY*EIRKILRGLIVKGDKAIRNGENTNDDLLGLLVESNMRQSNERE 5181
5180 EVGMSIED 5157
1115 IIEECRLFYFAGSETTS 1065 frameshift
1065 MLLT*TLIMLSMHPEWQERAREEVMHHFRRTTPDHDGLSRLKIVHM 928

>AP003278 $P chromosome 1, PAC clone:P0518F01, similar to 72A22 missing N-term half
AP003330.1 chromosome 1 clone B1085F01 CYP72A like
Pseudogene, no N-term in 9000bp upstream until next p450 ends near 22400
31539 SLPIENNRKMHQINKEIESILRGLIGKRMQAMKEGESTKDDLLGILLESNTKHMEENGQSS 31718
31719 QGLTIKDIVEECKLFYFAGAETTSVLLTWTMLLLSIHPEWQDHAREEIMGLFRKNKPDYE 31898
31899 GLSRLKI
32030 VTMIFYEVLRLHPPFIEIGWKTYKEMEIGGVTYPAGVSIKIPVLFIHHDPDSWGSDVHEF 32209
32210 KPERFSEGISKASKDPGAFLPFGWGPRICIGQNFALLESKMALCLILQRLEFELAPSYTH 32389
32390 APHTMVTLHPMHGAQMKVRAI 32452 or frameshift after KVR to 
      SYMIISDYSVFYYYNSWL* 

>CYP72A24 $F AP002839 Oryza sativa genomic DNA, chromosome 1 109970-113787
AQ864347.1 nbeb0023A03r CUGI Rice BAC genomicLength = 735
MVVFAAGDERPLMLVWAAVAGAVLAWCAVRAMEWAWWRPRRLERALRAQGLRGTPYRSPA
GDAPLNVQLSAEARARTMPLGCHDVVPRAMPLFHQAMKEH (PHASE 1 INTRON)
GKVSITWFGPVPRVTITKPELVREVLSNKFGHFEKLKFGRFQRLLHNGLGSHEGEKWAKH
RRIINPAFHLEKLK(PHASE 0 INTRON)
RMLPAFAACCTELVDK
WEGLAKGGDEPYEVDVWPEMQSLTGDVISRAAFGSSYLEGKRIFQLQGEQIELIVATMNK
IHIPGYI (PHASE 2 INTRON) 
HLPTKSNRRMKQIAAEIEGMLKRIIAKRESALKAGEASSDDDLLGLLLESNLDHSKGNGGA
ASSGISIDDVIGECKLFYFAGMETTSVLLTWTMVVLSMHPEWQDRAREEVLHVFGSRAPD
YDGLSRLRI (PHASE 0 INTRON)
VTMVLYEVLRLYTPLTALQRKTYKPMELGGVRYPAGVVLTLPLLCVHHDKDVWGADADEF
RPERFAEGISKASREAPAFFPFGWGPRICIGQNFALLEAKMGLSMILQRFSFDLSPSYTH
APFPVGLLQPEHGAQVRLTRLN*

>CYP72A25 $F AP002839 Oryza sativa genomic DNA, chromosome 1 115472-117492
AG021553.1 strain NC0134 PCR from rice genomic DNA clone NC0134_0_102_1A.
Length = 636 AG021553
AG023207.1 strain NC2780 PCR from rice clone NC2780_0_701_1A.Length = 474
MEIVDGASPPLHPWSLLLYALGALAALWWAWRALDRFWLRPRRLGRALRSQGLRGTDYRFPSGYLKEFARLL
AAALAAPMPPLSHDVASRALPFELAAIKQH (PHASE 1 INTRON)
GNVCVTWFGPEVRVIVSDPKLFREILANKNGRFGKQKSILWVQNLLADGLTSHQGEKWVA
HRRIMNHAFHLEKLK (PHASE 0 INTRON)
VQRMLPAFAACSSELISRWQDSVGADGAQEIDVWPEFQNLTGDVISRSAFGSSFSEGRRI
FQLQSEQARNVMKMAKALYFPGYR (PHASE 2 INTRON
FLPTELNRRTKANAREVRELLKGIITKRESAMKDGHAVNDDLLGLLLETNIKESQEAGSS
KPTMTTKDIIEELKLLYFAGSDTTAVLLTWTMVLLSMHPEWQDRAREEVLRVFGKNSPDF
EGINHLKV (PHASE 0 INTRON)
VTMILHEVLRLYPPILLLGREAYEETELGGVTYPPGVTFALPIAGIHHDPDVWGEDV
GEFKPERFAEGVSRASKDSPALVPFSWGPRICVGQNFALLEAKMALSMILQRFSFGLSPS
YTHAPFPIPTLQPQHGAQIKLTKL*

>AC119289.1 $F (japonica cultivar-group) chromosome 5 clone
AQ916317.1 nbeb0063E04r CUGI Rice BAC genomicLength = 521 52% to 72A7
AQ871967.1 nbeb0045H22r CUGI Rice BAC genomicLength = 824
50424 MLIMLGLGLVPAGAAAALAVALVCLAAAAWWTVERAPRRLERALRAQGVGGGRY
      QLLLGGDVAENGRLNREAWSRPLPLGCHRIAPRVLPLLWNAVRDH (1) 50720
54713 GKLSFIWFGPVPRVMIPDPELVREVFNKFDQFGKPKMIRVGKLLATGVVSYEGEKWAK 54886
54887 HRRILNHAFHHEKIK (0) 54931
55175 RMLPVFANCCTEMVTRWENSISLEAASEIDVWPEFRNLTGDVISRTAFGSSYQEGRRIF 55351
55352 QLQEELAQYLTEALQKLFIPGYW (2?) 55420
56034 YRYLPTKNNRRMREIDREVRKILLEIIGNKERAITNGENSNDDMLGLLVESNTKQPELRM 56213
56214 STDDIIEECKLFYFAGMETTSVLLTWTLIVLSMHPEWQERAREEVLHHFGRTTTPDYDSL 56393
56394 SRLKI 56417 (0?)
56923 VTMILYEVLRLYPPVVLLNRRTFKETNLGGIKFPADMNLILPILFIHHDPEIWGKDASEF 57102
57103 NPGRFADGISNASKYHDASFFPFGWGPRICIGQSFALLEAKMALSMILQRFSLELSPSYI 57282
57283 HAPYIVLTLRPQHGAQIKLKRI* 57351

>BE041018.1 OF16D12 OF cDNA 5'Length = 816 70% to 709A1 (partial)
no indica ortholog cannot extend
GPPSADHLSKLTVLQMIIQETLRLYPPATLLPRMAFEDIQLGGLRLPRGLTVWIPVLAIHHDEFIWGPDAH

>AP004142.1 $F chromosome 8 clone OJ1567_B05 53% to 709A2
66952 MAMAAAVLVAIALPVSLALLL
66889 VAKAVWVTVSCYYLTPARIRRVLASQGVRGPPPRPLVGNLRDVSALVAESTAADMASLSH 66710
66709 DIVARLLPHYVLWSNTY 66659 (1)
66576 GRRFVYWYGSEPRVCVTEAGMVRELLSSRHAHVTGKSWLQRQGAKHFIGRGLLMANGATW 66397
66396 SHQRHVVAPAFMADRLK (0) 
      GRVGHMVECTRQTVR 66217
66216 ALRDAVARSGNEVEIGAHMARLAGDVIARTEFDTSYETGKRIFLLIEELQRLTARSSRYL 66037
66036 WVPGSQ 66019 (2)
65469 YFPSKYRREIKRLNGELERLLKESIDRSREIADEGRTPSASPCGRGLLGMLLAEMEKK 65296
65295 EAGGNGGGEVGYDAQMMIDECKTFFFAGHETSALLLTWAIMLLATHPAWQDKARAEVAAV 65116
65115 CGGGAPSPDSLPKLAVLQMVINETLRLYPPATLLPRMAFEDIELGGGALRVPSGASVWI 64939
64938 PVLAIHHDEGAWGRDAHEFRPDRFAPGRPRPPAGAFLPFAAGPRNCVGQAYAMVEAKVAL 64759
64758 AMLLSSFRFAISDEYRHAPVNVLTLRPRHGVPVRLLPLPPPRP* 64627 

>AC082644 $F 50% to 709B2 EST AU066048, AU032967 from this gene
CDS join(140516..140786,141950..142173,142311..142561,
142647..143025,143212..143643)
140516 MATLLLLAVAAAAAAWVWWGRYAWRARAVARRLAAQGVRGPRRGG 140650
140651 LLRGCNDEVRRRKAEAEADGVAMDVGDHDYLRRVVPHFVAWKELY 140785 (1?)
141949 GTPFLYWFGPQPRICVSDYNLVKQILSKKYGHFVKNDAHPAILSMIGKGLVLVEGADWVR 142128
142129 HRRVLTPAFTMDKLK 142173 (0?)
142311 VMTKTMASCAECLIQGWLDHASNSKSIEIEVEFSKQFQDLTADVICRTAFGSNSEKGKEV 142490
142491 FHAQKQLQAIAIATILNLQLPGFK 142559 (?)
142648 YLPTKRNRCKWKLENKLRNTLMQIIQSRITSEGNGYGDDLLGVMLNACFSTEQGEKRDEL 142827
142828 ILCVDEIIDECKTFFFAGHETTSHLLTWTMFLLSVYPEWQDRLREEVLRECRKENPNADM 143007
143008 LSKLKE 143025
143212 MTMVLLETLRLYPPVIFMFRKPITDMQLGRLHLPRGTAIVIPIPILHRDKEVWGD 143376
143377 DADEFNPLRFANGVTRAAKIPHAHLGFSIGPRSCIGQNFAMLEAKLVMAMILQKFSFALS 143556
143557 PKYVHAPADLITLQPKFGLPILLKALDA* 143643

>AQ865256.1 AQ795997.1 AQ871363 AQ913328 AQ272179 AZ130379 AQ159150 AU166330.1
(partial) ortholog to aaaa01013573.1 $FI 99% lower case
MEKMLALIVVLVILLSLALFYLCNILWLRAVKIRKKLRRQGIRGPKPTFLYGNTKEIKRI
RQELKFSQKQGTNNFISTLFPHFLLCRETYGMHML
GPVFLYSTGALEILQVSHPDMVKDIGRWTPSELGKPNYLKKSRKALFGGGLFTENGDEWA 399
YQRKIIAPEFFMDKIK
MIQLIEDATVTVLEAWEDMIDDVG 
42 aa gap gcreivvddylrnlsadviaracfgssftegeeifcklrqlq
YLPTKSNQEIQMLDEQVRLLILDVAKEQHHYQDSHNSLVNAIIDGAQDGRS 
AAEAEDFIVGNCKTIYFGGHE
47? aa gap stvvtaiwclmllathsewqerarseamevcrgrstldvdalrrlki 
vtmviqetlrlyppasvmmqealtdvklg
NIEVPRGTIVQVPRLMLHLDKEAWGADADEFRPDRFANGVAAACRAAHMYVPFGHGPRTC 
IGQNLAMAELKVVLARLLTKFAFSPSPRYRHSPAFRLTIEPGFGLPLMVTKLP* 

>AQ575202.1 AG023995.1 (partial) 
only 2 diffs to AAAA01006291.1 AAAA01013573.1 and AAAA01083300.1
they seem not to continue
118 MELFSSQQWLALLPPIILCILLFSYVYIILWLRPERLRQKLRSQGVRGPKPSFLFGNIPEMR 297
298 RIQQLAKSAHEQEAGSTDMFSSNYVATLFPYFLHWSRVY 414

>AP003258.2 $F genomic DNA, chr 1, PAC clone:P0463A02, complete 46% to 709B2
N-term runs off end of contig identical to AP003764.2 (has N-term)
       MLKSTIELYIFTTAIAKKSLHSQTKHKSKMDVPSVVIP
151039 ILVVLVSRLLTSALVHLLWKPYAITKLFRGQGITGPKYRLFVGSLPEIKRMKAAAAADEVA 150857
150856 AGAHSHDFIPIVLPQHSKWATDHG (1)
       KTFLYWLGAVPAVSLGRVEQVKQVLLERTGSFTKNYMNANLEA 150497
150496 LLGKGLILANGEDWERHRKVVHPAFNHDKLK 150404 (0)
149815 FMSVVMAESVESMVQRWQSQIQQAGNNQVELDLSRELSELTSDVITRSAFGSSHEEGKE 149639
149638 VYQAQKELQELAFSSSLDVPALVFLR 149561 (2)
149237 GNTRAHQLVKKSRTMLMEIIEGRLAKVEAAEAGYGSDLLGLMLEARALEREGNGLVLTTQE 149055
149054 IIDECKTFFFAGQDTTSNHLVWTMFLLSSNAQWQDKLREEVLTVCGDAIPTPDMANRLKL 148875 (0)
148359 VNMVLLESLRLYSPVVIIRRIAGSDIDLGNLKIPKGTVLSIPIAKIHRDRDVWGPDADEF 148180
148179 NPARFKNGVSRAASYPNALLSFSQGPRGCIGQTFAMLESQIAIAMILQRFEFRLSPSYVH 148000
147999 APMEAITLRPRFGLPVVLRNLQG* 147928

>AC082644.10 $F chr 3 BAC OSJNBa0013M12 genomic sequence, complete sequence
48% to 709B3 annotation in Genbank not full length 
ends corrected here, one in frame stop may be a seq error
EST C26232, D15749, AU091837 from this gene 
AQ795536 nbxb0056O10r
146763 MASLALVFLALLLPTISWLWDYIIVPHIWRPYIAAKRLREQGIRGPPY 146906
146907 KFLKGCNEDVKKMKDAADDLVLDVHDHNYLPRVMP*YLKWRSQY 147038 (1) 
147228 GRALCILVWGKA
147262 RICIFNYEWARQILSSKSGHFLKNDTTPTVLALLGKGLVLVEGIDWERHRRV 147417
147418 INPAFTMDKIK 147450 (0)
147771 MMTKTMVACAQNMVKELEDQASSNKNGETQVELDKQFQELTADIISHTAFGSS 147929
147930 YKLGIEAFHAQKELQEIAVKSLLNVQIPG 148016 (?)
148136 FSYLPTKGNWRKLTLEKKLRGTLMQIIQSRLSSKGSGYGSDLLGLMLEACIATDQGREQH 148315
148316 QLSLSIDEIIHECKTFFFAGHETTSLLLTWTVFLLSVYPEWQARLRLEALRECGKENPNG 148495
148496 DNLSKLKE 148519 (0?)
148671 MSMVFLETLRLYGPALFLQRKPLTDITVGETKIPKDHAIIIPSAIMHRDKEIWGDDTDE 148847
148848 FNPLRFQNGVTRAAKVPHALLAFSIGPRSCIGQNFAMLEAKSVMAMILKKFSFTLSPNYV 149027
149028 HAPVDLLTLQPKFGLPVVLRLLDA* 149102

>AC087220.7 $F chromosome 3 clone OSJNBb0097F01 similar to 709B2 
(this clone missing N-term 37 aa)
one in frame stop and one bad phase 0 boundary, might be a pseudogene
identical to AC082644 CDS join(154905..155190,158210..158433,158677..158712) partial
AQ363984.2 nbxb0060A22r
154905 MDSILLLPLVALLVVAISWLWDYTVVRLIWRPHCIAK 155015 this piece from AC082644.10
132837 EFREKQGIRGPAYKFLGGNNGEISRLKEEADGQVLDNLRDHNYLPRIAPHFLKWRAQY (1)
129645 GEAFLFWYGAKP RICIFDYELARQILSSKSGHFLKNDAPPTLVALMGKGLVLLEGTDWVR 129466
129465 HRRVINPAFNMDKLK 129421 (0)
129177 MMISTMTGCAQSLAKELEDVAAKNKDRVTEVDLNQKFR*LTADIIAHTAFGSS 129019
129018 YQLGKEAFQAQHELTEITMATLFQVQLPGLN 128935 (2)
128834 YLPTERNRRKWRLQKNLRDTLMQIIRSRLSSKDGGYGNDLLGLMLGACASDEQ 128667
128666 GEASSLSMDEIVDECKTFFLAGHETTSLLLTWTVFLLSVYPEWQERLRNEVLRECGTDQ 128490
128489 CPDANSLGKLKE 128457 (0) bad boundary 
127940 MTMVLLETLRLYNPALFIQRKPTADITVGSLAIPAGVAVYIPVPIMHRDREVWGHDAGEF 127761
127760 NPLRFRDGAARAAAAAGIPHALLSFSIGPRSCIGQGFAMLEAKAAMAAMLRRLSFRVSPG 127581
127580 YVHAPVDLITLKPKFGLPVIVRLPDA* 127500 

>AP005099.1 $F (japonica cultivar-group) chromosome 7 44% to 714A1
AQ258184.1 nbxb0019H10f
73606 MVVVVAAAMAAASLCCGVAAYLYYVLWLAPERLRAHLRRQG
73483 IGGPTPSFPYGNLADMRSHAAAAAGGKATGEGRQEGDIVHDYRQAVFPFYENWRKQY 73313 (1)
70996 GPVFTYSVGNMVFLHVSRPDIVRELSLCVSLDLGKSSYMKATHQPLF 70856
70855 GEGILKSNGNAWAHQRKLIAPEFFPDKVK 70769 (0)
70616 GMVDLMVDSAQVLVSSWEDRIDRSGGNALDLMIDDDIRAYSADVISRTCFGSSYVKGKQI 70437
70436 FDMIRELQKTVSTKKQNLLAEMTGLSFLFPKASGRAAWRLNGRVRALILDLVGENGEEDGG
      NLLSAMLRSARGGGGGGGEVAAAAEDFVVDNCKNIYFAGYESTA
      VTAAWCLMLLALHPEWQDRVRDEVQAACCGGGGRSPDFPALQKMKN (0) 69984
69906 LTMVIQETLRLYPAGAVVSRQALRELSLGGVRVPRGVNIYVPVSTLH 69766
69765 LDAELWGGGAGAAEFDPARFADARPPLHAYLPFGAGARTCLGQTFAMAELKVLLSLVLCR 69586
69585 FEVALSPEYVHSPAHKLIVEAEHGVRLVLKKVRSKCDWAGFD* 69460

>AQ856705.1 (partial) nbeb0004C17f CUGI Rice BAC genomicLength = 767 53% to 714A2
ortholog AAAA01009691.1 appears to be a pseudogene missing some seq
EFNPERFSNAQPQLHSYLPFGAGARTCLGQGFAMAELKTLI
SLIISKFVLKLSPNYEHSPTLKLIVEPEFGVDLSLTRVQGAYRH*

>AP004026.1 $F chromosome 2 clone OJ1136_C04, 52% to 715A1
AQ290362 nbxb0036J18f 55% to 715A1 N-term
AQ288588 56% to 715A1 C-term 94% to AP004026 may be same gene
AQ795581.1 nbxb0056A12f CUGI Rice BAC genomic cloneLength = 558
88341 MELVVQALAAAAALLAVFFLSTLYLSPAATARRLRNAGFRGPTPSFPLGNL
      REIASSLASNNDTDESNTKGGDIHAAVFPYFARWRRA
88605 FGKVFVYWLGTEPFLYVADPEFLKAATAGALGKLWGKPDVFRRDRMPMFGRGLVMAEGDE 88784
88785 WARHRHIIAPAFSATNLNDMIGVMEETTAKMLGEWGDMVASGRSCVDVEKGVVRNAAE 88958
88959 IIARASFGISADDDDATGARVFHKLQAMQAILFRSTRLVGVPLAGLLHIRAT 89114
89115 YEAWKLGREIDALLLDIIESRRRREGGGGGKKKKKTTSNDLLSLLLAGSEASAGAERK 89288
89289 LTTRELVDECKTFFFGGHETTALALSWTLLMLAAHPEWQAAVREEVVEVAGRSGPLDAAA 89468
89469 LGKLTKMGCVLSEVLRLYPPSPNVQRQALQDVVVVAGDGEKKVVIPKGTNMWIDVVAMH 89645
89646 RDGELWGEEASEFRPERFMREGVQGGCRHRMGYVPFGFGGRICVGRNLTAMELRVVLAM 89822
89823 VLRRFAVEVAPEYRHAPRIMLSLRPSHGIQLRLTPLC* 89936

>AP003727.3 $P chromosome 1 clone:P0672D08 Pseudogene fragment 
missing N and C-terminal and part of I-helix 39% to 715A1 
NRMPMFGRGRVMAEGDEWARHRCIVAPAFSATNLN
DMIGVMEETTSKMLGEWSDIVALGHSCIDIEKGVVRNAAEIIAKASFSIAADDATVFHK (frameshift)
VRLVSVPLASLLHIRADRATYEAWKLGRKIDALLLDIIESRRRCEGGGRK
TTTTDLLWLLLAGNEASAAAERKLTTALALSWTLLMLATHPEWRAAVRE this is missing from AP004123
EVEEVTGWSGPMDAAAMGKLTKMGCMLNEVLRLYPPSPNVQRPAACDAEV

>AP004123.1 $P chromosome 2 clone OJ1654_A02, related to AP004026.1 dashes 
indicate deletions, missing N and C-terminal sequences
GAFGKL*GKPAVFRHDRMP (MWP) insertion
MFGRGRIMAEGDEWARHRCIIAPAFSATNLN
DTIGVMEETTSKMLGEWSDMVALGHSCIDIEKGVVRNAAEIIAKANFSIAAA-DAT---VFHK (frameshift)
LQAM---LFHST (frameshift)
RLVGVPLASLLHIRATYEAWKVGRKIDALLLDIIDSRRRREGGG
MK deletion of I-helix region 49aa
EVEEVTGWSGPMDAAVMEKLTKM
GCMLNEVLRLYPPSPNVQRPAACDAEV

>AP004365.1 $P chromosome 1 clone P0458E05, Length = 194415
93% to AP004026.1 $F chromosome 2 715 like
184935 ETTSKMLGEWSDMVVSG*SCIDIEKGVVWNAAEIIAKASFGIGTDDDATGAGVFHKLQAMQAMLFRSTRLVGVP 184714 frameshift and deletion
       RANYEAWKLGREIDALLLDIIESRRHXXXXXXXX frameshift
       TNDLL*LLLGGNKASAGA frameshift
184557 ERKLTTRELVDECKTFFFXXXXXXXXX missing I-helix region 184504
184108 LSWTLLMLAAHPEWQAAVREEVVEAAGRSGLLDAAALGKLTKMGCVLNEVLRLYQPSPNVQRQALQDVVV 183902

>AP003822.1 $F chromosome 7 clone OJ1316_A04, 68% to 72B
52729 MWSSWAWTWSWSGAAAVAVAAAAAWVAVYAAAARAAEAL
52846 WWRPRRVERHFAAQGVRGPGYRFFVGSSIELVRLMVDAASRPMEPPTSHDILPRVLPFYH 53025
53026 HWRKLY 53043 (1)
55138 GPMHLIWFGRTPRLVVSEPELIREVLLTRADHFDRYEAHPMICQFEGYGLSNLHGERWA 55314
55315 RRRRVLTPAFHTENLRMIAPFVAGTVTRMLDELAERARAGGAGEAEVDVAEWFQRVPQEA 55494
55495 ITFAAFGRRNYDDGAAVFRLQDELAGYATEAHSKVYIPGYRFLPTRKNRRVWQLDREIRS 55674
55675 HLAKFVTGLQSCSSSHGDDADDGGDGGGGMREFMSFMAPAMTAGEIIEESKNF 55833
55834 FFAGKETLSNLLTWTTVALAMHPEWQERARREVVAVCGRGDLPTKDHLPKLKTV 55995
56559 LGMILNETLRLYPPAVAMIRTAKEDVELGGCVVPAGTEVMIPIMAVHHDAAAWGDDAAEF 56738
56739 NPARFAADDDGGRRRHPMAFMPFGGGARVCIGQNMALMEAKVALAVVLRRFEFRLS 56906
56907 PAYVHAPRVLMILSPQFGAPVIFRPLTSAAA* 57002

>AP003612.1 $F chromosome 6 clone P0457B11, 62% to 72B
109843 MMEAVAVAAAVLLLLHVAARVADAVWWRPRRLEAHFAGQGVRGPPYRFLVGCVR 109682
109681 EMVALMAEATAKPMPPAAPHNALPRVLAFYHYWRKIY 109571 (1)
109284 GPTFLIWFGPTPRLTVAEPEMVREIFLTRAEAFDRYEAHPVVRQLEGDGLVSLHGDKW 109111
109110 AHHRRVLTPGFYPDNLN (0) 
108953 RLVPHVGRSVAALAERWRAMACAGGGEVEVDVAEWFQAVAEEAITRATFGRSYDSGRV 108780
108779 VFRLQARLMAFASEAFRKVLVPGYR 108705 (2)
108606 FLPTKKNRMSWGLDREIRRGLVRLIGRRSGGDGGEEDETTTELKDKQDSGFNDLLGLMI 108430
108429 NAGVDRTMPVEDMVEECKTFFFAGKQTTTNLLTWATVLLAMHPDWQDRARREVLAVCGDA 108250
108249 AGELPTKDHLPKLKT (0) 
       LGMILNETLRL 108070
108069 YPPAVATIRRAKFDVTLGGGGDGDAGGIHIPRDTELLVPIMAIHHDARLWGPDAAQFNPA 107890
107889 RFASGAARAAKHPLAFIPFGLGSRMCIGQSLAILEAKLTMAVLLQRFDLALSPTYVHAPT 107710
107709 VLMLLHPQYGAPLIFRPRQSQPSN* 107635


CYP74 clan contains only one family CYP74 (3 sequences)

>AC099043.1 $F chromosome 3 clone OSJNBa0079B15, Length = 155583
63% to AP004181.1 chromosome 2 56% to 74A Arab
AU031582 AU031583
62290 MATAAACISFASPSPARVVIR
62227 RQTRASASASATDRQEVVSPKRRLPLRKVPGDYGPPVVGAIRDRYEYFYGPGGRDGFFAA 62048
62047 RVRAHRSTVVRLNMPPGPFVARDPRVVALLDAASFPVLFDTSLVDKTDLFTGTFMPSTD 61871
61870 LTGGYRVLSYLDPSEPNHAPLKTLLFYLLSHRRQQVIPKFREVYGDLFGLMENDLARVGK 61691
61690 ADFGVHNDAAAFGFLCQGLLGRDPAKSALGRDGPKLITKWVLFQLSPLLSLGLPTLVED 61514
61513 TLLHSLRLPPALVKKDYDRLADFFRDAAKAVVDEGERLGIAREEAVHNILFALCFNSF 61340
61339 GGMKILFPTLVKWLGRAGARVHGRLATEVRGAVRDNGGEVTMKALAEMPLVKSAVYEAL 61163
61162 RIEPPVAMQYGRAKRDMVVESHDYGYEVREGEMLFGYQPMATKDPRVFARPEEYVPDRFL 60983
60982 GEDGARLLRHVVWSNGPETAAPTLHDKQCAGKDFVVLVARLLLVELFLRYDSFDVEV 60812
60811 GTSTLGSSVTVTSLKKATF* 60752

>AY055775 $F allene oxide synthase (AOS) mRNA, complete cds. 
Ha,S.-B., Lee,B.-C. and Back,K.
Molecular characterization and expression of rice allene oxide synthase
Unpublished
AQ686628.1 nbxb0072F10r CUGI Rice BAC genomic cloneLength = 445 62% to 74A
AU101779.1 Rice green shoot cDNA clone S13809.Length = 644
EXXR to end also BI809245.1, BI811033.1, AU182268.1
C72393, D47977, C99549 52-172 REGION opp end = C99548 (probably 3 prime 
untranslated)
MELGVPLPRRPVPGSYGVPFVSAVRDRLDFYYLQGQDKYFESRA
ERYGSTVVRINVPPGPFMARDPRVVALLDAKSFPVLFDVAKVEKRDVFTGTFMPSTSL
TGGYRVCAYLDPSEPNHAKIKQLLLSLLVSRKDAFVPVFRSNFGALLDTVQSQLASGG
GKSDFTALNDATSFEFIGKAYFGVRPSASSSLGTGGLDQGRLWLLWQLAPLTTLGLPM
IIEDPLLHTLPLPPFLISSDYKALYAYFAAAASQALDAAEGLGLSREEACHNLLFATV
FNSYGGFKLLLPQILSRVAQAGEKLHERLAAEIRSAVADAGGNVTLAALEKMELTRSV
VWEALRLDPPVRFQYGRAKADLEIESHDASFAIKKGEMLFGYQPCATRDPRVFGATAR
EFVGDRFVGEEGRKLLQYVYWSNGRETENPSVDNKQCPGKNLVVLVGRLLLVELFLRY
DTFTAEAGKKVVITGVTKASTSAVNRTA

>AC107226.1 $F chromosome 3 clone OSJNBa0081P02, Length = 149791 
74 like 98% to AY055775 allene oxide synthase (AOS) probably same gene
N-term matches C72393, D47977, C99549 95%
Same as AC107207.1 chromosome 3 clone OSJNBb0106M04
108155 MELGVPLPR
108182 RPVPGSYGVPFVSAVRDRLDFYYLQGQDKYFESRAERYGSTVVRINVPPGPFMARDPRV 108358
108359 VALLDAKSFPVLFDVAKVEKRDVFTGTFMPSTSLTGGYRVCAYLDPSEPNHAKIKQLLLS 108538
108539 LLVSRKDAFVPVFRSNFGALLDTVESQLASGGGKSDFTALNDATSFEFIGEAYFGVRPSA 108718
108719 SSSLGTGGPTKAALWLLWQLAPLTTLGLPMIIEDPLLHTLPLPPFLISSDYKALYAYFA 108895
108896 AAASQALDAAEGLGLSREEACHNLLFATVFNSYGGFKLLLPQILSRVAQAGEKLHERL 109069
109070 AAEIRSAVADAGGNVTLAALEKMELTRSVVWEALRLDPPVRFQYGRAKADLEIESHDAS 109246
109247 FAIKKGEMLFGYQPCATRDPRVFGATAREFVGDRFVGEEGRKLLQYVYWSNGRETEN 109417
109418 PSVDNKQCPGKNLVVLVGRLLLVELFLRYDTFTAEAGKKVVITGVTKASTSAVNRTA* 109591

>AP004181.1 $F chromosome 2 clone OJ1136_G09, AU184424.1 45% to 74A of Arabidopsis
6988 MAPPRANSGDGNDGAVGGQSKLSPSGLLIREIPGGYGVPFLSPLRDRLDYYYFQGADEFFRS 7173
7174 RVARHGGATVLRVNMPPGPFLAGDPRVVALLDARSFRVLLDDSMVDKADTLDGTFMPSLA 7353
7354 LFGGHRPLAFLDAADPRHAKIKRVVMSLAAARMHHVAPAFRAAFAAMFDEVDAGLVAGGP 7533
7534 VEFNKLNMRYMLDFTCAALFGGAPPSKAMGDAAVTKAVKWLIFQLHPLASKVVKPWPLED 7713
7714 LLLHTFRLPPFLVRREYGEITAYFAAAAAAILDDAEKNHPGIPRDELLHNLVFVAVFNAY 7893
7894 GGFKIFLPHIVKWLARAGPELHAKLASEVRAAAPAGGGEITISAVEKEMPLVKSVVWEAL 8073
8074 RMNPPVEFQYGRARRDMVVESHDAAYEVRKGELLFGYQPLATRDEKVFDRAGEFVPDRFV 8253
8254 SGAGSAARPLLEHVVWSNGPETGTPSEGNKQCPGKDMVVAVGRLMVAGLFRRYDTFAADV 8433
8434 EELPLEPVVTFTSLTRAADGDGAARRGV* 8520 

CYP85 clan contains 12 families 85, 87, 88, 90, 702, 707, 708, 716, 718, 720, 
722, 724 (29 sequences)

>AQ157412 (partial) nbxb0009M10r AQ577511 nbxb0091K23r C72168 C73926 opp end C99700 
(probably 3 prime untranslated) 46% to 87A2
AU101385.1 Rice panicle cDNA clone E20933.Length = 449
C72007 AQ445750 nbxb0054D22r AU165390 48% to 87A2 45% to 708A1 48% to 90C1 379-464 
region  9/97 85 CLAN
C99624 71% identical to C72122 51% to 87A2 91% to AQ863358.1
AU093892.1 Rice panicle cDNA clone E1139.Length = 706 opposite end = C72168
Very similar to AL607001.1 chromosome 4 clone OSJNBA0088I22, gene 2 105584-109445 
ortholog to aaaa01004251.1 $FI 7 diffs
MEKSEIFVGAHSYAAICAFTLIIGWLAHWVYRWINPPCNGRLPPGSMGFPIVGETFQFFR
TSPSIDMPIYYKRRLER
YGPIFKTNIGGQHVVISLDPEVNQFIFQQEGKLFQSWFPETTLNIFGKKTLTTYNRTAH
KLIRSFVCKLYGPENVKKSLLPELENSMR 3
(gap) 
ANSGFTWEEYKSLK
FSTQVVNEITRISNVAPGVFRKALTDVQVNGYTIPSGWLVMISPMGVHLNPKLFEDPLKFDPWRWTEEKRISMQRNFMPFGGGIRMC 
PAAEFNKLFITLFLHIVVTEYRWKDIDGGNVKRISEVLVAQEYHIQLVPQT* 

>AQ576451 (partial) nbxb0089M21r 80% to C72007 67% to 702A8 and 87A2 84% to AL607001 gene 2
ortholog to aaaa01000436.1 both are partials cannot extend
KEHPTVPEQTSQSKFGVPWEEYNPLMFTSQ (0)
VVHEFTRISNVAPGIFRKTLADVKVK (1)
GYTIPAGWLVMISPMAVDLNPTLFEDPLEFNPWRWT 203 (0)
DKTKQSELLRNYMPFGGGIRLCLGAEF

>AL607001.1a $F chromosome 4 clone OSJNBA0088I22, gene 1 92976-97891
48% to AC083944.6 57% to 87A2
92976 MYAYVGLVGGAAALVLLLLLRHQARRWRNPRCGGQLPPGSMGLPLVGETFQFFSSDASLDIPPFIRHRLAR  93188 
(2)
93267 YGPIFKTSLVGHPVVVSADEELNHMVFQQEGQLFQSW 93377
93378 YPDSFVEILGKDNVGEQQGAMFRYLKNMVLRYFGPESLKEGIIRDVERAVSSSLCTWSTL 93557
93558 PAVELKEAVST 93590 (0)
93686 MVFDLAASKLLGLEPSRSKILRKSFFDFVRGLISFPLYLPGTAYYSCMQ 93832 (0)
93922 GRRRAMVVLEQVLEERKQSTGLQRGGEAQQHGDFLDYVIQEITKEKPVMTEKMALDLMF 94098
94099 VLLFASFHTTSLALTLAVKLLADHPLVLEELT 94194 (0)
94575 VINETVRLANIAPVIFRKALKDIRFN 94652 (1)
97033 GYTIPAGWGVMVCPPAVHLNPYIYPDPLTFIPSRFK 97140 (0)
97486 DKPEINRGSKHFMAFGGGLRFCVGADFSKLQLAIFLHFLVTKYR 97617 (2)
97802 WIPLGASRVVRTPGLEFPDGYRIKVIQRH* 97891

>AL607001.1b $F chromosome 4 clone OSJNBA0088I22, gene 2 105584-109445 
= C25461, C72122, AU075642 clone E1025_6Z, AQ159117, AQ863358, AQ795088.1  
47% to 87A2 78% to AQ157412
105584 MEKSELLLGSYSYAALCGVTLIIGWLAHWVYKWMNPPCIGRLPPGSMGFPIIGETFQFFRASPSIDMPSYYKQRLER 
(2?) 105814
105556 YGPLFKTSLVGRPVIISLDPEVNRFIFQQEGKLFQSWYPETAINIFGKKSLTTYNGTIHKFIRGVAAK 106159
106160 LFGLENLKESLLPELENSMRESFASWTGKPSVEVQDGVSD 106279 (0)
106493 MIFDLVAKKLIGLDVTNSRELRKNFQDFFQGMVSFPIYFPGTSFYRSMQ (0)
       GRRNVRNTLTDIMKERLSAPGKKYGDLVDLIVEELQSEKPM 108258
108259 IDENFAIDALAALLFTSFATLSSTLTVAFKYLTDNPKVVEELK (0)
       EEHGTILKKREGVNSGFTWEEYRSLKFSTQ (0)
       VMNEITRISNVTPGVFRKTLTDVQVK 108747 (1?)
108863 GYTIPSGWLVMISPMAVHLNPKLFEDPLKFDPWRWR (0)
       EEKRSSMLKNYMPFGGGVRLCLGAEFSKLFIALFLHILVTEYR 109231 (2)
109356 WTEIEGGEVLRISEIMFPQGYHIQLVPQT* 109445

>AL607001.1c $F chromosome 4 clone OSJNBA0088I22, gene 3 136970-140694 61% to 87A2
136970 MAYIALLCAALAAVVALLRWAYRWSHPRSNGRLPPGSLGLPVIGETLQFFAPNPTCDLSPFVKERIKR 137173 
(2?)
137287 YGSIFKTSVVGRPVVVSADPEMNYYVFQQEGKLFESWYPDTFTEIFGRDNVGSLHGFM 137460
137461 YKYLKTLVLRLYGQENLKSVLLAETDAACRGSLASWASQPSVELKEGIST (0) 
       MIFDLTAKKLIGYDPSKPSQVNLRKNFGAFICG 137820
137821 LISFPLNIPGTAYHECME 137874 (0)
138176 GRKNAMKVLRGMMKERMAEPERPCEDFFDHVIQELRREKPLLTETIALDLMFVLLFASF 138352
138353 ETTALALTIGVKLLTENPKVVDALX 138424 (0)
138574 EEHEAIIRNRKDPNSGVTWAEYKSMTFTSQ (0)
139123 IMEIVRLANIVPGIFRKALQDVEIK 139197 (1?)
139305 XYTIPAGWGIMVCPPAVHLNPEIYEDPLAFNPWRWQ 139421 (0)
140396 XXEITGGTKHFMAFGGGLRFCVGTDLSKVLMATFIHSLVTKYR 140518 (2)
140605 WRTVKGGNIVRTPGLSFPDGFHIQLFPKN* 140694

>AC090482.3 $F 45% to 87A2 in the CYP85 clan 2 diffs with AQ868994
       MAVLATACWLVFVKRNTSRPSG (frameshift?) 
135052 RLPPGSRGFPIIGETLEFLTESPANQLPAFFKRRLDR 134942 
116833 RYGPIFKTNMIMEDLIVSLDPEVSNFVLQQEEKLFQIWCPGSFMRITGAESIITTHALLHR 116651
116650 HIRNLVLRIFGPGNLRQEIIQEMHKTAEASLSSWLNHPSIELKEAISS 116519 (0?)
113034 MIFSVTAKKLISYDSSTSDGKMWKHYDAFAQGLITLPLCIPGTAFYKCMQ 112876 (0?)
98384  GRKNVMKMLKEILNERKKIEGRHESIDFLDVLIEEVKEDNPSMTENTALNLLFSL 98211
98210  LFGSFDTTSSGITGMLKFLTDNPEALRELT (0?)
97814  EHNNIRRRRADLNSEITWEEYKSMKFTSH 97704 
97599  VIHEALRLASITPMMFREAIEDVHIK 97519 (1)
96537  GFAIPKESKIMICPSTVHLNPVVYEDPI 96457 frameshift
       TFNPLRWK (0)
95926  DIAEPAGGASKDFMPFGGGLRLCVGAHFAKLQMAVFVHCL 95831
95446  LSTLVSWKAIKGGRMVLSPGLRFLEGFHIQLFPKD* 95339

>AC083944.6 $F clone OSJNBa0047G15 three bad AG boundaries 47% to 87A2
AQ689865.1 AQ868994.1 very similar 70-75% to AC090482.3
AQ868994.1 nbeb0032J22r BAC genomic clone nbeb0032J22r.Len = 762 80% to 708A1
AQ689865.1 nbxb0080B14f CUGI Rice BAC genomic cloneLength = 792
AQ272902 405-441 region PERF 54% to 87A2 and 90A1 85 CLAN AQ689865* nbxb0080B14f
40470 MAVLATACWLVIRLVFVKGNTDRRPCEADGMRMVPGSRGLPIIGETLEFLALSPS 40634
40635 LQIPAFFQRRFD 40670 (2) bad AG boundary
43217 YGPLFKTNMIMEDLIVSLDPEVTNFVFQQEERLFQMWYPDSFIRIIGTDSLIRTHGLLHR 43396
43397 HIKNLALRLFGPENLRREIIHDVQKTVEASLSSWLNQPSIELKEAVSS 43540 (0)
44000 MIFKVTAKKLISYDSSTSDGKMWKQYDAFTQGLFAFPLCIPGTAFYKCMQ 44149 (0)
46292 GRKNVIKMLKEILDERKNTEEHHESVDFFDVIEEVKAKNPDQMTENAVLDLLFLLLF 46462
46463 ASFETTSSGITAMLIFLTDNPEALQELI 46546 (0)
47088 EHNNIRKRKADKNSEITWEEYKSMKFTSH 47174 (0)
47314 VIHEALRLASIAPVMFREAIEDVHIK 47391 (1)
47742 GFAIPKGSKIMICPYTVHLNPVVYKDPNTFNPWRWK 47849 (0) bad AG boundary
48552 RNIPEPASGASKDFMAFGGGLRLCVGAHFAKLQMAVFLHCLVTS 48683 (0) bad AG boundary
48972 LSTIVSWKAIKGGRMALSPGLRFPEGFHIQLFPKA* 49079

>AC092778.2 $F chromosome 3 clone OSJNBa0015G17, 59% to 85A2
AQ271015 nbxb0015G17f 173-222 56% to tomato CYP85
AQ159479 nbxb0014E19f 75% identical to N-term of 85 49-71
AQ795731.1 nbxb0057K13r Rice BAC Length = 728 76% to tomato CYP85 
C97147 38% identical to C73729 71% to 85 324-401 region
AU100843.1 Rice callus cDNA clone C52418.Length = 710
No overlap seen between N and C-terminal parts of this gene so it might be chimeric
arkiiv from tomato 85A1 to fill in missing sequence 
105850 MVLVAIGVVVAAAVVVSSLLLRWNEVRYSRKRGLPPGTMGWPLFGETTEFLKQGPSFMKA 106029
106030 RRLRY (1)
       GSVFRTHILGCPTVVCMEAELNRRALASEGRGFVPGYPQS 106945
106946 MLDILGRNNIAAVQGPLHRAMRGAMLSLVRPAMIRSSLLPKIDAFMRSHLAAWSSSSSSA 107125
107126 VVDIQAKTKE (0)
       MALLSALRQIAGVSAGPLSDALKAELYTLVLGTISLPI 108109
108110 NLPGTNYYQGFK 108145 (0) contig ends 108418 rest of gene missing
       arkiiv (from tomato 85A1 to fill in missing sequence) 
       AMLEQMIAERRSSGQVHDDMLDALLTGVEGTREKLTDEQIIDLIITLIYSGYETMSTTSMM
       AVKYLSDHPKALEQLR (?)
       KEHFDIRKGKAPEDAIDWNDFKSMTFTRA (?)
       VIFETLRLATVVNGLLRKTTQDVEMNG (?)
       YVIPKGWRIYGYTREINYDPFLYPDPMTFNPWRWLEKNMES
       HPHFMLFGGGSRMCPGKEVGTVEIATFLHYFVTQYRWEEEGNNTILKFPRVEAPNGLHIRVQDY* 

>D48181 (partial) 45% to 708A1 and 702A2  3/95  8/95 28-105 REGION 85 CLAN 44% to 87A2
AI978308.1 RZ83.F EcoRI Rice cDNA clone RZ83.Length = 436
This N-terminal of gene shown below as AQ446313
AQ446313.2 nbxb0069A16r CUGI Rice BAC genomic cloneLength = 537 
BI805182.1 AI978309.1 AU173983.1 BI805304.1 71% to AC090482.3 61% to 87A2
BM037780 cannot extend
This is C-terminal of gene shown above as D48181
Ortholog to aaaa01003340.1 $FI 1 diff
MAVLASWLIRLVSLKWNSSHPCKADEGSRLPPGSRGLPLLGESLEFFTSSTSLELPVFFK
RRLNRYGPVFRTNMVGQDLIVCLDPEVNSFVFQQEERLFQCWYP 
(gap)
GFFYPKGSKIMICPSTVHLNPMIYKDPNIFNPWRWKDTAEPTGGASKDFMAFGGGLRLCVGX (fs)
DFAKLQTAIFLHCLVTKYRWKAIKGGTMVLGPGLRFPEGFHIQLFPKP*

>AP000616.1 $F chr 6, clone:P0514G12 Length = 138739 = AP002805
equivalent to CYP88A dwarf3 in Zea mays 55% to 88A4
also on AP002805.1 genomic DNA, chr 6, BAC clone:OSJNBa0004I20
AG025156.1 strain NC2581 PCR from rice genomic clone T5558T.Length = 536
85% to AQ577548 may be same seq with a few errors
AQ577548 nbxb0091A04r heme 76% identical to CYP88 of Zea mays 85% to AP000616 exon 4, 5
AQ157673.1 nbxb0009L10f CUGI Rice BAC Library Length = 506
125060 MEGMGMAAAWAAGDLWVLAAAVVAGVVLVDAVVRRAHDWVR 125182
125183 VAALGAERRSRLPPGEMGWPMVGSMWAFLRAFKSGNPDAFIASFIR 125320
127992 RFGRTGVYRTFMFSSPTILAVTPEACKQVLMDDEGFVTGWPKATVTLIGPKSFVNMSYDD 128171
128172 HRRIRKLTAAPINGFDALTTYLSFIDQTVVASLRRWSSPESGQVEFLTELRRMTFKIIVQ 128351
128352 IFMSGADDATMEALERSYTDLNYGMRAMAINLPGFAYYRALRARRKLVSVLQGVLDGRRA 128531
128532 AAAKGFKRSGAMDMMDRLIEAEDERGRRLADDEIVDVLIMYLNAGHESSGHITMWATVFL 128711
128712 QENPDIFARAKAEQEEIMRSIPAT 128891
128892 QNGLTLRDFKKMHFLSQ 128942
129448 VVDETLRCVNISFVSFRQATRDIFVN 129525
129704 GYLIPKGWKVQLWYRSVHMDDQVYPDPKMFNPSRWE 129811
129937 GPPPKAGTFLPFGLGARLCPGNDLAKLEISVFLHHFLLGYK 130059
130322 LKRANPKCRVRYLPHPRPVDNCLATITKV 130408

>AC079935.3 $F (japonica cultivar-group) cultivar 35% to 88A3 may be new family
155791 MMSGMQTPVVAVGATDWAWRLGALVAGAVPLLALAV
       WHCTDAFHRAAFAFRRRGTRRARRLPPGHMGLPFV 155579
155578 GETLALLWYFNLARRPDAFIQAKRRRYCYGDGDDDAGIYRTHLFGSPAVLVCSPASNGFV 155399
155398 FRSAPPGSFGVGWPDPELVGVSSIVNVHGGRHARLRRFVLGAINSPNSLRSFAEVVQPRV 155219
155218 AAALRSWAAKGTITAATEIKK (0) 
       VTFENICRMFVSMEPSPATAKIDEWFAGLVAGFRALQLDIPGTAFHHARK (0)
       CRRKLNSVFREEVKRRKLKAKLEEHDDVMSGLMRMEDEQGRRLG
154678 DDEVVDNIVSLVLGGYKSTSSAIMWATYYLAKLSAVLAKLR 154556 (0)
148738 EENLAIAKEKNGASFITLDDISKMKYTAK 148652 (0)
148565 VVEETIRLANISPMLYRVALRDVEYR 148488 (1)
147994 GYTIPEGWKVIVWIRSLHVDPKYYDDPLSFNPDRWD 147887 (0)
146817 KAAKPGTYQVFGGGERICAGNMLARLQLTIMLHHLSCGYK 146698 (2)
146576 WELLNPDAGVVYLPHPRPTDGAAMSFSKL* 146499

>C73729 (partial) 39% to 88A1 43% to 707A1 44% to 90A1  9/97 304-406 REGION 85 CLAN
83% to AC079935.2 may be same seq with frameshifts and errors
STASAIMWATYHLAKSPSALAKLREENLAIAKEKNGDGFITLEDVSKMKYTAKVVEDTIRLANIAPMAH
RVALRDVEYRGYTIPKGWKVIVWFRSLHVGPXYYDXPL

>AP004129.1 $F chromosome 2 clone OJ1007_A03 71% to 707A1
same as AP004052.1 97413-99348
AU077439.1 Rice callus cDNA clone C12940_8A.Length = 432 77% to 707A1
AU057960 84% to 707A1 heme to end AU057961 3 prime untranslated region
76179 MGAFLLFVCVLAPFLLVCAVRGRRRQAGSSEAAACGLPLPPGSMGWPYVGETFQLYSSKNPNVFF 76373
76374 NKKRNKYGPIFKTHILGCPCVMVSSPEAARFVLVTQAHLFKPTFPASKERMLGPQAIFFQ 76553
76554 QGDYHAHLRRIVSRAFSPESIRASVPAIEAIALRSLHSWDGQFVNTFQEMKT 76709 (0)
76814 YALNVALLSIFGEEEMRYIEELKQCYLTLEKGYNSMPVNLPGTLFHKAMKARKRLGA 76984
76985 IVAHIISARRERQRGNDLLGSFVDGREALTDAQIADNVIGVIFAARDTTASVLTWMVKF 77161
77162 LGDHPAVLKAVT 77197 (0)
77278 EEQLQIAKEKEASGEPLSWADTRRMKMTSRVIQETMRVASILSFTFREAVED 77433
77434 VEYQ 77445 (1)
77627 GYLIPKGWKVLPLFRNIHHNPDHFPCPEKFDPSRFE (0) 77734
77890 VAPKPNTFMPFGNGTHSCPGNELAKLEMLVLFHHLATKYR 78009
78010 WSTSKSESGVQFGPFALPLNGLPMSFTRKNTEQE* 78114

>AP004162.1 $F chromosome 8 clone OJ1320_D12, = AY023201.1 searched with 707A3 
very similar to AP004129 same as AP004457.1 chromosome 8
AQ365446.2 nbxb0063F16f CUGI Rice BAC genomic cloneLength = 800 68% to 707A4
AQ365217.2 nbxb0063K16f CUGI Rice BAC genomic cloneLength = 767 53% to 707A4
35847 MAFLLFFVFVTAAVLCFVVPAFLLLCTSVQRRRDVGQGGGRDWQKKK
35706 KLRLPPGSMGWPYVGETLQLYSQDPNVFFASKQKR 35602 (2)
35464 YGEIFKTNLLGCPCVMLASPEAARFVLVSQARLFKPTYPPSKERMIGPSALFFHQGEYH 35288
35287 LRLRRLVQAALAPDSLRALVPDVDAAVAATLAAWSGGHVASTFHAMKK 35144 (0)
34944 LSFDVGVVTIFGGRLGRRHREELRTNYSVVERGYNCFPNRFPGTLYHKAIQ 34792 (0)
34706 ARKRLRAILSEIVAERRARGGGGGGGGDDLLGGLMRSRDDGTAGAVALLTDDQIADNV 34533
34532 VGVLFAAQDTTASVLTWILKYLHDSPKLLEAVK 34434 (0)
      AEQMAIYVANEGGKRPLTWTQTRSMTLTHQ (0)
33645 VILESLRMASIISFTFREAVADVEYK 33568 (1)
32047 GFLIPKGWKVMPLFRNIHHNPDYFQDPQKFDPSRFK 31940 (0)
31338 VAPRPSTFLPFGSGVHACPGNELAKLEMLVLVHRLVTAYR 31219 (2)
31105 WEIVGASDEVEYSPFPVPRGGLNAKLWKQEAEEDMYMAMGTITAAGA* 30962

>AL606442.1 $F chromosome 4 clone OSJNBa0068L06 43% to 93D1
AU070399 C-helix cDNA clone S14304_12A, 43% to 93A1 43% to 93D1
AA754485      51% TO 76C3    1/98 K-HELIX 97SN1835 46% to 81F4
AU066130 AU066198 57% to 93B2 57% to 81D4
AU096562.1 Rice green shoot cDNA clone S14304.Length = 365 C-term 47% to 705A23
122031 MASLMEVQVPLLGMGTTMGALALALVVVVVVHVAVNAFGRRR
122157 LPPSPASLPVIGHLHLLRPPVHRTFHELAARLGPLMHVRLGSTHCVVASSAEVAAELIR 122333
122334 SHEAKISERPLTAVARQFAYESAGFAFAPYSPHWRFMKRLCMSELLGPRTVEQLRPVRRA 122513
122514 GLVSLLRHVLSQPEAEAVDLTRELIRMSNTSIIRMAASTVPSSVTEEAQELVKVVAELV 122690
122691 GAFNADDYIALCRGWDLQGLGRRAADVHKRFDALLEEMIRHKEEARMRKKTDTDV 122855
122856 GSKDLLDILLDKAEDGAAEVKLTRDNIKAFII 122951 (0)
123050 DVVTAGSDTSAAMVEWMVAELMNHPEALRKVREEIEAVVGRDRIAGEGDLPRLPYLQAAYKETLRL 123247
123248 RPAAPIAHRQSTEEIQIRGFRVPAQTAVFINVWAIGRDPAYWEEPLEFRPERFLAGGGGE 123427
123428 GVEPRGQHFQFMPFGSGRRGCPGMGLALQSVPAVVAALLQCFDWQCMDNKLIDMEEADG 123604
123605 LVCARKHRLLLHAHPRLHPFPPLL* 123679

>AL607097.1 $F chromosome X clone OSJNBa0082A03 = AY021309.1 39% to 93D1
118528 MEEGVVGGGGAAVLVALLVTVVLAVMRSAGSRSSKRGRLPPSPMALPIIGHLHLIRPPPHRAFDRILARH 118737
118738 GPLVYLRLGPSTHCVVIGSADVARDFLKFEASIPERPPTAVTRQLAYGKAGFAFAPYGA 118914
118915 YWRFVKRLCMSELLGPRTVELLRPVRAAELADVLRAAQSAAERGEGVDMSHELVRMA 119085
119086 NNSIMRMVASALPGEMAEAARDCAKQVAELVGAFNAEDFVAVCRGWDLQGIGRRT 119250
119251 NEVHARFDALLETIIEAKEEARRRSLRLGRRESSSKDLLDMLMDAAEDDTAEVKLTRD 119424
119425 NIKAFVL (0) 
       DIF 119604
119605 TAGSDTTATTVEWMLAELVNHPECMAKLRGELDAVVGRSRLVGEQDVARLPYLQAVLKET 119784
119785 LRLRPPAVFAQRVTVEPVQVRGYTIPTDTQVFFNIFSIGRDATYWDQPLHFRPDRFL 119955
119956 PDGAGATVDPKGQHPQLMPFGSGRRACPGMGLAMQAVPAFLAALVQCFDWAPPPSQP 120126
120127 LPLDMEEAAGLVSARKHPLLLLPTPRIQPLPSFYS* 120234

>AL606588.1 $F chromosome 4 clone OSJNBa0016O02 = D15214 AQ576430 nbxb0089K21f 85 clan 50% to 724A1
165880 MVGGELVLAALVILLALLLTLVLSHFLPLLLNPKAPK
165769 GSFGWPLLGETLRFLSPHASNTLGSFLEDHCSR 165671 (2)
165313 YGRVFKSHLFCTPTIVSCDQELNHFILQNEERLFQCSYPRPIHGILGKSSMLVVLGEDH 165137
165136 KRLRNLALALVTSTKLKPSYLGDIEKIALHIVGSWHGKSKDKGMVNVIAFCEEARK 164999 (0)
164739 FAFSVIVKQVLGLSPEEPVTAMILEDFLAFMKGLISFPLYIPGTPYAKA 164587 (0)
164467 ARARISSTVKGIIEERRNAGSSNKGDFLDVLLSSNELSDEEKVSFVLDSLLGGYETTSLLISMVVYFLGQS 164255
164254 AQDLELVK (0) 
       REHEGIRSKKEKDEFLSSEDYKKMEYTQH 164042 (0)
163752 VINEALRCGNIVKFVHRKALKDVRYK 163675 (1)
163488 EYLIPSGWKVLPVFSAVHLNPLLHGNAQQFQPCRWE 163381 (0)
162923 GASQGTSKKFTPFGGGPRLCPGSELAKVEAAFFLHHLVLNYR 162798 (2)
162134 WRIDGDDIPMAYPYVEFQRGLPIEIEPLCSES* 162036

>AC104473.1 $F chromosome 3 clone OJ1626B05, Length = 127982
63% to AL606588.1 85 clan 
same as AQ689106 nbxb0078N12r 77% to 90B1
      MAAMMASITSELLFFLPFILLALLTFYTTTVAKCHGGHWWRGGTTPA
34958 KRKRMNLPPGAAGWPLVGETFGYLRAHPATSVGRFMEQHIAR 34833 (2)
34664 YGKIYRSSLFGERTVVSADAGLNRYILQN 34578
34577 EGRLFECSYPRSIGGILGKWSMLVLVGDPHREMRAISLNFLSSVRLRAVLLPEVERHTLL 34398
34397 VLRAWPPSSTFSAQHQAKK 34341 (0)
34237 FTFNLMAKNIMSMDPGEEETERLRREYITFMKGVVSAPLNLPGTPYWKALK 34082 (0)
31271 SRAAILGVIERKMEERVEKLSKEDASVEQDDLLGWALKQSNLSKEQILDLLLSLLFAGHE 31092
31091 TSSMALALAIFFLEGCPKAVQELR (0)
30871 EEHLGIARRQRLRGECKLSWEDYKEMVFTQC 30776 (0)
29986 VINETLRLGNVVRFLHRKVIKDVHYK 29909 (1)
29811 GYDIPSGWKILPVLAAVHLDSSLYEDPQRFNPWRWK 29704 (0)
28816 SSGSSGGLAQSSSFMPYGGGTRLCAGSELAKLEMAVFLHHLVLNFRW 28676
28675 ELAEPDQAFVFPFVDFPKGLPIRVHRIAQDDEQE* 28571

>AP003244 $F 59% to 90D1
CDS join(30874..31124,31209..31536,32037..32186,32506..32754,
33832..33921,34032..34217,34728..34849,38166..38262)
AQ157843 64% identical to AQ290163 75% TO 90C1 AT HEME BINDING REGION
C97894 Rice callus Oryza sativa cDNA clone C0085_11A, mRNA sequence
extreme C-term 71% to CYP90C1 opp end = C97895 (probably 3 prime untranslated)
MVSAAAGWAAPAFAVAAVVIWVVLCSELLRRRRRGAGSGKGDAA
AAARLPPGSFGWPVVGETLEFVSCAYSPRPEAFVDKRRKLHGSAVFRSHLFGSATVVT
ADAEVSRFVLQSDARAFVPWYPRSLTELMGKSSILLINGALQRRVHGLVGAFFKSSHL
KSQLTADMRRRLSPALSSFPDSSLLHVQHLAKSVVFEILVRGLIGLEAGEEMQQLKQQ
FQEFIVGLMSLPIKLPGTRLYRSLQAKKKMARLIQRIIREKRARRAAASPPRDAIDVL
IGDGSDELTDELISDNMIDLMIPAEDSVPVLITLAVKFLSECPLALHQLEEENIQLKR
RKTDMGETLQWTDYMSLSFTQHVITETLRLGNIIGGIMRKAVRDVEVKGHLIPKGWCV
FVYFRSVHLDDTLYDEPYKFNPWRWKEKDMSNGSFTPFGGGQRLCPGLDLARLEASIF
LHHLVTSFRWVAEEDHIVNFPTVRLKRGMPIRVTAKEDDD

>AP004314.1 $P chromosome 7 clone P0495H05, Length = 205470 90D like
137016 QAKKKMVRLIQRIIWDKRARRATTGPP
136946 RDAIDVLFRDTSDELTDELISDTMIDLMIPAGDSVPVLITLTVKFLSECLLALHQLEXXXXXX 136776
135719 KRRKTSMGETLQWTDMSLSFTXX 135657
135520 VITETLWLGNIIGGIMHKAVRGVEVNGYLIPKGWC 135416 frameshift
135413 SVHLDDMLYDEPYKFNP*RSK
134847 EKDMSNGSFTPFGGGQRLCPGLDLARLEASIFFHHLVTSFRY 134722

>AP003895.1 $P chromosome 7 clone OJ1564_F08 pseudogene of AP003244 gene (CYP90D like)
same as AP003851.1 52402-50108
AL607093.1 137125-139380 chromosome 7 clone OSJNBa0046C22 identical to AP003895.1
AL607094.1 128525-130820 chromosome 7 clone OSJNBa0046C22 identical to AP003895.1
missing N-term 242 aa
59055 QAKKKMVRLIQRIIWDKRARRA 59120 frameshift
59119 PPRDAIDVLFRDTSDELTDELISDTMIDLMIPAGDSVPVLITLTVKFLS 59265
59266 ECLLALHQLE 59295 (0) 
      EENMQL 60350 frameshift
60352 KRRKTSMGETLQWTDMSLSFTH 60414 (0) boundary shift
60551 VITETLWLGNIIGGIMHKAVRGVEVNGYLIPKGWC 60655 6 aa deletion and frameshift
60658 SVHLDDMLYDEPYKFNP*RSK 60720 (0)
61224  EKDMSNGSFTPFGGGQRLCPGLDLARLEASIFFHHLVTSFRY 61349
Missing C-terminal

>AP003850.1a $P chromosome 7 clone OJ1793_E11 pseudogene fragment last half of ex 4 
84% to gene 6
14670 SLRYLVRQFRWKLCCKEEGYRNATPMPVLGLPIELETRSPPDYAHA* 14810

>AP003850.1b $F chromosome 7 clone OJ1793_E11 gene 1 36% to 725A1 83% to gene 2
37% to 716A1 may be new family
22610 MSMDSSMLLALLLALFIPILLHLVTRRKYASYNLPSGSLGFPLIGQTISLLRALRKNT 22783
22784 DYQWYQDRIKKYGPVSKMSVFGSPTVLLTGPAANRFAFCNPDLIFTQTKALNALV 22948
22951 RSILMLSGEELKQVRSAVQGYLRPEMVTKYIWKMDKEVRRHIDLHWVGQKTLT (0)
      VAPLAKRLTFNITCSVFFGEEAGP 23310
23311 IREALATDFEALVKATLSIPVNIPFTKFNKGLSASWRIRKLLSRIARERE 23460
23461 AALQQGRCSSTDDFFTYMIALRSEG 23535 frame shift 4bp deletion
23535 HTLTVEDIVDNAILLLTAGYETTSVLIIFLLRYLANEPDILGNITE (1) 23672
      EQEEIARNKGPNEPLTWDDVSRMKYTWKVAMETLRTVPAIFGSFRT 23894
23895 AIKDIEYQGYHIPKGWQ 23945 (0)
24091 IFTDQIVTHLDTNFFDGPRKFDPARFHNQSSIPPYCFVPFGGGPRMCPGNEFAKTGTLV 24267
24268 AMHYLVRQFRWKLCCKEEGYRKDPTPMPLLGLPIDLETRSPPGYAHS* 24411
                     H  T   L  T  V  E
23521 ttacgctccg aaggcacaca cttacagtgg aggacattgt ggacaatgca atcttactcc
            S  E   G  T  H

>AP003850.1c $P chromosome 7 clone OJ1793_E11 pseudogene fragment end of exon 2 
73% to gene 2
24875 SFTVEDIVDNAIFLLIAGYETTSVLITFIPWHLDKEPEILGK 25000

>AP003850.1d $F chromosome 7 clone OJ1793_E11 gene 2 37% to 725A1 59% to Zea AF090447
AQ290163 40% identical to AQ050520 55% TO 716A1
26522 MDSSMPFALLLALLIPILLHFVIRRKYSSYNLPPGSLGFPVIGQSISLLRAL 26677
26678 RSNTDYQWYQDRIKKYGPVSKMSVFGSPTVLLTGPAANRFVFSNQDLIITETKAANALIG 26857
26858 RSILTLSGEELKQVRSALQGYLRTEMVTKCIRKMDEEVRRHIDLNWVGQKTVT 27016 (0)
27144 VAPLAKRLTFDIICSVIFGQGAGPIREALAADFKKMVQAMLSIPVNIPFTKFNKGL 27311
27312 SASRRVRKVLRQIARDREAALQQGHSSSADDFFTYMLVLRSEGTHSL 27452
27453 TVEDIVDNAILLLLAGYETSSVLITFLLRYLANEPDILGKITE 27581 (1)
27674 EQEEIARYKGPDEPLTWDDVSRMKYTWKVAMETLQTVPPIFGSLRTAIKDIEYQGYHIPKGWQ 27862 (0)
28024 VFTAIIITHLDANFSDDPNKFNPARFHNQSSVPPYCFVPFGGGPRMCPGNEFARTETLV 28200
28201 AMHYLVRQFRWKLCCKDEGYRKDPVPMPLELPIELETRSSPWICLK* 28341 

>AP003850.1e $F chromosome 7 clone OJ1793_E11 gene 3 37% to 725A1 85% to gene 2
32688 MSMDSSIPFALLLALLIPILLHVVIRRKYSSYNLPPGSLGFPVIGQTISLLRALHSNTDYQ 32870
32871 WCQDRIEKYGAVSKMSLFGSPTVLLAGPGANHFVFSNQDLIFTETKAINALVGRSILTLS 33050
33051 GEELKQVRGALHGYLRPEMVTKYMRKMDEEVRRHIDLNWVGHKTVT (0)
      VAPLARRLAFDIICSVIFGQGVGPIREALAADFETMVK 33410
33411 AMLSIPVNIPFTKFNKGLNASRRIRKVLRQIARDMEGALQQGYSSSA 33551
33552 DDFFTYMLVLRSKGTHSLTVEDIVDNAIVLLAAGYETSSVLITFLIRCLANEPDIFGKITD 33734 (1)  
      EQEEIARSKGPNEPLTWDDVSRMKYT 33911
33912 WKVALEILRTISPIFGSFRTAIKDIEYRGYHIPKGWQ 34022
34162 VFHAQSITHLDGKFFNDPIKFDPTRFDNQSLIPPYCFVPFGGGPSMCPGNEFPRTETLV 34338
34339 AMHYLVRQFRWKLCCKEEGYRKDPLPTPVLGLPIELETRTPPEYAHA* 34482

>AP003850.1f $F chromosome 7 clone OJ1793_E11 gene 4 37% to 725A1 87% to gene 2
AG021457.1 strain NC0074 PCR genomic clone NC0074_0_402_1A.Length = 657
AG023373.1 strain ND0041 PCR genomic clone ND0041_0_401_1A.Length = 522
35% to 716A2 almost identical to AQ857616.1 may be same gene = gene 4
44135 MPFALLLALLIPILLHFVIRRKYSSYNLPPGSLGFPVIGQSISLLRAL 44278
44279 RSNTDYQWYQDRIKKYGPVFKMSLFGSPTVLMAGPAANHFVFSNQDLIFTQTKAINTIIG 44458
44459 RSILTLSGEELKQVRGALQGHLRPEMVTKYMRKMDEEVRRHIDLNWVGHKTVK (0)
      VAPLAKRLTFDIICSVVFGQGIGPIREALATD 44818
44819 FETLVQALLSLPVNIPFTKFNKGLSASRRIRKVLRQIAREREAALQQ 44959
44960 GHSSSADDFFTYMLVLRSEGTHSLTVEDIVDNAIVLLTAGYGNSAVLITFLLRYLANDPDILGKITE 45160 (1)
45259 EQEEIASSRGPNEPLTWDDVSRMKYTWKVALETLRTVPPIFGSFRTAIKDIEYRGYHIPKGWK (0) 
45601 VFTAQSITHLDGNFFNDPVKFDPTRFDNQTSIPPYCFVPFGGGPRMCPGNEFARTETLVTMHYL 45792
45793 VRQFRWKLCCKEEGYRKDPLPMPVLGLPMELETRSPP* 45906

>AP003850.1g $P chromosome 7 clone OJ1793_E11 pseudogene 5 missing N-terminal 29 aa
33% to 725A1 44% to gene 2 35% to 716A2
48172 SIPPGSLGLPLIGQSLALLRAMP 48240 frameshift
48243 RNTAERWLQDRIDRYGPVSKLSLFGAPTVLLAGPAANKVVFLHEALAPKQPRSLATI 48413 frameshift
48415 PREEEHPGARQRADAVPQAGDAAAVRGKDRRRGE
48517 VRRHLADRWAGRRTVTVLPLMKTLTFDIIATLLFGLERGAIREQLADAYTGMLEGTQSVP 48696
48697 VDL 48705 frameshift
48711 PFTAFRNGLKAIERDGAPAARGDRELREKKANLEEQGKSS 48830
48831 PTDDPISCLVSLRLRRRRRPAAADRWRRRSSITPCSNALVAGHDTSAILLTFMVRHLAND 49010
49011 PATLAAMVQ (1)  bad exon boundary
      EHDEIAKSKQEGEALTRE 49190
49191 DVTRMKLSWRVAQESTR 49241 14 aa deletion and frameshift
49243 DIEVDGYVIPKGWQAFWSPCVTHMDPAKFLPSRFDAQAASAAPPPPL 49383
49384 P*SFVAFGGGPRVCPGMELARVETLVTTHYLVRHFRWQLCC 49506 10bp insertion and frameshift
49517 EENTFVRDPLPTPLNGLPVELDRIAPLW* 49603

>AP003850.1h $F chromosome 7 clone OJ1793_E11 gene 6 0ne in frame stop codon
38% to 725A1 85% to gene 2
AQ857616.1 nbeb0006M07f CUGI Rice BAC genomicLength = 837 44% to 87A2
90% to gene 6 probably same gene
60284 MDSSMPFALLLALLIPTLLRFVIRRKY
60365 SSYNLPPGSLGFPLIGQSISLLRALRSNTDYQWYQDRIKKYGPVFKMSLFGSPTVLMAGP 60544
60545 AANHFVFSNQDLIFTQTKAINTILGRSILTLSGEELKRVRSALQGYLRLEMVTKYMRKMD 60724
60725 EEVRRHIDLNWVGHKTVK  (0) 
60884 AAPLAKRLTFDIICSVVFGQGIGPIREALATDFETLVQALLSLPVNIPFTKFIKGLRASRRIRKVLR 61084
61085 QIAREREAALQQGHSSSADDFFTYMLVLRSEGTHSLTVEDIVDNAIV 61225
61226 LLTAGYGNSAVLITFLLRYLANDPDILGKITE 61321 (1)
61420 EQEEIARRKGPNEPLTWDDVSRMKYTWKVALETLRTVPPIFGSFRTAVKDIEYHGYHIPKGWQ (0) 
61762 VFTAQRITHLDGNFFNDPVKFDPTRFDNHTSIPPYCFVPFGGGPRMCPGNEFARTEILVTMH*L 61953
61954 VRQFRWKLCCEEEGYRKDPVPIPVLGLPIELETRSPPEYAHA* 62082

>AP003850.1i $F chromosome 7 clone OJ1793_E11 gene 7 36% to 716A2 79% to AP004051
65671 MALAVVVVALLVAFLTPLAVYLAGRSTRTKPPPRRNLPPGSLGLPLVGQSLSLLRAMRRNTAER 65862
65863 WLQGRIDRYGPVSKLSLFGAPTVLLAGPAANKAVFLSEALAPKQPRSLAAIIGRRNMLEL 66042
66043 VGDDHRRVRGALAQFLRPEMLRRYVGRIDGEVRRHLAGRWAGRRTVAVLPLMKLL 66207
66208 TLDVIATLLFGLARGAVRERLAAAFADMLEGLWAVPLDLPFTAFRRSLRASARARRL 66378
66379 LAATVREKKANLEQGESSPSDDLISYLVSLRDGDGGGGRPLLTEEEIIDNSIVCLTAGHD 66558
66559 TSAILLTFMVRHLADDPAILAAM (0) 
      HEEIARSKRD 66738
66739 GEALTWEDVARMKLTWRVAQETLRMVPPVFGSFRRALEDVELDGGYVIPKGWQVFW 66906
66907 APCVTHMDPAIYHDPDKFDPSRFDAQAAASAPPPYSFVAFGGGPRICPGMELARVETLV 67083
67084 TMHYLVRHFRWRLCCGGEENTFVRDPLPSPANGLPVELDHIAPLRCDEFNS* 67239

>AP004051.1 $F chromosome 7 clone OJ1218_C12 79% to AP003850 gene 7
AP004266.1 148535-147580 region chromosome 7 clone P0038F10, = AP004051
27221 MAVVAVAVVIALLAAFLTPLAVYLARRASPPPP
27122 PRRNLPPGSLGLPLIGQSLSLLRAMRRNTADRWLQDRIDRYGPVSKLSLFGAPTVLVAG 26946
26945 PAANKVVFHHEALAPKQPRSLAAIIGRRNILELVGDDHRRVRGAILQFLRPDMV 26784
26783 RRYVGKIDSEVRRHLAARWAGRRTVAVFPLMKTLAFDVIATLLFGLDRGAIREQLA 26616
26615 DAFDGMHEGLWTVPVDLPFTPFRRGLMASARARRLVEATVREKAAKLEH 26469
26468 GESSPSDDLISCLLSLRDGGRQLLTEEEIVDNSVLALVAGHDTSAVLLTFMLRH 26307
26306 LANDPATLAAM 26292 (0?) defective joint compared to AP003850
      HEEIARGKRDGEALTCEDVAKMKLSWRVAQETLRMVPPVLGSFRRAPVDVEFEGYTI 26017
26016 PRGWQIFWSPSVTHMDPAIFHEPTKFEPSRFDGAAAAAAYSFVPFGGGPRICPGMEL 25846
25845 ARVETLVTAHYLVRHFRWKLCLGEEKNTFLRDPMPTPHDGLPVELDHIAPLC* 25687

>AC092557.2 $F chromosome 3 clone OSJNBa0096I06 38% to 722A1
Similar to CYP90C in CYP85 clan 
11388 MAVFGAAVLVALAVTCGLIWSRSRRLSKEMRDIPGTMGWPVVGETFSFISGFSSPAGILSFMRDRQKR 11185 (2) 
11068 FGKVFKTYVLGRMTVFMTGREAAKILLSGKDGVVSLNLFYTGKQVLGPTSLLTTNGD 10898
10897 EHKKLRRLIGEPLSIDALKKHFDFINDLAVQTLDTWLDRRVLVLEEASS 10751 (0) 
10664 FTLKVIANMLISLEPEGEEQEKFRANFKIISSSFASLPLKIPGTAFHRGLK 10512 (0)
10413 ARNRMYAMLDSVIARRRDGGEVRNDFLQTLLRKHAKDGTAADEDDGGGGGDRTLNG 10258
10257 DADKLTDAQLKDNILTLLVAGHDTTTAGLTWLIKFLGENPEALQKLR 10117 (0) 
      EEHMEIKERLDGSSHLRWSDVNSMPYTNK (0) bad boundary see Zea mays EST AW224879
9726  VMNETLRRATILPWFSRKAAQDFSID 9649 (1)
9538  GYEIKKGTSVNLDVVSIHHDPSVFADPYKFDPNRFD (0) 
9345  GTLKPYSFLGFGSGPRMCPGMSLARLEICVFIHHLVCRYS 9244 (2)
9165  WTPLEDDDSVQPTLVRMLRNKYPIVAAAI* 9076

>AC073405.2 $F chromosome 5 clone P0036D10, complete sequenceLength = 156772 44% to 722A1
AQ576503 nbxb0089E02f 42% with 707A3 I-helix region = AC073405
      MESLAAGAWWVVVLLLLVLTIVASWYRSWWKTTEAGGPLLPPPAAGAGPWWVWVWQWR
72407 ETAAFLASHGSGRGFYHFVQERYKLYKGEGEGEATCCFRTALMGRVHVFVSASHPAASQL 72228
72227 LTAEPPHLPKRYARTAADLLGPHSILCSTSHAHHRHARRALATTLFATPSTAAFAAAFDR 72048
72047 LVIRHWTTLLPPHNQNQVVVVLDAALHISYRAICEMLLGAGGGKLRPLQSDVFAVTQAM 71871
71870 LALPLRWLPGTRFRRGLHARKRIMAALREEMAARNHHHHHHHHHHDLLSVLMQRRQLGH 71694
71693 PDALTEDQILDNMLTLIIAGQVTTATAITWMVKYLSDNRLIQDKLR 71556
      AEAFRLELKGDYSLTMQHLNAMDYAYK
71216 AVKESLRMATIVSWFPRVALKDCQVA 71139
70844 GFHIKKDWIVNIDARSLHYDPDVFDNPTVFDPSRFDV 70734
70609 QPQKRRLLVFGAGGRTCLGMNHAKIMMLIFLHRLLTNFR 70493
70397 WEMADDDPSLEKWAMFPRLKNGCPILLTPIHNS* 70296

CYP86 clan contains four families 86, 94, 96, 704 (41 sequences)

>AP003232.1h $F chromosome 1 clone P0034E02 one in frame stop 53% to 94D2
100994 MEFSSSSTSLFLLLSILPLLYFLCQRNDPKKQPHAHGLKSYPVVGIVPHFTKNNDRF 100824
100823 LEFTTEIMKRSPTQTMSFKALGLTGGGVITANPANVEYTLKTNFGNYPKGELAVSMVVDF 100644
100643 LGHGIFNSDGEQWQWQRKAASYEFNKRSLRNFVVDTVRSEVVERLLPLLERAERDGRTLD 100464
100463 VQDVLERFAFDNICQVAFDEDPACLAEDSMASPQSAEFMRAFNDAQIAVRDRFMSPVKSL 100284
100283 WRFKRLFNMEPERRMREALATIHGFAERIVRERRERGKAGLARSDDFLSRFAASGEHSDE 100104
100103 SLRDVVTNFLLAGRDTTSSALTWFFWVLSGRPDVEDKIVREIHAVRRASGSTSDATFSFD 99924
99923  ELRDMQYLHAAITESMRLYPPVAMDTHSCKEDDFLPDGTFVGKGWLVTY*AYAMARVEDI 99744
99743  WGADCEEFRPERWLDEVGAFRPESPFKYPVFHAGPRMCLGKEMADIQMKSIVASVLERFS 99564
99563  LQYAGGEGHPGLVLSVTLRMKGGLPMQVATRG* 99465

>AP003232.1g $F chromosome 1 clone P0034E02 
AZ135305.1 OSJNBb0115G10r CUGI Rice BAC genomic 53% to 94D2
AZ131112.1 OSJNBb0104L22f CUGI Rice BAC genomic 
91065 MEFSSSSTSLFLLLSILPLLYFLCQRHDPKKQPHAHGLKSYPVVGTLPHFAKNKDRF 90895
90894 LEFITEIMKRSPTHTLSFKALGLTGGVITANPANVEYTLKTNFGNYPKGELAVSMLVDFL 90715
90714 GHGIFNSDGEQWQWQRKAASYEFNKRSLRNFVVDTVRSEVVERLLPLLERAERDGRTLDV 90535
90534 QDVLERFAFDNICHVAFDEDPACLAEDSMASPQSAKFMRAFSDAQNAVMDRFMSPVKSRW 90355
90354 RFKRLFNMEPERQMREALATIHGFAERIVRERRERGEAGLARSDDFLSRFAASGDHSDES 90175
90174 LRDVVTNFLIAGRDTTSTALTWFFWLLSGRPDVEDKIVREIHAVRRASGGTGDPTFNLDE 89995
89994 LRDMQYLHAAITESMRLYPPVAMDSHSCKEDDFLPDGTFVGKGWFVSYSAYAMARVEDIW 89815
89814 GADCEEFRPERWLDEAGAFRPESPFKYPVFHAGPRMCLGKEMAYIQMKSIVASVLERFSL 89635
89634 RYAGGEGHPGFVLWLTLRMKGGLPMQVTTRG* 89539

>AP003232.1f $F chromosome 1 clone P0034E02 
86279 MKFSSTSTPLFILLLPFLPLLYFLYLYQDTKKQPAGSNGLKSYPVVGTLPHFAKNRHRFLEWST 86088
86087 DVMKRSPTHTMTFKALGLTGGVITANVANVEHILKTNFSNYPKGELSVSLLEDLLGHGIF 85908
85907 NSDGEQWLWQRKAASYEFNQRSLRSFVVDTVRFEVVERLLPLLEWARRDGRTLDVQDVLE 85728
85727 RFAFDNICHVVFHEDPACLAEDSMVSSQSAEFIRACSDAQNAIIARFMSPVKSLWRVKRL 85548
85547 FNLDPERRMRDALTTIHGYADRIVRERRARGEAGLARSDDFLSRFAAGGEHSDESLRDVV 85368
85367 TNFLIAGRDSTSSALTWFFWLVSSRPDVEDKIVHEIRAVRSASSSGGTSSATFSFDELRD 85188
85187 MHYLHAAITESMRLYPPVHLDTHSCKEDDFLPDGTFVGKGWLVTYCAYAMGRVEDIWGAD 85008
85007 CEEFRPERWLDEAGAFRPDSPFKYPIFHAGPRMCLGKEMAYIQMKSIVACVLEQFSLRYA 84828
84827 GGDGHPGFVLWSTLRMEGGLPMQVTTRE* 84741

>AP003232.1e $F chromosome 1 clone P0034E02 51% to 94D1 4 diffs with BE230752
3 diffs with AQ509587 78% to AZ135305
81213 MELSPISASLLLILILLAFLPLLYFLYMHQDPKKKPRIHGLKSYPVVGTLPHIIKNK 81043
81042 HRFLKWSTSIMKCSPTNTMSYKALGLTGGVITANPANVEHILKTNFDNYPKGKLTVSML 80866
80865 EDFLGHGIFNSDGEQWLWQRKAASYEFNKRSLRNFVVDTVRFEIVKRLLPLLEQAGLDGR 80686
80685 TLDLQDVLERFAFDNICLVAFGEDPACLTKERMAAPQSAEFMRAFNDAQNAILARFNSPA 80506
80505 KSLWRVKKLFNMEPERRMREALATIHGFAERIVRERRERGEAGLARGDDFLSRFAASGEH 80326
80325 SDESLRDVVTNFVLAGRDTTSSALTWFFWIVSGRPDVEDRVVREIRAVRASSGSTDATF 80149
80148 SFDELREKHYLHAAITESMRLYPPVAIDTHSCKEDDFLPDGTFVGKGWL 80002
80001 VMYSAYAMGRMEGIWGADCEEYRPERWLDEAGAFRPESTFKYPVFNAGPRICIGKE 79834
79833 MAYIQMKSIVACVLEKFSLRYAS 79765
      DANERPRSVLSLTLRMKWGLPMKVTIRK* 79678

>AC105364.1 $F chromosome 3 clone OJ1743A09, Length = 145936
53% to 94D2
53146 MAAILLLLLVFLVSVALVVRTWSASRRSHEKARYATAELRPY
53273 PLVGHLPQFLANRHRILDWMTEVLSRQPTRTFVLRRPGGVRGVITANPANVERF 53434
53435 LRSGFDNYPKGERFASLLHDFLGRGIFNADGEAWRSQRKAASYEFNTRSLRAFVARCVHG 53614
53615 ELHGRLLPLLRRAAAEGRAIDLQDALERFAFDNICRVAFDHDPGQLPDASGGGALAEADD 53794
53795 GSTASGRFADAFRDAANLSAGRFRYAVPWFWRVKKALHIGSERRLRESIAIVHDFADRII 53974
53975 RSRREEIRAGLEKHDLLSRFMASHDESYTEVALRDVVISFLLAGRETTSSALTWFFW 54145
54146 LLSSRPDVERRIREEVATVRARRGDGDVDRVGFDLDELREMQYVHAAITESMRLYPP 54316
54317 VPVDSLHAQEDDVLPDGTAVEAGWFVAYNSYAMGRMESVWGKDAAEFRAERWLEDAAAAT 54496
54497 FRPESPFRYVSFHGGPRVCLGKEMAYIQMKSIIACVLQELELAVDGAYRPRQVTSLTLRM 54676
54677 ADGLPTRVKVRGN* 54718

>AP003232.1d $F chromosome 1 clone P0034E02 48% to 94D2
76378 MELSSISASLLLILPLLPLLYFLYLHQDPKKQPRAHGLKSYPVVGTLPHFIKHKNHI 76208
76207 LEWSAGVLKRSPMHTMSFKALGLTGGVFTANPANVEHMLKTNFGNYVKGEAIITMLEDFL 76028
76027 GRGIFNSDGEKWLWQRKATSYEFSKRTLRNFVVDTVQFEVIERLLPLLERAGRDGRTLDV 75848
75847 QSVLERFAFDNICRVVFDEDPACLAKDSVASPHIAEFMGACNDAQNAILARFNSPIKSLW 75668
75667 RVKRLFNIESERRLREALATIHAYTDRIIRERRERGEARGDDFLSRFAAGDKHSDES 75497
75496 LHDVITNLVLAGRETTASALTWFFWLVSGRPDVEDNIVREIRAVRRASSSNGVTSGAAFS 75317
75316 PHELRDMHYLHAAITESMRLYPPVSLDTYVCKEEDFLPDGTFVGKGWQVTYCAYAMARVE 75137
75136 DIWGTDCEEFRPERWLDEAGVFRPESSFKYPVFHGGPRMCLGKEIAYIQMKSIVSCVFDR 74957
74956 FTLRYTGGEGHPGLVTSLALRMEGGLPMQVLLTNRGQAVSC* 74831 

>AP003232.1c $P chromosome 1 clone P0034E02 pseudogene fragments 
67707 FT*VFNDAQNTIVSRFLS*VKSL*RFKRLFNMEPKRQMWEALAR 67576
67575 HDPATPSGSFSKHNN*SLREVVTSFLLA 67492
67313 TFVGKGWLVIYYAYAMRYVEDIRGSDCEEFRLEQWMNKAGVF*PKSSFK 67167
67164 FEYPIFYIGQRMCLGKEMTYIHGFGFTALEKLMQRVA 67054

>AP003232.1b $F chromosome 1 clone P0034E02 
BE230752.1 99MJ354 Rice Seedling cDNA clone 99MJ354 65% to 94C1
60720 MELSSTSASLLLILLLTLVYFLYLHQDPKKKPRTHGLKSYPVVGTLPHFINNKDRF 60553
60552 LEWSTGVMKRSPTHTMSFKELGLTGGVITANPANVEHILKANFGNYPKGELAVSLLEDFL 60373
60372 GHGIFNSDGEQWLWQRKAASYEFNKRSLRNFVVDTVRFEVVERLLPLLEYAGRHGRTLDV 60193
60192 QDVLERFAFDNICRVAFDEDPACLTEESMAAPQSAEFMRAFNDAQNAILDRFNSPAKSLW 60013
60012 RIKKLFNMEPERRMRDSLATIHGYAERIVRERRERREARLERRDDFLSRFAASGEHSDES 59833
59832 LRDVVTNFILAGRDTTSSALTWFFWLLSGRPDVEDKIVREIRAVRQSSAGSEGTRGATFS 59653
59652 LDELRDMQYLHAAITESMRLYPPVPFDTHSCKEEEFLPDGTFAGKGWLVTYCAYAMGRVE 59473
59472 DIWGADCEEFRPERWLDEAGAFRPESTFKYPVFHAGPRMCLGKEMAYIQMKSIVACVLEQ 59293
59292 FSLRYAGDAKGHPGLVVALTLRMEGGLPMKVTIRE* 59185

>AP003232.1a $F chromosome 1 clone P0034E02 
AZ135359.1 OSJNBb0115O14f CUGI Rice BAC genomicLength = 843 50% to 94D2
43934 MGTELSLTSSVPLLLLLLFPLLCFLCLRHGSTRKQPRADGLKAYPIVGILPHFVRNQHRL 43755
43754 LEWSAGVVARCPTHTMSFNFKGFGLIAGAITGNPANVEHIVKTNFQNYSKGEYVVSVMED 43575
43574 FLGHGIFNSDGDQWLWQRKAASYEFNKRSLRNFVVDTVRSEVVDRLLPLLTRAERDGRTL 43395
43394 DEQDVLERFAFDNVCCVAFDEDPACLTEEGMGTNARTEFLRAFNDAQNILMARFMSPVEW 43215
43214 AWRAKRLLDLEPERRMREALATIHGYADRIVRERRERGAAGLARKDDFLSRFAATGKHSD 43035
43034 ESLRDVX 43017 frameshift
43014 TNFVLAGRDTTSSALTWFFWLVSGQPDVEDRIAREIRAVRASSGSTDAAAFSFDELREMH 42835
42834 YLHAAITEAMRLYPPVAMDSHCCQNDDVLPDGTFVGKGWQVTYSAYAMARLEELWGADCE 42655
42654 EFRPERWLDEDGVFRPESPFKYPVFHGGPRMCLGKEMAYIQMKSIAACVFERFSFRFVG 42478
42477 GEGRPGLVFSVTLRMEGGLPMRVKKRRDSVC* 42382

>AP004239.1 $P chr 6 clone OJ1115_C07 90% identical to I-helix region of AP003232
136094 (aa309) SSALTWFFWILSGRPDVEKR aa336 136153

>AC090870.8 $P chromosome 10 clone OSJNBb0015K05 
Pseudogene fragment from aa 63-337 Nearly identical to AP003232 91065-89539
AQ509587 nbxb0096J15f 69% to 96A1
104876 KVMKRSPTHTMSFKALGLTGGVITANPANVEYTLKTNFGNYPKGELAVSMLVDFLGHGIF 105055
105056 NSDGEQWLWQQKAASYKFNKRSLRNFVVDTVRFEVVERLLPLLERAERDGRMLDVQDVLE 105235
105236 CFAFDNICHVAFDEDPACLA 105295 frameshift
105297 EESMASPQSAEFMRAFNDAQNAVMDRFMSPVKSRWCFKRLFNMEPERRLQEVLATIHGFA 105476
105477 ERIVGERRERVVAGLARSDDFLLRFAASGEHSNESLRDVVTNFLLVGHDTTSSALTWFLC 105656
105657 RPVVEKREEKE 105689

>AP003735.2a $F genomic DNA, chromosome 1, BAC clone:B1147A04, complete 39% to 94A2 43% to 94D2
4872 MYMQMDSSYLHLVLPAAAAAVVVAVVLLLSLWRRCQ
4981 TTSNHRPQANPILGNLVAFLANGHRFLDWSTGLLAAAPASTMQVHGPLGLGYCGVATASP 5160
5161 DAVEHMLRASFHNYVDKGDRVRDAFADLLGDGLFLANGRLWRLQRKLAASSFSPRLLRLF 5340
5341 AGRVVLDQLRRRLLLFFDAAADARRVFDLQDVLKRFAFDNICSVAFGVDRDDSSPSSSSP 5520
5521 SRLEAGGDGRDDAFFAAFDDAIDISFGRILHPTTLAWKAMKLLDVGSERRLRQAIGVVDE 5700
5701 YVTAIMESKQRCSDSEEESDLLSRFTAAMMEEDGGNELGAMFDSPEAKRRFLRD 5862
5863 TVKTFVLAGKDTTSSALTWLFWFLAANPECERRVYEEVTALRGDTAGDERDDGYEELKRM 6042
6043 HYLHAAITETMRLYPPVPLASRVAAADDVLPDGTVVRAGWFADYSSYAMGRMPQLWERDC 6222
6223 GEFRPERWLDGGGGGGGRFVAVDAARYPVFHAGPRSCLGKEMAYVQMKAVAAAVVRRFSV 6402
6403 EVVPAAAANAPPSPPPHETAVTLRMKGGLRVLLTRRRGVLSHA* 6534 

>AP003735.2b $F genomic DNA, chromosome 1, BAC clone:B1147A04, complete 
61% to AP003735 4872-6534
8263 MDGTLAPLLLLLLLFLPALLLYLRRRPAAASRINNNHCPHPNPVLGNALPFLRNRHRFLDW 8445
8446 ATDLLAAAPTSTIEVRGALGLGSGVATANPAVVDHFLRASFPNYVKGARFAVPFEDLLGR 8625
8626 GLFAADGRLWALQRKLASYSFSSRSLRRFSARVLRAHLHRRLVPLLDAAAGSGEAVDLQD 8805
8806 VLGRFGFDNICNVAFGVESSTLLEGGDRRHEAFFAAFDAAVEISVARVFHPTTLVWRAMR 8985
8986 LANVGSERRMRDAIRVIDEYVMAIVASEERLRLRRGEDEREHEQHLLSRFAASMEEEGGE 9165
9166 LAAMFGSPGAKRRFLRDVVVSFVMAGKDSTSSALTWLFWLLAANPRCERRVHEEVSSSRH 9345
9346 ADPRRADAGEDGHGDGYDELRRMHYLHAAISEAMRLYPPVPIDSRVAVAADALPDGTAVR 9525
9526 AGWFADYSAYAMGRMPQLWGEDCREFRPERWLSDGGEFVAVDAARYPVFHAGPRAC 9693
9694 LGREMAYVQMKAVAAAVIRRFAVEPVQAPASMETPPACEVTTTLKMKGGLLVRIRKR 9864
     EDDAAQQKLT* 9897

>AP003442.1 $F chromosome 1 clone B1096A10 72% to 86A1
AQ954084.1 nbeb0054A03r CUGI Rice BAC genomicLength = 403 57% to 86A1
D41651  71% to 86A1 almost identical to AP003442
49790 MAAAAVALASAYMVWFWALSRRLSGPRMWPLVGSLPSVVLNRARVHDWIADNLRATGDAATYQTCILPLPFLARRQGLV 
50026
50027 TVTCNPRNLEHILRARFDNYPKGPMWQASFHDLLGQGIFNSDGETWLIQRKTAALEFTTR 50206
50207 TLRQAMARWANRSIKYRLWRILDDHCNAAASVDLQDLLLRLTFDNICGLTFGKDPETLSP 50386
50387 GLPENPFANAFDEATEATMQRFLFPSLLWRIKKAFGVGSERSLRDSLAVVDRHMTETIAA 50566
50567 RKATPSDDLLSRFMKKRDSKGKAFPEDVLQWIALNFVLAGRDTSSVALSWFF 50722
50723 WTLMQRRDVERKVVLEIASVLRETRGDDTARWTEEPLNFDELERLVYLKAALTETLR 50893
50894 LYPSVPQDSKYVVADDVLPDGTVVPAGSAITYSIYSVGRMESIWGKDCAEFRPERWLSAD 51073
51074 GSRFEPVKDAYRFVAFNGGPRTCLGKDLAYLQMKSIASAVLLRNSVELVPGHKVEQKM 51247
51248 SLTLFMKNGLRVHVKPRDIASYVEPSEPAPPQGSLVIPTTTAAAA* 51385

>AC073556.1 $F unknown clone OSJNBa0091P11 14 unordered pieces 73% to 86E1
C73892        40% to 86A8  9/97 3-57 REGION
49583 MKSPMEEAHAMPVTSFFPVAGIHKLIAIFLVVLSWILVHKWSL
49712 RNQKGPRSWPIIGATVEQLKNYHRMHDWLVEYLSKDRTVTVDMPFTSYTYI 49864
49865 ADPVNVEHVLKTNFTNYPK 49921
50010 GEVYRSYMDVLLGDGIFNADGEMWRKQRKTASFEFASKNLRDFSTVVFREYSLKLSSI 50183
50184 LSQACKAGRVVDMQ 50225
50317 ELFMRMTLDSICKVGFGVEIGTLSPDLPENSFAQAFDAANIIVTLRFIDPLWRLKK 50484
50485 FLHVGSEALLEQSMKLVDDFTYSVIRRRK 50571
50718 NGEQIKHDILSRFIELGEAGGDEGGGSFGDDKSLRDVVLNFVIAGRDTTATTLSWFTYMA 50897
50898 MTHPAVADKLRRELAAFEDERARELSYDAVGKLVYL 51077
51078 HACVTETLRLYPAVPQDPKGIVEDDVLPDGTKVRAGGMVTYVPYSMGRMEYNWGPDAASF 51257
51258 RPERWLSGDGGAFRNASPFKFTAFQAGPRICLGKDSAYLQMKMALAILFRFYTFDLV 51428
51429 EDHPVKYRMMTILSMAHGLKVRVSTSV* 51513

>AP004092.1 $F chromosome 2 clone OJ1568_B05, similar to 86C3
C99701 43% identical to D39760 60% to 86B1 same clone as C73927
84356 MAIASMAAAAAAAIVGAVREHVRASDLAVAGAVL
84458 FAFSAAVSAVRARRRGAPVLWPLVGIVPTLFVHRDDIYEWGSAALLRAGGVFPYRGTWG 84634
84635 GGSSGVITSAPANVEHVLRANFGNYPKGPYYRERFVELLGGGIFNADGEAWRAQRRAATA 84814
84815 EMHSSRFVEFSVRSIEQLVYGRLVPLAERLSGGGAAVDLQEVLLRFTFDNICAVAFGVDA 84994
84995 GCLADGLPDVPFARAFELATELSLLRFVTPPFIWKAKRLLRAGSERRLVEATRAVREFAE 85174
85175 RAVADRRNEMRKVGSLRGRCDLLSRLMSSAPGADYSNEFLRDFCISFILAGRDTSSVG 85348
85349 LAWFFWLLAGHPDVESRVVGDVLAAGGDIKRMDYLHAAL 85465
85466 TEAMRLYPPVPVDFKEALADDVLPDGTPVRARQRVIYYTYAIGRDPASWGDDAAAFRPER 85645
85646 WMRGGAFAGGESPFKYAVFNAGPRLCIGKRFAYTQMKTAAAAVLSRFAVEVVPGQEVK 85819
85820 PKLTTTLYMKNGLMVRFRRRSPPPPSPPPRHVVADDDDDDDVAAGRHVAVGSCNSNHL* 85996

>AP003230.1 $P pseudogene fragment at C-helix 2 diffs with AP004139.1
65348 PFWHTVFRDLLGDGIFNSDGETWVA*RNTAALEFTT 65241

>AP003242.1 $P 38% to perf region of AP004139.1 possible pseudogene fragment 
11819 ISYSIFSTGVLQITM*QGEILLHFWPSWL*LVDGKRPFGPMDGHLLIAYN 11670

>AP004139.1 $F chromosome 2 clone OJ1486_E07 70% to 86A2
55281 MEVGTWAVVVAVAAAYMTWFWRMSRGLS
55335 GPRVWPVVGSLPGLVQHAENMHEWIAANLRRAGGTYQTCIFAVPGVARRGGLVTVTCDP 55511
55512 RNLEHVLKSRFDNYPKGPFWHAVFRDLLGDGIFNSDGETWVAQRKTAALEFTTRTLRTAM 55691
55692 SRWVSRSIHHRLLPILDDAAAGKAHVDLQDLLLRLTFDNICGLAFGKDPETLAKGLPENA 55871
55872 FASAFDRATEATLNRFIFPEYLWRCKKWLGLGMETTLASSVAHVDQYLAAVIKARKLELA 56051
56052 GNGKCDTVAMHDDLLSRFMRKGSYSDESLQHVALNFILAGRDTSSVALSWFFWLV 56216
56217 STHPAVERKVVHELCAVLAASRGAHDPALWLAAPFTFEELDSLVYLKAALSETLRLYP 56390
56391 SVPEDSKHVVADDYLPDGTFVPAGSSVTYSIYSAGRMKTVWGEDCLEFRPERWLSADGSK 56570
56571 FEPHDSYKFVAFNAGPRICLGKDLAYLQMKNIAGSVLLRHRLAVAQGHRVEQKMSLTL 56744
56745 FMKNGLRMEVRPRDLAPVADELRGADVRATAPCA* 56849 

>AL606687.1 $F chromosome 4 clone OSJNBa0084K11 similar to AP004139
C91843        72% IDENTICAL TO 86A4    4/98 47 TO 128 REGION
AU093465.1 Rice panicle cDNA clone E31913.Length = 609
BE228888.1 98AS3245 Immature Seed cDNA clone 98AS3245. Length = 483 76% to 86A2
94178 MEAGTWAVVVAAVAAYMAWFWRMSRGLSGPRVWPVVGSLPGLVRHAEDMH
      EWIAANLRRTRGTYQTCIFAVPGLARRGGLVTVTCDPR 94441
94442 NLEHVLKSRFDNYPKGPFWHGVFGDLLGDGIFNSDGETWVAQRKTAALEFTTRTLRTA 94615
94616 MSRWVSRSIHSRLLPILSDAAAAGGGGGGATVDLQDLLLRLTFDNICGLAFGKDPETLAR 94795
94796 GLPENDFASAFDRATEATLNRFIFPECVWRFKKWMGLGMETTLARSVQHVDRYLSAVIKA 94975
94976 RKLELAAGNGKGDASSATPHDDLLSRFMRKGTYSDESLQHVALNFILAGRDTSSVA 95143
95144 LSWFFWLVSTHPAVERKIVRELCTVLAASRGADDPALWLAAPLNFEELDQLVYLKAALSE 95323
95324 TLRLYPSVPEDSKHVVADDVLPDGTFVPAGSSVTYSIYSAGRMKTVWGDDCLEFRPERWL 95503
95504 SADGTKFEPHDSFRFVAFNAGPRICLGKDLAYLQMRNIAGSVL 
      LRHRLAVAPGHRVEQKMSLTLFMKHGLRMEVRPRDLAPIVDELRGAGEYAAAARATAACA* 95815

>AL606448.1 $P two short pseudogene fragments = D15209 6 diffs with AL606687.1
D15209  69% IDENTICAL TO 86A1 Arabidopsis T04172 EST  5/93  7/98 113-131 REGION
very similar to C91843 identical to AL606448.1
135204 AFGDLLGNGIFNSDGETWVAQRKTLA 135278
135631 STSIKARKLELAAESGKGDASS 135696

*****300, 69F japonica

>AP003767.1 $P two short pseudogene fragments only 4 diffs with AL606448.1
72817 FGDLLGNGIFNFDGETWVAQHKTTA 72891
73228 STSIKARKLELAAGSGKSDASS 73293

>AC087182.8 $F chromosome 10 clone OSJNBa0029C15, 66% to 86B1
D47545, D41610, D47561, D49287, AU032597 AU067853 77% identical to 86B1
AU067852 N-terminal region
AU096913.1 Rice green shoot Oryza sativa cDNA clone S16487
AU032659 66% identical to contig of D47545 63% to 86B1 same clone = D47454 
53685 MDAVMPGAAGAHNATAAAAAGRRGGGIVAGMMALPEVQTVELLVAVSIFVAIHSLRQRRSQGLPSW 53882
53883 PLVGMLPSLLLGLRGDMYEWLTGVLASRGGTFTFHGPWLTNLHCVVTSDPRNLEHMLKTK 54062
54063 FGSFPKGPYFRDTVRDLLGDGIFGADDEVWRRQRKAASLEFHSAEFRALTASSLVELVHR 54242
54243 RLLRVLGDAEEAGDAVDLQDVLLRLTFDNVCMIAFGVDPGCLRPGLPEIPFAKAFEDATE 54422
54423 ATIVRFVTPTAVWRAMRALGVGHERVLQRSLAGVDRFAYDVIRQRKEEVAAGGGGGGGGR 54602
54603 SDLLTIFTKMRDADTGAAAYSDKFLRDICVNFILAGRDTSSVALAWFFWLLNKNPAVEAK 54782
54783 ILEEIDDIVAARRSSPPAPAVAANGADEDDLVFHPEEVKKMEYLHAALSEALRLYPSVPVDHKE 54974 (0)
56977 VVEDEVFPDGTVLKKGTKVIYAMYTMGRMESIWGEDCREYKPERWLRDGRFMGESAYKF 57153
57154 TAFNGGPRLCLGKDFAYYQMKFAAASILRRYHVRVVDGHPVAPKMALTMYMKHGLKVK 57327
57328 LTKRDKSKL* 57357

>D39760  (partial) 46% to 704A1 48% to 94B1 53% to 94D1   11/94 298-366 region I-HELIX
AQ840682 AQ288368 AQ271455 D41746 CANNOT COMPLETE THIS SEQ
54% to 96A9 53% to 86B1 86 CLAN 50% identical to AU031131
AQ325920.1 nbxb0021P14r CUGI Rice BAC genomic cloneLength = 938 86 like C-term
48% to 86A4
ortholog of aaaa01003917.1 $FI 1 diff and one 7aa fs region
MNLFDLFFVLQLLLSGVCVLVICYRYQRLKSMKKCSLGVVQWPIVGVLPAIVANMHRLLD 
GVTFLLATSQLNFQCRFWLAGFRFFVTCDPDNVR 

KHRADDESKSRQGGGG
GGDKQSVADLLSSFLCDDEIAGSADADVYIRDMAMNLLVAGRDTTSSA
LSWFFYLLSTNPRVEQKLLQELAPIASRKPQL 
QQGGSSPATAGMVTFDAS

WITENGKLRYVPSNKFVAFNSGPRTCLGK
EMALVQMKVTAAAMAWNFAVEVVPGHVVEPRLSVILHMKNGLLVRVKRREPVMNT*

>AP003289 $F 55% to 94C1 CDS complement(91864..93438)
MGVVEAEALHGAVEALAGSLQPHVATAFFVFSACTVALAALLAV
VRLRPPWWCDCTVCEAFLTASWAGEFDNLCDWYAHLLRTSPAQTVHVHVLRNVLTANP
VTVDHVLRARFDNYPKGAPFSAILADFLGRGIFNVDGDAWLFQRKLAAAELASPALRA
FAARVVASELRCRLIPLLHSASREGNGKVLDLQDMFRRFAFDSICKISFGLDPGCLEL
SMPVSTLVEAFDTASTLSARRATVPMQIIWRLKRFLNVGDERKLRDAVRLVDALAAEV
IRQRRKLGGAATGSDLLSRFMGSIDDDKYLRDIVVSFMLAGRDTIASALTAFFLLLSD
HPEVATAIRDEVARVTGDGNRTMAATFDKLKDMHYVHAAMYESMRLFPPVQFDSKFAA
GDDTLPDGTVVAKGTRVTYHAYAMGRMESVWGPDCAEFRPERWLRDGRFVPESPYRYP
VFQAGVRVCIGKELALMEMKAVIVAVVRSFDIEAIARSSRRPKFAPGLTATFAGGLPV
RVRRRRARASGHNPPI

>AC116949.3 $F chromosome 11 NO INTRONS
BI804955.1 clone S002F11.Length = 411 62% to AP003289 54% to 94C1
BI805571.1
AU056498 AU056805 64% to 94B1 from aa  79-123
AU056681.1 Oryza sativa mature leaf cDNA clone S20781_1A.Length = 419
73586 MEMELPWGARCAGLAFFAFSVCLAALGVVLLVARRWPWCSCHVCRAYLTGSWAREFTNLG 73407
73406 DWYAHLLRRSPTGTVHVHVLGCTVTANPANVEHMLRTRFDNFPKGRPFAALLGDLLGDGI 73227
73226 FNVDGHAWRHQRKMASLELGSVAVRSYAYKIIAQEVEARLMPVLADAADRGAVLD 73062
LQDVFRRFAFDXIC
KISFGLDPGCLDREMPVSELADAFDAASRLSAMRGAAASPLLWRAKRFLNVGSERELRKA
IKVVDELAAAMIRERQKLGVGSSHDLLSRFMASTGVDDAAADDKFLRDIVVSFLLAGRDT
VSTALTTLFMLLSKNPEVAAAMRAEAEAGDGGETGAAITYEHLKGLHYTHAVLHENMRL
FPPVQFDSKFCAAADVLPDGTYVGGDARVMYHPYAMGRMPHIWGADYAAFRPARWLTGPG
ASFVPANPYKYPVFQAGQRVCLGKELAVTEMKAASVAVVRAFDVEVVGENGRSGGAAAAP
RFVPGLTASISGGLQVRVRRRVHT* 72048

>AC118132.1 $F chromosome 10 39% to 96A10
AU031131 45% to 96A1 and 86A4 440-512 region C-terminal 86 CLAN
137194 MDKAMAFIPPRDLVVVLPFLALLLPLYIYLRYSRSAKANPSLPTEWPLVGMLPSLVANIH 137373
137374 NLFDYATALLAASGNSFEARGPPMSSLRFFVTCDPDNVRHIFINNFANYPKGEEFASFFD 137553
137554 VMGDSFFNADGESWRRQRARVQHVMSNPRLLASMAACCRGKVEKGLLPILDRMASAGAPF 137733
137734 DLQRLLTRFAFDVTAMAVFGVDTCRLSIDMPPLDVANAMDAVMEVGFFRHTVPVSCWKLM 137913
137914 RSLRIGPERKLTAAQRLLRRFVAEMIEKRRVAGGACKATDDEQGGVPPPADIVSSYINDP 138093
138094 EYVDEDGNPREFMYATFINYMVAGRDTVGTALSWLFFNLTEHPRVVARIREELEPIASSK 138273
138274 AGGGGMVVFDPEETKPLVYLQAALFESMRLYPPGPIERKATLAEDVLPSGHTVRAGDNIL 138453
138454 IPVYSMGRMASVWGKDSGEYRPERWVTEDGKLRHVPAHRFMPFNA 138588 (fs)
138590 GPRLCLGKDISVLQMKSVAAA 138652
138653 VAWNFDLEVVAGHAVEPKVSMVMQIKNALMVKVKKR* 138763

>AP002484a  CDS  $F join(76365..77485,77598..78009) 42% T0 96A5 new 96 subfamily
MGPLWTFILLYPEIFLAIICFFWFSLFRPIRQRQKSNLPVNWPV
FGMLPFLVQNLHYIHDKVADVLREAGCTFMVSGPWFLNMNFLITCDPATVNHCFNANF
KNYPKGSEFAEMFDILGDGLLVADSESWEYQRRMAMYIFAARTFRSFAMSTITRKTGS
VLLPYLDHMAKFGSEVELEGVFMRFSLDVTYSTVFAADLDCLSVSSPIPVFGQATKEA
EEAVLFRHVIPPSVWKLLRLLNVGTEKKLTNAKVVIDQFIYEEIAKRKAQASDGLQGD
ILSMYMKWSIHESAHKQKDERFLRDTAVGFIFAGKDLIAVTLTWFFYMMCKHPHVEAR
ILQELKGLQSSTWPGDLHVFEWDTLRSAIYLQAALLETLRLFPATPFEEKEALVDDVL
PNGTKVSRNTRIIFSLYAMGRIEGIWGKDCMEFKPERWVSKSGRLRHEPSYKFLSFNT
GPRSCLGKELSLSNMKIIVASIIHNFKVELVEGHEVMPQSSVILHTQNGMMVRLKRRD
AA*

>AP002484b $F    CDS  80463..81980 43% to 96A1
MELLPWLLGFVVKYPEIMASAACFLLLFCRFRRRSKRIPTNWPV
VGALPAIVANAGRVHDWVTEFLRAAAMSHVVEGPWGSPGDVLITADPANVAHMFTANF
GNYPKGEEFAAMFDVLGGGIFNADGESWSFQRRKAHALLSDARFRAAVAASTSRKLGG
GLVPLLDGVAASGAAVDLQDVFMRLTFDLTAMFVFGVDPGCLAADFPTVPFAAAMDDA
EEVLFYRHVAPVPWLRLQSYLKIGHYKKMAKAREVLDASIAELIALRRERKAADANAT
GDADLLTAYLACQDEIGMDGAAFDAFLRDTTLNLMVAGRDTTSSALTWFFWLLSNHPG
VEARILAELRAHPPSPTGAELKRLVYLHAALSESLRLYPPVPFEHKAAARPDTLPSGA
AVGPTRRVIVSLYSMGRMEAVWGKGCEEFRPERWLTPAGRFRHERSCKFAAFNVGPRT
CLGRDLAFAQMKAVVAAVVPRFRVAAAAAPPRPKLSIILHMRDGLKVKVHRRQED*

>D46472, D46467 N-TERMINAL 1-127 REGION 38% to 96A5 40% to 96A3 opp end = AU086015.1 AU095825
AU086015.1 Rice green shoot clone S11166.Length = 413 48% to 96A2 64% to AC087182.8
also = BI810846.1, AU095825.1 BM421002 (clone U022E12) cannot extend
(Partial) ortholog of aaaa01022933.1 $FI 8 diffs
MALSLTPSLQDLVSSLMILLVLLYISYWRSKYQSLFPVDWPIVGLLPTLVANLHRLHDKITEYLAASGISFTGN
GPAASNMRLFVTCDPDNVRHIFTTNHDNYPKGHEFAEIFDIMAGGFFTMDGXL 
(gap)
FETLXLYPPGWIERKTVVADDVMPSGHEVRAGDAVLISIYSM
GRMESLWGDDCRVYRPERXLYDSRGGRKMRHVPHKFLSFNSGPRMCPGKNIAVMQMKIIAAPVVWNFDLEVVEGQAV
GPKLSCLLQMKNGVMVKVKKRAV*

>AP000391 $F 53% to 704A2 CDS complement(join(110051..110242,110615..110815,
             111352..111690,111998..112282,112493..112687))
             /note="ESTs AU056036(S20239),C72753(E2173),
             AU056035(S20239) correspond to a region of the predicted gene.
             53% TO 704A2 GENBANK ENTRY MISSING N-TERM SHOWN BELOW
AQ395857.2 nbxb0067E02f CUGI Rice BAC genomic cloneLength = 417 C-helix
AU056035 AQ395866 nbxb0066G06f C26743 45% to 704A1  57-133 C-helix REGION
AU057206 AU057207 AU056036 58% identical to D47545 contig similar to AU033151
C72753 46% to 704A2 9/97 I-HELIX TO K-HELIX 
MVTQLTYLAASAGVCLASALAIALLSIALYIIGVVASFAVLCAKEFAERAHDRP
113301 PLVGTVFRQLKNFDRMFDEHVNYATAHRTSRIVYPGHCEVFTSDPAVVEHVLK 113143
NSFSKYSKGDFLTTA
MKDLFGDGIFATDGDMWRHQRKLASYEFSTKVLRDFSSDTFRRN
AAKLAEKISCAAANRISINIQDLLMRATMDSIFKVGFGFELNTLSGSDESGIQFSKAF
DEANSLVYYRFVDIMWKLKRYLNIGSEAKLKRNIQIIDSFVMKLIHQKREQMKIAADY
KTKEDILSRFVLASEQDPGTMDDRYLRDIVLNFLIAGKDTTGNTLTWFFYLLCKNPIV
QDKVALEIREFVEWSKEDNTIESFTKRLDEGAISKMHYLQATISETLRLYPAVPVDAK
MADEDDVLPNGYRVVKGDGINYMIYAMGRMTYLWGEDAQEFRPERWLVNGVYQQESPF
KFVSFNAGPRICLGKEFAHRQMKIMAATLIHFFKFRLEDESKEPIYKTMFTLHIDNGL
HLLANPREISP

>AL606640.1 $F chromosome 4 clone OSJNBa0019K04, 54% to 704A2 and AP000391
BE039709.1 OC02G02 OC cDNA 5' Length = 955 50% to 704A1
BE228790.1 98AS3140 Immature Seed cDNA clone 98AS3140.Length = 517 35% to 704A2
77409 MESPLSHPAMVALSLLLLVALYLARRAVLGKKRRYPPVAGTMF 77537
77538 HQLLNFGRLLEYHTELSRKYRTFRMLTPTCNYVYTVEPANVEHILKTNFANYGKGPMTHD 77717
77718 VLEDLLGDGIFNVDGGMWRQQRKVASLEFSTRVLRDYSSAVFRDTAAELAGILERGPAAK 77897
77898 GRERVDMQDLLMRATLDSFFRVGFGVNLGVLSGSSKEGLVFARAFDDASEQVLFRFFDLL 78077
78078 WKVKRFLNISSEATMKQSIRTINDFVYSIIDRKIEQMSREQHEF (0)
      AKKEDILSRFLLEREKDPGCFDNKYIRDIILNFVIA 78437
78438 GRDTTAGTLSWFLYAVCKNQRVQDKIAREVRDATTGDRDVGVQDFSSFLTEDAINKMQY 78614
78615 LHAALTETLRLYPGVPL 78665 (0)
78800 DVKYCFSDDTLPDGHAVKKGDMVNYQPYPMGRMKFLWGDNAEEFKPERWLDDSGMFVAES 78979
78980 PFKFTAFQ 79000 (0)
79119 AGPRICLGKEFAYRQMKIVSAVLLYFFRFEMWDDDATVGYRPMLTLKMDGPFYLRALAR* 79298

>AC074232.2a $F chromosome 10 clone OSJNBb0005J14, 5 unordered pieces gene 1 51% to 704A2
1-14932: contig of 14932 bp in length
D48478 AU056906 52% IDENTICAL TO 704A1 C-helix to I helix AU056907 C-term
D41685, D41400, D41416, D41661 64% to 704A2 355-451 region
BE229539.1 98MJ0246 Rice cDNA clone 98MJ0246.Length = 489
BE229554.1 98MJ0272 Rice Immature Seed cDNA clone 98MJ0272.Length = 349
AC078894.1 chr 10 clone OSJNBa0096G08 
2688 MGEDGGVNSSSNSPAAAVGLVLVVAICTYLAVVATRKQKRRRRRRP
2822 PVVGTAFHQLYHVRRVHDYHTALSREHMTFRLLVPAGREQIYTCDPAVVEHI 2977
2978 LRTNFANYGK 3007
4645 GSFNHGNMSDLFGDGIFAVDGDKWKQQRKIASYDFTTRALR 4767
4943 LQGFLMRATMDSIFTIAFGQDLNTLDGSGEGRRFAAAFDDASEFTMLRYLNPFWKLSR 5116
5117 LLNVGAEAMLKERIKVVDGFVYKLIRDRSDELSNTKAHDT 5242
5337 DILTRFIQATTSDSGTVDYKYLRDIILNIVIAGKDTTAGSLAWFLYMMCKHPEVQEKICH 5516
5517 EAMEATNAGEAASIDEFSQSLTDEALNKMHYLHAALTETLRLYPAVPL 5672
5746 DNKQCFSDDVLPNGFNVSKGDIVFYIPYAMGRMESLWGKDAESFRPERWL 5925
5926 DENGVFQQESPFKFTAFQA 6096
6097 GPRICLGKDFAYRQMKIFAAVLLRFFVLKLRDEKEIISYRTMITLSVDQGLHLTAMAR* 6273

>AC074232.2b $F chromosome 10 clone OSJNBb0005J14, 5 unordered pieces gene 2
14983-40291: contig of 25309 bp in length 47% to 704A1
AQ870554.1 nbeb0039H06r CUGI Rice BAC genomicLength = 746
AZ047399.1 nbeb0093H22r CUGI Rice BAC genomicLength = 400
AZ047398.1 nbeb0093H20r CUGI Rice BAC genomicLength = 400
AQ860889.1 nbeb0015N10f CUGI Rice BAC genomicLength = 494
28049 MGMDGDSSYSPALAAVAGAVALVAFCSYYLAVTRATGDGEARRRRRRHP
28196 PVVGTVFHQLYHVRRLHDYYTALCREHTTFRLLATPGRRNIYTCDPAVVEHI 28351
28352 LRTNFPSYGK 28381
29955 GPLNSEILNDLFGEGIFAVDGEKWKTQRKIASYDFTTRALRDFSSDVFKRNAAKLAGVV 30131
30132 SNHAASNSKCSFLQGLLTRATMDSIFTIAF 30311
30312 GQDLNTLDGSGEGRHFAKAFDDAGEYLLLRYLNPFWKLARLLNVGAEATLKERIKVVD 30485
30486 EFVYKLIRARSDELS 30530
30653 DDLLSRFIQATTSDSGTVDYKYLRDIVLNIVIAAKDSTSGSLAWFLYMACKRPEVQEKIF 30832
30833 DEVMETTNAGDCASIDEFLTSLTDQALNKMHYLHAALTETLRLYPSVPLVI 30985
31122 NGFSVSKGDGVFYMPYAMGRMEFLWGKDAEAFRPERWLDEHGVFQQESPFKFTAF 31286 
31388 AGPRICIGKDFAYRQMKIFAAVLIRSFVFKLRDKKDNVSYRTAITLAIDQDLHLTATAR* 31567

>AC074232.2c $F chromosome 10 clone OSJNBb0005J14, 5 unordered pieces gene 3
82798-97479: contig of 14682 bp in length. 51% to 704A2
AC078894.1 chr 10 clone OSJNBa0096G08 95774-96370
AU086052.1 Rice shoot Oryza sativa cDNA clone S4234.Length = 359
AZ127799.1 OSJNBb0092H20r CUGI Rice BAC genomicLength = 697
AZ135483.1 OSJNBb0116I23r CUGI Rice BAC genomic Length = 794
AU033151 60% to 704A2
AQ859370.1 nbeb0012F15r CUGI Rice BAC genomicLength = 422
AZ125054.1 OSJNBb0079L07f CUGI Rice BAC genomicLength = 480
85065 MGAERGGDSCYSPAAVMATAGALALVAICSYLAVTSNKQKRRRRP
85198 PVVGTVFHQLYNVRRIHDYHTALSREHTTFRMLVPAGGDQIYTCDPAVVEHI 85353
85354 LKTNFANYGK 85383
87710 GPFNHGNAKDLFGDGIFAIDGEKWKQQRKIASYDFSTRALR 87832
      DFSCAVFKRNAAKLAGIVSNHAA
      TYGLCRTFVQGLMLRATMDSIFTIAFGTDLNTLDGSGEGSRFAAAFDDASEFTM
      LRYISPLWKLARLLNVGVEAMLKERIKVVDEFVYRLIRARSDELS
88423 DILSRFLQATTSDSVVDYKYLRDIILNIVIAGKDTTAGALAWFLYMVCKHPEVQ 88584
88585 EKICHEAMVATSAGDTASVDEFLQSLTDQALNNMHYLHAALTETLRLYPSVPM 88764
88765 ENKQCFSDDVLPNGFNVSKGDIVFFIPYA 88944
88945 MGRMESLWGKDAEYFRPERWLDENGVFQQESPFKFTAFQ 
      AGPRICLGKEFAYRQMKIFAAVLLRFFVLKLRDEKEIVSYRTTLTLAI 89298
      DQGLHLTATAR* 89334

>AC074282.1 $P chromosome 10 clone OSJNBa0049K09 93 unordered pieces 48% TO 704A2
PROBABLE PSEUDOGENE 
183681 VRCFIFIKQAQRRHLVPPEEGHRQRHAFAMDIIRARRQSASVQDRDDVLSRFVASDKHNE 183860
183861 *VLRDIVLIFLIAGCETRSSGLSWFFWLLSSQPDVMARIADVVHVVRKATGACPSEL 184031
184032 FGFDALREMHYLHAVLMESMRLYPP 184106 FRAME SHIFT
184111 QACAADDTLPDGTLVHAGWSVTYNAYAMGRLAAIWGEDCLEYKPER*LGEDGVFQPASPRK 184293

CYP97 clan contains only one family 97 (4 sequences)

>AP004028.1 $F chromosome 2 clone OJ1136_C12, same as AP004048 131962-137032 97A 71% to 97A3
sequence same as AG020939.1 strain H0219 PCR from rice genomic clone H0219_0_62_1A
AG025179.1 strain NC2610 PCR from rice genomic clone T5719T 73% to 97D1
AG023069.1 strain NC2690 PCR from rice genomic clone NC2690_0_402_1A 78% to 97D1
AG022824.1 strain NC2584 PCR from rice genomic clone T5578T 82% to 97D1
AG020732.1 strain H0006 PCR from rice genomic DNA clone H0006_2_202_1A 85% to 97D1
91296 MAATSSAAAAAPPPCRLLGSGQAHLRLPPSAAAAAASARRRLLLRCAA
      SGGNGKGGGGDGSGSDPVLEERRRRRQAELAARIASGEFTAQGPA (2) 91018 exon 1
90920 WIAPLAVGLAKLGPPGELAAALLTKVAGGGGPEIPQAVGSMSAVTGQAFFIPLYDLFLTYGGIFRLNFGPK (0) 
90708 exon 2
      SFLIVSDPAIAKHILRDNSKAYSK 90537 (0) exon 3
      GILAEILEFVMGTGLIPADGEIWRVRRRAIVPAMHQK 90339 (0) exon 4
      YVTAMISLFGYASDRLCQKLDKAATDGED 90171
90170 VEMESLFSRLTLDVIGKAVFNYDFDSLSYDNGIVE (0) exon 5
      AVYVTLREAEMRSTSPIPTWEIPIWKDISPRQKKVNEALALINKTLDELIDICK 89802 (0) exon 6
89715 RLVEEEDLQFHEEYMNEQDPSILHFLLASGDD 89620 (0) exon 7
89554 VSSKQLRDDLMTMLIAGHETSAAVLTWTFYLLSK 89438 (0) exon 8 
      YPNVMAKLQDE (0) exon 9
89160 ADTVLGDRLPTIEDVKKLKYTTRVINE (0) exon 10
88998 SLRLYPQPPVLIRRSIEEDMLGGYPIGR 88915 (2) exon 11
88254 GEDIFISVWNLHHCPKHWDGADVFNPERWPLDGPNPNETNQNFS 88117 (2) exon 12
86884 YLPFGGGPRKCVGDMFATFE 86819 (0) exon 13
86584 TVVATAMLVRRFDFQMAPGAPP 86519 (0) exon 14
86441 VEMTTGATIHTTEGLKMTVTRRTKPPVIPNLEMKVISDSPENMSTTTS 86298
86297 MPVSAASIASGEDQQGQVSATRI* 86226 exon 15

>AU173808.1 (partial) cDNA clone R3811. Length = 467 233-380 new seq 48% to AC025783.5
end of gene is on AU173809
AU173809.1 clone R3811.Length = 444 66% to AC025783.5 end of gene AU173808 cannot extend
ortholog of aaaa01006047.1
LIKNAKETRQEADVEKLQQRDYSSLKDASLLRFLVDMRGADVDDRQLRDDLMTMLIAGHET 185
186 TAAVLTWSVFLLAQNPSKMRKAQAEVDSVLSNETINVDQLKKLEYIRLIIVEALRLYP 359
360 QPPLLIRRALRPDKLPGGYNGAKEGYEIPAGTDIFL
gap of 52 aa
QFALLESTVALALLLQKFDVELRGSPDEVEMVTGATIHTKSGLWCRVRRRT*

>AC025783.5a $F chromosome 10 clone OSJNBa0001O14 1 ordered pieces Length = 180630 
CYP97C
77% identical to 97C1. N-terminals on these sequences vary up to SWVSP
111679 MAAAAAAAVPCVPFLCPPPPPLVSPRLRRGHVRLRLRPPRSSGGGGGGGAGGDEPPITT
111856 SWVSPDWLTALSRSVATRLGGGDDSGIPVASAKLDDVRDLLGGALFLPLFKWFREEGPVY 112035
112036 RLAAGPRDLVVVSDPAVARHVLRGYGSRYEKGLVAEVSEFLFGSGFAIAEGALWT phase 0 intron
113153 VRRRSVVPSLHKRFLSVMVDRVFCKCAERLVEKLETSALSGKPVNMEARFSQMTLDVIGL 113332
113333 SLFNYNFDSLTSDSPVIDAVYTALKEAELRSTDLLPYWK phase 0 intron
113582 IDLLCKIVPRQIKAEKAVNIIRNTVEDLITKCKKIVD 113692
113693 AENEQIEGEEYVNEADPSILRFLLASREE phase 0 intron
114115 VTSVQLRDDLLSMLVAGHETTGSVLTWTIYLLSK 114216 phase 0 intron
114327 DPAALRRAQAEVDRVLQGRLPRYEDLKELKYLMRCINESMRLYPHPP 114467 phase 0 intron
114538 VLIRRAIVDDVLPGNYKIKAGQDIMISVYNIHRSPE 114645 phase 0 intron
114955 VWDRADDFIPERFDLEGPVPNETNTEYR 115038 phase 2 intron
115197 FIPFSGGPRKCVGDQFALLEAIVALAVVLQKMDIELVPDQKINMTTGATIHTTNVSVPLSMLMTASRWH* 
115406

there may be a frame shift in the region KLETSALSGKPVNMEARFS in exon 2
compared to 97C1 from Arabidopsis.

>AC025783.5b $P chromosome 10 clone OSJNBa0001O14 1 ordered pieces Length = 180630 
Extra 97C first exon (pseudogene)
       DLLGGAFSLLPLLEWGRAVSSSSATTPFPG 124558
124557 TCYARKGLVAEVSEFLFGSGFSTA*DALW 124471

>AP003622 $P chromosome 6 clone P0633E08 related to AC025783.5 exon 1
105535 DLLGGALSLLPLLEWFRLVVGPRGFVVVCDHAFSRHVLCE 105654 frameshift and 6 aa deletion
105656 KGLVAEVSKFLFG 105694

CYP710 clan contains only one family 710 (9 sequences)

>AP002092b $F CDS complement(60811..62349) = AP002093 comp(104342-105880) 
60% to 710A1
MVESFHGLVVVDLRTAAPLLATAVALYILIEQLSYHRKKGSMPG
PPLVVVPFLGSVTHLFRDPVGFWDLQATRASKSGAGLTADFLFGRLMVFIRDSELSRR
VFANVRADAFHLVGHPFGKKLFGDHNLIYMVGKEHKDLRRRIAPNFTPRALSTYAVIQ
QRVILSHLRRWIDRSVANGGKAEPIRVPCRDMNLETSQTVFVGPYLTVETRERFDRDY
NLFNHGFITLPIDLPGSAFRRARLAVPRLKHILEDCARQSKQRMRGGGEPECLVDYLM
QETVREIDEAAAAGLPPPPHTSDMETGNLLFDFLFAAQDASTSSLCWAVSALDSHPDV
LARVRAEVAALWSPESGEPITAEMMTEMKYTQAVAREVVRYWPPGPVVPHIAGEAFQL
TEQYTIPKGTIVFPSVYESSFQGFPDAGTFDPERFFSEARREDVVYKRNFLAFGAGPH
QCVGQRYALNHLVIFMALLASLIDFRRERTEGCDVPVYMPTIVPRDGCVVHLKQRCAK
LPSF

>AP002092a $F CDS complement(80797..82311) 61% to 71A10
note="ESTs AU086027(S2303),D40339(S2251) correspond to a
region of the predicted gene.
D40339 D40368 N-term 52% to 710A3 or 710A4 = AP002092 80797
THIS GENE = AP002093 CDS complement(124328..125842)
MVDSLLYGLLDLRMAAPLLAAAVALYVLVEQLSYHRKKGSLPGP
PLVVPFIGSATHMIRDPTGFWEMQAARARKSGVGFTADFLAGKFTIFIRDSELSNRVF
ANVRPDAFFVIGHPFGKKLFGDHNLIYLFGDDHKDLRRRMATNFTPRALSTYAAIQQR
GIVSHLRRWLDRSAANGGKAEPIRVPCRDMNLETSQTVFAGPYLTEEARERFKSDYNL
FNVGLLAFPVDLPGLAFRRARQAVARLVRMLRDCARESKARMRAGGEPECLVDYWMQE
TVREIDEAKAAGLPPPAHISDDEEIGGFLFDFLFAAQDASTSSLCWAVSALDSHPDVL
ARVRAEVASLWSPDSGEPITADKIAEMKYTKAVAREVVRHRPPATLMPHIALQNFQLT
ESYTIPKGTLVLPSMYESSFQGFHDPDAFDPERFFSEARREDVVYKRNFLAFGAGPHQ
CVGQRYALNHLVIFMALFVSLVDFRRERTEGCDVPVYMPTMVPRDGCVVYLKQR*

>AP003575.1 $F chromosome 6 clone P0528B02, similar to 71A24 one in frame stop codon 395 to 71A14
54816 MAAVQLDSGLLIGFLFLATCLVVAVRSYLRSGGADGGRGGAA
54690 ITSPPALPVIGNLHQLGRGRHHRALRELARRHGPLFQLRLGSVRALVVSSASMAEAVLRH 54511
54510 QDHVFCGRPQQHTARGTLYGCRDVAFSPYGERWRRLRRVAVVHLLSARRVDSFRALREEE 54331
54330 VASFVNRIRAASGGGGGVVNLTELIVGLTHAVVSRAAFGKKLGGVEPAKVRETVGEL 54160
54159 ADLLGTIAVSDMFPRLRWVDWATGLDARTKRTAAKLDEVLEMVLRDHEPSRGDDDDDDGD 53980
53979 GEARDLMDDLLSMANGGDDHGYKLDRIDVKGLLIL 53875 (0)
      DMFAAGTDTVYKSIE*TMAELIKNPAEMAKVQAEVRHVVAAAHGGEGDEDA (0) may be incorrect joint
53613 LGKMTLLRAAMKEAMRLHPPLPLLIPREAIQDTVLHGHRVAAGTRVMINAWAIGRDEAAW 53434
53433 EDAGEFRPGRFADGGDNAGVEYYGGGGDFRFMPFGAGRRGCPGMAFATRLAELVVANMAC 53254
53253 WFEWELPDGQDVQSFEPSRSSSREVCRLGSSSGKIGNHVYVVQTTRM* 53110

>AC092749.1 $F clone OSJNBb0023M11, from chromosome 10, complete sequence
35% to 76C3 53% to AC074105.1
71148 MVFLLVCMCSLLLMFLISYALQLFGDARRRLPPGPTPLPLIGNLLDIASDLPHRSLA 70978
70977 RLAGRHGPLMAVRLGTVVAVVASSSSTAREVLQTHNGSLTGRVPPDAWHGVGHAANSVFV 70798
70797 LPPRRKWRALRRIGAEHLLSARQLDGRRLLPLLRDAVLDLLRRVSEMSAASGGGAGAPVQ 70618
70617 VGHAAFAAMMDMQWRAMFSAGLEDDDARVLQDAAREAVALSLKPNLSDFYPALAAV 70450
70449 DLQGLRRRFAGRVGTVYHLVDEQIERRMRRRREAAGDDGEARSEDDLLDVLL 70294
70293 DMSEHGKDDGKVAIDRDLIRTFLT (0)
67984 DIFLATVDTIASTLEWAMAELLQDRETMRKLQEELKKVLGSKTHAEYADMDRLPYLRAVIKETLRLH 67784
67783 PVVPIVPNVAEEMVEIHGHVVPRGSTILVNLWAVHRDAEAWPEPNRFLPERFMLRQHGQ 67607
67606 EAAGRALGTATTEFGLIPFSAGRRVCLGLPLATRMLHAMLGSLLHRFEWTLPLEVEENGV 67427
67426 DMSENLGLTMTMATPLQAIAKSI* 67355

>AC092749.1 $P clone OSJNBb0023M11, from chromosome 10, complete sequence
probable pseudogene fragments aa 66-198 upstream of complete gene
64120 LMAVRLGTVVSIVASSPSTARQILQTHNGSLTGRV 64006 
64019 AWHGVGVGHAANSVFVLPPRRKWRALRRIGAEHLLSTRQLDGACMPLLR 63874
63872 SEMSAASGGAPVQVGHAAHAVFAAMVDMQWRAMFSAG 63762

>AP003434.1a $F chromosome 1, PAC clone:P0452F10, complete 41% to 71A24 = C98812
C98812 52% identical to D48413 43% to 71A13, 44% to 71B10
34853 MELSSSLAAVLHSPLFLLAAL
34916 LLLPVFTLLSFSSAKKPGDGGGWRLPLPPSPRGVPFLGHLPLLGSLPHRKLRSMAEAHGP 35095
35096 VMLLWFGRVPTVVASSAASAQEAMRARDAAFASRARVRMAERLIYGRDMVFAPYGEFWR 35272
35273 QARRVSVLHLLSPRRIASFRGVREQEVAALLDRVRRRCGVRGGGETVNLSDLLMSYANGV 35452
35453 ISRAAFGDGAYGLDGDEGGEKLRELFANFEALLGTATVGEFVPWLAWVDKLMGLDAKAA 35629
35630 RISAELDGLLERVIADHRERRRLSQPDGGDGDGDGDENVDHRDFVDVLLDVSEVEEGAG
      AGEVLLFDTVAIKAIIL (0) 
36458 DMIAAATDTTFTTLEWAMAELINHPPVMRKLQCEIRAAVGVPGASGGAEVTEDHLGELR 36634
36635 LLRAVVKETLRLHAPVPLLVPRETVEDTELLGYRVPARTRVIINVWAIGRDAAAWGDRAE 36814
36815 EFVPERWLDGGGEEVEYAAQLGQDFRFVPFGAGRRGCPGAGFAAPSIELALTNLLYHFDW 36994
36995 ELPPHADGAAAATAARLDMGELFGLSMRMKTTLNLVAKPWSSDV* 37129

>AP003434.1b $F chromosome 1, PAC clone:P0452F10, complete = AA754300
AA754300      42% IDENTICAL TO 71A14   1/98 I-HELIX 43% to 703A2
39698 MELSSLAALLHSPLLLAVLLLVFSWLIVSSTKKRPPPPCGDGGRRLP
39839 LPPSPPGVPLLGHLPLLGTLPHRKLRSMAEAHGPVMLLRLGRVPAVVASSAAAAEEVMRT 40018
40019 RDLAFASRPRVRMSERLFYGRDMAFAPYGEFWRQARRVTVLHLLSPRRVLSFRGVREQE 40195
40196 VAALLDRVRRRCGGGGETVNLSDLLMSYAHGVISRAAFGHGGAHGFDGDEGGEKLRKLFA 40375
40376 DFEGLLGTMTVGEFVPWLAWVDKLTGLDAKVARTSAAMDGLLERVIADHRERRRSRGQA 40552
40553 VGDGEADADHRDFVDVMLDVSEAEEGAGAGAGGVLFDTVAIKAVIL (0) 
42074 DMMAAGTDSSFTTTEWVMAELINHPRVMRKL 42169
42170 QDEIRAVVGTSSASAAAAATGGGQVTEDHLGELPFLRAVIKEMLRLHAPGPLLLPRETVE 42349
42350 DTELLGYRIPARTRVIINVWAIGRDAAAWGDSAEEFVPERWLDGGGGGGVEYAQQLGKDS 42529
42530 RFVPFGAGRRGCPGAGFAALSVELALANLLYHFDWELPPPAASGIMATTRLDMDELFGLS 42709
42710 VRLKADLNLVAKPWSPGAS* 42769

>AP003434.1c $F AU163704.1 chromosome 1, PAC clone:P0452F10, complete 44% 71A14
48011 MAVSLVVVVVV
48044 VIAIVVPLLYLVLLPAWKPARRDDGDGGMRRRLPPSPPWGLPLLGHLHLLGALPHRALRS 48223
48224 LAAAHGPVLLLRLGRVPVVVVSSAAAAEEVMRTRDLEFASRPRVAMAERLLYGGRDVAFA 48403
48404 PYGEYWRQTRRICVVHLLSARRVLSFRRVREEEAAALVARVRAAGGAVDLVEHLTAY 48574
48575 SNTVVSRAVFGDESARGLYGDVDRGRVLRKLFDDFVELLGQEPMGELLPWLGWVDALN 48748
48749 GMEVKVQRTFEALDGILEKVIDDHRRRRREVGRQMDDGGGGDHRDFVDVLLDVNETDMD 48925
48926 AGVQLGTIEIKAIIL 48970 (0)
51142 DMFAAGTDTTTTVIEWAMAELITHPDAMRNAQDEIKAVVGITSHITEDHLDRLPYLK 51312
51313 AVLKETLRLHPPLPLLVPHEPSSDTKILGYSIPACTRIVINAWTIGRDQATWGEHAEEFI 51492
51493 PERFLESGLDYIGQDFVLVPFGAGRRGCPGVGFAVQAMEMALASLLYNFDWETRVVDRRS 51672
51673 EFGTSSLDMSEMNGLSVRLKYGLPLIAISRFP* 51771

>AP003434.1d $F chromosome 1, PAC clone:P0452F10, complete like 71A
58119 MAVSLLPAVL
58149 VLLAIVAPLLYLVLLPAVKYTTRNGAARWEDDGGDGRRRRRLPPSPRGLPLLGHLHLLGA 58328
58329 LPHRALRSLAAAHGPVLLLRLGRVPVVVVSSAAAAEEVMRSRDMEFASRPRMAMAELLLY 58508
58509 GGRDVAFAPYGEYWRQARRICVVHLLSARRVLSFRRVREEEAAALVGRVRAAAADVVVD 58685
58686 LSDLLIAYSNTVLTRIAFGDESARGGGGGGDRGRELRKVFDDFARLLGTEPMGELLPW 58859
58860 FWWVDALRGIDGKVQRTFEALDGILERVIDDHRRRREGGRRMDDDGGGDHRDFVDVLLD 59036
59037 VNETDKDAGIQLGTVEIKAIIM 59102 (0)
59562 DMFVGGSDTTTTMMAWTMAELINHPRAMRKAQNEIWAVVGNTSHVTKDHVDK 59717
59718 LPYLKAVFKETLRLHPPLPLLIPREPPADTQILGYTIPAHTRVVINAWAIGRDAAAWGQQ 59897
59898 PDEFSPEKFLNSTIDYKGQDFELLPFGAGRRGCPGIVFGVSAMEIALASLLYHFDWEAAA 60077
60078 TDHRRRGSQAWALPVDMSEVNGIAVHLKYGLHVVAKPRMP* 60200

>AP002092 $F CDS complement(54602..56137) 60% TO 710A1
THIS SEQ IS THE SAME AS AP002093 CDS complement(98133..99668)
MRTSTDPSGSIESFHGLVHLRTAAPLLAAAVALYMLIEQLSYHR
KKGSMPGAPLVVPFLGSAAHLIRDPVGFWDVQAALARKSGAGLAADFLFGRFTVFIRD
SELSHRVFANVRADAFHVVSHPFGKKLFGEHNLVYLVGEEHKDLRRRIAPNFTPRALS
TYAVIQQRVIISHLRRWLDRSASNGGKAEPIRVPCRDMNLETSQTVFVGPYLTEKARE
RFDRDYNLFNVGFITLPVDLPGFAFRRARLAGARLMHTLGDCARQSRQRMLGGGEPEC
LLDYLMQETVREI
DEATAAGL this may be too long vs arab. Check for intron
PPPPHTSDVEVGALLFGFLFAAQDASTSSLCWAVSAL
DSHPNVLARVRAEVAALWSPESGEPITAEMMSAMKYTQAVAREVVRYHPPATLVPHIA
VEAFQLTAQYTIPKGTMVFPSVYESSFQGFQDADAFDPDRFFSEARREDVVYKRNFLA
FGAGPHQCVGQRYALNHLVIFMALFVSLVDFRRERTEGCDVPVYMPTMVPRDGCVVYL
KQR*

>AP002093 $F CDS complement(143977..145503) 65% to 710A1
AP002092 CDS complement(100446..101972)
note="EST D40368(S2303) corresponds to a region of the
predicted gene.
THIS SEQ = AP002093 complement(143977..145503)
MAAVVDFLDLRAAAPFVVAALAFYFLVEQLSYHRKKGPLPGPPL
VVPFVGSVAHMIRDPTGFWDAQAARARKSGAGLAADFLIGRFVVFIRDSELSHRVFAN
VRPDAFHLIGHPFGKKLFGDHNLIYMFGEDHKDLRRRIAPNFTPRALSTYAAIQQRVI
LSHLRRWLDRSAANGGKAEPIRVPCRDMNLETSQTVFAGPYLTKEAREKFERDYNFFN
VGLMALPVDLPGFAFRSARLGVARLVRTLGECARASKARMRAGGEPECLVDFWMQETV
REIDEAKAAGKPPPAHTDDEELGGFLFDFLFAAQDASTSSLCWAVSALDSHPDVLAGV
RAEVASLWSPESGEPITAEKIAEMKYTQAVAREVVRHRPPATLVPHIAGEEFQLTEWY
TIPKGTIVFPSVYESSFQGFPEPDTFDPERFFSEARREDVVYKRNFLAFGAGPHQCVG
QRYALNHLVLFMALFVSVVDFRRDRTEGCDEPVYMPTIVPRDSCTVYLKQRCAKFPSF*

CYP711 clan contains only one family 711 (4 sequences)

>AP003254a $F CYP711 ortholog 60% to 711
CDS join(126724..126950,127061..127213,130152..131133, 131374..131706)
MEISTVLGAILAEYAVTLVAMAVGFLVVGYLYEPYWKVRHVPGP
VPLPLIGHLHLLAMHGPDVFSVLTRKYGPIFRFHMGRQPLVMVADAELCKEVGVKKFK
NFPNRSMPSPITNSPVHQKGLFFTSGSRWTTMRNMILSIYQPSHLATLIPSMESCIER
AAENLEGQEEINFSKLSLSFTTDVLGQAAFGTDFGLSKKLASSDDDEDTRKIAADTCA
EAKASSEFIKMHVHATTSLKMDMSGSLSIIVGQLLPFLHEPFRQVLKRLRWTADHEID
RVNLTLGRQLDRIVAERTAAMKRDPAALQQRKDFLSVMLTARESNKSSRELLTPDYIS
ALTYEHLLAGSATTAFTLTTALYLVAKHPEVEEKLLREIDGFGPRDRVPTAEDLQTKF
PYLDQ
ARQTLPRRVYIVRHEFLTAHRARMQ is this an intron?
VLKEAMRYYPSSPLIARELNQQLEIGGY
PLPKGTWVWMAPGVLGKDPKNFPEPEVFRPERFDPNGEEEKRRHPYALFPFGIGPRAC
IGQKFAIQEMKLSAIHFYRHYVFRPSPSMESPPEFVYSIVSNFKNGAKLQVIKRHI

>AP003254b $F CYP711 ortholog 82% to seq on same contig
CDS join(142251..142477,142590..142742,145800..146613,
146730..146825,146909..147244)
MEIISTVLGSTAEYAVTLVAMAVGLLLLGYLYEPYWKVRHVPGP
VPLPFIGHLHLLAMHGPDVFTVLARKYGPVFRFHMGRQPLVMVADAELCKEVGVKKFK
SIPNRSMPSAIANSLINQKGLCFTRGSRWTALRNMIISIYQPSHLASLIPTMQSCIEC
VSKNLDGQEDITFSDLALGFATDVIGQAAFGTDFGLSKISASSNDDDIDKIATDTSAE
AKASSEFIRMHVHATTSLKMDLSGSLSIIIGQLLPFLQEPFRQVLKRIPWTADHEIDH
VNLALGGQMDKIVAERAAAMERDQAAPHAQQRKDFLSVVLAARESNKSWRELLTPDYI
SALTYEHLLAGSATTAFTLSTVLYLVSKHPEVEEKLLREIDGFGPHDHAPTAEDLQTK
FPYLDQVVKESMRFYFLSPLIARETCEQVEIGGYALPKGTWVWLAPGVLAKDPKNFPE
PEVFRPERFDPNGEEEKRRHPYAFIPFGIGPRACIGQKFSIQEIKLSVIHLYRNYVFR
HSPSMESPLEFQYSIVCNFKYGVKLRVIKRHTA

>AP003378.1 $F chromosome 1 clone P0047E11, Like 711A two in frame stops
continues AP003254 contig = AQ859680.1 nbeb0013M03f BAC genomic 
AQ860765.1 nbeb0015B04f CUGI Rice BAC genomic 
82666 MDISEVLGATAEWAVTLVAMAVGLLVVAYLYEPYRKVWHVPGPVPLPLIGHLHLLAMHG 82842
82843 PDVFSVLARKHGPVFR 82890 (2)
83176 FHMGRQPLIIVADAELCKEVGVKKFKSIPNRSMPSPIANSPIHKKGLFFIR 83322 (2)
83850 GPRWTSMRNMIISIYQPSHLASLIPTMESCIQRASKNLDGQKEITFSDLSLSLAT 84014
84015 DVIGLAAFGTDFGLSKLPVTPDDSNIDKIAADTSVEAKASSEFIKMHMHATTSLKMDLS 84191
84192 GSLSILVGMLLPFLQEPFRQVLKRIPGMGDYKIDRVNRALKTHMDSIVAEREAAMEHDLA 84371
84372 ASQQRKDFLSVVLTARESNKSSRELLTPDYISALTYEHLLAGSTTTAFTLSTVLYLVAKH 84551
84552 PEVEEKLLKEIDAFGPRYCVPMADDLQTKFPYLDQVCVAYIFLCVHGFVRPIFFFLT 84722
84723 TALFTWL*MQVVKESMRFYIMSPLLARETLEQVEIGGYVLPK 84848 (0)
85342 GTWVWLAPGVLAKDPKNFPEPEIFRPERFDPNGEEERRRHPYAFIPFGIGPRVCIGQKF 85518
85519 SIQEIKLSMIHLYRHYVFRHSPSMESPLEF*FAIICDFKYGVKLQAIKRHHA* 85677

>AP003765.1 $F CYP727A1 = AQ574081 = AP003832 37701-42682
140359 MASPCEHHVPYTLLGALLSGGGPHAAACGGAAFLRDYAERGTNALL
       WAALLAVTWLLVLRLAALLRLWALGARLPGPPAFPADPGLAAGDIT (1) 140634
142650 GYLSKLHGSYGPVVRLWLGPSQLLVSVKDSRVIKELLTKAEDKLPLTGKTYNLACGKLGLFISSFEK (0) 142851
143011 VKSRRESLKSFLDEKLSVGTGGSSFKIIQIVLDRVDSIMARDFLDCRYFSQHMAFNIVGSAL
       FGDAFFDWSDASAYEELMMTVAKDACFWASYAVPPFWKPDYRRYRTLCARLKLLTQGIVAKS
       RNQNGVLSLIDLSSCQRSERMIKDPCRGFSLLDGVISSRCLNEAAEGPLSSEEEICGNIMGL
       MLHGISTCANLIGNILTRLALYPNLQCQ 143652 (0)
144033 LHSEIVSGHSESSELKIDDVLRMKFLLATVCESARLLPAGPLLQRCSLQQ 144182 (1)
144578 DVNLNSSITIPAGAILVIPLHLVQMEASTWGNDACQFNPNRFLKKEINFE 144721 (1)
144825 EILAAAHKGSNGINLFTDECDKTESFLPFGSGSRACVGQKFAVLGIAMLIASLLRSYE 144992 (0)
145233 VQPHPALSQEMESLVDSNSLHHIPNPKIILKKRSI* 145340


69 indica sequences = 32 full length indica, 7 indica pseudogenes, 29 partials (one full hybrid)

330 japonica sequences = 226 full length, 65 pseudogenes, 39 partials 

>AL772426.1 = ortholog of AAAA01002713.1 and aaaa01010047.1b 99% to these two
AF140486 mRNA 57% to C97610 57% to CYP81G1 56% to 81D2 = AU173235 clone R1815
99775 MEISQAFVFASLLLLLLLTWLLFHLLSYQAPPPNGDGGRRIPSPPALPVVGHLHLLKKPL 99596
99595 HRSLAALAARYGGGAGLLLLRFGARPVVLVSSQAAADECFTAHDAALAGRPGLASRRLLT 99416
99415 DGCPTIATAGHSARWRHLRRLATVHALCARRLAATS 99308 (?)
99308 PARDAEARAMAARLYSSSSSSSAASAVVVGVKPAAYGFVASVIMSMVAGERMAEEDVLRF 99129
99128 KAITEAGLAAAGAANRQDFLPFLRLLDFGRARRRLAGIAKERHDFGQRIVDEYRRRHRRR 98949
98948 LAVAADDFSSSPPRRTVIGDLLRQQESSPESYADEVIRTVCL 98823 (0?)
98238 SLLQAGTDTSASTIEWAMALLLNNPDVLRKATDEINSVVGMSRLLQEPDLANLPYLRCII 98059
98058 TETLRLYPLAPHLVPHEASRDCMVAGHVIARGTMVLVDVYSMQRDPRVWEDPDKFIPERF 97879
97878 KGFKVDGSGWMMPFGMGRRKCPGEGLALRTVGMALGVMIQCFQWERVGKKKVDMSEGSGL 97699
97698 TMPMAVPLMAMCLPRVEMESVLKSL* 97621

>AL772426.1 = ortholog of aaaa01010047.1dca hybrid 99% 
BE039828 (partial) similar to Jerusalem artichoke CYP81B1
95445 MAGLEVATTAVTGGDASLVVVVGVLFLMVAVVVMTRLGDGGAAPSPPAMPVLGHLHLIKKPL 95260
95259 HRSLAEVAARVGAAPVVSLRLGARRALLVSTHAAAEECFTACDAAVAGRPRLLAGDVLG 95083
95082 YGHTTVVWASHGDHWRALRRLLGVELFSNARLAALAADRRAEVASLVDAVLRDAAAGGGG 94903
94902 GGTVTLRPRLFELVLNVMLRAVTARRHAGDETRRFQEIVEETFAASGSPTVGDFFPALR 94726
94725 WVDRLRGVVATLQSLQKRRDAFVAGLVDDHRRTRRAAAAAADKDQKKNGIIDALLT 94558
94557 LQETDPDHYTDNVVKGIVLVLLTAGTDTSALTTEWAMAQLVAHPEAMTKVRAEIDANVG 94381
94380 AARLVEEADMASLPYLQCVVKETLRLRPVGPVIPAHEAMEDCKVGGYHVRRGTMILVNAW 94201
94200 AIHRDGDVWGSPEEFRPERFMDDGAGAGAVTAVTAPMLPFGLGRRRCPGEGLAVRLVGLT 94021
94020 VAALVQCFDWEIGEGGAVDMAEGGGLTMPMATPLAAVCRPREFVKTVVSDCF* 93862



>AL772426.1 P450 fragment
63156 PSQPVLGHLHLLKRPPLHRSX 63097
63092 AAAARAPLVSLRLGRAAGAARVDARRGAGERFTACDAAMAGRPPAARR 62949

>AL772426.1 = ortholog of AAAA01017554.1 $FI 99%
69722 MTGGLEVAMVAGGGNGGAAVLVGITVLLFVVVVVVVVLVRWWSG
69590 GEGGAAPSPPALPVLGHLHLLKKPLHRSLAAVAAGVGAPVVSLRLGARRALVVSTHA 69420
69419 AAEECFTACDAALAGRPRTLAGEILGYDHTIVLWTPHGDHWRALRRFLAVELLSAPRLAA 69240
69239 LAADRHAEAASLVDAILRDAAGGAKVTLRPRLFELVLNVMLRAATTRRRHASVDA 69075
69074 RKLQEIIEETFSVNGTPSVGDFFPALRWVDRLRGKVGSLKKLQARRDAMVTGLIDD 68907
68906 HRQWRSGSAGDGDQDKEKKGVIDALLALQETDPDHYTDNVVKGIIL 68769 (0?)
68310 SLLFAGTDTSALTIEWAMAQLVTHPETMKKARAEIDANVGTARLVEEADMANLPYIQCVI 68131
68130 KETLRLRTAGPVIPAHEAMEDTTVGGFRVARGTKVLVNAWAIHRDGDVWDAPEEFRPERF 67951
67950 VDSDAGGAVTAPMMPFGLGRRRCPGEGLAVRVVGVSVAALVQCFDWEVGDDDVVDMTE 67777
67776 GGGLTMPMATPLAAVCRPREFVKTILSTS* 67687

>AC125784.1 $F ortholog of AAAA01085971.1 100% AAAA01009188.1 1 aa diff
AU071096 BI796894 BI796059  50% to C99304 like 75B1
93834 MELAALCTDPVVLSAAFLCLLLHLSLRSYRPPSPGGGRRLPPGPPGLPVLGALPLVGPAP 94013
94014 HAGLASLARKYGPVMYLKMGTCGVVVASSPCAARSFLKALDARFANRPAVASAVDITYNY 94193
94194 QNMVFANYGARWKLMRKLASVHLLGARALADWAAVRRDEARRLLRGVAEASAAGRP 94361
95165 YKDMIVSLLTGAGLFNISDFVPALAWLDLQGVQAKLRRIHDQFDVLITKLLADHAATAAD 95344
95345 RARAGRTDFVDRLRAAVGVDDEDGETISEVNIKGLIF 95455
99273 DMFTAGTDTSSIIVEWAMAEMMKNPAVMARAQEEMDRVVGRGRRLEESDIASLPYLQAVC 99452
99453 KEAMRLHPSTPLSLPHFSFDECDVDGYRIPANTRLLINIYAIGRDPSAWEDPLEFRPERF 99632
99633 MPGGAAERVDPLGNYFELIPFGAGRRICAGKLAGMVFVQYFLGTLLHSFDWRLPDGEDKV 99812
99813 DMSETFGLALPKAVPLRALVTPRLAPAAYA 99902