Alphabetical list of 57 new Arabidopsis ESTs

Last modified Sept 15, 1999 AI992420 98% to 71A13 3 diffs AI992562 exact match to 97B3 AI992649 exact match to 72A15 middle region w/o *s AI992806 97% to 83A2 3 diffs AI992811 97% to 86B1 2 diffs AI992955 93% to 89A5 probably a new sequence AI993005 98% to 71B20 1 diff AI993097 96% to 710A1 2 diffs AI993108 91% to 71B18 probable new sequence AI993171 exact match to 708A2 AI993216 CYP706A1 AI993366 71B28 1 diff AI993708 71B28 2 diffs AI993723 91% to 72A13 probable new seq. AI993752 98% to 90C1 1 diff AI993784 exact match to 90A1 AI993845 exact match to 71B20 AI993868 97% to 76C2 4 diffs AI994017 exact match to 83A1 AI994062 88A3 1 diff AI994171 exact match to CYP85 AI994216 96% to 89A9 5 diffs AI994541 60% to 90C1 new P450 sequence AI995175 87A2 I diff AI995260 98% to 72A11 2 diffs AI995263 97% to 86A1 2 diffs AI995272 exact match to B25606 GSS fragment and W43241 EST similar to CYP77A3 AI995327 71A25 1 diff AI995358 exact match to 708A3 AI995549 99% to 78A9 (complete sequence confidential) AI995768 71% to 97A1 new sequence AI996074 CYP81D4 1 diff AI996081 exact match to 706A1 or 706A2 AI996368 exact match to 81F4 AI996369 94C1 I diff AI996432 exact match to 81F2 AI996557 exact match to 72A7 AI996574 80% to 94B1 new P450 sequence AI997093 56% to 81D1 new P450 sequence AI997107 exact match to 706A6 AI997182 exact match to 72A8 AI997467 very poor N-terminal sequence most like CYP93D1 AI997548 71B5 I diff AI997651 exact match to 71B6 AI997686 exact match tp 72A12P AI997903 exact match 71B3 except for stop AI998034 96A4 1 diff AI998104 exact match to 707A3 AI998324 exact match to 51A2 AI998859 exact match to 71B2 AI999004 exact match to 71A20 AI999087 exact match to 98A3 AI999275 75% to 96A3 new P450 sequence AI999358 exact match to 71A12 AI999699 exact match to 81D5 AW004264 89% to 72A15 may be new P450 sequence AW004486 exact match to 84A1

Nine Probable New P450 sequences extracted from the list above

>AI993108 701495348 91% to 71B18 probable new sequence 7 NIFLGRIDTGALTMIWAMTELARNPGVMKKVQGEIRDRLRRNKERVTEEDNNKVPYLNLV 186 187 VKETFRLHYPVPLLLPRETMAHIKVQGYDIPPKRRILVNAWAIGRDPKLWTNPREFDPG 363 364 RFIDSPVDYKGQHFELLPFGSGR 432 >AI993723 701497314 91% to 72A13 probable new seq. 405-514 338 TLPGGVQLTLPILLIQRDRDLWGNDAGESKPDRLKDGLSKATKNQVSFFPFAWX 180 175 PRTCIGQNFALLEAKMAMTLILRKFSFELSPSYVHAPYTVLTTHPQFGAQLMLHKL* 5 >AW004264 701669957 89% to 72A15 may be new P450 sequence 420-513 343 QHDIELWGNDAAEVKPDRVKDGLSKATKSQVSFFPFAWGPRICIGQNFALLEAKRARALI 164 163 LRRFSFEISPS*CHAPDTGITSHPQFGAQLIMHKR* 62 >AI997093 701552028 49% to 81B1 new P450 sequence 397-486 SDLSSLPYLRCVIYETLRLHPAAPILPPH 475 XXXXXXXLGNYEIPENTVLLVNAWAVHRDGELWEEADVFKPERFEEFVGDRDGFRFLPFG 305 304 VGRRACPAAGLAMRVVSLAVGALVQCFEWEKVEKEDIDMRPAFSVAMDRAEPLIALLKPCPEMVPILSQL* >AI992955 701495129 93% to 89A5 probably a new sequence 1 SEFLNGGTDTTATALQWIMANLVKNPDIQKRLYEEIKSVVGEEANEVEEEDAQKMPYLEAVVMEG 197 198 LRRHPPGHFVLPHSVTEDTVLGGYKVPKNGTINFMVAEIGRDPKVLEAPNAFKPERFM 371 372 EEAVDITGSTRIKMMPFGPGKRISPGIGLTILXLEYYFANMVREFXXREVQ >AI994541 701498549 60% to 90C1 3 diffs with R30379 EST exact to R30380 N-term 35-128 210 PHGSLGWPVIGETIEFVSSAYSDRPESFMDKRRLMYGRVFKSHIFGTATIVSTDAEVNRA 389 390 VLQSDSTAFVPFYPKTVTELMGKSSILLINGSLHRRVHGLVGSFLKSQLLKAQI The above sequence is related to a citrus EST dbj|C95372.1|C95372 C95372 Citrus unshiu Miyagawa-wase maturation stage Citrus unshiu cDNA clone pcMAlM1104-93, mRNA sequence Length = 484 Score = 98.3 bits (241), Expect = 4e-21 Identities = 43/57 (75%), Positives = 51/57 (89%) Frame = +1 Query: 2 PHGSLGWPVIGETIEFVSSAYSDRPESFMDKRRLMYGRVFKSHIFGTATIVSTDAEV 58 P G+LGWP++GE++EF+S A +DRPESFMDKRR MYG+VFKSHIFG TIVSTDAEV Sbjct: 298 PLGTLGWPIVGESLEFISCALTDRPESFMDKRRCMYGKVFKSHIFGXPTIVSTDAEV 468 >AI996574 701667509 80% to 94B1 new P450 sequence 86% to H76099 EST 527 FWLLTEDDDVDGRFLEEVDPLVSLGLGFEDLKEMAYTKACLCEAMRLYPPVSWDSKHAANDDV LPDGTRVKRGDKVTYFPYGMGR 348 347 METLWGTDSEEFNPNRWFDSEPGSTRPVLKPISPYKFPVFQAGPRVCVGKEMAFMQMK 174 173 YVVGSVLSRFEIVPVNKDRXVFVPLLTAHMAGGLKVKIKRRSHILNNV* 27 >AI999275 701555244 75% to 96A3 new P450 sequence 495 DVLPSGHKVDANSKIVICIYALGRMRSVWGEDALDFKPERWISDNGGLRHEPSYKFMA 322 321 FNSGPRTCLGKNLALLQMKMVALEIIRNYDFKVIEGHKVEPIPSILLRMKHGLKV 157 >AI995768 701548254 71% to 97A1 new sequence 403 GGXPRKXIGDMFXSFENVVAIAMLIRRFNFQIAPGAXPVKMTTGATIHTTEGLKLTVTKR 224 223 TKPLDIHPYRYFQWILTG* 218

44 new ESTs that match existing P450s

>AI995549 701674596 99% to 78A9 1 diff (complete sequence confidential) ITDTIIDGRRVPAGTTAMVNMWAIAHDPHVWENPLEFKPERFVAKEGEVEFSVLGSDLRLA 265 264 PFGSGRRVCPGKNLGLTTVTFWTATLLHEFEMLTPSDEKTVDLSEKLRLSCEMANPLAAK 85 84 LRPRR >AI998104 701672101 exact match to 707A3 408-430 465 FGSGIHSCPGNELXKLEISVLIH 397 >AI993752 701497405 98% to 90C1 1 diff 31-75 290 KGMIPNGSLGWPVIGETLNFIACGYSSRPVTFMDTRKSLYGKVFKTNIIG 439 >AI993784 701514527 exact match to 90A1 57-182 36 FHRRESTRYGSVFMTHLFGEPTIFSADPETNRFVLQNEGKLFECSYPASICNLLGKHSLL 215 216 LMKGSLHKRMHSLTMSFANSSIIKDHLMLDIDRLVRFNLDSWSSRVLLMEEAKKITFE 389 390 LTVKQL 407 >AW004486 701931810 exact match to 84A1 442-521 199 GFFIPKKSRVMINAFAIGRDPTSWTDPDTFRPSRFLEPGVPDFKGSNFEFIPFGSXXXPC 20 >AI997903 701671309 exact match 71B3 except for stop 409-499 422 NPELWXNPXXFXP*RFIDCPMDYXXNSFEXLPFGSGXKICPGXAFGIATVELXLLNL 252 251 LYYFDWRLAEEDKDIDMEXAGDATIVKKVXL 159 >AI993005 701494021 98% to 71B20 1 diff 311-371 66 GIDTGALTMIWAMTELVKNPKLIKKVQGEIREQLGSNKARITEEDIDKVPYLKMVVKETF 245 246 RLH 254 >AI994171 701499941 exact match to CYP85 387-475 196 VIYETSRLATIVNGVLRKTTRDLEINGYLIPKGWRIYVYTREINYDANLYEDPLIFNPWR 375 376 WMKKSLESQNSCFVFGGGTRLCPGK 450 >AI996432 701665727 exact match to 81F2 401-466 411 EPEKFMPERFEDQEASKKLMVFGNGRRTCPGATLGQRMVLLALGSLXQCFDWEKV 247 246 NGEDVD 229 >AI993097 701495327 96% to 710A1 2 diffs 194-334 1 GPYLDKEAKNRFRTDYNLFNLGSMALPIDLPGFAFGEARRAVKRLRETLGICAGKSKARI 180 181 AAGE*PACLIDFWMQAIVAEYPQPPHSGDEEIGGLLFDFLFAAQDASTSSLLWAVTLHEL 360 361 EPEVLDRVRXX 387 393 ISKIWSTESN 422 >AI998324 701545267 exact match to 51A2 442-473 230 ELVSPFPEIDWNAMVVGVKGNVMVRYKRRQLS 135 >AI995358 701673389 exact match to 708A3 360-456 501 YTIPAGWIVAVAPSAVHFDPAIYENPFEFNPWRWEGKEMIWGSKTFMAFGYGVRLCV 331 330 GAEFSRLQMAIFLHHLVAYYDFSMVQDSEIIRSPFHQYTKDLLI 199 >AI994062 701498421 88A3 1 diff 321 1 KMEFLSKVVDETLRVITFSLTAFREAKTDVEMNGYLIPKGWKVLTWFRDVHIDPE 165 166 VFPDPRKFDPARWDNGFVPKAGAFLPFGAGSHLCPGNDLAKLEISIFLHHFLLKYQ 333 334 VKRSNP 351 >AI993171 701495585 exact match 708A2 321-443 92 QMTFTNMVINETLRMANMAPIMYRKAVNDVEIKGYTIPAGWIVAVIPPAVHFNDA 256 257 IYENPLEFNPWRWEGKELRSGSKTFMVFGGGVRQCVGAEFARLQISIFIHHLVTTYD 427 428 FSL 436 >AI992562 701558271 exact match to 97B3 406-513 516 VPKGTDIFISVYNLHRSPYFWDNPHDFEPERFLRTKESNGIEGWAGFDPSRSPGALYPN 340 339 EIIADFAFLPFGGGPRKCIGDQFALMESTVALAMLFQKFDVELRGTPESVELVSGA 172 171 TIHAKNGMWCKLKR 130 >AI996557 701667434 exact match to 72A7 450-514 275 VSFFPFGWGPRICIGQNFAMLEAKMAMALILQKFSFELSPSYVHAPQTVMTTRPQFGAHL 96 95 ILHKL 81 >AI992649 701558580 exact match to 72A15 middle region w/o *s 13 MVPAFHQSCREVVGEWDQLVSDKGSSCEVDVWPGLVSMTADVISRTAFGSSYKEGQRIF 189 190 ELQAELAQLIIQAFRKAFIPGYR**KTKLVF*YCNLLK*NGALRIFEPIFCVVISQQKV 366 367 TEE*KQQPEKSKLY*EGS*TKG*GPEKLVKPKRRFAGNLLESNLRQTEGNGMSPEDLMEE 546 547 CKLSQSVG 570 >AI997686 701670043 exact match tp 72A12P 397 HISLPIMLVQRDTMMWEFKPERFKDGLSKATKSHVSFFPFAWGPKICIGQNFAML 233 232 EAKMAMALILQRFSFELSPSYVHAPQIVVTIHPQFGAHLILRKL 101 >AI995263 701503279 97% to 86A1 2 diffs 1 NSPSDDLLSRFLKKRDVNGNVLPTDVLQRIALNFVLAGRDTSSVALSWFFWLVMNNRDVD 180 181 TKIVNELSMVL*ETRGNDQEKWTEEPLEFDEADRLVYLKAALAETLRLYPSVPQDFKYVV 360 361 ENX 366 371 VLPDGTFVPRGSTVT 415 >AI995175 701502466 87A2 I diff 38 LTWEEYKSMTYTFQFINETARLANIVPAIFRKALRDIKFKDYTIPAGWAVMVCPPAVHL 214 215 NPEMYKDPLVFNPSRWEGSKVTNASKHFMAFGGGMRFCVGTDFTKLQMAAFLHS 376 377 LVTKYRWEEIKGGNITRTPGLQFPNGYHVKLHKK 478 >AI995260 701503269 98% to 72A11 2 diffs 5 KAREIQVILRGIVNKRLRAREAGEAPNDDLLGILLESNLGQTKGNGMSTEDLMEECKLFY 184 185 FVGQETTSVLLVWTMVPLSQHQDWQARAREEVKQVFGDKEDAEGLNQL 331 332 KVMTMILYEVLRLYPPIPQ 388 >AI997182 701552523 exact match to 72A8 499 LNEVLRLYPPGILLGRTVEKETKLGEDMTLPGGAQVVIPVLMVHRDPELWGEDVHEFNPE 320 319 RFADGISKATKNQVSFLPFGWGPRFCPGQNFALMEAKMALVLILQRFSFELSPSYTH 149 148 APHTVLTLHPQFG 110 >AI992811 701493659 97% to 86B1 2 diffs 2 VFPDGTMLKKGDKVIYAIYAMGRMEAIWGKDCLEFRPERWLRDGRFMSESVYKLTAF 172 173 NGGPRLCLGKDFAYYQMKS 229 >AI996369 701665128 94C1 I diff 574 VWSDGTFVNSGTRVTYHAYAMGRMDXIWGPDYEEFKPERWLDNEGKFRPENPVKYPVF 401 400 QAGARVCIGKEMAIMEMKSIAVAIIRRFETRVASPETTETLRFAPGLTATVNGGLPVMIQ 221 220 ER 215 >AI998034 701672680 96A4 1 diff 470 DVLPSXHKVDXNWRVVIPIYSLGRMKXVWGDDAEDFRPERWISDSGMLRQESXYKFLA 297 296 FNAGPRTCLGKRLTFLQMKTVAGEXIRNYDIKVVEGHKPKPXPSVLLRMQHGXK 135 >AI992420 701557676 98% to 71A13 3 diffs 6 MTELIRSPKSMKKLQDEIRSTIRPHGSYIKEKEVENMKYLKAVIKEVLRLHPSLPMILPR 185 186 LLSEDVKVKGYNIAAGTEVIINAWAIQRDTAIWGPDAEEFKPERHLDSALDYHGENLNYI 365 366 PFGSGRRICPGMNLALGLAEVTVANLVGRFDWRVEAGPNGDQ 491 >AI993845 701515154 exact match to 71B20 19 ISFLCFCLITLASLIFFAKKIKHLKWNLPPSPPKFPVIGNLHQIGELPHRSLQHLAER 192 193 YGPVMLLHFGFVPVTVVSSREAAEEVLRTHDLDCCSRPKLVGTRLLSRNFKDVCFTPYGN 372 373 EWKARRKFALRELFCLKKVQSFRHIREEECNFLVKQLSESAVNRSPVDL 519 >AI993868 701515245 97% to 76C2 4 diffs 38 ERDFVDVLLDLTEGDEAELNINDIVHLLLDLFGAGTDTNSSTVEWAMAELLRNP 199 200 ETMVKAQAEIDCVIGQKGVVEESDISTLPYLQAVVKETFRLHPAAPLLVPRKAESDVEV 376 377 LGFMVPKDTQVFVNVWAIGKDPNVWENSSRFKPERFLGEDIDLRGRDYELTPFGA 541 >AI999358 701555558 exact match to 71A12 267 IPFGSGRRICPGINLALGLVEVTVANLVGRFDWRAEAGPNGDQPDLTEAFGLDVCRKFPL 88 87 IAFPSSVI 64 >AI995327 701673108 71A25 1 diff 475 YHIPAGTQVMMNAWAIGREVATWGPDAEEFKPERHLDTSVDFRGQNFELLPFGAGRRICP 296 295 AVSFAVVLNEVVLANLVHGFDWKLPEESXEDKTDVAESSGFSVHREFPLXAFAS 134 >AI997651 701669823 exact match to 71B6 556 YMKMVIKETWRLHAPSPILIPREAMTNFKIKGYDIYPGTRIHVNAWAIGRNPDVWKDPD 380 379 EFIPERFVDSNVETKGTSFELLPFGSGRRGCPAMYVGLSTVEYTLANLLYHFDWKAT 209 208 EEVSVEEAPGLTSHRKHPLHLVPVNVI 128 >AI997107 701552072 exact match to 706A6 569 IEESHITRLPFISAIMKETLRLXPTIPLLVPHRPSETALVGGYTIPKNTKIFINVWSIQR 390 389 DPNVWEYPTEFRPERFLDKKSCDFTGTDYSYLPFGSGRRICAGIALAERMILYTLATLL 213 212 HSFDWKIPEGHILDLKEKFGIVLKLKSPLVALP 114 >AI994017 701496312 exact match to 83A1 506 IPLLIPRACIQDTKIAGYDIPAGTTVNVNAWAVSRDEKEWGPNPDEFRPERFLEKEVDFK 327 326 GTDYEFIPFGSGRRMCPGMRLGAAMLEVPYANLLLSFNFKLPNGMKPD 183 >AI993708 701497233 71B28 2 diffs 25 IMSVFLCFLCLLPLILIFLKNLKPSKWKLPPGPKKLPIIGNLHQRRELHPRNSRNLS 195 196 EKYGPIVFLRYGFVPVVVISSKEAAEEVLKTHDLECCSRPETVGTRAISYNFKDIGFAPC 375 376 GEDWRTMRX 399 402 VSVVELFSSKKLQSFRYIREEENDLCVKKLSDLASRRSLVNLEKTLFTL 548 >AI997548 701668923 71B5 I diff 570 KKVEYLKMVIEEAFRLHPPAPLLLPRLTMSDINIQGYSIPKNTMIQINTYTIGRDPKNW 394 393 TKPDEFIPERFVDNPIEYKGQHFELLPFGAGRRVCPGMATGITIVELGLLSLLYFFDWSL 214 213 PNGMTTKDIDMEEDGAFVIAKKVSLELVPT 124 >AI996081 701549901 exact match to 706A1 or 706A2 504 GYTVPKDSKIFINVWAIHRDPKNWDEPNEFKPERFLENSLDFNGGDFKYLPFGSGRRIC 328 327 AAINMAERLVLFNIASLLHSFDWKAPQGQKFEVEEKFGLVLKLKSPLVAIP 175 >AI993366 701496129 71B28 1 diff 19 ILDDHLKPGRESSDIIDVMIDMMKKQEKEGDSFKFTTDHLKGMITDIFLAGVGTSSTTLI 198 199 WAMTELIRNPRVMKKVQDEIRTTLGDKKERITEEDLNQLHYFKLMVKEIFRLHPAAPLL 375 >AI998859 701547150 exact match to 71B2 496 VVPRXXMSHIKIQGYDXPPKTQIQLNVWTIGRDPKRWNDPEEXNPERFXNSSXXFRGQH 320 319 XDLLPFGSGRRICPGMPMAIASVELALMNLLYYFDWSMPDGTKGEDIDMEEAGNXSIVKK 140 139 IPL 131 >AI996368 701665124 exact match to 81F4 571 VSETLRLHPAAPVLVPRSTAEDIKIGGYDVPRDTMVMVNAWAIHRDPDLWTEPERFNPE 395 394 RFNGGEGEKDDVRMLIAFGSGRRICPGVGLAHKIVTLALGSLIQCFDWKKVNEKEI 227 226 DMSEGPGMAMRMMVPLRA 173 >AI992806 701493639 97% to 83A2 3 diffs 69 LLGTLFFSDLFPYFGFLDNLTGLSARLKKAFKELDTYLQELLDETLDPNRPKQEPESFID 248 249 LLMQIYKDQPFSIKFTHENVKAMILDIVVPGTDTVAAVVVWVMTYLIKYPEAMKKAQDEV 428 429 RSVIGDKGYVSEEDIPYLPYL*AVIKGV 512 >AI995272 701503329 exact match to B25606 GSS FRAGMENT AND W43241 EST VERY SIMILAR TO CYP77A3 2 IEWGIAQLIVNPEIQSRLYDEIKSTVGDREVEEKDVDKMVFLQAVVKEILRKHPPTYF 175 176 TLTHSVTEPTTVAGYDVPVGINVEFYLPGINEDPKLWSDPKKFNPDRFIXXXXXXXXXXXXX 322 361 VKMMPFGIGRRICPG 405 >AI999699 701557091 exact match to 81D5 535 VEESDIVNLHYLQNIVSETLRLYPAVPLLLPHFSSDECKVAGYDMPRRTLLLTNVWAMHR 356 355 DPGLWEEPERFKPERFEKEGEARKLMPFXX 272 267 GRRACPGAELGKRLVSLALGCLIQSFEWERVGAELVDMTEGEGITMPKATPLRA 106 >AI994216 701500344 96% to 89A9 5 diffs 2 ATSMQWIMAIMVKYPEIQRKVYEEMKSAFSGEEGEREEIREEDLGKLSYLKAVILECLRR 181 182 HPPGHYLSYHKVTHDTVLGGFLIPRQGTINFMVGEMGRDPKIWEDPLTFKPERFLENGE 358 359 ACDFDMTGTREIKMMPFGAGRRMCPGYALSLLHLEIYVA 475 >AI996074 701549876 CYP81D4 (1 diff) 612 VLETLRMYPAVPLLLPHLSSEDCKVGGYDIPSGTMVLTNAWAMHR 475 476 NPEVWXDPEIFKPERFEKEGEAEKLISFGMGRRACPGAGLAHRLINQALGSLVQ 315 314 CFEWERVGEDFVDMTEDKGATLPKAIPLRA 225 >AI999004 exact match to 71A20 >AI993216 CYP706A1 ALLAVVCYFWIQGKSKSKNGPPLPPGPWPLPIVGNLPFLNSDVLHTQFQALTLKHG 314 PLMKIHLGSKLAIVVSSPDMAREVLKTHDI 404 >AI997467 very poor N-terminal sequence most like CYP93D1 93D1 MVDLQYFSVIILVCLGITVLIQAITNRLRDRLPLPPSPTALPIIGHIHLLGPIAHQ-ALHK |||||| ||||| | | | AI997467 LPPSPTXLPIIGXXHXINKXXLXXAXHX 93D1 LSIRYGPLMYLFIGSIPNLIVSS | ||| | || || AI997467 XSXNYGPVLFXKFGCXXVLIXSS >AI999087 CYP98A3 420 EDXLEEDVDMKGHDXRLLPFGAGRRVCPGAQLGINXGNFXMSHLLHHFVWTPPQGTKPEEIDMSENPGL 213