ࡱ>  Ybjbjв 2к_к_?  8,$302jjj$c!$jjjjjhjjjjjjjjj ;Pjj03j$j$jj$~ jjjjjjjjjjjj3jjjj$jjjjjjjjj X r: Purple sea urchin P450s From the sea urchin genome project http://annotation.hgsc.bcm.tmc.edu/Urchin/cgi-bin/pubSearchGene.cgi search was with P450 as synonym. Found 137 hits including one reductase. Nov. 29, 2006 (in progress, expression data links still being added. These are not named yet.) GLEAN is a gene prediction program that merged four other gene prediction algorithm results from sea urchin. (see p. 14 of the supp. section of the Science paper (Science 314, 939-952 2006) A quote from http://www.systemix.org/sea-urchin/primer.html "[The] GLEAN set for example is very conservative and therefore misses many existing genes." >GLEAN3_00279  HYPERLINK "http://www.systemix.org/codes/plot-header.php?org=SU&expt=NASA%3Asignal&tr=glean-davidson&ext=20000&gene=GLEAN3_00279&Submit2=Array+data" expression data for GLEAN3_00279 This prediction is short (219 aa), missing N-terminal half MKESKDQNGNDVFCADHVVMATWDMFLGGYEATYKTIKWAIAFLIHHPEVQAKIQSEM TAGFGDRMPTWEDRAKLPYLEATVLETLRQSSMSSMNGPRRTTCDTQVKGYDVAKDTT VFCNLWWVHHDPKNWKDPMSFRPEHFLTEDGKVSVPRAFLAFSLGSRACIGSQLALMQ LILLLGGLLQRFNLRQAVGEPLPDLEPGSELVRTVGDYTMCLEVR >GLEAN3_00436  HYPERLINK "http://www.systemix.org/codes/plot-header.php?tr=glean-davidson&cut=&CN=&log=&min=&max=&org=SU&expt=NASA:signal&scaffold=237&strand=C&begin=230573&end=257235" expression data for GLEAN_00436 MEINFLGFSFSLTWFLVGAIFIIYICNDLWIRSYWTRHGFKSPKSVPIFGNTLQWTKG IQHAFLDYFKKQGTVFGVYDFHRPGLCITDLDLLRNIMVKDFGSFTNRWATPLLNKDL LDALFFSKDDHWKGIRNIITPTFSAAKMKLMSPLINKCADRLLKHLEKHQKIHGNIE CRELVGAFVMDTIASCAFGLNVDCQENRHHPFVTNAKKAFAGNTFSSPLLRIDYLQLL LDAQMRDITGNNKNKADGIHGDPEDEAIDEGGLQKKSSSKIRLNESEVISQALIFFLA GYETTNTTCGYMLYLLATHPDVQDKLVNEIDDVAPEAEDVGYQSISKMPYLEQIFCET ERIYPPALMTDRVCNEPFDINGFTVPKGMRIFIPIFTIHHDPNLWPDPETYDPDRFSK ENREKHHPCAWMPFGTGPRNCVGMRFAMMEAKMVIVRILQKYQIETCPQTEIPPKQGT NGLISPPNGITLRAVKRN >GLEAN3_00645  HYPERLINK "http://www.systemix.org/codes/plot-header.php?org=SU&expt=NASA%3Asignal&tr=glean-davidson&ext=20000&gene=GLEAN3_00645&Submit2=Array+data" expression data for GLEAN3_00645 This gene is about 75 aa longer than usual at the N-terminal MATRLSRFLARARPSRSSGVRPQTFHDAHASSCPLAAVPSTLSPHASTTTPSTPSTTK TSTTSTTSAPSSISDLPSNNNLSLGVSNGCTFDQAPASINPSERNCTIFDDVKSFKDI PGPKGWPILGTLGTYLSGKGLERIYDHQIDFTKKYGPIWKERLGTLEFVNLAKPDLVE VMYRNDSRYPRRLDMKPWKVYRAHRDESLGVLTHEGKEWHRARRTIGRKIMPPREVAK FTGVINEIVTDMVERLRFVRDTKGEGDGVVPELQNEMYKWSMESIFKLLLETRIGCK EPIPEKSQGFINAVGQMFASGQIMFLTSPEWFRRWNLKPWRDHEAAWDYIFNFAKKSV DARLQELATLAEEGDVVEDSGILTHLLASQQLSMKEVYSNACELLLAGVDTTSNTLAW ALYELSRHPDTQDRLAKEVEGALAAKGRCIPEHDDLPNMPLLKGVIKETLRLYPVVPA NNRALDKDVTIGGYHIPKGTMIGSLQYVMGRDPEIYPDPDTFLPERWLKGSEAISSEE KASSFSSLPFGFGPRMCIGRRIAELEMHLALARISTSFKLRYHGTEEVFPAIRGILVP GSPLKISFRDR >GLEAN3_00746  HYPERLINK "http://www.systemix.org/codes/plot-header.php?org=SU&expt=NASA%3Asignal&tr=glean-davidson&ext=20000&gene=GLEAN3_00746&Submit2=Array+data" expression data for GLEAN3_00746 Missing the C-terminal MSRNIGFAHLATRYGGLISLRLGTKLTVVLNDVDSVKQSLVKEADVFTDRNVPIEYSL ALPLEGSLAFASGDKWRELRTFVAKAFRQFEVEKEKLESVIVRESERLCKAFQDEANK SSSDSQGFDPHHIINNAVSNIICSMCFGQSYEYSDPEFHHILTVVGNSFRSLSPSGLL LISRYFYYTPLYREVRHNLAVLQDFIRGIVQRHQETFDPDNTRDIIDMYLAEVRQLKE DGMDTYLKEENIWRGVFDIFLAASESMTSALRWIFLYLIDNPELQRQV >GLEAN3_01228  HYPERLINK "http://www.systemix.org/codes/plot-header.php?org=SU&expt=NASA%3Asignal&tr=glean-davidson&ext=20000&gene=GLEAN3_01228&Submit2=Array+data" expression data for GLEAN3_01228 no seq. provided, Scaffold 91975 12823-12912, 11663-11790, 11303-11390, 10483-10647 (minus strand exons) This seq derived partially from two of those sequences best hit is to a Fugu partial sequence like CYP4V DFVLDGYKVPKGYAVIYVSSIAHRDPNVFKNPNSFDPTRWAN RTEKEKLLFCYGDGPRCCVGTSLMKNLI CYP4V5-like fragment from Fugu FS:S000209 Scaffold_209 52% to 4V5 Score = 55.8 bits (133), Expect = 1e-11 Identities = 23/40 (57%), Positives = 29/40 (72%) Query: 1 DFVLDGYKVPKGYAVIYVSSIAHRDPNVFKNPNSFDPTRW 40 D + G+KVPKG+ IYVS HRDP+VF+ P+SF P RW Sbjct: 1 DSTIAGFKVPKGHGAIYVSHAVHRDPDVFQQPDSFLPERW 40 SEQUENCE 180 Ciona intestinalis Score = 212 (74.6 bits), Expect = 6.3e-20, P = 6.3e-20 Identities = 35/68 (51%), Positives = 48/68 (70%) Query: 4 LDGYKVPKGYAVIYVSSIAHRDPNVFKNPNSFDPTRWAN-RTEKEKLLFCYGDGPRCCVG 62 L+ Y +PK YA++YV+ AHRDP VF PNSF+P RW +E L+ C+G GPRCCVG Sbjct: 310 LESYDLPKDYAIMYVTHAAHRDPEVFPEPNSFNPDRWITCNAGQEDLVLCFGSGPRCCVG 369 Query: 63 TSLMKNLI 70 T L++ ++ Sbjct: 370 TDLIRRIL 377 >GLEAN3_01525  HYPERLINK "http://www.systemix.org/codes/plot-header.php?org=SU&expt=NASA%3Asignal&tr=glean-davidson&ext=20000&gene=GLEAN3_01525&Submit2=Array+data" expression data for GLEAN3_01525 This seq is short on both ends YSILPKDLKYKREVRAAANEIRAMGRRCILARMDALRRGDQVPRDILTYILQESNNSE GGIKDFDLEELVDEFFTFFGAGQETTFNLLSFTLLHLGRNPQVMKKLRDEIDTVLKGR NYVEYSDVSKMKYLTLVLKEALRINPPFGHIHRLLPHEMDLCGYKVPKGSVVMIPVYG MGRNEKHFKNPEKFDPERFTRDEDSPLFAYIPFSLGARACIGQTFAMEGFQLDTVIR >GLEAN3_01622  HYPERLINK "http://www.systemix.org/codes/plot-header.php?org=SU&expt=NASA%3Asignal&tr=glean-davidson&ext=20000&gene=GLEAN3_01622&Submit2=Array+data" expression data for GLEAN3_01622 MDLYTFLDPYAFFIMLMTVLVLYTQWFEYPISNVKPLPGPRKWSILGNLPQLIAARFD INDFHERLRKMYGAVVGYRIYHHNVVVINDMKLAREVANHPDFCGRMRDELHANLSKE PNAGILTSYGHTWKEHHDVLSGFVTALSKSKQLEDICTIEATKFIKVLTDQTDKDLII NHYSVQTLAVIFCQVIFGKVLDVNDDIVKEVVDIVMERQRESPSATRRLLPSYVNNLT GSGAKRRHHLDVQFVALVKTLVGDHTKQTEILIPCFVRRYWDERHEKLQSMEDGSMTS LTSDNLLWACIDLVTVCVDEASLLLTQSLLLLASHPEVQSKLKEDIRPEAKEQDDQEP SAYLAAVQLETQRLAPDNPWSTSMMALSDVKLGDFEILQDAVVVANWRYLRQGDDQGW DEPTQFQPGRFLGEDGSLVEHPQMISFPAGKRACPADAMSSSASSMLIRHICQRFKLS LPAQTTHPCDKNGLIQRSSLFKTKLRIVPH >GLEAN3_01792  HYPERLINK "http://www.systemix.org/codes/plot-header.php?org=SU&expt=NASA%3Asignal&tr=glean-davidson&ext=20000&gene=GLEAN3_01792&Submit2=Array+data" expression data for GLEAN3_01792 MVSFYVGPKPVILLNSYPVIFEAFRHPDLQDRPSSRLLQEIMGLKNSGIAFSNGEVWK EQRKFTHSVFRSLGVGKKSYEDIVSAEMTQLSSVIKEKKSSPFDPSTLFMQTVSNIIC SIVFGTQYTYDDDGFKKILHLVNLNVQLSGGGGTFLFLPIPGISKIPFGKMKLMTNNV RSLNAFIQSQIESHVKNWDPENQRDFIDQYLNKLDETKDTVSSFTKLNLTGCIADLFL AGSDTTTTTLNWCILFMMAYPQVQSRVQDELDHVVGRERIPRLDDIKDLPYTNAVILE VQRKGVVVPLGVPHVAAADTSIRGYTIPKGAVIVSNISEVLNSEELWKDSGAFKPERF LTADGELIKRDELVFFSTGKTKDQYFFFLCLIRGFIDYIYMVTFKDTILQIKFT >GLEAN3_02371  HYPERLINK "http://www.systemix.org/codes/plot-header.php?org=SU&expt=NASA%3Asignal&tr=glean-davidson&ext=20000&gene=GLEAN3_02371&Submit2=Array+data" expression data for GLEAN3_02731 MSHLYSICVGFVNLSSVFILMAMISLLAVGYFIKTRGSRFPAGPLGVPVLGYLPFLDC RRLHRSLMSLGRRYGNVFSLKIGAQTVVVLNSADVIRECLVGKATSFDGRPVWMLNKV NQGRGIANEQPTKKWQAHRKAWSKVTRSLTLHDLDMEGKISHDIARILSVLESSQGHP IDVSKTVHMALCNIICSLCFGKSFSYNDLDFQHLLKMADKFFCYLSSASAVNFFPILW YLPLKANKAVAEGYEGIFGFVKNLVEEKQDGVGRMQDRGFIDVCLDETKSKKNCDVVK MKSKKTSADMPGRNIDRNAETTNDNKNDSISSADSHCHQPCGNGTACKEMDNDDENDV LETLDEYSRRVICEDVTYVATDMFIGGAETTHAGVMWSIAFMVLYPDIQIRVQKELDD MVGDQHLPVWSDRHHLPYTQACLQEILRLGNVIPIAIPHVTTRDVTMSNGCHLPQGTT VLSNLYACHMDPSAWESPREFRTERFLDSEGNPKRFDHFMPFSIGRRMCLGEQLARME LFLAFTHIFIKYDVSFAPNERQPCLEGRPGITHSPYRFKICAIPRKATKMER >GLEAN3_02380  HYPERLINK "http://www.systemix.org/codes/plot-header.php?org=SU&expt=NASA%3Asignal&tr=glean-davidson&ext=20000&gene=GLEAN3_02380&Submit2=Array+data" expression data for GLEAN3_02380 MDVFNEQSFIMAKKMEKFADQSEPFNIFPQITYCVLDIICDTAMGKSINAQGEGDNEY VTAVISMTNLVQERMKKPWFWPDLLYDNIQSGKKHANNLRVLHDMTTKIIKQRLQEPP RALDGSEEDAVAGKRRRIAFLDLLLQMHREDPSFTLEDIREEVDTFMFEGHDTTAAAA SWTILMIGRHPEVQTRLHEELDEVFGDSDRPITADDLQKLQYLNCVFKETLRLCPSVP MIGRDLEEDCVIDGKVVPKGTLVVLGIYALHRDPEQFPDPEKFDPDRFLLENSTKRHP YAYVPFSAGPRNCIGQKFAMMEDKVILANLMRKFSVQAIQTMEETNPLGELIMRPRDG IYVKLSRRK >GLEAN3_02590  HYPERLINK "http://www.systemix.org/codes/plot-header.php?tr=glean-davidson&cut=&CN=&log=&min=&max=&org=SU&expt=NASA:signal&scaffold=41502&strand=W&begin=-1984&end=9572" expression data for GLEAN3_02590 MVDTFNASADHLVNYLMERADGLQEVRMYDAFNNITLDIIAKVTFNMELNIIDDPDNP FPNSILTCLNAIMTSINNPWMQYGLTKKDRKTKADVKAACDFLRGFGRDCVNARLAAK ARGEELPNDMLTNILQASQDLEGNENFGIDEMLDEFVTFFIAGQETTSNLMSFTLEML GKHPHVLQKVQDELDEKLGSQMFISFQDMGKLDYLMLVLKESLRMYAPAPVVTRVTGT EVKASTGLIIPKGSQLNLSPFVMGRMSEYFEDPLVFRPERFVESKHSPYAYFPFALGQ RSCIGQQFALIEARIIMAKLLQQFKFDLVPGQNWNLSYAVTIRPDDGCRNYVSLRH >GLEAN3_02656  HYPERLINK "http://www.systemix.org/codes/plot-header.php?tr=glean-davidson&cut=&CN=&log=&min=&max=&org=SU&expt=NASA:signal&scaffold=498&strand=W&begin=-2381&end=20027" expression data for GLEAN3_02656 MSSVVNLLLSVIEYIGIANLLTTITICLVTFWYIRMNYGASFHYRLPPGPPVIPYPFD FLMALFPSSNRFGRMIFLSKRYGDITCLQLGRRMIIMISSPSLVKELIVKQADVTSNR SAPPFFTSVMDYKGGIIFSDGEPWQEIRRFSLSALRNFGMGRKGIETRIQEEARNLCN AFAQHSSEAFDAMHLTNAAVSNIICSITFGRRLEYDDPVFMDMIQRLRNVTSGVGINP IIRELMMRFPILLKTPLLGKPKANLLGVKNFITKEVIEHIESYHSSDIRDIIDMYLTE AERLEEEKKKTNEDRTKPYLNKDDIWMCVFDLFLAGTETTSTALLWFLLVMAAKPDVQ EKVISEIDLVVGRERAPCMEDRKDMPYTDATLTEVLRFRPVAPLGVPHVTAKDVEIGE YTIPRDTEIMINIIGIHLDPRLWDEPEEFNPHRFLSEDGKTVIKPQDAFTVFGAGRRV CLGEQLARMEFYLFATTLMQRFRFRLPEGESGDFGFEHCLGTIAPKNFKILAQER >GLEAN3_02658  HYPERLINK "http://www.systemix.org/codes/plot-header.php?tr=glean-davidson&cut=&CN=&log=&min=&max=&org=SU&expt=NASA:signal&scaffold=498&strand=W&begin=39775&end=61911" expression data for GLEAN3_02658 MESLKGYVTDWWNDMGIITSLLFGLFGSVIVFGIVRQTFGKREKLPPGPLKIPFPLNI IASLFSKEGKMDMILSAKRKYGDLVSIEIDGQPKILVGSPELVRELFVKHGDVSSSRA ASKPLKAMINYTGGIIFGESWQPLRRFSLSALRTFGMGKRSIASKITEEARIFTDGLW ESQTVEGVNPSGLLNKAISNVICSITFGKRFEYSDPDYIIQVENIKKIMAGGNIPPVL RIVAQRFPVIFRSPLSKNAKKRLQSLKDFIVQQVDEHDRTFEGDDIRDIVDSFLAECR RLQEKGEQECYIRKEDIWMSVWELFIAGTETTASALAWFFLFMAAHPDIQEQVFKEIV TTIGSERAPVYDDRKNMPYTEATLAEVLRHRPVAPFGLPHVADETIHVRDYVIPKGST ILANILGIHHDPELFPEPSKFDPNRFLSDDGQTFVKNDAYTAFGLGRRSCLGEQLAKM EFFLFATSVLQRFTIRLPAGATPDYSIGHRMMTLLPSDFYIHLDKR >GLEAN3_02660  HYPERLINK "http://www.systemix.org/codes/plot-header.php?tr=glean-davidson&cut=&CN=&log=&min=&max=&org=SU&expt=NASA:signal&scaffold=498&strand=W&begin=104931&end=158815" expression data for GLEAN3_02660 MELNAGFRILLYALLVITSTVGVAFVVFSWLLYRFNGKYGHLPCPRPTSFFFGHLSHI NAVSKKMDMQTLTLMHQWMRDLGPVYCIRFILQPMVICSNPPTVKVGFDMDINAVENS DCPFPSAIALALQGMDATFKNPLLMVDPRSKARQYRRSVRDAVKLVRNIGQDCIEKRR EAKKKGEELPKDILTCILQAEDDVGNGEKIETADLIDEFGTFFVAGQETTSNLLAFTL LEIGRHKDVAKRLKDEVDEVLGSKSIIEHSDLTKLEYMSRVFKETMRLNPPVSGASRE LASDVQSCGYVIPKGASVLFLSYLTSRLEEYFDDPLLFNPDRFIPTDETPRHFFAYFP FAIGQRNCIGQQLALIEAKVILAKLLQSFDFRLEQSQRHIIVSQVTNKPIDGCKNYIS LRD >GLEAN3_02831  HYPERLINK "http://www.systemix.org/codes/plot-header.php?org=SU&expt=NASA%3Asignal&tr=glean-davidson&ext=20000&gene=GLEAN3_02831&Submit2=Array+data" expression data for GLEAN3_02831 only half of gene from I-helix on, missing PERF motif MSELMPKTAYFCIFIAGIQTTVETLYWAMALLATYPEIQEKIRAEIDDVIGRDRLPTI NDRGKLPYTEASLYEVLRYSSIAPTALPHATTRDTEFEHFMDGSGTIRDHPPSFLPFG AGRRGCLGEAVAKADLFLIFTWFLQNYTFSKVQGKESEDILKMIPQTASGRLLLSYEI MINKRD >GLEAN3_02832  HYPERLINK "http://www.systemix.org/codes/plot-header.php?org=SU&expt=NASA%3Asignal&tr=glean-davidson&ext=20000&gene=GLEAN3_02832&Submit2=Array+data" expression data for GLEAN3_02832 short at N-term, green = C-helix MLPSPFHVIHFQVILSSSQHVRKGFLYRLLEPWLGLGLLTSTGQKWFHRRKMLTPTFH FSILQCFMDVFNEQSTILVKKLEKFADKSETINIFPLVTNCVLDIICDTAMGRCTNAQ EDGDNEYVQAVGRMGELVIARAKNPLTWPDYLFGKLNAGKEHDKTLKILHDVTDNMIQ ERLKEPPSVTHEDEDETVARRRKRIAFLDLLLAMHREDASFTLKDIREEVDTFMFEGH DTTAAAISWAILEIGQHPDIQERLHAELDEVFGDSIRPVTSDDLSRLSYLTRIVKESL RIIPAVPMVARSLDEDIVLDGKVVPKEAMIMLHIYALHQDPQQFPDPDQFDPDRFLPE NAEKRHPYAFVPFSAGPRNCIGQKFAMMETKLTLANIFRRFSIESVQTIEGAKPAGQL ILRPVEGNILVKLSWRK >GLEAN3_02884  HYPERLINK "http://www.systemix.org/codes/plot-header.php?org=SU&expt=NASA%3Asignal&tr=glean-davidson&ext=20000&gene=GLEAN3_02884&Submit2=Array+data" expression data for GLEAN3_02884 missing N-term half, seems long on the end MRILIDLANKSDDSSSKEGLYEAIKNQTIEFLTSGMLSLSSAASTMLYHLAKKPSVLA KVREELNDYGLLTDYSPLTLNKINQCRYVMQVIKESLRMVPPIGATFRTALKTFELDG KQIPKGWTVFWSIRETIHLSDSFTDTKNFDPDRFSPQRQEDKVAGRFSMPVFGHGTHS CIGQNFALLTLRILLIELARTCDFETPNVDKIKFSWIPSPKPKCGMPTRFTRKILVDQ NENEVSYSSKSSTCPQGEDSRERSEEGVHQRSTTCHAA >GLEAN3_02898  HYPERLINK "http://www.systemix.org/codes/plot-header.php?org=SU&expt=NASA%3Asignal&tr=glean-davidson&ext=20000&gene=GLEAN3_02898&Submit2=Array+data" expression data for GLEAN3_02898 MDLYTFLDPYAFFIMLMTVLVLYTQWFEYPISNVKPLPGPRTWSILGNLPQLIAARFD INDFHERLRKMYGAVVGYRIYHHNVVVINDLKLVREVANHPDFCGRMRSKLHANLSKE PNVGILTSYGHTWKEHHDVLSGFVTALSKSKQLEDICTIEATKFIKVLTDQTDKDLII HHYSVQTLAVIFCQVIFGKVLDVNDDIVKEVVDIVMERQRESASATRRLLPSYVNNLT GSGAKRRHQLNVQFVALVKTLVGDHTKQTEILIPCFVRRYWDERHEKLQSMEDGSMTS LTSDNLLWACIDLVTVCVDEASLLLTQSLLLLASHPEVQSKLKEDICPKAREQDDQEP SAYLAAVQLETQRLAPDNPWSTSLMALSDVKLADFEILQDAVVVANWRYLRQGDDQDW DEPTQFQPGRFLGEDGSLVEHPQMISFPAGKRACPADAMSSSASSMLIRHICQRFKLS LPAQTTHPCDKNGLIQRSSLFKTKLRIVPH >GLEAN3_02899  HYPERLINK "http://www.systemix.org/codes/plot-header.php?org=SU&expt=NASA%3Asignal&tr=glean-davidson&ext=20000&gene=GLEAN3_02899&Submit2=Array+data" expression data for GLEAN3_02899 C-terminal half only NGTVSSYDELNTVFSLADIFAAASETTTTTLRWLMLRMVTHPDIQKKIQDEIDDA VGRDRLPTLLDKPKLPYTEATILEVMRLNFVTPLGVPHQYTGRGELSFHGYDIPE GAIVFANYWSISRHPKLWDDPEVFRPERFLNEEGKCVKPDELTPFSIGKRVCIGESL AKMELSLLFTHLLHRFTFTLPDGVPSYSMKDSLGAVNTPVSYQL >GLEAN3_03038  HYPERLINK "http://www.systemix.org/codes/plot-header.php?tr=glean-davidson&expt=NASA%3Asignal&org=SU&CN=&cut=&log=&scaffold=14873&strand=W&begin=-18111&end=6000&min=&max=&submit=GO" expression data for GLEAN3_03038 MGYDQLFNSVTVTTYLVLIITVVLITRYWINTKRESEKVARSGGRPLPGPWGLPVIGN MLSLDQSGPHLSLMKLAKDYGNVFKIQMGSRPVLVLNGLEATKKALTKQPVVFAGRPD LFTFKVINETNVYGPSISFSNYSEQWKLHRKLAETSLRHFTAGGQIQVLEKVIESEAQ ELVRYLKASNKKSGANLKGLINLSVSNIVLWYLLSKRSSYKDKKLIELLGFADKFSKA TGNGNPVDFLPWLKFIMTKSTRKLKEWQEQFGQMLQGIIEEHHAVYEPGSERDIVDHI ITTSTKGGDEELKRLGLDGKTLLQIAYDFFGAGYETTSATLEWMLLYMASYPQVQSDV QGEIDQFVGRGRLPTLNDFDNLPLTQSCLLEIQRHASVVPFAVPHSTTKDTVLDGYFV PKDMVVFVNLYSAHFDPEVWDQPEAFNARRFLAQDGTLDKDKEKLVVAFGLGRRKCIG SDLARINIFIYFTTLLHQLNLSCPQGQKVSLESAFGLVRRPVEFNVEVVPRS >GLEAN3_03231  HYPERLINK "http://www.systemix.org/codes/plot-header.php?tr=glean-davidson&cut=&CN=&log=&min=&max=&org=SU&expt=NASA:signal&scaffold=1629&strand=C&begin=61841&end=85073" expression data for GLEAN3_03231 MIVIASAVGTALCAVYYLKEHYDYKVRKLPPGPTSIPFIGDVLSINPKSPHLSLIELA KTYGDIFSIKLGSERVVVLNNAELIKSAYRGVDISHRPDLFCMDVLAGGKGFLTCKDE IQRQMHMKICRQAMRVVNNADMGEKILQEANNLVNHFEAQNEKPFDPQCDLHIASLNI LCNFVFGERYEHDHPEMREILDFSAEISKLLSPMHPVNAMPWLRHFPNKWFASLFKAK NQRDRILMKKYVEHVVTYKAGRVRDMLDGLLAETTEAVAENNKQAVALLTPEHIIINM WLIFFAGSDTVTNTLQWSLLYMAVFPKKQAKAQEELERIIGQSKRLSLEHKSQLPYLQ AVINEVLRYSSLTILGVPHAAARDTEINGYFIPKGTSVMANFWSVHHNEDAWDQPEEF LPERFLNDQGNLKEPSQLPHFMPFSVGKRRCLGANVAKAELFLMLGRLLQEFSFEMPQ DVEADLQGEAAVSLIPKPYKIVAKKRHIIQPETI >GLEAN3_03232  HYPERLINK "http://www.systemix.org/codes/plot-header.php?tr=glean-davidson&expt=NASA%3Asignal&org=SU&CN=&cut=&log=&scaffold=1629&strand=C&begin=39000&end=60000&min=&max=&submit=GO" expression data for GLEAN3_03232 MDSASQLLLLHPTSVILPYGHSFENFTLNLYDMLSIKTTALQPVSAPHGSMAPILIPL MIVVSIIVYYCQEWMEFNVDKVPLGPTSLPFIGNVLQLKSKALHLSLTDMAAKFGNVF SMKLGRQRVVVLNDAQTVKTAYNGPNFTNRPSVFSIEFFVSKGFMACQKKHDFRIHTK LMKHAFQAVTYTSLGDKLVEEAMDLINQFDSYNGDAFNPREDLNLVSLNILHNIAFGK RYQKGDTELHEIFDYSSRIMKGVSPVHPVNLIPWLQRYPNPWMKDLSDARDKRDNMLL RFYEEHKETYVEGEIRDMVDALIKVEQDAEEAGDMETLRLLSPLHIITNIWTIFFAGT DTIHNALLWVLLYMAVFPEVQRKVQEEIDRVIGDRTPCLEDEQTLPYLCATFYETLRF SCLSLLGVPHAAVCDTTLNGYHIPAGTQVLANFWAINHDETTWDNPSEFRPERFLNDN GQLRNMKDFPHFMPFSTGRRACLGKNIGKSGIIVLSACLLQKLNIAVPTGEGAQQPTL EPEVHFDLVPKQYDIVVTRRDSMRS >GLEAN3_03606  HYPERLINK "http://www.systemix.org/codes/plot-header.php?tr=glean-davidson&expt=NASA%3Asignal&org=SU&CN=&cut=&log=&scaffold=1111&strand=W&begin=46000&end=74958&min=&max=&submit=GO" expression data for GLEAN3_03606 N-term seems about 150 aa too long MEAYVTEGTTKEGDDILEEPGHQQFKPEATLLLGDISLECDRGHYLNKDLETCVPCPA GYYLDVASEICTPCNRGFFQDDQAQTDCKPCKNGTSTRNQGATHSNQCEVLCGPGYFS STGFSPCMWCEQGSYQSLHGQTRCEKCPLMSTTPDHSGTSIDDCQDKMFDLSATSVFL GVVIFLLCLWVVRRPKNLPPGPWSLPIIGYNFKPGLIHEAYMDLAKKYGPIFSLRRGS FVFVVLNDRESITQALVKSGEFFSDRFVPGHFNWAIPDPGKKATISFSNGKPWVDLRK FSLPALRSFGFGKQSLVPQINLEARYLSEEIRNLRGEPTDLAITFSKATANIICQLLF SRRYEYSDGEMSKVLKDNEEILSLIPETDLVNIFEPLIHTSNQRYKRYREANFANRDF IMSHLNSHRETFQKDNIRDFTDAFLADDISKEFELEHFWRVVFDLFAGGTDTTAVVTS WAILYLSVHADVQRKVQTELDTVVGRGRQPNTLDRPDLPYCNATITEVMRIRPILPVS VPHMTSDNVSFRGFTIPKGSIIVPNLWAVHHDPKEWNEPDRFNPDRFLSADGKQFQKN EAWMPFGVGRRDCVGSQLAKMETFLLFTNLFQQFEFKLPPNQPNPSMRGANGITMNPE PYKVCAIER >GLEAN3_03607  HYPERLINK "http://www.systemix.org/codes/plot-header.php?org=SU&expt=NASA%3Asignal&tr=glean-davidson&ext=20000&gene=GLEAN3_03607&Submit2=Array+data" expression data for GLEAN3_03607 missing N-term, green = C-helix MYTPCATIAFSNGKPWVDLRRFSLPALRSFGFGKQSLVPQINLEARYLSEEIRDLRGE PTDLATTFSKATANIICQLLFSQRYEYSDGEMSRILKSMDEFVSLIPETDLVNIFETL IHTPKYKPYRESIFSDRDFIMSHLNSHRETFQKDNIRDFTDAFLADDISKEFELEHFW RVLLDFFIGGTDTTAVITSWAILFLSVHSDVQRKVQAELDTVVGRGRQPNTLDRPDLP YCNATLTEVMRIRPILPVSVPHMTSDNVSFRGFTIPKGSIIVPNLWAVHHDPKEWNEP YKFNPDRFLSADGKQFQKNEAWMPFGVGRRECVGSQLAKMETFLLFTNLFQQFEFKLP PNQPNPSMRGANAFTMNPQPYKICAIER >GLEAN3_03687  HYPERLINK "http://www.systemix.org/codes/plot-header.php?org=SU&expt=NASA%3Asignal&tr=glean-davidson&ext=20000&gene=GLEAN3_03687&Submit2=Array+data" expression data for GLEAN3_03687 missing N-term and C-term (no heme signature) MVMTNSKHTKSHVDYDGFQAIFGSRFAGRSLVNETDHSRWSQRRALFNPAFHRQYVGF GLDLNAMEDPSCPFPVAATKVLQGLQKSLLVPWYSYIPSAHARNFRKEVKEACHLIRE IGRDCILSRIEAKARGDATPQDILTYLLDASKELQGDQNFGLEEMIDEFVTFFVAGQE TTGNHLSFTLQQICRYPEVLKKLLIEIEEVLGDKPFVDYSDLPKLEYLMLVMKESMRQ FPPVSGSTRSLAHEIECCGYTIPKGTRLRVNHFIMGKMEKYFDDPEEFRPERFQVSDE TYKSRVGVE >GLEAN3_05160  HYPERLINK "http://www.systemix.org/codes/plot-header.php?org=SU&expt=NASA%3Asignal&tr=glean-davidson&ext=20000&gene=GLEAN3_05160&Submit2=Array+data" expression data for GLEAN3_05160 MAAVEFITENAGTLLVFVTTFTIVLTFLRNRTKTGFNIPGGPTALPFFGNLFSLKPHD EFLDSLVGWSKEYGSIYSLYLGRKLILIVNGSEAVKELTCKTGLDFADRGESVAFLAL NPNRLGLFDTLADEEWKRQRKFAHTTLRGMGFGKVSLEPLIQTEARHVISYFKTKVGS LLDPSVAFSVTSSNIINKLVFNRRVDHGSESGDELVRHVRNLMVSDEGAVLTLFPSLW PVMRYADSKWMKLYDSWAHIESFFHQEIKIREDRIDACKGELEASDFLTAYLIHRRAD PEGYSLEALDFVLVDLFLAGTDTVSTTLTWAVLYLVSHPEIQQKIHDEIINEIGDKEV SMSHRSGLTYTQAALLEVQRLATVAPLIPRESSRDAVLCGHKLPKGTTLLINLWSLHY DQDVWKDPKQFNPDRFLNDDGTLVKQPESFLPFSAGKRMCMGDQLAKMELFLFFTMIV QQIKFSLPPGADPLPMDGLLEIARKPKPFQVQVSAWSD >GLEAN3_05253  HYPERLINK "http://www.systemix.org/codes/plot-header.php?tr=glean-davidson&expt=NASA%3Asignal&org=SU&CN=&cut=&log=&scaffold=118&strand=C&begin=225000&end=264312&min=&max=&submit=GO" expression data for GLEAN3_05253 missing N-term, green = C-helix, missing middle of gene from C-helix to EXXR motif MGHGLVSEVDHEKWFHRRAIMNPAFRRKLQAEVDEVLGQKHTIQYEDLAKLEYMMRVL KETLRLYSPVGGTTRLTAHPINYNGITIPAKCTVTVMSYVMSRMEEYFDDPLLFNPDR FKPSEDGSLPRHLYAYFPFSMGQRSCIGQQFALIEARVILAKLLQRFEFRLEQSQSMG ILDELTAKPMGRCRNYITLRA >GLEAN3_05439  HYPERLINK "http://www.systemix.org/codes/plot-header.php?tr=glean-davidson&expt=NASA%3Asignal&org=SU&CN=&cut=&log=&scaffold=1053&strand=C&begin=20000&end=40000&min=&max=&submit=GO" expression data for GLEAN3_05439 MGLFVVIVVYVLGLFACLLGIGLLVFVAAILRTEQKFSDLPGPKRDSFLFGNIFSIFE AKKKLGQPIGLIIYTLSQDHGPVFWIRVFARGHVFASAPEIVKELLLNSRHIKPQSFY KNFQYVFGARFMGNGLVSEINHEKWFHRRAIMNPAFRRKYLMGMMEKFNAESEKLCEH LRDKADGRNEVEMLNELNNVTLDVIANVSFSMTTGAVGDPSCPFPAAVTLGLRGMQTI IDKPWYSFDPRSESRKTRQETREAVNLIRETGRQQIEERIAARQRGDHVPNDILTVIL DCANDLVGDKDFQMENMIDEFVTIFFAGQETTSNLLAFTIQQLGRNPDVAKKVQAEVD EVLGQKPYVEYDDLAKLEYMMRVFKETLRLYPPVIGTTRVTAHPVIYKDMTIPAGVSV SVLSVVMSRMEEYFDDPLLFNPDRFKPQEDENLPRHFYAYIPFSVGQRSCIGQQFAMI EARVILGKLLQRFEFHLNQSQRLDMLDELTLKPLDKCKNYLTLRA >GLEAN3_05521  HYPERLINK "http://www.systemix.org/codes/plot-header.php?org=SU&expt=NASA%3Asignal&tr=glean-davidson&ext=20000&gene=GLEAN3_05521&Submit2=Array+data" expression data for GLEAN3_05521 missing the middle of the gene, no I-helix or EXXR motif MAVLGTISGLFSMTTVYLMVAVVVIFFIYDYWTHTYFKRLGIPGPPPKPIFGNALDFG EQPIFELLDEYANKYGPVVGIYMWRRPLLLVTDLDILRDIMVKDYHRFYNKFPMPVSS GSFDNALFLLRDAQTDRECGETCSYKGYKIEKGMQIWIPAYTIQRDPTYWPDPLKFDP ERFTKENREGRNPFTWLPFGAGPRICIGMRFALMEAKMAIVRSLQVVHFEISPLTKIP PDVGLTSLLSKKSSYTLKITQRH >GLEAN3_05655  HYPERLINK "http://www.systemix.org/codes/plot-header.php?tr=glean-davidson&expt=NASA%3Asignal&org=SU&CN=&cut=&log=&scaffold=7510&strand=C&begin=12000&end=36195&min=&max=&submit=GO" expression data for GLEAN3_05655 MENEYGSNNPIPPRGSRFPAGPLGVPVLGYLPFLDCRRLHRSLMSLGRRYGNVFSLKI GAQTVVVLNSADVIRECLVQKATSFDGRPVWMLNKVNQGRGIANEQPTKKWQAHRKAW SKVTRSLTSYDLDMEGKISHDIDRILSVLESSQGHPIDVSKTVHMALCNIICSLCFGK SFSYNDLDFQHLLKMADKFFRYLSSASAVNFFPILWYLPLKANKAVAEGYEGIFGFVK NLVEEKEDGVGRTQDRGFIDVCLDDAKSKKNCDGVKMRSEKTFADMPGGNIDRSAETT ADNKNDSTSSAASHCHRPCGNSRACKEMDDDENDILKTLDEYSRRIICEDVTYVATDM FIGGAETTHAGVMWAIVFMVLYPDIQIRVQKELDDMIGDQHLPVWSDRHHLPYTQACL QEILRLGNVIPVAIPHVTTRDVTMSNGCHLPQGTTVLSNLYSCHMDPSAWESPREFRP ERFLDSEGNPKRFDHFMPFSIGRRMCLGEQLARMELFLAFTHIFIKYDVSLAPNERQP CLEGRPGITHSPYRFKICAVPRKATKWNGKDSQTSSD >GLEAN3_05665  HYPERLINK "http://www.systemix.org/codes/plot-header.php?tr=glean-davidson&expt=NASA%3Asignal&org=SU&CN=&cut=&log=&scaffold=82902&strand=C&begin=53000&end=79201&min=&max=&submit=GO" expression data for GLEAN3_05665 sequence starts in I-helix, missnig N-term MGFGKVALEPIIQEEVIGDEPVCLSHRGSMPYTEATLMELQRLASVVPIVPHAAFRDS ELGGYHIPKGTRMWINMWSLFRDPVDWPNPNKCDPHRFLNKDGTKVVHPDSFMPFSAG RRVCMGEQLAKMELFLFFVSLMQRFKYVFADPDTTPSFEGVVGLNRQPRPYKVKAILR NK >GLEAN3_05668 MTVFAPCRNLLEINVDIIQTMMFSVFVTTFLLGCTVSICCAWLLRRPKNLPPGPWSLP IIGYRAGSGLIHECYASLAQRYGPIFSFRRGPFLVVVLNNREIIQQALVKNADVFSDR FTPKHVNLRIPASNPNASVVWSSGKPWGDLRKFGLTALRAFGMGKKSIEPHINLEVRY LCEEIEKLRGHPTNLLTMLNNTTANVICRIVFGRRFEYDDEEFQGILRGFFENFSTIS GSDLVNVFSFLLYMPWYKDFRRNLSQSKSFIMAQLKLHTSSFEKDNIRDIIDAYLADE GISKDFTVEEFCRVVLDFFAAGTETTSTALSWAILYLCLYPDVQTKVQSELDTVVGRG RQPAISDRARLPYCEAALMESMRIRPVLPLSLPHTTSRSVSLGGFTIPQGTTIIPNIW AAHHDPEVWDDPDDFRPERFLENHDSPTVKKNDSWIPFGLVKYDIFLFFPDKLRSSRV SGGSASQDGVLSTVQRFEFKLPPDQSTHSKRGISGLTMPPEPYEVCAIER >GLEAN3_05931 MALWNLAPVTVLVLSVLFSFLGYRCLLCMIAVWTEVKRIVHIRRTLPGAPPHWLFGNV LQEPGPVTPGFEWHRNMPKSYRRLHVFWAGWKPIVVLNHPESVRNVLNGSVGTVKSEV YRLFDEWLGQPMATTDGNLWKRHRRLITNSFHFNVLKSHIPKMNQIADTLISVISQRG VAGELLELHKTTSAFSSDVILQCAFSYESGCQEGVSEYMKSVSTLAGIVMKRSLSPHL TYDFIFRRSSLYKSWRKEIDVLHRLQEKLISDRREQHRQTGCTEFSKGKDILDTLMLA KDTEGGLTDREMRDEINSFVFGGIDTTSSALTWLLYLLATHPEYQTKVQEEIDVLFQD RDPEVCSEDLHRTPFLMKCVKESQRMYSFVTPGRLLTEPLVVDGLTVPAGTELTLFTY QLHHNPDVWGDDHMTFRPNRFDRQNVEGRDPFAFIPFSAGARNCIAQQFALQEIQVAA IRIFDKFGFTLVRDSTPVLRVVSVPEDEILLGNTFYNEIYFNLLLVLLQREDLHRTPF LMKCVKESQRMYSFVSPGRLLTEPLVVDGLTVPAGTELTLFTYQLHHNPDVWGDDHMT FRPSRFDRQNVEGRDPFAFIPFSAGARNCIAQQFALQEIQVAMIRIFDKFCFTLVRDS KPVLRVVSVPEDEILLGIHPRSKK >GLEAN3_06386 MVPCVSRVIRVGFAHLATRYGGLISLRLGTKLTVVLNDVDSVKQSLVKEADVFTDRNV PIEYSLALPLEGSLAFASGDKWRELRTFVAKAFRQFEVEKEKLESVIVRESERLCKAF QDEANKSSSDIQGFDPHHIINNAVSNIICSMCFGQSYEYSDPEFHHILTVVGNSFRSL SPSGLLLVSRYFYYTPLYREVRRNLAVLQDFIRGIVQRHQETFDPDNTRDIIDMYLMS NSEVGCRVWAPKLDAGVVCRSCMPGLGAEVRYLGAEIGCRDWLPRLGAGVGCRGWVSE LGLEFGYRGLATRMAADAEIRADILRVIGPTRNPTIKDRDELPLIEATCMEVLRLRPP APMGGPRQAKRDVEVKGYVIPEGTLVLTNVWYIHQNAENWHEPERFNPSRFLSEDGKE LVKNPAFMPYGAX >GLEAN3_06574 MGFTGTEPGYWLLAGVLGLLVAWLVKHLLTRPKNLPPGPTGLPFLGVAWQFVKGGSNP LALFASWSKKYGEMVSFYVGPKPIIILNSYPVIFEAFRHPDLQDRLSSRILQEIMGLK NSGIVLSIGEVWKEQRKFTHSVFRSLGVGKKSYEDTVAAEMTQLCSVIKEKKGSPFDP STHFMQAISNVICSIVFGTQYTYDDAAFNKDVTRGGPRSRRVHVKAPCVSFPVALLGA NRILDLVNMNVQLAGGGGSFVFLPIPGISKIPFGKMKLFTDNIRTFTAFIQSQIESHE KNRDPDSQRDFIDQYLNKLDETKDTVSSFTKLNLTVCIADLFSAGSDTTTTTLKWCIL YMMAYPQVQSRVQDELDHVVGRERIPRLDDIKDLPYTNAVILESFFGCNPDFYLSKVQ RKGVVVPLGVPHVAAADTTIRGYTIPKGAIIVSNIFEVLNSEELWKDSGAFEPERFLT ADGELIKRDELIFFSTGRRVCIGEQLARMETFLGFTSLLHRFTFKKPDDSSTLSFEGV LGISRNPVPYMTCAVVRE >GLEAN3_06989 No seq. provided Scaffold 72639 63174-64724 (no introns) MADFQFLNSMSVVTYVVVIATSILITRYWINTKRKSEKVAKSGGRPLPGPWGIPVFGN MLSLDQSGPHLPLMKLAKVYGNVFKIQMGSRPVLVLNGLEAIKKALTKQPAVFAGRPD LFTFNVINETNVYGPSIGFSTYSEQWKLHRKLAEASLRHFTAGGQVQVLEKVVESEAQ ELVRYLKASYEEKNTNLKGMLSLSVSNILLWYLLSKRSSYEDKKLIELLGFADKFSKA TGSGNPIDFLPWLRFIMTKHTMKLKEFQKQLGQMLQGIIEEHHAVYEPGSERDIVDHI ITTSTEIGEEELKRLGLAGNTLLQTVHDFFGAGYETTSATLEWMLLYMASYPQVQSDV QGEIDKVIGRHRLPTLNDFDNLPLTQSCLLEIQRHASVIPFAIPHSTTKDTVLDEYFV PKDMVVFVNLYSAHFDPEVWDQPEVFNARRFLAQDGTLDKDKEKLVVAFGLGRRKCIG SDLARINLFIYFTTLLHQLNFSCPQGQKVSLDSAFALIRRPLEFNVELVPRS >GLEAN3_07306 MYVFKRKAFVDLARKYGPVFSVRRGPLLFVVLNDRESMKQALVKSGEFFSDRFIPGHL RWGIPDTEKNASIVWSNGKPWDDQRKFSLPALRSFGFGKKSLVPQINLESRYLADEIR ALHGEPTDLSALLNKATANVISQLVFGRRYEYDDPEFIGALEALADAFGLVSETDPVI VFESLVHTPWYKPYRGCIFKLRDFVMSHLHNHQETFQKDNIRDFVDAFLAHDISKDYS LDTFWRIVLDFFAAGTETTAVVTSWAILYLAVHPDVQKKVQDELDAVVGRGRQPDTSD RPNLPYCDATLMEIMRIRPVLPVSLPHLTSADVSIGPYTIPKGTIVIPNLWAVHHDPK EWCDPHLFNPDRFLSADGQTVVKNEAWMPFSIGRRDCLGMQLAKMESFLLFANLFQQF DFKLPPDQPTPSMRGHTGLSMPPESYQICAIER >GLEAN3_07335 seq begins at I-helix, missing N-term half VNTFLLGGIDTTSSALTWLLYLLATHPEYQTKIQEEVDELFQDREIREVRREDLHRTP FLMKCVKESQRMYSFVSSSRLLTEPLVVDGLTVPAGTDLAIYPYQLHHNPDVWSDDHM TFRPSRFDRQNVEGRDPFAFIPFSAGARNCIAQQFALQEIQVAAIRIYDKFGFTLVRD STPVFRIVSVPEDEILLGIHPRRET >GLEAN3_07404 MSESKAKSLPGPWGLPILGNLLNLGSDEPHVALMKLAKEYGNVYQIRLGSRPVVVLCG QEAIRSALVRQPVAFAGRPDLASFQFIRKNTDVTVAFQTNDATWKLHRKLAESSIRFF VSGQRKSSLESTVTQEVESLINYWTNNNNNDDNLQIDPVNLIKLSVSNVMLWCILGKR HSLDDSRLREFVSKSDDFAVAVGSGNIADFMPWVRFVTVRTTQNLKKLLNTFRGWFMP YIEEHRLQYEQDSEKDILDYLVTSSNKLDEADLQQLGVSREALQISTVFDLFGAGFDT TSAIITWQLYYAIRHPAVIARVQAELDDVVGRDRLPNLTDRDHLPLTQAFLLETFRHS SVVPFTIPHSTTTDTTLLGYHIPKDTVVFVNLYSVHHDPDTWHRPSDFDPERFIDNET GLVDKTKADAIMPFGAGRRRCIGAEMGRIQSFLYFAILIHQCHVSSLDKASLDNKPIY GLTMRPSPFNICVSRR >GLEAN3_07406 No sequence data provided >GLEAN3_07409 MIDLLGAGESGFGQFNALTITATLGVVTATLALWTIMRPNRFPPGPRGLPIVGSIFSL NDSPEVVFGEWAKKYGDIFGFKAGERWMVVLNRQALIKEAVLKQGVDFAGRPDFYSVN IFTEGFKDIVFSPYSETWKLHRKLAHSALRHFATGKPLQGLISSVYPKVEKKLAMTEG QPIDPKVLITLIMYNVLAQMCFGLSYELEDPNVTQWMDVNNDVNEKLGLGLAADIFSW AKYIPTSGPRMIKEITETMFGFLRSQVDEAREHYDPENINDFYSLLLKAQEDARKEGE NVDKLTDTHIFQTVADIFGAGIQTTVETLYWAMALLVTYPEIQAKIRAEIDDVIGRDR LPTINDRGNLPYTEASLYEVLRYSSIAPIAVPHATSRDTEFGGYHIPKGTVVFINTHS MHYDPQEWDQPDKFLPEHFMDGSGTVREHPPSFLPFGAGRRGCLGEAVAKADLFLIFS WFLQNYTFSKVPGKESEDILNMIPQTAFGRLLLSYEIMINKRD >GLEAN3_07558 MSVIFKLVITVATLCLLHRVYRCVRRWLRLYRRANYMNRNVPSVEPHWLWGNVFDNPG IFQAGLDWHIKAVSRCRRMLVYWGFWTPCLIVNHPDTMKDVLKSSTISRRPAGRTGYH ILGEWLGQGIATSDGPHWRRNRKIVTPFFQSNCVRKHHVPVINKTTDIVLQKLSIKAD SATPFDVSHLMFNLSGDIIYRVALSHELGCQEGGSPVLKSLNILTEIGTSRILNPIMA NDFIFRMTSQYNMWKEALEVVNSLSEKLIRERKRMHEETSCETATKSRDLLDTLILST YSDDKRLSHREIRDEGGKSHLPVLTREVNTCIFGGVDTTGSALTWFFMDMAKHPEHQT KVQEEIDDALGDHPDDYIDSFDQHTFPYLMQCIRESLRLHAMLMPARGLIEPLVVEGV TIPPKTGIMIDFYQLHHNPDIWGDDHMEYRPERFDQGNVARREALAFLPFSVGPHQCI GQRIAMQELEIIGIRVLRRFSLGLVKEAKTKFRIVGKPASEVLLTIKHRQNSSNL >GLEAN3_08152 MEWIVQSDTSLWLMTGAIALVVMWFVKGLMSRPKNLPPGPVGLPFLGVAWEMMRAKPD PLALYSSWADQYGDVVSFKVGPMTFVLLNSYEVIVEAFSHPDLNNRPKSQMVEEALGL ANNGVILANGKVWEEQRKFAHSVFRSLGVGKKSYEDTVATEMDQLRAAIEEKEGSSFD PNLLFGQAVANIVCSVAFGTQYKYSDVEFRQILNLMNDNMELSSGGGALLFLPLPGIS KIPFGAPKKMVKNMKEVNKFLLNQIESHEKCLDLDHPRDFIDLYLKKMAETKVLETSS YSKINLNSAIGDLLFAGTETTTTTLKWCILYMMAHPEMQSRVQRELDHVVGRERLPRL DDRNDLPYTNAVLIEVQRKGAVAALGVPHVAAEDTSFRGHTIPKGATILSNIWKVLNS EELWKDSDAFKPDRFLTEDGKLIKREELIFFSVGRRVCLGEQLARMETFLGFTSLLHR FTFKKPDNSPPLSFEGVLGITRNTMPYLTCALSRD >GLEAN3_08632 MLAVRWYRQRSRLPGKLPPGPWGLPLIGSIFSLGKQPHLTLMDMAKKYGNVFTLNLAG QLVVVLNGYESVREALVKKAGVFAGRPHLALTQELTEGQGIVVADYGPIWREQRKFTL TALRNFGFGKESFEAEIQEEIRYMLAAFKEKDGSSFNSDHIVEIAVSNVVCNFNFGRR FEYQDPKFQLLVDSIYRFAEIGTNAAAINFFPFLKHLPIPEIREVFEIDKVVRSFLQR MIEDHKETLDETRQRDFIDAYLLEIKRRQEAMKKEEKISEADASFTETYLFHILNDLI FAGTETMSSTIRWAILYMIVQPEVQKKVQEELDTVIAPGELPSWRDRNRMPYTEATLM EIQRMANITALTFPHKTTEDTELAGYSIPKDTHVFINLYSVHVDPTQWKEPEKFMPER FLSDEGGVIHNPALMPFSAGRRACPGEKLARMELFLFFSSMLRYFTLTPAPENPNPSI YKNYGISLNPKPYKICVIPRQ >GLEAN3_09118 seq begins at I-helix, missing N-term half MPVSKLQETHRFIHEHLRDYSRMNGSHKAGILTTVETLYWAMALLATYPEIQAKIRVE IDDVIGRDRLPTINDRGKLPYTEASLYEVLRYSSIVPNALPHATTRDTEFGGYHIPKG TTVIINTYSMHYDPQEWDQPDKFLPEHFMDGSGTVRDHPPSFLPFGAGRRGCLGEAVA KADLFLIFTWFMQNYTFSKVPGKESEEILKMIPQTASGRLLLSYEIMINKRD >GLEAN3_09512 missing N-term green = C-helix MVSFYVGPKPIIILNSYPVIFEAFRHPDLQDRLSSRLLQEIMGLKNSGIGFSNGEVWK EQRKFTHSVFRSLGVGKKSYEDTVSAEMTQLSSVIKEKKSSPFDPSTLFMQAVSNIIC SIIFGTQYTYDDEAFKEILHLVNLNVKLSGGGGSFLFLPIPGISKIPFGKMKLMTNTI RSLNAFIQSQIESHEKNWDPENQKDFIDQYLNKLDETKDTVSSFTKLNLTGCIADLFL AGSDTTATTLKWCILFMMAYPQVQSRVQDELDHVVGRERIPRLDDIKDLPYTNAVILE VQRKGVVVPLGVPHVAAADTTIRGYTIPKGAVIVSNISEVLNSEELWKDSGAFNPERF LTADGELIKRDELIFFSSGRRACPGEQLARMETFLGFTSLLQRFAFKKPENSLPLSFE GVLGVSRNPVPYMTCTVVRE >GLEAN3_09692 missing C-term, no EXXR, PERF or heme signature MGFTETEPGYWLLAGVLAGVLGTWLVKHLITRPKNLPPGPTGLPLLGVAWQIYKGGSD PLALFASWSKKYGEIVSFYVGPKPIVILNSYPVIFEAFRHPDLQDRLSSRLLQEIMGL KNSGIGFSNGEVWKEQRKFTHSVFRSLGVGKKSYEDTVSAEMIQLSSVIKEKKSSPFD PSTLFMQAVSNIICSIIFGTQYTYDDDAFKKILHLVNLNVKLSGGGGSFLFLPIPGIS KIPFGKMKLMTNTIRSLNAFIQSQIESHEKNWDPENQRDFIDQYLNKLDETKDTISSF NKLNLTGCIADLFLAGSDTTATTLKWCILFMMAYPQVQSRVQDELDHVVGRERIPRLD DIKDQRLWGLQT >GLEAN3_09825 missing N-term MSRVLERMNETLTLIPETDLVNIFEPLIHTSIQRYKRYRESIFSDRDFIMSHLKSHQE TFQKDNIRDFTDAFLADDISKEFKLEHFWRVLLDFFAGGTDTTAVVTSWAILFLSVHP DVQRKVQTELDTVVGRGRQPNTLDRPDLPYCNATLTEVMRIRPVLPVSVPHMTSDNVS FRGFTIPKGSIIIPNLYAVHHDPKEWNEPDKFNPDRFLSADGKQFQMNDAWMPFGVGR RDCVGSQLAKMEAFLLFTNLFQQFEFKLPPNQPTPSMCGANGITMKPKPYKICAIER >GLEAN3_10143 MYTPCATIAFSNGKPWVDLRRFSLPALRSFGFGKQSLVPQINLEARYLSEEIRNLRGE PTDLATTFSKATANIICQLLFSQRYEYSDGEMSRVLKSMEEFLSLIPETDLVNIFETL IHTPRYKPYRESIFSDRDFIMSHLNSHRETFQKDNIRDFTDAFLADDISKEFELEHFW RVLLDFFVGGTDTTAVVTSWAILFLSVHPDVQRKVQAELDTVVGRGRQPNTLDRPDLP YCNATLTEVMRIRPILPVSVPHMTSDNVSFRGFTIPKGSIIVPNLWAVHHDPKEWNEP YKFNPDRFLSADEKQFQKNEAWMPFGVGRRDCVGSQLAKMETFLLFTNLFQQFEFKLP PNQPNPSMRGANGITMNPEPYKVCAIER >GLEAN3_10246 sequence begins at I-helix, missing N-term MFFAATDTIHEVLLWLTLFIATRPDIQLRMRKEIDTVMNSGGSESLYDQRSRLPVVRS IILEVLRMRPVAPMGVPHVVSDTGSIDGYSIPKGSQIFSNIWGMHNDPDIWKDPSVFN PDRFLSNTGVLDEVKAKLVIPFSIGPRSCPGEQVARRIIFQTLVSLISRYRISLPAGR PTPRLVNEERSFTMSPERFKIVFSRIQADPVQLKDLEIPTTE >GLEAN3_10549 missing C-term, green = C-helix MVAIELVILVLVLWMVKSFVSKRYGKQYRLPPGPPALPIIGNLPVFMTGKAPYRVMKN LADKYGSVFSLKIGTTPFVVLNDVDSIREALHHRSVDYAGRPCLYSVSLMSRNNCDIV FGDYTPGWRLHRRATAAALHKNIRGVGCEAMGIKITREAEYFTQLLMKKAGPVVDIEN ELNLAIINIISQLTFATRYTLDDPEYQALVDSNQKFSSLLKPGDPIDVFPALKVRGNT IVIFNCFYLGQHNI >GLEAN3_10563 MFSLLSTSIILGALVFLICFWFVRRPKNLPPGPWTLPIIGYRFKTGHVHEAYVELAKK YGPVFSIRRGPFLFVVLNDRESVKQALVKSGEFFSDRSVPGPINWTIPDAKKKASIVW SNGKSWVDLRKFSLPALRSFGFGKKSLVPQINLEARYLAEEIKALHGEPMDPSAVLNK ATANVMAQLIFGRRYEYDDAEFIEILQVMVDIFAFVSDNDPVNVFEPLIHTPWYKPYR EMMIKQNKFIHAQLYFHRETFQKDNIRDFVDAFLADDISKEYTLEDFGRIVLDFFTAG TDTTAVVTSWALLFLSVHSDVQRKVQNELDAVVGRGRQPDTSDRPNLPYCDATLMEIM RLRPVVPISLPHMTSADASLGPYTIPKGTIIVPNLWAVHHDPKEWCDPHLFNPNRFLS ADGQTVVKNEAWMPFSTGRRDCLGMQLAMMETFLLFTNLFQQFEFKLPPDQPTHSMRG HPGLTMPPESYKICAIER >GLEAN3_10576 MFFLSSTSVLLGVLGFLFCLWFVRRPKNLPPGPWSLPIIGYRFDSGLIHEAFVDLARK YGPVFSVRRGPFLFVVLNDRASMKQALVKSGEFFSDRFVPGHIRWGIPDAEKNAAIVW SNGKPWDDQRKFSLPALRSFGFGKKSLVPQINLEARYLAEEIKALHGEPMDPSALLNK ATANIIVQLVFGRRYEYDDPEFIGALQAMVDIFALVSDTDPVNVFESLVHTPWYKPYR DCTFKLRDFIMSHLYNHQETFQKDNIRDFVDAFLAHDISKDYSMDTFWRIVLDFFAAG TETTAVVTSWAILYLAVYPDVQKKVQNELDAVVGRGRQPNTSDRPNLPYCDATLMEVM RIRPVLPVALPHMTSADVSIGPYTIPKGTIVIPNLWAVHHDPKEWCDPQLFNPDRFLS ADGQTVVKNEAWMPFSIGRRDCLGVQLAKMESFLLFTNLFQQFEFKLPPDQPTHSMRG HPGLSMPPESYKICAIER >GLEAN3_10719 No seq. provided Scaffold 58259 989-2431 no introns MADNQFLNGRPLPGPWGLPVIGNMLSLDQTGPHLSLMKLAKVYGNVFKIHMGNRPVLV LNGFQAIKEALTKQPAIFAGRPDLFTFKVIDETNVYGPSISFSTYSEQLKLHRKLAET CLRHFTEGGQAQFLEGVAKGEAEELVRHLKVSKCEGGLDIRRLIRLSVTNMILWYMFS KRSSYDSKLLNDILNLTDKAEKPSGSGNPVDFLPWLRFITTKYKTTLQEYRKKFGKLL SGVIEEHAVLYEPGSERDVIDHLISTGHNSGEEELQRLGLAGKALYQSAFDFFPAGTE TVLATLEWIMVYMVTYPNVQSDVQHEIDQFVGRGRLPTLNDRDNLPLTQSCLLEIQRH AAAVPFAVPHSTTQDTVLDGYYVPKDMLVFVSLYSAHFDPDVWEKPEVFNPRRFLAQD GTLDEEKTKLAIPFSLGPRRCLGSGVARIQLFIYFTTFLHQLNFKCPDGQKASFDATY KFPRRPKVFNVQIDPR >GLEAN3_10720 No seq. provided Scaffold 58259 10218-11660 no introns nearly identical to GLEAN3_10719 MADNQFLNGRPLPGPWGLPVIGNMLSLDQTGPHLSLMKLAKVYGNVFKIHMGNRPLLV LNGFQAIKEALTKQPAVFAGRPDLFTFKVIDETNVYGPSISFSTYSEQLKLHRKLAET CLRHFSEGGQAQFLEGVAKGEAEELVRHLKVSKCEGGLDIRRLIRLSVTNMILWYMFS KRSSYDSKLLNDILNLTDKAEKPSGSGNPVDFLPWLRFITTKYKTTLQEYRKKFGKLF SGVIEEHAVLYEPGSERDVIDHLISTGHNSGEEELQRLGLAGKALYQSAFDFFPAGTE TVLATLEWIMVYMVTYPNVQSDVQHEIDQFVGRGRLPTLNDRDNLPLTQSCLLEIQRH AAAVPFAVPHSTTQDTVLDGYYVPKDMLVFVSLYSAHFDPDVWEKPEVFNPRRFLAQD GTLDEEKTKLAIPFSLGPRRCLGSGVARIQLFIYFTTFLHQLNFKCPDGQKASFDATY KFPRRPKVFNVQIDPR >GLEAN3_10721 No seq. provided Scaffold 58259 20505-21947 no introns nearly identical to GLEAN3_10719 MADNQFLNGRPLPGPWGLPVIGNMLSLDQTGPHLSLMKLAKVYGNVFKIHMGNRPVLV LNGFQAIKEALTKQPAVFAGRPDLFTFKVIDETNVYGPSISFSTYSEQLKLHRKLAET CLRHFSEGGQAQFLEGVAKGEAEELVRHLKVSKCEGGLDIRRLIRLSVTNMILWYMFS KRSSYDSKLLNDILNLTDKAEKPSGSGNPVDFLPWLRFITTKYKTTLQEYRKKFGKLF SGVIDEHAVLYEPGSERDVIDHLISTGHNSGEEELQRLGLAGKALYQSAFDFFPAGTE TVLATLEWIMVYMVTYPNVQSDVQHEIDQFVGRGRLPTLNDRDNLPLTQSCLLEIQRH AAAVPFAVPHSTTQDTVLDGYYVPKDMLVFVSLYSAHFDPDVWETPEVFNPRRFLAQD GTLDEEKTKLAIPFSLGPRRCLGSGVARIQLFIYFTTFLHQLNFKCPEGQKASLDATY KFPRRPKVFNVQIDPR >GLEAN3_11009 MVFTILGFTVSWPLVGTLVAIIVGYEIWHMTYFKRKGVKGPIPFPVVGHSIDLFQGIH WKSQEYVRKYGNVFGMFMFSSPVFFISDLDMVKDIAVKSFSKFMNHKTFVLKSRPFDK ALIGLKDQHWKDVRNVITPTFSAVKMKQLSGLLNESSDTLVKNFGAIQEKEGNVKVFS LFEAFTMDGIAKCAFGIQVDSQNDPSDPFVTHAKDLMKNRFALKFVIAGIFPRIGKIM DYFEVSTLPPEVSSFFISVVDQAIAARKESGGSKMKDFLQLMLNAAQENKSEKEMIEE KPQDEKVEGDLYALIEDQDNKSYFDLKKSKKTTLTKDEIYGQAMLFFLAGYDTINTTL GFLTYSLATNPEVQDKLIEEIDNETPARDDVGYNSVAKMSYLDNVVCEILRLYPAGLI IERQCNETHVCNGLTIPKDSQIFFPVYAIHRDPAIWPDPEKFDPERFTKENREGRHPL AWMPFGAGPRNCIGMRFALMEIKMATVRLLQKYRFETCPETQIPPKFAKMSLKPDDDL YLRVVERS >GLEAN3_12080 MAVVLPAGIGVTVCIVALSAFLFYYIARFLRIFLLISKFDGPPAVPIFGNTLQFKSDP REHFLYDVGVREKYRHQSGGVIRMWFGPLPIVLIYNAVHCEALMNSSRHINKGTEYTF MEPWLGKGLLTSGGQKWFHRRKLLTPTFHFSILQSFMEVFNEQSMILVKKLGKFADES ETVNIFPLVTHCVLDIICDTAMGKRTNAQSDNENEYVKAVGRMSHLIVSRIRNFMLWP DWIYNRTDAGKDHEKALQILHGVTNKVITTKTQMTPNGGTYEDDAVTRKRKRIAFLDL LLQMHREDATFTLEAIREEVDTFMFEGHDTTAAAVNWALLLIGQHPTVQARLHDEIDQ VFGDSERPITSDDLSELSYLSCVVKESLRLLPSVPGIGRDLDEDIIVNGKVVPKGASV FLSIYGIHHDPEQFPDPERFDPDRFLPENSTKRHPFAFIPFSAGPRNCIGQKFAMMED KVLLINILRRFSIKSLQTLDETKPAGQIVLRPADGNILVKLSHRK >GLEAN3_12081 MAAVLSAGIGVTVCIVALSAFLFYYIARLLRIFLLISKFDGPPAVPIFGNTLQFKSDP RDIFLYYQRNFERYRQETGGITRIWYGPLPAVIVYSAKHGEALMNSSHHTSKGIQYKC LEPWLGKGLLTSSGKKWTHRRKLLTPTFHFSILQNFMEVFNEQSIILVKKLEKFAEKS ETFNIFPFVTHCVLDIICDTAMGKRTNAQEDSENEYVRAVDSMSRLVVRRMRNPLCWS DWIYYNMGVGKEHEKSLNVLHKVTNEMIQERLSDIPSNENGTNQDDAVTRKRKRIAFL DLLLQMHREDATFTLEDIREEVDTFMFEGHDTTSAAVSWALLLIGQHPEVQVRLHDEI DQVFGESERPVTSDDLSKLPYLSRVIKETLRITPPVPGLSRELDEDIVVDGKVLPKEA MVIINIYGIHHDPEQFPDPDRFDPDRFLPENSTKRHPFAFIPFSAGPRNCIGQKFAMM EDKVILINLLRRFSVKSLQTLDEAKPAGLLILRPAEGTILVKLSLRK >GLEAN3_12148 N-term only MANSVNIFESVGGVFGEMTLATMILVSTIFVLGIAWAFKSLLGPQNKDIKLPPRLPTG IPFLGQAVAFNKSPIDFLEDAYEKYGDVFSFTMVGKTFTYLIGSEASALLFNSKNENL NAEEVYSNLTVPVFGKGVAYDVPNPVSLINLIIDYIRGA >GLEAN3_12500 missing N-term, seq begins at KYG motif MDMAKKYGPIFSLRRGPFLVVVLNDIATIKQAFVKSSEFFQNRFVPGHIRWGVPDTNK NASIVWSSGKPWKDHRKFSLHALRSFGFGKKSLVPQINLEARYLADEIKNLRGAPSDL SPTLNKATANVVAQLVFGRRYEYDDLQYIEVLQAMVAIFAGISITDPVVVFESLIHTP WYKHYRDATFKARDFIMCQLYMHRDTFQKDNIRDYVDAFLADDISKEYTLEEFWRIVL DLFAAGTETTTLVTSWAILYLAVHPDVQKKVQNELDRVVGRGRQPTTTDRPDLPYCDA TLMEVMRIRPVLPVLSPRECLGIQLARMESFLLFTNLFQQFEFKLPSDQPTASLRGQP GLSMHPHPYHVCAIER >GLEAN3_12926 missing C-term MGPFVVIVVYVLGLFACLLGIGLLVFVAAILRTEQKFSDLPGPKRDSFLFGNIFSIFE AKKKLGQPIGLIIYTLSQDHGPVFWIRVFARGHVFASAPEIVKELLLNSRHIKPQSFY KNFQYVFGARFMGNGLVSEINHEKWFHRRAIMNPAFRRKYLMGMMEKFNAESEKLCEH LRDKADGRNEVEMLNELNNVTLDVIANVSFSMTTGAVGDPSCPFPAAVTLGLRGMQTI IDKPWYSFDPRSESRKTRQETREAVNLIRETGRQQIEERIAARQRGDHVPNDILTVIL DCANDLVGDEDFQMENMIDEFVTIFFAGKLFSAPENCNYYS >GLEAN3_13039 MDFRWHQLTPYHLLVTALVGLVAAWFVKHLMRLPKNLPPGPTGLPLLGVISKFIGTSD PLALLSEWADRYGEITSFKLGPQLVVLLNSYDAVAEAFRHPDLQGKPRSRMTEEASGF GPNAGLLIGAGQSWKEQRKFTLSVFRNLGVGKKSYEDTVAAEMAQLGRAIKETKGSPI DPNILVGQAVANVICTVIFGTQYKYDDSEFQHLLYLMNFNMSRSGAGGWLFFIPIPGM SKVPFGVIKAMMTNAKEFYAFIDHHIEAHKKNGGPDIPKDFIDQYLIKLKEMKGTESS FSHSSLKAVISDLFFAGMETTTTTLKWCILYMMAYPEVQSRVQAEMDQVVGRERLPGL DDRQNLPYTCAVLMEVQRKGAVVAMGVPHMAAADVTIKGYTIPKGTIIFPNILNVLNN KNFWDDSDAFTPERFLSDDGTLIKREELIFFSTGRRVCLGEQLARMETFLGFTSLLHR FTFKKPDNSPPLSFEAVLGGTRSSLPYLTCAIPRDPTENVA >GLEAN3_13102 Missing N-term, green = C-helix VTPLFSDDLKDALFNSKDEHWKGVRNIVTPTFSAAKMKLMSPLINKCADRLLKHLEKR QELHGNIECRELVGAFVMDTIASCAFGLNIDCQENRFGPFVTNAKKAFSGNIFTSPYL LLVSLFPWTLPILEYLDFNIIDRKSTQFFADIIKKTSAVRRESKNSDSAKRIDYLQLL LDAQERDITGNNKNKADGIHGDPEDEAIDEGGLQKKSSKKIRLNESEVISQALIFFLA GYETTNTTCGFLLYFLATNPDVQDKLVDEINDVAPKAEDVGYQSLSKMPYLEQIFCET ERIYPPAIMIDRVCNEPFNVNGFTVPKGMRIFIPIYTIHRNPNLWPDPETFDPDRFRK ENREKHHPCAWMPFGTGPRACVGMRFAIMEAKMVIVRILQKYQIETCPQTEMPPKQAT NGFLSPPNGITLRAVKRN >GLEAN3_13199 MEKCEISSSQLESSLPYPSSFTSVILCVITLYLVYRLFRSLREKKDDGRRYPPGPKGL PLIGSLLDLRSDVLMKMKRFADEYGSIYTINLGPTNLVVLNNKDVIKEAFVTNAACMS HRPQVMFDTLKQSQGIILKQPGDLHRKNRNFLSTGFRMIKLGVPEFSYHLHPHIDNFK DKLIASKGQPLPICSTSHRLLFEIFTDLMLGEVPDPNSEKVNKLLDILYIALSPNPML ALVGNLIPNWVKSALSDWIDMADCLAKVTRYLKEHIHQHRETYEGDPTNLINAFLHKE KLELEKTSKLREESSSGGDSDTEIEKAAIGTSFMVNDQEENEKSLAAILLDVFQAGCT SVVASLQWLLLDLAIHQDVQDKLRMEVDRNVEQRNQRPSLLLRSDNFPYTKAVMLESI RLHPPAPLGAPHCAAQDTKLAGYDIDKGTIILPNFLMALQDFNQWEDACDFRPERFLN EDGTLATRETLGFSIGPRQCPGKLLSNTVISSILTDVIQDFTLHLPPDHNFSLKGNPG IIYNPQPFSIIAKQRTS >GLEAN3_14092 MTVLGMISGLFSMTTVYLSGAVVVIFFIYDYWTHTYFKRLGIPGPPPKPIFGNALDFS KPMYDLLDEYANKYGPVVGIYMMRRPLLLIRDLDMLRDIMVKDYHKFYNKFRVPISSG NLDNALFLLRDAQWKRVRDIVTPTFTGKKMKMMSGILNEGADTMVEKLLQSCKEDGNI DSLELFGSFVMDSVASCAFGLKVNSQEDKDNLFITNGKRLFDFIFNSPVFMAVYVIQF FTDVTEKAVALRESHPSKQRVDFLQLLIDAKNGKSKMASENDKDDIHNKYFEDTGAEE NISTKTQKYMTREELLAQGMIFLAGGYETTASLLTFASYLLASNPDCQDKLIAEIDDI APKRDDVGYTTISKMPYLDQVVCEVLRIYPPGPLTDRECGETCTYKGYKIEKGTQILI PTYTIQHDPTYWPDPLKFDPERFTKANREGRNPFTWLPFGAGPRICIGMRFALMEAKM ALVRSLQVVRFGISPLTKIPPDLGNTSLLSKKSSYTLKVVQRH >GLEAN3_14573 MAFTILGFTVSWPLVGTLVAIIVGYEIWHMTYFKRKGVKGPIPFPVVGHSIDLFQGIH WKTQEYVKKYGNVFGLFMLSSPVFVISDLEMVKDITVKSFSKFMNHKTFVLKSRPFDK VLTSLKDQHWKDVRNVITPSFSAVKMKQLSVLLNESSDTLVKNFGAIQRREGKVKVFD LFEAFTMDGIAKCGFGIQVDSQNDPSDPFVTNAKDLMKNTFSLKFLIAATFPGIGKIM DYYGVSTISPKIASFFLSVVDQAIAARKESGGSKMKDFLQLMLNAAQENKSEKEKPQE EKVEGDLHALIEDQDNKSYFDLKKSKKTTLTKDEIYGQAIIFFLAGYATIHTTLGFVT YCLATNPEVQDKLIEEIDNETPTRDDVGYNSIAKMSYLDNVVCEILRLYPASVLVERQ CNETHVCNGITIPKDSQVMFPVFAIHRDPALWPDPEKFDPERFTKENREGRHPLAWMP FGAGPRNCIGMRFALMEIKMATVRLLQKYRFETCPETPIPVKFAKMGLKPDDDLYLRV VERS >GLEAN3_14843 MDKNLPDVLTDDVLINAVSVTTYLMLIITSILFTRYWINAKRESDKVAKSGGRPLPGP WGLPVFGSMLSLDPSGPHLSLMKLAKVYGHVFKIQMGSRPVLVLNGLKAIRNALVKQA VVFAGRPDLLTLKAINSANLFGPSLSFSQYSEEWKLHRKITETSLRHFTAGSQVPFVE SVAKGEAEELVRYLKVPKCERSVDIPCLVRLSVSNLMLWFMFRKRASYDDKKFIESIN ILDGLSEAGSGNLVDFLPWLRFFSLDSSTGFLEAQKRFNKSVRGLIDERHELYDPESA QHVFMDEIIALGHVREEDEFQRLGLAGRTVLQNAFDFFGAGLGTTSATLEWAMVHMAL FPKAQYEVQCEIDKVVGRDRLPTLNDREYLPLTQSCLLEIQRYAAVTPFGIPHSTTKD TILDGYFVPKDMVVFVNLYSANFDPEVWDEPEVFNPRRFLAQDGALDDEKMKLLKAFG LGRRRCVGSGLASINLFIYFSTLLHQLRFSCPEGGEVNLEFRFGITLHPKNLNVHIES R >GLEAN3_14874 seq starts at I-helix, missing N-term half MIFLSAGYETTAALLTYASYLLASNPDCQDKLIAEIDDIAPKRDDVGYTTISKMPYLD QVVCEVLRIYPPGILSDRECGETCTYKGYKIEKGLQIWIPTFTIQRDPTHWPDPLKFN PERFTKANREGRNPFTWLPFGAGPRICIGMRFALMEAKMALVRSLQVVRFEISPLTKI PPDVGNTSLLSKKSSYTLKIVQRH >GLEAN3_15096 missing PERF and heme signature at C-term MDLDTSFITDVSLVGGALKATFGVLLLYFIYCYFNDNGSQYHHPPGPKGLPFLGSVLS MIFTRKHPHEVSLKGVAFANGEVWKEQRRLLMSVLKKADVDGISYDDIIVSQAERMLA EFEKTNGQSFDPRDLIHKTIGNVVMKILTGVTYDYDDLEFSRVMTNTVRVFTLLGEAG ILASIPALAAIPSPTKWKIIKAWEEIYGFLKEIIERRKARFDTNQKATDFMGSFVKAM CAKGTEIGPASCDEINLMVSSFDLLLAGWDTISTTLAWGLHTLASYPEAQRRVHQEIK DVIGLDRLPQFSDHHSMPVTMATIAECMRFRPTLAVHIPHVATQDCKIGGYDIPKGGQ ITLNMWSKGLSWRATGKN >GLEAN3_15120 short at the N-term MVLKANRYREKLGRIPVHLSLLKCVSFILLRLFGPICSITLRVSDSRISTPMKNKIRH FYSALSSYEFAPSVKKLGDAKIDWGVSGWRPTKEAAGLSRTQHGIVSETPGGSSHISY FFVSFALITLIAYQNLVQTFQLNQSSFTVVDIAVVVTVSAAVIIVVVVVVVFALLSGI KRYMRQVIDETLRVSILAPYAARYQDFDVVLGGHVIPKGTPVIHALGVSLQSEKYFPN PKTFNPENFSAANVKKRPRDAYHPFGFAGRRVCPGQQFAYKEVAIFLAVFIRAFKVKV VPGQKPEHVHGLVSHLVNKDGGEVWITVEKR >GLEAN3_15156 sequence starts in I-helix, missing N-term MNPDIQEKLINEVDEVTPSRDSVGYSSIATMPYLDQVVCETLRLYPPAGISNRMCNET FVHNGLMIEKGTQVFIPVYTIHRDPDYWPEPEKFDPDRFTKEAKAKQHPFAWQPFGAG PRNCIGMRFALMETKMAVVRTLQNYRFETCAETENFWEQTLSGNLY >GLEAN3_15256 MDSVPLDQLGTLLIVTLLCMVGLRFLISWIRDGKKRKLPPGPTAWPLVGSIPFMIRSK LPPLDLLTALADDHGDILRLRIMTTEVVILSSYDIIKEAYNHPNLQGRPRIVPIEDYI GRRNIGVAGSSGPVWREHRQFVFSAFRHFSTGPQNIDHIIHVEARTLVSEVSTLSRKP FNPHKVLNNAVSNVIAVMLFGKAYNYADDDFNQLQEIINTNLQEAASGGIYLFIPFLA KVPFSPANKIKQGIKRFTDFTKKIVNEHRHVFEPDHPRDIIDLYLKKIQEDHEGGIES SFDETNLHVLCGDLFGASTETTNASLKWSIIYMMIHPEVQTRVQEELDRVVGRDRLPE LDDRASLPFTLATLHEIQRMGHVFPLSIPHACTADMRLADYDIPEGTLVVSNLWRLSR DSRIWSDPNEFRPDRFLNENGDCIKPEALIPFSIGKRVCIGESIARKEIFVFFSSLLH RFHFSIPAGGSPPSLQGRLGLVYDTQDYITCASPRT >GLEAN3_15442 MIETPASSRDEPNLTVISSLLSLTSLTTSSLLLCVLVLVLVLVILTVDRRNSVTPVED GVTLLPGPRGALATVKVLLGMTKRASRYMFERTKTYGKVFGIGDGPDSYVIIMDYASL EEAFVTKADLCSDRGGLIRTMLESVLPERGSIFNNGTAWNEVRGFVAKTVKNPDFGAT TISRCVSETIPTLCSELDALVDGPFDPKRTYQCFCVDVFHRIAFGIGISYENCTVKKF IDNLDTVVAQTLDPSTLLLTRPSLINSFLFRGYRSKLRSSSSYVKTRAEEVLAKLDAP EQSSSLIALFVKYIEGIRSSGKASYSEPDDGWVIVYNMFFAATDTIHEVLLWLTLFIA TRPDIQLRMRKEIDTVMNSGGSESLYDQRSRLPVVRSIILEVLRMRPVAPMGVPHVVS DTGSIDGYSIPKGSQIFSNIWGMHNDPDIWKDPSAFNPDRFLSNTGVLDEVKAKLVIP FSIGPRSCPGEQVARRIIFQTLVSLISRYRISLPAGRPTPQLVNEERSFTMSPERFKV IFSRIQADPVQLKDLEIPTTE >GLEAN3_15568 missing N-term MDLFDESATRLRDHLIPLADDKTLVRMLEHFNNVALDVIGKVGFDMDINAVENSDCPF PSAVALALQGINVAFQNPLTLVDPRSKARQYRRSVRDAVKLVFKETMRLNPPVSGLSR VLASDVQSCGYVIPKGTTIMFMTYVTSRLEEFFDDPLLFNPDRFIPTDETPRHFFAYY PFAIGQRNCIGQQIALIESKVILAKLLQSFDFRLEQSQRHIILREVTNKPIDGCKNYI SLRD >GLEAN3_16056 missing N-term starts at KYG motif EYFQKYGKIVGVYNFRKPTLLVSDPDVVRKILVKDFDHFVNRQEAAIKTDLLAGGMLA AKDERWRKLRSLLSPSFTSVKLKSMVPLMRECAEVLVDIFTKREEEGKTIECKELYGP YAMDVIGSCAFGLRVNSQTNQDEPFIKYAKRIFNFNVTNPAFLISMLFPWTAPIFNFF KLGSVPKDIEQFFLGVFEKAIADRQANGDTGKSDFLELMMSAHKEDDDVNIMNDAEDD IADLAQETSWKPKQKTALNRQEIIGNSITFLLAGYETSSTALGFVSYLLATNQHVQDL LVAEVKEMAPSQSDVTFETVTKMEYLNMVIMEAMRIYPPGEVLERVCNKEITYNNITI KKGQYIFIPIWNLQHDSDLWPDPETFDPERFSKDRRGNNHPCGWLPFGIGPHTCIGVR FAMLEIKVAMVCILQKFRFDTCPETEIPPKLGNLSFLTPPNGITLRITPHSRYNLRSR >GLEAN3_16152 missing the middle of the gene between C-helix and heme signature MEVEVLGNAMSLTSLLVLIALGCIVLLLLNDYWAHGYWKRRGIPFEPPLPIIANMRVF AKDIPQAAGELRSMVSILEGERWKEVRNILTPTFSAAKMKQMSNLINTNADTLVKYIG LKQEEKGRIECSEFTKEAKAKQHPLAWQPFGAGPRNCIAMRFALMETKMAVVRVLQNY RFETCAETEIPPTFGRSGFLTPPNGVTLSAVPRTDKVA >GLEAN3_16251 MSYRCAVCYCDHEMEADMRVYSRNGRSDLLALFSSWADNYGDIVSFNVGPKLIVLLNS YEVVVEAFRNPDLQNRPGSQIMKEISGGGKNGVLLSVGDTWREQRKFVHSVFRSLGVG KKSYEDIVAREMDQLGSAIEEKRGTPFDPNTIFGQAVANIICSIVLGTQYKYDDTDFK HIIELLIRNIELFGGGAWIFFLPLPGISKIPFGAVKALVSNTRELNTFVRTMIKSHEL NLDPEHPKDFIDQYLNKLNETKGTDSSFSQANLEAAIKDLFFAGTDTTTNTLNWSILF MMAHPDIQSRVQTELDQVVGRHRLPKLDDRKNLPYTSAVLMEVQRKGAVLALGVPHVA AADTQVRGYTIPKGATILSNLYKVLNREDLWGDTNDFRPERFLSKEGEFIKREELIFF SSGRRMCLGEQLARMETYLGFTSLLQRFTFKKPDHTPPLSFAGVLGGTRNTVPYVTCA IPRETKI >GLEAN3_16442 MALLTDSYGAALAIISLVLTLIWYLKNSSSWRYRNLPPGPFGLPILGILPYLDEKNPS VDVLRMSKRYGPIFGGFLGSHRAVFLNDYDTIKETFARTDDVFSDRPRIAIFELYSDG QGVACCYLENHWKEQRRFTLRALRNYGMGRLQGSLRNAIDVEVSHLMTAMSKIKKPFD PRIYVESAVCNITCKMVFGRRYEYEDAQFTEFLHNLEKIFELSGVAGVVNYLSWMKYV PFSGYHALGDCVRHLENGLFTKECEEHQKVFDPNRSPRDLIEAFLLELHKKQQEKVEN NGEDMAPTGFNHQQMLHFIADVFAAGTDTTASTLKWSLIHLMKFEDEQRKIQEELDHV IGRDRMPSLEDKPNLPYTQAFLAESLRYGCAGPLGVPHGAKEDTKLRGYDIPKGTVIV ANLYAVLYDPKVFPEPDKFNPSRFIDDEGRVKTSLIERAHSQFGIGVPHGAKDDTKLR GYDIPKGTVIVANLYAVLYDPKVFPEPDKFNPSRFIDEEGRVKTSLIERAHSQFGIGR RQCVGMDMALMERFVFITNLIHRFNARAPNGPDSVTIDARGGLAFVPMPFEMQLTER >GLEAN3_16816 starts at C-helix MKCSIIFEEGKSWSDMRRFFLLVFRKFGMGKRSAGDRVNEEARVLCKEFILAEGKPFD PFRTVNYAVANIIRSITAGQRFEYSNGRFQQMVQLITDVLGAENFYGPLEMMPWLYFT PLFTKERNMCNALIGLMKQEVVEHRETVDVDNPRDVIDMCLAEIDRAKDAGEETFFDE SLLWRTLLDIYLAGTETVATTLTWAMLYMAGYPHIQEKVCQEIDDVIGHEGTPHYDDR TRMPYTEATIAELLRIRPIAPLGIPHINTESIVFRGYLLPERSIIMMNMVGMHLDPEL WPDPESFDPTRFLSEDGKATKKPEGHMPFGAGRRICLGEQLAKVELFLFFTNLLQKFT FRIPDGVLPDYGFGHRTSTLAPKKFDIVASLR >GLEAN3_17582 MIDLLGAGESGFGQFNALTITATLGVVTATLALWTIVRPKRFPPGPRGLPIVGSIFSM NDSPEIVFGEWAKRYGDIFGFKAGERWMVVLNRQALIKEAVLKQGVDFAGRPDFYSLE IFTDGFKDIAFSPYSETWKLHRKLAHTALRHFATGKPLQDLISSVYPKVDKKLAMTEG QPIDPKVLINLIMYNVLAQMCFGRSYELEDPTVAQWMDTNDDLNEKFGLGLAADMFSW ARYIPTSGPRMIKELTEKLWGFLRLQVDETREHYDPENINNFYSLLLKAQEDARKEGE NVDKLTDTHVFQTISDIFGGGYHIPKGTVVMINTHSMHYDPQEWDQPDKFLPEHFMDG SGTVRDHPPSFLPFGAGRRGCLGEAVAKADLFLIFTWFMQNYTFSRAPGKESEDILKM IPQTASGRILLSYEIMINKRD >GLEAN3_17986 MSSSVFYYGFTPFILAGLIAFLLSFILVRSGKKKYKNLPPRGPMEWPVLGGLPSLSGK DMPHVIIADMAKKYGPIFGMKMGSFYAVILSDYSLVRQAFAKSNDEFSDRPKITMIEN IVQGKGLIVSYFGQMQTEHRRFSLSALRSLGMGKFKMEETIVDEARQLAMIFTSRQSK PFVPFHDIVVSVSNIICWLSLGKRFDYNDESFKAILASLFESFEIAEIAGLFNFLPHW SGKCTTEEQEKSRRSPTCYVEMYTEQMIEAERETPGEHTFTKENMYHAVADLFAAGTE TTATTLKWALMYMILHQDVQEKIHKELDDVVGRNRLPSLTDRPNLPYTEATLLEIQRF ATITPLGVPHAPVQDTVLNGYDIPKGTVVLPNLWAIHHEPDLWKNPDEFNPARFLNPD TKQVVQREELIPFSIGRRRCLGEQLAKVELFIFFTHLLHRFQFTTPEGAPPLTMKARN GATLNPEEFEICASLIDDVM >GLEAN3_18242 MPHLRLVDLNSFSRSERRPPSWSPMSSYPDMYSGPQQMLQRCVIVQKDEKGYGLTVTG DNPVFIQSVKDHGAAYKAGVQTGDRIIKELLTRSKYLKAPDQYRGLKSLYGARTLGNG LISELNHEVWMKKRALFNPAFHRKYLMGLMNEFNSCSAKLVNHLIPLSDGQTEVVMTK EFERLTMEIIGKVGFGLEDDIIGNPDSPLCQLFPKVMSGMQSVYRRPLLKYSILPKDI KYKHEVRAAANEIRAVGRRCILARIDALRRGDQVPQDILTYILQESNNLEGIKDFDLE DMVDEFVTFFGAGQETTSNLLSFTLLHLGRNPQVLKKLRDEIDTVLKGRNYVEYSEVS KMKYLTLVLKETLRMNPPVGMLNRVLPYEMDLCGYKVPKGSTVLMPIYGMGRDEKHFK NPEKFDPERFTRDEDSPLFAYIPFSLGARSCIGQTFAMIEAKVVICKLIQQLEFQLVP NQSFEFVQEVTLKPKDGCKSYITMRNL >GLEAN3_18372 MGSLLDLVDTSTALIALLVFLLSLVVTHRFYQNGDDSKSLPPRGPREWPVIGIIPSLV GDEPYKLLARMSPQYGPVFLTRIGMISTLIINDFDSMKEALVRSGDDFADRPKFGMME YITKGKGIISAHVGDSQREKRRFGLAALRELGMGKSRLVENISDEIETLCDVFMRYEG KPFNPAHDITVSVSNIISWIVFGKRFKHNDTAFKRFLEIIHEGIEICESTGAAGFIPM LQYLPLPVWKKLADNDREWKEFDAVMIAEALQRPASEEPCFVQMYNEEMKRMEQRAKA TAAGSKASDGTDGIDASEETVIFNEEDMRQTLGDLFAAGTDTTATTLNWAILYLILHP DVQCKIHEELDNVFGRNRKPSLEDRKYIPYTEAVLLEVQRKATIVPLGVPHAASWDTK LNGHHIPAKTLILPNIWAVHHDPKIWKNPDEFQPERFLDDKGTTVLQREELIPFCIGR RKCLGEQLAKMELYYFFTHLLHRFEFRISEGALPPSTVGKPGATRNPMPFSVCAIPRF S >GLEAN3_18468 MAGIVSVLSVVVQSVGSPALVVGGITILLYLIYQVFYQGGKKYPPGPTGLPVFGSVFS LLLTKRQQHEVVNEWSHKYGPIFAFKMLGTRVYVLTDPGLVSDAFHSSPDINDRMSMA IADEITGRPNNGLVASVGEVWKEHRRFFLTVFKKIDSGVIRFEEIIGSEVERFLKEVE KKDGESFDPCNAINVGVANIITRLVTGVNYDHEDKDFQHLIDCSDRIFQIMGPAGLFT LIPILAKIPSPVKTEITNLSLEILKMVRKSIEIHKTEFDPEEPARDFISCYLKEMHRR KEENDPGTFDDINLLTSSFDALMGGWETSTTAISWMTFTLAAHPEAQRRVRQEILDVV GSDRFPRFSDRRSMPITEATLAECTRLRPAAAIHIPHIASRDCKLGGYDVTKGSAITT DLWYLALSPNLWKDPQDFKPERFLDAEGQFNKKLEPLPFGYGKRVCPGEQLARMEIFI FVTYLLQNFSLTLPEGAPTSNSGKHGLTNKPNPFKINAKKI >GLEAN3_18515 MANSGYVASSLFFGFQEICYRECIIVLTTFLFVRLLVSRGLFKKSTPPSNPPPPSPTG LPFFGNLFSFQEDMRHLFRKWTNEIGGIFRLRIGPVMDIVVINDVEAVREAFVEKADA FSDRMVPPLLGEALGKRGNNHLYKHKKTYTRSLIWPAGGGWKKTRKFGHSAFRSMGLG HRRIEEKVVEEGRILVQACEMKEGEPFNPRVLITNAVGNMIACLVTGSRYDYGDPDFV KIVGDIERFSQNLSLTSLGNVFPLLYYTPLYTDFRRPMRDIISYIKSMVKQHQQNFDP LNVRDIIDLYIVELKQQREQQIPGTVEENLEWRLSIELYIAGMVTTSDSMLWAILIVA QHPQLMKEIQKEIDDVIGDERSPKMADQDALPLIVATLMEVSRIRPAVPLGVPRLTSR DATLQGYHIPKDTVVIMNLWEVLNSPKYWDKPEEFNPYRFLSEDRRSVVNHPAFIPFG IGLRACLGEKLARMQMFIIFTSLLQRFTFCLADDSRSAGMKGKAGMAYSPLKYDIILK RRENSD >GLEAN3_19082 missing N-term MASHGLTYGNSLFLLFAPSALKDNIRDFVDAFLADDISKEYTLEDFGRIVLDFFTAGT DTTAVVTSWALLFLSVHSDVQRKVQNELDAVVGRGRQPDTSDRPNLPYCDATLMEIMR LRPVVPISLPHMTSADASLGPYTIPKGTIVVPNLWAVHHDPKEWCDPYLFNPNRFLSA DGQTVVKNEAWMPFSTGRRDCLGMQLAMMETFLLFTNLFQQFEFKLPPDQPTHSMRGH PGLTMPPESYKICATER >GLEAN3_19464 MFIPADTLDFFRDVNKKTIALRKSSSSGPNSKRVDFLQLLLDAQKQENIDKESKPEEE EEDDDDIHADLDKHHDVQYHSRGLGKKATLTDNEVFAQGLIFFTAGSETTTTLLGFIA YSLATNPEAQEKLIDEIDRVTPSRDSVGYKSVATMPYLDQVVCETLRIFPPAAVSNRM CNETFVHNGLMIEKGTQVFIPVYTIHRDPDYWPEPEKFVPERFTKEAKAKQHPLAWQP FGAGPRNCIGMRFALMETKMAVVRVFQNYRLETNAQTEIPPKFNRTGFLTPPNGITLT AVPRTDKVAY >GLEAN3_19883 MESADAVALTLCGVVLLASLAVWSMSRPRGFPPGPLGVPILGSVFSMTASPEEIFVRW SKQYGDVFGFKVGQRMIVVINHQDAIREALLKHGIEFAGRPDFFSFGVFTYGFKDIIF SSYSDTWKLHHKFAHAALRQFASGKRLENLLHGSIAKMEGYLSKMDDSEPVDPKLVIT LLLYNVMSTMCFGKDYDFLDENLIAWKENNEKLNQEFGMGVAADFFGWAQYLPTPGVK MVHRMRDELHRFVWDELDSHKKNFDPDNVRDVFDMLLLAQKEAADSGNQDAEKLSDTH LVLTVSNMFAAGIQNTTETLYWALAFMVEHRDIQSRCQAEIDRAIGRKRVPVVDDRGS LPYTEATLYEVMRYSSILPIAVPHATTCDVNFRGYRIPKGAVVMFNTYSMHYDPNAWE SPTSFKPEHFLEDDGSVQQHPSSFLPFGAGRRSCLGEVAAKTDLIIIFTWLLQNYCFK TVPGKETTSLLRVIENSAVGRNLENYELIMEMRK >GLEAN3_19898 MAVLEISSIVTIPTLVALFVVVVLYLIYRTNSRTALRHPPGPSELPFIGSIFTLLFTK LQQHELLFKWSGEYGGIFSFTLAGGRVVVLSDKELIQETLQDPFVNDRIPFPACITIY GRENTGIAFASGAVWKEQQRFFLTVFRKVNTSVIRFEDIIQTQGQRLLNELDRLEGQL FDPSNIVTVTISNVILKIVTGKTYEYADPGLLRMATCSDRLADIIGPAGLLGMIPGLA SLPLPAKRQLTDLMREMIDFIRDHVEEHKANFNPDEPATDFMSCYLKEIANSSGIPDT TFDETNMLQTCWDAMFPGYETTISAFRWALHLLASHPQVQSRIRQEIFDAVERDRLPA YTDRHNMPYMESTVVECLRMRPVIPYVAPHIVSRDRKIAGYDVAVGTRITVNLWFLAR SPTLWEDPDEFRPERFLDENGCYDRSREPLSFSYGRRTCPGDQLARMELFLILSYILQ RFTLTLPDGAPRNLEGSMGATYKPYQYELRATKNHGARSSASNSVPE >GLEAN3_19899 MSGDQIKDLVNWLEVVNDWTNKYGDVISFKLLNIRVYVLNNVQMSHDVMGSLSMMGRM PVPIIKTLTGRHNAGVALSNGDVWKEQRRFFLSVFKRADAKGIRFEEIVGAQAERMLT EIENTQSARFDPQPIIYTSIGNIITKIVTGITYDYDDDDFHRFMTHSNRVLDLMGPAG ILASVPILAVLPSPAKRPLTKEWDAMYDFIKKSIASHKSEFNPNQEAADYIESYLHEI AKRTTENNKGSFDEINLLACSFDLIVAGLETSATSISWSLHTLACYPEAQHRVRQEIF DVIGQERDPRFSDRHRMPIAMATIAECMRLRPAIDLHIARVAEQDCQVGEYDIPKGSV VTTNMSHLFLSPSLWEEPLEFRPDRFLTEDGQFDRKREPIFFGTGRRVCPGEQLARTE IFLFITSILQRFTLSLAEGVPKDLPCYHSITNRPDPYQIHAEKIV >GLEAN3_20229 missing N-term starts with C-helix MKTIYNYHDQSFCSWGQKWFHRRKMLTPTFHFSILQSFMDVFNEQSMILVKKLEKFAE KSETVNIFPLVAYCVLDIICITAMGRSSNAQENSENEYVKAVGRMSELVTIRGKNPLL WPDFIFDKIKTGKEHKETLKILHGVTNKERLKESPKDTNTDDDDVDVVARTRKRIAFL DLLLQMHREDASFTLGDIREEVDTFMFEGHDTTAAAAGWAILLIGHHPDVQARLHMEI DEVFGDSMRPVTSDDLSRLQYLTCVVKETLRLIPSVPSIFRKLDEDIMLDGKVVPKET LVALSIYGLHQDPDEFPNPERFDPDRFLPVNAEERHPYAYVPFSAGPRNCIGQKFAMM EDKIILASILRRFSIKSIQTIDEAKPAGQLILRPAEGNMLVNLSRRK >GLEAN3_20233 missing N-term MPMEINKMGDAAAKGMFDFLEKILKRHRQAYNPEDEPKDLIEAFLHEEANKKAQNVDV GSFTHQCLKQVVWDLMLGGIETVSTSLSWYFLFMAQYPDVQAKVQAELDKVIGRDRLP SLSDRQNLPFTESTSVECYRFCRLINVQIPHRTRCDVKLGGYDIPKGTNVGTNASWLS SSPSLWENSEEFVPDRFIDKETGLYNSKLEAGLVEFGVGRRVCPGEQLARTELFVIFT HILHRFTLRLPEGTPITRNGINGLTHHPKPYKITAAPRR >GLEAN3_20234 MAQVMSTDFLTHNLSTLLTTGLTIILVVLFSLRRKNDSSKKLTHPPGPSGIPVLGNLL TVAFSKLPQHELINGWSKKYGDVFWFKIFGTGVVIINDKEVMQKTFQNTDVCDRAPFF ASDYVLGRENNGIAFANGGSWKEQRKFFVSIFRKVDIGVSRLEDIVGGQVKNIVTQVK NLKGEKFDPAIMVNMSIANIITRIITGVTYDWEDKAFQDIVAYSGKLFDISGPAGLFS LNPVFARLPLPVNFEIRRLCRTIMDFIQVSIDEHRKNFDPNAPVKDFIGSYLKEMASA DANNLKSFDDTNMIVSCFDALLGGWETVASTLRWTIYLLSKHPEAQGRMRQEIHDLVG DERFPTIADRQYLPLVEATIAECMRFIPAVPIHLPHVTSQDCNIGGFDVPKGSAVAAN LLHFAKSPMFWEDPEVFRPERFLDDAGNFMRGLEPVPFGYGKRQCPGELLARMEMFIF TTFLVQNFELKLPAGIKDDLQGVLGLTWRPNHYFIEGHPTNANEE >GLEAN3_20751 missing both N and C-terms, stops in heme signature MVDEHGETLDDQHIRDIIDLYILEIREQKKRGDEIVFNETTVWRGIFDMISAGTGTTS TTLLWGILLMAAHPDAMEQVQNEIDRVIGDHRPPSMSDREDLPLTMATIMEIQRFRPV SPLGFPRFTNKDAEVKGYSIPKGTAAIVNIWELMHGTKYWEEPDKFKPSRFLSDDGRS IIKPDAFIPFGIGRL >GLEAN3_20752 missing N-term MVDEHGKTLDDQHIRDIIDLYILEIREQKKRGDEIVFDETTVWRGIFDMISAGTGTTS TTLLWGILLMAAHPDAMEQVQNEIDRVIGDHRPPSMSDREDLPLTMATIMEVQRFRPV SPLGFPRFTNKDAEVQGYSIPKGTAAIVNIWELMHSTKYWEEPDKFKPSRFLSDDGRS IIKPDAFIPFGIGSRVCLGEILAKMDVFLTFTFLLQRFTFRLPEDSPTN >GLEAN3_20753 MASSGYVASSIFLGLQEISFRECIVFVMTCLFVRLLVSRGSFKKSTPPSNPPPPSPTG LPFFGNLFSFQEDMRHLFRKWTNEIGGIFRLRIGPVMDIVVINDVEAVREAFVEKADA FSDRMVPPLLGEALGKRGSLIWPAGGGWKKTRKFGHSAFRSMGLGHRRIEEKVVEEGR ILVQACEMKEGEPFNPRVLITNAVGNMIACLVTGSRYDYGDPDFVKIVGDIERFSQNL SLTSLGNVFPLLYYTPLYTDFRRPMQDIISYIKSMVKQHQQNFDPLNVRDIIDLYIVE LKQQREQQIPGTVEENLEWRLSIELYIAGMVTTSDSMLWAILIVAQHPQLMKEIQKEI DDVIGDERSPKMADQDALPLIVATLMEVSRIRPAVPLGLPRLTSRDATLQGYHIPKDT GCYSLLYNNYNNINSSSVVIMNLWEVLNSPKYWDKPEEFNPYRFLSEDRRSVVNHPAF IPFGIGLRACLGEKLARMQMFIIFTSLLQRFTFCLADDSRSAGMKGKAGMAYSPLKYD IILKRRENSD >GLEAN3_20754 MEMGLFKVERLRDGLICFAVFLLAKWLLGSKTRKSKLPLPPSPPGLPLIGHLLMFRRD FRKVFRGLKEKYGSVFSIRVGPQRTVVMSEIDAVKEAFVNQAEAFSDRHMPPLLKQGL GSAGSFAFDNGPVWKARRRLGLSAMRTFGVGKYSMEQRIIEEAKMLCETIENNQETSF NPKGLITHSVSSIICGITFGQRFDLNDPEFNDLISRIHSALSSISFFTVGQLFPFLLY TPLYSKFRNLISSLKTLVTRMVDEHGKTLDDQHIRDIIDLYILEIREQKKRGDEIVFD ETRVWRGIFDLFAAGTGTTSTTLLWGILLMAAHPDAMEQVQNEIDRVIGDHRPPSMSD REALPLTMATIMEIQRFRPVSPLGFPRFTNKDAEVKGYSIPKGTAAIVNIWELMHSTK YWEEPDKFKPSRFLSDDGRSIVKPDAFIPFGIGSRVCLGEILAKMEVFLTFTCLLQRF TFHLSEDSPTNIDAIPGVALIPYDFLIMAEKRV >GLEAN3_20756 MANRSYVASSMFLGLQEISFRECIVFVTTCLFVRLLVSRGSFKKSTPPSNPPPPSPTG LPFFGNLFSFQEDMRHLFRKWTNEIGGIFRLRIGPVMDIVVINDVEAIREAFVEKADA FSDRMVPPLLGVAFGKRGSLIWPAGDGWKKTRKFGHSAFRSMGLGNRRIEDKIVEEGR ILVQACEMKEGEPFNPRVLITNAVGNMIACLVTGSRYDYGDPDFVKIVGDIERLSENT SLTSLGNVFPFLYYTPLYTGFRRPMQDIISYIKSMVKQHQQNFDPLNVRDIIDLYMVE LKQQREQQIPGVVEENLEWRLVVELFLAGMVTTSDSLLWAILIVAQHPQLMKEIQKEI DDVIGEERPPKMADQDALPLIVATLMEVSRIRPVTPLALPRLTSRDATVQGYHIPKDT VVLMNLWEVLNTPKNWDKPEEFNPYRFLSEDRRSVVDHPAFIPFGIGLRACLGEKLAR MQMFIIFTSLLQKFTFCLADDSRSVGMKGKAGIAYSPLKYDIILKRRENSD >GLEAN3_20876 MAFMFSVWTFAVPVCLLYLIKKLWYWYNISKSDPNCSAPLPGGSMGWPIIGETFHVLF QGPSFYEKNFAKFGRVFKTHLFLNPTVRVHGPDNIAKILQGENDIVTSTKPVRMAIEK WCQSDEVLAANVLRHLSFRVAADVLCGFHYTDEDVTELCQKFDDIMKGIMTIPYDVPF SRFRRALQARDFIKQKIDENVTRKANLDPEEDTQDAMRILIDLANKSDDSSSKEGLYE AIKNQTIEFLTSGMLSLSSAASTMLYHLAKKPSVLAKVREELNDYGLLTDYSPLTLNK INQCRYVMQVIKESLRMVPPIGATFRTALKTFELDGKQIPKGWTVFWSIRETIHLSDS FTDTKNFDPDRFSPQRQEDKVAGRFSMPVFGHGTHSCIGQNFALLTLRILLIELARTC DFETPNVDKIKFSWIPSPKPKCGMPTRFTRKILVDQNENEVSYSSKSSTCPQGEDSRE RSEEGVHQRSTTCHAA >GLEAN3_21087 MIDLLGDSSLVQVNGLTITATLGVVAISLALWTMVRPRNFPPGPRGLPIVGSLFSLND NLEKIFTGWIKQYGDIFGFKLGEHWMVVLNHQEIIKEALLKQSVQFAGRPRSPSAEWM TDGYKDIAFGMYGETWKLHRKLGHTSLRHFATGEALQKLLSTTFPKLEVEVAKTAGKP VDPKNIISLLLNNVLSEMCFGKRFEMHDPKVLEFIKLNDEMNAEIGMGFPEDVFSILQ YVPYKKRDIIRGFMNKIKELVIEQIGDHKETHTQGEVTDIYHLLLDAQEKAMAEDSNM EKLSDEHLLWTIMDIFSAGIQTSTDTLYWALALMTEHPEIADRVRKEIDDVIGGDRIP VIEDRGSLPYAEATLYEVLRYSSIAPILLPHATTCDTTLRGHAIPKGTTVMMNVYSMH FDPREWKDPENFCPEHFLAENGKVRDHPPSFLPFGAGRRVCLGEAVAKADLFLIFTWF MQNYTFSKVPGKESESLTNMIPMSAAGRLLVPYEIMINKRE >GLEAN3_21185 missing N-term MLKYDVEFSGRPQFYSAEIASDGFKDIVFSNYSDTWKLHRKLAHSAFRKFATGHKLDD LVHGVIPQVQQMLTENSGKPFDPKLMITLTLYNVFSKMCFSESYDFSDPRMLGYITMS QELAEKVGFGLVADYFPWARFLSYNGASYIQRMKQKYMEWTSPQLQRHRETLDPTGIE TSMQRLYWAIALMTEYPDVQAKIQGEIEVEIGRGRHPVIKDRGNLPYTEATLYEMIRY STVGPVGVPHATMVDTKFRGYLIPKGTTVLLNQYSMHFDEREWTDPHSFRPEHFLDES GTTRLHPSSFQPFGTGRRSCLGEAVAKADLFLIFTWLLQNYTFRKVPEYSEENLLKLL PETAAGRVLQNYKILIEER >GLEAN3_21251 missing N-term MPSMQGVQYALDGYIKKYGNCCGFYQFLTPAILVADPDMIKQIMVKHFDRFHNRMMSE MVREPASRLVQNLKKVCGTDRQGIVELKELFGAYIMDTIGLCAFGLNVDSQKNKDDPF VLNAKRILLQFRLYITGKIPGVSHILPLLGLTVFPPEVLEFFIAIIDQAVEQRKTNKA AKTVDFLQLVIDSTKDNACEDEDDDDDATTNGTSSSSHAIDKKRKRQALTRQELTSQA LAFFVAGYETTTTALCFTAYLLATNPDQQDRLIDEIDRLVPTAEDITYDNLKKLTCLE NFISESLRLYPPGAL >GLEAN3_21255 MEVLQQESPGSCLTVWLVFAATALLILYLTWNYTYFLRRGIPGPLPFPVLGNVLDLKE GVQYALDGYIKKYGNCCGFYQFLTPAILVADPDMIKQIMVKHFDRFHNRMYIPVDNPM LRSALFLSSDDQWKKIRHTLTPAFSGRKMKLMSEMVREPASRLVQNLRKACGTDRQGI VELKELFGAYIMDTIGLCAFGLNVDSQKNKDDPFVLNAKRFFSSLDFTSPAMIISMLT SKIPGVSHILPLLGLTVFPPDVLEFFIAIIDQAVEQRKMNKAAKTVDFLQLVIDSTKD NACEDEDDDDDATTNGTSSSSHAIDKKRKRQPLTRQELTSQALAFFVAGYETTTTALC FTAYLLATNPDQQDRLIDEIDRLVPTAEDITYDNLKKLTCLENFISESLRLYPPGALI NRVCNQTTTINGHVFQAGDSVSLPIWSVHRDPEHWPNPEKFDPDRFNKENAAGRAPLA WLPFGGGPRFCIGLRFAMMEIRTATVMLLQQFRLEVAPETEIPPVLGKMGFINPPNGI TLRVVERSK >GLEAN3_22109 missing N-term MSDLALGFFSNDFSSLMSSKGSIFGQRWKQLRRFGHRALRHFGINQGSMEDSIIEEGR LMAEAFEMKEGTQFNPKVLVTNTVGNVITRFVFGFRFEYGDPVFEKLAEDFELSFESF SITSLGSVFPFLYYTPLYTNFRRPIQDVVSYIKSVVNDHRRSFDPLHLRDVIDLHMAE VKGQRDENKMAAIEEGWEWKFIFELYAAGLATTTDTLRWAILIAAQHPQLVHEVQKEI DDVIGGRSPKFSDQSLMPLTGATMLEITRIRPVVPLGVPRCTSRDAVVQGYHIPKNTT VLMNLWELQTSPKYWKEPEKFNPYRFLSEDRKTIVSHRALNPFGIGLRVCLGEKLARM QLFIILTSLLQRFTLRLADDSLSVGMEGKAGIAYGPLDYNIVLEKRHV >GLEAN3_22110 METVKKSSLDFFADYAIYGREVLVFATVLLFVRWILLQWSYNKNSKPSAPPLPYPPGL PFFGNMFSFRTDMRDILRKWTNDLGGLFRLRIGPMVDIVVISDVDAIQEAFVDQANAF SERDIPPLLEVAMGTNGSIFGQRWKQLRRFGHRALRHFGINQGSMEDSIIEEGRLMAE AFEMKEGTQFNPKVLVTNTVGNVITRFVFGFRFEYGDPVFEKLAEDFEHFFESFSITS LGSVFPFLYYTPLYTNFRRPIQNVVSYIKSVVNDHRRSFDPLHLRDVIDLHMAEVKGQ RDENKMAAMEEGWEWKFIFELYSAGLATTTDTLRWAILIAAQHPQLVHEVQKEIDDVI GGRSPKFSDQSLMPLTGATMLEITRIRPVVPLGVPRCTSRDAVVQGYHIPKNTTVLMN LWELQTSPKYWKEPDKFNPYRFLSEDHKTIVSHRALNPFGIGLRVCLGEKLARMQLFI IFTSLLQRFTLRLADDSLAVGMEGKAGLAYGPLDYNIVLEKRNV >GLEAN3_22432 MDLNQMTIYHLLVTALVGLVAVWIVKHLMTRPKNLPPGPTGIPLLGVAFKLIRSRSDL LALFSSWADKYGDITSFRAGPKLVVHLNSYEVISEAFRHPDLQSRPHSQMLTEMSGVG QLGLILGVGDSWKEHRKFTFSVFRSLGVGKKSYEDTIAAEMAQLGGAIEEKKGTAFNP NVLFEQAVANVICSIVFGTQYQYSDTDFQHVLYLLNNIARLAGTGGWAFFIPIPGISK TPFGFVKQFVNTVDRLLAFSQAQLDSHMKDLDPDHPKDLIDQYLIKLEETQGAASYFD QLNLKYLISDLFIAGTETTTTTLKWCFIYMMAFPEVQSCVQAELDQVVGRERMPGWQD RKNLPYTCAVLMEVQRKGAVGAMGVPHVAAADVTIKGYTIPKDTIIFSNIWNVLNNKN FWNDSDAFTPERFLSEDGTLVKRDDLIIFSTGRRVCIGEQIARMETFLGFTSLLHRFT FKKPDNSPPLSFDGVLGGTRSCLPYTTCALKRE >GLEAN3_22590 MDFGISLTTFLLGMVTFLVTLMIVWSRTGRTYANLPPKGPPEWPIIGSLLSLAGEHPP HQILANMAKKYGPIFSMQMGSFYAVVLSDYSLIKQAFAKSGDDFSDRPKIAMIEHLAE GKGLISGYNSPAQIEQRRFTFSALRSLGMGKFRMEETISEEIQHLIKSFTERNTQPFI PFHDITVSVSNVICWLNFGKRFEYNDDDFKGVLEALFGIVEITEIGGVFNFLPFLRFL PGSGLKKCFKHKAIFDKHMYPLIRKIKQSYDDKFESAECYIHMYADKIAEAAKQNIVP NLFNDGNMVLAVADLFVAGTETTATTLKWALLYMVVNQDVQKRVQEELDRVVGVDRLP SLLDKPNLPYTEATLLEIQRKASIVPLGVPHAPVQDTILYGYDIPKGTVVLPNLWAIH HDPNLWKNPDEFNPDRFLKPDTKHVVQREELIPFSIGRRRCIGEELAKVELYLFFTHL LCRFHFILPPGAPQPTMEGRSLATFVPHDYSLCAIPRCRDKDFEES >GLEAN3_22593 MYASVEKMSSSVFYYGFTPFILAGLIAFLLSFILVRSGKKKYKNLPPRGPREWPVLGG LPSLAGTDMPHVTIADMAKKYGPIFGMKMGSFYAVILSDYSLVRQAFAKSNDEFSDRP KITMIENLVQGKGLIVSYFGQMQTEHRRFSLSALRSLGMGKFKMEETIVDEARQLATI FASRQSKPFVPFHDIVVSVSNIICWLSLGKRFDYNDESFKAILASLFESFEIAEVAGL FNFLPFLRFLPGSGFKRMIALQQNHDRFMKPLVLKVHEEQEKSRRSPTCYVEMYTEQM IEAERETPGKHTFTKENMYHAVADLFAAGTETTATTLKWALMYMLLHKDVQEKIHKEL DDVVGRNRLPSLTDRPSLPYTEATLLEIQRFATITPLGVPHAPVQDTVLNGYDIPEGT VILPNLWAIHHDPDLWKNPDEFNPDRFLSPDTKQVVQREELIPFSIGRRRCLGEQLAK VELFIFFTHLLHRFQFTTPEGAPPLTMKARNGATLNPEEFEICASLIDDVM >GLEAN3_23067 MSLLRFLLRISTKANRQTVLLTVASVFLALLTAYRHTLTRTKHGRRRRRLPPGPTNYP IVGILPFLGRDDPGKSVLELSKKYGPVFYGRLGNFDAIFLNDYESVREAFAKSGDAFS DRPRFTTFEAWSGGRGVVCCYYQNHWKEQRRFLLKLLRNLGMGRANNSLEDRILQEAV LLTRSIGETNGESFDPRIPMTMAVSNIICGLTFGKRFDYDNKEFEKFICTLERCFEMT DIAGPTNYFTLLKYVPFSSLKKMGAMARSLERGLFTREKDAHKLILNTNDCKDMIDFF LKEVRENESKEGRGEAPNSSFKEEFLPSLIGDIFAAGTDTSACAMYWVMLYSIKYPKY QRRAQEELDRVVGRHRLPRQSDRPNLPFVVAMLAESLRCSAGGPFGVPHAAVEDTTFR GYDVPKGTIMIANHYALLHDPEVFGVDTEVFNPCRFFDDDGTVIKSSKTGLAVSRRAC PGEQLAQSERFIFFSHLIHQFDFSAPDGPDSVSLDSHFGLSRSPFPYKLRAERRPIDL QLPDITL >GLEAN3_23068 MKKYGAASPKTLKSTLQFLLRLMGNRRHTTYILALISLVLATISLRIGGQGRRRRRGK LPPGPMGWPIVGMLPYMDPENPAQSVWDMSKQYGSVFCGRLGSHLAVFINDYETIKEA FSRKDDAFSDRPRITMFEVYTNGHGVATCYFDNHWKEQRRFAMKAFRHFGVGKPDSVF EEYLMRESKHLLHAYSRKHVAFNLKPFLDMAACNIISSMTFGNRFSYTERTFQQFLQA LDDIFHAADVAGVTNYLSFMKYLPFSGYSSLDRRSRELESGLFVREREGHKRTYVASE TRDLIDAFLHEIRKKKEEGVDSDKTGFSDKMILHFIADMFAAGTDTTSNSTQWLLLFA AKYEKEQLRVQEELDRVVGRDRLPTLSDRKNLPYTEAFMVESMRCAVGGPFAVPHAAV RDTEFKGFFIPKGTVIVSNILAVLRDPKMFPEPEEFRPSRFLSEDGNLRKDLIERVSA QFGVGRRTCIGDQMARAERYIFLTHLIHTYNLSGASGPDSLSLKSHGGLTRNPVPYDV ILTERTPTVASAKKSHGF >GLEAN3_23808 ELWNLTYFWRRGIKCSRPLPLLGNILSCTKGFYSYIYTCQQNYGNIFGLYFFRSPLIV VCDPDVLRDVLVKSFSKFSNHYQFFLKNRPLDQGLLDLRDQRWKDVRSVISPTFSALK MKQMSPVINECCDVLVTNLKKKQGEDINVNSVFEAFTMDGIAKCAFGLQVDSQNNPDD PFVKHAKRIMDGTVESPVAMIAGVFPPAAYLFNALDIALFPAETRKFFRDVVVKAIEL KKRSAGNERRDFLQLMMNAHNDDEHDSRSKVIKDDNDLHNLIEDHQDKHVSNAQNTKT QLTMDEIFAQAVLFFVAGYETTNVTLGFLAYALATNPDIQDRLIEEVDEVTPTRDSVD YNSIAKMSLLDMVVCETLRLYPPAVMGDRCCSETHTVNGLTIEKGVQFLYSIYNIQRD PTLWPEPEKFDPSRFTKENRANRHPFAWIPFGAGPRNCIGMRFALMEIKMAVVRILQK YRFVPSPKTDIPVKFGTGNTLKPDGGIFLRVEERQ* >GLEAN3_24078 MLPPNINQNGTQNKLRITIMIDATAFLTRGTAFLAGETIECAITFSCPEATGSGQPEC LAWASAQLHCQCNVKESKVIIPPQPITRRPSEEWENIQSSSNTSFAASRGLDEATIFE GANHTDAANPFLEGDRPKNTLLDRAVQILTTITSRKSPSVAFSNGDVWKEQRRFLLSV FKRADAKGIRFEEIVGAQAERMLTEIEKTKSARFDPQPIINTSIGSIITKIVSGITYE FDDDDFHRLVTHSNRVFDLLGPAGGILSSVPILAAVPSPAKRQLTKEWDTMYDFIKKN IASHKSEFNPNQEAADYIESYLHEIAKRTTENNKGSFDEINLLECSFDVLLGGWETSA TTISWSLHTLACYPEAQNRVRQEIFDVIGQERDPRFSDRHQMPIAMSTIAECTRLRPA FDFHLPHVAEQDCQVGEYDIPKGSVVTTNMSHLFLSPTLWEEPLEFRPDRFLTEDGQF DRKREPIIFGTAFHPQSGRRRSKRSPMLPQVHQSTGSLPDTC >GLEAN3_24113 MFIPRINHFAMEFDWHQLTPYHLLATALVGLMAAWFVKHLMRLPKNLPPGPTGLPLLG VVSKFFGTSDPLALLSEWADMYGEITSFKIGTQLVVLLNSYDAVAEAFRHPDLQDRPR CRIKEELSGCGRNAGLLIGVGQSWKEQRKFTLSVFRSLGVGKKSYEDTVAAEMTQLGR AIKEKKGSPFDPNILLGQAVANVICTIIFGTQYTYSDTEFQQLLSLLNFSMRRSGVGG WLFFIPIPGISKVPFGVIKTMASNAKEFYAFIDDHIETHKKNGVPDVPRDFIDQYLIK LKEMKGTKSSFSHHSLKSVIADLFFAGMETTTTTLKWFILYMMAYPEVQSRVQAELDH VVGRERLPGLSDGKNLPYTCAVIMEVQRKGAVVAMGVPHLASVDVTIRGYTIPKGTII YPNIWKVMNSKNFWDDSDAFKPERFLSEDGKLTNRQELIYFSTGRRICPGEQLARMET FLGFTSLLHRFTFKKPDNSPPLSLDAILGGTRSSLPYLTCAIPRDPAENVA >GLEAN3_24320 missing the middle between KYG and I-helix motifs MFDLSATSVFLGGVIFLLCFWVVRRPRNLPPGPWSLPIIGYNFKPGLIHEAFMDLAKK YGPIFSLRRGSFVFVILNDRESITQALVKSGEFFSDRFVPGHFNWGIPDPGKKGGTDT TAVVTSWAILYLSVHPEVQRKVQTELDTVVGRGRQPNTLDRPDLPYCNATLTEVMRIR PVLPVSIPHMTSDNVSFRGFTIPKGSIIIPNLYAVHHDPKEWNDPDTFNPDRFLSADG KQFQMNDAWMPFGVGRRDCVGSQLAKMEAFLLFTNLFQQFEFKLPPNQPTPSMCGANG ITMKPKPYKICAIER >GLEAN3_24836 MASMAQQILHRHVGTMVTSEHLEDAKPYDEIPGPKGVPFFGSLFDYTGLGPYKLDKLH EATVDRFRNFGPIFKETLAGVTHVHIIDPTDVRELFRNEGRSPKRTPIDAMVKYRRTR KRHIGMANTEGEEWQRLRKPVQHLLMKPQSVYAYIPIMEECADDFVTLMKETQDIESQ EVPEFNHKMQRWTLESVCAIVFDTRMGCLNTGLDNNPEALDMIVSVSNFFDSLKDLTF SFPFWKFGITTKAWTKFTEAQDYFFGMSKKYATIALDRMKSAVGTSTQSESEGTFLEC LLARRDFDIDSLCDLMNDLMIAAVETTASTLAFNLYCLAKNPEVQERVFQEINQAIPP GTKITAQSLQNLPYLKACIKETVRVFPTVDGTNRIPSREIALAGYRIPPDTIVRIHCI AGLMEEYFPSPEKFKPERWLRGDAECENIDPYLILPFGYGSRMCTGRRMAEQDLYIML IKIVQNFKIEWHHEEDMELIFRLTNVPDRPAQFRFVSRT >GLEAN3_25299 missing some seq between pro rich region and C-helix MIVDLVRDVFDVGFSSILLGIATTLVLVWMMKWMSTRRRLNLPPGPMAMPWPLNFVMD LLLLLQAKGGIVFSEGDDWKQHRRFGLAALRNFGMGKKSLENSINTEARILNEVLSGK IGKPFDPSFVINNAVSNIICAITFGRRFEYSDPKFKEMIQRINLFAAEDPGILSDLPT FISTARRRNLLEVRKFLEEEINEHKSTFEPSDVRDVIDLYLKEIYRAREAGEKNELDL TKAWPLVFDFFLAGTETTSTTLLWAFLLMAGHPEVQEKIVAEISSVIGSGATPRYEDR KLMPYTEATLVEVLRYRPIAPIGVPHRATSDLKVKGYDIVEDVIIVINILHIHHDPKI WGDPEVFRPERFLTEDGTALIKHEAYMPFGVGRRVCLGEQLAKMELFLFFTNILQQFK ITMPPGVQPNYDFGHRALTTLVPKAYEILLEER >GLEAN3_25595 missing C-term after EXXR motif MNDLSDVQYGDVFSFTMVGKTFTYLIGSEASALLFNSKNENLNAEEVYSNLTVPVFGK GVAYDVPNPVFVEQKKMLKTGLNIQQFKRHIPLIEEETREYFNRWGDSGEKNLFVALS ELIILTASRCLHGKEIRSMLDEEVAQLYADLDGGFTHLAWLAPSWIPFPSFLRRDRAH RKIKQIFYKAIAKRRETGVENDMLQTLIDSKYKSGRPLSDDEIAGMCIGLLLAGQHTS STTSAWLGFFLARDKEVQDRCNAEQIKVCGDASTEVSYDQLKDMQLLDHCVKEALRLR PPIMTMMRVAKSPLVSVPIIGPVVPRES >GLEAN3_25829 MAMILTVLGYVALLICVILGLTIAVLSAAILRTEYVLKDVPTPKNKSFFLGDIKTFGV WSEKIGMSVANILYLMTKDLGPVYYVRVFWKAHFFASAPEIVKELLLNSRHIKSQEEY HSFQKLYGERFMGHGLVSEVDHEKWFHKRAIMNPAFRRKYLISLMEEYNSGSEALCQK LKEKADGKIEVPMLDELNKVTLDVIAKIAFSMDTNAIGDPDCPFPAAITAALQGLQSL HERPWSTIDPRCISMRRSTRKAVHFIRETGRDQIKERIQARKRGDHVPSDLLNLILDI ANDLEGDKDFEMENMLDEFVTLFIAGQETTSNLLAFTILQLGRHPDVAKKLQAEIDEV LGQKHKIQYEDLAKLEYMMRVLKETLRLYSPVGGTTRLTAHPINYNGITIPAKCSLTV MSYVTSRMEEYFDDPLLFNPDRFKPSEDGSLPRHLYAYFPFSMGQRSCIGQQFALIEA RVILAKLLQRFEFRLEQSQSMGILDELTAKPMGRCRNYITLRA >GLEAN3_25830 MAMILTVLGYVALLICVILGLAIAVLSAAILRKEYELKDVPTPKNKSFFLGDIKTIGA WREKELLLNSRHIKPQEIYHGFQKLYGERFMGHGLVSEVDHEKWFHRRAIMNPAFRRK YLISLMEEYNSGSEALCQKLKEKADGKIEVPMLDELNKVTLDVIAKIAFSMDTNAIGD PDCPFPAAITAALQGLQSLHERPWSTMDPRCISMRRSTRKAIHLIRETGRDQIKERIR ARKRADHVPSDLLNLILDIANDLEGDKDFGMENMLDEFVTLFIAGQETTSNLLAFTIL QLGRHPDVAKKLQAEVDEVLGQKHTIQYEDLAKLEYMMRVLKETLRLYSPVGGTTRLT AHPVKYNGITIPAKATVTVMSYVMSRMEEYFDDPLLFNPDRFKPSEDGSLPRHLYAYF PFSMGQRSCIGQQFALIEARVILAKLLQRFEFRLEQSQSMGILDELTSKPMGRCKNYI TLRA >GLEAN3_25863 MGVDPVRDVFDVGSSSILLGIATSLVLVWMVRWMSTRRRLNLPPGPMPVPWPLNIVTE LLLKLQGTNPFVLLGKLAERYGEVVYIDNGGRRFVFLASPEAVNEVLVKQADITSGRE DFWELKAVTHYHGGIIFSEGDDWIQHRRFGLAALRNFGMGKKSLENSVNAEARILNEV LSEKIGKSFDPFFAMNNAVSNIICAITFGRRFEYSDPKFKEMIQRINYFVAEDPGPMS NLPTFISNARRRNLVAVRKFLEEEVNEHKSTFEPSDVRDVIDLYLKEIYRAREAGEKN ELDLTKAWPLVFDFFLAGTETTSTTLLWAFLFMAGHPEVQEKIVAEINSVIGSGATPR FEDRKLMPYTEATLVEVLRYRPIAPTGVPHRATSDLKVKGYAIPDGVNIAVNILYIHH DPKIWGDPEVFRPERFLTEDGKALIKHEAYMPFGVGRRICLGEQLAKMELFLFFTNIL QQFKITLPPGVQPNYDFGHRVTTLVPKAYEIILEER >GLEAN3_25956 missing N-term starts with C-helix MVLYDLFEVYYREQTTASSVIWENGDKWKSSRRTTLTAFRKSLSSPSTLGATVKSSWQ HLRVSLQSLNGQHCNPRSEITSAIAEIISSLCFGHCVDVRSDPKIADIMQAQHNISRG LSGISLGHIFPHLYHTPLYSKLRRDIEQYTQFMREVVKDHKLRLVGNDDGKCKDGNRK EYITDVTDCYLQQTQHGGKVAVEEEDVWRVMFDLMGAGTETTSETLLWGILYLCLFPD VQRQVHSEIDENMALYGPDDVINYDQRKSLPYTQATILELLRCACVAPLSLPRVTTRD LTVLGYSIPRGTEIWVNEWAINRDRNVWNKPHDFNPARFLDDSGRVDPKLKESIISFG EGRRMCLGRCLAEHMLFVFFTNLVRWFTFRLPEDCRETPNVNEGNFGLTYSPPQFDVI ITPRVFGN >GLEAN3_25969 missing N-term MTLWYMFSKRSSYDSPLQNDILNLTDKAEKPSGSGNPVDFLPWLRFITTKYKTTLQEY RKKFGKLFSGVIEEHAVLYEPGSERDVIDHLISTGHNSGEEELQRLGLAGKALYQSAF DFFPAGTETVLATLEWIMVYMVTYPNVQSDVQHEIDQFVGRGRLPTLNDRDNLPLTQS CLLEIQRHAAAVPFAVPHSTTQDTVLDGYYVPKDMLVFVSLYSAHFDPDVWETPEVFN PRRFLAQDGTLDEEKTKLAIPFSLGPRRCLGSGVARIQLFIYFTTFLHQLNFKCPEGQ KASLDATYKFPRRPKVFNVQIDPR >GLEAN3_26188 missing C-term after EXXR motif MWRRPLLLVRDLDMLRDIMVKDHHRFYNKFPLPMSSGSFDNALFLLRDAQWKRVHDIM TPTFTGKKMKMMSGILNEGADDMVEKLLQSCKEDGNIDSLDNGLPYAYSVFTFQCGIY PSLLYIKIKLNGLKKIVGIIEKRVDFLQLLIDAKNGKSKMANENDEDDIHNKYFKDAG ADENILTKTQKYMTREELLAQSMVFLVAGYDTTATLLTFASYLLASNPDCQDKLIAEI DDIAPKRDDVGYTTISKMPYLDQVVCEVLRIYPPAVV >GLEAN3_26225 MTYFKRKGIKGPSPYPVVGNSLDLFQGIHWTTQDYVKKYGNVFGMFMFSSPVFFISDL EMVKDITVKSFSKFMNHKTFVLKSRPFDKALTGLKDQHWKDVRNVITPTFSAVKMKQL SVLLNESSDTLVKNFGAIQEKDGKVKVFGLFEAFTMDGIAKCAFGIQVDSQNDPSDPF VTNAKDLMKNRFSLSTMIAVTFPRIGKIMDYYGVSTMSPKISSFFISVVDQAIAARKE SGGSKMKDFLQLMLNAAQENKSEKEMIEEKPQDEKVEGDLHALIEDQGNKSYFDLKKS KKTTLTKDEIYGQAIIFFLAGYETINTTLGFLTYSLATNPEVQDKLIEEIDNETPTRE DVGYNSIAKMSYLDNVVCEILRLYPAGVIVERQCNETHVCNGLTIPKDSQVMFPVFAI HRDPAIWPDPEKFDPERFTKENREGRHPLAWMPFGAGPRNCIGMRFALMEIKMATVRL LQKYRFETCPETQIPPKFAKMSLKPDDDLYLRVVERS >GLEAN3_26360 missing N-term starts at C-helix MGFELLCTKATIAFSNGKPWVDLRRFALPALRSFGFGKQSLVPQINLEARYLSEEIRN LRGEPTDLATTFSKATANIICQLLFSQRYEYSDGEMSRILKSMDEFLSLIHETDLVNI FETLIHTPRYKPYRESIFSDRDFIMSHLNSHRETFQKDNIRDFTDAFLADDISKEFEL EHFWRVLLDLFIGGTDTTASVTSWTILFLSVHPDVQRKVQAELDTVVGRGRQPNTLDR PDLPYCNATLTEVMRIRPILPVSVPHMTSDDVSFRGFTIPKGSIIIPNLWAVHHDPKE WNEPDTFNPDRFLSADGKQFQKNEAWMPFGVGRRDCVGSQLAKMETFLLFTNLFQQFE FKLPPNQPNPSMRGANRFTMNPQPYKICAIER >GLEAN3_26373 MAMAYRRLRHRLLLPPGPWFLQLRMVYYMLTMRLPCRLEKIYEAQSKKYGKMFSFSLG SDIKIVISDPDIIREAFGKHRAAFSDRRSPSVYAKSFGGEHNHRILFANGRLWKTSRR VVAPCLRNIGTWGQQMESIILREADSLTDKLAELGSKPFDPRHHLTRAVANIIHMMVH GYNLDPDDPRLQQMLDSFQLVVTHPMVTSNTLRYLPFLYYLPTTRPLRQAFQLLKTML STAVSTQANGYDQSSINTIVKRIMQAAEEDDDVGEDRLKTKEVWRLVYDLMLAGSDTT SNTLKWAVLVMAYYPDIQKKVHNEIDCVLGERSPCLQDKKSMSYIQATIMEVLRVRPV LPLALEHLTSETTVLRGYTIPKGAILIPNVWSCHHNPEYWDDPEVFNPGRFLSEDNQS LGEEPHAFLPFSFGPRKCLGDQLAKNELFLIFAKIMQRFHIDFAPGDRPESFDEGFDK GLILRPNNMKVRLQPRKIYSHEE >GLEAN3_26477 METKVLPFDVNSYTLGVAFCTCFLSWVAVKRYRTEAKEESLHSLPPGPPKWPLLGNIP GMMMAGSSSKYLSDLVSEYGPVYTLYMGPRPVVVLNKYDIMKQALMDGHLLSDRGDLP ALDDYFKGRGMMMAGSSSKYLSDLVSEYGPVYTLYMGPRPVVVLNKYDIMKQALMDGH LLSDRGDLPALDDYFKGRGIVGSHYTEYWKNTRSFVIHSLRQFGVDRQSFESVIASQA DILVEDIRKRGTFDPLLCLQSNVCSVICGVVFGKQYNLDNPRFTDLLETVYKALRAIE YAAVSSLVPGLHYLPGCSYQKFYACMDKLMRFFREEINAHDETLDPDSPRDVIDMYLS KIRRASPEDPTYGLFSKTNLEFVVNDLFMAGTETSASVLYWAILLIASNPGIQERLQK EVDDVTTHCDHVSIADAASMPYTRATIDESLRYASVAISSMRSTTSTMTLGKYRFEVG TWVMLNYEYAMFDPDHWSEPKRFKPERFLDDHGQYCPDEQLIPFSIGKRNCIGEKLGR MELFIFFTNIIKNFELVFPDDAPHPSVEAATGGMMRVPNRYSITFKDRTTTAA >GLEAN3_27152 missing N-term starts with I-helix MSELMLKFLCFIAGIQTTVETLYWAMALLATYPEIQAKIRAEIDDVIGRDRLPTINDR GKLVYTEASLYEVLRYSSIVPTNLHATTRDTEFGGYHIPKGTVVLINTHSMHYDPQEW DQPDKFLPEHFMDGSGTVREHPPSFLPFGAGRRGCLGEAVAKADLFLIFSWFLQNYTF SKVPGKESEDILNMIPQTAFGRLLISYEIMINKRD >GLEAN3_27153 MIDLLGAGESGFGQINALTITATLGMVTATLALWTIVRPKRFPPGPRGLPIVGSIYSM NDSPEVVFGEWAKKYGDIFGFKAGERWIVVLNRQALIKEAVLKQSVGFADRPDFLSLD IFSDGFKDIAFSPYSETWKLHRKLAHSALRHFATGKPLQDLISSVYPKVEKKLAMTEG QPIDPKVLITLLMYNVLAQMCFGRSLLLKAQEDARKEGENVDKLTDTHIFQTIADIFT AGILTTVETLYWAMALLVTYPEIQAKIRVEIDDVIGRDRLPTINDRGKLPYTEASLYE VLRYSSIVPNALPHATTRDTEFGGYHIPKGTTVIINTYSMHYDPQEWDQPDKFLPEHF MDGSGTVRDHPPSFLPFGAGRRGCLGEAVAKADLFLIFTWFMQNYTFSKAPGKESEDI LKMIPQTASGRLLLSYEIMINKRD >GLEAN3_27613 MHLTCTVVKVVMVVGMLMVMTPNSDALSGKTPPGPGLLGSFTQLKRVVSDFFRSRSAA EMISEKEGGLYQLNLGLSKICFINSRGFRDEAFLEKYHLFSDRGVESNKKQLPEELMQ WRGTIRDNGAASDEARAFVNSTLKETSYLRTTVETQIEEAIAFFDDELTGAIPSDLVV PKMIFREHVLDVMTRIAFDTAGHRNTISGQWQFQKFFEDVDALSNSVLFQPGFISTVL PNTKVDDYETNLVGVAKYLKERAEEAIYTNRETSLVQRYSDFFLKRESSIAQPEDGWR LMYDFFTAGVNTVPEVITWDIYYLASDLDLQDKIRQEMKNFGTDVTYDDRTSLPRTMS FLTEVHRLRPVATLGIPHAASADGTIGGYAIDKETLIIPNHWAMHNDPSDWQSDPQVF DATRFLKLNGNDLTFDSDKARIVTPFSVGPRSCPGQEVARRTIFLTIANLVQKYTVRF PEGPVVDVSNVKGSFTMYAPKFEVLLEPRSRPPPQRN >GLEAN3_27796 missing N-term and some middle seq starts at C-helix MDSIASCGFGLKVNSQEDKDDPFVEHVARFFKVSFTSPAFMAVSFMPFLKHIFEYLNI TLFPRDVIEFFSDVTEKAVALRENNTSEKRADFLQLLIDAKNGRTETANENDDDDIHN KYFKNAGTEDDVPTKAQKYMTREELLGQAIIFLAAGYETTSALLTLTSYLLATNPDHQ DKLIAEIDDIAPKRDDVSYSTISKMPYLDQVVCEALRIYPPATLTDRECGETCTYKGI KIEKGAHIWIPPYTLHHDPHHWPNPTKFDPDRFSKENREGRNPFTWIPFGAGPRICIG MRFALMETKMALVRSLQTVRFEVSPLTKIPPDIGQSQILSSTASYMLKVVRRH >GLEAN3_28152 MAIGIGIIAFIMALVAIVTVPLLKFIRLWIVFNKFHGPPAIPLLGNALQFKQDPNELV EQVSSWVEEYRTKSKGLMRLWLGPFPVLFIYNPHEVEPGKEPTRHIYEESEGKRLGRQ ITGKKWFHRRKMLTPTFHFAILHNFIEVFDEQSNTLCQKFGEHAQKQDLVNICPLVTL CVLDIISETAMGKQLHAQDESNNDYVNAVMRMSDIIQQRQKKPWFWPDPLFDNTKSGK EHAQCLQILHKMTNTLIDERSKEENQKVDSNDNDGSKRRIAFLDLLLKMQRDDPSFTK SDIREEVDTFMFEGHDTTAALASWAMFLIGHHTRVQKKLHQELDSVFGDSDRPVTADD LQKLPYLTCVLKETLRIFPSVPIVGRDLQEDCIIDGKLAPRGTLLIIAIGSLHRDPTQ FPEPLMFDPDRFLPEFSSKRHPFSYVPFSAGPRNCIGQRFALMEDKVLLANVLRCFSL ESTQSLKDTMPIAELILRPSEGIHMKITRRKGISNLDSKLLQRKQNNQRSRLVTV >GLEAN3_28699 missing N-term FVVLNDRESITQALVKSGEFFSDRFVPGHFNWGIPDPGKKATIAHSNGKPWVDLRRFS LPALRSFGFGKQSLVPQINLEARYLSEEIRNLRGEPTDLLTTFSKATANIICQLLFSQ RYEYSDGEMSRVLERMNETLTLIPETDLVNIFEPLIHTSIQRYKRYRESLFSDRDFIM SHLKSHQETFQKDNIRDFTDAYLADDISEEFELEHFWRVLLDFFAGGTDTTAVVTSWA ILFLSVHPEVQGKVQTELDTVVGRGRQPNTLDRPDLPYCNATLTEVMRIRPVLPVSVP HMTSDNVSFRGFTIPKGSIIIPNLYAVHHDPKEWNEPDTFNPDRFLSADRKQFQMNDA WMPFGVGRRDCVGSQLAKMEAFLLFTNLFQQFEFKLPPNQPTPSMCGANGITMKPKPY KICAIER >GLEAN3_28922 missing C-term MGFTETEPGYWLLAGVLGLLVAWLVKHLITGPKNLPPGPTGLPLLGVAWQIYKGGSDP LALFAAWSKKYGEMVSFFVGPKPIIILNSYPVIFEAFRHPDLQDRLGSRLLQEIIGLK NSGIAFSNGDVWKEQRKFTHSVFRSLGVGKKSYEDTVSAEMTQLSSVIKEKKGSPFDP STLFMQAVSNIICSIVFGTQYTYDDEAFKEIPFGKMKLMTNNIRSVNAFIQSQIESHE KNRDPDNQRDFIDQYLNKLDETKVK >GLEAN3_28934 MFFLSSTSVLLGVLGFLFCLWFVRRPKNLPPGPWSLPIIGYRFDSGLIHEAFVDLTRK YGPVFSVRRGPFLFVVLNDRESMKQALVKSGEFFSERFVPGHIRWGIPDAEKNASISW SNGKPWDDQRKFSLPALRSFGFGKKSLVPQINLEARYLADEIKALHGEPMDLSAVLNK ATANIIVQLIFSRRYEYDDPEFIGALEAMADIFSLVSDTDPVNVFESLVHTPWYKPYR DCTSKLRDFIMSHLNNHQETFQKDNIRDFVDAFLAHDISKDYSLDKFWRILLDFFAGG TETTAVVTSWAILYLAVHPDVQKKVQTELDVVVGRGRQPNTNDRPNLPYCDATLMEIM RIRPVIPVSLPHMTSADVSLGPYTLPKGTIVIPNLWAVHHDPKEWCDPHLFNPDRFLS ADGQTVVKNEAWMPFSIGRRDCLGMQLAKMESFLLFANLFQQFDFKLPPDQPTHSMRG HPGLSMPPESYKICAIER     >?  . / 0 y z * + - . / O P Q   < = j k l y z { | % & ' F G H ķķĥķķĥhbzB*OJQJph hbz0JjhbzB*UphhbzB*ph#hbzB*CJOJQJ^JaJphhbzB*CJaJph#hT'hbz5B* CJ \aJ phhbzA? / 0 z + Q  = k l z H 3ed@ed@ 23mn123@ABCp78nrs#$]^nIJn678XYZnpq!"\]n3 hbz0JjhbzB*UphhbzB*phhbzB*OJQJphU3n23A8s$^JZq"]ed@W)TqrH{ ?@%&?Ued@345UVW()25MQST^hpqrGHz{  >?@$%&>?TUnopq~78غhbzB*CJOJQJaJph *hbzB*OJQJphhbzB*OJQJph hbz0JjhbzB*UphhbzB*phIUopq9X CDR G3nB}ed@89WX BCDQRST   FG23mnAB|}- . d e f s t u v ! ! !,!-!.!h!i!! hbz0JjhbzB*UphhbzB*OJQJphhbzB*phU. e f t .!i!!!"U"""#A#v#w##?$z$$$+%f%%%%%&ed@!!!!!""T"U""""""##@#A#u#v#w######$$$=$>$?$y$z$$$$$$*%+%e%f%%%%%%%%%%%d&e&f&&&&&&&&8'9's't'''''''''''((((((((9) hbz0JjhbzB*UphhbzB*phhbzB*OJQJphU&&&9't'''''((:)u)))&*a*****++&,a,,,-M-ed@ed@9):)t)u))))))%*&*`*a***********+++++++++%,&,`,a,,,,,,--L-M-----------q.r.s....... / /E/F////////1020506070D0E0F0G00hbz hbz0JjhbzB*UphhbzB*phhbzB*OJQJphTM-----.. /F////206070E0051p111111223M33ed@ed@00000004151o1p11111111111112222222223 333L3M333333338494s4t44444444+5,5-5M5N5O5y5z555555*6+6e6f666Ͻhbz0JOJQJhbzOJQJjhbzOJQJU *hbzhbz hbz0JjhbzB*UphhbzB*phJ33394t4444O5z555+6f6666V7778B8}888.9M9N9\9ed@ed@6666666273747T7U7V77777788A8B8|8}888888-9.9L9M9N9[9\9]9^99999:::*:+:b:c:::::;;;;;;;;;;;;;;&<'<a<b<<<<<<==M=N=========>> hbz0JjhbzB*UphhbzB*phhbzY\9:+:c:::;;;;'<b<<<=N=====>>?L????8@s@ed@ed@> > >>>>>>>??K?L????????7@8@r@s@@@@@@@@@@@AAAAAAAA/B0BjBkBBBBBBCCVCWCCCCCCCCCCCCCDDDDDDDDD-E.Ehbz0JOJQJhbzOJQJjhbzOJQJUhbz hbz0JhbzB*phjhbzB*UphMs@@@@@AA0BkBBBCWCCCCCCDD.EiEEEFUFFFGed@ed@.EhEiEEEEEEFFTFUFFFFFFGG@GAG{G|GGGGGGGGG+H,H-HMHNHOHnHoH~HHHHHHI IZI[IIIIIIIIIIIIIJJJJJJJJJ KKK KZK[KKKKKK L LLLL$L%L *hbz hbz0JjhbzB*UphhbzB*phhbzWGAG|GGGGOHoHHH I[IIIIIIJJ K[KKK LLL%LLMed@ed@%L&L'LLLLLLLLMMTMUMMMMMMNN@NAN{N|NNNNNNNNNNNOOOOOOOPP(P-PUPVPPPPPPPPPPPPPQQQQQQQRR@RAR{R|RRRRRR,S-SgShSSSSSSS *hbzhbz hbz0JhbzB*phjhbzB*UphWMUMMMNAN|NNNNNOPVPPPPPPQRAR|RRR-ShSSSed@ed@SSSTTUIUUUUUUVV6WqWWW"X]XXXXXYYZIZZed@ed@SSSSSvTwTxTTTTTTT UUHUIUUUUUUUUUUUUUVVVVVVVV5W6WpWqWWWWWW!X"X\X]XXXXXXXXXYY Y YYYYYYY ZZHZIZZZZZZZZZZZZ [ [G[H[[hbz hbz0JjhbzB*UphhbzB*phXZZZZZZ [H[[[[4\o\\\\\']b]]]^N^^^^:_u__ed@ed@[[[[[[3\4\n\o\\\\\\\\\&]']a]b]]]]]]^^M^N^^^^^^^^9_:_t_u_________``M`N````````9a:aGaHaIaVaWaaaaabbBbCbWb}b~bbbbb.c/cWcicjc|c}c~cccchbzOJQJhbzB*phhbz]____`N````:aHaIaWaaabCb~bbb/cjc}c~cccd;dvded@ed@cccd:d;dWdudvddddd&e'eWeaebeeeeeeeeffVfWfffffggBgCg}g~gggggggghhhOhPhhhhhhhhhhi)i*idieiiiiijjjPjQjjjjjjjjjjjjjkkkJkKkkkkhbzB*phhbzhbzOJQJ]vddd'ebeeeeefWfffgCg~ggggghPhhhhhh*ieied@ed@eiiijQjjjjjjjjkKkkkk7lrlllll#m^mmmnJned@ed@kkkk6l7lqlrlllllllll"m#m]m^mmmmmmnnInJnnnnnnnnnooAoBo|o}oooooo-p.phpippppppqqqqqqMqNqqqqqqq9r:rNrtrurrrrrsssss:s;sNsusvsssss t!t"thbzB*phhbz`JnnnnnoBo}ooo.pipppqqqqNqqqq:rurrrsssed@ed@s;svsss!t"t0tOtttu;uvuuuvvv@v{vvv,wgwwwwwed@ed@"t/t0tNtOttttttttu:u;uNuuuvuuuuuvvvvv?v@vzv{vvvvvw+w,wfwgwwwwwwwwwwxxxBxCx}x~xxxxxxyyy?yz*+;<=JKYZzϐА *hbzB*OJQJphhbzB*phhbzB*OJQJphWËы G3nٍڍ/jɎ?z+<=KZed@ZА F0kW˓&aהMed@ EFz/0jkzVWzʓ˓%&`a֔ה&LM &Z[Жі &FGΗϗЗݗޗ& *hbzB*OJQJphhbzB*phhbzB*OJQJphW [і GϗЗޗTʘ@{,gݚed@ed@&STɘʘ&?@z{+,fgܚݚRSțɛ RSȜɜ>?yz*+,-.;<fgܞݞ012?@ijߟhbzhbzB*phhbzB*OJQJph[ݚSɛ Sɜ?z+-.<gݞ12@jed@ed@jV̠ߠ=x)IJX)*8s$ed@ed@UVˠ̠ޠߠ<=wx()HIJWX()*78rs#$8^_Ԥդ4568CD~/08jk8VWlmn{|ŧƧhbzB*phhbz`$_դ56D0kWmn|Ƨ<w|}ed@ed@;<vw{|}#$^_ԩթJK֪ת(-LMit34no Z["#]^ӮԮIJ *hbzhbzB*phhbz^$_թKתM4o [ed@ed@#^ԮJ V̰B}#^Բed@ed@ '-UV˰̰AB|}"#]^ӲԲIJ^_`ano Z[^дѴ FG[\]^jkVW^̶ͶBC^_`mn *hbzhbzB*phhbz^J`ao [Ѵ G\]kWͶC_`ned@ed@nYZϸи EF̹͹BC}~./ijݻ޻STɼʼ?@z{ʽ˽̽ڽ۽PQƾǾʾؾپھ"#]^ʿӿhbzB*phhbz`Zи F͹C~/j޻Tʼ@ed@ed@@{̽۽QǾپھ#^Կ)dPed@ed@ӿԿ()cdOP#$%23mnYZn :;<IJn56npq#$FGY_n23mn *hbzhbzB*phhbz^$%3nZ ;<J6q$Ged@ed@3n2m UA| !/ced@ed@123lm 3TU3@A{| !./3bc#$%23AB|}$%&34no Z[ FGQRShbzB*OJQJphhbzhbzB*ph[$%3B}%&4o [ GRSaMed@ed@S`aLM89Z[\ijUVABuvw56pq!"\]mno|}-.hiTUi~hbzB*OJQJphhbzB*ph]M9[\jVBvw6q"]no}ed@}.iUN"1lXhed@ed@MNi!"01ADiklWXghivw'(bc(NO'(bc(DEFST(? *hbzhbzhbzB*ph^hiw(cO(cEFT@{ed@ed@?@z{(+,XYZghST?@abcpq!"\] HIwxy78rs#$^_RShbzB*phhbzOJQJhbz]{,YhT@bcq"]Ixyed@ed@8s$_S?zB}.ed@ed@>?yzAB|}-.;<>[vw{'(2Obco{ GH{34no{hbzOJQJhbzB*OJQJphhbzB*phhbzX.<w(c H4oZed@ed@YZ{ EF{*+,9:kl:WX'(:bc:NO .01:CIkl *hbzB*OJQJphhbzB*phhbzB*OJQJphW F+,:lX(cOed@1lXZ ()7r#^;ed@:WXYZ '()67qr"#]^r:;<IJr56pq!"&'(56pq!"\ *hbzB*OJQJphhbzB*phhbzB*OJQJphW;<J6q"'(6q"]34BeQed@\]q 234ABdeqPQq   ()cd)OPhijwx ))./ij *hbzhbz *hbzB*OJQJphhbzB*phhbzB*OJQJphTQ  )dPijx/jed@ed@)UV!")\] HIijkxy)*dePQhijwx( ) c d        O P             - . h i      *hbzhbzB*phhbz^V"]Ijky*eQijxed@ed@) d    P      . i      M    9 t     ed@ed@       L M       8 9 s t         LM89stSTu|:;<IJ56pq!"YZ[hiwx *hbzhbzB*phhbz^ M9tT;<J6q"Zed@ed@Z[ix)d,;v'ABP<wed@ed@()cd+,:;uv&';@ABOP;<vw'(:;<=>?@ABCDEFGHIJKLMNOPQRSTUVWXYhT'jhbzUhbzB*phhbz *hbzW(;<=>?ABDEGHJKLMNOPQRSTUVWXgdT'ed@XYed@01h/R / =!"#$% x666666666vvvvvvvvv666666>6666666666666666666666666666666666666666666666666hH66666666666666666666666666666666666666666666666666666666666666666p62&6FVfv2(&6FVfv&6FVfv&6FVfv&6FVfv&6FVfv&6FVfv8XV~ 0@ 0@ 0@ 0@ 0@ 0@ 0@ 0@ 0@ 0@ 0@ 0@ 0@ 0@66666_HmH nH sH tH H`H Normal CJOJQJ_HaJmH sH tH L`L  Heading 1$@&5B* CJ \aJ phDA D Default Paragraph FontRiR 0 Table Normal4 l4a (k ( 0No List 4U`4 0 Hyperlink >*phDV D 0FollowedHyperlink >*phR/R Heading 1 CharCJ OJPJQJ^JaJ ph/TD/"D msonormaldd[$\$OJQJ@B`2@ 0 Body TextB*OJQJphH/AH 0Body Text CharCJOJPJQJaJ4@R4 T'0Header  H$B/aB T'0 Header CharCJOJPJQJaJ4 @r4 T'0Footer  H$B/B T'0 Footer CharCJOJPJQJaJPK![Content_Types].xmlN0EH-J@%ǎǢ|ș$زULTB l,3;rØJB+$G]7O٭Vj\{cp/IDg6wZ0s=Dĵw %;r,qlEآyDQ"Q,=c8B,!gxMD&铁M./SAe^QשF½|SˌDإbj|E7C<bʼNpr8fnߧFrI.{1fVԅ$21(t}kJV1/ ÚQL×07#]fVIhcMZ6/Hߏ bW`Gv Ts'BCt!LQ#JxݴyJ] C:= ċ(tRQ;^e1/-/A_Y)^6(p[_&N}njzb\->;nVb*.7p]M|MMM# ud9c47=iV7̪~㦓ødfÕ 5j z'^9J{rJЃ3Ax| FU9…i3Q/B)LʾRPx)04N O'> agYeHj*kblC=hPW!alfpX OAXl:XVZbr Zy4Sw3?WӊhPxzSq]y Ye  38!9)06>.E%LS[ck"t{{2&nӿS?\ Y3U&M-3\9s@GMSZ_vdeiJnsww|4ËZݚj$@M}h{.;Q ZXY.Oz&FA 7 X 4 U 7R t ,=e "##%r&&E((()**,,-M-.3/T/\112333666899;<<?,@M@ABB%DDDFGGHIIKwLLMNNQQQYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXT # @H 0(  0(  b S  ?C"@T'bz?A@YP@Unknown G.[x Times New Roman5^Symbol3. *Cx ArialC.  PLucida Grande7Courier?Z PTimesTimesC.,*{$ Calibri Light7.*{$ CalibriA$BCambria Math"1hU•'U•'((%0B@P $P'T'2!xx3% NormalPurple Sea urchin P450s Stephen NelsonMicrosoft Office User Oh+'0 (4 T ` lx'Purple Sea urchin P450s Stephen NelsonNormalMicrosoft Office User2Microsoft Office Word@F#@. P@. P( ՜.+,D՜.+,H hp  'Private Purple Sea urchin P450s Title* 8@ _PID_HLINKS'Ad*^Zhttp://www.systemix.org/codes/plot-header.php?tr=glean-davidson&expt=NASA%3Asignal&org=SU&CN=&cut=&log=&scaffold=82902&strand=C&begin=53000&end=79201&min=&max=&submit=GOWhttp://www.systemix.org/codes/plot-header.php?tr=glean-davidson&expt=NASA%3Asignal&org=SU&CN=&cut=&log=&scaffold=7510&strand=C&begin=12000&end=36195&min=&max=&submit=GOG7Thttp://www.systemix.org/codes/plot-header.php?org=SU&expt=NASA%3Asignal&tr=glean-davidson&ext=20000&gene=GLEAN3_05521&Submit2=Array+dataQhttp://www.systemix.org/codes/plot-header.php?tr=glean-davidson&expt=NASA%3Asignal&org=SU&CN=&cut=&log=&scaffold=1053&strand=C&begin=20000&end=40000&min=&max=&submit=GODNhttp://www.systemix.org/codes/plot-header.php?tr=glean-davidson&expt=NASA%3Asignal&org=SU&CN=&cut=&log=&scaffold=118&strand=C&begin=225000&end=264312&min=&max=&submit=GOB3Khttp://www.systemix.org/codes/plot-header.php?org=SU&expt=NASA%3Asignal&tr=glean-davidson&ext=20000&gene=GLEAN3_05160&Submit2=Array+dataB;Hhttp://www.systemix.org/codes/plot-header.php?org=SU&expt=NASA%3Asignal&tr=glean-davidson&ext=20000&gene=GLEAN3_03687&Submit2=Array+dataB3Ehttp://www.systemix.org/codes/plot-header.php?org=SU&expt=NASA%3Asignal&tr=glean-davidson&ext=20000&gene=GLEAN3_03607&Submit2=Array+dataBhttp://www.systemix.org/codes/plot-header.php?tr=glean-davidson&expt=NASA%3Asignal&org=SU&CN=&cut=&log=&scaffold=1111&strand=W&begin=46000&end=74958&min=&max=&submit=GO?http://www.systemix.org/codes/plot-header.php?tr=glean-davidson&expt=NASA%3Asignal&org=SU&CN=&cut=&log=&scaffold=1629&strand=C&begin=39000&end=60000&min=&max=&submit=GO[<http://www.systemix.org/codes/plot-header.php?tr=glean-davidson&cut=&CN=&log=&min=&max=&org=SU&expt=NASA:signal&scaffold=1629&strand=C&begin=61841&end=85073F9http://www.systemix.org/codes/plot-header.php?tr=glean-davidson&expt=NASA%3Asignal&org=SU&CN=&cut=&log=&scaffold=14873&strand=W&begin=-18111&end=6000&min=&max=&submit=GOB;6http://www.systemix.org/codes/plot-header.php?org=SU&expt=NASA%3Asignal&tr=glean-davidson&ext=20000&gene=GLEAN3_02899&Submit2=Array+dataC;3http://www.systemix.org/codes/plot-header.php?org=SU&expt=NASA%3Asignal&tr=glean-davidson&ext=20000&gene=GLEAN3_02898&Submit2=Array+dataO:0http://www.systemix.org/codes/plot-header.php?org=SU&expt=NASA%3Asignal&tr=glean-davidson&ext=20000&gene=GLEAN3_02884&Submit2=Array+dataI1-http://www.systemix.org/codes/plot-header.php?org=SU&expt=NASA%3Asignal&tr=glean-davidson&ext=20000&gene=GLEAN3_02832&Submit2=Array+dataJ1*http://www.systemix.org/codes/plot-header.php?org=SU&expt=NASA%3Asignal&tr=glean-davidson&ext=20000&gene=GLEAN3_02831&Submit2=Array+data_'http://www.systemix.org/codes/plot-header.php?tr=glean-davidson&cut=&CN=&log=&min=&max=&org=SU&expt=NASA:signal&scaffold=498&strand=W&begin=104931&end=158815.|$http://www.systemix.org/codes/plot-header.php?tr=glean-davidson&cut=&CN=&log=&min=&max=&org=SU&expt=NASA:signal&scaffold=498&strand=W&begin=39775&end=61911=z!http://www.systemix.org/codes/plot-header.php?tr=glean-davidson&cut=&CN=&log=&min=&max=&org=SU&expt=NASA:signal&scaffold=498&strand=W&begin=-2381&end=20027Chttp://www.systemix.org/codes/plot-header.php?tr=glean-davidson&cut=&CN=&log=&min=&max=&org=SU&expt=NASA:signal&scaffold=41502&strand=W&begin=-1984&end=9572@:http://www.systemix.org/codes/plot-header.php?org=SU&expt=NASA%3Asignal&tr=glean-davidson&ext=20000&gene=GLEAN3_02380&Submit2=Array+dataA5http://www.systemix.org/codes/plot-header.php?org=SU&expt=NASA%3Asignal&tr=glean-davidson&ext=20000&gene=GLEAN3_02371&Submit2=Array+dataF8http://www.systemix.org/codes/plot-header.php?org=SU&expt=NASA%3Asignal&tr=glean-davidson&ext=20000&gene=GLEAN3_01792&Submit2=Array+dataG3http://www.systemix.org/codes/plot-header.php?org=SU&expt=NASA%3Asignal&tr=glean-davidson&ext=20000&gene=GLEAN3_01622&Submit2=Array+dataC3http://www.systemix.org/codes/plot-header.php?org=SU&expt=NASA%3Asignal&tr=glean-davidson&ext=20000&gene=GLEAN3_01525&Submit2=Array+dataI3 http://www.systemix.org/codes/plot-header.php?org=SU&expt=NASA%3Asignal&tr=glean-davidson&ext=20000&gene=GLEAN3_01228&Submit2=Array+dataB4 http://www.systemix.org/codes/plot-header.php?org=SU&expt=NASA%3Asignal&tr=glean-davidson&ext=20000&gene=GLEAN3_00746&Submit2=Array+data@4http://www.systemix.org/codes/plot-header.php?org=SU&expt=NASA%3Asignal&tr=glean-davidson&ext=20000&gene=GLEAN3_00645&Submit2=Array+dataH http://www.systemix.org/codes/plot-header.php?tr=glean-davidson&cut=&CN=&log=&min=&max=&org=SU&expt=NASA:signal&scaffold=237&strand=C&begin=230573&end=257235H7http://www.systemix.org/codes/plot-header.php?org=SU&expt=NASA%3Asignal&tr=glean-davidson&ext=20000&gene=GLEAN3_00279&Submit2=Array+data  !"#$%&'()*+,-./0123456789:;<=>?@ABCDEFGHIJKLMNOPQRSTUVWXYZ[\]^_`abcdefghijklmnopqrstuvwxyz{|}~     Root Entry F0;P1Table$WordDocument2SummaryInformation(DocumentSummaryInformation8+CompObjr  F Microsoft Word 97-2003 Document MSWordDocWord.Document.89q