Purple sea urchin P450s

 

From  the sea urchin genome project

http://annotation.hgsc.bcm.tmc.edu/Urchin/cgi-bin/pubSearchGene.cgi

search was with P450 as synonym.  Found 137 hits including one reductase.

Nov. 29, 2006  (in progress, expression data links  still being added.

These are not named yet.)

 

GLEAN is a gene prediction program that merged four other gene prediction

algorithm results from sea urchin. (see p. 14 of the supp. section of the Science

paper (Science 314, 939-952 2006)

A quote from http://www.systemix.org/sea-urchin/primer.html

"[The] GLEAN set for example is very conservative and therefore misses many existing genes."

 

>GLEAN3_00279

expression data for GLEAN3_00279

This prediction is short (219 aa), missing N-terminal half

MKESKDQNGNDVFCADHVVMATWDMFLGGYEATYKTIKWAIAFLIHHPEVQAKIQSEM

TAGFGDRMPTWEDRAKLPYLEATVLETLRQSSMSSMNGPRRTTCDTQVKGYDVAKDTT

VFCNLWWVHHDPKNWKDPMSFRPEHFLTEDGKVSVPRAFLAFSLGSRACIGSQLALMQ

LILLLGGLLQRFNLRQAVGEPLPDLEPGSELVRTVGDYTMCLEVR

 

>GLEAN3_00436

expression data for GLEAN_00436

MEINFLGFSFSLTWFLVGAIFIIYICNDLWIRSYWTRHGFKSPKSVPIFGNTLQWTKG

IQHAFLDYFKKQGTVFGVYDFHRPGLCITDLDLLRNIMVKDFGSFTNRWATPLLNKDL

LDALFFSKDDHWKGIRNIITPTFSAAKMKLMSPLINKCADRLLKHLEKHQKIHGNIE

CRELVGAFVMDTIASCAFGLNVDCQENRHHPFVTNAKKAFAGNTFSSPLLRIDYLQLL

LDAQMRDITGNNKNKADGIHGDPEDEAIDEGGLQKKSSSKIRLNESEVISQALIFFLA

GYETTNTTCGYMLYLLATHPDVQDKLVNEIDDVAPEAEDVGYQSISKMPYLEQIFCET

ERIYPPALMTDRVCNEPFDINGFTVPKGMRIFIPIFTIHHDPNLWPDPETYDPDRFSK

ENREKHHPCAWMPFGTGPRNCVGMRFAMMEAKMVIVRILQKYQIETCPQTEIPPKQGT

NGLISPPNGITLRAVKRN

 

>GLEAN3_00645

expression data for GLEAN3_00645

This gene is about 75 aa longer than usual at the N-terminal

MATRLSRFLARARPSRSSGVRPQTFHDAHASSCPLAAVPSTLSPHASTTTPSTPSTTK

TSTTSTTSAPSSISDLPSNNNLSLGVSNGCTFDQAPASINPSERNCTIFDDVKSFKDI

PGPKGWPILGTLGTYLSGKGLERIYDHQIDFTKKYGPIWKERLGTLEFVNLAKPDLVE

VMYRNDSRYPRRLDMKPWKVYRAHRDESLGVLTHEGKEWHRARRTIGRKIMPPREVAK

FTGVINEIVTDMVERLRFVRDTKGEGDGVVPELQNEMYKWSMESIFKLLLETRIGCK

EPIPEKSQGFINAVGQMFASGQIMFLTSPEWFRRWNLKPWRDHEAAWDYIFNFAKKSV

DARLQELATLAEEGDVVEDSGILTHLLASQQLSMKEVYSNACELLLAGVDTTSNTLAW

ALYELSRHPDTQDRLAKEVEGALAAKGRCIPEHDDLPNMPLLKGVIKETLRLYPVVPA

NNRALDKDVTIGGYHIPKGTMIGSLQYVMGRDPEIYPDPDTFLPERWLKGSEAISSEE

KASSFSSLPFGFGPRMCIGRRIAELEMHLALARISTSFKLRYHGTEEVFPAIRGILVP

GSPLKISFRDR

 

>GLEAN3_00746

expression data for GLEAN3_00746

Missing the C-terminal

MSRNIGFAHLATRYGGLISLRLGTKLTVVLNDVDSVKQSLVKEADVFTDRNVPIEYSL

ALPLEGSLAFASGDKWRELRTFVAKAFRQFEVEKEKLESVIVRESERLCKAFQDEANK

SSSDSQGFDPHHIINNAVSNIICSMCFGQSYEYSDPEFHHILTVVGNSFRSLSPSGLL

LISRYFYYTPLYREVRHNLAVLQDFIRGIVQRHQETFDPDNTRDIIDMYLAEVRQLKE

DGMDTYLKEENIWRGVFDIFLAASESMTSALRWIFLYLIDNPELQRQV

 

>GLEAN3_01228

expression data for GLEAN3_01228

no seq. provided, Scaffold 91975 12823-12912, 11663-11790, 11303-11390, 10483-10647

(minus strand exons) This seq derived partially from two of those sequences

best hit is to a Fugu partial sequence like CYP4V

DFVLDGYKVPKGYAVIYVSSIAHRDPNVFKNPNSFDPTRWAN

RTEKEKLLFCYGDGPRCCVGTSLMKNLI

 

CYP4V5-like fragment from Fugu FS:S000209 Scaffold_209 52% to 4V5

 

 Score = 55.8 bits (133), Expect = 1e-11

 Identities = 23/40 (57%), Positives = 29/40 (72%)

 

Query: 1  DFVLDGYKVPKGYAVIYVSSIAHRDPNVFKNPNSFDPTRW 40

          D  + G+KVPKG+  IYVS   HRDP+VF+ P+SF P RW

Sbjct: 1  DSTIAGFKVPKGHGAIYVSHAVHRDPDVFQQPDSFLPERW 40

 

SEQUENCE 180  Ciona intestinalis

 

 Score = 212 (74.6 bits), Expect = 6.3e-20, P = 6.3e-20

 Identities = 35/68 (51%), Positives = 48/68 (70%)

 

Query:     4 LDGYKVPKGYAVIYVSSIAHRDPNVFKNPNSFDPTRWAN-RTEKEKLLFCYGDGPRCCVG 62

             L+ Y +PK YA++YV+  AHRDP VF  PNSF+P RW      +E L+ C+G GPRCCVG

Sbjct:   310 LESYDLPKDYAIMYVTHAAHRDPEVFPEPNSFNPDRWITCNAGQEDLVLCFGSGPRCCVG 369

 

Query:    63 TSLMKNLI 70

             T L++ ++

Sbjct:   370 TDLIRRIL 377

 

 

>GLEAN3_01525

expression data for GLEAN3_01525

This seq is short on both ends

YSILPKDLKYKREVRAAANEIRAMGRRCILARMDALRRGDQVPRDILTYILQESNNSE

GGIKDFDLEELVDEFFTFFGAGQETTFNLLSFTLLHLGRNPQVMKKLRDEIDTVLKGR

NYVEYSDVSKMKYLTLVLKEALRINPPFGHIHRLLPHEMDLCGYKVPKGSVVMIPVYG

MGRNEKHFKNPEKFDPERFTRDEDSPLFAYIPFSLGARACIGQTFAMEGFQLDTVIR

 

>GLEAN3_01622

expression data for GLEAN3_01622

MDLYTFLDPYAFFIMLMTVLVLYTQWFEYPISNVKPLPGPRKWSILGNLPQLIAARFD

INDFHERLRKMYGAVVGYRIYHHNVVVINDMKLAREVANHPDFCGRMRDELHANLSKE

PNAGILTSYGHTWKEHHDVLSGFVTALSKSKQLEDICTIEATKFIKVLTDQTDKDLII

NHYSVQTLAVIFCQVIFGKVLDVNDDIVKEVVDIVMERQRESPSATRRLLPSYVNNLT

GSGAKRRHHLDVQFVALVKTLVGDHTKQTEILIPCFVRRYWDERHEKLQSMEDGSMTS

LTSDNLLWACIDLVTVCVDEASLLLTQSLLLLASHPEVQSKLKEDIRPEAKEQDDQEP

SAYLAAVQLETQRLAPDNPWSTSMMALSDVKLGDFEILQDAVVVANWRYLRQGDDQGW

DEPTQFQPGRFLGEDGSLVEHPQMISFPAGKRACPADAMSSSASSMLIRHICQRFKLS

LPAQTTHPCDKNGLIQRSSLFKTKLRIVPH

 

>GLEAN3_01792

expression data for GLEAN3_01792

MVSFYVGPKPVILLNSYPVIFEAFRHPDLQDRPSSRLLQEIMGLKNSGIAFSNGEVWK

EQRKFTHSVFRSLGVGKKSYEDIVSAEMTQLSSVIKEKKSSPFDPSTLFMQTVSNIIC

SIVFGTQYTYDDDGFKKILHLVNLNVQLSGGGGTFLFLPIPGISKIPFGKMKLMTNNV

RSLNAFIQSQIESHVKNWDPENQRDFIDQYLNKLDETKDTVSSFTKLNLTGCIADLFL

AGSDTTTTTLNWCILFMMAYPQVQSRVQDELDHVVGRERIPRLDDIKDLPYTNAVILE

VQRKGVVVPLGVPHVAAADTSIRGYTIPKGAVIVSNISEVLNSEELWKDSGAFKPERF

LTADGELIKRDELVFFSTGKTKDQYFFFLCLIRGFIDYIYMVTFKDTILQIKFT

 

>GLEAN3_02371

expression data for GLEAN3_02731

MSHLYSICVGFVNLSSVFILMAMISLLAVGYFIKTRGSRFPAGPLGVPVLGYLPFLDC

RRLHRSLMSLGRRYGNVFSLKIGAQTVVVLNSADVIRECLVGKATSFDGRPVWMLNKV

NQGRGIANEQPTKKWQAHRKAWSKVTRSLTLHDLDMEGKISHDIARILSVLESSQGHP

IDVSKTVHMALCNIICSLCFGKSFSYNDLDFQHLLKMADKFFCYLSSASAVNFFPILW

YLPLKANKAVAEGYEGIFGFVKNLVEEKQDGVGRMQDRGFIDVCLDETKSKKNCDVVK

MKSKKTSADMPGRNIDRNAETTNDNKNDSISSADSHCHQPCGNGTACKEMDNDDENDV

LETLDEYSRRVICEDVTYVATDMFIGGAETTHAGVMWSIAFMVLYPDIQIRVQKELDD

MVGDQHLPVWSDRHHLPYTQACLQEILRLGNVIPIAIPHVTTRDVTMSNGCHLPQGTT

VLSNLYACHMDPSAWESPREFRTERFLDSEGNPKRFDHFMPFSIGRRMCLGEQLARME

LFLAFTHIFIKYDVSFAPNERQPCLEGRPGITHSPYRFKICAIPRKATKMER

 

>GLEAN3_02380

expression data for GLEAN3_02380

MDVFNEQSFIMAKKMEKFADQSEPFNIFPQITYCVLDIICDTAMGKSINAQGEGDNEY

VTAVISMTNLVQERMKKPWFWPDLLYDNIQSGKKHANNLRVLHDMTTKIIKQRLQEPP

RALDGSEEDAVAGKRRRIAFLDLLLQMHREDPSFTLEDIREEVDTFMFEGHDTTAAAA

SWTILMIGRHPEVQTRLHEELDEVFGDSDRPITADDLQKLQYLNCVFKETLRLCPSVP

MIGRDLEEDCVIDGKVVPKGTLVVLGIYALHRDPEQFPDPEKFDPDRFLLENSTKRHP

YAYVPFSAGPRNCIGQKFAMMEDKVILANLMRKFSVQAIQTMEETNPLGELIMRPRDG

IYVKLSRRK

 

>GLEAN3_02590

expression data for GLEAN3_02590

MVDTFNASADHLVNYLMERADGLQEVRMYDAFNNITLDIIAKVTFNMELNIIDDPDNP

FPNSILTCLNAIMTSINNPWMQYGLTKKDRKTKADVKAACDFLRGFGRDCVNARLAAK

ARGEELPNDMLTNILQASQDLEGNENFGIDEMLDEFVTFFIAGQETTSNLMSFTLEML

GKHPHVLQKVQDELDEKLGSQMFISFQDMGKLDYLMLVLKESLRMYAPAPVVTRVTGT

EVKASTGLIIPKGSQLNLSPFVMGRMSEYFEDPLVFRPERFVESKHSPYAYFPFALGQ

RSCIGQQFALIEARIIMAKLLQQFKFDLVPGQNWNLSYAVTIRPDDGCRNYVSLRH

 

>GLEAN3_02656

expression data for GLEAN3_02656

MSSVVNLLLSVIEYIGIANLLTTITICLVTFWYIRMNYGASFHYRLPPGPPVIPYPFD

FLMALFPSSNRFGRMIFLSKRYGDITCLQLGRRMIIMISSPSLVKELIVKQADVTSNR

SAPPFFTSVMDYKGGIIFSDGEPWQEIRRFSLSALRNFGMGRKGIETRIQEEARNLCN

AFAQHSSEAFDAMHLTNAAVSNIICSITFGRRLEYDDPVFMDMIQRLRNVTSGVGINP

IIRELMMRFPILLKTPLLGKPKANLLGVKNFITKEVIEHIESYHSSDIRDIIDMYLTE

AERLEEEKKKTNEDRTKPYLNKDDIWMCVFDLFLAGTETTSTALLWFLLVMAAKPDVQ

EKVISEIDLVVGRERAPCMEDRKDMPYTDATLTEVLRFRPVAPLGVPHVTAKDVEIGE

YTIPRDTEIMINIIGIHLDPRLWDEPEEFNPHRFLSEDGKTVIKPQDAFTVFGAGRRV

CLGEQLARMEFYLFATTLMQRFRFRLPEGESGDFGFEHCLGTIAPKNFKILAQER

 

>GLEAN3_02658

expression data for GLEAN3_02658

MESLKGYVTDWWNDMGIITSLLFGLFGSVIVFGIVRQTFGKREKLPPGPLKIPFPLNI

IASLFSKEGKMDMILSAKRKYGDLVSIEIDGQPKILVGSPELVRELFVKHGDVSSSRA

ASKPLKAMINYTGGIIFGESWQPLRRFSLSALRTFGMGKRSIASKITEEARIFTDGLW

ESQTVEGVNPSGLLNKAISNVICSITFGKRFEYSDPDYIIQVENIKKIMAGGNIPPVL

RIVAQRFPVIFRSPLSKNAKKRLQSLKDFIVQQVDEHDRTFEGDDIRDIVDSFLAECR

RLQEKGEQECYIRKEDIWMSVWELFIAGTETTASALAWFFLFMAAHPDIQEQVFKEIV

TTIGSERAPVYDDRKNMPYTEATLAEVLRHRPVAPFGLPHVADETIHVRDYVIPKGST

ILANILGIHHDPELFPEPSKFDPNRFLSDDGQTFVKNDAYTAFGLGRRSCLGEQLAKM

EFFLFATSVLQRFTIRLPAGATPDYSIGHRMMTLLPSDFYIHLDKR

 

>GLEAN3_02660

expression data for GLEAN3_02660

MELNAGFRILLYALLVITSTVGVAFVVFSWLLYRFNGKYGHLPCPRPTSFFFGHLSHI

NAVSKKMDMQTLTLMHQWMRDLGPVYCIRFILQPMVICSNPPTVKVGFDMDINAVENS

DCPFPSAIALALQGMDATFKNPLLMVDPRSKARQYRRSVRDAVKLVRNIGQDCIEKRR

EAKKKGEELPKDILTCILQAEDDVGNGEKIETADLIDEFGTFFVAGQETTSNLLAFTL

LEIGRHKDVAKRLKDEVDEVLGSKSIIEHSDLTKLEYMSRVFKETMRLNPPVSGASRE

LASDVQSCGYVIPKGASVLFLSYLTSRLEEYFDDPLLFNPDRFIPTDETPRHFFAYFP

FAIGQRNCIGQQLALIEAKVILAKLLQSFDFRLEQSQRHIIVSQVTNKPIDGCKNYIS

LRD

 

>GLEAN3_02831

expression data for GLEAN3_02831

only half of gene from I-helix on, missing PERF motif

MSELMPKTAYFCIFIAGIQTTVETLYWAMALLATYPEIQEKIRAEIDDVIGRDRLPTI

NDRGKLPYTEASLYEVLRYSSIAPTALPHATTRDTEFEHFMDGSGTIRDHPPSFLPFG

AGRRGCLGEAVAKADLFLIFTWFLQNYTFSKVQGKESEDILKMIPQTASGRLLLSYEI

MINKRD

 

>GLEAN3_02832

expression data for GLEAN3_02832

short at N-term, green = C-helix

MLPSPFHVIHFQVILSSSQHVRKGFLYRLLEPWLGLGLLTSTGQKWFHRRKMLTPTFH

FSILQCFMDVFNEQSTILVKKLEKFADKSETINIFPLVTNCVLDIICDTAMGRCTNAQ

EDGDNEYVQAVGRMGELVIARAKNPLTWPDYLFGKLNAGKEHDKTLKILHDVTDNMIQ

ERLKEPPSVTHEDEDETVARRRKRIAFLDLLLAMHREDASFTLKDIREEVDTFMFEGH

DTTAAAISWAILEIGQHPDIQERLHAELDEVFGDSIRPVTSDDLSRLSYLTRIVKESL

RIIPAVPMVARSLDEDIVLDGKVVPKEAMIMLHIYALHQDPQQFPDPDQFDPDRFLPE

NAEKRHPYAFVPFSAGPRNCIGQKFAMMETKLTLANIFRRFSIESVQTIEGAKPAGQL

ILRPVEGNILVKLSWRK

 

>GLEAN3_02884

expression data for GLEAN3_02884

missing N-term half, seems long on the end

MRILIDLANKSDDSSSKEGLYEAIKNQTIEFLTSGMLSLSSAASTMLYHLAKKPSVLA

KVREELNDYGLLTDYSPLTLNKINQCRYVMQVIKESLRMVPPIGATFRTALKTFELDG

KQIPKGWTVFWSIRETIHLSDSFTDTKNFDPDRFSPQRQEDKVAGRFSMPVFGHGTHS

CIGQNFALLTLRILLIELARTCDFETPNVDKIKFSWIPSPKPKCGMPTRFTRKILVDQ

NENEVSYSSKSSTCPQGEDSRERSEEGVHQRSTTCHAA

 

>GLEAN3_02898

expression data for GLEAN3_02898

MDLYTFLDPYAFFIMLMTVLVLYTQWFEYPISNVKPLPGPRTWSILGNLPQLIAARFD

INDFHERLRKMYGAVVGYRIYHHNVVVINDLKLVREVANHPDFCGRMRSKLHANLSKE

PNVGILTSYGHTWKEHHDVLSGFVTALSKSKQLEDICTIEATKFIKVLTDQTDKDLII

HHYSVQTLAVIFCQVIFGKVLDVNDDIVKEVVDIVMERQRESASATRRLLPSYVNNLT

GSGAKRRHQLNVQFVALVKTLVGDHTKQTEILIPCFVRRYWDERHEKLQSMEDGSMTS

LTSDNLLWACIDLVTVCVDEASLLLTQSLLLLASHPEVQSKLKEDICPKAREQDDQEP

SAYLAAVQLETQRLAPDNPWSTSLMALSDVKLADFEILQDAVVVANWRYLRQGDDQDW

DEPTQFQPGRFLGEDGSLVEHPQMISFPAGKRACPADAMSSSASSMLIRHICQRFKLS

LPAQTTHPCDKNGLIQRSSLFKTKLRIVPH

 

>GLEAN3_02899

expression data for GLEAN3_02899

C-terminal half only

NGTVSSYDELNTVFSLADIFAAASETTTTTLRWLMLRMVTHPDIQKKIQDEIDDA

VGRDRLPTLLDKPKLPYTEATILEVMRLNFVTPLGVPHQYTGRGELSFHGYDIPE

GAIVFANYWSISRHPKLWDDPEVFRPERFLNEEGKCVKPDELTPFSIGKRVCIGESL

AKMELSLLFTHLLHRFTFTLPDGVPSYSMKDSLGAVNTPVSYQL

 

>GLEAN3_03038

expression data for GLEAN3_03038

MGYDQLFNSVTVTTYLVLIITVVLITRYWINTKRESEKVARSGGRPLPGPWGLPVIGN

MLSLDQSGPHLSLMKLAKDYGNVFKIQMGSRPVLVLNGLEATKKALTKQPVVFAGRPD

LFTFKVINETNVYGPSISFSNYSEQWKLHRKLAETSLRHFTAGGQIQVLEKVIESEAQ

ELVRYLKASNKKSGANLKGLINLSVSNIVLWYLLSKRSSYKDKKLIELLGFADKFSKA

TGNGNPVDFLPWLKFIMTKSTRKLKEWQEQFGQMLQGIIEEHHAVYEPGSERDIVDHI

ITTSTKGGDEELKRLGLDGKTLLQIAYDFFGAGYETTSATLEWMLLYMASYPQVQSDV

QGEIDQFVGRGRLPTLNDFDNLPLTQSCLLEIQRHASVVPFAVPHSTTKDTVLDGYFV

PKDMVVFVNLYSAHFDPEVWDQPEAFNARRFLAQDGTLDKDKEKLVVAFGLGRRKCIG

SDLARINIFIYFTTLLHQLNLSCPQGQKVSLESAFGLVRRPVEFNVEVVPRS

 

>GLEAN3_03231

expression data for GLEAN3_03231

MIVIASAVGTALCAVYYLKEHYDYKVRKLPPGPTSIPFIGDVLSINPKSPHLSLIELA

KTYGDIFSIKLGSERVVVLNNAELIKSAYRGVDISHRPDLFCMDVLAGGKGFLTCKDE

IQRQMHMKICRQAMRVVNNADMGEKILQEANNLVNHFEAQNEKPFDPQCDLHIASLNI

LCNFVFGERYEHDHPEMREILDFSAEISKLLSPMHPVNAMPWLRHFPNKWFASLFKAK

NQRDRILMKKYVEHVVTYKAGRVRDMLDGLLAETTEAVAENNKQAVALLTPEHIIINM

WLIFFAGSDTVTNTLQWSLLYMAVFPKKQAKAQEELERIIGQSKRLSLEHKSQLPYLQ

AVINEVLRYSSLTILGVPHAAARDTEINGYFIPKGTSVMANFWSVHHNEDAWDQPEEF

LPERFLNDQGNLKEPSQLPHFMPFSVGKRRCLGANVAKAELFLMLGRLLQEFSFEMPQ

DVEADLQGEAAVSLIPKPYKIVAKKRHIIQPETI

 

>GLEAN3_03232

expression data for GLEAN3_03232

MDSASQLLLLHPTSVILPYGHSFENFTLNLYDMLSIKTTALQPVSAPHGSMAPILIPL

MIVVSIIVYYCQEWMEFNVDKVPLGPTSLPFIGNVLQLKSKALHLSLTDMAAKFGNVF

SMKLGRQRVVVLNDAQTVKTAYNGPNFTNRPSVFSIEFFVSKGFMACQKKHDFRIHTK

LMKHAFQAVTYTSLGDKLVEEAMDLINQFDSYNGDAFNPREDLNLVSLNILHNIAFGK

RYQKGDTELHEIFDYSSRIMKGVSPVHPVNLIPWLQRYPNPWMKDLSDARDKRDNMLL

RFYEEHKETYVEGEIRDMVDALIKVEQDAEEAGDMETLRLLSPLHIITNIWTIFFAGT

DTIHNALLWVLLYMAVFPEVQRKVQEEIDRVIGDRTPCLEDEQTLPYLCATFYETLRF

SCLSLLGVPHAAVCDTTLNGYHIPAGTQVLANFWAINHDETTWDNPSEFRPERFLNDN

GQLRNMKDFPHFMPFSTGRRACLGKNIGKSGIIVLSACLLQKLNIAVPTGEGAQQPTL

EPEVHFDLVPKQYDIVVTRRDSMRS

 

>GLEAN3_03606

expression data for GLEAN3_03606

N-term seems about 150 aa too long

MEAYVTEGTTKEGDDILEEPGHQQFKPEATLLLGDISLECDRGHYLNKDLETCVPCPA

GYYLDVASEICTPCNRGFFQDDQAQTDCKPCKNGTSTRNQGATHSNQCEVLCGPGYFS

STGFSPCMWCEQGSYQSLHGQTRCEKCPLMSTTPDHSGTSIDDCQDKMFDLSATSVFL

GVVIFLLCLWVVRRPKNLPPGPWSLPIIGYNFKPGLIHEAYMDLAKKYGPIFSLRRGS

FVFVVLNDRESITQALVKSGEFFSDRFVPGHFNWAIPDPGKKATISFSNGKPWVDLRK

FSLPALRSFGFGKQSLVPQINLEARYLSEEIRNLRGEPTDLAITFSKATANIICQLLF

SRRYEYSDGEMSKVLKDNEEILSLIPETDLVNIFEPLIHTSNQRYKRYREANFANRDF

IMSHLNSHRETFQKDNIRDFTDAFLADDISKEFELEHFWRVVFDLFAGGTDTTAVVTS

WAILYLSVHADVQRKVQTELDTVVGRGRQPNTLDRPDLPYCNATITEVMRIRPILPVS

VPHMTSDNVSFRGFTIPKGSIIVPNLWAVHHDPKEWNEPDRFNPDRFLSADGKQFQKN

EAWMPFGVGRRDCVGSQLAKMETFLLFTNLFQQFEFKLPPNQPNPSMRGANGITMNPE

PYKVCAIER

 

>GLEAN3_03607

expression data for GLEAN3_03607

missing N-term, green = C-helix

MYTPCATIAFSNGKPWVDLRRFSLPALRSFGFGKQSLVPQINLEARYLSEEIRDLRGE

PTDLATTFSKATANIICQLLFSQRYEYSDGEMSRILKSMDEFVSLIPETDLVNIFETL

IHTPKYKPYRESIFSDRDFIMSHLNSHRETFQKDNIRDFTDAFLADDISKEFELEHFW

RVLLDFFIGGTDTTAVITSWAILFLSVHSDVQRKVQAELDTVVGRGRQPNTLDRPDLP

YCNATLTEVMRIRPILPVSVPHMTSDNVSFRGFTIPKGSIIVPNLWAVHHDPKEWNEP

YKFNPDRFLSADGKQFQKNEAWMPFGVGRRECVGSQLAKMETFLLFTNLFQQFEFKLP

PNQPNPSMRGANAFTMNPQPYKICAIER

 

>GLEAN3_03687

expression data for GLEAN3_03687

missing N-term and C-term (no heme signature)

MVMTNSKHTKSHVDYDGFQAIFGSRFAGRSLVNETDHSRWSQRRALFNPAFHRQYVGF

GLDLNAMEDPSCPFPVAATKVLQGLQKSLLVPWYSYIPSAHARNFRKEVKEACHLIRE

IGRDCILSRIEAKARGDATPQDILTYLLDASKELQGDQNFGLEEMIDEFVTFFVAGQE

TTGNHLSFTLQQICRYPEVLKKLLIEIEEVLGDKPFVDYSDLPKLEYLMLVMKESMRQ

FPPVSGSTRSLAHEIECCGYTIPKGTRLRVNHFIMGKMEKYFDDPEEFRPERFQVSDE

TYKSRVGVE

 

>GLEAN3_05160

expression data for GLEAN3_05160

MAAVEFITENAGTLLVFVTTFTIVLTFLRNRTKTGFNIPGGPTALPFFGNLFSLKPHD

EFLDSLVGWSKEYGSIYSLYLGRKLILIVNGSEAVKELTCKTGLDFADRGESVAFLAL

NPNRLGLFDTLADEEWKRQRKFAHTTLRGMGFGKVSLEPLIQTEARHVISYFKTKVGS

LLDPSVAFSVTSSNIINKLVFNRRVDHGSESGDELVRHVRNLMVSDEGAVLTLFPSLW

PVMRYADSKWMKLYDSWAHIESFFHQEIKIREDRIDACKGELEASDFLTAYLIHRRAD

PEGYSLEALDFVLVDLFLAGTDTVSTTLTWAVLYLVSHPEIQQKIHDEIINEIGDKEV

SMSHRSGLTYTQAALLEVQRLATVAPLIPRESSRDAVLCGHKLPKGTTLLINLWSLHY

DQDVWKDPKQFNPDRFLNDDGTLVKQPESFLPFSAGKRMCMGDQLAKMELFLFFTMIV

QQIKFSLPPGADPLPMDGLLEIARKPKPFQVQVSAWSD

 

>GLEAN3_05253

expression data for GLEAN3_05253

missing N-term, green = C-helix, missing middle of gene from C-helix to EXXR motif

MGHGLVSEVDHEKWFHRRAIMNPAFRRKLQAEVDEVLGQKHTIQYEDLAKLEYMMRVL

KETLRLYSPVGGTTRLTAHPINYNGITIPAKCTVTVMSYVMSRMEEYFDDPLLFNPDR

FKPSEDGSLPRHLYAYFPFSMGQRSCIGQQFALIEARVILAKLLQRFEFRLEQSQSMG

ILDELTAKPMGRCRNYITLRA

 

>GLEAN3_05439

expression data for GLEAN3_05439

MGLFVVIVVYVLGLFACLLGIGLLVFVAAILRTEQKFSDLPGPKRDSFLFGNIFSIFE

AKKKLGQPIGLIIYTLSQDHGPVFWIRVFARGHVFASAPEIVKELLLNSRHIKPQSFY

KNFQYVFGARFMGNGLVSEINHEKWFHRRAIMNPAFRRKYLMGMMEKFNAESEKLCEH

LRDKADGRNEVEMLNELNNVTLDVIANVSFSMTTGAVGDPSCPFPAAVTLGLRGMQTI

IDKPWYSFDPRSESRKTRQETREAVNLIRETGRQQIEERIAARQRGDHVPNDILTVIL

DCANDLVGDKDFQMENMIDEFVTIFFAGQETTSNLLAFTIQQLGRNPDVAKKVQAEVD

EVLGQKPYVEYDDLAKLEYMMRVFKETLRLYPPVIGTTRVTAHPVIYKDMTIPAGVSV

SVLSVVMSRMEEYFDDPLLFNPDRFKPQEDENLPRHFYAYIPFSVGQRSCIGQQFAMI

EARVILGKLLQRFEFHLNQSQRLDMLDELTLKPLDKCKNYLTLRA

 

>GLEAN3_05521

expression data for GLEAN3_05521

missing the middle of the gene, no I-helix or EXXR motif

MAVLGTISGLFSMTTVYLMVAVVVIFFIYDYWTHTYFKRLGIPGPPPKPIFGNALDFG

EQPIFELLDEYANKYGPVVGIYMWRRPLLLVTDLDILRDIMVKDYHRFYNKFPMPVSS

GSFDNALFLLRDAQTDRECGETCSYKGYKIEKGMQIWIPAYTIQRDPTYWPDPLKFDP

ERFTKENREGRNPFTWLPFGAGPRICIGMRFALMEAKMAIVRSLQVVHFEISPLTKIP

PDVGLTSLLSKKSSYTLKITQRH

 

>GLEAN3_05655

expression data for GLEAN3_05655

MENEYGSNNPIPPRGSRFPAGPLGVPVLGYLPFLDCRRLHRSLMSLGRRYGNVFSLKI

GAQTVVVLNSADVIRECLVQKATSFDGRPVWMLNKVNQGRGIANEQPTKKWQAHRKAW

SKVTRSLTSYDLDMEGKISHDIDRILSVLESSQGHPIDVSKTVHMALCNIICSLCFGK

SFSYNDLDFQHLLKMADKFFRYLSSASAVNFFPILWYLPLKANKAVAEGYEGIFGFVK

NLVEEKEDGVGRTQDRGFIDVCLDDAKSKKNCDGVKMRSEKTFADMPGGNIDRSAETT

ADNKNDSTSSAASHCHRPCGNSRACKEMDDDENDILKTLDEYSRRIICEDVTYVATDM

FIGGAETTHAGVMWAIVFMVLYPDIQIRVQKELDDMIGDQHLPVWSDRHHLPYTQACL

QEILRLGNVIPVAIPHVTTRDVTMSNGCHLPQGTTVLSNLYSCHMDPSAWESPREFRP

ERFLDSEGNPKRFDHFMPFSIGRRMCLGEQLARMELFLAFTHIFIKYDVSLAPNERQP

CLEGRPGITHSPYRFKICAVPRKATKWNGKDSQTSSD

 

>GLEAN3_05665

expression data for GLEAN3_05665

sequence starts in I-helix, missnig N-term

MGFGKVALEPIIQEEVIGDEPVCLSHRGSMPYTEATLMELQRLASVVPIVPHAAFRDS

ELGGYHIPKGTRMWINMWSLFRDPVDWPNPNKCDPHRFLNKDGTKVVHPDSFMPFSAG

RRVCMGEQLAKMELFLFFVSLMQRFKYVFADPDTTPSFEGVVGLNRQPRPYKVKAILR

NK

 

>GLEAN3_05668

 

MTVFAPCRNLLEINVDIIQTMMFSVFVTTFLLGCTVSICCAWLLRRPKNLPPGPWSLP

IIGYRAGSGLIHECYASLAQRYGPIFSFRRGPFLVVVLNNREIIQQALVKNADVFSDR

FTPKHVNLRIPASNPNASVVWSSGKPWGDLRKFGLTALRAFGMGKKSIEPHINLEVRY

LCEEIEKLRGHPTNLLTMLNNTTANVICRIVFGRRFEYDDEEFQGILRGFFENFSTIS

GSDLVNVFSFLLYMPWYKDFRRNLSQSKSFIMAQLKLHTSSFEKDNIRDIIDAYLADE

GISKDFTVEEFCRVVLDFFAAGTETTSTALSWAILYLCLYPDVQTKVQSELDTVVGRG

RQPAISDRARLPYCEAALMESMRIRPVLPLSLPHTTSRSVSLGGFTIPQGTTIIPNIW

AAHHDPEVWDDPDDFRPERFLENHDSPTVKKNDSWIPFGLVKYDIFLFFPDKLRSSRV

SGGSASQDGVLSTVQRFEFKLPPDQSTHSKRGISGLTMPPEPYEVCAIER

 

>GLEAN3_05931

MALWNLAPVTVLVLSVLFSFLGYRCLLCMIAVWTEVKRIVHIRRTLPGAPPHWLFGNV

LQEPGPVTPGFEWHRNMPKSYRRLHVFWAGWKPIVVLNHPESVRNVLNGSVGTVKSEV

YRLFDEWLGQPMATTDGNLWKRHRRLITNSFHFNVLKSHIPKMNQIADTLISVISQRG

VAGELLELHKTTSAFSSDVILQCAFSYESGCQEGVSEYMKSVSTLAGIVMKRSLSPHL

TYDFIFRRSSLYKSWRKEIDVLHRLQEKLISDRREQHRQTGCTEFSKGKDILDTLMLA

KDTEGGLTDREMRDEINSFVFGGIDTTSSALTWLLYLLATHPEYQTKVQEEIDVLFQD

RDPEVCSEDLHRTPFLMKCVKESQRMYSFVTPGRLLTEPLVVDGLTVPAGTELTLFTY

QLHHNPDVWGDDHMTFRPNRFDRQNVEGRDPFAFIPFSAGARNCIAQQFALQEIQVAA

IRIFDKFGFTLVRDSTPVLRVVSVPEDEILLGNTFYNEIYFNLLLVLLQREDLHRTPF

LMKCVKESQRMYSFVSPGRLLTEPLVVDGLTVPAGTELTLFTYQLHHNPDVWGDDHMT

FRPSRFDRQNVEGRDPFAFIPFSAGARNCIAQQFALQEIQVAMIRIFDKFCFTLVRDS

KPVLRVVSVPEDEILLGIHPRSKK

 

>GLEAN3_06386

MVPCVSRVIRVGFAHLATRYGGLISLRLGTKLTVVLNDVDSVKQSLVKEADVFTDRNV

PIEYSLALPLEGSLAFASGDKWRELRTFVAKAFRQFEVEKEKLESVIVRESERLCKAF

QDEANKSSSDIQGFDPHHIINNAVSNIICSMCFGQSYEYSDPEFHHILTVVGNSFRSL

SPSGLLLVSRYFYYTPLYREVRRNLAVLQDFIRGIVQRHQETFDPDNTRDIIDMYLMS

NSEVGCRVWAPKLDAGVVCRSCMPGLGAEVRYLGAEIGCRDWLPRLGAGVGCRGWVSE

LGLEFGYRGLATRMAADAEIRADILRVIGPTRNPTIKDRDELPLIEATCMEVLRLRPP

APMGGPRQAKRDVEVKGYVIPEGTLVLTNVWYIHQNAENWHEPERFNPSRFLSEDGKE

LVKNPAFMPYGAX

 

>GLEAN3_06574

MGFTGTEPGYWLLAGVLGLLVAWLVKHLLTRPKNLPPGPTGLPFLGVAWQFVKGGSNP

LALFASWSKKYGEMVSFYVGPKPIIILNSYPVIFEAFRHPDLQDRLSSRILQEIMGLK

NSGIVLSIGEVWKEQRKFTHSVFRSLGVGKKSYEDTVAAEMTQLCSVIKEKKGSPFDP

STHFMQAISNVICSIVFGTQYTYDDAAFNKDVTRGGPRSRRVHVKAPCVSFPVALLGA

NRILDLVNMNVQLAGGGGSFVFLPIPGISKIPFGKMKLFTDNIRTFTAFIQSQIESHE

KNRDPDSQRDFIDQYLNKLDETKDTVSSFTKLNLTVCIADLFSAGSDTTTTTLKWCIL

YMMAYPQVQSRVQDELDHVVGRERIPRLDDIKDLPYTNAVILESFFGCNPDFYLSKVQ

RKGVVVPLGVPHVAAADTTIRGYTIPKGAIIVSNIFEVLNSEELWKDSGAFEPERFLT

ADGELIKRDELIFFSTGRRVCIGEQLARMETFLGFTSLLHRFTFKKPDDSSTLSFEGV

LGISRNPVPYMTCAVVRE

 

>GLEAN3_06989

No seq. provided Scaffold 72639 63174-64724 (no introns)

MADFQFLNSMSVVTYVVVIATSILITRYWINTKRKSEKVAKSGGRPLPGPWGIPVFGN

MLSLDQSGPHLPLMKLAKVYGNVFKIQMGSRPVLVLNGLEAIKKALTKQPAVFAGRPD

LFTFNVINETNVYGPSIGFSTYSEQWKLHRKLAEASLRHFTAGGQVQVLEKVVESEAQ

ELVRYLKASYEEKNTNLKGMLSLSVSNILLWYLLSKRSSYEDKKLIELLGFADKFSKA

TGSGNPIDFLPWLRFIMTKHTMKLKEFQKQLGQMLQGIIEEHHAVYEPGSERDIVDHI

ITTSTEIGEEELKRLGLAGNTLLQTVHDFFGAGYETTSATLEWMLLYMASYPQVQSDV

QGEIDKVIGRHRLPTLNDFDNLPLTQSCLLEIQRHASVIPFAIPHSTTKDTVLDEYFV

PKDMVVFVNLYSAHFDPEVWDQPEVFNARRFLAQDGTLDKDKEKLVVAFGLGRRKCIG

SDLARINLFIYFTTLLHQLNFSCPQGQKVSLDSAFALIRRPLEFNVELVPRS

 

>GLEAN3_07306

MYVFKRKAFVDLARKYGPVFSVRRGPLLFVVLNDRESMKQALVKSGEFFSDRFIPGHL

RWGIPDTEKNASIVWSNGKPWDDQRKFSLPALRSFGFGKKSLVPQINLESRYLADEIR

ALHGEPTDLSALLNKATANVISQLVFGRRYEYDDPEFIGALEALADAFGLVSETDPVI

VFESLVHTPWYKPYRGCIFKLRDFVMSHLHNHQETFQKDNIRDFVDAFLAHDISKDYS

LDTFWRIVLDFFAAGTETTAVVTSWAILYLAVHPDVQKKVQDELDAVVGRGRQPDTSD

RPNLPYCDATLMEIMRIRPVLPVSLPHLTSADVSIGPYTIPKGTIVIPNLWAVHHDPK

EWCDPHLFNPDRFLSADGQTVVKNEAWMPFSIGRRDCLGMQLAKMESFLLFANLFQQF

DFKLPPDQPTPSMRGHTGLSMPPESYQICAIER

 

>GLEAN3_07335

seq begins at I-helix, missing N-term half

VNTFLLGGIDTTSSALTWLLYLLATHPEYQTKIQEEVDELFQDREIREVRREDLHRTP

FLMKCVKESQRMYSFVSSSRLLTEPLVVDGLTVPAGTDLAIYPYQLHHNPDVWSDDHM

TFRPSRFDRQNVEGRDPFAFIPFSAGARNCIAQQFALQEIQVAAIRIYDKFGFTLVRD

STPVFRIVSVPEDEILLGIHPRRET

 

>GLEAN3_07404

MSESKAKSLPGPWGLPILGNLLNLGSDEPHVALMKLAKEYGNVYQIRLGSRPVVVLCG

QEAIRSALVRQPVAFAGRPDLASFQFIRKNTDVTVAFQTNDATWKLHRKLAESSIRFF

VSGQRKSSLESTVTQEVESLINYWTNNNNNDDNLQIDPVNLIKLSVSNVMLWCILGKR

HSLDDSRLREFVSKSDDFAVAVGSGNIADFMPWVRFVTVRTTQNLKKLLNTFRGWFMP

YIEEHRLQYEQDSEKDILDYLVTSSNKLDEADLQQLGVSREALQISTVFDLFGAGFDT

TSAIITWQLYYAIRHPAVIARVQAELDDVVGRDRLPNLTDRDHLPLTQAFLLETFRHS

SVVPFTIPHSTTTDTTLLGYHIPKDTVVFVNLYSVHHDPDTWHRPSDFDPERFIDNET

GLVDKTKADAIMPFGAGRRRCIGAEMGRIQSFLYFAILIHQCHVSSLDKASLDNKPIY

GLTMRPSPFNICVSRR

 

>GLEAN3_07406

No sequence data provided

 

>GLEAN3_07409

MIDLLGAGESGFGQFNALTITATLGVVTATLALWTIMRPNRFPPGPRGLPIVGSIFSL

NDSPEVVFGEWAKKYGDIFGFKAGERWMVVLNRQALIKEAVLKQGVDFAGRPDFYSVN

IFTEGFKDIVFSPYSETWKLHRKLAHSALRHFATGKPLQGLISSVYPKVEKKLAMTEG

QPIDPKVLITLIMYNVLAQMCFGLSYELEDPNVTQWMDVNNDVNEKLGLGLAADIFSW

AKYIPTSGPRMIKEITETMFGFLRSQVDEAREHYDPENINDFYSLLLKAQEDARKEGE

NVDKLTDTHIFQTVADIFGAGIQTTVETLYWAMALLVTYPEIQAKIRAEIDDVIGRDR

LPTINDRGNLPYTEASLYEVLRYSSIAPIAVPHATSRDTEFGGYHIPKGTVVFINTHS

MHYDPQEWDQPDKFLPEHFMDGSGTVREHPPSFLPFGAGRRGCLGEAVAKADLFLIFS

WFLQNYTFSKVPGKESEDILNMIPQTAFGRLLLSYEIMINKRD

 

>GLEAN3_07558

MSVIFKLVITVATLCLLHRVYRCVRRWLRLYRRANYMNRNVPSVEPHWLWGNVFDNPG

IFQAGLDWHIKAVSRCRRMLVYWGFWTPCLIVNHPDTMKDVLKSSTISRRPAGRTGYH

ILGEWLGQGIATSDGPHWRRNRKIVTPFFQSNCVRKHHVPVINKTTDIVLQKLSIKAD

SATPFDVSHLMFNLSGDIIYRVALSHELGCQEGGSPVLKSLNILTEIGTSRILNPIMA

NDFIFRMTSQYNMWKEALEVVNSLSEKLIRERKRMHEETSCETATKSRDLLDTLILST

YSDDKRLSHREIRDEGGKSHLPVLTREVNTCIFGGVDTTGSALTWFFMDMAKHPEHQT

KVQEEIDDALGDHPDDYIDSFDQHTFPYLMQCIRESLRLHAMLMPARGLIEPLVVEGV

TIPPKTGIMIDFYQLHHNPDIWGDDHMEYRPERFDQGNVARREALAFLPFSVGPHQCI

GQRIAMQELEIIGIRVLRRFSLGLVKEAKTKFRIVGKPASEVLLTIKHRQNSSNL

 

>GLEAN3_08152

MEWIVQSDTSLWLMTGAIALVVMWFVKGLMSRPKNLPPGPVGLPFLGVAWEMMRAKPD

PLALYSSWADQYGDVVSFKVGPMTFVLLNSYEVIVEAFSHPDLNNRPKSQMVEEALGL

ANNGVILANGKVWEEQRKFAHSVFRSLGVGKKSYEDTVATEMDQLRAAIEEKEGSSFD

PNLLFGQAVANIVCSVAFGTQYKYSDVEFRQILNLMNDNMELSSGGGALLFLPLPGIS

KIPFGAPKKMVKNMKEVNKFLLNQIESHEKCLDLDHPRDFIDLYLKKMAETKVLETSS

YSKINLNSAIGDLLFAGTETTTTTLKWCILYMMAHPEMQSRVQRELDHVVGRERLPRL

DDRNDLPYTNAVLIEVQRKGAVAALGVPHVAAEDTSFRGHTIPKGATILSNIWKVLNS

EELWKDSDAFKPDRFLTEDGKLIKREELIFFSVGRRVCLGEQLARMETFLGFTSLLHR

FTFKKPDNSPPLSFEGVLGITRNTMPYLTCALSRD

 

>GLEAN3_08632

 

MLAVRWYRQRSRLPGKLPPGPWGLPLIGSIFSLGKQPHLTLMDMAKKYGNVFTLNLAG

QLVVVLNGYESVREALVKKAGVFAGRPHLALTQELTEGQGIVVADYGPIWREQRKFTL

TALRNFGFGKESFEAEIQEEIRYMLAAFKEKDGSSFNSDHIVEIAVSNVVCNFNFGRR

FEYQDPKFQLLVDSIYRFAEIGTNAAAINFFPFLKHLPIPEIREVFEIDKVVRSFLQR

MIEDHKETLDETRQRDFIDAYLLEIKRRQEAMKKEEKISEADASFTETYLFHILNDLI

FAGTETMSSTIRWAILYMIVQPEVQKKVQEELDTVIAPGELPSWRDRNRMPYTEATLM

EIQRMANITALTFPHKTTEDTELAGYSIPKDTHVFINLYSVHVDPTQWKEPEKFMPER

FLSDEGGVIHNPALMPFSAGRRACPGEKLARMELFLFFSSMLRYFTLTPAPENPNPSI

YKNYGISLNPKPYKICVIPRQ

 

>GLEAN3_09118

seq begins at I-helix, missing N-term half

MPVSKLQETHRFIHEHLRDYSRMNGSHKAGILTTVETLYWAMALLATYPEIQAKIRVE

IDDVIGRDRLPTINDRGKLPYTEASLYEVLRYSSIVPNALPHATTRDTEFGGYHIPKG

TTVIINTYSMHYDPQEWDQPDKFLPEHFMDGSGTVRDHPPSFLPFGAGRRGCLGEAVA

KADLFLIFTWFMQNYTFSKVPGKESEEILKMIPQTASGRLLLSYEIMINKRD

 

>GLEAN3_09512

missing N-term green = C-helix

MVSFYVGPKPIIILNSYPVIFEAFRHPDLQDRLSSRLLQEIMGLKNSGIGFSNGEVWK

EQRKFTHSVFRSLGVGKKSYEDTVSAEMTQLSSVIKEKKSSPFDPSTLFMQAVSNIIC

SIIFGTQYTYDDEAFKEILHLVNLNVKLSGGGGSFLFLPIPGISKIPFGKMKLMTNTI

RSLNAFIQSQIESHEKNWDPENQKDFIDQYLNKLDETKDTVSSFTKLNLTGCIADLFL

AGSDTTATTLKWCILFMMAYPQVQSRVQDELDHVVGRERIPRLDDIKDLPYTNAVILE

VQRKGVVVPLGVPHVAAADTTIRGYTIPKGAVIVSNISEVLNSEELWKDSGAFNPERF

LTADGELIKRDELIFFSSGRRACPGEQLARMETFLGFTSLLQRFAFKKPENSLPLSFE

GVLGVSRNPVPYMTCTVVRE

 

>GLEAN3_09692

missing C-term, no EXXR, PERF or heme signature

MGFTETEPGYWLLAGVLAGVLGTWLVKHLITRPKNLPPGPTGLPLLGVAWQIYKGGSD

PLALFASWSKKYGEIVSFYVGPKPIVILNSYPVIFEAFRHPDLQDRLSSRLLQEIMGL

KNSGIGFSNGEVWKEQRKFTHSVFRSLGVGKKSYEDTVSAEMIQLSSVIKEKKSSPFD

PSTLFMQAVSNIICSIIFGTQYTYDDDAFKKILHLVNLNVKLSGGGGSFLFLPIPGIS

KIPFGKMKLMTNTIRSLNAFIQSQIESHEKNWDPENQRDFIDQYLNKLDETKDTISSF

NKLNLTGCIADLFLAGSDTTATTLKWCILFMMAYPQVQSRVQDELDHVVGRERIPRLD

DIKDQRLWGLQT

 

>GLEAN3_09825

missing N-term

MSRVLERMNETLTLIPETDLVNIFEPLIHTSIQRYKRYRESIFSDRDFIMSHLKSHQE

TFQKDNIRDFTDAFLADDISKEFKLEHFWRVLLDFFAGGTDTTAVVTSWAILFLSVHP

DVQRKVQTELDTVVGRGRQPNTLDRPDLPYCNATLTEVMRIRPVLPVSVPHMTSDNVS

FRGFTIPKGSIIIPNLYAVHHDPKEWNEPDKFNPDRFLSADGKQFQMNDAWMPFGVGR

RDCVGSQLAKMEAFLLFTNLFQQFEFKLPPNQPTPSMCGANGITMKPKPYKICAIER

 

>GLEAN3_10143

MYTPCATIAFSNGKPWVDLRRFSLPALRSFGFGKQSLVPQINLEARYLSEEIRNLRGE

PTDLATTFSKATANIICQLLFSQRYEYSDGEMSRVLKSMEEFLSLIPETDLVNIFETL

IHTPRYKPYRESIFSDRDFIMSHLNSHRETFQKDNIRDFTDAFLADDISKEFELEHFW

RVLLDFFVGGTDTTAVVTSWAILFLSVHPDVQRKVQAELDTVVGRGRQPNTLDRPDLP

YCNATLTEVMRIRPILPVSVPHMTSDNVSFRGFTIPKGSIIVPNLWAVHHDPKEWNEP

YKFNPDRFLSADEKQFQKNEAWMPFGVGRRDCVGSQLAKMETFLLFTNLFQQFEFKLP

PNQPNPSMRGANGITMNPEPYKVCAIER

 

>GLEAN3_10246

sequence begins at I-helix, missing N-term

MFFAATDTIHEVLLWLTLFIATRPDIQLRMRKEIDTVMNSGGSESLYDQRSRLPVVRS

IILEVLRMRPVAPMGVPHVVSDTGSIDGYSIPKGSQIFSNIWGMHNDPDIWKDPSVFN

PDRFLSNTGVLDEVKAKLVIPFSIGPRSCPGEQVARRIIFQTLVSLISRYRISLPAGR

PTPRLVNEERSFTMSPERFKIVFSRIQADPVQLKDLEIPTTE

 

>GLEAN3_10549

missing C-term, green = C-helix

MVAIELVILVLVLWMVKSFVSKRYGKQYRLPPGPPALPIIGNLPVFMTGKAPYRVMKN

LADKYGSVFSLKIGTTPFVVLNDVDSIREALHHRSVDYAGRPCLYSVSLMSRNNCDIV

FGDYTPGWRLHRRATAAALHKNIRGVGCEAMGIKITREAEYFTQLLMKKAGPVVDIEN

ELNLAIINIISQLTFATRYTLDDPEYQALVDSNQKFSSLLKPGDPIDVFPALKVRGNT

IVIFNCFYLGQHNI

 

>GLEAN3_10563

MFSLLSTSIILGALVFLICFWFVRRPKNLPPGPWTLPIIGYRFKTGHVHEAYVELAKK

YGPVFSIRRGPFLFVVLNDRESVKQALVKSGEFFSDRSVPGPINWTIPDAKKKASIVW

SNGKSWVDLRKFSLPALRSFGFGKKSLVPQINLEARYLAEEIKALHGEPMDPSAVLNK

ATANVMAQLIFGRRYEYDDAEFIEILQVMVDIFAFVSDNDPVNVFEPLIHTPWYKPYR

EMMIKQNKFIHAQLYFHRETFQKDNIRDFVDAFLADDISKEYTLEDFGRIVLDFFTAG

TDTTAVVTSWALLFLSVHSDVQRKVQNELDAVVGRGRQPDTSDRPNLPYCDATLMEIM

RLRPVVPISLPHMTSADASLGPYTIPKGTIIVPNLWAVHHDPKEWCDPHLFNPNRFLS

ADGQTVVKNEAWMPFSTGRRDCLGMQLAMMETFLLFTNLFQQFEFKLPPDQPTHSMRG

HPGLTMPPESYKICAIER

 

>GLEAN3_10576

MFFLSSTSVLLGVLGFLFCLWFVRRPKNLPPGPWSLPIIGYRFDSGLIHEAFVDLARK

YGPVFSVRRGPFLFVVLNDRASMKQALVKSGEFFSDRFVPGHIRWGIPDAEKNAAIVW

SNGKPWDDQRKFSLPALRSFGFGKKSLVPQINLEARYLAEEIKALHGEPMDPSALLNK

ATANIIVQLVFGRRYEYDDPEFIGALQAMVDIFALVSDTDPVNVFESLVHTPWYKPYR

DCTFKLRDFIMSHLYNHQETFQKDNIRDFVDAFLAHDISKDYSMDTFWRIVLDFFAAG

TETTAVVTSWAILYLAVYPDVQKKVQNELDAVVGRGRQPNTSDRPNLPYCDATLMEVM

RIRPVLPVALPHMTSADVSIGPYTIPKGTIVIPNLWAVHHDPKEWCDPQLFNPDRFLS

ADGQTVVKNEAWMPFSIGRRDCLGVQLAKMESFLLFTNLFQQFEFKLPPDQPTHSMRG

HPGLSMPPESYKICAIER

 

>GLEAN3_10719

No seq. provided Scaffold 58259 989-2431 no introns

MADNQFLNGRPLPGPWGLPVIGNMLSLDQTGPHLSLMKLAKVYGNVFKIHMGNRPVLV

LNGFQAIKEALTKQPAIFAGRPDLFTFKVIDETNVYGPSISFSTYSEQLKLHRKLAET

CLRHFTEGGQAQFLEGVAKGEAEELVRHLKVSKCEGGLDIRRLIRLSVTNMILWYMFS

KRSSYDSKLLNDILNLTDKAEKPSGSGNPVDFLPWLRFITTKYKTTLQEYRKKFGKLL

SGVIEEHAVLYEPGSERDVIDHLISTGHNSGEEELQRLGLAGKALYQSAFDFFPAGTE

TVLATLEWIMVYMVTYPNVQSDVQHEIDQFVGRGRLPTLNDRDNLPLTQSCLLEIQRH

AAAVPFAVPHSTTQDTVLDGYYVPKDMLVFVSLYSAHFDPDVWEKPEVFNPRRFLAQD

GTLDEEKTKLAIPFSLGPRRCLGSGVARIQLFIYFTTFLHQLNFKCPDGQKASFDATY

KFPRRPKVFNVQIDPR

 

>GLEAN3_10720

No seq. provided Scaffold 58259 10218-11660 no introns nearly identical to GLEAN3_10719

MADNQFLNGRPLPGPWGLPVIGNMLSLDQTGPHLSLMKLAKVYGNVFKIHMGNRPLLV

LNGFQAIKEALTKQPAVFAGRPDLFTFKVIDETNVYGPSISFSTYSEQLKLHRKLAET

CLRHFSEGGQAQFLEGVAKGEAEELVRHLKVSKCEGGLDIRRLIRLSVTNMILWYMFS

KRSSYDSKLLNDILNLTDKAEKPSGSGNPVDFLPWLRFITTKYKTTLQEYRKKFGKLF

SGVIEEHAVLYEPGSERDVIDHLISTGHNSGEEELQRLGLAGKALYQSAFDFFPAGTE

TVLATLEWIMVYMVTYPNVQSDVQHEIDQFVGRGRLPTLNDRDNLPLTQSCLLEIQRH

AAAVPFAVPHSTTQDTVLDGYYVPKDMLVFVSLYSAHFDPDVWEKPEVFNPRRFLAQD

GTLDEEKTKLAIPFSLGPRRCLGSGVARIQLFIYFTTFLHQLNFKCPDGQKASFDATY

KFPRRPKVFNVQIDPR

 

>GLEAN3_10721

No seq. provided Scaffold 58259 20505-21947 no introns nearly identical to GLEAN3_10719

MADNQFLNGRPLPGPWGLPVIGNMLSLDQTGPHLSLMKLAKVYGNVFKIHMGNRPVLV

LNGFQAIKEALTKQPAVFAGRPDLFTFKVIDETNVYGPSISFSTYSEQLKLHRKLAET

CLRHFSEGGQAQFLEGVAKGEAEELVRHLKVSKCEGGLDIRRLIRLSVTNMILWYMFS

KRSSYDSKLLNDILNLTDKAEKPSGSGNPVDFLPWLRFITTKYKTTLQEYRKKFGKLF

SGVIDEHAVLYEPGSERDVIDHLISTGHNSGEEELQRLGLAGKALYQSAFDFFPAGTE

TVLATLEWIMVYMVTYPNVQSDVQHEIDQFVGRGRLPTLNDRDNLPLTQSCLLEIQRH

AAAVPFAVPHSTTQDTVLDGYYVPKDMLVFVSLYSAHFDPDVWETPEVFNPRRFLAQD

GTLDEEKTKLAIPFSLGPRRCLGSGVARIQLFIYFTTFLHQLNFKCPEGQKASLDATY

KFPRRPKVFNVQIDPR

 

>GLEAN3_11009

MVFTILGFTVSWPLVGTLVAIIVGYEIWHMTYFKRKGVKGPIPFPVVGHSIDLFQGIH

WKSQEYVRKYGNVFGMFMFSSPVFFISDLDMVKDIAVKSFSKFMNHKTFVLKSRPFDK

ALIGLKDQHWKDVRNVITPTFSAVKMKQLSGLLNESSDTLVKNFGAIQEKEGNVKVFS

LFEAFTMDGIAKCAFGIQVDSQNDPSDPFVTHAKDLMKNRFALKFVIAGIFPRIGKIM

DYFEVSTLPPEVSSFFISVVDQAIAARKESGGSKMKDFLQLMLNAAQENKSEKEMIEE

KPQDEKVEGDLYALIEDQDNKSYFDLKKSKKTTLTKDEIYGQAMLFFLAGYDTINTTL

GFLTYSLATNPEVQDKLIEEIDNETPARDDVGYNSVAKMSYLDNVVCEILRLYPAGLI

IERQCNETHVCNGLTIPKDSQIFFPVYAIHRDPAIWPDPEKFDPERFTKENREGRHPL

AWMPFGAGPRNCIGMRFALMEIKMATVRLLQKYRFETCPETQIPPKFAKMSLKPDDDL

YLRVVERS

 

>GLEAN3_12080

MAVVLPAGIGVTVCIVALSAFLFYYIARFLRIFLLISKFDGPPAVPIFGNTLQFKSDP

REHFLYDVGVREKYRHQSGGVIRMWFGPLPIVLIYNAVHCEALMNSSRHINKGTEYTF

MEPWLGKGLLTSGGQKWFHRRKLLTPTFHFSILQSFMEVFNEQSMILVKKLGKFADES

ETVNIFPLVTHCVLDIICDTAMGKRTNAQSDNENEYVKAVGRMSHLIVSRIRNFMLWP

DWIYNRTDAGKDHEKALQILHGVTNKVITTKTQMTPNGGTYEDDAVTRKRKRIAFLDL

LLQMHREDATFTLEAIREEVDTFMFEGHDTTAAAVNWALLLIGQHPTVQARLHDEIDQ

VFGDSERPITSDDLSELSYLSCVVKESLRLLPSVPGIGRDLDEDIIVNGKVVPKGASV

FLSIYGIHHDPEQFPDPERFDPDRFLPENSTKRHPFAFIPFSAGPRNCIGQKFAMMED

KVLLINILRRFSIKSLQTLDETKPAGQIVLRPADGNILVKLSHRK

 

>GLEAN3_12081

MAAVLSAGIGVTVCIVALSAFLFYYIARLLRIFLLISKFDGPPAVPIFGNTLQFKSDP

RDIFLYYQRNFERYRQETGGITRIWYGPLPAVIVYSAKHGEALMNSSHHTSKGIQYKC

LEPWLGKGLLTSSGKKWTHRRKLLTPTFHFSILQNFMEVFNEQSIILVKKLEKFAEKS

ETFNIFPFVTHCVLDIICDTAMGKRTNAQEDSENEYVRAVDSMSRLVVRRMRNPLCWS

DWIYYNMGVGKEHEKSLNVLHKVTNEMIQERLSDIPSNENGTNQDDAVTRKRKRIAFL

DLLLQMHREDATFTLEDIREEVDTFMFEGHDTTSAAVSWALLLIGQHPEVQVRLHDEI

DQVFGESERPVTSDDLSKLPYLSRVIKETLRITPPVPGLSRELDEDIVVDGKVLPKEA

MVIINIYGIHHDPEQFPDPDRFDPDRFLPENSTKRHPFAFIPFSAGPRNCIGQKFAMM

EDKVILINLLRRFSVKSLQTLDEAKPAGLLILRPAEGTILVKLSLRK

 

>GLEAN3_12148

N-term only

MANSVNIFESVGGVFGEMTLATMILVSTIFVLGIAWAFKSLLGPQNKDIKLPPRLPTG

IPFLGQAVAFNKSPIDFLEDAYEKYGDVFSFTMVGKTFTYLIGSEASALLFNSKNENL

NAEEVYSNLTVPVFGKGVAYDVPNPVSLINLIIDYIRGA

 

>GLEAN3_12500

missing N-term, seq begins at KYG motif

MDMAKKYGPIFSLRRGPFLVVVLNDIATIKQAFVKSSEFFQNRFVPGHIRWGVPDTNK

NASIVWSSGKPWKDHRKFSLHALRSFGFGKKSLVPQINLEARYLADEIKNLRGAPSDL

SPTLNKATANVVAQLVFGRRYEYDDLQYIEVLQAMVAIFAGISITDPVVVFESLIHTP

WYKHYRDATFKARDFIMCQLYMHRDTFQKDNIRDYVDAFLADDISKEYTLEEFWRIVL

DLFAAGTETTTLVTSWAILYLAVHPDVQKKVQNELDRVVGRGRQPTTTDRPDLPYCDA

TLMEVMRIRPVLPVLSPRECLGIQLARMESFLLFTNLFQQFEFKLPSDQPTASLRGQP

GLSMHPHPYHVCAIER

 

>GLEAN3_12926

missing C-term

MGPFVVIVVYVLGLFACLLGIGLLVFVAAILRTEQKFSDLPGPKRDSFLFGNIFSIFE

AKKKLGQPIGLIIYTLSQDHGPVFWIRVFARGHVFASAPEIVKELLLNSRHIKPQSFY

KNFQYVFGARFMGNGLVSEINHEKWFHRRAIMNPAFRRKYLMGMMEKFNAESEKLCEH

LRDKADGRNEVEMLNELNNVTLDVIANVSFSMTTGAVGDPSCPFPAAVTLGLRGMQTI

IDKPWYSFDPRSESRKTRQETREAVNLIRETGRQQIEERIAARQRGDHVPNDILTVIL

DCANDLVGDEDFQMENMIDEFVTIFFAGKLFSAPENCNYYS

 

>GLEAN3_13039

MDFRWHQLTPYHLLVTALVGLVAAWFVKHLMRLPKNLPPGPTGLPLLGVISKFIGTSD

PLALLSEWADRYGEITSFKLGPQLVVLLNSYDAVAEAFRHPDLQGKPRSRMTEEASGF

GPNAGLLIGAGQSWKEQRKFTLSVFRNLGVGKKSYEDTVAAEMAQLGRAIKETKGSPI

DPNILVGQAVANVICTVIFGTQYKYDDSEFQHLLYLMNFNMSRSGAGGWLFFIPIPGM

SKVPFGVIKAMMTNAKEFYAFIDHHIEAHKKNGGPDIPKDFIDQYLIKLKEMKGTESS

FSHSSLKAVISDLFFAGMETTTTTLKWCILYMMAYPEVQSRVQAEMDQVVGRERLPGL

DDRQNLPYTCAVLMEVQRKGAVVAMGVPHMAAADVTIKGYTIPKGTIIFPNILNVLNN

KNFWDDSDAFTPERFLSDDGTLIKREELIFFSTGRRVCLGEQLARMETFLGFTSLLHR

FTFKKPDNSPPLSFEAVLGGTRSSLPYLTCAIPRDPTENVA

 

>GLEAN3_13102

Missing N-term, green = C-helix

VTPLFSDDLKDALFNSKDEHWKGVRNIVTPTFSAAKMKLMSPLINKCADRLLKHLEKR

QELHGNIECRELVGAFVMDTIASCAFGLNIDCQENRFGPFVTNAKKAFSGNIFTSPYL

LLVSLFPWTLPILEYLDFNIIDRKSTQFFADIIKKTSAVRRESKNSDSAKRIDYLQLL

LDAQERDITGNNKNKADGIHGDPEDEAIDEGGLQKKSSKKIRLNESEVISQALIFFLA

GYETTNTTCGFLLYFLATNPDVQDKLVDEINDVAPKAEDVGYQSLSKMPYLEQIFCET

ERIYPPAIMIDRVCNEPFNVNGFTVPKGMRIFIPIYTIHRNPNLWPDPETFDPDRFRK

ENREKHHPCAWMPFGTGPRACVGMRFAIMEAKMVIVRILQKYQIETCPQTEMPPKQAT

NGFLSPPNGITLRAVKRN

 

>GLEAN3_13199

MEKCEISSSQLESSLPYPSSFTSVILCVITLYLVYRLFRSLREKKDDGRRYPPGPKGL

PLIGSLLDLRSDVLMKMKRFADEYGSIYTINLGPTNLVVLNNKDVIKEAFVTNAACMS

HRPQVMFDTLKQSQGIILKQPGDLHRKNRNFLSTGFRMIKLGVPEFSYHLHPHIDNFK

DKLIASKGQPLPICSTSHRLLFEIFTDLMLGEVPDPNSEKVNKLLDILYIALSPNPML

ALVGNLIPNWVKSALSDWIDMADCLAKVTRYLKEHIHQHRETYEGDPTNLINAFLHKE

KLELEKTSKLREESSSGGDSDTEIEKAAIGTSFMVNDQEENEKSLAAILLDVFQAGCT

SVVASLQWLLLDLAIHQDVQDKLRMEVDRNVEQRNQRPSLLLRSDNFPYTKAVMLESI

RLHPPAPLGAPHCAAQDTKLAGYDIDKGTIILPNFLMALQDFNQWEDACDFRPERFLN

EDGTLATRETLGFSIGPRQCPGKLLSNTVISSILTDVIQDFTLHLPPDHNFSLKGNPG

IIYNPQPFSIIAKQRTS

 

>GLEAN3_14092

MTVLGMISGLFSMTTVYLSGAVVVIFFIYDYWTHTYFKRLGIPGPPPKPIFGNALDFS

KPMYDLLDEYANKYGPVVGIYMMRRPLLLIRDLDMLRDIMVKDYHKFYNKFRVPISSG

NLDNALFLLRDAQWKRVRDIVTPTFTGKKMKMMSGILNEGADTMVEKLLQSCKEDGNI

DSLELFGSFVMDSVASCAFGLKVNSQEDKDNLFITNGKRLFDFIFNSPVFMAVYVIQF

FTDVTEKAVALRESHPSKQRVDFLQLLIDAKNGKSKMASENDKDDIHNKYFEDTGAEE

NISTKTQKYMTREELLAQGMIFLAGGYETTASLLTFASYLLASNPDCQDKLIAEIDDI

APKRDDVGYTTISKMPYLDQVVCEVLRIYPPGPLTDRECGETCTYKGYKIEKGTQILI

PTYTIQHDPTYWPDPLKFDPERFTKANREGRNPFTWLPFGAGPRICIGMRFALMEAKM

ALVRSLQVVRFGISPLTKIPPDLGNTSLLSKKSSYTLKVVQRH

 

>GLEAN3_14573

MAFTILGFTVSWPLVGTLVAIIVGYEIWHMTYFKRKGVKGPIPFPVVGHSIDLFQGIH

WKTQEYVKKYGNVFGLFMLSSPVFVISDLEMVKDITVKSFSKFMNHKTFVLKSRPFDK

VLTSLKDQHWKDVRNVITPSFSAVKMKQLSVLLNESSDTLVKNFGAIQRREGKVKVFD

LFEAFTMDGIAKCGFGIQVDSQNDPSDPFVTNAKDLMKNTFSLKFLIAATFPGIGKIM

DYYGVSTISPKIASFFLSVVDQAIAARKESGGSKMKDFLQLMLNAAQENKSEKEKPQE

EKVEGDLHALIEDQDNKSYFDLKKSKKTTLTKDEIYGQAIIFFLAGYATIHTTLGFVT

YCLATNPEVQDKLIEEIDNETPTRDDVGYNSIAKMSYLDNVVCEILRLYPASVLVERQ

CNETHVCNGITIPKDSQVMFPVFAIHRDPALWPDPEKFDPERFTKENREGRHPLAWMP

FGAGPRNCIGMRFALMEIKMATVRLLQKYRFETCPETPIPVKFAKMGLKPDDDLYLRV

VERS

 

>GLEAN3_14843

MDKNLPDVLTDDVLINAVSVTTYLMLIITSILFTRYWINAKRESDKVAKSGGRPLPGP

WGLPVFGSMLSLDPSGPHLSLMKLAKVYGHVFKIQMGSRPVLVLNGLKAIRNALVKQA

VVFAGRPDLLTLKAINSANLFGPSLSFSQYSEEWKLHRKITETSLRHFTAGSQVPFVE

SVAKGEAEELVRYLKVPKCERSVDIPCLVRLSVSNLMLWFMFRKRASYDDKKFIESIN

ILDGLSEAGSGNLVDFLPWLRFFSLDSSTGFLEAQKRFNKSVRGLIDERHELYDPESA

QHVFMDEIIALGHVREEDEFQRLGLAGRTVLQNAFDFFGAGLGTTSATLEWAMVHMAL

FPKAQYEVQCEIDKVVGRDRLPTLNDREYLPLTQSCLLEIQRYAAVTPFGIPHSTTKD

TILDGYFVPKDMVVFVNLYSANFDPEVWDEPEVFNPRRFLAQDGALDDEKMKLLKAFG

LGRRRCVGSGLASINLFIYFSTLLHQLRFSCPEGGEVNLEFRFGITLHPKNLNVHIES

R

 

>GLEAN3_14874

seq starts at I-helix, missing N-term half

MIFLSAGYETTAALLTYASYLLASNPDCQDKLIAEIDDIAPKRDDVGYTTISKMPYLD

QVVCEVLRIYPPGILSDRECGETCTYKGYKIEKGLQIWIPTFTIQRDPTHWPDPLKFN

PERFTKANREGRNPFTWLPFGAGPRICIGMRFALMEAKMALVRSLQVVRFEISPLTKI

PPDVGNTSLLSKKSSYTLKIVQRH

 

>GLEAN3_15096

missing PERF and heme signature at C-term

MDLDTSFITDVSLVGGALKATFGVLLLYFIYCYFNDNGSQYHHPPGPKGLPFLGSVLS

MIFTRKHPHEVSLKGVAFANGEVWKEQRRLLMSVLKKADVDGISYDDIIVSQAERMLA

EFEKTNGQSFDPRDLIHKTIGNVVMKILTGVTYDYDDLEFSRVMTNTVRVFTLLGEAG

ILASIPALAAIPSPTKWKIIKAWEEIYGFLKEIIERRKARFDTNQKATDFMGSFVKAM

CAKGTEIGPASCDEINLMVSSFDLLLAGWDTISTTLAWGLHTLASYPEAQRRVHQEIK

DVIGLDRLPQFSDHHSMPVTMATIAECMRFRPTLAVHIPHVATQDCKIGGYDIPKGGQ

ITLNMWSKGLSWRATGKN

 

>GLEAN3_15120

short at the N-term

MVLKANRYREKLGRIPVHLSLLKCVSFILLRLFGPICSITLRVSDSRISTPMKNKIRH

FYSALSSYEFAPSVKKLGDAKIDWGVSGWRPTKEAAGLSRTQHGIVSETPGGSSHISY

FFVSFALITLIAYQNLVQTFQLNQSSFTVVDIAVVVTVSAAVIIVVVVVVVFALLSGI

KRYMRQVIDETLRVSILAPYAARYQDFDVVLGGHVIPKGTPVIHALGVSLQSEKYFPN

PKTFNPENFSAANVKKRPRDAYHPFGFAGRRVCPGQQFAYKEVAIFLAVFIRAFKVKV

VPGQKPEHVHGLVSHLVNKDGGEVWITVEKR

 

>GLEAN3_15156

sequence starts in I-helix, missing N-term

MNPDIQEKLINEVDEVTPSRDSVGYSSIATMPYLDQVVCETLRLYPPAGISNRMCNET

FVHNGLMIEKGTQVFIPVYTIHRDPDYWPEPEKFDPDRFTKEAKAKQHPFAWQPFGAG

PRNCIGMRFALMETKMAVVRTLQNYRFETCAETENFWEQTLSGNLY

 

>GLEAN3_15256

MDSVPLDQLGTLLIVTLLCMVGLRFLISWIRDGKKRKLPPGPTAWPLVGSIPFMIRSK

LPPLDLLTALADDHGDILRLRIMTTEVVILSSYDIIKEAYNHPNLQGRPRIVPIEDYI

GRRNIGVAGSSGPVWREHRQFVFSAFRHFSTGPQNIDHIIHVEARTLVSEVSTLSRKP

FNPHKVLNNAVSNVIAVMLFGKAYNYADDDFNQLQEIINTNLQEAASGGIYLFIPFLA

KVPFSPANKIKQGIKRFTDFTKKIVNEHRHVFEPDHPRDIIDLYLKKIQEDHEGGIES

SFDETNLHVLCGDLFGASTETTNASLKWSIIYMMIHPEVQTRVQEELDRVVGRDRLPE

LDDRASLPFTLATLHEIQRMGHVFPLSIPHACTADMRLADYDIPEGTLVVSNLWRLSR

DSRIWSDPNEFRPDRFLNENGDCIKPEALIPFSIGKRVCIGESIARKEIFVFFSSLLH

RFHFSIPAGGSPPSLQGRLGLVYDTQDYITCASPRT

 

>GLEAN3_15442

MIETPASSRDEPNLTVISSLLSLTSLTTSSLLLCVLVLVLVLVILTVDRRNSVTPVED

GVTLLPGPRGALATVKVLLGMTKRASRYMFERTKTYGKVFGIGDGPDSYVIIMDYASL

EEAFVTKADLCSDRGGLIRTMLESVLPERGSIFNNGTAWNEVRGFVAKTVKNPDFGAT

TISRCVSETIPTLCSELDALVDGPFDPKRTYQCFCVDVFHRIAFGIGISYENCTVKKF

IDNLDTVVAQTLDPSTLLLTRPSLINSFLFRGYRSKLRSSSSYVKTRAEEVLAKLDAP

EQSSSLIALFVKYIEGIRSSGKASYSEPDDGWVIVYNMFFAATDTIHEVLLWLTLFIA

TRPDIQLRMRKEIDTVMNSGGSESLYDQRSRLPVVRSIILEVLRMRPVAPMGVPHVVS

DTGSIDGYSIPKGSQIFSNIWGMHNDPDIWKDPSAFNPDRFLSNTGVLDEVKAKLVIP

FSIGPRSCPGEQVARRIIFQTLVSLISRYRISLPAGRPTPQLVNEERSFTMSPERFKV

IFSRIQADPVQLKDLEIPTTE

 

>GLEAN3_15568

missing N-term

MDLFDESATRLRDHLIPLADDKTLVRMLEHFNNVALDVIGKVGFDMDINAVENSDCPF

PSAVALALQGINVAFQNPLTLVDPRSKARQYRRSVRDAVKLVFKETMRLNPPVSGLSR

VLASDVQSCGYVIPKGTTIMFMTYVTSRLEEFFDDPLLFNPDRFIPTDETPRHFFAYY

PFAIGQRNCIGQQIALIESKVILAKLLQSFDFRLEQSQRHIILREVTNKPIDGCKNYI

SLRD

 

>GLEAN3_16056

missing N-term starts at KYG motif

EYFQKYGKIVGVYNFRKPTLLVSDPDVVRKILVKDFDHFVNRQEAAIKTDLLAGGMLA

AKDERWRKLRSLLSPSFTSVKLKSMVPLMRECAEVLVDIFTKREEEGKTIECKELYGP

YAMDVIGSCAFGLRVNSQTNQDEPFIKYAKRIFNFNVTNPAFLISMLFPWTAPIFNFF

KLGSVPKDIEQFFLGVFEKAIADRQANGDTGKSDFLELMMSAHKEDDDVNIMNDAEDD

IADLAQETSWKPKQKTALNRQEIIGNSITFLLAGYETSSTALGFVSYLLATNQHVQDL

LVAEVKEMAPSQSDVTFETVTKMEYLNMVIMEAMRIYPPGEVLERVCNKEITYNNITI

KKGQYIFIPIWNLQHDSDLWPDPETFDPERFSKDRRGNNHPCGWLPFGIGPHTCIGVR

FAMLEIKVAMVCILQKFRFDTCPETEIPPKLGNLSFLTPPNGITLRITPHSRYNLRSR

 

>GLEAN3_16152

missing the middle of the gene between C-helix and heme signature

MEVEVLGNAMSLTSLLVLIALGCIVLLLLNDYWAHGYWKRRGIPFEPPLPIIANMRVF

AKDIPQAAGELRSMVSILEGERWKEVRNILTPTFSAAKMKQMSNLINTNADTLVKYIG

LKQEEKGRIECSEFTKEAKAKQHPLAWQPFGAGPRNCIAMRFALMETKMAVVRVLQNY

RFETCAETEIPPTFGRSGFLTPPNGVTLSAVPRTDKVA

 

>GLEAN3_16251

MSYRCAVCYCDHEMEADMRVYSRNGRSDLLALFSSWADNYGDIVSFNVGPKLIVLLNS

YEVVVEAFRNPDLQNRPGSQIMKEISGGGKNGVLLSVGDTWREQRKFVHSVFRSLGVG

KKSYEDIVAREMDQLGSAIEEKRGTPFDPNTIFGQAVANIICSIVLGTQYKYDDTDFK

HIIELLIRNIELFGGGAWIFFLPLPGISKIPFGAVKALVSNTRELNTFVRTMIKSHEL

NLDPEHPKDFIDQYLNKLNETKGTDSSFSQANLEAAIKDLFFAGTDTTTNTLNWSILF

MMAHPDIQSRVQTELDQVVGRHRLPKLDDRKNLPYTSAVLMEVQRKGAVLALGVPHVA

AADTQVRGYTIPKGATILSNLYKVLNREDLWGDTNDFRPERFLSKEGEFIKREELIFF

SSGRRMCLGEQLARMETYLGFTSLLQRFTFKKPDHTPPLSFAGVLGGTRNTVPYVTCA

IPRETKI

 

>GLEAN3_16442

MALLTDSYGAALAIISLVLTLIWYLKNSSSWRYRNLPPGPFGLPILGILPYLDEKNPS

VDVLRMSKRYGPIFGGFLGSHRAVFLNDYDTIKETFARTDDVFSDRPRIAIFELYSDG

QGVACCYLENHWKEQRRFTLRALRNYGMGRLQGSLRNAIDVEVSHLMTAMSKIKKPFD

PRIYVESAVCNITCKMVFGRRYEYEDAQFTEFLHNLEKIFELSGVAGVVNYLSWMKYV

PFSGYHALGDCVRHLENGLFTKECEEHQKVFDPNRSPRDLIEAFLLELHKKQQEKVEN

NGEDMAPTGFNHQQMLHFIADVFAAGTDTTASTLKWSLIHLMKFEDEQRKIQEELDHV

IGRDRMPSLEDKPNLPYTQAFLAESLRYGCAGPLGVPHGAKEDTKLRGYDIPKGTVIV

ANLYAVLYDPKVFPEPDKFNPSRFIDDEGRVKTSLIERAHSQFGIGVPHGAKDDTKLR

GYDIPKGTVIVANLYAVLYDPKVFPEPDKFNPSRFIDEEGRVKTSLIERAHSQFGIGR

RQCVGMDMALMERFVFITNLIHRFNARAPNGPDSVTIDARGGLAFVPMPFEMQLTER

 

>GLEAN3_16816

starts at C-helix

MKCSIIFEEGKSWSDMRRFFLLVFRKFGMGKRSAGDRVNEEARVLCKEFILAEGKPFD

PFRTVNYAVANIIRSITAGQRFEYSNGRFQQMVQLITDVLGAENFYGPLEMMPWLYFT

PLFTKERNMCNALIGLMKQEVVEHRETVDVDNPRDVIDMCLAEIDRAKDAGEETFFDE

SLLWRTLLDIYLAGTETVATTLTWAMLYMAGYPHIQEKVCQEIDDVIGHEGTPHYDDR

TRMPYTEATIAELLRIRPIAPLGIPHINTESIVFRGYLLPERSIIMMNMVGMHLDPEL

WPDPESFDPTRFLSEDGKATKKPEGHMPFGAGRRICLGEQLAKVELFLFFTNLLQKFT

FRIPDGVLPDYGFGHRTSTLAPKKFDIVASLR

 

>GLEAN3_17582

MIDLLGAGESGFGQFNALTITATLGVVTATLALWTIVRPKRFPPGPRGLPIVGSIFSM

NDSPEIVFGEWAKRYGDIFGFKAGERWMVVLNRQALIKEAVLKQGVDFAGRPDFYSLE

IFTDGFKDIAFSPYSETWKLHRKLAHTALRHFATGKPLQDLISSVYPKVDKKLAMTEG

QPIDPKVLINLIMYNVLAQMCFGRSYELEDPTVAQWMDTNDDLNEKFGLGLAADMFSW

ARYIPTSGPRMIKELTEKLWGFLRLQVDETREHYDPENINNFYSLLLKAQEDARKEGE

NVDKLTDTHVFQTISDIFGGGYHIPKGTVVMINTHSMHYDPQEWDQPDKFLPEHFMDG

SGTVRDHPPSFLPFGAGRRGCLGEAVAKADLFLIFTWFMQNYTFSRAPGKESEDILKM

IPQTASGRILLSYEIMINKRD

 

>GLEAN3_17986

MSSSVFYYGFTPFILAGLIAFLLSFILVRSGKKKYKNLPPRGPMEWPVLGGLPSLSGK

DMPHVIIADMAKKYGPIFGMKMGSFYAVILSDYSLVRQAFAKSNDEFSDRPKITMIEN

IVQGKGLIVSYFGQMQTEHRRFSLSALRSLGMGKFKMEETIVDEARQLAMIFTSRQSK

PFVPFHDIVVSVSNIICWLSLGKRFDYNDESFKAILASLFESFEIAEIAGLFNFLPHW

SGKCTTEEQEKSRRSPTCYVEMYTEQMIEAERETPGEHTFTKENMYHAVADLFAAGTE

TTATTLKWALMYMILHQDVQEKIHKELDDVVGRNRLPSLTDRPNLPYTEATLLEIQRF

ATITPLGVPHAPVQDTVLNGYDIPKGTVVLPNLWAIHHEPDLWKNPDEFNPARFLNPD

TKQVVQREELIPFSIGRRRCLGEQLAKVELFIFFTHLLHRFQFTTPEGAPPLTMKARN

GATLNPEEFEICASLIDDVM

 

>GLEAN3_18242

MPHLRLVDLNSFSRSERRPPSWSPMSSYPDMYSGPQQMLQRCVIVQKDEKGYGLTVTG

DNPVFIQSVKDHGAAYKAGVQTGDRIIKELLTRSKYLKAPDQYRGLKSLYGARTLGNG

LISELNHEVWMKKRALFNPAFHRKYLMGLMNEFNSCSAKLVNHLIPLSDGQTEVVMTK

EFERLTMEIIGKVGFGLEDDIIGNPDSPLCQLFPKVMSGMQSVYRRPLLKYSILPKDI

KYKHEVRAAANEIRAVGRRCILARIDALRRGDQVPQDILTYILQESNNLEGIKDFDLE

DMVDEFVTFFGAGQETTSNLLSFTLLHLGRNPQVLKKLRDEIDTVLKGRNYVEYSEVS

KMKYLTLVLKETLRMNPPVGMLNRVLPYEMDLCGYKVPKGSTVLMPIYGMGRDEKHFK

NPEKFDPERFTRDEDSPLFAYIPFSLGARSCIGQTFAMIEAKVVICKLIQQLEFQLVP

NQSFEFVQEVTLKPKDGCKSYITMRNL

 

>GLEAN3_18372

MGSLLDLVDTSTALIALLVFLLSLVVTHRFYQNGDDSKSLPPRGPREWPVIGIIPSLV

GDEPYKLLARMSPQYGPVFLTRIGMISTLIINDFDSMKEALVRSGDDFADRPKFGMME

YITKGKGIISAHVGDSQREKRRFGLAALRELGMGKSRLVENISDEIETLCDVFMRYEG

KPFNPAHDITVSVSNIISWIVFGKRFKHNDTAFKRFLEIIHEGIEICESTGAAGFIPM

LQYLPLPVWKKLADNDREWKEFDAVMIAEALQRPASEEPCFVQMYNEEMKRMEQRAKA

TAAGSKASDGTDGIDASEETVIFNEEDMRQTLGDLFAAGTDTTATTLNWAILYLILHP

DVQCKIHEELDNVFGRNRKPSLEDRKYIPYTEAVLLEVQRKATIVPLGVPHAASWDTK

LNGHHIPAKTLILPNIWAVHHDPKIWKNPDEFQPERFLDDKGTTVLQREELIPFCIGR

RKCLGEQLAKMELYYFFTHLLHRFEFRISEGALPPSTVGKPGATRNPMPFSVCAIPRF

S

 

>GLEAN3_18468

MAGIVSVLSVVVQSVGSPALVVGGITILLYLIYQVFYQGGKKYPPGPTGLPVFGSVFS

LLLTKRQQHEVVNEWSHKYGPIFAFKMLGTRVYVLTDPGLVSDAFHSSPDINDRMSMA

IADEITGRPNNGLVASVGEVWKEHRRFFLTVFKKIDSGVIRFEEIIGSEVERFLKEVE

KKDGESFDPCNAINVGVANIITRLVTGVNYDHEDKDFQHLIDCSDRIFQIMGPAGLFT

LIPILAKIPSPVKTEITNLSLEILKMVRKSIEIHKTEFDPEEPARDFISCYLKEMHRR

KEENDPGTFDDINLLTSSFDALMGGWETSTTAISWMTFTLAAHPEAQRRVRQEILDVV

GSDRFPRFSDRRSMPITEATLAECTRLRPAAAIHIPHIASRDCKLGGYDVTKGSAITT

DLWYLALSPNLWKDPQDFKPERFLDAEGQFNKKLEPLPFGYGKRVCPGEQLARMEIFI

FVTYLLQNFSLTLPEGAPTSNSGKHGLTNKPNPFKINAKKI

 

>GLEAN3_18515

MANSGYVASSLFFGFQEICYRECIIVLTTFLFVRLLVSRGLFKKSTPPSNPPPPSPTG

LPFFGNLFSFQEDMRHLFRKWTNEIGGIFRLRIGPVMDIVVINDVEAVREAFVEKADA

FSDRMVPPLLGEALGKRGNNHLYKHKKTYTRSLIWPAGGGWKKTRKFGHSAFRSMGLG

HRRIEEKVVEEGRILVQACEMKEGEPFNPRVLITNAVGNMIACLVTGSRYDYGDPDFV

KIVGDIERFSQNLSLTSLGNVFPLLYYTPLYTDFRRPMRDIISYIKSMVKQHQQNFDP

LNVRDIIDLYIVELKQQREQQIPGTVEENLEWRLSIELYIAGMVTTSDSMLWAILIVA

QHPQLMKEIQKEIDDVIGDERSPKMADQDALPLIVATLMEVSRIRPAVPLGVPRLTSR

DATLQGYHIPKDTVVIMNLWEVLNSPKYWDKPEEFNPYRFLSEDRRSVVNHPAFIPFG

IGLRACLGEKLARMQMFIIFTSLLQRFTFCLADDSRSAGMKGKAGMAYSPLKYDIILK

RRENSD

 

>GLEAN3_19082

missing N-term

MASHGLTYGNSLFLLFAPSALKDNIRDFVDAFLADDISKEYTLEDFGRIVLDFFTAGT

DTTAVVTSWALLFLSVHSDVQRKVQNELDAVVGRGRQPDTSDRPNLPYCDATLMEIMR

LRPVVPISLPHMTSADASLGPYTIPKGTIVVPNLWAVHHDPKEWCDPYLFNPNRFLSA

DGQTVVKNEAWMPFSTGRRDCLGMQLAMMETFLLFTNLFQQFEFKLPPDQPTHSMRGH

PGLTMPPESYKICATER

 

>GLEAN3_19464

MFIPADTLDFFRDVNKKTIALRKSSSSGPNSKRVDFLQLLLDAQKQENIDKESKPEEE

EEDDDDIHADLDKHHDVQYHSRGLGKKATLTDNEVFAQGLIFFTAGSETTTTLLGFIA

YSLATNPEAQEKLIDEIDRVTPSRDSVGYKSVATMPYLDQVVCETLRIFPPAAVSNRM

CNETFVHNGLMIEKGTQVFIPVYTIHRDPDYWPEPEKFVPERFTKEAKAKQHPLAWQP

FGAGPRNCIGMRFALMETKMAVVRVFQNYRLETNAQTEIPPKFNRTGFLTPPNGITLT

AVPRTDKVAY

 

>GLEAN3_19883

MESADAVALTLCGVVLLASLAVWSMSRPRGFPPGPLGVPILGSVFSMTASPEEIFVRW

SKQYGDVFGFKVGQRMIVVINHQDAIREALLKHGIEFAGRPDFFSFGVFTYGFKDIIF

SSYSDTWKLHHKFAHAALRQFASGKRLENLLHGSIAKMEGYLSKMDDSEPVDPKLVIT

LLLYNVMSTMCFGKDYDFLDENLIAWKENNEKLNQEFGMGVAADFFGWAQYLPTPGVK

MVHRMRDELHRFVWDELDSHKKNFDPDNVRDVFDMLLLAQKEAADSGNQDAEKLSDTH

LVLTVSNMFAAGIQNTTETLYWALAFMVEHRDIQSRCQAEIDRAIGRKRVPVVDDRGS

LPYTEATLYEVMRYSSILPIAVPHATTCDVNFRGYRIPKGAVVMFNTYSMHYDPNAWE

SPTSFKPEHFLEDDGSVQQHPSSFLPFGAGRRSCLGEVAAKTDLIIIFTWLLQNYCFK

TVPGKETTSLLRVIENSAVGRNLENYELIMEMRK

 

>GLEAN3_19898

MAVLEISSIVTIPTLVALFVVVVLYLIYRTNSRTALRHPPGPSELPFIGSIFTLLFTK

LQQHELLFKWSGEYGGIFSFTLAGGRVVVLSDKELIQETLQDPFVNDRIPFPACITIY

GRENTGIAFASGAVWKEQQRFFLTVFRKVNTSVIRFEDIIQTQGQRLLNELDRLEGQL

FDPSNIVTVTISNVILKIVTGKTYEYADPGLLRMATCSDRLADIIGPAGLLGMIPGLA

SLPLPAKRQLTDLMREMIDFIRDHVEEHKANFNPDEPATDFMSCYLKEIANSSGIPDT

TFDETNMLQTCWDAMFPGYETTISAFRWALHLLASHPQVQSRIRQEIFDAVERDRLPA

YTDRHNMPYMESTVVECLRMRPVIPYVAPHIVSRDRKIAGYDVAVGTRITVNLWFLAR

SPTLWEDPDEFRPERFLDENGCYDRSREPLSFSYGRRTCPGDQLARMELFLILSYILQ

RFTLTLPDGAPRNLEGSMGATYKPYQYELRATKNHGARSSASNSVPE

 

>GLEAN3_19899

MSGDQIKDLVNWLEVVNDWTNKYGDVISFKLLNIRVYVLNNVQMSHDVMGSLSMMGRM

PVPIIKTLTGRHNAGVALSNGDVWKEQRRFFLSVFKRADAKGIRFEEIVGAQAERMLT

EIENTQSARFDPQPIIYTSIGNIITKIVTGITYDYDDDDFHRFMTHSNRVLDLMGPAG

ILASVPILAVLPSPAKRPLTKEWDAMYDFIKKSIASHKSEFNPNQEAADYIESYLHEI

AKRTTENNKGSFDEINLLACSFDLIVAGLETSATSISWSLHTLACYPEAQHRVRQEIF

DVIGQERDPRFSDRHRMPIAMATIAECMRLRPAIDLHIARVAEQDCQVGEYDIPKGSV

VTTNMSHLFLSPSLWEEPLEFRPDRFLTEDGQFDRKREPIFFGTGRRVCPGEQLARTE

IFLFITSILQRFTLSLAEGVPKDLPCYHSITNRPDPYQIHAEKIV

 

>GLEAN3_20229

missing N-term starts with C-helix

MKTIYNYHDQSFCSWGQKWFHRRKMLTPTFHFSILQSFMDVFNEQSMILVKKLEKFAE

KSETVNIFPLVAYCVLDIICITAMGRSSNAQENSENEYVKAVGRMSELVTIRGKNPLL

WPDFIFDKIKTGKEHKETLKILHGVTNKERLKESPKDTNTDDDDVDVVARTRKRIAFL

DLLLQMHREDASFTLGDIREEVDTFMFEGHDTTAAAAGWAILLIGHHPDVQARLHMEI

DEVFGDSMRPVTSDDLSRLQYLTCVVKETLRLIPSVPSIFRKLDEDIMLDGKVVPKET

LVALSIYGLHQDPDEFPNPERFDPDRFLPVNAEERHPYAYVPFSAGPRNCIGQKFAMM

EDKIILASILRRFSIKSIQTIDEAKPAGQLILRPAEGNMLVNLSRRK

 

>GLEAN3_20233

missing N-term

MPMEINKMGDAAAKGMFDFLEKILKRHRQAYNPEDEPKDLIEAFLHEEANKKAQNVDV

GSFTHQCLKQVVWDLMLGGIETVSTSLSWYFLFMAQYPDVQAKVQAELDKVIGRDRLP

SLSDRQNLPFTESTSVECYRFCRLINVQIPHRTRCDVKLGGYDIPKGTNVGTNASWLS

SSPSLWENSEEFVPDRFIDKETGLYNSKLEAGLVEFGVGRRVCPGEQLARTELFVIFT

HILHRFTLRLPEGTPITRNGINGLTHHPKPYKITAAPRR

 

>GLEAN3_20234

MAQVMSTDFLTHNLSTLLTTGLTIILVVLFSLRRKNDSSKKLTHPPGPSGIPVLGNLL

TVAFSKLPQHELINGWSKKYGDVFWFKIFGTGVVIINDKEVMQKTFQNTDVCDRAPFF

ASDYVLGRENNGIAFANGGSWKEQRKFFVSIFRKVDIGVSRLEDIVGGQVKNIVTQVK

NLKGEKFDPAIMVNMSIANIITRIITGVTYDWEDKAFQDIVAYSGKLFDISGPAGLFS

LNPVFARLPLPVNFEIRRLCRTIMDFIQVSIDEHRKNFDPNAPVKDFIGSYLKEMASA

DANNLKSFDDTNMIVSCFDALLGGWETVASTLRWTIYLLSKHPEAQGRMRQEIHDLVG

DERFPTIADRQYLPLVEATIAECMRFIPAVPIHLPHVTSQDCNIGGFDVPKGSAVAAN

LLHFAKSPMFWEDPEVFRPERFLDDAGNFMRGLEPVPFGYGKRQCPGELLARMEMFIF

TTFLVQNFELKLPAGIKDDLQGVLGLTWRPNHYFIEGHPTNANEE

 

>GLEAN3_20751

missing both N and C-terms, stops in heme signature

MVDEHGETLDDQHIRDIIDLYILEIREQKKRGDEIVFNETTVWRGIFDMISAGTGTTS

TTLLWGILLMAAHPDAMEQVQNEIDRVIGDHRPPSMSDREDLPLTMATIMEIQRFRPV

SPLGFPRFTNKDAEVKGYSIPKGTAAIVNIWELMHGTKYWEEPDKFKPSRFLSDDGRS

IIKPDAFIPFGIGRL

 

>GLEAN3_20752

missing N-term

MVDEHGKTLDDQHIRDIIDLYILEIREQKKRGDEIVFDETTVWRGIFDMISAGTGTTS

TTLLWGILLMAAHPDAMEQVQNEIDRVIGDHRPPSMSDREDLPLTMATIMEVQRFRPV

SPLGFPRFTNKDAEVQGYSIPKGTAAIVNIWELMHSTKYWEEPDKFKPSRFLSDDGRS

IIKPDAFIPFGIGSRVCLGEILAKMDVFLTFTFLLQRFTFRLPEDSPTN

 

>GLEAN3_20753

MASSGYVASSIFLGLQEISFRECIVFVMTCLFVRLLVSRGSFKKSTPPSNPPPPSPTG

LPFFGNLFSFQEDMRHLFRKWTNEIGGIFRLRIGPVMDIVVINDVEAVREAFVEKADA

FSDRMVPPLLGEALGKRGSLIWPAGGGWKKTRKFGHSAFRSMGLGHRRIEEKVVEEGR

ILVQACEMKEGEPFNPRVLITNAVGNMIACLVTGSRYDYGDPDFVKIVGDIERFSQNL

SLTSLGNVFPLLYYTPLYTDFRRPMQDIISYIKSMVKQHQQNFDPLNVRDIIDLYIVE

LKQQREQQIPGTVEENLEWRLSIELYIAGMVTTSDSMLWAILIVAQHPQLMKEIQKEI

DDVIGDERSPKMADQDALPLIVATLMEVSRIRPAVPLGLPRLTSRDATLQGYHIPKDT

GCYSLLYNNYNNINSSSVVIMNLWEVLNSPKYWDKPEEFNPYRFLSEDRRSVVNHPAF

IPFGIGLRACLGEKLARMQMFIIFTSLLQRFTFCLADDSRSAGMKGKAGMAYSPLKYD

IILKRRENSD

 

>GLEAN3_20754

MEMGLFKVERLRDGLICFAVFLLAKWLLGSKTRKSKLPLPPSPPGLPLIGHLLMFRRD

FRKVFRGLKEKYGSVFSIRVGPQRTVVMSEIDAVKEAFVNQAEAFSDRHMPPLLKQGL

GSAGSFAFDNGPVWKARRRLGLSAMRTFGVGKYSMEQRIIEEAKMLCETIENNQETSF

NPKGLITHSVSSIICGITFGQRFDLNDPEFNDLISRIHSALSSISFFTVGQLFPFLLY

TPLYSKFRNLISSLKTLVTRMVDEHGKTLDDQHIRDIIDLYILEIREQKKRGDEIVFD

ETRVWRGIFDLFAAGTGTTSTTLLWGILLMAAHPDAMEQVQNEIDRVIGDHRPPSMSD

REALPLTMATIMEIQRFRPVSPLGFPRFTNKDAEVKGYSIPKGTAAIVNIWELMHSTK

YWEEPDKFKPSRFLSDDGRSIVKPDAFIPFGIGSRVCLGEILAKMEVFLTFTCLLQRF

TFHLSEDSPTNIDAIPGVALIPYDFLIMAEKRV

 

>GLEAN3_20756

MANRSYVASSMFLGLQEISFRECIVFVTTCLFVRLLVSRGSFKKSTPPSNPPPPSPTG

LPFFGNLFSFQEDMRHLFRKWTNEIGGIFRLRIGPVMDIVVINDVEAIREAFVEKADA

FSDRMVPPLLGVAFGKRGSLIWPAGDGWKKTRKFGHSAFRSMGLGNRRIEDKIVEEGR

ILVQACEMKEGEPFNPRVLITNAVGNMIACLVTGSRYDYGDPDFVKIVGDIERLSENT

SLTSLGNVFPFLYYTPLYTGFRRPMQDIISYIKSMVKQHQQNFDPLNVRDIIDLYMVE

LKQQREQQIPGVVEENLEWRLVVELFLAGMVTTSDSLLWAILIVAQHPQLMKEIQKEI

DDVIGEERPPKMADQDALPLIVATLMEVSRIRPVTPLALPRLTSRDATVQGYHIPKDT

VVLMNLWEVLNTPKNWDKPEEFNPYRFLSEDRRSVVDHPAFIPFGIGLRACLGEKLAR

MQMFIIFTSLLQKFTFCLADDSRSVGMKGKAGIAYSPLKYDIILKRRENSD

 

>GLEAN3_20876

MAFMFSVWTFAVPVCLLYLIKKLWYWYNISKSDPNCSAPLPGGSMGWPIIGETFHVLF

QGPSFYEKNFAKFGRVFKTHLFLNPTVRVHGPDNIAKILQGENDIVTSTKPVRMAIEK

WCQSDEVLAANVLRHLSFRVAADVLCGFHYTDEDVTELCQKFDDIMKGIMTIPYDVPF

SRFRRALQARDFIKQKIDENVTRKANLDPEEDTQDAMRILIDLANKSDDSSSKEGLYE

AIKNQTIEFLTSGMLSLSSAASTMLYHLAKKPSVLAKVREELNDYGLLTDYSPLTLNK

INQCRYVMQVIKESLRMVPPIGATFRTALKTFELDGKQIPKGWTVFWSIRETIHLSDS

FTDTKNFDPDRFSPQRQEDKVAGRFSMPVFGHGTHSCIGQNFALLTLRILLIELARTC

DFETPNVDKIKFSWIPSPKPKCGMPTRFTRKILVDQNENEVSYSSKSSTCPQGEDSRE

RSEEGVHQRSTTCHAA

 

>GLEAN3_21087

MIDLLGDSSLVQVNGLTITATLGVVAISLALWTMVRPRNFPPGPRGLPIVGSLFSLND

NLEKIFTGWIKQYGDIFGFKLGEHWMVVLNHQEIIKEALLKQSVQFAGRPRSPSAEWM

TDGYKDIAFGMYGETWKLHRKLGHTSLRHFATGEALQKLLSTTFPKLEVEVAKTAGKP

VDPKNIISLLLNNVLSEMCFGKRFEMHDPKVLEFIKLNDEMNAEIGMGFPEDVFSILQ

YVPYKKRDIIRGFMNKIKELVIEQIGDHKETHTQGEVTDIYHLLLDAQEKAMAEDSNM

EKLSDEHLLWTIMDIFSAGIQTSTDTLYWALALMTEHPEIADRVRKEIDDVIGGDRIP

VIEDRGSLPYAEATLYEVLRYSSIAPILLPHATTCDTTLRGHAIPKGTTVMMNVYSMH

FDPREWKDPENFCPEHFLAENGKVRDHPPSFLPFGAGRRVCLGEAVAKADLFLIFTWF

MQNYTFSKVPGKESESLTNMIPMSAAGRLLVPYEIMINKRE

 

>GLEAN3_21185

missing N-term

MLKYDVEFSGRPQFYSAEIASDGFKDIVFSNYSDTWKLHRKLAHSAFRKFATGHKLDD

LVHGVIPQVQQMLTENSGKPFDPKLMITLTLYNVFSKMCFSESYDFSDPRMLGYITMS

QELAEKVGFGLVADYFPWARFLSYNGASYIQRMKQKYMEWTSPQLQRHRETLDPTGIE

TSMQRLYWAIALMTEYPDVQAKIQGEIEVEIGRGRHPVIKDRGNLPYTEATLYEMIRY

STVGPVGVPHATMVDTKFRGYLIPKGTTVLLNQYSMHFDEREWTDPHSFRPEHFLDES

GTTRLHPSSFQPFGTGRRSCLGEAVAKADLFLIFTWLLQNYTFRKVPEYSEENLLKLL

PETAAGRVLQNYKILIEER

 

>GLEAN3_21251

missing N-term

MPSMQGVQYALDGYIKKYGNCCGFYQFLTPAILVADPDMIKQIMVKHFDRFHNRMMSE

MVREPASRLVQNLKKVCGTDRQGIVELKELFGAYIMDTIGLCAFGLNVDSQKNKDDPF

VLNAKRILLQFRLYITGKIPGVSHILPLLGLTVFPPEVLEFFIAIIDQAVEQRKTNKA

AKTVDFLQLVIDSTKDNACEDEDDDDDATTNGTSSSSHAIDKKRKRQALTRQELTSQA

LAFFVAGYETTTTALCFTAYLLATNPDQQDRLIDEIDRLVPTAEDITYDNLKKLTCLE

NFISESLRLYPPGAL

 

>GLEAN3_21255

MEVLQQESPGSCLTVWLVFAATALLILYLTWNYTYFLRRGIPGPLPFPVLGNVLDLKE

GVQYALDGYIKKYGNCCGFYQFLTPAILVADPDMIKQIMVKHFDRFHNRMYIPVDNPM

LRSALFLSSDDQWKKIRHTLTPAFSGRKMKLMSEMVREPASRLVQNLRKACGTDRQGI

VELKELFGAYIMDTIGLCAFGLNVDSQKNKDDPFVLNAKRFFSSLDFTSPAMIISMLT

SKIPGVSHILPLLGLTVFPPDVLEFFIAIIDQAVEQRKMNKAAKTVDFLQLVIDSTKD

NACEDEDDDDDATTNGTSSSSHAIDKKRKRQPLTRQELTSQALAFFVAGYETTTTALC

FTAYLLATNPDQQDRLIDEIDRLVPTAEDITYDNLKKLTCLENFISESLRLYPPGALI
NRVCNQTTTINGHVFQAGDSVSLPIWSVHRDPEHWPNPEKFDPDRFNKENAAGRAPLA

WLPFGGGPRFCIGLRFAMMEIRTATVMLLQQFRLEVAPETEIPPVLGKMGFINPPNGI

TLRVVERSK

 

>GLEAN3_22109

missing N-term

MSDLALGFFSNDFSSLMSSKGSIFGQRWKQLRRFGHRALRHFGINQGSMEDSIIEEGR

LMAEAFEMKEGTQFNPKVLVTNTVGNVITRFVFGFRFEYGDPVFEKLAEDFELSFESF

SITSLGSVFPFLYYTPLYTNFRRPIQDVVSYIKSVVNDHRRSFDPLHLRDVIDLHMAE

VKGQRDENKMAAIEEGWEWKFIFELYAAGLATTTDTLRWAILIAAQHPQLVHEVQKEI

DDVIGGRSPKFSDQSLMPLTGATMLEITRIRPVVPLGVPRCTSRDAVVQGYHIPKNTT

VLMNLWELQTSPKYWKEPEKFNPYRFLSEDRKTIVSHRALNPFGIGLRVCLGEKLARM

QLFIILTSLLQRFTLRLADDSLSVGMEGKAGIAYGPLDYNIVLEKRHV

 

>GLEAN3_22110

METVKKSSLDFFADYAIYGREVLVFATVLLFVRWILLQWSYNKNSKPSAPPLPYPPGL

PFFGNMFSFRTDMRDILRKWTNDLGGLFRLRIGPMVDIVVISDVDAIQEAFVDQANAF

SERDIPPLLEVAMGTNGSIFGQRWKQLRRFGHRALRHFGINQGSMEDSIIEEGRLMAE

AFEMKEGTQFNPKVLVTNTVGNVITRFVFGFRFEYGDPVFEKLAEDFEHFFESFSITS

LGSVFPFLYYTPLYTNFRRPIQNVVSYIKSVVNDHRRSFDPLHLRDVIDLHMAEVKGQ

RDENKMAAMEEGWEWKFIFELYSAGLATTTDTLRWAILIAAQHPQLVHEVQKEIDDVI

GGRSPKFSDQSLMPLTGATMLEITRIRPVVPLGVPRCTSRDAVVQGYHIPKNTTVLMN

LWELQTSPKYWKEPDKFNPYRFLSEDHKTIVSHRALNPFGIGLRVCLGEKLARMQLFI

IFTSLLQRFTLRLADDSLAVGMEGKAGLAYGPLDYNIVLEKRNV


>GLEAN3_22432

MDLNQMTIYHLLVTALVGLVAVWIVKHLMTRPKNLPPGPTGIPLLGVAFKLIRSRSDL

LALFSSWADKYGDITSFRAGPKLVVHLNSYEVISEAFRHPDLQSRPHSQMLTEMSGVG

QLGLILGVGDSWKEHRKFTFSVFRSLGVGKKSYEDTIAAEMAQLGGAIEEKKGTAFNP

NVLFEQAVANVICSIVFGTQYQYSDTDFQHVLYLLNNIARLAGTGGWAFFIPIPGISK

TPFGFVKQFVNTVDRLLAFSQAQLDSHMKDLDPDHPKDLIDQYLIKLEETQGAASYFD

QLNLKYLISDLFIAGTETTTTTLKWCFIYMMAFPEVQSCVQAELDQVVGRERMPGWQD

RKNLPYTCAVLMEVQRKGAVGAMGVPHVAAADVTIKGYTIPKDTIIFSNIWNVLNNKN

FWNDSDAFTPERFLSEDGTLVKRDDLIIFSTGRRVCIGEQIARMETFLGFTSLLHRFT

FKKPDNSPPLSFDGVLGGTRSCLPYTTCALKRE

 

>GLEAN3_22590

MDFGISLTTFLLGMVTFLVTLMIVWSRTGRTYANLPPKGPPEWPIIGSLLSLAGEHPP

HQILANMAKKYGPIFSMQMGSFYAVVLSDYSLIKQAFAKSGDDFSDRPKIAMIEHLAE

GKGLISGYNSPAQIEQRRFTFSALRSLGMGKFRMEETISEEIQHLIKSFTERNTQPFI

PFHDITVSVSNVICWLNFGKRFEYNDDDFKGVLEALFGIVEITEIGGVFNFLPFLRFL

PGSGLKKCFKHKAIFDKHMYPLIRKIKQSYDDKFESAECYIHMYADKIAEAAKQNIVP

NLFNDGNMVLAVADLFVAGTETTATTLKWALLYMVVNQDVQKRVQEELDRVVGVDRLP

SLLDKPNLPYTEATLLEIQRKASIVPLGVPHAPVQDTILYGYDIPKGTVVLPNLWAIH

HDPNLWKNPDEFNPDRFLKPDTKHVVQREELIPFSIGRRRCIGEELAKVELYLFFTHL

LCRFHFILPPGAPQPTMEGRSLATFVPHDYSLCAIPRCRDKDFEES

 

>GLEAN3_22593

MYASVEKMSSSVFYYGFTPFILAGLIAFLLSFILVRSGKKKYKNLPPRGPREWPVLGG

LPSLAGTDMPHVTIADMAKKYGPIFGMKMGSFYAVILSDYSLVRQAFAKSNDEFSDRP

KITMIENLVQGKGLIVSYFGQMQTEHRRFSLSALRSLGMGKFKMEETIVDEARQLATI

FASRQSKPFVPFHDIVVSVSNIICWLSLGKRFDYNDESFKAILASLFESFEIAEVAGL

FNFLPFLRFLPGSGFKRMIALQQNHDRFMKPLVLKVHEEQEKSRRSPTCYVEMYTEQM

IEAERETPGKHTFTKENMYHAVADLFAAGTETTATTLKWALMYMLLHKDVQEKIHKEL

DDVVGRNRLPSLTDRPSLPYTEATLLEIQRFATITPLGVPHAPVQDTVLNGYDIPEGT

VILPNLWAIHHDPDLWKNPDEFNPDRFLSPDTKQVVQREELIPFSIGRRRCLGEQLAK

VELFIFFTHLLHRFQFTTPEGAPPLTMKARNGATLNPEEFEICASLIDDVM

 

>GLEAN3_23067

MSLLRFLLRISTKANRQTVLLTVASVFLALLTAYRHTLTRTKHGRRRRRLPPGPTNYP

IVGILPFLGRDDPGKSVLELSKKYGPVFYGRLGNFDAIFLNDYESVREAFAKSGDAFS

DRPRFTTFEAWSGGRGVVCCYYQNHWKEQRRFLLKLLRNLGMGRANNSLEDRILQEAV

LLTRSIGETNGESFDPRIPMTMAVSNIICGLTFGKRFDYDNKEFEKFICTLERCFEMT

DIAGPTNYFTLLKYVPFSSLKKMGAMARSLERGLFTREKDAHKLILNTNDCKDMIDFF

LKEVRENESKEGRGEAPNSSFKEEFLPSLIGDIFAAGTDTSACAMYWVMLYSIKYPKY

QRRAQEELDRVVGRHRLPRQSDRPNLPFVVAMLAESLRCSAGGPFGVPHAAVEDTTFR

GYDVPKGTIMIANHYALLHDPEVFGVDTEVFNPCRFFDDDGTVIKSSKTGLAVSRRAC

PGEQLAQSERFIFFSHLIHQFDFSAPDGPDSVSLDSHFGLSRSPFPYKLRAERRPIDL

QLPDITL

 

>GLEAN3_23068

MKKYGAASPKTLKSTLQFLLRLMGNRRHTTYILALISLVLATISLRIGGQGRRRRRGK

LPPGPMGWPIVGMLPYMDPENPAQSVWDMSKQYGSVFCGRLGSHLAVFINDYETIKEA

FSRKDDAFSDRPRITMFEVYTNGHGVATCYFDNHWKEQRRFAMKAFRHFGVGKPDSVF

EEYLMRESKHLLHAYSRKHVAFNLKPFLDMAACNIISSMTFGNRFSYTERTFQQFLQA

LDDIFHAADVAGVTNYLSFMKYLPFSGYSSLDRRSRELESGLFVREREGHKRTYVASE

TRDLIDAFLHEIRKKKEEGVDSDKTGFSDKMILHFIADMFAAGTDTTSNSTQWLLLFA

AKYEKEQLRVQEELDRVVGRDRLPTLSDRKNLPYTEAFMVESMRCAVGGPFAVPHAAV
RDTEFKGFFIPKGTVIVSNILAVLRDPKMFPEPEEFRPSRFLSEDGNLRKDLIERVSA

QFGVGRRTCIGDQMARAERYIFLTHLIHTYNLSGASGPDSLSLKSHGGLTRNPVPYDV
ILTERTPTVASAKKSHGF

 

>GLEAN3_23808

ELWNLTYFWRRGIKCSRPLPLLGNILSCTKGFYSYIYTCQQNYGNIFGLYFFRSPLIV

VCDPDVLRDVLVKSFSKFSNHYQFFLKNRPLDQGLLDLRDQRWKDVRSVISPTFSALK

MKQMSPVINECCDVLVTNLKKKQGEDINVNSVFEAFTMDGIAKCAFGLQVDSQNNPDD

PFVKHAKRIMDGTVESPVAMIAGVFPPAAYLFNALDIALFPAETRKFFRDVVVKAIEL

KKRSAGNERRDFLQLMMNAHNDDEHDSRSKVIKDDNDLHNLIEDHQDKHVSNAQNTKT

QLTMDEIFAQAVLFFVAGYETTNVTLGFLAYALATNPDIQDRLIEEVDEVTPTRDSVD

YNSIAKMSLLDMVVCETLRLYPPAVMGDRCCSETHTVNGLTIEKGVQFLYSIYNIQRD

PTLWPEPEKFDPSRFTKENRANRHPFAWIPFGAGPRNCIGMRFALMEIKMAVVRILQK

YRFVPSPKTDIPVKFGTGNTLKPDGGIFLRVEERQ*

 

>GLEAN3_24078

MLPPNINQNGTQNKLRITIMIDATAFLTRGTAFLAGETIECAITFSCPEATGSGQPEC

LAWASAQLHCQCNVKESKVIIPPQPITRRPSEEWENIQSSSNTSFAASRGLDEATIFE

GANHTDAANPFLEGDRPKNTLLDRAVQILTTITSRKSPSVAFSNGDVWKEQRRFLLSV

FKRADAKGIRFEEIVGAQAERMLTEIEKTKSARFDPQPIINTSIGSIITKIVSGITYE

FDDDDFHRLVTHSNRVFDLLGPAGGILSSVPILAAVPSPAKRQLTKEWDTMYDFIKKN

IASHKSEFNPNQEAADYIESYLHEIAKRTTENNKGSFDEINLLECSFDVLLGGWETSA

TTISWSLHTLACYPEAQNRVRQEIFDVIGQERDPRFSDRHQMPIAMSTIAECTRLRPA

FDFHLPHVAEQDCQVGEYDIPKGSVVTTNMSHLFLSPTLWEEPLEFRPDRFLTEDGQF

DRKREPIIFGTAFHPQSGRRRSKRSPMLPQVHQSTGSLPDTC

 

>GLEAN3_24113

MFIPRINHFAMEFDWHQLTPYHLLATALVGLMAAWFVKHLMRLPKNLPPGPTGLPLLG

VVSKFFGTSDPLALLSEWADMYGEITSFKIGTQLVVLLNSYDAVAEAFRHPDLQDRPR

CRIKEELSGCGRNAGLLIGVGQSWKEQRKFTLSVFRSLGVGKKSYEDTVAAEMTQLGR

AIKEKKGSPFDPNILLGQAVANVICTIIFGTQYTYSDTEFQQLLSLLNFSMRRSGVGG

WLFFIPIPGISKVPFGVIKTMASNAKEFYAFIDDHIETHKKNGVPDVPRDFIDQYLIK

LKEMKGTKSSFSHHSLKSVIADLFFAGMETTTTTLKWFILYMMAYPEVQSRVQAELDH

VVGRERLPGLSDGKNLPYTCAVIMEVQRKGAVVAMGVPHLASVDVTIRGYTIPKGTII

YPNIWKVMNSKNFWDDSDAFKPERFLSEDGKLTNRQELIYFSTGRRICPGEQLARMET

FLGFTSLLHRFTFKKPDNSPPLSLDAILGGTRSSLPYLTCAIPRDPAENVA

 

>GLEAN3_24320

missing the middle between KYG and I-helix motifs

MFDLSATSVFLGGVIFLLCFWVVRRPRNLPPGPWSLPIIGYNFKPGLIHEAFMDLAKK

YGPIFSLRRGSFVFVILNDRESITQALVKSGEFFSDRFVPGHFNWGIPDPGKKGGTDT

TAVVTSWAILYLSVHPEVQRKVQTELDTVVGRGRQPNTLDRPDLPYCNATLTEVMRIR

PVLPVSIPHMTSDNVSFRGFTIPKGSIIIPNLYAVHHDPKEWNDPDTFNPDRFLSADG

KQFQMNDAWMPFGVGRRDCVGSQLAKMEAFLLFTNLFQQFEFKLPPNQPTPSMCGANG

ITMKPKPYKICAIER

 

>GLEAN3_24836

MASMAQQILHRHVGTMVTSEHLEDAKPYDEIPGPKGVPFFGSLFDYTGLGPYKLDKLH

EATVDRFRNFGPIFKETLAGVTHVHIIDPTDVRELFRNEGRSPKRTPIDAMVKYRRTR

KRHIGMANTEGEEWQRLRKPVQHLLMKPQSVYAYIPIMEECADDFVTLMKETQDIESQ

EVPEFNHKMQRWTLESVCAIVFDTRMGCLNTGLDNNPEALDMIVSVSNFFDSLKDLTF

SFPFWKFGITTKAWTKFTEAQDYFFGMSKKYATIALDRMKSAVGTSTQSESEGTFLEC

LLARRDFDIDSLCDLMNDLMIAAVETTASTLAFNLYCLAKNPEVQERVFQEINQAIPP

GTKITAQSLQNLPYLKACIKETVRVFPTVDGTNRIPSREIALAGYRIPPDTIVRIHCI

AGLMEEYFPSPEKFKPERWLRGDAECENIDPYLILPFGYGSRMCTGRRMAEQDLYIML

IKIVQNFKIEWHHEEDMELIFRLTNVPDRPAQFRFVSRT

 

>GLEAN3_25299

missing some seq between pro rich region and C-helix

MIVDLVRDVFDVGFSSILLGIATTLVLVWMMKWMSTRRRLNLPPGPMAMPWPLNFVMD

LLLLLQAKGGIVFSEGDDWKQHRRFGLAALRNFGMGKKSLENSINTEARILNEVLSGK

IGKPFDPSFVINNAVSNIICAITFGRRFEYSDPKFKEMIQRINLFAAEDPGILSDLPT

FISTARRRNLLEVRKFLEEEINEHKSTFEPSDVRDVIDLYLKEIYRAREAGEKNELDL

TKAWPLVFDFFLAGTETTSTTLLWAFLLMAGHPEVQEKIVAEISSVIGSGATPRYEDR

KLMPYTEATLVEVLRYRPIAPIGVPHRATSDLKVKGYDIVEDVIIVINILHIHHDPKI

WGDPEVFRPERFLTEDGTALIKHEAYMPFGVGRRVCLGEQLAKMELFLFFTNILQQFK

ITMPPGVQPNYDFGHRALTTLVPKAYEILLEER

 

>GLEAN3_25595

missing C-term after EXXR motif

MNDLSDVQYGDVFSFTMVGKTFTYLIGSEASALLFNSKNENLNAEEVYSNLTVPVFGK

GVAYDVPNPVFVEQKKMLKTGLNIQQFKRHIPLIEEETREYFNRWGDSGEKNLFVALS

ELIILTASRCLHGKEIRSMLDEEVAQLYADLDGGFTHLAWLAPSWIPFPSFLRRDRAH

RKIKQIFYKAIAKRRETGVENDMLQTLIDSKYKSGRPLSDDEIAGMCIGLLLAGQHTS

STTSAWLGFFLARDKEVQDRCNAEQIKVCGDASTEVSYDQLKDMQLLDHCVKEALRLR

PPIMTMMRVAKSPLVSVPIIGPVVPRES

 

>GLEAN3_25829

MAMILTVLGYVALLICVILGLTIAVLSAAILRTEYVLKDVPTPKNKSFFLGDIKTFGV

WSEKIGMSVANILYLMTKDLGPVYYVRVFWKAHFFASAPEIVKELLLNSRHIKSQEEY

HSFQKLYGERFMGHGLVSEVDHEKWFHKRAIMNPAFRRKYLISLMEEYNSGSEALCQK

LKEKADGKIEVPMLDELNKVTLDVIAKIAFSMDTNAIGDPDCPFPAAITAALQGLQSL

HERPWSTIDPRCISMRRSTRKAVHFIRETGRDQIKERIQARKRGDHVPSDLLNLILDI

ANDLEGDKDFEMENMLDEFVTLFIAGQETTSNLLAFTILQLGRHPDVAKKLQAEIDEV

LGQKHKIQYEDLAKLEYMMRVLKETLRLYSPVGGTTRLTAHPINYNGITIPAKCSLTV

MSYVTSRMEEYFDDPLLFNPDRFKPSEDGSLPRHLYAYFPFSMGQRSCIGQQFALIEA

RVILAKLLQRFEFRLEQSQSMGILDELTAKPMGRCRNYITLRA

 

>GLEAN3_25830

MAMILTVLGYVALLICVILGLAIAVLSAAILRKEYELKDVPTPKNKSFFLGDIKTIGA

WREKELLLNSRHIKPQEIYHGFQKLYGERFMGHGLVSEVDHEKWFHRRAIMNPAFRRK

YLISLMEEYNSGSEALCQKLKEKADGKIEVPMLDELNKVTLDVIAKIAFSMDTNAIGD

PDCPFPAAITAALQGLQSLHERPWSTMDPRCISMRRSTRKAIHLIRETGRDQIKERIR

ARKRADHVPSDLLNLILDIANDLEGDKDFGMENMLDEFVTLFIAGQETTSNLLAFTIL

QLGRHPDVAKKLQAEVDEVLGQKHTIQYEDLAKLEYMMRVLKETLRLYSPVGGTTRLT

AHPVKYNGITIPAKATVTVMSYVMSRMEEYFDDPLLFNPDRFKPSEDGSLPRHLYAYF

PFSMGQRSCIGQQFALIEARVILAKLLQRFEFRLEQSQSMGILDELTSKPMGRCKNYI

TLRA

 

>GLEAN3_25863

MGVDPVRDVFDVGSSSILLGIATSLVLVWMVRWMSTRRRLNLPPGPMPVPWPLNIVTE

LLLKLQGTNPFVLLGKLAERYGEVVYIDNGGRRFVFLASPEAVNEVLVKQADITSGRE

DFWELKAVTHYHGGIIFSEGDDWIQHRRFGLAALRNFGMGKKSLENSVNAEARILNEV

LSEKIGKSFDPFFAMNNAVSNIICAITFGRRFEYSDPKFKEMIQRINYFVAEDPGPMS

NLPTFISNARRRNLVAVRKFLEEEVNEHKSTFEPSDVRDVIDLYLKEIYRAREAGEKN

ELDLTKAWPLVFDFFLAGTETTSTTLLWAFLFMAGHPEVQEKIVAEINSVIGSGATPR

FEDRKLMPYTEATLVEVLRYRPIAPTGVPHRATSDLKVKGYAIPDGVNIAVNILYIHH

DPKIWGDPEVFRPERFLTEDGKALIKHEAYMPFGVGRRICLGEQLAKMELFLFFTNIL

QQFKITLPPGVQPNYDFGHRVTTLVPKAYEIILEER

 

>GLEAN3_25956

missing N-term starts with C-helix

MVLYDLFEVYYREQTTASSVIWENGDKWKSSRRTTLTAFRKSLSSPSTLGATVKSSWQ

HLRVSLQSLNGQHCNPRSEITSAIAEIISSLCFGHCVDVRSDPKIADIMQAQHNISRG

LSGISLGHIFPHLYHTPLYSKLRRDIEQYTQFMREVVKDHKLRLVGNDDGKCKDGNRK

EYITDVTDCYLQQTQHGGKVAVEEEDVWRVMFDLMGAGTETTSETLLWGILYLCLFPD

VQRQVHSEIDENMALYGPDDVINYDQRKSLPYTQATILELLRCACVAPLSLPRVTTRD

LTVLGYSIPRGTEIWVNEWAINRDRNVWNKPHDFNPARFLDDSGRVDPKLKESIISFG

EGRRMCLGRCLAEHMLFVFFTNLVRWFTFRLPEDCRETPNVNEGNFGLTYSPPQFDVI

ITPRVFGN

 

>GLEAN3_25969

missing N-term

MTLWYMFSKRSSYDSPLQNDILNLTDKAEKPSGSGNPVDFLPWLRFITTKYKTTLQEY

RKKFGKLFSGVIEEHAVLYEPGSERDVIDHLISTGHNSGEEELQRLGLAGKALYQSAF

DFFPAGTETVLATLEWIMVYMVTYPNVQSDVQHEIDQFVGRGRLPTLNDRDNLPLTQS

CLLEIQRHAAAVPFAVPHSTTQDTVLDGYYVPKDMLVFVSLYSAHFDPDVWETPEVFN

PRRFLAQDGTLDEEKTKLAIPFSLGPRRCLGSGVARIQLFIYFTTFLHQLNFKCPEGQ

KASLDATYKFPRRPKVFNVQIDPR

 

>GLEAN3_26188

missing C-term after EXXR motif

MWRRPLLLVRDLDMLRDIMVKDHHRFYNKFPLPMSSGSFDNALFLLRDAQWKRVHDIM

TPTFTGKKMKMMSGILNEGADDMVEKLLQSCKEDGNIDSLDNGLPYAYSVFTFQCGIY

PSLLYIKIKLNGLKKIVGIIEKRVDFLQLLIDAKNGKSKMANENDEDDIHNKYFKDAG
ADENILTKTQKYMTREELLAQSMVFLVAGYDTTATLLTFASYLLASNPDCQDKLIAEI

DDIAPKRDDVGYTTISKMPYLDQVVCEVLRIYPPAVV

 

>GLEAN3_26225

MTYFKRKGIKGPSPYPVVGNSLDLFQGIHWTTQDYVKKYGNVFGMFMFSSPVFFISDL

EMVKDITVKSFSKFMNHKTFVLKSRPFDKALTGLKDQHWKDVRNVITPTFSAVKMKQL

SVLLNESSDTLVKNFGAIQEKDGKVKVFGLFEAFTMDGIAKCAFGIQVDSQNDPSDPF

VTNAKDLMKNRFSLSTMIAVTFPRIGKIMDYYGVSTMSPKISSFFISVVDQAIAARKE

SGGSKMKDFLQLMLNAAQENKSEKEMIEEKPQDEKVEGDLHALIEDQGNKSYFDLKKS

KKTTLTKDEIYGQAIIFFLAGYETINTTLGFLTYSLATNPEVQDKLIEEIDNETPTRE

DVGYNSIAKMSYLDNVVCEILRLYPAGVIVERQCNETHVCNGLTIPKDSQVMFPVFAI

HRDPAIWPDPEKFDPERFTKENREGRHPLAWMPFGAGPRNCIGMRFALMEIKMATVRL

LQKYRFETCPETQIPPKFAKMSLKPDDDLYLRVVERS

 

>GLEAN3_26360

missing N-term starts at C-helix

MGFELLCTKATIAFSNGKPWVDLRRFALPALRSFGFGKQSLVPQINLEARYLSEEIRN

LRGEPTDLATTFSKATANIICQLLFSQRYEYSDGEMSRILKSMDEFLSLIHETDLVNI

FETLIHTPRYKPYRESIFSDRDFIMSHLNSHRETFQKDNIRDFTDAFLADDISKEFEL

EHFWRVLLDLFIGGTDTTASVTSWTILFLSVHPDVQRKVQAELDTVVGRGRQPNTLDR

PDLPYCNATLTEVMRIRPILPVSVPHMTSDDVSFRGFTIPKGSIIIPNLWAVHHDPKE

WNEPDTFNPDRFLSADGKQFQKNEAWMPFGVGRRDCVGSQLAKMETFLLFTNLFQQFE

FKLPPNQPNPSMRGANRFTMNPQPYKICAIER

 

>GLEAN3_26373

MAMAYRRLRHRLLLPPGPWFLQLRMVYYMLTMRLPCRLEKIYEAQSKKYGKMFSFSLG

SDIKIVISDPDIIREAFGKHRAAFSDRRSPSVYAKSFGGEHNHRILFANGRLWKTSRR

VVAPCLRNIGTWGQQMESIILREADSLTDKLAELGSKPFDPRHHLTRAVANIIHMMVH

GYNLDPDDPRLQQMLDSFQLVVTHPMVTSNTLRYLPFLYYLPTTRPLRQAFQLLKTML

STAVSTQANGYDQSSINTIVKRIMQAAEEDDDVGEDRLKTKEVWRLVYDLMLAGSDTT

SNTLKWAVLVMAYYPDIQKKVHNEIDCVLGERSPCLQDKKSMSYIQATIMEVLRVRPV

LPLALEHLTSETTVLRGYTIPKGAILIPNVWSCHHNPEYWDDPEVFNPGRFLSEDNQS

LGEEPHAFLPFSFGPRKCLGDQLAKNELFLIFAKIMQRFHIDFAPGDRPESFDEGFDK

GLILRPNNMKVRLQPRKIYSHEE

 

>GLEAN3_26477

METKVLPFDVNSYTLGVAFCTCFLSWVAVKRYRTEAKEESLHSLPPGPPKWPLLGNIP

GMMMAGSSSKYLSDLVSEYGPVYTLYMGPRPVVVLNKYDIMKQALMDGHLLSDRGDLP

ALDDYFKGRGMMMAGSSSKYLSDLVSEYGPVYTLYMGPRPVVVLNKYDIMKQALMDGH

LLSDRGDLPALDDYFKGRGIVGSHYTEYWKNTRSFVIHSLRQFGVDRQSFESVIASQA

DILVEDIRKRGTFDPLLCLQSNVCSVICGVVFGKQYNLDNPRFTDLLETVYKALRAIE

YAAVSSLVPGLHYLPGCSYQKFYACMDKLMRFFREEINAHDETLDPDSPRDVIDMYLS

KIRRASPEDPTYGLFSKTNLEFVVNDLFMAGTETSASVLYWAILLIASNPGIQERLQK

EVDDVTTHCDHVSIADAASMPYTRATIDESLRYASVAISSMRSTTSTMTLGKYRFEVG

TWVMLNYEYAMFDPDHWSEPKRFKPERFLDDHGQYCPDEQLIPFSIGKRNCIGEKLGR

MELFIFFTNIIKNFELVFPDDAPHPSVEAATGGMMRVPNRYSITFKDRTTTAA

 

>GLEAN3_27152

missing N-term starts with I-helix

MSELMLKFLCFIAGIQTTVETLYWAMALLATYPEIQAKIRAEIDDVIGRDRLPTINDR

GKLVYTEASLYEVLRYSSIVPTNLHATTRDTEFGGYHIPKGTVVLINTHSMHYDPQEW

DQPDKFLPEHFMDGSGTVREHPPSFLPFGAGRRGCLGEAVAKADLFLIFSWFLQNYTF

SKVPGKESEDILNMIPQTAFGRLLISYEIMINKRD

 

>GLEAN3_27153

MIDLLGAGESGFGQINALTITATLGMVTATLALWTIVRPKRFPPGPRGLPIVGSIYSM

NDSPEVVFGEWAKKYGDIFGFKAGERWIVVLNRQALIKEAVLKQSVGFADRPDFLSLD

IFSDGFKDIAFSPYSETWKLHRKLAHSALRHFATGKPLQDLISSVYPKVEKKLAMTEG

QPIDPKVLITLLMYNVLAQMCFGRSLLLKAQEDARKEGENVDKLTDTHIFQTIADIFT

AGILTTVETLYWAMALLVTYPEIQAKIRVEIDDVIGRDRLPTINDRGKLPYTEASLYE

VLRYSSIVPNALPHATTRDTEFGGYHIPKGTTVIINTYSMHYDPQEWDQPDKFLPEHF

MDGSGTVRDHPPSFLPFGAGRRGCLGEAVAKADLFLIFTWFMQNYTFSKAPGKESEDI

LKMIPQTASGRLLLSYEIMINKRD

 

>GLEAN3_27613

MHLTCTVVKVVMVVGMLMVMTPNSDALSGKTPPGPGLLGSFTQLKRVVSDFFRSRSAA

EMISEKEGGLYQLNLGLSKICFINSRGFRDEAFLEKYHLFSDRGVESNKKQLPEELMQ

WRGTIRDNGAASDEARAFVNSTLKETSYLRTTVETQIEEAIAFFDDELTGAIPSDLVV

PKMIFREHVLDVMTRIAFDTAGHRNTISGQWQFQKFFEDVDALSNSVLFQPGFISTVL

PNTKVDDYETNLVGVAKYLKERAEEAIYTNRETSLVQRYSDFFLKRESSIAQPEDGWR

LMYDFFTAGVNTVPEVITWDIYYLASDLDLQDKIRQEMKNFGTDVTYDDRTSLPRTMS

FLTEVHRLRPVATLGIPHAASADGTIGGYAIDKETLIIPNHWAMHNDPSDWQSDPQVF

DATRFLKLNGNDLTFDSDKARIVTPFSVGPRSCPGQEVARRTIFLTIANLVQKYTVRF

PEGPVVDVSNVKGSFTMYAPKFEVLLEPRSRPPPQRN

 

>GLEAN3_27796

missing N-term and some middle seq starts at C-helix

MDSIASCGFGLKVNSQEDKDDPFVEHVARFFKVSFTSPAFMAVSFMPFLKHIFEYLNI

TLFPRDVIEFFSDVTEKAVALRENNTSEKRADFLQLLIDAKNGRTETANENDDDDIHN

KYFKNAGTEDDVPTKAQKYMTREELLGQAIIFLAAGYETTSALLTLTSYLLATNPDHQ

DKLIAEIDDIAPKRDDVSYSTISKMPYLDQVVCEALRIYPPATLTDRECGETCTYKGI

KIEKGAHIWIPPYTLHHDPHHWPNPTKFDPDRFSKENREGRNPFTWIPFGAGPRICIG

MRFALMETKMALVRSLQTVRFEVSPLTKIPPDIGQSQILSSTASYMLKVVRRH

 

>GLEAN3_28152

MAIGIGIIAFIMALVAIVTVPLLKFIRLWIVFNKFHGPPAIPLLGNALQFKQDPNELV

EQVSSWVEEYRTKSKGLMRLWLGPFPVLFIYNPHEVEPGKEPTRHIYEESEGKRLGRQ

ITGKKWFHRRKMLTPTFHFAILHNFIEVFDEQSNTLCQKFGEHAQKQDLVNICPLVTL

CVLDIISETAMGKQLHAQDESNNDYVNAVMRMSDIIQQRQKKPWFWPDPLFDNTKSGK

EHAQCLQILHKMTNTLIDERSKEENQKVDSNDNDGSKRRIAFLDLLLKMQRDDPSFTK

SDIREEVDTFMFEGHDTTAALASWAMFLIGHHTRVQKKLHQELDSVFGDSDRPVTADD

LQKLPYLTCVLKETLRIFPSVPIVGRDLQEDCIIDGKLAPRGTLLIIAIGSLHRDPTQ

FPEPLMFDPDRFLPEFSSKRHPFSYVPFSAGPRNCIGQRFALMEDKVLLANVLRCFSL

ESTQSLKDTMPIAELILRPSEGIHMKITRRKGISNLDSKLLQRKQNNQRSRLVTV

 

>GLEAN3_28699

missing N-term

FVVLNDRESITQALVKSGEFFSDRFVPGHFNWGIPDPGKKATIAHSNGKPWVDLRRFS

LPALRSFGFGKQSLVPQINLEARYLSEEIRNLRGEPTDLLTTFSKATANIICQLLFSQ

RYEYSDGEMSRVLERMNETLTLIPETDLVNIFEPLIHTSIQRYKRYRESLFSDRDFIM

SHLKSHQETFQKDNIRDFTDAYLADDISEEFELEHFWRVLLDFFAGGTDTTAVVTSWA

ILFLSVHPEVQGKVQTELDTVVGRGRQPNTLDRPDLPYCNATLTEVMRIRPVLPVSVP

HMTSDNVSFRGFTIPKGSIIIPNLYAVHHDPKEWNEPDTFNPDRFLSADRKQFQMNDA

WMPFGVGRRDCVGSQLAKMEAFLLFTNLFQQFEFKLPPNQPTPSMCGANGITMKPKPY

KICAIER

 

>GLEAN3_28922

missing C-term

MGFTETEPGYWLLAGVLGLLVAWLVKHLITGPKNLPPGPTGLPLLGVAWQIYKGGSDP

LALFAAWSKKYGEMVSFFVGPKPIIILNSYPVIFEAFRHPDLQDRLGSRLLQEIIGLK

NSGIAFSNGDVWKEQRKFTHSVFRSLGVGKKSYEDTVSAEMTQLSSVIKEKKGSPFDP

STLFMQAVSNIICSIVFGTQYTYDDEAFKEIPFGKMKLMTNNIRSVNAFIQSQIESHE

KNRDPDNQRDFIDQYLNKLDETKVK

 

>GLEAN3_28934

MFFLSSTSVLLGVLGFLFCLWFVRRPKNLPPGPWSLPIIGYRFDSGLIHEAFVDLTRK

YGPVFSVRRGPFLFVVLNDRESMKQALVKSGEFFSERFVPGHIRWGIPDAEKNASISW

SNGKPWDDQRKFSLPALRSFGFGKKSLVPQINLEARYLADEIKALHGEPMDLSAVLNK

ATANIIVQLIFSRRYEYDDPEFIGALEAMADIFSLVSDTDPVNVFESLVHTPWYKPYR

DCTSKLRDFIMSHLNNHQETFQKDNIRDFVDAFLAHDISKDYSLDKFWRILLDFFAGG

TETTAVVTSWAILYLAVHPDVQKKVQTELDVVVGRGRQPNTNDRPNLPYCDATLMEIM

RIRPVIPVSLPHMTSADVSLGPYTLPKGTIVIPNLWAVHHDPKEWCDPHLFNPDRFLS

ADGQTVVKNEAWMPFSIGRRDCLGMQLAKMESFLLFANLFQQFDFKLPPDQPTHSMRG

HPGLSMPPESYKICAIER