ࡱ>  bjbjв 4~к_к_`````tttt|t||||||||HJJJJJJ$m~n`|||||n``||   |`|`|H |H   `ӔPvj 40  ` (|| |||||nn||||||||||||||||X &: >Cs4V5a MAVALFCIVVLLAVIAVLNRREIARLVKNRILLSSIPTMRERAYPIIGHSYLLGGDPAKFFEFI STGAKAMVTELKEQIAVIWLGPVPVIVVCHPAAAEVILRSSKHMEKSYLYEFLKPWLGTG LLTSGGEKWKQRRRLITPSFHFNILQDFLEVMNEQSATMIRTLEGKRTAGSAINVGKAIT MCALDIICETAMGQTVNAQENDDSDYVKSLYR (2) ISELIQLRQKTPTLWWDAAFSRMKVGKEQESLLRTLHGFTRDVIIERAKTKDSKKCRGDAF (0) NPKRLAFLDVLLHAETEDGKTLSLNDIQEEVDT FMFEGHDTTAAAMTWAIYALGRHPEIQQKVHEELDSVFEDDR (2) FGAITNAQLQKLSYLERVIKESLR MFPSVPMFARVLSEDCKLGGYSVPKGTQAIIFGYTIHHHPGIWEDEERFEPDRFLAENCV GRHPYAYIPFSAGPRNCIGQKFAMMEEKVVLCHLLRRYSVISHDKEEDLLINADLILRSSTPLNITLKPR* >intestinalis seq 134, 141 ortholog to Cs4v5a 75% MAVALLLIIVVTCVIALHYRKGIARHFKNRLMLSRLPSLPDPAYPLIGHAYLLRGDPS RFFEFVTETTNAMVTEVNEKIGILWLGPVPLMVVCHPEAAEVVLRSSKHMEKSYLYKFLHP WLGTGLLTSGGEKWKQRRRLITPSFHFNILQDFLEVMNEESGKLINN LKKKLKSGVKVDVGKAVTMCALDIICETAMGQTVNAQENDDSSYVRALYR (2?) ISELIQLRQKTPTLWWDPAFSRMKLGKEHENLLSILHGFTRDVITKRAKTRDQKIVE (0) NPRRMAFLDVLLHAETEDGKTLSLNDIQEEVDTFMFEGHDTTAAAMTWAIYVIGRHPDIQ KKIHEELDAVFGEDRGGTITNNQLQKLSYLERVIKECLRLYPSVPFYARVLSEDCKVGDY MVPKGTQTVIFAHTIHHHPYVWEDPEKFDPDRFLAENCVKRHPYAYIPFSAGPRNCIGQK FALMEEKVILSKLLHNYFVVSHDKKEDLVINGDLILRSSTPLNITLEARK* 730 >Cs4V5b MAVTLLIILGAFVVPF ALYWRKILKEISNRHRINKLPSVNSPAYPLVGHSYMFPRNPDEMYDLITSWLKWVMREAK QKLAVVWLGPVPLVAIVHPDAAEVVLRSSKHMEKSFVYRFLHPWLGTGLLTSGGEKWKQR RRLITPSFHFNILLDFLEVMNEQSTKM LRNLDAAVQSGSKVNVGRAITMCALDIICETAMGRTVNAQSDQESEYVKALYR (2) ISDIIQQRQKSPALWWDLAFSKMAMGREHDATIDVLHRFTMNVISDRAQSKTEIKVGWIF (0) DTRRMAFLDVLLRAESEDGRSLSLSDIQEEVDTFMFEGHDTTAAAMTWTIYLIGRHPAIQARIHEELDD 553 552 VFGGDMGTITNSHLQKLSLLERTIKESLRMFPSVPFIGRVTTEECSVGSHSIPAGTQVA 376 375 IFIDSLHHNPSVWPDVDRFDPDRFLPENCVGRHPYSFIPFSAGPRNCIGQKFALMEEKVL 196 195 LTQILRKYSIHSHDEEEDLRKQADLILRSSTPLNISLTPRA* 70 >seq 115 ortholog to Cs4v5b N-term 69% MALTLFILIGALAVTIAVYWKKIKEQFR KRQLINKLPTIMETAYPLVGHSYLFPKSAEELFVFITGRLNWVMTEAVEKIAVLWLGPIPL IGVVHPEAAEVILRSSKHIEKSFVYTFVHPWLGTGLLTSGGEKWKQRRRLITPSFHFNIL QEFLEVMNEQSKKMVDKLDASIASGSKIYVGKAITMCALDIICETAMGQTVNAQDHQDSE YVKALYRISDLVQFRQRTPALWWDAVFSRSKLGIEHDTILCTLHGFTRNVITERAQGKGK KEIENPRRLAFLDVLLNAETEDGKSLSLNDIQEEVDTFMFEGHDTTAAAMTWTVYLIGRYPD IQEKLHEEIDSVFHDDKEGVISNSQLQKLSYLERVIKESLRL YPSVPFIGRVTTEECVIADHVIPVGTQVALFIES MHRNPAVWPDAEKFDPDRFTAENCVGRHPYAYIPFSAGPRNCVGQKFAMMEEKVILAQIL RRFSLVSHDKEEDLKKQADLILRSSKPLNITLTPRV* >best matches to CYP11A1 G126P69032RA7.T0.seq 722 (07) 44% to mouse 2j6 112 ITDLFMAGTDTMATTVHWSLVFLAQNPEIQNKMAAEIIKVTGNDVINVSMMESMPYTRA 288 289 VMYESTRMRPVFPLSLGHQATSDVTVKDNVIPKGTLVVANLWAIQNDPKWWKNPESFRPE 468 469 RHITENGGFTKNEKIVPFSIGPRFCLGSQMATYHQFIFLANLVRTFKFRFHENQPEPDLT 648 649 GVMTSVLLP 675 > best matches to CYP11A1 G126P605845F.T0/G126P605845FG6.T0.seq 701 (27) 149 ITDLFMAGTDTMATTVHWSLVFLAQNPEIQNKMAAEIIKVTGNDVINVSMMESMPYTRA 325 326 VMYESTRMRPVFPLSLGHQATSDVTVKDNVIPKGTLVVANLWAIQNDPKWWKNPESFRPE 505 506 RHITENGGFTKNEKIVPFSIGPRFCLGSQMATYHQFIFLANLVRTFK 646 >best matches to 27A1 G126P601882R.T0/G126P601882RC8.T0.seq 734 75% to seq 115 49% to 4V2 human NFLPPQDTRRMAFLDVLLRAESEDGRS 696 LSLSDIQEEVDTFMFEGHDTTAAAMTWTIYLIGRHPAIQARIHEELDDVFGGDMGTITN 520 519 SHLQKLSLLERTIKESLRMFPSVPFIGRVTTEECSVGSHSIPAGTQVAIFIDSLHHNPSV 340 339 WPDVDRFDPDRFLPENCVGRHPYSFIPFSAGPRNCIGQKFALMEEKVLLTQILRKY 172 171 SIHSHDEEEDLRKQADLILRSSTPLNISLTPR 76 >SEQUENCE 34, 52, 71, 217 74% TO SEQ 29, 60% to seq 65, 39% TO 2C9 MLDILLSCIRAWFSTITLSIIVFYVTWHYWNKRTPGSPPGPRGFPFIGAITSIRKHPEHVMTKWNKEYGPVCMVRLGFKDVLVIGS YEAAHEAYVKSQDFLDRPSPFGLEILGGGYGLFPIAYGSFHQEQRRFGLNTLREHGMGRRVLESTILQYAEELCDRLETKMSAPVL LDQEVYIAVSSTIAHIVFGHNTMQDSPEFRDMILWMLRPNTATVLAGILVFAPYLKHLPFFRGVHNDSRALRLKLESLNLKEVKKH DKTRDPSDPRDFIDSFLNEMDK (1) HFKGKIDPNWETYSSFSDQQLVSM (0) VRDLFLAGTDTTSATTCWIILFLCKYPDVQRRMQKEIDDVIGENGIPKLALAERLPFTRAVIQ (0) EMGRIRPNVPLAVPHCASRDSTLMGFNIPKDTIIMTNIWGIHHDEKTWKDPYKFNPDRHLDAEGNFVKSNHVMQFNIGLRSCLGQQ LARMELFLITVTLFRKFFIELEPGCDIDMEGESLVSLRPYPFKVLLTKRI* >student assembly of seq 34 ortholog missing C-term (2 different genes) 68% to seq 65 MMEEVSIVQTIRGTFDTILLLAIVFYLTWHIWNRRSPLSPPGPRGIPLLGAITQLGKHPEHA MMKWNQQYGPICMVRLGHKDV LVLGSYEAAHEALVKNTHIADRPTHNIEVFYGGKGILMINSGSFHQEQRRFGLNTLREY GMGRRALEPTILIYVNDLCDRIEKYGSDPFVIDGEVYLTISSTISHIVFGHDVIHDNPKFKEIIFKMIEPN KLNVLAGILAFAPFLKHFPFFSTVHKSSKEFRLKLQSLIGEEVQEHKKTRDPTQSRDFIDSFLQAMEK (1?) VQDGKLDPNWEPYSSFTIDQLTAM (0?) VRDLFMAATDTTSSTICWIILFLCKNTDVQIKMQEEIDEVLGQNGILKYELATKMHYVRAVIQ (0) LFKEIARLRPSVPLSLPHASNRDTSIMGYRIPKDTIVLTNIWGIHHDEKIWKNPYEFNPE RHLDANGNFVKSKKVMQFNIGLRSCLGQQLANMELFLVTVSLFHRFSFSVVNPKIDMGGE SMVSLRPYPYSVVATTRG* >SEQUENCE 65, 78, 38 join with 68 check end MEISVQSISSIFDTILLVLIVSYLTWHFWNQRSAWA PPGPRGLPFIGPITSIRIHPEHAMMKWNQQYGPVCMVRFGFKDILLLGSYEAAHEAL VKNMDLADRPSNGIAVFKGGKGILMTKFGSYHQEQRRFSLNKLREYGMGRRALEPTI LLYSNELCERIEKFGSKPFYIDMEIYKAISSTICHIVFGHNVIEENEDFKEIINTLN KKSKLNVLSGILAFAPFLRFLPVFSTIHAKSVNFQQTLHALVRQEISEHEKTRDPKE PRDYIDSFLNEMDK DFRGQVDPNWKQYSSFTYEQLVAM (0) YEQLVAMCRDLFMAGTDTTSSTTSWIILFLCRYPEVQRKMQEEADQVLGSNGEPKMALAEKMPYTR (2?) FFFQEINRIRPNVPLSVPHYSSKDTMV MGYKIPKDTIVLTNIWGIHHDEKLWKNPYDFNPERHLDSNGKFIK SSKVIQFNIGLRSCLGQQLANMELFLVTVSIFRQFSFSFNPGDKIDM EGESVIALRPYSYRVIAEKR >sequence 29 9 accessions 33% to 1A1 in a cluster with 65 MEEYGIINSIKVAVDTVILVGLAIYLTWHFWNKRTPGGAPGPRGLPFIGAITSIRKHPEHAMMKWNQQYG PVCMVRLGFKDILLLGSYEAAHEALVKSPDFANRPPAHSIEVIAGGKGIVMIPFGPFHQEQRRFGLNALREH GMGRRALEPTVHLCAQELCERIEEYGSKPFNLDNEVY QSVSSIISRLVFGHDVVQENLEFRKMIFQMIEPNKLNILAGIL AFAPYLKHLPFFSEIHRKNKEFRLKMESSIRKEIEEHMETRDRKEPRDYIDSFLNEMEK (?) DEPNTKNEEWKQYSSFTQNQLIAM (0) CRDLFMAGTDTTSSTTSWIILFLCRYPEVQRKMQEEADQVLGSNGEPKMALAEKMPYTRAVIQ (0) EIARMRPTLPLSVPHCTTQDTMVMGYKIPKDTIVLTNIWGIHHDEKLWKNPYDFNPERHLDSNGNFV KSSNIIQFNIGLRSCLGQQLAKMELFLLTTSLCRHFSFSVVGEVDMEGESMVTLRPCSMEVVATKRA* >sequence 2,3,9,45 COMPLETE 81 accessions 94% to sequence 1 40% to 2U1 MVLQLLSDINVSSLVIFFTAFLALYYWYTRPKNFPPGPRGVPFLGVIPFLGNYPERVMRKWSKKYGPVMSVRMG REDWVVLGDYETIQQ (0) SLVKQGQCFSGRPDVPVLNQITNGHGLITVDYNEDWKTQRRFGITTLRG (2) FGVGKRSMEDRIVEEVAYLNDAIRSHNEKPFDIL (0) SILSNAVSNNICSVVMGRRFDYDDKRFMEIMARLSRS (2) FNDPTANFALNVVMFMPILVKIPPFSRINNQLMTDVRVI (1) LQMLREILSEHKSTFNKDDVRDFIDAFIAEQNSESKHSSYT (0) DLQLLQYVRDLFVAGTETTTSTLRWSILCMIHNPEKQEKLRKEICDVI (1) GQDRVPAMNDKAQMPYTCAFMQEVFRYRTLVPLSVVHMTNQDVVLNGYTIPKGTT (0) ISPNLWAVHNNPDVWDEPSKFKPERHLDDKGNFVQSKHVIPFSIGPRHCLGEQLARMEYFIYLVSMV QKFEFFPDPNEPDLPDVEDGSSGVVFVPLRFKQIAKIV* >Cs2a CIONA SAVIGNYI SEQUENCE 71% TO CIONA INTESTINALIS seq 2 70% TO SEQ 4 MLQRMLNEINVSTSFIFLTVFLGLYYWYRRPKNFPPGPRGIPFLGVLPFLGNYPERKMRK WSNKYGPVMSVRMGRQDWVVLGDHETIQQ (0) GLVKHGNAFSGRPSIATIDQITEGHGVLFIDYNDHWKRQRRFGLSTLRG (2) FGVGKRSMEDRITEEVAYLNDAIRTHDGKAFDIQ (0) SILSNAVSNNICSIVMGRRFDYDDERFKEIMGRLARG (2) FNDPEASFVLQILIFMPALINFPYFSRINGELMEDVRVI (1) PELLREIVEEHKASYEQDNHRDFIDAFLGEQKAENGAKTFT (0) DTQLLQYVRDLFVAGTETTTSTVRWSLLCLIHNPETQEKLRKEIFEVL (1) GPEKIPAFENKSKMPYTSAFIQEIYRYRTLVPLSVTHMTNEDAHISGYTIPKGTT (0) IAPNLWAVHNDPEEWDEPNKFKPERHLDAGGNFVQLKHVIPFSVGPRHCLGEQLARMEVF IFLVSLVQKFEFLPDPKATVLPDIENGASGAAYVPLPFKIVAKVV* >Cs2b Second savignyi whole gene ortholog to seq 2/4 MLQRVFNEINVSTGFVVLTVFLALYYWYRRPKNYPPGPRGIPFLGVLPFLGNYAERTMHK WSKKYGPVMSVRMGRQDWVFLGDHETIRQ (0) ALVNQGNSFSGRPLIATLDQITEGHGIVLLDYGDWWKRQRSFGFSTLRG (2) FGVGKRSMEDRITEEVAYLNDAIRSHDGKAFDIK (0) SILSNAVSNNISSIVMGRRFEYDDEHSKEIMARLAMG (2) FNDPDTSFFLQILIFMPACVHLPYFSRVNKKLMEDVHVTR (1) SELLREIIAEHKASYDQDNHRDFIDAFLGEHNAENGTDAFT (0) DKQLLHYVRDLFFAGSETSTSTLRWTLLCLIHHPEKQEKLRKEIFEVL (1) GQEKIPAVDNKAYMPYTCAFMQEVYRYRTLAPFGVAHMTNEDVNLNGYSIPNGTT (0) IYVNLWAVHNNPDVWDEPNKFKPERHLDDKGNFVQSNHVIPFSVGPRHCLGEQLARMEIFIYL VSLVQKFEFLPDPDATELPDIENGSYGPICAPKPFKMVAREV* >related savignyi 2/4 like exon sequences (probably two more closely related genes) TLVKQGSIFSGRPSIPILEEMTKGHGILLLDYGEKWKSQRKFGLMTLRG (2) GLVKQGTAFSGRPSIDIIERVSKGNGMIFLNYDEHWKKQRRFGLSTLRG (2) FGVGKRSMEDRITEEVAYLNDAIRTHDGKAFNIQ (0) = RB10 SILGNAISNNICSIVMGRRFDYDDERFKEIITRLARR (2) FNDPEVSLVRQILIFMPALVNAPYFSRINAELMENVRVI (1) FIDPEVSSALQIFVFMPALVNFPYFSRIYGELMKDVRVI (1) PESLREMVEEHKASYEQDNHRDFIDAFLGEQKAENGAKTFT (0) SELLREIVADHKALYDQDNHRDFIDAFLGEQKSENGSETFT (0) DMQLLHYVRDLFVAGTETTTSTLRWSLLCLIHDPEIQDKLRKEIFEVL (1) GQEKIPAFEDKSKMPYTRAFIQEIYRHRTLLPLSVTHMTNEDANICGYTIQKGTT (0) ISSNLWAVHNDPDVWNEPSKFKPERHLDDKGNFVQSSHVIPFSVGPRHCLGEQLARME VFIYLVSLVQKFEFLPDPDATELPDIKIGSNGPAYVPLPFNMVARVV* IAPNLWAVHNDPVVWDEPNKFKPERHLDADGNFVQLKHVIPFSVGPRHCLGEQLARMEVF IFLVSLVQKFEFLPDPKAKVLSDNENGASGAAYVPLPFKIVAKVV >sequence 4, 8, 92 (48 accessions) 83% to sequence 2 faces away from seq 220 on MLQHLLGDINVSSLVIFFTAFLALYYWYTRPKNFPPGPRGVPFLGVIPFLGNY PERVMRKWSKKYGPVMSVRMGREDWVILNDYENIQK (0) ALVKQGHSFSGRPDIPVFTQINNGMGLLTVDYNDHWKMQRRFGITTLRG (2) FGVGKRSMEDRIVEEVAYLNDAIRSHNDKPFDIL (0) GVLSNAVSNNICSIVMGRRFDYDDERYKEIMFRLSRS (2) TKDPEANFYIsVLMFVPFLVNFPPFSRINKEMMDDVKVIL (1) DHLDEIVSEHnSTFNKDDARDFIDAFIAEKNSQNKHSSFT (0) DpQLLHYVVDLFEAGTETTTSTLMWSILCMIHNPEKQEKLRKEICSVV (1) GQDRVPAMNDKAQMPYTCAFMQEVFRYRTLVPLSLMHMTNEDVVLNGYNIPKGTT (0) VSPNLWAVHNDPDVWDEPSKFKPERHLDDKGNFVQSKHVVAFSVGPRHCLG EQLARMEYFIYLVSMVQKFEFLPDPNEPNLPDIEKGSNGAAFVPLPFKQIARVV* >SEQUENCE 27, 220 SIMILAR TO SEQ 2 and seq 8 BUT NOT SAME SEQ Faces away from seq MLTNLIANPSISPVLAILSCVIAFYYWYKRPKNMPPGPRGIPFLGIIPFVGMN PEQAFMQWSKKYGPVITVRMGRKDWVVLCDHDTIHQ (0) VLVKQSTVCSGRPKIPIVSELSKGHGILFADYCEKWKSQRKFGMKTLRE (2) FGVGKKCTEDRVLEEVDFLCNEIRSKNGKPFDIQ (0) DIMCNAVSNIIMNIVIGRRCNYDENFFTDVISRFTKW (2) FNDPTAGAMFTGMMFLPQLKYVPPFSKYYKIFRDDIGALH (1) EFFEEVIKEHEKNFDGNNLRDFIDAFLLEMKKNESGSEFT (0) YIQLLNYIRDLFDGGTETTVSTSRWAILCMLHYPETQKKLRNEIMTII (1) GPNTPASMSHKSDMPYTCAFIQEVLRFRTLVPLSVPHKVNEDATVNGYTIPKGVT (0) VSPNLWAVHNDPDVWDEPSKFKPERHLDDKGNFVQSNHVIPFSVGPRHCLGEQLARMEIFIFL ISMVQKFEFLPDPNEPDLPEINHGTNGAAFVPLPYKIVANQI* >Sequence 209, 199 = A different C-term from seq 27 and seq 10 MIEAFLRQWTDTTTVIIFLFTLLFYYWYRRPLRFPPGPRGLPLV GVLPFLRKYTARTMHKWSNKYGPVMSVRMGNEDWVVMGNYEAVQH (0?) SFVKAGNVFSGRPVLKVANEIAEGKGIVMRDFNTTWKTHRKFGSITLRG (2?) FGVGKKSLEERIYEEVEVMNKEILSKEGKPFDIT (0?) EILTNAVMNVISIITTDQRFEYDDEHFRLLQQIFTKW (2?) FLEPENTAAFAQIIYVPLLCNIPPYRSKYLEVKRDMKRVSG (?) VFQQMVEQHRKTFDKNNLRDFIDAFICEGKKGTDESFT (0) DGQLVQYVREMFEAGTETMTGTVRWAMLCLIHYPDAQKKLREEIFEAI GNNNRPSISDRKAMPFTSAFIQEVFRFRTRVPLGVQHMTTETVNFANYVIPKGTT (0) ILANMWAVHNDPDVWDEPSKFKPERHLDDKGNFVQSNHVIPFSVGPRHCLGEQLARME IFIFLVSMVQKFEFLPDPNEPDLPEINDGVKGNGFFPYPFNFVASEI* >sequence 42, 59, 218 36% TO 2U1 Fugu FATSVLLLLVIMDKTD KKMPGSPPGPRGLPILGMLPFMMSYPERLMADWSKKHGPVIM (0) VKMGPKNVVILGSSDAAHAAFVKNTHLANRPQDGLEIFAGGKGILFINS SMFHSEQRRFCLSALREFGMGRRTLEPKIVDCAALLCDQIDDVCGHDVST (1) GPIQIEQMIYVTMSNVISHLVFGHDSLNDNKGLCELLLRTIEPNKYNALAGILMFLP SLKNVPPFSTTTRRAIEFREDLHRYIKMEIDRHRASRDPNQPRDFIDKFLNEIEKIKVRHNSNNNNNNNLILNNNVK (1) GSKSRRKIAGASFSEDQLIPLVRDLFMAGTDTSSTTITWVVIFLASFPDVQTRLHEELDSVLGQNRTPSVSMEDKMPYMRAVIQ (0) ETFRMRPPLPLSVPHVAQCNTTLMGYRVPKDSIILTNTWGIQNNPKFWVDPDTFKPQRHIDDLGKFIKSPNVI PFNIGSRNCLGQQLAKMELFITIASLCRRFWFTLPTGETPDLKGESTFILRPFPFNVVATKRV* >sequence 6, 132, 225 ci0100133889 MEMETVLGLITIGNVTTILLAVVTVYITRCISKMI (?) FFPPGFPPGERGLPFLG (?) VLPYLGEHPHNVLKEWSYKHGDIMAIKLGTTNAVIVSSQSTLNEV (?) FVKRASAISDRFTNEFFTQILLHRGIFNSAYTERLKIQRKFVHQALHR (?) NLINNKTPEERINEEIQYFIREIHSQDGKPFHAH (?) HMLLSSVTNVICSIVLGKRFEYNDKWFTKIMSCIVE (?) SLAHSGPSFLLPISIYPWLIHFPPFRRINVRFRKSFATFA (?) ELIQQDIDEHKSSFDENNLRDLIDAFLFEMKKEGHHSSFD (?) ELQLCVIVRELLLAGNETTGSQIAWLLLALMHFPKWQDRIHREI (?) KVLTGENGVLKLDDRKKLPCTCAFIQETLRLHSIGSLNAPHKVSRDTEVKGYKIPAGTMV (?) MSNVFAIHTDPKVWTDPETFDPSRHLNEVGSFINSSK MSPFSLGPRQCIGKTIAENKMFLYFGNICQKFRIVSETPTPITLHEGQSGFVFSPKEHKVIAIAYSSK* >ci0100139346 seq 7a = seq 7, 100, 101 46% to 2J2 MLDYLTFTNLALFAIALLLFYIWLTPKYKNLPPGPMGVPFLGCLP FMETLAERTFTKWSKKYGPIITVQLGDHRTVILNSYEAIEE (0) AFIKKGEKFNGRFKTYFTEFASEGLGMVFIDGEKWKEHRKFGTRALAG (2) AGMYGKTIEQRVLEEAENICHVIRSKNEEPFILE (0) DDLILSVANVINGITIRERCDEEGNENLLKYAAIVVEG (2) FRKDGLIYSLLSVIPWCRFFPQTQKHIKSLKDGLKAGN (1) ALFDNLIEQHQQQHDRLHPRDLIDLYLNQMKDFS (0) KPQLRYFLKDLFAAGTETSSSTIRWALFYLIVNPDIQDKVHKEIDDVI (1) GHNGVVRYDTKLPYTKAVLQETYRIRTATPLGVPRRNTEDVTLMGYFIPKNTQ (0) IIPNLWWVHNDPEYWNEPDVFKPERHLDEEGNLIMSNRVIPFSIGARHCLGENLAR TEIFLFLVSILQKFTVLPNPEDPNPPLECSPGIVNSPHKYPLIMKER* >ci0100139379 seq 7b 97% to seq 7a above only 12 aa diffs MLDYLTFTNLALFAIALLLFYIWLTPKYKNLPPGPMGVPFLGCLPFMETLAERTFTKWSKKYGPIITVQL GDHRTVILNSYEAIEEAFTKKGEKFNGRFKTYFTEFASEGLGIFFIDGEKWKEHRKFGTRALAGAGMYGK TIEQRVLEEAENICHVIRSKNEEPFILEDYLILSVANVINGITIKERCDEEGNENLLKYVASVVEGFRKD GLIYSLLSVIPWCRFFPQTQKHMKSMKDGLRMGN ALFDNLIEQHQQQHDRLHPRDLIDLYLNEMKDFS KPQLRYFLKDLFAAGTETSSSTIRWALFYLIVNPDIQDKVHKEIDDVIGHNGVVRYDTKLPYTKAVLQETYR IRTATPLGVPRRNTEDVTLMGYFIPKNTQIIPNLWWVHNDPEYWNEPDVFKPERHLDEEGNLIMSNRVIP FSIGARHCLGENLARTEIFLFLVSILQKFTVLPNPEDPNPPLECSPGIVNSPHKYPLIMKER* >genewise.1257.28.1 seq 1, 50 and seq 10, 11, 22 similar to seq 27 and 209 cibd036p10 MIELVLPDLSIAPTILTLICL LLFYTWYRRPSRFPPGPRGFPIVGVLPFLEKYSERTMHKWSKKYGPVMSVRMGNDDWVVMGNYEAVYQ (?) SFVKAGSAFSGRPYMKLANEITECKGLVMRDFNPTWKTQRKFGHLTFRG (?) FGVGKKSLENRIYEEVEVMNKEILSKGGKPFDIS (0) EILTNAVMNVICNITADQRFEYDDKNFRLLQHILTKWY (2) FLEPENTAAFANMIFVPLLCNIPPYRSKYLEVKRDMMQMA (1) DVFQELVDQHRISFDKNNIRDFTDAFICEVDNGKDDSFTDRQLVQYLREIFEAGTQTTT GTLNWAILCLIHYPQAQKKLRNEILDVI (?) GNNNRPSISDRKSMPFTSAFIQEVFRFRTRVRTGVPHKTTETVNFANYVIPKGTTI (?) VANLWAVHNDPDVWDEPSKFKPERHLDDKGNFVQSKHVIPFSVGPRHCLGEQLARMKIFIFLVSMVQKFE FLPDPNEPDLPEIDDGVIGNGFFPYPFNFVANEI* >1257c equivalent to seq 1 1257c and 1257d are probably parts of a single gene 7055 MIELVLTDLSIAPTILTLICLLLFYTWYRRPSRFPPGPRGFPIVGVLPFL 7104 7105 EKYSERTMHKWSKKYGPVMSVRMGNDDWVVMGNYEAVYQ 7154 7641 SFVKAGSAFSGRPYMKLANEITECKGLVMRDFNPTWKTQRKFGHLTFRG 7722 7982 FGVGKKSLENRIYEEVEVMNKEILSKGGKPFDIS 7995 8226 EILTNAVMNVICNITA 8241 seq gap missing part of exon 4 and all of exon 5 8894 ADVFQELVDQHRISFDKNNIRDFTDAFICEVDNGKDDSFT 8933 9153 DRQLVQYLREIFEAGTQTTTGTLNWAILCLIHYPQAQKKLRNEILDVI 9196 9656 GNNNRPSISDRKSMPFTSAFIQEVFRFRTRVRTGVPHKTTETVNFANYVIPKGTTV 9749 seq gap missing exon 9 >1257d 2 aa diffs with seq 1 1257c and 1257d are probably parts of a single gene seq gap missing exon 1 and part of exon 2 9992 ECKGLVMRDFNPTWKTQRKFGHLTFRG 10016 10267 FGVGKKSLENRIYEEVEVMNKEILSKGGKPFDIS 10280 10509 EILTNAVMNVICNITADQRFEYDDKNFRLLQHILTKWY 10546 10723 FLEPENTAAFANMIFVPLLCNIPPYRSKYLEVKRDMMQM 10759 10937 ADVFQELVDQHRISFDKNNIRDFIDAFICEVDNGKDDSFT 10976 11196 DRQLVQYLREIFEAGTQTTTGTLNWAILCLIHYPQAQKKLRDEILDVIG 11242 seq gap missing exon 8 11913 IVANLWAVHNDPDVWDEPSKFKPERHLDDKGNFVQSKHVIPFSVGPRHCL 11962 11963 GEQLARMKIFIFLVSMVQKFEFLPDPNEPDLPEIDDGVIGNGFFPYPFNF 12012 12013 VANEI 12062 >sequence 55, 80, 13 ci0100138298 SCAF 106 MIEKFLPEWSNTGAVLMLVCTLIFYYWHQRPIHFPPGPRGFPLIGVLPYLGKYP ERTMQIWSRKYGPVMSVRVGSEDWVVMGSQESIYQ (0) SFVKGGNVFSGRPFVKVANEVTESKGLAMIDFNPTWKTQRKFGNVTLRG (2) FGVGKKSLEERIIEEVEAVKMNILSKEGKPFDIT (0) DILSNAIMNVICNITAGQRFEYDDQFFIQIRRHLW (2) FMDPDRTAAFSAMVFVPLLCHIPPYRKKYEQIKQDTKEMM (1) GLFQQLVDQQRKTFDKNNLRDFIDAFILENENGTDESFT (0) DRQLVHYVRELFKAGTETSTGTLRWAMLCLIHYPGAQEKIRKEIFDVL (1) GNSTFPSMSDRNAMPYTSAFIQEVFRFRTLAPLGVPHKTTDTVNFANYIIPKGTT (1) ILSNLWAVHNDPTVWNNPRQFKPERHIDDKGKYIQSNHVIPFSVGPRHCLGEQ LARMEIFIFLVSMVQKFEFLPDPNKTDLPELDDGVNGVAFVPYPFKLVAKEI* >sequence 14, 37, 69 25 accessions 50% to 2C8 missing seq in mid MLPSGYIFVFVFLCIYYLLQWRRRPKNFPPGPLGIPLFGIA PFAGVDMHKYLATYYAKYGGVMSFRLATKDWIVLNDIEAITQ (0) ALLKQGESFSGRPQSYLMNQLTEGCGIVFSTGPRWQAQRRFVLTALKT (2) LGMGTRSMVGIINEENQNFSSVIQSSGGQVNIL (0) (1) DIISEITEEHKNSFDENNLRDFIDIFLLEMKRRNSEYFT (0) ELQLLHLVRDLFVGAIDTTTATLGWGIICLLHYPECQVRIQEEIDDVI (1) GCAEPDMSHHESMPYLRAFIQEVHRFQTIAPLNIPHCVTEDCVLFGYHIPKS TPVMSNIWRVHNDPKYWENPEKFSPERHLDSEGRFVPSNRVLSFAVGHRSCLG VQLARVELFLFYASTLKKYEFQIDPEYGLPDWSNDRSGTVKTPKKFSVLLKSR* >sequence 16 and ci0100140915 they overlap at exon 6 MFCSIFEWLTSAGIITSLIGTFTFVVLVYYYWWKPPHPRYPPGVRGVPILGAIPLLSPFPHRVLLNWSKKYGAIFSARFGRSDA VVLNDYTSVYQ (0) AFVENYVSFQSRPDVQVVNEYTSGFGFAFSKGQYCLDVKKFSAQAIRS (2) IGIVQIEKWVQEVAEGTVKQLGEFEGNAIDVK (0) TCIGRKVADVMTMLLFGRFFGTKDQHYEQFLKDCFDN (2) VTDPAIYSQVNALMFVPLLRHIPWFGNALSVHQNIHFRLK (1) DFCRKEIEEHKKTLNVNEPRDYVDAFLIEMRNHSPRDSWFH (0) EENLVNCLVDIYTAGTDAPTNVILWAMV TLLHKPNLQHLLFHEINNKTSKFKNRYYSNEDLSNRWP LLKAFIQENFRFNPPAPLSIPHQTNQGVKLDKYFIPKDTM IISNIYAVNHDPKIWKNPYEFNIYRHIDTDGKFVPSTKVIAFGIGCRSCLGEKLGRLEIFYF LANMIKRFEILPDPGSKDLPPINEGGNSILHLPIHFKVVFKPRENEDYFSL* >ci0100141603 seq 16 like, scaf 2157 probably same as seq 16 on scaf 1266 EENLVNCLVDIYTAGTDAPTNVILWAIVTLLHKPNLQHLLFHEINNKTN LSNRWPLLKAIIQENLRFNPPAPLGIPHQTNQGVKLDKYFIPKDTM >sequence 18, 216 revised C-term MSMLVNLITNFDMSIVLTASCAFVLVFYYWYKRPRNFPPGPRGIPLLGVV PFLGKHTEKDFRKWSKKYGPMMSVRLGQNDWIVLGDHKTINE (0) CLVKGGNSFSGRPDHPVFKQFTKNHGVAMVDYGDHWKVQRRFGLTTLRG (2) FGVGKRSMEDRIVEEIEYLNNAIRSHNGKPFNIS (0) KLFANAVSNNISSIVMGERMDYSDKIFETLTQRSTES (2) IYNSDMAGKIIRFSIFVPSLWNLLLLFSTQASKQTSVFNSIRG (?) QAIVNWVSQHKSTYDEHNVRDFIDAFIGEQKKETDVSFT (0) DLQLIQYVRDLYVAGTETTTVTLRWAVRCLIHYPEKQNKLRKEIFDVI (1) GREKTPSMKDKAQMPYTCAFMQELFRFRTLVPLGVPHKISNDVQVEGWSIPKGTW (0) IIPNLWAVHNDPDVWDEPSKFKPERHLDEKGNFVQSKHVIPFSVGPRHCLGEQLARME IFIFLVSMVQKFEFLPDPNEPRLSEVPQGVAGMMCVPHPFKQIVKEV* >sequence 20, 87, 90, 107 ci0100147280 complete MSGVSVCLSVVFFTGFLALCCWYKRPKNFPPGPRGVPFLGVIPFLGNYPARVMQKWSRKYGPVMSVRMGS RDLVVLNNHENIQK (0) QALVMQGQIFSGRPNMPALTQVTDGLGLATIDYTDHWKTQRRFGQTTLRG (?) FGVGKRSMEDRIVEEVEYLNNAIRSHNGKPFDIL (?) GILGNAVSNNTCSVVMGRRFDYDNERFKKIIATLSGL (2?) FTNPKSNSTIILVLFMPFLMKIPPFSWINNKRVDGINEIKR (?) LLREIVHEHELRFDKDDVRDFMDAFIAEQKKNDKHSSFT (?) DLQLLQYVRDLFVAGTETTTSTLRWSILCLIHNPEKQEKLRKEIYKVT (1) GQDRVPAMSDKAQMPYTCAFMQEVFRYRTLVPLSLFHATNDDVSLNGYRIPKGTT (0) VLPNLWAVHNDPDVWDEPSN (2) KFKPERHLDEKGNFVQSKHVIPFSVGPRQCLGEQLARMEIFIFLVS MVQKFEFLPDPNEPNLPEIENGSCGTVFVPFRFKQIAKMK* >SEQUENCE 21, 85, 193 MVFDVTMLNISPFIVFIITFVFVWNWYRRPPRYPPGPLGFPVLGVLPFLSNRP ENKFLDWSKRYGPIISVSIGWQKWLVLNEYDVTME (0?) ALVKNRLAFHGRPFVEFFIRTTVPPGIAFAPGGVHWKTLRKFGTNGIKA (2) VGLENMEKKTNIEIQYFIEEIRGHKGKSFNIQ (0) EPLTHAVANVVSSFCLGQRFDRSNDSFNQLIRSLMHQ (2) GHVAANWRTMLLTFLPFLCDVWPFTKHMDESITNVRTLN (1?) GIIRHFVEEQSASFDGDEVRNMVDAFLMHSRNSVDENVITDD (0) KIIGFFSDMFLAGTKTITSQLRWGFLIMVKHQDCQNKVRREIDDVIG (1?) TVKLKHRKLKHRSIMPYTCAVVHELFRF RTVVPLSLPRMNVADVTVGGYRIPKGTN (0?) VIMNIWALHNDDCRWREPERFIPERHLNADGKFMKPKNVLPFGVGARSCIAEHLARNKIFLFLVSTLKNF TLSVPNGDSPSLDAGSNGTIFIPSDYNIVAQYRD* >SEQUENCE 106, 23, 221 very similar to seq 18 MSMLVNLVTNFDMSSIVLTASCAFVFILYYWYKRPRNFPPGPRGIPLLGMVPFLGKHTEK DFRKWSKKYGPMMSVRLGQNDWIVLGDHKTINE (0?) QCLVKGGNSFSGRPDHPVFKQVTKNHGVAMVDYGDHWKVQRRFGLTTLRG (?) FGVGKRGMEDRIVEEIAYLNDAIRSNNGKPFDIA (?) ELMANGVSNNISSIVMGQRMDYNDEIFKTLSKKAPNR VYNSDTIGKITKLVLYMPFLRNLLFLFLSKPKEMNKINTLIRG QAIVNWVSQHKSTYDEHNVRDFIDAFIGEQKKETDVSFT DLQLIQYVRDLYVAGTETTT GTLRWAVLCLVHHPEKQNKLRKEIFDVIGREKTPSMKDKAQMPYTCAFMQELFRF RTLAPLGVPHKISNDVQVQGWSIPKGTWLVCNLWAVHNDPDVWDEPS KFKPERHLDEKGNFVQSKYVMPFSVGPRHCLGEQLARMEIFIFLVSMVQKF EFLPDPNEPQLSEVQQGVSGFMCVPHPFKQIAKEV >SEQUENCE 26, 81 no introns 37% to 2U1 human 41% to seq 2 MEITTFLLCMLMLVLIMFMWRRSSTNFPPGIEGIPLFGCIPSLGMHKERTIAKWGREKG WPIFYLRMGITKIEVLNTFESIEEAYVKKGQYFIDRHLPIFLRHYSTGKGVVFLNGEK WKVQRRFGIHTLRDLGMGKSGMESKIVHEIDHLSEYLVNNNQESGLVDITPIITRAVSNIVC QLVFNRRFNLDEEFLFIDSVNNKLMKSVTTLKQIMLTLQHIPIFNRIPAIAKIK KYRTDLANNRLQR MRNYIKSHKETLDKNNPRDFIDAFLIASEEDKSEDTSFDDEQLLYYIQDLFFGGTDTTSF TLTWTFLYLAMYPDKQEQLHAEIDEVLGSNGIVSMSCLDKMPYAQAVMQEIFRLRPLVPL SMPRKTLSRITLMGYDIPKGTVFMPNIWSVHHNADRWPNPEDFMPERHIDSEGKFVKSKE IIPFSLGPRNCLGIRLAEMEHFMFMIGLLQKLHFSLPEDQPPPDVRGSSFVVLLLPPDFNMKITTR* >Ortholog to seq 26 in Ciona savignyi no introns MDFTSVLLFLLVVVLVFLWNRRGSGNFPPGFDGVPLLGCIPLFGAHRERSIAEWGREKG SWPIFYIKLGITKVVVLNTFESIEEAFVKKGQYFVERHIPIFLKRYSKGGKGIVFSNGEI WKHQRRFGIHTLRDLGMGKSGMENKIQSEIEHLSDYLVNSPNIVGKKSVDIVPIITKAVSNIIC QLVFNTRFNLDDKITFIDSAAGIKLVKNASWVRQVKMTLCYVPLLNRIPTIAKLK QEREALHNTRHDL LRKYIQMHKETLDKGSPRDFIDAFLIAGEQDKEENSSFNDEQLLFYIQDLFFGGTDTTSF TLTWTFLYLSLFPEKQKRLHDEIDEVIGSSGNISMSFLEKMPYAQAVMQEIFRVRPLVPL GMPRKCGQNISLMGHNLPKDTSVIPNIWSAHHNPDRWPEPDSFLPERHIDSNGNFIKSKD IIPFVLGPRNCLGIRLAEMEHFMFMVGLLQKLQFALPAGQGKVDTRGSSSIVLLLPPDVDLVITSR* >sequence 31, 98, 200 1 accession 49% to 2U1 ci0100148485 missing exon 5 MLLNNGTMLEVPGSFVSYLTVCKFLTCTAAYLLYRHWYQMPRSFPPGPRGIP ILGVLPLLGRHPERKLNEWKREYGPVMGARVGRKNWVVLGDCASIKMVIYLR (2) DIFSGRPSPHKKEDGIVFLDNGRSWTMHKTLGLLAFRG (2) FGITENKSLEKHVIEEVSYLKEAVRSKRGVAFSID (0) TILTNAVSNVLCSILLGQRFQYSDQRFKRIIHQLSRR (2) XXTLLGFINIHEKTYDSTNIRDFIDAFLLLKHRQLDESFN (0) DKQLIHYIYDLFLGGTETTTGTLRWAILCLLHYPEKQAKLRKEIIQVLG (2) DQEVTILKKHEMPYASAFIHEVYRFRTLFPLGLPHKTSHQVILDSYVIPQGTT (0) VCTNLWAVHNDPKVFDEPEEFKPERHLYGNGEFGRSPYVIPFSV GPRHCLGEQLASMMLFIYLVSLVRSFEFLP DPKLNGLPDIRSGASGPVFLPKSFNVVAREL* >sequence 33, 116 ci0100137396 75% to seq 64 complete MLPAFLGSLTSNYVTFCLTGFFTLVLFVYYYWWKLPHPRYPPGVRGIPVLGALPFLGKSAHKVIMRWSRE KYGPIMSVRFGQKDSVVLNDYESIYEALVKQGPAFQTRPRIKVIEAHSNGYGFGFAAGHKKFMQVRHFTG KALRGLGIGGLALEERVSEVAQELVLSLQKLDGKPTNLRMLVGTTVGNVIASIVIGKKFHPNDKEFQRYI QLIFESFGDKETKKYFLVLIYFSFLRHIPPFKNACKQFIADHREQYDFCRKEIEEHKNNLDVNKPKDYID AFLIEMKKYSPENSWFH EESLRVCVADLFYAGTETSTSTILWGMIALINYPEIQEKLHKEIVNATGEELP RLDHKDELPLLQAFIQELYRCMALIPLGIQHQTTKNVDICGYCIPKDTVVFTNIHAVHHDPNIWKNPSEF NIYRHIDEEGKFIPSRKVIPFGIGCRSCLGEKLARIEIFLFLANIIKRFEVRPDPDSEPLLPIDDGITGF GFLPFLFKAVVLPRTNKGTQ* >sequence 36 7 accessions + 4 ESTs LOW 30% RANGE TO 2 FAMILY MEMBERS, NO INTRONS MASVLTVVGDYLNIWSIVLGGSA LIFYMWSRKGNLPPGPRGYPILGAMPYLVVHPEKVITKWSREIYGPILSVPLL NRTIIYLNTYEAITEAFSKQGLKLAGRPYMFLFQQISSDLGITMKTYSEHFKVQRQFGMR SLKPSRIESLVIQESRYFIDQLHDQVGKVYDMKTDVYNFT ANIICSVVLGKRFDYDDPAFKLILESSEKALGEPEDANLMMAMILIPQLRFLP PFHGANQRFLKNQHRILSMLLGIIEEHQKNFDPNNVEDFIDAYIYEQKFGIGK NTKLFENDQLKVYVRDLFIAGVETTSNTIQWSILALLYNPK YMDLMYNEVYEALGSDGVPCMKDREKMPVTCAFIQEIMRFRTITPGAAPHSAMEDIKLNGYDIPSGT MVVANLWALHNDPDTWPEPEQFNPHRHIDSD GKFIHSPKILPFSIGPRYCMGEGIAKAEIFIFLTSLLQKFTV SRDPNAVYPEEIEQGIVSAMSCPHRYNIILTAR* >sequence 39, 60, 74 ci0100154493 complete MFCPFEFHEVLVGIITFILVWSCIRRPTRYPPGPRGVPILGVLPFLSKYPEREFLVWSKRYGSLLSVSIA WQRWIVLNDYDITMEVLVKNRMIFNMRPAIRFYQEALDPLGIVFSPGGAIWKAHRTFDFYILLFHSIGQE RMEKLIMTEIDYFIAELEKHNKKSFDIQDTLTHSIANVMCRFTLGERFDYKDVRFKKLVGCIIHVGKVTK DWKTMFLTLLSPLAHIWPFTRHVNEFKNAMKTFNDQIRIMVEEQQSSFNGIEVRNMIDAYLLKSQSFVSC NEKYTVE DIVGFFSDLFIAGTETLSSQLRWGFLIMMLNPECQENVRNEIHAVIGGNGRARLSDRYRMPYT CAVIQEIFRFRTLGALSLPRTNVVDVEIEGCKIPKGSKVILNTWAIHNDERNWENPQKFIPERHLDKDNF INSKKVLPFSVGARSCLGEQLARAEIFFFFVATLQQFELKFDPNAQVPSLDDASNGVIFSPNPYTMIVER INEDRSNNNNNSKVI* >sequence 40, 19, 88 missing exon 1 N-term not found in 11000bp ci0100138492 LPYPRGLPIIGNIHQMGNFPHVKLTEWSKQFGDFYRIKMGRYDALVVNGHENIR (2) NCLAKKSAAFAGRPPFETSKLIEEGLSISFSNYS (2) PEWERQKQCTIKALKLYTSGSDKRSTMEETVSSHAKQLAEDLINSADQQ (0) GLVGDLHDTVIYSTTSVSSTICFGRSFTRQDPELKEFLRNFQSFDKAMGASQIINFWPFLKYFPVLGKSFR (0) NLKTYMDQYWNFTLSMLEQHWDTYVPNNMRDLADCLWAQSNQ (0) NRQLTDQQRRIAYGASDAFGAGFDTISAMITWSIFYMAVFPEHQRK (0) IREEIDRLETSMFSLRHHGDVCPYTQAWLYEVLRH ISVSPLLVPHYTVKQVEVNGTMIPAGVVVLFNVAN (0) ADRDTRVWENPEQFEPERFLARDPTTGGARVVASETSKI LNWGAGKRRCPGAELSRHELFIYIANLVKLCYIE QAVEGIEPAIPWPCTPGISTKPKAFRVKVTQR* >savignyi ortholog of seq 40 56% identical N-term not found LPSPRGLPIIGNVHQLTTSPHVKLSEWAKEFGDLFRIKMGCFDTLVVTGYDNIR (2) TALVKHSVAFAGRPPYETSKLFSNGLSLAFNNY (2) SPAWEKQKRCTVKALKLYTAGPDLQKRNAMEDTASYQANLLVDQLLASVNK (0) DAITNPDEIVHHSATNVISNICFGRSFSKNDPELQKFVSINRAFDRAMGSAQIVNFWPFLKSVPVLGRSYQ NLKAHMDVFWDFVFPNLKEHWKTYNPSNIRDIADCLWYQSH TSSKRDLQRRIASAASDIFGAGYDTTHKVVLWSLFYMAAFPQYQQKV RDIFRVSEVKMY TLRHHGDECPYVQAWIYEVLRHTSLAPILLPHYTTKEVTLNGVRIPAGVV KKYHTIQAHKDPKIWKNPDEFDPGHFLEEDGSKLRSEAVHKLLSWGAGKRRCPGAELSRHE IFVFVTTLVRRAYIGQAVDGVEPAFPWNTTGGISISPDPFRVKITER >sequence 41, 77 MLRIFAFASQISMETLLTSAAILTCVLFVVYDTWWRFLGTSGTLRGRN GFPVFGCFPLLGKNKERVFMNWSRTSYGPIYYARLGASRVLVLNGYEAVREALVLRPQ 297 AFAGRCKTELMKLLYEGQHGIMQLDHGPMWKAQRKFGQNALGSLGMGKRSLQ (0) QQIVNEFNCLSSTLEXELLANGSKGAYLDEMLWLYAVTNISLLIFNESFDPDNEK (2) RERFLYLLMYVPFLKNLPLVRQQLNVTIQGRKTKLAKFKQFVENHRQSRDPDDPRDFIDCYLNTMDSKSANDS (2) FTEKQLLYFMNDLFIAGTETSSNTNSWALLLLLKNPEVMKRLRDEIDSLGREPTL (0) EDEYKMPYTRAVMQEIFRYRPALPCNI IPRKTTTSIELNGHSIPAGMPVIANLWSVHHDPVTWAPDPE TFRPERHLNRDGEFMTSDHVMPFSVGLRSCIGRNLARNELFIFLTSILRRF NLEFADGCYVDDMTGESGLLLHPPSYKVKGSLRA* >Cs41a Savignyi ortholog A of seq 41 74% to savignyi ortholog A of seq 41 MLYLSLFSSNFSIEVVLTCFCLFLLLVYNVRVRHGGTSGTLPGKDGFPIVGCFPMLGKHKE RVLMKWSTSSLGPMFYIRLGTSKVLVLNGYNAIKEALEQKPLPIAGRVESKFVKMLYDGD HGLIQLDYGPMWKEQRKFGQTTLGGLGMGKRSFQ (0) SKIRDEFNCLAATLESELSINGPKGVNFTSMLRLYAANITSSLVFNQTFDSTW (2) RERFLIQMMYMPWLKYLPIVWKQLALSKRCRRDKVKIFQALVDKHRKSRDPNDPRDFVDCYLNVMEKKQGKNS (2) SFTEKQLLYYIVDLFLAGTETSANTNAWAILLVLKNPQVKAKLRSELEAFGKSPTL (0) EDEAKMPYTRAVMQEVFRYRTSARLNIVARRTT SDLELNGYKVPANMPVIANLWSVHHDPLTWAPDPETFRPERHLDGDGNFIPNDHVMPFSV GARSCIGKNLAKNEVFIFITSIFSQFDLELADGCHVDNMDGDSGMLLHPPEYRVKGSIRV* >Cs41b Savignyi ortholog B of seq 41 74% to savignyi ortholog A of seq 41 1 MLFSSYLTFNFFIESLLAGLCLFLLLVYDVFIRLR 35 36 GTSGTLPGKDGFPLVGCFPMLGKHKERMLMKWSSSGLGPVFYARLGASRVLVLNGYDVIK 95 96 EALLQRPLAMAGRVESDLIQELYDGKHGLIQLDHGPLWKEQRKFGQTTLGGLGMGKRSFE 155 (0) 156 SKIVDEFNCFATTLENELNMNEPKGVNLTRMLRLYAANVISSLIFNQTFEISC 208 (2) 209 RERFLILLMYVPGLKNLPIVRQQLAITTRTRLDKVKIFRTLVEEHRKSRDPNEPRDFIDCYLNVLAKNKGENS (2) SFTEKQLLYYIADLFLAGTETSATTNAWALLLVLKNPEVKRKLRAELDALGKQPTL (0) EDECRMPYTRAVMQEVFRFRPAVPFNI IARKTTSELELKGYKIPAGMPVIANLWSVHH DPVTWAPDPEIFRPGRHLDDDGNFVPSNHVIPFSVGARSCIGRNLAKNELFVFITSVLRR FDLELADGCHVGDISGDSGMLLHPPKYKVKGFG* >Savignyi ortholog C of seq 41 RERFLILLMYVPGVKKLPFVRAQLAITTRTRLDKVKFSRPFGEKHRKSRGPNEPRDFHDCYLNVLAKIKGK (10) 209 RERFLILLMYMPGLKNLPIVRQQLAVTTRTRRNKIKIFQTLVEEHRKSRDPNEPRDFIDCYLNVLEGKKEENR 281 SFSEKQLLYYIADLYMAGTETSANTNSWALLLVLKNPEVKKKLKAELDALGKQPAL 338 EDECRMPYTRAVMQEVFRFRSALPFNIIARKTTSELKLNGYKIPAGMPVIANLWSVHH 395 396 DSVTWAPDPEIFRPERHLDDDGNFVPSNHVIPFSVGARSCIGRNLAKNELFVFITSILRR 455 456 FDLELADGCHVGDISGDSGMLLHPPKYKVKGSVLM* 490 >sequence 43 seq 43 shares an exact run with seq 24 ci0100132600 complete MFAKLFGWWGSSWVTTCLLGLLTLVILVYYYWWKLPHPRYPPGVRGIPVMGALPLLGKFAHKVIMRWSHE KYGPVMSVRFGPSDSVVLNDYESVYEALVKQGPAFLTRPNIELINSYSNGYGFGFAEGNKKYMEVRHFTL KALRGFGIGGRVMEERVSEVAQDLVQSLQELDGKPTNFRMIVGTTVSNVIASIVLGKKFHPKDEDFQRHV QLIFDSFGDEETASYILVLMYFPLLRHIPPFKNACKKFVDVHFKQFDFCRKEILEHKKNLDENEPGDYID AFLVEMKKHSPQDSWFHDESLMVCVADLFLAGTETSTSTIMWGVVVLINYPEIQEKLHQEITNTTGEALP SLNHRDDLPLLQAFIQEVYRCMTLVPLGVQHQTTKDVEISGYCIPKDTVVFTNIDAVHHDPNIWKNPSEF NIYRHIDKDGKFIPSKKVIPFGIGCRSCLGEKLARIEIFLFLANIIKRFKVLPDPESQDLPPMDDGITGF GFFPFPFKAVVLPRKNEEIPE* >sequence 231 80% to seq 64 may be in gap at end of scaf 311 close to ci0100140915 = cibd084l24 FYRKEIKEPKNNFNVNEPKDYVDAFLIEMKKHSLEDSWFH (0?) EEALLICVADLFVAGIETSTGTVMWGMTVLINYPEIQEKLHKEIVNAT (1?) ETPPNLDQMDELPLIQAFIQELYRYMTLAPLSLQHETTKDADIGGYCIPKETL (0?) VLTNIYAVHHDSNIWKNPSEFNIYRHIDKEGKFILSKKVIPF (?) GIGCRSCLGDKLARNEVFLLLANIIKGFEILPDPESKELPDYKDGVNGFLYVPYDYKLVAKPRIVNE* >sequence 46, 54 33% to CYP1A MMITAAILLDAGRSFAVPVAFTAVSVLTLYVCLRKRQGIPPGPTAWPLVGNL FSMGRQSHLILESMRKTYGDVFSVYFGSTLVVVVNGKAVEECLSTHSAR (2) YSMRPELHTAQYILEGKSFAFSHIAVSKHKRYRTLAVAVVKQLVNGGGEKTDVAV KHGLQNGTRHSSIEERIFMEAACMCDKLLETSDSPDLKDEILKVITKEL (2) LSEYELDEISRVVENLRNSNEAIMLVNFIPAVRMLWRNGLQKYIQLTQSLNR (2) FFERCIRNRKAQLATVSNGHTEDNGVRLTNGVDCTVKFWQKLKNDPQYEESRVMKV (0) VADLFGARVDTMTVALAWMIVYWSTYQAAQERAQKEIDHFVKNEKRLPR (2) YSERNQLPYTMALIMEVERHCSFVPFTLPHAPAQDTMLNGYLIPKGTMMLISMRSINHDTAVWDSPAQFR (2) PERFLLDQSGGFNSALAEQVMLFGAGRRRCAGEALGRMQIFLYSVLFLRKCTFRR SDKDGHVLPESLAGISLIPQTMCVSISRREADGSKNTEP* >sequence 49, 67 opp end of clone is seq 67 join together MLVQILTATFWTLIP NSFGDLLIYAILVLTIVIYVKSLKRDKEWLALPGPIPW PLVGNA PFLGAEPHKKLLELSL KYGPVYRLKMGGIKTVVLCNAEVVRSALIKQREAFSGRPKFSSYKAVS AGESVVFNDEET LPP WRSH KSKIVRHMHKYTTSIRTRDKVTDLINTECMMMVTELDRISRSKCVNPENVIRM ALANVMCAVCFGNRFEYDNE (0) EFQKLLSMNTEFGAVIELGPIIDAMPWIK (0) VIPKFKKAIADYLKINLQLDTWSRHR (2) VDGVLKTFDNDDVTNVVASMTSEVLEKKSAGESREITESETKTIAALSADILGA GQHTTSTTFFWVINLLLCFPKVLNKLTEEVRSKLGNRLPTLEDRTSLPYMDAVLTE VLRFSSPLSSTIPHSTLKDVKLAGHTIKRGTMVIISQYAVNHDPQNWKNPENFDPERFLTK NEGGEIIFNESLSEKVLAFSIGERKCPGSQLSRMLLFLATTLLVQVSDLSADLERPPT AAAEYGLILRPKHLSIKLTLREHWQRRDSIRA* >SEQUENCE 25, 35, 93, 89 end is long but = EST rcies001a09 MDSLVFVLVDTVLVMKYQILLLLVIVYAIKLLAASQSRRLNIPGPYPWPVIGNVIEMGGQPQFSLTNMAK (?) RYGPVYLMKLGTADVLVLNNYEVIKEALLRQRRIFGGRPIFDSFKKISQGLGVVFNSTMT QGDEWMKLKMTIVKHVHRFVSSEETKGYVAHHVQMEAVELVRILTEKCRS SPNEVIFPIEQINLAIANVVCAIMFGHRYQHGNK (0) EFQDLISLNEQFGDVIGSGSQVDVIPWMK (0) IFPKFRNALKVFDFLTNRLNNWMRLR (2) TKEHRLTYKHGVIRDIVDSFIAESIDHPEQSALNDDVIMALTTDVFGA GQDTMSTTMQWVFVYMMHFKECQRK IHAELDSVIGPGELPHISDRRRLPYLEAVMHEIFRHSTFTSTTIPHVTTQDTVLDGHFIP KGILVFINQFGANHDPNHWVDPDKFIPERFLDGKGNLISRPHDRYLLFSTGARKCPG DELSRMLILHFMATMFALCEVSSDPQKPATL DAVYNLSMRPKELRTIVRS RNLPFLKNSVAQMSEADSHVLTVPGETTSFLTSRVESTVPDNQESQFSDNDFEKVDTKIP KRKVFSRPTLTHDDINGNNVRKRGNLHQSAMYRIQLAT* >sequence 15, 63 N-term 43% to 2C9 C-term 46% to 1A1 WXXP to heme MESVWVVIKWVKETMMSNSSFETIVAVATLLLLLMFVSENWNWLKIPGPI PWPIIGNLGSLKGTKFLSIHEMYKIYGRIFRLKFGRVEAVVLCDVELIKE ALLDRGRSLSGRPQFASYRLVSGCKSVVTNDPRCLREWVNY KSTMVQTLCSISKNNEMKELMNERIGSVLVYMIQELEKGGDGQNFAEDIVTKTVANFLCT VCYGGTYDFNSK (0) EFNNLIEMSRHYTDNLSKSILRDMIPLAE (0) ILPSVNKGRADFAKTSYHLHLWFLKR (2) VEEVIQHFQPNKLNDLASVMVSDLTNDPTENISNITEKDRNSIAAIINDLVQG (?) GYHSLYSMALWVVTYMIKYPEEVKKIENELNEVLDDYLPTLHDQESLPHTMAFINE (0) VLRCRPSLPLAVPHSATEDTKLGGYDISKDTMVVASLYSANRDPKVWANPDQFDPSR FLAKDDLGVTVLDETKVEQVFTFSLGDRKCPGEDIGRSFLFLTTAYLAHTCKLKPDPAK PPTFQTKPGSITRPKDFGVQLNVKKCWLGVFKPDDNEE* >sequence 51, 219, 223 1 accession 40% to 2J2 MYDVIIVCVISAIAVVAASWLKQNGSKKFPPGSRGLPFLGSVPFVLGSLERTLA VWGWQNYGPVFFFTRGNTRTVVLNTYEAASEGFVKQGNH LSGRFIDANHNELFADYSPQWQEQRKLRVKTLKGFGKVNTEEIMTSECKKMICRIRETKGSF EAGYYLAVPTFNIVKSIIYGE RFENKMNLIIYTFIFVAGEPSEGMNKQDEVNLSKVLADPVYKAGNGFMEFMAGMFPQIRS SVAYERMLEKKGKERVRQVKE KMTGSKKLIEGMLQNLQDWETGEVYNNQQLSCYLTDLFMAGTDTMATTVHWSLVFLAQNP QVQEKMAAEVKSVAGNDVITTSMMDSMPYTRAVMHESSRLRPVFPLSLRHQATSDVTVKD YVIPTGTSVVANLWAIQNDPKWWKQPSEFRPERHVTEDGGFTKSEKIVPFSVGPRYCLGS QIAIYQQFIFLTNLVRGFRFRFDQAKVEPDLTGLMTSVLAPVNVHLVAEER* >sequence 57, 104 MFSTLLGSLALNYISTCLLGIFTLGVCMYYYWWKFPHPRYPPGVRGIPVLGALPF LSKFAHKDITRWSREKYGPIMSVRAGQTDSIFLNDYESIYE (?) AMIKQGPAFRSRPAMKLIT AYSNGYGLGFSESHKKHSDLRQFSLKTLRGFGIGGRQLEDRISDIAQELVQAFEMLDGKATDVK MIVGKTVGNVIASIVLGKKFHGDDKDFQRHINLIFDS (2?) FGDEETAKYVLAMMYFPSLRHIPPFKNAWDQTLKGHFEQL (1) DFCRKEIEELKKTLDVNDPRGYVDAFLIEMKKHSPEDSWFH (0) EESLIVCVVDLFFAGTDTSTSTVMWAMVALLNFPEIQEKLHKEIANAT (1?) QIYSGEALPSLDHRDELPLLNAFIQELYRHMTLGPFGVPHETTTDAYIGGYCIPKNTR (0) VITNIWAVHYDPNTWENPSEFNIYRHIDEDGKFVPSKKVIPFGIGCRSCLGEKLARLE VFHFLANIIKRFEILPDPESKELPDYRDGVNGFVYVPYRFKLVAKPRIDNK* >seq 64 cibd042f04_3 75% identical to seq 57 but not same seq GCiWno474_c04.b1 MLPAFLGSLTSNYVTFCLTGVCTLVLFVYYYWWKLPHPRYPPGVRGIPVLGALPFLGKFAHKDI MRWSREKYGPIMSVRFGQKDSVVLNDYESIYEAMVKQGPAFRSRPTMKLITSYSNGYGFG FAEGHTKYMELRHFTFKILRGFGIGGRYLEDRISDHAQELVQDFKKLDGKATDVR MIVATTIGNVIASIVLGKKYHPKDKDFQRHLQLILNN FGNEETASYILCMMYFPSLRHIPPFKNAWKTFMDEHFEQL GFCRKEIEEHKNNLNVNDPKDFVDAFLVEMKKHLPEGSWFH (0?) EESLILCVIDLFFGGTETSTSTVMWAMVVLINFPQIQEKLHKEIVTAT EALPNLDHRNELPLFQAFIQELYRCMTLAPLGVQHQTTKDADIGGYCIPKDTV VLTNIHAVHHDPKIWKNPSEFNIYRHIDQDGKFIPSRKMIPFGIGCRSCLG 687 EKLARSEVFLLLANIIKRFEILPDPESKELPDYKDGINGFLYVPYDYKLVDKPRTVNI* >sequence 58, 70 45% to 2R1 ci0100144015 complete MLTVSYVYVLALFLLCCLFRWRQRPKKFPPGPLGIPILGITPFVGSRMEKYLASYYAKYGGVMSFRLATKDWIILN DIEAITQAYSKQGESFSGRPQSYLFDKLTEGCGIAFANGPRWQEQRRFVLTALKTLGMGKKGMEEIIKTE NQHFISVIQNSGGKVHIL (0) KEIRRLEANIISKVLIGKRFDYTDERLNSIIDCRFR CTFTAFSSIYSTVSRIIQETGLQYSARNKQVMFIY (1) IIRKLTNEHETPDEANPKDLTEIFLSEIEAKKRRILLE (1) FLSNIFCTQKIQLYHLIRDIFVGGIDTTTATLGWGIICLLHYPECQMRIQEEIEDVI (1?) GNSEPDMTHQKHMPYLRAFVQEVHRYCTIAPFSIPHCVTEDCVLFGYHIPKSTPVMSNIWRVHNDPKY WKNPEKFSPERHLDLEGRFVPSNRVLPFAVGQRRCVGEQMAKVGLFLFWASFLQK CEIHIDSDFGLPSWKKEPSGTVRSPQEFTVKVRNRLQNN* >sequence 62, 83 8 accessions 42% to 2U1 ci0100151040 complete MLPIICLSFFLIIFLCWKPLRKRSNYPPGCDGMPFLGCLPLLGVHPANTIMKWSKRYGDVFYVKYAMQNV IMLNSFESIKEALVKQGHVFSG RQHTFITEFTSGNAGLGFLDYGAVWRSQRRFGLATLHEFGMGRRCMAP KILEEADLMCDEMLKHGDHAFDCMDHITPAVTNIISILVFGKRFEYDDPTLLDTLHLLFDFGHDFPFALF YFAPLVRFLPPWKAELEKIRQARKPVWDHIMSAIQQHQDTYNENDCRDFIDSFIAKMKSTNADGVKF EQLFHYVKELFLAGTDTTATTLRWAILLIHYHTDVHEKIHEEIDREASNPVKFEDMPNMPYTQAVIQEVFRSR PIANFGTMRKTTAVGKVGGYRIPKGSIVMPNIWAVHHNPQRWKNPHLFKPDRHIDQNGKFVKSEDIIPFN VGQRSCLGMQLAKMEIFIFLVRMMQRFNFKLATDKEKLDLNGQYIFLHYPPAFDVSVQLRS* >sequence 66, 75 1 accession 41% to 1A1 33% to 2C9 63% to seq 57 MFESFGYYFGESLTSNHVISGLIGVFTIVMCVYYYWWKFPHPRYPPGVRGVPIFGAIP FFGRYVQETLAKWSRSKYGPVMSARFGQDDAVVLNDFESIHE (0?) ALVKNIQHFNSRPSFYIVEQFTHGYGFGFADGHNKYLDVRNFSLSALRG (?) LGIGRRTMETRVSEVAQDLVKILEDLDQKPTDLK (0?) MMLGGTVANVLCSVVFGKTYDLSDPEYQYAVQCSFDC (2) FGDPENSEYLNFMFFYPKLRYIQPFKRALKKFIDVHLGLIAFNQKEINLHKERLDVN EPGDYIDAFLIEMKKHSPENSWFHETQLLHCLGDMFIAGTETTTNTILWALLALIHHPEI QEKLYQELLDNIGEQVLPSTDHRDKIPLFRAFTQEVYRFKTIVPLALQHRANKDVEIGGY VIPKGTKVFPNLHAVHHDPNIWKNPSEFNIYRHISKDGKFIPSKRVVAFGMGARSCLGEKLAITE VFLFLANIIKRFEILPDPESKELPILKDGVNSLLYVPYRFKVVVKPRIINE* >SEQUENCE 72, 86 ci0100136116 100% to seq 18 C-term 3 diffs to seq 72 100% to seq 105 MMEVLIGLNVSTLATFLVVFYAVYYWYKRPGSYPPGPRGYPLVGVLPRLGKYPERVMRKWSRKYGPVMSVRMGR NDWVSLNDYSSIKE (0) ALVKQGNVTSARPDFPAHQEVAHYHGIIFSSGDLWQNQRRFGITTLRE (2) CGFGKRSLEERIVEEVSYLNDAIRATMGKPFEIM (0) AKLGNAISNNICSVAMGRRYDYDNKRFKDLVAKVSWR (2) SMGVVRIRFYNFFKCFFCLLPNGTKTSIERC (1) LFREIIDEHRATFDKENIRDFIDAFLERMAGNDPHFTVTIYA (0) EFQLLVFLRDLFIAGTETTTSTLRWALLVLLHNPGVQDKLRNEIHKVLGF (0) STTVSMAHRESMPYTRAFIEEIYRYRTLTPLDLFHETSGELNISGYKIPKNTV (0) IIPNLWAVHNDPDVWDEPSKFKPERHLDEKGNFVQSKHVIP FSVGPRHCLGEQLARMEIFIFLVSMVQKFEFLPDPNEPNLPEINHGVNATAFVAYPFNIVANQV >SEQUENCE 157, 79, 215 60% TO 2C8 ci0100148852 MASNIALELCKGLYT WTSVAVIAVSLFYIWWRRPHNFPPGPRGLPLIGVLPFLGRFPGEVFKKWSLEYGP VMSIRMGRSDVVVV NTGDSFREAFVEQGANTSGRPPDIMFAQVTGDKGLVFKDYSANFKATKKWILRTLS QYGLGRRGMEEKIQEEADCLAGYLRTKVGKPVDFKVLYCLTSNVVSLIMSGRRYDYEDEEHQKRLEFGDP NDAPKLLLIDLFPALKYLPVFKTANDKLRSRMKAVLGFTRSNIEEHKKTFEKDDIRDFIDTFLAEMKFGN ETSRDNFTVH ELNVLFRDFFLAGSETTSTNILWILLLLVHYPEHTAKVIKEIDTIIGPNASASSTTHSSQMPF TCAFIQESMRYRTSVPLGAQHYCQKTFQLNGYTIPKGTLIYPNIWGVHNDPKSWPNPSKFDPYRHIDEKG NYRNSSNVIPFLIGPRNCVGEALARQEIFIFLVKILQKFEVKPVPGKTPSIADGVNSVVFSPHSFNLILQ ER* >ci0100144919 96% to seq 84 almost same as seq 222 N-term complete MIFEIVITILGAVLLHWLFIALFNPWRYPPGPAGLPLIGSLHLAAPEFHIKLPKLAKKYGPIFTIKAGCRRVVFLV GYDVIKEVITDRAKDFASRCPTIVARIVRGEGLDG IAASPYGPKWMANRKFFYSAMRTMGLGKRGIEKCVVDE IPYIVGELENICSNDGLFEPSSVFDSAVLNVLAYFTFGNRYSYQDEKFKELIHINNDFFQKSKFLNQPQF FLLTLIPGLDKYWLPKCGKDIKESFG KINKFVKAEIKQHRQNLDPENPRDYIDCYLNELNQMNDQSELSE LGLEMSIMDLFQAGTETTSTTLRWAILYLANNPHIQEKVQQEIDEVLGFDQLPQYEDRMRMPYCEATVLE VQRMATIAPI GLQHCSEDDQVLGGYRVPKRTSIVACYHSIHFDPKFWKNPNKFDPCNFLDSNGKVCIPDA FMPFGGGLRICVGMNIAKQELFLFFVAILQKFSLHLPDDMDHLDEKPGPGIILAPKIYRVQIKRR >SEQUENCE 17, 111 first intron is over 3000 bp MIFEIVITILAAVLLHWLVNFLFNPWRYPPGPAGIPVIGSL HLAAPDFHRSILIGFQISFAISVAFLQKQINRLR (1) NIVFSELPKLARKYGPIFTITAGYRRVVFLVGYDLIKE (?) VITDRAKDFASRCPNMPGRIVRGEGLD GIAAAPYGPKWMANRKFFYSAMRTMGLGKRGIEKCVVDEIPYIVEELEKLCTKNELFE PSSVFDSAVLNVLAYFTFGNR (?) YSYQDEKFKELIHINNDFFQKAKFLNQPLFFLLSLVPGLHKYWLPQCGKDLKESVG (?) KINKFVKAEIEQHRQNFDRKNPRDYIDCYLKELDQMNDQSELS (?) MSIMDLFQAGTETTSTTLRWAILYMVNNPNIQSK VQQEIDDVLGFDRLPQYEDRMRMPYCEATVLEVQRMATIAPIGLHCSEEDQ (?) MLGGYRIPKRTSIMACYHSIHFDPKFWKNPNEFDPCNFLDSNKKVCIPDAFMPFGGG (?) LRICVGMNIAKQELFLFFVAILQKFSLHLPDGVDHLDEESGSGITLAPKIYQVKIKRRA* >ci0100144128 90% to seq 222 scaffold 38 complete MLYGIVLTIIFVLCLPICHWIYVLLSKPRNYPPGPLGIPLFGSFHYAAPAIHI (1) NLPKLAKKYGPIFQLRAFNQAIVCVVGYDLIKE VMINKSKDFASRYASVPGKLVRGEGMD GIAVAPYGPKWMANRKFFYSAMRTMGLG KRGIEKCVIDEVPYIVEELEKLCSSDELFEPSSVFDPAVLNVLAYFTFGNRYSYQDEKFKELIHINNEFF QKAKFLNQPEFFLVTLIPGLHKYWLPQCGKDLKESVGKIHKFVKAEIKHHRQHLDPQNPRDYIDCYLNEL NQTSDQSELSELGLEMSIMDLFQAGTETTATTLRWAVLYMVNNPHIQENVQSEIDNVLGFDQLPKYEDRT RMPYCEATVLEVQRMASITPFGLLHCSEEDQKLGGYHIPKHTCIMPSYHTIHFDPKFWKNPNEFDPCNFI DPNGKVGTRDAFMPFGGGLRICVGMNIAKQELFLFFVAILQKFTLQLPDDVKSLDDEPIPGMTLAPKPFR VKIKIRS* >SEQUENCE 91 WXXP TO END 39% TO 2R1 56% to seq 36 487 WPQPDKFDPHRHLDSAKKFISSSEIVPFSVGARY CMGRETAQTEIFVFVVALLQKFKITRDPTLKWPIHMEDGIQSALCCPSRYKIILN* 216 >SEQUENCE 94, 189, 97, 96, 137, 191 41% to 26B1 fugu ci0100143924 revised MSFSVVSAYQKVAPMVVTYDLLRMPATTGRNTTTLWQHEFLSTM (0) ENATEGWETELEVETVAEATFQQGANGEHEYSGSDQFFEQLNGTLQSLFDSSAVAASPLMA FILLGLAAILAVCILVHPLKAWWLNYNMKSAVDPDSKLNHLPIPPGSFGLPFIGETLAWITQ (0) GPKFNSNRRKKYGNVFTTNAISLPIVKVSGHEYVKE (0) ILTGEHDKVTTIWPYTVRTILGSHGIVNSIGDIHKYKRKVAFKAFTRAALNDYVPIMRNHATRI (0) VRQMQESDQPLVYPNMLRLTFDVAVNALLGLEISDQVELDMLFKTFHQLVSNVFCLPYNVPGFGFNK (0) GMKARNQLLDLLQIHIEAKKQA (0) KLREESNLDNNNTKSDADNEFELGDLEIKEIAIELMFAGYYTSASALTSAILELARHPDVFSKL ENELLQHGILREDSSDEEHMPELNLQNIHKLTYLDQVLKETLRIRPPVLGAYRRAKKTFQIG (0) DYRIPKGWTVIYNIRDTHELEFEHMTEFDPEHFAPDANDKKFR (2) FIPFGGGPRVCIGQEYARIIMKVSLIEMIRCNSSWKLANKTLPKMVA IPTLHPKDGLPVHLQPRKTVGVVTSRSEPVATDSS* >SEQUENCE 102, 53, 150 SIMILAR TO SEQ 8 N-TERM 39% TO 2B6 MIDIFQTICASTGPYPFTVLVICVVWVFRWWRRPHDLPPGPRGFPLLGVIPFMGEFPER VIKKWSLENYGPIMCARMGMQDSVFLNTADAIKM (?) AFCDNSEYLSGRPTTTIFKQVTGDKGIGLKQYGENYKVIRKFIVRTLSK (?) YGVGKTMIENQIVEEAGLLAEFLKTKTNQLVDL (?) FYLYVLTSNIVSRVVCGRRYDFEDEKYHQVLINLQKM (?) LGDAKDAPFTLLIDFFPGLKHLPWFRGANNRMIERIKNWL (?) EHSKDHVEEHKKTFDKNNLRDLIDSFLHEQAYGKTENKN (?) EEELIVIVRDLFSAGSETSSNSILWILIVLLHHPHHAKKCIQEIDDVMG PNGAPSSQHREKMPFTCAVIQETFRYRTVAPIGLQHYAQATVELGGLRIPQGQG VYSNIWGVHNDPVAWPNPS EFDPYRHINKDGKFVLSNNIMTFSIGPRSCPGEALAKLEIFMFVTKILQKF NVRASPNNPPSLDGVNCLQFTPLSCEVILTER* >SEQUENCE 103 93% TO SEQ 102 (3 DIFFS) SIMILAR TO SEQ 8 N-TERM 37% TO 2R1 MFENMVYTLQYIFASIGPYPFTVLAICVAMLVRWWRRPHGLPLGPRGIPLLGVIPFMGE FPERVIKKWSLENYGPIMCARMGMEDGVFLNTAEAIKM AFCDKSEHLSGRPRTTIFKQVTKDKGLALKQYGKDYNIIRKFVVRTLSQ WGVGKTMIEDQIVEEAGLLADFLKNKTNQPIELK LYFYVMTSNIIGRVVCGRRHDFEDEEYHEILN NIQKVLGEADDAXLTLLIDFFPGLKHLPWFRGANNRMIERVKKTLEYCKGVIEEH KKTFDKNNLRDLIDALLYEQQFGKKSKRKVLL EDELVVIVRDLFTAGS GETSSLMISWVLIMLLHHPEHAKKVIEEIDDVMGPNGAPSTKHRDRMPFTCAVIQETFRC KIVAPIGIQHYAQATVELGGYIIPQGTMVYSNIWGVHNDPVAWPNPSKFDPYRHISKDGKFF TSNNIISFSTGPRSCPGEPLAKMEIFLFVTKILQKFEVKAPPDKLPSLVGVNCLLFSPES CEVILTER >best savignyi hit to CYP24 53% to seq 110 possible processed pseudogene with MFQQLWEPLRLILYNVAVLAILAVLFKFFKRISVICKIRWNLRTEWKGPPCHWFWGHLGQVT KGNKDPANFIPYFTKNEKLFPFGFAIYAGPLRTILTIHHPNLVKQVLNASTFDAPKPKEAYGGLLLKWL GLGLLSLNNREWFRHRKLLTSAFHHDILKPYVKISNSCTDILLDTWQKKVKQQPTGFSIEIFEQA SLLTLDIALQCLMSYESKCQERLDNDYISSIKNLSFLYVRRNRSKNLLLKYFPFLYKWS QEGKLYYKECDSVHKFSTSLINRRKQELGNPDIAPKREFLDFLDMLLQARDSNGLGLTTS EICAEVDTFLFGSHDTTASGISWFLYCISQNPEVQNKCF*EITQVLGDRENVRWSDLSSL PYLTRCIKETLRLYPPAPYISRTVNKDIYAEGKTILKGTTVLLSIIGVQRSVLFWKYPQK FNPDRFTPESASMQNNHAFIPFSAGPRNCIGQHFAMNEMRVVLSKMLRTFRFSIDPNHKI VPCHEIIYRSRGGIHLIVEPRTSVSI* >SEQUENCE 110, 121, 122 139 67% TO 4F3 MLFMFIKYAVYASFSLLALSILHSLFS (2) IQEFYTVTKYLKSNWPEPESHWLWGSFGAV (0) MLKMSADPSYYFTYMNDCMKKYPYGFRTQLGPFIKSLNTYHPTIAKAIL (1) NVPKLKRAYKLLFEWL (1) GHGLLVLEDKVWLRHRRLLTPSFHFEVLKPYVKTMNESAHVMV (0) ENWLSKTSDNKVAKVEIFHYA SLMTLDTTLRCLMSYQSNCQDEN (2) STNDYVAKIYELSETIVRRQRNLNILNRIDPIYNVT (1) KEGRKYLKLCDDVHKFSESVINRRKCDSEQPAHNETRKYFDFLDTLLKAR (0) DSDGKGLSDSEIRAEVDTFMFEGHDTTASGISWTFYCLAMHPEHQEKCFQEIEKVMADRTDIEW (2) NDLS NLPHLTLCIKESLRQYPPVPIIFRKLNKDIEVDGKTIVK (1) DT NVVLHIYALHHHEEFWKDPHIFDPSRFTQENMKSMNSYAYVPFSAGPR (2) NCIGQRFAMNEIKIAVAQVLSKFQLKPDLSKKIQHSVDVIYRATTGLYLKAERRV * >savignyi ortholog to seq 110 63% identical to seq 110 boundaries need checks MGIKPEYSIPSYIFLYFSIYEQYRARKYLKRNWPGPKGHWFWGYAGEL (0) MRKIRKDPGYYFSYIRGITKTYPYGFSSNLTPLFASLNIYHPNAVKAVL DAPKLKRAYRSLFEWL XHGLLVLEDKEWFRHRRLLTPSFHFEILKPYIKTMNECSITMV (0) ESWVKKTSEKAQIADDMDITPSISRMTLDTILKCLMSYESNCQDEX STNSYVSKIYEISASVAHRMRDVNIFKRINAIFYAT TSFRAHGKAFLKLCEAVHEFSDSVINCRKAELKQPLVKQNRKYLDFLDTLLKAR DADGKGLSDREIRAEVDTFMFEGHDTTASGISWILYCLAENADHQEMCFREIQEQMGDRTDIEW DDLAKLPCLTMCIKESMRLFPPVPIIFRKLNKDIEVDGKLIAK GTDIVLHIYALHHHEMFWKDPEKFDPNRFLPKNSSNVEGYTYVPFSAGPR NCIGQNFAMNEMKICLAQVIRNLRLYIDGDSPVPKMHPMLVLQSKSGIFIKFEKI >SEQUENCE 118, 140 N-TERM TO EXXR 381 AA 35% TO 3A5 no introns MYIPSFLSVETLVLLITSLVLVRIYIHKKWQFLKNINIPHIQPTILQLGTVRPFIGNSD WMMNSHEYCKRKLGKIYGAYLFLTPQITVADPDIIKQMYIKEFATFPTRQTSLTKINGK VMNTTILMTEGEQWKRIRNTLTPSFSTAKLKEMLGIMEDCCNHAINILIKTTQNDQ GKFEAKSVFGKLALDITCSAAFSVDTHPQELKDGQEPKIIKMVKQAFGFEISR SPLVMLCFMFPAFEEILCKLDYSIIPKDTVNYFANLTKSLIDQRKHSTVKPRVDLMQLMLNDQI SVEDVAKGKSKGLTQLEITSNSVLMVLAGYDTSANALTLLAYNLATHKHVHKKVHEEIDQ MLEKYGSLTYEALTSMKYLGMCLNETLRLYPPIAVNSRIPNRDITINEVFLPKGIQVIVP VYGMSRDEEIWEEPLKFKPERMEDMRSVDPMIFQPFGGGPRGCIGMRFAVLEIKLAIAKIL QDFDLDVCENTPKPPVQLEFKSTTLKPKGDIYLRVTPKSN* >SEQUENCE 112 15 exons 54% TO 3A5 note 112 identical to seq 119 up to VDSQ MITPLDILSLETWILILTSFILIR (2) LYIHKKWQVLKNLNIPHDQPTILGIGNLMDFIKNPEF (0) LFMGYFDLKKKYGPIYG (2) AYNWLTPTITITDPEILKKIFIKEFSTFPDRQ (0) IALLDVNGKEMNSALTVVSGRKWKRIR (2) NTLSPTFSSSKLKEMFGIVEDCADSFVQNIGTINSDVDGRFDVSL (2) AFGRLTLDGICSSAFGVKVDSQ (0) AGNNGEESQVAKMARELLAFEFFRNPMLYVF (1) LIFPWTEKIAEKFDYTIFPRKYVRYFARLIDSVVESKKEKKQ (0) RTDILQTMIDSQITEEDVKNGAVK (1) GVTKTEMKGNSIIMLLAGYETTSNAMVFLAYNLAVYKDAQHKCREEIEQVIAEH (0) GGLTYEAVQDLKYMTQCLNESMRLYSLVPA (2) NSRYCERDITINGVTIPKGTLVNIPVFGMGRDEEFWNEPLTFNPDR (2) MLDMNEIDPMIFQPFGAGPRNCIGMRFALLEIKITFAKLLQKFHLDVCEDTP (0) APPLKVGFKTSMKPKETLYLKVKALET* >revised seq 232 probable pseudogene of seq 112 ci0100132937 scaf 41 6288 MITPLDILSLETWILILTSFSLIR 6265 SYIHKKWQVLKDLN 5697 IPHIQPTILGLGNLGKFLKD 5678 GELTYETVQRLKHLTQCLNESLRL YNNVPV NLRYCERDITINGVTIPKGTSIDIPVYGMGRDEEFWEDPLVFKLES MLDMNEIDPMIFQPFGAGPRNCIGMRFALLEIKITFAKLLQKFYLDVCEDTP ASPLDVTFKMSMKPKDQIFLKVTKINE >revised seq 119, 131 = ci0100140050 scaf 75 exons 3-14 cigd043j06_1 rcigd043j06_6 Scaf 41 N-term exons 1 and 2 runs off end numbering from scaf 41 1002 MITPLDILSLETWILILTSFILVRL 928 115 YIHKKWQVLKNINIPHDPPTILGVGNMMGIIKDPNQ 7 MFLSYFKMKKKYGLVYG AYSWLTPSITIADPEILKQIFIKEFSTFPDRQ KALLDVNGKEMNTALTSVTGSQWKRIR NTLSPTFSSSKLKEMFGIVEDCADSFVQNIGTINSDGDGRFDVSL 3486 AFGRLTLDGICSSAFGVKVDSQ AANNGKESQVAKMAREFLAIEIFKNPMVFVF VVFPWLGKVAKLFDYSVFPKKYIIYFSKLIDSVVETKKKKKQ RTDILQTMIDSQITEEDVKNGAVK GVTRTEMKGNALITLIAAYETTSNAMVFLAYNLAVYKDAQHKCREEIKQVIAEH GGLTYEAVQDLKYMTQCLNESMRLYSLVPA NGRYCVRDITINGVTIPKGTLVNIPVFGMGRDEEFWNDPLTFNPDR MLDMNEIDPMIFQPFGAGPRNCIGMRFALLEIKITFAKLLQKFYLDVCEDTP APPLEVTFKSGMRPKEIIHLKVTAVE* >SEQUENCE 125, 113, 114, 213, 138 MITPLDILSLETWILILTSFILIRLYIHKRWQVLKDLNIPHDPPTILGVGNMSKFFKDS DAGYKRHLELKQKFGPIYGHYIGLSPHVYVGDPEILKQIMIKEFHIFPDRQKNFQNANGK EFNDNLTAISGKQWKRVRDTLSPTFTSSKLKEMFGIVEDCADSFVQNIGTINSDSNGRFQPA VVFSKVAIDSICSAAFGVNVDSQ SKPQGQEPEVVKMALELFNYPLFKNPFFLIF (1) LMFPWTENIAKVFDYSLLPKDCLRYFARLAKTVKKNKANVEQ RVDILQTMINSEITDNEVKNGAIK LTEIEVTANSLLMMFGGFETTASAMVFLAYNLAVYKDAQHKCREEIEQ VIAKHGGLTYEAMNDLKYIMQCLNESLRLYPPAPMNSRYCERDITIHGLTIKKGITVQIP VYGMGRDEEFWEDPLVFKPERMLDMNEIDPMIFQPFGAGPRNCIGMRFALLEIKITFAKL LQKFHLDVCEDTPAAPIDVSFKSGMRTKQDLFLKVTARE* >SEQUENCE 136, 203 I-HELIX 58% TO 4X1 missing region before I helix MKAIARFPFPALVRFVSTCPAHQRAAWSKKASQEDLSYAKPFEAVPEPNSY (?) PVIGTALEYTPLNNFNPSYVGEHWIERHKELGPIYKENILPG (?) VTETMVFTSSPQNTEIMFRNEERCPFRDPLGPVAKIRDERNEYHGLTNSNGEDWWR (?) VRQIVNQHFLQNTAVWKYAEAHRNVSKDFLKFIDRNMDEKNE (?) VPNFGKALNRWSFEGAGVFTLKRRL (1) GALDIKPCTEIEEMINCNERIFQSMGELLYALPFWKYVNTKDMKVYKTVFIYTCNKFCESYLVLID (2) YIHSFVNIICSEKVILMTEFLVAGIDTTSNSAAFLLYALASNQEAQERLREEVREVNRMDTIDGK (?) LLHNMKYLQACLRETQRMFPFVNSSPRRFRKDLVLSGYLIPAG (?) HPHSDTSNTVNSRNPEYYDDPNTFIPERWLETKS (?) FKPYIRFIMSGPFGLGMRMCPGRKFSIQELQLLLITLLSNY RVEYHHDPIKVAFRLVSFPSQAPQFKFTKLKN* >ci0100142705 LSYAKPFEAVPEPNSY PVIGTALEYTPLNNFNPSYVGEHWIERHKELGPIYKENILP VTETMVFTSSPQNTEIMFRNEERCPFRDPLGPVAKIRDERNEYHGLTNSNGEDWWR VRQIVNQHFLQNTAVWKYAEAHRNVSKDFLKFIDRNMDEKNE VPNFGKALNRWSFEGAGVFTLKRRLGAL ICSEKVILMTEFLVAGIDTTSNSAAFLLYALASNQEAQERLREEVREVNRMDTIDGK LLHNMKYLQACLRETQRMFPFVNSSPRRFRKDLVLSGYLIPAG THILNTSNTVNSRNPEYYDDPNTFIPERWLETKS VTERQRERFIMSGPFGLGMRMCPGRKFSIQELQLLLITLLSNYRVEYHHDPIKVAFRLVSFPSQAPQFKF >ci0100143467 complete seq 145, 120, 123, 197 MPTLIADLVLLFVLYLLGKAVVRNWKTIAAGRRMSKIIGSPPAQWIVGHAYS (0) LAPTPEGLLSNVKTAASYPGMMVLWLAHKWIVQVYHPDTVGKVLSTQ (1?) ATKDDMTYEYLKAWVGYGILTSNGERFQRQKKLLVTAFHQDLIKQ (2) YSPVFTKATDRLIEKWTEKQNQSVELFEDISLLTLDCILICAMSTDLDCQRIG (2) KQHPYIDAVHQLAYLAIKRFYNPWVRHDFIYNLTSEGRLWKKLLKVVHE MSESVVKARRKVVSISDKPKLSKTKYVDFLDVLLRAK DESGEGLTDQEIREQVDTFLFAGHDTTAS (1) AICWFLYNLACNPEYQDKCRKEVKEVMEGRKDVQM (2) FDIGDMTYLNMCLKESMRLHSPIPFIGRKVKNPINLRRPN (?) DNTEVTLCPGTTIIPHLLSLHRNPDVWDEPE (0) VYDPERFNTKNASRLHPYAYIPFSAGPR (2) ICIGQNFALVEMKLTVAKLLLNYEFYVDEDIPKPQPIPNVVLRSQNGLYVKFRKI* >CIONA SAVIGNYI BEST MATCHES TO ABOVE SEQUENCE TYQYLTGFVGYGLLTSNGVRWQRQKALLSTAFHQDLLK YSPVFTSATDTLIEKWGKKQNRSVELFEDISLLTLDCILICAMSTDLDCQRIG EDEKGLTDIEIKDEVDTFLFEGHDTTSSG IAWALYHLAKYSEFQAKCREELTEVLGDRKYVEW IREIMEDRKNVQL FDVGEMSYLTMCIKESMRLYSPIPFIGRKIKNKLNVHRPNGASDVTLDADVTVIPHLFTLHRNPEIWDDPEV >SEQUENCE 147, 149, 153, 163 MTLFEAASEVVFLNRGFVLSSLISEVVLLIAIVAVTKVYLV PAIRYYKRGLLYTKQGCSPPRHWLWGHLTVFPLSHDGMLKRLRTTAQYAGMFVSWAGPFT YWIETQHPTTCAVLANSSAPKAEFLYGCLRPWIGDGLLTSKGSKWKRNRRLLTPAFHFEI LKPYLNIFNKSSYAMLEKWKRHSGQ SYDVYGDVSNLTLDTMMQCAMSTDTDSGGEDGGYGYINAVHELTLLVMERVYNP LHMIDWIYHFSSNGRRFRKLVDFVHETSETIIKERKKELETLVQEDNDNASTATNNELSS FKKRGTKHLDFLDILLQTKDENGNGLTLKEIRDEVDTFLFEGHDTTASGIAWCLYNLAKH PEYQQRCREEVLEEVGEKEKIEWED 19 LSKLTYLAMCIKESLRLNPPVFCVGRELNEDLTFNDKYTPCKKMKVESGNTVLMQIMSMH 198 199 RNPHVWENPE 231 (0) 297 VYDPERFSAENMKNRHSMSFLAFSAGP 383 156 RNCIGRNFAMNELRVVIGQTLRQYKLWCDDDTPEPKMQPNIILRSANGIHIKFK >SEQUENCE 148 missing exon between HTK and GIA MAWTSVGGIVTTTAVGDFLLVACIVVVIKSFIIPAVRGYQDAKKRARSVGEVKKHWLYGSVKM (0) FPQNEEGLMKRVASSFEKPVWTVNWFGPYISEVVIHHADLAITLLSSS (1) APKNNLVYGFFHPWIGDGLLTSSGRKWQRNRRLLTPAFHFSILKP (2) YTNVSNACVRVMLDKWSKKVGTSMEIYLDVNLMTLDTILQCAMSTKSDCQNRS (2) KKNEYIEAVHDVSKYIMSRVHKPLLHIDWIYWLTAEGRKFKQLVKVLHDQSEK (0) VIRERRKTLENRKFEEESSGKKRLDFLDILLHTK (0) GIAWALYNLAVNVDCQDKCREELKSVVGDKENIEW (2) EDLSKLSYLTLCIKESLRLCPPVPFIGRELNEPLKF RSKLKEPNETTIDAHSNIALHIFTLHRNVHVWDSPE (?) EFIPERFKPENMKGRSPHAYLPFSAGPR (2?) NCIGQNFAMNEMKIAIGQTLRRFKVIPDESSPKPSITPQVVLRPKDGIFIKLMKI* >sequence 151, 152, 135 ci0100151682 scaf 357 ci0100147473 scaf 996 rcilv086f16 MSFALANGVLTSTVVGDIALIFATLFVILNYILPIAKLRQSSSNLEKQVGLGSFKKHWFFGHLRLFPPLE HGLLKRAQRTIQTPAWVTTWFGPLVSEATTHHSETAITLLASSAPKNEYVYKFLRPWLGDGLLVSNGKKW MRNRHLLTPAFHFSILKPYAEISNNCIKTMIEKWTTKTNVAFDIYKDVNSMTLDTMLQCAMSFK IEGLKER DENGIGLSDVEIENEVDTFLNEGHDTTA SGISWSLYMLAQHPEYQQKCRDELEEVLKDKEDIDWSNLTKLQYLTMCIKETLRI RPPVPAISRQLDQPLTFKSKTHNVPPTTLAKGSFAGVQIFSLHRNVHVWEEPE VFNPDRFTQDNIKKRSPHAFLPFSAGSRNCIGQNFAMNEMKIAVGQIIRKY ELYIDDDTPDTVMETALVLRSTNGIMIKIRKI >Cs151 73% to seq 151 (C-term) 69% to ci0100151682 (N-term) 40% to CYP4T5 MVWFGIQGVVTSSVFGDLLLLICMAIVIYNYVTPAVKTYQASNKTAKKIGKVKKHWFWGNLKM not linked yet GLLKRAQRTISSPAWVVNWFGPLLAEATFHHAETAITLLSSS AASKNEHLYKFLRPWLGDGLLVSNGKKWQRNRHLLTPAFHFNILKP file 33 YAQISNDCAKTMITKWTKKTGVAFDIYTDVNQMTLDTMLQCAMSFKIE file 9 EKNIYLQAVRDLTNLTMERITRPLLHIEPIFWLSSTGRKYKEALEIVQGQTNR VIQERRAELKTSKEVHESVEGLDQVKVTRKGKYIDFLDILLQTK DEDGVGLSMQEIRDEVETFLNEGHETTAI GSSWTIYMLAMNPECQKKCREELSDVIKDKEDIEW 63% SDLKKLQYLTMCVKETLRLRPPVPAISRLLDQPLTFKSKTHQVPETTLEKGSIAGVNIFALHRNVHVWEDPEV 83% FDPERFNPENTEQRSPHAFLPFSAGS 70% RNCIGRNFAMNEIKIAVGQVLRKFEFYIDDETPATKMATSMVLRAKDGIMLKLRAI 66% >sequence 155 complete 80% to seq 110 MLLFYLQYAVYATVCAFAVFLLPSVFT (?) SLKRFYTVRKNLNSQWPGPKGNWFW GSSAKVIAEVQKDSAYYFTYIKEMTKQFPQGFRGQLGPIFLSLNAYHPNMVKAVLCAPAS DAPKLNRAYKMLFEWLGHGLLVLEDKVWFRHRRLLTPSFHFEVLKPYVKTMNESAHVMVE NWLGKTSENKVVEIEIFHSASLMTLDTILKCLMSYQSNCQEEKSTNTYVAKIYQLSNTV VKRMRDLNLFKRIDPIYYLTK NGKLYKKLCDDVHKFSESVIDRRKCDIGQPAHNGTRKHFDFLDTLLKARDSDGKGLSDSE IRAEVDTFMFEGHDTTASGISWTFYCLAMHPEHQEKCFQEIEKVMADRTDIEW NDLSNLPHLTLCIKESLRQYPPVPIIFRKLNKDIEVDGKTIVK DTDVILHIYALHHHEEFWKDPHIFDPS RFTQENMKSMNSYAYVPFSAGPRNCIGQRFAMNEIKIAVAQVLRKFQLKPDLSKKIQHSV DVIYRATTGLYLKAERRV* >sequence 109, 124, 161 82% identical to 129, 156 MVAQSQLESILLQNGGGFIFQTLLADVLLFAALVFVTAKYVSPAVKHQFKMKKCGKEVGG PKGHWFYGDLLE (0) NGYDKTGLLNSGKRCEEFPAMCTQWLGPVQMNVQLHHPDVTKTVLSKS (1) HPKGYAYDKFLKPLLQDGLVTSKGDLWKRHRRLLTPIFHFDILKS (2) HVQIFNQNTTELLDWMEEASKSDVGGNLMIPCSNMAFNNVMECVMSQKCSAKEG (2) EISTYTGYVKSMSNMMFQRLSRPLLHSDTLFRVSKLGKRFSIAKKGIEDYASK (0) LIKERRAILSSNSPNSELTTTNIKDEVTGTSFNFTRRKGKMVDFLDILIQTK (0) DQDGVGLKNDEINDEVITFFSAGHETVSN (0) AISWGLYTLASNLDCQTRCREEIKSAVGDKETLEW (2) ADLSKLPYLTQCIKEAMRLFPTLHVISRQLSEDVTLSHRFNDYKPIIIPKGVCVAVNIFAMSRNPHIWEEPE (0) KFNPDRFTPENTSKRSPHAFIPFGAGPR (2) NCIGQNFGMQQIRVVLSKVLRKYEVYVDETLPKPEMSPGVTVQPKDGVFIKVRKLTTQ* >sequence 129, 156 this gene has 11 exons 36% to 4X1 (in the CYP4 clan) MVAQSQLESLLLQSGGGFIFQTLLADVLLFAALVFVTAKYVSPAVKHQVLLRKCAKEVGGPKGHWFYGDLLE (0) YGYDETGVWNSVKRCEEFPAMCSQWLGPAQMNVQLYHPDVTKTVLSKS (1) YPKSYAYNKFLKPLLRDGLVTSKGDLWKRHRRLLTPIFHFDILKS (2) HVQIFNQNSSRLLEWMEETSKSDVGRDVAVPDCDMAFSNVMECVMSKKSSDLTG (2) ETTRYTGYVQSVSAMMFQRLCQPILHSDSMFKVSGLGRRFAVAKKGIEDYANK (0) IIIERRAILSSNSPNSELTTTTVKDDVTGTSFNFNRRKGKMVDFLDILIQTK (0) DQDGVGLTNDEINDEVITFFSAGHETVSN (1) AITWSLYTLASNLDCQTRCREEIKSAVGDKETLEW (2) ADLSKLPYMTQCIKEAMRLFPTLHVVGRQLTEDVTLSHRFNDWKPVTLPKGSSIGVNIFAMNRNPHIWDEPE (0) KFDPERFSPDNTANRSPHAFIPFGAGPR (2) NCIGQNFGMQQIRVVLSKVLRKYEVYVDETLPKPVMVPGVAMQPKNGVYIKVRKLSKE* >scf/ciona01/G126/seq_dir/hrs/G126P60510F.T0/G126P60510FG11.T0.seq 722 exon 1 MVAQSQIELLLLQSGGGFVFQTLLADILLLVVLAVVSAKLVVPTVKHQIKLKKCADEVGGPKMHWFYGDLIK (0) YAYNEESVWNSIKRCDEFPAVCRQWLGPFQMNVQLYHPDVTKTILAKS (1?) >scf/ciona01/G126/seq_dir/hrs/G126P67089F.T0/G126P67089FA7.T0.seq 734 exon 8 AITWTLYTLASNLDCQTRCREEIQSALGEKSNVQW 309 >sequence 158, 146 almost = seq 130 MLLPDYLEFSLSLYSIGAVCVVVFLYKSRKLIGRFLKERRIRKELTKEIDGPQPHWLFGHLNL (0?) LQPNEETILKTCEQTQKYPKLGMVWFGPALAYLMVFHPHVVRPVLNVE (?) HPKDNLAYRFIKPWIGDGLLVSHGSKWRRNRHLLTKAFHFDILKQ (?) YTKVFDACCKTMLEKWSRKCDGSTVEVFRPVSLMTLDSMLQCAISCETECQQEG (?) NRNSNTYTDAVFTLTTIAMKRIFNPFCHVDWIFAMTKSGRQFKAAAKLIHEFDEK (?) VIRERRQENKKILGRSDDEDLTQEDLEKITSWRKSDGRYLDFLDMLILTR (?) DDDGKGLTEREIRDEVDTFLFEGHDTTASGIAWSLYCLAANEEYQEKCREEIEQVVGSKDALEW (?) QDLSKLPFVTMFIKEVLRLYPPVFAVARRTEQPVKFPRGFGRDQLNVGDEQ (?) LPKSDCSRVVXAPLNIGIPIMTLHRNEQVWEDAK (?) VFDPYRFSTENCAKRSPYAYLPFSAGP (?) RNCIGQNFAMAELKTVLVRTLLRYKLYVDNQCPDPKMSPMIVLKSLNGIHIKLKKI* >ci0100150610 seq 160 and seq 159 with N-term and C-term added complete MQSSGLSSIWDQLQINDWLPSSFLVETTLLILLIYLITHYGLLFRETYTKSRKISERVGVDEKNRHWFFGHLKH (0?) ITNNEEGLLKFVELAKAFPLMVSLWLGPALTTVQTYHPDSTTALLNSGAPKNDLGYRFIR TWIGDGLLTSKGKKWFRNRRLLTPAFHYKVLKP YTEVFNKNSLVMVNKLNKLCDQPVNIYTHVGLMTLDT MLQCAMTSKTNCQHVTDHPYLLAVKEISDLIIARVRQPLHFFDWM YNLSAAGKRNKEVTKILHEHTEKII SERKAVLAKTANIDPEPSDVEGFRKTEGKTLDFLDILLQCKDENGEGLSDIEIRDEVDTFLFAGHDTTSS GIAWAMFHLAKYPQFQTKCREELKEVLGNKDEIEWSDLPQLVYLTMFLKESMRLRPPVYAVGRYLEHPIT IKDKNTKRDTVIPAGTNVSMHIFTLHRHPDFWDEPSKFDPERFTKENIAKRPAFAYVPFSAGSRNCIGQN FAMNEMKISLAHVIRNLRLYIDDETPVPKMQPRLILQSSTGIFVKIEKLERQ* >SEQUENCE 190, 162 36% to 26A1 fugu ESTs rcicl022k09 rciad056p18 MVESSFLYPDGGGIRKAITNISSLEVPSNRTLINYLESEDFFYFLFFLIKYI GVPCFLMHVICCLWEIRCVRMKDHSCPLPLPRGNMGFPLVGEMFHLFVV (0) GANYQRQRRSKYGKIYRTHMFGSPSVAVIGEEHVKKIVLGE (1?) GTLVQTRWPSTTRRILGTDGIVNGDLATHCRIKRLALKAFSPKYIGTYAPVIQAGVAAHV (?) GCiWno735_j03.b1 QEWVDQGTVHILEKCKQAVSKTMMTV LLGIKSNDPDIQLYIQAADDIINSILSLPLDLPGFGFHK (?) GCiWno735_j03.g1 GLKARELIFNKLESQLKSKFNATDDELFGDSGGEFESVL LNILKFARREGETEIKLRNLQNLSLEFIFAGTQSLQCSCSLVTFHLSQNPQV = seq 162 NFECREEIKAAGLWDTPLTEINVERIQSLRFIELVAKETLRV TPTIPGAVRVALKTFELGGYQIPK GWQIIYSIRLTHEEAAGKVLQN GSKCEFEPGKHFTSTCKNTDVKKSGEDKSHECLKQQSPFAFLPFGKGPRMCAGKNYGMLFLKVLIFELVRTA DISLEGKCKIIEVPMTRTDKSVKIKFTSIKKLKPENSVNDASSSTAV* >SEQUENCE 164, 165, 171 53% to seq 169 SCSSTDSDASSPYLNTDGRKFFGQ extra MWFVMVASVEQYLSKLISCRVRNTSVAVIPTCVKT (2) RQGEPPLVNHTLPFIGAALDFGKDPLNYLRNLQSKFGDVFTIKLAGW (2) DTHVFMNPLDVRSLERAKFLGHQEIIGDFVTRMVGSPH (1) ILSPAADVQKKLTHSIEKNLRGSMQLNLLCSSCKPVVVNALHRTLGHTMNHHKP (0) IKLHEFSKKLIFDITFNLLFGSLPDYERASEETERTYDVFKTYFKGSAL LVDRIPIHLLPETKKARGIFLKMMEELDWSKRENVSKLIDDIVKVNPKL (1) EFLKQRARHLMVVLWAAQANTTPALFWTLFYLIANPDAKRAVLEEYEQL (1) RWPTMKDILQIERKDLDKLVIL (1) DGCLRETMRLTGASMSIRKATRDESIKTSDGKQYSVRRGDYTAFFAPVTHMDEDIFEKPQ (0) DFVHDRFLVSVDDTSS LKPRSRFTKNGQRVAVSRAIMTFGYGVTRCPGRHIALIEIKLAVLAMLGHF NVQFVDSNTKPPSFDLTHLGFGVMPPDTDIEVLIAPKCC* >SEQUENCE 169 53% to seq 164 had wrong exon 1, switched with seq 164 MWLLLVASVEQYLSKHTSCRVRNTSVAVTPTCVKA (2) RAGEPPLVSHWIPFIGSALDFGKAPLEYLQSLQKRLGDVFTIKLAGW (2) NVHVFMNPHDVPNLERAKNIDHKNVIGDFVSRMVGDEH MIHNPARHSDAVPKILESTLRSTMHLMTLCDSCTNVLPKTIKRTLKSEFNCKRETK (0) IGLYELCKKLVSDMTLNLFFGSLASDEKCNEETQQICEEFFVYFKSSALLVDPIPIHLLP ATKKSRGRLFELLSRFDFENRDNVSELIKKVAAV (1) GSSEEKVRHLLIILWSAQANTLPALFWTLFYILNDKTARIDVLEEFEKHLRKT (0) WPNLSDIQRMRKGDVDKLVLL (1?) DSCTKETMRMTGSSLAARRAHADTTVTLASGQEYKIRKGDYTSYFAPVTHQDADIYDKPE (0) EYRHKRFLQHDDKDPSMLKSRTQFSKNGKRISPSTSIMTFGYGAVRCPGRHIALMEIKFLTLVLLRHF DIKLVKNYSRQSEDYPQFDLTHLGFGVMPPSRDVDVTLSI* >SEQUENCE 166, 170, 172, 181 36% to 7B hum MILEALTTVLAALVSSYLIWLVAKRKQKDGEPPIVPFWIPWL (1) GSGLQYQKDPITFLRKQEKKYGPIFTVFM GGSYFTFVTDPFTHPIIAKESKVLNFRICTIPTIKKVFDFTRRGVDVPTMGGLMVKYMQ GHWLETLVSKMMGNLQKVLLHGSYNNTEWATTGLLHLVETVMCTAGFKSL FGEYPASSGVDNNKLHHKIVEDFVKFDKAFPLLVQGRPISSLK GVKESRANMWKFFTMDEIHKREMVYEMVEQYAKKLEEDQELEQLKKELYCVNWAAM SNTLLSAFWALYYILKHPEAMEAANVEIQRMLKETSQCVDGKSEINFTRDDMAKMTVL DSIINETLRLDISSSLLRQALQDYNLKVVSADRSYKIRKGDKLMTLFQLTHYDPEIYEQPDQFK FDRFLNSDGSPKTDFYKKGKVVKFFLQPFGSGV SKCPGRYWAVTELKQLVILLLLYFDFKLINPKQEIKMDQIRVAFGSLPPNIDPQVQFKRRVIV* >CYP7 like Ciona savignyi one intron shared with human CYP7A1 ortholog seq 166 MLVSIIYTVLLLLLSSYAIWFIAKRKQKRGEPPIVPFWIPWL (1) GSGLTYQKDPIAFLKKQTKKYGPIFTVFM (1) GGNYFTFVTDPFTHPLVAKEGKILNFRICTLPMIKHLFDYQRPAVEEPTMGGLFVKYLQ (0) GQRLNVMIEKMMKCLQRIMLAGKYTNTDWSEVGLLHFVESIMCTASFSTL (2) FGDYPKSSNKDIPAFEHQMFEDFRKFDEAFPYLVKGYPLNSFK (0) GLEESRKNFWKHLASDELNKREMVSDLIAEYARICDAEKDRTLPKKLFSTAWAAQ (0) SNTMVSAFWTIYFVTKHGDAKAAAEVEIQRLLKETSQTLDGKTPINFNRDDMDKMTVL (1) DSIINESLRLHVSASMLRQALQDYDLKVLSVGKTYRLRKGDRVMSLFELNHKDPEIYS NPEVFKYDRFLNEDGSLKTDFYKNGKVVRFALQPFGSGL (2) SKCPGRYWAVTELKQIVILLLLYFDFELINPNEEIKMNKNRVAFGSLPPTNDPKIRFPSTV* >SEQUENCE 167, 173, 174, 176 MLSVILAFFVLILATYFLWNSRKRQQRDGEPNIVPYSIPWL (?) GSSLPFYKDPVKFLKECEKKYGSVFTLFM (?) GGQYFTFVTDPFVQHIIVKEKKVLRFRNCLPPVLKHAFDYYPTAVDDEMMKLTHHYFQ (?) GKPLGLLVKQMNTNIQKWILDGNQGDAGDKWETVGLLSLVQRVMGSASFVSL (?) FGENPKASDTLDVQRQLLNFLTVFPSLLRGLPLSLIK (?) RSREYRQDLWNSLTKSKVTKRKEIFPIIDEI VTLHEQNGENKLLPKRLLPVLVASQ (?) VNTNPSSFWILYYLLKNSEALAAVKDEMEKVLQERSKNPGNNFVCFTKEEMETMMTL (?) ESAINESLRLSISGGMLRKAATEYRLEIPDYGQTYTIRENDMVLVWTQVNQMDPE IFEEPEVFKYDRFLNEDGSLKTNFYKDGKLVKYAFQPFGDGS (?) SKCPGRYWAVLELKQLVVNMLLYYDIQLIDSQEIKMDKGRVGLGALNPTKDALMSIRRKPLC* >Ciona savignyi ortholog to sequence 167 MLEIILSILVLLAVSFLIWNTRKRTAARGEPPIVPYSIPWL (?) GCALPFYKDPLQFLRDCQRKYGPFFTVFM (?) GGEYLTFVTDLSAQQTIAKEKTVLTFRKSLPPVLKQVFGYDSNSVDDEYVKVFPKHFQ (?) GKALENLVSKMNKCLQNFIINGKYYNEEWMTVDLRTFVESIICPASFVSM (?) FGEYSSSRSDKMTEVERNILKFTDLFPSLIRGYPISILK (?) GATACRQYLWKVLSKQEISGRKNVSGVVEETISLHEQYGESDVLPKRLLSFLTATQ (?) VNSVPSAIWLLYFLHKHPEANKVAREEIERVLKESKESTSKDVKIEVTNEDLKKMTVL (?) DSIINESFRLHLAPSMFRIATEDYFVEVAELGRTYKIRKNDKVLLWVQMNQMDPDIFE QPEVFKFDRFLNKDGSRKRDFYKNGRLVKCAFQPFGTGL (2?) SKCPGRYFAVNELKQLAITTLRFFDIELVNPQAEIKMDKNRIGLGALYPATDIKIRIRRKADC* >SEQUENCE 180 39A1 LIKE MKVPGTIGYPIIGDKSREFLSDPTKFITSRINEH SSRIFQCRILNNPTIFVCSPQAVKKLLSKEMANDIEHGYKSILHRLYGDNIVFENDVTAN ELHNCLRNCFQNMELYKETVNRLCE PLLSSLHLRTEPVYIYPEFKRVMTRMCLELFLGIKTESCLDQTEEITHLATQHWHGIISV PVSLKIPLAGESTYSLAVQAKEKL LSIIRSILLGQNGQGDASVAAKLRHADFSNSKSLEQHLLLFVSAVIPKAFASLITSFIIV MAGDDKEALRARARTDSKFLDRVLLEVERMWPPLFAGRRLVRRTTTLESYDLPKDYAIMY VTHAAHRDPEVFPEPNSFNPDRWITCNAGQEDLVLCFGSGPRCCVGTDLIRRILRDVASR LLQNYEWSLAPPNQDTSYKYLPVARPAVQPSIAFSSYLTNDEPFFDACG* >ci0100147340 seq 182 4 diffs blue = intron? MLSCLQKASRRRSFDYFKVICIVRLQHAQ (1) VQSKETAVKDFSEIPQPKSLPILGTFLDYTKFRGFNVANLSNLFKARFDELGPIYKETIIPGLSEPTVHCFSPQD NEKLFRLESRLANRDPIEFLKVVRERIG VPSGLVNIQGEEWYQ LRRIVNEHFLTNSSVWSYSKDIHEVSE DFVDYISRSLDSNNEVPNFALALNKWALEGAGVFSLDTRFGMFTGEVEEHLQEIVDNSNLMFQLMAELTV APPFWKYFETKQIRALKQCQAAQVAAIKKQRDILLKRIKEKDPEKLTKLEKLMTKKDLPQGEVDI LVSDMLGG GVDTTSNAAAFMLYIMAVNPDKQEILREEIKTALQSGKVDGKSLQRMKYLHACALEAQRMFPLTPGTSRR LPVDVVLSGYNVPAGTRLLLLSNVAHHKNPEYFDQPESFLPERWLERKHKSKFAMIGSFGMGARACPGRK FAMQEVHYLLISLLSRYKVEYHHKPIKPHFRLVTVPSEEPQFTFIPL* >ci0100131379 Correct order of fragments 57% to seq 182 new seq MIRKWQKRGFFTNFYQNAVTSNRRQHIE (1) AKTASNINAAKTKPFSDIPEPKGLFGTFLDYTALRGFKTEELYSHYQKRHKKYGPIFKENVLP (1) GLTSYDVFCSSAEANKAMFRLEDKYSNRGPLDFIQIARENAGVSSGLLDPSWEEWFK (0) FRKVANRYLLSNASVWSYSSALHEVCDDFVDYIADNLDENNE (0) FPNFALALNKWALEGAGVFCLDTRFG RNFSGQLDDNLKVIMDQVIESFTLMTDLFLKPPLYKYITTRKMK (0) ALTSVVKKQSEAIQKQRVKIMNQMKEKTPGSKTKLKTVLENSDLNEE (2) EIASLVSDMLGGGVDTTSNAAAFMLYIMAVNPDKQEILREEIKTALQSGKVDGR (0) SVQTMKYLHACVQETQRLFPFSFGVARMFEKDIEVCGYNVPA (1) GTRILLTGNVVNNKDAKYFKDPETFLPERWLERRNSESK (2) FATTGAFGRGVRACPGRKFAMQEVHYLLISLLSRYKVEYHHKPIKPHFRLLTVPSEKPQFTFIPLS* >SEQUENCE 184, 212 MNDVTITWWIVISALALLAVKMVDDNSKRKKRSQNFRSPPGPKGYPILGTFPSLIRKPA HIVFGEWSKKYGNVYQFTLGAGTSVVLGDTATVRLAMLGKSPDFDSRCQWRYCFN KMFRGLSVLPYSFKWVNTRRRTLTILRNIGMGRSLMEGKIHEETQCLLKCMLDSSQDGKV NFEPRDMLRRAPMNTIWSILFGKRYDYSNPYLLYVMNTVVWLVDFLAKASVFDVLPSSLR HLAGFSLETFHNKLETLHAFMTERVDDHAKSYESDVTRDFVDAWIKAKKEDKLKIQDSED SNDISSLPDVIQDLIVAGAHSTGIVFNWAVLLLVLNKEEQDL CYREIMDNVGPTREISMSDRSKLRRVDAVMHECMRIGTVLPLVAHATARDTTIHGYNIKAGTE VIYNNWFINRNEKVFPDAEKFNPDRWIKKDGSFRKELLQHFFPFGIGKRSCAGESLA RANLFIVVATLLQRVELLPPDETASPEMLPEVKHQFGLLAECKPFLMRVRCRPTFVPSQ* >SEQUENCE 186, 187 CYP20 ortholog 29% to fugu CYP20 CYS = HIS MLEVHVTITAVAVLAVILLLFLIGLLFYIYPSSN (?) KETTVPGPAVSDIR (2) DGNRSDIDRAGSLHQYLCNLHAEHGDLASFHFDKQLIISIADSKYFAPRSQVFQLP (1) DFLYEFWEPMIGKESFVFSQGVDVFKRRDNIDFKVDKKVKEFYAKIF (0) RLLAIELEEKWRKQPEDQHIPLTQHLKGLATKAVCQMMFGESFADDERVIKLYKAWDK (0) CWNEVMAASKMEPTKSESDEKK (1) SIKELRSLMEKAIEERWKKSKHLDGITLLDGLLASSNSSVSPLSSEQLLCDVITMFTMGVQVTNA (1) VLCWSVYFIAAHDDVQTKLRNKLNEVESLDIMPQSLNDVK (2) YMQSILKESIRCSGVFSYSGRVEDIDITVGDHLLPKG (0) TPIIEALMFPMQSDAFESPDM (2) FNPKNFSEATKSVNPLTYIPFAFGGSRHKPQDS (2) LVYLLVSITVASLFRNLRISLAGKEVDPQSANYESFVALPSTEEVWVTVGGKLDVETVGENN* >ciona savignyi sequence for CYP20 ortholog 72% to intestinalis seq 186 MLEVHVSITAVAILVLIILLFLIGLLVYIYPVI (2) KETTVPGPDVSDAK (2) DGNKSDIERAGSLHAYLCNLHADFKHEGAVSFHHGKQLVISISNPKFFAPRSQVFQLP (1) SFLYEYWRPLIGEESFLFAKDIDVFRRRDYLDFKVNRNVKEFYAKNF (0) LAVELETKWEKQPEDQHIPLTQHLKGLATKAVCQMMFGDNYADGTKVLELYRAWDK (0) CWNEVMAASKMEPTKSESVEKK (1) SISELRKLMENAIEERWNKSKSLDGITLLDGLLSSQTSDSKDQLTSEQLLCDVITMFTMGVQITTA (1) VLCWSCYFVASNKEVQRKLKNQLNESKSLDIMPETLMDME (2) YMQGILKESVRCSGMFSFAGRVEDIDITVQNHLIPKG (0) TPILEAIMTEMRSEAFSCPDV (2) FNPKNFTKDNNHLNPLTFIPFAFGGSRHKPEES (2) LVYLLAAVTVSSLFRNLQVSLAGKEIEPFAANFESFVAMPSTEEIWVTVSKEKEVENVG* >SEQUENCE 192, 210 CYP11 like sequence revised added exon 2 MGTFGIGWRLRSQWRVAAQNVKHQITQALSLG ETKTSVREVQSYSE IPGPRGLPFLGSVLEYTAIGKFSPKQFHKALRSRHG (2) RYGKIFRERLLQDEFVYVSDPKDIRVLVQNEGKAPHLEAFW (2?) GNDWLRSRTSINGGISCPRALKTYRAGQQVIAEDFVELIRNKRDATTGEVPQF EQWLNRWALESIAIAMLNTR LGLFDAGQSKKNRPAEKLRECMDIFFDYGARLTFGVPMWKIFNTREWKTFCQSQVDEFENAGYFVNK (2) KLKQLKEERGSQEASKQPPSCLLEHLISKNSE VSQMEATTICLELLAGGIDTTATAAVFTLFELSRNKEHQSKLRELANQETDEG KNNSQFSRYLKSCIWESMRLHPLTYANMRKTETDLVLSGYKVPAGTTVRYTSHLMNLKDE KYFPQADEFLPERWLDRKSENRCKEQFTFTPFGHGARQCPGRRIAEQELDLLFRQLIRNF NINYEHEDIKTKVRLFNAADRPARFSFTPISKPITEIQQETN* >Cs192like revised added exon 2 MLRTYAFGLEAIHRWGTWFQTKLGATEKFRETSTKVGKFTAIPGPMGLPLLGSVLEYTPLGRFSPRN FNKAIWSRHQ (2) RYGNIFRERILLDEFIYVSDPKDIRVLVQNEGRAPCRPHLEPVVE (1) SDVWLRSRTSVNGAIACPKALKQYRFNQQIIAKDF IELIRNKRDSKSGEVESFEQWLNRWALESIAVALLNNRYLV (2) LGLLNADPTNPNSPAEKLRLCMDIFFDYGARLTFGLPSWKLFNTRQWKIFCQSQVDEFENAGHFVTK (2) KLNQINNEQRLYKVPPSCLLEHLISNNNNLSQIEATTICLEMLAGGIDTTATAAVFTLYELARNI KHQETLRLLAKKEKCQHNSEFTKYLKACIWESMRLHPLTYANM RKTETDLILSGYHVPAGSVVRYTSHLINLKNEKYFPKATEFLPERWLDRKSEYR (2) CTEQFTFTPFGHGARQCPGRRVAEQELNLLLREIIVNFNVGY YHDDIKTKVRLFNAADRPARFSFQPI* >sequence 5, 12, 194 ci0100141208 complete possible fs at YES MEYCFWYFTIAVFSIYCLLSWQRRPKNFPPGPRGFPVLGIVPYFGKYPEKVLSKFADQYESRVMSFRMGR MDWIVLNDIETITQ (0) ALVKQGEDFAGRPKSPLFDMLTNGCGIVFVDGEKWKAQRKFGIMTLRS (2) LGMGRGSLDDIMNYEFEVLCHFLSSGGSQENVM (0) TLFRKAVTNVTCSVLMGKRYDYDDPVLNNITEHIR (2) PSKTVVMYSPILRFIPPFRWTYKKIVNSIEQVTG (1) VIRNISEMHKQDFDSSTKRDFVDHFIAEMNKEGEYFTVNYVIM (1) YLLVAAINTTTAAFGWCVVSLLRHPECQEKIWQEVDKVI (1) GNETPSCKHQENMPYMRSFIQEIHRHRSIATLNLPHRTVRSVRLCGYDIPKDTP (0) VLTNIWRVHNNQNVWKNPHEFRPDRHLDSNGNFVSSNNVIPFACGYRRCL GEQIAKAEIFLFIVNIVKRFHLVVDQVTGPPTLEKDPKGAVNTPAPFRFSVRARKN* >sequence 196, 32 ci0100152256 complete changed middle exon added N-term exon MFENVGYYFSW (2) SLLTSSHVISAFIGVFTLLMFVYYYWWKFPHPRYPPGVRGIPVLGATPFFGKYAQEKLAIWSRDTYGPV MSARFGQDEAVVLNDYESVHEALVKNLQYFNSRPFVYVAAQVTKGYGIAFADDHQNCMEVRSFSLSALRG LGIGRRTMETRVSEIAQDLVRTLEDLNEKPTDLKMAIGSTVANVICSVVFGKTYEADDQEYEYAVRCSFDC (2) YGDPENSEYLSFLFFYPALRHIQPFKRAMEKFVLVQAGLEG (1) DFCKEIEKHKKHLDENEPADYIDAFLVEMKKHSAQ DSWFHETELRHC LVDFFIAGTETSTNAILWSLLALIHYPNIQEEIHQELIDNIGEQVVPSIDHRDKLPLF RAFVQEIFRFKTLLPLSIQHRASHDVEIGGYVIPKGTKVFPNLHAVHHDPNTWENPSDFNIYRHVDKHGK FIPSKKVIPFGIGARSCLGEKLANVEVFLFLANIIKRFEILPDPESKELPDFKDGINGFIYVPYRYKVVT KPRIINE* >sequence 198, 61 complete seq stop upstream in ESTs MSSKEDVPPKSLPGPWGFPFVGSAFQLGLHKDKTLYEWTELYGPVYALNLGTRLAVALHGY SAIQEALVDKASSCSGRICAPIIDAMVNQQGLFFVDHDEHWKKWREIAHSGLSNSWKTTQ SGILTESERLKKYLSEADGKPINIEETLQITMANAVFQTCLSRRFDLDDTEAANFLVDLY ATNRDPKFGKILFLTFLRFLPSYKSYLIEGDTTRRKKVRKVLRGFIEERLEDDDQSRDLVS LFFKAGPTPSLENKTLVAGLIQDMLFAGSITTSSQLLWVILFVLKNPACMHKMQNELDAVL GDCENPTKSAISDKYDELNYTRAVVQEAMRLRCIAPTSMFHKATEDTTIAGYFVAKDTVI MPNLYSVHHDPITWAPDPESFRPERHLDSDGSFVNSDHVIPFSVGWRRCVGEKIARTEIF AHLVTIFKHFEVALHPDDVGVDMEGVPGPILRPVSYRTQFTSR* >sequence 202, 195, 24 ci0100132628 N-term longer complete MDVESVGCLWGLYSSWVTTCLMGLLTLVILVYYYWWKLPHPRYPPGVRGIPVMGATPFFGKYP QETLARWSRDKYGPIMSARFGQDEAVVLNDFETMHE (?) AFVKNLQYFNSRPTFQIVEQYTNGYGFGFADGHKKFLEVRTFSQSALRG (?) LGIGKRTMEARVSEVAQDLVRTLEDLDQKATDVK (0) MVIGTVVANVISSVAFGKTYDPGDAEFEHLVQCSFN (?) CYGDPDNSEHVNFLIFYPAMRHIQPFKRSLEKFYVVQNGIL (?) DFCRKEILEHKKNLDENEPGDYIDAFLVEMKKHSPQDSWFH (?) EKQLQHAVVDLFIAGTETSTNTILWALLALIHYPETQEKLYQELLENIG (?) EQVLPSIDQRDKLPLFRAFIQEIFRFKTLVPLSVTHRASDDVEIGGYVIPKGTK (0) VFPNIHAVHHDPNIWKNPSEFNIYRHIDKDGKFIPSKKVIPFGIGCRSCLG EKLARIEIFLFLANIIKRFKVLPDPKSQDLPEFKDGVNSIIYVPYRYKVVAKPRIVNE* >Ci204 204, 142, 127 MFVHEKMFLQPSSRLLWIQQLS (0) KIHKLN KSLKYMPEPKGLPFIGTLLDYTFLKKFDKTNLASHFIGRHQLLGPIYKEYVFPAFSK PQVFSTLPDDNEVLLRSEEKISHREPVEFVVSARKLLGWSIGLPFDVGEDWYK LRKVVNQHFLKNSIVWSHSKQQHEVAEEFVDYIGQNLDENNE VPHFQDLLQKWALESTAVFCFGVRL GVFDKSVDDDLNIIIDTNRKKFELIMKGIFSAPLWKFYNTKLMK (0) KFNKVQIEQLHAIRRQQIKAFTEFPVDQKLIESLQFYICS SEIEVLIIDLLSGGVDTTSNSAVFVLYLLSMNPDKQEVLRKEILEALKTGKTDGK DLMRMSYLHACVLESQRLLPFSIGTNRRYSKDIV 203 LSGYKIPAGTVVTMTSNIINNCSEEYFDDPNSFKPERWIQVDNKAKDRMRFAMAGSFG 377 HGTRACPGRKFALQEMKYLLISVLSRYKVEYHYKPMKINFLLVSTPSQPPKFTFKQL* 551 >Cs204like MGNHQITSRPPSSYRSYCS (0?) MTDFTSL KCLKEMPEPKGHPIFGTLLDYTSYGNFNFKNLAEHFRKRHQNLGSIYKEYIFPSKFL PYVQCTSAEDNRVLFQAEERIAYREAVESMVVSRRRMGLSIGLPFSQGEEWYR (0) MRQVVNRHFLKNSTVWSHAQEQYKVAEDLVKYINENLDHNKE VLHLNDALQKWAFESTGVFCFGMRL GIFDTDADEDLDIILETNRQKFLLIAETMFSPPLWKLFNTRLMK (0) KLISVQFNQLAAIQRQREKVMNNFSPSPELARVTRLFKNSKLTK (2) SEVDMLIMDLLAGGIDTTSNAAIFMLYTLAINPDKQELLRKEILEVLESGKTDGK (0) DLMKMTYLHACILETQRMFPFSIGTNRRYSKDVVLSGYRIPA (1) GTTVILTSNLVNGRNPSMFPNPDTFLPERWLNNQDNEAKDKMR RFAMAGSFGHGTRACPGRKFALQEIKYLLISLLSRYKVEYHYAPMTINFLLVSTPSEKPRFTFNPI* >ci0100130435 seq 48, 56, 230 seq 205 ADDED 10AA AT JOINT complete MIPSYLIFFEFPIVATVLVLVCLFYWYRRPKNSPPGPRGIPLLGVLPLLGRYPERTLHEWSKKYG PVMSVRMGRQDWLVLGDHGTIH QALVKQSHTFSGRPHVPTLHRITQGRGLAFVDHGPAWKAQRKFGAVTL RGFGVGKKGMEDKIVEEVHYLNDAIRQHEGKPFNIL AILSNAVSNNICSVILGRRFDYNDKDFQDIVHKM TRDFTDPKSTALTNASVFAPVLDNFPPFAQNTNQRVNDAMVVREILLKYVKEHKATFDREDARDFIDAFLKEAELSKEPS FNDTQLLHYLHDLFLGGTETTTSTLRWALLCLLHYPQTQTKLREEINEVIGRHLIVKVSYSCKVDMPYTC AFIQELYRYRTLLPLSLTHKTNEKAMLGEYTIPKSTIVSVNLWAVHNDPNTWEEPSMFKPERHLDESGHF VQSKHVIPFSVGPRHCVGEQLARMEVFIYLVSMVQKFEFLPDPNANELPDIEKGSNGPAFVPQMFDIVAK VI* >SEQUENCE 208 QKMSWTHGPIMGVKLLRKFTVVLSS VEAMQNALVKQGHAYAGRPDSIMFQAFTGGR (?) GVLAIDGPRWKENRRFGTVTLRKLGAGRA IMEERIVDEAMYFCEALQIIQKKSFNITKLITHSMANVICGLAFGERFNYDDKRFISTAR ALVDWFNDNSNVFMLLVFAPFLRHFWPFTNAVKSLKSAREKLYTMLIPIIDRHRETVDDY DYRDFIEAYLQKISQTRDDPTSFYSDKELLGNVRDLFVGGTDTTVNTLRWTIIYLINR PEVQVQKKKEIDRVIGPDARPKMSHLDDMHYTN (?) AFMHEMLRIRTPGPLGLAHRTTEPTKLMGYDIPEV FITNFWAVHHDPKLWDQPEEFIPERFLDKDGKFVKNANLVPFSM (1) GSRYCMGETVARMEFFLFVVTMLQRFHLLLPTDDSEKITEECLR (2) SGQFLIPPSHNMRVKERFTREQSIFEYNKQMWE* I* L {   M . l & ^ _ g x  H \P 5XK4xVr<deV* Kah~Ih~IOJQJ^J`I* L {   M . l & ^ _ g x  H gd~IH \P 5XK4xgd~IVr<deV* Ka/M  gd~I/M  Mr#^n Qfg1W%jBV 7l  ` !F!r!!!!"O"~"""""#I#p#####*$f$$$$$'%]%%%%%&D&r&&&&h~Ih~IOJQJ^J` Mr#^n Qfg1W%jBVgd~I 7l  ` !F!r!!!"O"~""""#I#p###gd~I##*$f$$$$'%]%%%%&D&r&&&'L'}''''<(r((((")O)gd~I&'L'}'''''<(r(((((")O)|))))"*Z*[**** +D+k+++++&,b,,,,,-;-o-----'.S.....-/./T/f/////80000-1n1o11111282_222223T3z33333!4O444445(5]5555h~Ih~IOJQJ^J`O)|)))"*Z*[*** +D+k++++&,b,,,,-;-o----'.S...gd~I..-/./T/f////8000-1n1o1111282_22223T3z3333gd~I3!4O44445(5]55566<666747W777'8(8~8889;9f99gd~I566<6666747W7777'8(8~88889;9f99999.:u:::::&;X;;;;; <@<{<<<<<'=R=z====>E>>>>>?*?+?W?????@;@h@@@@A;AqArAAAA BBBhBBBBC;CrCsCCCC DBDgDDDDh~Ih~IOJQJ^J`999.:u::::&;X;;;; <@<{<<<<'=R=z===>E>>>>?gd~I?*?+?W????@;@h@@@A;AqArAAA BBBhBBBC;CrCsCCC Dgd~I DBDgDDDD E0EYEEEEFJFyFzFFFF3GZGGGGHQHHHHHgd~IDD E0EYEEEEEFJFyFzFFFFF3GZGGGGGHQHHHHHH5IHIIIIII+J`JJJJJKK%K[KKKKK L6LeLLLLL MDMEMtMMMMN6N\NNNNNO1OfOOOOOPhQ۝BSמO{h~Ih~IOJQJ^J`˕JlmMpɗ8w LԙMǚ!gd~I!"Jjěڛ >hQ۝BSמO{&gd~I&bנ Nɡ12}ĢڢQlܣ:mäĤ ,@_S2f}ȧ7fۨTЩG/gӫ gݬ.Ph~Ih~IOJQJ^J`&bנ Nɡ12}ĢڢQlܣ:mäĤ ,gd~I,@_S2f}ȧ7fۨTgd~IЩG/gӫ gݬ.PQ_pӭgd~IPQ_pӭVT'Moİ BCrϱ4}~Ų?Yͳ $iU1Xж+,}ķ LTr89ӹ5mкh~Ih~IOJQJ^J`VT'Moİ BCrϱ4}~Ųgd~IŲ?Yͳ $iU1Xж+,}ķ Lgd~ILTr89ӹ5mкe»"_ؼ+bgd~Ie»"_ؼ+b0m~"\-N%[<_12X RE}Gd9opLh~Ih~IOJQJ^J`b0m~"\-N%[<_gd~I12X RE}Gdgd~I9opL]23y=o7Igd~I]23y=o7I~JqLgEt.gFG(_ Fr^=u<^=}h~Ih~IOJQJ^J`I~JqLgEt.gFGgd~I(_ Fr^=u<^gd~I=}&'?bL'I%p/0pgd~I&'?bL'I%p/0pB]-Z&cH%8uIw!"jVr,R%Th~Ih~IOJQJ^J`B]-Z&cH%8ugd~IIw!"jVr,R%Tgd~I>{IY&hU 3Z$^_J0w;x1nT}1_@AVqy._&efqh~Ih~IOJQJ^J`>{IY&hU 3Z$gd~I$^_J0w;x1nT}gd~I1_@AVqy._&efq1Kgd~Iq1K|DV6}3p3dh~Ih~IOJQJ^J*K|DV6}3p3dgd~Igd~I21h:ppH./ =!"#$% x666666666vvvvvvvvv666666>6666666666666666666666666666666666666666666666666hH66666666666666666666666666666666666666666666666666666666666666666p62&6FVfv2(&6FVfv&6FVfv&6FVfv&6FVfv&6FVfv&6FVfv8XV~ 0@ 0@ 0@ 0@ 0@ 0@ 0@ 0@ 0@ 0@ 0@ 0@ 0@ 0@66666 OJPJQJ_HmH nH sH tH @`@ NormalCJ_HaJmH sH tH DA D Default Paragraph FontRiR 0 Table Normal4 l4a (k ( 0No List DZ@D _&0 Plain TextCJOJQJ^JaJJ/J _&0Plain Text CharCJOJQJ^JaJPK![Content_Types].xmlN0EH-J@%ǎǢ|ș$زULTB l,3;rØJB+$G]7O٭Vj\{cp/IDg6wZ0s=Dĵw %;r,qlEآyDQ"Q,=c8B,!gxMD&铁M./SAe^QשF½|SˌDإbj|E7C<bʼNpr8fnߧFrI.{1fVԅ$21(t}kJV1/ ÚQL×07#]fVIhcMZ6/Hߏ bW`Gv Ts'BCt!LQ#JxݴyJ] C:= ċ(tRQ;^e1/-/A_Y)^6(p[_&N}njzb\->;nVb*.7p]M|MMM# ud9c47=iV7̪~㦓ødfÕ 5j z'^9J{rJЃ3Ax| FU9…i3Q/B)LʾRPx)04N O'> agYeHj*kblC=hPW!alfpX OAXl:XVZbr Zy4Sw3?WӊhPxzSq]y ~&5DRa1rPqH  #O).39? DHMRhXL^[cQisoty~˕!&,ŲLbI$K"!J!O!w!|!!==>>IIII:WBW[[[[]]]^R`Z`dd1h4hĐ̐2:R5s5>D$doxLU33333333pH.~I1'G{@@UnknownG.[x Times New Roman5^Symbol3. *Cx Arial?= .Cx Courier New7.*{$ Calibri9=  @ ConsolasC.,*{$ Calibri LightA$BCambria Math"1h''$ }$ }%X20<<@@P $P1'2!xxD Microsoft Office UserMicrosoft Office User Oh+'0 ( H T `lt|'Microsoft Office User Normal.dotmMicrosoft Office User2Microsoft Office Word@@rƽP@rƽP$  ՜.+,0 hp|  '}<  Title  !"#$%&'()*+,-./0123456789:;<=>?@ABCDEFGHIJKLMNOPQRSTUVWXYZ[\]^_`abcdefghijklmnopqrstuvwxyz{|}~Root Entry F+ӔP1TableWordDocument4~SummaryInformation(DocumentSummaryInformation8CompObjr  F Microsoft Word 97-2003 Document MSWordDocWord.Document.89q