>Cs4V5a
MAVALFCIVVLLAVIAVLNRREIARLVKNRILLSSIPTMRERAYPIIGHSYLLGGDPAKFFEFI
STGAKAMVTELKEQIAVIWLGPVPVIVVCHPAAAEVILRSSKHMEKSYLYEFLKPWLGTG
LLTSGGEKWKQRRRLITPSFHFNILQDFLEVMNEQSATMIRTLEGKRTAGSAINVGKAIT
MCALDIICETAMGQTVNAQENDDSDYVKSLYR (2)
ISELIQLRQKTPTLWWDAAFSRMKVGKEQESLLRTLHGFTRDVIIERAKTKDSKKCRGDAF (0)
NPKRLAFLDVLLHAETEDGKTLSLNDIQEEVDT
FMFEGHDTTAAAMTWAIYALGRHPEIQQKVHEELDSVFEDDR (2)
FGAITNAQLQKLSYLERVIKESLR
MFPSVPMFARVLSEDCKLGGYSVPKGTQAIIFGYTIHHHPGIWEDEERFEPDRFLAENCV
GRHPYAYIPFSAGPRNCIGQKFAMMEEKVVLCHLLRRYSVISHDKEEDLLINADLILRSSTPLNITLKPR*

>intestinalis seq 134, 141 ortholog to Cs4v5a 75% 
MAVALLLIIVVTCVIALHYRKGIARHFKNRLMLSRLPSLPDPAYPLIGHAYLLRGDPS
RFFEFVTETTNAMVTEVNEKIGILWLGPVPLMVVCHPEAAEVVLRSSKHMEKSYLYKFLHP
WLGTGLLTSGGEKWKQRRRLITPSFHFNILQDFLEVMNEESGKLINN
LKKKLKSGVKVDVGKAVTMCALDIICETAMGQTVNAQENDDSSYVRALYR (2?)
ISELIQLRQKTPTLWWDPAFSRMKLGKEHENLLSILHGFTRDVITKRAKTRDQKIVE (0)
NPRRMAFLDVLLHAETEDGKTLSLNDIQEEVDTFMFEGHDTTAAAMTWAIYVIGRHPDIQ 
KKIHEELDAVFGEDRGGTITNNQLQKLSYLERVIKECLRLYPSVPFYARVLSEDCKVGDY 
MVPKGTQTVIFAHTIHHHPYVWEDPEKFDPDRFLAENCVKRHPYAYIPFSAGPRNCIGQK 
FALMEEKVILSKLLHNYFVVSHDKKEDLVINGDLILRSSTPLNITLEARK* 730

>Cs4V5b
MAVTLLIILGAFVVPF
ALYWRKILKEISNRHRINKLPSVNSPAYPLVGHSYMFPRNPDEMYDLITSWLKWVMREAK
QKLAVVWLGPVPLVAIVHPDAAEVVLRSSKHMEKSFVYRFLHPWLGTGLLTSGGEKWKQR
RRLITPSFHFNILLDFLEVMNEQSTKM
LRNLDAAVQSGSKVNVGRAITMCALDIICETAMGRTVNAQSDQESEYVKALYR (2)
ISDIIQQRQKSPALWWDLAFSKMAMGREHDATIDVLHRFTMNVISDRAQSKTEIKVGWIF (0)
DTRRMAFLDVLLRAESEDGRSLSLSDIQEEVDTFMFEGHDTTAAAMTWTIYLIGRHPAIQARIHEELDD 553
552 VFGGDMGTITNSHLQKLSLLERTIKESLRMFPSVPFIGRVTTEECSVGSHSIPAGTQVA 376
375 IFIDSLHHNPSVWPDVDRFDPDRFLPENCVGRHPYSFIPFSAGPRNCIGQKFALMEEKVL 196
195 LTQILRKYSIHSHDEEEDLRKQADLILRSSTPLNISLTPRA* 70

>seq 115 ortholog to Cs4v5b N-term 69%
MALTLFILIGALAVTIAVYWKKIKEQFR
KRQLINKLPTIMETAYPLVGHSYLFPKSAEELFVFITGRLNWVMTEAVEKIAVLWLGPIPL 
IGVVHPEAAEVILRSSKHIEKSFVYTFVHPWLGTGLLTSGGEKWKQRRRLITPSFHFNIL 
QEFLEVMNEQSKKMVDKLDASIASGSKIYVGKAITMCALDIICETAMGQTVNAQDHQDSE 
YVKALYRISDLVQFRQRTPALWWDAVFSRSKLGIEHDTILCTLHGFTRNVITERAQGKGK
KEIENPRRLAFLDVLLNAETEDGKSLSLNDIQEEVDTFMFEGHDTTAAAMTWTVYLIGRYPD
IQEKLHEEIDSVFHDDKEGVISNSQLQKLSYLERVIKESLRL
YPSVPFIGRVTTEECVIADHVIPVGTQVALFIES
MHRNPAVWPDAEKFDPDRFTAENCVGRHPYAYIPFSAGPRNCVGQKFAMMEEKVILAQIL
RRFSLVSHDKEEDLKKQADLILRSSKPLNITLTPRV*

>best matches to CYP11A1 G126P69032RA7.T0.seq    722 (07) 44% to mouse 2j6
112 ITDLFMAGTDTMATTVHWSLVFLAQNPEIQNKMAAEIIKVTGNDVINVSMMESMPYTRA 288
289 VMYESTRMRPVFPLSLGHQATSDVTVKDNVIPKGTLVVANLWAIQNDPKWWKNPESFRPE 468
469 RHITENGGFTKNEKIVPFSIGPRFCLGSQMATYHQFIFLANLVRTFKFRFHENQPEPDLT 648
649 GVMTSVLLP 675

> best matches to CYP11A1 G126P605845F.T0/G126P605845FG6.T0.seq    701 (27)
149 ITDLFMAGTDTMATTVHWSLVFLAQNPEIQNKMAAEIIKVTGNDVINVSMMESMPYTRA 325
326 VMYESTRMRPVFPLSLGHQATSDVTVKDNVIPKGTLVVANLWAIQNDPKWWKNPESFRPE 505
506 RHITENGGFTKNEKIVPFSIGPRFCLGSQMATYHQFIFLANLVRTFK 646

>best matches to 27A1 G126P601882R.T0/G126P601882RC8.T0.seq 734 75% to seq 115 49% to 4V2 human
NFLPPQDTRRMAFLDVLLRAESEDGRS
696 LSLSDIQEEVDTFMFEGHDTTAAAMTWTIYLIGRHPAIQARIHEELDDVFGGDMGTITN 520
519 SHLQKLSLLERTIKESLRMFPSVPFIGRVTTEECSVGSHSIPAGTQVAIFIDSLHHNPSV 340
339 WPDVDRFDPDRFLPENCVGRHPYSFIPFSAGPRNCIGQKFALMEEKVLLTQILRKY 172
171 SIHSHDEEEDLRKQADLILRSSTPLNISLTPR 76

>SEQUENCE 34, 52, 71, 217 74% TO SEQ 29, 60% to seq 65, 39% TO 2C9
MLDILLSCIRAWFSTITLSIIVFYVTWHYWNKRTPGSPPGPRGFPFIGAITSIRKHPEHVMTKWNKEYGPVCMVRLGFKDVLVIGS
YEAAHEAYVKSQDFLDRPSPFGLEILGGGYGLFPIAYGSFHQEQRRFGLNTLREHGMGRRVLESTILQYAEELCDRLETKMSAPVL
LDQEVYIAVSSTIAHIVFGHNTMQDSPEFRDMILWMLRPNTATVLAGILVFAPYLKHLPFFRGVHNDSRALRLKLESLNLKEVKKH
DKTRDPSDPRDFIDSFLNEMDK (1) 
HFKGKIDPNWETYSSFSDQQLVSM (0)
VRDLFLAGTDTTSATTCWIILFLCKYPDVQRRMQKEIDDVIGENGIPKLALAERLPFTRAVIQ (0)
EMGRIRPNVPLAVPHCASRDSTLMGFNIPKDTIIMTNIWGIHHDEKTWKDPYKFNPDRHLDAEGNFVKSNHVMQFNIGLRSCLGQQ
LARMELFLITVTLFRKFFIELEPGCDIDMEGESLVSLRPYPFKVLLTKRI*

>student assembly of seq 34 ortholog missing C-term (2 different genes) 68% to seq 65
MMEEVSIVQTIRGTFDTILLLAIVFYLTWHIWNRRSPLSPPGPRGIPLLGAITQLGKHPEHA
MMKWNQQYGPICMVRLGHKDV
LVLGSYEAAHEALVKNTHIADRPTHNIEVFYGGKGILMINSGSFHQEQRRFGLNTLREY
GMGRRALEPTILIYVNDLCDRIEKYGSDPFVIDGEVYLTISSTISHIVFGHDVIHDNPKFKEIIFKMIEPN
KLNVLAGILAFAPFLKHFPFFSTVHKSSKEFRLKLQSLIGEEVQEHKKTRDPTQSRDFIDSFLQAMEK (1?)
VQDGKLDPNWEPYSSFTIDQLTAM (0?)
VRDLFMAATDTTSSTICWIILFLCKNTDVQIKMQEEIDEVLGQNGILKYELATKMHYVRAVIQ (0)
LFKEIARLRPSVPLSLPHASNRDTSIMGYRIPKDTIVLTNIWGIHHDEKIWKNPYEFNPE
RHLDANGNFVKSKKVMQFNIGLRSCLGQQLANMELFLVTVSLFHRFSFSVVNPKIDMGGE
SMVSLRPYPYSVVATTRG*

>SEQUENCE 65, 78, 38  join with 68 check end
MEISVQSISSIFDTILLVLIVSYLTWHFWNQRSAWA
PPGPRGLPFIGPITSIRIHPEHAMMKWNQQYGPVCMVRFGFKDILLLGSYEAAHEAL 
VKNMDLADRPSNGIAVFKGGKGILMTKFGSYHQEQRRFSLNKLREYGMGRRALEPTI 
LLYSNELCERIEKFGSKPFYIDMEIYKAISSTICHIVFGHNVIEENEDFKEIINTLN 
KKSKLNVLSGILAFAPFLRFLPVFSTIHAKSVNFQQTLHALVRQEISEHEKTRDPKE 
PRDYIDSFLNEMDK 
DFRGQVDPNWKQYSSFTYEQLVAM (0) 
YEQLVAMCRDLFMAGTDTTSSTTSWIILFLCRYPEVQRKMQEEADQVLGSNGEPKMALAEKMPYTR (2?)
FFFQEINRIRPNVPLSVPHYSSKDTMV 
MGYKIPKDTIVLTNIWGIHHDEKLWKNPYDFNPERHLDSNGKFIK 
SSKVIQFNIGLRSCLGQQLANMELFLVTVSIFRQFSFSFNPGDKIDM 
EGESVIALRPYSYRVIAEKR

>sequence 29 9 accessions 33% to 1A1 in a cluster with 65
MEEYGIINSIKVAVDTVILVGLAIYLTWHFWNKRTPGGAPGPRGLPFIGAITSIRKHPEHAMMKWNQQYG
PVCMVRLGFKDILLLGSYEAAHEALVKSPDFANRPPAHSIEVIAGGKGIVMIPFGPFHQEQRRFGLNALREH
GMGRRALEPTVHLCAQELCERIEEYGSKPFNLDNEVY
QSVSSIISRLVFGHDVVQENLEFRKMIFQMIEPNKLNILAGIL
AFAPYLKHLPFFSEIHRKNKEFRLKMESSIRKEIEEHMETRDRKEPRDYIDSFLNEMEK (?)
DEPNTKNEEWKQYSSFTQNQLIAM (0)
CRDLFMAGTDTTSSTTSWIILFLCRYPEVQRKMQEEADQVLGSNGEPKMALAEKMPYTRAVIQ (0) 
EIARMRPTLPLSVPHCTTQDTMVMGYKIPKDTIVLTNIWGIHHDEKLWKNPYDFNPERHLDSNGNFV 
KSSNIIQFNIGLRSCLGQQLAKMELFLLTTSLCRHFSFSVVGEVDMEGESMVTLRPCSMEVVATKRA*

>sequence 2,3,9,45 COMPLETE 81 accessions 94% to sequence 1 40% to 2U1
MVLQLLSDINVSSLVIFFTAFLALYYWYTRPKNFPPGPRGVPFLGVIPFLGNYPERVMRKWSKKYGPVMSVRMG
REDWVVLGDYETIQQ (0)
SLVKQGQCFSGRPDVPVLNQITNGHGLITVDYNEDWKTQRRFGITTLRG (2)
FGVGKRSMEDRIVEEVAYLNDAIRSHNEKPFDIL (0)
SILSNAVSNNICSVVMGRRFDYDDKRFMEIMARLSRS (2)
FNDPTANFALNVVMFMPILVKIPPFSRINNQLMTDVRVI (1)
LQMLREILSEHKSTFNKDDVRDFIDAFIAEQNSESKHSSYT (0)
DLQLLQYVRDLFVAGTETTTSTLRWSILCMIHNPEKQEKLRKEICDVI (1)
GQDRVPAMNDKAQMPYTCAFMQEVFRYRTLVPLSVVHMTNQDVVLNGYTIPKGTT (0)
ISPNLWAVHNNPDVWDEPSKFKPERHLDDKGNFVQSKHVIPFSIGPRHCLGEQLARMEYFIYLVSMV
QKFEFFPDPNEPDLPDVEDGSSGVVFVPLRFKQIAKIV*

>Cs2a CIONA SAVIGNYI SEQUENCE 71% TO CIONA INTESTINALIS seq 2 70% TO SEQ 4
MLQRMLNEINVSTSFIFLTVFLGLYYWYRRPKNFPPGPRGIPFLGVLPFLGNYPERKMRK
WSNKYGPVMSVRMGRQDWVVLGDHETIQQ (0)
GLVKHGNAFSGRPSIATIDQITEGHGVLFIDYNDHWKRQRRFGLSTLRG (2)
FGVGKRSMEDRITEEVAYLNDAIRTHDGKAFDIQ (0)
SILSNAVSNNICSIVMGRRFDYDDERFKEIMGRLARG (2)
FNDPEASFVLQILIFMPALINFPYFSRINGELMEDVRVI (1)
PELLREIVEEHKASYEQDNHRDFIDAFLGEQKAENGAKTFT (0)
DTQLLQYVRDLFVAGTETTTSTVRWSLLCLIHNPETQEKLRKEIFEVL (1)
GPEKIPAFENKSKMPYTSAFIQEIYRYRTLVPLSVTHMTNEDAHISGYTIPKGTT (0)
IAPNLWAVHNDPEEWDEPNKFKPERHLDAGGNFVQLKHVIPFSVGPRHCLGEQLARMEVF 
IFLVSLVQKFEFLPDPKATVLPDIENGASGAAYVPLPFKIVAKVV*

>Cs2b Second savignyi whole gene ortholog to seq 2/4
MLQRVFNEINVSTGFVVLTVFLALYYWYRRPKNYPPGPRGIPFLGVLPFLGNYAERTMHK
WSKKYGPVMSVRMGRQDWVFLGDHETIRQ (0)
ALVNQGNSFSGRPLIATLDQITEGHGIVLLDYGDWWKRQRSFGFSTLRG (2)
FGVGKRSMEDRITEEVAYLNDAIRSHDGKAFDIK (0)
SILSNAVSNNISSIVMGRRFEYDDEHSKEIMARLAMG (2)
FNDPDTSFFLQILIFMPACVHLPYFSRVNKKLMEDVHVTR (1)
SELLREIIAEHKASYDQDNHRDFIDAFLGEHNAENGTDAFT (0)
DKQLLHYVRDLFFAGSETSTSTLRWTLLCLIHHPEKQEKLRKEIFEVL (1)
GQEKIPAVDNKAYMPYTCAFMQEVYRYRTLAPFGVAHMTNEDVNLNGYSIPNGTT (0)
IYVNLWAVHNNPDVWDEPNKFKPERHLDDKGNFVQSNHVIPFSVGPRHCLGEQLARMEIFIYL
VSLVQKFEFLPDPDATELPDIENGSYGPICAPKPFKMVAREV*

>related savignyi 2/4 like exon sequences (probably two more closely related genes)
TLVKQGSIFSGRPSIPILEEMTKGHGILLLDYGEKWKSQRKFGLMTLRG (2)
GLVKQGTAFSGRPSIDIIERVSKGNGMIFLNYDEHWKKQRRFGLSTLRG (2) 
FGVGKRSMEDRITEEVAYLNDAIRTHDGKAFNIQ (0) = RB10
SILGNAISNNICSIVMGRRFDYDDERFKEIITRLARR (2)
FNDPEVSLVRQILIFMPALVNAPYFSRINAELMENVRVI (1)
FIDPEVSSALQIFVFMPALVNFPYFSRIYGELMKDVRVI (1)
PESLREMVEEHKASYEQDNHRDFIDAFLGEQKAENGAKTFT (0)
SELLREIVADHKALYDQDNHRDFIDAFLGEQKSENGSETFT (0)
DMQLLHYVRDLFVAGTETTTSTLRWSLLCLIHDPEIQDKLRKEIFEVL (1)
GQEKIPAFEDKSKMPYTRAFIQEIYRHRTLLPLSVTHMTNEDANICGYTIQKGTT (0)
ISSNLWAVHNDPDVWNEPSKFKPERHLDDKGNFVQSSHVIPFSVGPRHCLGEQLARME
VFIYLVSLVQKFEFLPDPDATELPDIKIGSNGPAYVPLPFNMVARVV*
IAPNLWAVHNDPVVWDEPNKFKPERHLDADGNFVQLKHVIPFSVGPRHCLGEQLARMEVF
IFLVSLVQKFEFLPDPKAKVLSDNENGASGAAYVPLPFKIVAKVV

>sequence 4, 8, 92  (48 accessions)  83% to sequence 2 faces away from seq 220 on 
MLQHLLGDINVSSLVIFFTAFLALYYWYTRPKNFPPGPRGVPFLGVIPFLGNY
PERVMRKWSKKYGPVMSVRMGREDWVILNDYENIQK (0)
ALVKQGHSFSGRPDIPVFTQINNGMGLLTVDYNDHWKMQRRFGITTLRG (2)
FGVGKRSMEDRIVEEVAYLNDAIRSHNDKPFDIL (0)
GVLSNAVSNNICSIVMGRRFDYDDERYKEIMFRLSRS (2)
TKDPEANFYIsVLMFVPFLVNFPPFSRINKEMMDDVKVIL (1)
DHLDEIVSEHnSTFNKDDARDFIDAFIAEKNSQNKHSSFT (0)
DpQLLHYVVDLFEAGTETTTSTLMWSILCMIHNPEKQEKLRKEICSVV (1)
GQDRVPAMNDKAQMPYTCAFMQEVFRYRTLVPLSLMHMTNEDVVLNGYNIPKGTT (0) 
VSPNLWAVHNDPDVWDEPSKFKPERHLDDKGNFVQSKHVVAFSVGPRHCLG
EQLARMEYFIYLVSMVQKFEFLPDPNEPNLPDIEKGSNGAAFVPLPFKQIARVV*

>SEQUENCE 27, 220 SIMILAR TO SEQ 2 and seq 8 BUT NOT SAME SEQ Faces away from seq 
MLTNLIANPSISPVLAILSCVIAFYYWYKRPKNMPPGPRGIPFLGIIPFVGMN
PEQAFMQWSKKYGPVITVRMGRKDWVVLCDHDTIHQ (0)
VLVKQSTVCSGRPKIPIVSELSKGHGILFADYCEKWKSQRKFGMKTLRE (2) 
FGVGKKCTEDRVLEEVDFLCNEIRSKNGKPFDIQ (0)
DIMCNAVSNIIMNIVIGRRCNYDENFFTDVISRFTKW (2) 
FNDPTAGAMFTGMMFLPQLKYVPPFSKYYKIFRDDIGALH (1) 
EFFEEVIKEHEKNFDGNNLRDFIDAFLLEMKKNESGSEFT (0)
YIQLLNYIRDLFDGGTETTVSTSRWAILCMLHYPETQKKLRNEIMTII (1)
GPNTPASMSHKSDMPYTCAFIQEVLRFRTLVPLSVPHKVNEDATVNGYTIPKGVT (0)
VSPNLWAVHNDPDVWDEPSKFKPERHLDDKGNFVQSNHVIPFSVGPRHCLGEQLARMEIFIFL
ISMVQKFEFLPDPNEPDLPEINHGTNGAAFVPLPYKIVANQI*

>Sequence 209, 199 = A different C-term from seq 27 and seq 10
MIEAFLRQWTDTTTVIIFLFTLLFYYWYRRPLRFPPGPRGLPLV
GVLPFLRKYTARTMHKWSNKYGPVMSVRMGNEDWVVMGNYEAVQH (0?) 
SFVKAGNVFSGRPVLKVANEIAEGKGIVMRDFNTTWKTHRKFGSITLRG (2?)
FGVGKKSLEERIYEEVEVMNKEILSKEGKPFDIT (0?)
EILTNAVMNVISIITTDQRFEYDDEHFRLLQQIFTKW (2?)
FLEPENTAAFAQIIYVPLLCNIPPYRSKYLEVKRDMKRVSG (?)
VFQQMVEQHRKTFDKNNLRDFIDAFICEGKKGTDESFT (0) 
DGQLVQYVREMFEAGTETMTGTVRWAMLCLIHYPDAQKKLREEIFEAI 
GNNNRPSISDRKAMPFTSAFIQEVFRFRTRVPLGVQHMTTETVNFANYVIPKGTT (0)
ILANMWAVHNDPDVWDEPSKFKPERHLDDKGNFVQSNHVIPFSVGPRHCLGEQLARME
IFIFLVSMVQKFEFLPDPNEPDLPEINDGVKGNGFFPYPFNFVASEI*

>sequence 42, 59, 218 36% TO 2U1 Fugu
 FATSVLLLLVIMDKTD
KKMPGSPPGPRGLPILGMLPFMMSYPERLMADWSKKHGPVIM (0)
VKMGPKNVVILGSSDAAHAAFVKNTHLANRPQDGLEIFAGGKGILFINS
SMFHSEQRRFCLSALREFGMGRRTLEPKIVDCAALLCDQIDDVCGHDVST (1)
GPIQIEQMIYVTMSNVISHLVFGHDSLNDNKGLCELLLRTIEPNKYNALAGILMFLP
SLKNVPPFSTTTRRAIEFREDLHRYIKMEIDRHRASRDPNQPRDFIDKFLNEIEKIKVRHNSNNNNNNNLILNNNVK (1)
GSKSRRKIAGASFSEDQLIPLVRDLFMAGTDTSSTTITWVVIFLASFPDVQTRLHEELDSVLGQNRTPSVSMEDKMPYMRAVIQ (0)
ETFRMRPPLPLSVPHVAQCNTTLMGYRVPKDSIILTNTWGIQNNPKFWVDPDTFKPQRHIDDLGKFIKSPNVI
PFNIGSRNCLGQQLAKMELFITIASLCRRFWFTLPTGETPDLKGESTFILRPFPFNVVATKRV*

>sequence 6, 132, 225 ci0100133889 
MEMETVLGLITIGNVTTILLAVVTVYITRCISKMI (?)
FFPPGFPPGERGLPFLG (?)
VLPYLGEHPHNVLKEWSYKHGDIMAIKLGTTNAVIVSSQSTLNEV (?)
FVKRASAISDRFTNEFFTQILLHRGIFNSAYTERLKIQRKFVHQALHR (?)
NLINNKTPEERINEEIQYFIREIHSQDGKPFHAH (?)
HMLLSSVTNVICSIVLGKRFEYNDKWFTKIMSCIVE (?)
SLAHSGPSFLLPISIYPWLIHFPPFRRINVRFRKSFATFA (?)
ELIQQDIDEHKSSFDENNLRDLIDAFLFEMKKEGHHSSFD (?)
ELQLCVIVRELLLAGNETTGSQIAWLLLALMHFPKWQDRIHREI (?)
KVLTGENGVLKLDDRKKLPCTCAFIQETLRLHSIGSLNAPHKVSRDTEVKGYKIPAGTMV (?)
MSNVFAIHTDPKVWTDPETFDPSRHLNEVGSFINSSK
MSPFSLGPRQCIGKTIAENKMFLYFGNICQKFRIVSETPTPITLHEGQSGFVFSPKEHKVIAIAYSSK*

>ci0100139346 seq 7a = seq 7, 100, 101 46% to 2J2
MLDYLTFTNLALFAIALLLFYIWLTPKYKNLPPGPMGVPFLGCLP
FMETLAERTFTKWSKKYGPIITVQLGDHRTVILNSYEAIEE (0)
AFIKKGEKFNGRFKTYFTEFASEGLGMVFIDGEKWKEHRKFGTRALAG (2)
AGMYGKTIEQRVLEEAENICHVIRSKNEEPFILE (0)
DDLILSVANVINGITIRERCDEEGNENLLKYAAIVVEG (2)
FRKDGLIYSLLSVIPWCRFFPQTQKHIKSLKDGLKAGN (1)
ALFDNLIEQHQQQHDRLHPRDLIDLYLNQMKDFS (0)
KPQLRYFLKDLFAAGTETSSSTIRWALFYLIVNPDIQDKVHKEIDDVI (1)
GHNGVVRYDTKLPYTKAVLQETYRIRTATPLGVPRRNTEDVTLMGYFIPKNTQ (0)
IIPNLWWVHNDPEYWNEPDVFKPERHLDEEGNLIMSNRVIPFSIGARHCLGENLAR
TEIFLFLVSILQKFTVLPNPEDPNPPLECSPGIVNSPHKYPLIMKER*

>ci0100139379 seq 7b 97% to seq 7a above only 12 aa diffs
MLDYLTFTNLALFAIALLLFYIWLTPKYKNLPPGPMGVPFLGCLPFMETLAERTFTKWSKKYGPIITVQL
GDHRTVILNSYEAIEEAFTKKGEKFNGRFKTYFTEFASEGLGIFFIDGEKWKEHRKFGTRALAGAGMYGK
TIEQRVLEEAENICHVIRSKNEEPFILEDYLILSVANVINGITIKERCDEEGNENLLKYVASVVEGFRKD
GLIYSLLSVIPWCRFFPQTQKHMKSMKDGLRMGN
ALFDNLIEQHQQQHDRLHPRDLIDLYLNEMKDFS
KPQLRYFLKDLFAAGTETSSSTIRWALFYLIVNPDIQDKVHKEIDDVIGHNGVVRYDTKLPYTKAVLQETYR
IRTATPLGVPRRNTEDVTLMGYFIPKNTQIIPNLWWVHNDPEYWNEPDVFKPERHLDEEGNLIMSNRVIP
FSIGARHCLGENLARTEIFLFLVSILQKFTVLPNPEDPNPPLECSPGIVNSPHKYPLIMKER*

>genewise.1257.28.1 seq 1, 50 and seq 10, 11, 22 similar to seq 27 and 209 cibd036p10
MIELVLPDLSIAPTILTLICL
LLFYTWYRRPSRFPPGPRGFPIVGVLPFLEKYSERTMHKWSKKYGPVMSVRMGNDDWVVMGNYEAVYQ (?)
SFVKAGSAFSGRPYMKLANEITECKGLVMRDFNPTWKTQRKFGHLTFRG (?)
FGVGKKSLENRIYEEVEVMNKEILSKGGKPFDIS (0) 
EILTNAVMNVICNITADQRFEYDDKNFRLLQHILTKWY (2)
FLEPENTAAFANMIFVPLLCNIPPYRSKYLEVKRDMMQMA (1) 
DVFQELVDQHRISFDKNNIRDFTDAFICEVDNGKDDSFTDRQLVQYLREIFEAGTQTTT
GTLNWAILCLIHYPQAQKKLRNEILDVI (?)
GNNNRPSISDRKSMPFTSAFIQEVFRFRTRVRTGVPHKTTETVNFANYVIPKGTTI (?)
VANLWAVHNDPDVWDEPSKFKPERHLDDKGNFVQSKHVIPFSVGPRHCLGEQLARMKIFIFLVSMVQKFE
FLPDPNEPDLPEIDDGVIGNGFFPYPFNFVANEI*

>1257c equivalent to seq 1 1257c and 1257d are probably parts of a single gene
7055 MIELVLTDLSIAPTILTLICLLLFYTWYRRPSRFPPGPRGFPIVGVLPFL 7104
7105 EKYSERTMHKWSKKYGPVMSVRMGNDDWVVMGNYEAVYQ 7154
7641 SFVKAGSAFSGRPYMKLANEITECKGLVMRDFNPTWKTQRKFGHLTFRG 7722
7982 FGVGKKSLENRIYEEVEVMNKEILSKGGKPFDIS 7995
8226 EILTNAVMNVICNITA 8241
seq gap missing part of exon 4 and all of exon 5
8894 ADVFQELVDQHRISFDKNNIRDFTDAFICEVDNGKDDSFT 8933
9153 DRQLVQYLREIFEAGTQTTTGTLNWAILCLIHYPQAQKKLRNEILDVI 9196
9656 GNNNRPSISDRKSMPFTSAFIQEVFRFRTRVRTGVPHKTTETVNFANYVIPKGTTV 9749
seq gap missing exon 9

>1257d 2 aa diffs with seq 1 1257c and 1257d are probably parts of a single gene
seq gap missing exon 1 and part of exon 2 
9992  ECKGLVMRDFNPTWKTQRKFGHLTFRG 10016
10267 FGVGKKSLENRIYEEVEVMNKEILSKGGKPFDIS 10280
10509 EILTNAVMNVICNITADQRFEYDDKNFRLLQHILTKWY 10546
10723 FLEPENTAAFANMIFVPLLCNIPPYRSKYLEVKRDMMQM 10759
10937 ADVFQELVDQHRISFDKNNIRDFIDAFICEVDNGKDDSFT 10976
11196 DRQLVQYLREIFEAGTQTTTGTLNWAILCLIHYPQAQKKLRDEILDVIG 11242
seq gap missing exon 8
11913 IVANLWAVHNDPDVWDEPSKFKPERHLDDKGNFVQSKHVIPFSVGPRHCL 11962
11963 GEQLARMKIFIFLVSMVQKFEFLPDPNEPDLPEIDDGVIGNGFFPYPFNF 12012
12013 VANEI 12062

>sequence 55, 80, 13 ci0100138298 SCAF 106 
MIEKFLPEWSNTGAVLMLVCTLIFYYWHQRPIHFPPGPRGFPLIGVLPYLGKYP
ERTMQIWSRKYGPVMSVRVGSEDWVVMGSQESIYQ (0)
SFVKGGNVFSGRPFVKVANEVTESKGLAMIDFNPTWKTQRKFGNVTLRG (2)
FGVGKKSLEERIIEEVEAVKMNILSKEGKPFDIT (0)
DILSNAIMNVICNITAGQRFEYDDQFFIQIRRHLW (2)
FMDPDRTAAFSAMVFVPLLCHIPPYRKKYEQIKQDTKEMM (1)
GLFQQLVDQQRKTFDKNNLRDFIDAFILENENGTDESFT (0)
DRQLVHYVRELFKAGTETSTGTLRWAMLCLIHYPGAQEKIRKEIFDVL (1)
GNSTFPSMSDRNAMPYTSAFIQEVFRFRTLAPLGVPHKTTDTVNFANYIIPKGTT (1)
ILSNLWAVHNDPTVWNNPRQFKPERHIDDKGKYIQSNHVIPFSVGPRHCLGEQ
LARMEIFIFLVSMVQKFEFLPDPNKTDLPELDDGVNGVAFVPYPFKLVAKEI*

>sequence 14, 37, 69 25 accessions 50% to 2C8 missing seq in mid
MLPSGYIFVFVFLCIYYLLQWRRRPKNFPPGPLGIPLFGIA
PFAGVDMHKYLATYYAKYGGVMSFRLATKDWIVLNDIEAITQ (0)
ALLKQGESFSGRPQSYLMNQLTEGCGIVFSTGPRWQAQRRFVLTALKT (2) 
LGMGTRSMVGIINEENQNFSSVIQSSGGQVNIL (0)
 (1) DIISEITEEHKNSFDENNLRDFIDIFLLEMKRRNSEYFT (0) 
ELQLLHLVRDLFVGAIDTTTATLGWGIICLLHYPECQVRIQEEIDDVI (1) 
GCAEPDMSHHESMPYLRAFIQEVHRFQTIAPLNIPHCVTEDCVLFGYHIPKS
TPVMSNIWRVHNDPKYWENPEKFSPERHLDSEGRFVPSNRVLSFAVGHRSCLG
VQLARVELFLFYASTLKKYEFQIDPEYGLPDWSNDRSGTVKTPKKFSVLLKSR*

>sequence 16 and ci0100140915 they overlap at exon 6
MFCSIFEWLTSAGIITSLIGTFTFVVLVYYYWWKPPHPRYPPGVRGVPILGAIPLLSPFPHRVLLNWSKKYGAIFSARFGRSDA
VVLNDYTSVYQ (0)
AFVENYVSFQSRPDVQVVNEYTSGFGFAFSKGQYCLDVKKFSAQAIRS (2)
IGIVQIEKWVQEVAEGTVKQLGEFEGNAIDVK (0)
TCIGRKVADVMTMLLFGRFFGTKDQHYEQFLKDCFDN (2)
VTDPAIYSQVNALMFVPLLRHIPWFGNALSVHQNIHFRLK (1)
DFCRKEIEEHKKTLNVNEPRDYVDAFLIEMRNHSPRDSWFH (0)
EENLVNCLVDIYTAGTDAPTNVILWAMV
TLLHKPNLQHLLFHEINNKTSKFKNRYYSNEDLSNRWP
LLKAFIQENFRFNPPAPLSIPHQTNQGVKLDKYFIPKDTM
IISNIYAVNHDPKIWKNPYEFNIYRHIDTDGKFVPSTKVIAFGIGCRSCLGEKLGRLEIFYF
LANMIKRFEILPDPGSKDLPPINEGGNSILHLPIHFKVVFKPRENEDYFSL*

>ci0100141603 seq 16 like, scaf 2157 probably same as seq 16 on scaf 1266
EENLVNCLVDIYTAGTDAPTNVILWAIVTLLHKPNLQHLLFHEINNKTN
LSNRWPLLKAIIQENLRFNPPAPLGIPHQTNQGVKLDKYFIPKDTM

>sequence 18, 216 revised C-term
MSMLVNLITNFDMSIVLTASCAFVLVFYYWYKRPRNFPPGPRGIPLLGVV
PFLGKHTEKDFRKWSKKYGPMMSVRLGQNDWIVLGDHKTINE (0)
CLVKGGNSFSGRPDHPVFKQFTKNHGVAMVDYGDHWKVQRRFGLTTLRG (2)
FGVGKRSMEDRIVEEIEYLNNAIRSHNGKPFNIS (0)
KLFANAVSNNISSIVMGERMDYSDKIFETLTQRSTES (2)
IYNSDMAGKIIRFSIFVPSLWNLLLLFSTQASKQTSVFNSIRG (?)
QAIVNWVSQHKSTYDEHNVRDFIDAFIGEQKKETDVSFT (0)
DLQLIQYVRDLYVAGTETTTVTLRWAVRCLIHYPEKQNKLRKEIFDVI (1)
GREKTPSMKDKAQMPYTCAFMQELFRFRTLVPLGVPHKISNDVQVEGWSIPKGTW (0)
IIPNLWAVHNDPDVWDEPSKFKPERHLDEKGNFVQSKHVIPFSVGPRHCLGEQLARME
IFIFLVSMVQKFEFLPDPNEPRLSEVPQGVAGMMCVPHPFKQIVKEV*

>sequence 20, 87, 90, 107 ci0100147280 complete
MSGVSVCLSVVFFTGFLALCCWYKRPKNFPPGPRGVPFLGVIPFLGNYPARVMQKWSRKYGPVMSVRMGS
RDLVVLNNHENIQK (0)
QALVMQGQIFSGRPNMPALTQVTDGLGLATIDYTDHWKTQRRFGQTTLRG (?)
FGVGKRSMEDRIVEEVEYLNNAIRSHNGKPFDIL (?)
GILGNAVSNNTCSVVMGRRFDYDNERFKKIIATLSGL (2?)
FTNPKSNSTIILVLFMPFLMKIPPFSWINNKRVDGINEIKR (?)
LLREIVHEHELRFDKDDVRDFMDAFIAEQKKNDKHSSFT (?)
DLQLLQYVRDLFVAGTETTTSTLRWSILCLIHNPEKQEKLRKEIYKVT (1)
GQDRVPAMSDKAQMPYTCAFMQEVFRYRTLVPLSLFHATNDDVSLNGYRIPKGTT (0)
VLPNLWAVHNDPDVWDEPSN (2)
KFKPERHLDEKGNFVQSKHVIPFSVGPRQCLGEQLARMEIFIFLVS
MVQKFEFLPDPNEPNLPEIENGSCGTVFVPFRFKQIAKMK*

>SEQUENCE 21, 85, 193
MVFDVTMLNISPFIVFIITFVFVWNWYRRPPRYPPGPLGFPVLGVLPFLSNRP
ENKFLDWSKRYGPIISVSIGWQKWLVLNEYDVTME (0?)
ALVKNRLAFHGRPFVEFFIRTTVPPGIAFAPGGVHWKTLRKFGTNGIKA (2)
VGLENMEKKTNIEIQYFIEEIRGHKGKSFNIQ (0)
EPLTHAVANVVSSFCLGQRFDRSNDSFNQLIRSLMHQ (2)
GHVAANWRTMLLTFLPFLCDVWPFTKHMDESITNVRTLN (1?)
GIIRHFVEEQSASFDGDEVRNMVDAFLMHSRNSVDENVITDD (0)
KIIGFFSDMFLAGTKTITSQLRWGFLIMVKHQDCQNKVRREIDDVIG (1?)
TVKLKHRKLKHRSIMPYTCAVVHELFRF
RTVVPLSLPRMNVADVTVGGYRIPKGTN (0?)
VIMNIWALHNDDCRWREPERFIPERHLNADGKFMKPKNVLPFGVGARSCIAEHLARNKIFLFLVSTLKNF
TLSVPNGDSPSLDAGSNGTIFIPSDYNIVAQYRD*

>SEQUENCE 106, 23, 221 very similar to seq 18 
MSMLVNLVTNFDMSSIVLTASCAFVFILYYWYKRPRNFPPGPRGIPLLGMVPFLGKHTEK
DFRKWSKKYGPMMSVRLGQNDWIVLGDHKTINE (0?)
QCLVKGGNSFSGRPDHPVFKQVTKNHGVAMVDYGDHWKVQRRFGLTTLRG (?)
FGVGKRGMEDRIVEEIAYLNDAIRSNNGKPFDIA (?)
ELMANGVSNNISSIVMGQRMDYNDEIFKTLSKKAPNR
VYNSDTIGKITKLVLYMPFLRNLLFLFLSKPKEMNKINTLIRG
QAIVNWVSQHKSTYDEHNVRDFIDAFIGEQKKETDVSFT 
DLQLIQYVRDLYVAGTETTT 
GTLRWAVLCLVHHPEKQNKLRKEIFDVIGREKTPSMKDKAQMPYTCAFMQELFRF 
RTLAPLGVPHKISNDVQVQGWSIPKGTWLVCNLWAVHNDPDVWDEPS 
KFKPERHLDEKGNFVQSKYVMPFSVGPRHCLGEQLARMEIFIFLVSMVQKF 
EFLPDPNEPQLSEVQQGVSGFMCVPHPFKQIAKEV

>SEQUENCE 26, 81 no introns 37% to 2U1 human 41% to seq 2
MEITTFLLCMLMLVLIMFMWRRSSTNFPPGIEGIPLFGCIPSLGMHKERTIAKWGREKG
WPIFYLRMGITKIEVLNTFESIEEAYVKKGQYFIDRHLPIFLRHYSTGKGVVFLNGEK
WKVQRRFGIHTLRDLGMGKSGMESKIVHEIDHLSEYLVNNNQESGLVDITPIITRAVSNIVC
QLVFNRRFNLDEEFLFIDSVNNKLMKSVTTLKQIMLTLQHIPIFNRIPAIAKIK
KYRTDLANNRLQR
MRNYIKSHKETLDKNNPRDFIDAFLIASEEDKSEDTSFDDEQLLYYIQDLFFGGTDTTSF
TLTWTFLYLAMYPDKQEQLHAEIDEVLGSNGIVSMSCLDKMPYAQAVMQEIFRLRPLVPL
SMPRKTLSRITLMGYDIPKGTVFMPNIWSVHHNADRWPNPEDFMPERHIDSEGKFVKSKE
IIPFSLGPRNCLGIRLAEMEHFMFMIGLLQKLHFSLPEDQPPPDVRGSSFVVLLLPPDFNMKITTR*

>Ortholog to seq 26 in Ciona savignyi no introns
MDFTSVLLFLLVVVLVFLWNRRGSGNFPPGFDGVPLLGCIPLFGAHRERSIAEWGREKG
SWPIFYIKLGITKVVVLNTFESIEEAFVKKGQYFVERHIPIFLKRYSKGGKGIVFSNGEI
WKHQRRFGIHTLRDLGMGKSGMENKIQSEIEHLSDYLVNSPNIVGKKSVDIVPIITKAVSNIIC
QLVFNTRFNLDDKITFIDSAAGIKLVKNASWVRQVKMTLCYVPLLNRIPTIAKLK
QEREALHNTRHDL
LRKYIQMHKETLDKGSPRDFIDAFLIAGEQDKEENSSFNDEQLLFYIQDLFFGGTDTTSF
TLTWTFLYLSLFPEKQKRLHDEIDEVIGSSGNISMSFLEKMPYAQAVMQEIFRVRPLVPL
GMPRKCGQNISLMGHNLPKDTSVIPNIWSAHHNPDRWPEPDSFLPERHIDSNGNFIKSKD
IIPFVLGPRNCLGIRLAEMEHFMFMVGLLQKLQFALPAGQGKVDTRGSSSIVLLLPPDVDLVITSR*

>sequence 31, 98, 200 1 accession 49% to 2U1 ci0100148485 missing exon 5
MLLNNGTMLEVPGSFVSYLTVCKFLTCTAAYLLYRHWYQMPRSFPPGPRGIP
ILGVLPLLGRHPERKLNEWKREYGPVMGARVGRKNWVVLGDCASIKMVIYLR (2)
DIFSGRPSPHKKEDGIVFLDNGRSWTMHKTLGLLAFRG (2)
FGITENKSLEKHVIEEVSYLKEAVRSKRGVAFSID (0)
TILTNAVSNVLCSILLGQRFQYSDQRFKRIIHQLSRR (2)

XXTLLGFINIHEKTYDSTNIRDFIDAFLLLKHRQLDESFN (0)
DKQLIHYIYDLFLGGTETTTGTLRWAILCLLHYPEKQAKLRKEIIQVLG (2)
DQEVTILKKHEMPYASAFIHEVYRFRTLFPLGLPHKTSHQVILDSYVIPQGTT (0)
VCTNLWAVHNDPKVFDEPEEFKPERHLYGNGEFGRSPYVIPFSV
GPRHCLGEQLASMMLFIYLVSLVRSFEFLP
DPKLNGLPDIRSGASGPVFLPKSFNVVAREL*

>sequence 33, 116 ci0100137396 75% to seq 64 complete
MLPAFLGSLTSNYVTFCLTGFFTLVLFVYYYWWKLPHPRYPPGVRGIPVLGALPFLGKSAHKVIMRWSRE
KYGPIMSVRFGQKDSVVLNDYESIYEALVKQGPAFQTRPRIKVIEAHSNGYGFGFAAGHKKFMQVRHFTG
KALRGLGIGGLALEERVSEVAQELVLSLQKLDGKPTNLRMLVGTTVGNVIASIVIGKKFHPNDKEFQRYI
QLIFESFGDKETKKYFLVLIYFSFLRHIPPFKNACKQFIADHREQYDFCRKEIEEHKNNLDVNKPKDYID
AFLIEMKKYSPENSWFH EESLRVCVADLFYAGTETSTSTILWGMIALINYPEIQEKLHKEIVNATGEELP
RLDHKDELPLLQAFIQELYRCMALIPLGIQHQTTKNVDICGYCIPKDTVVFTNIHAVHHDPNIWKNPSEF
NIYRHIDEEGKFIPSRKVIPFGIGCRSCLGEKLARIEIFLFLANIIKRFEVRPDPDSEPLLPIDDGITGF
GFLPFLFKAVVLPRTNKGTQ*

>sequence 36 7 accessions + 4 ESTs LOW 30% RANGE TO 2 FAMILY MEMBERS, NO INTRONS
MASVLTVVGDYLNIWSIVLGGSA
LIFYMWSRKGNLPPGPRGYPILGAMPYLVVHPEKVITKWSREIYGPILSVPLL
NRTIIYLNTYEAITEAFSKQGLKLAGRPYMFLFQQISSDLGITMKTYSEHFKVQRQFGMR
SLKPSRIESLVIQESRYFIDQLHDQVGKVYDMKTDVYNFT
ANIICSVVLGKRFDYDDPAFKLILESSEKALGEPEDANLMMAMILIPQLRFLP
PFHGANQRFLKNQHRILSMLLGIIEEHQKNFDPNNVEDFIDAYIYEQKFGIGK 
NTKLFENDQLKVYVRDLFIAGVETTSNTIQWSILALLYNPK
YMDLMYNEVYEALGSDGVPCMKDREKMPVTCAFIQEIMRFRTITPGAAPHSAMEDIKLNGYDIPSGT 
MVVANLWALHNDPDTWPEPEQFNPHRHIDSD 
GKFIHSPKILPFSIGPRYCMGEGIAKAEIFIFLTSLLQKFTV 
SRDPNAVYPEEIEQGIVSAMSCPHRYNIILTAR*

>sequence 39, 60, 74 ci0100154493 complete
MFCPFEFHEVLVGIITFILVWSCIRRPTRYPPGPRGVPILGVLPFLSKYPEREFLVWSKRYGSLLSVSIA
WQRWIVLNDYDITMEVLVKNRMIFNMRPAIRFYQEALDPLGIVFSPGGAIWKAHRTFDFYILLFHSIGQE
RMEKLIMTEIDYFIAELEKHNKKSFDIQDTLTHSIANVMCRFTLGERFDYKDVRFKKLVGCIIHVGKVTK
DWKTMFLTLLSPLAHIWPFTRHVNEFKNAMKTFNDQIRIMVEEQQSSFNGIEVRNMIDAYLLKSQSFVSC
NEKYTVE DIVGFFSDLFIAGTETLSSQLRWGFLIMMLNPECQENVRNEIHAVIGGNGRARLSDRYRMPYT
CAVIQEIFRFRTLGALSLPRTNVVDVEIEGCKIPKGSKVILNTWAIHNDERNWENPQKFIPERHLDKDNF
INSKKVLPFSVGARSCLGEQLARAEIFFFFVATLQQFELKFDPNAQVPSLDDASNGVIFSPNPYTMIVER
INEDRSNNNNNSKVI*

>sequence 40, 19, 88   missing exon 1 N-term not found in 11000bp ci0100138492 
LPYPRGLPIIGNIHQMGNFPHVKLTEWSKQFGDFYRIKMGRYDALVVNGHENIR (2)
NCLAKKSAAFAGRPPFETSKLIEEGLSISFSNYS (2)
PEWERQKQCTIKALKLYTSGSDKRSTMEETVSSHAKQLAEDLINSADQQ (0)
GLVGDLHDTVIYSTTSVSSTICFGRSFTRQDPELKEFLRNFQSFDKAMGASQIINFWPFLKYFPVLGKSFR (0)
NLKTYMDQYWNFTLSMLEQHWDTYVPNNMRDLADCLWAQSNQ (0)
NRQLTDQQRRIAYGASDAFGAGFDTISAMITWSIFYMAVFPEHQRK (0)
IREEIDRLETSMFSLRHHGDVCPYTQAWLYEVLRH
ISVSPLLVPHYTVKQVEVNGTMIPAGVVVLFNVAN (0)
ADRDTRVWENPEQFEPERFLARDPTTGGARVVASETSKI
LNWGAGKRRCPGAELSRHELFIYIANLVKLCYIE
QAVEGIEPAIPWPCTPGISTKPKAFRVKVTQR*

>savignyi ortholog of seq 40 56% identical N-term not found 
LPSPRGLPIIGNVHQLTTSPHVKLSEWAKEFGDLFRIKMGCFDTLVVTGYDNIR
(2) TALVKHSVAFAGRPPYETSKLFSNGLSLAFNNY
(2) SPAWEKQKRCTVKALKLYTAGPDLQKRNAMEDTASYQANLLVDQLLASVNK (0)
DAITNPDEIVHHSATNVISNICFGRSFSKNDPELQKFVSINRAFDRAMGSAQIVNFWPFLKSVPVLGRSYQ
NLKAHMDVFWDFVFPNLKEHWKTYNPSNIRDIADCLWYQSH
TSSKRDLQRRIASAASDIFGAGYDTTHKVVLWSLFYMAAFPQYQQKV
RDIFRVSEVKMY
TLRHHGDECPYVQAWIYEVLRHTSLAPILLPHYTTKEVTLNGVRIPAGVV
KKYHTIQAHKDPKIWKNPDEFDPGHFLEEDGSKLRSEAVHKLLSWGAGKRRCPGAELSRHE
IFVFVTTLVRRAYIGQAVDGVEPAFPWNTTGGISISPDPFRVKITER

>sequence 41, 77 
MLRIFAFASQISMETLLTSAAILTCVLFVVYDTWWRFLGTSGTLRGRN
GFPVFGCFPLLGKNKERVFMNWSRTSYGPIYYARLGASRVLVLNGYEAVREALVLRPQ 297
AFAGRCKTELMKLLYEGQHGIMQLDHGPMWKAQRKFGQNALGSLGMGKRSLQ (0)
QQIVNEFNCLSSTLEXELLANGSKGAYLDEMLWLYAVTNISLLIFNESFDPDNEK (2)
RERFLYLLMYVPFLKNLPLVRQQLNVTIQGRKTKLAKFKQFVENHRQSRDPDDPRDFIDCYLNTMDSKSANDS (2)
FTEKQLLYFMNDLFIAGTETSSNTNSWALLLLLKNPEVMKRLRDEIDSLGREPTL (0)
EDEYKMPYTRAVMQEIFRYRPALPCNI
IPRKTTTSIELNGHSIPAGMPVIANLWSVHHDPVTWAPDPE
TFRPERHLNRDGEFMTSDHVMPFSVGLRSCIGRNLARNELFIFLTSILRRF
NLEFADGCYVDDMTGESGLLLHPPSYKVKGSLRA*

>Cs41a Savignyi ortholog A of seq 41 74% to savignyi ortholog A of seq 41
MLYLSLFSSNFSIEVVLTCFCLFLLLVYNVRVRHGGTSGTLPGKDGFPIVGCFPMLGKHKE
RVLMKWSTSSLGPMFYIRLGTSKVLVLNGYNAIKEALEQKPLPIAGRVESKFVKMLYDGD
HGLIQLDYGPMWKEQRKFGQTTLGGLGMGKRSFQ (0)
SKIRDEFNCLAATLESELSINGPKGVNFTSMLRLYAANITSSLVFNQTFDSTW (2)
RERFLIQMMYMPWLKYLPIVWKQLALSKRCRRDKVKIFQALVDKHRKSRDPNDPRDFVDCYLNVMEKKQGKNS (2)
SFTEKQLLYYIVDLFLAGTETSANTNAWAILLVLKNPQVKAKLRSELEAFGKSPTL (0)
EDEAKMPYTRAVMQEVFRYRTSARLNIVARRTT
SDLELNGYKVPANMPVIANLWSVHHDPLTWAPDPETFRPERHLDGDGNFIPNDHVMPFSV
GARSCIGKNLAKNEVFIFITSIFSQFDLELADGCHVDNMDGDSGMLLHPPEYRVKGSIRV*

>Cs41b Savignyi ortholog B of seq 41 74% to savignyi ortholog A of seq 41 
  1 MLFSSYLTFNFFIESLLAGLCLFLLLVYDVFIRLR 35
 36 GTSGTLPGKDGFPLVGCFPMLGKHKERMLMKWSSSGLGPVFYARLGASRVLVLNGYDVIK  95
 96 EALLQRPLAMAGRVESDLIQELYDGKHGLIQLDHGPLWKEQRKFGQTTLGGLGMGKRSFE 155 (0)
156 SKIVDEFNCFATTLENELNMNEPKGVNLTRMLRLYAANVISSLIFNQTFEISC 208 (2)
209 RERFLILLMYVPGLKNLPIVRQQLAITTRTRLDKVKIFRTLVEEHRKSRDPNEPRDFIDCYLNVLAKNKGENS (2)
    SFTEKQLLYYIADLFLAGTETSATTNAWALLLVLKNPEVKRKLRAELDALGKQPTL (0)
    EDECRMPYTRAVMQEVFRFRPAVPFNI IARKTTSELELKGYKIPAGMPVIANLWSVHH
    DPVTWAPDPEIFRPGRHLDDDGNFVPSNHVIPFSVGARSCIGRNLAKNELFVFITSVLRR
    FDLELADGCHVGDISGDSGMLLHPPKYKVKGFG*

>Savignyi ortholog C of seq 41

    RERFLILLMYVPGVKKLPFVRAQLAITTRTRLDKVKFSRPFGEKHRKSRGPNEPRDFHDCYLNVLAKIKGK (10)
209 RERFLILLMYMPGLKNLPIVRQQLAVTTRTRRNKIKIFQTLVEEHRKSRDPNEPRDFIDCYLNVLEGKKEENR 281
    SFSEKQLLYYIADLYMAGTETSANTNSWALLLVLKNPEVKKKLKAELDALGKQPAL
338 EDECRMPYTRAVMQEVFRFRSALPFNIIARKTTSELKLNGYKIPAGMPVIANLWSVHH 395
396 DSVTWAPDPEIFRPERHLDDDGNFVPSNHVIPFSVGARSCIGRNLAKNELFVFITSILRR 455
456 FDLELADGCHVGDISGDSGMLLHPPKYKVKGSVLM* 490

>sequence 43 seq 43 shares an exact run with seq 24 ci0100132600 complete
MFAKLFGWWGSSWVTTCLLGLLTLVILVYYYWWKLPHPRYPPGVRGIPVMGALPLLGKFAHKVIMRWSHE
KYGPVMSVRFGPSDSVVLNDYESVYEALVKQGPAFLTRPNIELINSYSNGYGFGFAEGNKKYMEVRHFTL
KALRGFGIGGRVMEERVSEVAQDLVQSLQELDGKPTNFRMIVGTTVSNVIASIVLGKKFHPKDEDFQRHV
QLIFDSFGDEETASYILVLMYFPLLRHIPPFKNACKKFVDVHFKQFDFCRKEILEHKKNLDENEPGDYID
AFLVEMKKHSPQDSWFHDESLMVCVADLFLAGTETSTSTIMWGVVVLINYPEIQEKLHQEITNTTGEALP
SLNHRDDLPLLQAFIQEVYRCMTLVPLGVQHQTTKDVEISGYCIPKDTVVFTNIDAVHHDPNIWKNPSEF
NIYRHIDKDGKFIPSKKVIPFGIGCRSCLGEKLARIEIFLFLANIIKRFKVLPDPESQDLPPMDDGITGF
GFFPFPFKAVVLPRKNEEIPE*

>sequence 231 80% to seq 64 may be in gap at end of scaf 311 close to ci0100140915 = cibd084l24
FYRKEIKEPKNNFNVNEPKDYVDAFLIEMKKHSLEDSWFH (0?)
EEALLICVADLFVAGIETSTGTVMWGMTVLINYPEIQEKLHKEIVNAT (1?)
ETPPNLDQMDELPLIQAFIQELYRYMTLAPLSLQHETTKDADIGGYCIPKETL (0?)
VLTNIYAVHHDSNIWKNPSEFNIYRHIDKEGKFILSKKVIPF (?)
GIGCRSCLGDKLARNEVFLLLANIIKGFEILPDPESKELPDYKDGVNGFLYVPYDYKLVAKPRIVNE*

>sequence 46, 54   33% to CYP1A
MMITAAILLDAGRSFAVPVAFTAVSVLTLYVCLRKRQGIPPGPTAWPLVGNL
FSMGRQSHLILESMRKTYGDVFSVYFGSTLVVVVNGKAVEECLSTHSAR (2)
YSMRPELHTAQYILEGKSFAFSHIAVSKHKRYRTLAVAVVKQLVNGGGEKTDVAV
KHGLQNGTRHSSIEERIFMEAACMCDKLLETSDSPDLKDEILKVITKEL (2)
LSEYELDEISRVVENLRNSNEAIMLVNFIPAVRMLWRNGLQKYIQLTQSLNR (2)
FFERCIRNRKAQLATVSNGHTEDNGVRLTNGVDCTVKFWQKLKNDPQYEESRVMKV (0)
VADLFGARVDTMTVALAWMIVYWSTYQAAQERAQKEIDHFVKNEKRLPR (2)
YSERNQLPYTMALIMEVERHCSFVPFTLPHAPAQDTMLNGYLIPKGTMMLISMRSINHDTAVWDSPAQFR (2)
PERFLLDQSGGFNSALAEQVMLFGAGRRRCAGEALGRMQIFLYSVLFLRKCTFRR
SDKDGHVLPESLAGISLIPQTMCVSISRREADGSKNTEP*

>sequence 49, 67 opp end of clone is seq 67 join together
MLVQILTATFWTLIP
NSFGDLLIYAILVLTIVIYVKSLKRDKEWLALPGPIPW PLVGNA PFLGAEPHKKLLELSL
KYGPVYRLKMGGIKTVVLCNAEVVRSALIKQREAFSGRPKFSSYKAVS AGESVVFNDEET
LPP WRSH KSKIVRHMHKYTTSIRTRDKVTDLINTECMMMVTELDRISRSKCVNPENVIRM
ALANVMCAVCFGNRFEYDNE (0) 
EFQKLLSMNTEFGAVIELGPIIDAMPWIK (0)
VIPKFKKAIADYLKINLQLDTWSRHR (2)
VDGVLKTFDNDDVTNVVASMTSEVLEKKSAGESREITESETKTIAALSADILGA 
GQHTTSTTFFWVINLLLCFPKVLNKLTEEVRSKLGNRLPTLEDRTSLPYMDAVLTE 
VLRFSSPLSSTIPHSTLKDVKLAGHTIKRGTMVIISQYAVNHDPQNWKNPENFDPERFLTK
NEGGEIIFNESLSEKVLAFSIGERKCPGSQLSRMLLFLATTLLVQVSDLSADLERPPT
AAAEYGLILRPKHLSIKLTLREHWQRRDSIRA*

>SEQUENCE 25, 35, 93, 89 end is long but = EST rcies001a09
MDSLVFVLVDTVLVMKYQILLLLVIVYAIKLLAASQSRRLNIPGPYPWPVIGNVIEMGGQPQFSLTNMAK (?)
RYGPVYLMKLGTADVLVLNNYEVIKEALLRQRRIFGGRPIFDSFKKISQGLGVVFNSTMT
QGDEWMKLKMTIVKHVHRFVSSEETKGYVAHHVQMEAVELVRILTEKCRS
SPNEVIFPIEQINLAIANVVCAIMFGHRYQHGNK (0)
EFQDLISLNEQFGDVIGSGSQVDVIPWMK (0)
IFPKFRNALKVFDFLTNRLNNWMRLR (2)
TKEHRLTYKHGVIRDIVDSFIAESIDHPEQSALNDDVIMALTTDVFGA
GQDTMSTTMQWVFVYMMHFKECQRK
IHAELDSVIGPGELPHISDRRRLPYLEAVMHEIFRHSTFTSTTIPHVTTQDTVLDGHFIP 
KGILVFINQFGANHDPNHWVDPDKFIPERFLDGKGNLISRPHDRYLLFSTGARKCPG 
DELSRMLILHFMATMFALCEVSSDPQKPATL
DAVYNLSMRPKELRTIVRS
RNLPFLKNSVAQMSEADSHVLTVPGETTSFLTSRVESTVPDNQESQFSDNDFEKVDTKIP
KRKVFSRPTLTHDDINGNNVRKRGNLHQSAMYRIQLAT*

>sequence 15, 63 N-term 43% to 2C9 C-term 46% to 1A1 WXXP to heme
MESVWVVIKWVKETMMSNSSFETIVAVATLLLLLMFVSENWNWLKIPGPI
PWPIIGNLGSLKGTKFLSIHEMYKIYGRIFRLKFGRVEAVVLCDVELIKE
ALLDRGRSLSGRPQFASYRLVSGCKSVVTNDPRCLREWVNY
KSTMVQTLCSISKNNEMKELMNERIGSVLVYMIQELEKGGDGQNFAEDIVTKTVANFLCT
VCYGGTYDFNSK (0)
EFNNLIEMSRHYTDNLSKSILRDMIPLAE (0)
ILPSVNKGRADFAKTSYHLHLWFLKR (2)
VEEVIQHFQPNKLNDLASVMVSDLTNDPTENISNITEKDRNSIAAIINDLVQG (?)
GYHSLYSMALWVVTYMIKYPEEVKKIENELNEVLDDYLPTLHDQESLPHTMAFINE (0)
VLRCRPSLPLAVPHSATEDTKLGGYDISKDTMVVASLYSANRDPKVWANPDQFDPSR
FLAKDDLGVTVLDETKVEQVFTFSLGDRKCPGEDIGRSFLFLTTAYLAHTCKLKPDPAK
PPTFQTKPGSITRPKDFGVQLNVKKCWLGVFKPDDNEE*

>sequence 51, 219, 223 1 accession 40% to 2J2
MYDVIIVCVISAIAVVAASWLKQNGSKKFPPGSRGLPFLGSVPFVLGSLERTLA
VWGWQNYGPVFFFTRGNTRTVVLNTYEAASEGFVKQGNH
LSGRFIDANHNELFADYSPQWQEQRKLRVKTLKGFGKVNTEEIMTSECKKMICRIRETKGSF
EAGYYLAVPTFNIVKSIIYGE
RFENKMNLIIYTFIFVAGEPSEGMNKQDEVNLSKVLADPVYKAGNGFMEFMAGMFPQIRS
SVAYERMLEKKGKERVRQVKE
KMTGSKKLIEGMLQNLQDWETGEVYNNQQLSCYLTDLFMAGTDTMATTVHWSLVFLAQNP
QVQEKMAAEVKSVAGNDVITTSMMDSMPYTRAVMHESSRLRPVFPLSLRHQATSDVTVKD
YVIPTGTSVVANLWAIQNDPKWWKQPSEFRPERHVTEDGGFTKSEKIVPFSVGPRYCLGS
QIAIYQQFIFLTNLVRGFRFRFDQAKVEPDLTGLMTSVLAPVNVHLVAEER*

>sequence 57, 104
MFSTLLGSLALNYISTCLLGIFTLGVCMYYYWWKFPHPRYPPGVRGIPVLGALPF
LSKFAHKDITRWSREKYGPIMSVRAGQTDSIFLNDYESIYE (?)
AMIKQGPAFRSRPAMKLIT
AYSNGYGLGFSESHKKHSDLRQFSLKTLRGFGIGGRQLEDRISDIAQELVQAFEMLDGKATDVK
MIVGKTVGNVIASIVLGKKFHGDDKDFQRHINLIFDS (2?)
FGDEETAKYVLAMMYFPSLRHIPPFKNAWDQTLKGHFEQL (1)
DFCRKEIEELKKTLDVNDPRGYVDAFLIEMKKHSPEDSWFH (0)
EESLIVCVVDLFFAGTDTSTSTVMWAMVALLNFPEIQEKLHKEIANAT (1?)
QIYSGEALPSLDHRDELPLLNAFIQELYRHMTLGPFGVPHETTTDAYIGGYCIPKNTR (0) 
VITNIWAVHYDPNTWENPSEFNIYRHIDEDGKFVPSKKVIPFGIGCRSCLGEKLARLE
VFHFLANIIKRFEILPDPESKELPDYRDGVNGFVYVPYRFKLVAKPRIDNK*

>seq 64 cibd042f04_3 75% identical to seq 57 but not same seq GCiWno474_c04.b1
MLPAFLGSLTSNYVTFCLTGVCTLVLFVYYYWWKLPHPRYPPGVRGIPVLGALPFLGKFAHKDI
MRWSREKYGPIMSVRFGQKDSVVLNDYESIYEAMVKQGPAFRSRPTMKLITSYSNGYGFG
FAEGHTKYMELRHFTFKILRGFGIGGRYLEDRISDHAQELVQDFKKLDGKATDVR
MIVATTIGNVIASIVLGKKYHPKDKDFQRHLQLILNN 
FGNEETASYILCMMYFPSLRHIPPFKNAWKTFMDEHFEQL
GFCRKEIEEHKNNLNVNDPKDFVDAFLVEMKKHLPEGSWFH (0?)
EESLILCVIDLFFGGTETSTSTVMWAMVVLINFPQIQEKLHKEIVTAT
EALPNLDHRNELPLFQAFIQELYRCMTLAPLGVQHQTTKDADIGGYCIPKDTV 
VLTNIHAVHHDPKIWKNPSEFNIYRHIDQDGKFIPSRKMIPFGIGCRSCLG 687
EKLARSEVFLLLANIIKRFEILPDPESKELPDYKDGINGFLYVPYDYKLVDKPRTVNI*

>sequence 58, 70 45% to 2R1 ci0100144015 complete
MLTVSYVYVLALFLLCCLFRWRQRPKKFPPGPLGIPILGITPFVGSRMEKYLASYYAKYGGVMSFRLATKDWIILN
DIEAITQAYSKQGESFSGRPQSYLFDKLTEGCGIAFANGPRWQEQRRFVLTALKTLGMGKKGMEEIIKTE
NQHFISVIQNSGGKVHIL (0)
KEIRRLEANIISKVLIGKRFDYTDERLNSIIDCRFR
CTFTAFSSIYSTVSRIIQETGLQYSARNKQVMFIY (1)
IIRKLTNEHETPDEANPKDLTEIFLSEIEAKKRRILLE (1)
FLSNIFCTQKIQLYHLIRDIFVGGIDTTTATLGWGIICLLHYPECQMRIQEEIEDVI (1?)
GNSEPDMTHQKHMPYLRAFVQEVHRYCTIAPFSIPHCVTEDCVLFGYHIPKSTPVMSNIWRVHNDPKY
WKNPEKFSPERHLDLEGRFVPSNRVLPFAVGQRRCVGEQMAKVGLFLFWASFLQK
CEIHIDSDFGLPSWKKEPSGTVRSPQEFTVKVRNRLQNN*

>sequence 62, 83 8 accessions 42% to 2U1 ci0100151040 complete
MLPIICLSFFLIIFLCWKPLRKRSNYPPGCDGMPFLGCLPLLGVHPANTIMKWSKRYGDVFYVKYAMQNV
IMLNSFESIKEALVKQGHVFSG RQHTFITEFTSGNAGLGFLDYGAVWRSQRRFGLATLHEFGMGRRCMAP
KILEEADLMCDEMLKHGDHAFDCMDHITPAVTNIISILVFGKRFEYDDPTLLDTLHLLFDFGHDFPFALF
YFAPLVRFLPPWKAELEKIRQARKPVWDHIMSAIQQHQDTYNENDCRDFIDSFIAKMKSTNADGVKF
EQLFHYVKELFLAGTDTTATTLRWAILLIHYHTDVHEKIHEEIDREASNPVKFEDMPNMPYTQAVIQEVFRSR
PIANFGTMRKTTAVGKVGGYRIPKGSIVMPNIWAVHHNPQRWKNPHLFKPDRHIDQNGKFVKSEDIIPFN
VGQRSCLGMQLAKMEIFIFLVRMMQRFNFKLATDKEKLDLNGQYIFLHYPPAFDVSVQLRS*

>sequence 66, 75 1 accession 41% to 1A1 33% to 2C9 63% to seq 57 
MFESFGYYFGESLTSNHVISGLIGVFTIVMCVYYYWWKFPHPRYPPGVRGVPIFGAIP
FFGRYVQETLAKWSRSKYGPVMSARFGQDDAVVLNDFESIHE (0?)
ALVKNIQHFNSRPSFYIVEQFTHGYGFGFADGHNKYLDVRNFSLSALRG (?)
LGIGRRTMETRVSEVAQDLVKILEDLDQKPTDLK (0?)
MMLGGTVANVLCSVVFGKTYDLSDPEYQYAVQCSFDC (2)
FGDPENSEYLNFMFFYPKLRYIQPFKRALKKFIDVHLGLIAFNQKEINLHKERLDVN
EPGDYIDAFLIEMKKHSPENSWFHETQLLHCLGDMFIAGTETTTNTILWALLALIHHPEI
QEKLYQELLDNIGEQVLPSTDHRDKIPLFRAFTQEVYRFKTIVPLALQHRANKDVEIGGY
VIPKGTKVFPNLHAVHHDPNIWKNPSEFNIYRHISKDGKFIPSKRVVAFGMGARSCLGEKLAITE
VFLFLANIIKRFEILPDPESKELPILKDGVNSLLYVPYRFKVVVKPRIINE*

>SEQUENCE 72, 86 ci0100136116 100% to seq 18 C-term 3 diffs to seq 72 100% to seq 105
MMEVLIGLNVSTLATFLVVFYAVYYWYKRPGSYPPGPRGYPLVGVLPRLGKYPERVMRKWSRKYGPVMSVRMGR
NDWVSLNDYSSIKE (0)
ALVKQGNVTSARPDFPAHQEVAHYHGIIFSSGDLWQNQRRFGITTLRE (2)
CGFGKRSLEERIVEEVSYLNDAIRATMGKPFEIM (0)
AKLGNAISNNICSVAMGRRYDYDNKRFKDLVAKVSWR (2)
SMGVVRIRFYNFFKCFFCLLPNGTKTSIERC (1)
LFREIIDEHRATFDKENIRDFIDAFLERMAGNDPHFTVTIYA (0) 
EFQLLVFLRDLFIAGTETTTSTLRWALLVLLHNPGVQDKLRNEIHKVLGF (0)
STTVSMAHRESMPYTRAFIEEIYRYRTLTPLDLFHETSGELNISGYKIPKNTV (0)
IIPNLWAVHNDPDVWDEPSKFKPERHLDEKGNFVQSKHVIP
FSVGPRHCLGEQLARMEIFIFLVSMVQKFEFLPDPNEPNLPEINHGVNATAFVAYPFNIVANQV

>SEQUENCE 157, 79, 215 60% TO 2C8 ci0100148852 
MASNIALELCKGLYT WTSVAVIAVSLFYIWWRRPHNFPPGPRGLPLIGVLPFLGRFPGEVFKKWSLEYGP
VMSIRMGRSDVVVV NTGDSFREAFVEQGANTSGRPPDIMFAQVTGDKGLVFKDYSANFKATKKWILRTLS
QYGLGRRGMEEKIQEEADCLAGYLRTKVGKPVDFKVLYCLTSNVVSLIMSGRRYDYEDEEHQKRLEFGDP
NDAPKLLLIDLFPALKYLPVFKTANDKLRSRMKAVLGFTRSNIEEHKKTFEKDDIRDFIDTFLAEMKFGN
ETSRDNFTVH ELNVLFRDFFLAGSETTSTNILWILLLLVHYPEHTAKVIKEIDTIIGPNASASSTTHSSQMPF
TCAFIQESMRYRTSVPLGAQHYCQKTFQLNGYTIPKGTLIYPNIWGVHNDPKSWPNPSKFDPYRHIDEKG
NYRNSSNVIPFLIGPRNCVGEALARQEIFIFLVKILQKFEVKPVPGKTPSIADGVNSVVFSPHSFNLILQ
ER*

>ci0100144919 96% to seq 84 almost same as seq 222 N-term complete
MIFEIVITILGAVLLHWLFIALFNPWRYPPGPAGLPLIGSLHLAAPEFHIKLPKLAKKYGPIFTIKAGCRRVVFLV
GYDVIKEVITDRAKDFASRCPTIVARIVRGEGLDG IAASPYGPKWMANRKFFYSAMRTMGLGKRGIEKCVVDE
IPYIVGELENICSNDGLFEPSSVFDSAVLNVLAYFTFGNRYSYQDEKFKELIHINNDFFQKSKFLNQPQF
FLLTLIPGLDKYWLPKCGKDIKESFG KINKFVKAEIKQHRQNLDPENPRDYIDCYLNELNQMNDQSELSE
LGLEMSIMDLFQAGTETTSTTLRWAILYLANNPHIQEKVQQEIDEVLGFDQLPQYEDRMRMPYCEATVLE
VQRMATIAPI GLQHCSEDDQVLGGYRVPKRTSIVACYHSIHFDPKFWKNPNKFDPCNFLDSNGKVCIPDA
FMPFGGGLRICVGMNIAKQELFLFFVAILQKFSLHLPDDMDHLDEKPGPGIILAPKIYRVQIKRR

>SEQUENCE 17, 111 first intron is over 3000 bp
MIFEIVITILAAVLLHWLVNFLFNPWRYPPGPAGIPVIGSL
HLAAPDFHRSILIGFQISFAISVAFLQKQINRLR (1) 
NIVFSELPKLARKYGPIFTITAGYRRVVFLVGYDLIKE (?)
VITDRAKDFASRCPNMPGRIVRGEGLD
GIAAAPYGPKWMANRKFFYSAMRTMGLGKRGIEKCVVDEIPYIVEELEKLCTKNELFE
PSSVFDSAVLNVLAYFTFGNR (?)
YSYQDEKFKELIHINNDFFQKAKFLNQPLFFLLSLVPGLHKYWLPQCGKDLKESVG (?)
KINKFVKAEIEQHRQNFDRKNPRDYIDCYLKELDQMNDQSELS (?)
MSIMDLFQAGTETTSTTLRWAILYMVNNPNIQSK
VQQEIDDVLGFDRLPQYEDRMRMPYCEATVLEVQRMATIAPIGLHCSEEDQ (?)
MLGGYRIPKRTSIMACYHSIHFDPKFWKNPNEFDPCNFLDSNKKVCIPDAFMPFGGG (?)
LRICVGMNIAKQELFLFFVAILQKFSLHLPDGVDHLDEESGSGITLAPKIYQVKIKRRA*

>ci0100144128 90% to seq 222 scaffold 38 complete
MLYGIVLTIIFVLCLPICHWIYVLLSKPRNYPPGPLGIPLFGSFHYAAPAIHI (1) 
NLPKLAKKYGPIFQLRAFNQAIVCVVGYDLIKE
VMINKSKDFASRYASVPGKLVRGEGMD
GIAVAPYGPKWMANRKFFYSAMRTMGLG
KRGIEKCVIDEVPYIVEELEKLCSSDELFEPSSVFDPAVLNVLAYFTFGNRYSYQDEKFKELIHINNEFF
QKAKFLNQPEFFLVTLIPGLHKYWLPQCGKDLKESVGKIHKFVKAEIKHHRQHLDPQNPRDYIDCYLNEL
NQTSDQSELSELGLEMSIMDLFQAGTETTATTLRWAVLYMVNNPHIQENVQSEIDNVLGFDQLPKYEDRT
RMPYCEATVLEVQRMASITPFGLLHCSEEDQKLGGYHIPKHTCIMPSYHTIHFDPKFWKNPNEFDPCNFI
DPNGKVGTRDAFMPFGGGLRICVGMNIAKQELFLFFVAILQKFTLQLPDDVKSLDDEPIPGMTLAPKPFR
VKIKIRS*

>SEQUENCE 91 WXXP TO END 39% TO 2R1 56% to seq 36
487 WPQPDKFDPHRHLDSAKKFISSSEIVPFSVGARY
CMGRETAQTEIFVFVVALLQKFKITRDPTLKWPIHMEDGIQSALCCPSRYKIILN*  216

>SEQUENCE 94, 189, 97, 96, 137, 191 41% to 26B1 fugu ci0100143924 revised
MSFSVVSAYQKVAPMVVTYDLLRMPATTGRNTTTLWQHEFLSTM (0)
ENATEGWETELEVETVAEATFQQGANGEHEYSGSDQFFEQLNGTLQSLFDSSAVAASPLMA
FILLGLAAILAVCILVHPLKAWWLNYNMKSAVDPDSKLNHLPIPPGSFGLPFIGETLAWITQ (0)
GPKFNSNRRKKYGNVFTTNAISLPIVKVSGHEYVKE (0)
ILTGEHDKVTTIWPYTVRTILGSHGIVNSIGDIHKYKRKVAFKAFTRAALNDYVPIMRNHATRI (0)
VRQMQESDQPLVYPNMLRLTFDVAVNALLGLEISDQVELDMLFKTFHQLVSNVFCLPYNVPGFGFNK (0)
GMKARNQLLDLLQIHIEAKKQA (0)
KLREESNLDNNNTKSDADNEFELGDLEIKEIAIELMFAGYYTSASALTSAILELARHPDVFSKL
ENELLQHGILREDSSDEEHMPELNLQNIHKLTYLDQVLKETLRIRPPVLGAYRRAKKTFQIG (0)
DYRIPKGWTVIYNIRDTHELEFEHMTEFDPEHFAPDANDKKFR (2)
FIPFGGGPRVCIGQEYARIIMKVSLIEMIRCNSSWKLANKTLPKMVA
IPTLHPKDGLPVHLQPRKTVGVVTSRSEPVATDSS*

>SEQUENCE 102, 53, 150 SIMILAR TO SEQ 8 N-TERM 39% TO 2B6
MIDIFQTICASTGPYPFTVLVICVVWVFRWWRRPHDLPPGPRGFPLLGVIPFMGEFPER
VIKKWSLENYGPIMCARMGMQDSVFLNTADAIKM (?)
AFCDNSEYLSGRPTTTIFKQVTGDKGIGLKQYGENYKVIRKFIVRTLSK (?)
YGVGKTMIENQIVEEAGLLAEFLKTKTNQLVDL (?)
FYLYVLTSNIVSRVVCGRRYDFEDEKYHQVLINLQKM (?)
LGDAKDAPFTLLIDFFPGLKHLPWFRGANNRMIERIKNWL (?)
EHSKDHVEEHKKTFDKNNLRDLIDSFLHEQAYGKTENKN (?)
EEELIVIVRDLFSAGSETSSNSILWILIVLLHHPHHAKKCIQEIDDVMG
PNGAPSSQHREKMPFTCAVIQETFRYRTVAPIGLQHYAQATVELGGLRIPQGQG
VYSNIWGVHNDPVAWPNPS
EFDPYRHINKDGKFVLSNNIMTFSIGPRSCPGEALAKLEIFMFVTKILQKF
NVRASPNNPPSLDGVNCLQFTPLSCEVILTER*

>SEQUENCE 103 93% TO SEQ 102 (3 DIFFS) SIMILAR TO SEQ 8 N-TERM 37% TO 2R1
MFENMVYTLQYIFASIGPYPFTVLAICVAMLVRWWRRPHGLPLGPRGIPLLGVIPFMGE
FPERVIKKWSLENYGPIMCARMGMEDGVFLNTAEAIKM 
AFCDKSEHLSGRPRTTIFKQVTKDKGLALKQYGKDYNIIRKFVVRTLSQ
WGVGKTMIEDQIVEEAGLLADFLKNKTNQPIELK
LYFYVMTSNIIGRVVCGRRHDFEDEEYHEILN
NIQKVLGEADDAXLTLLIDFFPGLKHLPWFRGANNRMIERVKKTLEYCKGVIEEH
KKTFDKNNLRDLIDALLYEQQFGKKSKRKVLL
EDELVVIVRDLFTAGS
GETSSLMISWVLIMLLHHPEHAKKVIEEIDDVMGPNGAPSTKHRDRMPFTCAVIQETFRC
KIVAPIGIQHYAQATVELGGYIIPQGTMVYSNIWGVHNDPVAWPNPSKFDPYRHISKDGKFF
TSNNIISFSTGPRSCPGEPLAKMEIFLFVTKILQKFEVKAPPDKLPSLVGVNCLLFSPES
CEVILTER

>best savignyi hit to CYP24 53% to seq 110 possible processed pseudogene with 
MFQQLWEPLRLILYNVAVLAILAVLFKFFKRISVICKIRWNLRTEWKGPPCHWFWGHLGQVT
KGNKDPANFIPYFTKNEKLFPFGFAIYAGPLRTILTIHHPNLVKQVLNASTFDAPKPKEAYGGLLLKWL
GLGLLSLNNREWFRHRKLLTSAFHHDILKPYVKISNSCTDILLDTWQKKVKQQPTGFSIEIFEQA
SLLTLDIALQCLMSYESKCQERLDNDYISSIKNLSFLYVRRNRSKNLLLKYFPFLYKWS
QEGKLYYKECDSVHKFSTSLINRRKQELGNPDIAPKREFLDFLDMLLQARDSNGLGLTTS
EICAEVDTFLFGSHDTTASGISWFLYCISQNPEVQNKCF*EITQVLGDRENVRWSDLSSL
PYLTRCIKETLRLYPPAPYISRTVNKDIYAEGKTILKGTTVLLSIIGVQRSVLFWKYPQK
FNPDRFTPESASMQNNHAFIPFSAGPRNCIGQHFAMNEMRVVLSKMLRTFRFSIDPNHKI
VPCHEIIYRSRGGIHLIVEPRTSVSI* 

>SEQUENCE 110, 121, 122 139 67% TO 4F3 
MLFMFIKYAVYASFSLLALSILHSLFS (2)
IQEFYTVTKYLKSNWPEPESHWLWGSFGAV (0)
MLKMSADPSYYFTYMNDCMKKYPYGFRTQLGPFIKSLNTYHPTIAKAIL (1) 
NVPKLKRAYKLLFEWL (1) 
GHGLLVLEDKVWLRHRRLLTPSFHFEVLKPYVKTMNESAHVMV (0) 
ENWLSKTSDNKVAKVEIFHYA SLMTLDTTLRCLMSYQSNCQDEN (2) 
STNDYVAKIYELSETIVRRQRNLNILNRIDPIYNVT (1) 
KEGRKYLKLCDDVHKFSESVINRRKCDSEQPAHNETRKYFDFLDTLLKAR (0) 
DSDGKGLSDSEIRAEVDTFMFEGHDTTASGISWTFYCLAMHPEHQEKCFQEIEKVMADRTDIEW (2) 
NDLS NLPHLTLCIKESLRQYPPVPIIFRKLNKDIEVDGKTIVK (1) 
DT NVVLHIYALHHHEEFWKDPHIFDPSRFTQENMKSMNSYAYVPFSAGPR (2) 
NCIGQRFAMNEIKIAVAQVLSKFQLKPDLSKKIQHSVDVIYRATTGLYLKAERRV * 

>savignyi ortholog to seq 110 63% identical to seq 110 boundaries need checks
MGIKPEYSIPSYIFLYFSIYEQYRARKYLKRNWPGPKGHWFWGYAGEL (0)
MRKIRKDPGYYFSYIRGITKTYPYGFSSNLTPLFASLNIYHPNAVKAVL
DAPKLKRAYRSLFEWL
XHGLLVLEDKEWFRHRRLLTPSFHFEILKPYIKTMNECSITMV (0)
ESWVKKTSEKAQIADDMDITPSISRMTLDTILKCLMSYESNCQDEX
STNSYVSKIYEISASVAHRMRDVNIFKRINAIFYAT
TSFRAHGKAFLKLCEAVHEFSDSVINCRKAELKQPLVKQNRKYLDFLDTLLKAR
DADGKGLSDREIRAEVDTFMFEGHDTTASGISWILYCLAENADHQEMCFREIQEQMGDRTDIEW
DDLAKLPCLTMCIKESMRLFPPVPIIFRKLNKDIEVDGKLIAK
GTDIVLHIYALHHHEMFWKDPEKFDPNRFLPKNSSNVEGYTYVPFSAGPR
NCIGQNFAMNEMKICLAQVIRNLRLYIDGDSPVPKMHPMLVLQSKSGIFIKFEKI

>SEQUENCE 118, 140 N-TERM TO EXXR 381 AA 35% TO 3A5 no introns
MYIPSFLSVETLVLLITSLVLVRIYIHKKWQFLKNINIPHIQPTILQLGTVRPFIGNSD
WMMNSHEYCKRKLGKIYGAYLFLTPQITVADPDIIKQMYIKEFATFPTRQTSLTKINGK
VMNTTILMTEGEQWKRIRNTLTPSFSTAKLKEMLGIMEDCCNHAINILIKTTQNDQ
GKFEAKSVFGKLALDITCSAAFSVDTHPQELKDGQEPKIIKMVKQAFGFEISR
SPLVMLCFMFPAFEEILCKLDYSIIPKDTVNYFANLTKSLIDQRKHSTVKPRVDLMQLMLNDQI
SVEDVAKGKSKGLTQLEITSNSVLMVLAGYDTSANALTLLAYNLATHKHVHKKVHEEIDQ 
MLEKYGSLTYEALTSMKYLGMCLNETLRLYPPIAVNSRIPNRDITINEVFLPKGIQVIVP
VYGMSRDEEIWEEPLKFKPERMEDMRSVDPMIFQPFGGGPRGCIGMRFAVLEIKLAIAKIL
QDFDLDVCENTPKPPVQLEFKSTTLKPKGDIYLRVTPKSN*

>SEQUENCE 112 15 exons 54% TO 3A5 note 112 identical to seq 119 up to VDSQ
MITPLDILSLETWILILTSFILIR (2)
LYIHKKWQVLKNLNIPHDQPTILGIGNLMDFIKNPEF (0)
LFMGYFDLKKKYGPIYG (2)
AYNWLTPTITITDPEILKKIFIKEFSTFPDRQ (0)
IALLDVNGKEMNSALTVVSGRKWKRIR (2)
NTLSPTFSSSKLKEMFGIVEDCADSFVQNIGTINSDVDGRFDVSL (2)
AFGRLTLDGICSSAFGVKVDSQ (0)
AGNNGEESQVAKMARELLAFEFFRNPMLYVF (1)
LIFPWTEKIAEKFDYTIFPRKYVRYFARLIDSVVESKKEKKQ (0)
RTDILQTMIDSQITEEDVKNGAVK (1)
GVTKTEMKGNSIIMLLAGYETTSNAMVFLAYNLAVYKDAQHKCREEIEQVIAEH (0)
GGLTYEAVQDLKYMTQCLNESMRLYSLVPA (2)
NSRYCERDITINGVTIPKGTLVNIPVFGMGRDEEFWNEPLTFNPDR (2)
MLDMNEIDPMIFQPFGAGPRNCIGMRFALLEIKITFAKLLQKFHLDVCEDTP (0)
APPLKVGFKTSMKPKETLYLKVKALET*

>revised seq 232 probable pseudogene of seq 112 ci0100132937 scaf 41
6288 MITPLDILSLETWILILTSFSLIR 6265
     SYIHKKWQVLKDLN
5697 IPHIQPTILGLGNLGKFLKD 5678
GELTYETVQRLKHLTQCLNESLRL YNNVPV
NLRYCERDITINGVTIPKGTSIDIPVYGMGRDEEFWEDPLVFKLES
MLDMNEIDPMIFQPFGAGPRNCIGMRFALLEIKITFAKLLQKFYLDVCEDTP
ASPLDVTFKMSMKPKDQIFLKVTKINE

>revised seq 119, 131 = ci0100140050 scaf 75 exons 3-14 cigd043j06_1 rcigd043j06_6
Scaf 41 N-term exons 1 and 2 runs off end numbering from scaf 41
1002 MITPLDILSLETWILILTSFILVRL 928
115  YIHKKWQVLKNINIPHDPPTILGVGNMMGIIKDPNQ 7
MFLSYFKMKKKYGLVYG
AYSWLTPSITIADPEILKQIFIKEFSTFPDRQ
KALLDVNGKEMNTALTSVTGSQWKRIR
NTLSPTFSSSKLKEMFGIVEDCADSFVQNIGTINSDGDGRFDVSL 3486 
AFGRLTLDGICSSAFGVKVDSQ
AANNGKESQVAKMAREFLAIEIFKNPMVFVF
VVFPWLGKVAKLFDYSVFPKKYIIYFSKLIDSVVETKKKKKQ
RTDILQTMIDSQITEEDVKNGAVK
GVTRTEMKGNALITLIAAYETTSNAMVFLAYNLAVYKDAQHKCREEIKQVIAEH
GGLTYEAVQDLKYMTQCLNESMRLYSLVPA
NGRYCVRDITINGVTIPKGTLVNIPVFGMGRDEEFWNDPLTFNPDR
MLDMNEIDPMIFQPFGAGPRNCIGMRFALLEIKITFAKLLQKFYLDVCEDTP
APPLEVTFKSGMRPKEIIHLKVTAVE*

>SEQUENCE 125, 113, 114, 213, 138 
MITPLDILSLETWILILTSFILIRLYIHKRWQVLKDLNIPHDPPTILGVGNMSKFFKDS
DAGYKRHLELKQKFGPIYGHYIGLSPHVYVGDPEILKQIMIKEFHIFPDRQKNFQNANGK
EFNDNLTAISGKQWKRVRDTLSPTFTSSKLKEMFGIVEDCADSFVQNIGTINSDSNGRFQPA
VVFSKVAIDSICSAAFGVNVDSQ 
SKPQGQEPEVVKMALELFNYPLFKNPFFLIF (1)
LMFPWTENIAKVFDYSLLPKDCLRYFARLAKTVKKNKANVEQ 
RVDILQTMINSEITDNEVKNGAIK
LTEIEVTANSLLMMFGGFETTASAMVFLAYNLAVYKDAQHKCREEIEQ 
VIAKHGGLTYEAMNDLKYIMQCLNESLRLYPPAPMNSRYCERDITIHGLTIKKGITVQIP
VYGMGRDEEFWEDPLVFKPERMLDMNEIDPMIFQPFGAGPRNCIGMRFALLEIKITFAKL
LQKFHLDVCEDTPAAPIDVSFKSGMRTKQDLFLKVTARE*

>SEQUENCE 136, 203 I-HELIX 58% TO 4X1 missing region before I helix
MKAIARFPFPALVRFVSTCPAHQRAAWSKKASQEDLSYAKPFEAVPEPNSY (?)
PVIGTALEYTPLNNFNPSYVGEHWIERHKELGPIYKENILPG (?)
VTETMVFTSSPQNTEIMFRNEERCPFRDPLGPVAKIRDERNEYHGLTNSNGEDWWR (?)
VRQIVNQHFLQNTAVWKYAEAHRNVSKDFLKFIDRNMDEKNE (?)
VPNFGKALNRWSFEGAGVFTLKRRL (1)
GALDIKPCTEIEEMINCNERIFQSMGELLYALPFWKYVNTKDMKVYKTVFIYTCNKFCESYLVLID (2)
YIHSFVNIICSEKVILMTEFLVAGIDTTSNSAAFLLYALASNQEAQERLREEVREVNRMDTIDGK (?)
LLHNMKYLQACLRETQRMFPFVNSSPRRFRKDLVLSGYLIPAG (?)
HPHSDTSNTVNSRNPEYYDDPNTFIPERWLETKS (?)
FKPYIRFIMSGPFGLGMRMCPGRKFSIQELQLLLITLLSNY
RVEYHHDPIKVAFRLVSFPSQAPQFKFTKLKN*

>ci0100142705
LSYAKPFEAVPEPNSY
PVIGTALEYTPLNNFNPSYVGEHWIERHKELGPIYKENILP
VTETMVFTSSPQNTEIMFRNEERCPFRDPLGPVAKIRDERNEYHGLTNSNGEDWWR
VRQIVNQHFLQNTAVWKYAEAHRNVSKDFLKFIDRNMDEKNE

VPNFGKALNRWSFEGAGVFTLKRRLGAL
ICSEKVILMTEFLVAGIDTTSNSAAFLLYALASNQEAQERLREEVREVNRMDTIDGK
LLHNMKYLQACLRETQRMFPFVNSSPRRFRKDLVLSGYLIPAG
THILNTSNTVNSRNPEYYDDPNTFIPERWLETKS
VTERQRERFIMSGPFGLGMRMCPGRKFSIQELQLLLITLLSNYRVEYHHDPIKVAFRLVSFPSQAPQFKF

>ci0100143467 complete seq 145, 120, 123, 197
MPTLIADLVLLFVLYLLGKAVVRNWKTIAAGRRMSKIIGSPPAQWIVGHAYS (0)
LAPTPEGLLSNVKTAASYPGMMVLWLAHKWIVQVYHPDTVGKVLSTQ (1?)
ATKDDMTYEYLKAWVGYGILTSNGERFQRQKKLLVTAFHQDLIKQ (2)
YSPVFTKATDRLIEKWTEKQNQSVELFEDISLLTLDCILICAMSTDLDCQRIG (2)
KQHPYIDAVHQLAYLAIKRFYNPWVRHDFIYNLTSEGRLWKKLLKVVHE
MSESVVKARRKVVSISDKPKLSKTKYVDFLDVLLRAK
DESGEGLTDQEIREQVDTFLFAGHDTTAS (1)
AICWFLYNLACNPEYQDKCRKEVKEVMEGRKDVQM (2)
FDIGDMTYLNMCLKESMRLHSPIPFIGRKVKNPINLRRPN (?)
DNTEVTLCPGTTIIPHLLSLHRNPDVWDEPE (0)
VYDPERFNTKNASRLHPYAYIPFSAGPR (2)
ICIGQNFALVEMKLTVAKLLLNYEFYVDEDIPKPQPIPNVVLRSQNGLYVKFRKI*

>CIONA SAVIGNYI BEST MATCHES TO ABOVE SEQUENCE
TYQYLTGFVGYGLLTSNGVRWQRQKALLSTAFHQDLLK
YSPVFTSATDTLIEKWGKKQNRSVELFEDISLLTLDCILICAMSTDLDCQRIG
EDEKGLTDIEIKDEVDTFLFEGHDTTSSG
IAWALYHLAKYSEFQAKCREELTEVLGDRKYVEW 
                     IREIMEDRKNVQL
FDVGEMSYLTMCIKESMRLYSPIPFIGRKIKNKLNVHRPNGASDVTLDADVTVIPHLFTLHRNPEIWDDPEV

>SEQUENCE 147, 149, 153, 163
MTLFEAASEVVFLNRGFVLSSLISEVVLLIAIVAVTKVYLV
PAIRYYKRGLLYTKQGCSPPRHWLWGHLTVFPLSHDGMLKRLRTTAQYAGMFVSWAGPFT
YWIETQHPTTCAVLANSSAPKAEFLYGCLRPWIGDGLLTSKGSKWKRNRRLLTPAFHFEI
LKPYLNIFNKSSYAMLEKWKRHSGQ
SYDVYGDVSNLTLDTMMQCAMSTDTDSGGEDGGYGYINAVHELTLLVMERVYNP
LHMIDWIYHFSSNGRRFRKLVDFVHETSETIIKERKKELETLVQEDNDNASTATNNELSS
FKKRGTKHLDFLDILLQTKDENGNGLTLKEIRDEVDTFLFEGHDTTASGIAWCLYNLAKH
PEYQQRCREEVLEEVGEKEKIEWED
19  LSKLTYLAMCIKESLRLNPPVFCVGRELNEDLTFNDKYTPCKKMKVESGNTVLMQIMSMH 198
199 RNPHVWENPE 231 (0)
297 VYDPERFSAENMKNRHSMSFLAFSAGP 383 
156 RNCIGRNFAMNELRVVIGQTLRQYKLWCDDDTPEPKMQPNIILRSANGIHIKFK 

>SEQUENCE 148 missing exon between HTK and GIA
MAWTSVGGIVTTTAVGDFLLVACIVVVIKSFIIPAVRGYQDAKKRARSVGEVKKHWLYGSVKM (0)
FPQNEEGLMKRVASSFEKPVWTVNWFGPYISEVVIHHADLAITLLSSS (1)
APKNNLVYGFFHPWIGDGLLTSSGRKWQRNRRLLTPAFHFSILKP (2) 
YTNVSNACVRVMLDKWSKKVGTSMEIYLDVNLMTLDTILQCAMSTKSDCQNRS (2)
KKNEYIEAVHDVSKYIMSRVHKPLLHIDWIYWLTAEGRKFKQLVKVLHDQSEK (0)
VIRERRKTLENRKFEEESSGKKRLDFLDILLHTK (0)
GIAWALYNLAVNVDCQDKCREELKSVVGDKENIEW (2)
EDLSKLSYLTLCIKESLRLCPPVPFIGRELNEPLKF 
RSKLKEPNETTIDAHSNIALHIFTLHRNVHVWDSPE (?) 
EFIPERFKPENMKGRSPHAYLPFSAGPR (2?)
NCIGQNFAMNEMKIAIGQTLRRFKVIPDESSPKPSITPQVVLRPKDGIFIKLMKI*

>sequence 151, 152, 135 ci0100151682 scaf 357 ci0100147473 scaf 996 rcilv086f16 
MSFALANGVLTSTVVGDIALIFATLFVILNYILPIAKLRQSSSNLEKQVGLGSFKKHWFFGHLRLFPPLE
HGLLKRAQRTIQTPAWVTTWFGPLVSEATTHHSETAITLLASSAPKNEYVYKFLRPWLGDGLLVSNGKKW
MRNRHLLTPAFHFSILKPYAEISNNCIKTMIEKWTTKTNVAFDIYKDVNSMTLDTMLQCAMSFK
IEGLKER
DENGIGLSDVEIENEVDTFLNEGHDTTA 
SGISWSLYMLAQHPEYQQKCRDELEEVLKDKEDIDWSNLTKLQYLTMCIKETLRI 
RPPVPAISRQLDQPLTFKSKTHNVPPTTLAKGSFAGVQIFSLHRNVHVWEEPE 
VFNPDRFTQDNIKKRSPHAFLPFSAGSRNCIGQNFAMNEMKIAVGQIIRKY 
ELYIDDDTPDTVMETALVLRSTNGIMIKIRKI

>Cs151 73% to seq 151 (C-term) 69% to ci0100151682 (N-term)  40% to CYP4T5
MVWFGIQGVVTSSVFGDLLLLICMAIVIYNYVTPAVKTYQASNKTAKKIGKVKKHWFWGNLKM not linked yet
GLLKRAQRTISSPAWVVNWFGPLLAEATFHHAETAITLLSSS
AASKNEHLYKFLRPWLGDGLLVSNGKKWQRNRHLLTPAFHFNILKP file 33
YAQISNDCAKTMITKWTKKTGVAFDIYTDVNQMTLDTMLQCAMSFKIE file 9
EKNIYLQAVRDLTNLTMERITRPLLHIEPIFWLSSTGRKYKEALEIVQGQTNR
VIQERRAELKTSKEVHESVEGLDQVKVTRKGKYIDFLDILLQTK
DEDGVGLSMQEIRDEVETFLNEGHETTAI 
GSSWTIYMLAMNPECQKKCREELSDVIKDKEDIEW 63%
SDLKKLQYLTMCVKETLRLRPPVPAISRLLDQPLTFKSKTHQVPETTLEKGSIAGVNIFALHRNVHVWEDPEV 83%
FDPERFNPENTEQRSPHAFLPFSAGS 70%
RNCIGRNFAMNEIKIAVGQVLRKFEFYIDDETPATKMATSMVLRAKDGIMLKLRAI 66%

>sequence 155 complete 80% to seq 110
MLLFYLQYAVYATVCAFAVFLLPSVFT (?)
SLKRFYTVRKNLNSQWPGPKGNWFW
GSSAKVIAEVQKDSAYYFTYIKEMTKQFPQGFRGQLGPIFLSLNAYHPNMVKAVLCAPAS
DAPKLNRAYKMLFEWLGHGLLVLEDKVWFRHRRLLTPSFHFEVLKPYVKTMNESAHVMVE
NWLGKTSENKVVEIEIFHSASLMTLDTILKCLMSYQSNCQEEKSTNTYVAKIYQLSNTV
VKRMRDLNLFKRIDPIYYLTK
NGKLYKKLCDDVHKFSESVIDRRKCDIGQPAHNGTRKHFDFLDTLLKARDSDGKGLSDSE
IRAEVDTFMFEGHDTTASGISWTFYCLAMHPEHQEKCFQEIEKVMADRTDIEW 
NDLSNLPHLTLCIKESLRQYPPVPIIFRKLNKDIEVDGKTIVK
DTDVILHIYALHHHEEFWKDPHIFDPS
RFTQENMKSMNSYAYVPFSAGPRNCIGQRFAMNEIKIAVAQVLRKFQLKPDLSKKIQHSV
DVIYRATTGLYLKAERRV*

>sequence 109, 124, 161   82% identical to 129, 156
MVAQSQLESILLQNGGGFIFQTLLADVLLFAALVFVTAKYVSPAVKHQFKMKKCGKEVGG
PKGHWFYGDLLE (0)
NGYDKTGLLNSGKRCEEFPAMCTQWLGPVQMNVQLHHPDVTKTVLSKS (1) 
HPKGYAYDKFLKPLLQDGLVTSKGDLWKRHRRLLTPIFHFDILKS (2) 
HVQIFNQNTTELLDWMEEASKSDVGGNLMIPCSNMAFNNVMECVMSQKCSAKEG (2)
EISTYTGYVKSMSNMMFQRLSRPLLHSDTLFRVSKLGKRFSIAKKGIEDYASK (0)
LIKERRAILSSNSPNSELTTTNIKDEVTGTSFNFTRRKGKMVDFLDILIQTK (0)
DQDGVGLKNDEINDEVITFFSAGHETVSN (0) 
AISWGLYTLASNLDCQTRCREEIKSAVGDKETLEW (2)
ADLSKLPYLTQCIKEAMRLFPTLHVISRQLSEDVTLSHRFNDYKPIIIPKGVCVAVNIFAMSRNPHIWEEPE (0)
KFNPDRFTPENTSKRSPHAFIPFGAGPR (2)
NCIGQNFGMQQIRVVLSKVLRKYEVYVDETLPKPEMSPGVTVQPKDGVFIKVRKLTTQ* 

>sequence 129, 156 this gene has 11 exons 36% to 4X1 (in the CYP4 clan)
MVAQSQLESLLLQSGGGFIFQTLLADVLLFAALVFVTAKYVSPAVKHQVLLRKCAKEVGGPKGHWFYGDLLE (0)    
YGYDETGVWNSVKRCEEFPAMCSQWLGPAQMNVQLYHPDVTKTVLSKS (1) 
YPKSYAYNKFLKPLLRDGLVTSKGDLWKRHRRLLTPIFHFDILKS (2)
HVQIFNQNSSRLLEWMEETSKSDVGRDVAVPDCDMAFSNVMECVMSKKSSDLTG (2)
ETTRYTGYVQSVSAMMFQRLCQPILHSDSMFKVSGLGRRFAVAKKGIEDYANK (0)
IIIERRAILSSNSPNSELTTTTVKDDVTGTSFNFNRRKGKMVDFLDILIQTK (0) 
DQDGVGLTNDEINDEVITFFSAGHETVSN (1) 
AITWSLYTLASNLDCQTRCREEIKSAVGDKETLEW (2)
ADLSKLPYMTQCIKEAMRLFPTLHVVGRQLTEDVTLSHRFNDWKPVTLPKGSSIGVNIFAMNRNPHIWDEPE (0)
KFDPERFSPDNTANRSPHAFIPFGAGPR (2)
NCIGQNFGMQQIRVVLSKVLRKYEVYVDETLPKPVMVPGVAMQPKNGVYIKVRKLSKE*

>scf/ciona01/G126/seq_dir/hrs/G126P60510F.T0/G126P60510FG11.T0.seq    722 exon 1
MVAQSQIELLLLQSGGGFVFQTLLADILLLVVLAVVSAKLVVPTVKHQIKLKKCADEVGGPKMHWFYGDLIK (0)
YAYNEESVWNSIKRCDEFPAVCRQWLGPFQMNVQLYHPDVTKTILAKS (1?)

>scf/ciona01/G126/seq_dir/hrs/G126P67089F.T0/G126P67089FA7.T0.seq    734  exon 8
AITWTLYTLASNLDCQTRCREEIQSALGEKSNVQW 309

>sequence 158, 146 almost = seq 130 
MLLPDYLEFSLSLYSIGAVCVVVFLYKSRKLIGRFLKERRIRKELTKEIDGPQPHWLFGHLNL (0?)
LQPNEETILKTCEQTQKYPKLGMVWFGPALAYLMVFHPHVVRPVLNVE (?)
HPKDNLAYRFIKPWIGDGLLVSHGSKWRRNRHLLTKAFHFDILKQ (?)
YTKVFDACCKTMLEKWSRKCDGSTVEVFRPVSLMTLDSMLQCAISCETECQQEG (?)
NRNSNTYTDAVFTLTTIAMKRIFNPFCHVDWIFAMTKSGRQFKAAAKLIHEFDEK (?)
VIRERRQENKKILGRSDDEDLTQEDLEKITSWRKSDGRYLDFLDMLILTR (?)
DDDGKGLTEREIRDEVDTFLFEGHDTTASGIAWSLYCLAANEEYQEKCREEIEQVVGSKDALEW (?)
QDLSKLPFVTMFIKEVLRLYPPVFAVARRTEQPVKFPRGFGRDQLNVGDEQ (?)
LPKSDCSRVVXAPLNIGIPIMTLHRNEQVWEDAK (?)
VFDPYRFSTENCAKRSPYAYLPFSAGP (?)
RNCIGQNFAMAELKTVLVRTLLRYKLYVDNQCPDPKMSPMIVLKSLNGIHIKLKKI*

>ci0100150610 seq 160 and seq 159 with N-term and C-term added complete
MQSSGLSSIWDQLQINDWLPSSFLVETTLLILLIYLITHYGLLFRETYTKSRKISERVGVDEKNRHWFFGHLKH (0?)
ITNNEEGLLKFVELAKAFPLMVSLWLGPALTTVQTYHPDSTTALLNSGAPKNDLGYRFIR
TWIGDGLLTSKGKKWFRNRRLLTPAFHYKVLKP YTEVFNKNSLVMVNKLNKLCDQPVNIYTHVGLMTLDT
MLQCAMTSKTNCQHVTDHPYLLAVKEISDLIIARVRQPLHFFDWM YNLSAAGKRNKEVTKILHEHTEKII
SERKAVLAKTANIDPEPSDVEGFRKTEGKTLDFLDILLQCKDENGEGLSDIEIRDEVDTFLFAGHDTTSS
GIAWAMFHLAKYPQFQTKCREELKEVLGNKDEIEWSDLPQLVYLTMFLKESMRLRPPVYAVGRYLEHPIT
IKDKNTKRDTVIPAGTNVSMHIFTLHRHPDFWDEPSKFDPERFTKENIAKRPAFAYVPFSAGSRNCIGQN
FAMNEMKISLAHVIRNLRLYIDDETPVPKMQPRLILQSSTGIFVKIEKLERQ*

>SEQUENCE 190, 162 36% to 26A1 fugu ESTs rcicl022k09 rciad056p18
MVESSFLYPDGGGIRKAITNISSLEVPSNRTLINYLESEDFFYFLFFLIKYI
GVPCFLMHVICCLWEIRCVRMKDHSCPLPLPRGNMGFPLVGEMFHLFVV (0)
GANYQRQRRSKYGKIYRTHMFGSPSVAVIGEEHVKKIVLGE (1?) 
GTLVQTRWPSTTRRILGTDGIVNGDLATHCRIKRLALKAFSPKYIGTYAPVIQAGVAAHV (?) GCiWno735_j03.b1
QEWVDQGTVHILEKCKQAVSKTMMTV
LLGIKSNDPDIQLYIQAADDIINSILSLPLDLPGFGFHK (?) GCiWno735_j03.g1
GLKARELIFNKLESQLKSKFNATDDELFGDSGGEFESVL
LNILKFARREGETEIKLRNLQNLSLEFIFAGTQSLQCSCSLVTFHLSQNPQV = seq 162
NFECREEIKAAGLWDTPLTEINVERIQSLRFIELVAKETLRV
TPTIPGAVRVALKTFELGGYQIPK GWQIIYSIRLTHEEAAGKVLQN
GSKCEFEPGKHFTSTCKNTDVKKSGEDKSHECLKQQSPFAFLPFGKGPRMCAGKNYGMLFLKVLIFELVRTA
DISLEGKCKIIEVPMTRTDKSVKIKFTSIKKLKPENSVNDASSSTAV*

>SEQUENCE 164, 165, 171 53% to seq 169 SCSSTDSDASSPYLNTDGRKFFGQ extra
MWFVMVASVEQYLSKLISCRVRNTSVAVIPTCVKT (2)
RQGEPPLVNHTLPFIGAALDFGKDPLNYLRNLQSKFGDVFTIKLAGW (2) 
DTHVFMNPLDVRSLERAKFLGHQEIIGDFVTRMVGSPH (1)
ILSPAADVQKKLTHSIEKNLRGSMQLNLLCSSCKPVVVNALHRTLGHTMNHHKP (0) 
IKLHEFSKKLIFDITFNLLFGSLPDYERASEETERTYDVFKTYFKGSAL
LVDRIPIHLLPETKKARGIFLKMMEELDWSKRENVSKLIDDIVKVNPKL (1)
EFLKQRARHLMVVLWAAQANTTPALFWTLFYLIANPDAKRAVLEEYEQL (1)
RWPTMKDILQIERKDLDKLVIL (1)
DGCLRETMRLTGASMSIRKATRDESIKTSDGKQYSVRRGDYTAFFAPVTHMDEDIFEKPQ (0)
DFVHDRFLVSVDDTSS 
LKPRSRFTKNGQRVAVSRAIMTFGYGVTRCPGRHIALIEIKLAVLAMLGHF 
NVQFVDSNTKPPSFDLTHLGFGVMPPDTDIEVLIAPKCC*

>SEQUENCE 169 53% to seq 164 had wrong exon 1, switched with seq 164
MWLLLVASVEQYLSKHTSCRVRNTSVAVTPTCVKA (2) 
RAGEPPLVSHWIPFIGSALDFGKAPLEYLQSLQKRLGDVFTIKLAGW (2)
NVHVFMNPHDVPNLERAKNIDHKNVIGDFVSRMVGDEH
MIHNPARHSDAVPKILESTLRSTMHLMTLCDSCTNVLPKTIKRTLKSEFNCKRETK (0)
IGLYELCKKLVSDMTLNLFFGSLASDEKCNEETQQICEEFFVYFKSSALLVDPIPIHLLP
ATKKSRGRLFELLSRFDFENRDNVSELIKKVAAV (1)
GSSEEKVRHLLIILWSAQANTLPALFWTLFYILNDKTARIDVLEEFEKHLRKT (0)
WPNLSDIQRMRKGDVDKLVLL (1?)
DSCTKETMRMTGSSLAARRAHADTTVTLASGQEYKIRKGDYTSYFAPVTHQDADIYDKPE (0)
EYRHKRFLQHDDKDPSMLKSRTQFSKNGKRISPSTSIMTFGYGAVRCPGRHIALMEIKFLTLVLLRHF 
DIKLVKNYSRQSEDYPQFDLTHLGFGVMPPSRDVDVTLSI*

>SEQUENCE 166, 170, 172, 181 36% to 7B hum 
MILEALTTVLAALVSSYLIWLVAKRKQKDGEPPIVPFWIPWL (1)
GSGLQYQKDPITFLRKQEKKYGPIFTVFM
GGSYFTFVTDPFTHPIIAKESKVLNFRICTIPTIKKVFDFTRRGVDVPTMGGLMVKYMQ
GHWLETLVSKMMGNLQKVLLHGSYNNTEWATTGLLHLVETVMCTAGFKSL
FGEYPASSGVDNNKLHHKIVEDFVKFDKAFPLLVQGRPISSLK 
GVKESRANMWKFFTMDEIHKREMVYEMVEQYAKKLEEDQELEQLKKELYCVNWAAM
SNTLLSAFWALYYILKHPEAMEAANVEIQRMLKETSQCVDGKSEINFTRDDMAKMTVL
DSIINETLRLDISSSLLRQALQDYNLKVVSADRSYKIRKGDKLMTLFQLTHYDPEIYEQPDQFK
FDRFLNSDGSPKTDFYKKGKVVKFFLQPFGSGV
SKCPGRYWAVTELKQLVILLLLYFDFKLINPKQEIKMDQIRVAFGSLPPNIDPQVQFKRRVIV*

>CYP7 like Ciona savignyi one intron shared with human CYP7A1 ortholog seq 166
MLVSIIYTVLLLLLSSYAIWFIAKRKQKRGEPPIVPFWIPWL (1)
GSGLTYQKDPIAFLKKQTKKYGPIFTVFM (1) 
GGNYFTFVTDPFTHPLVAKEGKILNFRICTLPMIKHLFDYQRPAVEEPTMGGLFVKYLQ (0)
GQRLNVMIEKMMKCLQRIMLAGKYTNTDWSEVGLLHFVESIMCTASFSTL (2)
FGDYPKSSNKDIPAFEHQMFEDFRKFDEAFPYLVKGYPLNSFK (0)
GLEESRKNFWKHLASDELNKREMVSDLIAEYARICDAEKDRTLPKKLFSTAWAAQ (0) 
SNTMVSAFWTIYFVTKHGDAKAAAEVEIQRLLKETSQTLDGKTPINFNRDDMDKMTVL (1)
DSIINESLRLHVSASMLRQALQDYDLKVLSVGKTYRLRKGDRVMSLFELNHKDPEIYS
NPEVFKYDRFLNEDGSLKTDFYKNGKVVRFALQPFGSGL (2)
SKCPGRYWAVTELKQIVILLLLYFDFELINPNEEIKMNKNRVAFGSLPPTNDPKIRFPSTV*

>SEQUENCE 167, 173, 174, 176
MLSVILAFFVLILATYFLWNSRKRQQRDGEPNIVPYSIPWL (?)
GSSLPFYKDPVKFLKECEKKYGSVFTLFM (?)
GGQYFTFVTDPFVQHIIVKEKKVLRFRNCLPPVLKHAFDYYPTAVDDEMMKLTHHYFQ (?)
GKPLGLLVKQMNTNIQKWILDGNQGDAGDKWETVGLLSLVQRVMGSASFVSL (?)
FGENPKASDTLDVQRQLLNFLTVFPSLLRGLPLSLIK (?)
RSREYRQDLWNSLTKSKVTKRKEIFPIIDEI VTLHEQNGENKLLPKRLLPVLVASQ (?)
VNTNPSSFWILYYLLKNSEALAAVKDEMEKVLQERSKNPGNNFVCFTKEEMETMMTL (?)
ESAINESLRLSISGGMLRKAATEYRLEIPDYGQTYTIRENDMVLVWTQVNQMDPE
IFEEPEVFKYDRFLNEDGSLKTNFYKDGKLVKYAFQPFGDGS (?)
SKCPGRYWAVLELKQLVVNMLLYYDIQLIDSQEIKMDKGRVGLGALNPTKDALMSIRRKPLC*

>Ciona savignyi ortholog to sequence 167
MLEIILSILVLLAVSFLIWNTRKRTAARGEPPIVPYSIPWL (?)
GCALPFYKDPLQFLRDCQRKYGPFFTVFM (?)
GGEYLTFVTDLSAQQTIAKEKTVLTFRKSLPPVLKQVFGYDSNSVDDEYVKVFPKHFQ (?)
GKALENLVSKMNKCLQNFIINGKYYNEEWMTVDLRTFVESIICPASFVSM (?)
FGEYSSSRSDKMTEVERNILKFTDLFPSLIRGYPISILK (?)
GATACRQYLWKVLSKQEISGRKNVSGVVEETISLHEQYGESDVLPKRLLSFLTATQ (?)
VNSVPSAIWLLYFLHKHPEANKVAREEIERVLKESKESTSKDVKIEVTNEDLKKMTVL (?) 
DSIINESFRLHLAPSMFRIATEDYFVEVAELGRTYKIRKNDKVLLWVQMNQMDPDIFE
QPEVFKFDRFLNKDGSRKRDFYKNGRLVKCAFQPFGTGL (2?)
SKCPGRYFAVNELKQLAITTLRFFDIELVNPQAEIKMDKNRIGLGALYPATDIKIRIRRKADC*

>SEQUENCE 180 39A1 LIKE
MKVPGTIGYPIIGDKSREFLSDPTKFITSRINEH
SSRIFQCRILNNPTIFVCSPQAVKKLLSKEMANDIEHGYKSILHRLYGDNIVFENDVTAN
ELHNCLRNCFQNMELYKETVNRLCE
PLLSSLHLRTEPVYIYPEFKRVMTRMCLELFLGIKTESCLDQTEEITHLATQHWHGIISV
PVSLKIPLAGESTYSLAVQAKEKL
LSIIRSILLGQNGQGDASVAAKLRHADFSNSKSLEQHLLLFVSAVIPKAFASLITSFIIV
MAGDDKEALRARARTDSKFLDRVLLEVERMWPPLFAGRRLVRRTTTLESYDLPKDYAIMY
VTHAAHRDPEVFPEPNSFNPDRWITCNAGQEDLVLCFGSGPRCCVGTDLIRRILRDVASR
LLQNYEWSLAPPNQDTSYKYLPVARPAVQPSIAFSSYLTNDEPFFDACG*

>ci0100147340 seq 182 4 diffs blue = intron?
MLSCLQKASRRRSFDYFKVICIVRLQHAQ (1)
VQSKETAVKDFSEIPQPKSLPILGTFLDYTKFRGFNVANLSNLFKARFDELGPIYKETIIPGLSEPTVHCFSPQD
NEKLFRLESRLANRDPIEFLKVVRERIG VPSGLVNIQGEEWYQ LRRIVNEHFLTNSSVWSYSKDIHEVSE
DFVDYISRSLDSNNEVPNFALALNKWALEGAGVFSLDTRFGMFTGEVEEHLQEIVDNSNLMFQLMAELTV
APPFWKYFETKQIRALKQCQAAQVAAIKKQRDILLKRIKEKDPEKLTKLEKLMTKKDLPQGEVDI LVSDMLGG
GVDTTSNAAAFMLYIMAVNPDKQEILREEIKTALQSGKVDGKSLQRMKYLHACALEAQRMFPLTPGTSRR
LPVDVVLSGYNVPAGTRLLLLSNVAHHKNPEYFDQPESFLPERWLERKHKSKFAMIGSFGMGARACPGRK
FAMQEVHYLLISLLSRYKVEYHHKPIKPHFRLVTVPSEEPQFTFIPL*

>ci0100131379 Correct order of fragments 57% to seq 182 new seq
MIRKWQKRGFFTNFYQNAVTSNRRQHIE (1)
AKTASNINAAKTKPFSDIPEPKGLFGTFLDYTALRGFKTEELYSHYQKRHKKYGPIFKENVLP (1)
GLTSYDVFCSSAEANKAMFRLEDKYSNRGPLDFIQIARENAGVSSGLLDPSWEEWFK (0)
FRKVANRYLLSNASVWSYSSALHEVCDDFVDYIADNLDENNE (0)
FPNFALALNKWALEGAGVFCLDTRFG
RNFSGQLDDNLKVIMDQVIESFTLMTDLFLKPPLYKYITTRKMK (0)
ALTSVVKKQSEAIQKQRVKIMNQMKEKTPGSKTKLKTVLENSDLNEE (2)
EIASLVSDMLGGGVDTTSNAAAFMLYIMAVNPDKQEILREEIKTALQSGKVDGR (0)
SVQTMKYLHACVQETQRLFPFSFGVARMFEKDIEVCGYNVPA (1) 
GTRILLTGNVVNNKDAKYFKDPETFLPERWLERRNSESK (2) 
FATTGAFGRGVRACPGRKFAMQEVHYLLISLLSRYKVEYHHKPIKPHFRLLTVPSEKPQFTFIPLS*

>SEQUENCE 184, 212
MNDVTITWWIVISALALLAVKMVDDNSKRKKRSQNFRSPPGPKGYPILGTFPSLIRKPA
HIVFGEWSKKYGNVYQFTLGAGTSVVLGDTATVRLAMLGKSPDFDSRCQWRYCFN
KMFRGLSVLPYSFKWVNTRRRTLTILRNIGMGRSLMEGKIHEETQCLLKCMLDSSQDGKV
NFEPRDMLRRAPMNTIWSILFGKRYDYSNPYLLYVMNTVVWLVDFLAKASVFDVLPSSLR
HLAGFSLETFHNKLETLHAFMTERVDDHAKSYESDVTRDFVDAWIKAKKEDKLKIQDSED
SNDISSLPDVIQDLIVAGAHSTGIVFNWAVLLLVLNKEEQDL
CYREIMDNVGPTREISMSDRSKLRRVDAVMHECMRIGTVLPLVAHATARDTTIHGYNIKAGTE
VIYNNWFINRNEKVFPDAEKFNPDRWIKKDGSFRKELLQHFFPFGIGKRSCAGESLA
RANLFIVVATLLQRVELLPPDETASPEMLPEVKHQFGLLAECKPFLMRVRCRPTFVPSQ*

>SEQUENCE 186, 187 CYP20 ortholog 29% to fugu CYP20 CYS = HIS
MLEVHVTITAVAVLAVILLLFLIGLLFYIYPSSN (?)
KETTVPGPAVSDIR (2)
DGNRSDIDRAGSLHQYLCNLHAEHGDLASFHFDKQLIISIADSKYFAPRSQVFQLP (1)
DFLYEFWEPMIGKESFVFSQGVDVFKRRDNIDFKVDKKVKEFYAKIF (0)
RLLAIELEEKWRKQPEDQHIPLTQHLKGLATKAVCQMMFGESFADDERVIKLYKAWDK (0)
CWNEVMAASKMEPTKSESDEKK (1)
SIKELRSLMEKAIEERWKKSKHLDGITLLDGLLASSNSSVSPLSSEQLLCDVITMFTMGVQVTNA (1)
VLCWSVYFIAAHDDVQTKLRNKLNEVESLDIMPQSLNDVK (2) 
YMQSILKESIRCSGVFSYSGRVEDIDITVGDHLLPKG (0)
TPIIEALMFPMQSDAFESPDM (2)
FNPKNFSEATKSVNPLTYIPFAFGGSRHKPQDS (2)
LVYLLVSITVASLFRNLRISLAGKEVDPQSANYESFVALPSTEEVWVTVGGKLDVETVGENN*

>ciona savignyi sequence for CYP20 ortholog 72% to intestinalis seq 186
MLEVHVSITAVAILVLIILLFLIGLLVYIYPVI (2)
KETTVPGPDVSDAK (2) 
DGNKSDIERAGSLHAYLCNLHADFKHEGAVSFHHGKQLVISISNPKFFAPRSQVFQLP (1) 
SFLYEYWRPLIGEESFLFAKDIDVFRRRDYLDFKVNRNVKEFYAKNF (0)
LAVELETKWEKQPEDQHIPLTQHLKGLATKAVCQMMFGDNYADGTKVLELYRAWDK (0) 
CWNEVMAASKMEPTKSESVEKK (1) 
SISELRKLMENAIEERWNKSKSLDGITLLDGLLSSQTSDSKDQLTSEQLLCDVITMFTMGVQITTA (1) 
VLCWSCYFVASNKEVQRKLKNQLNESKSLDIMPETLMDME (2) 
YMQGILKESVRCSGMFSFAGRVEDIDITVQNHLIPKG (0)
TPILEAIMTEMRSEAFSCPDV (2)
FNPKNFTKDNNHLNPLTFIPFAFGGSRHKPEES (2)
LVYLLAAVTVSSLFRNLQVSLAGKEIEPFAANFESFVAMPSTEEIWVTVSKEKEVENVG*

>SEQUENCE 192, 210 CYP11 like sequence revised added exon 2
MGTFGIGWRLRSQWRVAAQNVKHQITQALSLG
ETKTSVREVQSYSE IPGPRGLPFLGSVLEYTAIGKFSPKQFHKALRSRHG (2)
RYGKIFRERLLQDEFVYVSDPKDIRVLVQNEGKAPHLEAFW (2?)
GNDWLRSRTSINGGISCPRALKTYRAGQQVIAEDFVELIRNKRDATTGEVPQF
EQWLNRWALESIAIAMLNTR
LGLFDAGQSKKNRPAEKLRECMDIFFDYGARLTFGVPMWKIFNTREWKTFCQSQVDEFENAGYFVNK (2)
KLKQLKEERGSQEASKQPPSCLLEHLISKNSE
VSQMEATTICLELLAGGIDTTATAAVFTLFELSRNKEHQSKLRELANQETDEG
KNNSQFSRYLKSCIWESMRLHPLTYANMRKTETDLVLSGYKVPAGTTVRYTSHLMNLKDE
KYFPQADEFLPERWLDRKSENRCKEQFTFTPFGHGARQCPGRRIAEQELDLLFRQLIRNF
NINYEHEDIKTKVRLFNAADRPARFSFTPISKPITEIQQETN*

>Cs192like revised added exon 2
MLRTYAFGLEAIHRWGTWFQTKLGATEKFRETSTKVGKFTAIPGPMGLPLLGSVLEYTPLGRFSPRN
FNKAIWSRHQ (2) 
RYGNIFRERILLDEFIYVSDPKDIRVLVQNEGRAPCRPHLEPVVE (1)
SDVWLRSRTSVNGAIACPKALKQYRFNQQIIAKDF 
IELIRNKRDSKSGEVESFEQWLNRWALESIAVALLNNRYLV (2)
LGLLNADPTNPNSPAEKLRLCMDIFFDYGARLTFGLPSWKLFNTRQWKIFCQSQVDEFENAGHFVTK (2)
KLNQINNEQRLYKVPPSCLLEHLISNNNNLSQIEATTICLEMLAGGIDTTATAAVFTLYELARNI
KHQETLRLLAKKEKCQHNSEFTKYLKACIWESMRLHPLTYANM
RKTETDLILSGYHVPAGSVVRYTSHLINLKNEKYFPKATEFLPERWLDRKSEYR (2)
CTEQFTFTPFGHGARQCPGRRVAEQELNLLLREIIVNFNVGY
YHDDIKTKVRLFNAADRPARFSFQPI*

>sequence 5, 12, 194 ci0100141208 complete possible fs at YES
MEYCFWYFTIAVFSIYCLLSWQRRPKNFPPGPRGFPVLGIVPYFGKYPEKVLSKFADQYESRVMSFRMGR
MDWIVLNDIETITQ (0)
ALVKQGEDFAGRPKSPLFDMLTNGCGIVFVDGEKWKAQRKFGIMTLRS (2)
LGMGRGSLDDIMNYEFEVLCHFLSSGGSQENVM (0)
TLFRKAVTNVTCSVLMGKRYDYDDPVLNNITEHIR (2) 
PSKTVVMYSPILRFIPPFRWTYKKIVNSIEQVTG (1)
VIRNISEMHKQDFDSSTKRDFVDHFIAEMNKEGEYFTVNYVIM (1)
YLLVAAINTTTAAFGWCVVSLLRHPECQEKIWQEVDKVI (1)
GNETPSCKHQENMPYMRSFIQEIHRHRSIATLNLPHRTVRSVRLCGYDIPKDTP (0)
VLTNIWRVHNNQNVWKNPHEFRPDRHLDSNGNFVSSNNVIPFACGYRRCL
GEQIAKAEIFLFIVNIVKRFHLVVDQVTGPPTLEKDPKGAVNTPAPFRFSVRARKN*

>sequence 196, 32 ci0100152256 complete changed middle exon added N-term exon
MFENVGYYFSW (2)
SLLTSSHVISAFIGVFTLLMFVYYYWWKFPHPRYPPGVRGIPVLGATPFFGKYAQEKLAIWSRDTYGPV
MSARFGQDEAVVLNDYESVHEALVKNLQYFNSRPFVYVAAQVTKGYGIAFADDHQNCMEVRSFSLSALRG
LGIGRRTMETRVSEIAQDLVRTLEDLNEKPTDLKMAIGSTVANVICSVVFGKTYEADDQEYEYAVRCSFDC (2)
YGDPENSEYLSFLFFYPALRHIQPFKRAMEKFVLVQAGLEG (1)
DFCKEIEKHKKHLDENEPADYIDAFLVEMKKHSAQ
DSWFHETELRHC LVDFFIAGTETSTNAILWSLLALIHYPNIQEEIHQELIDNIGEQVVPSIDHRDKLPLF
RAFVQEIFRFKTLLPLSIQHRASHDVEIGGYVIPKGTKVFPNLHAVHHDPNTWENPSDFNIYRHVDKHGK
FIPSKKVIPFGIGARSCLGEKLANVEVFLFLANIIKRFEILPDPESKELPDFKDGINGFIYVPYRYKVVT
KPRIINE*

>sequence 198, 61 complete seq stop upstream in ESTs
MSSKEDVPPKSLPGPWGFPFVGSAFQLGLHKDKTLYEWTELYGPVYALNLGTRLAVALHGY
SAIQEALVDKASSCSGRICAPIIDAMVNQQGLFFVDHDEHWKKWREIAHSGLSNSWKTTQ
SGILTESERLKKYLSEADGKPINIEETLQITMANAVFQTCLSRRFDLDDTEAANFLVDLY
ATNRDPKFGKILFLTFLRFLPSYKSYLIEGDTTRRKKVRKVLRGFIEERLEDDDQSRDLVS
LFFKAGPTPSLENKTLVAGLIQDMLFAGSITTSSQLLWVILFVLKNPACMHKMQNELDAVL
GDCENPTKSAISDKYDELNYTRAVVQEAMRLRCIAPTSMFHKATEDTTIAGYFVAKDTVI
MPNLYSVHHDPITWAPDPESFRPERHLDSDGSFVNSDHVIPFSVGWRRCVGEKIARTEIF
AHLVTIFKHFEVALHPDDVGVDMEGVPGPILRPVSYRTQFTSR*

>sequence 202, 195, 24 ci0100132628 N-term longer complete
MDVESVGCLWGLYSSWVTTCLMGLLTLVILVYYYWWKLPHPRYPPGVRGIPVMGATPFFGKYP
QETLARWSRDKYGPIMSARFGQDEAVVLNDFETMHE (?)
AFVKNLQYFNSRPTFQIVEQYTNGYGFGFADGHKKFLEVRTFSQSALRG (?)
LGIGKRTMEARVSEVAQDLVRTLEDLDQKATDVK (0)
MVIGTVVANVISSVAFGKTYDPGDAEFEHLVQCSFN (?)
CYGDPDNSEHVNFLIFYPAMRHIQPFKRSLEKFYVVQNGIL (?)
DFCRKEILEHKKNLDENEPGDYIDAFLVEMKKHSPQDSWFH (?)
EKQLQHAVVDLFIAGTETSTNTILWALLALIHYPETQEKLYQELLENIG (?)
EQVLPSIDQRDKLPLFRAFIQEIFRFKTLVPLSVTHRASDDVEIGGYVIPKGTK (0)
VFPNIHAVHHDPNIWKNPSEFNIYRHIDKDGKFIPSKKVIPFGIGCRSCLG
EKLARIEIFLFLANIIKRFKVLPDPKSQDLPEFKDGVNSIIYVPYRYKVVAKPRIVNE*

>Ci204 204, 142, 127
MFVHEKMFLQPSSRLLWIQQLS (0)
KIHKLN 
KSLKYMPEPKGLPFIGTLLDYTFLKKFDKTNLASHFIGRHQLLGPIYKEYVFPAFSK
PQVFSTLPDDNEVLLRSEEKISHREPVEFVVSARKLLGWSIGLPFDVGEDWYK
LRKVVNQHFLKNSIVWSHSKQQHEVAEEFVDYIGQNLDENNE
VPHFQDLLQKWALESTAVFCFGVRL
GVFDKSVDDDLNIIIDTNRKKFELIMKGIFSAPLWKFYNTKLMK (0)
KFNKVQIEQLHAIRRQQIKAFTEFPVDQKLIESLQFYICS
SEIEVLIIDLLSGGVDTTSNSAVFVLYLLSMNPDKQEVLRKEILEALKTGKTDGK
DLMRMSYLHACVLESQRLLPFSIGTNRRYSKDIV 203
LSGYKIPAGTVVTMTSNIINNCSEEYFDDPNSFKPERWIQVDNKAKDRMRFAMAGSFG 377
HGTRACPGRKFALQEMKYLLISVLSRYKVEYHYKPMKINFLLVSTPSQPPKFTFKQL* 551

>Cs204like
MGNHQITSRPPSSYRSYCS (0?)
MTDFTSL
KCLKEMPEPKGHPIFGTLLDYTSYGNFNFKNLAEHFRKRHQNLGSIYKEYIFPSKFL
PYVQCTSAEDNRVLFQAEERIAYREAVESMVVSRRRMGLSIGLPFSQGEEWYR (0)
MRQVVNRHFLKNSTVWSHAQEQYKVAEDLVKYINENLDHNKE
VLHLNDALQKWAFESTGVFCFGMRL
GIFDTDADEDLDIILETNRQKFLLIAETMFSPPLWKLFNTRLMK (0)
KLISVQFNQLAAIQRQREKVMNNFSPSPELARVTRLFKNSKLTK (2)
SEVDMLIMDLLAGGIDTTSNAAIFMLYTLAINPDKQELLRKEILEVLESGKTDGK (0)
DLMKMTYLHACILETQRMFPFSIGTNRRYSKDVVLSGYRIPA (1)
GTTVILTSNLVNGRNPSMFPNPDTFLPERWLNNQDNEAKDKMR
RFAMAGSFGHGTRACPGRKFALQEIKYLLISLLSRYKVEYHYAPMTINFLLVSTPSEKPRFTFNPI*

>ci0100130435 seq 48, 56, 230 seq 205 ADDED 10AA AT JOINT complete
MIPSYLIFFEFPIVATVLVLVCLFYWYRRPKNSPPGPRGIPLLGVLPLLGRYPERTLHEWSKKYG
PVMSVRMGRQDWLVLGDHGTIH QALVKQSHTFSGRPHVPTLHRITQGRGLAFVDHGPAWKAQRKFGAVTL
RGFGVGKKGMEDKIVEEVHYLNDAIRQHEGKPFNIL AILSNAVSNNICSVILGRRFDYNDKDFQDIVHKM
TRDFTDPKSTALTNASVFAPVLDNFPPFAQNTNQRVNDAMVVREILLKYVKEHKATFDREDARDFIDAFLKEAELSKEPS
FNDTQLLHYLHDLFLGGTETTTSTLRWALLCLLHYPQTQTKLREEINEVIGRHLIVKVSYSCKVDMPYTC
AFIQELYRYRTLLPLSLTHKTNEKAMLGEYTIPKSTIVSVNLWAVHNDPNTWEEPSMFKPERHLDESGHF
VQSKHVIPFSVGPRHCVGEQLARMEVFIYLVSMVQKFEFLPDPNANELPDIEKGSNGPAFVPQMFDIVAK
VI*

>SEQUENCE 208
QKMSWTHGPIMGVKLLRKFTVVLSS
VEAMQNALVKQGHAYAGRPDSIMFQAFTGGR (?)
GVLAIDGPRWKENRRFGTVTLRKLGAGRA
IMEERIVDEAMYFCEALQIIQKKSFNITKLITHSMANVICGLAFGERFNYDDKRFISTAR
ALVDWFNDNSNVFMLLVFAPFLRHFWPFTNAVKSLKSAREKLYTMLIPIIDRHRETVDDY
DYRDFIEAYLQKISQTRDDPTSFYSDKELLGNVRDLFVGGTDTTVNTLRWTIIYLINR
PEVQVQKKKEIDRVIGPDARPKMSHLDDMHYTN (?)
AFMHEMLRIRTPGPLGLAHRTTEPTKLMGYDIPEV 
FITNFWAVHHDPKLWDQPEEFIPERFLDKDGKFVKNANLVPFSM (1)
GSRYCMGETVARMEFFLFVVTMLQRFHLLLPTDDSEKITEECLR (2)
SGQFLIPPSHNMRVKERFTREQSIFEYNKQMWE*