>Cs4V5a
MAVALFCIVVLLAVIAVLNRREIARLVKNRILLSSIPTMRERAYPIIGHSYLLGGDPAKFFEFI
STGAKAMVTELKEQIAVIWLGPVPVIVVCHPAAAEVILRSSKHMEKSYLYEFLKPWLGTG
LLTSGGEKWKQRRRLITPSFHFNILQDFLEVMNEQSATMIRTLEGKRTAGSAINVGKAIT
MCALDIICETAMGQTVNAQENDDSDYVKSLYR (2)
ISELIQLRQKTPTLWWDAAFSRMKVGKEQESLLRTLHGFTRDVIIERAKTKDSKKCRGDAF (0)
NPKRLAFLDVLLHAETEDGKTLSLNDIQEEVDT
FMFEGHDTTAAAMTWAIYALGRHPEIQQKVHEELDSVFEDDR (2)
FGAITNAQLQKLSYLERVIKESLR
MFPSVPMFARVLSEDCKLGGYSVPKGTQAIIFGYTIHHHPGIWEDEERFEPDRFLAENCV
GRHPYAYIPFSAGPRNCIGQKFAMMEEKVVLCHLLRRYSVISHDKEEDLLINADLILRSSTPLNITLKPR*

>paired_scaffold_15 100% to Cs4V5a
403126 FLDVLLHAETEDGKTLSLNDIQEEVDTFMFEGHDTTAAAMTWAIYALGRHPEIQQKVH 402953
402952 EELDSVFEDDR
402654 AQLQKLSYLERVIKESLRMFPSVPMFARVLSEDCKLGGYSVPKGTQAIIFGYTIHHHPG 402478
402477 IWEDEERFEPDRFLAENCVGRHPYAYIPFSAGPRNCIGQKFAMMEEKVVLCHLLRRYSVI 402298

>intestinalis seq 134, 141 ortholog to Cs4v5a 75% 
MAVALLLIIVVTCVIALHYRKGIARHFKNRLMLSRLPSLPDPAYPLIGHAYLLRGDPS
RFFEFVTETTNAMVTEVNEKIGILWLGPVPLMVVCHPEAAEVVLRSSKHMEKSYLYKFLHP
WLGTGLLTSGGEKWKQRRRLITPSFHFNILQDFLEVMNEESGKLINN
LKKKLKSGVKVDVGKAVTMCALDIICETAMGQTVNAQENDDSSYVRALYR (2?)
ISELIQLRQKTPTLWWDPAFSRMKLGKEHENLLSILHGFTRDVITKRAKTRDQKIVE (0)
NPRRMAFLDVLLHAETEDGKTLSLNDIQEEVDTFMFEGHDTTAAAMTWAIYVIGRHPDIQ 
KKIHEELDAVFGEDRGGTITNNQLQKLSYLERVIKECLRLYPSVPFYARVLSEDCKVGDY 
MVPKGTQTVIFAHTIHHHPYVWEDPEKFDPDRFLAENCVKRHPYAYIPFSAGPRNCIGQK 
FALMEEKVILSKLLHNYFVVSHDKKEDLVINGDLILRSSTPLNITLEARK* 730

>Cs4V5b
MAVTLLIILGAFVVPF
ALYWRKILKEISNRHRINKLPSVNSPAYPLVGHSYMFPRNPDEMYDLITSWLKWVMREAK
QKLAVVWLGPVPLVAIVHPDAAEVVLRSSKHMEKSFVYRFLHPWLGTGLLTSGGEKWKQR
RRLITPSFHFNILLDFLEVMNEQSTKM
LRNLDAAVQSGSKVNVGRAITMCALDIICETAMGRTVNAQSDQESEYVKALYR (2)
ISDIIQQRQKSPALWWDLAFSKMAMGREHDATIDVLHRFTMNVISDRAQSKTEIKVGWIF (0)
DTRRMAFLDVLLRAESEDGRSLSLSDIQEEVDTFMFEGHDTTAAAMTWTIYLIGRHPAIQARIHEELDD 553
552 VFGGDMGTITNSHLQKLSLLERTIKESLRMFPSVPFIGRVTTEECSVGSHSIPAGTQVA 376
375 IFIDSLHHNPSVWPDVDRFDPDRFLPENCVGRHPYSFIPFSAGPRNCIGQKFALMEEKVL 196
195 LTQILRKYSIHSHDEEEDLRKQADLILRSSTPLNISLTPRA* 70

>paired_scaffold_181 Cs4V5b 
271174 DTRRMAFLDVLLRAESEDGR
271114 SLSLSDIQEEVDTFMFEGHDTTAAAMTWTIYLIGRHPAIQARIHEELDDVFGGDMGTITN 270935
270934 SHLQKLSLLERTIKESLRMFPSVPFIGRVTTEECSVGSHSIPAGTQVAIFIDSLHHNPS 270758
270757 VWPDVDRFDPDRFLPENCVGRHPYSFIPFSAGPRNCIGQKFALMEEKVLLTQILRKYSIH 270578
270577 SHDEEEDLRKQADLILRSSTPLNISLTPR 270491

>seq 115 ortholog to Cs4v5b N-term 69%
MALTLFILIGALAVTIAVYWKKIKEQFR
KRQLINKLPTIMETAYPLVGHSYLFPKSAEELFVFITGRLNWVMTEAVEKIAVLWLGPIPL 
IGVVHPEAAEVILRSSKHIEKSFVYTFVHPWLGTGLLTSGGEKWKQRRRLITPSFHFNIL 
QEFLEVMNEQSKKMVDKLDASIASGSKIYVGKAITMCALDIICETAMGQTVNAQDHQDSE 
YVKALYRISDLVQFRQRTPALWWDAVFSRSKLGIEHDTILCTLHGFTRNVITERAQGKGK
KEIENPRRLAFLDVLLNAETEDGKSLSLNDIQEEVDTFMFEGHDTTAAAMTWTVYLIGRYPD
IQEKLHEEIDSVFHDDKEGVISNSQLQKLSYLERVIKESLRL
YPSVPFIGRVTTEECVIADHVIPVGTQVALFIES
MHRNPAVWPDAEKFDPDRFTAENCVGRHPYAYIPFSAGPRNCVGQKFAMMEEKVILAQIL
RRFSLVSHDKEEDLKKQADLILRSSKPLNITLTPRV*

>SEQUENCE 34, 52, 71, 217 74% TO SEQ 29, 60% to seq 65, 39% TO 2C9
MLDILLSCIRAWFSTITLSIIVFYVTWHYWNKRTPGSPPGPRGFPFIGAITSIRKHPEHVMTKWNKEYGPVCMVRLGFKDVLVIGS
YEAAHEAYVKSQDFLDRPSPFGLEILGGGYGLFPIAYGSFHQEQRRFGLNTLREHGMGRRVLESTILQYAEELCDRLETKMSAPVL
LDQEVYIAVSSTIAHIVFGHNTMQDSPEFRDMILWMLRPNTATVLAGILVFAPYLKHLPFFRGVHNDSRALRLKLESLNLKEVKKH
DKTRDPSDPRDFIDSFLNEMDK (1) 
HFKGKIDPNWETYSSFSDQQLVSM (0)
VRDLFLAGTDTTSATTCWIILFLCKYPDVQRRMQKEIDDVIGENGIPKLALAERLPFTRAVIQ (0)
EMGRIRPNVPLAVPHCASRDSTLMGFNIPKDTIIMTNIWGIHHDEKTWKDPYKFNPDRHLDAEGNFVKSNHVMQFNIGLRSCLGQQ
LARMELFLITVTLFRKFFIELEPGCDIDMEGESLVSLRPYPFKVLLTKRI*

>paired_scaffold_278 Length = 731525 complete CYP2 clan Seq 34 ORTHOLOG revised 
58475 MEESLFLSIKSTVDIFILVAIALYVTWHLWNKRSPQSPPGPRGIPLLGAITEMRKHPEHA 58654
58655 MMKWNQQYGPVCMVRLGLKDVLVLGSYEAAHEALVKNTHIADRPTHNIEVFYGGKGILM 58831
58832 INSGSFHQEQRRFGLNTLREYGMGRRALEPTILIYVNDLCDRIEKYGSDPFVIDGEVYLT 59011
59012 ISSTISHIVFGHDVIHDNPKFKEIIFKMIEPNKLNVLAGILAFAPFLKHFPFFSTVHKSS 59191
59192 KEFRLKLQSLIGEEVQEHKKTRDPTQSRDFIDSFLQAMEKTER (1) 59320
59700 (1) SGNKSKLNQDGFADVQDGKLDPNWEPYSSFTIDQLTAM (0) 59768
60529 (0) VRDLFMAATDTTSSTICWIILFLCKNTDVQIKMQEEIDEVLGQNGILKYELATKMHYVRAVIQ (0) 60717
61567 (0) EIARLRPSVPLSLPHASNRDTSIMGYK 61644 frameshift
61649 VPKDTIVLTNIWGIHHDEKIWKNPYEFNPERHLDANGNFVKSKKVMQFNIGLRSCLGQQL 61828
61829 ANMELFLVTVSLFHRFSFSLVNPKIDMGGESMVSLRPYPYGVVATTRG* 61975

>SEQUENCE 65, 78, 38  join with 68 check end
MEISVQSISSIFDTILLVLIVSYLTWHFWNQRSAWA
PPGPRGLPFIGPITSIRIHPEHAMMKWNQQYGPVCMVRFGFKDILLLGSYEAAHEAL 
VKNMDLADRPSNGIAVFKGGKGILMTKFGSYHQEQRRFSLNKLREYGMGRRALEPTI 
LLYSNELCERIEKFGSKPFYIDMEIYKAISSTICHIVFGHNVIEENEDFKEIINTLN 
KKSKLNVLSGILAFAPFLRFLPVFSTIHAKSVNFQQTLHALVRQEISEHEKTRDPKE 
PRDYIDSFLNEMDK 
DFRGQVDPNWKQYSSFTYEQLVAM (0) 
YEQLVAMCRDLFMAGTDTTSSTTSWIILFLCRYPEVQRKMQEEADQVLGSNGEPKMALAEKMPYTR (2?)
FFFQEINRIRPNVPLSVPHYSSKDTMV 
MGYKIPKDTIVLTNIWGIHHDEKLWKNPYDFNPERHLDSNGKFIK 
SSKVIQFNIGLRSCLGQQLANMELFLVTVSIFRQFSFSFNPGDKIDM 
EGESVIALRPYSYRVI

>ci0100145536 = seq 65 CORRECTED complete CYP2 clan
MEISVQSISSIFDTILLVFIVSYLTWHFWNQRSAWAPPGPRGLPFIGAITSIRKHPEHAMMKWNQQYGPV
CMVRFGFKDILLLGSYEAAHEALVKNMDLADRPSNGIAVFKGGKGILMTKFGSYHQEQRRFSLNKLREYG
MGRRALEPTILLYSNELCERIEKFGSKPFYIDMEIYKAISSTICHIVFGHNVIEENEDFKEIINTLNKKS
KLNVLSGILAFAPFLRFLPVFSTIHAKSVNFQQTLHALVRQEISEHEKTRDPKEPRDYIDSFLNEMDKAKSTMQ (1) 
ESETTQTTGGESIEGFRGQVDPNWKQYSSFTYEQLVAM (0) 
CRDLFMAGTDTTSSTTSWIILFLCRYPEVQRKMQEEADQVLGSNGEPKMALAEKMPYTRAVIQ (0)
EINRIRPNVPLSVPHYSSKDTMVMGYKIPKDTIVLTNIWGIHHDEKLWKNPYDFNPERHLDSNGKFIKSSKVIQF
NIGLRSCLGQQLANMELFLVTVSIFRQFSFSFNPGDKIDMEGESVIALRPYSYRVIAEKR*

>New Cs300 seq 67% to seq 34 ortholog paired_scaffold_278 complete CYP2 clan
64594 MDIAADIIKEVCAVQTIRGAIDTILLLAIVFYLTRLFWNRRSPLSPPGPRGIPLIGAITKIGKHPEHAMMKWNQQY 64821
64822 GPVCMVRLGFKDVLVLGSYEAAHEALVKNTHIA 64911
64925 DRPPNGIPIFKSGQGIFMINSGSFHQEQRRFGLNTLREFGMGRRALEPTILIYVNDLCD 65101
65102 RIEKYGSDPFAIDNEIYTVIASTISHIVCGHNVTEDSEDFRYILRRLNDKSKLNALSGIL 65281
65282 AFAPFLRHFPLFRMLYRRTLELQELIHSVLQKEIDEHEVTRDPKEPRDYIDSFLTEMEKSRNQTTK (1) 65479
65763 (1) GEVSNDSPCQENKDFREQQDPNWKKYSSFSKEQLVAM  (0) 65873
66132 CRDLFIAGTDTTTSTICWIILYLCQYPEVQRKFQEEVDRVLGSSGVLKMAHAEKMPYARAVIQ  (0) 66314
66620 EINRMRPNVPLSVPHYSSTDTSIMGYRIPKATIVLTNIWGIHHDEKIWKNP 66772
66773 YEFNPERHLDVHGNFVKSKKVIHFNIGLRSCLGQQLANMEQFLVTVSLFHRFSISLKPGE 66952
66953 KADMAGE 66973
66975 SVVSLRPYSYNVVTKKRL* 67031

>New Cs301 seq 74% to seq 34 ortholog (this N-term matches student assembly) paired_scaffold_278 complete CYP2 clan
71129 MMEEVSIVQTIRGTFDTILLLAIVFYLTWHIWNRRSPLSPPGPRGIPLLGAITQLGKHPEHAMMKW 71326
71327 NQQYGPVCMVRLGPKDVLVLGSFEAAHEALVK 71422 
71424 NTHMADRPSNGIPLFKGGKGILMIHSGSFHQEQRRFGLNTLREF 71555
71556 GIGRRALEPTILIYVNDLCDRIEKYGSDPFAIDKEIYKAISSTICHIVFGHNVIEESEDF 71735
71736 RAILSRLNRKNKLSTLAGIVAFAPFLRHLPVFGMLYRRTLEFQQIIHKVIGKEIEEHVDT 71915
71916 RDPKEPRDYIDSFLNEMEKS (1) 71975
72227 YANRKMSNAEHKEDFRGQEDPNWIQYSSFSKEQLVAM (0) 72337
72469 CRDLFIAGTDTTTSTICWIILYLCQYPEVQRKFQEEVDRVLGSSGVLKMAHAEKMPYARAVIQ (0) 72654
72761 EINRMRPNVPLSVPHYSSADTTLMGYRIPKDTIVLTNIWGIHHDEKIWKN 72910
72911 PYEFNPERHLDANGNFVKSNKVMQFNIGLRSCLGQQLANMELFLVTVSLFHRLSFRLKPG 73090
73091 EKVDMAGESVISLRPFSYNVILEKRM* 73171

>New Cs302 seq 71% to seq 34 ortholog 89% to seq Cs301 paired_scaffold_278 complete
CYP2 clan
84029 MMEEVSIVQTIRGTFDTILLLFIVFYLTWIFWNRRSPLSPPGPRGIPLLGAITQLGKHPEHAMMKW 83832
83831 NQKYGPVCMVRLGHKDVLLLGSYEAAYEALVKNTHIADRPPNGIPLFKGGKGILMINSG 83655
83654 SFHQEQRRFGLNTLREYGMGSRALEPTILIYVNDLCDRIEKYGSDPFAIDNEIYTVISTT 83475
83474 ICHIIFGHNVIEESEDFRAILTTLNRKDKLSTLTGFVAIAPFLRHLPVFGMLYRKTMKFK 83295
83294 QDIHDVIEKEIKEHEDTRDPKEPRDYVDSFLNEMEKS (1) 83184
82857 YVPEKTSYEEHKEDFRGQEDPNWIQYSSFSKEQLVAM (0)
      CRDLFIAGTDTTTSTICWIILYLCQYPEVQRKFQEEVDRVLGSSGVLKMAHAEKMPYARAVIQ (0)
82335 ETNRVRPNVPFSVPHHSSADTTLMGYRIPKDTIVLTNIWAIHHDETIWKNPYEFNPERHLDA 82150
82149 NGNFVKSKKVMQFNIGLRSCLGQQLANMELFLVTVSLFHRFSFRLKPGEKVDMAGESVIS 81970
81969 LRPFSYNVISKKRI* 81925

>sequence 29 9 accessions 33% to 1A1 in a cluster with 65
MEEYGIINSIKVAVDTVILVGLAIYLTWHFWNKRTPGGAPGPRGLPFIGAITSIRKHPEHAMMKWNQQYG
PVCMVRLGFKDILLLGSYEAAHEALVKSPDFANRPPAHSIEVIAGGKGIVMIPFGPFHQEQRRFGLNALREH
GMGRRALEPTVHLCAQELCERIEEYGSKPFNLDNEVY
QSVSSIISRLVFGHDVVQENLEFRKMIFQMIEPNKLNILAGIL
AFAPYLKHLPFFSEIHRKNKEFRLKMESSIRKEIEEHMETRDRKEPRDYIDSFLNEMEK (?)
DEPNTKNEEWKQYSSFTQNQLIAM (0)
CRDLFMAGTDTTSSTTSWIILFLCRYPEVQRKMQEEADQVLGSNGEPKMALAEKMPYTRAVIQ (0) 
EIARMRPTLPLSVPHCTTQDTMVMGYKIPKDTIVLTNIWGIHHDEKLWKNPYDFNPERHLDSNGNFV 
KSSNIIQFNIGLRSCLGQQLAKMELFLLTTSLCRHFSFSVVGEVDMEGESMVTLRPCSMEVVATKRA*

>sequence 2,3,9,45 COMPLETE 81 accessions 94% to sequence 1 40% to 2U1
MVLQLLSDINVSSLVIFFTAFLALYYWYTRPKNFPPGPRGVPFLGVIPFLGNYPERVMRKWSKKYGPVMSVRMG
REDWVVLGDYETIQQ (0)
SLVKQGQCFSGRPDVPVLNQITNGHGLITVDYNEDWKTQRRFGITTLRG (2)
FGVGKRSMEDRIVEEVAYLNDAIRSHNEKPFDIL (0)
SILSNAVSNNICSVVMGRRFDYDDKRFMEIMARLSRS (2)
FNDPTANFALNVVMFMPILVKIPPFSRINNQLMTDVRVI (1)
LQMLREILSEHKSTFNKDDVRDFIDAFIAEQNSESKHSSYT (0)
DLQLLQYVRDLFVAGTETTTSTLRWSILCMIHNPEKQEKLRKEICDVI (1)
GQDRVPAMNDKAQMPYTCAFMQEVFRYRTLVPLSVVHMTNQDVVLNGYTIPKGTT (0)
ISPNLWAVHNNPDVWDEPSKFKPERHLDDKGNFVQSKHVIPFSIGPRHCLGEQLARMEYFIYLVSMV
QKFEFFPDPNEPDLPDVEDGSSGVVFVPLRFKQIAKIV*

>Cs2a CIONA SAVIGNYI SEQUENCE 71% TO CIONA INTESTINALIS seq 2 70% TO SEQ 4
MLQRMLNEINVSTSFIFLTVFLGLYYWYRRPKNFPPGPRGIPFLGVLPFLGNYPERKMRK
WSNKYGPVMSVRMGRQDWVVLGDHETIQQ (0)
GLVKHGNAFSGRPSIATIDQITEGHGVLFIDYNDHWKRQRRFGLSTLRG (2)
FGVGKRSMEDRITEEVAYLNDAIRTHDGKAFDIQ (0)
SILSNAVSNNICSIVMGRRFDYDDERFKEIMGRLARG (2)
FNDPEASFVLQILIFMPALINFPYFSRINGELMEDVRVI (1)
PELLREIVEEHKASYEQDNHRDFIDAFLGEQKAENGAKTFT (0)
DTQLLQYVRDLFVAGTETTTSTVRWSLLCLIHNPETQEKLRKEIFEVL (1)
GPEKIPAFENKSKMPYTSAFIQEIYRYRTLVPLSVTHMTNEDAHISGYTIPKGTT (0)
IAPNLWAVHNDPEEWDEPNKFKPERHLDAGGNFVQLKHVIPFSVGPRHCLGEQLARMEVF 
IFLVSLVQKFEFLPDPKATVLPDIENGASGAAYVPLPFKIVAKVV*

>paired_scaffold_10 Cs2a CYP2 clan complete
        MLQRMLNEINVSTSFIFLTVFLGLYYWYRRPKNFPPGPRGIPFL
1860946 GVLPFLGNYPERKMRKWSNKYGPVMSVRMGRQDWVVLGDHETIQQ (0) 1860812
        GLVKHGNAFSGRPSIATIDQITEGHGVLFIDYNDHWKRQRRFGLSTLRG (2) 1859654
        FGVGKRSMEDRITEEVAYLNDAIRTHDGKAFDIQ (0)
1858956 SILSNAVSNNICSIVMGRRFDYDDERFKEIMGRLARG (2)
        FNDPEASFVLQILIFMPALINFPYFSRINGELMEDVRVIL (1)
1858612 KLLREIVEEHKASYEQDNHRDFIDAFLCEQKAENGAKTFT (0) 1858493
1858247 DTQLLQYVRDLFVAGTETTTSTVRWSLLCLIHNPETQEKLRKEIFEVL (1) 1858101
1857724 GPEKIPAFENKSKMPYTSAFIQEIYRYRTLVPLSVTHMTNEDAHISGYTIPKGTT (0) 1857563
1857321 IAPNLWAVHNDPEEWDEPNKFKPERHLDAGGNFVQLKHVIPFSVGPRHCLGEQLARMEVF 1857142
1857141 IFLVSLVQKFEFLPDPKATVLPDIENGASGAAYVPLPFKIVAKVV* 1857004

>Cs2b Second savignyi whole gene ortholog to seq 2/4
MLQRVFNEINVSTGFVVLTVFLALYYWYRRPKNYPPGPRGIPFLGVLPFLGNYAERTMHK
WSKKYGPVMSVRMGRQDWVFLGDHETIRQ (0)
ALVNQGNSFSGRPLIATLDQITEGHGIVLLDYGDWWKRQRSFGFSTLRG (2)
FGVGKRSMEDRITEEVAYLNDAIRSHDGKAFDIK (0)
SILSNAVSNNISSIVMGRRFEYDDEHSKEIMARLAMG (2)
FNDPDTSFFLQILIFMPACVHLPYFSRVNKKLMEDVHVTR (1)
SELLREIIAEHKASYDQDNHRDFIDAFLGEHNAENGTDAFT (0)
DKQLLHYVRDLFFAGSETSTSTLRWTLLCLIHHPEKQEKLRKEIFEVL (1)
GQEKIPAVDNKAYMPYTCAFMQEVYRYRTLAPFGVAHMTNEDVNLNGYSIPNGTT (0)
IYVNLWAVHNNPDVWDEPNKFKPERHLDDKGNFVQSNHVIPFSVGPRHCLGEQLARMEIFIYL
VSLVQKFEFLPDPDATELPDIENGSYGPICAPKPFKMVAREV*

>paired_scaffold_76 Length = 1209077 79% to seq 48 CYP2 clan
889173 MMPDFLMYFELPILTLILVLVCIYYWYRRPKNSPPGPRGIPVLGVLPFMGLYPERTLNKWSKKYGPV 889370
889371 MSVRMGRHDWVILGDHATIQQ 
889551 ALVKQSNVFSGRPHVPTLHHITQGRGLAFVDYGPVWKSQRKFGAVTLR 889697
890144 FGVGKKTMEDKITEEVHYLNDAIRSRGGQPFEIL
       DVLSNAVSNNICNIILGRRFDYNDESFREIMHRLSRG 890452
890953 LTEMLLKYVEEHQATFTRDDVRDFIDAFIKESKEGTD 891063
891134 DTQLIHYVHDLFVGGTETTTSTLRWALLCLLHYPLTQDKLRKEIKEVIGK 891283
891423 MPYTCAFIQELYRFRTLLPLSLTHKTSEDAFLGKYRIPKSTTVS 
891603 VSVNLWAVHNDPNLWDEPSKFKPERHLDDKGNFVQSKHIIPFSVGPRHCVGEQLARM 891782
891783 EVFIYLVSMVQKFEFSPDPNAADLPDINEGSNGPAFVPKKFDIV 891914

>65% to seq 55 paired_scaffold_76 CYP2 clan
896229 MMVVETLLPAWSNAGAFFVSIIVILIYYWYQRPLLFPRGPRGFPFVGVVPYLGKNPETTLSK 896044
896043 WSRKYGPVMSVRMGREDWVVFGNYESVYE 895864
895863 SFVKGGHVFSGRPYIKLLSKLTESKGLAMIDYNPTWKSQRKFGLITLR 895735
895029 FGVGKKSLEERIVEEIDVLNRSILEKEGTPFDI 894931
894240 SGMFQVLVKEHRETFDENNIQDFIDAFLLEAKMHTDPSFKDVRDL 894106
894219 VKEHRETFDENNIQDFIDAFLLEAKMH 894139
894058 ERQLLQYVRELFEAGTETTVGTMLWAVLCLVRYPDAQSKLRKEIFDVIGK 893909
893142 ILGPVSIPSLSDRSAMPFTCAFMQEVFRYRTIIPLSVPHKTTDTVNFGGFIIPKGTT 892972
892901 VVSNLWAVHNDPDVWNEPSKFKPERHLDHKGNFVQSNHVIPFSIGPRRCLGEQLARME 892728
892727 IFIILVSLVQKFEFLPDSDETDLPEIDKGLNGAAFVPLSYKIIAKPI 892587

>70% to seq 27 paired_scaffold_76 CYP2 clan
900705 MLCCFYDCISVSTILPVAAILAAVYYWYKRPSNMPPGPRGIPFLGVVPFVEKQSERT 900535
900534 FFKWKKDYGSVLSVRMGREDWVVLNDYESIYQ 900439
900296 ALVKQGSSFSGRPSVSILDEITNGHGLAFLDYCEKWKSQRKFGLMTLRG 900150
899919 FGVGKKGMEERILEEVDFLCNKIRAKNGKPFDIQ 899818
898328 IFKRIIEDHENTYDDNNIRDFIDAFILEMKRNQSSSDFTVSSLI 898197
897978 QLMQYLRDLFEAGSETTTSTTRWALLCMLHYPETQEKLREEIMNVIGK 897835
897477 LGSSARPSMSHKTDMPYMCAFIQEVFRFRTLVPLNVPHKTTEKVNINGYTIPKGTA 897310
897251 VFTNLWAVHNDPDVWDEPNKFKPERHLDHKGNFVHSNHVIPFSVGPRHCLGEQLARMEVF 897072
897071 IFLVSLIQKFEFLPDPDAKGLPDIENGSHGTVFVPLPYKVL 896949

>N-TERM PSEUDOGENE
901883 MLNSMFSQINVSISIYILTFLTVLLALYYWYKRPTNFPPGRQGLPILGVLPFRGNYHKAE 902062
       KEVWS 902077
902538 PSIAIIDQISE*HSILFIDYDDHW 902609

>paired_scaffold_76 79% to cs2b CYP2 clan
910953 MMQCMFNEINVSTGFVVLTVFLALYYWYRRPKNYPPGPRGIPFLGVLPLLGNYPERTMHKW 911135
911136 SKKYGSVMSVRMGRQDWVVLGDYETIQQ 911219
911587 LVKQGNSFSGRPSIAILDQISEGHGLLFIDYDKQWKRQRRFGLATLRG 911730
911796 FGVGKRSMEDRITEEVAYLNDAIRSHDGKAFDIQ 911897
912224 VMGRRFEYDDKQFKEIMAIISRG 912292
912361 NDPETSFVLHILIFMPALVYIPHFSRIYEGLMEDARATL
912549 IDEHKASYDQNNQRDFIDAFLGEXXXXXXXXXXX 912617
912733 DKQLLHYVRDLFFAGTETTMSTLRWALLCLIHHPETQEKLRKEIFTVL 912876
912953 GHEKIPAVENKSNMPYTCAFMQEVYRYRTLVPLNLTHMTNEDVILNEYSIPKGTT 913117
913177 ISSNLWAVHNDPDVWNEPSKFKPERHLDDKGNFVQSSHVIPFSVGPRHCLGEQLARME 913350
913351 VFIYLVSLVQKFEFLPDPDATELPDIKIGSNGPAYVPLPFNMVARVV 913491

>75% to seq 27 paired_scaffold_76 CYP2 clan
927634 MLSCLNDCISVPTILSLATILAAVYYWYKRPSNMPPGPRGIPLLGVVPFVGKVPEQVFFK 927455
927454 SKKDYGSVMTVRMGREDWVVLNDYESIYQ
       TLVKQGSIFSGRPSIPILEEMTKGHGILLLDYGEKWKSQRKFGLMTLR 927107
926117 FGVGKKGMEERILEEVDFLCNKIRAENGKPFDIQ 926016
923794 KQVIEEHESTYDNNNIRDFIDAFILEMKRDQSNSDFTVSSLI 923669
923320 QLMQYVRDLFEAGTETTTSTIRWALLCMLHYPETQDKLREEIMKVLGE 923177
922703 PSMSQKVDMPYTCAFIQEVYRFRTLVPLNVPHKTNKSFDINGYTIPKGTT 922554
922480 ISTNIWAVHNNPDVWDEPNKFKPERHLDDKGNFVQSKHVIPFSVGPRHCLGEQVARMEVF 922301
922300 IFLVSLVQKFEFLPDPDATDLPDIDNGANGTAFVPFPYQLVAKEI 922166

>paired_scaffold_76 Cs2b CYP2 clan
946366 MLQRVFNEINVSAGFVVLTVFLALYYWYRRPKNYPPGPRGIPFLGVLPFLGNYAERTMYKWSKKYGPVMSVRMGRQ 946139
946138 DWVFLGDHETIRQ 946100
946025 ALVNQGNSFSGRPLIATLDQITEGHGIVLLDYGDWWKRQRSFGFSTLR 945894
945826 FGVGKRSMEDRITEEVAYLNDAIRSHDGKAFDIK 945725
944393 TFSELLREIIAEHKASYDQDNHRDFIDAFLGEHNAENGTDAFT
943365 RLLHYVRDLFFAGSETSTSTLRWTLLCLIHHPEKQEKLRKEIFEVLGK 943222
943165 VTGQEKIPAVDNKAYMPYTCAFMQEVYRYRTLAPFGVAHMTNEDVNLNEYSIPNGTT 942995
942583 DWQIYVNLWAVHNNPNVWDEPNKFKPERHLDDKGNFVQSNHVIPFSVGPRHCLGEQLARM 942404
942403 EIFIYLVSLVQKFEFLPDPDATDLPDIENGSYGPICAPKPFKMV 942272

>paired_scaffold_76 80% to Cs2b CYP2 clan
949817 VLGRRFEYDDEHSKEIMARLAMG
949676 PDTSFFLQILIFLPACVHLPYFSRIN 949599
       FNDPDTSFFLQILIFLPACVHLPYFSRINKKWMKDVRLTTSK 949560
949519 SELLREIVAEHKASYDQDNHRDFIDAFLGLQNAENGADGFT
949318 DNQLLYYVRDLFFTGTETTTSTLRWTLIFLIHHXXX 949226
949215 PEKQEKLRKEILEVLGK 949165
949079 PLAENKFNMPYTCAFMQEVYRYRTILPLSASHMTNEDANLNGYSI       
       ILDHDWQIYVNLWAVHNDPNAWDEPNKFKPERHLDDKGNFVQSNHVIPFSVGPRH 948696
948695 CLGEQLARMKVFIFLVSWV*KFKFLPDPDATDLPDI 948588
948585 GPLCAPKPFKMAAKVV 948538

>paired_scaffold_76 C-TERM PSEUDOGENE
950031 GPLCAPKPFKMAAKVV 950078

>paired_scaffold_76 84% to cs2b CYP2 clan
951649 MLQSMFNEINVSTAFVFLSVFLALYYWYRRPKNYPPGPRGIPLLGFLPFLGNYPERTMYKW 951831
951832 SKKYGPVMSVRMGRQDWVILGDHETIRQ 951915
951996 ALVNQGNSFSGRPLIATIDQITEGHGILLIDYGDWWKRQRRFGFSTLR 952127
952193 SFGVGKRSMEDRITEEVAYLNNAIRLHDGKAFNIQ 952297
953096 VLGRRFEYDDERFKEIMARLAMG 953164
953229 FNDPDTSFFLQILIFMPACVHLPYFSRVNKKLMEDVDVTISK 953354
953417 IVAEHKASYDQDNHRDFIDAFLGEQKAENGTDAFT 953521
953981 DKQLLHYVRDLFFA 954022 (FS)
954024 GSETTATTLRWALLCLIHYPKKQEKLRKEIFK 
954204 VTGQEKIPAVDNKSYMPYTCAFMQEVYRYRTLAPLGVAHMTNEDVNLNGYSIPNETM 954374
954778 IFVNLWAVHNNPDVWDEPSKFKPERHLDDKGNFVQSKHVIPFSVGPRHCLGEQLARM 954948
954949 EVFIFLVSLVQKFEFLPDPDEVNLPDIENGSYGPLCAPKPFKMVARIV 955092

>Mostly new 80% to Cs2b paired_scaffold_377 CYP2 clan
470117 MLQSMMEEINVSISFTFLIVFLALYYWYRRPNNFPPGPRGLPLLGVIPFLGNHPERRMRKWSKKYGPVMSV 470329
470330 RMGRQDWVVLGDYDTIHQ 470383
470631 IKKGLVKQGTSFSGRPRIATLEQITEGHGLVFADYGDRWKRQRRFGLATLRG 470786
470840 FGVGKRSMEDRITEEVAYLNDAIRSHDGNAFDIQ 
       SILSNAISNNICSIVMGQRFDYDDERFKEIMTKLSYG 471166
471229 FNDPEVSLVRQILIFMPALVNAPYFSRINAELMENVR 
       NSVSELLREIVADHKAL 471441
471442 YDQDNHRDFIDAFLGEQKSENGSETFT 
471622 DKQLLHYVRDLFIAGTETTTSTLRWSLLCLIHHPEIQEKLRKEILKVLGMIK 471786
471842 GQEKTPAMENKPIMPYTCAFMQEVYRYRTLVPLNLTHMTNNDVNLNGYSIPKGTTV 472009
472065 TQISSNIWAVHNDPALWKDPSKFKPERHLDDKGNFVQSNHVIPFSVGPRHCLGEQLARME 472244
472245 FFIFLVSLVQKFEFLPDPNATDLPDINKGANGVAFVPLPYNMVARVV 472385

>related savignyi 2/4 like exon sequences (probably two more closely related genes)
TLVKQGSIFSGRPSIPILEEMTKGHGILLLDYGEKWKSQRKFGLMTLRG (2)
GLVKQGTAFSGRPSIDIIERVSKGNGMIFLNYDEHWKKQRRFGLSTLRG (2) 
FGVGKRSMEDRITEEVAYLNDAIRTHDGKAFNIQ (0) = RB10
SILGNAISNNICSIVMGRRFDYDDERFKEIITRLARR (2)
FNDPEVSLVRQILIFMPALVNAPYFSRINAELMENVRVI (1)
FIDPEVSSALQIFVFMPALVNFPYFSRIYGELMKDVRVI (1)
PESLREMVEEHKASYEQDNHRDFIDAFLGEQKAENGAKTFT (0)
SELLREIVADHKALYDQDNHRDFIDAFLGEQKSENGSETFT (0)
DMQLLHYVRDLFVAGTETTTSTLRWSLLCLIHDPEIQDKLRKEIFEVL (1)
GQEKIPAFEDKSKMPYTRAFIQEIYRHRTLLPLSVTHMTNEDANICGYTIQKGTT (0)
ISSNLWAVHNDPDVWNEPSKFKPERHLDDKGNFVQSSHVIPFSVGPRHCLGEQLARME
VFIYLVSLVQKFEFLPDPDATELPDIKIGSNGPAYVPLPFNMVARVV*
IAPNLWAVHNDPVVWDEPNKFKPERHLDADGNFVQLKHVIPFSVGPRHCLGEQLARMEVF
IFLVSLVQKFEFLPDPKAKVLSDNENGASGAAYVPLPFKIVAKVV

>paired_scaffold_10 CYP2 clan
1855032 MLQRMFNEINVSTSFIFLTVFLAFFYWYRRPKNFPPGPRGVPFLGVLPFL 1854883
1854860 MRTWSKKYGPVMSVRMGRQDWVVLGDYETIQQ 1854765
1854250 GLVKQGTAFSGRPSIDIIERVSKGNGMIFLNYDEHWKKQRRFGLSTLRG 1854104
1854015 SFGVGKRSMEDRITEEVAYLNDAIRTHDGKAFNIQ
1853606 VMGRRFDYDDERFKEIITRLSRRYLTLRPAVCLR
        GSEWFNKRNNARKLIPESLREMVEEHKASYEQDNH
1853246 RDFIDAFLGEQKAENGAKTFT
1852892 DMQLLHYVRDLFVAGTETTTSTLRWSLLCLIHDPEIQDKLRKEIFEVL 1852749
1852345 GQEKIPAFEDKSKMPYTRAFIQEIYRHRTLLPLSVTHMTNEDANICGYTIQKGTT 1852181
1852018 IAPNLWAVHNDPVVWDEPNKFKPERHLDADGNFVQLKHVIPFSVGPRHCLGEQLARMEVF 1851839
1851838 IFLVSLVQKFEFLPDPKAKVLSDNENGSSGAAFVPLPFKIV 1851716

>paired_scaffold_10 CYP2 clan
1723846 KSFGVGKRSMEDRITEEVAYLNDAIRTHDGKSFDIQ 1723953
1727607 DERFKEIMSRLAKGYLTLRSVCFVGAVNHFNDIFRFNDPHASFVLQMLIFMPALINFPYF 1727786
1727787 SRINGELMEDVRVILSKAEVVR 1727852
1727884 ELLREIVEEHKDSYEQDNHRDFIDEFLGEQKAEIGTKTFT
1728188 DKQLLQYVRDLFVAGTETTTSTLRWSLLCLIHNPKTQEKLRKEIFEVL 1728331
1728672 GQEKIPAFENKSKMPYTSAFIQEIYRYRTLLPLSVTHMTNEDANICGYTIPKGTT 1728836
1733808 IAPNLWAVHNDPVVWDNPEKFKPERHLDTDGNFVQLKHVIPFSVGPRHCLGEQLARMEVF 1733987
1733988 IFLVSLVQKFEFLPDPKATVLYNLENGTSGAVHVPLPFKIVAKVV 1734122

>sequence 4, 8, 92  (48 accessions)  83% to sequence 2 faces away from seq 220 on 
MLQHLLGDINVSSLVIFFTAFLALYYWYTRPKNFPPGPRGVPFLGVIPFLGNY
PERVMRKWSKKYGPVMSVRMGREDWVILNDYENIQK (0)
ALVKQGHSFSGRPDIPVFTQINNGMGLLTVDYNDHWKMQRRFGITTLRG (2)
FGVGKRSMEDRIVEEVAYLNDAIRSHNDKPFDIL (0)
GVLSNAVSNNICSIVMGRRFDYDDERYKEIMFRLSRS (2)
TKDPEANFYIsVLMFVPFLVNFPPFSRINKEMMDDVKVIL (1)
DHLDEIVSEHnSTFNKDDARDFIDAFIAEKNSQNKHSSFT (0)
DpQLLHYVVDLFEAGTETTTSTLMWSILCMIHNPEKQEKLRKEICSVV (1)
GQDRVPAMNDKAQMPYTCAFMQEVFRYRTLVPLSLMHMTNEDVVLNGYNIPKGTT (0) 
VSPNLWAVHNDPDVWDEPSKFKPERHLDDKGNFVQSKHVVAFSVGPRHCLG
EQLARMEYFIYLVSMVQKFEFLPDPNEPNLPDIEKGSNGAAFVPLPFKQIARVV*

>SEQUENCE 27, 220 SIMILAR TO SEQ 2 and seq 8 BUT NOT SAME SEQ Faces away from seq 
MLTNLIANPSISPVLAILSCVIAFYYWYKRPKNMPPGPRGIPFLGIIPFVGMN
PEQAFMQWSKKYGPVITVRMGRKDWVVLCDHDTIHQ (0)
VLVKQSTVCSGRPKIPIVSELSKGHGILFADYCEKWKSQRKFGMKTLRE (2) 
FGVGKKCTEDRVLEEVDFLCNEIRSKNGKPFDIQ (0)
DIMCNAVSNIIMNIVIGRRCNYDENFFTDVISRFTKW (2) 
FNDPTAGAMFTGMMFLPQLKYVPPFSKYYKIFRDDIGALH (1) 
EFFEEVIKEHEKNFDGNNLRDFIDAFLLEMKKNESGSEFT (0)
YIQLLNYIRDLFDGGTETTVSTSRWAILCMLHYPETQKKLRNEIMTII (1)
GPNTPASMSHKSDMPYTCAFIQEVLRFRTLVPLSVPHKVNEDATVNGYTIPKGVT (0)
VSPNLWAVHNDPDVWDEPSKFKPERHLDDKGNFVQSNHVIPFSVGPRHCLGEQLARMEIFIFL
ISMVQKFEFLPDPNEPDLPEINHGTNGAAFVPLPYKIVANQI*

>Sequence 209, 199 = A different C-term from seq 27 and seq 10
MIEAFLRQWTDTTTVIIFLFTLLFYYWYRRPLRFPPGPRGLPLV
GVLPFLRKYTARTMHKWSNKYGPVMSVRMGNEDWVVMGNYEAVQH (0?) 
SFVKAGNVFSGRPVLKVANEIAEGKGIVMRDFNTTWKTHRKFGSITLRG (2?)
FGVGKKSLEERIYEEVEVMNKEILSKEGKPFDIT (0?)
EILTNAVMNVISIITTDQRFEYDDEHFRLLQQIFTKW (2?)
FLEPENTAAFAQIIYVPLLCNIPPYRSKYLEVKRDMKRVSG (?)
VFQQMVEQHRKTFDKNNLRDFIDAFICEGKKGTDESFT (0) 
DGQLVQYVREMFEAGTETMTGTVRWAMLCLIHYPDAQKKLREEIFEAI 
GNNNRPSISDRKAMPFTSAFIQEVFRFRTRVPLGVQHMTTETVNFANYVIPKGTT (0)
ILANMWAVHNDPDVWDEPSKFKPERHLDDKGNFVQSNHVIPFSVGPRHCLGEQLARME
IFIFLVSMVQKFEFLPDPNEPDLPEINDGVKGNGFFPYPFNFVASEI*

>sequence 42, 59, 218 36% TO 2U1 Fugu
MLSLFFSSVSIFFRCFSFLFNSWTLTFATSVLLLLVIMDKTD
KKMPGSPPGPRGLPILGMLPFMMSYPERLMADWSKKHGPVIM (0)
VKMGPKNVVILGSSDAAHAAFVKNTHLANRPQDGLEIFAGGKGILFINS
SMFHSEQRRFCLSALREFGMGRRTLEPKIVDCAALLCDQIDDVCGHDVST (1)
GPIQIEQMIYVTMSNVISHLVFGHDSLNDNKGLCELLLRTIEPNKYNALAGILMFLP
SLKNVPPFSTTTRRAIEFREDLHRYIKMEIDRHRASRDPNQPRDFIDKFLNEIEKIKVRHNSNNNNNNNLILNNNVK (1)
GSKSRRKIAGASFSEDQLIPLVRDLFMAGTDTSSTTITWVVIFLASFPDVQTRLHEELDSVLGQNRTPSVSMEDKMPYMRAVIQ (0)
ETFRMRPPLPLSVPHVAQCNTTLMGYRVPKDSIILTNTWGIQNNPKFWVDPDTFKPQRHIDDLGKFIKSPNVI
PFNIGSRNCLGQQLAKMELFITIASLCRRFWFTLPTGETPDLKGESTFILRPFPFNVVATKRV*

>paired_scaffold_408 80% to seq 42 CYP2 clan 3 diffs to next seq
19162 MTFRIFNCWTILLTMFVILMMLLIIDKNPKKTVGAPPGPKGYPVLGVLPYLMKFPERKMA 19341
19342 EWNSQHGSIIMV 19377
19666 VKIGLTDVVVLGSSSAAHAAFVKNTHLSDRPQDGLEIFAGGKGLLFINASDFHTEQRRF 19842
19843 CLSALREFGMGKKTLEPKILDCAAMLCDHIIS 19938

>paired_scaffold_408 75% to seq 42 CYP2 clan
25679 MTFRIFNCWTILLTMFVILMLLIINKNPKKTVGAPPGPKGYPVLGVLPYLMKFPERKMAE 25858
25859 WSSQHGSIIMV 25891
26177 VKIGLTDVVVLGSSSAAHAAFVKNTHLSDRPQDGLEIFAGGKGLLFINASDFHTEQRRF 26353
26354 CLSALREFGMGKKTLEPKILDCAAMLCDHIIS 26449
      NLTFTGPIQIEELVYITMSSVISHLV 29469
29470 FGHDTLQENDDLRRLLLRTVEPNKFNALAGVLMFIPSLKNIPPFSITTSRAVQFREGLHH 29649
29650 HIRIEIDRHRASRDPNQPRDLIDKFLNQIDK 29742
30036 SFTEEQLVPLVRDLFMAGTDTSSTTITWAILFLASFPDVQGRIHEEIDTIIGYHQRPSIS 30215
30216 FEDKMPYVRAVLQ 30254
30624 LFKETFRLRPPLPLSVPHVAHCNTSLMGYRIPKGAIILTNTWGIQNNPKLWPEPEKFKPE 30803
30804 RHIDEFGKFIKSPDVIPFNIGTRHCLGQQLAKMELFITLTSLCQRFRFTLPLGKTPDLTG 30983
30984 ESTFILRPFPFHVIATKR 31037

>sequence 6, 132, 225 ci0100133889 
MEMETVLGLITIGNVTTILLAVVTVYITRCISKMI (?)
FFPPGFPPGERGLPFLG (?)
VLPYLGEHPHNVLKEWSYKHGDIMAIKLGTTNAVIVSSQSTLNEV (?)
FVKRASAISDRFTNEFFTQILLHRGIFNSAYTERLKIQRKFVHQALHR (?)
NLINNKTPEERINEEIQYFIREIHSQDGKPFHAH (?)
HMLLSSVTNVICSIVLGKRFEYNDKWFTKIMSCIVE (?)
SLAHSGPSFLLPISIYPWLIHFPPFRRINVRFRKSFATFA (?)
ELIQQDIDEHKSSFDENNLRDLIDAFLFEMKKEGHHSSFD (?)
ELQLCVIVRELLLAGNETTGSQIAWLLLALMHFPKWQDRIHREI (?)
KVLTGENGVLKLDDRKKLPCTCAFIQETLRLHSIGSLNAPHKVSRDTEVKGYKIPAGTMV (?)
MSNVFAIHTDPKVWTDPETFDPSRHLNEVGSFINSSK
MSPFSLGPRQCIGKTIAENKMFLYFGNICQKFRIVSETPTPITLHEGQSGFVFSPKEHKVIAIAYSSK*

>paired_scaffold_421 67% to seq 6 CYP2 clan
837856 STNFLTIVVGIATFYIARLLSKLIAFPADFPPGEKGLPFLGVLPFLGDHPQFALKDWSRK 837677
837676 HGDIMGVKFGSTNVVVLSSQSVLNEVNIKFL
835636 INNKNPEDRISEEIHYFLEELRSKNGKPFDL 835544
835434 VVGKRFDYDDAWFKRIM 835384
834904 LVQEAIDDHKASYDENNIRDLIDAFICEMKKENHHSSF 834791
834135 TICFFQDLQLCILVRELLLAGNETTGTQLSWLIVAAMNYPEWHDKIHKEIIDLIGKDSL 833959
833716 IGENGSLKLDHRRQLPLTCAFIQETLRLRPIGSLNAPHSASETVTVNGYTIPAGTM 833549
833029 VLHNVFAVHTDPKVWKDPERFDPNRHIDDSGNFVISKNLSAFSCGPRQC 832883
832882 IGKTLAENKLFLYFGNICQYFRVKSDCSKSPSMSAANSGFVFSPKRHNVIVQ 832727

>ci0100139346 seq 7a = seq 7, 100, 101 46% to 2J2
MLDYLTFTNLALFAIALLLFYIWLTPKYKNLPPGPMGVPFLGCLP
FMETLAERTFTKWSKKYGPIITVQLGDHRTVILNSYEAIEE (0)
AFIKKGEKFNGRFKTYFTEFASEGLGMVFIDGEKWKEHRKFGTRALAG (2)
AGMYGKTIEQRVLEEAENICHVIRSKNEEPFILE (0)
DDLILSVANVINGITIRERCDEEGNENLLKYAAIVVEG (2)
FRKDGLIYSLLSVIPWCRFFPQTQKHIKSLKDGLKAGN (1)
ALFDNLIEQHQQQHDRLHPRDLIDLYLNQMKDFS (0)
KPQLRYFLKDLFAAGTETSSSTIRWALFYLIVNPDIQDKVHKEIDDVI (1)
GHNGVVRYDTKLPYTKAVLQETYRIRTATPLGVPRRNTEDVTLMGYFIPKNTQ (0)
IIPNLWWVHNDPEYWNEPDVFKPERHLDEEGNLIMSNRVIPFSIGARHCLGENLAR
TEIFLFLVSILQKFTVLPNPEDPNPPLECSPGIVNSPHKYPLIMKER*

>paired_scaffold_347 74% to seq 7a CYP2 clan
917011 VVLLAVMMICRWHKQKYKNLPPGPTGVPILWLSSFYGYASR 916889
916968 RNTRTFHQDQRVYLFCGCLPFMDTLAELTFTKWSKKYGPIFTVNLGQ 916828
916449 QALINQGDKFNGRVHNFFTNLSSNGHGILFIEGEKWRIQRKFGTRALAG 916303
912520 NTFQKTQLLYFLKDIFVAGTETSTSTIIWALYYLIRHPEIQDKVHREIDAELGKAR 912353
911719 KLSLKTNLPYTKAVLQETYRIRTVTPLAIPRRTRGDVNLMGYFIPKHTEVI 911567
910776 IISNLWWVHNDPEHWDEPDVFRPERHLDTNGNLVMLNRVIPFSIGPRNCLGENLARIEIY 910597
910596 LFLVSILQKFVVLPNPPITCNPGAINAPSRYNIVLKE 910471 

>paired_scaffold_347 74% to seq 7a CYP2 clan
942634 VVLLAVMMIYRWHKQKVKNLPPGPTGVPIFGCLPFMDTLAELTFTKWSKKYGPIFTVHLG 942455
942454 QTRVVVLNSYEAIVEVRV 942401
942074 QALINQGDKFNGRVHNFFTNLSSNGHGILFIE-GEKWRIQRKFGTRALAG 941928
938126 SNTFQKTQLLYFLKDIFVAGTETSTSTIIWALYYLIRHPEIQDKVHREIDAELGKAR 937956
937358 KLSLKTNLPYTKAVLQETYRIRTVTPLAIPRRTRGDVNLMGYFIPKHTEVI 937206
936743 IISNLWWVHNDPEHWDEPDVFRPERHLDTNGNLVMLNRVIPFSIGPRNCLGENLARIEIY 936564
936563 LFLVSILQKFVVLPNPP 936498

>paired_scaffold_347 Like 7a CYP2 clan
957152 VVLLAVMMIYRWHKQKYKNLPPGPTGVPILGCLPFMDTLAELTFTKWSKKYGPIFTVYLG 956973
956972 QTRVVVLNSYEAIEEVRV 956919
956589 QALINQGDKFNGRVHNFFTNLSSNGHGILFVEGEKWRIQRKFGIRALAG 956443
952194 SNTFQKTQLLYFLKDIFVAGTETSTSTIIWALYYLIRHPEIQDKVHREIDAELGKAR 952024
950489 IISNLWWVHNDPEYWDEPDVFRPERHLDTNGNLVMLNRVVPFSIGPRNCLGENLARIEIY 950310
950309 LFLVSILQKFVVLPNPDDPNPPITCNPGAINAPSRYNIVLKE 950184

>paired_scaffold_347 75% to seq 7a CYP2 clan
969026 VVLLAVMMIYRWHKQKDKNLPPGPTGVPILGCLPFMDTLAELTFTKWSKKYGPIFTVYLG 968847
968846 QTRVVVLNSFEAIEEVRV 968793
968230 QALINQGDKFTGRVHNFFTNFSNNGHGLFFID-GEKWTIQRKFGTRALAG 968084
964663 SNSFQKTQLQYFLKDIFAAGTETSTSTIIWALYYLIRHPEIQDKVHREIDAELGKAR 964493
964219 KLSLKTNLPYTKAVLQETYRIRTATPLAIPRRTREDVNLMGYSIPKDTQVI 964067
963562 ILSNLWWVHNDPEHWDEPDVFRPERHLDMNGNLVMLNRVIPFSIGPRNCLGENLARIEIY 963383
963382 LFLVSILQKFVVLPNPPMTCNPGAVNAPSSYNIVLKE 963257

>ci0100139379 seq 7b 97% to seq 7a above only 12 aa diffs
MLDYLTFTNLALFAIALLLFYIWLTPKYKNLPPGPMGVPFLGCLPFMETLAERTFTKWSKKYGPIITVQL
GDHRTVILNSYEAIEEAFTKKGEKFNGRFKTYFTEFASEGLGIFFIDGEKWKEHRKFGTRALAGAGMYGK
TIEQRVLEEAENICHVIRSKNEEPFILEDYLILSVANVINGITIKERCDEEGNENLLKYVASVVEGFRKD
GLIYSLLSVIPWCRFFPQTQKHMKSMKDGLRMGN
ALFDNLIEQHQQQHDRLHPRDLIDLYLNEMKDFS
KPQLRYFLKDLFAAGTETSSSTIRWALFYLIVNPDIQDKVHKEIDDVIGHNGVVRYDTKLPYTKAVLQETYR
IRTATPLGVPRRNTEDVTLMGYFIPKNTQIIPNLWWVHNDPEYWNEPDVFKPERHLDEEGNLIMSNRVIP
FSIGARHCLGENLARTEIFLFLVSILQKFTVLPNPEDPNPPLECSPGIVNSPHKYPLIMKER*

>genewise.1257.28.1 seq 1, 50 and seq 10, 11, 22 similar to seq 27 and 209
MIELVLPDLSIAPTILTLICL
LLFYTWYRRPSRFPPGPRGFPIVGVLPFLEKYSERTMHKWSKKYGPVMSVRMGNDDWVVMGNYEAVYQSF
VKAGSAFSGRPYMKLANEITECKGLVMRDFNPT WKTQRKFGHLTFRGFGVGKKSLENRIYEEVEVMNKEI
LSKGGKPFDIS (0) 
EILTNAVMNVICNITADQRFEYDDKNFRLLQHILTKWY (2)
FLEPENTAAFANMIFVPLLCNIPPYRSKYLEVKRDMMQMA (1) 
DVFQELVDQHRISFDKNNIRDFTDAFICEVDNGKDDSFTDRQLVQYLREIFEAGTQTTTGTLNWAILCLIHYP
QAQKKLRNEILDVIGNNNRPSISDRKSMPFTSAFIQEVFRFRTRVRTGVPHKTTETVNFANYVIPKGTTI
VANLWAVHNDPDVWDEPSKFKPERHLDDKGNFVQSKHVIPFSVGPRHCLGEQLARMKIFIFLVSMVQKFE
FLPDPNEPDLPEIDDGVIGNGFFPYPFNFVANEI*

>sequence 55, 80, 13 ci0100138298 SCAF 106 
MIEKFLPEWSNTGAVLMLVCTLIFYYWHQRPIHFPPGPRGFPLIGVLPYLGKYP
ERTMQIWSRKYGPVMSVRVGSEDWVVMGSQESIYQ (0)
SFVKGGNVFSGRPFVKVANEVTESKGLAMIDFNPTWKTQRKFGNVTLRG (2)
FGVGKKSLEERIIEEVEAVKMNILSKEGKPFDIT (0)
DILSNAIMNVICNITAGQRFEYDDQFFIQIRRHLW (2)
FMDPDRTAAFSAMVFVPLLCHIPPYRKKYEQIKQDTKEMM (1)
GLFQQLVDQQRKTFDKNNLRDFIDAFILENENGTDESFT (0)
DRQLVHYVRELFKAGTETSTGTLRWAMLCLIHYPGAQEKIRKEIFDVL (1)
GNSTFPSMSDRNAMPYTSAFIQEVFRFRTLAPLGVPHKTTDTVNFANYIIPKGTT (1)
ILSNLWAVHNDPTVWNNPRQFKPERHIDDKGKYIQSNHVIPFSVGPRHCLGEQ
LARMEIFIFLVSMVQKFEFLPDPNKTDLPELDDGVNGVAFVPYPFKLVAKEI*

>sequence 14, 37, 69 25 accessions 50% to 2C8 missing seq in mid
MLPSGYIFVFVFLCIYYLLQWRRRPKNFPPGPLGIPLFGIA
PFAGVDMHKYLATYYAKYGGVMSFRLATKDWIVLNDIEAITQ (0)
ALLKQGESFSGRPQSYLMNQLTEGCGIVFSTGPRWQAQRRFVLTALKT (2) 
LGMGTRSMVGIINEENQNFSSVIQSSGGQVNIL (0)
 (1) DIISEITEEHKNSFDENNLRDFIDIFLLEMKRRNSEYFT (0) 
ELQLLHLVRDLFVGAIDTTTATLGWGIICLLHYPECQVRIQEEIDDVI (1) 
GCAEPDMSHHESMPYLRAFIQEVHRFQTIAPLNIPHCVTEDCVLFGYHIPKS
TPVMSNIWRVHNDPKYWENPEKFSPERHLDSEGRFVPSNRVLSFAVGHRSCLG
VQLARVELFLFYASTLKKYEFQIDPEYGLPDWSNDRSGTVKTPKKFSVLLKSR*

>paired_scaffold_12 74% to seq 14 CYP2 clan
856107 VVFCLVCYLFRWKSRPKNFPPGPLGIPMLGVAPFAGSKLENYLASFYKKYGGVMSFRLGP 855928
855927 KDWVVLNDLEAITQ 855886
855465 AFTKQGESFSGRPKSVVFDQLTQGCGIVFANGDRWQHQRRFVLTALK 855325
852786 ADIISQIAENHKQSLDENNLCDFIDIFLSDMKKQKGDFFT 852667
852024 QLCHLIRDLFVGGIDTTTAALGWGIVCLLNFPECQDKIHQEIDQIIGK 851881
851686 SAVPNMSHQEKMPYLRAFIQEVHRFQTIAGLNIPHCVTEDCILYGYRIPKGTPVMSNIWF 851507
851506 VHNDPAHWQEPQKFRPERHLDGDGKFIPSNRVIPFSVGHRSCLGVQLAKAELFLFYAGVL 851327
851326 KHHEFQVDPNFGLPDWSRDDGGTLKTPKDFTVCIKE 851219

>sequence 16 and ci0100140915 they overlap at exon 6
MFCSIFEWLTSAGIITSLIGTFTFVVLVYYYWWKPPHPRYPPGVRGVPILGAIPLLSPFPHRVLLNWSKKYGAIFSARFGRSDA
VVLNDYTSVYQ (0)
AFVENYVSFQSRPDVQVVNEYTSGFGFAFSKGQYCLDVKKFSAQAIRS (2)
IGIVQIEKWVQEVAEGTVKQLGEFEGNAIDVK (0)
TCIGRKVADVMTMLLFGRFFGTKDQHYEQFLKDCFDN (2)
VTDPAIYSQVNALMFVPLLRHIPWFGNALSVHQNIHFRLK (1)
DFCRKEIEEHKKTLNVNEPRDYVDAFLIEMRNHSPRDSWFH (0)
EENLVNCLVDIYTAGTDAPTNVILWAMV
TLLHKPNLQHLLFHEINNKTSKFKNRYYSNEDLSNRWP
LLKAFIQENFRFNPPAPLSIPHQTNQGVKLDKYFIPKDTM
IISNIYAVNHDPKIWKNPYEFNIYRHIDTDGKFVPSTKVIAFGIGCRSCLGEKLGRLEIFYF
LANMIKRFEILPDPGSKDLPPINEGGNSILHLPIHFKVVFKPRENEDYFSL*

>ci0100141603 seq 16 like, scaf 2157 probably same as seq 16 on scaf 1266
EENLVNCLVDIYTAGTDAPTNVILWAIVTLLHKPNLQHLLFHEINNKTN
LSNRWPLLKAIIQENLRFNPPAPLGIPHQTNQGVKLDKYFIPKDTM

>sequence 18, 216 revised C-term
MSMLVNLITNFDMSIVLTASCAFVLVFYYWYKRPRNFPPGPRGIPLLGVV
PFLGKHTEKDFRKWSKKYGPMMSVRLGQNDWIVLGDHKTINE (0)
CLVKGGNSFSGRPDHPVFKQFTKNHGVAMVDYGDHWKVQRRFGLTTLRG (2)
FGVGKRSMEDRIVEEIEYLNNAIRSHNGKPFNIS (0)
KLFANAVSNNISSIVMGERMDYSDKIFETLTQRSTES (2)
IYNSDMAGKIIRFSIFVPSLWNLLLLFSTQASKQTSVFNSIRG (?)
QAIVNWVSQHKSTYDEHNVRDFIDAFIGEQKKETDVSFT (0)
DLQLIQYVRDLYVAGTETTTVTLRWAVRCLIHYPEKQNKLRKEIFDVI (1)
GREKTPSMKDKAQMPYTCAFMQELFRFRTLVPLGVPHKISNDVQVEGWSIPKGTW (0)
IIPNLWAVHNDPDVWDEPSKFKPERHLDEKGNFVQSKHVIPFSVGPRHCLGEQLARME
IFIFLVSMVQKFEFLPDPNEPRLSEVPQGVAGMMCVPHPFKQIVKEV*

>sequence 20, 87, 90, 107 ci0100147280 complete
MSGVSVCLSVVFFTGFLALCCWYKRPKNFPPGPRGVPFLGVIPFLGNYPARVMQKWSRKYGPVMSVRMGS
RDLVVLNNHENIQK (0)
QALVMQGQIFSGRPNMPALTQVTDGLGLATIDYTDHWKTQRRFGQTTLRG (?)
FGVGKRSMEDRIVEEVEYLNNAIRSHNGKPFDIL (?)
GILGNAVSNNTCSVVMGRRFDYDNERFKKIIATLSGL (2?)
FTNPKSNSTIILVLFMPFLMKIPPFSWINNKRVDGINEIKR (?)
LLREIVHEHELRFDKDDVRDFMDAFIAEQKKNDKHSSFT (?)
DLQLLQYVRDLFVAGTETTTSTLRWSILCLIHNPEKQEKLRKEIYKVT (1)
GQDRVPAMSDKAQMPYTCAFMQEVFRYRTLVPLSLFHATNDDVSLNGYRIPKGTT (0)
VLPNLWAVHNDPDVWDEPSN (2)
KFKPERHLDEKGNFVQSKHVIPFSVGPRQCLGEQLARMEIFIFLVS
MVQKFEFLPDPNEPNLPEIENGSCGTVFVPFRFKQIAKMK*

>SEQUENCE 21, 85, 193
MVFDVTMLNISPFIVFIITFVFVWNWYRRPPRYPPGPLGFPVLGVLPFLSNRP
ENKFLDWSKRYGPIISVSIGWQKWLVLNEYDVTME (0?)
ALVKNRLAFHGRPFVEFFIRTTVPPGIAFAPGGVHWKTLRKFGTNGIKA (2)
VGLENMEKKTNIEIQYFIEEIRGHKGKSFNIQ (0)
EPLTHAVANVVSSFCLGQRFDRSNDSFNQLIRSLMHQ (2)
GHVAANWRTMLLTFLPFLCDVWPFTKHMDESITNVRTLN (1?)
GIIRHFVEEQSASFDGDEVRNMVDAFLMHSRNSVDENVITDD (0)
KIIGFFSDMFLAGTKTITSQLRWGFLIMVKHQDCQNKVRREIDDVIG (1?)
TVKLKHRKLKHRSIMPYTCAVVHELFRF
RTVVPLSLPRMNVADVTVGGYRIPKGTN (0?)
VIMNIWALHNDDCRWREPERFIPERHLNADGKFMKPKNVLPFGVGARSCIAEHLARNKIFLFLVSTLKNF
TLSVPNGDSPSLDAGSNGTIFIPSDYNIVAQYRD*

>SEQUENCE 106, 23, 221 very similar to seq 18 
MSMLVNLVTNFDMSSIVLTASCAFVFILYYWYKRPRNFPPGPRGIPLLGMVPFLGKHTEK
DFRKWSKKYGPMMSVRLGQNDWIVLGDHKTINE (0?)
QCLVKGGNSFSGRPDHPVFKQVTKNHGVAMVDYGDHWKVQRRFGLTTLRG (?)
FGVGKRGMEDRIVEEIAYLNDAIRSNNGKPFDIA (?)
ELMANGVSNNISSIVMGQRMDYNDEIFKTLSKKAPNR
VYNSDTIGKITKLVLYMPFLRNLLFLFLSKPKEMNKINTLIRG
QAIVNWVSQHKSTYDEHNVRDFIDAFIGEQKKETDVSFT 
DLQLIQYVRDLYVAGTETTT 
GTLRWAVLCLVHHPEKQNKLRKEIFDVIGREKTPSMKDKAQMPYTCAFMQELFRF 
RTLAPLGVPHKISNDVQVQGWSIPKGTWLVCNLWAVHNDPDVWDEPS 
KFKPERHLDEKGNFVQSKYVMPFSVGPRHCLGEQLARMEIFIFLVSMVQKF 
EFLPDPNEPQLSEVQQGVSGFMCVPHPFKQIAKEV

>SEQUENCE 26, 81 no introns 37% to 2U1 human 41% to seq 2
MEITTFLLCMLMLVLIMFMWRRSSTNFPPGIEGIPLFGCIPSLGMHKERTIAKWGREKG
WPIFYLRMGITKIEVLNTFESIEEAYVKKGQYFIDRHLPIFLRHYSTGKGVVFLNGEK
WKVQRRFGIHTLRDLGMGKSGMESKIVHEIDHLSEYLVNNNQESGLVDITPIITRAVSNIVC
QLVFNRRFNLDEEFLFIDSVNNKLMKSVTTLKQIMLTLQHIPIFNRIPAIAKIK
KYRTDLANNRLQR
MRNYIKSHKETLDKNNPRDFIDAFLIASEEDKSEDTSFDDEQLLYYIQDLFFGGTDTTSF
TLTWTFLYLAMYPDKQEQLHAEIDEVLGSNGIVSMSCLDKMPYAQAVMQEIFRLRPLVPL
SMPRKTLSRITLMGYDIPKGTVFMPNIWSVHHNADRWPNPEDFMPERHIDSEGKFVKSKE
IIPFSLGPRNCLGIRLAEMEHFMFMIGLLQKLHFSLPEDQPPPDVRGSSFVVLLLPPDFNMKITTR*

>Ortholog to seq 26 in Ciona savignyi no introns paired_scaffold_207
MDFTSVLLFLLVVVLVFLWNRRGSGNFPPGFDGVPLLGCIPLFGA
1063884 HRERSIAEWGREKGWPIFYIKLGITKVVVLNTFESIEEAFVKKGQYFVERHIPIFLKRYS 1063705
1063704 KGGKGNVCFPMGRFGNTSARFGIHTLRDLGMGKSGMENKIQSEIEHLSDYLVNSPNI 1063534
1063533 VGKKSVDIVPIITKAVSNIICQLVFNTRFNLDDKITFIDSAAGIKLVKNASWVRQVKMTL 1063354
1063353 CYVPLLNRIPTIAKLKQEREALHNTRHDLLRKYIQMHKETLDKGNPRDFIDAFLIAGEQD 1063174
1063173 KEENSSFNDEQLLFYIQDLFFGGTDTTSFTLTWTFLYLSLFPEKQKRLHD 1063024
1063023 EIDEVIGSSGNISMSFLEKMPYAQAVMQEIFRVRPLVPLGMPRKCGQNISLMGHNLPKDT 1062844
1062843 SVIPNIWSAHHNPDRWPEPDSFLPERHIDSNGNFIKSKDIIPFMLGPRNCLGIRLAEMEH 1062664
1062663 FMFMVGLLQKLQFALPAGQGKVDTRGSSSIVLLLPPDVDLVITSR* 1062526

>Ortholog to seq 26 in Ciona savignyi no introns
MDFTSVLLFLLVVVLVFLWNRRGSGNFPPGFDGVPLLGCIPLFGAHRERSIAEWGREKG
SWPIFYIKLGITKVVVLNTFESIEEAFVKKGQYFVERHIPIFLKRYSKGGKGIVFSNGEI
WKHQRRFGIHTLRDLGMGKSGMENKIQSEIEHLSDYLVNSPNIVGKKSVDIVPIITKAVSNIIC
QLVFNTRFNLDDKITFIDSAAGIKLVKNASWVRQVKMTLCYVPLLNRIPTIAKLK
QEREALHNTRHDL
LRKYIQMHKETLDKGSPRDFIDAFLIAGEQDKEENSSFNDEQLLFYIQDLFFGGTDTTSF
TLTWTFLYLSLFPEKQKRLHDEIDEVIGSSGNISMSFLEKMPYAQAVMQEIFRVRPLVPL
GMPRKCGQNISLMGHNLPKDTSVIPNIWSAHHNPDRWPEPDSFLPERHIDSNGNFIKSKD
IIPFVLGPRNCLGIRLAEMEHFMFMVGLLQKLQFALPAGQGKVDTRGSSSIVLLLPPDVDLVITSR*

>sequence 31, 98, 200 1 accession 49% to 2U1 ci0100148485 missing exon 5
MLLNNGTMLEVPGSFVSYLTVCKFLTCTAAYLLYRHWYQMPRSFPPGPRGIP
ILGVLPLLGRHPERKLNEWKREYGPVMGARVGRKNWVVLGDCASIKMVIYLR (2)
DIFSGRPSPHKKEDGIVFLDNGRSWTMHKTLGLLAFRG (2)
FGITENKSLEKHVIEEVSYLKEAVRSKRGVAFSID (0)
TILTNAVSNVLCSILLGQRFQYSDQRFKRIIHQLSRR (2)

XXTLLGFINIHEKTYDSTNIRDFIDAFLLLKHRQLDESFN (0)
DKQLIHYIYDLFLGGTETTTGTLRWAILCLLHYPEKQAKLRKEIIQVLG (2)
DQEVTILKKHEMPYASAFIHEVYRFRTLFPLGLPHKTSHQVILDSYVIPQGTT (0)
VCTNLWAVHNDPKVFDEPEEFKPERHLYGNGEFGRSPYVIPFSV
GPRHCLGEQLASMMLFIYLVSLVRSFEFLP
DPKLNGLPDIRSGASGPVFLPKSFNVVAREL*

>paired_scaffold_395 70% to seq 31 CYP2 clan
215710 NFISISAFLFLVAMYAIYRYWSMKPRNFPPGPTGLPVFGVLPYLGMRPEQTFNKWKKTY 215886
215887 GPVMGVRMSKKDWVVLGDHSNIK 215955
216531 HEILSIQALLCEGGSFSGRPTFNNKGRKGIAFLDFGRKYNFQRSLGLLVFSG 216686
217183 GLGKNSLEQHVRNEAPHLIEAVRGQSNKPFDIE 217281
218541 QLIHYISDLFICGTITTAGALRWALLCLLHHPDKQDKMRREIIEVLGRKGIERLS 218705
219016 IFHISGDKDVTMLQKADLPYTSAFIHEVCRYRTIFPLGLPHKTSYDVLLDSYFIPQGTT 219192
219672 VCINLWGVHNDP 219707 (FS)
219707 SLFDEPGEFRPERHLYEDGKFGRSDYVIPFSVGPRHCLGH 219826
219827 QVASMMLFSHLVNLVRNFEFLPDPTDPNLPDITNGSSGILFLPRNFKLVAK 219979

>sequence 33, 116 ci0100137396 75% to seq 64 complete
MLPAFLGSLTSNYVTFCLTGFFTLVLFVYYYWWKLPHPRYPPGVRGIPVLGALPFLGKSAHKVIMRWSRE
KYGPIMSVRFGQKDSVVLNDYESIYEALVKQGPAFQTRPRIKVIEAHSNGYGFGFAAGHKKFMQVRHFTG
KALRGLGIGGLALEERVSEVAQELVLSLQKLDGKPTNLRMLVGTTVGNVIASIVIGKKFHPNDKEFQRYI
QLIFESFGDKETKKYFLVLIYFSFLRHIPPFKNACKQFIADHREQYDFCRKEIEEHKNNLDVNKPKDYID
AFLIEMKKYSPENSWFH EESLRVCVADLFYAGTETSTSTILWGMIALINYPEIQEKLHKEIVNATGEELP
RLDHKDELPLLQAFIQELYRCMALIPLGIQHQTTKNVDICGYCIPKDTVVFTNIHAVHHDPNIWKNPSEF
NIYRHIDEEGKFIPSRKVIPFGIGCRSCLGEKLARIEIFLFLANIIKRFEVRPDPDSEPLLPIDDGITGF
GFLPFLFKAVVLPRTNKGTQ*

>sequence 36 7 accessions + 4 ESTs LOW 30% RANGE TO 2 FAMILY MEMBERS, NO INTRONS
MASVLTVVGDYLNIWSIVLGGSA
LIFYMWSRKGNLPPGPRGYPILGAMPYLVVHPEKVITKWSREIYGPILSVPLL
NRTIIYLNTYEAITEAFSKQGLKLAGRPYMFLFQQISSDLGITMKTYSEHFKVQRQFGMR
SLKPSRIESLVIQESRYFIDQLHDQVGKVYDMKTDVYNFT
ANIICSVVLGKRFDYDDPAFKLILESSEKALGEPEDANLMMAMILIPQLRFLP
PFHGANQRFLKNQHRILSMLLGIIEEHQKNFDPNNVEDFIDAYIYEQKFGIGK 
NTKLFENDQLKVYVRDLFIAGVETTSNTIQWSILALLYNPK
YMDLMYNEVYEALGSDGVPCMKDREKMPVTCAFIQEIMRFRTITPGAAPHSAMEDIKLNGYDIPSGT 
MVVANLWALHNDPDTWPEPEQFNPHRHIDSD 
GKFIHSPKILPFSIGPRYCMGEGIAKAEIFIFLTSLLQKFTV 
SRDPNAVYPEEIEQGIVSAMSCPHRYNIILTAR*

>New Cs 74% to seq 36 complete no introns paired_scaffold_10 CYP2 clan
106197 MDLVLYVCSCVNVWTLVILGCAALFYLWGGKKKNLPPGPRGLPFLGSMLHLVNHPE 106364
106365 KRVGKWSREAYGPVVTVKLINRSVIYLNTYEALVEGFSKQGVKLAGRPNIMLFQQIAKGL 106544
106545 GITFKTYSEHFKVQRQFGMKFLKPSRIESLVVQESRYFVNQLAEHNGKVYDMKTD 106709
106710 VYNFTANIICSIVFGRRYDYNDPFFKMILESSEKALGDPADAAIMLPMILFPGLRHIP 106883
106884 PFRGANQRFLNHQSEILSMLLEVIEEHQKKFDPDNIEDFIDAYIYEQKFGLGK 107042
107043 NTKLFENDQLKYYVRDLFIAGIETTSNTIQWSVLAFLYHPEYVSLMYNEIYEVLGSDGVP 107222
107223 CMKDRDKMPKTCAFIQEIMRYRTLVPGGAPHCPTEDIKLNGYDIPAGTMVLANLWAIHND 107402
107403 PTTWPEPEKFDPRRHLDSDGKFVHSPKILPFSIGPRYCMGEGIAKTEIFIFLVSLLQKFT 107582
107583 ISRDPDSFYPDEMDEGIVAALSIPNRYNIILRAR* 107687

>New 53% to seq 36 paired_scaffold_178 CYP2 clan
718747 GYAFVSCCILFFIVYRGSKERRLRLPPGPSFLYIMVALPRLIQNPERTIRHWCAQYGPLVTVRLFNRN 718544
718543 IVYLNTQETIKAAFLKHKLAGRPQLFFFSQLTEGFGLSTRDYSDDFKVIKRFGMQVLK 718370
717875 ARQCMEHLRRNTGKPCDLKTDIYNFTGNITSKTLFNTRFEYDDCEFRKVLLASERSLGIP 717696
717695 EHSNAIKALLLFGWLRFVPPFSEVNKDFLKNQNSVLQFIEQHIQEHQIKFDADKLSDFI 717519
717518 DEYLYEKRSEGEKSISFEDKQLVMYLRDLFAAGIETTSNTIQWAILALLYYP 717363
717362 EYAELIYGEIYKAFGSTGVPSMKDRGKLPLTCAFIQEIMRFRTLAPSGAPHRAVERTELN 717183
717182 GYDIPPGTIVLANLWALHNDPEIWPEPQNFDPQRHIDIDGNFVPSHNILPFSVGSRYCMG 717003
717002 EKIAKTEIFIFIVSLLQNFTVHRDPSFQWPSEMEEGVMSALCCPHRYRVVLHNR 716841

>sequence 39, 60, 74 ci0100154493 complete
MFCPFEFHEVLVGIITFILVWSCIRRPTRYPPGPRGVPILGVLPFLSKYPEREFLVWSKRYGSLLSVSIA
WQRWIVLNDYDITMEVLVKNRMIFNMRPAIRFYQEALDPLGIVFSPGGAIWKAHRTFDFYILLFHSIGQE
RMEKLIMTEIDYFIAELEKHNKKSFDIQDTLTHSIANVMCRFTLGERFDYKDVRFKKLVGCIIHVGKVTK
DWKTMFLTLLSPLAHIWPFTRHVNEFKNAMKTFNDQIRIMVEEQQSSFNGIEVRNMIDAYLLKSQSFVSC
NEKYTVE DIVGFFSDLFIAGTETLSSQLRWGFLIMMLNPECQENVRNEIHAVIGGNGRARLSDRYRMPYT
CAVIQEIFRFRTLGALSLPRTNVVDVEIEGCKIPKGSKVILNTWAIHNDERNWENPQKFIPERHLDKDNF
INSKKVLPFSVGARSCLGEQLARAEIFFFFVATLQQFELKFDPNAQVPSLDDASNGVIFSPNPYTMIVER
INEDRSNNNNNSKVI*

>paired_scaffold_51 62% to seq 39 CYP2 clan
395028 FVAICVAIAILLRNWLKRPSNYPPGPLGIPGLGVIPFLSTFPQREFLKWSKKYGPMFSAR 395207
395208 FGLQKWVVLNDYDSVVQVI 395264
395383 LVKNSQSFHGRPSIRLFIETWTPLGIVFSTGGTIWKTHRNFGTRAVR 395523
399563 QDTVGFFTDLFIGGTETLTSQLRWGFLTMALHPEYQTRVRCEIDDVIGEN 399712
400560 GRNGTAKLSDQSKMPFTRAVIAELLRFRTLGPLSLPRAAVKDVTVQGYKIPKGT 400721
401044 VFLNAWGLHNDEKIWETPHLFIPDRHLDPDGNFVSSKYILPFGAGVRSCLGENLARAELF 401223
401224 LVFVATLQNFELSVDENNPPSLDEGRSGIIYTPNPFNLIINRR 401352

>sequence 40, 19, 88   missing exon 1 N-term not found in 11000bp ci0100138492 
LPYPRGLPIIGNIHQMGNFPHVKLTEWSKQFGDFYRIKMGRYDALVVNGHENIR (2)
NCLAKKSAAFAGRPPFETSKLIEEGLSISFSNYS (2)
PEWERQKQCTIKALKLYTSGSDKRSTMEETVSSHAKQLAEDLINSADQQ (0)
GLVGDLHDTVIYSTTSVSSTICFGRSFTRQDPELKEFLRNFQSFDKAMGASQIINFWPFLKYFPVLGKSFR (0)
NLKTYMDQYWNFTLSMLEQHWDTYVPNNMRDLADCLWAQSNQ (0)
NRQLTDQQRRIAYGASDAFGAGFDTISAMITWSIFYMAVFPEHQRK (0)
IREEIDRLETSMFSLRHHGDVCPYTQAWLYEVLRH
ISVSPLLVPHYTVKQVEVNGTMIPAGVVVLFNVAN (0)
ADRDTRVWENPEQFEPERFLARDPTTGGARVVASETSKI
LNWGAGKRRCPGAELSRHELFIYIANLVKLCYIE
QAVEGIEPAIPWPCTPGISTKPKAFRVKVTQR*

>savignyi ortholog of seq 40 56% identical N-term not found 
LPSPRGLPIIGNVHQLTTSPHVKLSEWAKEFGDLFRIKMGCFDTLVVTGYDNIR
(2) TALVKHSVAFAGRPPYETSKLFSNGLSLAFNNY
(2) SPAWEKQKRCTVKALKLYTAGPDLQKRNAMEDTASYQANLLVDQLLASVNK (0)
DAITNPDEIVHHSATNVISNICFGRSFSKNDPELQKFVSINRAFDRAMGSAQIVNFWPFLKSVPVLGRSYQ
NLKAHMDVFWDFVFPNLKEHWKTYNPSNIRDIADCLWYQSH
TSSKRDLQRRIASAASDIFGAGYDTTHKVVLWSLFYMAAFPQYQQKV
RDIFRVSEVKMY
TLRHHGDECPYVQAWIYEVLRHTSLAPILLPHYTTKEVTLNGVRIPAGVV
KKYHTIQAHKDPKIWKNPDEFDPGHFLEEDGSKLRSEAVHKLLSWGAGKRRCPGAELSRHE
IFVFVTTLVRRAYIGQAVDGVEPAFPWNTTGGISISPDPFRVKITER

>paired_scaffold_3 0rtholog of seq 40 100% match CYP2 clan
83836 PSPRGLPIIGNVHQLTTSPHVKLSEWAKEFGDLFRIKMGCFDTLVVTGYDNIR 83678
80325 PYVQAWIYEVLRHTSLAPILLPHYTTKEVTLNGVRIPAGVVVIFNAGQ 80182
79669 HKDPKIWKNPDEFDPGHFLEEDGSKLRSEAVHKLLSWGAGKRRCPGAELSRHEIFVFVTT 79490
79489 LVRRAYIGQAVDGVEPAFPWNTTGGISISPDPFRVKITER 79370

>sequence 41, 77 
MLRIFAFASQISMETLLTSAAILTCVLFVVYDTWWRFLGTSGTLRGRN
GFPVFGCFPLLGKNKERVFMNWSRTSYGPIYYARLGASRVLVLNGYEAVREALVLRPQ 297
AFAGRCKTELMKLLYEGQHGIMQLDHGPMWKAQRKFGQNALGSLGMGKRSLQ (0)
QQIVNEFNCLSSTLEXELLANGSKGAYLDEMLWLYAVTNISLLIFNESFDPDNEK (2)
RERFLYLLMYVPFLKNLPLVRQQLNVTIQGRKTKLAKFKQFVENHRQSRDPDDPRDFIDCYLNTMDSKSANDS (2)
FTEKQLLYFMNDLFIAGTETSSNTNSWALLLLLKNPEVMKRLRDEIDSLGREPTL (0)
EDEYKMPYTRAVMQEIFRYRPALPCNI
IPRKTTTSIELNGHSIPAGMPVIANLWSVHHDPVTWAPDPE
TFRPERHLNRDGEFMTSDHVMPFSVGLRSCIGRNLARNELFIFLTSILRRF
NLEFADGCYVDDMTGESGLLLHPPSYKVKGSLRA*

>Cs41a Savignyi ortholog A of seq 41 74% to savignyi ortholog A of seq 41
MLYLSLFSSNFSIEVVLTCFCLFLLLVYNVRVRHGGTSGTLPGKDGFPIVGCFPMLGKHKE
RVLMKWSTSSLGPMFYIRLGTSKVLVLNGYNAIKEALEQKPLPIAGRVESKFVKMLYDGD
HGLIQLDYGPMWKEQRKFGQTTLGGLGMGKRSFQ (0)
SKIRDEFNCLAATLESELSINGPKGVNFTSMLRLYAANITSSLVFNQTFDSTW (2)
RERFLIQMMYMPWLKYLPIVWKQLALSKRCRRDKVKIFQALVDKHRKSRDPNDPRDFVDCYLNVMEKKQGKNS (2)
SFTEKQLLYYIVDLFLAGTETSANTNAWAILLVLKNPQVKAKLRSELEAFGKSPTL (0)
EDEAKMPYTRAVMQEVFRYRTSARLNIVARRTT
SDLELNGYKVPANMPVIANLWSVHHDPLTWAPDPETFRPERHLDGDGNFIPNDHVMPFSV
GARSCIGKNLAKNEVFIFITSIFSQFDLELADGCHVDNMDGDSGMLLHPPEYRVKGSIRV*

>paired_scaffold_186 Length = 1694611 Cs41a CYP2 clan
846393 PGKDGFPIVGCFPMLGKHKERVLMKWSTSSLGPMFYIRLGTSKVLVLNGYNAIKEALEQKPLAIAGRVESKFVKML 846166
846165 YDGDHGLIQLDYGPMWKEQRKFGQTTLGGLGMGKRSFQ 846052
844429 MMYMPWLKYLPIVWKQLALSKRCRRDKVKIFQALVDKHRKSRDPNDPRDFVDCYLNVMEK 844250
843607 TFSSFTEKQLLYYIVDLFLAGTETSANTNAWAILLVLKNPRLRQ 843476
843085 EDEAKMPYTRAVMQEVFRYRTSAPLNIVARRTTSDLELNGYKVPANMPVIANLWSVHHDPLTWA 842896
842715 DGCHVDNMDGDSGMLLHPPEYRV 842647

>Cs41b Savignyi ortholog B of seq 41 74% to savignyi ortholog A of seq 41 
  1 MLFSSYLTFNFFIESLLAGLCLFLLLVYDVFIRLR 35
 36 GTSGTLPGKDGFPLVGCFPMLGKHKERMLMKWSSSGLGPVFYARLGASRVLVLNGYDVIK  95
 96 EALLQRPLAMAGRVESDLIQELYDGKHGLIQLDHGPLWKEQRKFGQTTLGGLGMGKRSFE 155 (0)
156 SKIVDEFNCFATTLENELNMNEPKGVNLTRMLRLYAANVISSLIFNQTFEISC 208 (2)
209 RERFLILLMYVPGLKNLPIVRQQLAITTRTRLDKVKIFRTLVEEHRKSRDPNEPRDFIDCYLNVLAKNKGENS (2)
    SFTEKQLLYYIADLFLAGTETSATTNAWALLLVLKNPEVKRKLRAELDALGKQPTL (0)
    EDECRMPYTRAVMQEVFRFRPAVPFNI IARKTTSELELKGYKIPAGMPVIANLWSVHH
    DPVTWAPDPEIFRPGRHLDDDGNFVPSNHVIPFSVGARSCIGRNLAKNELFVFITSVLRR
    FDLELADGCHVGDISGDSGMLLHPPKYKVKGFG*

>paired_scaffold_298 Length = 2411369 Cs41b CYP2 clan
122126 LCLFLLLVYDALIRLRGTSGTLPGKDGFPLVGCFPMLG 
122240 KHKERMLMKWSSSGLGPVFYARLGASRVLVLNGYDVIKEALVQRPLAMAGRVESDLIQEL 122419
122420 YDGKHGLIQLDHGSLWKEQRKFGQTTLGGLGMGKRSFEVTV 122542
122890 KIMARSNILFSRERFLI 
122941 LLMYVPGLKNLPIVRQQLAITTRTRLDKVKIFRTLVEEHRKSRDPNEPRDFIDCYLNVLAKN 123126
123516 TFSSFTEKQLLYYIADLFLAGTETSATTNAWALLLVLKNPEVKRKLRAELDVVGK 123680
124394 EDECRMPYTRAVMQEVFRFRPAVPFNIIARKTTSELELKGYKIPAGMPVIANLWSVHHDPVTWA 124585
124586 PDPEIFRPGRHLDDDGNFVPSNHVIPFSVGARSCIGRNLAKNELFVFITSVLRRFDLELA 124765
124766 DGCHVGDISGDSGMLLHPPKYKV 124834

>Savignyi ortholog C of seq 41
    RERFLILLMYVPGVKKLPFVRAQLAITTRTRLDKVKFSRPFGEKHRKSRGPNEPRDFHDCYLNVLAKIKGK (10)
209 RERFLILLMYMPGLKNLPIVRQQLAVTTRTRRNKIKIFQTLVEEHRKSRDPNEPRDFIDCYLNVLEGKKEENR 281
    SFSEKQLLYYIADLYMAGTETSANTNSWALLLVLKNPEVKKKLKAELDALGKQPAL
338 EDECRMPYTRAVMQEVFRFRSALPFNIIARKTTSELKLNGYKIPAGMPVIANLWSVHH 395
396 DSVTWAPDPEIFRPERHLDDDGNFVPSNHVIPFSVGARSCIGRNLAKNELFVFITSILRR 455
456 FDLELADGCHVGDISGDSGMLLHPPKYKVKGSVLM* 490

>paired_scaffold_103 85% -92% to Cs41b ortholog C of seq 41 CYP2 clan
391595 LVYDVFLRLRGTSGTLPGKDGFPLVGCFPMLG
391691 KHKERMLMKWSSSGLGPVFYTRLGASRVLVLNGYDVIKEALVQRPLAMAGRVETELIQEL 391870
391871 YDGKHGLIQLDHGPLWKEQRKFGQSTLGGLGMGKRSFEVTI 391993
392972 FLILLMYMPGLKNLPIVRQQLAVTTRTRRNKIKIFQTLVEEHRKSRDPNEPRDFIDCYLN 393151
393152 VLEGKKEEN 
393725 FSSFSEKQLLYYIADLYMAGTETSANTNSWALLLVLKNPEVKKKLKAELDALGKQ 393889
394143 EDECRMPYTRALMQEVFRFRSALPFNIIARKTISELKLNGYKIPAGMSVIANLWSVHHDPVTWA 394334
394335 PDPEIFRPERHLDDDGNFVPSNHVIPFSVGARSCIGRNLAKNELFVFITSILRRFDLELA 394514
394515 DGCHVGDISGDSGMLLHPPKYKV 394583

>sequence 43 seq 43 shares an exact run with seq 24 ci0100132600 complete
MFAKLFGWWGSSWVTTCLLGLLTLVILVYYYWWKLPHPRYPPGVRGIPVMGALPLLGKFAHKVIMRWSHE
KYGPVMSVRFGPSDSVVLNDYESVYEALVKQGPAFLTRPNIELINSYSNGYGFGFAEGNKKYMEVRHFTL
KALRGFGIGGRVMEERVSEVAQDLVQSLQELDGKPTNFRMIVGTTVSNVIASIVLGKKFHPKDEDFQRHV
QLIFDSFGDEETASYILVLMYFPLLRHIPPFKNACKKFVDVHFKQFDFCRKEILEHKKNLDENEPGDYID
AFLVEMKKHSPQDSWFHDESLMVCVADLFLAGTETSTSTIMWGVVVLINYPEIQEKLHQEITNTTGEALP
SLNHRDDLPLLQAFIQEVYRCMTLVPLGVQHQTTKDVEISGYCIPKDTVVFTNIDAVHHDPNIWKNPSEF
NIYRHIDKDGKFIPSKKVIPFGIGCRSCLGEKLARIEIFLFLANIIKRFKVLPDPESQDLPPMDDGITGF
GFFPFPFKAVVLPRKNEEIPE*

>paired_scaffold_129 60% to seq 43 CYP2 clan
740127 EAVLNNLGVSVLIGLVTFCMLLYQYWWKFPHPCYPPGVRGVPVLGALPFFGKFAHKTILQ 739948
739947 WSREKYGKIMSVRMGPNDAVVLNDYDTI 739864
738865 IDEHKKNLDENSPGDFIDAFLIEMKKHSPTDSW 738767
738553 FQEKQLLLSVNALFIAGTETTTSTILWAMIVLLNYPNLQYILHKEL 738416
738098 PSVKHGDNLPLLQAFIQELYRCITIAPLGLQRTNTEDVEIGGHFIPKNTVV 737946
737857 ILPNIYAVHNDPVIWPEPSKFDIYRHIDKDGKFIHSKKVIPFGVGARVCLGKKLAQLEVF 737678
737677 IFLANIIKKFEIIPDPETNEIPPIHDGLSGAGFFPYPFKVIVKSR 737543

>paired_scaffold_129 61% to seq 43 CYP2 clan
744715 LLDAVLNSLVVSVLIGLVTFCMLLYQYWWKFPHPCYPPGVRGVPVLGALPFFGKFAHKTI 744536
744535 LQWSREKYGKIMSVRMGPNEAVVLNDYDTI 744446
743445 IDEHKKNLDENSPGDFIDAFLIEMKKHSPTDSW 743347
743125 FQEKQLLLSVNALFIAGTETTTSTILWAMIVLLNYPNLQYILHKELMTETGMP 742967
742669 PSVKHGDNLPLLQAFIQELYRCITIAPLGLQRTNTEDVEIGGHFIPKNTVV 742517
742428 ILPNIYAVHNDPVIWTEPSKFDIYRHIDKDGKFIHSKKVIPFGVGARVCLGKKLAQLEVF 742249
742248 IFLANIIKKFEIIPDPETNEIPPIHDGLSGAGFFPYPFKVIVKSR 742114

>paired_scaffold_315 67% to seq 43 CYP2 clan
1356644 VSVVIGLVTLSILVYQNWWKLPHPRYPSGVRGIPVLGALPFLGEMPHTVLQRWSCEKYGP 1356823
1356824 VMSVRLGPNDAVVLNDYDTITEA 1356892
1356964 ALVKHRSIFQLRPQLKIITDYSKGYGFGFADGHKQYLEVRQFTLKALRG 1357110
1357173 GIGKRKMEVRVFDVAQELVQTLHDLKGKAIDLK 1357271
1357919 QIEEHQRNLDENEPTDFIDAFLIEMKKHSPEDSWF 
1358090 TDILFQNDELLQCVADLFLAGTDTSSNTISWAVSVLLHFPDLQTKLFEEISSKTG 1358254
1358325 EKPPAVEHRQELPLLQAFIQEILRCFSIAPLSLQHKASEDVEIGGCFIPKHTLV 1358486
1358836 ILTNLYAAHHDPKTWKDPSSFNIYRHIDKDGKFVPSKKVISFGIGARSCLGEKLGRIEIC 1359015
1359016 LFLANIIKTFEILPDPKSNRLPSIDRGIAGFLYIPYPFKVVMKSRI 1359153

>paired_scaffold_329 63% to seq 43 CYP2 clan
242020 LITLIMIVYHYWWKLPHLRYPPGVRGIPILGALPFLGRFPGKVIKEWSLKKYGPVMSVRF 241841
241840 GPNESVVLNDFESIHE 241793238896 LLQCVADLFLAGTETSTTTLMWCLAVLLNYPDLQSVLYEEISRETGKVMVDK 238726
238687 QKLPSVDHKENLPKLQAFIQEIYRNLTLLPLGLQHKTMADVELCGYHIPKDTV 238529
236483 IFTNLHAVHSDSKTWEDPYKFDIYRHIDKNGKFVASKKVIPFGIGARSCIGEKLAQTEIY 236304
236303 LFLANIIKRFEILPDSGSKHPPPLNDGVTGFAFTPFSFKVVLKIR 236169

>sequence 231 80% to seq 64 may be in gap at end of scaf 311 close to ci0100140915 = cibd084l24
FYRKEIKEPKNNFNVNEPKDYVDAFLIEMKKHSLEDSWFH (0?)
EEALLICVADLFVAGIETSTGTVMWGMTVLINYPEIQEKLHKEIVNAT (1?)
ETPPNLDQMDELPLIQAFIQELYRYMTLAPLSLQHETTKDADIGGYCIPKETL (0?)
VLTNIYAVHHDSNIWKNPSEFNIYRHIDKEGKFILSKKVIPF (?)
GIGCRSCLGDKLARNEVFLLLANIIKGFEILPDPESKELPDYKDGVNGFLYVPYDYKLVAKPRIVNE*

>sequence 46, 54   33% to CYP1A
MMITAAILLDAGRSFAVPVAFTAVSVLTLYVCLRKRQGIPPGPTAWPLVGNL
FSMGRQSHLILESMRKTYGDVFSVYFGSTLVVVVNGKAVEECLSTHSAR (2)
YSMRPELHTAQYILEGKSFAFSHIAVSKHKRYRTLAVAVVKQLVNGGGEKTDVAV
KHGLQNGTRHSSIEERIFMEAACMCDKLLETSDSPDLKDEILKVITKEL (2)
LSEYELDEISRVVENLRNSNEAIMLVNFIPAVRMLWRNGLQKYIQLTQSLNR (2)
FFERCIRNRKAQLATVSNGHTEDNGVRLTNGVDCTVKFWQKLKNDPQYEESRVMKV (0)
VADLFGARVDTMTVALAWMIVYWSTYQAAQERAQKEIDHFVKNEKRLPR (2)
YSERNQLPYTMALIMEVERHCSFVPFTLPHAPAQDTMLNGYLIPKGTMMLISMRSINHDTAVWDSPAQFR (2)
PERFLLDQSGGFNSALAEQVMLFGAGRRRCAGEALGRMQIFLYSVLFLRKCTFRR
SDKDGHVLPESLAGISLIPQTMCVSISRREADGSKNTEP*

>paired_scaffold_63 75% to seq 46 CYP2 clan
595236 SICLPITAFALSLIYLHRRKRDNLPPGPFAWPVLGNLLSLRSNSTAALEEIRRTYGDV 595063
595062 YSLYFGSRLVVVVNGKAVEECLSTRSAK 5949795
594724 RFSMRPELFTAQYVLGGKSFAFSHMDVETHRRYRKLAVGVVKELLVSTHERSQPTTMEEV 594545
594544 NRIPPQSIEDQIYAQAKRLCVGLFDIYASNSKSGQLDIRKEIMRRISFEM 594395
594161 LWEHELADLSELVEDLRNSNDATLILNFIPISRYLWKKGLRKYIKINQDLNK 
592629 FFSRCFDRRNPHVANGSDCCKSEETCDVLSGIDCVLKLWQQLKDDPQFEENRVMKLVRKLFKCN 592438
591699 VGDLFGANVDTMTVALAWMIVYWSTYHQAQTRAQEEIDRFVETNFHLPRY 591550
591042 RYSDRSQLPFVMALIWEVARHCSFVPFALPHAPVEDTTLNGYLIPSGTVMMISMRSVNHDQTLWDS 590845
590844 PGEFR 590830
590562 PERFISSETGVFNKGLADRVMLFGGGRRRCAGEALARMQLFLFSVSILRSCTIRRVDHS 590386
590385 DVLPD 590371

>sequence 49, 67 opp end of clone is seq 67 join together
MLVQILTATFWTLIP
NSFGDLLIYAILVLTIVIYVKSLKRDKEWLALPGPIPW PLVGNA PFLGAEPHKKLLELSL
KYGPVYRLKMGGIKTVVLCNAEVVRSALIKQREAFSGRPKFSSYKAVS AGESVVFNDEET
LPP WRSH KSKIVRHMHKYTTSIRTRDKVTDLINTECMMMVTELDRISRSKCVNPENVIRM
ALANVMCAVCFGNRFEYDNE (0) 
EFQKLLSMNTEFGAVIELGPIIDAMPWIK (0)
VIPKFKKAIADYLKINLQLDTWSRHR (2)
VDGVLKTFDNDDVTNVVASMTSEVLEKKSAGESREITESETKTIAALSADILGA 
GQHTTSTTFFWVINLLLCFPKVLNKLTEEVRSKLGNRLPTLEDRTSLPYMDAVLTE 
VLRFSSPLSSTIPHSTLKDVKLAGHTIKRGTMVIISQYAVNHDPQNWKNPENFDPERFLTK
NEGGEIIFNESLSEKVLAFSIGERKCPGSQLSRMLLFLATTLLVQVSDLSADLERPPT
AAAEYGLILRPKHLSIKLTLREHWQRRDSIRA*

>paired_scaffold_56 66% to seq 49 CYP2 clan
957040 VLIYISMVSIVVIYVKSVKRNKEFMALPGPTPWPIVGNAPFLGKQPHKTLLQLSQK 956873
956872 YGPIYRLKMGSVEAVILCDLDVIRCALIKQREVFSGRPKFESYKAVSAGESVVFNDSESL 956693
956692 APWKSHKSKILRHLHKFATSVRTKEKVNNIITTECMLMLQCLHRRSQDGFVDPEDVIRMT 956513
956512 IANVMCAVCYGNRFEYENE 956456
950636 GQHTTSGTFFWVINILLFYPKVLQRITNEVRSKIGERIPTLEDQADLPYVEAFLTEV 950466
949639 VLRFASPLSSTIPHSTTKDTTLKGYKIKRNTMVIISQYSVNHDPKIWRNPEVFDPERFLTRDENTNLVFND 949427
949426 ALAEKVLSFSVGERKCPGSRMSQMVLFLATCLLVHTGTLYPNPDRPPS 949283
949282 PVDDAQYGLILRPEYISMKFLLDKKW 949205

>SEQUENCE 25, 35, 93, 89 end is long but = EST rcies001a09
MDSLVFVLVDTVLVMKYQILLLLVIVYAIKLLAASQSRRLNIPGPYPWPVIGNVIEMGGQPQFSLTNMAK (?)
RYGPVYLMKLGTADVLVLNNYEVIKEALLRQRRIFGGRPIFDSFKKISQGLGVVFNSTMT
QGDEWMKLKMTIVKHVHRFVSSEETKGYVAHHVQMEAVELVRILTEKCRS
SPNEVIFPIEQINLAIANVVCAIMFGHRYQHGNK (0)
EFQDLISLNEQFGDVIGSGSQVDVIPWMK (0)
IFPKFRNALKVFDFLTNRLNNWMRLR (2)
TKEHRLTYKHGVIRDIVDSFIAESIDHPEQSALNDDVIMALTTDVFGA
GQDTMSTTMQWVFVYMMHFKECQRK
IHAELDSVIGPGELPHISDRRRLPYLEAVMHEIFRHSTFTSTTIPHVTTQDTVLDGHFIP 
KGILVFINQFGANHDPNHWVDPDKFIPERFLDGKGNLISRPHDRYLLFSTGARKCPG 
DELSRMLILHFMATMFALCEVSSDPQKPATL
DAVYNLSMRPKELRTIVRS
RNLPFLKNSVAQMSEADSHVLTVPGETTSFLTSRVESTVPDNQESQFSDNDFEKVDTKIP
KRKVFSRPTLTHDDINGNNVRKRGNLHQSAMYRIQLAT*

>paired_scaffold_142 77% to seq 25 CYP2 clan
222183 FRRYGPIYLIKLGTADVLILNNYDVIKEALIRQRGVFSGRPVFESFKKISQ 222031
222030 GRGIVFNSSLTQGAAWQRMKMTIVKHLHRFIASPQTKGFVAGHVQKETVQLVHILSEKCR 221851
221850 SSTNQAIEPVENINLAVANVVCSIMFGHRYQHGNK 221746
219363 LHRTREHRQSYKHGVIRDLVDSFIAESIDKPGQLLNDDVIMALTTDVFGAGQDT 219202
219201 MSTTLQWIFVYMMRFKECQKK 219139
218667 IHAELDSVLKPGSLPQIKDRARLPYLEAVMHEIFRHSTFTTTTIPHVTTEDTVLRGYHLPKET 218479
218478 LIFINQYAANHDPEHWVEPDKFIPERFLDEKGNLISRPHDRYLLFSTGSRKCPGDELSRM 218299
218298 LILYLMANIFTLCEISPDPNQPTTLDAVYTLSMRPKNVKTVVRVR 218164

>sequence 15, 63 N-term 43% to 2C9 C-term 46% to 1A1 WXXP to heme
MESVWVVIKWVKETMMSNSSFETIVAVATLLLLLMFVSENWNWLKIPGPI
PWPIIGNLGSLKGTKFLSIHEMYKIYGRIFRLKFGRVEAVVLCDVELIKE
ALLDRGRSLSGRPQFASYRLVSGCKSVVTNDPRCLREWVNY
KSTMVQTLCSISKNNEMKELMNERIGSVLVYMIQELEKGGDGQNFAEDIVTKTVANFLCT
VCYGGTYDFNSK (0)
EFNNLIEMSRHYTDNLSKSILRDMIPLAE (0)
ILPSVNKGRADFAKTSYHLHLWFLKR (2)
VEEVIQHFQPNKLNDLASVMVSDLTNDPTENISNITEKDRNSIAAIINDLVQG (?)
GYHSLYSMALWVVTYMIKYPEEVKKIENELNEVLDDYLPTLHDQESLPHTMAFINE (0)
VLRCRPSLPLAVPHSATEDTKLGGYDISKDTMVVASLYSANRDPKVWANPDQFDPSR
FLAKDDLGVTVLDETKVEQVFTFSLGDRKCPGEDIGRSFLFLTTAYLAHTCKLKPDPAK
PPTFQTKPGSITRPKDFGVQLNVKKCWLGVFKPDDNEE*

>sequence 51, 219, 223 1 accession 40% to 2J2
MYDVIIVCVISAIAVVAASWLKQNGSKKFPPGSRGLPFLGSVPFVLGSLERTLA
VWGWQNYGPVFFFTRGNTRTVVLNTYEAASEGFVKQGNH
LSGRFIDANHNELFADYSPQWQEQRKLRVKTLKGFGKVNTEEIMTSECKKMICRIRETKGSF
EAGYYLAVPTFNIVKSIIYGE
RFENKMNLIIYTFIFVAGEPSEGMNKQDEVNLSKVLADPVYKAGNGFMEFMAGMFPQIRS
SVAYERMLEKKGKERVRQVKE
KMTGSKKLIEGMLQNLQDWETGEVYNNQQLSCYLTDLFMAGTDTMATTVHWSLVFLAQNP
QVQEKMAAEVKSVAGNDVITTSMMDSMPYTRAVMHESSRLRPVFPLSLRHQATSDVTVKD
YVIPTGTSVVANLWAIQNDPKWWKQPSEFRPERHVTEDGGFTKSEKIVPFSVGPRYCLGS
QIAIYQQFIFLTNLVRGFRFRFDQAKVEPDLTGLMTSVLAPVNVHLVAEER*

>best matches to CYP11A1 paired_scaffold_94 71% to seq 51 in cyp2 clan
373693 GPVFFFTRGSCRTVVLNTYEAASEGFVKQGNHLSGRFDPKHKHKLFADYTEK 373848
373849 WQEQRKLRVMALKGLGKINLEEVISSECEKLMEKITS 373959
374504 RSSLTFERMVEKKVKERIDKVREQITGSQEFMDEMLQKIQDWENGECYNDKQLF 374665
374666 NYITDLFMAGTDTMATTVHWSLVFLAQNPEIQNKMAAEIIKVTGNDIINVSMMESMPYT 374842
374843 RAVMYESTRMRPVFPLSLGHQATSDVTVKDNVIPKGTLVVANLWAIQNDPKWWKNPESFR 375022
375023 PERHITENGGFTKNEKIVPFSIGPRFCLGSQMATYHQFIFLANLVRTFKFRFHENQPE 375196
375197 PDLTGVMTSVLAPVGIRMMATKR 375265

>sequence 57, 104
MFSTLLGSLALNYISTCLLGIFTLGVCMYYYWWKFPHPRYPPGVRGIPVLGALPF
LSKFAHKDITRWSREKYGPIMSVRAGQTDSIFLNDYESIYE (?)
AMIKQGPAFRSRPAMKLIT
AYSNGYGLGFSESHKKHSDLRQFSLKTLRGFGIGGRQLEDRISDIAQELVQAFEMLDGKATDVK
MIVGKTVGNVIASIVLGKKFHGDDKDFQRHINLIFDS (2?)
FGDEETAKYVLAMMYFPSLRHIPPFKNAWDQTLKGHFEQL (1)
DFCRKEIEELKKTLDVNDPRGYVDAFLIEMKKHSPEDSWFH (0)
EESLIVCVVDLFFAGTDTSTSTVMWAMVALLNFPEIQEKLHKEIANAT (1?)
QIYSGEALPSLDHRDELPLLNAFIQELYRHMTLGPFGVPHETTTDAYIGGYCIPKNTR (0) 
VITNIWAVHYDPNTWENPSEFNIYRHIDEDGKFVPSKKVIPFGIGCRSCLGEKLARLE
VFHFLANIIKRFEILPDPESKELPDYRDGVNGFVYVPYRFKLVAKPRIDNK*

>seq 64 cibd042f04_3 75% identical to seq 57 but not same seq GCiWno474_c04.b1
MLPAFLGSLTSNYVTFCLTGVCTLVLFVYYYWWKLPHPRYPPGVRGIPVLGALPFLGKFAHKDI
MRWSREKYGPIMSVRFGQKDSVVLNDYESIYEAMVKQGPAFRSRPTMKLITSYSNGYGFG
FAEGHTKYMELRHFTFKILRGFGIGGRYLEDRISDHAQELVQDFKKLDGKATDVR
MIVATTIGNVIASIVLGKKYHPKDKDFQRHLQLILNN 
FGNEETASYILCMMYFPSLRHIPPFKNAWKTFMDEHFEQL
GFCRKEIEEHKNNLNVNDPKDFVDAFLVEMKKHLPEGSWFH (0?)
EESLILCVIDLFFGGTETSTSTVMWAMVVLINFPQIQEKLHKEIVTAT
EALPNLDHRNELPLFQAFIQELYRCMTLAPLGVQHQTTKDADIGGYCIPKDTV 
VLTNIHAVHHDPKIWKNPSEFNIYRHIDQDGKFIPSRKMIPFGIGCRSCLG 687
EKLARSEVFLLLANIIKRFEILPDPESKELPDYKDGINGFLYVPYDYKLVDKPRTVNI*

>sequence 58, 70 45% to 2R1 ci0100144015 complete
MLTVSYVYVLALFLLCCLFRWRQRPKKFPPGPLGIPILGITPFVGSRMEKYLASYYAKYGGVMSFRLATKDWIILN
DIEAITQAYSKQGESFSGRPQSYLFDKLTEGCGIAFANGPRWQEQRRFVLTALKTLGMGKKGMEEIIKTE
NQHFISVIQNSGGKVHIL (0)
KEIRRLEANIISKVLIGKRFDYTDERLNSIIDCRFR
CTFTAFSSIYSTVSRIIQETGLQYSARNKQVMFIY (1)
IIRKLTNEHETPDEANPKDLTEIFLSEIEAKKRRILLE (1)
FLSNIFCTQKIQLYHLIRDIFVGGIDTTTATLGWGIICLLHYPECQMRIQEEIEDVI (1?)
GNSEPDMTHQKHMPYLRAFVQEVHRYCTIAPFSIPHCVTEDCVLFGYHIPKSTPVMSNIWRVHNDPKY
WKNPEKFSPERHLDLEGRFVPSNRVLPFAVGQRRCVGEQMAKVGLFLFWASFLQK
CEIHIDSDFGLPSWKKEPSGTVRSPQEFTVKVRNRLQNN*

>sequence 62, 83 8 accessions 42% to 2U1 ci0100151040 complete
MLPIICLSFFLIIFLCWKPLRKRSNYPPGCDGMPFLGCLPLLGVHPANTIMKWSKRYGDVFYVKYAMQNV
IMLNSFESIKEALVKQGHVFSG RQHTFITEFTSGNAGLGFLDYGAVWRSQRRFGLATLHEFGMGRRCMAP
KILEEADLMCDEMLKHGDHAFDCMDHITPAVTNIISILVFGKRFEYDDPTLLDTLHLLFDFGHDFPFALF
YFAPLVRFLPPWKAELEKIRQARKPVWDHIMSAIQQHQDTYNENDCRDFIDSFIAKMKSTNADGVKF
EQLFHYVKELFLAGTDTTATTLRWAILLIHYHTDVHEKIHEEIDREASNPVKFEDMPNMPYTQAVIQEVFRSR
PIANFGTMRKTTAVGKVGGYRIPKGSIVMPNIWAVHHNPQRWKNPHLFKPDRHIDQNGKFVKSEDIIPFN
VGQRSCLGMQLAKMEIFIFLVRMMQRFNFKLATDKEKLDLNGQYIFLHYPPAFDVSVQLRS*

>New seq 62 ortholog paired_scaffold_36 CYP2 clan
109093 FVLGCFILALLFWKPIRKPRNYPPGFDGVPLLGCLPFLGPQPAKTIMEWSKTYGDVFYVK 108914
108913 YATQNVVMINSLQAIKETMIKQGHIFSG 108830
107734 DFLTGRQHTFITKFTAGDAGLGFLDYGSVWRSQRRFGIITLHEFGVGRKCLGPQILEEAE 107555
107554 LMCAEMAKHGG 107522
106093 MITNAIEEHEATYNENSCRDFIDCFIAK 106010
105514 NDETASWWNKYQLFHYVKELFLGGTDTTATALRWALLLIHTNPEIAKKLKAEIDNNVT 105341
105340 GRVKLDDMINLPYTQAVIQEVFRYRPVVNFGTMRKTSAVGTATGYRIPKGTIVMPNIWSV 105161
105160 HHDPVRWKNPEVFRPERHLDENGKFKKSDAVIPFNVGQRSCLGRQLAQTEIFIFLVRMMQ 104981
104980 KFDFKIETPLEDLDMNGNHVFIHYPPDYKVSIQVR 104876

>sequence 66, 75 1 accession 41% to 1A1 33% to 2C9 63% to seq 57 
MFESFGYYFGESLTSNHVISGLIGVFTIVMCVYYYWWKFPHPRYPPGVRGVPIFGAIP
FFGRYVQETLAKWSRSKYGPVMSARFGQDDAVVLNDFESIHE (0?)
ALVKNIQHFNSRPSFYIVEQFTHGYGFGFADGHNKYLDVRNFSLSALRG (?)
LGIGRRTMETRVSEVAQDLVKILEDLDQKPTDLK (0?)
MMLGGTVANVLCSVVFGKTYDLSDPEYQYAVQCSFDC (2)
FGDPENSEYLNFMFFYPKLRYIQPFKRALKKFIDVHLGLIAFNQKEINLHKERLDVN
EPGDYIDAFLIEMKKHSPENSWFHETQLLHCLGDMFIAGTETTTNTILWALLALIHHPEI
QEKLYQELLDNIGEQVLPSTDHRDKIPLFRAFTQEVYRFKTIVPLALQHRANKDVEIGGY
VIPKGTKVFPNLHAVHHDPNIWKNPSEFNIYRHISKDGKFIPSKRVVAFGMGARSCLGEKLAITE
VFLFLANIIKRFEILPDPESKELPILKDGVNSLLYVPYRFKVVVKPRIINE*

>paired_scaffold_112 69% to seq 66 CYP2 clan
1462948 FSLAFEYLLSGSSIVSVLIGLLTFTWIVYQFWWKLPHPRYPSGVRGIPVLGALPYLGRFPHETIAKWSRDKYGP 1463169
1463170 VMSVRFGQEDAVVLNDYESITE 1463235
1463643 ALVKNINVFHTRPSFYVINCFTDGYGFGFADGHKKYLEVRNFSLSALRGYNTLK 1463804
        ISEFYQKEMEEHEKNLDENEPADFIDAFLIEMKKHSPDTSWFH 1466070
1466837 KQLLHCLGDLFNAGTETSTNVILWMLLALMHYPDIQEKLYQEIHDVIG 1466980
1467651 GEHVLPSVDHKDNLPLLRAFIQEVYRYQTIKPLSIKHRTSEDVEIGGYFIPKDTT 1467815
1468624 IFTNIDAVHKDEKTFKSPNEFNIYRHIVDGKFVSNKKVMPFGIGARSCLGEKLGKIEVF 1468800
1468801 LFLANIIKRFEILPDPQSKELPPFKDGISSFLYTPFAFKLVAKPRIVN 1468944

>paired_scaffold_401 72% TO SEQ 66 79% TO SEQ ABOVE
103021 MFSILFEWLNSWSGVVSGLIGLLTLCMVVYRYWWKLPHPRFPPGVRGIPILGATPFFGRY 102842
102841 PQETIAKWSREKYGPVMSVRFGQQDAVVLNDYESIHE 102732
102081 ALVKKINVFHTRPSCYIVECFTDGYGFGFADGHKKYLEVRNFSLSALRG 101935
101485 LGIGRHTMEMRVSGVAQDLVQRLEDCQGQPTDVKVTL 101375

>SEQUENCE 72, 86 ci0100136116 100% to seq 18 C-term 3 diffs to seq 72 100% to seq 105
MMEVLIGLNVSTLATFLVVFYAVYYWYKRPGSYPPGPRGYPLVGVLPRLGKYPERVMRKWSRKYGPVMSVRMGR
NDWVSLNDYSSIKE (0)
ALVKQGNVTSARPDFPAHQEVAHYHGIIFSSGDLWQNQRRFGITTLRE (2)
CGFGKRSLEERIVEEVSYLNDAIRATMGKPFEIM (0)
AKLGNAISNNICSVAMGRRYDYDNKRFKDLVAKVSWR (2)
SMGVVRIRFYNFFKCFFCLLPNGTKTSIERC (1)
LFREIIDEHRATFDKENIRDFIDAFLERMAGNDPHFTVTIYA (0) 
EFQLLVFLRDLFIAGTETTTSTLRWALLVLLHNPGVQDKLRNEIHKVLGF (0)
STTVSMAHRESMPYTRAFIEEIYRYRTLTPLDLFHETSGELNISGYKIPKNTV (0)
IIPNLWAVHNDPDVWDEPSKFKPERHLDEKGNFVQSKHVIP
FSVGPRHCLGEQLARMEIFIFLVSMVQKFEFLPDPNEPNLPEINHGVNATAFVAYPFNIVANQV

>SEQUENCE 157, 79, 215 60% TO 2C8 ci0100148852 
MASNIALELCKGLYT WTSVAVIAVSLFYIWWRRPHNFPPGPRGLPLIGVLPFLGRFPGEVFKKWSLEYGP
VMSIRMGRSDVVVV NTGDSFREAFVEQGANTSGRPPDIMFAQVTGDKGLVFKDYSANFKATKKWILRTLS
QYGLGRRGMEEKIQEEADCLAGYLRTKVGKPVDFKVLYCLTSNVVSLIMSGRRYDYEDEEHQKRLEFGDP
NDAPKLLLIDLFPALKYLPVFKTANDKLRSRMKAVLGFTRSNIEEHKKTFEKDDIRDFIDTFLAEMKFGN
ETSRDNFTVH ELNVLFRDFFLAGSETTSTNILWILLLLVHYPEHTAKVIKEIDTIIGPNASASSTTHSSQMPF
TCAFIQESMRYRTSVPLGAQHYCQKTFQLNGYTIPKGTLIYPNIWGVHNDPKSWPNPSKFDPYRHIDEKG
NYRNSSNVIPFLIGPRNCVGEALARQEIFIFLVKILQKFEVKPVPGKTPSIADGVNSVVFSPHSFNLILQ
ER*

>paired_scaffold_66 Length = 6019272 60% to seq 157 CYP2 clan
4762424 ILVTFAAVVYQYWWKRPNGFPPGPRGIPFLGVIPFLDNFPEKLLKKWSLEKYGAVMSARL 4762245
4762244 GTSDVVVLNTPESV 4762203
4761514 QVFTNQGSLTSGRPTENIFRQATGDKGIVLKDYSMKFKALKKWVIQSLSK
        LNTYHLKMRRHGFGRNGMEERITEEAVCFTEYLKCKVGKPVDIK 4761140
4760259 IEEHKKTFDENNLRDFVDNFIHEMKFRKETIRDYFT 4760152
4758908 EDELNLIIRDMFVAGSETTSNSLLWCFVLLMHHPKHKETIVKEIDEAMGK 4758759
4758714 IVGADGIPSSKLREKMPFTCAFIQESVRYITVVPMGSHHYCQEDIKINGYTIPKGTT 4758544
4758254 ILSNLWGVHNDPVTWPNPSQFDPHRHIDHQGKYRNSSKVIPFSIGPRSCLGEALARQEMF 4758075
4758074 IFLIAILQKYEINPVPGEPLPSLVGINSLTYTPANFKLILTER 4757946

>paired_scaffold_66 61% to seq 157 CYP2 clan
4768873 ILAIFAAVLYQYWWKRPNGFPPGPRGIPILGVIPFLDNFPEKLLKKWSLEKYGAVMSARL 4768694
4768693 GTSDVVVLNTSESV 4768652
4767607 RRHGFGRSGMEERITEEAVCFTEYLKCKVGKPVDIK 4767500
4767387 VCGRRYELDDQRQLKFLETMQM 4767322
4765841 EEELNAIIRDMILAGSETTSNTLLWCFVLLMHHSEHKETIVKEIDEAMGK 4765650
4765661 IVGADGIPSSKLREKMPFTCAFIQESVRYFTVVPMGSQHYCQENIKINEYTIPKGTT 4765473
4765169 ILSNLWGVHNDPVTWPNPSQFDPHRHIDDQGKYRNSRNVIPFSIGPRSCLGEALARQEMF 4764990
4764989 IFLIAILQKYEINPVPGEPLPSLVGINSLTYTPANFKLILTER 4764861

>paired_scaffold_66 61% to seq 157 CYP2 clan
4774403 NVSTYTLLSTILAIFAAVFYQYWWRRPNGFPPGPRGIPILGVIPFLDNFPEKLLKKWSL 4774227
4774226 EKYGAVMSARLGTSDVVVLNTSESV 4774152
4773195 LHRHGFGRSGMEERITEEAVCFTEYLKCKVGKPVDIK 4773085
4771431 EEELNAIIRDMIVAGSETTSNTLLWCFVLLMHHPEHKKTIVKEIDEAMGK 4771282
4771235 LGADGIPSSKLREKMPFTCAFIQESVRYFTVVPMGSQHYCQENIKIHGYTIPKGTT 4771068
4770784 ILSNLWGVHNDPVTWPNPSKFDPHRHIDDQGKYRNSSKVIPFSIGPRSCLGEALARQEMF 4770605
4770604 IFLIAILQKYEINPVPGKPLPSLVGINSLTYTPANFKLILTER 4770476

>paired_scaffold_142 59% to seq 157 CYP2 clan
120866 NVSTFTLLSTTLVIFAAVLYQYWWRRPNGFPPGPRGIPILGIIPFLDKFPEKLLKKWSL 120690
120689 EKYGAVMSARLGTSDVVVLNTSESVVEVGFIQKGN 120585
120034 KAFTNQGSLTSGRPIENIFRQATGDKGIVFKDYSMKFKALKKWVIQSL 119891
119771 LRRHGFGRNGMEERITEEAVCFTEYLKCKVGKPVDIK 119661
118807 DFARDHIEKHKKTFEENNLRDFVDNFIHEMKFGKETMKDYFT 118682
117520 EDELKSIIRDMIVAGSETTSNSLLWCFVILLHHPEHKKTLVKEIDEAMGKTLLPR 117356
117324 VGADGILSSILREKMPFTCAFIQESVRYCTVVPMGSQHYCQEDIKINDYTIPKGTT 117157
116815 IISNLWGVHNDPVTWPNPSHFDPHRHIDDKGKYRNSRNVIPFSIGPRSCLGEALARQEMF 116636
116635 IFLIAILQKYEINPVPGKPLPSLVGINSLTYTPANFKLILTER 116507

>paired_scaffold_433 63% to seq 157 CYP2 clan
107607 WTILVIFSALLYQYWWKRPNGFPPGPRGIPILGIIPFLDHFPEKLFKKWSLEKYGAVISA 107428
107427 RLGTSDVVVLNTSESV 107380
107011 LRRHGFGRKGMEERITEEALCFTEYLKCKVGKPVDIK 106901
105448 KTMISDIFCNIIDFARGHVAEHKKSFDENCLRDFIDNYIHEIKFGKENMRNQFT 105287
104926 EDELATIIQDMMFAGSESTSNSLLWCFVILLHHPEHAKTLVKERDMAMGK 104777
104726 GANGIPSSILREKMPFTCAFIQESMRYCTVVPMGSQHYCQEDIKINEYTISKGTTVWL 104553
102869 ILSNLWGVHHDPVTWPNPSEFNPNRHIDDQGKYRNSSKVIPFTIGPRSCLGEALAKQEIF 102690
102689 IFLTAILQKYEINPMPGEPLPSLVGVNSLTCTPANFKVIINER 102561

>paired_scaffold_64 61% to seq 157 CYP2 clan
256138 MLDIIKGVVPTCMTLWTVFVIFLAVLYQYWWRRPTGFPPGPRGIPVLGVMPFLDKFPEKI 256317
256318 FKNWSLEKYGPVMSVRLGTSDVVILNTADSITE 256416
257039 GFAKKGMEDRITEEADCFTEYLKGKVGKPVDLK 257137
258447 IKEHRETFDENYLRDFIDTFINEMNFETENVRDYFTVKDLIKQAIEL 258587
258960 DDELVEVIRDFLIAGSETTSNSLLWCFLILIQHPEHAKTIAKEIDAAVGK 
259125 LNKGTNKTPSTMLREKMPFTCAFIQESMRYRTNVPMGSPHYCQKDTKIN 259304
259305 GYSIPEGTT 259331
259697 VFPNLWGVHNDPVTWPNPSQFDPYRHIDDKGKYRNSSKVIPFSIGPRSCLGEVLAMQEMF 259876
259877 VFLTAILQKYEINPVPGEPLPSVVGINSLTCTPANFKLILKER 260005

>ci0100144919 96% to seq 84 almost same as seq 222 N-term complete
MIFEIVITILGAVLLHWLFIALFNPWRYPPGPAGLPLIGSLHLAAPEFHIKLPKLAKKYGPIFTIKAGCRRVVFLV
GYDVIKEVITDRAKDFASRCPTIVARIVRGEGLDG IAASPYGPKWMANRKFFYSAMRTMGLGKRGIEKCVVDE
IPYIVGELENICSNDGLFEPSSVFDSAVLNVLAYFTFGNRYSYQDEKFKELIHINNDFFQKSKFLNQPQF
FLLTLIPGLDKYWLPKCGKDIKESFG KINKFVKAEIKQHRQNLDPENPRDYIDCYLNELNQMNDQSELSE
LGLEMSIMDLFQAGTETTSTTLRWAILYLANNPHIQEKVQQEIDEVLGFDQLPQYEDRMRMPYCEATVLE
VQRMATIAPI GLQHCSEDDQVLGGYRVPKRTSIVACYHSIHFDPKFWKNPNKFDPCNFLDSNGKVCIPDA
FMPFGGGLRICVGMNIAKQELFLFFVAILQKFSLHLPDDMDHLDEKPGPGIILAPKIYRVQIKRR

>SEQUENCE 17, 111 first intron is over 3000 bp
MIFEIVITILAAVLLHWLVNFLFNPWRYPPGPAGIPVIGSL
HLAAPDFHRSILIGFQISFAISVAFLQKQINRLR (1) 
NIVFSELPKLARKYGPIFTITAGYRRVVFLVGYDLIKE (?)
VITDRAKDFASRCPNMPGRIVRGEGLD
GIAAAPYGPKWMANRKFFYSAMRTMGLGKRGIEKCVVDEIPYIVEELEKLCTKNELFE
PSSVFDSAVLNVLAYFTFGNR (?)
YSYQDEKFKELIHINNDFFQKAKFLNQPLFFLLSLVPGLHKYWLPQCGKDLKESVG (?)
KINKFVKAEIEQHRQNFDRKNPRDYIDCYLKELDQMNDQSELS (?)
MSIMDLFQAGTETTSTTLRWAILYMVNNPNIQSK
VQQEIDDVLGFDRLPQYEDRMRMPYCEATVLEVQRMATIAPIGLHCSEEDQ (?)
MLGGYRIPKRTSIMACYHSIHFDPKFWKNPNEFDPCNFLDSNKKVCIPDAFMPFGGG (?)
LRICVGMNIAKQELFLFFVAILQKFSLHLPDGVDHLDEESGSGITLAPKIYQVKIKRRA*

>paired_scaffold_103 74% to seq 17 CYP2 clan
955109 YTNLNLYICSKIHEFVKTEIKEHKERLDVKNPRDYIDCYLAELHFSKDNT 955258
956555 FTIFCVSGLEMSIMDLFQAGTETTSTTLSWAVLYMVNNPEIQ 956680
956745 KVQQEIDNILGLDQVPKYEDRLKMPYCEATILEVQRFATIAPIGIANVIYLKNSNLLCNSMSL 956933
957384 GLIHCSEEDQKVAGYEIPKGTSIMPCYHSIHFDPKFWKNPNQFDPTNFLENDGKVGTRTAFMP 957572
957573 FGGGNLLNLN*IKLKIFLGL 957632
959152 GLRICVGMNVAKQELFLFFVAMFQKFSMNVPDGKDFIDDAPLHGVTLAPKPFSVRIKRR 959328

>ci0100144128 90% to seq 222 scaffold 38 complete
MLYGIVLTIIFVLCLPICHWIYVLLSKPRNYPPGPLGIPLFGSFHYAAPAIHI (1) 
NLPKLAKKYGPIFQLRAFNQAIVCVVGYDLIKE
VMINKSKDFASRYASVPGKLVRGEGMD
GIAVAPYGPKWMANRKFFYSAMRTMGLG
KRGIEKCVIDEVPYIVEELEKLCSSDELFEPSSVFDPAVLNVLAYFTFGNRYSYQDEKFKELIHINNEFF
QKAKFLNQPEFFLVTLIPGLHKYWLPQCGKDLKESVGKIHKFVKAEIKHHRQHLDPQNPRDYIDCYLNEL
NQTSDQSELSELGLEMSIMDLFQAGTETTATTLRWAVLYMVNNPHIQENVQSEIDNVLGFDQLPKYEDRT
RMPYCEATVLEVQRMASITPFGLLHCSEEDQKLGGYHIPKHTCIMPSYHTIHFDPKFWKNPNEFDPCNFI
DPNGKVGTRDAFMPFGGGLRICVGMNIAKQELFLFFVAILQKFTLQLPDDVKSLDDEPIPGMTLAPKPFR
VKIKIRS*

>SEQUENCE 91 WXXP TO END 39% TO 2R1 56% to seq 36
487 WPQPDKFDPHRHLDSAKKFISSSEIVPFSVGARY
CMGRETAQTEIFVFVVALLQKFKITRDPTLKWPIHMEDGIQSALCCPSRYKIILN*  216

>SEQUENCE 94, 189, 97, 96, 137, 191 41% to 26B1 fugu ci0100143924 revised
MSFSVVSAYQKVAPMVVTYDLLRMPATTGRNTTTLWQHEFLSTM (0)
ENATEGWETELEVETVAEATFQQGANGEHEYSGSDQFFEQLNGTLQSLFDSSAVAASPLMA
FILLGLAAILAVCILVHPLKAWWLNYNMKSAVDPDSKLNHLPIPPGSFGLPFIGETLAWITQ (0)
GPKFNSNRRKKYGNVFTTNAISLPIVKVSGHEYVKE (0)
ILTGEHDKVTTIWPYTVRTILGSHGIVNSIGDIHKYKRKVAFKAFTRAALNDYVPIMRNHATRI (0)
VRQMQESDQPLVYPNMLRLTFDVAVNALLGLEISDQVELDMLFKTFHQLVSNVFCLPYNVPGFGFNK (0)
GMKARNQLLDLLQIHIEAKKQA (0)
KLREESNLDNNNTKSDADNEFELGDLEIKEIAIELMFAGYYTSASALTSAILELARHPDVFSKL
ENELLQHGILREDSSDEEHMPELNLQNIHKLTYLDQVLKETLRIRPPVLGAYRRAKKTFQIG (0)
DYRIPKGWTVIYNIRDTHELEFEHMTEFDPEHFAPDANDKKFR (2)
FIPFGGGPRVCIGQEYARIIMKVSLIEMIRCNSSWKLANKTLPKMVA
IPTLHPKDGLPVHLQPRKTVGVVTSRSEPVATDSS*

>paired_scaffold_422 ortholog og seq 94 66%
951017 FLLLQENVTEGWKSALSGNETDSV-LGRAENLEEIHSGANQSAELLNGTILVQDGGPAPS 950841
950840 ASPVMAIFLFVVTAFLVACVLWRPLNAWWVHYNMKKAFDPDSNLNHLPIPPGSFGLPFIG 950661
950660 ETIAWITQ 950637
       GANFNSSRRKKYGNVFTTSAISLPMVKVSGHEY 948365
948164 VLTGEHDLVTTIWPYTVRAILGMNGIVNSTGAIHKFKRKVAFKAFTRAMLNQYIPF 947997
947996 MRHHSAKI 947973
946422 VQQMQKSERPLVYPNMLRLTFDVGVNALLGLEITDQAKSDKLFQTFNQLVSNVFCIPHNI 946243
946242 PGLGFNK 946222
944534 KLRSEEQIDNNNTNNNQKPVAQDDDFELSDLAIKEMAIELMFAGYYTSASALTSTTLELA 944355
944354 RHPEVFAKVEKELIDHGILREERSSADAETPDEEERVIDLQALHKLTYLDQVFKETLRIR 944175
944174 PPVLGAYRRARKTFQIG 944124
943003 DYRIPKGWTVIYNIRDTHEMEFKHMSEFNPENFAPTGNETKYR 942875
942168 YIPFGGGPRMCIGQDYARTISKLAIIEMIRCNSGWKLANEKLPKMVSIPTLHPKDGLPVI 941989
941988 LEPRDRVGVVTSSSKRASCDNS 941923

>SEQUENCE 102, 53, 150 SIMILAR TO SEQ 8 N-TERM 39% TO 2B6
MIDIFQTICASTGPYPFTVLVICVVWVFRWWRRPHDLPPGPRGFPLLGVIPFMGEFPER
VIKKWSLENYGPIMCARMGMQDSVFLNTADAIKM (?)
AFCDNSEYLSGRPTTTIFKQVTGDKGIGLKQYGENYKVIRKFIVRTLSK (?)
YGVGKTMIENQIVEEAGLLAEFLKTKTNQLVDL (?)
FYLYVLTSNIVSRVVCGRRYDFEDEKYHQVLINLQKM (?)
LGDAKDAPFTLLIDFFPGLKHLPWFRGANNRMIERIKNWL (?)
EHSKDHVEEHKKTFDKNNLRDLIDSFLHEQAYGKTENKN (?)
EEELIVIVRDLFSAGSETSSNSILWILIVLLHHPHHAKKCIQEIDDVMG
PNGAPSSQHREKMPFTCAVIQETFRYRTVAPIGLQHYAQATVELGGLRIPQGQG
VYSNIWGVHNDPVAWPNPS
EFDPYRHINKDGKFVLSNNIMTFSIGPRSCPGEALAKLEIFMFVTKILQKF
NVRASPNNPPSLDGVNCLQFTPLSCEVILTER*

>paired_scaffold_323 69% to seq 102 CYP2 clan
55883 ITLFVVLVTVGFYWWKRPSGFPPGPRGIPFLGVISFMGKYPERTIKKWSLEKYGTIMCAR 55704
55703 MGSADCIFLNTAEVIKK
54828 GDKNISMQAFTGLGNHVSGRPAHTIFKQVTGERGIGLKQYGEDYKIMRNF 54679
51553 LNQVEKQ*LLIDKLLYMYSIYSSIYTQKEELVVIIRDLVSAGSETTSNTIAWILIALMHN 51374
51373 QDHLRKVVKEIDQVMGKYKL 51314
50896 KEPNSRHKIKMPFTCAVIQETMRYRTVTPYGIQHCVTSTVELGGYIIPKGTL 50741
50585 VYSNIWGAHNDPVVWPNPSKFDPYRHIDKAGNFISSNNVMPFSVGNRNCIGEQLAKQEIF 50406
50405 IFITKILQKFELSVPPGEPLPSLEGVNCLQFIPLSCELLLTER 50277

>paired_scaffold_64 67% to seq 102
504384 ITLTVALITAGYYWWKRPRGFPPGPRGIPVLGVIPWLGAYPERVIKKWSSEKYGRVMCAR 504205
504204 MGSEDRVFLNTAESIKK 504154
503558 AFVGNGKHISGRPTTTLFKQVTGENGIGLKQYGEDYKTFRS 503436
500958 DEELVVVLRDLLSAGSETSSNTITWMLILLLHNPKHMQRVVNEIDDVMGR 
500778 NGVPGSCHKDKMPFTCAVIQETMRCKTVVPTAVQHCATATLEVE 500599
500598 GYIIPKGTL 500572
500315 VYANIWGAHNDPVAWPDPSRFDPYRHIDKDGNCILSNNIIPFSVGLRSCPGEQLARQEIF 500136
500135 IFVTKILQKFKLSAPENKPLPSLEGVNCLQFTPLSCELVLTER 500007

>SEQUENCE 103 93% TO SEQ 102 (3 DIFFS) SIMILAR TO SEQ 8 N-TERM 37% TO 2R1
MFENMVYTLQYIFASIGPYPFTVLAICVAMLVRWWRRPHGLPLGPRGIPLLGVIPFMGE
FPERVIKKWSLENYGPIMCARMGMEDGVFLNTAEAIKM 
AFCDKSEHLSGRPRTTIFKQVTKDKGLALKQYGKDYNIIRKFVVRTLSQ
WGVGKTMIEDQIVEEAGLLADFLKNKTNQPIELK
LYFYVMTSNIIGRVVCGRRHDFEDEEYHEILN
NIQKVLGEADDAXLTLLIDFFPGLKHLPWFRGANNRMIERVKKTLEYCKGVIEEH
KKTFDKNNLRDLIDALLYEQQFGKKSKRKVLL
EDELVVIVRDLFTAGS
GETSSLMISWVLIMLLHHPEHAKKVIEEIDDVMGPNGAPSTKHRDRMPFTCAVIQETFRC
KIVAPIGIQHYAQATVELGGYIIPQGTMVYSNIWGVHNDPVAWPNPSKFDPYRHISKDGKFF
TSNNIISFSTGPRSCPGEPLAKMEIFLFVTKILQKFEVKAPPDKLPSLVGVNCLLFSPES
CEVILTER

>paired_scaffold_323 72% to seq 103 CYP2 clan
38065 ITLFVVLVTVGFYWWKRPPGFPPGPRGIPFLGVIPFMGKYPERTIKKWSLEKYGTIMCAR 38244
38245 MGSADCVFLNTAEVIKK 38295
39034 QAFTGLGNHVSGRPAHTIFKQVTGEKGIALKQYGEDYKIMRNF 39162
43713 EELVVIIRDLVSAGSETTSNTIAWLLIALMHNQDHLRKVVKEIDQVMGKYKLVYPDN 43883
44245 KEPNSRHKVKMPFTCAVIQETMRYRTITPYGIQHCVTSTVELGGYIIPKGTL 44400
44558 VYSNIWGAHNDPVVWPNPSKFDPYRHIDKAGNFIPSNNVIPFSVGNRNCIGEQLAKQEIF 44737
44738 IFVTKILQKFELTVPPGKPLPSLEGVNCLQFISLSCELMLTER 44866

>paired_scaffold_64 69% to seq 103 CYP2 clan
505738 LTVAVITAGYYWWKRPRGFPPGPRGIPVLGVIPWLGAYPERVIKKWSSEKYGRVMCARMG 505917
505918 SEDRVFLNTAESIKK 505962
506331 AFVGNGKNISGRPITTLFKQVTGENGIGLKQYGEDYKTFRS 506453
508713 DEELVVVLRDLLSAGSETSSNTITWMLILLLHNPKHMQRVVNEIDDVIGRCNLL 
508893 NGAPSTRHKGKMPFTCAVIQETMRCKTVAPIAIQHCATATLEVEGY 509072
509073 IIPKKTL 509093
509550 VYANIWGAHNDPVAWPDPSRFDPYRHIDKDGNCILSNNIIPFSVGLRSCPGEQLARQEIF 509729
509730 IFVTKILQKFKLSVPENKPLPSLKGVNCLQFTPLSCEVVLTER 509858

>best savignyi hit to CYP24 53% to seq 110 possible processed pseudogene with 
MFQQLWEPLRLILYNVAVLAILAVLFKFFKRISVICKIRWNLRTEWKGPPCHWFWGHLGQVT
KGNKDPANFIPYFTKNEKLFPFGFAIYAGPLRTILTIHHPNLVKQVLNASTFDAPKPKEAYGGLLLKWL
GLGLLSLNNREWFRHRKLLTSAFHHDILKPYVKISNSCTDILLDTWQKKVKQQPTGFSIEIFEQA
SLLTLDIALQCLMSYESKCQERLDNDYISSIKNLSFLYVRRNRSKNLLLKYFPFLYKWS
QEGKLYYKECDSVHKFSTSLINRRKQELGNPDIAPKREFLDFLDMLLQARDSNGLGLTTS
EICAEVDTFLFGSHDTTASGISWFLYCISQNPEVQNKCF*EITQVLGDRENVRWSDLSSL
PYLTRCIKETLRLYPPAPYISRTVNKDIYAEGKTILKGTTVLLSIIGVQRSVLFWKYPQK
FNPDRFTPESASMQNNHAFIPFSAGPRNCIGQHFAMNEMRVVLSKMLRTFRFSIDPNHKI
VPCHEIIYRSRGGIHLIVEPRTSVSI* 

>paired_scaffold_366 best savignyi hit to CYP24 CYP4 clan
       MFQQLWEPLRLILYNVAVLAILAVLFKFFKRISVICKIRWNLRTEWKGPPCHWFWGHLGQVTKGNKDPAN
451729 FIPYFTKNEKLFPFGFAIYAGPLRTILTIHHPNLVKQVLNASTFDAPKPKEAYGGLLLKW 451908
451909 LGLGLLSLNNREWFRHRKLLTSAFHHDILKPYVKISNSCTDILLDTWQKKVKQQPTGFSI 452088
452089 EIFEQASLLTLDIALQCLMSYESKCQERLDNDYISSIKNLSFLYVRRNRSKNLLLKYFP 452265
452266 FLYKWSQEGKLYYKECDSVHKFSTSLINRRKQELGNPDIAPKREFL 452403
452404 DFLDMLLQARDSNGLGLTTSEICAEVDTFLFGSHDTTASAISWFLYCI 452547
452548 SQNPEVQNKCFQEITQVLGDRKNVRWSDLSSLPYLTRCIKETLRLYPPAPYISRIVNKD 452724
452725 IYAEGKTILKGTTVLLSIIGVQRSVLFWKHPQKFNPDRFTPESASMQNNHAFIPFSAGPR 452904
452905 NCIGQHFAMNEMRVVLSKMLRTFRFSIDP 452991
       NHKIVPCHEIIYRSRGGIHLIVEPRTSVSI* 
 
>SEQUENCE 110, 121, 122 139 67% TO 4F3 
MLFMFIKYAVYASFSLLALSILHSLFR (2)
SIQEFYTVTKYLKSNWPEPESHWLWGSFGAV (0)
MLKMSADPSYYFTYMNDCMKKYPYGFRTQLGPFIKSLNTYHPTIAKAIL (1) 
NVPKLKRAYKLLFEWL (1) 
GHGLLVLEDKVWLRHRRLLTPSFHFEVLKPYVKTMNESAHVMV (0) 
ENWLSKTSDNKVAKVEIFHYA SLMTLDTTLRCLMSYQSNCQDEN (2) 
STNDYVAKIYELSETIVRRQRNLNILNRIDPIYNVT (1) 
KEGRKYLKLCDDVHKFSESVINRRKCDSEQPAHNETRKYFDFLDTLLKAR (0) 
DSDGKGLSDSEIRAEVDTFMFEGHDTTASGISWTFYCLAMHPEHQEKCFQEIEKVMADRTDIEW (2) 
NDLS NLPHLTLCIKESLRQYPPVPIIFRKLNKDIEVDGKTIVK (1) 
DT NVVLHIYALHHHEEFWKDPHIFDPSRFTQENMKSMNSYAYVPFSAGPR (2) 
NCIGQRFAMNEIKIAVAQVLSKFQLKPDLSKKIQHSVDVIYRATTGLYLKAERRV * 

>paired_scaffold_36 Cs110 63% identical to seq 110 boundaries need checks
MLAYIPYSLCAALVVLAGSFLPSIFRS (2)
IYEQYRARKYLKRNWPGPKGHWFWGYAGEL (0)
MRKIRKDPGYYFSYIRGITKTYPYGFSSNLTPLFASLNIYHPNAVKAVL
DAPKLKRAYRSLFEWL
189378 GHGLLVLEDKEWFRHRRLLTPSFHFEILKPYIKTMNECSITMV (0?) 189503
ESWVKKTSEKAQIADDMDITPSISRMTLDTILKCLMSYESNCQDEX
STNSYVSKIYEISASVAHRMRDVNIFKRINAIFYAT (2)
       AHGKAFLKLCEAVHEFSDSVINCRKAELKQPLVKQNRKYLDFLDTLLKAR
196424 DADGKGLSDREIRAEVDTFMFEGHDTTASGISWILYCLAENADHQEMCFREIQEQMGDRTDIEW 196615
197887 DDLAKLPCLTMCIKESMRLFPPVPIIFRKLNKDIEVDGKLIAK 198015
198732 GTDIVLHIYALHHHEMFWKDPEKFDPNRFLPKNSSNVEGYTYVPFSAGPR 198881
199251 NCIGQRFAMNELKIAVAQVLRKLRISPSSDRKIQHAVDVVYKTKNGIYLRFEQRNCLTSSP* 199436

>paired_scaffold_445a ortholog to seq 110 two different genes
516075 MQFITFLWENSKFFVFDRSFLETCLVILALYLFMHYGMVLRRLHNNAKHLDEVAGVDKRNKHWFYGHMKH 515866
515692 YINNETGLLRLIQRFKTFPLMISHWIGPVMTILLVYHPDSVHAVLTSG 515549
515205 APKNDLWYRFIRPWIGDGLLTSKGTKWFRNRRLLTSAFHYKVLKP 515071
514280 YLLAVKEVGNLLMERSRQPLHFFDWIYNLSANGRRNKVVVKVLHDHTEK 514134
514657 YSAIFNKNSLVMVDKFKKLCGQSIEVHTHVGLMTLDTMLQCAMSTNTNCQHVT (2) 
514289 DHPYLLAVKEVGNLLMERSRQPLHFFDWIYNLSANGRRNKVVVKVLHDHTEK (0) 514134
512620 IIAERKEVLAKAVNHDENYSEMKGFQKIKGKALDFLDILLQSK 512492
511792 DEDGKGLTDIEIRDEVDTFLFEGHDTTSS 511706
511348 GIAWALYHLAKYPEFQSKCREE*TEVLGDRKYVEW 511244
510498 NDLPKLEYLTKFIKESLRLKPPVYIMGRQLEEPLKIIEKCSERVSTIPAGTNVSTSIFMLHRHEN 510304
510303 FWENPT 510286
       VFNPERFTKENIAKRPAFAYVPFSAGS 509810
509704 RNCIGQNFAMNEMKICLAQVIRKLRLYIDGDSPVPKMLPRIILQSESGIFIKFEKI 509537

>paired_scaffold_445b ortholog to seq 110 two different genes
500316 MQFITFLWETSKSFVFGRSFIETCLVILALYLSMHYGMVLRRLHSNAKHLDEVAGVDKRTKHWFYGHMKH 500107
499931 YTNNETGLLRLIQRFKTFPLMISHWIGPVMTILQVYHPDSVHAVLTSG 499788
499452 APKNDLGYRFIRPWIGDGLLTSKGSKWFRNRRLLTPAFHYKVLKP
498498 YLMAVKEVGDLLMERSRQPFHFFDWMYNLSANGIRNKVVVKVLHDHTEKVSN 
498893 YSEIFNKNSLVMVDKFKKVCGQSVEVHTHVGLMTLDTMLQCAMSTNTNCQHVT (2) 498746
498507 NHPYLMAVKEVGDLLMERSRQPFHFFDWMYNLSANGIRNKVVVKVLHDHTEK (0) 498352
       IIAERKEVLAKAVNDEEKDSEMEGFQTIKGKVLDFLDILLQSK 497186
496826 DEDGKGLTDIEIRDEVDTFLFEGHDTTSS 496740
496527 GIAWALYHLAKYPEFQTKCRDELTEVLGDRKYVEW 496423
495663 NDLPKLEYLTKFIKESLRLKPPVYIMGRQLEEPLKITEKSSGRVSTIPVGTNVSTSIFMLHRHEN 495469
495468 FWDNPT 495451
495221 VFDPERFTKENIAKRPAFAYVPFSAGS 495141
494904 RNCIGQNFAMNEMKICLAQVIRNLRLYIDGDSPVPKMLPRIILQSESGIFIRLEKI 494737

>SEQUENCE 118, 140 N-TERM TO EXXR 381 AA 35% TO 3A5 no introns
MYIPSFLSVETLVLLITSLVLVRIYIHKKWQFLKNINIPHIQPTILQLGTVRPFIGNSD
WMMNSHEYCKRKLGKIYGAYLFLTPQITVADPDIIKQMYIKEFATFPTRQTSLTKINGK
VMNTTILMTEGEQWKRIRNTLTPSFSTAKLKEMLGIMEDCCNHAINILIKTTQNDQ
GKFEAKSVFGKLALDITCSAAFSVDTHPQELKDGQEPKIIKMVKQAFGFEISR
SPLVMLCFMFPAFEEILCKLDYSIIPKDTVNYFANLTKSLIDQRKHSTVKPRVDLMQLMLNDQI
SVEDVAKGKSKGLTQLEITSNSVLMVLAGYDTSANALTLLAYNLATHKHVHKKVHEEIDQ 
MLEKYGSLTYEALTSMKYLGMCLNETLRLYPPIAVNSRIPNRDITINEVFLPKGIQVIVP
VYGMSRDEEIWEEPLKFKPERMEDMRSVDPMIFQPFGGGPRGCIGMRFAVLEIKLAIAKIL
QDFDLDVCENTPKPPVQLEFKSTTLKPKGDIYLRVTPKSN*

>paired_scaffold_186 63% to seq 118 no introns complete
625691 MWFENVLCIETLILIVTLLILLRIYIYKKWQYFKKINIPHPEPSLWELGNVKEFLINPRWLLDS 625882
625883 YAVWKKKFGKNYGTYTFLTPQITISDPEIIKQIYIKEFSTFPDRQRTLRTVNGKVMNTTL 626062
626063 LVSDGERWKRIRNTLTPTFSAAKLKEMLEIMENCCERMVQRLKIASHENDGRFDTKLVFG 626242
626243 TAALDIICSSAFSVDTHPQGLKDGEESTIIKMAKKAFGFNLIGSPLIILCLFFPPFEKVL 626422
626423 SFFNYSIMPKDTVNYFSRLTESVINQRKVSKEKPRVDLMQLMVDAEISEKDACKENAKGL 626602
626603 TRMEITSNSVLMVLAGYETSANLMCFLAYNLAVYKEAQRMVQEEISHTLEEYGSFTYEAL 626782
626783 NSMKYLTSCIQETLRLYPPIAINSRFPIKDITINGIFFQKGIQVNIPVYGLSRDDEIWDE 626962
626963 PLNFKPERMSDMSKINSMVFQPFGGGPRGCIGMRFALLEIKLVFCKFLQQFNIDVCEDTP 627142
627143 EPPLAMLFRSTTLKPKENIFLKLSRREY* 627229

>SEQUENCE 112 15 exons 54% TO 3A5 note 112 identical to seq 119 up to VDSQ
MITPLDILSLETWILILTSFILIR (2)
LYIHKKWQVLKNLNIPHDQPTILGIGNLMDFIKNPEF (0)
LFMGYFDLKKKYGPIYG (2)
AYNWLTPTITITDPEILKKIFIKEFSTFPDRQ (0)
IALLDVNGKEMNSALTVVSGRKWKRIR (2)
NTLSPTFSSSKLKEMFGIVEDCADSFVQNIGTINSDVDGRFDVSL (2)
AFGRLTLDGICSSAFGVKVDSQ (0)
AGNNGEESQVAKMARELLAFEFFRNPMLYVF (1)
LIFPWTEKIAEKFDYTIFPRKYVRYFARLIDSVVESKKEKKQ (0)
RTDILQTMIDSQITEEDVKNGAVK (1)
GVTKTEMKGNSIIMLLAGYETTSNAMVFLAYNLAVYKDAQHKCREEIEQVIAEH (0)
GGLTYEAVQDLKYMTQCLNESMRLYSLVPA (2)
NSRYCERDITINGVTIPKGTLVNIPVFGMGRDEEFWNEPLTFNPDR (2)
MLDMNEIDPMIFQPFGAGPRNCIGMRFALLEIKITFAKLLQKFHLDVCEDTP (0)
APPLKVGFKTSMKPKETLYLKVKALET*

>paired_scaffold_126 69% to seq 112 Length = 13092 runs off end
11811 SKPSGQEPKIIQMAKGLSNFEFTKNPLLLVF
11661 VIFPWAEWIAEKVNYSLFPAKYLNYFYKLVDTVIEDKKSQ XXv 11542
10948 GMTRVEIKGNSLIFIFGGYETTSNAMLFLAYNLALHKDAQHHVREEVQTALHKH 10787
10465 GGLTYEAIQDLKYMTQCLNESLRLYPLTP X 10379
10130 NSRYCEQDITINGITIPRGTTVLTPMFGMMRDEDIWEDPLIFNPDR 9993
9601 MEDMSKIDPMIYQPFGAGPRNCIGMRFALLEIKITFAKLLLKFDLDVCEDTP 9446
9000 X PPLAVTFKTSMRPKETLYLKVVGREX 8926

>revised seq 232 probable pseudogene of seq 112 ci0100132937 scaf 41
GELTYETVQRLKHLTQCLNESLRL YNNVPV
NLRYCERDITINGVTIPKGTSIDIPVYGMGRDEEFWEDPLVFKLES
MLDMNEIDPMIFQPFGAGPRNCIGMRFALLEIKITFAKLLQKFYLDVCEDTP
ASPLDVTFKMSMKPKDQIFLKVTKINE

>paired_scaffold_181 N-term frag of a CYP3 seq
220990 ILILTSYLLLQLPIHRVW 220937

>paired_scaffold_238 similar to seq 112
2498 MFNGYFISKKKFGPIYG 2448
2167 SYQFLSPTITIADPEILKQVFIKEFSNFPDRQ 2072
1469 XXLLKANGREMNTGLTAVTGDQWKRIR 1395
1345 NTLSPTFSSSKLKQMSGILEECADTFVRNIGTINSDENGRFDASMX 1211
1009 FGKLSLDAICSSAFGVKVDSQ 947

>paired_scaffold_238 similar to seq 112
8147 SKPNGKEPKVAEMAQKMFNHPIVRDPMFL 8061
7788 LFPSVLFPSFEHVLAKFDYSIFPRKYTNYFSKLTDSVMEDKKSKKQ 7651
6018 GGITYEAAQELKYLTQCINESMRL
5688 YKVVPRNSRYCEEDITINGITIPRGATIDVPVYGMGRDGDIWEDPLVFNPDR 5533
5223 MEDMSKIDPMIYQPFGAGPRNCIGMRFALLEIKITFAKLLLKFDLDVCD 5077
4793 XXXXPPLEMTFKSSMKPKDQVLLKV 4731

>paired_scaffold_112 CYP3a like seq about 70% to seq 112 and seq 119
2172740 MITPLDILSLETWILIITSLILIR 2172669
2172005 LYIHKKWQVLSNLNIPHDPPTLMGMGNLMPLMKNPDL 2171895
Missing exon 3
2171412 SYQFLTPTITITDTEIIKQVFIKEFSNFPDRQ 2171317
Missing exon 5
2170987 NTLSPTFSSSKLKQMSGILEECADTFVRNIGTINSDENGRFDASM 2170853
2170622 VFGKLSLDSICSSALGVKVDSQ 2170557
2170291 SKPSGLEPKIVQMAKGLSNFEFTKNPLLLIF 2170199
2170137 VLFPWTEWIAEKFNYSLFPAKYLNYFYKLIDAVIEEKKSQXX 2170018
Missing exon 10
2168806 GMTRVEIKGNSLIFIFGGYENTSNAMLFLAYNLALNKDAQQKAREEAQAAIDKH 2168645
2168205 GGLTDEAIQSLKYMTQCLNESIRLYPLTPX 2168119
2167507 NTRYSEQDITINGITIPKGVQVLVPTFGMMRDEEYWEDPLTFNPDR 2167370
2167122 MEDMSKIDPMIYQPFGAGPRNCIGMKFALLEIKITFAKLLMKYDLDVCDDTP 2166967
2166487 XXPLAVAFKTSVKPKETLFLKV 2166428

>revised seq 119, 131 = ci0100140050 scaf 75 cigd043j06_1 rcigd043j06_6
MITPLDILSLETWILILTSFILIR
LYIQKKWQLLKNINIPHDPPTILGVGNMMGIIKDPNQ
MFLSYFKMKKKYGLVYG
AYSWLTPSITIADPEILKQIFIKEFSTFPDRQ
KALLDVNGKEMNTALTSVTGSQWKRIR
NTLSPTFSSSKLKEMFGIVEDCADSFVQNIGTINSDGDGRFDVSL 3486 
AFGRLTLDGICSSAFGVKVDSQ
AANNGKESQVAKMAREFLAIEIFKNPMVFVF
VVFPWLGKVAKLFDYSVFPKKYIIYFSKLIDSVVETKKKKKQ
RTDILQTMIDSQITEEDVKNGAVK
GVTRTEMKGNALITLIAAYETTSNAMVFLAYNLAVYKDAQHKCREEIKQVIAEH
GGLTYEAVQDLKYMTQCLNESMRLYSLVPA
NGRYCVRDITINGVTIPKGTLVNIPVFGMGRDEEFWNDPLTFNPDR
MLDMNEIDPMIFQPFGAGPRNCIGMRFALLEIKITFAKLLQKFYLDVCEDTP
APPLEVTFKSGMRPKEIIHLKVTAVE*

>SEQUENCE 125, 113, 114, 213, 138 
MITPLDILSLETWILILTSFILIRLYIHKRWQVLKDLNIPHDPPTILGVGNMSKFFKDS
DAGYKRHLELKQKFGPIYGHYIGLSPHVYVGDPEILKQIMIKEFHIFPDRQKNFQNANGK
EFNDNLTAISGKQWKRVRDTLSPTFTSSKLKEMFGIVEDCADSFVQNIGTINSDSNGRFQPA
VVFSKVAIDSICSAAFGVNVDSQ 
SKPQGQEPEVVKMALELFNYPLFKNPFFLIF (1)
LMFPWTENIAKVFDYSLLPKDCLRYFARLAKTVKKNKANVEQ 
RVDILQTMINSEITDNEVKNGAIK
LTEIEVTANSLLMMFGGFETTASAMVFLAYNLAVYKDAQHKCREEIEQ 
VIAKHGGLTYEAMNDLKYIMQCLNESLRLYPPAPMNSRYCERDITIHGLTIKKGITVQIP
VYGMGRDEEFWEDPLVFKPERMLDMNEIDPMIFQPFGAGPRNCIGMRFALLEIKITFAKL
LQKFHLDVCEDTPAAPIDVSFKSGMRTKQDLFLKVTARE*

>SEQUENCE 136, 203 I-HELIX 58% TO 4X1 missing region before I helix
MKAIARFPFPALVRFVSTCPAHQRAAWSKKASQEDLSYAKPFEAVPEPNSY (?)
PVIGTALEYTPLNNFNPSYVGEHWIERHKELGPIYKENILPG (?)
VTETMVFTSSPQNTEIMFRNEERCPFRDPLGPVAKIRDERNEYHGLTNSNGEDWWR (?)
VRQIVNQHFLQNTAVWKYAEAHRNVSKDFLKFIDRNMDEKNE (?)
VPNFGKALNRWSFEGAGVFTLKRRL (1)
GALDIKPCTEIEEMINCNERIFQSMGELLYALPFWKYVNTKDMKVYKTVFIYTCNKFCESYLVLID (2)
YIHSFVNIICSEKVILMTEFLVAGIDTTSNSAAFLLYALASNQEAQERLREEVREVNRMDTIDGK (?)
LLHNMKYLQACLRETQRMFPFVNSSPRRFRKDLVLSGYLIPAG (?)
HPHSDTSNTVNSRNPEYYDDPNTFIPERWLETKS (?)
FKPYIRFIMSGPFGLGMRMCPGRKFSIQELQLLLITLLSNY
RVEYHHDPIKVAFRLVSFPSQAPQFKFTKLKN*

>paired_scaffold_300 76% to seq 136
1065119 MKRITKLPFPAFVRFSSSCPVHQQAAWSQMIARE 1065018
1064602 FEEIPEPTNYPLIGTALEYTPLNNFNPSYVGEHWIQRHKELGPIFKENILP 
        GVTETMVFTSSPENTEIMFRNEEKCPFRD 1064243
1064242 PLGPVAKIRDERKEYHGLTNSNGNEWLR 1064159
1063148 VRQIVNQHFLQNAEVWEYTEQHRQVARDFQQFIERNMNKNNE 1063023
1062631 VPDFGKALNRWSFEGAGIFTLQTRF 1062557
1060031 GALDKRPGKEIEDMINCNERIFQSMGELLYSLPLWKYFNTKDMK 1050900
1058661 NIFFSEKVILMTEFLVAGIDTTSNSAAFFLYALATNPEAQEKLRQEVMPISKQDKIDGK 1058485
1058184 LHNLKYLQACLRETQRMFPFVSSSPRRFNKDLILSGYNVPA 1058062
1057327 GTHILNTSNTVNSRNPEYFTDPDTFIPERWLGEKS 1057223
1056497 RFIMSGPFGLGQRMCPGRKFSVQELQLLVLTLLSNYRIEYHGDPIKVAFRLVSFPS 1056330
1056329 RPPQFKFVNIK 1056297

>ci0100142705
LSYAKPFEAVPEPNSY
PVIGTALEYTPLNNFNPSYVGEHWIERHKELGPIYKENILP
VTETMVFTSSPQNTEIMFRNEERCPFRDPLGPVAKIRDERNEYHGLTNSNGEDWWR
VRQIVNQHFLQNTAVWKYAEAHRNVSKDFLKFIDRNMDEKNE

VPNFGKALNRWSFEGAGVFTLKRRLGAL
ICSEKVILMTEFLVAGIDTTSNSAAFLLYALASNQEAQERLREEVREVNRMDTIDGK
LLHNMKYLQACLRETQRMFPFVNSSPRRFRKDLVLSGYLIPAG
THILNTSNTVNSRNPEYYDDPNTFIPERWLETKS
VTERQRERFIMSGPFGLGMRMCPGRKFSIQELQLLLITLLSNYRVEYHHDPIKVAFRLVSFPSQAPQFKF

>ci0100143467 complete seq 145, 120, 123, 197
MPTLIADLVLLFVLYLLGKAVVRNWKTIAAGRRMSKIIGSPPAQWIVGHAYS (0)
LAPTPEGLLSNVKTAASYPGMMVLWLAHKWIVQVYHPDTVGKVLSTQ (1?)
ATKDDMTYEYLKAWVGYGILTSNGERFQRQKKLLVTAFHQDLIKQ (2)
YSPVFTKATDRLIEKWTEKQNQSVELFEDISLLTLDCILICAMSTDLDCQRIG (2)
KQHPYIDAVHQLAYLAIKRFYNPWVRHDFIYNLTSEGRLWKKLLKVVHE
MSESVVKARRKVVSISDKPKLSKTKYVDFLDVLLRAK
DESGEGLTDQEIREQVDTFLFAGHDTTAS (1)
AICWFLYNLACNPEYQDKCRKEVKEVMEGRKDVQM (2)
FDIGDMTYLNMCLKESMRLHSPIPFIGRKVKNPINLRRPN (?)
DNTEVTLCPGTTIIPHLLSLHRNPDVWDEPE (0)
VYDPERFNTKNASRLHPYAYIPFSAGPR (2)
ICIGQNFALVEMKLTVAKLLLNYEFYVDEDIPKPQPIPNVVLRSQNGLYVKFRKI*

>CIONA SAVIGNYI BEST MATCHES TO SEQUENCE 145
TYQYLTGFVGYGLLTSNGVRWQRQKALLSTAFHQDLLK
YSPVFTSATDTLIEKWGKKQNRSVELFEDISLLTLDCILICAMSTDLDCQRIG
EDEKGLTDIEIKDEVDTFLFEGHDTTSSG
IAWALYHLAKYSEFQAKCREELTEVLGDRKYVEW 
                     IREIMEDRKNVQL
FDVGEMSYLTMCIKESMRLYSPIPFIGRKIKNKLNVHRPNGASDVTLDADVTVIPHLFTLHRNPEIWDDPEV

>paired_scaffold_58a 63% to ci0100143467
178841 MPTLIADIVLLFVLQRFGKGLLRNWKTISAGKRMSRIVGSPPAQWLVGHAYN (0?) 178686
177840 LAPTTEGLLSNVKAAMKFPGMMVLWLAHKWIVQVYHPDTVGKILLTQ 177700
176599 ATKDDMTYQYLTGFVGYGLLTSNGVRWQRQKALLSTAFHQDLLKQ (2) 176465
176408 YSPVFTSATDTLIEKWGKKQNRSVELFEDISLLTLDCILICAMSTDLDCQRIG 176250
175694 KKHPYIEAVHHLSYLAIKRFYNPWVRHDFVYNLTADGRLWHKSLKVVHDLSESV 175533
174071 VVSARRKVVKIDDKPRLSKTKYVDFLDILLRTK 173973
173450 QDQSGEGLSDREIREQVDTFLFGGHDTTAS 173361
172356 AISWLLYNLARHTEYQEKCREEIREIMEDRKNVQL (2) 172252
171806 FDIAEMSYLTMCIKESMRLYSPIPFIGRKIKNKLNVHRPNGASDVTLDADVTVIPHLFTLHR 171620
171619 NPEIWDDPE 171593
171022 VFDPERFNVKNSAKMHPYAFVPFSAGPR 170939
170524 VCIGQNFAMIEMKLIVSKLLLNFEFWVDGETPEPKMIPNIVLRSENGVHVKFKDLQS 170354

>paired_scaffold_58b 57% to ci0100143467
153765 MPTLIADIVLLFVLQRVGKGLLRNWKTISAGKRMSRIVGSPPAQWLVGHAYN (0?) 153920
154799 LAPTTEGLLSNVKAAMKFPGMMVLWLAHKWIVQVYHPDTVGKILSTQ 154939
156215 ATKDDMTYQYLTGFVGYGLLTSNGVRWQRQKALLITAFHQDLLKQ 
156407 YSPVFTSATDTLIEKWGKKQNRSVELFEDISLLTLDCILICAMSTDLDCQRIG 156565
157943 KKHPYIEAVHHLSYLAIKRFYNPWVRHDFVYNLTADGRLWHKSLKVVHDLSESV 158104
159266 VVSARRKVVKIDDKPRLSKTKYVDFLDILLRTK 159364
159922 QDQSGEGLSDREIREQVDTFLFAGHDTTAS
160568 (1) aggc IVWLLYNLARHPEYQEKCREEIREIMEDRKNVQL 160669
161098 FDVGEMTYLTMCMKESMRLYAPVPFIGRKIKNKLNVHRPNGASDVTLDADVTVIPHLFTLHR 161283
161284 NPEIWDDPE 161310
       VFDPERFNVKNSAKMHPYAFVPFSAGPR 162162
162475 VCMGQNFAMIEMKLIVSKLLLNFEFWVDGETPEPKMIPNIVLRSENGVHVKFKDLQS 162645

>SEQUENCE 147, 149, 153, 163
MTLFEAASEVVFLNRGFVLSSLISEVVLLIAIVAVTKVYLV
PAIRYYKRGLLYTKQGCSPPRHWLWGHLTVFPLSHDGMLKRLRTTAQYAGMFVSWAGPFT
YWIETQHPTTCAVLANSSAPKAEFLYGCLRPWIGDGLLTSKGSKWKRNRRLLTPAFHFEI
LKPYLNIFNKSSYAMLEKWKRHSGQ
SYDVYGDVSNLTLDTMMQCAMSTDTDSGGEDGGYGYINAVHELTLLVMERVYNP
LHMIDWIYHFSSNGRRFRKLVDFVHETSET IIKERKKELETLVQEDNDNASTATNNELSS
FKKRGTKHLDFLDILLQTK
DENGNGLTLKEIRDEVDTFLFEGHDTTASGIAWCLYNLAKH
PEYQQRCREEVLEEVGEKEKIEWED
19  LSKLTYLAMCIKESLRLNPPVFCVGRELNEDLTFNDKYTPCKKMKVESGNTVLMQIMSMH 198
199 RNPHVWENPE 231 (0)
297 VYDPERFSAENMKNRHSMSFLAFSAGP 383 
156 RNCIGRNFAMNELRVVIGQTLRQYKLWCDDDTPEPKMQPNIILRSANGIHIKFKPV

>paired_scaffold_233 66% to seq 147
118145 MNVLDVLTYATFQKQGFLLPSIVSEIVLLILLAATTKLFVIPTVKRYNLGLKYSRQLSCP 118324
118325 PTHWLWGHLAR 118357
118783 YPLNHEGVLQRMRDTAQYAATFVCWLGPFKYWLEFQHPTSCAVLANSS 118926
119654 APKAPMLYEYLRPWIGDGLLLSSGNKWKRNRRLLTPAFHFEILKP (2)
120099 YTQIFNKTSTAMLNKWSKHAGQSYDIYGDVSNMAFSAMMQCTMSTELESNQEEG (2) 120260
120480 GDYISAVHELTEIIIKRINFPLLMVDWIFNLTPSGRRFWKLVDFVHKTSEA 120632
121152 IIEARRIELNQRNKSDNLEPLNITGVSQFKKRGTKHLDFLDILLQTR 121292
       DEDGNGLTQQEIRDEVDTFLFGGHDTIAS 122048
122374 GLSWCLYNLATHPDYQERCRQELVDVVGDNELIEW 122478
122810 DDLPKLNYLAMCIKESLRLFPPAFGVGRVLNEDLTIKDKYATCQNVTAVAGTTALL 122977
122978 QITALHRNPHVWENPE 123025
       VYDPERFSADNMKGRHPMAFLSFSAGPR 123564
124136 NCIGRTFAMNEMRVVIGQTLRKYELWCDRDTPEPRMKPNLILRSSNGIHIKFKPL* 124303

>SEQUENCE 148 scaffold 100
MAWTSVGGIVTTTAVGDFLLVACIVVVIKSFIIPAVRGYQDAKKRARSVGEVKKHWLYGSVKM (0)
FPQNEEGLMKRVASSFEKPVWTVNWFGPYISEVVIHHADLAITLLSSS (1)
APKNNLVYGFFHPWIGDGLLTSSGRKWQRNRRLLTPAFHFSILKP (2) 
YTNVSNACVRVMLDKWSKKVGTSMEIYLDVNLMTLDTILQCAMSTKSDCQNRS (2)
154552 KKNEYIEAVHDVSKYIMSRVHKPLLHIDWIYWLTAEGRKFKQLVKVLHDQSEK (0) 154453
VIRERRKTLENRKFEEESSGKKRLDFLDILLHTK (0)
DEDGKGLNDSEIRDEVDTFLFAGHDTTAS
GIAWALYNLAVNVDCQDKCREELKSVVGDKENIEW (2)
EDLSKLSYLTLCIKESLRLCPPVPFIGRELNEPLKF 
RSKLKEPNETTIDAHSNIALHIFTLHRNVHVWDSPE (?) 
EFIPERFKPENMKGRSPHAYLPFSAGPR (2?)
NCIGQNFAMNEMKIAIGQTLRRFKVIPDESSPKPSITPQVVLRPKDGIFIKLMKI*

>sequence 151, 152, 135 ci0100151682 scaf 357 ci0100147473 scaf 996 rcilv086f16 
MSFALANGVLTSTVVGDIALIFATLFVILNYILPIAKLRQSSSNLEKQVGLGSFKKHWFFGHLRL
FPPLEHGLLKRAQRTIQTPAWVTTWFGPLVSEATTHHSETAITLLASS
APKNEYVYKFLRPWLGDGLLVSNGKKWMRNRHLLTPAFHFSILKP
YAEISNNCIKTMIEKWTTKTNVAFDIYKDVNSMTLDTMLQCAMSFKIEGLKER
QKNTYLQAVRDLSTLTMARISNPLLHIEWIFWRSSTGRKYKAALEIVHQQTKQ
VIRERKEELKKASEVSETVTEGSETAQVTRKGKYIDFLDILLHTK
DENGIGLSDVEIENEVDTFLNEGHDTTA 
SGISWSLYMLAQHPEYQQKCRDELEEVLKDKEDIDWSNLTKLQYLTMCIKETLRI 
RPPVPAISRQLDQPLTFKSKTHNVPPTTLAKGSFAGVQIFSLHRNVHVWEEPE 
VFNPDRFTQDNIKKRSPHAFLPFSAGSRNCIGQNFAMNEMKIAVGQIIRKY 
ELYIDDDTPDTVMETALVLRSTNGIMIKIRKI

>Cs151 73% to seq 151 (C-term) 69% to ci0100151682 (N-term)  40% to CYP4T5
862563 MLASGLITSTLLADILLIFVTTFLIWNYVRPIVKLRQLSNKLESKIGPNKKHWFFGHLRM 862742
863300 FPPSEEGLLKRAQRTISSPAWVVNWFGPLLAEATFHHAETAITLLSSS 863443
AASKNEHLYKFLRPWLGDGLLVSNGKKWQRNRHLLTPAFHFNILKP 
YAQISNDCAKTMITKWTKKTGVAFDIYTDVNQMTLDTMLQCAMSFKIERQNEK (2) 
EKNIYLQAVRDLTNLTMERITRPLLHIEPIFWLSSTGRKYKEALEIVQGQTNR
VIQERRAELKTSKEVHESVEGLDQVKVTRKGKYIDFLDILLQTK
DEDGVGLSMQEIRDEVETFLNEGHETTAI 
GSSWTIYMLAMNPECQKKCREELSDVIKDKEDIEW 63%
SDLKKLQYLTMCVKETLRLRPPVPAISRLLDQPLTFKSKTHQVPETTLEKGSIAGVNIFALHRNVHVWEDPEV 83%
FDPERFNPENTEQRSPHAFLPFSAGS 70%
RNCIGRNFAMNEIKIAVGQVLRKFEFYIDDETPATKMATSMVLRAKDGIMLKLRAI 66%

>paired_scaffold_298b 100% to Cs151
862563 MLASGLITSTLLADILLIFVTTFLIWNYVRPIVKLRQLSNKLESKIGPNKKHWFFGHLRM 862742
863300 FPPSEEGLLKRAQRTISSPAWVVNWFGPLLAEATFHHAETAITLLSSS 863443
864149 ASKNEHLYKFLRPWLGDGLLVSNGKKWQRNRHLLTPAFHFNILKP 864283
864822 YAQISNDCAKTMITKWTKKTGVAFDIYTDVNQMTLDTMLQCAMSFKIERQ 864971
866223 DEDGVGLSMQEIRDEVETFLNEGHETTAI 866309
       GSSWTIYMLAMNPECQKKCREELSDVIKDKEDIEW 866663
866797 SDLKKLQYLTMCVKETLRLRPPVPAISRLLDQPLTFKSKTHQVPETTLEKGSIAGVN 866967
866968 IFALHRNVHVWEDPE 867012
867457 VFDPERFNPENTEQRSPHAFLPFSAGSR 867540
868027 NCIGRNFAMNEIKIAVGQVLRKFEFYIDDETPATKMATSMVLRAKDGIMLK 868179

>sequence 155 complete 80% to seq 110
MLLFYLQYAVYATVCAFAVFLLPSVFT (?)
SLKRFYTVRKNLNSQWPGPKGNWFW
GSSAKVIAEVQKDSAYYFTYIKEMTKQFPQGFRGQLGPIFLSLNAYHPNMVKAVLCAPAS
DAPKLNRAYKMLFEWLGHGLLVLEDKVWFRHRRLLTPSFHFEVLKPYVKTMNESAHVMVE
NWLGKTSENKVVEIEIFHSASLMTLDTILKCLMSYQSNCQEEKSTNTYVAKIYQLSNTV
VKRMRDLNLFKRIDPIYYLTK
NGKLYKKLCDDVHKFSESVIDRRKCDIGQPAHNGTRKHFDFLDTLLKARDSDGKGLSDSE
IRAEVDTFMFEGHDTTASGISWTFYCLAMHPEHQEKCFQEIEKVMADRTDIEW 
NDLSNLPHLTLCIKESLRQYPPVPIIFRKLNKDIEVDGKTIVK
DTDVILHIYALHHHEEFWKDPHIFDPS
RFTQENMKSMNSYAYVPFSAGPRNCIGQRFAMNEIKIAVAQVLRKFQLKPDLSKKIQHSV
DVIYRATTGLYLKAERRV*

>sequence 109, 124, 161   82% identical to 129, 156
MVAQSQLESILLQNGGGFIFQTLLADVLLFAALVFVTAKYVSPAVKHQFKMKKCGKEVGG
PKGHWFYGDLLE (0)
NGYDKTGLLNSGKRCEEFPAMCTQWLGPVQMNVQLHHPDVTKTVLSKS (1) 
HPKGYAYDKFLKPLLQDGLVTSKGDLWKRHRRLLTPIFHFDILKS (2) 
HVQIFNQNTTELLDWMEEASKSDVGGNLMIPCSNMAFNNVMECVMSQKCSAKEG (2)
EISTYTGYVKSMSNMMFQRLSRPLLHSDTLFRVSKLGKRFSIAKKGIEDYASK (0)
LIKERRAILSSNSPNSELTTTNIKDEVTGTSFNFTRRKGKMVDFLDILIQTK (0)
DQDGVGLKNDEINDEVITFFSAGHETVSN (0) 
AISWGLYTLASNLDCQTRCREEIKSAVGDKETLEW (2)
ADLSKLPYLTQCIKEAMRLFPTLHVISRQLSEDVTLSHRFNDYKPIIIPKGVCVAVNIFAMSRNPHIWEEPE (0)
KFNPDRFTPENTSKRSPHAFIPFGAGPR (2)
NCIGQNFGMQQIRVVLSKVLRKYEVYVDETLPKPEMSPGVTVQPKDGVFIKVRKLTTQ* 

>paired_scaffold_309 77% to seq 109
11655 MVAQSQIELLLLQSGGGFVFQTLLADILLLVVLAVVSAKLVVPTVKHQIKLKKCADEVGG 11476
11475 PKMHWFYGDLIK (0) 11440
11297 YAYNEESVWNSIKRCDEFPAVCRQWLGPFQMNVQLYHPDVTKTILAKS 11154
10372 VPKSYAYDKFLKPLLRDGLVTSSGDIWKRHRRLLTPIFHFEILKS 10238
10079 HVNIFNKNASILLNFCEKASENGERSDVSVASADMAFTNVMECVMSQRANDNTE (2) 9918
 9173 GAAKYVAYIRSASAIMFQRLCRPHLHADTLFNLSKLGRKFAKVKQGIEEYAFK 9015
 8761 IIVQRRQILSNTSSSGDLTNASVKDANGTVFNFSRRRGKIVDFLDILIQTK (0) 8609
 6943 DQDGVGLTNTEINDEVVTFFSGGQETVSN (1) ggt 6857
 5473 SAITWTLYSLANNMDYQQRCREDIKEMLGDKSDIEW 5366
 5146 GDLSKLPYLTQCIKEAMRLYPTVHVIGRYLTEDVTISHRFNDWKPLTIPQGVCVAINALAMHRNPHIWKDPE 4931
 4032 TFDPDRFTPENSSKRSPHAFIPFGAGPR 3949
 3595 NCIGQNFAMQQVRVSISKILLKYEVYIDKSCPKPVMVPGVTMQPKDGVFIKVRKL 3431

>paired_scaffold_81a 81% to seq 109 Frameshifted in exon 3 (lower case)
208078 MVAQSQIELLLLQSGGGFVFQTLLADILLLVVLAVVSAKLVVPTVKHQIKLKKCADEVGG 207899
207898 PKMHWFFGDLVK 207863
207803 YAYNEEGVWNSVKRCDEFPAVCRQWLGPFQMNVQLYHPDVTKTILAKS 207660
206667 VPKSYAYDKFLKPLLRDGLgdvirryignvigRLLTPIFHFEILKS 206530 
206449 HVNIFNENANILLDFCEKASESGSGSDVSIASADMAFTNVMECVMSQKATDNTE (2)
       GTAKYVDYVRSASAIMFERLCRPLLHTDTLFSLSNLGRNFAKVKLGIEDYAHK 205961
205684 LIVQRRAMFSNTSTSSDLTKATVKDASGTEFNFSRRRGKMVDFLDILIQTK 205508
205218 DQVGVGLTNTEINDEVITFFSAGHETVSN 205132
204780 (1) agca AITWTLYTLASNLDCQTRCREEIQSALGEKSNVEW 204679
204556 ENLSKLPYLTQCIKEAMRLFPTLHVIGRYLTEDVTISHRFNDWKPLTIPQGVCVGVNIFAMHRNPHVWEDPE 204341
204038 KFDPDRFTPENSAKRSPHAFIPFGAGPR 
203824 NCIGQNFGMQQIRVNVSKVLLKYEVYTDESCPKPIMVPGVAMQPKDGVYIKVRKL 203660

>sequence 129, 156 this gene has 11 exons 36% to 4X1 (in the CYP4 clan)
MVAQSQLESLLLQSGGGFIFQTLLADVLLFAALVFVTAKYVSPAVKHQVLLRKCAKEVGGPKGHWFYGDLLE (0)    
YGYDETGVWNSVKRCEEFPAMCSQWLGPAQMNVQLYHPDVTKTVLSKS (1) 
YPKSYAYNKFLKPLLRDGLVTSKGDLWKRHRRLLTPIFHFDILKS (2)
HVQIFNQNSSRLLEWMEETSKSDVGRDVAVPDCDMAFSNVMECVMSKKSSDLTG (2)
ETTRYTGYVQSVSAMMFQRLCQPILHSDSMFKVSGLGRRFAVAKKGIEDYANK (0)
IIIERRAILSSNSPNSELTTTTVKDDVTGTSFNFNRRKGKMVDFLDILIQTK (0) 
DQDGVGLTNDEINDEVITFFSAGHETVSN (1) 
AITWSLYTLASNLDCQTRCREEIKSAVGDKETLEW (2)
ADLSKLPYMTQCIKEAMRLFPTLHVVGRQLTEDVTLSHRFNDWKPVTLPKGSSIGVNIFAMNRNPHIWDEPE (0)
KFDPERFSPDNTANRSPHAFIPFGAGPR (2)
NCIGQNFGMQQIRVVLSKVLRKYEVYVDETLPKPVMVPGVAMQPKNGVYIKVRKLSKE*

>paired_scaffold_81b 78% to seq 109
215526 MVAQSQIELLLLQSGGGFVFQTLLADILLLVVLAVVSAKFVVPSVKFQIKLKKCSDLVGG 215347
215346 PKMHWIYGDLVK 215311
214235 YSYNEEGVWNSVKRCDEFPAVCRQWLGPFHMNVQLYHPDVTKTILAKS 214092
213556 VPKSYGYDKFLKPLLRDGLVTSSGDIWKRHRRLLTPIFHFEILKS
213179 IITRHVNIFNENANILLDFCEKASESGSGSDVSVASADMAFTNVMECVMSQKATDNTE (2) 213006
       TAKYVDYVRSASAIMFERLCRPLLHTDTLFSLSKLGRNFAKVKLGIEDYAQK 212714
212479 LIVQRRAMFSNTSASSDLTKATVKDASGTEFNFSRRRGKMVDFLDILIQTK 212303
       DQEGVGLTNAEIKDEVITFFSAGHETVSN 211905
211669 SAITWTLYTLASNLDCQTRCREEIQSALGEKSNVEW 211562
211234 ENLSKLPYLTQCIKEAMRLFPTLHVIGRYLTEDVTISHRFNDWKPLTIPRGVSVVVNIFAMHRNPNLWEDPE 211019
210423 KFDPDRFTPENSSKRSPHAFIPFGAGPR 210340
210229 NCIGQNFGMQQIRVTVSKILQKYEVYIDESCPKPIIVPGVTMHPKDGVFIKVRKL 210065

>sequence 158, 146 almost = seq 130 
MLLPDYLEFSLSLYSIGAVCVVVFLYKSRKLIGRFLKERRIRKELTKEIDGPQPHWLFGHLNL (0?)
LQPNEETILKTCEQTQKYPKLGMVWFGPALAYLMVFHPHVVRPVLNVE (?)
HPKDNLAYRFIKPWIGDGLLVSHGSKWRRNRHLLTKAFHFDILKQ (?)
YTKVFDACCKTMLEKWSRKCDGSTVEVFRPVSLMTLDSMLQCAISCETECQQEG (2)
NSNTYTDAVFTLTTIAMKRIFNPFCHVDWIFAMTKSGRQFKAAAKLIHEFDEK (?)
VIRERRQENKKILGRSDDEDLTQEDLEKITSWRKSDGRYLDFLDMLILTR (?)
DDDGKGLTEREIRDEVDTFLFEGHDTTASGIAWSLYCLAANEEYQEKCREEIEQVVGSKDALEW (2)
QDLSKLPFVTMFIKEVLRLYPPVFAVARRTEQPVKFPRGFGRDQLNVGDEQ (0)
LPKSDCSRVVDAPLNIGIPIMTLHRNEQVWEDAK (?)
VFDPYRFSTENCAKRSPYAYLPFSAGP (?)
RNCIGQNFAMAELKTVLVRTLLRYKLYVDNQCPDPKMSPMIVLKSLNGIHIKLKKI*

>paired_scaffold_315 76% to seq 158
1244324 MLYLDSLGYFFSTFSVVVFSCLTFLVVNEIGKLLIPHLSELRRSKALTEEIDGPKPHWLFGHLHL (0)
        LKPNQESILKTCEQTKMYPKMGMAW 1244653
1244654 IGPVIPYLMLIHPHVIRPLLTVE 1244722
1244806 EPKDNLAYRFIKPWIGDGLLVSHGKKWRRNRHLLTKAFHFEILKQ (2) 1244937
1245004 YSKVFESCCNTMLDKWTDKCDGNMVEVSSPISRMTLDSMMQCAISCETDCQLEG (2) 1245165
1245349 KSNEYTSAIYTLTTIAMKRIFNPFCHIDWIFGITKSGRQFKEAAKYIHDFDEK 1245507
1246267 VICERQFSNRSVLLQHLAENDKSDSEKIAAHRKSEGKHLDFLDILLLTR (0) 1246413
1246747 DEDGKGLTEREIRDEVDTFLFEGHDTTASGIAWSFYCLATNTEHQETCRREIKEVLGTKTSLNW 1246938
1247249 QDLSKLPFVTMFIKEVLRLYPPVFAIARRTQHAIQFPRGFGADQHCMED (0) 1247395
1247695 LDTKEGCGRKVEAPCNMAIPIMNVHRNEQLWDDPQ 1247799
1248144 VFNPMRFSAENCSKRSPYAYIPFSAGPR 1248227
1248880 NCIGQNFAMAELKTVIAQTLLRYKLYVDDECPTPVMSPMIVLKSTNGIHIKLEKI 1249044

>ci0100150610 seq 160 and seq 159 with N-term and C-term added complete
MQSSGLSSIWDQLQINDWLPSSFLVETTLLILLIYLITHYGLLFRETYTKSRKISERVGVDEKNRHWFFGHLKH (0?)
ITNNEEGLLKFVELAKAFPLMVSLWLGPALTTVQTYHPDSTTALLNSGAPKNDLGYRFIR
TWIGDGLLTSKGKKWFRNRRLLTPAFHYKVLKP YTEVFNKNSLVMVNKLNKLCDQPVNIYTHVGLMTLDT
MLQCAMTSKTNCQHVTDHPYLLAVKEISDLIIARVRQPLHFFDWM YNLSAAGKRNKEVTKILHEHTEKII
SERKAVLAKTANIDPEPSDVEGFRKTEGKTLDFLDILLQCKDENGEGLSDIEIRDEVDTFLFAGHDTTSS
GIAWAMFHLAKYPEFQTKCREELKEVLGNKDEIEWSDLPQLVYLTMFLKESMRLRPPVYAVGRYLEHPIT
IKDKNTKRDTVIPAGTNVSMHIFTLHRHPDFWDEPSKFDPERFTKENIAKRPAFAYVPFSAGSRNCIGQN
FAMNEMKISLAHVIRNLRLYIDDETPVPKMQPRLILQSSTGIFVKIEKLERQ*

>paired_scaffold_214 51% to seq 160
235361 MHLTTFLLRNYPFWFVVQSCLVILALYLFMHYGMVLRRLHNNAKHLDEIAGVEKRTKHWFYGHIKQ (0) 235164
234853 YTNNETGVQRVIGRTKYFPLMVSHWVGPVLSALHVYHPDSVHAVLTSG (1) 234710
234607 APKDDLVYRFLRPWIGDGLLTSKGTKWFRNRRLLTPAFHYKVLKP 234473
234171 YSEIFNKNSLVMVDKFKKLCGQSIEVHTHVGLMTLDTMLQCAMSTKTNCQHDT 234013
233744 DHPYLLAVKEVGQLVIERARQPLHFFDWIYNLSANGKRNKVVVKFLHDHTET 233589
233368 IIAERKEVLAKAVNDDENDSGMEGFQTIKGKALDFLDILLQSK 233240
231534 DEDGKGLTDIEIRDEVDTFLFEGHDTTSS 231448
231256 GIAWALYHLAKYSEFQAKCREELTEVLGDRKYVEW (2) 231152
230678 NDLPKLEYLTKFIKESMRLKPPVYSIGRELEEPLKIKEKVSGHESTIPTGTNLVTNIIA 230502
230501 LHNHVKLWKNPT 230466
230179 VFDPERFTKENIAKRPAFAYVPFSAGN 230099
229560 RNCIGQNFAMNEMKICLAQVIRNLRLYIDGDSPVPKMHPMLVLQSESGIFIKFEKI 229393

>paired_scaffold_65a1 83% to _214
218496 MKLITYLWENYNSFVFGRSFIETCLVILALYLFMHYGMVLRRLHSNAKHLDEVAGVDKRN 218317
218316 KHWFYGHMKH 218287
218106 YSYNETGLLRLIQRSKAFPLMVSHWIGPVMTILQVYHPDSVHAVLTSG 217963
217791 APKNDLAYRFIRTIVGDGLLTSKGSKWFRNRRLLTPAFHYKVLKP 217657
217173 YSEIFNKNSLVMVDKFKKLCGQSVEVHTHVGLMTLDTMLQCAMSANTNCQHVT (2) 217015
216853 NHPYQLAVREAGELVIKRLRQPLHYFDWIYNLSANGRRNKVVVKVLHDHTEK 216698
216370 IIAERKEVLAKAANDDENDSELEGFQKLKGKAVDFLDILLQSK 
215337 NDNGKGLTDIEIRDEVDTFLFAGHDTTSS 215251
214773 GISWALYHLAKYPEFQTKCRDELTEVVADREYVEW (2) 214669
214226 NDLPKLEYLTKFIKESMRLVTPVYSIGRQLEKSLKIKEKDSEHVYTIPAGINVSTSI 214059
214058 FALHRHESFWENPS 214017
213405 VFDPERFTKENIAKRPAFAYVPFSAGSR 
213213 NCIGQNFAMNEMKICLAQVIRNLRLYIDGDSPVPKMQPLLVMQSETGIFIKFEKI 213049

>paired_scaffold_65a2 92% to _214
288502 QDEDGKGLTDIEIRDEVDTFLFEGHDTTSS 288413
       GIAWTLYHLAKYPEFQTKCRDELTLVLGDGKNVEW 
       DLPKLEYLTKFIKESMRLKPPVYLIGRQLEEPLKITEKS 287897
287896 SGRESTIPAGTNVSTSIFMLHRHEHLWENPT 287804
287390 VFDPERFTKENIAKRPAFAYVPFSAGSR 
287202 NCIGQNFAMNEIKICLAQVIRNLRLYIDGDSPDPKMLPRIILQSESGIFIKFEKI 287038

>paired_scaffold_65b 92% to _214
331096 MHLTAYLLGNYPFWFVVQSCLVILALYLFMHYGMVLRRLHNNAKHLDEVAGVDKRTKHWFYGHFKH 331293
331599 YTNDETGVQRVIGRTKYFPLMVSHWIGPVMSALHVYHPDSVHAVLTSG 331742
331874 APKDDLGYRFLRSWIGDGLLTSKGSKWFRNRRLLTPAFHYKVLKP 332008
332308 YSEIFNKNSLIMVDKFKRLCGQSIDVYTPVGLMTLDIMLQCAMSTKTNCQHET (2) 332466
332769 DHPYLLAVKEVGELVIERTRQPLHFFDWIYNLSADGKRNKSVVKFLHDHTET 332924
333145 IIAERKEVLAKAINDDENDSGMEGFQTIKGKSLDFLDILLQSK 333273
333680 DEDGKGLTDIEIRDEVDTFLFEGHDTTSS 333766
333994 GIAWALYHLAKYPEFQTKCREELTEVLGDRKYVEW
334612 NDLPKLEYLTKFIKESMRLKPPVYAIGRELEEPLKIKEKVSGHESIIPTGTNLVTNIIALH 334794
334795 NHVKLWQNPK 334824
335111 VFDPERFTKENIAKRPAFAYVPFSAGSR 335194
335607 NCIGQNFAMNEMKICLAQVIRNLRLCIDGDSPVPKMHPMLVLQSKSGIFIKFKKI 335888

>paired_scaffold_65c 90% to _214
350975 MQFTAYLLGNYPFWFVVQSCLVILALYLSMHYGMVLRRLHNNAKHLDEIAGVDKRTKHWFYGHFKH 350778
350523 YTNDETGVQRVIGRTKYFPLMVSHWIGPVMSALHVYHPDSVHAVLTSG 350380
350140 APKDDLVYRFLRSWIGDGLLTSKGSKWFRNRRLLTPAFHYNVLKP 350006
349721 YSEIFNKNSLVMVNKFKKLCGQPVDVYTHVGLMTLDTMLQCAMSTKTNCQHET (2) 349563
349319 DHPYLLAVKEVGELVIKRLRQPLHYFDWIYNLSANGKRNKIVVKFLHDHTET 349164
348664 IIAERKEVLAKAVNDDENDSEMEGFQKLKGKALDFLDILLQSK 348536
348184 DEDGKGLTDIEIRDEVDTFLFEGHDTTAS 348098
347897 GIAWALYHLAKYPNFQSKCREELTEVLGDRKYVEW 347793
347356 NDLPKLEYLTKFIKESMRLKPPVHAIGRDLEEPLIIKEKVSGHESIIPTGTNLVTNIIALH 347174
347173 NHVKLWQNPK 347144
346858 VFDPERFTKENIAKRPAFAYVPFSAGSR 346775
345993 NCIGQNFAMNEMKICLAQVIRNLRLYIDGDSPVPKMHPMLVLQSKSGIFIKFEKI 345829

>paired_scaffold_65d 91% to _214
359984 MQFTAYLLGNYPFWFIVQSCLVILALYLFMHYGMVLRRLHSNAKHLDEIAGVETRTKHWFYGHFKH 359787
359532 YTNDETGVQRVIGRTKYFPLMVSHWIGPVMSALHVYHPDSVHAVLTSG 359389
359146 APKDDLVYRFLRSWIGDGLLTSKGSKWFRNRRLLTPAFHYKVLKP 359012
358731 YSEIFNKNSLVMVNKFKKLCGQSVDVHTPVGLMTLDTMLQCAMSTKTNCQHET (2) 358573
358300 DHPYLLAVKEVGELVIERLRQPLHYFDWIYNLSANGKRNKIVVKFLHDHTET 358145
357510 IIAERKEVLAKAVNDDENDSEMEGFQKLKGKALDFLDILLQSK 357382
357146 DEDGKGLTDIEIRDEVDTFLFEGHDTTAS 357060
356859 GIAWALYHLAKYPNFQSKCREELTEVLGDRKYVEW 356755
356318 NDLPKLEYLTKFIKESMRLKPPVHAIGRDLEEPLKIKEKVSRHESIIPTGTNLVTNIIALH 356136
356135 NHVKLWQNPK 356106
355819 VFDPERFTKENIAKRPAFAYVPFSAGSR 355736
       NCIGQNFAMNEMKICLAQVIRNLRLYIDGDSPVPKMHPMLVLQSKSGIFIKFEKI 355111

>paired_scaffold_65e 89% to _214
374711 MQFTAYLLGNYPFWFVVQSCLVILALYLFMHYGMVLRRLHNNAKHLDEIAGVEKTTKHWFYGHFKH 374514
374260 YTNDETGVQRVIGRTKYFPLMVSHWIGPVMSALHVYHPDSVHAVLTSG 374117
373872 APKDDLVYRFLRSWIGDGLLTSKGSKWFRNRRLLTPAFHYKVLKP 373738
373453 YSEIFNKNSLVMVNKFKKLCGQSVDVYTHVGLMTLDTMLQCAMSTKTNCQHET (2) 373295
373017 DHPYLLAVKEVGELVIERLRQPLHYFDWIYNLSANGKRNKVVVKFLHDNTET 372862
372231 IIAERKEVLQNAKAVNDEENDSKMEGFQTIKGKALDFLDILLQSK 372097
371690 DEDGKGLTDIEIRDEVDTFLFEGHDTTAS 371604
371403 GIAWALYHLAKYPNFQSKCREELTEVLGDRKYVEW 371299
370862 NDLPKLEYLTKFIKESMRLKPPVHAIGRDLEEPLKIKEKVSGHESIIPTGTNLVTNIIALH 370680
370679 NHVKLWQNPK 370650
370364 VFDPERFTKENIAKRPAFAYVPFSAGSR 370281
369814 NCIGQNFAMNEMKICLAQVIRNLRLYIDGDSPVPKMHPMLVLQSKSGIFIKFEKI 369653

>paired_scaffold_81c 59% to seq 160 ci0100150610
196431 MQFITYLWTNFQSVAVGRSFIETCLVILALYLFMHYGMVLRRLHNNAKHLDEVAGVEKRTKHWFYGHMKH 196640
196799 YTNDETGMLRIIQRFKAFPLMISHWIGPVLTILQVYHPDSVHAVLTSG 196942
197268 APKNDLGYRFIRSWIGDGLLTSKGSKWFRNRRLLTPAFHYKVLKP 197402
197868 YSEIFNKNSLVMVDKFKKLCGQSVEVHTHVGLMTLDTMLQCAMSANTNCQHVT (2) 198026
198233 NHPYLMAVKEVGDLVMKRIRQPLHFFDWIYNLSANGRRNKIVVKVLHDHTEK 198388
198652 IIAERKEVLAKAVKNDENDSEMEGFQKMKGKALDFLDILLQSK 198786
199686 DENGKGLTDIEIRDEVDTFLFEGHDTTAS 199772
200175 GIAWALYHLAKYPEFQTKCREELTEVLGDQKYVEW (2) 200279
200737 NDLPKLEYLTKFIKESMRLVPPVYAIGRKLEKPLNIKEKDSE 200862
200863 HESTIPAGTNIGTSIFALHRHVHFWENPT 200949
201269 VFDPERFTKENIAKRPAFAYVPFSAGSR
201413 NCIGQNFAMNEMKICLAQVIRNLRLHIDGDSPVPKMQPRLILQSESGIFVKFEKL 201577

>paired_scaffold_298c 79% to _81 69% TO Cs151
1131691 MVWFGIQGVVTSSVFGDLLLLICMAIVIYNYVTPAVKTYQASNKTAKKIGKVKKHWFWGNLKM 1131503
1131324 FPANKQGLMKRVALTLDNPAWVANWMGPIICEVVLHHADTAITMLSSA 1131181
1130910 APKNRTGYRFLRPWIGDGLLTSSGKKWQRNRRLLTPAFHFEILKP 1130776
1130381 YGDVFNECVKVMLDKWEAGPNNQVEIYKDISLMTLDTMLQCAMSNKSNCQKIG (2) 1130223
1129794 GRNEYIEAVHDLSIFTIQRAHSPLMHIDWIYWLSREGRKCKKALDIVHRQSEK (0) 1129633
1129023 VISERKKVLLMNTGESDRRRNKRKHLDFLDILLQTK 1128916
1128690 DEDGMGLSDKEIQEEVDTFLFEGHDTTAS 1128604
1128514 GIAWALYNLASNENCQDKCREELRRVVGEKENIDW 1128410
1128148 DDLSKLPYLTMCIKESLRLYPPVPFIGRELKEPLQFQSDFNKTKQTTIEAKSQVALHIFSL 1127966
1127965 HRNVHVWESPN 
        EYNPERFNPENIKGRSPHAYLPFSAGPR 1127765
1127515 NCIGQNFALNELKISIGQTIRRFKLYTDETTPTPIIKPALVLRPTNGIHIKFQPV 1127351

>SEQUENCE 190, 162 36% to 26A1 fugu ESTs rcicl022k09 rciad056p18
MVESSFLYPDGGGIRKAITNISSLEVPSNRTLINYLESEDFFYFLFFLIKYI
GVPCFLMHVICCLWEIRCVRMKDHSCPLPLPRGNMGFPLVGEMFHLFVV (0)
GANYQRQRRSKYGKIYRTHMFGSPSVAVIGEEHVKKIVLGE (1?) 
GTLVQTRWPSTTRRILGTDGIVNGDLATHCRIKRLALKAFSPKYIGTYAPVIQAGVAAHV (?) GCiWno735_j03.b1
QEWVDQGTVHILEKCKQAVSKTMMTV
LLGIKSNDPDIQLYIQAADDIINSILSLPLDLPGFGFHK (?) GCiWno735_j03.g1
GLKARELIFNKLESQLKSKFNATDDELFGDSGGEFESVL
LNILKFARREGETEIKLRNLQNLSLEFIFAGTQSLQCSCSLVTFHLSQNPQV = seq 162
NFECREEIKAAGLWDTPLTEINVERIQSLRFIELVAKETLRV
TPTIPGAVRVALKTFELGGYQIPKGWQIIYSIRLTHEEAAGKVLQN
GSKCEFEPGKHFTSTCKNTDVKKSGEDKSHECLKQQSPFAFLPFGKGPRMCAGKNYGMLFLKVLIFELVRTA
DISLEGKCKIIEVPMTRTDKSVKIKFTSIKKLKPENSVNDASSSTAV*

>paired_scaffold_422 ortholog of seq 190 80%
931985 MTNMSSWDNATNKTLVSYIGSNDFYFTIFFLIKYIGIPGFLLYLTCWLWEIRCVRMKDHS 932164
932165 CPLPLPGGNMGFPLVGEMFHLFVV 932236
934420 GANYQRQRRRKYGKIYRTHMFGSPSVAVIGDEYVKKIVLGE 934542
935404 GTLVQTRWPSTTRRILGTDGIVNGDLATHCRIKRLALKAFSPKYVGTFAPVI 935559
935560 QAGIAAHIK 935586
       EWLGRGTVPILEMCKQAVSKTMM 936281
936282 TVLLGIKSDDPNIQLYIQAADDIINSILSLPLDIPGFGFRR 936404
938997 GLKARELIFDKLETQLKSKFNASDEELFGDNGGEYESVLLNILKLARREGETDIKLRNL 939173
939174 QNLSLEFIFAGTQSLQCSCSLVTFHLSQNPKV 939269
939712 CREEIRTAGLWDIPLTEINVERLQGLKYVEMVAKETLRVTPTIPGAVRVALKTFELG 939882
       GYQIPRGWQIIYSIRLTHEEAAGKVLQNGGEC 940819
940820 EFNPGQYFDCSCDSDDSKSTKTAPPKHTCSKQNSPYSFLPFGKGARMCAGKNYGMLYLKV 940999
941000 LIFELVRNADISLVGKCKIVEVPMTRTDKSVKIKLTPIRK 941119

>SEQUENCE 164, 165, 171 53% to seq 169 SCSSTDSDASSPYLNTDGRKFFGQ extra
MWLLLVASVEQYLSKHTSCRVRNTSVAVTPTCVKA (2) 
RQGEPPLVNHTLPFIGAALDFGKDPLNYLRNLQSKFGDVFTIKLAGW (2) 
DTHVFMNPLDVRSLERAKFLGHQEIIGDFVTRMVGSPH (1)
ILSPAADVQKKLTHSIEKNLRGSMQLNLLCSSCKPVVVNALHRTLGHTMNHHKP (0) 
IKLHEFSKKLIFDITFNLLFGSLPDYERASEETERTYDVFKTYFKGSAL
LVDRIPIHLLPETKKARGIFLKMMEELDWSKRENVSKLIDDIVKVNPKL (1)
EFLKQRARHLMVVLWAAQANTTPALFWTLFYLIANPDAKRAVLEEYEQL (1)
RWPTMKDILQIERKDLDKLVIL (1)
DGCLRETMRLTGASMSIRKATRDESIKTSDGKQYSVRRGDYTAFFAPVTHMDEDIFEKPQ (0)
DFVHDRFLVSVDDTSS 
LKPRSRFTKNGQRVAVSRAIMTFGYGVTRCPGRHIALIEIKLAVLAMLGHF 
NVQFVDSNTKPPSFDLTHLGFGVMPPDTDIEVLIAPKCC*

>paired_scaffold_174 78% to seq 164 (ortholog)
35196 MDSTFCSIILVAVNLILLAAIFLRRRR (2) 35116
34415 KGEPPLVSHALPFIGAAIDFGKDPLNYLKNLQSKHGDVFTIKLAGW 34278
30946 DYHVFMNPLDVRSLERAKFLGHQEIIGDFVTRMVGNPHKIHSHDFIPELHTADGKSKKTL 30767
30766 SPGAEVQKKLTHSIEKNLRGSMQLNSLCESCKPGVVNALLRNLKPSMSHQDKP 30608
29143 IKLHDFCKRVIFDITFNLLFGSLPDYDESCKETSRMYDVFKLYFKRSALLVDRIPIHLLP 28964
28963 ETKKARTEFLKLMEGIEWSKRENVSRLIQDIVDVNPKL 28850
25705 EFLQQRARHLLVVLWAAQANTTPAFFWILFYLLSNPDAKRAVMEEYEQLKRHKHRNKQVK 25526
25525 ERGNDGWPTMQDILGIKRKDLDRLVIL 25445
25177 DGCLKETMRLTGASLSVRKASREEKIKTVDGRMFSIRKGDYTAYFASVTHMDEEIFENPQ 24998
      DFVHDRFLVSADEVSSX
23975 KPRSRFSKHGQRVAVSRAIMTFGFGATRCPGRHIAMIEIKLAVLALLRHFDVRFANPNEK 23796
23795 TPEFDLSHLGFGVMPPATDIDVLVSLK 23715

>paired_scaffold_323 78% to seq 164 96% to paired_scaffold_174
26637 MDSTFCSIILVAVNVILLAAIFLRRRR (2?) 26557
26068 KGEPPLVSHALPFIGSAIDFGKDPLNYLKNLQSKHGDVFTIKMAGL 25931
21750 DYHVFMNPLDVRSLERAKFLGHQEIIGDFVTRMVGNPHKIHSHDFIPELHPADGKSKKTL 21571
21570 SPGAEVQKKITHSIEKNLRGSMQLNSLCESCKPGVVNALLRNLKPSMSHQDKP 21412
20013 IKLHDFCKRVIFDITFNLLFGSLSDFDESCKETSRMYDVFKLYFKRSALLVDRIPIHLLP 19834
19833 ETKKARTEFLKLMEGIEWSKRENVSRLIQDIVDVNPKL 19720
16557 EFLQQRARHLLVVLWAAQANTTPAFFWILFYLLSNPDAKRAVMEEYEQFKRHKNRNKQVN 16378
16377 ERGNDGWPTMQDILGIKRKDLDRLVIL 16297
16057 DGCLKETMRLTGASLSLRKAAREKKIKTVDGRMFSIRKGDYTAYFAPVTHMDEEIFENPQ 15878
15103 DFVHDRFLVSAEEVSSGSSLDSDVSTPSFTGDNERFFGQLKPRSRFSKHGQRVAVSRA 14930
14929 IMTFGFGATRCPGRHIAMIEIKLAVLALFRHFDVRFAKPNEKTPEFDLTHLGFGVMPPET 14750
14749 DIDVLVS 14729

>SEQUENCE 169 53% to seq 164 2 possible exon 1s scaffold 544
34931 MWFVMVASVEQYLSKLISCRVRNTSVAVIPTCVKT 34827 
34666 MFWTLALVILNVVCYSIYRNRRRR (2?) 34595
34321 RAGEPPLVSHWIPFIGSALDFGKAPLEYLQSLQKRLGDVFTIKLAGW (2) 34181
NVHVFMNPHDVPNLERAKNIDHKNVIGDFVSRMVGDEH
MIHNPARHSDAVPKILESTLRSTMHLMTLCDSCTNVLPKTIKRTLKSEFNCKRETK (0)
IGLYELCKKLVSDMTLNLFFGSLASDEKCNEETQQICEEFFVYFKSSALLVDPIPIHLLP
ATKKSRGRLFELLSRFDFENRDNVSELIKKVAAV (1)
GSSEEKVRHLLIILWSAQANTLPALFWTLFYILNDKTARIDVLEEFEKHLRKT (0)
WPNLSDIQRMRKGDVDKLVLL (1?)
DSCTKETMRMTGSSLAARRAHADTTVTLASGQEYKIRKGDYTSYFAPVTHQDADIYDKPE (0)
EYRHKRFLQHDDKDPSMLKSRTQFSKNGKRISPSTSIMTFGYGAVRCPGRHIALMEIKFLTLVLLRHF 
DIKLVKNYSRQSEDYPQFDLTHLGFGVMPPSRDVDVTLSI*

>paired_scaffold_23 ortholog of seq 169 75%
358818 MIWTLSLIVLNLVLAFLYIGRRKR (2) 358747
358491 LPGEPPLVSHWIPFVGAALDIGKAPLDYMRSLQRRLGDVFTIKLAGW 358351
358105 DVHVFMNPLDVPSLERAKSIDHTQVIGQFVSQMVGDKHLIHNPAKHSEVIPK 357950
357949 TLEKTLRSAMHLITLCDSCTSILPKTIERTLKSEFSSKRKMK 357824
357454 IGLYKLCKKLVADMTLNLFFGSLASDKDCSEETRKICEEFFVYFDSSALLVDRIPIH 357284
357283 LLPATKRSRNKMFELLSDFDFENRENVSKLISQVAAV 357173
       ERVRHLLIILWSAQANTLPALFWTLFHI 356785
356784 LNDKQAKVDVLQEFEKHLGKSENFYDVTTVGENLQWPTMDDIQRIKKGDLDKMVVL 356617
356199 DSCLKETLRVSGSSMAVRRATQNTTITLHSGKVYNIRKGDYTSYFAPVTHQDTQIY 356032
356031 EKPE 356020
355768 EYRHERFLQRGEKDPNMLKSRTQFSKEGKRISPSTSIMTFGYGAIRCPGRHIAMIEIKV 355592
355591 LALTLLKHFNIDLGQIYDRSSKDYPQPDLTHLGFGVMPPDRDVGVTLS 355448

>SEQUENCE 166, 170, 172, 181 36% to 7B hum 
MILEALTTVLAALVSSYLIWLVAKRKQKDGEPPIVPFWIPWL (1)
GSGLQYQKDPITFLRKQEKKYGPIFTVFM
GGSYFTFVTDPFTHPIIAKESKVLNFRICTIPTIKKVFDFTRRGVDVPTMGGLMVKYMQ
GHWLETLVSKMMGNLQKVLLHGSYNNTEWATTGLLHLVETVMCTAGFKSL
FGEYPASSGVDNNKLHHKIVEDFVKFDKAFPLLVQGRPISSLK 
GVKESRANMWKFFTMDEIHKREMVYEMVEQYAKKLEEDQELEQLKKELYCVNWAAM
SNTLLSAFWALYYILKHPEAMEAANVEIQRMLKETSQCVDGKSEINFTRDDMAKMTVL
DSIINETLRLDISSSLLRQALQDYNLKVVSADRSYKIRKGDKLMTLFQLTHYDPEIYEQPDQFK
FDRFLNSDGSPKTDFYKKGKVVKFFLQPFGSGV
SKCPGRYWAVTELKQLVILLLLYFDFKLINPKQEIKMDQIRVAFGSLPPNIDPQVQFKRRVIV*

>CYP7 like Ciona savignyi one intron shared with human CYP7A1 ortholog seq 166
MLVSIIYTVLLLLLSSYAIWFIAKRKQKRGEPPIVPFWIPWL (1)
GSGLTYQKDPIAFLKKQTKKYGPIFTVFM (1) 
GGNYFTFVTDPFTHPLVAKEGKILNFRICTLPMIKHLFDYQRPAVEEPTMGGLFVKYLQ (0)
GQRLNVMIEKMMKCLQRIMLAGKYTNTDWSEVGLLHFVESIMCTASFSTL (2)
FGDYPKSSNKDIPAFEHQMFEDFRKFDEAFPYLVKGYPLNSFK (0)
GLEESRKNFWKHLASDELNKREMVSDLIAEYARICDAEKDRTLPKKLFSTAWAAQ (0) 
SNTMVSAFWTIYFVTKHGDAKAAAEVEIQRLLKETSQTLDGKTPINFNRDDMDKMTVL (1)
DSIINESLRLHVSASMLRQALQDYDLKVLSVGKTYRLRKGDRVMSLFELNHKDPEIYS
NPEVFKYDRFLNEDGSLKTDFYKNGKVVRFALQPFGSGL (2)
SKCPGRYWAVTELKQIVILLLLYFDFELINPNEEIKMNKNRVAFGSLPPTNDPKIRFPSTV*

>paired_scaffold_401 CYP7 like 68% to seq 166
121225 RKGESPLLFH 
       SLIILGSGLTYQKDPIAFLKKQTKKYGPIFT 121046
118749 QSNTMVSAFWTIYFVTKHGDAKAAAEVEIQRLLKETSQTLNGKTPINFNRNDMDKMTVLG 118570
118031 QITFVDSIINESLRLHVSASMLRQALQDYDLKVLSVGKTYRLRKGDRVMSLFELNHKDPE 117852
117851 IYSEPDVFKYDRFLNEDGSLKTDFYKNGKVVRFALQPFGSGL 117726
117553 SKCPGRYWAVTELKQIVILLLLYFDFELINPNEEIKMNKNRVAFGSLPPTNDPKI 117389

>SEQUENCE 167, 173, 174, 176
MLSVILAFFVLILATYFLWNSRKRQQRDGEPNIVPYSIPWL (?)
GSSLPFYKDPVKFLKECEKKYGSVFTLFM (?)
GGQYFTFVTDPFVQHIIVKEKKVLRFRNCLPPVLKHAFDYYPTAVDDEMMKLTHHYFQ (?)
GKPLGLLVKQMNTNIQKWILDGNQGDAGDKWETVGLLSLVQRVMGSASFVSL (?)
FGENPKASDTLDVQRQLLNFLTVFPSLLRGLPLSLIK (?)
RSREYRQDLWNSLTKSKVTKRKEIFPIIDEI VTLHEQNGENKLLPKRLLPVLVASQ (?)
VNTNPSSFWILYYLLKNSEALAAVKDEMEKVLQERSKNPGNNFVCFTKEEMETMMTL (?)
ESAINESLRLSISGGMLRKAATEYRLEIPDYGQTYTIRENDMVLVWTQVNQMDPE
IFEEPEVFKYDRFLNEDGSLKTNFYKDGKLVKYAFQPFGDGS (?)
SKCPGRYWAVLELKQLVVNMLLYYDIQLIDSQEIKMDKGRVGLGALNPTKDALMSIRRKPLC*

>Ciona savignyi ortholog to sequence 167
MLEIILSILVLLAVSFLIWNTRKRTAARGEPPIVPYSIPWL (?)
GCALPFYKDPLQFLRDCQRKYGPFFTVFM (?)
GGEYLTFVTDLSAQQTIAKEKTVLTFRKSLPPVLKQVFGYDSNSVDDEYVKVFPKHFQ (?)
GKALENLVSKMNKCLQNFIINGKYYNEEWMTVDLRTFVESIICPASFVSM (?)
FGEYSSSRSDKMTEVERNILKFTDLFPSLIRGYPISILK (?)
GATACRQYLWKVLSKQEISGRKNVSGVVEETISLHEQYGESDVLPKRLLSFLTATQ (?)
VNSVPSAIWLLYFLHKHPEANKVAREEIERVLKESKESTSKDVKIEVTNEDLKKMTVL (?) 
DSIINESFRLHLAPSMFRIATEDYFVEVAELGRTYKIRKNDKVLLWVQMNQMDPDIFE
QPEVFKFDRFLNKDGSRKRDFYKNGRLVKCAFQPFGTGL (2?)
SKCPGRYFAVNELKQLAITTLRFFDIELVNPQAEIKMDKNRIGLGALYPATDIKIRIRRKADC*

>SEQUENCE 180 39A1 LIKE
MKVPGTIGYPIIGDKSREFLSDPTKFITSRINEH
SSRIFQCRILNNPTIFVCSPQAVKKLLSKEMANDIEHGYKSILHRLYGDNIVFENDVTAN
ELHNCLRNCFQNMELYKETVNRLCE
PLLSSLHLRTEPVYIYPEFKRVMTRMCLELFLGIKTESCLDQTEEITHLATQHWHGIISV
PVSLKIPLAGESTYSLAVQAKEKL
LSIIRSILLGQNGQGDASVAAKLRHADFSNSKSLEQHLLLFVSAVIPKAFASLITSFIIV
MAGDDKEALRARARTDSKFLDRVLLEVERMWPPLFAGRRLVRRTTTLESYDLPKDYAIMY
VTHAAHRDPEVFPEPNSFNPDRWITCNAGQEDLVLCFGSGPRCCVGTDLIRRILRDVASR
LLQNYEWSLAPPNQDTSYKYLPVARPAVQPSIAFSSYLTNDEPFFDACG*

>paired_scaffold_186 ortholog of seq 180 70%
681879 MKVPGTIGYPVIGDKSMEFL 681820
681640 RNPSSFIQSRIEQHSSRIFQCRFLNNPTIFVCSAQGVKTLLSQ 681512
680673 DMIEDIEHGYKSILHRLYGDNIVFENGLDSIELHHCLTNCFKDVENYTETV 680521
       XXXXXXXXX
679207 NTLHRSAPVPVYSEFKKLMTRLCLKLFLDIEGEEASDEVTELATQHWH 679064
678445 GIISVPVFVKLPLAGQSTYSLAIQAKEKLLSLIRSLLQRQANSGX 678314
677760 ASVASKLRRADFPNGSTLEQHLLLFVSAVIPKAFASLITSFITALAGEDK 677611
676148 AEMRHRARNDAGYLDHVMLEVERMWPPLFAGRRLVKRHTNLLGYRLPK 676005
675441 DYAILYVSHAAHRDPDVFPEPDLFNPDRWDTX 675349
674535 NSGQEGLICCFGSGPRSCVGTSLIRRILRDVASRLLQDYDWSLSPPNQDTSYKYLPVARP 674356
674355 ARQPTLRFASCADDNDDHFYDACG 674284

>ci0100147340 seq 182 4 diffs blue = intron?
MLSCLQKASRRRSFDYFKVICIVRLQHAQ (1)
VQSKETAVKDFSEIPQPKSLPILGTFLDYTKFRGFNVANLSNLFKARFDELGPIYKETIIPGLSEPTVHCFSPQD
NEKLFRLESRLANRDPIEFLKVVRERIG VPSGLVNIQGEEWYQ LRRIVNEHFLTNSSVWSYSKDIHEVSE
DFVDYISRSLDSNNEVPNFALALNKWALEGAGVFSLDTRFGMFTGEVEEHLQEIVDNSNLMFQLMAELTV
APPFWKYFETKQIRALKQCQAAQVAAIKKQRDILLKRIKEKDPEKLTKLEKLMTKKDLPQGEVDI LVSDMLGG
GVDTTSNAAAFMLYIMAVNPDKQEILREEIKTALQSGKVDGKSLQRMKYLHACALEAQRMFPLTPGTSRR
LPVDVVLSGYNVPAGTRLLLLSNVAHHKNPEYFDQPESFLPERWLERKHKSKFAMIGSFGMGARACPGRK
FAMQEVHYLLISLLSRYKVEYHHKPIKPHFRLVTVPSEEPQFTFIPL*

>paired_scaffold_330 64% to ci0100147340 seq 182
1746363 KSLAEIPEPKSLPILGTILDYTKFNGFEIGNLAEFYKSRFEDLGPIFKENILP 1746521
1747454 GLTDMTVFCCSPEDNEIMFRTEGRLANRDPIDVLKFSREKLGIPSGLVNIQGEEWYT 1747624
1747948 LRKIANKHFLSNSSVWSYDKELLEVSNDFVEYISQNLDSNNE 1748073
1748780 VPNFSLALKKWALEGAGTFCLDTRF 1748854
1749456 GLFSEKTDENLQLIVDNSLMLLDLMTELIFGPPVWRYFETKQIK 1749587
        ALKECQAVQLKAIGKQRDKIMEKVKHKD 1750089
1750558 EVAILIMDMLGGGVDTTSNAAAFILYVLAINPEKQEILREEIINALQSDKVDGM 1750719
1750925 SLQRMKYLHACVLEAQRMFPMTVGISRRLLQDTVLSGYLVPK 1751050
1751429 GTKVTMLSNVTNGNNPAYFDDPETYLPERWLQKNPNTK 1751542
1753057 FAMTGSFGIGARACPGRKFAMQEVHYLLISVLSRFKLQYNYKPIKAKFRMGNVTSEEPQ 1753233
1753234 FTFISL 1753251

>paired_scaffold_129 similar to ci0100147340 seq 182 67%
287384 AQEKVIDNAMKSLAEIPKPKSLPILGTFLDYTKFKGFEVGNIAEFYKSRFEDLGPIFKEN 287205
287204 IFP 287196
286663 GLTDMTVYCSSPQDNETMFRTEGKLANRDPIAVLKFSREKLGIPTGLVNIQGEEWYT 286493
285764 LRKIVNKHFLSNSSVWSYDKELHEVSDDFVEYISQNLDSNNE 285639
284914 VPNFRLALNKWALEGAGTFCLDTRF 284840
284194 GMFSEKTDENLQLIVDNTALLFDLLTELNFAPPVWRYFETKKIK 284063
283676 ALKKCQAVQLEAISKQRDRMMEKVKHKN 283593
283264 EVAILISDMLGGGVDTTSNAAAFMLYVLAINPEKQEILREEIKNALQSDKVDG 283103
282921 SLQRMKYLHACVLEAQRMFPLTVGTTRRLLQDTILSGYLVPK 282796
282442 GTKVIMLSNVMNGQNPAYFEDPETYLPERWLQRNPNIK 282329
281024 FAMTGSFGIGARACPGRKFAMQEVHYLLISVLSRFKLQYNYKPIKAKFRMVTVTSEEPQ 280848
280847 FTFIPL 280830

>ci0100131379 Correct order of fragments 57% to seq 182 new seq
MIRKWQKRGFFTNFYQNAVTSNRRQHIE (1)
AKTASNINAAKTKPFSDIPEPKGLFGTFLDYTALRGFKTEELYSHYQKRHKKYGPIFKENVLP (1)
GLTSYDVFCSSAEANKAMFRLEDKYSNRGPLDFIQIARENAGVSSGLLDPSWEEWFK (0)
FRKVANRYLLSNASVWSYSSALHEVCDDFVDYIADNLDENNE (0)
FPNFALALNKWALEGAGVFCLDTRFG
RNFSGQLDDNLKVIMDQVIESFTLMTDLFLKPPLYKYITTRKMK (0)
ALTSVVKKQSEAIQKQRVKIMNQMKEKTPGSKTKLKTVLENSDLNEE (2)
EIASLVSDMLGGGVDTTSNAAAFMLYIMAVNPDKQEILREEIKTALQSGKVDGR (0)
SVQTMKYLHACVQETQRLFPFSFGVARMFEKDIEVCGYNVPA (1) 
GTRILLTGNVVNNKDAKYFKDPETFLPERWLERRNSESK (2) 
FATTGAFGRGVRACPGRKFAMQEVHYLLISLLSRYKVEYHHKPIKPHFRLLTVPSEKPQFTFIPLS*

>SEQUENCE 184, 212
MNDVTITWWIVISALALLAVKMVDDNSKRKKRSQNFRSPPGPKGYPILGTFPSLIRKPA
HIVFGEWSKKYGNVYQFTLGAGTSVVLGDTATVRLAMLGKSPDFDSRCQWRYCFN
KMFRGLSVLPYSFKWVNTRRRTLTILRNIGMGRSLMEGKIHEETQCLLKCMLDSSQDGKV
NFEPRDMLRRAPMNTIWSILFGKRYDYSNPYLLYVMNTVVWLVDFLAKASVFDVLPSSLR
HLAGFSLETFHNKLETLHAFMTERVDDHAKSYESDVTRDFVDAWIKAKKEDKLKIQDSED
SNDISSLPDVIQDLIVAGAHSTGIVFNWAVLLLVLNKEEQDL
CYREIMDNVGPTREISMSDRSKLRRVDAVMHECMRIGTVLPLVAHATARDTTIHGYNIKAGTE
VIYNNWFINRNEKVFPDAEKFNPDRWIKKDGSFRKELLQHFFPFGIGKRSCAGESLA
RANLFIVVATLLQRVELLPPDETASPEMLPEVKHQFGLLAECKPFLMRVRCRPTFVPSQ*

>paired_scaffold_315 85% to seq 184 CYP2 clan
1445963 MGDVTITWWIFLSALALLAVKMVD--TKSKKKNSNFRHPPGPKGYPILGTFPSLLREPAH 1445790
1445789 VVFG 1445778
        EWSKKYGNVYQFTLGGGRSIVLGDTATVRLAMLGKTPDFDSRCQWR 1445189
1445188 YCFNKMFR 1445165
1444620 GLSVLPYSRKWVNTRRRTLTILRNIGMGRSSMEGKIHEESKC 1444495
1444173 QILMECMLEASNDGAVNFEPRDMMRRAPMNTIWSIIFGKR 1444054
1443528 RYDYGNPYLLYVMNTVVWLVDFLAKASVFDVLPSSLRHMAGFSLETFHRKLDTLHGFMTD 1443349
1443348 RVAEHKKSYESDVTRDFVDAWME 1443280
1442899 ASKREKVNKTALDDSDEISCLPDVIQDIIIAGAHSTGVVFNWAILLLVLNKKEQDL
        CYREIMENVGSTREISMSDRQRLRRVDAVVH 1442540
1442539 ECMRIGTVLPLVAHATARDTEIGGYDIKAGTE 1442444
1442197 GFPVIYNNWYINRNEKVFPEPDKFNPDRWINEKGGFRKELLQHFFPFGI 
1442008 IPGKRSCAGESLARANLFIVVATLLQHLELLMPG 1441886
1441599 PADETAAANMQPEVKHKFGLIAECKPFLMRVKKRPT 1441492

>SEQUENCE 186, 187 CYP20 ortholog 29% to fugu CYP20 CYS = HIS
MLEVHVTITAVAVLAVILLLFLIGLLFYIYPSSN (?)
KETTVPGPAVSDIR (2)
DGNRSDIDRAGSLHQYLCNLHAEHGDLASFHFDKQLIISIADSKYFAPRSQVFQLP (1)
DFLYEFWEPMIGKESFVFSQGVDVFKRRDNIDFKVDKKVKEFYAKIF (0)
RLLAIELEEKWRKQPEDQHIPLTQHLKGLATKAVCQMMFGESFADDERVIKLYKAWDK (0)
CWNEVMAASKMEPTKSESDEKK (1)
SIKELRSLMEKAIEERWKKSKHLDGITLLDGLLASSNSSVSPLSSEQLLCDVITMFTMGVQVTNA (1)
VLCWSVYFIAAHDDVQTKLRNKLNEVESLDIMPQSLNDVK (2) 
YMQSILKESIRCSGVFSYSGRVEDIDITVGDHLLPKG (0)
TPIIEALMFPMQSDAFESPDM (2)
FNPKNFSEATKSVNPLTYIPFAFGGSRHKPQDS (2)
LVYLLVSITVASLFRNLRISLAGKEVDPQSANYESFVALPSTEEVWVTVGGKLDVETVGENN*

>ciona savignyi sequence for CYP20 ortholog 72% to intestinalis seq 186
MLEVHVSITAVAILVLIILLFLIGLLVYIYPVI (2)
KETTVPGPDVSDAK (2) 
DGNKSDIERAGSLHAYLCNLHADFKHEGAVSFHHGKQLVISISNPKFFAPRSQVFQLP (1) 
SFLYEYWRPLIGEESFLFAKDIDVFRRRDYLDFKVNRNVKEFYAKNF (0)
LAVELETKWEKQPEDQHIPLTQHLKGLATKAVCQMMFGDNYADGTKVLELYRAWDK (0) 
CWNEVMAASKMEPTKSESVEKK (1) 
SISELRKLMENAIEERWNKSKSLDGITLLDGLLSSQTSDSKDQLTSEQLLCDVITMFTMGVQITTA (1) 
VLCWSCYFVASNKEVQRKLKNQLNESKSLDIMPETLMDME (2) 
YMQGILKESVRCSGMFSFAGRVEDIDITVQNHLIPKG (0)
TPILEAIMTEMRSEAFSCPDV (2)
FNPKNFTKDNNHLNPLTFIPFAFGGSRHKPEES (2)
LVYLLAAVTVSSLFRNLQVSLAGKEIEPFAANFESFVAMPSTEEIWVTVSKEKEVENVG*

>SEQUENCE 192, 210 CYP11 like sequence revised added exon 2
MGTFGIGWRLRSQWRVAAQNVKHQITQALSLG
ETKTSVREVQSYSE IPGPRGLPFLGSVLEYTAIGKFSPKQFHKALRSRHG (2)
RYGKIFRERLLQDEFVYVSDPKDIRVLVQNEGKAPHLEAFW (2?)
GNDWLRSRTSINGGISCPRALKTYRAGQQVIAEDFVELIRNKRDATTGEVPQF
EQWLNRWALESIAIAMLNTR
LGLFDAGQSKKNRPAEKLRECMDIFFDYGARLTFGVPMWKIFNTREWKTFCQSQVDEFENAGYFVNK (2)
KLKQLKEERGSQEASKQPPSCLLEHLISKNSE
VSQMEATTICLELLAGGIDTTATAAVFTLFELSRNKEHQSKLRELANQETDEG
KNNSQFSRYLKSCIWESMRLHPLTYANMRKTETDLVLSGYKVPAGTTVRYTSHLMNLKDE
KYFPQADEFLPERWLDRKSENRCKEQFTFTPFGHGARQCPGRRIAEQELDLLFRQLIRNF
NINYEHEDIKTKVRLFNAADRPARFSFTPISKPITEIQQETN*

>Cs192like revised added exon 2
MLRTYAFGLEAIHRWGTWFQTKLGATEKFRETSTKVGKFTAIPGPMGLPLLGSVLEYTPLGRFSPRN
FNKAIWSRHQ (2) 
RYGNIFRERILLDEFIYVSDPKDIRVLVQNEGRAPCRPHLEPVVE (1)
SDVWLRSRTSVNGAIACPKALKQYRFNQQIIAKDF 
IELIRNKRDSKSGEVESFEQWLNRWALESIAVALLNNRYLV (2)
LGLLNADPTNPNSPAEKLRLCMDIFFDYGARLTFGLPSWKLFNTRQWKIFCQSQVDEFENAGHFVTK (2)
KLNQINNEQRLYKVPPSCLLEHLISNNNNLSQIEATTICLEMLAGGIDTTATAAVFTLYELARNI
KHQETLRLLAKKEKCQHNSEFTKYLKACIWESMRLHPLTYANM
RKTETDLILSGYHVPAGSVVRYTSHLINLKNEKYFPKATEFLPERWLDRKSEYR (2)
CTEQFTFTPFGHGARQCPGRRVAEQELNLLLREIIVNFNVGY
YHDDIKTKVRLFNAADRPARFSFQPI*

>paired_scaffold_167 Cs192 like 100% MATCH
581738 EATTICLEMLAGGIDTTATAAVFTLYELARNIKHQETLRLLAKKEKCQHNSEFT 581577
581576 KYLKACIWESMRLHPLTYANMRKTETDLILSGYHVPAGSVVRYTSHLINLKNEKYFPKA 581400
581399 TEFLPERWLDRKSEY
RCTEQFTFTPFGHGARQCPGR 581220
581219 RVAEQELNLLLREIIVNFNV 581160

>sequence 5, 12, 194 ci0100141208 complete possible fs at YES
MEYCFWYFTIAVFSIYCLLSWQRRPKNFPPGPRGFPVLGIVPYFGKYPEKVLSKFADQYESRVMSFRMGR
MDWIVLNDIETITQ (0)
ALVKQGEDFAGRPKSPLFDMLTNGCGIVFVDGEKWKAQRKFGIMTLRS (2)
LGMGRGSLDDIMNYEFEVLCHFLSSGGSQENVM (0)
TLFRKAVTNVTCSVLMGKRYDYDDPVLNNITEHIR (2) 
PSKTVVMYSPILRFIPPFRWTYKKIVNSIEQVTG (1)
VIRNISEMHKQDFDSSTKRDFVDHFIAEMNKEGEYFTVNYVIM (1)
YLLVAAINTTTAAFGWCVVSLLRHPECQEKIWQEVDKVI (1)
GNETPSCKHQENMPYMRSFIQEIHRHRSIATLNLPHRTVRSVRLCGYDIPKDTP (0)
VLTNIWRVHNNQNVWKNPHEFRPDRHLDSNGNFVSSNNVIPFACGYRRCL
GEQIAKAEIFLFIVNIVKRFHLVVDQVTGPPTLEKDPKGAVNTPAPFRFSVRARKN*

>New 66% to seq 5 paired_scaffold_377 CYP2 clan
2898483 LITVLLILIYLWLSWYRKPKNFPPGPRGLPILGIVPYLGKHPEKVLSKLADKYNPSVMSF 2898304
2898303 RMGRMDWIILNDIEAINQ 2898250
2897850 ALVKQGEDFSGRPKSPLFDMLTNGCGLVFVD-GERWKEQRKFGIMALRX 2897710
        missing exons 3-5
2894967 VIQEITESHRRNFDQNNKKDFVDYFIDEXXXXXXXXXXXXXXX 2894884
2894494 DEQLCHYTRDLLVAGINTTTXX 2894435
2894427 LGWGIVCLLLYRGHQTKIAKEIQQVMGKX 2894344
2893636 GDETPTLHHQDKMPHLRSFIQEVHRCRSIATLNLPHRTTRDVRLNGYDIPKNTP 2893475
2893017 VLTNIWRVHNDPRVWKDPHKFNPGRHLDSNGKFVHSAHIIPFACGY 2892880
2892879 RRCLGERIARAEMFIFFAKIIGRFELNKDDGEPLTLDNDPAGAVNSPKPFRFTVRRR 2892709

>sequence 196, 32 ci0100152256 complete changed middle exon added N-term exon
MFENVGYYFSW (2)
SLLTSSHVISAFIGVFTLLMFVYYYWWKFPHPRYPPGVRGIPVLGATPFFGKYAQEKLAIWSRDTYGPV
MSARFGQDEAVVLNDYESVHEALVKNLQYFNSRPFVYVAAQVTKGYGIAFADDHQNCMEVRSFSLSALRG
LGIGRRTMETRVSEIAQDLVRTLEDLNEKPTDLKMAIGSTVANVICSVVFGKTYEADDQEYEYAVRCSFDC (2)
YGDPENSEYLSFLFFYPALRHIQPFKRAMEKFVLVQAGLEG (1)
DFCKEIEKHKKHLDENEPADYIDAFLVEMKKHSAQ
DSWFHETELRHC LVDFFIAGTETSTNAILWSLLALIHYPNIQEEIHQELIDNIGEQVVPSIDHRDKLPLF
RAFVQEIFRFKTLLPLSIQHRASHDVEIGGYVIPKGTKVFPNLHAVHHDPNTWENPSDFNIYRHVDKHGK
FIPSKKVIPFGIGARSCLGEKLANVEVFLFLANIIKRFEILPDPESKELPDFKDGINGFIYVPYRYKVVT
KPRIINE*

>paired_scaffold_401 61% to seq 196 CYP2 clan
77517 DFNQKEIEEHEKNLDENEPADFIDAFLIEMKKHSPDTTWFH 77395
77027 QLLHCLCDMFVAGTETSTNTILWSMLALIHFPEVQQNIYKEIHKVLGE 76884
76749 GEHVIPSADHGDKLPLLRAFIQELYRYRTIVPMSLQHRASEDVEIGGYLITKDTTVNTMI 76570
76307 VSTNIDAIHQDEKLFKNPKEFNIYRHIEGGKFVFNKKIIPFGIGARSCLGEKLAKLEIF 76131
76130 IFLANIIRRFEILPDPKSKCLPPFKDGISGLLYVPFRFKMVAKPRLVN 75987

>sequence 198, 61 complete seq stop upstream in ESTs
MSSKEDVPPKSLPGPWGFPFVGSAFQLGLHKDKTLYEWTELYGPVYALNLGTRLAVALHGY
SAIQEALVDKASSCSGRICAPIIDAMVNQQGLFFVDHDEHWKKWREIAHSGLSNSWKTTQ
SGILTESERLKKYLSEADGKPINIEETLQITMANAVFQTCLSRRFDLDDTEAANFLVDLY
ATNRDPKFGKILFLTFLRFLPSYKSYLIEGDTTRRKKVRKVLRGFIEERLEDDDQSRDLVS
LFFKAGPTPSLENKTLVAGLIQDMLFAGSITTSSQLLWVILFVLKNPACMHKMQNELDAVL
GDCENPTKSAISDKYDELNYTRAVVQEAMRLRCIAPTSMFHKATEDTTIAGYFVAKDTVI
MPNLYSVHHDPITWAPDPESFRPERHLDSDGSFVNSDHVIPFSVGWRRCVGEKIARTEIF
AHLVTIFKHFEVALHPDDVGVDMEGVPGPILRPVSYRTQFTSR*

>paired_scaffold_66 76% to seq 198 CYP2 clan
5610751 PPKSLPGPWGFPFVGSAFQLGLQKEKTLYEWTEYYGPVYGLNLGSKAVVAL 5610903
5611297 YKAIHEALVTKASVCSGRPVSPVFDAMVNKQGLFFVDYDDFWTKWRDIAHSGL 5611455
5613843 TPSLANKTLLVALIQDMLFAGSITTSSEFLWVIIFVLKNPNCMHKLQAELDTVL 5614004
5614570 EELNYTRAVVQEVIRLRPIAPTTMFHKATEDTTIQGFFVAKDTVIMPNIYSVHHDPVTWA 5614749
5614750 PDPDDFRPERHLDDDGLFVNSDHVIPFSVGWRRCVGEKIARTEIFAHLVTTFKHFEVDLH 5614929
5614930 PDDVGIDLAGIPGPIIRPARFRVQFSAR 5615013

>sequence 202, 195, 24 ci0100132628 N-term longer complete
MDVESVGCLWGLYSSWVTTCLMGLLTLVILVYYYWWKLPHPRYPPGVRGIPVMGATPFFGKYP
QETLARWSRDKYGPIMSARFGQDEAVVLNDFETMHE (?)
AFVKNLQYFNSRPTFQIVEQYTNGYGFGFADGHKKFLEVRTFSQSALRG (?)
LGIGKRTMEARVSEVAQDLVRTLEDLDQKATDVK (0)
MVIGTVVANVISSVAFGKTYDPGDAEFEHLVQCSFN (?)
CYGDPDNSEHVNFLIFYPAMRHIQPFKRSLEKFYVVQNGIL (?)
DFCRKEILEHKKNLDENEPGDYIDAFLVEMKKHSPQDSWFH (?)
EKQLQHAVVDLFIAGTETSTNTILWALLALIHYPETQEKLYQELLENIG (?)
EQVLPSIDQRDKLPLFRAFIQEIFRFKTLVPLSVTHRASDDVEIGGYVIPKGTK (0)
VFPNIHAVHHDPNIWKNPSEFNIYRHIDKDGKFIPSKKVIPFGIGCRSCLG
EKLARIEIFLFLANIIKRFKVLPDPKSQDLPEFKDGVNSIIYVPYRYKVVAKPRIVNE*

>paired_scaffold_2 63% to seq 202 CYP2 clan
562173 GFLTLGILVYRYWWKLPHPRYPPGLRGVPILGVLPFIGRSAHVKFTKWSRDKFGPIMMVR 562352
562353 MGRDDVIILSTYDVIHE 562403
562501 KAFVKNMAAFQSRPYMHILAHVSGGYGLILPPFHSNFAQVRQFSLNTLRG 562650
565003 VSLQLLEIVYLYRYCLPFIEFNQKEVIEHQMKLGENEPADFIDAFLHEMKKHSPDDSWFH
       YKERQLVQSLVDLFLAGIETTTTFLLWSMITLLHYPDIQALLYQELLQETG 565428
565902 IGEQAFPSIGHREKLPLLEAFIQEIFRFKTLLLLGAQHLTTKDVEINGHFIPKST 566066
566131 VMPNIYGCHSDPNIWKNPSKFDIYRHINKDGKFVPSKKVIPFGIGARACIGEKLARIE 566304
566305 TYVFLANIVKRFEILSDPESKNLPPLDDGVIGLVQAPLPFKVVLKPRI 566448

>New 70% to seq 202 paired_scaffold_278 aa401-476 region last exon CYP2 clan
131488 LFTLGLAVYYFWWKFPHPRYPPGVRGIPILGALPKLGKDPHKVLAKWAAEKYGPIMSVRF 131667
131668 RKTDAVVLSDFDSI 131709
133191 QAFVTNAASFQSRPSVPIIDQMTGGYGFGFADGHAKCLAVRKFTVSALRRF 133343
138022 LMQYLFEGTTTEPISLEHRHKLPLLNAFIEECWRFYTIAPMALIHKTNQCVGIGKYVIP 138198
138199 ANTMV 138213
138782 YSICDSHQIFANLHAVHFDKNIWRHPTKFDFSRHIDKDGKFISSRKVIPFGIGARSCIGEKMARMEIF 138985
138986 LFVANIIKAFEILPDPLKKPHSPLENMAGSVLHVPEKFKVVFKPRK 139123

>New 68% to seq 202 paired_scaffold_278 aa401-476 region last exon CYP2 clan
143357 VGSTEQVSLEHRHKLPLLNAFIEECWRFYTIAPMALIHKTNQCVGLGKYVIPANTMV 143527
142378 ENLVFCLFDLYSAGVDVTTNALLWSISTLMNSPVIQESLHQEMSAHVG 142521
144042 IFANLHAVHFDKNIWRDPTKFDFSRHIDKNGKFISSRKVIPFGIGARSCIGEKMARMEIF 144221
144222 LFVANIIKAFEILPDPLKKPHSPLENMAGS 144311
144312 VLHVPEKFKVVFKPRK 144359

>paired_scaffold_124 65% to seq 202 CYP2 clan
1112775 FVFYHYWWKFPHPQFPPGVRGIPVMGALPYLGKSAHTVMAKWSRDKFGPIMSVRFGSDDV 1112596
1112595 VVLSTYDVIHE 1112563
1112065 QAFVKNMAAFQSRPHILLVNALTEGYGLAFPPYSPEFGQVRQFSLSTLRG 1111916
1109071 EKQLLHSISDLFIAGTETTTTTLLWCMIAMLHYPDLQARIYNELMQEIG
1108397 GEQVLPRVDHRDKLPLFRAFIQEILRYKTTVILGIQHLTTKDVDIKGYRIPKDT 1108236
1107885 VTANIDACHSDPNVWKDPSKFDIHRHIDKDGKFILSRKVIPFGIGARSCLGEKLARIEIY 1107706
1107705 LFLANIIKRFEILPDPKSEHLPPFDDGLSGLIHSPFSYKVSLKPRI 1107568

>paired_scaffold_401 65% to seq 202 CYP2 clan
88505 MFSILIEWLNSWSGVMSVLIGALTLCMVVYRYWWKLPHPRFPPGVRGIPILGATPFFGRY 88326
88325 PQETIAKWSREKYGPVMSVRFGQQDAVVLNDYESIHEVSI 88206
87577 QALVKNINVFHTRPSCSIVECFTEGYGFGFADGHKKYLEVRNFSLAALRG 87428

>paired_scaffold_401 2 PARTS OF GENE OUT OF ORDER
92287 DFNRKEIEEHEKNLDENEPADFIDAFLIEMKKHSPDTSWFH 92165
91829 QLLHCLGDMFIAGTETSTNTILWAMLALIHFPKVQQDLYKEIHKVLGE 91686
      GEHVIPSADHGEKLPLLRAFIQELYRYRTIVPMSLQHRASEDVEIGGHLITKDTT 91384
90780 VSPNIDAVHQDEKLFKNPKEFNIYRHIEGGKFFFNKKIIPFGIGARSCLGEKLARLEVF 90604
90603 LFLANIIRRFEILPDPKSKCLPPFKDGISGLLYVPFRFKMVAKPRLVN 90460

>Ci204 204, 142, 127
MFVHEKMFLQPSSRLLWIQQLS (0)
KIHKLN 
KSLKYMPEPKGLPFIGTLLDYTFLKKFDKTNLASHFIGRHQLLGPIYKEYVFPAFSK
PQVFSTLPDDNEVLLRSEEKISHREPVEFVVSARKLLGWSIGLPFDVGEDWYK
LRKVVNQHFLKNSIVWSHSKQQHEVAEEFVDYIGQNLDENNE
VPHFQDLLQKWALESTAVFCFGVRL
GVFDKSVDDDLNIIIDTNRKKFELIMKGIFSAPLWKFYNTKLMK (0)
KFNKVQIEQLHAIRRQQIKAFTEFPVDQKLIESLQFYICS
SEIEVLIIDLLSGGVDTTSNSAVFVLYLLSMNPDKQEVLRKEILEALKTGKTDGK
DLMRMSYLHACVLESQRLLPFSIGTNRRYSKDIV 203
LSGYKIPAGTVVTMTSNIINNCSEEYFDDPNSFKPERWIQVDNKAKDRMRFAMAGSFG 377
HGTRACPGRKFALQEMKYLLISVLSRYKVEYHYKPMKINFLLVSTPSQPPKFTFKQL* 551

>Cs204like
MGNHQITSRPPSSYRSYCS (0?)
MTDFTSL
KCLKEMPEPKGHPIFGTLLDYTSYGNFNFKNLAEHFRKRHQNLGSIYKEYIFPSKFL
PYVQCTSAEDNRVLFQAEERIAYREAVESMVVSRRRMGLSIGLPFSQGEEWYR (0)
MRQVVNRHFLKNSTVWSHAQEQYKVAEDLVKYINENLDHNKE
VLHLNDALQKWAFESTGVFCFGMRL
GIFDTDADEDLDIILETNRQKFLLIAETMFSPPLWKLFNTRLMK (0)
KLISVQFNQLAAIQRQREKVMNNFSPSPELARVTRLFKNSKLTK (2)
SEVDMLIMDLLAGGIDTTSNAAIFMLYTLAINPDKQELLRKEILEVLESGKTDGK (0)
DLMKMTYLHACILETQRMFPFSIGTNRRYSKDVVLSGYRIPA (1)
GTTVILTSNLVNGRNPSMFPNPDTFLPERWLNNQDNEAKDKMR
RFAMAGSFGHGTRACPGRKFALQEIKYLLISLLSRYKVEYHYAPMTINFLLVSTPSEKPRFTFNPI*

>paired_scaffold_129 Cs204 like
277323 TSLKCLKEMPEPKGHPIFGTLLDYTSYGNFNFKNLAEHFRKRHQTLGSIYKEYIFPX 277156
276471 XTSPYVQCTSAEDNRVLFQAEERIAYREAVESMVVSRRRMGLSIGLPFSQGEEWYR  276307
275741 MRQVVNRHFLKNSTVWSHAQEQYKVAEDLVKYINENLDHNKEV 275613
273237 QSEVDMLIMDLLAGGIDTTSNAAIFMLYTLAINPDKQELLRKEILEVLESGKTDGK 273070
272535 DLMKMTYLHACILETQRMFPFSIGTNRRYSKDVVLSGYRIPA 272410
272119 GTTVILTSNLVNGRNPSMFPNPDTFLPERWL 
271768 RFAMAGSFGHGTRACPGRKFALQEIKYLLISLLSRYKVEYHYAPMTINFLLVSTPSEKPR 271589
271588 FTFNPI 271571

>ci0100130435 seq 48, 56, 230 seq 205 ADDED 10AA AT JOINT complete
MIPSYLIFFEFPIVATVLVLVCLFYWYRRPKNSPPGPRGIPLLGVLPLLGRYPERTLHEWSKKYG
PVMSVRMGRQDWLVLGDHGTIH QALVKQSHTFSGRPHVPTLHRITQGRGLAFVDHGPAWKAQRKFGAVTL
RGFGVGKKGMEDKIVEEVHYLNDAIRQHEGKPFNIL AILSNAVSNNICSVILGRRFDYNDKDFQDIVHKM
TRDFTDPKSTALTNASVFAPVLDNFPPFAQNTNQRVNDAMVVREILLKYVKEHKATFDREDARDFIDAFLKEAELSKEPS
FNDTQLLHYLHDLFLGGTETTTSTLRWALLCLLHYPQTQTKLREEINEVIGRHLIVKVSYSCKVDMPYTC
AFIQELYRYRTLLPLSLTHKTNEKAMLGEYTIPKSTIVSVNLWAVHNDPNTWEEPSMFKPERHLDESGHF
VQSKHVIPFSVGPRHCVGEQLARMEVFIYLVSMVQKFEFLPDPNANELPDIEKGSNGPAFVPQMFDIVAK
VI*

>SEQUENCE 208
QKMSWTHGPIMGVKLLRKFTVVLSS
VEAMQNALVKQGHAYAGRPDSIMFQAFTGGR (?)
GVLAIDGPRWKENRRFGTVTLRKLGAGRA
IMEERIVDEAMYFCEALQIIQKKSFNITKLITHSMANVICGLAFGERFNYDDKRFISTAR
ALVDWFNDNSNVFMLLVFAPFLRHFWPFTNAVKSLKSAREKLYTMLIPIIDRHRETVDDY
DYRDFIEAYLQKISQTRDDPTSFYSDKELLGNVRDLFVGGTDTTVNTLRWTIIYLINR
PEVQVQKKKEIDRVIGPDARPKMSHLDDMHYTN (?)
AFMHEMLRIRTPGPLGLAHRTTEPTKLMGYDIPEV 
FITNFWAVHHDPKLWDQPEEFIPERFLDKDGKFVKNANLVPFSM (1)
GSRYCMGETVARMEFFLFVVTMLQRFHLLLPTDDSEKITEECLR (2)
SGQFLIPPSHNMRVKERFTREQSIFEYNKQMWE*

>paired_scaffold_65 79% to seq 208 CYP2 clan
186375 VSAQVTWAVLAAILIPYVIWQWVKRPKNFPPGPIGLPVLGIAGSMTNRAE 186226
185646 MHKTKKRNFSQKIGWKYGPIVGVKLLRKFTVVLNSVESINEALVKQGHIFSGRPDSIMFQ 185467
185466 AFTGGRGGVI 185437
182664 SDAELLGNVRDLFVGGTDTTVSTLRWSIVYLINRPQVQKKV 182542
182359 QKEIDKVIGPDARPKMTHLDDMPYTN 182238
182138 AFMTEMLRIRIPGPLGLAHRAMEATKLMGYDIPK 182010
181316 DTQVITNFWAVHHDPKIWGDPEEFRPERFLDADGKFVKPTHLVAFSM 181173
180909 GSRYCMGESIARMEFFIFVVVMLQRYHLL 180823