ࡱ > ] _ Z [ \ bjbjв 4r к_к_ ` ` ` ` ` t t t t P \ t i $ W ~ ` ` ` # ` ` `rUP 9 0 i ` ( i X & :
White rot FASTA file July 31, 2001 167 sequences, 126 with unique heme signatures Gene models are being determined. 47 scaffolds with 103 sequences have been analyzed in detail. 92 complete P450 genes are assembled now with all intron-exon boundaries tentatively identified. Three of these genes were split at the ends of adjacent scaffolds. Seven pseudogene fragments have been identified and two P450s run off the ends of scaffolds without continuations identified on other scaffolds yet. (x) indicates intron phase. ex follwed by a number indicates exon number THE SCAFFOLD NUMBERS ARE NOT CYP NAMES!!. ONLY CYP51, CYP61 AND CYP63 ARE NAMED NOW. >Scaffold_44 CYP51 gene model complete all boundaries checked 93217 MSLSQYGPIAGLVGQAYDALASMSTSRLVLFLLINIPILSV (0) LPKDKSLPPVVWHWFPWFGSAAAYGEDPIKFFFDCKEK (0) 92907 YGNVFTFILMGRKVTVALTPAGNNFIMGGKHTTFSAEEVYG (0) GLTTPVFGKDVVYDCPNELLMEQKKFVKFGLSTENFRQYVGMIEEEVLQFMRNDASFK IYQMNDINEWGAFDVLKVMSEITILTASRTLQGKEVRANITKDYAQVYNDLDGGFTPLHF MFPNLPLESYRKRDAAHKKISDFYISIIRKRRENPGQ (0) EEHDMIAALMNQKYRVGRPLKDHEIAHIMIALLMAGQHTSSATGSWALLHIADRPDVA (2) EALYEEQVKHFRQSDGSWRTPEYEELKELPVLDSVIRETLRIHPPIH SIMRAVREDVVVPPTLAAPSEDGRYVIPKGHVVLSSAAISQVDPMLWKNANDWDPSRWSD PEGVAAQAYKQYDDAEGAKVDFGFGLVSKGTDSPYQPFGAGRHRCIGEQFAYLQLGTIISTFVR HVEMRLPETGVPPPNYHVRSHTRILRLVCADVASADDDHTPEGASQHSLQAA* 91272 >Scaffold_75 CYP61 49% to 61A3 gene model complete all boundaries checked may be a small intron after TRKAL compared to other CYP61s 38607 MASSQAAFPSTLSDSSRHSTDSPAFIGLLPTGSWF 38711 ex1 38712 YTTAAILLSLLVIEQSVYRYKKRHLPGDKWTIPLIGKF ex 1 ADSMKPTMEGYMKQWNSGALSAISVFNV (2) 38909 ex1 38970 RFIVMASTTEYARKILNSPTYAEPCLVHSAKQIILPDNW (2) 39086 ex2 39142 VFLTGKEHVEYRRGLNLLFTRKAL GYVLQYCLVLYAMLTHRR SLYLGIQDVITRKHFAKW 39321 39322 LADAAKDPSAKPIMMTARELNMETSLRVFCGNHIPEHGAKEISDKYWMITVALELVNFPL 39501 39502 AIPGTKVYNAIQARKAAMKWLELAARKSKESVAAGNPPECMLEEWVTILNDPAYKGR 39672 39673 REFSDHEMAMVVFSFLFASQDAMSSGLIYGFQHLADHPEVLAKVREEQER 39822 39823 VRGGDYEKPLTLEMMDEMPYLRAMVKETLRVKPPVTMVPYKTTKAFPISQDYTVPSGSMV 40002 40003 IPSFYNSLHDPAVFPDPDRFMPERWLDPNGSANTNPRNYLVFGSGPHKCIGLEYAMMNIA 40182 40183 LVLANAAVLMNWEHELTPQSDKVQIIATLFPQDGCKLKFSPRQHA* 40320 >Scaffold_86a 6 different sequences 53% identical to 86f gene model complete all boundaries checked 60378 MLNLSLDSSSVLSLVWTASPWLLLSWILYTVLMAVYNLHFHPLAKFPGPKMAAASEWWLA 60557 60558 YVEVIKQESLSKKLWELHEQY (1) 60626 ex1 60672 GDVVRFGPNQ (0) ex2 60760 LHFSKPAAYNEIYNVKNRWDRDMKLYHIFADEVSTLTIPDYARAKKRRDLTTFLFSRKN 60936 60937 IVNMQHLVQQC (0) 60969 ex3 61021 LDTVCENIDKHIKEGKPVSIFKAFRCAAADVICTMCFARSMNATSEPGFNAQ 61176 61177 VVTAIHAAFPVIMVFKHFPLLQTLSRMVPPLLLSSLRPELNGLMKMRK (0) 61320 ex4 61378 MLTDQVKEVKAHPEILKESQQVTIYHELLKDPKNIPSDTSLRDEAVLYVTAGMDTSSDTLT 61560 61561 LATINVLSRPDVHARLMHELVEAWPHLEDAPPRYEQLEKLPYL (0) 61689 ex5 61749 TAVLKESLRLSHGVVQPMTRVVPREGAYISGHFIPGG (0) 61859 ex6 61913 SIVGMSSIFVHWNEEIFADARAFKPERWLDPEADLDPWLVAFSKGPRSCLGV (2) 62071 ex7 62137 NLGWCELYMSIAAIFRRYELKLNGIG (2) 62202 ex8 62257 PSDLVWRDVYLPLHIGPDLTVIAKRRTV* 62349 ex9 >Scaffold_86b 6 different sequences 498 aa gene model complete all boundaries checked 64671 MDDIVLVLLVACAVALYARTKRTRYRLPPGPKGLPILGNTYDIPAKYEWLAYEKWSRDF (1) GSDIICLKFVGTPVIVLNSIQAINDLLEKRSSIYSDR (2) VGLDRNFGFVPYGDVWREHRHLFHQYFRLDMVPKYHDRMLKHSKDLLQRLLVSPDRLMEHLRF (2) SVAGASILNISYGIDVQPENDHYIAVADEAIHALAVTGNAGSYL (1) VRYIPAWVPGAKFKRDAANWWEKTRLMIDEPFNYAKQRM (0) 65809 AQGKGMDCVTAVMLSAIGEDQDREHQELLIKQVLSVSYI 65925 (1) GGADT (0) 66052 TVSALATFVLAMMQNPKMQRIAQADIDRVVGGERLPSVEDRDSLPYVTAIVKEALR 66222 (2) WRPVIPL (1) AVPHRVTVDDEYKGYHIPAGSIIVGNVWY (2) 66487 AVLHDETRYPNPDVFDPTRFLTSDGQLDNNAPDPAEACFGFGRRICAGRYFALDAVWLSVACI 66666 LATFDIAKPLDENGNPIEPSGEYTTGLLS (2) HPVPFKVSFKPRSAAAEALVREPISRDL* 66903 >Scaffold_86c 6 different sequences 497 aa gene model complete all boundaries checked 67683 MDYLGISILLVAALALHYFLRKKRYRLPPGPKGLPILGNALDIPAKHEWLAYAKWGQEC 67859 (1) 67912 GSDIIYLNLAGTPVVVLNSAKAAKDLLEKRSSIYSDR 68022 (2) 68152 IGLGRNFGLKPYGDTWREHRRLVHQHFRTENVPRYHEFTSKQ IGRLLLHLLEDPSNFVRHLRI 68340 (2) MAGASILRICYGIDVQPDNDHYLSVADEAIESIAATGNAGSYL 68523 (1) VRYLPSWAPGAQFKRDAAKWKEKVDRMIAEPFDYAKRYMAS (0) 68826 TEGAATDYIAGLLLSAMDPGRDKTQQEIAIRDSLWAAYV 68939 (1) GGADT (0) SVSALATFTLAMVLYPDVQQTAQAELDRVLGKDTLPTIEDRDSLPYVTAVVKETLR (2) WHPVTPL (1) AVPHKVTTDDEYRGYHIPARSIVVGNVW (2) 69503 AILHDPDRYPNPESFEPSRYLTSDGLLDPAAPDPTEACFGFGRRICPGRHLAYDTIWTGIASIL 69688 SSFDISPPLDEQGKPVNPSEEYTTGMLS (2) 69836 HPVPFRANFKARSENVEALIRRITLCE* 69916 >Scaffold_86d 6 different sequences 499 aa gene model complete all boundaries checked 70591 MQTVVLAILFVFAIALPIYSRRKRYRLPPGPRGLPIIGNILDIPAGREWLTYAKWSREY (1) GSDIIYLNMAGTPVYVLNSIQATTDLLEKRSSTYSDS (2) VGWGKNFAFQPYGDFWREHRRMFHQHFHPEAVTKHHVHILKQAKDLLQRLLVDPDDFMQHLRF (2) MAGAAILRVSYGINVQPENDHYIGIAERAIHSLALTGNAGSYL (1) VKYLPSWAPGARFKRDGEIWRREVDQMFSEPFELVKRQMVG (0) ADGEPPDCVTASLLTTLDERKDRPREELEIAVKQAVGTSYV (1) GGADT (0) 71980 TVSSVATFILAMLQFPDVQRTAQAEIDRVVGSTRLPTIEERGSLPYVTAVMKETLR 72147 (2) WNQVTPL (1) AVPHKVTVDDEYKGYFIQAGSIVIGNSW (2) 72440 AVLHDETRYPNPEAFDPTRFLTPDGKLNPSAPDPVEAAFGFGRRICPGRHFAMD 72601 AIWMNLAFILATFNIEKPLDEAGRPIEPSGLYTPGLLS 72715 (2) HPEPFRVKFIPRSKAAEALIRETMFYD* >Scaffold_86e 6 different sequences 494 aa gene model complete all boundaries checked 73397 MENITFAVLFVLLLAVPLFFKRQRYRFPPGPKSLPLIGSVLEFPVQSSWLTYQRWGREL (1) ASDIIYLNVLGKHIYVLNSAQAVSDLLEKRSGTYSDR (2) VGWDKNFALQPYGEFWREHRKAFHQQFQPDMVPRYHVHMYKQAKDLVRRLIAEPNALKQHLRY (2) MAGALILRVSYGIDAEPNDDHYFEIIEQAVYSLTEVANTGAYL (1) VKYVPSWMPGAQFKRDATEWAPQVNQMFDEPFDVVKRAL (0) 74537 AEGNAPDSVCAALLSELDPRKDRAHQETVIKQAVGTAYIGGADT 74656 (0) intron lost TVSTLSTFVLAMMTHPDVQRTAQEHIDRVTGGDRLPTIEDRDALSYVTAIIKEALR (2) WRPVLPM (1) AVPHITTADDEYRGYHIPKGSIVMGNAW (2) probable seq error with extra G before AVL AVLHDEARYANPDAFDPTRFLTPAGTLDKDAPDALEAAFGYGRRVCAGLHFAL 75312 75313 DSMWVNVACV 75342 LATLDIRKPVDEHGVPVEPSMAYTTGLLE (2) QPEPFAVVFKPRSQAAEALIYEGHDD* 75568 >Scaffold_86f 6 different sequences 53% identical to 86a gene model complete all boundaries checked 77777 MLRLLVDNGLVSALAR 77730 77729 YGPAMLISIIVWTLGRVVYNLYFHPLAKYPGPRMAAATEWWQAWLEIFKAESLSLTLLELHAKHG (1) 77532 ex1 GDIVRIGPNE (0) ex2 77403 LHFSRPSAYHEIYTSKNKWAKNPAFYRYIVSPTESTFSTCEYDKAKKRRDITLPIFSRK 77221 77220 SILGMQHLVQEC (0) ex3 IDSMCENIDKHISEKKSVNILRAFRCCALDAVTSLCFARNTRATSEPEFRAPIEV 76969 76968 AMDFSLPLTPVLKHFPMVQVVMSWLPPDVLLWADARLGGFVQLRK (0) 76834 76780 MLDAQVEEILRDPDVLASAEHPTIYHAFLAHAPTPSVAELRDEALVYVHAGTDTSSDAL 76604 76603 AVGTLNVLGRPAVLARLRAELDTVWPRLDERPRYEALEALPYL (0) 76475 TAVVKESLRCSHGVVHPMTRIVPRGGARISGAHIPAGTIVAESNIFVHWNAD 76264 76263 VFPEPHEFRPERWLEGKTPSGESLDNWLVPFSKGPRSCIGI (2) NLGYCEIYMTFANLFRRYDLSLDGVK 76021 (2) PSDWKWRDCYLPHYLGPEMKVVATPRLS* 75874 >Scaffold_193a very similar to 252a gene model complete all boundaries checked 33472 MDDVNLFIRARTLLDSVLVLILTSIGYAVA ex1 33562 NAVYNVYFHPLSKFPGPRMAAASRWWKTYVEVYRDESIVDRLFHLHEKYG (1) 33711 ex1 NVVRIAPDE (0) ex2 33858 LHFSDPAVYNAIYSPKSRWNKDPLMYAPFGFGSHRSMFSTVEYQPAKKRRDLAAPHFSRK 34037 34038 SVLNLQGVIQAG (0) 34073 ex3 34127 IDELCDAMAQRAAEGKPTDIYSAFRCLNFDNVTSYCFGWSLHMVRSPDFSAEPVQNMQD 34303 34304 MHSSYQVWKHWLRTPMRLLLS (0) ex4 34473 KIKEQVDGYLERPEELDNAPHSLIFHSLMDPAQSTKLDKQSVVEEANLLIIAGTDTISN 34649 34650 ASALGTLFLLSDGGYMRDKLQAELKAIWPRLDDKPSLEVLESSAPYL (0) ex5 34841 KAACKESLRLSHGVMSPLLRVVPSQGATLGGHFVSGG (0) ex6 TKVGICNAFVHLNPALFPDPHVFRPERWPEPGAESLDTWLVAFSKGPRSCIGIK (2) 35165 ex7 35215 NLGWCELYMNLANLFR (2) ex8 35378 RRVRAYDLWYQDRFLPYYQGPDLLVHATPVSD* 35476 ex9 >Scaffold_193b very similar to 252a. gene model complete all boundaries checked 36645 MNVWARVSEGWTALEVILVALLTSIGYVVTTA 36741 LYNIYFHPLSKFPGPKLAASSWLWKAYVEVIKGESILDRLSKLHEEYG (1) 36884 ex1 PVVRIAPDE (0) ex2 37027 LHFNDPAVYNEIYTARSRWNKDDVMYAPFGKDTSIFTTREFREAKKRRDLSAPHFSRKTV 37206 37207 LSLQGLIQEG (0) 37236 ex3 37296 IDEFCEVITKRDADSKTTDIFRAFRCLDFDNVSSFCFGWSEHAIQAPDFNSAAVEELQH 37472 37473 SNKDFQFWKHFLRLPLPVLLLAS (0) ex4 RIKNQIATYIDK 37690 37691 PEELDKTPHPTVFHVLMDSSHGTRLSATAMAEEASLFLIAGTDTTSNASALGTIFALSDN 37870 37871 GYMRNKLKEELKSVWPRLEDKPSLEVLESLPYL (0) ex5 38020 KAVCKESLRLSHGAMSPLMRVVPQQGAVLGGHFVPGG (0) ex6 38188 TKVGMAHTFVHFNPTLFPEPHTFRPERWLEPGAEALDTWNVAFSKGPRSCLGIK (2) 38349 ex7 38403 SLAWCELYMNIAHIFR (2) ex8 38566 RPVRPYDLWHRDCFLPYLDGVDLLVYATPSTD* 38664 ex9 >Scaffold_252a 6 P450s on this sequence. gene model complete all boundaries checked 13839 MDVVRQALEGRTTKEYAGLALLAFAAYVVANI 13744 13743 IYNLYFHPLAKFPGPRAAAASRWWKAYVEVYKGESIVDRLFELHAEYG (1) ex1 DVVRITPDE (0) ex2 LHFSDPKVYNEIYNTRSRWDKDGEMYAPFGGNSTMFTALRYHDAKKRRDLTASLFSRK SVLSLQGSIQEG (0) ex3 LDELCDIISARSAAGKTTDLFRAFRCLNLDNVTSFCFGWSLHTVRAPDFRAPPLEEVQN SHGGYQFWKHLMLFRAVLLP (0) ex4 joint not conserved in scaffold 193a and b KLKEQVDALVAR PDELEAAPHPIIFHSLIDPAHGAKLSAQELMEEANMFIVAGIDTTSNATGAGVIGVLSN PSTYDKLKTELRTAWPRLDEKPTVEVFESLPYL (0) ex5 KAVCKEALRLSHGITSPMLRIVPPQGATLAERFVPGG (0) ex6 TQVGVSHLFVHLNPTLFPDPHAFRPERWLEPGAESLDTWLVAFSKGPRSCLGI (2) ex7 12062 NLGWCELYLNIANLFR (2) ex8 RRARASGDWKDCFLPCFEGPDMLIHTTPVAD* ex9 >Scaffold_252b 6 P450s on this sequence gene model complete all boundaries checked 27279 MWVVTCTCAAILYMVLVGLRKRHRFPPGPKGLPLIGNVLNAPAGSSAPMVYQQWSKLF (1) 27106 ex1 27051 GSNIIHLKIFGTHFFVLNDAKTASDLLEKRSANYSGR (2) 26941 exon 2 26800 TGWDRDWGLLDYGDSWKKHRHVHHRYFHPKVLEAYHPRM ex3 EKGVQMLLQLLHRSPADFNAHLR (2) 26615 ex3 FMTGHIIISIVYGIETKSADHPYIGLAEEGVKAFSATAVPGRFL (1) ex4 26287 VKHIPAWFPGADFKRQAALWKKDVDAMYETPFNDIKAAI (0) 26171 ex 5 26114 RRGETNTSIVGAALAELEGQADTEEAENIIMNVAGTAYP (1) ex6 TASDT (0) ex7 25872 TIITLEFFILAMLQHPEVQRRAQADLERVVGNARLPSIHDQALLPYITAIMHETLR (2) 25705 ex 7 WRPPFRT (1) ex9 25581 VSLPRKSLHDDEYEGYHIPAGSIMIANEW (2) 25492 ex10 25443 AILHDEARYPNPEEFDPSRFLNTDGSIDHTVPEPVEPFGHGRRLCPGRHFAMDVIWLTI 25267 ex11 25266 ANILHVYSIEKAVDQSGNVVEPSGKCIDGLL (2) 25174 ex11 25113 SAPEPFQAVFRPRSDAAIALLRSID* 25036 ex12 >Scaffold_252c 6 P450s on this sequence gene model complete all boundaries checked some differences with 252b 29836 MSDSTLLAVGLIFGLLVLARVSRKRPRFPPGPKGLPIVGNLFGIPRDHSWLTYAEWGRLY (1) 29675 ex1 29611 ISNIVHFEALGQHFFVLNDEKITKEIFEGRSQIYSDR (2) 29507 ex2 29377 TGWHRNWAFTPYGESWRQNRRLFHQFFRAQAIPDYHDHM ex3 AKGARGLVQLLLQTPENWMRHIR (2) 29192 ex3 29135 HASGSTVLDAVYAMDVDPNDNERLEGVERAVETLVEIAEAGGYL (1) 29004 ex4 28881 VKHIPTWFPGAGFKRQAMAWKRDIDALFERPYHEVKSAMGCRIFSE (0) 28765 ex 5 (ex5 GT boundary does not match 252b possible mutation from GTA to GGA) 28706 QRGKARSCVTSSLMSMFSEKLGDPDVEETIMGIAGTSY (1) ex6 AGSDT (0) ex7 28473 TVFTMFAFVQAMLLYPNVQRKAQEELDRVVGRDRLPEVADRQSLPYVSAIVKEILR (2) 28303 ex 8 WNPILPA (1) ex9 AVYHKSLADDWYEGYFIPAGSIVIGNTW (2) 28097 ex10 28033 AVLNDAERYPDPEPFKPERFLTADGQLDSQVPDPVEVFGYGRRVCAGRHFAQTALFLYI 27857 ex11 27856 AHVLALFTIDNPLDESGHVIKPRRDCLTRL (2) 27767 ex11 27706 VPKPFKAKFKPRFTGVEELVHMSSIPTQ* 27623 ex12 >Scaffold_252d 6 P450s on this sequence gene model complete all boundaries checked 32807 MQSNALVIACLVAGLLAVARASRKSRQRRYPPGPNGLP ex1 ILKNLFDIPRTYSWLTYEAWGREY (1) 32622 ex1 32563 NSDVVHFEALGLHFVVLNSTEAAKELLEGRSHIYSDR (2) 32456 exon 2 32315 TGWHRHWGIMAYGDAWRQRRRLFHQHFRPQAVPQYHEPMV ex3 RSARTLLQLLLESPDDWMRHVH (2) 32133 ex3 32076 HVSGGTVLKVLYAVDVDPHDDEGMDVVDKALQIFMKLPEPFV (1) 31966 ex4 31828 VKHLPAWFPGAGFKRRAMEWKVHVDRMFEEPYHKIKVAA (0) 31721 ex 5 31653 QSGTARPCIATSLLSAAWEDLENPDVEEQLISVLGTAY (1) 31534 ex6 ANSDT (0) ex7 31429 LEQTIFSVYSFVIAMMLYPDVQRKAQEELDQVVGRDRLPEIADRESLPYFSAVLKEVFR (2) 31253 ex 8 WHPVTPI (1) ex9 AAPHKSLEDDWYKGYFIPAGTIVFGNTW (2) 31034 ex10 30986 AILHEESRYAEPDVLRPERFLTPAGTLDPAVPDPDEVFGYGRRICPGRYFVQDALFLYA 30810 ex11 30809 SHLLAAFTLSKFVDDEGHVEEPSRETVSPIF (2) 30711 ex11 MIPRAFKANIKPRYEGAERLVQMAFTSAS* 30563 ex12 >Scaffold_252e 6 P450s on this sequence gene model complete all boundaries checked 35740 MFDNGVTIALLLIGVTLLVAEALKKRHRFPPGPKGLPIIGNL ex1 LDVPKDYHWLTYTAWSRQF (1) 35558 ex1 35502 DSDIIHLEALGQHYFVISSVDVAKDIFEGRSQLYSDR (2) 35392 ex2 35262 TGWERNFAMMAYGDSWRRHRRLFHQHFRLQNVPAYHD ex3 QIAKGARNLAQLLLQTPDKFGRHIR (2) 35079 ex3 35023 HVVGAVILDIMYGIEVAPDDDERMEHLERAVHIFMELGQAGGFL (1) 34892 ex4 34762 VKYLPTWFPGAAFKRQAMEWKPQVDAMYEISYNEVKDSM (0) 34646 ex 5 34580 QRDQAKPCITSALLTACWDDLDQSNMEETLIGVTGTGY (1) 34464 ex6 AGSDT (0) ex7 34346 SVFALNAFVLAMMLFPDVQRKAQEELDRVVGRERLPSADDRDSLPYISAVIKELLR (2) 34182 ex 8 WHPITPT (1) ex9 34061 AAPHKSIADDYYNGYFIPAGSIVIGNTW (2) 33978 ex10 33923 AMLHNEERYPDPEAFKPERFLTPEGTLDPHVPDPAEGFGFGRRICPGRHFAQASLFLNI 33747 ex11 33746 SNVLATCMIEKPVDEFGNVVEPTRECTSRFFW (2) ex11 33602 ALKPFEAKITPRFEGVENLVQTMSTYTN* 33513 ex12 >Scaffold_252f 6 P450s on this sequence this seq runs off end of scaffold gene model complete all boundaries checked 37365 QVAVQDELDMVLGRKHLPSIEDRDSLPRVTAMLHEVLR (2) 37252 ex 8 WHPVGPM (1) ex9 37123 GVPHRLTVDDEYRGYHIPAGTIVMINAW (2) 37040 ex10 36990 AILHDESVYPEPDIFRPERYLDSDGRLRTDMPYPVEGFGAGRRLCPGRHFAHDMLWLAI 36814 36813 AHVLTVFRIERAVDEDGREIVPEAKFEPWLI (2) 36721 ex11 SPPEPFQCQTKLRFPEAEGLVHLAAMDE* ex12 >Scaffold_169a 6 different P450s gene model complete all boundaries checked 14401 MDIPVPYAFIVVGALVLFFRLRKKPRFPPGPKGLPIVGNALDLPKA (1) exon 1 RE*LTYAKWSREC (1) ALTERNATIVE EXTENSION OF EXON 1 ALLOWING A STOP CODON THIS MATCHES BETTER WITH OTHER SEQUENCES (POSSIBLE SEQ ERROR) 14162 RSDIIHLRFFGTHVFVLNSVKVVNELMVKRSAIYSD (2) 14052 exon 2 13928 SRTGWQRNFTFVDLGDHWKARARMFQQNLGTSTISK ex3 HRPKLIEGNRKLLLNLLLSPDDFMKHIR (2) 13740 ex3 13689 YLSGSSILGIIYGIEVQQDHDPFVETAEKALQCLAAVINAGSYA (1) 13561 ex4 13439 VRFLPTWAPGAQFKRDAAEWYKYVTALIDGPYTYVKESL (0) 13323 ex5 ANGENNTSIVGTLLQELSDDEKRSEQEDTIREAFGTAYT (1) ex6 GGVDT (0) ex7 13033 TYSSVNSFILAMLKYPDVQRKAQEELDRVIGRDRLPSFDDRDALPYITAIVKETLR (2) 12869 ex 8 WGLVAPL (1) ex9 12736 AAPHQLRVDDEYEGYFLPAGSVIIGNAW (2) 12653 ex10 12603 AILNDEKRYPHPESFIPERYLTTDGTLDSSAPDPMEACFGFGRRMCLGRYFAFDSLWIAV 12424 12423 ASLAAAFHMEKAVDESGLVIEPSGEYT (2) 12343 ex11 12271 YPLPFKAVFRPRHEGVVALIKADVSESDSL* 12179 ex12 >Scaffold_169b 6 different P450s gene model complete all boundaries checked 26938 MFSALVLSLLLAAALFLRFRRKRYPLPPGPKGLPIIGNARDIPKSFPWYTYDRWSREY (1) exon 1 26705 NSEIIYLRLVGTDVIVINSEKAANELLNKRSTIYSD (2) 26598 exon 2 26446 SVGWGGRNFAFAHYGDLWRAHRRLFHQYFHPGAVPAYHAKS EX3 TLEVRRLLPRLLSHPDDFMQSIR (2) 26273 ex3 26219 TMTGAIILGITFGMELQPENDPFVALAEEALHAMAQVGNVGSYI (1) 26088 ex4 25957 VQYLPSWAPGAAFKRQAAKWNKIVLEMYEKPFQTIKQAL (0) 25841 ex5 25782 ARGEAPPSILTSMLETLDPEEDNAARESDMRHVTGTAYT (1) ex6 AGADT (0) 25547 TVSSLGTFILAMLLHPEVQRRAQEEIDRVVGSDRFPEYDDRDSLPYITAIMKETLR (2) 25383 ex 7 WRQVTPL (1) 25257 AVPHRLRVDDEYNGYHLPAGSVVVGNSW (2) 25171 ex9 25117 AMLHDEERYPNSDLFDPTRFLTPDGELDPDAPGPELAAFGFGRRICPGRYFAMDSMWIAM 24938 24937 AHILATVNIEKAVDDAGNILEPSGEYT (2) 24845 ex10 24786 YPVPFKVAFKPRSAAADALIQGGTPLA* 24703 ex11 >Scaffold_169c 6 different P450s pseudogene gene model complete all boundaries checked 28974 MSGHAVYVLATATRCNHGGSTEPSSPSYSS ex1 GPPGLPLTGNLLELPMRLGWLAFTEWSRKH (phase 1 GT missing) 29153 ex1 GSDIIQLEVFEKHVLVISSLHTVWELWAHPYTAHG (phase 2 GT missing) 29312 ex2 >Scaffold_169d 6 different P450s defective GT bound on ex3 gene model complete all boundaries checked 29901 MGWTLCLYALLGLTGLWV 29958 ARVRRPRQRYPPGPTGLPVLGNVFDVPLENGWLVFDQWARQY (1) 30083 ex1 30155 DSDVVHAEALGRHIYVVNSAKAARELFDGRPHVYSD (2) 30262 ex2 30408 KSGWWRSWVMLPYGDYWKEHRRLFHQHFRPQSLPQY ex3 HEKQAKAARRLVRLLLDSPQDYAKHIR (no GT here in stop codon) 30581 ex3 YATGSSILNVVYSFDAQPGDPRLELVEAAMGTANELMHTGVYL (1) ex4 30890 VKHLPMWFPGAHFKRQAARYKRLVDDMFEIPYAQLKSSM (0) 31006 ex5 31067 QEGTIEPCFAAALLSEAEDSASPERDDMFMNLAGTAYA (1) 31177 ex6 AGTDT (0) ex7 31315 IMMTLLTFILAMVLHPEEQAAVQEEIDRVVGRDRLPGLADRESLPRVTAVIQEVLR (2) 31479 ex8 WHPPLPL (1) ex9 31595 ATPHRAASDDEYNGYHIPAGALVLGNCW (2) 31678 ex10 31735 AMLHDARVYPDPDVFRPGRFLAAGGDAPRADVPLPAEAFGFGRRICPGRHFALDSLFLF 31911 ex11 31912 AHLLAAFRIEHAVDAEGNVVPVVAAFEPQA (2) ex11 PPKPFKARFTLRYCGAENLVRGGVRAG* ex12 >Scaffold_169e 6 different P450s gene model complete all boundaries checked 36710 MALLVYFCAGLVPVLLLVLGLRKRPRYPPGPRGVPIFGNI ex1 FDVPMKYAWLEYVKYGQQY (1) 36534 ex1 36476 KSDIVHFQVLGQHIVVLNSLQAVGDLLDKQSSIYSD (2) 36369 ex2 36225 RTGWSRSWVQMEYGDQWRMHRRLMHQHFRSTMIPQ ex 3 YHPKQTKAVRRLIQSLLEQPEHFMEHVH (2) 36046 ex3 35992 FLSGSLILDVVFSFDVRPGDAILALAERAVDTTKAIIAAGVWL (1) 35861 ex4 35733 VKYIPSWFPGAGFKRIAAKWKTDVNKMFDVPYAKFKDSM (0) 35617 ex5 35558 REGSATPCFASALLSGAEDDNGGVIDNQDEVFISLTGTAYV (1) 35439 ex6 AGSDT (0) ex7 35306 LSDALSTFLLAMIAFPEKQRAAHEALDCVLERKRLPGVEDRDALPHITALAYEVLR (2) 35145 ex8 WHPVVPL (1) ex9 35015 SIPHRTTADSYYKGYYIPAGSTIFPNSW (2) 34929 ex10 34882 AILHDEALYPEPHLFRPERFLNEDGSLHAHARDPIEAFGYGRRICPGRHFAHDALWLAI 34706 ex11 34705 AHILAVFKIERALDVDGNEIEPKLDFMPHFLS (2) ex11 MPKPFKCRFTPRFPDAANLALSASSDY* ex12 >Scaffold_169f 6 different P450s gene model complete all boundaries checked MAVLLAAGYY 39658 ILSVAVFILLYNASRRRQRLPPGPKGLPLIGNLFDVPNDYAWLRYKELGQQY (1) 39503 exon 1 39451 GSDIVHMQALGNHILVLNSMKAAVEILDKRADISSD (2) 39341 exon 2 39206 SSGLGRAWTQLGHNDSWRIHRRLFHQHFRPSAISQYHTKQT ex3 KAIHRMLSFLRESPAQYMDHIR (2) 39024 ex3 38973 FMAGSMILDVVYTLDVQPGDYRIKLAEMVAHVSTEVFTAGVWM (1) 38842 ex4 38718 VRHLPTWFPGAGFKIQAAKWKTTVDRSYDIPYEQFKASM (0) 38602 ex5 38540 HEGGGEPCLASALLSSAEDVEELERMDKVFSSLTGTAYI (1) 38427 ex6 AGTDT (0) ex7 38316 TVSTLASFVLAMTIFPEAQLAAQAEIDRVLGGTRLPDINDKANLPQVTAILYETLR (2) 38152 ex 8 WNPVLPL (1) ex9 38034 ALPHRTTADTSYDGYYIPAGTVVLGNSW (2) 37948 ex10 37900 AILQDETLFPEPQLFKPERYLNGDGSLNSSAHYPIETFGFGRRICPGRYFAQDAVWLAI 37724 37723 AHILAVFKIERARDGDGKEIVPTPD (2) ex11 MPKPFECKFKVRSPQAESLIESAALGG* ex12 >Scaffold_205a 13 different P450s on this sequence 59% to scaffold 4 gene model complete all boundaries checked 7562 MELPAHTKYLLACLAFAIFVLLHSKRRRPRYPPGQRGLPLVGNLWDIPTEYAWVKYREIGAQL (1) 7735 ex1 7804 GSDIIHFEVLGSHYVVLNSDKAVKEVLEKRSHNSSD (2) 7914 ex2 8041 RTGWHRNWALLEYGDYWKDLRRIFSQYFRPSAVPQYHSKQTKAVRRFLNLLLNSPDDFTKHIR (2) 8229 ex3 YLAASAILDVVYGFDVRPGDPRIELVERGVHTLTDISAGVYL (1) 8407 ex4 8527 VKYIPAWFPGASFKRKAAGWKVLVDAVYEVPYSQYKDAM (0) 8643 ex 5 8707 REGTAKPCFAGTLLSEANPDGDLDETFRCLTGTAYV (1) 8811 ex6 GGADT (0) 8935 VSSTLLTFMLAVTMFPETQDPAHEELDRVLGRKRLPDIRDRDDLPYITAMLHEVLR (2) 9096 ex7 WHPVAPL (1) 9227 ALPHRLTADDEYEGYHIPAGAVVFGNAW (2) 9310 ex9 9369 AILHDPATYGDPDVYAPARYLTADGRALRADVPYPLEGFGFGRRVCPGRPFAHDILWLA 9545 9546 LAHVLAVFRVGRARDAHECEVPPRGVFTPGLI (2) 9641 ex10 SVPEPFGCRFVPRFPGAEELIRQSAMPE* ex11 >Scaffold_205b 13 different P450s on this sequence defective GT boundary on ex10 gene model complete all boundaries checked MEDTSRSLVGPSLWAVFALGL 10923 LFAFCLRRQPRYPPGPRGLPIVGNVFDIPMNVGWKVFRDVSRCF (1) 11054 ex1 11111 ESDVIHYEALGSHLVVVNGAKAAKELFERRANNYSD (2) 11218 ex2 11354 STGWHRNWGQLEYGDRWRQHRRLFHQHFRPMAVSQYHPRQVKGVRVLLRALSESPEDFQRHIR (2) 11536 ex3 11593 FMAGATIMEIVYAYDAQPGDPRIKLVEDAVDTLTFVVNAGVYL (1) 11724 ex4 11847 VKYVPNWFPGASFKRQAAEWKKLVDALYEQPYQEFKATV (0) 11954 ex5 12023 KEGNAKPCFAATLLSSVENDEDIENLEELFMGLTGTAFV (1) 12136 ex6 AGSDTV (0) ex7 12260 TIASLNVFILAVTIFPEAQRSAQEEIDRVLERKRLPTMEDKVLLPHVTALVHETLR (2) 12424 ex 8 WHPPLPL (1) ex9 12547 AAPHRVIEDDEYEGYFIPAGTTIIGNAW (MISSING PHASE 2 GT) 12633 ex10 12682 AILRDEDLFPDGDSFKPERWLNEAGALRDDLPYPMETFGFGRRICPGRHFANDVLWLAI 12858 ex11 12859 ANILTVFSIERALGEDGQPIVPEAKFSPRLI (2) 12951 ex11 13015 SKPEPFKCAFKHRFSGAEDMIRLAAIVEE* 13098 ex12 >Scaffold_205c 13 different P450s on this sequence exon 11 extended by 10 amino acids (lower case ) due to missing GT boundary at the expected site gene model complete all boundaries checked MESLTLRNCASAFLAG 14073 LLVLACGLVLRPRRRPRYPPGPKGLPLVGNMLDIPTEYAWERYYELGKEY (1) 14222 ex1 14277 GSDVLFFRVLGSHFLVLNSAAAANELLEKRANVYSD (2) 14387 ex2 14522 RTGWDQNWAFWEYGEGWKQARKMFHQHFRPSAAPQYHLKQ ex3 TKAARRFVKLLLESPASFAQHAR (2) 14704 ex3 14752 FLAGSAILDAVYAFDVQADDPRIALVERGVHTLVEISRGVFL (1) ex4 15004 VKYIPSWFPGAGFKRQAAQWKDAVDATYSDPYRQFKTLL (0) 15111 ex 5 15177 RNGQAEPCVAASLLSSSGDEPSGALDDLLKSVAGTAYV (1) 15287 ex6 GGSDT (0) 15410 VSATLTTFILAMTMFPDAQAAAHAQLDEVLKRTRLPEMADRAALPYITAILYEVLR (2) 15571 ex7 WQPAGPL (1) 15698 GLPRRLMADDEYRGWHIPAGTVVLPNIW (2) 15778 ex9 15843 AMSHDPGTHAVPAQFVPARYLAADGTLREDVPCPADVFGFGRRVCPGRPFAQDVLWLAI 16019 16020 AHVLSVFRMEGPMGERGEIRHSRLFTPGLIrrvgrplerg (2) 16109 ex10 16168 RLPEPFACSFRPRFPGAENLVDAGVVG* ex11 >Scaffold_205d 13 different P450s on this sequence gene model complete all boundaries checked MELPPVPHPLIAYLCAGLLLAGLVVTRL 17227 RRRRHYPPGPKGLPLIGNLFDIPTDYAWKIYRAFGDQY (1) 17340 ex1 17399 GSDIIHFEIFGTHLVILNSAKAARDLFEKRSSIYSD (2) 17509 ex2 17641 RTRWGRSFGFMQHGDEWREHRRLFNMHFRPSAIAQYHAK ex3 QKSAVCTLLRSLLDAPEQFREHVH (2) 17826 ex3 17889 FMAGDVIMGIVYGFDVQPGDSRLQLVEKAVMTLNQIVNAGVYL (1) 18020 ex4 18140 VKYIPAWFPGAGFKRHAAEWKKLVDDMFEIPYRESMKSL (0) 18235 ex 5 18312 QEGKCESSFAASLLAQLEGQESPDNIERIAMDVLGTTYV (1) 18431 ex6 AGSDT (0) ex7 18545 IITATSTFLLAMILHPEVQITVQAELDALLEGARLPNISDKAALPSVTAVLQEVLR (2) 18712 ex 8 WNPGLPL (1) ex9 18841 VPHRVVADDEYKGYHIPAGAAVIGNTW (2) 18921 ex10 18984 AMLHDETTYPDPEPFKPQRFLNEDGTLNADVPYPTDVFGHGRRMCPGRHFAHDMLWLTI 19160 ex11 19161 ASILTVYKVERDVDEDGQEITPTASFTS (2) 19244 ex11 19304 RVPTPFRCRFTPRSASAESLIRSSGVSTE* 19393 ex12 >Scaffold_205e 13 different P450s on this sequence gene model complete all boundaries checked MEQSTLHILPYLCASIPVLVCL 20067 LVLRLRRPHYPPGPKGLPLVGNLFDVPLSHGWVAYRELAKQY (1) 20192 ex1 20254 GSDVIHLEILGSHIVIINSAKAARDLFDKRSNIYSD (2) 20364 ex2 20499 NTGWHRNWGFMAYGDYWRKHRRLFHRHFRPAAVPQYHSAQAKGVHNLLKLLMRSPERFREHIR (2) 20681 ex3 20733 FMAGSTILDVVYALDVQPGDSRIELVERAVHTSTEIVAAGVYL (1) 20864 ex4 20981 VKHIPSWVPGAAFHRKAAAWKALVDRMYEEPYNQFKASM (0) 21097 ex5 21154 KDGNAKACLTASLLMEAESTHQLDAIEDILISVTGTAYG (1) 21273 ex6 AGTDT (0) 21398 AVASLNTFMLGITMFPHTQLAAQDELDRITARQRLPTMEDRENLPHVTAILQEVLR (2) 21565 ex 8 WNPAAPL (1) 21695 GLPHRTVRSDEYNGYFIPQGATIIGNSW (2) 21778 ex10 21829 AMLHDEAIYPDPGSFKPERFLTEDSTLRSDVPYPIEAFGFGRRICPGRYFAHDLLWLTI 22005 ex11 22006 AGILAVFRIERARDEQGDEIVPAGDFSR (2) 22086 ex11 22147 SPEPFQCRIVPRFAGAEALIHGTGLLG* 22233 ex12 >Scaffold_205f 13 different P450s on this sequence GT boundary on exon 1 missing GGT = GGC gene model complete all boundaries checked MEWSLSLGYALLGLGIMW 22749 VVKYAQRPRRRYPPGPKGIPILGNVFNIPLENSWISFDQWSRQY (Phase 1 GT missing) 22880 ex1 22942 ASDIVHVEALGKHVYVVNSARAAKELFDGRANVYSD (2) 23049 ex2 23188 KCGWSRSWAMLPYGNYWREHHRLFHQHFRPQSMVRYHEKQRRGARRLLQLLLDTPEDYEKHMR (2) 23370 ex3 23422 YAAGSTILDVVYSFDVQPNDPRIELVEAALGTANDLMHAGIYL (1) 23553 ex4 23678 VKHIPTWFPGAQFKRLAAKYKRLVDNMYTVPYSQLKASV (0) 23785 ex5 23848 KLGTAQPCLVASLLSEADEHVTPERDEIFMNLAGTTYA (1) 23964 ex6 GGTDT (0) ex7 24083 IVIALSIFILAMILHPEQQVAVQKEIDRVVGRDRLPELADRESLPRVTAVIQEVLR (2) 24261 ex 8 WHSPLPL (1) ex9 24382 ATPHRATSDDEYNGYYIKAGSVVIGNAW (2) 24465 ex 10 24518 AMLHNENVYPDPASFKSERFLTPDGKLRDDVPFPIEAFGFGRRICPGRHFALDSLFLLV 24694 ex11 24695 SHILAVFTIEHAVDADGHIIPVEPEFEPQAFRWA (2) 24775 ex11 PPKPFKAQFKLRFLAAEDLIHGSALE* ex12 >Scaffold_205g 13 different P450s on this sequence GT BOUNDARY MISSING ON ex6 GGT = GGC gene model complete all boundaries checked 27389 MLTALSCVFAEALAVWAVSSWTRPRHEYPPGPKGLPFLGNMFDIPMKYGWVTFANWSRLY (1) 27226 ex1 27153 GSDIVHVQALGKHIYVINSAKVAKDLFDGRPHIYSD (2) 27043 ex2 26911 NSGWKRAWALSPYDDEWREYRKLFHQHFRPSAVQQYHHKQTKAVRRLLQLLLDTPEDFLAHLR (2) 26723 ex3 26667 YAAGSSLLDVVYSVDALPGDLRITLVEKAVHTFAKLLETGVYL (1) 26536 ex4 26416 VKHIPAWFPGADFKRQAAEYRQLVDDMFKVPYQQFKDAW (0) 26315 ex5 26239 RRGTAQPCFAASLLTDADPLDGSEHEELFINLTGTTYA (phase 1 GT missing) 26123 ex6 AGSDT (0) 25997 TVAAMSTFMLAMALHPEVQRWVQEELDRVVGRGRLPEMADQPALLRVMATVHEVLR (2) 25812 ex7 WHPPLPL (1) 25694 ATPHRAMADDVYAGFTIPAGSIVLGNSW (2) 25611 ex9 25556 AILHDDKTYTNPHTFDPRRFVGPNAQPFPEVVFGHGRRECPGRHFALDILFLAV 25395 25394 AHVLSVFAIERVDNSDPGIGDIQGLFTPHVL (2) 25344 ex10 25247 SYPKPFKASFKPRFPGVESLVRTAALSEI* 25158 ex11 >Scaffold_205h 13 different P450s on this sequence 65% to 205a gene model complete all boundaries checked MSRFLYDYST 30650 LLYLCAGITFVVLITLSSRPRRRYPPGPKGLPIVGNLFDVPTDHGWKRYQEIGKEY (1) 30483 ex1 30419 GSDIVHFQVFGSHIVVVNTAKAARELLDKRSNIYSD (2) 30309 ex2 30181 KTGWHRNFSLMPYGEGWRTRRRLFHQHFRPMAVPQYHTRQLKAVHGLVQSLFEAPQNYKEHIR (2) 29993 ex3 29936 FMAGSAILDIIFAFDIQPGDPRIEIVEKGVQTATEFMCSGVYL (1) ex4 29691 VKYLPSWFPGAGFKRQAAKWKALVDDMHEIPYYQFKQTM (0) 29575 ex5 29520 REGKAKPCFASTLLSSAAENDKDSLESLDEIFMSLTGTAYV (1) 29395 ex6 AGSDT (0) ex7 29280 TISALNTFVLAMTMFPETQAAAQEELDRVLGRKRLPDFDDRDSMPYLTAMVYELLR (2) 29110 ex8 WHSVLPL (1) ex9 28990 GLPHRTLADDEYNGYFIPAGTVIVGNCW (2) 28904 ex 10 28853 GMLHDDDLFPDPDIFRPERFLNADGTLNSDAHFPIETFGFGRRICPGRYFAQDLLWLTIA 28674 ex11 28673 NVLAACSIERVVDEKGFEVRPTGDMTPRVL (2) 28584 ex11 28524 SMPEPFECNIRPRFSGAEALVRSACLND* 28438 ex12 >Scaffold_205i 13 different P450s on this sequence gene model complete all boundaries checked 34928 MEALASRITVYLCAGVVLVYVFSRVFKRRPHYPPGPRGLPIIGNLLDVPSLYGWIAYKNLGDQC (1) 34755 ex1 34674 GSDIVHLEVLGSHYVVLNSAKAARDLLDKRSNNYSD (2) 34564 ex2 34440 RTGLERGLGMLPYGDYWRLHRRLTQQHFRAAAVPQYHARQAKVVRKLLHSLLDSPERFMDHIR (2) 34252 ex3 FMAGAIILDIVYALDVHPGDHMIEVVETAMGRINEIINAGVFL (1) 34073 ex4 33951 VKYLPSWFPGAGFKRRAAAWKVDIAPMFEAPYERFKQSL (0) 33835 ex5 33778 GGTTRPSFAGNLLSRVQNEEELSQLEDVFMNITGTAYG (1) 33662 ex6 AGSDT (0) 33543 TLATLTGFVLAMTIFPEKQLAAHAELDKVLERKRLPEVEDMQYLPSITALVYEVLR (2) 33373 ex7 WNPAAPL (1) 33241 GIPHQTIVDDEYNGYFIPAGTVVIGNAW (2) 33155 ex9 33111 AMLRDPNTYPDPDTFKPERFLAKDGSLRDDVPYPTEAFGHGRRICV 32974 32973 GRHFAQDVLWIAIAHILTVFRIERAVDEDGREIVPVPDYTPHFV (2) 32854 ex10 32788 TMPKPFKCRFTPRFPGAEGLIRSAADASNE* 32696 ex11 >Scaffold_205j 13 different P450s on this sequence 55% to 205b gene model complete all boundaries checked MLDRSVLLPCLVG 36862 IVAAIIIVSRRGRERYRFPPGPKPLPLIGNLLDAPTDLGWYTYAKWARQY (1) 37011 ex1 37076 HSDIIHFEVFGQHFYILNSVRVAKDLLERRSQVYAD (2) 37180 ex2 37320 RTGWHRVFSMKAYGESWRQQRRLFHQHFRQQAIPEYHAELTNGARMLLRSFLESPNHFLEHIR (2) 37508 ex3 HISGGTILAVLYGIDVDNYSAERMESIEKAIEIVTEIADGGVYL (1) 37692 ex4 37811 VKYLPTWFPGAGFKRRAAEWRVHVETMFEAPYGDVKRDM (0) 37927 ex5 37985 KLGKAKPCVATKLMSAFGDKAEDPEIEELLICVTGTAY (1) 38098 ex6 AASDT (0) 38216 IVFAMIAFVRAMMVFPEVQCKAQQELDRVVGRDRLPVISDQASLPYLAAVTKELLR 38383 ex 7 WHPITPI (1) 38513 AVPHKSTTDDWYDGYYVAAGSIVIANVW (2) 38599 ex9 38655 AMFRDEERYPDPEAFRPERFLTAEGTLDPAVPDPVEVFGFGRRMCAGRHYVDAALFLAI 38831 38832 AHVLHALTIEKPRDARIPVVDPPPGYALSRL (2) 38924 ex10 39002 APEPFEADIKPRFEGVERLMQMSSLHSF* 39088 ex11 >Scaffold_205k 13 different P450s on this sequence gene model complete all boundaries checked MLSSTILVFSS 41551 VCTLAAIVAIVRRFGGKRRHHFPPGPKGLPIVGNLFDVPTNFGWYTFAKWAQQY (1) 41390 ex1 41324 NSDIIHFEVLGKHFYVLHSAALAKELFERRSQTYSD (2) 41217 ex2 41068 RTGWHRVLTLTPYGEYWRQYRRLFHEHFRAQVIPQYEDKMLTSARNLLRLLLETPDRFLRHIR (2) 40880 ex3 40828 HASGRTMLDIVYALDTEAHNNAVILESVEKAIEIFAEVAEGGAYL (1) 40691 ex4 40579 VKYLPAWFPGASFKRQAAAWRVHVDTMYEAPYQDVNRRL (0) 40463 ex 5 LAGKAKPCITTSLISAFSDKCEDPDVEESLISFAGTTY (1) AGSDT (0) 40175 SVFKMTIFMRAMLLFPEVQVKAQEELDRVVGRDRLPELADKDSLPYISALYKELLR (2) 40008 ex7 WHPLFPL (1) 39869 AFPHKSTVDDWLDGYFIPAGSLIIGNAW (2) 39783 ex9 39734 ATLHDEERYPDPEAFRPERFLSDDGKLDPSVPDPVEAFGYGRRICPGRHYADASLFLYI 39558 39557 AHILFAFTIRKPLDERGNVIEPPPG (2) 39492 ex10 39394 VPEPFKASIKPRFEGVEELIQLSTQLATSD* 39302 ex11 >Scaffold_205L 13 different P450s on this sequence region between 41600 and 44300 searched for P450 exons and this was the only one. Pseudogene fragment, lone exon gene model complete 42834 RFLMAERALRTDVTFPIEVFGYSRRICLGRQFAKDVLFLAISNILAIFTIEKAVDEHSGSIEVQNAFLPHAIRCA 42610 ex10 >Scaffold_205m 13 different P450s on this sequence gene runs off end of scaffold frameshift in exon 2 gene model complete all boundaries checked end of this scaffold matches end of scaffold 180 but this P450 sequence does not continue there 44302 MPDCTTIGYLLASVALAYALASRHPRSSRYPPGPRGLPLLGNLFDMPRKHSWTKHQELSKTY (1) 44487 ex1 ESDVIHYQVLGLHIMALNSG (frame shift) EAVRDLLNKRSTIYSD (2) 44652 ex2 44788 RTGWHRNWALMRYGDAWKERRRHFHEHFRPQAVSQYNFKQVKAARILLNSLLESPRAFSEHIR (2) 44976 ex3 45034 FMASSLILDIVYALDVRPDDPEARRVERALETLAEISASVFM (1) 45162 ex4 45291 VKYLPSWFPGAGSKRQASIWKNIVDEMFETAYQVCKNSA (0) 45389 ex5 45479 QHEYVRPCFTTALLSEVSDPANMKEMDEIFMSLAGTTYI (1) 45583 ex6 GGSDT (0) 45650 ex 7 Scaffold 205 ends at 46031 >Scaffold_223a N-term gene model complete all boundaries checked sequence runs off end. Probably continues on scaffold 182 700 MALHPRQYRLRFLRDVLRAVVWPQLVLNTVLYAAGLRVGPVLRVLLSLLAVPLYGT 533 532 ARTLLAQQRNAAWAG (1) 488 ex1 416 PCVRGRWPGNVDVVLGFVRSLREDYLMQFLDDLFRAHGCRTLNMRLFWEDQ (0) 261 ex2 207 IWTIDEGHVRHMIAGAGVECFDKGYYWQERL (2) 115 ex3 58 ESFLGNGIFNRWAYMD (0) 11 ex4 >Scaffold_182 N-term exons 1 - 2 missing off the end of this seq. The missing part is probably on scaffold 223a gene model complete all boundaries checked 76 QRAIARPWFAKDRISDLNIFDRHTSTTLALIADFADRREAFDAQDLFARFTLDSASEFLFG 258 259 KCAETLHGTLPVAGRAKLGPKGSSVEDEFGSFAWAFE (0) ELFHDKTAKHRKVIQDWLQPIVREALHSKAAAARGEDTGE 609 GTFLSHLTKTTD (1) 645 702 DPQDIAYSILNMLLAGRDT (0) 758 808 TAAALSFTVYLLALHPEVVEKLRAEVVQAYGSDGRPSVEDMKSLKY (1) 969 997 LRAVLNETMRLFPPVPLNIRTSDDTPRVFPASAGAPKYYVPPRTPVVYSSVIIQR 1161 1162 RKDLWGADALDFRPERWLEPETARRLAENPFMFMPFHAGPRL (0) 1287 1338 CLGQNFAYNEMSFFVVRLLQRVAALELAPDAQPEGSLPPARWKNGEGRQAVEKIWPGSSV 1517 1518 TTYIKVSSTRSRPCG* >Scaffold_223b 603 amino acids gene model complete all boundaries checked 8694 MELHPRQYRLRFLLDVLRAIVWPQLVFNAALYLAGFHPGAFLRVVASV 8837 8838 LAVPLLGTVRTAISQRRNKIQAG (1) ex1 AALGAKEVPCVRGKWPGNLDIVLGFVRSLKEAYLMQFLDDLFREYDCKTLNMRLLWEDQ (0) ex2 IWTIDEAHVRYMLAGPGFEWFHKGYYWQERM (2) 9282 ex3 ESFLGNGIFNRWA (2) ex4a possible error here see 223a above YTI (0) ex4b required to maintain phase at joints might be false 9493 QRAIARPWFVKDRISDLNIFDRHTTTTLALISEFVDRREAFDAQDLFARFTLDSASEFLFG 9675 9676 RCLDTLHGTLPVAGRAKMGPKGTAIEDAFGSFARAFE (0) ex5 9850 DVQVQIARRTRIGKPWPLFELFTDKTAPSVAVIHDWLRPIVHEALAKKSAASAEKESGED 10029 10030 STFLSHLANSTD (1) 10065 ex6 10124 DPQDIAYSVLNMLLAGRDT (0) ex7 10227 TASVLSFVVYFLALHPHVTEKLRAEILQAYGPDGRPSVEDMKDLKY (1) ex8 10419 VRAVLNETMRLFPPVPMNLRLSDAHPRIFPASGSAPKYYVAPRTVILYSIFLVQR 10583 10584 RTDLWGADALEFRPERWLEPATARLLADHPFAFTPFHAGPRL (0) 10709 ex9 10771 CLGQNFAYNEMTFFIVRLLQRVSGFELAPDAQ 10866 10867 PEGSLPPARWKYGEGRQAVEKIWPASSVTTFIKVSLA 10977 10978 SMPCCGERWLKRRRQGGLWVRAVPA* 11055 ex10 >Scaffold_74 gene model complete all boundaries checked 60930 MPHPFSRYRLRVFGDFVR 60983 60984 IVLAPSFVFWSAVQILKLRLGLLSPAAWLTFLFAASYARVQYRGFLQRQEARRRG 61148 61149 GVLPPEVVGRWPGNIDILIKLGKASLTAYPGSFYLDLFEEYQSTTLNLKLLWSDLVRCLS 61328 61329 FCRLSAVLKTLSQIITMDEEHIKHILTTGFNHFWRGRRQKERM (2) 61457 ex1 61557 YAPSGASRRHDTDSQGDVSQEWKK (0) 61628 ex2 61691 HRALARPFFARDRISDFDLFEKYAGATLGILGGLAGRGAAVDVQDLYARFTLDAAAEFLFG 61873 61874 ERLDTLHGALPVAGQAKLGSKGAATDDAFGAFVRAFEASQDIITTRQVRGYFWPVREL 62047 62048 FQDKVAPHAAVIGAFLEPIVQRTLDRKAKMRAAGVSPTTEHDTFLDYLADHTE (1) 62224 ex3 62264 DPKVIRDQLLNILMAGRDTTACLLTYVTYVMAMYPDIMQKMRQEVLHV 62407 62408 CGHDAPNFEKLKALRY (1) 62455 ex4 62512 VHAVLNETLRVFPPVPMNVREVRARGVVLPHADPTYAAAPAPLYVPGGTVVMYLPVLTQRNT 62697 62698 ALWGDDADVFDPDRWLDARLRRFTENPMMYTPFSGGPRI (0) 62814 ex5 62879 CIGQNYARNEATYLLVRLLQQFDAVALAPEAQ 62974 62975 PAGSLPPPEWRHARGRAAEERIWPAYAITLYVKVRLSLQWLYC* 63106 ex6 >Scaffold_232 42% to scaffold 7 gene model complete all boundaries checked 22610 MALPPGLQYLLPQLPLLLAPPAAVLLAAHAARAFAGTAAPAWALALACVLSWPVALTALV 22789 22790 QLRAHRVAREAAARGARLPPAVEARYPGGVDLMRRNNSEVEEHIP (1) 22891 GYRLSEFGRQYGWTYNFRMLFQDRVRGRPRPRPPG (0) 23199 RILATDFTSYEKGAVFSAQMKSLLGTGVFNADGDLWK (2) 23318 two introns missing 23351 FHRAMTRPFFSRDRISHFDVFDR (2) 23437 HAEDALKLAKARLSEGVPIDWQ (0) DLVSRFTLDSATEFLFGQDVRSLSAPLPHPPTAPQA QHDTHDAEHPANRFAHAFLQAQLASARRSRYTAAWPLWEFWE NKVEKHTRVMDEFIQPLLRDALARKAKGADAQAEEAVADGETLLEHLVKLTD (1) DPQIIHDETLNILLAGRDT (0) TAITLTMAGYMLAEHPDILQRLRKEILDTVGTRRPTYDDIRDMKYLRAFINEVLRMYPPV (2) PFNVRFSTAPTVWPSPEGDFYVPAGTRCMYSVFVMHRRKDLWGPD (1) ADKFDPDRFLDERLGKYLTPNPFIFLPFNAGPRICLGQQ (0) FAYNETSFMLIRLLQRVSKIELHPEVSPQSVAPPGWAASSISD GKDKVVFKSHLTMYVQGGLWVTMQFENPEEH* 25024 >Scaffold_293a 98% to CYP63A1 SUBMITTED SEQUENCE PROBABLY SAME GENE 100% IDENTICAL TO AF005475 FROM BEGINNING OF EXON 9 (TAGT) gene model complete all boundaries checked 27142 MGLTQAQRLVLGQLARLVAPALAVCVLLAAARRTQLVRAPVWADALIALIAIPLFHVGRA 27321 27322 HWRYARLARKAARLGAALPPRWEGKLPGSVDVLQLVDEAYRHGFLS (1) 27459 27508 DYFYEKFGELGHTYNFYVLWDMDYCTEDAAVIK (0) 27607 27658 AVLATDFNNWVK (1) 27693 27747 GERFDSYMHSVLGTGVFNAD 27809 (1) 27860 GELWK (2) 27874 27923 FHRSMTRPFFARERITDFETFNRHAEEAILKMKERLREGFAVDFA 28057 (0) 28110 DLISRFTLDAATEFLFGACVHSLAGALPYPHGAPAHLHTTRARIPADDFAAAFRAAQDAV 28289 28290 SHRARLVWLWPWFELARSRTDAPMRTVDRYLTPIIERALAMSRAAKQAPQGEKEEVADGE 28469 28470 TLLDHLARYTT (1) 28502 28555 DPTILHDEILNIMIAGRDT (0) 28611 28663 TAGTLTFVVYFLTQHPNVLQRLRQ 28734 28735 EILDVVGPSNLPTYDDIKQMKYLRAVLN (1) 28818 no GT boundary here GC instead 28868 ETLRLYPPV (2) 28893 28953 PWNMR (2) 28967 29022 YAVKDGILPNSEPDGKPWFIPAGAS (2) 29096 29151 VSYSVHCMHRRKDYWGPD (1) 29203 29250 AEEFDPDRFLDERLHKYLTPNPFIFLPFNAGPRICLGQQFAYNEMSFFLVK 29303 29304 LLQTFEDISFERDAFEPNALPPAEWAKFPGRKGKEKFWPRAHLTLYSE (0) 29552 29597 GGMWVKMREAQAMGPQVA* 29652 >Scaffold_293b Length = 30818 sequence runs off end continues on 301b gene model complete all boundaries checked 29971 MLVSVDALALRTLVYELTYLLYPAVPTAAALVLLQRLGSVWLPTWTIVLLSLCSVPVAHRILVWL 30165 30166 KDWRAARKAASMGAILPPRLKGRWPGSIDLLRQLTKTFETGFLS (1) 30297 ELLWDYMHVLGQTFEVYILWDSNYVTSDANVIK (0) TILATDFDNFVK (1) 30589 GEKLDVCVRPVLGTGVFNSD (1) 30648 GEMWK (2) >Scaffold_301b = 92% TO SCAFFOLD 301A 100% IDENTICAL TO 63A2 EXCEPT AT ONE INTRON JOINT gene model complete all boundaries checked sequence runs off end sequence appears to continue on 293b YFVK (1) PROBABLE FRAME SHIFT AT BEGINNING OF EXON 4 28695 XEKLDVCVRPVLGTGVFNSD (1) GEMWK (2) FHRSMTRPFFTRERISHFDLFDRHA (1) 28444 28388 DATMAKMKARLAEGFAVDFQDLISRFTLDSATEFLFGQCVHSLASVLPYPHDAPAHLQ 28215 28214 TTGASRTEDFARAFAEAQDAVSFRLRMGWLWPWFELFGSRTKAPMAVVDAFLDPILRDAV 28035 28034 ARADKIKRENGGRVPEVKGEIEEDETLLDHLVKYTN (1) 27927 27871 DPKILHDEVLNIMIAGRDT (0) TGGTLTSAVYFLSQYPEVLRR 27695 27694 LRKEILEKVGPTRRPTYDDIREMKYLRAFIN (1) 27584 ETLRLYPAV (2) PWNVR (2) YPVKDTTIPGPHPDKPYFIPANTP (2) 27319 VSYSVHCMHRRTDYWGPDAEAFDPD 27193 RFLDARVQRYLTPNPFIFLPFNAGPRICLGQQFAYNEMSFFVIRLLQHFDEVQLCEDALA 27014 27013 PDCRVPDAWRGAPGRKGVERFWPKAHLTLYAK (0) GGLWVKMREASTSEAVV* 26810 >Scaffold_301a CYP63A3 85% TO PC-2 CYP63A2 BUT ONLY 4 DIFFS FROM KILH ON gene model complete all boundaries checked 26344 MPSSIDFPDRLVLRVIAYELVFLFYPAVPAAAGLVLLRRLTDIWLPTWAIVLLSVCSLPVVHGLSIWR 26138 26137 NHWRAARKAARMGAVLPPRLKGRWPGSIDLLMRLTDAFETGFMS (1) 26006 25949 DLLWEYMHTIGQTFEVYVLWDSNYVTSDANVVK (0) 25851 25697 AILATDFTSFVK (1) 25662 25698 GKKFDVCMRSVLGTGVFNSD (1) 25639 GDMWK (2) FHRTMTRPFFTRERISHFDLFDRHA (1) 25433 (EXTRA INTRON HERE) 25379 DDAMAKMKARFAEGYAVDFQ (MISSING INTRON HERE) DLISRFTLDSATEFLFGQCVHSLASVLPYPHNAPAHLQ 25206 25205 TTSASAAEDFARAFAEAQTVLNFRIRMGWLWPWFELFGSRTKAPMAVVDAFLDPILKAAV 25026 25025 ERADQIKHENGGKVPEAKEEIDEDETLLDHLVKYTN (1) 24918 24855 DPKILHDEVLNIMIAGRDT (0) 24805 24748 TAGTLTSAVYFLSQYPEVLRRLREEILEKVGPTRRPTYDDIREMKYLRAFIN (1) ETLRLYPAV (2) 24515 PWNVR (2) 24380 YPVKDTTIPGPEPDKPYFIPANTP (2) 24318 24258 VSYSVHCMHRRTDYWGPD (MISSING INTRON HERE) AEAFDPDRFLDARVQRYLTPNPFIFLPFNAGPRICLGQ 24091 24090 QFAYNEMSFFVIRLLQHFDEVQLCEDALAPDCRVPDAWRGAPGRKGVERFWAKAHLTLYAK (0) 23908 GGLWVKMREAPTSEAV* 23800 >Scaffold_115a GENE MODEL COMPLETE exon boundaries all checked 17359 MTPSHSFPDLVASCISAWTLCFALGFSIAAASFYSLFGPFN (0) 17237 ex1 17188 LHHIPTVGGSSIPLISHRGARKYMRDAKGVLQDGYK (0) 17084 ex2 17018 HKGKAFKVALTDRWLVVITGKRLVDELQKMPEDVASFVGAVAD (0) 16890 ex3 16837 FQGLRYIFGQKVLDDPFHVNIIRSHLTKHLSSVFGDICDEIYVAFSELIPQQDEE (1) 16673 ex4 16616 WVPVHAIQVVRTIVARASNRVFVGLPVC (1) 16533 ex5 16483 RNAGYLSLAVNFIVDVAKARDFIALFPPVLKPLAAKMTSDIGTRVQEGMQYLEPLINERL 16304 ex6,7 16303 RLMEKFGKDWTDKP (0) 16262 ex6,7 16206 NDTLQWMMDGIMERDGTIEQLVRIVLLENFSSIHSSSN (0) 16093 ex8 16037 TFTHALYHLAANPEYITPLREEVETAISEEGWTKAAMSRLRKVDSFLRESLRLNGINP (1) 15864 ex9 15807 VSMQRKALISFTFSD ex10 15762 GTYIPKGTILVTPALATHHDEDNYEDATTFKPFRFVGENPEDDVPLVTTSADFVPF 15595 ex10 15594 GHGRQA (2) 15577 ex10 15514 CPGRFFAAHQMKAMMAYLVLNYDVKFE 15434 ex11 15433 NEGVRPQNVHGVLSVQPDPKARVLCRRRKSSYT* 15332 ex11 >Scaffold_115b 48% to scaffold 297 GENE MODEL COMPLETE exon boundaries all checked 20897 MSLHQVYDAVVAHGDMGTLLVYMAYSVPLVMFLYSLFSPAS (0) 20775 ex1 20722 LRHIPTEGGPSFPLLSYKAARAYLRDATGILQRGYDK (0) 20612 ex2 20557 HKGKPFKVAMPDRWVVVLTGKKLVDELQRLPDDAVSFIKGASD (0) 20429 ex3 20374 LSGTEHMFGRQVIDDPFHVPIIRTHLTKNLAPMFSDVFDEVSIAFQELIPACDGE (1) 20216 ex4 20160 WVPVHAIKVARSVVARTSNRIFAGLPIC (1) 20077 ex5 20024 RHPEYLNLVINFTVDVAKGRYALLLFPPALKGIAAKILTNIDGRIKEGLKYLGPLIEQRM 19845 ex6,7 19844 ALAEKFGNDSSEKP (0) 19803 ex6,7 19746 DDMLQWIIDEVRARNQSVFEVVRTVLLVNFAAIHTSSN (0) 19633 ex8 19582 SFTHALYHLAANPEFIAPLREEIETIVSEEGWSKAAIGKMWKLDSFMRESQRYNGINS (1) 19409 ex9 19353 VSVKRKALKPLTLSDGTFIPKGTVLVTPTVATHFDDDNYKNPTVFDPFRY 19204 ex10 19203 YREKEQDMSAVKHQFVTTSPDYVSFGHGKHA (2) 19111 ex10 19051 CPGRFFAANELKAMMAYVVVNYDVKFEK 18962 ex11 18961 EGVRPENIYAAMGISPDPNARVLFRKRESIVSV* 18860 ex11 >Scaffold_115c pseudogene fragment GENE MODEL COMPLETE exon boundaries all checked 46242 NDLLQWIMDEAVARDKS (frame shift) QEEIARMVLFLN (frame shift) FGAIQTTSC (0) ex8 46591 VSMFPNILTPVTLSDGTFLPAGTTVVTPVLATHYNEDNYTNAALFDPF*SCKDNRS 46758 ex10 46759 GGQQFVRTSADCVTLGHGRML 46815 ex10 (phase 2 GT missing) 46882 CPGRFFAAAELKTLVAYVLVNYDLRPETEGVRPVNIYKGLTV*PSETAKVLFKKRQTDE* 47061 ex11 >Scaffold_115d GENE MODEL COMPLETE exon boundaries all checked exon 3 GT boundary wrong 47583 MVLTTDFGGISTTHVVIGAFVTWLLLRYFSAKN (0) 47681 47736 LRHIPTVGGPSVPILSIVALYNFLANGKKVVLDGYQK (0) 47846 ex2 47898 YKGKAFKVALLDRWLVVLCNPKLVEELQKLPEALVGYSLLEAX (0) 47993 ex3 probable frame shift to GT 48098 IFETKHIFGADLLTDPVHLVLLRTLTRNLGQVFGDMYQEVETSFQELVPANEKE (1) 48241 ex4 48303 WLPVHASPIMRTIVTRAANRVFVGVPVC (1) 48386 ex 5 48441 RDEGYLHLMVHFAEDVNKAFGLYTVVPSFAQGFVARKAKAVMDDCIERCLGYTRPTIKDR 48620 ex6,7 48621 TTMMDSFGDNWADKP (0) 48665 ex6,7 48723 NDMIQWTIEETKARGQGEYDMARMLMFINSGAVETTSQ (0) 48833 ex8 48893 AVIHALYDISVRPELADELREEVERAIAEDGWTKDATNKMRKLDSFLRESQRINGPMI (1) 49066 ex9 49123 VSMFRLVREPVTLSD ex10 49168 GTFLPAGTTIASPTLGAHFDDSIYPNASTFDPLRFYKAEAAGQPQFVTTSPEYLTFGHGKHA (2) 49353 ex10 49408 CPGRFFAVNELKAILAYMLMHYDIKPEQDGVRP 49506 ex11 49507 ENKSMGLGVLPNPDAKVMFRKRHAG* 49584 ex11 >Scaffold_115e GENE MODEL COMPLETE exon boundaries all checked 50547 MSSMNAPALPATHIVAGAILVWLLVRTFGTQN (0) 50642 ex1 50697 LRHIPTEGGPSLPIISFLGLHAFLTRSREILEDGYQT (0) 50807 ex2 50857 HKGRAFKVALIDRWLVVLSGKKLVEELQKMPDDTVESATTE (0) 50979 ex3 51041 MFNMQHVFASNWHKDPVHSSLLRSLTRNLGVVFSDMFDELDTAFRECVPANAER (1) 51202 ex4 51255 WLPVQAHTTMASIVTRAANRIFVGLPVC (1) 51338 ex5 51397 RDAGYIHMMIHVAEDVSDAVRTLSMLPTFMKPFVARRATVIDQRIQQCLDYLRPAIADRM 51576 ex6,7 51577 SMLERFGKDWEEKP (0) 51618 ex6,7 51670 NDVLQWIIDEVTARNQGEDEVARIVLFINFGAIETTSF (0) 51780 ex8 51835 AVTHALYDIVSRPGLADVLREEVEAAVATEGWTKAATNKMRKLDSVLRESQRLNGPTT (1) 51996 ex9 52075 ASMFRRVLQPVTLSD ex10 52116 52117 GTYLPAGTTVVTPTLATHFDDTNYADAQTFDPLRFYKPDGVQAQLVTTSADFVTFGHGKHA (2) 52299 ex10 52363 CPGRFFAANELKAMMAYILMHYDIRPEREGVRP 52461 ex11 52462 ENVYRGLNVLPDANARVFFRRRQTD* 52539 ex11 >Scaffold_137a GENE MODEL COMPLETE exon boundaries all checked may be seq error at the end of exon 4 GGT replaced by GGG no GT boundary for exon 4 MARSDILDALSLGRTELSTTYLVFGLFLALFLYSLFSPAS (0) ex1 LRHIPTVGSSSLPLLSYKGAYDFLRDGRSLLQRGYNQ (0) ex2 YKGKVFKVAFTDRWLAVVTGRKLVEEVQRLSDDVISFPDASGE (0) ex3 VTGFKYIFTKCALSRDPFHVNMIQRQLTKHMSVAFDDLHDEFETAFKELLPHNETA (1?) ex4 bad GT boundary WVPVHAIEVARKVVARASNRIFVGLPVC (1) ex5 RDKAFLDLMVNFTLDVARARDLLALFPPALKPFVAKLVVKLDSRIEEGMQVL ex6,7 RPIIQERMEIIEKFGKDSPEKP (0) ex6,7 DDMLQWIIDALVERNEPMEQVVLITLFVNIAAINTSSN (0) ex8 SFTHALYHLAARPEWIAPLREEAEAVIGNEGWTKNAMGKLVKIDSFMRESQRYNSIVP (1) ex9 LTCMRKALQPFTLSD ex10 30210 GTHIPRGTILVTPAIATHFDDEHYADAASFDPSRY ex10 VPVADAKQGGAPKQYVTTTAEYVPFGHGKYA (2) ex10 CPGRFFAGTELKAMMAYLVLNYDVKF 29873 ex11 AQEGVRPPNAATTLSTRPHQEARVLFRKRNSSVQ* ex11 >Scaffold_137b GENE MODEL COMPLETE exon boundaries all checked MAFSDVVATVGAGPWVAYMMCAVLLALLLYSLFSPAS (0) ex1 LRHIPTEGGSSLPLVSYLGAYNVLRNLQSVLQRGYDK (0) ex2 HKGKAFKIALPDRWVVVLTGKTLVDELQRMPEESASFIDATTE (0) ex3 LTGFGYIWGPRMRKDPCHVPIIRNQLTRQLSSAFGDIYEEIELSFQGLMPACEKD (1) ex4 WTPVHVIEVARDVVARASNRVFVGLRVC (1) ex5 RNPDYLDMLVDCAVSVASARNTLMLFPFVLKTFAAKNVVNMDRRIRRGMQHLGPIIEE ex6,7 RMSLLRSLGNDWPDKP (0) ex6,7 DDMLQWIIDEVAARQMPKEDVVRNIMFLNFAAIHTSSN (0) ex8 SFTHAIYHLAANPDYLGPLREEVEAVTAKEGWSKTAMGRMWRIDSFLRESQRVNSINP (1) ex9 44032 LTVIRRTRTSLTLSD ex10 44077 GTFIPEGTVVAAPAYPTHFDDENYVGGDTFDPWRY ex10 VREKEQDLSPSKHQYVTLSPEYVPFGLGKRA (2) 44274 ex10 CPGRFFAANELKAMLAYLVVNYDVKFEK 44418 ex11 EGVRPENMHVGLTISPDPAAKVLFRKRRS* 44508 ex11 >Scaffold_137c GENE MODEL COMPLETE exon boundaries all checked MASNHLFSGVLPLDRAASTLGYLVCGALLALLLQNILTTIS (0) ex1 LRHIPTVGTSTLPLLSYKGAYDFTRDIKGVFQQGYAK (0) ex2 YKGRAFKIAFTDRWFVVLTGRKLLEELHRLPDSTTSFNHASGS (0) ex3 ITGSTYIYGRGWLSDPWHIPIIRDRLTKHLAASFGDMYDELETAFRELMPSCEEE (1) ex4 WVPVHFITMARTVVARTSNRVFVGLPAC (1) ex5 RNLGYHTLLVNFALDVSKARNRLAWLPPALKRVAARALTRIDSRIEEGMQYL ex6,7 GPTIRARIVEMERYKGDWPDKP (0) ex6,7 NDILQWIMEELIARKMPMEEAVRIILRINSSAVQTTAN (0) ex8 SLTHAIYHLAANPDLIAPLREEVDAVITDEGWTKLAMSKLSRLDSFMRESLRLNIVNP (1) ex9 LSVRRMALKSFTFSD ex10 46572 GTFIPKGTLMVTPAHATHLDEANYEHASVFDPWRF ex10 VHQKEEDLSPTKHQFITTSPEFVAWGHGKHA (2) ex10 CPGRFFASNELKAMMAYIILNYDVKFA 46910 ex11 RAGVRPDNVYSGLTVAPNQEANVLFRRRQTQ* 46956 ex11 >Scaffold_52a GENE MODEL COMPLETE exon boundaries all checked 10827 MLVLLGFVSLVLFLVYRRSVRASRGRLPPGPPADPIIGHMRVFPRANHGEVFHQWSKQYG (1) 10651 ex1 10594 DVLHLDVLGKSIIVLNSQEAANDLLDKRSANYSDRPEFPAFNL (2) ex2 10414 LGWDSMLVFLRYGPAFLRQRRLMQQPLTRTGVVVFRPVQLQQCHVLLKNLLASPKDFDAHLRR (2) ex3 RFASAITLEMTYGHKVSSDDDAYLDIADKVNVVLTKMSKAAILDLFPRAKH ex4,5 fused LPSWFPGAWFIRYAN (1) ex4,5 fused DHRHLIWEMASKPFEQVEQQL (0) ex6 9796 AAGTAQPSFVSMHLEEMHRQNTHDADNVSALKTAAAHMWTGGEET (0) ex7,8 fused 9605 STLLIFVLAAVRNRDAVRRAQAELDRVLGPGRLPTFEDMDALPYVEAFIKETIRFHSALPL 9423 ex9,10,11 fused GIPHRAMADDVYRDMLIPKDATVLVNST (2) ex9,10,11 fused 9285 ALARDPAAYSTPERFWPERFLPPHSEPPPVGLGFGWGRRVCPGRHLAEASLWIVV 9121 ex12,13 fused 9120 ASMLAAFDIAPVQDAHGRDAPPELRFTQAIT (2) 9031 ex12, 13 fused SHPEPFECSITPRSEKVAQLIMQL* ex14 >Scaffold_52b GENE MODEL COMPLETE exon boundaries all checked MGFLTALLLVFVLLCVALVRAVRRRRARPPYPPGPPADPLIGHIRIMPSTDNAHEVFHDWAQQYG 18706 18707 DVMSLDVLGTRYVILNSAEAATDLLEKRNSKYADRPTFPMYER (2) ex1,2 fused 18891 VGWKDTLVFLPYGPYFRKQRKMLQLPLEKERVTDFRHIEEQETCVMLYNILSDPDNTDAFVHR (2) ex3 19129 RYTTAVTMELAYGHRVVSNDDEYLKAADMVIDVLRSVTRPSLLDVSPI 19269 (1) ex4 FEYLPAWFPGAWFVKCIK (1) ex5 EIKPVVLREIQHPVSVVQQEL (0) ex6 MAGTAKPSFVSQQLEDLSRENGLSQEDLYTVSMVAHQIF (1) 19663 ex7 GGSET (0) ex8 19790 GWHTIMTFIACMLTNPDVQRKGQEELDSVVGRGRLPDFTDRDSLPYIDCIVKETMR (2) 19945 ex9 20000 WQPVVPL 20020 ex 10,11 fused 20021 SVPHKAMEDDEYRGMHIPKGATIIPNAR (GT boundary missing AGGT = AGGC) 20098 ex10,11 fused GITWDERHFHEPRTYKPERFLPRPQGAGEVFPQGAVFGWGRR (2) ex12 20341 LCPGRYLADDVVWLAIARILAVFDIQKAVDADGNVVEPHIEFTTVLT (2) 20478 ex13 20541 SHPKPFPCSLRPRSEKAAELVRQAYDMHMANVAV* ex14 >Scaffold_52c GENE MODEL COMPLETE exon boundaries all checked MGFLTALLLALVLLCAIWVRAVRRRRGRLPYPPGPPADPLIGHIRIMPSTDVAHEVFHGWAQQYG ex1,2 fused 21628 DVMSFSVLGTRYVILNSAEAATDLLEKRNSKYADRPTFPMYER (2) ex1,2 fused 21810 VGWKDALVFLPYGSYFRKQRKMVQLPFEKEKVTDFRHIEEQESCVMLYNIFSDPDNRDAFVHR (2) ex3 22044 RYTTGVTMELTYGHRVVSDEDEYLKAADMIIDVLRSVTRPSLLDVSPL (1) 22184 ex4 22247 FEKLPAWFPGAWFVKCIK (1) 22282 ex5 KTKPVVLREIQRPVSVVQQKL (0) 22406 ex6 MAGTAKSSFVSQHLEELSREKGLSEEDLYTVSMAAHQVF (GT boundary missing) 22579 ex7 GGTET (0) ex8 22710 AWHTIMTFIACMLTNPDVQRKGQEELDRVVGSGRLPDFTDRDSLPYVDCIVKETMR (2) 22865 ex9 22918 WQPVVPL 22938 ex 10, 11 fused 22939 SVPHRAMEDDEYRGMYIPKGATIIPNAR (GT boundary missing AGGT = AGGC) 23016 ex10,11 fused GITWDERHFHEPRTFKPERFLPMPQGAGEVFPQSAVYGWGRR (2) ex12 23254 ICPGRYFADDMVWLAVARILAVFDIRKAVDADGNVVEPRIEFATVL (2) 23391 ex13 23456 SHPKPFPCSLQPRSEKAAELIRQAYEMHMANVEA* ex14 >Scaffold_52d pseudogene GENE MODEL COMPLETE exon boundaries all checked ICPGH beginning of exon 13 25670 SHPKPFPCELRPRSARAAALINEAHEMQLTNAEA* ex14 >Scaffold_52e GENE MODEL COMPLETE exon boundaries all checked MGPTAFVAILLCAVLLVQAVRRRRTPLPYPPGPPADPLIGHLRIMPDTSTAPEVWHSWSRKYG ex1,2 fused 26391 DVMSLSILGKRVVILNSEEAVTELFEKRGAKYADRPSYPLYER (2) 26495 ex1,2 fused 26566 VGWKDALILLPYGTYYRKLRKMLQLPFEKDKAPNYRHIQEQEACVLLHNFLRDPSSVESPIHR (2) ex3 26808 RYTAAIIIEIAFGHRVLSDDDEHLKAADMFVEVQHGAGRPSLLDVSPI (1) 26948 ex4 FEKLPSWFPGAWHVRYIK (1) ex5 EKRPMILHAIQHPVSVIRQQL (0) 27169 ex6 27229 VDGTANPSFVSQQLNDLIREGGLTPENQYDLSIVAHMIFG (1) 27342 ex7 GGSET (0) ex8 27459 TWNTLTTFIACMLMNPEVQRKGQEELDRVVGRGRLPDFTDRDSLPYIECVMKETMR (2) 27620 ex9 WHPVAPL 27691 ex10,11 fused 27692 AMPRRAIEDDEYHGMYIPKGAMVIANI (GT boundary missing CGGT = CGGC) 27772 ex10,11 fused SLTWDERRFHDARSFKPERFLPKPEGAGEVLPPSFAFGWGRR (2) ex12 28004 ICPGRYLADDVVWIAVARILATLDIRKPKAADGSIIEPRIEFEAALT (2) 28141 ex13 28202 SHPKPFPCEIRPRSDKAAELIKQAYDMHMASVET* ex14 >Scaffold_180 similar to AT002920 Pleurotus ostreatus cDNA. seq contines on scaffold 519. Joint bewtween ex2 and ex3 not well supported gene model complete all boundaries checked MLLGTTYL 1241 ITLLTLLSVGIAPLFTLWRRKRTSEAYLPPGPPALPFVGNLFQLPRKSVPRTFATMSQQY (1) exon 1 GPLYYMRVINRHFVIVNDLDLARILFDKRGAIYSHRPRL (0) ex2 PMAQEVVKRDTMLFMNYGPEFRKSRKLVSTFLNQRNASKYWPAQEIESLKFVLAVQRNPSDWLKLTRW (2) ex3 TATSLVIRLLYGIEVQDKDDALVGLAEDFARLTTETTEPGRWLVDAFPI (1) ex4 LRHVPAWLPGAGFKRWAKRAKARMDEFATLPYVMAKDKI (0) 266 ex5 EKGDITPCWTAEKLLETTEPLTEQDEKEIRHTATSMYS (1) ex6 GGSDT (0) ex7 >Scaffold_519 sequence runs off end. seq continues on scaffold 180 gene model complete all boundaries checked GGSDT (0) ex7 216 TNAMVATFILLMLHYPEVQKKAQEEIDSVTGGTWVPGMRDRERFPYINCLVKELFR 392 ex8 FSPAVPLVPHSLHEDDVVEGYLIPKGSWVMANMW (2) ex8 532 AFMHDEARYPDPETFTPERFEARPGVEPQDDPLDIVFGFGRR (2) 681 ex9 713 ACPGYLLGVASVYLNIVHLLFAFDIAPVKDAAGASVLPPIEFSDGHVA (2) 847 ex10 HAKPFECDMRERSAERIALIEHTAHSMQ* ex11 >Scaffold_15a 35% to CYP61 missing top of gene in run of NNNNNNNNNN gene model complete all boundaries checked this might be a pseudogene 3229 GMLIVIALPQTSGIIQW 3280 FLALIGKHPDIQARAHHELDAVVGRDRWPMAEDEKDLPFIRAIIKEVLRVHAPFWN 3448 ATPHSSTEDFVYNGMYIPKGAAVILNCFTLHHNEARYPDPY (0) TFNPDRFLGDDLSSAESAHQANAMERDHWSFGAG (2) (AG boundary missing near TFN) RRICPGINVAERILFLAISRLLWAFTVHELPTEPISLEEYEGTSGRTPVPYRV HLLPRHERVHAALEAEQEIAACEL* >Scaffold_15b N-term first exon not too well supported similar to 249 gene model complete all boundaries checked MITVNRPSTQDSVIATVACAI (0) 54042 IVYFVCKRWEPWRLAVVVPLTMLPPLVLSLPLAASVGFLNAVITTSSTFASALVAMLTF YRLSPLHPLSQFPGPLQCKISGFWMAWIVSRGKRHVYIQSLHEKYGDYVRI (1) 54380 GPNEVSINDPTAIPQILGSYGWPKASGMSGRALHQDPLPLISLLDDAEHSRRRKSWN RAFSSAAVREYQPVVAQRATQLVQALQEQRSVVDLSQWMSYFT (2) 54727 54859 FGGGTEMLREGDKAGLWQLLKDGM (2) 54930 SAIFEHVPWLSHYILHFPALIRSLTDLRALAFGRSKLRYERGSTTKDLFYYL (0) SNEDHADAEMPPMKVVISDAVLAIVAGSDTTAVTLSNIFYYLISHPDTYRRLQEEVDKYY PPGEDALNPKHYVKMSYLDAVL (2) 55502 NEAMRLYPALPSGSMRTPAKGSGGQVVGQR (2) 55585 55653 FIPEGTQVRVHPYTIQRDPRNFSYPNRFWAERWMIASGNQSHHEKIAHNPDALIPFS 55823 55824 TGPRNCVGKNLALLEMKMVTCHVTQRLSLCFADGWDPSRWWDDLEDVFVSRMGQLPVIVRSR* 56012 >Scaffold_15c gene model complete all boundaries checked 162800 MPLLLVDIAALVAGLVLLFMLDSWRRKGQHLPPGPPGLPFLGNILQIPRKKEFIVFRDLGNIY (1) 162633 ex1 162550 GDIVTLRVPGQIFVILNSRKAVADLLDARSQIYSDRPRTIMCKDL (2) ex2 162324 FRDCRKLLRKGLGPSAVQSFIPFLNRQSAFYLENLQKRPEAFVDIFK (2) ex3 RNAAAISMKIAYGYDGIQDDEELYGIGAMANHYFAETAVVGVWPVDMLPI (1) ex4 161929 LRHVPQWFPFAYFKKYAAQAKPIVLESVNRPFEETKRHM (0) 161813 ex5 KRGTAGGSFTSMLLEDAKGDPETEDCIKWSGTGIFLGQMDT (0) ex6 161576 TTSALSWFFLSMVLHPEVQAKAQAEIDKVVGNERLPRFEDKESLPYVSAVMQEVFRWHPVVPM (1) 161394 ex7 161342 IPHALSKDDEYRGYFIPAKTSLIGNIW (2) 161262 ex9 161201 AIMHDESLYPDAEDFRPERFTEDGAPDCLNVAFGFGRR (2) ex10 VCPGILIAQAHVFVSIATTLATFNITKARDAQGNIIEPVVEDTPGAI (2) 160893 ex11 160833 NFPQPFKVSLEPRSAAAADLIRRSAEHSKTLPERLEIFSLDA* ex12 >Scaffold_15d gene model complete all boundaries checked 170601 MLLQVLAAIAALFVLSGLLNSRRRNMHVPPGPKALPLLGNVLDIPKKDVHVTFRDWANIY (1) 170771 exon 1 GKDLMKVEMPGETLYVISNKKVMVDLFEARSAIYSSKPTMTMAD (2) SSGFKNSIPLLPYNARLKTSRRLLKQGLSPAAVRSYFPYINNRTALFLEALLKDPDDFVRHFT (2) 171205 ex3 171268 RTAAHTALKIAYGYEGVTEDHHLLHTAIETMEIFATVVNPGRWLVDTLPI (1) 171435 ex4 LDRIPVSFPFANFKRVQEESRPVVFETVSKPFEEVKKHL (0) AEGTADGSFSSYLLQSEKPDPETEDCIRWAATSIFLGQFDT (0) 171825 TTATLSWFTHAMVKFPEVQKKAQEEIDRVIGNDRLPEIQDRDSLPYVNAIMKEIFRWQPIISM (1) 172007 ex7 172057 LPRSVVQDDEYNGYFVPAGTYLLANIW (2) 172137 ex9 172184 AVLHDPEVYPEPEKFMPERHLKEGVPNPLDVTFGFGRR (2) 172297 ex10 172344 VCPGMQVAQSQTFGTMAAMLATLHLKPQKDEHGRDIIPETRTVDGLI (2) 172490 ex10 172544 RFPVPFKCAFVPRSEAALKLIERGAEHARSVPDRLERWSD* ex11 >Scaffold_15e gene model complete all boundaries checked MVQASDTLLSV 173099 VVCAALITLTSFLLSGYRKRNAHLPPGPKPLPIIGNLHQLPDLKADRAVAFRDMSLAY (1) ex1 173328 GSDILCVKVPGMLMYILNSKESMFDILVTRSAKSSSKPPQVMAD (2) ex2 173522 LSGWKYTVPSLPYGQRIKTSRRLLHKGLGPSAVQSYIPYLERESAFFLEKLLDQQDAYKKHVT (2) 173701 ex3 173777 HTAARIALKIAYGYEGVTDDAHLIDTAVKAMNIFCVTATPGIWLVDSLPF (1) ex4 173974 LQHMPSWFPGTGFKKQASQWSQTVLYAINHPFEELKRQM (0) 174090 ex5 AAGTAGASFAGRLLEVEDLSDPEVEDCIKHCSTGIFAGQFDT (0) ex6 174324 TTAVMSWFAVVMAFYPEIQKKAQDEINKVVGHERMPVVADRDSLPYVNAILKELLRWRPVLPL (1) 174506 ex7 174556 IAHSVNEEDEYKGYYVPKDTVILANVW (2) 174642 ex9 174692 AVLHDESNYDEPEKYKPERFLRDGILDPSVLDPATLAFGFGRR (2) 174820 ex10 174885 ICPGMHIGQTLLFILMSRTLQNFDIAPAKDAHGREIAIDTSAVPGLI (2) 175025 GFPKPFKVSLVPRSNAHATHIRHAAEHARSLPDKLAIFEL* >Scaffold_15f gene model complete all boundaries checked MLDVGATAAAGLLLVFLLGLGLMKTQH 178217 LPPGPRGLPLLGNVLQIPRRLPHVAFRDMGHKYG (1) ex1 178374 GDIVTLRVPGYNLLVLNSRQAIYDLLDSRSAVYSDRPQGT (2) 178499 ex2 178602 IYRKLLRKGLGASAVQSFIPFLNRQSALYLENLQSRPEAFVEITK (2) 178730 ex3 178796 RNAAAISMKIAYGYDGIADDEELYRLAHQTTIYFAETAVLGAWPVDMFPI (1) 178954 178997 LRFIPSWFPLAYFRRYAARARPVVVECINKPFEETKRHM (0) 179113 ex5 RLGSAGASFTSMLLGDANGDPDTEDYIKWSSAAIFLGQMDT (0) 179343 TTAVLSWFYLAMALHPEVQAKAQAEIDQVVGNERLPHIEDRDSLPYVCAVMREVFRWHPVANL (1) 179525 ex7 179577 VPHATDKDDHYRDYFVPAQTVAIANVW (2) 179657 ex9 179709 AVLHDEDVYHDADKFIPERFSEEGAPDSLEIAFGFGRR (2) 179822 ex10 ACPGKVVGQAHVFASIATVLATFNITKARDAQGNVIEPEVMDTPGAV (2) 180010 ex10 180073 NTPQPFKVNIEPRSEAAVDLIRRSAEHSRTLPERLEIFSLDA* ex11 >Scaffold_33c ex1 joint questionable gene model complete all boundaries checked 53267 MSDLITAHVLLPSL (0) IILIILATISYRLSPLHPLARYPGPILDKSTSLRLAYLAFIGQRAQYVTELHERYGKIVRI (1) GPNKLSINSLDVVHPIYGSSQAYDKSESYRPGLSAEGSIFFARKK (phase 0? boundary = GNNNNNNN) (missing exon 4 in a run of NNNNNNN) ELMRDGDPEGVVESGHKAILMFETY (0) CDSFGEVPALFDILSVLPTGEAYQLVEKRAATHLKDRIKVHPHDGWDMCSFF (0) LAQREGHNYPPMNEVDLNANTVVAFEA (1) GGDTTAGFIIITMFHLLRYRQAYDKLKEELDGAFPTGSVSVEEYSH 51364 LAELPYLGAVINEGLRLGAAFPSFPRVVPKGGAMLAGEFIPEGTMVGVPIYTQHYSPDNF 51185 51184 WPEPREFRPERWFEDGLGPGTITRQAAFMPFQFG (1) PFGCPGKALGLRLMSVVISNL (frameshift) 50965 LVLSYDLSFPPDFDPEAFLNGWINTRTNIFRIPLRVEAKRRPW* 50834 >Scaffold_33b ex1 joint questionable cannot join ex2 and 3 GT boundary missing see scaffold 3 for this joint (similar seq) gene model complete all boundaries checked 50259 MFNSFGPAHVLLPPL (0) ALSVIIAVAAYRLSPLHPLAHFPGSWVDKVTSLRVAYFALTGHRAEHVTSLHDKYGVVVRI (1?) GPNRVSINSSDVIYPIYASPQAFDRAASYRPGLIHDGSLLFSRKR (0) HWDGALKDRVDQLIDCIARRQDLRGVVDLG (0) EFMRNGDPHNICRSAKDSIVLFEVY (0) SSSLGEIPALFDIASVLPVTAEYRKVERHFQKHINERMQIKSHSDWDFCSFF (0) MAQREDAQYPPLSKPDLNADAMVAFEA (1) 48741 GGDTLAGFLSIIIFYILKHQPVYQKLRAELEQAFPLGEIAQDQYASLTEIPYLVAVINEGL 48565 48564 RLGANFAGFQRVVPQGGAVLAGQFIPAGTVVGVPAHLQHIHPDNFWPTPLEFRPERWFKDGL 48379 GPGTITRQSAFMAFQFG (1) PFGCVGKTFAYRQLNVVLSRLLLAYDLTFALDFDSKAFVEGWLN 48139 IRTTIFNYPLKVQASRRQW* 48080 >Scaffold_33a pseudogene fragment exons 3 to 6 91% identical to scaffold 112b gene model complete all boundaries checked VAAFIGSNTNFYVADADIIK (0) ex3 EITTHRSRLPKPIEQYKVLTFFGGNIVASVGEHWKRYRKIAAPTFSE (0) ex4 RNNKLVWDETRLTMQDLFTNVWGRHAKIHVDHAVDITLP (0) ex5 IALFVMS (frameshift) VAGFGRRIPWQDEDVVPAGHTMTFK (0) ex6 >Scaffold_388a very similar to 77 417 112 129 gene model complete all boundaries checked 3583 MISDTFALAISSGLSLFLCLKAFIDYRAGLRSI (2) 3684 ex1 3732 NHSYLPGFRALISSFGILGLFFKEPKRGLWGGRRRFWLRKHLDFEEAGVDIISH (0) 3893 ex2 3954 IAFLPSVSTYLLLADAAAIK (0) 4013 ex3 4069 EVTGHRARFPKPTYKTLRIFGGNVLASEGEEWKRHRKVVGPAFSE (0) 4203 c-helix ex4 4255 HNNRLVWNETVKIVNDLFANVWGSQSEVYVDNVVQSVTLP (0) 4374 ex5 4423 MALYVISIAGFGKRALWQADGNLPPGHKLSFQ (0) 4521 ex6 4576 DALHILGTDLWIKAATPTLLMNWAPTTRIANVKLAFDEVK (0) 4692 ex7 4747 QYMLELIQERRNSEKRDERYDLFSSLLDANDLNEDGNGNVTLTNDELL (1) 4890 ex9 GNIFIFMLA (1) 4973 ex9 split GHE (0) ex9 split 5087 TTAHTLAFTFGLLALHPDYQETVYQQIKSIVPDNRPP (0) 5197 ex10 MYEEMNSLTECMA (2) ex11 5351 YETLRLFPP (0) 5380 ex12 5436 TATIPKIAAEDTYLVTIDRAGNRVVVPVPCGTALHLNVIALHHN (1) 5564 ex13 5614 PRYWDNPSAFKPERFRGDWPRDAFIPFSTGSRSCIGRR (2) 5730 ex14 5780 FFETESIAILTMILSRYKIELRNDPRFADETYEERWQRVLRVKDGLTPA* 5932 ex15 >Scaffold_388b very similar to 77b 417 112a 112b 129 large intron between ex5 and ex6 There are no AG GT boundaries on exon 6 so this might be a pseudogene. gene model complete all boundaries checked MGTLAWVVLSFCLFYCVQKYLEFRAVVRSIH (2) ex1 DHPGFRTLLPPYGIFGFLFKRPIPGITRGGMSQWRGKYRDFEAFGMDIISA (0) 17296 ex2 TSVIPTARNAFLVADPAAIK (0) ex3 17133 EITSSRTRFPKPVAQYRVLTFFGANIVTAEGDEWKRFRKITAPAFSE 16993 ex4,5 fused 16992 RNNRLVWDETVKIMLDLFENEWAGKDTVVVDHAVEVTLP (0) 16876 ex4,5 fused 14857 WIALFVIGVAGFGRKMTWQEDSKLPPGHQLSFK 14759 (no AG boundary at WIA no GT at SFK) ex6 14700 EALHYVSTAVFVKLATPAWLLTWAPTERMRRTNLAFKELE (0) 14584 ex7 14518 QYMLEMIQTRRNSEKKEERYDLFSNLLDASEDGSDGHARLADEELL (1) 14381 ex8 GNIFIFLLAGHE (0) 14307 ex9 14247 TTAHTLAFTFGLLALYPEQQDKLYKHIKHVIPDGRIP (0) 14137 ex10 AYEEMNLLHESIA (2) ex11 13982 VFYETLRLFPP (0) 13953 ex12 13895 VTGIPKVAAEDTTLVTTDHSGNKVVVPVTKGTGISLHVPGLHYN (1) 13770 ex13 13710 PRYWDDPYEFKPERFHGDWPRDAFLPFSSGARSCLGRR (2) 13594 ex14 13549 FFETEGIAILTMLVSRYKIEVKEEPEFAGETFEQRKERILAAR 13418 ex15 GGLTLTYVCSPPHLRNLLNLPLR* ex15 >Scaffold_129 very similar to 77b 388b 417 112a 112b gene model complete all boundaries checked 23646 MFSNTFALAITSGLLLSCLKAYMDYRAALRSIN (2) 23747 ex1 23789 YHPGSCALIPSFGMLGLLFKEPRRGLWGGWRRFWRRKYLDFQEAGVDIISH (0) 23950 ex2 24011 IAFVPSVTTYLVLADAAAIK (0) 24070 ex3 24124 EVTGHRARFPKPSYEFFRIFGGNIIASEGDEWKRHRKIAAPAFSE (0) 24258 c-helix ex4 24306 HNNRLVWNETVKIVCGFFENVWGSQAEVYVDDVVQSLTLP (0) 24425 ex5 24479 MALHVISIAGFGKQTVWRADGTLPPKHKLSFQ (0) 24574 DALHVVSTDLWIKFVMPTMLLDLAPTKRIAKVKLAFEEVE (0)24750 24806 QYMLELIQERRDAEKRDERHDLFSNLLDANDSDENGDGSVKLTDEELL (1) 24949 GNIFIFMLA (1) ex9 split GHE (0) ex9 split 25150 TTAHTLAFTFGLLALHSDYQEKVHQQIKSIMPDNRLP (0) 25260 ex10 TYEEMHLFTECTA (2) ex11 25420 VFYETLRLFPP (0) 25446 ex12 25509 VTTIPKISAEDTSLVTTDRAGNRVVVPVPCGTSLHLSVVALHYN (1) 25643 ex13 25690 PRYWDDPYAFKPERFHGDWPREAFIPFSAGARSCLGRR (2) 25803 ex14 25869 FFETEGIAILTMILSRYKIELKDDPRYAHETYEERWQRVLDVKDGLTTT* 26018 ex15 >Scaffold_77a most like 52b frameshift in exon 2 gene model complete all boundaries checked 1298 MDPATVAVAVVCALAVMHVLTRRARTRLPYPPGPPEDPIIGHLRQMPNNDEAAEVWYRWAKQYGDVMSLNV frameshift 1512 LGKRLVILSSEEAATELLEKRSSKYADRPRFPIFER (2) 1619 ex 1,2 fused with frameshift 1672 IGWKDMVLLMPYGPYHKTLRKMIQVPFEKDKAFQFRDIQERATSIMLHNFLADPKGIEHHTHC (2) 1857 ex3 1920 RYVVSIIVEIVFGHRILSEDDEHLKIADVFVKIQHEASQPSLLDVSPL (1) 2063 ex4 2122 FAKLPSWFPGAWFVKYIE (1) ex5 DTKAILSHAIHHPVSIVQEQL (0) ex6 ASGIAKPSFVADELERLIKAGQLTPQNKYDVSIAAHMIFG (1) 2467 ex7 GGTET (0) ex8 2590 TWNTLTTFIACMLLNPEAQRKAQEEIDKVVGHGRLPDFTDRDSLPYVECVVKETMR (2) 2751 ex9 2810 WHPVAPVAVPHKATEDDVYRGMYIPKGAIVIANAR (GT boundary missing AGGT = AGGC) 2914 ex 10,11 fused 2980 SITWDERRFHDPHAFKPERFLPRPLGAGEDFVQGAVYGWGRR (2) 3102 ex12 3155 ICPGRHLAGDMVWIAIARVLAVFDIQKARDADGNAIEPNIEFTTAV (2) 3292 ex13 3362 HPKPFPCELRPRSEKAASLIKESYELHSID* 3457 ex14 >Scaffold_77b very similar to 388 417 112 129 gene model complete all boundaries checked 81731 MNSVLVILLSTILLLCLKTYVDLRTALRAV (2) 81876 NYHPGFKSFISCFGVFGFAFKEPRRGLIGGSLRFWHRKHLDFDEAGVDVIHH (0) 82031 82080 VSFFPRVSTCLILADPAVIK (0) 82139 82189 EVTSHRALFPKPLYHELRLWGGNIIASEGDEWKRHRKVGAPAFSE (0) 82323 c-helix 82380 PNNRLVWNETVKIMVDLFDNVWGSQDTIIVDHVVDAFTLP (0) 82496 82549 VALFVISVAGFGKNASWQSDLLPPSGHKLSFK (0) 82650 82697 DAIHVVSVDMFIQVVTPTFLWKLAPTKRIADVKLGFEELE (0) 82813 82869 KYMLEMVEERRNAPKKEERYDLFSSLLDASDSDADGGARLTDRELL (1) 83006 GNIFIFLLA (1) 83081 GHE (0) 83197 TTAHSLAFTFGLLAMHQDYQEKLYQHVKSVIPDGRLP (0) 83307 TYEEMNKLTECMA (2) 83459 VFYETLRLFPP (0) 83488 83553 VVGVPKVVAENTTLVATDFTGKRRAIPVAAGSDIHISILALHYN (2) 83678 83730 PRCWDEPHAFKPERFHGNWPRDAFLPFMAGPRACLGRR (2) 83840 83903 FFETEGIAILTMLVSRYKIELKDEPAFAHETYEERWDRLFTVKQGITLA* 84052 >Scaffold_417 gene model complete all boundaries checked MQQHLLFAAGLICLFLVKRCIEYRRAIRAI (2) 14022 HNYPGVRAVLSNSSGLGYLCKRSIPGLAVGGARLWVKRYSDFCRYGADIVSC (0) 14177 VAVLPRTEILLFVADPAAIK (0) 14341 EISSDKTRFSKPTELYELVNIFGRNIVTTEGDEWKRHRKIVAPAFSE (0) 14481 c-helix 14546 RNCELVWEETLHVMIGLFNDVWGSDSIITLDNAFDITMP (0) 14662 14715 ISLFVVAASAFGRRIPWTEGGLAPPGHHMSFK (0) 14813 14874 EALHIVSTGTVIKAVLPKWLLNLGPSQYIREVRDAFREME (0) 14990 15048 AYMREMVTENMLDDTKSRRDLFSSLVHAGQDSPGQEALLTDAELL (1) GNVFMFLLA (1) GHE (0) 15382 TAASTLCFALGLLALHKDEQDKLYDHIRFTLGEKDVP (0) 15492 AYSDLTSLSYCSA (2) 15635 VLYETLRLFPP (0) 15664 15726 VIGIPKKATEDTVLSTVDRDGNHIAVPVPVGSSVAIHVPGVHYN (1) 15851 intron shifted one base 15905 PRYWKDPAAFRPSRFFGNWPRDAFLPFGAGSRACIGRR (2) 16015 16069 FFETEAITALTMLVVRYEISVTDEPQFRDETAEQRRERVLSATQELTLT* >Scaffold_112a 3 different P450s on this sequence 42% to scaffold 154 very similar to 77 388 417 112 129 gene model complete all boundaries checked MLLILWALLAAVVYHAAARLVRLRRLLVKIR (2) ex1 FHPGQRAATSIYGAATFLFPWRIPNLTPGANLLFDEKHALLARHGLDVVTS (0) ex2 VSTHPMRAVFVVADPAVLR (0) ex3 35143 DMAAARSRYPKPVELYGSLSLYGPNIVASENDAWKRYRRICSPSFSE (0) 35003 c-helix ex4 34950 RNNKLVWEETVRVVTELFDTWEGRQEIDMEDALTMTLS (0) 34837 ex5 34788 ITLFVISSAGFGKPITWKGGDERPEGYAMSFK (0) 34687 ex6 DVIYHMSTGVFIKIATPQWLLNLGLTEKMRNTNVAFKELG (0) ex7 34458 MYMSDMIRERRESQQREDRGDLFNGLLDAGEEDEKLKLTDEELM (1) 34297 ex8 34281 GNIFIFLIA (1) 34252 ex9 split GHE (0) ex9 split 34143 TTGHTLCYALALLALYPDEQEKLYQHIRTLCPAGELPVYDDLRNYTYALA (2) 34033 ex10,11 fused VLYETLRMFPSVVGIPKVASEDTCVQTMNDAGQLVEVFIPEGSDIVFDTPGLHYN (2) 33775 ex12,13 fused 33720 PKYWTDPYTFSPSRFMAPDWPRDAFLPFSGGPRACLGRRFA 33598 ex14,15 fused 33606 RFAEIESIAVLVLFVSQYTIHLKEDPKYAGETEQQRRERVLKSVPGLTLT 33457 ex14,15 fused >Scaffold_112b 3 different P450s on this sequence 65% to 112a very similar to 77 388 417 112 129 gene model complete all boundaries checked MASRLLVLLAALVLFALRAFARFRRAVHAVSYVSSALRSV (expected phase 2 GT missing) ex1 38510 NHPGYRTLLNTLGPIENFFPRIPGVAPGAFHMWKRKHRDFEEHGWDVITY (0) 38361 ex2 VAAFIGSSTNFYVADADVIK (0) ex3 38197 EITTHRSRFPKPIEQYKVLTFFGGNIVASEGEHWKRYRKIAAPAFSE (0) 38057 ex4 c helix 38000 RNNKLVWDETRLIMQDLFTNVWGERAEIYVDHAVDITLP (0) 37884 ex5 37726 IALFVIGVAGFGRRIPWQDEDVVPAGHTMTFK (expected GT missing) 37631 ex6 37559 TALHTVSENVFTRLLIPDWLLRAAPTARLARIRDAFAELE 37440 ex7,8 fused 37436 QYMREMIRARRERPAREERHDLFSSLLDASKDADVRLQDSELI (1) ex7,8 fused GNMFIFMLA (1) 37278 ex9 split GHE (0) ex 9 split 37116 TTAHTLCYMLAMLAMHPEVQDKMYESIRGVTQNGRLPEYEDMRSLSYCEA (2) ex10,11 fused 36913 VLYETLRMFPPVNSIPKSVAEDTAITITNADGERTTVPMPKGSSISIHTPGLHYN (2) 36749 ex12,13 fused 36696 PRYWPDPHTFRPERFLAADWPRDAFLPFSAGPRACLGRR 36574 ex14,15 fused 36576 RFSETESVAAAAMLVLRYRIAVADEPRFAGEGARARFERVTASRPGVTMT ex14,15 fused CVFCAPLWVVVGTDELCCRPTRVPLVFRRR* >Scaffold_57 Length = 127761 N-terminal exon not certain gene model complete all boundaries checked MDALDPRIAIP (0) 127279 IVYFIYKKYEPSSPRSAFFLLLALPGVLAVALRVCERFNSYATAVP 127147 TVYLAYWSLLSTFVVAYRISPFHPLARYPGPLLCKISKGWLAYVAG KGGKAHLYVQDLHMRYGEVVRI (1) 126941 126894 GPNELSITHQDFTRVVLGAKGLPRGP (1) 126817 YYDSRQHEAGMSLDGMRDQALHAIRRRPWARGMNTAAMKYYEELIRNTLSDLIAG 126604 126603 LKQRTNRPVDISEWTNYFGFDFMGQMA (2) 126523 126474 FTRDYGMLKNGYDKEGLLDLIEHAMQ (2) 126394 126341 DSAWISHITWSIPFLRYVPGASKYWDQMKAVGERAVADRVALGSNHRDLFHYLMDEDGH 126162 126161 EAVRPTKALVAVDGQLAIIAGGDTTATTLSHIVYFLLRYPIYLDRLRKEIDETFPDGADS 125982 125981 TLDFTKQTNMPFLNACI (2) 125931 125877 NEALRLYPPVLAGLQRRVEPGTGGKMIGP (2) 125791 125734 YFVPEETQVSLFAYSIHRDPQHFSPLTNTFWPDRWLSQEKYTLPSGDVISADEVVTNRDVF 125552 125551 IPFSQGPMVCAGKNVALTEMRSVMCALLQHFDLKIADQSFLDSW 125420 EDKIEETFTTKRGTLPVILSLRA* >Scaffold_284 run of NNNNNN upstream of N-term 21900 and higher missing exon 1 in run of NNNNNNNNN gene model complete all boundaries checked TVYVSYWALLVTYTIAYRLSPFHPLARYPGPLLCRISKAWLAYIVATSGKMHLYIQNLHMRYGDVVRI (1) 21607 21557 GPNEVSVTHRDFIGVIGPKGLPKGP (1) 21483 YYETRAHKAGTSLNAIRDQALHSVRRRPWARGMNSAAMKYYEELVEETVG DLVASLKRRTTKAIDFSEWMTYFGFDFMGHMA (2) FTHDYGLLKNGYDKEGLTPLIEHAMQ 21054 (2) DVAWVSHVPWSIDYLRHIPGVYKHWLRIKALGVQTVRKRIALGSTRRDLFYHLM 20842 20841 DEDNRERVKPDINVVAIDGQLAIIAGGDTTATALSHLFYCLLRHPQYLERLRKEIDDAVPIACGSF 20644 20643 KLDFSKLPAMPFLNAFI (2) 20593 20540 NETLRLYPPVLSGLQRRVEAGTGGRFIGP (2) 20454 20396 HFIPEQTQVSFSAYTIHRDPRYFSPLTDTFWPDRWLVQEQYVLPSGGVIPAAEVVTDRDVF 20214 20213 FPFSQGPTVCAGKTLALTEIRSVACALLQTFDISNADEASFDAWEDNLEEMFTTKRSTLPVFLSLRT* 20010 >Scaffold_154 35% to 164a gene model complete all boundaries checked 26877 MARCNQACYLIYKHSEPNNIPAHAALLLGVPALLVHQLGTHWSILQQGGAFVA YWALILAFTGLYRLSPIHPLARYPGPTLGKLSKIYLSYLSARGDIYRVIKGWHDKYGDVVRI (1) GPNELSFRHVDALQPIMGTKYTVKGP (1) 26404 26348 YYDTRTTPEQITQMDGIRDYSVHGQRRKPWLRAMSSAGLKGFEPIVKMKALELVEELSKK 26169 26168 VGEIIDMSEWMNLF (2) GFDFMGHLAFGREFGLLKSGNDHDDMIRTVEDGVY (2) 25901 GAGVISHIPWILPFLVHFPPAMKGLRAMQQMAATFARERTQKGSTTKDIYYFLV (2) 25752 25710 NEGAAQSGATHDEVIADGMLALIAGSDTTSIALSHVCYFLLRHPACAARLRAEVD RAFPPGEDVLDFARHADMPYLNACI (2) 25459 25376 NEALRLLPPGLGGLQRMVRRGTGGAMIGP (2) 25290 HFVPEDTKLSVHLFSLMRDAREFAPLPDAFWPERWLAQDTYVLPTGD 25082 25081 AVSKEHVTTNRAAFIPFSVGPQNCAGKALALVELRAVTCALVSKFELH 24938 KPKDYDLDQWEGDLLDLYISIRGKLPVILQARQGR* 24839 >Scaffold_78 N-terminal exon is a best guess gene model complete all boundaries checked MSAIFAASS (0) 61873 TTYIIFKRFEYSKPVPALVLLIGVPMT 61957 LASVLRDHFAGLATAMCATAAVHWVLLTLFVAAYRISPFHPLARYPGPLPCKLSKCWMAY 62136 62137 LAGSGGKTHVYIDRLHRLYGDVVRI (1) 62253 GPNELSVRHKDACTTVLGAKGLPRGP (1) 62387 YYDTREHDNGVSLDGIRDPALHAVRRRPWARAMNSASTQYFEELIQHTVSDLTGGLK 62566 62567 ERAGESIDLTEWMSFF (2) 62742 FSYDYNMLKEGRDTQGLRRMIDQSLV (2) 62819 62870 DLRWISHIPWSIPYLKMIPGASKNWDDMKAAGDKVARHRVSLGSSRPDLFHHL (0) 63034 RDEEGHEAVRPQLEVVSVDGALAII (0) 63159 63210 AGADTAATTLAHFWMFMLRHPACFERLRKEVDATFARDDGPDFVKQARMPYLNACL (2) 63371 63431 NETLRLFPPVLAGLQRRVGRGTGGRMIGT (2) 63514 63571 HFIPEDTQVSLVAYTVHRNPDCFSPFPDTFWPDRWLTQETYTLPTGEVIPSSDVLTRRDA 63750 63751 FMAFSQGPMACAGKNVALAEMRAAVCAVVQRFDLVLAYERALDEWEEVLQECFVSKLGK 63927 63928 LPVQVVPRN* 63951 >Scaffold_249a 59% to 247 exon 7 GT boundary missing gene model complete all boundaries checked MAGDLSTRDALGIIVVSAL (0) ex1 14922 GTHAIFKRWEIKHILVVSTLLLFLPAALSTLLIPHLGSFKGIAAGFSVYFITLLSSI 14743 ex2 14742 TLYRISPFHPLAHYPGPLLPKISKIYHIAKVSSGKQHLYLQELHNQYGDIVRF (1) 14566 ex2 14527 GPNEVSIRDASCIMPVLGAQGMPKGP (1) ex3 MWQGRHFWTEVHTLIG 14347 FRDPKAHQRRRRPWNRAFNTAAMKEYTPLMQNRVRELGDALVARQGQVVDLAEWIGFFT (2) ex4 YGFMGDVYV (2) ex5 FGGWTNMVREGGDRERLWEVVKSGLK (2) 13973 ex6 13932 IEFIYDNIPWLSYYTRNIPGAGNAELRAMAIGQTEKRYSRTSTSKDLFYYL (GT phase 0 missing GTA = GCA) ex7 13699 GNEDGAEKEDPPKHTVVVDGGLALVAGSDTTSSVLSSIFYCLLRHPDTYDRLQAEVDKFYP 13520 13519 PGEDSLDPSHLSDMNYLEA (0) ex8 13409 GLRLFPAVPSGSQRAPEIGTGGKLVGP (2) 13323 ex9 YYIPEGTQTRIHFWSVHRDPRYFSRPEAFWPDRWLIA 13160 13159 EGLQAHAAGDEPFVHNPNAWTPFSFGPSNCVGKNLALQEMRMVLVHLMHRLIVRLADGWD 12980 12979 PAQYEREMEDRFVFSIGRLPVVVERRD* 12896 ex10 >Scaffold_249b gene model complete all boundaries checked MAAREAILIIALFAVVST (0) 18453 ISHVIFRQWEVMHCSVVLGLLVVIPAVLSTPLVSDFGIPCGLALGFTTYFAVLLLSITL 18277 18276 YRISPFHPIARYPGPLLAKISKIYHVSKIWSGKQHLYLQRLHEKYGDIVRI (1) 18121 18073 GPNELSIRDVSCITPALGAQGMPKGP (1) 17933 MFNGRHLWPETHSLIGFRDPKEHQRRRRPWNRAFNTASVKEFNPIIQARVQELGDAFAAREGQ 17754 17753 VVDLAEWIGFFT (2) 17718 YDFMGDMV (2) 17595 FGGWTHMVREGADHNGLWQLIKSGMK (2) 17521 17470 VSFVYEHIPWLSYYVKKLPGAGSDLKMMRAMAFGQTEKRYATTTTTRDLFYYL (0) 17247 TNEDGSEKVDPPKAVVISDGALALIAGSDTTSTVLTSTFYCLLRNPETYKRLQEEVDMFY 17071 17070 PAGEGSLDPKHLPEMHYLEA (0) 16953 GLRLFPAVPSGTQRAPEVGKGGKAIGP (2) 16864 16807 YYIPEGTQTRLHFWSIHRDPRNFSHPEMFWPDRWLIAEGLQECVGEKLVHNPNAWLPFS 16631 16630 FGPSNCVGKNLAMQEMRMLVCHLVQRFNFRFADGYDPAQYERDWQDRFVVMIGQLPVTIERRA* 16439 >Scaffold_141 73% to scaffold 249 gene model complete all boundaries checked MPSGILGQLQSLPAKLTAQDATL (0) 12018 VAHVIFKIWEPMQARIVSLLVIIAPLLLSTLFIPHYGT 12126 VSGVFRSFAIYLTTLVSSIVVYRLSPWHPLARYPGPLLAKVTKLYHALMVSKGKQHVYIK 12305 12306 ALHDQYGDIVRI GPNEVSIRDAACIQPLMGAQGLAKGP (1) fused 12494 SWSGRSMFPPISPLIGIRDPAEHARRRRPWNRAFNTNGIKEFMPTIQTRVQQLAEHLGERH 12673 12674 GQALDLAEWFSFFTYDFMGDMI (2) FGGWTEMMRDGGDLQGLWTRVKAGLQ (2) 12868 12942 HAGMVPEHVPWVAYYAKKIPSVVRKVSEMRGMGISRAKMRYQQGSTSKDLFYYL (0) 13085 13142 SNEDGSEKVTPPPEVVTSDGALALVAGSDTTSSVLSNLFYCLLRDPVSYKRLQEEVDKFYP 13321 13322 PGENSLDPRHINNMPFLEAVI (2) 13384 NEAMRLYPVVPSGSQRSPEIGKGGRAVGP (2) 13528 13583 YYIPEGNQARVHFWSVFRDSRNFSHPETFWPDRWLIAEGLQESPEKITHNANAFVPFS 13756 13757 FGPANCVGKNLAIQEMRLAVTHLMHKLNFRFADGFNPDEWDSQIQDVTVMQLGKLMVVVERRD* 13942 >Scaffold_55 61% to scaffold 141 top of gene missing in run of NNNNNNNs gene model complete all boundaries checked AEWIAFFA (2) YDFMGDMA (2) FGGWTEMMRDGVDKEGFWDMIHMTLK (2) 89160 AGAVFEQVPWLAYYAKMLPAISQRILKMRKLTVRHATRRYNSGSFSRDLFHYL (0) SGEDLPEGSARPPQHIVAADGLLAVVAGSDTTASALSNLFYCIMRHRDVYKRLQQE 89519 89520 VDQFSPLGDDSLDPQHLNNMPYLNAVI (2) 89600 89652 NETLRYLPSVLSGSQRAPLIGGGGVSVGP (2) 89738 89797 YYVPEGNQVRVHFYSVHRDPRYFSDPDRFWPERWLIADDRQPSSEKIVHDDRAFIPFS 89970 89971 YGPSNCVGKGLALQQMKSTVCHVMAKLEMRFADGYDPDTWEEQVQDEGVMIVGSLPVVIKRRL* 90159 >Scaffold_3 gene model complete all boundaries checked MALLQLAQDVFRRSHPAS (0) ACYALYHKYLNKPVHPLYHFCLLLVVPAALLALLQAIHRISVAQECGYALF YWLVMASATVVYRISPFHPLANYPGPLAAKISKLYLAYLTAKGRAHEDVRALHSKYGDVVRI (1) GPNELSFNRSDAIQTIYADKTMPKGP (1) 264083 YYVARTNLAGVVQLDGVRDFKEHARRRRPWNKAMNSAAIKSYEPIVSSTASQLLGQL 264259 264260 SKRIHNDVNISDWMSFYG (2) FDFMGRMV (2) 264385 FGREWGMLEEGRDVNDYWHTMDKCLT (2) 264569 IVSWSSQIPWSVPIIRLMKPPPEVLKMQKISDDSAMARLTSEGSGVKDLYYYL (0) 264727 264786 LNEDDSSKSELTRDECISEGVLAIVAGSDTAATALTHLCYYLLTHPDSLQRLRQEIEEAYP 264965 TLGSELDDLSRQAEMPYLNACI (2) 265031 265081 NETLRLLPPVLTGLQRSVTAGSGAIIAG (2) 265164 YFVPEGVDVSVHHYSVHRNLQDFSPIPDTFWPDRWL 265325 265326 EQDAYVLPDGDVIGKGEVRTNRGAFMPFSVGPQQCAGKNLAMVELRAVACGLFRRFDLS 265502 265503 LSERMNICDYEKGLRDAYTTVRGPLYVKLKPRKE* 265601 >Scaffold_9 73% to scaffold 144 gene model complete all boundaries checked MADTSLLSRRLKSFFDPQGSPTLLSLPDSF (0) 56886 VVHLIFKRWEPMKLPIVAFLLFLVPACLSLLFASHLSLTKGLATAFATFYTVLVSSIVI 57062 57063 YRISPFHPLARYPGPLAAKITKWWHAYHVHTGKQHLYVRRLHDQYGDIVRI (1) GPNDVSIRDASCISSGLGSQGLPKGP (1) 57353 57409 MWDGRFMYSPIPAMVGARDHAYHMQRRRPWNRAFSATALKEYEPLIYGRVHQLVSALADRQGQ 57588 VVDIAKWIGYFT (2) YDFMGDMV (2) YGGWTEMLRDGKDEDGLWDVVHRGLE (2) 57908 DVSAVYGEVPWVSYYASMLPNVGKDLKRMRKMAFDRAKQRYDSGSKARDLFYYL (0) 58057 58102 SNEDGAEKVTPPRPIVVSDGVLALIAGSDTTAIVTATILYSLLCNPTTYHRLQQEVDKFYPRGEDPLNP 58308 58309 KHYKDMHYLEATI (2) 58341 58398 NEGLRLFPATPSGTQRAPAPGKGDRLIGK (2) 58487 58541 YYIPEGTATKFHFWSIQRDPRNFSHPDTFWPERWLVAEGLEHADEPLTHNANAFVPFSFGPYNCV 58735 58736 GKNVAMQEMRMLLCHLMHTLDLRFPEGYVPRAFEDALEDQFGFKVGELPVIVQRRE* 58900 >Scaffold_271 stop codon in middle of last exon (seq. after stop matches 247, 144) gene model complete all boundaries checked 28395 MGAGYVPQLPRSYALTSFTGTSQTHALTPVIGAAL (0) 28288 28229 ITHLNFKRWEPLNLPIVIFLPLVAPLGLSALFAPGPSYGSLVAPFITLFLYHAILLASIA 28053 28052 LYRISPWHDLYHYPGPLPAKPSKWWMVWKERHGKQHLYVKALHDRYGDVVRT GPNEISIRDVAAVVPLMGTKGLPKGT (2) 27822 (fused exons) 27756 APWGEPIVPSVVPLIGIRDSVEHARLRRSWARAFTAGALKGYEPVLTARITQLIAKLGSQTGE 27577 27576 XIDLALVLSYFSYDFMGDML (2) 27475 YYGGGSELLAEGDNDGVWAL (2) 27407 27356 GQMTCHHLPWLAKYKDLLPASPGLQKMRSVALQRTMARYKNGGLGKDLFYHL (0) SNEDSAEKTSPPTEQLVSDGVLAVFTASDTIATVLSNVFWSILRFPRYYEQ 26997 26996 LQAEVDKFYPARADAFDTAHYGEMVWLDAIT (2) 26851 NEALRLYPIVPSGSQRAPAPGDGARVVGP (2) 26765 YVIPAGTNARVHTWSLQRDPRCFSRPDAFWPERWL AAAGLGDSEATCEGPEGAAGAAEDFVHDARAFEPYFVGPLDCIGRALA 26456 QLELRMVLCALLQRLAFAPARG*DPLERERTLQDQFVVMMRGPVMVSVSWRA* 26304 >Scaffold_144 84% to scaffold 247 gene model complete all boundaries checked most similar to 247a 35619 MGSEVAMQLLRANISKPFPRLDQNDALAVVVGAALV (0) 35509 35460 VCHLIFKHLEPTYIPAVLFLLVLVPLYLSALLLPHFGPLLAPVIAFTTYHTSLLTSIGLY RISPWHPLAKYPGPLPAKLSKWWMVWKERDCKQHFYLEDLHKRYGDVVRIGPNELSICNV DAVLPLMGPDGLTKGPC (1) STIGQGLEQPIPSLVSIRDPAEHARRRRPWTRAFSTA ALKEYEPILAKRISELCEQLAQQKGSLNLATWISWFTYDVMGDL (2) VFGGGNEMLATGDQDGIWAMFEKSGE (2) GQMVYHHVPWLAHYAKRLPMSPALKKMREFALGRALERYKRGAVTKDLFYYL (0) SNEDGSEKIPPPPAQVIGDGVLATVAASDTTSTTIANTFWHILRYPHYY KRVQAEVDKYYPPGESAFDTKHHNKMTFLEAVI (2) HETLRLYPVLPSGSQRSPVPGKGDRVVGP (2) YYLPDGTQARVHTWSLHRDPRYFSRPDTFWPERWLIAEGLEPAPAGEPFVHNANAFIPFS FGPANCVGKNLAYLEMRMVFCHLLQNLDFELDKRWNPAERSRSAEDQFVLYMRCPLPVTVRRRV* 33556 >Scaffold_247a 51% to 154 gene model complete all boundaries checked 17448 MGTEYVLPLLRTNILKLPTNMTRNDALLAVGGAAV (0) 17558 LCHLIFKKWEPTYIPAVVTLLLVVPLGLSALLVPHYGQLLAPLVALATYHTILLTSIALY RLSPWHPLAQYPGPLPAKLSKWWMVWQERDCKQHLYIKQLHDRYGDIVRIGPNELSIRNV DAVAPLMGTNGLPKGPS (1) LRGQGLEPPITGLIAIRDPAEHARRRRPWTRAFSTA ALKEYEPILVKRISQLCEQLASQKGTLDLATWFSWFTYDFMGDMV (2) RFGGGSEMLAHGDQDGIWTMFKEGGE (2) GQMMYHHIPWLAHYAKRLPMSPALKKMRGFALGRTAERYKKGASTKDLFYYL (0) SNEDGVEKTPPPAAQVISDGVLATIAASDTTSTTLSNAFWNILRHPHYY KRLQAEVDKFYPVGENAFDTKHHSKMTFLDAVL (2) NETLRMYPVLPSGSQRAPFPGNGDRVVGP (2) YYIPDGTQARIHFWSLQRDPRYFSHPDTFWPERWLIAEGLEPAPAGEKFVHNPNAFIPFS FGPSNCVGKNLAQMEMRMVFCYLLQNLDFELDKSWNPAERENATEDQFVLLMRSPVQVTVRRRV* 19517 >Scaffold_247b gene model complete all boundaries checked MMPYILGVTLLLLTVIVLRVLRARASRAAPYPPGPPAYPVIGSVGPFPAHEPHLGLAELAKKYG (1) ex1 20516 DVMYFEIFGKPLVVLSSLEAASDLLEKRSAIYSSRPRFAVHEM (2) 20620 ex2 20700 IGWTDMVSFLPYGEQFNKQRKFFLHTFSKQGCLVFRSSQVAQTHLLLKNILQCPTRYIEYLR 20885 ex3 RFSTAVIMEIAYGHKVSSEDDPYVKIAEDTNNVLMAAGHSLALVDFLPW (1) ex4 21133 LRHLPAWFPGNWFARVAQ (1) 21183 ex5 partial ESRPVIQRMRNFPFDQVVQQM (0) 21363 ASGTASPSFVSMQIEELERDGGASPENLHILKIAASQMYGAGAET (0) 21561 TWSTIMNVIAFLLLHPTAQRKAQDELDSVLRGERMPDFDDRKSLPYLDALLLEVMRLQPTAPL (1) 21743 ex7 21808 GVPHSSTTDDIYRGMFIPKDSIVLTNTTY (0) 21891 ex9 ALAMDDRVYRNPTEFRPERFLPPHAEPNPNGIVFGWGRR (2) 22114 ICPGRYLADTSVWIVMASFLTVFEIVPERDRNGQDIIPEIQWCS (1) 22257 22330 APFPCVIRPRSEKEVKLVSRL* 22398 >Scaffold_104 63182 VLLWVLWKVFRNYVVSSPLDNIPGPPRSSFWS 63283 ex1 63714 GEQHRRQRRVLNPVFSGAHMRHMAPVFYDVAHRV 63815 65118 ILAATDTTSSALARILHILAERQDIQDKVRAELVEAAGEGEDIPYDQLVNLPWLDAIC 65291 65292 RETMRLQVP MTAASHRARTDVIMPLSEPVQGRDGTLIHEIAVPKGTLVTISVRGCNRNKAIWGEDALEWKPERWL 65636 >Scaffold_203 39023 GQQHRKHRRVLNPVFSINHMRNLAPLF 38943 c-helix 38894 TSTGGEVEILGWMGRAALELVGQGGFC 38814 38261 FAATDTTSNAMARILHLLAEHQHVQDKMRLELFEAGMDGEDIPYDRLVELPYLDAVCRETLRL 38073 YPPVLFMNRE (small exon?) TRQDAVLPLSKPIQGLDGDVLTEIMVPKGTLLMVSIVACNRNKALWGEDVLEWKPERWLSPLPESIREAHVPGVYSHLYVY 37689 >Scaffold_76 I-helix to EXXR 41033 GNLKEMFNRHGWGFHDTIIRQYGPIATVHSMLG 40935 ex2 40879 LYVYDPKALNHIILKDQYTY 40820 ex3 40647 GAHHRKQRKLLNAIFSIARMRDTAPIFYNVAHR 40549 c-helix 39822 ILGATDTTSNALARTFQLLAEHQDVQDKMRAELADAAPDGEDIPYDQLVHLPLLDAVC 39649 39648 RETLRL 39631 NPPIGLLARE ARDDIVLPFLQPVHGRDGSLINEVPVPKGSTVFISVRACNRNPL 39369 IWGEDAAEWKPERWLQPTPKSLSEARVPGVYAN 39271 MTFLGGDRACL >Scaffold_97 36% to CYP51 53419 AAALVAYLLYLCVVYPLRNPIRQLPGPPSKWFL 53517 ex1 53731 RLFTLDPVTMNHVLQHTAIYEKPWPSRRLISGLIGAGMLSAEGQMHKRQRR 53883 c-helix 53854 EGQMHKRQRRVATPAFSLNEMRALIPLVFSKGT 53952 c-helix 54010 GHVVNVCSWASRATFDVMGSAGFDYEFNAIQNEDNELLRAYVDMFE 54147 54509 NSAVDLPPRLSDQDLLNNINTFMFAGTDTTSLALTWTLYVLALYPHVQDRLRAELLSVAPIAPIDTLTP 54688 54689 EEVQSLYAEIAALPFLENVIRETLRLIPPVHSSIREATRDDVVP 54820 WGETA 54964 55072 GLYQHTLTFSAGPR (0) 55165 ACIGMRMSVIELKSFLFTLVTNFKFAPDPTQKIGKA 55275 55331 SILTRPYVAGKQNEGSALPLIVTPY 55405 >Scaffold_112c 3 different P450s on this sequence very similar to 77 388 417 112 129 59100 GEQHKKQRKMLNPVFSVSNLRELLPVIQPIANK 59002 c-helix 58886 WLSRGAQEYMSQACFGWTFNALDLNKRNTYSEAARKY 58776 58384 VKANASSNEADRMTDSEMIGQM (?) 58270 STLLFAGFETTTYAISRILWVLASHPDAQARIRSEVKAAKNKYLEDGRSTASTWEDVSLSY 58088 58087 DDLMALPYLDAVIKETLRVYPPSSVHFR 58004 57810 NVWGPDASEWKPERWLNSDDKAVPK 57736 >Scaffold_12a gene model complete all boundaries checked 53199 MNNLTAALILVALALWFACRRFTRTTLRDIPGPKPVSFWL (1) 53321 ex1 53375 GNLEQYFLGQAGEGDFHLQERYGRIARLHGSIG (0) 53473 ex2 53532 GEYLWISDPNALRYIFQTSGYRYAKQPERRALSRLHSGHGLVWAD (1) 53660 ex3 53715 GEVHKRQRKVMLPAFGAPESKALLPHFARAAEA (0) 53819 c-helix 53864 VSVKWKDILTTAPSLSKELNVSTWLSRATMDAIGE (1) 53977 AAFDYHFGALENTDTDIVRAYNNLM (2) 54149 PIVFGAPTADAIFKRDALRIFRSSRIVEWIYDRQRNPAVEKARECEELTLKIARELVENK 54328 54329 AEALEQGKGSKDIFSLL (1) 54370 54432 VKANMTEDAKSRLSEEEMYAEMR (2) 54552 TILFAGHETTSTTISWVLLE 54612 LARHLPVQERLREEILAHKRGGELSATDLDGMPFLQAVVREALRLHPVLNQTF 54770 54771 RQAEQNDVLPLAHPLTDRTGTVLTALPISKGTRVILSIAAYNR (2) 54899 54963 DTELWGSDAHAFDPDRWLDGRVKKVQTLGMYGNL (2) 55055 55102 LTFAAGVRGCIGWRFA (2) 55195 VYEIQTFLVELLANFEFRPTEDLKRLRREPCGVMVPTLEGDRGTVQLPLRVSLLDHKI* 55362 >Scaffold_12b gene model complete all boundaries checked, many fused exons 184563 MDTVLVGLFVALALYAWSRSSKRSALPVPPGPKPVPLLGNIFDLTAKELWLRVTGWSKQY (1) 184384 ex1 184326 DIVYIHLLGQGLVFCNTYEVAQDLLEKKGSIYSDK (GT boundary missing)184222 ex2 CGCQNMVAFTRYGDFARRQRKLMNTAFGISAVKRYRPLLANESVLLLKRILADPQDYMG YIRRYAGGLTLQSVYGYRVETNDDPLLELGTECVDILSNKIASGGGIWPVDIFPF (1) ex3 183744 LQHLPTWFPGAGFKRKAAVWRAKMEEFVDKPYEMVLERM (0) 183628 ex4 RSGATVPCFVTTLLEEARDEKGGAVDAQRDFDIRWTANSMYSASMDT TITVVQLFLLAMILHPEVLRKAQAELDAVVGPARLPTFADRPALPYLDAVMSEVLRWGVPVP LGLPHRLMEDDVYRGTHLRAGTLVFANIWNMLRNEAIWAQPDVFRPERFLEPVDEATAK RRDPRPYVFGFGRRRCPGLHLIEESLWIVMATLLATTDILAEKDESGKPVMPHVDFTNS (0) ex5 fused 182807 LVVPFSTPAPFKCDIRPRSEQALQLVRLAE* 182736 ex6 >Scaffold_206 MTSAPLLAFGAALIAIIWTLFQGYLVKSPLDNIPGPERSSFWL (1) 36949 ex1 36488 GEHHRKQRKLLNPVFSINHMRHMAPIFYQTTHR 36390 c-helix 35800 VKANMEASDEDRLPEDELIGQMT 35680 TLVFAATDTTSNALSRILELLAKNQDVQDKMRTELIAASPDGEDIPYDTLVALPYMDA 35507 35506 VCRETLRL 35483 35214 IWGEDADEWKPERWL 35170 >Scaffold_187 21928 GDHHRKQRKLLNPVFSINHMRHMTPIFYNVVHDVR 22032 22756 (0) TTSNALARIFQLLAEHPDVQDRLRAELTDAAPDGADIPYDALVVLPYLDA 22905 22906 VCRETLRL 23079 ARADAVLPLAEPLRGADGAPITEIAVPRGTPLIIAIRASNR 23201 >Scaffold_173 3475 PSGDHHRRQRKLLNPVFSIAHLRRVTPVFYEVMNRV 3582 c-helix 4318 TFIFAGTDTTSNALARILHLLCLHPDVQEKLRAEIIEARAQNGGGDLDYDALVALPYLEAVCRETLRL 4521 HPTHSTRRARSDALLPLSAPLTLRDGSRVSALHVPQGTSVLVAIHSVNR 4782 4786 AALWGADAHVWRPERWLEKLPDAVAEARVPGVYSNL >Scaffold_88 64% to scaffold 12 gene model complete all boundaries checked MHDIFPLAVLLGALLWIVRRILSRSSIRDICGPEPESFWL (1) ex1 44607 GNLKQFFMRQAGEGDFELQERYGRIARLHGSIG (0) 44711 ex2 44761 GEYLWVADPKALQHIYQASGYNYAKQPERRALSRLHSGHGLVWAE (1) ex3 44954 GEVHRRQRKIMLPAFGAPESKALLPHFIHIAES (0) 45058 ex4 45105 LSMRWKDILLASRDFAKELDVTEWLSRATMDAIGE (1) ex5 45258 AAFDCQFGALDNGGSEVLRAYNDLL (2) 45335 ex6 PMVLGVPTTDGIWKRDAMRIFNSSAIVEWIHDRQTNDVLQRARECEQMVMK VAKELVSSKAEALVQGKGSRAYF (probable frameshift here) SLL (1) ex7 45685 VKANAAEDAASRLSDEEMYAEMR (2) 45750 ex9 45809 TSSLAGHETTAMALSWALLELAQHPEVQSRLREEVRGCKRGEELSAAVLDSMPYLQAVLREVLRVHP 46009 46010 PAIHNFRQAVRDDVLPLAHPITTKSGSVLTELPIQKGTRLILSIAAYNR (2) 46156 ex10 46197 DPDLWGSDPHMFDPDRWLDGRVKKGQVVGMYGNL (2) ex11 LSFSAGVRGCIGWRFA (2) 46400 ex12 IYEMQAFLVELVSNFEFGPTEDLKRLRREPCGVVAPMLEGEQGVQLPLRVSLANYDV* 46616 ex13 >Scaffold_101 48% to scaffold 7 first exons to C-helix fused 507 amino acids end of exon 5, exon 6 not well supported. gene model complete all boundaries checked MPLLSAVPAAALPLLG 53410 AALYVLWTFLALLVRQARSPLRHLRGPPSPSFL 53508 VGNLREMHDQENTALFARWEHRYGSTFVYHGFLG GARLLTTDPVAVAHILAHGYDFPKPEFIRDALASMAAGHEGLLVVEGEDHRRQVRA (0) c-helix ex1 SPAFATPHIKSLSPIIWSKATQ (0) ex2 LRDVWIDLASSPSLTPAATP (2) ex3 54377 PGTKVDVLAWLARATLDVIGEA (1) 54439 ex4 GFGYAFNSVRAAACPGDAAEDELARA FAVIFSTARKFRLITVLQVWFPFLRRFVSIPPRCFLALP (1) ex5 LKSSLSSDPSQQLSTNAMLCQIATFLAA (1) ex6 55067 GHETSASALSWALYALARAPACQHTLRRELRALTLPADPSAADLQAVLALPYLDAVVRET 55261 55262 LRVHAPVTSTMRVAAHDAAVPVGTPFRDAHGAQHAAIRLRAGDIVTLPLQAMNK 55423 55436 WGADAACFRPERWLAHGDAPREPRGLWGGVMTFGTGVVANGNRSCIGYRFAVND (2) 55597 VVTRPCVKSEPHLGNQMPLRLRRVAVEETVGDSSGDGAPRTVS* >Scaffold_7 47% to scaffold 97 sequence is short exon 5 not well supported only 481 amino acids. gene model complete all boundaries checked 74171 MGYPLAVYAVGALVALIVYSVGPTVWHVLTSPLRHLPGPPNDSLLW 74033 GNMAAIQNEEISVPQARWVKQYGHTISYRGVFG (0) 73935 ex1 73876 MWRLWTVDTRALNHILTHHLIYQRPLPSRYQLSRLVGPGVLVTE (1) 73748 ex2 EERHKHQ (0) ex3 part of c-helix 73635 RRVMNPAFGPAQVRELTEIFTEKANEMRDVWYNEITKAGGASAQ 73549 ex4 part of c-helix 73497 VDALSWLSRATLDIIGRA 73438 ex4 73437 GFGYDFEALTGASNELNQAFSTLFARPIARHRFFA (2) ex4 RIGMQLIEQRKAAILAEKGKDVERKDLTGRDLLTLLIRANMATDIPEDQRLSDEEVLAQ (1) ex5 VPTFIVAGHETTSTATTWALFSLAQMPEIQRKLRNEMLTIDTDTPSMDQLNSLPYLDAVIR ETLRFHSPVPVTTREAMADDVIPLGTPTVDRYGRTIDHINIKKGDLVFVPILAINRSKEIWGEDVDD (2) ex6 72585 72529 RPERFENVPEAASTVPGVWGNVLSFLGGPRACIGYRFSLVD (2) 72404 ex7 IVTRPVMTGPDGKTRGALPLIIRPYRP* ex8 this EST supports first half of exon 5 >gi|6433584|emb|AJ274211.1|AJ274211 AJ274211 Metarhizium anisopliae ARSEF 2575 Metarhizium anisopliae Query: 16 SRIGMQLIEQRKAAILAEKGKDVERKDLTG 45 +R+G L+ R+AA+ E G D R D+TG Sbjct: 442 NRVGSTLVPGRRAALAHEVGDDGIRVDITG 353 >Scaffold_311a gene model complete all boundaries checked MSSLLVLVAISLALSQLIRFYRWLFHHSISYLRGPVADSFIL (1) ex1 GNVREFTYQESVGDLDFRYMNEYGTAWRMKSILG (0) 1636 SDVLMICDPKALQHVLHKSGYHYPKNTEARIGSFNVTGRSILWAP (1) ex2 NGDIHSRHRKIMNPAFTAQQLRSFLPLFRRGSNK (0) C-helix MCQLWKDEVLAQAPTGMTIAVNQWLARTTLDVIGE (1) AAFDFSFGALDDADNEVSKAYHNML (2) FADSLLYPSAWSTIFRGLWRFIPDQLLSYVRYLPTREYTRFRYTLNIINKVSKSLIDQKS EDLLSGDKSSKDVMSVL (1) VRANSSENPRSQLSEEEMVSQMATLTLAGHETTANTITW LLYELAKHPEYQQKMREEIAVKRAEINARGDADFTMDDLESMQYLHAALK (0) ETLRYHPIVYHLAREASKDDVIPLAYPVTTIKGETVSEIPIAAGQIIMPNIAAYNR (2) LPQVWGDDAHEWNPLRFIDDSPEVQVRLGMFGNL (2) MSF (1) FAGVRGCIG (2) WRFS (2) LIEMQAIVADLVENFQFSIPPEKPEIIRVPAGIMGPMVKGKMHEGLQMPLHVTPL* >gi|6934623|dbj|AT002896.1|AT002896 AT002896 POSLM01 Pleurotus ostreatus cDNA clone 355LM. Length = 615 This EST from another basidiomycete supports the small exons in this gene Query: 1 LPQVWGDDAHEWNPLRFIDDSPEVQVRLGMFGNLMSFFAGVRGCIGWRFSLIEMQAIVAD 60 L +WG+DA E+ P R+ V G++ N+M+F G R CIGWRFS++EM+A++ Sbjct: 177 LVSIWGEDAFEFKPERWQSPPEAASVVPGIWSNMMTFLGGPRACIGWRFSIVEMKALLFT 356 >Scaffold_311b gene model complete all boundaries checked MAAAALLIICWLVVNLRRLLTHNSIRHLRGPPSASTLF (1) 7990 ex1 7939 GNVTDTLYQASVGDVEFRWLKEYGGAWRLRGLLG (0) 7838 7785 ANILALADPK (0) 7716 ALQHVLQKSGYNYPKTRQLSVTLFNLTGRSILWAP (1) 7594 ex3 TGEIHARHRKVMNPAFSVPQLRSFIPLFRQSAKK (0) 7436 c-helix 7388 LTQIWKDQVNAGHPDGVTLPVDRWLARATLDIIGE (1) 7278 AAFDFDFGALDNTENEVSKAYHRML (2) 7150 ex 7097 FADSQLYPSVWNLLFQATWSLLPEPLLYYIRYLPTREYKTYRSTLSVMDKIAAQLIEERT 6918 6917 REFGAGDPDKSRKDVMSVL (1) 6858 6810 VRANMSENPSTRLSDEEMRSQMFAMTLAGHETTANTVT 6697 frame shift 6694 MLWELAKHPDIQEQLRQEIAEKRVEVTANGSYDFALDDLETMPLLQAVIK 6545 (0) 6487 ETLRYHPISSFLWRVAAKDDVIPLEKPIVTTTGETITEIPVAAGQVIMPSLCSYNR (2) 6317 6262 LAHVWGEDAHDWNPMRFLEGDTEKQTKVGMLSN (2) 6161 ITF (1) SAGVRSCIG (2) WRFS (2) 5885 VLEMQAIVVELVENFRFSLPDNKPEIIRAPTMTMGPMVKGKLHEGFQMPLRVVPV* 5715 >Scaffold_51 gene model complete all boundaries checked MFHGYLSATFQAQRRP (1) 59426 GNIKDFTYQQNVGDLDFQWVKQFGRVWRMQSPFG (0) 59313 59268 TDILALADPK (0) ex2 AMQHCFHKADDQYNKRVESTVGSRMMMGKGLVWAS (1) 59080 ex3 GTTHERQRKIMSPAFTTAQIRSFLPFFRAGAAK (0) 58924 c-helix KWRDELFNHSTDGAAVPVNKWFSRATLDILGE (1) 58772 58730 TAFDFNFGAVDDKDNEVTLAFHTML (2) 58647 ex4 58600 FANSCLRPPKWDLLFKRIWYFLPNPLLELVQYVPTKEQNRFRRCRLVVEKVSQQLIQEKR 58421 58420 EALLAEAKSSRDIFSVL (1)58361 58325 VRANVSENPNSRLSDEELIAQMGTLVLAGHVTTATTLSWMLYELARRQDYQDKM 58161 58160 REEIVAARARLQERGQQDFSMEDLENMHYVSSCLK (0) ETLRFHPPVYHLFRQANTDDVIPLEQPVRTTSGKYVTEIPVAAGQQVLFSVCAYQR (2) LPEVWGEDAGIWNPMRFIDGNVDKQSKLGLYSNL (2) MTFSAGSRGCLG (2) WRFT (2) 57475 IVETLAIIVELLEHFKFEPTEDTAKVIRVPTGIMSAFTAGKEREGPQMLLKVVPIL* 57311 >Scaffold_4a 47837 SSPLVLLACTVCVAVLAFRWYSSSTHGSIAHIRGPP 47730 first exon 47659 GNIRDFSYQENVGDLDFAYMKEYGTAWRLKSSLG (0) 47555 47500 KSVLMVADPK (0) ex2 ALQHIFHKSGYLYPKTTPSTVRSFLVTGKSILWAP 47318 ex3 DGNTHSRHRKIMNPAFSAPQLRSFLTLFRKSSSK C-helix 47119 LCQLWRDEISPEGSTVLVNKWLARTTLDVIGE (1) 47018 46975 LAAFDFDFGAMQDNQNELSVAYDNML 46898 ex4 46857 YLLSTDATLHTSPWNAIFEALWDYIPDGILKQVQHIPTREYARFKQTLGVFAKYSKRLIA 46678 46677 QKSADLVSDTHSKDVMSVLG VRANAAEDAGRKLNDEEMVSQMSALTLAGHETTANTISWLLYELA 46438 46437 KHPDFQEKMHAEIVAKRAEIVARGDEDFTMEDLESLEYLQAAIK (0) 46306 ETLRYHPIAFHLNRMASQDDVLPLAYPVMTTAGEKVTEIPVRKGQAIMPNLAAYNR 46087 46030 IPEIWGADAHEWNPMRYIENRTDAQVRVGMYANL 45929 45819 PAAGVRGCIG 45790 45671 SLIEMQAIISDLVENFRFGLPKDRPEVLRVPAAVMAPMIKGRMEEGAKLPLHVT 45510 last exon >Scaffold_4b 55457 SGACVLCLAWLAYRWYRWTTRLNISYIRGPPVKSWILG 55344 ex1 55296 GNVRDFAFQENVGDLDFKYVQEYGLVWRMQQPLG 55192 AQVLMVADPK 55117 GDIHARHRKAMNPAFNNAQLRSYYPCFRRTSSK c-helix VCQLWKDQILSQGPNGATIRVDRWMARAALDIIGE (1) 54655 AAFDFDFGALDDSANELSAAYHNML 54522 ex4 54464 SADSTLRPSAAQAVFQGLWTHAPLRVLERVRHLPLRDIARFQHAMRVFNTYAARLMARGA 54285 AGAAHGRDVMSVLG 54228 TAHANASADPRTRLSAEEVRAQMCALTFAGHETTANTTTWLLWELARHP 54035 frame shift 54036 PAHQDSVRADTVRRRAHVAARGDADFGVEDLDALPCLEAAIR (0) 53908 ETLRCHCIVFHLNRVASQDDVIPLSRPLTTATGKTVTEIPVAAGQVVMPNIAVYNR 53681 TRTKWDPTRFLDGRVDNPEVRLGVYGN LRTF AGGVRGCIG 53275 RHRMIEMQAIVADLIGHFRFSIPDDKPEIVRAPSMLMAPMIKGKEHEGSQMPLHV 53096 last exon >Scaffold_4c 74181 IFTAVLAVTVVSLLTRRKNGRHVPPGPKPLPFIGNALQIPPQHEWIKFKEW 74333 exon 1 74622 YGPRLRESRKMLKKGMGPAAVKTYYPYINREVPFFLENMLRKPDSFVEH 74768 ex3 LKIAYGYEGVTEDEKIIRGGVDAMEVFSAVAAPGVWV 74961 ex4 75028 VKYVPSWFPFAKFKKFAERGKKITDEALDTPFYEVKRRV 75144 ex5 75378 TTATLSWFTLAMAKYPEVQKKAQAEIDRVIGKDRLPEVGDRDSLPYVWAIMQETFRWH 75551 75552 PTITMSGY(?) ex7,8 75626 VPHTAIQDDEYRGYFIPSGTTLMANIW 75706 ex9 RERRSWPISGACPLRLMF 75731 75732 EVIYHVHRG 75759 ILHDEKLYHDADKFIPERFCDEGAPDSLSVAFGFGRR 75869 75913 RRVCPGLVIAQTHVFVTMASMLATFNITKARDSTGAVIEPREDAKSGVI 76059 ex10 76131 PKPFVVSITPRSDEAVTLIRRSV 76199 >Scaffold_4d 77219 VLALLLASVLTKARRKARNLPPGPKPLPFIGNAHQIPPENEWIKFKEWGDEY 77374 exon 1 77654 PYGQRLRESRKLLKKGTSPAAVKTYHPYINRDLPFFLENMLSTPDKFVEH 77803 ex3 LKIAYGYEGITEDERIIQGGVKAMEVFSATAVPGVW 77995 ex4 78078 VRHLPSWAPFSSFKGFAERCKRITDEALNTPFYEVKQRL 78194 ex5 78431 TTATLSWFTMAMAKYPDVQEKAQAEIDRVVGRDRLPEVGDRDSLPYTWAILQETMRWH 78604 78605 PTVAL 78619 ex7,8 78678 VPHTAIQDDTYGGYFVPAGTTVIANVW 78758 ex9 78809 AMMHDENVYHDADKFMPERFYEEGAPDSLSVVFGFGRR (?) ICPGLVVAQTHMFVTIASILATLNISKARDNAGNVIEPREDAKSGVI 79126 ex10 79190 PKPFQVSITPRSDAAVQLIRRSV 79258 ex11 >Scaffold_4e 39% to CYP61 199360 LFAGVLLVCLFVSYARKRPRYPPGPKGLPLIRNLLDIPADYPWITYRDLAEKYS 199199 exon 1 199149 SDILHFEVFGSHLVVLNSAEATRQILEKQSSITSDR 199042 exon 2 198893 DRTVTFMEYGESWRRHRKLFQEHFRQQAIPRYHHAQTKGVNRLLKSLLDTPEKFSAHIR 198717 ex3 198422 VRYVPSWFPGASFKRLAGKWKKAVDDMYTIPYNNYK 198315 ex5 198012 TALGIFIMVMAVHPEAQISIHEELDRVLHRDRLPTMEDRKELPRTTALMYETFR 197851 ex7 197726 VPHQTTAAIHYNGYFIPQGANIVANSW 197646 ex9 197594 AILRDEGLYPDPETFKPERWLDADGSLRDDMRFPVEMFGYGRRICV 197457 197456 GRHFAEDIVWLAIASILSVYKIEPPVDENGTVRALEADFTPRL 197328 197267 PKPFKCRFTPRFPGAEGLIRASL 197199 >Scaffold_4f 202911 VSCASLLILHTRRRPRYPPGPKGLPIVGNLLDVPTHNAWIKYKQLGKKY 202765 exon 1 202710 GSDIIHFEVFGSHIVVLNSTTVARDILEKRSQISSDR 202600 exon 2 202451 ERNFGFMRYGDGWRRQRRLFQQHFRRKAVTQYHAIQSKSVHSLLNALLDRPERFIANLR 202275 ex3 201827 VKHIPSWMPGAGFKRKAAEWKVLVDDMYEVPYN 201729 ex5 201418 VGTLSIFFLAMTIFPSVQVAAQEEIDRVLGRKRLPSIEDRNALPRTTAIVYEVLRY 201251 ex 7 201128 VPHRTIADDEYNGYFIPAGTTIIANAWY 201045 ex9 200996 AMLHDEERYPNLETFIPERFLNKDGSLRSDACIPLEPFGFGRRICPGRYFAEDIVWLAI 200820 200819 ASILSVFRVEPPVDEHGEPLKQTATFGTRFL 200727 ex10 >Scaffold_4g I-helix to heme (others in 4 do not have this intron) 276990 XXGRPFRVASPLRWQYIVSSPELIDELRRAPD 277079 ex3 277376 WHPVVAHDTNVRIVTQTSSRIFVGLPLC 277459 ex5 277508 RDPELLKITMSYTPTVMKTGLLLKVLPRPLQ 277600 (?) ex6 277653 RFVQRGSGSIDALIDRAHRLLLPTIEERRRMMDKYGAEWLDKP 277781 ex7 278020 GFTVALYRLATHPEYVQPLREEVEAVIAQDGWTREAFRKMPKVDSFLKECMRLQGPXX 278166 ex9 278240 VLLQRKAMQDFTFSDGTFVPKGSHVATSIVATHCDSAYYSDPLTFNPWRFVGAEDDAQDS 278419 ex10 278420 KHRFATTSPEYLLFGYGRHA (2) 278479 ex10 278542 CXGRFFAEIQLEMMMAYVVTTYDVRMEKPGVLPEPIEFGSMSLPSMTAKVCFRKR 278700 ex11 >Scaffold_10a intron in heme CYS (10b does not have this intron) 85346 LQDIPTVGGPNLPFFSYFGAIHFLVRANKIISDGYSK 85236 ex2 85184 YKGGSFKIAQVNRWLVFVTDPALNEELRKAPEDQMSXXXXXXX 85077 ex3 84951 YIEVLRDQLLRHVNPSPAMLYDEMQASFQDFIPXXXXX 84853 ex4 84790 WTPIPALSTSLKLFLRMSNRAFVGTPLC 84707 ex5 84265 SFTSVLFLLASRPAWQAELRAEAAAALAHGYTRDALARLRKLDSFVQESLRFNXXXX 84107 ex9 84029 XXXXKLALTDFALSN 83997 ex10 83996 GTVIPKGTLVSAPLRALHLDDEVYPDGASFQPWRFVRAGGEAAPRQSLASTSPTYLPFGHGKSA 83805 ex10 83742 CPGRFFAALELKMVTAYLVLNYDLKLE 83662 ex11 83661 GDATEVPPVSWFITARVPNYKANVLVRRR 83575 ex11 >Scaffold_10b 30% to CYP51 123796 SAVVAVWVMYELASRPTYIPAIREELLAVAELQADGSHYLSYDSLRNARLLDSFIREVMR 123975 123976 LKGDTLGVCRQTVQDTPMGQYVIPK 124050 124170 EVFDGFRWVERNLPAVMVGPTYFPFGMNRWACPGRVLAVS 124289 >Scaffold_151a 17010 LPPGPRPLPLIGNLLDAPSAYHWETFAEWN 17099 exon 1 17166 DVSSITILGQRMVILNSLDAAIDLLEKKSSIYSDR 17270 exon 2 17564 AGAIILNMGYGYQVKEGHDPLVDLVDRAVNGFVAASTPGSFL 17689 ex4 17759 VRWVPAWFPGAGWKRKALTWRADTRATCDVPFEFAKQ 17869 ex5 18105 VSAITTFFLAMTLYPEVQKRAQQELDAVLGAGRLPTLDDREQLPYTRALVSEVFRWNPIGPL 18290 ex7,8 18343 VPHVSTEDDEYRGYFLPKGSIFIANIWY 18426 ex9 18895 HPQPFKCSIKPRSQRAEELIR 18957 ex11 >Scaffold_151b 19497 DVVFALLALYLIVRFLQKDRTLPFPPGPKPLPLIGNLLDMPSTYQWVTFADW 19652 exon 1 19727 DISSVTVLGQRIVILNSLDAAVELLEKRSAIYSSR 19831 exon 2 19898 GELFRDIRRFLHRYIGSRGQLERVAPFYELIETSTQDFLQRTLADPVRFVEHIR 20059 ex3 20119 AGAIILNMTYGYKVQEGYDPLVDLVDRAVDGFVAASTPGSY 20241 ex4 20326 VQWIPSWFPGAGWKRRAEAWRADTQAMCDVPFEFAKQ 20436 ex5 20677 VSAITTFFLAMTLYPEVQKRAQEELDAVIGTDRLPTLDDRERLPYTRALVSEVLRWNPIGPL 20862 ex7,8 20931 VPHVSTEDDVYRGYFLPKGSMFIANIW 21011 ex9 21089 ILRNPDTYSEPLRFKPERYLGEQPEMDPRCAVFGFGRRICPG 21214 ex10 21447 HPQPFKCSIKPRSKKAEELIR 21509 ex11 >Scaffold_23 40506 VENPMVVLDTAGTVNDLFEKRGTSYSSR 40423 exon 2 40323 FTTMQYTT*WRKHRHLFHQHFNTSAMHVSRPVVLREAHTFLHNVSRTP 40180 ex3 39883 VKHVPAWFPGAAFKKQALQWREANRMMLNVPLEKVQQ 39773 ex5 >Scaffold_69a 73% to scaffold 247 6369 GAHLVFKRWEPKKLRVVTFLLFLVPACLSTLLLPHFGTALG 6246 LTVGFLTYWTALTLSIVFYRVGPLHPLYQYPGPLPAKISKWWHVWHVQQGKQHLYLQQ 6073 6072 LHDKYGDIVRIGAS 6031 5984 GPNEVSIRDPACITPVLGAQGMPK 5913 5841 GRNMWPETAPLIGYRDPAEHMKRRKPWNRAFSSASVKEFEPIIQHRVHQLVEALSDRQGQVVD 5653 LAEWISFFTY (?) DFMGDM (?) 5311 XXXXXXXVPWVSYYAKKLPWTAQDNKAMRVMAFSRTEQRYASGSASKDLFYYLV 5171 5117 SNEDGSEKVSPPRNIVIGDGLLALVAGSDTTATVVANTMYELLRHPAAYRRLQEEVDKFY 4938 4937 PRGEDSLDPKHIKDMHYLEAVM SNEGLRMYPAVPSGSVRAPEVGKGGKIAGP 4716 4663 SYIPEGTQTRIHFWSVQRDARNFSFPETFWPERWLIAEGIEPAPAGEKLVHNPNAFTPFS 4484 4483 FGPYNCVGKNIALAEMKQLLCHLVHKLDVRFADGVDPDAFDRASEDRFIYVVGELPVVV 4307 4306 ERR 4298 >Scaffold_69b 9838 MQAAHLVFNRWEPTNIAVVAALLLGVPCAAASLLFSGRGFVPRLALTAALYYACLGTSVV 9659 9658 LYRLSPWHPLARYPGPWLLKTSKLWMVRRVKRGGQWRYIRELHQRFGDVVRIG 9447 GPNELSFCDAAMVVPVLGTQGLPKGPG 9367 LIALRDPLEHQRRRRTWNRAFKPAALQEYLPLIQKRTAQLLDALSKCEEQDV 9119 9118 VDLGRWIRFCKY 9083 8817 AHMPWLAQFTKHIPRVAAKLKELRAAARSRAAARYKAGANRKDLFYYLVGD 8665 8623 SNEDGGEKEQPTEDVILSDALLAIIAGSDTTSTILTSAVYCLLTHPDVHKRLVEEVDKFY 8444 8443 PPGADWCNTEHHADMHYLNA 8384 8325 NETLRLFPVLRDGSLRAPWVGHGDRALGP (?) SSFIPEGTQVRVHTY 8146 8145 SLQRDPRCFSQPDTFWPERWLVAGGLQHAEPGFVHEPGAFLPFSRGPSDCVGKGLALQ 7972 7971 DMRIVLCALLQHLELAPPRNRPFDEWKAEVDRRFADSSAVLPVSVRVRRRV 7819 >Scaffold_90 91112 XXTTHGSRFPKSIEQYKILTFFGGNIVASEGEHWKRSRKIAAPAFSE (0) 91246 c-helix 91305 XNNKLVWDETRLILQDLFTHVWGKRAEIHVDYAVDITLLX (0) 91418 91505 KIALFVVGVTCFGRRIPWQNEDVVPAGHTMTXXX 91597 >Scaffold_99a 46% to 164a 4124 LLSLYDTLDRSTTPLSVLLPWLPSPSMLAKLRASKQVYDIVDGAIRARVASGVSRDDTLQIL 4309 4782 IPSGAYVVYPFSDIHLDPRLYPDPWRFDPSRPESKSNIGYVGWGGGE*SFVACPSIDTFL 4961 4962 THFLLPKGRTVCLGQRIAKLQIKLVLSMFLLHYDFDL 5072 >Scaffold_99b similar to 313 14609 FGFGMRACIGRPFAWQEAIIALAVLLQKFDFVLDDPSYELE 14487 ex10 >Scaffold_95 73% to scaffold 249 67473 KSVHLIFKRYEPMHILVVSTLLLLLPAILTVPLIDQLGIAKGFLVAFATYFATLLSSITLYRI 67285 67284 SPFHPLARYPGPIIAKVSKIYHVAQVWSGKQHLYLQRLHDRYGDIVRFGGFSTCSLR 67114 67090 GPNEVSIRDVSCIAPMLGTQGMPKGP (?) GKHCWPEIHTLI 66911 66910 GCSDIKEHQRRRRPWNRAFSTAAMKEYNPIIQKRLQELGDALAARQGEVVNLADWISFFT 66731 66456 HVPWLSYYTRNIPGATNDEFRGMVFGQTVKRYACTGTTKDLFYYL 66322 66260 NEDGAEKEDPPKPIVVVDGAVALIAGSDTTSTVLASTFYCLLRNADTYKRLQAEVDRFYP 66081 66080 PGADSLAPDHLPEMHYLEA 65962 EALRLFPAVPSGSQRTPERGSGGKTIGP 65879 SYIPEGTQTRIHFWSVHRDPRNFSRPETFWPDRWLIADGLQKDEGVE 65688 65687 FVHNPNAWIPFSLGPANCVGKNLALQEMRMVLVHLLHRFSFRFAGDYNP 65540 EQYEWDIEDRFVVAVGRLPVIVERR 65466 >Scaffold_204 4685 DLGHLPYGDEWKLARKFFTQHVR 4753 exon 3 partial 4930 SVKYGIDVLSADDPYIVNARQALNAINAAGNVGSYL 5037 ex4 5877 ALPHDRARYPAPDTFCAERSLTADGALDPAMPEPVAGHRFGRHAHGAGHALD 6032 ex10 6001 GMHMAQDTLWIVAASLQRAFRMERAVDVSGAPVPVTGEHTVGVV 6132 ex10 >Scaffold_59 56% to 205b 8666 IVLAVIVACSIALLLFQRRPHLPLPPGPKRWPVVGNAFRFPKEREWLTFMRWSREF 8499 exon 1 8433 GSDLLYYEMWGRPFVVINSHRAAVELFERKSALYADR 8323 exon 2 8162 DLAFMPYDETWKLARKLFTQHFRAGAAGRYRDEETRCARELLADILRDDTQLFEHAR 7992 ex3 7932 GKLIMSVTYGIDVRSADDKYITNAQKALYAITATGNVGTYL 7810 ex4 7733 VKYIPEWFPGAKFQREAREWREAAEAMSQRPVEDVKIAM 7617 ex5 7329 SATLTVVLAMLLFPEIQERAHAELDRVVGMDRFPVFEDQPSLPYITAICKE 7177 ex 7 7043 AVMHRVTQDDVYNGCHIPGGATVVLNSWY 6957 ex9 6910 AILFDPDQYPNPEPFAPERFLAPSGELAPDVPEPTAAFGYGRRACAGTAMALDTLWIVV 6734 6733 ASLLWAFDIRRAVDEMGNEIDVAGEYTFGVV 6641 ex10 >Scaffold_82a 66881 AVVKESLRMVIGVVHPMTRIVPPQGAVLCDMFIPGG 66953 GAVLCDMFIPGGVCTLLHL (?) YFLHHNEDVFPQPRTFKPERWLARE 67132 67133 SDKEHMLVSFSRGPHSCLGVKCVSPFPNMANTDPKSSMAYCELYLAFAYFFRRYDVELN 67309 >Scaffold_82b 58% to 144 94829 QATHLVFNRFEPTELPVVAAALVGLPAVLSALLVGHFGLLAGAALAFATFHATLAASIVLYR 94644 94643 LSPFHPLARYPGPLPARITRWYWARVACGGRQHLELKRLHDVYGDIVRIGE 94491 IVGPNELSFRDISVVQPMMGAQGMPKGP 94077 RTFKPPILPLVGMRDVADHTRRRRPWIRAFTPA 93919 93918 ALKEYEPVVAKRGAQLIEILAQKKHTDFVHWIHLFT 93811 93580 HIPWLGYWMRHFPQLATATKQYRAMCFQRGMQRYQDGSSQKDLFYYL 93440 93389 SNEDGAEKQTPDKATVIADSALAIIAGSDTTASVFSNMVYCLLKNPHAYKRLRAEVDEFY 93210 93209 PPEENSLDPKHHSKMPYLEAVM 93093 NETLRLYPVVPSGSQRAPELGTGGTLMG 93010 92950 SYIPENTSVRVHFWSVHRDPRNFSQPESFLPERWLAAEGLEA 92770 GTFVHNANAYMPFSFGPWNCVGKALALLELRCVATHMMQRLDVRFADGWDPAEWDAAMED 92591 92590 KFVIKTGRLPVVVERR 92543 >Scaffold_311c 50% to 205b gene model complete all boundaries checked 16862 MDTLLLAGLVAVAVVAAGCLAHSRRQRFPPGPKGLPLLQNLLDVPRHRPQWEAYRDWGLKY (1) 17032 exon 1 17095 NSDIVHLRLFGVSFVIVNTADAVTELFSRRSSVYSD (2) 17202 exon 2 17336 SIGWEKAIVMKNYGEDWREHSRLFHQSFQPKVIQEYYPRLYEEARKLLPRLLKGDDFVASLRV (2) ex3 17573 MTASAILGVTFGMEINDSNDPYVTIADRAIQSLVEAGMPGSYM (1) 17701 ex4 17825 VRYIPSWAPGGKFKRDAAEWRTLVSDMFTKPFEHIKSAI (0) 17932 ex5 17998 RHGVARPSIATSLLMGLDDKQDNARREHVIRNVTGTAYV (1) 18111 GAADT (0) 18236 TVAALRTFILAMVMHPAIQKAAQAELDRVVGRDCLPTFADREHLPYLTAIQYEVLR (2) 18400 ex7 18546 RANADDEYHGYHIPKDAIVLGNIW (2) 18623 ex9 18683 AILHDPARYRDPAAFDPARWLTPDGALRADAGDAMLAFGFGRRICPGRHYAVANMWINM 18859 18860 AYMLAAFDIAPPRDAAGRAVPPAGEHTTGLLT (2) 18955 ex10 19024 YPKPFGAVFTPRSAAALRLIVADADD* 19104 ex11 >Scaffold_490 MOST LIKE 311C gene model complete all boundaries checked 4818 MGVLILCAIAILAALLCYHHFRARRFRLPPGPKGLPIVGNVLDVPKDGPGWLTYERWSHEY (1) 4636 ex1 4582 GSDVIYLNLLGSSIVILNSSKATTDLLDKRSPIYSD (2) 4472 exon 2 4334 SVKGDRAFAFLGYGDEWRQHRGIFHKYFHGQAVHNFRPKMLEEARKVLVRLQSTDDYDRCFRV (2) 4179 exon 3 4102 MSAASILGVTFGMDIEDINDPYVVLAEEAINYVLSAAIPGSFV (1) 3971 ex4 3850 VKYLPAWAPGAGFKCKGAEWHDLVSRMILTPFETLKQKM (0) 3734 ex 5 3680 AEGTAKPCIATAIVQKLEESKGGDAQRETQIAQYVTGTAYT (1) 3555 AAADT (0) 3428 TVSSLCTFILAMVLNPEVQALAQEEIDRVIGTTSLPDYTYRDSLPYVSAIMYEVLRWRPVAPL (1) 3243 ex7 3185 GVPHRLMEDDEYEGYHIPGGSLVVGNIW (2) 3102 ex9 3049 AITHDPVRYLNPDAFDPTRWLTTDGQLGDVTDALVAFGFGRRVCPGRHYALEALWITL 2876 2875 VHVLAAYRIEPPVDAHGRARPPSGEYLPGFIA (2) 2780 ex10 2712 FPAPFKAVFKPRSPVALGLIQTALG* 2638 ex11 >Scaffold_453 3219 VQWGDMWRETRRVYANQLRECMIP 3148 exon 3 partial >Scaffold_32 67132 WGRMWEETRTVYAQRLDEC 67076 >Scaffold_117 perf-heme 8472 IWGADAEVFNPLRHQQRTPTQEKALLGFGAGRVMCVASRWAPHAAGIIVASISE 8311 >Scaffold_431 50% to scaffold 205b 791 LVIILLVRWAARQRQDRKQGPHPPGPPGLPLLGNLLDMPDNPSWMTYIRWSEKY 952 exon 1 1008 SDILRLNVLGSNIIIVNSLDAANDLLDKRSAIYSDR 1115 exon 2 1254 GWNVAFRRYDDTWRNGRRVFQHELGPQVVKRFRALEEHATHQLLRNLL ex3 1479 SMSAFEILRIAYGIEVTGREDPYVDTAEHAVGAVVATCSPGSYL 1610 ex4 1679 VKHIPEWVPGAKFKQDAKVWRRYVTELRDKPFSVVKERI 1795 ex5 2096 VSALGSFILGTVLDPAIQARAHADLDRVCPGRLPTFDDQPELPYIDAIVKEALRWNPVLPI 2278 ex 7,8 2330 VAHCSIADDVYRGYFIPKGSLVLANSW 2410 ex9 2411 AILHDEAAYADPLRFHPDRFMAGDALDGRVREPDAAFGFGRRICPGRYMAYDAIWIAV 2584 2585 ACMLAVFRIDKAKDAQGREITPSGEYNVG 2671 ex10 2742 YPKPFPCDIRPRSSAHEALIR 2804 ex11 >Scaffold_400a 12645 SDVVTFDVLGTTVVILNSLKAATELLEARSAIYSDR 12538 ex2 12169 SLAGRQILHIAYGLEIRDRADPWITAAEHGVEIAVKCIIPGSYL 12038 ex4 11971 VKYVPEWFPGAGFKRQARIWKKEVTSIADAP 11879 ex5 11561 STLATFFHALSRHPAVLYEAQRAVDRVCAGRLPTFADYDALPYIHALLRECLRWRPVVPL 11382 ex7,8 11312 HRASKEDVYEGYRIPAGALVLANNWY 11235 ex9 11175 AIMHDPAAYTDPEDFNPRRFLRARAGAGANGSGADDSLELDPGVRDPGVAAFGFGRRACP 10996 10995 GRYMAYESLWIVMASVLTVFDVLPAEGDEGEVEYTDGFLS 10876 ex10 >Scaffold_400b 14939 PAGATFGFGRRIRSASAFSHVRIASVLDTFTI 14844 ex10 >Scaffold_219 14119 KSPSSVPGSHMAPPETHEIHTAGSAWRARDYRY 14021 >Scaffold_60 7597 REIHHPLPLPSRSYVPPSPRALLA 7668 >Scaffold_45a most like 205e gene model complete all boundaries checked 107776 MLDTTPFTLALLTVGAVCLLGLIKGASRRRRLPPGPKGVPLLGNIFDAPKEHEWRTFKEWGRTY (1) 107585 ex1 107530 ESDVVHFGILGTHYVVLNSAWAALDLMDKRSHNYSD (2) 107423 ex2 107260 KTGWDRNWGFMKYGDYWRAHRRMFHQHFRPNAVSAYHSTSQQAVRELLRLLYAKPEHFKEHIQ (2) 107078 ex3 HMTGYNIIKLMFGVAVSPEDDPILARMENALRILGKIANPGVYL (1) 106755 VRFIPSWVPGAKFKRDAEAWKPVIDKTYTQVYEEIKTSY (0) ex5 106582 ANGSPVPCLVTEMLEDVLKVDNEAYRDMLEDVIINGGGTAYV (1) 106460 ex6 AAYDT (0) ex7 106326 SSSVLSTFVLAMLLYPDVQRTAQEELDQIVGPDRLPTMEDQPSLPYVTALATEVLR (2)106165 ex8 WRPALPI (1) ex9 106037 GVAHKSVVDDEYRGYHIPGGSVIIPNVQ (2) 105954 ex10 105903 AILHDEGSFPDPDTFNPRRFLDSEGQQLEELTGIVSAAFGFGRRICPGRYFAKDVVWLTI 105724 ex11 105723 ASVLSTFNIEKHFNEAGNAVEPSGEYTPGIIS (2) 105628 ex11 105574 YPAPFKAAFKPRSESAVELVR* 105509 ex12 >Scaffold_45b pseudogene of 45a N-term missing, beginning of exon 2 missing end of exon 5 missing gene model complete all boundaries checked 110266 PGPQGLPILRSLFHAPAQFQQLAFQDWGHRY (1) 110174 exon 1 110084 IVLNCAQPASDLSDNRSTVYSD (2) 110016 exon 2 109872 KGGW (frameshift) DRNMGPLAHSAYWREHRRL (deletion) APHHFKDHVR (2) ex3 109459 VRHIPAWLPGAQFKRAAARWRSIVEEVF 109376 ex5 (AG boundary missing, GT end missing) (0) DSGNPEPCL (frameshift) AACLPTSHRDREDTLMLENVVINTSGTAYA (1) ex6 AASGT (GT boundary missing) ex7 109046 (0) TTTTLLAFILTTMLYPDVQKAVRQELDSVVGQGRLPEMADRNALSSIPALMKECLR (2) 108885 ex 8 WRPPLPL (GT boundary missing) ex9 108739 (1) GVPHRSTAEDE*RGRRIQVPAGSTAIGNAW (GT boundary missing) 108653 ex10 108464 XXXXXXXXYSDPEAVKPWRFFDAAG (frameshift) ex11 AG end missing FGYGRRVCPGRHCLLDFVWLATANVLVVCSIEEP 108363 ex11 (frameshift) FDGQWDTVEPSEQHTTGTII (2) 108305 ex11 FLAPFEAVFKXXXXAHVECVR (stop codon missing) ex12 >Scaffold_313 I-helix to EXXR similar to 99b 3814 SAGMLTFILYYLLKNPEAMRKLREEVDTMIGSRTMTVDDVHKLPYLIGGKTPFLSS*RRG 3635 3634 ADFSPAVMRETLRLGPPAPARGTAPFEDTLLKGKYPVAKDGRIYCGIYMVHRDPKVW 3464 3463 GE 3458 >Scaffold_603 PKG to PERF 681 YALPPGTIVATQAWSMHRDEDVFPSAETFLPERWL 785 898 VPFGVGTRQCGGQNLAHLMIRIVVAVVVRNCEVRADVRETNERSMSMRDAFV 1053 >Scaffold_258a PERF 21435 TSRILRLLCSLWGPNIFAANGEEWRKHRRIINPAFS 21328 c-helix 20563 (0) TVAHTLDAAFALLALHQDFQEEVYQELLEVMPXXXXX 20468 20098 HNPRIFPDPEAFKPERWYNAHENDMSMFSFGARAC 19994 >Scaffold_258b PERF 25641 IFAANGGEWKKHRRVINPAFS 25703 part of c-helix 26448 (0) XXXXXXXXXXXMLALHPDFQEECYREILKVMHTNXXX 26516 26879 HNPKLYSDPEKFLPERWYNTHENDRTMFSIGAQAC 26983 >Scaffold_242a EXXR to PKG Sbjct: 18658 DSIDLTSAVMREALRLGPPAPMRGAASFEDTLLKGKYPVAKDVPIYCGVYMVHRDPK 18828 Sbjct: 18829 VWGE 18840 >Scaffold_242b EXXR to PKG 28756 AVMRETLRLSPSAPARIVQAMEATTLGGGKYAIAKDDTLLIATYVSQRDP 28607 >Scaffold_242c EXXR to PKG Sbjct: 33958 AVMREALRLGPPASARGASPYEDTTIGGGRFAVPKDTFIMCSLYNIHRDTKVWGE 33794 >Scaffold_361a Sbjct: 8693 AVMRESLRLGPSVPGRMIESLKDQTLKNGKYAVAKGEILVVCNFIAQRDSKVFGD 8857 >Scaffold_361b Sbjct: 13979 AVMRETLRLTPVAPGRVIEAIEATTLKGGQYAIDKGQDILVAVHSSHRDPKVWGD 14143 >Scaffold_356 similar to 193 HPLAKFPGPKLAAATHWYSAYYEVWRDGALVEHLQELHKQYG 7683 QAVVGMGATFMHYNPEVFPQPYTFDPDRWLQPDDFRRGLG 9040 The following P450s all have in common a phase 2 intron in the heme CYS >Scaffold_152 intron in heme CYS 55928 YKGRPFKIAQFDRWLVVLTTPHHIEELRKAPETGLSSRDAXXX 55809 ex3 54871 XXTFAIFHAAADPGVADALRAEVASVVAEHGWTPTALTKMQRLDSFVREVQRMHXXXX 54716 ex9 54642 ALVMKIARTDYAFADGHVVPRGALVTAPATTVHRDDEHYPDAHTFRPWRF 54483 ex10 54482 VGTPDESEADSARRKATSTSPTFLAWGHGKHS (2) 54399 ex10 54344 CPGRFFAVRELKMLLGSVLLRYDVRLK 54264 ex11 54263 TPGVLPQDQWYLTFRVPDPTACVLLKRRTTA 54171 ex11 >Scaffold_21 Length = 172089 MISIDSISLISGLISFAFIAYYLRQDKLQ (0) ex1 HIPSPGPTGPISSWYAAYKYIRGDAPQIIEEGYRK (0) ex2 154290 YKGRIFRIADLNRWTVVVTSPSLVEELRKAPEDVLSFHEGIRY (0) 154400 ex3 154482 SLQLDYTFGREAVEHEYHIPVIRTQLTRHLTPLFADIHDEIVQSFTDLVPPSDT ex4 154629 WTSVRVVPTVMQVVSRTSNRVFVGLPYC 154712 ex4 154713 RDPAFCALAVRFATDVVKTGVALHLTPRALKP (2) ex4 fused LGVRLVSPVSKRVEEGKGDHRQAGGRPAPRQPGGRPARAAQGACGNWPDKP (0) ex5 155085 EDMIQWLIEEANEDERTVEGLVLRILIVNFAAIHTSSM (0) 155192 ex6 155247 SFTHAVNMLAAHPECIAPLREEIEEVVREEGWTKAAVQRMRKLDSFMKECQRLHGLGA (1) 155408 ex7 155477 VTMSRVALQDYTFSD 155521 ex8 155522 GTRIPRGTLVMAASRPIHHDAALYVPDADAFDPWRFARLRAADADASIKHQMVHTSAEYL ex9 155702 AFGHGKHA (2) 155725 ex10 155785 CPGRFFAVNELKLMMAHVLHTYDIR 155859 ex11 155860 PQTSVPPGRWIRHSLLANPIATVDLKKRQT* 155949 ex11 >Scaffold_314 Length = 25333 ex 1 not supported by blast match gene model complete all boundaries checked MQSSVGEVFASAPTLAKLLAGAAFVLFLNSLWNIWK (0) ex1 5060 LRHIPTVGGPAIPILCYIGTFRYLQDPQKILQEGYEK (0) 5170 ex2 5243 YKAKPGMFKIAAPDRWLVVVGHPNLIDELQKHSDEQVSFMDAATE (0) 5356 ex3 FVGTRYALPGTIADDPWHIPPLKQHLTHAIGSFFGDMLEELRVSIEERIPSNEKDD (1) 5569 ex4 EWVAIPALDTFFWVFTRVIDRIIVGLPI (1) 5799 RDTEFIKLMVEFTMSIGIARFFIGLVPPMLKP (2) 5894 ex6 5948 AMAKLAARGVHKATAEAEKMLAPVIADRVRHLDEFGEKWADKP (0) 6073 ex7 NDLLQINIEEARAQGRPLDEIVIRTIVSIFVGVSTSAA (0) 6306 SFVHVLYHLAADPELQAALRTEVEGAIARDGWTKAALVGMHRVDSVLRESQRVNGINS (1) 6476 ex9 6530 VSVMRTALQDITLTSAGAPVCLPAGTLCVAPERALHADKEHYPDPDAFVPFRFAELRATA ex10 DARGGAQHQFVSTSTRYVPFGHGKHA (2) 6787 ex10 6844 CPGRFFAGNEMKAAVAHLVSNYDVRLPDGASTRPPNELFGLAIVPNRSAKVMSRRRQPVV* 7023 ex11 >Scaffold_210 intron in heme CYS () similar to 137 GT boundary missing at ex 7 gene model complete all boundaries checked MSDYSSLLAYIFISLATLAYLKRLLWPDRQQ (0) ex1 27371 LEHIPAIGPTAPILSYWGAFRWLSHGTEITQKGYAK (0) 27270 ex2 27199 YKGRPFKVANFNRWLVVVSGPKLIDDIRKAAEHELSFEEAAHE (0) 27071 ex3 26994 NLEVRYTAGPCIAENSYHVPIVRGQLTRNLPFLFNDVRDEVAKAFGDHIPPTD (1) 26860 ex4 26785 DWTPVAAHPVIMQIVARATNRVFVGAPKC (1) 26702 ex5 RDPDWLDLSIQFTADLILGAHIITQFPQFLKP (2) ex6 26500 LAARFFTRVPAAIRRGRRHLERTIEHRKACLEQYGADWPDKP (expected phase 0 GT missing) 26372 ex7 26312 NDLISWLLDEAKGEERTIHNLVTRVLTLEFAAIHTTSN (0) 26199 ex8 26148 SFVHALYQLAAHPEWAEPLRDEIEQVVKREGWSKSSLDKMHRLDSFLKESQRYYALGG (1) 25993 ex9 25909 VTMDRRAMKDFTFSDGT 25871 ex10 25870 VIPEGTFVGVAVLATQHDPQYYDDPDTFNPWRFSDLREESDESGRHLLVST 25718 ex10 25717 GIEYFPFGHGRHA (2) 25679 ex10 25624 CPGRFFAAIELKLMLAHIVMNYDVKAE 25544 ex11 25543 LDGVVPPILEFGQNLAPNMKAKVLFKNRQRS* 25448 ex11 >Scaffold_297 48% to scaffold 210 intron in heme CYS 27263 XXGSPFRIATRRYWQYIVSSPKLIDELRRAPDDELSFLDAVNE 27385 ex3 27473 YTMGAATANNLYHVPVIRNTLTRNLGNLSSEIYDEISNAFADCIPAXXXX 27610 ex4 27681 WMAVPALQSIMQIVARTSSRIFVGLPLC 27764 ex5 29066 RYEGLGMRTLPLSPFRSIY*PCS 29138 VFLTRKAVKDFTFSDGTFIPKGSYVSTSRAATHG 29239 ex10 29240 QSEYYRDPYVFDPWRFANLRDETGEGVKHQMVNTSIEYLPFGLGKHA (2) 29380 ex10 29439 CPGRFFAANELKSMMAHLVVTYDVALD 29519 ex11 29520 MPGEVPRSVHFGPINSPNRTAKVLFRKR 29603 ex11 The following P450s have in common an intron in the heme signature before the CYS >Scaffold_5 23027 LVALLLILTLFVLVTRSSSRHYPPGPRPLPLLGNIHNAPAQRQWKTFAAWKSTY 23188 exon 1 23250 DVISLTIFGQRIVVLNSLESAIDLLEKRSAVYSDR 23354 exon 2 23382 LGWAQQLVFAPYGEHFRNMRKILHKYLGARGQLDKIEPYHEIIEAATAKFLVRAL 23546 23847 VKWVPTWCPGTSWKRQAQEWKDLFVRMTEGPYAMAQQQ 23960 ex5 24177 VAIILSFILAMTLHPEAQKKAQQEIDALTNGERLPVIR ex7,8 DREELPYVRALISEVLRWNPLVPL 24362 ex7,8 24416 VPHCAIADDVYRGYFIPEGSTVVVNMW 24496 ex9 24568 DPEVYSDPEVFKPERFLGPDPERDIRAFVFGFGRR 24672 ex10 24980 YPLPFKCTMKPRSERAETLI 25039 ex11 >Scaffold_24 149683 DCAAVLLALWVIRKAFSGSSTSRHYPPGPKGLPIVGNLFDVPTSQEWQTYSQWA 149844 exon 1 149912 DLISMNVLGQRIAIISSVKVAFDLFEKRSVIYSDR 150016 exon 2 150484 VRHLPSWVPGTAWRKTAEKYRRHVADAVAEPFAFVKQQM 150600 ex5 150963 SALTSLFLAMTLYPEVQRRAQAELDGVVGTDRLPTFEDRDRLPYITAICAEVLRWMPVGPL 151145 ex7 151213 LPHRLTEDDVYEGYALPKGTIFFVNNWY 151296 ex9 151374 LLHDPDTYRDPMAFMPERFLGAAPELDPSKIAFGYGRR (?) 151542 ICPGILVAEATIFITVAATLAAFSIRPAQNGGAPSLPPVRQTSGII 151679 ex10 151753 HPAPFQCDVVPRSKKAEALV 151812 ex11 >Scaffold_54 similar to 15 87020 VIAVLAYGLFTVLRRLRLGVLGKSLPPGPIGIPVFG 87127 exon 1 88156 IARFVAEIAEHPEVQVRAHAELDQVVGRDRIPGAEHKSELPYIGAIVKVGLRFQ 88317 ex7 88732 PRTCLGKPLAQTEVFLAVACMLWAFEVQPFDTWEQEAEEDGEDEHPSPWPDPFLVRLRPR 88911 ex10 >Scaffold_66a 19677 TAIAFVIMAAATHPKAQAEVQAQLDSVVGRDR 19582 19591 KGQRYVCAQTGAVTSGSPGLVPSFDDESLLPLVTAFYLEAYRWRPV 19454 ex 7 19328 AHPHNQFMQGEYVIPADAIVIGNHW 19254 ex9 19204 SIARDPDVFPEPEEFRPSRWLDESGKLREDLSSFNFGFGRR 19076 ex10 >Scaffold_66b 32359 TAISFVMMAAATHPQAQAQVQAQLDSVVGRDR (?) VPTFDDEKLLPLVVAFYLETFRWRPI 32129 ex 7 31872 AIGHDETVFSDPDVFRPSRWLDEAGKLRDDISPFTYGFGRR 31750 ex10 >Scaffold_164a 5 P450s on this sequence 28% to CYP61 10606 SWLLLIPALAIVAHILIWLLDPHGIRSYPGPLLAKFSDAWLGYVAAQGHRSEVVHDLHKQYG 10421 9282 SLGSSSCAITYYLAKYPDAQRKLQQELDEALGSDDEPVSTFDQVKRLPYLQAVIDEALRI 9103 ex 7 9102 HSTSGIGLPRLVPKGGMTVCGRFFPEGTVLSVPTYTIHRDEEVWGKDPEVFRPERW 8935 8934 FEQDKNAVQKTYNPFSFGPR (?) 8817 SCIGRNLANMELLIIVSSILRRYDFVLEDPDKPVSGFLVI 8698 >Scaffold_164b 5 P450s on this sequence 22646 DPAVIGFGFGRR 22681 (?) 22735 RICPGMHFAHNSIFIAIARMLYVFNFTKAVDKNGNEITPEVEYS 22866 ex10 22929 HPSPFVCSISPRSIAAAELV 22988 ex11 >Scaffold_164c 5 P450s on this sequence 38255 LDIALATLAVVLLKTIIARSKQRARYPPGPKGLPVIGNVLQMPKDREWLTFAQAER 38088 exon 1 37958 NIVYLSLLGQPMIILNSAKDAVALLDKRSSIYSDR 37854 exon 2 37796 PYGDRFREYRRLMAKFIGGKTQVERHFPVMEQEATSFLKRILRRPD 37659 ex3 37560 LKLAYGYTIREDEDPFVTLADRAMAQFTEATTPGAFL 37450 ex4 37431 LRHMPAWFPGASFKRTAQEWSDTLNSMADVPHAFVKEQM 37315 ex5 37040 VSSIHTFFLTMLLFPHVQKRAQAEIDSVVGTDRLPTFEDRAKLPYVEGVLKEVLRWHPIGPL 36855 ex7,8 36793 LPHRLAQDDSYEGHLFPKGAIVIANIW 36713 ex9 36655 LHDPDVYPNPSDFDPTRHLSENGRSPQPDPRDYCFGFGRR 36536 ex10 36228 HPKPFRCSIKPRSTRAEALI 36169 ex11 >Scaffold_164d 5 P450s on this sequence 45556 AFLSRTRRQGPYPPGPKGLTIVGNALEMPTSREWLTFSEW 45675 exon 1 45745 DIIYLSLLGQPMVILNSAKHAIALLDKRSNIYSDR 45849 exon 2 45877 IGWKYTLALTPYGQRFREYRRFIAKLIGGPTQMQTHLPLEEHETRRFLKRLLNEPERVADHIR 46065 ex3 46276 LRYVPAWVPGARFQKTAREWRKVLERMADEPHDFVKQRM 46392 ex5 46623 VSAIYSFFLAMTLFPHVAKRAQMEVDAVVGSDRLPTCEDRPNLPYVEALVKEVFRWNPVAPL 46808 ex7,8 46861 LPHRLIEDDIYEGYFIPKGSFVIPNIW 46941 ex9 46995 ILHDPNHYPNPFEFDPTRFLSDEGRTPQPDPRDYCFGFGR (?) 47169 RICPGLHLADVSVFLSCAMVLATFDISKAVENGKVIEPEVEYTSGTI 47309 ex10 47369 HPKPFKCTIKPRSTKAEALI 47428 ex11 >Scaffold_164e 5 P450s on this sequence 48060 LVLLSAILTVVLIRTAIARRKRWARLPPGPKGLPIVGNVLQMPKSQEWLTFSRWAEQY 48233 exon 1 48291 DIVYLNILGQPLIILNSAEDAVALLDKGGSIYANR 48395 exon 2 48822 LRHVPAWLPGASFKATAKRWGETLEQMADVPHNYVKEQM 48938 ex5 49234 VSSIHSFVLAMVLHPHVQRRAQAEIDAIIGPERLPTFEDRAALPYVEALFKEVLRWNPVGPL 49419 ex7 49474 LPHRLSQDDVYKGYLLPKGSIIIANIW 49554 ex9 49613 LRDHNLYPNPSDFDPTRHLPKNTEAASQPDPRNYCFGFGRR 49735 ex10 49855 VLAGQHLADASVWLACATMLATFDIENLVASDGTVIGVEPEYTSG 49989 ex10 50056 HPKPFKCSIKPRSALANALI 50115 ex11 >Scaffold_165 53515 ILVLICLLLRVVRRNTTRRLEQLPPGPIGLPILGET 53408 exon 1 52432 PEVQAQAQMELDRVVGRDRLPQAEDAKDLPYVRAIVKVSISIR 52304 ex7 52236 PHMSTEDFSYRGYKIPKDTAVILN 52165 ex9 52097 HDSQRYPNPEKFD (0) PDRYIDDERSSAESAKLADPYQRDHWTFGAG (?) RRICPAIALAEHEIFLSVAGLLWAFDMRQLSD 51756 >Scaffold_245 3382 LLATAALLALLSARRTQQPTPPGPCELPIIGNVAELSGGFEWTRF 3516 exon 1 3707 IGVFMMAMALHPDKQARAQEEIDSAVGTDRLPTMDDKARLPYVYAVVQEAMRWHPMLPL 3883 ex7 3936 LPRRAEVDDEYDGYHIAKDTTVCANLWY 4019 ex9 4080 MEPNVKYPPEQFIPERFLDAEHPTPNPNTWAFGFGRR 4190 ex10 X M D E y - r 2 t u . ` a / x a M V T P % M B W h^ h^ OJ QJ ^J ` X M D E y - r 2 t u . ` gd^ a / x a M V T P gd^ P % M B W ( O C gd^ ( O C 9 m 3 v ' m O E ( S p ! '! B! ! ! " " H" " " " # # # b# # # # $ 3$ T$ $ $ % % F% z% % % &