149 named intact gene sequences
10 named pseudogenes
D. Nelson
August 16, 2007
>CYP51F1 ug.2.6.1(CYP51) MSLSQYGPIAGLVGQAYDALASMSTSRLVLFLLINIPILSVVCNVIYQLLPKDK SLPPVVWHWFPWFGSAAAYGEDPIKFFFDCKEKYGNVFTFILMGRKVTVALT PAGNNFIMGGKHTTFSAEEVYGGLTTPVFGKDVVYDCPNELLMEQKKFVKF GLSTENFRQYVGMIEEEVLQFMRNDASFKIYQMNDINEWGAFDVLKVMSEIT ILTASRTLQGKEVRANITKDYAQVYNDLDGGFTPLHFMFPNLPLESYRKRDA AHKKISDFYISIIRKRRENPGQEEHDMIAALMNQKYRVGRPLKDHEIAHIMIAL LMAGQHTSSATGSWALLHIADRPDVAEALYEEQVKHFRQSDGSWRTPEYEE LKELPVLDSVIRETLRIHPPIHSIMRAVREDVVVPPTLAAPSEDGRYVIPKGHV VLSSAAISQVDPMLWKNANDWDPSRWSDPEGVAAQAYKQYDDAEGAKVDF GFGLVSKGTDSPYQPFGAGRHRCIGEQFAYLQLGTIISTFVRHVEMRLPETGV PPPNYHAMITLPKAPRNILYRRRNFD >CYP53C2 ug.1.19.1= pc.1.261.1 MAVIEALTQLDLKSWLLLIPALAIVAHILIWLLDPHGIRSYPGPLLAKFSDAWL GYVAAQGHRSEVVHDLHKQYGTFVRIAPNHLSIADPDALQVVYGHGTGTLK SNFYDAFVSIQRGLFNTRSRSEHARKRKIVSHIFSQKSVLEFEPHVRLYVKQLI QQWDRLYEAGAKGLVWLDCLPWYNYLAFDIIGDLAFGAPFGMLLAARDAA PVAVDHEQAMASYGKEKSEVQYIPAVQVINDRGTYSASLGVLPPWMRPIVKL FPWFRRGQKAVKQLAGIAVAAVAQRLTTPTDRVDLLGKLQEGRDDDGNLM GKEELTAEALTQLIAGSDTTSNSSCAITYYLAKYPDAQRKLQQELDEALGSDD EPVSTFDQVKRLPYLQAVIDEALRIHSTSGIGLPRLVPKGGMTVCGRFFPEGTV LSVPTYTIHRDEEVWGKDPEVFRPERWFEQDKNAVQKTYNPFSFGPRSCIGRN LANMELLIIVSSILRRYDF VLEDPDKPFDTMEGFLRKPVECVVGIRRRTL >CYP61A1 ug.78.18.1(CYP61) MASSQAAFPSTLSDSSRHSTDSPAFIGLLPTGSWFYTTAAILLSLLVIEQSVYRY KKRHLPGDKWTIPLIGKFADSMKPTMEGYMKQWNSGALSAISVFNVRFIVM ASTTEYARKILNSPTYAEPCLVHSAKQIILPDNWVFLTGKEHVEYRRGLNLLF TRKALGYVLQYCLVLYAMLTHRRSLYLGIQDVITRKHFAKWLADAAKDPSA KPIMMTARELNMETSLRVFCGNHIPEHGAKEISDKYWMITVALELVNFPLAIP GTKVYNAIQARKAAMKWLELAARKSKESVAAGNPPECMLEEWVTILNDPAY KGRREFSDHEMAMVVFSFLFASQDAMSSGLIYGFQHLADHPEVLAKVREEQE RVRGGDYEKPLTLEMMDEMPYLRAMVKETLRVKPPVTMVPYKTTKAFPISQ DYTVPSGSMVIPSFYNSLHDPAVFPDPDRFMPERWLDPNGSANTNPRNYLVF GSGPHKCIGLEYAMMNIALVLANAAVLMNWEHELTPQSDKVQIIATLFPQDG CKLKFSPRQHA >CYP63A1 PC-1(ug.20.36.1) MGLTQAQRLVLGQLARLVAPALAVCVLLAAARRTQLVRAPVWADALIALIA IPLFHVGRAHWRYARLARKAARLGAALPPRWEGKLPGSVDVLQLVDEAYRR GFLSDYFYEKFGELGHTYNFYVLWDMDYCTEDAAVIKAVLATDFNNWVKG ERFDSYMHSVLGTGVFNADGELWKFHRSMTRPFFARERITDFETFNRHAEEAI LKMKERLREGFAVDFADLISRFTLDAATEFLFGACVHSLAGALPYPHGAPAH LHTTRARIPADDFAAAFRAAQDAVSHRARLVWLWPWFELARSRTDTPMRTV DRYLTPIIERAL AMSRAAKQAPQGEKEEVADGETLLDHLARYTTDPTILHDEILNIMIAGRDTTG GTLTFVIYFLTQHPDVLQRLRQEILDVVGPSNLPTYDDIKQMKYLRAVLNETQ RLYPPVPWNMRYAVEDSIVPNSEPEGKPWFIPAGASVSYSVHCMHRRKDYW GPDAEEFDPDRFLDERLHKYLTPNPFIFLPFNAGPRICLGQQFAYNEMSFFLVK LLQTFEDISFERDAFEPNALPPAEWAKFPGRKGKEKFWPRAHLTLYSEGGMW VKMREAQAMGQVA >CYP63A2 PC-2(ug.20.35.1) MLVSVDALALRTLVYELTYLLYPAVPTAAALILLQRFGNVWLPTWTIVLLSL CNVPVAHRILVWLKDGRAARKAASMGAILPPRLKGRWPGSIDLLRQLTQTFE TGFLSEMLWGYMHVLGQTFEVYILWDSNYVTSDANVIKTILATDFDNFVKGE KLDVCVRPVLGTGVFNSDGEMWKFHRSMTRPFFTRERISHFDLFDRHADAT MAKMKARLAEGFAVDFQDLISRFTLDSATEFLFGQCVHSLASVLPYPHDAPA HLQTTGASRTEDFARAFAEAQDAVSFRLRMGWLWPWFELFGSRTKAPMAV VDAFLDPILRDAVARADKIKRENGGRVPEVKGEIEEDETLLDHLVNVVQTKIL HDEVLNIMIAGRDTTGGTLTSAVYFLSQYPEVLRRLREEILEKVGPTRRPTYD DIREMKYLRAFINETLRLYPAVPWNVRYPVKDTTIPGPHPDKPYFIPANTPVSY SVHCMHRRTDYWGPDAEAFDPDRFLDARVQRYLTPNPFIFLPFNAGPRICLG QQFAYNEMSFFVIRLLQHFDEVQLCEDALAPDCRVPDAWRGAPGRKGVERF WPKAHLTLYAKGGLWVKMREASTSEAVV >CYP63A3 PC-3(ug.20.34.1) MPSSIDFPDRLVLRVIAYELVFLFYPAVPAAAGLVLLRRLTDIWLPTWAIVLLS VCSLPVVHGLSIWRNHWRAARKAARMGAVLPPRLKGRWPGSIDLLMRLTDA FETGFMSDLLWEYMHTIGQTFEVYVLWDSNYVTSDANVVKAILATDFTSFVK GKK FDVCMRSVLGTGVFNSDGDMWKFHRTMTRPFFTRERISHFDLFDRHADDAM AKMKARFAEGYAVDFQDLISRFTLDSATEFLFGQCVHSLASVLPYPHNAPAH LQTTSASAAEDFARAFAEAQTVLNFRIRMGWLWPWFELFGSRTKAPMAVVD AFLDPI LKAAVERADQIKHENGGKVPEAKEEIDEDETLLDHLVKYTNDPKILHDEVLNI MIAGRDTTAGTLTSAVYFLSQYPEVLRRLREEILEKVGPTRRPTYDDIREMKY LRAFINETLRLYPAVPWNVRYPVKDTTIPGPEPDKPYFIPANTPVSYSVHCMH RRTDYWGPDAEAFDPDRFLDARVQRYLTPNPFIFLPFNAGPRICLGQQFAYNE MSFFVIRLLQHFDEVQLCEDALAPDCRVPDAWRGAPGRKGVERFWAKAHLT LYAKGGLWVKMREAPTSEAV >CYP63A4 PC-4(pc.151.16.1) MALPPGLQYLLPQLPLLLAPPAAVLLAAHAARAFAGTAAPAWALALACVLS WPVALTALVQLRAHRVAREAAARGARLPPAVEARYPGGVDLMRRNNSEVE EHIPGYRLSEFGRQYGWTYNFRMLFQDRVRGRPRPRPPGRILATDFTSYEKGA VFSAQMKSLLGTGVFNADGDLWKFHRAMTRPFFSRDRISHFDVFDRHAEDA LKLAKARLSEGVPIDWQDLVSRFTLDSATEFLFGQDVRSLSAPLPHPPTAPQA QHDTHDAEHPANRFAHAFLQAQLASARRSRYTAAWPLWEFWENKVEKHTR VMDEFIQPLLRDALARKAKGADAQAEEAVADGETLLEHLVKLTDDPQIIHDE TLNILLAGRDTTAITLTMAGYMLAEHPDILQRLRKEILDTVGTRRPTYDDIRD MKYLRAFINEVLRMYPPVPFNVRFSTAPTVWPSPEGDFYVPAGTRCMYSVFV MHRRKDLWGPDADKFDPDRFLDERLGKYLTPNPFIFLPFNAGPRICLGQQFA YNETSFMLIRLLQRVSKIELHPEVSPQSVAPPGWAASSISDGKDKVVFKSHLT MYVQGGLWVTMQFENPEEH >CYP63B1 PC-7(genscan.57.18.1) (genewise.57.16.1) MPHPFSRYRLRVFGDFVRIVLAPSFVFWSAVQILKLRLGLLSPAAWLTFLFAA SYARVQYRGFLQRQEARRRGGVLPPEVVGRWPGNIDILIKLGKASLTAYPGSF YLDLFEEYQSTTLNLKLLWSDLVRCLSFCRLSAVLKTLSQIITMDEEHIKHILT TGFNHFWRGRRQKERMYAPSGASRRHDTDSQGDVSQEWKKHRALARPFFA RDRISDFDLFEKYAGATLGILGGLAGRGAAVDVQDLYARFTLDAAAEFLFGE RLDTLHGALPVAGQAKLGSKGAATDDAFGAFVRAFEASQDIITTRQVRGYFW PVRELFQDKVAPHAAVIGAFLEPIVQRTLDRKAKMRAAGVSPTTEHDTFLDY LADHTEDPKVIRDQLLNILMAGRDTTACLLTYVTYVMAMYPDIMQKMRQEV LHVCGHDAPNFEKLKALRYVHAVLNETLRVFPPVPMNVREVRARGVVLPHA DPTYAAAPAPLYVPGGTVVMYLPVLTQRNTALWGDDADVFDPDRWLDARL RRFTENPMMYTPFSGGPRICIGQNYARNEATYLLVRLLQQFDAVALAPEAQP AGSLPPPEWRHARGRAAEERIWPAYAITLYVKVRLSLQWLYC >CYP63C1 PC-5(pc.101.32.1) MELHPRQYRLRFLLDVLRAIVWPQLVFNAALYLAGFHPGAFLRVVASVLAVP LLGTVRTAISQRRNKIQAGAALGAKEVPCVRGKWPGNLDIVLGFVRSLKEAY LMQFLDDLFREYDCKTLNMRLLWEDQIWTIDEAHVRYMLAGPGFEWFHKG YYWQERMESFLGNGIFNRWAQRAIARPWFVKDRISDLNIFDRHTTTTLALISE FVDRREAFDAQDLFARFTLDSASEFLFGRCLDTLHGTLPVAGRAKMGPKGTA IEDAFGSFARAFEDVQVQIARRTRIGKPWPLFELFTDKTAPSVAVIHDWLRPIV HEALAKKSAASAEKESGEDSTFLSHLANSTDDPQDIAYSVLNMLLAGRDTTA SVLSFVVYFLALHPHVTEKLRAEILQAYGPDGRPSVEDMKDLKYVRAVLNET MRLFPPVPMNLRLSDAHPRIFPASGSAPKYYVAPRTVILYSIFLVQRRTDLWG ADALEFRPERWLEPATARLLADHPFAFTPFHAGPRLCLGQNFAYNEMTFFIVR LLQRVSGFELAPDAQPEGSLPPARWKYGEGRQAVEKIWPASSVTTFIKVSLAS MPCCGERWLKRRRQGGLWVRAVPA >CYP63C2 PC-6(pc.101.28.1) QRAIARPWFAKDRISDLNIFDRHTSTTLALIADFADRREAFDAQDLFARFTLDS ASEFLFGKCAETLHGTLPVAGRAKLGPKGSSVEDEFGSFAWAFEELFHDKTA KHRKVIQDWLQPIVREALHSKAAAARGEDTGEGTFLSHLTKTTDDPQDIAYSI LNMLLAGRDTTAAALSFTVYLLALHPEVVEKLRAEVVQAYGSDGRPSVEDM KSLKYLRAVLNETMRLFPPVPLNIRTSDDTPRVFPASAGAPKYYVPPRTPVVY SSVIIQRRKDLWGADALDFRPERWLEPETARRLAENPFMFMPFHAGPRLCLG QNFAYNEMSFFVVRLLQRVAALELAPDAQPEGSLPPARWKNGEGRQAVEKI WPGSSVTTYIKVSSTRSRPCG >CYP502B1 pc.5.187.1 MDTVLVGLFVALALYAWSRSSKRSALPVPPGPKPVPLLGNIFDLTAKELWLR VTGWSKQYDIVYIHLLGQGLVFCNTYEVAQDLLEKKGSIYSDKCGCQNMVA FTRYGDFARRQRKLMNTAFGISAVKRYRPLLANESVLLLKRILADPQDYMGY IRRYAGGLTLQSVYGYRVETNDDPLLELGTECVDILSNKIASGGGIWPVDIFPF LQHLPTWFPGAGFKRKAAVWRAKMEEFVDKPYEMVLERMRSGATVPCFVT TLLEEARDEKGGAVDAQRDFDIRWTANSMYSASMDTTITVVQLFLLAMILHP EVLRKAQAELDAVVGPARLPTFADRPALPYLDAVMSEVLRWGVPVPLGLPH RLMEDDVYRGTHLRAGTLVFANIWNMLRNEAIWAQPDVFRPERFLEPVDEA TAKRRDPRPYVFGFGRRRCPGLHLIEESLWIVMATLLATTDILAEKDESGKPV MPHVDFTNSLVVPFSTPAPFKCDIRPRSEQALQLVRLAE >CYP505D1 ug.73.17.1 MTHEIPCPPAWPFLGHMTSIDPEYPTLSLHLFTKQYGEIYRLRLPGRDLVVVNS QELVHEVSDDKRFKKSPKGGLQELRPLIGDSLLTADYPREENWGIAHRVISPS FNPIGLRGFFDDMVDVISQLVLKWERFGPHYKIDIAEDFTAATFEVIALCCASY RMNTFYTGGTHPVATAVVDYGVEGFARGKRGRLLSWLMRSATAKFEQDKE TLLQYADELLEERKAHPTDRKDVLWAMMNRADPVTGKKMTDLSVKQNLLT LLTAGHETTSAFMSIIIYYLIKYPEAMRKLREEIDTVLGDRQMTADDLARLPYL LAVMRETLRLTPVAPGRVIEAIEATTLKGGQYAIDKGQDILVAVHSSHRDPKV WGDDVDDFRPERMLDGKFEALPPDSWQPFSAGLRACIGRAIAWQEAQIMITF LVQHFTFTLADPQYELRIKQAFTLRVHDLYVHARRRTDRRGCVTLLPPAPAP GVGLAEAKGAPHDGGEGALPMHVFYGSNMGTCEAFAQRIVADAGRHGFKA SLAALDAAVANLPTDGPVVIVTASYEGQPPDNAAHFVEWATNMRGSGAPAL AGVVYALFGCGNRDWVQTYQRVPTLVDGALAAAGAERLLPRAEGDAGSGG FFEAFARWEGALWAALETRYATMKSGSAEGAVDVEVLDAGVSRADVLRQP DTMMGTVLENRVLTAQGAPVKRHIEFKLPEQVTYKAGDYLTVLPMNPPRDV RRAMARFGLLPDQEVTIRTKTPSSLPTGRPISVYTLLSAYVELSQPATTRDLRF LSEAAKSEAEKLVFKELAENYTECVLTGRLSVLDILEAHPNVDVPFGAFLQLL PSMRARQYSISSSPLCDPTRASLTIRVFEAPTSPGRKDPLLGVASTYLGGLHPG DRVQLAVRPCKTAFRLPADPAVPLVLVCAGAGLAPMRGFLQERALQKEGGR DVGKSLLFFGCRHPEEDYLYRDEDLKKWVELGIVDVRVAFSRAQDQSLGCK HVQDRLWHDRTDVMDACDKGAKLYLCGSAKMAAGVKDKLVLVVQDAMQ LEHAAAVEQFNTMMAGRFATDVFE >CYP505D2 pc.73.4.1 MTEPIPTPPSVPFLGHIPLLDREVPMLSLALLAEQYGDIYRLIFPGRSSIAIASQE LVHEVSDDKRFRKTVQGPLGEVRAVAGDGLFTADVPGEENWDIAHRILMPA FSFMKIRDMFDDMVDVVAQMVVKWERFGPRFRIDPAVDFTALTLEAISLTTM SYRMNAFYTFVQNGIHPFAKAMNEFLQESGGRSRRGRVLSAFMRGATAKWE QNRDLMMKYVDDNARSSARKDVLDLMMNEKDPVTGRKMTELSIKQNLLTF LIAGHETTSGMLTFTIYYLLKYPAVMRKLREEIDTMIGDRPMTVDDVNKMPY LTAVMRESLRLGPSVPGRMIESLKDQTLKNGKYAVAKGEILVVCNFIAQRDS KVFGDDADEFKPERMMDGKFEALPPDAWQPFGAGVRGCIGRAFAWQEVQIV LVYLLQHFNLAFADPNYDLRLKQTLTLKPNEFYIHAIPRAERRRAIPLLGPRAG PTSAPVNGTNGIADEGGHPMYVYYGSNMGTCEAFAQRIAGDAGRYGFSAAV ASLDSATENLPTDGPAVVITASYEGQPPDNAAHFVEWLGALGDADSPLAGVA YAVFGCGNHDWVQTYQRVPTRVDEGLAAAGAERLLPRGEGDAGAGDFFEA FTRWEAALWEALGKKYETAKGSGKEAGVQIKVTNATVSRADALRQADTMM GTVIENRVLTAPGAPEKRHLDIRLPEGTTYNAGDYLAILPTNPSRDVRRALAR FGLLPDQEITIESASPTSLPTGRPISAHTLLSGYVELAQPATTRDLRLLSEAATS DAEKLVFQKLADNYAEEVLAARLSVLDILEAHPDVNIPLGAFLQLLPTMRVR QYSISSSPLADPTQASLTIRVFEAPCTAGRKAPLLGVASTYLGGLHAGDRVAL AVRPCKTAFRLPADPALPLVMVCAGAGLAPMRGFLQERAAQKRAGRDVAKS LLFFGCRDPAEDYLYRDGDLAEWTALGIVDVRAAFSRARDQSLGCKYVQDR LWHDRADVMAAWDKGAKLYLCGSAKMAAGVKDKLVLVVQDAMQLEHAA AVEKFNMMMAGRFATDVFE >CYP505D3 ug.73.15.1 MSQPIPMPPSVPFLGHVTTIDAELPVMSFRLLAKQYGEIYELNMLGRCILWML VINTQELLHEVSDEKRFRKIVSGGLNEVRNAAGDGLFTAHADKEQNWAIAHR ILMPSFSAMNMRNMFDDMVDVVSQLVLKWERFGPYHKINPADDFTALTLEA ISFCAMSYRWVIFYSIYVRNDVHPFARAMSDFLLESGARARRPGIIAPFMRSA NAKYQQDIDVLMNFVDEIIADRRAHPTDKKDILNVMLHAKDKETGLGMTED NIRRNLLTFLIAGHETTSGMLTFIMYYLLKHPEAMRKLREEVDTVIGERPMTV DDVNKLPYLIAVMREALRLGPPASARGASPYEDTTIGGGRFAVPKDTFIMCSL YNIHRDTKVWGEDAEEFRPERMLDGKFEAMPPDSWQPFGYGMRGCIGRPFA WQEAQIALVYLMQRFTFAMADPGYDLRLKQTLTIKPHEFFIHAIPRADRAHG APLFSTPSPLRPRAASSAQPPADTAGRTPVYVLYGSNTGTSEGFAQRIASAAA GKGMYSRSTIGTLDSAAAHLPTDGPVVIVTASYEGQPADNAAHFVEWLSSLQ GTELEGVRHAVFGCGNRDWQATYQRVPTLVDDALTARGSIPLVLRGAGDAA ASDFFEAFEKWETGLWGALREAYGVATGANAESGISIETLDTGKGRASILRQP DAALGTVVENRVLTAPGAPEKRHIEFKLPEGMTYQTGDYLAILPVNPQRDVH RALARFGLLPDQEITIRSAGPTTLPTDRPVNVSTLLSGYVELGQPATTRDLRLL SEHAKSDSTKAALQALLDNYASDVLGARLSVLDILEAHADIALPFAAFLDTLP SMRVRQYSISSSPLADAAHASLTISVLAAPARSGRPERFLGVASTFLGGLRAG DRVPLAVRPSAAAFHPPADPSVPLLLVGAGAGLAPLRGFLQERALQKKAGRD VAKSILFFGCRRPDEDLLYGDAELKEWQELGVVDVRPAFSRAPEHSFGCKYV QDRVWHDRAEAVATFKAGAKLYICGSSRMAAGVKEQIVLIVQEDSKLEYPE AVEKFEKIMVGRFATDVFE >CYP505D4 pc.73.11.1 (ug.73.16.1) MTHPIPTPPTVPLLGHATLIDHDFPMGTNALWAREYGEIFRMCFPGRTVYVVS SYELVHEASNDKLFRKSVGGPLAELRSSVGDGLVTANVPGEENWGIAHRVL MPCFSTISLRNMFDDMVDVVSQLVLKWERFGPHYRIDPAEDFTALTFEAISLC SMSYRMNPFYNSAMHPFAAAVVDFQVECMARSRRGKLLNALIRSAKTKFEQ DRDLLMQYADETVLEDRKAHPIEKKDVLWTMINRADPVTGKKMTDLSVKQ NLLTLLMAGHETTSGMLTFAMYHLLKNPEAMRKLREEVDTIIGDRAMTADD LSRLPYLVAVMRETLRLSPSAPARIVQAMEATTLGGGKYAIAKDDTLLIATYV SQRDPAIWGPDAEEFRPERMLDGKFEALPPDAWQPFGAGIRSCIGRPFAWQE VQIVLVSLMQRFTFAFADGHYDLRMKQTLTMKPHDFYIHAIPRTDRARVPPL LGVRAAPAQSTDGEKGKVEAGEGAPPMYVYFGSNMGTAESFAQRIAGDAGR HGFKATVAPLDAAVEKLASDGPVVVITASYEGKPPDNAGHFVEWLSNLGDES ALAGVSFAVFGCGNRDWARTFQRIPTLVDDALGAHGGARIIPRGVGDASTGS FFESFANWEEGLWAALAEKYETAKPTSVGGLELVVTDAGPGRADALRQPDT TMGTVVENRVLTAPGAPVKRHIEIQLPEGTSYTAGDYLAVLPTNPPRDVRRV LKRFALLADQEITIQSADPTSLPTGRPVNVYALLSGYVELAQPATTRDLRLLIE ASSTDAEKQVFKELADNHAERVLKPRLSVLDIVEAHPSVHVPFAAFLQLLPA MRVRQYSISSSPLVDPARATLTIRVFELPGAPARRPHLGVGSTFLARLAPGDR VQLAVRPCKPAFRLPADPTVPLVLCCAGAGLAPMRGFLQERAMQKQAGRDV GKSLLFFGCRDPQEDYLYKDDDLKAWVDLGIVDVRVAFSRAPDQSLGCKYV QDRIWHDRADVLAAWNQGAKLYLCGSAKMATGVKDKLVHVVRDATGVDE ASASDKFNEMMDRFRYRYFRVRYEHVVSAAFTYCIWNYLRDHSTRFNCPIA MACTFRLGAIARGSICATWLRWREE >CYP505D5 pc.73.14.1 MTTPIPSPPSIPFLGHVTIIDREVAIYSYNLLAKQYGEIYQLNMMGALRIIVICSQ ELLHEVSDEKRFRKIPRSALEQVRNAVGDGLFTANGDDPNWHLAHRILMPAF STMNTRNMFDDMVDVVNQLVQKWERFGPRHKIDPAQDFTALTFEAITFCAM SYRELTLPQEGVHPFARAMADFLVESGNRALRPGIVQPFMRSTNSKYEEDIKI MEHYVNDIYEQRKANPTDKKDILNLMMYGKDTQTGEGLSEKTIKDNLLTFLI AGHETTSGMLTFIIYYLLKNPEAMRKLREEVDTIIGSRPMTVDDVHKLPYLIA VMREALRLGPPAPMRGAASFEDTLLKGKYPVAKDVPIYCGVYMVHRDPKV WGEDAEEFRPERMLDGRFEALPPEAWQPFGFGVRACIGRPFAWQEAQITVVY LMQRFTFVMHDPSYDLQLKQTLTIKPHEFFIHAIPRTDRPSIVPIPTPSSTLLRD QTAPAAQPPVTTPGEGGGHRMYVLYGSNTGTCEAFAQRVASDATVHGEVSV FIGTLDSAAGHLPSDGPVVVVTASFEGQPADNAAHFVSWLTALNGSALADVS FAVFGCGNRDWASTYQRIPTLCDDTMAARGGKRLVPRGEGDAGSSDLFESFE HWEAGLWEALQKTYGTTKVEGRQEAIKVSTVDAGTARATALRQPDTMLGT VVENRLLTSPGVAEKHHIEFQLPDGLTYRTGDYLAILPMNPSRDVQRVLAHFS LLPDQEVTISAAGPSPLPTGRPVNVSSLLSGYVELSQAATTRDLRILMSAAKSE DTKAALSELLDGYAEKMQAARLSVLDILEAHPGLDISFALFLQLLPSMRVRQ YSISSSPLADPTRASLTVSVLSAAPTAGRREPFLGVASTYLASLRAGDCVQLA VRPSAAAFHPPADPAVPLVLFCAGAGLAPMRGFLQERALQKQAGRDVAKSIL FFGCRSPQHDFLYADSDLRTWTELGVVDVRPAFSRDTEHSAGCKYVQDRVW ADREDVVKVWKAGAKMYVCGSGRMATAVKQKLVEIIAAQLNVDSEKATET FNNIIKGRFATDVFE >CYP505D6 pc.17.40.1 MTSTIPTPPSIPFLGHVASIEREVPLRSFRLLSEQYGEIYELNILGRKLLVVSSAK LMSDVSDDKKFYKNMSGPLMQVRNAVGDGLFTAYGEEPNWGIAHRLLMPA FGTASIRDMFPDMLDLASQLVLKWERFGPKHRIDPAEDFTRLTLDTIALCAMS YRLNSFYRDSSHPFVQSMVDFLVECNLRANRPGLLTSVMVQTNAKYEEDIKT MTELADEIIAERRRNPTDKKDLLNIMLYSKDPKTGQSLSDVNIRNNLLTFLIAG HETTSGLLTFALYYLIKNPEAMRKAHEEVDEVLGDQQIQLTDIGKLKYIDAVL RETMRLSPTAPMRTVRPFEDITIGDGKYFVPKDYTVVINTIVAQRDPTVWGED SNEFHPERMLDGKFEALPPNAWQPFGFGMRACIGRPFAWQEAIIALAVLLQK FDFVLDDPSYELELKQSLTIKPAHFYVHALPREGKPQLLATPSAAPFSSHARET TNASLPASPGTEAKQPMYVLYGSNTGTSESFAQRIANGAAAHGFRATLGTLD SVADHLPTDGPIVIVCASFEGEPADNAAHFVERLTSLQDKPLQNLRFAVFGCG HHDWFRTYQRIPKLIDQTLEDRGAQRLVPRGEGDAGSSEFFEAFEAWETKLW EVLPEEYNTVVKQDITSGLKVETVGEGATRAVDLRQHDAALGTVIENRVLTA PGAPQKRHIEFELPEGVTSRAGDYLAILPSNPPQDVHRVLARFGMLPEQQIVIS SSGPSSLPTGRQISAFDLLSGYVELSQPATARDVRTLLNIDSSDATKESLKALL ESYSDAVLGRRLSVLDLLEQYPDIKLPFAAYLALLPSMRIRQYSISSSPLWNAQ RVTLTVSVLEAPALSGRKEPFLGVASTYLANLRPGDKVQMAVRASNAAFHLP QDPRTPLVLFAAGSGLAPMRGFLQERALQKKAGREVGRAVLFFGCRRPDED YLYSDSDLKEWEELGVVELRPAFSRAPEKSEGCKYVQDRVWHDRRALDGLY EAGAKWFVCGSGKVARGVKEVLTAMIKESRGYSDEEAAAAFERATVGRFAT DIFE >CYP505D7 gx.187.5.1 ATPIPSPPSVPFLGHVTIIDREVAIYSYNLLAKQYGEIYQLNMMGAKVVVICSQ ELLHEVSDEKRFRKVPSSALDQVGNAAGEGLFTAHGDNPNWHLAHRILMPA FSTMNTRNMFDDMVDVVNQLVQKWERFGPRYKIDPSQDFTALTLEAITFCA MSYRYGRIXVHPFARAMADFLVESGNRALRPGIVQPLMRATNSKYEENIKIM QKYVDDVYNQRKENPTDKKDILNLMMYGKDPKTGERLSEKTIKENLLTFLIA GHETTSGMLTFILYYLLKNPEAMRKLREEVDTMIGSRTMTVDDVHKLPYLIA VMRETLRLGPPAPARGTAPFEDTLLKGKYPVAKDGRIYCGIYMVHRDPKVW GEDAEEFRPERMLDGRFEALPPEAWQPFGFGVRACIGRPFAWQEAQITVVYL MQRFTFVMHDPSYDLQLKQTLTIKPHEFFIHAIPRTDRPSIVPIPTPSSTLLRDQ TAPTAQPGPVTTPGEGGGHRMYVLYGSNTGTCEAFAQRVASDATVHGFKAV IGTLDSAAGHLPSDGPVVVVTASFEGQPADNAAHFVSWLTALNGSALADVSF AVFGCGNRDWASTYQRIPTLCDDTMAARGGKRLVHRGEGDAGSSDLFESFE >CYP512B1 pc.30.92.1(genewise2nd.30.46.1) MSLHQVYDAVVAHGDMGTLLVYMAYSVPLVMFLYSLFSPASLRHIPTEGGP SFPLLSYKAARAYLRDATGILQRGYDKHKGKPFKVAMPDRWVVVLTGKKLV DELQRLPDDAVSFIKGASDLSGTEHMFGRQVIDDPFHVPIIRTHLTKNLAPMFS DVFDEVSIAFQELIPACDGEWVPVHAIKVARSVVARTSNRIFAGLPICRHPEYL NLVINFTVDVAKGRYALLLFPPALKGIAAKILTNIDGRIKEGLKYLGPLIEQRM ALAEKFGNDSSEKPDDMLQWIIDEVRARNQSVFEVVRTVLLVNFAAIHTSSNS FTHALYHLAANPEFIAPLREEIETIVSEEGWSKAAIGKMWKLDSFMRESQRYN GINSVSVKRKALKPLTLSDGTFIPKGTVLVTPTVATHFDDDNYKNPTVFDPFR YYREKEQDMSAVKHQFVTTSPDYVSFGHGKHACPGRFFAANELKAMMAYV VVNYDVKFEKEGVRPENIYAAMGISPDPNARVLFRKRESIVSV >CYP512B2 pc.30.93.1 MTPSHSFPDLVASCISAWTLCFALGFSIAAASFYSLFGPFNLHHIPTVGGSSIPLI SHRGARKYMRDAKGVLQDGYKHKGKAFKVALTDRWLVVITGKRLVDELQK MPEDVASFVGAVADFQGLRYIFGQKVLDDPFHVNIIRSHLTKHLSSVFGDICD EIYVAFSELIPQQDEEWVPVHAIQVVRTIVARASNRVFVGLPVCRNAGYLSLA VNFIVDVAKARDFIALFPPVLKPLAAKMTSDIGTRVQEGMQYLEPLINERLRL MEKFGKDWTDKPNDTLQWMMDGIMERDGTIEQLVRIVLLENFSSIHSSSNTF THALYHLAANPEYITPLREEVETAISEEGWTKAAMSRLRKVDSFLRESLRLNG INPVSMQRKALISFTFSDGTYIPKGTILVTPALATHHDEDNYEDATTFKPFRFV GENPEDDVPLVTTSADFVPFGHGRQACPGRFFAAHQMKAMMAYLVLNYDV KFENEGVRPQNVHGVLSVQPDPKARVLCRRRKSSYT >CYP512B3 pc.30.113.1 MASNHLFSGVLPLDRAASTLGYLVCGALLALLLQNILTTISLRHIPTVGTSTLP LLSYKGAYDFTRDIKGVFQQGYAKYKGRAFKIAFTDRWFVVLTGRKLLEELH RLPDSTTSFNHASGSITGSTYIYGRGWLSDPWHIPIIRDRLTKHLAASFGDMYD ELETAFRELMPSCEEEWVPVHFITMARTVVARTSNRVFVGLPACRNLGYHTL LVNFALDVSKARNRLAWLPPALKRVAARALTRIDSRIEEGMQYLGPTIRARIV EMERYKGDWPDKPNDILQWIMEELIARKMPMEEAVRIILRINSSAVQTTANSL THAIYHLAANPDLIAPLREEVDAVITDEGWTKLAMSKLSRLDSFMRESLRLNI VNPLSVRRMALKSFTFSDGTFIPKGTLMVTPAHATHLDEANYEHASVFDPWR FVHQKEEDLSPTKHQFITTSPEFVAWGHGKHACPGRFFASNELKAMMAYIILN YDVKFARAGVRPDNVYSGLTVAPNQEANVLFRRRQTQ >CYP512B4 pc.30.114.1 MAFSDVVATVGAGPWVAYMMCAVLLALLLYSLFSPASLRHIPTEGGSSLPLV SYLGAYNVLRNLQSVLQRGYDKHKGKAFKIALPDRWVVVLTGKTLVDELQR MPEESASFIDATTELTGFGYIWGPRMRKDPCHVPIIRNQLTRQLSSAFGDIYEEI ELSFQGLMPACEKDWTPVHVIEVARDVVARASNRVFVGLRVCRNPDYLDML VDCAVSVASARNTLMLFPFVLKTFAAKNVVNMDRRIRRGMQHLGPIIEERMS LLRSLGNDWPDKPDDMLQWIIDEVAARQMPKEDVVRNIMFLNFAAIHTSSNS FTHAIYHLAANPDYLGPLREEVEAVTAKEGWSKTAMGRMWRIDSFLRESQR VNSINPLTVIRRTRTSLTLSDGTFIPEGTVVAAPAYPTHFDDENYVGGDTFDPW RYVREKEQDLSPSKHQYVTLSPEYVPFGLGKRACPGRFFAANELKAMLAYLV VNYDVKFEKEGVRPENMHVGLTISPDPAAKVLFRKRRS >CYP512B5 pc.30.118.1 MARSDILDALSLGRTELSTTYLVFGLFLALFLYSLFSPASLRHIPTVGSSSLPLL SYKGAYDFLRDGRSLLQRGYNQYKGKVFKVAFTDRWLAVVTGRKLVEEVQ RLSDDVISFPDASGEVTGFKYIFTKCALSRDPFHVNMIQRQLTKHMSVAFDDL HDEFETAFKELLPHNETAWVPVHAIEVARKVVARASNRIFVGLPVCRDKAFL DLMVNFTLDVARARDLLALFPPALKPFVAKLVVKLDSRIEEGMQVLRPIIQER MEIIEKFGKDSPEKPDDMLQWIIDALVERNEPMEQVVLITLFVNIAAINTSSNS FTHALYHLAARPEWIAPLREEAEAVIGNEGWTKNAMGKLVKIDSFMRESQRY NSIVPLTCMRKALQPFTLSDGTHIPRGTILVTPAIATHFDDEHYADAASFDPSR YVPVADAKQGGAPKQYVTTTAEYVPFGHGKYACPGRFFAGTELKAMMAYL VLNYDVKFAQEGVRPPNAATTLSTRPHQEARVLFRKRNSSVQ >CYP512C1 pc.30.76.1 MSSMNAPALPATHIVAGAILVWLLVRTFGTQNLRHIPTEGGPSLPIISFLGLHA FLTRSREILEDGYQTHKGRAFKVALIDRWLVVLSGKKLVEELQKMPDDTVES ATTEMFNMQHVFASNWHKDPVHSSLLRSLTRNLGVVFSDMFDELDTAFREC VPANAERWLPVQAHTTMASIVTRAANRIFVGLPVCRDAGYIHMMIHVAEDV SDAVRTLSMLPTFMKPFVARRATVIDQRIQQCLDYLRPAIADRMSMLERFGK DWEEKPNDVLQWIIDEVTARNQGEDEVARIVLFINFGAIETTSFAVTHALYDI VSRPGLADVLREEVEAAVATEGWTKAATNKMRKLDSVLRESQRLNGPTTAS MFRRVLQPVTLSDGTYLPAGTTVVTPTLATHFDDTNYADAQTFDPLRFYKPD GVQAQLVTTSADFVTFGHGKHACPGRFFAANELKAMMAYILMHYDIRPERE GVRPENVYRGLNVLPDANARVFFRRRQTD >CYP512C2 pc.30.77.1 MVLTTDFGGISTTHVVIGAFVTWLLLRYFSAKNLRHIPTVGGPSVPILSIVALY NFLANGKKVVLDGYQKYKGKAFKVALLDRWLVVLCNPKLVEELQKLPEAL VGYSLLEAXGTIFETKHIFGADLLTDPVHLVLLRTLTRNLGQVFGDMYQEVET SFQELVPANEKEWLPVHASPIMRTIVTRAANRVFVGVPVCRDEGYLHLMVHF AEDVNKAFGLYTVVPSFAQGFVARKAKAVMDDCIERCLGYTRPTIKDRTTM MDSFGDNWADKPNDMIQWTIEETKARGQGEYDMARMLMFINSGAVETTSQ AVIHALYDISVRPELADELREEVERAIAEDGWTKDATNKMRKLDSFLRESQRI NGPMIVSMFRLVREPVTLSDGTFLPAGTTIASPTLGAHFDDSIYPNASTFDPLR FYKAEAAGQPQFVTTSPEYLTFGHGKHACPGRFFAVNELKAILAYMLMHYDI KPEQDGVRPENKSMGLGVLPNPDAKVMFRKRHAG >CYP512C3P gx.30.36.1 60% TO 512C1 PSEUDOGENE NDLLQWIMDEAVARDKS (FRAMESHIFT) QEEIARMVLFLN (FRAMESHIFT) FGAIQTTSCV (1) LSMFPNILTPVTLSDGTFLPAGTTVVTPVLATHYNEDNYTNAALFDPFSCKDN RSGGQQFVRTSADCVTLGHGRML (1) CPGRFFAAAELKTLVAYVLVNYDLRPETEGVRPVNIYKGLT V*PSETAKVLFKKRQTDE* >CYP512D1 pc.27.9.1 MQSSVGEVFASAPTLAKLLAGAAFVLFLNSLWNIWKLRHIPTVGGPAIPILCYI GTFRYLQDPQKILQEGYEKYKAKPGMFKIAAPDRWLVVVGHPNLIDELQKHS DEQVSFMDAATEFVGTRYALPGTIADDPWHIPPLKQHLTHAIGSFFGDMLEEL RVSIEERIPSNEKDDEWVAIPALDTFFWVFTRVIDRIIVGLPIRDTEFIKLMVEFT MSIGIARFFIGLVPPMLKPAMAKLAARGVHKATAEAEKMLAPVIADRVRHLD EFGEKWADKPNDLLQINIEEARAQGRPLDEIVIRTIVSIFVGVSTSAASFVHVL YHLAADPELQAALRTEVEGAIARDGWTKAALVGMHRVDSVLRESQRVNGIN SVSVMRTALQDITLTSAGAPVCLPAGTLCVAPERALHADKEHYPDPDAFVPF RFAELRATADARGGAQHQFVSTSTRYVPFGHGKHACPGRFFAGNEMKAAVA HLVSNYDVRLPDGASTRPPNELFGLAIVPNRSAKVMSRRRQPVV >CYP512E1 pc.154.15.1 MSDYSSLLAYIFISLATLAYLKRLLWPDRQQLEHIPAIGPTAPILSYWGAFRWL SHGTEITQKGYAKYKGRPFKVANFNRWLVVVSGPKLIDDIRKAAEHELSFEE AAHENLEVRYTAGPCIAENSYHVPIVRGQLTRNLPFLFNDVRDEVAKAFGDHI PPTDDWTPVAAHPVIMQIVARATNRVFVGAPKCRDPDWLDLSIQFTADLILG AHIITQFPQFLKPLAARFFTRVPAAIRRGRRHLERTIEHRKACLEQYGADWPD KPNDLISWLLDEAKGEERTIHNLVTRVLTLEFAAIHTTSNSFVHALYQLAAHP EWAEPLRDEIEQVVKREGWSKSSLDKMHRLDSFLKESQRYYALGGVTMDRR AMKDFTFSDGTVIPEGTFVGVAVLATQHDPQYYDDPDTFNPWRFSDLREESD ESGRHLLVSTGIEYFPFGHGRHACPGRFFAAIELKLMLAHIVMNYDVKAELD GVVPPILEFGQNLAPNMKAKVLFKNRQRS >CYP512F1 pc.15.28.1 MISIDSISLISGLISFAFIAYYLRQDKLQHIPSPGPTGPISSWYAAYKYIRGDAPQI IEEGYRKYKGRIFRIADLNRWTVVVTSPSLVEELRKAPEDVLSFHEGIRYSLQL DYTFGREAVEHEYHIPVIRTQLTRHLTPLFADIHDEIVQSFTDLVPPSDTWTSV RVVPTVMQVVSRTSNRVFVGLPYCRDPAFCALAVRFATDVVKTGVALHLTP RALKPLGVRLVSPVSKRVEEGKGDHRQAGGRPAPRQPGGRPARAAQGACGN WPDKPEDMIQWLIEEANEDERTVEGLVLRILIVNFAAIHTSSMSFTHAVNMLA AHPECIAPLREEIEEVVREEGWTKAAVQRMRKLDSFMKECQRLHGLGAVTM SRVALQDYTFSDGTRIPRGTLVMAASRPIHHDAALYVPDADAFDPWRFARLR AADADASIKHQMVHTSAEYLAFGHGKHACPGRFFAVNELKLMMAHVLHTY DIRPQTSVPPGRWIRHSLLANPIATVDLKKRQT >CYP512G1 pc.16.37.1 SVPTMGPTAPLLSYWGVLRYMTRPRDVLREGHVKFGGRPFRVASPLRWQYI VSSPELIDELRRAPDAELDPLAAADDILHFVKALGTKFASNTYHVPIVRTTLTK NIGTLLPSVLDEMRVAFARYIPADKEWHPVVAHDTNVRIVTQTSSRIFVGLPL CRDPELLKITMSYTPTVMKTGLLLKVLPRPLQSFVQRGSGSIDALIDRAHRLLL PTIEERRRMMDKYGAEWLDKPNDMLQWLMDSAEGEERTPRGLAARMLAVY FAATDTAALGFTVALYRLATHPEYVQPLREEVEAVIAQDGWTREAFRKMPK VDSFLKECMRLQGPSTLLLQRKAMQDFTFSDGTFVPKGSHVATSIVATHCDS AYYSDPLTFNPWRFVGAEDDAQDSKHRFATTSPEYLLFGYGRHACAGRFFAE IQLEMMMAYVVTTYDVRMEKPGVLPEPIEFGSMSLPSMTAKVCFRKRATE >CYP512G2 pc.15.22.1 (genewise2nd.15.10.1) MFADSPTSLYALVLLGTVVYLLNWLKGSKYKSVPALGPTAPLLSYWGAIRFF LDAQGMLQEGQLKYGGSPFRIATRRYWQYIVSSPKLIDELRRAPDDELSFLDA VNEALELEYTMGAATANNLYHVPVIRNTLTRNLGNLSSEIYDEISNAFADCIP ARDEWMAVPALQSIMQIVARTSSRIFVGLPLCRNREFLEISMTYTTDVVKTGL LLNMVPGPLKPIVNRLFSKVEQHIDRTHALLRPIIEERQRMMEQYGDDWPDKP NDMLQWLMDAAEGQEREPRALALRILIVGFAAIHTSSMSFTQALYYLAAHPE YMQPMRDEVEAVLAAEGGWSKGALQKMRKVDSFLKECQRYEGLGMLFLTR KAVKDFTFSDGTFIPKGSYVSTSRAATHGQSEYYRDPYVFDPWRFANLRDET GEGVKHQMVNTSIEYLPFGLGKHACPGRFFAANELKSMMAHLVVTYDVALD MPGEVPRSVHFGPINSPNRTAKVLFRKRRG >CYP512H1 pc.21.108.1 LQDIPTVGGPNLPFFSYFGAIHFLVRANKIISDGYSKYKGGSFKIAQVNRWLVF VTDPALNEELRKAPEDQMSSPAALHQYVQGLYTMGFGLDGMLYYIEVLRDQ LLRHVNPSPAMLYDEMQASFQDFIPQNTEWTPIPALSTSLKLFLRMSNRAFVG TPLCSNSEYLDLVTEFMNNVFKGAFFYNCLPAFVRPILARWLDLVNPCLDRA VRLLTPIYATRVAELEAAGKAEWAGASDDLLSCLVASHYSAARDVRELARIL LVVNLAAVHTVSQSFTSVLFLLASRPAWQAELRAEAAAALAHGYTRDALAR LRKLDSFVQESLRFNGLGALASTKLALTDFALSNGTVIPKGTLVSAPLRALHL DDEVYPDGASFQPWRFVRAGGEAAPRQSLASTSPTYLPFGHGKSACPGRFFA ALELKMVTAYLVLNYDLKLEGDATEVPPVSWFITARVPNYKANVLVRRRQE KA >CYP512J1 pc.37.85.1 NIPTLGTEMPVLSLWGALRYVMNSHNVIQEGYLKYKGRPFKIAQFDRWLVVL TTPHHIEELRKAPETGLSSRDAVDSLLKAKYTLGMDVDTVQRYLDLFREKLA NKLGGLTSEVHEEMELTLNESLPKSEDWEEFCVLPSILKILFRTTNRAFVGAPL CRSEEYVALGEEYTTNVFKGAVIYNALPKALLPLLSKVLDFIGPTSRRCEQLFR PEMDKRAAYLEEHGADARGKYDDLLTWLIATHGAGDEINYSELTRIVLIANL AAVHSTAMVFTFAIFHAAADPGVADALRAEVASVVAEHGWTPTALTKMQRL DSFVREVQRMHTLGAALVMKIARTDYAFADGHVVPRGALVTAPATTVHRD DEHYPDAHTFRPWRFVGTPDESEADSARRKATSTSPTFLAWGHGKHSCPGRF FAVRELKMLLGSVLLRYDVRLKTPGVLPQDQWYLTFRVPDPTACVLLKRRTT A >CYP5035A1 PC-hn-2(pc.12.112.1) MADTSLLSRRLKSFFDPQGSPTLLSLPDSFVHLIFKRWEPMKLPIVAFLLFLVP ACLSLLFASHLSLTKGLATAFATFYTVLVSSIVIYRISPFHPLARYPGPLAAKIT KWWHAYHVHTGKQHLYVRRLHDQYGDIVRIGPNDVSIRDASCISSGLGSQGL PKGPMWDGRFMYSPIPAMVGARDHAYHMQRRRPWNRAFSATALKEYEPLIY GRVHQLVSALADRQGQVVDIAKWIGYFTYDFMGDMVYGGWTEMLRDGKD EDGLWDVVHRGLEDVSAVYGEVPWVSYYASMLPNVGKDLKRMRKMAFDR AKQRYDSGSKARDLFYYLSNEDGAEKVTPPRPIVVSDGVLALIAGSDTTAIVT ATILYSLLCNPTTYHRLQQEVDKFYPRGEDPLNPKHYKDMHYLEATINEGLR LFPATPSGTQRAPAPGKGDRLIGKYYIPEGTATKFHFWSIQRDPRNFSHPDTF WPERWLVAEGLEHADEPLTHNANAFVPFSFGPYNCVGKNVAMQEMRMLLC HLMHTLDLRFPEGYVPRAFEDALEDQFGFKVGELPVIVQRRE >CYP5035A2 ug.97.52.1 MGSDAQLPVLSPRDAFAIIVLSAVGAHLVFKRWEPKKLRVVTFLLFLVPACLS TLLLPHFGTALGLTVGFLTYWTALTLSIVFYRVGPLHPLYQYPGPLPAKISKW WHVWHVQQGKQHLYLQQLHDKYGDIVRIGPNEVSIRDPACITPVLGAQGMP KSDMFLGRNMWPETAPLIGYRDPAEHMKRRKPWNRAFSSASVKEFEPIIQHR VHQLVEALSDRQGQVVDLAEWISFFTYDFMGDMVFGGWTEMMRDGADKG GLWDLLRRGLTVSALWGEVPWVSYYAKKLPWTAQDNKAMRVMAFSRTEQ RYASGSASKDLFYYLSNEDGSEKVSPPRNIVIGDGLLALVAGSDTTATVVANT MYELLRHPAAYRRLQEEVDKFYPRGEDSLDPKHIKDMHYLEAVINEGLRMY PAVPSGSVRAPEVGKGGKIAGPYYIPEGTQTRIHFWSVQRDARNFSFPETFWP ERWLIAEGIEPAPAGEKLVHNPNAFTPFSFGPYNCVGKNIALAEMKQLLCHLV HKLDVRFADGVDPDAFDRASEDRFIYVVGELPVVVERRD >CYP5035A3 ug.53.54.1 MAGDLSTRDALGIIVVSALGTHAIFKRWEIKHILVVSTLLLFLPAALSTLLIPHL GSFKGIAAGFSVYFITLLSSITLYRISPFHPLAHYPGPLLPKISKIYHIAKVSSGK QHLYLQELHNQYGDIVRFGPNEVSIRDASCIMPVLGAQGMPKGPMWQGRHF WTEVHTLIGFRDPKAHQRRRRPWNRAFNTAAMKEYTPLMQNRVRELGDAL VARQGQVVDLAEWIGFFTYDFMGDMVFGGWTNMVREGGDRERLWEVVKS GLKIEFIYDNIPWLSYYTRNIPGAGNAELRAMAIGQTEKRYSRTSTSKDLFYYL ANGAEKEDPPKHTVVVDGGLALVAGSDTTSSVLSSIFYCLLRHPDTYDRLQA EVDKFYPPGEDSLDPSHLSDMNYLEAVISEGLRLFPAVPSGSQRAPEIGTGGKL VGPYYIPEGTQTRIHFWSVHRDPRYFSRPEAFWPDRWLIAEGLQAHAAGDEPF VHNPNAWTPFSFGPSNCVGKNLALQEMRMVLVHLMHRLIVRLADGWDPAQ YEREMEDRFVFSIGRLPVVVERRD >CYP5035A4 pc.53.86.1 MAAREAILIIALFAVVSTISHVIFRQWEVMHCSVVLGLLVVIPAVLSTPLVSDF GIPCGLALGFTTYFAVLLLSITLYRISPFHPIARYPGPLLAKISKIYHVSKIWSGK QHLYLQRLHEKYGDIVRIGPNELSIRDVSCITPALGAQGMPKGPMFNGRHLW PETHSLIGFRDPKEHQRRRRPWNRAFNTASVKEFNPIIQARVQELGDAFAARE GQVVDLAEWIGFFTYDFMGDMVFGGWTHMVREGADHNGLWQLIKSGMKV SFVYEHIPWLSYYVKKLPGAGSDLKMMRAMAFGQTEKRYATTTTTRDLFYY LTNEDGSEKVDPPKAVVISDGALALIAGSDTTSTVLTSTFYCLLRNPETYKRL QEEVDMFYPAGEGSLDPKHLPEMHYLEAGLRLFPAVPSGTQRAPEVGKGGK AIGPYYIPEGTQTRLHFWSIHRDPRNFSHPEMFWPDRWLIAEGLQECVGEKLV HNPNAWLPFSFGPSNCVGKNLAMQEMRMLVCHLVQRFNFRFADGYDPAQY ERDWQDRFVVMIGQLPVTIERRA >CYP5035A5 pc.1.6.1 MPSGILGQLQSLPAKLTAQDATLVAHVIFKIWEPMQARIVSLLVIIAPLLLSTLF IPHYGTVSGVFRSFAIYLTTLVSSIVVYRLSPWHPLARYPGPLLAKVTKLYHAL MVSKGKQHVYIKALHDQYGDIVRIGPNEVSIRDAACIQPLMGAQGLAKGPSW SGRSMFPPISPLIGIRDPAEHARRRRPWNRAFNTNGIKEFMPTIQTRVQQLAEH LGERHGQALDLAEWFSFFTYDFMGDMIFGGWTEMMRDGGDLQGLWTRVK AGLQHAGMVPEHVPWVAYYAKKIPSVVRKVSEMRGMGISRAKMRYQQGST SKDLFYYLSNEDGSEKVTPPPEVVTSDGALALVAGSDTTSSVLSNLFYCLLRD PVSYKRLQEEVDKFYPPGENSLDPRHINNMPFLEAVINEAMRLYPVVPSGSQR SPEIGKGGRAVGPYYIPEGNQARVHFWSVFRDSRNFSHPETFWPDRWLIAEGL QESPEKITHNANAFVPFSFGPANCVGKNLAIQEMRLAVTHLMHKLNFRFADG FNPDEWDSQIQDVTVMQLGKLMVVVERRD >CYP5035A6 pc.42.19.1 (genewise.42.13.1) HLIYKRWEPLRLSVTLTLLMGVPAALSVLLIPHLGLLRGALATFSLYLSTLISSI VAYRLSPWHPLARYPGPLPARVTQLWHTWQAHKGQQHLYLKQMHDKYGD VVRMGPNEISIRDADCIVPLYGPHGLPKGPSAGRQMHPQELSLIGYRDPARHS VRRKPWARGLGTAAVREYMPALRSRVSQLVDALGARSGHPVDLAEWIAFFA YVFVSDCSLSXLANCQSGMTLWATWRECHGAGAVFEQVPWLAYYAKMLPA ISQRILKMRKLTVRHATRRYNSGSFSRDLFHYLSGEDLPEGSARPPQHIVAAD GLLAVVAGSDTTASALSNLFYCIMRHRDVYKRLQQEVDQFSPLGDDSLDPQH LNNMPYLNAVINETLRYLPSVLSGSQRAPLIGGGGVSVGPYYVPEGNQVRVH FYSVHRDPRYFSDPDRFWPERWLIADDRQPSSEKIVHDDRAFIPFSYGPSNCV GKGLALQQMKSTVCHVMAKLEMRFADGYDPDTWEEQVQDEGVMIV >CYP5035A7 gw.54.121.1 (genewise.54.12.1) KSVHLIFKRYEPMHILVVSTLLLLLPAILTVPLIDQLGIAKGFLVAFATYFATLL SSITLYRISPFHPLARYPGPIIAKVSKIYHVAQVWSGKQHLYLQRLHDRYGDIV RFGGFSTCSLRGPNEVSIRDVSCIAPMLGTQGMPKGPGKHCWPEIHTLIGCSDI KEHQRRRRPWNRAFSTAAMKEYNPIIQKRLQELGDALAARQGEVVNLADWI SFFTHVPWLSYYTRNIPGATNDEFRGMVFGQTVKRYACTGTTKDLFYYLNED GAEKEDPPKPIVVVDGAVALIAGSDTTSTVLASTFYCLLRNADTYKRLQAEV DRFYPPGADSLAPDHLPEMHYLEAEALRLFPAVPSGSQRTPERGSGGKTIGPS YIPEGTQTRIHFWSVHRDPRNFSRPETFWPDRWLIADGLQKDEGVEFVHNPN AWIPFSLGPANCVGKNLALQEMRMVLVHLLHRFSFRFAGDYNPEQYEWDIE DRFVVAVGRLPVIVERR >CYP5035B1 Phanerochaete chrysosporium scaffold_247a MGTEYVLPLLRTNILKLPTNMTRNDALLAVGGAAV LCHLIFKKWEPTYIPAVVTLLLVVPLGLSALLVPHYGQLLAPLVALATYHTILL TSIALY RLSPWHPLAQYPGPLPAKLSKWWMVWQERDCKQHLYIKQLHDRYGDIVRIG PNELSIRNV DAVAPLMGTNGLPKGPS LRGQGLEPPITGLIAIRDPAEHARRRRPWTRAFSTA ALKEYEPILVKRISQLCEQLASQKGTLDLATWFSWFTYDFMGDMV RFGGGSEMLAHGDQDGIWTMFKEGGE GQMMYHHIPWLAHYAKRLPMSPALKKMRGFALGRTAERYKKGASTKDLFYYL SNEDGVEKTPPPAAQVISDGVLATIAASDTTSTTLSNAFWNILRHPHYY KRLQAEVDKFYPVGENAFDTKHHSKMTFLDAVL NETLRMYPVLPSGSQRAPFPGNGDRVVGP YYIPDGTQARIHFWSLQRDPRYFSHPDTFWPERWLIAEGLEPAPAGEKFVHNP NAFIPFS FGPSNCVGKNLAQMEMRMVFCYLLQNLDFELDKSWNPAERENATEDQFVLL MRSPVQVTVRRRV* >CYP5035B2 pc.54.66.1 MGSEVAMQLLRANISKPFPRLDQNDALAVVVGAALVVCHLIFKHLEPTYIPA VLFLLVLVPLYLSALLLPHFGPLLAPVIAFTTYHTSLLTSIGLYRISPWHPLAKY PGPLPAKLSKWWMVWKERDCKQHFYLEDLHKRYGDVVRIGPNELSICNVDA VLPLMGPDGLTKGPCSTIGQGLEQPIPSLVSIRDPAEHARRRRPWTRAFSTAAL KEYEPILAKRISELCEQLAQQKGSLNLATWISWFTYDVMGDLVFGGGNEMLA TGDQDGIWAMFEKSGEGQMVYHHVPWLAHYAKRLPMSPALKKMREFALGR ALERYKRGAVTKDLFYYLSNEDGSEKIPPPPAQVIGDGVLATVAASDTTSTTI ANTFWHILRYPHYYKRVQAEVDKYYPPGESAFDTKHHNKMTFLEAVIHETLR LYPVLPSGSQRSPVPGKGDRVVGPYYLPDGTQARVHTWSLHRDPRYFSRPDT FWPERWLIAEGLEPAPAGEPFVHNANAFIPFSFGPANCVGKNLAYLEMRMVF CHLLQNLDFELDKRWNPAERSRSAEDQFVLYMRCPLPVTVRRRV >CYP5035B3 PFF_271(pc.92.54.1) MGAGYVPQLPRSYALTSFTGTSQTHALTPVIGAALITHLNFKRWEPLNLPIVIFL PLVAPLGLSALFAPGPSYGSLVAPFITLFLYHAILLASIALYRISPWHDLYHYPGP LPAKPSKWWMVWKERHGKQHLYVKALHDRYGDVVRTGPNEISIRDVAAVV PLMGTKGLPKGTAPWGEPIVPSVVPLIGIRDSVEHARLRRSWARAFTAGALKG YEPVLTARITQLIAKLGSQTGEXIDLALVLSYFSYDFMGDMLYYGGGSELLAEG DNDGVWALGQMTCHHLPWLAKYKDLLPASPGLQKMRSVALQRTMARYKNG GLGKDLFYHLSNEDSAEKTSPPTEQLVSDGVLAVFTASDTIATVLSNVFWSILR FPRYYEQLQAEVDKFYPARADAFDTAHYGEMVWLDAITNEALRLYPIVPSGSQ RAPAPGDGARVVGPYVIPAGTNARVHTWSLQRDPRCFSRPDAFWPERWLAAA GLGDSEATCEGPEGAAGAAEDFVHDARAFEPYFVGPLDCIGRALAQLELRMVL CALLQRLAFAPARGDPLERERTLQDQFVVMMRGPVMVSVSWRA >CYP5035C1 ug.43.44.1 MSAVLNSLSPTESVLAVVTCALATHLVFNRFEPTELPVVAAALVGLPAVLSAL LVGHFGLLAGAALAFATFHATLAASIVLYRLSPFHPLARYPGPLPARITRWYW ARVACGGRQHLELKRLHDVYGDIVRIGPNELSFRDISVVQPMMGAQGMPKG PMWDGRTFKPPILPLVGMRDVADHTRRRRPWIRAFTPAALKEYEPVVAKRG AQLIEILAQKKHTDFVHWIHLFTFDIMSDALFGGNPGAEMMSHEDKDGIMHS MKTGFEAGQLFEHIPWLGYWMRHFPQLATATKQYRAMCFQRGMQRYQDGS SQKDLFYYLVNEDGAEKQTPDKATVIADSALAIIAGSDTTASVFSNMVYCLL KNPHAYKRLRAEVDEFYPPEENSLDPKHHSKMPYLEAVINETLRLYPVVPSGS QRAPELGTGGTLMGTHYIPENTSVRVHFWSVHRDPRNFSQPESFLPERWLAA EGLEAPPAGLSGAPQDADSAAARGTFVHNANAYMPFSFGPWNCVGKALALL ELRCVATHMMQRLDVRFADGWDPAEWDAAMEDKFVIKTGRLPVVVERRF >CYP5035D1 ug.50.50.1 MITVNRPSTQDSVIATVACAIIVYFVCKRWEPWRLAVVVPLTMLPPLVLSLPL AASVGFLNAVITTSSTFASALVAMLTFYRLSPLHPLSQFPGPLQCKISGFWMA WIVSRGKRHVYIQSLHEKYGDYVRIGPNEVSINDPTAIPQILGSYGWPKASGM SGRALHQDPLPLISLLDDAEHSRRRKSWNRAFSSAAVREYQPVVAQRATQLV QALQEQRSVVDLSQWMRLPSDIAFVNLHDFLYSFGGGTEMLREGDKAGLWQ LLKDGMSTAVPFEHVPWLSHYILHFPALIRSLTDLRALAFGRSKLRYERGSTT KDLFYYLVNEDHADAEMPPMKVVISDAVLAIVAGSDTTAVTLSNIFYYLISHP DTYRRLQEEVDKYYPPGEDALNPKHYVKMSYLDAVLNEAMRLYPALPSGSM RTPAKGSGGQVVGQRFIPEGTQVRVHPYTIQRDPRNFSYPNRFWAERWMIAS GNQSHHEKIAHNPDALIPFSTGPRNCVGKNLALLEMKMVTCHVTQRLSLCFA DGWDPSRWWDDLEDVFVSRMGQLPVIVRSR >CYP5035E1 pc.97.5.1 (genewise.97.3.1) MQAAHLVFNRWEPTNIAVVAALLLGVPCAAASLLFSGRGFVPRLALTAALY YACLGTSVVLYRLSPWHPLARYPGPWLLKTSKLWMVRRVKRGGQWRYIRE LHQRFGDVVRIGGPNELSFCDAAMVVPVLGTQGLPKGPGLIALRDPLEHQRR RRTWNRAFKPAALQEYLPLIQKRTAQLLDALSKCEEQDVVDLGRWIRFCKYA HMPWLAQFTKHIPRVAAKLKELRAAARSRAAARYKAGANRKDLFYYLVGD SNEDGGEKEQPTEDVILSDALLAIIAGSDTTSTILTSAVYCLLTHPDVHKRLVE EVDKFYPPGADWCNTEHHADMHYLNANETLRLFPVLRDGSLRAPWVGHGD RALGPSSFIPEGTQVRVHTYSLQRDPRCFSQPDTFWPERWLVAGGLQHAEPGF VHEPGAFLPFSRGPSDCVGKGLALQDMRIVLCALLQHLELAPPRNRPFDEWK AEVDRRFADSSAVLPVSVRVRRRV >CYP5036A1 PC-hn-1(pc.15.127.1) MDALDPRIAIPIVYFIYKKYEPSSPRSAFFLLLALPGVLAVALRVCERFNSYAT AVPTVYLAYWSLLSTFVVAYRISPFHPLARYPGPLLCKISKGWLAYVAGKGG KAHLYVQDLHMRYGEVVRIGPNELSITHQDFTRVVLGAKGLPRGPYYDSRQ HEAGMSLDGMRDQALHAIRRRPWARGMNTAAMKYYEELIRNTLSDLIAGLK QRTNRPVDISEWTNYFGFDFMGQMAFTRDYGMLKNGYDKEGLLDLIEHAM QDSAWISHITWSIPFLRYVPGASKYWDQMKAVGERAVADRVALGSNHRDLF HYLMDEDGHEAVRPTKALVAVDGQLAIIAGGDTTATTLSHIVYFLLRYPIYLD RLRKEIDETFPDGADSTLDFTKQTNMPFLNACINEALRLYPPVLAGLQRRVEP GTGGKMIGPYFVPEETQVSLFAYSIHRDPQHFSPLTNTFWPDRWLSQEKYTLP SGDVISADEVVTNRDVFIPFSQGPMVCAGKNVALTEMRSVMCALLQHFDLKI ADQSFLDSWEDKIEETFTTKRGTLPVILSLRA >CYP5036A2 ug.170.56.1 MFFPLDSWVAVPVTSAVTYFIYKTLEPSSPVSVLLLLGIAPGALSWTLYGGSG SVIAHIFTVYVSYWALLVTYTIAYRLSPFHPLARYPGPLLCRISKAWLAYIVAT SGKMHLYIQNLHMRYGDVVRIGPNEVSVTHRDFIGVIGPKGLPKGPYYETRA HKAGTSLNAIRDQALHSVRRRPWARGMNSAAMKYYEELVEETVGDLVASL KRRTTKAIDFSEWMTYFGFDFMGHMAFTHDYGLLKNGYDKEGLTPLIEHAM QDVAWVSHVPWSIDYLRHIPGVYKHWLRIKALGVQTVRKRIALGSTRRDLFY HLMDEDNRERVKPDINVVAIDGQLAIIAGGDTTATALSHLFYCLLRHPQYLER LRKEIDDAVPIACGSFKLDFSKLPAMPFLNAFINETLRLYPPVLSGLQRRVEAG TGGRFIGPHFIPEQTQVSFSAYTIHRDPRYFSPLTDTFWPDRWLVQEQYVLPSG GVIPAAEVVTDRDVFFP FSQGPTVCAGKTLALTEIRSVACALLQTFDISNADEASFDAWEDNLEEMFTTK RSTLPVFLSLRT >CYP5036A3 ug.128.46.1 MSAIFAASSTTYIIFKRFEYSKPVPALVLLIGVPMTLASVLRDHFAGLATAMCA TAAVHWVLLTLFVAAYRISPFHPLARYPGPLPCKLSKCWMAYLAGSGGKTH VYIDRLHRLYGDVVRIGPNELSVRHKDACTTVLGAKGLPRGPYYDTREHDNG VSLDGIRDPALHAVRRRPWARAMNSASTQYFEELIQHTVSDLTGGLKERAGE SIDLTEWMSFFFSYDYNMLKEGRDTQGLRRMIDQSLVDLRWISHIPWSIPYLK MIPGASKNWDDMKAAGDKVARHRVSLGSSRPDLFHHLRDEEGHEAVRPQLE VVSVDGALAIIAGADTAATTLAHFWMFMLRHPACFERLRKEVDATFARDDG PDFVKQARMPYLNACLNETLRLFPPVLAGLQRRVGRGTGGRMIGTHFIPEDT QVSLVAYTVHRNPDCFSPFPDTFWPDRWLTQETYTLPTGEVIPSSDVLTRRDA FMAFSQGPMACAGKNVALAEMRAAVCAVVQRFDLVLAYERALDEWEEVLQ ECFVSKLGKLPVQVVPRN >CYP5036B1 genewise.22.99.1 MALLQLAQDVFRRSHPASACYALYHKYLNKPVHPLYHFCLLLVVPAALLAL LQAIHRISVAQECGYALFYWLVMASATVVYRISPFHPLANYPGPLAAKISKLY LAYLTAKGRAHEDVRALHSKYGDVVRIGPNELSFNRSDAIQTIYADKTMPKG PYYVARTNLAGVVQLDGVRDFKEHARRRRPWNKAMNSAAIKSYEPIVSSTA SQLLGQLSKRIHNDVNISDWMSFYGFDFMGRMVFGREWGMLEEGRDVNDY WHTMDKCLTIVSWSSQIPWSVPIIRLMKPPPEVLKMQKISDDSAMARLTSEGS GVKDLYYYLLNEDDSSKSELTRDECISEGVLAIVAGSDTAATALTHLCYYLLT HPDSLQRLRQEIEEAYPTLGSELDDLSRQAEMPYLNACINETLRLLPPVLTGLQ RSVTAGSGAIIAGYFVPEGVDVSVHHYSVHRNLQDFSPIPDTFWPDRWLEQD AYVLPDGDVIGKGEVRTNRGAFMPFSVGPQQCAGKNLAMVELRAVACGLFR RFDLSLSERMNICDYEKGLRDAYTTVRGPLYVKLKPRKE >CYP5036C1 ug.36.48.1 = genewise2nd.119.1.1|whiterot1 = scaf154 47% t0 5036B1, 45% to 5036A1, 46% to 5036A2, 45% to 5036A3 not in tree MVDRVDPRLILGSSSLVSVACYLIYKHSEPNNIPAHAALLLGVPALLVHQLGT HWSILQQGGAFVAYWALILAFTGLYRLSPIHPLARYPGPTLGKLSKIYLSYLSA RGDIYRVIKGWHDKYGDVVRIGPNELSFRHVDALQPIMGTKYTVKGPYYDTR TTPEQITQMDGIRDYSVHGQRRKPWLRAMSSAGLKGFEPIVKMKALELVEEL SKKVGEIIDMSEWMNLFGFDFMGHLAFGREFGLLKSGNDHDDMIRTVEDGV YGAGVISHIPWIAFLVHFPPAMKGLRAMQQMAATFARERTQKGSTTKDIYYF LTEDEGAAQSGATHDEVIADGMLALIAGSDTTSIALSHVCYFLLRHPACAARL RAEVDRAFPPGEDVLDFARHADMPYLNACINEALRLLPPGLGGLQRMVRRG TGGAMIGPHFVPEDTKLSVHLFSLMRDAREFAPLPDAFWPERWLAQDTYVLP TGDAVSKEHVTTNRAAFIPFSVGPQNCAGKALALVELRAVTCALVSKFELHK PKDYDLDQWEGDLLDLYISIRGKLPVILQARQGR >CYP5037A1 PC-ln-1(gx.1.22.1) PPGPPGLPFVGNAYQIPHDKQWLRFDEWIRRYGDLVHISVMGQPTVIIGSAQT ASELLDARGSIYSDRPQAVMAGELVGWDQGLGYAPGPHSPRFREFRRLFQQF MGPRAAQDSSMLAAQEKSATRLLSRLLSTPEEFITHVRQVTGALILYLTYGYE VDEDGFKDPLVNIAEEAMLGFARASDPGAYLVDTMPWLKYIPEWFPGASFK QDVKAMRQARERLYDVPYNFVQKAMAEGPVPRSFVSTYVEEKATPAFADEE LIKAAAASLYSGGADTTPSSLASFILAMTLHPDVQRRAQVELDSVIGESWQRL PTFADRPNLPYIDAIVLEVLRWHPAVPLGLAHRLSQDDVYRGYYFSQGTVFW ANIWTMLHDEIIFPDPSRFMPERYLDEHGRLKSMSRFEDPAVIGFGFGRRICPG MHFAHNSIFIAIARMLYVFNFTKAVDKNGNEITPEVEYSGFISHPSPFVCSISPR SIAAAELVVQ >CYP5037B2 ug.1.25.1 MIRPLTLLDIALATLAVVLLKTIIARSKQRARYPPGPKGLPVIGNVLQMPKDRE WLTFAQWGEQFGNIVYLSLLGQPMIILNSAKDAVALLDKRSSIYSDRPILYMG GELIGWKYILGLTPYGDRFREYRRLMAKFIGGKTQVERHFPVMEQEATSFLK RILRRPDDLGANIRTHAGAIILKLAYGYTIREDEDPFVTLADRAMAQFTEATTP GAFLVDVFPLLRHMPAWFPGASFKRTAQEWSDTLNSMADVPHAFVKEQMA KDTEVPSFTSELLRDEKLQEGQEFNIKWSAASLYAGGADTTVSSIHTFFLTML LFPHVQKRAQAEIDSVVGTDRLPTFEDRAKLPYVEGVLKEVLRWHPIGPLGA FLVLFFSLGLPHRLAQDDSYEGHLFPKGAIVIANIWCVSISTSWQCLYSPCCRK CLHDPDVYPNPSDFDPTRHLSENGRSPQPDPRDYCFGFGRRSYPGLHLADTSI WITCATVLAAFNIENVVENGRVIDIVPEYTSGTISHPKPFRCSIKPRSTRAEALIF SD >CYP5037B3 pc.1.248.1 MPSALSLLDFAFAALGLIIVKAFLSRTRRQGPYPPGPKGLTIVGNALEMPTSRE WLTFSEWGGRYGDIIYLSLLGQPMVILNSAKHAIALLDKRSNIYSDRPVLVMG GEMIGWKYTLALTPYGQRFREYRRFIAKLIGGPTQMQTHLPLEEHETRRFLKR LLNEPERVADHIRKTAGCIILKLSHGYDVREGHDPIVDLVDTATEQFSLATSPG AFLVDVFPLLRYVPAWVPGARFQKTAREWRKVLERMADEPHDFVKQRMAE NTNVPNYTSELLQNERLDGDKEFNIKWSAASLYSGGADTTVSAIYSFFLAMTL FPHVAKRAQMEVDAVVGSDRLPTCEDRPNLPYVEALVKEVFRWNPVAPLGL PHRLIEDDIYEGYFIPKGSFVIPNIWSYSTSHILHDPNHYPNPFEFDPTRFLSDEG RTPQPDPRDYCFGFGRRICPGLHLADVSVFLSCAMVLATFDISKAVENGKVIE PEVEYTSGTISHPKPFKCTIKPRSTKAEALILSADD >CYP5037B4 ug.1.26.1 MSTPLTYLVLLSAILTVVLIRTAIARRKRWARLPPGPKGLPIVGNVLQMPKSQ EWLTFSRWAEQYGDIVYLNILGQPLIILNSAEDAVALLDKGGSIYANRPILAM GGELVGWNRTLALTQYGERFREYRRLIARFIGGKAQMARHLPLVERETRRLL QRILNNPEDLAGNIRKTAGAIILTLSHGYRIREDDDPVVAHVGRALEQFTEAST PGAFLVDVFPILRHVPAWLPGASFKATAKRWGETLEQMADVPHNYVKEQMA SNKDIPNFTSELLRDEKLGDIDSKEFNIKWAAASMYSGLGPQTVSSIHSFVLA MVLHPHVQRRAQAEIDAIIGPERLPTFEDRAALPYVEALFKEVLRWNPVGPLG LPHRLSQDDVYKGYLLPKGSIIIANIWSFLRDHNLYPNPSDFDPTRHLPKNTEA ASQPDPRNYCFGFGPDVLAGQHLADASVWLACATMLATFDIENLVASDGTVI GVEPEYTSGTVSHPKPFKCSIKPRSALANALINAGMPE >CYP5037C1 pc.2.111.1 (genewise2nd.2.36.1) MTSTNILLISHGHRVKDSNDRFLKLSDAVTDEFSEAVAPGAFLVDQFPLLRHL PSWVPGTAWRKTAEKYRRHVADAVAEPFAFVKQQMAAGTAIPSFVSRNLDD GAAPSPDHEHTVKYAAMALGCADAADAAWQTASALTSLFLAMTLYPEVQR RAQAELDGVVGTDRLPTFEDRDRLPYITAICAEVLRWMPVGPLGLPHRLTED DVYEGYALPKGTIFFVNNWKLLHDPDTYRDPMAFMPERFLGAAPELDPSKIA FGYGRRICPGILVAEATIFITVAATLAAFSIRPAQNGGAPSLPPVRQTSGIISHPA PFQCDVVPRSKKAEALVVAAVENR >CYP5037-un1 pseudogene genewise.25.75.1 scaffold_11 (1449124 bp) : 785586:787348 (1763 bp) (-) strand 32% TO 5037B3, C-term half = 40% to 5037B2 gray X = frameshift MLLRIWSRSQSFHPNVSALLFVTTRACHLMSC RLPLPPGPRARWYGTIGMPTKSQWLNCHGRKCTVRYGSHYDAPGLLMYFHIVENPMVV LDTAGTVNDLFEKRGTSYSSRPVTTMVNELYAYAQYSLGGYSLAH (?) WRKHRHLFHQHFNTSAMHVSRPVVLREAHTFLHNVSRTPVDDWSIVFGGH (2) ADAIVTMLSYGHQIAPEGEMYVDTRTRLSQ (0) AYLLLLLAKTRTSL IYDVVKHVPAWFPGAAFKKQALQWREANRMMLNVPLEKVQ 67 amino acid deletion here QIDRIFASDKLPTF ADREDLX YVDCIVWEYLRWNP (1) VTPLGLPCQVTEDDTYCGX YIPKGATIVSNTWY (0) MYPYLLLFDPDRFADASRNASLGIHELPNAAFER (2) MCPGRVLAFETI WITIATTLAGFHLSEPRDEHNEVIQLDTPNTPKLLS (2) HPKPHQCAVSPRSERALFLVVESLDG* >CYP5136A1 PFF_311a MSSLLVLVAISLALSQLIRFYRWLFHHSISYLRGPVADSFILGNVREFTYQESV GDLDFRYMNEYGTAWRMKSILGSDVLMICDPKALQHVLHKSGYHYPKNTEA RIGSFNVTGRSILWAPNGDIHSRHRKIMNPAFTAQQLRSFLPLFRRGSNKMCQ LWKDEVLAQAPTGMTIAVNQWLARTTLDVIGEAAFDFSFGALDDADNEVSK AYHNMLFADSLLYPSAWSTIFRGLWRFIPDQLLSYVRYLPTREYTRFRYTLNII NKVSKSLIDQKSEDLLSGDKSSKDVMSVLVRANSSENPRSQLSEEEMVSQMA TLTLAGHETTANTITWLLYELAKHPEYQQKMREEIAVKRAEINARGDADFTM DDLESMQYLHAALKETLRYHPIVYHLAREASKDDVIPLAYPVTTIKGETVSEI PIAAGQIIMPNIAAYNRLPQVWGDDAHEWNPLRFIDDSPEVQVRLGMFGNLM SFFAGVRGCIGWRFSLIEMQAIVADLVENFQFSIPPEKPEIIRVPAGIMGPMVK GKMHEGLQMPLHVTPL >CYP5136A2 pc.142.11.1 MAAAALLIICWLVVNLRRLLTHNSIRHLRGPPSASTLFGNVTDTLYQASVGDV EFRWLKEYGGAWRLRGLLGANILALADPKALQHVLQKSGYNYPKTRQLSVT LFNLTGRSILWAPTGEIHARHRKVMNPAFSVPQLRSFIPLFRQSAKKLTQIWKDQV NAGHPDGVTLPVDRWLARATLDIIGEAAFDFDFGALDNTENEVSKAYHRMF ADSQLYPSVWNLLFQATWSLLPEPLLYYIRYLPTREYKTYRSTLSVMDKIAAQ LIEERTREFGAGDPDKSRKDVMSVLVRANMSENPSTRLSDEEMRSQMFAMTL AGHE TTANTVTWMLWELAKHPDIQEQLRQEIAEKRMEVTANGSYEFALDDLESMP LLQAVIKETLRYHPISSFLWRVAAKDDVIPLEKPIVTTTGETITEIPVAAGQVIM PSLCSYNRLAHVWGEDAHDWNPMRFLQGDTEKQTKVGMLSNLITFSAGVRS CIGWRFSVLEMQAIVVELVENFRFSLPDNKPEIIRAPTMTMGPMVKGKLHEGF QMPLRVVPV >CYP5136A3 pc.16.161.1 MAVIDYTLHASSPLVLLACTVCVAVLAFRWYSSSTHGSIAHIRGPPVKNPILG NIRDFSYQENVGDLDFAYMKEYGTAWRLKSSLGKSVLMVADPKALQHIFHK SGYLYPKTTPSTVRSFLVTGKSILWAPDGNTHSRHRKIMNPAFSAPQLRSFLTL FRKSSSKLCQLWRDEISPEGSTVLVNKWLARTTLDVIGEAAFDFDFGAMQDN QNELSVAYDNMFTDATLHTSPWNAIFEALWDYIPDGILKQVQHIPTREYARF KQTLGVFAKYSKRLIAQKSADLVSDTHSKDVMSVLVRANAAEDAGRKLNDE EMVSQMSALTLAGHETTANTISWLLYELAKHPDFQEKMHAEIVAKRAEIVAR GDEDFTMEDLESLEYLQAAIKETLRYHPIAFHLNRMASQDDVLPLAYPVMTT AGEKVTEIPVRKGQAIMPNLAAYNRIPEIWGADAHEWNPMRYIENRTDAQVR VGMYANLMTFSAGVRGCIGWRFSLIEMQAIISDLVENFRFGLPKDRPEVLRVP AAVMAPMIKGRMEEGAKLPLHVTVY >CYP5136A4 pc.16.153.1 SGACVLCLAWLAYRWYRWTTRLNISYIRGPPVKSWILGGNVRDFAFQENVG DLDFKYVQEYGLVWRMQQPLGAQVLMVADPKGDIHARHRKAMNPAFNNA QLRSYYPCFRRTSSKVCQLWKDQILSQGPNGATIRVDRWMARAALDIIGEAA FDFDFGALDDSANELSAAYHNMLSADSTLRPSAAQAVFQGLWTHAPLRVLE RVRHLPLRDIARFQHAMRVFNTYAARLMARGAAGAAHGRDVMSVLGTAHA NASADPRTRLSAEEVRAQMCALTFAGHETTANTTTWLLWELARHPPAHQDS VRADTVRRRAHVAARGDADFGVEDLDALPCLEAAIRETLRCHCIVFHLNRVA SQDDVIPLSRPLTTATGKTVTEIPVAAGQVVMPNIAVYNRTRTKWDPTRFLDG RVDNPEVRLGVYGNLRTFAGGVRGCIGRHRMIEMQAIVADLIGHFRFSIPDDK PEIVRAPSMLMAPMIKGKEHEGSQMPLHV >CYP5136A5 pc.14.209.1 MFHGYLSATFQAQRRPGNIKDFTYQQNVGDLDFQWVKQFGRVWRMQSPFG TDILALADPKAMQHCFHKADDQYNKRVESTVGSRMMMGKGLVWASGTTHE RQRKIMSPAFTTAQIRSFLPFFRAGAAKKWRDELFNHSTDGAAVPVNKWFSR ATLDILGETAFDFNFGAVDDKDNEVTLAFHTMLFANSCLRPPKWDLLFKRIW YFLPNPLLELVQYVPTKEQNRFRRCRLVVEKVSQQLIQEKREALLAEAKSSRD IFSVLVRANVSENPNSRLSDEELIAQMGTLVLAGHVTTATTLSWMLYELARR QDYQDKMREEIVAARARLQERGQQDFSMEDLENMHYVSSCLKETLRFHPPV YHLFRQANTDDVIPLEQPVRTTSGKYVTEIPVAAGQQVLFSVCAYQRLPEVW GEDAGIWNPMRFIDGNVDKQSKLGLYSNLMTFSAGSRGCLGWRFTIVETLAII VELLEHFKFEPTEDTAKVIRVPTGIMSAFTAGKEREGPQMLLKVVPIL >CYP5137A1 pc.5.122.1 (SEQ ON OPP STRAND FROM THIS MODEL #) MNNLTAALILVALALWFACRRFTRTTLRDIPGPKPVSFWLGNLEQYFLGQAG EGDFHLQERYGRIARLHGSIGGEYLWISDPNALRYIFQTSGYRYAKQPERRAL SRLHSGHGLVWADGEVHKRQRKVMLPAFGAPESKALLPHFARAAEAVSVK WKDILTTAPSLSKELNVSTWLSRATMDAIGEAAFDYHFGALENTDTDIVRAY NNLMPIVFGAPTADAIFKRDALRIFRSSRIVEWIYDRQRNPAVEKARECEELTL KIARELVENKAEALEQGKGSKDIFSLLVKANMTEDAKSRLSEEEMYAEMRTI LFAGHETTSTTISWVLLELARHLPVQERLREEILAHKRGGELSATDLDGMPFL QAVVREALRLHPVLNQTFRQAEQNDVLPLAHPLTDRTGTVLTALPISKGTRVI LSIAAYNRDTELWGSDAHAFDPDRWLDGRVKKVQTLGMYGNLLTFAAGVR GCIGWRFAVYEIQTFLVELLANFEFRPTEDLKRLRREPCGVMVPTLEGDRGTV QLPLRVSLLDHKI >CYP5137A2 PFF_88 NOT IN TREE MHDIFPLAVLLGALLWIVRRILSRSSIRDICGPEPESFWLGNLKQFFMRQAGEG DFELQERYGRIARLHGSIGGEYLWVADPKALQHIYQASGYNYAKQPERRALS RLHSGHGLVWAEGEVHRRQRKIMLPAFGAPESKALLPHFIHIAESLSMRWKDI LLASRDFAKELDVTEWLSRATMDAIGEAAFDCQFGALDNGGSEVLRAYNDL LPMVLGVPTTDGIWKRDAMRIFNSSAIVEWIHDRQTNDVLQRARECEQMVM KVAKELVSSKAEALVQGKGSRAYFSLLVKANAAEDAASRLSDEEMYAEMRT SSLAGHETTAMALSWALLELAQHPEVQSRLREEVRGCKRGEELSAAVLDSMP YLQAVLREVLRVHPPAIHNFRQAVRDDVLPLAHPITTKSGSVLTELPIQKGTR LILSIAAYNRDPDLWGSDPHMFDPDRWLDGRVKKGQVVGMYGNLLSFSAGV RGCIGWRFAIYEMQAFLVELVSNFEFGPTEDLKRLRREPCGVVAPMLEGEQG VQLPLRVSLANYDV >CYP5033A1 Ustilago maydis 36% to CYP5034A1 GenEMBL XM_399595.1 37% to white rot Scaffold_7 C-term MAISTSSRLVIHQDVLSWLQHRPFASAFTLLVVYITYKLAIKPILFPSPYRHLPRPERASYILGQRI VEANGLTYIDASTNQRVKVSGPGEVCKHYARTLDTSVFVFPEPFGGETLFISDPFALNAILADVDKF QSDLLRTTIIEFIVGKGIVARFGDAHRKQRKLMAPAFTPAHIKGLTPIFAKYAQLMCHKIALADDES VDFAEYLDCTMLDIIGEAGFGYRCSALERGRGGSELSSAFNSVNQAAIDFGPARAIHLGLSAMLYPR ASIWPLSEANRRIAKVNRVMDRITMQIVREAKSRVEKEGEDLGDKKDLLSLLIKSNLDARIGERMTD KEISGQIQTFMFAGYETSSVTTSWTLYFLARHPEVQNKLRNILTATLSERKGIPLEELDVSTLEYDD VWCQDLEYFDWILAETLRLCPPLSGNDRQAMQDSVLPLMTPVKMTNGENVSQLMVKKGSRLTIGIKT VNCDRKLFGDDADEFRPERFAELPQRHAEAKLPPYATYSFFGGPKSCIGSKFALTEMKVIIIAVLSR FQLSPEP GVTIKQHQALIVRPRVETSTGGPAAGMPLRIKRLPHQVSV >CYP5138A1 pc.65.27.1 MPLLSAVPAAALPLLGAALYVLWTFLALLVRQARSPLRHLRGPPSPSFLVGNL REMHDQENTALFARWEHRYGSTFVYHGFLGGARLLTTDPVAVAHILAHGYD FPKPEFIRDALASMAAGHEGLLVVEGEDHRRQVRASPAFATPHIKSLSPIIWSK ATQLRDVWIDLASSPSLTPAATPPGTKVDVLAWLARATLDVIGEAGFGYAFN SVRAAACPGDAAEDELARAFAVIFSTARKFRLITVLQVWFPFLRRFVSIPPRCF LALPLKSSLSSDPSQQLSTNAMLCQIATFLAAGHETSASALSWALYALARAPA CQHTLRRELRALTLPADPSAADLQAVLALPYLDAVVRETLRVHAPVTSTMRV AAHDAAVPVGTPFRDAHGAQHAAIRLRAGDIVTLPLQAMNKWGADAACFRP ERWLAHGDAPREPRGLWGGVMTFGTGVVANGNRSCIGYRFAVNDVVTRPC VKSEPHLGNQMPLRLRRVAVEETVGDSSGDGAPRTVS >CYP5139A1 gx.38.22.1 MGYPLAVYAVGALVALIVYSVGPTVWHVLTSPLRHLPGPPNDSLLWGNMAA IQNEEISVPQARWVKQYGHTISYRGVFGMWRLWTVDTRALNHILTHHLIYQR PLPSRYQLSRLVGPGVLVTEEERHKHQRRVMNPAFGPAQVRELTEIFTEKANE MRDVWYNEITKAGGASAQVDALSWLSRATLDIIGRAGFGYDFEALTGASNEL NQAFSTLFARPIARHRFFARIGMQLIEQRKAAILAEKGKDVERKDLTGRDLLT LLIRANMATDIPEDQRLSDEEVLAQVPTFIVAGHETTSTATTWALFSLAQMPEI QRKLRNEMLTIDTDTPSMDQLNSLPYLDAVIRETLRFHSPVPVTTREAMADD VIPLGTPTVDRYGRTIDHINIKKGDLVFVPILAINRSKEIWGEDVDDRPERFEN VPEAASTVPGVWGNVLSFLGGPRACIGYRFSLVDIVTRPVMTGPDGKTRGAL PLIIRPYRP >CYP5140A1 pc.96.21.1 MNASSIDFFPRNLATSPVFSAKPFLLALSLISTYLVSVAFYRLFFSPLASIPGPW YAAVSDLWITTHVLRMQQCRVVQDLFDTYGPIVRIGPNKVAFCDAGTMRSV YCVHKFDKSAYYKSLLTNNNDHAMTTLPHAEHAIRKKTYAPHYTPANLALF QPELNDLALKLTDILSIRSSSVDVLDLFRHLMVDVIACTVFGSRSGSLDNWNK GVRDPLSIAVYDFPKRGIMVRLCLSSPVTASDNHIRSGAQCLLGPGSFIAGVDT SSTSLSYMFWELSRRRDVMQRLQAEIDEIMPDPRVIPDATVLNRSEYLNAFVK ECEYACHPCPAEILRDPIALHFDMMGYALPPGTIVATQAWSMHRDEDVFPSA ETFLPERWLVDPHADREVEEERLARMHLHLVPFGVGTRQCGGQNLAHLMIRI VVAVVVRNCEVRADVRETNERSMSMRDAFVSPLLWLLLGSERS >CYP5141A1 pc.181.9.1 MISDTFALAISSGLSLFLCLKAFIDYRAGLRSINHSYLPGFRALISSFGILGLFFK EPKRGLWGGRRRFWLRKHLDFEEAGVDIISHIAFLPSVSTYLLLADAAAIKEV TGHRARFPKPTYKTLRIFGGNVLASEGEEWKRHRKVVGPAFSEHNNRLVWN ETVKIVNDLFANVWGSQSEVYVDNVVQSVTLPMALYVISIAGFGKRALWQA DGNLPPGHKLSFQDALHILGTDLWIKAATPTLLMNWAPTTRIANVKLAFDEV KQYMLELIQERRNSEKRDERYDLFSSLLDANDLNEDGNGNVTLTNDELLGNI FIFMLAGHETTAHTLAFTFGLLALHPDYQETVYQQIKSIVPDNRPPMYEEMNS LTECMAYETLRLFPPTATIPKIAAEDTYLVTIDRAGNRVVVPVPCGTALHLNVI ALHHNPRYWDNPSAFKPERFRGDWPRDAFIPFSTGSRSCIGRRFFETESIAILT MILSRYKIELRNDPRFADETYEERWQRVLRVKDGLTPA >CYP5141A2 gx.37.18.1 MFSNTFALAITSGLLLSCLKAYMDYRAALRSINYHPGSCALIPSFGMLGLLFK EPRRGLWGGWRRFWRRKYLDFQEAGVDIISHIAFVPSVTTYLVLADAAAIKE VTGHRARFPKPSYEFFRIFGGNIIASEGDEWKRHRKIAAPAFSEHNNRLVWNE TVKIVCGFFENVWGSQAEVYVDDVVQSLTLPMALHVISIAGFGKQTVWRAD GTLPPKHKLSFQDALHVVSTDLWIKFVMPTMLLDLAPTKRIAKVKLAFEEVE QYMLELIQERRDAEKRDERHDLFSNLLDANDSDENGDGSVKLTDEELLGNIFI FMLAGHETTAHTLAFTFGLLALHSDYQEKVHQQIKSIMPDNRLPTYEEMHLF TECTAVFYETLRLFPPVTTIPKISAEDTSLVTTDRAGNRVVVPVPCGTSLHLSV VALHYNPRYWDDPYAFKPERFHGDWPREAFIPFSAGARSCLGRRFFETEGIAI LTMILSRYKIELKDDPRYAHETYEERWQRVLDVKDGLTTT >CYP5141A3 PFF_77b MNSVLVILLSTILLLCLKTYVDLRTALRAVNYHPGFKSFISCFGVFGFAFKEPR RGLIGGSLRFWHRKHLDFDEAGVDVIHHVSFFPRVSTCLILADPAVIKEVTSH RALFPKPLYHELRLWGGNIIASEGDEWKRHRKVGAPAFSEPNNRLVWNETVK IMVDLFDNVWGSQDTIIVDHVVDAFTLPVALFVISVAGFGKNASWQSDLLPPS GHKLSFKDAIHVVSVDMFIQVVTPTFLWKLAPTKRIADVKLGFEELEKYMLE MVEERRNAPKKEERYDLFSSLLDASDSDADGGARLTDRELLGNIFIFLLAGHE TTAHSLAFTFGLLAMHQDYQEKLYQHVKSVIPDGRLPTYEEMNKLTECMAV FYETLRLFPPVVGVPKVVAENTTLVATDFTGKRRAIPVAAGSDIHISILALHYN PRCWDEPHAFKPERFHGNWPRDAFLPFMAGPRACLGRRFFETEGIAILTMLVS RYKIELKDEPAFAHETYEERWDRLFTVKQGITLA >CYP5141A4 pc.181.12.1 MGTLAWVVLSFCLFYCVQKYLEFRAVVRSIHDHPGFRTLLPPYGIFGFLFKRPI PGITRGGMSQWRGKYRDFEAFGMDIISATSVIPTARNAFLVADPAAIKEITSSR TRFPKPVAQYRVLTFFGANIVTAEGDEWKRFRKITAPAFSERNNRLVWDETV KIMLDLFENEWAGKDTVVVDHAVEVTLPWIALFVIGVAGFGRKMTWQEDSK LPPGHQLSFKEALHYVSTAVFVKLATPAWLLTWAPTERMRRTNLAFKELEQY MLEMIQTRRNSEKKEERYDLFSNLLDASEDGSDGHARLADEELLGNIFIFLLA GHETTAHTLAFTFGLLALYPEQQDKLYKHIKHVIPDGRIPAYEEMNLLHESIA VFYETLRLFPPVTGIPKVAAEDTTLVTTDHSGNKVVVPVTKGTGISLHVPGLH YNPRYWDDPYEFKPERFHGDWPRDAFLPFSSGARSCLGRRFFETEGIAILTML VSRYKIEVKEEPEFAGETFEQRKERILAARGGLTLTYVCSPPHLRNLLNLPLR >CYP5141B1 pc.37.84.1 MQQHLLFAAGLICLFLVKRCIEYRRAIRAIHNYPGVRAVLSNSSGLGYLCKRSI PGLAVGGARLWVKRYSDFCRYGADIVSCVAVLPRTEILLFVADPAAIKEISSD KTRFSKPTELYELVNIFGRNIVTTEGDEWKRHRKIVAPAFSERNCELVWEETL HVMIGLFNDVWGSDSIITLDNAFDITMPISLFVVAASAFGRRIPWTEGGLAPPG HHMSFKEALHIVSTGTVIKAVLPKWLLNLGPSQYIREVRDAFREMEAYMREM VTENMLDDTKSRRDLFSSLVHAGQDSPGQEALLTDAELLGNVFMFLLAGHET AASTLCFALGLLALHKDEQDKLYDHIRFTLGEKDVPAYSDLTSLSYCSAVLYE TLRLFPPVIGIPKKATEDTVLSTVDRDGNHIAVPVPVGSSVAIHVPGVHYNPRY WKDPAAFRPSRFFGNWPRDAFLPFGAGSRACIGRRFFETEAITALTMLVVRYE ISVTDEPQFRDETAEQRRERVLSATQELTLT >CYP5141C1 pc.81.19.1 MASRLLVLLAALVLFALRAFARFRRAVHAVSYVSSALRSVNHPGYRTLLNTL GPIENFFPRIPGVAPGAFHMWKRKHRDFEEHGWDVITY VAAFIGSSTNFYVADADVIK () EITTHRSRFPKPIEQYKVLTFFGGNIVASEGEHWKRYRKIAAPAFSE () RNNKLVWDETRLIMQDLFTNVWGERAEIYVDHAVDITLP () IALFVIGVAGFGRRIPWQDEDVVPAGHTMTFK () TALHTVSENVFTRLLIPDWLLRAAPTARLARIRDAFAELEQYMREMIRARRER PAREERHDLFSSLLDASKDADVRLQDSELIGNMFIFMLAGHETTAHTLCYML AMLAMHPEVQDKMYESIRGVTQNGRLPEYEDMRSLSYCEAVLYETLRMFPP VNSIPKSVAEDTAITITNADGERTTVPMPKGSSISIHTPGLHYNPRYWPDPHTF RPERFLAADWPRDAFLPFSAGPRACLGRRRFSETESVAAAAMLVLRYRIAVA DEPRFAGEGARARFERVTASRPGVTMTCVFCAPLWVVVGTDELCCRPTRVPL VFRRR >CYP5141C2P pseudogene FRAGMENT pc.167.26.1 91% TO 5141C1 genewise.20.89.1 [whiterot1:25284] VAAFIGSNTNFYVADADIIK (0) EITTHRSRLPKPIEQYKVLTFFGGNIVASVGEHWKRYRKIAAPTFSE (0) RNNKLVWDETRLTMQDLFTNVWGRHAKIHVDHAVDITLP (0) VAGFGRRIPWQDEDVVPAGHTMTFK () TALHTVSENVFTRLLILD >CYP5141C3P pseudogene FRAGMENT pc.67.69.1 84% TO 5141C1 VASFIGSNANFYVADADVIM (0) GNTTHGSRFPKSIEQYKILTFFGGNIVASEGEHWKRSRKIAAPAFSE (0) RNNKLVWDETRLILQDLFTHVWGKRAEIHVDYAVDITLL (0) IALFVVGVTCFGRRIPWQNEDVVPAGHTMT*LK (0) TVLHTVSENVFMRLLIPD >CYP5141D1 pc.81.21.1 MLLILWALLAAVVYHAAARLVRLRRLLVKIRFHPGQRAATSIYGAATFLFPW RIPNLTPGANLLFDEKHALLARHGLDVVTSVSTHPMRAVFVVADPAVLRDM AAARSRYPKPVELYGSLSLYGPNIVASENDAWKRYRRICSPSFSERNNKLVW EETVRVVTELFDTWEGRQEIDMEDALTMTLSITLFVISSAGFGKPITWKGGDE RPEGYAMSFKDVIYHMSTGVFIKIATPQWLLNLGLTEKMRNTNVAFKELGM YMSDMIRERRESQQREDRGDLFNGLLDAGEEDEKLKLTDEELMGNIFIFLIAG HETTGHTLCYALALLALYPDEQEKLYQHIRTLCPAGELPVYDDLRNYTYALA VLYETLRMFPSVVGIPKVASEDTCVQTMNDAGQLVEVFIPEGSDIVFDTPGLH YNPKYWTDPYTFSPSRFMAPDWPRDAFLPFSGGPRACLGRRFARFAEIESIAV LVLFVSQYTIHLKEDPKYAGETEQQRRERVLKSVPGLTLT >CYP5142A1 ug.79.41.1 MDDVNLFIRARTLLDSVLVLILTSIGYAVANAVYNVYFHPLSKFPGPRMAAAS RWWKTYVEVYRDESIVDRLFHLHEKYGNVVRIAPDELHFSDPAVYNAIYSPK SRWNKDPLMYAPFGFGSHRSMFSTVEYQPAKKRRDLAAPHFSRKSVLNLQG VIQAGVSNLCDAMAQRAAEGKPTDIYSAFRCLNFDNVTSYCFGWSLHMVRS PDFSAEPVQNMQDMHSSYQVWKHFLWLRTPMRLLLSVLGKRPMPYFRVIME QVDGYLERPEELDNAPHSLIFHSLMDPAQSTKLDKQSVVEEANLLIIAGTDTIS NASALGTLFLLSDGGYMRDKLQAELKAIWPRLDDKPSLEVLESSAPYLKAAC KESLRLSHGVMSPLLRVVPSQGATLGGHFVSGGTKVGICNAFVHLNPALFPDP HVFRPERWPEPGAESLDTWLVAFSKGPRSCIGINLGWCELYMNLANLFRRFD LKLDHRVQVLFLGGS >CYP5142A2 pc.79.57.1 MNVWARVSEGWTALEVILVALLTSIGYVVTTALYNIYFHPLSKFPGPKLAASS WLWKAYVEVIKGESILDRLSKLHEEYGPVVRIAPDELHFNDPAVYNEIYTARS RWNKDDVMYAPFGKDTSIFTTREFREAKKRRDLSAPHFSRKTVLSLQGLIQE GIDEFCEVITKRDADSKTTDIFRAFRCLDFDNVSSFCFGWSEHAIQAPDFNSAA VEELQHSNKDFQFWKHFLRLPLPVLLLASRIKNQIATYIDKPEELDKTPHPTVF HVLMDSSHGTRLSATAMAEEASLFLIAGTDTTSNASALGTIFALSDNGYMRN KLKEELKSVWPRLEDKPSLEVLESLPYLKAVCKESLRLSHGAMSPLMRVVPQ QGAVLGGHFVPGGTKVGMAHTFVHFNPTLFPEPHTFRPERWLEPGAEALDT WNVAFSKGPRSCLGIKSLAWCELYMNIAHIFRRPVRPYDLWHRDCFLPYLDG VDLLVYATPSTD >CYP5142A3 pc.24.27.1 MDVVRQALEGRTTKEYAGLALLAFAAYVVANIIYNLYFHPLAKFPGPRAAA ASRWWKAYVEVYKGESIVDRLFELHAEYGDVVRITPDELHFSDPKVYNEIYN TRSRWDKDGEMYAPFGGNSTMFTALRYHDAKKRRDLTASLFSRKSVLSLQGSIQEGL DELCDIISARSAAGKTTDLFRAFRCLNLDNVTSFCFGWSLHTVRAPDFRAPPL EEVQNSHGGYQFWKHLMLFRAVLLPKLKEQVDALVARPDELEAAPHPIIFHS LIDPAHGAKLSAQELMEEANMFIVAGIDTTSNATGAGVIGVLSNPSTYDKLKT ELRTAWPRLDEKPTVEVFESLPYLKAVCKEALRLSHGITSPMLRIVPPQGATL AERFVPGGTQVGVSHLFVHLNPTLFPDPHAFRPERWLEPGAESLDTWLVAFS KGPRSCLGINLGWCELYLNIANLFRRFDLKLEGRAKAFLDGPRARASGDWKD CFLPCFEGPDMLIHTTPVAD >CYP5142B1 ug.20.42.1 MLNLSLDSSSVLSLVWTASPWLLLSWILYTVLMAVYNLHFHPLAKFPGPKMA AASEWWLAYVEVIKQESLSKKLWELHEQYGANATQLHFSKPAAYNEIYNVK NRWDRDMKLYHIFADEVSTLTIPDYARAKKRRDLTTFLFLARILLRQLDTVCE NIDKHIKEGKPVSIFKAFRCAAADVICTMCFARSMNATSEPGFNAQVVTAIHA AFPVIMVFKHFPLLQTLSRMVPPLLLSSLRPELNGLMKMRKMLTDQVKEVKA HPEILKESQQVTIYHELLKDPKNIPSDTSLRDEAVLYVTAGMDTSSDTLTLATI NVLSRPDVHARLMHELVEAWPHLEDAPPRYEQLEKLPYLTAVLKESLRLSHG VVQPMTRVVPREGAYISGHFIPGGSIVGMSSIFVHWNEEIFADARAFKPERWL DPEADLDPWLVAFSKGPRSCLGVNLGWCELYMSIAAIFRRYELKLNGIG >CYP5142C1 pc.20.56.1 MLRLLVDNGLVSALARYGPAMLISIIVWTLGRVVYNLYFHPLAKYPGPRMAA ATEWWQAWLEIFKAESLSLTLLELHAKHGGDIVRIGPNELHFSRPSAYHEIYT SKNKWAKNPAFYRYIVSPTESTFSTCEYDKAKKRRDITLPIFSRKSILGMQHLV QECIDSMCENIDKHISEKKSVNILRAFRCCALDAVTSLCFARNTRATSEPEFRA PIEVAMDFSLPLTPVLKHFPMVQVVMSWLPPDVLLWADARLGGFVQLRKML DAQVEEILRDPDVLASAEHPTIYHAFLAHAPTPSVAELRDEALVYVHAGTDTS SDALAVGTLNVLGRPAVLARLRAELDTVWPRLDERPRYEALEALPYLTAVV KESLRCSHGVVHPMTRIVPRGGARISGAHIPAGTIVAESNIFVHWNADVFPEP HEFRPERWLEGKTPSGESLDNWLVPFSKGPRSCIGINLGYCEIYMTFANLFRR YDLSLDGVKPSDWKWRDCYLPHYLGPEMKVVATPRLS >CYP5142D1 ug.43.40.1 MAGQLMAHSLDVLSTMFTLLSLYAITRCIYNLYLHPLSRFPGPKLAAATTWW RAIGEVFMWENLTDKLVELHNTYGPCEIVRIGPNELHFSRPSVYHEIHNPRNK WNKDPAVYNVFADTESTVSICNYEAAKRRREMTLPLFSRRSIVDAHDLIRSCL DKMCTNIDSIASSGEPVHFFRAFRCFALDAISLMCFGVSPEASLAPHFRSTLDG AMHVALHDALLVKQFPLLKYLMAYSPQWLVTYTRPALRSYFEMRRVRLSLP TLRQLVYLRSTIAAERSSAGSQEGTPLSEPVLRDEAFVFVNAGADTVSNAITV GVLNVVDNRDVYTKLKHELRCAWPNLKVSPRWEELERLPYLRAVVKESLR MVIGVVHPMTRIVPPQGAVLCDMFIPGGTSVGISHYFLHHNEDVFPQPRTFKP ERWLARESDKEHMLVSFSRGPHSCLGVNMAYCELYLAFAYFFRRYDVELNG VRYVHVSSRETELTN >CYP5142E1 pc.167.13.1 MDSWLSWHGVAAAVVAAALLLVVYRVYFHPLAKFPGPKLAAATHWYSAY YEVWRDGALVEHLQELHKQYGPVVRITPDEACISYTDIYVRGTRFTKDPGFY GFMHGDRSSFWMLDPQKSKARRDVLLPLFSRRAVLSLEDVVQKKVRALVTA VLTQGADDTSVNMHRAYRSATLDTILAYAFAQERGMLDVPGFAHPLVREFE RAFPLALILKHLPWLHRVSTAVRAVKYMLVRTDPDDIVRDTAAQIDGLLADP DRLAELPHETVFHRFLAPHAKGAGGEPPSRRDIVDEAINIFAAGSDSTGHTCA MGTAFVLAYPEVHKRLVRELEEAWPDRDAEIRLAQLEKLPYLTAVIKESLRM SHGVVMPLPRVVRPNEAIIDGISVPAGAVVGMGATFMHYNPEVFPQPYTFDP DRWLQPDVSRLEQHLVPFSKGLRSCIGLTLAWCEMYLVFGYIFRLLDMQLDN MTLEDIKVKYHFTPTVREKDMLRCMVRARES >CYP5143A1 gx.20.61.1 IILIILATISYRLSPLHPLARYPGPILDKSTSLRLAYLAFIGQRAQYVTELHERYG KIVRIGPNKLSINSLDVVHPIYGSSQAYDKSESYRPGLSAEGSIFFARKKELMR DGDPEGVVESGHKAILMFETYCDSFGEVPALFDILSVLPTGEAYQLVEKRAAT HLKDRIKVHPHDGWDMCSFFLAQREGHNYPPMNEVDLNANTVVAFEAGGD TTAGFIIITMFHLLRYRQAYDKLKEELDGAFPTGSVSVEEYSHLAELPYLGAVI NEGLRLGAAFPSFPRVVPKGGAMLAGEFIPEGTMVGVPIYTQHYSPDNFWPEP REFRPERWFEDGLGPGTITRQAAFMPFQFGPFGCPGKALGLRLMSVVISNLLV LSYDLSFPPDFDPEAFLNGWINTRTNIFRIPLRVEAKRRPW >CYP5143A2 PFF_33b(pc.20.120.1) MFNSFGPAHVLLPPLALSVIIAVAAYRLSPLHPLAHFPGSWVDKVTSLRVAYF ALTGHRAEHVTSLHDKYGVVVRIGPNRVSINSSDVIYPIYASPQAFDRAASYR PGLIHDGSLLFSRKRHWDGALKDRVDQLIDCIARRQDLRGVVDLGEFMRNGD PHNICRSAKDSIVLFEVYSSSLGEIPALFDIASVLPVTAEYRKVERHFQKHINER MQIKSHSDWDFCSFFMAQREDAQYPPLSKPDLNADAMVAFEAGGDTLAGFL SIIIFYILKHQPVYQKLRAELEQAFPLGEIAQDQYASLTEIPYLVAVINEGLRLG ANFAGFQRVVPQGGAVLAGQFIPAGTVVGVPAHLQHIHPDNFWPTPLEFRPE RWFKDGLGPGTITRQSAFMAFQFGPFGCVGKTFAYRQLNVVLSRLLLAYDLT FALDFDSKAFVEGWLNIRTTIFNYPLKVQASRRQW >CYP5144A1 ug.24.32.1 revised 8/15/2007 at I-helix micro exon, also removed one small intron after AGVYL and added a micro exon VDVIPA MELPAHTKYLLACLAFAIFVLLHSKRRRPRYPPGQRGLPLVGNLWDIPTEYA WVKYREIGAQLGSDIIHFEVLGSHYVVLNSDKAVKEVLEKRSHNSSDRPQTV MLQELTGWHRNWALLEYGDYWKDLRRIFSQYFRPSAVPQYHSKQTKAVRRF LNLLLNSPDDFTKHIRYLAASAILDVVYGFDVRPGDPRIELVERGVHTLTDISA GVYL (1) VDVIPA (1) LKYIPAWFPGASFKRKAAGWKVLVDAVYEVPYSQYKDAMREGTAKPC 2461961 FAGTLLSEANPDGDLDETFRCLTGTAYV (1) GGADT (0) 2462165 VSSTLLTFMLAVTMFPETQDPAHEELDRVLGRKRLPDIRDRDDLPYITAM 2462314 LHEVLRWHPVAPLTLPHRLTADDEYEGYHIPAGAVVFGNAWAILHDPATYGD PDVYAPARYLTADGRALRADVPYPLEGFGFGRRVCPGRPFAHDILWLALAHVL AVFRVGRARDAHECEVPPRGVFTPGLISVPEPFGCRFVPRFPGAEELIRQSAMP E >CYP5144A2 ug.24.29.1 revised 8/15/2007 at I-helix micro exon and at VDVFPI, AND KQSIMVNEL, AILRDED MEDTSRSLVGPSLWAVFALGLLFAFCLRRQPRYPPGPRGLPIVGNVFDIPMNV GWKVFRDVSRCFESDVIHYEALGSHLVVVNGAKAAKELFERRANNYSDR (2) KQSIMVNEL (2) TGWHRNWGQLEYGDRWRQHRRLFHQHFRPMAVSQYHPRQVKGVRVLLRA LSESPEDFQRHIRFMAGATIMEIVYAYDAQPGDPRIKLVEDAVDTLTFVVNAG VYL (1) VDVFPI (1) LKYVPNWFPGASFKRQAAEWKKLVDALYEQPYQEFKATVKEGN 2465265 AKPCFAATLLSSVENDEDIENLEELFMGLTGTAFV (1) 2465369 AGSDT (0) 2465493 TIASLNVFILAVTIFPEAQRSAQEEIDRVLERKRLPTMEDKVLLP 2465627 HVTALVHETLRWHPPLPLAAPHRVIEDDEYEGYFIPAGTTIIGNAW (2) AILRDEDLFPDGDSFKPERWLNEAGALRDDLPYPMETFGFGRRICPGRHFAND VLWLAIANILTVFSIERALGEDGQPIVPEAKFSPRLISK PEPFKCAFKHRFSGAEDMIRLAAIVEE >CYP5144A3 genewise2nd.24.5.1 REVISED AT PRSVMLHEL 8/16/2007 MELPPVPHPLIAYLCAGLLLAGLVVTRLRRRRHYPPGPKGLPLIGNLFDIPTDYAWKI YRAFGDQYGSDIIHFEIFGTHLVILNSAKAARDLFEKRSSIYSDRPRSVMLHEL TRWGRSFGFMQHGDEWREHRRLFNMHFRPSAIAQYHAKQKSAVCTLLRSLLDA PEQFREHVHFMAGDVIMGIVYGFDVQPGDSRLQLVEKAVMTLNQIVNAGVYL (1) VDVIPA (1) LKYIPAWFPGAGFKRHAAEWKKLVDDMFEIPYRESMKSLQEGKCESSFAASLLAQ LEGQESPDNIERIAMDVLGTTYVAGSDTIITATSTFLLAMILHPEVQITVQAELDAL LEGARLPNISDKAALPSVTAVLQEVLRWNPGLPLVPHRVVADDEYKGYHIPAGA AVIGNTWAMLHDETTYPDPEPFKPQRFLNEDGTLNADVPYPTDVFGHGRRMCP GRHFAHDMLWLTIASILTVYKVERDVDEDGQEITPTASFTSRVPTPFRCRFTPRSA SAESLIRSSGVSTE >CYP5144A4 pc.24.8.1(gx.24.6.1) revised at micro exon 8/15/2007 ADDED KQQSVMIHEL AND REVISED LAST INTRON BOUNDARY 8/16/2007 NOTE THIS GENE IS COMBINED WITH CYP5144A5 IN JGI BROWSER (e_gww2.1.398.1) MEQSTLHILPYLCASIPVLVCLLVLRLRRPHYPPGPKGLPLVGNLFDVPLSHGW VAYRELAKQYGSDVIHLEILGSHIVIINSAKAARDLFDKRSNIYSDKQQSVMIHELTGWHRN WGFMAYGDYWRKHRRLFHRHFRPAAVPQYHSAQAKGVHNLLKLLMRSPERFREHIRF (2) 2473972 MAGSTILDVVYALDVQPGDSRIELVERAVHTSTEIVAAGVYL (1) 2474097 VDLFPI (1) 2474217 LKHIPSWVPGAAFHRKAAAWKALVDRMYEEPYNQFKASM 2474333 KDGNAKACLTASLLMEAESTHQLDAIEDILISVTGTAYGAGTDTAVASLNTFMLGITMFPH TQLAAQDELDRITARQRLPTMEDRENLPHVTAILQEVLRWNPAAPLGLPHRTVRSDEYNG YFIPQGATIIGNSWAMLHDEAIYPDPGSFKPERFLTEDSTLRSDVPYPIEAFGFGRRICPGRYF AHDLLWLTIAGILAVFRIERARDEQGDEIVPAGDFSPRFIS (2) SPEPFQCRIVPRFAGAEALIHGTGLLG* >CYP5144A5 pc.24.9.1(genewise2nd.24.7.1) NOTE THIS GENE IS COMBINED WITH CYP5144A5 IN JGI BROWSER (e_gww2.1.398.1) MEWSLSLGYALLGLGIMWVVKYAQRPRRRYPPGPKGIPILGNVFNIPLENSWISFDQ WSRQYASDIVHVEALGKHVYVVNSARAAKELFDGRANVYSDKEQSVMMLELCGW SRSWAMLPYGNYWREHHRLFHQHFRPQSMVRYHEKQRRGARRLLQLLLDTPEDYE KHMRYAAGSTILDVVYSFDVQPNDPRIELVEAALGTANDLMHAGIYL (1) VDIFPL (1) LKHIPTWFPGAQFKRLAAKYKRLVDNMYTVPYSQLKASVKLGTAQPCLVASLLS EADEHVTPERDEIFMNLAGTTYAGGTDTIVIALSIFILAMILHPEQQVAVQKEIDR VVGRDRLPELADRESLPRVTAVIQEVLRWHSPLPLATPHRATSDDEYNGYYIKA GSVVIGNAWAMLHNENVYPDPASFKSERFLTPDGKLRDDVPFPIEAFGFGRRICP GRHFALDSLFLLVSHILAVFTIEHAVDADGHIIPVEPEFEPQAFSPPKPFKAQFKLR FLAAEDLIDGSALE >CYP5144A6 pc.24.10.1 revised at micro exons 8/16/2007 MLTALSCVFAEALAVWAVSSWTRPRHEYPPGPKGLPFLGNMFDIPMKYGWVTF ANWSRLYGSDIVHVQALGKHIYVINSAKVAKDLFDGRPHIYSDK (2) EQSVMTQEL (2) SGWKRAWALSPYDDEWREYRKLFHQHFRPSAVQQYHHKQTKAVRRLLQLLLD TPEDFLAHLRYAAGSSLLDVVYSVDALPGDLRITLVEKAVHTFAKLLETGVYL (1) VDVAPI (1) LKHIPAWFPGADFKRQAAEYRQLVDDMFKVPYQQFKDAWRRGTAQPCFAASLLT DADPLDGSEHEELFINLTGTTYAAGSDTTVAAMSTFMLAMALHPEVQRWVQEEL DRVVGRGRLPEMADQPALLRVMATVHEVLRWHPPLPLATPHRAMADDVYAGF TIPAGSIVLGNSWAILHDDKTYTNPHTFDPRRFVGPNAQPFPEVVFGHGRRECPG RHFALDILFLAVAHVLSVFAIERVDNSDPGIGDIQGLFTPHVLSYPKPFKASFKPRF PGVESLVRTAALSEI >CYP5144A7 pc.24.11.1 This is hybrid with genewise2nd.24.9.1|whiterot1 in second half (GENE IS SPLIT IN GENOME VIEWER) MSRFLYDYSTLLYLCAGITFVVLITLSSRPRRRYPPGPKGLPIVGNLFDVPTDHGW KRYQEIGKEYGSDIVHFQVFGSHIVVVNTAKAARELLDKRSNIYSDKQRSVMIHE LTGWHRNFSLMPYGEGWRTRRRLFHQHFRPMAVPQYHTRQLKAVHGLVQSLFE APQNYKEHIRFMAGSAILDIIFAFDIQPGDPRIEIVEKGVQTATEFMCSGVYL (1) VDVFPI (1) LKYLPSWFPGAGFKRQAAKWKALVDDMHEIPYYQFKQTMREGKAKPCFASTLL SSAAENDKDSLESLDEIFMSLTGTAYVAGSDTTISALNTFVLAMTMFPETQAAAQ EELDRVLGRKRLPDFDDRDSMPYLTAMVYELLRWHSVLPLGLPHRTLADDEYN GYFIPAGTVIVGNCWGMLHDDDLFPDPDIFRPERFLNADGTLNSDAHFPIETFGFG RRICPGRYFAQDLLWLTIANVLAACSIERVVDEKGFEVRPTGDMTPRVLSMPEPF ECNIRPRFSGAEALVRSACLND >CYP5144A8 genewise2nd.24.9.1 = Scaffold_205i revised at micro exon 8/15/2007 AND AT QQTVMIHEL fgenesh1_pg.C_scaffold_1000787 [Phchr1:787] MEALASRITVYLCAGVVLVYVFSRVFKRRPHYPPGPRGLPIIGNLLDVPSLYGWI AYKNLGDQCGSDIVHLEVLGSHYVVLNSAKAARDLLDKRSNNYSDR (2) QQTVMIHEL (2) TGLERGLGMLPYGDYWRLHRRLTQQHFRAAAVPQYHARQAKVVRKLLHSLLD SPERFMDHIRFMAGAIILDIVYALDVHPGDHMIEVVETAMGRINEIINAGVFL (1) VDVIPV (1) LKYLPSWFPGAGFKRRAAAWKVDIAPMFEAPYERFKQSLGGTTRPSFAGNLLSR VQNEEELSQLEDVFMNITGTAYGAGSDTTLATLTGFVLAMTIFPEKQLAAHAEL DKVLERKRLPEVEDMQYLPSITALVYEVLRWNPAAPLGIPHQTIVDDEYNGYFIP AGTVVIGNAWAMLRDPNTYPDPDTFKPERFLAKDGSLRDDVPYPTEAFGHGRRI CVGRHFAQDVLWIAIAHILTVFRIERAVDEDGREIVPVPDYTPHFVTMPKPFKCRF TPRFPGAEGLIRSAADASNE >CYP5144A9 pc.83.7.1 revised at micro exon 8/15/2007, revised at RQSTMMLDEL and EFTARIVS 8/16/2007, EST = DV758101.1 MAVLLAAGYYILSVAVFILLYNASRRRQRLPPGPKGLPLIGNLFDVPNDYAWLRY KELGQQYGSDIVHMQALGNHILVLNSMKAAVEILDKRADISSDRQSTMMLDEL SGLGRAWTQLGHNDSWRIHRRLFHQHFRPSAISQYHTKQTKAIHRMLSFLRESPA QYMDHIRFMAGSMILDVVYTLDVQPGDYRIKLAEMVAHVSTEVFTAGVWM (1) VDMIPM (1) LRHLPTWFPGAGFKIQAAKWKTTVDRSYDIPYEQFKASMHEGGGEPCLASALLS SAEDVEELERMDKVFSSLTGTAYIAGTDTTVSTLASFVLAMTIFPEAQLAAQAEI DRVLGGTRLPDINDKANLPQVTAILYETLRWNPVLPLALPHRTTADTSYDGYYIP AGTVVLGNSWAILQDETLFPEPQLFKPERYLNGDGSLNSSAHYPIETFGFGRRICP GRYFAQDAVWLAIAHILAVFKIERARDGDGKEIVPTPEFTARIVS (2) MPKPFECKFKVRSPQAESLIESAALGG >CYP5144A10 pc.83.8.1 revised at micro exons 8/16/2007 MALLVYFCAGLVPVLLLVLGLRKRPRYPPGPRGVPIFGNIFDVPMKYAWLEYVK YGQQYKSDIVHFQVLGQHIVVLNSLQAVGDLLDKQSSIYSDR (2) VPSVMLNEL (2) TGWSRSWVQMEYGDQWRMHRRLMHQHFRSTMIPQYHPKQTKAVRRLIQSLLE QPEHFMEHVHFLSGSLILDVVFSFDVRPGDAILALAERAVDTTKAIIAAGVWL (1) VDVVPI (1) LKYIPSWFPGAGFKRIAAKWKTDVNKMFDVPYAKFKDSMREGSATPCFASALLS GAEDDNGGVIDNQDEVFISLTGTAYVAGSDTLSDALSTFLLAMIAFPEKQRAAHE ALDCVLERKRLPGVEDRDALPHITALAYEVLRWHPVVPLSIPHRTTADSYYKGY YIPAGSTIFPNSWAILHDEALYPEPHLFRPERFLNEDGSLHAHARDPIEAFGYGRRI CPGRHFAHDALWLAIAHILAVFKIERALDVDGNEIEPKLDFMPHFLSMPKPFKCR FTPRFPDAANLALSASSDY >CYP5144A11 ug.83.30.1 revised at micro exons and after EXXR motif 8/16/2007 GC boundary at HIRY, added C-term MGWTLCLYALLGLTGLWVAARVRRPRQRYPPGPTGLPVLGNVFDVPLENGWL VFDQWARQYDSDVVHAEALGRHIYVVNSAKAARELFDGRPHVYSDK (2) DQSVMLLEL (2) SGWWRSWVMLPYGDYWKEHRRLFHQHFRPQSLPQYHEKQAKAARRLVRLLLD SPQDYAKHIRY (2) ATGSSILNVVYSFDAQPGDPRLELVEAAMGTANELMHTGVYL (1) VDIFPV (1) LKHLPMWFPGAHFKRQAARYKRLVDDMFEIPYAQLKSSMQEGTIEPCFAAALLS EAEDSASPERDDMFMNLAGTAYAAGTDTIMMTLLTFILAMVLHPEEQAAVQEEI DRVVGRDRLPGLADRESLPRVTAVIQEVLRWHPPLPLA TPHRAASDDEYNGYHIPAGALVLGNCWAMLHDARVYPDPDVFRPGRFLAAGGD APRADVPLPAEAFGFGRRICPGRHFALDSLFLFVAHLLAAFRIEHAVDAEGNVVP VVAAFEPQAFR (2) SPPKPFKARFTLRYCGAENLVRGGVRAG* >CYP5144A12 pc.16.82.1 revised at micro exon and added one exon before it. 8/15/2007 fgenesh1_pg.C_scaffold_8000230 [Phchr1:5055], added N-term 8/16/2007 added RTHSVMLNELSGWAE and WYTPFPLG and C-term exon 8/16/2007 MELWFDFPSVVTYVLAA VSCASLLILHTRRRPRYPPGPKGLPIVGNLLDVPTHNAWIKYKQLGKKYGSDIIHF EVFGSHIVVLNSTTVARDILEKRSQISSD RTHSVMLNELSGWAE ERNFGFMRYGDGWRRQRRLFQQHFRRKAVTQYHAIQSKSVHSLLNALLDRPERF IANLRF (2) MAGSMILRIVYGTDIQPGDSRLTLVEKAVGTLVEVMNAGVFL (1) VDVFPI (1) LKHIPSWMPGAGFKRKAAEWKVLVDDMYEVPYNVGTLSIFFLAMTIFPSVQVAA QEEIDRVLGRKRLPSIEDRNALPRTTAIVYEVLRWYTPFPLGVPHRTIADDEYNGY FIPAGTTIIANAWYAMLHDEERYPNLETFIPERFLNKDGSLRSDACIPLEPFGFGRRI CPGRYFAEDIVWLAIASILSVFRVEPPVDEHGEPLKQTATFGTRFLS (2) PAPFKCCFTLRYPEAEGLIRASATSTA* >CYP5144A13 pc.16.83.1 = GX.16.34.1 (USE THIS MODEL) = SCAFFOLD 4e fgenesh1_pg.C_scaffold_8000229 [Phchr1:5054] ver2, ESTs = DV761651.1, DV753979.1 revised at micro exon 8/15/2007, revised at AHSIMLNELS and C-term 8/16/2007 REVISED SEQ = 50% TO CYP5144A8, IT SITS NEXT TO 5144A12 MTTLALASSLTYLFAGVLLVCLFVSYARKRPRYPPGPKGLPLIRNLLDIPADYPWIT YRDLAEKYNSDILHFEVFGSHLVVLNSAEATRQILEKQSSITSDRAHSIMLNELSG WDTDRTVTFMEYGESWRRHRKLFQEHFRQQAIPRYHHAQTKGVNRLLKSLLDTP EKFSAHIRFMSAYTITEVVFGKEVGPDDPSIEVVDDGMHTLNELLNAGVFL (1) VDIFPL (1) LRYVPSWFPGASFKRLAGKWKKAVDDMYTIPYNNYKATLGEGDANTCLLATAMADKVD QDDARVVDHDLMCLAGTTFGAGYDTTATALGIFIMVMAVHPEAQISIHEELDRVLHRDRL PTMEDRKELPRTTALMYETFRWHLPLPLG VPHQTTAAIHYNGYFIPQGANIVANSWAILRDEGLYPDPETFKPERWLDADGSLR DDMRFPVEMFGYGRRICVGRHFAEDIVWLAIASILSVYKIEPPVDENGTVRALEAD FTPRLFSAPKPFKCRFTPRFPGAEGLIRASL* >CYP5144A14P pc.24.16.1 = gx.24.13.1 PSEUDOGENE MISSING C-TERM revised at micro exons 8/16/2007 MPDCTTIGYLLASVALAYALASRHPRSSRYPPGPRGLPLLGNLFDMPRKHSWTK HQELSKTYESDVIHYQVLGLHIMALNSGEAVRDLLNKRSTIYSDR (2) QETVMLHEL (2) TGWHRNWALMRYGDAWKERRRHFHEHFRPQAVSQYNFKQVKAARILLNSLLE SPRAFSEHIRFMASSLILDIVYALDVRPDDPEARRVERALETLAEISASVFM (1) VDLIPV (1) LKYLPSWFPGAGSKRQASIWKNIVDEMFETAYQVCKNSAQHEYVRPCFTTALLS EVSDPANMKEMDEIFMSLAGTTYI (1) GGSDT (0) >CYP5144A15P PSEUDOGENE pc.24.17.1 64% TO 5144A1 IT APPEARS THAT 5144A14P AND 5144A15P WERE PARTS OF A SINGLE GENE DISRUPTED BY A TRANSPOSON CONTAINING A RETROVIRUS- RELATED POL POLYPROTEIN THAT NOW LIES BETWEEN THEM AQSAARSSTASTEQINATLNTWMLAMTLFPDTQVAVQDELDMVLGRKHLPSIED RDSLPRVTAMLHEVLRWHPVGPMGVPHRLTVDDEYRGYHIPAGTIVMINAWAIL HDESVYPEPDIFRPERYLDSDGRLRTDMPYPVEGFGAGRRLCPGRHFAHDMLWL AIAHVLTVFRIERAVDEDGREIVPEAKFEPWLIRCASLCVGVPCTAHLAVSPPEPF QCQTKLRFPEAEGLVHLAAMDE >CYP5144A16 NEW SEQ 2588519-2588442 REGION (-) STRAND SCAF_1 68% TO 5144A5 AA 311-336 e_gww2.1.906.1 [Phchr1:132380] MEGIPILSYVTLGLVTVWTILSLRKPRRRYPPGPKGLPVLGNV FDIPLENGWLIFDKWARQY (1) GSDIVHVEALGKHIYVINSAKVAREIFDGRPHNYSDK (2) EQSTMLLEL (2) SGWGRSWVMFPYGDYWRQHRRLFHQHFRAQSIPQYHQKQAAAA RRLLQLLLDTPADFAKHIRY (2) ATGSSIVDVVYSFDTPPGDPRLEIVEAAMGTASELLHSGIYL (1) VDVFPI (1) LKYVPAGFPGAQFKRKAAHYNKLVKDMFTIPYTQVKTAM KEGSVQPCFTTALLSESDDLDTPERDKIFQSLVGTAYAAGTDTLMISMLTFMLAMVLH PEAQT AAQNEIDSVVGRDRLPGMTDRDSLPRVTALIQEVLRWHCPMPLATPHRAIVNDEYNGY HIAAGSVVIGNA WAMLHDEDVYPDPHSFKPDRFLTTDGRLRDDIPFPIEAFGFGRRICPGRYFAMDALFL FVSHVLAVFRIE HSVDAHGNVVGVEAEFQPQGFRCA* >CYP5144A17P gx.24.12.1 49% TO pc.24.17.1, 57% TO 5144A5 scaffold_1:2496394-2495844 (-) strand 2496394 IADREPLPPVTA 2496356 2496230 HTPRYERRHSGYYMSAGLLVIGDTWY (0) 2496153 2496068 RFLMAERALRTDVTFPIEVFGYSRRICLGRQFAKDVLFLAISNILAIFTI EKAVDEHSGSIEVQNAFLPHAIRCA 2495844 >CYP5144A18 fgenesh1_pg.C_scaffold_1000782 [Phchr1:782] same as Scaffold_205c gene model complete, revised at micro exons VDLIPI (1) and PQTVMLHEL 8/16/2007, 62% to 5144A1 GC boundary at PGLIR MESLTLRNCASAFLAG LLVLACGLVLRPRRRPRYPPGPKGLPLVGNMLDIPTEYAWERYYELGKEY GSDVLFFRVLGSHFLVLNSAAAANELLEKRANVYSDR (2) PQTVMLHEL (2) TGWDQNWAFWEYGEGWKQARKMFHQHFRPSAAPQYHLKQ TKAARRFVKLLLESPASFAQHAR FLAGSAILDAVYAFDVQADDPRIALVERGVHTLVEISRGVFL VDLIPI (1) LKYIPSWFPGAGFKRQAAQWKDAVDATYSDPYRQFKTLL RNGQAEPCVAASLLSSSGDEPSGALDDLLKSVAGTAYV GGSDT VSATLTTFILAMTMFPDAQAAAHAQLDEVLKRTRLPEMADRAALPYITAILYEVLR WQPAGPL GLPRRLMADDEYRGWHIPAGTVVLPNIW AMSHDPGTHAVPAQFVPARYLAADGTLREDVPCPADVFGFGRRVCPGRPFAQDV LWLAI AHVLSVFRMEGPMGERGEIRHSRLFTPGLIR (2) LPEPFACSFRPRFPGAENLVDAGVVG* >CYP5144B1 pc.23.12.1 revised at micro exon AND PHAPIVSTIL 8/16/2007 e_gww2.8.230.1 [Phchr1:138753] MLDTTPFTLALLTVGAVCLLGLIKGASRRRRLPPGPKGVPLLGNIFDAPKEHEWR TFKEWGRTYESDVVHFGILGTHYVVLNSAWAALDLMDKRSHNYSDN (2) PHAPIVSTIL (2) TGWDRNWGFMKYGDYWRAHRRMFHQHFRPNAVSAYHSTSQQAVRELLRLLY AKPEHFKEHIQHMTGYNIIKLMFGVAVSPEDDPILARMENALRILGKIANPGVYL (1) VDSFPL (1) LRFIPSWVPGAKFKRDAEAWKPVIDKTYTQVYEEIKTSYANGSPVPCLVTEMLED VLKVDNEAYRDMLEDVIINGGGTAYVAAYDTSSSVLSTFVLAMLLYPDVQRTA QEELDQIVGPDRLPTMEDQPSLPYVTALATEVLRWRPALPIGVAHKSVVDDEYR GYHIPGGSVIIPNVQAILHDEGSFPDPDTFNPRRFLDSEGQQLEELTGIVSAAFGFG RRICPGRYFAKDVVWLTIASVLSTFNIEKHFNEAGNAVEPSGEYTPGIISYPAPFK AAFKPRSESAVELVR >CYP5144C1 ug.83.31.1 revised at micro exons 8/16/2007 fgenesh1_pg.C_scaffold_1000766 [Phchr1:766] MFSALVLSLLLAAALFLRFRRKRYPLPPGPKGLPIIGNARDIPKSFPWYTYDRWSR EYNSEIIYLRLVGTDVIVINSEKAANELLNKRSTIYSDR (2) EHMTMLLDL (2) VGWGGRNFAFAHYGDLWRAHRRLFHQYFHPGAVPAYHAKSTLEVRRLLPRLLS HPDDFMQSIRTMTGAIILGITFGMELQPENDPFVALAEEALHAMAQVGNVGSYI (1) VDYLPW (1) LQYLPSWAPGAAFKRQAAKWNKIVLEMYEKPFQTIKQALARGEAPPSILTSMLE TLDPEEDNAARESDMRHVTGTAYTAGADTTVSSLGTFILAMLLHPEVQRRAQEE IDRVVGSDRFPEYDDRDSLPYITAIMKETLRWRQVTPLAVPHRLRVDDEYNGYH LPAGSVVVGNSWAMLHDEERYPNSDLFDPTRFLTPDGELDPDAPGPELAAFGFG RRICPGRYFAMDSMWIAMAHILATVNIEKAVDDAGNILEPSGEYTYPVPFKVAFK PRSAAADALIQGGTPLA >CYP5144C2 pc.83.19.1 revised at micro exon 8/15/2007 fgenesh1_pg.C_scaffold_1000761 [Phchr1:761] REVISED EXXR REGION AND PSHTLLVTV, and E*LTYAKWSREC (1) Added C-term 8/16/2007 Note: there is a stop codon in a conserved region, possible pseudogene MDIPVPYAFIVVGALVLFFRLRKKPRFPPGPKGLPIVGNALDLPKAR E*LTYAKWSREC (1) GSDIIHLRFFGTHVFVLNSVKVVNELMVKRSAIYSDR (2) PSHTLLVTV (2) TGWQRNFTFVDLGDHWKARARMFQQNLGTSTISKHRPKLIEGNRKLLLNLLLSP DDFMKHIRYLSGSSILGIIYGIEVQQDHDPFVETAEKALQCLAAVINAGSYA (1) VNYVPI (1) LRFLPTWAPGAQFKRDAAEWYKYVTALIDGPYTYVKESLANGENNTSIVGTLLQ ELSDDEKRSEQEDTIREAFGTAYTGGVDTTYSSVNSFILAMLKYPDVQRKAQEEL DRVIGRDRLPSFDDRDALPYITAIVKETLRWGLVAPLAAPHQLRVDDEYEGYFLPA GSVIIGNAWAILNDEKRYPHPESFIPERYLTTDGTLDSSAPDPMEACFGFGRRMCL GRYFAFDSLWIAVASLAAAFHMEKAVDESGLVIEPSGEYTSGTAC (2) YPLPFKAVFRPRHEGVVALIKADVSESDSL* >CYP5144C3 pc.20.52.1 revised at micro exons 8/16/2007 MDDIVLVLLVACAVALYARTKRTRYRLPPGPKGLPILGNTYDIPAKYEWLAYEK WSRDFGSDIICLKFVGTPVIVLNSIQAINDLLEKRSSIYSDR (2) PVTVMAYEM (2) VGLDRNFGFVPYGDVWREHRHLFHQYFRLDMVPKYHDRMLKHSKDLLQRLLV SPDRLMEHLRFSVAGASILNISYGIDVQPENDHYIAVADEAIHALAVTGNAGSYL (1) VDYLPL (1) LRYIPAWVPGAKFKRDAANWWEKTRLMIDEPFNYAKQRMAQGKGMDCVTAV MLSAIGEDQDREHQELLIKQVLSVSYIGGADTTVSALATFVLAMMQNPKMQRIA QADIDRVVGGERLPSVEDRDSLPYVTAIVKEALRWRPVIPLAVPHRVTVDDEYK GYHIPAGSIIVGNVWAVLHDETRYPNPDVFDPTRFLTSDGQLDNNAPDPAEACFG FGRRICAGRYFALDAVWLSVACILATFDIAKPLDENGNPIEPSGEYTTGLLSHPVP FKVSFKPRSAAAEALVREPISRDL >CYP5144C4 gx.20.26.1 revised at micro exons 8/16/2007 MDYLGISILLVAALALHYFLRKKRYRLPPGPKGLPILGNALDIPAKHEWLAYAK WGQECGSDIIYLNLAGTPVVVLNSAKAAKDLLEKRSSIYSDR (2) PVTVMAHEL (2) IGLGRNFGLKPYGDTWREHRRLVHQHFRTENVPRYHEFTSKQIGRLLLHLLEDPS NFVRHLRIMAGASILRICYGIDVQPDNDHYLSVADEAIESIAATGNAGSYL (1) VDSLPI (1) LRYLPSWAPGAQFKRDAAKWKEKVDRMIAEPFDYAKRYMASTEGAATDYIAGL LLSAMDPGRDKTQQEIAIRDSLWAAYVGGADTSVSALATFTLAMVLYPDVQQT AQAELDRVLGKDTLPTIEDRDSLPYVTAVVKETLRWHPVTPLAVPHKVTTDDEY RGYHIPARSIVVGNVWAILHDPDRYPNPESFEPSRYLTSDGLLDPAAPDPTEACFG FGRRICPGRHLAYDTIWTGIASILSSFDISPPLDEQGKPVNPSEEYTTGMLSHPVPF RANFKARSENVEALIRRITLCE >CYP5144C5 pc.20.54.1 revised at micro exon 8/15/2007 and EIMVMCHEM MQTVVLAILFVFAIALPIYSRRKRYRLPPGPRGLPIIGNILDIPAGREWLTYAKWSR EYGSDIIYLNMAGTPVYVLNSIQATTDLLEKRSSTYSDR (2) EIMVMCHEI (2) VGWGKNFAFQPYGDFWREHRRMFHQHFHPEAVTKHHVHILKQAKDLLQRLLVDPDD FMQHLRFMAGAAILRVSYGINVQPENDHYIGIAERAIHSLALTGNAGSYL (1) VDNLPI (1) LKYLPSWAPGARFKRDGEIWRREVDQMFSEPFELVKRQMVGADGEPPDCVTAS LLTTLDERKDRPREELEIAVKQAVGTSYVGGADTTVSSVATFILAMLQFPDVQRT AQAEIDRVVGSTRLPTIEERGSLPYVTAVMKETLRWNQVTPLAVPHKVTVDDEY KGYFIQAGSIVIGNSWAVLHDETRYPNPEAFDPTRFLTPDGKLNPSAPDPVEAAF GFGRRICPGRHFAMDAIWMNLAFILATFNIEKPLDEAGRPIEPSGLYTPGLLSHPEP FRVKFIPRSKAAEALIRETMFYD >CYP5144C6 pc.20.55.1 revised at micro exon 8/16/2007 and VVTTMAHEM MENITFAVLFVLLLAVPLFFKRQRYRFPPGPKSLPLIGSVLEFPVQSSWLTYQRWG RELASDIIYLNVLGKHIYVLNSAQAVSDLLEKRSGTYSDR (2) VVTTMAHEM (2) VGWDKNFALQPYGEFWREHRKAFHQQFQPDMVPRYHVHMYKQAKDLVRRLIA EPNALKQHLRYMAGALILRVSYGIDAEPNDDHYFEIIEQAVYSLTEVANTGAYL (1) VDFLPF (1) VKYVPSWMPGAQFKRDATEWAPQVNQMFDEPFDVVKRALAEGNAPDSVCAAL LSELDPRKDRAHQETVIKQAVGTAYIGGADTTVSTLSTFVLAMMTHPDVQRTAQ EHIDRVTGGDRLPTIEDRDALSYVTAIIKEALRWRPVLPMAVPHITTADDEYRGY HIPKGSIVMGNAWAVLHDEARYANPDAFDPTRFLTPAGTLDKDAPDALEAAFGY GRRVCAGLHFALDSMWVNVACVLATLDIRKPVDEHGVPVEPSMAYTTGLLEQP EPFAVVFKPRSQAAEALIYEGHDD >CYP5144C7 pc.142.5.1 , e_gww2.9.179.1 revised at micro exon 8/15/2007 revised at RLHSTMLHDY 8/16/2007, ESTs = DV760024.1, DV759090.1 revised region after EVLR WRPVAPL MDTLLLAGLVAVAVVAAGCLAHSRRQRFPPGPKGLPLLQNLLDVPRHRPQWEA YRDWGLKYNSDIVHLRLFGVSFVIVNTADAVTELFSRRSSVYSDR (2) LHSTMLHDY (2) IGWEKAIVMKNYGEDWREHSRLFHQSFQPKVIQEYYPRLYEEARKLLPRLLKGD DFVASLRVMTASAILGVTFGMEINDSNDPYVTIADRAIQSLVEAGMPGSYM (1) VEYIPL (1) MRYIPSWAPGGKFKRDAAEWRTLVSDMFTKPFEHIKSAIRHGVARPSIATSLLMG LDDKQDNARREHVIRNVTGTAYVGAADTTVAALRTFILAMVMHPAIQKAAQAE LDRVVGRDCLPTFADREHLPYLTAIQYEVLR (2) WRPVAPL (1) GFPRRANADDEYHGYHIPKDAIVLGNIWAILHDPARYRDPAAFDPARWLTPDGA LRADAGDAMLAFGFGRRICPGRHYAVANMWINMAYMLAAFDIAPPRDAAGRA VPPAGEHTTGLLTYPKPFGAVFTPRSAAALRLIVADADD >CYP5144C8 genewise.35.22.1 = GX.35.9.1 revised at micro exons 8/16/2007 e_gww2.4.188.1 MGVLILCAIAILAALLCYHHFRARRFRLPPGPKGLPIVGNVLDVPKDGPGWLTYE RWSHEYGSDVIYLNLLGSSIVILNSSKATTDLLDKRSPIYSDR (2) QRLTVLHDF (2) VKGDRAFAFLGYGDEWRQHRGIFHKYFHGQAVHNFRPKMLEEARKVLVRLQST DDYDRCFRVMSAASILGVTFGMDIEDINDPYVVLAEEAINYVLSAAIPGSFV (1) VDSLPL (1) LKYLPAWAPGAGFKCKGAEWHDLVSRMILTPFETLKQKMAEGTAKPCIATAIVQ KLEESKGGDAQRETQIAQYVTGTAYTAAADTTVSSLCTFILAMVLNPEVQALAQ EEIDRVIGTTSLPDYTYRDSLPYVSAIMYEVLRWRPVAPLGVPHRLMEDDEYEGY HIPGGSLVVGNIWAITHDPVRYLNPDAFDPTRWLTTDGQLGDVTDALVAFGFGR RVCPGRHYALEALWITLVHVLAAYRIEPPVDAHGRARPPSGEYLPGFIAFPAPFK AVFKPRSPVALGLIQTALG >CYP5144D1 pc.24.13.1 (genewise2nd.24.10.1) revised at micro exons 8/16/2007 MLDRSVLLPCLVGIVAAIIIVSRRGRERYRFPPGPKPLPLIGNLLDAPTDLGWYTY AKWARQYHSDIIHFEVFGQHFYILNSVRVAKDLLERRSQVYADR (2) QQSVMVQEL (2) TGWHRVFSMKAYGESWRQQRRLFHQHFRQQAIPEYHAELTNGARMLLRSFLES PNHFLEHIRHISGGTILAVLYGIDVDNYSAERMESIEKAIEIVTEIADGGVYL (1) VDFIPL (1) LKYLPTWFPGAGFKRRAAEWRVHVETMFEAPYGDVKRDMKLGKAKPCVATKL MSAFGDKAEDPEIEELLICVTGTAYAASDTIVFAMIAFVRAMMVFPEVQCKAQQ ELDRVVGRDRLPVISDQASLPYLAAVTKELLRWHPITPIAVPHKSTTDDWYDGY YVAAGSIVIANVWAMFRDEERYPDPEAFRPERFLTAEGTLDPAVPDPVEVFGFGR RMCAGRHYVDAALFLAIAHVLHALTIEKPRDARIPVVDPPPGYALSRLFWAPEPF EADIKPRFEGVERLMQMSSLHSF >CYP5144D2 pc.24.14.1 revised at micro exon 8/15/2007 and at QLSVMACEL MLSSTILVFSSVCTLAAIVAIVRRFGGKRRHHFPPGPKGLPIVGNLFDVPTNFGWY TFAKWAQQYNSDIIHFEVLGKHFYVLHSAALAKELFERRSQTYSDR (2) QLSVMACEL (2) TGWHRVLTLTPYGEYWRQYRRLFHEHFRAQVIPQYEDKMLTSARNLLRLLLETP DRFLRHIRHASGRTMLDIVYALDTEAHNNAVILESVEKAIEIFAEVAEGGAYL (1) VDHIPI (1) LKYLPAWFPGASFKRQAAAWRVHVDTMYEAPYQDVNRRLLAGKAKPCITTSLI SAFSDKCEDPDVEESLISFAGTTYAGSDTSVFKMTIFMRAMLLFPEVQVKAQEEL DRVVGRDRLPELADKDSLPYISALYKELLRWHPLFPLAFPHKSTVDDWLDGYFIP AGSLIIGNAWATLHDEERYPDPEAFRPERFLSDDGKLDPSVPDPVEAFGYGRRICP GRHYADASLFLYIAHILFAFTIRKPLDERGNVIEPPPGVPEPFKASIKPRFEGVEELI QLSTQLATSD >CYP5144D3 pc.24.18.1 revised at micro exons 8/16/2007 e_gww2.1.419.1 [Phchr1:133291] ver2 MFDNGVTIALLLIGVTLLVAEALKKRHRFPPGPKGLPIIGNLLDVPKDYHWLTYT AWSRQFDSDIIHLEALGQHYFVISSVDVAKDIFEGRSQLYSDR (2) PQTVMLHEL (2) TGWERNFAMMAYGDSWRRHRRLFHQHFRLQNVPAYHDQIAKGARNLAQLLLQ TPDKFGRHIRHVVGAVILDIMYGIEVAPDDDERMEHLERAVHIFMELGQAGGFL (1) VDFIPA (1) LKYLPTWFPGAAFKRQAMEWKPQVDAMYEISYNEVKDSMQRDQAKPCITSALL TACWDDLDQSNMEETLIGVTGTGYAGSDTSVFALNAFVLAMMLFPDVQRKAQE ELDRVVGRERLPSADDRDSLPYISAVIKELLRWHPITPTAAPHKSIADDYYNGYFI PAGSIVIGNTWAMLHNEERYPDPEAFKPERFLTPEGTLDPHVPDPAEGFGFGRRIC PGRHFAQASLFLNISNVLATCMIEKPVDEFGNVVEPTRECTSRFFWALKPFEAKIT PRFEGVENLVQTMSTYTN >CYP5144D4 Scaffold_252d seq next to CYP5144D5 e_gww2.1.429.1 [Phchr1:132914] CYP55% to CYP5144D5 revised at HETVMARDL 8/16.2007, added C-term MQSNALVIACLVAGLLAVARASRKSRQRRYPPGPNGLPILKNLFDIPRTYSWLTYE AWGREYNSDVV HFEALGLHFVVLNSTEAAKELLEGRSHIYSDR (2) HETVMARDL (2) TGWHRHWGIMAYGDAWRQRRRLFHQHFRP QAVPQYHEPMVRSARTLLQLLLESPDDWMRHVHHVSGGTVLKVLYAVDVDPHD DEGMDVVDKALQIFMKLPEPFV (1) VNFIPP (1) LKHLPAWFPGAGFKRRAMEWKVHVDRMFEEPYHKIKVAAVTRPCIATSLLSAAW EDLEN PDVEEQLISVLGTAY GEYNLALEQTIFSVYSFVIAMMLYPDVQRKAQEELDQVVGRDRLPEIADRESLPY FSAVLKEVFRWHPVTPIAAPHKSLEDDWYKGYFIPAGTIVFGNTWAILHEESRYAE PDVLRPERFLTPAG TLDPAVPDPDEVFGYGRRICPGRYFVQDALFLYASHLLAAFTLSKFVDDEGHVEEP (2) RLTCIFVTLRIPRAFKANIKPRYEGAERLVQMAFTSAS* >CYP5144D5 PFF_252c = GX.24.18.1 revised at micro exon 8/15/2007 revised at HKTVMLHEL 8/16.2007, EST = DV762757.1 MSDSTLLAVGLIFGLLVLARVSRKRPRFPPGPKGLPIVGNLFGIPRDHSWLTYAE WGRLYISNIVHFEALGQHFFVLNDEKITKEIFEGRSQIYSDR HKTVMLHEL TGWHRNWAFTPYGESWRQNRRLFHQFFRAQAIPDYHDHMAKGARGLVQLLLQ TPENWMRHIRHASGSTVLDAVYAMDVDPNDNERLEGVERAVETLVEIAEAGGY L (1) VDFIPA (1) LKHIPTWFPGAGFKRQAMAWKRDIDALFERPYHEVKSAMGCRIFSEQRGKARSC VTSSLMSMFSEKLGDPDVEETIMGIAGTSY (1) AGSDTTVFTMFAFVQAMLLYPNVQRKAQEELDRVVGRDRLPEVADRQSLPYVS AIVKEILRWNPILPAAVYHKSLADDWYEGYFIPAGSIVIGNTWAVLNDAERYPDP EPFKPERFLTADGQLDSQVPDPVEVFGYGRRVCAGRHFAQTALFLYIAHVLALFT IDNPLDESGHVIKPRRDCLTRLVPKPFKAKFKPRFTGVEELVHMSSIPTQ >CYP5144E1 pc.8.2.1 (genewise.8.1.1) REVISED AT PRMPMLLDL 8/16/2007 fgenesh1_pg.C_scaffold_17000192 ADDED N-TERM AND COMPLETED C-TERM revised seq at EXXR MGDTLALNGGIVLAVIVACSIALLLFQRRPHLP LPPGPKRWPVVGNAFRFPKEREWLTFMRWSREFGALXDLLYYEMWGRPFVVIN SHRAAVELFERKSALYADR (2) PRMPMLLDL (2) CGWAWDLAFMPYDETWKLARKLFTQHFRAGAAGRYRDEETRCARELLADILR DDTQLFEHARVTFGKLIMSVTYGIDVRSADDKYITNAQKALYAITATGNVGTYL VDSIPLLKYIPEWFPGAKFQREAREWREAAEAMSQRPVEDVKIAMAEGTARPSV LRSLLEDYGETMSAQEAYAILSATGTAYEAAASETTWSATLTVVLAMLLFPEIQE RAHAELDRVVGMDRFPVFEDQPSLPYITAICKE (0) ALRWRTPLPL (1) AVMHRVTQDDVYNGCHIPGGATVVLNSWAILFDPDQYPNPEPFAPERFLAPSGE LAPDVPEPTAAFGYGRRACAGTAMALDTLWIVVASLLWAFDIRRAVDEMGNEI DVAGEYTFGVVCYPAPFRCALRPRSEGIRSLISLPE* >CYP5144F1 pc.240.2.1, e_gww2.5.175.1 [Phchr1:131322] REVISED AT PEMPMLNDL AND MICRO EXON 8/16/2007 MVSAVLQGLAGLVIILLVRWAARQRQDRKQGPHPPGPPGLPLLGNLLDMPDNPS WMTYIRWSEKYNSDILRLNVLGSNIIIVNSLDAANDLLDKRSAIYSDR (2) PEMPMLNDL (2) CGFGWNVAFRRYDDTWRNGRRVFQHELGPQVVKRFRALEEHATHQLLRNLLRE PAEFMGHLRHMSAFEILRIAYGIEVTGREDPYVDTAEHAVGAVVATCSPGSYLV NIMPFLKHIPEWVPGAKFKQDAKVWRRYVTELRDKPFSVVKERILRGDAPDCAA KSLLESLESGEDTAGYTEDDIKYALGSMYA (1) GGSDT (0) TVSALGSFILGTVLDPAIQARAHADLDRVCPGRLPTFDDQPELPYIDAIVKEALRW NPVLPIDVAHCSIADDVYRGYFIPKGSLVLANSWAILHDEAAYADPLRFHPDRFM AGDALDGRVREPDAAFGFGRRICPGRYMAYDAIWIAVACMLAVFRIDKAKDAQ GREITPSGEYNVGFAYPKPFPCDIRPRSSAHEALIRATAEDA >CYP5144G1 pc.119.17.1 , gww2.5.320.1 [Phchr1:40563] ver2 revised TVCSISYEL and I-helix region 8/16/2007 MSSLLRVADAVLLCAALTIIYKLCLRQAKPSRLPYPPGPPGYPLIGHLSGPEGPGG RSWVTFRDWSLQYGSDVIHLNMAGTHLIVLNTLGACSDLLEKRSTIYSDR (2) TVCSISYEL (2) CGLGWSFGLQRYGPEWRDGRKCFESQFNAHAVRKYRPALSREVSRFLHNLCTD PVAWEYHVHHMAGALIMSVGYAIDVQAKDDPYLNAAEHAGECVQKTLVPGAF LVDILPFLKYLPDWFPGVGFKQKARSWRKSIMYIRDAPYDVTKKRVVSAVVPDC VAKDLMEKMVNNAKDPVYMERVARSAVGSMYL (1) AGADT (0) THSVLSACVLTLVLNPNFLPRAQASIDEVCQGRLPDFSDYEALPHVHAIVREAMR WNPVVALNLPHRCTTDDIYEGHLIPAGSIVIANIWAILHDPAVYPDPESCNPMRYL RCSPDGTVTLDPAVPNPADVAFGFGRRICPGRFMGYQTVWLALARMLAAFDIQC ATDADGVPIVPRGEYDRPKPFECSIKPRSAAHAALVMQDVDAGEL >CYP5144H1 ug.119.22.1, gww2.5.193.1 REVISED AT FGFTMLREL, FGF END DOES NOT HAVE AN AG BOUNDARY POSSIBLE FRAMESHIFT IN THIS REGION MIGHT CREATE PRLTMLREL COMPARE TO LENTINULA EST EB011290.1 WITH PRMTMLNEL REVISED MICRO EXON REGION AT I-HELIX MASTSQTAFNAALGAACLVYLFAKIYWFVAAGRNLHGLKKLPGPRGWPFVGYLKAAE RPAWLTYWRWSDEYDSDVVTFDVLGTTVVILNSLKAATELLEARSAIYSDR (2) FGFTMLREL (2) VGFDWNITVSEYGPYWRDSRRAFAHAFHPHAVARYRPAELKATHQFLRDLLNE PEDFHGRIRYLAGRQILHIAYGLEIRDRADPWITAAEHGVEIAVKCIIPGSYLVDLI PILKYVPEWFPGAGFKRQARIWKKEVTSIADAPLAAMEACDSLPDDSAAKPLLER MLDSPDDPAYARHVLRGTLASMYI (1) AGADT (0) TTSTLATFFHALSRHPAVLYEAQRAVDRVCAGRLPTFADYDALPYIHALLRECLR WRPVVPLNFAHRASKEDVYEGYRIPAGALVLANNWAIMHDPAAYTDPEDFNPR RFLRARAGAGANGSGADDSLELDPGVRDPGVAAFGFGRRACPGRYMAYESLWI VMASVLTVFDVLPAEGDEGEVEYTDGFLSTPKPFRCTIRPRSAAHAKLVYLALEL DA >CYP5144J1 genewise.24.21.1 = scaf252b = pc.24.20.1 = genewise2nd.24.19.1 46% to CYP205b (CYP5144A2), revised at micro exons 8/16/2007 added N-term MWVVTCTCAAILYMVLVGLRKRHRFPPGPKGLPLIGNVLNAPAGSSAP MVYQQWSKLFGSNIIHLKIFGTHFFVLNDAKTASDLLEKRSANYSGR (2) SQTVMLFDL(2) TGWDRDWGLLDYGDSWKKHRHVHHRYFHPKVLEAYHPRMEKGVQMLLQLLH RSPADFNAHLRF 2537737 MTGHIIISIVYGIETKSADHPYIGLAEEGVKAFSATAVPGRFL (1) 2537865 VDSLPF (1) 2538014 LKHIPAWFPGADFKRQAALWKKDVDAMYETPFNDIKAAI 2538130 RRGETNTSIVGAALAELEGQADTEEAENIIMNVAGTAYPTASDTTIITLEFFILAML QHPEVQRRAQADLERVVGNARLPSIHDQALLPYITAIMHETLRWRPPFRTVSLPR KSLHDDEYEGYHIPAGSIMIANEWAILHDEARYPNPEEFDPSRFLNTDGSIDHTVP EPVEPFGHGRRLCPGRHFAMDVIWLTIANILHVYSIEKAVDQSGNVVEPSGKCID GLLSAPEPFQAVFRPRSDAAIALLRSID >CYP5144-un1 PFF_45b pseudogene PGPQGLPILRSLFHAPAQFQQLAFQDWGHRYIVLNCAQPASDLSDNRSTVYSDKG GWDRNMGPLAHSAYWREHRRLAPHHFKDHVR FTTAATPGMYL (1) VDSFPM (1) LRHIPAWLPGAQFKRAAARWRSIVEEVFDSGNPEPCLAACLPTSHRDREDTLMLE NVVINTSGTAYAAASGTTTTTLLAFILTTMLYPDVQKAVRQELDSVVGQGRLPE MADRNALSSIPALMKECLRWRPPLPLGVPHRSTAEDERGRRIQVPAGSTAIGNA WXXXXXXXXYSDPEAVKPWRFFDAAGFGYGRRVCPGRHCLLDFVWLATANVL VVCSIEEPFDGQWDTVEPSEQHTTGTIIFLAPFEAVFKXXXXAHVECVR >CYP5145A1 ug.82.21.1 ESTs = DV765434.1, DV765432.1, DV759462.1, DV757969.1 MFYPPLVAVDVVFALLALYLIVRFLQKDRTLPFPPGPKPLPLIGNLLDMPSTYQW VTFADWHDRYGDISSVTVLGQRIVILNSLDAAVELLEKRSAIYSSRPYMRMAGEI LLWARTLVLSTYPGELFRDIRRFLHRYIGSRGQLERVAPFYELIETSTQDFLQRTL ADPVRFVEHIRKNAGAIILNMTYGYKVQEGYDPLVDLVDRAVDGFVAASTPGSY YVDIFPALQWIPSWFPGAGWKRRAEAWRADTQAMCDVPFEFAKQERLHGDNNS KNFVSDNLASMETAQQEHHLKMAAGSLYSGGADTTVSAITTFFLAMTLYPEVQ KRAQEELDAVIGTDRLPTLDDRERLPYTRALVSEVLRWNPIGPLGVPHVSTEDDV YRGYFLPKGSMFIANIWCVPPSTYTYSEPLRFKPERYLGEQPEMDPRCAVFGFGR RICPGACLNLAEASIFAVSAMALAVFDISKAVEDGVEITPKVEYTTGTISHPQPFK CSIKPRSKKAEELIRG >CYP5145A2 pc.82.61.1 (genewise2nd.82.23.1) revised near RYG 8/16/2007 MPSTHITALDIAVAIFALYIIRRLLQRGRTLPLPPGPRPLPLIGNLLDAPSAYHWETF AEWNTRYGDVSSITILGQRMVILNSLDAAIDLLEKKSSIYSDRPVMPMAGEILLW SQTLVLSPYPSDRFRDIRRYLHRYVGSRGQLERVAQTHQLIEDETRGFLQQTLRN PLQFIAHIRKTAGAIILNMGYGYQVKEGHDPLVDLVDRAVNGFVAASTPGSFLVD IIPALRWVPAWFPGAGWKRKALTWRADTRATCDVPFEFAKQEALRGSTSNNFVS ANIQDIENAEQEYHLKMAAASLYSGGADTTVSAITTFFLAMTLYPEVQKRAQQE LDAVLGAGRLPTLDDREQLPYTRALVSEVFRWNPIGPLGVPHVSTEDDEYRGYF LPKGSIFIANIWYILRDPHTYTEPLRFKPERFLGEQPEQDPRAAVFGFGRRICPGAC NARVA >CYP5145A3 pc.11.258.1 scaf5 partial seq NOT ON TREE 54% to gx.82.23.1 (5145A2) MQACLLEVFAHFPPVMAVLLVVLLLILTLFVLVTRKSSRHYPPGPRPLPLLGNIHN APAQRQWKTFAAWKSTYGDVISLTIFGQRIVVLNSLESAIDLLEKRSAVYSDRPR MVMVGELLGWAQQLVFAPYGEHFRNMRKILHKYLGARGQLDKIEPYHEIIEAAT AKFLVRALRDDSDFHLEHNVHMTSGTINLRIGYGHNIAEGDDELVQMMDDALV GFNRAAVPGAFLVDIIPALKWVPTWCPGTSWKRQAQEWKDLFVRMTEGPYAM AQQQAERGGHDNIVSMSLSEDMSPEDHYDLKMAVGSLYGGGTDTTVAIILSFIL AMTLHPEAQKKAQKEIDALTNGERLPVIRDREELPYVRALISEVLRWNPLVPLGV PHRAIADDVYRGYFIPEGSTIVVNMWQLAQDPEVYADPEVFKPERFLGPDSERDI RTFVFGFGRRICPGLNLAEASTFAICARILAVFEIGKVVEDGKRITPDFAFRDGAV RFV >CYP5146A1 pc.59.8.1 , fgenesh1_pg.C_scaffold_1000663 [Phchr1:663] Phchr1/scaffold_1:2066931-2069520 MDPATVAVAVVCALAVMHVLTRRARTRLPYPPGPPEDPIIGHLRQMPNNDEAAE VWYRWAKQYGDVMSLNVLGKRLVILSSEEAATELLEKRSSKYADRPRFPIFERIG WKDMVLLMPYGPYHKTLRKMIQVPFEKDKAFQFRDIQERATSIMLHNFLADPK GIEHHTHCRYVVSIIVEIVFGHRILSEDDEHLKIADVFVKIQHEASQPSLLDVSPLF AKLPSWFPGAWFVKYIEDTKAILSHAIHHPVSIVQEQLASGIAKPSFVADELERLI KAGQLTPQNKYDVSIAAHMIFGGGTETTWNTLTTFIACMLLNPEAQRKAQEEID KVVGHGRLPDFTDRDSLPYVECVVKETMRWHPVAPVAVPHKATEDDVYRGMY IPKGAIVIANARSITWDERRFHDPHAFKPERFLPRPLGAGEDFVQGAVYGWGRRI CPGRHLAGDMVWIAIARVLAVFDIQKARDADGNAIEPNIEFTTAVHPKPFPCELR PRSEKAASLIKESYELHSID >CYP5146A2 pc.59.23.1(gw.59.18.1) gww2.1.504.1 [Phchr1:38849] Phchr1/scaffold_1:2095089-2097090 adjacent to 5146A4 note by a typo this was called 5148A2, but it is really 5146A2 MGFLTALLLVFVLLCVALVRAVRRRRARPPYPPGPPADPLIGHIRIMPSTDNAHE VFHDWAQQYGDVMSLDVLGTRYVILNSAEAATDLLEKRNSKYADRPTFPMYER VGWKDTLVFLPYGPYFRKQRKMLQLPLEKERVTDFRHIEEQETCVMLYNILSDP DNTDAFVHRRYTTAVTMELAYGHRVVSNDDEYLKAADMVIDVLRSVTRPSLLD VSPIFEYLPAWFPGAWFVKCIKEIKPVVLREIQHPVSVVQQELMAGTAKPSFVSQ QLEDLSRENGLSQEDLYTVSMVAHQIFGGSETGWHTIMTFIACMLTNPDVQRKG QEELDSVVGRGRLPDFTDRDSLPYIDCIVKETMRWQPVVPLSVPHKAMEDDEYR GMHIPKGATIIPNARGITWDERHFHEPRTYKPERFLPRPQGAGEVFPQGAVFGWG RRLCPGRYLADDVVWLAIARILAVFDIQKAVDADGNVVEPHIEFTTVLTSHPKPF PCSLRPRSEKAAELVRQAYDMHMANVAV >CYP5146A3 pc.59.24.1(ug.59.33.1) gww2.1.413.1 [Phchr1:38101] Phchr1/scaffold_1:2097942-2100000 MGFLTALLLALVLLCAIWVRAVRRRRGRLPYPPGPPADPLIGHIRIMPSTDVAHE VFHGWAQQYGDVMSFSVLGTRYVILNSAEAATDLLEKRNSKYADRPTFPMYER VGWKDALVFLPYGSYFRKQRKMVQLPFEKEKVTDFRHIEEQESCVMLYNIFSDP DNRDAFVHRRYTTGVTMELTYGHRVVSDEDEYLKAADMIIDVLRSVTRPSLLDV SPLFEKLPAWFPGAWFVKCIKKTKPVVLREIQRPVSVVQQKLMAGTAKSSFVSQ HLEELSREKGLSEEDLYTVSMAAHQVFGGTETAWHTIMTFIACMLTNPDVQRKG QEELDRVVGSGRLPDFTDRDSLPYVDCIVKETMRWQPVVPLSVPHRAMEDDEY RGMYIPKGATIIPNARGITWDERHFHEPRTFKPERFLPMPQGAGEVFPQSAVYGW GRRICPGRYFADDMVWLAVARILAVFDIRKAVDADGNVVEPRIEFATVLSHPKPF PCSLQPRSEKAAELIRQAYEMHMANVEA >CYP5146A4 genewise.59.20.1 e_gww2.1.326.1 [Phchr1:133267] Phchr1/scaffold_1:2102681-2104782 MGPTAFVAILLCAVLLVQAVRRRRTPLPYPPGPPADPLIGHLRIMPDTSTAPEVW HSWSRKYGDVMSLSILGKRVVILNSEEAVTELFEKRGAKYADRPSYPLYERVGW KDALILLPYGTYYRKLRKMLQLPFEKDKAPNYRHIQEQEACVLLHNFLRDPSSVE SPIHRRYTAAIIIEIAFGHRVLSDDDEHLKAADMFVEVQHGAGRPSLLDVSPIFEKL PSWFPGAWHVRYIKEKRPMILHAIQHPVSVIRQQLVDGTANPSFVSQQLNDLIRE GGLTPENQYDLSIVAHMIFGGGSETTWNTLTTFIACMLMNPEVQRKGQEELDRV VGRGRLPDFTDRDSLPYIECVMKETMRWHPVAPLAMPRRAIEDDEYHGMYIPKG AMVIANISLTWDERRFHDARSFKPERFLPKPEGAGEVLPPSFAFGWGRRICPGRY LADDVVWIAVARILATLDIRKPKAADGSIIEPRIEFEAALTSHPKPFPCEIRPRSDK AAELIKQAYDMHMASVET >CYP5146B1 pc.59.19.1 , e_gww2.1.427.1 [Phchr1:132579] Phchr1/scaffold_1:2085390-2087317 MLVLLGFVSLVLFLVYRRSVRASRGRLPPGPPADPIIGHMRVFPRANHGEVFHQ WSKQYGDVLHLDVLGKSIIVLNSQEAANDLLDKRSANYSDRPEFPAFNLLGWDS MLVFLRYGPAFLRQRRLMQQPLTRTGVVVFRPVQLQQCHVLLKNLLASPKDFD AHLRRRFASAITLEMTYGHKVSSDDDAYLDIADKVNVVLTKMSKAAILDLFPRA KHLPSWFPGAWFIRYANDHRHLIWEMASKPFEQVEQQLAAGTAQPSFVSMHLEE MHRQNTHDADNVSALKTAAAHMWTGGEETSTLLIFVLAAVRNRDAVRRAQAE LDRVLGPGRLPTFEDMDALPYVEAFIKETIRFHSALPLGIPHRAMADDVYRDMLI PKDATVLVNSTALARDPAAYSTPERFWPERFLPPHSEPPPVGLGFGWGRRVCPGR HLAEASLWIVVASMLAAFDIAPVQDAHGRDAPPELRFTQAITFSEDSHPEPFECSI TPRSEKVAQLIMQL >CYP5146C1 pc.175.6.1 (genewise2nd.175.5.1) scaffold_6:685560-687500 region MMPYILGVTLLLLTVIVLRVLRARASRAAPYPPGPPAYPVIGSVGPFPAHEPHLGL AELAKKYGDVMYFEIFGKPLVVLSSLEAASDLLEKRSAIYSSRPRFAVHEMIGWT DMVSFLPYGEQFNKQRKFFLHTFSKQGCLVFRSSQVAQTHLLLKNILQCPTRYIE YLRRFSTAVIMEIAYGHKVSSEDDPYVKIAEDTNNVLMAAGHSLALVDFLPWLR HLPAWFPGNWFARVAQESRPVIQRMRNFPFDQVVQQMASGTASPSFVSMQIEEL ERDGGASPENLHILKIAASQMYGAGAETTWSTIMNVIAFLLLHPTAQRKAQDEL DSVLRGERMPDFDDRKSLPYLDALLLEVMRLQPTAPLGVPHSSTTDDIYRGMFIP KDSIVLTNTTYALAMDDRVYRNPTEFRPERFLPPHAEPNPNGIVFGWGRRICPGR YLADTSVWIVMASFLTVFEIVPERDRNGQDIIPEIQWCSAPFPCVIRPRSEKEVKLV SRL >CYP5147A1 ug.50.27.1 MPLLLVDIAALVAGLVLLFMLDSWRRKGQHLPPGPPGLPFLGNILQIPRKKEFIVF RDLGNIYGDIVTLRVPGQIFVILNSRKAVADLLDARSQIYSDRPRTIMCKDLIGWD GSVVLSNNTPRFRDCRKLLRKGLGPSAVQSFIPFLNRQSAFYLENLQKRPEAFVDI FKRNAAAISMKIAYGYDGIQDDEELYGIGAMANHYFAETAVVGVWPVDMLPIL RHVPQWFPFAYFKKYAAQAKPIVLESVNRPFEETKRHMKRGTAGGSFTSMLLED AKGDPETEDCIKWSGTGIFLGQMDTTTSALSWFFLSMVLHPEVQAKAQAEIDKV VGNERLPRFEDKESLPYVSAVMQEVFRWHPVVPMIPHALSKDDEYRGYFIPAKT SLIGNIWAIMHDESLYPDAEDFRPERFTEDGAPDCLNVAFGFGRRVCPGILIAQAH VFVSIATTLATFNITKARDAQGNIIEPVVEDTPGAINFPQPFKVSLEPRSAAAADLI RRSAEHSKTLPERLEIFSLDA >CYP5147A2 ug.50.57.1 this seq matches an unannotated region scaf 50 178-179kb ug.50-57 is from the first 5 kb of this scaffold, about 170 kb away from this seq. this matches my scaffold_15f. The ug # is wrong. MLDVGATAAAGLLLVFLLGLGLMKTQHLPPGPRGLPLLGNVLQIPRRLPHVAFR DMGHKYGGDIVTLRVPGYNLLVLNSRQAIYDLLDSRSAVYSDRPQGTIYRKLLR KGLGASAVQSFIPFLNRQSALYLENLQSRPEAFVEITK RNAAAISMKIAYGYDGIADDEELYRLAHQTTIYFAETAVLGAWPVDMFPILRFIP SWFPLAYFRRYAARARPVVVECINKPFEETKRHMRLGSAGASFTSMLLGDANGD PDTEDYIKWSSAAIFLGQMDTTTAVLSWFYLAMALHPEVQAKAQAEIDQVVGN ERLPHIEDRDSLPYVCAVMREVFRWHPVANLVPHATDKDDHYRDYFVPAQTVA IANVWAVLHDEDVYHDADKFIPERFSEEGAPDSLEIAFGFGRRACPGKVVGQAH VFASIATVLATFNITKARDAQGN VIEPEVMDTPGAVNTPQPFKVNIEPRSEAAVDLIRRSAEHSRTLPERLEIFSLDA >CYP5147A3 pc.16.140.1 MLLFAAVSTVLALLLASVLTKARRKARNLPPGPKPLPFIGNAHQIPPENEWIKFKE WGDEYGDLVKIKIPGSMLYIVNKRKVVDELFEARSAVYCSRPNFIMASLSGWDH SIPTLPYGQRLRESRKLLKKGTSPAAVKTYHPYINRDLPFFLENMLSTPDKFVEHY NRNAARIALKIAYGYEGITEDERIIQGGVKAMEVFSATAVPGVWAVDTFPFLRHL PSWAPFSSFKGFAERCKRITDEALNTPFYEVKQRLEKGTADGSFTSVMLSTEKLD PETEEIIKWCATGIFTGQFDTTTATLSWFTMAMAKYPDVQEKAQAEIDRVVGRD RLPEVGDRDSLPYTWAILQETMRWHPTVALVPHTAIQDDTYGGYFVPAGTTVIA NVWAMMHDENVYHDADKFMPERFYEEGAPDSLSVVFGFGRRICPGLVVAQTH MFVTIASILATLNISKARDNAGNVIEPREDAKSGVINFPKPFQVSITPRSDAAVQLI RRSVEHSKTLPDKLELFSP >CYP5147A4 pc.16.141.1 IFTAVLAVTVVSLLTRRKNGRHVPPGPKPLPFIGNALQIPPQHEWIKFKEWYGPRL RESRKMLKKGMGPAAVKTYYPYINREVPFFLENMLRKPDSFVEHLKIAYGYEGV TEDEKIIRGGVDAMEVFSAVAAPGVWVVKYVPSWFPFAKFKKFAERGKKITDEA LDTPFYEVKRRVTTATLSWFTLAMAKYPEVQKKAQAEIDRVIGKDRLPEVGDRD SLPYVWAIMQETFRWHPTITMSGYVPHTAIQDDEYRGYFIPSGTTLMANIWRERR SWPISGACPLRLMFEVIYHVHRGILHDEKLYHDADKFIPERFCDEGAPDSLSVAFG FGRRRRVCPGLVIAQTHVFVTMASMLATFNITKARDSTGAVIEPREDAKSGVIPK PFVVSITPRSDEAVTLIRRSV >CYP5147B1 pc.50.95.1 MLLQVLAAIAALFVLSGLLNSRRRNMHVPPGPKALPLLGNVLDIPKKDVHVTFR DWANIYGKDLMKVEMPGETLYVISNKKVMVDLFEARSAIYSSKPTMTMADSSG FKNSIPLLPYNARLKTSRRLLKQGLSPAAVRSYFPYINNRTALFLEALLKDPDDFV RHFTRTAAHTALKIAYGYEGVTEDHHLLHTAIETMEIFATVVNPGRWLVDTLPIL DRIPVSFPFANFKRVQEESRPVVFETVSKPFEEVKKHLAEGTADGSFSSYLLQSEK PDPETEDCIRWAATSIFLGQFDTTTATLSWFTHAMVKFPEVQKKAQEEIDRVIGN DRLPEIQDRDSLPYVNAIMKEIFRWQPIISMLPRSVVQDDEYNGYFVPAGTYLLA NIWAVLHDPEVYPEPEKFMPERHLKEGVPNPLDVTFGFGRRVCPGMQVAQSQTF GTMAAMLATLHLKPQKDEHGRDIIPETRTVDGLIRFPVPFKCAFVPRSEAALKLIE RGAEHARSVPDRLERWSD >CYP5147C1 pc.50.96.1 MVQASDTLLSVVVCAALITLTSFLLSGYRKRNAHLPPGPKPLPIIGNLHQLPDLKA DRAVAFRDMSLAYGSDILCVKVPGMLMYILNSKESMFDILVTRSAKSSSKPPQV MADEXLSGWKYTVPSLPYGQRIKTSRRLLHKGLGPSAVQSYIPYLERESAFFLEK LLDQQDAYKKHVTHTAARIALKIAYGYEGVTDDAHLIDTAVKAMNIFCVTATPG IWLVDSLPFLQHMPSWFPGTGFKKQASQWSQTVLYAINHPFEELKRQMAAGTA GASFAGRLLEVEDLSDPEVEDCIKHCSTGIFAGQFDTTTAVMSWFAVVMAFYPEI QKKAQDEINKVVGHERMPVVADRDSLPYVNAILKELLRWRPVLPLIAHSVNEED EYKGYYVPKDTVILANVWAVLHDESNYDEPEKYKPERFLRDGILDPSVLDPATL AFGFGRRICPGMHIGQTLLFILMSRTLQNFDIAPAKDAHGREIAIDTSAVPGLIGFP KPFKVSLVPRSNAHATHIRHAAEHARSLPDKLAIFEL >CYP5148A1 pc.79.37.1 RWAKTFGPLYSMWIGNQLFVVISDPQIVKDLVITNGAIFSSRKDMYVKSQIIFRGR GITTTPYGDTWRKHRRLASQFLGNRVVSGYLSGLEYEVQDMLCGLLTDGQAGF VPVSPQAYLGRLALNNIMTIVFGTRTGSIDDPFIHHWLTLSREFMNCTGPVSNWV DFVPFXCMAKFLLDVKDKERLDDLDIILLCCGFLVGGVESTAAIKQWFAAHISVL PEVQAQAQMELDRVVGRDRLPQAEDAKDLPYVRAIVKEIERVHNPFWLGTPHM STEDFSYRGYKIPKDTAVILNTYTMHHDSQRYPNPEKFDPDRYIDDERSSAESAK LADPYQRDHWTFGAGRRICPAIALAEHEIFLSVAGLLWAFDMRQLSDAPIDLKEY DGLSGRSPVPFCIRLVPRHERVAAVVGASCHTATCRNG >CYP5148A2 ug.50.57.1|whiterot1 not same as Yadav’s = pc.50.1.1 + pc.50.2.1 (This is the correct ug number) this whole seq not in Yadavs collection Note: a typo named another seq 5148A2, but that seq was really 5146A2. This is the correct 5148A2 seq TFVLLVAGILYIVLPFFFRKNLVDKNGNSIPPGPLLRLPYLPDYPERTLHAWAQKFGP LYSFFIGNQLYVVVSDANVARE LLVNNGAIFSSRKQYFTKNQTILRGRAITASPYGETWRQHRKIAAQLLTPKAIQSYNN VLDYEARIMIRSMYKESMQGAV PINPAHYTGRYTLNNMLTISFAMRTESTQDPLIQRILAMAMEFNDLTGPFSNLVDFIE PLQWLPTKTHARAAKLHDDFIE VYGSMVMAVKERMDAGENVPHCLAKVLIEGQQQEKLDWEDVCMLSAAFALGGVHSVSG LRWFLALIGKHPDIQARAHHEL DAVVGRDRWPMAEDEKDLPFIRAIIKEVLRVHAPFWNATPHSSTEDFVYNGMYIPKGA AVILNCFTLHHNEARYPDPYVF PHCASYCLCANAMERDHWSFGAGRRICPGINVAERILFLAISRLLWAFTVH >CYP5149A1 gx.62.25.1 MORE COMPLETE VERSION OF pc.62.64.1 SEAYLPPGPPALPFVGNLFQLPRKSVPRTFATMSQQYGPLYY MRVINRHFVIVNDLDLARILFDKRGAIYSHRPRLPMAQEVVKRDTMLFMNYGPE FRKSRKLVSTFLNQRNASKYWPAQEIESLKFVLAVQRNPSDWLKLTRWTATSLV IRLLYGIEVQDKDDALVGLAEDFARLTTETTEPGRWLVDAFPILRHVPAWLPGAG FKRWAKRAKARMDEFATLPYVMAKDKIEKGDITPCWTAEKLLETTEPLTEQDE KEIRHTATSMYSGTNAMVATFILLMLHYPEVQKKAQEEIDSVTGGTWVPGMRD RERFPYINCLVKELFRFSPAVPLVPHSLHEDDVVEGYLIPKGSWVMANMWAFMH DEARYPDPETFTPERFEARPGVEPQDDPLDIVFGFGRRACPGYLLGVASVYLNIV HLLFAFDIAPVKDAAGASVLPPIEFSDGHVA HAKPFECDMRERSAERIALIEHTA >CYP5150A1 pc.24.95.1 red is new POSSIBLE GC BOUNDARY AT ETMRL scaffold_1 (3273479 bp) : 2653416:2656338 (2923 bp) MAFSLPLISATVLLWVLWKVFRNYVVSSPLDNIPGPPRSSFWS (1) GDSAIMYQRHGWSFHDNISEKYGPVSTTHTLLG (0) ARGLYVYDQPKALNSIMITDQDSYEEPAWLLQ (2) SSHAIFGPAVFIAQ (1) GEQHRRQRRVLNPVFSGAHMRHMAPVFYDVAHR (0) LRx (frameshift) AVSAQIHDPASSEIDVLEWTSRAALELIGQGGLGCSLDPL VADSNNDFGTALKTLI (2) PLISGLHFYRMLMPYITPFVPLSVRRLFMRWVPHKNAQ QLRAATEDLWALSRQIYEEKLAAVAKGDNDEVFEGQDLISIL (1) IRSNTAAAAEDRLPEEEIIAQVA (2) GLILAATDTTSSALARILHILAERQDIQDKVRAELVEAAGEGEDI PYDQLVNLPWLDAICRETMRL (2) HPPANIVNRE (2) ARTDVIMPLSEPVQGRDGTLIHEIAVPKGTLVTISVRGCN RNKAIWGEDALEWKPERWLKSLPETVSGAHIPGVYAH (2) MTFIGGARACL (2) GFRFAQLEM (1) KVVLAVMLRSFRFQLSDKEVYWNFAGVVFPSIGRDGKTASMPMKIETL* >CYP5150A2 fgenesh1_pg.C_scaffold_2000178 [Phchr1:1200] = gx.121.14.1 LOCATED ON VER 2 SCAFF 2, 527-529KB 58% TO GW.95.20.1, 53% TO PC.24.95.1, 51% TO PC.10.108.1 55% to 153.5, 49% to 66.11 MTSAPLLAFGAALIAIIWTLFQGYLVKSPLDNIPGPERSSFWLGNHGDIFNRHAWNFH DRARQLFGPVFR FWGPFASRGLFVYDPKALNSIIVKDQLIYEESRWFISWNKYAFGLGLLSTLGEHHRKQ RKLLNPVFSINH MRHMAPIFYQTTHRLRTAITAELEASSADVDVLNWMGRLALELIGQGGLGYSFDTLVA HTHNEFGDAIKG YVPAILNVIILRNIIYPYMDEYIPAKVRRFILDILPFRSVRRIQKIIDDMHSHSRRIF NEKKAALEKGDG AVLHQVGEGKDIMSILLKANMEASDEDRLPEDELIGQMT (2) TLVFAATDTTSNALSRILELLAKNQ DVQDKMRTELIAASPDGEDIPYDTLVALPYMDAVCRETLRLHPPVNMMSRETREDVMM PLSEPIQGVNGE TISEIFVPKDTSVIVSIRACNRNKAIWGEDADEWKPERWLSPLPEAVGNAHVPGVYSH LMTFLGGGRACM (2) GFKFSQLEM (1) KVVLAVMLRTFKFFPGKNEIYWNMGGVNYPTAGKDSNKACMYLRLERIAS* >CYP5150A3 pc.153.5.1 LOCATED ON VER 2 SCAFF 2 scaffold_2 (3043971 bp) : 1623404:1625693 (-) strand (whole gene range) 57% to 95.20, 56% to 121.14, 51% to 10.108, 51% to 24.95, 47% to 66.11 MVPPSTFVLCALGGWVFWKLFRGYFTRSPLDNIPGPRRASLLK (1) GNAHQLFNRHAWGFHERISQEYGQVVKFHAPFG (0) GRGLYVFDPKALYHMIVKDIATFDEPRWFLQ (2) MADFTFGPGLFSTS (1) GQQHRKHRRVLNPVFSINHMRNLAPLFYTVAHRLRDGLSTQLTTSTGGE VEILGWMGRAALELVGQGGFCHSFDQLDKNVPNAYRDVLKEVM (2) PSQIALHFWRILLPYAVEYVPARIRRFLAPW LPHPVMQKYRNICITMDEQARAI YHAKKVALEQGDKLVEHQACEGRDILSVL (1) VTANKQESVEDRLSEEEVIALIS (2) TLAFFAATDTTSNAMARILHLLAEHQHVQDKMRLELFEAGM DGEDIPYDRLVELPYLDAVCRETLRL (2) YPPVLFMNRE (2) TRQDAVLPLSKPIQGLDGDVLTEIMVPKGTLLMVSIVACNRNKALWGEDVLEWKPERW LSPLPESIREAHVPGVYSHL (2) LTFLGGGRACM (2) GSKFAQLEM (1) KVLLCTLLRSYRFMPGTKGVYWNVGAISYPTPNEIDEKAAMYLTMDHIQAPQ* >CYP5150A4 genewise.95.20.1 57% TO pc.153.5.1, 55% TO pc.10.108.1, 54% TO pc.24.95.1, 51% TO gx.66.11.1, 58% TO gx.121.14.1, fgenesh1_pg.C_scaffold_2000729 [Phchr1:1751] = gw.95.20.1 scaffold_2(3043971 bp):2185130:2187482 (2353 bp) whole gene range MATTTHGLALYALCGLCVWALWRVLRAYVVKSPLDNVPGPERTSFLK (1) GNTHQIFSRHGLDYLQELGERYGQVVRYYAPLG (0) ARGLYVFDPKALNHIVVKDQAIYEEPRWFIR (2) LNRLLFGPGLLSTL (1) GDHHRKQRKLLNPVFSINHMRHMTPIFYNVVHD (0) LRDAVAEQVKDTPT EVNVLDWMLRTALELVGQGGLGYSFDALSAQKRNVYGEALKELL (2) PTVFALHFWRVLLPYVGAVVPAWVCRAAAPFLPHAAMQKLRRVV GAMDAHSRRIYEMKKGLLEKGDAAVVHQVSEGRDILSIL (1) MKANREVDEEDRLPEDEIIAQMS (2) TLVFAATDTTSNALARIFQLLAEHPDVQDRLRAELT DAAPDGADIPYDALVVLPYLDAVCRETLRL (2) HPPASFMNRE (2) ARADAVLPLAEPLRGADGAPITEIAVPRGTPLIIAIRASNRN SALWGADALAWRPERWLAPLPDALAKAHVPGIYANL (2) MTFLGGGRACM (2) GFKFSQLEM (1) KVVLAVMLRSFRFLPGDKEIYWNLAPVAYPTVG KTSTKSELYLKLEPLKT* >CYP5150A5 AADS01000067.1 = pc.10.108.1 54% TO pc.24.95.1 scaffold_8 (1906386 bp) : 331157:333480 (2324 bp) whole gene range MGPLQLVLVATAA 5407 WALWRLFRHYILRSPLDNIPGPASSSFIY (1) 5499 5550 GNLKEMFNRHGWGFHDTIIRQYGPIATVHSMLG (0) 5648 5713 ARALYVYDPKALNHIILKDQYTYDEPGWFLE (2) 5787 WHRMIFGPTLIATT (1) 5936 GAHHRKQRKLLNAIFSIARMRDTAPIFYNVAHR (0) LRDAISADLN 6116 QGSGEINMIEWFSRAALELAGRGGVAYSFDALEANSENSEF GMTVKQYR (2) 6345 PTTIALHFWRILSPYASLYIPRVVRLALGRLVPHKDLQ 6443 RIQMIAHAISAQSKKIYDFRMAAFQRGDEDAVREISEGRDVLSYI (1) IRAGLNSEEERLPEEEILAHMS 6752 SGLILGATDTTSNALARTFQLLAEHQDVQDKMRAELA DAAPDGEDIPYDQLVHLPLLDAVCRETLRL (2) NPPIGLLARE (2) 7091 ARDDIVLPFLQPVHGRDGSLINEVPVPKGSTVFISVRACN RNPLIWGEDAAEWKPERWLQPTPKSLSEARVPGVYANQ (2) 7312 7377 MTFLGGDRACL (2) GFKFSQLEM (1) KVVLAVLL 7556 7557 RSFRFLPCNKDVYWNLAGITYPTLGKDSDKLELPIKLEIIGGQY* 7676 >CYP5150B1 fgenesh1_pg.C_scaffold_2000954 [Phchr1:1976] = gx.66.11.1 LOCATED ON VER 2 SCAFF 2, 2927193 51% TO 95.20, 45% TO 24.95, 44% TO PC.10.108 49% to 121.14, 47% to 153.5 MSLFALAIPWVACGLVLMLVYRLVRDYVVSSSLEDVQGPTPRSLIY (1) GNLPELQNRGAWPFLDHLTNDYDRVVRMRGMFG (0) KRILWVADPKALHHIVVKDQDIYEEAPSAITF (2) GRKLGMGPGLLSTL (1) GDHHRRQRKLLNPVFSIAHLRRVTPVFYEVMNR (0) LSKGIEKQLDTTSNNEVDLLAWMGRAALELIGQGGFG HSFDPLVEQTPNPYADAVKSLV (2) PAITGLIFYRMVIHLVDPLVEACAAHPALGAFIKRWFW LVPNARMQHVKTIFDVLHDTSTAIYTEKKIALDSDDPELKMRVLEGRDLM (1) SVMLRENMNADAADRLPEREIIAQIT (2) TFIFAGTDTTSNALARILHLLCLHPDVQEKLRAEIIEARAQNGGGDLDYDALVALPYL EAVCRETLRL (2) YAPVPFVSRQ (2) ARSDALLPLSAPLTLRDGSRVSALHVPQGTSVLVAIHSVNRSAALWGADAHVWRPERW LEKLPDAVAEARVPGVYSNL (2) MTFIGGGRACM (2) GFKFSQLEM (1) EVVLATLLSSFRFSLCDGKNADIVWNRAGIAYPTVGNDGGHPSLPLRVERLKC* >CYP5150C1 EB077269.1 Trametes versicolor cDNA clone 52% TO CYP5150A2, 50% TO CYP5150B1 SHVMAQVLQLLSEHPDAQAKVRREILEAGDGYIPYEKLHSLPYLDAICKETLRMYPPTPI VLREAFRDTTLPLSQPMRGSDGSMLSHIPIPKGTNVLVGVRACNRNKALWGEDAEEWKPE RWLAPLPKAVEDASIPGVYSNLMTFVGGGRSCVGFTFSQLEMKVVLSSLLANFTFQLSEK PIFWHISAITFPSAAKDSLKPEMWLKVGRYTGEAA* >CYP5151A1 pc.8.82.1 (name is not right pc.8.80.1 is correct (-) strand) C-term is 37% to CYP5033A1 Ustilago maydis 30% overall name error, CYP5144E1name assigned twice see pc.8.2.1 Phanerochaete chrysosporium cDNA clone DV765344 MESFTLAALSLTTACAAALVAYLLYLCV VYPLRNPIRQLPGPPSKWFLELRHMYMTMD (2) PRRSPHTAAEFVEKYGRNVYIRGPVPWDQRLFTLDPVTMNHVLQHT AIYEKPWPSRRLISGLIGAGMLSAEGQMHKRQRR VATPAFSLNEMRALIPLVFSKGT ELQKKWMEIMRDAGVKPGQ GHVVNVCSWA SRATFDVMGSAGFDYEFNAIQNEDNELLRAYVDMF ETAVSKQKAGLRSVLVMYLPIIDKIF (0) 159857 PNETTRFVSKCQTVIERVAGTLIQEKKRKMADAAVKGQVY 159738 159737 QGKDLLSLMRKY (1) 159702 159661 VKSNSAVDLPPDQRLS159572 DQDLLNNINTFMFAGTDTTSLALTWTLYVLALYPHVQDRLRAELLS VAPIAPIDTLTPEE VQSLYAEIAALPFLENVIRETLRLIPPVHSSIREATRDDVV PVSAPLKRTTPNGRVVEEQVHQIVVPKGTFIHVPIEGFNLDKGL WGETAWKFD (2) PDRWDNLPETIKELPGLYQHTLTFSAGPR (0) ACIGMRMSVIELKSFLFTLVTNFKFAPDPTQKIGKANV (2) ILTRPYVAGKQNEGSALPLIVTPYVREDAS* COMPARE TO >CYP5151A2 AACS01000049.1 Coprinopsis cinerea strain okayama7#130 3476 SCIGMRFSMIEIKTFLYILVTRFVFKPTKDKIIKSNV 3586 3642 VLTRPYISGKYREGSQLPLIVTPYI 3716 WHOLE PREDICTED SEQ 55% TO PHANEROCHAETE MNPILLSGLAAASTVLTWVVYRVVIEPRFNPLLKLAGPPSPGLF GTNLAPVLSPTVSPRLHEVYAENYGRSMRIRGVGPWDERLLTLDPVSVAYVLKNSTIY EKPWQSRALITSLIGCGMLAAEGQVHKRQRRVGTPAFSIQNLRGIVPLVFKKGTELKD KWMEMIETTGEVRGSDEKEKSMVVDVCHWVSRATFDVIGVAGFDYQFNAIQNESNELF NAYKEMFEIAISQGDGIITLISIYAPWIHKIFPNQVSRTVERCQEVIRRVAGQIIQEK KRKIAEGEASGKPYQGRDLLSLLLKSNVAVDLPEDQRISDEDILNNVNTFMFAGSDTS SLTLTWTLWLLANNPEIQDRLRAELLAAIPDTELTADISSLNEDEIQTLYGIIAELPL LNNVTRESIRLIPPVHSSIRVATQDDEIPTRYPVKLADGTIDTKQSVKIAKGSFVHVA VEGFNLDKEFWGADAWDFNPDRWDDQPETARQLPGLYNNTLTFSAGPR(0) SCIGMRFSMIEIKTFLYILVTRFVFKPTKDKIIKSNVVLTRPYISGKYRE GSQLPLIVTPYIPSAEH EB072414.1| TverSEQ10129 Trametes versicolor pBluescript (EcoRI-XhoI) Trametes versicolor cDNA clone TverSEQ10129, mRNA sequence. Length=897 PNELNAAFQEVFNPGANFTIFTILKNVFPALDIFPDERAKRLDHAQDVMRRIGLQLIEEK KAQIAREMSEGKSGGVERKDVQGRDLLTLLMKANMATDIPDNQRLSDEDVLAQVPTFLVA GHETTSTATMWCLYALTQAPDVQKKLRDELFTLQTEAPTMDELSSLPYLDAVVRETLRIH APVPTTMRVATKDDVIPVSEPFVDRRGKVQDSIHISKGSPIIIPVLSLNRSTELWGADAL EFRPERWINPPETISSIPGVWGHILSFLAGPRACIGYRFSLVEMKALLFELVRAFEFE Score = 169 bits (428), Expect = 9e-42 Identities = 89/197 (45%), Positives = 129/197 (65%), Gaps = 9/197 (4%) Frame = +3 Query 1 DQDLLNNINTFMFAGTDTTSLALTWTLYVLALYPHVQDRLRAELLSVQS---LYAEIAAL 57 D+D+L + TF+ AG +TTS A W LY L P VQ +LR EL ++Q+ E+++L Sbjct 321 DEDVLAQVPTFLVAGHETTSTATMWCLYALTQAPDVQKKLRDELFTLQTEAPTMDELSSL 500 Query 58 PFLENVIRETLRLIPPVHSSIREATRDDVVPVSAPLKRTTPNGRVVEEQVHQIVVPKGTF 117 P+L+ V+RETLR+ PV +++R AT+DDV+PVS P G+V ++ +H + KG+ Sbjct 501 PYLDAVVRETLRIHAPVPTTMRVATKDDVIPVSEPF--VDRRGKV-QDSIH---ISKGSP 662 Query 118 IHVPIEGFNLDKGLWGETAWKFDPDRWDNLPETIKELPGLYQHTLTFSAGPRACIGMRMS 177 I +P+ N LWG A +F P+RW N PETI +PG++ H L+F AGPRACIG R S Sbjct 663 IIIPVLSLNRSTELWGADALEFRPERWINPPETISSIPGVWGHILSFLAGPRACIGYRFS 842 Query 178 VIELKSFLFTLVTNFKF 194 ++E+K+ LF LV F+F Sbjct 843 LVEMKALLFELVRAFEF 893 >pc.8.80.1 [whiterot1:77774] MFAGTDTTSLALTWTLYVLALYPHVQDRLRAELLSVAPIAPIDTLTPEEVQSLYAEIA ALPFLENVIRET LRLIPPVHSSIREATRDDVV PVSAPLKRTTPNGRVVEEQVHQIVVPKGTFIHVPIEGFNLDKGL WGETAW KFDPDRWDNLPETIKELPGLYQHT LTFSAGPRVRSPSVCRLSQG* >CYP5152A1 pc.24.121.1 65% TO pc.24.126.1, 42% TO CYP530A1 N. crassa NOTE: 5144C1 IS AT 2402616-2402515 REGION SCAF_1 5144A6 IS 2479233-2479132 REGION (-) STRAND SCAF_1 THIS SEQ IS AT 2699101-2698997 (-) STRAND SCAF_1 pc.24.126.1 IS AT 2713991-2713887 REGION (-) STRAND SCAF_1 same as model fgenesh1_pg.C_scaffold_1000845 [Phchr1:845] N-term is in a seq gap possible frameshift after FSFK FKFAQLTEMYGPVFSFKQGTRVVCVVGRHQ (0) AAVEIMQKH GADLADRPRSIAAGELLSGGKRTLLVGAGDRLRKLRK (2) ALHSHLQPSVAVQYRPMQLKHALNVILDILRDPEHHIDHARR (2) YAASVVMT MTYGKTEPTYYTDPEVQEILLHGTRLGSVIPLDYHKVDRFPILKHVPFVTSTLRQWHKEELALFS DLVDGARARL (0) RDGAPPSFATYLIDQQQQFGLSDDEIAYLAGSMFGAGSDTSATAIAFVIMAAATHPKAQAEVQAQ LDSVVGRDR (1) VPSFDDESLLPLVTAFYLEAYRWRPVSYG (1) GFAHKATADIRW (0) GEYVIPADAIVIGNHWSIARDPDVFPEPEEFRPSRWLDESGKLREDLSSF NFGFGRR (2) VCVGQHVANN (2) SLFINTALILWAFSVGEDPAQPIDTMAFTDTANVRVHPFKAVYEPRIPRLREVVETYLD* >CYP5152A2 pc.24.126.1 35% to CYP5065A2 yellow IS AT 2713991-2713887 REGION (-) STRAND SCAF_1 possible GC boundary at PISWG Same as model fgenesh1_pg.C_scaffold_1000848 [Phchr1:848] MATGLLADVLARVQVPALALLFALLLLRAALRIVQRQRVPLPPGPPG RWFEPGPKAPLRYAELAKTYGPVFAFRRGGQLVCIINSYK (0) DAVEIMQKRGADLADRPDFIAAGDFLSGGMRTLLVGAGERVRRLRRC RALHSQLQPTAAVQHKPVQFRAALDLVLDVLHDPADHLNHTKR (2) FAASLILTMTYGKTTPTRYSDAEVREINVHTTRLGTVVPAGLHAVDRHPVLR HVPPATATLRRWHREELALFTRMVDGVRKDV (0) HVARPSFTTYLLEHQEEYGLSDDELAYLAGSMFGAGSDS (0) TATAISFVMMAAATHPQAQAQVQAQLDSVVGRDR (1) VPTFDDEKLLPLVVAFYLETFRWRPISWG (1) GFAHRATSDIVWNDYVIPAGATVFGNHWAIGHDETVFSDPDVFRPSRW LDEAGKLRDDISPFTYGFGRR (2) VCVGQHVANN (2) SLFINTALLLWAFNIREDPKVSIDTMGFTDSGTVRVLPFHV QFHPRIEHLREIVESSMPEDVYSAA* >CYP5153A1 gx.27.66.1 = pc.27.122 fgenesh1_pg.C_scaffold_1000958 Phchr1/scaffold_1:3068242-3069378 32% TO CYP5116B1 Aspergillus nidulans The unbroken reading frame starting with FNMYQ is preceded by a conserved non-p450 seq upstream that probably ends 460 bp before FNMYQ. The true P450 N-terminal must reside in between or this is a pseudogene missing the true N- terminal. The two short exons shown below are my best guess for the N-terminal, but they do not resemble other P450s, and the total length is only 443 aa, a little short, so the pseudogene alternative may be correct. MVHDSGGAIFDTAGPALVR (1) RTITVYPSDSLVQRGIQTNLELILKHIYTCEVSPIHPHLAPLHQ (0) FNMYQTIVWYFTAVLFLLGAVVARARKRPVTSSLERIQLTDLRDIRELLAPISTSLPIL LEQRSVPNARLVRAFGITNTFVSSSVDVHATFSREARALIAGNDWDRFAQCAQLAVDK CIEERTDAGTVMRYDTLMQNAALLSILMGLFEVALDDVPIADLGVVARGINDLWKLSK TADTLPPHMLPEINTRLRAWLPAHQNPVDLIIPAFETMWRVVATTVAYTHADPLAHAV LETFLADPTSTAFASARSGTPSVDALVTEAIRLHPPTSRISRHVVSNSTKGAVLVADV GALHRDPTIWGADAEVFNPLRHQQRTPTQEKALLGFGAGRVMCVASRWAPHAAGIIVA SISERIGREIQVREGKAKGGRDWDGWFVECI* >AACS01000290.1 Coprinopsis cinerea strain okayama7#130, whole genome shotgun Score = 57.8 bits (138), Expect = 5e-06 Identities = 33/79 (41%), Positives = 44/79 (55%), Gaps = 3/79 (3%) Frame = +1 Query 103 TDLRDIRELLAPISTSLPILLEQRSVPNARLVRAFGITNTFVSSSVDVHATFSREARALI 162 + L IR+L +P + LL R+ PN RLVRAFGITNTFVS VH +F AR+L+ Sbjct 167305 SSLGGIRQLFSPDGADVGTLLADRARPNQRLVRAFGITNTFVSPHPSVHRSFVTAARSLL 167484 Query 163 A---GNDWDRFAQCAQLAV 178 + W F + A+ Sbjct 167485 SRANKRGWGTFRDISTQAI 167541 >CYP5159A1 Coprinopsis cinerea strain okayama7#130, ACCESSION AACS01000290 REGION: 167200..168432 Yellow region not in pc model. look for it 39% TO CYP5153A1 MLDWQSTSIVIAAFLLASLLCLIAISLNQESAVGHSSLGGIRQL FSPDGADVGTLLADRARPNQRLVRAFGITNTFVSPHPSVHRSFVTAARSLLSRANKRG WGTFRDISTQAIHVELSLARSGSVNYGIFIQAVTLRVILVGLLGANVPMEDFSPDDIY TAASHINKLWSLSKDPSPIPSHLLPELNDALRRLLPDITTFPNPVDIVIPAWETFWRV VATTVAYSHNSKAITQLFLDFYAYPTDNAFREANADANISPKNVVEESMRLQPPSKHI ARKTIRPSLSKLPKPIANLLVRFLPRISWVKHYADVQAVLRSPAIWGSNSLEFNPWRH NQDPSSTLPSRAEALGYIFGGGNLRCIGSSWAPVAAAVVASAVFDAVDRGVCSIVPGR AVGGRNGWEDWSVTDTKN >CYP5154A1 PFF_258a (gx. 18.4.1) 63% TO PFF_258b (gx. 18.4.1), C-term 38% to 5141A4 Scaf_6 ver2 (-) strand 503798-505808 505808 MQDLGYLPGLRSLVTPMSPLGFALPPSQYNPGRDWQW 505728 505655 VYRDAGTETISAIPYVFGPP (1) 505596 LAKQVVSTKGQ (2) 505458 ILRLLCSLWGPNIFAANGEEWRKHRRIINPAFSNAT (2) 505363 FASVWEQTSRVFDEMEVGEGWAGKHTVNLPVVNGLTNK (0) 505139 LALILISTCGFGNPLKWQFTNSASGGMSFEKALSIVSSNHI ALLLIPDWMYRLPNK (2) 504972 IRELKTAVDRMNLFMRELIEKRRAEMAQKAPERTDILSAMIK (2) 504735 LSWLSRFDNDCDKVGNTFLLLSAGH (1) 504661 504611 DTVAHTLDAAFALLALHQDFQEEVYQELLEVMPTEADFV 504494 504443 TYENSARLVKTRACFLEASRMFRI (2) 504372 SGFMLIRDTAEDVVLQNVGPNNDDVLPLKRGTRVVVDMIGL (1) 504143 HHNPRIFPDPEAFKPERWYNAHENDMSMFSFGARA (1) 504042 503989 CIGRKFAVAEGICFLAKLLRRWRVEPLAKEGETKEQWKQRVV RGVVVLNLGIGEVPVRLVRRNC* 503798 >CYP5154A2P PFF_258b (gx. 18.4.1) This one is looking like a pseudogene, no EXXR motif, No N-term Met Scaf_6 ver2 (+) strand 509278-511252 62% to PFF_258a in overlapping regions 509278 YLPGIRSLVSPMSALGASIPTSRWNPGRHWQW 509373 509448 VYRDAGTETIAAVPYLFGPPII 509513 509662 GPNIFAANGGEWKKHRRVINPAFSQET 509742 509963 LALILISTCGFSNPLSWQ 510016 510040 MPFADALWIVSTRIIARLLLPRWVYWLP 510123 510371 DYNMHQLGDTFLLLTAGHGT 510430 510478 MLALHPDFQEECYREILKVMHTNDDFV 510558 510606 TFGNSTHLIKTRSCFLEASSLYRM 510686 510727 AAGEILVQDVAEDTILRGAAPDGGDPPVPRETPIVVDMLGLR 510852 510906 DHNPKLYSDPEKFLPERWYNTHENDRTMFSIGAQA (1) 511010 511067 CLGRHFALVEGTCFLARLLRIWRVVPLLRPG ETVEQWRAKIGVEALFNFGIGNVPLKFVRR* 511252 >CYP5155A1 fgenesh1_pg.C_scaffold_7000377 [Phchr1:4564] Same gene as pc.81.7.1 whole seq is 40% to CYP5150A4 REMOVED POSSIBLE SHORT INTRON SEQ VKAAKNKYLEDGRSTASTWE MAVGLPEVVLAAIFAYLLYRETWGKKTALSDVAGPPRESWMK (1) GNTQRLFRDALDYNLWLSRTYGTAVKMYSLY (1) ALYLSDPLALHHVFVKDQNSFDVSDAFIH (2) NLLMFGEGLTGTL (1) GEQHKKQRKMLNPVFSVSNLRELLPVIQPIANKMASVFVEQIPAD (1) AREIDVMPWLSRGAQEYMSQACFGWTFNALDLNKRNTYSEAARKYT (2) PAALRVSWLRPYLPFIVRTIPLTLRTKMLDWYPGSDMKDFLYILDVM HQTSKRIFEQKKKALDSTVLEKADAETSERSEGDLGP (0) GKDIMSILLK ANASSNEADRMTDSEMIGQMSTLLFAGFETTTYAISRILWVLASHPDAQARIRSE(0) DVSLSYDDLMALPYLDAVIKETLRVYPPSSVHFRVALQNTTLPLQYPVKSVNDTPITTIPVEKGT QILVSIIASNHNTNVWGPDASEWKPERWLNSDDKAVPKATTDSAKYPGVYSGMMTFLG GPRGCIGFKFSE MEAKQVLATLLPRLHFALPSAVDEQGRRKEVYWMMSGPQIPVVRPPFGDGMTAQVPLD VRLVREEDFAYGFEDEDLIKL* >CYP5156A1 gx.17.28.1 34% TO CYP602B1 Fusarium graminearum scaffold_7 (2051558 bp) : 939436:940306 region SAME AS fgenesh1_pg.C_scaffold_7000309 [Phchr1:4496] N-TERM in model SEEMS TOO LONG. Remove it. 29% o 609A1 MYPLGLLTIVQHFETRDAALKLAIALPSIVLLALLFSWVSARKDQD EGPPFLPLSIWETVWPFFTSRHDFLRRGFELTRHPAFRFKLLQ (0) HTVVVVSQEHARADFFACRGLDIHEGFKVLSGA (0) IPMLPGVTSDLQTRRINLIHRRLAAAQGGDHLQR (1) LVPFIVQDIRQGFCSWGSTESLIDPFTRIPA (0) LLFQTTVRCLGSHELADDGATVARLLSLYDTLDRSTTPLSVLLPWLP SPSMLAKLRASKQVYDIVDGAIRARVASGVSRDDTLQILLDHGDEKMVIVG (0) FIMGLLVAGARSTGTT (1) ASWIVTFLAGHPEWRRKVREEIHALLSLYAATTAHCANDPAELLATVPLEAFEQCMP ATDAVIRETLRIAQPHTAMRRNVGPDTYIAGTRIPSGAYVVYPFSDIHLDPRLYPD PWRFDPSRPESKSNIGYVGWGG (1) GRTVCLGQRIAKLQIKLVLSMFLLHYDFDLVDQDEHPLGNVPR PNWNDHLTCKPPSGSCLVRLAKQ* >CYP5156A2 EB008493.1, EB007318.1 Gloeophyllum trabeum ESTs 61% TO CYP5156A1 LADIPLSAWEGSTPVLDAVIRETLRLAQPHTAMRRNVGPDLVIDGKTVPSGAYVVY PFSDVHLDADIYPDPWRFDPGRPADAKRAHAWVGWGGGKTVCLGQRLAKLEMKIIA AMFLVGFDYAVVDKAGKPADPLPKPNWNDALMCRPEAGSCYVK YERAASSTSSSSSSSPSSPSSPL* >CYP5156B1 EB016637.1 Lentinula edodes 50% to CYP5156A1 N-TERM ONLY MPYSVGLQAAGAALLSAGLPVLFTTAIIVLLLIISINSSLQKDVADAPARLPI YSFFTIIPFFRRRFDFLNWGFQATGQSTFQFDLLRNKVIVVSGESARQAFFTAKGLDLTE GFKILSGAIPMVRGVTSDLQTKRISLIHKRLAAVQKNEQLSMLIRPMLEDSRRLMESWGN SGCFDPFDNIYELVFQLTVRSLSCTEISDDPCLVSRLKKLYDTLDVGTTPATVLLPWLPT PAMVKKLWATKEIYDIVVAAITAR >CYP5157A1 genewise.21.71.1 scaffold_9 (1898532 bp) : 1276291:1276784 (494 bp) fgenesh1_pg.C_scaffold_9000353 [Phchr1:5749] Phchr1/scaffold_9:1276084-1278363 30% to 608A1 Magnaporthe grisea over 412 aa MSLLPPLTNLASGHISFFLGIPRGELFLLFLPFVVLIAFVLFKRIIQATKRKHASTRPL CVLVSTDRPIDQLPPYHYRPTSKRFATMDLLSALRGREREYTKYVFADDKSLSFEEGAA TILNLRFLLKIRGGRFYKDVDKLITSGIIPRIEAITNKIYPIFMRHARRLVEDGQKNNG CVDFFAHTNHSIAESMLTVVMGEVGIYENLSWFSRTFPRLSKFSRLRDGLSDNGCPRLR LLFGTLIYRYFFVLGPYVWRELRNNKFEPLARSEKEHDANESVLRYLGRMFAREDGTVS AVDTCWCMCLMLSLIFASVHQTAVVAVWVMYELASRPTYIPAIREELLAVAELQADGSH YLSYDSLRNARLLDSFIREVMRLKGDTLGVCRQTVQDTPMGQYVIPKGHLVIPMASLSH RSREYHGQDAEVFDGFRWVERNLPAVMVGPTYFPFGMNRWACPGRVLAVSEMKMIALTI LALADPTLEGGKYTVVDPLNTTSVQPAGKLYLTPLARPLI* >CYP5158A1 39% to 5037B2, 36% to 5037B3, 36% to 5144A5 scaffold_2 2356566-2358785 2356566 MLTSQVSIAKLTTLPTSYYA 2356626 LLATIALLALLFARRTQQPTPPGPRGLPIIGNVAELSGGFEWIRFGTTLRKQF (1) 2356784 GDVLGFKVLNNRILVLNTAKAAKEFMDKRASKYSSRPVLTVIGELMGLDQ (0) AMPLIPYGAEWRACRKLEHVALNQSAVKQYRPVIEHHAAQLALDILQEPDKFLTHTRL (2) IILAVTYGLSARVTATE (0) YISLAEEVMRIVTIYLRPFAHLCDVMPV (1) LKHLPSWIPFRREAEYGRRLFESFVSTPYERTKQAF (0) AKGDAEPSLIRDILASMPQDSLTPEVEHRVKWTAGFALLA (1) SGGES (0) 2357970 TFGTIGVFMMAMALHPDKQARAQEEVDSAVGTDRLPTMDDKARLPYVYAVVQEA 2358131 2358132 MRWHPMLPL (1) 2358158 2358208 SLPRRAEVDDEYDGYYIAKDTTVCANLW (2) 2358291 2358346 AMGMEPNVKYPPEQFIPERFLDAEHPTPNPNTWAFGFGRR (2) 2358465 ICPGKALAEESLFVLMSTLLAMFEICAPPEGIKPEFESRVVR (2) 2358702 LPKPFKCIFRLRSPEKADMLRAVVATQ* 2358785 >CYP5158A2P gx.89.1.1 pseudogene 92% to CYP5158A1 scaffold_2 2396927-2398113 2396927 MLTSQVLIAKLTALPTSCYA 2396987 LLATAALLALLSARRTQQPTPPGPCELPIIGNVAELSGGFEWTRFGTTLRKQF (1) 2397145 GDVLGF large deletion here, missing 6 exons SGGES (0) 2397300 TFGTIGVFMMAMALHPDKQARAQEEIDSAVGTDRLPTMDDKARLPYVYAVVQEA 2397461 2397462 MRWHPMLPL 2397488 2397541 LPRRAEVDDEYDGYHIAKDTTVCANLWY 2397624 2397685 MEPNVKYPPEQFIPERFLDAEHPTPNPNTWAFGFGRR 2397795 2397854 ICPGKALAEESLLVLMSTLLAMFEICAPPEGIKPEFESRVVR 2397979 2398033 LPKSFKCIFRLRSPEKADPPRAIAAAQ 2398113