Phanerochaete chrysosporium cytochrome P450s

149 named intact gene sequences
10 named pseudogenes

D. Nelson
August 16, 2007

 
>CYP51F1 ug.2.6.1(CYP51)
MSLSQYGPIAGLVGQAYDALASMSTSRLVLFLLINIPILSVVCNVIYQLLPKDK
SLPPVVWHWFPWFGSAAAYGEDPIKFFFDCKEKYGNVFTFILMGRKVTVALT
PAGNNFIMGGKHTTFSAEEVYGGLTTPVFGKDVVYDCPNELLMEQKKFVKF
GLSTENFRQYVGMIEEEVLQFMRNDASFKIYQMNDINEWGAFDVLKVMSEIT
ILTASRTLQGKEVRANITKDYAQVYNDLDGGFTPLHFMFPNLPLESYRKRDA
AHKKISDFYISIIRKRRENPGQEEHDMIAALMNQKYRVGRPLKDHEIAHIMIAL
LMAGQHTSSATGSWALLHIADRPDVAEALYEEQVKHFRQSDGSWRTPEYEE
LKELPVLDSVIRETLRIHPPIHSIMRAVREDVVVPPTLAAPSEDGRYVIPKGHV
VLSSAAISQVDPMLWKNANDWDPSRWSDPEGVAAQAYKQYDDAEGAKVDF
GFGLVSKGTDSPYQPFGAGRHRCIGEQFAYLQLGTIISTFVRHVEMRLPETGV
PPPNYHAMITLPKAPRNILYRRRNFD
 
>CYP53C2 ug.1.19.1= pc.1.261.1
MAVIEALTQLDLKSWLLLIPALAIVAHILIWLLDPHGIRSYPGPLLAKFSDAWL
GYVAAQGHRSEVVHDLHKQYGTFVRIAPNHLSIADPDALQVVYGHGTGTLK
SNFYDAFVSIQRGLFNTRSRSEHARKRKIVSHIFSQKSVLEFEPHVRLYVKQLI
QQWDRLYEAGAKGLVWLDCLPWYNYLAFDIIGDLAFGAPFGMLLAARDAA
PVAVDHEQAMASYGKEKSEVQYIPAVQVINDRGTYSASLGVLPPWMRPIVKL
FPWFRRGQKAVKQLAGIAVAAVAQRLTTPTDRVDLLGKLQEGRDDDGNLM
GKEELTAEALTQLIAGSDTTSNSSCAITYYLAKYPDAQRKLQQELDEALGSDD
EPVSTFDQVKRLPYLQAVIDEALRIHSTSGIGLPRLVPKGGMTVCGRFFPEGTV
LSVPTYTIHRDEEVWGKDPEVFRPERWFEQDKNAVQKTYNPFSFGPRSCIGRN
LANMELLIIVSSILRRYDF
VLEDPDKPFDTMEGFLRKPVECVVGIRRRTL
 
>CYP61A1 ug.78.18.1(CYP61)
MASSQAAFPSTLSDSSRHSTDSPAFIGLLPTGSWFYTTAAILLSLLVIEQSVYRY
KKRHLPGDKWTIPLIGKFADSMKPTMEGYMKQWNSGALSAISVFNVRFIVM
ASTTEYARKILNSPTYAEPCLVHSAKQIILPDNWVFLTGKEHVEYRRGLNLLF
TRKALGYVLQYCLVLYAMLTHRRSLYLGIQDVITRKHFAKWLADAAKDPSA
KPIMMTARELNMETSLRVFCGNHIPEHGAKEISDKYWMITVALELVNFPLAIP
GTKVYNAIQARKAAMKWLELAARKSKESVAAGNPPECMLEEWVTILNDPAY
KGRREFSDHEMAMVVFSFLFASQDAMSSGLIYGFQHLADHPEVLAKVREEQE
RVRGGDYEKPLTLEMMDEMPYLRAMVKETLRVKPPVTMVPYKTTKAFPISQ
DYTVPSGSMVIPSFYNSLHDPAVFPDPDRFMPERWLDPNGSANTNPRNYLVF
GSGPHKCIGLEYAMMNIALVLANAAVLMNWEHELTPQSDKVQIIATLFPQDG
CKLKFSPRQHA
 
>CYP63A1 PC-1(ug.20.36.1)
MGLTQAQRLVLGQLARLVAPALAVCVLLAAARRTQLVRAPVWADALIALIA
IPLFHVGRAHWRYARLARKAARLGAALPPRWEGKLPGSVDVLQLVDEAYRR
GFLSDYFYEKFGELGHTYNFYVLWDMDYCTEDAAVIKAVLATDFNNWVKG
ERFDSYMHSVLGTGVFNADGELWKFHRSMTRPFFARERITDFETFNRHAEEAI
LKMKERLREGFAVDFADLISRFTLDAATEFLFGACVHSLAGALPYPHGAPAH
LHTTRARIPADDFAAAFRAAQDAVSHRARLVWLWPWFELARSRTDTPMRTV
DRYLTPIIERAL
AMSRAAKQAPQGEKEEVADGETLLDHLARYTTDPTILHDEILNIMIAGRDTTG
GTLTFVIYFLTQHPDVLQRLRQEILDVVGPSNLPTYDDIKQMKYLRAVLNETQ
RLYPPVPWNMRYAVEDSIVPNSEPEGKPWFIPAGASVSYSVHCMHRRKDYW
GPDAEEFDPDRFLDERLHKYLTPNPFIFLPFNAGPRICLGQQFAYNEMSFFLVK
LLQTFEDISFERDAFEPNALPPAEWAKFPGRKGKEKFWPRAHLTLYSEGGMW
VKMREAQAMGQVA
 
>CYP63A2 PC-2(ug.20.35.1)
MLVSVDALALRTLVYELTYLLYPAVPTAAALILLQRFGNVWLPTWTIVLLSL
CNVPVAHRILVWLKDGRAARKAASMGAILPPRLKGRWPGSIDLLRQLTQTFE
TGFLSEMLWGYMHVLGQTFEVYILWDSNYVTSDANVIKTILATDFDNFVKGE
KLDVCVRPVLGTGVFNSDGEMWKFHRSMTRPFFTRERISHFDLFDRHADAT
MAKMKARLAEGFAVDFQDLISRFTLDSATEFLFGQCVHSLASVLPYPHDAPA
HLQTTGASRTEDFARAFAEAQDAVSFRLRMGWLWPWFELFGSRTKAPMAV
VDAFLDPILRDAVARADKIKRENGGRVPEVKGEIEEDETLLDHLVNVVQTKIL
HDEVLNIMIAGRDTTGGTLTSAVYFLSQYPEVLRRLREEILEKVGPTRRPTYD
DIREMKYLRAFINETLRLYPAVPWNVRYPVKDTTIPGPHPDKPYFIPANTPVSY
SVHCMHRRTDYWGPDAEAFDPDRFLDARVQRYLTPNPFIFLPFNAGPRICLG
QQFAYNEMSFFVIRLLQHFDEVQLCEDALAPDCRVPDAWRGAPGRKGVERF
WPKAHLTLYAKGGLWVKMREASTSEAVV
 
>CYP63A3 PC-3(ug.20.34.1)
MPSSIDFPDRLVLRVIAYELVFLFYPAVPAAAGLVLLRRLTDIWLPTWAIVLLS
VCSLPVVHGLSIWRNHWRAARKAARMGAVLPPRLKGRWPGSIDLLMRLTDA
FETGFMSDLLWEYMHTIGQTFEVYVLWDSNYVTSDANVVKAILATDFTSFVK
GKK
FDVCMRSVLGTGVFNSDGDMWKFHRTMTRPFFTRERISHFDLFDRHADDAM
AKMKARFAEGYAVDFQDLISRFTLDSATEFLFGQCVHSLASVLPYPHNAPAH
LQTTSASAAEDFARAFAEAQTVLNFRIRMGWLWPWFELFGSRTKAPMAVVD
AFLDPI
LKAAVERADQIKHENGGKVPEAKEEIDEDETLLDHLVKYTNDPKILHDEVLNI
MIAGRDTTAGTLTSAVYFLSQYPEVLRRLREEILEKVGPTRRPTYDDIREMKY
LRAFINETLRLYPAVPWNVRYPVKDTTIPGPEPDKPYFIPANTPVSYSVHCMH
RRTDYWGPDAEAFDPDRFLDARVQRYLTPNPFIFLPFNAGPRICLGQQFAYNE
MSFFVIRLLQHFDEVQLCEDALAPDCRVPDAWRGAPGRKGVERFWAKAHLT
LYAKGGLWVKMREAPTSEAV
 
>CYP63A4 PC-4(pc.151.16.1)
MALPPGLQYLLPQLPLLLAPPAAVLLAAHAARAFAGTAAPAWALALACVLS
WPVALTALVQLRAHRVAREAAARGARLPPAVEARYPGGVDLMRRNNSEVE
EHIPGYRLSEFGRQYGWTYNFRMLFQDRVRGRPRPRPPGRILATDFTSYEKGA
VFSAQMKSLLGTGVFNADGDLWKFHRAMTRPFFSRDRISHFDVFDRHAEDA
LKLAKARLSEGVPIDWQDLVSRFTLDSATEFLFGQDVRSLSAPLPHPPTAPQA
QHDTHDAEHPANRFAHAFLQAQLASARRSRYTAAWPLWEFWENKVEKHTR
VMDEFIQPLLRDALARKAKGADAQAEEAVADGETLLEHLVKLTDDPQIIHDE
TLNILLAGRDTTAITLTMAGYMLAEHPDILQRLRKEILDTVGTRRPTYDDIRD
MKYLRAFINEVLRMYPPVPFNVRFSTAPTVWPSPEGDFYVPAGTRCMYSVFV
MHRRKDLWGPDADKFDPDRFLDERLGKYLTPNPFIFLPFNAGPRICLGQQFA
YNETSFMLIRLLQRVSKIELHPEVSPQSVAPPGWAASSISDGKDKVVFKSHLT
MYVQGGLWVTMQFENPEEH
 
>CYP63B1 PC-7(genscan.57.18.1) (genewise.57.16.1)
MPHPFSRYRLRVFGDFVRIVLAPSFVFWSAVQILKLRLGLLSPAAWLTFLFAA
SYARVQYRGFLQRQEARRRGGVLPPEVVGRWPGNIDILIKLGKASLTAYPGSF
YLDLFEEYQSTTLNLKLLWSDLVRCLSFCRLSAVLKTLSQIITMDEEHIKHILT
TGFNHFWRGRRQKERMYAPSGASRRHDTDSQGDVSQEWKKHRALARPFFA
RDRISDFDLFEKYAGATLGILGGLAGRGAAVDVQDLYARFTLDAAAEFLFGE
RLDTLHGALPVAGQAKLGSKGAATDDAFGAFVRAFEASQDIITTRQVRGYFW
PVRELFQDKVAPHAAVIGAFLEPIVQRTLDRKAKMRAAGVSPTTEHDTFLDY
LADHTEDPKVIRDQLLNILMAGRDTTACLLTYVTYVMAMYPDIMQKMRQEV
LHVCGHDAPNFEKLKALRYVHAVLNETLRVFPPVPMNVREVRARGVVLPHA
DPTYAAAPAPLYVPGGTVVMYLPVLTQRNTALWGDDADVFDPDRWLDARL
RRFTENPMMYTPFSGGPRICIGQNYARNEATYLLVRLLQQFDAVALAPEAQP
AGSLPPPEWRHARGRAAEERIWPAYAITLYVKVRLSLQWLYC
 
>CYP63C1 PC-5(pc.101.32.1)
MELHPRQYRLRFLLDVLRAIVWPQLVFNAALYLAGFHPGAFLRVVASVLAVP
LLGTVRTAISQRRNKIQAGAALGAKEVPCVRGKWPGNLDIVLGFVRSLKEAY
LMQFLDDLFREYDCKTLNMRLLWEDQIWTIDEAHVRYMLAGPGFEWFHKG
YYWQERMESFLGNGIFNRWAQRAIARPWFVKDRISDLNIFDRHTTTTLALISE
FVDRREAFDAQDLFARFTLDSASEFLFGRCLDTLHGTLPVAGRAKMGPKGTA
IEDAFGSFARAFEDVQVQIARRTRIGKPWPLFELFTDKTAPSVAVIHDWLRPIV
HEALAKKSAASAEKESGEDSTFLSHLANSTDDPQDIAYSVLNMLLAGRDTTA
SVLSFVVYFLALHPHVTEKLRAEILQAYGPDGRPSVEDMKDLKYVRAVLNET
MRLFPPVPMNLRLSDAHPRIFPASGSAPKYYVAPRTVILYSIFLVQRRTDLWG
ADALEFRPERWLEPATARLLADHPFAFTPFHAGPRLCLGQNFAYNEMTFFIVR
LLQRVSGFELAPDAQPEGSLPPARWKYGEGRQAVEKIWPASSVTTFIKVSLAS
MPCCGERWLKRRRQGGLWVRAVPA
 
>CYP63C2 PC-6(pc.101.28.1)

QRAIARPWFAKDRISDLNIFDRHTSTTLALIADFADRREAFDAQDLFARFTLDS
ASEFLFGKCAETLHGTLPVAGRAKLGPKGSSVEDEFGSFAWAFEELFHDKTA
KHRKVIQDWLQPIVREALHSKAAAARGEDTGEGTFLSHLTKTTDDPQDIAYSI
LNMLLAGRDTTAAALSFTVYLLALHPEVVEKLRAEVVQAYGSDGRPSVEDM
KSLKYLRAVLNETMRLFPPVPLNIRTSDDTPRVFPASAGAPKYYVPPRTPVVY
SSVIIQRRKDLWGADALDFRPERWLEPETARRLAENPFMFMPFHAGPRLCLG
QNFAYNEMSFFVVRLLQRVAALELAPDAQPEGSLPPARWKNGEGRQAVEKI
WPGSSVTTYIKVSSTRSRPCG
 
>CYP502B1 pc.5.187.1
MDTVLVGLFVALALYAWSRSSKRSALPVPPGPKPVPLLGNIFDLTAKELWLR
VTGWSKQYDIVYIHLLGQGLVFCNTYEVAQDLLEKKGSIYSDKCGCQNMVA
FTRYGDFARRQRKLMNTAFGISAVKRYRPLLANESVLLLKRILADPQDYMGY
IRRYAGGLTLQSVYGYRVETNDDPLLELGTECVDILSNKIASGGGIWPVDIFPF
LQHLPTWFPGAGFKRKAAVWRAKMEEFVDKPYEMVLERMRSGATVPCFVT
TLLEEARDEKGGAVDAQRDFDIRWTANSMYSASMDTTITVVQLFLLAMILHP
EVLRKAQAELDAVVGPARLPTFADRPALPYLDAVMSEVLRWGVPVPLGLPH
RLMEDDVYRGTHLRAGTLVFANIWNMLRNEAIWAQPDVFRPERFLEPVDEA
TAKRRDPRPYVFGFGRRRCPGLHLIEESLWIVMATLLATTDILAEKDESGKPV
MPHVDFTNSLVVPFSTPAPFKCDIRPRSEQALQLVRLAE
 
>CYP505D1 ug.73.17.1
 
MTHEIPCPPAWPFLGHMTSIDPEYPTLSLHLFTKQYGEIYRLRLPGRDLVVVNS
QELVHEVSDDKRFKKSPKGGLQELRPLIGDSLLTADYPREENWGIAHRVISPS
FNPIGLRGFFDDMVDVISQLVLKWERFGPHYKIDIAEDFTAATFEVIALCCASY
RMNTFYTGGTHPVATAVVDYGVEGFARGKRGRLLSWLMRSATAKFEQDKE
TLLQYADELLEERKAHPTDRKDVLWAMMNRADPVTGKKMTDLSVKQNLLT
LLTAGHETTSAFMSIIIYYLIKYPEAMRKLREEIDTVLGDRQMTADDLARLPYL
LAVMRETLRLTPVAPGRVIEAIEATTLKGGQYAIDKGQDILVAVHSSHRDPKV
WGDDVDDFRPERMLDGKFEALPPDSWQPFSAGLRACIGRAIAWQEAQIMITF
LVQHFTFTLADPQYELRIKQAFTLRVHDLYVHARRRTDRRGCVTLLPPAPAP
GVGLAEAKGAPHDGGEGALPMHVFYGSNMGTCEAFAQRIVADAGRHGFKA
SLAALDAAVANLPTDGPVVIVTASYEGQPPDNAAHFVEWATNMRGSGAPAL
AGVVYALFGCGNRDWVQTYQRVPTLVDGALAAAGAERLLPRAEGDAGSGG
FFEAFARWEGALWAALETRYATMKSGSAEGAVDVEVLDAGVSRADVLRQP
DTMMGTVLENRVLTAQGAPVKRHIEFKLPEQVTYKAGDYLTVLPMNPPRDV
RRAMARFGLLPDQEVTIRTKTPSSLPTGRPISVYTLLSAYVELSQPATTRDLRF
LSEAAKSEAEKLVFKELAENYTECVLTGRLSVLDILEAHPNVDVPFGAFLQLL
PSMRARQYSISSSPLCDPTRASLTIRVFEAPTSPGRKDPLLGVASTYLGGLHPG
DRVQLAVRPCKTAFRLPADPAVPLVLVCAGAGLAPMRGFLQERALQKEGGR
DVGKSLLFFGCRHPEEDYLYRDEDLKKWVELGIVDVRVAFSRAQDQSLGCK
HVQDRLWHDRTDVMDACDKGAKLYLCGSAKMAAGVKDKLVLVVQDAMQ
LEHAAAVEQFNTMMAGRFATDVFE
 
>CYP505D2 pc.73.4.1
MTEPIPTPPSVPFLGHIPLLDREVPMLSLALLAEQYGDIYRLIFPGRSSIAIASQE
LVHEVSDDKRFRKTVQGPLGEVRAVAGDGLFTADVPGEENWDIAHRILMPA
FSFMKIRDMFDDMVDVVAQMVVKWERFGPRFRIDPAVDFTALTLEAISLTTM
SYRMNAFYTFVQNGIHPFAKAMNEFLQESGGRSRRGRVLSAFMRGATAKWE
QNRDLMMKYVDDNARSSARKDVLDLMMNEKDPVTGRKMTELSIKQNLLTF
LIAGHETTSGMLTFTIYYLLKYPAVMRKLREEIDTMIGDRPMTVDDVNKMPY
LTAVMRESLRLGPSVPGRMIESLKDQTLKNGKYAVAKGEILVVCNFIAQRDS
KVFGDDADEFKPERMMDGKFEALPPDAWQPFGAGVRGCIGRAFAWQEVQIV
LVYLLQHFNLAFADPNYDLRLKQTLTLKPNEFYIHAIPRAERRRAIPLLGPRAG
PTSAPVNGTNGIADEGGHPMYVYYGSNMGTCEAFAQRIAGDAGRYGFSAAV
ASLDSATENLPTDGPAVVITASYEGQPPDNAAHFVEWLGALGDADSPLAGVA
YAVFGCGNHDWVQTYQRVPTRVDEGLAAAGAERLLPRGEGDAGAGDFFEA
FTRWEAALWEALGKKYETAKGSGKEAGVQIKVTNATVSRADALRQADTMM
GTVIENRVLTAPGAPEKRHLDIRLPEGTTYNAGDYLAILPTNPSRDVRRALAR
FGLLPDQEITIESASPTSLPTGRPISAHTLLSGYVELAQPATTRDLRLLSEAATS
DAEKLVFQKLADNYAEEVLAARLSVLDILEAHPDVNIPLGAFLQLLPTMRVR
QYSISSSPLADPTQASLTIRVFEAPCTAGRKAPLLGVASTYLGGLHAGDRVAL
AVRPCKTAFRLPADPALPLVMVCAGAGLAPMRGFLQERAAQKRAGRDVAKS
LLFFGCRDPAEDYLYRDGDLAEWTALGIVDVRAAFSRARDQSLGCKYVQDR
LWHDRADVMAAWDKGAKLYLCGSAKMAAGVKDKLVLVVQDAMQLEHAA
AVEKFNMMMAGRFATDVFE
 
>CYP505D3 ug.73.15.1
 
MSQPIPMPPSVPFLGHVTTIDAELPVMSFRLLAKQYGEIYELNMLGRCILWML
VINTQELLHEVSDEKRFRKIVSGGLNEVRNAAGDGLFTAHADKEQNWAIAHR
ILMPSFSAMNMRNMFDDMVDVVSQLVLKWERFGPYHKINPADDFTALTLEA
ISFCAMSYRWVIFYSIYVRNDVHPFARAMSDFLLESGARARRPGIIAPFMRSA
NAKYQQDIDVLMNFVDEIIADRRAHPTDKKDILNVMLHAKDKETGLGMTED
NIRRNLLTFLIAGHETTSGMLTFIMYYLLKHPEAMRKLREEVDTVIGERPMTV
DDVNKLPYLIAVMREALRLGPPASARGASPYEDTTIGGGRFAVPKDTFIMCSL
YNIHRDTKVWGEDAEEFRPERMLDGKFEAMPPDSWQPFGYGMRGCIGRPFA
WQEAQIALVYLMQRFTFAMADPGYDLRLKQTLTIKPHEFFIHAIPRADRAHG
APLFSTPSPLRPRAASSAQPPADTAGRTPVYVLYGSNTGTSEGFAQRIASAAA
GKGMYSRSTIGTLDSAAAHLPTDGPVVIVTASYEGQPADNAAHFVEWLSSLQ
GTELEGVRHAVFGCGNRDWQATYQRVPTLVDDALTARGSIPLVLRGAGDAA
ASDFFEAFEKWETGLWGALREAYGVATGANAESGISIETLDTGKGRASILRQP
DAALGTVVENRVLTAPGAPEKRHIEFKLPEGMTYQTGDYLAILPVNPQRDVH
RALARFGLLPDQEITIRSAGPTTLPTDRPVNVSTLLSGYVELGQPATTRDLRLL
SEHAKSDSTKAALQALLDNYASDVLGARLSVLDILEAHADIALPFAAFLDTLP
SMRVRQYSISSSPLADAAHASLTISVLAAPARSGRPERFLGVASTFLGGLRAG
DRVPLAVRPSAAAFHPPADPSVPLLLVGAGAGLAPLRGFLQERALQKKAGRD
VAKSILFFGCRRPDEDLLYGDAELKEWQELGVVDVRPAFSRAPEHSFGCKYV
QDRVWHDRAEAVATFKAGAKLYICGSSRMAAGVKEQIVLIVQEDSKLEYPE
AVEKFEKIMVGRFATDVFE
 
>CYP505D4 pc.73.11.1 (ug.73.16.1)
MTHPIPTPPTVPLLGHATLIDHDFPMGTNALWAREYGEIFRMCFPGRTVYVVS
SYELVHEASNDKLFRKSVGGPLAELRSSVGDGLVTANVPGEENWGIAHRVL
MPCFSTISLRNMFDDMVDVVSQLVLKWERFGPHYRIDPAEDFTALTFEAISLC
SMSYRMNPFYNSAMHPFAAAVVDFQVECMARSRRGKLLNALIRSAKTKFEQ
DRDLLMQYADETVLEDRKAHPIEKKDVLWTMINRADPVTGKKMTDLSVKQ
NLLTLLMAGHETTSGMLTFAMYHLLKNPEAMRKLREEVDTIIGDRAMTADD
LSRLPYLVAVMRETLRLSPSAPARIVQAMEATTLGGGKYAIAKDDTLLIATYV
SQRDPAIWGPDAEEFRPERMLDGKFEALPPDAWQPFGAGIRSCIGRPFAWQE
VQIVLVSLMQRFTFAFADGHYDLRMKQTLTMKPHDFYIHAIPRTDRARVPPL
LGVRAAPAQSTDGEKGKVEAGEGAPPMYVYFGSNMGTAESFAQRIAGDAGR
HGFKATVAPLDAAVEKLASDGPVVVITASYEGKPPDNAGHFVEWLSNLGDES
ALAGVSFAVFGCGNRDWARTFQRIPTLVDDALGAHGGARIIPRGVGDASTGS
FFESFANWEEGLWAALAEKYETAKPTSVGGLELVVTDAGPGRADALRQPDT
TMGTVVENRVLTAPGAPVKRHIEIQLPEGTSYTAGDYLAVLPTNPPRDVRRV
LKRFALLADQEITIQSADPTSLPTGRPVNVYALLSGYVELAQPATTRDLRLLIE
ASSTDAEKQVFKELADNHAERVLKPRLSVLDIVEAHPSVHVPFAAFLQLLPA
MRVRQYSISSSPLVDPARATLTIRVFELPGAPARRPHLGVGSTFLARLAPGDR
VQLAVRPCKPAFRLPADPTVPLVLCCAGAGLAPMRGFLQERAMQKQAGRDV
GKSLLFFGCRDPQEDYLYKDDDLKAWVDLGIVDVRVAFSRAPDQSLGCKYV
QDRIWHDRADVLAAWNQGAKLYLCGSAKMATGVKDKLVHVVRDATGVDE
ASASDKFNEMMDRFRYRYFRVRYEHVVSAAFTYCIWNYLRDHSTRFNCPIA
MACTFRLGAIARGSICATWLRWREE
 
 
>CYP505D5 pc.73.14.1
MTTPIPSPPSIPFLGHVTIIDREVAIYSYNLLAKQYGEIYQLNMMGALRIIVICSQ
ELLHEVSDEKRFRKIPRSALEQVRNAVGDGLFTANGDDPNWHLAHRILMPAF
STMNTRNMFDDMVDVVNQLVQKWERFGPRHKIDPAQDFTALTFEAITFCAM
SYRELTLPQEGVHPFARAMADFLVESGNRALRPGIVQPFMRSTNSKYEEDIKI
MEHYVNDIYEQRKANPTDKKDILNLMMYGKDTQTGEGLSEKTIKDNLLTFLI
AGHETTSGMLTFIIYYLLKNPEAMRKLREEVDTIIGSRPMTVDDVHKLPYLIA
VMREALRLGPPAPMRGAASFEDTLLKGKYPVAKDVPIYCGVYMVHRDPKV
WGEDAEEFRPERMLDGRFEALPPEAWQPFGFGVRACIGRPFAWQEAQITVVY
LMQRFTFVMHDPSYDLQLKQTLTIKPHEFFIHAIPRTDRPSIVPIPTPSSTLLRD
QTAPAAQPPVTTPGEGGGHRMYVLYGSNTGTCEAFAQRVASDATVHGEVSV
FIGTLDSAAGHLPSDGPVVVVTASFEGQPADNAAHFVSWLTALNGSALADVS
FAVFGCGNRDWASTYQRIPTLCDDTMAARGGKRLVPRGEGDAGSSDLFESFE
HWEAGLWEALQKTYGTTKVEGRQEAIKVSTVDAGTARATALRQPDTMLGT
VVENRLLTSPGVAEKHHIEFQLPDGLTYRTGDYLAILPMNPSRDVQRVLAHFS
LLPDQEVTISAAGPSPLPTGRPVNVSSLLSGYVELSQAATTRDLRILMSAAKSE
DTKAALSELLDGYAEKMQAARLSVLDILEAHPGLDISFALFLQLLPSMRVRQ
YSISSSPLADPTRASLTVSVLSAAPTAGRREPFLGVASTYLASLRAGDCVQLA
VRPSAAAFHPPADPAVPLVLFCAGAGLAPMRGFLQERALQKQAGRDVAKSIL
FFGCRSPQHDFLYADSDLRTWTELGVVDVRPAFSRDTEHSAGCKYVQDRVW
ADREDVVKVWKAGAKMYVCGSGRMATAVKQKLVEIIAAQLNVDSEKATET
FNNIIKGRFATDVFE
 
>CYP505D6  pc.17.40.1

MTSTIPTPPSIPFLGHVASIEREVPLRSFRLLSEQYGEIYELNILGRKLLVVSSAK
LMSDVSDDKKFYKNMSGPLMQVRNAVGDGLFTAYGEEPNWGIAHRLLMPA
FGTASIRDMFPDMLDLASQLVLKWERFGPKHRIDPAEDFTRLTLDTIALCAMS
YRLNSFYRDSSHPFVQSMVDFLVECNLRANRPGLLTSVMVQTNAKYEEDIKT
MTELADEIIAERRRNPTDKKDLLNIMLYSKDPKTGQSLSDVNIRNNLLTFLIAG
HETTSGLLTFALYYLIKNPEAMRKAHEEVDEVLGDQQIQLTDIGKLKYIDAVL
RETMRLSPTAPMRTVRPFEDITIGDGKYFVPKDYTVVINTIVAQRDPTVWGED
SNEFHPERMLDGKFEALPPNAWQPFGFGMRACIGRPFAWQEAIIALAVLLQK
FDFVLDDPSYELELKQSLTIKPAHFYVHALPREGKPQLLATPSAAPFSSHARET
TNASLPASPGTEAKQPMYVLYGSNTGTSESFAQRIANGAAAHGFRATLGTLD
SVADHLPTDGPIVIVCASFEGEPADNAAHFVERLTSLQDKPLQNLRFAVFGCG
HHDWFRTYQRIPKLIDQTLEDRGAQRLVPRGEGDAGSSEFFEAFEAWETKLW
EVLPEEYNTVVKQDITSGLKVETVGEGATRAVDLRQHDAALGTVIENRVLTA
PGAPQKRHIEFELPEGVTSRAGDYLAILPSNPPQDVHRVLARFGMLPEQQIVIS
SSGPSSLPTGRQISAFDLLSGYVELSQPATARDVRTLLNIDSSDATKESLKALL
ESYSDAVLGRRLSVLDLLEQYPDIKLPFAAYLALLPSMRIRQYSISSSPLWNAQ
RVTLTVSVLEAPALSGRKEPFLGVASTYLANLRPGDKVQMAVRASNAAFHLP
QDPRTPLVLFAAGSGLAPMRGFLQERALQKKAGREVGRAVLFFGCRRPDED
YLYSDSDLKEWEELGVVELRPAFSRAPEKSEGCKYVQDRVWHDRRALDGLY
EAGAKWFVCGSGKVARGVKEVLTAMIKESRGYSDEEAAAAFERATVGRFAT
DIFE
 
>CYP505D7 gx.187.5.1
ATPIPSPPSVPFLGHVTIIDREVAIYSYNLLAKQYGEIYQLNMMGAKVVVICSQ
ELLHEVSDEKRFRKVPSSALDQVGNAAGEGLFTAHGDNPNWHLAHRILMPA
FSTMNTRNMFDDMVDVVNQLVQKWERFGPRYKIDPSQDFTALTLEAITFCA
MSYRYGRIXVHPFARAMADFLVESGNRALRPGIVQPLMRATNSKYEENIKIM
QKYVDDVYNQRKENPTDKKDILNLMMYGKDPKTGERLSEKTIKENLLTFLIA
GHETTSGMLTFILYYLLKNPEAMRKLREEVDTMIGSRTMTVDDVHKLPYLIA
VMRETLRLGPPAPARGTAPFEDTLLKGKYPVAKDGRIYCGIYMVHRDPKVW
GEDAEEFRPERMLDGRFEALPPEAWQPFGFGVRACIGRPFAWQEAQITVVYL
MQRFTFVMHDPSYDLQLKQTLTIKPHEFFIHAIPRTDRPSIVPIPTPSSTLLRDQ
TAPTAQPGPVTTPGEGGGHRMYVLYGSNTGTCEAFAQRVASDATVHGFKAV
IGTLDSAAGHLPSDGPVVVVTASFEGQPADNAAHFVSWLTALNGSALADVSF
AVFGCGNRDWASTYQRIPTLCDDTMAARGGKRLVHRGEGDAGSSDLFESFE
 
>CYP512B1 pc.30.92.1(genewise2nd.30.46.1)
 
MSLHQVYDAVVAHGDMGTLLVYMAYSVPLVMFLYSLFSPASLRHIPTEGGP
SFPLLSYKAARAYLRDATGILQRGYDKHKGKPFKVAMPDRWVVVLTGKKLV
DELQRLPDDAVSFIKGASDLSGTEHMFGRQVIDDPFHVPIIRTHLTKNLAPMFS
DVFDEVSIAFQELIPACDGEWVPVHAIKVARSVVARTSNRIFAGLPICRHPEYL
NLVINFTVDVAKGRYALLLFPPALKGIAAKILTNIDGRIKEGLKYLGPLIEQRM
ALAEKFGNDSSEKPDDMLQWIIDEVRARNQSVFEVVRTVLLVNFAAIHTSSNS
FTHALYHLAANPEFIAPLREEIETIVSEEGWSKAAIGKMWKLDSFMRESQRYN
GINSVSVKRKALKPLTLSDGTFIPKGTVLVTPTVATHFDDDNYKNPTVFDPFR
YYREKEQDMSAVKHQFVTTSPDYVSFGHGKHACPGRFFAANELKAMMAYV
VVNYDVKFEKEGVRPENIYAAMGISPDPNARVLFRKRESIVSV
 
>CYP512B2 pc.30.93.1
 
MTPSHSFPDLVASCISAWTLCFALGFSIAAASFYSLFGPFNLHHIPTVGGSSIPLI
SHRGARKYMRDAKGVLQDGYKHKGKAFKVALTDRWLVVITGKRLVDELQK
MPEDVASFVGAVADFQGLRYIFGQKVLDDPFHVNIIRSHLTKHLSSVFGDICD
EIYVAFSELIPQQDEEWVPVHAIQVVRTIVARASNRVFVGLPVCRNAGYLSLA
VNFIVDVAKARDFIALFPPVLKPLAAKMTSDIGTRVQEGMQYLEPLINERLRL
MEKFGKDWTDKPNDTLQWMMDGIMERDGTIEQLVRIVLLENFSSIHSSSNTF
THALYHLAANPEYITPLREEVETAISEEGWTKAAMSRLRKVDSFLRESLRLNG
INPVSMQRKALISFTFSDGTYIPKGTILVTPALATHHDEDNYEDATTFKPFRFV
GENPEDDVPLVTTSADFVPFGHGRQACPGRFFAAHQMKAMMAYLVLNYDV
KFENEGVRPQNVHGVLSVQPDPKARVLCRRRKSSYT
 
>CYP512B3 pc.30.113.1
 
MASNHLFSGVLPLDRAASTLGYLVCGALLALLLQNILTTISLRHIPTVGTSTLP
LLSYKGAYDFTRDIKGVFQQGYAKYKGRAFKIAFTDRWFVVLTGRKLLEELH
RLPDSTTSFNHASGSITGSTYIYGRGWLSDPWHIPIIRDRLTKHLAASFGDMYD
ELETAFRELMPSCEEEWVPVHFITMARTVVARTSNRVFVGLPACRNLGYHTL
LVNFALDVSKARNRLAWLPPALKRVAARALTRIDSRIEEGMQYLGPTIRARIV
EMERYKGDWPDKPNDILQWIMEELIARKMPMEEAVRIILRINSSAVQTTANSL
THAIYHLAANPDLIAPLREEVDAVITDEGWTKLAMSKLSRLDSFMRESLRLNI
VNPLSVRRMALKSFTFSDGTFIPKGTLMVTPAHATHLDEANYEHASVFDPWR
FVHQKEEDLSPTKHQFITTSPEFVAWGHGKHACPGRFFASNELKAMMAYIILN
YDVKFARAGVRPDNVYSGLTVAPNQEANVLFRRRQTQ
 
>CYP512B4 pc.30.114.1
MAFSDVVATVGAGPWVAYMMCAVLLALLLYSLFSPASLRHIPTEGGSSLPLV
SYLGAYNVLRNLQSVLQRGYDKHKGKAFKIALPDRWVVVLTGKTLVDELQR
MPEESASFIDATTELTGFGYIWGPRMRKDPCHVPIIRNQLTRQLSSAFGDIYEEI
ELSFQGLMPACEKDWTPVHVIEVARDVVARASNRVFVGLRVCRNPDYLDML
VDCAVSVASARNTLMLFPFVLKTFAAKNVVNMDRRIRRGMQHLGPIIEERMS
LLRSLGNDWPDKPDDMLQWIIDEVAARQMPKEDVVRNIMFLNFAAIHTSSNS
FTHAIYHLAANPDYLGPLREEVEAVTAKEGWSKTAMGRMWRIDSFLRESQR
VNSINPLTVIRRTRTSLTLSDGTFIPEGTVVAAPAYPTHFDDENYVGGDTFDPW
RYVREKEQDLSPSKHQYVTLSPEYVPFGLGKRACPGRFFAANELKAMLAYLV
VNYDVKFEKEGVRPENMHVGLTISPDPAAKVLFRKRRS
 
>CYP512B5 pc.30.118.1
MARSDILDALSLGRTELSTTYLVFGLFLALFLYSLFSPASLRHIPTVGSSSLPLL
SYKGAYDFLRDGRSLLQRGYNQYKGKVFKVAFTDRWLAVVTGRKLVEEVQ
RLSDDVISFPDASGEVTGFKYIFTKCALSRDPFHVNMIQRQLTKHMSVAFDDL
HDEFETAFKELLPHNETAWVPVHAIEVARKVVARASNRIFVGLPVCRDKAFL
DLMVNFTLDVARARDLLALFPPALKPFVAKLVVKLDSRIEEGMQVLRPIIQER
MEIIEKFGKDSPEKPDDMLQWIIDALVERNEPMEQVVLITLFVNIAAINTSSNS
FTHALYHLAARPEWIAPLREEAEAVIGNEGWTKNAMGKLVKIDSFMRESQRY
NSIVPLTCMRKALQPFTLSDGTHIPRGTILVTPAIATHFDDEHYADAASFDPSR
YVPVADAKQGGAPKQYVTTTAEYVPFGHGKYACPGRFFAGTELKAMMAYL
VLNYDVKFAQEGVRPPNAATTLSTRPHQEARVLFRKRNSSVQ
 
>CYP512C1 pc.30.76.1
 
MSSMNAPALPATHIVAGAILVWLLVRTFGTQNLRHIPTEGGPSLPIISFLGLHA
FLTRSREILEDGYQTHKGRAFKVALIDRWLVVLSGKKLVEELQKMPDDTVES
ATTEMFNMQHVFASNWHKDPVHSSLLRSLTRNLGVVFSDMFDELDTAFREC
VPANAERWLPVQAHTTMASIVTRAANRIFVGLPVCRDAGYIHMMIHVAEDV
SDAVRTLSMLPTFMKPFVARRATVIDQRIQQCLDYLRPAIADRMSMLERFGK
DWEEKPNDVLQWIIDEVTARNQGEDEVARIVLFINFGAIETTSFAVTHALYDI
VSRPGLADVLREEVEAAVATEGWTKAATNKMRKLDSVLRESQRLNGPTTAS
MFRRVLQPVTLSDGTYLPAGTTVVTPTLATHFDDTNYADAQTFDPLRFYKPD
GVQAQLVTTSADFVTFGHGKHACPGRFFAANELKAMMAYILMHYDIRPERE
GVRPENVYRGLNVLPDANARVFFRRRQTD
 
>CYP512C2 pc.30.77.1
 
MVLTTDFGGISTTHVVIGAFVTWLLLRYFSAKNLRHIPTVGGPSVPILSIVALY
NFLANGKKVVLDGYQKYKGKAFKVALLDRWLVVLCNPKLVEELQKLPEAL
VGYSLLEAXGTIFETKHIFGADLLTDPVHLVLLRTLTRNLGQVFGDMYQEVET
SFQELVPANEKEWLPVHASPIMRTIVTRAANRVFVGVPVCRDEGYLHLMVHF
AEDVNKAFGLYTVVPSFAQGFVARKAKAVMDDCIERCLGYTRPTIKDRTTM
MDSFGDNWADKPNDMIQWTIEETKARGQGEYDMARMLMFINSGAVETTSQ
AVIHALYDISVRPELADELREEVERAIAEDGWTKDATNKMRKLDSFLRESQRI
NGPMIVSMFRLVREPVTLSDGTFLPAGTTIASPTLGAHFDDSIYPNASTFDPLR
FYKAEAAGQPQFVTTSPEYLTFGHGKHACPGRFFAVNELKAILAYMLMHYDI
KPEQDGVRPENKSMGLGVLPNPDAKVMFRKRHAG
 
>CYP512C3P gx.30.36.1 60% TO 512C1 PSEUDOGENE
NDLLQWIMDEAVARDKS (FRAMESHIFT)
QEEIARMVLFLN (FRAMESHIFT)
FGAIQTTSCV (1)
LSMFPNILTPVTLSDGTFLPAGTTVVTPVLATHYNEDNYTNAALFDPFSCKDN
RSGGQQFVRTSADCVTLGHGRML (1)
CPGRFFAAAELKTLVAYVLVNYDLRPETEGVRPVNIYKGLT
V*PSETAKVLFKKRQTDE*
 
>CYP512D1 pc.27.9.1
MQSSVGEVFASAPTLAKLLAGAAFVLFLNSLWNIWKLRHIPTVGGPAIPILCYI
GTFRYLQDPQKILQEGYEKYKAKPGMFKIAAPDRWLVVVGHPNLIDELQKHS
DEQVSFMDAATEFVGTRYALPGTIADDPWHIPPLKQHLTHAIGSFFGDMLEEL
RVSIEERIPSNEKDDEWVAIPALDTFFWVFTRVIDRIIVGLPIRDTEFIKLMVEFT
MSIGIARFFIGLVPPMLKPAMAKLAARGVHKATAEAEKMLAPVIADRVRHLD
EFGEKWADKPNDLLQINIEEARAQGRPLDEIVIRTIVSIFVGVSTSAASFVHVL
YHLAADPELQAALRTEVEGAIARDGWTKAALVGMHRVDSVLRESQRVNGIN
SVSVMRTALQDITLTSAGAPVCLPAGTLCVAPERALHADKEHYPDPDAFVPF
RFAELRATADARGGAQHQFVSTSTRYVPFGHGKHACPGRFFAGNEMKAAVA
HLVSNYDVRLPDGASTRPPNELFGLAIVPNRSAKVMSRRRQPVV
 
>CYP512E1 pc.154.15.1
MSDYSSLLAYIFISLATLAYLKRLLWPDRQQLEHIPAIGPTAPILSYWGAFRWL
SHGTEITQKGYAKYKGRPFKVANFNRWLVVVSGPKLIDDIRKAAEHELSFEE
AAHENLEVRYTAGPCIAENSYHVPIVRGQLTRNLPFLFNDVRDEVAKAFGDHI
PPTDDWTPVAAHPVIMQIVARATNRVFVGAPKCRDPDWLDLSIQFTADLILG
AHIITQFPQFLKPLAARFFTRVPAAIRRGRRHLERTIEHRKACLEQYGADWPD
KPNDLISWLLDEAKGEERTIHNLVTRVLTLEFAAIHTTSNSFVHALYQLAAHP
EWAEPLRDEIEQVVKREGWSKSSLDKMHRLDSFLKESQRYYALGGVTMDRR
AMKDFTFSDGTVIPEGTFVGVAVLATQHDPQYYDDPDTFNPWRFSDLREESD
ESGRHLLVSTGIEYFPFGHGRHACPGRFFAAIELKLMLAHIVMNYDVKAELD
GVVPPILEFGQNLAPNMKAKVLFKNRQRS
 
>CYP512F1 pc.15.28.1
 
MISIDSISLISGLISFAFIAYYLRQDKLQHIPSPGPTGPISSWYAAYKYIRGDAPQI
IEEGYRKYKGRIFRIADLNRWTVVVTSPSLVEELRKAPEDVLSFHEGIRYSLQL
DYTFGREAVEHEYHIPVIRTQLTRHLTPLFADIHDEIVQSFTDLVPPSDTWTSV
RVVPTVMQVVSRTSNRVFVGLPYCRDPAFCALAVRFATDVVKTGVALHLTP
RALKPLGVRLVSPVSKRVEEGKGDHRQAGGRPAPRQPGGRPARAAQGACGN
WPDKPEDMIQWLIEEANEDERTVEGLVLRILIVNFAAIHTSSMSFTHAVNMLA
AHPECIAPLREEIEEVVREEGWTKAAVQRMRKLDSFMKECQRLHGLGAVTM
SRVALQDYTFSDGTRIPRGTLVMAASRPIHHDAALYVPDADAFDPWRFARLR
AADADASIKHQMVHTSAEYLAFGHGKHACPGRFFAVNELKLMMAHVLHTY
DIRPQTSVPPGRWIRHSLLANPIATVDLKKRQT
 
>CYP512G1 pc.16.37.1
SVPTMGPTAPLLSYWGVLRYMTRPRDVLREGHVKFGGRPFRVASPLRWQYI
VSSPELIDELRRAPDAELDPLAAADDILHFVKALGTKFASNTYHVPIVRTTLTK
NIGTLLPSVLDEMRVAFARYIPADKEWHPVVAHDTNVRIVTQTSSRIFVGLPL
CRDPELLKITMSYTPTVMKTGLLLKVLPRPLQSFVQRGSGSIDALIDRAHRLLL
PTIEERRRMMDKYGAEWLDKPNDMLQWLMDSAEGEERTPRGLAARMLAVY
FAATDTAALGFTVALYRLATHPEYVQPLREEVEAVIAQDGWTREAFRKMPK
VDSFLKECMRLQGPSTLLLQRKAMQDFTFSDGTFVPKGSHVATSIVATHCDS
AYYSDPLTFNPWRFVGAEDDAQDSKHRFATTSPEYLLFGYGRHACAGRFFAE
IQLEMMMAYVVTTYDVRMEKPGVLPEPIEFGSMSLPSMTAKVCFRKRATE
 
>CYP512G2 pc.15.22.1 (genewise2nd.15.10.1)
MFADSPTSLYALVLLGTVVYLLNWLKGSKYKSVPALGPTAPLLSYWGAIRFF
LDAQGMLQEGQLKYGGSPFRIATRRYWQYIVSSPKLIDELRRAPDDELSFLDA
VNEALELEYTMGAATANNLYHVPVIRNTLTRNLGNLSSEIYDEISNAFADCIP
ARDEWMAVPALQSIMQIVARTSSRIFVGLPLCRNREFLEISMTYTTDVVKTGL
LLNMVPGPLKPIVNRLFSKVEQHIDRTHALLRPIIEERQRMMEQYGDDWPDKP
NDMLQWLMDAAEGQEREPRALALRILIVGFAAIHTSSMSFTQALYYLAAHPE
YMQPMRDEVEAVLAAEGGWSKGALQKMRKVDSFLKECQRYEGLGMLFLTR
KAVKDFTFSDGTFIPKGSYVSTSRAATHGQSEYYRDPYVFDPWRFANLRDET
GEGVKHQMVNTSIEYLPFGLGKHACPGRFFAANELKSMMAHLVVTYDVALD
MPGEVPRSVHFGPINSPNRTAKVLFRKRRG
 
>CYP512H1 pc.21.108.1
LQDIPTVGGPNLPFFSYFGAIHFLVRANKIISDGYSKYKGGSFKIAQVNRWLVF
VTDPALNEELRKAPEDQMSSPAALHQYVQGLYTMGFGLDGMLYYIEVLRDQ
LLRHVNPSPAMLYDEMQASFQDFIPQNTEWTPIPALSTSLKLFLRMSNRAFVG
TPLCSNSEYLDLVTEFMNNVFKGAFFYNCLPAFVRPILARWLDLVNPCLDRA
VRLLTPIYATRVAELEAAGKAEWAGASDDLLSCLVASHYSAARDVRELARIL
LVVNLAAVHTVSQSFTSVLFLLASRPAWQAELRAEAAAALAHGYTRDALAR
LRKLDSFVQESLRFNGLGALASTKLALTDFALSNGTVIPKGTLVSAPLRALHL
DDEVYPDGASFQPWRFVRAGGEAAPRQSLASTSPTYLPFGHGKSACPGRFFA
ALELKMVTAYLVLNYDLKLEGDATEVPPVSWFITARVPNYKANVLVRRRQE
KA
 
>CYP512J1 pc.37.85.1
NIPTLGTEMPVLSLWGALRYVMNSHNVIQEGYLKYKGRPFKIAQFDRWLVVL
TTPHHIEELRKAPETGLSSRDAVDSLLKAKYTLGMDVDTVQRYLDLFREKLA
NKLGGLTSEVHEEMELTLNESLPKSEDWEEFCVLPSILKILFRTTNRAFVGAPL
CRSEEYVALGEEYTTNVFKGAVIYNALPKALLPLLSKVLDFIGPTSRRCEQLFR
PEMDKRAAYLEEHGADARGKYDDLLTWLIATHGAGDEINYSELTRIVLIANL
AAVHSTAMVFTFAIFHAAADPGVADALRAEVASVVAEHGWTPTALTKMQRL
DSFVREVQRMHTLGAALVMKIARTDYAFADGHVVPRGALVTAPATTVHRD
DEHYPDAHTFRPWRFVGTPDESEADSARRKATSTSPTFLAWGHGKHSCPGRF
FAVRELKMLLGSVLLRYDVRLKTPGVLPQDQWYLTFRVPDPTACVLLKRRTT
A
 
>CYP5035A1 PC-hn-2(pc.12.112.1)
MADTSLLSRRLKSFFDPQGSPTLLSLPDSFVHLIFKRWEPMKLPIVAFLLFLVP
ACLSLLFASHLSLTKGLATAFATFYTVLVSSIVIYRISPFHPLARYPGPLAAKIT
KWWHAYHVHTGKQHLYVRRLHDQYGDIVRIGPNDVSIRDASCISSGLGSQGL
PKGPMWDGRFMYSPIPAMVGARDHAYHMQRRRPWNRAFSATALKEYEPLIY
GRVHQLVSALADRQGQVVDIAKWIGYFTYDFMGDMVYGGWTEMLRDGKD
EDGLWDVVHRGLEDVSAVYGEVPWVSYYASMLPNVGKDLKRMRKMAFDR
AKQRYDSGSKARDLFYYLSNEDGAEKVTPPRPIVVSDGVLALIAGSDTTAIVT
ATILYSLLCNPTTYHRLQQEVDKFYPRGEDPLNPKHYKDMHYLEATINEGLR
LFPATPSGTQRAPAPGKGDRLIGKYYIPEGTATKFHFWSIQRDPRNFSHPDTF
WPERWLVAEGLEHADEPLTHNANAFVPFSFGPYNCVGKNVAMQEMRMLLC
HLMHTLDLRFPEGYVPRAFEDALEDQFGFKVGELPVIVQRRE
 
>CYP5035A2 ug.97.52.1
MGSDAQLPVLSPRDAFAIIVLSAVGAHLVFKRWEPKKLRVVTFLLFLVPACLS
TLLLPHFGTALGLTVGFLTYWTALTLSIVFYRVGPLHPLYQYPGPLPAKISKW
WHVWHVQQGKQHLYLQQLHDKYGDIVRIGPNEVSIRDPACITPVLGAQGMP
KSDMFLGRNMWPETAPLIGYRDPAEHMKRRKPWNRAFSSASVKEFEPIIQHR
VHQLVEALSDRQGQVVDLAEWISFFTYDFMGDMVFGGWTEMMRDGADKG
GLWDLLRRGLTVSALWGEVPWVSYYAKKLPWTAQDNKAMRVMAFSRTEQ
RYASGSASKDLFYYLSNEDGSEKVSPPRNIVIGDGLLALVAGSDTTATVVANT
MYELLRHPAAYRRLQEEVDKFYPRGEDSLDPKHIKDMHYLEAVINEGLRMY
PAVPSGSVRAPEVGKGGKIAGPYYIPEGTQTRIHFWSVQRDARNFSFPETFWP
ERWLIAEGIEPAPAGEKLVHNPNAFTPFSFGPYNCVGKNIALAEMKQLLCHLV
HKLDVRFADGVDPDAFDRASEDRFIYVVGELPVVVERRD
 
>CYP5035A3 ug.53.54.1
MAGDLSTRDALGIIVVSALGTHAIFKRWEIKHILVVSTLLLFLPAALSTLLIPHL
GSFKGIAAGFSVYFITLLSSITLYRISPFHPLAHYPGPLLPKISKIYHIAKVSSGK
QHLYLQELHNQYGDIVRFGPNEVSIRDASCIMPVLGAQGMPKGPMWQGRHF
WTEVHTLIGFRDPKAHQRRRRPWNRAFNTAAMKEYTPLMQNRVRELGDAL
VARQGQVVDLAEWIGFFTYDFMGDMVFGGWTNMVREGGDRERLWEVVKS
GLKIEFIYDNIPWLSYYTRNIPGAGNAELRAMAIGQTEKRYSRTSTSKDLFYYL
ANGAEKEDPPKHTVVVDGGLALVAGSDTTSSVLSSIFYCLLRHPDTYDRLQA
EVDKFYPPGEDSLDPSHLSDMNYLEAVISEGLRLFPAVPSGSQRAPEIGTGGKL
VGPYYIPEGTQTRIHFWSVHRDPRYFSRPEAFWPDRWLIAEGLQAHAAGDEPF
VHNPNAWTPFSFGPSNCVGKNLALQEMRMVLVHLMHRLIVRLADGWDPAQ
YEREMEDRFVFSIGRLPVVVERRD
 
>CYP5035A4 pc.53.86.1
 
MAAREAILIIALFAVVSTISHVIFRQWEVMHCSVVLGLLVVIPAVLSTPLVSDF
GIPCGLALGFTTYFAVLLLSITLYRISPFHPIARYPGPLLAKISKIYHVSKIWSGK
QHLYLQRLHEKYGDIVRIGPNELSIRDVSCITPALGAQGMPKGPMFNGRHLW
PETHSLIGFRDPKEHQRRRRPWNRAFNTASVKEFNPIIQARVQELGDAFAARE
GQVVDLAEWIGFFTYDFMGDMVFGGWTHMVREGADHNGLWQLIKSGMKV
SFVYEHIPWLSYYVKKLPGAGSDLKMMRAMAFGQTEKRYATTTTTRDLFYY
LTNEDGSEKVDPPKAVVISDGALALIAGSDTTSTVLTSTFYCLLRNPETYKRL
QEEVDMFYPAGEGSLDPKHLPEMHYLEAGLRLFPAVPSGTQRAPEVGKGGK
AIGPYYIPEGTQTRLHFWSIHRDPRNFSHPEMFWPDRWLIAEGLQECVGEKLV
HNPNAWLPFSFGPSNCVGKNLAMQEMRMLVCHLVQRFNFRFADGYDPAQY
ERDWQDRFVVMIGQLPVTIERRA
 
>CYP5035A5 pc.1.6.1
MPSGILGQLQSLPAKLTAQDATLVAHVIFKIWEPMQARIVSLLVIIAPLLLSTLF
IPHYGTVSGVFRSFAIYLTTLVSSIVVYRLSPWHPLARYPGPLLAKVTKLYHAL
MVSKGKQHVYIKALHDQYGDIVRIGPNEVSIRDAACIQPLMGAQGLAKGPSW
SGRSMFPPISPLIGIRDPAEHARRRRPWNRAFNTNGIKEFMPTIQTRVQQLAEH
LGERHGQALDLAEWFSFFTYDFMGDMIFGGWTEMMRDGGDLQGLWTRVK
AGLQHAGMVPEHVPWVAYYAKKIPSVVRKVSEMRGMGISRAKMRYQQGST
SKDLFYYLSNEDGSEKVTPPPEVVTSDGALALVAGSDTTSSVLSNLFYCLLRD
PVSYKRLQEEVDKFYPPGENSLDPRHINNMPFLEAVINEAMRLYPVVPSGSQR
SPEIGKGGRAVGPYYIPEGNQARVHFWSVFRDSRNFSHPETFWPDRWLIAEGL
QESPEKITHNANAFVPFSFGPANCVGKNLAIQEMRLAVTHLMHKLNFRFADG
FNPDEWDSQIQDVTVMQLGKLMVVVERRD
 
>CYP5035A6 pc.42.19.1 (genewise.42.13.1)
HLIYKRWEPLRLSVTLTLLMGVPAALSVLLIPHLGLLRGALATFSLYLSTLISSI
VAYRLSPWHPLARYPGPLPARVTQLWHTWQAHKGQQHLYLKQMHDKYGD
VVRMGPNEISIRDADCIVPLYGPHGLPKGPSAGRQMHPQELSLIGYRDPARHS
VRRKPWARGLGTAAVREYMPALRSRVSQLVDALGARSGHPVDLAEWIAFFA
YVFVSDCSLSXLANCQSGMTLWATWRECHGAGAVFEQVPWLAYYAKMLPA
ISQRILKMRKLTVRHATRRYNSGSFSRDLFHYLSGEDLPEGSARPPQHIVAAD
GLLAVVAGSDTTASALSNLFYCIMRHRDVYKRLQQEVDQFSPLGDDSLDPQH
LNNMPYLNAVINETLRYLPSVLSGSQRAPLIGGGGVSVGPYYVPEGNQVRVH
FYSVHRDPRYFSDPDRFWPERWLIADDRQPSSEKIVHDDRAFIPFSYGPSNCV
GKGLALQQMKSTVCHVMAKLEMRFADGYDPDTWEEQVQDEGVMIV
 
>CYP5035A7 gw.54.121.1 (genewise.54.12.1)

KSVHLIFKRYEPMHILVVSTLLLLLPAILTVPLIDQLGIAKGFLVAFATYFATLL
SSITLYRISPFHPLARYPGPIIAKVSKIYHVAQVWSGKQHLYLQRLHDRYGDIV
RFGGFSTCSLRGPNEVSIRDVSCIAPMLGTQGMPKGPGKHCWPEIHTLIGCSDI
KEHQRRRRPWNRAFSTAAMKEYNPIIQKRLQELGDALAARQGEVVNLADWI
SFFTHVPWLSYYTRNIPGATNDEFRGMVFGQTVKRYACTGTTKDLFYYLNED
GAEKEDPPKPIVVVDGAVALIAGSDTTSTVLASTFYCLLRNADTYKRLQAEV
DRFYPPGADSLAPDHLPEMHYLEAEALRLFPAVPSGSQRTPERGSGGKTIGPS
YIPEGTQTRIHFWSVHRDPRNFSRPETFWPDRWLIADGLQKDEGVEFVHNPN
AWIPFSLGPANCVGKNLALQEMRMVLVHLLHRFSFRFAGDYNPEQYEWDIE
DRFVVAVGRLPVIVERR
 
>CYP5035B1 Phanerochaete chrysosporium
scaffold_247a
MGTEYVLPLLRTNILKLPTNMTRNDALLAVGGAAV
LCHLIFKKWEPTYIPAVVTLLLVVPLGLSALLVPHYGQLLAPLVALATYHTILL
TSIALY
RLSPWHPLAQYPGPLPAKLSKWWMVWQERDCKQHLYIKQLHDRYGDIVRIG
PNELSIRNV
DAVAPLMGTNGLPKGPS
LRGQGLEPPITGLIAIRDPAEHARRRRPWTRAFSTA
ALKEYEPILVKRISQLCEQLASQKGTLDLATWFSWFTYDFMGDMV
RFGGGSEMLAHGDQDGIWTMFKEGGE
GQMMYHHIPWLAHYAKRLPMSPALKKMRGFALGRTAERYKKGASTKDLFYYL
SNEDGVEKTPPPAAQVISDGVLATIAASDTTSTTLSNAFWNILRHPHYY
KRLQAEVDKFYPVGENAFDTKHHSKMTFLDAVL
NETLRMYPVLPSGSQRAPFPGNGDRVVGP
YYIPDGTQARIHFWSLQRDPRYFSHPDTFWPERWLIAEGLEPAPAGEKFVHNP
NAFIPFS
FGPSNCVGKNLAQMEMRMVFCYLLQNLDFELDKSWNPAERENATEDQFVLL
MRSPVQVTVRRRV*
 
>CYP5035B2 pc.54.66.1
MGSEVAMQLLRANISKPFPRLDQNDALAVVVGAALVVCHLIFKHLEPTYIPA
VLFLLVLVPLYLSALLLPHFGPLLAPVIAFTTYHTSLLTSIGLYRISPWHPLAKY
PGPLPAKLSKWWMVWKERDCKQHFYLEDLHKRYGDVVRIGPNELSICNVDA
VLPLMGPDGLTKGPCSTIGQGLEQPIPSLVSIRDPAEHARRRRPWTRAFSTAAL
KEYEPILAKRISELCEQLAQQKGSLNLATWISWFTYDVMGDLVFGGGNEMLA
TGDQDGIWAMFEKSGEGQMVYHHVPWLAHYAKRLPMSPALKKMREFALGR
ALERYKRGAVTKDLFYYLSNEDGSEKIPPPPAQVIGDGVLATVAASDTTSTTI
ANTFWHILRYPHYYKRVQAEVDKYYPPGESAFDTKHHNKMTFLEAVIHETLR
LYPVLPSGSQRSPVPGKGDRVVGPYYLPDGTQARVHTWSLHRDPRYFSRPDT
FWPERWLIAEGLEPAPAGEPFVHNANAFIPFSFGPANCVGKNLAYLEMRMVF
CHLLQNLDFELDKRWNPAERSRSAEDQFVLYMRCPLPVTVRRRV
 
>CYP5035B3 PFF_271(pc.92.54.1)
MGAGYVPQLPRSYALTSFTGTSQTHALTPVIGAALITHLNFKRWEPLNLPIVIFL
PLVAPLGLSALFAPGPSYGSLVAPFITLFLYHAILLASIALYRISPWHDLYHYPGP
LPAKPSKWWMVWKERHGKQHLYVKALHDRYGDVVRTGPNEISIRDVAAVV
PLMGTKGLPKGTAPWGEPIVPSVVPLIGIRDSVEHARLRRSWARAFTAGALKG
YEPVLTARITQLIAKLGSQTGEXIDLALVLSYFSYDFMGDMLYYGGGSELLAEG
DNDGVWALGQMTCHHLPWLAKYKDLLPASPGLQKMRSVALQRTMARYKNG
GLGKDLFYHLSNEDSAEKTSPPTEQLVSDGVLAVFTASDTIATVLSNVFWSILR
FPRYYEQLQAEVDKFYPARADAFDTAHYGEMVWLDAITNEALRLYPIVPSGSQ
RAPAPGDGARVVGPYVIPAGTNARVHTWSLQRDPRCFSRPDAFWPERWLAAA
GLGDSEATCEGPEGAAGAAEDFVHDARAFEPYFVGPLDCIGRALAQLELRMVL
CALLQRLAFAPARGDPLERERTLQDQFVVMMRGPVMVSVSWRA
 
>CYP5035C1 ug.43.44.1
 
MSAVLNSLSPTESVLAVVTCALATHLVFNRFEPTELPVVAAALVGLPAVLSAL
LVGHFGLLAGAALAFATFHATLAASIVLYRLSPFHPLARYPGPLPARITRWYW
ARVACGGRQHLELKRLHDVYGDIVRIGPNELSFRDISVVQPMMGAQGMPKG
PMWDGRTFKPPILPLVGMRDVADHTRRRRPWIRAFTPAALKEYEPVVAKRG
AQLIEILAQKKHTDFVHWIHLFTFDIMSDALFGGNPGAEMMSHEDKDGIMHS
MKTGFEAGQLFEHIPWLGYWMRHFPQLATATKQYRAMCFQRGMQRYQDGS
SQKDLFYYLVNEDGAEKQTPDKATVIADSALAIIAGSDTTASVFSNMVYCLL
KNPHAYKRLRAEVDEFYPPEENSLDPKHHSKMPYLEAVINETLRLYPVVPSGS
QRAPELGTGGTLMGTHYIPENTSVRVHFWSVHRDPRNFSQPESFLPERWLAA
EGLEAPPAGLSGAPQDADSAAARGTFVHNANAYMPFSFGPWNCVGKALALL
ELRCVATHMMQRLDVRFADGWDPAEWDAAMEDKFVIKTGRLPVVVERRF
 
>CYP5035D1 ug.50.50.1
MITVNRPSTQDSVIATVACAIIVYFVCKRWEPWRLAVVVPLTMLPPLVLSLPL
AASVGFLNAVITTSSTFASALVAMLTFYRLSPLHPLSQFPGPLQCKISGFWMA
WIVSRGKRHVYIQSLHEKYGDYVRIGPNEVSINDPTAIPQILGSYGWPKASGM
SGRALHQDPLPLISLLDDAEHSRRRKSWNRAFSSAAVREYQPVVAQRATQLV
QALQEQRSVVDLSQWMRLPSDIAFVNLHDFLYSFGGGTEMLREGDKAGLWQ
LLKDGMSTAVPFEHVPWLSHYILHFPALIRSLTDLRALAFGRSKLRYERGSTT
KDLFYYLVNEDHADAEMPPMKVVISDAVLAIVAGSDTTAVTLSNIFYYLISHP
DTYRRLQEEVDKYYPPGEDALNPKHYVKMSYLDAVLNEAMRLYPALPSGSM
RTPAKGSGGQVVGQRFIPEGTQVRVHPYTIQRDPRNFSYPNRFWAERWMIAS
GNQSHHEKIAHNPDALIPFSTGPRNCVGKNLALLEMKMVTCHVTQRLSLCFA
DGWDPSRWWDDLEDVFVSRMGQLPVIVRSR
 
>CYP5035E1 pc.97.5.1  (genewise.97.3.1)
MQAAHLVFNRWEPTNIAVVAALLLGVPCAAASLLFSGRGFVPRLALTAALY
YACLGTSVVLYRLSPWHPLARYPGPWLLKTSKLWMVRRVKRGGQWRYIRE
LHQRFGDVVRIGGPNELSFCDAAMVVPVLGTQGLPKGPGLIALRDPLEHQRR
RRTWNRAFKPAALQEYLPLIQKRTAQLLDALSKCEEQDVVDLGRWIRFCKYA
HMPWLAQFTKHIPRVAAKLKELRAAARSRAAARYKAGANRKDLFYYLVGD
SNEDGGEKEQPTEDVILSDALLAIIAGSDTTSTILTSAVYCLLTHPDVHKRLVE
EVDKFYPPGADWCNTEHHADMHYLNANETLRLFPVLRDGSLRAPWVGHGD
RALGPSSFIPEGTQVRVHTYSLQRDPRCFSQPDTFWPERWLVAGGLQHAEPGF
VHEPGAFLPFSRGPSDCVGKGLALQDMRIVLCALLQHLELAPPRNRPFDEWK
AEVDRRFADSSAVLPVSVRVRRRV
 
>CYP5036A1 PC-hn-1(pc.15.127.1)
MDALDPRIAIPIVYFIYKKYEPSSPRSAFFLLLALPGVLAVALRVCERFNSYAT
AVPTVYLAYWSLLSTFVVAYRISPFHPLARYPGPLLCKISKGWLAYVAGKGG
KAHLYVQDLHMRYGEVVRIGPNELSITHQDFTRVVLGAKGLPRGPYYDSRQ
HEAGMSLDGMRDQALHAIRRRPWARGMNTAAMKYYEELIRNTLSDLIAGLK
QRTNRPVDISEWTNYFGFDFMGQMAFTRDYGMLKNGYDKEGLLDLIEHAM
QDSAWISHITWSIPFLRYVPGASKYWDQMKAVGERAVADRVALGSNHRDLF
HYLMDEDGHEAVRPTKALVAVDGQLAIIAGGDTTATTLSHIVYFLLRYPIYLD
RLRKEIDETFPDGADSTLDFTKQTNMPFLNACINEALRLYPPVLAGLQRRVEP
GTGGKMIGPYFVPEETQVSLFAYSIHRDPQHFSPLTNTFWPDRWLSQEKYTLP
SGDVISADEVVTNRDVFIPFSQGPMVCAGKNVALTEMRSVMCALLQHFDLKI
ADQSFLDSWEDKIEETFTTKRGTLPVILSLRA
 
>CYP5036A2 ug.170.56.1
MFFPLDSWVAVPVTSAVTYFIYKTLEPSSPVSVLLLLGIAPGALSWTLYGGSG
SVIAHIFTVYVSYWALLVTYTIAYRLSPFHPLARYPGPLLCRISKAWLAYIVAT
SGKMHLYIQNLHMRYGDVVRIGPNEVSVTHRDFIGVIGPKGLPKGPYYETRA
HKAGTSLNAIRDQALHSVRRRPWARGMNSAAMKYYEELVEETVGDLVASL
KRRTTKAIDFSEWMTYFGFDFMGHMAFTHDYGLLKNGYDKEGLTPLIEHAM
QDVAWVSHVPWSIDYLRHIPGVYKHWLRIKALGVQTVRKRIALGSTRRDLFY
HLMDEDNRERVKPDINVVAIDGQLAIIAGGDTTATALSHLFYCLLRHPQYLER
LRKEIDDAVPIACGSFKLDFSKLPAMPFLNAFINETLRLYPPVLSGLQRRVEAG
TGGRFIGPHFIPEQTQVSFSAYTIHRDPRYFSPLTDTFWPDRWLVQEQYVLPSG
GVIPAAEVVTDRDVFFP
FSQGPTVCAGKTLALTEIRSVACALLQTFDISNADEASFDAWEDNLEEMFTTK
RSTLPVFLSLRT
 
>CYP5036A3 ug.128.46.1
MSAIFAASSTTYIIFKRFEYSKPVPALVLLIGVPMTLASVLRDHFAGLATAMCA
TAAVHWVLLTLFVAAYRISPFHPLARYPGPLPCKLSKCWMAYLAGSGGKTH
VYIDRLHRLYGDVVRIGPNELSVRHKDACTTVLGAKGLPRGPYYDTREHDNG
VSLDGIRDPALHAVRRRPWARAMNSASTQYFEELIQHTVSDLTGGLKERAGE
SIDLTEWMSFFFSYDYNMLKEGRDTQGLRRMIDQSLVDLRWISHIPWSIPYLK
MIPGASKNWDDMKAAGDKVARHRVSLGSSRPDLFHHLRDEEGHEAVRPQLE
VVSVDGALAIIAGADTAATTLAHFWMFMLRHPACFERLRKEVDATFARDDG
PDFVKQARMPYLNACLNETLRLFPPVLAGLQRRVGRGTGGRMIGTHFIPEDT
QVSLVAYTVHRNPDCFSPFPDTFWPDRWLTQETYTLPTGEVIPSSDVLTRRDA
FMAFSQGPMACAGKNVALAEMRAAVCAVVQRFDLVLAYERALDEWEEVLQ
ECFVSKLGKLPVQVVPRN
 
>CYP5036B1 genewise.22.99.1
MALLQLAQDVFRRSHPASACYALYHKYLNKPVHPLYHFCLLLVVPAALLAL
LQAIHRISVAQECGYALFYWLVMASATVVYRISPFHPLANYPGPLAAKISKLY
LAYLTAKGRAHEDVRALHSKYGDVVRIGPNELSFNRSDAIQTIYADKTMPKG
PYYVARTNLAGVVQLDGVRDFKEHARRRRPWNKAMNSAAIKSYEPIVSSTA
SQLLGQLSKRIHNDVNISDWMSFYGFDFMGRMVFGREWGMLEEGRDVNDY
WHTMDKCLTIVSWSSQIPWSVPIIRLMKPPPEVLKMQKISDDSAMARLTSEGS
GVKDLYYYLLNEDDSSKSELTRDECISEGVLAIVAGSDTAATALTHLCYYLLT
HPDSLQRLRQEIEEAYPTLGSELDDLSRQAEMPYLNACINETLRLLPPVLTGLQ
RSVTAGSGAIIAGYFVPEGVDVSVHHYSVHRNLQDFSPIPDTFWPDRWLEQD
AYVLPDGDVIGKGEVRTNRGAFMPFSVGPQQCAGKNLAMVELRAVACGLFR
RFDLSLSERMNICDYEKGLRDAYTTVRGPLYVKLKPRKE
 
>CYP5036C1 ug.36.48.1 = genewise2nd.119.1.1|whiterot1 = scaf154
47% t0 5036B1, 45% to 5036A1, 46% to 5036A2, 45% to 5036A3
not in tree
MVDRVDPRLILGSSSLVSVACYLIYKHSEPNNIPAHAALLLGVPALLVHQLGT
HWSILQQGGAFVAYWALILAFTGLYRLSPIHPLARYPGPTLGKLSKIYLSYLSA
RGDIYRVIKGWHDKYGDVVRIGPNELSFRHVDALQPIMGTKYTVKGPYYDTR
TTPEQITQMDGIRDYSVHGQRRKPWLRAMSSAGLKGFEPIVKMKALELVEEL
SKKVGEIIDMSEWMNLFGFDFMGHLAFGREFGLLKSGNDHDDMIRTVEDGV
YGAGVISHIPWIAFLVHFPPAMKGLRAMQQMAATFARERTQKGSTTKDIYYF
LTEDEGAAQSGATHDEVIADGMLALIAGSDTTSIALSHVCYFLLRHPACAARL
RAEVDRAFPPGEDVLDFARHADMPYLNACINEALRLLPPGLGGLQRMVRRG
TGGAMIGPHFVPEDTKLSVHLFSLMRDAREFAPLPDAFWPERWLAQDTYVLP
TGDAVSKEHVTTNRAAFIPFSVGPQNCAGKALALVELRAVTCALVSKFELHK
PKDYDLDQWEGDLLDLYISIRGKLPVILQARQGR
 
>CYP5037A1 PC-ln-1(gx.1.22.1)
PPGPPGLPFVGNAYQIPHDKQWLRFDEWIRRYGDLVHISVMGQPTVIIGSAQT
ASELLDARGSIYSDRPQAVMAGELVGWDQGLGYAPGPHSPRFREFRRLFQQF
MGPRAAQDSSMLAAQEKSATRLLSRLLSTPEEFITHVRQVTGALILYLTYGYE
VDEDGFKDPLVNIAEEAMLGFARASDPGAYLVDTMPWLKYIPEWFPGASFK
QDVKAMRQARERLYDVPYNFVQKAMAEGPVPRSFVSTYVEEKATPAFADEE
LIKAAAASLYSGGADTTPSSLASFILAMTLHPDVQRRAQVELDSVIGESWQRL
PTFADRPNLPYIDAIVLEVLRWHPAVPLGLAHRLSQDDVYRGYYFSQGTVFW
ANIWTMLHDEIIFPDPSRFMPERYLDEHGRLKSMSRFEDPAVIGFGFGRRICPG
MHFAHNSIFIAIARMLYVFNFTKAVDKNGNEITPEVEYSGFISHPSPFVCSISPR
SIAAAELVVQ
 
>CYP5037B2 ug.1.25.1
MIRPLTLLDIALATLAVVLLKTIIARSKQRARYPPGPKGLPVIGNVLQMPKDRE
WLTFAQWGEQFGNIVYLSLLGQPMIILNSAKDAVALLDKRSSIYSDRPILYMG
GELIGWKYILGLTPYGDRFREYRRLMAKFIGGKTQVERHFPVMEQEATSFLK
RILRRPDDLGANIRTHAGAIILKLAYGYTIREDEDPFVTLADRAMAQFTEATTP
GAFLVDVFPLLRHMPAWFPGASFKRTAQEWSDTLNSMADVPHAFVKEQMA
KDTEVPSFTSELLRDEKLQEGQEFNIKWSAASLYAGGADTTVSSIHTFFLTML
LFPHVQKRAQAEIDSVVGTDRLPTFEDRAKLPYVEGVLKEVLRWHPIGPLGA
FLVLFFSLGLPHRLAQDDSYEGHLFPKGAIVIANIWCVSISTSWQCLYSPCCRK
CLHDPDVYPNPSDFDPTRHLSENGRSPQPDPRDYCFGFGRRSYPGLHLADTSI
WITCATVLAAFNIENVVENGRVIDIVPEYTSGTISHPKPFRCSIKPRSTRAEALIF
SD
 
>CYP5037B3 pc.1.248.1
MPSALSLLDFAFAALGLIIVKAFLSRTRRQGPYPPGPKGLTIVGNALEMPTSRE
WLTFSEWGGRYGDIIYLSLLGQPMVILNSAKHAIALLDKRSNIYSDRPVLVMG
GEMIGWKYTLALTPYGQRFREYRRFIAKLIGGPTQMQTHLPLEEHETRRFLKR
LLNEPERVADHIRKTAGCIILKLSHGYDVREGHDPIVDLVDTATEQFSLATSPG
AFLVDVFPLLRYVPAWVPGARFQKTAREWRKVLERMADEPHDFVKQRMAE
NTNVPNYTSELLQNERLDGDKEFNIKWSAASLYSGGADTTVSAIYSFFLAMTL
FPHVAKRAQMEVDAVVGSDRLPTCEDRPNLPYVEALVKEVFRWNPVAPLGL
PHRLIEDDIYEGYFIPKGSFVIPNIWSYSTSHILHDPNHYPNPFEFDPTRFLSDEG
RTPQPDPRDYCFGFGRRICPGLHLADVSVFLSCAMVLATFDISKAVENGKVIE
PEVEYTSGTISHPKPFKCTIKPRSTKAEALILSADD
 
>CYP5037B4 ug.1.26.1
MSTPLTYLVLLSAILTVVLIRTAIARRKRWARLPPGPKGLPIVGNVLQMPKSQ
EWLTFSRWAEQYGDIVYLNILGQPLIILNSAEDAVALLDKGGSIYANRPILAM
GGELVGWNRTLALTQYGERFREYRRLIARFIGGKAQMARHLPLVERETRRLL
QRILNNPEDLAGNIRKTAGAIILTLSHGYRIREDDDPVVAHVGRALEQFTEAST
PGAFLVDVFPILRHVPAWLPGASFKATAKRWGETLEQMADVPHNYVKEQMA
SNKDIPNFTSELLRDEKLGDIDSKEFNIKWAAASMYSGLGPQTVSSIHSFVLA
MVLHPHVQRRAQAEIDAIIGPERLPTFEDRAALPYVEALFKEVLRWNPVGPLG
LPHRLSQDDVYKGYLLPKGSIIIANIWSFLRDHNLYPNPSDFDPTRHLPKNTEA
ASQPDPRNYCFGFGPDVLAGQHLADASVWLACATMLATFDIENLVASDGTVI
GVEPEYTSGTVSHPKPFKCSIKPRSALANALINAGMPE
 
>CYP5037C1 pc.2.111.1 (genewise2nd.2.36.1)
MTSTNILLISHGHRVKDSNDRFLKLSDAVTDEFSEAVAPGAFLVDQFPLLRHL
PSWVPGTAWRKTAEKYRRHVADAVAEPFAFVKQQMAAGTAIPSFVSRNLDD
GAAPSPDHEHTVKYAAMALGCADAADAAWQTASALTSLFLAMTLYPEVQR
RAQAELDGVVGTDRLPTFEDRDRLPYITAICAEVLRWMPVGPLGLPHRLTED
DVYEGYALPKGTIFFVNNWKLLHDPDTYRDPMAFMPERFLGAAPELDPSKIA
FGYGRRICPGILVAEATIFITVAATLAAFSIRPAQNGGAPSLPPVRQTSGIISHPA
PFQCDVVPRSKKAEALVVAAVENR
 
>CYP5037-un1 pseudogene genewise.25.75.1
scaffold_11 (1449124 bp) : 785586:787348 (1763 bp) (-) strand
32% TO 5037B3, C-term half = 40% to 5037B2
gray X = frameshift
MLLRIWSRSQSFHPNVSALLFVTTRACHLMSC
RLPLPPGPRARWYGTIGMPTKSQWLNCHGRKCTVRYGSHYDAPGLLMYFHIVENPMVV
LDTAGTVNDLFEKRGTSYSSRPVTTMVNELYAYAQYSLGGYSLAH (?)
WRKHRHLFHQHFNTSAMHVSRPVVLREAHTFLHNVSRTPVDDWSIVFGGH (2)
ADAIVTMLSYGHQIAPEGEMYVDTRTRLSQ (0)
AYLLLLLAKTRTSL
IYDVVKHVPAWFPGAAFKKQALQWREANRMMLNVPLEKVQ
67 amino acid deletion here
QIDRIFASDKLPTF
ADREDLX
YVDCIVWEYLRWNP (1)
VTPLGLPCQVTEDDTYCGX
YIPKGATIVSNTWY (0)
MYPYLLLFDPDRFADASRNASLGIHELPNAAFER (2)
MCPGRVLAFETI
WITIATTLAGFHLSEPRDEHNEVIQLDTPNTPKLLS (2)
HPKPHQCAVSPRSERALFLVVESLDG*
 
>CYP5136A1 PFF_311a
 
MSSLLVLVAISLALSQLIRFYRWLFHHSISYLRGPVADSFILGNVREFTYQESV
GDLDFRYMNEYGTAWRMKSILGSDVLMICDPKALQHVLHKSGYHYPKNTEA
RIGSFNVTGRSILWAPNGDIHSRHRKIMNPAFTAQQLRSFLPLFRRGSNKMCQ
LWKDEVLAQAPTGMTIAVNQWLARTTLDVIGEAAFDFSFGALDDADNEVSK
AYHNMLFADSLLYPSAWSTIFRGLWRFIPDQLLSYVRYLPTREYTRFRYTLNII
NKVSKSLIDQKSEDLLSGDKSSKDVMSVLVRANSSENPRSQLSEEEMVSQMA
TLTLAGHETTANTITWLLYELAKHPEYQQKMREEIAVKRAEINARGDADFTM
DDLESMQYLHAALKETLRYHPIVYHLAREASKDDVIPLAYPVTTIKGETVSEI
PIAAGQIIMPNIAAYNRLPQVWGDDAHEWNPLRFIDDSPEVQVRLGMFGNLM
SFFAGVRGCIGWRFSLIEMQAIVADLVENFQFSIPPEKPEIIRVPAGIMGPMVK
GKMHEGLQMPLHVTPL
 
>CYP5136A2 pc.142.11.1
 
MAAAALLIICWLVVNLRRLLTHNSIRHLRGPPSASTLFGNVTDTLYQASVGDV
EFRWLKEYGGAWRLRGLLGANILALADPKALQHVLQKSGYNYPKTRQLSVT
LFNLTGRSILWAPTGEIHARHRKVMNPAFSVPQLRSFIPLFRQSAKKLTQIWKDQV
NAGHPDGVTLPVDRWLARATLDIIGEAAFDFDFGALDNTENEVSKAYHRMF
ADSQLYPSVWNLLFQATWSLLPEPLLYYIRYLPTREYKTYRSTLSVMDKIAAQ
LIEERTREFGAGDPDKSRKDVMSVLVRANMSENPSTRLSDEEMRSQMFAMTL
AGHE
TTANTVTWMLWELAKHPDIQEQLRQEIAEKRMEVTANGSYEFALDDLESMP
LLQAVIKETLRYHPISSFLWRVAAKDDVIPLEKPIVTTTGETITEIPVAAGQVIM
PSLCSYNRLAHVWGEDAHDWNPMRFLQGDTEKQTKVGMLSNLITFSAGVRS
CIGWRFSVLEMQAIVVELVENFRFSLPDNKPEIIRAPTMTMGPMVKGKLHEGF
QMPLRVVPV
 
>CYP5136A3 pc.16.161.1
MAVIDYTLHASSPLVLLACTVCVAVLAFRWYSSSTHGSIAHIRGPPVKNPILG
NIRDFSYQENVGDLDFAYMKEYGTAWRLKSSLGKSVLMVADPKALQHIFHK
SGYLYPKTTPSTVRSFLVTGKSILWAPDGNTHSRHRKIMNPAFSAPQLRSFLTL
FRKSSSKLCQLWRDEISPEGSTVLVNKWLARTTLDVIGEAAFDFDFGAMQDN
QNELSVAYDNMFTDATLHTSPWNAIFEALWDYIPDGILKQVQHIPTREYARF
KQTLGVFAKYSKRLIAQKSADLVSDTHSKDVMSVLVRANAAEDAGRKLNDE
EMVSQMSALTLAGHETTANTISWLLYELAKHPDFQEKMHAEIVAKRAEIVAR
GDEDFTMEDLESLEYLQAAIKETLRYHPIAFHLNRMASQDDVLPLAYPVMTT
AGEKVTEIPVRKGQAIMPNLAAYNRIPEIWGADAHEWNPMRYIENRTDAQVR
VGMYANLMTFSAGVRGCIGWRFSLIEMQAIISDLVENFRFGLPKDRPEVLRVP
AAVMAPMIKGRMEEGAKLPLHVTVY
 
>CYP5136A4 pc.16.153.1
SGACVLCLAWLAYRWYRWTTRLNISYIRGPPVKSWILGGNVRDFAFQENVG
DLDFKYVQEYGLVWRMQQPLGAQVLMVADPKGDIHARHRKAMNPAFNNA
QLRSYYPCFRRTSSKVCQLWKDQILSQGPNGATIRVDRWMARAALDIIGEAA
FDFDFGALDDSANELSAAYHNMLSADSTLRPSAAQAVFQGLWTHAPLRVLE
RVRHLPLRDIARFQHAMRVFNTYAARLMARGAAGAAHGRDVMSVLGTAHA
NASADPRTRLSAEEVRAQMCALTFAGHETTANTTTWLLWELARHPPAHQDS
VRADTVRRRAHVAARGDADFGVEDLDALPCLEAAIRETLRCHCIVFHLNRVA
SQDDVIPLSRPLTTATGKTVTEIPVAAGQVVMPNIAVYNRTRTKWDPTRFLDG
RVDNPEVRLGVYGNLRTFAGGVRGCIGRHRMIEMQAIVADLIGHFRFSIPDDK
PEIVRAPSMLMAPMIKGKEHEGSQMPLHV
 
>CYP5136A5 pc.14.209.1
 
MFHGYLSATFQAQRRPGNIKDFTYQQNVGDLDFQWVKQFGRVWRMQSPFG
TDILALADPKAMQHCFHKADDQYNKRVESTVGSRMMMGKGLVWASGTTHE
RQRKIMSPAFTTAQIRSFLPFFRAGAAKKWRDELFNHSTDGAAVPVNKWFSR
ATLDILGETAFDFNFGAVDDKDNEVTLAFHTMLFANSCLRPPKWDLLFKRIW
YFLPNPLLELVQYVPTKEQNRFRRCRLVVEKVSQQLIQEKREALLAEAKSSRD
IFSVLVRANVSENPNSRLSDEELIAQMGTLVLAGHVTTATTLSWMLYELARR
QDYQDKMREEIVAARARLQERGQQDFSMEDLENMHYVSSCLKETLRFHPPV
YHLFRQANTDDVIPLEQPVRTTSGKYVTEIPVAAGQQVLFSVCAYQRLPEVW
GEDAGIWNPMRFIDGNVDKQSKLGLYSNLMTFSAGSRGCLGWRFTIVETLAII
VELLEHFKFEPTEDTAKVIRVPTGIMSAFTAGKEREGPQMLLKVVPIL
 
>CYP5137A1 pc.5.122.1   (SEQ ON OPP STRAND FROM THIS MODEL #)
 
MNNLTAALILVALALWFACRRFTRTTLRDIPGPKPVSFWLGNLEQYFLGQAG
EGDFHLQERYGRIARLHGSIGGEYLWISDPNALRYIFQTSGYRYAKQPERRAL
SRLHSGHGLVWADGEVHKRQRKVMLPAFGAPESKALLPHFARAAEAVSVK
WKDILTTAPSLSKELNVSTWLSRATMDAIGEAAFDYHFGALENTDTDIVRAY
NNLMPIVFGAPTADAIFKRDALRIFRSSRIVEWIYDRQRNPAVEKARECEELTL
KIARELVENKAEALEQGKGSKDIFSLLVKANMTEDAKSRLSEEEMYAEMRTI
LFAGHETTSTTISWVLLELARHLPVQERLREEILAHKRGGELSATDLDGMPFL
QAVVREALRLHPVLNQTFRQAEQNDVLPLAHPLTDRTGTVLTALPISKGTRVI
LSIAAYNRDTELWGSDAHAFDPDRWLDGRVKKVQTLGMYGNLLTFAAGVR
GCIGWRFAVYEIQTFLVELLANFEFRPTEDLKRLRREPCGVMVPTLEGDRGTV
QLPLRVSLLDHKI
 
>CYP5137A2  PFF_88 NOT IN TREE
MHDIFPLAVLLGALLWIVRRILSRSSIRDICGPEPESFWLGNLKQFFMRQAGEG
DFELQERYGRIARLHGSIGGEYLWVADPKALQHIYQASGYNYAKQPERRALS
RLHSGHGLVWAEGEVHRRQRKIMLPAFGAPESKALLPHFIHIAESLSMRWKDI
LLASRDFAKELDVTEWLSRATMDAIGEAAFDCQFGALDNGGSEVLRAYNDL
LPMVLGVPTTDGIWKRDAMRIFNSSAIVEWIHDRQTNDVLQRARECEQMVM
KVAKELVSSKAEALVQGKGSRAYFSLLVKANAAEDAASRLSDEEMYAEMRT
SSLAGHETTAMALSWALLELAQHPEVQSRLREEVRGCKRGEELSAAVLDSMP
YLQAVLREVLRVHPPAIHNFRQAVRDDVLPLAHPITTKSGSVLTELPIQKGTR
LILSIAAYNRDPDLWGSDPHMFDPDRWLDGRVKKGQVVGMYGNLLSFSAGV
RGCIGWRFAIYEMQAFLVELVSNFEFGPTEDLKRLRREPCGVVAPMLEGEQG
VQLPLRVSLANYDV
 
>CYP5033A1 Ustilago maydis
36% to CYP5034A1 GenEMBL XM_399595.1 37% to white rot Scaffold_7 C-term
MAISTSSRLVIHQDVLSWLQHRPFASAFTLLVVYITYKLAIKPILFPSPYRHLPRPERASYILGQRI
VEANGLTYIDASTNQRVKVSGPGEVCKHYARTLDTSVFVFPEPFGGETLFISDPFALNAILADVDKF
QSDLLRTTIIEFIVGKGIVARFGDAHRKQRKLMAPAFTPAHIKGLTPIFAKYAQLMCHKIALADDES
VDFAEYLDCTMLDIIGEAGFGYRCSALERGRGGSELSSAFNSVNQAAIDFGPARAIHLGLSAMLYPR
ASIWPLSEANRRIAKVNRVMDRITMQIVREAKSRVEKEGEDLGDKKDLLSLLIKSNLDARIGERMTD
KEISGQIQTFMFAGYETSSVTTSWTLYFLARHPEVQNKLRNILTATLSERKGIPLEELDVSTLEYDD
VWCQDLEYFDWILAETLRLCPPLSGNDRQAMQDSVLPLMTPVKMTNGENVSQLMVKKGSRLTIGIKT
VNCDRKLFGDDADEFRPERFAELPQRHAEAKLPPYATYSFFGGPKSCIGSKFALTEMKVIIIAVLSR
FQLSPEP GVTIKQHQALIVRPRVETSTGGPAAGMPLRIKRLPHQVSV
 
>CYP5138A1 pc.65.27.1

MPLLSAVPAAALPLLGAALYVLWTFLALLVRQARSPLRHLRGPPSPSFLVGNL
REMHDQENTALFARWEHRYGSTFVYHGFLGGARLLTTDPVAVAHILAHGYD
FPKPEFIRDALASMAAGHEGLLVVEGEDHRRQVRASPAFATPHIKSLSPIIWSK
ATQLRDVWIDLASSPSLTPAATPPGTKVDVLAWLARATLDVIGEAGFGYAFN
SVRAAACPGDAAEDELARAFAVIFSTARKFRLITVLQVWFPFLRRFVSIPPRCF
LALPLKSSLSSDPSQQLSTNAMLCQIATFLAAGHETSASALSWALYALARAPA
CQHTLRRELRALTLPADPSAADLQAVLALPYLDAVVRETLRVHAPVTSTMRV
AAHDAAVPVGTPFRDAHGAQHAAIRLRAGDIVTLPLQAMNKWGADAACFRP
ERWLAHGDAPREPRGLWGGVMTFGTGVVANGNRSCIGYRFAVNDVVTRPC
VKSEPHLGNQMPLRLRRVAVEETVGDSSGDGAPRTVS
 
>CYP5139A1 gx.38.22.1

MGYPLAVYAVGALVALIVYSVGPTVWHVLTSPLRHLPGPPNDSLLWGNMAA
IQNEEISVPQARWVKQYGHTISYRGVFGMWRLWTVDTRALNHILTHHLIYQR
PLPSRYQLSRLVGPGVLVTEEERHKHQRRVMNPAFGPAQVRELTEIFTEKANE
MRDVWYNEITKAGGASAQVDALSWLSRATLDIIGRAGFGYDFEALTGASNEL
NQAFSTLFARPIARHRFFARIGMQLIEQRKAAILAEKGKDVERKDLTGRDLLT
LLIRANMATDIPEDQRLSDEEVLAQVPTFIVAGHETTSTATTWALFSLAQMPEI
QRKLRNEMLTIDTDTPSMDQLNSLPYLDAVIRETLRFHSPVPVTTREAMADD
VIPLGTPTVDRYGRTIDHINIKKGDLVFVPILAINRSKEIWGEDVDDRPERFEN
VPEAASTVPGVWGNVLSFLGGPRACIGYRFSLVDIVTRPVMTGPDGKTRGAL
PLIIRPYRP
 
>CYP5140A1  pc.96.21.1
 
MNASSIDFFPRNLATSPVFSAKPFLLALSLISTYLVSVAFYRLFFSPLASIPGPW
YAAVSDLWITTHVLRMQQCRVVQDLFDTYGPIVRIGPNKVAFCDAGTMRSV
YCVHKFDKSAYYKSLLTNNNDHAMTTLPHAEHAIRKKTYAPHYTPANLALF
QPELNDLALKLTDILSIRSSSVDVLDLFRHLMVDVIACTVFGSRSGSLDNWNK
GVRDPLSIAVYDFPKRGIMVRLCLSSPVTASDNHIRSGAQCLLGPGSFIAGVDT
SSTSLSYMFWELSRRRDVMQRLQAEIDEIMPDPRVIPDATVLNRSEYLNAFVK
ECEYACHPCPAEILRDPIALHFDMMGYALPPGTIVATQAWSMHRDEDVFPSA
ETFLPERWLVDPHADREVEEERLARMHLHLVPFGVGTRQCGGQNLAHLMIRI
VVAVVVRNCEVRADVRETNERSMSMRDAFVSPLLWLLLGSERS
 
>CYP5141A1 pc.181.9.1
 
MISDTFALAISSGLSLFLCLKAFIDYRAGLRSINHSYLPGFRALISSFGILGLFFK
EPKRGLWGGRRRFWLRKHLDFEEAGVDIISHIAFLPSVSTYLLLADAAAIKEV
TGHRARFPKPTYKTLRIFGGNVLASEGEEWKRHRKVVGPAFSEHNNRLVWN
ETVKIVNDLFANVWGSQSEVYVDNVVQSVTLPMALYVISIAGFGKRALWQA
DGNLPPGHKLSFQDALHILGTDLWIKAATPTLLMNWAPTTRIANVKLAFDEV
KQYMLELIQERRNSEKRDERYDLFSSLLDANDLNEDGNGNVTLTNDELLGNI
FIFMLAGHETTAHTLAFTFGLLALHPDYQETVYQQIKSIVPDNRPPMYEEMNS
LTECMAYETLRLFPPTATIPKIAAEDTYLVTIDRAGNRVVVPVPCGTALHLNVI
ALHHNPRYWDNPSAFKPERFRGDWPRDAFIPFSTGSRSCIGRRFFETESIAILT
MILSRYKIELRNDPRFADETYEERWQRVLRVKDGLTPA
 
>CYP5141A2 gx.37.18.1
MFSNTFALAITSGLLLSCLKAYMDYRAALRSINYHPGSCALIPSFGMLGLLFK
EPRRGLWGGWRRFWRRKYLDFQEAGVDIISHIAFVPSVTTYLVLADAAAIKE
VTGHRARFPKPSYEFFRIFGGNIIASEGDEWKRHRKIAAPAFSEHNNRLVWNE
TVKIVCGFFENVWGSQAEVYVDDVVQSLTLPMALHVISIAGFGKQTVWRAD
GTLPPKHKLSFQDALHVVSTDLWIKFVMPTMLLDLAPTKRIAKVKLAFEEVE
QYMLELIQERRDAEKRDERHDLFSNLLDANDSDENGDGSVKLTDEELLGNIFI
FMLAGHETTAHTLAFTFGLLALHSDYQEKVHQQIKSIMPDNRLPTYEEMHLF
TECTAVFYETLRLFPPVTTIPKISAEDTSLVTTDRAGNRVVVPVPCGTSLHLSV
VALHYNPRYWDDPYAFKPERFHGDWPREAFIPFSAGARSCLGRRFFETEGIAI
LTMILSRYKIELKDDPRYAHETYEERWQRVLDVKDGLTTT
 
>CYP5141A3 PFF_77b
 
MNSVLVILLSTILLLCLKTYVDLRTALRAVNYHPGFKSFISCFGVFGFAFKEPR
RGLIGGSLRFWHRKHLDFDEAGVDVIHHVSFFPRVSTCLILADPAVIKEVTSH
RALFPKPLYHELRLWGGNIIASEGDEWKRHRKVGAPAFSEPNNRLVWNETVK
IMVDLFDNVWGSQDTIIVDHVVDAFTLPVALFVISVAGFGKNASWQSDLLPPS
GHKLSFKDAIHVVSVDMFIQVVTPTFLWKLAPTKRIADVKLGFEELEKYMLE
MVEERRNAPKKEERYDLFSSLLDASDSDADGGARLTDRELLGNIFIFLLAGHE
TTAHSLAFTFGLLAMHQDYQEKLYQHVKSVIPDGRLPTYEEMNKLTECMAV
FYETLRLFPPVVGVPKVVAENTTLVATDFTGKRRAIPVAAGSDIHISILALHYN
PRCWDEPHAFKPERFHGNWPRDAFLPFMAGPRACLGRRFFETEGIAILTMLVS
RYKIELKDEPAFAHETYEERWDRLFTVKQGITLA
 
>CYP5141A4 pc.181.12.1
MGTLAWVVLSFCLFYCVQKYLEFRAVVRSIHDHPGFRTLLPPYGIFGFLFKRPI
PGITRGGMSQWRGKYRDFEAFGMDIISATSVIPTARNAFLVADPAAIKEITSSR
TRFPKPVAQYRVLTFFGANIVTAEGDEWKRFRKITAPAFSERNNRLVWDETV
KIMLDLFENEWAGKDTVVVDHAVEVTLPWIALFVIGVAGFGRKMTWQEDSK
LPPGHQLSFKEALHYVSTAVFVKLATPAWLLTWAPTERMRRTNLAFKELEQY
MLEMIQTRRNSEKKEERYDLFSNLLDASEDGSDGHARLADEELLGNIFIFLLA
GHETTAHTLAFTFGLLALYPEQQDKLYKHIKHVIPDGRIPAYEEMNLLHESIA
VFYETLRLFPPVTGIPKVAAEDTTLVTTDHSGNKVVVPVTKGTGISLHVPGLH
YNPRYWDDPYEFKPERFHGDWPRDAFLPFSSGARSCLGRRFFETEGIAILTML
VSRYKIEVKEEPEFAGETFEQRKERILAARGGLTLTYVCSPPHLRNLLNLPLR
 
>CYP5141B1 pc.37.84.1
MQQHLLFAAGLICLFLVKRCIEYRRAIRAIHNYPGVRAVLSNSSGLGYLCKRSI
PGLAVGGARLWVKRYSDFCRYGADIVSCVAVLPRTEILLFVADPAAIKEISSD
KTRFSKPTELYELVNIFGRNIVTTEGDEWKRHRKIVAPAFSERNCELVWEETL
HVMIGLFNDVWGSDSIITLDNAFDITMPISLFVVAASAFGRRIPWTEGGLAPPG
HHMSFKEALHIVSTGTVIKAVLPKWLLNLGPSQYIREVRDAFREMEAYMREM
VTENMLDDTKSRRDLFSSLVHAGQDSPGQEALLTDAELLGNVFMFLLAGHET
AASTLCFALGLLALHKDEQDKLYDHIRFTLGEKDVPAYSDLTSLSYCSAVLYE
TLRLFPPVIGIPKKATEDTVLSTVDRDGNHIAVPVPVGSSVAIHVPGVHYNPRY
WKDPAAFRPSRFFGNWPRDAFLPFGAGSRACIGRRFFETEAITALTMLVVRYE
ISVTDEPQFRDETAEQRRERVLSATQELTLT
 
>CYP5141C1 pc.81.19.1
MASRLLVLLAALVLFALRAFARFRRAVHAVSYVSSALRSVNHPGYRTLLNTL
GPIENFFPRIPGVAPGAFHMWKRKHRDFEEHGWDVITY
VAAFIGSSTNFYVADADVIK ()
EITTHRSRFPKPIEQYKVLTFFGGNIVASEGEHWKRYRKIAAPAFSE ()
RNNKLVWDETRLIMQDLFTNVWGERAEIYVDHAVDITLP ()
IALFVIGVAGFGRRIPWQDEDVVPAGHTMTFK ()
TALHTVSENVFTRLLIPDWLLRAAPTARLARIRDAFAELEQYMREMIRARRER
PAREERHDLFSSLLDASKDADVRLQDSELIGNMFIFMLAGHETTAHTLCYML
AMLAMHPEVQDKMYESIRGVTQNGRLPEYEDMRSLSYCEAVLYETLRMFPP
VNSIPKSVAEDTAITITNADGERTTVPMPKGSSISIHTPGLHYNPRYWPDPHTF
RPERFLAADWPRDAFLPFSAGPRACLGRRRFSETESVAAAAMLVLRYRIAVA
DEPRFAGEGARARFERVTASRPGVTMTCVFCAPLWVVVGTDELCCRPTRVPL
VFRRR
 
>CYP5141C2P pseudogene FRAGMENT pc.167.26.1 91% TO 5141C1
genewise.20.89.1 [whiterot1:25284]
VAAFIGSNTNFYVADADIIK (0)
EITTHRSRLPKPIEQYKVLTFFGGNIVASVGEHWKRYRKIAAPTFSE (0)
RNNKLVWDETRLTMQDLFTNVWGRHAKIHVDHAVDITLP (0)
VAGFGRRIPWQDEDVVPAGHTMTFK ()
TALHTVSENVFTRLLILD
 
>CYP5141C3P pseudogene FRAGMENT pc.67.69.1 84% TO 5141C1
VASFIGSNANFYVADADVIM (0)
GNTTHGSRFPKSIEQYKILTFFGGNIVASEGEHWKRSRKIAAPAFSE (0)
RNNKLVWDETRLILQDLFTHVWGKRAEIHVDYAVDITLL (0)
IALFVVGVTCFGRRIPWQNEDVVPAGHTMT*LK (0)
TVLHTVSENVFMRLLIPD
 
>CYP5141D1 pc.81.21.1

MLLILWALLAAVVYHAAARLVRLRRLLVKIRFHPGQRAATSIYGAATFLFPW
RIPNLTPGANLLFDEKHALLARHGLDVVTSVSTHPMRAVFVVADPAVLRDM
AAARSRYPKPVELYGSLSLYGPNIVASENDAWKRYRRICSPSFSERNNKLVW
EETVRVVTELFDTWEGRQEIDMEDALTMTLSITLFVISSAGFGKPITWKGGDE
RPEGYAMSFKDVIYHMSTGVFIKIATPQWLLNLGLTEKMRNTNVAFKELGM
YMSDMIRERRESQQREDRGDLFNGLLDAGEEDEKLKLTDEELMGNIFIFLIAG
HETTGHTLCYALALLALYPDEQEKLYQHIRTLCPAGELPVYDDLRNYTYALA
VLYETLRMFPSVVGIPKVASEDTCVQTMNDAGQLVEVFIPEGSDIVFDTPGLH
YNPKYWTDPYTFSPSRFMAPDWPRDAFLPFSGGPRACLGRRFARFAEIESIAV
LVLFVSQYTIHLKEDPKYAGETEQQRRERVLKSVPGLTLT
 
>CYP5142A1 ug.79.41.1
MDDVNLFIRARTLLDSVLVLILTSIGYAVANAVYNVYFHPLSKFPGPRMAAAS
RWWKTYVEVYRDESIVDRLFHLHEKYGNVVRIAPDELHFSDPAVYNAIYSPK
SRWNKDPLMYAPFGFGSHRSMFSTVEYQPAKKRRDLAAPHFSRKSVLNLQG
VIQAGVSNLCDAMAQRAAEGKPTDIYSAFRCLNFDNVTSYCFGWSLHMVRS
PDFSAEPVQNMQDMHSSYQVWKHFLWLRTPMRLLLSVLGKRPMPYFRVIME
QVDGYLERPEELDNAPHSLIFHSLMDPAQSTKLDKQSVVEEANLLIIAGTDTIS
NASALGTLFLLSDGGYMRDKLQAELKAIWPRLDDKPSLEVLESSAPYLKAAC
KESLRLSHGVMSPLLRVVPSQGATLGGHFVSGGTKVGICNAFVHLNPALFPDP
HVFRPERWPEPGAESLDTWLVAFSKGPRSCIGINLGWCELYMNLANLFRRFD
LKLDHRVQVLFLGGS
 
>CYP5142A2 pc.79.57.1
 
MNVWARVSEGWTALEVILVALLTSIGYVVTTALYNIYFHPLSKFPGPKLAASS
WLWKAYVEVIKGESILDRLSKLHEEYGPVVRIAPDELHFNDPAVYNEIYTARS
RWNKDDVMYAPFGKDTSIFTTREFREAKKRRDLSAPHFSRKTVLSLQGLIQE
GIDEFCEVITKRDADSKTTDIFRAFRCLDFDNVSSFCFGWSEHAIQAPDFNSAA
VEELQHSNKDFQFWKHFLRLPLPVLLLASRIKNQIATYIDKPEELDKTPHPTVF
HVLMDSSHGTRLSATAMAEEASLFLIAGTDTTSNASALGTIFALSDNGYMRN
KLKEELKSVWPRLEDKPSLEVLESLPYLKAVCKESLRLSHGAMSPLMRVVPQ
QGAVLGGHFVPGGTKVGMAHTFVHFNPTLFPEPHTFRPERWLEPGAEALDT
WNVAFSKGPRSCLGIKSLAWCELYMNIAHIFRRPVRPYDLWHRDCFLPYLDG
VDLLVYATPSTD
 
>CYP5142A3 pc.24.27.1
 
MDVVRQALEGRTTKEYAGLALLAFAAYVVANIIYNLYFHPLAKFPGPRAAA
ASRWWKAYVEVYKGESIVDRLFELHAEYGDVVRITPDELHFSDPKVYNEIYN
TRSRWDKDGEMYAPFGGNSTMFTALRYHDAKKRRDLTASLFSRKSVLSLQGSIQEGL
DELCDIISARSAAGKTTDLFRAFRCLNLDNVTSFCFGWSLHTVRAPDFRAPPL
EEVQNSHGGYQFWKHLMLFRAVLLPKLKEQVDALVARPDELEAAPHPIIFHS
LIDPAHGAKLSAQELMEEANMFIVAGIDTTSNATGAGVIGVLSNPSTYDKLKT
ELRTAWPRLDEKPTVEVFESLPYLKAVCKEALRLSHGITSPMLRIVPPQGATL
AERFVPGGTQVGVSHLFVHLNPTLFPDPHAFRPERWLEPGAESLDTWLVAFS
KGPRSCLGINLGWCELYLNIANLFRRFDLKLEGRAKAFLDGPRARASGDWKD
CFLPCFEGPDMLIHTTPVAD
 
>CYP5142B1 ug.20.42.1
MLNLSLDSSSVLSLVWTASPWLLLSWILYTVLMAVYNLHFHPLAKFPGPKMA
AASEWWLAYVEVIKQESLSKKLWELHEQYGANATQLHFSKPAAYNEIYNVK
NRWDRDMKLYHIFADEVSTLTIPDYARAKKRRDLTTFLFLARILLRQLDTVCE
NIDKHIKEGKPVSIFKAFRCAAADVICTMCFARSMNATSEPGFNAQVVTAIHA
AFPVIMVFKHFPLLQTLSRMVPPLLLSSLRPELNGLMKMRKMLTDQVKEVKA
HPEILKESQQVTIYHELLKDPKNIPSDTSLRDEAVLYVTAGMDTSSDTLTLATI
NVLSRPDVHARLMHELVEAWPHLEDAPPRYEQLEKLPYLTAVLKESLRLSHG
VVQPMTRVVPREGAYISGHFIPGGSIVGMSSIFVHWNEEIFADARAFKPERWL
DPEADLDPWLVAFSKGPRSCLGVNLGWCELYMSIAAIFRRYELKLNGIG
 
>CYP5142C1 pc.20.56.1
MLRLLVDNGLVSALARYGPAMLISIIVWTLGRVVYNLYFHPLAKYPGPRMAA
ATEWWQAWLEIFKAESLSLTLLELHAKHGGDIVRIGPNELHFSRPSAYHEIYT
SKNKWAKNPAFYRYIVSPTESTFSTCEYDKAKKRRDITLPIFSRKSILGMQHLV
QECIDSMCENIDKHISEKKSVNILRAFRCCALDAVTSLCFARNTRATSEPEFRA
PIEVAMDFSLPLTPVLKHFPMVQVVMSWLPPDVLLWADARLGGFVQLRKML
DAQVEEILRDPDVLASAEHPTIYHAFLAHAPTPSVAELRDEALVYVHAGTDTS
SDALAVGTLNVLGRPAVLARLRAELDTVWPRLDERPRYEALEALPYLTAVV
KESLRCSHGVVHPMTRIVPRGGARISGAHIPAGTIVAESNIFVHWNADVFPEP
HEFRPERWLEGKTPSGESLDNWLVPFSKGPRSCIGINLGYCEIYMTFANLFRR
YDLSLDGVKPSDWKWRDCYLPHYLGPEMKVVATPRLS
 
>CYP5142D1 ug.43.40.1
MAGQLMAHSLDVLSTMFTLLSLYAITRCIYNLYLHPLSRFPGPKLAAATTWW
RAIGEVFMWENLTDKLVELHNTYGPCEIVRIGPNELHFSRPSVYHEIHNPRNK
WNKDPAVYNVFADTESTVSICNYEAAKRRREMTLPLFSRRSIVDAHDLIRSCL
DKMCTNIDSIASSGEPVHFFRAFRCFALDAISLMCFGVSPEASLAPHFRSTLDG
AMHVALHDALLVKQFPLLKYLMAYSPQWLVTYTRPALRSYFEMRRVRLSLP
TLRQLVYLRSTIAAERSSAGSQEGTPLSEPVLRDEAFVFVNAGADTVSNAITV
GVLNVVDNRDVYTKLKHELRCAWPNLKVSPRWEELERLPYLRAVVKESLR
MVIGVVHPMTRIVPPQGAVLCDMFIPGGTSVGISHYFLHHNEDVFPQPRTFKP
ERWLARESDKEHMLVSFSRGPHSCLGVNMAYCELYLAFAYFFRRYDVELNG
VRYVHVSSRETELTN
 
>CYP5142E1 pc.167.13.1
MDSWLSWHGVAAAVVAAALLLVVYRVYFHPLAKFPGPKLAAATHWYSAY
YEVWRDGALVEHLQELHKQYGPVVRITPDEACISYTDIYVRGTRFTKDPGFY
GFMHGDRSSFWMLDPQKSKARRDVLLPLFSRRAVLSLEDVVQKKVRALVTA
VLTQGADDTSVNMHRAYRSATLDTILAYAFAQERGMLDVPGFAHPLVREFE
RAFPLALILKHLPWLHRVSTAVRAVKYMLVRTDPDDIVRDTAAQIDGLLADP
DRLAELPHETVFHRFLAPHAKGAGGEPPSRRDIVDEAINIFAAGSDSTGHTCA
MGTAFVLAYPEVHKRLVRELEEAWPDRDAEIRLAQLEKLPYLTAVIKESLRM
SHGVVMPLPRVVRPNEAIIDGISVPAGAVVGMGATFMHYNPEVFPQPYTFDP
DRWLQPDVSRLEQHLVPFSKGLRSCIGLTLAWCEMYLVFGYIFRLLDMQLDN
MTLEDIKVKYHFTPTVREKDMLRCMVRARES
 
>CYP5143A1 gx.20.61.1
IILIILATISYRLSPLHPLARYPGPILDKSTSLRLAYLAFIGQRAQYVTELHERYG
KIVRIGPNKLSINSLDVVHPIYGSSQAYDKSESYRPGLSAEGSIFFARKKELMR
DGDPEGVVESGHKAILMFETYCDSFGEVPALFDILSVLPTGEAYQLVEKRAAT
HLKDRIKVHPHDGWDMCSFFLAQREGHNYPPMNEVDLNANTVVAFEAGGD
TTAGFIIITMFHLLRYRQAYDKLKEELDGAFPTGSVSVEEYSHLAELPYLGAVI
NEGLRLGAAFPSFPRVVPKGGAMLAGEFIPEGTMVGVPIYTQHYSPDNFWPEP
REFRPERWFEDGLGPGTITRQAAFMPFQFGPFGCPGKALGLRLMSVVISNLLV
LSYDLSFPPDFDPEAFLNGWINTRTNIFRIPLRVEAKRRPW
 
>CYP5143A2 PFF_33b(pc.20.120.1)
MFNSFGPAHVLLPPLALSVIIAVAAYRLSPLHPLAHFPGSWVDKVTSLRVAYF
ALTGHRAEHVTSLHDKYGVVVRIGPNRVSINSSDVIYPIYASPQAFDRAASYR
PGLIHDGSLLFSRKRHWDGALKDRVDQLIDCIARRQDLRGVVDLGEFMRNGD
PHNICRSAKDSIVLFEVYSSSLGEIPALFDIASVLPVTAEYRKVERHFQKHINER
MQIKSHSDWDFCSFFMAQREDAQYPPLSKPDLNADAMVAFEAGGDTLAGFL
SIIIFYILKHQPVYQKLRAELEQAFPLGEIAQDQYASLTEIPYLVAVINEGLRLG
ANFAGFQRVVPQGGAVLAGQFIPAGTVVGVPAHLQHIHPDNFWPTPLEFRPE
RWFKDGLGPGTITRQSAFMAFQFGPFGCVGKTFAYRQLNVVLSRLLLAYDLT
FALDFDSKAFVEGWLNIRTTIFNYPLKVQASRRQW
 
>CYP5144A1 ug.24.32.1     revised 8/15/2007 at I-helix micro exon, also removed one
small intron after AGVYL and added a micro exon VDVIPA
 
MELPAHTKYLLACLAFAIFVLLHSKRRRPRYPPGQRGLPLVGNLWDIPTEYA
WVKYREIGAQLGSDIIHFEVLGSHYVVLNSDKAVKEVLEKRSHNSSDRPQTV
MLQELTGWHRNWALLEYGDYWKDLRRIFSQYFRPSAVPQYHSKQTKAVRRF
LNLLLNSPDDFTKHIRYLAASAILDVVYGFDVRPGDPRIELVERGVHTLTDISA
GVYL (1)
VDVIPA (1)
LKYIPAWFPGASFKRKAAGWKVLVDAVYEVPYSQYKDAMREGTAKPC
2461961 FAGTLLSEANPDGDLDETFRCLTGTAYV (1)
GGADT (0)
2462165 VSSTLLTFMLAVTMFPETQDPAHEELDRVLGRKRLPDIRDRDDLPYITAM 2462314
LHEVLRWHPVAPLTLPHRLTADDEYEGYHIPAGAVVFGNAWAILHDPATYGD
PDVYAPARYLTADGRALRADVPYPLEGFGFGRRVCPGRPFAHDILWLALAHVL
AVFRVGRARDAHECEVPPRGVFTPGLISVPEPFGCRFVPRFPGAEELIRQSAMP
E
 
>CYP5144A2 ug.24.29.1 revised 8/15/2007 at I-helix micro exon and at VDVFPI,
AND  KQSIMVNEL, AILRDED
 
MEDTSRSLVGPSLWAVFALGLLFAFCLRRQPRYPPGPRGLPIVGNVFDIPMNV
GWKVFRDVSRCFESDVIHYEALGSHLVVVNGAKAAKELFERRANNYSDR (2)
KQSIMVNEL (2)
TGWHRNWGQLEYGDRWRQHRRLFHQHFRPMAVSQYHPRQVKGVRVLLRA
LSESPEDFQRHIRFMAGATIMEIVYAYDAQPGDPRIKLVEDAVDTLTFVVNAG
VYL (1)
VDVFPI (1)
LKYVPNWFPGASFKRQAAEWKKLVDALYEQPYQEFKATVKEGN
2465265 AKPCFAATLLSSVENDEDIENLEELFMGLTGTAFV (1) 2465369
AGSDT (0)
2465493 TIASLNVFILAVTIFPEAQRSAQEEIDRVLERKRLPTMEDKVLLP 2465627
HVTALVHETLRWHPPLPLAAPHRVIEDDEYEGYFIPAGTTIIGNAW (2)
AILRDEDLFPDGDSFKPERWLNEAGALRDDLPYPMETFGFGRRICPGRHFAND
VLWLAIANILTVFSIERALGEDGQPIVPEAKFSPRLISK
PEPFKCAFKHRFSGAEDMIRLAAIVEE
 
>CYP5144A3 genewise2nd.24.5.1 REVISED AT PRSVMLHEL 8/16/2007
 
MELPPVPHPLIAYLCAGLLLAGLVVTRLRRRRHYPPGPKGLPLIGNLFDIPTDYAWKI
YRAFGDQYGSDIIHFEIFGTHLVILNSAKAARDLFEKRSSIYSDRPRSVMLHEL
TRWGRSFGFMQHGDEWREHRRLFNMHFRPSAIAQYHAKQKSAVCTLLRSLLDA
PEQFREHVHFMAGDVIMGIVYGFDVQPGDSRLQLVEKAVMTLNQIVNAGVYL (1)
VDVIPA (1)
LKYIPAWFPGAGFKRHAAEWKKLVDDMFEIPYRESMKSLQEGKCESSFAASLLAQ
LEGQESPDNIERIAMDVLGTTYVAGSDTIITATSTFLLAMILHPEVQITVQAELDAL
LEGARLPNISDKAALPSVTAVLQEVLRWNPGLPLVPHRVVADDEYKGYHIPAGA
AVIGNTWAMLHDETTYPDPEPFKPQRFLNEDGTLNADVPYPTDVFGHGRRMCP
GRHFAHDMLWLTIASILTVYKVERDVDEDGQEITPTASFTSRVPTPFRCRFTPRSA
SAESLIRSSGVSTE
 
>CYP5144A4 pc.24.8.1(gx.24.6.1) revised at micro exon 8/15/2007
ADDED KQQSVMIHEL AND REVISED LAST INTRON BOUNDARY 8/16/2007
NOTE THIS GENE IS COMBINED WITH CYP5144A5 IN JGI BROWSER (e_gww2.1.398.1)
 
MEQSTLHILPYLCASIPVLVCLLVLRLRRPHYPPGPKGLPLVGNLFDVPLSHGW
VAYRELAKQYGSDVIHLEILGSHIVIINSAKAARDLFDKRSNIYSDKQQSVMIHELTGWHRN
WGFMAYGDYWRKHRRLFHRHFRPAAVPQYHSAQAKGVHNLLKLLMRSPERFREHIRF (2)
2473972 MAGSTILDVVYALDVQPGDSRIELVERAVHTSTEIVAAGVYL (1) 2474097
VDLFPI (1)
2474217 LKHIPSWVPGAAFHRKAAAWKALVDRMYEEPYNQFKASM 2474333
KDGNAKACLTASLLMEAESTHQLDAIEDILISVTGTAYGAGTDTAVASLNTFMLGITMFPH
TQLAAQDELDRITARQRLPTMEDRENLPHVTAILQEVLRWNPAAPLGLPHRTVRSDEYNG
YFIPQGATIIGNSWAMLHDEAIYPDPGSFKPERFLTEDSTLRSDVPYPIEAFGFGRRICPGRYF
AHDLLWLTIAGILAVFRIERARDEQGDEIVPAGDFSPRFIS (2)
SPEPFQCRIVPRFAGAEALIHGTGLLG*
>CYP5144A5 pc.24.9.1(genewise2nd.24.7.1)
NOTE THIS GENE IS COMBINED WITH CYP5144A5 IN JGI BROWSER (e_gww2.1.398.1)
 
MEWSLSLGYALLGLGIMWVVKYAQRPRRRYPPGPKGIPILGNVFNIPLENSWISFDQ
WSRQYASDIVHVEALGKHVYVVNSARAAKELFDGRANVYSDKEQSVMMLELCGW
SRSWAMLPYGNYWREHHRLFHQHFRPQSMVRYHEKQRRGARRLLQLLLDTPEDYE
KHMRYAAGSTILDVVYSFDVQPNDPRIELVEAALGTANDLMHAGIYL (1)
VDIFPL (1)
LKHIPTWFPGAQFKRLAAKYKRLVDNMYTVPYSQLKASVKLGTAQPCLVASLLS
EADEHVTPERDEIFMNLAGTTYAGGTDTIVIALSIFILAMILHPEQQVAVQKEIDR
VVGRDRLPELADRESLPRVTAVIQEVLRWHSPLPLATPHRATSDDEYNGYYIKA
GSVVIGNAWAMLHNENVYPDPASFKSERFLTPDGKLRDDVPFPIEAFGFGRRICP
GRHFALDSLFLLVSHILAVFTIEHAVDADGHIIPVEPEFEPQAFSPPKPFKAQFKLR
FLAAEDLIDGSALE
 
>CYP5144A6 pc.24.10.1  revised at micro exons 8/16/2007
 
MLTALSCVFAEALAVWAVSSWTRPRHEYPPGPKGLPFLGNMFDIPMKYGWVTF
ANWSRLYGSDIVHVQALGKHIYVINSAKVAKDLFDGRPHIYSDK (2)
EQSVMTQEL (2)
SGWKRAWALSPYDDEWREYRKLFHQHFRPSAVQQYHHKQTKAVRRLLQLLLD
TPEDFLAHLRYAAGSSLLDVVYSVDALPGDLRITLVEKAVHTFAKLLETGVYL (1)
VDVAPI (1) LKHIPAWFPGADFKRQAAEYRQLVDDMFKVPYQQFKDAWRRGTAQPCFAASLLT
DADPLDGSEHEELFINLTGTTYAAGSDTTVAAMSTFMLAMALHPEVQRWVQEEL
DRVVGRGRLPEMADQPALLRVMATVHEVLRWHPPLPLATPHRAMADDVYAGF
TIPAGSIVLGNSWAILHDDKTYTNPHTFDPRRFVGPNAQPFPEVVFGHGRRECPG
RHFALDILFLAVAHVLSVFAIERVDNSDPGIGDIQGLFTPHVLSYPKPFKASFKPRF
PGVESLVRTAALSEI
 
>CYP5144A7 pc.24.11.1 This is hybrid with genewise2nd.24.9.1|whiterot1 in second half (GENE IS SPLIT IN GENOME VIEWER)
MSRFLYDYSTLLYLCAGITFVVLITLSSRPRRRYPPGPKGLPIVGNLFDVPTDHGW
KRYQEIGKEYGSDIVHFQVFGSHIVVVNTAKAARELLDKRSNIYSDKQRSVMIHE
LTGWHRNFSLMPYGEGWRTRRRLFHQHFRPMAVPQYHTRQLKAVHGLVQSLFE
APQNYKEHIRFMAGSAILDIIFAFDIQPGDPRIEIVEKGVQTATEFMCSGVYL (1)
VDVFPI (1)
LKYLPSWFPGAGFKRQAAKWKALVDDMHEIPYYQFKQTMREGKAKPCFASTLL
SSAAENDKDSLESLDEIFMSLTGTAYVAGSDTTISALNTFVLAMTMFPETQAAAQ
EELDRVLGRKRLPDFDDRDSMPYLTAMVYELLRWHSVLPLGLPHRTLADDEYN
GYFIPAGTVIVGNCWGMLHDDDLFPDPDIFRPERFLNADGTLNSDAHFPIETFGFG
RRICPGRYFAQDLLWLTIANVLAACSIERVVDEKGFEVRPTGDMTPRVLSMPEPF
ECNIRPRFSGAEALVRSACLND
 
>CYP5144A8 genewise2nd.24.9.1 = Scaffold_205i revised at micro exon 8/15/2007
AND AT QQTVMIHEL
fgenesh1_pg.C_scaffold_1000787 [Phchr1:787]
MEALASRITVYLCAGVVLVYVFSRVFKRRPHYPPGPRGLPIIGNLLDVPSLYGWI
AYKNLGDQCGSDIVHLEVLGSHYVVLNSAKAARDLLDKRSNNYSDR (2)
QQTVMIHEL (2)
TGLERGLGMLPYGDYWRLHRRLTQQHFRAAAVPQYHARQAKVVRKLLHSLLD
SPERFMDHIRFMAGAIILDIVYALDVHPGDHMIEVVETAMGRINEIINAGVFL (1)
VDVIPV (1)
LKYLPSWFPGAGFKRRAAAWKVDIAPMFEAPYERFKQSLGGTTRPSFAGNLLSR
VQNEEELSQLEDVFMNITGTAYGAGSDTTLATLTGFVLAMTIFPEKQLAAHAEL
DKVLERKRLPEVEDMQYLPSITALVYEVLRWNPAAPLGIPHQTIVDDEYNGYFIP
AGTVVIGNAWAMLRDPNTYPDPDTFKPERFLAKDGSLRDDVPYPTEAFGHGRRI
CVGRHFAQDVLWIAIAHILTVFRIERAVDEDGREIVPVPDYTPHFVTMPKPFKCRF
TPRFPGAEGLIRSAADASNE
 
>CYP5144A9 pc.83.7.1 revised at micro exon 8/15/2007,
revised at RQSTMMLDEL and EFTARIVS 8/16/2007, EST = DV758101.1
MAVLLAAGYYILSVAVFILLYNASRRRQRLPPGPKGLPLIGNLFDVPNDYAWLRY
KELGQQYGSDIVHMQALGNHILVLNSMKAAVEILDKRADISSDRQSTMMLDEL
SGLGRAWTQLGHNDSWRIHRRLFHQHFRPSAISQYHTKQTKAIHRMLSFLRESPA
QYMDHIRFMAGSMILDVVYTLDVQPGDYRIKLAEMVAHVSTEVFTAGVWM (1)
VDMIPM (1)
LRHLPTWFPGAGFKIQAAKWKTTVDRSYDIPYEQFKASMHEGGGEPCLASALLS
SAEDVEELERMDKVFSSLTGTAYIAGTDTTVSTLASFVLAMTIFPEAQLAAQAEI
DRVLGGTRLPDINDKANLPQVTAILYETLRWNPVLPLALPHRTTADTSYDGYYIP
AGTVVLGNSWAILQDETLFPEPQLFKPERYLNGDGSLNSSAHYPIETFGFGRRICP
GRYFAQDAVWLAIAHILAVFKIERARDGDGKEIVPTPEFTARIVS (2)
MPKPFECKFKVRSPQAESLIESAALGG
 
>CYP5144A10 pc.83.8.1 revised at micro exons 8/16/2007
MALLVYFCAGLVPVLLLVLGLRKRPRYPPGPRGVPIFGNIFDVPMKYAWLEYVK
YGQQYKSDIVHFQVLGQHIVVLNSLQAVGDLLDKQSSIYSDR (2)
VPSVMLNEL (2)
TGWSRSWVQMEYGDQWRMHRRLMHQHFRSTMIPQYHPKQTKAVRRLIQSLLE
QPEHFMEHVHFLSGSLILDVVFSFDVRPGDAILALAERAVDTTKAIIAAGVWL (1)
VDVVPI (1)
LKYIPSWFPGAGFKRIAAKWKTDVNKMFDVPYAKFKDSMREGSATPCFASALLS
GAEDDNGGVIDNQDEVFISLTGTAYVAGSDTLSDALSTFLLAMIAFPEKQRAAHE
ALDCVLERKRLPGVEDRDALPHITALAYEVLRWHPVVPLSIPHRTTADSYYKGY
YIPAGSTIFPNSWAILHDEALYPEPHLFRPERFLNEDGSLHAHARDPIEAFGYGRRI
CPGRHFAHDALWLAIAHILAVFKIERALDVDGNEIEPKLDFMPHFLSMPKPFKCR
FTPRFPDAANLALSASSDY
 
>CYP5144A11 ug.83.30.1 revised at micro exons and after EXXR motif 8/16/2007
GC boundary at HIRY, added C-term
MGWTLCLYALLGLTGLWVAARVRRPRQRYPPGPTGLPVLGNVFDVPLENGWL
VFDQWARQYDSDVVHAEALGRHIYVVNSAKAARELFDGRPHVYSDK (2)
DQSVMLLEL (2)
SGWWRSWVMLPYGDYWKEHRRLFHQHFRPQSLPQYHEKQAKAARRLVRLLLD
SPQDYAKHIRY (2)
ATGSSILNVVYSFDAQPGDPRLELVEAAMGTANELMHTGVYL (1)
VDIFPV (1)
LKHLPMWFPGAHFKRQAARYKRLVDDMFEIPYAQLKSSMQEGTIEPCFAAALLS
EAEDSASPERDDMFMNLAGTAYAAGTDTIMMTLLTFILAMVLHPEEQAAVQEEI
DRVVGRDRLPGLADRESLPRVTAVIQEVLRWHPPLPLA
TPHRAASDDEYNGYHIPAGALVLGNCWAMLHDARVYPDPDVFRPGRFLAAGGD
APRADVPLPAEAFGFGRRICPGRHFALDSLFLFVAHLLAAFRIEHAVDAEGNVVP
VVAAFEPQAFR (2)
SPPKPFKARFTLRYCGAENLVRGGVRAG*
 
>CYP5144A12 pc.16.82.1 revised at micro exon and added one exon before it. 8/15/2007
fgenesh1_pg.C_scaffold_8000230 [Phchr1:5055], added N-term 8/16/2007
added RTHSVMLNELSGWAE and WYTPFPLG and C-term exon 8/16/2007
MELWFDFPSVVTYVLAA
VSCASLLILHTRRRPRYPPGPKGLPIVGNLLDVPTHNAWIKYKQLGKKYGSDIIHF
EVFGSHIVVLNSTTVARDILEKRSQISSD
RTHSVMLNELSGWAE
ERNFGFMRYGDGWRRQRRLFQQHFRRKAVTQYHAIQSKSVHSLLNALLDRPERF
IANLRF (2)
MAGSMILRIVYGTDIQPGDSRLTLVEKAVGTLVEVMNAGVFL (1)
VDVFPI (1)
LKHIPSWMPGAGFKRKAAEWKVLVDDMYEVPYNVGTLSIFFLAMTIFPSVQVAA
QEEIDRVLGRKRLPSIEDRNALPRTTAIVYEVLRWYTPFPLGVPHRTIADDEYNGY
FIPAGTTIIANAWYAMLHDEERYPNLETFIPERFLNKDGSLRSDACIPLEPFGFGRRI
CPGRYFAEDIVWLAIASILSVFRVEPPVDEHGEPLKQTATFGTRFLS (2)
PAPFKCCFTLRYPEAEGLIRASATSTA*
 
>CYP5144A13 pc.16.83.1 = GX.16.34.1 (USE THIS MODEL)  = SCAFFOLD 4e
fgenesh1_pg.C_scaffold_8000229 [Phchr1:5054] ver2,
ESTs = DV761651.1, DV753979.1
revised at micro exon  8/15/2007, revised at AHSIMLNELS and C-term  8/16/2007
REVISED SEQ = 50% TO CYP5144A8, IT SITS NEXT TO 5144A12
MTTLALASSLTYLFAGVLLVCLFVSYARKRPRYPPGPKGLPLIRNLLDIPADYPWIT
YRDLAEKYNSDILHFEVFGSHLVVLNSAEATRQILEKQSSITSDRAHSIMLNELSG
WDTDRTVTFMEYGESWRRHRKLFQEHFRQQAIPRYHHAQTKGVNRLLKSLLDTP
EKFSAHIRFMSAYTITEVVFGKEVGPDDPSIEVVDDGMHTLNELLNAGVFL (1)
VDIFPL (1)
LRYVPSWFPGASFKRLAGKWKKAVDDMYTIPYNNYKATLGEGDANTCLLATAMADKVD
QDDARVVDHDLMCLAGTTFGAGYDTTATALGIFIMVMAVHPEAQISIHEELDRVLHRDRL
PTMEDRKELPRTTALMYETFRWHLPLPLG
VPHQTTAAIHYNGYFIPQGANIVANSWAILRDEGLYPDPETFKPERWLDADGSLR
DDMRFPVEMFGYGRRICVGRHFAEDIVWLAIASILSVYKIEPPVDENGTVRALEAD
FTPRLFSAPKPFKCRFTPRFPGAEGLIRASL*
 
>CYP5144A14P pc.24.16.1 = gx.24.13.1 PSEUDOGENE MISSING C-TERM
revised at micro exons  8/16/2007
MPDCTTIGYLLASVALAYALASRHPRSSRYPPGPRGLPLLGNLFDMPRKHSWTK
HQELSKTYESDVIHYQVLGLHIMALNSGEAVRDLLNKRSTIYSDR (2)
QETVMLHEL (2)
TGWHRNWALMRYGDAWKERRRHFHEHFRPQAVSQYNFKQVKAARILLNSLLE
SPRAFSEHIRFMASSLILDIVYALDVRPDDPEARRVERALETLAEISASVFM (1)
VDLIPV (1) LKYLPSWFPGAGSKRQASIWKNIVDEMFETAYQVCKNSAQHEYVRPCFTTALLS
EVSDPANMKEMDEIFMSLAGTTYI (1)
GGSDT (0)
 
>CYP5144A15P PSEUDOGENE pc.24.17.1 64% TO 5144A1
IT APPEARS THAT 5144A14P AND 5144A15P WERE PARTS OF A SINGLE
GENE DISRUPTED BY A TRANSPOSON CONTAINING A RETROVIRUS-
RELATED POL POLYPROTEIN THAT NOW LIES BETWEEN THEM
AQSAARSSTASTEQINATLNTWMLAMTLFPDTQVAVQDELDMVLGRKHLPSIED
RDSLPRVTAMLHEVLRWHPVGPMGVPHRLTVDDEYRGYHIPAGTIVMINAWAIL
HDESVYPEPDIFRPERYLDSDGRLRTDMPYPVEGFGAGRRLCPGRHFAHDMLWL
AIAHVLTVFRIERAVDEDGREIVPEAKFEPWLIRCASLCVGVPCTAHLAVSPPEPF
QCQTKLRFPEAEGLVHLAAMDE
 
>CYP5144A16 NEW SEQ 2588519-2588442 REGION (-) STRAND SCAF_1
68% TO 5144A5 AA 311-336
e_gww2.1.906.1 [Phchr1:132380]
MEGIPILSYVTLGLVTVWTILSLRKPRRRYPPGPKGLPVLGNV
FDIPLENGWLIFDKWARQY (1)
GSDIVHVEALGKHIYVINSAKVAREIFDGRPHNYSDK (2)
EQSTMLLEL (2)
SGWGRSWVMFPYGDYWRQHRRLFHQHFRAQSIPQYHQKQAAAA
RRLLQLLLDTPADFAKHIRY (2)
ATGSSIVDVVYSFDTPPGDPRLEIVEAAMGTASELLHSGIYL (1)
VDVFPI (1)
LKYVPAGFPGAQFKRKAAHYNKLVKDMFTIPYTQVKTAM
KEGSVQPCFTTALLSESDDLDTPERDKIFQSLVGTAYAAGTDTLMISMLTFMLAMVLH
PEAQT
AAQNEIDSVVGRDRLPGMTDRDSLPRVTALIQEVLRWHCPMPLATPHRAIVNDEYNGY
HIAAGSVVIGNA
WAMLHDEDVYPDPHSFKPDRFLTTDGRLRDDIPFPIEAFGFGRRICPGRYFAMDALFL
FVSHVLAVFRIE
HSVDAHGNVVGVEAEFQPQGFRCA*
 
>CYP5144A17P gx.24.12.1 49% TO pc.24.17.1, 57% TO 5144A5
scaffold_1:2496394-2495844 (-) strand
2496394 IADREPLPPVTA 2496356
2496230 HTPRYERRHSGYYMSAGLLVIGDTWY (0) 2496153
2496068 RFLMAERALRTDVTFPIEVFGYSRRICLGRQFAKDVLFLAISNILAIFTI
EKAVDEHSGSIEVQNAFLPHAIRCA 2495844
 
>CYP5144A18 fgenesh1_pg.C_scaffold_1000782 [Phchr1:782]
same as Scaffold_205c gene model complete, revised at micro exons VDLIPI (1) and
PQTVMLHEL 8/16/2007, 62% to 5144A1
GC boundary at PGLIR
 
MESLTLRNCASAFLAG
LLVLACGLVLRPRRRPRYPPGPKGLPLVGNMLDIPTEYAWERYYELGKEY
GSDVLFFRVLGSHFLVLNSAAAANELLEKRANVYSDR (2)
PQTVMLHEL (2)
TGWDQNWAFWEYGEGWKQARKMFHQHFRPSAAPQYHLKQ
TKAARRFVKLLLESPASFAQHAR
FLAGSAILDAVYAFDVQADDPRIALVERGVHTLVEISRGVFL
VDLIPI (1)
LKYIPSWFPGAGFKRQAAQWKDAVDATYSDPYRQFKTLL
RNGQAEPCVAASLLSSSGDEPSGALDDLLKSVAGTAYV
GGSDT
VSATLTTFILAMTMFPDAQAAAHAQLDEVLKRTRLPEMADRAALPYITAILYEVLR
WQPAGPL
GLPRRLMADDEYRGWHIPAGTVVLPNIW
AMSHDPGTHAVPAQFVPARYLAADGTLREDVPCPADVFGFGRRVCPGRPFAQDV
LWLAI
AHVLSVFRMEGPMGERGEIRHSRLFTPGLIR (2)
LPEPFACSFRPRFPGAENLVDAGVVG*
 
>CYP5144B1 pc.23.12.1   revised at micro exon AND PHAPIVSTIL  8/16/2007
e_gww2.8.230.1 [Phchr1:138753]
MLDTTPFTLALLTVGAVCLLGLIKGASRRRRLPPGPKGVPLLGNIFDAPKEHEWR
TFKEWGRTYESDVVHFGILGTHYVVLNSAWAALDLMDKRSHNYSDN (2)
PHAPIVSTIL (2)
TGWDRNWGFMKYGDYWRAHRRMFHQHFRPNAVSAYHSTSQQAVRELLRLLY
AKPEHFKEHIQHMTGYNIIKLMFGVAVSPEDDPILARMENALRILGKIANPGVYL
(1)
VDSFPL (1)
LRFIPSWVPGAKFKRDAEAWKPVIDKTYTQVYEEIKTSYANGSPVPCLVTEMLED
VLKVDNEAYRDMLEDVIINGGGTAYVAAYDTSSSVLSTFVLAMLLYPDVQRTA
QEELDQIVGPDRLPTMEDQPSLPYVTALATEVLRWRPALPIGVAHKSVVDDEYR
GYHIPGGSVIIPNVQAILHDEGSFPDPDTFNPRRFLDSEGQQLEELTGIVSAAFGFG
RRICPGRYFAKDVVWLTIASVLSTFNIEKHFNEAGNAVEPSGEYTPGIISYPAPFK
AAFKPRSESAVELVR
 
>CYP5144C1 ug.83.31.1 revised at micro exons 8/16/2007
fgenesh1_pg.C_scaffold_1000766 [Phchr1:766]
MFSALVLSLLLAAALFLRFRRKRYPLPPGPKGLPIIGNARDIPKSFPWYTYDRWSR
EYNSEIIYLRLVGTDVIVINSEKAANELLNKRSTIYSDR (2)
EHMTMLLDL (2)
VGWGGRNFAFAHYGDLWRAHRRLFHQYFHPGAVPAYHAKSTLEVRRLLPRLLS
HPDDFMQSIRTMTGAIILGITFGMELQPENDPFVALAEEALHAMAQVGNVGSYI
(1)
VDYLPW (1) LQYLPSWAPGAAFKRQAAKWNKIVLEMYEKPFQTIKQALARGEAPPSILTSMLE
TLDPEEDNAARESDMRHVTGTAYTAGADTTVSSLGTFILAMLLHPEVQRRAQEE
IDRVVGSDRFPEYDDRDSLPYITAIMKETLRWRQVTPLAVPHRLRVDDEYNGYH
LPAGSVVVGNSWAMLHDEERYPNSDLFDPTRFLTPDGELDPDAPGPELAAFGFG
RRICPGRYFAMDSMWIAMAHILATVNIEKAVDDAGNILEPSGEYTYPVPFKVAFK
PRSAAADALIQGGTPLA
 
>CYP5144C2 pc.83.19.1 revised at micro exon 8/15/2007
fgenesh1_pg.C_scaffold_1000761 [Phchr1:761]
REVISED EXXR REGION AND PSHTLLVTV, and E*LTYAKWSREC (1)
Added C-term
8/16/2007 Note: there is a stop codon in a conserved region, possible pseudogene
 
MDIPVPYAFIVVGALVLFFRLRKKPRFPPGPKGLPIVGNALDLPKAR
E*LTYAKWSREC (1)
GSDIIHLRFFGTHVFVLNSVKVVNELMVKRSAIYSDR (2)
PSHTLLVTV (2)
TGWQRNFTFVDLGDHWKARARMFQQNLGTSTISKHRPKLIEGNRKLLLNLLLSP
DDFMKHIRYLSGSSILGIIYGIEVQQDHDPFVETAEKALQCLAAVINAGSYA (1)
VNYVPI  (1)
LRFLPTWAPGAQFKRDAAEWYKYVTALIDGPYTYVKESLANGENNTSIVGTLLQ
ELSDDEKRSEQEDTIREAFGTAYTGGVDTTYSSVNSFILAMLKYPDVQRKAQEEL
DRVIGRDRLPSFDDRDALPYITAIVKETLRWGLVAPLAAPHQLRVDDEYEGYFLPA
GSVIIGNAWAILNDEKRYPHPESFIPERYLTTDGTLDSSAPDPMEACFGFGRRMCL
GRYFAFDSLWIAVASLAAAFHMEKAVDESGLVIEPSGEYTSGTAC (2)
YPLPFKAVFRPRHEGVVALIKADVSESDSL*
 
>CYP5144C3 pc.20.52.1 revised at micro exons 8/16/2007
 
MDDIVLVLLVACAVALYARTKRTRYRLPPGPKGLPILGNTYDIPAKYEWLAYEK
WSRDFGSDIICLKFVGTPVIVLNSIQAINDLLEKRSSIYSDR (2)
PVTVMAYEM (2)
VGLDRNFGFVPYGDVWREHRHLFHQYFRLDMVPKYHDRMLKHSKDLLQRLLV
SPDRLMEHLRFSVAGASILNISYGIDVQPENDHYIAVADEAIHALAVTGNAGSYL
(1)
VDYLPL (1)
LRYIPAWVPGAKFKRDAANWWEKTRLMIDEPFNYAKQRMAQGKGMDCVTAV
MLSAIGEDQDREHQELLIKQVLSVSYIGGADTTVSALATFVLAMMQNPKMQRIA
QADIDRVVGGERLPSVEDRDSLPYVTAIVKEALRWRPVIPLAVPHRVTVDDEYK
GYHIPAGSIIVGNVWAVLHDETRYPNPDVFDPTRFLTSDGQLDNNAPDPAEACFG
FGRRICAGRYFALDAVWLSVACILATFDIAKPLDENGNPIEPSGEYTTGLLSHPVP
FKVSFKPRSAAAEALVREPISRDL
 
 
>CYP5144C4 gx.20.26.1 revised at micro exons 8/16/2007
MDYLGISILLVAALALHYFLRKKRYRLPPGPKGLPILGNALDIPAKHEWLAYAK
WGQECGSDIIYLNLAGTPVVVLNSAKAAKDLLEKRSSIYSDR (2)
PVTVMAHEL (2)
IGLGRNFGLKPYGDTWREHRRLVHQHFRTENVPRYHEFTSKQIGRLLLHLLEDPS
NFVRHLRIMAGASILRICYGIDVQPDNDHYLSVADEAIESIAATGNAGSYL (1)
VDSLPI (1)
LRYLPSWAPGAQFKRDAAKWKEKVDRMIAEPFDYAKRYMASTEGAATDYIAGL
LLSAMDPGRDKTQQEIAIRDSLWAAYVGGADTSVSALATFTLAMVLYPDVQQT
AQAELDRVLGKDTLPTIEDRDSLPYVTAVVKETLRWHPVTPLAVPHKVTTDDEY
RGYHIPARSIVVGNVWAILHDPDRYPNPESFEPSRYLTSDGLLDPAAPDPTEACFG
FGRRICPGRHLAYDTIWTGIASILSSFDISPPLDEQGKPVNPSEEYTTGMLSHPVPF
RANFKARSENVEALIRRITLCE
 
>CYP5144C5 pc.20.54.1 revised at micro exon 8/15/2007 and EIMVMCHEM
MQTVVLAILFVFAIALPIYSRRKRYRLPPGPRGLPIIGNILDIPAGREWLTYAKWSR
EYGSDIIYLNMAGTPVYVLNSIQATTDLLEKRSSTYSDR (2)
EIMVMCHEI (2)
VGWGKNFAFQPYGDFWREHRRMFHQHFHPEAVTKHHVHILKQAKDLLQRLLVDPDD
FMQHLRFMAGAAILRVSYGINVQPENDHYIGIAERAIHSLALTGNAGSYL (1)
VDNLPI (1)
LKYLPSWAPGARFKRDGEIWRREVDQMFSEPFELVKRQMVGADGEPPDCVTAS
LLTTLDERKDRPREELEIAVKQAVGTSYVGGADTTVSSVATFILAMLQFPDVQRT
AQAEIDRVVGSTRLPTIEERGSLPYVTAVMKETLRWNQVTPLAVPHKVTVDDEY
KGYFIQAGSIVIGNSWAVLHDETRYPNPEAFDPTRFLTPDGKLNPSAPDPVEAAF
GFGRRICPGRHFAMDAIWMNLAFILATFNIEKPLDEAGRPIEPSGLYTPGLLSHPEP
FRVKFIPRSKAAEALIRETMFYD
 
>CYP5144C6 pc.20.55.1 revised at micro exon 8/16/2007 and VVTTMAHEM
MENITFAVLFVLLLAVPLFFKRQRYRFPPGPKSLPLIGSVLEFPVQSSWLTYQRWG
RELASDIIYLNVLGKHIYVLNSAQAVSDLLEKRSGTYSDR (2)
VVTTMAHEM (2)
VGWDKNFALQPYGEFWREHRKAFHQQFQPDMVPRYHVHMYKQAKDLVRRLIA
EPNALKQHLRYMAGALILRVSYGIDAEPNDDHYFEIIEQAVYSLTEVANTGAYL
(1)
VDFLPF (1)
VKYVPSWMPGAQFKRDATEWAPQVNQMFDEPFDVVKRALAEGNAPDSVCAAL
LSELDPRKDRAHQETVIKQAVGTAYIGGADTTVSTLSTFVLAMMTHPDVQRTAQ
EHIDRVTGGDRLPTIEDRDALSYVTAIIKEALRWRPVLPMAVPHITTADDEYRGY
HIPKGSIVMGNAWAVLHDEARYANPDAFDPTRFLTPAGTLDKDAPDALEAAFGY
GRRVCAGLHFALDSMWVNVACVLATLDIRKPVDEHGVPVEPSMAYTTGLLEQP
EPFAVVFKPRSQAAEALIYEGHDD
 
>CYP5144C7 pc.142.5.1 , e_gww2.9.179.1 revised at micro exon 8/15/2007
revised at RLHSTMLHDY 8/16/2007, ESTs = DV760024.1, DV759090.1
revised region after EVLR WRPVAPL
MDTLLLAGLVAVAVVAAGCLAHSRRQRFPPGPKGLPLLQNLLDVPRHRPQWEA
YRDWGLKYNSDIVHLRLFGVSFVIVNTADAVTELFSRRSSVYSDR (2)
LHSTMLHDY (2)
IGWEKAIVMKNYGEDWREHSRLFHQSFQPKVIQEYYPRLYEEARKLLPRLLKGD
DFVASLRVMTASAILGVTFGMEINDSNDPYVTIADRAIQSLVEAGMPGSYM (1)
VEYIPL (1)
MRYIPSWAPGGKFKRDAAEWRTLVSDMFTKPFEHIKSAIRHGVARPSIATSLLMG
LDDKQDNARREHVIRNVTGTAYVGAADTTVAALRTFILAMVMHPAIQKAAQAE
LDRVVGRDCLPTFADREHLPYLTAIQYEVLR (2)
WRPVAPL (1) GFPRRANADDEYHGYHIPKDAIVLGNIWAILHDPARYRDPAAFDPARWLTPDGA
LRADAGDAMLAFGFGRRICPGRHYAVANMWINMAYMLAAFDIAPPRDAAGRA
VPPAGEHTTGLLTYPKPFGAVFTPRSAAALRLIVADADD
 
>CYP5144C8 genewise.35.22.1 = GX.35.9.1 revised at micro exons 8/16/2007
e_gww2.4.188.1
MGVLILCAIAILAALLCYHHFRARRFRLPPGPKGLPIVGNVLDVPKDGPGWLTYE
RWSHEYGSDVIYLNLLGSSIVILNSSKATTDLLDKRSPIYSDR (2)
QRLTVLHDF (2)
VKGDRAFAFLGYGDEWRQHRGIFHKYFHGQAVHNFRPKMLEEARKVLVRLQST
DDYDRCFRVMSAASILGVTFGMDIEDINDPYVVLAEEAINYVLSAAIPGSFV (1)
VDSLPL (1)
LKYLPAWAPGAGFKCKGAEWHDLVSRMILTPFETLKQKMAEGTAKPCIATAIVQ
KLEESKGGDAQRETQIAQYVTGTAYTAAADTTVSSLCTFILAMVLNPEVQALAQ
EEIDRVIGTTSLPDYTYRDSLPYVSAIMYEVLRWRPVAPLGVPHRLMEDDEYEGY
HIPGGSLVVGNIWAITHDPVRYLNPDAFDPTRWLTTDGQLGDVTDALVAFGFGR
RVCPGRHYALEALWITLVHVLAAYRIEPPVDAHGRARPPSGEYLPGFIAFPAPFK
AVFKPRSPVALGLIQTALG
 
>CYP5144D1 pc.24.13.1 (genewise2nd.24.10.1) revised at micro exons 8/16/2007
 
MLDRSVLLPCLVGIVAAIIIVSRRGRERYRFPPGPKPLPLIGNLLDAPTDLGWYTY
AKWARQYHSDIIHFEVFGQHFYILNSVRVAKDLLERRSQVYADR (2)
QQSVMVQEL (2)
TGWHRVFSMKAYGESWRQQRRLFHQHFRQQAIPEYHAELTNGARMLLRSFLES
PNHFLEHIRHISGGTILAVLYGIDVDNYSAERMESIEKAIEIVTEIADGGVYL (1)
VDFIPL (1)
LKYLPTWFPGAGFKRRAAEWRVHVETMFEAPYGDVKRDMKLGKAKPCVATKL
MSAFGDKAEDPEIEELLICVTGTAYAASDTIVFAMIAFVRAMMVFPEVQCKAQQ
ELDRVVGRDRLPVISDQASLPYLAAVTKELLRWHPITPIAVPHKSTTDDWYDGY
YVAAGSIVIANVWAMFRDEERYPDPEAFRPERFLTAEGTLDPAVPDPVEVFGFGR
RMCAGRHYVDAALFLAIAHVLHALTIEKPRDARIPVVDPPPGYALSRLFWAPEPF
EADIKPRFEGVERLMQMSSLHSF
 
>CYP5144D2 pc.24.14.1 revised at micro exon 8/15/2007 and at QLSVMACEL
MLSSTILVFSSVCTLAAIVAIVRRFGGKRRHHFPPGPKGLPIVGNLFDVPTNFGWY
TFAKWAQQYNSDIIHFEVLGKHFYVLHSAALAKELFERRSQTYSDR (2)
QLSVMACEL (2)
TGWHRVLTLTPYGEYWRQYRRLFHEHFRAQVIPQYEDKMLTSARNLLRLLLETP
DRFLRHIRHASGRTMLDIVYALDTEAHNNAVILESVEKAIEIFAEVAEGGAYL (1)
VDHIPI (1)
LKYLPAWFPGASFKRQAAAWRVHVDTMYEAPYQDVNRRLLAGKAKPCITTSLI
SAFSDKCEDPDVEESLISFAGTTYAGSDTSVFKMTIFMRAMLLFPEVQVKAQEEL
DRVVGRDRLPELADKDSLPYISALYKELLRWHPLFPLAFPHKSTVDDWLDGYFIP
AGSLIIGNAWATLHDEERYPDPEAFRPERFLSDDGKLDPSVPDPVEAFGYGRRICP
GRHYADASLFLYIAHILFAFTIRKPLDERGNVIEPPPGVPEPFKASIKPRFEGVEELI
QLSTQLATSD
 
>CYP5144D3 pc.24.18.1 revised at micro exons 8/16/2007
e_gww2.1.419.1 [Phchr1:133291] ver2
MFDNGVTIALLLIGVTLLVAEALKKRHRFPPGPKGLPIIGNLLDVPKDYHWLTYT
AWSRQFDSDIIHLEALGQHYFVISSVDVAKDIFEGRSQLYSDR (2)
PQTVMLHEL (2)
TGWERNFAMMAYGDSWRRHRRLFHQHFRLQNVPAYHDQIAKGARNLAQLLLQ
TPDKFGRHIRHVVGAVILDIMYGIEVAPDDDERMEHLERAVHIFMELGQAGGFL
(1)
VDFIPA (1)
LKYLPTWFPGAAFKRQAMEWKPQVDAMYEISYNEVKDSMQRDQAKPCITSALL
TACWDDLDQSNMEETLIGVTGTGYAGSDTSVFALNAFVLAMMLFPDVQRKAQE
ELDRVVGRERLPSADDRDSLPYISAVIKELLRWHPITPTAAPHKSIADDYYNGYFI
PAGSIVIGNTWAMLHNEERYPDPEAFKPERFLTPEGTLDPHVPDPAEGFGFGRRIC
PGRHFAQASLFLNISNVLATCMIEKPVDEFGNVVEPTRECTSRFFWALKPFEAKIT
PRFEGVENLVQTMSTYTN
 
>CYP5144D4 Scaffold_252d seq next to CYP5144D5
e_gww2.1.429.1 [Phchr1:132914] CYP55% to CYP5144D5
revised at HETVMARDL 8/16.2007, added C-term
MQSNALVIACLVAGLLAVARASRKSRQRRYPPGPNGLPILKNLFDIPRTYSWLTYE
AWGREYNSDVV
HFEALGLHFVVLNSTEAAKELLEGRSHIYSDR (2)
HETVMARDL (2)
TGWHRHWGIMAYGDAWRQRRRLFHQHFRP
QAVPQYHEPMVRSARTLLQLLLESPDDWMRHVHHVSGGTVLKVLYAVDVDPHD
DEGMDVVDKALQIFMKLPEPFV (1)
VNFIPP (1)
LKHLPAWFPGAGFKRRAMEWKVHVDRMFEEPYHKIKVAAVTRPCIATSLLSAAW
EDLEN
PDVEEQLISVLGTAY
GEYNLALEQTIFSVYSFVIAMMLYPDVQRKAQEELDQVVGRDRLPEIADRESLPY
FSAVLKEVFRWHPVTPIAAPHKSLEDDWYKGYFIPAGTIVFGNTWAILHEESRYAE
PDVLRPERFLTPAG
TLDPAVPDPDEVFGYGRRICPGRYFVQDALFLYASHLLAAFTLSKFVDDEGHVEEP (2)
RLTCIFVTLRIPRAFKANIKPRYEGAERLVQMAFTSAS*
 
>CYP5144D5 PFF_252c = GX.24.18.1 revised at micro exon 8/15/2007
revised at HKTVMLHEL 8/16.2007, EST = DV762757.1
MSDSTLLAVGLIFGLLVLARVSRKRPRFPPGPKGLPIVGNLFGIPRDHSWLTYAE
WGRLYISNIVHFEALGQHFFVLNDEKITKEIFEGRSQIYSDR
HKTVMLHEL
TGWHRNWAFTPYGESWRQNRRLFHQFFRAQAIPDYHDHMAKGARGLVQLLLQ
TPENWMRHIRHASGSTVLDAVYAMDVDPNDNERLEGVERAVETLVEIAEAGGY
L (1)
VDFIPA (1) LKHIPTWFPGAGFKRQAMAWKRDIDALFERPYHEVKSAMGCRIFSEQRGKARSC
VTSSLMSMFSEKLGDPDVEETIMGIAGTSY (1)
AGSDTTVFTMFAFVQAMLLYPNVQRKAQEELDRVVGRDRLPEVADRQSLPYVS
AIVKEILRWNPILPAAVYHKSLADDWYEGYFIPAGSIVIGNTWAVLNDAERYPDP
EPFKPERFLTADGQLDSQVPDPVEVFGYGRRVCAGRHFAQTALFLYIAHVLALFT
IDNPLDESGHVIKPRRDCLTRLVPKPFKAKFKPRFTGVEELVHMSSIPTQ
>CYP5144E1 pc.8.2.1 (genewise.8.1.1) REVISED AT PRMPMLLDL 8/16/2007
fgenesh1_pg.C_scaffold_17000192
ADDED N-TERM AND COMPLETED C-TERM revised seq at EXXR
MGDTLALNGGIVLAVIVACSIALLLFQRRPHLP
LPPGPKRWPVVGNAFRFPKEREWLTFMRWSREFGALXDLLYYEMWGRPFVVIN
SHRAAVELFERKSALYADR (2)
PRMPMLLDL (2)
CGWAWDLAFMPYDETWKLARKLFTQHFRAGAAGRYRDEETRCARELLADILR
DDTQLFEHARVTFGKLIMSVTYGIDVRSADDKYITNAQKALYAITATGNVGTYL
VDSIPLLKYIPEWFPGAKFQREAREWREAAEAMSQRPVEDVKIAMAEGTARPSV
LRSLLEDYGETMSAQEAYAILSATGTAYEAAASETTWSATLTVVLAMLLFPEIQE
RAHAELDRVVGMDRFPVFEDQPSLPYITAICKE (0)
ALRWRTPLPL (1)
AVMHRVTQDDVYNGCHIPGGATVVLNSWAILFDPDQYPNPEPFAPERFLAPSGE
LAPDVPEPTAAFGYGRRACAGTAMALDTLWIVVASLLWAFDIRRAVDEMGNEI
DVAGEYTFGVVCYPAPFRCALRPRSEGIRSLISLPE*
 
>CYP5144F1 pc.240.2.1, e_gww2.5.175.1 [Phchr1:131322]
REVISED AT PEMPMLNDL AND MICRO EXON 8/16/2007
MVSAVLQGLAGLVIILLVRWAARQRQDRKQGPHPPGPPGLPLLGNLLDMPDNPS
WMTYIRWSEKYNSDILRLNVLGSNIIIVNSLDAANDLLDKRSAIYSDR (2)
PEMPMLNDL (2)
CGFGWNVAFRRYDDTWRNGRRVFQHELGPQVVKRFRALEEHATHQLLRNLLRE
PAEFMGHLRHMSAFEILRIAYGIEVTGREDPYVDTAEHAVGAVVATCSPGSYLV
NIMPFLKHIPEWVPGAKFKQDAKVWRRYVTELRDKPFSVVKERILRGDAPDCAA
KSLLESLESGEDTAGYTEDDIKYALGSMYA (1)
GGSDT (0)
TVSALGSFILGTVLDPAIQARAHADLDRVCPGRLPTFDDQPELPYIDAIVKEALRW
NPVLPIDVAHCSIADDVYRGYFIPKGSLVLANSWAILHDEAAYADPLRFHPDRFM
AGDALDGRVREPDAAFGFGRRICPGRYMAYDAIWIAVACMLAVFRIDKAKDAQ
GREITPSGEYNVGFAYPKPFPCDIRPRSSAHEALIRATAEDA
 
>CYP5144G1 pc.119.17.1 , gww2.5.320.1 [Phchr1:40563] ver2
revised TVCSISYEL and I-helix region 8/16/2007
MSSLLRVADAVLLCAALTIIYKLCLRQAKPSRLPYPPGPPGYPLIGHLSGPEGPGG
RSWVTFRDWSLQYGSDVIHLNMAGTHLIVLNTLGACSDLLEKRSTIYSDR (2)
TVCSISYEL (2)
CGLGWSFGLQRYGPEWRDGRKCFESQFNAHAVRKYRPALSREVSRFLHNLCTD
PVAWEYHVHHMAGALIMSVGYAIDVQAKDDPYLNAAEHAGECVQKTLVPGAF
LVDILPFLKYLPDWFPGVGFKQKARSWRKSIMYIRDAPYDVTKKRVVSAVVPDC
VAKDLMEKMVNNAKDPVYMERVARSAVGSMYL (1)
AGADT (0)
THSVLSACVLTLVLNPNFLPRAQASIDEVCQGRLPDFSDYEALPHVHAIVREAMR
WNPVVALNLPHRCTTDDIYEGHLIPAGSIVIANIWAILHDPAVYPDPESCNPMRYL
RCSPDGTVTLDPAVPNPADVAFGFGRRICPGRFMGYQTVWLALARMLAAFDIQC
ATDADGVPIVPRGEYDRPKPFECSIKPRSAAHAALVMQDVDAGEL
 
>CYP5144H1 ug.119.22.1, gww2.5.193.1
REVISED AT FGFTMLREL, FGF END DOES NOT HAVE AN AG BOUNDARY
POSSIBLE FRAMESHIFT IN THIS REGION MIGHT CREATE  PRLTMLREL
COMPARE TO LENTINULA EST EB011290.1 WITH PRMTMLNEL
REVISED MICRO EXON REGION AT I-HELIX
 
MASTSQTAFNAALGAACLVYLFAKIYWFVAAGRNLHGLKKLPGPRGWPFVGYLKAAE
RPAWLTYWRWSDEYDSDVVTFDVLGTTVVILNSLKAATELLEARSAIYSDR (2)
FGFTMLREL (2)
VGFDWNITVSEYGPYWRDSRRAFAHAFHPHAVARYRPAELKATHQFLRDLLNE
PEDFHGRIRYLAGRQILHIAYGLEIRDRADPWITAAEHGVEIAVKCIIPGSYLVDLI
PILKYVPEWFPGAGFKRQARIWKKEVTSIADAPLAAMEACDSLPDDSAAKPLLER
MLDSPDDPAYARHVLRGTLASMYI (1)
AGADT (0)
TTSTLATFFHALSRHPAVLYEAQRAVDRVCAGRLPTFADYDALPYIHALLRECLR
WRPVVPLNFAHRASKEDVYEGYRIPAGALVLANNWAIMHDPAAYTDPEDFNPR
RFLRARAGAGANGSGADDSLELDPGVRDPGVAAFGFGRRACPGRYMAYESLWI
VMASVLTVFDVLPAEGDEGEVEYTDGFLSTPKPFRCTIRPRSAAHAKLVYLALEL
DA
 
>CYP5144J1 genewise.24.21.1 = scaf252b = pc.24.20.1 = genewise2nd.24.19.1
46% to CYP205b (CYP5144A2), revised at micro exons 8/16/2007 added N-term
MWVVTCTCAAILYMVLVGLRKRHRFPPGPKGLPLIGNVLNAPAGSSAP
MVYQQWSKLFGSNIIHLKIFGTHFFVLNDAKTASDLLEKRSANYSGR (2)
SQTVMLFDL(2)
TGWDRDWGLLDYGDSWKKHRHVHHRYFHPKVLEAYHPRMEKGVQMLLQLLH
RSPADFNAHLRF
2537737 MTGHIIISIVYGIETKSADHPYIGLAEEGVKAFSATAVPGRFL (1) 2537865
VDSLPF (1)
2538014 LKHIPAWFPGADFKRQAALWKKDVDAMYETPFNDIKAAI 2538130
RRGETNTSIVGAALAELEGQADTEEAENIIMNVAGTAYPTASDTTIITLEFFILAML
QHPEVQRRAQADLERVVGNARLPSIHDQALLPYITAIMHETLRWRPPFRTVSLPR
KSLHDDEYEGYHIPAGSIMIANEWAILHDEARYPNPEEFDPSRFLNTDGSIDHTVP
EPVEPFGHGRRLCPGRHFAMDVIWLTIANILHVYSIEKAVDQSGNVVEPSGKCID
GLLSAPEPFQAVFRPRSDAAIALLRSID
 
>CYP5144-un1 PFF_45b pseudogene
 
PGPQGLPILRSLFHAPAQFQQLAFQDWGHRYIVLNCAQPASDLSDNRSTVYSDKG
GWDRNMGPLAHSAYWREHRRLAPHHFKDHVR
 
FTTAATPGMYL (1)
VDSFPM (1)
LRHIPAWLPGAQFKRAAARWRSIVEEVFDSGNPEPCLAACLPTSHRDREDTLMLE
NVVINTSGTAYAAASGTTTTTLLAFILTTMLYPDVQKAVRQELDSVVGQGRLPE
MADRNALSSIPALMKECLRWRPPLPLGVPHRSTAEDERGRRIQVPAGSTAIGNA
WXXXXXXXXYSDPEAVKPWRFFDAAGFGYGRRVCPGRHCLLDFVWLATANVL
VVCSIEEPFDGQWDTVEPSEQHTTGTIIFLAPFEAVFKXXXXAHVECVR
 
>CYP5145A1 ug.82.21.1 ESTs = DV765434.1, DV765432.1, DV759462.1, DV757969.1
MFYPPLVAVDVVFALLALYLIVRFLQKDRTLPFPPGPKPLPLIGNLLDMPSTYQW
VTFADWHDRYGDISSVTVLGQRIVILNSLDAAVELLEKRSAIYSSRPYMRMAGEI
LLWARTLVLSTYPGELFRDIRRFLHRYIGSRGQLERVAPFYELIETSTQDFLQRTL
ADPVRFVEHIRKNAGAIILNMTYGYKVQEGYDPLVDLVDRAVDGFVAASTPGSY
YVDIFPALQWIPSWFPGAGWKRRAEAWRADTQAMCDVPFEFAKQERLHGDNNS
KNFVSDNLASMETAQQEHHLKMAAGSLYSGGADTTVSAITTFFLAMTLYPEVQ
KRAQEELDAVIGTDRLPTLDDRERLPYTRALVSEVLRWNPIGPLGVPHVSTEDDV
YRGYFLPKGSMFIANIWCVPPSTYTYSEPLRFKPERYLGEQPEMDPRCAVFGFGR
RICPGACLNLAEASIFAVSAMALAVFDISKAVEDGVEITPKVEYTTGTISHPQPFK
CSIKPRSKKAEELIRG
 
>CYP5145A2 pc.82.61.1 (genewise2nd.82.23.1) revised near RYG 8/16/2007
MPSTHITALDIAVAIFALYIIRRLLQRGRTLPLPPGPRPLPLIGNLLDAPSAYHWETF
AEWNTRYGDVSSITILGQRMVILNSLDAAIDLLEKKSSIYSDRPVMPMAGEILLW
SQTLVLSPYPSDRFRDIRRYLHRYVGSRGQLERVAQTHQLIEDETRGFLQQTLRN
PLQFIAHIRKTAGAIILNMGYGYQVKEGHDPLVDLVDRAVNGFVAASTPGSFLVD
IIPALRWVPAWFPGAGWKRKALTWRADTRATCDVPFEFAKQEALRGSTSNNFVS
ANIQDIENAEQEYHLKMAAASLYSGGADTTVSAITTFFLAMTLYPEVQKRAQQE
LDAVLGAGRLPTLDDREQLPYTRALVSEVFRWNPIGPLGVPHVSTEDDEYRGYF
LPKGSIFIANIWYILRDPHTYTEPLRFKPERFLGEQPEQDPRAAVFGFGRRICPGAC
NARVA
 
>CYP5145A3 pc.11.258.1 scaf5 partial seq NOT ON TREE
54% to gx.82.23.1 (5145A2)
MQACLLEVFAHFPPVMAVLLVVLLLILTLFVLVTRKSSRHYPPGPRPLPLLGNIHN
APAQRQWKTFAAWKSTYGDVISLTIFGQRIVVLNSLESAIDLLEKRSAVYSDRPR
MVMVGELLGWAQQLVFAPYGEHFRNMRKILHKYLGARGQLDKIEPYHEIIEAAT
AKFLVRALRDDSDFHLEHNVHMTSGTINLRIGYGHNIAEGDDELVQMMDDALV
GFNRAAVPGAFLVDIIPALKWVPTWCPGTSWKRQAQEWKDLFVRMTEGPYAM
AQQQAERGGHDNIVSMSLSEDMSPEDHYDLKMAVGSLYGGGTDTTVAIILSFIL
AMTLHPEAQKKAQKEIDALTNGERLPVIRDREELPYVRALISEVLRWNPLVPLGV
PHRAIADDVYRGYFIPEGSTIVVNMWQLAQDPEVYADPEVFKPERFLGPDSERDI
RTFVFGFGRRICPGLNLAEASTFAICARILAVFEIGKVVEDGKRITPDFAFRDGAV
RFV
 
>CYP5146A1 pc.59.8.1 , fgenesh1_pg.C_scaffold_1000663 [Phchr1:663]
Phchr1/scaffold_1:2066931-2069520
MDPATVAVAVVCALAVMHVLTRRARTRLPYPPGPPEDPIIGHLRQMPNNDEAAE
VWYRWAKQYGDVMSLNVLGKRLVILSSEEAATELLEKRSSKYADRPRFPIFERIG
WKDMVLLMPYGPYHKTLRKMIQVPFEKDKAFQFRDIQERATSIMLHNFLADPK
GIEHHTHCRYVVSIIVEIVFGHRILSEDDEHLKIADVFVKIQHEASQPSLLDVSPLF
AKLPSWFPGAWFVKYIEDTKAILSHAIHHPVSIVQEQLASGIAKPSFVADELERLI
KAGQLTPQNKYDVSIAAHMIFGGGTETTWNTLTTFIACMLLNPEAQRKAQEEID
KVVGHGRLPDFTDRDSLPYVECVVKETMRWHPVAPVAVPHKATEDDVYRGMY
IPKGAIVIANARSITWDERRFHDPHAFKPERFLPRPLGAGEDFVQGAVYGWGRRI
CPGRHLAGDMVWIAIARVLAVFDIQKARDADGNAIEPNIEFTTAVHPKPFPCELR
PRSEKAASLIKESYELHSID
 
>CYP5146A2 pc.59.23.1(gw.59.18.1) gww2.1.504.1 [Phchr1:38849]
Phchr1/scaffold_1:2095089-2097090 adjacent to 5146A4
note by a typo this was called 5148A2, but it is really 5146A2
MGFLTALLLVFVLLCVALVRAVRRRRARPPYPPGPPADPLIGHIRIMPSTDNAHE
VFHDWAQQYGDVMSLDVLGTRYVILNSAEAATDLLEKRNSKYADRPTFPMYER
VGWKDTLVFLPYGPYFRKQRKMLQLPLEKERVTDFRHIEEQETCVMLYNILSDP
DNTDAFVHRRYTTAVTMELAYGHRVVSNDDEYLKAADMVIDVLRSVTRPSLLD
VSPIFEYLPAWFPGAWFVKCIKEIKPVVLREIQHPVSVVQQELMAGTAKPSFVSQ
QLEDLSRENGLSQEDLYTVSMVAHQIFGGSETGWHTIMTFIACMLTNPDVQRKG
QEELDSVVGRGRLPDFTDRDSLPYIDCIVKETMRWQPVVPLSVPHKAMEDDEYR
GMHIPKGATIIPNARGITWDERHFHEPRTYKPERFLPRPQGAGEVFPQGAVFGWG
RRLCPGRYLADDVVWLAIARILAVFDIQKAVDADGNVVEPHIEFTTVLTSHPKPF
PCSLRPRSEKAAELVRQAYDMHMANVAV
 
>CYP5146A3 pc.59.24.1(ug.59.33.1) gww2.1.413.1 [Phchr1:38101]
Phchr1/scaffold_1:2097942-2100000
MGFLTALLLALVLLCAIWVRAVRRRRGRLPYPPGPPADPLIGHIRIMPSTDVAHE
VFHGWAQQYGDVMSFSVLGTRYVILNSAEAATDLLEKRNSKYADRPTFPMYER
VGWKDALVFLPYGSYFRKQRKMVQLPFEKEKVTDFRHIEEQESCVMLYNIFSDP
DNRDAFVHRRYTTGVTMELTYGHRVVSDEDEYLKAADMIIDVLRSVTRPSLLDV
SPLFEKLPAWFPGAWFVKCIKKTKPVVLREIQRPVSVVQQKLMAGTAKSSFVSQ
HLEELSREKGLSEEDLYTVSMAAHQVFGGTETAWHTIMTFIACMLTNPDVQRKG
QEELDRVVGSGRLPDFTDRDSLPYVDCIVKETMRWQPVVPLSVPHRAMEDDEY
RGMYIPKGATIIPNARGITWDERHFHEPRTFKPERFLPMPQGAGEVFPQSAVYGW
GRRICPGRYFADDMVWLAVARILAVFDIRKAVDADGNVVEPRIEFATVLSHPKPF
PCSLQPRSEKAAELIRQAYEMHMANVEA
 
>CYP5146A4 genewise.59.20.1 e_gww2.1.326.1 [Phchr1:133267]
Phchr1/scaffold_1:2102681-2104782
MGPTAFVAILLCAVLLVQAVRRRRTPLPYPPGPPADPLIGHLRIMPDTSTAPEVW
HSWSRKYGDVMSLSILGKRVVILNSEEAVTELFEKRGAKYADRPSYPLYERVGW
KDALILLPYGTYYRKLRKMLQLPFEKDKAPNYRHIQEQEACVLLHNFLRDPSSVE
SPIHRRYTAAIIIEIAFGHRVLSDDDEHLKAADMFVEVQHGAGRPSLLDVSPIFEKL
PSWFPGAWHVRYIKEKRPMILHAIQHPVSVIRQQLVDGTANPSFVSQQLNDLIRE
GGLTPENQYDLSIVAHMIFGGGSETTWNTLTTFIACMLMNPEVQRKGQEELDRV
VGRGRLPDFTDRDSLPYIECVMKETMRWHPVAPLAMPRRAIEDDEYHGMYIPKG
AMVIANISLTWDERRFHDARSFKPERFLPKPEGAGEVLPPSFAFGWGRRICPGRY
LADDVVWIAVARILATLDIRKPKAADGSIIEPRIEFEAALTSHPKPFPCEIRPRSDK
AAELIKQAYDMHMASVET
 
>CYP5146B1 pc.59.19.1 , e_gww2.1.427.1 [Phchr1:132579]
Phchr1/scaffold_1:2085390-2087317
MLVLLGFVSLVLFLVYRRSVRASRGRLPPGPPADPIIGHMRVFPRANHGEVFHQ
WSKQYGDVLHLDVLGKSIIVLNSQEAANDLLDKRSANYSDRPEFPAFNLLGWDS
MLVFLRYGPAFLRQRRLMQQPLTRTGVVVFRPVQLQQCHVLLKNLLASPKDFD
AHLRRRFASAITLEMTYGHKVSSDDDAYLDIADKVNVVLTKMSKAAILDLFPRA
KHLPSWFPGAWFIRYANDHRHLIWEMASKPFEQVEQQLAAGTAQPSFVSMHLEE
MHRQNTHDADNVSALKTAAAHMWTGGEETSTLLIFVLAAVRNRDAVRRAQAE
LDRVLGPGRLPTFEDMDALPYVEAFIKETIRFHSALPLGIPHRAMADDVYRDMLI
PKDATVLVNSTALARDPAAYSTPERFWPERFLPPHSEPPPVGLGFGWGRRVCPGR
HLAEASLWIVVASMLAAFDIAPVQDAHGRDAPPELRFTQAITFSEDSHPEPFECSI
TPRSEKVAQLIMQL
 
>CYP5146C1 pc.175.6.1 (genewise2nd.175.5.1)
scaffold_6:685560-687500 region
MMPYILGVTLLLLTVIVLRVLRARASRAAPYPPGPPAYPVIGSVGPFPAHEPHLGL
AELAKKYGDVMYFEIFGKPLVVLSSLEAASDLLEKRSAIYSSRPRFAVHEMIGWT
DMVSFLPYGEQFNKQRKFFLHTFSKQGCLVFRSSQVAQTHLLLKNILQCPTRYIE
YLRRFSTAVIMEIAYGHKVSSEDDPYVKIAEDTNNVLMAAGHSLALVDFLPWLR
HLPAWFPGNWFARVAQESRPVIQRMRNFPFDQVVQQMASGTASPSFVSMQIEEL
ERDGGASPENLHILKIAASQMYGAGAETTWSTIMNVIAFLLLHPTAQRKAQDEL
DSVLRGERMPDFDDRKSLPYLDALLLEVMRLQPTAPLGVPHSSTTDDIYRGMFIP
KDSIVLTNTTYALAMDDRVYRNPTEFRPERFLPPHAEPNPNGIVFGWGRRICPGR
YLADTSVWIVMASFLTVFEIVPERDRNGQDIIPEIQWCSAPFPCVIRPRSEKEVKLV
SRL
 
>CYP5147A1 ug.50.27.1
MPLLLVDIAALVAGLVLLFMLDSWRRKGQHLPPGPPGLPFLGNILQIPRKKEFIVF
RDLGNIYGDIVTLRVPGQIFVILNSRKAVADLLDARSQIYSDRPRTIMCKDLIGWD
GSVVLSNNTPRFRDCRKLLRKGLGPSAVQSFIPFLNRQSAFYLENLQKRPEAFVDI
FKRNAAAISMKIAYGYDGIQDDEELYGIGAMANHYFAETAVVGVWPVDMLPIL
RHVPQWFPFAYFKKYAAQAKPIVLESVNRPFEETKRHMKRGTAGGSFTSMLLED
AKGDPETEDCIKWSGTGIFLGQMDTTTSALSWFFLSMVLHPEVQAKAQAEIDKV
VGNERLPRFEDKESLPYVSAVMQEVFRWHPVVPMIPHALSKDDEYRGYFIPAKT
SLIGNIWAIMHDESLYPDAEDFRPERFTEDGAPDCLNVAFGFGRRVCPGILIAQAH
VFVSIATTLATFNITKARDAQGNIIEPVVEDTPGAINFPQPFKVSLEPRSAAAADLI
RRSAEHSKTLPERLEIFSLDA
 
>CYP5147A2 ug.50.57.1 this seq matches an unannotated region scaf 50 178-179kb
ug.50-57 is from the first 5 kb of this scaffold, about 170 kb away from this seq.
this matches my scaffold_15f.  The ug # is wrong.

MLDVGATAAAGLLLVFLLGLGLMKTQHLPPGPRGLPLLGNVLQIPRRLPHVAFR
DMGHKYGGDIVTLRVPGYNLLVLNSRQAIYDLLDSRSAVYSDRPQGTIYRKLLR
KGLGASAVQSFIPFLNRQSALYLENLQSRPEAFVEITK
RNAAAISMKIAYGYDGIADDEELYRLAHQTTIYFAETAVLGAWPVDMFPILRFIP
SWFPLAYFRRYAARARPVVVECINKPFEETKRHMRLGSAGASFTSMLLGDANGD
PDTEDYIKWSSAAIFLGQMDTTTAVLSWFYLAMALHPEVQAKAQAEIDQVVGN
ERLPHIEDRDSLPYVCAVMREVFRWHPVANLVPHATDKDDHYRDYFVPAQTVA
IANVWAVLHDEDVYHDADKFIPERFSEEGAPDSLEIAFGFGRRACPGKVVGQAH
VFASIATVLATFNITKARDAQGN
VIEPEVMDTPGAVNTPQPFKVNIEPRSEAAVDLIRRSAEHSRTLPERLEIFSLDA
 
>CYP5147A3 pc.16.140.1
MLLFAAVSTVLALLLASVLTKARRKARNLPPGPKPLPFIGNAHQIPPENEWIKFKE
WGDEYGDLVKIKIPGSMLYIVNKRKVVDELFEARSAVYCSRPNFIMASLSGWDH
SIPTLPYGQRLRESRKLLKKGTSPAAVKTYHPYINRDLPFFLENMLSTPDKFVEHY
NRNAARIALKIAYGYEGITEDERIIQGGVKAMEVFSATAVPGVWAVDTFPFLRHL
PSWAPFSSFKGFAERCKRITDEALNTPFYEVKQRLEKGTADGSFTSVMLSTEKLD
PETEEIIKWCATGIFTGQFDTTTATLSWFTMAMAKYPDVQEKAQAEIDRVVGRD
RLPEVGDRDSLPYTWAILQETMRWHPTVALVPHTAIQDDTYGGYFVPAGTTVIA
NVWAMMHDENVYHDADKFMPERFYEEGAPDSLSVVFGFGRRICPGLVVAQTH
MFVTIASILATLNISKARDNAGNVIEPREDAKSGVINFPKPFQVSITPRSDAAVQLI
RRSVEHSKTLPDKLELFSP
 
>CYP5147A4 pc.16.141.1
 
IFTAVLAVTVVSLLTRRKNGRHVPPGPKPLPFIGNALQIPPQHEWIKFKEWYGPRL
RESRKMLKKGMGPAAVKTYYPYINREVPFFLENMLRKPDSFVEHLKIAYGYEGV
TEDEKIIRGGVDAMEVFSAVAAPGVWVVKYVPSWFPFAKFKKFAERGKKITDEA
LDTPFYEVKRRVTTATLSWFTLAMAKYPEVQKKAQAEIDRVIGKDRLPEVGDRD
SLPYVWAIMQETFRWHPTITMSGYVPHTAIQDDEYRGYFIPSGTTLMANIWRERR
SWPISGACPLRLMFEVIYHVHRGILHDEKLYHDADKFIPERFCDEGAPDSLSVAFG
FGRRRRVCPGLVIAQTHVFVTMASMLATFNITKARDSTGAVIEPREDAKSGVIPK
PFVVSITPRSDEAVTLIRRSV
 
>CYP5147B1 pc.50.95.1
MLLQVLAAIAALFVLSGLLNSRRRNMHVPPGPKALPLLGNVLDIPKKDVHVTFR
DWANIYGKDLMKVEMPGETLYVISNKKVMVDLFEARSAIYSSKPTMTMADSSG
FKNSIPLLPYNARLKTSRRLLKQGLSPAAVRSYFPYINNRTALFLEALLKDPDDFV
RHFTRTAAHTALKIAYGYEGVTEDHHLLHTAIETMEIFATVVNPGRWLVDTLPIL
DRIPVSFPFANFKRVQEESRPVVFETVSKPFEEVKKHLAEGTADGSFSSYLLQSEK
PDPETEDCIRWAATSIFLGQFDTTTATLSWFTHAMVKFPEVQKKAQEEIDRVIGN
DRLPEIQDRDSLPYVNAIMKEIFRWQPIISMLPRSVVQDDEYNGYFVPAGTYLLA
NIWAVLHDPEVYPEPEKFMPERHLKEGVPNPLDVTFGFGRRVCPGMQVAQSQTF
GTMAAMLATLHLKPQKDEHGRDIIPETRTVDGLIRFPVPFKCAFVPRSEAALKLIE
RGAEHARSVPDRLERWSD
 
>CYP5147C1 pc.50.96.1

MVQASDTLLSVVVCAALITLTSFLLSGYRKRNAHLPPGPKPLPIIGNLHQLPDLKA
DRAVAFRDMSLAYGSDILCVKVPGMLMYILNSKESMFDILVTRSAKSSSKPPQV
MADEXLSGWKYTVPSLPYGQRIKTSRRLLHKGLGPSAVQSYIPYLERESAFFLEK
LLDQQDAYKKHVTHTAARIALKIAYGYEGVTDDAHLIDTAVKAMNIFCVTATPG
IWLVDSLPFLQHMPSWFPGTGFKKQASQWSQTVLYAINHPFEELKRQMAAGTA
GASFAGRLLEVEDLSDPEVEDCIKHCSTGIFAGQFDTTTAVMSWFAVVMAFYPEI
QKKAQDEINKVVGHERMPVVADRDSLPYVNAILKELLRWRPVLPLIAHSVNEED
EYKGYYVPKDTVILANVWAVLHDESNYDEPEKYKPERFLRDGILDPSVLDPATL
AFGFGRRICPGMHIGQTLLFILMSRTLQNFDIAPAKDAHGREIAIDTSAVPGLIGFP
KPFKVSLVPRSNAHATHIRHAAEHARSLPDKLAIFEL
 
>CYP5148A1 pc.79.37.1
RWAKTFGPLYSMWIGNQLFVVISDPQIVKDLVITNGAIFSSRKDMYVKSQIIFRGR
GITTTPYGDTWRKHRRLASQFLGNRVVSGYLSGLEYEVQDMLCGLLTDGQAGF
VPVSPQAYLGRLALNNIMTIVFGTRTGSIDDPFIHHWLTLSREFMNCTGPVSNWV
DFVPFXCMAKFLLDVKDKERLDDLDIILLCCGFLVGGVESTAAIKQWFAAHISVL
PEVQAQAQMELDRVVGRDRLPQAEDAKDLPYVRAIVKEIERVHNPFWLGTPHM
STEDFSYRGYKIPKDTAVILNTYTMHHDSQRYPNPEKFDPDRYIDDERSSAESAK
LADPYQRDHWTFGAGRRICPAIALAEHEIFLSVAGLLWAFDMRQLSDAPIDLKEY
DGLSGRSPVPFCIRLVPRHERVAAVVGASCHTATCRNG
 
>CYP5148A2 ug.50.57.1|whiterot1 not same as Yadav’s = pc.50.1.1 + pc.50.2.1 (This is the correct ug number)
this whole seq not in Yadavs collection
Note: a typo named another seq 5148A2, but that seq was really 5146A2.
This is the correct 5148A2 seq
TFVLLVAGILYIVLPFFFRKNLVDKNGNSIPPGPLLRLPYLPDYPERTLHAWAQKFGP
LYSFFIGNQLYVVVSDANVARE
LLVNNGAIFSSRKQYFTKNQTILRGRAITASPYGETWRQHRKIAAQLLTPKAIQSYNN
VLDYEARIMIRSMYKESMQGAV
PINPAHYTGRYTLNNMLTISFAMRTESTQDPLIQRILAMAMEFNDLTGPFSNLVDFIE
PLQWLPTKTHARAAKLHDDFIE
VYGSMVMAVKERMDAGENVPHCLAKVLIEGQQQEKLDWEDVCMLSAAFALGGVHSVSG
LRWFLALIGKHPDIQARAHHEL
DAVVGRDRWPMAEDEKDLPFIRAIIKEVLRVHAPFWNATPHSSTEDFVYNGMYIPKGA
AVILNCFTLHHNEARYPDPYVF
PHCASYCLCANAMERDHWSFGAGRRICPGINVAERILFLAISRLLWAFTVH
 
>CYP5149A1  gx.62.25.1 MORE COMPLETE VERSION OF pc.62.64.1
SEAYLPPGPPALPFVGNLFQLPRKSVPRTFATMSQQYGPLYY
MRVINRHFVIVNDLDLARILFDKRGAIYSHRPRLPMAQEVVKRDTMLFMNYGPE
FRKSRKLVSTFLNQRNASKYWPAQEIESLKFVLAVQRNPSDWLKLTRWTATSLV
IRLLYGIEVQDKDDALVGLAEDFARLTTETTEPGRWLVDAFPILRHVPAWLPGAG
FKRWAKRAKARMDEFATLPYVMAKDKIEKGDITPCWTAEKLLETTEPLTEQDE
KEIRHTATSMYSGTNAMVATFILLMLHYPEVQKKAQEEIDSVTGGTWVPGMRD
RERFPYINCLVKELFRFSPAVPLVPHSLHEDDVVEGYLIPKGSWVMANMWAFMH
DEARYPDPETFTPERFEARPGVEPQDDPLDIVFGFGRRACPGYLLGVASVYLNIV
HLLFAFDIAPVKDAAGASVLPPIEFSDGHVA
HAKPFECDMRERSAERIALIEHTA
 
>CYP5150A1 pc.24.95.1 red is new POSSIBLE GC BOUNDARY AT ETMRL
scaffold_1 (3273479 bp) : 2653416:2656338 (2923 bp)
MAFSLPLISATVLLWVLWKVFRNYVVSSPLDNIPGPPRSSFWS (1)
GDSAIMYQRHGWSFHDNISEKYGPVSTTHTLLG (0)
ARGLYVYDQPKALNSIMITDQDSYEEPAWLLQ (2)
SSHAIFGPAVFIAQ (1)
GEQHRRQRRVLNPVFSGAHMRHMAPVFYDVAHR (0)
LRx (frameshift)
AVSAQIHDPASSEIDVLEWTSRAALELIGQGGLGCSLDPL
VADSNNDFGTALKTLI (2)
PLISGLHFYRMLMPYITPFVPLSVRRLFMRWVPHKNAQ
QLRAATEDLWALSRQIYEEKLAAVAKGDNDEVFEGQDLISIL (1)
IRSNTAAAAEDRLPEEEIIAQVA (2)
GLILAATDTTSSALARILHILAERQDIQDKVRAELVEAAGEGEDI
PYDQLVNLPWLDAICRETMRL (2)
HPPANIVNRE (2)
ARTDVIMPLSEPVQGRDGTLIHEIAVPKGTLVTISVRGCN
RNKAIWGEDALEWKPERWLKSLPETVSGAHIPGVYAH (2)
MTFIGGARACL (2)
GFRFAQLEM (1)
KVVLAVMLRSFRFQLSDKEVYWNFAGVVFPSIGRDGKTASMPMKIETL*
 
>CYP5150A2 fgenesh1_pg.C_scaffold_2000178 [Phchr1:1200]
= gx.121.14.1  LOCATED ON VER 2 SCAFF 2, 527-529KB
58% TO GW.95.20.1, 53% TO PC.24.95.1, 51% TO PC.10.108.1
55% to 153.5, 49% to 66.11
MTSAPLLAFGAALIAIIWTLFQGYLVKSPLDNIPGPERSSFWLGNHGDIFNRHAWNFH
DRARQLFGPVFR
FWGPFASRGLFVYDPKALNSIIVKDQLIYEESRWFISWNKYAFGLGLLSTLGEHHRKQ
RKLLNPVFSINH
MRHMAPIFYQTTHRLRTAITAELEASSADVDVLNWMGRLALELIGQGGLGYSFDTLVA
HTHNEFGDAIKG
YVPAILNVIILRNIIYPYMDEYIPAKVRRFILDILPFRSVRRIQKIIDDMHSHSRRIF
NEKKAALEKGDG
AVLHQVGEGKDIMSILLKANMEASDEDRLPEDELIGQMT (2)
TLVFAATDTTSNALSRILELLAKNQ
DVQDKMRTELIAASPDGEDIPYDTLVALPYMDAVCRETLRLHPPVNMMSRETREDVMM
PLSEPIQGVNGE
TISEIFVPKDTSVIVSIRACNRNKAIWGEDADEWKPERWLSPLPEAVGNAHVPGVYSH
LMTFLGGGRACM (2)
GFKFSQLEM (1)
KVVLAVMLRTFKFFPGKNEIYWNMGGVNYPTAGKDSNKACMYLRLERIAS*
 
>CYP5150A3 pc.153.5.1 LOCATED ON VER 2 SCAFF 2
scaffold_2 (3043971 bp) : 1623404:1625693  (-) strand
(whole gene range)
57% to 95.20, 56% to 121.14, 51% to 10.108, 51% to 24.95,
47% to 66.11
MVPPSTFVLCALGGWVFWKLFRGYFTRSPLDNIPGPRRASLLK (1)
GNAHQLFNRHAWGFHERISQEYGQVVKFHAPFG (0)
GRGLYVFDPKALYHMIVKDIATFDEPRWFLQ (2)
MADFTFGPGLFSTS (1)
GQQHRKHRRVLNPVFSINHMRNLAPLFYTVAHRLRDGLSTQLTTSTGGE
VEILGWMGRAALELVGQGGFCHSFDQLDKNVPNAYRDVLKEVM (2)
PSQIALHFWRILLPYAVEYVPARIRRFLAPW
LPHPVMQKYRNICITMDEQARAI
YHAKKVALEQGDKLVEHQACEGRDILSVL (1)
VTANKQESVEDRLSEEEVIALIS (2)
TLAFFAATDTTSNAMARILHLLAEHQHVQDKMRLELFEAGM
DGEDIPYDRLVELPYLDAVCRETLRL (2)
YPPVLFMNRE (2)
TRQDAVLPLSKPIQGLDGDVLTEIMVPKGTLLMVSIVACNRNKALWGEDVLEWKPERW
LSPLPESIREAHVPGVYSHL (2)
LTFLGGGRACM (2)
GSKFAQLEM (1)
KVLLCTLLRSYRFMPGTKGVYWNVGAISYPTPNEIDEKAAMYLTMDHIQAPQ*
 
>CYP5150A4 genewise.95.20.1 57% TO pc.153.5.1, 55% TO
pc.10.108.1, 54% TO pc.24.95.1, 51% TO gx.66.11.1, 58% TO
gx.121.14.1,
fgenesh1_pg.C_scaffold_2000729 [Phchr1:1751] = gw.95.20.1
scaffold_2(3043971 bp):2185130:2187482 (2353 bp)
whole gene range
MATTTHGLALYALCGLCVWALWRVLRAYVVKSPLDNVPGPERTSFLK (1)
GNTHQIFSRHGLDYLQELGERYGQVVRYYAPLG (0)
ARGLYVFDPKALNHIVVKDQAIYEEPRWFIR (2)
LNRLLFGPGLLSTL (1)
GDHHRKQRKLLNPVFSINHMRHMTPIFYNVVHD (0)
LRDAVAEQVKDTPT
EVNVLDWMLRTALELVGQGGLGYSFDALSAQKRNVYGEALKELL (2)
PTVFALHFWRVLLPYVGAVVPAWVCRAAAPFLPHAAMQKLRRVV
GAMDAHSRRIYEMKKGLLEKGDAAVVHQVSEGRDILSIL (1)
MKANREVDEEDRLPEDEIIAQMS (2)
TLVFAATDTTSNALARIFQLLAEHPDVQDRLRAELT
DAAPDGADIPYDALVVLPYLDAVCRETLRL (2)
HPPASFMNRE (2)
ARADAVLPLAEPLRGADGAPITEIAVPRGTPLIIAIRASNRN
SALWGADALAWRPERWLAPLPDALAKAHVPGIYANL (2)
MTFLGGGRACM (2)
GFKFSQLEM (1)
KVVLAVMLRSFRFLPGDKEIYWNLAPVAYPTVG
KTSTKSELYLKLEPLKT*
 
>CYP5150A5 AADS01000067.1 = pc.10.108.1 54% TO pc.24.95.1
scaffold_8 (1906386 bp) : 331157:333480 (2324 bp)
whole gene range
MGPLQLVLVATAA
5407 WALWRLFRHYILRSPLDNIPGPASSSFIY (1) 5499
5550 GNLKEMFNRHGWGFHDTIIRQYGPIATVHSMLG (0) 5648
5713 ARALYVYDPKALNHIILKDQYTYDEPGWFLE (2) 5787
WHRMIFGPTLIATT (1)
5936 GAHHRKQRKLLNAIFSIARMRDTAPIFYNVAHR (0)
LRDAISADLN
6116 QGSGEINMIEWFSRAALELAGRGGVAYSFDALEANSENSEF
GMTVKQYR (2)
6345 PTTIALHFWRILSPYASLYIPRVVRLALGRLVPHKDLQ 6443
RIQMIAHAISAQSKKIYDFRMAAFQRGDEDAVREISEGRDVLSYI (1)
IRAGLNSEEERLPEEEILAHMS
6752 SGLILGATDTTSNALARTFQLLAEHQDVQDKMRAELA
DAAPDGEDIPYDQLVHLPLLDAVCRETLRL (2)
NPPIGLLARE (2)
7091 ARDDIVLPFLQPVHGRDGSLINEVPVPKGSTVFISVRACN
RNPLIWGEDAAEWKPERWLQPTPKSLSEARVPGVYANQ (2) 7312
7377 MTFLGGDRACL (2)
GFKFSQLEM (1)
KVVLAVLL 7556
7557 RSFRFLPCNKDVYWNLAGITYPTLGKDSDKLELPIKLEIIGGQY* 7676
 
>CYP5150B1 fgenesh1_pg.C_scaffold_2000954 [Phchr1:1976] =
gx.66.11.1
LOCATED ON VER 2 SCAFF 2, 2927193
51% TO 95.20, 45% TO 24.95, 44% TO PC.10.108
49% to 121.14, 47% to 153.5
MSLFALAIPWVACGLVLMLVYRLVRDYVVSSSLEDVQGPTPRSLIY (1)
GNLPELQNRGAWPFLDHLTNDYDRVVRMRGMFG (0)
KRILWVADPKALHHIVVKDQDIYEEAPSAITF (2)
GRKLGMGPGLLSTL (1)
GDHHRRQRKLLNPVFSIAHLRRVTPVFYEVMNR (0)
LSKGIEKQLDTTSNNEVDLLAWMGRAALELIGQGGFG
HSFDPLVEQTPNPYADAVKSLV (2)
PAITGLIFYRMVIHLVDPLVEACAAHPALGAFIKRWFW
LVPNARMQHVKTIFDVLHDTSTAIYTEKKIALDSDDPELKMRVLEGRDLM (1)
SVMLRENMNADAADRLPEREIIAQIT (2)
TFIFAGTDTTSNALARILHLLCLHPDVQEKLRAEIIEARAQNGGGDLDYDALVALPYL
EAVCRETLRL (2)
YAPVPFVSRQ (2)
ARSDALLPLSAPLTLRDGSRVSALHVPQGTSVLVAIHSVNRSAALWGADAHVWRPERW
LEKLPDAVAEARVPGVYSNL (2)
MTFIGGGRACM (2)
GFKFSQLEM (1)
EVVLATLLSSFRFSLCDGKNADIVWNRAGIAYPTVGNDGGHPSLPLRVERLKC*
 
>CYP5150C1 EB077269.1 Trametes versicolor cDNA clone
52% TO CYP5150A2, 50% TO CYP5150B1
SHVMAQVLQLLSEHPDAQAKVRREILEAGDGYIPYEKLHSLPYLDAICKETLRMYPPTPI
VLREAFRDTTLPLSQPMRGSDGSMLSHIPIPKGTNVLVGVRACNRNKALWGEDAEEWKPE
RWLAPLPKAVEDASIPGVYSNLMTFVGGGRSCVGFTFSQLEMKVVLSSLLANFTFQLSEK
PIFWHISAITFPSAAKDSLKPEMWLKVGRYTGEAA*
 
 
>CYP5151A1 pc.8.82.1 (name is not right pc.8.80.1 is correct (-) strand)
C-term is 37% to CYP5033A1 Ustilago maydis 30% overall
name error, CYP5144E1name assigned twice see pc.8.2.1
Phanerochaete chrysosporium cDNA clone DV765344
MESFTLAALSLTTACAAALVAYLLYLCV
VYPLRNPIRQLPGPPSKWFLELRHMYMTMD (2)
PRRSPHTAAEFVEKYGRNVYIRGPVPWDQRLFTLDPVTMNHVLQHT
AIYEKPWPSRRLISGLIGAGMLSAEGQMHKRQRR
VATPAFSLNEMRALIPLVFSKGT
ELQKKWMEIMRDAGVKPGQ
GHVVNVCSWA
SRATFDVMGSAGFDYEFNAIQNEDNELLRAYVDMF
ETAVSKQKAGLRSVLVMYLPIIDKIF (0)
159857 PNETTRFVSKCQTVIERVAGTLIQEKKRKMADAAVKGQVY 159738
159737 QGKDLLSLMRKY (1) 159702
159661 VKSNSAVDLPPDQRLS159572
DQDLLNNINTFMFAGTDTTSLALTWTLYVLALYPHVQDRLRAELLS
VAPIAPIDTLTPEE
VQSLYAEIAALPFLENVIRETLRLIPPVHSSIREATRDDVV
PVSAPLKRTTPNGRVVEEQVHQIVVPKGTFIHVPIEGFNLDKGL
WGETAWKFD (2)
PDRWDNLPETIKELPGLYQHTLTFSAGPR (0)
ACIGMRMSVIELKSFLFTLVTNFKFAPDPTQKIGKANV (2)
ILTRPYVAGKQNEGSALPLIVTPYVREDAS*
 
COMPARE TO
>CYP5151A2 AACS01000049.1 Coprinopsis cinerea strain okayama7#130
3476 SCIGMRFSMIEIKTFLYILVTRFVFKPTKDKIIKSNV 3586
3642 VLTRPYISGKYREGSQLPLIVTPYI 3716
 
 
WHOLE PREDICTED SEQ 55% TO PHANEROCHAETE
MNPILLSGLAAASTVLTWVVYRVVIEPRFNPLLKLAGPPSPGLF
GTNLAPVLSPTVSPRLHEVYAENYGRSMRIRGVGPWDERLLTLDPVSVAYVLKNSTIY
EKPWQSRALITSLIGCGMLAAEGQVHKRQRRVGTPAFSIQNLRGIVPLVFKKGTELKD
KWMEMIETTGEVRGSDEKEKSMVVDVCHWVSRATFDVIGVAGFDYQFNAIQNESNELF
NAYKEMFEIAISQGDGIITLISIYAPWIHKIFPNQVSRTVERCQEVIRRVAGQIIQEK
KRKIAEGEASGKPYQGRDLLSLLLKSNVAVDLPEDQRISDEDILNNVNTFMFAGSDTS
SLTLTWTLWLLANNPEIQDRLRAELLAAIPDTELTADISSLNEDEIQTLYGIIAELPL
LNNVTRESIRLIPPVHSSIRVATQDDEIPTRYPVKLADGTIDTKQSVKIAKGSFVHVA
VEGFNLDKEFWGADAWDFNPDRWDDQPETARQLPGLYNNTLTFSAGPR(0)
SCIGMRFSMIEIKTFLYILVTRFVFKPTKDKIIKSNVVLTRPYISGKYRE
GSQLPLIVTPYIPSAEH
 
 
EB072414.1| TverSEQ10129 Trametes versicolor pBluescript (EcoRI-XhoI) Trametes
versicolor cDNA clone TverSEQ10129, mRNA sequence.
Length=897
 
PNELNAAFQEVFNPGANFTIFTILKNVFPALDIFPDERAKRLDHAQDVMRRIGLQLIEEK
KAQIAREMSEGKSGGVERKDVQGRDLLTLLMKANMATDIPDNQRLSDEDVLAQVPTFLVA
GHETTSTATMWCLYALTQAPDVQKKLRDELFTLQTEAPTMDELSSLPYLDAVVRETLRIH
APVPTTMRVATKDDVIPVSEPFVDRRGKVQDSIHISKGSPIIIPVLSLNRSTELWGADAL
EFRPERWINPPETISSIPGVWGHILSFLAGPRACIGYRFSLVEMKALLFELVRAFEFE
 
Score = 169 bits (428), Expect = 9e-42
Identities = 89/197 (45%), Positives = 129/197 (65%), Gaps = 9/197 (4%)
Frame = +3
 
Query 1 DQDLLNNINTFMFAGTDTTSLALTWTLYVLALYPHVQDRLRAELLSVQS---LYAEIAAL 57
D+D+L + TF+ AG +TTS A W LY L P VQ +LR EL ++Q+ E+++L
Sbjct 321 DEDVLAQVPTFLVAGHETTSTATMWCLYALTQAPDVQKKLRDELFTLQTEAPTMDELSSL 500
 
Query 58 PFLENVIRETLRLIPPVHSSIREATRDDVVPVSAPLKRTTPNGRVVEEQVHQIVVPKGTF 117
P+L+ V+RETLR+ PV +++R AT+DDV+PVS P G+V ++ +H + KG+
Sbjct 501 PYLDAVVRETLRIHAPVPTTMRVATKDDVIPVSEPF--VDRRGKV-QDSIH---ISKGSP 662
 
Query 118 IHVPIEGFNLDKGLWGETAWKFDPDRWDNLPETIKELPGLYQHTLTFSAGPRACIGMRMS 177
I +P+ N LWG A +F P+RW N PETI +PG++ H L+F AGPRACIG R S
Sbjct 663 IIIPVLSLNRSTELWGADALEFRPERWINPPETISSIPGVWGHILSFLAGPRACIGYRFS 842
 
Query 178 VIELKSFLFTLVTNFKF 194
++E+K+ LF LV F+F
Sbjct 843 LVEMKALLFELVRAFEF 893
 
>pc.8.80.1 [whiterot1:77774]
MFAGTDTTSLALTWTLYVLALYPHVQDRLRAELLSVAPIAPIDTLTPEEVQSLYAEIA
ALPFLENVIRET
LRLIPPVHSSIREATRDDVV
PVSAPLKRTTPNGRVVEEQVHQIVVPKGTFIHVPIEGFNLDKGL
WGETAW
KFDPDRWDNLPETIKELPGLYQHT
LTFSAGPRVRSPSVCRLSQG*
 
>CYP5152A1 pc.24.121.1 65% TO pc.24.126.1, 42% TO CYP530A1
N. crassa NOTE: 5144C1 IS AT 2402616-2402515 REGION SCAF_1
5144A6 IS 2479233-2479132 REGION (-) STRAND SCAF_1
THIS SEQ IS AT 2699101-2698997 (-) STRAND SCAF_1
pc.24.126.1 IS AT 2713991-2713887 REGION (-) STRAND SCAF_1
same as model fgenesh1_pg.C_scaffold_1000845 [Phchr1:845]
N-term is in a seq gap possible frameshift after FSFK
FKFAQLTEMYGPVFSFKQGTRVVCVVGRHQ (0)
AAVEIMQKH GADLADRPRSIAAGELLSGGKRTLLVGAGDRLRKLRK (2)
ALHSHLQPSVAVQYRPMQLKHALNVILDILRDPEHHIDHARR (2)
YAASVVMT
MTYGKTEPTYYTDPEVQEILLHGTRLGSVIPLDYHKVDRFPILKHVPFVTSTLRQWHKEELALFS
DLVDGARARL (0)
RDGAPPSFATYLIDQQQQFGLSDDEIAYLAGSMFGAGSDTSATAIAFVIMAAATHPKAQAEVQAQ
LDSVVGRDR (1)
VPSFDDESLLPLVTAFYLEAYRWRPVSYG (1)
GFAHKATADIRW (0)
GEYVIPADAIVIGNHWSIARDPDVFPEPEEFRPSRWLDESGKLREDLSSF
NFGFGRR (2)
VCVGQHVANN  (2)
SLFINTALILWAFSVGEDPAQPIDTMAFTDTANVRVHPFKAVYEPRIPRLREVVETYLD*
 
>CYP5152A2 pc.24.126.1 35% to CYP5065A2
yellow IS AT 2713991-2713887 REGION (-) STRAND SCAF_1
possible GC boundary at PISWG
Same as model fgenesh1_pg.C_scaffold_1000848 [Phchr1:848]
MATGLLADVLARVQVPALALLFALLLLRAALRIVQRQRVPLPPGPPG
RWFEPGPKAPLRYAELAKTYGPVFAFRRGGQLVCIINSYK (0)
DAVEIMQKRGADLADRPDFIAAGDFLSGGMRTLLVGAGERVRRLRRC
RALHSQLQPTAAVQHKPVQFRAALDLVLDVLHDPADHLNHTKR (2)
FAASLILTMTYGKTTPTRYSDAEVREINVHTTRLGTVVPAGLHAVDRHPVLR
HVPPATATLRRWHREELALFTRMVDGVRKDV (0)
HVARPSFTTYLLEHQEEYGLSDDELAYLAGSMFGAGSDS (0)
TATAISFVMMAAATHPQAQAQVQAQLDSVVGRDR (1)
VPTFDDEKLLPLVVAFYLETFRWRPISWG (1)
GFAHRATSDIVWNDYVIPAGATVFGNHWAIGHDETVFSDPDVFRPSRW
LDEAGKLRDDISPFTYGFGRR (2)
VCVGQHVANN  (2)
SLFINTALLLWAFNIREDPKVSIDTMGFTDSGTVRVLPFHV
QFHPRIEHLREIVESSMPEDVYSAA*
 
>CYP5153A1 gx.27.66.1 = pc.27.122
         fgenesh1_pg.C_scaffold_1000958
         Phchr1/scaffold_1:3068242-3069378
32% TO CYP5116B1  Aspergillus nidulans
The unbroken reading frame starting with FNMYQ is preceded
by a conserved non-p450 seq upstream that probably ends
460 bp before FNMYQ.  The true P450 N-terminal must reside
in between or this is a pseudogene missing the true N-
terminal.  The two short exons shown below are my best
guess for the N-terminal, but they do not resemble other
P450s, and the total length is only 443 aa, a little
short, so the pseudogene alternative may be correct.
MVHDSGGAIFDTAGPALVR (1)
RTITVYPSDSLVQRGIQTNLELILKHIYTCEVSPIHPHLAPLHQ (0)
FNMYQTIVWYFTAVLFLLGAVVARARKRPVTSSLERIQLTDLRDIRELLAPISTSLPIL
LEQRSVPNARLVRAFGITNTFVSSSVDVHATFSREARALIAGNDWDRFAQCAQLAVDK
CIEERTDAGTVMRYDTLMQNAALLSILMGLFEVALDDVPIADLGVVARGINDLWKLSK
TADTLPPHMLPEINTRLRAWLPAHQNPVDLIIPAFETMWRVVATTVAYTHADPLAHAV
LETFLADPTSTAFASARSGTPSVDALVTEAIRLHPPTSRISRHVVSNSTKGAVLVADV
GALHRDPTIWGADAEVFNPLRHQQRTPTQEKALLGFGAGRVMCVASRWAPHAAGIIVA
SISERIGREIQVREGKAKGGRDWDGWFVECI*
 
 
>AACS01000290.1 Coprinopsis cinerea strain okayama7#130, whole genome shotgun
 
Score = 57.8 bits (138), Expect = 5e-06
Identities = 33/79 (41%), Positives = 44/79 (55%), Gaps = 3/79 (3%)
Frame = +1
 
Query 103 TDLRDIRELLAPISTSLPILLEQRSVPNARLVRAFGITNTFVSSSVDVHATFSREARALI 162
+ L IR+L +P + LL R+ PN RLVRAFGITNTFVS VH +F AR+L+
Sbjct 167305 SSLGGIRQLFSPDGADVGTLLADRARPNQRLVRAFGITNTFVSPHPSVHRSFVTAARSLL 167484
 
Query 163 A---GNDWDRFAQCAQLAV 178
+ W F + A+
Sbjct 167485 SRANKRGWGTFRDISTQAI 167541
 
>CYP5159A1 Coprinopsis cinerea strain okayama7#130,
ACCESSION AACS01000290 REGION: 167200..168432
Yellow region not in pc model. look for it
39% TO CYP5153A1
MLDWQSTSIVIAAFLLASLLCLIAISLNQESAVGHSSLGGIRQL
FSPDGADVGTLLADRARPNQRLVRAFGITNTFVSPHPSVHRSFVTAARSLLSRANKRG
WGTFRDISTQAIHVELSLARSGSVNYGIFIQAVTLRVILVGLLGANVPMEDFSPDDIY
TAASHINKLWSLSKDPSPIPSHLLPELNDALRRLLPDITTFPNPVDIVIPAWETFWRV
VATTVAYSHNSKAITQLFLDFYAYPTDNAFREANADANISPKNVVEESMRLQPPSKHI
ARKTIRPSLSKLPKPIANLLVRFLPRISWVKHYADVQAVLRSPAIWGSNSLEFNPWRH
NQDPSSTLPSRAEALGYIFGGGNLRCIGSSWAPVAAAVVASAVFDAVDRGVCSIVPGR
AVGGRNGWEDWSVTDTKN
 
 
>CYP5154A1 PFF_258a (gx. 18.4.1) 63% TO PFF_258b
(gx. 18.4.1), C-term 38% to 5141A4
Scaf_6 ver2 (-) strand 503798-505808
505808 MQDLGYLPGLRSLVTPMSPLGFALPPSQYNPGRDWQW 505728
505655 VYRDAGTETISAIPYVFGPP (1) 505596
LAKQVVSTKGQ (2)
505458 ILRLLCSLWGPNIFAANGEEWRKHRRIINPAFSNAT (2) 505363
FASVWEQTSRVFDEMEVGEGWAGKHTVNLPVVNGLTNK (0)
505139 LALILISTCGFGNPLKWQFTNSASGGMSFEKALSIVSSNHI
ALLLIPDWMYRLPNK (2) 504972
IRELKTAVDRMNLFMRELIEKRRAEMAQKAPERTDILSAMIK (2)
504735 LSWLSRFDNDCDKVGNTFLLLSAGH (1) 504661
504611 DTVAHTLDAAFALLALHQDFQEEVYQELLEVMPTEADFV 504494
504443 TYENSARLVKTRACFLEASRMFRI (2) 504372
SGFMLIRDTAEDVVLQNVGPNNDDVLPLKRGTRVVVDMIGL (1)
504143 HHNPRIFPDPEAFKPERWYNAHENDMSMFSFGARA (1) 504042
503989 CIGRKFAVAEGICFLAKLLRRWRVEPLAKEGETKEQWKQRVV
RGVVVLNLGIGEVPVRLVRRNC* 503798
 
>CYP5154A2P PFF_258b (gx. 18.4.1)
This one is looking like a pseudogene, no EXXR motif,
No N-term Met
Scaf_6 ver2 (+) strand 509278-511252
62% to PFF_258a in overlapping regions
509278 YLPGIRSLVSPMSALGASIPTSRWNPGRHWQW 509373
509448 VYRDAGTETIAAVPYLFGPPII 509513
509662 GPNIFAANGGEWKKHRRVINPAFSQET 509742
509963 LALILISTCGFSNPLSWQ 510016
510040 MPFADALWIVSTRIIARLLLPRWVYWLP 510123
510371 DYNMHQLGDTFLLLTAGHGT 510430
510478 MLALHPDFQEECYREILKVMHTNDDFV 510558
510606 TFGNSTHLIKTRSCFLEASSLYRM  510686
510727 AAGEILVQDVAEDTILRGAAPDGGDPPVPRETPIVVDMLGLR 510852
510906 DHNPKLYSDPEKFLPERWYNTHENDRTMFSIGAQA (1) 511010
511067 CLGRHFALVEGTCFLARLLRIWRVVPLLRPG
ETVEQWRAKIGVEALFNFGIGNVPLKFVRR* 511252
 
 
>CYP5155A1 fgenesh1_pg.C_scaffold_7000377 [Phchr1:4564]
Same gene as pc.81.7.1
whole seq is 40% to CYP5150A4
REMOVED POSSIBLE SHORT INTRON SEQ VKAAKNKYLEDGRSTASTWE
 
MAVGLPEVVLAAIFAYLLYRETWGKKTALSDVAGPPRESWMK (1)
GNTQRLFRDALDYNLWLSRTYGTAVKMYSLY (1)
ALYLSDPLALHHVFVKDQNSFDVSDAFIH (2)
NLLMFGEGLTGTL (1)
GEQHKKQRKMLNPVFSVSNLRELLPVIQPIANKMASVFVEQIPAD (1)
AREIDVMPWLSRGAQEYMSQACFGWTFNALDLNKRNTYSEAARKYT (2)
PAALRVSWLRPYLPFIVRTIPLTLRTKMLDWYPGSDMKDFLYILDVM
HQTSKRIFEQKKKALDSTVLEKADAETSERSEGDLGP (0)
GKDIMSILLK
ANASSNEADRMTDSEMIGQMSTLLFAGFETTTYAISRILWVLASHPDAQARIRSE(0)
DVSLSYDDLMALPYLDAVIKETLRVYPPSSVHFRVALQNTTLPLQYPVKSVNDTPITTIPVEKGT
QILVSIIASNHNTNVWGPDASEWKPERWLNSDDKAVPKATTDSAKYPGVYSGMMTFLG
GPRGCIGFKFSE
MEAKQVLATLLPRLHFALPSAVDEQGRRKEVYWMMSGPQIPVVRPPFGDGMTAQVPLD
VRLVREEDFAYGFEDEDLIKL*
 
>CYP5156A1 gx.17.28.1 34% TO CYP602B1   Fusarium graminearum
scaffold_7 (2051558 bp) : 939436:940306 region
SAME AS
fgenesh1_pg.C_scaffold_7000309 [Phchr1:4496]
N-TERM in model SEEMS TOO LONG. Remove it.
29% o 609A1
MYPLGLLTIVQHFETRDAALKLAIALPSIVLLALLFSWVSARKDQD
EGPPFLPLSIWETVWPFFTSRHDFLRRGFELTRHPAFRFKLLQ (0)
HTVVVVSQEHARADFFACRGLDIHEGFKVLSGA (0)
IPMLPGVTSDLQTRRINLIHRRLAAAQGGDHLQR (1)
LVPFIVQDIRQGFCSWGSTESLIDPFTRIPA (0)
LLFQTTVRCLGSHELADDGATVARLLSLYDTLDRSTTPLSVLLPWLP
SPSMLAKLRASKQVYDIVDGAIRARVASGVSRDDTLQILLDHGDEKMVIVG (0)
FIMGLLVAGARSTGTT (1)
ASWIVTFLAGHPEWRRKVREEIHALLSLYAATTAHCANDPAELLATVPLEAFEQCMP
ATDAVIRETLRIAQPHTAMRRNVGPDTYIAGTRIPSGAYVVYPFSDIHLDPRLYPD
PWRFDPSRPESKSNIGYVGWGG (1)
GRTVCLGQRIAKLQIKLVLSMFLLHYDFDLVDQDEHPLGNVPR
PNWNDHLTCKPPSGSCLVRLAKQ*
 
>CYP5156A2 EB008493.1, EB007318.1 Gloeophyllum trabeum ESTs 61% TO CYP5156A1
LADIPLSAWEGSTPVLDAVIRETLRLAQPHTAMRRNVGPDLVIDGKTVPSGAYVVY
PFSDVHLDADIYPDPWRFDPGRPADAKRAHAWVGWGGGKTVCLGQRLAKLEMKIIA
AMFLVGFDYAVVDKAGKPADPLPKPNWNDALMCRPEAGSCYVK
YERAASSTSSSSSSSPSSPSSPL*
 
 
>CYP5156B1 EB016637.1 Lentinula edodes
50% to CYP5156A1 N-TERM ONLY
MPYSVGLQAAGAALLSAGLPVLFTTAIIVLLLIISINSSLQKDVADAPARLPI
YSFFTIIPFFRRRFDFLNWGFQATGQSTFQFDLLRNKVIVVSGESARQAFFTAKGLDLTE
GFKILSGAIPMVRGVTSDLQTKRISLIHKRLAAVQKNEQLSMLIRPMLEDSRRLMESWGN
SGCFDPFDNIYELVFQLTVRSLSCTEISDDPCLVSRLKKLYDTLDVGTTPATVLLPWLPT
PAMVKKLWATKEIYDIVVAAITAR
 
>CYP5157A1 genewise.21.71.1
scaffold_9 (1898532 bp) : 1276291:1276784 (494 bp)
fgenesh1_pg.C_scaffold_9000353 [Phchr1:5749]
Phchr1/scaffold_9:1276084-1278363
30% to 608A1 Magnaporthe grisea over 412 aa
MSLLPPLTNLASGHISFFLGIPRGELFLLFLPFVVLIAFVLFKRIIQATKRKHASTRPL
CVLVSTDRPIDQLPPYHYRPTSKRFATMDLLSALRGREREYTKYVFADDKSLSFEEGAA
TILNLRFLLKIRGGRFYKDVDKLITSGIIPRIEAITNKIYPIFMRHARRLVEDGQKNNG
CVDFFAHTNHSIAESMLTVVMGEVGIYENLSWFSRTFPRLSKFSRLRDGLSDNGCPRLR
LLFGTLIYRYFFVLGPYVWRELRNNKFEPLARSEKEHDANESVLRYLGRMFAREDGTVS
AVDTCWCMCLMLSLIFASVHQTAVVAVWVMYELASRPTYIPAIREELLAVAELQADGSH
YLSYDSLRNARLLDSFIREVMRLKGDTLGVCRQTVQDTPMGQYVIPKGHLVIPMASLSH
RSREYHGQDAEVFDGFRWVERNLPAVMVGPTYFPFGMNRWACPGRVLAVSEMKMIALTI
LALADPTLEGGKYTVVDPLNTTSVQPAGKLYLTPLARPLI*
 
>CYP5158A1 39% to 5037B2, 36% to 5037B3, 36% to 5144A5
scaffold_2 2356566-2358785
2356566 MLTSQVSIAKLTTLPTSYYA
2356626 LLATIALLALLFARRTQQPTPPGPRGLPIIGNVAELSGGFEWIRFGTTLRKQF (1) 2356784
GDVLGFKVLNNRILVLNTAKAAKEFMDKRASKYSSRPVLTVIGELMGLDQ (0)
AMPLIPYGAEWRACRKLEHVALNQSAVKQYRPVIEHHAAQLALDILQEPDKFLTHTRL (2)
IILAVTYGLSARVTATE (0)
YISLAEEVMRIVTIYLRPFAHLCDVMPV (1)
LKHLPSWIPFRREAEYGRRLFESFVSTPYERTKQAF (0)
AKGDAEPSLIRDILASMPQDSLTPEVEHRVKWTAGFALLA (1)
SGGES (0)
2357970 TFGTIGVFMMAMALHPDKQARAQEEVDSAVGTDRLPTMDDKARLPYVYAVVQEA 2358131
2358132 MRWHPMLPL (1) 2358158
2358208 SLPRRAEVDDEYDGYYIAKDTTVCANLW (2) 2358291
2358346 AMGMEPNVKYPPEQFIPERFLDAEHPTPNPNTWAFGFGRR (2) 2358465
ICPGKALAEESLFVLMSTLLAMFEICAPPEGIKPEFESRVVR (2)
2358702 LPKPFKCIFRLRSPEKADMLRAVVATQ* 2358785
 
>CYP5158A2P gx.89.1.1 pseudogene
92% to CYP5158A1
scaffold_2 2396927-2398113
2396927 MLTSQVLIAKLTALPTSCYA
2396987 LLATAALLALLSARRTQQPTPPGPCELPIIGNVAELSGGFEWTRFGTTLRKQF (1) 2397145
GDVLGF
large deletion here, missing 6 exons
SGGES (0)
2397300 TFGTIGVFMMAMALHPDKQARAQEEIDSAVGTDRLPTMDDKARLPYVYAVVQEA 2397461
2397462 MRWHPMLPL 2397488
2397541 LPRRAEVDDEYDGYHIAKDTTVCANLWY 2397624
2397685 MEPNVKYPPEQFIPERFLDAEHPTPNPNTWAFGFGRR 2397795
2397854 ICPGKALAEESLLVLMSTLLAMFEICAPPEGIKPEFESRVVR 2397979
2398033 LPKSFKCIFRLRSPEKADPPRAIAAAQ 2398113