Three new P450 sequences have been added: CYP31A5P, CYP33C12P

and CYP32B1. 

 

D. Nelson March 23, 2004

 

>CYP13A1 T10B9.8   Z48717 498 aa

MGYFWFPWFSAIFVAVFSYYIWQWTFWRRRGVVGPMGFPVLGVFLNSLD

NNFPFPLQCREWTKKFGKIYGFTEGTLKTLVISDPELVHEVFVTQYDNFYGRKR

NPIQGDSEKEKRTNLFAAQGFRWKRLRAISSPTFSNSSLRKLYQTVEDSALELL

RHIEKQSAGGKQIDMLKFYQEFTLDVIGRIAMGQTDSQMFKNPIMPIVSK (0)

LFQGNFAKLFLIGGIFPTFLVEIIRQILLKNLKVGSFRKINEITL

DAIHNRIKQREEDQKNGIEIGEPA

DFIDLFLDAKAEDVEHFGENNGDFSKSTTY (0)

TNRQLTTEEIVGQCTVFLIAGFDTTALSLSYAT

YLLATHPEIQKKLQEEVNRECPNPEVTIDQLSKLKYMECVFKEALRLYPLGAF

ANSRRCMRNTKLGNMKVEVGTMIQVDTWTLHTDPNIWGDDAEDFKPERWQ

TPNSDQIYQKSGYIPFGLGPRQCIGMRLAYMEEKILLVHILRKFTFETGAKTEI

PLKLIGRATTQPESVWMHLNPRN*

 

>CYP13A2 T10B9.7    Z48717 515 aa revised 3/23/04

MSLGFVLAVTFSIFLGILTYYLWIWTYWMRKGVKGPRGRPFVGVLDVLLE

HETPGLIKLGEWTKKYGKVYGYTDGTQRTLVVADPAMVHEIFVKQFDNFYGR

KLNPIQGNPEKEQRVHLLAAQGYRWKRLRTISSQSFSNASLKKMKRTVEDSA

LELLRHIEKQTAGGEQIDMLRFYQEYTMDVIGRFAMGQTDSMMFKNPIVNV

VREIFCGSRKNLMLICQVFPPIGQFIRDLTFKFPRIPAFKLYSIM

QDVVAARIAQREREKGAESGE

PQDFIDLFLDARSDDVDFSAEAREDFSKRNLKITKELSADEVVGQCFLFLIGGFDTTALSLSY

VTYLLAVNPKIQEKVIEEIAREFGTSEVEFEKLGRLKYMDCVIKEALRLYPLA

SISNSRKCMKTTTVNGVKIEAGVYVQMDTWSLHYDPELWGEDVKEFKPERW

STDEPLEHKGAYLPFGLGPRQCIGMRLAIMEQKILLTHLLKNYTFETGNKTRIP

LKLVGSATTSPEDVFVHLRPRIW*

 

>CYP13A3 T10B9.5   Z48717 520 aa revised 3/23/04

MSLSILIAIALFIGVFTYYLWIWSFWMRKGIKGPRGLPFFGIINAFQSYEKP

WILRLGDWTKEYGPMYGFTDGVEKTLVVSDPEFVHEVFVKQFDNFYARKQN

PLQGDPDKDPRIHLVTSQGHRWKRLRTLASPTFSNKSLRKIFSTVEESVAEM

MRHLEKGTAGGKTIDILEYYQEFTMDIIGKIAMGQSGSMMFENPWLDKIRAI

FNTRGNIIFIICGIVPFTGSIFRWFFSKVPTAQTVTSLMHTLEIAL

TKRVEQRAADEKAGIESSGEPQD

FIDLFLDVQADTDFLEDETKNGFARSQIVKVDKHLTFDEIIGQLFVFLLAGYDTTALSLSYSSY

LLARHPEIQKKLQEEVDRECPDPEVTFDQLSKLKYMECVIKETLRLYPLASIVH

NRKCMKSTTVLGMKIEEGTNVQADTWTLHYDPKFWGENANEFKPERWESGD

EQAVAKGAYLPFGLGPRICIGMRLAYMEEKMLLAQILKKYSLETTFETHIPLK

LVGIATTAPTNVHLKLKPRHSD*

 

>CYP13A4 T10B9.1   Z48717 520aa

MSLSLLIAGALFIGFLTYYIWIWSFWIRKGVKGPRGFPFFGVILKFHDYENPGLLKL

GEWTKKYGSIYGITEGVEKTLVVSNPEFVHEVFVKQFDNFYGRKTNPIQGDPNKN

KRAHLVLAQGHRWKRLRTLASPTFSNKSLRKIMSTVEETVVELMRHLDEASAKG

KAVDLLDYYQEFTLDIIGRIAMGQTESLMFRNPMLPKVKEIFKKGGKMPFLIAGVF

PIAGTLMRQLFMKFPKFSPAFGIMNTMEKALNKRLEQRAADKKAGIEPSGEPQDFI

DLFLDARANVDFIEEESTLGFAKSEVLKVDKHLTFDEIIGQLFVFLLAGYDTTALSLS

YSSYLLATHPEIQKKLQEEVDRECPDPEVTFDQISKLKYMECVVKEALRMYPLASL

VHNRKCMKKTNVLGVEIDEGTNVQVDTWTLHYDPKVWGDDASEFKPERWETGD

ELFYAKGGYLPFGMGPRICIGMRLAMMEEKLLLTHILKKYTFDTSTETEIPLKLVG

SATIAPRNVMLKLTPRHSN

 

>CYP13A5 T10B9.2   Z48717 520 aa aa revised 3/23/04

MSLSILIAGASFIGLLTYYIWIWSFWIRKGVKGPRGFPFFGVIHEFQDYENPG

LLKLGEWTKEYGPIYGITEGVEKTLIVSNPEFVHEVFVKQFDNFYGRKTNPIQG

DPNKNKRAHLVSAQGHRWKRLRTLSSPTFSNKNLRKIMSTVEETVVELMRH

LDDASAKGKAVDLLDYYQEFTLDIIGRIAMGQTESLMFRNPMLPKVKGIFKD

GRKLPFLVSGIFPIAGTMFREFFMRFPSIQPAFDIMSTVEK

ALNKRLEQRAADEKAGIEPSGEPQDFI

DLFLDARANVDFFEEESALGFAKTEIAKVDKQ

LTFDEIIGQLFVFLLAGYDTTALSLSYSSYLL

ARHPEIQKKLQEEVDRECPNPEVTFDQISKLKYMECVVKEALRMYPLASIVH

NRKCMKETNVLGVQIEKGTNVQVDTWTLHYDPKVWGEDANEFRPERWESG

DELFYAKGGYLPFGMGPRICIGMRLAMMEKKMLLTHILKKYTFETSTQTEIPL

KLVGSATTAPRSVMLKLTPRHSN*

 

>CYP13A6 T10B9.3    Z48717 518 aa revised 3/23/04

MIFVLLSAVLLGVFTYSVWIWSYFIRKGIKGPRGFPGIGMLIQTIDHENPPFL

KYRDWTKQYGPVYGFTEGPQQTMIISEPEMVNEIFKKQFDNFYGRKLRPIIGDP

EKDKRVNIFSTQGKRWKRLRTLSSPSFSNNSLRKVRNSVQECGTEILWNIEQK

VRKNEDIDMLIVYQEYTLGVISRIALGQSESNMFKNPLLPKVQAIFNGSWHV

FLITGIFPPLAGVFRKMSKMLPASFIPAFKIFDLIEV

AVQARIDQRAKDEIKGVEPGEPQDFIDLFL

DARVPDVKILSGEANEDFAKSSVVKINKELTFDEIIAQCFVFLAAGFDTTALSLSYATYLLATH

PEIQTKLQEEVDRECPDPEIFFDHLSKLKYLECVMKETLRLYPLGTTANTRKCM

RETTINGVNFDEGMNIQVDTWTLHHNPRIWGEDVEDFKPERWENGACEHLE

HNGSYIPFGSGPRQCIGMRLAQMEQKILLAQILKEYSFRTTKNTQIPVKLVGK

LTLSPESVIVKLEPRDS*

 

>CYP13A7 T10B9.10    Z48717 518 aa

MSFSILIAIAIFVGIISYYLWIWSFWIRKGVKGPRGLPFLGVIHKFTNYENPG

ALKFSEWTKKYGPVYGITEGVEKTLVISDPEFVHEVFVKQFDNFYGRKLTAIQ

GDPNKNKRVPLVAAQGHRWKRLRTLASPTFSNKSLRKIMGTVEESVTELVR

SLEKASAEGKTLDMLEYYQEFTMDIIGKMAMGQEKSLMFRNPMLDKVKTIFK

EGRNNVFMISGIFPFVGIALRNIFAKFPSLQMATDIQSILEK

ALNKRLEQREADEKAGIEPSGEPQDF

IDLFLDARSTVDFFEGEAEQDFAKSEVLKVDKH

LTFDEIIGQLFVFLLAGYDTTALSLSYSSYLL

ATHPEIQKKLQEEVDRECPDPEVTFDQLSKLKYLECVVKEALRLYPLASLVHN

RKCLKTTNVLGMEIEAGTNINVDTWSLHHDPKVWGDDVNEFKPERWESGDE

LFFAKGGYLPFGMGPRICIGMRLAMMEMKMLLTNILKNYTFETTPETVIPLK

LVGTATIAPSSVLLKLKSRF*

 

>CYP13A8 T10B9.4  Z48717 (C-terminal) and Y53C12 whole 13A subfamily 510 aa

MIFELILISIVTYYFWHWTFWKRRGLPGPWGVPIFGKAGAMLEDSFPPGYT

LQKWTKEYGKIYGFTEGMQKVMVISDPDLVQEILVKQYDNFYGRKHNPVQGD

PDKDKDIHIVGAQGFRWKRLRTITAPAFSNGSIKKVLTTMEDSTQELMKKLR

EESENGKAVNMhlFYQEYTFDVISRVAMGQPDSQMFKNPLLKDVKG

FFEHNRWQIWMFSGGFPFAVSFLKWLFIKVGKFGAGPFIVVQKSVTDAVMSRIA

QREADKKHGVEPGEAADYIDMFLNARAEVEHFGESNDEFHKSSSY (0)

NNRQLTTQEIISQCFV

FLVAGFDTTAISLSYVTYFLALNPKIQSKLQDEVDKECPNDEITFDQLSKLKYM

DNVIKESLRLFPFASFANSRRCMRNTVIGEQIVEAGVDVMIDTWTLHHDKN

VWGNDVEEFKPERWDSPLTPQQAYLSFGAGPRVCLGMRFALLEQKGLLSHI

LKKYTFETNAKTQLPIKLVGRATARPENLFLSLKPRV*

 

>CYP13A9P T10B9.6    Z48717 488 aa X = missing amino acid

frameshift at FIDLF

MILTVLFFGIL TYYLWIWHYWIRKGIRGPRGLPFFGVIHKFQNYDYPGYLKL

RDWTKKNGSVYGIXXXXXXTLIISNPEFVDEVFVKQFDNFYGRKQNPLQGDPN

KNPRVTLFSAEGHRWKRLXXLSSPTFSNKSLRIIMGTVEESVVEVMKYLEKE

VADGQHVNMLEYYQEYTLDVIGKIAMGQTESLLFKNPLMSGVKG (0)

VFQNKRKYGSLVAGVFPIVGVLFRTLSMLEIVVCERLKQRKDDENNGFVKVGQPQDFIDLF

LDSRADLDFFEQEQTLEFTKKEIAKIEKQ

LTFDEIIGQLFVFFLAGYDTTALSLSYSSYLLATH

PEIQKKLQEEVDRECPDPEITFDQISKLRYMECVIKESLRMYPLATAAHNRKC

MKSINIRGVDIEKGTNVQVDTWSLHYDPKVWGDDANEFKPERWETGDELFY

AKGGYIPFGIGPRICIGMRLAMMEKKMLLVPVLKQYTFKTCSETEVPLKLVG

LSTIAPTSVILTLKPKMSRDQ*

 

>CYP13A10 ZK1320.4    Z46934 505 aa

MSVILLAIPTLFIGFISYYLWIWTYWRRRGIPGPLGYPLVG

SFPKTLKSEYPQYLQIRDWTKLYGPIYGYTEGTIKTLIVSDIDIVRQIFVEQYD

NFYGRKLNPIQGDPEKDERTNLFSAQGFRWKRLRAISSPTFSNNSLRKINVTV

EDSAMELLRHIEEQTSEGQQIDMLQFYQEFTMDTIGRIAMGQTDSQMFKNPLL

KFVRAIFGDNRKHIPLIGGVFPTLAQVFRFFMLKFPLLGAANFIHVNKTV

VTAVQNRIDQRENDRKN

GIEIGEPQDFIDLFLEARADDVEHFQENNGDFSKTSSY (0)

GNRQLTTQEIVGQCLVFLIAGFDTT

ALSLSYTTFLLATHPEVQKKLQEEIERECIEPSISFDHLSKLKYMDCIIKETLRL

YPLGTMANSRRCMRATKLGNVEVEVGTMVQVDTWSLHTDTKIWGDDAKEF

KPERWLDPNCDQVFQKGGYISFGLGPRQCVGMRLAYMEEKMLLAHILRKYTF

EVGTKTEIPLKLVGRATTQPETVWMHLKQRI*

 

>CYP13A11 F14F7.2  Z81503 517 aa also Y39E4 Z94158 102000-104000 region

MIILLFVVSSVVGIVSYYFWTWTYWRRCGIPGPQGYPFLGSALDMMDHENP

PFLQLKKWTSQYGKVYGITEGLLRTLVISDTNLIHEVFVKQYDNFYGRNLNPI

QGDPNREKRVTLFSAQGHRWKRLRTIANPTFSSNNLRKIQVTVEDSALELLR

HIEQHTAGGKAIDVLEYFQEFTMDVIGRIAMGQPDSLMFKNPLLPCARDVFG

KPRTALFLSGILFPWIGPMIRKVVFYFTNVFNNPAAQIMYQTANAVEQRIK

QRMAEEKAGIDPGEPQDFIDLFLDAKSDDMELENNEDFTKAGVKVTRQLTKDE

VVGQCFVFLIAGFDTTALSLSYSSFLLATHPKAQKKLQEEIDRECADPEVTFDQ

LSKLKYMECVIKETLRMYPLGALANSRRCMRSTKIGNYEIEKGVDILCDTWT

LHYDKSIWGEDAEEFKPERWESGDEHFYQKGGYIPFGLGPRQCIGMRLAYMEE

KLLLSHILRKYTLEVCNKTQIPLKLIGSRTTQPESVWLNLTPRDDN*

 

>CYP13A12 F14F7.3 Z81503 518 aa also Y39E4 Z94158 105000-108000 region

MAIIFLAILTSIIGVLSFYLWTWSYWKRRGIAGPSGYPILGSALEMLSSENP

PYLQLKEWTKQYGKVYGITEGLSRTLVISDPDLVQEVFVKQYDNFFGRKLNPI

QGDPNKDKRVNLFSSQGHRWKRLRTISSPTFSNNSLRKLKTTVEECAVELLRH

IEQHTDGGQPIDLLDFYQEFTLDVIGRIAMGQTDSQMFKNPLLPYVRAVFGEP

RKGLFLSGSLAPWIGPILRMVMFSLPNIVKNPAVHVIRHTSNAVEQRVKLR

MADEKAGIDPGEPQDFIDLFLDAKSDDVELENNEDFTKAGVKVTRQLTTEEIV

GQCFVFLIAGFDTTALSLSYSSFLLATHPKVQKKLQEEIDRECADPEVTFDQLS

KLKYMECVIKETLRMYPLGALANSRCCMRATKIGNYEIDEGTNILCDTWTLH

SDKSIWGEDAEEFKPERWESGDEHFYQKGGYIPFGLGPRQCIGMRLAYMEEKL

LLSHILRKYTLEVCNKTQIPLKLIGSRTTQPESVWLNLTPRDDN*

 

>CYP13B1 F02C12.5   Z54269  (C-terminal) C29F7 Z92827 N-terminal

510 aa revised 3/21/04

MGAIIVLVVLFATIAGYFKWIHTYWRRRGISGPEGLPFIGNYYDLADVNKPRGYLI

HKWTQkFGKVFGYYEGAVPVLVVSDMDMLQELFLKKFDNFYARKSTNHIHGNL

ECSKSEPRINLFTSRGARWKRLRALASPGFSVKALKQVHDVMEDSAINMVDL

MAKHEDGKPFNIHaYFQEFTYDVISRLAMGQPNSELFNNSGVEIVKsIFMRTHRV

LPWYFTVLFPQFEHLVKRMFYNHAAVQGGDIEKLLLICKKTVESRIQEReENAKLG

FENAENDFIDMFLNYYSEQVEDIEFGSTVEKKVTAEDVIGACFVFLLAGFDTTANSL

AYASYLLAKHPEKMKLAQEEVDTVVGSENVSYDDMTKLKYLDAVVRESLRLYPVA

WFACSRECVKPTTLGDIYIDKGVKIEADVMSLHRSKEIWGENADDFVPERWLEPSS

RHTMSWIPFGAGPRQCVGMRLGLSEAKTALAHLLRRYDLVAGVETEKELNILGCT

TTSPEAVTLYLKPRI*

 

>CYP13B2 K06G5   Z81565 511 aa

MGVIVCLVAIIGVVAGYFKWIHSYWKRRGIEGPVGLPFIGSFYDLADREKPRGFIIN

KWTKmFGKVFGYYEGVIPVLVVSDLDMLQEMFIKKFDCFYARKTTNLIHGNLECS

QEEPRVNLFAARGARWKRLRALASPAFSVKALKQIHETMEDSVFSMVDHMSKQV

NGEAFNIHEYYQEFTYDVISRLAMGQTYSEQFNNEGVDIVKKIFLRKNRVYPWYLA

VMFPGFENTIKNTFFNHEAVRGGDVGKLLKFCETAVHDRLKERADNVEQGIENP

HNDFIDMFLDYYTDTNIEDNAFGIKVEKKVTSEDVIGACFVFLLAGFDTTANTLAY

ASYLLAKHPQEMRKVQDEIDRICTSEHISYDDIGKLRYMDAVIREALRMYPVAWFA

CSRECVQATTLGNYYIEKGVRIEADVRALHYSEEIWGKNANEFVPERWLESSPRH

NMSWIPFGAGPRQCVGMRLGLSEAKTALAHLLRRFSILAGPKTEKELHLQGCTTT

SPEKVTVHLMTRD*

 

>CYP14A1 K09A11.2  Z50742 491 aa

MSVFILAFVIFIIFYVFHFYWKVSKYPKGPLPLPFIGNIHQFPPDNVQKYFDVLSKT

YGPCFTIWIPFPAIVLTDYEHIKDAFVNQGDTFTYRAHRSPETLLPVHDHTGILASD

GDHWRLQRRTSLKILRDFGLGRNLMEEQVIRSVHEMLIQLENINDKKNVDVFWPI

QLCVGNVINETLFGFHYKYEDSEKFKTFVKIVDKHLRHLQGKMPLLVSAFPWLKH

VPIVGDIGYHNIKNNISSYHTFIEEEVATQVKKYDGESEPENFVHAYMQQMKQTGN

PGLDITNLCATVLDFWLAGMETTSNSLRWHLAFMMKYPEVQDKVRKEILDNVGT

ARLPSMSDKPNMPYTQAVIHEVQRCSNMIPFLGSHQCKEETEIHGNRVPAGSLVFA

QLWSVMRNDTVFEDPETFNPSRYLQSDGKTFDKAVLEKTIPFSIGKRNCVGEGLA

RMELFLIFSALIQKYEFVATSNIDLTPDWGVVLTAKPYTCNIIPQF*

 

>CYP14A2 K09A11.3    Z50742 491 aa

MSILIVAFlVSITVYIAYFYWKVSKYPKGPFPLPFIGNILQIPSENIQEYLDDLSKTYGP

CFTLWTPLPAIVLTDYEHVKEAFVTQGDAFIYRADRPPETLLQPHLNTGVLFSNGD

NWRFQRRTALKILRDFGLGRNLMEEQVMRSVHEMLAQLERIADKKNVDMFWPIQ

LCVGNVINESLFSYHYKYEDSKKFETFVKVLDKHLKTVQGKTIFLMSAFPWLKHF

PVIGELGYHRIKKNIQSYQEFINDEVTSQIKHYDGESEPENFVHAYMQQMKQSGNP

NLDMNNLCASViDFWLAGMETTSNSLRWHLAfMMKYPEIQDKVRKEIlDNVGTA

RLPSMSDKPNMPYTQAVIHEVQRCSNMIPILGTHTNRDDILLKGKKIPTGTLVFAQI

WSVLKNDPVFEESSKFNPDRYLMADGKTINKSVLERTIPFSVGKRNCVGEGLARM

ELFLIFSALIQKYEFIPKTNIDVKPVCGAVLTTKPYICELVPQAA*

 

>CYP14A3 K09A11.4    Z50742 498 aa

MSVFIIAFFTFITTYVAYFYWKVSKYPKGPLPLPFFGNLLQFPAENIHLYFDELSQTY

GPCFTLWTPLPAVVLTDYDHVKEAFVTQGDAFINRANRPPETLLQPHLNTGVLGSS

GDNWRLQRRTALKILRDFGLGRNLMEEQVMRSVQEMLAQLDHIPDKQNVDMYWP

IQLCVGNVINESLFGYHYKYEDSDKFETFVKVINKHLKIAQGKPQLLVSAFPWLRY

VPIIGELGYHKIQRNIQSYQKFIDEEVVSQIKQYDGESEPENFVHAYLQQMKQSGNP

NLDMNNLCASVLDFWLAGMETTSNSLRWHLAAMMKYPEIQDKVRKEIFDNVGTA

RLPSMSDKPNMPYTQAVIHEVQRFSNMIPILGTHTNKEDILLKGKNVPTGTVIFAQI

WSVLKNDSVFEDSHKFNPDRYLLTDGKTFDKTILERTIPFSVGKRNCVGEGLARM

ELFLIFTALIQKYEFIPNGSIDLSPVPGAVLTTKPYTCRLIPQSARKFQLF*

 

>CYP14A4 K09A11    Z50742   end of cosmid = R04D3.1  Z70212 486 aa

MSPLLILAFFVATVGYLVHFYLKVRKYPKGPFPLPVIGNLHQIPGGNLEIWFDELA

KIYGPCYTVWSPLPCVVLTDFDHIKNAFVTQGEFNNRAHKPPESLLHVHDNTGIL

NSSGENWQLQRRTSLKILRDFGMGKNLMEEQIVKSVQEMMTQLEKTDDKKKADI

FWPVQLCVGNVINQVLFGYHFKYDDCERFQKFVSVIDFHLRTLLGKCSLLVSAFP

MLRHVPIIGEYGYQRIKRNIKSYQYFIEEEVATQMKSFEEENEPENFVHAYMQQMK

QNGHPGLDVKNLRACALDFWLAGMETTSNSLRWHIAYMMKHPDIQDKVRKEILD

VVGNSRFPSMSDKPNMPYTQAVIHEVQRHSNMVPFLGTHqsNKDTNVLGQNIPA

GTNVLVQAWSVMRNDPIFINQLSFNPDRYLMADGKSFDKNILEKTIPFSVGKRSCIGEGL

ARMELFLIFSALIQKYEFIANGPVDMSYDFGAVLTIKPYTCQLKAQF*

 

>CYP14A5 F09F3.7 492 aa

MSVFIVALSVFIISYVISFYWKVRKYPKGPFPLPFFGNLLQFPADNIQEHLDK

LSKTYGPCFTVWTPLPAVVLTDYEHIKEAFVTQGDAFVNRAQRLPEILFQPH

PNTGVVFSSGDNWKIQRRTALKILRDFGLGRNLMEEQVMRSVHEMLAQLEH

ISDKKNVDMYWPIQLCVGNVINESLFGYHYKYEDAGRFEKFVKVVDRHLKIA

QGNASLLVSAFPWLRHLPVIGNLGYHSIKNNIKSYQQFIEEEVTSQLKNYDGE

SEPENFVHAYMQQMKQTGNPNLDMTNLCASVLDFWLAGMETASNSLRWH

LAFMMKYPEVQDKVRNEIFENIGTARLPSMSDKQNMPYTQAVIHEVQRCSN

MVPILATHMNTEDVLVKEHNIPTGTLLFAQIWSVLKNDPVFEENSKFNPDR

YLMPDGKTLNKTVLERTIPFSVGKRNCVGEGLARMELFLIFSALIQKYEFIPK

TNVDLKPVYGGVITVKPYLCELVPQNA*

 

>CYP22A1 T13C5.1 U39648 17939-21555 572 aa first 85aa may not be part of this gene

MHLENRVLSSVLDYASKFYKRMSSLVFFSANSIQVQMNISQSSW

IKCRDWMAFALSHHIIMGIYLLILRNFLPQVVPDFEWQHYFMRVFIVHIIYIIISYFI

RITRYPPGPPPMAVFGNSPFVNILTPEQTFLEYREIYGPIFTLHLSQPTIILAEYKTI

QEALVKNGQQTSGRSSAESFVLFTGDRLNGDGVILAMRQKWKDMRHEISRFMNKWYGA

PMDELVLHHTRCLEQELAKIAETKSLIDLRDPLAGAIANVIQQITIGRNYMYQDQEFQ

TQLRDINAVVKEIMTAEVFFVNCYPWLRYLPEGILRKWTNYKRSGFRLQQWFRTILEE

HHVNRHQGDFMSHMIDLQESKQEQFRDLSIILTCGDMWTGGMETTVTTLRWGIIYLLN

NPEVQAKCQMEILDVFGNDIPDMGKMNQTPYVRATLSEIQRLANVLPWAIPHKTIEEC

NIGGYDIPVNTEIIPALGAVLFDPNVFESPKQFKPERFLDEEGKYRVMEEFRPFGLGP

RVCLGERIARTELYLIFASLLQNFRFYLNRGDPIPVAERVIGGITAPPKPYATRVEYL

GNRLIN*

 

>CYP23A1 B0304.3    U39472 529 aa (revised 3/19/2003)

MPSLRNAILLTIATTFLYFRIMRTLNFDNYM (0)

ATYIYFLTCITLYAIYELNYKRRRL

PNGPVPWLVAGNMPSFINVNNVDVLFQSWKQQYGGIFTVWIGPIPLVMVS

DLPTIKKYFIQHADSFSNRWRNFVTDSIMEGSNGIVQIDGNKWREQRRFALHT

LRDFGVGKPLMEQMITLEVTSLMNHMEKSCGLDGKELHLCPSIAVCVGNIIN

NMLFGLRFNQDNSYMHRLHQLLDDQSHTVMQPIMGAYIAFPVTSKIPIINGE

WNRLMGIKNELLEFLETQIEGHRMNWKDEMIEQEPEDLTYAYMIEVEKRKRN

GEDVGFFDDQQLKMLLLDLFFAGMETTVTTLKWAFLLMAKNQKVQKNVQAELDSIG

QPMIEIQHRTRLPYVQATINEIQRIANILPINLLRTVAEDIEIDGYNFKKGDLII

PQISILMNDPEIFENPEEFNPSRFLDEDNNVKKIDEFLPFSIGRRQCLGESLARA

ELYLVFANLIQNFNFEVADDVTTE

RVLGLTVSPVEYSCKITRRGLDHNQNSVK

 

>CYP25A1 C36A4.1    Z66495 496 aa corrected 3/19/04

MALLILSSLVISIFTFFIYIILARRERFKLREKIGLSGPEPHWFLGNLKQTAERKEKLGYD

DANRWFNELHEQYGETFGIYYGSQMNIVISNEKDIKEVFIKNFSNFSDRSVPSI

YEANQLTASLLMNSYSSGWKHTRSAIAPIFSTGKMKAMQETINSKVDLFLDI

LREKASSGQKWDIYDDFQGLTLDVIGKCAFAIDSNCQRDRNDVFY

VNAKKYISNIDIRHSKIIAASVLLPELSTFWKALYKYTPLADAEIPLVEGLSNV

YERRRGGEGSDSVDLLKLLLNREDDKSKPMTKQEVIENCFAFLLAGYETTSTA

MTYCSYLLSKYPNVQQKLYEEIMEAKENGGLTYDSIHNMKYLDCVYKETLRFY

PPHFSFIRRLCREDITIRGQFYPKGAIVVCLPHTVHRNPENWDSPEEFHPERFE

NWEEKSSSLKWIPFGVGPRYCVGMRFAEMEFKTTIVKLLDTFELKQFEGEAD

LIPDCNGVIMRPNDPVRLHLKPRN*

 

>CYP25A2 C36A4.2    Z66495 496 aa

MALLILTSILILLVSFIIYILFARREQFKLREKIGLTGPEPHWFMGNLKQIVDR

KEKLGYDDSNKWFNELHKQYGETFGIYFGAQMNIVLSNEEDIKEVFIKNFSNFSDRIVPP

IFDSNQLNQSLLQNTYATGWKHTRSAIAPIFSTGKMKAMQETIHSKVDLFLD

ILREKASSGQKWDIYEDFQGLTLDVIGKCAFAIDSNCQRDRNDVFYVNARKFI

ANIDIRHSKVIAASFILPELAPLWRALYKYTPLADAEIPLVEGLSNVYERRR

GGEGSDSVDLLKLLLNREDDKSKPMTKQEVIENCFAFLLAGYETTSTAMTYCS

YLLSKYPNVQQKLYEEIMEAKENGGLTYDSIHNMKYLDYVYKETLRCYPPVIH

FINRRCLADITIRGQFYPKGSVVTCLPHTVHLNPENWDSPEEFHPERFENWEE

KSSSLKWIPFGVGPRYCVGMRFAEMEFKTTIVKLLDTFELKQFKGEADLIPDC

NGVIMRPKDPVRLLLKPRN*

 

>CYP25A3 C36A4.3    Z66495 496 aa

MAFLILTSILVSLVSFIIYVILARKERFRLRGKIGLSGPEPHWLMGNLKQIIER

KAKLGYDDSYDWYNKLHKQFGETFGIYFGTQLNINITNEEDIKEVFIKNFSNFSDRTPPPI

IEDNKLKESLLQNTYESGWKHTRSAIAPIFSTGKMKAMHETIHSKVDLFLEIL

KEKASSGQKWDIYDDFQGLTLDVIGKCAFAIDSNCQRDRNDVFYVNARKFITN

IDIRHSKIISTSFLFPELSKLWKVLYRFTDLAKAEIPLVEGLADVYERRRGGEG

SDSVDLLKLLLNREDDKSKPMTKQEVIENCFAFLLAGYETTSTAMTYCSYLLS

KYPNVQQKLYEEIMEAKENGGLTYDSIHNMKYLDYVYKETLRCYPPVIHFSNR

RCLKDITIRGQFYPKGAIVVCLPHTVHRNPENWDSPEEFHPERFENWEEKSSSLK

WIPFGVGPRYCVGMRFAEMEFKTTIAKLLDTFELKQFEGEADLIPDCNGVIMRPKD

PVRLHLKPRN*

 

>CYP25A4 C36A4.6    Z66495 495 aa revised 3/20/04

MAILIISTIFFTIITFISYSIWRRHAIFKLRSSIGIPGPPVHWLWGNLNIIKDR

VSRLGYNDTTQWHPTLHTKYGPIFGLYCGTQLHITVSEEEDIKEIFIQNFSNFSDRMTPD

IFGMNQLNQSLLQNTYATGWKHTRSAIAPIFSTGKMKAMHETLVSKIDIFLE

VLKEKSSSGQKWDIFENFQSLSLDIIGKCAFAIDSNCQRDRTDLFYVQARKFVG

AVDLKKSWILPVSLILPELSWLWRFLYKFSDLSAAELPLVKGLVDLYDRRR

AGEGGNDSTDLLNLLIRRETIGKMTQREVIENCFAFLIAGYETTSTAMMFSAY

LLAEYPIVQQKLYEEIKKTKENAGLNYDSIHNMKYLDCVYKESLRFYPPTTHF

TNRVCLNDMTIRGQIYPEDSTLKVQPYTIHRNPANWESPDEFQPERFENWEEK

SSSLKWIPFGVGPRYCVGMRFAEMEFKTTIAKLLDTFELSLVPGDPPMIPET

NGVIFRPRSPVRLNLKLRI*

 

>CYP25A5 F42A6 495 aa

MAILIISTIFFTIITFISYIIWRRHSIFKLRSSIGIPGPP VNWL*GNLNIIKEWG

DTTQWHPTLHTKYGPIFG LYCGTQLHITVSEEEDIKEIFIQNFSNFSDRMTPNIF

GMNqLNQSLLQNTYATGWKHTRSAVAPIFSTGKMKAMHETLVSKIDIFLEV

LKEKSSSGQKWDIFdNFQSLSLDIIGKCAFAIDSNCQRDRIDLFYVQARKFVGA

VALKKSWILPVSlILPELSWLWRFLYKFSDLSAAELPLVKGLVDLYDRRRA

GEGGNDSTDLLNLLIRRETIGKMTKREVIENCFAFLIAGYETTSTAMMFSAYL

LAEYPIVQQKLYEEIKKTKESAGLNYDSIHNMKYSDCVYKESLRFYPPTThFTN

RVCLNDMTIQGQIYPKDSTLKVQPHTIHRNPANWESPDEFQPERFENWEEKSS

SLKWIPFGVGPRYCVGMRFAEMEFKTTIAKLLDTFELSLVPGDPPMIPETNG

VIFRPRCPVRLNLKLRI*

 

>CYP25A6P K06B9.1    PSEUDOGENE 250 aa

MAFLILTSILVSLVSFIIYVILARKERFRLRGKIGLSGPEPHWLMGNLKQIIER

KAKLGYDDSYDWYNKLHKQFGETFGIYFGTQLNINITNEEDIKEVFIKNFSNFS

DRTPPPIIEDNKLKESLLQNTYESGWKHTRSAIAPIFSTGKMKAMHETIHSKV

DLFLEILKEKASSGQKWDIYELVQKLLNDSDIYYFSDFQGLTLDVIGKCAFAID

SNCQRDRNDIFYVNARKFITNIDIRHRQNFKKGSQ*

 

>CYP29A1 C44C10.2    Z69787 502 aa also Y102F5 AL022276 61000-63000 region

MALILIIILICLLSFKPWSWKTIQLIXKYRQYDDKIPGPPTHPIFGNTETFKNKSQTEI

TEIFRQTADETRSQGKSVMKYHILGKLYVWPLDGKTIAKLVESTTELNKGDDYNFF

LPWLGGGVLVEGFGERWRTHRKLLTPTFHFAKLEGYLEVFYSETKIMIEHLEKFA

DNEETVDMFPYIKRCALDIICGAAIGTKINAQMHHNHPYVKAVEGFNSMAISHAIN

PSYQIPAIYWALGLKKQKDAHLNTMKTFTVNVIADRKAAIASGEVEKETSKRKMN

FLDILLNSEESNSLTSEDIRQEVDTFMFAGHDTTTTSVSWACWNLAHNPDIQEKVY

EEIVHIFGEEPNDEVTSEGISXLEYTERVLNESKRIIAPVPSLQRKLINDMEFGGMTIP

SGANVSIAPLALHSNAQVFPNPNKFDPDRFLPDEIAKRNAYDFMPFSAGLRNCIGE

KFALLNEKVMMIHILKNFRLEPMGGFYSTKPMFEAVARPSNGISVKLIRRQF*

 

>CYP29A2 T19B10.1    Z74043 503 aa

MTIFIPISIAIILAYLATWIPTLLKYKRHWQYGSKLPGPPAHPIFGNLGPIVGKKTED

LPSVFINWAAEQRDQGHSVMRVMILGKVYAWPLNGKAAAAIIDSTTETNKGDDYR

FFDPWLGGGLLLEGYGERWKSHRKMLTPAFHFAKLGGYFEVFNNESKILIDLLSD

FSASGETVDIFPYVKRCALDIISETAMGIKIDAQINHDHKYVQAVEGYNKIGVLVSF

NPHLKNQFIFWATGYKAQYDDYLSTLKSMTEKVIKERRAAHDSGEVEKETSKRM

MNFLDLMLSMEESNQLTSEDIRQEVDTFMFAGHDTTTSSTSWACWNLAHNPNVQE

KVYKEMIEVFGDDPNTDITLENVNNLNYLDIVLKESKRIIAPVPALQRKLTNDLEID

GYIVPAGGNVTISPMVLHSNHHVFKNPTEFNPDRFLPDEVSKRHPYDFMPFLAGP

RNCIGQKFAQLNEKVMISHIVRNFKIEPTLKYNDTKPCLEVVTKPSNGIPVRLIRRN

*

>CYP29A3 Y108G3 contig 541 504 aa

MSLILPCLLIILLLFIVSFWKIMKNILKYRKYDDlLPGPPAHPIFGNTKTFSNKTTE

EIFQALRDLFSEAVEKGQSLIRHRILGTFYVWPLDGKTVSKILESTTELDKGGPYE

FFNDWLGGGTLLEGYGERWRSHRKMLTPTFHFAKLEGYFEVFNTESRVVVDCLD

KFAKSGETVDLFPFFKRCTLDTICKTAMGAKVDAQLQNSHPYITAIEQALQLGVlY

AMNPHHQIPAIYWALGHQKKKDEYFNIMKTFTRNVIAERRTARESGEVEKETSKR

NMNFLDILLSNEESSVLSPEDLRQEVDTFMFAGHDTTTTSVSWVCWNLAHHPDIQQ

NVYEEIVSVFGEDPNEDVTTEGIKKLEYTERMLKESKRICPTVPAVLRQLISDMEIG

GVLIPAGANVAIAPMAIHKNANIYQNPDIFDPDRFLPEETAKRHAYDFIPFSAGLRN

CIGQKFAQLNEKVMVIHLLKNFKIEPMGGYYSTKQVFEPVGKPSNGIPVRLVRR

 

>CYP29A4 B0331 501 aa revised 3/19/04

MSILIPVALALLFVYLLSFYDTIRLMRKFWIYGGKMPGPPAHPIFGNASLFKNKTTK

DFVELFVQLAHEARSKGANLMRTQVMNRIYVWPLNGKTAATILESSTEVNKGDD

YAFLVPWLGGGLLMEKGEKWKSHRRILTPAFHFAKLEGYLDVFNSESKILIDCLEKI

AETQETVDLFPFFKRCTLDIICGTAMGIKLDAQNVHNLGYVQAVEGFNKLTVEYSL

NPFLWNRFVYWALGYQKMHDDFLYTLKKFTNDAIVERRTVIASGEIEKETSKRKM

NFLDILLNSEESNELTSDEIRKEVDTFMFAGHDTTSTSLSWLCWNIAHNPEVQENV

YKEIISIFGEDPNQDVTSENINRLEYTERVLKESKRMFPPVPGFQRKLTKDIVIDGITI

PSEGNITISPTVLHCNPFVYQNPEKFDPDRFLPEECAKRHSYDYIPFSAGLRNCIGQ

KFSILNEKVMLIHILRNFKLEPKLEFYETKPLFEVVAKPSHGIPVKLIKR

 

>CYP31A1P C01F6.3 Z68213 missing 25 aa at C-helix plus stop at ERK 477 aa

BJ104281 an EST for this pseudogene revised 3/23/04

MGVIILAVLLASATVIAWLLYKHLRMRQALKHLNQPRSYPIIGHGLITKPD

PEGFMNQVIGMGYLYPDPRMCLLWIGPFPCLMLYSGDLVEAIFSSTKHLNRGFAY

VLLEP

WLGISILTSQKEQWRLXXXXXXXXXXXXXXXXXXXXXXXXXKILVQKLCCLG

ADERVDVLSVIALCTLDIICETSMGIAIGAQLAENNEYVWAVHTINKLISKRTNNPL

MWNSFIYNL

TEDGRTHEKCLHIFHDFTKKLIGERK*ALQENDYKMEGRLAFLDLLLEMVNSGQM

DETDVQAEGNTFMLEGHDTTSTGLMWAVHLLGNHPDVQRKVQAELDEVMGDDE

DVTIEHLSRMKYLECALKEALRLFPSVLIITRELSDDQVIGGFNIPKGVTFLLNLYLV

HRDPAQWKDPDVFDPDRFHPENSIGRKSFAFIPFSAGSRNCIGQRFALMEEKVIMA

HLLRNFNIKAVELMHEVRPKMEIIVRPVTPIHMKLTRRRPIASP*

 

>CYP31A2 F22B3 Z68336 487 aa revised 3/23/04

MGVIIPAVLLAMATVIAWLLYKHLRMRQVLKHLNQPRSYPIVGHGLITKPDPEGFM

NQVIGMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEP

WLGISILTSQKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILVQKLCCLGADEEV

DVLSVITLCTLDIICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPL

MWNSFIYNL (2)

TEDGRTHEKCLRILHDFTKKVIVERKEALQENDYKMEGRLAFLDLLLEMVKSGQMDETD

VQAEVDTFMFEGHDTTSTGLMWAIHLLGNHPEVQRKVQAELDEVMGDDEDVTIEH

LSRMKYLECALKEALRLFPSVPIITRELSDDQVIGGVNIPKGVTFLLNLYLVHRDPA

QWKDPDVFDPDRFLPENSIGRKSFAFIPFSAGSRNCIGQRFALMEEKVIMAHLLRN

FNIKAVELMHEVRPKMEIIVRPVTPIHMKLTRRRPIVSP*

 

>CYP31A3 T16C6 495 aa this sequence revised according to Y17G9.contig 61

97% to CYP31A2 10 aa diffs CE24183 is too long.

MGVIIPAVLLASATIIAWLLYKHLRMRQALKHLNQPRSYPIVGHGLVTKP

DPEGFMNQVIGMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLN

KGFAYVLLEPWLGISILTSQKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKIL

VQKMCSLGAEEEVDVLSVITLCTLDIICETSMGKAIGAQLAENNEYVWAVH

TINKLISKRTNNPLMWNSFIYNLTEDGRTHEKCLRILHDFTKKVIVERKEALQENDYKMEGR

LAFLDLLLEMVKSGQMDETDVQAEVDTFMFEGHDTTSTGLMWAIHLLGNHPEVQR

KVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRLFPSVPIITRELSDDQVIGGV

NIPKGVTFLLNLYLVHRDPSQWKDPDVFDPDRFLPENSIARKSFAFIPFSAGSRNCI

GQRFALMEEKVIMAHLLRNFN VKAVELMHEVRPKMEIIVRPVTPIHMKLTRRRPIV

SP*

 

>CYP31A4P Pseudogene related to CYP31A sequences 44849-45028 of Y17G9.contig61

MTHLLRNFSVKDVELVHEVRPKMEIIVCPVSPIHMKLTRRRPIISP*

 

>CYP31A5P Y62E10A.15 CE28717.

MGVIIPAVLLASATVIAWLIYKHLRMRQVLK (0)

HLNQPRSYPIVGHGLITKP  50

DPEGFMNQVIGMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHL 100

NKGFAYVLLEPWLGISILTSQKEQWRPKRKLLTPTFHYDILKDFLPIFNE 150

QSKILIQKLCCLGVAD (2)

EEVDVLSVITLCTLDIICETSMGKAIGAQLAE (0)

NNEYVWAVHTINKLISKRTNNPLMWNSFIYNL

TEDGRTHEKCLHILHDFTKK 250

VIVERKEALQDSDYKMEGRLAF

(sequence gap)

VRPKMEIIVRPVTPIHMKLTRRRPIVSP*

 

>CYP32A1 C26F1.2    U53148 508 aa  also Y97E10 contig266 revised 3/19/04

MIIVISIVIGYVIYLVVVNFQQILELWRINRKCAQNLSMVNGPPALPLVGSAHLFKW

NPYAFTFQMEGWAQKYLFGRAKYGEIAAPNNEVDGIMLLWIGPVPIVFLGTSECI

RPVLESNTNISKPSQYDKMSEWIGTGLLTSTHEKWFHRRKMLTPTFHFTIIQD

YFPVFVRNAEVLADAVELHVDGDYFDAFPYFKRCTLDIICETAMGIQVNAQL

GHNNEYVHAVKRISEIVWNHMKFPWLWLKPIWYLTGLGFEFDRNVRMTNN

FVRK (0?)

VIQERKELLNEDGNEASEKKRKAFLDLLLTIQKEEGTLSDEDIREEVDTFMFEGHDT

TSSGIGFTILWLGFYPECQKKLQKELDEVFGFETNQPPSMDDIKKCSYLEKCIKESLRMFPSVPLIAR

RLSEDVTINHPSGQKIVLPAGLAACVSPIAAARDPRAWPDPDTYNPDNFDID

AIAGRDPYAYIPFSAGPRNCIGQKFALLEQKTILSTFFRKYEVESLQTEENLRP

VPELILRPYNGMKIKIKRREAADYVVL*

 

>CYP32B1 Y5H2B.5; CE26222

MLAAALVLLLTYFAYLIFRQKDDILQFLHVRKICTREFKKLRGPPAVLIFGT

LWYFKKDPVEMVYQAQAWFSEYTLAPDNCGVLKVAVNVEVS (0)

LGFQLWLGPIPAVNIARGEIAK ()

IVLDSSVNISKSSQYNKLKEWIGDGLLI ()

STGDKWRSRRKMLTQTFHF ()

AVLKEYQKIFGAQGKILVEVLQLRANNKFSFDIMPYIKRCALDIICETAM

GCSISSQRGANDEYVNSVRRLSEIVWNYEKAPQFWLKPIWYLFGDGFEFN

RHVKLTTDFTRDVIENRKKELKTHNSEQNETKKLAFLDYLLKSQEEHPDI

LTDEGIREEVDTFMFEGHDTTSSGITFAVWFLGQFPEYQQRVHDELDEIF

GEDFERIPNSEDIQKMVYLEQCIKETLRMTPPVPFVSRKLTEDVKI ()

PHATKPDLLLPAGINCMINIITIMKDARYFERPYEFFPEHFSPERVAAREPFAFVPFSAGPRNCI ()

GQKFALLEEKVLLSWIFRNFTVTSMTKFPEEMPIPELILKPQFGTQVLLRNRRKL

 

>CYP33A1 C12D5.7    U55365 492 aa

MIFFLLLTAFVLYLFNEFYWKRRGLPPGPIPLPIIGNMIELFSYPPPTAAYDA

WTKKFGKIYTVWMGTDPAVIITGYKELKDTFVNDGHSYLDKMIYSKLNTSLR

GGDYGVIDTNGNTWKEHRKFALHTLRDFGMGKEAMEASIQLEVDKIDEELKK

VEGKEVNIQEHFDLAIGNIINQFLFGNRFKDSSKFNELKKLLDLFFEVQGSLRV

YFAYTVDFLPQWMVELLTPDVSRVRDGIYQFFDEQIEEHRQEIDFETSDSKDY

VETFMKEQKKREAEGDFASFSNEQLKNMCFDMWVAGMHTTTNTMGFLTAF

ALNNMDAQRKMQKELTEVIGDRVVTMKDKLNLPYTNAFINEAQRCANLVP

MNLPHAVTRDVQLLGCTIPKGTTVIHQISSVMSDPEIFEEPERFVPERYLDES

GNLKKIEELVPFSIGKRVCLGEGLARMELFLFTANSFNRYEFNCGSNGVPSLER

TFAFIAKAQDYTCQVKPRYSS*

 

>CYP33B1 C25E10.2   U50311 495 aa

MILIFFALCTLFFLIHQYLWRRRGLPPGPTPIPIFGNLFQLSGSEAPGISIFQKWKDQ

YGPIFTFYMGPVPFVVLTDYQDIKETVIKDGDTYADKYLSPEFNKYFRGGEYGIMDIS

GDRWKEHRKFAVLQLRELGVGKPLMESKILIEAEELIKKLKTAEILNEDFLLQSELD

VAVGSVINQFLFGYRFDRSKLFEFTRIKTLVNNFMEEVGKPLGVLAFTCHGIPSFL

VKLMVSGIEEQKRELFRFLRKQIDGAKSQINYEEEHNEDFVEAYLRKKFQREQKND

FDSYCDSQLENVCFDIWAAGFDTLTNTVGFLIAYAINYPEMQMLIHQEIDNYLAHH

SRLLTLADKNALVYFNAFANEAQRVSNILPMNLPHALTRDVKLKGYHLKKGTGVI

HQIANVMTDETIFKDSQRFDPNRFIDENGKLKKIEELCPFSMGKRQCIGEGLARMEI

FLLAANLFNYFEFLPASDGLPSLYKDFSLVSHVIPYKCRQYIP*

 

>CYP33C1 C45H4.2 502 aa revised 3/20/04 small deletion

MIIILLLTFLTIYFVYELYWKRRNFPPGPCPLPVFGNLLSIANPPPGYKAFER

WTKKYGDVYTFWIGNTPHIMINTWDKIKETFIRDADTYTNKVVLPMVTLSR

GGEYGIIDSNGAMWREHRRFALSTMRDFGLGKNLMQENILMEVQDVFARLD

AKLGSETDVPEVFDHAVANVVNQLLFGYRFMGPKENEYQELKHIIDSPAEIF

GKLHIFLAMNIPIFAKLLPESLYEGPIKTFRDTTLAFFNKQIEAHRHRIDFEDL

NSESTDFVETFLKEQKRRESEGDSETFSNIQLLNVCIDLWFAGLNT

TTNTITWAISYVLHHPEVQDKIHEELDKVIGSDRLITTADKNDLPYFNASINE

SQRGINILPLNLQHATTRDTVIDGFKIPKGTGVVAQISTVMNNEEVFPDPYTF

NPDRFIDENGKLKKVDELAPFSVGKRSCPGEGLARMELFLFIANFLNRYQ IHPS

KEGLPSMAKGSGPVVAPRLFTAILTRRF*

 

>CYP33C2 C45H4.17 AF039053.2 39539-42553 495aa

MLIILLLTVLTVYLVYELYWKRRNFPPGPCPLPVFGNLLSIANP

PPGYEAFERWTKKYGDIYTFWIGNTPHIMINTWDKIKETYIRDAETYTNKVRLPALDL

FRGGEYGIIDSNGATWREHRRFALSTMRDFGLGKNLMQENILMEVQDVFARLDAKLGS

ETDIPEVFDHAVANVVNQLLFGYRFVGPKESEYQELKRIIEAPTDLFGKPHMFLAMNI

PVIAKLMPEFMYEGPFKDFRDTAFAFFNKQIEAHRQEVDLNDLNSESTDFVETFLKEQ

KRRESEGDSETFSNVQLSNVCMDLWFAGLNTTTNTITWAISYILHHPEVQDKVHEEMD

KVIGSDRLITTADKNDLPYFNAFINEAQRGINIVPLNLQHATTRDTVINGFKIPKGTG

VVAQISTVMNNEEVFPDPHTFNPDRFIDEHGKLKRVEELAPFSVGKRSCPGEGLARME

LFLFIANFLNRYKIYPSKEGPPSMDKSNGALIAPRLFSATLKRRS

 

>CYP33C3 F41B5.4 500 aa

MILLLVFTAFLAFLFHELYWKRRNLPPGPIPLPYMGNILTMLMNKPGYECFRNWT

KKYGDVYTFWMGKTPYVMISSYDLLKDTFVRDGDTYKDKYPQPFNQKIRGGNYG

IVESNGHLWNTHRRFALTTLRDFGLGKDLMHEKILIEVENIFKKFDAQLGLEQDVS

VVMNNAIANVINQNIFGYRFEGDKEEDFKKLRELMEYQETAFATFKVYVEAFIPKV

GALLPGRSLDVLLEEWRDNFYSFFDTQIENHRKKIDFDSQESLDYAEAYLKEQRKQ

EALGEFELFSNKQLSNTCLDLWSAGLSTTYTTVTWALAYVLNSPEVLEKMRSELDE

VVGKDRFVTTADKNDLPYMNAAINEVQRCANLLPVNVAHATTRDTVINGYAIKKG

TGVIAQISTVMLDEKVFPDPYKFNPDRFIDEHGKLIKVEQLIPFSIGRRQCPGEGL

ARMEIFLFVANFFNHYQISPSSEGFPSIDKSDRVGVFPKKFKAILNRRFYGPSKEI*

 

>CYP33C4 F44C8 493 aa revised 3/22/04

MIILLVFTAFLAFLFHELYWKRRNLPPGPIPLPFMGNILTMLLNKPGYECFRNCTK

KYGDVYTFWMGKTPFVMISSYDLLKDTFVRDGDTYKDKYPQPLNQKIRGGNYGI

VESNGHLWSTHRRFALTTLRDFGLGKDLMQEKILIEVEDIFKKFDAQLGLEQDVSV

VMNNAIANVINQNIFGYRFEGDKEEDFKKLRELMEYQETAFATFKVYVEAFIPKVG

ALLPGRSLNELLEEWRDNFYTFFDTQIENHRKKIDFDSQESLDYAEAYLKEQRKQE

ALGEFELFSNKQLSNTCLDLWLAGLSTTNTTVNWTICYVLNHPDVLQKMNEEFDQ

VVGSDRLVTMGDKNNLPYFNAVLNESQRCANIVPINLFHATTKDTVINGYPVKKG

TGVIAQISTVMLDEKiFPDPYKFNPDRFIDENGKPIKIEQLIPFSIGKRQCPGEGL

ARMEMFLFLANFFNRYKISPSSKGFPNLDKKDNVGVFPKDLHAILNKRN*

 

>CYP33C5 F41B5.3 494 aa revised 3/22/04

MILLLLSAAVCLFLFHELYWKRRSLPPGPTPLPFMGNTLAMLLEKPGYECFRRWT

KQFGGVYTFWMGNIPYVIIGSYDLLKETFVRDGDTYKDKYPQPFNEKLRGGMYGI

VESNGHMWSTHRRFALSTLRDFGLGKDLMQEKILIEVEDIFRKYDAQLGKEQDIQV

VLNNAIANVINQTIFGYRFDETNQEEFERMRHLVEYQEKQFATVKVYVEAFVPTIG

KFLPGKSLDQLLDDWRNSFYTFFDTQIENHRKKIDFDSEESQDYAEAYLKEQKKQE

ALGEFELFSNTQFSNTCLDLWLAGVSTTNTTVNWTICYVLNHPDVLQKMNEEFDQ

VVGSDRLVTMGDKNNLPYFNAVLNESQRCANIVPINLFHATTKDTVINGYPVKKG

TGVIAQISTVMLDEKiFPEPYKFNPDRFIDENGKPIKIEQLIPFSIGKRQCPGEGLA

RMEIFLFLANFFNRYkISPSSKVFPNLDKKDNVGVFPKDLHAILNRRNC*

 

>CYP33C6 F41B5.7 496 aa

MIILLLSAAVCVFLFHELFWKRRHLPPGPTPLPFMGNILAVILEKPGYECFRRWTK

QFGDVYTFWMGNTPYIIIGTYDLLKETYIRDGDTYKDKYPHPFNEQLRGGMYGIVE

SNGHLWSTHRKFALSTLRNFGLGKDLMQENILIEVEDIFRKFDAQLGKEQEIQVVL

KNAIANVINQIIFGYRFDATNQDKLEKMRYLVEYQEKAFTTVKASIQAFAPRIGSILP

GKSLDQLLNEFKDNFYTFFNAEIENHRRKIDVDSEESQDYAEAYLKEQRKHEARGE

FELFSNTQFANTCLDLWLAGVSTTNTTVNWTICYVLNHPEVLQKMNEEFDQVVGS

DRLVSMGDKNNLPYFNAVLNESQRCANIVTINLFHATTKDTVINGYPIKKGTGVIA

QISTVMLDEKiFPEPYKFNPDRFIGENGKPIKIEQLIPFSIGKRQCPGEGLARMEI

FLLVANLFNRYQISPFSEGFPNLEKNSVVEIFAKGLRVILTKRHC*

 

>CYP33C7 F41B5.2 494 aa

MILLLLFTVLIAFLFHEMYWKRRNLPPGPIPLPLIGNLQSMLLEKPGYECFRRWNK

QFGDLYTFWMGNTPYVIVSSYELMKETFIRDGDTYKDKFPQPFNEKFRGGIFGIIET

NGHLWNTHRRFALSTLRDFGLGKDLMQEKILIEVEDIFRKYDAQIDKDMEISTILH

NAIANVINQTIFGYRFDESNQEEYKKLKHLIEFQENVFTSAKVTVQVFAPKLGKILP

GESLEDLMKDWKNSFYDFFNTQIENHRQKIDFDSEESQDYAEAYLKEQKKYEALG

DTELFSNKQLSNTCLDLWFAGLSTTNTTANWTICYVMNTPGVLEKIHDELDKVVG

SDRLVTTADKNNLPYMNAVINESQRCTNIVPINLFHATTRDTVINGYPVKKGTGVI

AQISTVMLDEKVFPEPYKFKPERFIDESGKLIKVDELVPFSIGKRQCPGEGLARM

ELFLFIANFFNRYQISPSSEGLPSIDKSERVGVFPRKFNAILKKRHV*

 

>CYP33C8 R08F11 494 aa also Y39H10 contig 180 revised 3/22/04

MFFILVLVLTGISIYLFHLLYWKRRNLPPGPTPLPLFGNTLPTLFDIPGYKTFGKLK

EQYGDAYTFWMGKDPYVMICSYDLLKETFIRDADTYKDKDYNPIDEKIRGGIYGI

LQTNGHVWNTHRRFALTTFRDFGLGKDLMQQKILIEVEDMFRKLDENIGEEQDVP

TVLYNAVANVINQIIFGYRFEGVKQEEFTKLKELMEYLETAFATLKIYVEIFVPWIG

KLLPGKSLDDVLNYWKDTSYEFFNSQIENHRLKIDFDTEESLDYAEAYLKEQKKQ

KAQGEFELFCDKQLSNNCLDLWTAGLSTTIITINWTICYIMNTPGVQEKMQEEMDK

VVGGGRLVTTADKNDLPYMNAVINEAQRCGNIVPLNLLHCTTKDTVINGYSVKK

GTGVVAQISTVMYDEKiFPDPYTFNPDRFIDDNGKLVKIEQLIPFSIGKRQCLGE

GLARMELFLFIANFLNRYHISPGSSGLPSLDKSKGITPRKFEPVLSRRHA*

 

>CYP33C9 C50H11 495 aa revised 3/21/04

MVLNIILVVVVAFLFHHLYWKRRNwpaGPTPLPLIGNLLSLRNPAPGYKAF

ARWTAKYGDIYTFWLGTRPYILVSSYEALKETFIKDGETYADKKPMAFQESF

RGGSYGVVETNGPFWREHRRFAIHQFRDFGLGKDRMEQRIMLEVEDIFNNCDK

TIGEGVDLTDIFDRAVGNVINQMLFGYRFDETRADEFRTIRAFFNFNSGEFAS

FSMRVQFFLPWMGYIMPGPTILDRFKKYQKGFTEFFGTQIENHKKEIDFELEEN

SDYVEAFLKEQRKREASGDFESfS (2)

TKQLSNMCLDLWFAALMTTSN

TMTWCFAYTLNYLDAQQKLHEELDRVIGSERHINTADKPNLPYTNAYINEIQ

RTANLVPLNLLHMTTRDTVLKGYNIPKGTGVVAQISTVMYDENVFPEPYIF

KPERFLDDDGKLKKVEQLVPFSVGKRQCLGEGLARMELFLFIANFFNRYRVV

PDANGPPIIDKAVLGGMHTKEFKAILQRRHVSE*

 

>CYP33C10P F41B5.5    PSEUDOGENE 162 aa

MILILLFTACLAVVFHEIYWKRRHFPPVPIPLPLIGNLHSLILEKPRYESFRKWKKG

FGDVFTVWMG*TPYIMITSYDLLKETFIRDGDTYKDNYPQPFNEKFRGGLYGLIDS

NGLLWSTHRRFALSTLRDFGLGKDLMQEKILIEVEDIFQKCDAQLGQEQD*

 

>CYP33C11 Y49C4A.9 CONTIG103 AC024799 comp(22028-24307)

MIAFSIFIAIFFWLAHCFYFKRRNLPPGPTPLPFLGNSLSMIWE

NPGYECFRRWTRIYGDVFTFWLGDTPYVVISSLEKMKETFIKDGETYQDKIQFSFTEK

FRGGRFGVIETTGHMWSTHRRFAISTFRDLGLGKNLMQEKILAEVEAVFQILDANLGA

EQDVPEVIYNAVANVINQMIFGYRFDESNQNEFEKLKDLIDIQEKSFASFKLCVQAFV

PIMNKLLPGKSLEDLIAEKRHDYYNFFYAQIQDHRAKINYQAEEYLDYAEAYLKEQKN

QEQGGLGSDLFSDQQLANMCFDLWFAGLTTTNTTINWTISYVLNHPEVQDKMQEELDK

VVGSDRLVTISDKSSLPYMNAVINESQRCVNLLPCNLFHATTRDTVINGNIIPKGTGV

IAQISTVMLDEKIFPDPYKFQPERFIDENGALKKIEQLIPFSVGKRQCLGEGLARMEL

FLIVANLFNRYFITPSSAGPPSLERTEMVGVFPRKFQAILTRRYE*

 

>CYP33C12P  Y5H2B.6; CE21315. AC006810 one in frame stop codon = pseudogene

MLLLLFLSVLFLALFYEFHWKRRNYPAGPLPLPVIGNMWSMMRNNSGVEC

FRQWTKDFGDVYTFWFGTKPYIVVSSYKRLKEAFILDGDTFADKIRQPFQ

DQFRGGNYGVVDTNGHVWSTHRRFALSSFRDFGLGKNLLQEKMLIEVQDM

FAKFDANLGKEQNLPVVLYNAAANVINQLIFGYRFDKEREGELKKLKALM

EFQETAFTTFKVYVQFFAPAIGKHLPGKSVEDLLAEFTVDFYKFFNHQIE

EHRSKIDFDSEESLDYAEAYLKEQRKQEAQGEFELFSTKQLSNTCFDLWF

AGLSTTHITLTWIVGHVLNYPDVQRKLHKELDEVIGSDRLITNDDKNNLP

YLNAVINESQRCANIVPINQIHSTSRDTVINGITVKKGTGVIPQISAIML

DDKVFPDPYAFNPERFLDANGKLRKI*EFVPFSVGKRQCLGEGLARMEMF

IFMSNFFNRYQ (0) 17014

VSPASSGPPSLVKESLLNVAPRKFDAILKKRHV* 16469

 

>CYP33D1 K05D4.4 492 aa revised 3/22/04

MLLLLISSLIFLFFAYHFSWKRRNYPPGPTPLPLVGNLLQLQKFGYDIFHKWK

KEFGPVFTFWLadRPFIFITSYEVMKETFVKDGDTFADKQLNQIDKKKLQRNY

GVLDTNGEMWREHRRFTLSQLRDLGLGKDLMQEKILLEIEEQFKDINSHLGEEIDLPSVL

DRGVGNVINLTLFNKRFETNQRDEFTYLKSLIDGLRDVASEFRYFIQFLVPWS

SKIIPGPTLGDKTKGMKEELDVFFVKQVEEHRKEIDFDTVESYDYVEAYLKEQ

KKREADGDHETFCNKQLYAMCFDLWMAGLQTTTVTLTWGFSFYLHNADVQ

LKIREELDRVIGNDRLITTADKNCLPYLTAFINETQRCANIIPFNLLHVATRDT

VIEGYPVKKGTGVIAQIGTVMSDEQIFPDAHCFNPGRFIENGKLKKVDEVIPF

SIGKRQCLGEGLARMELFLFFANIFNRYDVQLDFSGNLPDLDKSKDNFVTPRKF

NAVLNKRYV*

 

>CYP33D3 C54E10 Y69H2 contigs 04354 and 04878 and

Y80D3 contig 02849 496 aa revised 3/22/04

MLLLLIFSSIFLYFFYHFHWKRRNLPPGPRPLPFLGNLLSLKTLKPGYEAFSNWKK

EYGPIFTFWMaNKPFVIIASYEKMKETFVKDGDTYVDKQLTHTEKERLGENYGVL

DTNGHMWKEHRRFTLTQLRDLGLGKDLMQEKILMEVEELFKELDAHGGEEIDLPK

LIDRSVGNVINLTLFNKRFDMDKRDEFAHLKSLIDGMRNVTSQfRYLIQYLVPWTS

TVLPGPTLSEKVRAKREELDDFFYSQIDEHRNEIDFDNTENLDFVEAYLKEQKKREE

DGDFktfcNKQLCAMLFDLWIAGLMTTTMTMTWGLSYYLYNPEVQRKIREELDKVI

GNDRLISTADKNDLPYLQAFVTETQRTANIIPLNLIHMTTRDTVIDGFPIQKGTGVIA

QISTVMYDEkVFPEPYKFKPERFIENGKFKKVDEVIPFSIGKRQCLGEGLARIELFLF

FANIFNRYDVMPDFSGSLPDLDKSKDNFVIPRKFKAVLKRRYMD*

 

>CYP33E1 C49C8.4    U61945 494 aa revised 3/20/04

MILLILTSILIIYLFNHFYWKRRKLPPGPIPLPIIGNLYLMTEDVKPGYKMYE

KLKDKYGPVFTFWLANLPMVTVTDWKLIKQHFIKDGANFVGRPEFPISMEM

RQGPYGIIESHGDRWIQQRRFALHILRDFGLGKNLMEEKVLGEVTAMIDSIRK

SMEDVDMQNIFDASVGSVINNMLFGYRYDETNIEEFLELKNRMNKHFKLAAE

PMGGLIGMNPWLGHLPFFKGYKNVIMHNWMGLMEMFRKQATDRLASIDY

DSDEYSDYVEAFLKERKKHENEQDFGGFeMEQLDSVCFDLWVAGMETTSNTLNW

ALLYVLLNPEVRQKVYEELEREIGSDRIITTTDRPKLNYINATVNESQRLANLLPMN

LSRSTNADVEIAGYRIPKDTVITPQISSVMYDPEIFPEPYEFKPERFLESDGSLKKVEE

LVPFSIGKRQCLGEGLAKMELFLYFANLFNKFDIKFHESNPNPSIKKEVGVTMKA

KNYRVSMKERY*

 

>CYP33E2 F42A9.5    U61952 509 aa

MILFIVLSIVSIYLFDLFYWKRRNLPPGPLPLPLVGNLHMMSDDVKPGYSLF

SNLKEQYGHVYTFWMASLPIVHVTDWNLIKQHFIKDGGNFVGRPEFPMSIEL

RQGPYGIIESHGDRWVQQRRFALHVLRDFGLGKNLMEEKVLGEVTAMLDRL

RKTMDEVDMQSVFDASVGSVINNLLFGYRYDETNIAEFLDLKEKLNNHFQLA

AKPIGGLIGMNPWLGYFPYFSGYKNGLFYELVFIFKMCFIVMIQNWMGLVE

MFRKQANEKLATIDYESDDYSDYVEAFLKERKKHENEKDFGGFEMEQLDSVCF

DLWVAGMETTSNTLNWALLYVLRNPEVRQKVYEELDREIGSDRIITTSDKPK

LNYINATINESQRLANLLPMNLARSTTKDVEIAGYHIKKNTVIIPQISLVLYN

PEIFPEPYEFKPERFLESDGSLKKVEELVPFSIGKRQCPGEGLAKMELLLFFANL

FNRFDIQLHQSNPNPSVEKEFGVTMKAKSYRLKLKDRH*

 

>CYP33E3P F42A9.4    U61952 PSEUDOGENE 236 aa

MILALSLALLFLYLFHFIYWKRRNLPPGPAPLPFVGNLLMLTEKVKPGYKL

WDSLTQQYGSVFTFWMAGLPMVFVTDWKLIKQYFIKDGGSYVGRPEFPLNT

EVKKGPYGIIDAHGNRWIQQRRFALHVLRDFGLGKNIMEEKILTEVTVMIER

LRKTIACVDMQNVFDTSIGSIINSLIFGYRFDESEFGWKFEKSRRACSIFNWKT

TVSRRRTCQNGIISVFCKLVQYIRYSTS*

 

>CYP34A1 T10H4.10   Z81119 504 aa

MFLVLLFFTIISLALAHQWRARKQLPRGPYPLPLIGNLHQLLYYCWKNGGLVEGY

AEIQKSFGKVYTLWIGPLPTVFIADFEVAHETHVKRAHEFGTRYAPGLMNYNRYE

RGVVASNGEFFQEHSRFVLSTFRNFASGRNIMEERIMDEYRYRFEDFASSNGICNK

ERKIFETWARPFFDLLTGSVINKILINERFEQNDPEFEKLVSRLAKGFENTGFLDIFC

PVRILESRFLKWRQDTIFEPFNYILELNKKSIARRVAQLKADEHVLSDDPDDFLDAY

LLKMQKDKNEGLETTFTLENLAVDMYDLWLGGQETTSTTLNWACACLLNRPDVIT

TAREELIRVTGGHRSLSLIDRRETPYLSAVISEVQRFASILNMNLFRIIKEDTVVDGQ

PLRAGTAVTAHIAMIHVDEDLFKNHTEFRPERFLENDDLDKKLIPFGIGRRSCLGES

LAKAELYLVLGNLLLDYDLEPVGEIPKLKTLAPFGLLKQSPEFRIRFVEVEKH*

 

>CYP34A2 T10H4.11  Z81119 502 aa

MFLILLFSTILAIALVHQWRARKQLPRGPYPLPLIGNLHQLFYYCWKNGGLVEGYA

EIEKSFGKVYTVWIGPMPTVFISDYDVAQETHVKRANVFGTRYAPGIMNYVRFDK

GVVASNGEFWQEHRRFALTTLRNFGFGRNIMEERIMDEYRYRFKDFATTGNNNM

AKSFETCARSFFDLLTGSVINKVLINERFEQDDADFEKLKTNLSRGLENTGFLDIFC

PVNILQSRFLKWRQDSIFQPLDWILELTKRNIAKRVAQLKSGEHVLHDEPDDFLDA

YLMKMYKDEKEGLDSTFTLDNLAIDMYDLWIAGQETTSTTLAWACACLLNKPEVV

LKAREELVHVTGGHRSLSLTDKKVTPYLSAVISEVQRVASILNVNLFRILEEDTYVG

GQPIRAGTAVTAHISMIHVDETLFKNHTEFNPERFLEVEGLDKKMIPFGIGKRACL

GESLAKAELYLVLGNMLLDYNLEPVGDVPKIQTITPFGLLKRPPPFKVRFVEVEKS*

 

>CYP34A3 C41G6.1     Z81047 494 aa also T13C10 Z81591

This gene has a small exon RIQDFSKTHG

MIFVTLVTTILIYFSCKLWRGRKKNPNGPLPLPLIGNLHQLVYNSWKTGGIV

AGFQVFKKQYGKFFTLWFGPIPIVFIADYDIAYETHVKKANIFGHRFTTGVM

NYIREGRGIIGSNGAFWQEHRRFALTTLRNFGLGKNIMEDRIMDEYRY

RIQDFSKTHG

KNGVIEVNAATMFDLLVGSIINRMLVSKRFEQGNLEFETMKVYLTKA

LEEVSIFEAFLPVWLLKSNLLRWRTKYTLAPVEYIYSLVQKEIQERTTSIENG

SHVPSEDGDDFVDAFLIKIEKDKKEGIDSTFTLESLAIDLFDLWLAGQDTSSTT

LLWAGICLLNHPEVVEKLRSELLEVTGGIRRLSLTDRARTPYLMAVLNEIQR

IASILNINIFRELREDTEIDGQPIAAGTVVANQLSMIHTDEELFEDHTRFDPER

FIENSTLEKKLIPFNLGKRSCPGESLARAELYLIIGNLVLDFDFEAVGIKPEIKT

STPFGIMKRPPNYISD*

 

>CYP34A4 T09H2.1 499 aa revised 3/22/04

MLLILLFTTVLAILIIHQWIKRRNLPNGPTPIPIIGNLHQLFYLYWKHGGAVSAYRQL

EQTYGKVFTVWIGPLPTVYISDYDLAHETHIKKSNIFGARFAVGALNYIREGRGIVA

SNGEFWQEHRRFALTTLRNFGLGRNLMEEKIMEEYRYRFAQTGNGNNNKSSIET

NSSMFFDLLIGSIINQLLISERFEQGDPEFEKLKESLSIGLEKFGVLDIFLPDWIMNAWWM

KWRMDDILGPFSWIHRLSQRNVQRRMEQIESGEHVIDGDGTDFMDTYINKIEKDK

REGVDSTFTLETLAIDMYDLWIAGQETTSTTLSWACACLLNHPEVVKIAREELVHLTGGHRSISLT

DKTSTPYLNAVInEVQRIASILNVNIFRQTSEDTTVNGQPIAAGTALTTHLSLIHTDEN

LFQNHTEFRPERFLENNNLEKKLIPFGIGKRACLGESLARAELYLVTGNMILDYDL

QPIGEVPQIKTTSPCGIMKRPPVYSLRFVPVHHA*

 

>CYP34A5 B0213.10 499 aa

MFLILILTSILTYFSINLWRGRRRNPKGPLPLPLIGNLHQLIYNSWKTGGMVAGFQE

FKKHYGKFFTLWFGPIPIVFIADYDIAYETHVKRANVFGHRFTIGGMDYIREGRGI

VGSNGDFWQEHRRFALTTLRNFGVGRNIMEDKIMDEYRYRIQDFSKTHGKNGVIE

VNATTMFDLLVGSIINRMLVSERFEQGDQDFEKLKMYLTKALEELSIFDSFTPLWLL

KSRILRWRTKVTMAPFDFVYELVKNGIQKRTSEIKNGSHVISEEGDDFVDAFLIKIE

KDKKEGIDSTFTLETLAIDLFDLWLAGQETTSTTLTWAGTCLLNHPEVVEKLRKELI

QITGGTRSVSLTDRAQTPYLTAVVNEVQRISSILNVNIFRQLQEDSHIDGQPIASGSV

VTTQLAMIHTDEDLFKDHTKFDPERFIENNNLDKKLIPFGIGKRSCPGESLARAELY

LIIGNLVLDFNLEAVGVKPEIKTSTPFGLMKRPPNYNIRLIPVSH

 

>CYP34A6 B0213.11 498 aa

MLFILILTATVLILTVNIWRAWRKLPNGPLPIPLIGNFHQLFYYAWRFGGIVEGIKE

MKKQYGKVFTLWMGPLPMVNICDYDIAYETHVKRANIFVNRYIHGATEYIREGRG

IIGSNGDFWLEHRRFALTTFRNFGIGRNIIEGKIMEEYNYRFEDFHETHFKNGAIQV

SASMFFDLLVGSIINQLLVSERFEQDDKEFEELKTKLTMALENSSIIEGVMPLWLLK

SKFMKWRTKTTFAPFDFIYEVGQKGIQRRVAAIENGTHVLSEEGDDFVDAFIIKIEK

DSKEEVESTFTLETLAVDLFDLWVAGQETTSTTLCWAFVCLMNYPEVVEKLRNELT

EVTGGMRSVSLADRSSTLYLNATINEIQRIASILNVNLPRVLEEDALIDGVLVPAGTA

FATQLSAMHTDDETFKNHKEFNPERFMENNNLEKKLIPFGIGKRSCPGESLARAE

LYLIIANIVMEYEIEPVGATPKMKTPTPFSLLKRPPSYEIRFVKRS*

 

>CYP34A7 B0213.12 499 aa

MLIILILVAIAAVLTVNLWRARQKLPNGPTPLPIIGNFHQLFYNGWKYGGLVAGFD

QFRKQYGKVFTVWMGPIPAVQICDFDVAHETHVKKAHTFGHRYTFGAMEYIREG

KGIIGSNGDFWLEHRRFALMTLRNFGLGRNIIEDKIMEEYRYRFEDFKKTNFKDG

AIQVNASSLFDLLVGSIINQLLVSERFEQDDEEFEELKTNLAMALENGSIIEGVLPLW

MLKSRFMKWRTKTTFAPFDFVFEVGKKGIQRRVAAIENGTHTLSEEGDDFVDAFI

VKMEKDKKDGIDSSFTLETLAVDLFDLWQAGQETTSTTLTWACACLLNHPEVVEK

LRKELTEVTGGARGVSLTDRTKTPYLNATINEVQRISSILNVNLLRILEEDAVIDGHP

VPAGTAFTTQLALLHTDEETFKNHKEFIPERFLENNNLEKRLIPFGIGKRSCPGESL

AKAELYLIIGNLVIDFDLKPVGAIPKIESPTPFSPVKRPPVYDIRFISRAH

 

>CYP34A8 B0213.14 499 aa

MLLILILLAVAAVLTANLWRARQKLPKGPTPLPLIGNFHQLFYLSWKTGSLVAAFN

ELKKQYGKVFTVWMGPKPSVYICDYDIAHETHVKRANIFGTRYSVGGMEYIREGR

GIIGSNGDFWLEHRRFALMTLRNFGVGRTIMEDKIMDEYRYRFKDFKRTHFKNGA

IQVNASSIFDLLVGSIINQLLVSERFEQDDQEFEKLKTSLAEALENISIIEGFLPLWVL

KSPLMKWRTKITFAPFDFIFELGNRGIQRRVAAIENGTHTLSEEGDDFVDAFIVKME

KDKKEGIDSSFTLETLAIDLFDLWQAGQETTSTTLTWACVCLLNHPEVVEKLRKEL

TEVTGGTRGVSLTDRTKTPYLNANINEFQRIASILNVNLFRVLEEDTTIDSQPVPAG

ALVTTNLSMLHTDEEIFKNPQEFRPERFMENNNLEKRLIPFGIGKRACPGESLARA

ELFLITGNMILDYDLEPVGTLPKIETTTPFAPMKRPPVYDIRFVPRSQ

 

>CYP34A9 B0213.15 499 aa

MILILILSAIVAIFTVNLWRARQKLPKGPTPLPLIGNLHQLIYNAWKNGGIVQGFNE

FKKQYGNVFTIWMGPIPSVHIADFDVAHETHVKRANTFGHRFSNGGMDYIREGR

GIIASNGEFWQEHRRFALTTLRNFGLGRNIMEDKIMEEYRYRFKDFKNTHFKNGG

IQVNASSCFDLLVGSIINQLLVSERFEQDNKEFERLKTSLAKGLEKVSIFEAFMPVW

VLKSKLMRWRTKTTFEPFDFIFGLVERGIQKRVSAIKNGTHIPSEEGEDFVDAFLIK

MEKDKNEGIQSTFDLETLAVDLYDLWLAGQETTSTTLTWACACLLNYPEVVKEIRK

ELTEVTGGTRSLSLTDRPKTPYLGATINEIQRIASILNVNIFRLMEEDSIIEGQPISAGA

VLTTQLAMLHTDEKTFKNHTEFRPERFLENNNLEKRLIPFGIGKRSCLGESLARAE

LYLITGNMILDFDFESVGAAPRIQTPTPFALMKRPPSYDIRFIPRSK

 

>CYP34A10 B0213.16 499 aa

MLIILLLVTFFAVATAYQWRKCQRLPKGPTPLPLIGNFHQFIYYGIKLGSAVAVYHE

FEKTYGKVFTIWMGPLPSVYISDYDVAHESHIKRANIFSTRFAHGTTNYIREGRGII

AANGDFWQDHRRFALTTLRNFGLGRNLMEEKIMEEYNYRISDYKRTHLKNGAIEI

HAGTFFDLLVGSIINQLLISERFEQGDQEFEKIKTSLALSLENFGIIDIFFPTAILDSPL

MKWRQKKIFEPFDFIYEVSKRNLTKRMEAVKSGDHVIEDGGRDFVDAYILKIQKD

ENSGAPSTFNLETMIIDVFDLWQAGQETTSTTLTWAFCCLLNHPNVVKKLRAELM

KLTGGNRHVGLNDRADTPYLNAVCNEVQRIASILNINLFRKIEQDTEIAGQPLASG

CVVTTQMSMLHTDVEVYKNPTEFRPERFLENNTLEKKLIPFGIGKRSCPGESLARA

ELYLILSNVILDVEIEAVGSIPKILTSNPFGLLRRPPSYDIRFKPLKV

 

>CYP35A1 C03G6.14    C13B3.B 502 aa Revised 3/19/04

MFLVLIFLALSCWLIIRQYQKVSRLPPGPVSFPIIGNLPHIIYYLWATGGIVS

TLDLFRKKYGNIFTLWVGPVPHVSICDYETSHEVFVKGANKYADIAHAPLFR

ELRQEMGVLVTNGSHWSTMKRFALHTFRDMGVGKDLMETRIMEELDARC

ADTDKSATDGVTVAQAGDFFDLTVGSIINSILVGKRFEEHNKDDFLKIKEAMG

AAFEVFSPFDMAVPVWFLRTFFRSRYDMMMTTQNTAKRFAAAEAVKRIED

IKSGAYEIDESNIEDYTDAFLLKIQKDGEDLDFNIETLKTMIIDLWMTGQETTTT

TLISGFTQLLLHPEVMVKAREEILKITENGSRHLSLTDRTSTPYLNAMIGEIQRHASI

LNVSFWKINKELTYMGGHPVDAGALVTAQLSALHVNDTIFKNPQEFDPERFIRDEE

LLQKVIPFGVGKRSCIGESLARAELYL

IIGNLLLRYKFEPHGTLSTTELLPYSAGKRPFKLEMKFVKI*

 

>CYP35A2 C03G6.15    C13B3.A 494 aa Revised 3/19/04

MFFVLFFSVLLGYLIVRQYQKVSRLPPGPISLPLIGNLPQIIYYLWSTGGIVST

LDLFRKRYGNIFTLWVGPIPHVSIADYETSHEVFVKNAGKYADKFHAPVMR

DVRndIGVLITNGDHWQEMRRFSLQAFRNMGVGKDIMETRIMEELDARCSD

IDKLATNGVTITHASEFFDLTVGSIINSILVGKRFEEDTKHEFLKIKETMDASF

ETFSPFDMTAPVWFLKTFFKHRYDKIWSAQETAKNFAAAEAIKRVESIKSGKYVID

ENNLQDYTDAFLLKIQKEGESKDFNIETLKTMIIDLWMTGQETTTTTLISGFNQL

LLHPEVMIKAREEILKITENGSRHLSLTDRTSTPYVNAVIGEIQRHASILNVSFWKIN

KEFTYMGGHPVDAGALVTSQLSALHVNETVFKNPQEFNPERFIRDGKLLQKVIPF

GVGKRNCLGESLAKAELYL

IFGNLLLRYKFEQHGKLSTTELMPYSAGKRPFKLEMKFVKI*

 

>CYP35A3 AC006673 K09D9 494 aa Revised 3/22/04

MFFVLFIAACLSWLIVRQYQKVSRHPPGPISFPLIGNLPQICYYLWSTGGIVSALDLL

RKKYGNIFTLWVGPVPYVSIADFETSHEVFVKNGGKYADKLHAPIMRDIRkdK

GIAFTNGDHWQEMRRFSLQTFRNMGVGKDIMETRILEELDARCSDIDKSAKNGV

TVAQASEFFDLTVGSVINSILVGKRFEEDTKHVFLRIKNTIDESFKLITPFNTTVPVW

ILKTFFKDRYDKMADAQEIAKNFVAAEALKRiEDIKSGKYVIDENNLQDYTDA

FLLKMQKEGENLDFNTETLKTMLVDLWLTGQETTTTTLTSGFNQLL

LHPEVMVKAREELMNVTENGSRSLSLTDRSSTPYLNAMIGEIQRHASILNLSFWKV

NKEFTYIGGHPVDAGALVTAQLSALHVNETYFTNPQVFEPERYSQDEKLLQKIIPFG

VGKRSCLGESLAKAELYL

IFGNLLLRYKFEPHGKLSATDLMPYSAGRRPFKLEMKFVKI

 

>CYP35A4 C49G7 494 aa Revised 3/20/04

MFVILFISAILSWLIVRQYQKVSRLPPGPVSLPLIGNLPQIFYYLWTTGSIVS

TLDLFRKRYGNIFTLWVGPLPHVSIADYEISHEVFVKNGSKYADKFHAPVMQ

DIRkdMGIMVTNGDHWQHMRRFSLQTFRNMGVGKDIMETKIMQELDARC

AEIDTSAMNGVTVTQASEFFELTVGSIINSILVGKRFDATTKHEFLKIIETMD

ASIETASFFDMMVPVWILKTFFKHRYDNLSDAFEVSKAFSAAEAIKRVDQI

KSGRYFIDENNLQDYTDAFLLKIEKEGQCQDFNMETLKTMIGDLWITGQE

TTTTTLISGFNQLLLHPEVMVKAREELMKVTENGSRSLSLTDRASTPYLNAM

IGEIQRHASILNVNFWKINKEFTYMGGHPVDAGALVTAQLGTLHVNETVFE

NPLKFDPERYIRDENLLQKVIPFGVGKRSCLGESLARAELYL

IFGNLLLRYKFESSSKLSTTELAPYSIGKRPFKLEMKFVKI 1482

 

>CYP35A5 K07C6.5 495 aa Revised 3/19/04

MMFILIVSLILTWLIVRQYQKVSRLPPGPVSLPLIGNLPQIVYYLWTTGGIVSTLDLF

RKRYGNIFTLWVGPVPHVSIADYETSYEVFVKQGNKLADVAHSPVLKEFSRGVGL

IRCNGEHWQVMRRFALQTFRNMGFGKEPMEIKIMEELDARCAEIDQASINGVTVTS

ASEFFDLTVGSIINSMLVGTRFEDHNKQEFLDIKNTMEESIGMFSPFDLTMPVWIM

KSFFRKRYDFLVGAQESARVLVASEALKRYDDIKSGKYVIDENNLQDYTDAFLLKI

VKGEDEKNFNIQALKTMLYDLWLTGQETTTVTLLCGFNHLLAHPEVMDKAREELL

KITENRARSLSLSDRPTTPYLNAMIGEIQRHASILNVTFWKINKELTYMGGHPVDS

GAVVTAQLGALHVNDTIFKNPREFDPERFIRDDTLLQKVIPFGVGKRNCLGESLAR

SELYLILGNLLLRYKFEPHGKLSTEETMPYSIAKRAFNLEMKFVKI

 

>CYP35B1 K07C6.4 499 aa

MIFVLLLTTFLAVLIIRQYQKARKLPPGPVSLPLIGNIHQLVYHIWNEKGVVPAFDLF

RKKYGDVFTIWLGPIPHVSICNYETSQEVFVKNGNKYKDRFLPPVYLHVSNNLGL

FTANGPVCAEMRKFTLLAFRNMGVGRDLMEQRILDELNTRCAEIDADAVNGKTIV

HTTEFFDLTVGSVINSLLVGKRFEAEDKDDFLEIKRLMAEAADLFTMFDLVVPVWF

LKSFFPSRFERVVAIFEKALNYLSREALARYEKLKTGEYSVNAEDPQDFVEAYLAK

MEQERLRDSDTPYTMECLKFVIGDLWLAGQDTTSTTLVAGFNHLVKNPEIVRKCR

EEILRITENGSRPLQLKDRAESHYLNATIAEIQRHASILNVNFWRLVHEPTTVQGHP

VDSGSVICAQLGALHVNDDLFKNPDKFYPERFIENEQLLSQIIPFGIGKRSCVGENI

AKSELYLMIGNLILRYDIKPHGSEPSTEDKLPYSAGKTPDKSVKLEFVKL

 

>CYP35B2 K07C6.3 499 aa

MIVILLLTTLIAFLVARQYQKAKKLPPGPVSLPLIGNIPQLVYYIWREKGVVPAFDLF

RQIYGDVFTIWLGPIPHVSICDYDTSQEVFVKNGSRLIDRFLPPVFLHLSNNLGLVS

ANGEVWAEMRRFTLLAFRNIGVGRDSMEERILEELDARCEEIDADAVNGKTVVQTS

DFFDLAVGSIINNTLIGKRFNAHNKDEFMKLKKMMDSASDLFTIFDLTAPVWMLKI

LVPKRCERILGVQEEILNFVSREANERYEQLKSGEYTVNAENPHDFVEAYFAKMEE

EKRHGGPTPYTMKCLKHVIGDLWLAGQDTTTTTLVSGFNQLVNHPEVIRKCREEI

LRITENGSRPLQLKDRAESHYLNATITEIQRHASILNVNFWRLINEPTNVKGYEIDS

GTVMTAQLGALHVNNDIFKNADKFYPERFIENGKLLNQVIPFGIGKRSCVGENLA

RSELYLLTGNLLLRYNIRPHGSLPSTADQLPYSAGKMPDKNVHLEFVKL

 

>CYP35B3 K07C6.2 499 aa

MIILIFITTLLTFLIVRQNQLSKKLPPGPIAFPIIGNIPQVAYYVWKTEGMVPALDLFR

KTYGNVFTIWLGPIPHVSICDYETSQEVFVKNGNKFKDRLLPPIFIHVSENLGLLTA

NGDSWAEMRGFTLLAFRNMGVGKDLMEKRILDELNARCAEIDADAVNGKTIVKIS

DFFDLTVGSVINSTLVGKRFEAHNKDEFLHLKKIIHVSSDIFTTFDMTVPVWVLKTF

FPGRFARSVEIQEEVLKYVSREASKRYDKMKLGEYTPSSEDPQDFVEAFLAKIDQEE

QTGGPTHFTMRCLNQVIADLWLAGQDTTATTLVSGFNQLVNNPEVMRKCREEIMK

ITENGYRPLQLKDRAETHYLNATIAEIQRHASILNVNFWRLVHEPTIVNGYEIDAGA

VITSQLGALHVNNDIFKDPSKFYPERFIENGKLLNQIIPFGIGKRSCVGENIARSELY

LMIGNLLLRYDIKPHGSLPSLDDKLPYSVGKMPDKTVELEFVKL

 

>CYP35C1 C06B3.3    Z77652 508 aa revised 3/19/04

MFFILLFVSIISFLTARQFLKAKRLPPGPFSLPLIGNAHQVGYQLWRTGGVTN

MLNHFRKEYGDIFTLWLGPIPHVNITNYELSHEVFVKNSTKYADKHVSPMID

YVRKGNGVFFSNGDKWQELRRFSMLTMRNMGMGRDLMEEKILSELDARCAE

INEKSIDGTTVLQVNEFFDLTVGSIINNMLLGFRFDERTKSRFLTMKHMFDEG

MDKMTPLFFTLPVWALQKFLAKDFNSIIKDQFEIIDYVSVDAIKRSRDFMNG

DYEIDPNNVEDIVDAFLLKMKQNPKSDVYDENNLKMLITDLWITGQETTTTTLV

SAFIQFLNNPQVMDTVQKELIKVTNGGSRQLSLRDKTETPYLNATIAEVQRHASILN

INFWRINNEPTVIGGHPVDSGCLIASQLSALHTNEKIFENPEKFNPERFIRNENLMQ

QTIPFGIGKRSCLGESLARAELYL

IIGNLLLRYNFESSGKMPSTRETVPFGFAKRCEAFDMKVTKI

 

>CYP35D1 F14H3.10 498 aa revised 3/21/04

MILILLFLTAITVITVRLYRKVLRFPPGPFPLPLIGNAHQIAYQAWRRGGILPALDYY

RKkYGNAYTLWLGPKASVSITDFETSQEVFVKQGKKCYNRQLAPILEHVTGGVGL

LIANGENWAEMRRFTLLTFRQMGVGTNIMEKRIMDELNGRCLEIDAQIARNDRAIV

DVKFFDLTVGSVINSFLIGKRFEDEEEFLKIKKLFDESSETFNIFDLNVPVWFLKTFL

PSRFKLTWDSRHQIMDHVMKGVEERIRDIESGAYKIDPKKPNDVVDAFLSKMKKE

EEIAGGQHPYYNLKSLKLVLHDLWLAGQGTTATTLYVGFMKLVNHPGIIQNIQKE

LLDITENGARDLTLKDRPNTPYLNATIAEIQRHASILNVNFWRINHETIHFNDYQVD

PGTMIAAQVGVLHVNEELFDNPKDFNVEKYLKNPKLLQQVIPFGIGKRSCVGEQIA

KSELYLVFGNILLRYNVKKHGSTPTNEDVYPYSSAKLPDVSGKLEFVKL*

 

>CYP35D2P F14H3.A   PSEUDOGENE 128 aa not annotated in genbank

MILIALFFTIFTVITVRLYRKVLRFPPGPFPLPFIGNAHQIVYQAWSRGGTLP

ALDYYR*NGNAYTLWLGPKAPVSLTDFETSQEVFVKQGKKCYNRQLAPILEH

VTGGVGLLTANGENWAEMRRFTLR*

 

>CYP36A1 C34B7.3     Z83220 493 aa

MLFAQLVILVIIVMLFLCRFANKLRGLPPGPTPWPFIGNTFQVPEDRIDLIIN

EFKKKYGGIFTLWLPFPTIVICDYDMLKRNIVKNGESFSGRPDTFIMDMLVQG

NYGLFFMENNWWKAQRRFTTHIFRSLGVGQAGTQDTIASLASGLVEKIDGQK

DKPIELRPLLVHVVGNVIHKHLFGFTREWNETEILDFHVAINDVLEHFTSPK

TQLLDAWPWLAYLDKPLSLGIPRTTRANDAIIQNLEQALAKHKTGINYDEEP

SSYMDAFLKEMKIRAAENALEDGFTEKQLIVAIYDLYSAGMETIIIVLRFAFL

YLVNNPETQKRIHEELDRNVGRERQVVMDDQKHLPYTCAFLQEVYRLGYVLP

VNFLRCTLTDVEDCEGYRLNAGTRVIAQFQSVHVDKKHFPDPEHFNPDRFLNS

RGEYIRDDRVNPFSMGKRSCLGENLARMEVFLYFCTLMQNFEWHTDGPYAPP

IDVITSSLRAPKPFTVRASRR*

 

>CYP37A1 F01D5    Z81493 524 aa also Y39G8 Z92851 intron?

MGIAVYLLALVVIYVVFNLSKILKFVKERMRLYHLMSKIDGPLALPLLGT

TFQFKMDPVEFALQLYNWGLEYSTKGSSLAAFWMGPYPMVIVLTPEANKV

QTRNMSGAQNFHIFQKVLESNALINKSSEYDIFLPWLGTGLLLASGEKWR

GRRKMMTPSFHFNVLIDFQVVFNSQSMILLEQIENAAKKTDDSTIDAFPY

IKRCALDIICETAMGTTVSAQTNHTHPYVVAVNEMNSLAFKYQRMPWLWI

KPIRQLIGYEADFQRNLDIVTSFTKKVIDRKLREHDETDGMVVVEEESKK

KAFLDMLIEKKEEGGLGYEDIREEVDTFMFEGHDTTSAGIGWSLWCLANC

PEYQKKCHEELDEIFEGTSRECSVEDLKKMKYLEKCVKEALRMRPSVPQM

ARSVEEEVEIDGKILPKGCSVMISPAFIQNNPRTFPNHEVFDPERFNEDE

ISKRHAYAYIPFSAGPRNCIGQKFAMQEEKTVISWVLRRFHIHTDIGLLE

NMPLPETITRPSLGFPLKFTVRQQ*

 

 

>CYP37B1 F28G4 and T13F3 515 aa revised 3/21/04

MHFLNIATIFLITIFLFYYKLIYNFIRDRLRIYKYMRKFDGPYSFPIIGTLYMV

NIFDISKFTTQSMELAQYYCQKGCGTIGLWLGPVPLIAVINPQHAKEILESNE

VITKAEEYDILFPWLGTGLLTSTGSKWRQRRKMLT PAFHFKVLNDFLSVHDY

QAKVFLEQIKPYADSGKEVDLFPYIKRLALDVICDTSMGVTIDAQNNHDHQYVE (2)

SVRLLSEYAFEWILRPWLRLKPLWY LTGPGHEYDRHLKIVTDFTKTVIKEKWEEF

QKFHVDPVVKTDKRSMAFLDLLLELRNEGLMNEDDIREEVDTFMFE

GHDTTSASMGWTLWCLAHNPEFQEKVIQEVDGIFGTSDRDCTNDDLKQM

KYLEKCLKESLRMYPSVPFFGRTVEQDVVInGDFFPKGVRIIVMPLLLQRNPLIFD

NPNQYNPENFSEDKIGSRHAYSDIPFSAGPRNCIGQKFAMMEEKAVISWFFRKYR

VTASQPFGMNKILPELILKSSLGFPLTVHHRTDNK*

 

>CYP42A1 CM08B12  M89401, AL020988 Y80D3 revised 3/20/04 504 aa

MGIITASLIVLTITWIIHFAFRKAKFIYNKLTVFQGPAALPLIGNFHQFHFSPEEFFEQ

SQGIAYMMRKGDERITRVWLGGLPFVLLYGAHEVEAILGSPKMLNKPFLYGFLSA

WIGDGLLISKPDKWRPRRKLLTPTFHYDILKDFVEVYNRHGRTLLSKFEAQAGT

GEYSDVFHTITLCTLDVICEAALGTSINAQKDPHSPYLDAVFKMKDIVFQRLLRPHY

FSDTIFNLIGPGKEHDECVKILHEFTSKAIYARKAKVDAAGGVEQLLAQETAEGRRRMAFLDLML

DMNSKGELPMEGICEEVDTFTFEGHDTTSAAMNWFLHLMGANPEIQSKVQKEIDE

VLGEADRPVSYEDLGKLKYLEACFKETLRLYPSVPLIARQCVEDIQVRGHTLPSGTA

VVMVPSMVHKDPRYWDDPEIFNPERFITGELKHPYAYIPFSAGSRNCI (1)

GRFAMME

EKCILAIILKNLKVKAKLRTDEMRVAAELIIRPLYGNELKFEKREFGDYTSIY*

 

>CYP43A1 E03E2.1 506 aa revised 3/21/04

MILLILLPVAIFTYVFLR ()

NFKLYYTLHKCGLNPRFDIFGIKGLLWLDSSAAHENFTR

MCSMIGDQTFSVLRGATPVVITSNVDLIHAISTEHFDCFHSRI

PEILSDDPITSDNIHMFAAKGERWKRLRTLTSYGL

STVKLKLLFPTMDTCVSEFMDHVNSLSDGQSVVINHSHSLFQNHTSYVLARKAFG

NFAELQKSTAEKIAYIFPETKLIFKNSFVGHFLKSATQQKFLDYLLHLISNFQSRKN

VDNNNGICCTENDHYSLLGFFFEHHNEKKLIEKAEGQIDMKKVKVEKSISYEEITA

QCKFISVAGFDTTSNTLTLLFNFLANNPDVQDKIYEAEIKNQPEQISFETVSSLRLLQ

NCIFETLRLFPHASPLQTRICTEPFKIGKYQFLENVQIVVNPWGPHHDREIWGNDV

DCFRPSRFENLTEQQRKAFMPFGVGPRQCVGMRFALLEMKTTAFRMLQKYSVF

TNSPVHDRHGKTVRMTVRDTGTIWPTDKLGLVLKQR

 

>CYP44A1 CELZK177  cosmid ZK177 U21321 Probable mitochondrial P450 489 aa

MRRSIRNLAENVEKCPYSPTSSPNTPPRTFSEIPGPREIPVIGNIGYFKYAVKS

DAKTIENYNQHLEEMYKKYGKIVKENLGFGRKYVVHIFDPADVQTVLAADGKTPFIVPL

QETTQKYREMKGMNPGLGNLNGPEWYRLRSSVQHAMMRPQSVQTYLPFSQIVSN

DLVCHVADQQKRFGLVDMQKVAGRWSLESAGQILFEKSLGSLGNRSEWADGLIEL

NKKIFQLSAKMRLGLPIFRLFSTPSWRKMVDLEDQFYSEVDRLMDDALDKLKVNDSDS

KDMRFASYLINRKELNRRDVKVILLSMFSDGLSTTAPMLIYNLYNLATHPEALKEIQKE

IKEDPASSKLTFLRACIKETFRMFPIGTEVSRVTQKNLILSGYEVPAGTAVDINTNVL

MRHEVLFSDSPREFKPQRWLEKSKEVHPFAYLPFGFGPRMCAGRRFAEQDLLTSL

AKLCGNYDIRHRGDPITQIYETLLLPRGDCTFEFKKL

 

>CYP13A.a cb25.fpc0022 5 exons Minus Strand

1361701 MLISAILLGVFTYSVWIWSYFIRKGIKGPRGFPGIGMLLQTLDHENPPFLKYRQWTK 1361531

1361527 QYGPVWGFTEGPQQTMVISEPEMVNEIFNKQFDNFYGRKLRPIIGDPEKDKRV 1361327

1361326 NMFAAQGKRWKRLRTMSSPSFSNNSLRKVKSTVQECGSEILWNIERKVATGEEIDMLL (2) 1361153

1361103 FYQEYTLGVISRIALGQSESKMFQNPLLLKVKS (0)

1360959 IFNGSWHLFLLTGLFPPLASVIRKMSKSLPADFVPAFKIFDLIETAVEARMAQRESDIKNGIEPGE 1360762

1360761 PQDFIDLFLDARTDADFLGEKNEDFSKTAVMKVNKELTFDEIVGQCFVFLAAGFDTTAL 1360585

1360584 SLSYATFLLATHPDVQKRLQEEVDNECPDPEISFDQLSKLKYMECVMKEALRLYPLGVT (2) 1360402

1360356 ANTRKCMRETVIDGVIFEEGMNIQVDTWTLHHNKSIWGDDVEDFKPER (2) 1360213

1360137 WENSACEHLEHNGSYIPFGSGPRQCIGMRLAQIEQKIILAQILKEYSFRTCKNTQIPLK 1359961

1359960 LVGKLTLSPENVIVKLEPRKDHL* 1359889

 

>CYP13A.b cb25.fpc0022 6 exons Minus Strand

1363792 MFFFLIFVAISTYYIYHWTFWKRRGVPGPWGIPILGKSGVMLDDRSAPPLLIQEWTKKYGK 1363610

1363609 VYGFTEGMQKVMVISDPDMVQEVFVKQYDNFFGRKVNPIQGDPDKDKDVPLVGAQGFRWK 1363430

1363429 RLRTISAPAFTNSSIKKVMPRIENSVQEFLKHLEQNGENGKPVNMHL (2) 1363295

1363274 YFQEYTMDVIMRIAMGQPESRMFKNPMLYDCKG (0) 1363140

1363093 FFENSRWPVWMFSGAFPFAVKGLRYLFLKLGKFGAGPFIRVQKQVTDAVNDRIAQREADRQQEMEPGEPQDFID 1362872

1362871 FFLDAQADVDFFGETNDEYQKSTIY (0) 1362797

1362747 NNRKLTMQEIITNCFLFVVAGFDTTAISLSYVAYFLSLNPRVQRKLQEEVDRECRDSEITF 1362565

1362564 EQLSKLKYMENTIKEALRVFPMAAF (2) 1362490

1362447 ANSRRCMRTTKIGDTVVEEGVDILVDTWTLHHDKKVWGEDADEFKPER (2) 1362304

1362250 WESPLNPHQSFMAFGAGPRVCLGVRLAYLEQKVLLAHVLRKYSFVANTQTQV 1362095

1362094 PMKLVGRATSRPQSLILSVEPRLPDPRV* 1362008

 

>CYP13A.c cb25.fpc0022 81% to 13A12 5 exons Plus Strand

1365401 MACIFSFLIFSYYLWTWTYWRRRGISGPPGYPLIGCALEMLDSENPPYLQLKEWTKEYG 1365577

1365578 KVYGITEGMMKILVISDPALVEEVFVKQYDNFYGRKLYPIQGDPNKDKLVNLFASQGHRW 1365757

1365758 KRLRTISSPTFSNNSLR (2) 1365808

1365856 KLKTTVEDCALELLRHIETQTAGGKPIDLL (2) 1365945

1365993 TFYQEFTLDVIGRIAMGQTDSQMFQNPM 1366076

1366077 LKYVKAVFGEPRKFIFLSGAIVPWTGPLLRKFFMSLPNIFKNPAIHIIRQVAQAVE 1366244

1366245 NRIAQRATDEKAGIEPGEPQDFIDLFLDAK 1366334

1366335 SDDVEIENNEDFTKAGIKVTRQLTKDEIVGQCFVFLIAGFDTTALSLSYSSFLLATHP 1366508

1366509 EVQKKLQEEIDRECPDPEITFDQLSKLKYLECVVKETLRKYPLGAM (2) 1366646

1366692 ANARRCMRSTKIGNYEIEEGIDILCDTWTLHADKNIWGEDAKEFKPER (2) 1366835

1366880 WESGDEQFFQKGGYIPFGLGPRQCIGMRLAYMEEKLLLCHILRKYTLETCSKTQIPLK 1367053

1367054 LIGSRTTSPETVWLNLKPREDI* 1367122

 

>CYP13A.d cb25.fpc0022 8 exons Plus Strand

        MVFFLFHWFIIVFVAVF (2)

1370232 SYYLWQWTYWVRRGVPGPLGYPVIGSFLNSLDHSFPGPLQVKEWTK (0) 1370369

1370425 KYGSTFGFTEGLMKTLVISDPELVDEVFVKQYDNFYGRKRNPISGDSEKEKRTNIFSAQG 1370604

1370605 FRWKRLRTISSPTFSNNNLRKLNLTVEECALEIIKHTEEKTAGGAQIDMLR (2) 1370757

1370800 FYQEFTLDVIGRISMGQSESLMFNNPIFPIVRK (0) 1370898

1370942 LFQGSGYYFRLMLIGGVLPPFLVEIVRQIVRRLKMGTFGKINGITLEAVQKRIKQREEDE 1371121

1371122 KAGVEPGEPEDFIDLFLDAKAGDVEHFEEDNGDFKKSTSY (0) 1371238

1371295 TNRQLTTDEIVGQCTVFLIAGFDTTALSLSYSTYLLATHPEVQKKLQEEVDRECPGSGITFDQL 1371486

1371487 SKLKYLECVMKETLRLYPLGTF (2) 1371552

1371597 ANSRKCMRTTKLGDIEVEAGTMVQVDTWTLQTNSKIWGEDAGEFKPER (2) 1371737

1371777 WENGDEQFIQKGAYIPFGLGPRQCIGMRLAYMEEKMLLAHILRRYTFQTGTKTEIPLM 1371956

1371957 LIGTTTTQPASVWMHLTPRL* 1372019

 

>CYP13A.e cb25.fpc0022 6 exons Minus Strand

1378847 MFLLFSKLFTLAVLGIIGVVTYYLWIWTYWMRRGVKGPRGKPFVGVLNVLLDHEKPGLLKLGEWTE (0)1378650

1378011 EYGKVYGYTDGNQRTLVVSDPSMVHEIFVKQFDNFYGRKLNPIQGNPDQDERVHLLSAQG 1377832

1377831 FRWKRLRTISSVSFSNGSLRKIMSTVEDSALDFMKHMEEKSANGEIDMLD (2) 1377682

1377636 AYQEYTMDVIARVAIGQRDNKMFSDQNELLKIVRQ (0) 1377532

1377484 IFGGSRKYLMLICQVFPVVGQAIRSLTFKYPKIAPFK LYLMIKKAVMERKEQRDRE 1377317

1377316 LENGVEPGEPQDFIDMFLDAKSDDVDFSKEAREDFSKRNMKITKELSADEA (2) 1377164

1377164 VGQCFVFVIGGFDTTALALSYVTYLLTVNPEVQKKVREEVDAEFGDSEIEFEKLGRLKYM 1376985

1376984 DCVIKEALRLYPLASI (2) 1376940

1376794 SNSRKCMHTTTVNGVEIEAGVYVQMDTWTLQRDPAIWGDDAMDFKPERWESAKQSEYKG 1376618

1376617 AYIPFGLGPRQCIGMRLAVMEQKVLLTHILKKYSFETGPSTSIPLKLVGSATTSPENVFV 1376438

1376437 QLKPRNQ* 1376414

 

>CYP13A.f cb25.fpc0022 6 exons Minus Strand

1382628 MSFSLLLISTIFIGFLSYYVWIWSYWIRKGVKGPRGIPFSGIMHRFQDYDYPFWRVMMDWTK (0) 1382443

1382309 EYGPIYGITEGVEKTLVVSDPEFVHEVFVTQFDNFYGRKLNPLQGDPNKNKRAHLVSTQ 1382133

1382132 GHRWKRLRTLSSPTFSNKNLKKIMKTVDDSVAELMRHLEKGCAGGKTVDMLE (2) 1381980

1381930 YYQEFTIDIIGKIAMGQTQSLMFENPMLPKVKS (0) 1381832

        IFVDGRKHAFFASAMFPFLGAMLRLNV

1381698 FPSIQPASDIMNIIEKAVKARIDQRKTDEKLGIEVSGEPQDFIDLFLDARADVDFFEQEAKDDFGKTQ 1381495

1381494 ITKVDKHLTFDETVGQLFVFLLAGYDTTALSLSYSSFLLATHPEVQRKCQEEVDRECSDP 1381315

1381314 EVTFDHLSKLKYMDWVIKEALRFYPLASI (2) 1381228

1380725 VHNRKCMKATTVCGVEIEEGTNVQVDTWTMHKDPNIWGDDVNEFKPER (2) 1380582

1380529 WESGDEQFIQKGGYLPFGMGPRICIGMRLALMEEKMLLTQILKKYTFETTSETVI 1380365

1380364 PLKLVGRSTTAPSSVNLKLRPRHSD* 1380287

 

>CYP13A.g cb25.fpc0022 8 exons Plus Strand

        MSIILLVITALLFGVL (2)

1484515 SYYLWIWIYWRRRGVPGPLGYPVVGSFLSTIEHTNPQYLQIREWTKQYGSVYGYTEGTM 1484691

1484692 KTLIVSEMDLVHQLFVTQYDNFYGRKLNPIQGDPEKDDRANLFSAQGYRWKRLRAIASPT 1484871

1484872 FSNNSLR (2) 1484892

1484945 KINLTVEDCAIELLRHIEEQTTGGEPINMLQ (2) 1485037

1485081 FYQEFTMDVIGRIAMGQIDSQMFRNPLLNFVKA (0) 1485185

1485224 IFADNRKHIPLIGGVFPVLAQIFRYFMLKFPFLGAANFIHVHKTM

1485359 VSAVHNRIAQREEDKNNGIEPGEPQDFIDLFLDARAEDVEHFGESNADFSKSSSY (0) 1485523

1485573 NNRQLTTMEVVGQCTVFLIAGFDTTALSLSYSTFLLATHPEIQKKLQEEVDIECPDSVITFDQL 1485764

1485765 SKLKYMDCVIKETLRLYPLGTL (2) 1485824

1485875 ANSRRCMRATKLGNVEVEVGTMVQVDTWSLHTDTKVWGDDAKEFKPER (2) 1486018

1486064 WLLPETDKISQKGGYIPFGLGPRQCIGMRLANMEEKILLSHILRKYTFKTGKKTDIP 1486234

1486235 LKLHGRATTQPESVWMHLVSRN* 1486303

 

>CYP13A.h cb25.fpc0023 5 exons  Plus Strand

       MITISLAFLTLFVGVFS (2)

864459 YYLWTWTYWKRRGIPGPMGYPIFGSSLELFDSENPPPAQLEKWTKKYGKVYGMTQGLSK 864635

864636 TLVISNADLVQEVLVNQYDNFYGRNLNPIQGNPNQEKRVHLFAAQGHRWKRLRTISSPTFSNNSLR (2) 864833

864944 KLKTTVEDSCVELLRHMEEQTAGGKPIELLN (2) 865036

865751 FYQEFTLDVIGRIALGQTDSQIFKNPMLPYVRAVFGEPRKALFMSGFLVPWIAPILRKIM 865930

865931 FSFPSILKNPAVHIIRQTADAVEKRIAQRVADERVGIEPGEPQDFIDLFLDAKSDDVDIQ 866110

866111 RNEEFSKTEKVTKKLTTDEVVGQCFVFLVGGFDTTALSLSFSSYLLATHPDVQKKLQEE 866287

866288 VDRECKDPEITFDKLSKLKYMECVIKETLRLYPLAAL (2) 866398

866442 ANARRCMRPTTIGGVEIEKGVDILCDTWTIHKDPKIWGVDAEEFKPERWESGDDHILSKG 866621

866622 SFISFGLGPRQCIGMRLAYMEEKLLLCHILRKYTLKTCKKTQIPMKLVGVRTTQPESVWM 866801

866802 NLEPREDN* 866828

 

>CYP35B.a cb25.fpc0023 5 exons plus strand

1165597 MIFILLLTTFLAILIVRQYQKAKKFPPGPTSLPFIGNIHQLIYYCWKHKGVVPAFDYFRQ (0) 1165776

1165823 YGDVFTLWLGPIPHVSICDYETSQEVFVKNGNKYKDRFLPPIFEHVS (1) 1165963

1166015 ENLGLVSANGESWAEMRRFALGAFRTMGVGRDLMEERILAELDAR (2) 1166149

1166196 CAKIDADAVNGKTIVHTTEFFDLTVGSVINSVLIGKRFDSHNKDEFLKLKGLIDSAADL 1166372

1166373 FTIFDLTVPIWLLKKLCPERYAKIVESQVEIINFVSREANERYEKVKSGEFTINSEDPQD 1166552

1166553 FVEAYLAKMEQEKEKGTTNLYTMECLKHVIGDLWLAGQDTTATTLVSGFNQFVNHPEVVK 1166732

1166733 KCREELMKLTDNGSRPLSLKDRAESHYLNATIAEIQRHASILNVNFWRYNHEATTIKGYP 1166912

1166913 VDSGSVITAQLGALHVNNDIFKNAEKFYPERFIEDPKLLNQVIPFGIGKRSCVGENIARSEIYL (0) 1167104

1167154 MIGNLLLRYDIKPFGDVPSTEDKLPYSAGKLPDKTVKLEFCKL* 1167285

 

>CYP33C.a cb25.fpc0023 minus strand 4 exons

688955 MIFILVLLSTLLLIYLFHLLYWKRRNLPPGPTPLPKFGNWLSMVFPLSGYECFRRWTKKYGDVYTFWL (1) 688752

688326 ESEPYIIIGSYERMKETFIRDGDTYVNKKFNEVIASFRDGQYGVVDTNGEFWSTHRRFTLSNF 688138

688137 RDLGFGKDLMQQKILIEVQEIIRKFDENVGEEQNVPKVFNKAVANVINQLIFGNRFDDEK 687958

687957 EKEFQKLMDLLEYQQKAFTSWKIIVECVVPFLSRFMPKPTVADL (2) 687826

687783 IEEYKTSFYGFFNRQIEEHRLKIDFDSEENESYAESYLKEQRKREADGDKETFSTQQLSNMC 687598

687597 MDLWLAGLGTTTLTLGWSMAYVMNTPGVQEKMHEELDRVIGSDRLISTSDKNDLPYMQAV 687418

687417 INECQRCANIVPINLFHETTRDTVIAGYPIEKGTGVIAQISTVMLDETIFPDPYKFNPDR 687238

687237 FIDEHGKLKKVEELCPFSVGKRQCLGEGLARMELFLMLANFFHRYE (0) 687100

687052 ISPSSTGPPSIDKTQALRVTPREFDAVLTKRH* 686954

 

>CYP33C.d solo exon 2 aa diffs to CYP33C.a cb25.fpc0023 about 7380bp upstream of 33C.a

697541 MIFILVLLSTLLLIYLFHLLYWKRRNLPPGPTPLPKFGNWLSMVFPMPGYECFR 697380

697379 RWTKKYGDVYTFWLG (1) 697335

 

>CYP33C.b cb25.fpc0023 4 exons

723577 MILLLLFTTLSLWLFHELYWKRRNYPNGPLPLPVIGNMVPIMKAKPGYEAFRQWTK 723744

723745 EFGDVFTFWL (1) 723774

724349 GTKPYIVVSSYKRLKETFILDGDTYADKAYQP 724444

724445 FQEQFRGGQYGVIDTNGHVWSTHRRFALTTFRDFGLGKDLMQQKILIEVDEIFRKFDENI 724624

724625 GKEQEIPGVFYNAGANVINQLIFGYRFDEEKQEELKKLKALMEFQETAFTTFKVQVQFFA 724804

724805 PIVGRMLPGKSVEEL (2) 724849

724904 LAEFTVDFYKFFDHQIEEHRSKIDFDSEENLDYAEA 725009

725010 YLKEQKKKESEGDMELFGNRQLSNMCFDLWVAGLSTTHTTLSWIIAYVLNHSEVQKTMQI 725189

725190 ELDEVIGSDRLISTGDKNNLPFMNAVINESQRCANIVPLNQIHCVSKDTMINGVLVKKGT 725369

725370 GIIPQISTVLLDETTFPDPYKFNPERFIDEHGKLKKVEELCAFSVGKRQCLGEGLARMEM 725549

725550 FLFISNFFNRYQ (0) 725585

725633 VTPGSSGPPSLEKETMFNATPRKIRAVLTKRYL* 725734

 

>CYP33C.c cb25.fpc0023 62% to 33C7 with one stop codon and one frameshift (possible pseudogene)

723082 MIFLLLFTTFGSWLFHELYWKRRTLPPGPTPLPLFGNILALSAEKPGYEAFRK 722924

722923 WTKVYGDVFTFWM (1) 722885

722211 GTKPYIIISSYSRLKETFIRDGETYMNKVQLHFQEFIRDGYYGV (Frameshift) 722080

722078 IAETNGEFWSTHRRFALSTLRNFGMGRDLIQEKILIEVEDMFKKLDEDIEKEQEINPVFN 721899

721898 NAVANINNQLIFGYRFEKEKLKELEK*TELMEYQEKFFTTFRTNFQIFVPQLAWLIPGKT 721719

721718 LEEGLVDWKTDFFDFFHKQIKAHREKIDLDSDESQDYAEAYLKEQKRRIKDGDKETFSDK 721539

721538 QLSNMCLDLWFAGLSTTSITLTWTITYILHNPDVQKKLHEELDRVVGNNRLISTSDKPEL 721359

721358 QYMNAVINESHQCVNLVPVNLFHCTSRDTVLNGYQVPKDTGVIAQISTVMLDEEVFPDPY 721179

721178 KFNPDRFIDENGKLKKVEVLCPFSVGKRQCLGEGLASMELFLLVANFVNRYK (0) 721023

720979 IKPYVHGLPSLDKSKNVTVVPRKVVAKLIRRHVQIDGLFVKNYFE* 720842

 

>CYP14A.a cb25.fpc0023 6 exons

953860 MSVFIVAFFSFIAAYIVHFYWKVSKYPKGPLPLPFVGNLLQ (0) 953982

954025 FPATHIQTFYDDLAKTYGPCFTLWTPTPSVVITDYDHLKEAFVTQ (1) 954159

954220 GDAFITRGLRPPETLLHPDVDTGIVFSNADTWRVQRRTSLKILRDFGLGRNLMEEQ 954387

954388 VTRSVHEMLQQLDNIKDKTDVDMYWPIQLCVGNVINESLFGYHYSYKDTEKFQKFVMIVD 954567

954568 RHLKNLSGRPQLLIAAWPWLRHVPIIGEMGYHSIKRNIKT (2)

954733 (0) YHKFIEDEVASQIKEYDEHSEPENFVHAYMKQINQAGNPELN (2) 954858

954923 MTQLCASVLDFWIAGMETTSNSLRWHLTYMMKYPEIQDKVRKEIFDVVGTNQLPSMSDKP 955102

955103 NMPYTQAVIHEVQRHSNMIPMLGTHF (1) 955180

955225 NTKEINIKGYKVPTGTNCIGQLWSIQRNDPVFIESHKFNPSRYFQADGRTMDKAVLEKTIPF 955410

955411 SIGKRNCLGEGLARMELFLIFSALIQKYEFVPKSSIDLSPVWGGVLTSKPYRCQLVPQTA* 955593

 

>CYP14A.h pseudogene solo exon cb25.fpc0023

958034 AVLEKIIPFTIGKRNCFGEGLARMELFLIFSALIQKYEFCA 958156

 

>CYP14A.b cb25.fpc0106 7 exons

251869 MSVFILAVLVFLVLYVVHFYWKVSKYPKGPFPLPLIGNIHQ (0) 251747

251700 FPPDHVQLYFDEMSKIYGPCFTVWIPYPAIVLNDYEHVKEAFVTQ (1) 251560

251510 GDTFTYRAHRSPETLLPVHDHTGILASDGDHWRLQRRTSLKILRDFGLGRNLMEEQVVRS 251331

251330 VQEMMVQIENITDKNKVDMFWPIQLCVGNVINETLFGFHYKYEDSEKFKTFVRIVDKHLR 251151

251150 HLQGTMPLLVSAFPWLRHLPIIGDIGYHNIKNNISS (0) 251043

250767 YQTFIEEEVTTQVKQYDGVSEPENFVHAYMQQMKQTGNPGLE (2) 250642

250594 IKNLCASVLDFWLAGMETTSNSLRWHLAYMMKYPEIQDKVRKEIFEVVGTSRLPSMSDKP 250415

250414 NMPYTQAVIHEVQRCSNMLPFLGSHQ (1) 250337

250282 CVEETSIKGKHVPAGTLVFAQIWSVMKNDPVFKDASEFNPDRYLQSDGKTFDK (0) 250124

250078 AVLEKTIPFSIGKRNCVGEGLARMELFLIFSALIQKYEFVPTSNIDLSPDWGVV 249917

249916 MTSKPYTCQLLPQY* 249872

 

>CYP14A.c pseudogene cb25.fpc0106 exon 2 and part of exon 3

270521 FPGQNIHIYFDDLSKTYGPCFTLWTPLPAVVITDYDYLKGAFVNQ (1) 270381

270314 GEAFIMRAERPPETLLQAHLNTGILQSSGENWRL*RRTSLKILRDFGLGRNLMEEQVMRS 270135

270134 VHEMLYQIGNITDKKKV 270084

 

>CYP14A.d cb25.fpc0106 7 exons

272501 MSVLIVAFLVFVITYIAHFYWKVSKYPKGPFPLPLIGNVLQ (0) 272379

272332 FPDKNIHIYFDELAKTYGPCFTLWTPLPAVVITDYDYLKDAFVTQ (1) 272198

272131 GEAFIMRAERPPETLLQPHINTGVLASSGENWRLQRRTSLKILRDFGLGRNLMEEQVMRS 271952

271951 VHEMLHQIENITDKKKVDMFWPIQLCVGNVINESLFGYHYKYEDADRFKTFVQVVDRHLR 271772

271771 AITGKIPLLMGAFPWLRHVPIIGEHGYHKIARNIKQ (?) 271664

271628 (0) YQGFIEEEVASQIKEYDGESEPDNFVHAYMQQMKQTGNPGLEF (2)

271445 MPNLCASVLDFWLAGMETTSNSLRWHLAYMMKYPEIQDNVRKEIFEVVGASRLPSMSDKPNMP 271257

271256 YTQSVIHEVQRHSNMIPMLGSHT (1) 271191

271143 NTEDIIVKGQNVPTGTVIFAQVWSILKNDSVFDESHKFNPSRYLQADGKTLNK (0) 270985

270937 AVLERTIPFSIGKRNCVGEGLARMELFLIFSALIQKYEFVPN 270812

270811 DNVDLTPVWGGVLTSKPYTCQLIPQAV* 270728

 

>CYP14A.e pseudogene cb25.fpc0106

253655 KVDMFWPIQLCVGNQ

       INESLFGYHHKY

       CDADRFKTFVQDLDRHLRLIQARLPLLMGAFPWL 253479

253478 RQVPIIGELGYHKIERNVKQ 253419

253372 YHKFIEEEVTSQMKEYDGESEPDNFVHA*QHIKQTGNPGXXXXX 253256

253200 NLCASVLDFWLAGMETTSNSLRWHLAYMMKYPEIQDNXXXXXXX

       VVGTSRLP*MSDKPNMPYTQSVIHEVQRQSDMIPLLGTHK

       XXXXXXXXXXXXXXXXXXXXX

252852 QIWSIMNDDSVFEENHKFNPSRYLQADGKMLNK 252754

252712 ALLERTIPFSIGKRNCVGEGLARMELFLIFSAVIQTYAFIPNSNIDLSPDWGAVLTSKP 252530

252529 YTYQLIPQTLYYFVK* 252484

 

>CYP14A.f pseudogene cb25.fpc0106

275213 MSVLIIAFLVFIIAYIAHFYWKVSKYPKEPFPLPFIGNFLQ (0) 275091

       XXXXXXXXXXXXXX

275020 KMYGLCFTLWAPLPAIVITHYDYLKDAFVTQ (1) 274928

274860 GDAFITRINRPPETLPQAHNNTGVLVSSRENWRLQRRTSLKILRDFGLSRNLMEEQVMRS 274681

274680 VHEMLQQIENINDKKKVDMFWPIRLCVGNVINESLFGYHYKYDDADRFKNFVQIADRHLR 274501

274500 AALGKLQLLMTAFPWLRHVPIIRELGYHKIGTXXXX (0?)

       YQGFIEKEVANQIKEYDGESEPENFVHAYXX

       QMKQTGNPGLE (2?) 274228

       XMPNLCS

274172 SVLDFWLAGMETTSNSLLWHIAYMM*YPEIQDKVREKIFEVVGTSRLPSMSDKPNMPYTQ 273993

273992 AVIHEVQRHSNMIPVMLFHT (1?) 273933

273882 XXEDVVVKGQNVPTGTLVFGQIWSIMKNDSMFDESQKFNPSRYLQADGKTLNN (0) 273730

273682 AVIEKKIPFGIGKKACAGEGLARMELFLIFSAFIQKYEFIANSNIDL 273542

273541 SPDWGGVLTSKSYTCQLIPQTV* 273473

 

>CYP14A.g cb25.fpc0106 77% to 14A4

278391 MSLLIISFVLITVVYIINFYWEVRKYPKGPFPLPLIGNIHQ (0)

278209 IPDDKLEVWFDDLSKTYGPIFTVWSPLPCVVITDYAHIKDAFVTQ (1) 278075

277628 GETYTYRAHRPPESLLQPHDNTGILNSSGENWRLQRRTSLKILRDFGMGRNLMEEQIVKS 277449

277448 IQEMMEQLAKSMDKKKAEIFWPIQLCVGNVINEFLFGFHYKYDECERFKKFVNVVDYHLR 277269

277268 ILLGKCSLSVSAFPWLRHVPIIGELGYHRIKRNIKT (0)

277115 YQSFIEEEVEKQLEKFDTESEPENFVHAYLQQMKQTGHPGLD (2) 276990

276932 MKNLCACVLDFWLAGMETTSNALRWHIAYMMKHPVIQENVRKEILDVVGTSRLPS 276768

276767 MSDKPKMPYTQAVIHEVQRHSNMVPFLGTHQ (1) 276675

276624 SVQETEILGKKIPAGTNVIAQTWSVMKNDPIFVNHLEFNPSRYLLADGKTFDK (0) 276466

276420 VVLERTIPFSVGKRSCVGEGLARMELFLIFTALIQKYEFIANGPVDMSYNAGAV 276259

276258 LTIKPYTCEMRKVF* 276214

 

>CYP33C.c cb25.fpc0023 possible pseudogene

723082 MIFLLLFTTFGSWLFHELYWKRRTLPPGPTPLPLFGNILALSAEKPGYEAFRK 722924

722923 WTKVYGDVFTFWM (1) 722885

722211 GTKPYIIISSYSRLKETFIRDGETYMNKVQLHFQEFIRDGYYGV (Frameshift) 722080

722078 IAETNGEFWSTHRRFALSTLRNFGMGRDLIQEKILIEVEDMFKKLDEDIEKEQEINPVFN 721899

721898 NAVANINNQLIFGYRFEKEKLKELEK*TELMEYQEKFFTTFRTNFQIFVPQLAWLIPGKT 721719

721718 LEEGLVDWKTDFFDFFHKQIKAHREKIDLDSDESQDYAEAYLKEQKRRIKDGDKETFSDK 721539

721538 QLSNMCLDLWFAGLSTTSITLTWTITYILHNPDVQKKLHEELDRVVGNNRLISTSDKPEL 721359

721358 QYMNAVINESHQCVNLVPVNLFHCTSRDTVLNGYQVPKDTGVIAQISTVMLDEEVFPDPY 721179

721178 KFNPDRFIDENGKLKKVEVLCPFSVGKRQCLGEGLASMELFLLVANFVNRYK (0) 721023

720979 IKPYVHGLPSLDKSKNVTVVPRKVVAKLIRRHVQIDGLFVKNYFE* 720842

 

>CYP34A.a cb25.fpc0023

1168376 MLFLLIFVTTLAIWTVYLWRATQRLPK (1) 1168456

1168506 GPFPLPLIGNFHQLAYTCWKAGGTVAGFHEFKKQYGKVFTIWLGPQPTVHITDIEIAQET 1168685

1168686 HVKRANVFGHRYSNGGTDYIREGRGIVASNGEFWQEHRRFALKTLRDFGLGRNLMEEKIMEEYRYR (2) 1168883

1168934 FQDFKKTNFKNGGIEVHSDSCFNLLVGSIINTLLVSERFEQNDADFEKLLETGTVALE 1169107

1169108 KMSVLDSFVPLWLMKSRAWQWRTKKVFGPFEFVHGLVERKIQQR (2)

1169417 RVKEIESGEHTMSETGEDFVDAFLIKMEKDKKDGVKDSTFT (2) 1169539

1169589 LENLAIDLYDLWLAGQETTSTTMVWACACLLNHPEVVGELKRELVGVTGGTRALSLTDKP 1169768

1169769 NTPFLNATIN (0) 1169798

1169844 EVQRIASILNSNLFRQLEEDTVIDGQPVSAGALVT 1169948

1169949 VQLSALHTDEAVFKNHTKFDPKSLMKNRNLEKNLIPFGIGKRSCPGESLARAELYL (0) 1170116

1170246 ITGNLILDFDLEPVGPAPEIKTTAPFGLMKRPPSYDIRFVPIEK* 1170380

 

>CYP35A.b cb25.fpc0023

1162983 MFFILVISFVLTWLVVRQYQKVSRLPPGPVSLPLIGNLPQIVYYLWSTGSVVSTLDLFRK (0) 1163162

1163209 RYGNIFTLWVGPIPHVSIADYETAHEVFVKNGNKYADKFHAPLFQEIK (1) 1163352

1163396 NDRGVVVTNGDHWQELRRFALQTLRNMGVGKDVMELKIVEELNAR (2) 1163521

1163607 CADIDEAAVNGVTTTQASEFFDLTVGSIINNILFGKRFDKNTKGEFLKIKNALDSAFEL 1163786

1163787 FTPFDMTVPIWVLKTFFSKRYNNLITMTDETTSFVAKEAETR (2)

1163953 YAQTKSGQYFIDENNIQDYTDAFLLKIQKDGENKDFN (2)

1164109 IETLKTMIIDLWITGQETTTTTLISGFNQLLLHPEVMEKARAELMNITENESRDLSLSDRPKTPYLN 1164309

1164310 AMIAEIQRHASILNINFWRINHEPTYMGGHMVDSGTFVTAQLSALHINETVFENPQKFDP 1164489

1164490 ERFIRDESLLQKVIPFGVGKRACLGESLARSELYL (0)

1164643 IFGNLLLRYNFKPHGKLSTVEVLPYSFVKRPFKLEMQFVKI* 1164768

 

>CYP34A.c cb25.fpc0023 86% to 34A4 (3A4 needs revision 3 small segments are missing)

761983 MFLILFFSTIITVLVVHQWMKRRRLPK (1) 761900

761852 GPLPIPFFGNLHQLFYFYWKYGGAVAAYRHLEQEYGKVFTVWIGPLPTVYISDYDIAHET 761673

761672 HIKRGNVFGARFAVGALNYIREGRGIVASNGEFWQEHRRFALKTLRDFGLGRNLMEEKIM 761493

761492 EEYRYR (2) 761475

761370 FSQTSNGINNKSSIETNSSMFFDLLIGSIINQLLISERFEQ (0) 761248

761207 GDPEFEKLKKSLSVGLEKFGVLDIFLPDWIMNAWWMKWRMDDILGPFSWIH 761047

761046 RLSARNVNRRMEMIESGEHVIDGDGTDFMDAYINKSEKDKREGVNSTFT (2) 760900

760855 LENLAIDMYDLWIAGQETTSTTLSWACACLLNHPEVVQKARDELVHLTGGHRSVSLTDKT 760676

760675 QTPYLNAVIN (0)

       EVQRIASILNVNIFRQTSEDTIVDGQPIAAGTALT 760496

760495 THLALIHTDETLFKDHTKFNPERFLENSCLEKKLIPFGIGKRSCLGESLARAELYLVLGN 760316

760315 MILDYDLEPIGEIPQIKTTSPFGIMKRPPVYSLRFVPVHHG* 760190

 

>CYP34A.d 74% to 34A5 cb25.fpc0023

765448 MFLFLIFFSTLTYFCLNLYRRRQKNPN (1)

765324 GPIPLPLIGNLHQIAYYAKKTGGIVPAFNEFKKQYGNVFTLWMGPVPSVYIADFDIAYET 765145

765144 HVKRVNVFGHRYTIGGMDYIREGRGIVGSNGDFWLEHRRFALTTLRNFGLGRNIMEEKIM 764965

764964 DEYRYR (2)

763513 IQDFSKTHAIGKESFEVSAATFFDILVGSVINQMLVSERFEQGDEDFQKLKVSLAK 763346

763345 SLEELSPIDMFTPLSILKSPLWRWRTKVTLAPFDFVLEMVSKGIKRR (2) 763205

763161 MAAIENGTHVVNEEGDDFVDAFLVKINKDQKDKILDSSFT (2) 763045

762995 LDTLAIDLYDLWLAGQETTSTTLTWAVVCLLNHPEVVEELRKELDGVTGGTRGVSLTDRS 762816

762815 KTPFLNATIN (0) 762786

762741 EVQRIASILNINPFRHLQEDTVIDGQPIAAGTCVTANLAQFHSDEELFKEHTKFKPE 762571

762570 RFLENNNLEKKLIPFGIGKRSCPGESLARAELYL (0) 762469

762419 ILGNLVMEYNLESVGLVPEIKSPTPFALMKRPPIYDVRFVP 762297

 

>CYP34A.e 7 exons cb25.fpc0023 76% to CYP34A.a

769026 MFFVLILSAVFAVLAVNLWRARQRLPK (1) 768946

768901 GPSPLPLIGNLHQLVYTCWKAGGTVGGFNEFRREFGNVFTIWMGPIPT 768758

768757 VHITDFDVAHETHVKRANTFGVRWANGGMNYIREGRGIIASNGEFWQEHRRFALKTLRDF 768578

768577 GLGRNIMEEKIMEEYNYR (2) 768524

768336 FQDYKKNNFKNGGIEVHASSCFDLLVGSIINTLLVSERFEQDDEDFEKMKLNLAK 768172

768171 TLEKVSIFDAFTPLAVFKSDLWKWRTKKIFGPFDFIFGMVQKGIQKR (2) 768031

767982 RVDAIASGKHIISEEGEDFVDAFLIKMEKDKKDGVKDSTFT (2) 767860

767810 LETLAIDLYDLWLAGQETTSTTLTWACACLLNHPEVVAELRSELVGITGGTRGLSLTDKP 767631

767630 NTPYLNATVN (0)

767531 EIQRIASILNTNLFRELEEDTVIDGQPVSAGAVVTTQMSLLHTNEAVFKNHTVFDPSR 767358

767357 FIENNNLEKTLIPFGIGKRACLGESLARAELYL (0) 767259

767214 ITGNLVLDYNFEPVGTKPEIKTLSPFGLMKRPPNYNIRFVPVNN* 767080

 

>CYP34A.e-de7c detritus exon 7 (partial) 72% to 34A9

766359 IKTPTPFAIMRRPPLYNIRFVP 766294

 

>CYP34A.e-de6b7b detritus exon 6 (Partial) and exon 7

766856 TLSFSNTLTDETLFKDHTKFNPSRFLENNNLEKKIIPFGIGKRSCLGKSLARAELYL (0) 766686

766638 ILGNLVMEYDLELVGEKPEIKTPTPFALMNRPPLYDIRFVPRRN* 766504

 

>CYP34A.b pseudogene cb25.fpc0023 missing N-term, 65% to N-term 71% to C-term of CYP34A.a

770607 PVPVXXXXHTIGYYAWKAGGIVAGFREIERIYGKIFTLWMGPIPTVPIADFEIAHETHVK 770440

770439 RAHTFXXXXXXX 770425

770417 GPMNFIRERRGIAASNGEFWQEHRRFALKTLRDFGLG 770307

770225 LKTLAIDLFDLYVVGQETTSTTLTWAYACLLNHPEVVEELRKELVGVTGGTRGVSLTDKP 770046

770045 NTPYLNATVNXXXXXX 770016

769930 SILNINLFRQLQEDTVIDGQPVSAGTVVTTQLSLLHTNEAVFKNHTVFNPNQYLENSYLE 769751

769750 KTLIPFGIGKRACPGESLARAELYL (0) 769676

769632 IIGNLVLEFNFEPVGMKPEIKTASPFGSMKRPPNYHIRFVPVNN* 769498

 

>CYP34A.f cb25.fpc0023 73% to CYP34A.a

772489 MLLFFVLTFVIGVLTINLWRARRKLPK (1) 772409

772367 GPTPLPLIGNFHQLAYTCWKAGGTVAGYNEFRKEFGKVFTIWLGPLPTVHIADIEVAHET 772188

772187 HVKRANIFGVRYSNGGINYIREDKGVISSNGEYWQEHRRFALTTLRNFGLGRNIMEEKIM 772008

772007 EEYRYRFQDYKRTNFKNGGIEVHASSCFDLLVGSIINTMLVSERFEQGDEEFDYMLESVE 771828

771827 RSLGRLSIFDSFTPPWILKSDWWQWRTKYVFGGLEFLLGMAKRQIERR (2) 771684

771630 VEAISNGEHAIGEDAEDFLDAFLIKMDKDKRDGIVDSSFN (2) 771514

771464 LDSLAIDFFDLWLAGQETTSTTMVWACVCLLNHPEVVAELRRELLEVTGGTRCLTLRDKP 771285

771284 NTPYLNATIN (0)

       EIQRIASIFNTNIFRILEEDALIDGQIVSAGAVFTA 771105

771104 QISLLHTDEAVFKDNTKFDPGRFMENNNLEKNLIAFGLGKRSCLGESLAKAELYL (0) 770940

770855 ITGNLILDFDLEPIGPVPEIKTTTPFGLMKRPPSYNIRFVPIEK* 770721

 

>CYP34A.g cb25.fpc0023 80% to 34A10

776708 MLIILLLVTLLAVATAYQWKKCQKLPK (1) 776628

776582 GPTPLPIIGNFHQFVYYG 776529

776528 LKLGSAVAVYHHFEKVHGKVFTLWMGLLPTVYISDYDIAHETHIKRANIFSNRFATGTTQ 776349

776348 HIRHGRGIIASNGEFWQEHRRFALTTLRNFGLGRNIMEEKIMEEYHYR (2)

776159 VADYTKTHTKNGAIEIHAGTFFDLLVGSIINQLLISERFEQDDQEFEKIKSALAESL 775989

775988 ENFGIVDIFFPCWFLETPLMAWRQKKIFKPFDFVYEVSVRNIAKRVAAVESGEHVIEDGG 775809

775808 NDFADAYLLKIQKDQRDGVKSTFD (2) 775740

775693 YETMAIDVFDLWQAGQETTSTTLTWAFACLLNHPEVAKKLRAELMKLTGGNRTISLTDRPD 775511

775510 TPYLNATCNEVQRIASILNINLFRHIEEDTVIAGQPLAAGTALTTQMSMLHTDEAVFKNA 775331

775330 TEFKPERFLENSNLEKTLIPFGIGKRSCLGESLARAELYL (0)

775162 ILSNLILDFDIEPVGTVPPIKTLNPFGLLRRPPTYNIRFKPVKH* 775028

 

>CYP34A.h cb25.fpc0023 7 exons 67% to 34A9

972276 MLTVLILISIFSIFVVRTWRASRKLPR (1) 972196

972150 GPTPLPLIGNIHQLVYTC 972097

972096 WKAGGVVAGLNEFKKQYGKVFTVWIGPVPTVHISDFEIAHETHIKRANMFGARYLPEMID 971917

971916 YIREGRGIVASNGDFWQEHRRFALRTLRDFGLGRNIMEAKIMEEYRYR (2) 971773

971537 FDDFRKTNFKNGGIEVDAANCFDLLVGSIINTLLISERFEQGDKDFDELNTSLAEGFE 971364

971363 SASILDAFTPSWLMKSDFWKWRSKTICRPFDLIYAQVDKGIKKR (2) 971232

971178 LEDVANGKHIIGEEGDDFVDAFLIKMEKDKMDGVTDSTFN (2) 971062

971008 PETMTIDMYDLWIGGQETTSATLTWACACFLQHPEVVQELREELLGVTGGTRGISLTDKPN 970826

970825 TPFLNVTIN (0) 970799

970744 EIQRIASILSMNLSRQVKEDTVIDGQPISAGTIVTTQMSMIHTDETVF 970601

970600 KNHTEFNPKRFLENNNLDKSLIPFGIGKRVCLGESLARAELYL (0) 970472

970427 ITGNLVLDFNFQPVGQKPEIKTRSPFAMMKRPPHYGIRFVPLN* 970296

 

>CYP35A.c cb25.fpc0023

1184170 MFLILLVSVFLAWLVVRQYQKVSRLPPGPVSLPLIGNLPQIVYYLWSTGGIVPTFDLFRK (0) 1184349

1184393 RYGNIFTLWLGPIPYVSIADYDTAHEVFVKNSNKYADKFIAPLFQAIR (1)

1184585 NDKGVAGTNGEHWQEMKRFVLQTFRNMGVGKDVMELKIIEELDAR (2) 1184710

1184807 CADIDKAAVNGVTITQASDFFDLTVGSIINNILVGRRFDEDNKEEFFKVKHSADSAIEI 1184983

1184984 VSLFDMMLPVWVLKTFFKAHYDTVINSNEETRDFVGKEAEIR (2)

1185152 DNQIKSGKYSIDENNVKDYTDAFLLKIQKDGENKDFN (2)

1185318 IETLKTMIVDLWTTGQETTTTTLISGFNQLLLHPEVMHKARSELMEITENGSRNLSLTDRPKT 1185496

1185497 PYLNAMIGEIQRLASILNINFWRINHEPTYMGGHMVDSGALVAAQLSALHINETVFENPE 1185676

1185677 KFDPERYIRDEPLLQKIIPFGVGKRACLGESLARAELYL (0) 1185793

1185842 IFGNLLLRYNFKPHGKLSTADVFPYSFAKRPFKLEMQFVKIRS* 1185973

 

>CYP35A.d cb25.fpc0023

1156433 MFFVLIISFFLTWLVVRQYQKVSRLPPGPVSLPLIGNLPQIVYYLWSTGSVVSTLDLFRK (0) 1156254

1156211 RYGKIFTLWVGPIPHVSIVDYETAHEVFVKNGNKYADKFHAPLFQEIK (1) 1156068

1156024 KDQGVLSTNGDHWQEMRRFALQTFRNMGVGKDVMELKIIEELNAR (2) 1155890

1155774 CADIDKAAVNGVTTTQASEFFDLTVGSIINNILVGKRFDKNTKGEFLKIKN 1155622

1155621 ALDSAFELFTPFDMTVPIWVLKTFFSKRYNNLISLNDETTGFVAKEAEAR (2)

1155429 YAQMKSGQYSVDENNIQDYSDAFLLKIQKDGENKNFK (2)

1155273 IETLKTMIIDLWITGQETTTTTLISGFNQLLLHPEVMEKARVELMKITENGSRDLSLSDRPKTPYLN 1155073

1155072 AMIAEIQRHASILNINFWRINHEPTNMGGHMVDSGAFVTAQLSALHINETVFEDPQKFDP 1154893

1154892 ERFVRDESLLQKVIPFGVGKRACLGESLARSELYL (0)

1154739 IFGNLLLRYNFKPHGKLSTQEVMPYSFAKRPFKLEMQFVKI* 1154614

 

>CYP35A.f pseudogene cb25.fpc0023 last 2 exons + strand

        ICFWR*NREPTYMGGHMMDSGALVTAQL (frameshift)

        DVLHVNETVFEHPEKIDPQRFIRDEKLLQKVIPFGIGKRTCLGESLSRFELHL (0)

1157187 IFGNLLLRYKLQPHGELSTKNLMPYSAGKRPFKLEMQFINI 1157233

 

>CYP35A.e cb25.fpc0023 76% to CYP35A.b

1188571 MFFVLIVFTFLTWLVVRQYQKVSRLPPGPVSLPLIGNLPQIVYYLYTTGGVVSTLDFFRK (0) 1188392

1188348 RYGNIFTLWVGPIPHVSIADYETSHEVFVKNANKYADKFHSPIFREMRSKR (1)

1188150 GILTANGDHWQEMRRFALFTFRNMGVGKDLMETRIMEELNAR (2) 1188025

1187926 CADIDKAAVNGVTTTQAAEFFDLTVGSIINSILVGKRFEEHNKNDFLKIKEVMDA 1187762

1187761 SFETFSPFDMTMPVWILKNFFPRRFEKMRNGQESSKQFAAKEALKR (2) 1187624

1187563 IDEIKAGRYVIDENNFQDYTDAFLWKMQKDGENEDYK (2)

1187407 VESLKTMILDLWITGQETTTTTLISGFNQLLLHSKFMEKARAELFEITENGSRNVSLSDRPKTPYLN 1187207

1187206 AVIGEIQRHASILNVNFWRWNNEPTYMGGHMVDSGALVTAQLSALHVNETVFEHPEKFDP 1187027

1187026 ERFIRDEKLLQKVIPFGLGKRSCLGESLARSELYL (0) 1186922

        IFGNLLLRYKFQPHGELSTREIMPYSAGKRPFKLEMQFIKV*

 

>CYP35A.g pseudogene solo exon?

1181077 ISFGVGKHTCLGESSARSELYW (0) 1181012

 

>CYP35a.h last exon partial cb25.fpc0023

1176589 RPFKLEMQILKIHHL* 1176542

 

>CYP35B.b cb25.fpc0023 91% to CYP35B.a

1153670 MIFILLLTFFLAILIVRQFQKASNFPPGPTSLPFIGNIHQLIYYCWKHKGVVPAFDYFRQ (0) 1153491

1153444 YGDVFTLWLGPIPHVSICDYETSQEVFIKNGNKYKDRFLPPIFEHVS (1)

1153260 ENMGLISANGESWAEMRRFALLAFRTMGVGRDLMEERILAELDAR (2) 1153123

1153076 CAKIDADAVNGKTIVHTSEFFDLTVGSVINSVLIGKRFDSHNKDEFLKLKGLLDSADELFTMFELTVPV 1152870

1152869 WLLKKCFPERYARIVEVQEEIINFVSREANERYEKVKSGEYTINADDPQDFVEAYLAKI 1152693

1152692 EEEKEKGTTNFYTMQCLKHVIGDLWLAGQGTTATTLVSGFNQLVNHPEVIMKCREELMRL 1152513

1152512 TNNGSRTLSLKDRADSHYLNATIAEIQRHASILNVNFWRYNHEATTIKGYPVDSGSVITA 1152333

1152332 QLGALHVNNDIFKNAGNFYPERFIEDPKLLNQVIPFGIGKRSCVGENIARSEIYL (0) 1152168

1152120 MIGNLLLRYDIKPHGALPSTEDKLPYTAGKLPDKTVKLEFCKL*

 

>CYP35B.c cb25.fpc0023 60% to CYP35B.a

1148938 MILVLIFTTLLAILIVRHYQKVRKLPPGPITLPLIGNNPQIVYYSWKNNGIVPAFEYFRK (0)

1148712 YGNVFTLWLGPFPHVSIADFETNHEVFVKNGNRYKDRFLSPIFEELQ (1) 1148569

1146885 DHGLVFANGEKWAELRKFTMVTFRNMGVGNEIMEEKIMEELDAR (2) 1146754

1146589 CADLDNEAIDGKTVVEHGKLFELMVGSVINSLLVGNRFGEDNKEEFLHIRHLIETFNEV

        FTTFDMIMPVWVLKTFFPTRYAIAADGMNAIKDYVGRVAEKRFEEVKSGKYVLEETNPKD

        YVDAFLIKIQKEKDHPAYT (2) 1146176

1146129 MKSLKYVLLDLWSAGHDTTATTLISGFNQFMHHPEVVRKCREELLRLTDGGSRPLSLKDR

        ADSRYLNATIAEIQRHASILNVNFWKTSDAPTKVNGYELDAGEILTAQMSALHVNEDIFK

        HAGEFMPERFLENEKLIHQLIPFGIGKRSCVGENLARAELFM (0) 1145644

1144837 VFGNLLLRYNIQPHGKLPSTADQLPFSAVKIADTSAKLEFIKI* 1144706

 

>cb25.fpc2310 ACC=CAAC01000047 64% to CYP22A1 only 2 exons

349058 MRISRSTEIQIRDWCYFAISHLAIISFYYMLMLIPVISISPLCFFAAIAL 348909

348908 HLAYFVRQHRYYATEYPPGPPPMAIFGNGVSVNTRAPEKTFLEYRAIYGPVFTLHLSQPT 348729

348728 VILADYDSIHEALIHNGQKTSGRSNSESFVLFTGDRNHGDGVILSSGQKSKNMRQETQKF 348549

348548 LRNWPGKRMDNLVLHHTASLEAALIKQAESKGLSDIKDPISGAIANIIQQMTIGRNYEYQ 348369

348368 DAEFQTQLSNINAVVKELMTFGVFFVNSFPISRFLLQWLPSSWMAYKNSGFKLQQWFRDI 348189

348188 LNEHKVNRHQDDFMSHMLDLQETRPEEFKDLSIILNCGDMWTGGMETTTTTLRWGLIYLL 348009

348008 QNPEVQAKCHQEIISAFGNDAPETSKMHLTPYVRATLSEIQRLANVLPWAIPHK (2) 347847

347799 SFEDCQIGGYKIPANTEIVPALGALLCDPTVFKNPEEFKPERFLDQYGEYSPMEEFRPF 347623

347622 GMGPRACLGEKLARTELYLIFSSLVQNFRFYLNECKLYGTYTV* 347491

 

>cb25.fpc3857 ACC=CAAC01000068 CYP22A3 9 exons cb25.fpc3857

4054766 MQMSNSLYIQIRDWVAFVVSHHVIMAMYFVLSY 4054864

4054865 SVIPYFVPSVFWEKCFIRFFIVHLFLVVFLHYQRVTKYPPGPPPMAILGNSPFINVLSPE 4055044

4055045 KTFLEYREIYGPVFTLHLS (1) 4055101

4055149 QPTVILADYKSIEEALVAN (1) 4055205

4055460 GQKTSGRSSAESFVLFTGDRQDGDGVILAMRQKWKNMRQEILRFMSKWYGKP 4055615

4055616 MDELVLHHTRSLESELVKMAESK (0) 4055684

4055728 CLIDLREPISGAIANVIQQITIGRNYLYQDTEFQAQLKDINAVVKE (0) 4055865

4055909 IMTAEVFFVNCYPFLRFLPEGILRKWTNYKRSGFRLQQW (2) 4056025

4056079 FRTILDEHHINRHQGDFMSHMVDLQESRPDEFN (2) 4056177

4056227 DLSIILTCGDMWTGGMETTTTTLRWGIIYLLNNPEVQTKCHNELVKVFGTDEPEMSKMNQTPYVRAT 4056427

4056428 LSEIQRLANVLPWAIPHK (2) 4056481

4056526 TFEECQVGEHKIPVNTEVIPALGALLCDPTLFENPKEFKPERFLDKEGKYYVMEEFRPF 4056702

4056703 GMGPRVCLGERVARTELYLIFASLLQNFRFYLNR (1)

4056853 TDPIPVAERIIGGITAPPKPYSTRVEYHGQRIVH* 4056957

 

>cb25.fpc0058 ACC=CAAC01000012 85% to CYP44A1

587351 MRRSIKILAENLENCPYTSATTTAPPRRFSEIPGPREIPVIGNSGNLKYAIGT (1) 587503

587554 DAETIENYNKHLEEMYNKYGKIVKENLGFGRKYVVHVFDP (1) 587673

587719 ADAQTVFAADGKTPFIVPLQETTQKYREMKGMNPGLGNL (2) 587835

587882 NGPEWYRLRSSIQHAMMRPQSVQT (2) 587953

588008 YLPFSQKVSDDLVRHVAEEQIRFGNANMQKVAGRWSLESAGQILFEKSLESLGNRSEWAD 588187

588188 GLIELNKKIFQLSAK (2) 588232

588279 MRLGLPIFRLFSTPSWRKMVKLEDEFYAEVDRLMDDALDNLKISESDS (2) 588422

588467 QNMRFASYLINRKELNRRDVKVILLSMFSDGLST (0) 588568

588612 TAPMLIYNLYNIATHPEALRKIQEEIKEDPTSSKLPYLRACIKETFRMFPIGTEVSRITQ 588791

588792 KDLTLSGYLIPAGTAVDINTNILMR (2) 588866

588908 NMVLFSDSPHEFKPERWLEKSKSVHPFAFLPFGFGPR 589018 frameshift

589018 MCAGRRFAEQDLLTSLAKLCANFDIRHRGEPITQIYETLLLPRGNCEFEFKKL* 589179

 

>cb25.fpc0071 ACC=CAAC01000016 CYP36A 94% to CYP36A1

1735798 MLFAQLVILVVIVMLLLCRFANKIRGLPPGPTPWPFI (1) 1735908

1735959 GNTFQVPEDRIDIIINEFKKKYGGIFTLWLPFPTVVICDYD (0) 1736081

1736951 MLKRNIVRNGEAFSGRPDTFIMDMLVQGNYGLFFMENNWWKAQRRFTTHIFRSLGVG 1737121

1737122 QAGTQDTIASLASGLVEKIDNQKDSSIELRPLLV (0) 1737223

1737275 HVVGNVIHKHLFGFTREWNETEILDFHVAINDVLEHFTSPKTQLLDAWPWLAYLDKPLSL 1737454

1737455 GIPRTTRANDAIIQNLEQ (0)

1737561 ALAKHKSGINYDEEPSSYMDAFLKEMKIRAAETALEDGFTEKQLIVAIYDLYSAGMETI 1737737

1737738 IIVLRFAFLYLINNPETQKRIHDELDKNVGRDR (0)

        QVVMDDQKQLPY 1737917

1737918 TCAFLQEVYRLGYVLPVNFLRCTLVDVEDCEGYRLPAGTRVIAQFQSVHVDKKHFPDPEH 1738097

1738098 FNPDRFLN (0) 1738121

1738174 ERGEYIRDDRINPFSMGKRSCLGENLARMEVFLYFCTLMQKFEWHTDGAFAPPIDVI 1738344

1738345 TSSLRAPKPFTIRASRR* 1738398

 

>cb25.fpc3052 ACC=CAAC01000064 86% to CYP23A1

       MPSIKYAILLSVVTSFLYFRIMKSFEMDSYT (0)

703199 TTYIYLFFCTISYIFYESNYKRRRLPNGPMPWLVAGNMPSFIN (1) 703068

703013 VKNVDDLFLYWKQRYGGIFTVWIGPIPLVM (0) 702810

702156 VSDLPTIKKYFIQHADAFSNRWRNFVTDSIM (1) 702064

701895 EGSNGIVQIDGDKWREQRRFALHTLRDFGVGRPLMEQMITLE (0) 701755

700650 VTTLMNHMAKSCGLSTKEVNLCPSIAVCVGNIINNMLFGLRFNQDNSYMHRLHSLLDDQ 700468

700467 SHTVMQPIMGAYIAFPVTTKVPFINGEWNRLMGIKKELLEFLEGQIQKHRENWKEEMMEQ 700288

700287 EPEDLTYAYMIEVEKRKRAGEDVGFFD (2) 700207

699397 DQQLKMLLLDLFFAGMETTVTTLKWAFLLMSKNPRVQRKVQEELDSIAQPMVEIQHRTRL 699218

699217 PYIQATINEIQRIANILPINLLRTVAEDIDIDGYQFKKGDLTIPQISILMNDPEIFKNPK 699038

699037 DFCPERFLDENLNVKKIDEFLPFSIGRRQCLGESLARAELYLIFANLMQNFKFEVSEDVT 698858

698857 TE (0)

       RVLGLTVSPVQYTCKISRRGLDHNENCVK

 

>cb25.fpc3857 ACC=CAAC01000068 CYP43

3469593 MCSVVGDKTFSMLRGATPVIVTNDVNLIHAISTESFDCFHSRI (0) 3469721

3469790 PEALSDDPITAENIHMFAAKGERWKRLRTITSYGLSTVKLKL (0) 3469915

3469968 LFPTIETCVSEFLDHVNSLSDGQSVVIDHSHS (2) 3470063

3470107 LFQNHTSYVLARCAYGHKEQNHRVNNFLSVFSDAFGALSDFQKSMTEKIT (1) 3470256

3470304 YFFPEMKSIFKNNLFAQFLQSTNQQKFLDYLLNLISKFQSRRVIENNNNEESSDPEAGK 3470480

3470481 YSLLEFFFEHHHEKKIVEKAEGRIDMKKVKVEKSISYQ (0) 3470594

3471312 EITAQCKFISVAGFDTTANTLTLLFNFLAHNPAIQEQIYNSEIKGNKSSMNFE 3471470

3471471 TVCSLPILQNCIFETLRLFPHASP (2) 3471542

3471589 LQMRICTAPITIGQYKFDENMQVVINPWGPHRDRVIWGDDVNCFKPSR (2) 3471732

3471779 FESLSEQQRKAFMPFGVGPRQCVGMRFALLELKTTAFRMLQKYVVRSTQPVIDRHGKL (0) 3471955

3471997 VNMTVRDTGTVWPTDKLGLVLTKRDI* 3472077

 

>cb25.fpc4470 ACC=CAAC01000141 CYP29A.f

299641 MTIIIPIIFGMVLIYLSTWIPTLRKFRRHWRYGNK (0) 299745

300059 MPGPPAHPIYGNLGPIVGLNTE (1) 300124

300323 DLPAVFINWAAEQRDKGNSIMRVMLLGTVYVWPLNGKAAAAIVDSTTETNKGDDYEFFSP 300502

300503 WLGGGLLLEGYGERWRSHRKMLTPAFHFAKLGGYFEVFNNEAK (0) 300634

       IMVDLLSDFCDSGKTVDIFPYVKRCALDI 300764

300765 ISETAMGIKIDAQMNHDHKYVQAVEGFNKIGVLVSFNPHLKNPFIFWATGYKAQYDDYLH 300944

300945 TLKNFTEKVIKERRAAHESGEIEVEKSKRMMNFLDLMLSMEEANQLTSEDIRQEVDTFMF 301124

301125 AGHDTTTSSTSWACWNMAHHPDVQEKVYKEMMEVFGDDPSTDITLENLGKLSYLDRVLKE 301304

301305 SKRIIPPVPALQRKLTNDLEI (1)

       SDGYTVPAGGNVTISPMVLHSNHLV 301484

301485 FDNPEKFDPDRFLPDEVSKRHPYDFMPFLAGPRNCIGQKFAQLNEKVMLCHIIRNFKIEP 301664

301665 TLGYKDTKQCLEVVTKPSKGIPVRLVRKI* 301754

 

>cb25.fpc4126 ACC=CAAC01000098 CYP29A.a

1752205 MALIVPVFIALVVIYILSFYKQIRNYFRLNHYGSKLPGPPLERIMGNANILKNKTSD (1) 1752375

        KMGLVFKEEARKARERGDSIVRYLLPGKLIVWPLNGKAVSQVLDSTT 1752561

1752562 EIHKGSDYEMFKPWIGGLLLLL (2)

1752675 VGDTWKFHRKLITPTFHFAKLEGYFNVFNSESK (0) 1752773

1752860 ILTELLEKFADSGETVDIFPYINRCLLDIICEAGMGTKVDAQFNHDHPYLKAVKGYVELM (0) 1753039

        MKTSTRPLLWNPFLFWALGYKRQQMDYLKTLKKFTAD (0) 1753195

1753246 VIAERKQARELEGQTGASVSKRNMNFLDLMLSTKESNGLTEEELRNEVDTFMFGGHDTT 1753422

1753423 TTSCSWMVWSLAHHQDIQQKVHEEIVSNCGEYPNEEITYEQANKLYYLELVMKESKRFFP 1753602

1753603 PVAAVQRHLQEPMVI (1) 1753647

1753697 DGHKIPAGTNIAIAPLVLHSNPEVFKNPEVFDPNRFLPEECAKRHAYDYIPFSAGVKNCI 1753876

1753877 GQKFSVLNEKVLISHLVRNFKIEPMLELDGTRPCFEVVSRPSKGIPVKLTRRL* 1754038

 

>cb25.fpc4126 ACC=CAAC01000098 CYP29A.b

1755258 MALLVPVFVSFLVLILFWLFQKYQKIQKVNYYGSQIPGPKGNWLTGNLSMFQKSNT (1) 1755425

        GLIEMFQEEAKKFREQGHSVMRYIMPGKLLVFPLTGKTVSKILESTTEI 1755617

1755618 EKGEDYDFFEPWIGGGLLVS (2)

        VGERWKSHRKLITPSFHFAKLEGYF 1755797

1755798 DVFNQESK (0) 1755821

1756083 ILVECLEKFSDSGEQVDLHPFINRCTLDIICETAMGTKVGAQFNHDNSYLKAVDGYSTMMV (1) 1756262

1756313 KYAYNPIMWNPFLFWILGHKKRQNDLLYSLKKFTGD (0) 1756420

1756468 IIAERKAALESGEAEKFTSKRSMNFLDLMLSMKESNVLSEEDLRQEVDTFMFGGHDTTTT 1756647

1756648 SCSWTCWNLAHNPEVQQKVCEELAEVCGEDPNGDISYEQANQLHYLDRVLKESKRLIAPV 1756827

1756828 ALVDRKLQKEMEI (1) 1756866

1757132 SGYIIPPGSSVSIAPVILHNNHDVWKNPEVFDPDRFLPEECAKRHPYDFVPFSAGIKNCIG 1757314

1757315 QKFSVLNEKVMVSHLVRNFKIEPMAKFHETLPCFESVSKPSNGIPVKLIRRV* 1757473

 

>cb25.fpc4126 ACC=CAAC01000098 CYP29A.c

1759597 MSILIPVAVLLFLVYILSFYDHVRLMRKFWIYGGK (0) 1759701

1760434 MPGPPAHPIFGNASLFKNKTTE (1) 1760499

1760647 DFVEIFVNLANEARAKGANLMRTQVMNRIYVWPLNGKTAA (0) 1760766

1760814 FILESSTEMNKGDDYSFLVPWLGGGLLMAQGEKWKNHRKMLTPAFHFAKLEGYLDVFNQESK (0) 1760999

1761050 VLIDCIEKAMETQEMIDLFPFFKRCTLDIICGTAMGIKFGAQLGKNHEYVKAVEGFNKLT (0) 1761229

1761467 VEYSLNPFLWNKFVYWALGYQKMHDDFLLTLKKFTNDAIVERRAAIASGEVEKETS 1761634

1761635 KRKMNFLDILLSSEESNELTSEDIRKEVDTFLFAGHDTTSTSLSWLCWNLAHNPDVQENV 1761814

1761815 YREILTVFGEDPNEDVTSEKINRMEYTERVMKESKRMFAPVPGVQRKLTKDIVI (1) 1761976

1762311 DGITIPSEGNITISPTVLHCNPYIYEKPEKFDPDRFLPEECAKRHSYDYIPFSAGLRNCI 1762490

1762491 GQKFSILNEKVMLIHILKHFRLEPKLGFYETKPLFE (0) 1762601

1762660 VVAKPSHGIPMKLIRRY* 1762713

 

>cb25.fpc4023 ACC=CAAC01000072 CYP29A.h

527778 MAFLIPALVALLLVIIATLWKKISNLQKFRKCDDT (0) 527882

527930 IPGPPAHPIFGNVSAFSGKNTA (1) 527995

528062 EIYEIFRKTADEFRSKGKNVIRYRFLDKLYVWPLTGKALA (0) 528181

528248 KVLESTTELNKGDDYKFYYSWFGGGVLVEGFGERWRTHRKLLTPTFHFAKLEGYLEVFNEESK (0) 528439

528483 IMIECLDKYAESGKTVDMFEYIKRCALDVICGAAMGTKVNAQYYHEHPYVKAVEGFNTLA (0) 528662

528710 IQHALNPLYEFDFIYWALGLKKQKDEYLHTMKKFTGDVIAERQAAIASGEV 528862

528863 EKETSKRKMNFLDILLSSGEANILSDEDIRQEVDTFMFAGHDTTTTALSWMCWNMAHHPD 529042

529043 IQQKVYEELVSIFGEDPHTEVTTEGLSKLDYTERVLKESKRQTIPVPALQRKLINDMEI (1) 529219

530770 DGYTIPSGANVAIAPMVLHKNAEAFPDPDKFDPDRFLPDEIAKRYAYDYIP 530922

530923 FSAGLRNCIGQKFAQMNEKVMLIYILKNFRLEPMAGFNSTKPVFE (0) 531057

531101 QAVARPSHGIPVKLIRRN* 531157

 

>cb25.fpc2220 ACC=CAAC01000044 CYP29A.d pseudogene

1861693 MALLIPILLGVLAIIFVMCSKRIKWILKVFEFSDKLPGPTTQPFIGNISSFAKMSKI (1) 1861863

1861911 EMLDYMIKTANELRDSGESIMRFQFIGKLLVFPLDGKSAQTLLQSTTELDKGDDYEFARAWLGRSV 1862108

1862109 LLDGYGERWKSHRRLVTPTFHFAKLEGYLGVFNRETK (0)

        VMVELLDEF 1862288

1862289 AKSGETVDLFHYIKRCTLDIICGTAMGTTIDAQHNPTHSYVLAVERFNKLS

        VDHSMKAHLQIPFLFWLLGYQKKKDDYVYTMKKFTRDVTAGRMTGL 1862621

1862622 KLGHIENHASKRDMNFLDILLCSKETKDWSE 1862714 (frameshift)

1862714 EDIREEVDTFMFAGHDTTAASFSWMCWNMAHNQDIQEKVYKEIIEIFGDDPNGDVDSEGI 1862893

1862894 KKLDYTERVLKESKRRIAPVPTLQRKLREDMEI (1) 1862992

        GGHMIPAGVNISVA missing 29 aa

1863083 RNAYDYIPFSAGLRNCVGQKFAQLNEKVLLIHMLRNYRIEPMLDFNGTQPSLE (0)

        IISRPSNGIPVMLTRR* 1863334