Rat cytochrome P450s

 

108 Rat P450 sequences.  This is a beginning of a revision of the rat P450s.

I am currently looking for more members in the 7 gene clusters seen in mouse.

The April 1, 2004 Nature issue on the rat genome had a figure showing 84 P450s

in the rat on a tree diagram.  I am looking for these and more.  There are some

major nomenclature problems due to naming the rat genes for the closest match in

the database, usually a mouse gene.  This will not work if there is not an

orthologous relationship.  Many of the names in Genbank will need to be changed.

 

The 4F gene cluster appears to be conserved with all 9 functional genes occurring

in the same order and orientation as in the mouse 4f cluster.  4F5 is the ortholog of

4f16, 4F4 is the ortholog of 4f15, 4F1 is the ortholog of 4f14 and 4F6 is the ortholog

of 4f13.  The other new rat genes (4F39, 4F17, 4F37, 4F40 and 4F18) will be named

for their ortholog in the mouse.  The pseudogenes are not conserved. 

Gene order and orientation(+/-) is: 4F39+, 4F17+, 4F5/4f16+, 4F37+, 4F40+,

4F4/4f15+, 4F1/4f14-, 4F6/4f13-, 4F18+

 

The CYP2ABFGST cluster has 14 full length genes, one complete pseudogene with

a few splice site errors (2B16P) and 9 small pseudogene fragments. 

The gene order is:

2S1-, 2B1+, 2B2+, 2B3+, 2B16P+, 2B14P+, 2B21-, 2B12+, 2BNEW+, 2B15+, 2G1+,

2A3+, 2ANEW+, 2A2+, 2F4+, 2T1+

 

Only 2S1 and 2B21 are oriented opposite to the cluster major orientation (+).

2b23 in mouse is also (-) and these two appear to be in orthologous locations, so

the orientation may be preserved.  In the mouse 2a22 is oriented opposite to the

other genes, but it was on a small contig that might be incorrectly oriented.

 

The rat has three genes between 2B21 and 2G1.  The mouse has 2b19 in this

location, so the rat may have expanded the 2b19 gene to three genes.  If we assume

this is correct, there is a reasonable orthologous relationship of genes in the rat and mouse clusters.

 

2S1/2s1, 2B1/2b10, 2B2/2b13, 2B3/2b9, 2B21/2b23, 2B12/2b19, 2BNEW/2b19,

2B15/2b19, 2G1/2g1, 2A3/2a5, 2ANEW/2a22, 2A2/2a12, 2F4/2f2, 2T1/2t4.

 

Last modified Oct. 12, 2004

 

D. Nelson

 

>CYP1A1 X00469

MPSVYGFPAFTSATELLLAVTTFCLGFWVVRVTRTWVPKGLKSP

PGPWGLPFMGHVLTLGKNPHLSLTKLSQQYGDVLQIRIGSTPVVVLSGLNTIKQALVK

QGDDFKGRPDLYSFTLIANGQSMTFNPDSGPLWAARRRLAQNALKSFSIASDPTLASS

CYLEEHVSKEAEYLISKFQKLMAEVGHFDPFKYLVVSVANVICAICFGRRYDHDDQEL

LSIVNLSNEFGEVTGSGYPADFIPILRYLPNSSLDAFKDLNKKFYSFMKKLIKEHYRT

FEKGHIRDITDSLIEHCQDRRLDENANVQLSDDKVITIVFDLFGAGFDTITTAISWSL

MYLVTNPRIQRKIQEELDTVIGRDRQPRLSDRPQLPYLEAFILETFRHSSFVPFTIPH

STIRDTSLNGFYIPKGHCVFVNQWQVNHDQELWGDPNEFRPERFLTSSGTLDKHLSEK

VILFGLGKRKCIGETIGRLEVFLFLAILLQQMEFNVSPGEKVDMTPAYGLTLKHARCE

HFQVQMRSSGPQHLQA

 

>CYP1A2 K02422

MAFSQYISLAPELLLATAIFCLVFWVLRGTRTQVPKGLKSPPGP

WGLPFIGHMLTLGKNPHLSLTKLSQQYGDVLQIRIGSTPVVVLSGLNTIKQALVKQGD

DFKGRPDLYSFTLITNGKSMTFNPDSGPVWAARRRLAQDALKSFSIASDPTSVSSCYL

EEHVSKEANHLISKFQKLMAEVGHFEPVNQVVESVANVIGAMCFGKNFPRKSEEMLNL

VKSSKDFVENVTSGNAVDFFPVLRYLPNPALKRFKNFNDNFVLSLQKTVQEHYQDFNK

NSIQDITGALFKHSENYKDNGGLIPQEKIVNIVNDIFGAGFETVTTAIFWSILLLVTE

PKVQRKIHEELDTVIGRDRQPRLSDRPQLPYLEAFILEIYRYTSFVPFTIPHSTTRDT

SLNGFHIPKECCIFINQWQVNHDEKQWKDPFVFRPERFLTNDNTAIDKTLSEKVMLFG

LGKRRCIGEIPAKWEVFLFLAILLHQLEFTVPPGVKVDLTPSYGLTMKPRTCEHVQAW

PRFSK

 

>CYP1B1 U09540

MATSLSADSPQQLSSLSTQQTILLLLVSVLAIVHLGQWLLRQWR

RKPWSSPPGPFPWPLIGNAASVGRASHLYFARLARRYGDVFQIRLGSCPVVVLNGESA

IHQALVQQGGVFADRPPFASFRVVSGGRSLAFGHYSERWKERRRAAYGTMRAFSTRHP

RSRGLLEGHALGEARELVAVLVRRCAGGACLDPTQPIIVAVANVMSAVCFGCRYNHDD

AEFLELLSHNEEFGRTVGAGSLVDVMPWLQLFPNPVRTIFREFEQINRNFSNFVLDKF

LRHRESLVPGAAPRDMMDAFILSAEKKATGDPGDSPSGLDLEDVPATITDIFGASQDT

LSTALLWLLILFTRYPDVQARVQAELDQVVGRDRLPCMSDQPNLPYVMAFLYESMRFT

SFLPVTLPHATTANTFVLGYYIPKNTVVFVNQWSVNHDPAKWSNPEDFDPARFLDKDG

FINKALASSVMIFSVGKRRCIGEELSKTLLFLFISILAHQCNFKANQNEPSNMSFSYG

LSIKPKSFKIHVSLRESMKLLDSAVEKLQAEEACQ

 

>CYP2A1-de2b exon 2 pseudogene Chr1 (-) only 240bp from Cyp2a22 ortholog start Met

82084718 YNAVKEALVDQAEGFSGQGEQA 82084653

 

>CYP2A1 NP_036824 88% T0 2A2 chr1 (+) Cyp2a22 ortholog

82084958 MLDTGLLLVVILASLSVMLLVSLWQQKIRGRLPPGPTPLPFIGNYLQLNTKDVYSSITQ 82085134

82085434 LSERYGPVFTIHLGPRRVVVLYGYDAVKEALVDQAEEFSGRGEQATYNTLFKGY 82085595

82088031 GVAFSSGERAKQLRRLSIATLRDFGVGKRGVEERILEEAGYLIKMLQGTC 82088180

82088398 GAPIDPTIYLSKTVSNVISSIVFGERFDYEDTEFLSLLQMMGQMNRFAASPTG 82088556

82089778 QLYDMFHSVMKYLPGPQQQIIKVTQKLEDFMIEKVRQNHSTLDPNSPRNFIDSFLIRMQE 82089957

82093158 EKNGNSEFHMKNLVMTTLSLFFAGSETVSSTLRYGFLLLMKHPDVE 82093295

82093737 AKVHEEIEQVIGRNRQPQYEDHMKMPYTQAVINEIQRFSNLAPLGIPRRIIKNTTFRGFFLPK 82093925

82094440 ATDVFPILGSLMTDPKFFPSPKDFDPQNFLDDKGQLKKNAAFLPFST 82094580

82098022 GKRFCLGDGLAKMELFLLLTTILQNFRFKFPMKLEDINESPKPLGFTRIIPKYTMSFMPI 82098201

 

>CYP2A2-de2b exon 2 pseudogene Chr1 (-)

82115528 LKPHWVVVLYEWDAVKEALGDQAEELSG*GEQANL 82115445

 

>CYP2A2 J04187 Cyp2a12 ortholog

82117349 MLDTGLLLVVILASLSVMFLVSLWQQKIRERLPPGPTPLPFIGNYLQLNMKDVYSSITQ 82117525

82117991 LSERYGPVFTIHLGPRRIVVLYGYDAVKEALVDQAEEFSGRGELPTFNILFKGY 82118152

82123228 GFSLSNVEQAKRIRRFTIATLRDFGVGKRDVQECILEEAGYLIKTLQGTC 82123377

82123595 GAPIDPSIYLSKTVSNVINSIVFGNRFDYEDKEFLSLLEMIDEMNIFAASATG 82123753

82124978 QLYDMFHSVMKYLPGPQQQIIKVTQKLEDFMIEKVRQNHSTLDPNSPRNFIDSFLIRMQE 82125157

82139054 EKYVNSEFHMNNLVMSSLGLLFAGTGSVSSTLYHGFLLLMKHPDVE 82139191

82139607 AKVHEEIERVIGRNRQPQYEDHMKMPYTQAVINEIQRFSNLAPLGIPRRIIKNTTFRGFFLPK 82139795

82140311 GTDVFPIIGSLMTEPKFFPNHKDFNPQHFLDDKGQLKKNAAFLPFSI 82140451

82141451 GKRFCLGDSLAKMELFLLLTTILQNFRFKFPMNLEDINEYPSPIGFTRIIPNYTMSFMPI  82141630

 

>CYP2A3 J02852 NM_012542 exon 4 in a seq gap in genome seq chr1 (+) Cyp2a5 ortholog

82023007 MLASGLLLVASVAFLSVLVLMSVWKQRKLSGKLPPGPTPLPFIGNYLQLNTEKMYSSLMK 82023186

82023453 ISQRYGPVFTIHLGPRRVVVLCGQEAVKEALVDQAEEFSGRGEQATFDWLFKGY 82023614

82024296 GVAFSSGERAKQLRRFSIATLRDFGVGKRGIEERIQEEAGFLIESFRKTN 82024445

         GALIDPTFYLSRTVSNVISSIVFGDRFDYEDKEFLSLLRMMLGSFQFTATSTG

82026488 QLYEMFSSVMKHLPGPQQQAFKELQGLEDFITKKVEQNQRTLDPNSPRDFIDSFLIRMLE 82026667

82028068 EKKNPNTEFYMKNLVLTTLNLFFAGTETVSTTLRYGFLLLMKHPDIE 82028208

82028659 AKVHEEIDRVIGRNRQAKYEDRMKMPYTEAVIHEIQRFADMIPMGLARRVTKDTKFREFLLPK 82028847

82029417 GTEVFPMLGSVLKDPKFFSNPNDFNPKHFLDDKGQFKKSDAFVPFSI 82029557

82030741 GKRYCFGEGLARMELFLFLTNIMQNFCFKSPQAPQDIDVSPRLVGFATIPPNYTMSFLSR 82030920

 

>CYP2A3-de1b exon 1 pseudogene Chr1 (+)

82052140 MLGSRLLLVAVLSCLCVMVFMPVWQQQYRDTIPPG 82052244

 

>CYP2B3-se1[9] exon 9 100% match to 2B3 chr1 (+)

81263180 GKRMCLGEGIARSELFLFFTTILQNYSVSSPVDPNTIDMTPKESGLAKVAPVYKICFVAR* 81263362

 

>CYP2B3-se2[1] duplicate exon 1 100% match Chr1 (-)

81308557 MDTSVLLLLAVLLSFLLFLVRGHAKVHGHLPPGPRPLPLLGNLLQMDRGGFRKSFIQ 81308387

 

>CYP2B1 J00719 Rn.91353 chr1 (+) 1 aa diff to CYP2B1

81344956 MEPSILLLLALLVGFLLLLVRGHPKSRGNFPPGPRPLPLLGNLLQLDRGGLLNSFMQ 81345126

81357886 LREKYGDVFTVHLGPRPVVMLCGTDTIKEALVGQAEDFSGRGTIAVIEPIFKEY 81358047

81358200 GVIFANGERWKALRRFSLATMRDFGMGKRSVEERIQEEAQCLVEELRKSQ 81358349

81360925 GAPLDPTFLFQCITANIICSIVFGERFDYTDRQFLRLLELFYRTFSLLSSFSS 81361083

81361768 QVFEFFSGFLKYFPGAHRQISKNLQEILDYIGHIVEKHRATLDPSAPRDFIDTYLLRMEK 81361947

81362389 EKSNHHTVFHHENLMISLLSLFFAGTETSSTTLRYGFLLMLKYPHVA (1) 81362529

81363958 EKVQKEIDQVIGSHRLPTLDDRSKMPYTDAVIHEIQRFSDLVPIGVPHRVTKDTMFRGYLLPK 81364146

81364315 NTEVYPILSSALHDPQYFDHPDSFNPEHFLDANGALKKSEAFMPFST 81364455

81368014 GKRICLGEGIARNELFLFFTTILQNFSVSSHLAPKDIDLTPKESGIGKIPPTYQICFSAR 81368193

 

>CYP2B2 J00720 Rn.91353 chr1 (+) 4aa diffs with CYP2B2 14aa diffs to CYP2B1

81423536 MEPSILLLLALLVGFLLLLVRGHPKSRGNFPPGPRPLPLLGNLLQLDRGGLLNSFMQ (0) 81423706

81426789 FREKYGDVFTVHLGPRPVVMLCGTDTIKEALVGQAEDFSGRGTIAVIEPIFKEY (1) 81426950

81427104 GVFFANGERWKALRRFSLATMRDFGMGKRSVEERIQEEAQCLVEELRKSQ (1) 81427253

81429793 GAPLDPTFLFQCITANIICSIVFGERFDYTDRQFLRLLELFYRTFSLLSSFSSQ 81429954

81430659 VFEFFSGFLKYFPGAHRQISKNLQEILDYIGHIVEKHRATLDPSAPRDFIDTYLLRMEK 81430835

81431274 EKSNHHTEFHHENLMISLLSLFFAGTETGSTTLRYGFLLMLKYPHVT (1) 81431414

81432829 EKVQKEIDQVIGSHRPPSLDDRTKMPYTDAVIHEIQRFADLAPIGLPHRVTKDTMFRGYLLPK 81433017

81433190 NTEVYPILSSALHDPQYFDHPDTFNPEHFLDADGTLKKSEAFMPFST (1) 81433330

81436959 GKRICLGEGIARNELFLFFTTILQNFSVSSHLAPKDIDLTPKESGIAKIPPTYQICFSAR 81437138

 

>CYP2B3 M20406 chr1 (+) exon 9 not adjacent to this gene. Found at 81263180-81263359

81486567 MDTSVLLLLAVLLSFLLFLVRGHAKVHGHLPPGPRPLPLLGNLLQMDRGGFRKSFIQ (0) 81486737

81514647 LQEKHGDVFTVYFGPRPVVMLCGTQTIREALVDHAEAFSGRGIIAVLQPIMQEY (1) 81514808

81514950 GVSFVNEERWKILRRLFVATMRDFGIGKQSVEDQIKEEAKCLVEELKNHQ (1) 81515099

81516395 GVSLDPTFLFQCVTGNIICSIVFGERFDYRDRQFLRLLDLLYRTFSLISSFSSQ (0) 81516556

81530756 MFEVYSDFLKYFPGVHREIYKNLKEVLDYIDHSVENHRATLDPNAPRDFIDTFLLHMEK (0) 81530932

81531383 EKLNHYTEFHHWNLMISVLFLFLAGTESTSNTLCYGFLLMLKYPHVA (1) 81531523

81536877 EKVQKEIDQVIGSQRVPTLDDRSKMPYTEAVIHEIQRFSDVSPMGLPCRITKDTLFRGYLLPK (0) 81537065

81537233 NTEVYFILSSALHDPQYFEQPDTFNPEHFLDANGALKKCEAFMPFSI (1) 81537373

         GKRMCLGEGIARSELFLFFTTILQNYSVSSPVDPNTIDMTPKESGLAKVAPVYKICFVAR

 

>CYP2B32P pseudogene partial Chr1 (+)

81806528 VLLLLTLIVGFLLFLVSQSQPKTHGHLPPGLCPLPFLGNLLQIKRRGLLNSFMQ 81806689

81808348 AQEKYGDVLTVHPGPRPVVRLCGTDTIREFLFDQAGTFSGQGTVAVLNPVVHGY 81808509

exon 3 missing

81809871 GVPLIPTSFFQRIAANIICSIVFGECFDYKDHQFLHLLDLIYQTFALMAPCPARS 81810035

81810759 VFQLFSGFLKYFPGVHKQISKNLQEILNYIGHSVEKHMATLDPSAPRDFINTYLLHMEN 81810935

81811666 EKSNHHTEFHHQTSVLSHFFDGTETTSTTLCCSFLIMLKYHHVK 81811797

 

>CYP2B12-de9b exon 9 Chr1 (-)

81829155 GKFICLGEGIG*NESFIFFTGILQNLSLASPVAPENIDLTPIKSGAGKIPSTYQIHILSR 81829012

 

>CYP2B12 X63545, S48369, NM_017156 Rn.108913 chr1 (+) 87% to 2b19 possible ortholog

81858238 MEFGVLLLLTLTVGFLLFLVSQSQPKTHGHLPPGPRPLPFLGNLLQMNRRGFLNSFMQ  81858411

81860089 LQEKYGDVFTVHLGPRPVVILCGTDTIREALVDQAEAFSGRGTVAVLHPVVQGY 81860250

81860393 GVIFATGERWKTLRRFSLVTMKEFGMGKRSVDERIKEEAQCLVEELKKYK 81860542

81860739 GAPLNPTFLFQSIAANTICSIVFGERFDYKDHQFLHLLDLVYKTSVLMGSLSSQ 81860900

81861646 VFELYSGFLKYFPGAHKQIFKNLQEMLNYIGHIVEKHRATLDPSAPRDFIDTYLLRMEK 81861822

81862554 EKSNHHTEFNHQNLVISVLSLFFAGTETTSTTLRCTFLIMLKYPHVA 81862694

81864745 EKVQKEIDQVIGSHRLPTPDDRTKMPYTDAVIHEIQRFADLTPIGLPHRVTKDTVFRGYLLPK 81864933

81865086 NTEVYPILSSALHDPRYFEQPDTFNPEHFLDANGALKKSEAFLPFST 81865226

81868929 GKRICLGEGIARNELFIFFTAILQNFTLASPVAPEDIDLTPINIGVGKIPSPYQINFLSR 81869108

 

>CYP2B14P U33540 exon 1 add Chr1 (+) exons 7,8,9 72% to 2B21 to this pesudogene

81706300 MKPNVLLLLAILLSFLLFLVRGHAKVHGHLPPGPRPLPILGNLLQMDRGGLLQSF 81706464

81728276 EKVQKEIGEVTGSHWFPILYSSKIPNTEAVIPEIQR 81728383

81728385 FSDLSSVVLPQRVTKDTFFQGFLLHK 81728462

81728634 NTEVYPILSSVLHDPQ 81728681

81728681 VLEYPVTFNPEHFLDANGALKKNEAFTPFSR 81728773

 

>CYP2B21 AF159245 Chr1 (-)

81765226 MDPSVLLLFALFTGFLLLLIRGQGNGYGHLPPGPCPLPLLGNVLQMDRRGLLKSFIQ  81765056

81759108 LRDKYGDVVTVHLGPRPIVMLYGTETIREALVDHAEAFSGRGTVAVVQPIIQDY  81758947

81758804 GMIFANGERWKILRRFSLATMRDFGMGKRSVEERIKEEAQCLVEELKKYK 81758655

81757889 GAPLDPTFHLQCITANIICSIVFGERFDYTDHQFLHLLDLFYEILSLVSSFSSQ 81757728

81749057 VFELFPGFLKYFPGTHRHISKNIEEILNFIGHCVEKHRATLDPSTPRDFIDTYLLRMEK 81748881

81748412 EKLNHHTEFHHQNLMMSVLSLFFAGTETSSTTLRYGFLLMLKYPHVA 81748272

81747109 EKVQKEIDQVIGSHRVPTLDDRIKMPYTDAVIHEIQRFSDLVPIGLPHRVTKDTLFRGYLLPK 81746921

81746748 NIEVYPILSSALHDPQYFEHPDTFNPEHFLDANGALKKNEAFLPFST 81746608

81736831 GKRVCLGEGIARNELFLFFTTILQNFSVSSPVSPKDIDLTPKESGFAKIPPTYQICFLSRQLG 81736643

 

>CYP2B31 86% to 2b19 possible ortholog

81918041 MELGVFLLLTFTVGFLLLLASQNRPKTHGHLPPGPRPLPFLGNLLQMNRRGLLRSFMQ 81918214

81919826 LQEKYGDVFTVHLGPRPVVILCGTDTMREALVDQAEAFSGRGTVAVLHPVVQGY 81919987

81920130 GVIFANGERWKILRRFSLVTMRNFGMGKRSVEERIKEEAQCLVEELKKYK 81920279

81922129 GALLNPTSIFQSIAANIICSIVFGERFDYKDHQFLRLLDLIYQTFSLMGSLSSQ 81922290

81923031 VFELFSGFLKYFPGVHKQISKNLQEILNYIDHSVEKHRATLDPNTPRDFIDTYLLHMEK 81923207

81923977 EKSNHHTEFHHQNLVISVLSLFFAGTETTSTTLRYSFLIMLKYPHVA 81924117

81926113 EKVQKEIDQVISSHRLPTLDDRIKMPYTDAVIHEIQRFADLAPIGLPHRVTKDTMFRGYLLPK 81926301

81926476 NTEVYPILSSALHDPRYFDHPDTFNPEHFLDANGTLKKSEAFLPFST 81926616

81930286 GKRTCLGEGIARNELFIFFTALLQNFSLASPVAPEDIDLTPINSGAGKIPSPYQINFLSR 81930465

 

>CYP2B15 D17343 to D17349 86% to 2b19 exons 2-4 in a seq gap in the genome

seq Chr1 (+)

81945068 MELGVLLLLTFTVGFLLLLASQNRPKTHGHLPPGPRPLPFLGNLLQMNRRGLLRSFMQ 81945241

         LQEKYGDVFTVHLGPRPVVILCGTDTIREALVDQAEAFSGRGTVAVLHPVVQGY

         GVIFANGERWKILRRFSLVTMRNFGMGKRSVEERIKEEAQCLVEELKKYK

         ALLNPTSIFQSIAANIICSIVFGERFDYKDHQFLRLLDLIYQTFSLMGSLSSQ

81950148 VFELFSGFLKYFPGVHKQISKNLQEILNYIDHSVEKHRATLDPNTPRDFINTYLLRMEK 81950324

81951073 EKSNHHTEFHHQNLVISVLSLFFTGTETTSTTLRYSFLIMLKYPHVA 81951213

81953132 EKVQKEIDQVIGSHRLPTLDDRTKMPYTDAVIHEIQRFADLIPIGLPHRVTNDTMFLGYLLPK 81953320

81953491 NTEVYPILSSALHDPRYFDHPDTFNPEHFLDVNGTLKKSEAFLPFST 81953631

81957185 GKRICLGEGIAQNELFIFFTAILQNFSLASPVAPEDIDLSPINSGISKIPSPYQIHFLSRCVG 81957373

 

>CYP2B16P U33541 to U33546 bad boundary introns 1,5,7 chr1 (+)

81633949 MEPSVLLLLAVLLSFLLLLVRGHAKIHGRLPPGPCPVPLLGNLLQMDRRGLLKSFIQ (?) 81634119

81641847 LQEKYGDVFTVHLGLRPVVVLCGTQTIREALVDHAEAFSGRGTIAGLEPVFQDY (1) 81642008

81642149 GIFFSSGEQWKTLRRFSMATMRDFGMRKKSVEERIKEESQCLVEELKKYQ (1) 81642298

81642886 GAPLDPTFLFQCITSNIICSIVFGECFDYTDHQFLHLLDLMYQTFSLLSSIFSQ (0) 81643047

81645234 VFELFPGVLKYFPGAHRQISRNLHEILDFIGQSVEKHRATLDPNAPRDFIYTYLLHMEK (?) 81645410

81645864 QKSNHYTEFHHWNLLSSVLSLFFAGTETSSTTLRYGFLIMLKYPHIT (1) 81646004

81654168 EKVQKEIDCVIGSHRLPTLDDRSKMPYTEAVIHEIQRFSDLAPIGTPHRVIKDTIFRGYLLPK (?) 81654356

81654524 QNTEVFPILSSVLHDPQYFEQPDIFNLQHFLDANGALKIIEAFLPFST (1) 81654667

81659608 GKRICLGESIARNELFLFFTTILQNFSVSSPVAPKDIDLTPKESGIGRIPQVYQICFLAH*  81659781

 

>CYP2C6v1_v1-de1b2b3b4b5b upstream pseudogene 96% identical to seq c

93% identical to seq upstream of CYP2C6v2 allele (temp name = CYP2Cnewb)

243935799 MDLVMLLVLTLSCLIFLSIWRQSSGRGKLP 243935888

243935888 SGPTPLPIIGNFFHLDLKNITQSLTN 243935965

243937699 FSKVNGSVFTLYFGMKPIVILHGYEAIKEGLIDHGEEFTERGSFPVAEKINKGL 243937860

243938035 GIAFSHGNRWKEIRRFTLMTLQNLGMGKKSIEDRVQEESRCLV 243938163

243939079 GSPCDPTFILGCAPCNVICSIIFQNCFDYKDQDFLSLVEKLNENIKIVSSPWI* 243939231

243940291 FCSSFPVFIDYCPGSHMTLAKNVYHTRNYILKKIKEHQESLDVTNPHDFIDYYLINWKQ 243940467

 

>CYP2C6_v1 M13711 two aa changes to match many ESTs (lower case mi) due to frameshift

97% to 2C77 and 2C6v2

243955584 MDLVMLLVLTLTCLILLSIWRQSSGRGKLPPGPIPLPIIGNIFQLNVKNITQSLTS 243955751

243964779 FSKVYGPVFTLYFGTKPTVILHGYEAVKEALIDHGEEFAERGSFPVAEKINKD 243964937

243965112 LGIVFSHGNRWKEIRRFTLTTLRNLGMGKRNIEDRVQEEARCLVEELRKTN 243965264

243966104 GSPCDPTFILGCAPCNVICSIIFQNRFDYKDQDFLNLMEKLNENMKILSSPWTQ 243966265

243967336 FCSFFPVLIDYCPGSHTTLAKNVYHIRNYLLKKIKEHQESLDVTNPRDFIDYYLIKWKQ 243967512

243984646 ENHNPHSEFTLENLSITVTDLFGAGTETTSTTLRYALLLLLKCPEVT 243984786

243989157 AKVQEEIDRVVGKHRSPCMQDRSRMPYTDAmiHEVQRFIDLIPTNLPHAVTCDIKFRNYLIPK 243989345

243990948 GTTIITSLSSVLHDSKEFPDPEIFDPGHFLDGNGKFKKSDYFMPFSA 243991088

243992245 GKRMCAGEGLARMELFLFLTTILQNFKLKSVLHPKDIDTTPVFNGFASLPPFYELCFIPL 243992424

 

>CYP2C6P M18336 J03509 M18774 an alternate splice version of 2C6

exon 8 is skipped and replaced by a cryptic exon just past the true exon 8

The GT boundary of the true exon 8 are the first two nucleotides of CYP2C6_v3

Cryptic exon 8

     MDLVMLLVLTLTCLILLSIWRQSSGRGKLPPGPIPLPIIGNIFQLNVKNITQSLTSFSKV 200

 201 YGPVFTLYFGTKPTVILHGYEAVKEALIDHGEEFAERGSFPVAEKINKDLGIVFSHGNRW 380

 381 KEIRRFTLTTLRNLGMGKRNIEDRVQEEARCLVEELRKTNGSPCDPTFILGCAPCNVICS 560

 561 IIFQNRFDYKDQDFLNLMEKLNENMKILSSPWTQFCSFFPVLIDYCPGSHTTLAKNVYHI 740

 741 RNYLLKKIKEHQESLDVTNPRDFIDYYLIKWKQENHNPHSEFTLENLSITVTDLFGAGTE 920

 921 TTSTTLRYALLLLLKCPEVTAKVQEEIDRVVGKHRSPCMQDRSRMPYTDAMIHEVQRFID 1100

1101 LIPTNLPHAVTCDIKFRNYLIPK 1169

 

CYP2C6_v2 CK224594.1 CK224593.1 note: the _v2 means alternative splice version 2

CYP2C6_v3 CK224595.1 CK224596.1 (3 nuc shorter at the joint uses the second AG)

               Beginning of exon 7 AGCTAAAG TCCAGGAAGA GATTGATCGT  243989183

GTGGTTGGCA AACATCGCAG CCCTTGCATG CAGGACAGGA GCCGCATGCC CTACACAGAT  243989243

GCCATGATTC ATGAGGTCCA GAGGTTCATT GACCTCATTC CTACCAACCT GCCACATGCG  243989303

GTGACCTGTG ACATTAAGTT CAGGAACTAC CTAATACCCA AG GT end of exon 7

Beginning of cryptic exon out of frame       agcaggtaa tagaaactca  243991103

tttccatggt tccagtgaca tgcagaaccg tggggactta gagtgtgact ctacatgtgc  243991163

tgatagcttg catctgcatg ataaggagca taattttcat tgtgtatgca ctgtcctgga  243991223

tatgaccacc ttctttatca gggt    end of cryptic exon

normal exon 9

1328 GKRMCAGEGLARMELFLFLTTILQNFKLKSVLHPKDIDTTPVFNGFASLPPFYELCFIPL

 

>CYP2C6v2-de1b2b3b4b4c5b upstream pseudogene

EST CK224599.1 = 100% match with 4 frameshifts) so this is a real gene

clone_lib="RALIUNN03 Sprague-Dawley rat female liver

The CYP2C6_v1 sequence is also seen in this same mRNA library

This GNOMON prediction adds two upstream exons that do not belong to this gene

58596732 MDLVMLLVLTLSCLILLSIWRQSSGRGKHP 58596643 exon 1 frameshift

58596643 SGPTPLPIIGNFFHLDLNNITQSLTS (0) 58596566 exon 1

58594823 FSKVNGSVFTLYFGMKLIVILHGYAATKEGLIDHGEEFTKRGSFPVAEKINKGL (1) exon 2 58594662

58594487 GIAFSHGNRWKEIRRFTLMTLQNLGMGKESIEDRVQEETQCLV*ELRKTN (1) exon 3 58594338

58593451 GSPCDPTFILGCAPCNVICSIIFQNCFDYKDQDFLSLMEKLNENIKIVSSPW 58593296

58592013 GSPCDPTFILGCAPCNVICSIIFQNCFDYKDQDFLSLMEKLNENIKIVSSPW 58591858

58590797 FCSSFPVFIDYCLGSHMTLA 58590738

58590736 NVYHTRNYILKKIKEHQESLDVTNPHDFIDYDLIKWKQ 58590620

AVSIKRNS

 

>CYP2C6v2 allele not in figure, 13 aa diffs to CYP2C6_v1 XM_215255 NW_047916

we are assigning this allele status but it may be a separate gene

(temp name = CYP2Cnewb)

58578624 MDLVMLLVLTLTCLILLSIWRQSSGRGKLPPGPIPLPIIGNIFQLNVKNITQSLTS (0) 58578457

58576741 FSKVYGPVFTLYFGLKPTVILHGYEAVKEALIDHGEEFAERGSFPVVEKINKDL (1) 58576583

58576405 GIAFSHGNRWKEIRRFTLTTLRNLGMGKRNIEDHVQEEARCLVEELRKTN 58576256

58575415 GSPCDPTFILGCAPCNVICSIIFQNRFDYKDQDFLNLMEKLNENMKVLSSPWTQ 58575254

58574189 FCSFFPVLIDYCPGSHTTLAKNIYYIRNYLLKKIKEHQESLDVTNPRDFIDYYLIKWKQ 58574013

58554666 ESHNPHLEFTLENLSVTVTDLFGAGTETTSTTLRYALLLLLKYPEVT 58554526

58534931 AKVQEEIDRVVGKHRSPCMQDRSRMPYTDAMIHEVQRFIDLIPTNLPHAVTCDIKFRNYLIPK 58534743

58533131 GTTIITSLSSVLHDSKEFPDPEIFDPGHFLDGNGKFKKSDYFMPFSA 58532991

58531833 GKRMCAGEGLARMELFLFLTTILQNFKLKSVLQPKDIDTTPVFHGFASLPPFYELCFIPL 58531654

 

>CYP2C7 M18335 exons 1,2,3 and 6 are in sequence gaps 93% to 2C7 variant and 2C81

the yellow labels are from a random Chr1 piece that is similar to the CYP2C7 N-term

differences with the published 2C7 sequence M18335 are in cyan

          MDLVTFLVLTLSSLILLSLWRQSSRRRKLPPGPTPLPIIGNFLQIDVKNISQSLTK 

          FSKTYGPVFTLYLGSQPTVILHGYEAIKEALIDNGEKFSGRGSYPMNENVTKGF  

          GIVFSNGNRWKEMRRFTIMNFRNLGIGKRNIEDRVQEEAQCLVEELRKTK      

243849546 GSPCDPSLILNCAPCNVICSITFQNYFDYKDKEMLTFMEKVNENLKIMSSPWMQ 243849385

243847566 VCNSFPSLIDYFPGTHHKIAKNINYMKSYLLKKIEEHQESLDVTNPRDFVDYYLIKQKQ 243847390

243829444 GSPCDPSLILNCAPCNVICSITFQNHFDYKDKEMLTFMEKVNENLKIMSSPWMQ 243829283

this duplicate exon 4 is not in the right sequence order

          ANNIEQSEYSHENLTCSIMDLIGAGTETMSTTLRYALLLLMKYPHVT

243803857 AKVQEEIDRVIGRHRSPCMQDRKHMPYTDAMIHEVQRFINFVPTNLPHAVTCDIKFRNYLIPK 243803669

243800623 GTKVLTSLTSVLHDSKEFPNPEMFDPGHFLDENGNFKKSDYFLPFSA 243800483

243799465 GKRACVGEGLARMQLFLFLTTILQNFNLKSLVHPKDIDTMPVLNGFASLPPTYQLCFIPS 243799286

 

>CYP2C7 variant unmapped 93% to 2C7 88% to 2C81

3463873 MDLVTFLVLTLSSLILLSLWRQSSRRRKLPPGPTPLPIIGNFLQIDVKNISQSLTK 3464040

3479907 FSKTYGPVFTLYLGSQPTVILHGYEAIKEALIDNGEKFSGRGSYPMIENVTKGF   3480068

3480234 GIVFSNGNRWKEMRRFTIMTFRNLGIGKRNIEDRVQEEAQCLVEELRKTK       3480383

3489182 GSPCDPSLILNCAPCNVICSITFQSHFDYKDKEMLTFMEKVNENLKIMSSPWMQ   3489343

3491162 VCNSFPSLVDYFPGTHHKIAKNINYMKSYLLKKIEEHQESLDVTNPRDFVDYYLIKQKQ 3491338

3505354 ANNIEQSEYSHENLTCSIMDLIGAGTETMSTTLRYALLLLMKYPHVT 3505494

3406504 AKVQEEIDRVVGKHRSPCMQDRSRMPYTDAMIHEVQRFIDLIPTNLPHAVTCDIKFRNYLIPK 3406692

3408304 GTTIITSLSSVLHDSKEFPDPEIFDPGHFLDGNGKFKKSDYFMPFSA 3408444

3409602 GKRMCAGEGLARMELFLFLTTILQNFKLKSVLQPKDIDTTPVFPGFASLPPFYELCFIPS 3409778

 

 

New frags on the plus strand between 2C7 and 2C6

 

>CYP2C79-se1[9] frag q Exon 9 100% to 2C79

243885148 GKRICVGEGLARTELFLFLTTILQNFNLKSPVDLKELDTNPVANGFVSVPPKFQICFIPI* 243885330

>CYP2C-se6[9] frag p exon 9 100% to CYP2C82P-de9b

243895387 GKWICVREDLAQMTLFLFCPTILKNFNLKSQVNPKEL 243895497

 

>seq upstream of 2C11

 

>CYP26A1 AF439720, NM_130408 Chr1 1Mb upstream of CYP2C cluster

242138769 MGLPALLASALCTFVLPLLLFLAALKLWDLYCVSSRDRSCALPLPPG

          TMGFPFFGETLQMVLQ (0) 242138581

242138389 RRKFLQMKRRKYGFIYKTHLFGRPTVRVMGADNVRRILL

          GEHRLVSVHWPASVRTILGAGCLSNLHDSSHKQRKK (0) 242138165

242137906 VIMQAFNREALQCYVPVIAEEVSGCLEQWLSCGERGLLVYPEV

          KRLMFRIAMRILLGCEPGPAGGGEDEQQLVEAFEEMTRNLFSLPIDVPFSGLYR (0) 242137616

242137537 GVKPRNLIHARIEENIRAKIRRLQAAERNAGCKDALQLLIEHSWERGERLDMQ (0) 242137379

242136717 ALKQSSTELLFGGHETTASAATSLITYLGLYPHVLQKVREEIKSK (0) 242136583

242136000 GLLCKSHHEDKLDMETLEQLKYIGCVIKETLRLNPPVPGGFRVALKTFELN (0) 242135848

242135595 GYQIPKGWNVIYSICDTHDVADSFTNKEEFNPDRFTSLHPEDTSRFSFIPFGGGLRSCRSKEFAKI

          LLKIFTVELARRCDWQLLNGPPTMKTSPTVYPVDNLPARFTHFQGDI* 242135254

 

>CYP26C1 XM_217935 94% TO 26C1 MOUSE Chr1 1Mb upstream of CYP2C cluster

242151281 MFSWGLSCLSMLGAAGTALLCAGLLLGLAQQLWTLRWTLSRDWASTLPLPKG

          SMGWPFFGETLHWLVQ (0) 242151079

242150553 GSRFHSSRRERYGTVFKTHLLGRPVIRVSGAENVRTILLGEHRL (0) 242150422

242149883 VLARVFSRPALEQFVPRLQEALRREVRSWCAAQRPVAVYQAAKALTFRMAAR

          ILLGLQLDEARCTELAQTFERLVENLFSLPLDVPFSGLRK (0) 242149608

242148160 GIRARDQLYQHLDEVIAEKLREELTAEPGDALHLIINSARELGRELSVQELK (0) 242148005

242146368 ELAVELLFAAFFTTASASTSLILLLLQHPAAIAKIQQELSAQGLGSPCSCAPRASGSRP

          DCSCEPDLSLAVLGRLRYVDCVVKEVLRLLPPVSGGYRTALRTFELD (0) 242146051

242144220 GYQIPKGWSVMYSIRDTHETAAVYRSPPEGFDPERFGVESEDARGSGGRFHYI

          PFGGGARSCLGQELAQAVLQLLAVELVRTARWELATPAFPVMQTVPIVHPVD

          GLLLLFHPLPTLGAGDGSPF* 242143843

 

>CYP2C11 J02657 72% to CYP2C6_v1

243377899 MDPVLVLVLTLSSLLLLSLWRQSFGRGKLPPGPTPLPIIGNTLQIYMKDIGQSIKK 243378066

243379842 FSKVYGPIFTLYLGMKPFVVLHGYEAVKEALVDLGEEFSGRGSFPVSERVNKGL 243380003

243380160 GVIFSNGMQWKEIRRFSIMTLRTFGMGKRTIEDRIQEEAQCLVEELRKSK 243380309

GAPFDPTFILGCAPCNVICSIIFQNRFDYKDPTFLNLMHRFNENFRLFSSPWLQVCNT

FPAIIDYFPGSHNQVLKNFFYIKNYVLEKVKEHQESLDKDNPRDFIDCFLNKMEQEKH

NPQSEFTLESLVATVTDMFGAGTETTSTTLRYGLLLLLKHVDVTAKVQEEIERVIGRN

RSPCMKDRSQMPYTDAVVHEIQRYIDLVPTNLPHLVTRDIKFRNYFIPKGTNVIVSLS

SILHDDKEFPNPEKFDPGHFLDERGNFKKSDYFMPFSA

243416959 GKRICAGEALARTELFLFFTTILQNFNLKSLVDVKDIDTTPAISGFGHLPPFYEACFIPVQRADSLSSHL* 243417171

 

>CYP2C24 92% to 2C80, M86678 has alternative splice first exon

no ESTs have this splice

CK481568.1 matches exons 1,2,3,4

CO565602.1 matched the end of the gene sequence and extends it a little 6 aa

Used this EST to blast the trace files to find the end of exon 7

    MDPVLVLVLTLSCLLLLSLWRQSSGRGKLPPGPTPLPIIGNILQIDVKDISKSFTN CK481568.1 exon 1

    QLSCSRKFGLTCGPEAQ

243522306 FTDKLTAKCHSSVSLHIDLPGNLL 243522235 yellow region not P450 seq.

243522073 FSKIYGPVFTLYFGPKPTVVVHGYEAVKEALDDLGEEFSGRGSFPIVERMNNGL 243521912

243521366 GVIFSNGTKWKELRHFSLMTLRNFGMGKRSIEDRIQEEASCLVEELRKTN 243521217

243518830 GSLCDPTFILSCAPSNVICSVVFHNRFDYKDENFLNLMEKLNENFKILNSPWMQ 243518669

    VCNALPAFIDYLPGSHNRVIKNFAEI 676

677 KSYILRRVKEHQETLDMDNPRDFIDCFLIKMEQEKHNPRTEFTIEILMATVSDVFVAGSE 856

857 TTSTTLRYGLLLLLKHIEVT

    AKVQEEIDHVIGRHRRPCMQDRTRMPYTDAMVHEIQRY 1030

gnl|ti|132779224 rts18e73.g from trace files for exon 7

    AKVQEEIDHVIGRHRRPCMQDRTRMPYTDAMVHEIQRYINLIPNNVPHAATCNVRFRNYVIPK

 

>CYP2C80 XM_217906.2 GNOMON exon 2 on AC109577.4 in HTGS 92% to 2C24, 73% to 2C11

MGWLSDP wrong N-term from GNOMON prediction (temp name = CYP2CNEWC)

Correct N-term possibly in a sequence gap

244632544 FSEVYGPVFTLYFGLKPTVVVYGYEVVKEVLDGEEFSGRGVFPIVTKVNNDL 244632389

          this exon 2 does not match 2C24

244632205 GVIFSNGTKWKELRRFSLMTLRNFGMGKRSIEDRIQEEASCLVEELRKTN 244632056

244628281 GSLCDPTFILSCAPSNVICSVIFHNRFDYKDENFLNLMEKFNENFKILNSPWMQ 244628120

244624041 VCNAIPAFIDYLPGSHNKVIKNFAEIKSYILRRVKEHQETLDMDNPRDFIDCFLIKIE 244623868

244620080 QEKHNPCTEFTIQSLVATVTDVFVAGSETTSTTLRYGLLLLLKHTEVT 244619937

244619006 AKVQEEIDHVIGRHRRPCMQDRTRMPYTDAMVHEIQRYINLIPNNVPHAATCNVRFRNYVIPK 244618818

244616897 GTDLITSLTSVLHDDKEFPNPEVFDPGHFLDEHGNFKRSDYFMPFSS 244616757

244614348 GKRMCVGEALARMELFLLLTTIVQNFNLKSFVATKDIDTTPLTNTFGCVPPSYQLYFTPR* 244614166

 

>2C80 EST no ESTs have this splice

CK481568.1 matches second exon

MDPVLVLVLTLSCLLLLSLWRQSSGRGKLPPGPTPLPIIGNILQIDVKDISKSFTN

FSKIYGPVFTLYFGPKPTVVVHGYEAVKEALDDLGEEFSGRGSFPIVERMNNGL

GVIFSNGTKWKELRHFSLMTLRNFGMGKRSIEDRIQEEASCLVEELRKTN

GSLCDPTFILSCAPS

 

>CYP2C79 XM_219933 minus strand 72% to 2C6_v1 95% to seq e, 100% to seq q (exon 9),

93% to seq z (exon 5) (temp name = CYP2CNEWD)

244590183 MILGVFLGLFLTCLLLLSLWKQNFQRRNLPPGPTPLPIIGNILQIDLKDISKSLRN 244590016

244575990 FSKVYGPVFTLYFGRKPAVVLHGYEAVKEALIDHGEEFAGRGIFPVAEKFNKNC 244575829

244575612 GVVFSSGRTWKEMRRFSLMTLRNFGMGKRSIEDRVQEEARCLVDELRKTN 244575463

244553851 GVPCDPTFILGCAPCNVICSIVFQNRFDYKDQEFLALIDILNENVEILSSPWIQ 244553690

244525726 ICNNFPAIIDYLPGRHRKLLKNFAFAKHYFLAKVIQHQESLDINNPRDFIDCFLIKMEQ 244525550

244524359 EKHNPKTEFTCENLIFTASDLFAAGTETTSTTLRYSLLLLLKYPEVT 244524219

244517844 AKVQEEIDHVIGRHRSPCMQDRHHMPYTDAVLHEIQRYIDLLPTSLPHALTCDMKFRDYFIPK 244517656

244516177 GTTVIASLTSVLYDDKEFPNPEKFDPSHFLDENGKVKKSDYFFPFST 244516037

244496745 GKRICVGEGLARTELFLFLTTILQNFNLKSPVDLKELDTNPVANGFVSVPPKFQICFIPI 244496566

 

>CYP2C79-de9b exon 9 62% to 2C79 2 aa diffs to seq d and seq p minus strand

244491372 G*WICVREDLAQMTLFLFCPTILKNFNLNSQVNPKEL 244491262

 

interval between 2C79 and 2C6

 

>CYP2C6-se1[1:2:3:2:3] frag n exons 1,2,3 2C6 like pseudogene plus strand exon 2,3 100% to seq m

244044941 MDHTTGTYTLSLILLSL*RQSSGRGKIPPGPTPLPIIDNLLQLDIKNVTQYLAN (0) 244045102

244050420 LSKVHGPVLTLYFWMKSNVVLHVDEAVNEDLIDHGE*FAVRRSIPLAEKLIKAL 244050581

244050793 XXXXXXXXXXXXXKTFTLMTLQNLRMGKGNIEDHVQE*AQ 244050873

frag m Exons 2,3 2C6 like pseudogene 100% to seq n

244052306 LSKVHGPVLTLYFWMKSNVVLHVDEAVNEDLIDHGE*FAVRRSIPLAEKLIKAL 244052467

244052679 XXXXXXXXXXXXXKTFTLMTLQNLRMGKGNIEDHVQE*AQ 244052759

 

>CYP2C7-se2[2:3] frag k exons 2,3 = 100% to 2C7 variant, 2 aa diffs to 2C7

exons 2,3,6,7,9 (6,7 and 9 have 1 aa diff to 2C7)

244064158 FSKTYGPVFTLYLGSQPTVILHGYEAIKEALIDNGEKFSGRGSYPMIENVTKGF 244064319

244064485 GIVFSNGNRWKEMRRFTIMTFRNLGIGKRNIEDRVQEEAQCLVEELRKTK 244064634

 

>CYP2C7-se1[6:7:9] frag j exons 6,7,9 (6,7 and 9 have 1 aa diff to 2C7)

244103321 ANNIEQSEYSHENLTCSIMDLIGAGTETMSTTLRYALLLLMKYPHVT 244103461

244120225 AKVQEEIDRVIGRHRSPCMQDRKHMPYTDAMIHEVQRFIDFVPTNLPHAVTCDIKFRNYLIPK 244120413

244124319 FLXXXLQNFNLKSLXHPKDIDTMPVLNXXASLPPTYQLCFIPS 244124447

 

>CYP2C13-se1[6] frag h 72% to 2C13 exon 6 plus strand 100% to seq s

70% to 2C12 exon 6  h

244165142 ENGNQQMNYTQEHLATMVTDLL 244165207

244165209 FGGRETLNSTMRFAFLFLMKYPYTT 244165284

 

>CYP2C22-se1[8] frag g exon 8 72% to 2C22  minus strand

244201638 KFDHGNFLDDR 244201606

244201606 GNFK*NDYFMAFLA 244201565

 

>CYP2C13-se3[1:2:3:2:3:] frag f Exons 1,2,3,2,3  exon 1 = 66% to 2C13 Minus Strand

exons 2,3 = 57% to 2C13

two identical copies of exons 2,3 100% to seq v exons 2,3

244215468 SQSFLLLLSLSSQISSKGKLPLDPTSLPILGYFF*VLMKDICQSLIN 244215328

244214467 FLKTSGPLYTQHFSLQPAVVFCGYAAVKGAFVDHSR*FS*RGWFSIFGKFSKVQ 244214306

244214137 GIGFSHKNVWKVKRFFTLITLKNLHMGNDNIKNKVQEEAQCLVKELKKIN 244213988

244213484                                    R*FS*RGWFSIFGKFSKVQ 244213428

244213259 GIGFSHKNVWKVKRFFTLITLKNLHMGNDNIKNKVQEEAQCLVKELKKIN 244213110

 

>CYP2C82P frag e Exons 1,4,4,5,6,7,8,9 almost an exact duplicate of seqs w,x,y,z,

exons 6-9 of the wxyz cluster in a seq gap Plus Strand

244218695 MDPVVVLMPSFSSLLLLSLWRQNSWRRKLPPGPNPLPIIGSFLQIDLNDLCQSLINE (0) 244218865

244233879        LILSYASCNVICSITFQNRFDYKDKEILTLMEKVNENVKIMSSPWIQ 244234019

244240189 GVPCDPTFILGCAPCNVICSIVFQNHFNYKGQEFLALIDTLNENVEILSSPWIQ 244240350

244265531 ICNNFPAIIDYLPGRHRKLLKKFAFAKHYFLAKVIQHKESLDINNPRDFIDCFLIKMEQ 244265707

244266904 KHNPKTEFTCKNLIFTASDLFAAGTETTSPTLRYSLLLLPKYPEV 244267038

244273480 AKVQEEIDHVIGRHRSPCMQDRHHMPYTDAVLHEIQ*YIDLLPTSLPHALTCDMKFRDYFIPK 244273668

244275197 GTTVIASLTSVLYDDKEFPNPEKFDLSHFLDENGKFKKSDYFFPFST 244275337

244286429 GKRICVGEGLAQTELFLFLTTILQNFNLKSPVDLKELDTNPVANGFVSVPPKFQICFIP 244286605

 

>CYP2C82P-de9b frag d Exon 9 identical to seq p

244289962 GKWICVREDLAQMTLFLFCPTILKNFNLKSQVNPKEL 244290072

 

>CYP2C77-de1b2b3b4b5b frag c Pseudogene 96% to 2C6_v1 exons 1-5 with partial deletion of exon 3 Plus Strand

244337898 MDLVMLLVLTLSCLILLSIWSQSSGRGKLP 244337987

244337987 SGPTPLPIIGNFFHLDLKNITQSLTS 244338064

244339793 FSKVNGSVFTLYFGMKPIVILHGYEAIK*GLIDHREEFTERGSFPVAEKINKGL 244339954

244340129 GIAFSHGNRWKEIRRFTLMTLQNLGMGK 244340212

244341157 GSPCDPTFILGCAPCNVICSIIFQNSFDYKDQDFLSLMEKLNENIKIVSSPWI* 244341318

244342872 FCSSFPVFIDYCPGIHMTLA 244342931

244342933 KNVYHTRNYILKKIKEHQESLDVTNPHDFIDYYLIKWKQ 244343049

 

>CYP2C77 variant of 2C6 13 aa diffs to CYP2C6_v1, 16 aa diffs to 2C6v2

This gene has three frameshifts

244357850 MDLVMLLVLTLTCLILLSIWRQSSGRGKLPPGPIPLPIIGNIFQLNVKNITQSLTS (0) 244358017

244359760 FSKVYGPVFTLYFGMKPTVILHGYEAVKEALIDHGEEFAERGSFPVAEKINKDL (1) 244359921

244360096 GIIFSHGNRWKEIRRFTLTTLRNLGMGKRNIEDRVQEEARCLVEE 244360230

244360232 MRKTN 244360246

244361085 GSPCDPTFILGCAPCNVICSIIFQNRFDYKDQDFLNLMEKLNENMKILSSPWTQ 244361246

244362321 FCSFFPVLIDYCPGSHTTLAKNVYHIRNYL 244362410

244362412 LKKIKEHQESLDVTNPQDFIDYYLIKWKQ 244362498

244381928 ESHNPHSEFTLENLSITVTDLFGAGTETTSTTLRYALLLLLKYPEIT 244382068

244392235 AKVQEEIDRVFGKHRSPCMQDRSRMPYTDAMIHEVQRFIDLIPTNLPHAVTCDIKFRNYLIPM 244392423

244394012 GTTIITSLSSVLHDSKEFPNPEIFDPGHFLDGNGKFKKSDYFMPFSA 244394152

244395307 GKRMFAGEGLA 244395339

244395341 RMELFLFLTTILQNFKLKSVLQPKDIDTTPVFHGFASLPPFYELCFIPL 244395487

 

>CYP82P-se[1:4:4:5] frag z Exon 5 minus strand 1 aa diff to seq e

243632036 ICDNFPAIIDYLPGRHRKLLKKFAFAKHYFLAKVIQHKESLDINNPRDFIDCFLIKMEQ (0) 243631860

frag y Exon 4 minus strand 92% to seq e

243654367 GVPCDPTFILGCAPCNVICSIVFQNHFNYKDQEFLALIE 243654251

243654249 LNENVEILSSP*IQ 243654208

frag x exon 4 minus strand 100% to seq e short exon 4

243659542 LILSYASCNVICSITFQNRFDYKDKEILTLMEKVNENVKIMSSPWIQ 243659402

frag w Exon 1 minus strand 100% to seq e

243675609 MDPVVVLMPSFSSLLLLSLWRQNSWRRKLPPGPNPLPIIGSFLQIDLNDLCQSLIN 243675442

 

>CYP2C13-se4[1:2:3] frag v Exon 1 (+) 59% to 2C13

243678671 FLLLLSLSSQISSKGKLPLDPTSLPILGYFF*VLMKDICQSLIN 243678802

Exon 2 (+) 48% to 2C79

243679647 FLKTSGPLYTQHFSLQPAVVFCGYAAVKGAFVDHSR*FS*RGWFSIFGKFSKVQ 243679808

Exon 3 (+) 100% to seq f

243679977 GIGFSHKNVWKVKRFFTLITLKNLHMGNDNIKNKVQEEAQCLVKELKKIN 243680126

 

>CYP2C7-se4[8:9] frag u Exon 8 minus strand exon 8 = 87% to frag 2, 8+9 = 63% to 2C7

243726168 GVMVITSLSSALHDNKEFPNPKRFDPG*FLDRNGNFKKTDYFILFSA 243726028

Exon 9 minus strand 60% to 2C7

243723025 CVGEGLTPIELFLFLTRILQNFNLKHLTHTEAVDTTPVLSRLTSVSPALKLFFIP 243722861

 

>CYP2C7-se3[8] frag t Exon 8 minus strand 82% to 2C7

243749788 TIVIT*LTSVLHDSKKFPNPEMLDSGHFLDENGNFKKSEYFMPFSA 243749651

 

>CYP2C13-se2[6:7] frag s Exons 6-7 minus strand 72% to 2C12 exon 6 100% to seq h

243766431 ENGNQQMNYTQEHLATMVTDLL 243766366

243766364 FGGRETLNSTMRFAFLFLMKYPYTT 243766290

243760156 XQINEEIGQVIWRHHSPSMLDWSHMIYTNAMVHEVQRYIDLAPNGVVCEVNCDTKYPRDYFIPK 243759968

 

>CYP2C7-de7b frag r Exon 7 (+) 100% to seq a CYP2C81-de7b

243792966 RVQEEIDQVIGRNPSPCMQDRSHMPYTNAMVHEVQR*SNIVPNNIVYEVTCDTKFRNYFIPK 243793151

 

>CYP2C81 93% to 2C7 28 aa diffs missing exon 1 Plus Strand, 91% to seq j (exons 6,7)

93% to seq k (exons 2,3)

244672079 FSKTYGPVFTLYLGSQPTVILHGYEAIKKALIDHGEKFSGRGSYPMIENVTKGF 244672240

244672408 GIAFSNGNRWKEIRRFTIMTLRNLGMGKRNIEDRVQEEAQRLVEELRKTK 244672557

244681144 GSPCDPSFILNCAPCNVICSITFQNHFDYKDKEILTFMEKVNENVKIMSSPRMQ 244681305

244683123 VCNSFPSLIDYFPGTHHKIAKNINYMKSYLLKKIEEHQESLDVTNPRDFVEYYLIKQKQ 244683299

244699290 ANHIEQSEYSHENLACSIMDLIGAGTETMSSTLRYALLLLMKYPHVP 244699430

244713313 AKVQEEIDHVIGRYRSPCMQDRSHMPYTDAMIHEVQRFINFVPTNLLHAVTCDIKFRNYLIPK 244713501

244717457 GTKVLTSLTSVLHGSKEFPNPEMFDPGHFLDENGNFKKSDYFLPFSA 244717597

244718606 GKRACVGEGLARMELFLFLTTILQNFKLKSLVHPKDIDTRPVLNGFASLPPTYQFCFIPS 244718785

 

>CYP2C81-de7b frag a Exon 7 minus Strand 100% to seq r, 80% to 2C13

244724629 LRVQEEIDQVIGRNPSPCMQDRSHMPYTNAMVHEVQR*SNIVPNNIVYEVTCDTKFRNYFIPK 244724441

 

>CYP2C81-de8b frag 1 Exon 8 93% to 2C7 Plus Strand

244737232 GTTVLTSLTSVLHDSKEFPNPEMFDPGHFLDENRNFKKSDYFMPFSA 244737372

 

>CYP2C81-de8c frag 2 Exon 8 76% to 2C13 Plus Strand 87% to seq u

244764239 GMMVITSLSSVLHYNKEFPNPERFDPGYFLDGNGNFKKTDYFILFSA 244764379

 

>CYP2C81-de1d frag 3 Exon  1 with frameshift Plus Strand 85% to seq e 83% TO SEQ w

244783632 MDLVVVL 244783652

244783654 CSVSSLLLFSLWRQSSWRRKLPPGPNPLPIIGNFLQIDLNNLCQSLNN (0) 244783797

 

>CYP2C81-de6e7e frag 4 exon 6 70% to 2C13 Plus Strand

244799349 ELELEHLGSMVTDLFFAGIESIRTTMIFALLFLLNTHTSQ 244799468

exon 7 82% to 2C13, 86% to seq r and seq a

244801583 LQNRSHMPYTNAMVHEVQRYSDIVPNNIVHEVTSDTKFRNYFIPK (0) 244801717

 

>CYP2C81-de1f2f3f frag 5 Exons 1,2,3 84% to 2C7 variant Minus Strand

244826982 MDLVTFLVLTLSSLILLSLWR*NSRRRKLPPGPTPLLIIGNFLQLDVKNVSQSLTM (0) 244826815

244813456 FSKAYGPVFTLYLGSQPTVILHGYEAVKETLIDHGEEFSGRGSFPMVEKAFKCF 244813295

244813129 GIVFSNGNR*KEIRQFIIMTLQNLGMGKRNIEDHVQEEAQCLVEELRKTK 244812980

 

very large gap 244845025-245223024 378kb

 

>CYP2C13v1 100% first 5 exons

Note this seq also on 100.0%    Un  ++   17276272  17282257

Exons 6-9 are on       99.1%    Un  ++   17323193  17358099 2 aa diffs to 2C13 J02861

CYP2C12 is also on this same contig 99.6%    Un  ++   17388090  17446950 2 aa diffs

Minus Strand HSPs:

 

245246208 MDPVVVLLLSLFFLLFLSLWRLSSGRGKLPPGPTPLPIIGNFFQVDMKDIRQSLTN (0) 245246041

245244920 FSKTYGPVYTLYVGSQPTVVLHGYEALKEALVDHGEEFSGRGRLPICEKVAKGQ (1) 245244759

245244599 GIAFSHGNVWKATRHFTVKTLRNLGMGKGTIEDKVQEEAKWLVKELKKTN 245244450

245240888 GSPCDPQFIMGCAPGNVICCIILQNRFDYEDKDFLNLIEKVNEAVKIISSPGIQ (0) 245240727

245239607 VFNIFPILLDYCPGNHNIYLKNYTWVKSYLLEKIKEHEESLDVSNPRDFIDYFLIERNQ 245239431

 

>CYP2C13-de1b2b frag 7 Exon 1 76% to 2C13 Minus Strand

245307855 MDPIVVLVLSLSCLLFLSLWRNNSRRGKLPPGPTPLPIIRNYLQLDMKDIC*SLTK (0) 245307688

frag 6 Exon 2 83% to 2C13 Minus Strand

245292652 FSKTYGPVYTLYFGSQPTVLLYGYEALKEALIDYGEAFSGRGRIPIHEKVSKGQ 245292491

 

>CYP2C22-se2[1:2] frag 9 Exon 1 61% to 2C22 Minus Strand

245347583 MDLFIILWICFACLSLFFLWNQLHYKEKLPPGPVPLPIVGNILQVNIKSIIKSLNI (0) 245347416

frag 8 Exon 2 79% to 2C22 Minus Strand

245334622 LAKEYGPVFTVYLGMKPTVVLHGHKALKEALIDRANEFSVKMQSSLLSKESQGL (1) 245334461

 

>CYP2C12 J03786 80% to 2C13

MDPFVVLVLSLSFLLLLYLWRPSPGRGKLPPGPTPLPIFGNFLQ

IDMKDIRQSISNFSKTYGPVFTLYFGSQPTVVLHGYEAVKEALIDYGEEFSGRGRMPV

FEKATKGLGISFSRGNVWRATRHFTVNTLRSLGMGKRTIEIKVQEEAEWLVMELKKTK

GSPCDPKFIIGCAPCNVICSIIFQNRFDYKDKDFLSLIENVNEYIKIVSTPAFQVFNA

FPILLDYCPGNHKTHSKHFAAIKSYLLKKIKEHEESLDVSNPRDFIDYFLIQRCQENG

NQQMNYTQEHLAILVTNLFIGGTETSSLTLRFALLLLMKYPHITDKVQEEIGQVIGRH

RSPCMLDRIHMPYTNAMIHEVQRYIDLAPNGLLHEVTCDTKFRDYFIPKGTAVLTSLT

SVLHARKEFPNPEMFDPGHFLDENGNFKKSDYFMPFSAGKRKCVGEGLASMELFLFLT

TILQNFKLKSLSDPKDIDINSIRSEFSSIPPTFQLCFIPV

 

>CYP2C13v1 J02861 80% to 2C12

MDPVVVLLLSLFFLLFLSLWRPSSGRGKLPPGPTPLPIIGNFFQVDMKDIRQSLTN

FSKTYGPVYTLYVGSQPTVVLHGYEALKEALVDHGEEFSGRGRLPICEKVAKGQ

GIAFSHGNVWKATRHFTVKTLRNLGMGKGTIEDKVQEEAKWLVKELKKTN

GSPCDPQFIMGCAPGNVICSIILQNRFDYEDKDFLNLIEKVNEAVKIISSPGIQ

VFNIFPILLDYCPGNHNIYFKNHTWLKSYLLEKIKEHEESLDVSNPRDFIDYFLIERNQ

ENANQWMNYTLEHLAIMVTDLFFAGIETVSSTMRFALLLLMKYPHVT

AKVQEEIDHVIGRHRSPCMQDRSHMPYTNAMVHEVQRYIDIGPNGLLHEVTCDTKFRNYFIPK

GTAVLTSLTSVLHDSKEFPNPEMFDPGHFLDENGNFKKSDYFIPFSA

GKRMCLGESLARMELFLFLTTILQNFKLKSLVDPKDINTTPICSSLSSVPPTFQMRFIPL

 

>CYP2C13v2 Not in figure probable 2C13 allele NM_138514 7AA DIFFS TO 2C13v1 (98%)

80% to 2C12 (temp name = CYP2CNEWA)

MDPVVVLLLSLFFLLFLSLWRLSSGRGKLPPGPTPLPIIGNFFQ

VDMKDIRQSLTNFSKTYGPVYTLYVGSQPTVVLHGYEALKEALVDHGEEFSGRGRLPI

CEKVAKGQGIAFSHGNVWKATRHFTVKTLRNLGMGKGTIEDKVQEEAKWLVKELKKTN

GSPCDPQFIMGCAPGNVICCIILQNRFDYEDKDFLNLIEKVNEAVKIISSPGIQVFNI

FPILLDYCPGNHNIYLKNYTWVKSYLLEKIKEHEESLDVSNPRDFIDYFLIERNQENA

NQWMNYTLEHLAIMVTDLFFAGIETVSSTMRFALLLLMKYPHVTAKVQEEIDHVIGRH

RSPSMQDRSHMPYTNAMVHEVQRYIDIGPNGLLHDVTCDTKFRNYFIPKGTAVLTSLT

SVLHDSKEFPNPEMFDPGHFLDENGNFKKSDYFIPFSAGKRMCLGESLARMELFLFLT

TILQNFKLKSLVDPKDINTTPICSSLSSVPPTFQMRFIPL

 

>CYP2C22 M58041 61% to 2C79

245425985 MALFIFLGIWLSCLVFLFLWNQHHVRRKLPPGPTPLPIFGNILQVGVKNMSKSMCM 245425818

LAKEYGPVFTMYLGMKPTVVLYGYEVLKEALIDRGEEFSDKMHSSMLSKVSQGL

GIVFSNGEIWKQTRRFSLMVLRSMGMGKRTIENRIQEEVVYLLEALRKTN

GSPCDPSFLLACVPCNLISSVIFQHRFDYSDEKFQKFIENFHTKIEILASPWAQ

LCSAYPVLYYLPGIHNKFLKDVTEQKKFILMEINRHRASLNLSNPQDFIDYFLIKMEKEKHN

EKSEFTMDNLIVTIGDLFGAGTETTSSTIKYGLLLLLKYPEVTAKIQEEITRVIGRHR

RPCMQDRNHMPYTDAVLHEIQRYIDFVPIPLPRKTTQDVEFRGYHIPK

GTSVMACLTSALHDDKEFPNPEKFDPGHFLDEKGNFKKSDYFMAFSA

GRRACIGEGLARMEMFLILTSILQHFILKPLVNPEDIDTTPVQPGLLSLPPPFQLCFIPV

 

>CYP2C23 X55446 59% to 2C11

MELLGFTTLALVVSVTCLSLLSVWTKLRTRGRLPPGPHPPSHYW

ESTATEPQGHPASLSKLAKEYGPVYTLYFGTSPTVVLHGYDVVKEALLQQGDEFLGRG

PLPIIEDTHKGYGLIFSNGERWKVMRRFSLMTLRNFGMGKRSLEERVQEEAWCLVEEL

QKTKAQPFDPTFILACAPCNVICSILFNDRFQYNDKTFLNLMDLLNKNFQQVNSVWCQ

MYNLWPTIIKYLPGKHIEFAKRIDDVKNFILEKVKEHQKSLDPANPRDYIDCFLSKIE

EEKDNLKSEFHLENLAVCGSNLFTAGTETTSTTLRFGLLLLMKYPEVQAKVHEELDRV

IGRHQPPSMKDKMKLPYTDAVLHEIQRYITLVGSSLPHAVVQDTKFRDYVIPKGTTVL

PMLSSVMLDQKEFANPEKFDPGHFLDKNGCFKKTDYFVPFSLGKRACVGESLARMELF

LFFTTLLQKFSLKTLVEPKDLDIKPITTGIINLPPPYKLCLVPR

 

>CYP2D1 J02867

MELLNGTGLWSMAIFTVIFILLVDLMHRRHRWTSRYPPGPVPWP

VLGNLLQVDLSNMPYSLYKLQHRYGDVFSLQKGWKPMVIVNRLKAVQEVLVTHGEDTA

DRPPVPIFKCLGVKPRSQGVILASYGPEWREQRRFSVSTLRTFGMGKKSLEEWVTKEA

GHLCDAFTAQAGQSINPKAMLNKALCNVIASLIFARRFEYEDPYLIRMVKLVEESLTE

VSGFIPEVLNTFPALLRIPGLADKVFQGQKTFMALLDNLLAENRTTWDPAQPPRNLTD

AFLAEVEKAKGNPESSFNDENLRMVVVDLFTAGMVTTATTLTWALLLMILYPDVQRRV

QQEIDEVIGQVRCPEMTDQAHMPYTNAVIHEVQRFGDIAPLNLPRFTSCDIEVQDFVI

PKGTTLIINLSSVLKDETVWEKPHRFHPEHFLDAQGNFVKHEAFMPFSAGRRACLGEP

LARMELFLFFTCLLQRFSFSVPVGQPRPSTHGFFAFPVAPLPYQLCAVVREQGL

 

>CYP2D pseudogene Chr 7  ++  120811066 120811206 2aa diff to 2D2/2D3 exon 8

between 2D1 and 2D3

GTTLIPNLSSLLNDETVWEKPLRFHPEHFLDAQGNFVKHEAFMPFSA

 

>CYP2D2 X52027 X52455

MGLLIGDDLWAVVIFTAIFLLLVDLVHRHKFWTAHYPPGPVPLP

GLGNLLQVDFENMPYSLYKLRSRYGDVFSLQIAWKPVVVINGLKAVRELLVTYGEDTA

DRPLLPIYNHLGYGNKSKGVVLAPYGPEWREQRRFSVSTLRDFGVGKKSLEQWVTEEA

GHLCDTFAKEAEHPFNPSILLSKAVSNVIASLVYARRFEYEDPFFNRMLKTLKESFGE

DTGFMAEVLNAIPILLQIPGLPGKVFPKLNSFIALVDKMLIEHKKSWDPAQPPRDMTD

AFLAEMQKAKGNPESSFNDENLRLVVIDLFMAGMVTTSTTLSWALLLMILHPDVQRRV

HEEIDEVIGQVRRPEMADQARMPFTNAVIHEVQRFADIVPTNIPHMTSRDIKFQGFLI

PKGTTLIPNLSSVLKDETVWEKPLRFHPEHFLDAQGNFVKHEAFMPFSAGRRACLGEP

LARMELFLFFTCLLQRFSFSVLAGRPRPSTHGVYALPVTPQPYQLCAVAR

 

>CYP2D3 X52028

MELLAGTGLWPMAIFTVIFILLVDLMHRRQRWTSRYPPGPVPWP

VLGNLLQVDLCNMPYSMYKLQNRYGDVFSLQMGWKPVVVINGLKAVQELLVTCGEDTA

DRPEMPIFQHIGYGHKAKGVVLCTYGPEWREQRRFSVSTLRNFGVGKKSLEQWVTDEA

SHLCDALTAEAGRPLDPYTLLNKAVCNVIASLIYARRFDYGDPDFIKVLKILKESMGE

QTGLFPEVLNMFPVLLRIPGLADKVFPGQKTFLTMVDNLVTEHKKTWDPDQPPRDLTD

AFLAEIEKAKGNPESSFNDANLRLVVNDLFGAGMVTTSITLTWALLLMILHPDVQCRV

QQEIDEVIGQVRHPEMADQAHMPFTNAVIHEVQRFADIVPMNLPHKTSRDIEVQGFLI

PKGTTLIPNLSSVLKDETVWEKPLRFHPEHFLDAQGNFVKHEAFMPFSAGRRACLGEP

LARMELFLFFTCLLQRFSFSVPTGQPRPSDYGVFAFLLSPSPYQLCAFKR

 

>CYP2D4 M22331, X52029, X52457 I,T,P seen in ESTs TDI or ANV seen in ESTs same lib

MRMPTGSELWPIAIFTIIFLLLVDLMHRRQRWTSRYPPGPVPWP

VLGNLLQIDFQNMPAGFQKLRCRFGDLFSLQLAFESVVVLNGLPALREALVKYSEDTA

DRPPLHFNDQSGFGPRSQGVVLARYGPAWRQQRRFSVSTFRHFGLGKKSLEQWVTEEA

RCLCAAFADHSGFPFSPNTLLDKAVCNVIASLLFACRFEYNDPRFIRLLDLLKDTLEE

ESGFLPMLLNVFPMLLHIPGLLGKVFSGKKAFVAMLDELLTEHKVTWDPAQPPRDLTD

AFLAEVEKAKGNPESSFNDENLRVVVADLFMAGMVTTSTTLTWALLFMILHPDVQCRV

QQEIDEVIGQVRRPEMADQARMPFTNAVIHEVQRFADILPLGVPHKTSRDIEVQGFLI

PKGTTLITNLSSVLKDETVWEKPLRFHPEHFLDAQGNFVKHEAFMPFSAGRRACLGEP

LARMELFLFFTCLLQRFSFSVPTGQPRPSDYGIFGALTTPRPYQLCASPR

 

>CYP2D18 U48219 S77859 ONLY 5 AA DIFFS probably = 2D4

MRMPTGSELWPIAIFTIIFLLLVDLMHRRQRWTSRYPPGPVPWP

VLGNLLQIDFQNMPAGFQKLRCRFGDLFSLQLAFESVVVLNGLPALREALVKYSEDTA

DRPPLHFNDQSGFGPRSQGVVLARYGPAWRQQRRFSVSTFRHFGLGKKSLEQWVTEEA

RCLCAAFADHSGFPFSPNTLLDKAVCNVIASLLFACRFEYNDPRFIRLLDLLKDTLEE

ESGFLPMLLNVFPMLLHIPGLLGKVFSGKKAFVAMLDELLTEHKVTWDPAQPPRDLTD

AFLAEVEKAKGNPESSFNDENLRVVVADLFMAGMVTTSTTLTWALLFMILRPDVQCRV

QQEIDEVIGQVRRPEMADQARMPFTNAVIHEVQRFADILPLGVPHKTSRDIEVQGFLI

PKGTTLIINLSSVLKDETVWEKPLRFHPEHFLDAQGNFVKHEAFMPFSAGRRACLGEP

LARMELFLFFTCLLQRFSFSVPAGQPRPSNYGVFGALTTPRPYQLCASPR

 

>CYP2D5 X52030 X52458

MELLNGTGLWPMAIFTVIFILLVDLMHRHQRWTSRYPPGPVPWP

VLGNLLQVDPSNMPYSMYKLQHRYGDVFSLQMGWKPMVIVNRLKAVQEVLVTHGEDTA

DRPPVPIFKCLGVKPRSQGVVFASYGPEWREQRRFSVSTLRTFGMGKKSLEEWVTKEA

GHLCDAFTAQNGRSINPKAMLNKALCNVIASLIFARRFEYEDPYLIRMLTLVEESLIE

VSGFIPEVLNTFPALLRIPGLADKVFQGQKTFMAFLDNLLAENRTTWDPAQPPRNLTD

AFLAEVEKAKGNPESSFNDENLRMVVVDLFTAGMVTTATTLTWALLLMILYPDVQRRV

QQEIDEVIGQVRCPEMTDQAHMPYTNAVIHEVQRFGDIAPLNLPRITSCDIEVQDFVI

PKGTTLIINLSSVLKDETVWEKPLRFHPEHFLDAQGNFVKHEAFMPFSAGRRACLGEP

LARMELFLFFTCLLQHFSFSVPAGQPRPSTLGNFAISVAPLPYQLCAAVREQGH

 

>CYP2D pseudogene z chr7:120386407-120386565  exon 9 (+ strand) 73% to 2D3

ACLGEPLTCMELFLFFICLLQSFSFSVKAGQPRPSNHGIFEMPISPSSYQLCA

 

>CYP2E1 J02627

MAVLGITIALLVWVATLLVISIWKKIYNSWNLPPGPFPLPILGN

IFQLDLKDIPKSFTKLAKRFGPVFTLHLGSRRIVVLHGYKAVKEVLLNHKNEFSGRGD

IPVFQEYKNKGIIFNNGPTWKDVRRFSLSILRDWGMGKQGNEARIQREAQFLVEELKK

TKGQPFDPTFLIGCAPCNVIADILFNKRFDYNDKKCLRLMSLFNENFYLLSTPWIQLY

NNFADYLRYLPGSHRKIMKNVSEIKQYTLEKAKEHLQSLDINCARDVTDCLLIEMEKE

KHSQEPMYTMENVSVTLADLFFAGTETTSTTLRYGLLILMKYPEIEEKLHEEIDRVIG

PSRVPAVRDRLDMPYMDAVVHEIQRFINLVPSNLPHEATRDTVFQGYVIPKGTVVIPT

LDSLLYDSHEFPDPEKFKPEHFLNENGKFKYSDYFKAFSAGKRVCVGEGLARMELFLL

LSAILQHFNLKSLVDPKDIDLSPVTVGFGSIPPQFKLCVIPRS

 

>CYP2F4 AF017393 end of exon 5 and exon 6 in seq gap in genome seq chr1 (+)

82269864 MDGVSTAILLLLLAVISLSLTFTSWGKGQLPPGPKPLPILGNLLQLRSQDLLTSLTK 82270034

82270123 LSKDYGSVFTVYLGPRRVIVLSGYQTVKEALVDKGEEFSGRGSYPIFFNFTKGN 82270284

82272477 GIAFSDGERWKILRRFSVQILRNFGMGKRSIEERILEEGSFLLDVLRKTE 82272626

82276791 GKPFDPVFILSRSVSNIICSVIFGSRFDYDDERLLTIIHFINDNFQIMSSPWGE 82276952

82277413 MYNIFPSLLDWVPGPHRRVFRNFGGMKD 82277496

         LIARSVREHQDSLDPNSPRDFIDCFLTKMV

         QEKQDPLSHFNMDTLLMTTHNLLFGGTETVGTTLRHAFLILMKYPKVQ

82279507 ARVQEEIDCVVGRSRMPTLEDRASMPYTDAVIHEVQRFADVIPMNLPHRVIRDTPFRGFLIPK 82279695

82281147 GTDVITLLNTVHYDSDQFKTPQEFNPEHFLDANQSFKKSPAFMPFSA 82281287

82282297 GRRLCLGEPLARMELFIYLTSILQNFTLHPLVEPEDIDLTPLSSGLGNLPRPFQLCMRIR 82282476

 

>CYP2G1 M33296 J04715 M34444 chr1 (+)

81996311 MALGGAFSIFMTLCLSCLLILIAWKRTSRGGKLPPGPTPIPFLGNLLQVRIDATFQSFLK 81996490

81997087 LQKKYGSVFTVYFGPRPVVILCGHEAVKEALVDQADDFSGRGEMPTLEKNFQGY 81997248

81998826 GLALSNGERWKILRRFSLTVLRNFGMGKRSIEERIQEEAGYLLEELHKVK 81998975

82001483 GAPIDPTFYLSRTVSNVICSVVFGKRFDYEDQRFRSLMKMINESFVEMSMPWA 82001641

82002014 QLYDMYWGVIQYFPGRHNRLYNLIEELKDFIASRVKINEASFDPSNPRDFIDCFLIKMY 82002190

82004063 QDKSDPHSEFNLKNLVLTTLNLFFAGTETVSSTLRYGFLLLMKYPEVE 82004206

82005533 AKIHEEINQVIGTHRTPRVDDRAKMPYTDAVIHEIQRLTDIVPLGVPHNVIRDTHFRGYFLPK 82005721

82005841 GTDVYPLIGSVLKDPKYFRYPEAFYPQHFLDEQGRFKKNDAFVAFSS 82005981

82007190 GKRICVGEALARMELFLYFTSILQRFSLRSLVPPADIDIAHKISGFGNIPPTYELCFMAR 82007369

 

>CYP2J3 U39943

MLVTAGSLLGAIWTVLHLRILLLAAVTFLFLADFLKHRRPKNYP

PGPWRLPLVGCLFHLDPKQPHLSLQQFVKKYGNVLSLDFANIPSVVVTGMPLIKEIFT

QMEHNFLNRPVTLLRKHLFNKNGLIFSSGQTWKEQRRFALMTLRNFGLGKKSLEQRIQ

EEAYHLVEAIKDEGGLPFDPHFNINKAVSNIICSVTFGERFEYHDSQFQEMLRLLDEA

MCLESSMMCQLYNIFPRILQYLPGSHQTLFSNWRKLKLFISDIIKNHRRDWDPDEPRD

FIDAFLKEMAKYPDKTTTSFNEENLICSTLDLFFAGTETTSTTLRWALLCMALYPEVQ

EKMQAEIDRVIGQGRQPNLADRDSMPYTNAVIHEVQRIGNIIPFNVPREVAVDTYLAG

FNLPKGTMILTNLTALHRDPKEWATPDTFNPEHFLENGQFKKRESFLPFSMGKRACLG

EQLARSELFIFITSLIQKFTFKPPVNEKLSLQFRMSVTISPVSHRLCAIPRL

 

>CYP2J3 91% to mouse 2j9 exon 8 in a seq gap

116772039 MLVTAGSLLGAIWTVLHLRILLLAAVTFLFLADFLKHRRPKNYPPGPWRLPLVGCLFHLDPKQPHLSLQQ 116771830

116767788 FVKKYGNVLSLDFANIPSVVVTGMPLIKEIFTQMEHNFLNRPVTLLRKHLFNKN 116767791

116766010 GLIFSSGQTWKEQRRFALMTLRNFGLGKKSLEQRIQEEAYHLVEAIKDEG 116765861

116765445 GLPFDPHFNINKAVSNIICSVTFGERFEYHDSQFQEMLRLLDEAMCLESSMMCQ 116765284

116760602 LYNIFPRILQYLPGSHQTLFSNWRKLKLFISDIIKNHRRDWDPDEPRDFIDAFLKEMAK 116760426

116758387 YPDKTTTSFNEENLICSTLDLFFAGTETTSTTLRWALLCMALYPEVQ 116758247

116754923 EKMQAEIDRVIGQGRQPNLADRDSMPYTNAVIHEVQRIGNIIPFNVPREVAVDTYLAGFNLPK 116754735

          GTMILTNLTALHRDPKEWATPDTFNPEHFLENGQFKKRESFLPFSM

116749991 GKRACLGEQLARSELFIFITSLIQKFTFKPPVNEKLSLQFRMSVTISPVSHRLCAIPRL 116749815

 

>CYP2J3P1 U40000

  24 MLVTAGSLLGAIWTVLHLRILLLAAVTFLFLADFLKHRRPKNYPPGPWRLPLVGCLFHLD 203

 204 PKQPHLSLQQFVKKYGNVLSLDFANIPSVVVTGMPLIKEIFTQMEHNFLNRPVTLLRKHL 383

 384 FNKNGLIFSSGQTWKEQRRFALMTLRNFGLGKKSLEQRIQGEAYHLVEAIKDEGGLPFDP 563

 564 HFNINKAVSNIICSVTFGERFEYHDSQFQEMLRLLDEAMCLESSMMCQLYNIFPRILQYL 743

 744 PGSHQTLFSNWRKLKLFISDIIKNHRRDWDPDEGRDFIDAFLKEMAKYPDKTTTSFNEEN 923

 924 LICSTLDLFFAGTETTSTTLRWALLCMALYPEVQEKMQAEIDRVIGQGRQPNLADRDSMP 1103

1104 YTNAVIHEVQRIGNIIPFNVPRKVAVDTYLAGFNLPKGTMILTNLTALHRDPKEWATPDT 1283

1284 FNPEHFLENGQFKKRESFLPFSM 1352

1415 GKRACLGEQLARSELFIFITSLIQKFTFKPPVNEKLSLQFRMSVTISPVSHRLCAIPRL 1591

 

>CYP2J3P2 U40004

  13 MLVTAGSLLGAIWTVLHLRILLLAAVTFLFLADFLKHRRPKNYPPGPWRLPLVGCLFHLD 192

 193 PKQPHLSLQQFVKKYGNVLSLDFANIPSVVVTGMPLIKEIFTQMEHNFLNRPVTLLRKHL 372

 373 FNKNGLIFSSGQTWKEQRRFALMTLRNFGLGKKSLEQRIQEEAYHLVEAIKDEGGLPFDP 552

 553 HFNINKAVSNIICSVTFGERFEYHDSQFQEMLRLLDEAMCLESSMMCQLYNIFPRILQYL 732

 733 PGSHQTLFSNWRKLKLFISDIIKNHRRDWDPDEPRDFIDAFLKEMAKYPDKTTTSFNEEN 912

 913 LICSTLDLFFAGTETTSTTLRWALLCMALYPEVQEKMQAEIDRVIGQGRQPNLADRDSMP 1092

1093 YTNAVIHEVQRIGNIIPFNVPREVAVDTYLAGFNLPKG 1206

     RDPKEWATPDTFNPEHFLENGQFKKRESFLPFSMGKRACLGEQLARSEL 1348

1349 FIFITSLIQKFTFKPPVNEKVSLQFRMSVTISPVSHRLCAIPRL 1480

 

>CYP2J5P exons 1-4 69% to 2j5 mouse now a pseudogene ortholog

116785102 MITSLSSLVTSSWAALLLRTLLLAAVTFLFLAGILRRHRPKDYQPGPWRLPFVGNFFQIDFEQSHLVLQK 116784893

116784415 FAKKYGNVFSLELDRPSVVVVTGQPLIKTKMFTHLEQNFANHFVTSVRKRAIGNN 116784251

116781318 GLITSNGQTWKEKRRFALMTLKNFGLGKKSLEQRMHE*AFHLVEARREEG 116781169

116780474 GQPVDLHLINNAVANVICSITFGGRFEYEDCQFQEMPTLLDEALHV 116780337

 

>CYP2J4

116734902 MLATAGSLIATIWAALHLRTLLVAALTFLLLADYFKTRRPKNYPPGPWGLPFVGNIFQLDFGQPHLSIQP 116734693

116725983 FVKKYGNIFSLNLGDITSVVITGLPLIKETFTHIEQNILNRPLSVMQERITNKN 116725822

116723426 GLIFSSGQTWKEQRRFALMTLRNFGLGKKSLEQRMQEEAHYLVEAIREEK 116723277

116722875 GKPFNPHFSINNAVSNIICSVTFGERFEYHDSRFQEMLRLLDEVMYLETTMISQ 116722714

116718583 LYNIFPWIMKYIPGSHQTVFRNWEKLKLFVSSMIDDHRKDWNPEEPRDFIDAFLKEMSK 116718407

116716306 YPEKTTSFNEENLICSTLDLFFAGTETTSTTLRWALLYMALYPEVQ 116716169

116713582 EKVQAEIDRVIGQKRAASLADRESMPYTNAVIHEVQRMGNIIPLNVPREVAMDTTLNGFHLPK 116713394

116711364 GTMVLTNLTALHRDPKEWATPDVFNPEHFLENGQFKKRESFLPFSM 116711227

116708412 GKRACLGEQLARSELFIFFTSLMQKFTFKPPTNEKLSLKFRNGLTLSPVTHRICAVPRE* 116708233

 

>CYP2J4-de6b  w

116706163 XXXXXXSFCEENLTCRTLDFLYAGIDTISNRLHWVLLLTCVNPEXX 116706053 exon 6

 

>CYP2J16-de2b5b9b x

116691748 KKYGNIFGLNLGDLTSEVITGLLLSKE 116691668 exon 2

116684743 FYDIFPYLMKYIPGITSNCFQKLGKLKLFVSCMTDEHRRDWNPEDPRNFTDALLKEMMK 116684567 exon 5

116677505 GKRACPGEQLARSKLFIFFTALIQKFTF 116677422

116677420 RLGMKSILGLTLSPVTHHI*ALSKQ 116677346 exon 9

 

>CYP2J16

116664772 MLATVGSLLAKIWSAINFWTLLLTLLTFLLLADYLKNRRPNNYPPGPWRLPFVGNLFQFDLNISHLHLRIQQ 116664557

116654396 FVKKYGNLISLDFGNISVVVITGLPLIKEALINNEQNFLKRPIVPSRYRVFKDN 116654235

116651622 GIFFANVHKWKEQRRFALTMLKNFGLGKKSLEQCIQEEAHHLVEVIGEEK 116651473

116650955 GQPFDPHFRINNAVSNIICSITFGERFEYDDSQFQELLKLADEVICSEASMTSV 116650794

116640170 LYNVFPLIFKYLPGPHQTVFKNWEKLKSIVANMIDRHRKDWNPDEPRDFVDAFLTEMTK 116639994

116638624 YPDKTTTSFNEENLIATTLDLFFAGTETTSTTLRWALLYITLNPEVQ 116638484

116627938 EKVHSEIDRVIGHGRLPSTDDQDAMPYTNAVIHEVLRMGNIIPLNVPREVTADSTLAGFHLPK 116627750

116624337 GKMILTNLTALHRDPKEWATPDTFNPEHFLENGQFKKRDSFLPFSV 116624200

116612610 GKRACPGEKLAKSELFIFFTALMQNFTFKAPTNEKLSLKLRKGLSLYPVSYRICAVPR 116612437

 

>CYP2J16-de5c6c9c y

72% to 2j6 mouse

116604392 LYNIFPWIMNYGPGSHQ 116604342 116604222 exon 5

116604345 SVFRNWEKLKLFVSCMIDNKQRWVP 116604271 exon 5

116602255 YPEKSTSFSQGHLFCSTLNLFRAGSET 116602175 exon 6

116591992 GKRACPGEQMAISELFSFFAAFMQ 116591921 exon 9

116591919 KFTFHLAINEKLRMKFRNGLTLP*SSHLYC 116591830 exon 9

 

>CYP2J17P

116584536 MLATASCLVANVCSAIPLWTLLLAALSWLPQKQAPQKQPSRALAPAIFGNLFQFDLDVSQLHSGI*PSKK 116584327 exon 1

116581102 FVTKYGNLISLDFGNTSSVIISGLPLIKEALTDM 116581001 exon 2

116580637 EQNLLKCIVLASREHVFKNN 116580578 exon 2 last half

116570454 LYNVFPFIIKYL 116570419 exon 5

116570408 NQTFFRNWENLNLFVSHMMESHRKDWNPVEPRDFIDAFLTYMTKEDD 116570268 exon 5 last half

116566151 KVHSEIDGVTGHGRPPSTGDRDSMPYTNAVIYEVLRMDNINPLKVPREVTADSTLDEFCLSK 116565966 exon 7

116563406 GTMVLINLTALYRESKEWTTQDTFNPEHFLENGMFKKRESF 116563284 exon 8

116559748 KFTFKPPISEKLSLKFRTGLTLSHVSCRI*SIHR 116559647 exon 9

 

>CYP2J18P

63% to 2j6 mouse

116551335 MLGTQDILEAGIWALLH 116551285 exon 1

116551282 RTLLLAAVTFLLLADYLKTGNK 116551217 exon 1

116551217 KKYPWGPCNPPVMNNLFQLDLEQ 116551149 exon 1

116537661 LYNAFLSIMKYHPGSHQ 116537611 exon 5

116537614 SVFRNWEKLIWRMSHIAENHCKG*NPAEL 116537528 exon 5

116537523 REFIDAFLTKMTK 116537485 exon 5

116534551 YPDKTTTNFNEENLICA 116534501 exon 6

116534498 LEFLFARTEITSTTLSWVLLYLSANPGVQ 116534412 exon 6

116529361 LFIFFTSLMQKFTFKPPISEKLILKFRMGLILSPVCH*ICVVPRQ* 116529224 exon 9

 

>CYP2J10 XM_233199  ortholog of mouse Cyp2j12

Predicted GNOMON 86% to 2j12 mouse (LOC313373), mRNA.

2J10 seq specific rev primer matches 116499966-116499989

forward primer 1 = 116515946 116515968

116516004 MLSTEDTLEAAIRALLHFRTLLLAAVTFLFLANYLKTRRPKNYPPGPWRLPFVGNLFQLDVKQPHVVIQK 116515795

116508667 FVKKYGNLTSLDFGTIPSVVITGLPLIKEAFTNTEQNFLNRPVTPLRKRVFNNN 116508506

116505791 GLIMSNGQTWKEQRRFTMTTLKNFGLGKRSLEQRIQEEANYLVEAIGADK 116505642

116505144 GQPFDPHFKINSAVSNIICSITFGERFEYEDSLFQELLRLLDEASCLESSMMCQ 116504983

116500081 LYNVFPTIIKYLPGSHQTVLRNWEKLKLFISCMMDSHQKDWNPDEPRDFIDAFLTEMAK 116499905

116496152 YRDKTTTSFNKENLIYSTLDLFFAGSETTSNILRWSLLYITTNPEVQ 116496012

116489147 EKVHSEIDRVIGHRRQPSTGDRDAMPYTNAVIHEVLRMGNIIPLNVPREMTADSTLAGFHLPK 116488959

116488244 GTTILTNLTGLHRDPKEWATPDTFNPEHFLENGQFKKRDSFLPFSM 116488107

116479687 GKRACPGEQLARTELFIFFTALMQNFTFKPPVNETLSLKFRNGLTLAPVSHRICAVPRQ 116479511

 

>CYP2J13 XM_233198 1455 bp ortholog of mouse Cyp2j13

Predicted GNOMON Rattus norvegicus similar to CYP2J4 (LOC313372), mRNA.

Missing exon 1 74% to XM_233199, 79% to 2J4 78% to 2J3 90% to 2j13 mouse

116449294 FVKKYGNVISLDLGIMSSVIISSLPLIKEAFSHLDENFINRPIFPLQKHIFNDN 116449133

116446157 GLIFSSGQTWKEQRRFALMTLRNFGLGKKSLEQRIQEEAHHLVEAIGEEE 116446008

116445630 GQPFDPHFKINNAVSNIICSITFGERFEYHDSQFQELLKLLDKAMYLGTPMMIH 116445469

116440971 LYNMFPWIIKHLPGQHQTLLATWGKLKSYIADIIENHREDWNPAEPRDFIDAFLNEMAK 116440795

116428766 YPDKTTTSFNEENLICSTLDLFLAGTETTSTTLRWAVLYMALYPEVQ 116428626

116426881 EKVQAEIDQVIGQEKHPSLADRDSMPYTNAVVHEIQRMGNIVPLNVPREVAVDTTLAGFHLPK 116426693

116426568 GSVVMTNLTALHMDPKEWATPDVFNPEHFLENGQFKKRDSFLPFSM 116426431

116423270 GKRACLGEQLARSELFIFFTALMQKFTFKPPTNEKLSLKFRLGITISPVSHRICAVPRL 116423094

 

>CYP2R1 XM_341909

MFQLPGVQTCAGALAGAFLLLLLVLVVRQLLRQRRPAGFPPGPP

RLPFIGNICSLALSADLPHVYMRKQSRVFGEIFSLDLGGISTVVLNGYDVVKECLVHQ

SEIFADRPCLPLFMKMTKMGGLLNSRYGRGWIDHRRLAVNSFHYFGSGQKSFESKILE

ETWSLIDAIETYKGRPFDLKQLITNAVSNITNLILFGERFTYEDTDFQHMIELFSENV

ELAASAPVFLYNAFPWIGILPFGKHQRLFRNADVVYDFLSKLIEKAAVNRKPHLPQNF

VDAYLDEMDKGQNDPLSTFSKENLIFSVGELIIAGTETTTNVLRWAVLFMALYPNIQG

QVHKEIDLIMGHDRRPSWEDKCKMPYTEAVLHEVLRFCNIVPLGIFHATSEDAVVRGY

SIPKGTTVITNLYSVHFDEKYWKDPDMFYPERFLDSSGYFTKKEALIPFSLGRRHCLG

EQLARMEMFLFFTSLLQQFHLHFPHELVPNLKPRLGMTLQPQAYLICAERR

 

>CYP2S1 (XM_218347 N-term incorrect) CK473647.1 EST with N-term chr1 (-)

no duplicate exon 4 as in the mouse

81101539 MEAASTWALLLALLLLLLALTLPRTPARGQLPPGPTPLPLLGNLLQLRPGALYSGFLR 81101366

81100365 LSKKYGPVFTVHLGPWRRVVVLVGHDAIREALGGQAEEFSGRGTLATLDKTFDGH 81100201

81097187 GVFFANGERWKQLRKFTLLALRDLGMGKREGEELIQEEVQNLVKAFQRTE 81097038

81095195 GRPFNPSMLLAQATSNVVCSLIFGIRLPYEDKEFQAVIQAASGTLLGISSPWG 81095037

81094957 QAYEMFSWLLQPLPGPHTQLQHHLGTLAAFTIQQVQRHQGRSHTSGPARDVVDAFLQKMA 81094778

81093521 QEKEDPGTEFTEKNLLMTVTYLLFAGTMTIGATIRYALLLLLKYPQVQK 81093375

81090341 KRVREELIQELGPSRTPSLSDRVRLPYTDAVLHEAQRLLALVPMGMPHTVTKTTSFRGYTLPK 81090156

81088138 GTEVFPLIGSVLHDPAVFRNPEEFHPSRFLDDDGRIRKHEAFLPYSL 81087998

81087846 GKRVCLGEGLARAELWLFFTSILQAFSLDTPCPPGDLSLKPAVRGLFNIPPDFQLQVWPTGDQSR 81087652

 

>CYP2T1 AF368269 (Genbank translation is incorrect in green region) chr1 (+)

82307700 MVTCEATLLLLLILTLMLMSWGWLAHQARARMQKDLPPGPAPLPLLGNLLQLQSGHLDRVLME 82307888

82308155 LSSRWGPVFTVWLGPRPAVVLSGYAALRDALVLQADAFSGRGSMAVFERFTHGN 82308316

82308634 GIVFSNGPRWRTLRNFALGALKEFGVGTSTIEERILEETACVLDEFQATM 82308783

82309030 GAPFDPRRLLDNAVSNVICTVVFGKRYNYGDPEFLRLLDLFSDNFRIMSSRWGE 82309191

82309932 TYNMFPSFMDWIPGPHHRIFKNFQELRLFISEQIQWHRQSRQTGEPRDFIDCFLEQMDK 82310108

82310183 EHQDPESHFQDETLVMTTHNLFFGGTETTSTTLRYGLLIMLKYPEVA 82310323

82310430 AKVQEELDATVGRTRAPSLADRAHLPYTNAVLHEIQRFISVLPLGLPRALIRDVNLRNHFLHK 82310618

82310840 GTFVIPLLVSAHRDPTQFKDPDHFNPTNFLDDQGEFQNNDAFMPFAP 82310840

82311073 GKRMCLGAGLARSEIFLFLTAILQKFSLLPVGSPADIDLTPQCTGLGNVPPAFQLRLVAR 82311252

 

>CYP2U1 XM_227677 gc boundary caused missassembly 90% to mouse 2U1

MSSIGGLRPAAGEQPGVGPHLQAVGGALLLCGLAVLLDWVWLQR

QRAGGIPPGPKPRPLVGNFGYLLLPRFLRLHFWLGSGSQTDTVGRHVYLARLARVYGN

IFSFFIGHRLVVVLSDFQSVREALVQQAEVFSDRPRMPLISILTKEKGIVFAHYGPIW

KQQRRFSHSTLRHFGLGKLSLEPRIIEEFAYVKAEMQKHGEAPFSPFPVISNAVSNII

CSLCFGQRFDYTNKEFKKVLDFMSRGLEICLHSQLFLINLCPWFYYLPFGPFKELRQI

ERDITCFLKNIIKEHQESLDANNPQDFIDMYLLHTQEEKDKCKGTNFDEDYLFYIIGD

LFIAGTDTTTNSLLWCLLYMSLNPGVQKKVHEEIERVIGRDRAPSLTDKAQMPYTEAT

IMEVQRLSMVVPLAIPHMTSEKT (1)

GYSIPKG

TVVLPNLWSIHRDPVIWEKPDDFCPHRFLDDQGQLLKRETFIPFGIGKRVCMGEQLAK

MELFLMFVSLMQSFTFALPEGSEKPIMTGRFGLTLAPHPFNVTVSKR

 

>CYP2W1 XM_221971 92% to mouse 2W1

MELLVLCVWGILLLLGLWGLLRGCAQDPSMTRQWPPGPRPLPFL

GNLHLLGVTHQDRALMELSERYGPMFTIHLGSQKTVVLSGYEVVREALVGTGHELADR

PPIPIFQLIQRGGGIFFSSGVHWKVARQFTVRTLQSLGIRQPPMVGKVLQELVCLKGQ

LDSYGGQPFPLTLLGWAPCNITFTLLFGQRFDYQDPVFVSLLSLIDQVMVLLGSPGIQ

LFNTFPRLGALLRLHRPVLSKIEEVRTILRTLLEAQRPPLPNGSPARSYVEALLQQGQ

DDPEDMFSEANVLACTLDMVMAGTETTAATLQWAVFLMVKHPHVQGRVQEELDRVLGP

GQLPQPEDQRALPYTSAVLHEVQRYITLLPHVPRCTAADIQLGGYLLPKGTPVIPLLT

SVLLDKTQWETPSQFNPNHFLDAKGCFMKRGAFLPFSTGRRVCVGESLARTELFLLFA

GLLQQYHLLPPPGLSPADLDLRPAPAFTMRPPAQTLRVVPRS

 

>CYP2AB1 (XM_221297 N-terminal incorrect) AC107471.6 N-term 92% to mouse

189790 MFSLFGGMAFLAGSFLLLKLAALCWRRSHLPPGPFPFPLLGNLWQLNFRLHPNMLFQ (0) 189620

LAQTHGNVFTVWLGSTPIVVLNGFRAVKEALVSNSEQFSGRPLTPFFRD

LFGEKGVICSNGLTWRQQRRFCLTTLRELGLGKQALELQLQHEAAELAEVFHQEQGRA

FDPQVPIIRSTTRVIGALVFGHHFLSEEPIFLELIRAINLGLAFASTTWRRLYDMFPW

ALRYLSGPHQKIFQYHEAVRGFIHHEIIRHKLRTPEAPKDFISCYLSQITKAMDDPVS

TFSEENLIQVVIDLFLGGTDTTATTLHWAIIYLVHHRAIQERVQQELDEVLGTAQAVC

YEDRERLPYTRAVLHEVQRLSSVVAVGAVRQCVTPTWMHGYYVSKGTIILPNLASVLC

DPECWETPHQFNPGHFLDKDGDFVTNEAFLPFSAGHRVCPGEQLARMELFLMFATLLR

TFRFQLPEGSQGLRLEYVFGGTLQPQPQKICAVPRLSSLSPREP

 

>CYP2AC1 NW_044163.1|Rn9_1523 chromosome 9

3425457 MSGFDFSAILALLGLILILILNIKDFMAKASKRQCPPGPKPWPVIGNLHILNLKRPYQTMLE 3425272

3423187 LSKKYGPIYSIQMGPRKVVVLSGYETVKDALVNYGNQFGERSQVPIFERLFDGK 3423026

3415443 GIAFAHGETWKTMRRFSLSTLRDFGMGKRTIEDTIVVECQHLIQSFESHK 3415294

3412018 GKPFEIKRVLNASVANVIVSMLLGKRFDYEDPQFLRLLTLIGENIKLIGNPSIV 3411857

3410639 LFNIFPILGFLLRSHKKVLRNRDELFSFIRRTFLEHCHNLDKNDPRSFIDAFLVKQQ 3410469

3410029 ENNKSADYFNEENLLALVSNLFTAGTETTAATLRWGIILMMRYPEVQS 3409886

3408812 KVHDEIHKVVGSAQPRIEHRTQMPYTDAVIHEIQRVANILPTSLPHETSTDVVFKNYYIPK 3408627

3406238 GTEVITLLTSVLRDQTQWETPDAFNPAHFLSSKGRFVKKEAFMPFSV 3406098

3402907 GRRMCAGEPLAKMELFLFFTSLMQKFTFQPPPGVSYLDLDLTPDIGFTIQPLPHKICALLRTSAL* 3402710

 

>CYP3A2 M13646

MDLLSALTLETWVLLAVILVLLYRLGTHRHGIFKKQGIPGPKPL

PFLGTVLNYYKGLGRFDMECYKKYGKIWGLFDGQTPVFAIMDTEMIKNVLVKECFSVF

TNRRDFGPVGIMGKAVSVAKDEEWKRYRALLSPTFTSGRLKEMFPIIEQYGDILVKYL

KQEAETGKPVTMKKVFGAYSMDVITSTSFGVNVDSLNNPKDPFVEKTKKLLRFDFFDP

LFLSVVLFPFLTPIYEMLNICMFPKDSIAFFQKFVHRIKETRLDSKHKHRVDFLQLML

NAHNNSKDEVSHKALSDVEIIAQSVIFIFAGYETTSSTLSFVLYFLATHPDIQKKLQE

EIDGALPSKAPPTYDIVMEMEYLDMVLNETLRLYPIGNRLERVCKKDIELDGLFIPKG

SVVTIPTYALHHDPQHWPKPEEFHPERFSKENKGSIHPYVYLPFGNGPRNCIDMRFAL

MNMKLALTKVLQNFSFQPCKETQIPLKLSRQAILEPEKPIVLKVLPRDAVINGA

 

>CYP3A pseudogene 200kb from 3A9

17387311 LYEGRQPVLAITDPDIIKTVLVKECYSTFTNRR 17387213 exon 4

 

>CYP3A9 U46118

MDLIPNFSMETWLLLVISLVLLYLYGTHSHGIFKKLGIPGPKPL

PFLGTILAYRKGFWEFDKYCHKKYGKLWGLYDGRQPVLAITDPDIIKTVLVKECYSTF

TNRRNFGPVGILKKAISISEDEEWKRIRALLSPTFTSGKLKEMFPIINQYTDMLVRNM

RQGSEEGKPTSMKDIFGAYSMDVITATSFGVNVDSLNNPQDPFVEKVKKLLKFDIFDP

LFLSVTLFPFLTPLFEALNVSMFPRDVIDFFKTSVERMKENRMKEKEKQRMDFLQLMI

NSQNSKVKDSHKALSDVEIVAQSVIFIFAGYETTSSALSFVLYLLAIHPDIQKKLQDE

IDAALPNKAHATYDTLLQMEYLDMVVNETLRLYPIAGRLERVCKTDVEINGVFIPKGT

VVMIPTFALHKDPHYWPEPEEFRPERFSKKNQDNINPYMYLPFGNGPRNCIGMRFALM

NMKVALVRVLQNFSFQPCKETQIPLKLSKQGLLQPEKPLLLKVVSRDETVNGA

 

>CYP3A pseudogene z 64% to 3A18

9216052 FGPVGFMKKAVTISEDDEGKRLRPLLSPVFTSGK 9216153

9216386 LWLLHFGLMCSPSSGSVMSVKHLRQEEKGEPIHMKE 9216493

9216722 FSGAYSMNGIAGASFGVNVDSLNN 9216781

XXXXXXXXXXXXXXXXXXXXXXXX

VVLFPFLTQI

 

>CYP3A18 X79991

MEIIPNLSIETWVLLATSLMLFYIYGTYSHGLFKKLGIPGPKPV

PLFGTIFNYGDGMWKFDDDCYKKYGKIWGFYEGPQPFLAIMDPEIIKMVLVKECYSVF

TNRRCFGPMGFMKKAITMSEDEEWKRLRTILSPTFTSGKLKEMFPLMRQYGDTLLKNL

RREEAKGEPINMKDIFGAYSMDVITGTSFGVNVDSLNNPQDPFVQKAKKILKFQIFDP

FLLSVVLFPFLTPIYEMLNFSIFPRQSMNFFKKFVKTMKKNRLDSNQKNRVDFLQLMM

NTQNSKGQESQKALSDLEMAAQAIIFIFGGYDATSTSISFIMYELATRPNVQKKLQNE

IDRALPNKAPVTYDALMEMEYLDMVVNESLRLYPIATRLDRVSKKDVEINGVFIPKGT

VVTIPIYPLHRNPEYWLEPEEFNPERFSKENKGSIDPYVYLPFGNGPRNCIGMRFALI

SMKLAVIGVLQNFNIQPCEKTQIPLKISRQPIFQPEGPIILKLVSRD

 

>CYP3A23/CYP3A1 D13912

MDLLSALTLETWVLLAVVLVLLYGFGTRTHGLFKKQGIPGPKPL

PFFGTVLNYYMGLWKFDVECHKKYGKIWGLFDGQMPLFAITDTEMIKNVLVKECFSVF

TNRRDFGPVGIMGKAISVSKDEEWKRYRALLSPTFTSGRLKEMFPVIEQYGDILVKYL

RQEKGKPVPVKEVFGAYSMDVITSTSFGVNVDSLNNPKDPFVEKAKKLLRIDFFDPLF

LSVVLFPFLTPVYEMLNICMFPKDSIEFFKKFVYRMKETRLDSVQKHRVDFLQLMMNA

HNDSKDKESHTALSDMEITAQSIIFIFAGYEPTSSTLSFVLHSLATHPDTQKKLQEEI

DRALPNKAPPTYDTVMEMEYLDMVLNETLRLYPIGNRLERVCKKDVEINGVFMPKGSV

VMIPSYALHRDPQHWPEPEEFRPERFSKENKGSIDPYVYLPFGNGPRNCIGMRFALMN

MKLALTKVLQNFSFQPCKETQIPLKLSRQGLLQPTKPIILKVVPRDEIITGS

 

>CYP3A62 AB084894 80% to CYP3A9, 78% to Cyp3a13

MDLIPNISLETWMLLATILVLLYLYGTSTHGNFKKLGISGPKPL

PFVGNILAYRHGFWEFDRHCHKKYGDIWGFYEGRQPILAITDPDIIKTVLVKECYSTF

TNRRSFGPAGILKKAITLSEDEEWKRLRTLLSPTFTSGKLKEMFPIINQYADLLVKNV

KHEAEKGNPITMKDIFGAYSMDVITGTSFGVNVDSLNNPQNPFVQKVKKLLKFNFLDP

FFLSVILFPFLTPVFEAFDITVFPKDVMKFFRTSVERMKENRMQEKVKQRLDFLQLMI

NSQSSGDKESHQGLTDVEIVAQSIFFIFAGYETTSSALSFALYLLATHPDLQKKLQDE

IDAALPNKAPVTYDVLVEMEYLDMVLNETLRLFPVGGRLERVCKKDVEINGVFIPKGT

VVMVPTFALHKDPKCWPEPEEFCPERFRKKNQDSINPYIYLPFGNGPRNCIGMRFALM

NMKIALVRVLQNFSFGLCKETQIPLKLRKKGFFQPEKPIILRAVSRD

 

>CYP3A71P new pseudogene 78% to 3A2

9497641 LFEWHTPVFAITDREMIKNVLVKECFSVFTNWR 9497543 exon 4

9468424 DLGPMGIMNKSIAF*KDEEWKRYRALLSPMFTSGKLKV 9468311

9468234 MFPIIKLYGDILVKYLRQEAEKGKPVSVKE 9468145

9467886 IFGAYSMDVITSTSFGVNVDSLNNPKDPFVEKTKKFLRLDYFDPLFISV 9467737

9464756 GLFPFLKPIYDMLNISVFPKDSIAFFKNFVYSMKESHLDSKQK 9464631

9453844 YQVDFFQLMMNAHNNSSESHK 9453782

9451670 FPALSDIEIIAQSIIFTFGGYDTTSSTLSFVLYSLATHSDVQKKLQEEIDHALPNK 9451503

9449744 ASPTYDIVMEMEYLDMVFNETLRLYPVTGRLHRMCKKDIELDGVFIPKG

        SMVMIPLYPLQHDPQHWPEPEEFRPE 9449520

9447351 RFSKENKCRTGHYVYLPFGNGPRNCLGMRFALMSMKLAVTKVLQNFSFHPCKET 9447190

9444444 QIPLKLSKQVILKPEKPIVLKVVPRDGVING 9444352

 

>CYP3A73 chr12_random_1.5 (from UCSC browser)

MDLVSALSLETWLLLAIILVLFYR (2)

FGTRTHGIFKKQGIPGPKPLPFLGTVLNYYR (0)

GLWKFDMECYKKCGKIWG (2)

LFDGQTPVFAIMDTEMIKSVLVKECFSVFTNRR (0)

NIGPVGIMSKSISVAKDEEWKRYRAFLSPTFTSGRLKE (0)

MFPIIEHYGDILVKYLKQKVEKGKPLAMKE (2)

VFGAYSMDVITSTSFEVNINSINNPKDPFVEKVKKFQRFDFFDPLFLSV (1)

VLFPFLTPIYEMLNICLFPKDSVAFFQKFVYRMKQTRLDSKHK (0)

HRVDFLQLMMNAHNNSKDKVSHK (1)

ALSDIEIVAQAIIFIFASYETTSSTLSFVLYSLATHPDSQKKLQEEIDRALPNK (0)

APPTYDTVMEMEYLDMVLNETPRLYPIGYRLERVCKKDIKLDGVFIPKGSVVMIPFYTLQHDPQHWPEPEEFLPER (2)

FSKENKGSIDPYVYLPFGNGPRNCIGMRFALMNMKLALTKVLQNFSFQLCEETQ (0)

IPLKLSRQRLFGPEKPIVLKVVPRDAVITGA*

 

>CYP4A1 M14972 NM_175837  1 AA DIFF

MSVSALSSTRFTGSISGFLQVASVLGLLLLLVKAVQFYLQRQWL

LKAFQQFPSPPFHWFFGHKQFQGDKELQQIMTCVENFPSAFPRWFWGSKAYLIVYDPD

YMKVILGRSDPKANGVYRLLAPWIGYGLLLLNGQPWFQHRRMLTPAFHYDILKPYVKN

MADSIRLMLDKWEQLAGQDSSIEIFQHISLMTLDTVMKCAFSHNGSVQVDGNYKSYIQ

AIGNLNDLFHSRVRNIFHQNDTIYNFSSNGHLFNRACQLAHDHTDGVIKLRKDQLQNA

GELEKVKKKRRLDFLDILLLARMENGDSLSDKDLRAEVDTFMFEGHDTTASGVSWIFY

ALATHPKHQQRCREEVQSVLGDGSSITWDHLDQIPYTTMCIKEALRLYPPVPGIVREL

STSVTFPDGRSLPKGIQVTLSIYGLHHNPKVWPNPEVFDPSRFAPDSPRHSHSFLPFS

GGARNCIGKQFAMSEMKVIVALTLLRFELLPDPTKVPIPLPRLVLKSKNGIYLYLKKL

H

 

>CYP4A2 M57719 M33938

MGFSVFSPTRSLDGVSGFFQGAFLLSLFLVLFKAVQFYLRRQWL

LKALEKFPSTPSHWLWGHNLKDREFQQVLTWVEKFPGACLQWLSGSTARVLLYDPDYV

KVVLGRSDPKPYQSLAPWIGYGLLLLNGKKWFQHRRMLTPAFHYDILKPYVKIMADSV

SIMLDKWEKLDDQDHPLEIFHYVSLMTLDTVMKCAFSHQGSVQLDVNSRSYTKAVEDL

NNLIFFRVRSAFYGNSIIYNMSSDGRLSRRACQIAHEHTDGVIKTRKAQLQNEEELQK

ARKKRHLDFLDILLFAKMEDGKSLSDEDLRAEVDTFMFEGHDTTASGISWVFYALATH

PEHQERCREEVQSILGDGTSVTWDHLDQMPYTTMCIKEALRLYSPVPSVSRELSSPVT

FPDGRSIPKGIRVTILIYGLHHNPSYWPNPKVFDPSRFSPDSPRHSHAYLPFSGGARN

CIGKQFAMNELKVAVALTLLRFELLPDPTRIPVPMPRLVLKSKNGIHLRLKKLR

 

>CYP4A8v1 M37828

MSGSALSFTIFPGSILGFLQIATVLTVLLLLLKTAQFYLHRRWL

LRATQQFPSPPSHWFFGHKIPKDQDFQDILTRVKNFPSACPQWLWGSNVRIQVYDPEY

MKLILGRSDPKAHGSYRFLAPWIGYGLLLLNGQTWFQHRRMLTPAFHYDTLKPYVGIM

ADSVRIMLDKWEQIVGQDSTLEIFQHITLMTLDTIMKCAFSQEGSVQLDRKYKSYIKA

VEDLNNLFFFRVQNMFHQNDFIYSLSSNGRKAHNAWQLAHDYTDQVIKSRKAQLQDEE

ELQKVKQKRRLDFLDILLFARIENGSSLSDKDLRAEVDTFMFEGHDTTASGISWIFYA

LATNPEHQQGCRKEIQSLLGDGASITWDDLDKMPYTTMCIKEALRIYPPVTAVSRMLS

TPVTFPDGRSLPKGITVMLSFYGLHHNPTVWPNPEVFDPYRFAPESSRHSHSFLPFSG

GARNCIGKQFAMNELKVAVALTLLRFELLPDPTRIPIPIPRLVLKSKNGIYLRLKKLQ

 

>CYP4A8v2 97% TO 4A8  BC081771

MSGSALSFTIFPGSILGFLQIATVLTVLLLLFKTAQFYLHRRWL

LRATQQFPSPPSHWFFGHKIPKDQEFQDILTRVKNFPSACPQWLWGSNVRIQVYDPDY

MKLILGRSDPKSHHSYRFLAPWIGYGLLLLNGQTWFQHRRMLTPAFHYDTLKPYVGIM

ADSVRIMLDKWEQIVGQDSTLEIFQHITLMTLDTIMKCAFSQEGSVQLDRKYKSYIKA

VEDLNNLSFFRIRNIFHQNDIIYSLSSNGRKARSAWQLAHEHTDQVIKSRKAQLQDEE

ELQKVKQKRRLDFLDILLFARIENGSSLSDKDLRAEVDTFMFEGHDTTASGISWIFYA

LATNPEHQQGCRKEIQSLLGDGASITWDDLDKMPYTTMCIKEALRIYPPVTAVSRMLS

TPVTFPDGRSLPKGITVMLSFYGLHHNPTVWPNPEVFDPYRFAPESSRHSHSFLPFSG

GARNCIGKQFAMNELKVAVALTLLRFELLPDPTRIPIPIPRLVLKSKNGIYLRLKKLQ

 

>CYP4A8v2-de1b  rat

            UCSC browser a in fig

CYP4A exon 1 pseudogene chr5:135545150-135545338 (- strand)

135545338 MSIFELSHITTGFGISGLLQMVSWLGLLLLLLFKAAQYYLHRQWIIKSVQHFPSPPSHWFFGN 135545150

 

>CYP4A8v2-de5c6c12c  rat

            UCSC browser b in fig

135596695-135596537 (- strand) exon 5

SVLLPQNKWE*TISDSLEIFQCASLITLATILMCVFSY*DNVHLN

135596212-135596066 (- strand) exon 6

HSQTYTQVVGILNNLRNAFPQSDIFYRMTADGHRTKNAFLIAHKHSDFV

135594009-135593884 (- strand) exon 12

KQFIMNEMKVVITLTLLCFEWLLDPTRVSVSISGFLLNPRMG

 

>CYP4A8v2-de4d12d  rat

            UCSC browser c in fig

135628319-135628215 (- strand) A exon 4

51% to 4A8

LSNDQTWFQHY*HI*TPLLHCDILKSNVRIVADCI

135617441-135617274 (- strand) B exon 12

75% to 4A2 76% to 4A8

RICIGKQLAMNAQKLAVALTLLQFELLPDPTRVPIPTEKLVLKSKNGIHLHLRKLQ

 

>CYP4A34P new pseudogene seq between 4A3 and 4A2 T

65% to 4A2 135837702- 135845966 (+ strand)

MGIFELSHITTVFGISRLLQMVFWLGLLLLLFKAAQYYLRRQWIIKSFQQFPFPPSHWLFGNFLK

135838759 KDQDLQQIRLWVEKFPTACVRWFWGNHACVLIYDPD*MKVILG*S 135838887 aa 65-107

(seq gap)

GYSLLLLNGKKWFQHRQMLTPAFHSDILKPYVGIMA

FSIFLLQDKWEELVGQDCPLEIYQDISLMTMETLINCAFSYQGSVQLE

NSRS*IKAVEDLTHLIHFRVRNGFH*SNIIYNLSSNGGSFHCACQIAHKHKG

DRVIRRRKVQLQSGVELEKIWKKWHLDLLDILLFAQ

EDGKSLSDEDLHAEVDTFMFEGHDTAARGISWIFYALPTHPEHQERCKEEVQSILGDGTSVTW

DHLDQMPYTTMCIKKALRLYPPGPAVSRELSTPVTFPDGCSSPKNSRISVV

IFGLHHNPRL*PNPE

VLDPFRFAPDVPQHTHAFLPFSAGAR

NCIRKLFAMNELKVAVTLTLL*LELLPDPTRVPFLVARTVLKSKIRIYLHLKKLK

 

>CYP4A34P-de12b C-term aa 453-508 with one frameshift

135831318 RNCIGEHFAMNELKVAMALTLL 135831383

135831383 QFELLPDPTRIPIPIPRLVLKSKNGIYLHLKKLQ 135831484

 

>CYP4A3 M33936

MGFSVFTPTRSLDGVSGFFQGAFLLSLFLVLFKAVQFYLRRQWL

LKALEKFPSTPSHWLWGHDLKDREFQQVLTWVEKFPGACLQWLSGSKTRVLLYDPDYV

KVVLGRSDPKASGIYQFLAPWIGYGLLLLNGKKWFQHRRMLTPAFHYGILKPYVKIMA

DSVNIMLDKWEKLDDQDHPLEIFHYVSLMTLDTVMKCAFSHQGSVQLDVNSRSYTKAV

EDLNNLTFFRVRSAFYGNSIIYNMSSDGRLSRRACQIAHEHTDGVIKMRKAQLQNEEE

LQKARKKRHLDFLDILLFAKMEDGKSLSDEDLRAEVDTFMFEGHDTTASGISWVFYAL

ATHPEHQERCREEVQSILGDGTSVTWDHLDQIPYTTMCIKEALRLYPPVPSVSRELSS

PVTFPDGRSIPKGITTTILIYGLHHNPSYWPNPKVFDPSRFSPDSPRHSHAYLPFSGG

ARNCIGKQFAMNELKVAVALTLLRFELLPDPTRIPVPMARLVLKSKNGIHLRLKKLR

 

>CYP4A33P 135689641-135700825 (+) between 4A8 and 4A2

DKWEQIVGQDSTLEIVQHNTLMTLDTIMKCAFSQEGSVQLDR

KYKSYIKAVGDLNNLSFFRIWNIFHQNDIIYSLSSNGCQANSAC*LAHEHT

DQVIKSRKAQLQDEEELQKVKQKRRLDFLDILLFAR

IENGSSLSDKDLRAEVDTFMFEGHDTTASGISWIFYALATNPEHQQGCRKEIQSLLGDGASITW

DDLDKMPYTTMCIKEALSIYPPVPSVSRMLSTPVTFPDGCSLPK

GITAVLSFYGHHHNPTL*PNPE

VFDPYRVFPXSSQHSHLFLPFSGGAR

NCIGKQFAMNELKVAIALTLLCLRLLPDPTRIPIPIPRLVLKSKNGIYLHLKKLQ

 

>CYP4A33P-de4b5b12b  rat

            UCSC browser

135662099-135662209 (+ strand) F exon 4

LSNDQTWFQH*HILTPLFHYGILKTNVRIIVDSVHEM

135671387-135671512 (+ strand) G exon 5

DISDSLEIFQCASLIALATIMMCAFSYQDNVHLNRSVTSQSF

135676914-135677039 (+ strand) S exon 12

KQFIMNEMKVVITLTLLCFEWLLDPTRVSVSISGFLLNPRMG

 

>CYP4A33P-de10c11c  rat

            UCSC browser

135712750-135712827 (+ strand) exons 10

GVLISFSICGLHHNPRLWPNAE (0)

135713048-135713125 (+ strand) R exon 11

VFDPFRFAPDVLRHTHAFLPFSAGAR

 

>CYP4B1 M29853

MVLNFLSPSLSRLGLWASVVILMVIVLKLFSLLLRRQKLARAMD

SFPGPPTHWLFGHALEIQKLGSLDKVVSWAQQFPHAHPLWFGQFVGFLNIYEPDYAKA

VYSRGDPKAADVYDFFLQWIGKGLLVLDGPKWFQHRKLLTPGFHYDVLKPYVAIFAES

TRMMLDKWEKKASENKSFDIFCDVGHMALDTLMKCTFGKGDSGLGHRDNSYYLAVSDL

TLLMQQRIDSFQYHNDFIYWLTPHGRRFLRACKIAHDHTDEVIRQRKAALQDEKERKK

IQQRRHLDFLDILLGVRDESGIKLSDAELRAEVDTFMFEGHDTTTSGISWFLYCMALY

PEHQQLCREEVRGILGDQDSFQWDDLAKMTYLTMCMKECFRLYPPVPQVYRQLNKPVT

FVDGRSLPAGSLISLHIYALHRNSTVWPDPEVFDPLRFSPENAAGRHPFAFMPFSAGP

RNCIGQQFAMNEMKVVTALCLLRFEFSLDPSKMPIKVPQLILRSKNGIHLYLKPLASR

SGK

 

>CYP4F39 UPSTREAM OF 4F5 chr7 (+) 94% to mouse 4f39 = ortholog

13051717 MLPITDYLLYLLGLEKTAFRVYVLSALLLFLLFLLFRLLLQAFKLFS

         DFRITCRRLSCFPEPPGRHWLLGHMSM 13051938

13054230 YLPNEKGLQNEKKVLDTMHHIILAWVGPFLPLLVLVHPDYIKPVLGAS 13054373

13064279 AAIAPKDEFFYSFLKPWL 13064332

13064958 GDGLLISKGNKWSRHRRLLTPAFHFDILKPYMKIFNQSVNIMH 13065086

13065271 AKWRRHLAEGSVTSFDMFEHVSLMTLDSLQKCVFSYSSDCQE 13065396

13067956 KLSDYISSIIELSALVVRRQYRLHHYLDFIYYLTADGRRFRQACDTVHNFTTEVIQQRRR

         ALRELGAEAWLKAKQGKTLDFIDVLLLAK 13068222

13072211 DEEGKELSDEDIRAEADTFMFE 13072276

13072388 GHDTTSSGLSWALFNLAKYPEYQDKCREEIQEVMKGRELEELDW 13072519

13076560 DDLTQLPFTTMCIKESLRQFPPVTLISRRCTEDIKLPDGRIIPK 13076691

13078724 GIICLVSIYGTHYNPLVWPDSK 13078789

13079451 VYNPYRFDPDIPQQRSPLAFVPFSAGP 13079531

13079947 RNCIGQSFAMAEMRVVVALTLLRFRLSVDRTRKVRRKPELILRTENGLWLNV

         EPLPSRAGVPRGPTEPEVQAPPAQA* 13080177

 

>CYP4F17 = CYP4F19temp AI030199 EST CHR7 13095557  13103056 chr7 (+)

90% to 4f17 next closest 82%, probable ortholog of 4f17

13095557 MLQLSLSWLGRGPVTVSPWQLLLVVGTSLLLARILAWISAFYDN

         YCRLRCFPQPPSRHWFWGHLNL 13095754

13102916 VKNNEEGLQLLAEMSHQFQDIHLCWIGIFYPILRLIHPKFIGPILQA 13103056

13103866 AAAVAPKEMIFYGFLKPWL 13103922

13104011 GDGLLVSAGEKWSRQRRLLTPAFHFDILKPYVKNFNKSVNIMH 13104139

13105431 AKWQRLTAKGSARLDMFEHISLMTLDSLQKCVFSFDSNCQE 13105553

13106281 SPSEYIAAIQELSSLIVKRHHQPFLYMDFLYYLTADGRRFRKACDLVHNFTDAVIRERRR

         TLSSQSVDEFLKSKTKSKTLDFIDVLLLAK 13106550

13106863 DEHGKELSDEDIRAEADTFMFG 13106928

13107118 GHDTTASALSWILYNLARHPEHQERCRQEVRELLRDREPEEIEW 13107249

13110034 DDLTQLPFLTMCIKESLRLHPPVTVISRCCTQDVVLPDGRVIPK 13110165

13110226 GNDCIISIFGVHHNPSVWPDPE 13110303

13110457 VYDPFRFDSENPQKRSPLAFIPFSAGP 13110537

13110878 RNCIGQTFAMNEMKVAVALTLLRFRLLPDDKEPRRKPELILRAEGGLWLRVEPLSTGAQ 13111054

 

>CYP4F5/4f16 13119940  13133265 chr7 (+) 3 aa diffs to mRNA U39207 90% to 4f16 89% to 4f37

13119940 MPWLTVSGLDLGSVVTSTWHLLLLGAASWILARILAWTYSFCENCSRLRCFPQSPKRNWFLGHLGT 13120137

13122954 IQSNEEGMRLVTEMGQTFRDIHLCWLGPVIPVLRLVDPAFVAPLLQAP 13123097

13125947 ALVAPKDTTFLRFLKPWL 13126000

13126086 GDGLFLSSGDKWSRHRRLLTPAFHFDILKPYVKIFNQSVNIMH 13126214

13227610 VKWKHLCVEGSAHLEMFENISLMTLDSLQKCLFGFDSNCQE 13227732

13128135 SPSEYISAILELSSLIIKRSQQLFLYLDFLYYRTADGRRFRKACDLVHNFTDAVIRERRR

         LLSSQGTDEFLESKTKSKSKTLDFIDVLLLAK 13128410

13129071 DEHGKELSDEDIRAEADTFMFG 13129136

13129327 GHDTTASALSWILYNLARHPEYQERCRQEVWELLRDREPEEIEW 13129458

13132295 DDLAQLPFLTMCIKESLRLHPPAIDLLRRCTQDIVLPDGRVIPK 13132426

13132538 GNICVISIFGIHHNPSVWPDPE 13132603

13132764 VFDPFRFDSENRQKRSPLSFIPFSAGP 13132844

13133089 RNCIGQTFAMNEMKVVVALTLLRFRVLPDDKEPRRKPEIILRAEGGLWLRMEPLSTDTQ 13133265

 

>CYP4F5? AF288818 7aa diffs to 4F5 probably same gene

MPWLTVSGLDLGSVVTSTWHLLLLGAASWILARILAWTYSFCEN

CSRLRCFPQSPKRNWFLGHLGT

IQSNEEGMRLVTEMGQTFRDIHLCWLGPVIPVLRLVDPAFVAPLLQAP

ALVAPKDPTFLHFLKPWL

GDGLFLSSGDKWSRHRRLLTPAFHFDILKPYVKIFNQSVNIMH

AKWKHLCLEGSVRLEMFENISLMTLDSLQKCLFGFDSNCQE

SPSEYISAILELSSLIIKRSQQLFLYLDFLYYRTADGRRFRKACDLLHNFTDAVIRERRR

LLSSQGVDEFLESKTKSKSKTLDFIDVLLLAK

DEHGKELSDEDIRAEADTFMFG

GHDTTASALSWILYNLARHPEYQERCRQEVWELLRDREPEEIEW

DDLAQLPFLTMCIKESLRLHPPAIDLLRRCTRHIVLPDGRVIPK

GNICVISIFGIHHNPSVWPDPE

VFDPFRFDSENRQKRSPLSFIPFSAGP

RNCIGQTFAMNEMKVVVALTLLRFRVLPDDKEPRRKPEIILRAEGGLWLRMEPLSTDTQ

 

>CYP4F37 94% to 4F5 chr7 (+) 89% to 4f16 88% to 4f37

13149326 MPWLTVSGLDLGSVVTSTWHLLLLGAASWILARILAWTYSFCENCSRLRCFPQSPKRNWFLGHLGV 13149523

13159662 IQSNEEGMQLVTEMGQTFRDVHLIWLGPVSPVLRLVDPAFVAPLLQAP 13159805

13162623 ALVAPKDPTFLHFLKPWL 13162676

13162768 GDGLFLSSGDKWSRHRRLLTPAFHFDILKPYVKTFNQSVNIMH 13162896

13164072 AKWKHLCLEGSARLEMFENISLMTLDSLQKCLFGFDSNCQE 13164194

13164825 SPSEYISATLELSSLTRKRSYKLFLYLDFLYYRTADGQRFRKACDLVHSFTDAVIRERRR

         LLSSQGVDEFLESKTKSKSKTLDFIDVLLLAK 13165100

13165761 DEHGKELSDEDIRAEADTFMFG 13165826

13165992 GHDTTASALSWILYNLASHPEYQERCRQEVWELLRDREPEEIEW 13166123

13168834 DDLAQLPFLTMCIKESLRLHPPAVDLLRRCTQDIVLPDGRVIPK 13168965

13169077 GNICVISIFGIHHNPSVWPDPE 13169142

13169302 VYDPFRFDPENRQKRSPLSFIPFSAGP 13169382

13169627 RNCIGQTFAMNEVKVAVGLTLLRFRFLPDDKEPRRKPELILRAEGGLWLRVELLSRDTQ 13169803

 

>CYP4F43P pseudogene chr7 (+) strand exons 4, 5, 9, 10, 11, 12

13179984 RDGVFLISFDKWNHHHCLLTPAFHFDNLVL 13180073

         *VKIFNQSVNIIH

13181413 VSFLKAKWKCLFSEGSACLEIFENLTTLDSLQKCLFSLDSNCQE 13181544

 

13207589 NDLAQLPFLTMCIKASLQLYPQDTNLICSCT 13207681

         *DILLPDG*VIPK

         XXXXXXXXXGVHHSPSVWTDPX

13208327 VYYPFPFDSKNPQKISPLAFMPFSVGP 13208407

13208782 RNCKRQTYPMSERKVALVLKLLHFHTIPGEIDPPRQPELILSLEGRLWLLKESLSVG 13208952

 

>CYP4F44P pseudogene MISSING EXON 1 AND HALF OF EXON 2 90% to 4f16

13215861 LGPVIPVLRLVDPAFVAPLLQAP 13215929

13219942 ALVAPKDMNFYGFLKPWL 13219995

13220082 GDGLLLSSGDKWNRHRXLTPAFHFDILKPYVKIFNQSVNIMH 13220206

13227610 VKWKHLCVEGSAHLEMFENISLMTLDSLQKCLFGFDSNCQE 13227732

13228571 SPSEYISAILELSSLTIKRSYQLFLYLDFLYYRTADGRRFRKAC

         DLVHSFTDAVIRERRRLLSSQGVDEFLESKTKSKSKTLDFIDVLLLAK 13228846

13230087 DEHGKELSDEDIRAEADTFMFG 13230152

13230344 GHDTTASTLSWILYNLARHPEYQESCLQEVWELLRDREPEEIEW 13230475

13239583 DDLAQLPFLTMCIKESLRLHPPAVDLLRRCTQDIVLPDGRVIPK 13239714

13239826 GNICVISIFGIHHNPSVWPDPE 13239891

13240051 VYDPFRFDPESRQKRSPLSFIPFSAGP 13240131

13240378 RNCIGQTFAMNEMKVAVALTLLRFRLLPDDKEPRRKPEIILRAEGGLRLLVEPLSGGA* 13240554

 

>CYP4F40 91% to 4f40 next closest = 82% probable ortholog of 4f40

13268033 MRHLDLSWLGLGPMSASPWLLLSLVGVSWFLTRCLTQIYTLYAK

         CQRLCGFPQPPKRSWFWGHLGM 13268230

13270621 SPPTEEGMKQMTELVATYPQGFMTWLGPIVPLITLCHPDIIRSVLSAS 13270764

13273375 AAVAPKDGIFYSFLKPWL 13273428

13273519 GDGLLVSASDKWSRHRSMLTPAFHFNILKPYVKIFNDSTNIMH 13273647

13275412 AKWLRLASGGSAHLDMFENISLMTLDTLQKCVFSFNSNCQE 13275534

13276605 KPSEYIAAILELSALVVKRNEQLLLHMDLLYRLTPDGRRFYKACHLVHDF

         TYAVIQERRRTLPKHGGDDVIKAKAKSKTLDFIDVLLLSK 13276874

13279547 DEDGKELSDEDIRAEADTFMFEG 13279615

13281122 GHDTTASGLSWILYNLAKHPEYQERCRQEVQELLRDRDSEEIEW 13281253

13281122 DDLAQLPFLTMCIKESLRLHPPVTMVSRCCTQDISLPDGRVIPK 13281253

13281331 GIICIINIFATHHNPTVWQDPE 13281396

13281524 VYDPFRFDPENIQARSPLAFIPFSAGP 13281604

13281923 RNCIGQTFAMNEMKVAVALTLLRFRVLPDDKEPRRKPELILRAEDGLWLRVEPLSAQA 13282096

 

>CYP4F4/4f15 U39206 chr7 (+) strand 92% to 4f15 next closest 83% probable ortholog of 4f15

13293478 MPQLDLSWLGLGPMSASPWLLLLLVGASWLLVRVLTQTYIFYRT

         YQHLCDFPQPPKWNWFLGHLGM 13293675

13296360 ITPTEQGLKQVTKLVATYPQGFMTWLGPILPIITLCHPDVIRSVLSA 13296500

13298373 SASVALKEVIFYSFLKPWL 13298429

13298517 GDGLLLSDGDKWSCHRRMLTPAFHFNILKPYVKIFNDSTNIMH 13298645

13301094 AKWQDLASGGSARLDMFKNISLMTLDSLQKCVFSFDSNCQE 13301216

13303748 KPSEYISAILELSALVAKRYQQLLLHTDSLYQLTHNGRRFHKACKLVHNFTDAVIQGRRR

         ALPSQHEDDILKAKARSKTLDFIDVLLLTK 13304017

13305908 DEDGKELSDEDIRAEADTFMFE 13305973

13306182 GHDTTASGLSWILYNLARHPEYQERCRQEVRELLRDRESTEIEW 13306313

13307962 DDLAQLPFLTMCIKESLRLHPPVTVISRRCTQDIVLPDGRVIPK 13308093

13308178 GVICIINIFATHHNPTVWPDPE 13308243

13308394 VYDPFRFDPENIKDRSPLAFIPFSAGP 13308474

13308851 RNCIGQTFAMNEMKVALALTLLRFRVLPDDKEPRRKPELILRAEGGLWLRVEPLSTQ 13309021

 

>CYP4F1/4f14 M94548 chr7 12 exons (-) strand 95% to 4f14 probable ortholog

13600726 MSQLSLSWLGLGPEVAFPWQTLLLFGASWILAQILTQIYAAYRN

         FRRLRGFPQPPKRNWLMGHVGM 13600529

13598616 VTPTEQGLKELTRLVGTYPQGFLMWIGPMVPVITLCHSDIVRSILNAS 13598473

13595808 AAVALKDVIFYTILKPWL 13595755

13595666 GDGLLVSAGDKWSRHRRMLTPAFHFNILKPYVKIFNDSTNIMH 13595538

13595386 AKWKRLISEGSSRLDMFEHVSLMTLDSLQKCVFSFDSNCQE 13595264

13593683 KSSEYIAAILELSALVAKRHQQPLLFMDLLYNLTPDGMRFHKACNLVHEFTDAVIRERRR

         TLPDQGLDEFLKSKAKSKTLDFIDVLLLTK 13593414

13592552 DEDGKELSDEDIRAEADTFMFE 13592552

13592326 GHDTTASGLSWILYNLANDPEYQERCRQEVQELLRDRDPEEIEW 13592195

13591103 DDLAQLPFLTMCIKESLRLHPPVTVISRCCTQDILLPDGRTIPK 13590972

13590900 GIICLISIFGIHHNPSVWPDPE 13590835

13590677 VYNPFRFDPENIKDSSPLAFIPFSAGP 13590597

13590229 RNCIGQTFAMSEMKVALALTLLRFRLLPDDKEPRRQPELILRAEGGLWLRVEPLTAGAQ 13590053

 

>CYP4F6/4f13 U39208 chr7 (-) strand 91% to 4f13 probable ortholog

13635825 MLQLSLSRLGMGSLTASPWHLLLLGGASWILARILAWIYTFYDN

         CCRLRCFPQPPKPSWFWGHLTL 13635628

13629996 MKNNEEGMQFIAHLGRNFRDIHLSWVGPVYPILRLVHPNVIAPLLQA 13629856

13620199 SAAVAPKEMTLYGFLKPWL 13620143

13620054 GDGLLMSAGEKWNHHRRLLTPAFHFDILKSYVKIFNKSVNTMH 13619926

13618162 AKWQRLTAKGSARLDMFEHISLMTLDSLQKCIFSFDSNCQE 13618040

13616381 SNSEYIAAILELSSLIVKRQRQPFLYLDFLYYLTADGRRFRKACDVVHNFTDAVIRERRS

         TLNTQGVDEFLKARAKTKTLDFIDVLLLAK 13616112

13615792 DEHGKGLSDVDIRAEADTFMFG 13615727

13615538 GHDTTASALSWILYNLARHPEYQERCRQEVRELLRDREPEEIEW 13615407

13610183 DDLAQLPFLTMCIKESLRLHPPVLLISRCCSQDIVLPDGRVIPK 13610052

13609958 GNICVISIFGVHHNPSVWPDPE 13609893

13609762 VYNPFRFDPENPQKRSPLAFIPFSAGP 13609682

13609348 RNCIGQTFAMSEIKVALALTLLRFCVLPDDKEPRRKPELILRAEGGLWL

         RVEPLSTVTSQLPWDLLAHPPTS 13609133

 

>CYP4F18 XM_224708 ASSEMBLY MODIFIED from Genbank entry 77% to 4F1 chr7 (+) strand

92% to 4f18 probable ortholog, 4f18 is also distant from the 4f cluster in mouse

18197903 MPLLSLSWLGLGHTAASPWLLLLLVGASCLLAYILPQVYAVFEN

         SRRLRRFPQPPTRNWLFGHLGL 18198100

18202714 IQSSEEGLLYIQSLSRTFRDVCCWWVGPWHPVIRIFHPAFIKPVILA 18202854

18203794 PASVAPKDRVFYRFLKPWL 18203850

18203937 GDGLLLSTGDKWSRHRHMLTPAFHFNILKPYVKIFNDSTNIMH 18204065

18206788 AKWQRLASQGSARLDMFEHISLMTLDSLQKCVFSFDSNCQE 18206910

18208877 KPSEYITAILELSALVARRHQSLLLYVDLFYHLTRDGMRFRKACRLVHDFTDAVIRERRR

         TLPDQGGDDALKAKAKAKTLDFIDVLLLSK 18209146

18211298 DEHGEALSDEDIRAEADTFMFG 18211363

18211538 GHDTTASGLSWILYNLAKHPEYQERCRQEVRELLRDREPEEIEW 18211669

18222463 DDLAQLPFLTMCIKESLRLHPPATAISRCCTQDIMLPDGRVIPK 18222594

18222676 GVICRISIFGTHHNPAVWPDPE 18222741

18223428 VYNPFRFDADNGEGRSPLAFIPFSAGP 18223508

18223827 RNCIGQTFAMSEMKVALALTLLRFRVLPDDKEPRRKPELILRAEGGLWLRVEPLSAGAH 18224003

 

>CYP4V XM_341440 extra intron in the middle removed, CK366141.1 EST at boundary

there is a gc-at boundary. 92% to mouse 4v3

MLWLWLGLSGQKLLLWGAASAVSVAGATVLLNILQMLVSYARKW

QQMRPIPSVARAYPLVGHALFMKPNNTEFFQQIIQYTEEFRHLPIIKLWIGPVPLVAL

YKAENVEVILTSSKQIDKSFMYKFLQPWLGLGLLTS (2)

TGSKWRARRKMLTPSFHFTILEDFLDVM

NEQANILVNKLEKHVNQEAFNCFFPITLCALDIICETAMGKNIGAQSNGDSEYVRTVY

RMSDMIYRRMKMPWFWFDLWYLMFKEGRDHKKGLKSLHTFTNNVIAERVNARKAEQDC

IGAGRGPLPSKTKRKAFLDLLLSVTDEEGNKLSHEDIREEVDTFMFEGHDTTAAAINW

SLYLLGSNPEVQRKVDKELDDVFGRSHRPVTLEDLKKLKYLDCVIKETLRVFPSVPLF

ARSLSEDCEVAGYKISKGTEAVIIPYALHRDPRYFPDPEEFQPERFFPENSQGRHPYA

YVPFSAGPRNCIGQKFAVMEEKTILACILREFWIESNQKREELGLAGDLILRPNNGIW

IKLKRRHEDDP

 

>CYP4X1 AF439343, NM_145675

MEASWLENRWARPLHLALVFCLALVLMQAVKLYLRRQRLLRDLR

PFPGPTAHWLLGHQKFLQEDNMEKLDEIVKEYPCAFPCWVGPFQAFFYIYDPDYAKIF

LSRTDPKTQYLHQLMTPFLGRGLLNLDGPRWFQHRCLLTPAFHQDILKPCVDMMAHSV

NMMLDKWEKTWTTQETTIEVFEHINLMTLDIIMKCAFGQETNCQINGTYESYVKATFE

LGEIISSRLYNFWHHHDIIFKLSPKGHCFQELGKVIHQCTEKIIQDRKKTLKDQVNQD

DTQTSQNFLDIVLSAQAGDEKAFSDADLRSEVNTFMWAGHDASAASISWLLYCLALNP

EHQDRCRTEIRSILGDGSSITWEQLDEIPYTTMCIKETLRLIPPIPSISRELSKPLTL

PDGHSLPAGMTVVLSIWGLHHNPAVWKDPKVFDPLRFTKENSEQRHPCAFLPFSSGPR

NCIGQQFAMLELKVAIALTLLRFRVAADLTRPPAFSSHTVLRPKHGIYLHLKKLPEC

 

>CYP5A1 D28773

MEVLGLLKFEVSGTVVTVTLSVVLLALLKWYSTSAFSRLRKLGI

RHPEPSPFVGNLMFFRQGFWESHLELRERYGPLCGYYLGRRMYIVISDPDMIKEVLVE

NFSNFSNRMASGLEPKLIADSVLMLRDRRWEEVRGALMSAFSPEKLNEMTPLISQACE

LLLSHLKHSAASGDAFDIQRCYCCFTTNVVASVAFGIEVNSQDAPEDPFVQHCQRVFA

FSTPRPLLALILSFPSIMVPLARILPNKNRDELNGFFNTLIRNVIALRDKQTAEERRG

DFLQMVLDAQRSMSSVGVEAFDMVTEALSSAECMGDPPQRCHPTSTAKPLTVDEIAGQ

AFLFLIAGHEITTNTLSFITYLLATHPECQERLLKEVDLFMEKHPAPEYCNLQEGLPY

LDMVVAETLRMYPPAFRFTREAAQDCEVLGQHIPAGSVLEIAVGALHHDPEHWPNPET

FDPERFTAEARLQQKPFTYLPFGAGPRSCLGVRLGLLVVKLTLLQVLHKFRFEACPET

QVPLQLESKSALCPKNGVYVKIVSR

 

>CYP7A1 J05460

MMTISLIWGIAVLVSCCIWFIVGIRRRKAGEPPLENGLIPYLGC

ALKFGSNPLEFLRANQRKHGHVFTCKLMGKYVHFITNSLSYHKVLCHGKYFDWKKFHY

TTSAKAFGHRSIDPNDGNTTENINNTFTKTLQGDALCSLSEAMMQNLQSVMRPPGLPK

SKSNAWVTEGMYAFCYRVMFEAGYLTLFGRDISKTDTQKALILNNLDNFKQFDQVFPA

LVAGLPIHLFKTAHKAREKLAEGLKHKNLCVRDQVSELIRLRMFLNDTLSTFDDMEKA

KTHLAILWASQANTIPATFWSLFQMIRSPEAMKAASEEVSGALQSAGQELSSGGSAIY

LDQVQLNDLPVLDSIIKEALRLSSASLNIRTAKEDFTLHLEDGSYNIRKDDMIALYPQ

LMHLDPEIYPDPLTFKYDRYLDESGKAKTTFYSNGNKLKCFYMPFGSGATICPGRLFA

VQEIKQFLILMLSCFELEFVESQVKCPPLDQSRAGLGILPPLHDIEFKYKLKH

 

>CYP7B1 XM_342218, U36992

MEGATTPDAASPGPLSLLGLLFAVTLLLPVLFLLTRRTRRPCEP

PLIKGWIPYLGMALKFWKDPLAFLQTLQRQYGDTFTVLLGGKYITFVLNPFQYQYVMK

NPKQLSFEKFSRRLSAKAFSVKKLLTDDDLSNDIHRGYLLLQGKSLDGLLETMIQEVK

EIFESRLLKLTDWNTARVFDFCSSLVFEITFTTIYGKILAANKKQIISELRDDFLKFD

DHFPYLVSDIPIQLLRNAEFMQKKIIKCLTPEKVAQMQRRSEIVQERQEMLKKYYGHE

EFEIGAHHLGLLWASLANTIPAMFWAMYYLLQHPEAMEVLRDEIDSFLQSTGQKKGPG

ISVHFTREQLDSLVCLESAILEVLRLCSYSSIIREVQEDMDFSSESRSYRLRKGDFVA

VFPPMIHNDPEVFDAPKDFRFDRFVEDGKKKTTFFKGGKKLKSYIIPFGLGTSKCPGR

YFAINEMKLLVIILLTYFDLEVIDTKPIGLNHSRMFLGIQHPDSDISFRYKAKSWRS

 

>CYP8A1 U53855 Rn.73051

MSWAALLGLLAVLLLLLLLLSRRRARRPGEPPLDLGSIPWLGHA

LEFGKDAASFLTRMKEKHGDIFTVLVGGRYVTVLLDPHSYDTVVWDLRTRLDFHPYAI

FLMERIFDLQLPNFNPSEEKARMKPTLMHKDLQALTEAMYTNLRTVLLGDSTEGGSGW

QEKGLLEFSYSSLLSAGYLTLYGVEASPRTHESQALDRDHSADVFRTFRQLDLMLPKL

ARGSLSVGDKDHACSVKSRLWKLLSPAGLASRADRSSWLESYLRHLEEMGVSEDMQAR

ALVLQLWATQGNMGPTAFWLLLFLLKNPEALDAVHAELKRIVWQAEKPVLQMTALPQK

ILDSMPVLDSVLNETLRLTAAPFITREVMADLALPMADRREFSLRRGDRLLLFPFLSP

QKDPEIYTEPEVFKYNRFLNPDGSEKKDFYKDGKRLKNYNMPWGAGHNQCLGKSYAIN

SIKQFVVLLLTHFDLELVSEDTEVPEFDLSRYGFGLMQPEEDVPIRYRTRL

 

>CYP8B1 NM_031241, AB009686

MLWGSVLGALLMAVGCLCLSLLPRHRRPWEPPLDKGFVPWLGHT

MAFRKNMFEFLKGMRAKHGDVFTLQLGGQYFTFVMDPLSFGPIIKSTQKVLDFVTYAR

ELVFKVFGYQSMDEDHQMLHVASTKHLMGQGLEDLNRAMLDSLSLVMLGPKGRSLGAR

SWCEDGLFHFCYSILFKAGFLSLFGCTKDKEQDLDEADELFRKFRRFDLLFPRFVYSL

LGPLEWVEVSQLQRLFHQRLSVEQNLEKDGISNWLGFMLRFLRERGMASSMQDKFNFM

MLWASQGNTGPTCFWALLFLLKHQDAMKAVREEATRVLGEARLEAETSFAFTLSALKC

TPVLDSVMEETLRLCATPTLLGVVQEDYVLKMASGQEYQIRRGDKVALFPYLSVHMDP

DIHPEPTTFKYNRFLNPDGTRKVDFYKSGKKIHHYNMPWGSGVSICPGRFFAPSEMKT

FVLLMVMYFDFELVDPDMPVPPIDPRRWGFGTSQPSHEVRFRYRLKPMQ

 

>CYP11A1 J05156

MLAKGLCLRSVLVKSCQPFLSPVWQGPGLATGNGAGISSTNSPR

SFNEIPSPGDNGWINLYHFLRENGTHRIHYHHMQNFQKYGPIYREKLGNMESVYILDP

KDAATLFSCEGPNPERYLVPPWVAYHQYYQRPIGVLFKSSDAWRKDRIVLNQEVMAPD

SIKNFVPLLEGVAQDFIKVLHRRIKQQNSGKFSGDISDDLFRFAFESITSVVFGERLG

MLEEIVDPESQRFIDAVYQMFHTSVPMLNMPPDLFRLFRTKTWKDHAAAWDVIFSKAD

EYTQNFYWDLRQKRDFSKYPGVLYSLLGGNKLPFKNIQANITEMLAGGVDTTSMTLQW

NLYEMAHNLKVQEMLRAEVLAARRQAQGDMAKMVQLVPLLKASIKETLRLHPISVTLQ

RYIVNDLVLRNYKIPAKTLVQVASYAMGRESSFFPNPNKFDPTRWLEKSQNTTHFRYL

GFGWGVRQCLGRRIAELEMTIFLINVLENFRIEVQSIRDVGTKFNLILMPEKPIFFNF

QPLKQDLGSTMPRKGDTV

 

>CYP11B1 pseudogene? XM_343261

MALRVTADVWLARPWQCLHRTRALGTTAKVAPKTLKPFEAIPQY

SRNKWLKMIQILREQGQENLHLEMHQAFQELGPIFRHSAGGAQIVSVMLPEDAEKLHQ

VESILPHRMPLEPWVAHRELRGLRRGVFLL

gap then two of the next exon

GHDLYPESLKFTHALHSMFTSTTQLILLPKSLTRWTSTQVWKGHFESWDIISEY

GHDLYPESLKFTHALHSMFTSTTQLILLPKSLTRWTSTQVWKGHFESWDIISEY

SHKCIKNVYRELAEGRQKSWSVISEMVAQSTLSMDAIHANSMEIIAEVLTR

TAISLVMTLFELARNPDVQQALQQESLAAEASIAANPQKAISDLPLLRAALKETLR

LYPVGSYLERILNSDLVLQNYHVPAGTFVIIYLYSMGRNPAVFPRPERYMPQRWLERKRS

FQHLAFGFGVRQCLGRRLAEVEMLLLLHH

MPKSFQVETQEKEDVQMAYRFILMPSSIPLLTFRPVS

 

>CYP11B1 X15431

MALRVTADVWLARPWQCLHRTRALGTTAKVAPKTLKPFEAIPQY

SRNKWLKMIQILREQGQENLHLEMHQAFQELGPIFRHSAGGAQIVSVMLPEDAEKLHQ

VESILPHRMPLEPWVAHRELRGLRRGVFLL NGADWRFNRLQLNPNMLSPKAIQSFVPF

VDVVARDFVENLKKRMLENVHGSMSINIQSNMFNYTMEASHFVISGERLGLT GHDLKP

ESVTFTHALHSMFKSTTQLMFLPKSLTRWTSTRVWKEHFDSWDIISEY VTKCIKNVYR

ELAEGRQQSWSVISEMVAQSTLSMDAIHANSMELIA GSVDT TAISLVMTLFELARNPD

VQQALRQESLAAEASIVANPQKAMSDLPLLRAALKETLRLYPVGSFVERIVHSDLVLQ

NYHVPAGTFVIIYLYSMGRNPAVFPRPERYMPQRWLERKRSFQHLAFGFGVRQCLGRR

LAEVEMLLLLHH MLKTFQVETLRQEDMQMVFRFLLMPSSSPFLTFRPVS

 

>CYP11B2 D00567

MGACDNDFIELHSRVTADVWLARPWQCLHRTRALGTTATLAPKT

LKPFEAIPQYSRNKWLKMIQILREQGQENLHLEMHQAFQELGPIFRHSAGGAQIVSVM

LPEDAEKLHQVESILPRRMHLEPWVAHRELRGLRRGVFLLNGAEWRFNRLKLNPNVLS

PKAVQNFVPMVDEVARDFLEALKKKVRQNARGSLTMDVQQSLFNYTIEASNFALFGER

LGLLGHDLNPGSLKFIHALHSMFKSTTQLLFLPRSLTRWTSTQVWKEHFDAWDVISEY

ANRCIWKVHQELRLGSSQTYSGIVAALITQGALPLDAIKANSMELTAGSVDTTAIPLV

MTLFELARNPDVQQALRQETLAAEASIAANPQKAMSDLPLLRAALKETLRLYPVGGFL

ERILNSDLVLQNYHVPAGTLVLLYLYSMGRNPAVFPRPERYMPQRWLERKRSFQHLAF

GFGVRQCLGRRLAEVEMLLLLHHMLKTFQVETLRQEDVQMAYRFVLMPSSSPVLTFRP

IS

 

>CYP11B3 U14907

MALRVTADVWARPWQCLHRTRALGSTATQAPKTLKPFEAIPQYS

RNKWLKMIQILREQSQENLHLEMHQAFQELGPIFRHSAGGAQIVSVMLPEDAEKLHQV

ESILPRRMTLESWVAHRELRGLRRGVFLLNGADWRFNRLQLNPNMLSPKAVQSFVPFV

DVVARDFVENLKKRMLENVHGSMSMDIQSNVFNYTMEASHFVISGERLGLTGHDLNPE

SLKFIHALHSMFKSTTQLMFLPKNLTRWTSTQVWKGHFESWDIISEYVTKCIKNVYRE

LAEGRQQSWSVISEMVAQSTLSMDAIHANSMELIAGSVDTTAISLVMTLFELARNPDV

QQALRQESLAAEASIAANPQKAMSDLPLLRAALKETLRLYPIGSSLERIVDSDLVLQN

YHVPAGTLVIIYLYSMGRNPAVFPRPERYMPQRWLERKRSFQHLAFGFGVRQCLGRRL

AEVEVLLLLHHMLKIFQVETLRQEDVQMAYRFVLMPNPRLVLTIRPVS

 

>CYP17A1 M31681

MWELVGLLLLILAYFFWVKSKTPGAKLPRSLPSLPLVGSLPFLP

RRGHMHVNFFKLQEKYGPIYSLRLGTTTTVIIGHYQLAREVLIKKGKEFSGRPQMVTQ

SLLSDQGKGVAFADAGSSWHLHRKLVFSTFSLFKDGQKLEKLICQEAKSLCDMMLAHD

KESIDLSTPIFMSVTNIICAICFNISYEKNDPKLTAIKTFTEGIVDATGDRNLVDIFP

WLTIFPNKGLEVIKGYAKVRNEVLTGIFEKCREKFDSQSISSLTDILIQAKMNSDNNN

SCEGRDPDVFSDRHILATVGDIFGAGIETTTTVLKWILAFLVHNPEVKKKIQKEIDQY

VGFSRTPTFNDRSHLLMLEATIREVLRIRPVAPMLIPHKANVDSSIGEFTVPKDTHVV

VNLWALHHDENEWDQPDQFMPERFLDPTGSHLITPTQSYLPFGAGPRSCIGEALARQE

LFVFTALLLQRFDLDVSDDKQLPRLEGDPKVVFLIDPFKVKITVRQAWMDAQAEVST

 

>CYP19A1 M33986

MFLEMLNPMHYNVTIMVPETVPVSAMPLLLIMGLLLLIRNCESS

SSIPGPGYCLGIGPLISHGRFLWMGIGSACNYYNKMYGEFMRVWISGEETLIISKSSS

MVHVMKHSNYISRFGSKRGLQCIGMHENGIIFNNNPSLWRTVRPFFMKALTGPGLIRM

VEVCVESIKQHLDRLGDVTDNSGYVDVVTLMRHIMLDTSNTLFLGIPLDESSIVKKIQ

GYFNAWQALLIKPNIFFKISWLYRKYERSVKDLKDEIEILVEKKRQKVSSAEKLEDCM

DFATDLIFAERRGDLTKENVNQCILEMLIAAPDTMSVTLYVMLLLIAEYPEVETAILK

EIHTVVGDRDIRIGDVQNLKVVENFINESLRYQPVVDLVMRRALEDDVIDGYPVKKGT

NIILNIGRMHRLEYFPKPNEFTLENFEKNVPYRYFQPFGFGPRSCAGKYIAMVMMKVV

LVTLLKRFHVKTLQKRCIENMPKNNDLSLHLDEDSPIVEIIFRHIFNTPFLQCLYISL

 

>CYP20A1 NM_199401 XM_237189 BC061716.1

MLDFAIFAVTFLLALVGAVLYLYPASRQASGIPGLTPTEEKDGN

LPDIVNSGSLHEFLVNLHGRYGPVVSFWFGRRLVVSLGTADALKQHFNPNKTLDPFET

MLKSLLGYRSGAGSGSEDHVRRRLYGDAVTAALQSNFPLLLKLSEELLDKWLSYPETQ

HIPLSQHMLGFALKFVTRMVLGDTFEGEQEVIRFQKIHGTVWSEIGKGFLDGSLDKNT

TRKNQYQEALMQLEAILKKIIKERKGGDFSQHTFIDSLVQRNLNEQQILEDSVVFSLA

GCIVTARLCTWAIHFLTTAEEVQKKLHKEVDHVLGKGPITSEKIEQLRYCQQVLCETV

RTAKLTPVSAQLQDIEGKVGPFIIPKETLVLYALGVVLQDASTWPSPHKFDPDRFADE

PVMKVFSSLGFSGTWECPELRFAYVVTTVLVSVLLKKLHLLAVDRQVFEMKYELVTSC

REETWITVSERH

 

>CYP21 U56853

MLLPGLLLLLLLLLLAGTRWLWGQWKLWKLRLPPLAPGFLHFLQ

PNLPVYLFGLAQKLGPIYRIRLGLQDVVVLNSNKTIEEALIQKWVDFAGRPQILDGKM

NFDLSMGDYSLTWKAHKKLSRSALVLGMRDSMEPLVEQLTQEFCERMRAQAGASVAIH

KEFSLLTCSIISCLTFGDKQDSTLLNATHSCVRDLLKAWNHWSVQILDIIPFLRFFPN

PGLWKLKQFQESRDHIVMQELKRHKDSLVAGQWKDMIDYMLQGVEKQRDARDPGQLHE

RHVHMSVVDLFVGGTETTAATLSWAVAFLLHHPEIQKRLQEELDLKLAPSSQLLYKNR

MQLPLLMATIAEVLRLRPVVPMALPHRATKASSISGYDIPKDTIIIPNIQGANLDEMV

WELPSKFWPDRFLESGKSPRIPTFGCGARVCLGEPLARLEFFVVLARLLQTFTLLPPP

DGTLPSLQPLPYTGINLLIPPFQVRLQPRNLAPQDQGQKSSTG

 

>CYP21-ps pseudogene fragment AY091789

LFGLAQKLGPIYRIAWG

DAVVLNSNKTIEEALIQKWVDFTG*PQILDGK

 

>CYP24A1 X59506

MSCPIDKRRTLIAFLRRLRDLGQPPRSVTSKASASRAPKEVPLC

PLMTDGETRNVTSLPGPTNWPLLGSLLEIFWKGGLKKQHDTLAEYHKKYGQIFRMKLG

SFDSVHLGSPSLLEALYRTESAHPQRLEIKPWKAYRDHRNEAYGLMILEGQEWQRVRS

AFQKKLMKPVEIMKLDKKINEVLADFLERMDELCDERGRIPDLYSELNKWSFESICLV

LYEKRFGLLQKETEEEALTFITAIKTMMSTFGKMMVTPVELHKRLNTKVWQAHTLAWD

TIFKSVKPCIDNRLQRYSQQPGADFLCDIYQQDHLSKKELYAAVTELQLAAVETTANS

LMWILYNLSRNPQAQRRLLQEVQSVLPDNQTPRAEDLRNMPYLKACLKESMRLTPSVP

FTTRTLDKPTVLGEYALPKGTVLTLNTQVLGSSEDNFEDSHKFRPERWLQKEKKINPF

AHLPFGIGKRMCIGRRLAELQLHLALCWIIQKYDIVATDNEPVEMLHLGILVPSRELP

IAFRPR

 

>CYP26A1 AF439720, NM_130408 Chr1 1Mb upstream of CYP2C cluster

242138769 MGLPALLASALCTFVLPLLLFLAALKLWDLYCVSSRDRSCALPLPPG

          TMGFPFFGETLQMVLQ (0) 242138581

242138389 RRKFLQMKRRKYGFIYKTHLFGRPTVRVMGADNVRRILL

          GEHRLVSVHWPASVRTILGAGCLSNLHDSSHKQRKK (0) 242138165

242137906 VIMQAFNREALQCYVPVIAEEVSGCLEQWLSCGERGLLVYPEV

          KRLMFRIAMRILLGCEPGPAGGGEDEQQLVEAFEEMTRNLFSLPIDVPFSGLYR (0) 242137616

242137537 GVKPRNLIHARIEENIRAKIRRLQAAERNAGCKDALQLLIEHSWERGERLDMQ (0) 242137379

242136717 ALKQSSTELLFGGHETTASAATSLITYLGLYPHVLQKVREEIKSK (0) 242136583

242136000 GLLCKSHHEDKLDMETLEQLKYIGCVIKETLRLNPPVPGGFRVALKTFELN (0) 242135848

242135595 GYQIPKGWNVIYSICDTHDVADSFTNKEEFNPDRFTSLHPEDTSRFSFIPFGGGLRSCRSKEFAKI

          LLKIFTVELARRCDWQLLNGPPTMKTSPTVYPVDNLPARFTHFQGDI* 242135254

 

>CYP26B1 AY245532, NM_181087

MLFEGLELVSALATLAACLVSVTLLLAVSQQLWQLRWAATRDKS

CKLPIPKGSMGFPLIGETGHWLLQGSGFQSSRREKYGNVFKTHLLGRPLIRVTGAENV

RKILLGEHQLVSTEWPRSARVLLGPNTVANSIGDIHRNKRKVFSKIFSHEALESYLPK

IQLVIQDTLRAWSSQPEAINVYQEAQRLTFRMAVRVLLGFSIPEEDLGNLFEVYQQFV

ENVFSLPVDLPFSGYRRGIQARQILQKGLEKAVREKLQCTQGKDYSDALDILIESSKE

HGKEMTMQELKDGTLELIFAAYATTASASTSLIMQLLKHPAVLEKLREELRAQGLLHG

GGCPCEGTLRLDMLSGLRYLDCVIKEVMRLFTPVSGGYRTVLQTFELDGFQIPKGWSV

MYSIRDTHDTAPVFKDVNVFDPDRFSQARSEDKDGRFHYLPFGGGVRTCLGKHLAKLF

LKVLAVELASTSRFELATRTFPRITLVPVLHPVDGLSVKFFGLDSNQNEILPETEAML

SATV

 

>CYP26C1 XM_217935 94% TO 26C1 MOUSE

 718 MFSWGLSCLSMLGAAGTALLCAGLLLGLAQQLWTLRWTLSRDWASTLPLPKGSMGWPFFG 897

 898 ETLHWLVQGSRFHSSRRERYGTVFKTHLLGRPVIRVSGAENVRTILLGEHRL (0)

     VLARVFSRPALEQFVPRLQEALRREVRSWCAAQRPVA 1257

1258 VYQAAKALTFRMAARILLGLQLDEARCTELAQTFERLVENLFSLPLDVPFSGLRK 1422

1705 GIRARDQLYQHLDEVIAEKLREELTAEPGDALHLIINSARELGRELSVQELK (0)

1888 ELAVELLFAAFFTTASASTSLILLLLQHPAAIAKIQQELSAQGLGSPCSCAPRASGSRP 2064

2065 DCSCEPDLSLAVLGRLRYVDCVVKEVLRLLPPVSGGYRTALRTFELDGYQIPKGWSVMYS 2244

2245 IRDTHETAAVYRSPPEGFDPERFGVESEDARGSGGRFHYIPFGGGARSCLGQELAQAVLQ 2424

2425 LLAVELVRTARWELATPAFPVMQTVPIVHPVDGLLLLFHPLPTLGAGDGSPF* 2583

 

>CYP26C1 XM_217935 94% TO 26C1 MOUSE Chr1 1Mb upstream of CYP2C cluster

242151281 MFSWGLSCLSMLGAAGTALLCAGLLLGLAQQLWTLRWTLSRDWASTLPLPKG

          SMGWPFFGETLHWLVQ (0) 242151079

242150553 GSRFHSSRRERYGTVFKTHLLGRPVIRVSGAENVRTILLGEHRL (0) 242150422

242149883 VLARVFSRPALEQFVPRLQEALRREVRSWCAAQRPVAVYQAAKALTFRMAAR

          ILLGLQLDEARCTELAQTFERLVENLFSLPLDVPFSGLRK (0) 242149608

242148160 GIRARDQLYQHLDEVIAEKLREELTAEPGDALHLIINSARELGRELSVQELK (0) 242148005

242146368 ELAVELLFAAFFTTASASTSLILLLLQHPAAIAKIQQELSAQGLGSPCSCAPRASGSRP

          DCSCEPDLSLAVLGRLRYVDCVVKEVLRLLPPVSGGYRTALRTFELD (0) 242146051

242144220 GYQIPKGWSVMYSIRDTHETAAVYRSPPEGFDPERFGVESEDARGSGGRFHYI

          PFGGGARSCLGQELAQAVLQLLAVELVRTARWELATPAFPVMQTVPIVHPVD

          GLLLLFHPLPTLGAGDGSPF* 242143843

 

>CYP27A1 M38566

MAVLSRMRLRWALLDTRVMGHGLCPQGARAKAAIPAALRDHEST

EGPGTGQDRPRLRSLAELPGPGTLRFLFQLFLRGYVLHLHELQALNKAKYGPMWTTTF

GTRTNVNLASAPLLEQVMRQEGKYPIRDSMEQWKEHRDHKGLSYGIFITQGQQWYHLR

HSLNQRMLKPAEAALYTDALNEVISDFIARLDQVRTESASGDQVPDVAHLLYHLALEA

ICYILFEKRVGCLEPSIPEDTATFIRSVGLMFKNSVYVTFLPKWSRPLLPFWKRYMNN

WDNIFSFGEKMIHQKVQEIEAQLQAAGPDGVQVSGYLHFLLTKELLSPQETVGTFPEL

ILAGVDTTSNTLTWALYHLSKNPEIQEALHKEVTGVVPFGKVPQNKDFAHMPLLKAVI

KETLRLYPVVPTNSRIITEKETEINGFLFPKNTQFVLCTYVVSRDPSVFPEPESFQPH

RWLRKREDDNSGIQHPFGSVPFGYGVRSCLGRRIAELEMQLLLSRLIQKYEVVLSPGM

GEVKSVSRIVLVPSKKVSLRFLQRQ

 

>CYP27B1 AB001992

MTQAVKLASRVFHRVQLPSQLGSDSVLRSLSDIPGPSTPSFLAE

LFCKGGLSRLHELQVHGAARYGPIWSGSFGTLRTVYVADPALVEQLLRQESHCPERCS

FSSWSEHRRRHQRACGLLTADGEEWQRLRSLLAPLLLRPQAAAGYAGTLDSVVSDLVR

RLRRQRGRGSGLPDLVLDVAGEFYKFGLEGIGAVLLGSRLGCLEAEVPPDTETFIEAV

GSVFVSTLLTMAMPSWLHRLIPGPWARLCRDWDQMFAFAQKHVEQREGEAAVRNQGKP

EEDLPTGHHLTHFLFREKVSVQSIVGNVTELLLAGVDTVSNTLSWALYELSRHPEVQS

ALHSEITGAVNPGSYAHLQATALSQLPLLKAVIKEVLRLYPVVPGNSRVPDRDICVGN

YVIPQDTLVSLCHYATSRDPAQFREPNSFNPARWLGEGPAPHPFASLPFGFGKRSCIG

RRLAELELQMALAQILTHFEVLPEPGALPVKPMTRTVLVPERSIHLQFVDR

 

>CYP39A1 XM_236983 (INCORRECT END)  AC107523.4

MGIMELFSPIAIAVLGSCVLFLFSRWKNLRGPPCIQGWIPWIGA

GFEFGKAPLEFIEKARIKYGPVFTVFAVGKRMTFVTEEEGINVLLKSKHVDFELAVQR

PLYHTAWIPKNIFFALHEKLYVLMKGKMGTFNTHHFTGQLTEEFHDQLEGLGTHGTMD

LNDFVRYLLYPATLNTLFMKGLFLTDKRKIKEFYQHFKTYDEGFEYGSQLPEWLLRNW

SKSKRWLLALFEKNIGDIKTHGSAGHSETLLQAVLGMVETETRLHSPNYGLVMLWASL

ANAAPIAFWTLAYILSHPDLHRTIVESISSVFGTAGKDKIQVSENDLKKLLLIKWCIL

ESIRLRAPGVITRKVVKPVKILNHTVPSGDLLMLSPFWLHRNPKYFPEPESFKPERWK

EANLDKYIFLDYFMAFGGGKFQCPGR   

147499 WFALLEIQLCIILVLYKYECSLLDPLPKQA (1) 147410

146814 SLHLVGVPQPAGKCRIEYKQRV* 146746

 

>CYP46A1 XM_343108

MSPGLLLLGSAVLLAFGLCCTFVHRARSRYEHIPGPPRPSFLLG

HLPYFWKKDEACGRVLQDVFLDWAKKYGPVVRVNVFHKTSVIVTSPESVKKFLMSTKY

NKDSKMYRAIQTVFGERLFGQGLVSECDYGRWYKQRRVMDLAFSRSSLVSLMGTFNEK

AEQLMEILEAKADGQTPVSMQDMLTCATIDILAKAAFGMETSMLLGAQKPLSQAVKVM

LEGISASRNTLAKFMPGKRKQLREIRESIRLLRQVGKDWVQRRREALKRGEDVPADIL

TQILKAEEGAQDDEVLLDNFVTFFIAGHETSANHLAFTVMELSRQPEIVARLQAEVDE

VVGSKRHLDYEDLGRLQYLSQVLKESLRLYPPAWGTFRLLEEETLIDGVRVPGNTPLL

FSTYVMGRMDTYFEDPLTFNPDRFGPGAPKPRFTYFPFSLGHRSCIGQQFAQMEVKVV

MAKLLQRLEFRLVPGQRFGLQEQATLKPLDPVLCTLRPRGWQPAPPPPPC

 

>CYP51A1 U17697

MEQVTGGNLLSTLLIACAFTLSLVYLFRLAVGHMVQLPAGAKSP

PYIYSPIPFLGHAIAFGKSPIEFLENAYEKYGPVFSFTMVGKTFTYLLGSDAAALLFN

SKNEDLNAEEVYGRLTTPVFGKGVAYDVPNAVFLEQKKILKSGLNIAHFKQYVSIIEK

EAKKYFKSWGESGERNVFEALSELIILTASHCLHGKEIRSQLNEKVAQLYADLDGGFS

HAAWLLPGWLPLPSFRRRDRAHREIKNIFYKAIQKRRLSKEPAEDILQTLLDSTYKDG

RPLTDDEIAGMLIGLLLAGQHTSSTTSAWMGFFLARDKPLQDKCYLEQKTVCGEDLPP

LTYEQLKDLNLLDRCIKETLRLRPPI MTMMRMAKTPQTV AGYTIPPGHQVCVSPTVNQ

RLKDSWVERLDFNPDRYLQDNPASGEKFAYVPFGAGRHRCIGENFAYVQIKTIWSTML

RLYEFDLINGYFPSVNYTTMIHTPENPVIRYKRR SK

 

>CYP51P1  pseudogene D87997 92% to CYP51A1 rat

 398 MEQVTGGNLLSTLLIACAFTLSLVYLFRLAVGHMVQLPAGAESPPCIYSPIPFLGHAI

     FGKSPIEFLENAYEKSGPVFSFTMVSKTFTYLLGSDAAALL 696

 697 FNSKNEDLNAEEVYGRLTTPVFGKXXXXXXX 768

 788 NAVFLEHKKILKSGLNIAHFKQYVSITEKEAKEYFKSWGESGERNVFEALSELIILTASH 967

 968 CLHGKEIRSQLNEKVAQLYADLDGGFSHAAWLLPGWLPLPSFRRRDRAHREIKNIFYKAI 1147

1148 QKRRLSKEPAEDILQTLLDSTYKDGRPLTDDVIAGMLIGLLLAGXXXXXX

1281 TSAWMGFFLVRDKPLQGKCYLEQKAVCGEDLPPLTYE*LKDLNLLDRCIKE 1433

     TLRLRPPIMTMMRMAKTPQ 1489

1490 NVAGCTIPPGHQVCVSPTVNQRLKDSWVERLDFNPDRYLQGNPASGEKFAYVPFGAGRHH 1669

1670 CIGENFAYVQIKTIWSTMLHLYEFDLINGYFLSVNYTTMIHTPENPVIRYKMK 1828

 

>CYP51P1  pseudogene XM_234202

MEQVTGGNLLSTLLIACAFTLSLVYLF

RLAVGHMVQLPAGAESPPCIYSPIPFLGHAIAFGKSPIEFLENAYEKSGPVFSFTMVG

KTFTYLLGSDAAALLFNSKNEDLNAEEVYGRLTTPVFGKGVAYDVPNAVFLEHKKILK

SGLNIAHFKQYVSITEKEAKEYFKSWGESGERNVFEALSELIILTASHCLHGKEIRSQ

LNEKVAQLYADLDGGFSHAAWLLPGWLPLPSFRRRDRAHREIKNIFYKAIQKRRLSKE

PAEDILQTLLDSTYKDGRPLTDDVIAGMLIGLLLAGHSAWMGFFLVRDKPLQGKCYLE

QKAVCGEDLPPLTYE*LKDLNLLDRCIKETLRLRPPIMTMMRMAKTPQNVAGCTIPPG

HQVCVSPTVNQRLKDSWVERLDFNPDRYLQGNPASGEKFAYVPFGAGRHHCIGENFAY

VQIKTIWSTMLHLYEFDLINGYFLSVNYTTMIHTPENPVIRYKMK

 

>CYP51P2  pseudogene D78370

MEQVTGGNLLSTLLIACAFTLSLV (fs)

NLFRLAVGHMVQLHAGAESPPCIY

SPIPFLGHRIAFGKSPIEFLENAYEKSGPVFSFTMVDKTFTYLLGSDAAALLFNSKNEDL

NAEEVYGRLTTPVFGKGVAYDVPNADFLEHKKLLKSGLNIAHFKQYVSITEKEAKEYFKS

WGESGERNVFEALSELIILTASHCLHGKEIRSQLNEKVAQLYADLDGGFSHAAWLLPGWL

PLPSFRRRDRAHREIKNIFYKAIQKRRLSKEPAEDILQTLLDSTYKDGRPLTDDVIAGML

IGLLLAGHSAWMGFFLVRDKPLQGKCYLEQKAVCGEALHPLTYE*LKDLNLLDRCIKETL

RLESPPI (fs)

VTM (fs)

VRIGQAPSRMCAGCTIPPGHQVCVSPTVNQRLKDSWVERLDFNPDRYLQ

GNPASGEKFAYVPFGAGRHHCIGENFAYVQIKTIWSTMLHLYEFDLINGYFLSVNYTTMI

HTPENPVIRYKMK