>CYP1A1 NM_000499
MLFPISMSATEFLLASVIFCLVFWVIRASRPQVPKGLKNPPGPW
GWPLIGHMLTLGKNPHLALSRMSQQYGDVLQIRIGSTPVVVLSGLDTIRQALVRQGDD
FKGRPDLYTFTLISNGQSMSFSPDSGPVWAARRRLAQNGLKSFSIASDPASSTSCYLE
EHVSKEAEVLISTLQELMAGPGHFNPYRYVVVSVTNVICAICFGRRYDHNHQELLSLV
NLNNNFGEVVGSGNPADFIPILRYLPNPSLNAFKDLNEKFYSFMQKMVKEHYKTFEKG
HIRDITDSLIEHCQEKQLDENANVQLSDEKIINIVLDLFGAGFDTVTTAISWSLMYLV
MNPRVQRKIQEELDTVIGRSRRPRLSDRSHLPYMEAFILETFRHSSFVPFTIPHSTTR
DTSLKGFYIPKGRCVFVNQWQINHDQKLWVNPSEFLPERFLTPDGAIDKVLSEKVIIF
GMGKRKCIGETIARWEVFLFLAILLQRVEFSVPLGVKVDMTPIYGLTMKHACCEHFQM
QLRS

>CYP1A2 NM_000761
MALSQSVPFSATELLLASAIFCLVFWVLKGLRPRVPKGLKSPPE
PWGWPLLGHVLTLGKNPHLALSRMSQRYGDVLQIRIGSTPVLVLSRLDTIRQALVRQG
DDFKGRPDLYTSTLITDGQSLTFSTDSGPVWAARRRLAQNALNTFSIASDPASSSSCY
LEEHVSKEAKALISRLQELMAGPGHFDPYNQVVVSVANVIGAMCFGQHFPESSDEMLS
LVKNTHEFVETASSGNPLDFFPILRYLPNPALQRFKAFNQRFLWFLQKTVQEHYQDFD
KNSVRDITGALFKHSKKGPRASGNLIPQEKIVNLVNDIFGAGFDTVTTAISWSLMYLV
TKPEIQRKIQKELDTVIGRERRPRLSDRPQLPYLEAFILETFRHSSFLPFTIPHSTTR
DTTLNGFYIPKKCCVFVNQWQVNHDPELWEDPSEFRPERFLTADGTAINKPLSEKMML
FGMGKRRCIGEVLAKWEIFLFLAILLQQLEFSVPPGVKVDLTPIYGLTMKHARCEHVQ
ARRFSIN

>CYP1A8P NT_008580.9|Hs9_8737 chromosome 9 Pseudogene 43% to 1A2
Chr9q21.12 69227672-69241560 + strand build 33
4822084 MILDLAVTPGEVTTSLIILVMVFVFVRALRSKGRKQVSPPGP*SFPIIENLLQLGDHPY 4822260
4822261 LTLMEMRKKYGDVFLLKLGMVPVLVVNGMEMVKQVLHKDGEHFAGRPNMHTFSFLAEGKS 4822440
4822441 LSFSVNYGESWKLHKKIASKAL*TFSNAEAKSSTCSCSLEEHVTEEISELVTVFVELTSK 4822620
4822621 NGSFDPRNAITCVVANIVCALCFGKR*DHSDEEFLRIVKTNDDLLKASSAANPADFIPCL 4822800
4822801 HYLPLKIINAPLEFYQALNGFIALHVQDHLATYGK 4822905 (0)
4824790 DHIRDITDALINVCHNKYAATKTDTLNDSEIISTVSDLFGA 4824912 (1)
4829424 GFETVSTCLCWSFLYLIHYPEIQARIQEEI 4829513 (1)
4829611 RPPRFEDRKILPYTEAFVSEVFRHASFLPFTIPHS 4829715 (2)
4832677 TTADTTLNGYFIPRKTCTFINMYQVNHDE 4832763 (2)
4835676 TIWDNHSLFRPDRFLNENRELNKSLVEKVLIFGMGIRKCLGEDVARNEIFIFITTVLQQFK 4835858
4835859 LKK*PRAKLDLTPTYGLVMRPKLYQLQAELHPSGSSSA* 4835975

>CYP1B1 NM_000104
MGTSLSPNDPWPLNPLSIQQTTLLLLLSVLATVHVGQRLLRQRR
RQLRSAPPGPFAWPLIGNAAAVGQAAHLSFARLARRYGDVFQIRLGSCPIVVLNGERA
IHQALVQQGSAFADRPAFASFRVVSGGRSMAFGHYSEHWKVQRRAAHSMMRNFFTRQP
RSRQVLEGHVLSEARELVALLVRGSADGAFLDPRPLTVVAVANVMSAVCFGCRYSHDD
PEFRELLSHNEEFGRTVGAGSLVDVMPWLQYFPNPVRTVFREFEQLNRNFSNFILDKF
LRHCESLRPGAAPRDMMDAFILSAEKKAAGDSHGGGARLDLENVPATITDIFGASQDT
LSTALQWLLLLFTRYPDVQTRVQAELDQVVGRDRLPCMGDQPNLPYVLAFLYEAMRFS
SFVPVTIPHATTANTSVLGYHIPKDTVVFVNQWSVNHDPVKWPNPENFDPARFLDKDG
LINKDLTSRVMIFSVGKRRCIGEELSKMQLFLFISILAHQCDFRANPNEPAKMNFSYG
LTIKPKSFKVNVTLRESMELLDSAVQNLQAKETCQ

>CYP2A6 NM_000762
MLASGMLLVALLVCLTVMVLMSVWQQRKSKGKLPPGPTPLPFIGNYLQLNTEQMYNSLMK
ISERYGPVFTIHLGPRRVVVLCGHDAVREALVDQAEEFSGRGEQATFDWVFKGY
GVVFSNGERAKQLRRFSIATLRDFGVGKRGIEERIQEEAGFLIDALRGTG
GANIDPTFFLSRTVSNVISSIVFGDRFDYKDKEFLSLLRMMLGIFQFTSTSTGQ
LYEMFSSVMKHLPGPQQQAFQLLQGLEDFIAKKVEHNQRTLDPNSPRDFIDSFLIRMQE
EEKNPNTEFYLKNLVMTTLNLFIGGTETVSTTLRYGFLLLMKHPEVE
AKVHEEIDRVIGKNRQPKFEDRAKMPYMEAVIHEIQRFGDVIPMSLARRVKKDTKFRDFFLPK
GTEVYPMLGSVLRDPSFFSNPQDFNPQHFLNEKGQFKKSDAFVPFSIG
KRNCFGEGLARMELFLFFTTVMQNFRLKSSQSPKDIDVSPKHVGFATIPRNYTMSFLPR

>CYP2A6 NT_011109.13 - strand
3511847 MLASGMLLVALLVCLTVMVLMSVWQQRKSKGKLPPGPTPLPFIGNYLQLNTEQMYNSLMK 3511668
3511401 ISERYGPVFTIHLGPRRVVVLCGHDAVREALVDQAEEFSGRGEQATFDWVFKGY 3511240
3510185 GVVFSNGERAKQLRRFSIATLRDFGVGKRGIEERIQEEAGFLIDALRGTG 3510036
3509801 GANIDPTFFLSRTVSNVISSIVFGDRFDYKDKEFLSLLRMMLGIFQFTSTSTGQ 3509640
3508472 LYEMFSSVMKHLPGPQQQAFQLLQGLEDFIAKKVEHNQRTLDPNSPRDFIDSFLIRMQE 3508296
3507518 EEKNPNTEFYLKNLVMTTLNLFIGGTETVSTTLRYGFLLLMKHPEVE 3507378
3506903 AKVHEEIDRVIGKNRQPKFEDRAKMPYMEAVIHEIQRFGDVIPMSLARRVKKDTKFRDFFLPK 3506715
3506193 GTEVFPMLGSVLRDPSFFSNPQDFNPQHFLNEKGQFKKSDAFVPFSIG 3506050
3505396 KRNCFGEGLARMELFLFFTTVMQNFRLKSSQSPKDIDVSPKHVGFATIPRNYTMSFLPR 3505220

>CYP2A7v1 related AC008537.3 Exon 8 is different from 2A7 sequence U22029
but nearly identical to M33317.  This is the 2A7 wt gene sequence
3 diffs to 2A6, 6 diffs to 2A7, other exons 100% to 2A7
MLASGLLLVALLACLTVMVLMSVWQQRKSRGKLPPGPTPLPFIGNYLQLNTEHICDSIMK
FSECYGPVFTIHLGPRRVVVLCGHDAVREALVDQAEEFSGRGEQATFDWVFKGY
GVAFSNGERAKQLLRFAIATLRDFGVGKRGIEERIQEESGFLIEAIRSTH
GANIDPTFFLSRTVSNVISSIVFGDRFDYEDKEFLSLLSMMLGIFQFTSTSTGQ
LYEMFSSVMKHLPGPQQQAFKLLQGLEDFIAKKVEHNQRTLDPNSPQDFIDSFLIHMQE
EEKNPNTEFYLKNLMMSTLNLFIAGTETVSTTLRYGFLLLMKHPEVE
AKVHEEIDRVIGKNRQPKFEDRTKMPYMEAVIHEIQRFGDVIPMSLARRVKKDTKFRDFFLPK
GTEVFPMLGSVLRDPSFFSNPQDFNPQHFLDDKGQFKKSDAFVPFSIG 
KRNCFGEGLARMELFLFFTTVMQNFRLKSSQSPKDIDVSPKHVVFATIPRNYTMSFLPR

>Another 2A Separate C-term on AC008537.5 minus strand missing exons 6, 8
105289 MLASGLLLVALLACLTVMVLMSVWQQRKSRGKLPPGPTPLPFIGNYLQLNTEHICDSIMK 105110
104830 FSECYGPVFTIHLGPRRVVVLCGHDAVREALVDQAEEFSGRGEQATFDWVFKGY 104669
103708 GVAFSNGERAKQLLRFAIATLRDFGVGKRGIEERIQEESGFLIEAIRSTH 103559
103324 GANIDPTFFLSRTVSNVISSIVFGDRFDYEDKEFLSLLSMMLGIFQFTSTSTGQ 103163
102015 LYEMFSSVMKHLPGPQQQAFKLLQGLEDFIAKKVEHNQRTLDPNSPQDFIDSFLIHMQE 101839
100457 AKVHEEIDRVIGKNRQPKFEDRTKMPYMEAVIHEIQRFGDVIPMSLARRVKKDTKFRDFFLPK 100269

 98951 KRNCFGEGLARMELFLFFTTVMQNFRLKSSQSPKDIDVSPKHVVFATIPRNYTMSFLP 98778

>CYP2A7v1 NT_011109.13 -strand, 2A7 wt gene sequence, same as AC008537.3 
Exon 8 is different from U22029 sequence at 6 aa
3543631 MLASGLLLVALLACLTVMVLMSVWQQRKSRGKLPPGPTPLPFIGNYLQLNTEHICDSIMK 3543452
3543172 FSECYGPVFTIHLGPRRVVVLCGHDAVREALVDQAEEFSGRGEQATFDWVFKGY 3543011
3542050 GVAFSNGERAKQLLRFAIATLRDFGVGKRGIEERIQEESGFLIEAIRSTH 3541901
3541666 GANIDPTFFLSRTVSNVISSIVFGDRFDYEDKEFLSLLSMMLGIFQFTSTSTGQ 3541505
3540357 LYEMFSSVMKHLPGPQQQAFKLLQGLEDFIAKKVEHNQRTLDPNSPQDFIDSFLIHMQE 3540181
3539414 EEKNPNTEFYLKNLMMSTLNLFIAGTETVSTTLRYGFLLLMKHPEVE 3539274
3538799 AKVHEEIDRVIGKNRQPKFEDRTKMPYMEAVIHEIQRFGDVIPMSLARRVKKDTKFRDFFLPK 3538611
3538089 GTEVFPMLGSVLRDPSFFSNPQDFNPQHFLDDKGQFKKSDAFVPFSIG 3537946
3537293 KRNCFGEGLARMELFLFFTTVMQNFRLKSSQSPKDIDVSPKHVVFATIPRNYTMSFLPR 3537117

>CYP2A7v1 M33317 4 aa diffs with 2A7 related AC008537.3
This is probably the wt 2A7 sequence with 4 nucleotide sequence errors
MLASGLLLVALLACLTVMVLMSVWQQRKSRGKLPPGPTPLPFIG
NYLQLNTEHICDSIMKFSECYGPVFTIHLGPRRVVVLCGHDAVREALVDQAEEFSGRG
EQATFDWVFKGYGVAFSNGERAKQLLRFAIATLRDFGVGKRGIEERIQEESGFLIEAI
RSSHGANIDPTFFLSRTVSNVISSIVFGDRFDYEDKEFLSLLSMMLGIFQFTSTSTGQ
LYEMFSSVMKHLPGPQQQAFKLLQGLEDFIAKKVEHNQRTLDPNSPQDFIDSFLIHMQ
EEEKNPNTEFYLKNLMMSTLNLFIAGTETVSTTLRYGFLLLMKHPEVEAKVHEEIDRV
IGKNRQPKFEDRTKMPYMEAVIHEIQRFGDVIPMSLARRVKKDTKFRDFFLPK
GTEVFPMLGSVLRDPSFFSNPQDFNPQHFLDDKGQFKKSDAFVPFSI
GKRYCFGEGLARMELFLFFTTVMQNFRLKSSQSPKDIDVSSKHVGFATIPRNYTMSFLPR

>CYP2A7v2 NM_000764 U22029 possible gene conversion with 2A18PC at exon 8
This represents an allele of CYP2A7
MLASGLLLVALLACLTVMVLMSVWQQRKSRGKLPPGPTPLPFIG
NYLQLNTEHICDSIMKFSECYGPVFTIHLGPRRVVVLCGHDAVREALVDQAEEFSGRG
EQATFEWVFKGYGVAFSNGERAKQLLRFAIATLRDFGVGKRGIEERIQEESGFLIEAI
RSTHGANIDPTFFLSRTVSNVISSIVFGDRFDYEDKEFLSLLSMMLGIFQFTSTSTGQ
LYEMFSSLMKHLPGPQQQAFKLLLGLEDFIAKKVEHNQRTLDPNSPQDFIDSFLIHMQ
EEEKNPNTEFYLKNLMMSTLNLFIAGTETVSTTLRYGFLLLMKHPEVEAKVHEEIDRV
IGKNRQPKFEDRTKMPYMEAVIHEIQRFGDVIPMSLARRVKKDTKFRDFFLPK
GTEVFPMLGSVLRDLRFFSNPRDFNPQHFLGEKGQFKKRDAFVPFSIR
KRNCFGEGLARMELFLFFTTVMQNFRLKSSQSPKDIDVSPKHVVFATIPRNYTMSFLPR

>CYP2A7_v1axD2a NM_030589 family 2, subfamily A, polypeptide 7 isoform 2 
the deletion transcript, protein_id="NP_085079.2
Ding S, Lake BG, Friedberg T, Wolf CR. Expression and alternative splicing of the cytochrome P-450 CYP2A7.
Biochem J. 1995 Feb 15;306 ( Pt 1):161-6. alternative splice variant 
MLASGLLLVALLACLTVMVLMSVWQQRKSRGKLPPGPTPLPFIGNYLQLNTEHICDSIMKVSQ
(deleted exon 2)
GVAFSNGERAKQLLRFAIATLRDFGVGKRGIEERIQEES
GFLIEAIRSTHGANIDPTFFLSRTVSNVISSIVFGDRFDYEDKEFLSLLSMMLGIFQF
TSTSTGQLYEMFSSVMKHLPGPQQQAFKLLQGLEDFIAKKVEHNQRTLDPNSPQDFID
SFLIHMQEEEKNPNTEFYLKNLMMSTLNLFIAGTETVSTTLRYGFLLLMKHPEVEAKV
HEEIDRVIGKNRQPKFEDRTKMPYMEAVIHEIQRFGDVIPMSLARRVKKDTKFRDFFL
PKGTEVFPMLGSVLRDPSFFSNPQDFNPQHFLDDKGQFKKSDAFVPFSIGKRNCFGEG
LARMELFLFFTTVMQNFRLKSSQSPKDIDVSPKHVVFATIPRNYTMSFLPR

>2A7 related Separate C-term on AC008537.3 minus strand missing exons 3, 6, 8
MLASGLLLVALLACLTVMVLMSVWQQRKSRGKLPPGPTPLPFIGNYLQLNTEHICDSIMK 
FSECYGPVFTIHLGPRRVVVLCGHDAVREALVDQAEEFSGRGEQATFDWVFKGY 
GANIDPTFFLSRTVSNVISSIVFGDRFDYEDKEFLSLLSMMLGIFQFTSTSTGQ
LYEMFSSVMKHLPGPQQQAFKLLQGLEDFIAKKVEHNQRTLDPNSPQDFIDSFLIHMQE 
AKVHEEIDRVIGKNRQPKFEDRTKMPYMEAVIHEIQRFGDVIPMSLARRVKKDTKFRDFFLPK
KRNCFGEGLARMELFLFFTTVMQNFRLKSSQSPKDIDVSPKHVVFATIPRNYTMSFLP 

>CYP2A13 U22028
MLASGLLLVTLLACLTVMVLMSVWRQRKSRGKLPPGPTPLPFIGNYLQLNTEQMYNSLMK
ISERYGPVFTIHLGPRRVVVLCGHDAVKEALVDQAEEFSGRGEQATFDWLFKGY
GVAFSNGERAKQLRRFSIATLRGFGVGKRGIEERIQEEAGFLIDALRGTH
GANIDPTFFLSRTVSNVISSIVFGDRFDYEDKEFLSLLRMMLGRFQFTGTSTGQ
LYEMFSSVMKHLPGPQQQAFKELQGLEDFIAKKVEHNQRTLDPNSPRDFIDSFLIRMQ
EEEKNPNTEFYLKNLVMTTLNLFFAGTETVSTTLRYGFLLLMKHPEVE
AKVHEEIDRVIGKNRQPKFEDRAKMPYTEAVIHEIQRFGDMLPMGLAHRVNKDTKFRDFFLPK
GTEVFPMLGSELRDPRFFSNPQDCSPQHFLDEKGQFKKSDAFVPFSI
GKRYCFGEGLARMELFLFFTTIMQNFRFKSPQSPKDIDVSPKHVGFATIPRNYTMSFLPR

>CYP2A13 NT_011109.13 + strand
3749893 MLASGLLLVTLLACLTVMVLMSVWRQRKSRGKLPPGPTPLPFIGNYLQLNTEQMYNSLMK 3750072
3750350 ISERYGPVFTIHLGPRRVVVLCGHDAVKEALVDQAEEFSGRGEQATFDWLFKGY 3750511
3751467 GVAFSNGERAKQLRRFSIATLRGFGVGKRGIEERIQEEAGFLIDALRGTH 3751616
3751824 GANIDPTFFLSRTVSNVISSIVFGDRFDYEDKEFLSLLRMMLGSFQFTATSTGQ 3751985
3753153 LYEMFSSVMKHLPGPQQQAFKELQGLEDFIAKKVEHNQRTLDPNSPRDFIDSFLIRMQ 3753326 
3755048 EEEKNPNTEFYLKNLVMTTLNLFFAGTETVSTTLRYGFLLLMKHPEVE 3755191
3755665 AKVHEEIDRVIGKNRQPKFEDRAKMPYTEAVIHEIQRFGDMLPMGLAHRVNKDTKFRDFFLPK 3755853
3756380 GTEVFPMLGSVLRDPRFFSNPRDFNPQHFLDKKGQFKKSDAFVPFSI 3756520
3757180 GKRYCFGEGLARMELFLFFTTIMQNFRFKSPQSPKDIDVSPKHVGFATIPRNYTMSFLPR 3757359

>CYP2A13 related gene AC058798.1   6 diffs with 2A13 runs off end of contig
may be seq errors [not found in build 33 of human genome, probably 2A13 with errors]
MLASGLLLVTLLACLTVMVLMSVWRQRKSRGKLPPGPTPLPFIGNYLQLNTEQMYNSLMK
ISERYGPVFTIHLGPRRVVVLCGHDAVKEALVDQAEEFSGRGEQATFDWLFKGYG
VAFNNGERAKHLPRFSIATLRGFGVGKRGIEEHIQEEAGFLIHSLRGTHG

>CYP2A18PC split pseudogene AC008537.2 first line = 2G1 seq.
EEKNPNTEFYLKNLVLTTLNLFVGGTETVSTTLHYGFLLLMKHPEVE
AKVHEIDRVIGKNQQPKFEDRAKTLYTEAVIHEIQRFGDLLPMGVSRRVKKDTKFRDFFLSK
GIEVFPMLGSVLRDLRFFSNPRDFNPQHFLGEKGQFKKRDAFVPFSI
GRRNCFREGLARMELFLYLTTIMQNFRFKSRQSPKDIVVSPKHVGFVTIPRNYTKCYLP

>CYP2A18PC NT_011109.13 - strand split pseudogene
3572108 EEKNPNTEFYLKNLVLTTLNLFVGGTETVSTTLHYGFLLLMKHPEVE 3571968
3571499 AKVHEIDRVIGKNQQPKFEDRAKTLYTEAVIHEIQRFGDLLPMGVSRRVKKDTKFRDFFLSK 3571314
3570793 GIEVFPMLGSVLRDLRFFSNPRDFNPQHFLGEKGQFKKRDAFVPFSI 3570653
3570075 GRRNCFREGLARMELFLYLTTIMQNFRFKSRQSPKDIVVSPKHVGFVTIPRNYTKCYLP 3569899

>CYP2A18PN = CYP2A7PT U22030 and 2A7PC U22044 split pseudogene
MLASGLLLVALLASLTVMVLMSVWQQRKSRGKLPLGPTPLLFIGNYLQLNTEHICDSIMK 
ISERYGPVFTIHLGPRRVVVLCGHDAVKEALVDQAEEFSGRGEQATFDWLFKGY
GVTCRTWERTKPLRRFSIATLRDFGVGKRGIKE 
IQEKAGFLIKAV*GTRG
SSIYPTFFLSRTTSNVISSIVFGDRFDYEDK 
KFLSLLCMMLESFQFTAPSTGE
LYEMFSSVMKHLPGPQQQAFKELQGLEDFIAKKVEHNQCTLDPNSPRDFIDSFLIRMQ 

>CYP2A18PN NT_011109.13 - strand split pseudogene
3689143 MLASGLLLVALLASLTVMVLMSVWQQRKSRGKLPLGPTPLLFIGNYLQLNTEHICDSIMK 3688964
3688685 ISERYGPVFTIHLGPRRVVVLCGHDAVKEALVDQAEEFSGRGEQATFDWLFKGY 3688524
3687579 GVTCRTWERTKPLRRFSIATLRDFGVGKRGIKE 3687481
3687478 IQEKAGFLIKAV*GTRG 3687428
3687181 SSIYPTFFLSRTTSNVISSIVFGDRFDYEDK 3687089
3687089 KFLSLLCMMLESFQFTAPSTGQ 3687024
3685861 LYEMFSSVMKHLPGPQQQAFKELQGLEDFIAKKVEHNQCTLDPNSPRDFIDSFLIRMQ 3685688

>CYP2B6 AC023172.1 CDS (hIIB1) cryptic exon 3A = 18813-18856 (hIIB2)
MELSVLLFLALLTGLLLLLVQRHPNTHDRLPPGPRPLPLLGNLLQMDRRGLLKSFLR (0)
FREKYGDVFTVHLGPRPVVMLCGVEAIREALVDKAEAFSGRGKIAMVDPFFRGY (1)
GVIFANGNRWKVLRRFSVTTMRDFGMGKRSVEERIQEEAQCLIEELRKSK (1)
GALMDPTFLFQSITANIICSIVFGKRFHYQDQEFLKMLNLFYQTFSLISSVFGQ (0)
LFELFSGFLKYFPGAHRQVYKNLQEINAYIGHSVEKHRETLDPSAPKDLIDTYLLHMEK (0)
EKSNAHSEFSHQNLNLNTLSLFFAGTETTSTTLRYGFLLMLKYPHVA (1)
ERVYREIEQVIGPHRPPELHDRAKMPYTEAVIYEIQRFSDLLPMGVPHIVTQHTSFRGYIIPK (0)
DTEVFLILSTALHDPHYFEKPDAFNPDHFLDANGALKKTEAFIPFSL (1)
GKRICLGEGIARAELFLFFTTILQNFSMASPVAPEDIDLTPQECGVGKIPPTYQIRFLPR*

>CYP2B6 NT_011109.13 + strand
3652727 MELSVLLFLALLTGLLLLLVQRHPNTHDRLPPGPRPLPLLGNLLQMDRRGLLKSFLR 3652897
3665422 FREKYGDVFTVHLGPRPVVMLCGVEAIREALVDKAEAFSGRGKIAMVDPFFRGY 3665583
3665717 GVIFANGNRWKVLRRFSVTTMRDFGMGKRSVEERIQEEAQCLIEELRKSK 3665866
3668325 GALMDPTFLFQSITANIICSIVFGKRFHYQDQEFLKMLNLFYQTFSLISSVFGQ 3668486
3670640 LFELFSGFLKYFPGAHRQVYKNLQEINAYIGHSVEKHRETLDPSAPKDLIDTYLLHMEK 3670816
3671415 EKSNAHSEFSHQNLNLNTLSLFFAGTETTSTTLRYGFLLMLKYPHVA 3671555
3673718 ERVYREIEQVIGPHRPPELHDRAKMPYTEAVIYEIQRFSDLLPMGVPHIVTQHTSFRGYIIPK 3673906
3674095 DTEVFLILSTALHDPHYFEKPDAFNPDHFLDANGALKKTEAFIPFSLGK 3674241
3678066 GKRICLGEGIARAELFLFFTTILQNFSMASPVAPEDIDLTPQECGVGKIPPTYQIRFLPR* 3678248

>AF182277 2B6 allele, 5 amino acid differences Nov 29 1999 mRNA 
shares two amino acid changes with CYP2B6*6 allele
MELSVLLFLALLTGLLLLLVQRHPNTHDRLPPGPRPLPLLGNLL
QMDRRGLLKSFLRFREKYGDVFTVHLGPRPVVMLCGVEAIREALVDKAEAFSGRGKIA
MVDPFFRGYGVIFANGNRWKVLRRFSVTTMRDFGMGKRSVEERTQEEAQCLIEELRKS
KGALMDATFLFHSITANIICSIVFGKRFHYQDQEFLKMLNLFYQTFSLISSVFGQLFE
LFSGFLKYFPGAHRQVYKNPQEINAYIGHSVEKHRETLDPSAPRDLIDTYLLHMEKEK
SNAHSEFSHQNLNLNTLSLFFAGTETTSTTLRYGFLLMLKYPHVAERVYREIEQVIGP
HRPPELHDRAKMPYTEAVIYEIQRFSDLLPMGVPHIVTQHTSFRGYIIPKDTEVFLIL
STALHDPHYFEKPDAFNPDHFLDANGALKKTEAFIPFSLGKRICLGEGIARAELFLFF
TTILQNFSMASPVAPEDIDLTPQECGVGKIPPTYQIRFLPR

>CYP2B7P1 = M29873 (hIIB3) AC008537.2 91% to 2B6 missing last exon
MELSVLLFLALLTGLLLLLVQRHPNSHGTLPPGPRPLPLLGNLLQMDRRGLLKSFLR
FREKYGDVFTVHLGPRPVVMLCGVEAIREALVDNAEAFSGRGKIVIMDPVYQGY
GMLFANGNRWKVLRRFSVTTMRDFGMGKRSVEERIQDEAQCLIEELRKSKG
ALVDPTFLFHSITANIICSIIFGKRFHYQDQEFLKTLNLFCQSFLLISSISSQ
LFELFSGFLKYFPGAHRQVYKNLQEINAYIGHSVEKHRETLDPSAPRDLIDTYLLHMEK
EKSNPHSEFSHQNLIINTLSLFFAGTETTSTTLRYGFLLMLKYPHVA
ERVYKEIEQVVGPHRPPALDDRAKMPYTEAVIREIQRFADLLPMGVPHIVTQHTSF*GYTIPK
DTEVFLILSTALRDPHYFEKPDAFNPDHFLDANGALKKNEAFIPFSLGK

>CYP2B7P1 NT_011109.13 + strand
3585694 MELSVLLFLALLTGLLLLLVQRHPNSHGTLPPGPRPLPLLGNLLQMDRRGLLKSFLR 3585864
3597518 FREKYGDVFTVHLGPRPVVMLCGVEAIREALVDNAEAFSGRGKIVIMDPVYQGY 3597679
3597812 GMLFANGNRWKVLRRFSVTTMRDFGMGKRSVEERIQDEAQCLIEELRKSKG 3597964
3600392 ALVDPTFLFHSITANIICSIIFGKRFHYQDQEFLKTLNLFCQSFLLISSISSQ 3600550
3602717 LFELFSGFLKYFPGAHRQVYKNLQEINAYIGHSVEKHRETLDPSAPRDLIDTYLLHMEK 3602893
3603486 EKSNPHSEFSHQNLIINTLSLFFAGTETTSTTLRYGFLLMLKYPHVA 3603626
3605751 ERVYKEIEQVVGPHRPPALDDRAKMPYTEAVIREIQRFADLLPMGVPHIVTQHTSF*GYTIPK 3605939
3606132 DTEVFLILSTALRDPHYFEKPDAFNPDHFLDANGALKKNEAFIPFSLGK 3606278
3610473 GKRICLGEGIARAELFLFFTTILQNFSVASPVAPEDIDLTPQECGVGKIPPTYQICFLPR* 3610655

>CYP2C8 M17397
MEPFVVLVLCLSFMLLFSLWRQSCRRRKLPPGPTPLPIIGNMLQIDVKDICKSFTN
FSKVYGPVFTVYFGMNPIVVFHGYEAVKEALIDNGEEFSGRGNSPISQRITKG
LGIISSNGKRWKEIRRFSLTNLRNFGMGKRSIEDRVQEEAHCLVEELRKTK
ASPCDPTFILGCAPCNVICSVVFQKRFDYKDQNFLTLMKRFNENFRILNSPWIQ
VCNNFPLLIDCFPGTHNKVLKNVALTRSYIREKVKEHQASLDVNNPRDFMDCFLIKMEQ
EKDNQKSEFNIENLVGTVADLFVAGTETTSTTLRYGLLLLLKHPEVT
AKVQEEIDHVIGRHRSPCMQDRSHMPYTDAVVHEIQRYSDLVPTGVPHAVTTDTKFRNYLIPK
GTTIMALLTSVLHDDKEFPNPNIFDPGHFLDKNGNFKKSDYFMPFSA
GKRICAGEGLARMELFLFLTTILQNFNLKSVDDLKNLNTTAVTKGIVSLPPSYQICFIPV

>CYP2C8 NT_030059.8 - strand
2093794 MEPFVVLVLCLSFMLLFSLWRQSCRRRKLPPGPTPLPIIGNMLQIDVKDICKSFTN 2093627
2092083 FSKVYGPVFTVYFGMNPIVVFHGYEAVKEALIDNGEEFSGRGNSPISQRITKG 2091925
2091753 LGIISSNGKRWKEIRRFSLTTLRNFGMGKRSIEDRVQEEAHCLVEELRKTK 2091601
2089353 ASPCDPTFILGCAPCNVICSVVFQKRFDYKDQNFLTLMKRFNENFRILNSPWIQ 2089192
2082903 VCNNFPLLIDCFPGTHNKVLKNVALTRSYIREKVKEHQASLDVNNPRDFIDCFLIKMEQ 2082727
2070343 EKDNQKSEFNIENLVGTVADLFVAGTETTSTTLRYGLLLLLKHPEVT 2070203
2067470 AKVQEEIDHVIGRHRSPCMQDRSHMPYTDAVVHEIQRYSDLVPTGVPHAVTTDTKFRNYLIPK 2067282
2063430 GTTIMALLTSVLHDDKEFPNPNIFDPGHFLDKNGNFKKSDYFMPFSA 2063290
2061702 GKRICAGEGLARMELFLFLTTILQNFNLKSVDDLKNLNTTAVTKGIVSLPPSYQICFIPV* 2061520

>CYP2C8-de6b old name 2C60P NT_008769.11|Hs10_8926 lone exon 6 between 2C9 and 2C8
8439669 EKDNQPLKFTIENLVGNVPDLFVAGTEMTSTTLRYGLLLLLKHPELTGR 8439815

>CYP2C8-de6b old name CYP2C60P NT_030059.8 + strand exon 6
2034065 EKDNQPLKFTIENLVGNVPDLFVAGTEMTSTTLRYGLLLLLKHPELT 2034205

>CYP2C9 M61857
MDSLVVLVLCLSCLLLLSLWRQSSGRGKLPPGPTPLPVIGNILQIGIKDISKSLTN
LSKVYGPVFTLYFGLKPIVVLHGYEAVKEALIDLGEEFSGRGIFPLAERANRG
FGIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTK
ASPCDPTFILGCAPCNVICSIIFHKRFDYKDQQFLNLMEKLNENIKILSSPWIQ
ICNNFSPIIDYFPGTHNKLLKNVAFMKSYILEKVKEHQESMDMNNPQDFIDCFLMKMEK
EKHNQPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVT
AKVQEEIERVIGRNRSPCMQDRSHMPYTDAVVHEVQRYIDLLPTSLPHAVTCDIKFRNYLIPK
GTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSA
GKRICVGEALAGMELFLFLTSILQNFNLKSLVDPKNLDTTPVVNGFASVPPFYQLCFIPV

>CYP2C9 NT_030059.8 + strand
1963075 MDSLVVLVLCLSCLLLLSLWRQSSGRGKLPPGPTPLPVIGNILQIGIKDISKSLTN 1963242
1966250 LSKVYGPVFTLYFGLKPIVVLHGYEAVKEALIDLGEEFSGRGIFPLAERANRG 1966408
1966580 FGIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTK 1966732
1972170 ASPCDPTFILGCAPCNVICSIIFHKRFDYKDQQFLNLMEKLNENIKILSSPWIQ 1972331
1973500 ICNNFSPIIDYFPGTHNKLLKNVAFMKSYILEKVKEHQESMDMNNPQDFIDCFLMKMEK 1973676
1996496 EKHNQPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVT 1996636
2005574 AKVQEEIERVIGRNRSPCMQDRSHMPYTDAVVHEVQRYIDLLPTSLPHAVTCDIKFRNYLIPK 2005762
2010425 GTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSA 2010565
2013238 GKRICVGEALAGMELFLFLTSILQNFNLKSLVDPKNLDTTPVVNGFASVPPFYQLCFIPV* 2013420

>CYP2C9-de1b AL133513.12 3 diffs with 2C18 lone exon 1 plus strand downstream of 2C19
114929 MDPAVALVLCLSCLFLLSLWRQSSGRGRLLFGPTPLLIIGNILQLDVKDMSKSLTNVSMLYAPL 115120

>CYP2C9-de1b NT_008769.11|Hs10_8926 lone exon 1 32kb upstream of 2C9 
same as AL133513.12, might work for alt splice
8335895 MDPAVALVLCLSCLFLLSLWRQSSGRGRLLFGPTPLLIIGNILQLDVKDMSKSLTNVSMLYAPL 8336086

>CYP2C9-de1b NT_030059.8 + strand
1930291 MDPAVALVLCLSCLFLLSLWRQSSGRGRLLFGPTPLLIIGNILQLDVKDMSKSLTN 1930458

>CYP2C9-de2c3c old name CYP2C59P NT_030059.8 - strand exon 2,3
2031948 FSKVYVPVFTVYFDIKLVLELHGYEVVKEALIDHGEEFSGKGIFPVSKKS**G 2031790
2031601 XXIIFSNGKRCKDIWLFLLMTLWNCRMVKRS 2031515
2031516 MEKHVQGEAQCLRQELRRTK 2031454

>CYP2C9-de2c3c old name 2C59P NT_008769.11|Hs10_8926 lone exons 2,3 
between 2C9 and 2C8
8437561 LSQFSKVYVPVFTVYFDIKLVLELHGYEVVKEALIDHGEEFSGKGIFPVSKKS**G 8437394
8437211 FRIIFSNGKRCKDIWLFLLMTLWNCRMVKRS 8437119
8437115 MEKHVQGEAQCLRQELRRTK 8437058

>CYP2C18 M61856
MDPAVALVLCLSCLFLLSLWRQSSGRGRLPSGPTPLPIIGNILQLDVKDMSKSLTN
FSKVYGPVFTVYFGLKPIVVLHGYEAVKEALIDHGEEFSGRGSFPVAEKVNKG
LGILFSNGKRWKEIRRFCLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTN
ASPCDPTFILGCAPCNVICSVIFHDRFDYKDQRFLNLMEKFNENLRILSSPWIQ
VCNNFPALIDYLPGSHNKIAENFAYIKSYVLERIKEHQESLDMNSARDFIDCFLIKMEQ
EKHNQQSEFTVESLIATVTDMFGAGTETTSTTLRYGLLLLLKYPEVT
AKVQEEIECVVGRNRSPCMQDRSHMPYTDAVVHEIQRYIDLLPTNLPHAVTCDVKFKNYLIPK
GTTIITSLTSVLHNDKEFPNPEMFDPGHFLDKSGNFKKSDYFMPFSA
GKRMCMGEGLARMELFLFLTTILQNFNLKSQVDPKDIDITPIANAFGRVPPLYQLCFIPV

>CYP2C18 NT_030059.8 + strand
1708213 MDPAVALVLCLSCLFLLSLWRQSSGRGRLPSGPTPLPIIGNILQLDVKDMSKSLTN 1708380
1712163 FSKVYGPVFTVYFGLKPIVVLHGYEAVKEALIDHGEEFSGRGSFPVAEKVNKG 1712321
1712514 LGILFSNGKRWKEIRRFCLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTN 1712666
1719309 ASPCDPTFILGCAPCNVICSVIFHDRFDYKDQRFLNLMEKFNENLRILSSPWIQ 1719470
1731177 VCNNFPALIDYLPGSHNKIAENFAYIKSYVLERIKEHQESLDMNSARDFIDCFLIKMEQ 1731353
1744789 EKHNQQSEFTVESLIATVTDMFGAGTETTSTTLRYGLLLLLKYPEVT 1744929
1748738 AKVQEEIECVVGRNRSPCMQDRSHMPYTDAVVHEIQRYIDLLPTNLPHAVTCDVKFKNYLIPK 1748926
1757690 GTTIITSLTSVLHNDKEFPNPEMFDPGHFLDKSGNFKKSDYFMPFSA 1757830
1759655 GKRMCMGEGLARMELFLFLTTILQNFNLKSQVDPKDIDITPIANAFGRVPPLYQLCFIPV* 1759837

>CYP2C19 M61854
MDPFVVLVLCLSCLLLLSIWRQSSGRGKLPPGPTPLPVIGNILQIDIKDVSKSLTN
LSKIYGPVFTLYFGLERMVVLHGYEVVKEALIDLGEEFSGRGHFPLAERANRG
FGIVFSNGKRWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTK
ASPCDPTFILGCAPCNVICSIIFQKRFDYKDQQFLNLMEKLNENIRIVSTPWIQ
ICNNFPTIIDYFPGTHNKLLKNLAFMESDILEKVKEHQESMDINNPRDFIDCFLIKMEK
EKQNQQSEFTIENLVITAADLLGAGTETTSTTLRYALLLLLKHPEVT
AKVQEEIERVIGRNRSPCMQDRGHMPYTDAVVHEVQRYIDLIPTSLPHAVTCDVKFRNYLIPK
GTTILTSLTSVLHDNKEFPNPEMFDPRHFLDEGGNFKKSNYFMPFSA
GKRICVGEGLARMELFLFLTFILQNFNLKSLIDPKDLDTTPVVNGFASVPPFYQLCFIPV

>CYP2C19 NT_030059.8 + strand
1787099 MDPFVVLVLCLSCLLLLSIWRQSSGRGKLPPGPTPLPVIGNILQIDIKDVSKSLTN 1787266
1799451 LSKIYGPVFTLYFGLERMVVLHGYEVVKEALIDLGEEFSGRGHFPLAERANRG 1799609
1799779 FGIVFSNGKRWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTK 1799931
1804891 ASPCDPTFILGCAPCNVICSIIFQKRFDYKDQQFLNLMEKLNENIRIVSTPWIQ 1805052
1806214 ICNNFPTIIDYFPGTHNKLLKNLAFMESDILEKVKEHQESMDINNPRDFIDCFLIKMEK 1806390
1844889 EKQNQQSEFTIENLVITAADLLGAGTETTSTTLRYALLLLLKHPEVT 1845029
1867229 AKVQEEIERVVGRNRSPCMQDRGHMPYTDAVVHEVQRYIDLIPTSLPHAVTCDVKFRNYLIPK 1867417
1874310 GTTILTSLTSVLHDNKEFPNPEMFDPRHFLDEGGNFKKSNYFMPFSA 1874450
1877125 GKRICVGEGLARMELFLFLTFILQNFNLKSLIDPKDLDTTPVVNGFASVPPFYQLCFIPV* 1877307

>CYP2C58P AL133513.12 2C pseudogene N-terminal first 160 aa minus strand
82160 LDLAAVLMLCLSCLLLLSL*TQISGRGKLPSDSTPLQVIESILQMADKDICKSSSNLSTL 81981
75345 SLYFDMKLVLVLHGYEVLKKALIHHGEEFSGKGIFPVSKK 75226
75033 IIFSNRKPCKEIWPFLLMTLWNCGVVKRS 74947
74945 LGKHVQVEAHCIVWELRRTK 74886

>CYP2C58P NT_030059.8 - strand exons 1,2,3
1897522 LDLAAVLMLCLSCLLLLSL*TQISGRGKLPSDSTPLQVIESILQMADKDICKSSSNLSTL 1897343
1890755 SLYFDMKLVLVLHGYEVLKKALIHHGEEFSGKGIFPVSKK 1890588
1890413 IIFSNRKPCKEIWPFLLMTLWNCGVVKRS 1890309
1890307 LGKHVQVEAHCIVWELRRTK 1890248

>CYP2C58P NT_008769.11|Hs10_8926 lone exons 1,2,3 between 2C19 and 2C9 
same as AL133513.12
8303126 LDLAAVLMLCLSCLLLLSL*TQISGRGKLPSDSTPLQVIESILQMADKDICKSSSNLSTLY 8302944 
8296311 SLYFDMKLVLVLHGYEVLKKALIHHGEEFSGKGIFPVSKK 8296192
8295999 IIFSNRKPCKEIWPFLLMTLWNCGVVKRS 8295913
8295911 LGKHVQVEAHCIVWELRRTK 8295852

>CYP2C62P AL138921 NT_030059 chromosome 10 50% to 2C8
Chr10q24.31 101999343-102031105 - strand build 33
LAMCVTCLIFFLVWKKSPSPTPLPTIGNRLQRNPKD
CISFQLAKEYSSVYTLYFGSWPTMVFHGYKAVKEALIDQGDKSLGRGHIPIIDDAQKRY
TSAQPFDSTFILASAPCNL
CSFLFKECFQYKNETFLSLMGLLNENVK
TTVLPLLSLVLFSYKQFPXXXXXXXGHFLDKNGCFNKTDYFLPFSLGK

>CYP2C-se1[7] 2C56P NT_022154.9|Hs2_22310 2C pseudogene fragment chr 2 exon 7
Chr2q24.3 165142570-165142755 + strand Build 33
1768955 SKVQEETDHAVGRHWRPCMQDRSHMPYTEAMVHEVQRH*PHPTNVPHALTSDIKFRNYLLPK 1769140

>CYP2C-se2[1:2] old name 2C61P NT_008583.11|Hs10_8740 
chromosome 10 pseudogene frag parts of ex 1 and 2, 
Chr10q21.3 66415290-66415135 - strand, Build 33 in MER1_type repeat
1832658 KGKLPHDLTSFLFVGNILQLNSKNLSKSITMLAKDYGPGFTVYFGIKPTVVV 1832813

>CYP2C-se3[1] old name 2C63P NT_011512.5|Hs21_11669 chromosome 21 
51% to 2C9 exon 1, chr21q21.2 25740563-25740423 build 33 - strand
bracketed by L1 repeats
12398358 CPSCLILLFLWNGSYAKGKLLPGPIPLPIV*NILPLRSMNTSKSISM 12398218

>CYP2C-se4[1] old name 2C64P NT_011602.7|HsX_11759 2C pseudogene fragment chr X 57% to 2C8 exon 1, ChrXq28 147659303-147659476 + strand Build 33
inside MTMR1 intron 3 (myotubularin-related protein 1)
435396 ASVDLAAVLVLFLSHFLFLSLWKQSSEREKLLPGPTPIRIIGNILELDLKDICKSLSD 435569

>CYP2D6 NM_000106
MGLEALVPLAVIVAIFLLLVDLMHRRQRWAARYPPGPLPLPGLGNLLHVDFQNTPYCFDQ
LRRRFGDVFSLQLAWTPVVVLNGLAAVREALVTHGEDTADRPPVPITQILGFGPRSQ
GVFLARYGPAWREQRRFSVSTLRNLGLGKKSLEQWVTEEAACLCAAFANHS
GRPFRPNGLLDKAVSNVIASLTCGRRFEYDDPRFLRLLDLAQEGLKEESGFLRE
VLNAVPVLLHIPALAGKVLRFQKAFLTQLDELLTEHRMTWDPAQPPRDLTEAFLAEMEK
AKGNPESSFNDENLRIVVADLFSAGMVTTSTTLAWGLLLMILHPDVQ
RRVQQEIDDVIGQVRRPEMGDQAHMPYTTAVIHEVQRFGDIVPLGMTHMTSRDIEVQGFRIPKG
TTLITNLSSVLKDEAVWEKPFRFHPEHFLDAQGHFVKPEAFLPFSAGRRACLGEPLAR
MELFLFFTSLLQHFSFSVPTGQPRPSHHGVFAFLVSPSPYELCAVPR

>CYP2D7P1 = CYP2D7P M33387 
MGLEALVPLAMIVAIFLLLVDLMHRHQRWAARYPPGPLPLPGLGNL (fs) LHVDFQNTPYCFDQ (0)
LRRRFGDVFSLQLAWTPVVVLNGLAAVREAMVTRGEDTADRPPAPIYQVLGFGPRSQ bad boundary
(1) GVFLARYGPAWREQRRFSVSTLRNLGLGKKSLEQWVTEEAACLCAAFADQA (1)
GPPFRPNGLLDKAVSNVIASLTCGRRFEYDDPRFLRLLDLAQEGLKEESGFLRE (0)
VLNAVPVLLHIPALAGKVLRFQKAFLTQLDELLTEHRMTWDPAQPPRDLTEAFLAEMEK (0)
AKGSPESSFNDENLRIVVGNLFLAGMVTTLTTLAWGLLLMILHLDVQ (1)
VRVQQEIDDVIGQVRRPEMGDQAHMPYTTAVIHEVQRFGDIIPLSVTHMTSHDIEVQGFRIPK (0)
GTTLITNLSSVLKDEAVWKKPFRFHPEHFLDAQGHFVKPEAFLPFSA (1)
GRRACLGEPLARMELFLFFTSLLQHFSFSVAAGQPRPSHSRVVSFLVTPSPYELCAVPR*

>CYP2D7P2 = CYP2D7AP X58467 assembled to best match 2D6 
AL021878 comp(46171-50354)
MGLEALVPLAMIVAIFLLLVDLMHRHQRWAARYPPGPLPLPGLGNL (fs) LHVDFQNTPYCFDQ (0)
LRRRFGDVFSLQLAWTPVVVLNGLAAVREAMVTRGEDTADRPPAPIYQVLGFGPRSQ bad boundary
(1) GVILSRYGPAWREQRRFSVSTLRNLGLGKKSLEQWVTEEAACLCAAFADQA (1)
GRPFRPNGLLDKAVSNVIASLTCGRRFEYDDPRFLRLLDLAQEGLKEESGFLRE (0)
VLNAVPVLPHIPALAGKVLRFQKAFLTQLDELLTEHRMTWDPAQPPRDLTEAFLAKKEK (0)
AKGSPESSFNDENLRIVVGNLFLAGMVTTLTTLAWGLLLMILHLDVQ (1)
LRVQQEIDDVIGQVRRPEMGDQAHMPCTTAVIHEVQRFGDIIPLSVTHMTSRDIEVQGFRIPK (0)
GTTLITNLSSVLKDEAVWKKPFRFHPEHFLDAQGHFVKPEAFLPFSA (1)
GRRACLGEPLARMELFLFFTSLLQHFSFSVAAGQPRPSHSRVVSFLVTPSPYELCAVPR*

>CYP2D8P1 = CYP2D8P AL021878 comp(55779-60892)
MGLDALVPLAVTVAIFLLLVDLMQQHQRWTARYPPGPLPLPGLGNLLHVDFQNIYTFNQ (0)
LRHRFGDVFSLQLAWMPVVVLNGLAAVREALVTCGEDTADRPPAPIYQVLGIGPRSQ bad boundary
(1) GVFLAHYGHAWREQRRFSVSTLRNLGLGKKSLERWVTEEAACLCAAFADQA (1)
RRPFHPNGLLNKAASNVIASLTCGCRFEYDDPRFLRLLDLAQKGLKEELGFL*E bad boundary
(0) MLNVVPLLLRIPGLAGKVLRSQKAFLTQLDELLTEHRMIWDPA*PPRDLTEAFLAEKEK (0)
AKGNPESSFNDENLRMVVADLFFAGMVTTSITLAWGLLLMILRPDVQ (1)
ARVQQIDNVIGQVW*PEMGDQARMPCTTAVIHEVQRFGDIVPLGVTHMTSRDIEVQGFRIPK (0)
GMMLFTNLSSVLKDEAVWEKPFRFHPEHFLDAQGHFVKLEAFLPFSA (1)
GRRACLGEPLARIELFLFFTSLLQHFSFSVPTGQPRPSHSRVVGFLVTPSPYELCAVPR

>CYP2D8P2 = CYP2D7BP or CYP2D8BP X58468
1622 MGLEALVPLAVIVAIFLLLVDLMHRRQRWAARYSPGPLPLPGLGNLLHVDFQNTPYCFDQ 1801
2504 LRRRFGDVFSLQLAWTPVVVLNGLAAVREAMVTRGEDTADRPPAPIYQVLGFGPRSQ 2674
3202 GVILSRYGPAWREQRRFSVSTLRNLGLGKKSLEQWVTEEAACLCAAFADQA 3354
3443 GRPFRPNGLLDKAVSNVIASLTCGRRFEYDDPRFLRLLDLAQ 3568 (fs) 3570 EGSKEESGFLRE 3605
4031 VLNAVPVLPHIPALAGKVLRFQKAFLTQLDELLTEHRMTWDPAQPPRDLTEAFLAKKEK 4207
4400 AKGSPESSFNDENLRIVVGNLFLAGMVTTLTTLAW 4504 (fs) 4504 GLLLMILHLDVQ 4539
4734 VRVQQEIDDVIGQVRRPEMGDQAHMPYTTAVIHEVQHFGDIVPLGVTHMTSRDIEVQGFRIPK 4922
5377 GTTLITNLSSVLKDEAVWEKPFRFHPEHFLDAQGHFVKPEAFLPFSA 5517
5616 GRRACLGEPLARMELFLFFTSLLQHFSFSVAAGQPRPSHSRVVSFLVTPSPYELCAVPR* 5795

>CYP2E1 J02843
MSALGVTVALLVWAAFLLLVSMWRQVHSSWNLPPGPFPLPIIGN
LFQLELKNIPKSFTRLAQRFGPVFTLYVGSQRMVVMHGYKAVKEALLDYKDEFSGRGD
LPAFHAHRDRGIIFNNGPTWKDIRRFSLTTLRNYGMGKQGNESRIQREAHFLLEALRK
TQGQPFDPTFLIGCAPCNVIADILFRKHFDYNDEKFLRLMYLFNENFHLLSTPWLQLY
NNFPSFLHYLPGSHRKVIKNVAEVKEYVSERVKEHHQSLDPNCPRDLTDCLLVEMEKE
KHSAERLYTMDGITVTVADLFFAGTETTSTTLRYGLLILMKYPEIEEKLHEEIDRVIG
PSRIPAIKDRQEMPYMDAVVHEIQRFITLVPSNLPHEATRDTIFRGYLIPKGTVVVPT
LDSVLYDNQEFPDPEKFKPEHFLNENGKFKYSDYFKPFSTGKRVCAGEGLARMELFLL
LCAILQHFNLKPLVDPKDIDLSPIHIGFGCIPPRYKLCVIPRS

>CYP2F1 J02906
MDSISTAILLLLLALVCLLLTLSSRDKGKLPPGPRPLSILGNLLLLCSQDMLTSLTK
LSKEYGSMYTVHLGPRRVVVLSGYQAVKEALVDQGEEFSGRGDYPAFFNFTKGN
GIAFSSGDRWKVLRQFSIQILRNFGMGKRSIEERILEEGSFLLADVRKTE
GEPFDPTFVLSRSVSNIICSVLFGSRFDYDDERLLTIIRLINDNFQIMSSPWGE
LYDILDPRFPSLLDWVPGPHQRIFQNFKCLRDLIAHSVHDHQASSPRDFIQCFLTKMA
EEKEDPLSHFHMDTLLMTTHNLLFGGTKTVSTTLHHAFLALMKYPKVQ
ARVQEEIDLVVGRARLPALKDRAAMPYTDAVIHEVQRFADIIPMNLPHRVTRDTAFRGFLIPK
GTDVITLLNTVHYDPSQFLTPQEFNPEHFLDANQSFKKSPAFMPFSA
GRRLCLGELLARMELFLYLTAILQSFSLQPLGAPEDIDLTPLSSGLGNLPRPFQLCLRPR

>CYP2F1 NT_011109.13 + strand
3777610 MDSISTAILLLLLALVCLLLTLSSRDKGKLPPGPRPLSILGNLLLLCSQDMLTSLTK 3777780
3777876 LSKEYGSMYTVHLGPRRVVVLSGYQAVKEALVDQGEEFSGRGDYPAFFNFTKGN 3778037
3781767 GIAFSSGDRWKVLRQFSIQILRNFGMGKRSIEERILEEGSFLLAELRKTE 3781916
3782878 GEPFDPTFVLSRSVSNIICSVLFGSRFDYDDERLLTIIRLINDNFQIMSSPWGE 3783039
3783378 LYDIFPSLLDWVPGPHQRIFQNFKCLRDLIAHSVHDHQASLDPRSPRDFIQCFLTKMA 3783551
3784240 EEKEDPLSHFHMDTLLMTTHNLLFGGTKTVSTTLHHAFLALMKYPKVQ 3784383
3786139 ARVQEEIDLVVGRARLPALKDRAAMPYTDAVIHEVQRFADIIPMNLPHRVTRDTAFRGFLIPK 3786327
3786914 GTDVITLLNTVHYDPSQFLTPQEFNPEHFLDANQSFKKSPAFMPFSA 3787054
3789321 GRRLCLGESLARMELFLYLTAILQSFSLQPLGAPEDIDLTPLSSGLGNLPRPFQLCLRPR 3789500

>CYP2F1P AC008537.3 93% identical to 2F1
GEPFDPTFVLSRSRSNIICSVLFGSRFDYDDERLLTIIRLINDNFQIMSSPWGE
LYDIFPSLLNWVPGPHQRIFQNFKCLRDLIAHSVHDHQASLDPRSPRDFIHCFLTKMAE
KKEDPLSHFHMDTLLMTTHNLLFGGTETVGTTLRHAFLAFMKYPKVQ
AHVQEEINLVVGHVRLPALKDRAAMPYTDMVIHEVQRFADIIPMNLPHRITRDTAFHGFLIPK
GTDVITLLNTVHYDPSQFLTPQEFNPEHFLDANQSFKKSPAFMPFSAG
HRLCLGESLARMELFLYLTAILQSFSLQPLGAPEDIDLTPLSSGLGNLPRPFQLCLHPR

>CYP2F1P NT_011109.13 - strand 93% identical to 2F1
3488193 GEPFDPTFVLSRSRSNIICSVLFGSRFDYDDERLLTIIRLINDNFQIMSSPWGE 3488032
3487723 LYDIFPSLLNWVPGPHQRIFQNFKCLRDLIAHSVHDHQASLDPRSPRDFIHCFLTKMAE 3487517
3486682 KKEDPLSHFHMDTLLMTTHNLLFGGTETVGTTLRHAFLAFMKYPKVQ 3486542
3483821 AHVQEEINLVVGHVRLPALKDRAAMPYTDMVIHEVQRFADIIPMNLPHRITRDTAFHGFLIPK 3483633
3483046 GTDVITLLNTVHYDPSQFLTPQEFNPEHFLDANQSFKKSPAFMPFSAG 3482903
3480318 HRLCLGESLARMELFLYLTAILQSFSLQPLGAPEDIDLTPLSSGLGNLPRPFQLCLHPR 3480142

>CYP2G1P AC008537 missing exons 4, 5 and 6 
MELGGAVTIFLALRLSCLLILIAWKRMDKAGKLPPGPTPILFLGHLLQVRTDATFQSFMK
LREKYSPVFTVYMGPRPVVVLCGHEAVKEALIDQADEFSGRGELASIKQNFQGHG
VALANGERWRILRRFSLTILRDFGMGKQSIKERIQEEASYLLEEFQKTK
AKIHEEINQVIGPHRLPRVDDRVKMPYTDVVIHEIQRLVDIVPMGVPHNIIQDTQFRGYLLPK
GTDVFPLLGSVLKDPKYFRYPDAFYPQHFLDEQGRFKKNEAFVPFSS
GRRICLGEAMARMELFLYFTSTLQNFSLCSLVPLVDIDITPKLSGFGNITPTYELCLVAR

>CYP2G1P NT_011109.13 + strand missing exons 4, 5 and 6
3552640 MELGGAVTIFLALRLSCLLILIAWKRMDKAGKLPPGPTPILFLGHLLQVRTDATFQSFMK 3552819
3553412 LREKYSPVFTVYMGP 3553456
3553456 RPVVVLCGHEAVKEALIDQADEFSGRGELASIKQNFQGH 3553572
3555264 GVALANGERWRILRRFSLTILRDFGMGKQSIKERIQEEASYLLEEFQKTK 3555413
3559558 AKIHEEINQVIGPHRLPRVDDRVKMPYTDVVIHEIQRLVDIVPMGVPHNIIQDTQFRGYLLPK 3559746
3559858 GTDVFPLLGSVLKDPKYFRYPDAFYPQHFLDEQGRFKKNEAFVPFSS 3559098
3561394 GKRICLGEAMARMELFLYFTSTLQNFSLCSLVPLVDIDITPKLSGFGNITPTYELCLVAR 3561573

>CYP2G2P AC008962 comp(28700-40696) seq of gene has two in frame stop codons
MEMGGAVTIFLALCLSCLLILIAWK*MNKAGKLPPGPTPIPFLGNLLQVRTDATFQSFMK (0)
LREKYSPVFTVYMGPRPVVVLCGHEAVKEALVDQADEFSGRGELASIKQNFQGHG (1)
VALANGERWRIL*RFPLTILRDFGMGKRSIEERIQEEASYLLEEFRKTK (1)
GAPIDPIFLLSRTVSNVISSVVFRSRFDYEDKQFLNLLRLINESFIEMSTPWAQ (0)
LYDMYSGIMQYLPGRHNLIYYLVEELKDFIASRVKINEASFDPQNPRDFIDCFLIKMHQ (0)
DKNNPRTEFNLKNLVLTTLNLFFAGTETVSSTLRYGFLLLMKHPEVE (1)
AKIHEEINQVIGPHRLPRVDDRVKMPYTDAVIHEIQRLVDIVPMGVPHNLIRDTQFRGYLLPK (0)
GTDVFPLLGSVLKDPKYFRYPDAFYPQHFLDEQGRFKKNEAFVPFSS (1)
GKRICLGEAMDRMELFLYFTSTLQNFSLHSLVPPVDIDITPKLSGFGNIPPTYELCLVAR*

>CYP2G2P NT_011109.13 - strand
3723865 MEMGGAVTIFLALCLSCLLILIAWK*MNKAGKLPPGPTPIPFLGNLLQVRTDATFQSFMK 3723686
3723093 LREKYSPVFTVYMGPRPVVVLCGHEAVKEALVDQADEFSGRGELASIKQNFQGH 3722932
3720860 GVALANGERWRIL*RFPLTILRDFGMGKRSIEERIQEEASYLLEEFRKTK 3720711
3719028 GAPIDPIFLLSRTVSNVISSVVFRSRFDYEDKQFLNLLRLINESFIEMSTPWAQ 3718867
3718287 LYDMYSGIMQYLPGRHNLIYYLVEELKDFIASRVKINEASFDPQNPRDFIDCFLIKMH 3718114
3716767 QDKNNPRTEFNLKNLVLTTLNLFFAGTETVSSTLRYGFLLLMKHPEVE 3716624
3713886 AKIHEEINQVIGPHRLPRVDDRVKMPYTDAVIHEIQRLVDIVPMGVPHNLIRDTQFRGYLLPK 3713698
3713586 GTDVFPLLGSVLKDPKYFRYPDAFYPQHFLDEQGRFKKNEAFVPFSS 3713446
3712052 GKRICLGEAMDRMELFLYFTSTLQNFSLHSLVPPVDIDITPKLSGFGNIPPTYELCLVAR* 3711870

>CYP2J2 NM_000775 chr 1 NT_032977.5 (build 33)
12841329 MLAAMGSLAAALWAVVHPRTLLLGTVAFLLAADFLKRRRPKNYPPGPWRLPFLGNFFLVDFEQSHLEVQL 12841120
12830683 FVKKYGNLFSLELGDISAVLITGLPLIKEALIHMDQNFGNRPVTPMREHIFKKN 12830522
12826895 GLIMSSGQAWKEQRRFTLTALRNFGLGKKSLEERIQEEAQHLTEAIKEEN 12826746
12826352 GQPFDPHFKINNAVSNIICSITFGERFEYQDSWFQQLLKLLDEVTYLEASKTCQ 12826191
12824543 LYNVFPWIMKFLPGPHQTLFSNWKKLKLFVSHMIDKHRKDWNPAETRDFIDAYLKEMSK 12824367
12822510 HTGNPTSSFHEENLICSTLDLFFAGTETTSTTLRWALLYMALYPEIQ 12822370
12819642 EKVQVEIDRVIGQGQQPSTAARESMPYTNAVIHEVQRMGNIIPQNVPREVTVDTTLAGYHLPK 12819454
12815686 GTMILTNLTALHRDPTEWATPDTFNPDHFLENGQFKKREAFMPFSI 12815549
12808413 GKRACLGEQLARTELFIFFTSLMQKFTFRPPNNEKLSLKFRMGITISPVSHRLC 12808252

>Chr 8 CYP2J pseudogene 97% to 2J2 AC009678.4 two stop codons missing C-term
probably error in seq but may be real pseudogene
Note this seq not found in build 33 and intron length is same as 2J2
Conclusion is this is a mismapped seq error version of 2J2
78907 MLAAMGSLAAALWAVVHPRTLLLGTVAFLLAADFLKRRRPKNYPPGPWRLPFLGNFFLVD 79086
79087 FEQSHLEVQL 79116
47234 FVKKYGNLFSLELGDISAVLITGLPLIKEALIHMDQNFGNRPVTPMREHIFKKN 47395
51022 GLIMSSGQAWKEQRRFTLTALRNFGL*KKSLXERIQEEGPHLP*SIKKEN 51171
71202 GQPFDPHFKINNAVSNIICSITFGERFEYQDSWFQQLLKLLDEVTYLEASKTCQ 71041
69393 LYNVFPWIMKFLPGPHQTLFSNWKKLKLFVSHMIDKHRKDWNPAETRDFIDAYLKEMSK 69217
67360 HTGNPTSSFHEENLICSTLDLFFAGTETTSTTLRWALLYMALYPEIQ 67220
64491 EKVQAEIDRVIGQGQQPSTAARESMPYTNAVIHEVQRMGNIIPLNVPREVTVDTTL 64324
64323 AGYHLPK 64303

>CYP2R1 Mikael Oscarson AC018795.4 also AC025730 AC025748 EST AA663042
5 exons (not the usual 9 seen in most CYP2s) last three introns in same place as 
other CYP2s, intron 2 is one amino acid farther upstream of WXXXR motif 
This represents insertion of one aa between the intron and the WXXXR motif.
Sequence shown is from AC090835.6 Homo sapiens chromosome 11
Intron 1 boundary supported by a mouse EST BB614913
127663 MWKLWRAEEGAAALGGALFLLLFALGVRQLLKQRRPMGFPPGPPGLPFIGNIYSLAASSE 127842
127843 LPHVYMRKQSQVYGE 127887 (0)
133951 IFSLDLGGISTVVLNGYDVVKECLVHQSEIFADRPCLPLFMKMTKMG 134091 (1)
139099 GLLNSRYGRGWVDHRRLAVNSFRYFGYGQKSFESKILEETKFFNDAIETYKGRPFDFKQL 139278
139279 ITNAVSNITNLIIFGERFTYEDTDFQHMIELFSENVELAASASVFLYNAFPWIGILPFGK 139458
139459 HQQLFRNAAVVYDFLSRLIEKASVNRKPQLPQHFVDAYLDEMDQGKNDPSSTFSKENLIF 139638
139639 SVGELIIAGTETTTNVLRWAILFMALYPNIQ 139731 (1)
140424 GQVQKEIDLIMGPNGKPSWDDKCKMPYTEAVLHEVLRFCNIVPLGIFHATSEDAVVR 140594
140595 GYSIPKGTTVITNLYSVHFDEKYWRDPEVFHPERFLDSSGYFAKKEALVPFSL 140753 (1)
141570 GRRHCLGEHLARMEMFLFFTALLQRFHLHFPHELVPDLKPRLGMTLQPQPYLICAERR* 141746

>CYP2S1 AC011510 one exon per line 78% to mouse 2s1 49% to 2B6 47% to 2A13
MEATGTWALLLALALLLLLTLALSGTRARGHLPPGPTPLPLLGNLLQLRPGALYSGLMR
LSKKYGPVFTIYLGPWRPVVVLVGQEAVREALGGQAEEFSGRGTVAMLEGTFDGH
GVFFSNGERWRQLRKFTMLALRDLGMGKREGEELIQAEARCLVETFQGTE
GRPFDPSLLLAQATSNVVCSLLFGLRFSYEDKEFQAVVRAAGGTLLGVSSQGGQ
TYEMFSWFLRPLPGPHKQLLHHVSTLAAFTVRQVQQHQGNLDASGPARDLVDAFLLKMAQ
EEQNPGTEFTNKNMLMTVIYLLFAGTMTVSTTVGYTLLLLMKYPHVQ
KWVREELNRELGAGQAPSLGDRTRLPYTDAVLHEAQRLLALVPMGIPRTLMRTTRFRGYTLPQ
GTEVFPLLGSILHEPNIFKHPEEFNPDRFLDADGRFRKHEAFLPFSL
GKRVCLGEGLAKAEVFLFFTTILQAFSLESPCPPDTLSLKPTVSGLFNIPPAFQLQVRPTDLHSTTQTR

>CYP2S1 NT_011109.13 + strand
3854686 MEATGTWALLLALALLLLLTLALSGTRARGHLPPGPTPLPLLGNLLQLRPGALYSGLMR 3854862
3855965 LSKKYGPVFTIYLGPWRPVVVLVGQEAVREALGGQAEEFSGRGTVAMLEGTFDGH 3856129
3859199 GVFFSNGERWRQLRKFTMLALRDLGMGKREGEELIQAEARCLVETFQGTE 3859348
3859882 GRPFDPSLLLAQATSNVVCSLLFGLRFSYEDKEFQAVVRAAGGTLLGVSSQGGQ 3860043
3860130 TYEMFSWFLRPLPGPHKQLLHHVSTLAAFTVRQVQQHQGNLDASGPARDLVDAFLLKMAQ 3860309
3862652 EEQNPGTEFTNKNMLMTVIYLLFAGTMTVSTTVGYTLLLLMKYPHVQ 3862792
3864870 KWVREELNRELGAGQAPSLGDRTRLPYTDAVLHEAQRLLALVPMGIPRTLMRTTRFRGYTLPQ 3865058
3867379 GTEVFPLLGSILHDPNIFKHPEEFNPDRFLDADGRFRKHEAFLPFSL 3867519
3867700 GKRVCLGEGLAKAELFLFFTTILQAFSLESPCPPDTLSLKPTVSGLFNIPPAFQLQVRPTDLHSTTQTR 3867906

>CYP2T2P AC008537
MXAGIAALLLWLLVLAPAWWG*GGC
RAQMRGSLPPRPRPLPLLGNLQLQSGGLDRALHS
LSGRWGRVFTVRLGPRPAVGLCGYAALRDALVLQADAVSGRGSMAVFERFTRGN
GILFSNRPCWWTLRNFALGALKKFGLGTRTVEARVLEEAACLLDEFQATI
GAPFDPVRLLDNAVSNVICS
LVFGNRYRYGDPEFLRLLNLFSDNFCIISSRWGE
SLMDWLPGPHHRIFRNFSELRVISEQIQRHWQMRQPAEPRDFIDCLTRWVRHG
QQDPESHFQE*TSVMTTHFFFGVTETTSTTLCYGLLILLKYLEVA
AKVQELDPVVGWRPAPSLDYRVCLPYANAVLLEIQCFISVVPLGLPRTLTLDTHLHSHCLPK
GTFVIPLLVTAHRDPTQFKDPDCFNPTNFLDKGKFQGNDAFMPFA
PAKQMCLGTGLAHSGIFLFLTATLQRFCLLPVVRPGTINLT
QCTGLGSVPPDFQLQPVAC 

>CYP2T2P NT_011109.13 - strand
3473517 MXAGIAALLLWLLVLAPAWWG*GGCRAQMRGSLPPRPRPLPLLGNLQLQSGGLDRALHS 3473342
3472811 LSGRWGRVFTVRLGPRPAVGLCGYAALRDALVLQADAVSGRGSMAVFERFTRGN 3472650
3472232 GILFSNRPCWWTLRNFALGALKKFGLGTRTVEARVLEEAACLLDEFQATI 3472083
3471917 GAPFDPVRLLDNAVSNVICS 3471858
3471859 LVFGNRYRYGDPEFLRLLNLFSDNFCIISSRWGE 3471758
3471525 SLMDWLPGPHHRIFRNFSELRVISEQIQRHWQMRQPAEPRDFIDCLTRWVRHG 3471367
3471285 QQDPESHFQE*TSVMTTHFFFGVTETTSTTLCYGLLILLKYLEVA 3471151
3471054 AKVQELDPVVGWRPAPSLDYRVCLPYANAVLLEIQCFISVVPLGLPRTLTLDTHLHSHCLPK 3470869
3470345 GTFVIPLLVTAHRDPTQFKDPDCFNPTNFLDKGKFQGNDAFMPFA 3470211
3470132 PAKQMCLGTGLAHSGIFLFLTATLQRFCLLPVVRPGTINLT 3470010
3470007 QCTGLGSVPPDFQLQPVAC 3469951

>CYP2T3P AC008962 C-terminal missing
RAQMRGSLPPRPRLLPLLGNLQLQSGGLDRALHS
LSGRWGRVFTVRLGPRPAVVLCGYAALRDALVLQADAVSGRGSMAVFERFTRGN
GILFSNRPCWWTLRNFALGAPKKFGLDTRTIEARVLDEAACLLDEFQATI
GAPFDPVRLLGNAVSSVTCSCLREPLWLRGPGVPEVLNLLSDNFRIMSSKWGE probable frameshift here
SLMDWLPGPHHQIFQNFSELQVFISEQIQQHWHMRQPAEPRDFIDCLARWVRHG
QQDPESHFQEETLVMTMQLFFFFFFGGTETTSTTLC
AKGQELDPVVGQRPVPSPD
DHVQWPYTNAVLLEIQRFISVVPRTLTLDTHLHSHCLAKG

>CYP2T3P NT_011109.13 + strand
3796143 MIAGIAALLLWLLVLVPAWWG*GGCRAQMRGSLPPRPRLLPLLGNLQLQSGGLDRALHS 3796319
3796847 LSGRWGRVFTVRLGPRPAVVLCGYAALRDALVLQADAVSGRGSMAVFERFTRGN 3797008
3797426 GILFSNRPCWWTLRNFALGAPKKFGLDTRTIEARVLDEAACLLDEFQATI 3797575
3797741 GAPFDPVRLLGNAVSSVTCSCLREPLWLRGPGVPEVLNLLSDNFRIMSSKWGE 3797899
3798106 SLMDWLPGPHHQIFQNFSELQVFISEQIQQHWHMRQPAEPRDFIDCLARWVRHG 3798267
3798346 KQQDPESHFQEETLVMTMQLFFFFFFGGTETTSTTLC 3798456
3798461 YGLLILLKYPEVA 3798499
3798589 AKGQELDPVVGQRPVPSPDDHVQWPYTNAVLLEIQRFISVVPRTLTLDTHLHSHCLAKG 3798765

>CYP2U1 AC096564.3 missing last exon off end of clone 58% to 2U1 fugu, 78% to 2U1 mouse
last exon from AC000016.1 (AC025090 old number deleted, replaced by AC096564), 
only 5 exons not like other CYP2s that have 9 exons
intron 1 same as CYP2s intron 2, intron 2 like CYP2s intron 6
intron 3 is in a new location, intron 4 is the same as CYP2s intron 8
introns 1, 2 and 4 in same location as CYP2R1, intron 3 is at a unique site with a gc boundary (verified by ESTs BX354123, BX498753)
145089 MSSPGPSQPPAEDPPWPARLLRAPLGLLRLDPSGGALLLCGLVALLGWSWLRRRRARGI
       PPGPTPWPLVGNFGHVLLPPFLRRRSWLSSRTRAAGIDPSVIGPQVLLAHLARVYGSI 
       FSFFIGHYLVVVLSDFHSVREALVQQAEVFSDRPRVPLISIVTKEK (1) 145577
158414 GVVFAHYGPVWRQQRKFSHSTLRHFGLGKLSLEPKIIEEFKYVKAEMQKHGEDPFCPF 
       SIISNAVSNIICSLCFGQRFDYTNSEFKKMLGFMSRGLEICLNSQVLLVNICPWLYYLPF 
       GPFKELRQIEKDITSFLKKIIKDHQESLDRENPQDFIDMYLLHMEEERKNNSNSSFDEE 
       YLFYIIGDLFIAGTDTTTNSLLWCLLYMSLNPDVQ (1) 159049
160820 EKVHEEIERVIGANRAPSLTDKAQMPYTEATIMEVQRLTVVVPLAIPHMTSENT (1 gc boundary) 160981
162974 VLQGYTIPKGTLILPNLWSVHRDPAIWEKPEDFYPNRFLDDQGQLIKKETFIPFGI (1) 162961
142740 GKRVCMGEQLAKMELFLMFVSLMQSFAFALPEDSKKPLLTGRFGLTLAPHPFNITISRR* 142561

>CYP2W1 AC073957.7 chromosome 7 clone RP11-449P15 40% to 2F1
104272 MALLLLLFLGLLGLWGLLCACAQDPSPAARWPPGPRPLPLVGNLHLLRLSQQDRSLME 104445 (0)
105472 LSERYGPVFTVHLGRQKTVVLTGFEAVKEALAGPGQELADRPPIAIFQLIQRGG 105633 (1)
106009 GIFFSSGARWRAARQFTVRALHSLGVGREPVADKILQELKCLSGQLDGYR 106158 (1)
106225 GRPFPLALLGWAPSNITFALLFGRRFDYRDPVFVSLLGLIDEVMVLLGSPGLQ 106383 (0)
107684 LFNVYPWLGALLQLHRPVLRKIEEVRAILRTLLEARRPHVCPGDPVCSYVDALIQQGQ 107857 (0)
108167 GDDPEGLFAEANAVACTLDMVMAGTETTSATLQWAALLMGRHPDVQ 108304 (1)
108406 GRVQEELDRVLGPGRTPRLEDQQALPYTSAVLHEVQRFITLLPHVPRCTAADTQLGGFLLPK 108591 (0)
109337 GTPVIPLLTSVLLDETQWQTPGQFNPGHFLDANGHFVKREAFLPFSA 109477 (1)
109694 GRRVCVGERLARTELFLLFAGLLQRYRLLPPPGVSPASLDTTPARAFTMRPRAQALCAVPRP* 109882

>CYP2AB1P 
2D31P NT_022676.10|Hs3_22832 chromosome 3 2D6 pseudogene fragment I-helix
chr3q27.1 185030751-185015757 - strand build 33
899650 NQENLV*VVIDLFLGGTDTTATTLCWALIHMIQHGAVQ 899537
NT_005962.297 (genescan predicted protein has errors) 75% to 2ab1 mouse
MLSLLSGLALLAISFLLLKLGTFCWDRSCLPPGPLPFPILGNLWQLCFQLHPETLLQ
LAQSVFTVWVGPIPVAVLSGFQVVKEALVSNSEQFSGRSLTPLFQDLFGER
GIICSSGHTWRQKRRFCLVMI*GLGL
GKLALEVQLQKEAAELAEAFRQEQGRPFDPQVSIVRST
VRVIGALVFGHHFLLEDPIFQELTQAIDFGLAFVSTVWRRLYDVFPWALC
HLPGPHQEIFRYQEVVLSLIHQEITRHKLRAPEAPRDFISCYLAQISK 
AMDDPVSTFNQENLV*VVIDLFLGGTDTTATTLCWALIHMIQHGAVQG
TVQLELDEVLGAAPVVCYEDRKRLPYTX
AVLHDVQRLSSVMAMGAVRQCVTSTRVCSYPVSK
GTIILPNLASVLYDPECWETPRQFNPGHFSDKDGNFVANEAFLPFSAGHRVYPAD
QLAQMELFLMFATLLRTFRFQLPEGSPGLKLEYIFGGTWQPQPQEICAVPR

>CYP2AC1P human = old CYP2C57P AC022650 41% to 2C9 possible pseudogene 3 in frame stops 
chr6p12.3 49549276-49539652 including 5 exons build 33 - strand
62% to 2ac1 mouse
note exon 1 in humans is on chr 13, but exon 3-9 are on chr 6
in chimp exon 1 is on chr 14, but exon 3 is on chr5
cannot find exon 2 in human, chimp or macaque.  Probably lost in chromosome rearrangement
1 MNELDASAILPILVLILIFILNIKKFMTKASK*LSPPGPRPLLVIGNLYFLNLKRPYQTMLE

3 GIAFSHGETWKTMRRFSLTTLRNFGMGEWIIEDTIIEECQNLIQNFEFHR
4 GKPFEIKTIMNASVAKIIVLVLLGKWFDYQDSQFLRLLALTGENVKFIGGLRIAVT 
5 LFNMFLVLGFLLKSHKTILRNRDELFSFIRMAFLDHHHKLDKNDPRNFTDVFLVTQQE
6 ENDTFADYFSDKKLVTLVNNLFTTGTETTASTLHWGILLVMRYPEVQ
7 SKVHNEITKVVVSAQS*LAHRTQMTHTDAVI*EVQRFANILPTSLSHATTTNIFKNYCIPK
8 GTEVIILLASVARDQAQWEKPDTFNPEHFLNSKEKFIKREAFLPFQW
9 GRRMCAGESFARKELFLFFTSLLQKFTFQPLPGVSHLDLDLSLDVGFTT

>CYP3A4 J04449
MAVIPDLAMETWLLLAVSLVLLYL
YGTHSHGLFKKLGIPGPTPLPFLGNILSYHK
GFCMFDMECHKKYGKVCG
FYDGQQPVLAITDPDMIKTVLVKECYSVFTNRR
PFGPVGFMKSAISIAEDEEWKRLRSLLSPTFTSGKLKE
MVPIIAQYGDVLVRNLRREAETGKPVTLKD
VFGAYSMDVITSTSFGVNIDSLNNPQDPFVENTKKLLRFDFLDPFFLS
IIFPFLIPILEVLNICVFPREVTNFLRKSVKRMKESRLEDTQK
HRVDFLQLMIDSQNSKETESHK
ALSDLELVAQSIIFIFAGYETTSSVLSFIMYELATHPDVQQKLQEEIDAVLPNK
APPTYDTVLQMEYLDMVVNETLRLFPIAMRLERVCKKDVEINGMFIPKGWVVMIPSYALHRDPKYWTEPEKFLPER
FSKKNKDNIDPYTYTPFGSGPRNCIGMRFALMNMKLALIRVLQNFSFKPCKETQ
IPLKLSLGGLLQPEKPVVLKVESRDGTVSGA

>CYP3A4 NT_007933.10|Hs7_8090
24615206 MALIPDLAMETWLLLAVSLVLLYL 24615135
24614522   LIPNLAVETWLLL 24614484 alt exon 1 (partial) CYP3A4-ie1b
24611209 YGTHSHGLFKKLGIPGPTPLPFLGNILSYHK 24611117
24609205 GFCMFDMECHKKYGKVWG 24609152
24603813 FYDGQQPVLAITDPDMIKTVLVKECYSVFTNRR 24603715
24601360 PFGPVGFMKSAISIAEDEEWKRLRSLLSPTFTSGKLKE 24601247
24600981 MVPIIAQYGDVLVRNLRREAETGKPVTLKD 24600892
24599626 VFGAYSMDVITSTSFGVNIDSLNNPQDPFVENTKKLLRFDFLDPFFLS 24599483
24598387 TAVFPFLIPILEVLNICVFPREVTNFLRKSVKRMKESRLEDTQK 24598256
24597568 HRVDFLQLMIDSQNSKETESHK 24597503
24595141 ALSDLELVAQSIIFIFAGYETTSSVLSFIMYELATHPDVQQKLQEEIDAVLPNK 24594980
24593392 APPTYDTVLQMEYLDMVVNETLRLFPIAMRLERVCKKDVEINGMFIPKGVVVMIPSYALHRDPKYWTEPEKFLPER 24593165
24592105 FSKKNKDNIDPYIYTPFGSGPRNCIGMRFALMNMKLALIRVLQNFSFKPCKETQ 24591944
24589353 IPLKLSLGGLLQPEKPVVLKVESRDGTVSGA 24589261

>CYP3A4-ie1b NT_007933.10|Hs7_8090
24614522   LIPNLAVETWLLL 24614484 alt exon 1 (partial)

>CYP3A5-de1b2b NT_007933.10|Hs7_8090  ALT EXONS 1 AND 2 FOR CYP3A5
24531159 MDLIPNLAVETWLLLAVSLVLLYL 24531088
24526963 YGAHSHGLFKKLGIPGPTPLLFLGTTLSYHQ 24526871

>CYP3A5-de1b2b NT_033964.1|Hs7_34119 Homo sapiens chromosome 7 
alt exon 1 and 2 upstream of 3A5
239409 MDLIPNLAVETWLLLAVSLVLLYL + ex 1
243605 YGAHSHGLFKKLGIPGPTPLLFLGTTLSYHQ 243697 + ex2

>CYP3A5_v1 NM_000777
MDLIPNLAVETWLLLAVSLVLLYL
YGTRTHGLFKRLGIPGPTPLPLLGNVLSYRQ
GLWKFDTECYKKYGKMWG
TYEGQLPVLAITDPDVIRTVLVKECYSVFTNRR
SLGPVGFMKSAISLAEDEEWKRIRSLLSPTFTSGKLKE
MFPIIAQYGDVLVRNLRREAEKGKPVTLKD
IFGAYSMDVITGTSFGVNIDSLNNPQDPFVESTKKFLKFGFLDPLFLS
IILFPFLTPVFEALNVSLFPKDTINFLSKSVNRMKKSRLNDKQK
HRLDFLQLMIDSQNSKETESHK
ALSDLELAAQSIIFIFAGYETTSSVLSFTLYELATHPDVQQKLQKEIDAVLPNK
APPTYDAVVQMEYLDMVVNETLRLFPVAIRLERTCKKDVEINGVFIPKGSMVVIPTYALHHDPKYWTEPEEFRPER
FSKKKDSIDPYIYTPFGTGPRNCIGMRFALMNMKLALIRVLQNFSFKPCKETQ
IPLKLDTQGLLQPEKPIVLKVDSRDGTLSGE

>CYP3A5-de13c 3A51P NT_033964.1|Hs7_34119 chromosome 7 solo exon 13 upstream of 3A5
254205 QIPLKLRLGGLLQPEKPIVLKVESRDGTVSG 254297 ex13 

>CYP3A5-de13c NT_007933.10|Hs7_8090  EXON 13 
24516360 IPLKLRLGGLLQPEKPIVLKVESRDGTVSG 24516271

>CYP3A5_v1 NT_007933.10|Hs7_8090
24511021 MDLIPNLAVETWLLLAVSLVLLYL 24510950
24507332 YGTRTHGLFKRLGIPGPTPLPLLGNVLSYRQ 24507240
24505710 GLWKFDTECYKKYGKMWG 24505657
24503803 TYEGQLPVLAITDPDVIRTVLVKECYSVFTNRR 24503705
24498184 XXGPVGFMKSAISLAEDEEWKRIRSLLSPTFTSGKLKE 24498077
24497814 MFPIIAQYGDVLVRNLRREAEKGKPVTLKD 24497725
24496438 IFGAYSMDVITGTSFGVNIDSLNNPQDPFVESTKKFLKFGFLDPLFLS 24496295
24495221 TVLFPFLTPVFEALNVSLFPKDTINFLSKSVNRMKKSRLNDKQK 24495093
24494007 HRLDFLQLMIDSQNSKETESHK 24493942
24491785 ALSDLELAAQSIIFIFAGYETTSSVLSFTLYELATHPDVQQKLQKEIDAVLPNK 24491624
24483904 APPTYDAVVQMEYLDMVVNETLRLFPVAIRLERTCKKDVEINGVFIPKGSMVVIPTYALHHDPKYWTEPEEFRPER 24483677
24481356 FSKKKDSIDPYIYTPFGTGPRNCIGMRFALMNMKLALIRVLQNFSFKPCKETQ 24481198
24479525 IPLKLDTQGLLQPEKPIVLKVDSRDGTLSGE 24479433

>CYP3A5_v2 = CYP3A5P1 L26985 cDNA these are derived from the 3A5 gene by 
incomplete processing exons 6, 11, 12, 13 deleted, three new exons present
gene model exons 1a2a3a14n4a15n5a16n delta6a 7a8a9a10ax delta11a delta12a delta13a
14n, 15n and 16n represent new cryptic exons 10ax means exon 10a is extended into 
the intron. This is an alternative transcript not a pseudogene
MDLIPNLAVETWLLLAVSLVLLYLYGTRTHGLFKRLGIPGPTPLPLLGNVLSYRQGLWKFDTECYKKYGKMW
SSLFGPHYPSSYEALGGSCVRLLLCVTP**TRT*GCCVSYN* cryptic exon 14n
GTYEGQLPVLAITDPDVIRTVLVKECYSVFTNRR 
ICATTSTIKMQTHSVTMWLPPAVLQSQHGVCLFL*Q cryptic exon 15n
SLGPVGFMKSAISLAEDEEWKRIRSLLSPTFTSGKLKE
KRHHKIHYKMSLTAPCWRKPYPSGT*VCTFNYS cryptic exon 16n, normal exon 6 missing
IFGAYSMDVITGTSFGVNIDSLNNPQDPFVESTKKFLKFGFLDPLFLS
IILFPFLTPVFEALNVSLFPKDTINFLSKSVNRMKKSRLNDKQKHRLDFLQLMIDSQNSK
ETESHKALSDLELAAQSIIFIFAGYETTSSVLSFTLYELATHPDVQQKLQKEIDAVLPNK
Intron after exon 10 is continued in this mRNA. Exons 11,12,13 missing

Query:    72 MDLIPNLAVETWLLLAVSLVLLYLYGTRTHGLFKRLGIPGPTPLPLLGNVLSYRQGLWKF 251
             MDLIPNLAVETWLLLAVSLVLLYLYGTRTHGLFKRLGIPGPTPLPLLGNVLSYRQGLWKF
Sbjct:     1 MDLIPNLAVETWLLLAVSLVLLYLYGTRTHGLFKRLGIPGPTPLPLLGNVLSYRQGLWKF 60

Query:   252 DTECYKKYGKMW 287 ex1,2,3
             DTECYKKYGKMW
Sbjct:    61 DTECYKKYGKMW 72

Retains part of intron here 133bp new exon

Query:   420 GTYEGQLPVLAITDPDVIRTVLVKECYSVFTNRR 521 ex4
             GTYEGQLPVLAITDPDVIRTVLVKECYSVFTNRR
Sbjct:    73 GTYEGQLPVLAITDPDVIRTVLVKECYSVFTNRR 106

Retains part of intron here 109bp new exon

Query:   630 SLGPVGFMKSAISLAEDEEWKRIRSLLSPTFTSGKLKEKRHHKIHYKMSLTAPCWRKPY 806 ex5-10 + intron
             SLGPVGFMKSAISLAEDEEWKRIRSLLSPTFTSGKLKEKRHHKIHYKMSLTAPCWRKPY
Sbjct:   107 SLGPVGFMKSAISLAEDEEWKRIRSLLSPTFTSGKLKEKRHHKIHYKMSLTAPCWRKPY 165

Query:   807 PSGT*VCTFNYSIFGAYSMDVITGTSFGVNIDSLNNPQDPFVESTKKFLKFGFLDPLFLS 986
             PSGT*VCTFNYSIFGAYSMDVITGTSFGVNIDSLNNPQDPFVESTKKFLKFGFLDPLFLS
Sbjct:   166 PSGT*VCTFNYSIFGAYSMDVITGTSFGVNIDSLNNPQDPFVESTKKFLKFGFLDPLFLS 225

Query:   987 IILFPFLTPVFEALNVSLFPKDTINFLSKSVNRMKKSRLNDKQKHRLDFLQLMIDSQNSK 1166
             IILFPFLTPVFEALNVSLFPKDTINFLSKSVNRMKKSRLNDKQKHRLDFLQLMIDSQNSK
Sbjct:   226 IILFPFLTPVFEALNVSLFPKDTINFLSKSVNRMKKSRLNDKQKHRLDFLQLMIDSQNSK 285

Query:  1167 ETESHKALSDLELAAQSIIFIFAGYETTSSVLSFTLYELATHPDVQQKLQKEIDAVLPNK 1346
             ETESHKALSDLELAAQSIIFIFAGYETTSSVLSFTLYELATHPDVQQKLQKEIDAVLPNK
Sbjct:   286 ETESHKALSDLELAAQSIIFIFAGYETTSSVLSFTLYELATHPDVQQKLQKEIDAVLPNK 345

>_1
PAQQTAALS*KEDSQNTVEEGKWRWTSSQIWRWKPGFSWLSAWCSSIYMGPVHMDFLRDW
EFQGPHLCLCWEMFCPIVRVSGNLTQSAIKSMEKCGVRLPCLDHITLHHMKPWVAPV*DS
CCVSHPNELEPKVAVCRTTRERMKVNSLCWPSQIPT*SEQC**KNVILSSQIEGFVQRPA
PSRCRPIPSPCGSLLLSYSHNMEFVFFSDSL*AQWDL*KVPSL*LRMKNGREYGHCCLQP
SPAENSRRKDITKFITKCHLLLHAGESHILLGLESAHLTTASLGPTAWM*LLAHHLE*TS
TLSTIHKTPLWRALRSS*NLVS*IHYFSQ*YSFHSLPQFLKH*MSLCFQKIP*IF*VNL*
TE*RKVASTTNKSTD*ISFS**LTPRIRKKLSPTKLCLIWSSQPSQ*SSFLLAMKPPAVF
FPSLYMNWPLTLMSSRNCKRRLMQFCPIR*GDDPWR*REEVKP*QKCLLTTPQENFYKKH
NH*FLH*HNVGSL*GEKQREKHRERLLLAEA*DLCTILLALVHLFTVITIMLS
>_2
QPSKQQHSAKRKTHRTQLKKESGDGPHPKFGGGNLASPGCQPGAPLSIWDPYTWTF*ETG
NSRAHTSAFVGKCFVLSSGSLEI*HRVL*KVWKNVGYVFPVWTTLPFII*SLGWLLCETL
AVCHTLMN*NLRLLCVVQLGNV*RSTPCAGHHRSRRDQNSASERMLFCLHKSKDLCNDQH
HQDADPFRHHVAPSCCPTVTTWSLSFSLTVFRPSGIYEKCHLFS*G*RMEENTVIAVSNL
HQRKTQGEKTSQNSLQNVTYCSMLEKAISFWDLSLHI*LQHLWGLQHGCDYWHIIWSEHR
LSQQSTRPLCGEH*EVPKIWFLRSIISLNNTLSIPYPSF*SIKCLSVSKRYHKFFK*ICK
QNEEKSPQRQTKAPTRFPSADD*LPEFERN*VPQSSV*SGARSPVNNLHFCWL*NHQQCS
FLHFI*TGHSP*CPAETAKGD*CSFAQ*GEGMTPGDEGKR*SLSKNASSPLPRRIFIKSI
ITDSFTDIM*EASEEKNKGRNIENGCYWQKHKIFVQYCWPWFTCLLLSQ*C*
>_3
SPANSSTQLKGRLTEHS*RRKVAMDLIPNLAVETWLLLAVSLVLLYLYGTRTHGLFKRLG
IPGPTPLPLLGNVLSYRQGLWKFDTECYKKYGKMWGTSSLFGPHYPSSYEALGGSCVRLL
LCVTP**TRT*GCCVSYN*GTYEGQLPVLAITDPDVIRTVLVKECYSVFTNRRICATTST
IKMQTHSVTMWLPPAVLQSQHGVCLFL*QSLGPVGFMKSAISLAEDEEWKRIRSLLSPTF
TSGKLKEKRHHKIHYKMSLTAPCWRKPYPSGT*VCTFNYSIFGAYSMDVITGTSFGVNID
SLNNPQDPFVESTKKFLKFGFLDPLFLSIILFPFLTPVFEALNVSLFPKDTINFLSKSVN
RMKKSRLNDKQKHRLDFLQLMIDSQNSKETESHKALSDLELAAQSIIFIFAGYETTSSVL
SFTLYELATHPDVQQKLQKEIDAVLPNKVRG*PLEMKGRGEALAKMPPHHSPGEFL*KA*
SLIPSLT*CRKPLRRKTKGET*RTVATGRSIRSLYNIAGPGSPVYCYHNNAK

NT_007933.11|Hs7_8090

Query: 1        GLWKFDTECYKKYGKMWG 18
                GLWKFDTECYKKYGKMWG
Sbjct: 24505710 GLWKFDTECYKKYGKMWG 24505657

Query: 20       SSLFGPHYPSSYEALGGSCVRLLLCVTP**TRT*GCCVSYN*G 62 extra exon
                SSLFGPHYPSSYEALGGSCVRLLLCVTP**TRT*GCCVSYN*G
Sbjct: 24504036 SSLFGPHYPSSYEALGGSCVRLLLCVTP**TRT*GCCVSYN*G 24503908

Query: 63       TYEGQLPVLAITDPDVIRTVLVKECYSVFTNRRI 96
                TYEGQLPVLAITDPDVIRTVLVKECYSVFTNRR+
Sbjct: 24503803 TYEGQLPVLAITDPDVIRTVLVKECYSVFTNRRV 24503702

Query: 89       SVFTNRRICATTSTIKMQTHSVTMWLPPAVLQSQHGVCLFL*QSL 133 extra exon
                S F++  ICATTSTIKMQTHSVTMWLPPAVLQSQHGVCLFL*Q L
Sbjct: 24503024 SNFSHS*ICATTSTIKMQTHSVTMWLPPAVLQSQHGVCLFL*QVL 24502890

Query: 132      SLGPVGFMKSAISLAEDEEWKRIRSLLSPTFS 163
                SLGPVGFMKSAISLAEDEEWKRIRSLLSPTF+
Sbjct: 24498190 SLGPVGFMKSAISLAEDEEWKRIRSLLSPTFT 24498095

Query: 30       SLGPVGFMKSAISLAEDEEWKRIRSLLSPTFTSGKLKE---------------------- 67
                SLGPVGFMKSAISLAEDEEWKRIRSLLSPTFTSGKLKE                      
Sbjct: 24498190 SLGPVGFMKSAISLAEDEEWKRIRSLLSPTFTSGKLKEV*K*DES*LEM*RMNLGTGRK* 24498011

Query: 68       -------------------------KRHHKIHYKMSLTAPCWRKPYPSGT*VCTFNYSIF 102 extra exon
                                         KRHHKIHYKMSLTAPCWRKPYPSGT*VCTFNY   
Sbjct: 24498010 DHSPFPRGSPLSSSFLKMVFYLYVQKRHHKIHYKMSLTAPCWRKPYPSGT*VCTFNYRY* 24497831

Query: 1        MFPIIAQYGDVLVRNLRREAEKGKPVTLKD 30 exon 6 missing in 3a5p1 and 3a5p2
                MFPIIAQYGDVLVRNLRREAEKGKPVTLK+
Sbjct: 24497814 MFPIIAQYGDVLVRNLRREAEKGKPVTLKE 24497725

Query: 100      SIFGAYSMDVITGTSFGVNID 120
                SIFGAYSMDVITGTSFGVNID
Sbjct: 24496441 SIFGAYSMDVITGTSFGVNID 24496379

Extension of exon 10 into intron
Query: 159      SPTFSFTLYELATHPDVQQKLQKEIDAVLPNKVRG*PLEMKGRGEALAKMPPHHSPGEFL 218
                S   SFTLYELATHPDVQQKLQKEIDAVLPNKVRG*PLEMKGRGEALAKMPPHHSPGEFL
Sbjct: 24491719 SSVLSFTLYELATHPDVQQKLQKEIDAVLPNKVRG*PLEMKGRGEALAKMPPHHSPGEFL 24491540

Query: 219      *KA*SLIPSLT*CRKPLRRKTKGET*RTVATGRSIRSLYNIAGPGSPVYCYHNNAK 274
                *KA*SLIPSLT*CRKPLRRKTKGET*RTVATGRSIRSLYNIAGPGSPVYCYHNNAK
Sbjct: 24491539 *KA*SLIPSLT*CRKPLRRKTKGET*RTVATGRSIRSLYNIAGPGSPVYCYHNNAK 24491372


>CYP3A5_v3 = CYP3A5P2 X90579 these are derived from the 3A5 gene by incomplete processing 
gene model exons 1a2a3a14n4a5a16n delta6a 7a8a9a10ax delta11a delta12a delta13a
same as 3a5p1 except missing exon 15n.  
This is an alternative transcript not a pseudogene

MDLIPNLAVETWLLLAVSLVLLYLYGTRTHGLFKRLGIPGPTPLPLLGNVLSYRQGLWKF
DTECYKKYGKMWG 
TYEGQLPVLAITDPDVIRTVLVKECYSVFTNRRSLGPVGFMKSAISLAEDEEWKRIRSL 
LSPTFTSGKLKEKRHHKIHYKMSLTAPCWRKPYPSGT*VCTFNYSIFGAYSMDVITGTSF
GVNIDSLNNPQDPFVESTKKFLKFGFLDPLFLSIILFPFLTPVFEALNVSLFPKDTINFL
SKSVNRMKKSRLNDKQKHRLDFLQLMIDSQNSKETESHKALSDLELAAQSIIFIFAGYET
TSSVLSFTLYELATHPDVQQKLQKEIDAVLPNK 

>_1
                              GIPSPANSSTQLKGRLTEHS*RRKVAMDLIPNLAVETWLLLAVSLVLLYLYGTRTHGLFK
                              RLGIPGPTPLPLLGNVLSYRQGLWKFDTECYKKYGKMWGISSLFGPHYPSSYEALGGSCV
                              RLLLCVTP**TRT*GCCVSYN*GTYEGQLPVLAITDPDVIRTVLVKECYSVFTNRRSLGP
                              VGFMKSAISLAEDEEWKRIRSLLSPTFTSGKLKEKRHHKIHYKMSLTAPCWRKPYPSGT*
                              VCTFNYSIFGAYSMDVITGTSFGVNIDSLNNPQDPFVESTKKFLKFGFLDPLFLSIILFP
                              FLTPVFEALNVSLFPKDTINFLSKSVNRMKKSRLNDKQKHRLDFLQLMIDSQNSKETESH
                              KALSDLELAAQSIIFIFAGYETTSSVLSFTLYELATHPDVQQKLQKEIDAVLPNKVRG*P
                              LEMKGRGEALAKMPPHHSPGEFL*KA*SLIPSLT*CRKPLRRKTKGET*RTVATGRSIRS
                              LYNIAGPGSPVYCYHNNAK*KKKKKKKKX
                              >_2
                              EFPAQQTAALS*KEDSQNTAEEGKWRWTSSQIWRWKPGFSWLSAWCSSIYMGPVHMDFLR
                              DWEFQGPHLCLCWEMFCPIVRVSGNLTQSAIKSMEKCGVSLPCLDHITLHHMKPWVAPV*
                              DSCCVSHPNELEPKVAVCRTTRERMKVNSLCWPSQIPT*SEQC**KNVILSSQIEGL*AQ
                              WDL*KVPSL*LRMKNGREYGHCCLQPSPAENSRRKDITKFITKCHLLLHAGESHILLGLE
                              SAHLTTASLGPTAWM*LLAHHLE*TSTLSTIHKTPLWRALRSS*NLVS*IHYFSQ*YSFH
                              SLPQFLKH*MSLCFQKIP*IF*VNL*TE*RKVASTTNKSTD*ISFS**LTPRIRKKLSPT
                              KLCLIWSSQPSQ*SSFLLAMKPPAVFFPSLYMNWPLTLMSSRNCKRRLMQFCPIR*GDDP
                              WR*REEVKP*QKCLLTTPQENFYKKHNH*FLH*HNVGSL*GEKQREKHRERLLLAEA*DL
                              CTILLALVHLFTVITIMLSKKKKKKKKK
                              >_3
                              NSQPSKQQHSAKRKTHRTQLKKESGDGPHPKFGGGNLASPGCQPGAPLSIWDPYTWTF*E
                              TGNSRAHTSAFVGKCFVLSSGSLEI*HRVL*KVWKNVGYLFPVWTTLPFII*SLGWLLCE
                              TLAVCHTLMN*NLRLLCVVQLGNV*RSTPCAGHHRSRRDQNSASERMLFCLHKSKVFRPS
                              GIYEKCHLFS*G*RMEENTVIAVSNLHQRKTQGEKTSQNSLQNVTYCSMLEKAISFWDLS
                              LHI*LQHLWGLQHGCDYWHIIWSEHRLSQQSTRPLCGEH*EVPKIWFLRSIISLNNTLSI
                              PYPSF*SIKCLSVSKRYHKFFK*ICKQNEEKSPQRQTKAPTRFPSADD*LPEFERN*VPQ
                              SSV*SGARSPVNNLHFCWL*NHQQCSFLHFI*TGHSP*CPAETAKGD*CSFAQ*GEGMTP
                              GDEGKR*SLSKNASSPLPRRIFIKSIITDSFTDIM*EASEEKNKGRNIENGCYWQKHKIF
                              VQYCWPWFTCLLLSQ*C*VKKKKKKKK

Query:    79 MDLIPNLAVETWLLLAVSLVLLYLYGTRTHGLFKRLGIPGPTPLPLLGNVLSYRQGLWKF 258
             MDLIPNLAVETWLLLAVSLVLLYLYGTRTHGLFKRLGIPGPTPLPLLGNVLSYRQGLWKF
Sbjct:     1 MDLIPNLAVETWLLLAVSLVLLYLYGTRTHGLFKRLGIPGPTPLPLLGNVLSYRQGLWKF 60

Query:   259 DTECYKKYGKMWG 297
             DTECYKKYGKMWG
Sbjct:    61 DTECYKKYGKMWG 73

Query:   427 GTYEGQLPVLAITDPDVIRTVLVKECYSVFTNRRSLGPVGFMKSAISLAEDEEWKRIRSL 606
             GTYEGQLPVLAITDPDVIRTVLVKECYSVFTNRRSLGPVGFMKSAISLAEDEEWKRIRSL
Sbjct:    73 GTYEGQLPVLAITDPDVIRTVLVKECYSVFTNRRSLGPVGFMKSAISLAEDEEWKRIRSL 132

Query:   607 LSPTFTSGKLKEKRHHKIHYKMSLTAPCWRKPYPSGT*VCTFNYSIFGAYSMDVITGTSF 786
             LSPTFTSGKLKE       Y   L     R+    G  V T    IFGAYSMDVITGTSF
Sbjct:   133 LSPTFTSGKLKEMFPIIAQYGDVLVRNL-RREAEKGKPV-TLK-DIFGAYSMDVITGTSF 189

Query:   787 GVNIDSLNNPQDPFVESTKKFLKFGFLDPLFLSIILFPFLTPVFEALNVSLFPKDTINFL 966
             GVNIDSLNNPQDPFVESTKKFLKFGFLDPLFLSIILFPFLTPVFEALNVSLFPKDTINFL
Sbjct:   190 GVNIDSLNNPQDPFVESTKKFLKFGFLDPLFLSIILFPFLTPVFEALNVSLFPKDTINFL 249

Query:   967 SKSVNRMKKSRLNDKQKHRLDFLQLMIDSQNSKETESHKALSDLELAAQSIIFIFAGYET 1146
             SKSVNRMKKSRLNDKQKHRLDFLQLMIDSQNSKETESHKALSDLELAAQSIIFIFAGYET
Sbjct:   250 SKSVNRMKKSRLNDKQKHRLDFLQLMIDSQNSKETESHKALSDLELAAQSIIFIFAGYET 309

Query:  1147 TSSVLSFTLYELATHPDVQQKLQKEIDAVLPNK 1245
             TSSVLSFTLYELATHPDVQQKLQKEIDAVLPNK
Sbjct:   310 TSSVLSFTLYELATHPDVQQKLQKEIDAVLPNK 342

>CYP3A7 NM_000765
MDLIPNLAVETWLLLAVSLILLYL
YGTRTHGLFKKLGIPGPTPLPFLGNALSFRK
GYWTFDMECYKKYRKVWG
IYDCQQPMLAITDPDMIKTVLVKECYSVFTNRR
PFGPVGFMKNAISIAEDEEWKRIRSLLSPTFTSGKLKE
MVPIIAQYGDVLVRNLRREAETGKPVTLKH
VFGAYSMDVITSTSFGVSIDSLNNPQDPFVENTKKLLRFNPLDPFVLS
IKVFPFLTPILEALNITVFPRKVISFLTKSVKQIKEGRLKETQK
HRVDFLQLMIDSQNSKDSETHK
ALSDLELMAQSIIFIFAGYETTSSVLSFIIYELATHPDVQQKVQKEIDTVLPNK
APPTYDTVLQLEYLDMVVNETLRLFPVAMRLERVCKKDVEINGMFIPKGVVVMIPSYVLHHDPKYWTEPEKFLPER
FSKKNKDNIDPYIYTPFGSGPRNCIGMRFALVNMKLALVRVLQNFSFKPCKETQ
IPLKLRFGGLLLTEKPIVLKAESRDETVSGA

>CYP3A7-de1b2b NT_007933.10|Hs7_8090 alt exons 1 and 2 for 3A7
24583613 XDLIPNLAVETWLLLAVSLVLHYL 24583545
24579466 YETHSHGLFKNLGIPGPRPLLFLGTTLSYHQ 24579374

>CYP3A7-de1b2b NT_033964.1|Hs7_34119 chromosome 7 alt exon 1 and 2 upstream of 3A7
186958 xxLIPNLAVETWLLLAVSLVLHYL + ex 1
191087 VSLCRYETHSHGLFKNLGIPGPRPLLFLGTTLSYHQ 191194 + ex 2

>CYP3A7 NT_007933.10|Hs7_8090
24566218 MDLIPNLAVETWLLLAVSLILLYL 24566147
24562276 YGTRTHGLFKKLGIPGPTPLPFLGNALSFRK 24562184
24553473 GYWTFDMECYKKYRKVWG 24553420
24551536 IYDCQQPMLAITDPDMIKTVLVKECYSVFTNRR 24551438
24548764 PFGPVGFMKNAISIAEDEEWKRIRSLLSPTFTSGKLKE 24548651
24548390 MVPIIAQYGDVLVRNLRREAETGKPVTLKH 24548301
24547030 VFGAYSMDVITSTSFGVSIDSLNNPQDPFVENTKKLLRFNPLDPFVLS 24546887
24545808 TEVFPFLTPILEALNITVFPRKVISFLTKSVKQIKEGRLKETQK 24545680
24544660 HRVDFLQLMIDSQNSKDSETHK 24544595
24542018 ALSDLELMAQSIIFIFAGYETTSSVLSFIIYELATHPDVQQKVQKEIDTVLPNK 24541857
24540386 APPTYDTVLQLEYLDMVVNETLRLFPVAMRLERVCKKDVEINGMFIPKGVVVMIPSYVLHHDPKYWREPEKFLPER 24540159
24539098 FSKKNKDNIDPYIYTPFGSGPRNCIGMRFALVNMKLALVRVLQNFSFKPCKETQ 24538937
24536720 IPLKLRFGGLLLTEKPIVLKAESRDETVSGA 24536628

>CYP3A43 AC011904 one exon per line
MDLIPNFAMETWVLVATSLVLLYI
YGTHSHKLFKKLGIPGPTPLPFLGTILFYLR
GLWNFDRECNEKYGEMWG
LYEGQQPMLVIMDPDMIKTVLVKECYSVFTNQM
PLGPMGFLKSALSFAEDEEWKRIRTLLSPAFTSVKFKE
MVPIISQCGDMLVRSLRQEAENSKSINLKE
FFGAYTMDVITGTLFGVNLDSLNNPQDPFLKNMKKLLKLDFLDPFLLLI
SLFPFLTPVFEALNIGLFPKDVTHFLKNSIERMKESRLKDKQK
HRVDFFQQMIDSQNSKETKSHK
ALSDLELVAQSIIIIFAAYDTTSTTLPFIMYELATHPDVQQKLQEEIDAVLPNK
APVTYDALVQMEYLDMVVNETLRLFPVVSRVTRVCKKDIEINGVFIPKGLAVMVPIYALHHDPKYWTEPEKFCPER
FSKKNKDSIDLYRYIPFGAGPRNCIGMRFALTNIKLAVIRALQNFSFKPCKETQ
IPLKLDNLPILQPEKPIVLKVHLRDGITSGP

>CYP3A43-de1b AC011904 alt exon 1 83% to 3A43 
MDLIPNFAMEMCVLLATSLVLLYL

>CYP3A43-de1b NT_007933.10|Hs7_8090  + strand alternate exon 1
24651143 MDLIPNFAMEMCVLLATSLVLLYL 24651214

>CYP3A43 NT_007933.10|Hs7_8090  + strand
24659241 MDLIPNFAMETWVLVATSLVLLYI 24659312
24667579 YGTHSHKLFKKLGIPGPTPLPFLGTILFYR 24667671
24670245 GLWNFDRECNEKYGEMWG 24670298
24675269 LYEGQQPMLVIMDPDMIKTVLVKECYSVFTNQM 24675367
24678613 PLGPMGFLKSALSFAEDEEWKRIRTLLSPAFTSVKFKE 24678726
24679291 MVPIISQCGDMLVRSLRQEAENSKSINLKD 24679380
24680672 FFGAYTMDVITGTLFGVNLDSLNNPQDPFLKNMKKLLKLDFLDPFLLLI 24680818
24686715 SLFPFLTPVFEALNIGLFPKDVTHFLKNSIERMKESRLKDKQK 24686843
24687958 HRVDFFQQMIDSQNSKETKSHK 24688023
24690954 ALSDLELVAQSIIIIFAAYDTTSTTLPFIMYELATHPDVQQKLQEEIDAVLPNK 24691115
24692738 APVTYDALVQMEYLDMVVNETLRLFPVVSRVTRVCKKDIEINGVFIPKGLAVMVPIYALHHDPKYWTEPEKFCPER 24692965
24694663 FSKKNKDSIDLYRYIPFGAGPRNCIGMRFALTNIKLAVIRALQNFSFKPCKETQ 24694824
24697025 IPLKLDNLPILQPEKPIVLKVHLRDGITSGP 24697123

>CYP3A43-de4c6c NT_007933.12|Hs7_8090 Homo sapiens chromosome 7 genomic contig
          Length = 64383103
Last exon of 3A43 in build 33 of human genome
24697113 IPLKLDNLPILQPEKPIVLKVHLRDGITSGP 24697205
CYP3A43-de4c6c = s in Fig. 3A, old 3A52P, 3A53P 
Exon 4 pseudogene fragment
Sbjct: 24701048 MIKTVLVKECYSVFPNWR 24701101
exon 6 pseudogene fragment
Sbjct: 24703375 EMLLIIL*YGDVFLRNLRQKVEKGKPLTLR 24703464

>CYP3A43-de4c6c NT_007933.10|Hs7_8090 [older version of seq]  + strand exon 4
24700966 MIKTVLVKECYSVFPN 247001013

>CYP3A43-de4c6c 3A52P AC011904 part of exon 4 downstream of CYP3A43, 88% to 3A4 
MIKTVLVKECYSVFPNWR

>CYP3A43-de4c6c 3A53P NT_033964.1|Hs7_34119 Homo sapiens chromosome 7 solo exon 6 downstream of 3A43
67275 EMLLIIL*YGDVFLRNLRQKVEKGKPLTLR 67186 ex 6

>CYP3A43-de4c6c 3A53P NT_033966.1|Hs7_34121 chromosome 7 
Exon 6 same as NT_033964.1|Hs7_34119 chromosome 7 lone exon 6 downstream of 3A43
1097550 IIL*YGDVFLRNLRQKVEKGKPLTLR 1097473

>CYP3A-se1[2] 3A54P NT_033967.1|Hs7_34122 chromosome 7 
chr7p22.2  4261928-4262002 + strand build 33 
2460622 LYRYGTCSHGLFKKMAIPGLTLLPF 2460548

>CYP3A-se2[5] 3A55P NT_033967.1|Hs7_34122 chromosome 7 
chr7p22.2 4047864-4047959 + strand build 33 214000bp from 3A54P
2246484 LESGLIITKHEKWKRLRSVISPAFPSGELQEV 2246579

>CYP4A11 NM_000778 12 exons MSVSVLSPSRLLGDVSGILQAASLLILLLLLIKAVQLYLHRQWLLKALQQFPCPPSHWLFGHIQE
LQQDQELQRIQKWVETFPSACPHWLWGGKVRVQLYDPDYMKVILGRS 
DPKSHGSYRFLAPWI 
GYGLLLLNGQTWFQHRRMLTPAFHYDILKPYVGLMADSVRVML 
DKWEELLGQDSPLEVFQHVSLMTLDTIMKCAFSHQGSIQVDR 
NSQSYIQAISDLNNLVFSRVRNAFHQNDTIYSLTSAGRWTHRACQLAHQHT 
DQVIQLRKAQLQKEGELEKIKRKRHLDFLDILLLAK 
MENGSILSDKDLRAEVDTFMFEGHDTTASGISWILYALATHPKHQERCREEIHSLLGDGASITW 
NHLDQMPYTTMCIKEALRLYPPVPGIGRELSTPVTFPDGRSLPKG 
IMVLLSIYGLHHNPKVWPNPEV 
FDPSRFAPGSAQHSHAFLPFSGGSR 
NCIGKQFAMNELKVATALTLLRFELLPDPTRIPIPIARLVLKSKNGIHLRLRRLPNPCEDKDQL*

>CYP4A11 NT_04852.13 - strand PFRF seq is polymorphic PSRF also found, both in ESTs
7941864 MSVSVLSPSRLLGDVSGILQAASLLILLLLLIKAVQLYLHRQWLLKALQQFPCPPSHWLFGHIQE 7941670
7938568 LQQDQELQRIQKWVETFPSACPHWLWGGKVRVQLYDPDYMKVILGRS 7938428
7937778 DPKSHGSYRFLAPWI 7937734
7937223 GYGLLLLNGQTWFQHRRMLTPAFHYDILKPYVGLMADSVRVML 7937095
7936078 DKWEELLGQDSPLEVFQHVSLMTLDTIMKCAFSHQGSIQVDR 7935953
7935584 NSQSYIQAISDLNNLVFSRVRNAFHQNDTIYSLTSAGRWTHRACQLAHQHT 7935432
7934991 DQVIQLRKAQLQKEGELEKIKRKRHLDFLDILLLAK 7934884
7934797 MENGSILSDKDLRAEVDTFMFEGHDTTASGISWILYALATHPKHQERCREEIHSLLGDGASITW 7934606
7934509 NHLDQMPYTTMCIKEALRLYPPVPGIGRELSTPVTFPDGRSLPK 7934378
7933479 GIMVLLSIYGLHHNPKVWPNPE 7933414
7933268 VFDPFRFAPGSAQHSHAFLPFSGGSR 7933191
7930740 NCIGKQFAMNELKVATALTLLRFELLPDPTRIPIPIARLVLKSKNGIHLRLRRLPNPCEDKDQL* 7930546

>25. CYP4A22 new 4A11 like sequence AL390073.5 95% identical to 4A11 
82163 MSVSVLSPSRRLGGVSGILQVTSLLILLLLLIKAAQLYLHRQWLLKALQQFPCPPSHWLFGHIQE 82357
85457 FQHDQELQRIQERVKTFPSACPYWIWGGKVRVQLYDPDYMKVILGRS 85597
86247 DPKSHGSYKFLAPRI 86291
86784 GYGLLLLNGQTWFQHRRMLTPAFHNDILKPYVGLMADSVRVML 86912
87946 DKWEELLGQDSPLEVFQHVSLMTLDTIMKSAFSHQGSIQVDR 88071
88440 NSQSYIQAISDLNSLVFCCMRNAFHENDTIYSLTSAGRWTHRACQLAHQHT 88592
89033 DQVIQLRKAQLQKEGELEKIKRKRHLDFLDILLLAK 89140
89227 MENGSILSDKDLRAEVDTFMFEGHDTTASGISWILYALATHPKHQERCREEIHGLLGDGASITW 89418
89515 NHLDQMPYTTMCIKEALRLYPPVPGIGRELSTPVTFPDGRSLPKG 89649
90545 IMVLLSIYGLHHNPKVWPNLE 90607
90754 VFDPSRFAPGSAQHSHAFLPFSGGSR 90831
93280 NCIGKQFAMNQLKVARALTLLRFELLPDPTRIPIPMARLVLKSKNGIHLRLRRLPNPCEDKDQL* 93474

>CYP4A22 NT_032977.5 + strand Build 33
52894 MSVSVLSPSRRLGGVSGILQVTSLLILLLLLIKAAQLYLHRQWLLKALQQFPCPPSHWLFGHIQE 53088
56188 FQHDQELQRIQERVKTFPSACPYWIWGGKVRVQLYDPDYMKVILGRS 56328
56978 DPKSHGSYKFLAPRI 57022
57515 GYGLLLLNGQTWFQHRRMLTPAFHNDILKPYVGLMADSVRVML 57643
58677 DKWEELLGQDSPLEVFQHVSLMTLDTIMKSAFSHQGSIQVDR 58802
59171 NSQSYIQAISDLNSLVFCCMRNAFHENDTIYSLTSAGRWTHRACQLAHQHT 59323
59764 DQVIQLRKAQLQKEGELEKIKRKRHLDFLDILLLAK 59871
59958 MENGSILSDKDLRAEVDTFMFEGHDTTASGISWILYALATHPKHQERCREEIHGLLGDGASITW 60149
60246 NHLDQMPYTTMCIKEALRLYPPVPGIGRELSTPVTFPDGRSLPKG 60380
61276 IMVLLSIYGLHHNPKVWPNLE 61338
61485 VFDPSRFAPGSAQHSHAFLPFSGGSR 61562
64011 NCIGKQFAMNQLKVARALTLLRFELLPDPTRIPIPMARLVLKSKNGIHLRLRRLPNPCEDKDQL* 64205

>CYP4A-se1[12] CYP4A26P AL731892.5 76% to 4A11 C-term upstream of 4A11
       TCIGKKFAMNEL 135092
135091 KVAMALTLLRFELLPDPTRIPVGIP*ILLKFKSGIHLHLRRV 134966

>CYP4A-se1[12] CYP4A26P NT_04852.13 - strand exon 12
7968299 TCIGKKFAMNELKVAMALTLLRFELLPDPTRIPVGIP*ILLKFKSGIHLHLRRVPNHCGDKDQL* 7968105

>CYP4A-se2[1] NT_04852.13 - strand exon 1 new
7994933 MSVSVLNPNRLPDGVSGLLQGASLLSLLLLLLKAAQP*LHR 7994811

>CYP4A-se3[12] CYP4A27P AL731892.5 67% to 4a11 C-term upstream of 4A26P
168338 NCIGKQFAMNELKVALTLLNFKLFSDPASIPILLPQMVLKFNNGIHLHVKRL 168183

>CYP4A-se3[12] CYP4A27P NT_04852.13 - strand exon 12
8001510 NCIGKQFAMNELKVALTLLNFKLFSDPASIPILLPQMVLKFNNGIHLHVKRLPNTCEDKNQI* 8001322

>CYP4A-se4[2] NT_04852.13 exon 2 - strand
8003576 XXXDQELKQFQKWVDF*IYPKRDFCDPVLTCCGGRKVHFLLYDPDKMEVILGRS 8003424

>CYP4B1 NM_000779
MVPSFLSLSFSSLGLWASGLILVLGFLKLIHLLLRRRTLAKAMDKFPGPPTHWLFGHALE
IQETGSLDKVVSWAHQFPYAHPLWFGQFIGFLNIYEPDYAKAVYSRG
DPKAPDVYDFFLQWIG
RGLLVLEGPKWLQHRKLLTPGFHYDVLKPYVAVFTESTRIML
DKWEEKAREGKSFDIFCDVGHMALNTLMKCTFGRGDTGLGHR
DSSYYLAVSDLTLLMQQRLVSFQYHNDFIYWLTPHGRRFLRACQVAHDHT
DQVIRERKAALQDEKVRKKIQNRRHLDFLDILLGAR
DEDDIKLSDADLRAEVDTFMFEGHDTTTSGISWFLYCMALYPEHQHRCREEVREILGDQDFFQW
DDLGKMTYLTMCIKESFRLYPPVPQVYRQLSKPVTFVDGRSLPA
GSLISMHIYALHRNSAVWPDPE
VFDSLRFSTENASKRHPFAFMPFSAGPR
NCIGQQFAMSEMKVVTAMCLLRFEFSLDPSRLPIKMPQLVLRSKNGFHLHLKPLGPGSGK

>CYP4B1 NT_04852.13 + strand
7799513 MVPSFLSLSFSSLGLWASGLILVLGFLKLIHLLLRRQTLAKAMDKFPGPPTHWLFGHALE 7799692
7811239 IQETGSLDKVVSWAHQFPYAHPLWFGQFIGFLNIYEPDYAKAVYSRG 7811379
7811570 DPKAPDVYDFFLQWIG 7811617
7812929 RGLLVLEGPKWLQHRKLLTPGFHYDVLKPYVAVFTESTRIML 7813054
7813913 DKWEEKAREGKSFDIFCDVGHMALNTLMKCTFGRGDTGLGHR 7814038
7814344 DSSYYLAVSDLTLLMQQRLVSFQYHNDFIYWLTPHGRRFLRACQVAHDHT 7814493
7814639 DQVIRERKAALQDEKVRKKIQNRRHLDFLDILLGAR 7814746
7815505 DEDDIKLSDADLRAEVDTFMFEGHDTTTSGISWFLYCMALYPEHQHRCREEVREILGDQDFFQW 7815696
7817480 DDLGKMTYLTMCIKESFRLYPPVPQVYRQLSKPVTFVDGRSLPA 7817611
7818395 GSLISMHIYALHRNSAVWPDPE 7818460
7818562 VFDSLRFSTENASKRHPFAFMPFSAGPR 7818645
7819063 NCIGQQFAMSEMKVVTAMCLLRFEFSLDPSRLPIKMPQLVLRSKNGFHLHLKPLGPGSGK 7819242

>CYP4F2 NM_001082 revised exon 2 and C-terminal
MSQLSLSWLGLWPVAASPWLLLLLVGASWLLAHVLAWTYAFYDNCRRLRCFPQPPRRNWFWGHQGM
IHSSEEGLLYTQSLACTFGDMGCWWVGPWQAVIHIFLPTCIKPVLFAS
AAIAPKDKFFYSFLEPWLG
DGLLLSAGDKWSRHRRMLTPAFHFNILKPYMKIFNESVNIMH
AKWQLLASEGSACLDMFEHISLMTLDSLQKCVFSFDSHCQE
KPSEYIAAILELSALVSKRHHEILLHIDFLYYLTPDGQRFRRACRLVHDFTDAVIQERRRTLPSQGV
DDFLQAKAKSKTLDFIDVLLLSK
DEDGKKLSDEDIRAEADTFMFE
GHDTTASGLSWVLYHLAKHPEYQERCRQEVQELLKDREPKEIEW
DDLAHLPFLTMCMKESLRLHPPVPVISRHVTQDIVLPDGRVIPK
GIICLISVFGTHHNPAVWPDPE
VYDPFRFDPENIKERSPLAFIPFSAGPR
NCIGQTFAMAEMKVVLALTLLAFRVLPDHTEPRRKPELVLRAEGGLWLRVEPLS

>CYP4F2 NT_011295.8 - strand
1290503 MSQLSLSWLGLWPVAASPWLLLLLVGASWLLAHVLAWTYAFYDNCRRLRCFPQPPRRNWFWGHQGM 1290306
1286773 IHSSEEGLLYTQSLACTFGDMGCWWVGPWQAVIHIFLPTCIKPVLFA 1286633
1285475 AAIAPKDKFFYSFLEPWLG 1285419
1285326 DGLLLSAGDKWSRHRRMLTPAFHFNILKPYMKIFNESVNIMH 1285201
1283325 AKWQLLASEGSACLDMFEHISLMTLDSLQKCVFSFDSHCQE 1283203
1282584 KPSEYIAAILELSALVSKRHHEILLHIDFLYYLTPDGQRFRRACRLVHDFTDAVIQERRRTLPSQGVDDFLQAKAKSKTLDFIDVLLLSK 1282315
1279200 DEDGKKLSDEDIRAEADTFMFE 1279135
1278946 GHDTTASGLSWVLYHLAKHPEYQERCRQEVQELLKDREPKEIEW 1278815
1272788 DDLAHLPFLTMCMKESLRLHPPVPVISRHVTQDIVLPDGRVIPK 1272657
1272561 GIICLISVFGTHHNPAVWPDPE 1272496
1272320 VYDPFRFDPENIKERSPLAFIPFSAGPR 1272237
1271827 NCIGQTFAMAEMKVVLALTLLRFRVLPDHTEPRRKPELVLRAEGGLWLRVEPLS* 1271663

>CYP4F2-de12b CYP4F36P NT_011281.4|Hs19_11438 chromosome lone last exon 75% TO 4F11
66609 ICMGQVFAVAEMKVVLALTLLGFRFLRHHMEPGRKPELIMHAEGGL*LLVEPLS 66448

>CYP4F2-de12b CYP4F36P NT_011295.8 EXON 12 - strand
1264730 ICMGQVFAVAEMKVVLALTLLGFRFLRHHMEPGRKPELIMHAEGGL*LLVEPLS 1264569

>CYP4F3 NM_000896
MPQLSLSSLGLWPMAASPWLLLLLVGASWLLARILAWTYTFYDNCCRLRCFPQPPKRNWFLGHLGL
IHSSEEGLLYTQSLACTFGDMCCWWVGPWHAIVRIFHPTYIKPVLFAP
AAIVPKDKVFYSFLKPWLG
DGLLLSAGEKWSRHRRMLTPAFHFNIL
KPYMKIFNESVNIMHAKWQLLASEGSARLDMFEHISLMTLDSLQKCVFSFDSHCQE
KPSEYIAAILELSALVTKRHQQILLYIDFLYYLTPDGQRFRRACRLVHDFTDDVIQERRR
TLPSQGVDDFLQAKAKSKTLDFIDVLLLSK
DEDGKKLSDEDIRAEADTFMFE
GHDTTASGLSWVLYHLAKHPEYQERCRQEVQELLKDREPKEIEW
DDLAQLPFLTMCIKESLRLHPPVPAVSRCCTQDIVLPDGRVIPK
GIICLISVFGTHHNPAVWPDPEVYDPFRFDPKNIKERSPLAFIPFSAGPR
NCIGQAFAMAEMKVVLGLTLLAFRVLPDHTEPRRKPELVLRAEGGLWLRVEPLS

>CYP4F3 NT_011295.8 + strand
1034308 MPQLSLSSLGLWPMAASPWLLLLLVGASWLLARILAWTYTFYDNCCRLRCFPQPPKRNWFLGHLGL 1034505
1036799 VTPTEQGMRVLTQLVATYPQGFKVWMGPIFPVIRFCHPNIIRSVINAS 1036942 NEW ALT EXON 2
1038611 IHSSEEGLLYTQSLACTFGDMCCWWVGPWHAIVRIFHPTYIKPVLFAP 1038754 4F3
1039943 AAIVPKDKVFYSFLKPWLG 1039999
1040091 DGLLLSAGEKWSRHRRMLTPAFHFNILKPYMKIFNESVNIMH 1040216
1042052 AKWQLLASEGSARLDMFEHISLMTLDSLQKCVFSFDSHCQE 1042174
1042806 KPSEYIAAILELSALVTKRHQQILLYIDFLYYLTPDGQRFRRACRLVHDFTDAVIQERRRTLPSQGVDDFLQAKAKSKTLDFIDVLLLSK 1043075
1045461 DEDGKKLSDEDIRAEADTFMFE 1045526
1045714 GHDTTASGLSWVLYHLAKHPEYQERCRQEVQELLKDREPKEIEW 1045845
1051157 DDLAQLPFLTMCIKESLRLHPPVPAVSRCCTQDIVLPDGRVIPK 1051288
1051382 GIICLISVFGTHHNPAVWPDPE 1051447
1051619 VYDPFRFDPKNIKERSPLAFIPFSAGPR 1051702
1052113 NCIGQAFAMAEMKVVLGLTLLRFRVLPDHTEPRRKPELVLRAEGGLWLRVEPL 1052271

>CYP4F8 NM_007253
MSLLSLSWLGLRPVAASPWLLLLVVGASWLLARILAWTYAFYHNGRRLRCFPQPRKQNWFLGHLGL
VTPTEEGLRVLTQLVATYPQGFVRWLGPITPIINLCHPDIVRSVINTS
DAITDKDIVFYKTLKPWLG
DGLLLSVGDKWRHHRRLLTPAFHFNILKPYIKIFSKSANIMH
AKWQRLAMEGSTCLDVFEHISLMTLDSLQKCIFSFDSNCQE
KPSEYITAIMELSALVVKRNNQFFRYKDFLYFLTPCGRRFHRACRLVHDFTDAVIQERRRTLTSQGVDDFLQAKAKSKTLDFIDVLLLSE
DKNGKELSDEDIRAEADTFMFG
GHDTTASGLSWVLYNLARHPEYQERCRQEVQELLKDREPKEIEW
DDLAQLPFLTMCLKESLRLHPPIPTFARGCTQDVVLPDSRVIPK
GNVCNINIFAIHHNPSVWPDPE
VYDPFRFDPENAQKRSPMAFIPFSAGPR
NCIGQKFAMAEMKVVLALTLLRFRILPDHREPRRTPEIVLRAEDGLWLRVEPLG

>CYP4F8 NT_011295.8 + strand
1008510 MSLLSLSWLGLRPVAASPWLLLLVVGASWLLARILAWTYAFYHNGRRLRCFPQPRKQNWFLGHLGL 1008707 
1010893 VTPTEEGLRVLTQLVATYPQGFVRWLGPITPIINLCHPDIVRSVINTS 1011036
1012385 AITDKDIVFYKTLKPWLG 1012438
1012530 DGLLLSVGDKWRHHRRL 1012580 (FRAMESHIFT)
1012580 VTPAFHFNILKPYIKIFSKSANIMH 1012654
1015111 AKWQRLAMEGSTCLDVFEHISLMTLDSLQKCIFSFDSNCQE 1015233
1015998 KPSEYITAIMELSALVVKRNNQFFRYKDFLYFLTPCGRRFHRACRLVHDFTDAVIQERRRTLTSQGVDDFLQAKAKSKTLDFIDVLLLSE 1016267
1016591 DKNGKELSDEDIRAEADTFMFG 1016656
1016856 GHDTTASGLSWVLYNLARHPEYQERCRQEVQELLKDREPKEIEW 1016987
1021195 DDLAQLPFLTMCLKESLRLHPPIPTFARGCTQDVVLPDSRVIPK 1021326
1021422 GNVCNINIFAIHHNPSVWPDPE 1021487
1021653 VYDPFRFDPENAQKRSPMAFIPFSAGPR 1021736
1022086 NCIGQKFAMAEMKVVLALTLLRFRILPDHREPRRTPEIVLRAEDGLWLRVEPLG 1022247

>27P. CYP4F9P     HUMAN AC004609 complement(13903-33104 region)
EGMRVLTQLVATYPQGFKIWMNPITPIIRLCHPNIIWSVINAS
ATIAPKDEAFYKFLKPWLG
DGLLVNASDKWSCHRQMLMPAFHFNMLKPYMKFFTDSVNIMH
AKWQLLASGGSAHLDMFEHTSLMTLDSVQKCVFSFDSHCQE
KPSQYIATILELSSLFSXXXXXXXLCMDFLYYLIPSGWRFRRACCLVHDFTEAIIQEQRH
TLTSQGVDYFHEVKAKSKTLDFTDVLLLSK
XXXXXXXXXXXXXXXXXXXXXX
GHNTTASGLSWVLYYLARHPEYQEHCWQEVQKLLKDHEPKEIE*
DDLAQLPFLTMYIKDSLWLHPPVPVISRCCTQDIVLPGG*VIPK
GIVCLFSNFETHHNPTVWLDPE
VYDPFRFDPENSKERSPLAFIPFSAGSX
NCIGQAFAMAEMKVVLALTLLCFRVCPDHMEPRRKPEVIMYAEGGLWLWVKPLS

>CYP4F9P NT_011295.8 missing exon 1, deletion in exon 5 - strand
1391916 KEGMRVLTQLVATYPQGFKIWMNPITPIIRLCHPNIIWSVINAS 1391785
1390474 ATIAPKDEAFYKFLKPWLG 1390418
1390320 DGLLVNASDKWSCHRQMLMPAFHFNMLKPYMKFFTDSVNIMH 1390195
1386105 AKWQLLASGGSAHLDMFEHTSLMTLDSVQKCVFSFDSHCQE 1385983
1385347 KPSQYIATILELSSLVXXXXXXXXXXX 1385298
1385194 DFLYYLIPSGWRFRRACCLVHDFTEAIIQEQRHTLTSQGVDYFHEVKAKSKTLDFTDVLLLSK 1385006
1383205 DEDWKELSDEEIRAEADTFMFE 1383140
1382939 GHNTTASGLSWVLYYLARHPEYQEHCWQEVQKLLKDHEPKEIE* 1382808
1374358 DDLAQLPFLTMYIKDSLWLHPPVPVISRCCTQDIVLPGG*VIPK 1374227
1374131 GIVCLFSNFETHHNPTVWLDPE 1374066
1373883 VYDPFRFDPENSKERSPLAFIPFSAGSR 1373800
1372873 NCIGQAFAMAEMKVVLALTLLCFRVCPDHMEPRRKPEVIMYAEGGLWLWVKPLSADPQ* 1372697

>CYP4F10P AD000685 exons 3-6 (cosmid ends in middle of exon 6) 80% to 4F3
ATIAPKDKVFYSFLKPWLG
DGFLLSAGDKWSCHRGMLMPAFHFNILKPYMKIFDESVNIMH
AKWHLLTLEHNACLDMFEHMNLMTLDSLQKCVFSFDSQCQE
KPSEYIASVLELSSLVAKRNQQILLHMDFLYYLTHDGRHFHRACRLGHDFADAAIQEKRH
TLPSQGVDDFLQAKAKSKTLDFIDVLLISK
DEDGKELSDEDIRV*ADTFMFE
GYDTRTSGLSWVQYNLAMHPECQERCGQERQEFLKVWEPKEIEW

>CYP4F10P NT_011295.8 + strand exons 3-8
1057001 ATIAPKDKVFYSFLKPWLG 1057057
1057150 DGFLLSAGDKWSCHRGMLMPAFHFNILKPYMKIFDESVNIMH 1057275
1059122 AKWHLLTLEHNACLDMFEHMNLMTLDSLQKCVFSFDSQCQE 1059244
1059864 KPSEYIASVLELSSLVAKRNQQILLHMDFLYYLTHDGRHFHRACRLGHDFADAAIQEKRHTLPSQGVDDFLQAKAKSKTLDFIDVLLISK 1060133
1062340 DEDGKELSDEDIRV*ADTFMF 1062402
1062587 GYDTRTSGLSWVQYNLAMHPECQERCGQERQEFLKVWEPKEIEW 1062718

>CYP4F11 N-terminal is on AC011517 rest is on AC020950
MPQLSLSWLGLGPVAASPWLLLLLVGGSWLLARVLAWTYTFYDNCRRLQCFPQPPKQNWFWGHQGL
VTPTEEGMKTLTQLVTTYPQGFKLWLGPTFPLLILCHPDIIRPITSAS
AAVAPKDMIFYGFLKPWLG
DGLLLSGGDKWSRHRRMLTPAFHFNLKPYMKIFNKSVNIMH
DKWQRLASEGSARLDMFEHISLMTLDSLQKCVFSFESNCQE
KPSEYIAAILELSAFVEKRNQQILLHTDFLYYLTPDGQRFRRACHLVHDFTDAVIQERRRTLPTQGIDDFLKNKAKSKTLDFIDVLLLSK
DEDGKELSDEDIRAEADTFMFE
GHDTTASGLSWVLYHLAKHPEYQEQCRQEVQELLKDREPIEIEW
DDLAQLPFLTMCIKESLRLHPPVPVISRCCTQDFVLPDGRVIPK
GIVCLINIIGIHYNPTVWPDPE
VYDPFRFNQENIKERSPLAFIPFSAGPR
NCIGQAFAMAEMKVVLALTLLHFRILPTHIEPRRKPELILRAEGGLWLRVEPLGANSQ

>CYP4F11 NT_011295.8 - strand
1327300 MPQLSLSWLGLGPVAASPWLLLLLVGGSWLLARVLAWTYTFYDNCRRLQCFPQPPKQNWF 1327119
1322493 VTPTEEGMKTLTQLVTTYPQGFKLWLGPTFPLLILCHPDIIRPITSAS 1322350
1320376 AAVAPKDMIFYGFLKPWLG 1320320
1320229 DGLLLSGGDKWSRHRRMLTPAFHFNILKPYMKIFNKSVNIMH 1320104
1317774 DKWQRLASEGSARLDMFEHISLMTLDSLQKCVFSFESNCQE 1317652
1316973 KPSEYIAAILELSAFVEKRNQQILLHTDFLYYLTPDGQRFRRACHLVHDFTDAVIQERRCTLPTQGIDDFLKNKAKSKTLDFIDVLLLSK 1316704
1315322 DEDGKELSDEDIRAEADTFMFE 1315257
1315059 GHDTTASGLSWVLYHLAKHPEYQEQCRQEVQELLKDREPIEIEW 1314928
1307786 DDLAQLPFLTMCIKESLRLHPPVPVISRCCTQDFVLPDGRVIPK 1307655
1307559 GIVCLINIIGIHYNPTVWPDPE 1307494
1307279 VYDPFRFDQENIKERSPLAFIPFSAGPR 1307196
1306800 NCIGQAFAMAEMKVVLALTLLHFRILPTHTEPRRKPELILRAEGGLWLRVEPLGANSQ* 1306624

>CYP4F12 GenEMBL AC004523  missing N-terminal
ITPTEEGLKNSTQMSATYSQGFTIWLGPIIPFIVLCHPDTIRSI
TNASAAIAPKDNLFIRFLKPWLGEGILLSGGDKWSRHRRMLTPAFHFNILKSYITIFN
KSANIMLDKWQHLASEGSSCLDMFEHISLMTLDSLQKCIFSFDSHCQERPSEYIATIL
ELSALVEKRSQHILQHMDFLYYLSHDGRRFHRACRLVHDFTDAVIRERRRTLPTQGID
DFFKDKAKSKTLDFIDVLLLSKDEDGKALSDEDIRAEADTFMFGGHDTTASGLSWVLY
NLARHPEYQERCRQEVQELLKDRDPKEIEWDDLAQLPFLTMCVKESLRLHPPAPFISR
CCTQDIVLPDGRVIPKGITCLIDIIGVHHNPTVWPDPEVYDPFRFDPENSKGRSPLAF
IPFSAGPRNCIGQAFAMAEMKVVLALMLLHFRFLPDHTEPRRKLELIMRAEGGLWLRV
EPLNVSLQ

>CYP4F12 mRNA for cytochrome P450, complete cds. AB035130
MSLLSLPWLGLRPVAMSPWLLLLLVVGSWLLARILAWTYAFYNNCRRLQCFPQPPKRNWFWGHLGL
ITPTEEGLKDSTQMSATYSQGFTVWLGPIIPFIVLCHPDTIRSITNAS
AAIAPKDNLFIRFLKPWLG
EGILLSGGDKWSRHRRMLTPAFHFNILKSYITIFNKSANIML
DKWQHLASEGSSRLDMFEHISLMTLDSLQKCIFSFDSHCQERP
SEYIATILELSALVEKRSQHILQHMDFLYYLSHDGRRFHRACRLVHDFTDAVIRERRR
TLPTQGIDDFFKDKAKSKTLDFIDVLLLSK
DEDGKALSDEDIRAEADTFMFG
GHDTTASGLSWVLYNLARHPEYQERCRQEVQELLKDRDPKEIEW
DDLAQLPFLTMCVKESLRLHPPAPFISRCCTQDIVLPDGRVIPK
GITCLIDIIGVHHNPTVWPDPE
VYDPFRFDPENSKGRSPLAFIPFSAGPR
NCIGQAFAMAEMKVVLALMLLHFRFLPDHTEPRRKLELIMRAEGGLWLRVEPLNVGLQ

>CYP4F12 NT_011295.8 + strand
1066422 MSLLSLPWLGLRPVATSPWLLLLLVVGSWLLARILAWTYAFYNNCRRLQCFPQPPKRNWFWGHLGL 1066619
1071153 ITPTEEGLKNSTQMSATYSQGFTIWLGPIIPFIVLCHPDTIRSITNAS 1071296
1073135 AAIAPKDNLFIRFLKPWLG 1073191
1073286 EGILLSGGDKWSRHRRMLTPAFHFNILKSYITIFNKSANIML 1073411
1075281 DKWQHLASEGSSCLDMFEHISLMTLDSLQKCIFSFDSHCQE 1075403
1076386 RPSEYIATILELSALVEKRSQHILQHMDFLYYLSHDGRRFHRACRLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSK 1076655
1077708 DEDGKALSDEDIRAEADTFMFG 1077773
1077959 GHDTTASGLSWVLYNLARHPEYQERCRQEVQELLKDRDPKEIEW 1078090
1088829 DDLAQLPFLTMCVKESLRLHPPAPFISRCCTQDIVLPDGRVIPK 1088960
1089052 GITCLIDIIGVHHNPTVWPDPE 1089117
1089322 VYDPFRFDPENSKGRSPLAFIPFSAGPR 1089405
1089801 NCIGQAFAMAEMKVVLALMLLHFRFLPDHTEPRRKLELIMRAEGGLWLRVEPL 1089959

>CYP4F22 AC011492 assembled gene 13 exons 114537-140651 66% to 4F3, 65% to 4F11, 
MLPITDRLLHLLGLEKTAFRIYAVSTLLLFLLFFLFRLLLRFLRLCRSFYITCRRLRCFPQPPRRNWLLGHLGMVS
PNEAGLQDEKKVLDNMHHVLLVWMGPVLPLLVLVHPDYIKPLLGAS
AAIAPKDDLFYGFLKPWLG
DGLLLSKGDKWSRHRRLLTPAFHFDILKPYMKIFNQSADIMH
AKWRHLAEGSAVSLDMFEHISLMTLDSLQKCVFSYNSNCQE
KMSDYISAIIELSALSVRRQYRLHHYLDFIYYRSADGRRFRQACDMVHHFTTEVIQERRR
ALRQQGAEAWLKAKQGKTLDFIDVLLLAR
DEDGKELSDEDIRAEADTFMFE
GHDTTSSGISWMLFNLAKYPEYQEKCREEIQEVMKGRELEELEW
DDLTQLPFTTMCIKESLRQYPPVTLVSRQCTEDIKLPDGRIIPK
GIICLVSIYGTHHNPTVWPDSK
VYNPYRFDPDNPQQRSPLAYVPFSAGPR
NCIGQSFAMAELRVVVALTLLRFRLSVDRTRKVRRKPELILRTENGLWLKVEPLPPRA*

>CYP4F22 NT_011295.8 + strand
918230 MLPITDRLLHLLGLEKTAFRIYAVSTLLLFLLFFLFRLLLRFLRLCRSFYITCRRLRCFPQPPRRNWLLGHLGMVS 918457
922608 PNEAGLQDEKKVLDNMHHVLLVWMGPVLPLLVLVHPDYIKPLLGAS 922745
930253 AAIAPKDDLFYGFLKPWLG 930309
930430 DGLLLSKGDKWSRHRRLLTPAFHFDILKPYMKIFNQSADIMH 930555
930765 AKWRHLAEGSAVSLDMFEHISLMTLDSLQKCVFSYNSNCQE 930887
933344 KMSDYISAIIELSALSVRRQYRLHHYLDFIYYRSADGRRFRQACDMVHHFTTEVIQERRRALRQQGAEAWLKAKQGKTLDFIDVLLLAR 933610
936864 DEDGKELSDEDIRAEADTFMFEG 936932
937042 GHDTTSSGISWMLFNLAKYPEYQEKCREEIQEVMKGRELEELEW 937173
941002 DDLTQLPFTTMCIKESLRQYPPVTLVSRQCTEDIKLPDGRIIPK 941133
942030 GIICLVSIYGTHHNPTVWPDSK 942095
943567 VYNPYRFDPDNPQQRSPLAYVPFSAGPR 943650
944188 NCIGQSFAMAELRVVVALTLLRFRLSVDRTRKVRRKPELILRTENGLWLKVEPLPPRA* 944364

>XM_065069 incorrectly assembled 4F23P
MRGNEEDMRLMEDLGHYFRDVQLWWLGSFYPVLHLVHPTFTAPV
LQASAAVALKDMSFYGFLKPWLGDGLLISAGDKWRWHRHLLTPAFHFKILKPYVKIFN
ESTNIMHAKWQRLALEGSVRLEMFEHISLMTLDSLQKCIFSFDSNCQDEYIDAILELS
ALSLKRHQHIFLLTDFLYFLTPNGRRFCRACDIVHNFTDAVIQERRRTLTSQGVDDFL
QAKAKSKTLDFIDVLLLAKDENGKKLSDENIRAEADTFMSGGHDTTASGLSWVLYNLA
RYPEYQEHCRQEVQELLKNGDPKEIEWDDLAQLPFLTMCLKESLRLHSPVSRIHRCCP
QDGVLPDGRVIPKGNTCTISIFGIHHNPSVWPDPEVLPLPPSPSRGLVYDPFRFDPEN
LQKTSPLAFIPFSAVPRRGSRRDGAGMGVLGTVRVPTPSPGNCIGQTFAMAEMKVVLA
LTLLRFRVLPDHAEPRRKLELIVRAEDGLWLRVEPLSADLQ

>CYP4F23P AC011492 assembled gene 76% to 4F3, 76% to 4F8, 76% to 4F11, 73% to 
MSLLSLSWLGLGPVAASPWLLLLLVGASWLLARVLAWTYAFYDNCHRLQCFQQPPKRNCF*GHLSLVS
GNEEDMRLMEDLGHYFRDVQLWWLGSFYPVLHLVHPTFTAPVLQAS 
AAVALKDMSFYGFLKPWLG
DGLLISAGDKWRWHRHLLTPAFHFKILKPYVKIFNESTNIMH
AKWQRLALEGSVRLEMFEHISLMTLDSLQKCIFSFDSNCQE
KPSEYIDAILELSALSLKRHQHIFLLTDFLYFLTPNGRRFCRACDIVHNFTDAVIQERRR
TLTSQGVDDFLQAKAKSKTLDFIDVLLLAK 
DENGKKLSDENIRAEADTFMSG
GHDTTASGLSWVLYNLARYPEYQEHCRQEVQELLKNGDPKEIEW
DDLAQLPFLTMCLKESLRLHSPVSRIHRCCPQDGVLPDGRVIPK 
GNTCTISIFGIHHNPSVWPDPEV
YDPFRFDPENLQKTSPLAFIPFSAVPR
NCIGQTFAMAEMKVVLALTLLRFRVLPDHAEPRRKLELIVRAEDGLWLRVEPLSADLQ* 

>CYP4F23P NT_011295.8 + strand
956966 MSLLSLSWLGLGPVAASPWLLLLLVGASWLLARVLAWTYAFYDNCHRLQCFQQPPKRNCF*GHLSLVS 957169
964633 GNEEDMRLMEDLGHYFRDVQLWWLGSFYPVLHLVHPTFTAPVLQAS 964770
966485 AAVALKDMSFYGFLKPWLG 966541
966633 DGLLISAGDKWRWHRHLLTPAFHFKILKPYVKIFNESTNIMH 966758
968607 AKWQRLALEGSVRLEMFEHISLMTLDSLQKCIFSFDSNCQE 968729
969459 KPSEYIDAILELSALSLKRHQHIFLLTDFLYFLTPNGRRFCRACDIVHNFTDAVIQERRRTLTSQGVDDFLQAKAKSKTLDFIDVLLLAK 969728
970066 DENGKKLSDENIRAEADTFMSG 970131
970323 GHDTSASGLSWVLYNLARYPEYQEHCRQEVQELLKNGDPKEIEW 970454
976244 DDLAQLPFLTMCLKESLRLHSPVSRIHRCCPQDGVLPDGRVIPK 976375
976472 GNTCTISIFGIHHNPSVWPDPE 976537
977017 VYDPFRFDPENLQKTSPLAFIPFSAVPR 977100
977453 NCIGQTFAMAEMKVVLALTLLRFRVLPDHAEPRRKLELIVRAEDGLWLRVEPLSADLQ* 977629

>CYP4F24P AC011537 77% to 4F11
MPQLSLSWLGLGQVAAFPWLLLLLAGASRLLAGFLAWTYAFYDNCRRLQYFPQPPKQKWFWGQPGP 
IIATEEGLKNLTQMSATYPQGFRIWLGPIFPFIVLCHPDIVRSITNAS 
AAIAPKDDLSIRFLKPWLG
EGILLSGGDKWSRHRRMLTPAFHFNILKPYIKIFNRSVNIMH 
DKWQHLASEGSSRLDMFEHISLMTLDSLQKCIFSFDSHCQE 
RPSEYIATILELSALVEKRNQHILQHMDFLYYLSHDGWRFRRACRLVHDFTDAVIQERRH 
TLPTQGI 
CDFLKNKAKS*TLDFIDVLLLSK 
DEDGKVLSDEDVRAEADTFMAG
GHDTTASGLSWVLYNLARHPEYQEHCRQEVQELLKDRDPKEIEW 
YDLAQLPFLTMCVKESLRLHPPVPYTSRHRIWDIVLPDGRVIPK 
GIICIINIIGIHHNPTV*PDPE 
VYNPFRFNSENSKERSPLAFIPFSAGPR 

>CYP4F24P XP_065068 this seq deletes a bad exon and makes a shortened protein
  1 MPQLSLSWLG LGQVAAFPWL LLLLAGASRL LAGFLAWTYA FYDNCRRLQY FPQPPKQKWF
 61 WGQPGPQEGL KNLTQMSATY PQGFRIWLGP IFPFIVLCHP DIVRSITNAS AAIAPKDDLS
121 IRFLKPWLGE GILLSGGDKW SRHRRMLTPA FHFNILKPYI KIFNRSVNIM HDKWQHLASE
181 GSSRLDMFEH ISLMTLDSLQ KCIFSFDSHC QERPSEYIAT ILELSALVEK RNQHILQHMD
241 FLYYLSHDGW RFRRACRLTL DFIDVLLLSK DEDGKVLSDE DVRAEADTFM FAGHDTTASG
301 LSWVLYNLAR HPEYQEHCRQ EVQELLKDRD PKEIEWYDLA QLPFLTMCVK ESLRLHPPVP
361 YTSRHRIWDI VLPDGRVYNP FRFNSENSKE RSPLAFIPFS AGPSLWENVH RMGADLGAVE
421 DPGPGFRRDG AQMRILGAVT VPTPTRRNCI GQAFAMAKMK VVLALTLLRF RFLLDHTEPR
481 RKPELIMRAE GGLWLRVEPL NAGLQ

>CYP4F24P NT_011295.8 FRAMESHIFT IN EXON 6 - strand
1172803 MPQLSLSWLGLGQVAAFPWLLLLLAGASRLLAGFLAWTYAFYDNCRRLQYFPQPPKQKWFWGQPGP 1172606
1169434 IIATEEGLKNLTQMSATYPQGFRIWLGPIFPFIVLCHPDIVRSITNAS 1169291
1167230 AAIAPKDDLSIRFLKPWLG 1167174
1167080 EGILLSGGDKWSRHRRMLTPAFHFNILKPYIKIFNRSVNIMH 1166955
1165049 DKWQHLASEGSSRLDMFEHISLMTLDSLQKCIFSFDSHCQE 1164927
1164146 RPSEYIATILELSALVEKRNQHILQHMDFLYYLSHDGWRFRRACRLVHDFTDAVIQERRHTLPTQGI 1163946
1163946 CDFLKNKAKS*TLDFIDVLLLSK 1163878
1162871 DEDGKVLSDEDVRAEADTFMFA 1162806
1162607 GHDTTASGLSWVLYNLARHPEYQEHCRQEVQELLKDRDPKEIEW 1162476
1154266 YDLAQLPFLTMCVKESLRLHPPVPYTSRHRIWDIVLPDGRVIPK 1154135
1154042 GIICIINIIGIHHNPTV*PDPE 1153977
1153789 VYNPFRFNSENSKERSPLAFIPFSAGPR 1153706
1153311 NCIGQAFAMAKMKVVLALTLLRFRFLLDHTEPRRKPELIMRAEGGLWLRVEPLNAGLQ 1153138

>AC018949 62% to 4F3 pseudogene
found in two places 100% match
CYP4F-se1[6:8]  chr9p11.2 43742181-43742487 (4F25P) + strand build 33
CYP4F-se13[6:8] chr9q13   63753376-63753682 (4F45P) + strand build 33
LWRCFLSPTSPCRNPSEYIATILELSALIV*QHQQICLCTDFLYYLTPEGRCFCRACDLVHNF 
DTIILERHCTLTSQGVDDFLNAKATFKIFDFSDAFVLSK 

>CYP4F-se1[6:8] NT_078056.1|Hs9_78125 Homo sapiens chromosome 9 genomic contig
          Length = 79560 = 4F25P CYP4F-se1[6:8]

Query: 1     PSEYIATLFELSALIV*WHQQICLCMDFLYYPFPEG*CFCRACDLVHNF 49
             PSEYIAT+ ELSALIV* HQQICLC DFLYY  PEG CFCRACDLVHNF
Sbjct: 65960 PSEYIATILELSALIV*QHQQICLCTDFLYYLTPEGRCFCRACDLVHNF 66106

Query: 50    DTIILEQPCTLTSQGVHDFLKTKATFKTWDFSDALVLGK 88
             DTIILE+ CTLTSQGV DFL  KATFK +DFSDA VL K
Sbjct: 66108 DTIILERHCTLTSQGVDDFLNAKATFKIFDFSDAFVLSK 66224

Query: 111   GYDSTATGLY*ILCNLRRHP 130
             GYDSTA+GLY IL  LRRHP
Sbjct: 66811 GYDSTASGLYWILVPLRRHP 66870

Query: 131   GVMPSCGARVLRDHNPEDIEW 151
             GV+P  GARVLRD N EDIEW
Sbjct: 66878 GVLPPRGARVLRDGNSEDIEW 66940

>CYP4F-se2[6] CYP4F26P AL139008 Homo sapiens chromosome 9 clone RP11-255A11 81% TO 4F25P
chr9p13.3 33593291-33593549 + strand build 33 
SDYVATILELHALIV*WHQQICLCMDFLYYLTPEGRCFCRAYDLVHNF 
DTIILEQHCTLTSQSVDILKARATFKTLDFIDALVLSK

>CYP4F-se2[6] NT_008413.15|Hs9_8570 Homo sapiens chromosome 9 genomic contig
          Length = 39435727

Query: 2        SEYIATLFELSALIV*WHQQICLCMDFLYYPFPEG*CFCRACDLVHNF 49
                S+Y+AT+ EL ALIV*WHQQICLCMDFLYY  PEG CFCRA DLVHNF
Sbjct: 33593291 SDYVATILELHALIV*WHQQICLCMDFLYYLTPEGRCFCRAYDLVHNF 33593434

Query: 50       DTIILEQPCTLTSQGVHDFLKTKATFKTWDFSDALVLGK 88
                DTIILEQ CTLTSQ V D LK +ATFKT DF DALVL K
Sbjct: 33593436 DTIILEQHCTLTSQSV-DILKARATFKTLDFIDALVLSK 33593549

>CYP4F-se3[6:7:8] CYP4F27P AC018804.2 Homo sapiens clone RP11-397H17 62% TO CYPF25P
chr2q21.1 130711641-130711907 - strand build 33
21476 LRSCFLSLASPCRNASEHTADILELSTLIV*RRQ*ICLCLDFLYYPFLRGDASAGPVTW 21300
21299 CTTSDTIILQQHRTLTSQGVDDFLKAKATFKASDFIDALVLSK 21171

>CYP4F-se3[6:7:8] NT_005079.12|Hs2_5236 Homo sapiens chromosome 2 genomic contig
          Length = 4419942 = 4F27
chr 2 130899265-130900232 July 2003 freeze
3868414 SEHTADILELSTLIV*RRQ*ICLCLDFLYYPFLRGDASAGPVTWCTTSDTIILQQHRTLT 3868235
3868234 SQGVDDFLKAKATFKASDFIDALVLSK 3868154
3867831 QDENGKELSDEDI*MEAGIFMSAG 3867760
3867566 GYDSRASGLY*ILYNLTKHPDI-GSAASL*CKSPNPEDIEW 3867447

>CYP4F-se3[6:8] chr2q21.1 130711641-130711937
4 DIFFS TO 27P 2 diffs to CYP4F-se10
Query:     1 CFLSLASPCRNASELIADILELSTLIV*RRQQICLCWDFLYYPFLRGDASAGPVTWCTTS 180
             CFLSLASPCRNASE  ADILELSTLIV*RRQ ICLC DFLYYPFLRGDASAGPVTWCTTS
Sbjct:     4 CFLSLASPCRNASEHTADILELSTLIV*RRQ*ICLCLDFLYYPFLRGDASAGPVTWCTTS 63

Query:   181 DTIILQQHRTLTSQGVDDFLKAKATFKASDFIDALVLSK 297
             DTIILQQHRTLTSQGVDDFLKAKATFKASDFIDALVLSK
Sbjct:    64 DTIILQQHRTLTSQGVDDFLKAKATFKASDFIDALVLSK 102

>CYP4F-se4[6:7:8] CYP4F29P chr 21 AL109748 designated CYP4F3LP pseudogene Nature 405, 311-319 2000 chr21q11.2 14140950-14141905 + strand build 33
PSEYIATLFELSALIV*WHQQICLCMDFLYYPF 
PEG*CFCRACDLVHNF
DTIILEQPCTLTSQGVHDFLKTKATFKTWDFSDALVLGK 
DENGKELSDEDI*MEAGIFMST
GYDSTATGLY*ILCNLRRHPGVMPSCGARVLRDHNPEDIEW

>CYP4F-se5[6:8] 4F32P NT_033006.1|Hs2_33182 chromosome 2 4F pseudgene 88% to 4F25P
chr2q11.1 94893656-94893866 + strand build 33
chr 2 94909781-94910782 July 2003 freeze
      CFLSPGSPCRNPSEYIATILELSALLV*W
22206 HQQICLCTDFLYYLTPEGRCFCRAYDLVHNF 22114
22112 DTIILEQHLTLTCRGVDDFLKAKATFKILDFSDAFLLSK 21996

>CYP4F-se5[6:8] 4F32P ref|NT_022300.8|Hs2_22456 chromosome 2 88% to 4F25P
740635 HQQICLCTDFLYYLTPEGRCFCRAYDLVHNF 740727
740729 DTIILEQHLTLTCRGVDDFLKAKATFKILDFSDAFLLSK 740845

>CYP4F-se5[6:8] NT_026970.9|Hs2_27130 Homo sapiens chromosome 2 genomic contig
          Length = 2594449

Query: 1     PSEYIATLFELSALIV*WHQQICLCMDFLYYPFPEG*CFCRACDLVHNF 49
             PSEYIAT+ ELSAL+V*WHQQICLC DFLYY  PEG CFCRA DLVHNF
Sbjct: 99822 PSEYIATILELSALLV*WHQQICLCTDFLYYLTPEGRCFCRAYDLVHNF 99676

Query: 50    DTIILEQPCTLTSQGVHDFLKTKATFKTWDFSDALVLGK 88
             DTIILEQ  TLT +GV DFLK KATFK  DFSDA +L K
Sbjct: 99674 DTIILEQHLTLTCRGVDDFLKAKATFKILDFSDAFLLSK 99558

Query: 110   TGYDSTATGLY*ILCNLRRH 129
             +GYDSTA+GLY IL NLRRH
Sbjct: 98986 SGYDSTASGLYWILYNLRRH 98927

Query: 131   GVMPSCGARVLRDHNPEDIEW 151
             GV+P  GARVLRD N EDIEW
Sbjct: 98916 GVLPPRGARVLRDRNSEDIEW 98854

>CYP4F-se6[6] 4F33P NT_030040.6|Hs9_30295 chromosome 9 78% TO 4F26P
chr9p13.1 38532310-38532516 + strand build 33
6944743 HQQICLCTDFLYYLTPEGRCFCGPATWCSTLDTIILERHCTLTSQGVDVLKAKATFKTL 6944567
6944566 DFIDALVPSK 6944537

>CYP4F-se6[6] NT_008413.15|Hs9_8570 Homo sapiens chromosome 9 genomic contig
          Length = 39435727

Query: 2        SEYIATLFELSALIV*WHQQICLCMDFLYYPFPEG*CFCRACDLVHNFDTIILEQPCTLT 61
                S+Y+AT+ EL ALIV*WHQQICLC DFLYY  PEG CFC         DTIILE+ CTLT
Sbjct: 38532567 SDYVATILELHALIV*WHQQICLCTDFLYYLTPEGRCFCGPATWCSTLDTIILERHCTLT 38532388

Query: 62       SQGVHDFLKTKATFKTWDFSDALVLGK 88
                SQGV D LK KATFKT DF DALV  K
Sbjct: 38532387 SQGV-DVLKAKATFKTLDFIDALVPSK 38532310

>CYP4F-se7[6:7:8] 4F34P NT_009799.9|Hs13_9956 chromosome 13 4F pseudogene 78% to 4F27P
343270 HQQICLCMDFLYYXX 343232
343226 PDGDASAGPVTWCTTLDTIILEQHCTLTSQGVHDFLKAKATFKT*DFSDALMLGK 343062

>CYP4F-se7[6:7:8] NT_009799.12|Hs13_9956 Homo sapiens chromosome 13 genomic contig
          Length = 12392145 = 4F34P

Query: 1      PSEYIATLFELSALIV*WHQQICLCMDFLYYPF 33
              PSEYIATLFELSALIV*WHQQICLCMDFLYYPF
Sbjct: 296823 PSEYIATLFELSALIV*WHQQICLCMDFLYYPF 296725

Query: 32     PFPEG*CFCRACDLVHNFDTIILEQPCTLTSQGVHDFLKTKATFKTWDFSDALVLGK 88
              P P+G             DTIILEQ CTLTSQGVHDFLK KATFKT DFSDAL+LGK
Sbjct: 296731 PIPDGDASAGPVTWCTTLDTIILEQHCTLTSQGVHDFLKAKATFKT*DFSDALMLGK 296561

Query: 88     KDENGKELSDEDI*MEAGIFMST 110
              +DENGKELSDEDI*MEA IFMST
Sbjct: 296237 QDENGKELSDEDI*MEADIFMST 296169

Query: 111    GYDSTATGLY*ILCNLRRHPGVMPSCGARVLRDHNPEDIEW 151
              GYDSTATGLY*ILCNLRRHPGVMPSCGARVLRD NPEDIEW
Sbjct: 295991 GYDSTATGLY*ILCNLRRHPGVMPSCGARVLRDRNPEDIEW 295869

>CYP4F-se8[6:7:8] 4F35P NT_010859.8|Hs18_11016 chromosome 18 4F pseudogene 84% to 4F12
chr18p11.21 14328070-14328279 + strand build 33
623264 HQQICLCMDFLYYPFLRGDASAGPMTWCTTLDTIILEQHCTLTSQGVHDFLKAKATFKTW 623085
623084 DFSDALVLGK 623055

>CYP4F-se8[6:7:8] NT_010859.12|Hs18_11016 Homo sapiens chromosome 18 genomic contig
          Length = 14719514

Query: 1        PSEYIATLFELSALIV*WHQQICLCMDFLYYPFPEG*CFCRACDLVHNFDTIILEQPCTL 60
                PSEYIATLFELSALIV*WHQQICLCMDFLYYPF  G             DTIILEQ CTL
Sbjct: 14328016 PSEYIATLFELSALIV*WHQQICLCMDFLYYPFLRGDASAGPMTWCTTLDTIILEQHCTL 14328195

Query: 61       TSQGVHDFLKTKATFKTWDFSDALVLGK 88
                TSQGVHDFLK KATFKTWDFSDALVLGK
Sbjct: 14328196 TSQGVHDFLKAKATFKTWDFSDALVLGK 14328279

Query: 88       KDENGKELSDEDI*MEAGIFMST 110
                +DENGKELS+EDI*MEAGIFMST
Sbjct: 14328531 QDENGKELSEEDI*MEAGIFMST 14328599

Query: 111      GYDSTATGLY*ILCNLRRHPGVMPSCGARVLRDHNPEDIEW 151
                GY+STATGLY*ILCNLRRHPGVMPS GARVLRD NPEDIEW
Sbjct: 14328777 GYESTATGLY*ILCNLRRHPGVMPSRGARVLRDRNPEDIEW 14328899


>CYP4F-se9[6:7:8] CYP4F30P chr2q21.1 131102500-131102796 4F30P 2 DIFFS TO 27P 
1 aa diff to CYP4F-se11
Query:     1 CFLSLASPCRNASEHIADILELSTLIV*RRQ*ICLCWDFLYYPFLRGDASAGPVTWCTTS 180
             CFLSLASPCRNASEH ADILELSTLIV*RRQ*ICLC DFLYYPFLRGDASAGPVTWCTTS
Sbjct:     4 CFLSLASPCRNASEHTADILELSTLIV*RRQ*ICLCLDFLYYPFLRGDASAGPVTWCTTS 63

Query:   181 DTIILQQHRTLTSQGVDDFLKAKATFKASDFIDALVLSK 297
             DTIILQQHRTLTSQGVDDFLKAKATFKASDFIDALVLSK
Sbjct:    64 DTIILQQHRTLTSQGVDDFLKAKATFKASDFIDALVLSK 102

>CYP4F-se9[6:7:8] NT_005079.12|Hs2_5236 Homo sapiens chromosome 2 genomic contig
          Length = 4419942 = 4F30P
chr 2 131290124-131291091 July 2003 freeze
4259273 SELIADILELSTLIV*RRQQICLCWDFLYYPFLRGDASAGPVTWCTTSDTIILQQHRTLT 4259094
4259093 SQGVDDFLKAKATFKASDFIDALVLSK 4259013
4258690 QDENGKELSDEDI*MEAGIFMSAG 4258619
4258425 GYDSRASGLY*ILYNLTKHPDIGSAASL*CKSPNPEDIEW 4258306

>CYP4F-se10[6:7:8] CYP4F31P chr2q21.1 132067957-132068253 4F31P 2 DIFFS TO 27P IDENTICAL TO 4F30P
Query:     1 CFLSLASPCRNASEHIADILELSTLIV*RRQ*ICLCWDFLYYPFLRGDASAGPVTWCTTS 180
             CFLSLASPCRNASEH ADILELSTLIV*RRQ*ICLC DFLYYPFLRGDASAGPVTWCTTS
Sbjct:     4 CFLSLASPCRNASEHTADILELSTLIV*RRQ*ICLCLDFLYYPFLRGDASAGPVTWCTTS 63

Query:   181 DTIILQQHRTLTSQGVDDFLKAKATFKASDFIDALVLSK 297
             DTIILQQHRTLTSQGVDDFLKAKATFKASDFIDALVLSK
Sbjct:    64 DTIILQQHRTLTSQGVDDFLKAKATFKASDFIDALVLSK 102

>CYP4F-se10[6:7:8] NT_005058.13|Hs2_5215 Homo sapiens chromosome 2 genomic contig
          Length = 18291257 = 4F31P
chr 2 132256324-132257291 July 2003 freeze
654564 SEHIADILELSTLIV*RRQ*ICLCWDFLYYPFLRGDASAGPVTWCTTSDTIILQQHRTLT 654743
654744 SQGVDDFLKAKATFKASDFIDALVLSK 654824
655147 QDENGKELSDEDI*MEAGIFMSAG 655218
655412 GYDSRASGLY*ILYNLTKHPDIGSAASL*CKSPNPEDIEW 655531

>CYP4F-se11[6:7:8] CYP4F37P chr2q21.1 131461272 131461568 5 DIFFS TO 27P 1 DIFF TO 4F32P
CFLSLASPCRNASELIADILELSTLIV*RRQQICLCWDFLYYPFLRGDASAGSVTWCTTS
DTIILQQHRTLTSQGVDDFLKAKATFKASDFIDALVLSK

>CYP4F-se11[6:7:8] NT_005058.13|Hs2_5215 Homo sapiens chromosome 2 genomic contig
          Length = 18291257 = 4F43P
chr 2 131649639-131650606 July 2003 freeze
47879 SELIADILELSTLIV*RRQQICLCWDFLYYPFLRGDASAGSVTWCTTSDTIILQQHRTLT 48058
48059 SQGVDDFLKAKATFKASDFIDALVLSK 48139
48462 QDENGKELSDEDI*MEAGIFMSAG 48533
48727 GYDSRASGLY*ILYNLTKHPDIGSAASL*CKSPNPEDIEW 48846

Query:     1 CFLSLASPCRNASELIADILELSTLIV*RRQQICLCWDFLYYPFLRGDASAGSVTWCTTS 180
             CFLSLASPCRNASE  ADILELSTLIV*RRQ ICLC DFLYYPFLRGDASAG VTWCTTS
Sbjct:     4 CFLSLASPCRNASEHTADILELSTLIV*RRQ*ICLCLDFLYYPFLRGDASAGPVTWCTTS 63

Query:   181 DTIILQQHRTLTSQGVDDFLKAKATFKASDFIDALVLSK 297
             DTIILQQHRTLTSQGVDDFLKAKATFKASDFIDALVLSK
Sbjct:    64 DTIILQQHRTLTSQGVDDFLKAKATFKASDFIDALVLSK 102

CYP4F-se12[6:8] CYP4F38P chr8P11.1 43132635-43132941 + STRAND BUILD 33
LWGCFLSPASPCRNPSDYIATILELSALIM*RQQQIFLHRDFLYYVTAEGWCFCRACVVVHNF
DTIILEQHCTLTSQGVNNFLKVTATFKTLDFIDVLVLDK

>CYP4F-se12[6:8] NT_008251.13|Hs8_8408 Homo sapiens chromosome 8 genomic contig
          Length = 5847381 = 4F44P

Query: 1       PSEYIATLFELSALIV*WHQQICLCMDFLYYPFPEG*CFCRACDLVHNF 49
               PS+YIAT+ ELSALI+*  QQI L  DFLYY   EG CFCRAC +VHNF
Sbjct: 5439042 PSDYIATILELSALIM*RQQQIFLHRDFLYYVTAEGWCFCRACVVVHNF 5439188

Query: 50      DTIILEQPCTLTSQGVHDFLKTKATFKTWDFSDALVLGK 88
               DTIILEQ CTLTSQGV++FLK  ATFKT DF D LVL K
Sbjct: 5439190 DTIILEQHCTLTSQGVNNFLKVTATFKTLDFIDVLVLDK 5439306

Query: 111     GYDSTATGLY*IL 123
               GYDSTA+GL+ +L
Sbjct: 5439892 GYDSTASGLFWVL 5439930

Query: 131     GVMPSCGARVLRDHNPEDIEW 151
               GV+P+ GA VLRDH+PEDIEW
Sbjct: 5439959 GVLPALGAGVLRDHSPEDIEW 5440021

>AC018949 62% to 4F3 pseudogene
found in two places 100% match
CYP4F-se1[6:8]  chr9p11.2 43742181-43742487 (4F25P) + strand build 33
CYP4F-se13[6:8] chr9q13   63753376-63753682 (4F45P) + strand build 33
LWRCFLSPTSPCRNPSEYIATILELSALIV*QHQQICLCTDFLYYLTPEGRCFCRACDLVHNF 
DTIILERHCTLTSQGVDDFLNAKATFKIFDFSDAFVLSK 

>CYP4F-se13[6:8] NT_078070.1|Hs9_78139 Homo sapiens chromosome 9 genomic contig
          Length = 423701 = 4F45P CYP4F-se13[6:8]

Query: 1     PSEYIATLFELSALIV*WHQQICLCMDFLYYPFPEG*CFCRACDLVHNF 49
             PSEYIAT+ ELSALIV* HQQICLC DFLYY  PEG CFCRACDLVHNF
Sbjct: 34719 PSEYIATILELSALIV*QHQQICLCTDFLYYLTPEGRCFCRACDLVHNF 34865

Query: 50    DTIILEQPCTLTSQGVHDFLKTKATFKTWDFSDALVLGK 88
             DTIILE+ CTLTSQGV DFL  KATFK +DFSDA VL K
Sbjct: 34867 DTIILERHCTLTSQGVDDFLNAKATFKIFDFSDAFVLSK 34983

Query: 111   GYDSTATGLY*ILCNLRRHP 130
             GYDSTA+GLY IL  LRRHP
Sbjct: 35569 GYDSTASGLYWILVPLRRHP 35628

Query: 131   GVMPSCGARVLRDHNPEDIEW 151
             GV+P  GARVLRD N EDIEW
Sbjct: 35636 GVLPPRGARVLRDGNSEDIEW 35698

421     1   151   151  96.7%     2  +-  130899265 130900232 #3
409     1   151   151  95.4%     2  ++  132256324 132257291 #10
403     1   151   151  94.8%     2  +-  131290124 131291091 #9
397     1   151   151  94.1%     2  ++  131649639 131650606 #11
 99    33    87   151  80.0%     2  +-   94910485  94910649 #5

>CYP4V2 formerly CYP4AH1 AC012525 Homo sapiens chromosome 4
at the boundary of 4q35.1 and 4q35.2. 187697428-187716242 + strand build 33
MAGLWLGLVWQKLLLWGAASAVSLAGASLVLSLLQRVASYARKWQQMRPIPTVARAYPLVGHALLMKPDGR 
EFFQQIIEYTEEYRHMPLLKLWVGPVPMVALYNAENVEG 
ILTSSKQIDKSSMYKFLEPWLGLGLLT 
STGNKWRSRRKMLTPTFHFTILEDFLDIMNEQANILVKKLEKHINQEAFNCFFYITLCALDIIC 
ETAMGKNIGAQSNDDSEYVRAVYR 
MSEMIFRRIKMPWLWLDLWYLMFKEGWEHKKSLQILHTFTNSV 
IAERANEMNANEDCRGDGRGSAPSKNKRRAFLDLLLSVTDDEGNRLSHEDIREEVDTFMFE 
GHDTTAAAINWSLYLLGSNPEVQKKVDHELDDV 
KSDRPATVEDLKKLRYLECVIKETLRLFPSVPLFARSVSED
YFLTAGYRVLKGTEAVIIPYALHRDPRYFPNPEEFQPERFFPENAQG 
RHPYAYVPFSAGPRNCIG 
QKFAVMEEKTILSCILRHFWIESNQKREELGLEGQLILRPSNGIWIKLKRRNADER* 

>34. CYP4X1 R56515, R53456, AA652746, AC026935.2 contig 141552-178731
MEFSWLETRWARPFYLAFVFCLALGLLQAIKLYLRRQRLLRDLRPFPAPPTHWFLGHQK
FIQDDNMEKLEEIIEKYPRAFPFWIGPFQAFFCIYDPDYAKTLLSRT
DPKSQYLQKFSPPLLG
KGLAALDGPKWFQHRRLLTPGFHFNILKAYIEVMAHSVKMML
DKWEKICSTQDTSVEVYEHINSMSLDIIMKCAFSKETNCQTN
STHDPYAKAIFELSKIIFHRLYSLLYHSDIIFKLSPQGYRFQ
KLSRVLNQYTDTIIQERKKSLQAGVKQDNTPKRKYQDFLDIVLSAK
DESGSSFSDIDVHSEVSTFLLAGHDTLAASISWILYCLALNPEHQERCREEVRGILGDGSSITW
DQLGEMSYTTMCIKETCRLIPAVPSISRDLSKPLTFPDGCTLPA
143044 GITVVLSIWGLHHNPAVWKNPK 143109
143902 VFDPLRFSQENSDQRHPYAYLPFSAGSR 143985
144478 NCIGQEFAMIELKVTIALILLHFRVTPDPTRPLTFPNHFILKPKNGMYLHLKKL 144642

>CYP4Z1 AJ131016 AC026935 161971-176942 52% to 4A11 52% to 4X1
AC026935.2 contig 141552-178731 intact mRNA = AY262056
161971 MEPSWLQELMAHPFLLLILLCMSLLLFQVIRLYQRRRWMIRALHLFPAPPAHWFYGHKE (0) 162147
163102 FYPVKEFEVYHKLMEKYPCAVPLWVGPFTMFFSVHDPDYAKILLKRQ (1) 163242
175103 DPKSAVSHKILESWV (1) 175147
176814 GRGLVTLDGSKWKKHRQIVKPGFNISILKIFITMMSESVRMML (0) 176942
NKWEEHIAQNSRLELFQHVSLMTLDSIMKCAFSHQGSIQLD (1)
STLDSYLKAVFNLSKISNQRMNNFLHHNDLVFKFSSQGQIFSKFNQELHQFT (1)
EKVIQDRKESLKDKLKQDTTQKRRWDFLDILLSAK (0)
SENTKDFSEADLQAEVKTFMFAGHDTTSSAISWILYCLAKYPEHQQRCRDEIRELLGDGSSITW (2)
EHLSQMPYTTMCIKECLRLYAPVVNISRLLDKPITFPDGRSLPA (1)
GITVFINIWALHHNPYFWEDPQ (0)
VFNPLRFSRENSEKIHPYAFIPFSAGL (2)
RNCIGQHFAIIECKVAVALTLLRFKLAPDHSRPPQPVRQVVLKSKNGIHVFAKKVC*

>CYP4Z1 NT_032977.5 + strand Build 33 N-term in seq gap
   28 NKWEEHIAQNSRLELFQHVSLMTLDSIMKCAFSHQGSIQLD 156
  242 STLDSYLKAVFNLSKISNQRMNNFLHHNDLVFKFSSQGQIFSKFNQELHQFT 385
 9967 EKVIQDRKESLKDKLKQDTTQKRRWDFLDILLSAK 10080
14505 SENTKDFSEADLQAEVKTFMFAGHDTTSSAISWILYCLAKYPEHQQRCRDEIRELLGDGSSITW 14693
21537 EHLSQMPYTTMCIKECLRLYAPVVNISRLLDKPITFPDGRSLPA 21668
30936 GITVFINIWALHHNPYFWEDPQ 31001
32058 VFNPLRFSRENSEKIHPYAFIPFSAGL 32135
33169 RNCIGQHFAIIECKVAVALTLLRFKLAPDHSRPPQPVRQVVLKSKNGIHVFAKKVC* 33342

>CYP4Z2P NT_004386.11|Hs1_4543 92% to 4Z1 full length version on AL731892.5
652145 TKDFSEADLQAEVKTFMFAGHDTTTTAISWIFYCLAKYPEHQQRC*DEIRELLGDG 651978

>CYP4Z2P AL731892.5 94% to 4Z1 1 stop codon in exon 8
67693 MEPSWLQELMAHPFLLLILLCMSLLLFQVIRLYQRRRWTIRAMHLFPAPPAHWFYGHKEX 67517
66240 YPVKEFEVYPELMEKYPCAVPLWVGPFTMFFNIHDPDYVKILLKRQ 66103
54555 DPKSAVSHKILESWV 54511
52843 GRGLVTLDGSKWKKHRQIVKPGFNISILKIFITMMSKSVRMML 52715
50555 NKWEEHIAQNSRLELFQHVSLMTLDSIMKCAFSHQGSIQLDRS 50427
50341 SYLKAVFNLSKISNQRMNNFLHHNDLVFKFSSQGQIFSKFNQELHQFT 50198
40615 HLEKVIQDRKESLKDKLKQDTTQKRRQDFLDILLSAKV 40502
35393 ENTKDFSEADLQAEVKTFMFAGHDTTTTAISWIFYCLAKYPEHQQRC*DEIRELLGDGSSITW 35205
27034 EHLSQMPYATMCIKECLRLYAPVVNISQLLDKPITFPDGRSLPA 26903
12988 GITVFINIWALHHNPYFWENPQ 12923
11873 VFNPLRFSRESSEKMHPYAFIPFSAG 11796
10762 PRNCIGQHFAIIGCKVAVALTLLCFELAPDYSRPPQPVRQMLLKSKNGIHVFAKKV 10595

>CYP4Z2P NT_04852.13 - strand
7900865 MEPSWLQELMAHPFLLLILLCMSLLLFQVIRLYQRRRWTIRAMHLFPAPPAHWFYGHKE 7900689
7899415 SYPVKEFEVYPELMEKYPCAVPLWVGPFTMFFNIHDPDYVKILLKRQ 99275
7887727 DPKSAVSHKILESWV 7887683
7886015 GRGLVTLDGSKWKKHRQIVKPGFNISILKIFITMMSKSVRMML 7885887
7883727 NKWEEHIAQNSRLELFQHVSLMTLDSIMKCAFSHQGSIQLDRS 7883599
7883513 SYLKAVFNLSKISNQRMNNFLHHNDLVFKFSSQGQIFSKFNQELHQFT 7883370
7873785 HLEKVIQDRKESLKDKLKQDTTQKRRQDFLDILLSAK 7873675
7868568 SENTKDFSEADLQAEVKTFMFAGHDTTTTAISWIFYCLAKYPEHQQRC*DEIRELLGDGSSITW 7868377
7860217 EHLSQMPYATMCIKECLRLYAPVVNISRLLDKPITFPDGRSLPA 7860086
7846160 GITVFINIWALHHNPYFWENPQ 7846095
7845045 VFNPLRFSRESSEKMHPYAFIPFSAG 7844968
7843934 PRNCIGQHFAIIGCKVAVALTLLCFELAPDYSRPPQPVRQMLLKSKNGIHVFAK 7843773

>CYP5A1 NM_001061 this gene is 197000 bases long
MMEALGFLKLEVNGPMVTVALSVALLALLKWYSTSAFSRLEKLG
LRHPKPSPFIGNLTFFRQGFWESQMELRKLYGPLCGYYLGRRMFIVISEPDMIKQVLV
ENFSNFTNRMASGLEFKSVADSVLFLRDKRWEEVRGALMSAFSPEKLNEMVPLISQAC
DLLLAHLKRYAESGDAFDIQRCYCNYTTDVVASVPFGTPVDSWQAPEDPFVKHCKRFF
EFCIPRPILVLLLSFPSIMVPLARILPNKNRDELNGFFNKLIRNVIALRDQQAAEERR
RDFLQMVLDARHSASPMGVQDFDIVRDVFSSTGCKPNPSRQHQPSPMARPLTVDEIVG
QAFIFLIAGYEIITNTLSFATYLLATNPDCQEKLLREVDVFKEKHMAPEFCSLEEGLP
YLDMVIAETLRMYPPAFRFTREAAQDCEVLGQRIPAGAVLEMAVGALHHDPEHWPSPE
TFNPERFTAEARQQHRPFTYLPFGAGPRSCLGVRLGLLEVKLTLLHVLHKFRFQACPE
TQVPLQLESKSALGPKNGVYIKIVSR

>CYP7A1 NM_000780  AC009927.10 chromosome 8
MMTTSLIWGIAIAACCCLWLILGIRRR (2)
QTGEPPLENGLIPYLGCALQFGANPLEFLRANQRKHGHVFTCKLMGKYVHFITNPLSYHKVLCHGKYFDWKKFHFATSAK (0)
AFGHRSIDPMDGNTTENINDTFIKTLQGHALNSLTESMMENLQRIMRPPVSSN
SKTAAWVTEGMYSFCYRVMFEAGYLTIFGRDLTRRDTQKAHILNNLDNFKQFDKVFPA
LVAGLPIHMFRTAHNAREKLAESLRHENLQKRESISELISLRMFLNDTLSTFDDLEKA
KTHLVVLWASQANTIPATFWSLFQMIR (2)
NPEAMKAATEEVKRTLENAGQKVSLEGNPICLSQAELNDLPVL (1)
DSIIKESLRLSSASLNIRTAKEDFTLHLEDGSYNIRKDSIIALYPQLMHLDPEIYPDPL (0)
TFKYDRYLDENGKTKTTFYCNGLKLKYYYMPFGSGATICPGRLFA
IHEIKQFLILMLSYFELELIEGQAKCPPLDQSRAGLGILPPLNDIEFKYKFKHL*

>CYP7B1 NM_004820 AC104939.7 chr 8
MAGEVSAATGRFSLERLGLPGLALAAALLLLALCLLVRRTR (2)
RPGEPPLIKGWLPYLGVVLNLRKDPLRFMKTLQKQHGDTFTVLLG (1) 
GKYITFILDPFQYQLVIKNHKQLSFRVFSNKLLEKAFSISQLQKNHDMNDELHLCYQFLQGKSLDILLESMMQN
LKQVFEPQLLKTTSWDTAELYPFCSSIIFEITFTTIYGKVIVCDNNKFISELRDDFLK
FDDKFAYLVSNIPIELLGNVKSIREKIIKCFSSEKLAKMQGWSEVFQSRQDVLEKYYVHEDLEIG (1)
AHHLGFLWASVANTIPTMFWAMYYLLRHPEAMAAVRDEIDRLLQSTGQKKGSGFPIHLTREQLDSLICL (1)
ESSIFEALRLSSYSTTIRFVEEDLTLSSETGDYCVRKGDLVAIFPPVLHGDPEIFEAPE (0)
EFRYDRFIEDGKKKTTFFKRGKKLKCYLMPFGTGTSKCP
GRFFALMEIKQLLVILLTYFDLEIIDDKPIGLNYSRLLFGIQYPDSDVLFRYKVKS*

>CYP8A1 D83402 AL118525.17 chr 20
MAWAALLGLLVALLLLLLLSRRRTR (2) 
RPGEPPLDLGSIPWLGYALDFGKDAASFLTRMKEKHGDIFT (0)
ILVGGRYVTVLLDPHSYDAVVWEPRTRLDFHAYAIFLMERIFDVQLPHYSPSDEKARMKL (2) 
TLLHRELQALTEAMYTNLHAVLLGDATEAGSGWHEMGLLDFSYSFLLR (2) 
AGYLTLYGIEALPRTHESQAQDRVHSADVFHTFRQLDRLLPKLARGSLSV (1)
GDKDHMCSVKSRLWKLLSPARLARRAHRSKWLESYLLHLEEMGVSEEMQARALVLQLWATQ (0)
GNMGPAAFWLLLFLLKNPEALAAVRGELESILWQAEQPVSQTTTLPQKVLDSTPVL (1)
DSVLSESLRLTAAPFITREVVVDLAMPMADGREFNLRRGDRLLLFPFLSPQRDPEIYTDPE (0)
VFKYNRFLNPDGSEKKDFYKDGKRLKNYNMPWGAGHNHCLGRSYAVNSIKQ (2) 
FVFLVLVHLDLELINADVEIPEFDLSRYGFGLMQPEHDVPVRYRIRP*

>CYP8B1 AF090318 AC010192
MVLWGPVLGALLVVIAGYLCLPGMLRQRRPWEPPLDKGT
VPWLGHAMAFRKNMFEFLKRMRTKHGDVFTVQLGGQYFTFVMDP
LSFGPILKDTQRKLDFGQYAKKLVLKVFGYRSVQGDHEMIHSASTKHLRGDGLKDLNE
TMLDSLSFVMLTSKGWSLDASCWHEDSLFRFCYYILFTAGYLSLFGYTKDKEQDLLQA
GELFMEFRKFDLLFPRFVYSLLWPREWLEVGRLQHLFHKMLSVSHSQEKEGISNWLGN
MLQFLREQGVPSAMQDKFNFMMLWASQGNTGPTSFWALLYLLKHPEAIRAVREEATQV
LGEARLETKQSFAFKLGALQHTPVLDSVVEETLRLRAAPTLLRLVHEDYTLKMSSGQE
YLFRHGDILALFPYLSVHMDPDIHPEPTVFKYDRFLNPNGSRKVDFFKTGKKIHHYTM
PWGSGVSICPGRFFALSEVKLFILLMVTHFDLELVDPDTPLPHVDPQRWGFGTMQPSH
DVRFRYRLHPTE

>CYP11A1 NM_000781
MLAKGLPPRSVLVKGYQTFLSAPREGLGRLRVPTGEGAGISTRS
PRPFNEIPSPGDNGWLNLYHFWRETGTHKVHLHHVQNFQKYGPIYREKLGNVESVYVI
DPEDVALLFKSEGPNPERFLIPPWVAYHQYYQRPIGVLLKKSAAWKKDRVALNQEVMA
PEATKNFLPLLDAVSRDFVSVLHRRIKKAGSGNYSGDISDDLFRFAFESITNVIFGER
QGMLEEVVNPEAQRFIDAIYQMFHTSVPMLNLPPDLFRLFRTKTWKDHVAAWDVIFSK
ADIYTQNFYWELRQKGSVHHDYRGMLYRLLGDSKMSFEDIKANVTEMLAGGVDTTSMT
LQWHLYEMARNLKVQDMLRAEVLAARHQAQGDMATMLQLVPLLKASIKETLRLHPISV
TLQRYLVNDLVLRDYMIPAKTLVQVAIYALGREPTFFFDPENFDPTRWLSKDKNITYF
RNLGFGWGVRQCLGRRIAELEMTIFLINMLENFRVEIQHLSDVGTTFNLILMPEKPIS
FTFWPFNQEATQQ

>CYP11B1 NM_000497
MALRAKAEVCMAVPWLSLQRAQALGTRAARVPRTVLPFEAMPRR
PGNRWLRLLQIWREQGYEDLHLEVHQTFQELGPIFRYDLGGAGMVCVMLPEDVEKLQQ
VDSLHPHRMSLEPWVAYRQHRGHKCGVFLLNGPEWRFNRLRLNPEVLSPNAVQRFLPM
VDAVARDFSQALKKKVLQNARGSLTLDVQPSIFHYTIEASNLALFGERLGLVGHSPSS
ASLNFLHALEVMFKSTVQLMFMPRSLSRWTSPKVWKEHFEAWDCIFQYGDNCIQKIYQ
ELAFSRPQQYTSIVAELLLNAELSPDAIKANSMELTAGSVDTTVFPLLMTLFELARNP
NVQQALRQESLAAAASISEHPQKATTELPLLRAALKETLRLYPVGLFLERVASSDLVL
QNYHIPAGTLVRVFLYSLGRNPALFPRPERYNPQRWLDIRGSGRNFYHVPFGFGMRQC
LGRRLAEAEMLLLLHHVLKHLQVETLTQEDIKMVYSFILRPSMCPLLTFRAIN

>CYP11B2 NM_000498
MALRAKAEVCVAAPWLSLQRARALGTRAARAPRTVLPFEAMPQH
PGNRWLRLLQIWREQGYEHLHLEMHQTFQELGPIFRYNLGGPRMVCVMLPEDVEKLQQ
VDSLHPCRMILEPWVAYRQHRGHKCGVFLLNGPEWRFNRLRLNPDVLSPKAVQRFLPM
VDAVARDFSQALKKKVLQNARGSLTLDVQPSIFHYTIEASNLALFGERLGLVGHSPSS
ASLNFLHALEVMFKSTVQLMFMPRSLSRWISPKVWKEHFEAWDCIFQYGDNCIQKIYQ
ELAFNRPQHYTGIVAELLLKAELSLEAIKANSMELTAGSVDTTAFPLLMTLFELARNP
DVQQILRQESLAAAASISEHPQKATTELPLLRAALKETLRLYPVGLFLERVVSSDLVL
QNYHIPAGTLVQVFLYSLGRNAALFPRPERYNPQRWLDIRGSGRNFHHVPFGFGMRQC
LGRRLAEAEMLLLLHHVLKHFLVETLTQEDIKMVYSFILRPGTSPLLTFRAIN

>CYP17 NM_000102
MWELVALLLLTLAYLFWPKRRCPGAKYPKSLLSLPLVGSLPFLP
RHGHMHNNFFKLQKKYGPIYSVRMGTKTTVIVGHHQLAKEVLIKKGKDFSGRPQMATL
DIASNNRKGIAFADSGAHWQLHRRLAMATFALFKDGDQKLEKIICQEISTLCDMLATH
NGQSIDISFPVFVAVTNVISLICFNTSYKNGDPELNVIQNYNEGIIDNLSKDSLVDLV
PWLKIFPNKTLEKLKSHVKIRNDLLNKILENYKEKFRSDSITNMLDTLMQAKMNSDNG
NAGPDQDSELLSDNHILTTIGDIFGAGVETTTSVVKWTLAFLLHNPQVKKKLYEEIDQ
NVGFSRTPTISDRNRLLLLEATIREVLRLRPVAPMLIPHKANVDSSIGEFAVDKGTEV
IINLWALHHNEKEWHQPDQFMPERFLNPAGTQLISPSVSYLPFGAGPRSCIGEILARQ
ELFLIMAWLLQRFDLEVPDDGQLPSLEGIPKVVFLIDSFKVKIKVRQAWREAQAEGST

>CYP19 NM_000103
MVLEMLNPIHYNITSIVPEAMPAATMPVLLLTGLFLLVWNYEGT
SSIPGPGYCMGIGPLISHGRFLWMGIGSACNYYNRVYGEFMRVWISGEETLIISKSSS
MFHIMKHNHYSSRFGSKLGLQCIGMHEKGIIFNNNPELWKTTRPFFMKALSGPGLVRM
VTVCAESLKTHLDRLEEVTNESGYVDVLTLLRRVMLDTSNTLFLRIPLDESAIVVKIQ
GYFDAWQALLIKPDIFFKISWLYKKYEKSVKDLKDAIEVLIAEKRCRISTEEKLEECM
DFATELILAEKRGDLTRENVNQCILEMLIAAPDTMSVSLFFMLFLIAKHPNVEEAIIK
EIQTVIGERDIKIDDIQKLKVMENFIYESMRYQPVVDLVMRKALEDDVIDGYPVKKGT
NIILNIGRMHRLEFFPKPNEFTLENFAKNVPYRYFQPFGFGPRGCAGKYIAMVMMKAI
LVTLLRRFHVKTLQGQCVESIQKIHDLSLHPDETKNMLEMIFTPRNSDRCLEH

>CYP20A1 AC011737.10 chr 2 (exons 1-11) AC080075.7 (exons 12, 13)
121155 MLDFAIFAVTFLLALVGAVLYLYP 121226
127938 ASRQAAGIPGITPTEEK 127988
128848 DGNLPDIVNSGSLHEFLVNLHERYGPVVSFWFGRRLVVSLGTVDVLKQHINPNKTS 129015
134061 DPFETMLKSLLRYQSGGGSVSENHMRKKLYENGVTDSLKSNFALLLK 134201
148606 LSEELLDKWLSYPETQHVPLSQHMLGFAMKSVTQMVMGSTFEDDQEVIRFQKNHGT 148773
154762 VWSEIGKGFLDGSLDKNMTRKKQYED 154839
160664 ALMQLESVLRNIIKERKGRNFSQHIFIDSLVQGNLNDQQ 160780
162151 ILEDSMIFSLASCIITAK 162204
167703 LCTWAICFLTTSEEVQKKLYEEINQVFGNGPVTPEKIEQLR 167825
171858 YCQHVLCETVRTAKLTPVSAQLQDIEGKIDRFIIPRE 171968
174354 TLVLYALGVVLQDPNTWPSPHK 174419
  3858 FDPDRFDDELVMKTFSSLGFSGTQECPELR 3947
  4142 FAYMVTTVLLSVLVKRLHLLSVEGQVIETKYELVTSSREEAWITVSKRY* 4291

>CYP21A1P 97% to 21A2 NT_033167.1|Hs6_33343
329415 MLLLGLLLLLPLLAGARLLWNWWKLRSLHLLPLAPGFLHLLQPDLPIYLLGLTQKFGPIYRLHLGLQ 329215
329117 DVVVLNSKRTIEEAMVKKWADFAGRPEPLTCK 329022
       LVSKNYPDLSL
328702 XXWSLLWKAHKKLTRSALLLGIRDSMEPVVEQLTQEFCE 328592
       RMRAQPGTPVAIEEEFSLLTCSINCYLTFGDKIK 328383
328294 EDNLMPAYYKCIQEVLKTWSHWSIQIVDVIPFLR 328193
328091 FFPNPGLRRLKQAIEKRDHNEEKQLRQHK
327835 ESLVAGQWRDMMDYMLQGVAQPSMEEGSGQLLEGHLHMAAVDLLIGGTETTANTLSWAVV
327654 FLLHHPE 327634
       IQQRL*EELDHELGPGASSSRVPYKDRARLPLLNATIAEVLRLWPVV
327292 PLALPHRTTRPS 327257
       SISGYDIPEGTVIIPNLQGAHLDETVWERPHEFWP 327069
326971 DRFLEPGKNSRALAFGCGARVCLGEPLARLELFVVLTRLLQAFTLLPSGDALPSLQPLPH
326791 CSVILKMQPFQVRLQPRGMGAHSPGQNQ 326708

>CYP21A2 M26856
MLLLGLLLLLPLLAGARLLWNWWKLRSLHLPPLAPGFLHLLQPD
LPIYLLGLTQKFGPIYRLHLGLQDVVVLNSKRTIEEAMVKKWADFAGRPEPLTYKLVS
RNYPDLSLGDYSLLWKAHKKLTRSALLLGIRDSMEPVVEQLTQEFCERMRAQPGTPVA
IEEEFSLLTCSIICYLTFGDKIKDDNLMPAYYKCIQEVLKTWSHWSIQIVDVIPFLRF
FPNPGLRRLKQAIEKRDHIVEMQLRQHKESLVAGQWRDMMDYMLQGVAQPSMEEGSGQ
LLEGHVHMAAVDLLIGGTETTANTLSWAVVFLLHHPEIQQRLQEELDHELGPGASSSR
VPYKDRARLPLLNATIAEVLRLRPVVPLALPHRTTRPSSISGYDIPEGTVIIPNLQGA
HLDETVWERPHEFWPDRFLEPGKNSRALAFGCGARVCLGEPLARLELFVVLTRLLQAF
TLLPSGDALPSLQPLPHCSVILKMQPFQVRLQPRGMGAHSPGQNQ

>CYP24 NM_000782
MSSPISKSRSLAAFLQQLRSPRQPPRLVTSTAYTSPQPREVPVC
PLTAGGETQNAAALPGPTSWPLLASLLQILWKGGLKKQHDTLVEYHKKYGKIFRMKLG
SFESVHLGSPCLLEALYRTESVPQRLEIKPWKAYRDYRKEGYGLLILEGEDWQRVRSA
FQKKLMKPGEVMKLDNKINEVLADFMGRIDELCDERGHVEDLYSELNKWSFESICLVL
YEKRFGLLQKNAGDEAVNFIMAIKTMMSTFGRMMVTPVELHKSLNTKVWQGHTLAWDT
IFKSVKACIDNRLEKYSQQPSADFLCDIYHQNRLSKKELYAAVTELQLAAVETTANSL
MWILYNLSRNPQVQQKLLKEIQSVLPENQRPREEDLRNMPYLKACLKESMRLTPGVPF
TTRTLDKATVLGEYALPKGTVLMLNTQVLGSSEDNFEDSSQFRPERWLQEKEKINPFA
HLPFGVGKRMCIGRRLAELQLHLALCWIVRKYDIQATDNEPVEMLHSGTLVPSRELPI
AFCQR

>CYP26A1 NM_000783 
MGLPALLASALCTFVLPLLLFLAAIKLWDLYCVSGRDRSCALPL
PPGTMGFPFFGETLQMVLQRRKFLQMKRRKYGFIYKTHLFGRPTVRVMGADNVRRILL
GDDRLVSVHWPASVRTILGSGCLSNLHDSSHKQRKKVIMRAFSREALECYVPVITEEV
GSSLEQWLSCGERGLLVYPEVKRLMFRIAMRILLGCEPQLAGDGDSEQQLVEAFEEMT
RNLFSLPIDVPFSGLYRGMKARNLIHARIEQNIRAKICGLRASEAGQGCKDALQLLIE
HSWERGERLDMQALKQSSTELLFGGHETTASAATSLITYLGLYPHVLQKVREELKSKG
LLCKSNQDNKLDMEILEQLKYIGCVIKETLRLNPPVPGGFRVALKTFELNGYQIPKGW
NVIYSICDTHDVAEIFTNKEEFNPDRFMLPHPEDASRFSFIPFGGGLRSCVGKEFAKI
LLKIFTVELARHCDWQLLNGPPTMKTSPTVYPVDNLPARFTHFHGEI

>CYP26B1 AC007002
MLFEGLDLVSALATLAACLVSVTLLLAVSQQLWQLRWAATRDKSCKLPIPKGSMGFPLIGETGHWLLQ
GSGFQSSRREKYGNVFKTHLLGRPLIRVTGAENVRKILMGEHHLVSTEWPRSTRMLLGPNTVSNS
IGDIHRNKRKVFSKIFSHEALESYLPKIQLVIQDTLRAWSSHPEAINVYQEAQ
KLTFRMAIRVLLGFSIPEEDLGHLFEVYQQFVDNVFSLPVDLPFSGYRR
GIQARQILQKGLEKAIREKLQCTQGKDYLDALDLLIESSKEHGKEMTMQELKDGTLELIF
AAYATTASASTSLIMQLLKHPTVLEKLRDELRAHGILHSGGCPCEGTLRLDTLSGLRYLD
CVIKEVMRLFTPISGGYRTVLQTFELDGFQIPKGWSVMYSIRDTHDTAPVFKDVNVFDP
DRFSQARSEDKDGRFHYLPFGGGVRTCLGKHLAKLFLKVLAVELASTSRFELATRTFPRI
TLVPVLHPVDGLSVKFFGLDSNQNEILPETEAMLSATV

>CYP26C1 AL358613.16 522 amino acids, 6 exons 
10896 MFPWGLSCLSVLGAAGTALLCAGLLLSLAQHLWTLRWMLSRDRASTLPLPKGSMGWPFFGETLHWLVQ (0) 11099
11642 GSRFHSSRRERYGTVFKTHLLGRPVIRVSGAENVRTILLGEHRLVRSQWPQSAHILLGSHTLLGAVGEPHRRRRK (0) 11866
12352 VLARVFSRAALERYVPRLQGALRHEVRSWCAAGGPVSVYDASKALTFRMAARILLGLRL
      DEAQCATLARTFEQLVENLFSLPLDVPFSGLRK (0) 12627
14013 GIRARDQLHRHLEGAISEKLHEDKAAEPGDALDLIIHSARELGHEPSMQELK (0) 14168
15588 ESAVELLFAAFFTTASASTSLVLLLLQHPAAIAKIREELVAQGLGRACGCAPGAAGGSEGPPPD
      CGCEPDLSLAALGRLRYVDCVVKEVLRLLPPVSGGYRTALRTFELD (0) 15917
17952 GYQIPKGWSVMYSIRDTHETAAVYRSPPEGFDPERFGAAREDSRGASSRLHYIPFGGGARSCLG
      QELAQAVLQLLAVELVRTARWELATPAFPAMQTVPIVHPVDGLRLFFHPLTPSVAGNGLCL* 18329

>CYP27A1 NM_000784
MAALGCARLRWALRGAGRGLCPHGARAKAAIPAALPSDKATGAP
GAGPGVRRRQRSLEEIPRLGQLRFFFQLFVQGYALQLHQLQVLYKAKYGPMWMSYLGP
QMHVNLASAPLLEQVMRQEGKYPVRNDMELWKEHRDQHDLTYGPFTTEGHHWYQLRQA
LNQRLLKPAEAALYTDAFNEVIDDFMTRLDQLRAESASGNQVSDMAQLFYYFALEAIC
YILFEKRIGCLQRSIPEDTVTFVRSIGLMFQNSLYATFLPKWTRPVLPFWKRYLDGWN
AIFSFGKKLIDEKLEDMEAQLQAAGPDGIQVSGYLHFLLASGQLSPREAMGSLPELLM
AGVDTTSNTLTWALYHLSKDPEIQEALHEEVVGVVPAGQVPQHKDFAHMPLLKAVLKE
TLRLYPVVPTNSRIIEKEIEVDGFLFPKNTQFVFCHYVVSRDPTAFSEPESFQPHRWL
RNSQPATPRIQHPFGSVPFGYGVRACLGRRIAELEMQLLLARLIQKYKVVLAPETGEL
KSVARIVLVPNKKVGLQFLQRQC

>CYP27B1 NM_000785
MTQTLKYASRVFHRVRWAPELGASLGYREYHSARRSLADIPGPS
TPSFLAELFCKGGLSRLHELQVQGAAHFGPVWLASFGTVRTVYVAAPALVEELLRQEG
PRPERCSFSPWTEHRRCRQRACGLLTAEGEEWQRLRSLLAPLLLRPQAAARYAGTLNN
VVCDLVRRLRRQRGRGTGPPALVRDVAGEFYKFGLEGIAAVLLGSRLGCLEAQVPPDT
ETFIRAVGSVFVSTLLTMAMPHWLRHLVPGPWGRLCRDWDQMFAFAQRHVE RREAEAA
MRNGGQPEKDLESGAHLTHFLFREELPAQSILGNVTELLLAGVD TVSNTLSWALYELS
RHPEVQTALHSEITAALSPGSSAYPSATVLSQLPLLKAVVKEVLRLYPVVPGNSRVPD
KDIHVGDYIIPKNTLVTLCHYATSRDPAQFPEPNSFRPARWLGEGPTPHPFASLPFGF
GKRSCMGRRLAELELQMALAQ 
ILTHFEVQPEPGAAPVRPKTRTVLVPERSINLQFLDR

>CYP27C1 AC027142 43% identical to 27A1 assembled gene
MQTSAMALLARILRAGLRPAPERGGLLGGGAPRRPQPAGARLPAGARAEDKGAGRPGSPPG 
GGRAEGPRSLAAMPGPRTLANLAEFFCRDGFSRIHEIQ 
QKHTREYGKIFKSHFGPQFVVSIADRDMVAQVLRAEGAAPQRANMESWREYRDLRGRATGLISA 
EGEQWLKMRSVLRQRILKPKDVAIYSGEVNQVIADLIKRIYLLRSQAEDGETVTNVNDLFFKYSME 
GVATILYESRLGCLENSIPQLTVEYIEALELMFSMFKTSMYAGAIPRWLRPFIPKPWREFC 
RSWDGLFKFS 
QIHVDNKLRDIQYQMDRGRRVSGGLLTYLFLSQALTLQEIYANVTEMLLAGVDT 
TSFTLSWTVYLLARHPEVQQTVYREIVKNLGERHVPTAADVPKVPLVRALLKETLR 
LFPVLPGNGRVTQEDLVIGGYLIPKG 
TQLALCHYATSYQDENFPRAKEFRPERWLRKGDLDRVDNFGSIPFGHGVRSCIGRRIAELEIHLVVIQ 
LLQHFEIKTSSQTNAVHAKTHGLLTPGGPIHVRFVNRK*

>CYP39A1 AC008104 AL035670 note heme region exon corrected 1/18/02
MELISPTVIIILGCLALFLLLQRKNLRRPPCIKGWIPWIGVGFEFGKAPLEFIEKARIK
YGPIFTVFAMGNRMTFVTEEEGINVFLKSKKVDFELAVQNIVYRT
ASIPKNVFLALHEKLYIMLKGKMGTVNLHQFTGQLTEELHEQLENLGTHGTMDLNNLVR
HLLYPVTVNMLFNKSLFSTNKKKIKEFHQYFQVYDEDFEYGSQLPECLLR 
NWSKSKKWFLELFEKNIPDIKACKSAKDNSM 
TLLQATLDIVETETSKENSPNYGLLLLWASLSNAVP
VAFWTLAYVLSHPDIHKAIMEGISSVFGKAG
KDKIKVSEDDLENLLLIKWCVLETIRLKAPGVITRKVVKPVEIL
NYIIPSGDLLMLSPFWLHRNPKYFPEPELFKPERW
KKANLEKHSFLDCFMAFGSGKFQCPARW
FALLEVQMCIILILYKYDCSLLDPLPKQ
SYLHLVGVPQPEGQCRIEYKQRI

>CYP46 NM_006668
MSPGLLLLGSAVLLAFGLCCTFVHRARSRYEHIPGPPRPS 
FLLGHLPCFWKKDEVGGRVLQDVFLDW 
AKKYGPVVRVNVFHKTSVIVTSPESVK 
KFLMSTKYNKDSKMYRALQTVFGER 
LFGQGLVSECNYERWHKQRRVIDLAFSRSSLVSLMETFNEKAEQLVEILEAKADGQTPVSMQDMLTYTAMDILAK 
AAFGMETSMLLGAQKPLSQAVKLMLEGITASRNTLAK 
FLPGKRKQLREVRESIRFLRQVGRDWVQRRREALKRGEEVPADILTQILK 
AEEGAQDDEGLLDNFVTFFIA 
GHETSANHLAFTVMELSRQPEIVAR 
LQAEVDEVIGSKRYLDFEDLGRLQYLSQ 
VLKESLRLYPPAWGTFRLLEEETLIDGVRVPGNTPLL 
FSTYVMGRMDTYFEDPLTFNPDRFGPGAPK 
PRFTYFPFSLGHRSCIGQQFAQ 
MEVKVVMAKLLQRLEFRLVPGQRFGLQEQATLKPLDPVLCTLRPRGWQPAPPPPPC

>CYP46A-se1[12:13:14] 46A4P NT_004424.11|Hs1_4581 chromosome 1 CYP46 pseudogene fragment
chr1p33 47913558-47913839 + strand build 33 surrounded by LINE L1 and L2 repeats
2405597 DPLTFNPYRFGPGAPKPRFTYFPFSLGHHSCIGQQFAQMEVKVVMAKLLQRLEFQLVPGP 2405776
2405777 RFGLQ*QATLKPLDPELCTLRPRGWQPAAPPPRC 2405878

>CYP51 NM_000786
MAAAAGMLLLGLLQAGGSVLGQAMEKVTGGNLLSMLLIACAFTL
SLVYLIRLAAGHLVQLPAGVKSPPYIFSPIPFLGHAIAFGKSPIEFLENAYEKYGPVF
SFTMVGKTFTYLLGSDAAALLFNSKNEDLNAEDVYSRLTTPVFGKGVAYDVPNPVFLE
QKKMLKSGLNIAHFKQHVSIIEKETKEYFESWGESGEKNVFEALSELIILTASHCLHG
KEIRSQLNEKVAQLYADLDGGFSHAAWLLPGWLPLPSFRRRDRAHREIKDIFYKAIQK
RRQSQEKIDDILQTLLDATYKDGRPLTDDEVAGMLIGLLLAGQHTSSTTSAWMGFFLA
RDKTLQKKCYLEQKTVCGENLPPLTYDQLKDLNLLDRCIKETLRLRPPIMIMMRMART
PQTVAGYTIPPGHQVCVSPTVNQRLKDSWVERLDFNPDRYLQDNPASGEKFAYVPFGA
GRHRCIGENFAYVQIKTIWSTMLRLYEFDLIDGYFPTVNYTTMIHTPENPVIRYKRRS
K

>CYP51P1 processed pseudogene U36926 5 in frame stops
chr3p12.2 82661285-82662808 + strand build 33 
MAAAAGMMLLGLLQAGG*VLGQAMEEVAGGNLLSMLLIACAFTLSLVYLFRLAAGHLVQL
TAGAKSPPYIFSPVPFLGHAIAFGKSPTEFLENAYGNYGPVFSFIMVGKAFTYLLGSDAA
ALLFNSKNEVLNAEDVYSRLTTPVFG*GVAYDVPNPVFLEQKKTLKSGLNIAHFK*HVSI
IEKETKEYFESWGESGEKNVFAALSELIILTASHYLHGKEIRSQHNEKVAQLYADLDGGF
SHAAWLLPGWLPLPCFRRRDRAHQEIKDIFYKAIQKRRQSQEKIDDILQTLLDATYKDGR
PLTDDEVAGMLTGLLLAEQHTSSTSA*MGFFLARDKTLQEKCYLEQKTVCGENLPPLTY 
DQLKDLNLLDRCIKETLRLRHPVMIMMRMARIPKTVAGYTIPPGHQVCVSPTVNQRLKDS
WVEHLDFNPDRYL*DNPASREKFAYVPFGAGHHGCTGENFAYVQIKTIWSTMLRLYEFDL
IDGYFPTVNYTTMIHTPENPVIHYK*RSK 

>CYP51P2 processed pseudogene U40053 
chr13q12.3 28226522-28228176 + strand build 33
MAAAAGMMLLGLLQAG
GSVLGQAMEEVTGGNLLSMLLIACTFTLSLVYLFRLAAGHLVQLPAGAKSPPYVFSPVP 
FPGHAIAFGKSPVEFLE 
NAYEKYGPVFSFTMVGKTFTYL 
LGSDAAALLFNSKNEDQNAEDVYSHLTTPVFGKGVAYDVPNPVFLEQKKMLKSGLNKAHF
KQHVSL 
EKETKEYFQSWGESGEKNVFEALSELIILTASHCLHGKEIRSQLNEKVAQLYADLDGGFS
HAAWLLPGWLPLPSFRCRDRAHWEIKDIFYKAIQKRRQSQEKIDDILQTLLDATYKDGRP
LTDDEVAGMLIGLLLAGQHSSSTTSAWMDFFLARDKTLQEKCYLEQKTVCGENLPPLTYD
QLKGLNLLDRCIKETLRLRPPIMIMMRMARTPQTVVGYTIPPGHQVCVSPTVNQRPKDSW
VERLDFNPDCYLQDNPASGEKFAYVPFGAGCHR*IGENFAYVQIKTIWSTMLRLYEFDLI
DGYFPIVNYTTMIHTPENPLIHYKRRSK 

>CYP51P3 NT_025741.8|Hs6_25897 chromosome 6 CYP51P3
chr6q24.3 148734956-148737020 + strand build 33
20447678 MLGLVQMSRSALGQLVEWVAGESDSLLSMLLISCAFILSLVCFATIVTTWPSCQLVQNAH 20447857
20447858 HMFSLPLHSLGMPYIWEKLN*ISRKCI*EVWAYEKYGPVCSFSVVSKTFT 20448007
20447859 ICFLSHYIPWACHTFGKS*IEFLESAYEKYGHMRSMDLYVVFLW*ARHLLLERDETAL 20448032
20448347 LFNSKTKDAEDVYSHLRTPAFGKGVECDMPNPAFLGQEKMLKSSLKVAHFRQQVSI 20448514
20448515 TEKERNTFKLGRKQRKKLCEALSELII 20448595
         FDS*PWFTWKGNQKSTQ*EVAQLCADVSGGFKPLA 20448698
20448699 WLRPRWLPLRGCRSSYKAH*EHNYCL*GNPETQKPEEKIQGILQTLLDTTDKGEHLLAD 20448875
20448876 EEVTGLLIRLFSGQHTSSTTGA*MSCFVARDETLHEKCYLKQKTVRGEDLPSLTYDLLK 20449052
20449426 VIGRTIPAGHQMGLSLTVNQGFQDTWVELVDPDQ*LQDISTGEK 20449557
20449566 GGGRHHIGKNFAHVQIKTVWSTLFHLYEFDFIDGYFPTVNYKTVVHHTPKNPVITYK*R 20449742