This is a collection of mouse P450s.  There are 88 mouse P450 sequences known.  
1 of the 88 is confidential and the sequence 2c51 is not shown here.  There is also 1 sequence known in humans that has not been found 
yet in mice, but it probably will be found (27C1). In this 
case the human sequence is given.  It is marked by a run of ********
4Z1 is not found yet in mice, so that might be one more.
There are only 4 pseudogenes 2c52p, 2c53p, 4a28p, 4f38p.  That leaves 85 known
Full length mouse P450s and possibly 27c1 and 4z1 for 87.
Last modified July 16, 2002

>1. Cyp1a1        Mm.14089 genbank X01681
MPSMYGLPAFVSATELLLAVTVFCLGFWVVRATRTWVPKGLKTP
PGPWGLPFIGHMLTVGKNPHLSLTRLSQQYGDVLQIRIGSTPVVVLSGLNTIKQALVR
QGDDFKGRPDLYSFTLITNGKSMTFNPDSGPVWAARRRLAQNALKSFSIASDPTSASS
CYLEEHVSKEANYLVSKLQKVMAEVGHFDPYKYLVVSVANVICAICFGQRYDHDDQEL
LSIVNLSNEFGEVTGSGYPADFIPVLRYLPNSSLDAFKDLNDKFYSFMKKLIKEHYRT
FEKGHIRDITDSLIEHCQDRKLDENANVQLSDDKVITIVLDLFGAGFDTVTTAISWSL
MYLVTNPRVQRKIQEELDTVIGRDRQPRLSDRPQLPYLEAFILETFRHSSFVPFTIPH
STTRDTSLNGFYIPKGCCVFVNQWQVNHDRELWGDPNEFRPERFLTPSGTLDKRLSEK
VTLFGLGKRKCIGETIGRSEVFLFLAILLQQIEFKVSPGEKVDMTPTYGLTLKHARCE
HFQVQMRSSGPQHLQA

>2. Cyp1a2        Mm.15537 genbank X00479
MAFSQYISLAPELLLATAIFCLVFWMVRASRTQVPKGLKNPPGP
WGLPFIGHMLTVGKNPHLSLTRLSQQYGDVLQIRIGSTPVVVLSGLNTIKQALVRQGD
DFKGRPDLYSFTLITNGKSMTFNPDSGPVWAARRRLAQDALKSFSIASDPTSASSCYL
EEHVSKEANHLVSKLQKAMAEVGHFEPVSQVVESVANVIGAMCFGKNFPRKSEEMLNI
VNNSKDFVENVTSGNAVDFFPVLRYLPNPALKRFKTFNDNFVLFLQKTVQEHYQDFNK
NSIQDITSALFKHSENYKDNGGLIPEEKIVNIVNDIFGAGFDTVTTAITWSILLLVTW
PNVQRKIHEELDTVVGRDRQPRLSDRPQLPYLEAFILEIYRYTSFVPFTIPHSTTRDT
SLNGFHIPKERCIYINQWQVNHDEKQWKDPFVFRPERFLTNNNSAIDKTQSEKVMLFG
LGKRRCIGEIPAKWEVFLFLAILLQHLEFSVPPGVKVDLTPNYGLTMKPGTCEHVQAW
PRFSK

>3. Cyp1b1        Mm.4443  genbank U03283
MATSLSADSPQQLSSLSTQQTTLLLLFSVLAAVHLGQWLLRQWQ
RKPWSSPPGPFPWPLIGNAAAVGQASHLYFARLARRYGDVFQIRLGSCPVVVLNGESA
IHQALVQQGSIFADRPPFASFRVVSGGRSLAFGHYSEHWKTQRRSAYSTMRAFSTRHP
RSRGLLEGHALAEARELVAVLVRRCAGGAFLDPTQPVIVAVANVMSAVCFGCRYNHDD
AEFLELLSHNEEFGRTVGAGSLVDVLPWLQLFPNPVRTTFRKFEQLNRNFSNFVLDKF
LRHRESLVPGAAPRDMTDAFILSAEKKASGAPGDDSSGLDLEDVPATITDIFGASQDT
LSTALLWLLILFTRYPDVQARVQAELDQVVGRDRLPCMSDQPNLPYVMAFLYESMRFS
SFLPVTIPHATTANTFVLGYYIPKNTVVFVNQWSVNHDPAKWPNPEDFDPARFLDKDG
FINKALASSVMIFSVGKRRCIGEELSKMLLFLFISILAHQCNFKANQNESSNMSFSYG
LTIKPKSFRIHVSLRESMELLDNAVKKLQTEEGCK

>4. Cyp2a4        Mm.14781 genbank J03549
MLTSGLLLVAAVAFLSVLVLMSVWKQRKLSGKLPPGPTPLPFVG
NFLQLNTEQMYNSLMKISQRYGPVFTIYLGSRRIVVLCGQEAVKEALVDQAEEFSGRG
EQATFDWLFKGYGIAFSSGERAKQLRSFSIATLRDFGVGKRGIEERIQEEAGFLIDSF
RKTNGAFIDPTFYLSRTVSNVISSIVFGDRFDYEDKEFLSLLRMMLGSLQFTATSMGQ
VYEMFSSVMKHLPGPQQQAFKELQGLEDFITKKVEHNQRTLDPNSPRDFIDSFLIRML
EEKKNPNTEFYMKNLVLTTQNLFFAGTETGSTTLRYGFLLLMKYPDIEAKVHEEIDRV
IGRNRQPKYEDRMKMPYTEAVIHEIQRFADLIPMGLARRVTKDTKFRDFLLPKGTEVF
PMLGSVLKDPKFFSNPKDFNPKHFLDDKGQFKKSDAFVPFSIGKRYCFGEGLARMELF
LFLTNIMQNFHFKSTQAPQDIDVSPRLVGFVTIPPTYTMSFLSR

>5. Cyp2a5        Mm.4112 (retired) genbank X89864
MLTSGLLLVAAVAFLSVLVLMSVWKQRKLSGKLPPGPTPLPFIG
NFLQLNTEQMYNSLMKISQRYGPVFTIYLGPRRIVVLCGQEAVKEALVDQAEEFSGRG
EQATFDWLFKGYGVVFSSGERAKQLRRFSIATLRDFGVGKRGIEERIQEEAGFLIDSF
RKTNGAFIDPTFYLSRTVSNVISSIVFGDRFDYEDKEFLSLLRMMLGSFQFTATSMGQ
LYEMFSSVMKHLPGPQQQAFKELQGLEDFITKKVEHNQRTLDPNSPRDFIDSFLIRML
EEKKNPNTEFYMKNLVLTTLNLFFAGTETVSTTLRYGFLLLMKHPDIEAKVHEEIDRV
IGRNRQPKYEDRMKMPYTEAVIHEIQRFADMIPMGLARRVTKDTKFRDFLLPKGTEVF
PMLGSVLKDPKFFSNPKDFNPKHFLDDKGQFKKNDAFVPFSIGKRYCFGEGLARMELF
LFLTNIMQNFHFKSTQAPQDIDVSPRLVGFATIPPTYTMSFLSR

>6. Cyp2a12       Mm.20770 genbank L06463
MLGSGLLLLAILAFLSVMVLVSVWQQKIRGKLPPGPIPLPFIGNYLQLNRKDVYSSITQLQEHYGPVFTIHLGPRRVVVLYGYDAV
KEALVDHAEEFSGRGEQATFNTLFKGYGVAFSNGERAKQLRRFSIATLRDFGMGKRGVEEHIQEEAGCLIKMLQGTCGAPIDPTIY
LSKTASNVISSIVFGDRFNYEDKEFLSLLQMMGQVNKFAASPTGQLYDMFHSVMKYLPGPQQQIIKDSHKLEDFMIQKVKQNQSTL
DPNSPRDFIDSFLIHMQKEKYVNSEFHMKNLVMTSLNLFFAGSETVSSTLRYGFLLLMKHPDVEAKVHEEIDRVIGRNRQPQYEDH
MKMPYTQAVINEIQRFSNFAPLGIPRRITKNTSFRGFFLPKATEVFPILGSLMTDPKFFSSPKDFNPQHFLDDKGQLKKIPAFLPF
STGKRFCLGDSLAKMELFFLFTTILQNFRFKFPRKLEDINESPTPEGFTRIIPKYTMSFVPI*

>7. Cyp2b9        Mm.876   genbank M60273 AH000038
MDPSVLLLLAVLLSLFLLLVRGHAKIHGHLPPGPHPLPLLGNLLQMDRGGLLKCFIQ
LQEKHGDVFTVHLGPRPVVVLCGTQTIREALVDHAEAFSGRGTIAAAQLVMQDY
GIFFASGQRWKTLRRFSLATMKEFGMGKRSVEERIKEEAQCLVEELKKYQ
GVPLDPTFLFQCITANIICSIVFGERFDYTDDQFLHLLNLMYKIFSLLSSFSGQ
MFELFSGFLKYFPGVHRQIVKKQQELLDYIAHSVEKHKATLDPSAPRDYIDTYLLRMEK
EKSNHNTEFHHQNLMMSVLSLFFAGTETTSATLHYGVLLMLKYPHVT
EKVQKEIDQVIGSHRLPTLDDRTKMPYTDAVIHEIQRFSDLVPIGLPHKVIKDTLFRGYLLPKNTEVYPVLSSALHDPQYFEQPDK
FNPEHFLDANGALKKCEAFLPFST
GKRICLGESIARNELFIFFTTILQNFSVASPVAPKDIDLTPKESGIGKIPPAHQIYFLAR*

>8. Cyp2b10       Mm.14177 genbank M21856
MEPSVLLLLALLVGFLLLLARGHPKSRGNFPPGPRPLPLLGNLLQMDRGGLLKSLIQ
LREKYGDVFTVHLGPRPVVMLCGTDTIREALVGQAEAFSGRGTVAVVEPTFKEY
GVIFANGERWKTLRRFSLATMRDFGMGKRSVEERIQEEAQCLVEELRKS
QGAPLDPTFLFQCITANVICSIVFGERFEYTDRQFLRLLELFYQTFSLISSFSSQ
MFELFSGFLKYFPGAHRQISKNLQELLDYIGHSVERHKATLDPSVPRDFIDIYLLRMEK
EKSNQNAEFHHQNLMMSVLSLFFVGTETSSTTLHYGFLLMLKYPHVTEKVQKEIDQVIGS
HRLPTLDDRTKMPYSDAVIHEIQRFSDLIPIGVPHRVTKDTLFRGYLLPKNTEVYPIL
SSALHDPQYFEQPDSFNPDQFLDANGALKKSEAFLPFSTGQIFDQKSVGKRICLGESI
ARSELFLFFTSILQNFSVASHVAPKDIDLTPKESGIGKIPPTYQICFLAR

>9. Cyp2b13       Mm.14413 genbank M60358 AH000037
MDPSVLLLLAVLLSLFLLLVRGHAKIHGHLPPGPRPLPLLGNLLQMDRGGLLKCFIHLQEKHGDVFTVHLGPRPVVVLCGTQTIRE
ALVNNAEAFSGRGTIAAAQLVMQDYGIFFSSGERWKTLRRFSLATMKEFGMGKRSVEERIKEEAQCLVEELKKYQGAPLDPTFFFQ
CVTANIICSIVFGERFDYTDEQFLRLLNLMYQIYSLLRSFSCQMFELFSGLLKYFPGVHRQIAKNQQEILNFITHRVEKHRASLDP
SEPRDFIDTYLLRMEKEKSNHNTEFHHQNLMMSVLSLFFAGTETTSTTLCCGFLLMLMYPHVAGKVQKEIDQVIGSHRLPTLDDRT
KMPYTDAVIHEIQRFSDIVPTGAPHRVTKDTMFRGYLLPKNTEVYPILSSALHDPQYFEQPDSFNPDHFLDANGALKKSEAFLPFS
TGKRICLGESIARNELFLFFTSILQNFSVASPVASKDIDLTPKESGIGKIPPTYQICFLAR*

>10. Cyp2b19       Mm.14098 genbank AF047529
MEFSVLLLLALTTGFLIFLVSQSQPKTHGHFPPGPRPLPFLGNL
LQMDRRGLLSSFIQLQEKYGDVFTVHLGPRPVVMLCGTDTIREALVNQAEAFSGRGTV
AVLDPIVQGYGVIFSSGERWKTLRRFSLATMRDFGMGKRSVEERIKEEAQCLVEELKK
YKGAPLNPTFYFQCIVANIICSIVFGERFDYKDHQFLHLLNLIYQTFSLMSSLSSQVF
ELFSAILKYFPGAHRQISKNLQEILDYIGHSVEKHRATLDPSAPRDFIDTYLLRMEKE
KSNHHTEFHHQNLVISVLSLFFAGTETTSTTLRYSFLIMLKYPHVAEKVQKEIDQVIG
SHRLPTLDDRTKMPYTDAVIHEIQRFTDLAPIGLPHKVTKDTLFRGYLIPKNTEVYPI
LSSALHDPRYFEQPDSFNPEHFLDANGALKTNEAFMPFSTGKRICLGEGIARNELFLF
FTTILQNFSLASPVAPENIDLIPNNSGATKTPPQYQIHFLSR

>11. Cyp2b20       no UNIGENE entry genbank X99715 no ESTs
              fusion of 2b10 and P450 reductase
MEPSVLLLLALLVGFLLLLARGHPKSRGNFPPGPRPLPLLGNL
LQMDRGGLLKSFIQLREKYGDVFTVHLGPRPVVMLCGTDTIREALVGQAEAFSGRGTV
AVVEPTFKEYGVIFANGERWKTLRRFSLATMRDFGMGKRSVEERIQEEAQCLVEELRK
SQGAPLDPTFLFQCITANIICSIVFGERFEYTDRQFLRLLELFYQTFSLISSFSSQMF
ELFSGFLKYFPGAHRQISKNLQELLDYIGHSVEKHRATLDPSVPRDFIDIYLLRMEKE
KSNQHTEFHHQNLMMSVLSLFFAGTETSSTTLRYGFLLMLKYPHVAEKVQKEIDQVIG
SHRLPTLDDRTKMPYTDAVIHEIQRFSDLIPIGVPHRVTKDTMFRGYLLPKNTEVYPI
LSSALHDPQYFEQPDSFNPDHFLDANGALKKSEAFLPFSTGKRICLGESIARPWMKPM
KGATPVPGPTMITGVVGPEFE

>12. Cyp2c29       Mm.20764 genbank D17674
MDLVVFLALTLSCLILLSLWRQSSGRGKLPPGPTPLPIIGNFLQIDVKNISQSFTNFSKAYGPVFTLYLGSKPTVILHGYEAVKEA
LIDRGEEFAGRGSFPMAEKMIKGFGVVFSNGNRWKEMRRFTLMTLRNLGMGKRNIEDRVQEEAQCLVEELRKTKGSPCDPTFILSC
APCNVICSIIFQNRFDYKDKEFLILMDKINENVKILSSPWLQVCNSFPSLIDYCPGSHHKIVKNFNYLKSYLLEKIKEHKESLDVT
NPRDFIDYYLIKQKQVNHIEQSEFSLENLASTINDLFGAGTETTSTTLRYALLLLLKYPDVTAKVQEEIDRVVGRHRSPCMQDRSH
MPYTDAMIHEVQRFIDLLPTSLPHAVTCDIKFRKYLIPKGTTVITSLSSVLHDSKEFPNPEMFDPGHFLNGNGNFKKSDYFMPFST
GKRICAGEGLARMELFLILTTILQNFKLKSLVHPKEIDITPVMNGFASLPPPYQLCFIPL*

>13. Cyp2c37       Mm.28533 genbank AF047542 ESTs AI255551
              AI255553, AI315987, AI119584
MDPILVLVLTLSCLFLLSLWRQSSERGKLPPGPTPLPIIGNILQIDVKDICQSFTNLSKVYGPVYTLYLGRKPTVVLHGYEAVKEA
LVDHGEEFAGRGRFPVFDKATNGMGLAFSKGNVWKNTRRFSLMTLRNLGMGKRSIEDRVQEEARCLVEELRKTNGSPCDPTFILGC
APCNVICSIIFQDRFDYKDRDFLNLMEKLNEITKILSSPWLQICNTYPALLDYCPGSHKQFFKNYACIKNFLLEKIKEHEESLDVT
IPRDFIDYFLINGGQEDGNQPLQNRLEHLAITVTDLFSAGTETTSTTLRYALLLLLKYPHVTAKVQKEIEHVIGKHRSPCMQDRSR
MPYTDAMIHEVQRFIDLIPNSLPHEVTCDIKFRNYFIPKGTTVITSLSSVLHDSTEFPNPEVFDPGHFLDGNGKFKKSDYFMPFST
GKRICAGEGLARMELFLFLTSILQSFNLKPLVHPKDIDVTPMLIGLGSVPPAFQLCFIPS*

>14. Cyp2c38       no UNIGENE entry genbank AF047725 no ESTs
DLVTFLVLTLSSLILLSLWRQRSRRGRLPPGPTPFPIIGNFLQIDVKNFNQSLTNFSKAYGPVFTLYLGSRPIVVLHGYEAVKEAL
IDHGEEFSGRGNIPMSEKINNGLGITFSNGNSWKETRHFTLMTLRNLGMGKRNIEDRVREEAQCLVEELRKTKGSPCDPTFILSCA
PCNVICSIIFQDRFHYKDKDFLMLMKKLNENVKILSSPWLQVCNNFPLLIDYCPGSHHKVLKNFKYIRSYLLEKVKEHQASLDFTN
PRDFIDYFLIKQKQANHIEQAEYSLENLVCTINNLFAAGTETTSTTLRYALLLLMKYPDVTAKVQEEIDHVVGRHRSPCMQDRSHM
PYTDAMIHEVQRFINLVPNNLPHAVTCDIKFRNYIIPKGTTVVTSLTSVLHDSKEFPNPEMFDPGHFLDANGNFKKSDYFMTFSAG
KRVCAGEGLARMELFLILTTILQNFKLKSLVHPKDIDMMPFVNGLIALPPHYQLCFIPL*

>15. Cyp2c39       no UNIGENE entry genbank AF047726 EST AA882179
DLVTFLVLTLSSLILLSLWRQSCGRGSLPPGPTPFPIIGNFLQIDMKNFSQSLTNFSKAYGPVFTLYLGSRPTVVLHGYEAVKEAL
IDHGEEFSDRGSIPMVEKINNGLGIVFSNGNRWKEIRRFTLTTLRNLGMGKRNIEDRVQEEAQCLVEELRKTKGSPCDPTFILSCA
PCNVICSIIFQDRFDYKDKDFLMLMEKLNENVKILSSPWLQVCNNFPLLIDYCPGSHHKVLKNVKYIRSYLLEKIKEHQESLDVTN
PRDFIDYYLIKQKQANHIQQAEFSLENLACTINNLFAAGTETTSTTLRYALLLLMKYPDVTAKVQEEIDHVIGRHRSPCMQDRNHM
PYTDAMIHEVQRFINLVPNNIPRAVTCDIKFRNYLIPKGTTVVTSLTSVLHDSKEFPNPELFDPGHFLDANGNFKKSDHFMPFSAG
KRVCAGEGLARMELFLFLTTILQNFKLKPLVHPKDIDMIPFVNGLIALPPHYQVCIIPR*

>16. Cyp2c40       Mm.29973 genbank AF047727 86 ESTs follow AI255979
MDPFVVLVLCLSFLLVLSLWRQRSARGNLPPGPTPLPIIGNYHLIDMKDIGQCLTNFSKTYGPVFTLYFGSQPIVVLHGYEAIKEA
LIDHGEEFSGRGRIPVFDKVSTGKGIGFSHGNVWKATRVFTVNTLRNLGMGKRTIENKVQEEAQWLMKELKKTNGSPCDPQFIIGC
APCNVICSIVFQNRFDYKDKDFLSLIGKVNECTEILSSPGCQIFNAVPILIDYCPGSHNKLFKNHTWIKSYLLGKIKEHEESLDVT
NPRDFIDYFLIQRRQKNGIEHMDYTIEHLATLVTDLVFGGTETLSSTMRFALLLLMKHTHITAKVQEEIDNVIGRHRSPCMQDRNH
MPYTNAMVHEVQRYIDLGPNGVVHEVTCDTKFRNYFIPKGTQVMTSLTSVLHDSTEFPNPEVFDPGHFLDDNGNFKKSDYFMPFSA
GKRICVGESLARMELFLFLTTILQNFKLKPLVDPKDIDMTPKHSGFSKIPPNFQMCFIPVE*

>17. Cyp2c44       Mm.26457 23 ESTs follow AI317668
MELLGLPTLALLVLVMSLSLLSVWTKMRTGGRLPPGPTPLPIIGNILQLDLKDIPASLSK
LAKEYGPVYTLYFGSWPTVVLHGYDVVKEALLNQGDEFLGRGPLPIIEDSQKGHGIVFSE
GERWKLLRRFSLMTLKNFGMGKRSLEERVQEEARCLVEELHKTEAQPFDPTFILACAPCN
VICSILFNERFPYNDKTFLNLMDLLNKNFYQLNSIWIQMYNLWPTIMKYIPGKHREFSKR
LGGVKNFILEKVKEHQEFLDPANPRDYIDCFLSKIEEEKHSLKSDFNLENLAICGSNLFT
AGTETTSTTLRFGLLLLVKHPEVQAKVHEELDRVIGRHQPPSM
KDKMKLPYTDAVLHEIQRYITLLPSSLPHAVVQDTKFRHYVIPKGTAVFPFLSSILLDQK
EFPNPEKFDPGHFLDKNGCFKKTDYFVPFSLGKRSCVGEGLARMELFLFFTTILQKFSLK
ALVEPKDLDIKPVTTGLFNLPPPYKLRLVPR

>18. Cyp2c50 BC011222.1 94% 2c37; 75% 2c39,2c29v2; 74% 2c38; 68% 2c40; 53% 2c44
The whole sequence can be assembled from ESTs
ESTs AI097740 ue39c07.y1 aa 1-253 (3 aa diffs)
AI117011 ue30b06.y1 aa 1-232 (2 aa diffs), AI119501 uf04f05.y1 1-214 (2 aa diffs)
AI529832 ui83a03.y1 95%, AI314482 uj47e03.y1 98%, BF385641 1-186 1 aa diff
AI528254 ui95c04.y1 98%, AA968308 uc71h08.y1 99%, AI876138 uj53h03.y1 99%
AI097678 ue35a08.y1 99%, AI226027 uj08d09.y1, BF384486, BF659471 maa30e07.y1
AI529923 ui87b03.y1, AW012429 uo04g09.y1 aa 311-484 2 aa diffs
AI266900 uj08d09.x1 337-490 1 frameshift, AI226462 uj06g04.y1 1-157 deletion 289-376
AI118193 ue34e02.x1 180-292 1 aa diff
GSS AZ589908 aa 430-477 
MDPILVLVFTLSCLFLLSLWRQSSERGKLPPGPTPLPIIGNILQ
INVKDICQSFTNLSKVYGPVYTLYLGNKPTVVLHGYEAVKEALVDHGEEFAGRGRLPV
FDKATNGMGIIFSKGNVWKNTRRFSLTTLRNLGMGKRSIEDRVQEEARCLVEELRKTN
GSPCDPTFILGCAPCNVICSIIFQDRFDYKDRDFLNLMEKLNEITKIMSTPWLQVCNT
FPVLLDYCPGSHNKVFKNYACIKNFLLEKIKEHEESLDVTIPRDFIDYFLINGGQEDG
NYPLKNRLEHLAITVTDLFSAGTETTSTTLRYALLLLLKYPHVTAKVQEEIEHVIGKH
RSPCMQDRSRMPYTDAMIHEVQRFIDLVPNSLPHEVTCDIKFRNYFIPKGTNVITSLS
SVLRDSKEFPNPEKFDPGHFLDENGKFKKSDYFMPFSTGKRICAGEGLARMELFLFLT
SILQNFNLKPLVHPKDIDVTPMLIGLASVPPAFQLCFIPS

>19. cyp2c51 confidential 69% 2c29v2; 69% 3c37; partial seq

>20. Cyp2c52p sequence shown is from Ensembl mouse version 3
19.38000001-39000000 chromosome 19 add 38 million to numbers for global location
Cyp2c52p one stop codon two frameshifts and a deletion
XM_140720 missing first 70 aa, insertion of 33 aa before WXXR motif,
missing 11 aa after WXXR motif, missing 14 aa before EXXR motif 
628318 MDPVLVLVLTLSCLLLLS*WRQNSGRGKLPPGPTPLPIIGNILQIDVKNTGQSVGK 628367
630645 FSKVYGPVFTLYFGMKPSVVLHGYEAVKEALVDLGEGFSGRGSFPVAEKASKGL 630806
630954 GIIFSNGMKWKEIRRFSVMT 631013  frameshift                        
631012 LRNFGMGKRSVEDRVQEEARCLVEELRNGK 631101                        
636385 XAPCDPTFILGCAPCNVICSIIFQKRFDYKDQTFLNLMDKFNENFRILSTPWIQ 636425
639913 VCNTFPAIIDYFPGSHNQVLKNFSYIKKNYVLEKVKKHQESLDMENPRDFIDCFLIKMKQ 639972
710041 EKHSLQSEFTHESLVATVTDMFGAGTETTSNTLRYGLLLLLKHVDIT 710181     
713060 AKVQEEIERVVGRHRSPCVQDRSHM 713134  4 aa deletion and f.s.         
713136 AVVHETQRYIVLIPTNLPHSVTCDAKFRNYFIPK 713237                        
715864 GTTVITSLTSMLHDDKEFPNPEKFDPGYFLDERGNVKKSDYFVPFSA 716004       
717828 GKRMCAGEGLTGMELFLFFTIILQNFNLKPLVDVKDIDTTPVVSGFGHVPPLYQARFIPV* 718010

>21. cyp2c53p AC078913.6 seq b assembled from parts 76% to 2c39 lowercase is from 2c39 (other seq on this clone = 2c29)
MDLITFLVLTLSSLILLLLWRQRSGRGRLPPGPTPFPIIGNFLQIDGKNFSQSLTN
FSKVYGPVFTLYLGMKPTVVLHGYEAVKEALIDHGEEFAVKGIFPLAEKNSKGLG
VVFSNGNRWKEMRRFTLMTLKNLGMGKRNIEDRVQEEAQCLVEELRKTNG
SLCDPTFILGCAPCNVTCSIIFQNHFDYKDQDFLSLMEKINENTKIVSTPWIQVK*NSF
62460 PVLIVYCPGSHKTVPENAYYIEIYILKKIKEHQESLDVTNPLDFIDYYLIKCKQ 62299
anhiqqaefslenlactinnlf 153332 GAGTETSTTLRYALLLLMTYPEVT 153403
154074 TAKVQAEIDCVIGRHRSPCMQDRHSMPYTDAVLHEIQRYIDLLPTSLPHAVTRDVKFREYLIPK 154265
gttvvtsltsvlhdskefpnpelfdpghfldangnfkksdhfm
91023 FFSGKCICVRDNLAQMKLFLFLTTIL*NFNLKSVDPQELD 90904
mipfvnglialpphyqvciipr

>22. Cyp2c54     mouse 92% to 2c50 from mouse Ensembl version 3
19.39000001-40000000 chr 19 add 39 million to these numbers for global location
777086 MDPILVLVLTLSCLFLLSLWRQSYERGKLPPGPTPLPIIGNILQIDVKDICQSFTN 777039
775804 LSRVYGPVYTLYLGRKPTVVLHGYEAVKEALVDHGDVFAGRGRLPVFDKATNGM 775643   
775480 GIGFSNGSVWKNTRHFSLMTLRNLGMGKRSIEDRVQEEARCLVEELRKTN 775331      
774882 GSPCDPTFILGCAPCNVICSIIFQDRFDYKDRDFLNLLEKLDEISKILSTPWLQ 774721
773617 VCNTFPALLDYCPGSHNQFFKNYAYIKNFLLEKIREHKESLDVTIPRDFIDYFLIKGAQ 773441 
751132 EDDNHPLKNNFEHLAITVTDLFIGGTESMSTTLRYALLLLLKYPHVT 750992   
749751 AKVQEEIEHVIGKHRRPCMQDRSHMPYTNAMIHEVQRFIDLVPNNLPHEVTCDIKFRNYFIPK 749563 
743820 GTTVITSLSSVLRDSKEFPNPEKFDPGHFLDENGKFKKSDYFMPFST 743680         
741906 GKRICAGEGLARMELFLFLTSILQNFNLKPLVHPKDIDITPMLIGLGSVPPAFQLCFIPS 741727 

>23. Cyp2c55     mouse 19.38000001-39000000 from mouse Ensembl version 3
chr 19 add 38 million to numbers to get global position
721844 MDPVLVLVLTLSCLLLLSLWRQNSGRGKLPPGPTPFPIIGNILQIDIKNISKSFNY 722011
725818 FSKVYGPVFTLYFGSKPTVVVHGYEAVKEALDDLGEEFSGRGSFQIFERINNDL 725979
726487 GVIFSNGTKWKELRRFSIMTLRSFGMGKRSIEDRIQEEASCLVEELRKAN 726636
733440 GSLCDPTFILSCAPSNVICSVIFHNRFDYKDEKFLNLMERLNENFKILNSPWMQ 733601
746116 VYNALPTLINYLPGSHNKVIKNFTEIKSYILGRVKEHQETLDMDNPRDFIDCFLIKMEQ 746292
749093 EKHNPHSEFTIESLMATVTDIFVAGTETTNITLRYGLLLLLKHTEVT 749233
750298 AKVQAEIDHVIGRHRSPCMQDRTRMPYTDAMVHEIQRYIDLIPNNVPHAATCNVRFRSYFIPK 750367
753216 GTELVTSLTSVLHDDKEFPNPEVFDPGHFLDENGNFKKSDYFMPFSI 753356 
757132 GKRMCVGEALARTELFLILTTILQNFNLKSLVDTKDIDTTPVANTFGRVPPSYQLYFIPR 757311

>24. Cyp2d9    Mm.3164  genbank gene ca M23998 J04471 M24262 (846bp) M24267 (3367bp)
MELLTGTDLWPVAIFTVIFILLVDLTHQRQRWTSRYPPGPVPWP
VLGNLLQVDLGNMPYSLYKLQNRYGDVFSLQMAWKPMVVINGLKAMKEMLLTCGEDTA
DRPPVPIFEYLGVKPGSQGVVLAPYGPEWREQRRFSVSTLRNFGLGKKSLEDWVTKEA
NHLCDAFTAQAGQPINPNPMLNKSTCNVIASLIFARRFEYEDPFLIRMLKVLEQSLTE
VSGLIPEVLNAFPILLRIPRLADKALQGQKSFIAILDNLLTENRTTWDPVQAPRNLTD
AFLAQIEKAKGNPESSFNDENLLMVVRDLFGAGMLTTSTTLSWALMLMILHPDVQRRV
QQEIDEVIGQVRHPEMADQAHMPYTNAVIHEVQRFGDIVPVNLPRITSHDIEVQDFLI
PKGTILLPNMSSMLKDESVWEKPLRFHPEHFLDAQGHFVKPEAFMPFSAGRRSCLGEA
LARMELFLFFTCLLQRFSFSVPDGQPQPSNSGVYGILVAPSPYQLCAVVRDQGH

>25. Cyp2d10 Mm.27803 genbank gene cb M24263
MELLTGAGLWSVAIFTVIFILLVDLMHRHQRWTSRYPPGPVPWP
VLGNLLQVDLDNMPYSLYKLQNRYGDVFSLQMGWKPMVVINGLKAMKEVLLTCGEDTA
DRPQVPIFEYLGVKPGSQGVVLAPYGPEWREQRRFSVSTLRNFGLGKKSLEDWVTKEA
RHLCDAFTAQAGQPINPNTMLNNAVCNVIASLIFARRFEYEDPYLIRMQKVLEDSLTE
ISLAYSEVLNMFPILLRIPGLPGKVFQGQKSLLAIVENLLTENRNTWDPDQPPRNLTD
AFLAEIEKVKGNAESSFNDENLRMVVLDLFTAGMVTTSTTLSWALLLMILHPDVQRRV
QQEIDAVIGQVRHPEMADQARMPYTNAVIHEVQRFGDIAPLNLPRITSRDIEVQDFLI
PKGSILIPNMSSVLKDETVWEKPLRFHPEHFLDAQGHFVKPEAFMPFSAGRRSCLGEP
LARMELFLFFTCLLQHFSFSVPNGQPRPRNLGVFPFPVAPYPYQLCAVMREQGH

>26. cyp2d11 M24264 gene cc
MELLTGAGLWSVAIFTVIFILLVDLMHRHQHWTSRCPPGPVPWP
VLGNLLQVDLDNMPYSLYKLQNRYGDVFSLQMAWKPMVVINGLKAMKEMLLTCGEDTA
DRPPVPIFEYLGVKPGSQGVVLAPYGPEWREQRRFSVSTLRNFGLGKKSLEEWVTKEA
RHLCDAFTAQAGQPINPNPMLNKSTCNVIASLIFARRFEYEDPFLIRMLKMLKECFTE
ISGFIPGVLNEFPIFLRIPGLADMVFQGQKSFMAILDNLLTENRTTWDPDQPPRNLTD
AFLAEIEKAKGNAESSFNDENLRMVVLDLFTAGMVTTSTTLSWALLLMILHPDVQRRV
QQEIDAVIGQVQHPEMADQARMPYTNAVIHEVQRFGTLLHCLCHASQVVTFTQVQDFL
VTKGSTLIPNLSSVLKGETVWEKPLRFHPEHFLDAQGHFVKPEAFMPFSAGHRSCLGE
ALARMELFLFFTCLLQRFSISVPDGQPQPSNYRVHAIPVAPFPYQLCAVMHEQGH

>27. Cyp2d12 no UNIGENE entry EST AA114047 gene cd 
AC118710.1 GSS AZ417526.1
MELLTGTDLWSVAIFTVIFILLVDLMYRRQSWTSCYPPGPVPWPVLGNLLQVDLNNMPYSL
YKLQNRYGDVFSLQMAWKPMVVINRMKAMKEVLLTCGEDTADRPPVPIFEHLGFKPRSQGMIFAPYGPEWREQ
RRFSLSSLRNFGLGRKSLEEWVIKEAGHLCDAFTTQAGQYINPNTMLKKATCNVIASLIFARRFEYED
PYLIRMLKVLEDSLTELSGLIPEVINTFPILLHIPRLAD 
KFLQSQKSFIAIVDNLLTENRTTWDPAQSPRNLTDAFLAEIEKAKGNPESSFND
ENLRMVVIDLFTAGILTTSTTLSWALLLMILHPDVQRRVQQEIDEVIGQVRHPEMADQAHMPYTNAVIHEVQRFGDIVPLHLPRIT
SRDIEVQDFLIPKGTILLPNMSSVHMDDTVWEKPLRFHPEHFLDAQGHFVKHEAFITFSAGR 
RSCLGEPLARMELFLFFTCLLQRFSFSVPDGQPQPSDHRVFFIMVAPSPYQLCAVIREQGH*

>28. Cyp2d13 AC087902.4 17 April 2001 no UNIGENE entry  EST BF533324 gene ce 
MELLTGTGLWPVAIFTVIFILLVDLMHRRQRWTSRYPPGPVPWPVLGNLLQVDLDNMPYSLYKL
QNRYGDVFSLQMAWKPVVVISGLKAVREVLVTCGEDTADRPEMPIFQHLGYGEKAKGVVF
APYGPEWRELRRFSVSTLRNLGLGKKSLEQWVTEEAGHLCDAFTAQAGSPLDPYTLLNKAVCNV
IASLIYARRFEYGDPDFIKMLKILKENMGENTGLFPE (0)
VLNTFPILLHIPGLAD KVFPGQKTFLTLVNKLVTEHKRTWDPDQPPRDLTDAFLAEMEK (0)
AKGNPKSSFNE ANLRLVVFDLFGAGIVTSSITLTWALLLMILHPDVQRRVQEEIDEVIGQVRCPEMADQAHMPYTNAVIH
EVQRFADIVPMNLPHKTSHDIEVQGFLIPK
GTTLIPNLSSALKDETVWEKPLRFHPEHFL
DAQGHFVKPEAFMPFSAGRRACLGEPLARMELFLFFTCLLQRFSFLVPAGQPQPSDYGIF
TFLVSPSPYQLCAFTRDQATN

>29. Cyp2d22 AF221525 NM_019823
MRLPTGAELWPIAIFTVIFLILVNLMHWRQRWTAHYPPGPMPWP
VLGNLLHMDFQNMPAGFQKLRGRYGDLFSLQLASESVVVLNGLTALREALVKHSEDTA
DRPPLHFNDLLGFGPRSQGIVLARYGPAWRQQRRFSVSTMHHFGLGKKSLEQWVTEEA
RCLCAAFADHTGHPFSPNTLLDKAVCNVIASLLYACRFEYDDPRFIRLLGLLKETLKE
EAGFLPMFLNVFPMLLRIPGLVGKVFPGKRAFVTMLDELLAEHKTTWDPTQPPRDLTD
AFLAEVEKAKGNPESSFNDENLRTVVGDLFSAGMVTTSTTLSWALMLMILHPDVQRRV
QQEIDEVIGQVQCPEMADQARMPYTNAVIHEVQRFADILPLGVPHKTSRDIELQGFLI
PKGTTLITNLSSALKDETVWEKPLCFHPEHFLDAQGHFVKPEAFMPFSAGRRSCLGEP
LARMELFLFFTCLLQRFSISVPDGQPQPSDHGVFRALTTPCPYQLCALPR

>30. Cyp2d26       Mm.29064 68ESTs follow AA982292
MGLLVGDDLWAVVIFTAIFLLLVDLVHRRQRWTACYPPGPVPFPGLGNLLQVDFENIPYS
FYKLQNRYGNVFSLQMAWKPVVVVNGLKAVRELLVTYGEDTSDRPLMPIYNHIGYGHKSK
GVILAPYGPEWREQRRFSVSTLRDFGLSKKSLEQWVTEEAGHLCDAFTKEAEH 
PFNPSPLLSKAVSNVIASLIYARRFEYEDPFFNR
MLKTLKESLGEDTGFVGEVLNAIPMLLHIPGLPDKAFPKLNSFIALVNKMLIEHDL
TWDPAQPPRDLTDAFLAEVEKAKGNPESSFNDKNLRIVVIDLFMAGMVTTSTTLSWALLL
MILHPDVQRRVHQEIDEVIGHVRHPEMADQARMPYTNAVIHEVQRFADIVPTNLPHMTSR
DIKFQDFFIPKGTTLIPNLSSVLKDETVWEKPLRFYPEHFLDAQGHFVKHEAFMPFSAGR
RSCLGEPLARMELFLFFTCLLQRFSFSVPDGQPRPSDYGIYTMPVTPEPYQLCAVAR

>31. Cyp2e1        Mm.13020 genbank L11650
MAVLGITVALLVWIATLLLVSIWKQIYRSWNLPPGPFPIPFFGN
IFQLDLKDIPKSLTKLAKRFGPVFTLHLGQRRIVVLHGYKAVKEVLLNHKNEFSGRGD
IPVFQEYKNKGIIFNNGPTWKDVRRFSLSILRDWGMGKQGNEARIQREAHFLVEELKK
TKGQPFDPTFLIGCAPCNVIADILFNKRFDYDDKKCLELMSLFNENFYLLSTPWIQAY
NYFSDYLQYLPGSHRKVMKNVSEIRQYTLGKAKEHLKSLDINCPRDVTDCLLIEMEKE
KHSQEPMYTMENISVTLADLFFAGTETTSTTLRYGLLILMKYPEIEEKLHEEIDRVIG
PSRAPAVRDRMNMPYMDAVVHEIQRFINLVPSNLPHEATRDTVFRGYVIPKGTVVIPT
LDSLLFDNYEFPDPETFKPEHFLNENGKFKYSDYFKAFSAGKRVCVGEGLARMELFLL
LSAILQHFNLKSLVDPKDIDLSPVTIGFGSIPREFKLCVIPRS

>32. Cyp2f2        Mm.4515  genbank M77497
MDGVSTAILLLLLAVISLSLTFSSRGKGQLPPGPKPLPILGNLL
QLRSQDLLTSLTKLSKEYGSVFTVYLGSRPVIVLSGYQTVKEALVDKGEEFSGRGAYP
VFFNFTRGNGIAFSDGERWKILRRFSVQILRNFGMGKRSIEERILEEGSFLLEVLRKM
EGKPFDPVFILSRSVSNIICSVVFGSRFDYDDERLLTIXHFINDNFKIMSSPWGEMYN
IFPSVLDWIPGPHKRLFRNFGGMKDLIARSVREHQDSLDPNSPRDFIDCFLTKMAQEK
QDPLSHFNMDTLLMTTHNLLFGGTETVGTTLRHAFLILMKYPKVQARVQEEIDRVVGR
SRMPTLEDRTSMPYTDAVIHEVQRFADVIPMNLPHRVTRDTPFRGFLIPKGTDVITLL
NTVHYDSDQFKTPQEFNPEHFLDDNHSFKKSPAFMPFSAGRRLCLGEPLARMELFIYF
TSILQNFTLQPLVDPEDIDLTPLSSGLGNLPRPFQLCMHIR

>33. Cyp2g1 NM_013809 no UNIGENE entry
MMLGGAFSIFMALCLSCLLILIAWKRTSKGGKLPPGPTPIPFLG
NFLQVRTDATFQSFQKLQKKYGSVFTVYFGPRPVVVLCGHEAVKEALVDQADDFSGRG
EMPTLEKNFQGYGLALSNGERWKILRRFSLTVLRNFGMGKRSIEERIQEEAGYLLEEL
HKVKGAPIDPTLYLSRTVSNVICSVVFGKRFDYQDQRFQSLMRMINESFVEMSKPWAQ
LYDMYWKVMQYFPGRHNYLYNLIEDLKDFIASRVKINEASFDPSNPRDFIDCFLIKMH
QDKSDPHTEFNLKNLVLTTLNLFFAGTETVSSTLRYGFLLLLKYPEVEAKIHEEINQV
IGTHRTPRVDDRAKMPYTDAVIHEIQRLTDIVPLGVPHNVTRDTHFRGYLLPKGTDVY
PLFGSVLKDPKYFRYPDAFYPQHFLDEQGRFKKNDAFVVFSSGKRICVGEALARMELF
LYFTSILQRFSLRSLVPPADIDIAHKISGFGNIPPVYELCFMAR

>34. Cyp2j5        Mm.12838 genbank U62294
MIMFLSSLVTTFWEALHLKTLVLAVVTFLFLINILRSRHPKNYP
PGPWRLPFVGNFFQIDTKQTHLVLQQFVKKYGNVFSLELGQSPVVVVSGLPLIKEMFT
HLDQNFVNRFMTPVRERITGKNGLVVSNGQTWKEQRRLALMALRNFGLGKKSLEERIQ
EETHHLVEAIREEGGQPFNPHLKLINAVSNIICSVTFGERFDYEDCQFQELLQLLDET
MHLMGSSAGQLYNGFPCIMKYLPGPHQKIFRNWGKLKLFVSHIVKKHEKDWNPDEPRD
FIDAFLIEMQKDPDRTTSFNEENLISTTLDLFLGGTETTSSTLRWALLYMSSYPEIQE
NVQAEIDRVIGHKRQVSLSDRESMPYTNAVIHEVQRMGNIVPLNSSREVTVDTKFNGF
HLPKGTMILTNLTALHRDPKEWATPEVFNPEHFLENGQFKKRESFLPFSMGKRACLGE
QLAKSELFIFFSALMQKFTFKPPINEKLSLKFRMGLILSPASYRICAIPRV

>35. Cyp2j6        Mm.6477  genbank U62295
MLAATGSLLATIWAALHPRTLLVAAVTFLLLADYFKNRRPKNYP
PGPWGLPFVGNIFQLDFGQPHLSIQPLVKKYGNIFSLNLGDITSVVITGLPLIKEALT
QMEQNIMNRPLSVMQERISNKNGLIFSSGQIWKEQRRFALMTLRNFGLGKKSLEERMQ
EEASHLVEAIREEEGKPFNPHFSINNAVSNIICSVTFGERFDYHDSRFQEMLRLLDEV
MYLETTMISQLYNIFPWIMKYIPGSHQKVFRNWEKLKLFVSCMIDDHRKDWNPDEPRD
FIDAFLKEMTKYPEKTTSFNEENLICSTLDLFFAGTETTSTTLRWALLYMALYPEVQE
KVQAEIDRVIGQKRAARLADRESMPYTNAVIHEVRRMGNIIPLNVPREVAMDTNLNGF
HLPKGTMVLTNLTALHRDPKEWATPDVFNPEHFLENGQFKKRESFLPFSMGKRACLGE
QLARSELFIFFTSLMQKFTFNPPINEKLSPKFRNGLTLSPVSHRICAVPRQ

>36. Cyp2j7 AF218856 Darryl Zeldin exons 7-9
XM_143894.1 complete sequence chromosome 4
MLSTVGSLVARIWSAIHLWTLLLTLLVFRLLADYLKNRRPKNYP
PGPWRLPFVGNLFQFDLDVSQLHLRIQLFVKKYGNLISIDFGNISSVVITGLPLIKEA
FTDMEQNLLKRPIIPSRKHVFKDNGIIFSSGQKWKEQRRFALMILKSFGLGKKSLEQR
IQEEAYHLVKAMEEEKGQPFDPHFRINSAVSNIICSINFGERFEYGDNKFQELLKITN
ETLSAETSMMLVLYNALPSIMKYLPGPHQTVFRNWERLKMIVYHMMESHRKDWNPDEP
RDFIDAFLTEMTKYPDKTTTSFNEENLICSCLDLFIAGTETSSTALRWALLYMTLNPE
VQERVHSEIDGVIGQWRQPSTGDRDSMPYTNAVIHEVLRMGNIVPLNVPREVTADSTL
AGFHLPKGKMILTNLTALHRDPKEWATPDIFNPEHFLENGQFKKRESFLPFSMGKRAC
PGEQLARTELFIFFTALMQKFTFKAPVNEKLSLKFRMSVPISPVSYCICAVPRQ*

>37. Cyp2j8 AF218857 WQ 4-1 AI429871 vv77f02.y1 69-184 (EST),
AA760476 vv77f02.r1 69-227 (EST), AZ393698 283-329 (GSS), AI606765 vv77f02.x1 330-476 (EST)
AZ057726 422-463 (GSS), XM_131520.1 (from nr)
AL772157.1 htgs AC102925.1  sequence below is from mouse Ensembl version 3
4.94000001-95000000 chr 4 add 94 million to numbers to get global location
263773 MFATIGCLVSNICSEIHLWTLLLAALTLLLLVDYIKNRHPKNYPPGPWRLPFVGNLFQFDLDVSHLHLGIQP 263558
260386 FVKKYGNLISLDFGTIPSVIISGEPLIKEALTCMGQNFLKRPILAARQHIFKNNG 260222
IISSSGQTWKEQRRFTLMILKNFGLGKKSLEQRIQDEAHHLVEAIAEEKGRPFDPHFMIN
256048 NAVSNIICSITIGERFEYEDNQFQELLKLADETLCLEASKVLM 255920
226315 LYNVFPSIFKYLPGPHQKLFSNWEKLKLFFS 226223 frameshift
h this codon comes from CGAT at the frameshift (delete G)
226218 VMDSHRKDWNPSAPRDFIDAFLTEMAK 226138
222579 YSDKTTTSFNEENLICTTLDLFFAGTETTSTALRWALLYITVNPEVQ 222439
217694 EKVHSEIDRVIGQGRHPTIDDRDSMPYTNAVIHEVLRMGNIIPLNVPREVEADITLAGFHLPK 217506
194144 GTMVLTNLTALHRDPKEWATPDVFNPEHFLENGQFKKRESFLPFSV 194010
191558 GKRGCPGEQLARSELFTFFTALMQKFTFKPPINEKLSLNFKMGVALSPVSYCICAVPR* 191382

>38. Cyp2j9        AK018422 08-FEB-2001 lung also AF336850 from D. Zeldin
MLAAAGSLVAAIWAALH
LRILLLSAVTFLFLADFLKNRRPKNYPPWPMRLPFVGCLFHLDPKQPHLSLQQFVKKY
GNVLSLDFANIPSVVITGMPLIKEIFTQMEHNFMNRPVTLLRKHLFNKNGLIFSSDQT
WKEQRRFALMTLRNFGLGKRSLEERIQEEAHYLVEAIRDEGGQPFDPHFNISNAVSNI
ICSITFGERFDYHDSQFQEMLRLLDEAMCLESSMMCQLYNIFPRILQYLPGSHQTLFR
NWRKLQLFVSDIVNNHRRDWDPDEPRDFIDAFLTEMTKYPDKTTTSFNEENLICSTLD
LFFAGTETTSTTLRWALLCMALYPEVQEKMQAEIDRVIGQARQPNLADRDSMPYTNAV
IHEVQRMGNIIPFNVPREVAVDTQLAGFNLPKGIMVLSNLTALHRDPSEWTTPDTFNP
EHFLENGQFKKRESFLPFSMGKRACLGEQLARSELFIFITTLFQKFTFKPPVNEKLSL
QFRMAATVSPVSHRLCAIPRL*

>39. CYP2R1 XM_146091.1 Mus musculus 16 MAY 2002
1   MLELPGARACAGALAGALLLLLFVLVVRQLLRQRRPAGFPPGPPRLPFVGNICSLALSAD 180
181 LPHVYMRKQSRVYGE 
    IFSLDLGGISTVVLNGYDVVKECLVHQSEIFADRPCLPLFMKMTKMGGLL 540
541 NSRYGRGWIDHRRLAVNSFHYFGSGQKSFESKILEETWSLIDAIETYKGGPFDLKQLITN 720
721 AVSNITNLILFGERFTYEDTDFQHMIELFSENVELAASAPVFLYNAFPWIGILPFGKHQR 900
901 LFRNADVVYDFLSRLIEKAAVNRKPHLPHHFVDAYLDEMDQGQNDPLSTFSKENLIFSVG 1080
1081ELIIAGTETTTNVLRWAILFMALYPNIQGQVHKEIDLIVGHNRRPSWEYKCKMPYTEAVL 1260
1261HEVLRFCNIVPLGIFHATSEDAVVRGYSIPKGTTVITNLYSVHFDEKYWKDPDMFYPERF 1440
1441LDSNGYFTKKEALIPFSLGRRHCLGEQLARMEMFLFFTSLLQQFHLHFPHELVPNLKPRL 1620
1621GMTLQPQPYLICAERR 1668

>40. Cyp2s1   mouse AC073725.2, AC087155.1 ESTS AA562979, AA543966, AA967201, AA472776
BF584822, AA546445 vk02f11.r1
3'UTR  Mm.23710 15 ESTs from the 3 prime untranslated region AI585412 vk02f11.y1
MEAASTWALLLALLLLLLLLSLTLFRTPARGYLPPGPTPLPLLGNLLQLRPGALYSGLLR (0)
LSKKYGPVFTVYLGPWRRVVVLVGHDAVREALGGQAEEFSGRGTLATLDKTFDGHG (1)
VFFANGERWKQLRKFTLLALRDLGMGKREGEELIQAEVQSLVEAFQKTE (1)
GRPFNPSMLLAQATSNVVCSLVFGIRLPYDDKEFQAVIQAASGTLLGISSPWGQ (0)
AYEMFSWLLQPLPGPHTQLQHHLGTLAAFTIQQVQKHQGRFQTSGPARDVVDAFLLKMAQ (0)
EKQDPGTEFTEKNLLMTVTYLLFAGTMTIGATIRYALLLLLRYPQVQ (1)
QRVREELIQELGPGRAPSLSDRVRLPYTDAVLHEAQRLLALVPMGMPHTITRTTCFRGYTLPK (0)
GTEVFPLIGSILHDPAVFQNPGEFHPGRFLDEDGRLRKHEAFLPYSL (1)
GKRVCLGEGLARAELWLFFTSILQAFSLETPCPPGDLSLKPAISGLFNIPPDFQLRVWPTGDQSR*

>41. Cyp2t4 AC073713.2 71% to CYP2T2P
 426 MVTCLALLLLLLILMLLLWWGGVVRRQAQMQKDLPPGPAPLPLLGNLLQLQSGDLDRVLME 608 (0)
 938 LSSHWGPVFTVWLGPLPAVVLCGYEALRDALVLQADAFSGRGAMAVFDRFTCGN 1100 (1)
     GIVFSNGPRWHSLRNFALGVLRELGVGRSTIEDRILEEAACVLDEFQATM 1610 (1)
     GAPFDPQQLLDSAVSNVICTVVFGKRYDYGDPEFRRLLNLFSDNFCIMSSRWAE (0)
     IYNMFPSFMDWIPGPHNRIFKNFQELRLFISEQIQWHWQSRQTGEPRDFIDCFLDQMDK 2698 (0)
2778 EQQDLESHFQDETLVMTTHDLFFGGTETTSTTLRYGLLIMLKYPEVA 2915 (1)
3017 AKVQEELDATVGRTWAPRIEDRARLPYTNAVLHEIQRFISVLPLGLPRALTRDVNLKNHFLHK 3205 (0)
3456 GTFVIPLLVSAHRDPTQFKDPDHFNPTNFLDDHGEFQNNDAFMPFAL 3596 (1)
     GKRMCLGAGLARSEIFLFLTAILQKFSLLPVGSPANINLNPQCTGLGNVPPAFQLRLVAR* 3868

>42. 2U1 mouse AK018458 Mus musculus 16 days embryo lung cDNA about 78% 
identical probable error at C-term some small diffs at intron boundaries
MSSLG DQRPAAGEQPGARLHVRA        TGGALLLCLLAVLLGWVWLRRQRACGI
      PPGPKPRPLVGNFGHLLVPRFLRPQFWLGS     GSQTDTVGQHVYLARMARVYGNI
      FSFFIGHRLVVVLSDFHSVREALVQQAEVFSDRPRMPLISIMT
      KEKGIVFAHY
      GPIWKQQRRFSHSTLRHFGLGKLSLEPRIIEEFAYVKEAMQKHGEAPFSPF
      PIISNAVSNIICSLCFGQRFDYTNKEFKKVLDFMSRGLEICLHSQLFLINICPWFYYLPF
      GPFKELRQIERDISCFLKNIIREHQESLDASNPQDFIDMYLLHMEEEQGASRRSSFDED
      YLFYIIGDLFIAGTDTTTNSLLWCLLYMSLNPDVQ
     KKVHEEIERVIGCDRAPSLTDKAQMPYTEATIMEVQRLSMVVPLAIPHMTSEKT
      VLQGFTIPKGTVVLINLWSVHRDPAIWEKPDDFCPHRFLDDQGQLLKRETFIPFGIG
(probable error here, missing last exon)QLKLGFNLFFTLSLVCVCVCVCVCVYRHV

>43. CYP2W1 MOUSE XM_144624 WHOLE mRNA May 16, 2002
PARTS from GSS AZ515172 AZ329864 AZ983190 BH076787
MALLLLGVWGILLLLGLWGLLQGCTRSPSLAPRWPPGPRPLPFL
GNLHLLGVTQQDRALMELSERYGPMFTIHLGSQKTVVLSGYEVVREALVGTGHELADR
PPIPIFQHIQRGGGIFFSSGARWRAGRQFTVRTLQSLGVQQPSMVGKVLQELACLKGQ
LDSYGGQPLPLALLGWAPCNITFTLLFGQRFDYQDPVFVSLLSLIDQVMVLLGSPGIQ
LFNTFPRLGAFLRLHRPVLSKIEEVRTILRTLLETRRPPLPTGGPAQSYVEALLQQGQ
EDDPEDMFGEANVLACTLDMVMAGTETTAATLQWAVFLMVKHPHVQGRVQEELDRVLG
PGQLPQPEHQRALPYTSAVLHEVQRYITLLPHVPRCTAADIQLGGYLLPKGTPVIPLL
TSVLLDKTQWETPSQFNPNHFLDAKGRFMKRGAFLPFSAGRRVCVGKSLARTELFLLF
AGLLQRYRLLPPPGLSPADLDLRPAPAFTMRP (end may be frameshifted)
PAQTFSYDSVYSGAKAAYPYVEVGSWPFIWHHGAEGVSAQCSGPTLS

>44. Cyp3a11       Mm.21193 genbank X60452
MDLVSALSLETWVLLAISLVLLYRYGTRKHELFKKQGIPGPKPL
PFLGTVLNYYKGLWKFDMECYKKYGKTWGLFDGQTPLLAVTDPETIKNVLVKECFSVF
TNRRDFGPVGIMSKAISISKDDEWKRYRALLSPTFTSGKLKEMFPVIEQYGDILVKYL
RQKAKKGKPVTMKDVLGAYSMDVITSTSFGVNVDSLNNPEDPFVEKAKKLLRFDFFDP
LLFSVVLFPFLTPVYEMLNICMFPKDSIEFFKKFVDRMKESRLDSKQKHRVDFLQLMM
NSHNNSKDKVSHKALSDMEITAQSIIFIFAGYETTSSTLSFTLHSLATHPDIQKKLQD
EIDEALPNKAPPTYDTVMEMEYLDMVLNETLRLYPIANRLERVCKKDVELNGVYIPKG
STVMIPSYALHHDPQHWSEPEEFQPERFSKENKGSIDPYVYLPFGNGPRNCLGMRFAL
MNMKLALTKIMQNFSFQPCKETQIPLKLSRQGLLQPEKPIVLKVVPRDAVITGA

>45. Cyp3a13       Mm.4094  genbank X63023
MDLIPNFSMETWMLLATSLVLLYLYGTHSHGIFKKLGIPGPKPL
PFLGTILAYQKGFWECDIQCHKKYGKMWGLYDGRQPVLAITDPDIIKTVLVKECYSTF
TNRRRFGPVGILKKAISISENEEWKRIRALLSPTFTSGRLKEMFPIINQFTDVLVRNM
RQGLGEGKPTSMKDIFGAYSMDVITATSFGVNIDSLNNPQDPFVEKIKKLLKFDIFDP
LFLSVTLFPFLTPVFDALNVSLFPRDVISFFTTSVERMKENRMKEKEKQRVDFLQLMI
NSQNYKTKESHKALSDVEIVAQSVIFIFAGYETTSSALSFALYLLAIHPDVQKKLQDE
IDAALPNKAPATYDTLLQMEYLDMVVNETLRLYPIAGRLERVCKTDVEINGLFIPKGT
VVMIPTFALHKDPKYWPEPEEFRPERFSKKNQDSINPYMYLPFGSGPRNCIGMRFALI
NMKVALVRVLQNFTVQPCKETEIPLKLSKQGLLQPENPLLLKVVSRDETVSDE

>46. Cyp3a16       Mm.30303 genbank D26137 no ESTs
MNLFSALSLDTLVLLAIILVLLYRYGTYTHGLFKKQGIPGPKPL
PFLGTVLNYYKGLWKFDMECYEKYGKTWGLFDGQIPLFVITDPETIKNVLVKECFSVF
TNRQDFFPVGIMSKSISLAKDEEWKRYRALLSPTFTSGNLKEMFPVIEQYGDILVKYL
RQEAEKGKPVAVKDVLGAYSMDVIISTTFGVNIDSLNNPEDPFVENAKKVLRFDYFDP
LSLSVALFPFLTPIYEMLNICMFPKDSIEFFKKFVDRMTENRLDSKQKHRVDFIYLMM
EAYNKSKDKDSHKALSEIEITAQSIIFIFAGYETTSSILSFTVYSLATHPDIQKKLQE
EIDEALPNKAPPTYDTVMAMEYLDMVLNETLRLYPITNRLQRVCKKDVEINGIYIPKG
STVIIPSYVLHHDPQHWPEPEEFQPERFSKENKGSIDPYVYLPFGNGPRNCIGMRFAL
MNMKLALIKVLQNFSFQPCKETQIPLKLSRELLLQPVKPIVLKVVPRDAVITGA

>47. Cyp3a25  AF204959
MELIPNLSIETWVLLVTSLVLFYIYGTYSHGLFKKLGIPGPKPL
PLLGTIFNYYDGMWKFDEDCYKKYGKIWGFYEGPQPILAIMDPEIIKIVLVKECYSVF
TNRRFFGPVGFMKKAITISEDEEWKRLRTLLSPTFTSGKLKEMFPIMRQYGDILVRNL
RREEEKGEPISMKDIFGAYSMDVITGTSFGVNVDSLNNPQDPFVQKAKKILKFKVFDP
FLLSIILFPFLTPIYEMLNFSIFPRDSMNFFKKFVKRMKKERLASNQKNRVDFLQLMM
NTQNSKGQESQKALSDLEMAAQAVIFIFGGYDATSTSISLIMYELATHPDVQKKLQDE
IDRTLPNKAPVTYDALMDMEYLDMVVNESLRLYPIAIRLERVSKKDVEINGVFIPKGT
VVMIPIYPLHRNPEYWPEPQEFCPERFSKENKGNIDPYIYMPFGNGPRNCIGMRFALI
SIKLAVIGVLQNFTVQPCEETQIPLKISREPIFQPEKPIILKVVSRDKPRTGS

>48. Cyp3a41     mouse NM_017396 AB033414
MNLFSALSLDTWVLLAIILVLLYRYGTRTHGLFKKQGIPGPTPL
PFLGTVLNYYKGLWKFDMECYEKYGKTWGLFDGQMPLFVITDPEMIKNVLVKECFSVF
TNRREFGPVGIMSKAISISKDEEWKRYRALLSPTFTSGKLKEMFPVIEQYGDILVKYL
MQEAEKGKPVTMKDVLGAYSIDVITSTSFGVNVDSLNNPEDPFVEKAKGILRVDFFDP
LVFSVVLFPFLTPVYEMLNICMFPKDSIEFFKKFVNRMKESRLDSKQKHRVDFLQLMM
NAHNNSKDKDSHKALSDMEITAQSIVFIFAGYETTSSTLSFTLYCLATHPDIQKKLQE
EIDETLPNKAPPTYDTVMEMEYLDMVLNETLRLYPIGNRLERFCKKDVELNGVYIPKG
STVMIPSYALHHDPQHWPEPEEFQPERFSKENKGSIDPYLYMPFGIGPRNCIGMRFAF
MTMKLALTKVMQNFSFQPCQETQIPLKLSRQGLLQPEKPIVLKVVPRDVVITGA

>49. Cyp3a44     mouse AB039380 Tsutomu Sakuma 10-JAN-2001
MNLFSALSLDTLVLLAIILVLLYRYGTRTHGLFKKQGIPGPKPLPFLGTVLNYYTGIWKFDMECYEKYGKTWGLFD
GQTPLLVITDPETIKNVLVKDCLSVFTNRREFGPVGIMSKAISISKDEEWKRYRALLSPTFTSGRLKEMFPVIEQY
GDILVKYLRQEAEKGMPVAMKDVLGAYSMDVITSTSFGVNVDSLNNPEDPFVEEAKKFLRVDFFDPLLFSVVLFPL
LTPVYEMLNICMFPNDSIEFFKKFVDRMQESRLDSNQKHRVDFLQLMMNSHNNSKDKDSHKAFSNMEITVQSIIFI
SAGYETTSSTLSFTLYCLATHPDIQKKLQAEIDKALPNKATPTCDTVMEMEYLDMVLNETLRLYPIVTRLERVCKK
DVELNGVYIPKGSMVMIPSYALHHDPQHWPDPEEFQPERFSKENKGSIDPYVYLPFGIGPRNCIGMRFALMNMKLA
VTKVLQNFSFQPCQETQIPLKLSRQGILQPEKPIVLKVVPRDAVITGA

>50. Cyp4a10       Mm.10742 genbank AB018421
MSVSALSPTRFADSLSGFLQVASVLGLLLLLVKAVQFYLHRQWL
LKAFQQFPSPPFHWFFGHEQFKGDHELQEIVSCIENFPSAFPRWFWGSKAYLTVYDPD
YMKVILGRSDPKANGAYRLLAPWIGYGLLLLNGQPWFQHRRMLTPAFHYDILKPYVKN
MADSIRLMLDKWERLADQDSSIEIFQHISLMTLDTVMKCAFSHKGSVQVDGNYRTYLQ
AIGDLNNLFHSRVRNIFHQNDTIYKLSSNGRLAKQACQLAHDHTDGVIKLRKDQLQDE
GELEKIKKKRRLDFLDILLFARMENGDSMSDKDLRAEVDTFMFEGHDTTASGVSWIFY
ALATHPDHQQRCREEVQSLLGDGSSITWDHLDQIPYTTMCIKEALRLYPPVPGIVREL
STSVTFPDGRSLPKGVQVTLSIYGLHHNPKVWPNPEVFDPSRFAPDSPRHSHSFLPFS
GGARNCIGKQFAMSELKVIVALTLLRFELLPDPTRVPMPLARLVLKSKNGIYLHLKKL
H

>51. Cyp4a12       no UNIGENE entry genbank X71479, Y10222 EST AA882070 vx36a01.r1
whole seq Jorge Capdevila sequence can be assembled from public ESTs
AA882070 mouse lung 937.
AI466445 mouse lung 937.
BF383974  NCI_CGAP_Li9 
BF533291  NCI_CGAP_Li9 
BF532415  NCI_CGAP_Li9 
BF385653  NCI_CGAP_Li9 
BF533939  NCI_CGAP_Li9 
BF532706  NCI_CGAP_Li9 
BF780461  NCI_CGAP_Kid14
BF780798  NCI_CGAP_Kid14
BF783550  NCI_CGAP_Kid14
BF789061  NCI_CGAP_Kid14
BF786791  NCI_CGAP_Kid14
BF783236  NCI_CGAP_Kid14
BF785997  NCI_CGAP_Kid14
BF783565  NCI_CGAP_Kid14
BF785444  NCI_CGAP_Kid14
BF782104  NCI_CGAP_Kid14
BF788511  NCI_CGAP_Kid14
BF789304  NCI_CGAP_Kid14
MSASALSSIRFPGSISEYLQVASVLSLLLLLFKTAQLYLHRQWLLSSTQQFPSPPSHWLF
GHKILKDQDLQDILTRIKNFPSACPQWLWGSKVRIQVYDPDYMKLILGRSDPKANGSYRF
LAPWIGRGLLLLDGQTWFQHRRMLTPAFHYDILKPYTEIMADSVRVMLDKWEQIVGQDST
LEIFRHITLMTLDTIMKCAFSHEGSVQLDRKYKSYIQAVEDLNDLVF
SRVRNIFH(Q)NDIIYRVSSNGCKANSACKLAHDHTDQVIKSRRIQLQDEEELEKLKKKRR
LDFLDILLFARMENGKSLSDKDLRAEVD
TFMFEGHDTTASGISWIFYALATNPEHQQRCRKEIQSLLGDGTS
ITWNDLDKMPYTTMCIKEALRIYPPVPSVSRELSSPVTFPDGRSLPKGIHVMLSFYGL
HHNPTVWPNPEVFDPSRFAPGSSRHSHSFLPFSGGARNC
IGKQFAMNELKVAVALTLLRFELLPDPTRVPIPIPRIVLKSKNGIHLHLKKLQ*

>52. Cyp4a14       Mm.7459  genbank Y11638 NM_007822
MGFFLFSPTRYLDGISGFFQWAFLLSLFLVLFKAVQFYLRRQWL
LKTLQHFPCMPSHWLWGHHLKDKELQQILIWVEKFPSACLQCLSGSNIRVLLYDPDYV
KVVLGRSDPKASGIYQFFAPWIGYGLLLLNGKKWFQHRRMLTPAFHYDILKPYVKIMA
DSVNIMLDKWEKLDGQDHPLEIFHCVSLMTLDTVMKCAFSYQGSVQLDENSKLYTKAV
EDLNNLTFFRLRNAFYKYNIIYNMSSDGRLSHHACQIAHEHTDGVIKMRKSQLQNEEE
LQKARKKRHLDFLDILLFARMEDRNSLSDEDLRAEVDTFMFEGHDTTASGISWIFYAL
ATHPEHQQRCREEVQSILGDGTSVTWDHLGQMPYTTMCIKEALRLYPPVISVSRELSS
PVTFPDGRSIPKGITATISIYGLHHNPRFWPNPKVFDPSRFAPDSSHHSHAYLPFSGG
SRNCIGKQFAMNELKVAVALTLLRFELLPDPTRIPVPIARLVLKSKNGIHLCLKKLR

>53. Cyp4a28p AJ297131, lower case = 4a10, same contig as 4x1
4.112000001-113000000 from Ensembl mouse version 3 chr 4 add 112 million to numbers
943832 MSISALSPTRFLECVSGFLQVVSLIGLLLLLFKAAQHYLRRQWLLRALYQFPSPPSHWLFGH 943647 
939707 QFKEDEELRWVLRYVEKFPSAFLHWIWGN 939621 frameshift
939621 *VFIKIYDPDYMKVILGRA 939565                                   
939009 SDPKAGLYRFLTPWL 938965                                          
938111 GHGLFILNGHKWFQHRRMLTSAFHYDILKPYVGTMADSVQVIL 937983               
930960 DKWEQIACQDITLEIFHPISLMMLETIMKCAFSYQGSVQLDRS 930832               
930514 RSSQAYLQAVSDLNNLVTSRMTNVFLHNDIIYNLTSHSRRTNSACQIAHEHT 930359      
929925 DRVVKLRKAQLQDNKSMEKLRGKRCLDFLDILLLSRVSS 929809        
       DDGSSLSDKALRAEVDSLMFGDHDTPASGISWVFYALATHPNHQQRCTEEVQSLLGDGSPITW 929542 
       EHLHQMPYTTLCVKEAQRLYPTVPSVGRKLNTPVTFPDGRSLPKG 929424
(missing exon 10)
927997 VFDPSRFAMNSVQHSHAFLPFSGGS 927923                               
926387 RNCIGKQLAMNALKVAVALTLLRFELLPDPT*VPIPTHQLILKSTNGIHL 926238        

exact duplicate of exons 6-12
925899 RSSQAYLQAVSDLNNLVTSRMTNVFLHNDIIYNLTSHSRRTNSACQIAHEHT
       DRVVKLRKAQLQDNKSMEKLRGKRCLDFLDILLLSRVSS 925195
       DDGSSLSDKALRAEVDSLMFGDHDTPASGISWVFYALATHPNHQQRCTEEVQSLLGDGSPITW 924927 
       EHLHQMPYTTLCVKEAQRLYPTVPSVGRKLNTPVTFPDGRSLPKG 924809
923831 IESTVSCFIFSSAWKAFDKPE
923372 VFDPSRFAMNSVQHSHAFLPFSGGS 923298                               
922002 RNCIGKQLAMNALKVAVALTLLRFELLPDPT*VPIPTHQLILKSTNGIHLHLRKL 921838   

>Cyp4a29 4.113000001-114000000 chr 4 from mouse Ensembl version 3 add 113 million to numbers
91% to 4A28p
47095 MSISALSPIRFLECVSGFLQVIFLIGLLLLLFKAAQHYLRRQWLLRALYQFPSPPSHWLFGH 47280 
53123 QFKVDEELRWILRYLEKFPGAFLHWIWGSHVFIKVCDPDYMKVILGRA 53266          
53824 SDPKASLYRFLTPWL 53868                                           
54675 GHGLFFLNGDEWFQHRRLLTPAFHYDILKPYVGIMADSVQVML 54803
55849 EKWEQIACQDITLEIFHPISLMMLDTIMKYAFSYQGSVQLDRS 55977                 
56295 RSSQAYLQAVSDLNNLVTSRMKNVFLHNDIIYNLTSHSRRTNSACQIAHEHT 56450        
56880 DRVVKLRKAQLQDNKSMKKLRGKRCLDFLDILLLSRVSS 56996
57075 DDGSSLSDKALRAEVDSLMFGGHDTPASGISWVFYALATHPDHQQRCREEVQSLLGDGSPITW 57263 
57362 DHLHQMPYTTMCIKEALRLYPPIPSVGRKLSTPVTFPDGRSLPKG 57378
58567 IESTVSCFIFFSAWKAFDKPE
59045 VFDPSRFAMNSVQHSHAFLPFSGGS 59119                                 
60585 RNCIGKHLAMNVLKVAVALTLLRFELLPDPSRVPIPTQQLVLKSKNGIYLHLRKL 60749     

>54. Cyp4b1        Mm.1840  genbank D50834
MALSFLSPSLSRLGLWASVVILMVTVLKLLSLLFRRQKLARALD
SFPGPPKHWLFGHALEIQKTGGLDKVVTWTEQFPYAHPLWLGQFIVFLNIYEPDYAKA
VYSRGDPKAAYVYDFFLQWIGKGLLVLEGPKWFQHRKLLTPGFHYDVLKPYVAIFAES
TRVMLDKWEKKASENKSFDIFCDVGHMALDTLMKCTFGKGDSGLSHSDNSYYLAVSDL
TLLMQQRIDSFQYHNDFIYWLTPHGRRFLRACQIAHDHTDHVIRQRKAALQDEKEQKK
LQERRHLDFLDILLGARDESGIKLSDADLRAEVDTFMFEGHDTTTSGISWFLYCMALY
PMHQQRCREEVREILGDRDSFQWDDLAQMTYLTMCMKECFRLYPPVPQVYRQLSKPVT
FVDGRSLPAGSLISLHIYALHRNSAVWPDPEVFDPLRFSPENMTGRHPFAFMPFSAGP
RNCIGQQFAMNEMKVVTALCLLRFEFSPDPSKIPIKVPQLILRSKNGIHLYLKPLGPG
SGK

>55. Cyp4f13       Mm.22045 15 ESTs follow AI323999 
BF533850 BC003954 AF233643
MLQLCLSWLGMGSLTASPWHLLLLGGASWILARILAWIYAFYDNCSRLRCFPQPPKPSWF
WGHLALMKNNEESMQFITHLGHDFHDVHLSWVGPVYPILRLVHPNFIAPLLQASAAVAPKEMTLYGFLKP 
WLGDGLLMSAGDKWSHHRRLLTPAFHFDILKSYVKIFNKSVNIMHAKWQCLASKGTSRLD 
MFEHISLMTLDSLQKCIFSVDSNCQESDSKYIAAILELSSLVVKRHRQP 
FLYLDLLYYLTADGRRFRKACDLVHNFTDAVIKERRSTLNTQGVEFLKAKAKTKTLD 
FIDVLLMAEDEHGKGLSNEDIRAEADTFMFGGHDTTTSALSWILYNLARHPEYQERCRQ 
EVQELLRDRDSEEIEWDDLAQLPFLTMCIKESLRLHPPVLLISRCCTQDVLLPDGRAI
PKGNICVISIFGVHHNPSVWPDPEVYNPFRFDPENPQKRSPLAFIPFSAGTRNCIGQTFA
MSEIKVALALTLLRFRILPDDKEPRRKPELILRAEGGLWLRVEPLSAGAQ* 

>56. Cyp4f14      Mm.10976 C-term only. AF233644 AK018676 AC073720 91k-108k 
              3ESTs follow AI046446
              The N-terminal is not in    
              UNIGENE but consists of AI098768, AI047403, 
              AI099192, AA237519, AA793220, AA869706, 
              AI118365 (chimeric clone), AA880205 BE135615
MSQLSLSWLGLGPEVAFPWKTLLLLGASWILARILIQIYAAYRN
YRHLHGFPQPPKRNWLMGHVGMVTPTEQGLKELTRLVGTYPQGFLMWIGPMVPVITLC
HSDIVRSILNASAAVALKDVIFYSILKPWLGDGLLVSAGDKWSRHRRMLTPAFHFNIL
KPYVKIFNDSTNIMHAKWQRLISDGSARLDMFEHVSLMTLDSLQKCVFSFDSNCQEKS
SEYIAAILELSALVAKRHQQPLMFMDLLYNLTPDGMRFRKACNVVHEFTDAVIRERHR
TLPDQGLDDFLKSKAKSKTLDFIDVLLLSKDEDGKELSDEDIRAEADTFMFEGHDTTA
SGLSWILYNLARHPEYQERCRQEVQELLRGREPEEIEWDDLAQLPFLTMCIKESLRLH
PPVTVISRCCTQDILLPDGRTIPKGIICLISIFGIHHNPSVWPDPEVYDPFRFDPENI
KDSSPLAFIPFSAGPRNCIGQTFAMSEMKVALALTLLRFRLLPDDKEPRRQPELILRA
EGGLWLRVEPLSAGAH

>57. Cyp4f15      Mm.26539 AF233645 1EST AI121221 
              AK018904
              no UNIGENE entry for N-terminal 
              ESTs AI315840 uj47c01.y1
              AI046615 ud70e01.y1, AI048594 ud61g02.y1
              Mm.26539 AI121221 matches C-term of Cyp4f13,  
              Cyp4f14 and Cyp4f15 exactly, but its opposite
              end is really Cyp4f15 this may indicate gene
              conversion occurred among these three genes.
MGFFRMPQLDLSWLGLRLEASSPWLLLLLIGASWLLARVLTQTY
IFYRTYHHLCDFPQPPKWNWFLGHLGMITPTEHGLKEVTNLVATYPQGFMTWLGPIIP
IITLCHPDIIRSVLNASASVALKEVVFYSFLKPWLGDGLLLSDGDKWSSHRRMLTPAF
HFNILKPYVKIFNDSTNIMHAKWQHLASGGSARLDVFENISLMTLDSLQKCVFSFDSN
CQENPSEYISAILELSALVTKRYHQLLLHIDSLYQLTCSGRRFHKACHLVHSFTDAVI
QDRRRTLPSKHEDDVLKAKAKSKTLDFIDVLLLSKDEDGKELSDEDIRAEADTFMFEG
HDTTASGLSWILYNLARHPEYQERCRQEVQELLRDRESTEIECSCAVFLRDDLAQLPF
LTMCIKESLRLHPPVTVISRRCTQDIVLPDGRVIPKGVICIINIFATHHNPTVWPDPE
VYDPFRFDPENIKDRSPLAFIPFSAGPRNCIGQTFAMNEMKVALALTLLRFRVLPDDK
EPRRKPELILRAEGGLWLRVEPLSTQ

>58. Cyp4f16 AF233646 AK009445 no UNIGENE entry ESTs AA059667 mj77a08.r1 
              AA863889 vx15d11.r1, AA673923 vo83a09.r1
              AA798644 vw34b12.r1, AA896484 vx63d12.r1
AA671839 vl02c09.r1, 
              AA755081 vu02b08.r1
AC073754 Mm.30504 ESTs AA688499 vq53e04.r1
              AA791143 vv91d10.r1, AI036012 vz68a09.r1
              AA794692 vu63c06.r1
MLRLSVSGLDLGSVVTSSWHLLLLGVASWILARILAWTYSFYENCSRLSCFPQPPKKNWFSGHLGM
IQSNEEGMQLVTEMGQAFQDVHLFWLGPVIPVLRIVDPAFVAPLLQAPALVAPKDMTFLRFLKPWL 
GDGLFLSSGDKWSRHRRLLTPAFHFDILKPYVKIFNQSVNI
MHAKWKHLSSEGSARLEMFEHISLMTLDSLQKCLFGFDSNCQES
PSEYISAILELSSLIIKRSLQLFLFVDFLYYHTADGRRFRKACDLVHNFTDAVIRERR
HTLSSQNHDEFLKSKTKSKTLDFIDVLLLAKDEHGKELSDEDIRAEADTFMFGGHDTT
ASALSWILYNLARHPEYQERCRQEVQELLRDREPEEIEWDDLAQLPFLTMCIKESLRL
HSPVIDLLRRCTRDIVLPDGRVIPK
GNICVISIFGIHHNPSVWPDPE
VYDPFRFDPENPQKRSPLAFIPFSAGP
RNCIGQTFAMSEMKVALALTLLRFRILPDDKEPRRKPEIILRAEGGLWLRVEPLSKGAQ* 128853

>59. Cyp4f17       AC073754 (5 exon 9s) no UNIGENE entry ESTs AA387122 vc22e06.r1
              AA386914 vc23b12.r1, AA387149 vc22f06.r1
MLQLGLSWLGLGPGAAFPWQLLQLV
GASLFLARILTWICAFYDNYCRLRCFPEPPSRHWFWGHMSM
VKNNEEGLQLLTERSHQFHDVHLCWIGPFYPILRLIHPKFIGSILQAS
AAVAPKEMIFYGFLKPWLG
DGLLVSAGEKWSRHRHLLTPAFHFDILKPYMKNFNKSVNIMH
AKWQRLTTKGTACLDMLEHISLMTLDSLQNCVFSFDSNCQE
SPSEYIAAIQELSSLIVKRHHQPFLYLDFLYYCTADGRRFRKACDLVH
NFTDAVIRERRRTLSSQNLDEFLKSKTKSKTLDFIDVLLLAK
DEHGKELSDEDIRAEADTFMFG
GHDTTASALSWILYNLARHPEYQERCRQEVQELLRGREPQEIEW
DDLAQLPFLTMCIKESLRLHPPVTVISRCCTQDVVLPDGRVIPK
GTDCVISIFGVHHNPEVWPDPE
VYDPFRFDPENPQKRSPLAFIPFSAGPR
NCIGQTFAMREMKVALALTLLRFRVLPGDKEPRRKPELILRAEGGLWLRVEPLSKGTQ* 152356

>60. Cyp4f18 AF233647 AK007863 frag.b no UNIGENE entry EST AA716998 vu61b04.r1
Cyp4f18 frag.d no UNIGENE entry EST AI006742 ua82h10.r1
AA716998 vu61b04.r1
MSQLSLSWMGLGHTAASPWLLLLLAGASCLLAYILTQIYGVFEN
SLRLRCFPQPPKRNWILGHLGLIQSSEEGLLYIQSLVRTFRDACCWWVGPLHPVIRIF
HPAFIKPVVLAPALVAPKDTVFYRFLKPWLGDGLLMSTGDKWSRHRRMLTPAFHFNIL
KPYVKVFNDSTNIMHAKWQRLASKGSAYLNMFEHISLMTLDSLQKCVFSFDSNCQEKP
SEYITAILELSALVARRHQRLLLHVDLFYYLTHDGMRFRKACRLVHDFTDAVIRERRR
TLLDQGGDDVLKAKAKAKTLDFIDVLLLSKDEHGKALSDEDIRAEADTFMFGGHDTTA
SGLSWILYNLARHPEYQERCRQEVRELLRDREPEEIEWDDLAQLPFLTMCIKESLRLH
PPVTAISRCCTQDIVLPDGRVIPKGVISRISIFGTHHNPAVWPDPEVYDPFRFDADNV
KGRSPLAFIPFSAGPRNCIGQTFAMSEMKVALALTLLRFRVLPDDKEPRRKPELILRA
EGGLWLKVEPLSAGAQ

>61. Cyp4f37 AC073754 comp(28012-47256) 93% identical to 4f16  
MLRLSVSGLDLGSVVTLSWHLLLLGAASWILARILAWTYSFCENCSRLRCFPQPPKRNWFLGHLGM
IEHNEEGMQLMTEAGQTYRDVHLCWLGPVIPVLRIVDPAFVAPLLLA TALVAPKDTTFHTFVKPWL
GDGLFLNSGDKWSRHRRLLTPAFHFDILKPYVKIFNQSVNIMH
AKWKHLSSEGSARLEMFEHISLMTLDSLQKCLFGFDSNCQE
SPSEYISAILELSSLVIKRSHQLFLFVDFLYYHTADGRRFRKACDLVHNFTDAVIR
ERRHTLSSQNHDEFLKSKTKSKTLDFIDVLLLAK
DEHGKELSDEDIRAEADTFMFGG
HDTTASALSWILYNLARHPEYQERCRQEVQELLRGREPQEIEW
DDLAQLPFLTMCIKESLRLHPPVIDLLRRCTRDIVLPDGRVIPK
GNICVISIFGIHHNPSVWPDPE
VYDPFRFDPENAHKRPPLAFIPFSAGP
RNCIGQTFAMNEMMVALALTLLRFRILPDDKEPRRKPEIILRAEGGLWLRVEPLS

>62. Cyp4f38p new AC073754 comp(9555-20475) partial pseudogene 52% to 4f14
AHILIQVYAAYGNYPHIPKWNWLMGHVSM
PQRRARRMLLNW*GNYP*GFIL*MIPLVSILKPCHPGIAKSV
VSAGDKWSHHCHMLTPAFHFNILKLYVKIFNDSTNIVH
LNISDHVILMTLGNFQKCIFSFDSNCQD
AIMELSMLISRRNKEPLLHMDLLGHLTPDGMCCCKACSMMHGLDDAVI
QERGHILSRQGLDGFFKDKVKFKPLNFIAVLLLTK
DDMSQLPFLTMCISLWLHPPDLLISYYCI*DVVLPDGWIIPKG

>63. Cyp4f39 AC073754 range 185296-209241 minus strand 62% to 4F14
MLPLTDYLLQLLGLEKTAFRVYV
VSALLLLVLFFFFRLLVRAFKLFSDFRITCRKLSCFPEPPGRHWLLGHMSMVS
PNEKGLQNEKKVLDTMHHIILAWVGPFLPLLVLVHPDYIKPVLGAS
AAIAPKDEFFYSFLKPWLG
DGLLISKGNKWSRHRRLLTPAFHFDILKPYMKIFNQCTNIMH
AKWRRHLAEGSVTSFDMFEHISLMTLDSLQKCVFSYNSDCQE
SMSDYISSIIELSALVVRRQYRLHHYLDFMYYLTADGRRFRQACDTVHNFTTEVI
QERRQALRQQGAEAWLKAKQGKTLDFIDVLLLAK
DEEGKELSDEDIRAEADTFMFEG
HDTTSSGLSWALFNLAKYPEYQEKCREEIQEVMKGRELEELDW
DDLTQLPFTTMCIKESLRQFPPVTLISRRCTEDIKLPDGRVIPK
GIICLVSIYGTHHNPIVWPDSK
VYNPYRFDPDTPQQRSPLAFVPFSAGP
RNCIGQSFAMAEMRVVVALTLLRFRLSVDRTHKVRRKPELILRTENGLWLNVEPLPSRA*

>64. Cyp4f40 AC073754 range <1-3688 minus strand two N-terminal exons no ESTs
continues on AC073720 comp(118300-137027) 79% to 4F14
MRYLDLSWLGLGPMSASPWLLFFLVGISWFLARFLTQLYTLYAKCQRLCGFPQPPKKNWFWGHLGMVS
PTEEGMKQVTELVTTYPQGFMTWLGPIVPLITLCHPDIIRSVLSAS
SAAVAPKDDIFYSFLKPWL
GDGLLVSAGDKWNRHRRMLTPAFHFNILKPYVKIFNDSTNIMH
AKWLRLASGGSTRLNMFENISLMTLDTLQKCVFSFNSNCQE
KPSQYIAAILELSTLAVKRNEQLLMHVDLLYRLTPDGMRFYKACRLVHDF
TDAVIQERRRTLLKHGGDDIIKAKAKSKTLDFIDVLLLTK
DEDGKELSDEDIRAEADTFMFRG
HDTTASGLSWILYNLARHPEHQERCRQEVQELLRDRDPKEIEW
DDLAQLPFLTMCIKESLRLHPPVTMVSRCCTQDISLPDGRIIPK
GVICIINIFGTHHNPTVWRDPE
VYDPFRFDPENIQARSPLSFIPFSAGP
RNCIGQTFAMSEMKVALALTLLRFRILPDDKEPRRKPELILRAEGGLWLRVEPLSTQA

>65. cyp4v3 AB056457 28 feb 2001 81% to 4v2
MLWLWLGLSGQKLLLWGAASAVSLAGATILISIFPMLVSYARKW
QQMRSIPSVARAYPLVGHALYMKPNNAEFFQQLIYYTEEFRHLPIIKLWIGPVPLVAL
YKAENVEVILTSSKQIDKSFLYKFLQPWLGLGLLTSTGSKWRTRRKMLTPTFHFTILE
NFLDVMNEQANILVNKLEKHVNQEAFNCFFYITLCALDIICETAMGKNIGAQSNNDSE
YVRTVYRMSDMIYRRMKMPWLWFDLWYLVFKEGRDHKRGLKCLHTFTNNVIAERVKER
KAEEDWTGAGRGPIPSKNKRKAFLDLLLSVTDEEGNRLSQEDIREEVDTFMFEGHDTT
AAAINWSLYLLGTNPEVQRKVDQELDEVFGRSHRPVTLEDLKKLKYLDCVIKETLRVF
PSVPLFARSLSEDCEVGGYKVTKGTEAIIIPYALHRDPRYFPDPEEFRPERFFPENSQ
GRHPYAYVPFSAGPRNCIGQKFAVMEEKTILACILRQFWVESNQKREELGLAGDLILR
PNNGIWIKLKRRHEDDP

>66. Cyp4x1 AJ297131 runs from 112000 to 138000 on the minus strand
mRNA            complement(join(112347..112511,116446..116595,
                     123737..123925,125922..126074,126161..126286,
                     128649..128777,131748..131888))
                     /gene="CYP_a"
MEASWLETRWARPLHLALVFCLALVLMQAMKLYLRRQRLLRDLSPFPGPPAHWLLGHQK
FLQEDNMETLDEIVKKHPCAFPCWVGPFQAFFYIYDPDYAKIFLSRT
GRGLLNLDGPRWFQHRCLLTPAFHQDILKPCVDTMAHSVKVML
DKWEKMWTTQETTIEVFEHINLMTLDIIMKCAFGQETNCQIN (?)
STYESYVKATFELGEIISSRLYNFWHHHDIIFKLSPKGHCFQELGKVIHQYT (?)
EKIIQDRKKILKNQVKQDDTQTSQIFLDIVLSAQ (?)
GKEDERAFSDADLRAEVNTFMWAGHDASAASISWLLYCLALNPEHQDRCRTEIRSILGDGSSITW
EQLDEMSYTTMCIKETLRLIPPVPSISRELSKPLTLPDGHSLPAG (?)
MTVVLSIWGLHHNPAVWNDPK (?)
VFDPLRFTKENSDQRHPCAFLPFSSGP (?)
RNCIGQQFAMLELKVAIALILLHFQVAPDLTRPPAFSSHTVLRPKHGIYLHLKKL

>67. Cyp5a1 Mm.4054  genbank L18868 AC073390.1
MEVLGLLKFEVSGTIVTVTLLVALLALLKWYSMSAFSRLEKLGI
RHPKPSPFVGNLMFFRQGFWESQLELRERYGPLCGYYLGRRMHVVISEPDMIKQVLVE
NFSNFSNRMASGLEPKMVADSVLLLRDRRWEEVRGALMSSFSPEKLDEMTPLISQACE
LLVAHLKRYAASRDAFNIQRCYCCYTIDVVASVAFGTQVDSQNSPEDPFVQHCRRAST
FCIPRPLLVLILSFPSIMVPLARILPNKNRDELNGFFNTLIRNVIALRDQQAAEERRR
DFLQMVLDAQHSMNSVGVEGFDMVPESLSSSECTKEPPQRCHPTSTSKPFTVDEIVGQ
AFLFLIAGHEVITNTLSFITYLLATHPDCQERLLKEVDLFMGKHPAPEYHSLQEGLPY
LDMVISETLRMYPPAFRFTREAAQDCEVLGQRIPAGTVLEIAVGALHHDPEHWPNPET
FDPERFTAEARLQRRPFTYLPFGAGPRSCLGVRLGLLVVKLTILQVLHKFRFEASPET
QVPLQLESKSALGPKNGVYIKIVSR

>68. Cyp7a1        no UNIGENE entry genbank L23754 EST AA254999
MMSISLIWGIAVVVSCCIWFIIGIRRRKVGEPPLDNGLIPYLGC
ALKFGSNPLEFLRAKQRKHGHVFTCKLMGKYVHFITNSLSYHKVLCHGKYFDWKKFHY
TTSAKAFGHRSIDPSDGNTTENINKTFNKTLQGDALCSLSEAMMQNLQSVMRPPGLPK
SKSAVWVTEGMYAFCYRVMFEAGYLTLFGKDISKTDTQRAFIQNNLDSFKQFDQVFPA
LVAGVPIHLLKTAHKARERLAESLKHKNLYMRDQVSELIRLRMFLNDTLSTFDDMEKA
KTHLVILWASQANTIPATFWSLFQMIRSPEAMKAASEEVNGSLQSAGQELSSGGNAIY
LDQEQLNNLPVLDSIIKEALRLSSASLNIRTAKEDFTLHLEDGSYNIRKDDIIALYPQ
LMHLDPEIYPDPLTFKYDRYLDESGKAKTTFYRNGNKLKYFYMPFGSGATICPGRLFA
VQEIKQFLILMLSYFELELVESHTKCPPLDQSRAGLGILPPLNDIEFKYKLKH

>69. Cyp7b1        Mm.4781  genbank U36993
MQGATTLDAASPGPLALLGLLFAATLLLSALFLLTRRTRRPREP
PLIKGWLPYLGMALKFFKDPLTFLKTLQRQHGDTFTVFLVGKYITFVLNPFQYQYVTK
NPKQLSFQKFSSRLSAKAFSVKKLLTDDDLNEDVHRAYLLLQGKPLDALLETMIQEVK
ELFESQLLKITDWNTERIFAFCGSLVFEITFATLYGKILAGNKKQIISELRDDFFKFD
DMFPYLVSDIPIQLLRNEESMQKKIIKCLTSEKVAQMQGQSKIVQESQDLLKRYYRHD
DSEIGAHHLGFLWASLANTIPAMFWAMYYILRHPEAMEALRDEIDSFLQSTGQKKGPG
ISVHFTREQLDSLVCLESTILEVLRLCSYSSIIREVQEDMNLSLESKSFSLRKGDFVA
LFPPLIHNDPEIFDAPKEFRFDRFIEDGKKKSTFFKGGKRLKTYVMPFGLGTSKCPGR
YFAVNEMKLLLIELLTYFDLEIIDRKPIGLNHSRMFLGIQHPDSAVSFRYKAKSWRS

>70. Cyp8a1        Mm.2339  genbank AB001607
MSWAALLGLLAVLLLLLLLLSRRRARRPGEPPLDLGSIPWLGHA
LEFGRDAASFLTRMKEKHGDIFTVLVGGRYVTVLLDPHSYDTVVWELRTRLDFHPYAI
FLMERIFDLQLPNFNPSEEKARMKPTLMHRDLQALTEAMYTNLRTVLLGDSTEAGSGW
QETGLLEFSYNALLSAGYLTLYGVEASPRTHESQAQDRVHSADVFHTFRQLDLLLPKL
ARGSLSAGDKDHACSVKNRLWKLLSPARLASRADRSSWLESYLRHLEEMGVSEEMQAR
ALVLQLWATQGNMGPTAFWLLLFLLKNPEALAAVRAELKHTVWQAEQPVSQMTTLPQK
ILDSMPVLDSVLNETLRLTAAPFITREVMADLALPMADGREFSLRRGDRLLLFPFLSP
QKDPEIYTEPEVFKYNRFLNPDGSEKKDFYKDGKRLKNYNMPWGAGHNQCLGKSYAIN
SIKQFVVLLLTHFDLELGSEDTEVPEFDLSRYGFGLMQPEEDVPIRYRARL

>71. Cyp8b1        Mm.20889 genbank AF090317
MTLWCTVLGALLTVVGCLCLSLLLRHRRPWEPPLDKGFVPWLGH
SMAFRKNMFEFLKGMRAKHGDVFTVQLGGQYFTFVMDPLSFGPIIKNTEKALDFQSYA
KELVLKVFGYQSVDGDHRMIHLASTKHLMGQGLEELNQAMLDSLSLVMLGPKGSSLGA
SSWCEDGLFHFCYRILFKAGFLSLFGYTKDKQQDLDEADELFRKFRRFDFLFPRFVYS
LLGPREWVEVSQLQRLFHQRLSVEQNLEKDGISCWLGYMLQFLREQGIASSMQDKFNF
MMLWASQGNTGPTCFWVLLFLLKHQDAMKAVREEATRVMGKARLEAKKSFTFTPSALK
HTPVLDSVMEESLRLCATPTLLRVVQEDYVLKMASGQEYQIRRGDKVALFPYLSVHMD
PDIHPEPTAFKYDRFLNPDGTRKVDFYKSGKKIHHYSMPWGSGVSKCPGRFFALSEMK
TFVLLMIMYFDFKLVDPDIPVPPIDPRRWGFGTSQPSHEVRFLYRLKPVQ

>72. Cyp11a1       Mm.28748 genbank J05511, NM_019779, AF195119
              ESTs AA288657
              AA107913, AA288671, AA684394, AA920968, 
              AI324751, AA920915, C85519, C87334, D19277,
              AU021754, C87804, C85447, C87406, AA389569,
              C88026, AU108054, C86435, C86758, W18146
MLAKGLSLRSVLVKGCQPFLSPTWQGPVLSTGKGAGTSTSSPRS
FNEIPSPGDNGWLNLYHFWRESGTQKIHYHQMQSFQKYGPIYREKLGTLESVYIVDPK
DASILFSCEGPNPERFLVPPWVAYHQYYQRPIGVLFKSSDAWKKDRIVLNQEVMAPGA
IKNFVPLLEGVAQDFIKVLHRRIKQQNSGNFSGVISDDLFRFSFESISSVIFGERMGM
LEEIVDPEAQRFINAVYQMFHTSVPMLNLPPDFFRLLRTKTWKDHAAAWDVIFNKADE
YTQNFYWDLRQKRDFSQYPGVLYSLLGGNKLPFKNIQANITEMLAGGVDTTSMTLQWN
LYEMAHNLKVQEMLRAEVLAARRQAQGDMAKMVQLVPLLKASIKETLRLHPISVTLQR
YTVNDLVLRNYKIPAKTLVQVASFAMGRDPGFFPNPNKFDPTRWLEKSQNTTHFRYLG
FGWGVRQCLGRRIAELEMTILLINLLENFRIEVQNLRDVGTKFSLILMPENPILFNFQ
PLKQDLGPAVTRKDNTVN

>73. Cyp11b1 80% to mouse 11B2 64% to human 11B1 and 11B2 80% to 11B1 rat 78% to 11B2 rat. GenBank J04451, sequence below is from Ensembl mouse version 3 
15.75000001-76000000 mouse chr 15 add 75 million to get global location
306415 MALRVTTDVWLARPWQCLHRTRALGTTATLAPKTLQPFEAIPQYSRNKWLKMIQILREQG 306356
306355 QENLHLEMHQVFRELGPIFR 306176 (?)
305773 HSVGKTQIVFVTLPEDVEKLYQVESTHPCRMPLESWIVHRELRGLGRGVFLL 305618 (?)
304234 NGPEWYFNRLQLNPNVLSPKAVQKFVPLVDGIARDFVDNLKKKMLESVHGSFSMDFQSSV 304175
304174 FNYTIE 304157 (?)
303918 ASHFVLFGERLGLIGRDLSPDSLKFLHTLHSMFKTTTQLLYLPRSLTRWTSTRVWKENLE 303859
303858 SWDFISEYG 303826 (?)
303446 TKCIKNVYRELAEGRPQSWSVTAELVAERTLSMDAIQANSMELIAGSTDT 303413 (0?)
301935 TSTPLVMTFFELARNPDVQQALRQESLAAEASIAANPQKAMSDLPLLRAALKETLR 301768
301535 LYPVGTFLERILSSDLVLQNYHVPAG 301458
301370 TVLNVNLYSMGRNPAVFPRPERYMPQRWLERKRSFKHLAFGFGVRQCLGRRLAEAEM 301314
301313 MLLLHH 301182 (?0)
300923 VLKSFHVETQEK 300888
300887 EDVRMAYRFVLMPSSSPLLTFRPVN 300705

>74. Cyp11b2 genbank S85260 sequence below is from Ensembl mouse version 3
15.75000001-76000000 mouse chr 15 add 75million to get global location
68% to human 11B2 67% to human 11b1 94% to 11B2 rat 81% to 11B1 rat
321034 MALRVTADVWLARPWQCLHRTRALGTTATLAPKTLQPFEAIPQYSRNKWLKMIQILREQG 320975
320974 QENLHLEMHQVFRELGPIFR 320795 (?)
320391 HSVGKTQIVSVMLPEDAEKLHQVESMLPRRMHLEPWVAHRELRGLRRGVFLL 320238 (?)
318803 NGPEWRLNRLRLNRNVLSPKAVQKFVPMVDMVARDFLETLKEKVLQNARGSLTM 318762
318761 DVQQSLFNYTIE 318606 (?)
318477 ASNFALFGERLGLLGHDLSPGSLKFIHALHSMFKSTS 318487
318486 QLLFLPKSLTRWTSTRVWKEHFDAWDVISEYG (?) 318391
318000 NRCIWKVHQELRLGSSQTYSGIVAELISQGSLPLDAIKA 318004
318003 NSMELTAGSVDT 317848 (?)
316981 TAIPLVMTLFELARNPDVQKALRQESLAAEASIAANPQKAMSDLPLLRAALKETLR 316814 (?)
316581 LYPVGGFLERILSSDLVLQNYHVPAG 316504 (0?)
316416 TLVLLYLYSMGRNPAVFPRPERYMPQRWLERKRSFQHLAFGFG 316396
316395 VRQCLGRRLAEVEMMLLLHH 316228 (0?)
315961 ILKTFQVETLRQEDVQMAYRFVLMPSSSPVLTFRPVS 315851

>75. Cyp17         Mm.1262  genbank M64863 AC079419.1
MWELVGLLLLILAYFFWPKSKTPNAKFPRSLPFLPLVGSLPFLP
RRGHMHANFFKLQEKYGPIYSLRLGTTTAVIVGHYQLAREVLVKKGKEFSGRPQMVTL
GLLSDQGKGVAFADSSSSWQLHRKLVFSTFSLFRDDQKLEKMICQEANSLCDLILTYD
GESRDLSTLIFKSVINIICTICFNISFENKDPILTTIQTFTEGIVDVLGHSDLVDIFP
WLKIFPNKNLEMIKEHTKIREKTLVEMFEKCKEKFNSESLSSLTDILIQAKMNAENNN
TGEGQDPSVFSDKHILVTVGDIFGAGIETTSSVLNWILAFLVHNPEVKRKIQKEIDQY
VGFSRTPSFNDRTHLLMLEATIREVLRIRPVAPLLIPHKANIDSSIGEFAIPKDTHVI
INLWALHHDKNEWDQPDRFMPERFLDPTGSHLITPTPSYLPFGAGPRSCIGEALARQE
LFIFMALLLQRFDFDVSDDKQLPCLVGDPKVVFLIDPFKVKITVRQAWKDAQVEVST

>76. Cyp19         Mm.5199  genbank D00659
MFLEMLNPMQYNVTIMVPETVTVSAMPLLLIMGLLLLIWNCESS
SSIPGPGYCLGIGPLISHGRFLWMGIGSACNYYNKMYGEFMRVWISGEETLIISKSSS
MFHVMKHSHYISRFGSKRGLQCIGMHENGIIFNNNPSLWRTIRPFFMKALTGPGLVRM
VEVCVESIKQHLDRLGEVTDTSGYVDVLTLMRHIMLDTSNMLFLGIPLDESAIVKKIQ
GYFNAWQALLIKPNIFFKISWLYRKYERSVKDLKDEIAVLVEKKRHKVSTAEKLEDCM
DFATDLIFAERRGDLTKENVNQCILEMLIAAPDTMSVTLYFMLLLVAEYPEVEAAILK
EIHTVVGDRDIKIEDIQNLKVVENFINESMRYQPVVDLVMRRALEDDVIDGYPVKKGT
NIILNIGRMHRLEYFPKPNEFTLENFEKNVPYRYFQPFGFGPRGCAGKYIAMVMMKVV
LVTLLRRFQVKTLQKRCIENIPKKNDLSLHPNEDRHLVEIIFSPRNSDKYLQQ

>77. Cyp20 AK020848 Mus musculus Cyp20 adult retina cDNA plus ESTs for C-term 
MLDFAIFAVTFLLALVGAVLYLYPASRQASGIPGLTPTEEKDGN
LPDIVNSGSLHEFLVNLHERYGPVVSFWFGRRLVVSLGTTDVLKQHFNPNKTSDPFET
MLKSLLGYQSGGGSAGEDHVRRKLYGDAVTASLHSNFPLLLQLSEELLDKWLSYPETQ
HIPLSQHMLGFALKFVTRMVLGSTFEDEQEVIRFQKIHG
TVWSEIGKGFLDGSLDKNTTRKKQYQEALMQLESTLKKIIKERKGGNFRQHT
FIDSLTQGKLNEQQILEDCVVFSLASCIITAR LCTWTIHFLTTTGEVQKK
LCKEIDQVLGEGPITSEKIEQLSYCQQVLFETVRTAKLTPVSARLQDIEGKVGPFVIPKE 360
TLVLYALGVVLQDPSTWPLPHRFDPDRFADEPVMKVFSSLGFSGTWECPELXFAYMVTAV 540
LVSVLLEKLRLLAVDRQVVEMKYELVTSAREEAWITVSKRH*

>78. Cyp21         Mm.18845 genbank AF049850
MLLPGLLLLLLLLAGTRWLWGQWKLRKLHLPPLAPGFLHFLQPN
LPIYLLGLTQKLGPIYRIRLGMQDVVVLNSNRTIEEALIQKWVDFAGRPHMLNGKMDL
DLSLGDYSLMWKAHKKLSRSALMLGMRDSMEPLIEQLTQEFCERMRAQAGTPVAIHKE
FSFLTCSIISCLTFGDKDSTLVQTLHDCVQDLLQAWNHWSIQILTIIPLLRFLPNPGL
QKLKQIQESRDHIVKQQLKRHKDSLVAGQWKDMIDYMLQGVEKQRDGKDEERLHEGHV
HMSVVDLFIGGTETTATTLSWAVAFLLHHPEIQKRLQEELDLKLGPGSQLLYRNRMQL
PLLMATIAEVLRLRPVVPLALPHRATRASSISGYDIPKDMVIIPNIQGANLDEMVWEL
PSKFWPDRFLEPGKNPRTPSFGCGARVCLGEPLARLELFVVLARLLQAFTLLPPPDGT
LPSLQPQPYAGINLPIPPFQVRLQPRNLAPQDQGERP

>79. Cyp24         Mm.6575  genbank D89669 AC084066.1
MSCPIDKRRPLIAFLRRLRDLGQPPRSVTSKAHVKRAPKEVPLC
PLMTDGETRNVTSLPGPTNWPLLGSLLEIFWKGGLKKQHDTLAEYHKKYGQIFRMKLG
SFDSVHLGSPSLLEALYRTESAHPQRLEIKPWKAYRDHRNEAYGLMILEGQEWQRVRS
AFQKKLMKPVEIMKLDKKINEVLADFMGQIDELRDERGRIQDLYSELNKWSFESICLV
LYEKRFGLLQKDTEEEALTFIAAIKTMMSTFGKMMVTPVELHKRLNTKVWQAHTLAWD
TIFKSVKPCIDHRLERYSQQPGADFLCDIYQQDHLSKKELYAAVTELQLAAVETTANS
LMWILYNLSRNPQVQQRLLREIQSVLPDNQTPRAEDVRNMPYLKACLKESMRLTPSVP
FTTRTLDKPTVLGEYTLPKGTVLTLNTQVLGSSEDNFEDADKFRPERWLEKEKKINPF
AHLPFGVGKRMCIGRRLAELQLHLALCWIIQKYNIVATDSEPVEMLHLGILVPSRELP
IAFCPR

>80. Cyp26a1 no UNIGENE entry genbank Y12657 NM_007811 EST AA239785
MGLPALLASALCTFVLPLLLFLAALKLWDLYCVSSRDRSCALPL
PPGTMGFPFFGETLQMVLQRRKFLQMKRRKYGFIYKTHLFGRPTVRVMGADNVRRILL
GEHRLVSVHWPASVRTILGAGCLSNLHDSSHKQRKKVIMQAFSREALQCYVLVIAEEV
SSCLEQWLSCGERGLLVYPEVKRLMFRIAMRILLGCEPGPAGGGEDEQQLVEAFEEMT
RNLFSLPIDVPFSGLYRGVKARNLIHARIEENIRAKIRRLQATEPDGGCKDALQLLIE
HSWERGERLDMQALKQSSTELLFGGHETTASAATSLITYLGLYPHVLQKVREEIKSKG
LLCKSNQDNKLDMETLEQLKYIGCVIKETLRLNPPVPGGFRVALKTFELNGYQIPKGW
NVIYSICDTHDVADIFTNKEEFNPDRFIVPHPEDASRFSFIPFGGGLRSCVGKEFAKI
LLKIFTVELARHCDWQLLNGPPTMKTSPTVYPVDNLPARFTYFQGDI

>81. Cyp26b1 mouse AC022779.3 AW047279 GSS AZ741670, AZ369731, BM936625.1
MLFEGLELVSALATLAACLVSVTLLLAVSQQLWQLRWAATRDKSCKLPIPKGSMGFPLIGETGHWLLQ (0)
GSGFQSSRREKYGNVFKTHLLGRPLIRVTGAENVRKILLGEHQLVSTEWPRSARVLLGPNTVANSIGDIHRNKRK (0)
VFSKIFSHEALESYLPKIQLVIQDTLRAWSSQPEAINVYQEAQRLTFRMAVRVLLGFSI 
PEEDLGHLFEVYQQFVENVFSLPVDLPFSGYRR (?)
GIQARQILQKGLEKAIREKLQCTQGKDYSDALDILIESSKEHGKEMTMQELK 
DGTLELIFAAYATTASASTSLIMQLLK
HPAVLEKLREELRAQGLLHGGGCPCEGTLRLDTLSSLRYLDCVIKEVMRLFTPVSGGYRTVLQTFELD (?)
GFQIPKGWSVMYSIRDTHDTAPVFKDVNVFDPDRFSQARSEDKDGRFHYLP
FGGGVRTCLGKHLAKLFLKVLAVELASTSRFELATRTFPRITLVPVLHPVDGLSVKFFGLDSNQNEILPETEAMLSATV

>82. CYP26C1 mouse XM_140712.1 May 17, 2002 also AC110212.4|
1   MISWGLSCLSVLGAAGTTLLCAGLLLGLAQQLWTLRWTLSRDWASTLPLPKGSMGWPFFG 180
181 ETLHWLVQGSRFHSSRRERYGTVFKTHLLGRPVIRVSGAENVRTILLGEHRL 338
339 VLARVFSRSSLEQFVPRLQGALRREVRSWCAAQRPVA 447
448 VYQAAKALTFRMAARILLGLQLDEARCTELAHTFEQLVENLFSLPLDVPFSGLRKGIRAR 627
628 DQLYEHLDEAVAEKLQEKQTAEPGDALLLIINSARELGHEPSVQELK 
    ELAVELL 987
988 FAAFFTTASASTSLILLLLQHPAAITKIQQELSAQGLGRACTCTPRASGSPPDCGC 1155
1156EPDLSLAMLGRLRYVDCVVKEVLRLLPPVSGGYRTALRTFELDGYQIPKGWSVMYSIRDT 1335
1336HETAAVYRSPPEGFDPERFGVESGDARGSGGRFHYIPFGGGARSCLGQELAQAVLQLLAV 1515
1516ELVRTARWELATPAFPVMQTVPIVHPVDGLLLFFHPLPTSGAGDGLPF*

>83. Cyp27a1       Mm.26793 8 ESTs follow AI286988
AK004977.1 adult male liver cDNA, RIKEN, BC002183 71% to hum 27a1
MAAWSRTRLRWTLLDPRVVGRGLCPQGARAKATIPAALQAQESTEGPGTGQDRPRLRSP 
AELPGTGTLQFLFQLFLQGYVLHLPDLQVLNKTKYGPMWTTSFGTYTNVNLASAPLLEQV
MRQEGKYPIRDHMDQWKDHRDHKGLTYGIFIAQGEQWYHLRQALKQRLLKPDEAALYTDA
LNEVISDFITRLDQVRAESESGDQVPDMAHLLYHLALEAITYILFEKRIGCLKPSIPEDT
AAFIRSVAIMFQNSVYITFLPKWTRPLLPFWKRYLNGWDNIFSFGKKLID EKVQELKAQL
QETGPDGVRVSGYLHFLLTNELLSTQETIGTFPELLLAGVDT TSNTLTWALYHLSKSPEI
QEALHKEVTGVVPFGKVPQHKDFAHMPLLKAVIKETLRLYPVVPTNSRIITEKETEINGF
LFPKNTQFVLCHYVVSRDPSVFPEPNSFQPHRWLRKKEADNPGILHPFGSVPFGYGVRSC
LGRRIAELEMQLMLSRLVQKYEIALAPGMGEVKTVSRIVLVPSKKVRLHFLQRQ

>84. Cyp27b1       Mm.6216  genbank AB006034 AC084293.2
LEADSPNTDMTQAVKLASRVFHRIHLPLQLDASLGSRGSESVLR
SLSDIPGPSTLSFLAELFCKGGLSRLHELQVHGAARYGPIWSGSFGTLRTVYVADPTL
VEQLLRQESHCPERCSFSSWAEHRRRHQRACGLLTADGEEWQRLRSLLAPLLLRPQAA
AGYAGTLDNVVRDLVRRLRRQRGRGSGLPGLVLDVAGEFYKFGLESIGAVLLGSRLGC
LEAEVPPDTETFIHAVGSVFVSTLLTMAMPNWLHHLIPGPWARLCRDWDQMFAFAQRH
VELREGEAAMRNQGKPEEDMPSGHHLTHFLFREKVSVQSIVGNVTELLLAGVDTVSNT
LSWTLYELSRHPDVQTALHSEITAGTRGSCAHPHGTALSQLPLLKAVIKEVLRLYPVV
PGNSRVPDRDIRVGNYVIPQDTLVSLCHYATSRDPTQFPDPNSFNPARWLGEGPTPHP
FASLPFGFGKRSCIGRRLAELELQMALSQILTHFEVLPEPGALPIKPMTRTVLVPERS
INLQFVDR

>gb|AC009974.7|AC009974 Homo sapiens chromosome 2 clone RP11-459I19, WORKING DRAFT 
SEQUENCE, 7
              unordered pieces
          Length = 208692

 Score = 41.6 bits (96), Expect = 0.010
 Identities = 17/39 (43%), Positives = 29/39 (73%)
 Frame = +3

Query: 7      RIIKMFKVIPDPDLGEVGAINRATLVPNRPVNLQFIERQ 45
              ++I+ +KV+  P+ GE+ ++ R  LVPN+ V LQF++RQ
Sbjct: 196731 QLIQKYKVVLAPETGELKSVARIVLVPNKKVGLQFLQRQ 196847

>gb|AC027416.2|AC027416 Homo sapiens clone RP11-504G11, WORKING DRAFT SEQUENCE, 32 
unordered
              pieces
          Length = 152129

 Score = 41.6 bits (96), Expect = 0.010
 Identities = 17/39 (43%), Positives = 29/39 (73%)
 Frame = -1

Query: 7      RIIKMFKVIPDPDLGEVGAINRATLVPNRPVNLQFIERQ 45
              ++I+ +KV+  P+ GE+ ++ R  LVPN+ V LQF++RQ
Sbjct: 148691 QLIQKYKVVLAPETGELKSVARIVLVPNKKVGLQFLQRQ 148575
27a1 human
TQFVFCHYVVSRDPTAFSEPESFQPHRWLRNSQPATPRIQHPFGSVPFGYGVRACLGRRIAELEMQLLLARVSWERLVGCVGREGW
RSPGRRGREAQGRSVQSGEWMANTQSTQPHVLFTPQLIQKYKVVLAPETGELKSVARIVLVPNKKVGLQFLQRQC*

181 - TGTGCCCTTTGGCTATGGGGTCCGGGCCTGCCTGGGCCGCAGGATTGCAGAGCTGGAGAT - 
    - C  A  L  W  L  W  G  P  G  L  P  G  P  Q  D  C  R  A  G  D   
    -  V  P  F  G  Y  G  V  R  A  C  L  G  R  R  I  A  E  L  E  M   
    -   C  P  L  A  M  G  S  G  P  A  W  A  A  G  L  Q  S  W  R  C   
241 - GCAGCTACTCCTCGCAAGGGTGAGCTGGGAGAGGCTAGTAGGGTGTGTGGGCAGGGAGGG - 
    - A  A  T  P  R  K  G  E  L  G  E  A  S  R  V  C  G  Q  G  G   
    -  Q  L  L  L  A  R  V  S  W  E  R  L  V  G  C  V  G  R  E  G   
    -   S  Y  S  S  Q  G  *  A  G  R  G  *  *  G  V  W  A  G  R  G   
301 - GTGGAGGAGTCCTGGGAGGAGAGGAAGGGAGGCACAGGGTAGGAGTGTGCAGAGCGGGGA - 
    - V  E  E  S  W  E  E  R  K  G  G  T  G  *  E  C  A  E  R  G   
    -  W  R  S  P  G  R  R  G  R  E  A  Q  G  R  S  V  Q  S  G  E   
    -   G  G  V  L  G  G  E  E  G  R  H  R  V  G  V  C  R  A  G  S   
361 - GTGGATGGCAAACACACAATCCACCCAACCACATGTGCTCTTTACCCCCCAGCTGATCCA - 
    - V  D  G  K  H  T  I  H  P  T  T  C  A  L  Y  P  P  A  D  P   
    -  W  M  A  N  T  Q  S  T  Q  P  H  V  L  F  T  P  Q  L  I  Q   
    -   G  W  Q  T  H  N  P  P  N  H  M  C  S  L  P  P  S  *  S  R   
421 - GAAGTACAAGGTGGTCCTGGCCCCGGAGACGGGGGAGTTGAAGAGTGTGGCCCGCATTGT - 
    - E  V  Q  G  G  P  G  P  G  D  G  G  V  E  E  C  G  P  H  C   
    -  K  Y  K  V  V  L  A  P  E  T  G  E  L  K  S  V  A  R  I  V   
    -   S  T  R  W  S  W  P  R  R  R  G  S  *  R  V  W  P  A  L  S   
481 - CCTGGTTCCCAATAAGAAAGTGGGCCTGCAGTTCCTGCAGAGACAGTGCTGAGCTGAGTC - 
    - P  G  S  Q  *  E  S  G  P  A  V  P  A  E  T  V  L  S  *  V   
    -  L  V  P  N  K  K  V  G  L  Q  F  L  Q  R  Q  C  *  A  E  S   
    -   W  F  P  I  R  K  W  A  C  S  S  C  R  D  S  A  E  L  S  L   




>gb|AC073128.3|AC073128 Homo sapiens chromosome 2 clone RP11-647O5, WORKING DRAFT 
SEQUENCE, 17
             unordered pieces
          Length = 196554

 Score = 41.6 bits (96), Expect = 0.010
 Identities = 17/39 (43%), Positives = 29/39 (73%)
 Frame = -3

Query: 7     RIIKMFKVIPDPDLGEVGAINRATLVPNRPVNLQFIERQ 45
             ++I+ +KV+  P+ GE+ ++ R  LVPN+ V LQF++RQ
Sbjct: 46303 QLIQKYKVVLAPETGELKSVARIVLVPNKKVGLQFLQRQ 46187

>CYP27C1 ********* AC027142 43% identical to 27A1 assembled gene
intron starting with QIH ending in VDT is from Celera's public data
CRA_Gene|hCG42613 /len=10487.  This Celera sequence is still missing the C-terminal.
Probable last exon is now found in AC027142.  AG Intron boundary is in the same
Location as CYP26B1.  Stop codon is one codon away from 26B1s stop codon.  
Length is preserved from cys to intron. (n) = intron phase, 9 exons

  1  85452 MQTSAMALLARILRAGLRPAPERGGLLGGGAPRRPQPAGARLPAGARAEDKGAGRPGSPPG 85634 61
 62  85635 GGRAEGPRSLAAMPGPRTLANLAEFFCRDGFSRIHEIQ (0) 85748 99
100  39574 QKHTREYGKIFKSHFGPQFVVSIADRDMVAQVLRAEGAAPQRANMESWREYRDLRGRATGLISA (2) 39371 163
164  43984 EGEQWLKMRSVLRQRILKPKDVAIYSGEVNQVIADLIKRIYLLRSQAEDGETVTNVNDLFFKYSME (1) 43787 229 
230  41743 GVATILYESRLGCLENSIPQLTVEYIEALELMFSMFKTSMYAGAIPRWLRPFIPKPWREFC 41564 290
291  41563 RSWDGLFKFS 41534 300 (1)
301        QIHVDNKLRDIQYQMDRGRRVSGGLLTYLFLSQALTLQEIYANVTEMLLAGVDT (0) 354 
355 110201 TSFTLSWTVYLLARHPEVQQTVYREIVKNLGERHVPTAADVPKVPLVRALLKETLR (2) 110034 410
411 108566 LFPVLPGNGRVTQEDLVIGGYLIPKG (0) 108489 436
437 108006 TQLALCHYATSYQDENFPRAKEFRPERWLRKGDLDRVDNFGSIPFGHGVRSCIGRRIAELEIHLVVIQ (0) 107794 504
505 102503 LLQHFEIKTSSQTNAVHAKTHGLLTPGGPIHVRFVNRK* 102619 542

extension of seq to stop
RRIAELEIHLVVIQVGRGGQTLAFGFGLFLLGTTG*
alu repeat follows this

3 phase at end of gene
DPAGPLPLCHIVPG*ELPSGQGVPA*ALAAERRLR*S*QFWIHPLWSWGSQLHRAENCRTGDSPRRDPGRAWRPNVGLWVRPLSTG
HNRIKTSRLKSKDGFGNLRSRAVRHCGSHLWSQLLRRLRREAHLNPGVQGCSEP*SHHGTPAWATE*DPIQK*ELCMISFENASPS
PVPTLSSNRVATL*ETLEV*ARPCGDWGPRLHL*WPCLQLGAGLSGTVSSSPPPRERHSGALAGLLCLTDPTIYTEQWFSSFLGCE
TSGSPGCCYRTPYM*RYIILGLHRIWIQAIAGTK*HLKEIL*K*SQWRRKFHPLASPASWRRLFQRRETLARSLGAPSLPEHLENV
GSAW*PHLPILLMRKPRLRKQQPLTKSHSGRCRAGSRCLDFSPTSHAGAPSRLTGRPELGRRESKGPGKPGWLGGGVEGFWMVRLR
VGTRAGEGKGLLPREEAQNEVWGEPPLAQGWLWRHRWPMRLCWGGAGGTKWPSHLCVDVAAAPVPGSM*VAARCVGGAQGKLASPS
VLPLAGSGTLAGFSAT*AEPVTSFTKGAKSLPDRCVRSKNSPRTESVPKKWKLFSSSEPCTKSAVGSVFRKVSAP*NAAVPPAATA
PSRRASVSWEGCLLPPGSGCLKNRGRHLSKRPAVKWYLELVMGSCLLLSTYEYNC*SFCLKSPHFYLGKCRKGASRFLFYFIIFFE
SCSVARLECSGAIGSLQPPPPGVMQFSCLSLWSS*DYRCPSPRPANFCIFK*RWGFIMLARLVPNY*PQVIHLPWPPKVLGLQV*A
TVPGP*FFFFFFRVSPCHPGWSAVAQSRVAATLSPGFKRFSCLSLWSS

RPSWPFATMPHRTRMRTSLGPRSSGLSAGCGKET*IELTILDPSPLVMGFAAA*GGELQNWRFTSS*SR*GVEAKRWPLGSASFYW
AQQDKDFQVEKQRRFWKLKIKSRQALWLTPVVPATQEAEAGGSLEPRSSRLQ*AVITPRHSSLGDRVRPYPKIRALHDIL*KCLPI
SRAHPVLQPRGYTLRNPGGLSEALWGLGTPPPPLVAVSAVRSWPLRHRQQQPPPPGKAQWCSCWSPLFN*PNHLHRAVVLKLLRL*
NIRKPWVLLQNTLHVKIHNSGFAPYLDSGYSRN*ITPEGNFIKVKSMAEKIPPISLSCKLEKTVSATGNLGPKFGSSIVA*TFGEC
GECLVTSSPHFTDEETEAQKATATYQVPQWQVQGRIQVS*LFPYKSCRCPLQAHWETGAGQKGE*GTREAWLAGRRGRGILDGQAQ
GGHQSWRR*GSTAQGGGTERGVGRTPTGSGLALEAQVAHETLLGWGWWDQVAIPPLCGCGRCPCPWEYVSGSSMCRGSSGQAGFTL
SSATCWLRDLGRFLSHLS*ACHVIYKRGQIPA**VCEE*KQSTYRVCAQEVEAVLIF*ALH*VCCGECFQEGLSSLKCCCASRSHC
TLPKGIGELGGMLVAPRLWLPEEPG*TPEQEASGEMVFGARDGVLFAAVNI*I*LLKFLLKKSSFLPGQV*KGSITVFILFYYFF*
VLLCRQARVQWCNRLTATSASWGHAILLPQPLE*LRLQVPITTPS*FLYF*VEMGIHHVGQAGPELLTSSDPPALASQSAGITGVS
HCAWPMIFFFFFQSLTLSPRLECSGAISGRCNSVSRVQAILLPQPLE*L

QTQLALCHYATSYQDENFPRAKEFRPERWLRKGDLDRVDNFGSIPFGHGVRSCIGRRIAELEIHLVVIQVGRGGQTLAFGFGLFLL
GTTG*RLPG*KAKTVLET*DQEPSGTVAHTCGPSYSGG*GGRLT*TQEFKAAVSRDHTTALQPGRQSETLSKNKSSA*YPLKMPPH
LPCPPCPPTAWLHSEKPWRSKRGPVGIGDPASTSSGRVCS*ELASQAPSAAAPPPGKGTVVLLLVSFV*LTQPFTQSSGSQAS*VV
KHQEALGVVTEHLTCEDT*FWVCTVSGFRL*QELNNT*RKFYKSKVNGGENSTH*PLLQVGEDCFSDGKPWPEVWELHRCLNIWRM
WGVPGDLISPFY**GNRGSESNSHLPSPTVAGAGQDPGVLTFPLQVMQVPPPGSLGDRSWAEGRVRDQGSLVGWEEG*RDSGWSGS
GWAPELEKVRVYCPGRRHRTRCGENPHWLRAGSGGTGGP*DSAGVGLVGPSGHPTFVWMWPLPLSLGVCEWQLDV*GELRASWLHP
QFCHLLAPGPWQVSQPPELSLSRHLQKGPNPCLIGV*GVKTVHVQSLCPRSGSCSHLLSPALSLLWGVFSGRSQLPKMLLCLPQPL
HPPEGHR*VGRDACCPQALAA*RTGVDT*ARGQR*NGIWSS*WGLVCCCQHMNITAKVFA*KVLISTWASVEREHHGFYFILLFFL
SLALSPG*SAVVQSAHCNLRLLGSCNSPASASGVAEITGAHHHAQLIFVFLSRDGDSSCWPGWSRTIDLK*STCLGLPKCWDYRCE
PLCLAHDFFFFFSESHPVTQAGVQWRNLGSLQLCLPGSSDSPASASGVA

Celera genomics hit
hCP51916 /version_id=113000036823627 /len=222
/trans=hCT33895 /gene=hCG42613 /gene_sym=na
/gene_name=na /hit_def=sterol 27-monooxygenase (EC1.14.14.-) cytochrome P450 27, 
precursor - human
/hit_id=4503211
Length = 222

Score =  295 bits (748), Expect = 3e-80
Identities = 144/148 (97%), Positives = 146/148 (98%)

Query: 164 EGEQWLKMRSVLRQRILKPKDVAIYSGEVNQVIADLIKRIYLLRSQAEDGETVTNVNDLF 223
           EGEQWLKMRSVLRQRILKPKDVAIYSGEVNQVIADLIKRIYLLRSQAEDGETVTNVNDLF
Sbjct: 1   EGEQWLKMRSVLRQRILKPKDVAIYSGEVNQVIADLIKRIYLLRSQAEDGETVTNVNDLF 60

Query: 224 FKYSMEGVATILYESRLGCLENSIPQLTVEYIEALELMFSMFKTSMYAGAIPRWLRPFIP 283
           FKYSMEGVATILYESRLGCLENSIPQLTVEYIEALELMFSMFKTSMYAGAIPRWLRPFIP
Sbjct: 61  FKYSMEGVATILYESRLGCLENSIPQLTVEYIEALELMFSMFKTSMYAGAIPRWLRPFIP 120

Query: 284 KPWREFCRSWDGLFKFSKRRIETSFTLS 311
           KPWREFCRSWDGLFKFSKRRIE++F  S
Sbjct: 121 KPWREFCRSWDGLFKFSKRRIESTFCRS 148

Query: 388 TQLALCHYATSYQDENFPRAKEFRPERWLRKGDLDRVDNFGSIPFGHGVRSCIGRRIAEL 447
           TQLALCHYATSYQDENFPRAKEFRPERWLRKGDLDRVDNFGSIPFGHGVRSCIGRRIAEL
Sbjct: 155 TQLALCHYATSYQDENFPRAKEFRPERWLRKGDLDRVDNFGSIPFGHGVRSCIGRRIAEL 214

Query: 448 EIHLVVIQ 455
           EIHLVVIQ
Sbjct: 215 EIHLVVIQ 222

Tblastn of celera nucleotide sequence for this gene
>CRA_Gene|hCG42613   /len=10487 /symbol=na /name=na
            Length = 10,487 end of amino acid seq runs of end of contig at 10487

  Plus Strand HSPs: N-terminal part is not in this contig

Query:   164 EGEQWLKMRSVLRQRILKPKDVAIYSGEVNQVIADLIKRIYLLRSQAEDGETVTNVNDLF 223
             EGEQWLKMRSVLRQRILKPKDVAIYSGEVNQVIADLIKRIYLLRSQAEDGETVTNVNDLF
Sbjct:     2 EGEQWLKMRSVLRQRILKPKDVAIYSGEVNQVIADLIKRIYLLRSQAEDGETVTNVNDLF 181

Query:   224 FKYSME 229
             FKYSME
Sbjct:   182 FKYSME 199

Query:   230 GVATILYESRLGCLENSIPQLTVEYIEALELMFSMFKTSMYAGAIPRWLRPFIPKPWREF 289
             GVATILYESRLGCLENSIPQLTVEYIEALELMFSMFKTSMYAGAIPRWLRPFIPKPWREF
Sbjct:  2240 GVATILYESRLGCLENSIPQLTVEYIEALELMFSMFKTSMYAGAIPRWLRPFIPKPWREF 2419

Query:   290 CRSWDGLFKFSKRRIETSFTLS 311
             CRSWDGLFKFSKRRIE++F  S
Sbjct:  2420 CRSWDGLFKFSKRRIESTFCRS 2485

Query:   297 FKFSKRRIEEKVQELKAQLQETGPDGVRVSG-YLHFLLTNELLSTQETIGTFPELLLAGV 355
             F+  +  ++ K+++++ Q+      G RVSG  L +L  ++ L+ QE      E+LLAGV
Sbjct:  4023 FQTGQIHVDNKLRDIQYQMDR----GRRVSGGLLTYLFLSQALTLQEIYANVTEMLLAGV 4190

Query:   356 DTTS 359
             DT S
Sbjct:  4191 DTVS 4202

Query:   306 TSFTLSWTVYLLARHPEVQQTVYREIVKNLGERHVPTAADVPKVPLVRALLKETLR 361
             TSFTLSWTVYLLARHPEVQQTVYREIVKNLGERHVPTAADVPKVPLVRALLKETLR
Sbjct:  8080 TSFTLSWTVYLLARHPEVQQTVYREIVKNLGERHVPTAADVPKVPLVRALLKETLR 8247

Query:   361 RLFPVLPGNGRVTQEDLVIGGYLIPKGTQL 390
             RLFPVLPGNGRVTQEDLVIGGYLIPKG  L
Sbjct:  9712 RLFPVLPGNGRVTQEDLVIGGYLIPKGVSL 9801

Query:   388 TQLALCHYATSYQDENFPRAKEFRPERWLRKGDLDRVDNFGSIPFGHGVRSCIGRRIAEL 447
             TQLALCHYATSYQDENFPRAKEFRPERWLRKGDLDRVDNFGSIPFGHGVRSCIGRRIAEL
Sbjct: 10284 TQLALCHYATSYQDENFPRAKEFRPERWLRKGDLDRVDNFGSIPFGHGVRSCIGRRIAEL 10463

Query:   448 EIHLVVIQ 455
             EIHLVVIQ
Sbjct: 10464 EIHLVVIQ 10487



>85. Cyp39         no UNIGENE entry ESTs AA272844, AA606237
NM_018887.1 AF237981 39a1 (oxysterol 7alpha-hydroxylase) 72% to human
MGIMELFSPIAIAVLGSCVLFLFSRLKNLLGPPCIQGWIPWIGAGLEFGKAPLEFIEKARIKY
GPVFTIFAMGNRMTFVSEEEGINVLLKSEHVDFESAVQSPVYHTAWIPKNVFSALHERLY
ALMKGKMGTFNTHHFTGPLTEELHEQLEGLGTHGTMDLNDFVRYLLYPATLNTLFKKGLF
LTDKRTIKEFYQQFKTYDEGFEYGSQLPEWLLRNWSKSKRWLLALFEKNIGNIKAHGSA
GHSGTLLQAILEVVETETRQYSPNYGLVVLWAALANAPPIAFWTLGYILSHPDIHRTVL
ESISSVFGTAGKDKIKVSEDDLKKLLIIKWCILESVRLRAPGVITRKVVKPVKILNHTVP
SGDLLMLSPFWLHRNPKYFPEPESFKPERWKEANLDKYIFLDYFMAFGGGKFQCPGRW
FALLEIQLCIILVLYKYECSLLDPLPKQSSRHLVGVPQPAGKCRIEYKQRA*

>86. Cyp46         no UNIGENE entry AF094479 NM_010010 ESTs AA096922, R75217 
MSPGLLLLGSAVLLAFGLCCTFVHRARSRYEHIPGPPRPSFLLG
HLPYFWKKDEDCGRVLQDVFLDWAKKYGPVVRVNVFYKTSVIVTSPESVKKFLMSTKY
NKDSKMYRALQTVFGERLFGQGLVSECDYGRWYKQRKVMDLAFSRSSLVSLMETFNEK
AEQLVEILEAKADGQTPVSMQDMLTCATIDILAKAAFGMETSMLLGAQKPLSQAVKVM
LEGISASRNTLAKFMPGKRKQLREIRESIRLLRQVGKDWVQRRREALKRGEDMPADIL
TQILKAEEGAQDDEVLLDNFVTFFIAGHETSANHLAFTVMELSRQPEIVARLQAEVDE
VVGSKRHLDYEDLGRLQYLSQVLKESLRLYPPAWGTFRLLEEETLIDGVRVPGNTPLL
FSTYVMGRMDTYFEDPLTFNPDRFGPGAPKPRFTYFPFSLGHRSCIGQQFAQMEVKVV
MAKLLQRIEFRLVPGQRFGLQEQATLKPLDPVLCTLRPRGWQPAPPPPPC

>87. Cyp51         Mm.24155 NM_020010 AF166266
ESTs AA123049, AI007126, W58959, 
              AA144311, AA288897, AA118292, AA537618, 
              AA980997, AA107943, AI314330, AA107043, 
              AI226612, AA105775, AA105777, AA288896, 
              AA162461, AA755063
MEQVTGGNLLSTLLIACAFTLSLVYLFRLAVGHMVQLPAGAKSP
PHIYSPIPFLGHAIAFGKSPIEFLENAYEKYGPVFSLTMVGKTFTYLLGSDAAALLFN
SKNEDLNAEEVYGRLTTPVFGKGVAYDVPNAIFLEQKKIIKSGLNIAHFKQYVPIIEK
EAKEYFQSWGESGERNVFEALSELIILTASHCLHGKEIRSQLNEKVAQLYADLDGGFT
HAAWLLPAWLPLPSFRRRDRAHREIKNIFYKAIQKRRLSKEPAEDILQTLLDSTYKDG
RPLTDEEISGMLIGLLLAGQHTSSTTSAWMGFFLAKDKPLQEKCYLEQKAVCGEDLPP
LTYDQLKDLNLLDRCIKETLRLRPPIMTMMRMAKTPQTVAGYTIPPGHQVCVSPTVNQ
RLKDSWAERLDFNPDRYLQDNPASGEKFAYVPFGAGRHRCVGENFAYVQIKTIWSTML
RLYEFDLINGYFPTVNYTTMIHTPENPVIRYKRRSK