April 25, 2000

|

Searches for P450s in the human genomic DNA sequences being deposited in
Genbank have identified several new human P450s. These include the five shown
below and others. One is 43% identical to CYP27A1 and has been named CYP27C1.
See the human section of the homepage for detailed PDF tables sorted by
chromosome, CYP name or accession number.

>AC027142 CYP27C1 43% identical to 27A1 partially assembled gene

1 85452 MQTSAMALLARILRAGLRPAPERGGLLGGGAPRRPQPAGARLPAGARAEDKGAGRPGSPPG
48 85635 GGRAEGPRSLAAMPGPRTLANLAEFFCRDGFSRIHE 85742 83
84 39568 LQQKHTREYGKIFKSHFGPQFVVSIADRDMVAQVLRAEGAAPQRANMESWREYRDLRGRATGLISA 39371 149
150 43984 EGEQWLKMRSVLRQRILKPKDVAIYSGEVNQVIADLIKRIYLLRSQAEDGETVTNVNDLFFKYSME 43787 215
(GGT)intron G amino acid at boundary (AGGA) other end of intron
216 41743 GVATILYESRLGCLENSIPQLTVEYIEALELMFSMFKTSMYAGAIPRWLRPFIPKPWREFC 41564
41563 RSWDGLFKFSKRRIE 41519 287
gap of 51 amino acids
340 110201 TSFTLSWTVYLLARHPEVQQTVYREIVKNLGERHVPTAADVPKVPLVRALLKETLR 110034 395
intron LFPVLPGNGRVTQEDLVIGGYLIPKG intron
418 108006 TQLALCHYATSYQDENFPRAKEFRPERWLRKGDLDRVDNFGSIPFGHGVRSCIGRRIAELEIHLVVIQV 107791 493
missing about 30 aa at end

>AC012525 Homo sapiens chromosome 4. There is a mouse ortholog for this seq.
Low 40% range with other mammalian 4 family members new subfamily of CYP4

223491 MAGLWLGLVWQKLLLWGAASAVSLAGASLVLSLLQRVASYARKWQQMRPIPTVARAYPLVGHALLMKPDGR 223279
220816 EFFQQIIEYTEEYRHMPLLKLWVGPVPMVALYNAENVEG 220700
219309 ILTSSKQIDKSSMYKFLEPWLGLGLLT 219232
218377 STGNKWRSRRKMLTPTFHFTILEDFLDIMNEQANILVKKLEKHINQEAFNCFFYITLCALDIIC 218186
217783 ETAMGKNIGAQSNDDSEYVRAVYR 217712
216357 MSEMIFRRIKMPWLWLDLWYLMFKEGWEHKKSLQILHTFTNSV 216229
214155 IAERANEMNANEDCRGDGRGSAPSKNKRRAFLDLLLSVTDDEGNRLSHEDIREEVDTFMFE 213973
210091 GHDTTAAAINWSLYLLGSNPEVQKKVDHELDDV 209993
206422 KSDRPATVEDLKKLRYLECVIKETLRLFPSVPLFARSVSED
206248 YFLTAGYRVLKGTEAVIIPYALHRDPRYFPNPEEFQPERFFPENAQG 206069
206068 RHPYAYVPFSAGPRNCIG 206015
204818 QKFAVMEEKTILSCILRHFWIESNQKREELGLEGQLILRPSNGIWIKLKRRNADER* 204648

>AC025090 CYP2U1 AC000016 has C-term 41% to 2N1 new CYP2 subfamily
intron joints not yet defined

MSSPGPSQPPAEDPPWPARLLRAPLGLLRLDPSGGALLLCGLVALLGWSWLRRRRARGI
77036 PPGPTPWPLVGNFGHVLLPPFLRRRSWLSSRTRAAGIDPSVIGPQVLLAHLARVYGSI 76863
76862 FSFFIGHYLVVVLSDFHSVREALVQQAEVFSDRPRVPLISIVT 76734
105008 GPVWRQQRKFSHSTLRHFGLGKLSLEPKIIEEFKYVKAEMQKHGEDPFCPF 105160
105161 SIISNAVSNIICSLCFGQRFDYTNSEFKKMLGFMSRGLEICLNSQVLLVNICPWLYYLPF 105340
105341 GPFKELRQIEKDITSFLKKIIKDHQESLDRENPQDFIDMYLLHMEEERKNNSNSSFDEE 105517
105518 YLFYIIGDLFIAGTDTTTNSLLWCLLYMSLNPDVQ 105622
107396 KVHEEIERVIGANRAPSLTDKAQMPYTEATIMEVQRLTVVVPLAIPHMTSENT 107554
109370 LQGYTIPKGTLILPNLWSVHRDPAIWEKPEDFYPNRFLDDQGQLIKKETFIPFGIG 109540
KRVCMGEQLAKMELFLMFVSLMQSFAFALPEDSKKPLLTGRFGLTLAPHPFNITISRR

>CYP4F22 AC011492 assembled gene 13 exons 114537-140651 66% to 4F3, 65% to 4F11, 63% to 4F2,
59% to 4F8, 64% to 4F12, 57% to AC011537 exact intron boundaries need checking no ESTs
MLPITDRLLHLLGLEKTAFRIYAVSTLLLFLLFFLFRLLLRFLRLCRSFYITCRRLRCFPQPPRRNWLLGHLGMVS
PNEAGLQDEKKVLDNMHHVLLVWMGPVLPLLVLVHPDYIKPLLGAS
AAIAPKDDLFYGFLKPWLG
DGLLLSKGDKWSRHRRLLTPAFHFDILKPYMKIFNQSADIMH
AKWRHLAEGSAVSLDMFEHISLMTLDSLQKCVFSYNSNCQE
KMSDYISAIIELSALSVRRQYRLHHYLDFIYYRSADGRRFRQACDMVHHFTTEVIQERRR
ALRQQGAEAWLKAKQGKTLDFIDVLLLAR
DEDGKELSDEDIRAEADTFMFEG
HDTTSSGISWMLFNLAKYPEYQEKCREEIQEVMKGRELEELEW
DDLTQLPFTTMCIKESLRQYPPVTLVSRQCTEDIKLPDGRIIPK
GIICLVSIYGTHHNPTVWPDSK
VYNPYRFDPDNPQQRSPLAYVPFSAGPR
NCIGQSFAMAELRVVVALTLLRFRLSVDRTRKVRPELILRTENGLWLKVEPLPPRA*

>CYP4F23P AC011492 assembled gene 76% to 4F3, 76% to 4F8, 76% to 4F11, 73% to 4F2,
75% to 4F12, 77% to 4F11, 60% to other 4F on this accession no ESTs
MSLLSLSWLGLGPVAASPWLLLLLVGASWLLARVLAWTYAFYDNCHRLQCFQQPPKRNCF*GHLSLVS
GNEEDMRLMEDLGHYFRDVQLWWLGSFYPVLHLVHPTFTAPVLQAS
AAVALKDMSFYGFLKPWLG
DGLLISAGDKWRWHRHLLTPAFHFKILKPYVKIFNESTNIMH
AKWQRLALEGSVRLEMFEHISLMTLDSLQKCIFSFDSNCQE
KPSEYIDAILELSALSLKRHQHIFLLTDFLYFLTPNGRRFCRACDIVHNFTDAVIQERRR
TLTSQGVDDFLQAKAKSKTLDFIDVLLLAK
DENGKKLSDENIRAEADTFMSG
GHDTTASGLSWVLYNLARYPEYQEHCRQEVQELLKNGDPKEIEW
DDLAQLPFLTMCLKESLRLHSPVSRIHRCCPQDGVLPDGRVIPK
GNTCTISIFGIHHNPSVWPDPEV
YDPFRFDPENLQKTSPLAFIPFSAVPR
NCIGQTFAMAEMKVVLALTLLRFRVLPDHAEPRRKLELIVRAEDGLWLRVEPLSADLQ*