Bos taurus Cytochrome P450s

 

Newly updated by David Drane (Ridgeway High School, Memphis)

Last revised July 21, 2005

 

All cytochrome P450 genes that could be assembled from EST data,

mRNA sequences in the nr section of Genbank, and WGS (whole genome

shotgun sequences) at NCBI, were assembled and compared to the human

P450 collection.

 

59 complete genes were assembled, this number includes three pseudogenes.

6 partial genes are lacking small regions, and there are several other small

pieces and a few shorter pseudogenes.

 

There are a minimum of 62 CYP genes in the cow. 

 

The old Bos taurus sequence page has been preserved here.

 

 

CYP1 family

 

CYP1A1

AB060696.1 = NP338778, TC208445 XM_588298 CC766779.1 Bac end

MFSVFGLPIPISATELLLASAVFCL

VFWVVRTWRPRVPQGLKSPPEPWGWPLLGHMLMLGKNPHVVLSQLSQRYGDVLQIRIG

CTPVLVLSGLDTVRQALVRQGDDFKGRPDLYSFTLITNGQSMTFNPDSGPVWAARRRL

AQNALKSFSTASDPASSSSCYLEEHVNKEAKYLLGKFQELMSGPGRFDPYRYIVVSVA

NVICAICFGRRYDHNDQEFLSLVNLSNEFGEITASGNPSDFIPVLRYLPNTALDLFKD

LNQRFYVFVQKIVKEHYKTFEKGHIRDITDSLIEHCQDKRLDENANIQLSDEKIINVV

IDLFGAGFDTVTTALSWSLLYLVTSPRVQKKIQEELDTVIGRARRPRLSDRPQLPYLE

AFILETFRHSSFVPFTIPHSTTRDSNLNGFYIPKGRCVFVNQWQINHDQKLWEDPSEF

RPERFLTADGTINKVLSEKVIIFGLGKRKCIGETIARLEVFLFLAILLHQVEFCVTPG

VKVDMTPVYGLTMKYARCEHFQAHMRS

 

CYP1A2

82% to human 1A2

Database location  : AAFC01721680   742 to 1116 (-)

Genomic location   : SCAFFOLD261464   3 to  377 (+)

trace file gnl|ti|669459617

MALSQLSPFSAMELLLASAIFCLVFWVVRTWRPRVPQGLKSPPEPWGWPLLGHMLTLG

KNPHVVLSQLSQRYGDVLQIRIGCTPVLVLSGLDTVRQALVRQGDDFKGRPDLYSFTLVT

DGQSMTFNPDSGPVWAARRRLAQNALNTFSVASD

PSSSSSCYLEDHVSKEAEALLGKFQELMSGPGRFDPYGHVVASV

ANVIGAMCFGQHFPQSSKEMLSLVESSHDFVESASSGNPVDFFPILKYLPNPALQRFK

SFNQRFLQFVRKTVQEHYQDFDKNSIQDIIGALFKHSEDNSRASSRLISQEKTVNLVN

DLFAAGFDTITTAISWSLMYLVTNPKIQRKIQEELD

RVVGRARRPRLSDRPQLPYLES

FILETFRHSSFVPFTIPHSTTRDTTLNGFFIPKERCVFINQWQVNHDPKLWGDPSVFR

PERFLTSDGTTIDKTASEKVLLFGMGKRRCIGEVMARWEVFLFLAILLQRLEFSVPPG

VKVDLTPTYGLTMKHARCEHMQARLRFPIK

 

CYP1B1

82% to human 1B1

MATGLSPDDHLSPTLLSVQQTMLLLLLSVLAAVHVGQWLLRQRRRQPGSAPPGPFAWPLI

GNAASMGSAPHLLFARLARRYGDVFQIHLGSCRVVVLNGERAIRQALVHQSAAFADRPPF

ASFRLVSGGRSLAFGQYSESWKAQRRAAHSTMRAFSTRQPRGRRVLEGHVVGEVRELVEL

LVRRSAGGAFLDPRPLTLVAVANVMSALCFGCRYSHDDAEFLELLSHNEEFGRTVGAGSL

VDVLPWLQRFPNPVRTAFREFEQLNRNFSNFVLDKFLRHRESLRPGAAPRDMMDAFIHSA

GADSGDGGPRLDVDYVPATVTDIFGASQDTLSTALQWLLVLFTR (2)

YSEVQARVQAELDQVVGRHRLPTLEDQPRLPYVMAFLYEAMRFSSFVPVTIPHATTANAS

VLGYHIPKDTVVFVNQWSVNHDPVKWSNPEDFDPTRFLDKDGLINKDLTGSVMVFSVGKR

RCIGEEISKMQLFLFISILAHQCNFKANPDEPSKMDFNYGLTIKPKSFKINVTLRESMEL

LDSAVQKLQVEKECQ*

 

CYP1A8P ortholog (probably a new subfamily since it is functional in opossum)

68% to 1A8P 40% to CYP1A1

BM25891 SCAFFOLD 61062 SCAFFOLD 129297 SCAFFOLD 245449

AAFC02085398.1

AAFC02099349.1

MIFGMAVTSGEVTTSRIILVMVFVFVRELGNKGRKEVFPPGPWSLPIVENLLQLG

DHLYFTFMEMRKKYGDVFLIKLGMVPVLVVNGMEMVKEVLLRNGEHFAA*PNV

LTFSFLAQ*KSLTFS

NYGENWTLHKKIASNALRTFPKAETKSSTRSCLLEKHVIEEVSELVKV

FTELTSRSGSFEPRGAITCAMANVV

CTLCFGKRYDHSDEEFLRIVKTDHDLLKASSAANPADFIPYF*YLPLRIINAPQEFYHARNQ

FIALHIRDHLTT

CPQDHIQDITDALINACHNKYAVAKITILNDDEIISTVSDLVGAG

FEIISTCIYWSFLYLIYYPEIQVKIQEEI

DGNTGMKSPRFENRKILP

YTEAFINEIFRHTSFLPFTIPHC (2)

TTADTTLNGYFIPRKTCTFINMYQVNHDE (2)

TIWDNPNLLRPERFLNENRELNKNLIEKIFIFGMGIQKCL

REEVAQNEVFVFITTVLQQLTLKKCPVVKLDLTPTYGLVMKPKPYQLPAEPRSMGSSCS*

 

CYP2 Family

 

CYP2A13

90% to 2A13 86% to 2A7

CB434432.1 CB463229 TC193989

MLASGLLLVALLACLTIMVLMSVWRQRNLKGKLPPGPTPLPFIGNYLQLNTEQMCNSLMK

ISEHYGPVFTV

HLGTRQIVVLCGYDAVKEALVDQAEEFSGRGKQATFDWLFKGYGVAFSNGERAKQLRRFS

ITTLRDFGVGKRGIEERIQEEAGFLIEAFRGTRS

AFIDPTFFLSRTVSNVISSIVFGDRFDYEDKEFLSLLRMMLGSFQFTATSTGQ

1 LYEMFYSVMKYLPGPQQQAFKELQGLEDFIAKKVEQNQRTLDPNSPRDFIDSFLIRMQEEKENPNTEFYRK 177

178 NLVMTTLNLFFAGTETVSTTMRDGFLLLMKHPDVEAKIHEEIDRVIGKNRQPKFEDRAKM 357

358 PYTEAVIHEIQRFGDMIPMGLARRVTKDTKFRDFLLPKGTEVFPMLGSVLRDPKFFSNPR 537

538 DFNPQHFLDEKGQFKKSDAFVPFSIGKRYCFGESLARMELFLFFTTIMQNFRFKSPQS 711

712 PQDINVSPKLVGFATIPPNYTMSFLPR*

 

CYP2B6

76% to 2B6 CN790156.1 CN787808.1 CB222090 TC193614

40 MELSMLLLFALLTGLLVLLARGRPKAHGRLPPGPRPLPFLGNLLQMDRKGLLKSFLR

FQQKYGDVFTVYLGPRPVVIICGTEAIREALVDQAEVFSGRAKIAVVDPIFQGY

GVIFANGERWKALRRFSLATMRDFGMGKRSVEERIQDEAQCLVEELRKSQ

GALQDPVFYFHSITANIICSIVFGKRFDYRDPEFLRLLELLFQSFVLISSLSSQ

LFELYSSFLKYFPGSHRQIYKNLQEINVFIGRSVEQHRETLDPNAPRDFIDCYLLRMEKDKSNPQSQFDHQN

LIMSVLSLFFAGTETTSTTLRYGFLLMLKYPHITERIQKEIDQVIGSYR

PALDDRAQMPYTDAVIHEIQRFADLIPIGVPHMVTKDTHFRGYILPK

GTEVYPVLSSALHESCYFEKPDDFNPDHFLDANGVVKKNDAFMPFSI

GKRICLGEGIARIELFLFFTTILQNFSVASPVAPEDIDLTPQESGVGNVPPNYRIQFLPRQRG*

 

CYP2B pseudogene fragment

SCAFFOLD 393933 stop codon same as in human 2B7

PALDDRAQMPYTDTVIHEIQRFADLISIGVSHMDAKDAHF*GYILPK

 

CYP2C85 missing exons 8,9

76% to 2C18 CB422177 AAFC02018840.1 TC198839 SCAFFOLD251527 SCAFFOLD131911

SCAFFOLD45421

Last two exons from XM_612374, see controversy below about exon 6

MDLPVVLVLCLCCLLLISLWKQSSGKGKLPPGPTPLPILGNILQLDVKDISKSVSN

LSKVYGPVFTLYFGMNPLVVLHGYEAVKEALIGLGEEFSGRGSCPVIQRASKGY

GVIFSNGKIWKETRRFSLMTLRDFGMGKRSMEDRVQQEACCLVEELRKTD

GLPCDPTFILGCAPCNVICSIIFQNHFDYKDQIFLDLMERLNENARILGSPWIQ

LCSSFPALIDYVPGKHKKFFENYACMKSYVLEKTREHQASLDMNNPRDFIDCFLTKMEQ

EKHNQELEYTVENLAHTVLDLFVAGTETTSTTLRYGLLLLLKHPEVT

AKVQEEIDHVIGRHRSPCMQDKSHMPYTDAVVHEIQRYIDLVPTNLPHAVTCDIKFRNYLIPK

GTGILTSLTSVLYDDKEFPNPEVFDPGHFLDESGNFRKSDHFMAFST

GKRICVGEGLARMELFLFLTTILQNFTLKSVVDPKDLDTTPVVNGLLSVPPFYQLCFIPV*

 

duplicate sequence on same contig AAFC02018840.1

EST CB422177.1 matches yellow region

PCDPTFILGCAPCNVICSIIFQNHFDYKDQIFLDLMERLNENARILGSPWIQ

LCSSFPALIDYVPGKHKKFFENYACMKSYVLEKTREHQASLDMNNPRDFIDCFLTKMEQ

EKHNQELEYTVENLAHTVLDLFVAGTETTSTTLRYGLLLLLKHPE

TAKVQEEIDHVIGRHQSPCMQDKSHMPYTDAVVHEIQRYIDLVPTNLPHAVTCDIKFRNYLIPK

 

CYP2C fragment C-term exons 8,9

CB421823 AAFC02018839 (exons 8,9) 88% to 2C87 81% to 2C18

Matches XM_612374 predicted from NW_616319

Same as 2C85 except yellow area in exon 6

EST CK949679.1 matches this yellow region, this EST is from 2C87

There might be an assembly error joining an exon from 2C87 into the 2C85 gene

Or there might be two exon 6 sequences in one gene.

SCAFFOLD54892 matches exon 6

SCAFFOLD131911 matches exon 5

LPCDPTFILGCAPCNVICSIIFQNHFDYKDQIFLDLMERLNENARILGSPWIQ

LCSSFPALIDYVPGKHKKFFENYACMKSYVLEKTREHQASLDMNNPRDFIDCFLTKMEQ

EKHNHQSEYTFENLTITVSDLFGAGTETTSTTLRYGLLLLLKHPEVT

AKVQEEIDHVIGRHRSPCMQDKSHMPYTDAVVHEIQRYIDLVPTNLPHAVTCDIKFRNYLIPK

GTGILTSLTSVLYDDKEFPNPEVFDPGHFLDESGNFRKSDHFMAFST

GKRICVGEGLARMELFLFLTTILQNFTLKSVVDPKDLDTTPVVNGLLSVPPFYQLCFIPV*

 

CYP2C86

58% to 2C19 CB428210 AAFC02035479.1 complete

MERLEITTLALVICVTCLVFLFVWKKSHKGLGKLPPGPTPLPIIGNLMQLNLKDIPASLSK

LAKQYGPVYTLHLGSQTTVVLHGYEVVKEALIDQGDEFLGRAHFPIIDDTQRGY

GLIFSNGDTWKQMRRFSSLMTLRDFGMGKRSLEERIQEEAQFLVEEFRKSE

AQPFNPAVTLSCATCNIICSILFNERFHYQDKTLHSLLDLLNENFNRISSLWNQ

IYNLWPKLIKPLPGEHRAFSKRLKDVHYFVLEKVKEHQKSLNHNNPRDYIDCFLSRMEQ

EKQNPESQFHLENLATCGSNLFSAGVETTTATLSYGFLLLMKYPEVQ

AKVHEEIDRVIGRTRSPCMKDKMKLPYTEAVLHEIQRYVTLVPSNLPHAVVQDTKFRQYVIPK

GTTVLPLLSSILYDCKEFPNPEKFDPGHFLDKNGSFRKTKYFVAFSI

GKRACVGEGLAQMELFLFFTTILQNFVLKPLGETKDIETKPIVIGLINMPPPFKLCLIPR*

 

CYP2C87

78% TO 2C19

CN791444 CK951564 SCAFFOLD 53069 CN791418.1 CK949679.1 CB441113

MDLAVVLVLCLSCLLLLSLWKQSSGKGKLPPGPTPLPILGNIFQLDVKNISKSLTS

LSKVYGPVFTVYFGMKPTVVLHGYEAVKEALIDLGEEFSRRGSFPVIERNVKGH

GIVFSNGKTWKETRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTN

GLPCDPTFILGCAPCNVICSIIFQNRFDYKDQTFLNLMKTINENIKILGSPWIQ

VLNIFPVLLDFFPWSYSYKKLYTNTAYVKNYVLEKTREHQASLDINNPRDFIDCFLIKMEQ EKHNHQSEYTFENLTITVSDLFGAGTETTSTTLRYGLLLLLKHPEVT

AKIQEEIDRVIGRHRSPCMQDRTHMPYMDAVLHEIQRYIDLAPTSVPHAVNCDVKFRNYLIPK

GTDILTSLTSVLHDDKEFPNPEVFDPGHFLDENGNFRKSDYFMAFSAGKRVCVGEGLA

RMELFLFLTTILQTFTLKSVVDPKDLDTTPAVTGIANVPPPYQLCFIPV*

 

CYP2C87-de2b fragment exon 2,

6kb downstream of 2C87 without an intervening exon 1, same orientation

LSKVCGPVFTVYFGMKPTVVLHGYEALQEALIDLGEEFSGRYSFPVNEKTRRGH

 

CYP2C fragment exons 7,8,9

BZ845111 SCAFFOLD133344 XM_612392 predicted from NW_616354 81% to 2C89

AAFC02088219

Note exon 7 = same seq as 2C87 but exons 8,9 differ

Possible alternative splice variant of 2C87.  EST CK949679.1 supports

Exon 7 joining with different exons 8 and 9 than these

So these exons may be skipped over. 

Gene order might be 2C87 exons 1-7, then these exons 8,9

Follwed by CYP2C87-de2b and the exons 8,9 of 2C87 above

AKIQEEIDRVIGRHRSPCMQDRTHMPYMDAVLHEIQRYIDLAPTSVPHAVNCDVKFRNYLIPK

GTTILTSLTSVLHDGKEFPNPEQFDPGHFLDKSGNFKKSDYFMVFSA

GKRVCVGEGLARMELFLLLVSILQKFTLKPLVDPKNIDTAPLLKGVGSIPHFYEVCFIPV*

 

CYP2C88

78% TO 2C19

CN786649 CK956312 CN787335.1 CN786468.1 CC503360

MDLAVVLVLCLSCLLLLSLWKQSSGKGKLPPGPTPLPILGNILQLDVKNISKSLTN

LSKVYGPVFTVYFGMKPIVVLHGYEAVKEALIDLGEEFSGRGMFPLAERANIVN

GILFSNGKTWKEIRRFSLMTLRNFGMGKRSIEDRVQEEACCLVEELRKTN

GLPCDPTFILGCAPCNVICSIIFQNRFDYKDPVFLDLMERLNEILRILSSPWVQ

VCNNFPALFDYLPGSHNKVLKNVANLKSFVLEKAMEHKASLDINNPRDYIDCFLIRMEQ

EKQNQQLEFTLENLTTTVFDLFGAGTETMSTTLRYGLLLLLKHPEVT

AKVQEEIDRVIGRHRSPCMQDRSHMPYTDAVVHEIQRYIDLVPSSLPHMVTHDIELRNYIIPK

GTGVLVSLTSVLYDDKVFPNPEMFDPGHFLDDSGNFKKSDHFMPFSA

GKRICAGESLARMEVFLFLTVILQKFTLKSVVDPKDIDTTPIANGFASVPPPYKLCFIPL

 

CYP2C89 missing exon 1 and part of exon 2

69% to 2C18 TC211258 CB531366 AY265992

     XXXXX

   3 XXXXXGPVFTLYFGMKPTVVLHGYEAVKQVLIDQSEEFSGRGSLPVADNINQGL

     GIVFSNGEIWKQTRRFSLMVLRNMGMGKRTIEHRIQEEALCLVEALKKTN

     GSPCDPTLLLSCAPCNVICSIIFRNRFEYNDERLLTLIKYFNENSRLVSTPWVE

     LYNTFPSLLHYFPGSHNTIFKNMTEQRKFILEEIKKHQESLDLNNPQDFIDYFLIKMEK

     EKHNKHSEFTMDNLITTVWDVFSAGTETTSLTLRYGLLLLLKHPEVT

     AKVQEEIDRVVGRNRSPCMQDKSCMPYTDAVLHEIQRYIDLVPSSMPHAATQDVKFREYLIPK

     GTVILTSLTSVLHDDNEFSNPGQFDPGHFLDESGNFKKTDHFMAFSA

     GKRVCVGEGLARMELFLLLVSILQHFTLKSVVDPKHIDTAPSFKGLISIPPFCEMCFIPV* 1292

 

CYP2C90 partial seq.  missing exon 1

CB222086 82% to 2C18 77% to 2C87 SCAFFOLD35528 AAFC02073759

DR713168

LSNTYGPVFTVYFGLRPTVVLHGYEAVKEALIDQGEEFSGRGNIPMSQRVNKGY

GIIFSNGKRWKEIRRFSLMTLRNFGMGKRSIEDRVQEEAHCLVEELRKTN

GSPCDPTFILGCAPCNVICSIIFQNRFDYTDQNFLNLLDKFNENLQVVSSPWMQ

VCNTFPILIDYFPGSHNKLFKNFAYIRSYVLEKVKEHQATLDINNPRDFIDCFLIKMEQ

EKHNQEMEFTFENLIASVSDLFGAGTETTSTTLRYGLLMLLKHPEVT

AKVQEEIDRVIGRHRSPCMQDRSHMPYMDAVVHEIQRYIDLVPTNLPHAVTRDIKFRNYLIPK

GTTVVTSLSSVLHDEKEFPNPKVFDPAHFLDESGNFKKSDYFMAFSA

GKRSCVGEGLARMELFLFLTTILQKFTLKSVVDPKDLDTTPVSSGFGHVPPPYQLCFTPL*

 

CYP2C exon 1

SCAFFOLD291268

MDLFVVLVICLSILIFLFLWNQRHAKGKLPPGPTPLPIVGNILQINIKNVSKSISK

 

CYP2C partial pseudogene sequence with frameshift and stop codon exons 1-3

SCAFFOLD113483

MDLAVVLVLYFSCLLLLSLWKQSSGKGKLLPGPTPLPILGNTLQLDVNDISKSLSD

LSEVCGPVFTVYFGMKPTVVLYGYEAVKETLI

DLGEEFSGRGSLPLPERISKGH

GILFSSGKRWKETRRFSLVTLRNFRMGE*SVEDRVQEEARCLVEELRKTNG

 

CYP2C pseudogene fragments

SCAFFOLD51208

EKHNQELEFTVENLMITVSDLFGAGTEMTSTTLRYGLLLLLKHPEVT

AKFQEEIDHMIGRDQSPCMQDRSHMPYVDAVVHEIQRYIDL (frameshift)

IPTNLPLAVTHDVKFRNYLIPK

Gap missing exon 8

XXXXXXXEGLTCVELFLFLTTILQ (frameshift)

KFTLKSVVDPKDIDTTPIVN

 

CYP2C fragments exons 5-7, two copies, 94% identical

BZ878104 78% to 2C18 87% to 2C89 SCAFFOLD40567 no ESTs

AAFC02042798.1

LYNAFPSLLHYLPGSHNTLFKNMTEQRKFILEKIKEHQESLDLNNPQDFIDYFLIKMEK

EKHNKQSEFTMDNLITTVWDVFTAGIETTSISLKYGLLLLLKHPEVT

AKVQEEINRVVGRNRSPCMQDRSRMPYTDAVIHEIQRYIDIVPNNLPHAAAQDIKFREYLIPK

 

Second partial gene on same contig AAFC02042798.1

Note: exons 8,9 not found between these.

LYNAFPSLLHYLPGSHNTLFKNMTEQRKFILEKIKEHQESLDLNNPQDFIDYFLIKMEK

EKHNKQSEFTMDNLITTIWDVLTAGIETTSLTLRYGLLLLLKHPEVT

AKVQEEINHVVGRNRSPCMEDRSRMPYTDAVIHEIQRFIDLVPNNLPHAAAQDIKFREYLIPK

 

CYP2D14

TC205271 78% to 2D6

MGLLSGDTLGPLAVALLIFLLLLDLMHRRSRWAPRYPPGPTPLPVLGNLLQVDFEDPRPSFNQ

LRRRFGNVFSLQQVWTPVVVLNGLAAVREALVYRSQDTADRPPPAVYEHLGYGPRAEG

VILARYGDAWREQRRFSLTTLRNFGLGKKSLEQWVTEEASCLCAAFADQA

GRPFSPMDLLNKAVSNVIASLTFGCRFEYNDPRIIKLLDLTEDGLKEEFNLVRKV

VEAVPVLLSIPGLAARVFPAQKAFMALIDELIAEQKMTRDPTQPPRHLTDAFLDEVKE

AKGNPESSFNDENLRLVVADLFSAGMVTTSTTLAWALLLMILHPDVQR

RVQQEIDEVIGQVRRPEMGDQALMPFTVAVVHEVQRFADIVPLGLPHMTSRDIEVQGFHIPK

GTTLITNLSSVLKDETVWEKPFRFHPEHFLDAQGRFVKQEAFIPFSA

GRRACLGEPLARMELFLFFTSLLQHFSFSVPAGQPRPSEHGVFAFLVTPAPYQLCAVPR*

 

CYP2D43

SCAFFOLD 102164

5681 MGLLSGDTLGPLAVALLIFLLLLDLMHRRSRWAPRYPPGPMPLPVLGNLLQVDFEDPRPSFNQ

     LRRRFGNVFSLQQVWTPVVVLNGLAAVREALVYRSQDTADRPPQALYKHLGFGPRAEG 6760

7291 VILARYGNAWREQRRFSLSTLRNFGLGKKSLEQWVTEEASCLCAAFADQA 7449

7550 GHPFSPMDLLNKAVSNVIASLTFGCRFEYNDPRIVKLLDVMEDGLKEEMKIMRQV 7714

8109 VEAVPVLLSIPGLAAKVVPGQKAFMTLVDELIAEQKMTRDPTQPPRHLTDAFLDEVKE 8288

     AKGNPESSFSDENLRLVVADLFSAGMVTTSTTLAWALLLMILHPDVQR 8591

8806 RVQQEIDEVIGQVRRPEMGDQALMPFTVAVVHEVQRFADIVPLGLPHMTSRDIEVQGFHIPK 8985

9424 GTTLITNLSSVLKDETVWEKPFRFHPEHFLDAQGRFVKQEAFIPFSA 9603

     GRRACLGEPLARMELFLFFTSLLQHFSFSVPAGQPRPSDHGVFVALVTPAPYQLCAVPR 9843

 

CYP2E1

AJ001715 TC189658 79% to human 2E1

MAALGITVALLVWMATLLFISIWKHIYSSWKLPPGPFPLPIIGNLLQLDIKNIPKSFTR

LAERYGPVFTLYLGSQRAVVVHGYKPVKEVLLDYKNEFSGRGENPGFQMHKNN

GIIFNNGSTWRDTRRFSLTTLRDLGMGKQGNEQRIQREAHFLLEVLRKTQ

GQPFDPTFVVGFAPYNVISDILFHKRFDYKDQTSLRLMSLFNENFYLLSSPWIQ

LYNNFPDYLQYLPGSHRKLLKNVSEVKSYALERVKDHQKSLEPSCPRGFLDTMLIEM

AKERHSVDPMYTLENIAVTVADLLFAGTETTSTTLRYGLLILMKYPEVE

EKLHEEIDRVIGPSRIPAVKDRLDMPYLDAVVHEIQRFIDLLPSNLLHEATQDTVFRGYVIPK

GTVVIPTLDSVLHDRQEFPEPEKFKPEHFLNENGKFKYSDHFKAFSA

GKRVCVGEGLARMELFLLLAAILQHFNLKSLVDPKDIDLSPIAIGFGKIPPRYKLCLIPRSKV*

 

CYP2F1 missing exon 1

AW312559.1 AAFC02178016.1 AAFC02183743.1 AJ459276 CC540963 BAC end

XXXXX

LSKEFGAVYTVYLGPRRVVVLSGYQAVKEALVDQAEEFGGRGDYPVFFNFTKGN

GIAFSNGDRWKVLRKYSVQILRNFGMGKRTIEERILEEGHFLLEELRKTQ

GKPFDPTFVVSRSVSNIICSVIFGSRFDYDDDRPLSIIHLINENFQIMSSPWGE

MYNIFPNLLDWVPGPHRRLFKNYGRIKDIIARSVREHQASLDPNSPRDFIDCFLTRWH

QEKQDPLSHFFMDTLLMTTHNLLFGGTETVGTTLRHAFRLLMKYPEVQ

VRVQEEIDRVVGHERLPTVEDRAAMPYTDAVIHEVQRFADVIPMSLPHRVTRDTNFRGFTIPR

GTDVITLLNTVHYDPSQFLKPKEFNPEHFLDANMSFKKSPAFMPFSA

GRRLCLGEALARMELFLYLTAILQSFSLQPLGAPEDIDLTPLSSGLGNLPRPYQLCVLAR

 

CYP2G1

83% to mouse (88% to human pseudogene)

Database location  : AAFC01082069   3860 to 4039 (+)

Genomic location   : SCAFFOLD219575 3860 to 4039 (+)

Database location  : AAFC01082069   4854 to 5015 (+)

Genomic location   : SCAFFOLD219575 4854 to 5015 (+)

Database location  : AAFC01082069   6748 to 6897 (+)

Genomic location   : SCAFFOLD219575 6748 to 6897 (+)

Database location  : AAFC01082069   8738 to 8899 (+)

Genomic location   : SCAFFOLD219575 8738 to 8899 (+)

Database location  : AAFC01082069   9151 to 9327 (+)

Genomic location   : SCAFFOLD219575 9151 to 9327 (+)

Database location  : AAFC01202105   145 to 300 (-)

Genomic location   : SCAFFOLD199473 145 to 300 (-)

Database location  : AAFC01421374    997 to 1185 (+)

Genomic location   : SCAFFOLD281678 3294 to 3482 (-)

Database location  : AAFC01421374   1314 to 1454 (+)

Genomic location   : SCAFFOLD281678 3025 to 3165 (-)

Database location  : AAFC01603617    586 to  765 (+)

Genomic location   : SCAFFOLD281678 1315 to 1494 (-)

3860 MELGGAFTIFLALCLSCLLILIAWKRMSKGGKLPPGPTPIPFLGNVLQVRTDATFQSFMK(0) 4039

4854 LKEKYGPVFTVYMGPRPVVVLCGHEAVKEALVDRADEFSGRGELASVERNFQGH(1)5015

6748 GVALANGERWRILRRFSLTILRDFGMGKRSIEERIQEEAGFLLVELRKTR(1)6897

8738 GARIEPTFFLSRTVSNVISSVVFGSRFDYEDQQFLKLLQMINQSFIEMSTSWAQ (0) 8899

9151 LYDMYSGIMQYLPGRHNRIYYLIEELKDFIASKVKINEASLDPQNPRDFIDCFLIKMHQ(0) 9327

300  DKNNPHTEFNLKNLVLTTLNLFFAGTETVSSTLRYGLLLMMKHPEVE(1)145

997  AKIHEEIDQVIGPHRIPSVDDRAKMPYTDAVIHEIQRLTDIVPMGVPHNVIRDTHFRGYLLPK(0) 1185

1314 GTDVFPLLGSVLKDPKYFRYPDAFYPQHFLDEQGHFKKNEAFVPFSS(1) 1454

586  GKRICLGEAMARMELFLYFTSILQNFSLRSLVPPADIDITPKVSGFGNIPPTYELCFMVR(1) 765

 

CYP2J26

complete 2JD

CB465835.1 SCAFFOLD215002 CB462842.1 SCAFFOLD278853

SCAFFOLD107814 CB465835.1 SCAFFOLD215002 CB462842.111497

CB532345.1

MLEALGSLVAALWTTLRPGIVLLGAFVFLLFADFLKRQHPKNYPPGPLRLPFIGNFFHLDLGKGILVPQQ

VVKKYGNIIRLDFGVIHFIVITGLPYIKEALVNQEQNFVNRPMIPLQKHIFNNK

GLVRSNGQVWKEQRRFTLTTLRNFGLGRKSLEERIQEEVTYLIQAIGEEN

GQPFDPHFIINNAVSNIICSITFGERFDYKDDQFQELLRLLDEILCIQASVCCQ

LYNAFPRIMNFLPGSHHTLFRKWEKLKMFVANVIENHRKDWNPAEARDFIDAYLQEIEK 11676

HKGNATSSFDDENLICSTLDLFLAGTETTSTTLRWGLLFMALNPEIQ 14705

EKVQAEIDRVLGQSQKVSTASRESMPYTNAVIHEVQRMGNIVPMNVPREVTVDTVLAGYH 15236

LVKGTMVLTNLTALHRDPAEWATPDTFNPEHFLENGQFKKRESFLPFSI

GKRMCLGEQLARTELFIFFTSLLQKFTFRPPENEKLSLKFRESLTSSPASYRLCAIPRA* 25310

 

CYP2J27

SCAFFOLD215002 complete  2JC

BM258720.1 CK950217.1

SCAFFOLD388989

This gene has an extra exon 5

MLEALGSLAAALWAALRPGTVLLGAVVFLFLDDFLKRRRPKNYPPGPPPLPEVGNFFQLDFDKAHLSLQR

FVKKYGNVFSVDFGIFRSVLITGLPLIKEALVHQDQNFANRPLIPIEKRIFNNK 37352

GLIMSNGHVWKEQRRFALTTLRNFGLGKKSLEERIQEEAAYLIQEIGEEN 39667

GQPFDPHFTINNAVSNIICSITFGERFDYQDDQFQELLRLFDEMMHLRTSTCCQ 40221

LYNIFPRIMSFLPGPQHALFSKWEKLKMFIAGVVENHKRDWNPAEARDFIDAYLQEIEK 42145

HKGNATSCFHEENLIYNTLDLFFAGTETTSTTLRWGLLYMALYPEIQ 43949

EKVQAEIDRVLGQSQKPSMAARESMPYTNAVIHEVLRMGNILPLNVPREVTVDTVLAGYRLPK

GTMVTTNLTALHRDPAEWATPDTFNPEHFLENGQFKKRESFLPFSI

GKRMCLGEQLARTELFIFFTSLLQKFTFRPPENEKLSLKFRMSMTLSPLSHRLCAIPRA*

 

CYP2J27-ie5b

SCAFFOLD215002  BM258720.1 CK950217.1

extra exon 5

LSNVFPRIMNFLPGPQHTLFSKWEKLKMFIAGVIENHKRDWNPAEARDFVDAY 41591

 

CYP2J28

complete  2JA

SCAFFOLD256921 CK945989.1 CK945989.1 CO259427.1

SCAFFOLD295402 CO259427.1 SCAFFOLD309869 CO893955 CO259427.1

SCAFFOLD213132 CO259427.1 SCAFFOLD247979 CO893955 CO259427.1

BE664093.1 SCAFFOLD65540 CO893955 CK770461.1

MLEALGSLAAALWAALRPGTVLLGAIVFLLLTDLLNRRRPKNYPPGPPRLPFVGNFFQLDFEQGHLSLQR

FVKKYGNLFSLEFGDLPSVVITGLPLIKEVLVYQDQNFVNRPISPIRERVFKKN

GLIMSNGHIWKEQRRFSLTALRNFGLGRKSLEERIQEEVAYLIQAIGEEK

GQPFNPHFKINNAVSNIICSITFGERFDYQDDQFQELLRLLDEVTYLETTVWCQ

LYNVFPRIMNFLPGPHQMLFSNWRKLKMFVARVIENHKRDWNPAEARDFIDAYLQETEK

HKGNAASSFHEENLIYNTLDLFFAGTETTSTTLRWGLLYMALYPEIQ 716

EKVQAEIDKVLDESQQPSMATRESMPYTNAVIHEVQRMGNILPLNVPREVTVDTVLAGYHLPK

GTMVLTNLTALHRDPAEWATPDTFNPEHFLENGQFKKREAFLPFSI

GKRMCLGEQLARTELFIFFTSLLQKFTFRPPEHEELSLKFRMGLTLSPVSHCLCAVPRA*

 

CYP2J29

complete 2JB

SCAFFOLD25620 DN825584.1 DN825609.1 CN793001.1

SCAFFOLD126970 SCAFFOLD87184 CK957631.1 SCAFFOLD166503

CK834395 CK945908.1 SCAFFOLD166503 CK960783.1 CK957063.1

SCAFFOLD245478 CK945645.1

MLSSLAAALWAALRPGTVLLGAVAFLFFADFLKRRRPKNFPPGPAGLPFVGNSFQLDPEKVHLTLQQ

FVKKYGNVFSLDFGTFPSILITGLPLIKEALVHQGENFSKRPVMPLQERIFNTK

GLIMSSGHIWKEQRRFALTTLRNFGLGKKSLEERIQEEASYLIQMIREEN

GKPFDPHFIINNAVSNIICSITFGERFDYQDSQFRELLRLLDEVLNLHTSLCCQ

LYSVFPRIMNFVPGPHQTLFSNLEKLKMFVAEMIENHKRDWNPAEARDFIDAYLQEIEK 8435

HKGGDASSFREENLIYSTLDLFLAGTETTSTSLRWGLLYMALNPEIQ 5634    

EKVQAEIDRVLGQSQQPSTAARESMPYTNAVIHEVLRMGNIIPLNVPREVAVDTTLAGYHLPK 5455

GTVVVTNLTALHRDPAEWATPDTFNPEHFLENGQFKKRESFLPFSI 2548

GKRMCLGEQLARAELFIFFTSLLQKFTFRPPENEKLSLKFRVSLTLAPISHRLCAVPRG*

 

CYP2J30

complete 2JH

DN825200.1 CK950246.1 SCAFFOLD281439

SCAFFOLD298803 CK940975.1

SCAFFOLD170755 CK945247.1 CN793929.1

MLEALSSLATALWAALRPDTVLLGTLAFLLFVDFLKRRHPKNYPPGPPGLPFVGNLFQLDPEKVPLVLHQ

FVKKYGNVFSLDFGTVPSVLITGLPLIKEVLVHQGQIFSNRPIVPLQEHIINNK

GLIMSSGQLWKEQRRFALTTLRNFGLGKKSLEERIQEEASYLIQTIREEN

GQPFDPHLTINNAVSNIICSITFGERFDYQDDQFQELLRMLDEILNLQTSMCCQ

LYNVFPRIMNFLPGPHQALFSNMEKMKMFVARMIENHKRDWNPAEARDFIDAYLQEIEK

HKGDATSSFQEENLIYNTLDLFLAGTETTSTSLRWGLLFMALNPEIQ

EKVQAEIDRVLGQSQQPSMAARESMPYTNAVIHEVLRMGNIIPLNVPREVAVDTTLAGYHLPK 15084

GTMVMTNLTALHRDPTEWATPDTFNPEHFLENGQFKKRESFLPFSI 12265

GKRMCLGEQLARTELFIFFTSLLQKFTFRPPENEQLSLKFRVSLTLAPVSHRLCAVPRG*

 

CYP2J31P pseudogene

CYP2JF SCAFFOLD295104 no ESTs possible pseudogene (N-term is short)

some frameshifts present,  exons 6-8 missing

MGAAAFLFVVHLKRRRGKNYPPGPPGLPFLGNFFHLDLKQLHLSLQQ

IVKKYGNMISLEMGGFSTVFFKWIAQNQRSPCLPGPKLVNHPIQRIQENIFKKH 5343

GLIMSNGHIWKEQRRSALTTLRNFGLGRKILEECIQEEAAYLIQTVGEEN 8001

XQPFDPHFTINNAVSNIVCSIAFGELFDYQDSXXQELLRLMDEAMYLQTSVRCRV 8538

LYNFFARIMNFLPGPHQTLFIKWEKLNMFIDSVIENHRRDWNPAEPRDFTDA

15856 GMWMCPGEQLARTELFIFFTSLLQKFTFRPPGDEKLSLQFRVSLTISSVSHWLC 16020

 

CYP2R1

scaf.96150, 81935, 91707 CK969495.1

MWEPHSAEAFVAALGGVFFLLLFALGVRQLLKQRRPSGFPPGPSGLPFIGNIYSLAASAELPHVYMKKQSQVYGE (0)

IFSLDLGGISAVVLNGYDVVKECLVHQSEIFADRPCLPLFMKMTKMG (1)

GLLNSRYGRGWVDHRKLAVNSFRCFGYGQKSFESKILEETKFFIDAVETYNGSPFDLKQLV

TNAVSNITNLVIFGERFTYEDTDFQHMIELFSENVELAASATVFLYNAFPWIGILPFGKH

QQLFRNAAVVYDFLSRLIEKASINRKPQLPQHFVDAYLDEMERSKNDPSSTFSKENLIFS

VGELIIAGTETTTNVLRWAVLFMALYPNIQ (1)

GQVQKEIDLIIGPSGKPSWDEKCKMPYTEAVLHEVLRFCNIVPLGIFHATSEDAVVRGYSI

PKGTTVITNLYSVHFDEKYWRDPEIFYPERFLDSSGHFAKKEALIPFSL (1)

GRRHCLGEQLARMEMFLFFTALLQRFHLHFPHELVPNLKPRLGMTLQPQPYLICAERR*

 

CYP2S1

AAFC02159945.1

MEAAGTWALLLLLLLLVVTLVLPATWDRGHLPPGPTPLPLLGNLLQLRPGALYLGLLR

LSKKYGPVFTVYLGPWRRVVVLVGHEAVQEALGGQAEEFSGRGTVATLDGTFDSH

GVFFSNGERWRQLRKFTTLALRDLGMGKREGEELIQAEARCLVEALQGTK

GRPFDPSLLLAQATCNIICSLVFDLRLPYDNEEFQAVVRAAGGIAVGVSSPWGQ

TYEMFSRFLQRLPGPHTQLLRHLGTVAAFAAQQVWQHKGSLGTSGPVRDLVDAFLLKMAK

EKQDPNTEFTAKNLLMTVVYLLFAGTVTVSTTIRYTLLLLLKYPQVQ

ERVQEELMRELGAGQRPSLGDRARLPYTDAVLHEAQRLLALVPMGIPRALTKTTRFRGYTLPQ

GTEVFPLLGSILHDPAVFEEPKEFNPGRFLDADGKFKKHEAFLPFSL

GKRVCLGEGLARTELFLLFTAILQAFSLEGPCPLGALSLQPAISGLFNIPQAFQLQFRPR*

 

CYP2T2P ortholog

AAFC02046938 65% to rat 2t4 66% to 2T2P human

MMISGIIALSLLVLLLAPARWGWGARSTQRQGALPPRATPLRLLGSLLQLRIWRPGPCTHG

LSGRCGPVFTVCLGQCPVVVLCRYAALRDALVLQADAFSGRGAMAVFKRFTRGN

GIAFSKGPRWPTLRNFALGALKEFGLGTQTIEERVLEEAACLLGDFQATGG

GAPFDPQRLLDNAVSNVICSVVLGNHYGYEDMEFLRLLDLFNDNFRIMSSRWGE

XXXXXSLLDWLPGLHH*IFRNFAXLRVFISQQIQLHQQTR*SGKPHDFIDXXXXXXX

GTENPESHFQAETLAMTMHNLFFGXXETTSTTLRYGLILLKYSFVA

AKVQAELDDMVGRMCAPTLEDREHLPYTNTVLHEIQCFISVVPFGLPSALTCDTHLRGYFLPK

GTFVIPLLVSTHWVPTQFKNPECFNPTNFLNDQGEFQSNAFTPFAL GTCLGAGLAPTDIFLFLTSILLRFFLLPVGSHSDTDLTPQCTGLGNVPPAFQLRLVAR*

 

CYP2U1

XM_615255 AAFC02193700 gnl|ti|665258407

MASPGLPQPPTEDAAWPLRLLHAPPGLLRLDPTGGALLLLVLAALLGWSW

LWRLPERGIPPGPAPWPVVGNFGFVLLPRFLRRKSWPYRRARNGGMNASGQGVQLLLADL

GRVYGNIFSFFIGHYLVVVLNDFHSVREALVQQAEVFSDRPRVPLTSIMTKGKGIVFAHY

GPVWRQQRKFSHSTLRHFGLGKLSLEPKIIEEFRYVKEEMQKHGDAPFNPFPIVNNAVSN

IICSLCFGRRFDYTNSEFKQMLTFMSRALEVCLNTQLLLVNICSWLYNLPFGPFKELRQI

EKDLTLFLKKIIKDHRESLDVENPQDFIDMYLLHVEEEKKNNSNSGFDEDYLFYIIGDLF

IAGTDTTTNSLLWCLLYMSLHPNIQEKIHEEIARVIGADRAPSLTDKAQMPYTEATIMEV

QRLSTVVPLSIPHMTSEKT

VLQGFTIPKGTIILPNLWSVHRDPAIWE

KPNDFYPDRFLDDQGQLIKKETFIPFGI

GKRVCMGEQLAKMELFLMFVSLMQSFTFVLPKDSKPILTGKYGLTLAPHPFNIIISKR

 

CYP2W1 partial sequence

scaf.279079,  426471 CB440236.1 AAFC02007621.1

XXXXX

LGKQYGPVFTVHLGHQKTVVLTGYEAVKEALVGTGQELAGRPPIAIFQLINGGG (1)

GVFFSSGPRWRAARQLTVRALHGLGVGRAPVANKVLQELRCLTAQLDSYE (1)

GRPFPLALLRWAPSNITFTLLFGQRFDYRDPVFLSLLGLVDEVMVLLGKPSVQ (0)

LFNLYPRLVALLQLHRPVLRKIEEVRAILRALLEARRHRTPPRGPQQSYLDALIQQGQ (0)

XXXXX

XXXXXXXXXXXXXXXXPRPEDVHALPYTNAVLHEVQRFITLLPHAPRCTVANTQLGPYLLPK

GTPVLALLNSVLLDETQWKTPRQFNPGHFLDANGRFVKRPAFLPFSA

XXXXX

CYP2AB1P pseudogene

scaf.15709 BI535102

MCPLLIWLGLLAASFLLLKFSIIYWERNHLPPDPFPFPILGNPWQLSFQLHPATLLQ

LAQTHGHVFTVWVGPTPVVVLCSFQA

KEALVSHSEQLSGWPLTPLFQDLAGERG

GVICSSGRTRRQ*RRFCLAALQGLG*GPLALELRLQEEAAGLVEAFHWEQ

GGPFDPQAPIVRSTARVTGALVFGRHFLSEDPFFQELI*ATNFGLAFXXXXXX

QLNDLFPWAFRCLPGPYREMFRYQKAVRGYIHREIMRHKLRTSEAPKDFISCYLAQIIK

ATDDPVSTFNEENLIQVVVGLFLGGTDTTGTTLYWVLIYMIQYGAIQS

ERVQQELVTVLGTSGAICYKDHEQLPHICTLLHEAQRLSSVA*V

AVCQCVTSTHVHGHPVPK

GTIILPNLAAVLCDPECWRTSRQFNPGHFLDKDGNFVVRDIFPPFSA

GHQMCLGD*LAQMKLLLMFATLLGTFSFQLPGRSPGLRLEYNFGGTRKPLPQKIYAVSRLNCPHPGPREEVL*

 

CYP2AC1P pseudogene

67% to rat 2AC1

SCAFFOLD151603, SCAFFOLD251118 AAFC02071354 pseudogene

BZ940825 CC910297

MSGFESSFILPILSLILIFILNIKIVMTKASKQHFPPVPRPLPIIGNLHILNLKRPYQTMLE (0)

LSQKYGSIYSIQIGPRKVAVLxGYETVKDVLVNHTDQFGEWFHVPISERLFEGK

GIFFSHSDTSKIIRFTLTTSQNFGMGKKALEDTIIGESQHLIRNFETDKG

GKPFEVKTLTNASVANINVSVLLGKGFDYQNTPFLRLLTLIDQSVKLIVSPPTA

LFNMFPVLRFLLKTYKNILRNKDELFSFIRMTFLHHHHKLDKNDPRSLTDAFLVRQQE

DTSTDYFNDDTLVVLVNNLFAAGTESMVSTLCWGILFMSRYPEIQS

KVHDEIAKVMGSTQP*MAH*TQMPYTDAVILEVQRFADILPTGLPRATTTNTIFKNNYIPK

GTEVIFLLTSVL*DQTQWENPATFNPEHFLDSIEKFIKKEAFISFSV (1)

SPL*CAGESLAKMELLLFFMSLLQKFTFQPPPGVSHLDLDPTRDTGVVIQPMPHKIRALPRA

 

CYP3 Family

 

CYP3A28

TC211564 Y10214 73% to 3A5 SCAFFOLD259874

MELIPSFSMETWVLLATSLVLLYI (2)

YGTYSYGLFKKLGIPGPRPVPYFGSTMAYHK (0)

GIPEFDNQCFKKYGKMWG (2)

FYEGRQPMLAITDPDIIKTVLVKECYSVFTNRR (0)

IFGPMGIMKYAISLAWDEQWKRIRTLLSPAFTSGKLKE (0)

MFPIIGQYGDMLVRNLRKEAEKGNPVNMKD (2)

MFGAYSMDVITGTAFGVNIDSLNNPHDPFVEHSKNLLRFRPFDPFILSI (1)

ILFPFLNPVFEILNITLFPKSTVDFFTKSVKKIKESRLTDKQM (0)

NRVDLLQLMINSQNSKEIDNHK (1)

ALSDIELVAQSTIFIFGGYETTSSTLSFIIYELTTHPHVQQKLQEEIDATFPNK (0)

APPTYDALVQMEYLDMVVNETLRMFPIAGRLERVCKKDVEIHGVTIPKGTTVLVPLFVLHNNPELWPEPEEFRPER (2)

FSKNNKDSINPYVYLPFGTGPRNCLGMRFAIMNIKLALVRILQNFSFKPCKETQ (0)

IPLKLYTQGLTQPEQPVILKVVPRGLGPQVEPDFL*

 

CYP3A74

TC219554 76% to 3A4 Length = 563 N-term CK957598.1

MELILSFSTETWVLLATGLVLLYL (2)

YGTYSYGLFKKLGVPGPRPLPYFGNILSYRK (0)

GVCEFNEECFKKYGKIWG (2)

IFEGKQPLLVITDPDMIKTVLVKECYSVFTNRR (0)

VFGPSGVMKNAISVAEDEQWKRIRTLLSPTFTSGKLKE (0)

MFPIIGKYGDVLVRNLRKEAEKGTSVDIKD (2)

IFGAYSMDVITSTSFGVNIDSLGNPQDPFVENVKKLLRFSILDPFLLAV(1)

VLFPFLVPILDVLNITIFPKSAVNFFTKSVKRIKESRLKDNQK(0)

PRVDFLQLMINSQNSKETDNHK  (1)

ALSDQELIAQSIIFIFAGYETTSSTLSFLLYILATHPDVQQKLQEEIDATFPNK (0)

APPTYDVLAQMEYLDMVVNETLRMFPIAIRLERLCKKDVEIHGVSIPKGTTVMVPISVLHKDPQLWPEPEEFRPER (2)

FSKKNKDSINPYVYLPFGTGPRNCIGMRFAIMNMKLAIVRVLQNFSFKPCKETQ (0)

IPLKISSQGVLRPEKPVVLKVVLRDGTISGA*

 

CYP3A75 missing exons 3-6

SCAFFOLD20102 DN530970.1 CK954349.1 DN532344.1 AAFC02132381 SCAFFOLD237847

MELIPSFSMETWVLLSISLVLLYL

YGTYSHGLFKKLGVPGPRPLPYFGNVLSYRK

 

VFGAYSMDVITSTSFGVNIDSLGNPQDPFVENAKKLLRFDILNPFLLSV

VLFPFLVPIFEVLNITMFPKSAVNFLAKSVKRIKESRLKDNQK

PRVDFLQLMINSQNSKETDNHK

ALSDQELMAQSVIFIFAGYETTSNTLSFLLYILATHPDVQQKLQEEIDVTFPNK

APPTYDVLAQMEYLDMVVNETLRMFPITVRLDRLCKKDVKIHGVSIPKGTTVTVPISVLHRDPQLWPEPEEFRPER

FSKKNKDTISPYVYLPFGTGPRNCIGMRFAIMNMKLAVVRVLQNFSFKSCKETQ

IPLKINSQGLIRPEKPIFLKVVLRDETISGA*

CYP3A76

TC192141 75% to 3A4 SCAFFOLD300381 AM015519.1 AAFC02005660.1 DR712863.1

MELIPNFSVETWVLLAISLVLLYL(2)

YGTYSHGLFKKLGVPGPRPLPLFGNVLSYRK (0)

GVCEFDEECFKKYGKMWG (2)

IFEGKHPLLVITDPDMIKTVLVKECYSVFTNRR (0)

VFGPMGVMKNAVSVAEDEQWKRIRTLLSPTITSGKLKE (0)

MFPIIGKYGDVLVRNLRKEAEKGTSVDMKE (2)

VFGAYSMDVITSTSFGVNIDSLGNPQDPFVENAKKLLRFDILDPFLLSV (1)

VLFPFLIPIFEVLNISIFPKSAVNFLTTSVKKIKESRLKDTQK (0)

PRVDFLQLMINSQNSKETDNHK (1)

ALSDQELMAQSIIFIFGGYETTSTSLSFIIYELATHPDVQQKLQEEIDATFPNK (0)

APPTYDVLAQMEYLDMVVNETLRMFPIAVRLERFCKKDVEIHGVSIPKGTTVTVPISVLHRDPQLWPELEEFRLER (2)

FSKKNKDSISPYVYLPFGTGPRNCIGMRFAIMNMKLAVVRVLQNFSFKPCKETQ (0)

IPLKIKSQGLLRPEKPIFLKVVLRDETISGA*

 

CYP3A76-ie6b

extra exon 6 SCAFFOLD300381 other exon 6 at 7654

9898 MFPIIGKYGDVLVRNLRKEAEKGKSVNMKE

 

CYP4 family

 

CYP4A40

Scaffold 211417 AAFC02026180 AAFC02091356.1 CB417720.1 BE750809.1 CYP4Ab

MSVSALSPSRALGGVSGLLQVVSLLGLVLLLIKAAQLYLRRQWLLKALHHFPSPPSHWFCGHKWE

FQEEGELPHLLKRVEKYPRACVRWMWGTRALLLVYDPDYMKMVLGRS

DPKAQIIHRFVKPWI

GTGLLLLEGQTWFQHRRMLTPAFHYDILKPYVGIMADSVRVML

DKWEELVSQDSHLEIFGHVSLMTLDTIMKCAFSQQGSVQTDR 369

SSQSYIQAIKDVSHLIISRLRNAFHQNDLIYRLTPEGHWNHRACQLAHQHT 895

DAVIKERKVRLQKEGELEKVRSRRHLDFLDILLFAR 1445

MENGSSLSDEDLRAEVDTFMFEGHDTTASGISWILYALASHPEHQQRCREEIQS 1690

LLADGASITW

DHLDQMPYTTMCIKEAMRLYPPVPVISRELSKPITFPDGHSLPA 1952

GILVSLSIYGLHHNPKVWPNPE

VFDPTRFAPGSTRHSHAFLPFSGGSR 3927

NCIGKQFAMNELKVAVALTLLRFELSPDSSRVPVPMPVIVLRSKNGIHLQLRKLSDPGGDK

 

CYP4A41

77% to 4A11 XM_594222

MSVSVLSPTRALGGVSGLLQVVSLLGLVLLLLKAAQLYLRRQWLLKALHQFPSPPSHWFYGHKKE

FQKESELPPLLERVEKYPKACVRWLWGTKALVLVYDPDYMKVFLGRS

DPKPHRTYKYLAPWI

GTGLLLLEGQKWFQHRRMLTPAFRYDILKAYVGIMADSVRVML

DKWEELVSQDSHLEIFGHVSLMTLDTIMKCAFSHQGSVQMDR

SSQSYIQAIRDLSHLIVSRLRNAFHQNDLIYRLTPEGRWNHRACQLTHQHT

DAVIKERKAHLQKEGELEKVRSRRHLDFLDILLLAR

MENGSSLSDEDLRAEVDTFMFEGHDTTASGISWILYALASHPEHQQRCREEIQSLLGDGASITW

DHLDQMRYTTMCIKEAMRL

YPPVPFIGRELRKPITFPDGRSLPA

GILVSLSFYGLHHNPNVWPNPE

VFDPTRFSPGSTQHSYAFLPFSGGSR

NCIGKQFAMNELKVAVALTLLRFELSPDPSRVPVPTPIMVLRSKNGIHLQLRKLSDPGGDKDKL

 

CYP4A42P pseudogene N-terminal

XM_606400 SCAFFOLD236059 72% to 4A40

KEKELEELLKRVEEFPYAYPCWMWG

DNVHLVVYDPDYMKVVLRRS

NPKSEDIYRFLAPWI

GLLLLEEQTWFQHRQMLTPAFHYDILKPYLGLMADSVQGML

DKWEELVSQDSHLEILGDISLMTLDTIMKCVFSH*GSIRNDR

XXXXXXXXXXXXXXXXXXXLRNVFYQNDLIYRLTPEGQWSY

XXXXXXXXX

TDAVIEERKTCLRKEGELEKVRSRRHLDFLDILLFSRV

NGSSLSDEDLRAEVDTFIFGGHDTTASGISWILYALASHPEHQQRCREEIQSLLGDGASITW

 

CYP4A pseudogene C-terminal

Scaf. 42211 pseudogene  

GNGTFLLEGQMWFQYWGMLTSAFHHDILRPSMGHMAYSVQLM

TDSIIKGRKSHLHKEGELEKERKWRPLDFLDILLFAR

MKTGSSLSDKDLHAEVDKVMFKGQETTTSGISWILYSLVSHPVHQQR

EEIQRLLGDGASITW

DHLDQMPYTTXCI

QEALRFYPPVPVIGRELSKPITFPVGHSLPA

XXXXXXXXXXXXXXXXXXXXXX

VFDSS*SAPGPAGHNHAFLPFSGGS

 

CYP4A pseudogene C-terminal

Scaf. 268824 c-term 4A pseudogene

GIEVALSFYGLHHNPKV*LNPE

VFEPSRFALGSNQHSHAFLPFSGGSR

KCIGKQFVMNEMKVAMALTLLHFELSPDPSRVPCLTP

EIVLRSKNGIHLHLRSL

 

CYP4A C-terminal possible pseudogene

Scaf. 305588 83% to CYP4A41 differs from 4A40 and 4A41

no ESTs

LSFYGLHHNPKVWPNPE

VFDPSWFTLGSTRHSHAFLPFSGGSR

NCIRKQSAMNDAKVAVALTLLRFELSSDPFRVLVPAPVMVQRSKNGIHFQLRKLSDPGGDKDK

 

CYP4B1

AW347853 89% to 4B2 3 aa diffs to mouse 4b1, 5 aa diffs to 4B2 cow

This seq is 100% match to pig CJ022616.1

DGRSLPA

GSLISLHIYALHRNSAVWPDPE

VFDPLRFSLENMAGRHPFAFLPFSAGPRNC

 

Pig CYP4B1 CJ022616.1 BP435822.1 CF179484.1 AW480803.1 CJ026219.1

BP440482.1 80% to human 4B1, 81% to cow 4B2

MVPALLSLSLSHLSLWALGLILALGVLKLTSLLLRRQMLARAMDK

FPGPPTHWIFGHALEIQQTGSLDKMVSWAHQFPHAHPLWLGPFLGFLNIYEPEYAKAVYS

RGDPKASDLYDFFLQWIGKGLLVLHGPKWFQHRKLLTPGFHHDVLKPYVAVMANSARVML

DKWEEKAREDKSFDIYHDVGHMALELLMKC

TFGKGTSGLNYSDNAYHLAVRDLTLLMQQRLSSFQYHNDFIYWLTPHGRRFLRACRVAHD

HTDQVIRERKAALQDKEEQERIQSRRHLDFLDILLGAQDEDRVKLPDADLRAEVDTFMFG

GHDTTTRGISWFLYCMSLNPEHQHRCRDEIREILGDRDSIQGEDLGKMTYLTMCIRESFRLYPP

GPQVYRQLSEPVSFVDGRSLPAGSLISLHIYALHRNSAVWPDPEVFDPLRFSLENMAGRH

PFAFLPFSAGPRNCIGQQFAMNEMKVVAALSLLRFEFAPDPSRPPIPMPQLTLCSKTGIH

LRLTSLGQGSGK*

 

CYP4B2

TC209863 BF602740 AAFC02041217.1 94% to goat 4B2 only 82% to human 4B1

MVPVLLSLSLSHLSLWAFALVSVLVFLKLTRLLLRRQMLARAMDRFSGPPTHWLFGHALE

IQQTGSLDKVVSWTHEFPYAHQLWVGQFLGFLNIYDPDYAKAVYSR

GDPKAPDFYDFFLQWI

GKGLLVLQGPKWFQHRKLLTPGFHYDVLKPYVALFAESTRVML

DKWEKKAREQKSFDIYSDVGHMALDSLMKCTFGKGTSGLNDR

DNNYYLSVKELTLLMQQRIDSFQYHNDFIYFLTPHGRRFLRACQVAHDHT

DQVIRERKAALQDEKERERIQSKRHLDFLDILLGAW

DEEGIKLSDEDLRAEVDTFMFEGHDTTTSAISWVLYCMSLYPEHQRRCREEIQEILGDRDTLKW

DDLAEMTYLTMCIKESFRL

YPPVPQVYRQLSQPVNFVDGRSLPE

GSLISLHIYALHRNSTVWPDPE

VFDPLRFSPENVAGRHSFAFIPFSAGPR

NCIGQQFAMAEVKVVTALCLLRFEFSPDPSRLPIKMPQLVLRSKNGIHLHLKPLGPGSEK*

 

CYP4X1

CF763940.1 CF766034.1 CF765861.1 CF768878.1

24776 MESSWLETHWARPFYLTLVFCLALGLLQAIKLYLQRQRLLRDLRLFPSPPTHWLYGHSK 24600

4386 LFKDGKMEILEKIVEKYPCAFPRWIGPFQAFLYIYDPDYAKTFLSRT (1)

DPKSNFLYKFMTASV

1563 GKGLVNLSGPKWSQHRRLLTPGFHFNTLKSFVEVMAQSVNIML 1435

NKWEKICGSQNTLLDIYEHINLMTLDVLMKCIFSWETNCQTDSSHDNYVKATSEGSNILMQR

LFNYFYHYDLIFKLSPLGHRLREVNKILHHHTDKIVKNRRNSLKDENKQTNTQKRKYRDF

LDIVLSAQAENDSFTDADLWSEVNTFMVAGHD

SVSAGISWLLYHLALYPEHQEKCREEIRAILGDGSSITWDQLSEM

SYTTMCIKESLRLAPPAVSISRQLSKPITFPDGRSLPAGMTVVLSIWGLHHNPAVWENPE

VFDPLRFSQECNKRHSHAYLPFSSGPRNCIGQNFAIAEIKVIVALI

LLRFQMRVEPTKPV

VFVPYIFIKPKKGIYLHLKKLP*

 

CYP4F22 ortholog

MLPITEQLLHLLGLEKTAFRLYAVSGLLLTLLFFLFRLLLQFLRLCWGFYITCRRLRCFPQPPRRNWLLGHLGM (0)

YLPNERGLQDEKKVLDDMHHVILVWLGPVLPLVVLVHPDYIKPLVGAS (1)

AAVAPKDDLFYGFLKPWL (1)

GDGLLLSRGDKWSRHRRLLTPAFHFDILKPYMKIFNQCADTMH (0)

AKWRGLAEGSEVSLDMFEHVSLMTLDCLQKCVFSYNSNCQE (2)

KMSDYISAIIELSALVVRRQYRLHHYLDFIYYRTADGRRFRQACDTVHGFTTEVIQERRR ALRQQGAEAWFKAKQGKTLDFIDVLLLAK (0)

DEDGKELSDEDIRAEADTFMFE (1)

GHDTTSSGLSWVLFNLAKYPEYQEKCREEIQEVMKGRELEELEWSGQ (2)

DDLTQLPFTTMCIKESLRQFPPVTLVSRRCTEDIKLPDGRIIPK (1)

GIICLVSIYGTHYNPAVWPDSK (0)

VYNPYRFDPDNPQQRSPLAYVPFSAGPR (2)

NCIGQSFAMAEMRVAVALTLLRFRLFLDRTRKVRRKPELILRTENGIWLKVEPLPPRA*

 

CYP4F46

SCAFFOLD265009 AAFC02033475 XM_591523 74% to human4F11

MLELSMSRLGLGLVAASPWLLLLVVGASWLLAHILAWTYTFFNNCRLLQCFPQPPKLNWFFAHLYL (0)

VPMTEQGLRKLTQLAANYSHGYMIWFGPITLMIVFCHPDMLRSITNAS

AAVAPKDMDFYNFLKPWLG

DGLLLSAGDKWSRHRCMLTPTFHFNILKPYMKIFTKSTDIMH

TNWECLVTQGHTRLDMFEHVSLLTLDSLQKCVFSFDSNCQE

KPSEYITTIFELCELAAKRNKQICLHTDFLYFLTLDGWHFRRACRLVHNFTDSIIQEQRCTLPSENI

DDFFKAKVKTKTLDFIDVLLLSK

DEDGKGLSDEE

IRAEADTFMFE

GHDTTASGLSWILYNLAKHPEYQERCGQEVKELLRDCESKEIEW

DDLAHLPFLTMCIKESPRLHPPVTFIFRYCTQDIVLPDGRVIPK

GVVCLIDIFGTHHNPSVWPDPE

VYDPFRFDPENIKGRSPLAFIPFSAGSR

NCIGQTFAMTEMNVVLALTLLRFRFRPDKEEPRRKPELILRAEGGLWLQVELLSAGPQ*

 

CYP4F47

XM_586481 CN792379.1 80% to 4F11  bottom identical to 615008

MLELSLSWLGLRPVAASPWLLLVLVGASWILAHVLAWTYTFYNNSRHLQCFPQPPKRNWFLGHMGQYPP

TEKGLRDFTQLVANYPQGFRICLGPAIPIIIFCHPDMIRIVANAS

AAAAPKDVIFYDFLKPWLG

DGLLLSAGDKWSRHRRMLTPAFHFNILKPYMKIFTKSADIMH

AKWQRLITEGHTHLDMFEHISLMTLDSLQKCVFSYDSNCQE

KPNEYIASILELSALVAKRNQQIFLHMDFLYFLTPDWRRFRRACRLVHDFTDAVIQERRHTLPSEGT

DDFLKAKVKTKTLDFIDVLLLTK

DEDGKGLSDEDIRAEADTFMFE

GHDTTASGLSWVLYNLAKHPEYQERCRQEVQDLLRDRESKEIEW

DDLAQLPFLTMCIKESLRLHPPVSVISRRYAQDTLLPDGRVIPK

GVICLINIIGTHHNPSVWPDPE

VYDPFRFEPENIKGRSPLAFIPFSVGPR

NCIGQTFAMTEMKVVLALTLLRFRVLPGEEPRRKPELILRAEGGLWLRVEPLSADQ

 

CYP4F48

XM_584482 CN793339 CK836265 AAFC02027122 CK946303 AAFC02217855.1

MLELSLSRLGLGPLAASPWLLPLLAGVSWILARVLAWTYTFYNNSRRLRCFLQPPKPNWFLGHMNL

VPSTEQGLIYFTQMAANYPRGYLIWFGPIIPMVIFCHPDMLRSITNAS

AAIAPKDMQFYGTLKPWLG

DGLLLSAGDKWSSHRRMLTPAFHFNILKPYIKIFTKSAEIMH

AKWEHLITEGHTHLDMFEHISLLTLDNLQKCVFSFDSNCQE

KPSEYIAAILELSALVSKRNQQLFLYMDFLYYLTSDGQRFRNACRLVHDFTDAVIQERRRTLPKENI

DDFLKAKAKTKTLDFIDVLLLTK

DEDGKGLSDEDIRAEADTFMFE

GHDTTASGLSWILYNLAKHPEYQERCRQEVQDLLRDRESKEIEW

DDLAQLPFLTMCIKESLRLHPPVTSISRRCTQDIVLPDGRVIPK

GVVCLIDIFGTHHNPSVWQDPE

VYDPFRFDPENIKGRSPLAFIPFSAGPR

NCIGQTFAMTEMKVILALTLLRFRVLPDKEPCRKPELILRTEGGLWLRVEPLSAGQQ*

 

CYP4F49

XM_614912 CK959664 CN793709 AAFC02090500.1 80% to 4F11 XM_583556

MLELSLSWLGLGPVAASPWLLLLFTGASWLLARILAWTYTFYNNSRRLQCFPQPPKRNWFLGHLGL

VPPTEQGMSKLTELVAKYSQGFRIWMGPITPIIVFCHPDLIRIVANAS

AAVAPKDVIFYEVLKPWLG

DGLLLSAGDKWSRHRRMLTPAFHFNILKPYMKIFTKSTDIMH

AKWQHLIKEGHTHLDMFEHISLMTLDSLQKCIFSYDSNCQE

KPSEYIAAILELSALVAKRHQEIFLHMDFLYYLTPDGRRFRRACRLVHDFTDAVIQERHRTLPSESI

DDFLKAKAKTKTLDFIDVLLLTK

DEDGKGLSDEDIRAEADTFMFE

GHDTTASGLSWILYNLAKHKEYQERCRQEVQELLKDREPKNIEW

DDLAQLPFLTMCIKESLRLHPPVSVISRRCTQDTVLPDGRVIPK

GVICLISIFGTHHNPSVWPDPE

VFDPFRFDPENIKGRSPVAFVPFSVGPR

NCIGQTFAMTEMKVVLALTLLRFRVLPDKEEPRRKPELILRAEGGLWLRVEPLSTGQQ*

 

CYP4F50

XM_583211 XM_583616 AAFC02021075.1 AAFC02239800.1 CK772605.1 CK960170.1

MLELSLSWLGLGPVAASPWLLLLLVGASWILARILAWIYAFYDNCCRLRCFPQPPKRSWFWGHLGL

AQSNEESMRLVEELGHYFRDVHLWWMGPFFPILRLVHPNFVAPLLQAS

ATIIPKDMFFYSFLKPWLG

DGLLLSAGDKWSSHRRLLTPAFHFEILKPYMKIFNKSADIMH

AKWQRLALEGSTRLDMFEHISLMTLDSLQKCVFSYDSNCQE

KPSEYIAAILELSALVMKRIKHIFLHVDFLYYLTRDGQRFYRACRLVHDFTDAIIQKRRRTLISQGS

QEFLKTKTKAKTLDFIDVLLLAK

DEDGKGLPDEDIRAEADTFMFE

GHDTTASGLSWILYNLAKHPEYQERCRQEVQELLRDREPKEIEW

EDLAQLPFLTMCIKESLRLHPPVAVISRLCTHDVVLPDGRVIPK

GNICVISIFGIHHNPSVWPDPE

VFNPFRFDPEAPKRSPLAFIPFSAGPR

NCIGQTFAMNEMKVALALTLLRFRILPDEEEPRRKPELILRAEGGLWLQVEPLN

TGQ*

 

CYP4V2 ortholog

TC207398 TC207399 TC207400 TC189439 CO895969.1  EST  

MLAPWLLSVGPKLLLWSGLC

AVSLAGATLTLNLLKMVASYARKWRQMRPVPTIGDPYPLVGHALMMKSDARDFFQQIIDFTEECR

HLPLLKLWLGPVPLVALYNAETVEVILSSSKHIEKSYMYKFLEPWLGLGLLTSTGNKW

RSRRKMLTPTFHFTILEDFLDV

MNEQANILVTKLEKHVNQEAFNCFFYVTLCTLDIICETAMGKNIGAQRNDDSEYVRAVYR

MSDSIHQRMKMPWLWLDLIFYMFKNGREHRRSLKIVHDFTNNVITERANEMKRHEEGTSN

DKEKDFPPRKTKCRAFLDLLLNVT

DDQGNKLSHEDIREEVDTFMFEGHDTTAAAINWSLYLLGWYPEVQQKVDTELEEVFGKS

DRPVTLEDLKKLKYLDCVIKESLRLFPSVPFFARNLTEDCEVAGHKMVQGCQVIIVPYAL

GSSRDPKYFPDPE

EFKPERVFPENLKGRHTYAYVPFSAGPRNCIGQKFAIMEEKTILSCILRHFWVESNQKRE

ELGLAGELILRPSNGIWIKLKRRNTDES*

CYP5A1

TC191301 76% to CYP5A1 Length = 1926

  94 MEVLGLFRLEVSGPMVTVALSVVFLALLKWYSTSAFSRLEKLGIRHPKPSPFIGNLAFFRQ 276

 277 GFWESHMELRKQYGPLSGYYLGRLMFIVISEPDMIEQVLVEKFSNFTNRMAT 432

 433 GLEPKPVADSVLFLRDKRWEEVRSVLTVAFSPEKLSEMTPLISRACDVLLAHLERH 600

 601 AQSGEAFDIQKTYCCYTTDVVASVAFGTEVNSQEAPEHPF 720

     VEHCRRFFASSIPKPLLVLLLSFPSIMVPLARILPNKNRDELNGFFNKLIRNVIALRDQQ

     AAEERRRDFLQMVQDVRHSAATVGVENFDIVRQVFSATKCPANPPRRHSPRPLSKP

1069 LSVDEVVGQAFIFLIAGYEIVTNTLSFATYLLATNPECQEKLLEEVDCFSKEHLAPEYCS 1248

1249 LQEGLPYLDMVIKETLRMYPPAFRFTRVAAQDCEVLGQRIPAGAVLETAVGALHYDPE 1422

1423 HWPNPENFNPERFTAEAQQRRRPYTYLPFGAGPRSCLGVRLGLLELKLTLLHILRKFR 1596

1597 FEACPETQVPLQLESKSALGPKNGVYIRIVSR*

 

CYP7A1

just one gene CB421103 92% to 7A1 Length = 707

MMSLSLIWGIVIAVCCCLYLLGMRRR (2) 5259

QMGEPPLENGLIPYLGCALQFGANPLEFLRANQRKHGHVFTCRLMGNYVHFITNPLSYH 3625

KVLCHGKYFDWKKFHFTASAK (0)

AFGHRSIDPSDGNTTDTISKTIIKTLQGDALSSLTEAMMGNLQLVLRPQGPPQPPTPTW 2193

VTEGMYSFCYRVMFEAGYLTLFGRDLAGQDAQKALILNSLDNFKQFDKIFPALVAGFPIH 2013      

VFKTGHYAREKLTEGLRLQKFRERDHISELVRFLNDTFATLDDTERAKSLLAVLWAS 1842

QANTIPATFWSLFQMIR

RNPEAMKAATEEVNKTLENAGQKVSFEDSPIHLNQTQLDNMPVL (1)

60 DSIIKESLRLSSASLNIRTAKEDFTLHLQDGSYNIRKDDIIALYPQLMHLDPEIYP 239

240 DPLTFKYDRYLDENGKTKTTFYSNGLKLKYYYMPFGSGVTICPGRLFAVQEIKQFLILML 419

420 SYFELELVESCVKCPPLDQSRAGLGILPPLYDTEFRYKFKHS* 548

 

CYP7B

just one gene

MNAGMGGFWPEPWLTRSLGYPGLALAAALLLLVLCLNARRTR (2)

RPGEPPLIKGWLPYFGQALKLQKDPLGFMTSLQKQYGDIFTLLLG (1)

GKYITFILDPFHYTSVAKNQKLSFQIFTNKFLKRVFSIKKMLTDSDLIDEIHSTYQ 56

  3 FLQGKHLDILMESTMQNLKQVFEPQLLKTTSWSTEYLLPFCNSVIFEMTFTTIYGNILAD 182

183 DKKTFITELKDDFLKFDEKFTRLASGIPIELLGNIKSVRTKLIKDLTIESLAKLQGMSEV 362

363 VQRRNDILEKYYTPKDTEIG (1)

AHHLGLLWASVTNTVPTMFWAMYYLLRNPKAMAVLRDEIDHLLQSTGQKKGPGFSIYLTR

EQLDSLVYL (1)

ESTILEVLRLCSFSGIFRFVQEDLTLHLESQDCCLRKGDFVVIFPPILHHDPEIFEAPD

EFRFDRFTENGKKKTTFFKRGKKLKYYHLPFGLGVSKCP

GRFLAMVEIKQLLVVLLTYFDLEIIDNKPLELNYSRFLFGIPYPDSDVLFRYKIKS

 

CYP8A1

TC208596 = D30718 89% to 8A1 Length = 2495

  30 MSWAVVFGLLAALLLLLLLTRRRTRRPGEPPLDLGSIPWLGHALEFGKDAAGFLTRMKEK 209

 210 HGDIFTVLVGGRHVTVLLDPHSYDAVVWEPRSRLDFHAYAVFLMERIFDVQLPHYNPGDE 389

 390 KSKMKPTLLHKELQALTDAMYTNLRTVLLGDTVEAGSGWHEMGLLEFSYGFLLRAGYLTQ 569

 570 YGVEAPPHTQESQAQDRVHSADVFHTFRQLDLLLPKLARGSLSAGDKDRVGKVKGRLWKL 749

 750 LSPTRLASRAHRSRWLESYLLHLEEMGVSEEMQARALVLQLWATQGNMGPAAFWLLLFLL 929

 930 KNPEALAAVRGELETVLLGAEQPISQMTTLPQKVLDSMPVLDSVLSESLRLTAAPFITRE 1109

1110 VVADLALPMADGREFSLRRGDRLLLFPFLSPQKDPEIYTDPEVFKYNRFLNPDGSEKKDF 1289

1290 YKDGKRLKNYSLPWGAGHNQCLGKGYAVNSIKQFVFLVLTQFDLELITPDVDIPEFDLSR 1469

1470 YGFGLMQPEHDVPVRYRIRP* 1532

 

CYP8B1

just one gene

MVLWGPALGALLVAIVGYLCLRGLLRQRRPKEPPLDKGPVPWLGHAMAFRKNMFEFLRHM

QAKHGDIFTVQLGGQYFTFVMDPLSFGPILKDAQRKLDFVEYAQKLVLKVFGYRSVQGDY

RMIHSASTKHLMGEGLEELNKVMLDTLSLVMLGPIGPSLGTHHWREDGLFHFCYNILFKA

GYLSLFGYTKDKEQDLLQAEELFLEFRKYDRLFPRFVYSLLGPREWLEVGRLQRLFHKEL  

SVEHNLEKYGVSNWITYMLQFLREQGVSPAMQDKFNFMMLWASQGNTGPTSFWALLFLLK

HPEAMRAVREEATRLLGEVRLEAKQSFKFGALHHMPVLDSVMEETLRLRASPTLLRVV

NDDQILQMASGQEYMLRNGDILALFPYLSVHMDPDIXXXXXXXXXXXXXXXXXLL

KAGKKIRHYTMPWGSGVSICPGRFLALNEMKLFILLMVMNFDLELVDPATPVPPVDPQRW

GFGTTQPSHEVRFRYRLRPAE

 

CYP11A1

TC189227 78% to 11A1 Length = 1959

MLARGLPLRSALVKACPPI

LSTVGEGWGHHRVGTGEGAGISTKTPRPYSEIPSPGDNGWLNLYHFWREKGSQRIHFRHI

ENFQKYGPIYREKLGNLESVYIIHPEDVAHLFKFEGSYPERYDIPPWLAYHRYYQKPIGV

LFKKSGTWKKDRVVLNTEVMAPEAIKNFIPLLNPVSQDFVSLLHKRIKQQGSGKFVGDIK

EDLFHFAFESITNVMFGERLGMLEETVNPEAQKFIDAVYKMFHTSVPLLNVPPELYRLFR

TKTWRDHVAAWDTIFNKAEKYTEIFYQDLRRKTEFRNYPGILYCLLKSEKMLLEDVKANI

TEMLAGGVNTTSMTLQWHLYEMARSLNVQEMLREEVLNARRQAEGDISKMLQMVPLLKAS

IKETLRLHPISVTLQRYPESDLVLQDYLIPAKTLVQVAIYAMGRDPAFFSSPDKFDPTRW

LSKDKDLIHFRNLGFGWGVRQCVGRRIAELEMTLFLIHILENFKVEMQHIGDVDTIFNLI

LTPDKPIFLVFRPFNQDPPQA*

 

CYP11B1

NM_174638

MALWAKARVRMAGPWLSLHEARLLGTRGAAAPKAVLPFEAMPRC

PGNKWMRMLQIWKEQSSENMHLDMHQTFQELGPIFRYDVGGRHMVFVMLPEDVERLQQ

ADSHHPQRMILEPWLAYRQARGHKCGVFLLNGPQWRLDRLRLNPDVLSLPALQKYTPL

VDGVARDFSQTLKARVLQNARGSLTLHIAPSVFRYTIEASTLVLYGERLGLLTQQPNP

DSLNFIHALEAMLKSTVQLMFVPRRLSRWMSTNMWREHFEAWDYIFQYANRAIQRIYQ

ELALGHPWHYSGIVAELLMRADMTLDTIKANTIDLTAGSVDTTAFPLLMTLFELARNP

EVQQAVRQESLVAEARISENPQRAITELPLLRAALKETLRLYPVGITLEREVSSDLVL

QNYHIPAGTLVKVLLYSLGRNPAVFARPESYHPQRWLDRQGSGSRFPHLAFGFGVRQC

LGRRVAEVEMLLLLHHVLKNFLVETLEQEDIKMVYRFILMPSTLPLFTFRAIQ

 

CYP11B2

MALWAKARVRMAGPWLSLHEARLLGTRGAAAPKAVLPFEAMPRCPGNKWMRMLQIWKE

QSSENMHLDMHQTFQELGPIFRYDVGGRHMVFVMLPEDVERLQQADSHHPQRMILEPWLA

YRQARGHKCGVFLLNGPQWRLDRLRLNPDVLSLPALQKYTPLVDGVARDFSQTLKARVLQ

NARGSLTLDIAPSVFRYTIEASTLVLYGERLGLLTQQPNPDSLNFIHALEAMLKSTVQLM

FVPRRLSRWMSTNMWREHFEAWDYIFQYANRAIQRIYQELALGHPWHYSGIVAELLMRAD

MTLDTIKANTIDLTAGSVDTTAFPLLMTLFELARNPEVQQAVRQESLVAEARISENPQRA

ITELPLLRAALKETLRLYPVGITLEREVSSDLVLQNYHIPAGTLVKVLLYSLGRNPAVFA

RPESYHPQRWLDRQGSGSRFPHLAFGFGVRQCLGRRVAEVEMLLLLHHVLKNFLVETLEQ

EDIKMVYRFILMPSTLPLFTFRAIQ*

 

CYP17A1

TC190136 71% to Steroid 17alphahydroxylase/17,20 lyase Length = 1725

     MWLLLAVFLLTLAYLFWPKTKHS

 116 GAKYPRSLPSLPLVGSLPFLPRRGQQHKNFFKLQEKYGPIYSFRLGSKTTVMIGHHQLAR 295

 296 EVLLKKGKEFSGRPKVATLDILSDNQKGIAFADHGAHWQLHRKLALNAFALFKDG 460

 461 NLKLEKIINQEANVLCDFLATQHGEAIDLSEPLSLAVTNIISFICFNFSFKNEDP 625

 626 ALKAIQNVNDGILEVLSKEVLLDIFPVLKIFPSKAMEKMKGCVQTRNELLNEILEKCQEN 805

 806 FSSDSITNLLHILIQAKVNADNNNAGPDQDSKLLSNRHMLATIGDIFGAGVETTTSVIK 982

 983 WIVAYLLHHPSLKKRIQDDIDQIIGFNRTPTISDRNRLVLLEATIREVLRIRPVAPTLIP 1162

1163 HKAVIDSSIGDLTIDKGTDVVVNLWALHHSEKEWQHPDLFMPERFLDPTGTQLISPSL 1336

1337 SYLPFGAGPRSCVGEMLARQELFLFMSRLLQRFNLEIP 1450

     DDGKLPSLEGHASLVLQIKPFKVKIEVRQAWKEAQAEGSTP*

 

CYP19A1

TC194849 88% to 19A1 Length = 2057

MLLEVLNPRHYNVTSMVSEVVPIASIAILLLTGFLLLVWNYEDTSSIPGPSYFLGIGPLI

SHCRFLWMGIGSACNYYNKMYGEFMRVWVCGEETLIISKSSSMFHVMKHSHYISRFGSKL

GLQFIGMHEKGIIFNNNPALWKAVRPFFTKALSGPGLVRMVTICADSITKHLDRLEEVCN

DLGYVDVLTLMRRIMLDTSNMLFLGIPLDESAIVVKIQGYFDAWQALLLKPDIFFKISWLCRKYEKSV

 927 KDLKDAMEILIEEKRHRISTAEKLEDSIDFATELIFAEKRGELTRENVNQC 1079

1080 ILEMLIAAPDTMSVSVFFMLFLIAKHPQVEEAIIREIQTVVGERDIRIDDMQKLKVVEN 1256

1257 FINESMRYQPVVDLVMRKALEDDVIDGYPVKKGTNIILNLGRMHRLEFFPKPNEFTLE 1430

1431 NFAKNVPYRYFQPFGFGPRACAGKYITMVMMKVVLVTLLRR 1553

     FHVQTLQGRCVEKMQKKNDLSLHPDETRDRLEMIFTPRNSDKCLER*

 

CYP20A1

NM_001015644.1

MLDFAIFAVTFLLALVGAVLYLYPASRQAAGIPGITPTEEKDGN

LPDIVNSGSLHEFLVNLHERYGPVVSFWFGRRLVVSLGTVDVLKQHINPNKTLDPFET

MLKSLLRYQSDSGNVSENHMRKKLYENGVTNCLRSNFALLIKLSEELLDKWLSYPESQ

HVPLCQHMLGFAMKSVTQMVMGSTFEDEQEVIRFQKNHGTVWSEIGKGFLDGSLDKST

TRKKQYEDALMQLESILKKIIKERKGRNFSQHIFIDSLVQGNLNDQQILEDTMIFSLA

SCMITAKLCTWAVCFLTTYEEIQKKLYEEIDQVLGKGPITSEKIEELRYCRQVLCETV

RTAKLTPVSARLQDIEGKIDKFIIPRETLVLYALGVVLQDPGTWSSPYKFDPERFDDE

SVMKTFSLLGFSGTRECPELRFAYMVTAVLLSVLLRRLHLLSVEGQVIETKYELVTSS

KEEAWITVSKRY

 

CYP21A1

TC209496 79% to CYP21A2 Length = 2158

     MVLAGLLLLLTLLSGAHLLWGRWKLRNLHLP

 234 PLVPGFLHLLQPNLPIHLLSLTQKLGPVYRLRLGLQEVVVLNSKRTIEEAMIRKWVDFAG 413

 414 RPQIPSYKLVSQRCQDISLGDYSLLWKAHKKLTRSALLLGTRSSMEPWVD 563

 564 QLTQEFCERMRVQAGAPVTIQKEFSLLTCSIICYLTFGNKEDTLVHAFHDCVQDLMKT 737

 738 WDHWSIQILDMVPFLRFFPNPGLWRLKQAIENRDHMVEKQLTRHKESMVAGQWRD 902

 903 MTDYMLQGVGRQRVEEGPGQLLEGHVHMSVVDLFIGGTETTASTLSWAVAFLLHHPEIQ 1079

1080 RRLQEELDRELGPGASCSRVTYKDRARLPLLNATIAEVLRLRPVVPLALPHRTTRPSSIF 1259

1260 GYDIPEGMVVIPNLQGAHLDETVWEQPHEFRPDRFLEPGANPSALAFGCGARVC 1421

1422 LGESLARLELFVVLLRLLQAFTLLPPPVGALPSLQPDPYCGVNLKVQPFQVRLQ 1583

     PRGVEAGAWESASAQ*

 

CYP24A1

86% to human

6035 MSSPISKSRSLAAFLQQLRSLGQPPRPVTSTACSPRRPREVPLCPVMEPGEAQDAATLP 6211

6212 GPTKWPLLGSLLEILWKGGLKKQHDTL (0)

6511 (0) VEYHKKYGKIFRMKLGSFDSVHLGSPCLLEALYRTESAHPQRLEIKPWKAYRDYREEGYGLLIL(2) 6702

7484 (2)EGEDWQRVRSAFPKKLMKPVEIMKLDDKINE(0) 7573

4559 (0) VLADFMGRIDELCDERGRIEDLYTELNKWSFE (1)

2100 (1) SICLVLYEKRFGLLQKNAGEEALNFIMAVKT (0)

1035 (0) MMSMFGKMMVTPVELHRSLNTRVWQAHTQAWDTIFRS (1) 925

1303(1) VKSCVDNRLEKYSEQPSMDFLCDIYHHNQLSKKELYAAVTELQLAAVET (0) 1449

TANSLMWILYNLSRNPHVQQKLFKEIQSVLPENQLPRAEDLRNMPYLKACLKESMR

LNPTVPFTTRTLDKAMVLGEYALPKG

5393 TVLVLNTHVLGSSEENFEESSQFRPERWLQDKKNLNPFAHLPFGVGKRMCVGRRLAELQL 5572

5573 HLALCW (0) 5602

IVRKYDVVATDLEPVETLHLGNLVPGRQLPVAFCQR

 

CYP26A1

TC196927 98% to 26A1 Length = 654 C-term AAFC02074175.1

MGLPALLASALCTFVLPLLLFLAAIKLWDLYCVSSRDRSCALPL

PPGTMGFPFFGETLQMVLQREKFLQMKRRKYGFIYKTHLFGRPTVRVMGADNVRRILL

GEHRLVSVHWPASVRTILGSGCLSNLHDSSHKQRKK (0)

VIMQAFSREALQCYVPVIAEEVGNYLEQWLSCGERGLLVYPQVKRLMFRIAMRILLGCE

SRLASGGEDEQQLVEAFEEMTRNLFSLPIDVPFSGLYR

GLKARDLIHARIEENIRAKIRRLPAAEAGGGCKDALQLLIEHSWERGERLDMQ

ALKQSSTELLFGGHETTASAATSLITYLGLYPHVLQKVREELKNK

GLLCKGNQDNKLDVEI

LEQLKYIGCVIKETLRLNPPVPGGFRVALKTFELNGYQIPKGWNVIYSICDTHDVADIFT 182

183 NKEEFNPDRFLLPHPEDASRFSFIPFGG

GLRSCVGKEFAKILLKIFTVELARHCDWQLLNGPPTMKTSPTVYPVDDLPARFTRFQGEI* 449

 

CYP26B1

TC200808 95% to 26B1 Length = 762

MLFEGLELVSALATLAACLVSVTLLLAVSQQLWQLRWAATRDKSCKLPIPKGSMGFPLIGETGHWLLQ

GSGFQSSRREKYGNVFKTHLLGRPLIRVTGAENVRKILMGEHHLVSTEWPRSTRML

LGPNTVSNSIGDIHRNKR

VFSKIFSHEALESYMPKIQLVIQDTLRAWSSHPEAINVYQEAQKLTFRMAIRVLLG

     FSLPVDLPFSGYRRGIQARQTLQKG

  77 LEKAIREKLQCTQGKDYSDALDIFIESSKEHGKEMTMQELKDGTLELIFAAYAT 238

 239 TASASTSLIMQLLKHPAVLEKLREELRAKGLLHSGGCPCEGTLRLDTLSGLHYLDCVIKE 418

 419 VMRLFTPVSGGYRTVLQTFELDGFQIPKGWSVMYSIRDTHDTAPVFKDVNVFDPDRFG 592

 593 QARSEDKDGRFHYLPFGGGVRTCLGKHLAKLFLKVL 700

     AVELASTSRFELATRTFPRITLVPVLHPVDGLSVKFFGLDSNQNKILPETEAMLSATV*

 

CYP26C1

BE749195 CC773722 95% to 26C1 Length = 465 N-term

67  MLPWGLSCLSALGAVGTALLGAGLLLSLAQHLWTLRWTLSRDRASALPLPKGSMGWPFFGETLHWLVQ (0)

GSRFHSSRRERYGTVFKTHLLGRPVIRVSGAENVRTVLLGEHRLVRSQWPQSAHILLGSHTLLGAVGESHRQRRK (0) 

5955 ILARAFSRAALECYVPRLQRALRREVRSWCAARGPVAVYEAAKALTFRMAARILLGLR 6134

6135 LDEAQCSELARTFEQFVENLFSLPLDVPFSGLRK (0) 6251

GIRARDQLHRHLEEAIAQKLLEDKTAVEPGDALDGIIHSTRELGHELSVQELK (0)

46  ESAVELLFAAFSTTASASTSLVLLLLQHPAAIAKIQQELAAQGLGRACGCAPAASGGG 225

226 AGPPPDCGCEPDLSLAALGHLRYVGCVVKEVLRLLPPVSGGYRTALRTFELD (0) 381

GYQIPKGWNVMYSIRDTHETAAVYRSPPEGF

DPERFGTAGDDALGAAGRFHYIPFGGGARSCLGQELAQTVLQLLAVELVRTARWELATPA 2322

2321 FPAMQTVPIVHPVDGLRLFFHPLASPAAQDGQRS*

 

CYP27A1

TC206580 82% to 27A1 Length = 1614 missing about 58aa at N-term

MNRMGALGSARLRWALLGRRAALPGLGSFGARAKAAIPSALPAAQAAEAPGTGPGDRRLRSLD

ELSGPGQLRLLFQLLVQGYVLHLHQLQVLNKAKYGPIWINRVGPQMHVHLASAPLLEQVM

RQEGKYPVRDDMKLWKEHRDQQGLSYGPFTTMGEQWYRLRQTLNQRMLKPAEAALYTDAL

NEVINDFMDQLKQLRAESASGDHVPDIAHQFYFFALEAISYILFEKRIGCLERSIPKDTE

TFVRSVGLMFHNSLFVTFLPTWTRPLLPFWKRYLDGWNTIFSFGKKLIDQKLEEIEAQLK

TENPEKTQISGYLHFLLTSG

QLSPREAEGSLPELLLAGVDTTSNTLTWALYHLSKNPEIQAALHKEVVGVVPAGQVPQHK 961

DLARMPLLKAVLKETLRLYPVVPVNSRVVVDKEIEVGGFLFPKNTQFVLCHYVISRDPDI 1141

YPEPDSFQPQRWLRKNQPDALKTQHPFGSVPFGYGVRACLGRRIAELEMQLLLTRLIQHYE 1324

VVLAPETGEVTSVARIVLVPNKKVGLRFLQRQS*

 

CYP27B1

MTQTLKFASRVFHRVRCPELGASLGSRGSESAPRVLADIPGPSTPGFLAELFCKGGLSRLHELQ (0)

VQGAARFGPVWLASFGTVRTVYLAAPTLVEQLLRQEGPRPERCSFSPWTEHRRRRQRACGLLTA (2)

EGEEWQRLRSLLAPLLLRPQAAARYAGTLHGVVRDLVRRLRRQRGLGAGPPSLVRDVAGEFYKFGLE (1)

GIAAVLLGSRLGCLEAEVPPDTETFIRAVGSVFVSTLLTMAMPSWLHRVVPGPWDRLCRDWDQMFAF (1)

AQQHVEQREAEVAMRNQSEKSEEDMGPGAHLTYFLLQKELPAASILGNVTELLLAGVDT (0)

VSNTLSWALYELSRHPEIQTALHAEITAALGPGSSTQPSATALSQLPLLKAVVKEVLR (2)

LYPVVPGNSRVPDRDICVGEYIIPKN (0)

TLVTLCHYATSRDPAQFPEPNSFRPARWLGEGPAPHPFASLPFGFGKRSCVGRRLAELELQMALAQ (0)

ILIHFEVQPEPGSAPVRPMTRTVLVPERSINLQFVDR*

 

CYP27C1

BE723057 86% to 27C1 Length = 519 DN538841.1 AAFC02145184.1

MALLARILKARPSPAAGRTGLLGGWDAGQPRLASSGLSARARAEDKGAGRPGASQAGGR

AEGPRSLAAMPGPRTLANLVEFFGKDGFSRIHEIQ

297 QKHTREYGKIFKPHFGPQFVVSVADRDLVAQVLRAEGASPQRANMGSWQEYRDLRXRS 476

477 TGLISA 494

EGEQWLKMRSVLRQRILKPKDVAIFAGEINQVIADLIKRIYFLKSQAEDGDTVTNINDLFFKYSME

GVATILYESRLGCLGNSIPQLTADYIEALALMFCTFKTSMYAGAIPRWLRLLIPKPWQEFCRSWDGLFEFS

QIHVDNKLRDIRCQMERGERVRGGLLTYLFLSQELTLEEIYANMTEMLLAGVDT

TSFTLSWAVYLLARHPEVQQALYREIVRNLGERHVPTAADVPKVPLVRALLKETLR

LFPVLPGNGRVTQEDLIVGGYLIPRG

TQLALCHYATSYEDENFPRAKEFRPERWLRQGNLRRVDNFGSIPFGYGARSCIGRRIAELEIHLLVIQ

LLQHFEIKTSPWTKTVHAKTHGLLMPGEPIHVRFVNRK

 

CYP39A1

TC197782 CB448362 79% to 39A1 Length = 911

192 MEFISPTVIIILSCVAVLLFLQWKNLRRPPCIRGWIPWIGAGFEFGKTPLEFIEKARIK

    YGPVFTVIVMGTRMTFVTEEEGINVFLKSKEINFELAVQNPVYHT

ASIAKNIFLKLHEKLYITVKGKMGIFNLYKFTGQLTEELQEQLQNLGTHGTTDLNKFMR

HLLYPVTVNILFKKGLFPTDERKIREFHQHFQAYDEGFEYGSQLPECLLR

NWSKSKKWLLALFEKNIPDIKTHKSAKENYP

TVMQAVLDLLEMEANEQKSPNYGLLLLWASLSHTVP

VAFWTFAFVLSHPNIHRTIMEGISSVFGTAG

KDKIKVSEDDLKKLPLIKWCILETIRLRAPGVIARKVLKPVKIL

DYTVPSGDLLMLSPFWLHRNPKYFPEPDLFKPERW

KEANLEKHAFLDCFMAFGSGKYQCPGRW

LALLEIQICIILIFYYYDCSLLDPLPKQ

SSLHLVGVQQPEGRCRIQFKQRK*

 

CYP46A1

AV663682 94% to CYP46 Length = 506 AV663681 CC523468 Bac end C-term

MSPGLLMLLGSAVLVVFGLCCTFVHRARSRYEHIPGPPRPS (2)

FLLGHLPYFWKKDEVCGRVLQDVFLDW (2)

AKKYGPVVRVNVFHKTSVIVTSPESVK (0)

KFLMSTKYNKDSKMYHAIQTVFGER (1)

LFGQGLVSECXYERWHKQRRIMDLAFSRSSLVGLMGTFNEKAEQLVEILEAQ 198

199 ADGQTPVSMQDMLTCATMDILAKAAFGMETSMLLGAQKPLSRKVKLILEGISASRNTLAK 378

379 FMPGKWKQLXETRESVRFLRQVGKEWVXRXRXALQRGEDVPADILTQILKAEEGAQD

367 DEILLDNFVTFFIAGHETSANHLAFTVMELSRQPEILARLQAEVDEVIGSKRHLDCEDLG 188

187 RLQYLSQVLKESLR 146

LYPPAWGTFRLLEEETLIDGVRVPGNTPLL

FSTYVMGRMDTYFEDPLTNPDRFGPGAPK

PKFTYFPFSLGPRSCIGQQFAQ

MEVKVVMAKLLQRLEFRLVPGQRFGLQEQATLKPLDPVLCTLQPRGWQPAPPPPPC

 

CYP51A1

AC108175.2 CYP51 93% to human

22188 MLDLLQAGGSVLGQAMEQVTGGNLASMLLIACAFTLSLVYLFRLAVGHLAPPLPTGA (0) 22018

20194 KSPPYIVSPIPFLGHAIAFGKSPIEFLEDAYEK (0) 20096

17795 YGPVFSFTMVGKTFTYLLGSEAAALLFNSKNEDLNAEEVYSRLTTPVFGKGVAYDVPNT 17619

16395 VFLEQKKMLKSGLNIAHFRQHVSIIEKETKEYFKSWGESGEK 16270

14644 NLFEALSELIILTASHCLHGKEIRSQLNEKVAQLYADLDGGFSHAAWLLPGWLPLPSFR 14468

13586 RRDRAHREIKNIFYKAIQKRRESGEKIDDILQTLLESTYK 13467

13076 DGRPLTDDEVAGMLIGLLLAGQHTSSTTSAWMGFFLARDKTLQEKCFLEQKTVCGENLPPLTYDQ (0) 12882

11489 LKDLNLLDRCIKETLRLRPPIMTMMRLAKTP 11397

10597 QTVAGYTIPPGHQVCVSPTVNQRLKDSWVERLDFNPDRYLEDSPASGEKFAYVPFGA (1?) 10427

 7020 GRHRCIGENFAYVQIKTIWSTMLRLYEFDLIDGYFPTVNYTTMIHTPEKPIIRYKRRSK* 6841