Atlantic Salmon cytochrome P450s assembled from ESTs

 

The CYP2008 Bioionformatics class found and assembled P450s from

Over 400,000 Salmo salar ESTS IN THE NCBI ESTdb. 

This was their first assignment in the course.

 

54 CYP Proteins assembled from heterozygous, polymorphic Salmo salar ESTs

 

CYP1A   whole TRANSLATION plus partial paralog Fazle Chowdhury

CYP1B   partial seq. Found by Ali Ellebedy

CYP1C.a partial seq Found by Ali Ellebedy

CYP1C.b partial seq

CYP1C.c partial seq

CYP1D   not found in ESTs

CYP2K.a whole TRANSLATION Brandon Hale, Frank Zhang, Rubi Mahato

CYP2K.b partial seq.

CYP2K.c partial seq. Brandon Hale

CYP2M1  whole TRANSLATION ortholog to trout 2M1 Yanhua Qu, Brandon Hale, Rubi Mahato, Mekel Richardson, Julie Philippart

CYP2M2  missing 27 aa in middle 68% to 2M1 Yanhua Qu

CYP2P   partial seq. Julie Philippart

CYP2R1  not found in ESTs

CYP2U1  missing 60 aa in middle found in 2D6 search Frank Zhang

CYP2X.a partial seq

CYP2X.b partial seq

CYP2X.c partial seq

CYP2X.d C-term

CYP2Y.a partial N-term Brandon Hale

CYP2Y.b partial seq. Brandon Hale

CYP2AD  partial seq. Mekel Richardson

CYP2AE  partial seq. N-term

CYP3A.a whole TRANSLATION Fazle Chowdhury

CYP3A.b partial 91% to CYP3A.a Fazle Chowdhury

CYP3A.c whole TRANSLATION 63% to 3A48

CYP4F   whole TRANSLATION plus partial paralog? Mitzi Dunagan, Rubi Mahato, Julie Philippart

CYP4T   whole TRANSLATION plus paralog Mitzi Dunagan, Akshata R. Udyavar, Brandon Hale, Julie Philippart, Ali Ellebedy

CYP4V.a partial seq. Akshata R. Udyavar

CYP4V.b partial seq. Ryoko Tsukahara

CYP5A   partial seq with 3 missing pieces Yanhua Qu

CYP7A   partial seq. Mekel Richardson

CYP7C.a whole TRANSLATION Ryoko Tsukahara, Mekel Richardson

CYP7C.b partial seq. 88% to CYP7C.a

CYP8A1  partial seq.

CYP8A2  partial seq.

CYP8B   partial seq.

CYP11A  partial seq.

CYP11B  partial seq.

CYP17A  not found in ESTs

CYP19A  brain partial seq

CYP19A  ovary partial seq

CYP20   whole TRANSLATION

CYP21   not found in ESTs

CYP24   partial seq. missing middle region Ryoko Tsukahara

CYP26A1 partial seq. C-term Frank Zhang

CYP26B  not found in ESTs

CYP26C1 partial seq. N-term mine

CYP27A.a partial seq Akshata R. Udyavar

CYP27A.b partial seq

CYP27B  not found in ESTs

CYP27C  partial seq Akshata R. Udyavar

CYP39A1 partial seq Ryoko Tsukahara

CYP46   whole TRANSLATION plus partial paralog Mitzi Dunagan, Brandon Hale

CYP51   partial seq.

 

DNA translator

http://ca.expasy.org/tools/dna.html

 

 

 

CYP1A1      Salmo salar (salmon)

            AF361643

            Christopher Rees, Weiming Li

            submitted to nomenclature committee Nov. 9, 2001

            a second gene is being isolated so this is called 1A1

            rather than just CYP1A.  This does not imply orthology to the

            mammalian 1A1, 1A2.  The CYP1A gene duplications in fish and mammals

            occurred independently.

 

>CYP1A AF361643       

Salmo salar cytochrome P450 1A (CYP1A) mRNA, complete cds

MVLMILPIIGSVSVSEGLVAMVTLCLVYMFMKYKHTEIPEGLKR

LPGPKPLPIIGNVLEVHNNPHLSLTAMSERYGSVFQIQIGMRPVVVLSGSETVRQALI

KQGEDFAGRPDLYSFKFINDGKSLAFSTDKAGVWRARRKLAMSALRSFATLEGSTPEY

SCALEEHVCKEGEYLVKQLTSVMDVSGSFDPFRHIVVSVANVICGMCFGRRYSHDDQE

LLSLVNLSDEFGQVVGSGNPADFIPILRYLPNRTMKRFMDINDRFNAFVQKIVSEHYE

SYDKDNIRDITDSLIDHCEDRKLDENANIQVSDEKIVGIVNDLFGAGFDTISTALSWA

VVYLVAYPEIQERLHQELTEKVGLNRTPRLSDKTNLPLLEAFILEIFRHSSFLPFTIP

HCTIKDTSLNGYFIPKDTCVFINQWQVNHDPELWKEPSLFNPDRFLSADGTELNKLEG

EKVLVFGMGKRRCIGEAIGRNEVYLFLAILLQRLRFQEKPGHPLDMTPEYGLTMKHKR

CQLKASLRPWGQEE

 

>CYP1A AF364076 has an extra C base after NGYF causes frameshift

Salmo salar cytochrome P450 1A mRNA, complete cds

Differs from AF361643 at blue and green

matches DR696646.1, DY719050.1 at blue

matches C-term at blue C CB505556.1, DY719520.1,

CA044359.1, DY692330.1, DY692329.1, CK878675.1,

EG852844.1, BQ036391.1

AM402919.1 matches grayed aa.

MVLMILPIIGSVSVSEGLVVMVTLCLVYMIMKYMHTEIPEGLKR

LPGPKPLPIIGNVLEVHNNPHLSLTAMSERYGSVFQIQIGMRPVVVLSGSETVRKALI

KQGEDFAGRPDLYSFKFINDGKSLAFSTDKAGVWRARRKLAMSALRSFATLEGSTPEY

SCALEEHVCKEGEYLVKQLTSVMDVSGSFDPFRHIVVSVANVICGMCFGRRYSHDDQE

LLSLVNLSDEFGQVVGSGNPADFIPILRYLPNRTMKRFMDINDRFNAFVQKIVSEHYE

SYDKDNIRDITDSLIDHCEDRKLDENANIQVSDEKIVGIVNDLFGAGFDTISTALSWA

VVYLVAYPEIQERLHQELTEKVGLNRTPRLSDKTNLPLLEAFILEIFRHSSFLPFTIP

HCTIKDTSLNGYFHSQGHLCLHQPVAGQPPGAVEGAFFIQPYRFLSADGTELNKLEGE

KVLVFGMGKRRCIGEAIGRNEVYLFLAILLQRLCFQEKPGHPLDMTPEYGLTMKHKRC

QLKASLRPWGQEE

 

>CYP1A revised AF364076 Corrected seq with frameshift removed

found by Fazle Chowdhury

MVLMILPIIGSVSVSEGLVVMVTLCLVYMIMKYMHTEIPEGLKR

LPGPKPLPIIGNVLEVHNNPHLSLTAMSERYGSVFQIQIGMRPVVVLSGSETVRKALI

KQGEDFAGRPDLYSFKFINDGKSLAFSTDKAGVWRARRKLAMSALRSFATLEGSTPEY

SCALEEHVCKEGEYLVKQLTSVMDVSGSFDPFRHIVVSVANVICGMCFGRRYSHDDQE

LLSLVNLSDEFGQVVGSGNPADFIPILRYLPNRTMKRFMDINDRFNAFVQKIVSEHYE

SYDKDNIRDITDSLIDHCEDRKLDENANIQVSDEKIVGIVNDLFGAGFDTISTALSWA

VVYLVAYPEIQERLHQELTEKVGLNRTPRLSDKTNLPLLEAFILEIFRHSSFLPFTIP

HCTIKDTSLNGYFIPKDTCVFINQWQVNHDPELWKEPSSFNPDRFLSADGTELNKLEGE

KVLVFGMGKRRCIGEAIGRNEVYLFLAILLQRLCFQEKPGHPLDMTPEYGLTMKHKRC

QLKASLRPWGQEE

 

DY692329.1, CB505556.1, DY719520.1, CK878675.1, CB504968.1,

DY692330.1

HCTVKDTSLNGYFIPKDTCVFINQWQVNHDPELWKEPSSFNPDRFLSADGTELNKLEG

 

>CYP1A There seem to be two very similar sequences with only minor changes

DY692329.1, CB504968.1, CB505556.1

GRRYSHDDQELLGLVNLSDEFGQVVGSGNPADFIPILRYLPNRTMKRFMDINDRFNTFVQ

KIVSEHYESYDKDNIRDITDSLIDHCEDRKLDENANVQVSDEKIVGIVNDLFGAGFDTIS

TALSWAVVYLVAYPEIQERLHQELKEKVGMTRTPRLSDKTNLPLLEAFILEIFRHSSFLP

FTIP

 

>CYP1B EG855910.1 opp end = EG855909

70% to 1B1 fugu

EG877309.1 opp end = EG877308, DY700468.1, EG856153.1

Found by Ali Ellebedy

GTDAFIMALDHSQDSSPGVSPGKDYVPPTIGDIFGASQDTLSTALQWIILILVRFPHIQL

RLQEEVDKVVYRSRLPTIEDQSQLPYVMAFIYEVMRFTSFVPLTIPHSTITDTTIMGYTI

LKDTVIFLNQWSINHDPARWTQPETFDPLRFLDQDSSLNKDLASSVLIFSLGKRRCIGEE

LSKMQLFLFTALLAHQAHFSPDPDKLPTIDYTYGLTLKPNNFSIAVNLRDSMDVLEEASQ

KPLYGETQEDTGNSRSD*

 

>CYP1C.a 81% to CYP1C2

EG933863.1, EG762705.1, DY701849.1, EG806773, EG890885.1

DW472969.1, EG765700.1, EG759216.1

Found by Ali Ellebedy

GVVLNGDASIREALVQHSTEFAGRPNFVSFQSVSGGNSMTFTNYSKQWRTHRKIAQSTIR

AFSSANSQTKKAFEQHIVAEATELIEAFLKLKGQFFNPAHELTVAAANVICALCFGKRYG

HDDIEFRTLLGSVDKFGETVGAGSLVDVMPWLQYFPNPVRRVYQNF

KDLNKEFFTFVRDKVVEHRETFDPEVTRDMSDAIIGVIDKADSDTGLTEAHTEGTVS

DLIGAGLDTVSTCLHWMLLLLVKYPNIQTKLQEQIDKVVGRDRLPCIEDKASLAYLDAVI

YETMRYTSFVPLTIPHSTTSDVTIEGFHIPKDTVVFINQWSVNHDPLQWKDPHLFDPSRF

LDENGALDKDLTSSVMIFSAGKRRCIGDQIAKVEVFLFSAILIHQCTFENNPSQDLSLDC

SYGLTLKPLNYKISAQLRGELLTGA*

 

>CYP1C.b DW541384.1 DY721531.1

MALLDTEFGVKGSSIIREWSGQVQPALVASFVFLFCLEACLWVRNLRLKRRLPGPFAW

PVVGNAMQLGQMPHITFSKLAKKYGNVYQIRLGCNNIVVLNGDTAIREALVQHSTEFAGR

PNFISFQMISGGRSLTFTNYSKQWKMHRKIAQSTIRAFSSANSQTKKAFEHHVLGECMDL

VQVFLRRSADGRYFNPAHEFTVAAANVICALCFGKRYGHDDIEFRTLLGRMDRFGETVGA

GSLVDVMPW

 

>CYP1C.c CX353507 C-term 86% to 1C.a might be the opp end of CYP1C.b

EG763264.1 2aa diffs. EG885248.1 CX154036.1

STIFQWILLLLIKYPNIQTKLQEQIDKVVGRDRLPSIEDKASLAYLDAVIYETMRYTSFV

PLTIPHSTTSDVTIEGFHIPKDTVVFINQWSVNHNPLKWKDPHLFDPSRFLDENGALDKD

LTNSVMIFSTGKRRCIGNQIAKVETFLFTAVLLHQCTFESNPSEALTLDCSYGLTLKPLH

YTITTKLRGKLLALVSPA*

 

>CYP2K.a DY730660.1 47% to 2K16 found by Brandon Hale, Frank Zhang,

Rubi Mahato

EG806893.1, DW567947.1, CK889346.1, EG853617.1, DY741080.1

EG929755.1, DY741081.1, EG851124.1, EG779526, BG935485.1

DN047609.1, CB510582.1, EG851123.1, EG853616.1

MSVLELFSLSGMSVMFITLNLIILIFIINRNTNPKNFPPGPRGLPLLGNTLNLDLKKPYQTMMEMKDKFGPVFSI

QMGLRKIVVLCGYEMVKEALVTQADQFAERPDIPLFKQITRGNGIIFGHGDSWRTIRRFT

LTVLRDLGMGKRNIEEKIIEESENLVKSFAAHNGDGFQTTIPLNAAASNIIVSLLMGHRM

EYDDPIFIKLLEMNYESFRLASGPFIQLYNMYPVIHPLPGPHHKVLAYQDNLKAFFRKSF

IQHRQILDENDSRSYIDAFLKKQQE

EKDNPSSHFHEWNLLCSVTNMFVAGTETTSSTLAW

ALVIMIKYPEIQSKVHEEIDKVISGSTPRIQHRQMMPFTDAVIHETQRFADILPMG

LPHETTADISFKGFFIPKGTYIIPLLRSVHRDKAHWEKPDDFYPHHFLNVD

DKFVKREAFMPFSAGRRVCVGETLARMELFLFYTSLMQRFSFLPP

IGMTADDVDISTCGGLGLAAPPVKVRALPRFHDT*

 

>CYP2K.b DV106223 75% to 2K10, 59% to 2K.a

CK889250.1, CA037986.1

EG843507.1

PWLGPWINNLTRLKKNIADMKMDVTELVRGLKETLNPQMCRG

FVDSFLVRKQTLEESGNMDSLYHDDNLVISVT

NMFGAGTDTTGTTLRWCLLLMAKYPHIQDQVQEEISRVIGSRQPLVEDRKNLPYTDAVI

HETQRLASILPIAIPHTTSRDITFQGYFIKKGTSVIPLLTSVLQDNSEWESPHTFTPSHF

LDEHGGFVKRDAFMAFSAGHRVCLGEGLARMELFLFFTSLLQRFRFTPPPGVTEDDLDLT

PFVGFTLNPSPHQLCAVSRL*

 

>CYP2K.c EG832531.1 opp end = EG832542, EG883217.1, EG842866.1

EG883216.1, EG842865.1, DN162836.1, CA061778.1

Found by Brandon Hale  

MSLIEGLLQTSSTVTLLGTVLFLLVLYLRSSGSSSEEQGKEP

PGPRPLPLLGNMLQLDFKKPYCT

LCELSKKYGSIFTVHFGPKKVVVLAGYKTVKQALVNQAEDFGDRDITPV

FYEFSQGHGILFANGDSWKEMRRFALTNLRNFGMGKKGSEEKILEEIHYLIEVLEKHEGK

AFDTAQPVIYAVSNIISAIVYGSRFEYTDPLFTGMADRSNESIHLTGSASIQIYNMF

(157 aa gap)

GIFHQKKGQSVIPLLT

SVLQDDSEWESPHSFTPSHFLDEHGRFVKRDAFMAFSAGRRVCLGEGLARMELFLFFTSL

LQRFRFTPPPGVTEDDLDLTPSVGFTLNPSPHQLCAVSRL*

 

>CYP2M1 ortholog to trout CYP2M1 95% to 2M1

CX353908 DW539128.1 DW562631.1 DY725141.1 DY728977.1

DY732623.1 DW541770.1 DY706581.1, DY701550.1 DW549411.1,

CK896372.1, BQ036463.1, EG355231.2, CA053315.1, DY725142.1

AM397486.1

found by Yanhua Qu, Brandon Hale, Rubi Mahato, Mekel Richardson,

Julie Philippart

MDVLHILQTNFVSI

VIGVVVIILLWMNRGKQSNSRLPPGPAPIPLLGNLLGMDVKAPYKLYMELSKKYGSVFTV

WLGSKPVVVISGYQAIKDAFVTQGEEFSGRANYPVIMTVSKGYGVLVSSGKRSKDLRRFS

LMTLKTLGMGRRSIEERVQEEAKMLV

KAFSEYGDSVVNPKELLCNCVGNVICSIVFGH

RFENDDPMFQLIQKAVDAYFNVLSSPIGAMYN

MFPRIIWYFPDKHHEMFAVVNKAIAYIQEQAEIRLKTLDTSEPQDFIEAFLVKMLEEKDD

PNTEFNNDNMVMTAWSLFAAGTETTSSTLRQSFLMMIKYPHIQASVQKEIDEVIGSRVPT

VDDRVKMPYTDAVIHEVQRYMDLSPTSVPHKVIRDTEFYNYHIPEGTMVLPLLSSVLADP

KLFKNPDEFDPENFLDENGVFKKNDGFFAFGVGKRACLGE

ALARVELFLFFTSLLQRFTFTGTKPPEEINIEPACSSFCRMPRSYDCYIKLRTEE*

 

>CYP2M1 paralog CK888131

95% to 2M1 above, paralog seq

E*LCNCVGNVICSIVFGHRFENDDPMFQLIQKAVDAYFNVLSSPIGAMYNMCPRIIWCFP

DKHHEMFAVVNKGIAYIQEQADIRLKTLDTSEPQDFIEAFLVNMLAEKDDPNTEFNNDNM

VMTAWSLFAAGTESTSSTLRQSFLMMITYPHIQASVQKEIDEVIGSRVPTVDDRVKMPYTD

 

>CYP2M2  68% to CYP2M1 found by Yanhua Qu

EG828494.1, opp end = EG828495 not CDS

DY732623.1, EG830130.1 opp end = EG830129

MDLLHVLQTNILSIIMAIVVIILLWKYMGKQSSYGRLPPGPSPIPLLGNLLQMDLKRPDMSYMEFS

KKYGSVFTVWLGSKPVVVISGYQAIKDAFVTQGEDFNGRANSAITPKLINEHGVGLSNGQ

RWKTLRSFSLMSLKNLGKGCRSLEERVQVEAKSLVKAFSEYGDSTVNPKELLFHSIINLF

WSIVFGRRFEYNDPEFQILYKPVYTYFDMLKSKVAMLYNISPRIVECFPGKHHELF

KAIDKAKAYIREE

(27 aa gap)

KGQPETEFNYDNLFPCVWDLFAAGTETQSSTLSHACLMMIKYPDIQEKVQKEIDEVIGSN

RVPTVDDRHKMPYTDAVIHEIQRSMSLAPIAFPHQMTRDTTFHNYHIPKGTTVFPLLSSV

LFDPKLFKNPDEFDPENFLDENGVFKENNGFLVFGLGKRYCLGDGMGIPRTVLFLFFTSL

LQRFTFRGTKPPEEIDASVVSYFHGRLARPYTCYVKLRTPNI*

 

>CYP2P.a fragment  Atlantic salmon

                GenMEBL BI468047 EST00457

                77% to CYP2P10

EG901171.1 opp end = EG901174, CA044176.1

found by Julie Philippart

1   DPSSPRDFIDCFLNEIEKCEDDTRAGFNLENLSFCTLDLFVAGTETTSTTLYWGLLFMIN 180

181 YPEIQAKVQAEIDAVVRSSRQPSMEDRDSMPYTDAVIHETQRMGNIIPLNVSRMATKDTE 360

361 VGGYTIPKNTIVLGTLQSILFDESEWETPHTFNPGHFLDQEGKFRKRDAFLPFSLGKRVC 540

541 PGEQLAKMELFLFFTSLLQRFTFFSPPGVEPSLDFQMGATHSPKPYQLCATPR*

 

>CYP2U1 DW576911.1 found by Frank Zhang

Top seq is 59% to CYP2U fugu

131  MELWHELLGTSALSHVCILALTVFVAVYYIMHTFRKHQDFSNIPPGPKPLPIVGNFGGFL  310

311  VPNFIWRRLRREDDPKSKTRALISPPVIITEQAKVYGNIYSMWVGSQLVVVLNGYEVVR  487

488  DALSNRADVFSDRPEIPTVTIMTKRKGIVFAPYGPVWRRQRKFCHTTLRNFGLGKLSLEP  667

668  CILEGLAVVKSELLRLSEQDTEGSGVDLTPLITNSV  775

(60 aa gap)

DY739225, DY738492.1, 75% to CYP2U fugu

QVERDITAFLKQIITRHRETLDPANPRDLIDMYLVEMLAQEAAGETDSSFSEDYLF

YIIGDLFIAGTDTTTNTVLWMILYMAVFPDIQERVQAEMDAVVGPDRVPSLTDKGSLPFT

EATIMEVQRMTVVVPLAIPHMASETTEFRGYTIPKGTVIIPNLWSVHRDPTVWEQPDDFN

PSRFLDDQGNLLRKECFIPFGIGRRVCMGEQLAKMELFLMFTSLLQAFSFSLPEGLAPPP

MHGRFGLTLAPCPYTVAVRPRR*

 

>CYP2X.a CA042231 80% to 2X8 danio C-term, CK887325.1 

GEISHIIAPSAKSVGKSMNPQVLFHNAASNIICLVLFGSRYDYNDEFLKTFVKLYTENAK

IANGPWAMLYDTVPMLRYLPLPFQKAFKNATCVKQMSVGLITQHKETRNPGAPRDFIDCY

LDELDKRGDDGSSFSEAQLIMYVLDLHFAGTDTTSNT

LLTAFLYLMTHPEIQERCQQEIDTVLEGKEHASFEDRHRMPYTQAVTHESQRIASTVPLS

VFHSTTRDTEVMGYSIPKGTLIIPNLTSVLSEEGQWKFPHEFNPLNFLNEQGEFEKPEAF

MPFSAGPRMCLGEGLARMELFLIMVT

 

>CYP2X.b DW532476 57% to CYP2X10, 58% to CYP2X.a mid region

CB504239

HEGIQDLTKIAIGPWAMFYDIIPALRALPLPFKKAFHIYDEIKEHAQKAVTNHKSSRVSG

EPRDLIDCYLDQIDMTGDGGSTFNDVQMVFLLIDLFLAGTDTTSNTLRCAVLHLMTNQHI

QERCHREIEDVLEGRSCALFEDRHAMPYVQAMIHESQRMADTVPLSVFHMTSCNTQLQGY

QLP

 

>CYP2X.c N-term 2 frameshifts EG647534 65% to 2X11 danio

LILLLFFFIRVRRPKNFPPGPRPLPILGNLLQLDPANPLKDLERLKRRYGNVFSLYIGSR

PAVVLNGLEVVREALVTRAAEYAGRPTHLMISHLFKGKGVVMANYGSSWRDHRRFALTTL

RNFGLGKRS

MEERILEE

VSHICTELESSAGSSMDPQHLFHLAASNIICS

 

>CYP2X.d C-term CA042805  70% to CYP2X1

DW532475.1, CA767912.1, CA038044.1

TRMIHESQRMADTVPLSVFHMTSCNTQLQGYQLXQGTMVIPNLSSVLHEEGQWKFPQEFN

PDNFLNEDGEFVKPEAFLPFSAGSRVCLGEGLARTELFLILVTLLRRFQFVWPEQEGAPD

FRKVFGIAQAQKPFRLGVRLRSSQ*

 

>CYP2Y.a  N-term Found by Brandon Hale  

Top = 68% to 2Y1

compiled from:

salmon EST sequence DY725570.1 opp end = DY725569 no CDS

 

MEMCSSVVLGGLVLLLLLWLFRLRRQRHVHLPPGPCALPLLGNL

HQIDKQAPFKTLTKWSGVYGPVMTVYLGPQRAVVLVGYEAVKEALVDQAEDFTGRAPVPFLVLVTR

GYGLAISNGERWRQLRRFTLTTLRDFGMGRKRMEEWIQEESQHLIDSLDATKAVPFDPHPFLSRTV

SNVICSLVFGQRFGYDDDNFLHLLNILSAVLRFGSSPCEQLYNIFPWLMERLPGRQ

 

>CYP2Y.b  DY717079 77% to CYP2Y4 danio C-term Found by Brandon Hale  

QRVPQMEDRKSLPFTEAVIHEVQRFLDIVPLNLPHYATKNISFRGYTIPQGTVILPMLHS

VLRDQNHWATPTTFNPNHFLDQNGNFQTNPAFLAFSAGKRACVGESLARMEIFLFLVSLL

QHFSFSCPGGPDSIDLSPEFSSFTNVPRHYQLIATPR*

 

>CYP2AD.a AM402774 53% to CYP2AD2 danio aa 112-193

VTFKGIGITLSNGHMWKNQRKFAHTHLRYFGEGKKGLEHYIQLESNFLCEAFREEQGGGF

NPHYILNNAVGNIISCVVFGVTFKY

 

>CYP2AD.b AM397512 C-term 68% to CYP2AD2 danio found by Mekel Richardson

ETQRIGNILPLGFPKMASKDTKLGEYFIPKGTVVNTNLSSVLFDKDEWETPNTFNPEHFL

DSEGQFRRRDAFLPFSAGKRACVGEHLARMELFLFFSSLLQRFSLSPVSGAMPSLDGVLG

FTHSPAEFLVRALPR

 

>CYP2AE N-term CB510051 61% to CYP2AE1 danio

MLEALVCLVGQWIDTKGVLLFLLVLLVTKYIHDLPPKNYPPGPFPLPFVGNMLNISIK

DYIGSFKKFVESYGDVTTLDLGGGNRCVLLSGLRGFKEAFVDQADTFTDRPSYPLNDRIS

RGLGLISSNGHMWRQQRRFAVSTLKYFGVGKKTLETSILQESHFLCDVYLA

 

>CYP3A.a found by Fazle Chowdhury

DW557151.1, EG828682.1 EG885620.1, EG885621.1, DQ361036

59% to 3A48, 60% to CYP3A.c

MSFLPYFSAETWTLLALLITLIVVYRYWPYGVFTKMGIPGPKPLPYIGTMMEYKKGFTNFDTECFQKYGRIWGIYDGRQ

PVLCIMDKSMIKTILIKECYNIFTNRRNFTFLSGELFDAVTIAEDDTWRRIRSVLSPSFTSGRLKEMFGI MKRHSANLLNGMKKQADKDQAIEVKEFFGPYSMDVVTSTAFSVDIDSLNNPSDPFVYNIKKMLKFDMFNP

LLLLIVLFPFIGPILDKMKFTFVPTEVTDFFYASLAKIKSGRDTGNSTSQVDFLQLMIDSQKGNDTKTGQEQTK

GLTDHEILSQAMVFIFAGYETSSSTMSFLAYNLATNPHTMTKLQEEIDTVFPNKAPIQYEALMQMDYLDCVLN

ESLRLFPVSPRLERVAKKTVEINGIVIPKDCVVLVPTWTLHRDPEIW

SDPEEFKPERFSKENKESIDPYTYMPFGAGPRNCIGMRFALIMIKLAMVEILQSFTFSVCDETEIPLEMDKQ

GLLMPKRPIKLRLEPRSNTPSNTTAISF*

 

>CYP3A.b  second gene 91% to first gene, found by Fazle Chowdhury

EG895305.1 opp end = EG895306

EG902122.1, opp end = EG902123

MSFLPYLSAETWTLLALLITFIVVYGYWPYGVFTKMGVPGPKPLPYFGTMME

YRKGFTNFDTECFQKYGRIWGIYDGRQPVLCTMDKSMIKTILIKECYNIFTNRRNFMFLN

GELFDALSFAEDDTWRRIRSVLSPSFTSGRLKEMFGIMKRHSANLLNGMKKQADKDQTIE

VKEFFGPYSMDVVTSTAFSVDIDSLNNPSDPFVSNVKKMIKFDMFNPLLLLFV

LFPFIGPILEKMKFSFFPSAVTDFFYASLAKIKSGRD

 

>CYP3A.c third gene

EG943017.1 DY734442.1, EG943028.1

63% to CYP3A48, 60% to CYP3A.a

MWYFLSVSTETWTLIAILFALFMWYGYAPYGFFKKLGIQGPKPLPFIGTFLEY

KRGLFIFDNDCYQKYGDVWGLFDGRLPVMGVMDTAMIKTILVKECYSVFTNRRDFGLNGE

LHDAVTTVEDNEWKRIRSTLSPSFTSGRLKDMFKIMTQHSRNLVKFLQKKVDSDEVLEVK

EIFGAYSMDVVASSAFSVDIDSINQPNDPLVVNIKKLLKFNLLSPLLILVVLFPFMRPLL

EKCKVSFLPAGAMKFFYSFLRKIKAERSKNVHNT

RVDLMQLMVDSQIPQDHSSKEAAHK

GLTDHEILSQALTYIFGGYETSSSTLGYLSYNLATNPDVQAALQDEIDKIFPDKAPPTYE

GLMQMEYLDMVINETLRVYPIISRLERVAKATVEVNGVTIPKGTVVMIPIMVLHHHPTHW

PKPEVFRPERFSKENRENIDPYTFLPFGMGPRNCIGMRFALQTIKLVIVEILQNFSFVTC

KETEVPLELFDNGFVAPTKPIKLKLVPRVLAPS*

 

>CYP4F found by Mitzi Dunagan, Rubi Mahato, Julie Philippart

DW540843.1, DY716972.1, DW540844.1, DW573410.1, DW584204.1

CA057516.1, DW573720.1, DY717493.1

61% to 4F43 Danio, 62% to 4F28 Fugu, 59% to 4F3 human

MAVIDAVLDRLLDGLLSLGRLLSPLY

SLFVLLQLSALVVLLLLSLRVVCLLWSHAQFTRRLQCFSKPPTQNWLMGHLGEMRSTEEG

LQAVDQMVRTYSHSCSWFLGPFYSLVRLFHPDYIKPLLLAPASITVKDELFYGFLRPWLG

QSLLLSNGEDWSRRRRLLTPAFHFDILKNYVKIFNHSSDIMHF

KWRRLVAEGESRQDMFSHISLMTLDSLLRCTFSYNSNCQESSSEYIAAIFELSTLVIE

RRGRILHHWDWLYWRSPEGQRFKQACNVVHRFTRTVVQERRAQLLHQGEPESHTDTTGGE

EKRKRVADFIDLLLLSKDEEGHGLTDEGIKAEADTFMFGG

HDTTASGISWVLYNLSQHQDYQDRCRAEVNDLLQDRETEDLDWEDLSSLPFTTMCIKESL RLHSPVSAVTRRYTKDITVPGGRVIPQGSICLVSIYGTHHNPEIWPDPDVYNPMRFDPEN

SKDRSSHAFIPFSSGPRNCIGQKFAMAELRVVVALTLRRFHLTPGGVEVRRLPQLVLRA

EGGLWLTLETLDTPQD*

 

>CYP4F paralog? 94% DW535744.1 C-term (only one, poor seq?)

opp end = DW535743.1

XXXXXXXXXXXXXX

LLSLGRLLSPLYSLFVLLQLSALVVLLLLSLRVVCLLWSHAQFTRRLQCFSKPPTQNWLM

GHLGEMRSTEEGLQAVDQMVRTYSHSCSWFLGPFYSLVRLFHPDYIKPLLLAPASITVKD

ELFYGFLRPWLGQSLLLSNGEDWSRRRRLLTPAFHFDILKNYVKIFNHSSDIMHFKWRRL

VAEGESRQDMFSHISLMTLDSLLRCTFSYNSNCQESSSEYIAAIFELSTLVIERRGRILH

HWDWLYWMSPEGQRFKQACNVVHRFTRTVVQERRAQLLHQGEP

(70 aa gap)

YQDRCRAEVNDLLQDRETEDLDWEDLSSLPFTPMCIKEFLRLHSPVFAVTRQYTKDITVP

GGRVIPQGSICLVSIYGTHHNPEIWPDPDVYNPMRFDPENSKDRSSHAFIPFFSGPRNCI

GQKFAMGELRVVVGLTLRRFRLTPGGVEVRGLPQLVLRAEGGLWLTLETLDTPQD*

 

>CYP4T salmon found by Mitzi Dunagan, Akshata R. Udyavar, Brandon Hale,

Julie Philippart, Ali Ellebedy

DY733068.1, DY712589.1, DY740889.1, DW555897.1, EG891086.1, DY713326.1

DY739259.1, EG891086.1

MELFETLKKVTLDSYRIHHLVAIFSLVYVILKISKLIVK

RNEWIRALETFPGPPKHWLFGHVREFKQDGNDMYKVVKWGESYPLAFQMWFGPFVSILNIHHPD

YVKTILASTEPKDDLSYRFLIPWIGDGLLVSGGQKWFRHRRLLTPGFHYDVLKPYVKMMSDSAK

TMLDKWETHSKSDESFELFEHVSLMTLDSIMKCAFSSNTNCQTVRGGESGTNSYIKAVYELSDL

VNVRLRTFPYHSEWIFQLSPHGYKYRKACNVAHSHTEEIIRKRKEALKDEKELGRIQAK

RNLDFLDILLCARDEDQQGLSDEAIRAEVDTFMFEGHDTTASGISWTLYSLACNPEHQQI

CRDEVISALEGRDTME

WEDLSKIPYTTMCIKESLRLYPPVPGMSRKLTKPMTFFD

GRTVPKGCLVGTSIFGIHRNATVWENPNAFDPLRFLPKNSAKRSPHAFVPFSAGPRNCIG

QNFAMNELKVVVAQTLKRYQLTED PMKKPKMI

PRLVLRSLNGIHVKIKPVDLVP*

 

>CYP4T paralog 95% to other salmon seq, found by Brandon Hale, Julie Philippart

EG839199.1, DW548509.1, CX352982.1,

Similar paralog seq 95%

EG839200 = opp. End of clone DY705018.1, CA055070

51   MELFETLKKVTLDSYRIRHLVAIFSLVYVVLKISKLIVKRNEWIRALETFPGPPKHWLFG  230

231  HVREFKEDGTDMYKVVKWGESYPPAFQMWFGPFVSFLNIHHPDYVKTILASTEPKDDLLY  410

411  RFLIPWIGDGLLVSEGLKWFRHRRLLTPGFHYDVLKPYVKLMADSAKTMLDKWETHSKFD  590

591  ESFELFEHVSLMTLDSIMKCAFSSNTNCQTVQGGESGTNSYIKAVYELSDLVNVRFRIFP  770

771  YHSEWIFQLSPHGYKYRKACNVAHSH  848

TEEIIRKRKEALKDEKELGRIQAKRNLDFLDILLCARDEDQQGLSDEAIRAEVD

TFMFEGHDTTASGISWTLYSLACNLEHQQICRDEVISALEGRD

TMEWEDLSKIPYTTMCIKESLRLYPPVPGMSRKITKPITFFDGRTVPEGCLVGTSIF

GIHRNATVWENPNAFDPLRFLPENSAKRSPHAFVPFSAGPRNCIGQNFAMNEMKVVVAQT

LKRYQLTEDPMKKPKMIPRLVLRSLNGIHLKIKPVNLEP*

 

>CYP4V.a

BQ036296 N-term 70% to CYP4V5 found by Akshata R. Udyavar

DY733547 C-term 72% to 4V5

CB506320 C-term

HPLKKYFQDWNELRPIPGVDGAYPIIGNALLFSTNAGDFFNQIIEGTKEFRHLPLLKVWV

GPLPLVVLFHAETVEGILHSSKHIDKAYFYRFMQPWLGTGLLTSTGDKWRGRRKMLTPTF

HFSILAEFLEVMNEQSEVLTQKLEKQAGGDPFNCFSYITLCALDIICETAMGKNIYAQSN

SESE

(33 aa gap)

KDHDNRLRILHSFTQSVIKERAESMENAGSDSESDHGIKSRRLAFLDMLLKATDEEGNYL

SHSDIQEEVDTFMFEGHDTTAASMNWTLHLLGSYPEVQTKVQEELQVVFGSSNRSVTVDD

LKRLRYLECVIKETLRLFPAVPMFARTVSDDCHINGFKIPKGVNALIIPFALHRDPRYFP

DPEEFRPERFLIENSTGRHPYAYIPFSAGPRNCIGQRFAMMEEKVVLSSVLRHFSVRACQ

SREELRPLGDLILRPEKGIWITLEKRQC*

 

>CYP4V.b CK896567.1 84% to 4V5 found by Ryoko Tsukahara

75% to 4V.a seq above

FLDMLLKTTDEEGNKLTHQDIQEEVDTFMFRGHDTTAAAMNWAIHLLGSHPEVQRKVQQE

LQEVFGVSDRPINTEDLKKLRYLECVIKESLRLFPSVPFFARSICEDCHINGFKVPKGAN

AIIMPYSLHRDPRHFPQPEEFRPERFMPENCVGRHPYAFIPFSAGLRNCIGQRFAVMEEK

VILASILRYFNVEACQK

 

>CYP5A1 DW567936 65% to 5A1 fugu even with gaps

DY702743.1, AJ425528 found by Yanhua Qu

MNILKMPVPSGVSVSVGLFMIFLALLYWYATFPYSALARCGIRHPKPSPFFGNMFLFRQG

FFGVHTDLIHKYGRVCGYYLGRRPVVVVADPDMLRQIMVKDFSTFPNRMTIRSATKPMSD

CLLMLRNEHWKRVRSILTPSFSAAKMKEMGPLINMATDTLLTNLLGHVESAESFDIHRCF

GCFTMDVIASVAFGTQVDSQKDPDDPFVHHAQKFFSFSFFRPIMFVFIAFPFLAPLARVIPF

(20 aa gap)

RDEQPVEERRRDFLQLMLDTRSTKECVPLEHFDVVNHADELAHTHDSGEQENGGAGSHES

PNRRSVQTQKRMMSEDEIVGQAFVFFLAGYETSSNTLAFTCYLLALHPECQSKLQAEVDD

FFTRYDSPDYTNVQDLKYLDMVISEALRLY

(16 aa gap)

NGQFLPKGATLEIPAGYLHYDPEYWPEPEKFIPDRFTAEAKASRHPFVYLPFGA

RTCVGMMLAQLKIKMALVHVFR

(missing 42 aa at end)

 

>CYP11A CA063876 86% to 11A fugu

GDMLQMLKMIPLVKGALKETLRLHPVAVSLQRYITEDIVIQNYHIPCGTLVQLGLYAMGR

DPDVFPRPEKYLPSRWLRTENQYFRSLGFGFGPRQCLGRRIAETEMQLFLIHMLENFRVD

KQRQVEVHSTFELILLPEKPILLTLKPLKSSQ*

 

>CYP11B1 DQ352841    

Salmo salar testicular cytochrome P45011beta mRNA, complete cds

Duplicated seq 75% to 11B1 fugu

PAVRRFLPLLDEVARDFCRLLVTRVEKEGGEEERGHSLTFDPSP

DLFRFALEASCHVLYGERIGLFSTSPSQESQKFIFAVERMLATTPPLLYLPPRLLWRL

GAPLWTQHATAWDHIFSHAEKRIQRGVQRLRSTQAAGGGSGGTEGEFTGILGQLMDKG

QLSLELIRANITELMAGGVDTTAVPLQFALYELGRNPLVPLQFALYELGRNPAVQEQV

RGQVRAAWARAGGDAHKALQGAPLLKGLVKETLRLYPVGITVQRYPVRDIIIQNYHIP

AGTCVQACLYPL

 

>CYP7A CA042205.1 79% to CYP7A1 fugu aa 128-310

found by Mekel Richardson

QTFLKTLQGEALPSLIETMMENLQSVMLQSDTLSPSKDRWDVDGIFAFCYKVMFESGYLT

LFGKDLGNNKNAARQEAQKALVLNALENFKEFDKIFPALVAGLPIHVFKSAHSARENLAK

TMLAENLSKRQNISDLISLRMLLNDTLSTFNDLSKARTHVALLWASQANTLPATFWSLLY

MIR

 

>CYP7A DY692254 83% to 7A1 fugu C-term found by Mekel Richardson

TRYRIRKDDVIALYPQMLHFDPKIYEDPLTYKYDRYLDDNGQEKTTFYREGRKLRYYYMP

FGSGVTKCPGRFFAVHEIKQFLALVLSYFNMELLDSAVKVPPLDQSRAGLGILQPTYDVD

FRYKLKTQ*

 

>CYP7C.a DW542316.1 56% to 7C1, 88% to DW550033

DW546163.1, DW574259.1 found by Ryoko Tsukahara, Mekel Richardson

MLEFVLPLFLGFLALYLLSVRFGRTRRDGEPPLINGWIPFLGKAMEFGKNAHGFL

AAHKEKHGDVFTVLIAGKYMTFIMNPLLYPYVIKHQKQLDFHEFSDQVAPLTFGYPPVRS

FFFSGMEEHIQRSFRLLQGDNLNNLTESMMGNLMFVFRQDYLTGESEWRTESVYQLCNSI

MFEATFLTLFGKPAHTSRHSGMVTLREDFVKFDTMFPLLIARIPI

SLLGGTKAIRDKLINYFHPQRMSPWSNTSGF

IKERAALFEQYDSMRDVDKAAHHFATLWASVGNTVPATFWAMYYLVTHPEALAVVREEIHG

VLQVSGIETDHNRDIAFTREQLDSLLYLESSINESLRLSSASMNIRVAQEDFSLRLEGE

RSIGVRKGDIVSLYPQSMHMDPGIYKNPEIYKFDRYIENGKEKTDFYKDGQKLKYYRMSF

GSGSTKCPGRYFAVNEIKQFLSLLLLYFDVEVLEGQEPC

TLDPSRAGLGILLPASDVQIRYRLR*

 

>CYP7C.b DW550033.1 58% to 7C1, 88% to CYP7C.a

opp end = DW550032

DY731800.1

KEKHGDVFTVLIAGKYMTFIMNPLLYPYVIKHGKQLDFHEFSDQVAPVTFGYPPVRSGKF

PGMDEHIQRSFRLLQGDNLDNLTESMMGNLMLVFHKDYLDGESEWRTESMYQFCNSVMFE

ATFLTMYGKPAHTNRHSGMVTLRQDFVKFDNMFPLLIARIPISLLGGTKTIRDKLINYFH

PQRMSTWSNTSGFIKERAALFEQYDCMGDVDKAAHHFAILWASVGNTVPATFWAMYYLLT

HPEALAVVREEIHCVLKVSGLEADHNQDA

TFTREQLDSLLYLESSINESLRLSSASMNIRVAQEDFSLRLEGERSIGVRKGDIISLYPQ

SMHMDPEIYENPEMYKFDRYVEDGKEKTDFFKDGQKLKYYRMPFGSGSTKCPGRYFAVNE

IKQFLSLLLLHFDMEVVEGQEPC

SLDFSRAGLGILLPATEVQIHYRPRQARGEE*

 

>CYP7C DW542061.1 90% to CYP7C.a, only 3 aa diffs to CYP7C.b

EG769244

QLDSLLYLESSINESLRLSSASMNIRVAQEDFSLRLEGERSIGVRKGDIISLYPQSMHMD

PEIYENPEMYKFDRYVEDGKEKTDFFKDGQKLKYYRMPFGSGSTKCPGRYFAVNEIKQFL

SLLLLHFDMEVVEGQEPC

SLDSSRAGLGILLPATDVQIHYRPRQAREEE*

 

>CYP8A2 DV107567.1 71% to 8A2 fugu, 66% to 8A1 fugu

FERPPGQVKKFYKGGERLKYYTMPWGAGDNMCVGRHFAVSGIKQFVFMVLSRLDLELCDP

TAIVPPVNPSRYGFGMLQPDGDLEVRYRLKTLH*

 

>CYP8A1 EG779368.1 71% to 8A1 fugu 60% to 8A2 fugu

GQEGSVKKDFFKGGRRLKYYTMPWGAGTNGCVGKRFAISSIRQFVYLVLSHLELELCDPE

AQMPEVNTSRYGFGMLQPEGDLAIRYKPRRSR

 

>CYP8B DW582478.1 76% to 8B.a danio

FPQMSSNKHLMGDGLVVMTQAMMSNLQNLMLHSVGTGDNNHMTWNEDGLFSYSYNIVFRA

GYLALFGNETPKSTGSMEKAREVDRVESQKLFLEFRKYDQLFPDLAYGVLLPGEKREAER

LKRLFWNMLSVQKVKTKENISVWVREQQQEREEHGMEDFMQDRYMFLLLWASQGNTGPSA

FWLLLYLMKHPDAMGAVKKEVEEVLKETGQEGRHKGPFIDLTRDMLQKTPILDSAVEETL

RLTTAPVLTRAVLQDMSLKM

 

>CYP19A2 DW178748 N-term 59% to 19A2 brain form

53% to 19A1 ovarian form aa 1-81

MATSIVDDASMSEALLLLLLLSLLLITTWCLTNTSHIPGPFFWAGLGPIL

SYSRFIWSGIGSACNYYNKRYGSMVRVWING

 

>CYP19A1 DQ361037  104 bp    mRNA    linear   VRT 07-FEB-2006

DEFINITION  Salmo salar aromatase alpha (CYP19A1) mRNA, partial cds.

82% to 19A1 ovarian fugu, 67% to 19A2 brain form fugu, aa 397-430

same as AF436885

ALSDDVISGYRVPKGTNIILNMGRMHRSEFFLKP

 

>CYP19A1 AF436885    

Salmo salar putative aromatase CYP19A1 mRNA, partial cds

81% to 19A1 ovarian, 70% to 19A2 brain fugu aa 324-460

VIAAPDTLSISLFFMLLLLKQNPDVELQLLEEIDTAIGERELHN

SDLQNLRVLEGFINESLRFHPVVDFTMRRALSDDVISGYRVPKGTNIILNMGRMHRSE

FFLKPNEFSLDNFEKNIPNRFFQPFGSGPRSCVGK

 

>CYP19A1 AY049958    

Salmo salar cytochrome P450-19 aromatase (CYP19) mRNA, partial cds

83% to 19A1 ovarian fugu, 61% to 19A2 brain aa 343-415

2 aa diffs to AF436885

KQNPDVELQLLQEIDTAIGERELHNSDLQNLRVLESFINESLRF

HPVVDFTMRRALSDDVISGYRVPKGTNII

 

>CYP19A2 DQ361038    

Salmo salar aromatase beta (CYP19B) mRNA, partial cds

87% to 19A2 brain form fugu, 76% to 19A1 ovarian form fugu aa 181-211

DPSGHVDVLNLLRCIVVDISNKLFLRVPLNE

 

>CYP19A1 trout AY427786     

Salmo trutta (brown trout) ovarian cytochrome P-450 aromatase mRNA, partial cds

74% to 19A1 ovary form fugu, 62% to 19A2 brain form

MAAVCLDTVIADLLVSESRNATATRSEGVSLATGSLLLL

LCLVVATWRHTDNNSVPGPFFCLGVGPLLSYLRFIWTGIGTASNYYNSKYGDIVRVWI

NGEETLILSSSSAVHHVLRQGRYTSRFGSKQGLSCIGMDERGIIFNSNMALWKKTRTY

FAKALTGPGLQRTVDVCVSSTQTHLDALQGLDGLMGGQVDVLSLLRCTVVDISNRLFL

GVPLNEKELLQKIQKYFDTWQTVLIKPDIYFKLDWIQQKHRRAAQELQDAIESLVDQK

RRGLQEADKLDHINFTADLIFAQSHGELSAENVRQCVLEMVIAAPDTLSISLFFMLLL

LKQNPDVELQLLEEIDTAIGERELHNSDLQNLRVLESFINESLRFHPVVDFTMRRALS

DDVISGYRVPKGTNIILNMGRMHRSEFFLKPNEFSLDNFEKNIPNRFFQPFGSGPRSC

VGKHIAMVMMKSILVTLLSRYSVCPHEGLTLDRLPQTNNLSQQPVEEKGEPHTMKFLP

RHQARK

 

>CYP20 EG857411.1, DY708511.1, CA063128.1

MLDFAIFAVTFVIFLVGAVLYLYPSSRSASGIPGLNPTEEKDGNLQDIVNRGSLHEFLAS

LHGQFGPVASFWFGGRPVVSLGSVDQLRQHINPNRTTDSFETMLKSLLGYQSGTGGGATE

AVMRKKLYESAVNNTPEKNFPMLLKLVEELVGKWQSFPKDQHTPLCAHLQGLAMKAVTQL

ALGDRFRNDAEVIGFRKNHEAIWSEIGKGYLDGSMEKSSIRK

EHYESALAEMETVLMSVAKDRKGQRSQTAFVDTLLQSNLTERQVMED

SMVFTLAGCVITANLCIWAVHFLSTSEDVQEKLHQELEDVLGSEPVS

LDKIPQLRYFQQVLNETVRTAKLTPIAARLQXNEGKV

DQHIIPKETLVIYALGVVLQDADTWSCPYKFDPDRFTEDSARKSFSLLGFSGNQACPELR

FAYTVATVVLSTVVRQLKLYQVKGQVVEARSELVSTPKDDTWITVSRRS*

 

>CYP24 EG801370.1, EG801369.1 found by Ryoko Tsukahara

MRAQIQKVPQIVELLKKKTVGLQHFK

PTSSVCVLDPKDATLVAPCRHSLPSKTQSLDSIPGPTNWPLFGSLIELLRKGGLQRQHNA

LIDYHKKFGKIFRMKLGSFESVHIGAPCLLEALYRKESAYPQRLEIKPWKAYRDLRDEAY

GLLILEGEDWQRVRSAFQQKLMKPTEVAKLDGKINQVLADFVSRIGQVTDNGQFEDLYFE

LNKWSMETICLVLYDKRFGLLHDNVNEEAMTFITSIKTMMSTFGAMMVTPVELHKN

(about 120 aa gap)

KESMRLSPSVPFTSRTLDKDTVLGDYSIPKGTVLMINSHALGANEEYFDDSSRFKPERWL

KESSTINPFAHVPFGIGKRMCIGRRLAELQLQLALCWVVRDYEIVATDSEPVDTIHSGAL

VPKRELPVAFIRR*

 

>CYP26A1 found by Frank Zhang

DW339542.1, DW340242.1, CK890206.1, CB508870.1, DW340364.1

79% to 26A1 danio, N-term absent in ESTdb

DEQELVEAFEEMIKNLFSLPVDVPFSGLYRGLKARNLIHSKIEENIQKKIDGGEVKH

RDALQQLIEISRNSDEPFSMQAIKESATELLFGGHETTASTATSLVMFLGLNPDAVNK

LRQELHQQEELGVDLLGQNVNIEVLEQLKYTGCVIKETLRINPPVPGGFRVVLKTFQLNG

YQIPKGWTVIYSICDTHDVADVFPNKEDFNPERFMDKSSED

SSRFNYIPFGGGTRMCVGKEFAKVLLKIFLIELTLRCNWTLSNGPPTMKTGPTVYPVDNL

PTVFSSYSLPC*

 

>CYP26C1 N-term EG871857 76% to fugu 26c1

MFLLELSHLSALVT

ALTSVLSALILLAVSRQLWTFRWTITRDTESKLPLPNGSMGWPLVGETFHWLFQGSNFHI

SRREKHGNVFKTHLLGKPVIRVTGAENIRKILLGEHNLVCTQWPQSTRIILGPDTLVNSI

GDLHKKKRKILAKVFSRGALETYLPRLQDVVKSEIAKWCSEPGPVNVYVSAKSLTFRIAV

RVLLGLKMEEKRIVYLSKIFEQLM

 

>CYP27A.a DW563635 opp end = DW563636.1 found by Akshata R. Udyavar

missing 31 aa at N-term

EG862435.1

GRRFKSGASRQEGQKLQTTIETTPEKYKTIDDLHGPSLATTVYWLFVKGYADKSHAMQVE

HKKLYGPIWRSRFGPFDVVNVASPEFISQVIRQEGRYPVRTELPHWKEYRDMRGQAYGLH

VDKGPQWYRIRSVLNPKMLKLAEVSAYAPVIHQVVGDLLQRIETLRLRSQDHTTVPDLAA

ELYKFGFEGISSILFETRLGCLQEEIPKDTLRFIAAANDMLTLSETVLFLPIWTRNVLPY

WKRFIQAWDDLVNVAQRLIDHKLAQMDAQVLAGK

(85 aa gap)

GIKETLRMYPVVPGNGRLTVDNEVVVDNYWFPKKTQFHLCHYAASHDEAEFVDAECFRPE

RWLRGDPESYQHHPYSFIPFGVGV

RACVGKRVAELEMYFALSRLMQHYEIQPEDGAPTTEPKTRTLLIPSKPINLRFLPRD*

 

>CYP27A.b EG833069.1 93% to CYP27A.a

54% to ctg11505 CYP27A3 LIKE FRAGMENT EST = CD283382 danio

MCSPAAVTRTLRLTVHHGQTLSVTLDGFVSGGRRFKSGASRHVGQHFQTTIE

TAPEKYKTIDDLRGPSLATTVYWLFVKGYADKSHAMQVEHKKLYGPIWRSRFGPFDVVNV

ASPELISQVIRQEGRYPVRTELPHWKEYRDMREQAYGLHVDKGLQWYRIRSVLNPKMLKL

AEVSVYAPVIHQVVGDLLQSIERLRLRSQDHTTVPDLTAELYKFGFEGISSILFETRLGC

LEEEIPKDTL

 

>CYP27A.c  75% to CYP27A.d danio C-term

EG856974.1, EG852854.1, CK876953.1, EG883050.1

GIDMKMEAIQKRVDTDQEVEGEYLTYLLSNTKMTNKEVYGSVAELLLAGVDTTSNTMMWA

MHLLSRDPNAQDTLYQEVSHCIPGDKIPSAQDVNRMPYLKAVIKETLRMFPVVPMNARIM

VENDIIIGGHFFTKKTSFTMCHYAISQDEKTFPEPSKFKPERWLRDGRVRPNPFGSIPFG

FGVRGCCVGRRIAELEMYLALSRIIKLFEIRPDPSIGEVKALNRTVLVADRQVN

LHFMERTNKAAL*

 

>CYP27A.d C-term removed probable intron

70% to 27A.b danio CK876940.1 

MLPKWSRNILPFYGRYIAGWEGIFKF ()

SAMKFIEMKMEAIQKRVDTNQEVEGEYLTYLLSNTKMTNKEVYGSVAE

LLLAGVDTTSNTMMWAMYLLSRDSNVQNTLYQEVSRCIPGDNIPSAQDVNCMPYLKAVIK

ETLRMYPVVPLNSRIMA

 

>CYP27B not found in ESTs

 

>CYP27C EG934966 found by Akshata R. Udyavar

DW556281.1 DY729189 79% to 27C1 danio, DW562388.1

RDLRGRSTGLISAEGEDWLKMRSVLRQLIMRPKDVAHFSDDVNAVVGDLVKRVVTLRSQQ

TDGLTVLNINDLFFKYAMEGMAAILYEDRLGCLENEVPQKSQDYIAALHLMFSSFKMTMY

AGAIPKWLRTIIPRPWDEFCSSWDGLFSFSQIHVDKRLGEIQDLVSRGETV

KGGLLTHMLVTREMNLEEIYANMTEMLLAGVDTTSFTLSWACYLLARHPEVQQEIYEEV

KMTLGHGTIPTADDVPRLPLIRGLVKETLRMFPVLPGNGRVTQDELVIGGYSIPKGTQLA

LCHYSTSMDEENFPGADDFRPDRWIRKDATDRVDNFGSIPFGYGIRSCIGRRIAELEMHL

ALTRLIQRFQIGLCPVTEEIKAKTHGLLCPAAPINLQFADRQP*

 

>CYP39A1 CB512602.1, DW563555.1 found by Ryoko Tsukahara

EVVTLVLSLVILAISAHLLFAGNYPNAPPCIKGWIPWFGVAFEFGKSPLTFISQARDKYG

PVFTVVAAGKRLTFVTLHEDFRTFFMSKDVDFEQAVQEPVHNTASISKDSFYKFHPACNT

LIKGRLTPGNVAQLTDHLCEEFNDHLET

LGDQGSGGLSELVRAVMYPAVMSNLLGKYNSPGSPFTMEQFKEKFAIYDEGFEYGSQLPD

MFLREWASSKCWLLSLLGNMVVKAEDDETSSESGNRTLLQHLATLITDKFLPNYGLLMLW

ASLANAIPITFWAVAFILSNPTVYQTAMEQINAALKDQDTRKTKVTAEELQQMPYVKWCI

LEAIRLRAPGAITRRVVRPLRIQNYIIPPGDLLMVSPYWAHRNPHYFPEPEEFKPERWEK

ADLVKNV

 

>complete CYP46 seq probably with polymorphisms

found by Mitzi Dunagan and Brandon Hale

EG798382.1, DW537530.1, DW537529.1, DY711165.1 (3 aa diffs)

EG927763.1, CX355054.1

MTLFHLIASWIGYLLMCLLGLILIAFLIFCLYIKYIHLKYDHIPGPPRDSFLFGHSPTLLREMS

NDRVVHDKFLEWAETYGPVYRINGMHVVVLCVTCPDTTKEVLMSPKYPKDSLVYKHLFN FGQRFLGSGLVTAQNHDEWYKQRRIMDPAFSSLYLRGQMGVFNERAEKLMDKLADMADS

NTAANMHHMLNCVTLDVIAKVAFGVELDLLNESDSPFPKAIEMCLKG

MVHYLRDSTFQLYPKNKKFINEVKKSCRLLRSTGRQWINE

RKMAVQNGEDVPKDILTQILKTAGKEENIANDDEELMLDNFVTFFIAGQETTANQLAFAV MELGRLPEILTKVKQEVDDVIGMKQEISFDDLGKMTYLSQVLKETLRLYPTAPGTSRFIP NDIVINGIPIPAGVTCFFSSYVCGRLDKFFEDPLTFDPERFHPDAAKPYYCYYPFALGPR SCLGQSFAQMEAKVVMAKLLQRFDFSLLPGQSFDILDTGTLRPKSGVVCNIRHRGQTPAA*

 

>CYP51

DY741343.1, DY731118.1 small 12 aa gap and missing 34 aa at N-term

QKYPPYIPSSIPFLGHAIAFGKSPIEFLENAYEKYGPVVSFTMVGKTFTYLLGSEAATLM

FNSKNEDLNAEDVYSRLTTPVFGKGVAYDVPNHIFLEQKKMFKTGLNIAHFKQHVEIIEE

ETKEYFSRWGDSGEENLFEALSELIILTASACLHGKEIRSMLDEKVAQLYADLDGGFSHA

AWLLPGWLPLPSFRRRDRAHREIKNIFYKVTQKRRSSGEKVDDMLQTLIDATYKDGRPLN

DDEIAGMLIGLLLAGQHTSL

XXXXXXXXXXXX

DKALQDRCYAEQKTACGEDLPPLNFDQLKDLTLLDRCLKETLRLRPPIMTMMRMARSPQT

AAGYTIPAGHQVCVSPTVNHRLQDTWTERMEFRPDRYLNDNPAAVEKFAYVPFGAGRHRC

IGENFAYVQIKTIWSTMLRLFEFDLVDGYFPTINYTTMIHTPHNPVIRYKRRQH*

 

100 ESTs found and identified starting from a CYP2K blast of ESTdb

limited to Salmo

 

AM397486.1 CYP2M1

AM397512.1 CYP2AD.b

AM402774.1 CYP2AD.a

AM402919.1 CYP1A

BG935485.1 CYP2K.a

BI468047.1 CYP2P.a

BQ036463.1 CYP2M1

CA037986.1 CYP2K.b

CA038044.1 CYP2X.d

CA042231.1 CYP2X.a

CA042805.1 CYP2X.d

CA053315.1 CYP2M1

CA061778.1 CYP2K.c

CA767912.1 CYP2X.d

CB504239.1 CYP2X.b

CB504968.1 CYP1A

CB505556.1 CYP1A

CB510051.1 CYP2AE

CK876940.1 CYP27A.d

CK876953.1 CYP27A.c

CK878675.1 CYP1A

CK887325.1 CYP2X.a

CK888131.1 CYP2M1 paralog

CK889250.1 CYP2K.b

CK889346.1 CYP2K.a

CK896372.1 CYP2M1

CX353507.1 CYP1C.c

CX353908.1 CYP2M1

DN047609.1 CYP2K.a

DN162836.1 CYP2K.c

DV106223.1 CYP2K.b

DW472969.1 CYP1C.a

DW532475.1 CYP2X.d

DW532476.1 CYP2X.b

DW537529.1 CYP46

DW537530.1 CYP46

DW539128.1 CYP2M1

DW541384.1 CYP1C.a

DW541770.1 CYP2M1

DW549411.1 CYP2M1

DW562388.1 CYP27C

DW562631.1 CYP2M1

DW567947.1 CYP2K.a

DW576911.1 CYP2U1

DY692329.1 CYP1A

DY700468.1 CYP1B

DY701550.1 CYP2M1

DY701849.1 CYP1C.a

DY705018.1 CYP4T

DY706581.1 CYP2M1

DY711165.1 CYP46

DY717079.1 CYP2Y.b

DY719050.1 CYP1A

DY719520.1 CYP1A

DY721531.1 CYP1C.b

DY725141.1 CYP2M1

DY725142.1 CYP2M1

DY725570.1 CYP2Y.a

DY728977.1 CYP2M1

DY729189.1 CYP27C

DY730660.1 CYP2K.a

DY732623.1 CYP2M2 CYP2M1 ?

DY733547.1 CYP4V.a

DY738492.1 CYP2U1

DY739225.1 CYP2U1

DY741080.1 CYP2K.a

DY741081.1 CYP2K.a

EG355231.2 CYP2M1

EG647534.1 CYP2X.c

EG759216.1 CYP1C.a

EG762705.1 CYP1C.a

EG765700.1 CYP1C.a

EG779526.1 CYP2K.a

EG798382.1 CYP46

EG806773.1 CYP1C.a

EG806893.1 CYP2K.a

EG828494.1 CYP2M2

EG828495.1 CYP2M2

EG830129.1 CYP2M2

EG830130.1 CYP2M2

EG832531.1 CYP2K.c

EG832542.1 CYP2K.c

EG842865.1 CYP2K.c

EG842866.1 CYP2K.c

EG843507.1 CYP2K.b

EG851124.1 CYP2K.a

EG852854.1 CYP27A.c

EG853617.1 CYP2K.a

EG855910.1 CYP1B

EG856153.1 CYP1B

EG856974.1 CYP27A.c

EG883050.1 CYP27A.c

EG883216.1 CYP2K.c

EG883217.1 CYP2K.c

EG890885.1 CYP1C.a

EG891086.1 CYP4T

EG901171.1 CYP2P.a

EG929755.1 CYP2K.a

EG933863.1 CYP1C.a

EG934966.1 CYP27C