D. Nelson
Zebrafish Sequences (New)
Zebrafish contig numbers (New)
Note: some non-zebrafish sequences are included for use in comparison tofragments. There are numerous P450s in clusters that still need to be assembled, such as CYP2P, CYP2AD etc.
All families and subfamilies are now represented. This is in progress. To Blast search these zebrafish sequences go to P450 Blast server
zebrafish ESTs (54 ESTs) 31 CONTIGS This is old data
last modified Oct. 14, 1999 and August 29, 2001 Zebrafish has only two complete P450 sequences known CYP19 and CYP26A1. A third has now been assembled from ESTs see below AI544967 43% identical to 2B19 This will form a new subfamily. The ESTs make up partial sequences in 10 different families (CYP1, 2, 3, 4, 8, 17, 19, 26A1, 26B1, 27A, 51). Note there are two different Aromatase genes in zebrafish and also in goldfish. One is called brain aromatase, the other is from ovary. For an abstract on this topic see Tchoudakova A. and Callard GV A CYP51 contig is the first CYP51 from any fish. Began adding more P450s to this file August 27, 2001 This file is in progress. To Blast search these zebrafish sequences go to P450 Blast server >CYP1A AF057713 amino acids 321-514 AI497216 fb63b04.y1 cyp1A 84% to trout 1A1 BG883127 fp22g02.y1 AW421695 fi84c06.y1 BF938760 fm81e04.y1 Lower case = CYP1A3 Oncorhynchus mykiss S69277 40 MALTILPILGPISVSESLVAIITICLVYLLMRLNRTKIPDGLQKLPGPKPLPIIGNVLEI 219 220 GNNPHLSLTAMSKCYGPVFQIQIGMRPVVVLSGNDVIRQALLKQGEEFSGRPALYSTKFI 399 400 SDGKSLAFSTDQVGVWRARRKLALNALPTFSTVQGKSP 513 eyscaleehvckegeylvkqltsvmdvsgsfdpfrhivvsvanvicgmcfgrryshddqe llglvnmsdefgqvvgsgnpadfipilrylpnrtmkrfmdindrfntfvqkivsehyd sydkdnirditdslidhcedrkldenanvqvsdekivgivndlf GAGFDTISTALSWAVVYLVHYPEVQERLQRELDEKIGKDRTPLLSDRANLPLLE SFILEIFRHSSFLPFTIPHCTSKDTSLNGYFIPKDT CVFVNQWQVNHDPELWKDPSSFIPDRFLTADGTELNKLEGEKVLVFGLGKRRCIGESIGR AEVFLFLAILLQRLKFTXMPGEMLDMTPEYGLTMKHKRCLLRVTPQP ggrkseghghiydsqhhyn >CYP1B Danio rerio (zebrafish) No accession number Celine Godard, Maya Said and John Stegeman Submitted to nomenclature committee Feb. 16, 2000 >CYP1B 51-169 213-484 Z35723-a191c08.p1c zfishC-a1626a06.q1c zfishC-a2428c09.p1c zfishC-a2047d05.q1c lower case = Pleuronectes platessa CYP1B AJ249074 mflqdppamdvtlegidpvtlravllacvtllfslhlwrwlggqpsvpgp 241 PGPFAWPLVGNAMQLGQMPHITFSKLAKKYGNVYQIRLGCSDIVVLNGDAAIRKALVQHS 420 421 TEFAGRPNFVSFQMISGGRSLTFTNYSKQWKTHRKVAQSTLRAFSMANSQTRKTFEQHVV 600 cefrellqlfvgkteqqrffqpmtylvvstanimsavcfgkry AYEDEEFLQVVGRNDQFTQTVGAGSIVDVMPWLQY 248 FPNPIRTIFDNFKKLNLEFGQFIRDKVIEHRKTIQSSTTRDMTDALIVALDKLGDKSELT 307 308 GGKDYVSPTMGDIFGASQDTLSTALQWIVLILVKYPEMQLRIQQEVDKVVDRTRLPSIED 367 368 QLQLPYIMAFVYEVMRFTSFVPLTIPHSTVTDTSIMGYTIPKNTVIFINQWSINHDPALW 427 428 SHPETFDPQRFLDQNGALNKDLTSSVLIFSLGKRRCIGEELSKMQLFLFTALIAHQC 484 ispdparppkldytygltlkpcafsiavalrghdmslldeatrssaeevkgepssdsqtkn > zfishC-a2047d05.q1c Length = 1060 24-162 52% to 1B 205 LWVRNLTFKKRLPGPFAWPLVGNAMQLGQMPHITFSKLAKKYGNVYQIRLGCSDIVV 375 376 LNGDAAIRKALVQHSTEFAGRPNFVSFQMISGGRSLTFTNYSKQWKTHRKVAQSTLRAFS 555 556 MANSQTRKTFEQHVV 627 > Z35723-a191c08.p1c Length = 740 224-426 52% to 1B1 zfishC-a2428c09.p1c Length = 751 341-471 68% to 1B1 635 NLFPNPVRSVYQNFXTINKEFFNYVKDKVLQHRDTYDRDVTRDMSDAIIGVIEHGK 468 467 ESTLTKDFVESTVTDLIGAGQDTVSTAMQWMLLLLVKYPSIQSKLQEQIDKVVGRDR 297 296 LPSIEDRCNLAYLDAFIYETMRFTSFVPVTIPHSTTSDVTIEGLHIPKDTVVFINQWSVN 117 116 HDPQKWSDPHIFNPSRFLDENGALNKDLTSSVMIFSTGKRRCIGEQIAKVEVFLFSAIL 64 63 LHQCKCGGSSREE 25 > zfishC-a1626a06.q1c Length = 584 297-448 63% to 1B 528 LTKDFVESTVTDLIGAGQDTVSTAMQWMLLLLVKYPSIQSKLQEQIDKVVGRDRLPSIEDRCNLAYMD 325 324 AFIYETMRFTSFVPVTIPHSTTSDVTIEGLHIPKDTVVFINQWSVNHDPQKWSDPHIFNP 145 144 SRFMDENGALNKDLTSSVMIFSTDKRRCIGEQVFR 40 >CYP2D-like AI497150 fb62g02.y1 C-HELIX REGION Length = 273 123-203 COMPLETE TRANSLATION 53% TO 2D3 FQMIFFVSLGVIMADYGESWREHRRFALTTLRNFGLGKKSMEQRILEEVKHICLLLEESAGKPIDPQHL 207 YHQAAANIIASIIFRSRFDYQD 273 >CYP2 AI658337 fc21h01.y1 N-term Length = 552 1-179 44% to chicken M25469 2H2 complete translation MLAALLLLDLSSVGLSLFLGLIFLVLFEIFRIHSCKGRFPPGPTPLPFVGSIPHFLNNPMGFI KSLSQYGEMTTVYPGRKPAIILNTLQLMKEALVQNGSSFSGRPPVPVFNWVTDGYGIVM ATFGHSWRQQRRFALHTLRNFGLGKKSVEERVTEESGYLISEFLXAEGKPFNPQHAI CYP2D-like AI641273 fc21h01.x1 PERF TO END opposite end of AI658337 Length = 553 61% TO 2D6 no exact match in this P450 set 550 FNPENFLDDKGHFFKPEAFLPFSLGPRACLGETLAKAELFLFVTSLLQRIRFSWPTGE 377 376 KLPDMNGIFGIVRSPKPFNIICHSRGSKH* >CYP2 AI584934 fb93b10.y1 AI878189 fc58e08.y1 AI436897 fb34d01.y1 84% to AI658337 above N-TERM COMPLETE TRANSLATION 44% to 2H2 chicken 40% TO 2C11 MLVGLLKLDLSSVGLSLFLGLFCLALFEICRIRIYKGRYPPGPTPLPFVGTIPHFLKNP MGFIRSLSQYGEMTTVYLGRKPAIILNALQLMKEAFVQNGSSFSGRPPVPVLTWVNQGY GIIMAMDGHSWRQQRRFALHTLRNFGLGKKSVEERVTEESGYLICE >CYP2 AI497370 fb64f09.y1 Length = 544 121-301 complete translation 41% to 2F4 90% to AI497150 (CYP2D ABOVE) GKGVIMADYGECWREHRRFAVTTLRNFGLGKKSMEQRILEEVKHICLLLEESAGKSIDPQ HLYHQAASNIIASVIFGSRFNYKDAYFQTLITSVEDLTKITIGPWAMLYEIAPVLRIFPL PFQKAFQYFEQITKHVLKVVEEHKTSRVAGEPRDLIDCYLEEMENKSDHRTSFDESQMVT >CYP2K7 60% identical to AF043296 Oncorhynchus mykiss cytochrome P450 (CYP2K4) AI722087 fd19b07.y1 amino acids 1-206 zfishC-a843a04.p1c MALVGALLPGLSFTVGMVVAFLLLFLVISYFSSSKDQGKYPPGPKPLPLLGNLHILDLK NTYMSLWKLSKQYGPVYTVHMGPRTVVVLSGYKAVKEALVNLSEEFGERDISPIFQDFNE GYGIVFSNGENWKEMRRFALSNLRDFGMGKKRSEELITEEIKYLKEEIERFGGKPFETKL PLAMAISNVIALIVYSIRFEYNSPKFHRAIVRANENAKLVGSPSVQ missing 226-303 AI722500 fd19b07.x1 amino acids 304-504 opposite end of AI722087 64% to AF043296 63% to AF045052 2K1v2 728 INNLFGAGXDTTVTTLRWGLLLXAKYPEIQAKVHDEIDSVIGERQPVPDDRKNLPYTDA 555 554 VIHEIQRFADILPIGLLRQTSCDVHLNGYLIKKGTSVFPLIASVLRDENEWETPDSFNPK 375 374 HFLNKQGQFVKKDAFMPFGAGRRLCIGESLARMELFLFFTSLLQHFCFTPPPGVSEDELD 195 194 LTPVVGFTLSPMPHKLCAVKRF* 132 >CYP2K7 Danio rerio (zebrafish) public version No accession number Donald R. Buhler EST AI722087 fd19b07.y1, AI722500 fd19b07.x1, BF157099 fl60g01.y1 zfishC-a843a04.p1c Submitted to nomenclature committee 2/10/2001 503 amino acids, 76% to 2K6, 59% to CYP2K4, CYP2K5 MALVGALLPGLSFTVGMVVAFLLLFLVISYFSSSKDQGKYPPGPKPLPLL GNLHILDLKNTYMSLWKLSKQYGPVYTVHMGPRTVVVLSGYKAVKEALVN LSEEFGERDISPIFQDFNEGYGIVFSNGENWKEMRRFALSNLRDFGMGKK RSEELITEEIKYLKEEIERFRGKPFETKLPLAMAISNVIALIVYSIRFEY NSPKFHRAIVRANENAKLVGSPSVQXXXXXXXXXXXXXXXXXXXXXXXXX XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX XXINNLFGAGXDTTVTTLRWGLLLXAKYPEIQAKVHDEIDSVIGERQPVP DDRKNLPYTDAVIHEIQRFADILPIGLLRQTSCDVHLNGYLIKKGTSVFP LIASVLRDENEWETPDSFNPKHFLNKQGQFVKKDAFMPFGAGRRLCIGES LARMELFLFFTSLLQHFCFTPPPGVSEDELDLTPVVGFTLSPMPHKLCAV KRF* >gnl|ti|25473928 zfishC-a843a04.p1c Length = 608 Score = 183 bits (424), Expect = 3e-45 Identities = 57/59 (96%), Positives = 57/59 (96%) Frame = -1 Query: 19 RFRGKPFETKLPLAMAISNVIALIVYSIRFEYNSPKFHRAIVRANENAKLVGSPSVQLY 77 RF GKPFETKLPLAMAISNVIALIVYSIRFEYNSPKFHRAIVRANENAKLVGSPSVQ Y Sbjct: 530 RFPGKPFETKLPLAMAISNVIALIVYSIRFEYNSPKFHRAIVRANENAKLVGSPSVQVY 354 >CYP2K AI882922 fc47c09.y1 65% to 2K1 105-197 C-HELIX REGION EFGERDITPIFQDCNQGQGIVFANGERWRTMRRFALSTLRDXGMGKKLSEEKIVDETTLS AGVFMKFEGQPFDTTQPVNYAVXNIISAIVMGT >CYP2K BI427723 fq91e02.y3 1-161 MAVVESLLHFSSAGTLLGTLLLLLVFYRLSRDSEFQKKRKDPPGPKPIPLLGNLLTLDLSRPFDSLCELS 349 350 KTYGNVYQVFLGPKKVVVLIGHKTVKEALVNFADEFGERDITPIFRILANDHGILFSNGE 529 530 SWKEMRRFAISNLRDFGMGKRRSEEKIIEE 619 >CYP2K BG892136 fq91e02.y1 1-126 51% to 2K 185 MAVVESLLHFSSARTLLGTLLLLLVFYRLSRDSEFQKKRKDPPGPKPIPLLGNLLTLDLSRPFDSLCELSKTYG 361 362 NVYQVFLGPKKVVVLIGHKTVKESLVNSADEFGERDITPIFRILANDHGIPF 517 >CYP2K6 Danio rerio (zebrafish) public version No accession number Wang-Buhler, J.L., Yang, Y.H., Lee, S.J. and Buhler, D.R. Submitted to nomenclature committee 6/16/2000 zfishC-a370c11.q1c zfishC-a1090h03.q1c zfishG-a1670f05.q1c zfishK-a196c11.p1c AW422084 XXLIEAFLLQGSPTGAILGALLLFLVIYLFSSSSSSQDKEKYPPGPKPLPLLGNLHILDLKKTYLSLLELSKKYGPIYTVYLGPKKVVILSGYKIX KEALVNLSEEFGDRDISPIFXXXXRGYGIXXXXGENWREMRRFALSTLRDFGMGRKRSEELIIEEIKYVKEEFXXFGGNPFETKLPLALAISNIIA SIVFSVRFEYSNTKLHRMVGRAYENMKLTGSPSVQIYNMFPWLRPIVANRNQIVKNLRDTFKQNEELINGVMKTLDPFNPRGIVDSFLIRQQKDEE SGKTDSLYNSNNLYCTVNNLFGAGTDTTVTTLRWGLLLMAKYPEIQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQGQFVKKDAFMPFGAGRRVCIGESLARMELFLFFTSLLQYFRFTPPPGVSED DLDLTPVVGFTLNPKPHQLCAVKRS >gnl|ti|25528409 zfishC-a1090h03.q1c Length = 585 Score = 142 bits (328), Expect(2) = 1e-62 Identities = 42/45 (93%), Positives = 43/45 (95%) Frame = +3 Query: 130 GENWREMRRFALSTLRDFGMGRKRSEELIIEEIKYVKEEFEKFGG 174 GENWREMRRFAL TLRDFGMGRKRSEELIIEEIKYVKEEF+ FGG Sbjct: 90 GENWREMRRFALXTLRDFGMGRKRSEELIIEEIKYVKEEFDMFGG 224 Score = 121 bits (279), Expect(2) = 1e-62 Identities = 44/57 (77%), Positives = 44/57 (77%), Gaps = 2/57 (3%) Frame = +1 Query: 174 GNPFETKLPLALAISNIIASIVFSVRFEYSNTKLHRMVGRAYENMKLTGSP--SVQI 228 GNPFETKL LALAI N IV SVRFEY NTKLHRMVGRAYENMKLTG VQI Sbjct: 304 GNPFETKLXLALAIXNXXXXIVXSVRFEYXNTKLHRMVGRAYENMKLTGXXXVXVQI 474 >gi|6950016|gb|AW422084.1|AW422084 fi54e12.y1 Sugano Kawakami zebrafish DRA Danio rerio cDNA clone 2641486 5' similar to SW:CPK1_ONCMY Q92090 CYTOCHROME P450 2K1 ;. Length = 426 Score = 231 bits (589), Expect = 5e-61 Identities = 117/124 (94%), Positives = 120/124 (96%) Frame = +1 Query: 2 ALIEAFLLQGSPTGAILGALLLFLVIYLFSSSSSSQDKEKYPPGPKPLPLLGNLHILDLK 61 +LIEAFLLQGSPTGAILGALLLFLVIYLFSSSSSSQDKEKYPPGPKPLPLLGNLHILDLK Sbjct: 55 SLIEAFLLQGSPTGAILGALLLFLVIYLFSSSSSSQDKEKYPPGPKPLPLLGNLHILDLK 234 Query: 62 KTYLSLLELSKKYGPIYTVYLGPKKVVILSGYKIVKEALVNLSEEFGDRDISPIFMISTR 121 KTYLSLLELSK+YGPIYTVYLGPKKVVILSGYKI+KEALVNLSEEFGDRDISPIF R Sbjct: 235 KTYLSLLELSKRYGPIYTVYLGPKKVVILSGYKIIKEALVNLSEEFGDRDISPIFHDFNR 414 Query: 122 GYGI 125 GYGI Sbjct: 415 GYGI 426 >gnl|ti|25475035 zfishC-a370c11.q1c Length = 1471 Score = 47.3 bits (110), Expect = 5e-04 Identities = 23/38 (60%), Positives = 29/38 (75%) Frame = +3 Query: 437 GKRACPGEALARVELFLFFTSVLQRFTFTGTKPPEEIN 474 G+R C GE+LAR+ELFLFFTS+LQ F FT PP ++ Sbjct: 267 GRRVCIGESLARMELFLFFTSLLQYFRFT---PPPGVS 371 >gnl|ti|25528409 zfishC-a1090h03.q1c Length = 585 Score = 43.4 bits (100), Expect = 0.008 Identities = 20/49 (40%), Positives = 31/49 (62%) Frame = +3 Query: 124 GKRSKDLRRFSLMTLKTFGMGRRSIEERVQEEAKMLVKAFGEYRDSVVN 172 G+ +++RRF+L TL+ FGMGR+ EE + EE K + + F + VN Sbjct: 90 GENWREMRRFALXTLRDFGMGRKRSEELIIEEIKYVKEEFDMFGGKTVN 236 >gnl|ti|15655323 zfishG-a1670f05.q1c CYP2K6 Length = 565 Score = 50.0 bits (117), Expect = 8e-05 Identities = 37/144 (25%), Positives = 61/144 (41%), Gaps = 38/144 (26%) Frame = +3 Query: 222 MYNMFPRIVWCFPGNHHEMFAIVNKAKVYIQEQAEIR--LKTLNISEPQDFIEAFLVKM- 278 +YNMFP W P + + N + Q + I +KTL+ P+ +++FL++ Sbjct: 60 IYNMFP---WLRPIVANRNQIVKNLRDTFKQNEELINGVMKTLDPFNPRGIVDSFLIRQQ 230 Query: 279 -----------------------------------LEEKDDPNTEFNNGNMVMTAWSLFA 303 ++E ++ +N+ N+ T +LF Sbjct: 231 KDEVLFHHFY*LLCTNNHTVFAHLLF*CHRVLAITMQESGKTDSLYNSNNLYCTVNNLFG 410 Query: 304 AGTETTSSTLRQSFLMMIKYPHIQ 327 AGT+TT +TLR L+M KYP IQ Sbjct: 411 AGTDTTVTTLRWGLLLMAKYPEIQ 482 >CYP2M1 Onchorhynchus mykiss (rainbow trout) GenEMBL U16657 Yang,Y.H., Wang,J.L. and Buhler,D.R. cDNA cloning and characterization of a novel cytochrome P450 from rainbow trout. Abstracts of the VII International Congress of Toxicology, Vol. 7, No. 1, 10-P-2 (1995) MDVLHILQTNFVSIIIGFVVIILLWMNRGKQSNSRLPPGPAPIP LLGNLLRMDVKAPYKLYMELSKKYGSVFTVWLGSKPVVVISGYQAIKDAFVTQGEEFS GRANYPVIMTVSKGYGVLVSSGKRSKDLRRFSLMTLKTFGMGRRSIEERVQEEAKMLV KAFGEYRDSVVNPKELLCNCVGNVICSIVFGHRFENDDPMFQLIQKAVDAYFNVLSSP IGAMYNMFPRIVWCFPGNHHEMFAIVNKAKVYIQEQAEIRLKTLNISEPQDFIEAFLV KMLEEKDDPNTEFNNGNMVMTAWSLFAAGTETTSSTLRQSFLMMIKYPHIQESVQKEI DEVIGSRVPTVDDRVKMPYTDAVIHEVQRYMDLSPTSVPHKVMRDTEFYNYHIPEGTM VLPLLSSVLVDPKLFKNPDEFDPENFLDENGVFKKNDGFFAFGVGKRACPGEALARVE LFLFFTSVLQRFTFTGTKPPEEINIEPACSSFGRLPRSYDCYIKLRTEK >CYP2N1 Fundulus heteroclitus (killifish) AF090434 John Stegeman >CYP2N2 Fundulus heteroclitus (killifish) AF090435 John Stegeman MWFYNLLLSLDVKGLFLFIFLFLLIADFYKSRKPANFPPGPKAL PFVGNFFSLDSKHPHVYFQKLAEIYGNVFSFRLGRDSIVFLNGYKAVREALVTQAENF VDRPFNAITDRFYTEPSAGIFMSNGEKWKKQRRFALSTLRNFGLGKNSLEQSVSEEIQ HLQEEMEIEKGKPFNPSGLFTNAVSNIICQLVMGKRYDYTDHRFQMMLRCMSEAVLLE GNVWGQLYMAFPSVMRYMPGPHNKIFSHFSSVEQFLYEEVEQHKKDLDRDNPRDYIDT FLIEMENHKESDLGFTEANLVYCAIDLFLAGTETTATTLLWALVFLVKYPEVQEKVQA EIDSVIEQARLPSMADRSSMPYTDAVIHEIQRIGNILPLNGMRVAAKDTTLGGYFIPK GTSLMPVLTSVLFDKAEWACPDTFNPGHFLDDNGKFVKRDAFLPFSAGKRACIGESLA KMELFLFLVALLQKFTFSVPEGVELSTEGITGTTRVPHPYKVSAKIR >CYP2N3 Stenotomus chrysops (scup) No accession number Agnes Knorr, Andrew McArthur John Stegeman Submitted to nomenclature committee Nov. 3, 2000 73% to 2N1 >CYP2N4 Chaetodon mertensii (butterfly fish) No accession number Bryan DeBusk Submitted to nomenclature committee July 19, 2001 >CYP2N5 Chaetodon punctatofasciatus (butterfly fish) No accession number Bryan DeBusk Submitted to nomenclature committee July 19, 2001 >CYP2N6 Chaetodon auriga (butterfly fish) No accession number Bryan DeBusk Submitted to nomenclature committee July 19, 2001 >CYP2N7 Chaetodon xanthurus (butterfly fish) No accession number Bryan DeBusk Submitted to nomenclature committee July 19, 2001 >CYP2N8 Chaetodon plebius (butterfly fish) No accession number Bryan DeBusk Submitted to nomenclature committee July 19, 2001 >CYP2P1 Fundulus heteroclitus (killifish) John Stegeman submitted to nomenclature committee >CYP2P2 Fundulus heteroclitus (killifish) GenEMBL AF117342 John Stegeman submitted to nomenclature committee >CYP2P3 Fundulus heteroclitus (killifish) GenEMBL AF117343 John Stegeman submitted to nomenclature committee MEAIRSVLGLEWIDARGVLLFFFVFLLLSDVLRNRKPKNFPPGP LALPFIRDLHRIRPARLHLQLTEFAETYGDIYSLHLFGGRAVIINGYKHVKEALVQKG EDFMDRPNIPLFADFFNNKGLVMSNGYQWKVQRRFALHTLRNFGLGKKAMERYIQQEC QYLNEAFSEQQGKPFNGQALINNAVSNIICCLVFGNRYEYNDKQYQTILQYFNEAVRL QGDLSVQIYNSIPGLMRWLPGSHKKIFMILQKLVDFVEIRIKEHRENLDPSSPRDYID SFLIEMGEKEDKDSGFELSNLCACTLDLFGAGTETTTTTLHWGLLYMIYYPQIQERVQ AEIDAVIGPSRQPSVADRENMPYTDAVIHEIQRMGNIIPLNLPRMANKDTTLDKYSIP KGTIIIPTLHSVLQDKSIWETPQTFNPQHFLDQDGQFRKRDAFMPFSTGKRVCLGEQL ARMELFLFFTSLLQRFTFSAPAGEEPSLEFKLGATRSPKPYRLCATPR >CYP2V1 AB026158 45% to CYP2J2 This sequence has two places where frameshifts are probable. This sequence has been corrected Also found as an EST F4R from Yea-Huey Yang, Jun-Lan Wang-Buhler and Donald R. Buhler Submitted to nomenclature committee 7/1/2000 MALENILLHLNSKVWTDAGTILLLFILFLLVSVKLRNRNKPTKT FLLGPTPLPFIGNVFNLDTSQPHICLTKMSDHYGNIFSLRLGSLNTVVVNTYSMVKKV LNDQGNSFMYRPSNDITERILSKCQGLTFNNGYSWKQHRRFTLSTLKFFGVGKRSLEF IIMEEYKFLHQSILDTNGLPFNPHYIINNGVSNIICSMVFGRRFEYTDQRFLNMLSLI SKALKLQTSVFIQLYAAFPRLMDLLPGPHKELFSCFHQVRAFIKEEVDKHRADWDPSS PRDFIDCYLTEIEKMKDDLEAGFHDEGLQYAVLDLFVAGTETTSTTLLWAFVYMMKYQ KSKKVQAEIDKVVGRYRRPSMDDRPCMPYTDAVIHEIQRMGNVVPLSVPRMTNEDTLL EGYXIPKGTQIIPNLTSVLFDQTKWKTQHSFDPQNFLNAQGKXEKPEAFIPFSLGKR SCPGESLARMELFLFFTSFLQSFSLSAPDETQTSLDFKCGMTLSPKPFKICFTPR >gi|6979902|gb|AF221128.1|AF221128 Danio rerio cytochrome P450 monooxygenase mRNA, partial cds Length = 257 Query: 370 ADILPIGLLRQTSCDVHLNGYLIKKGTSVFPLIASVLRDENEWETPDSFNPKHFLNKQGQ 429 A+I+P+ L +T+ D+ NGY IKKGT+V PL+ SVL+DE+ WE P+SF P+HFL+++GQ Sbjct: 5 ANIVPLSLPHKTTSDITFNGYFIKKGTTVVPLLTSVLKDESAWEKPNSFYPEHFLDEKGQ 184 Query: 430 FVKKDAFMPFGAGRRLCIGESLAR 453 FVK+DAF+PF AGRR+C+GESLAR Sbjct: 185 FVKRDAFIPFSAGRRVCLGESLAR 256 >CYP2V1 BG884010 fp29g09.y1 8-146 93% to AB026158 probably same gene 185 LTSKVWTDAGTILLLFILFLLVSVKLRNRNKPHKNLPPGPTPLPFIGNVFNLDTSQPH 358 359 IDLTKMSDHYGNIFSLRLGSLNTVVVNTYSMVKKVLNDQGNSFMYRPSNDITERISKCQG 538 539 LTFNNGYSWKQHKRXTLSTLK 601 TDAVIHEIQRMGNVVPLSVPRMTNEDTILEGYFIPKGTQIIPNLTSVLFDQTKWKTQHSFDPQNFLNAQGKFEKPEAFIPFSLGKRSCPGESLARM ELFLLL >CYP2X1 Ictalurus punctatus (catfish) GenEMBL AF315346.1 Schlenk,D., Furnes,B. and Zhou,X. Isolation and cloning of a new P450 2 family gene from Ictalurus Punctatus. Unpublished 42% to 2N2 MLGSWLLVVFCVCLLFLFIRIQRPKNFPPGPRPIPIFGNLFQFN IKNPLKDFEKLAEQYGNICSLYIGTKPAVVLNGLKVIREALVTKSADFSGRPQNMLKD VTEGKGIVIADYGRQWKEHRRFALMTLRNFGLGKQSMENRILGEIEHLVAKLEKYAGS SMYPQTLFHDAASNIIYLVLFGTRYDYGDETLKVYVRLYSENAKLANGTWSIIYDALP ILRSLPLPFKKAFDNYSTLKILTANMINKHRTTRVPGKPRDLVDCYLDEIDKKDNEST FSEEQLVVYIMNLHIAGTDTTSNTLLTSILYLMAHPDIQKRCQKEIDVVLEGNSQPSF EDRHNMPYTQAVVHECQRIARTVPLSVFHCTTRDTELMGYTIPKGTMIIPNLSSVLSE EGQWKFPHDFNPSNFLNEQGQFEKPEAFIPFSAGPRVCLGEGLARMELFLFLVTLLRR FQFIWPEDAGEPDFTPIFGLTLTPKPYRMGVKLRQPAREK >CYP2 AI496898 fb63f09.x1 Length = 486 50% to CYP2K1 no exact match in this P450 set 486 GTIIIPYXSSSLREESQWKFPHEFNPQNFLNEKGKFVKNDAFMPFSAGPRVCLGENLARM 307 306 ELFLILVTVLRRFRLVWPKDAGEPDFTYIYGGTQSVKPYRVIVEPR 169 >CYP2 (may be new subfamily) AI964243 EST269357 zebrafish, clone RZBDA33 AI964242 EST269356 zebrafish, clone RZBDA33 47% to 2B P450s 48% to 2J1 amino acids 241-504 LVHTXKIKQNASELLAFIQGEVKEHRKTLDPDSPRDFIDAYLLEIEKQQSNKDSTFHEGNLVISTA DLFLAGTDTTSTTIRWGLLFLIQNPDVQERCHEEIVQVLGYDRLPSMDDRDRLPYTLATV HEIQRCANLVPFGVIHETIQPTKLRGYDIPQGTVVMTNLAAILSDKEHWKHPDTFNPKNF LDENGHFSKPEVFIPFSLGPXFCXGETLAKMELLLFITSLLQRIRXSSPPDAXPINLE CLMGIIRYPXPFSIICCSRDTKE* >CYP2 may be in new subfamily AI657973 fc19c11.y1 AI958603 fc94a10.y1 AI544967 fb69h12.y1 48% to 2J1 78% to AI964243 above 138-494 part of this fragment nearly identical to AI883438 fc64b09.y1 below 43% to 2B19 complete sequence MFGALVKLDLSSVGLTLLLGLILLVLFEIFRIHSYKSRFPPGPTPLPFVGNLPHLLRDPMGFNRLMA QYGEMSTMYLGKKPAIVLNTIQVA KEALVQEAFAGRPCLPAIDWTSNGCGIIMATFNNSWKQQRRFALHTLRN FGLGKKSLESRVLEESQYLIAEMLKDEGRPMNPQHAVQNALSNIICSIVFGDRFDYNNKRFEYLLKILNE SIILTGSAAGQIFNFAPIIKHFPGPHQMINENANEVYSFVRHEVEEHRKTLDPGSPRDFIDGYLLEMEKQKSNKDSTFHEDNLITT TVDLFLAGSDSTSSSIRWGLLFLIQNPDVQERCHEEIVQVLGYDRLPCMDDRDRLPYTLATVHEIQRCGNIAPFGLFHETVQPTKL RGYNIPQGTIIMTNYTAIFSNKEHWKHPDTFNPENFL DENGQFSKPKCFIAFGVGPRICLGDTLAKTALFLFITSLLQRIRFSLPPDAKPMDMDGILSIIRYPELSIICCG* >CYP2 AI723168 fc33e01.x1 39% TO 2D18 no exact match in this P450 set PATFLYRRGPFEKPRAFYPFSPGPWVGLGKSLAPKDLFFIFAPLFGGFPFVWPQKAGKP 50 NFPPVFGVPLPPHP 8 >CYP2 AI883438 fc64b09.y1 n-term 45% to 2C sequences AI444248 fb40e01.y1 1-159 42% TO 2C23 This sequence looks like it could join with AI544967 above MFGALVKLDLSSVGLTLLLGLILLVLFEIFRIHSYKSRFPPGPTPLPFVGNLPHLLRDPMGFN RLMAQYGEMSTMYLGKKPAIVLNTIQVAKEALVQEAFAGRPCLPAIDWTSNGC GIIMATFNNSWKQQRRFALHTLRNXGLGKKSIESRVLEESQYL AI884142 fc74c07.x1 C-term 44% to 2J3 similar to 2J and 2D seqs. AI878677 fc64b09.x1 opposite end of AI883438 AI461305 fb40e01.x1 opposite end of AI444248 TPLVDXITRFAETVPSGVFHEXIWATKLRGFDIPQXTMIMTNLXAISAVK DHXKHPDTLNPENFLDENGHFSKPESYIPFSLRLRACIGESLVRTELFLFA TVLLQRIHFSWPPDAKPIDMDGIVGIVRYPQTFSIICCSRDSKK* >CYP2 AI497151 fb62g03.y1 Zebrafish WashU MPIMG EST Danio rerio cDNA 5' similar to Length = 479 13-160 COMPLETE TRANSLATION 49% TO 2J6 no exact match in this P450 set MAYTAMLETLDVKGILLFMVAFLLVADYLKNKNPPKYPPSPFSVPLLGNIFNVDSKEPHLYLTKLGHA 206 YNNIFSLRLGSDKTVFITGYKMVKEALVTQAENFVDRPNSPVLARVYSGNAGLFFSNGEM 386 WKKQRRFALSTLRNFGLGKKTMELAICEESR 473 >CYP2 AI959373 fd08g05.y1 43% TO 2B10 44% TO J00719 2B1 Length = 538 amino acids N-TERM no exact match in this P450 set MLEVSVLILLCIFFVFFLIRIKRPKNFPPGPPPVPIFGNLLQINMVDPLKEFERLAEKYGNIFSLYTGSKPAVFLNNFEVIKEALV TKAQDFSGRPQDLMISHLTGNKGVVLADYGPLWKDHRRFALMTLRNFGLGKQSMEERILGEISHIVDFLDKNTAKTVDPQIMFHNI ASNVINL > zfishC-a2684d06.q1c Length = 743 4-104 = AI959373 426 MLEVSVLILICIFLVFFLIRIKRPKNFPPGPPPVPIFGNLLQINMVDPLKEFER 271 181 LAEKYGNIFSLYTGSKPAVFLNNFEVIKEALVTKAQDFSGR 59 >CYP2 AI497156 fb62g08.y1 Length = 409 6-148 41% to 2J2 no exact match in this P450 set ILHLIYDSFDFKSWIIFFVVFLIIAEMIKNRTPSNYPPGPWPLPFLGTVFTKMDFKNINKLAKVYGKVFSL 214 RVGSEKMIIVSGYKMVKEALVTQNDSXRLRPPVPLFHKVYKGIGLTMSNGYIWRSHRRFASHLR 394 ASHLR 409 >CYP2 AI545969 fb66e09.y1 Zebrafish WashU MPIMG EST Danio rerio cDNA 5' similar 56% TO 2J3 no exact match in this P450 set I-HELIX TO PKG EDWDPANPRDFIDNYLTEMEKKNSDPQAGFNIESLIISCLDIVEAGTETGATTLRWGLLFMI KFPEIQKKVQAEIDRVIGQSRQPCLDDRVNMPYTEAVLHEIXRFGDVVPLGFPKQAAVDT KIGNYFIPKGTSITTNLSSVLHDPNEWETPD >CYP2 AI545376 fb74e03.y1 N-term complete translation 46% to 2J6 AA542498 fa07c01.r1 1-82 MALENILLHLTSKVWTDAVTILLMVILFLLVSVKLRNRNKPHKNLPPGP TPLPFIGDVFNLDTSQPHIDLT*MSDHYGNIFS >CYP2 AI544925 fb69d04.y1 54% to 2F1 AA545718 fa06e09.r1 57-196 48% to 2G1 RWRRKENGLSLPPVPLALPLIGNLLTLDKSAPFKSFMKWRKTYGSVMTVHLGPQRMVVLVGYETVKEALV DQADDFAPRAPIPFMNRIVKGYGLAISNGEGWRQXXXFTSPHLGDFGVGRNRLEQWIQXEIRYLL*SFEK >CYP2 AI958763 fc96g05.y1 1 diff with 2a5 or 2a12 GKLPPGPTPLPFIGNY >CYP3A different from other 3A below AI332015 fa96d12.y1 zebrafish fin day3 regeneration Danio rerio cDNA 5' Length = 447 58% T0 3A27 1-121 MIGHT BE SAME GENE AS below AI497054 fb59e11.y1 Length = 489 72% TO 3A27 129-282 BG727940 fo79h04.y1 MFDLSSLSVTWTLVVLVITLLLIYGVWPHGFFKKLGIPGPRPLPFVGTALSYSKGICNFDIECSKKYGK VWGIYDGRLPLLLVTDLEMIKTILVKDCYSTFTNRRNMNPDLVGPFADGITLVKDERWR 447 RIRSSLSPSFTCGRLKEMFGIMNKHSHILVDSMGKTAKRGESADIKEFFGAYSMDVVTST 181 AFSVDIDSLNNPKDPFVTNIKKMLKFDLLNPLFLLIAFFPFMAPVLEKMDFALFPTSVTD 361 FFYAALQKIKSDRDTKTLAKDNTKKRVDXLQLMVDSQTGXXXXXXXXXXXX 478 289 LSDLELVAQSIIFIFAGYETTSSVLSFIMYELATHPDVQQKLQEEIDAVLPNKAPPTYDT 348 349 VLQMEYLDMVVNETLRLFPIAMRLERVCKKDVEINGMFIPKGWVVMIPSYALHRDPKYWT 408 409 EPEKFLPERFSKKNKDNIDPYTYTPFGSGPRNCIGMRFALMNMKLALIRVLQNFSFKPCK 468 469 ETQIPLKLSLGGLLQPEKPVVLKVESR 495 combined fragments show 63% identity to 3A27 AI330590 fa96d12.x1 85% to AI722355 Length = 442 449-501 full translation GMRFALMMMKLPVGKLLQKYPVETCKKTQIPGQLNFFFQPKVPLPLKLIPRSPKEKQ* > AW018919 fd61g06.y1 CYP3A 1 YNLATNPETMKKLQEESDETFPNQAPVDYETLMSYYYLDAALSESLRLYPVAARLERVCK 180 181 KTVEINGLLIPKDLVVMVPTYALHRDPDYWSEPESFKPERFTKGNKESIDPYMYMPFGLG 360 361 PRNCIGMRFAQLTMKLAIVEILQRFDVSACDETQVPLELXFNGLLSPKDPIKLKLQPR 534 >AW076890 fj03b10.y1 CYP3A 238 LSSLSVTWTLVVLVITLLLXYGVWPHGFFKKLGIPGPRPLPFVGTALSYSKGICNFDIE 414 415 CSKKYGKVWGIYDGRLPLLLVTDLEMIKTIXVKDCYSTFTNRRNMNPDXVGPXADGITLV 594 595 KDERWRXNR 621 >BF717373 fd45a03.y1 CYP3A 94 LSSLSVTWTLVVLVITLLLIYGVWPHGFFKKLGIPGPRPLPFVGTALSYSKGICNFDIE 270 271 CSKKYGKVWGIYDGRLPLLLVTDLEMIKTILVKDCYSTFTNRRNMNPDLVGPFADGITLV 450 451 KDERWRRIRSSLSPYFTSGRLKEIFPIAMTHADRFIENMEKKDPNLPLKIKDVVAPYSV 627 628 DVVASSSFQRDFRLINTPDDALATSIKRFL 717 >AW202769 fj22a02.y1 CYP3A 91 MCDLSSLSVTWTLVVLVITLLLIYGVWPHGFFKKLGIPGPRPLPFVGTALSYSKGICNFDIE 267 268 CSKKYGKVWGIYDGRLPLLLVTDLEMIKTILVKDCYSTFTNRRNMNPDLVGPFADGITLV 447 448 KDERWRRIRSSLSPYFTSGRLKEIFPIAMTHADRFIENMEKKDPNLPLKIEDVVAPYSL 624 625 DVVRSSSFSVDYDYIDNPDDTLVTSHKSLYNINP 726 AW232617 fj22a02.x1 LDMAIHESMRVFPAGPRLERVCTKPVEIHGITHTKTTLIGIPLYV ISRDPDLWESPNEFKPERFSPESETEINQCPFMPFELGPRNCIGMRFALMMMKLLVVKLLHKY TVETCKETQIPVQLNFFFQPKVPITLKLIPRSHKEKQ* >CYP3A AI883503 fc65a12.y1 like 3A27 opposite end = AI878727 Length = 541 225-362 PROBABLY HAS RETAINED INTRON zfishC-a1991g07.q1c IYDGRLPILMVTDLEMIKTIMVKECYSTFTNRR TRPAFCPSAANFLAKMGISLFSRSTTDFYYKALRKIKDEHNESN 134 GRVDFLKLMIQNQIPDDQVKDTASEQPVKGLTDHEILSQSFIFILGGYETTSTTLSYLLYN 427 LATNXXXXXKLVEEIDKNFPLDIPITYDALMRMDYLEM 537 The LEM at the end of this fragment may overlap with LDM at the beginning of the next AI878727 fc65a12.x1 AI722355 fc26b04.x1 AI959602 fd11g02.x1 AW232617 fj22a02.x1 LDMAIHESMRVFPAGPRLERVCTKPVEIHGITHTKTTLIGIPLYV ISRDPDLWESPNEFKPERFSPESETEINQCPFMPFELGPRNCIGMRFALMMMKLLVVKLLHKY TVETCKETQIPVQLNFFFQPKVPITLKLIPRSHKEKQ* >CYP3A27 trout U96077 MMSFLPYFSAETWTLLALLITLIVVYGYWPYGVFTKMGIPGPKP LPYFGTMLEYKKGFTNFDTECFQKYGRIWGIYDGRQPVLCIMDKSMIKTVLIKECYNI FTNRRNFHLNGELFDALSVAEDDTWRRIRSVLSPSFTSGRLKEMFGIMKQHSSTLLSG MKKQADKDQTIEVKEFFGPYSMDVVTSTAFSVDIDSLNNPSDPFVSNVKKMLKFDLFN PLFLLVALFPFTGPILEKMKFSFFPTAVTDFFYASLAKIKSGRDTGNSTNRVDFLQLM IDSQKGSDTKTGEEQTKGLTDHEILSQAMIFIFAGYETSSSTMSFLAYNLATNHHVMT KLQEEIDTVFPNKAPIQYEALMQMDYLDCVLNESLRLYPIAPRLERVAKKTVEINGIV IPKDCIVLVPTWTLHRDPEIWSDPEEFKPERFSKENKESIDPYTYMPFGAGPRNCIGM RFALIMIKLAMVEILQSF TFSVCDETEIPLEMDNQGLLMPKRPIKLRLEARRNTPSNT TATTLKSPTT >CYP4T AI959112 fd24f08.y1 no exact matches in this P450 set Length = 569 16-167 complete translation N-term 63% to CYP4T2 MLLYGISPFVLSVNHVFALIFLACLLTVVKLLIVRRKGVKTMERFPGPPAHWLFGHVKEFRQDGHDLEKIVKWMELYQFAFPLWFG PSLAVLNIHHPSYVKTILTTTEPKDDYAYKFFIPWLGDGLLVSTGQKWFRHRRLLTPGFHYDVLKPYVKLISDSTKVMLDKWKFTP DQKSPLSCLS >CYP4T AI558878 fb67e05.y1 no exact matches in this P450 set Length = 414 257-369 complete translation I-K helix 72% to CYP4T2 AF045468 Dicentrarchus labrax FHLSPHGYRFRKAASIAHNHTAEVIRKRKEVLKMEEEQGIVKNRRYLDFLDILLSARYEHQQGLSDEDIRAEVDTFMFEGHDTTAS GISWIFYNLACNPEHQEKCRQEIQQALDGKATLEWEDLNKIPYTTMCIKESL >CYP5A BG728082 BF718045 AW595283 MQLLVDGLKWFGVEASGLSLTVCLFLLSLSLLYWYSISPFSNLERCGIKHPKPLPFIGN 2 LMMFRNGFFKSQADLINKYGRICGYYIGRRSTVIIADPDMLRQVMVKEFNKFPNRMTA 181 182 RGITKPMSDSLIMLKGEQWKRVRSILTPTFSAAKMKEMVPLINTATETLLRNLKSHAES 358 359 ENSFNIHKCFGCFTMDVIASVAFGTQVDSQNNPDDPFVHQASKFFAFSFFRPIMIIFMAF 538 539 PCLLRPLAGLLPNKSKD 589 >BF938740 36% to CYP5A possible pseudogene of CYP5A C*WTCLKWFGVEASGLSLTVCLFLLFLSLLYCYSISPFSNLKQYNIKHPNPLPFIRNLIIFQNHF 264 FXSHTNLINKYGLIYIYYIGPLSTVIIIYPNIL 363 >CYP7A BG884003 fp29f09.y1 33-199 68 *LCGYAVTXRKRKPGEPPLVTGWLPFIGVTKDYVANPLGFLTETQKKHGNVFTCLIAGKYF 247 248 TFVTDPFSFSSVVRQGKNLDFQKFAINFSQRVFGHADFNSPQLSGNYREIHSIFRQT 418 419 LQGPSLQHLTQSMLCNLQTVLERCLPHEHDWLEEGLQSFTNRIMFEAGFLTLFGKE 583 gap of 20 aa >CYP7A BF718169 fd57d07.y1 212-375 24 FDKIFPALIAGLPIHVFKSAYSAREKLAKTMLHENLSRRANVSDLISLRMLLNDTLSTFN 203 204 ELSKARTHVAILWASQA 256 NTLPATFWTLFHMIRCPAAMKAASEEVRQTFESSNQKVDPTNSRLVLTREQLDNMPVLDS 435 436 IIKEAMRLSSASLNVRMAKSDFLLQLDXXXXX 516 >CYP7 BG883457 fp29f09.x1 381-502 534 IKKEDYIALYPQLLHLDPEIYPNPTEFKYNRFLDENDQPKTNFFKSGRRLRNFLMPFGSG 355 354 ASECPGRFFAVYEIKLFLALTLWHYDLQLRDPDVPVVQDSARAGLGIMPPSQDVLLRFR 178 177 KK 172 >CYP7 BF717505 fd45f11.y1 114-208 MDPSQGYTTENLHQTFLKTLQGDALSSLIETMMENLQGTMLQSGMLKATTSEWQSDGIYA FCYKVMFEAGYLTLFXKELDGDQSIARQQAQKALVLXCLDN >CYP7A BI473289 fp42a01.y3 415-500 10 EDGKKKTDFYKSGQKVRYYRMPFGSGATQCPGRFFAMNELKQFVCVTLLMCEMQLLDGQQ 189 190 EASMDNSRAGLGILPPANPIPFKYK 264 >CYP8A like BE605774 341-498 BE605271 DSVLWETLRLTAAALITRDVTQDKKIRLSNGQEYHLRRGDRLCVFTPISPQMD PQIHQQPEMFQFDRFLNADRTEKKDFFKNGARVKYPSVPWGTEDNLCPGRHFAVHAIKEL VFTILTRFDVELCHKNATVPLVDPSRYGFGILQPAGDLEIRYRIR CYP8A like BE558156 169-391 11 YSLLFKTGYLTVFGA*NNDSAPLTQIYEEFRRFDKLLPKLARTTINNEEKQ 163 164 IASAAREKLWKWLTPSGLYRKPREQSWLGSYVKQLQDEGIDAEMQRRAMLLQLWVTQGNA 343 344 RPAAFWVMGYLLTHPEALRAVREEIQGGKHLRLEERQKNTPVFDSVLWET 493 494 LRLTAAALITXDVTHDKKIRLSYGQEYHLRLGDQLCVFLFISP 622 >CYP8A like BE558055 169-383 11 YSLLFKTGYLTVFGAENNDSAALTQIYEEFRRFDKLLPKLARTTINKEEKQ 163 164 IASAAREKLWKWLTPSGLDRKPREQSWLGSYVKQLQDEGIDAEMQRRAMLLQLWVTQGNA 343 344 GPAAFWVMGYLLTHPEALRAVREEIQGGKHLRLEERQKNTPVFDSVLWET 493 494 LRLTAXALITXDVTXDKKIRLSNGQ*YHLQRGDQL 598 >CYP8B AI558624 fb68e08.y1 Zebrafish WashU MPIMG EST Danio rerio cDNA 5' Length = 599 COMPLETE TRANSLATION 57% TO RAT CYP8B I HELIX TO HEME no exact match in this P450 set SQGNTGPSAFWLLLYLMKHPEAMSAVRKEVEEILKEAGQEVKPGGPLIDLSRDMLLKTPILDSAVEETLRLTAAPILTRA VMQDMTISMANGQEYKIREGDRVAVFPYVVHVDPEVHPDPLTFKYDRFLNADGSRKTDFYKGGKKLKYYSMPWGAGTTMC PGRFFATNELKQFVFLMLSYSDCELTIQMNRFQVLISDD CYP11A1 Oncorhynchus mykiss (rainbow trout) GenEMBL S57305 (1789bp) Swiss Q07217 (514 amino acids) PIR S32197 (514 amino acids) Takahashi,M., Tanaka,M., Sakai,N., Adachi,S., Miller,W.L. and Nagahama,Y. Rainbow trout ovarian cholesterol side-chain cleavage cytochrome P450 (P450scc). cDNA cloning and mRNA expression during oogenesis. FEBS Lett. 319, 45-48 (1993) MMVSWSVCRSSLALPACGLPSARHNSSMPVVRQALSPDNSSTVQ NFSEIPGLWRNGLANLYSFWKLDGFRNIHRVMVHNFNTFGPIYREKIGYYDSVNIIKP EMPAILFKAEGHYPKRLTVEAWTSYRDYRNRKYGVLLKNGEDWRSNRVILNREVISPK VLGNFVPLLDEVGQDFVARVHKKIERSGQDKWTTDLSQELFKYALESVGSVLYGERLG LMLDYINPEAQHFIDCISLMFKTTSPMLYIPPAMLRRVGAKIWRDHVEAWDGIFNQAD RCIQNIYRTMRQDTNTHGKYPGVLASLLMLDKLSIEDIKASVTELMAGGVDTTSITLL WTLYELARHPDLQEELRAEVAVARQSTQGDMLQMLKMIPLVKGALKETLRLHPVAVSL QRYITEEIVIQNYHIPCGTLVQLGLYAMGRDPDVFPRPEKYLPSRWLRTENQYFRSLG FGFGPRQCLGRRIAETEMQLFLIHMLENFRVDKQRQVEVHSTFELILLPEKPILLTLK PLKSGQ zfishB-a659e05.p1c zfishG-a1981b11.q1c zfishG-a1983f12.p1c zfishG-a2015f11.p1c zfishG-a2015f11.q1c 88 REKVGFYESVNIIKPEDAAILFKAEGHYPKRLTIDAWTAYMDYRNRKYGVL 138 211 VLYGERLGLLLDNIDPEFQHFIDCVSVMFKTTSPMLYLPPGLLRSIRSNIWKNHVEAWDGIFNQ 275 ADRCIPEYLSNIGKKIQKHGKVPGVLA 301 329 TAITLLWTLYELARNPDLQEEIRAEISAARIASKGDMVQMLKMIPLVKGTLKETLR 384 >CYP17 AI883222 fd17f05.y1 N-terminal of CYP17 61% to Ictalurus punctatus CYP17 MAEALILPWLLCLSLFSAVTLAALYLKQKMNGFVPAGNRSPPSLPSLPIIGSLLSLVSDSPPHIFFQDLQ CYP17 Oryzias latipes (medaka) GenEMBL D87121(2421bp) Kobayashi,D., Matsuyama,M., Tanaka,M., Fukada,S. and Nagahama,Y. Structural analysis of medaka P-450c17 and expression in the ovarian follicle. unpublished (1996) MAWFLCLSVLVVLVLALAALLWRVRTRDRPQEAPSLPYLPVLGS LLSLRSPHPPHVLFKELQQKYGQTYSLKMGSHQVIIVNHHAHAREVLLKRGRTFAGRP RTVTTDVLTRDGKDIAFGDYSATWRFHRKIVHGALCMFGEGSASLQRIICTEAQSLCS TLSEAAATGLALDLSPELTRAVTNVICSLCFNSSYSRGDPEFEAMLRYSQGIVDTVAK DSLVDIFPWLQIFPNKDLRLLKQCVAVRDQLLQKKFEEHKSDYSDHVQRDLLDALLRA KRSAENNNTAAEFSAEAVGLSDDHLLMTVGDIFGAGVETTTTVLKWAITYLIHYPEVQ KQIQEELDRKVGVDRPPQLSDRGSLPFLEATIREVLRIRPVAPLLIPHVALSDTSLGD FTVRKGTRVVINLWSLHHDEKEWTNPDLFNPGRFLSADGSSLTLPSSSYLPFGAGLRV CLGEALAKMELFLFLSWILQRFTLSVPPSQSLPSLEGKFGVVLQPVKYAVKATPRPGC HSGLFPANP CYP17 Oryzias latipes (medaka) GenEMBL D87122(2302bp) Kobayashi,D., Tanaka,M., Fukada,S. and Nagahama,Y. Presence of a Novel Cytochrome P-450c17 Transcripts in Medaka Gonads unpublished (1996) >CYP19a AF183906 AF004521 AF226620 AF183907 (alt splice) ovarian MAGDLLQPCGMKPVRLGEAVVDLLIQRAHNGTERAQDNACGATA TILLLLLCLLLAIRHHRPHKSHIPGPSFFFGLGPIVSYCRFIWSGIGTASNYYNSKYG DIVRVWINGEETLILNRSSAVYHVLRKSLYTSRFGSKLGLQCIGMHEQGIIFNSNVAL WKKVRAFYAKALTGPGLQRTMEICTTSTNSHLDDLSQLTDAQGQLDILNLLRCIVVDV SNRLFLGVPLNEHDLLQKIHKYFDTWQTVLIKPDVYFRLDWLHKKHKRDAQELQDAIT ALIEQKKVQLAHAEKLDHLDFTAELIFAQSHGELSAENVRQCVLEMVIAAPDTLSISL FFMLLLLKQNPDVELKILQEMDSVLAGQSLQHSHLSKLQILESFINESLRFHPVVDFT MRRALDDDVIEGYNVKKGTNIILNVGRMHRSEFFSKPNQFSLDNFQKNVPSRFFQPFG SGPRSCVGKHIAMVMMKSILVALLSRFSVCPMKACTVENIPQTNNLSQQPVEEPSSLS VQLILRNTL >AF120031 68% to AF004521 CYP19 2 different CYP19s in zebrafish amino acids 326-467 APDTLSISLFFMLLLLKQNSAVEEQIVQEIQSQIGSRDVESADL QKLNVLERFIKESLRYHPVVDFIMRQSLEDDYIDGYRVAKGTNLILNIGRMHKTEFFK KPNEFSLENFENTVPSRYFQPFGCGPRACVGKHIAMVMMK >CYP19b AF183908 AF226619 brain MMEHVVKDAVNIGAVVQGTLLLLTGTLMLILLHRIFGVKNWRNQ SALPGPGWWLGLGPVLSYSRFLWMGIGTACNYYNEKYGSIARVWINGEETVILSKSSA VYHVLKSNNYTGRFASAKGLQCIGMFKQGIIFNSNIAKWKKVRTYFTRALTGPGLQKS VEVCVSATNRQLDVLQEFTDASGHVDVLNLLRCIVVDVSNRLFLRIPLNEKELLIKIH RYFSTWQTVLIQPDIFFKLDFVYRKYHLAAKELQDEMGKLVEQKRQAINNTEKLDEMD FATELIFAQNHDELSVDDVRQCVLEMVIAAPDTLSISLFFMLLLLKQNSAVEEQIVQE IQSQIGSRDVESADLQKLNVLERFIKESLRYHPVVDFIMRQSLEDDYIDGYRVAKGTN LILNIGRMHKTEFFKKPNEFSLENFENTVPSRYFQPFGCGPRACVGKHIAMVMTKAIL VTMLSRFTVCPRHGCTISTIRQTNNLSMQPVEEDPDCLAMRFIPRAQNSNGETADNRT SKE >AF248042 49% to CYP2J2 MILHLIYDSFDFKSWIIFFVVFLIIAEMIKNRTPSNYPPGPWPL PFLGTVFTKMDFKNINKLAKVYGKVFSLRVGSEKMIIVSGYKMVKEALVTQNDSFVLR PPVPLFHKVYKGIGLTMSNGYIWRSHRRFAASHLRTFGEGKKNLELGIQQECVYLCDA FKAEKEPFNPIFILHGAVSNTVACLTFGQRFDYNDEWYQEILRLDNQCVQLAGSPRVQ LYNAFPKLLDYLPGPHQKVFSNYKKITQSLKDEIIKHREDWDPANPRDFIDNYLTEME KKKSDPQAGFNIESLIISCLDIVEAGTETGATTLRWGLLFMIKFPEIQKKVQAEIDRV IGQSRQPCLDDRVNMPYTEAVLHEIQRFGDVVPLGFPKQAAVDTKIGNYFIPKGTSIT TNLSSVLHDPNEWETPDTFNPGHFLDKNGQFRKRDAFLPFSAGKRACVGELLARNVLF LFFTSLLQQFTLSKCPGEEPSLEGEIWFTYAPAPFRISVSVR > zfishC-a510d08.q1c Length = 778 222-366 = AF248042 182 LYNAFPKLLDYLPGPHQKVFSNYKKITQSLKDEIIKHREDWDPANPRDFIDNYLTEME QKKSDPQAGFNIESLIISCLDIVEAGTETGATTLRWGLLFMIKFPEIQ 571 EKVXAEIDRVIGQSRQPCLDDRVNMPYTEAALHEIQRLGD 777 >CYP26A1 U68234 AI545454 fb81e05.x1 AI584636 fb81e05.y1 AI626269 fc12e02.y1 AI667038 fc24h03.y1 MGLYTLMVTFLCTIVLPVLLFLAAVKLWEMLMIRRVDPNCRSPL PPGTMGLPFIGETLQLILQRRKFLRMKRQKYGCIYKTHLFGNPTVRVMGADNVRQILL GEHKLVSVQWPASVRTILGSDTLSNVHGVQHKNKKKAIMRAFSRDALEHYIPVIQQEV KSAIQEWLQKDSCVLVYPEMKKLMFRIAMRILLGFEPEQIKTDEQELVEAFEEMIKNL FSLPIDVPFSGLYRGLRARNFIHSKIEENIRKKIQDDDNENEQKYKDALQLLIENSRR SDEPFSLQAMKEAATELLFGGHETTASTATSLVMFLGLNTEVVQKVREEVQEKVEMGM YTPGKGLSMELLDQLKYTGCVIKETLRINPPVPGGFRVALKTFELNGYQIPKGWNVIY SICDTHDVADVFPNKEEFQPERFMSKGLEDGSRFNYIPFGGGSRMCVGKEFAKVLLKI FLVELTQHCNWILSNGPPTMKTGPTIYPVDNLPTKFTSYVRN >CYP26B1 AI721901 fc26h10.y1 lower case = fugu zfishG-a2365c08.p1c zfishG-a2403c08.q1ca MFGHDFCLVSALLSVADAVLPTVLLLAVSRLLWEFRWSITRDKTCKLPLPQGSMGWPLVGETFHWLFQ GAGFHASRRQKYGNVFKTHLLGRPLIRVTGAENVRK VLMGEHSLVTVDCPQSTSTLLGRNSLANSIGDIHRKRRKIFAKVFSHEALESYLPKIQQV IQETLRVWSSNPDPINVYRESQRLSFNMAVRVLLGFRIPEEEMHCLFSTFQEFVENVFSL PIDLPFSGYRKGIRARDSLQKSIEKAIREKPLHTQGKDYTDALDVLL esakengseltmqelk eatxelixaaxattarastslirqllrhppvlerlreelrargllhngclcpegelrldtivslk yldcvikevlrlftpvsgayrtamqtfeld gvqipkgwsvmysirdthdtstvfkdvdvfdp drfsqergedkegrfhylpfgggvrsclgkqlatlflrilaie lastsrfelatrqfprvitvpvvhpvdglkvkfygl dsnqneimakseellgaav >gnl|ti|15963120 zfishG-a606d09.q1c Length = 565 Score = 61.3 bits (146), Expect = 3e-08 Identities = 24/40 (60%), Positives = 32/40 (80%) Frame = +2 Query: 27 QLWSFRWSLTRDRRCELPLPKGSMGWPLVGETFQWLFQGS 66 QL + RW+ TRD+ C+LP+PKGSMG+P++GET W FQ S Sbjct: 26 QLSTLRWTATRDKSCKLPMPKGSMGFPIIGETCHWFFQVS 145 >gnl|ti|25932150 Z35723-a97d11.q1c Length = 728 Score = 136 bits (338), Expect = 9e-31 Identities = 64/121 (52%), Positives = 87/121 (71%), Gaps = 4/121 (3%) Frame = -3 Query: 379 GVQIPKGWSVMYSIRDTHDTSTVFKDVDVFDPDRFSQERGEDKEGRFHYLPFGGGVRSCL 438 G QIPKGWSVMYSIRDTH+ + +++ + FDPDRF R E K RF Y+PFGGGV C+ Sbjct: 606 GYQIPKGWSVMYSIRDTHERAEAYQNPEFFDPDRFCVGREESKSERFSYVPFGGGVXRCI 427 Query: 439 GKQLATLFLRILAIELASTSRFELATRQFPRVITVPVVHPVDGLKVKF----YGLDSNQN 494 G++LA + L+ LA+EL +T+ LAT+ +PR+ TVP+VHPV+GL V F G++ N+ Sbjct: 426 GRELALIVLKTLAVELLATADCTLATQTYPRMQTVPIVHPVNGLHVFFNYRTQGIERNRR 247 Query: 495 E 495 E Sbjct: 246 E 244 >gnl|ti|15814082 zfishK-a167e03.q1c Length = 597 Score = 71.4 bits (172), Expect = 3e-11 Identities = 47/124 (37%), Positives = 57/124 (45%), Gaps = 47/124 (37%) Frame = -3 Query: 302 TSLIRQLLRHPPVLERLREELRARGLLHNG-----------CLCPEGE------------ 338 TSLI QLLRHP V ER R EL++ GL+ +G + EGE Sbjct: 595 TSLIMQLLRHPDVSERARAELKSEGLITDGHGHCRSRCXGNAISEEGEAAEKSTSDRRSA 416 Query: 339 ------------------------LRLDTIVSLKYLDCVIKEVLRLFTPVSGAYRTAMQT 374 L L+ + L YLDCV+KEVLR PVSG YRT +QT Sbjct: 415 INKATYFEAGDKEEGRRSRTHVPYLSLEKLSQLSYLDCVVKEVLRFLPPVSGGYRTVLQT 236 Query: 375 FELD 378 FEL+ Sbjct: 235 FELN 224 >CYP27A 58% to rat CYP27A amino acids 342-537 AI477533 fb58e08.y1 AI477651 fb58e08.x1 TSNTLLWALYLLSKDPAAQETLHQEVTKVLKDDRIPTAEEVNSMPFLKAVIKETLRLYPVVPVNSRLIAES EVIIGEYLFPKKTTFNLFHYAISHDEKVFPEPQKFKPERWLRDGRTRPNPFGSIPFGFGV RACVGRRIAELEMHLALARXXXXXXMRPDPTVGEVKANFRSVLVPNKKVNLHFVERQKTET* >CYP51 81% identical to pig CYP51 AB009988 Sus scrofa amino acids 162-369 lower case = human AI522712 fb61b10.x1 Length = 461 AI496776 fb61b10.y1 162-369 AI545923 fb66a09.y1 Length = 301 AI545486 fb66a09.x1 3'UTR Z35723-a344c03.p1c zfishC-a2025a06.q1c zfishB-a467g01.q1c zfishB-a347f12.q1cz zfishB-a347f12.q1ca GLNIAQFXQHVEIIEEETKDYFRRWGESGERN 199 LFDALSELIILTASRCLHGCEIRSLLDE RVAQLYADLDGGITHAAWLLPGWLPLPSFR 256 RRDRAHLEIKXIFYNVIKKRREDTEKHDDILQTLIDATYKDGR PLSDDEIAGMLIGLLLAGQHTSSTTSAWMGFFLARDRALQERCYSEQKSVCGEELPPLHY DQ 361 LKDLSLLDRCLKE tlrlrppimimmrmartpqtvvgytippghqvcvsptvnqrpkdsw verldfnpdcylqdnpasgekfayvpfga 450 GRHRCIGENFAYVQIKTIWSTLLRMFDFELVDGHFPPVNYTTMIHTPHNPIIRYTRRN 507 >CYP? AI617721 zehn2020.seq.F 34% TO 2J3 MID 30% RANGE WITH MANY FAMILIES, possible pseudogene GIEIKKGTIVAPFAISIQRNPTIYKDPHTYNPERWLTNDNNTK*MDSFSFIPFSAG*RTC 182 IG*HLA*LEARIILNLFVKNFEFSCP 260 Database: zebrafish query = 2M1 1,062,731 sequences; 697,619,668 total letters gnl|ti|25636005 zfishC-a1846d04.p1c 99 2e-29 gnl|ti|25607053 zfishC-a1626a06.q1c 113 5e-24 gnl|ti|15612925 zfishG-a1246a05.q1c 94 5e-24 gnl|ti|15612962 zfishG-a1246e05.q1c 94 5e-24 gnl|ti|25890966 Z35723-a191c08.p1c 108 2e-22 gnl|ti|25385670 zfishC-a34h06.q1c 69 2e-20 gnl|ti|15895455 zfishG-a144a09.p1c 69 2e-20 gnl|ti|25772938 zfishC-a2916d05.p1c 100 4e-20 gnl|ti|25440012 zfishC-a510d08.q1c 62 6e-18 gnl|ti|25804313 zfishI-a238d12.q1c 88 2e-16 gnl|ti|25508682 zfishC-a921f03.q1c 85 2e-15 gnl|ti|25719092 zfishC-a2428c09.p1c 82 2e-14 gnl|ti|25684989 zfishC-a2047d05.q1c 72 2e-11 gnl|ti|25730905 zfishC-a2684d06.q1c 51 3e-11 gnl|ti|15997243 zfishG-a1026b07.q1c 71 5e-11 gnl|ti|25876603 zfishB-a619a12.q1c 69 2e-10 gnl|ti|15851063 zfishK-a820d04.p1c 68 3e-10 gnl|ti|25470781 zfishC-a628g07.p1c 63 8e-09 gnl|ti|25772939 zfishC-a2916d05.q1c 62 1e-08 gnl|ti|25866408 zfishB-a687d05.p1c 48 2e-08 gnl|ti|25650333 zfishC-a1912g05.q1c 62 2e-08 gnl|ti|15604226 zfishG-a1404f04.q1c 61 4e-08 gnl|ti|25406044 zfishC-a25h04.p1c 61 4e-08 gnl|ti|25399272 zfishC-a18b11.q1c 47 5e-08 gnl|ti|25527789 zfishC-a1083h03.p1c 61 5e-08 gnl|ti|25476058 zfishC-a603d07.p1c 59 2e-07 gnl|ti|15638066 zfishG-a1551g08.q1c 57 5e-07 gnl|ti|25421782 zfishC-a250a11.q1c 57 5e-07 gnl|ti|15814754 zfishK-a181a04.p1c 57 6e-07 gnl|ti|15632927 zfishG-a1551g08.p1c 57 6e-07 gnl|ti|25640036 zfishC-a1699d01.q1c 57 6e-07 gnl|ti|15948681 zfishG-a606c02.p1c 57 6e-07 gnl|ti|25586593 zfishC-a1385b03.q1c 57 8e-07 gnl|ti|15700515 zfishG-a1901c03.p1c 56 1e-06 gnl|ti|25933922 Z35723-a339f11.p1c 56 1e-06 gnl|ti|25703415 zfishC-a2172h09.q1c 56 1e-06 gnl|ti|25495275 zfishC-a678c11.p1c 55 2e-06 gnl|ti|15990930 zfishG-a899b03.p1c 45 7e-06 gnl|ti|25516014 zfishC-a953b08.p1c 38 7e-06 gnl|ti|25613235 zfishC-a1646d09.q1c 53 1e-05 gnl|ti|25379567 zfishB-a33e04.q1c 52 2e-05 gnl|ti|25770169 zfishC-a2848h06.p1c 52 2e-05 gnl|ti|25475284 zfishC-a402h10.p1c 52 2e-05 gnl|ti|25544638 zfishC-a1177h12.q1c 52 3e-05 gnl|ti|25598301 zfishC-a1468a05.q1c 52 3e-05 gnl|ti|25745756 zfishC-a2539b10.q1c 52 3e-05 gnl|ti|25811022 zfishB-a46b05.q1c 51 3e-05 gnl|ti|25532179 zfishC-a1101c09.q1c 50 8e-05 gnl|ti|15655323 zfishG-a1670f05.q1c 50 8e-05 gnl|ti|25446857 zfishC-a534g09.p1c 49 2e-04 gnl|ti|25450699 zfishC-a534g09.q1c 49 2e-04 gnl|ti|25961691 Z35723-a848d07.q1c 49 2e-04 gnl|ti|25433798 zfishC-a440h04.p1c 48 4e-04 gnl|ti|15747951 zfishG-a2684h06.p1c 47 5e-04 gnl|ti|15965749 zfishG-a835b06.p1c 47 5e-04 gnl|ti|25475035 zfishC-a370c11.q1c 47 5e-04 gnl|ti|25549031 zfishC-a1146b02.p1c 47 7e-04 gnl|ti|25390674 zfishC-a123d10.q1c 46 9e-04 gnl|ti|15871446 zfishK-a1004a03.p1c 46 0.001 gnl|ti|25724163 zfishC-a2474d08.p1c 46 0.002 gnl|ti|25960774 Z35723-a834b06.q1c 46 0.002 gnl|ti|25763645 zfishC-a2756d10.q1c 46 0.002 gnl|ti|25716769 zfishC-a2378h05.q1c 45 0.003 gnl|ti|25825644 zfishB-a130g09.q1c 45 0.003 gnl|ti|25949565 Z35723-a631b05.p1c 45 0.003 gnl|ti|25564097 zfishC-a463g12.q1c 44 0.006 gnl|ti|15969489 zfishG-a789f12.p1c 44 0.006 gnl|ti|25528409 zfishC-a1090h03.q1c 43 0.008 gnl|ti|25680599 zfishC-a2052h02.p1c 43 0.013 gnl|ti|25947981 Z35723-a384f04.q1c 43 0.013 gnl|ti|15806150 zfishK-a149h03.q1c 42 0.017 gnl|ti|15834587 zfishK-a583c07.p1c 42 0.017 gnl|ti|15887108 zfishG-a34c11.p1c 42 0.022 gnl|ti|25772474 zfishC-a2904a10.p1c 41 0.029 gnl|ti|15760668 zfishG-a2826f10.q1c 41 0.038 gnl|ti|15745699 zfishG-a2632g08.q1c 41 0.038 gnl|ti|25505571 zfishC-a961a08.q1c 41 0.050 gnl|ti|25975946 Z35723-a873a04.p1c 40 0.066 gnl|ti|15960408 zfishG-a661a12.p1c 40 0.066 gnl|ti|15960450 zfishG-a661c12.p1c 40 0.066 gnl|ti|15889700 zfishG-a147a09.q1c 40 0.066 gnl|ti|25873742 zfishB-a496h01.q1c 40 0.066 gnl|ti|15712946 zfishG-a1901c03.q1c 40 0.066 gnl|ti|15785710 zfishI-a76h10.q1c 40 0.086 gnl|ti|25695685 zfishC-a1218e09.p1ca 39 0.11 gnl|ti|25948744 Z35723-a508e08.p1c 39 0.15 gnl|ti|25512419 zfishC-a1007a08.p1c 39 0.15 gnl|ti|16016320 zfishG-a1000a05.p1c 39 0.15 gnl|ti|25665158 zfishC-a1921f08.q1c 38 0.25 gnl|ti|15855129 zfishK-a1019e01.q1c 38 0.33 gnl|ti|25783206 zfishC-a2901c10.p1c 38 0.44 gnl|ti|25464622 zfishC-a643a08.p1c 38 0.44 gnl|ti|15798053 zfishK-a102g06.q1c 37 0.57 gnl|ti|15910554 zfishG-a67c10.q1c 37 0.75 gnl|ti|25653721 zfishC-a1785a09.p1c 36 0.98 gnl|ti|15746184 zfishG-a2651b11.q1c 36 1.3 gnl|ti|15958948 zfishG-a628h11.q1cz 35 2.9 gnl|ti|15688193 zfishG-a1843b03.p1c 35 2.9 gnl|ti|15784815 zfishI-a36g12.q1c 34 3.8 gnl|ti|25932150 Z35723-a97d11.q1c 32 15 > zfishC-a1846d04.p1c Length = 705 56% to 2D6 60% to 2K 68 FHAENLLMTVGNLFAAGTDTTGTTLRWGLMLMAKYPQIQ DRVQEEIDRVIGGRQPVVEDRKKLPYTDAVIHEIQRLANIVPMNLPHVTSCDV 427 428 TFNGYFIXX 448 537 GTTVIPLLTSVLKDESEWEKPNSFYPEHFLDEKGQFVKRDAFMPFSAG 680 > zfishI-a238d12.q1c Length = 603 132-311 68% to 2R1 zfishG-a1246a05.q1c Length = 601 319-434 70% to 2R1 zfishG-a1246e05.q1c Length = 591 284-434 70% to 2R1 67 KLAVNCFRYFGTGQRMFERISEECLYFLDAIDQHQGKPFNPKHLVTNAVSNITNLIIFG 243 244 QRFTYDDGDFQHMIEIFSENVELAASSWAFLYNAFPWMEYLPFGKHQRLFRNANEVYKFL 423 424 LQIIRRFSQGRVPQSPQHYIDAYLDEMEQSTPDKATSFSQDNLIFSVGELIIAGTETTTN 603 CLRWAMLYMALYPRIQ 457 EKVQMEIDSVLNGRQPAFEDRQRMPYVEAVLHEVLRLCNIVPLGIFRATSQ 206 205 DAVVRGYTIPKGTMVITNLYSVHFDEKYWSDPSIFCPERFLDCNGKFIRHEAFLRY 38 > zfishC-a34h06.q1c Length = 742 275-387 52% to 2D6 583 LFYFMQETGEKDSSFNDQNLLITVSNLFAAGTDTTGTTLRWGLMLMAKYPQIQ 287 DRVQEEIDQVLGGREPVAEDRKNLPYTDAVIHETQRLANILPLNLPHKTSCDVTFNGYFI 108 > zfishG-a144a09.p1c Length = 589 275-387 52% to 2D6 580 LFYFMQETGEKDSSFNDQNLLITVSNLFAAGTDTTGTTLRWGLMLMAKYPQIQ 407 284 DRVQEEIDQVLGGREPVAEDRKNLPYTDAVIHETQRLANILPLNLPHKTSCDVTFNGYFI 105 > zfishC-a2916d05.p1c Length = 732 330-438 55% to 2D6 74 MQKEIDCVIGQNRIPTMEDRKSLPFTDAVIHEVQRCLDIAPLNVPHYAL*DITFRGYKIP 253 254 KVTMIIPMLHSVLRDEG 433 434 HWETPWTFNPEHFLDSNGNFKKNPAFMPFSAGK 532 > zfishC-a921f03.q1c Length = 608 328-392 60% to 2C9 249 EQMQREIDRVIGQNRIPTMEDRKSLPFTDAVIHEVQRYMDIVPLSLPHYAMKDITFRGYKIPKVTM 52 > zfishG-a1026b07.q1c Length = 601 278-387 49% to 2J2 121 IITKKNDTEAGFTVGSLEWSMVDLFEGGTETTTNSLRWALLFLIKYPDIQ QKVQAEIDEVIGSRLPSMSDKANMH 480 481 YLNAFIHEVLTRANLVPLNMARVAKKDTTLGGYFI 585 > zfishB-a619a12.q1c Length = 738 328-437 134 DRVQEEIDRVIGGRQPAVEDRKKLPYTYAVIHEIQRFANIVPLNLPHTTSCDITFNGYFI 313 530 GTTVIPLLTSVLKDESEWEKPNSFYPEHFLDEKGQFVKRDAFMPFSAG 673 > zfishK-a820d04.p1c Length = 763 328-387 101 DRVQEEIDQVLGGREPVAEDRKNLPYTDAVIHETQRLVIILPLNLPHKTSCDVTFNGYFI 280 > zfishC-a628g07.p1c Length = 570 305-394 52 GPETTSTTLYWGLLYMMKYPEIQ SKVQQEIDAIVGGSRQPSVSDRDNMPYTNAVIHEIQ 311 312 RMGNIIPINLARTTSEDTQIEKYSIPKVGIVV 407 > zfishC-a2916d05.q1c Length = 727 431-490 518 FYFVSTGKRVCVGQSLARMEIFLFIVSLLQKFSFSSPNGPDSIDPSLELSSFGNMPRLYE 339 > zfishB-a687d05.p1c Length = 760 117-207 42 FGIVFANGERWRTMRRFALSTLRDFGMGKKLSEEKIVDETRYLREVFMKFEGTV 203 320 VSNIISAIVYGKRFEYEDPAFQDMVNKA 403 > zfishC-a1912g05.q1c Length = 728 222-327 526 LYNAFPKLLDYLPGPHQKVFSNYKKITQSLKDEIIKHREDWDPANPRDFIDNYLTEME QKKSDPQAGFNIESLIKNSLDIVEAGTETGATTLRWGLLFMIKFPEIQ 137 > zfishG-a1404f04.q1c Length = 581 381-437 177 EFRGFTIPKGTVIIPNLWSVHRDPTVWENPDDFNPSRFLDDQGKILRKDCFIPFGLG 347 > zfishC-a25h04.p1c Length = 635 304-374 490 AGNDTTGSALRWGLMLMAKYPQIQDRVPE*ID 311 310 RVIGVRHPVV*GRKKLPYADVVIHKIQRLANIVPMNIPH 194 > zfishC-a18b11.q1c Length = 703 118-207 680 GIVFANGERWRTMRRFALSTLRDFGMGKKLSEEKIVDETRYLREVFMKFEGTV 522 405 VSNIISAIVYGKRFEYEDPAFQDMVNKA 322 > zfishC-a1083h03.p1c Length = 705 244-327 692 VENIRTFIRSKVKEHEQRLDFSDPSDFIDCFLIRLTQ EKDKLDTEFHKDNLMATVLNLFVAGTETTSTTLRYALMLLIKHPQIQ 318 > zfishC-a603d07.p1c Length = 580 60-118 96 VFLQFAPHYGSIYGIYIGSKPAVVLTGQKMIKEALITQAAEFAGRPNHMMISHITRSEG 272 > zfishG-a1551g08.q1c Length = 567 318-389 24 LIRARYPEMNIFCVFVFSEKVQSEIDQVIGQTRQPLMDDRTNLPYTYAVIHEIQRFANIV 203 204 TFTPPRVANKDTTVGGQLIPK 266 > zfishC-a250a11.q1c Length = 574 62-118 113 LQLIEKYGNIFSVRIGSDKIVYVSGFKMVKDVLITQGENFTDRPVSPLFDTLYKGRG 283 > zfishK-a181a04.p1c Length = 637 328-394 286 EQCQREIDEVLGARDHVTYEDRNAMHFVQAVIHEGQRVADIAPLSMFHSAKTDTQLRGYSIPKVTFTV 83 > zfishG-a1551g08.p1c Length = 611 375-438 586 KIHKSNTSQSVWLLQGVIVLPMLKPILLDKKEYSTPYDFNPDHFLDQNGKFLKKENFIPFSIGK 395 > zfishC-a1699d01.q1c Length = 707 289-345 462 FHEENLMLTVANLFAAGTDTTGTTLRWGLMLMAKYPHIQ DRVQEEIDRVIGGRQPVV 707 > zfishG-a606c02.p1c Length = 595 233-327 524 FPGPHQKIKQNANELLAFIQGEVKEHKKTLDPDSPRDFIDAYLLE LKQQKSNKDSTFHEGNLAISTADLFLAGTHTTSTTIRWGLLFLTQNPDVQ 138 > zfishC-a1385b03.q1c Length = 713 325-390 469 HNTERCHEEIVQVLGYDRLPCMDDRDRLPYTLATVHEIQRCGNIAPFGLFHETVQPTKLRGYNIPPG 669 > zfishG-a1901c03.p1c Length = 622 325-388 260 HNTERCHKEIVQVLGYDRLPSMEDRDRLPYTLATVHEIQRCANLAPFGLIHETIQPTKLQGYDLP 66 > Z35723-a339f11.p1c Length = 731 325-388 320 HNTERCHKEIVQVLGYDRLPSMEDRDRLPYTLATVHEIQRCANLAPFGLIHETIQPTKLQGYDLP 126 > zfishC-a2172h09.q1c Length = 713 343-438 711 PCMDDRDRLPYTLATVHEIQRCGNIAPFGLFHETVQPTKLRGYNIPQ 571 442 GTIIMTNYTAIFSNKEHWKHPDTFNPENFLDENGQFSKPKCFIAFGVGK 293 > zfishC-a678c11.p1c Length = 594 116-207 503 GSGILFSNGENWKEMRRFALSNLRDFGMGKRGSEEKIIEEIHHLKGEFDKFEGT PFDTTQPVK*DVSNIISSIVYGSRFEYTDPQFTEMVDRA 147 > zfishG-a899b03.p1c Length = 622 222-327 578 IYEAFPAIMKHLPGPHNDIFSNYDLLKSFVHEVIVKHKAKLDPSEPRDYIDTFLIEMKEV 399 398 K*CLNDRRSLFN 363 292 TALRNQTLLHVFLTCLRQGTESTSNTLCWGLIYLIMYPDVQ 170 > zfishC-a953b08.p1c Length = 570 13-93 256 SSVLVLICILLVFLLIRIQRPKNFPPGPSPLPIFGNLLHFNLANPLK 396 471 HFKFAEKYGNIFSLYTGSRPAVFLNSFAVIKEA 569 > zfishC-a1646d09.q1c Length = 573 389-437 318 KGTIIIPYLSSSLREESQWKFPHEFNPQNFLNEKGEFVKNEAFMPFSAG 464 > zfishB-a33e04.q1c Length = 674 222-323 420 IFNLAPFIKHFPGPHQKIKKNSNELYSFIEDEVEEHRKTLDPVSPRDFIDAYLLE LKQQKSNKDSTFQEENLIGSAIDLFFAGTDSTATSIRWGLLFLFRY 34 > zfishC-a2848h06.p1c Length = 703 233-327 510 FPGPHQKIKKNSNELYSFIEDEVEEHRKTVDPVSPRDFIDAYLLE LKQQKSNKDSTFQEENLIGSAIDLLFAGTDSTATSIRWGLLFLIQNPDVQ > zfishC-a402h10.p1c Length = 598 226-327 199 FAPIIKHFPGPHQKIKRNADELSGFFQHEVKEHKKTLDPGSPRDYIDAYLLEM KQQKSNKDSTFHDENLIGSTTDLFVAGSDSTATTFRWG 558 559 LLXLIQNPDVQ 591 > zfishC-a1177h12.q1c Length = 697 338-389 31 LGTRLPSMDDRDRLPYTLATVHEIQRYGNIIPILIHETILPTKLQGYSIPQ 183 > zfishC-a1468a05.q1c Length = 729 63-118 322 QLSKKYGPVFKVHLRRKKVVVLAGYKTVKQALVNQAQEFGERDITPIFHDCYHGQG 489 > zfishC-a2539b10.q1c Length = 715 366-444 13 DYNSSSVPYMICVSFVDGFKVPKGTNIVVITYALHRDPRFFPDPEEFRPERFLPENCV 186 187 GRHPYAYIPFSAGLRNCIGE 246 > zfishB-a46b05.q1c Length = 718 222-322 429 IFNLAPFIKHFPGPHQKIKKNSNELYSFIEDEVEEHRKTLDPVSPRDFIDAYLLE LKQQKSNKDSTFQEENLIGSAIDLFFAGTDSTATSI 73 72 RWGLLFLIR 46 > zfishC-a1101c09.q1c Length = 578 289-327 436 FHEENLMLTVANLFAAGTDTTGTTLRWGLMLMAKYPHIQ 552 > zfishC-a534g09.p1c Length = 709 222-281 217 LYNIFPKVMEILPGRHHTMFGEIDDLKSSIMTIIKEHEENLDPSDPKDFIDCFFIRLKQD 38 > zfishC-a534g09.q1c Length = 581 7-64 299 LSSSLLLVLVLTVLMLIRWRRKENGLSLPPGPLALPLIGNLLTVETSAPFKSFMKV 466 > Z35723-a848d07.q1c Length = 735 18-108 258 FLCVLLLVKHFRDVYSKNMPPGPFPLPFVGNLTNIGFSDPLGSFQRVS 401 589 LVCQIAEKYGDVCTLYLXTKPCILMTGYDTLKEAFVEQADIFTDRPYFP 735 > zfishC-a440h04.p1c Length = 730 226-322 193 FAPIIKHFPGPHQKIKKNADELSGFFQHEVKEHKKTLDPGSPRDYIDAYLLEM KQQKSNKDSTFHDENLIGSTTDLFVAGSDSTATTFRWGLLFLYK 570 > zfishG-a2684h06.p1c Length = 751 116-277 715 GSGISFPKAQSWKDMPRFAISNLADFGMGKRGSEETIIEEIHHLKAEFDKFEGT PFDTTQPVNYAVSNIISSIVYGSRFEYTDPRFTE 356 355 MVDRAINVLMHIFLCMYFRIYNIFPWLGLFLN 179 178 SKRTVVRNMLKNRAEFMKLITGLQETLNIHDRRGFVDSFLIR 53 > zfishG-a835b06.p1c Length = 602 432-471 516 FSFSPGPRACLGETLAKVELFLLVTSLLQRIRFSWPTGEKLPD 388 > zfishC-a1146b02.p1c Length = 703 390-465 79 GTTVIPLLTSVLKDESECGETKQLLPRTLP**EGPVRQERCF FAVWSGRRVCLGESLARMELFLFFASLLQSYRFT 384 > zfishC-a123d10.q1c Length = 570 222-280 344 LYNIFPQVMERFSSRHHAILKDVENIRTFIRNKVKEHEQRLDFSDPSDFIDCFLIRLTQ 168 > zfishK-a1004a03.p1c Length = 536 437-465 228 GRRVCLGESLARMELFLFFTSLLQSYRFT 142 > zfishC-a2474d08.p1c Length = 714 1-65 407 MDLLHIYEWIDIKAVLFFACVFLLLSNYIQNKTPKNFPPGPWPLPIIGNLYHIDFNKIHLEVEKK 213 > Z35723-a834b06.q1c Length = 695 437-476 469 GKRVCLGEQIARIELFLFFVSLFRKFRFSATEGEKLNMD 353 > zfishC-a2756d10.q1c Length = 605 437-476 60 GKRVCLGEQIARIELFLFFVSLFRKFRFSATE-GEKLNMD 176 > zfishC-a2378h05.q1c Length = 724 282-327 242 KSDHRTSFDESQMVTLLFDLFIAGTETTFNTLRTLTLYLMTYTHIQ 379 > zfishB-a130g09.q1c Length = 698 119-160 328 VLATFGHSWRQQRRFALHTLRNFGLGKKSVEERVTEESGYLV 203 > Z35723-a631b05.p1c Length = 728 249-332 47 IFLGDEVEEHRKTLDPGSPRDFIDAYLLEIEKVRSSFNT 163 TFHEGNLLASAGDLFMAGTDTTETTIRWGLLFLIQNPDVQGTKAK 395 > zfishC-a463g12.q1c Length = 605 325-383 144 HITERCHKEIVQVLGYDRSPSMEDRDRLLFTLFFVHEIHLCSNLAPLGLIHETIYPTKLQ 323 > zfishG-a789f12.p1c Length = 593 343-434 470 PSVSDRDNMPYTNSVIHEIQSIGNIGPLNV 381 252 KGTLVIGSLTSVLFDESEWETPHSFNPGHFLDAEGKFRRRDAFLPF 115 > zfishC-a2052h02.p1c Length = 719 119-160 350 VMVTFNHSWRQQRRFALHTLRNFGLGRKSVESRVLEESQYLI 225 > Z35723-a384f04.q1c Length = 685 328-372 210 EQCQREIDEVLGARDHVTYEDRNDMHFVQAVIHEGQRVADIVPLNV 73 > zfishK-a149h03.q1c Length = 578 118-180 134 GIIMATFNNSWKQQRRFALHTLRNFGLGKKSIESRVLEESQYLFAELLKDEGMTCSC 304 > zfishK-a583c07.p1c Length = 581 172-278 50 DPRLLLNNAVSNVICVLVFGNRFDYSDHHFQTLLKHINEAIYLERGICAQVNIFTLFNTCSALFAIV 250 363 LYNMFPWLMQRLPGSHKKVITLWKKVIDFIRQKVNEHKVDHDPLNPRDYIDCFLAEM 533 > zfishG-a34c11.p1c Length = 493 309-364 288 TSNTMLWALYLLSKDPAAQETLHQEVNKVLKGDRIPTAEEVNSMPFLKAVIKETLRW 118 > zfishC-a2904a10.p1c Length = 721 284-327 574 DKEAEFTEENLIHCVLDLFGAGTESTAKTLVWALLYMAKYPDVQ 443 > zfishG-a2826f10.q1c Length = 609 390-417 410 GTVVFPL*SSVLIDPKMWKNSNCIDPEN 327 > zfishG-a2632g08.q1c Length = 570 172-202 81 NPRLLLNNAVSNVICVLVFGNRFEYSDHHFQ 173 > zfishC-a961a08.q1c Length = 576 437-465 88 GPRSCLGETLAKTELFLFITSLLQRIRFS 174 > Z35723-a873a04.p1c Length = 726 6-68 257 ILETLDVKGILLFMVAFLLVADYLKNRNPPKYPPSPFSVPLLGNIFNVDSKEPHLYLTKVSR*Y 66 > zfishG-a147a09.q1c Length = 556 118-170 301 GLIFSGGHMCRQQRRFALATLKYFGVGKKTLENSILQECRSVCESVQSERGTV 459 > zfishG-a1901c03.q1c Length = 625 60-160 57 LFSKQLSQYGEMTTIYLGRKPTIMLNTVQLAKEVLIQDAFAGKPSLPVLDWVSNGLG 227 465 VMVTFNHSWRQQRRFALHTLRNXGLGRKSVESRVLEESQYLI 590 > zfishG-a661a12.p1c Length = 607 171-220 310 MSPHHPIQNAVSNIICSIVFGDRFDYDNKRFEYLLELLNENFVLTGSAAG 161 > zfishG-a661c12.p1c Length = 613 171-220 316 MSPHHPIQNAVSNIICSIVFGDRFDYDNKRFEYLLELLNENFVLTGSAAG 167 > zfishB-a496h01.q1c Length = 748 171-220 565 VNPHHALQNAVSNIFCSIMFGERFDYDNKRLGYLLKILNENMMLTGSAIG 416 > zfishI-a76h10.q1c Length = 772 171-220 334 VDPHHPIQNAVSNIICSIVFGDRFEYNNKRFEYLLKMLNETIMLAGSASG 185 > zfishC-a1218e09.p1ca Length = 710 58-113 269 YNHIFQFAERYGNIFSLRIFGPRIVVLNGYNLVKEVYIKQGDNLADRPVLPLFYEI 102 > Z35723-a508e08.p1c Length = 724 216-325 19 SSLVLQLYEIAPVLRIFPLPFWKAFHYFEKITRHSLKVVEEHKKSFVAGEPKDLIDCYL 195 196 EEMKKVRADQRTTFDEAQMVTLLFDLFLAGTETT 375 376 SNTLRTLTLFDDSYSY 423 > zfishC-a1007a08.p1c Length = 719 9-58 276 TDAGTILLLFILFLLVSKKMRNRNKPHKNLPPGPTPLPFIGNVFNLDTSQPH 121 > zfishG-a1000a05.p1c Length = 567 444-477 43 ESLARMELFLFFTSLLQHFCFTPPPGVSEDELDLTP 150 > zfishK-a1019e01.q1c Length = 609 118-159 309 GLTFNNGYSWKQHRRFTLSTLKFFGVGKRRMEFIIMEEDKFL 184 > zfishC-a2901c10.p1c Length = 723 222-278 397 IFNLAPFIKHFPGPHQKIKKNSNELYSFIEDEVEEHRKTLDPVSPRDFIDAYLLEI 230 > zfishC-a643a08.p1c Length = 721 12-66 272 VSTILAFLLLFLVISYFFSSKDKGKYPPGPKPLPVLGNLHILDLKNTYMSLWKVRK 105 > zfishK-a102g06.q1c Length = 587 163-208 361 FLPFLGAAFDPTILLYNAVSNIICQMVFGQRFDYADHQFKTMLKYI 498 > zfishG-a67c10.q1c Length = 618 437-471 473 GPRICLGDTLAKTALFLFITSLLQRIRFSLPPD 375 > zfishC-a1921f08.q1c Length = 736 437-471 404 GLRACIGESLVRTELFLFATVLLQRIHFSWPPD 306 > zfishC-a1785a09.p1c Length = 700 12-65 395 LTLFLGLIFLVVFEISRIYSYKCRFPPGPTPLPFVGNLPHLLKKPMEFIRSVS 553 > zfishG-a2651b11.q1c Length = 706 119-165 522 VLADYGPLWKDHGRFALMTLRNFGLGKQFMVDRILGEIAHVVGGLGK 662 > zfishG-a628h11.q1cz Length = 608 71-115 405 VFSLDMGGIRTVILNGYDAIKECLYHQSEVFADRPSLPLFQKMTK 271 > zfishG-a1843b03.p1cLength = 594 13-53 170 SSVLVLICILLVFLLIRIQRPKNFPPGPSPLPIFGNLLHFN 48 > zfishI-a36g12.q1c Length = 590 9-59 263 TMLTALVLLCLGAFLLYLQVRIRRPKDFPPGPAPVPFFGNLLQLNRINPIK 111 >Z35723-a97d11.q1c Length = 728 385-451 603 YQIPKGWSVMYSIRDTHERAEAYQNPEFFDPDRFCVGREESKSERFSYVPFGGGVXRCIG 424 423 RELALIVL 400