CYP4C3 complete AL072439 AI457023 AI296945 AI106780 AL051651
MAESILLSKVGQVISGYSPITVFLLGSILIFLVVYNKRRSRLVKYIEKIPGPAAMPFLGNAIE*
MNVDHDELFNRVIGMQKLWGTRIGINRVWQGTAPRVLLFEPETVE*
PILNSQKFVNKSHDYDYLHPWLGEGLLTSTDRKWHSRRK*
ILTPAFHFKILDDFIDVFNEQSAVLARKLAVEVGSEAFNLFPYVTLCTLDIVC*
ETAMGRRIYAQSNSESEYVKAVY
GIGSIVQSRQAKIWLQSD
FIFSLTAEYKLHQSYINTLHGFSNMVIRERKAELAILQENNNNNNNN
APDAYDDVGKKKRLAFLDLLIDASKEGTVLSN*
EDIREEVDTFMFEGHDTTSAAISWTLFLLGCHPEYQERVVEELDSIFGDDKETPATMKNLMDMRYLE
CCIKDSLRLFPSVPMMAKMVGEDVN*
IGGKIVPAGTQAIIMTYALHRNPRVFPKPEQFNPDNFLPENCAGRHPFAYI
PFSAGPRNCIGQKFAILEEKAVISTVLRKYKIEAVDRREDLTLLGELILRPKDGLRVKITPRD*

AA952162 AI388993 AA538617 AA439290 AA940746 AA941522 AI106700 45% to 4c3
RLFPSSPCLAFAHADGVDLNFGGQLVACRLEAI 370

AC007928 Drosophila melanogaster chromosome 3 clone BACR04O02
This seq is improved over the old seq. At KYG region
MLTINLLLAVGALFWIYFLWSRRSKLYFLMLKIPGPIGLPIL
GSSLENIITYKRKLSFRTKYLNK*YGSTILTWMGPVPFIVTRDPKVVEDIF
SSPDCHNKSQHIVNAITSCMGNGLLGKIDPHWLDRRKHFNPSFKQDLLLS
FFHIFDAETKVLMNLLDTYVDKGEIDVVPEMLRWSFKIAA*
ETTMGSEVKHDEHFKNGSLV
NRKRNFKQTETNFIYLKCDNPIHFYRLISHSTLNILMPLVQNRMIS
KICGYDKLRADNFSRIQKMLDNV
VNPLPKTDSDPESNIVINRAMELYRKGDITYM
DVKSECCIMIAAGYDTSALTVYHALFLLANHPEHQEAVFEELNGVFPDAGHFGITYPDMQ
KLDYLERVIKETLRLIPAIPITARETKNDVRLSNGVLIPKGVVIGIDMFHTHRNPEVWGP
DADNFNPDNFLAENMEQKHPYAYIPFARGKRNCIG
SKYAMMSSKFALCRILRNYKISTSTLYKDLVYVDNMTMKLAEYPRLKLQRRG*

AC007724 Drosophila melanogaster chromosome 3 clone BACR30N15 C-helix
PMGFPFIGLNVFTQVRYACVPFYLKYME
KR*YGKTVLTWIGLTPVLVTCEPKILEDIFTSPNCSNRSSVVDKAISSCLGLGLL
*VSIDNHWNERRKLLLPSFKNNAVLSFVPVLN
NEANFLVTLLAEFVDGGDINLLPELNKWSFKIAARK*

AC017371
Query: 1    PMGFPFIGL----------------------NVFTQVRYACVPFYLKYMEKRY 31
            PMGFPFIGL                      NVFTQVRYACVPFYLKYMEKRY
Sbjct: 6454 PMGFPFIGLAFEYIRLKRK*LYFRI*LGKTKNVFTQVRYACVPFYLKYMEKRY 6612

AL076582 AL074717 AL072640 N-term to C-helix also AC007724 71656-71049
MDTFQLLLAVGVCFWIYFLWSRRRLYMMHFKIPGPMGLPILGIAFEYLITYKR*
YGSTCLVWVGPTPFVITRDPKIAEEIFLSPECLNRSSIFSKPVNSCTGDGLLSLEGN
KWVDRRKNLNPAFKQNVLLSFLPIFNSEAKTLVAFLDSLVGQGEKKVRDDIVRWSFRIAT*
ETTVGTDVKKDASFKNDSVL

Cyp4d1 AF016992 see also AF016993 to AF017004,
MWLLLSLVLLLAIIALEMRRFLRNMRTIPGPLPLPLLGN
AHIFLGLTPAEACLKIGELAERHGDTFGLFLGPSYSVMLFNPRDVERVLG
SSQLLTKSQEYSFLGRWLNEGLLVSNGRKWHRRRKIITPAFHFRILEPYV
EIFDRQSLRLVEELALRISRGQERINLGEAIHLCALDAICETA*MGVSIN
AQSNADSEYVQAVKTISMVLHKRMFNILYRFDLTYMLTPLARAEKKALNV
LHQFTEKIIVQRREELIREGSSQESSNDDADVGAKRKMAFLDILLQSTVD
ERPLSNLDIREEVDTFMFEGHDTTSSALMFFFYNIATHPEAQKKCFEEIR
SVVGNDKSTPVSYELLNQLHYVDLCVKETLRMYPSVPLLGRKVLEDCEIN
GKLIPAGTNIGISPLYLGRREELFSEPNSFKPERFDVVTTAEKLNPYAYI
PFSAGPRNCIGQKFAMLEIKAIVANVLRHYEVDFVGDSSEPPVLIAELIL
RTKEPLMFKVRERVY

CLOSEST FIRST 4D1 EXON (NOT USED, SKIPPED IN 6/7 ESTS) AI062205
MFLVIGAILASALFVGLLLYHLKFKRLIDLISYMPGPPVLPLVGHGHHFI
GKPPHEMVKKIFEFMETYSKDQVLKVWLGPELNVLMGNPKDVEVVLGTLR
FNDKAGEYKALEPWLKEGLLVSRGRKWHKRRKIITPAFHFKILDQFVEVF
EKGSRDLLRNMEQDRLKHGESGFSLYDWINLCTMDTIC

MORE DISTANT FIRST CYP4D1 EXON (THIS ONE IS USED) 6 ESTS MATCH THIS ONE
MWLLLSLVLLLAIIALEMRRFLRNMRTIPGPLPLPLLGN
AHIFLGLTPAEACLKIGELAERHGDTFGLFLGPSYSVMLFNPRDVERVLG
SSQLLTKSQEYSFLGRWLNEGLLVSNGRKWHRRRKIITPAFHFRILEPYV
EIFDRQSLRLVEELALRISRGQERINLGEAIHLCALDAICETA*

Cyp4d2 Al009194 14092-15781 also X75955, AL023401 Cyp4d2 STS
MLGVVGVLLLVAFATLLLWDFLWRRRGNGILPGPRPLPFLGNLLMYRGLD
PEQIMDFVKKNQRKYGRLYRVWILHQLAVFSTDPRDIEFVLSSQQHITKN
NLYKLLNCWLGDGLLMSTGRKWHGRRKIITPTFHFKILEQFVEIFDQQSA
VMVEQLQSRADGMTPINIFPVICLTALDIIAETAMGTKINAQKNPNLPYV
QAVNDVTNILIKRFIHAWQRVDWIFRLTQPTEAKRQDKAIKVMHDFTENI
IRERRETLVNNSKETTPEEEVNFLGQKRRMALLDVLLQSTIDGAPLSDED
IREEVDTFMFEGHDTTTSAISFCLYEISRHPEVQQRLQQEIRDVLGEDRK
SPVTLRDLGELKFMENVIKESLRLHPPVPMIGRWFAEDVEIRGKHIPAGT
NFTMGIFVLLRDPEYFESPDEFRPERFDADVPQIHPYAYIPFSAGPRNCI
GQKFAMLEMKSTVSKLLRHFELLPLGPEPRHSMNIVLRSANGVHLGLKPRA

AC015418 = AI261013 in ordered fragments 46% to 4D1
ELREKHGPVFRIWFGKDLMVMFTDPEDIKQLLGNNQLLTKSRNYELLEPWLGKGLLTNGGESWHRRRK
LLTPGFHFRILSEFKEPMEENCRILVRRLRTKANGESFDIYPYITLFALDAICETAMGI
KKHAQLQSDSEYVQAV
HSICRVMHKQSFSFWQRLNVFFKHTKPGKEREAALKVLHDETNRVIRLRREQL
IQERNEWKPEAEQDDVGAKRRLAFLDMLLLTQMEGGAELSDTDIREEVDTFMFEGH
DTTSSAIAFALSLLSKNPDVQQRAFEEASELEGREKESMPYLEAVIKETLR
IYPSVPFFSRKVLEDLEVGKLTVPKGASISCLIYMLHRDPKNFPDPERFDPDRFLVNE
KQMHPFAFAAFSAGPRNCIG
QKFAMLELKTSLAMLLRSYRFLPDKDHQPKPLAELVTKSGNGIRLRILPRDENGTTA*

AI402187 GH09810 94% identical to Cyp4d2 7 diffs probably is 4d2
PPVPMIGRWFAEDVEKRGKHIPAGTNLTMGIFVLPRDPEYFESPDEFRPE
RFDADVPQIHPYAYIPFSAGPRNCIGQKFAMLEMKSTVSKLLRQFELLPL
APEPRQLMNIVLRSANGVHLGLKPRA

Cyp4d8
MLLFLLVVLLFGAGWIIHLGQADRRRKVGNLPGPICPPLIGAMQLMLRLNPK
TFIKVGREYVLKFGHLQRVWIFNRLLIMSGDAELNEQLLSSQEHLVKHPVYKVLGQWLGN
GLLLKDGKVWHQRRKIITPTFHFSILEQFVEVFDQQSNICVQRLAQKANGNTFDV
YRSICAAALDIIAETAMGTKIYAQANESTPYAEAVNE*
CTALLSWRFMSVYLQVELLFTLTHPHLKW
RQTQLIRTMQEFTIKVIEKRRQALEDQQSKLMDTADEDVGSKR
RMALLDVLLMSTVDGRPLTNDEIREAVDTFM
FEGHDTTTSALSFCLHELSRHPEVQAKMLEEIVQVLGTTRSRP
VSIRDLGELKYMECVIKESLRMYPPVPIVGRKLQTDFKYS*
DGVIPAGSEIIIGIFGVHRQPETFPNPDEFIPERHE
NGSRVAPFKMIPFSAGPRNCIGQKFAQLEMKMMLAKIVREYELLPMGQRVECIVNIVLR
SETGFQLGMRKRKHN*

AC004717 DS01529 also AC005834 comp(100737-101282) P1s DS01589 DS02501
MWILLGIAVLIMTLVWDNSRKQWRVNTFEKSRILGPFTIPIVGNGLQALT
LRPENFIQRFGDYFN*KYGKTFRLWILGECLIYTKDLKYFESILSSSTLL
KKAHLYRFLRDFLGDGLLLSTGNKWTSRRKVLAPAFHFKCLENFVEIMDR
NSGIMVEKLKNYADGKTCVDLFKFVSLEALDVTT*ETAMGVQVNAQNEPN
FPYTKALK*SVVYIESKRLASVSMRYNWLFPLAAPLVYRRLQKDIAIMQD
FTDKVIRERRAILERARADGTYKPLSTLT*DDIGGKAKMTLLDILLQATI
DNKPLSDVDIREEVDVFIFAGDDTTTSGMSHALHAISRHPKVQECISEHV
VSVLVPDPDASVTQTKLLEVKYLECVIKQTMRLHPPVPILGRYIPEDLII
GEIAIPGNTSI*LMPYYVYRDPEYFPDPLVFKPERWMDMKTTSNTPPLAY
IPFSSGPKNCIGQKFANLQMKALISKVIRHYELLPLGADLKATYTFILSS
STGNNVGLKPRTRVK*

AI106625 AI517120 N-term 65% to AC005834
MWLTLITGALILLLTWDFGRKRQRVLAFEKSAIPGPISIPILGCGLQALHLGAENIIGWV
GEKFDKYGKTFRFWILGESLIYTKDLQYFETILSSTTLLEKGQLYEYLRPFLNDGLLVST
GRKWHARRKIFTHAFHFKVLEHYVEIMDRHSNVMVDNL
RKVADGKTAVDMLKYVSLAALDVIT*
EAAMGVQVNAQNDPDFPYIKALKSVVYIQPDRMFRFSRRYNWLFPLAAPLLHRQLLSDIR
VMHDFTDKVISERRETVRRAKADGTYRPLSGCT
AEIGSKSQMALLDILLQSSINNQPLSDADIREEVDTFMFEGDDTTSSG
VSHALYAIARHPEVQQRIFEELQRVLGPDASASGTQAQLQDLKYLDCVIKETMRLY
PPVPAIGRHAQKELEIGDKTIPANTSIYLVLYYAHRDANYFPDPLSFRPERFLEDQEQGH
NTFAYVPFSAGPKNCIGQKFAVLEMKVLISKVLRFYELLPLGEELKPMLTFILRSASGIN
VGLRPRKALR

AC017388 = AC010113 AC010557

35538 MILTATFICFCLASAFNYFRARRQRSLIKNLKGPFTWPLMGAMHKLLFLTPI
35744 NFFQRSTEYLTKYGTFSRCWVFHRLFIPLADLELSRQLLENDTHLETGYELMKDWLV 35913
35914 GGVLMCQSEQWQKRHSLISGLFDKGNLEQLIDLSRHQTEQLLQKLAKQADQKVFDIWYTV 36093
36094 SPIVLDLMVMTTCGAKPSEEYSKNLKD
      LSEIYRKRFLSLQ 36273
36274 SANRFNYWLSSPFMRKRQNRLIKRLNDEHNNLMAMHQSQNQLKIENGLDIYQLRPIP 36444
36445 LKDHKSLLEILLESKDPQLTGEEICGELNTCNYLGYQLCSPALCFCLVTIARNPSVQQK 36621
36622 CLDELNLAQIKDQGWDLEKLNYLDAVLHETMRLYPPQVIVGRQLKKDFPYN 36774
      AELPCGSEIYINLYELQRNEVRYPKANHFDAQR 36948
36949 FLDSPPELLSYSLGPRCCPARKFSMQLLKTLLAPILANFEVL 37074
      PYGDEVRLDLRLVLGSSNGFQLALKPR*

Cyp4d10 U91634 Drosophila mettleri
MSLSLPPLIAVACLVVALARISWLPLRSWLRRRRRTHQLAAQLPGPRNLP
LLGNFHMFFGLEPWQVPHLINQLAKKYDGTFKLKMGSNFSLMMFQPRDIE
VVLGSSQLLDKAVEYSFLRGWLNDGLLLSGGRKWHRRRKIITPAFHFRIL
ESYDEIFDRQTRLLIHKWQQTLGHSFDLGHDVHLFTLDVICETAMGVSTN
AQTNADSDYVRAVKTISTVLHKRMFNIFYRFDLTYMLTPLAWAERRALNV
LHKFTEKIIVQRREELLRGGVTQTTDGADVGAKSKMVFLDILLQSNIDDK
PLTNLDIREEVDTFMFEGHDTTSSGITFFFYNIALYPECQRKCVEEIVSV
LGKDTETPVTYDLLNNLNYMDLCIKETLRMYPSVPLLGRKVLQECEINGK
IIPAGTNIGISPLFLGRSEDISSEPNTFKPERFDVVTSAEKLNPHAYIPF
SAGPRNCIGQKFAMLEIKAIAANVLRHYEIEFVGNAEESPVLIAELILRT
KDPLMFKLKKRVI

AC010003 chromosome 3L/75B13 clone RPCI98-6L11, = AC015424 AC009369 AC007573
MFWLGFGLLLLALSLYLLYVFERQSRIDRLTHKWPAPPALPFIGHLHILAKLVGPHPLRRATEMINEHLHDHRAKLWMGT
KLYLVDCNPKDIQALCSAQQLLQKTNDYRVFENWLCEGLFTSGFEKWSHRRKIVMPAFNYTMIKQFVAVFEKQSRILLTN
VAKFAESGDQIDFLQLISCFTLDTICETALGVSVGSQSSAKSEYLDAVKS*ILVIIDKRLKNIFYRNSFIFKRTSHYKR
EQELIKTLSGFTEGIIQKRIDEINQDAENRNYQSSDAELDGVKRTLCFLDTLLLSKGPDG
MFLTVKDIREEVDS
FLFGGFDLTATTLNSLMYNMTLHPEHQQRCREEVWSVCGKDKSEPISIEQVRQLEFLEAC
IKETLRMYPSGPLTARKATANCTISKYIIPKGSDVIISPI
YMGRCKDFFPDPMVFKPDRWAIGAEPKLRHHF
FIPFMAGARSCMGQRYAMVMLKMVLAHLLRNFLFEPLXERQVELKLNFVITLHTVDPIYVELK

Cyp4d14 AL009194 152A3.2 comp(10682..12421) 55% identical to Cyp4d2
MYLELFAILLATALAWDYMRKRRHNKMYAEAGIRGPKSYPLVGNAPLLIN
ESPKTIFDMQFRLIAEFGKNIKTQMLGESGFMTADSKMIEAIMSSQQTIQ
KNNLYSLLVNWLGDGLLISQGKKWFRRRKIITPAFHFKILEDFVEVFDQQ
SATMVQKLYDRADGKTVINMFPVACLCAMDIIAETAMGVKINAQLQPQFT
YVQSVTTASAMLAERFMNPLQRLDFTMKLFYPKLLDKLNDAVKNMHDFTN
SVITERRELLQKAIADGGDADAALLNDVGQKRRMALLDVLLKSTIDGAPL
SNDDIREEVDTFMFEGHDTTTSSIAFTCYLLARHPEVQARVFQEVRDVIG
DDKSAPVTMKLLGELKYLECVIKESLRLFPSVPIIGRYISQDTVLDGKLI
PADSNVIILIYHAQRDPDYFPDPEKFIPDRFSMERKGEISPFAYTPFSAG
PRNCIGQKFAMLEMKSTISKMVRHFELLPLGEEVQPVLNVILRSTTGINC
GLKPRVY

AC015418 2797- = AI261013 in ordered fragments 46% to 4D1
MSTLALVAFVLWAAFLRYLPKILNFLRLQRFAKTLPGPTIGELIANVKKGGEFVQRELIK
ELREKHGPVFRIWFGKDLMVMFTDPEDIKQLLGNNQLLTKSRNYELLEPWLGKGLLTNGGESWHRRRK
LLTPGFHFRILSEFKEPMEENCRILVRRLRTKANGESFDIYPYITLFALDAICETAMGI
KKHAQLQSDSEYVQAV
HSICRVMHKQSFSFWQRLNVFFKHTKPGKEREAALKVLHDETNRVIRLRREQL
IQERNEWKPEAEQDDVGAKRRLAFLDMLLLTQMEGGAELSDTDIREEVDTFMFEGH
DTTSSAIAFALSLLSKNPDVQQRAFEEASELEGREKESMPYLEAVIKETLR
IYPSVPFFSRKVLEDLEVGKLTVPKGASISCLIYMLHRDPKNFPDPERFDPDRFLVNE
KQMHPFAFAAFSAGPRNCIG
QKFAMLELKTSLAMLLRSYRFLPDKDHQPKPLAELVTKSGNGIRLRILPRDENGTTA*

Cyp4e1 K00045, AC005451 22741-24838 AI135768 AI293313
MWIVLCAFLALPLFLVTYFELGLLRRKRMLNKFQGPSMLPLVGNAHQMGN
TP*TEILNRFFGWWHEYGKDNFRYWIGYYSNIMVTNPKYMEFILSSQTLI
SKSDVYDLTHPWLGLGLLTSTGSKWHKHRKMITPAFHFNILQDFHEVMNE
NSTKFIDQLKKVADGGNIFDFQEEAHYLTLDVICDTAMGVSINAMENRSS
SVVQAFKD*ITYTFNMRAFSPWKRNKYLFHFAPEYPEYSKTLKTLQDFTN
EIIAKRIEVRKSGLEVGIKADEFSRKKMAFLDTLLSSKVDGRPLTSQELY
EEVSTFMFEGHDTTTSGVGFAVYLLSRHPDEQEKLFNEQCDVMGASGLGR
DATFQEISTMKHLDLFIKEAQRLYPSVPFIGRFTEKDYVIDGDIVPKGTT
LNLGLLMLGYNDRVFKDPHKFQPERFDREKPGPFEYVPFSAGPRNCIGQK
FALLEIKTVVSKIIRNFEVLPALDELYDPILSASMTLKSENGLHLRMKQR
LVCDST*

Cyp4e2 U56957, X86076, U34332 AC005451
MWFVLYIFLALPLLLVAYLELSTFRRRRVLNKFNGPRGLPLMGN
AHQMGKNPSEILDTVFSWWHQYGKDNFVFWIGTYSNVLVTSSKYLEF
ILSSQTLITKSDIYQLTHPWLGLGLLTSTGSKWHKHRKMITPAFHFNILQ
DFHEVMNENSTKFIKHLKTVAAGDNIFDFQEQAHYLTLDVICDTAMGVSI
NAMENRSSSIVQAFKDMCYNINMRAFHPLKRNELLYRLAPDYPAYSRTLK
TLQDFTNEIIAKRIEAHKSGAVSTNAGDEFTRKKMAFLDTLLSSTIDGRP
LNSKELYEEVSTFMFEGHDTTTSGVSFAVYLLSRHQDEQRKLFKEQREVM
GNSELGRDATFQEISQMKYLDLFIKEAQRVYPSVPFIGRFTEKDYVIDGD
LVPKGTTLNLGLVMLGYNEKVFKDPHKFRPERFELEKPGPFEYVPFSAGP
RNCIGQKFALLEIKTVVSKIIRNFEVLPALDELVSKDGYISTTIGLPDAE
RKKRDPYRHKYDPILSAVLTLKSENGLYIRLKERH

AC007291 Cyp4e3 BACR02I05 comp(115634-117642) also U34330
MWLAVLALLVLPLITLVYFERKASQRRQLLKEFNGPTPVPILGNANRIGK
NP*TEILSTFFDWWYDYGKDNFLFWIGYSSHIVMTNPKQLE*YILNSQQL
IQKSTIYDLLHPWLGHGLLTSFGSKWHKHRKMITPSFHFNILQDFHEVMN
ENSAKFMTQLKKASAGDTIIDFQEHANYLTLDVICDTAMGVPINAMEQRD
SSIVQAFREMCYNINMRAFHPFKRSNRVFSLTPEFSAYQKTLKTLQDFTY
DIIEKRVYALQNGGSKEDHDPSLPRKKMAFLDTLLSSTIDGRPLTRQEIY
EEVSTFMFEGHDTTTSGVSFSVYLLSRHPDVQRKLYREQCEVMGHDMNRS
VSFQEIAKMKYLDLFIKEAQRVYPSVPFIGRYCDKDYDLDGSIVPKGTTL
NLALILLGYNDRIFKDPHHFRPERFEEEKPAPFEYLPFSAGPRNCIGQKF
ALLELKTVISKVVRSFEVLPAVDELVSTDGRLNTYLGLAPDEKLKREAGR
HKYDPILSAVLTLKSDNGLHLRLRERRS*

Cyp4e4 AL009194 152A3.6 16210-17867 also U34331
MLVVLLVALLVTRLVASLFRLALKELRHPLQGVVPSVSRVPLLGAAWQMR
SFQPDNLHDKFAEYVKRFGRSFMGTVLGHVVMVTAEPRHIDALLQGQHQL
KKGTMYFALRGWLGDGLLLSRGKEWHTMRKIITPTFHFSILEQFVEVFDR
QSSILVERLRTLSYGNEVVNIYPLVGLAALDIITETAMGVNVDAQGADSE
VVHAVKDLTNILATRFMRPHLLFPHLFRLCWPSGFRKQQAGVICLHEFTN
GIIEQRRRLLAREANQDKPTKPHALLDTLLRATVDGQPLTDKQIRDEVNT
FIFEGHDTTTSAVSFCLYLLSRHEAVQQKLFEELRMHYGQDLFRGVILSD
FATLPYLSCVVKESLRLYPPIPAVARCLEKDLVIDEGYIPVGTNVVVLLW
QLLRDEAIFTDPLVFQPERHLGEEAPRLSPYSYIPFSAGPRNCIGQKFAL
LEMKTMVTKVIRHYQLLPMGADVEPSIKIVLRSKSGVNVGLRPRLY

AC005451 comp(14729-16905) AC005415 ESTs AA202364 (LD02646) AI297753
MFLIAIAIILATILVFKGVRIFNYIDHMAGIMEMIPGPTPYPFVGNLFQF
GLKPAEYPKKVLQYCRKYDFQGFRSLVFLQYHMMLSDPAEIQNILSSSSL
LYKEHLYSFLRPWLGDGLLTSSGARWLKHQKLYAPAFERSAIEGYLRVVH
RTGGQFVQKLDVLSDTQEVFDAQELVAKCTLDIVCENATGQDSSSLNGET
SDLHGAIKDLCDIVCENAVVQERTFSIVKRFDALFKFSQYRARINKTPKL
ITSQIISQRRHQLAAENTCQQGQPINKPFLDVLLTAKLDGKVLKEREIIE
EVSTFIFTGHDPIAAAISFTLYTLSRHSEIQQKAAEEQRRIFGENFAGEA
DLARLDQMHYLELIIRETLRLYPSVPLIARTNRNPIDLDGTKVAKCTTVI
MCLIAMGYNEKYFDDPCTFRPERFENPTGNVGIEAFKSVPFSAGPRRCIG
EKFAMYQMKALLSQLLRRFEILPAVDGLPPGINDHSREDCVPQSEYDPVL
NIRVTLKSENGIQIRLRKR*

Cyp4e5 U78486 Drosophila mettleri
MWFIVYILLALPIMLFVFLSCEWPKRNDAEQIEWSSGVPFLGNAHQMGKT
PAEILNTFFEFWHKYNKDNFRIWIGYYANILVSNPKHLEVIMNSTTLIEK
LDIYDMLHPWLGEGLLTSKGSKWHKHRKMITPTFHFNILQDFHQVMNENS
AKFIKRLKEVSAGDNIIDFQDETHYLTLDAICDTAMGVTINAIEKRDTVD
VVKAFKDMCHIINMRAFRPLQRSDFLYRFSPEYATYAKTLKTLKDFTNDI
IAKRIKVHRTAAAKTNQEGSEFSRKKMLPDTLLSATIDGRPLNQQEIYEE
VSTFMFEGHDTTTSGVAFAGYILSRFPEEQRKLYEEQQAVMGNELNRDAT
FQEISAMKYLDLFIKEAQRVYPSVPFIGRYTDKDYNIHGTIMPKGTTLNL
GIIVLGYDDRVFEEPHRFYPERFEKQKPGPFEYVPFSAGPRNCIGQKFAL
LELKTVISKLVRTFEVLPAVDELVSKDGNLNTYVGLPKEEKERKERMGYK
YDPILSAVLTLKSENGLHLRLR

Cyp4g1 AL009188 also U34328,
MAVEVVQETLQQAASSSSTTVLGFSPMLTTLVGTLVAMALYEYWRRNSRE
YRMVANIPSPPELPILGQAHVAAGLSNAEILAVGLGYLNKYGETMKAWLG
NVLLVFLTNPSDIELILSGHQHLTKAEEYRYFKPWFGDGLLISNGHHWRH
HRKMIAPTFHQSILKSFVPTFVDHSKAVVARMGLEAGKSFDVHDYMSQTT
VDILLSTAMGVKKLPEGNKSFEYAQAVVDMCDIIHKRQVKLLYRLDSIYK
FTKLREKGDRMMNIILGMTSKVVKDRKENFQEESRAIVEEISTPVASTPA
SKKEGLRDDLDDIDENDVGAKRRLALLDAMVEMAKNPDIEWNEKDIMDEV
NTIMFEGHDTTSAGSSFALCMMGIHKDIQAKVFAEQKAIFGDNMLRDCTF
ADTMEMKYLERVILETLRLYPPVPLIARRLDYDLKLASGPYTVPKGTTVI
VLQYCVHRRPDIYPNPTKFDPDNFLPERMANRHYYSFIPFSAGPRSCVGR
KYAMLKLKVLLSTIVRNYIVHSTDTEADFKLQADIILKLENGFNVSLEKR
QYATVA

AL067059 (BACR015N20), AL057969 (BACR024M08) Drosophila melanogaster
MSLMFLGAECSSMVLAAFETSAHSVFFALVLLAMFPEHQXLVFXXFKEHFLLAKG
IEVTHTVLQX 425
LXFXVRXLNETLRLMPSVPFSSRETLEDLRXSXGVVIPKGMTFSIDIFNTQRNTXX
WG 251
SEAAQFNPENFLPEKIHDRHPYAFIPFSKGKXNCIG 143

Cyp4g15 AC013897 AI403684 AI388987 N-term AI134510 Cyp4g15 C-term
MEVLKKDAALGSPSSVFYFLLLPTLVLWYIYWRLSRAHLYRLAGRLPGPRGLPIVGHLFD
VIGPASSVFRTVIRKSAPFEHIAKMWIGPKLVVFIYDPRDVELLLSSHVYIDKASEYKFF
KPWLGDGLLISTGEKWRSHRKLIAPTFHLNVLKSFIELFNENSRNVVRKLRAEDGRTFDC
HDYMSEATVEILLGE*TAMGVSKKTQDKSGFEYAMAVMRMCDILHARHRSIFLRNEFGFT
LTRYYKEQGRLLNIIHGLTTKVIRSKKAAFEQGTRGSLAQCELKAAALEREREQNGGVDQ
TPSTAGSDEKDREKDKEKASPVAGLSYGQSAGLKDDLDVEDNDIGEKKRLAFLDLMLESA
QNGALITDTEIKEQVDTIMFEGHDTTAAGSSFFLSLMGIHQDIQDRVLAELDSIFGDSQR
PATFQDTLEMKYLERCLMETLRMYPPVPLIARELQEDLKLNSGNYVIPRGATVTVATVLL
HRNPKVYANPNVFDPDNFLPERQANRHYYAFVPFSAGPRSCVG*RKYAMLKLKILLSTIL
RNYRVYSDLTESDFKLQADIILKREEGFRVRLQPRTS*

Cyp4p1 U34327, AC008186 comp(2-88, 148-465) EST AI293255
TSIGLIFGLMNMSLNPDKQELCYQEIQEHIDDDLSNLDVGQLNKLKYLEY
FMKETTRLFPSVPIMGREAVQETELANGLILPKGAQITIHVFDIHRNAKY
WDSPEEFRPERFLPENVQDRHTYAYVPFSAGQRNCIGKKYAMQEMKTLMV
VLLKQFKVLKAIDPQKIVFHTGITLRTQDKIRVKLVRRT*

Cyp4p2 AL055774 BACR021H19 80% identical to Cyp4p1
ETAMGIKLDEMAEKGDRYRANFHIIDEGLTRRIVNPLYWDDCVYNMFTG
HKYNAALKVVHEFSREIIAKRRVLLEEELENRRATQTADDDM*KETFAML
DTLICAEKDGLIDDIGISEEVDTLMAEGYDTTSIGLVFGLMNMSLYAAEQ
ELCYQEIQEHIXDDLSNLNLSQLSKLNYLGYFIKETMRLYPSIPIMGRQT
LQETELENGLILPKRSQINIHVFDI

Cyp4p3 AC008186 48361-48564, 48626-49174 AL063060
FAMLDTLILAEKDGLIDHIGICEEVDTLMFEGYDTTSIGVVLGKRNMSLYAAEQNLC
SQEIQEHIXDDLSNLTLSQLSKLNYLGYFIKETRRRYLSIPIMGRQTLQEPELENGLFLPKRSQINI
HVFDIHRNPKYWESPEEFRPERFLPQNCLKRHPYAYIPFSAGQRNCIGK
KYAMQEMKTLMVVILKHFKILPVIDPKSIVFQVGITLRFKNKIKVKLVRRNCV*

N-term A AI389883 56% to 4p Nterm about 15 aa short to overlap with AL055774
MMICLLWISVAILVVIHWIYKVNKDYNILAFFARRVQTKDGKP
LDSLVPMIKGRTVFANCFDLLGKDTDQVFTHLRQLAKNSGDSYLQYSMGF
SNFNVIDAHNAANILNHPNLITKGVIYNFLHPFLRTGVLTATEKKWHTRR
SMLTRTFHLDILNQFQEIFIAESLKFVSQFQGQNEV

N-term B follows Cyp4p3 sequence on AC008186
MLILWLVGAFIVLIQWIYRLNRDYCILGFFAKRIRTKNGQNPESIAPLVKGSTIFANSFD
LYGKDHAGVFEHSRDCAKKLGKSYAEYAMGTAIYNVIDADSAERVLNDPNLINKGTIYDFLHPFLRTGLLTSTG

N-term C AI405947 GH26104, AI109737 GH09012 AL063060
MIILWLILALSALLYWLHRANKDYHILSFFTKRIRLKDG
TPVEIIAPIAKGKTIFGNTLDLYGRDHAGVFNYSRERAKEMGTSYIEYVF
GKAIYNIIDADSAENVLNHPNLITKGLVYNFLHPFLRTGLLTSTGKKWHA
RRKMLTPTFHFNILNQFQEIFKTESQKFLLQFEGQDEVTITLHDVIPRFT
LNSICETAMGVKLDEMAEKGDRYR

AC018129 Drosophila melanogaster, *** SEQUENCING IN PROGRESS ***, in ordered

Cyp4p1 and N-term C intact
69662 MIILWLILALSALLYWLHRANKDYHILSFFTKRIRLKDGTPVEIIAPIAKGKTIFGNTLD 69483
69482 LYGRDH 69465
69405 AGVFNYSRERAKEMGTSYIEYVFGKAIYNIIDADSAENVLNHPNLITKGLVYNFLHPFLR 69226
69225 TGLLTSTGKKWHARRKMLTPTFHFNILNQFQEIFK 69058
69010 TESQKFLLQFEGQDEVTITLHDVIPRFTLNSICET 68831
68830 AMGVKLDEMAEKGDRYRENFSQIEECFIRRLSNPLLWGDKLFEMFAAKDFASALDVVHRF 68651
68650 SSEIIAKRRDLLKDELDKSSSTADDDXX 68573
68501 KKRFAMLDTLIYAEKDGLIDHIGICEEVDTLMFEGYDTTSIGLIFGLMNMSLNPDKQELC 68322
68321 YQEIQEHISDDLSNLDVGQLNKLKYLEYFMKETTRLFPSVPI 68142
68141 MGREAVQETELANG 68103
68100 LILPKGAQITIHVFDIHRNAKYWDSPEEFRPERFLPENVQDRHTYAYVPFSAGQRNCIGK 67921
67864 KYAMQEMKTLMVVLLKQFKVLKAIDPQKIVFHTGITLRTQDKIRVKLVRRT* 67706

Cyp4p3 N-term B intact matches 1st part of 4p3 but this was a chimera
67385 MLILWLVGAFIVLIQWIYRLNRDYCILGFFAKRIRTKNGQNPESIAPLVKGSTIFANSFD 67206
67205 LYGKDH
67105 AGVFEHSRDCAKKLGKSYAEYAMGTAIYNVIDADSAERVLNDPNLINKGTIYDFLHPFLR 66926
66925 TGLLTSTG
66869 KKWHARRKMLSPTFHFNILNQFQEIF 66768
66708 RTESLKFLEQFKGNDEAIISLNEVIPRFTLNSIC 66607
66565 ETAMGVKLDEMAEKGDRYRENFRQIEECFIRRMSNPLLWSDTLFKMFAEKDYAS 66386
66385 ALDVVHGFSSEIIAKRRDQLNDEIDSRGNTQTAEDEL 66272
66213 KKRFAMLDTLILAEKDGLIDHIGICEEVDTLMFEGYDTTSIGLMFGLMNMSLYPEEQEKC 66034
66033 YQEIQANISDELNILNIGQLNKLKNLEYFIKETMRLFPSVP 65854
65853 AMGRETTRETELSNGLILPKGSQIFVHVFDIHRNPEYWDSPEEFRPERFLPENSQNRHTY 65674
65673 AYIPFSAGQRNCIGM
65578 KFAMQEMKTLMVALLKQFQILPEIDPKTIVFQTGLTLRTKNQIHVFDI 65435

Cyp4p2 (plus N-term A)
72180 MMICLLWISVAILVVIHWIYKVNKDYNILAFFARRVQTKDGKPLDSLVPMIKGRTVFANCFDLL 72001
72000 GKDTDQ 71992
71902 VFTHLRQLAKNSGDSYLQYSMGFSNFNVIDAHNAANILNHPNLITKGVIYNFLHPFLRTG 71723
71722 VLTATGKKWHTRRSMLTRTFHLDILNQFQEIF 71573
71448 RAESLKFVSQFQGQNEVVVSLKDRISRFTLNSIC 71308
71264 ETAMGIKLDEMAEKGDRYRANFHIIDEGLTRRIVNPLYWDDCVYNMFTGHKYNAALK 71085
71084 VVHEFSREIIAKRRVLLEEELENRRATQTADDDM 70980
70912 KKRFAMLDTLICAEKDGLIDDIGISEEVDTLMAEGYDTTSIGLVFGLMNMSLYAAEQELC 70733
70732 YQEIQEHIL
70647 DDLSNLNLSQLSKLNYLGYFIKETMRLYPSIPIMGRQTLQETELENGLILPKRSQINI 70468
70467 HVFDIHRNPKYWESPEEFRPERFLPQNCLKRHPYAYIPFSAGQRNCIG
70287 KKYAMQEMKTLMVVILKHFKILPVIDPKSIVFQVGITLRFKNKIKVKL 70108
70107 VRRNCV* 70099

AC008324 chromosome 2 clone BACR25K01 (D854)Gene 1 76245-78135
MFLEVLFAAPLVIFIFRKLWAHLNRTYFILSLCKRIRTEDGSLLESKIYVAPSKTRFGNNFDLVNFT
TESIFNFMRDASAKAKGRNYLWYFFHAP
MYNIVRAEEAEEILQSSKLITKNMIYELLKPFLGEGLLISTDQKWHSRRKALTPAFHFKVLQSFLIIFK
EECNKLVKVLHQSVNMELELNQVIPQFTLNNVC
ETALGVKLDDLSEGIRYRQSIHAIEEVMQQRLCNPFFYNIVYFFLFGDYRKQV
NNLKIAHEFSSNIIEKRRSLFKSNQLGQEDEFGKKQRYAMLDTLLAAEADGQI
DHQGICDEVNTFMFEGYDTTSTCLIFTLLMLALHEDVQKKCYEEIKYLPDDSDDISVFQ
FNELVYMECVIKESLRLFPSVPFIGRQCVEETVVNGMVMPKDTQISIHLYEIMRDARHFS
NPDLFQPDRFFPENTVNRHPFAFVPFSAGQRNCIGQKFAILEIKVLLAAVIRNFKILPVT
LLDDLTFENGIVLRTKQNIKVKLVHRENK*

AC017771 = AC008324 Gene 2 78359-79584 and AC008003 AA567719 AA697618 AI517752, AL108342
21996 MWIALLGSSLLIGALWLLLRQLNKTYFILSLCKRVRTADGSPLESKVFVVPGKTRFGNNLDLNLTP 21811
21743 ANIFSYIRESTAKANGQNYIWNFLFAPEYNIVRAEDAEEIFQSTKITTKNMSYELIRPFL 21564
21563 GDGLLISI
DQKWHTRRKTLTPAFHFNILQSFLSIF
REESKKFIKILDKNVGFELELNQIIPQFTLNNIC
21203 ETALGVKLDDMSEGNEYRKAIHDFEIVFNQRMCNPLMFFNWYFFLFGDYKKY 21027
21026 SRILRTIHGFSSGIIQRKRQQFKQKQLGQVDEFGKKQRYAMLDTLLAAEAEGKID 20862
20861 HQGICDEVNTFMFGGYDTTSTSLIFTLLLLALHADVQERCYEELQDLPEDIDEVSMFQF 20685
20684 NELIHLECVIKESLRLFPSAPIIGRTCIEESVMNGLVLPKNAQISIHIYDIMRDARHFP 20508
20507 KPNQFLPERFLPENSVNRHPFAFVPFSAGPRNCIGQKFGVLEIKVLLAAVIRNFKLLPAT 20328
20327 QLEDLTFENGIVLRTQQNIKVKFEAR 20250

AC008325 Drosophila melanogaster chromosome 2 clone BACR05M06 (D855) RPCI-98
MWIALLGIPILLAVLTLLLKHINKTYFILSLTKRVRTEDGSPLESKVAIMPGKTRFGNNL
DILNFTPASVFNFVRESTAKAKGQNYLWYFLYAPMYNVVRPEEAEEVFQSTKLITKNVVY
ELIRPFLGDGLLISTDHKWHSRRKALTPAFHFNVLQSFLGIFK
EECKKFLNVLEKNLDAELELNQVIPPFTLNNIC
ETALGVKLDDMSEGNEYRKAIH

Cyp6a2 M88009, S51248, U78088, AC007549
MFVLIYLLIAISSLLAYLYHRNFNYWNRRGLPHDAPHPLYGNMVGFRKNR
VMHDFFYDYYNKYRKSGFPFVGFYFLHKPAAFIVDTQLAKNILIKDFSNF
ADRGQFHNGRDDPLTQHLFNLDGKKWKDMRQRLTPTFTSGKMKFMFPTVI
KVSEEFVKVITEQVPAAQNGAVLEIKELMARFTTDVIGTCRFGIECNTLR
TPVSDFRTMGQKVFTDMRHGKLLTMFVFSFPKLASRLRMRMMPEDVHQFF
MRLVNDTIALRERENFKRNDFMNLLIELKQKGRVTLDNGEVIEGMDIGE
LAAQVFVFYVAGFETSSSTMSYCLYELAQNQDIQDRLRNEIQTVLEEQEG
QLTYESIKAMTYLNQVISETLRLYTLVPHLERKALNDYVVPGHEKLVIEK
GTQVIIPACAYHRDEDLYPNPETFDPERFSPEKVAARESVEWLPFGDGPR
NCIGMRFGQMQARIGLAQIISRFRVSVCDTTEIPLKYSPMSIVLGTVGGI
YLRVERI

AL062684.1|CNS002GP Drosophila melanogaster genome survey sequence TET3 end of
LAGXXXSTTMGFTLYELACNQDVQDKLRAEIXSVLERYNGKLEYDSMQDLFYME
KVINESLRKHPVXAHLARIATKPYQHSNPKYFIEAGTGVLVSTLGIHHDPEXY
PEPEKFIPERFDEEQVKRASHLRFPTFGAGPRNCIGLRFGRMQVIIGLALLIHNXRF
EXHPKTPVPMKYTINNLLLGSEGGIHLNITKVVRD*

AC010578 21500-20681 AA696094, AA803578, AA202305, AI546241
MAILLGLVVGVLTLVAWWVLQNYTYWKRRGIPHDPPNIPLGNTGELWRTMP
LAGILKRTYLKFRKQTDGPFAGFYLYAMKYIVITDVDFVKTVLIRDFDKF
HDRGVYHNDKDDPLTNNLATIEGQKWKNLRQKLTHTFTSAKMKSMFSTVL
NVGDEMIRVVDEKISSSSQTLEVTDIVSRFTSDVIGICAFGLKCNSLRDP
KAEFVQMGYSALRERRHGWLVDLLIFGMPKLAGELGFQFLLPSVQKFYMK
IVQDTIDYRMKRKVTRNDFMDTLIDMKQQYDKGDKENGLAFNEVAAQAFV
FFLAG 20681

AI063421 GH03219 AI532649 SD04231, 63% identical to AA696094
LIVLLIGVITFVAWYVHQHFNYWKRRGIPHDEPKIPYGNTSELMKTVHFA
DIFKRTYNKLRNKTDGPFVGFYMYFKRMVVVTDIDFAKTVLIREFDKFHD
RGVFHNERDDPLSANLVNIDGQKWKTLRQKLTPTFTSGKMKTMFPTILTV
GDELIRVFGETASADSDSMEITNVVARFTADVIGSCAFGLDCHSLSDPKAKF

AA951440 LD31895, AA816508 LD01943, AA201305 LD04267
MLDVVALLLIALAVGFWFVRTRYSYWTRRGIGSEPARFPVGNMEGFRKNKHFI
DIVTPIYEKFKGNGAPFAGFFMMLRPVVLVTDLELAKQILIQDFANFEDR
GMYHNERDDPLTGHLFRIDGPKWRPLRQKMSPTFTSAKMKYMFPTVCEVG
EELTQVCGELADNAMCGILEIGDLMARYTSDVIGRCAFGVECNGLRNPEA
EFAIMGRRAFSERRHCKLVDGFIESFPEVARFLRMRQIHQDITDFYVGIV
RETVKQREEQGIVRSDFMNLLIEMKQRGELTIEEMAAQAFIFFAAGFDTS
ASTLGFAXYELAKQP

AC010578 Drosophila melanogaster chromosome 2 clone BACR03K23 (D1086) RPCI-98
EFLAQAIIFLGAGFETSSTTMGFGIYELGRNQDVQDKLREEIGNVFGKHNKEFTYEGIKEMKYLEQVVMETLRKYPVLAHLTRMTD
TDFSPEDPKYFIAKGTIVVIPALGIHYDPDIYPEPEIFKPERFTDEEIAARPSCTWLPFGEGPRNCIGLRFGMMQTCVGLAYLIRG
YKFSVSPETQIPMKIVVKNILISAENGIHLKVEKLAK*

L46858, AL061295 (BACR004K24) 52% identical to cyp6a5
EFTYDSMQELRYMELVIAETLRKYPILPQLTRISRHLYAAKGDRHFYIEP
GQMLLIPVYGIHHDPALYPEPHKFIPERFLADQLAQRPTAAWLPFGDGPR
NCIGMRFGKMQTTIGLVSLLRNFHFSVCPRTDPKIEFLKSNILLCPAHGI
YLKVQQLSQMSS*

AL061650.1|CNS00613 Drosophila melanogaster genome survey sequence TET3 end of
TLRGXPLLPRLTRFSGLLYAARGVRLFXFGPGLLLLXPVYGIXXVPALXPXPHRFI
PERX 539
LAGRLAPRPAAAWLPFGVGPRXCVGMGFGRVPAAVGLVGLLRIFRFGVCPRPGP
GVAFLR 359
SPFLLCPAXGFCLGVPRL 305

AA801503 HL02667 I-helix and before, 53% identical to Cyp12a2
TVQEYRSPNGFLLRLGRETSLYRYIPTPTYKKFSRAMDEIFDT
CSMYVNQAIERIDRKSSQGDSNDHKSVLEQLLQIDRKLAVVMAMDMLMGG
VDTTSTAISGILLNLAKNPEKQQRLREEVLSKLTSLHSEFTVEDMK

AC007137.10|AC007137 Drosophila melanogaster chromosome 2 clone BACR25C02
MWLLLPILLYSAVFLSVRHIYSHWRRRGFPSEKAGITWSFLQKAYRREFR
HVEAICEAYQSGKDRLLGIYCFFRPVLLVRNVELAQTILQQSNGHFSELK
WDYISGYRRFNLLEKLAPMFGTKRLSEMFGQVQKVGDHLIHHLLDRQGQG
CPQEVDIQQKLRV*YSVNIIANLIYGLDINNFEHEDHILTSYLSHSQASIQS*
FTLGRLPQKSSYTYRLRDLIKQSVELREDHGLIRKDILQLLVRFRN*NRLM
VSFEVCIVKSCTNSLFLDADKLLSIKRLAKVAEDLLKVSLDAVASTVTFT
LLEILQEPLIVEKLRAEIKELSNENGQLKFEELNGLRYMDMCLK*ETLRK
YPPLPIIERVCRKSYSLPNSKFTIDEGKTLMVPLLAMHRDEKYFSEPMKY
KPLRFLQTANDVGQCEDKTKSNVFIGFGIGGSQCVGTYRIQIILAMFRQY
FSHRNYRTELCKVGN*

Cyp6a8 L46859, AL054065, AI258590 LP01819, AI107730 GH05558
MALAYILFQVAVALLAILTYYIHRKLTYFKRRGIPFVAPHLIRGNMEELQ
KTKNIHEIFQDHYNKFRESKAPFVGFFFFQSPAAFVIDLELAKQILIKDF
SNFSNKGIFYNEKDDPISAHLFNLDGAQWRLLRNKLSSTFTSGKMKLMYP
TVVSVANEFMTVMHEKVPKNSVLEIRDLVARFTVDVIGTCAFAIQCNSLR
DEKAEFLYFGKRSLVDKRHGTLLNGFMRSYPKLARKLGMVRTAPHIQEFY
SRIVTETVAVREKEHIKRNDFMDMLIELKNQKEMTLENGDVVRGLTMEEV
LAQAFVFFIAGFETSSSTMGFALYELAKNPDIEQHDQNFTYECTKDLKYL
NQVLDETLRLYTIVPNLDRMAAKRYVVPGHPNFVIEAGQSVIIPSSAIHH
DPSIYPEPFEFRPERFSPEESAGRPSVAWLPFGDGPRNCIGLRFGQMQAR
IGPALLIRNFKFSTCSKTPNPLVYDPKSFVLGVKDGIYLKVETV

AC008285 AC017567 Drosophila melanogaster chromosome 3 clone BACR31P16 (D1002)
MQLTYFLFQVAVALLAIVTYILHRKLTYFKRRGIPYDKPHPLRGNMEGYKKTRTVHEIHQ
EYYNKYRNSKAPFVGFYLFQKPAAFVIDLELAKQILIKNFSNFTDKGIYYNEKDDPMSAH
LFNLDGPQWRLLRSKLSSTFTSGKMKFMYPTVVSVAEEFMAVMHEKVSENSILDVRDLVA
RFTVDVIGTCAFGIKCNSLRDEKAEFLHFGRRALLDSRHGNLVSGLMRSYPNLARRLGLC
RNTAQIQEFYQRIVKETVTLREKENIKRNDFMDMLIGLKNQKNMTLENGEVVKGLTMDEI
VAQAFVFFIAGFDTSSSTMGFALYELAKNPSIEQHDQKFTYECIKDLKYLDQVLSETLRH
YTIVPNVDRVAAKRFVVPGNPKFVIEAGQSVIIPSSAIHHDPSIYPEPNEFRPERFSPEE
SAKRPSVAWLPFGEGPRNCIGLRFGQMQARIGLAMLIKNFTFSPCSATPDPLTFDPHSAI
LLGIKGGIQLKVEAI*

AC004721 AC017455 Cyp6a16 DS03308 complete 2-455 comp(2956-5070) 47% to 6a8
MDFTLLLLTSLLSFLLGYLRYRFTYWELRGIPQLRPHFLFGHFFRLQSVHYSELLQETYDAF
RGSAKVAGTYVFLRPMAVVLDLDLVKAVLIRDFNNFVDRRSFHGDPLTANLFNLQGEEWRNL
RTKLSPTFTSGKMKYMFGTVSTVAQQLGGTFDELVAVLELHDLMARYTTDVI
GSCAFGTECSSLREPQAEFRQVGRRIFRNSNRSIRWRIFKMTYLSSLAKLGLPVRI
LHPDITKFFNRIVRETVELRERENIRRNDFMDLLLDLRR
KGLTMEQMAAQAFVFFVAGFETSSSNMSYALFELAKNQDVQQKLRMEINDSIGKHG
KLTYEAMMEMPYLDQTI
PETLRKYPALSSLTRLASEDYEIPSPDGGDPVVLEKGTSVHIPVLAIHYDPEVYPEP
HEFRPERFAPDACRERHPTAFLGFGDGPRNCIGLRFGRMQVKVGLITLLR
RFRFSLPPGSPTQLKVTKRNLILLPSDGVRLQVDPVESRLM*

Cyp6a9 L46860, AL054861, AL053264, AL072094, AL055555
MGVYSVLLAIVVVLVGYLLLKWRRALHYWQNLDIPCE
EPHILMGSLTGVQTSRSFSAIWMDYYNKFRGTGPFAGFYWFQRPGILVLD
ISLAKLILIKEFNKFTDRGFYHNTEDDPLSGQLFLLDGQKWKSMRSKLSS
TFTSGKMKYMFPTVVKVGHEFIEVFGQAMEKSPIVEVRDILARFTTDVIG
TCAFGIECSSLKDPEAEFRVMGRRAIFEQRHGPIGIAFINSFQNLARRLH
MKITLEEAEHFFLRIVRETVAFREKNNIRRNDFMDQLIDLKNSPLTKSES
GESVNLTIEEMAAQAFVFFGAGFETSSTTMGFALYELAQHQDIQDRVRKE
CQEVIGKYNGEITYESMKDMVYLDQVISETLRLYTVLPDLNRECLEDYEV
PGHPKYVIKKGMPVLIPCGAMHRDEKLYANPNTFNPIFFARTSEGSDSVE
WLPFGDGPRLCIGMRFGQMQARSGLALLINRFKFSVCEQTTIPIVYSKKT
FLISSETGIFLKVERV

AL072844 Drosophila melanogaster genome survey sequence TET3 end
LLIPTAAIHMDPGIYENPQRFYPERFXEQAXRSRPAAAFLPXGDGLRGCIAARFAEQ
QLLVGLVALLRQHRYAPSAETSIPVEYDNRRLLLMPKSDIKLSVERVDXL*

AC008288 = AL072844 AC009342 AC017706
MDLMHRTLLTALGELSVVYALVKFSLGYWKRRGILHEKPKFL
WGNIKGVVSGKRHAQDALQDIY
TAYKGRAPFVGFYACLKPFILALDLKLVHQIIFTDAGHFTSRGLYSNPSGEPLSHNLLQLNGHKWRSLHAKSAEVFTPANMQKLLV
RLSQISSRIQRDLGEKSLQTINISELVGAYNTDVMASMAFGLVGQDNVEFAKWTRNYWADFRMWQAYLALEFPLIARLLQYKSYAE
PATAYFQKVALSQLQLHRRRDRQPLQTFLQLYSNAEKPLTDIEIAGQAFGFVLAGLGPLNATLAFCLYELARQPEVQDRTRLEINK
ALEEHGGQVTPECLRELRYTKQVLNETLRLHTPHPFLLRRATKEFEVPGSVFVIAKGNNVLIPTAAI
HMDPGIYENPQRFYPERFEEQARRSRPAAAFLPFGDGLRGCIAARFAEQQLLVGLVALLRQHRYAPSAETSIPVEYDNRRLLLMPK
SDIKLSVERVDKL*

AL069964 Drosophila melanogaster genome survey sequence T7 end of
MSVGTVLLTALLALVGYLLMKWRSXMRHWQDLGIPCEEPHILMG
SMKGVRTARSFNEIWTSYYNKFRGSGPFAGFYWFRRPAVFVLEK
SLXKQILIKEFNKFTDRGXFHNPEDDPLSGQLFLLDGQKWRTMR
NSTSSTFTSGKMKY

Cyp6a9 HOMOLOG Drosophila grimshawi U87164
DPEAEFRIMGRKSLTDQRHGNLGNALLNGFPNFSRRIHMKLTPEHIEKFF
MRIVKETVDYREKNNVRRNDFMDQLIDLKNKPLMKSETGESMNLTIEEIS
AQALVFFAAGFETSSTTMGFALYELARAEDVQNRLRKECNEVLARHNGDL
TYESIKDMKYLDQVISETLRLYTVLPILNRQCLEDYVVP

Cyp6a13 Drosophila melanogaster AC005457, DS08616, AA941155 LD25139
MLTLLVLVFTVGLLLYVKLRWHYSYWSRRGV
AGERPVYFRGNMSGLGRDLHWTDINLRIYRKFRGVERYCGYFTFMTKSLFIMDLELIRDI
MIRDFSSFADRGLFHNVRDDPLT
GNLLFLDGPEWRWLRQNLTQVFTSGKMKFMFPNMVEVGEKLTQACRLQVGEIEAKD
LCARFTTDVIGSCAFGLECNSLQDPESQFRRMGRSVTQEPLHSVLVQAFMFAQPELARKL
RFRLFRPEVSEFFLDTVRQTLDYRRRENIHRNDLIQLLMELGEEGVKDALSFEQIAAQALV
FFLAGFDTSSTTMSFCLYELALNPDVQERLRVEVLAVLKRNNQKLTYDSVQEMPYLDQ
VVAETLRKYPILPHLLRRSTKEYQIPNSNLILEPGSKIIIPVHSIHHDPELYPDPEKFDPSRFE
PEEIKARHPFAYLPFGEGPRNCIGERFGKLQVKVGLVYLLRDFKFSRSEKTQIPLKFS
SRNFLISTQEGVHLRME

AC009844 65863-64467 45% to 6a2 same as AL097801, AL057750
MAVMIVLLIGVITFLAWYVHQHFNYWKRRGI
FPR*APKLPTVIPAYLMKTRPFCGYFSRDPPTKLRTKPAGPFVGFLYVFQEDW*L*PNIDSAKPE
LIREFDKFPVGGVFHNERED
PLSATLVNIDGQKWKPLRQKLTPTFTSGKMKTMFPTILTVGDELIRVFGETASADSDSME
ITNVVARFTADVIGSCAFGLDCHSLSDPKAKFVQMGTTAITERRHGKSMDLLLFGAPELA
AKLRMKATVQEVEDFYMNIIRDTVDYRVKNNVKRHDFVDMLIEMKLKFDNGDKENGLTFN
EIAAQAFIFFLAGFETSSTTMGFALYELACHQDIQDKLRTEINTVLKQHNGKLDYDSMRE
MTYLEKVITETMRKRPVVGHLIRVATQHYQHTNPKYNIEKGTGVIVPTLAIHHDPEFYPE
PEKFIPERFDEDQVQQRPXCTFLPFGDGPRNCIGLRFGRMQVIVGXALLIHNFKF

AC009844 28950-28363 and 42575-43341 same sequence
VLEIVDLVARYTPDVIGNCAFGLNCNSLQNPNAEFVTIGKRAIIERRYGGLLDFLIFGFP 28771
KLSRRLRLKLNVQDVEDFYTSIVRNTIDYRLRTNEKRHDFMDSLIEMYEKEQAGNT 28603
EDGLSFNEILAQAFIFFVAGFETSSTTMGFALYELALDQDIQKHNNEF 28423
TYEGIKEMKYLEQVVMETLRKYPVLAHLTRMTQTDFSPEDPKYFIPKGTTGVIPALGIHYDPEIYPEP
GEVKPERLTDEAIAARPSCTWL

AC009844 44542-44881
MLLLALIVVILSLLVFAARRRHGYWQRRGIPHDVPHPIYGNIKDWPKKRHIAMIFRDYYFK
YKRSVYPFAGFYFFFTRSAVITDLELVKRVLIKDFNHFENRGIFYNEIDDPLS

AC009844 35993-35161
IERFFMRIVRETVAFREQNNIRRNDFMDQLIDLKNKPLMVSQSGESVNLTIEEIAAQAF 35817
VFFAAGFETSSTTMGFALYELAQNQDIEKCNGELNYESMKDLVYLDQ 35637
ETLRLYTVLPVLNRECLEDYEVPGHPKYVIKKGMPVLIPCGAMHRD 35440
EKLYANPNTFNPDNFSPERVKERDSVEWLPFGDGPRNCIGMRFGQMQARIGLALLIKDFK 35260
FSVCEKTTIPMTYNKEMFLIASNSGIYLKAERV 35161

AC009844 82008-82526
KGLYCNQKSDPLSGDLYALRGESWKEMRQKLDPSLEGDRMSLLYDCLYEEAEQLLLTVNS 82187
TLMSQPHSTVHIQKIMRRYVLSSLAKCVFGLNAEQRKTYPLEDFEQMTELALNSHKHGYL 82367
MNLMMIRVPNFCRMLRMRRTPKQAEEYFIKLLTSIVEQRETSGKPQKDYLQLL 82526

AC009844 90483-90241 = AL069964
MSVGTVLLTALLALVGYLLMKWRSTMRHWQDLGIPCEEPHILMGSMKGVRTARSFNEIWTSYYNKFRGSGPFAGFYWFRRP
AVFVLEKSLGKQILIKEFNKFTDRGFFHNPEDDPLSGQLFLLDGQKWRTMRNSTSSTFTSGK

6a9
MGVYSVLLAIVVVLVGYLLLKWRRALHYWQNLDIPCE
EPHILMGSLTGVQTSRSFSAIWMDYYNKFRGTGPFAGFYWFQRPGILVLD
ISLAKLILIKEFNKFTDRGFYHNTEDDPLSGQLFLLDGQKWKSMRSKLSS
TFTSGKMKYMFPTVVKVGHEFIEVFGQAMEKSPIVEVRDILARFTTDVIG
TCAFGIECSSLKDPEAEFRVMGRRAIFEQRHGPIGIAFINSFQNLARRLH
MKITLEEAEHFFLRIVRETVAFREKNNIRRNDFMDQLIDLKNSPLTKSES
GESVNLTIEEMAAQAFVFFGAGFETSSTTMGFALYELAQHQDIQDRVRKE
CQEVIGKYNGEITYESMKDMVYLDQVISETLRLYTVLPDLNRECLEDYEV
PGHPKYVIKKGMPVLIPCGAMHRDEKLYANPNTFNPIFFARTSEGSDSVE
WLPFGDGPRLCIGMRFGQMQARSGLALLINRFKFSVCEQTTIPIVYSKKT
FLISSETGIFLKVERV

AC009844 694-846
WFGFGVGARSCIGIQFAQLQLRLALALLLSEYEFSLNTRKPLINLEDGIAL

AL057750.1|CNS00162 Drosophila melanogaster genome survey sequence TET3 end of
YVIGKFSXGLDCHSXSDPKAKFVQMGTTAITERRHGKSMXLLLFGAPELAAXX
RMKATVQEVEDFYMNIIRDTVDYRVKNNVKRHDFVDMLIEMKLKFDN
GDKENGLTFNEIAAQAFIFFLAGFETSSTTMGFALYELACHQDIQDKLRTEINTVLKQHN
GKLDYDSMREMTYLEKVITETMRKRPVVGHLIRVATQHYQHTNPKYNIEKGTGVIVPTLA
IHHDPEFYPEPEKFIPERFDEDQVQQRPXCTFLPFGDGPRNCIGLRFGRMQVIVGXALLI
HNFKF

Cyp6a14 AC005457 P1 clone DS08616 AC007085 (BACR21H10) 16539-17204
MLFTIALVGVVLGLAYSLHIKIFSYWKRKGVPHETPLPIVGNMX 13699
RSTTSAISTKEFIRSLRGRVPSPECTFFKRTALITDLDFIKQVMIKDFSYFQ 13868
DRGAFTNPRDDPLTGHLFALEGEEWRAMRHKLTPVFTSGKIKQMSKVIVDVGLRLGDAM 14045
DKAVKEAKVEEGNVEIKDLCARFTTDVIGSCAFGLECNSLQDPSAEFRQKGREIFTRR 14219
RHSTLVQSFIFTNARLARKLRIKVLPDDLTQFFMSTVKNTVDYRLKNGIKRNDFIEQMIE 14399
LRAEDQEAAKKGQGIDLSHGLTLEQMAAQAFVFFVAGFETSSSTMSLCLYELALQPDIQQ 14579
RLREEIESVLANVDGGELNYDVLAQMTYLDQVLSETLRKHP 14759
LLPHLIRETTKDYQIPNSDIVLDKGILALIPVHNIHHDPEIYPEPEKFDPSRFDPEEVKN 14939
RHPMAYLPFGDGPRNCIGLRFGKIQAKIGLVSLLRRFKFSVSNRTDVPLIFS 15095
KKSFLLTTNDGIYLKVE

Cyp6a15p AC007085 Drosophila melanogaster GenEMBL AC005457 P1 clone DS08616
AFTVTNSKLAKKLKMKILRDDLTDFFLSVVKPALSGMTLWTSPPSGGRSSRQGGSKFDLS 9960
HNWTLEQMAAQAIVFFLAGFETSSSTMSSCKYELALQPEI*NQIRDEIERVLEGNAITYDALAK 9768
INYPEQVLSETLRKHPIQLIKFLLETQES 9681
FRVRNTELIVEKGTSLLIPVHSVHYDPHLYPHPKLFDSSRLKAYKSNSRHPFAYLPFGTF 9467
GPRSCIGLRFGKMQAKIGIVSLCQRFKFGDSDLTDIPLASDTRSAIVERI* 9322

Cyp6a17 AL052842 (BACR001L16), AL074108 (BACR035C16) 49% with 6a2 AA699131
MLLLALIVVILSLLVFAARRRHGYWQRRGIPHDEVHPLFGNIKDWPNKRHIAEIFR
DYYFKYKNSDYPFAGFYFFFTRTAVVTDMELLKRVLIKDFNHFENRGVFYNEIDDPLSATL
FSIEGQKWRHLRHKLTPTFTSGKMKNMFPIVVKVGEEMDKVFRSKTAADRGQVLEVV
DLVARYTADVIGNCAFGLNCNSLYDPKAEFVSIGKRAITEHRYGNMLDIFLF
GFPKLSRRLRLKLNIQEAEDFYTKIVRETIDYRLRTKEKRND 338
FMDSLIEMYKNEQSGNSEDGLTFNELLAQAFIFFVAGFETSSTTMGFALYELARNQD 509
VQDKLREEIGNVFGKHNKEFTYEGIKEMKYLEQVVMETLRKVPVLAHLTR
MTDTDFSPEDPKYFIAKGTIVVIPALGIHYDPDIYPEPEIFKPE 263
RFTDEEIAARPSCTWLPFGEGPRNCIGLRFGMMQTCVGLAYLIRGYKFSVSPETQIPMK 86
IVVKNILISAENGIHLKVEKLA

AC007594 sequence A AI257340 AI259899 one small segment is from seq. B
MLWEFFALFAIADALLYRWASANNDFFKDRGIAYEKPELYFGNMAGMFLRKRAMFDIVCD 205
LYTKGGSKKFFGIFEQRQPLLMVRDPDLIKQITIKDFDHFINHRNEFATSSDDDPHDMSN 385
LFGSSLVSMRDARWKDMRSTLSPAFTGSKMRHMVQLMNHEA 508
KEAVDCLKQDDSRVQENELDMKDYCTRFTNDVIASTAFGLQVNSFKDRENTFY
QMGKKLTTFTFLQSMKFMLFFALKGLNKILKVELFDRKSTQYFVRLVLDAMKYRQEHNIV 180
RPDMINMLMEARGIIQTEKTKASAVREWS
DRDIVAQCFVFFFAGFETSAVLMCFTAHELMENQDVQQRLYEEVQQVDQDLEGKELTYEA 37807
IMGMKYLDQVVNEVLRKWPAAIAVDRECNKDITFDVDGQKVEVKKGDVIWLPTCGFHRDP 37987
KYFENPMKFDPERFSDENKESIQPFTYFPFGLGQRNCIGSR 38167
FALLEAKAVIYYLLKDYRFAPANKSCIPLKLITSGFQLSPKGGFWIKLVQR 38320

AC007594 sequence B (pseudogene) AA567377 AA698035
MLWEFFALFAIAAALFYRWASANNDFFKDRGIAYEKPVLYFGNMAGMFLRKRAMFDIVCD 39306
LYTKGGSK 39330
KFFGIFEQRQPLLMVRDPDLIKQITIKDFDHFINHRNVFATSSDDDPHDMSNLFGSSLFS 32428
MRDARWKDMRSTLSPAFTGSKMRQMFQLMNQVAKEAVDCLKQDDSRVQENELDMKDYCTR 32248
FTNDVIASTAFGLQVNSFKDRENTFYQMGKKLTTFTFLQSMKFMLFFALKGLNKI
LKVELFDRKSTQYFVRLVLDAMKYRQEHNIVRPDMINMLMEARGIIQTEKTKASAVREWS 31842
DRDIVAQCFVFFFAGFETSAVLMCFTAHELMENQDVQQRLYEEVQQVDQDLEGKEL 31674
KYLDQVVSEVLRKWPPAIAFDRECNKE 31593
EAKAVIYYLLKDYRFAPAKKSCIPLELISSGFQLSPKGGFWIKLVQR 31455

AC009741 Drosophila melanogaster chromosome 3 clone BACR44K17 (D976) RPCI-98
QRQPLLMVRDPDLIKQITIKDFDHFINHRNEFDTSSDDDPHDMSNLFSSSLFSMRDARWK 146378
DMRSTLSPAFTGSKMRQMFQLMNQVAKEAVDCLKQDDSRVQENELDMKDYCTRFTNDVIA 146558
STAFGLQVNSFKDRENTFYQMGKKLTTFTFLQNMKFILLFALKSLNK 146699
ILKVEIFDRKSTQYFVRLVLDAMKYRQEHNIGRPDMINML 146887

Cyp6d2 AC004377 DS00837
MWTILLTILIAGLLYRYVKRHYTHWQRLGVDEEPAKIPFGVMDTVMKQER
SLGMALADIYARHEGKIVGIYMLNKRSILIRDAQLARQIMTSDFASFHDR
GVYVDEDKDPLSANLFNLRGASWGSVIFGLEIDSFRNPKNEFREISSSTS
RDESLLLKIHNMSMFICPPIAKLMNRLGYESRILTSLRDMMKRTIEFREE
HNVVRKDMLQLLIRLRNTGKIGEDDDQVWDMETAQEQLKSMSIEKIAAQA
FLFYVAGSESTAAASAFTLYELSMYPELLKEAQEEVDAVLMKHNLKPKDR
FTYEAVQDLKFLDICIMETIRKYPGLPFLNRECTEDYPVPGTNHIIAKGT
PILISLFGMQRDPVYFPNPNGYDPHRFDSNNMNYDQAAYMPFGEGPRHCI
GKALRMGKVNSKVAVAKILANFDLVQSPRKEVEFRFDAAPVLVTKEPLKL
RLTKRK*

AC007440 comp(89081-89665) region also on same fragment Cyp6g1 and one other
MLLIWLLLLTIVTLNFWLRHKYDYFRSRGIPHLPPSSWSPMGNLGQLLFL
RISFGDLFRQLYADPRNGQAKIVGFFIFQTPALMVRDPELIRQVLIKNFN
NFLNRFESADAGDPMGALTLPLAKYHHWKESRQCMSQLFTSGRMRDVMYS
QMLDVASDLEQYLNRKLGDRLERVLPLGRMCQLYTTDVTGNLFYSLNVGG
LRRGRSELITKTK
ELFNTNPRKVLDFMSVFFLPKWTGVLKPKVFTEDYARYMRHLVDDHHEPTKGDLINQLQHFQLSRSSNHYSQHP
DFVASQAGI
ILLAGFETSSALMGFTLYELAKAPDIQERLRSELREAFISTATLSYDTLMTLPYLKMVCLEALRLYPAAAFVNRECTSSASEG
FSLQPHVDFI*VMIIPTYLCQSKF
QFWPEPGVFDPKRFGPERSRHIHPMTYIPFGAGPHGCIGSRLGVLQLKLGIVHILKQYW
VETCERTVSEIRFNPKSFMLESENEIYLRFCRSSL*

AC007441 AC018176 AI403829 chromosome 3 clone BACR10E03 (D690) RPCI-98
50% to 6D1
MIGIYLLIAAVTLLYVYLKWTFSYWDRKGFPSTGVSIPFGALESVTKGK
RSFGMAIYDMYKSTKEPVIGLYLTLRPALLVRDAQLAHDVLVKDFASFHD
RGVYVDEKNDPMSASLFQMEGASWRALRNKLTPSFTSGKLKAMFETSDSV
GDKLVDSIRKQLPANGAKELELKKLMATYAIDIIATTIFGLDVDSFADPN
NEFQIISKKVNRNNIEDIIRGTSSFL
LEKFFVKIGWKQEATERMRELSNRTVDL
REQNNIVRKDLLQLLLQLRNQGKINTDDNIWSAESTKNGVKSMSKDLIAGQLFLFYVAGYETTASTTSFT
LYELTQNPEVMEKAKEDVRSAIEKHGGKLTYDAISDMKYLEACILETARKYPALPLLNRICTKDYPVPDSKLVIQKGTPIIISLIG
MHRDEEYFPDPLAYKPERYLENGKDYTQAAYLPFGEGPRMCIGARMGKVNVKIAIAKVLSNFDLEIRKEKCEIEFGVYGIPLMPKS
GVPVRLSLKK*

AC015208 Drosophila melanogaster, *** SEQUENCING IN PROGRESS ***, in ordered
MELVLLILVASLIGIAFLALQQHYSYWRRMGVREIRPKWIVGNLMGLLNMRMSPAEFISQLY
NHPDAENEPFVGIHVFHKPALLLRDPEMVRNILVKDFAGFSNRYSSSDPKGDPLGSQNIF
FLKNPAWKEVRLKLSPFFTGNRLKQMFPLIEEVGASLDAHLRQQPLHNERMRCFDLEAK
ELCALYTTDVIATVAYGVSANSFTDPKCEFRRHGRSVFEFNLLRAAEFTLVFFLPHLVPF
VRFKVVPAEATRFLRKTINYVMSEREKSGQKRNDLIDILIEFRRSTQLAKASGIKDQF
VFEGDILVAQAVLFFTAGFESSSSTMAFAMYELAKDTDVQQRLREEIKDALVESGGQVTL
KMIESLEFMQMILLEVLRMYPPLPFLDRECTSGRDYSLAPFHKKFVVPKGMPVYIPC
YALHMDPQYFPQPRKFLPERFSPENRKLHTPYTYMPFGLGPHG
CIGERFGYLQAKVGLVNLLRNHMITTSERTPHRMQLDPKAIITQAKGGIHLRLVRDALGV*

AF083946 Cyp6g1 = AC007440, AL065705, AA698945, AI402390 GH21606, AI403823
MVLTEVLFVVVAALVALYTWFQRNHSYWQRkgipYIPPtpiigNTKVVFK
MENSFGMHlSEIYNDPRLKDEAVVGIYSMNKPGLIIRDIELIKSILIKDF
NRFHNRYarcdphgdplgynNLFFVRDAhwkgiRTKLTPVFTSGKVKQMY
TLmQEIGKDLELALQRRGEKNSGSFITEIKEICAQFSTDSIATIAFGIRA
NSLENPNAEFRNYGRKMFTFTVARAKDFFVAFFLPKLVSLMRIQFFTADF
SHFMRSTIGHVMEERERSGLLRNSLIDVLVSLRKEAAAEPSKPHYAKNQD
FLVVSAGVFFTAGFETSSSTMSFALYEMAKHPEMQKRLRDEINEALVEGG
GSLSYEKIQSLEYLAMVVDEVLRMYPVLPFLDREYESVEGQPDLSLKPFY
DYTLENGTPVFIPIYALHHDPKYWTNPSQFDPERFSPANRKNIVAMAYQP
FGSGPHNCIGSRIGLLQSKLGLVSLLKNHSVRNCEATMKDMKFDPKGFVH
QADGGIHLEIVNDRLYDQSAPSLQ

AC008257 AC014226 AL062352 AL054245 AI108091 AI113367 AI064259 AI064268
36% with 6a2 43% identical to 6g1 possible frameshifts after EKDRRKA and GEDV
MLLLLLLGSLTIVFYIWQRRTLSFWERHGVKYIRPFPVVGCTREFLTAKVPFFEQIQKFH
EAPGFENEPFVGVYMTHRPALVIRDLELIKTVMIKKFQYFNNRVLQTDPHNDALGYKNLF
FARSPGWRELRTKISPVFTSGKIKQMYPLMVK*IGKNLQDSAERLGSGTEVQVKDLCSRF
TTDLIATIAFGVEANALQDAKSEFFYHNRAIFSLTLSRGIDFAIIFMIPALASLARVKLF
SRETTKFIRSSVNYVLKEREKDRRKA*RNDLIDILLALKREAAANPGEDV*KEVDLDYLV
AQAAVFQTAGFETSASTMTMTLYELAKNEALQDRLRQEIVDFFGDEDHISYERIQEMPYL
SQVVNETLRKYPIVGYIERECSQPAEGERFTLEPFHNMELPHGMSIYMSTVAVHRDPQYW
PDPEKYDPERFNSSNRDNLNMDAYMPFGVGPRNCIGMRLGLLQSKLGLVHILRNHRFHTC
DKTIKKIEWAPTSPETFSTRRIISRFEAITGPAN*

AC015002 = Dm0590 STS from DS07966-Sp6 in vector ad10sacBII, AL058810
MVYSTNILLAIVTILTGVFIWSRRTYVYWQRRRVKFVQPTHLLGNLSRVLRLEESFALQLRRFY 11344
FDERFRNEPVVGIYLFHQPALLIRDLQLVRTVLVEDFVSFSNRFAKCDGRSDKMGALSLF 11164
LAKQPEWREIRTRLAPAFAGAKLKQMFSLMEE
IGCDLEWYLKRLTRDLRRGDAERGAIVSIKDVCDLYNTDMIASIAFGLRSYSLRNTQSE 9947
IGSHCQDLFRPNVRRIIDLFVIFYLPKLVPLLRPKLFTEPHAEFLRRVIQLVIEERERGG 9767
DLRNDLIEMLLTLKKEADLQQDKSHFTHHRDFLAAQAASFEVAGIETCSASMSFALYELA 9587
KQPLMQSRLRREIREAFASNPNGRLTYEAVARMEFLDMVVEETLRKYPIVPLLERECTPI 9407
NKKRFYSLRPHAECYTRRGMPVFISNLAIHHDPK 9305
YWPDPDRFDPERFSAANKALQAPMSYMPFGAGPRNCIGMQIGLLQIK 8830
LGLVYFLHQHRVEICDRTVERIQFDAKFALLASEQRIYLKVDCL* 8695

AC006469 Cyp9b1 chromosome 2R DS02730 and DS07472 also U34324
MSFVEICLVLATIGLLLFKWSTGTFKAFEGRNLYFEKPYPFLGNMAASAL
QKASFQKQISEFYNRTRHH*KLVGLFNLRTPMIQINDPQLIKKICVKDFD
HFPNHQTLNIPNERLVNDMLNVMRDQHWRNMRSVLTPVFTSAKMRNMFTL
MNESFAQCLEHLKSSQPIAAGENAFELDMKVLCNKLSNDVIATTAFGLKV
NSFDDPENEFHTIGKTLAFSRGLPFLKFMMCLLAPKVFNFFKLTIFDSTN
VEYFVRLVVDAMQYREKHNITRPDMIQLLMEAKKESKDNWTDDEIVAQCF
IFFFAAFENNSNLICTTAYELLRNLDIQERLYEEVKETQEALKGAPLTYD
AAQEMTYMDMVISESLRKWTLSAAADRLCAKDYTLTDDEGTKLFEFKAGD
NINIPICGLHWDERFFPQPQRFDPERFSERRKKDLIPYTYLPFGVGPRSC
IGNRYAVMQAKGMLYNLMLNYKIEASPRTTRDMWESARGFNIIPTTGFWM
QLVSRK*

AC006469 Cyp9b2 DS02730 and DS07472 also U34325, AI402118
MALIEICLALVVIGYLIYKWSTATFKTFEERKLYFEKPYPFVGNMAAAAL
QKSSFQRQLTEFYERTRQH*KLVGFFNMRTPMITLNDPELIKKVCVKDFD
HFPNHQPFITSNDRLFNDMLSVMRDQRWKHMRNTLTPVFTAAKMRNMFTL
MNESFAECLQHLDSSSKTLPGRKGFEVDMKVMCNKLSNDIIATTAFGLKV
NSYDNPKNEFYEIGQSLVFSRGLQFFKFMLSTLVPKLFSLLKLTIFDSAK
VDYFARLVVEAMQYREKHNITRPDMIQLLMEAKNESEDKWTDDEIVAQCF
IFFFAAFENNSNLICTTTYELLYNPDVQERLYEEIVETKKALNGAPLTYD
AVQKMTYMDMVISESLRKWTLAAATDRLCSKDYTLTDDDGTKLFDFKVGD
RINIPISGLHLDDRYFPEPRKFDPDRFSEERKGDMVPYTYLPFGVGPRNC
IGNRYALMQVKGMLFNLLLHYKIEASPRTIKDLWGSASGFNFTPRSGFWM
HLVPRK*

Cyp9b3 Drosophila mettleri AF083945
MDLILLLSIVGLIYFVYKWATARHNEFELRGLPFEKPLPIFGNN
AAVVTGRASFQKSSPSSIARTRQHKMVGFFNFRTPMIQLNDPEIIKKITVKDFEYFP
NHQLFFTTEERLINDMLSVMKDQRWKHMRNTLTPVFTSAKMRSMFSLMNESFAEC
MDHLDQMSKTAVKPGGSFELELKEVCNRLSNDLIATTAFGLKVSSYKKPDNDFYEIGKSI
VFFRGKALYKLFACPTTLPAVFKLLGFKIFDAQKTDFFIRLVVDAMKYREENNIVRPD
MIQLLMEAKKESTEHWSDDELVAQCFIFFFAAFENNASLICTTAYELLNNPDVQQRLY
EEVQETYDALKGEMLTYDAVTKMKYMDLVASESLRKWTLAASTDRECAKDYTLYDD
DASKLFEFKAGDRINIPIVGLHLDDKFFPEPHKFIPERFSDGNKDQIVPYTYLPFGAGPRN
CIGNRYALMQAKAMLYNLVLKYKIERSPKTVKDLLSDSRGFQLTPQSGYWVHLVPRK

Cyp9c1 U34326, AL063862 (BACR007C22), AL058497 (BACR024A05)
MVFVELSIFVAFIGLLLYKWSVYTFGYFSKRGVAHEKPIPLLGNIPWSV
LMGKESYIKHSIDLHLRLKQHKVYGVFN*LRDPLYYLSDPELIRQVGIK
NFDTFTNHRKGITEGFNDTSVISKSLLSLRDRRWKQMRSTLTPTFTSLK
IRQMFELIHFCNVEAVDFVQRQLDAGTSELELKDFFTRYTNDV
IATAAFGIQVNSFKDPNNEFFSIGQRISEFTFWGGLKVMLYILMPKLMK
VKTSPLHSFF
NVDYFKKLVFGAMKYRKEQSIVRPDMIHLLMEAQRQFKAEQEGSAESAAQ
QDKAEFNDDDLLAQCLLFFSAGFETVATCLSFTSYELMMNPEVQEKLLAE
ILAVKEQLGEKPLDYDTLMGMKYLNCVVSESLRKWPPAFIVDRMCGSDFQ
LKDEEGEVVVNLREDDLVHINVGALHHDPDNFPEPEQFRPERFDEEHKHE
IRQFTYLPFGVGQRSCIGNRLALMEVKSLIFQLVLRYHLKPTDRTPA
DMMSSISGFRLLPRELFWCKLESRGPA*

Cyp9f1 Drosophila mettleri AF083947
MLVEFLALSVVVLLLAYRWATANYNFFKERGIPYHKPYPFVGNMGKMLLR
QKSMFDLIVELYNRGDSKVFGIFEQRKPLLMIRDPELVKQITIKDFDHFI
NHRNIFGVDNNDPHDMDNLFGSSLFSMRDARWKDMRRPLSPAFTGSKMRQ
MFQLMDIVANEAVECLKRDDIPENGIELDMKDYCTRFTNDVIASTAFGLQ
VNSFKDRENQFYMMGKKLTPLQPLTNLKFLLFTSAQKIFKALKISLFDRQ
STNYFVRLVLDAMKYRQENNIIRPDMINMLLEARGLINSDKLKSSVVRDW
SDRDIVAQCFVFFFAGFETSAVLMCFTAQELLENEDVQEKLYEEVAQVDS
DLQGGQLTYEAIMGMKYLDQVVSEVLRKWPAAIAVDRECNKDITYEVDGK
SVQIKKGEAVWLPTCGFHRDPKYFENPNKFDPDRFSEENKDKIQPFTYYP
FGVGPRNCIGSRFALLEAKAVIYYLLREFRLVPAKKTCIPLVLSSSGFQL
APKTGFWVKLIPRK

Cyp9f2 AA735946 78% IDENTICAL TO Cyp9f1 This appears to be 
derived from a full length version of the pseudogene sequence
DRENTFYQMGKKLTTFTFLQNMKFILLFALKSLN
KILKVEIFDRKSTQYFVRLVLDAMKYRQEHNIVRPDMINMLMEARGIIQT
EKTKASAVREWSDRSIVAQCFAFFFAGFETSAVLMCFTAHELMENQDVQQ
RLYEEVQQVDQDLEGKELTYEAIMGMKYLDQVVSEVLRKWPPAIAFDREC
NKDITFDVDGQKVEVKKGDVIWLPTCGFHRDPK

AC017170 Drosophila melanogaster, AI259899 and parts of AC007594 
N-terminal (39127-39333) and comp(32169-32604) up to DRENTFY

MLWEFFALFAIAAALFYRWASANNDFFKDRGIAYEKPVLYFGNMAGMFLRKRAMFDIVCD
LYTKGGSKKFFGIFEQRQPLLMVRDPDLIKQITIKDFDHFINHRNVFATSSDDDPHDMSN
LFGSSLFSMRDARWKDMRSTLSPAFTGSKMRQMFQLMNQVAKEAVDCLKQDDSRVQENEL
DMKDYCTRFTNDVIASTAFGLQVNSFKDRENTFYQMGKKLTTFTFLQSMKFMLFFALKGLNK
ILKVELFDRKSTQYFVRLVLDAMKYRQEHNIVRPDMINMLMEARGIIQTEKTKASAVRE
WSDRDIVAQCFVFFFAGFETSAVLMCFTAHELMENQDVQQRLYEEVQQVDQDLEGKELTY
EAIMGMKYLDQVVNEVLRKWPLAIAVDRECNKDITFDVDGQKVEVKKGDVIWLPTCGFHR
DPKYFENPMKFDPERFSDENKESIQPFTYFPFGLGQRNCIGSRFALLEAKAVIYYLLKDYRFAPAK
KSCIPLELITSGFQLSPKGGFWIKLVQR

9f pseudogene AI113499 AL105104 AC017240 AC007594 comp(31455-32006)

NSFKDRENTFYQMGKKLTTFTFLQNMKFILLFALKSLNK
ILKVELFDRKSTQYFVRLVLDAMKYRQEHNIVRPDMINMLMEARGIIQTEKTKASAVRE
WSDRDIVAQCFVFFFAGFETSAVLMCFTAHELMENQDVQQRLYEEVQQVDQDLEGKELTY
EAIMGMKYLDQVVSEVLRKWPAAIAFDRECNK
EAKAVIYYLLKDYRFAPAKKSCIPLELISSGFQLSPKGGFWIKLVQR

There are at least three 9f genes.  One is a pseudogene.
The pseudogene has an EST AI113499 and two genomic sequences AL105014 and AC017240
One of the other two sequences is represented by the EST AA735946 that is nearly identical to 
the pseudogene except it does not have the two deletions seen in the pseudogene.  The other 
sequence is represented by EST AI259899 and genomic sequence AC017170.  The genomic sequence 
AC007594 has three gene fragments.  The first comp(31455-32607) and the last (39127-39333) seem 
to belong together.  The middle sequence (37628-38320) matches AC017170 and probably is the same 
gene.  The first sequence may to be chimeric since the first part of this sequence up to ILKVEL 
matches AC017170 and AI259899, but the second part matches the pseudogene and does not match 
AC017170 and AI259899.  
For an alignment of 9f related Drosophila sequences see 9f alignment


AC005450 Cyp9h1 comp(17940-19601) = AC007453 comp(140861-142543) = AC005472
MDQSMIALALFIILLVLLYKWSVAKYDVFSERGVSHEKPWPLIGNIPLKAMIGGMP
VLKKMIELHTK
HTGSPVYGIYALRDAVFFVRDPELIKLIGIKEFDHFVNHNSMHNNIQESILSKS
LISLRDGRWKEMRNILTPAFTGSKMRIMYDLIQSCSEEGVIHIQEQLELSQDASIELE
MK
DYFTRFANDVIATVAFGISINSFRRKDNEFFRIGQAMSRISAWSVVKAMLYALFPRL
MK
VLRIQVLDTKNIDYFSSL
VTAAMRYRQEHKVVRPDMIHLLMEAKQQRLADLSDKSKDELYYSEFTADDLLAQC
LLFFF
AGFEIISSSLCFLTHELCLNPTVQDRLYEEIISVHEELKGQPLTYDKLTKMKYLDMV
VLE
ALRKWPPSISTDRECRQDIDLFDENGQKLFSARKGDVLQIPIFSLHHDPENFEDPEF
FNP
ERFADGHALESRVYMPFGVGPRNCIGNRMALMELKSIVYQLLLNFKLLPAKRTSR
DLL
NDIRGHGLKPKNGFWLKFEARQ*

cyp12a4 AC006091 Drosophila melanogaster chromosome 3 clone BACR48G05 (D475)
MLKVRSALSLIQSQKATLSLATQK*
RWQTNVATAEAREDSEWLQAKPFEQIPRLNMWALSMKMSMPGGKYKNMELME
MFEAMRQDY
GDIFFMPGIMGNPPFLSTHNPQDFEVVFRNEGVWPNRPGNYTLLYHREEYRKDFY
QGVMG
VIPTQGKPWGDFRTVVNPVLMQPKNVRLYYKKMSQVNQEF
ILELRDPDTLEAPDDFIDTINRWTLESVSVVALDKQLGLLKNSNKESEALKLFHYL
DEFFIVSIDLEMKPSPWRYIKTPKLKRLMRALDGIQEVTLAYVDEAIERLDKEAKEG
VVR
PENEQSVLEKLLKVDRKVATVMAMDMLMAGVDT
TSSTFTALLLCLAKNPEKQARLREEVMKVLPNKNSEFTEASMKNVPYLRACIKESQ
RLHP
LIVGNARVLARDAVLSGYRVPAGTYVNIVPLNALTRDEYFPQASEFLPERWLRSPK
DSE
SKCPANELKSTNPFVFLPFGFGPRMCVGKRIVEMELELGTARLIRNFNVEFNYPTE
NAFR
SALINLPNIPLKFKFIDLPN*

Cyp12a5 AC006091 Drosophila melanogaster chromosome 3 clone BACR48G05 (D475)
MLKGRIALNILQSQKPIVFSASQQ*RWQTNVPTAEIRNDPEWLQAKPFEE
IPKANILSLFAKSALPGGKYKNLEMMEMIDALRQDYGNIIFLPGMMGRDG
LVMTHNPKDFEVVFRNEGVWPFRPGSDILRYHRTVYRKDFFDGVQGIIPS
QGKSWGDFRSIVNPVLMQPKNVRLYFKKMSQVNQEFIKEIRDASTQEVPG
NFLETINRWTLESVSVVALDKQLGLLRESGKNSEATKLFKYLDEFFLHSA
DLEMKPSLWRYFKTPLLKKMLRTMDSVQEVTLKYVDEAIERLEKEAKEGV
VRPEHEQSVLEKLLKVDKKVATVMAMDMLMAGVDTTSSTFTALLLCLAKN
PEKQARLREEVMKVLPNKDSEFTEASMKNVPYLRACIKESQRVYPLVIGN
ARGLTRDSVISGYRVPAGTIVSMIPINSLYSEEYFPKPTEFLPERWLRNA
SDSAGKCPANDLKTKNPFVFLPFGFGPRMCVGKRIVEMELELGTARLIRN
FNVEFNHSTKNAFRSALINLPNIPLKFKFKFTDVPN*

AC012807 Drosophila melanogaster, AC009385 AA801503
MLRLTVKHGLRANSQLAATRNPDASSYVQQL
ESEWEGAKPFTELPGPTRWQLFRGFQKGGEYHQLGMDDVMRLYKKQFGDICLIPGLFGM 53300
PSTVFTFNVETFEKVYRTEGQWPV 53228
RGGAEPVIHYRNKRKDEFFKNCMGLFGN
GAEWGKNRSAVNPVLMQHRNVAIYLKPMQRVNRQFVNRIREIRDKESQEVPGDFMNTINH 52788
LTFESVATVALDRELGLLREANPPPEASKLFKNIEVLMDSFFDLGVRPSLYRYIPTPTYK 52608
KFSRAMDEIFDTCSMYVNQAIERIDRKSSQGDSNDHKSVLEQLLQIDRKLAVVMAMDM 52434
LMGGVDTTSTAISGILLNLAKNPEKQQRLREEVLSKLTSLHSEFTVEDMKSLPYLRAVIK 52254
ESLRLYPVTFGNARSAGADVVLDGYRIPKGTKLLMTNSFLLKDDRLYPRAKEFIPERWLR 52074
RKDDDKSDVLMNKDLNAFIYLPFGFGPRMCVGKRIVDLEMELTVANLVRNFHIEYNYS 51900
TEKPYKCRFLYKPNIPLKFKFTDLKY* 51819

AL063519 T7 end of BAC BACR07K20 STS G01307 is exact match
AC006496 AC012699
MAVILLLALALVLGCYCALHRHKLADIYLRPLLKNTLLEDFYHAELIQPEAPKRRRRGI
WDIPGPKRIPFLGTKWIFLLFFRRYKMTKLHE
YGDIVLEVMPSNVPIVHLYNRDDLEKVLKYPSKYPFRPPTEIIVMYRQSRPDR
YASVGIVNEQGPMWQRLRSSLTSSITSPRVLQNFLPALNAVCDDFIELLRARRDPDTLV
VPNFEELANLMGLEAVCTLMLGRRMGFLAIDTKQPQKISQLAAAVKQLFISQRDSYYGLG
LWKYFPTKTYRDFARAEDLIYE
Orf with I helix
SQSSVISEIIDHELEELKKSAACEDDEAAGLRSIFLNI
LELKDLDIRDKKSAIIDFIAAGIET
Orf with EXXR motif
LANTLLFVLSSVTGDPGAMPRILSEFCEYRDTNILQDALTNA
TYTKACIQESYRLRPTAFCLARILEEDMELSGYSLNAG
Orf with PERW and heme motifs
TVVLCQNMIACHKDSNFQGAKQFTPERWIDPATENFTVNVDNASIVV
PFGVGRRSCPGKRFVEMEVVLLLAK

AC008187 Drosophila melanogaster chromosome 2 clone BACR13D20 (D604) RPCI-98
A cyp12 family member 45% to 12a5
LQLITKRNRMNTLSSARSVAIYVGPVRSSRSASVLAHEQAKSS*ITEEH
KTYDEIPRPNKFKFMRAFMPGGEFQNASITEYTSAMRKRYGDIYVMPGMFGRKDWVTTF 85761
NTKDIEMVFRNEGIWPRRDGLDSIVYFREHVRPDVYGEVQGLVAS
QNEAWGKLRSAINPIFMQPRGLRMYYEPLSNINNEF 85452
IKEIRDPKTLEVPEDFTDEISRLVFESLGLVAFDRQMGLIRKNRDNSDALTLFQTSRDIF 85211
RLTFKLDIQPSMWKIISTPTYRKMKRTLNDSLNVAQKMLKENQDALEKRRQAGEKINS 85037
NSMLERLMEIDPKVAVIMSLDILFAGVDATATLLSAVLLCLSKHPDKQAKLREELLSIMP 84857
TKDSLLNEENMKDMPYLRAVIKETLRYYPNGLGTMRTCQNDVILSGYRVPKGTTVLLGS 84680
NVLMKEATYYPRPDEFLPERWLRDPETGKKMQVSPFTFLPFGFGPRMCIGKR 84524
VVDLEMETTVAKLIRNFHVEFNRDASRPFKTMFVMEPAITFPFKFTDIEQ* 84371

AL063519 BACR07K20 STS G01307 is exact match
MXILXLNRQYGDIVLEVMPSNVPIVHLYNRDDLEKVLKYPSKYPFRPPTEIIVMYRQSRPDR
YASVGIVNEQGPMWQRLRSSLTSSITSPRVLQNFLPALNAVCDDFIELLRARRDPDTLV
VPNFEELANLMGLEAVCTLMLGRRMGFLAIDTKQPQKISQLAAAVKQLFISQRDSYYGLG
LWKYFPTKTYRDFARAEDLIYE*VHR

AL074984 note this sequence has the end of an early exon at its beginning
PWLXXXDRPMXLISRNRDDPDALT
LNIQPSMWRXISTPNFRMMMRLLDDILMFSQKMIKDTEDSVEKRRQ*
LQLIPKRNRMNTLSSARSVAIYVGPVRSXXXASVLAHEQAKSS

AC007398 AC007418 mito clan best matches = 37-38% to 12a sequences
MQRLRTGESSNPKKLNV*
SQQPVTSVATTRTTASSLPA
ETTSSPAAAVRPYSEVPGPYPLPLIGNSWRFAPLIG*
TYKISDLDKVMNELHVNYGKMAKVGGLIGHPDLLFVFDG
DEIRNI*
THFLFAMELRPSMPSLRHYKGDLRRDFFGDVAGLIGV*
HGPKWEAFRQEVQHILLQPQTAKKYIPPLNDIASEFMGR*
IELMRDEKDELPANFLHELYKWALE*
SVGRVSLDTRLGCLSPEGSEEAQQIIEAINTFFWAVPELELRMPLWRIY
PTKAYRSFVKALDQFT*EITLHPRICMKNIGKTM
DKADADEARGLSKSEADISIVERIVRKTGNRKLAAILALDLFLVGVDT
TSVAASSTIYQLAKNPDKQKKLFDELQKVFPHREADINQNVLEQM
PYLRACVKETLRMRPVVIANGRSLQSDAVINGYHVPKGVSFREMVTIW*
DPAYFPEPKRFLPERWLKQSTDXXX
SAGCPHANQKIHPFVSLPFGFGRRMCVGRRFAEIELHTLLAKV
GFNHVLHLALPELPPFNLPYLIPGTDLPQIQGLVQFRRVC
VPCELHVHSAVASEFQTNAEGRVSCLHQREPEAHHGAQSCSYTRRMCDIY
SQSSAAVAMLHFN*

AC007356 Drosophila melanogaster chromosome 2 clone BACR24H09 (D595) RPCI-98
MNNLSLKAWRSTVSCGPNLRQCVPRISGA
GSRRAQCRESSTGVATCPHLADSEEASAPRIHSTSEWQNALPYNQIPGPK
PIPILGNTWR
LMPIIGQYTISDVAKISSLLHDRYGRIVRFGGLIGRPDLLFIYDADEIEK
CYRSEGPTPFRPSMPSLVKYKSVVRKDFFGDLGGVVG
RHGEPWREFRSRVQKPVLQLSTIRRYLQPLE
VITEDFLVRCENLLDENQELPEDFDNEIHKWSLE
GIGRV
ALDTRLGCLESNLKPDSEPQQIIDAAKYALRNVATLELKAPYWRYFPTPLWTRYVK
NMNFFV
SVCMKYIQSATERLKTQDPSLRAGEPSLVEKVILSQK
DEKIATIMALDLILVGIDTVSMAVCSM
LYQLATRPVDQQKVHEELKRLLPDPNTPLTIPLLDQMHHLKGFIKEVFRMYSTVIG
NG
RTLMEDSVICGYQVPKGVQAVFPTIVTGNMEEYVTDAATFRPERWLKPQHGGTPG
KLHPF
ASLPYGYGARMCLGRRFADLEMQILLAKLLRNYKLEYNHKPLDYAVTFMYAPDG
PLRFKMTRV*

AC017275 = AC007356 in ordered
36160 SMAVCSMLYQLATRPVDQQKVHEELKRLLPDPNTPLTIPLLDQMHHLKGFIKEVFRMYST 35981
35980 VIGNGRTLMEDSVICGYQVPKGVQAVFPTIVTGNMEEYVTDAATFRPERWLKPQHGGTPG 35801
35800 KLHPFASLPYGYGARMCLGRRFADLEMQILLAKLLRNYKLEYNHKPLD 35657

Cyp12b1 U78485 Drosophila acanthoptera
MWKFAIHSQQPFCWQQLCNRRHLYVGNVQQQTHLELLDAAPTRSDDEWLQAKPY
EKVPGPGTWQVLSYFLPG
GKQYNTNLIQMNRRMREWYGDIYRFPGLMGKQDVIFTYNPNDFELTYRNEGVWP
IRIGLESFTYYRKVHRPE
VFGSIGGLVSEQGKDWAHIRNKVNPVQMRVQNVRQNLPQIDQISREFVDKLDTLR
DPVTHILNDNFHEQLKM
WAFESISFVALNTRMGLLSDRPDPNAARLAEHMTDFFNYSFKYDVQPSIWPYYKT
PGFKKFLQTYDKITEIT
TAYIDEAIKRFEIEKDSGNECVLQQLLSLNKKVAVVMAMYMLMAGIDTTSSAFVTIL
YHLARNPHKQRQLHR
ERRRILPDSDEPLTPENTKNMPYLRACIKECMRITSITPGNFRIATKDLVLSGYRVPR
GEGVLMGVLELSNS
EKYFGQSGQFMPERWLKADTDPDVKACPAARSRNPFVYLAFGFGPRTCIGKRIAE
LEMETLLTRLLRRYQVS
WLAEMPLQYESNIILSPHGIYVQVRAAC

cyp12b2 AC018326 <7400-9141 AC004657 AC004345
MWKYSNKIIYRNVSGNQLWFNRNSSVGGTLSQQVRSWQKEQELLKSRNLFTNNGYICSQT
QLELADSRIDEKWQQARSFGEIPGPSLLRMLSFFMPGGKLF*NTNLIQMNRLMREMYGDI
YCIPGMMGKPNAVFTYNPDDFEMTYRNEGVWPIRIGLESLNYYRKIHRPDVFKGVGGLAS
E*QGQEWADIRNKVNPVLMKVQNVRQNLPQLDQISKEFIDK*LETQRNPETHTLTTDFHN
QLKMWAFESISFVALNTRMGLLSDNPDPNADRLAKHMRDFFNYSFQFDVQPSIWTFYKTA
GFKKFLKTYDNITDITSNYIETAMRGFGKNDDGKTKCVLEQLLEHNKKVAVTMVMDMLMA
GIDT*TSSACLTILYHLARNPSKQEKLRRELLRILPTTKDSLTDQNTKNMPYLRACIKEG
LRITSITPGNFRITPKDLVLSGYQVPRGTGVLMGVLELSNDDKYFAQSSEFIPERWLKSD
LAPDIQACPAARTRNPFVYLPFGFGPRTCIGKRIAELEIETLLVRLLRSYKVSWLPETPI
EYESTIILSPCGDIRFKLEPVGDLM*

Cyp18 U44753 AL062343 AI294030 (3 diffs with cyp18)
MLADSYLIKFVLRQLQVQEDGDAQHLLMVFLGLLALVTLLQWLVRNYREL
RKLPPGPWGLPVIGYLLFMGSEKHTRFMELAKQYGSLFSTRLGSQLTVVM
SDYKMIRECFRREEFTGRPDTPFMQTLNGYGIINSTGKLWKDQRRFLHDK
LRQFGMTYMGNGKQQMQKRIMTEVHEFIGHLHASDGQPVDMSPVISVAVS
NVICSLMMSTRFSIDDPKFRRFNFLIEEGMRLFGEIHTVDYIPTMQCFPS
ISTAKNKIAQNRAEMQRFYQDVIDDHKRSFDPNNIRDLVDFYLCEIEKAK
AEGTDAELFDGKNHEEQLVQVIIDLFSAGMETIKTTLLWINVFMLRNPKE
MRRVQDELDQVVGRHRLPTIEDLQYLPITESTILESMRRSSIVPLATTHS
PTRDVELNGYTIPAGSHVIPLINSVHMDPNLWEKPEEFRPSRFIDTEGKV
RKPEYFIPFGVGRRMCLGDVLARMELFLFFASFMHCFDIALPEGQPLPSL
KGNVGATITPESFKVCLKRSPLGPTAADPHHMRNVGAN

AI294030 3 DIFFS WITH CYP18
MLADSYLIKFVLRQLQVQQDGDAQHLLMGFLGLLALDT

AL067521 similar to CYP18 I-helix to K-helix region
LLWXSVFHCXIRGCCARVXXALARVVGRLRLPXFXXXXXLPLPASPFLASXRRSXXVPLAPTPSPXR 515

AL077732.1|CNS00KLP similar to CYP18 C-terminal
IDTEGKXATXSTSYPSXXAXXXSXAXFWRGVELFLFFASFIHCFXIXLPXGQPLPSLKGN 183
VGXTITPESFNVCLXXRPLWPTSADPHHMRNVGAN 288

AC012164 293-399 27006-27380
RGLGKEELAGHATTLLLEGYETSAMLLAFALYELALNEDAQRHAGNLI 27185
DPGALGELRYSEAALLEALRLHPAMQALQKRCTKTFTLPDQKSGASSELKVHLGTELVLP 27365
VHAIH 27380
DSALYPAPNQFRPERF 27492

AC012164 477-537 109248-109066 not on AC015216 or AC012373
FLFFAPLKAWFGIGPPEGLPLARFKGKVGAPLPPGLVKVCPKGPPLGAPSPHSPPMGKVGA 109066

AC015216 Drosophila melanogaster, in ordered fragments
MSADIVDIGHTGWMPSVQSLSILLVPGALVLVILYLCERQCNDLMGAP
PPGPWGLPFLGYLPFLDARAPHKSLQKLAKRYGGIFELKMGRVPTVVLSDAALVRDFFRR 49011
DVMTGRAPLYLTH
GIICAQEDIWRHARRETIDWLKALGMTRRPGELRARLERRIARGVDECV 49301
VNPLPALHHSLGNIINDLVFGITYKRDDPDWLYLQRLQEEGVKLIGVSGVVNFLPWLRHL 49902
PANVRNIRFLLEGKAKTHAIYDRIVEACGQRLKEKQKVFKELQEQKRLQRQLEKEQLRQS 50082
KEADPSQEQSEADEDDEESDEEDTYEPECILEHFLAVRDTDSQLYCDDQLRHLLAD 50250
LFGAGVDTSLATLRWFLLYLAREQRCQRRLHELLLPLGPSPTLEELEPLAYLRACIS 50421
ETMRIRSVVPLGIPHGCKENFVVGDYFIKGGSMIVCSEWAIHMDPVAFPEPEEFRPERFL 50601
TADGAYQAPPQFIPFSSGYRMCPGEEMARMILTLFTGRILRRFHLELPSGTEVDMAGES 50778
GITLTPTPHMLRFTKLPAVEMRHAPDGAVVQD 50877

AC015216 Assembled sequence 72624-74273 also = AC012164 and GSS AL098201
MLPLVLFILLAATLLFWKWQGNHWRRLGLEAPFGWPLVGNMLDFALGRRSYGEIYQEIYT*
RNPGLKYVGFYRLFNEPAILVRDQELLRQILVGRNFADCADNAVYVDHQRDVLASHNPFIANGDRWRVLRADLVP
LFTPSRVRQTLPHVARACQLLRDQVPLGRFEAKDLATRYTLQVVASAIFGLDAHCLGIHMRVAHEPSRWLEWLAPLFQPS
VWSLLETMSLLHTPRLGRLIGHR*YVPLPLQHWFRELVEARSGGDNLLQWLAESKRGLGKEELAGHATTLLLEGYETSA
MLLAFALYELALNEDAQRRLHIELDEVAQRHAGNLI DPVALGELRYSEAALLEALRLHPAMQALQKRCTKTFTLPDQKSGASSELKVHLGTVLVLPVQAIH
LDPALYPAPNQFXPERFLNQPPMGCRFLGFGAGPRMCPGMRLGLLQTKAALTTLLQDHCVQLADEDQCRVEVSPLTFLTASRNGIWLSFKRRTRRY*

AC014292 AC009393 AC008359 AC014072 AI257867
MITETLLTICAAVFLCLSYRYAVGRPSGFPPGPPKIPLFGSYLFMLIINF
KYLHKAALTLSRWYKSDIIGLHVGPFPVAVVHSADGVREILNNQVFDGRP
QLFVAAMRDPGQDVRGIFFQDGPLWKEQRRFILRYLRDFGFGRRFDQLEL
VIQEQLNDMLDLIRNGPKYPHEHEMVKSGGYRVLLPLLFNPFSANAHFYI
VYNECLSREEMGKLVKLCQMGIQFQRNADD
YGKMLSIIPWIRHIWPEWSGYNKLNESNLFVRQFFADFVDKYLDSYEEGVE
RNFMDVYIAEMRRGPGYGFNRDQLIMGLVDFSFPAFTAIGVQLSLLVQYLMLYPAVLRRVQNEIDEVVGCGRLPNLEDRKNLPFTE
ATIREGLRIETLVPSDVPHKALEDTELLGYRIPKDTIVVPSLYAFHSDARIWSDPEQFRPERFLDADGKLCLKLDVSLPFGAGKRL
CAGETFARNMLFLVTATMCQHFDFVLGPNDRLPDLSQNLNGLIISPPDFWLQLQDRH*

Cyp28a1 U89746 Drosophila mettleri
MLEITLILVLLLLGLFYVFMTWNFGYWRKRKVPGPKPHCFTGNY
PHMYNMKRHSVYDLNDIYSEYEHKFDAVGIYGARSPQLLVISPQVARRVFVSDFR
HFH
DNEISLMVDEKSDFIMANNPFSQIGDEWKQRRADITPGLTMGRIKTVYPVTQEVCQ
KM
TDWLRKQIRLPPSGGIDAKDMSLRFTSEMVTDCVLGLKAESFSDKPTPIMGYIKDL
FA
QSWTFIIYFVLVSTLPALRHVFKLRFVPLRIENFFVNLMQTAIDARRQQLAAGKQFE
R
VDFLDYILHLGKKKNLDTRHLTAHTMTFLLDGFETTALYDSCALVLSRDQEAQQKL
RE
ELEAHLDDKGIIDFEKLNELPFLDACVQESLRIFPPAFMSNKLCTEPIELPNKTGENF
TVERGTTVVVPHYCFMMDEEFFPDPQAFKPERFMQPDAAKMYREQGVFMAFGDG
PRVC
IGMRFALTQIKGALVELLTKFIIRVNPKTRSDNEYDPTTFIGTCKGGIWLDFELRQ

Cyp28a2 U89747 Drosophila mettleri
MLVTLILLGLVVFLGYKFLIWNYDYWRKRKVPGPKPALFTGNYP
HLFTGKQHPVYAVNEIYRKYKNDYDAVGIYISRMPQLLIVNPDLAHRVFVSNFKN
FHD
NEISALVVEKSDYIFANNIFSMTGDAWKERRSDITPGLTISRIKSVYPVTNQVCKKM
T
EYIKKQIRIAPKDGLNGKDLSLCFTTEMVTDCVLGLGAQSFTDNPTPVMAKMRNL
FRQ
DLPFLINTIAMALFPPLRRIIRLRFLSKTIVEFFVRFMETPLEERQKHISAGANINRV
DMLGYIIQLSPKRNMDSLKITACTMSFLLDGDDTPPSLLSNTLLLLGRNPQGHQRL
RE
ELSEHLCDQGFIDFDKLVDLPYLNACVHESIRIFLTAVSSKLCTESIELSNRNGPNFT
VEKGTVVLVPITCFMYDDDHFPNANEYNPERFLKPDSIKKYRDQGLFLGFGDGPRI
CIGMRFGLAQAKAALVEILVNFDVSVNARTRKDNLYDPKNLLSTLEGGIWLDFAARS

AI403094 AI403665 AA698711 AA697564 50% to 28A2 AC008324 AI062825
MCPISTALFVIAAILALIYVFLTWNFSYWKKRGIPTAKSWPFVGSFPSVF
TQKRNVVYDIDEIYEQYKNTDSIVGVFQTRIPQLMVTTPEYAHKIYVSDF
RSFHDNEMAKFTDSKTDPILANNPFVLTGEAWKERRAEVTPGLSANRVKA
AYPVSLRVCKKFVEYIRRQSLMAPAQGLNAKDLCLCYTTEVISDCVLGIS
AQSFTDNPTPMVGMTKRVFEQSFGFIFYTVVANLWPPITKFYSVSLFAKDVAAFFYDL
MQKCIQVRRESPAAQQRDDFLNYMLQLQEKKGLNAAELTSHTMTFLTDGFETTAQVL
THTLLFLARNPKEQMKLREEIGTAELTFEQISELPFTEACIH
ETLRIFSPVLAARKVVTEPCELTNKNGVSVKLRPGDVVIIPVNALHHDPQYYEEPQSFKP
ERFLNINGGAKKYRDQGLFFGFGDGPRICPGMRFSLTQIKAALVEIVRNFDIKVNPKTRK
DNEIDDTYFMPALKGGVWLDFVERN*

AC008324 = AL070820 AC017780
114532 MCPVTTFLVLVLTLLVLVYVFLTWNFNYWRKRGIKTAPTWPFVGSFPSIFTRKRNIAYDI 114353
114352 DDIYEKYKDTDNMVGVFTTRVPQLLVMCPEYIHKIYA
29560 TDFRSFHNNEWRNFVST 29613
29666 KTDMILGNNPFVLTGDEWKERRSEIMPALSPNRVKAVYP 29845
29846 VSQSVCKKFVEYIRRQQQMATSEGLDAMDLSLCYTTEVVSDCGLGVSAQSFTDTPTPLLK 30025
30026 MIKRVFNTSFEFIFYSVVTNLWQKVRKFYSVPFFNKETEVFFLDIIRRCITLRLEKPEQ 30202
30203 QRDDFLNYMLQLQEKKGLHTDNILINTMTFILDGFETTALVLAHIMLMLGRNPEEQDKVR 30382
30383 KEIGSADLTFDQMSELPHLDAC 30454
30519 ILETLRLFSPQVAARKLVTEPFEFANKNGRTVHLKPGDVVTIPVKALHHDPQYYEDPLTF 30698
30699 KPERFLESNGGGMKSYRDRGVYLAFGDGPRHCPG 30800
30860 MRFALTQLKAALVEILRNFEIKVNPKTRSDNQIDDTFFMATLKGGIYLDFKDL* 31021

Cyp28a3 U91565 Drosophila nigrospiracula

MRFALIQIKAAVVEVITKFNVRVNPKTRKDNEYEPTAFITSLKGGIWLDFESRP

Cyp28a4 U91565 Drosophila hydei

MRFAMTQIKGALVEVLTKFNVRVNPKTRTDNEYEPTRFITTLKGGIWLDFEPRQ

Cyp28a5 DS00180_1 : contig 1 (85065 bases) of 1 for P1 d29 (AC001660)
MVLITLTLVSLVVGLLYAVLVWNYDYWRKRGVPGPKPKLLCGNYPNMFTMKRH
AIYDLDDI
YRQYKNKYDAVGIFGSRSPQLLVINPALARRVFVSNFKNFHDNEIAKNIDEKTDFI
FANNP
FSLTGEKWKTRRADVTPGLTMGRQIKTVYPVTNKVCQKLTEWVEKQLRLGSKDG
IDAKQMS
LCFTTEMVTDCVLGLGAESFSDKPTPIMSKINDLFNQPWTFVLFFILTSSFPSLSHLI
KLR
FVPVDVERFFVDLMGSAVETRRAQLAAGKQFERSDFLDYILQLGEKRNLDNRQLLA
YSMTF
LLDGFETTATVLAHILLNLGRNKEAQNLLREEIRSHLQDGTIAFEKLSDLPYLDACV
QETI
RLFPPGFMSNKLCTESIEIPNKEGPNFVVEKGTTVVVPHYCFMLDEEFFPNPQSFQ
PERFL
EPDAAKTFRERGVFMGFGDGPRVCIGMRFATVQIKAAIVELISKFNVKINDKTRKD
NDYEP
GQIITGLRGGIWLDLEKL*

AC014191 Drosophila melanogaster, *** SEQUENCING IN PROGRESS ***, in ordered
MFGSLLLGIATLLGAIYAFLVSNFGHWRRRGVTEPRALPLFGSFPNMIWPRQHFTMDMRDIY 9811
RHYRNTHSYVGCYLLRAPKLLVLEPRLVYEIYVSAFSHFENN 9631
DASKMVDIAKDRLVALNPFVLEGEEWRHQRAVFSTLLTNGRIRTTHAIMQRVCLDLCQFI 9451
AIKSAGGKDLDCID
LGLRFTGESLFDCVLGIQARTFTDNPLPVVRQNHEMSAENRGLAIAGAVHGLFPNLPRWL 9171
RPKVFPRSHDRFYGQMISEALRLRRSKHQERNDFINHLLEMQRELDLSEEDMASHAMT 8997
FMFDGLDTTSNSIAHCLLLLGRNPDCQRRLYEELQLVNPGGY 8817
LPDLDALIDLPYLSACFN 8763
*ESLRIYPAGGWASKTCTKEYELRGS 8629
HHSEPLKLRPGDHVMVPIYALHNDPDLYPEPDVFRPERFLDGGLKNCKQQG 8476
IFLGFGNGPRQCVGMRLGLAMAKAALAAIVQRFEVVVSPRTLNGTELDPLIFVGVHKGGI 8296
WLQFVPRKNVTTK* 8254

CYP4AA1 AC004516 68725-68267 AC004426 AC005556 comp(849-1502)
MHLRLLSPPQLERTTNLELCSILILLVISLSIYTFYATLNTYLRSVLLSLRLTGPPSLPF
L
GNCMLVTDKDCK*YGSLVRIWVLLFPFFAVLEPEDLQVILSSKKHTNKVFFYRLM
HNFLGD
GLITSSGSKWSNHRRLIQPAFHHNLLEKFIDTFVDASQSLYENLDAEAVGTEINIAK
YVNN
CVLDILN*EAVLGVPIKKRGQDVAMMEDSPFRQGKIMMPARFTQPWLLLDGIYHW
TKMAND
ELNQKKRLNDFTRKMIQRRRQIQNNNNGNSERKCLLDHMIEISESNRDFTEEDIVN
EACTF
MLAGQDSVGAAVAFTLFLLTQNPECQDRCVLELATIFEDSNRAPTMTDLHEMRYM
EMCIKE
ALRLYPSVPLIARKLGEEVRLAKHTLPAGSNVFICPYATHRLAHIYPDPEKFQPERF
SPEN
SENRHPYAFLPFSAGPRYCIGNRFAIMEIKTIVSRLLRSYQLLPVTGKTTIAATFRITL
RA
SGGLWVRLKERDHPLIAH*

L49408 DS02740, AI405120
MFYTVIWIFCATLLAILFGGVRKPKRFPPGPAWYPIVGSALQVSQLRCRLGMFCKV
IDVFA
RQYVNPYGFYGLKIGKDKVVIAYTNDAISEMMTNEDIDGRPDGIFYRLRTFNSRLG
VLLTD
GEMWVEQRRFILRHLKNFGFARSGMMDIVHNEATCLLQDLKDKVLKSGGKQTRI
EMHDLTS
VYVLNTLWCMLSGRRYEPGSPEITQLLETFFELFKNIDMVGALFSHFPLLRFIAPNF
SGYN
GFVESHRSLYTFMSKEIELHRLTYKNYDEPRDLMDSYLRAQDEGNDEKGMFSDQS
LLAICL
DMFLAGSETTNKSLGFCFMHLVLQPEIQERAFQEIKEVVGLERIPEWSRDRTKLPY
CEAIT
LEAVRMFMLHTFGIPHRAVCDTRLSGYEIPKDTMVIACFRGMLINPVDFPDPESFN
PDRYL
FDGHLKLPEAFNPFGFGRHRCMGDLLGRQNLFMFTTTVLQNFKMVAIPGQVPEEV
PLEGAT
AAVKPYDIMLVAREQ*

AC003055 (P1 DS06332) second gene complete 5801-7437
MFTLVGLCLTIVHVAFAVVYFYLTWYHKYWDKRGVVTAEPLTILGSYPGILINKSR
SLILDVQDVYSKYKDKYRTVGTFITRQPQLLVLDPALAHEILVDKFSHFRDTITSSFVGHNP
DDKYVAGSPFFSAGDKWKRLRSENVGGLTPSRLKMAYSIWEQSGRKLVEYIERARREQG
DIIETRDLAYRFTANAMADFIWGIDAGSLSGKVGEIGDFQKTSTDWSAHAFSSMIRFNKTL
VAIFVRKLFSMRFFTKATDEFFLRLTQDAVNLRQGGSGEGRTDYLSHLIQLQQRGNSIHDSV
GHALTVHLDGFETSGAVLYNAVSYQLSEHHEEQEKLRSEILEALASEGQISYDQINNLPYLD
QCFNESLRLTTPIGFFMRICTKPTQINLGDDKTLDLEPGVTVMVPAYQYHHDNDIYPEASE
FRPDRFENGAASVLTKRGCFLPFGDGPRICLGMRVGQLSVKTAIVHILSNYQVEQMKKV
PLGADSGMGIFLNGDVELKYTKLQK

AC009909, AC002444, AC003055
MYILASLALILLHLLVLPIYLYLTWHHKYWRKRGLVTARPLTLLGTYPGLLTRKSNLVFD
VQKIYSKYKGKHRAVGVFVTRQPQILVLDPELAHEVLVSNFRCYKDSLQSSYLRHAKWDK
YARLNPFWASGQSWRRLRTDAQAGISGSRLRQAYNIWEQGGQMLTEYMTQQVAEKNNILE
TRDVCFRYTAHVMADFIWGIDAGTLTRPMEQPNKVQEMASKWTSYAFYMLTLFMATIVAP
CSRLLLRFRFYPKETDEFFSNLTKESIELRLKAGDSTRTDYLSHLLQLRDQKQATHDDLV
GHALTVMLDGYDTSGTALLHALYYVLAENPAVQQKLRVEILSCMASEKSLDFEKLSSLQY
LEQCFNESLRLSSLIPQYTKVCTLPTVIRLSESKSLDVEVGMTIMIPNYQFHHDKQYFPE
PEAFKPERFDNGAYQELMRKGIFLPFSDGPRICMGVPLAMLTLKSALVHILSNFQVVRGR
DRLIPKGDSGFGVVLQGDVNLEYRRFFR*

AC007571
MLASIILSGWLLLAWLYFLWS
RRRYYKVAWQLRGPIGWPLIGMGLQMMNPESKSWTGF*
YMDGLSRQFKAPFISWMGTSCFLYINDPHSVEQILNSTHCTNKGDFYR
FMSSAIGDGLFTSSSPRWHKHRRLINPAFGRQILSNFLPIFNAEAEVLLQ
KLELEGVQHGKRLEIYQILKKIVLEAAC*
QTTMGKKMNFQHDGSLCIFKAYNGLVQMLRHLIMVNNIC
WLITFAFSLTEVCVKRMLSPWLYPDLIYRRSGLFRLQQKVVGILFGFIEQVSYWSDQSCTH
*LISQTFETTSTALYFTILCLAMHPCYQ
EKLHKELVTELPPSGDINLEQLQRLEYTEMVINEAMRLFAPVPMVLRSAD
QDIQLKRGDGEFLIPRGTQIGIDIYNMQRDERVWGPLSRTYNPDAHFGLD
SPQRHAFAFVPFTKGLRMCIGYRYAQMLMKLLLARIFRSYRISTEARLEE
LLVKGNISLKLKDYPLCRVERR*

AC017336 153877-155807 = AC007752 62176-61546 and 120713-121149
MLTWTLWCGLLFLLWIYFLWSRRRFYLLTLKIPGPLGYPILGMAHWLMRRE*
DILNAFGCFLDKHGPTIFSWLGPIPFMIVSDPQVVQDIFTSPHCVNKGIIYKAVDDGAGVGLFSLKD*
PRWSIHRKLLNPAFGHKVLLSFLPIFNRETALLLDQLEPLQDDGEKDLIPLLQSFTLGIAT*
QTTMGSDVKDEESFRSNSLLGRYQC*
ILETMTDMCFSPWLNSRFCRQLAGKESHYYQAKTEIRQFIRK*
DEMGALPSIQSNDKNLFLNLVTDLMRRGVFTLKNVEDESNI
IVFGAFETTANAVYYTLMLLAMFPEYQERAFEEIKTIFPNTGDFDVSYADTQQMVYLDLI
LNESMRVIPPVPVVSRQTSQDLKLSNGIVVPKGVQIAIDIYHMHRSKKIWGPDAETF
NPDHFLPHNIQDKHPYAYIPFTKGIRNCIG*
WRYALISAKVTLAKLLRNYRFKTSFPFENLYFVEDITMKLKSVPLLELQKRT*

AL068269 AA141181 might be same as AC005130
AAQAXFLXLAGFDTSSSTYHFLRCTSWPRTPXFRTPXNGXSGCLAV
*PRSATELRHFTGLXYLRQVVDEVLRXYPPTXFLDRCCNSRTGYDL
SPWNGGSPFKLRAGTXVYISVLGIHRDAQYWPNPESFDPERFSAEQ
RQQHHPMTYLPFGAGPRXCIGXXLGQLEIKVGLLHILNXFRVEXCE
RTLPEMRFDPKAFVLTAHNGTYLRFVKNXL*

AC012831 comp(42975-44564) = AL068269 AA141181 similar to AC014742
MIAVFSLIAAALAVGSLVLLPVVLRGGCLLVVTIVWLWQILHFWHWRRLGVPFVPAAPFVGNVWNLLRGACCFGDQFRELYESKEA
AGRAFVGIDVLHNHALLLRDPALIKRIMVEDFAQFSSRFETTDPTCDTMGSQNLFFSKYETWRETHKIFAPFFAAGKVRNMYGLLE
NIGQKLEEHMEQKLSGRDSMELEVKQLCALFTTDIIASLAFGIEAHSLQNPEAEFRRMCIEVNDPRPKRLLHLFTMFFFPRLSHRV
GTHLYSEEYERFMRKSMDYVLSQRAESGENRHDLIDIFLQLKRTEPAESIIHRPDFFAAQAAFLLLAGFDTSSSTITFALYELAKN
TTIQDRLRTELRAALQSSQDRQLSCDTVTGLVYLRQVVDEVLRLYPPTAFLDRCCNSRTGYDLSPWNGGSPFKLRAGTPVYISVLG
IHRDAQYWPNPEVFDPERFSAEQRQQHHPMTYLPFGAGPRGCIGTLLGQLEIKVGLLHILNHFRVEVCERTLPEMRFDPKAFVLTA
HNGTYLRFVKNSL*

AC014742 Drosophila melanogaster, in ordered pieces = AC005130
MVIAFFIFL*CAALAVGSVVLLPLIALLAVWLWQRRHFRIWRRLGVPYLPAAPVLG 1204
NVLNVETAACCFGDQFRELYERKEAAGRAIVGINVLHSHALLLRDPALIRRILVEDFPEFSSSFKST 1398
DAIRDTMGSGNLLFTKYKTWWETHKIFAIRLGGRRIRSLLYGLLERI*QNLEAHMAQK 1557
LNGAESVELEVKQLCALFTTDIFAKFALQSLQNPEAEFRPMCIEVNDPKPKRLSHHLFTGF 1710
TPPIYRVRTHLYSEEYERFMRKSMNYVLAQRAEN*EKRYDLIDMFLQMHRT 1841
ETAEGIIHRPDFYVAQAAFLLLAGFDTSSSFALYELAKNPTIEHRLQAELRVDLQSSHNHQLSYDTLTGLVYLRQVLED 2077
LPFGAGPRGCIGTLLGQLGIKVGLLHTLKHFRVELCERTLPEMRFDP*ASVLTAHNGTFLRFVRNSL* 2504

AC005130 DS01560, complete sequence AL062712 (BACR006M08), AL075733 (BACR037C16), = AC014742
CAALAVGSVVLLPLIALLAVWLWQRRHFRIWRRLGVPYLPAAPVL
GNVLNVETAA
*FGDQFRELYERKEAAGRAIVGINVLHSHALLLRDPALIRR
ILVEDFPEFSSSFKSTDAIRDTMGSGNLLFTKYKTWWETHKIFAVRTSGK*
*YVFADAPYRNCGRHNPSARLLRSPGC
FLLLAGFDTSSSFALYELA
KNPTIEHRLQAELRVDLQSSHNHQLSYDTLTGLVYLRQVLEDDPQFHDEI
SYPPRELQRIASV*
*SDTCLAVYFIIFKIRNPKPRIL
HQIDLTRATLTASLHALPFGAGPRGCIGTLLGQLGIKVGLLHTLKHFRVE
LCERTLPEMRFDP*

AL070820.1|CNS00FNQ Drosophila melanogaster genome survey sequence TET3 end
LSLCYTTEVVSDCGLGXSAXSFTDTPTPLLKXIKRVFNTSFEFIFYSVVTNLWQKV
RKFY
SVPFFNKETEVFFLDIIRRCITLRLEKPKQQRDDFLNYILQLQEKKGLHTDNILIN
TMTFILDGFETTALVLAHIMLMLGRNPESK
LTFDQMSELPHLDACI*LETLRLFSPQVAARKLVTEPFEFANKNGRTVHLKPGDVV
TIPVKAL 794
HHDPQYYEDPLTFKPERF 848

AL062352 AL054245 AI108091 AI113367 AI064259 AI064268 36% with 6a2
FRPPGXITXXPXMTXTXYXLAKNEXLQDRLRQEIVDFFGDEDHISY
ERIQEMPYLSQXVNETLRKYPIVGYIEREAIHPRTIPQHGVPHGMS
IYMSTVAVHRDPQYWPDPEKYDPERFNSSNRDNLNMDAYMPFGVGP
RNCIGMRLGLLQSKLGLVHILRNHRFHTCDKTIKKIEWAPTSPETF
CTRRIISRFEAITGPAN*

AC014186 AA803931 AA821188 AA803220 AI404794
MCCLQSTTLDRKKNASWNRSAIVGSDTSDAPEHCPL
RCGALLAVLLAWQQRKCWRLIWQLNGWRGVIQQPVLWLLLCINLHPNS
ILEKVSQYRVHFQRPLRVLVGTRVLLYIDDPAGMECVLNAPECLDKTFLQDGFFVRRGLLHAR
GQKWKLRRKQLNPAFSHNIVASFFDVFNSVGNQMVEQFQTQTNLHGQAVKFTAAEDLLSRAVLE*
DTSMGAQLDTQSVDHSPIIQAFHLSSKLLFKRMINPLLSSDWIFQRTQLWRDLDEQLQ
VIHSQMESVIEKRAKELLDMGEPAGRAHNLLDTLLLAKFEGQSLSRREI
RDEINTFVFXGVDTTTAAMSFVLYALAKFPETQTRLRKELQDVALDETTDLDALNGLPYLEALIKE
VLRLYTIVPTTGRQTTQSTEIGGRTYCAGVTLWINMYGLAHDKEYYPDPYAFKPERWLPE
DGAVAPPAFSYIPFSGGPHVCIGRRYSLLLMKLLTARLVREFQMELSPEQAPLRLEAQM
VLKAQQGINVSFLKQ*

AI404831 GH24669 52% to 6d2 N-term AC008197, AC009846, AA141002 C-term
MFSLILLAVTLLTLAWFYLKRHYEYWERRGFPFEKHSGIPFGCLDSVWRQEKSMGLAIYDVYV
KSKERVLGIYLLFRPAVLIRDADLARRVLAQDFASFHDRGVYVDEERDPLSANIF
SLRGQSWRSMRHMLSPCFTSGKLKSMFSTSEDIGDKMVAHLQKELPEEGFKEVDIKKVMQ
NYAIDIIASTIFGLDVNSFENPDNKFRKLVSLARANNRFNAMFG
VSNGCSCRIAQFLFRIGFKNPVGLAMLQIVKETVEYREKHGIVRKDLLQLLIQLRNTGKIDENDEKSFSIQKTPDD
IKTISLEAITAQAFIFYIAGQETTGSTAAFTIYELAQYPELLKRLQDEVDETLAKNDGKITYDSLNKMEFLDLCVQETIRKYPGLP
ILNRECTQDYTVPDTNHVIPKGTPVVISLYGIHHDAEYFPDPETYDPERFSEESRNYNPTAFMPFGEGPRICIAQRMGRINSKLAI
IKILQNFNVEVMSRSEIEFENSGIALIPKHGVRVRLSKRVPKLS*

AC008197 chromosome 3 clone BACR02L12 (D753) RPCI-98 AC009846 AA141002
MFSLILLAVTLLTLAWFYLKRHYEYWERRGFPFEKHSGIPFGCLDSVWRQEKSMGLAIYD 68158
VYVKSKERVLGIYLLFRPAVLIRDADLARRVLAQDFASFHDRGVYVDEERDPLSANIFSL 68338
RGQSWRSMRHMLSPCFTSGKLKSMFSTSEDIGDKMVAHLQKELPEEGFKEVDIKKVMQNY 68518
AIDIIASTIFGLDVNSFENPDNKFRKL 68599
VISESEFCYGVSNGCSCRIAQFLFRIGFKNPVGLAMLQIVKETVEYREKHGIVRKDLLQLLIQLRNTGKIDENDEKSFSIQKTPDD
IKTISLEAITAQAFIFYIAGQETTGSTAAFTIYELAQYPELLKRLQDEVDETLAKNDGKITYDSLNKMEFLDLCVQETIRKYPGLP
ILNRECTQDYTVPDTNHVIPKGTPVVISLYGIHHDAEYFPDPETYDPERFSEESRNYNPTAFMPFGEGPRI

AI064680 N-TERM TO C-HELIX AL061315 AI404867 AI406053 AI1516839 AC009382
MSALIFLCAILIGFVIYSLISSARRPKNFPPGPRFVPWLGNTLQFRKEASAVGGQHILFERWA
KDFRSDLVGLKLGREYVVVALGHEMVKEVQLQEVFEGRPDNFFLRLRTMGTRKGITCTDG
QLWYEHRHFAMKQMRNVGYGRSQMEHHIELEAEELLGQLERTEEQPIEPVTWLAQ
SVLNVLWCLIAGKR

AL078186
MLTSVFYVLFAIAITIILISYVFLLLKCKQKAFVVIGLLYQEKKY
QCFDQAPGPHPWPIIGNINLLGRFQYNPFYGFGTLTKKYGDIYSLSLGHT
RCIVVNNVDLIKEVLNKNGKYFGGRPDFFRYHKLFGGDRNNCKFIXXLRF

AC014810 Drosophila melanogaster, *** SEQUENCING IN PROGRESS ***, in ordered
MLAALIYTILAILLSVLATSYICIIYGVKRRVLQPVKTKNSTEINHNAYQKYTQAPGPRP
WPIIGNLHLLDRYRDSPFAGFTALAQQYGDIYSLTFGHTRCLVVNNLELIREVLNQNGKV
MSGRPDFIRYHKLFGGERSNSLALCDWSQLQQKRRNLARRHCSPREFSCFYM
KMSQIVARKWSTGIRELGNQLVP
GEPINIKPLILKACANMFSQYMCSLRFDYDDVDFQQIVQYFDEIFWEINQGHPLDFLPWL
YPFYQRHLNKIINWSSTIRGFIMERIIRHRELSVDLDEPDRDFTD
ALLKSLLEDKDVSRNTIIFMLEDFIGGHSAVGNLVMLVLAYIAKNVDIGRRIQEEIDAII
EEENRSINLLDMNAMPYTMATIFEVLRYSSSPIVPHVATEDTVISGYGVTKGTIVF
INNYVLNTSEKFWVNPKEFNPLRFLEPSKEQSPKHF
LPFSIGKRTCIGQNLVRGFGFLVVVNVMQRYNISSHNPSTIKISPESLALPADCFPLVLTPREKIGPL*

AC008307 AC015141 AC007725 chromosome 3 clone BACR03D22 (D709) RPCI-98
TAFSSQWALFALSKEPRLQQRLAKERATNDSRLMHGLIKESLRLY 45293
PVAPFIGRYLPQDAQLGGHFIXXX 45230
TMVLLSLNTAGRDPSHFEQPERVLPERWCIGETEQVHKSHGSLPFAIGQRSCIGRRVALK 44986
QLHSLLGRCAAQFEMSCLNEMPVDSVLRMVTVPDRTLRLALRPR 44854

SRCRWLPPAATVTYTDDREEGAAGPAALAETPARPAPGANP*PEPLPFALRSAAFAAFSRNGTTACRRRQIRAHSKGEGT
SGSWDTCGSYSRRRSHAVNP*SLTSKSRRESNRVENA*YILVSENSIKLAYAGLVQC*TIQP*LATF*KK*LSIGFSFKK
VLNPTKLPKSAKRNRVFYI*NAS*DTKLY*NLTNRNI*IEISKLQRRKIC*NIQGRN*KTNHNLFDLREFNYNTYVLIYF
RRKDP*FNPYILYLFSLHKYIDARHKQYGPIFRERLGGTQDAVFVSSANLMRGVFQHEGQYPQHPLPDAWTLYNQQHACQ
RGLFFM*VEWTK*VR*IANVEPTKTISCLFFDGNREGAEWLHNRRILNRLLLNGNLNWMDVHIESCTRRMVDQWKRRTAE
AAAIPLAESGEIRSYELPLLEQQLYRWSIEGTRRHALRADPHTK*FHPNSIHL*FCAASCLAPACSPAPRSSPRWTTSRR
LCTRCLSIARD**HSRLAWPRFCACPSGGISRPMWMRCCVRELP*SITASECRRTKGDRTMRRFTIASRRRMCQAI*SSG
YL*TWSLQQVTR*AIQLNCKAHY*L*LHLI*TAFSSQWALFALSKEPRLQQRLAKERAT

EQVQVVTPRCHRDVHR*PRRGSGRARCAG*DTCSTSSWCESLA*ASSVRAAIRRLCSVFPQRNYRLPSPPNTCPFQG*RD
FR*LGHLWIL*PPAEPRSKSVKFNLEKSERKQSC*KCIIYSCVRKLYKIGLCRTSSMLNNSTLIGNLLKKVIINRFFF*E
SA*PY*ITKKCKT**SFLYLKCFMRY*IILKSHKSKYLNRNIKAAKKKNMLKYTRTKLKN*SQSLRLKRVQLQYICINIF
SPKGSIV*PVYPLSV*PS*VHRCEAQAVWSHFPGAIGRYPGCSVRIVRKSHARSLPARGSVSAASAAGCLDAV*PATCLP
TGTVLHVSGVDQMSPLNCKCGADENDLLSLLRWE*GGRRVAAQPTHT*STAAQRKFELDGRAY*ELYQTNGGSVEKTHCG
GGGDSASGEW*NTKLRTAPVGTTALPLVHRRYKTSCIKG*SPYKMISSQLYPLVVLCCIMFGTSVLTCPKIQSSLDYFTQ
IVHKVFEHSSRLMTFPPRLAQILRLPIWRDFEANVDEVLREGAAIIDHCIRVQEDQRRPHDEALYHRLQAADVPGDMIKR
IFVDLVIAAGDTVSNPIELQSPLLIVVTPHLDRIQQSVGFVCPFKGAEAPATTGQGASY

RAGAGGYPPLPP*RTPMTEKRERPGPLRWLRHLLDQLLVRILSLSLFRSRCDPPPLQRFPATELPPAVAAKYVPIPRVKG
LPVVGTLVDLIAAGGATQ*IRKV*PRKVGEKAIVLKMHNIFLCPKTL*NWLMPD*FNVKQFNLNWQPFEKSDYQ*VFLLR
KCLTLLNYQKVQNVIEFFISKMLHEILNYIKISQIEIFKSKYQSCKEEKYVEIYKDETKKLITISST*ESSTTIHMY*YI
FAERIHSLTRISFICLAFISTSMRGTSSMVPFSGSDWAVPRMQCSYRPQISCAESSSTRVSIRSIRCRMPGRCITSNMPA
NGDCSSCKWSGPNESVELQMWSRRKRSLVSSSMGIGRAPSGCTTDAYLIDCCSTEI*IGWTCILRAVPDEWWISGKDALR
RRRRFR*RRVVKYEATNCPCWNNSSTVGP*KVQDVMH*GLIPIQNDFIPTLSTCSSVLHHVWHQRAHLPQDPVLAGLLHA
DCAQGV*A*LATDDIPASLGPDFAPAHLAGFRGQCG*GAA*GSCHNRSLHQSAGGPKETAR*GALPSPPGGGCARRYDQA
DICRLGHCSR*HGEQSN*IAKPITNCSYTSFRPHSAVSGLCLPFQRSRGSSNDWPRSELP

AC015141 Drosophila melanogaster, *** SEQUENCING IN PROGRESS ***, in ordered
           pieces
           Length = 21229

 Score =  124 bits (308), Expect = 5e-29
 Identities = 63/63 (100%), Positives = 63/63 (100%), Gaps = 13/63 (20%)
 Frame = +1

Query: 1   MTEKRERPGPLRWL-------------LSLFRSRCDPPPLQRFPATELPPAVAAKYVPIP 47
           MTEKRERPGPLRWL             LSLFRSRCDPPPLQRFPATELPPAVAAKYVPIP
Sbjct: 229 MTEKRERPGPLRWLRHLLDQLLVRILSLSLFRSRCDPPPLQRFPATELPPAVAAKYVPIP 408

Query: 48  RVKGLPVVGTLVDLIA 63
           RVKGLPVVGTLVDLIA
Sbjct: 409 RVKGLPVVGTLVDLIA 456

AC007648 Drosophila melanogaster chromosome 3 clone BACR13A02 (D705) RPCI-98
              13.A.2 map 88E-88F strain y; cn bw sp, WORKING DRAFT
              SEQUENCE, 84 unordered pieces
              Length = 128775

 Score =  123 bits (306), Expect = 9e-29
 Identities = 62/63 (98%), Positives = 63/63 (99%), Gaps = 13/63 (20%)
 Frame = -1

Query: 1      MTEKRERPGPLRWL-------------LSLFRSRCDPPPLQRFPATELPPAVAAKYVPIP 47
              MTEKRERPGPLRW+             LSLFRSRCDPPPLQRFPATELPPAVAAKYVPIP
Sbjct: 115974 MTEKRERPGPLRWMRHLLDQLLVRILSLSLFRSRCDPPPLQRFPATELPPAVAAKYVPIP 115795

Query: 48     RVKGLPVVGTLVDLIA 63
              RVKGLPVVGTLVDLIA
Sbjct: 115794 RVKGLPVVGTLVDLIA 115747

AC008307 Drosophila melanogaster chromosome 3 clone BACR03D22 (D709) RPCI-98
             03.D.22 map 86F-87A strain y; cn bw sp, *** SEQUENCING IN
             PROGRESS ***, 94 unordered pieces
             Length = 91546

 Score =  102 bits (253), Expect = 1e-22
 Identities = 49/49 (100%), Positives = 49/49 (100%)
 Frame = +2

Query: 15    LSLFRSRCDPPPLQRFPATELPPAVAAKYVPIPRVKGLPVVGTLVDLIA 63
             LSLFRSRCDPPPLQRFPATELPPAVAAKYVPIPRVKGLPVVGTLVDLIA
Sbjct: 28868 LSLFRSRCDPPPLQRFPATELPPAVAAKYVPIPRVKGLPVVGTLVDLIA 29014

AC015396 AL099182 Cyp302a1 dib1 gene
MLTKLLKISCTSRQCTF
AKPYQAIPGPRGPFGMGNLYNYLPGIGSYSWLRLHQAGQDKYEKYGAIVRETIVPGQDI 32181
VWLYDPKDIALLLNERDCPQRRSHLALAQYRKSRPDVYKTTGLLPTNGPEWWRIRAQ 32352
VQKELSAPKSVRNFVRQVDGVTKEFIRFLQESRNGGAIDMLPKLTRLNLE 32502
32643 VTCLLTFGARLQSFTA
QEQDPRSRSTRLMDAAETTNSCILPTDQGLQLWRFLETPSFRKLSQAQSYMESVALELVE 32870
ENVRNGSVGSSLISAYVKNPELDRSDVVGTAADLLLAGIDTTSYASAFLLYHIA 33032
RNPEVQQKLHEEARRVLPSAKDELSMDALRTDITYTRAVLKESLRLNPIAVGVGR 33197
GRGQDLNQDAIFSGYFVPKG
TTVVTQNMVACRLEQHFQDPLRFQPDRWLQHRSALNPYLVLPFGHG 33441
MRACIARRLAEQNMHI 33489
LLLRLLREYELIWSGSDDEMGVKTLLINKPDAPVLIDLRLRRE* 33705

AC017770 Drosophila melanogaster, *** SEQUENCING IN PROGRESS ***, in ordered
             pieces
             Length = 67513

Cyp4e4 75-203
31598 EYVKRFGRSFMGTVLGHVVMVTAEPRHIDALLQGQHQLKKGTMYFALRGWLGDGLLLSRG 31419
31418 KEWHTMRKIITPTFHFSILEQFVEVFDRQSSILVERLRTLSYGNEVVNIYPLVGLAALDI 31239
31238 ITETAMGVNVD 31206

Cyp4d2 67-206
33788 AEIMDFVKKNQRKYGRLYRVWILHQLAVFSTDPRDIEFVLSSQQHITKNNLYKLLNCWLG 33609
33608 DGLLMSTGRKWHGRRKIITPTFHFKILEQFVEIFDQQSAVMVEQLQSRADGMTPINIFP 33432
33431 VICLTALDIIAGEFA 33387

Cyp4d2 245-483
33126 RQDKAIKVMHDFTENIIRERRETLVNNSKETTPEEEVNFLGQKRRMALLDVLLQST 32959
32958 IDGAPLSDEDIREEVDTFMFEGHDTTTSAISFCLYEISRHPEVQQRLQQEIRDVLGEDRK 32779
32778 SPVTLRDLGELKFMENVIKESLRLHPPVPMIGRWFAEDVE 32656
32605 LSGGKHIPAGTNFTMGIFVLLRDPEYFESPDEFRPERFDADVPQIHPYAYIPFSAGPRN 32429
32428 CIGQKFAMLEMKSTVSKLLRHFELL 32354

Cyp4d14 67-483
36011 AAIFDMQFRLIAEFGKNIKTQMLGESGFMTADSKMIEAIMSSQQTIQKNNLYSLLVNWLG 36190
36191 DGLLISQGKKWFRRRKIITPAFHFKILEDFVEVFDQQSATMVQKLYDRADGKTVINMFPV 36370
36371 ACLCAMDIIAETAMGVKINAQLQPQFTYVQSAHLFSASAML 36550
36551 AERFMNPLQRLDFTMKLFYPKLLDKLNDAVKNMHDFTNSVITERRELLQKAIADGGDADA 36730
36731 ALLNDVGQKRRMALLDVLLKSTIDGAPLSNDDIREEVDTFMFEGHDTTTSSIAFTCYLLA 36910
36911 RHPEVQARVFQEVRDVIGDDKSAPVTMKLLGELKYLECVIKESLRLFPSVPIIGRYISQD 37090
37091 TVLDGKLIPADSNVIILIYHAQRDPDYFPDPEKFIPDRFSMERKGEISPFAYTPFSAGP 37267
37268 RNCIGQKFAMLEMKSTISKMVRHFELL 37348

more distant 4d1 exon 1 103-194 usedin most ESTs of 4d1
49069 ERVLGSSQLLTKSQEYSFLGRWLNEGLLVSNGRKWHRRRKIITPAFHFRILEPYVEIFDR 49248
49249 QSLRLVEELALRISRGQERINLGEAIHLCALDAIC 49353

closest 4d1 exon 1 118-200
50309 YKALEPWLKEGLLVSRGRKWHKRRKIITPAFHFKILDQFVEVFEKGSRDLLRNME-QDRL 50485
50486 KHGDSGFSLYDWINLCTMDTICGRDLGL 50569

Cyp4d1 192-512
50677 SIPETAMGVSINAQSNADSEYVQAVKTISMVLHKRMFNILYRFDLTYMLTPLARAEKKAL 50856
50857 NVLHQFTEKIIVQRREELIREGSSQESSNDDADVGAKRKMAFLDILLQSTVDERPLSNLD 51036
51037 IREEVDTFMFEGHDTTSSALMFFFYNIATHPEAQKKCFEEIRSVVGNDKSTPVSYELLNQ 51216
51217 LHYVDLCVKETLRMYPSVPLLGRKVLEDCEISDGKLIPAGT 51396
51397 NIGISPLYLGRREELFSEPNIFKPERFDVVTTAEKLNPYAYIPFSAGPRNCIGQKFAMLE 51576
51577 IKAIVANVLRHYEVDFVGDSSEPPVLIAELILRTKEPLMFKVRER 51711

AC017388 Drosophila melanogaster, *** SEQUENCING IN PROGRESS ***, in ordered

Cyp4d8 74-212
38766 REYVLKFGHLQRVWIFNRLLIMSGDAELNEQLLSSQEHLVKHPVYKVLGQWLGNGLLLSD 38587
38586 GKVWHQRRKIITPTFHFSILEQFVEVFDQQSNICVQRLAQKANGNTFDVYRSICAAA 38416
38415 LDIIAETAMGTKIYAQANESTPY 38347

Cyp4d8 250-512
38155 IRTMQEFTIKVIEKRRQALEDQQSKLMDTADEDVGSKRRMALLDVLLMSTVDGRPLTN 37982
37981 DEIREEVDTFMFEGHDTTTSALSFCLHELSRHPEVQAKMLEEIVQVLGTDRSRPVSIRDL 37802
37801 GELKYMECVIKESLRMYPPVPIVGRKLQTDFKVHGD 37622
37621 GVIPAGSEIIIGIFGVHRQPETFPNPDEFIPERHENGSRVAPFKMIPFSAGPRNCIGQ 37448
37447 KFAQLEMKMMLAKIVREYELLPMGQRVECIVNIVLRSETGFQLGMRKR 37304

AC017371 Drosophila melanogaster, *** SEQUENCING IN PROGRESS ***, in ordered
1st gene with annotation

MDTFQLLLAVGVCFWIYFLWSRRRLYMMHFKIPGPMGLPILGIAFEYLITYK*   R at intron = CGT
RKMSIRTKYMDIYGSTCLVWVGPTPFVITRDPKIAEEIFLSPECLNRSSIFSKPVNSCTGDGLLSLE*
G at intron = GGT
ASKWVDRRKNLNPAFKQNVLLSFLPIFNSEAKTLVAFLDSLVGQGEKKVRDDIVRWSFRIATRK*
RK at intron = CGTAAGT
Intron = ACAGAA

QTTVGTDVKKDASFKNDSVLKSYET  T = ACGT

FMKIIVMNVLLPFTHNKIFSTLGGFETQKALAKSNVNKMIGTV
1         R = AGG  RFMKIIVMNVLLPFTHNKIFSTLGGFETQKALAKSNVNKMIGTV
2         R = AGG  RMLRLTIINIFVPFVQNKIVSKLFGLEWLRRRDASAINKMINNV
3         S = AGC  SLNNLIPIGVVMPWLRNKYLGKLFSYEKRRLEAATQSNAFIKDV

IVDKKLMTKPESGSQPEITSVINKAIELHRNGEMSREEVQSECCSFVVAAFETTGDTVYHALILLAMFPEHQDTVYQELK
ELFPVAGDFEVTYDDLQRMVFLERVVNETLRLIPSVPFTPRETIRDFRLSSGVVIPKGVGIGIDIFATHRNRDHWGTDPS
SFNPDHFLPDNVRDRHPYAYIPFSKGRRNCI

GWKYGLMSSKLALSKILRNCKVSTSFRYEDLEFVDNIGMELAQSPGLEFHRRT*

AC017371 1st gene 1890-3720 AC007724 AL072640 AL074717 AL076582
MDTFQLLLAVGVCFWIYFLWSRRRLYMMHFKIPGPMGLPILGIAFEYLITYK
RKMSIRTKYMDIYGSTCLVWVGPTPFVITRDPKIAEEIFLSPECLNRSSIFSKPVNSCTGDGLLSLE
ASKWVDRRKNLNPAFKQNVLLSFLPIFNSEAKTLVAFLDSLVGQGEKKVRDDIVRWSFRIATR
QTTVGTDVKKDASFKNDSVLKSYET
FMKIIVMNVLLPFTHNKIFSTLGGFETQKALAKSNVNKMIGTV
IVDKKLMTKPESGSQPEITSVINKAIELHRNGEMSREEVQSECCSFVVAAFETTGDTVYHALILLAMFPEHQDTVYQELK
ELFPVAGDFEVTYDDLQRMVFLERVVNETLRLIPSVPFTPRETIRDFRLSSGVVIPKGVGIGIDIFATHRNRDHWGTDPS
SFNPDHFLPDNVRDRHPYAYIPFSKGRRNCI
GWKYGLMSSKLALSKILRNCKVSTSFRYEDLEFVDNIGMELAQSPGLEFHRRT*

2nd gene with annotation
MIVIQLLIAASLILWIRFLWSRRKLYMLMMQLPGRMGLPLLGNSVRYLIISR
G at intron = GGT next joint = AGGC so joint =GGC = G
GRMSSRTTYMDKHGSTYMAWIGTTPIVITRDPKIAEKVLTSPFCINRSSQTTNALALSMGYGLLTLQ
G at intron = GGT
AGSKWMARRKHMNPAFKHSVLLSFLPIFNAETDLLVSVFDSFVGQGEKDVLSDLIRWSFAIATRK*

QTTLGTDVTKDDNFENDAILKTYQS   S = TCGT

2 MLRLTIINIFVPFVQNKIVSKLFGLEWLRRRDASAINKMINNV

ILDKKLNSNPENYCESELKTVIHRAIELFRNDEMSLMELGAECSSMVLAAFETSAHTVYYALVLLAMFPEHQEMVFNEIK
EHFPLAKGIEVTHTDLQQLVYLDRVLNETLRLMPSVPFSSRETLEDLRLSNGVVIPKGMTISIDIFNTQRNTDYWGSEAA
QFNPENFLPEKIHDRHPYAFIPFSKGKRNCI

GWRYGLMSSKLALVKILRNYKLKTSFPYENLEFVDHMVIKLAQSPQLAFERRTL*

AC017371 2nd gene 4100-5929 AC007724 AL057969 AL067059
MIVIQLLIAASLILWIRFLWSRRKLYMLMMQLPGRMGLPLLGNSVRYLIISR
GRMSSRTTYMDKHGSTYMAWIGTTPIVITRDPKIAEKVLTSPFCINRSSQTTNALALSMGYGLLTLQ
AGSKWMARRKHMNPAFKHSVLLSFLPIFNAETDLLVSVFDSFVGQGEKDVLSDLIRWSFAIATR
QTTLGTDVTKDDNFENDAILKTYQS
MLRLTIINIFVPFVQNKIVSKLFGLEWLRRRDASAINKMINNV
ILDKKLNSNPENYCESELKTVIHRAIELFRNDEMSLMELGAECSSMVLAAFETSAHTVYYALVLLAMFPEHQEMVFNEIK
EHFPLAKGIEVTHTDLQQLVYLDRVLNETLRLMPSVPFSSRETLEDLRLSNGVVIPKGMTISIDIFNTQRNTDYWGSEAA
QFNPENFLPEKIHDRHPYAFIPFSKGKRNCI
GWRYGLMSSKLALVKILRNYKLKTSFPYENLEFVDHMVIKLAQSPQLAFERRTL*

3rd gene with annotations
MLTLQIFEAFAIILCVYFLWSRRRFYIMMLKLPGPMGFPFIGLAFEYIRLKRK R at intron = CGT
G at other end of intron is AGGT so joint = CGT = R
RKIRLRTILFKIYGKTVLTWIGLTPVLVTCEPKILEDIFTSPNCSNRSSVVDKAISSCLGLGLLTLK
S at intron = AGT D at intron = AGAT so joint = AAT = N
NNHWNERRKLLLPSFKNNAVLSFVPVLNNEANFLVTLLAEFVDGGDINLLPELNKWSFKIAARK*

QITMGDEVRNQANYQNGNLLESYKA   A = GCGT

LNNLIPIGVVMPWLRNKYLGKLFSYEKRRLEAATQSNAFIKDV

IIDKKLSSTDNSSEPALIDRILNLVRIGELSYDDVMGEFSNIIFAASDTLSITVNNVLILMAMFPKYQDNVFEELAEVFP
SGGEFEASHADLEKLVKLDRVLHETMRLIPAVPLLIRQTSHSIQLSNGFYIPEGVTLMIDIFHTHRNKDIWGPQANAFNP
DNFLPENKRARPPYSYLPFSKGKKTCL

GWKLSLISAKLALAKILRNYMLSTTFLYKDLRFIDNTTMKLAEQPLLAVKRRI*

AC017371 6352-8145 3rd gene AC007724 AL066602
MLTLQIFEAFAIILCVYFLWSRRRFYIMMLKLPGPMGFPFIGLAFEYIRLKRK
RKIRLRTILFKIYGKTVLTWIGLTPVLVTCEPKILEDIFTSPNCSNRSSVVDKAISSCLGLGLLTLK
NNHWNERRKLLLPSFKNNAVLSFVPVLNNEANFLVTLLAEFVDGGDINLLPELNKWSFKIAAR
QITMGDEVRNQANYQNGNLLESYKA
LNNLIPIGVVMPWLRNKYLGKLFSYEKRRLEAATQSNAFIKDV
IIDKKLSSTDNSSEPALIDRILNLVRIGELSYDDVMGEFSNIIFAASDTLSITVNNVLILMAMFPKYQDNVFEELAEVFP
SGGEFEASHADLEKLVKLDRVLHETMRLIPAVPLLIRQTSHSIQLSNGFYIPEGVTLMIDIFHTHRNKDIWGPQANAFNP
DNFLPENKRARPPYSYLPFSKGKKTCL
GWKLSLISAKLALAKILRNYMLSTTFLYKDLRFIDNTTMKLAEQPLLAVKRRI*

AC017740 Drosophila melanogaster, *** SEQUENCING IN PROGRESS ***, in ordered

271-461 = AC007928
168661 QVVNKKVNPLPKTDSDPESNIVINRAMELYRKGDITYMDVKSECCIMIAAGYDTSALTVY 168482
168481 HALFLLANHPEHQEAVFEELNGVFPDAGHFGITYPDMQKLDYLERVIKETLRLIPAIPI 168305
168304 TARETKNDVRLSNGVLIPKGVVIGIDMFHTHRNPEVWGPDADNFNPDNFLAENMEQKHPY 168125
168124 AYIPFARGKRNCIG 168083

AC017250

87-512 = AI404831
38209 YLLFRPAVLIRDADLARRVLAQDFASFHDRGVYVDEERDPLSANIFSLRGQSWRSMRHML 38388
38389 SPCFTSGKLKSMFSTSEDIGDKMVAHLQKELPEEGFKEVDIKKVMQNYAIDIIASTIF 38562
38563 GLDVNSFENPDNKFRKLVSLVSNGCSCRI 38742
38743 AQFLFRIGFKNPVGLAMLQIVKETVEYREKHGIVRKDLLQLLIQLRNTGKIDENDE 38910
38911 KSFSIQKTLTFSGHIKTISLEAITAQAFISYIAGQETTGST 39087
39088 AAFTIYELAQYPELLKRLQDEVDETLAKNDGKITYDSLNKMEFLDLCVQETIRKYPGLPI 39267
39268 LNRECTQDYTVPDTNHVIPKGTPVVISLYGIHHDAEYFPDPETYDPERFSEESRNYNPT 39444
39445 AFMPFGEGPRICIAQRMGRINSKLAIIKILQNFNVEVMSRSEIEFENSGIALIPKHGVR 39621
39622 VRLSKR 39639

AC017648 Drosophila melanogaster, *** SEQUENCING IN PROGRESS ***, in ordered
Cyp9h1 306-485
9674 DELYYSEFTADDLLAQCLLFFFAGFEIISSSLCFLTHELCLNPTVQDRLYEEIISVHEE 9850
9851 LKGQPLTYDKLTKMKYLDMVVLEALRKWPPSISTDRECRQDIDLFDENGQKLFSARKGDV 10030
10031LQIPIFSLHHDPENFEDPEFFNPERFADGHALESRVYMPFGVGPRNCIGNRMALMELK 10204
10205SIVYQLLLNFKLLPA 10249

AC018013 = AC009382 EST AI064680

37224 MSALIFLCAILIGFVIYSLISSARRPKNFPP 37132
      GPRFVPWLGNTLQ
35956 FRKEASAVGGQHILFERWAKDFRSDLVGLKLGREYVVVALGHEMVKEVQLQEVFEGR 35786
35785 PDNFFLRLRTMGTRKGITCTDGQLWYEHRHFAMKQMRNVGYGRSQMEHHIELEAEELLGQ 35606
35605 LERTEEQPIEPVTWLAQSVLNVLWCLIAGKRIARQEDGTLRRLLDLMNRRSKLFDIC 35435
35434 GGLLAQFPWLRHVAPDRTGYNLIQQLNTELYGFFMDTIEEHRRQLAKDPSPAESDLIYA 35258
35257 YLQEMKDRSAGGESSTFNETQLVMTILDFFIAGSQTTSNTINLALMVLAMRPDVQEKLFS 35078
35077 QVTASVAAASTDAFPHLSRREAFDYMDAFIMEVQRFFHITPITGPRRALWATKLGGYDIP 34898
34897 KNATILISLRSVHLDKEHWKDPLEFRPERFIDSAGKCFKDEYFMPFGMGRRRCLGDALAR 34718
34717 ACIFSFLVRIVQHFSVVLPAGESPSMVLLPGITLTPKPYKVQFVKRT* 34574

AC017306 = L49408 CYP2 like
12933 MFYTVIWIFCATLLAILFGG
12993 VRKPKRFPPGPAWYPIVGSALQVSQ LRCRLGMFCKVIDVFARQYVNPYGFYGLKIGKDKV 13172
13173 VIAYTNDAISEMMTNEDIDGRPDGIFYRLRTFNSRLGVLLTDGEMWVEQRRFILRHLKNF 13352
13353 GFARSGMMDIVHNEATCLLQDLKDKVLKSGGKQTRIEMHDLTSVYVLNTLWCMLSGRRYE 13532
13533 PGSPEITQLLETFFELFKNIDMVGALFSHFPLLRFIAPNFSGYNGFVESHRSLYTFMSKE 13712
13713 IELHRLTYKNYDEPRDLMDSYLRAQDEGNDEKGMFSDQSLLAICLDMFLAGSETTNKSL 13889
13890 GFCFMHLVLQPEIQERAFQEIKEVVGLERIPEWSRDRTKLPYCEAITLEAVRMFMLHTFG 14069
14070 IPHRAVCDTRLSGYEIPKDTMVIACFRGMLINPVDFPDPESFNPDRYLFDGHLKLPEAF 14246
14247 NPFGFGRHRCMGDLLGRQNLFMFTTTVLQNFKMVAIPGQVPEEVPLEGATAAVKPYDIM 14423
14424 LVAREQ* 14444

AC017771 Drosophila melanogaster, *** SEQUENCING IN PROGRESS ***, in ordered
= AC008325a
26507 MWIALLGIPILLAVLTLLLKHINKTYFILSLTKRVRTEDGSPLESKVAIMPGKTRFGNNLDIL 26319
26318 NFTTASVFNFVRESTAKAKGQNYLWYFLYAPMYNVVRPEE 26139
26138 AEEVFQSTKLITKNVVYELIRPFLGDGLLISTGK 25959
25958 GFNARLSLQCPFREECKKFLNVLEKNLDAELELNQ 25779
25778 VIPPFTLNNIC 25746
25686 ETALGVKLDDMSEGNEYRKAIHAIEEVLIQRVCNPLMYYNWYFFVYGDYRKHLQNLR 25516
25515 IVHDFSSRIIERKRQQFQQKQLGEVDEFGRKQRYAMLDTLLAAEADGQIDHQGIC 25351
25350 DEVNTFMFEGYDTTSTCLIFTLLMLALHEDVQKKCYEEVENLPEDSDDISMFQFNKLVY 25174
25173 LECVIKESLRMFPSVPFIGRQCVEETVVNGMVMPKDTQISIHIYDIMRDPRHFPKPDLF 24997
24996 QPDRFLPENTVNRHPFAYVPFSAGQRNCIGQKFAILEMKVLLAAVIRNFKLLPATQLEDL 24817
24816 TFENGIVLRTQENIKVKLSKRVK* 24745

16-194
24052 WAHLNRTYFILSLCKRIRTEDGSLLESKIYVAPSKTRFGNNFDLVNFTSG
23698 YDLRAFETVFGRRTAD*HRYFYIFILNSYYFKEF*TNLDQKWHSREKP*LLLFTLRCCNL 23519
23518 FLSIFREECNKLVKVLHQSVNMELELNQVIPQFTLNNVC 23348

69-134 = AC008324 gene 1
23841 IFNFMRDASAKAKGRNYLWYFFHAPMYNIVRAEEAEEILQSSKLITKNMIYELLKPFLGE 23662
23661 GLLISTG 23641

190-512 = AC008324 gene 1
23179 INVIAETALGVKLDDLSEGIRYRQSIHAIEEVMQQRLCNPFFYNIVYFFLFGDYRKQVN 23003
23002 NLKIAHEFSSNIIEKRRSLFKSNQLGQEDEFGKKQRYAMLDTLLAAEADGQIDHQ 22838
22837 GICDEVNTFMFEGYDTTSTCLIFTLLMLALHEDVQKKCYEEIKYLPDDSDDISVFQFNE 22661
22660 LVYMECVIKESLRLFPSVPFIGRRCVEEGVVNGLIMPKNTQINIHLYEIMRDARHFSNP 22484
22483 KMFQPDRFFPENTVNRHPFAFVPFSAGQRNCIGQKFAILEIKVLLAAVIRNFKILPVTLL 22304
22303 DDLTFENGIVLRTKQNIKVKLVHR 22232

AC017771 Drosophila melanogaster, *** SEQUENCING IN PROGRESS ***, in ordered
             pieces
             Length = 49626

 Score =  250 bits (632), Expect = 1e-65
 Identities = 133/321 (41%), Positives = 204/321 (63%)
 Frame = -1

Query: 186   ETAMGIQMNAQEESESEYVKAVYEISELTMQRSVRPWLHPKVIFDLTTMGKRYAECLRIL 245
             ETA+G++++   E  +EY KA++ I E+ +QR   P ++    F +    +++ + LRI+
Sbjct: 25686 ETALGVKLDDMSEG-NEYRKAIHAIEEVLIQRVCNPLMYYNWYFFVYGDYRKHLQNLRIV 25510

Query: 246   HGFTNKVIQERKSLRQMTGMKPTISNEEDELLGKKKRLAFLDLLLEASENGTKMSDTDIR 305
             H F++++I+ ++   Q   +      E DE  G+K+R A LD LL A  +G ++    I
Sbjct: 25509 HDFSSRIIERKRQQFQQKQL-----GEVDEF-GRKQRYAMLDTLLAAEADG-QIDHQGIC 25351

Query: 306   EEVDTFMFEGHDTTSAGICWALFLLGSHPEIQDKVYEELDHIFQGSDRSTTMRDLADMKY 365
             +EV+TFMFEG+DTTS  + + L +L  H ++Q K YEE++++ + SD   +M     + Y
Sbjct: 25350 DEVNTFMFEGYDTTSTCLIFTLLMLALHEDVQKKCYEEVENLPEDSD-DISMFQFNKLVY 25174

Query: 366   LERVIKESLRLFPSVPFIGRVLKEDTKIGDYLVPAGCMMNLQIYHVHRNQDQYPNPEAFN 425
             LE VIKESLR+FPSVPFIGR   E+T +   ++P    +++ IY + R+   +P P+ F
Sbjct: 25173 LECVIKESLRMFPSVPFIGRQCVEETVVNGMVMPKDTQISIHIYDIMRDPRHFPKPDLFQ 24994

Query: 426   PDNFLPERVAKRHPYAYVPFSAGPRNCIGQKFATLEEKTVLSSILRNFKVRSIEKREDLT 485
             PD FLPE    RHP+AYVPFSAG RNCIGQKFA LE K +L++++RNFK+    + EDLT
Sbjct: 24993 PDRFLPENTVNRHPFAYVPFSAGQRNCIGQKFAILEMKVLLAAVIRNFKLLPATQLEDLT 24814

Query: 486   LMNELILRPESGIKVELIPRL 506
               N ++LR +  IKV+L  R+
Sbjct: 24813 FENGIVLRTQENIKVKLSKRV 24751

 Score =  249 bits (628), Expect = 3e-65
 Identities = 132/326 (40%), Positives = 203/326 (61%)
 Frame = -3

Query: 180   ALDIICETAMGIQMNAQEESESEYVKAVYEISELTMQRSVRPWLHPKVIFDLTTMGKRYA 239
             A+++I ETA+G++++   E    Y ++++ I E+  QR   P+ +  V F L    ++
Sbjct: 23182 AINVIAETALGVKLDDLSEG-IRYRQSIHAIEEVMQQRLCNPFFYNIVYFFLFGDYRKQV 23006

Query: 240   ECLRILHGFTNKVIQERKSLRQMTGMKPTISNEEDELLGKKKRLAFLDLLLEASENGTKM 299
               L+I H F++ +I++R+SL      K     +EDE  GKK+R A LD LL A  +G ++
Sbjct: 23005 NNLKIAHEFSSNIIEKRRSL-----FKSNQLGQEDEF-GKKQRYAMLDTLLAAEADG-QI 22847

Query: 300   SDTDIREEVDTFMFEGHDTTSAGICWALFLLGSHPEIQDKVYEELDHIFQGSDRSTTMRD 359
                 I +EV+TFMFEG+DTTS  + + L +L  H ++Q K YEE+ ++   SD  +  +
Sbjct: 22846 DHQGICDEVNTFMFEGYDTTSTCLIFTLLMLALHEDVQKKCYEEIKYLPDDSDDISVFQ- 22670

Query: 360   LADMKYLERVIKESLRLFPSVPFIGRVLKEDTKIGDYLVPAGCMMNLQIYHVHRNQDQYP 419
               ++ Y+E VIKESLRLFPSVPFIGR   E+  +   ++P    +N+ +Y + R+   +
Sbjct: 22669 FNELVYMECVIKESLRLFPSVPFIGRRCVEEGVVNGLIMPKNTQINIHLYEIMRDARHFS 22490

Query: 420   NPEAFNPDNFLPERVAKRHPYAYVPFSAGPRNCIGQKFATLEEKTVLSSILRNFKVRSIE 479
             NP+ F PD F PE    RHP+A+VPFSAG RNCIGQKFA LE K +L++++RNFK+  +
Sbjct: 22489 NPKMFQPDRFFPENTVNRHPFAFVPFSAGQRNCIGQKFAILEIKVLLAAVIRNFKILPVT 22310

Query: 480   KREDLTLMNELILRPESGIKVELIPR 505
               +DLT  N ++LR +  IKV+L+ R
Sbjct: 22309 LLDDLTFENGIVLRTKQNIKVKLVHR 22232

 Score =  247 bits (624), Expect = 9e-65
 Identities = 167/453 (36%), Positives = 253/453 (54%), Gaps = 59/453 (13%)
 Frame = -2

Query: 54    VPRNKLFQVFDRRAKLYGPLYRIWAGPIA-QVGLTRPEHVELILRDTKHIDKSLVYSFIR 112
             +  N    + +  AK  G  Y IW    A +  + R E  E I + TK   K++ Y  IR
Sbjct: 21749 ITANIFSYIRESTAKANGQNY-IWNFLFAPEYNIVRAEDAEEIFQSTKITTKNMSYELIR 21573

Query: 113   PWLGEGLL--------------------TGTGAKWHSHRKMITPTFHFKILDIFVD---- 148
             P+LG+GLL                    T  G +       +       +  +F+
Sbjct: 21572 PFLGDGLLISIGE*SYIIVNI**SHSF*TRNGTQEEKL*PQLFILIFCSLFSLFLSK*E* 21393

Query: 149   --------------VFVEKSEILVKKLQSKVGGKDFDIYPFITHCALDIIC--------- 185
                           +F E+S+  +K L   VG  + ++   I    L+ IC
Sbjct: 21392 RFLTYYLFK*SIILIFREESKKFIKILDKNVGF-ELELNQIIPQFTLNNICGKHNHLFIY 21216

Query: 186   -----------ETAMGIQMNAQEESESEYVKAVYEISELTMQRSVRPWLHPKVIFDLTTM 234
                        ETA+G++++   E  +EY KA+++   +  QR   P +     F L
Sbjct: 21215 EIVLLTIFSSIETALGVKLDDMSEG-NEYRKAIHDFEIVFNQRMCNPLMFFNWYFFLFGD 21039

Query: 235   GKRYAECLRILHGFTNKVIQERKSLRQMTGMKPTISNEEDELLGKKKRLAFLDLLLEASE 294
              K+Y+  LR +HGF++ +IQ ++        K     + DE  GKK+R A LD LL A
Sbjct: 21038 YKKYSRILRTIHGFSSGIIQRKRQQ-----FKQKQLGQVDEF-GKKQRYAMLDTLLAAEA 20877

Query: 295   NGTKMSDTDIREEVDTFMFEGHDTTSAGICWALFLLGSHPEIQDKVYEELDHIFQGSDRS 354
              G K+    I +EV+TFMF G+DTTS  + + L LL  H ++Q++ YEEL  + +  D
Sbjct: 20876 EG-KIDHQGICDEVNTFMFGGYDTTSTSLIFTLLLLALHADVQERCYEELQDLPEDIDE- 20703

Query: 355   TTMRDLADMKYLERVIKESLRLFPSVPFIGRVLKEDTKIGDYLVPAGCMMNLQIYHVHRN 414
              +M    ++ +LE VIKESLRLFPS P IGR   E++ +   ++P    +++ IY + R+
Sbjct: 20702 VSMFQFNELIHLECVIKESLRLFPSAPIIGRTCIEESVMNGLVLPKNAQISIHIYDIMRD 20523

Query: 415   QDQYPNPEAFNPDNFLPERVAKRHPYAYVPFSAGPRNCIGQKFATLEEKTVLSSILRNFK 474
                +P P  F P+ FLPE    RHP+A+VPFSAGPRNCIGQKF  LE K +L++++RNFK
Sbjct: 20522 ARHFPKPNQFLPERFLPENSVNRHPFAFVPFSAGPRNCIGQKFGVLEIKVLLAAVIRNFK 20343

Query: 475   VRSIEKREDLTLMNELILRPESGIKVELIPRL 506
             +    + EDLT  N ++LR +  IKV+   R+
Sbjct: 20342 LLPATQLEDLTFENGIVLRTQQNIKVKFEARV 20247

21749 ITANIFSYIRESTAKANGQNYIWNFLFAPEYNIVRAEDAEEIFQSTKITTKNMSYELIR 21573
21572 PFLGDGLLISIGTQEEKL*PQLFILIFCSLFSLFLS
      IFREESKKFIKILDKNVGFELELNQIIPQFTLNNIC
      ETALGVKLDDMSEGNEYRKAIHDFEIVFNQRMCNPLMFFNWYFFLFGD 21039
21038 YKKYSRILRTIHGFSSGIIQRKRQQFKQKQLGQVDEFGKKQRYAMLDTLLAAEA 20877
20876 EGKIDHQGICDEVNTFMFGGYDTTSTSLIFTLLLLALHADVQERCYEELQDLPEDIDE 20703
20702 VSMFQFNELIHLECVIKESLRLFPSAPIIGRTCIEESVMNGLVLPKNAQISIHIYDIMRD 20523
20522 ARHFPKPNQFLPERFLPENSVNRHPFAFVPFSAGPRNCIGQKFGVLEIKVLLAAVIRNFK 20343
20342 LLPATQLEDLTFENGIVLRTQQNIKVKFEARV 20247

 Score = 48.8 bits (114), Expect(2) = 7e-11
 Identities = 31/92 (33%), Positives = 46/92 (49%), Gaps = 3/92 (3%)
 Frame = -2

Query: 33    LVNKLPGPTAYPV---VGNAIEAIVPRNKLFQVFDRRAKLYGPLYRIWAGPIAQVGLTRP 89
             ++N  PG   Y +   V N   + +  +    V +  AK  G  Y  +        + RP
Sbjct: 26324 ILNFTPGSFCYSMRL*VYNYFSSFITASVFNFVRESTAKAKGQNYLWYFLYAPMYNVVRP 26145

Query: 90    EHVELILRDTKHIDKSLVYSFIRPWLGEGLLTGTG 124
             E  E + + TK I K++VY  IRP+LG+GLL  TG
Sbjct: 26144 EEAEEVFQSTKLITKNVVYELIRPFLGDGLLISTG 26040

 Score = 44.1 bits (102), Expect(2) = 1e-06
 Identities = 23/61 (37%), Positives = 33/61 (53%)
 Frame = -1

Query: 64    DRRAKLYGPLYRIWAGPIAQVGLTRPEHVELILRDTKHIDKSLVYSFIRPWLGEGLLTGT 123
             D  AK  G  Y  +        + R E  E IL+ +K I K+++Y  ++P+LGEGLL  T
Sbjct: 23823 DASAKAKGRNYLWYFFHAPMYNIVRAEEAEEILQSSKLITKNMIYELLKPFLGEGLLIST 23644

Query: 124   G 124
             G
Sbjct: 23643 G 23641

 Score = 39.9 bits (91), Expect(2) = 7e-11
 Identities = 14/25 (56%), Positives = 18/25 (72%)
 Frame = -3

Query: 126   KWHSHRKMITPTFHFKILDIFVDVF 150
             KWHS RK +TP FHF +L  F+ +F
Sbjct: 25978 KWHSRRKALTPAFHFNVLQSFLGIF 25904

 Score = 30.1 bits (66), Expect(2) = 1e-06
 Identities = 11/20 (55%), Positives = 15/20 (75%)
 Frame = -2

Query: 131   RKMITPTFHFKILDIFVDVF 150
             RK +TP FHFK+L  F+ +F
Sbjct: 23564 RKALTPAFHFKVLQSFLIIF 23505

AC018294 Drosophila melanogaster, *** SEQUENCING IN PROGRESS ***, in ordered
            pieces
            Length = 96075

36-529  44% to 12A2 probable mito seq.
1755 MYHTLMLNIGFICRMSTQK* 1814  CANNOT IDENTIFY THE N-TERMINAL
2205 ATAVNLEEAKPYADIPGPSKLQLIRAFLPGGK
2369 GLYKNLPVHEMFLDMNRQYGSIFRMPSVAGTDLVLTMNPQDYEVIFRNEGQYPYRRSFE 2545
2546 VMDYFKRVHRREVFDGYDGLTSGNGPAWGKMRTAVNPILLQPRN 2725
2726 AKLYMTNLVQVSDEFLER* 2776
2838 IRIIRDPVTQEMPDDFAVDIRHLVIESICSVALNTHLGLLGEQRNNKDIQKLVLALQDVV 3017
3018 ELGFQLDIMPAFWKYLPMPNFKKLMRSLDTITDFCYFHIGNALKRIEEDAKAGTLNEIGL 3197
3198 ETSLLEKLARFDRQTAVIIAMDLLFAGADPVSLTLGGILFS 3377
3378 LSKSPDKQARLLEEIRGILPNKDSSLTIENMRNLPYLRACIKEGIRMYPIGPGTLRRMPH 3557
3558 DVVLSGYRVVAGTDVGIAANYQMANMEQFVPKVREFIPERWLRDESNSHLVGE 3716
3717 TATPFMYLPFGFGPRSCAGKRIVDMMLEIAISRLVRNFKIGFDYPIENAFKAQFFVQPNI 3896
3897 PFKFKFIERNE* 3932

MLKVRSALSLIQSQKATLSLATQK*    RWQTNVATAEAREDSEWLQAKPFEQIPRLNMWALSMKMSMPGGK
                                                |||   ||      |      ||||
2070 MLSTQWNANKQISRQIYQLCRGLAQKVRKIYFKYIFIL*NLEEAKPYADIPGPSKLQL IRAFLPGGK
         MNTLSSARSVAIYVGPVRSSRSASVLAHEQAKSS

an EST like this seq
NTLSSARSVAIYVGPVRSSRSASVLAHEQAKSSITEEHKTYDEIPRPNKFKFMRAFMPGGEFQNASITEYTSAMRKRYGD
IYVMPGMFGRKDWVTTFNTKDIEMVFRNEGIWPRRDGLDSIVYFREHVRPDVYGEVQGLVASQNEAWGKLRSAINPIFMQ
PRGLRMYYEPLSNINNEFIERIKEIRDPKTLEVPEDFTDEISRLVF

54F 12a4       MLKVRSALSLIQSQKATLSLATQKRWQTNVATAEAREDSEWLQAKPFEQIPRLNMWALSMKMSMPGGK
55F 12a5       MLKGRIALNILQSQKPIVFSASQQRWQTNVPTAEIRNDPEWLQAKPFEEIPKANILSLFAKSALPGGK
56F 12b2     MWKYSNKI(36aa)KSRNLFTNNGYICSQTQLELADSRIDEKWQQARSFGEIPGPSLLRMLSFF-MPGGK
57F AC008187 ??TKRNRMNTLSSARSVAIYVGPVRSSRSASVLAHEQAKSSITEEHKTYDEIPRPNKFKFMRAF-MPGGE
    AC018294                                          NLEEAKPYADIPGPSKLQLIRAF-LPGGK