CYP34A8

 

>CYP34A8 B0213.14 499 aa

         Length = 499

 

 Score = 2442 (859.6 bits), Expect = 4.5e-257, P = 4.5e-257

 Identities = 484/499 (96%), Positives = 484/499 (96%)

 

Query:     1 MLLILILLAVAAVLTANLWRARQKLPKGPTPLPLIGNFHQLFYLSWKTGSLVAAFNELKK 60

             MLLILILLAVAAVLTANLWRARQKLPKGPTPLPLIGNFHQLFYLSWKTGSLVAAFNELKK

Sbjct:     1 MLLILILLAVAAVLTANLWRARQKLPKGPTPLPLIGNFHQLFYLSWKTGSLVAAFNELKK 60

 

Query:    61 QYGKVFTVWMGPKPSVYICDYDIAHETHVKRANIFGTRYSVGGMEYIREGRGIIGSNGDF 120

             QYGKVFTVWMGPKPSVYICDYDIAHETHVKRANIFGTRYSVGGMEYIREGRGIIGSNGDF

Sbjct:    61 QYGKVFTVWMGPKPSVYICDYDIAHETHVKRANIFGTRYSVGGMEYIREGRGIIGSNGDF 120

 

Query:   121 WLEHRRFALMTLRNFGVGRTIMEDKIMDEYRYRFKDFKRTHFKNGAIQVNASSIFDLLVG 180

             WLEHRRFALMTLRNFGVGRTIMEDKIMDEYRYRFKDFKRTHFKNGAIQVNASSIFDLLVG

Sbjct:   121 WLEHRRFALMTLRNFGVGRTIMEDKIMDEYRYRFKDFKRTHFKNGAIQVNASSIFDLLVG 180

 

Query:   181 SIINQLLVSERFEQDDQEFEKLKTSLAEALENISIIEGFLPLWVLKSPLMKWRTKITFAP 240

             SIINQLLVSERFEQDDQEFEKLKTSLAEALENISIIEGFLPLWVLKSPLMKWRTKITFAP

Sbjct:   181 SIINQLLVSERFEQDDQEFEKLKTSLAEALENISIIEGFLPLWVLKSPLMKWRTKITFAP 240

 

Query:   241 ---------------VAAIENGTHTLSEEGDDFVDAFIVKMEKDKKEGIDSSFTYVLSII 285

                            VAAIENGTHTLSEEGDDFVDAFIVKMEKDKKEGIDSSFT     

Sbjct:   241 FDFIFELGNRGIQRRVAAIENGTHTLSEEGDDFVDAFIVKMEKDKKEGIDSSFT------ 294

 

Query:   286 RKFSIACFSLETLAIDLFDLWQAGQETTSTTLTWACVCLLNHPEVVEKLRKELTEVTGGT 345

                      LETLAIDLFDLWQAGQETTSTTLTWACVCLLNHPEVVEKLRKELTEVTGGT

Sbjct:   295 ---------LETLAIDLFDLWQAGQETTSTTLTWACVCLLNHPEVVEKLRKELTEVTGGT 345

 

Query:   346 RGVSLTDRTKTPYLNANINEFQRIASILNVNLFRVLEEDTTIDSQPVPAGALVTTNLSML 405

             RGVSLTDRTKTPYLNANINEFQRIASILNVNLFRVLEEDTTIDSQPVPAGALVTTNLSML

Sbjct:   346 RGVSLTDRTKTPYLNANINEFQRIASILNVNLFRVLEEDTTIDSQPVPAGALVTTNLSML 405

 

Query:   406 HTDEEIFKNPQEFRPERFMENNNLEKRLIPFGIGKRACPGESLARAELFLITGNMILDYD 465

             HTDEEIFKNPQEFRPERFMENNNLEKRLIPFGIGKRACPGESLARAELFLITGNMILDYD

Sbjct:   406 HTDEEIFKNPQEFRPERFMENNNLEKRLIPFGIGKRACPGESLARAELFLITGNMILDYD 465

 

Query:   466 LEPVGTLPKIETTTPFAPMKRPPVYDIRFVPRSQ 499

             LEPVGTLPKIETTTPFAPMKRPPVYDIRFVPRSQ

Sbjct:   466 LEPVGTLPKIETTTPFAPMKRPPVYDIRFVPRSQ 499

 

Compare to seq below

 

 

>CYP34A7 B0213.12 499 aa

         Length = 499

 

 Score = 2193 (772.0 bits), Expect = 1.1e-230, P = 1.1e-230

 Identities = 405/498 (81%), Positives = 456/498 (91%)

 

Query:     1 MLLILILLAVAAVLTANLWRARQKLPKGPTPLPLIGNFHQLFYLSWKTGSLVAAFNELKK 60

             ML+ILIL+A+AAVLT NLWRARQKLP GPTPLP+IGNFHQLFY  WK G LVA F++ +K

Sbjct:     1 MLIILILVAIAAVLTVNLWRARQKLPNGPTPLPIIGNFHQLFYNGWKYGGLVAGFDQFRK 60

 

Query:    61 QYGKVFTVWMGPKPSVYICDYDIAHETHVKRANIFGTRYSVGGMEYIREGRGIIGSNGDF 120

             QYGKVFTVWMGP P+V ICD+D+AHETHVK+A+ FG RY+ G MEYIREG+GIIGSNGDF

Sbjct:    61 QYGKVFTVWMGPIPAVQICDFDVAHETHVKKAHTFGHRYTFGAMEYIREGKGIIGSNGDF 120

 

Query:   121 WLEHRRFALMTLRNFGVGRTIMEDKIMDEYRYRFKDFKRTHFKNGAIQVNASSIFDLLVG 180

             WLEHRRFALMTLRNFG+GR I+EDKIM+EYRYRF+DFK+T+FK+GAIQVNASS+FDLLVG

Sbjct:   121 WLEHRRFALMTLRNFGLGRNIIEDKIMEEYRYRFEDFKKTNFKDGAIQVNASSLFDLLVG 180

 

Query:   181 SIINQLLVSERFEQDDQEFEKLKTSLAEALENISIIEGFLPLWVLKSPLMKWRTKITFAP 240

             SIINQLLVSERFEQDD+EFE+LKT+LA ALEN SIIEG LPLW+LKS  MKWRTK TFAP

Sbjct:   181 SIINQLLVSERFEQDDEEFEELKTNLAMALENGSIIEGVLPLWMLKSRFMKWRTKTTFAP 240

 

Query:   241 FDFIFELGNRGIQRRVAAIENGTHTLSEEGDDFVDAFIVKMEKDKKEGIDSSFTLETLAI 300

             FDF+FE+G +GIQRRVAAIENGTHTLSEEGDDFVDAFIVKMEKDKK+GIDSSFTLETLA+

Sbjct:   241 FDFVFEVGKKGIQRRVAAIENGTHTLSEEGDDFVDAFIVKMEKDKKDGIDSSFTLETLAV 300

 

Query:   301 DLFDLWQAGQETTSTTLTWACVCLLNHPEVVEKLRKELTEVTGGTRGVSLTDRTKTPYLN 360

             DLFDLWQAGQETTSTTLTWAC CLLNHPEVVEKLRKELTEVTGG RGVSLTDRTKTPYLN

Sbjct:   301 DLFDLWQAGQETTSTTLTWACACLLNHPEVVEKLRKELTEVTGGARGVSLTDRTKTPYLN 360

 

Query:   361 ANINEFQRIASILNVNLFRVLEEDTTIDSQPVPAGALVTTNLSMLHTDEEIFKNPQEFRP 420

             A INE QRI+SILNVNL R+LEED  ID  PVPAG   TT L++LHTDEE FKN +EF P

Sbjct:   361 ATINEVQRISSILNVNLLRILEEDAVIDGHPVPAGTAFTTQLALLHTDEETFKNHKEFIP 420

 

Query:   421 ERFMENNNLEKRLIPFGIGKRACPGESLARAELFLITGNMILDYDLEPVGTLPKIETTTP 480

             ERF+ENNNLEKRLIPFGIGKR+CPGESLA+AEL+LI GN+++D+DL+PVG +PKIE+ TP

Sbjct:   421 ERFLENNNLEKRLIPFGIGKRSCPGESLAKAELYLIIGNLVIDFDLKPVGAIPKIESPTP 480

 

Query:   481 FAPMKRPPVYDIRFVPRS 498

             F+P+KRPPVYDIRF+ R+

Sbjct:   481 FSPVKRPPVYDIRFISRA 498

 

CYP23A1

 

>CYP23A1 B0304.3    U39472 529 aa (revised 12/8/2003)

         Length = 529

 

 Score = 2756 (970.2 bits), Expect = 2.4e-290, P = 2.4e-290

 Identities = 529/534 (99%), Positives = 529/534 (99%)

 

Query:     1 MPSLRNAILLTIATTFLYFRIMRTLNFDNYMATYIYFLTCITLYAIYELNYKRRRLPNGP 60

             MPSLRNAILLTIATTFLYFRIMRTLNFDNYMATYIYFLTCITLYAIYELNYKRRRLPNGP

Sbjct:     1 MPSLRNAILLTIATTFLYFRIMRTLNFDNYMATYIYFLTCITLYAIYELNYKRRRLPNGP 60

 

Query:    61 VPWLVAGNMPSFINVNNVDVLFQSWKQQYGGIFTVWIGPIPLVMVSDLPTIKKYFIQHAD 120

             VPWLVAGNMPSFINVNNVDVLFQSWKQQYGGIFTVWIGPIPLVMVSDLPTIKKYFIQHAD

Sbjct:    61 VPWLVAGNMPSFINVNNVDVLFQSWKQQYGGIFTVWIGPIPLVMVSDLPTIKKYFIQHAD 120

 

Query:   121 SFSNRWRNFVTDSIMEGSNGIVQIDGNKWREQRRFALHTLRDFGVGKPLMEQMITLEVTS 180

             SFSNRWRNFVTDSIMEGSNGIVQIDGNKWREQRRFALHTLRDFGVGKPLMEQMITLEVTS

Sbjct:   121 SFSNRWRNFVTDSIMEGSNGIVQIDGNKWREQRRFALHTLRDFGVGKPLMEQMITLEVTS 180

 

Query:   181 LMNHMEKSCGLDGKELHLCPSIAVCVGNIINNMLFGLRFNQDNSYMHRLHQLLDDQSHTV 240

             LMNHMEKSCGLDGKELHLCPSIAVCVGNIINNMLFGLRFNQDNSYMHRLHQLLDDQSHTV

Sbjct:   181 LMNHMEKSCGLDGKELHLCPSIAVCVGNIINNMLFGLRFNQDNSYMHRLHQLLDDQSHTV 240

 

Query:   241 MQPIMGAYIAFPVTSKIPIINGEWNRLMGIKNELLEFLETQIEGHRMNWKDEMIEQEPED 300

             MQPIMGAYIAFPVTSKIPIINGEWNRLMGIKNELLEFLETQIEGHRMNWKDEMIEQEPED

Sbjct:   241 MQPIMGAYIAFPVTSKIPIINGEWNRLMGIKNELLEFLETQIEGHRMNWKDEMIEQEPED 300

 

Query:   301 LTYAYMIEVEKRKRNGEDVGFFDDQQLKMLLLDLFFAGMETTVTTLKWAFLLMAKNQKVQ 360

             LTYAYMIEVEKRKRNG     FDDQQLKMLLLDLFFAGMETTVTTLKWAFLLMAKNQKVQ

Sbjct:   301 LTYAYMIEVEKRKRNG-----FDDQQLKMLLLDLFFAGMETTVTTLKWAFLLMAKNQKVQ 355

 

Query:   361 KNVQAELDSIGQPMIEIQHRTRLPYVQATINEIQRIANILPINLLRTVAEDIEIDGYNFK 420

             KNVQAELDSIGQPMIEIQHRTRLPYVQATINEIQRIANILPINLLRTVAEDIEIDGYNFK

Sbjct:   356 KNVQAELDSIGQPMIEIQHRTRLPYVQATINEIQRIANILPINLLRTVAEDIEIDGYNFK 415

 

Query:   421 KGDLIIPQISILMNDPEIFENPEEFNPSRFLDEDNNVKKIDEFLPFSIGRRQCLGESLAR 480

             KGDLIIPQISILMNDPEIFENPEEFNPSRFLDEDNNVKKIDEFLPFSIGRRQCLGESLAR

Sbjct:   416 KGDLIIPQISILMNDPEIFENPEEFNPSRFLDEDNNVKKIDEFLPFSIGRRQCLGESLAR 475

 

Query:   481 AELYLVFANLIQNFNFEVADDVTTERVLGLTVSPVEYSCKITRRGLDHNQNSVK 534

             AELYLVFANLIQNFNFEVADDVTTERVLGLTVSPVEYSCKITRRGLDHNQNSVK

Sbjct:   476 AELYLVFANLIQNFNFEVADDVTTERVLGLTVSPVEYSCKITRRGLDHNQNSVK 529

 

Compare to the sequence below:

 

>cb25.fpc3052 ACC=CAAC01000064 86% to CYP23A1

            Length = 534

 

 Score = 2430 (855.4 bits), Expect = 8.4e-256, P = 8.4e-256

 Identities = 454/534 (85%), Positives = 493/534 (92%)

 

Query:     1 MPSLRNAILLTIATTFLYFRIMRTLNFDNYMATYIYFLTCITLYAIYELNYKRRRLPNGP 60

             MPS++ AILL++ T+FLYFRIM++   D+Y  TYIY   C   Y  YE NYKRRRLPNGP

Sbjct:     1 MPSIKYAILLSVVTSFLYFRIMKSFEMDSYTTTYIYLFFCTISYIFYESNYKRRRLPNGP 60

 

Query:    61 VPWLVAGNMPSFINVNNVDVLFQSWKQQYGGIFTVWIGPIPLVMVSDLPTIKKYFIQHAD 120

             +PWLVAGNMPSFINV NVD LF  WKQ+YGGIFTVWIGPIPLVMVSDLPTIKKYFIQHAD

Sbjct:    61 MPWLVAGNMPSFINVKNVDDLFLYWKQRYGGIFTVWIGPIPLVMVSDLPTIKKYFIQHAD 120

 

Query:   121 SFSNRWRNFVTDSIMEGSNGIVQIDGNKWREQRRFALHTLRDFGVGKPLMEQMITLEVTS 180

             +FSNRWRNFVTDSIMEGSNGIVQIDG+KWREQRRFALHTLRDFGVG+PLMEQMITLEVT+

Sbjct:   121 AFSNRWRNFVTDSIMEGSNGIVQIDGDKWREQRRFALHTLRDFGVGRPLMEQMITLEVTT 180

 

Query:   181 LMNHMEKSCGLDGKELHLCPSIAVCVGNIINNMLFGLRFNQDNSYMHRLHQLLDDQSHTV 240

             LMNHM KSCGL  KE++LCPSIAVCVGNIINNMLFGLRFNQDNSYMHRLH LLDDQSHTV

Sbjct:   181 LMNHMAKSCGLSTKEVNLCPSIAVCVGNIINNMLFGLRFNQDNSYMHRLHSLLDDQSHTV 240

 

Query:   241 MQPIMGAYIAFPVTSKIPIINGEWNRLMGIKNELLEFLETQIEGHRMNWKDEMIEQEPED 300

             MQPIMGAYIAFPVT+K+P INGEWNRLMGIK ELLEFLE QI+ HR NWK+EM+EQEPED

Sbjct:   241 MQPIMGAYIAFPVTTKVPFINGEWNRLMGIKKELLEFLEGQIQKHRENWKEEMMEQEPED 300

 

Query:   301 LTYAYMIEVEKRKRNGEDVGFFDDQQLKMLLLDLFFAGMETTVTTLKWAFLLMAKNQKVQ 360

             LTYAYMIEVEKRKR GEDVGFFDDQQLKMLLLDLFFAGMETTVTTLKWAFLLM+KN +VQ

Sbjct:   301 LTYAYMIEVEKRKRAGEDVGFFDDQQLKMLLLDLFFAGMETTVTTLKWAFLLMSKNPRVQ 360

 

Query:   361 KNVQAELDSIGQPMIEIQHRTRLPYVQATINEIQRIANILPINLLRTVAEDIEIDGYNFK 420

             + VQ ELDSI QPM+EIQHRTRLPY+QATINEIQRIANILPINLLRTVAEDI+IDGY FK

Sbjct:   361 RKVQEELDSIAQPMVEIQHRTRLPYIQATINEIQRIANILPINLLRTVAEDIDIDGYQFK 420

 

Query:   421 KGDLIIPQISILMNDPEIFENPEEFNPSRFLDEDNNVKKIDEFLPFSIGRRQCLGESLAR 480

             KGDL IPQISILMNDPEIF+NP++F P RFLDE+ NVKKIDEFLPFSIGRRQCLGESLAR

Sbjct:   421 KGDLTIPQISILMNDPEIFKNPKDFCPERFLDENLNVKKIDEFLPFSIGRRQCLGESLAR 480

 

Query:   481 AELYLVFANLIQNFNFEVADDVTTERVLGLTVSPVEYSCKITRRGLDHNQNSVK 534

             AELYL+FANL+QNF FEV++DVTTERVLGLTVSPV+Y+CKI+RRGLDHN+N VK

Sbjct:   481 AELYLIFANLMQNFKFEVSEDVTTERVLGLTVSPVQYTCKISRRGLDHNENCVK 534

 

CYP29A4

 

>CYP29A4 B0331 500 aa 4 small changes, probably errors in my old sequence

         Length = 500

 

 Score = 2532 (891.3 bits), Expect = 1.3e-266, P = 1.3e-266

 Identities = 481/486 (98%), Positives = 481/486 (98%)

 

Query:     1 MSILIPVALALLFVYLLSFYDTIRLMRKFWIYGGKMPGPPAHPIFGNASLFKNKTTKDFV 60

             MSILIPVALALLFVYLLSFYDTIRLMRKFWIYGGKMPGPPAHPIFGNASLFKNKT  DFV

Sbjct:     1 MSILIPVALALLFVYLLSFYDTIRLMRKFWIYGGKMPGPPAHPIFGNASLFKNKTXSDFV 60

 

Query:    61 ELFVQLAHEARSKGANLMRTQVMNRIYVWPLNGKTAATILESSTEVNKGDDYAFLVPWLG 120

             ELFVQLAHEARSKGANLMRTQVMNRIYVWPLNGKTAATILESSTEVNKGDDYAFLVPWLG

Sbjct:    61 ELFVQLAHEARSKGANLMRTQVMNRIYVWPLNGKTAATILESSTEVNKGDDYAFLVPWLG 120

 

Query:   121 GGLLMEKGEKWKSHRRILTPAFHFAKLEGYLDVFNSESKILIDCLEKIAETQETVDLFPF 180

             GGLLMEKGEKWKSHRRILTPAFHFAKLEGYLDVFNSES ILIDCLEKIAETQETVDLFPF

Sbjct:   121 GGLLMEKGEKWKSHRRILTPAFHFAKLEGYLDVFNSES-ILIDCLEKIAETQETVDLFPF 179

 

Query:   181 FKRCTLDIICGTAMGIKLDAQNVHNLGYVQAVEGFNKLTVEYSLNPFLWNRFVYWALGYQ 240

             FKRCTLDIICGTAMGIKLDAQNVHNLGYVQAVEGFNKLTV YSLNPFLWNRFVYWALGYQ

Sbjct:   180 FKRCTLDIICGTAMGIKLDAQNVHNLGYVQAVEGFNKLTVSYSLNPFLWNRFVYWALGYQ 239

 

Query:   241 KMHDDFLYTLKKFTNDAIVERRTVIASGEIEKETSKRKMNFLDILLNSEESNELTSDEIR 300

             KMHDDFLYTLKKFTNDAIVERRTVIASGEIEKETSKRKMNFLDILLNSEESNELTSDEIR

Sbjct:   240 KMHDDFLYTLKKFTNDAIVERRTVIASGEIEKETSKRKMNFLDILLNSEESNELTSDEIR 299

 

Query:   301 KEVDTFMFAGHDTTSTSLSWLCWNIAHNPEVQENVYKEIISIFGEDPNQDVTSENINRLE 360

             KEVDTFMFAGHDTTSTSLSWLCWNIAHNPEVQENVYKEIISIFGEDPNQDVTSENINRLE

Sbjct:   300 KEVDTFMFAGHDTTSTSLSWLCWNIAHNPEVQENVYKEIISIFGEDPNQDVTSENINRLE 359

 

Query:   361 YTERVLKESKRMFPPVPGFQRKLTKDIVIDGITIPSEGNITISPTVLHCNPFVYQNPEKF 420

             YTERVLKESKRMFPPVPGFQRKLTKDIVI GITIPSEGNITISPTVLHCNPFVYQNPEKF

Sbjct:   360 YTERVLKESKRMFPPVPGFQRKLTKDIVIGGITIPSEGNITISPTVLHCNPFVYQNPEKF 419

 

Query:   421 DPDRFLPEECAKRHSYDYIPFSAGLRNCIGQKFSILNEKVMLIHILRNFKLEPKLEFYET 480

             DPDRFLPEECAKRHSYDYIPFSAGLRNCIGQKFSILNEKVMLIHILRNFKLEPKLEFYET

Sbjct:   420 DPDRFLPEECAKRHSYDYIPFSAGLRNCIGQKFSILNEKVMLIHILRNFKLEPKLEFYET 479

 

Query:   481 KPLFEV 486

             KPLFEV

Sbjct:   480 KPLFEV 485

 

CYP35A1

 

C03G6.14; CE07888. This gene is badly  misassembled

MFLVLIFLALSCWLIIRQYQKVSRLPPGP skipped sequence here

KYGNIFTLWVGPVPHVSICDY  50

ETSHEVFVKGANKYADIAHAPLFRELRQEMGVLVTNGSHWSTMKRFALHT 100

FRDMGVGKDLMETRIMEELDARCADTDKSATDGVTVAQAGDFFDLTVGSI 150

INSILVGKRFEEHNKDDFLKIKEAMGAAFEVFSPFDMAVPVWFLRTFFRS 200

RYDMMMTTQNTAKRFAAAEAVK skipped seq here

SIETLKTMIIDLWMTGQETTTTTLISGF 250

TQLLLHPEVMVKAREEILKITENGSRHLSLTDRTSTPYLNAMIGEIQRHA 300

SILNVSFWKINKELTYMGGHPVDAGALVTAQLSALHVNDTIFKNPQEFDP 350

ERFIRDEELLQKVIPFGVGKRSCIGESLARAELYL

C-term is wrong seq, but my C-term may also be wrong

ISLLCICGTSNKSFLHKYLPLWNSVALLLPVKLSRPAFAALISSLPLDKLLPHI

 

>C03G6.14    C13B3.B    CYP35A1 502 aa

yellow is at an intron boundary

MFLVLIFLALSCWLIIRQYQKVSRLPPGPVSFPIIGNLPHIIYYLWATGGIVS

TLDLFRKKYGNIFTLWVGPVPHVSICDYETSHEVFVKGANKYADIAHAPLFR

ELRRW*MGVLVTNGSHWSTMKRFALHTFRDMGVGKDLMETRIMEELDARC

ADTDKSATDGVTVAQAGDFFDLTVGSIINSILVGKRFEEHNKDDFLKIKEAMG

AAFEVFSPFDMAVPVWFLRTFFRSRYDMMMTTQNTAKRFAAAEAVKRIED

IKSGAYEIDESNIEDYTDAFLLKIQKDGEDLDFNIETLKTMIIDLWMTGQETTTT

TLISGFTQLLLHPEVMVKAREEILKITENGSRHLSLTDRTSTPYLNAMIGEIQRHASI

LNVSFWKINKELTYMGGHPVDAGALVTAQLSALHVNDTIFKNPQEFDPERFIRDEE

LLQKVIPFGVGKRSCIGESLARAELYL VRHLQYRISNFLFRLLGTFCSAINSNRMEH

CRQQSCCRIAPEKDHSSWK*

18318 IIGNLLLRYKFEPHGTLSTTELLPYSAGKRPFKLEMKFVKI 18196 correct C-term

 

Compare my 35A1 assembly to 35A2 below

>CYP35A2 C03G6.15    C13B3.A 495 aa

         Length = 496

 

 Score = 2005 (705.8 bits), Expect = 9.2e-211, P = 9.2e-211

 Identities = 386/505 (76%), Positives = 431/505 (85%)

 

Query:     1 MFLVLIFLALSCWLIIRQYQKVSRLPPGPVSFPIIGNLPHIIYYLWATGGIVSTLDLFRK 60

             MF VL F  L  +LI+RQYQKVSRLPPGP+S P+IGNLP IIYYLW+TGGIVSTLDLFRK

Sbjct:     1 MFFVLFFSVLLGYLIVRQYQKVSRLPPGPISLPLIGNLPQIIYYLWSTGGIVSTLDLFRK 60

 

Query:    61 KYGNIFTLWVGPVPHVSICDYETSHEVFVKGANKYADIAHAPLFRELRRW*MGVLVTNGS 120

             +YGNIFTLWVGP+PHVSI DYETSHEVFVK A KYAD  HAP+ R++R   +GVL+TNG

Sbjct:    61 RYGNIFTLWVGPIPHVSIADYETSHEVFVKNAGKYADKFHAPVMRDVRSK-IGVLITNGD 119

 

Query:   121 HWSTMKRFALHTFRDMGVGKDLMETRIMEELDARCADTDKSATDGVTVAQAGDFFDLTVG 180

             HW  M+RF+L  FR+MGVGKD+METRIMEELDARC+D DK AT+GVT+  A +FFDLTVG

Sbjct:   120 HWQEMRRFSLQAFRNMGVGKDIMETRIMEELDARCSDIDKLATNGVTITHASEFFDLTVG 179

 

Query:   181 SIINSILVGKRFEEHNKDDFLKIKEAMGAAFEVFSPFDMAVPVWFLRTFFRSRYDMMMTT 240

             SIINSILVGKRFEE  K +FLKIKE M A+FE FSPFDM  PVWFL+TFF+ RYD + +

Sbjct:   180 SIINSILVGKRFEEDTKHEFLKIKETMDASFETFSPFDMTAPVWFLKTFFKHRYDKIWSA 239

 

Query:   241 QNTAKRFAAAEAVKRIEDIKSGAYEIDESNIEDYTDAFLLKIQKDGEDLDFNIETLKTM 300

             Q TAK FAAAEA+KR+E IKSG Y IDE+N++DYTDAFLLKIQK+GE  DFNIETLKTM

Sbjct:   240 QETAKNFAAAEAIKRVESIKSGKYVIDENNLQDYTDAFLLKIQKEGESKDFNIETLKTM 298

 

Query:   301 IIDLWMTGQETTTTTLISGFTQLLLHPEVMVKAREEILKITENGSRHLSLTDRTSTPYLN 360

             IIDLWMTGQETTTTTLISGF QLLLHPEVM+KAREEILKITENGSRHLSLTDRTSTPY+N

Sbjct:   299 IIDLWMTGQETTTTTLISGFNQLLLHPEVMIKAREEILKITENGSRHLSLTDRTSTPYVN 358

 

Query:   361 AMIGEIQRHASILNVSFWKINKELTYMGGHPVDAGALVTAQLSALHVNDTIFKNPQEFDP 420

             A+IGEIQRHASILNVSFWKINKE TYMGGHPVDAGALVT+QLSALHVN+T+FKNPQEF+P

Sbjct:   359 AVIGEIQRHASILNVSFWKINKEFTYMGGHPVDAGALVTSQLSALHVNETVFKNPQEFNP 418

 

Query:   421 ERFIRDEELLQKVIPFGVGKRSCIGESLARAELYLVRHLQYRISNFLFRLLGTFCSAINS 480

             ERFIRD +LLQKVIPFGVGKR+C+GESLA+AELYLVR + YR S        TF  A NS

Sbjct:   419 ERFIRDGKLLQKVIPFGVGKRNCLGESLAKAELYLVRFVFYRSSV-------TFSFATNS 471

 

Query:   481 NRMEHCRQQSCCRIAPEKDHSSWK* 505

             N M +    S C  A EKDHSSWK*

Sbjct:   472 NNMANYPLPSLCHTALEKDHSSWK* 496

 

 

CYP35A2

 

>CYP35A2 C03G6.15    C13B3.A 495 aa

         Length = 496

 

 Score = 2342 (824.4 bits), Expect = 1.8e-246, P = 1.8e-246

 Identities = 451/454 (99%), Positives = 453/454 (99%)

 

My C-term may be wrong see below.  By comparison to the briggsae seqs

It looks like all my C-term exons for elegans 35As are wrong.  They may be in an alternative reading frame from the correct sequences. 

 

Query:     1 MFFVLFFSVLLGYLIVRQYQKVSRLPPGPISLPLIGNLPQIIYYLWSTGGIVSTLDLFRK 60

             MFFVLFFSVLLGYLIVRQYQKVSRLPPGPISLPLIGNLPQIIYYLWSTGGIVSTLDLFRK

Sbjct:     1 MFFVLFFSVLLGYLIVRQYQKVSRLPPGPISLPLIGNLPQIIYYLWSTGGIVSTLDLFRK 60

 

Query:    61 RYGNIFTLWVGPIPHVSIADYETSHEVFVKNAGKYADKFHAPVMRDVRNDIGVLITNGDH 120

             RYGNIFTLWVGPIPHVSIADYETSHEVFVKNAGKYADKFHAPVMRDVR+ IGVLITNGDH

Sbjct:    61 RYGNIFTLWVGPIPHVSIADYETSHEVFVKNAGKYADKFHAPVMRDVRSKIGVLITNGDH 120

 

Query:   121 WQEMRRFSLQAFRNMGVGKDIMETRIMEELDARCSDIDKLATNGVTITHASEFFDLTVGS 180

             WQEMRRFSLQAFRNMGVGKDIMETRIMEELDARCSDIDKLATNGVTITHASEFFDLTVGS

Sbjct:   121 WQEMRRFSLQAFRNMGVGKDIMETRIMEELDARCSDIDKLATNGVTITHASEFFDLTVGS 180

 

Query:   181 IINSILVGKRFEEDTKHEFLKIKETMDASFETFSPFDMTAPVWFLKTFFKHRYDKIWSAQ 240

             IINSILVGKRFEEDTKHEFLKIKETMDASFETFSPFDMTAPVWFLKTFFKHRYDKIWSAQ

Sbjct:   181 IINSILVGKRFEEDTKHEFLKIKETMDASFETFSPFDMTAPVWFLKTFFKHRYDKIWSAQ 240

 

Query:   241 ETAKNFAAAEAIKRVESIKSGKYVIDENNLQDYTDAFLLKIQKEGESKDFNIETLKTMII 300

             ETAKNFAAAEAIKRVESIKSGKYVIDENNLQDYTDAFLLKIQKEGESKDFNIETLKTMII

Sbjct:   241 ETAKNFAAAEAIKRVESIKSGKYVIDENNLQDYTDAFLLKIQKEGESKDFNIETLKTMII 300

 

Query:   301 DLWMTGQETTTTTLISGFNQLLLHPEVMIKAREEILKITENGSRHLSLTDRTSTPYVNAV 360

             DLWMTGQETTTTTLISGFNQLLLHPEVMIKAREEILKITENGSRHLSLTDRTSTPYVNAV

Sbjct:   301 DLWMTGQETTTTTLISGFNQLLLHPEVMIKAREEILKITENGSRHLSLTDRTSTPYVNAV 360

 

Query:   361 IGEIQRHASILNVSFWKINKEFTYMGGHPVDAGALVTSQLSALHVNETVFKNPQEFNPER 420

             IGEIQRHASILNVSFWKINKEFTYMGGHPVDAGALVTSQLSALHVNETVFKNPQEFNPER

Sbjct:   361 IGEIQRHASILNVSFWKINKEFTYMGGHPVDAGALVTSQLSALHVNETVFKNPQEFNPER 420

 

Query:   421 FIRDGKLLQKVIPFGVGKRNCLGESLAKAELYLI 454

             FIRDGKLLQKVIPFGVGKRNCLGESLAKAELYL+

Sbjct:   421 FIRDGKLLQKVIPFGVGKRNCLGESLAKAELYLV 454

 

>C03G6.15    C13B3.A     CYP35A2  495 aa

yellow is at intron boundary

MFFVLFFSVLLGYLIVRQYQKVSRLPPGPISLPLIGNLPQIIYYLWSTGGIVST

LDLFRKRYGNIFTLWVGPIPHVSIADYETSHEVFVKNAGKYADKFHAPVMR

DVRSKIGVLITNGDHWQEMRRFSLQAFRNMGVGKDIMETRIMEELDARCSD

IDKLATNGVTITHASEFFDLTVGSIINSILVGKRFEEDTKHEFLKIKETMDASF

ETFSPFDMTAPVWFLKTFFKHRYDKIWSAQETAKNFAAAEAIKRVESIKSGKYVID

ENNLQDYTDAFLLKIQKEGESKDFNIETLKTMIIDLWMTGQETTTTTLISGFNQL

LLHPEVMIKAREEILKITENGSRHLSLTDRTSTPYVNAVIGEIQRHASILNVSFWKIN

KEFTYMGGHPVDAGALVTSQLSALHVNETVFKNPQEFNPERFIRDGKLLQKVIPF

GVGKRNCLGESLAKAELYL

VRFVFYRSSVTFSFATNSNNMANYPLPSLCHTALEKDHSSWK*

Probable correct C-term =

IFGNLLLRYKFEQHGKLSTTELMPYSAGKRPFKLEMKFVKI*

 

cDNA evidence

>gi|30742705|gb|CB400978.1|CB400978  UniGene info OSTF185G5_1 AD-wrmcDNA Caenorhabditis elegans cDNA.

          Length = 544

 

 Score =  210 bits (534), Expect = 7e-56

 Identities = 100/102 (98%), Positives = 101/102 (99%)

 Frame = +3

 

Query: 1   LDLFRKRYGNIFTLWVGPIPHVSIADYETSHEVFVKNAGKYADKFHAPVMRDVRSKIGVL 60

           LDLFRKRYGNIFTLWVGPIPHVSIADYETSHEVFVKNAGKYADKFHAPVMRDVR+ IGVL

Sbjct: 135 LDLFRKRYGNIFTLWVGPIPHVSIADYETSHEVFVKNAGKYADKFHAPVMRDVRNDIGVL 314

 

Query: 61  ITNGDHWQEMRRFSLQAFRNMGVGKDIMETRIMEELDARCSD 102

           ITNGDHWQEMRRFSLQAFRNMGVGKDIMETRIMEELDARCSD

Sbjct: 315 ITNGDHWQEMRRFSLQAFRNMGVGKDIMETRIMEELDARCSD 440

 

>CYP35A.e cb25.fpc0023 76% to CYP35A.b briggsae seq on bottom

          Length = 495

 

 Score = 2054 (723.0 bits), Expect = 5.9e-216, P = 5.9e-216

 Identities = 379/494 (76%), Positives = 445/494 (90%)

 

Query:     1 MFFVLFFSVLLGYLIVRQYQKVSRLPPGPISLPLIGNLPQIIYYLWSTGGIVSTLDLFRK 60

             MFFVL     L +L+VRQYQKVSRLPPGP+SLPLIGNLPQI+YYL++TGG+VSTLD FRK

Sbjct:     1 MFFVLIVFTFLTWLVVRQYQKVSRLPPGPVSLPLIGNLPQIVYYLYTTGGVVSTLDFFRK 60

 

Query:    61 RYGNIFTLWVGPIPHVSIADYETSHEVFVKNAGKYADKFHAPVMRDVRNDIGVLITNGDH 120

             RYGNIFTLWVGPIPHVSIADYETSHEVFVKNA KYADKFH+P+ R++R+  G+L  NGDH

Sbjct:    61 RYGNIFTLWVGPIPHVSIADYETSHEVFVKNANKYADKFHSPIFREMRSKRGILTANGDH 120

 

Query:   121 WQEMRRFSLQAFRNMGVGKDIMETRIMEELDARCSDIDKLATNGVTITHASEFFDLTVGS 180

             WQEMRRF+L  FRNMGVGKD+METRIMEEL+ARC+DIDK A NGVT T A+EFFDLTVGS

Sbjct:   121 WQEMRRFALFTFRNMGVGKDLMETRIMEELNARCADIDKAAVNGVTTTQAAEFFDLTVGS 180

 

Query:   181 IINSILVGKRFEEDTKHEFLKIKETMDASFETFSPFDMTAPVWFLKTFFKHRYDKIWSAQ 240

             IINSILVGKRFEE  K++FLKIKE MDASFETFSPFDMT PVW LK FF  R++K+ + Q

Sbjct:   181 IINSILVGKRFEEHNKNDFLKIKEVMDASFETFSPFDMTMPVWILKNFFPRRFEKMRNGQ 240

 

Query:   241 ETAKNFAAAEAIKRVESIKSGKYVIDENNLQDYTDAFLLKIQKEGESKDFNIETLKTMII 300

             E++K FAA EA+KR++ IK+G+YVIDENN QDYTDAFL K+QK+GE++D+ +E+LKTMI+

Sbjct:   241 ESSKQFAAKEALKRIDEIKAGRYVIDENNFQDYTDAFLWKMQKDGENEDYKVESLKTMIL 300

 

Query:   301 DLWMTGQETTTTTLISGFNQLLLHPEVMIKAREEILKITENGSRHLSLTDRTSTPYVNAV 360

             DLW+TGQETTTTTLISGFNQLLLH + M KAR E+ +ITENGSR++SL+DR  TPY+NAV

Sbjct:   301 DLWITGQETTTTTLISGFNQLLLHSKFMEKARAELFEITENGSRNVSLSDRPKTPYLNAV 360

 

Query:   361 IGEIQRHASILNVSFWKINKEFTYMGGHPVDAGALVTSQLSALHVNETVFKNPQEFNPER 420

             IGEIQRHASILNV+FW+ N E TYMGGH VD+GALVT+QLSALHVNETVF++P++F+PER

Sbjct:   361 IGEIQRHASILNVNFWRWNNEPTYMGGHMVDSGALVTAQLSALHVNETVFEHPEKFDPER 420

 

Query:   421 FIRDGKLLQKVIPFGVGKRNCLGESLAKAELYLIFGNLLLRYKFEQHGKLSTTELMPYSA 480

             FIRD KLLQKVIPFG+GKR+CLGESLA++ELYLIFGNLLLRYKF+ HG+LST E+MPYSA

Sbjct:   421 FIRDEKLLQKVIPFGLGKRSCLGESLARSELYLIFGNLLLRYKFQPHGELSTREIMPYSA 480

 

Query:   481 GKRPFKLEMKFVKI 494

             GKRPFKLEM+F+K+

Sbjct:   481 GKRPFKLEMQFIKV 494

 

C03G6.15; CE35217. = correct seq

MFFVLFFSVLLGYLIVRQYQKVSRLPPGPISLPLIGNLPQIIYYLWSTGG  50

                            IVSTLDLFRKRYGNIFTLWVGPIPHVSIADYETSHEVFVKNAGKYADKFH 100

                            APVMRDVRNDIGVLITNGDHWQEMRRFSLQAFRNMGVGKDIMETRIMEEL 150

                            DARCSDIDKLATNGVTITHASEFFDLTVGSIINSILVGKRFEEDTKHEFL 200

                            KIKETMDASFETFSPFDMTAPVWFLKTFFKHRYDKIWSAQETAKNFAAAE 250

                            AIKRVESIKSGKYVIDENNLQDYTDAFLLKIQKEGESKDFNIETLKTMII 300

                            DLWMTGQETTTTTLISGFNQLLLHPEVMIKAREEILKITENGSRHLSLTD 350

                            RTSTPYVNAVIGEIQRHASILNVSFWKINKEFTYMGGHPVDAGALVTSQL 400

                            SALHVNETVFKNPQEFNPERFIRDGKLLQKVIPFGVGKRNCLGESLAKAE 450

                            LYL IFGNLLLRYKFEQHGKLSTTELMPYSAGKRPFKLEMKFVKI

 

CYP33B1

 

>CYP33B1 C25E10.2   U50311 495 aa

         Length = 496

 

 Score = 2536 (892.7 bits), Expect = 4.9e-267, P = 4.9e-267

 Identities = 485/496 (97%), Positives = 489/496 (98%)

 

My seq wrong at green region yellow is intron boundary

 

Query:     1 MILIFFALCTLFFLIHQYLWRRRGLPPGPTPIPIFGNLFQLSGSEAPGISIFQKWKDQYG 60

             MILIFFALCTLFFLIHQYLWRRRGLPPGPTPIPIFGNLFQLSGSEAPGISIFQKWKDQYG

Sbjct:     1 MILIFFALCTLFFLIHQYLWRRRGLPPGPTPIPIFGNLFQLSGSEAPGISIFQKWKDQYG 60

 

Query:    61 PIFTFYMGPVPFVVLTDYQDIKETVIKDGDTYADKYLSPEFNKYFRGGEYGIMDISGDRW 120

             PIFTFYMG  PFVVLTDYQDIKETVIKDGDTYADKYLSPEFNKYFR  ++  + +SGDRW

Sbjct:    61 PIFTFYMGIAPFVVLTDYQDIKETVIKDGDTYADKYLSPEFNKYFREMQFTKL-LSGDRW 119

 

Query:   121 KEHRKFAVLQLRELGVGKPLMESKILIEAEELIKKLKTAEILNEDFLLQSELDVAVGSVI 180

             KEHRKFAVLQLRELGVGKPLMESKILIEAEELIKKLKTAEILNEDFLLQSELDVAVGSVI

Sbjct:   120 KEHRKFAVLQLRELGVGKPLMESKILIEAEELIKKLKTAEILNEDFLLQSELDVAVGSVI 179

 

Query:   181 NQFLFGYRFDRSKLFEFTRIKTLVNNFMEEVGKPLGVLAFTCHGIPSFLVKLMVSGIEEQ 240

             NQFLFGYRFDRSKLFEFTRIKTLVNNFMEEVGKPLGVLAFTCHGIPSFLVKLMVSGIEEQ

Sbjct:   180 NQFLFGYRFDRSKLFEFTRIKTLVNNFMEEVGKPLGVLAFTCHGIPSFLVKLMVSGIEEQ 239

 

Query:   241 KRELFRFLRKQIDGAKSQINYEEEHNEDFVEAYLRKKFQREQKNDFDSYCDSQLENVCFD 300

             KRELFRFLRKQIDGAKSQINYEEEHNEDFVEAYLRKKFQREQKNDFDSYCDSQLENVCFD

Sbjct:   240 KRELFRFLRKQIDGAKSQINYEEEHNEDFVEAYLRKKFQREQKNDFDSYCDSQLENVCFD 299

 

Query:   301 IWAAGFDTLTNTVGFLIAYAINYPEMQMLIHQEIDNYLAHHSRLLTLADKNALVYFNAFA 360

             IWAAGFDTLTNTVGFLIAYAINYPEMQMLIHQEIDNYLAHHSRLLTLADKNALVYFNAFA

Sbjct:   300 IWAAGFDTLTNTVGFLIAYAINYPEMQMLIHQEIDNYLAHHSRLLTLADKNALVYFNAFA 359

 

Query:   361 NEAQRVSNILPMNLPHALTRDVKLKGYHLKKGTGVIHQIANVMTDETIFKDSQRFDPNRF 420

             NEAQRVSNILPMNLPHALTRDVKLKGYHLKKGTGVIHQIANVMTDETIFKDSQRFDPNRF

Sbjct:   360 NEAQRVSNILPMNLPHALTRDVKLKGYHLKKGTGVIHQIANVMTDETIFKDSQRFDPNRF 419

 

Query:   421 IDENGKLKKIEELCPFSMGKRQCIGEGLARMEIFLLAANLFNYFEFLPASDGLPSLYKDF 480

             IDENGKLKKIEELCPFSMGKRQCIGEGLARMEIFLLAANLFNYFEFLPASDGLPSLYKDF

Sbjct:   420 IDENGKLKKIEELCPFSMGKRQCIGEGLARMEIFLLAANLFNYFEFLPASDGLPSLYKDF 479

 

Query:   481 SLVSHVIPYKCRQYIP 496

             SLVSHVIPYKCRQYIP

Sbjct:   480 SLVSHVIPYKCRQYIP 495

 

CYP32A1

 

>CYP32A1 C26F1.2    U53148 508 aa  also Y97E10 contig266

         Length = 522

 

 Score = 1508 (530.8 bits), Expect = 4.3e-158, P = 4.3e-158

 Identities = 283/321 (88%), Positives = 297/321 (92%)

 

Query:     1 MIIVISIVIGYVIYLVVVNFQQILELWRINRKCAQNLSMVNGPPALPLVGSAHLFKWNPY 60

             MIIVISIVIGYVIYLVVVNFQQILELWRINRKCAQNLSMVNGPPALPLVGSAHLFKWNPY

Sbjct:     1 MIIVISIVIGYVIYLVVVNFQQILELWRINRKCAQNLSMVNGPPALPLVGSAHLFKWNPY 60

 

Query:    61 AFTFQMEGWAQKYLFGRAKYGEIAAPNNEVDGIMLLWIGPVPIVFLGTSECIRPVLESNT 120

             AFTFQMEGWAQKYLFGRAKYGEIAAPNNEVDGIMLLWIGPVPIVFLGTSECIRPVLESNT

Sbjct:    61 AFTFQMEGWAQKYLFGRAKYGEIAAPNNEVDGIMLLWIGPVPIVFLGTSECIRPVLESNT 120

 

Query:   121 NISKPSQYDKMSEWIGTGLLTSTHEKWFHRRKMLTPTFHFTIIQDYFPVFVRNAEVLADA 180

             NISKPSQYDKMSEWIGTGLLTSTHEKWFHRRKMLTPTFHFTIIQDYFPVFVRNAEVLADA

Sbjct:   121 NISKPSQYDKMSEWIGTGLLTSTHEKWFHRRKMLTPTFHFTIIQDYFPVFVRNAEVLADA 180

 

Query:   181 VELHVDGDYFDAFPYFKRCTLDIICETAMGIQVNAQLGHNNEYVHAVKRISEIVWNHMKF 240

             VELHVDGDYFDAFPYFKRCTLDIICETAMGIQVNAQLGHNNEYVHAVKRISEIVWNHMKF

Sbjct:   181 VELHVDGDYFDAFPYFKRCTLDIICETAMGIQVNAQLGHNNEYVHAVKRISEIVWNHMKF 240

 

Query:   241 PWLWLKPIWYLTGLGFEFDRNVRMTNNFVRKVDAADFKIENQKNYYYLGIVLP---EAFK 297

             PWLWLKPIWYLTGLGFEFDRNVRMTNNFVRKVDAAD +   +K   +L ++L    E  

Sbjct:   241 PWLWLKPIWYLTGLGFEFDRNVRMTNNFVRKVDAADNEASEKKRKAFLDLLLTIQKEEGT 300

 

Query:   298 INFQVIQERKELLNEDGNEAS 318

             ++ + I+E  +    +G++ +

Sbjct:   301 LSDEDIREEVDTFMFEGHDTT 321

 

 Score = 1184 (416.8 bits), Expect = 9.2e-124, P = 9.2e-124

 Identities = 241/307 (78%), Positives = 254/307 (82%)

 

Query:   252 TGLGFEFDRNVRMTNNFVRKVDAADFKIENQKNYYYLGIV-------LPEAFKINFQVIQ 304

             T +G + +  +   N +V  V      + N   + +L +        L   F  N ++ 

Sbjct:   207 TAMGIQVNAQLGHNNEYVHAVKRISEIVWNHMKFPWLWLKPIWYLTGLGFEFDRNVRMTN 266

 

Query:   305 ERKELLNEDGNEASEKKRKAFLDLLLTIQKEEGTLSDEDIREEVDTFMFEGHDTTSSGIG 364

                  ++   NEASEKKRKAFLDLLLTIQKEEGTLSDEDIREEVDTFMFEGHDTTSSGIG

Sbjct:   267 NFVRKVDAADNEASEKKRKAFLDLLLTIQKEEGTLSDEDIREEVDTFMFEGHDTTSSGIG 326

 

Query:   365 FTILWLGFYPECQKKLQKELDEVFGFETNQPPSMDDIKKCSYLEKCIKESLRMFPSVPLI 424

             FTILWLGFYPEC             FETNQPPSMDDIKKCSYLEKCIKESLRMFPSVPLI

Sbjct:   327 FTILWLGFYPEC-------------FETNQPPSMDDIKKCSYLEKCIKESLRMFPSVPLI 373

 

Query:   425 ARRLSEDVTINHPSGQKIVLPAGLAACVSPIAAARDPRAWPDPDTYNPDNFDIDAIAGRD 484

             ARRLSEDVTINHPSGQKIVLPAGLAACVSPIAAARDPRAWPDPDTYNPDNFDIDAIAGRD

Sbjct:   374 ARRLSEDVTINHPSGQKIVLPAGLAACVSPIAAARDPRAWPDPDTYNPDNFDIDAIAGRD 433

 

Query:   485 PYAYIPFSAGPRNCIGQKFALLEQKTILSTFFRKYEVESLQTEENLRPVPELILRPYNGM 544

             PYAYIPFSAGPRNCIGQKFALLEQKTILSTFFRKYEVESLQTEENLRPVPELILRPYNGM

Sbjct:   434 PYAYIPFSAGPRNCIGQKFALLEQKTILSTFFRKYEVESLQTEENLRPVPELILRPYNGM 493

 

Query:   545 KIKIKRREAADYVV 558

             KIKIKRREAADYVV

Sbjct:   494 KIKIKRREAADYVV 507

 

Cyan not in my seq (intron?) C-terms do not agree

Is my seq frameshifted at the end?

C26F1.2; CE32809.

MIIVISIVIGYVIYLVVVNFQQILELWRINRKCAQNLSMVNGPPALPLVG  50

SAHLFKWNPYAFTFQMEGWAQKYLFGRAKYGEIAAPNNEVDGIMLLWIGP 100

VPIVFLGTSECIRPVLESNTNISKPSQYDKMSEWIGTGLLTSTHEKWFHR 150

RKMLTPTFHFTIIQDYFPVFVRNAEVLADAVELHVDGDYFDAFPYFKRCT 200

LDIICETAMGIQVNAQLGHNNEYVHAVKRISEIVWNHMKFPWLWLKPIWY 250

LTGLGFEFDRNVRMTNNFVRKVDAAD

FKIENQKNYYYLGIVLPEAFKINFQVIQERKELLNEDG

NEASEKKRKAFLDLLLTIQKEEGTLSDEDIREEVDT 350

FMFEGHDTTSSGIGFTILWLGFYPECQKKLQKELDEVFGFETNQPPSMDD 400

IKKCSYLEKCIKESLRMFPSVPLIARRLSEDVTINHPSGQKIVLPAGLAA 450

CVSPIAAARDPRAWPDPDTYNPDNFDIDAIAGRDPYAYIPFSAGPRNCIG 500

QKFALLEQKTILSTFFRKYEVESLQTEENLRPVPELILRPYNGMKIKIKR 550

REAADYVVL

 

>C26F1.2    U53148    CYP32A1 508 aa  also Y97E10 contig266

 

MIIVISIVIGYVIYLVVVNFQQILELWRINRKCAQNLSMVNGPPALPLVGSAHLFKW

NPYAFTFQMEGWAQKYLFGRAKYGEIAAPNNEVDGIMLLWIGPVPIVFLGTSECI

RPVLESNTNISKPSQYDKMSEWIGTGLLTSTHEKWFHRRKMLTPTFHFTIIQD

YFPVFVRNAEVLADAVELHVDGDYFDAFPYFKRCTLDIICETAMGIQVNAQL

GHNNEYVHAVKRISEIVWNHMKFPWLWLKPIWYLTGLGFEFDRNVRMTNN

FVRKVDAAD  NEASEKKRKAFLDLLLTIQKEEGTLSDEDIREEVDTFMFEGHDT

TSSGIGFTILWLGFYPECFETNQPPSMDDIKKCSYLEKCIKESLRMFPSVPLIAR

RLSEDVTINHPSGQKIVLPAGLAACVSPIAAARDPRAWPDPDTYNPDNFDID

AIAGRDPYAYIPFSAGPRNCIGQKFALLEQKTILSTFFRKYEVESLQTEENLRP

VPELILRPYNGMKIKIKRREAADYVV FTTFFRMKLDECEK*

 

CYP25A1

 

>CYP25A1 C36A4.1    Z66495 496 aa corrected 9/14/98

         Length = 497

 

Score = 1060 (373.1 bits), Expect = 4.0e-244, Sum P(2) = 4.0e-244

 Identities = 212/229 (92%), Positives = 213/229 (93%)

 

Query:     1 MALLILSSLVISIFTFFIYIILARRERFKLREKIGLSGPEPHWFLGNLKQTAERKEKLGY 60

             MALLILSSLVISIFTFFIYIILARRERFKLREKIGLSGPEPHWFLGNLKQTAERK    

Sbjct:     1 MALLILSSLVISIFTFFIYIILARRERFKLREKIGLSGPEPHWFLGNLKQTAERK----- 55

 

Query:    61 DDANRWFNELHEQYGETFGIYYGSQMNIVISNEKDIKEVFIKNFSNFSDRSVPSIYEANQ 120

              DANRWFNELHEQYGETFGIYYGSQMNIVISNEKDIKEVFIKNFSNFSDRSVPSIYEANQ

Sbjct:    56 -DANRWFNELHEQYGETFGIYYGSQMNIVISNEKDIKEVFIKNFSNFSDRSVPSIYEANQ 114

 

Query:   121 LTASLLMNSYSSGWKHTRSAIAPIFSTGKMKAMQETINSKVDLFLDILREKASSGQKWDI 180

             LTASLLMNSYSSGWKHTRSAIAPIFSTGKMKAMQETINSKVDLFLDILREKASSGQKWDI

Sbjct:   115 LTASLLMNSYSSGWKHTRSAIAPIFSTGKMKAMQETINSKVDLFLDILREKASSGQKWDI 174

 

Query:   181 YDDFQGLTLDVIGKCAFAIDSNCQRDRNDVFYHPVTVKITINNFTYFHS 229

             YDDFQGLTLDVIGKCAFAIDSNCQRDRNDVFY  V  K  I+N    HS

Sbjct:   175 YDDFQGLTLDVIGKCAFAIDSNCQRDRNDVFY--VNAKKYISNIDIRHS 221

 

 Score = 1283 (451.6 bits), Expect = 4.0e-244, Sum P(2) = 4.0e-244

 Identities = 236/236 (100%), Positives = 236/236 (100%)

 

Query:   275 YERRRGGEGSDSVDLLKLLLNREDDKSKPMTKQEVIENCFAFLLAGYETTSTAMTYCSYL 334

             YERRRGGEGSDSVDLLKLLLNREDDKSKPMTKQEVIENCFAFLLAGYETTSTAMTYCSYL

Sbjct:   261 YERRRGGEGSDSVDLLKLLLNREDDKSKPMTKQEVIENCFAFLLAGYETTSTAMTYCSYL 320

 

Query:   335 LSKYPNVQQKLYEEIMEAKENGGLTYDSIHNMKYLDCVYKETLRFYPPHFSFIRRLCRED 394

             LSKYPNVQQKLYEEIMEAKENGGLTYDSIHNMKYLDCVYKETLRFYPPHFSFIRRLCRED

Sbjct:   321 LSKYPNVQQKLYEEIMEAKENGGLTYDSIHNMKYLDCVYKETLRFYPPHFSFIRRLCRED 380

 

Query:   395 ITIRGQFYPKGAIVVCLPHTVHRNPENWDSPEEFHPERFENWEEKSSSLKWIPFGVGPRY 454

             ITIRGQFYPKGAIVVCLPHTVHRNPENWDSPEEFHPERFENWEEKSSSLKWIPFGVGPRY

Sbjct:   381 ITIRGQFYPKGAIVVCLPHTVHRNPENWDSPEEFHPERFENWEEKSSSLKWIPFGVGPRY 440

 

Query:   455 CVGMRFAEMEFKTTIVKLLDTFELKQFEGEADLIPDCNGVIMRPNDPVRLHLKPRN 510

             CVGMRFAEMEFKTTIVKLLDTFELKQFEGEADLIPDCNGVIMRPNDPVRLHLKPRN

Sbjct:   441 CVGMRFAEMEFKTTIVKLLDTFELKQFEGEADLIPDCNGVIMRPNDPVRLHLKPRN 496

 

C36A4.1; CE03070. cyan not in my seq, green instead

MALLILSSLVISIFTFFIYIILARRERFKLREKIGLSGPEPHWFLGNLKQ  50

TAERKEKLGYDDANRWFNELHEQYGETFGIYYGSQMNIVISNEKDIKEVF 100

IKNFSNFSDRSVPSIYEANQLTASLLMNSYSSGWKHTRSAIAPIFSTGKM 150

KAMQETINSKVDLFLDILREKASSGQKWDIYDDFQGLTLDVIGKCAFAID 200

SNCQRDRNDVFY

HPVTVKITINNFTYFHSSSPGTFHFLESTLQIHTTGRCRNSTCRRTVKCVGFRQDKAKFCSD

YERRRGGEGSDSVDLLKLLLNREDDK 300

SKPMTKQEVIENCFAFLLAGYETTSTAMTYCSYLLSKYPNVQQKLYEEIM 350

EAKENGGLTYDSIHNMKYLDCVYKETLRFYPPHFSFIRRLCREDITIRGQ 400

FYPKGAIVVCLPHTVHRNPENWDSPEEFHPERFENWEEKSSSLKWIPFGV 450

GPRYCVGMRFAEMEFKTTIVKLLDTFELKQFEGEADLIPDCNGVIMRPND 500

PVRLHLKPRN                                         510

 

>C36A4.1    Z66495   CYP25A1 496 aa corrected 9/14/98

 

MALLILSSLVISIFTFFIYIILARRERFKLREKIGLSGPEPHWFLGNLKQTAERK

DANRWFNELHEQYGETFGIYYGSQMNIVISNEKDIKEVFIKNFSNFSDRSVPSI

YEANQLTASLLMNSYSSGWKHTRSAIAPIFSTGKMKAMQETINSKVDLFLDI

LREKASSGQKWDIYDDFQGLTLDVIGKCAFAIDSNCQRDRNDVFY

VNAKKYISNIDIRHSKIIAASVLLPELSTFWKALYKYTPLADAEIPLVEGLSNV

YERRRGGEGSDSVDLLKLLLNREDDKSKPMTKQEVIENCFAFLLAGYETTSTA

MTYCSYLLSKYPNVQQKLYEEIMEAKENGGLTYDSIHNMKYLDCVYKETLRFY

PPHFSFIRRLCREDITIRGQFYPKGAIVVCLPHTVHRNPENWDSPEEFHPERFE

NWEEKSSSLKWIPFGVGPRYCVGMRFAEMEFKTTIVKLLDTFELKQFEGEAD

LIPDCNGVIMRPNDPVRLHLKPRN*

 

My green seq matches other CYP25As see below

 

 

>CYP25A2 C36A4.2    Z66495 496 aa

         Length = 497

 

 Score = 234 (82.4 bits), Expect = 2.3e-22, P = 2.3e-22

 Identities = 44/54 (81%), Positives = 51/54 (94%)

 

Query:     1 VNAKKYISNIDIRHSKIIAASVLLPELSTFWKALYKYTPLADAEIPLVEGLSNV 54

             VNA+K+I+NIDIRHSK+IAAS +LPEL+  W+ALYKYTPLADAEIPLVEGLSNV

Sbjct:   207 VNARKFIANIDIRHSKVIAASFILPELAPLWRALYKYTPLADAEIPLVEGLSNV 260

 

CYP25A2

 

>CYP25A2 C36A4.2    Z66495 496 aa

         Length = 497

 

 Score = 1327 (467.1 bits), Expect = 9.8e-272, Sum P(2) = 9.8e-272

 Identities = 260/266 (97%), Positives = 260/266 (97%)

 

Query:     1 MALLILTSILILLVSFIIYILFARREQFKLREKIGLTGPEPHWFMGNLKQIVDRKEKLGY 60

             MALLILTSILILLVSFIIYILFARREQFKLREKIGLTGPEPHWFMGNLKQIVDRK    

Sbjct:     1 MALLILTSILILLVSFIIYILFARREQFKLREKIGLTGPEPHWFMGNLKQIVDRK----- 55

 

Query:    61 DDSNKWFNELHKQYGETFGIYFGAQMNIVLSNEEDIKEVFIKNFSNFSDRIVPPIFDSNQ 120

              DSNKWFNELHKQYGETFGIYFGAQMNIVLSNEEDIKEVFIKNFSNFSDRIVPPIFDSNQ

Sbjct:    56 -DSNKWFNELHKQYGETFGIYFGAQMNIVLSNEEDIKEVFIKNFSNFSDRIVPPIFDSNQ 114

 

Query:   121 LNQSLLQNTYATGWKHTRSAIAPIFSTGKMKAMQETIHSKVDLFLDILREKASSGQKWDI 180

             LNQSLLQNTYATGWKHTRSAIAPIFSTGKMKAMQETIHSKVDLFLDILREKASSGQKWDI

Sbjct:   115 LNQSLLQNTYATGWKHTRSAIAPIFSTGKMKAMQETIHSKVDLFLDILREKASSGQKWDI 174

 

Query:   181 YEDFQGLTLDVIGKCAFAIDSNCQRDRNDVFYVNARKFIANIDIRHSKVIAASFILPELA 240

             YEDFQGLTLDVIGKCAFAIDSNCQRDRNDVFYVNARKFIANIDIRHSKVIAASFILPELA

Sbjct:   175 YEDFQGLTLDVIGKCAFAIDSNCQRDRNDVFYVNARKFIANIDIRHSKVIAASFILPELA 234

 

Query:   241 PLWRALYKYTPLADAEIPLVEGLSNV 266

             PLWRALYKYTPLADAEIPLVEGLSNV

Sbjct:   235 PLWRALYKYTPLADAEIPLVEGLSNV 260

 

 Score = 1277 (449.5 bits), Expect = 9.8e-272, Sum P(2) = 9.8e-272

 Identities = 236/236 (100%), Positives = 236/236 (100%)

 

Query:   287 YERRRGGEGSDSVDLLKLLLNREDDKSKPMTKQEVIENCFAFLLAGYETTSTAMTYCSYL 346

             YERRRGGEGSDSVDLLKLLLNREDDKSKPMTKQEVIENCFAFLLAGYETTSTAMTYCSYL

Sbjct:   261 YERRRGGEGSDSVDLLKLLLNREDDKSKPMTKQEVIENCFAFLLAGYETTSTAMTYCSYL 320

 

Query:   347 LSKYPNVQQKLYEEIMEAKENGGLTYDSIHNMKYLDYVYKETLRCYPPVIHFINRRCLAD 406

             LSKYPNVQQKLYEEIMEAKENGGLTYDSIHNMKYLDYVYKETLRCYPPVIHFINRRCLAD

Sbjct:   321 LSKYPNVQQKLYEEIMEAKENGGLTYDSIHNMKYLDYVYKETLRCYPPVIHFINRRCLAD 380

 

Query:   407 ITIRGQFYPKGSVVTCLPHTVHLNPENWDSPEEFHPERFENWEEKSSSLKWIPFGVGPRY 466

             ITIRGQFYPKGSVVTCLPHTVHLNPENWDSPEEFHPERFENWEEKSSSLKWIPFGVGPRY

Sbjct:   381 ITIRGQFYPKGSVVTCLPHTVHLNPENWDSPEEFHPERFENWEEKSSSLKWIPFGVGPRY 440

 

Query:   467 CVGMRFAEMEFKTTIVKLLDTFELKQFKGEADLIPDCNGVIMRPKDPVRLLLKPRN 522

             CVGMRFAEMEFKTTIVKLLDTFELKQFKGEADLIPDCNGVIMRPKDPVRLLLKPRN

Sbjct:   441 CVGMRFAEMEFKTTIVKLLDTFELKQFKGEADLIPDCNGVIMRPKDPVRLLLKPRN 496

 

C36A4.2; CE03071. cyan not in my seq extra intron seq?

MALLILTSILILLVSFIIYILFARREQFKLREKIGLTGPEPHWFMGNLKQ  50

IVDRKEKLGYDDSNKWFNELHKQYGETFGIYFGAQMNIVLSNEEDIKEVF 100

IKNFSNFSDRIVPPIFDSNQLNQSLLQNTYATGWKHTRSAIAPIFSTGKM 150

KAMQETIHSKVDLFLDILREKASSGQKWDIYEDFQGLTLDVIGKCAFAID 200

SNCQRDRNDVFYVNARKFIANIDIRHSKVIAASFILPELAPLWRALYKYT 250

PLADAEIPLVEGLSNV

TPFLLENFCSFSLILTKIFS

YERRRGGEGSDSVD 300

LLKLLLNREDDKSKPMTKQEVIENCFAFLLAGYETTSTAMTYCSYLLSKY 350

PNVQQKLYEEIMEAKENGGLTYDSIHNMKYLDYVYKETLRCYPPVIHFIN 400

RRCLADITIRGQFYPKGSVVTCLPHTVHLNPENWDSPEEFHPERFENWEE 450

KSSSLKWIPFGVGPRYCVGMRFAEMEFKTTIVKLLDTFELKQFKGEADLI 500

PDCNGVIMRPKDPVRLLLKPRN

 

 

CYP35C1

 

C06B3.3; CE27662. CYP35C1

MFFILLFVSIISFLTARQFLKAKRLPPGPFSLPLIGNAHQVGYQLWRTGG  50

VTNMLNHFRKEYGDIFTLWLGPIPHVNITNYELSHEVFVKNSTKYADKHV 100

SPMIDYVRKGNGVFFSNGDKWQELRRFSMLTMRNMGMGRDLMEEKILSEL 150

DARCAEINEKSIDGTTVLQVNEFFDLTVGSIINNMLLGFRFDERTKSRFL 200

TMKHMFDEGMDKMTPLFFTLPVWALQKFLAKDFNSIIKDQFEIIDYVSVD 250

AIKRSRDFMNGDYEIDPNNVEDIVDAFLLKMKQNPKSDVYDENNLKMLIT 300

DLWITGQETTTTTLVSAFIQFLNNPQVMDTVQKELIKVTNGGSRQLSLRD 350

KTETPYLNATIAEVQRHASILNINFWRINNEPTVIGGHPVDSGCLIASQL 400

SALHTNEKIFENPEKFNPERFIRNENLMQQTIPFGIGKRSCLGESLARAE 450

LYLIIGNLLLRYNFESSGKMPSTRETVPFGFAKRCEAFDMKVTKI

 

>CYP35C1 C06B3.3    Z77652 508 aa

         Length = 509

 

 Score = 2351 (827.6 bits), Expect = 3.3e-249, Sum P(2) = 3.3e-249

 Identities = 452/457 (98%), Positives = 454/457 (99%)

 

M C-term is wrong

 

Query:     1 MFFILLFVSIISFLTARQFLKAKRLPPGPFSLPLIGNAHQVGYQLWRTGGVTNMLNHFRK 60

             MFFILLFVSIISFLTARQFLKAKRLPPGPFSLPLIGNAHQVGYQLWRTGGVTNMLNHFRK

Sbjct:     1 MFFILLFVSIISFLTARQFLKAKRLPPGPFSLPLIGNAHQVGYQLWRTGGVTNMLNHFRK 60

 

Query:    61 EYGDIFTLWLGPIPHVNITNYELSHEVFVKNSTKYADKHVSPMIDYVRKGNGVFFSNGDK 120

             EYGDIFTLWLGPIPHVNITNYELSHEVFVKNSTKYADKHVSPMIDYVRKGNGVFFSNGDK

Sbjct:    61 EYGDIFTLWLGPIPHVNITNYELSHEVFVKNSTKYADKHVSPMIDYVRKGNGVFFSNGDK 120

 

Query:   121 WQELRRFSMLTMRNMGMGRDLMEEKILSELDARCAEINEKSIDGTTVLQVNEFFDLTVGS 180

             WQELRRFSMLTMRNMGMGRDLMEEKILSELDARCAEINEKSIDGTTVLQVNEFFDLTVGS

Sbjct:   121 WQELRRFSMLTMRNMGMGRDLMEEKILSELDARCAEINEKSIDGTTVLQVNEFFDLTVGS 180

 

Query:   181 IINNMLLGFRFDERTKSRFLTMKHMFDEGMDKMTPLFFTLPVWALQKFLAKDFNSIIKDQ 240

             IINNMLLGFRFDERTKSRFLTMKHMFDEGMDKMTPLFFTLPVWALQKFLAKDFNSIIKDQ

Sbjct:   181 IINNMLLGFRFDERTKSRFLTMKHMFDEGMDKMTPLFFTLPVWALQKFLAKDFNSIIKDQ 240

 

Query:   241 FEIIDYVSVDAIKRSRDFMNGDYEIDPNNVEDIVDAFLLKMKQNPKSDVYD---ENNLKM 297

             FEIIDYVSVDAIKRSRDFMNGDYEIDPNNVEDIVDAFLLKMKQNPKSDVY+   ENNLKM

Sbjct:   241 FEIIDYVSVDAIKRSRDFMNGDYEIDPNNVEDIVDAFLLKMKQNPKSDVYESFSENNLKM 300

 

Query:   298 LITDLWITGQETTTTTLVSAFIQFLNNPQVMDTVQKELIKVTNGGSRQLSLRDKTETPYL 357

             LITDLWITGQETTTTTLVSAFIQFLNNPQVMDTVQKELIKVTNGGSRQLSLRDKTETPYL

Sbjct:   301 LITDLWITGQETTTTTLVSAFIQFLNNPQVMDTVQKELIKVTNGGSRQLSLRDKTETPYL 360

 

Query:   358 NATIAEVQRHASILNINFWRINNEPTVIGGHPVDSGCLIASQLSALHTNEKIFENPEKFN 417

             NATIAEVQRHASILNINFWRINNEPTVIGGHPVDSGCLIASQLSALHTNEKIFENPEKFN

Sbjct:   361 NATIAEVQRHASILNINFWRINNEPTVIGGHPVDSGCLIASQLSALHTNEKIFENPEKFN 420

 

Query:   418 PERFIRNENLMQQTIPFGIGKRSCLGESLARAELYLI 454

             PERFIRNENLMQQTIPFGIGKRSCLGESLARAELYL+

Sbjct:   421 PERFIRNENLMQQTIPFGIGKRSCLGESLARAELYLV 457

 

CYP31A1P

 

>gi|17538369|ref|NM_069185.1| Caenorhabditis elegans cytochrome P450 (ccp-31A1), mRNA

          Length = 1269

 

 Score =  780 bits (2014), Expect = 0.0

 Identities = 407/487 (83%), Positives = 412/487 (84%), Gaps = 1/487 (0%)

 Frame = +1

 

By attempting to make this into a functional gene without gaps or stop codons the actual sequence has been asssembled incorrectly as an mRNA

 

Query: 1   MGVIILAVLLASATVIAWLLYKHLRMRQALKHLNQPRSYPIIGHGLITKPDPEGFMNQVI 60

           MGVIILAVLLASATVIAWLLYKHLRMRQALKHLNQPRSYPIIGHGLITKPDPEGFMNQVI

Sbjct: 1   MGVIILAVLLASATVIAWLLYKHLRMRQALKHLNQPRSYPIIGHGLITKPDPEGFMNQVI 180

 

Query: 61  GMGYLYPDPRMCLLWIGPFPCLMLYSGDLVEAIFSSTKHLNRGFAYVLLEPWLGISILTS 120

           GMGYLYPDPRMCLLWIGPFPCLMLYSGDLVEAIFSSTKHLNR                 

Sbjct: 181 GMGYLYPDPRMCLLWIGPFPCLMLYSGDLVEAIFSSTKHLNR------------------ 306

 

Query: 121 QKEQWRLXXXXXXXXXXXXXXXXXXXXXXXXXKILVQKLCCLGADERVDVLSVIALCTLD 180

                                            ILVQKLCCLGADERVDVLSVIALCTLD

Sbjct: 307 ---------------------------------ILVQKLCCLGADERVDVLSVIALCTLD 387

 

Query: 181 IICETSMGIAIGAQLAENNEYVWAVHTINKLISKRTNNPLMTEDGRTHEKCLHIFHDFTK 240

           IICETSMGIAIGAQLAENNEYVWAVHTINKLISKRTNNPLM            I++ + 

Sbjct: 388 IICETSMGIAIGAQLAENNEYVWAVHTINKLISKRTNNPLMWNS--------FIYNLYGS 543

 

Query: 241 KLIGERK*ALQENDYKMEGRLAFLDLLLEMVNSG-MDETDVQAEGNTFMLEGHDTTSTGL 299

            +I +      ENDYKMEGRLAFLDLLLEMVNSG MDETDVQAEGNTFMLEGHDTTSTGL

Sbjct: 544 FIINK------ENDYKMEGRLAFLDLLLEMVNSGQMDETDVQAEGNTFMLEGHDTTSTGL 705

 

Query: 300 MWAVHLLGNHPDVQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRLFPSVLIIT 359

           MWAVHLLGNHPDVQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRLFPSVLIIT

Sbjct: 706 MWAVHLLGNHPDVQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRLFPSVLIIT 885

 

Query: 360 RELSDDQVIGGFNIPKGVTFLLNLYLVHRDPAQWKDPDVFDPDRFHPENSIGRKSFAFIP 419

           RELSDDQVIGGFNIPKGVTFLLNLYLVHRDPAQWKDPDVFDPDRFHPENSIGRKSFAFIP

Sbjct: 886 RELSDDQVIGGFNIPKGVTFLLNLYLVHRDPAQWKDPDVFDPDRFHPENSIGRKSFAFIP 1065

 

Query: 420 FSAGSRNCIGQRFALMEEKVIMAHLLRNFNIKAVELMHEVRPKMEIIVRPVTPIHMKLTR 479

           FSAGSRNCIGQRFALMEEKVIMAHLLRNFNIKAVELMHEVRPKMEIIVRPVTPIHMKLTR

Sbjct: 1066FSAGSRNCIGQRFALMEEKVIMAHLLRNFNIKAVELMHEVRPKMEIIVRPVTPIHMKLTR 1245

 

Query: 480 RRPIASP 486

           RRPIASP

Sbjct: 1246RRPIASP 1266

 

Compare to 31A2

 

>CYP31A2 F22B3 Z68336 487 aa

         Length = 488

 

 Score = 2236 (787.1 bits), Expect = 3.0e-235, P = 3.0e-235

 Identities = 435/488 (89%), Positives = 442/488 (90%)

 

Query:     1 MGVIILAVLLASATVIAWLLYKHLRMRQALKHLNQPRSYPIIGHGLITKPDPEGFMNQVI 60

             MGVII AVLLA ATVIAWLLYKHLRMRQ LKHLNQPRSYPI+GHGLITKPDPEGFMNQVI

Sbjct:     1 MGVIIPAVLLAMATVIAWLLYKHLRMRQVLKHLNQPRSYPIVGHGLITKPDPEGFMNQVI 60

 

Query:    61 GMGYLYPDPRMCLLWIGPFPCLMLYSGDLVEAIFSSTKHLNRGFAYVLLEPWLGISILTS 120

             GMGYLYPDPRMCLLWIGPFPCLMLYS DLVE IFSSTKHLN+GFAYVLLEPWLGISILTS

Sbjct:    61 GMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTS 120

 

Query:   121 QKEQWRLXXXXXXXXXXXXXXXXXXXXXXXXXKILVQKLCCLGADERVDVLSVIALCTLD 180

             QKEQWR                          KILVQKLCCLGADE VDVLSVI LCTLD

Sbjct:   121 QKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILVQKLCCLGADEEVDVLSVITLCTLD 180

 

Query:   181 IICETSMGIAIGAQLAENNEYVWAVHTINKLISKRTNNPLMTEDGRTHEKCLHIFHDFTK 240

             IICETSMG AIGAQLAENNEYVWAVHTINKLISKRTNNPL+TEDGRTHEKCL I HDFTK

Sbjct:   181 IICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPLITEDGRTHEKCLRILHDFTK 240

 

Query:   241 KLIGERK*ALQENDYKMEGRLAFLDLLLEMVNSG-MDETDVQAEGNTFMLEGHDTTSTGL 299

             K+I ERK ALQENDYKMEGRLAFLDLLLEMV SG MDETDVQAE +TFM EGHDTTSTGL

Sbjct:   241 KVIVERKEALQENDYKMEGRLAFLDLLLEMVKSGQMDETDVQAEVDTFMFEGHDTTSTGL 300

 

Query:   300 MWAVHLLGNHPDVQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRLFPSVLIIT 359

             MWA+HLLGNHP+VQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRLFPSV IIT

Sbjct:   301 MWAIHLLGNHPEVQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRLFPSVPIIT 360

 

Query:   360 RELSDDQVIGGFNIPKGVTFLLNLYLVHRDPAQWKDPDVFDPDRFHPENSIGRKSFAFIP 419

             RELSDDQVIGG NIPKGVTFLLNLYLVHRDPAQWKDPDVFDPDRF PENSIGRKSFAFIP

Sbjct:   361 RELSDDQVIGGVNIPKGVTFLLNLYLVHRDPAQWKDPDVFDPDRFLPENSIGRKSFAFIP 420

 

Query:   420 FSAGSRNCIGQRFALMEEKVIMAHLLRNFNIKAVELMHEVRPKMEIIVRPVTPIHMKLTR 479

             FSAGSRNCIGQRFALMEEKVIMAHLLRNFNIKAVELMHEVRPKMEIIVRPVTPIHMKLTR

Sbjct:   421 FSAGSRNCIGQRFALMEEKVIMAHLLRNFNIKAVELMHEVRPKMEIIVRPVTPIHMKLTR 480

 

Query:   480 RRPIASP* 487

             RRPI SP*

Sbjct:   481 RRPIVSP* 488

 

compare to cDNA

>gi|18246951|dbj|BJ104281.1|BJ104281  UniGene info BJ104281 unpublished oligo-capped cDNA library, C. elegans L1 stage

           Caenorhabditis elegans cDNA clone yk1059h10 5'.

          Length = 601

 

 Score =  164 bits (415), Expect = 4e-42

 Identities = 84/100 (84%), Positives = 86/100 (86%), Gaps = 8/100 (8%)

 Frame = +1

 

Query: 14  DVLSVITLCTLDIICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPL-------- 65

           DVLSVI LCTLDIICETSMG AIGAQLAENNEYVWAVHTINKLISKRTNNPL       

Sbjct: 1   DVLSVIALCTLDIICETSMGIAIGAQLAENNEYVWAVHTINKLISKRTNNPLMWNSFIYN 180

 

Query: 66  ITEDGRTHEKCLRILHDFTKKVIVERKEALQENDYKMEGR 105

           +TEDGRTHEKCL I HDFTKK+I ERK ALQENDYKMEGR

Sbjct: 181 LTEDGRTHEKCLHIFHDFTKKLIGERK*ALQENDYKMEGR 300

 

 

>gi|1122764|emb|Z68213.1|CEC01F6 Caenorhabditis elegans cosmid C01F6, complete sequence

          Length = 31281

 

 

Score = 61.6 bits (148), Expect(5) = 0.0

 Identities = 31/31 (100%), Positives = 31/31 (100%)

 Frame = -2

 

Query: 1    MGVIILAVLLASATVIAWLLYKHLRMRQALK 31

            MGVIILAVLLASATVIAWLLYKHLRMRQALK

Sbjct: 8876 MGVIILAVLLASATVIAWLLYKHLRMRQALK 8784

 

 

 Score =  162 bits (410), Expect(5) = 0.0

 Identities = 85/135 (62%), Positives = 89/135 (65%)

 Frame = -3

 

Query: 31   KHLNQPRSYPIIGHGLITKPDPEGFMNQVIGMGYLYPDPRMCLLWIGPFPCLMLYSGDLV 90

            +HLNQPRSYPIIGHGLITKPDPEGFMNQVIGMGYLYPDPRMCLLWIGPFPCLMLYSGDLV

Sbjct: 8740 QHLNQPRSYPIIGHGLITKPDPEGFMNQVIGMGYLYPDPRMCLLWIGPFPCLMLYSGDLV 8561

 

Query: 91   EAIFSSTKHLNRGFAYVLLEPWLGISILTSQKEQWRLXXXXXXXXXXXXXXXXXXXXXXX 150

            EAIFSSTKHLNR            + I  +    W                        

Sbjct: 8560 EAIFSSTKHLNR------------VRIRFA*TMAWN-------QYFDEPERTMASFFDFL 8438

 

Query: 151  XXKILVQKLCCLGAD 165

              KILVQKLCCLGA+

Sbjct: 8437 LLKILVQKLCCLGAE 8393

 

Score = 55.8 bits (133), Expect = 6e-09

 Identities = 25/25 (100%), Positives = 25/25 (100%)

 Frame = -1

 

Query: 103  GFAYVLLEPWLGISILTSQKEQWRL 127

            GFAYVLLEPWLGISILTSQKEQWRL

Sbjct: 8526 GFAYVLLEPWLGISILTSQKEQWRL 8452

 

 Score = 63.9 bits (154), Expect(5) = 0.0

 Identities = 32/32 (100%), Positives = 32/32 (100%)

 Frame = -2

 

Query: 166  ERVDVLSVIALCTLDIICETSMGIAIGAQLAE 197

            ERVDVLSVIALCTLDIICETSMGIAIGAQLAE

Sbjct: 8345 ERVDVLSVIALCTLDIICETSMGIAIGAQLAE 8250

 

Score = 56.2 bits (134), Expect(5) = 0.0

 Identities = 25/27 (92%), Positives = 26/27 (96%)

 Frame = -1

 

Query: 195  LAENNEYVWAVHTINKLISKRTNNPLM 221

            L +NNEYVWAVHTINKLISKRTNNPLM

Sbjct: 8211 LFQNNEYVWAVHTINKLISKRTNNPLM 8131

 

Score =  480 bits (1236), Expect(5) = 0.0

 Identities = 238/240 (99%), Positives = 239/240 (99%), Gaps = 1/240 (0%)

 Frame = -3

 

Query: 222  TEDGRTHEKCLHIFHDFTKKLIGERK*ALQENDYKMEGRLAFLDLLLEMVNSG-MDETDV 280

            TEDGRTHEKCLHIFHDFTKKLIGERK*ALQENDYKMEGRLAFLDLLLEMVNSG MDETDV

Sbjct: 8059 TEDGRTHEKCLHIFHDFTKKLIGERK*ALQENDYKMEGRLAFLDLLLEMVNSGQMDETDV 7880

 

Query: 281  QAEGNTFMLEGHDTTSTGLMWAVHLLGNHPDVQRKVQAELDEVMGDDEDVTIEHLSRMKY 340

            QAEGNTFMLEGHDTTSTGLMWAVHLLGNHPDVQRKVQAELDEVMGDDEDVTIEHLSRMKY

Sbjct: 7879 QAEGNTFMLEGHDTTSTGLMWAVHLLGNHPDVQRKVQAELDEVMGDDEDVTIEHLSRMKY 7700

 

Query: 341  LECALKEALRLFPSVLIITRELSDDQVIGGFNIPKGVTFLLNLYLVHRDPAQWKDPDVFD 400

            LECALKEALRLFPSVLIITRELSDDQVIGGFNIPKGVTFLLNLYLVHRDPAQWKDPDVFD

Sbjct: 7699 LECALKEALRLFPSVLIITRELSDDQVIGGFNIPKGVTFLLNLYLVHRDPAQWKDPDVFD 7520

 

Query: 401  PDRFHPENSIGRKSFAFIPFSAGSRNCIGQRFALMEEKVIMAHLLRNFNIKAVELMHEVR 460

            PDRFHPENSIGRKSFAFIPFSAGSRNCIGQRFALMEEKVIMAHLLRNFNIKAVELMHEV+

Sbjct: 7519 PDRFHPENSIGRKSFAFIPFSAGSRNCIGQRFALMEEKVIMAHLLRNFNIKAVELMHEVK 7340

 

 Score = 60.5 bits (145), Expect = 4e-09

 Identities = 28/29 (96%), Positives = 29/29 (100%)

 Frame = -2

 

Query: 458  EVRPKMEIIVRPVTPIHMKLTRRRPIASP 486

            +VRPKMEIIVRPVTPIHMKLTRRRPIASP

Sbjct: 6887 QVRPKMEIIVRPVTPIHMKLTRRRPIASP 6801

 

F23A7.3 fusion to CYP29A1 (bogus)

 

Sequence F23A7.3 matches this sequence NM_077558.  This seq is a fusion protein with P450 CYP29A1 on the bottom and some unknown sequence on the top. F23A7.3 matches the top only (CYAN).  I think this may be an artifactual fusion.  This sequence should be unfused and the annotation of F23A7.3 as a P450 should be stopped.  Note that the first exon of 29A1 has been skipped in this fusion.

 

These genes are not linked in C. briggsae

 

 

NM_077558               2199 bp    mRNA    linear   INV 21-NOV-2003

DEFINITION  Caenorhabditis elegans cytochrome 450 family member (XM328), mRNA.

MVRKSFKRNHSSTAEDDNKDFDIKKVSDGLSSSISEPFVVANFP

KNKQWDIDEMTTNEKYIAYLPTVVKVTPAHKKRLLMNAELNNLFENLRTHKTGPLDTE

IDGANFVAIEKDVEPHLIRKIQLLPNGMLKLIWPVLPRNEYESNSNNQEEIDATKKEL

SETKKQLETERQKSIESQRFLEKSCNEKVSKLAVENQKLKNKMAQKSDETDSAYRMIL

ELKKKHEQELQMKNSTIMKLEDNV KFNNSSNEQFSKMTTENQQLKDWMATKTKEVEHA

AKMMLELKQKHIEELQMKTTQITKLENDVKLNGNFMDHISELTEENHNLKEQLKQKTD

AETKMIAEMKKKHRGERHYMNSRITKLEDALEYKDNQLSKREIAHNKLLDQLS

EITEIFRQTADETRSQGKSVMKYHILGKLYVWPLDGKTIAKLVESTTELNKGDDYNFFLPWLG

GGVLVEGFGERWRTHRKLLTPTFHFAKLEGYLEVFYSETKIMIEHLEKFADNEETVDM

FPYIKRCALDIICGAAIGTKINAQMHHNHPYVKAVEGFNSMAISHAINPSYQIPAIYW

ALGLKKQKDAHLNTMKTFTVNVIADRKAAIASGEVEKETSKRKMNFLDILLN N

Missing sequence of CYP29A1 here.

GMTIP

SGANVSIAPLALHSNAQVFPNPNKFDPDRFLPDEIAKRNAYDFMPFSAGLRNCIGEKF

ALLNEKVMMIHILKNFRLEPMGGFYSTKPMFEAVARPSNGISVKLIRRQF"

 

>CYP29A1 C44C10.2 CE05409    Z69787 502 aa also Y102F5 AL022276 61000-63000 region

MALILIIILICLLSFKPWSWKTIQLIXKYRQYDDKIPGPPTHPIFGNTETFKNKSQT

EITEIFRQTADETRSQGKSVMKYHILGKLYVWPLDGKTIAKLVESTTELNKGDDYNFF

LPWLGGGVLVEGFGERWRTHRKLLTPTFHFAKLEGYLEVFYSETKIMIEHLEKFA

DNEETVDMFPYIKRCALDIICGAAIGTKINAQMHHNHPYVKAVEGFNSMAISHAIN

PSYQIPAIYWALGLKKQKDAHLNTMKTFTVNVIADRKAAIASGEVEKETSKRKMN

FLDILLN SEESNSLTSEDIRQEVDTFMFAGHDTTTTSVSWACWNLAHNPDIQEKVY

EEIVHIFGEEPNDEVTSEGISXLEYTERVLNESKRIIAPVPSLQRKLINDMEFG GMTIP

SGANVSIAPLALHSNAQVFPNPNKFDPDRFLPDEIAKRNAYDFMPFSAGLRNCIGE

KFALLNEKVMMIHILKNFRLEPMGGFYSTKPMFEAVARPSNGISVKLIRRQF*

 

F23A7.3; CE09577.

MVRIICKRKTAEDKPNAFEFKKVNKGLTSPSDGPFVIAHFPKNMIWEIDQ  50

ETSNEKYIAYFPTIVSSPDTKRLMENDQLKMLYEKLRDHKSGPLKTQIDG 100

TNFVAVEQDAEPHQIESIQLLPNGNLKLYWPVICRDELEEDTNKEFNEMK 150

KDLYETKRELTEVRISLMNLEAEHRTLQTNCNRKDTQLFERHEEIQQLKD 200

HIQWKSKDTEVAAKMVFQMQNKHKEEVEKYTSTISKLKEEV VAKDKEIVE 250

NKTANDNMVSALLGLINKQRRVGT

 

>gi|17550997|ref|NM_077558.1| Caenorhabditis elegans cytochrome 450 family member (XM328), mRNA

          Length = 2199

 

 Score =  208 bits (530), Expect = 2e-52

 Identities = 117/248 (47%), Positives = 167/248 (67%), Gaps = 7/248 (2%)

 Frame = +1

 

Query: 1   MVRIICKRK---TAEDKPNAFEFKKVNKGLTSPSDGPFVIAHFPKNMIWEIDQETSNEKY 57

           MVR   KR    TAED    F+ KKV+ GL+S    PFV+A+FPKN  W+ID+ T+NEKY

Sbjct: 1   MVRKSFKRNHSSTAEDDNKDFDIKKVSDGLSSSISEPFVVANFPKNKQWDIDEMTTNEKY 180

 

Query: 58  IAYFPTIV--SSPDTKRLMENDQLKMLYEKLRDHKSGPLKTQIDGTNFVAVEQDAEPHQI 115

           IAY PT+V  +    KRL+ N +L  L+E LR HK+GPL T+IDG NFVA+E+D EPH I

Sbjct: 181 IAYLPTVVKVTPAHKKRLLMNAELNNLFENLRTHKTGPLDTEIDGANFVAIEKDVEPHLI 360

 

Query: 116 ESIQLLPNGNLKLYWPVICRDELEEDTN--KEFNEMKKDLYETKRELTEVRISLMNLEAE 173

             IQLLPNG LKL WPV+ R+E E ++N  +E +  KK+L ETK++L   R    ++E++

Sbjct: 361 RKIQLLPNGMLKLIWPVLPRNEYESNSNNQEEIDATKKELSETKKQLETER--QKSIESQ 534

 

Query: 174 HRTLQTNCNRKDTQLFERHEEIQQLKDHIQWKSKDTEVAAKMVFQMQNKHKEEVEKYTST 233

            R L+ +CN K ++L     E Q+LK+ +  KS +T+ A +M+ +++ KH++E++   ST

Sbjct: 535 -RFLEKSCNEKVSKL---AVENQKLKNKMAQKSDETDSAYRMILELKKKHEQELQMKNST 702

 

Query: 234 ISKLKEEV 241

           I KL++ V

Sbjct: 703 IMKLEDNV 726

 

CYP31A2

 

>gi|4263218|gb|AC006720.1|  Download subject sequence spanning the HSP Caenorhabditis elegans cosmid Y17G9B, complete sequence

          Length = 19620

 

 Score =  223 bits (567), Expect = 3e-59

 Identities = 119/152 (78%), Positives = 120/152 (78%), Gaps = 31/152 (20%)

 Frame = +3

 

Query: 1     DVLSVITLCTLDIICETSMGKAIGAQLAE----------------NNEYVWAVHTINKLI 44

             DVLSVITLCTLDIICETSMGKAIGAQLAE                NNEYVWAVHTINKLI

Sbjct: 12522 DVLSVITLCTLDIICETSMGKAIGAQLAEVREILQSQHF*TFFFQNNEYVWAVHTINKLI 12701

 

Query: 45    SKRTNNPLMWNSFIYNII---------------TEDGRTHEKCLRILHDFTKKVIVERKE 89

             SKRTNNPLMWNSFIYN+                TEDGRTHEKCLRILHDFTKKVIVERKE

Sbjct: 12702 SKRTNNPLMWNSFIYNLYDSFIIKKVNSILFFRTEDGRTHEKCLRILHDFTKKVIVERKE 12881

 

Y62E10A.15 = CYP31A5P (a new pseudogene)

 

This seq is almost identical to CYP31A2 (only 4 amino acid differences, plus some minor errors at intron boundaries.  There is a large fragment missing in the middle

My conclusion is that this is a very recent pseudogene of CYP31A2 = CYP31A5P

 

>CYP31A2 F22B3 Z68336 487 aa

         Length = 488

 

 Score = 1239 (436.1 bits), Expect = 4.8e-143, Sum P(2) = 4.8e-143

 Identities = 237/251 (94%), Positives = 240/251 (95%)

 

Query:     1 MGVIIPAVLLASATVIAWLIYKHLRMRQVLKHLNQPRSYPIVGHGLITKPDPEGFMNQVI 60

             MGVIIPAVLLA ATVIAWL+YKHLRMRQVLKHLNQPRSYPIVGHGLITKPDPEGFMNQVI

Sbjct:     1 MGVIIPAVLLAMATVIAWLLYKHLRMRQVLKHLNQPRSYPIVGHGLITKPDPEGFMNQVI 60

 

Query:    61 GMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTS 120

             GMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTS

Sbjct:    61 GMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTS 120

 

Query:   121 QKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILIQKLCCLGVADEEVDVLSVITLCTL 180

             QKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKIL+QKLCCLG ADEEVDVLSVITLCTL

Sbjct:   121 QKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILVQKLCCLG-ADEEVDVLSVITLCTL 179

 

Query:   181 DIICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPLMWNSFIYNLTEDGRTHEKC 240

             DIICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPL        +TEDGRTHEKC

Sbjct:   180 DIICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPL--------ITEDGRTHEKC 231

 

Query:   241 LHILHDFTKKV 251

             L ILHDFTKKV

Sbjct:   232 LRILHDFTKKV 242

 

 Score = 148 (52.1 bits), Expect = 4.8e-143, Sum P(2) = 4.8e-143

 Identities = 30/40 (75%), Positives = 34/40 (85%)

 

Query:   239 KCLHILHDFTKKVRPKMEIIVRPVTPIHMKLTRRRPIVSP 278

             K + ++H+    VRPKMEIIVRPVTPIHMKLTRRRPIVSP

Sbjct:   452 KAVELMHE----VRPKMEIIVRPVTPIHMKLTRRRPIVSP 487

 

>CYP31A2 F22B3 Z68336 487 aa

MGVIIPAVLLAMATVIAWLLYKHLRMRQVLKHLNQPRSYPIVGHGLITKPDPEGFM

NQVIGMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEP

WLGISILTSQKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILVQKLCCLGADEEV

DVLSVITLCTLDIICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPLITEDG

RTHEKCLRILHDFTKK VIVERKEALQENDYKMEGRLAFL DLLLEMVKSGQMDETD

VQAEVDTFMFEGHDTTSTGLMWAIHLLGNHPEVQRKVQAELDEVMGDDEDVTIEH

LSRMKYLECALKEALRLFPSVPIITRELSDDQVIGGVNIPKGVTFLLNLYLVHRDPA

QWKDPDVFDPDRFLPENSIGRKSFAFIPFSAGSRNCIGQRFALMEEKVIMAHLLRN

FNIKAVELMHE VRPKMEIIVRPVTPIHMKLTRRRPIVSP*

 

>gi|6425490|emb|AL132865.1|CEY62E10A Caenorhabditis elegans YAC Y62E10A, complete sequence

          Length = 57237

 

Score = 60.1 bits (144), Expect(3) = e-112

 Identities = 29/31 (93%), Positives = 30/31 (96%)

 Frame = +3

 

Query: 1     MGVIIPAVLLAMATVIAWLLYKHLRMRQVLK 31

             MGVIIPAVLLA ATVIAWL+YKHLRMRQVLK

Sbjct: 42477 MGVIIPAVLLASATVIAWLIYKHLRMRQVLK 42569

 

 Score =  285 bits (730), Expect(3) = e-112

 Identities = 134/145 (92%), Positives = 139/145 (95%), Gaps = 1/145 (0%)

 Frame = +1

 

Query: 23    HLRMRQV-LKHLNQPRSYPIVGHGLITKPDPEGFMNQVIGMGYLYPDPRMCLLWIGPFPC 81

             HL++  +  +HLNQPRSYPIVGHGLITKPDPEGFMNQVIGMGYLYPDPRMCLLWIGPFPC

Sbjct: 42586 HLKIN*INFQHLNQPRSYPIVGHGLITKPDPEGFMNQVIGMGYLYPDPRMCLLWIGPFPC 42765

 

Query: 82    LMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTSQKEQWRPKRKLLTPTFHYDIL 141

             LMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTSQKEQWRPKRKLLTPTFHYDIL

Sbjct: 42766 LMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTSQKEQWRPKRKLLTPTFHYDIL 42945

 

Query: 142   KDFLPIFNEQSKILVQKLCCLGADE 166

             KDFLPIFNEQSKIL+QKLCCLG  E

Sbjct: 42946 KDFLPIFNEQSKILIQKLCCLGVAE 43020

 

 

 Score =  102 bits (254), Expect(3) = e-112

 Identities = 55/73 (75%), Positives = 56/73 (76%), Gaps = 17/73 (23%)

 Frame = +3

 

Query: 166   EEVDVLSVITLCTLDIICETSMGKAIGAQLAE-----------------NNEYVWAVHTI 208

             EEVDVLSVITLCTLDIICETSMGKAIGAQLAE                 NNEYVWAVHTI

Sbjct: 43068 EEVDVLSVITLCTLDIICETSMGKAIGAQLAEVREILPSI*ISKCFFFQNNEYVWAVHTI 43247

 

Query: 209   NKLISKRTNNPLI 221

             NKLISKRTNNPL+

Sbjct: 43248 NKLISKRTNNPLM 43286

 

 

 Score = 84.3 bits (207), Expect = 2e-16

 Identities = 39/42 (92%), Positives = 41/42 (97%)

 Frame = +1

 

Query: 222   TEDGRTHEKCLRILHDFTKKVIVERKEALQENDYKMEGRLAF 263

             TEDGRTHEKCL ILHDFTKKVIVERKEALQ++DYKMEGRLAF

Sbjct: 43885 TEDGRTHEKCLHILHDFTKKVIVERKEALQDSDYKMEGRLAF 44010

 

This gene missing 195 amino acids here

 

 

 Score = 60.5 bits (145), Expect = 4e-09

 Identities = 28/29 (96%), Positives = 29/29 (100%)

 Frame = +1

 

Query: 459   EVRPKMEIIVRPVTPIHMKLTRRRPIVSP 487

             +VRPKMEIIVRPVTPIHMKLTRRRPIVSP

Sbjct: 44824 QVRPKMEIIVRPVTPIHMKLTRRRPIVSP 44910

 

Y62E10A.15; CE28717.

MGVIIPAVLLASATVIAWLIYKHLRMRQVLK (0)

HLNQPRSYPIVGHGLITKP  50

DPEGFMNQVIGMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHL 100

NKGFAYVLLEPWLGISILTSQKEQWRPKRKLLTPTFHYDILKDFLPIFNE 150

QSKILIQKLCCLGVAD (2)

EEVDVLSVITLCTLDIICETSMGKAIGAQLAE (0)

NNEYVWAVHTINKLISKRTNNPI (2)

(LMWNSFIYNL) delete this seq

TEDGRTHEKCLHILHDFTKK 250

VIVERKEALQDSDYKMEGRLAF (add this seq)

(sequence gap)

VRPKMEIIVRPVTPIHMKLTRRRPIVSP*

 

New pseudogene seq = CYP31A5P

MGVIIPAVLLASATVIAWLIYKHLRMRQVLK (0)

HLNQPRSYPIVGHGLITKP

DPEGFMNQVIGMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHL

NKGFAYVLLEPWLGISILTSQKEQWRPKRKLLTPTFHYDILKDFLPIFNE

QSKILIQKLCCLGVAD (2)

EEVDVLSVITLCTLDIICETSMGKAIGAQLAE (0)

NNEYVWAVHTINKLISKRTNNPI (2)

TEDGRTHEKCLHILHDFTKK

VIVERKEALQDSDYKMEGRLAF

(sequence gap)

VRPKMEIIVRPVTPIHMKLTRRRPIVSP*

 

Y5H2B.5 = CYP32B1

 

Y5H2B.5; CE26222.

MLAAALVLLLTYFAYLIFRQKDDILQFLHVRKICTREFKKLRGPPAVLIFGT

LWYFKKDPVEMVYQAQAWFSEYTLAPDNCGVLKVAVNVEVS (0)

LGFQLWLGPIPAVNIARGEIAK ()

IVLDSSVNISKSSQYNKLKEWIGDGLLI ()

STGDKWRSRRKMLTQTFHF ()

AVLKEYQKIFGAQGKILVEVLQLRANNKFSFDIMPYIKRCALDIICETAM

GCSISSQRGANDEYVNSVRRLSEIVWNYEKAPQFWLKPIWYLFGDGFEFN

RHVKLTTDFTRDVIENRKKELKTHNSEQNETKKLAFLDYLLKSQEEHPDI

LTDEGIREEVDTFMFEGHDTTSSGITFAVWFLGQFPEYQQRVHDELDEIF

GEDFERIPNSEDIQKMVYLEQCIKETLRMTPPVPFVSRKLTEDVKI ()

PHATKPDLLLPAGINCMINIITIMKDARYFERPYEFFPEHFSPERVAAREPFAFVPFSAGPRNCI ()

GQKFALLEEKVLLSWIFRNFTVTSMTKFPEEMPIPELILKPQFGTQVLLRNRRKL

 

>EMBOSS_001_4

EHFGISKRILLVGTNFYILNTFKYFRNGIPSPGMVQRVHTGSG*LWCVEGSGKC*SFSVK

LGFQLWLGPIPAVNIARGEI

>EMBOSS_001_5

TLWYFKKDPV GRN*FLYT*HF*IF*KWYTKPRHGSASTHWLRITVVC*R*R*MLKFQCEI

GFSVMVGTNTSCQHCERGDX

>EMBOSS_001_6

NTLVFQKGSCW*ELIFIYLTLLNILEMVYQAQAWFSEYTLAPDNCGVLKVAVNVEVSV*N

WVFSYGWDQYQLSTLREGRX

 

 

46% to 32A1 (below) 42% to 37A1 39% to 37B1

 

>CYP32A1 C26F1.2    U53148 508 aa  also Y97E10 contig266 revised 3/19/04

         Length = 509

 

 Score = 1169 (411.5 bits), Expect = 3.6e-122, P = 3.6e-122

 Identities = 228/495 (46%), Positives = 329/495 (66%)

 

Query:     8 LLLTYFAYLIFRQKDDILQFLHVRKICTREFKKLRGPPAVLIFGTLWYFKKDPVEMVYQA 67

             +++ Y  YL+      IL+   + + C +    + GPPA+ + G+   FK +P    +Q

Sbjct:     7 IVIGYVIYLVVVNFQQILELWRINRKCAQNLSMVNGPPALPLVGSAHLFKWNPYAFTFQM 66

 

Query:    68 QAWFSEYTL---------APDN--CGVLKLWLGPIPAVNIARGEIAKIVLDSSVNISKSS 116

             + W  +Y           AP+N   G++ LW+GP+P V +   E  + VL+S+ NISK S

Sbjct:    67 EGWAQKYLFGRAKYGEIAAPNNEVDGIMLLWIGPVPIVFLGTSECIRPVLESNTNISKPS 126

 

Query:   117 QYNKLKEWIGDGLLISTGDKWRSRRKMLTQTFHFAVLKEYQKIFGAQGKILVEVLQLRAN 176

             QY+K+ EWIG GLL ST +KW  RRKMLT TFHF ++++Y  +F    ++L + ++L  +

Sbjct:   127 QYDKMSEWIGTGLLTSTHEKWFHRRKMLTPTFHFTIIQDYFPVFVRNAEVLADAVELHVD 186

 

Query:   177 NKFSFDIMPYIKRCALDIICETAMGCSISSQRGANDEYVNSVRRLSEIVWNYEKAPQFWL 236

               + FD  PY KRC LDIICETAMG  +++Q G N+EYV++V+R+SEIVWN+ K P  WL

Sbjct:   187 GDY-FDAFPYFKRCTLDIICETAMGIQVNAQLGHNNEYVHAVKRISEIVWNHMKFPWLWL 245

 

Query:   237 KPIWYLFGDGFEFNRHVKLTTDFTRDVIENRKKELKTHNSEQNETKKLAFLDYLLKSQEE 296

             KPIWYL G GFEF+R+V++T +F R V           ++E +E K+ AFLD LL  Q+E

Sbjct:   246 KPIWYLTGLGFEFDRNVRMTNNFVRKV--------DAADNEASEKKRKAFLDLLLTIQKE 297

 

Query:   297 HPDILTDEGIREEVDTFMFEGHDTTSSGITFAVWFLGQFPEYFGEDFERIPNSEDIQKMV 356

                 L+DE IREEVDTFMFEGHDTTSSGI F + +LG +PE F  +  + P+ +DI+K 

Sbjct:   298 E-GTLSDEDIREEVDTFMFEGHDTTSSGIGFTILWLGFYPECF--ETNQPPSMDDIKKCS 354

 

Query:   357 YLEQCIKETLRMTPPVPFVSRKLTEDVKIPHATKPDLLLPAGINCMINIITIMKDARYFE 416

             YLE+CIKE+LRM P VP ++R+L+EDV I H +   ++LPAG+   ++ I   +D R +

Sbjct:   355 YLEKCIKESLRMFPSVPLIARRLSEDVTINHPSGQKIVLPAGLAACVSPIAAARDPRAWP 414

 

Query:   417 RPYEFFPEHFSPERVAAREPFAFVPFSAGPRNCIGQKFALLEEKVLLSWIFRNFTVTSMT 476

              P  + P++F  + +A R+P+A++PFSAGPRNCIGQKFALLE+K +LS  FR + V S+

Sbjct:   415 DPDTYNPDNFDIDAIAGRDPYAYIPFSAGPRNCIGQKFALLEQKTILSTFFRKYEVESLQ 474

 

Query:   477 KFPEEMPIPELILKPQFGTQVLLRNR 502

                   P+PELIL+P  G ++ ++ R

Sbjct:   475 TEENLRPVPELILRPYNGMKIKIKRR 500

 

>gi|22417476|emb|CAAC01000007.1|  Download subject sequence spanning the HSP Caenorhabditis briggsae contig cb25.fpc0023 from assembly cb25.agp8,

              whole genome shotgun sequence

          Length = 1407032

 

726375 MIAAILVALLTYFLFSIFQKRHEIQKFLYARRICIQEFEKVKGPPAVPIFGTTWYFKSDP 726554

726555 IEMIKQAQKWFVEYTLAPDSNGLLK (0) 726629

726676 FWMGPVPVVSICRGEVAK ()

726977 IFDSSTNIPKSSQYRKLKEWIGDGLLI () 727057

727102 STGPKWRSRRKMLTQTFHFAVLKEYHKVFASQGKILVDVLRLRANNTYPFDIMPYIKR 727275

727276 CTLDIIC () 727296

727682 ETAMGCSISSQMGSNDKYVESVKRLSELVWNYEK () 727783

727827 APLYWLKPIWYLFGNGFEFDRLVKLTTDFTRDVIDKRKEELKLHESEPSDKKLAFLDY 728000

728001 LLKSQTDHPEILTDEGIREEVDTFMFEGHDTTSSGIKFAIWFLGQYPEYQQQVQDEMDEI 728180

728181 FGDDYERYPNSEDIQRMIYLEQCIKETLRLTPPVPFISRQLEEDVLI () 728321

728362 AHATKPPVLLPAGMNIMINIITIMKDARYFEKPYEFFPEHFEPERVNSREAFAYVPFSA 728538

728539 GPRNCI ()

728820 GQKFALLEEKVVLSWIFRNFTVTSMSKYPEEHPIPELILKPQFGTQVLLKNRRK 728981

 

 

Score =  119 bits (299), Expect(2) = 8e-30

 Identities = 53/86 (61%), Positives = 71/86 (82%)

 Frame = +3

 

Query: 1      MLAAALVLLLTYFAYLIFRQKDDILQFLHVRKICTREFKKLRGPPAVLIFGTLWYFKKDP 60

              M+AA LV LLTYF + IF+++ +I +FL+ R+IC +EF+K++GPPAV IFGT WYFK DP

Sbjct: 726375 MIAAILVALLTYFLFSIFQKRHEIQKFLYARRICIQEFEKVKGPPAVPIFGTTWYFKSDP 726554

 

Query: 61     VEMVYQAQAWFSEYTLAPDNCGVLKL 86

              +EM+ QAQ WF EYTLAPD+ G+LK+

Sbjct: 726555 IEMIKQAQKWFVEYTLAPDSNGLLKV 726632

 

Score = 37.7 bits (86), Expect(2) = 8e-30

 Identities = 17/50 (34%), Positives = 30/50 (60%), Gaps = 2/50 (4%)

 Frame = +1

 

Query: 85     KLWLGPIPAVNIARGEIAKI--VLDSSVNISKSSQYNKLKEWIGDGLLIS 132

              + W+GP+P V+I RGE+AK+    DS  ++    +Y K+   + D  +I+

Sbjct: 726673 QFWMGPVPVVSICRGEVAKVGQKFDSIEHLGSFDRYQKVIN*LNDHHIIN 726822

 

 Score = 48.9 bits (115), Expect(2) = 2e-32

 Identities = 22/27 (81%), Positives = 23/27 (85%)

 Frame = +2

 

Query: 105    VLDSSVNISKSSQYNKLKEWIGDGLLI 131

              + DSS NI KSSQY KLKEWIGDGLLI

Sbjct: 726977 IFDSSTNIPKSSQYRKLKEWIGDGLLI 727057

 

 Score =  117 bits (293), Expect(2) = 2e-32

 Identities = 54/67 (80%), Positives = 60/67 (89%)

 Frame = +1

 

Query: 130    LISTGDKWRSRRKMLTQTFHFAVLKEYQKIFGAQGKILVEVLQLRANNKFSFDIMPYIKR 189

              + STG KWRSRRKMLTQTFHFAVLKEY K+F +QGKILV+VL+LRANN + FDIMPYIKR

Sbjct: 727096 IYSTGPKWRSRRKMLTQTFHFAVLKEYHKVFASQGKILVDVLRLRANNTYPFDIMPYIKR 727275

 

Query: 190    CALDIIC 196

              C LDIIC

Sbjct: 727276 CTLDIIC 727296

 

Score = 63.5 bits (153), Expect(3) = e-117

 Identities = 28/34 (82%), Positives = 32/34 (94%)

 Frame = +2

 

Query: 197    ETAMGCSISSQRGANDEYVNSVRRLSEIVWNYEK 230

              ETAMGCSISSQ G+ND+YV SV+RLSE+VWNYEK

Sbjct: 727682 ETAMGCSISSQMGSNDKYVESVKRLSELVWNYEK 727783

 

 Score =  285 bits (730), Expect(3) = e-117

 Identities = 133/167 (79%), Positives = 154/167 (92%)

 Frame = +3

 

Query: 230    KAPQFWLKPIWYLFGDGFEFNRHVKLTTDFTRDVIENRKKELKTHNSEQNETKKLAFLDY 289

              +AP +WLKPIWYLFG+GFEF+R VKLTTDFTRDVI+ RK+ELK H SE ++ KKLAFLDY

Sbjct: 727824 RAPLYWLKPIWYLFGNGFEFDRLVKLTTDFTRDVIDKRKEELKLHESEPSD-KKLAFLDY 728000

 

Query: 290    LLKSQEEHPDILTDEGIREEVDTFMFEGHDTTSSGITFAVWFLGQFPEYQQRVHDELDEI 349

              LLKSQ +HP+ILTDEGIREEVDTFMFEGHDTTSSGI FA+WFLGQ+PEYQQ+V DE+DEI

Sbjct: 728001 LLKSQTDHPEILTDEGIREEVDTFMFEGHDTTSSGIKFAIWFLGQYPEYQQQVQDEMDEI 728180

 

Query: 350    FGEDFERIPNSEDIQKMVYLEQCIKETLRMTPPVPFVSRKLTEDVKI 396

              FG+D+ER PNSEDIQ+M+YLEQCIKETLR+TPPVPF+SR+L EDV I

Sbjct: 728181 FGDDYERYPNSEDIQRMIYLEQCIKETLRLTPPVPFISRQLEEDVLI 728321

 

 Score =  122 bits (307), Expect(3) = e-117

 Identities = 58/89 (65%), Positives = 70/89 (78%)

 Frame = +1

 

Query: 396    IPHATKPDLLLPAGINCMINIITIMKDARYFERPYEFFPEHFSPERVAAREPFAFVPFSA 455

              + HATKP +LLPAG+N MINIITIMKDARYFE+PYEFFPEHF PERV +RE FA+VPFSA

Sbjct: 728359 LAHATKPPVLLPAGMNIMINIITIMKDARYFEKPYEFFPEHFEPERVNSREAFAYVPFSA 728538

 

Query: 456    GPRNCIGQKFALLEEKVLLSWIFRNFTVT 484

              GPRNCIG+ +   E  + + + F    V+

Sbjct: 728539 GPRNCIGK*YDKCEPILYVHFKFNYLDVS 728625

 

 Score =  104 bits (259), Expect = 4e-20

 Identities = 49/55 (89%), Positives = 54/55 (98%)

 Frame = +3

 

Query: 461    IGQKFALLEEKVLLSWIFRNFTVTSMTKFPEEMPIPELILKPQFGTQVLLRNRRK 515

              +GQKFALLEEKV+LSWIFRNFTVTSM+K+PEE PIPELILKPQFGTQVLL+NRRK

Sbjct: 728817 VGQKFALLEEKVVLSWIFRNFTVTSMSKYPEEHPIPELILKPQFGTQVLLKNRRK 728981

 

 

 

Y5H2B.6 = CYP33C12P

 

Y5H2B.6; CE21315. AC006810 one in frame stop codon = pseudogene

MLLLLFLSVLFLALFYEFHWKRRNYPAGPLPLPVIGNMWSMMRNNSGVEC

FRQWTKDFGDVYTFWFGTKPYIVVSSYKRLKEAFILDGDTFADKIRQPFQ

DQFRGGNYGVVDTNGHVWSTHRRFALSSFRDFGLGKNLLQEKMLIEVQDM

FAKFDANLGKEQNLPVVLYNAAANVINQLIFGYRFDKEREGELKKLKALM

EFQETAFTTFKVYVQFFAPAIGKHLPGKSVEDLLAEFTVDFYKFFNHQIE

EHRSKIDFDSEESLDYAEAYLKEQRKQEAQGEFELFSTKQLSNTCFDLWF

AGLSTTHITLTWIVGHVLNYPDVQRKLHKELDEVIGSDRLITNDDKNNLP

YLNAVINESQRCANIVPINQIHSTSRDTVINGITVKKGTGVIPQISAIML

DDKVFPDPYAFNPERFLDANGKLRKI*EFVPFSVGKRQCLGEGLARMEMF

IFMSNFFNRYQ (0) 17014

VSPASSGPPSLVKESLLNVAPRKFDAILKKRHV* 16469

 

This gene is a CYP33C but it is missing its C-terminal

Including the heme signature of p450s.  The missing seq has a stop codon.

 

Compare to briggsae

 

>CYP33C.b cb25.fpc0023 4 exons

          Length = 495

 

 Score = 1761 (619.9 bits), Expect = 6.6e-185, P = 6.6e-185

 Identities = 318/426 (74%), Positives = 375/426 (88%)

 

Query:     1 MLLLLFLSVLFLALFYEFHWKRRNYPAGPLPLPVIGNMWSMMRNNSGVECFRQWTKDFGD 60

             M+LLL  + L L LF+E +WKRRNYP GPLPLPVIGNM  +M+   G E FRQWTK+FGD

Sbjct:     1 MILLLLFTTLSLWLFHELYWKRRNYPNGPLPLPVIGNMVPIMKAKPGYEAFRQWTKEFGD 60

 

Query:    61 VYTFWFGTKPYIVVSSYKRLKEAFILDGDTFADKIRQPFQDQFRGGNYGVVDTNGHVWST 120

             V+TFW GTKPYIVVSSYKRLKE FILDGDT+ADK  QPFQ+QFRGG YGV+DTNGHVWST

Sbjct:    61 VFTFWLGTKPYIVVSSYKRLKETFILDGDTYADKAYQPFQEQFRGGQYGVIDTNGHVWST 120

 

Query:   121 HRRFALSSFRDFGLGKNLLQEKMLIEVQDMFAKFDANLGKEQNLPVVLYNAAANVINQLI 180

             HRRFAL++FRDFGLGK+L+Q+K+LIEV ++F KFD N+GKEQ +P V YNA ANVINQLI

Sbjct:   121 HRRFALTTFRDFGLGKDLMQQKILIEVDEIFRKFDENIGKEQEIPGVFYNAGANVINQLI 180

 

Query:   181 FGYRFDKEREGELKKLKALMEFQETAFTTFKVYVQFFAPAIGKHLPGKSVEDLLAEFTVD 240

             FGYRFD+E++ ELKKLKALMEFQETAFTTFKV VQFFAP +G+ LPGKSVE+LLAEFTVD

Sbjct:   181 FGYRFDEEKQEELKKLKALMEFQETAFTTFKVQVQFFAPIVGRMLPGKSVEELLAEFTVD 240

 

Query:   241 FYKFFNHQIEEHRSKIDFDSEESLDYAEAYLKEQRKQEAQGEFELFSTKQLSNTCFDLWF 300

             FYKFF+HQIEEHRSKIDFDSEE+LDYAEAYLKEQ+K+E++G+ ELF  +QLSN CFDLW

Sbjct:   241 FYKFFDHQIEEHRSKIDFDSEENLDYAEAYLKEQKKKESEGDMELFGNRQLSNMCFDLWV 300

 

Query:   301 AGLSTTHITLTWIVGHVLNYPDVQRKLHKELDEVIGSDRLITNDDKNNLPYLNAVINESQ 360

             AGLSTTH TL+WI+ +VLN+ +VQ+ +  ELDEVIGSDRLI+  DKNNLP++NAVINESQ

Sbjct:   301 AGLSTTHTTLSWIIAYVLNHSEVQKTMQIELDEVIGSDRLISTGDKNNLPFMNAVINESQ 360

 

Query:   361 RCANIVPINQIHSTSRDTVINGITVKKGTGVIPQISAIMLDDKVFPDPYAFNPERFLDAN 420

             RCANIVP+NQIH  S+DT+ING+ VKKGTG+IPQIS ++LD+  FPDPY FNPERF+D +

Sbjct:   361 RCANIVPLNQIHCVSKDTMINGVLVKKGTGIIPQISTVLLDETTFPDPYKFNPERFIDEH 420

 

Query:   421 GKLRKI 426

             GKL+K+

Sbjct:   421 GKLKKV 426

 

>CYP33C.b cb25.fpc0023 4 exons

723577 MILLLLFTTLSLWLFHELYWKRRNYPNGPLPLPVIGNMVPIMKAKPGYEAFRQWTK 723744

723745 EFGDVFTFWL (1) 723774

724349 GTKPYIVVSSYKRLKETFILDGDTYADKAYQP 724444

724445 FQEQFRGGQYGVIDTNGHVWSTHRRFALTTFRDFGLGKDLMQQKILIEVDEIFRKFDENI 724624

724625 GKEQEIPGVFYNAGANVINQLIFGYRFDEEKQEELKKLKALMEFQETAFTTFKVQVQFFA 724804

724805 PIVGRMLPGKSVEEL (2) 724849

724904 LAEFTVDFYKFFDHQIEEHRSKIDFDSEENLDYAEA 725009

725010 YLKEQKKKESEGDMELFGNRQLSNMCFDLWVAGLSTTHTTLSWIIAYVLNHSEVQKTMQI 725189

725190 ELDEVIGSDRLISTGDKNNLPFMNAVINESQRCANIVPLNQIHCVSKDTMINGVLVKKGT 725369

725370 GIIPQISTVLLDETTFPDPYKFNPERFIDEHGKLKKV EELCAFSVGKRQCLGEGLARMEM 725549

725550 FLFISNFFNRYQ (0) 725585

725633 VTPGSSGPPSLEKETMFNATPRKIRAVLTKRYL* 725734

 

 

CYP31A3 and pseudogene CYP31A4P

 

>CYP31A3 T16C6 486 aa this sequence revised according to Y17G9.contig 61

         Length = 488

 

Score = 1164 (409.7 bits), Expect = 1.2e-121, P = 1.2e-121

 Identities = 242/332 (72%), Positives = 263/332 (79%)

 

Query:     1 MGVIIPAVLLASATIIAWLLYKHLRMRQALKHLNQPRSYPIVGHGLVTKPDPEGFMNQVI 60

             MGVIIPAVLLASATIIAWLLYKHLRMRQALKHLNQPRSYPIVGHGLVTKPDPEGFMNQVI

Sbjct:     1 MGVIIPAVLLASATIIAWLLYKHLRMRQALKHLNQPRSYPIVGHGLVTKPDPEGFMNQVI 60

 

Query:    61 GMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTS 120

             GMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTS

Sbjct:    61 GMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTS 120

 

Query:   121 QKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILVQKMCSLGAEEEVDVLSVITLCTLD 180

             QKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILVQKMCSLGAEEEVDVLSVITLCTLD

Sbjct:   121 QKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILVQKMCSLGAEEEVDVLSVITLCTLD 180

 

Query:   181 IICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPLMWNS---------FIYNLYD 231

             IICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPL+             +++  

Sbjct:   181 IICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPLITEDGRTHEKCLRILHDFTK 240

 

Query:   232 SFIIKKVNSILF--FRTEDGRTHEKCLRILHDFTKKVIVERKEALQEND-YKMEGR-LAF 287

               I+++  ++    ++ E GR     L +L +  K   ++  +   E D +  EG    

Sbjct:   241 KVIVERKEALQENDYKME-GRL--AFLDLLLEMVKSGQMDETDVQAEVDTFMFEGHDTTS 297

 

Query:   288 LDLLLEMVKSGQMDETD--VQAEVDTFMFEGHDTT 320

               L+  +   G   E    VQAE+D  M +  D T

Sbjct:   298 TGLMWAIHLLGNHPEVQRKVQAELDEVMGDDEDVT 332

 

 Score = 1394 (490.7 bits), Expect = 5.1e-146, P = 5.1e-146

 Identities = 274/295 (92%), Positives = 277/295 (93%)

 

Query:   217 NNPLMWNSFIYNLYDSFIIKKVNSILFFRTEDGRTHEKCLRILHDFTKKVIVERKEALQE 276

             NN  +W     N     I K+ N+ L   TEDGRTHEKCLRILHDFTKKVIVERKEALQE

Sbjct:   198 NNEYVWAVHTIN---KLISKRTNNPLI--TEDGRTHEKCLRILHDFTKKVIVERKEALQE 252

 

Query:   277 NDYKMEGRLAFLDLLLEMVKSGQMDETDVQAEVDTFMFEGHDTTSTGLMWAIHLLGNHPE 336

             NDYKMEGRLAFLDLLLEMVKSGQMDETDVQAEVDTFMFEGHDTTSTGLMWAIHLLGNHPE

Sbjct:   253 NDYKMEGRLAFLDLLLEMVKSGQMDETDVQAEVDTFMFEGHDTTSTGLMWAIHLLGNHPE 312

 

Query:   337 VQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRLFPSVPIITRELSDDQVIGGV 396

             VQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRLFPSVPIITRELSDDQVIGGV

Sbjct:   313 VQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRLFPSVPIITRELSDDQVIGGV 372

 

Query:   397 NIPKGVTFLLNLYLVHRDPSQWKDPDVFDPDRFLPENSIARKSFAFIPFSAGSRNCIGQR 456

             NIPKGVTFLLNLYLVHRDPSQWKDPDVFDPDRFLPENSIARKSFAFIPFSAGSRNCIGQR

Sbjct:   373 NIPKGVTFLLNLYLVHRDPSQWKDPDVFDPDRFLPENSIARKSFAFIPFSAGSRNCIGQR 432

 

Query:   457 FALMEEKVIMAHLLRNFNVKAVELMHEVRPKMEIIVRPVTPIHMKLTRRRPIVSP 511

             FALMEEKVIMAHLLRNFNVKAVELMHEVRPKMEIIVRPVTPIHMKLTRRRPIVSP

Sbjct:   433 FALMEEKVIMAHLLRNFNVKAVELMHEVRPKMEIIVRPVTPIHMKLTRRRPIVSP 487

 

Y17G9B.3; CE24183. remove cyan seq

MGVIIPAVLLASATIIAWLLYKHLRMRQALKHLNQPRSYPIVGHGLVTKP  50

DPEGFMNQVIGMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHL 100

NKGFAYVLLEPWLGISILTSQKEQWRPKRKLLTPTFHYDILKDFLPIFNE 150

QSKILVQKMCSLGAEEEVDVLSVITLCTLDIICETSMGKAIGAQLAENNE 200

YVWAVHTINKLISKRTNNPLMWNSFIYNLYDSFIIKKVNSILFFRTEDGR 250

THEKCLRILHDFTKKVIVERKEALQENDYKMEGRLAFLDLLLEMVKSGQM 300

DETDVQAEVDTFMFEGHDTTSTGLMWAIHLLGNHPEVQRKVQAELDEVMG 350

DDEDVTIEHLSRMKYLECALKEALRLFPSVPIITRELSDDQVIGGVNIPK 400

GVTFLLNLYLVHRDPSQWKDPDVFDPDRFLPENSIARKSFAFIPFSAGSR 450

NCIGQRFALMEEKVIMAHLLRNFNVKAVELMHEVRPKMEIIVRPVTPIHM 500

KLTRRRPIVSP                                        511

 

with cyan seq removed we get

 

>CYP31A3 T16C6 486 aa this sequence revised according to Y17G9.contig 61

         Length = 488

 

add the green seq to the model

 Score = 2521 (887.4 bits), Expect = 1.9e-265, P = 1.9e-265

 Identities = 486/495 (98%), Positives = 487/495 (98%)

 

Query:     1 MGVIIPAVLLASATIIAWLLYKHLRMRQALKHLNQPRSYPIVGHGLVTKPDPEGFMNQVI 60

             MGVIIPAVLLASATIIAWLLYKHLRMRQALKHLNQPRSYPIVGHGLVTKPDPEGFMNQVI

Sbjct:     1 MGVIIPAVLLASATIIAWLLYKHLRMRQALKHLNQPRSYPIVGHGLVTKPDPEGFMNQVI 60

 

Query:    61 GMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTS 120

             GMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTS

Sbjct:    61 GMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTS 120

 

Query:   121 QKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILVQKMCSLGAEEEVDVLSVITLCTLD 180

             QKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILVQKMCSLGAEEEVDVLSVITLCTLD

Sbjct:   121 QKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILVQKMCSLGAEEEVDVLSVITLCTLD 180

 

Query:   181 IICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPLMWNSFIYNLTEDGRTHEKCL 240

             IICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPL        +TEDGRTHEKCL

Sbjct:   181 IICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPL--------ITEDGRTHEKCL 232

 

Query:   241 RILHDFTKKVIVERKEALQENDYKMEGRLAFLDLLLEMVKSGQMDETDVQAEVDTFMFEG 300

             RILHDFTKKVIVERKEALQENDYKMEGRLAFLDLLLEMVKSGQMDETDVQAEVDTFMFEG

Sbjct:   233 RILHDFTKKVIVERKEALQENDYKMEGRLAFLDLLLEMVKSGQMDETDVQAEVDTFMFEG 292

 

Query:   301 HDTTSTGLMWAIHLLGNHPEVQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRL 360

             HDTTSTGLMWAIHLLGNHPEVQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRL

Sbjct:   293 HDTTSTGLMWAIHLLGNHPEVQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRL 352

 

Query:   361 FPSVPIITRELSDDQVIGGVNIPKGVTFLLNLYLVHRDPSQWKDPDVFDPDRFLPENSIA 420

             FPSVPIITRELSDDQVIGGVNIPKGVTFLLNLYLVHRDPSQWKDPDVFDPDRFLPENSIA

Sbjct:   353 FPSVPIITRELSDDQVIGGVNIPKGVTFLLNLYLVHRDPSQWKDPDVFDPDRFLPENSIA 412

 

Query:   421 RKSFAFIPFSAGSRNCIGQRFALMEEKVIMAHLLRNFNVKAVELMHEVRPKMEIIVRPVT 480

             RKSFAFIPFSAGSRNCIGQRFALMEEKVIMAHLLRNFNVKAVELMHEVRPKMEIIVRPVT

Sbjct:   413 RKSFAFIPFSAGSRNCIGQRFALMEEKVIMAHLLRNFNVKAVELMHEVRPKMEIIVRPVT 472

 

Query:   481 PIHMKLTRRRPIVSP 495

             PIHMKLTRRRPIVSP

Sbjct:   473 PIHMKLTRRRPIVSP 487

 

Compare to 31A2

 

>CYP31A2 F22B3 Z68336 487 aa

         Length = 488

 

 Score = 2517 (886.0 bits), Expect = 5.1e-265, P = 5.1e-265

 Identities = 478/488 (97%), Positives = 484/488 (99%)

 

Query:     1 MGVIIPAVLLASATIIAWLLYKHLRMRQALKHLNQPRSYPIVGHGLVTKPDPEGFMNQVI 60

             MGVIIPAVLLA AT+IAWLLYKHLRMRQ LKHLNQPRSYPIVGHGL+TKPDPEGFMNQVI

Sbjct:     1 MGVIIPAVLLAMATVIAWLLYKHLRMRQVLKHLNQPRSYPIVGHGLITKPDPEGFMNQVI 60

 

Query:    61 GMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTS 120

             GMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTS

Sbjct:    61 GMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTS 120

 

Query:   121 QKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILVQKMCSLGAEEEVDVLSVITLCTLD 180

             QKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILVQK+C LGA+EEVDVLSVITLCTLD

Sbjct:   121 QKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILVQKLCCLGADEEVDVLSVITLCTLD 180

 

Query:   181 IICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPLITEDGRTHEKCLRILHDFTK 240

             IICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPLITEDGRTHEKCLRILHDFTK

Sbjct:   181 IICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPLITEDGRTHEKCLRILHDFTK 240

 

Query:   241 KVIVERKEALQENDYKMEGRLAFLDLLLEMVKSGQMDETDVQAEVDTFMFEGHDTTSTGL 300

             KVIVERKEALQENDYKMEGRLAFLDLLLEMVKSGQMDETDVQAEVDTFMFEGHDTTSTGL

Sbjct:   241 KVIVERKEALQENDYKMEGRLAFLDLLLEMVKSGQMDETDVQAEVDTFMFEGHDTTSTGL 300

 

Query:   301 MWAIHLLGNHPEVQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRLFPSVPIIT 360

             MWAIHLLGNHPEVQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRLFPSVPIIT

Sbjct:   301 MWAIHLLGNHPEVQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRLFPSVPIIT 360

 

Query:   361 RELSDDQVIGGVNIPKGVTFLLNLYLVHRDPSQWKDPDVFDPDRFLPENSIARKSFAFIP 420

             RELSDDQVIGGVNIPKGVTFLLNLYLVHRDP+QWKDPDVFDPDRFLPENSI RKSFAFIP

Sbjct:   361 RELSDDQVIGGVNIPKGVTFLLNLYLVHRDPAQWKDPDVFDPDRFLPENSIGRKSFAFIP 420

 

Query:   421 FSAGSRNCIGQRFALMEEKVIMAHLLRNFNVKAVELMHEVRPKMEIIVRPVTPIHMKLTR 480

             FSAGSRNCIGQRFALMEEKVIMAHLLRNFN+KAVELMHEVRPKMEIIVRPVTPIHMKLTR

Sbjct:   421 FSAGSRNCIGQRFALMEEKVIMAHLLRNFNIKAVELMHEVRPKMEIIVRPVTPIHMKLTR 480

 

Query:   481 RRPIVSP* 488

             RRPIVSP*

Sbjct:   481 RRPIVSP* 488

 

I had the ends of my genes mixed up.  The pseudogene should be downstream of CYP31A3

Not in the last intron.

 

>CYP31A3 T16C6 486 aa this sequence revised according to Y17G9.contig 61

         Length = 488

 

  Plus Strand HSPs:

 

 Score = 551 (194.0 bits), Expect = 1.9e-69, Sum P(2) = 1.9e-69

 Identities = 107/113 (94%), Positives = 111/113 (98%), Frame = +3

 

Query: 13203 VPIITRELSDDQVIGGVNIPKGVTFLLNLYLVHRDPSQWKDPDVFDPDRFLPENSIARKS 13382

             VPIITRELSDDQVIGGVNIPKGVTFLLNLYLVHRDPSQWKDPDVFDPDRFLPENSIARKS

Sbjct:   356 VPIITRELSDDQVIGGVNIPKGVTFLLNLYLVHRDPSQWKDPDVFDPDRFLPENSIARKS 415

 

Query: 13383 FAFIPFSAGSRNCIGQRFALMEEKVIMAHLLRNFNVKAVELMHEVKH*KLKLI 13541

             FAFIPFSAGSRNCIGQRFALMEEKVIMAHLLRNFNVKAVELMHEV+  K+++I

Sbjct:   416 FAFIPFSAGSRNCIGQRFALMEEKVIMAHLLRNFNVKAVELMHEVRP-KMEII 467

 

Score = 146 (51.4 bits), Expect = 1.9e-69, Sum P(2) = 1.9e-69

 Identities = 29/30 (96%), Positives = 30/30 (100%), Frame = +1

 

Query: 13567 QVRPKMEIIVRPVTPIHMKLTRRRPIVSP* 13656 end of CYP31A3

             +VRPKMEIIVRPVTPIHMKLTRRRPIVSP*

Sbjct:   459 EVRPKMEIIVRPVTPIHMKLTRRRPIVSP* 488

 

Score = 165 (58.1 bits), Expect = 2.3e-11, P = 2.3e-11

 Identities = 40/63 (63%), Positives = 44/63 (69%), Frame = +2

 

This is the pseudogene CYP31A4P

 

Query: 15233 MTHLLRNFSVKDVELVHEMIFESL*IKFLIKLVQVRPKMEIIVCPVSPIHMKLTRRRPII 15412

             M HLLRNF+VK VEL+HE                VRPKMEIIV PV+PIHMKLTRRRPI+

Sbjct:   442 MAHLLRNFNVKAVELMHE----------------VRPKMEIIVRPVTPIHMKLTRRRPIV 485

 

Query: 15413 SP* 15421

             SP*

Sbjct:   486 SP* 488

 

AC006720               19620 bp    DNA     linear   INV 07-JUL-2003

DEFINITION  Caenorhabditis elegans cosmid Y17G9B, complete sequence.

 

    13201 ctgttccaat tattacaaga gaattgagtg atgatcaagt gattggaggt gttaatattc

    13261 caaaaggagt cacttttctg ctcaacctgt accttgttca tcgtgatcct tcgcaatgga

    13321 aggatcctga tgtattcgat ccggatcgtt tccttcctga aaattcaatt gctcgcaagt

    13381 ctttcgcctt cattccattc tcagctggaa gtcgtaactg cattggccaa cgttttgcac

    13441 tcatggagga gaaggtcatc atggctcatt tgttgagaaa tttcaatgta aaagctgtcg

    13501 agttgatgca tgaggtgaaa cattgaaaat taaaattgat aaattaaatt ctaactgaaa

    13561 attgttcagg ttcgtccgaa aatggagatt atcgttcgcc cagtcactcc aattcacatg

    13621 aagcttacca gacgccgtcc aatcgtctct ccataatttc aaaatagcca attttattct

    13681 tgatgattat atatttttaa tgcatgataa attttaattt tcatacttta ccttgcaaat

    13741 gtgatgaaca aattcttatt tcggggagtg aactattcct ggaacaaaaa cgatttgatt

    13801 ttttaagctc tataggctct acactaaatt ctgtgaatac gtactgcaaa tatatcaaaa

    13861 ctaaattttt ctcattgcaa aaagtctgaa tttattgatt tttgaatgaa gttatagcta

    13921 gtaatattga ttctcatgga acagctcata ttgacaaatg tgacaaattt tgatacggta

    13981 tttttaaagg cgcagcaaaa aattttctct atggtcccgc aacgtacaca gtttcactag

    14041 attttgacat gccgctttaa aggcacgtgg atttatttaa atgggtaaca cggcccggca

    14101 agtggtacat ccatgcaaat gcgctctact gataatttga gtgtagacca ggtttgggcg

    14161 cgtgataacg aaaaaagctt tggtccaaaa aatttagaat ttagtttcgg acatttttta

    14221 tatgcatcac aaaaaaactg gaccaaccgt ttttgagata cacgcgccca aacgtccagg

    14281 tatacggtag acaaattgcg tacaggtacc acttctcggg ccgtgggtaa tatgggaaca

    14341 aacaattcta agaatgcgta ctgggcgaca tatttggcgc gcaaaatatc ttgtagcgaa

    14401 aactacagta attcttttaa tgaccactgt agctcttgtg tcgatttacg ggcccacttt

    14461 cgaaatgagt ttctttttga atagtgataa caaatttctc attttccttc gttattttgt

    14521 attattttct cacgttttgc ttaattttaa tattccataa atggttttaa ttcattcaga

    14581 aattcaggcc cgtaaatcga caaaagagct acagtagtca tctatagaat tactgtattt

    14641 ttcgctccga gatattttgc gcgtcaaata tattgcgtag tacgcattct cagaaattaa

    14701 tgttcccgta ataccaaagt catacaggaa aagtgaattt ttcgaaattc aaaatttgaa

    14761 ccgttgatat tttcttgttg atgttttgca aacaacactc cagcgcttga gcatattgcc

    14821 atcaaacacc aatgagaacg aaaaaggttg aatcgaacaa ttcgaagaat tttctatttt

    14881 aagagaaaag atctctaacc agtgccttct cgctccatta tcctttggaa catgtgcgtg

    14941 tctctgcacg ctttagacaa agaaaaatat gtataatatc tctaagatga gaattagaga

    15001 attggggaag gggggtcaat tggatcggac cacccaatct tcccatctgc tcattctcca

    15061 tttgtgacca tggacaacgg tgtgtctatt ccattccaac tgctcaaatc acctcaaaaa

    15121 tcgattagtt atgttcttca aagaatgact atcggagatc tgtaagtaaa atgtgaccaa

    15181 atataattat ttacagacct gcaagtggaa ctattcgcta gacattcact agatgactca

    15241 tttgttgaga aatttcagtg taaaagatgt tgagttggtg catgaaatga tatttgaaag

    15301 tttataaatt aaattcttaa taaaacttgt tcaggttcgt ccaaaaatgg aaataatcgt

    15361 ttgcccggtc tctccaatcc acatgaagct caccagacgc cgtccaatca tatctccatg

    15421 atttgaaatt aatttgaaat ttcgaacgat tttatatttt tattgcttga taaatttgag

    15481 ttttctgtaa aaagtttgaa acttcaacct tgaaattccc gttactggtc gagtccaaat

 

 

Y80D3A.5

 

>CYP42A1 CM08B12  M89401, AL020988 Y80D3 contig 01341 and 00427 504 aa

         Length = 505

 

 Score = 2290 (806.1 bits), Expect = 5.8e-241, P = 5.8e-241

 Identities = 440/450 (97%), Positives = 440/450 (97%)

 

Query:     1 MGIITASLIVLTITWIIHFAFRKAKFIYNKLTVFQGPAALPLIGNFHQFHFSPEEFFEQS 60

             MGIITASLIVLTITWIIHFAFRKAKFIYNKLTVFQGPAALPLIGNFHQFHFSPEEFFEQS

Sbjct:     1 MGIITASLIVLTITWIIHFAFRKAKFIYNKLTVFQGPAALPLIGNFHQFHFSPEEFFEQS 60

 

Query:    61 QGIAYMMRKGDERITRVWLGGLPFVLLYGAHEVEAILGSPKMLNKPFLYGFLSAWIGDGL 120

             QGIAYMMRKGDERITRVWLGGLPFVLLYGAHEVEAILGSPKMLNKPFLYGFLSAWIGDGL

Sbjct:    61 QGIAYMMRKGDERITRVWLGGLPFVLLYGAHEVEAILGSPKMLNKPFLYGFLSAWIGDGL 120

 

Query:   121 LIS-KPDKWRPRRKLLTPTFHYDILKDFVEVYNRHGRTLLSKFEAQAGTGEYSDVFHTIT 179

             LIS KPDKWRPRRKLLTPTFHYDILKDFVEVYNRHGRTLLSKFEAQAGTGEYSDVFHTIT

Sbjct:   121 LISRKPDKWRPRRKLLTPTFHYDILKDFVEVYNRHGRTLLSKFEAQAGTGEYSDVFHTIT 180

 

Query:   180 LCTLDVICEAALGTSINAQKDPHSPYLDAVFKMKDIVFQRLLRPHYFSDTIFNLIGPGKE 239

             LCTLDVICEAALGTSINAQKDPHSPYLDAVFKMKDIVFQRLLRPHYFSDTIFNLIGPGKE

Sbjct:   181 LCTLDVICEAALGTSINAQKDPHSPYLDAVFKMKDIVFQRLLRPHYFSDTIFNLIGPGKE 240

 

Query:   240 HDECVK--ILHEFTSKAIYARKAKVDAAGGVEQLLAQETAEGRRRMAFLDLMLDMNSKGE 297

             HDECVK  ILHEFTSKAIYARK          QLLAQETAEGRRRMAFLDLMLDMNSKGE

Sbjct:   241 HDECVKVIILHEFTSKAIYARK----------QLLAQETAEGRRRMAFLDLMLDMNSKGE 290

 

Query:   298 LPMEGICEEVDTFTFEGHDTTSAAMNWFLHLMGANPEIQSKVQKEIDEVLGEADRPVSYE 357

             LPMEGICEEVDTFTFEGHDTTSAAMNWFLHLMGANPEIQSKVQKEIDEVLGEADRPVSYE

Sbjct:   291 LPMEGICEEVDTFTFEGHDTTSAAMNWFLHLMGANPEIQSKVQKEIDEVLGEADRPVSYE 350

 

Query:   358 DLGKLKYLEACFKETLRLYPSVPLIARQCVEDIQVRGHTLPSGTAVVMVPSMVHKDPRYW 417

             DLGKLKYLEACFKETLRLYPSVPLIARQCVEDIQVRGHTLPSGTAVVMVPSMVHKDPRYW

Sbjct:   351 DLGKLKYLEACFKETLRLYPSVPLIARQCVEDIQVRGHTLPSGTAVVMVPSMVHKDPRYW 410

 

Query:   418 DDPEIFNPERFITGELKHPYAYIPFSAGSRNCI 450

             DDPEIFNPERFITGELKHPYAYIPFSAGSRNCI

Sbjct:   411 DDPEIFNPERFITGELKHPYAYIPFSAGSRNCI 443

 

Y80D3A.5; CE23108.

MGIITASLIVLTITWIIHFAFRKAKFIYNKLTVFQGPAALPLIGNFHQFH  50

FSPEEFFEQSQGIAYMMRKGDERITRVWLGGLPFVLLYGAHEVEAILGSP 100

KMLNKPFLYGFLSAWIGDGLLISKPDKWRPRRKLLTPTFHYDILKDFVEV 150

YNRHGRTLLSKFEAQAGTGEYSDVFHTITLCTLDVICEAALGTSINAQKD 200

PHSPYLDAVFKMKDIVFQRLLRPHYFSDTIFNLIGPGKEHDECVK  ILHEF 250

TSKAIYARKAKVDAAGGVEQLLAQETAEGRRRMAFLDLMLDMNSKGELPM 300

EGICEEVDTFTFEGHDTTSAAMNWFLHLMGANPEIQSKVQKEIDEVLGEA 350

DRPVSYEDLGKLKYLEACFKETLRLYPSVPLIARQCVEDIQVRGHTLPSG 400

TAVVMVPSMVHKDPRYWDDPEIFNPERFITGELKHPYAYIPFSAGSRNCI 450

ANIDNMGVKLTEIT end wrong

 

>gi|25815101|emb|AL132853.2|CEY80D3A  Download subject sequence spanning the HSP Caenorhabditis elegans YAC Y80D3A, complete sequence

          Length = 102495

 

The genomic seq supports the yellow regions for Y80D3A.5 so my seq is wrong there

 

 Score =  102 bits (253), Expect = 7e-23

 Identities = 45/47 (95%), Positives = 47/47 (100%)

 Frame = +1

 

Query: 1     PHSPYLDAVFKMKDIVFQRLLRPHYFSDTIFNLIGPGKEHDECVKIL 47

             PHSPYLDAVFKMKDIVFQRLLRPHYFSDTIFNLIGPGKEHDECVK++

Sbjct: 58066 PHSPYLDAVFKMKDIVFQRLLRPHYFSDTIFNLIGPGKEHDECVKVI 58206

 

 

 Score =  109 bits (272), Expect = 5e-25

 Identities = 55/56 (98%), Positives = 56/56 (100%)

 Frame = +3

 

Query: 45    KILHEFTSKAIYARKAKVDAAGGVEQLLAQETAEGRRRMAFLDLMLDMNSKGELPM 100

             +ILHEFTSKAIYARKAKVDAAGGVEQLLAQETAEGRRRMAFLDLMLDMNSKGELPM

Sbjct: 58620 QILHEFTSKAIYARKAKVDAAGGVEQLLAQETAEGRRRMAFLDLMLDMNSKGELPM 58787

 

 

 

>gi|275296|gb|M89401.1|M89401  UniGene info CEL08B12 Chris Martin sorted cDNA library Caenorhabditis elegans

           cDNA clone cm08b12 5' similar to cytochrome P450

           homologous peptide.

          Length = 528

 

 

 Score = 26.6 bits (57), Expect(2) = 2e-11

 Identities = 12/12 (100%), Positives = 12/12 (100%)

 Frame = +2

 

Query: 60 AKVDAAGGVEQL 71

          AKVDAAGGVEQL

Sbjct: 2  AKVDAAGGVEQL 37

 

 

 Score = 56.2 bits (134), Expect(2) = 2e-11

 Identities = 27/28 (96%), Positives = 27/28 (96%)

 Frame = +3

 

Query: 73  AQETAEGRRRMAFLDLMLDMNSKGELPM 100

           AQETA GRRRMAFLDLMLDMNSKGELPM

Sbjct: 42  AQETAXGRRRMAFLDLMLDMNSKGELPM 125

 

 

 

 

>gi|25815101|emb|AL132853.2|CEY80D3A  Download subject sequence spanning the HSP Caenorhabditis elegans YAC Y80D3A, complete sequence

          Length = 102495

 

 Score =  226 bits (577), Expect = 5e-60

 Identities = 107/107 (100%), Positives = 107/107 (100%)

 Frame = +3

 

Query: 1     VLGEADRPVSYEDLGKLKYLEACFKETLRLYPSVPLIARQCVEDIQVRGHTLPSGTAVVM 60

             VLGEADRPVSYEDLGKLKYLEACFKETLRLYPSVPLIARQCVEDIQVRGHTLPSGTAVVM

Sbjct: 60489 VLGEADRPVSYEDLGKLKYLEACFKETLRLYPSVPLIARQCVEDIQVRGHTLPSGTAVVM 60668

 

Query: 61    VPSMVHKDPRYWDDPEIFNPERFITGELKHPYAYIPFSAGSRNCIGE 107

             VPSMVHKDPRYWDDPEIFNPERFITGELKHPYAYIPFSAGSRNCIGE

Sbjct: 60669 VPSMVHKDPRYWDDPEIFNPERFITGELKHPYAYIPFSAGSRNCIGE 60809

 

 

 

 

 Score =  119 bits (299), Expect = 9e-28

 Identities = 60/61 (98%), Positives = 60/61 (98%)

 Frame = +3

 

Query: 106   GERFAMMEEKCILAIILKNLKVKAKLRTDEMRVAAELIIRPLYGNELKFEKREFGDYTSI 165

             G RFAMMEEKCILAIILKNLKVKAKLRTDEMRVAAELIIRPLYGNELKFEKREFGDYTSI

Sbjct: 61968 GMRFAMMEEKCILAIILKNLKVKAKLRTDEMRVAAELIIRPLYGNELKFEKREFGDYTSI 62147

 

Query: 166   Y 166

             Y

Sbjct: 62148 Y 62150

 

>gi|30733169|gb|CB391459.1|CB391459  UniGene info OSTF152C3_1 AD-wrmcDNA Caenorhabditis elegans cDNA.

          Length = 153

 

 Score = 99.4 bits (246), Expect = 4e-22

 Identities = 50/50 (100%), Positives = 50/50 (100%)

 Frame = +3

 

cDNAs for the end of the protein

 

Query: 117 ILAIILKNLKVKAKLRTDEMRVAAELIIRPLYGNELKFEKREFGDYTSIY 166

           ILAIILKNLKVKAKLRTDEMRVAAELIIRPLYGNELKFEKREFGDYTSIY

Sbjct: 3   ILAIILKNLKVKAKLRTDEMRVAAELIIRPLYGNELKFEKREFGDYTSIY 152

 

>gi|30733181|gb|CB391471.1|CB391471  UniGene info OSTF152C3_2 AD-wrmcDNA Caenorhabditis elegans cDNA.

          Length = 142

 

 Score = 89.7 bits (221), Expect = 3e-19

 Identities = 45/46 (97%), Positives = 46/46 (100%)

 Frame = +3

 

Query: 108 RFAMMEEKCILAIILKNLKVKAKLRTDEMRVAAELIIRPLYGNELK 153

           RFAMM+EKCILAIILKNLKVKAKLRTDEMRVAAELIIRPLYGNELK

Sbjct: 3   RFAMMDEKCILAIILKNLKVKAKLRTDEMRVAAELIIRPLYGNELK 140

 

>CYP42A1 CM08B12  M89401, AL020988 Y80D3 contig 01341 and 00427 504 aa

MGIITASLIVLTITWIIHFAFRKAKFIYNKLTVFQGPAALPLIGNFHQFHFSPEEFFEQ

SQGIAYMMRKGDERITRVWLGGLPFVLLYGAHEVEAILGSPKMLNKPFLYGFLSA

WIGDGLLISRKPDKWRPRRKLLTPTFHYDILKDFVEVYNRHGRTLLSKFEAQAGT

GEYSDVFHTITLCTLDVICEAALGTSINAQKDPHSPYLDAVFKMKDIVFQRLLRPHY

FSDTIFNLIGPGKEHDECVKVIILHEFTSKAIYARKQLLAQETAEGRRRMAFLDLML

DMNSKGELPMEGICEEVDTFTFEGHDTTSAAMNWFLHLMGANPEIQSKVQKEIDE

VLGEADRPVSYEDLGKLKYLEACFKETLRLYPSVPLIARQCVEDIQVRGHTLPSGTA

VVMVPSMVHKDPRYWDDPEIFNPERFITGELKHPYAYIPFSAGSRNCIG (1)

RFAMMEEKCILAIILKNLKVKAKLRTDEMRVAAELIIRPLYGNELKFEKREFGDYTSIY*

 

>gi|22417578|emb|CAAC01000109.1|  Download subject sequence spanning the HSP Caenorhabditis briggsae contig cb25.fpc4175 from assembly cb25.agp8,

             whole genome shotgun sequence

          Length = 116228

 

 Score =  223 bits (567), Expect = 5e-57

 Identities = 103/107 (96%), Positives = 107/107 (100%)

 Frame = +2

 

Query: 1     VLGEADRPVSYEDLGKLKYLEACFKETLRLYPSVPLIARQCVEDIQVRGHTLPSGTAVVM 60

             VLGEADRP+SYEDLGKLKYLEACFKETLRLYPSVPLIARQCVEDIQ+RGHTLPSGTAVVM

Sbjct: 66992 VLGEADRPISYEDLGKLKYLEACFKETLRLYPSVPLIARQCVEDIQIRGHTLPSGTAVVM 67171

 

Query: 61    VPSMVHKDPRYWDDPEIFNPERFITGELKHPYAYIPFSAGSRNCIGE 107

             VPSMVHKDPRYW+DPEIFNPERFI+GELKHPYAYIPFSAGSRNCIGE

Sbjct: 67172 VPSMVHKDPRYWEDPEIFNPERFISGELKHPYAYIPFSAGSRNCIGE 67312

 

 

 

 

 Score =  110 bits (275), Expect = 4e-23

 Identities = 55/66 (83%), Positives = 60/66 (90%)

 Frame = +2

 

Query: 97    FSAGSRNCIGERFAMMEEKCILAIILKNLKVKAKLRTDEMRVAAELIIRPLYGNELKFEK 156

             FS  + N  G RFAMMEEKCILAI+LKNLKVKAKLRTD+MRVAAELIIRPL+GNELKFE+

Sbjct: 68207 FSPKTSNFPGMRFAMMEEKCILAILLKNLKVKAKLRTDQMRVAAELIIRPLFGNELKFER 68386

 

Query: 157   REFGDY 162

             REFGDY

Sbjct: 68387 REFGDY 68404

 

CYP25A4

 

C36A4.6; CE03074. use this seq

MAILIISTIFFTIITFISYSIWRRHAIFKLRSSIGIPGPPVHWLWGNLNI  50

IKDRVSRLGYNDTTQWHPTLHTKYGPIFGLYCGTQLHITVSEEEDIKEIF 100

IQNFSNFSDRMTPDIFGMNQLNQSLLQNTYATGWKHTRSAIAPIFSTGKM 150

KAMHETLVSKIDIFLEVLKEKSSSGQKWDIFENFQSLSLDIIGKCAFAID 200

SNCQRDRTDLFYVQARKFVGAVDLKKSWILPVSLILPELSWLWRFLYKFS 250

DLSAAELPLVKGLVDLYDRRRAGEGGNDSTDLLNLLIRRETIGKMTQREV 300

IENCFAFLIAGYETTSTAMMFSAYLLAEYPIVQQKLYEEIKKTKENAGLN 350

YDSIHNMKYLDCVYKESLRFYPPTTHFTNRVCLNDMTIRGQIYPEDSTLK 400

VQPYTIHRNPANWESPDEFQPERFENWEEKSSSLKWIPFGVGPRYCVGMR 450

FAEMEFKTTIAKLLDTFELSLVPGDPPMIPETNGVIFRPRSPVRLNLKLR 500

I

 

>CYP25A4 C36A4.6    Z66495 495 aa

         Length = 496

 

 Score = 2587 (910.7 bits), Expect = 1.9e-272, P = 1.9e-272

 Identities = 493/501 (98%), Positives = 493/501 (98%)

 

Query:     1 MAILIISTIFFTIITFISYSIWRRHAIFKLRSSIGIPGPPVHWLWGNLNIIKDRVSRLGY 60

             MAILIISTIFFTIITFISYSIWRRHAIFKLRSSIGIPGPPVHWLWGNLNIIKDRV    

Sbjct:     1 MAILIISTIFFTIITFISYSIWRRHAIFKLRSSIGIPGPPVHWLWGNLNIIKDRV----- 55

 

Query:    61 NDTTQWHPTLHTKYGPIFGLYCGTQLHITVSEEEDIKEIFIQNFSNFSDRMTPDIFGMNQ 120

              DTTQWHPTLHTKYGPIFGLYCGTQLHITVSEEEDIKEIFIQNFSNFSDRMTPDIFGMNQ

Sbjct:    56 -DTTQWHPTLHTKYGPIFGLYCGTQLHITVSEEEDIKEIFIQNFSNFSDRMTPDIFGMNQ 114

 

Query:   121 LNQSLLQNTYATGWKHTRSAIAPIFSTGKMKAMHETLVSKIDIFLEVLKEKSSSGQKWDI 180

             LNQSLLQNTYATGWKHTRSAIAPIFSTGKMKAMHETLVSKIDIFLEVLKEKSSSGQKWDI

Sbjct:   115 LNQSLLQNTYATGWKHTRSAIAPIFSTGKMKAMHETLVSKIDIFLEVLKEKSSSGQKWDI 174

 

Query:   181 FENFQSLSLDIIGKCAFAIDSNCQRDRTDLFYVQARKFVGAVDLKKSWILPVSLILPELS 240

             FENFQSLSLDIIGKCAFAIDSNCQRDRTDLFYVQARKFVGAVDLKKSWILPVSLILPELS

Sbjct:   175 FENFQSLSLDIIGKCAFAIDSNCQRDRTDLFYVQARKFVGAVDLKKSWILPVSLILPELS 234

 

Query:   241 WLWRFLYKFSDLSAAELPLVKGLVDLYDRRRAGEGGNDSTDLLNLLIRRETIGKMTQREV 300

             WLWRFLYKFSDLSAAELPLVKGLVDLYDRRRAGEGGNDSTDLLNLLIRRETIGKMTQREV

Sbjct:   235 WLWRFLYKFSDLSAAELPLVKGLVDLYDRRRAGEGGNDSTDLLNLLIRRETIGKMTQREV 294

 

Query:   301 IENCFAFLIAGYETTSTAMMFSAYLLAEYPIVQQKLYEEIKKTKENAGLNYDSIHNMKYL 360

             IENCFAFLIAGYETTSTAMMFSAYLLAEYPIVQQKLYEEIKKTKENAGL  DSIHNMKYL

Sbjct:   295 IENCFAFLIAGYETTSTAMMFSAYLLAEYPIVQQKLYEEIKKTKENAGLYNDSIHNMKYL 354

 

Query:   361 DCVYKESLRFYPPTTHFTNRVCLNDMTIRGQIYPEDSTLKVQPYTIHRNPANWESPDEFQ 420

             DCVYKESLRFYPPTTHFTNRVCLNDMTIRGQIYPEDSTLKVQPYTIHRNPANWESPDEFQ

Sbjct:   355 DCVYKESLRFYPPTTHFTNRVCLNDMTIRGQIYPEDSTLKVQPYTIHRNPANWESPDEFQ 414

 

Query:   421 PERFENWEEKSSSLKWIPFGVGPRYCVGMRFAEMEFKTTIAKLLDTFELSLVPGDPPMIP 480

             PERFENWEEKSSSLKWIPFGVGPRYCVGMRFAEMEFKTTIAKLLDTFELSLVPGDPPMIP

Sbjct:   415 PERFENWEEKSSSLKWIPFGVGPRYCVGMRFAEMEFKTTIAKLLDTFELSLVPGDPPMIP 474

 

Query:   481 ETNGVIFRPRSPVRLNLKLRI 501

             ETNGVIFRPRSPVRLNLKLRI

Sbjct:   475 ETNGVIFRPRSPVRLNLKLRI 495

 

CYP25A3

 

added 7 aa to the seq

 

>CYP25A3 C36A4.3    Z66495 496 aa

MAFLILTSILVSLVSFIIYVILARKERFRLRGKIGLSGPEPHWLMGNLKQIIER

KAKLGYDDSYDWYNKLHKQFGETFGIYFGTQLNINITNEEDIKEVFIKNFSNFSDRTPPPI

IEDNKLKESLLQNTYESGWKHTRSAIAPIFSTGKMKAMHETIHSKVDLFLEIL

KEKASSGQKWDIYDDFQGLTLDVIGKCAFAIDSNCQRDRNDVFYVNARKFITN

IDIRHSKIISTSFLFPELSKLWKVLYRFTDLAKAEIPLVEGLADVYERRRGGEG

SDSVDLLKLLLNREDDKSKPMTKQEVIENCFAFLLAGYETTSTAMTYCSYLLS

KYPNVQQKLYEEIMEAKENGGLTYDSIHNMKYLDYVYKETLRCYPPVIHFSNR

RCLKDITIRGQFYPKGAIVVCLPHTVHRNPENWDSPEEFHPERFENWEEKSSSLK

WIPFGVGPRYCVGMRFAEMEFKTTIAKLLDTFELKQFEGEADLIPDCNGVIMRPKD

PVRLHLKPRN*

 

CYP34A3

 

C41G6.1; CE15699.

MIFVTLVTTILIYFSCKLWRGRKKNPNGPLPLPLIGNLHQLVYNSWKTGG  50

IVAGFQVFKKQYGKFFTLWFGPIPIVFIADYDIAYETHVKKANIFGHRFT 100

TGVMNYIREGRGIIGSNGAFWQEHRRFALTTLRNFGLGKNIMEDRIMDEY 150

RYR LLKNSR incorrect seq here

KNGVIEVNAATMFDLLVGSIINRMLVSKRFEQGNLEFETMK 200

VYLTKALEEVSIFEAFLPVWLLKSNLLRWRTKYTLAPVEYIYSLVQKEIQ 250

ERTTSIENGSHVPSEDGDDFVDAFLIKIEKDKKEGIDSTFTLESLAIDLF 300

DLWLAGQDTSSTTLLWAGICLLNHPEVVEKLRSELLEVTGGIRRLSLTDR 350

ARTPYLMAVLNEIQRIASILNINIFRELREDTEIDGQPIAAGTVVANQLS 400

MIHTDEELFEDHTRFDPERFIENSTLEKKLIPFNLGKRSCPGESLARAEL 450

YLIIGNLVLDFDFEAVGIKPEIKTSTPFGIMKRPPNYISD           490

 

>CYP34A3 C41G6.1     Z81047 492 aa also T13C10 Z81591

         Length = 493

 

 Score = 2526 (889.2 bits), Expect = 5.7e-266, P = 5.7e-266

 Identities = 490/492 (99%), Positives = 490/492 (99%)

 

Query:     1 MIFVTLVTTILIYFSCKLWRGRKKNPNGPLPLPLIGNLHQLVYNSWKTGGIVAGFQVFKK 60

             MIFVTLVTTILIYFSCKLWRGRKKNPNGPLPLPLIGNLHQLVYNSWKTGGIVAGFQVFKK

Sbjct:     1 MIFVTLVTTILIYFSCKLWRGRKKNPNGPLPLPLIGNLHQLVYNSWKTGGIVAGFQVFKK 60

 

Query:    61 QYGKFFTLWFGPIPIVFIADYDIAYETHVKKANIFGHRFTTGVMNYIREGRGIIGSNGAF 120

             QYGKFFTLWFGPIPIVFIADYDIAYETHVKKANIFGHRFTTGVMNYIREGRGIIGSNGAF

Sbjct:    61 QYGKFFTLWFGPIPIVFIADYDIAYETHVKKANIFGHRFTTGVMNYIREGRGIIGSNGAF 120

 

Query:   121 WQEHRRFALTTLRNFGLGKNIMEDRIMDEYRY--RLLKNSRKNGVIEVNAATMFDLLVGS 178

             WQEHRRFALTTLRNFGLGKNIMEDRIMDEYRY  RLLKNSRKNGVIEVNAATMFDLLVGS

Sbjct:   121 WQEHRRFALTTLRNFGLGKNIMEDRIMDEYRYNPRLLKNSRKNGVIEVNAATMFDLLVGS 180

 

Query:   179 IINRMLVSKRFEQGNLEFETMKVYLTKALEEVSIFEAFLPVWLLKSNLLRWRTKYTLAPV 238

             IINRMLVSKRFEQGNLEFETMKVYLTKALEEVSIFEAFLPVWLLKSNLLRWRTKYTLAPV

Sbjct:   181 IINRMLVSKRFEQGNLEFETMKVYLTKALEEVSIFEAFLPVWLLKSNLLRWRTKYTLAPV 240

 

Query:   239 EYIYSLVQKEIQERTTSIENGSHVPSEDGDDFVDAFLIKIEKDKKEGIDSTFTLESLAID 298

             EYIYSLVQKEIQERTTSIENGSHVPSEDGDDFVDAFLIKIEKDKKEGIDSTFTLESLAID

Sbjct:   241 EYIYSLVQKEIQERTTSIENGSHVPSEDGDDFVDAFLIKIEKDKKEGIDSTFTLESLAID 300

 

Query:   299 LFDLWLAGQDTSSTTLLWAGICLLNHPEVVEKLRSELLEVTGGIRRLSLTDRARTPYLMA 358

             LFDLWLAGQDTSSTTLLWAGICLLNHPEVVEKLRSELLEVTGGIRRLSLTDRARTPYLMA

Sbjct:   301 LFDLWLAGQDTSSTTLLWAGICLLNHPEVVEKLRSELLEVTGGIRRLSLTDRARTPYLMA 360

 

Query:   359 VLNEIQRIASILNINIFRELREDTEIDGQPIAAGTVVANQLSMIHTDEELFEDHTRFDPE 418

             VLNEIQRIASILNINIFRELREDTEIDGQPIAAGTVVANQLSMIHTDEELFEDHTRFDPE

Sbjct:   361 VLNEIQRIASILNINIFRELREDTEIDGQPIAAGTVVANQLSMIHTDEELFEDHTRFDPE 420

 

Query:   419 RFIENSTLEKKLIPFNLGKRSCPGESLARAELYLIIGNLVLDFDFEAVGIKPEIKTSTPF 478

             RFIENSTLEKKLIPFNLGKRSCPGESLARAELYLIIGNLVLDFDFEAVGIKPEIKTSTPF

Sbjct:   421 RFIENSTLEKKLIPFNLGKRSCPGESLARAELYLIIGNLVLDFDFEAVGIKPEIKTSTPF 480

 

Query:   479 GIMKRPPNYISD 490

             GIMKRPPNYISD

Sbjct:   481 GIMKRPPNYISD 492

 

>CYP34A3 C41G6.1     Z81047 494 aa also T13C10 Z81591

This gene has a frameshift IQDFSKTHG revised 3/20/04

MIFVTLVTTILIYFSCKLWRGRKKNPNGPLPLPLIGNLHQLVYNSWKTGGIV

AGFQVFKKQYGKFFTLWFGPIPIVFIADYDIAYETHVKKANIFGHRFTTGVM

NYIREGRGIIGSNGAFWQEHRRFALTTLRNFGLGKNIMEDRIMDEYRYR (2)

IQDFSKTHG (frameshift)

KNGVIEVNAATMFDLLVGSIINRMLVSKRFEQGNLEFETMKVYLTKA

LEEVSIFEAFLPVWLLKSNLLRWRTKYTLAPVEYIYSLVQKEIQERTTSIENG

SHVPSEDGDDFVDAFLIKIEKDKKEGIDSTFTLESLAIDLFDLWLAGQDTSSTT

LLWAGICLLNHPEVVEKLRSELLEVTGGIRRLSLTDRARTPYLMAVLNEIQR

IASILNINIFRELREDTEIDGQPIAAGTVVANQLSMIHTDEELFEDHTRFDPER

FIENSTLEKKLIPFNLGKRSCPGESLARAELYLIIGNLVLDFDFEAVGIKPEIKT

STPFGIMKRPPNYISD*

 

compare to CYP34A5 B0213.10

>gi|17557253|ref|NM_071698.1|  LocusLink infoUniGene info Caenorhabditis elegans cytochrome p450 2c2 family member (5E430),

           mRNA

          Length = 1500

 

 Score =  192 bits (488), Expect = 4e-50

 Identities = 91/105 (86%), Positives = 101/105 (96%)

 Frame = +1

 

Query: 1   NYIREGRGIIGSNGAFWQEHRRFALTTLRNFGLGKNIMEDRIMDEYRYRIQDFSKTHGKN 60

           +YIREGRGI+GSNG FWQEHRRFALTTLRNFG+G+NIMED+IMDEYRYRIQDFSKTHGKN

Sbjct: 313 DYIREGRGIVGSNGDFWQEHRRFALTTLRNFGVGRNIMEDKIMDEYRYRIQDFSKTHGKN 492

 

Query: 61  GVIEVNAATMFDLLVGSIINRMLVSKRFEQGNLEFETMKVYLTKA 105

           GVIEVNA TMFDLLVGSIINRMLVS+RFEQG+ +FE +K+YLTKA

Sbjct: 493 GVIEVNATTMFDLLVGSIINRMLVSERFEQGDQDFEKLKMYLTKA 627

 

 

>gi|6435485|emb|Z81047.2|CEC41G6  Download subject sequence spanning the HSP Caenorhabditis elegans cosmid C41G6, complete sequence

          Length = 31437

 

 Score = 74.7 bits (182), Expect = 1e-14

 Identities = 36/41 (87%), Positives = 37/41 (90%), Gaps = 1/41 (2%)

 Frame = +3

 

Query: 1     WQEHRRFALTTLRNFGLGKNIMEDRIMDEYRYR-IQDFSKT 40

             WQEHRRFALTTLRNFGLGKNIMEDRIMDEYRYR  + F KT

Sbjct: 21507 WQEHRRFALTTLRNFGLGKNIMEDRIMDEYRYRCFKIFQKT 21629

 

Score = 28.5 bits (62), Expect(2) = 4e-07

 Identities = 11/12 (91%), Positives = 12/12 (100%)

 Frame = +1

 

Query: 32    YRIQDFSKTHGK 43

             +RIQDFSKTHGK

Sbjct: 21844 FRIQDFSKTHGK 21879

 

 Score = 40.8 bits (94), Expect(2) = 4e-07

 Identities = 20/23 (86%), Positives = 20/23 (86%)

 Frame = +3

 

Query: 39    KTHGKNGVIEVNAATMFDLLVGS 61

             K   KNGVIEVNAATMFDLLVGS

Sbjct: 21864 KNSRKNGVIEVNAATMFDLLVGS 21932

 

CYP33C1

 

>CYP33C1 C45H4.2 502 aa

         Length = 503

 

 Score = 2603 (916.3 bits), Expect = 3.9e-274, P = 3.9e-274

 Identities = 496/502 (98%), Positives = 497/502 (99%)

 

Query:     1 MIIILLLTFLTIYFVYELYWKRRNFPPGPCPLPVFGNLLSIANPPPGYKAFERWTKKYGD 60

             MIIILLLTFLTIYFVYELYWKRRNFPPGPCPLPVFGNLLSIANPPPGYKAFERWTKKYGD

Sbjct:     1 MIIILLLTFLTIYFVYELYWKRRNFPPGPCPLPVFGNLLSIANPPPGYKAFERWTKKYGD 60

 

Query:    61 VYTFWIGNTPHIMINTWDKIKETFIRDADTYTNKVVLPMVTLSRGGEYGIIDSNGAMWRE 120

             VYTFWIGNTPHIMINTWDKIKETFIRDADTYTNKVVLPMVTLSRGGEYGIIDSNGAMWRE

Sbjct:    61 VYTFWIGNTPHIMINTWDKIKETFIRDADTYTNKVVLPMVTLSRGGEYGIIDSNGAMWRE 120

 

Query:   121 HRRFALSTMRDFGLGKNLMQENILMEVQDVFARLDAKLGSETDVPEVFDHAVANVVNQLL 180

             HRRFALSTMRDFGLGKNLMQENILMEVQDVFARLDAKLGSETDVPEVFDHAVANVVNQLL

Sbjct:   121 HRRFALSTMRDFGLGKNLMQENILMEVQDVFARLDAKLGSETDVPEVFDHAVANVVNQLL 180

 

Query:   181 FGYRFMGPKENEYQELKHIIDSPAEIFGKLHIFLAMNIPIFAKLLPESLYEGPIKTFRDT 240

             FGYRFMGPKENEYQELKHIIDSPAEIFGKLHIFLAMNIPIFAKLLPESLYEGPIKTFRDT

Sbjct:   181 FGYRFMGPKENEYQELKHIIDSPAEIFGKLHIFLAMNIPIFAKLLPESLYEGPIKTFRDT 240

 

Query:   241 TLAFFNKQIEAHRHRIDFEDLNSESTDFVETFLKEQKRRESE-----ANSKNLNFSNIQL 295

             TLAFFNKQIEAHRHRIDFEDLNSESTDFVETFLKEQKRRESE     ANSKNLNFSNIQL

Sbjct:   241 TLAFFNKQIEAHRHRIDFEDLNSESTDFVETFLKEQKRRESEGDSETANSKNLNFSNIQL 300

 

Query:   296 LNVCIDLWFAGLNTTTNTITWAISYVLHHPEVQDKIHEELDKVIGSDRLITTADKNDLPY 355

             LNVCIDLWFAGLNTTTNTITWAISYVLHHPEVQDKIHEELDKVIGSDRLITTADKNDLPY

Sbjct:   301 LNVCIDLWFAGLNTTTNTITWAISYVLHHPEVQDKIHEELDKVIGSDRLITTADKNDLPY 360

 

Query:   356 FNASINESQRGINILPLNLQHATTRDTVIDGFKIPKGTGVVAQISTVMNNEEVFPDPYTF 415

             FNASINESQRGINILPLNLQHATTRDTVIDGFKIPKGTGVVAQISTVMNNEEVFPDPYTF

Sbjct:   361 FNASINESQRGINILPLNLQHATTRDTVIDGFKIPKGTGVVAQISTVMNNEEVFPDPYTF 420

 

Query:   416 NPDRFIDENGKLKKVDELAPFSVGKRSCPGEGLARMELFLFIANFLNRYKIHPSKEGLPS 475

             NPDRFIDENGKLKKVDELAPFSVGKRSCPGEGLARMELFLFIANFLNRY+IHPSKEGLPS

Sbjct:   421 NPDRFIDENGKLKKVDELAPFSVGKRSCPGEGLARMELFLFIANFLNRYQIHPSKEGLPS 480

 

Query:   476 MAKGSGPVVAPRLFTAILTRRF 497

             MAKGSGPVVAPRLFTAILTRRF

Sbjct:   481 MAKGSGPVVAPRLFTAILTRRF 502

 

compare to cDNA

 

>gi|30738516|gb|CB396805.1|CB396805  UniGene info OSTR179D10_1 AD-wrmcDNA Caenorhabditis elegans cDNA.

          Length = 592

 

 Score = 85.5 bits (210), Expect = 4e-18

 Identities = 43/53 (81%), Positives = 45/53 (84%)

 Frame = -3

 

remove the ANSKNLN fragment Keep GDSET

 

Query: 1   NSESTDFVETFLKEQKRRESEGDSETANSKNLNFSNIQLLNVCIDLWFAGLNT 53

           NSESTDFVETFLKEQKRRESEGDSET       FSN+QL NVC+DLWFAGLNT

Sbjct: 320 NSESTDFVETFLKEQKRRESEGDSET-------FSNVQLSNVCMDLWFAGLNT 183

 

>CYP33C1 C45H4.2 502 aa revised 3/20/04 small deletion

MIIILLLTFLTIYFVYELYWKRRNFPPGPCPLPVFGNLLSIANPPPGYKAFER

WTKKYGDVYTFWIGNTPHIMINTWDKIKETFIRDADTYTNKVVLPMVTLSR

GGEYGIIDSNGAMWREHRRFALSTMRDFGLGKNLMQENILMEVQDVFARLD

AKLGSETDVPEVFDHAVANVVNQLLFGYRFMGPKENEYQELKHIIDSPAEIF

GKLHIFLAMNIPIFAKLLPESLYEGPIKTFRDTTLAFFNKQIEAHRHRIDFEDL

NSESTDFVETFLKEQKRRESEGDSETFSNIQLLNVCIDLWFAGLNT

TTNTITWAISYVLHHPEVQDKIHEELDKVIGSDRLITTADKNDLPYFNASINE

SQRGINILPLNLQHATTRDTVIDGFKIPKGTGVVAQISTVMNNEEVFPDPYTF

NPDRFIDENGKLKKVDELAPFSVGKRSCPGEGLARMELFLFIANFLNRYQ IHPS

KEGLPSMAKGSGPVVAPRLFTAILTRRF*

 

CYP33E1

 

>CYP33E1 C49C8.4    U61945 494 aa

         Length = 495

 

 Score = 2605 (917.0 bits), Expect = 2.4e-274, P = 2.4e-274

 Identities = 493/494 (99%), Positives = 493/494 (99%)

 

Query:     1 MILLILTSILIIYLFNHFYWKRRKLPPGPIPLPIIGNLYLMTEDVKPGYKMYEKLKDKYG 60

             MILLILTSILIIYLFNHFYWKRRKLPPGPIPLPIIGNLYLMTEDVKPGYKMYEKLKDKYG

Sbjct:     1 MILLILTSILIIYLFNHFYWKRRKLPPGPIPLPIIGNLYLMTEDVKPGYKMYEKLKDKYG 60

 

Query:    61 PVFTFWLANLPMVTVTDWKLIKQHFIKDGANFVGRPEFPISMEMRQGPYGIIESHGDRWI 120

             PVFTFWLANLPMVTVTDWKLIKQHFIKDGANFVGRPEFPISMEMRQGPYGIIESHGDRWI

Sbjct:    61 PVFTFWLANLPMVTVTDWKLIKQHFIKDGANFVGRPEFPISMEMRQGPYGIIESHGDRWI 120

 

Query:   121 QQRRFALHILRDFGLGKNLMEEKVLGEVTAMIDSIRKSMEDVDMQNIFDASVGSVINNML 180

             QQRRFALHILRDFGLGKNLMEEKVLGEVTAMIDSIRKSMEDVDMQNIFDASVGSVINNML

Sbjct:   121 QQRRFALHILRDFGLGKNLMEEKVLGEVTAMIDSIRKSMEDVDMQNIFDASVGSVINNML 180

 

Query:   181 FGYRYDETNIEEFLELKNRMNKHFKLAAEPMGGLIGMNPWLGHLPFFKGYKNVIMHNWMG 240

             FGYRYDETNIEEFLELKNRMNKHFKLAAEPMGGLIGMNPWLGHLPFFKGYKNVIMHNWMG

Sbjct:   181 FGYRYDETNIEEFLELKNRMNKHFKLAAEPMGGLIGMNPWLGHLPFFKGYKNVIMHNWMG 240

 

Query:   241 LMEMFRKQATDRLASIDYDSDEYSDYVEAFLKERKKHENEQDFGGFEMEQLDSVCFDLWV 300

             LMEMFRKQATDRLASIDYDSDEYSDYVEAFLKERKKHENEQDFGGF MEQLDSVCFDLWV

Sbjct:   241 LMEMFRKQATDRLASIDYDSDEYSDYVEAFLKERKKHENEQDFGGFRMEQLDSVCFDLWV 300

 

Query:   301 AGMETTSNTLNWALLYVLLNPEVRQKVYEELEREIGSDRIITTTDRPKLNYINATVNESQ 360

             AGMETTSNTLNWALLYVLLNPEVRQKVYEELEREIGSDRIITTTDRPKLNYINATVNESQ

Sbjct:   301 AGMETTSNTLNWALLYVLLNPEVRQKVYEELEREIGSDRIITTTDRPKLNYINATVNESQ 360

 

Query:   361 RLANLLPMNLSRSTNADVEIAGYRIPKDTVITPQISSVMYDPEIFPEPYEFKPERFLESD 420

             RLANLLPMNLSRSTNADVEIAGYRIPKDTVITPQISSVMYDPEIFPEPYEFKPERFLESD

Sbjct:   361 RLANLLPMNLSRSTNADVEIAGYRIPKDTVITPQISSVMYDPEIFPEPYEFKPERFLESD 420

 

Query:   421 GSLKKVEELVPFSIGKRQCLGEGLAKMELFLYFANLFNKFDIKFHESNPNPSIKKEVGVT 480

             GSLKKVEELVPFSIGKRQCLGEGLAKMELFLYFANLFNKFDIKFHESNPNPSIKKEVGVT

Sbjct:   421 GSLKKVEELVPFSIGKRQCLGEGLAKMELFLYFANLFNKFDIKFHESNPNPSIKKEVGVT 480

 

Query:   481 MKAKNYRVSMKERY 494

             MKAKNYRVSMKERY

Sbjct:   481 MKAKNYRVSMKERY 494

 

E is created at intron joint

 

CYP35A4

 

>CYP35A4 C49G7 494 aa Revised 3/19/04

         Length = 494

 

 Score = 2535 (892.4 bits), Expect = 6.3e-267, P = 6.3e-267

 Identities = 492/494 (99%), Positives = 492/494 (99%)

 

Query:     1 MFVILFISAILSWLIVRQYQKVSRLPPGPVSLPLIGNLPQIFYYLWTTGSIVSTLDLFRK 60

             MFVILFISAILSWLIVRQYQKVSRLPPGPVSLPLIGNLPQIFYYLWTTGSIVSTLDLFRK

Sbjct:     1 MFVILFISAILSWLIVRQYQKVSRLPPGPVSLPLIGNLPQIFYYLWTTGSIVSTLDLFRK 60

 

Query:    61 RYGNIFTLWVGPLPHVSIADYEISHEVFVKNGSKYADKFHAPVMQDIRKDMGIMVTNGDH 120

             RYGNIFTLWVGPLPHVSIADYEISHEVFVKNGSKYADKFHAPVMQDIR  MGIMVTNGDH

Sbjct:    61 RYGNIFTLWVGPLPHVSIADYEISHEVFVKNGSKYADKFHAPVMQDIRSIMGIMVTNGDH 120

 

Query:   121 WQHMRRFSLQTFRNMGVGKDIMETKIMQELDARCAEIDTSAMNGVTVTQASEFFELTVGS 180

             WQHMRRFSLQTFRNMGVGKDIMETKIMQELDARCAEIDTSAMNGVTVTQASEFFELTVGS

Sbjct:   121 WQHMRRFSLQTFRNMGVGKDIMETKIMQELDARCAEIDTSAMNGVTVTQASEFFELTVGS 180

 

Query:   181 IINSILVGKRFDATTKHEFLKIIETMDASIETASFFDMMVPVWILKTFFKHRYDNLSDAF 240

             IINSILVGKRFDATTKHEFLKIIETMDASIETASFFDMMVPVWILKTFFKHRYDNLSDAF

Sbjct:   181 IINSILVGKRFDATTKHEFLKIIETMDASIETASFFDMMVPVWILKTFFKHRYDNLSDAF 240

 

Query:   241 EVSKAFSAAEAIKRVDQIKSGRYFIDENNLQDYTDAFLLKIEKEGQCQDFNMETLKTMIG 300

             EVSKAFSAAEAIKRVDQIKSGRYFIDENNLQDYTDAFLLKIEKEGQCQDFNMETLKTMIG

Sbjct:   241 EVSKAFSAAEAIKRVDQIKSGRYFIDENNLQDYTDAFLLKIEKEGQCQDFNMETLKTMIG 300

 

Query:   301 DLWITGQETTTTTLISGFNQLLLHPEVMVKAREELMKVTENGSRSLSLTDRASTPYLNAM 360

             DLWITGQETTTTTLISGFNQLLLHPEVMVKAREELMKVTENGSRSLSLTDRASTPYLNAM

Sbjct:   301 DLWITGQETTTTTLISGFNQLLLHPEVMVKAREELMKVTENGSRSLSLTDRASTPYLNAM 360

 

Query:   361 IGEIQRHASILNVNFWKINKEFTYMGGHPVDAGALVTAQLGTLHVNETVFENPLKFDPER 420

             IGEIQRHASILNVNFWKINKEFTYMGGHPVDAGALVTAQLGTLHVNETVFENPLKFDPER

Sbjct:   361 IGEIQRHASILNVNFWKINKEFTYMGGHPVDAGALVTAQLGTLHVNETVFENPLKFDPER 420

 

Query:   421 YIRDENLLQKVIPFGVGKRSCLGESLARAELYLIFGNLLLRYKFESSSKLSTTELAPYSI 480

             YIRDENLLQKVIPFGVGKRSCLGESLARAELYLIFGNLLLRYKFESSSKLSTTELAPYSI

Sbjct:   421 YIRDENLLQKVIPFGVGKRSCLGESLARAELYLIFGNLLLRYKFESSSKLSTTELAPYSI 480

 

Query:   481 GKRPFKLEMKFVKI 494

             GKRPFKLEMKFVKI

Sbjct:   481 GKRPFKLEMKFVKI 494

 

KD forms at intron joint

 

CYP33C9

 

>CYP33C9 C50H11 495 aa

         Length = 497

 

 Score = 2609 (918.4 bits), Expect = 9.1e-275, P = 9.1e-275

 Identities = 492/496 (99%), Positives = 493/496 (99%)

 

Query:     1 MVLNIILVVVVAFLFHHLYWKRRNWPAGPTPLPLIGNLLSLRNPAPGYKAFARWTAKYGD 60

             MVLNIILVVVVAFLFHHLYWKRRN+  GPTPLPLIGNLLSLRNPAPGYKAFARWTAKYGD

Sbjct:     1 MVLNIILVVVVAFLFHHLYWKRRNFFPGPTPLPLIGNLLSLRNPAPGYKAFARWTAKYGD 60

 

Query:    61 IYTFWLGTRPYILVSSYEALKETFIKDGETYADKKPMAFQESFRGGSYGVVETNGPFWRE 120

             IYTFWLGTRPYILVSSYEALKETFIKDGETYADKKPMAFQESFRGGSYGVVETNGPFWRE

Sbjct:    61 IYTFWLGTRPYILVSSYEALKETFIKDGETYADKKPMAFQESFRGGSYGVVETNGPFWRE 120

 

Query:   121 HRRFAIHQFRDFGLGKDRMEQRIMLEVEDIFNNCDKTIGEGVDLTDIFDRAVGNVINQML 180

             HRRFAIHQFRDFGLGKDRMEQRIMLEVEDIFNNCDKTIGEGVDLTDIFDRAVGNVINQML

Sbjct:   121 HRRFAIHQFRDFGLGKDRMEQRIMLEVEDIFNNCDKTIGEGVDLTDIFDRAVGNVINQML 180

 

Query:   181 FGYRFDETRADEFRTIRAFFNFNSGEFASFSMRVQFFLPWMGYIMPGPTILDRFKKYQKG 240

             FGYRFDETRADEFRTIRAFFNFNSGEFASFSMRVQFFLPWMGYIMPGPTILDRFKKYQKG

Sbjct:   181 FGYRFDETRADEFRTIRAFFNFNSGEFASFSMRVQFFLPWMGYIMPGPTILDRFKKYQKG 240

 

Query:   241 FTEFFGTQIENHKKEIDFELEENSDYVEAFLKEQRKREASGDFESFSTKQLSNMCLDLWF 300

             FTEFFGTQIENHKKEIDFELEENSDYVEAFLKEQRKREASGDFES STKQLSNMCLDLWF

Sbjct:   241 FTEFFGTQIENHKKEIDFELEENSDYVEAFLKEQRKREASGDFESSSTKQLSNMCLDLWF 300

 

Query:   301 AALMTTSNTMTWCFAYTLNYLDAQQKLHEELDRVIGSERHINTADKPNLPYTNAYINEIQ 360

             AALMTTSNTMTWCFAYTLNYLDAQQKLHEELDRVIGSERHINTADKPNLPYTNAYINEIQ

Sbjct:   301 AALMTTSNTMTWCFAYTLNYLDAQQKLHEELDRVIGSERHINTADKPNLPYTNAYINEIQ 360

 

Query:   361 RTANLVPLNLLHMTTRDTVLKGYNIPKGTGVVAQISTVMYDENVFPEPYIFKPERFLDDD 420

             RTANLVPLNLLHMTTRDTVLKGYNIPKGTGVVAQISTVMYDENVFPEPYIFKPERFLDDD

Sbjct:   361 RTANLVPLNLLHMTTRDTVLKGYNIPKGTGVVAQISTVMYDENVFPEPYIFKPERFLDDD 420

 

Query:   421 GKLKKVEQLVPFSVGKRQCLGEGLARMELFLFIANFFNRYRVVPDANGPPIIDKAVLGGM 480

             GKLKKVEQLVPFSVGKRQCLGEGLARMELFLFIANFFNRYRVVPDANGPPIIDKAVLGGM

Sbjct:   421 GKLKKVEQLVPFSVGKRQCLGEGLARMELFLFIANFFNRYRVVPDANGPPIIDKAVLGGM 480

 

Query:   481 HTKEFKAILQRRHVSE 496

             HTKEFKAILQRRHVSE

Sbjct:   481 HTKEFKAILQRRHVSE 496

 

correct seq at two exon boundaries

 

CYP43A1

 

>CYP43A1 E03E2.1 485 aa

         Length = 485

 

 Score = 2143 (754.4 bits), Expect = 9.1e-247, Sum P(2) = 9.1e-247

 Identities = 406/406 (100%), Positives = 406/406 (100%)

 

Query:   101 PEILSDDPITSDNIHMFAAKGERWKRLRTLTSYGLSTVKLKLLFPTMDTCVSEFMDHVNS 160

             PEILSDDPITSDNIHMFAAKGERWKRLRTLTSYGLSTVKLKLLFPTMDTCVSEFMDHVNS

Sbjct:    80 PEILSDDPITSDNIHMFAAKGERWKRLRTLTSYGLSTVKLKLLFPTMDTCVSEFMDHVNS 139

 

Query:   161 LSDGQSVVINHSHSLFQNHTSYVLARKAFGNFAELQKSTAEKIAYIFPETKLIFKNSFVG 220

             LSDGQSVVINHSHSLFQNHTSYVLARKAFGNFAELQKSTAEKIAYIFPETKLIFKNSFVG

Sbjct:   140 LSDGQSVVINHSHSLFQNHTSYVLARKAFGNFAELQKSTAEKIAYIFPETKLIFKNSFVG 199

 

Query:   221 HFLKSATQQKFLDYLLHLISNFQSRKNVDNNNGICCTENDHYSLLGFFFEHHNEKKLIEK 280

             HFLKSATQQKFLDYLLHLISNFQSRKNVDNNNGICCTENDHYSLLGFFFEHHNEKKLIEK

Sbjct:   200 HFLKSATQQKFLDYLLHLISNFQSRKNVDNNNGICCTENDHYSLLGFFFEHHNEKKLIEK 259

 

Query:   281 AEGQIDMKKVKVEKSISYEEITAQCKFISVAGFDTTSNTLTLLFNFLANNPDVQDKIYEA 340

             AEGQIDMKKVKVEKSISYEEITAQCKFISVAGFDTTSNTLTLLFNFLANNPDVQDKIYEA

Sbjct:   260 AEGQIDMKKVKVEKSISYEEITAQCKFISVAGFDTTSNTLTLLFNFLANNPDVQDKIYEA 319

 

Query:   341 EIKNQPEQISFETVSSLRLLQNCIFETLRLFPHASPLQTRICTEPFKIGKYQFLENVQIV 400

             EIKNQPEQISFETVSSLRLLQNCIFETLRLFPHASPLQTRICTEPFKIGKYQFLENVQIV

Sbjct:   320 EIKNQPEQISFETVSSLRLLQNCIFETLRLFPHASPLQTRICTEPFKIGKYQFLENVQIV 379

 

Query:   401 VNPWGPHHDREIWGNDVDCFRPSRFENLTEQQRKAFMPFGVGPRQCVGMRFALLEMKTTA 460

             VNPWGPHHDREIWGNDVDCFRPSRFENLTEQQRKAFMPFGVGPRQCVGMRFALLEMKTTA

Sbjct:   380 VNPWGPHHDREIWGNDVDCFRPSRFENLTEQQRKAFMPFGVGPRQCVGMRFALLEMKTTA 439

 

Query:   461 FRMLQKYSVFTNSPVHDRHGKTVRMTVRDTGTIWPTDKLGLVLKQR 506

             FRMLQKYSVFTNSPVHDRHGKTVRMTVRDTGTIWPTDKLGLVLKQR

Sbjct:   440 FRMLQKYSVFTNSPVHDRHGKTVRMTVRDTGTIWPTDKLGLVLKQR 485

 

 Score = 225 (79.2 bits), Expect = 9.1e-247, Sum P(2) = 9.1e-247

 Identities = 43/43 (100%), Positives = 43/43 (100%)

 

Query:    58 MCSMIGDQTFSVLRGATPVVITSNVDLIHAISTEHFDCFHSRI 100

             MCSMIGDQTFSVLRGATPVVITSNVDLIHAISTEHFDCFHSRI

Sbjct:     1 MCSMIGDQTFSVLRGATPVVITSNVDLIHAISTEHFDCFHSRI 43

 

 

>cb25.fpc3857 ACC=CAAC01000068 CYP43

            Length = 474

 

 Score = 1243 (437.6 bits), Expect = 3.8e-193, Sum P(2) = 3.8e-193

 Identities = 243/363 (66%), Positives = 290/363 (79%)

 

Query:   152 SEFMDHVNSLSDGQ-SVVIN---HSHSLFQNH--TSYV-LARKAFGNFAELQKSTAEKIA 204

             S  +DH +SL     S V+    + H   QNH   +++ +   AFG  ++ QKS  EKI

Sbjct:   109 SVVIDHSHSLFQNHTSYVLARCAYGHKE-QNHRVNNFLSVFSDAFGALSDFQKSMTEKIT 167

 

Query:   205 YIFPETKLIFKNSFVGHFLKSATQQKFLDYLLHLISNFQSRKNVDNNNGICCTEND--HY 262

             Y FPE K IFKN+    FL+S  QQKFLDYLL+LIS FQSR+ ++NNN    ++ +   Y

Sbjct:   168 YFFPEMKSIFKNNLFAQFLQSTNQQKFLDYLLNLISKFQSRRVIENNNNEESSDPEAGKY 227

 

Query:   263 SLLGFFFEHHNEKKLIEKAEGQIDMKKVKVEKSISYEEITAQCKFISVAGFDTTSNTLTL 322

             SLL FFFEHH+EKK++EKAEG+IDMKKVKVEKSISY+EITAQCKFISVAGFDTT+NTLTL

Sbjct:   228 SLLEFFFEHHHEKKIVEKAEGRIDMKKVKVEKSISYQEITAQCKFISVAGFDTTANTLTL 287

 

Query:   323 LFNFLANNPDVQDKIYEAEIKNQPEQISFETVSSLRLLQNCIFETLRLFPHASPLQTRIC 382

             LFNFLA+NP +Q++IY +EIK     ++FETV SL +LQNCIFETLRLFPHASPLQ RIC

Sbjct:   288 LFNFLAHNPAIQEQIYNSEIKGNKSSMNFETVCSLPILQNCIFETLRLFPHASPLQMRIC 347

 

Query:   383 TEPFKIGKYQFLENVQIVVNPWGPHHDREIWGNDVDCFRPSRFENLTEQQRKAFMPFGVG 442

             T P  IG+Y+F EN+Q+V+NPWGPH DR IWG+DV+CF+PSRFE+L+EQQRKAFMPFGVG

Sbjct:   348 TAPITIGQYKFDENMQVVINPWGPHRDRVIWGDDVNCFKPSRFESLSEQQRKAFMPFGVG 407

 

Query:   443 PRQCVGMRFALLEMKTTAFRMLQKYSVFTNSPVHDRHGKTVRMTVRDTGTIWPTDKLGLV 502

             PRQCVGMRFALLE+KTTAFRMLQKY V +  PV DRHGK V MTVRDTGT+WPTDKLGLV

Sbjct:   408 PRQCVGMRFALLELKTTAFRMLQKYVVRSTQPVIDRHGKLVNMTVRDTGTVWPTDKLGLV 467

 

Query:   503 LKQR 506

             L +R

Sbjct:   468 LTKR 471

 

 Score = 618 (217.5 bits), Expect = 3.8e-193, Sum P(2) = 3.8e-193

 Identities = 113/134 (84%), Positives = 131/134 (97%)

 

Query:    58 MCSMIGDQTFSVLRGATPVVITSNVDLIHAISTEHFDCFHSRIPEILSDDPITSDNIHMF 117

             MCS++GD+TFS+LRGATPV++T++V+LIHAISTE FDCFHSRIPE LSDDPIT++NIHMF

Sbjct:     1 MCSVVGDKTFSMLRGATPVIVTNDVNLIHAISTESFDCFHSRIPEALSDDPITAENIHMF 60

 

Query:   118 AAKGERWKRLRTLTSYGLSTVKLKLLFPTMDTCVSEFMDHVNSLSDGQSVVINHSHSLFQ 177

             AAKGERWKRLRT+TSYGLSTVKLKLLFPT++TCVSEF+DHVNSLSDGQSVVI+HSHSLFQ

Sbjct:    61 AAKGERWKRLRTITSYGLSTVKLKLLFPTIETCVSEFLDHVNSLSDGQSVVIDHSHSLFQ 120

 

Query:   178 NHTSYVLARKAFGN 191

             NHTSYVLAR A+G+

Sbjct:   121 NHTSYVLARCAYGH 134

 

>CYP43A1 E03E2.1 485 aa green missed in CE29747

MCSMIGDQTFSVLRGATPVVITSNVDLIHAISTEHFDCFHSRI CICCSILIVHQGSKM

KNYCNTAKILKLCIFYNNYLQ PEILSDDPITSDNIHMFAAKGERWKRLRTLTSYGL

STVKLKLLFPTMDTCVSEFMDHVNSLSDGQSVVINHSHSLFQNHTSYVLARKAFG

NFAELQKSTAEKIAYIFPETKLIFKNSFVGHFLKSATQQKFLDYLLHLISNFQSRKN

VDNNNGICCTENDHYSLLGFFFEHHNEKKLIEKAEGQIDMKKVKVEKSISYEEITA

QCKFISVAGFDTTSNTLTLLFNFLANNPDVQDKIYEAEIKNQPEQISFETVSSLRLLQ

NCIFETLRLFPHASPLQTRICTEPFKIGKYQFLENVQIVVNPWGPHHDREIWGNDV

DCFRPSRFENLTEQQRKAFMPFGVGPRQCVGMRFALLEMKTTAFRMLQKYSVF

TNSPVHDRHGKTVRMTVRDTGTIWPTDKLGLVLKQR

 

E03E2.1; CE29747. extra seq

MILLILLPVAIFTYVFLR ()

NFKLYYTLHKCGLNPRFDIFGIKGLLWLDSSA  50

AHENFTR MCSMIGDQTFSVLRGATPVVITSNVDLIHAISTEHFDCFHSRI 100

 

PEILSDDPITSDNIHMFAAKGERWKRLRTLTSYGLSTVKLKL

LFPTMDTCVSEFMDHVNSLSDGQSVVINHSHSLFQNHTSYVLARKAFGNFAELQKSTA

EKIAYIFPETKLIFKNSFVGHFLKSATQQKFLDYLLHLISNFQSRKNVDN 250

NNGICCTENDHYSLLGFFFEHHNEKKLIEKAEGQIDMKKVKVEKSISYEE 300

ITAQCKFISVAGFDTTSNTLTLLFNFLANNPDVQDKIYEAEIKNQPEQIS 350

FETVSSLRLLQNCIFETLRLFPHASPLQTRICTEPFKIGKYQFLENVQIV 400

VNPWGPHHDREIWGNDVDCFRPSRFENLTEQQRKAFMPFGVGPRQCVGMR 450

FALLEMKTTAFRMLQKYSVFTNSPVHDRHGKTVRMTVRDTGTIWPTDKLG 500

LVLKQR                                             506

 

 

TGAATCAACAACTGTACTTGTCTGCAATTTTTTCAGTTCAATGATTCTACTTTTTCTTCT

GCCGACTGCATTATTAATTTATGTTTTAATAAGGTAAAGCTAACTAAATGTCAAGACACG

GCTCTCAGAATAGATTTTCAGAAATCTCAAAATATATTTCACACTCTATAGAAATGGACT

GAACCCCCAATTTGACATATTTGGAATCAAGGGGCTCCGATGGTTGGATAGTTCTAATGC

TCATGAAAACTTTACACAAATGTGTTCTGTTGTTGGTGATAAAACATTTTCTATGCTCCG

CGGTGCAACACCGGTCATTGTGACCAATGATGTTAATTTGATACATGCAATTTCGACGGA

AAGTTTTGATTGTTTCCATTCGCGAATTGTGAGTTTTATTTAGAATAGAAAAACAATACT

GGAAAATTCATTAAATTGGAAAATATATAATTCCAGCCAGAAGCTTTATCTGATGACCCG

ATCACAGCCGAAAATATCCACATGTTTGCTGCCAAAGGTGAGCGCTGGAAACGTCTTCGA

ACAATAACGAGTTATGGATTATCCACAGTTAAGTTAAAATTGGTATGTCCCATGACAGTT

TTGTTGTCCTTTTTATCTAATCATGTTTGTTCAGTTGTTCCCAACCATTGAGACTTGTGT

TTCTGAATTTTTGGATCACGTGAACAGCCTGTCCGATGGTCAAAGTGTTGTCATCGATCA

TTCTCATAGGTTTGTATAATCATCTTCTATTTATAGATTTGACTTTTTTCAGTCTCTTTC

AAAATCATACATCTTACGTTCTAGCTCGATGTGCTTACGGTCACAAAGAGCAAAATCATC

 

>EMBOSS_001_1

*INNCTCLQFFQFNDSTFSSADCIINLCFNKVKLTKCQDTALRIDFQKSQNIFHTL*KWT

EPPI*HIWNQGAPMVG*F*CS*KLYTNVFCCW**NIFYAPRCNTGHCDQ*C*FDTCNFDG

KF*LFPFANCEFYLE*KNNTGKFIKLENI*FQPEALSDDPITAENIHMFAAKGERWKRLR

TITSYGLSTVKLKLVCPMTVLLSFLSNHVCSVVPNH*DLCF*IFGSREQPVRWSKCCHRS

FS*VCIIIFYL*I*LFSVSFKIIHLTF*LDVLTVTKSKII

>EMBOSS_001_2

ESTTVLVCNFFSSMILLFLLPTALLIYVLIR*S*LNVKTRLSE*IFRNLKIYFTLYRNGL

NPQFDIFGIKGLRWLDSSNAHENFTQMCSVVGDKTFSMLRGATPVIVTNDVNLIHAISTE

SFDCFHSRIVSFI*NRKTILENSLNWKIYNSSQKLYLMTRSQPKISTCLLPKVSAGNVFE

Q*RVMDYPQLS*NWYVP*QFCCPFYLIMFVQLFPTIETCVSEFLDHVNSLSDGQSVVIDH

SHRFV*SSSIYRFDFFQSLSKSYILRSSSMCLRSQRAKS

>EMBOSS_001_3

NQQLYLSAIFSVQ*FYFFFCRLHY*FMF**GKAN*MSRHGSQNRFSEISKYISHSIEMD*

TPNLTYLESRGSDGWIVLMLMKTLHKCVLLLVIKHFLCSAVQHRSL*PMMLI*YMQFRRK

VLIVSIREL*VLFRIEKQYWKIH*IGKYIIPARSFI**PDHSRKYPHVCCQR*ALETSSN

NNELWIIHS*VKIGMSHDSFVVLFI*SCLFSCSQPLRLVFLNFWIT*TACPMVKVLSSII

LIGLYNHLLFIDLTFFSLFQNHTSYVLARCAYGHKEQNH

 

 

GAGTCAACAATTTTCTTTCGGTGTTTTCTGATGCGTTCGGGGCATTGTCAGATTTTCAAA

AATCAATGACTGAGAAAATTACTTGTGAGTCACTAAAACATAATTCATCTAATTGAGAAA

TCTATTTGCAGATTTTTTCCCAGAAATGAAGTCAATTTTCAAAAACAACTTATTTGCCCA

ATTTTTACAAAGTACAAATCAACAAAAATTCTTGGATTATTTACTGAATCTGATCTCAAA

ATTTCAATCCCGAAGAGTTATCGAAAACAATAATAATGAAGAATCAAGCGACCCCGAAGC

TGGCAAGTACAGTTTATTGGAGTTTTTCTTCGAACATCATCATGAAAAAAAGATTGTGGA

AAAGGCGGAAGGGCGGATTGACATGAAGAAAGTGAAAGTCGAGAAAAGTATTTCTTATCA

GGTGTGAGTCTCGGTTATCTTGTAGTCTAAAAAACGCAGTACATTAGAGTTTCTGCTGGC

TACAATGTTCATTTGTTTTGCAAATATTGAAAATTGTTTTGCAAAATATTCAATGAAAGA

AAACCGAAACTAATTTATTGCAATACTGGAGAATGTAAACATTTACTATCGTTTGTTTTT

TTCAATAGAAAAATAGACATTGGAAACTTTGGAGTCCCAAAAAGTGATCCAAATATTTTT

GAATATCGGAAAATGTTAGAGCATACTAAACAAAATTGGAGCTTCGGTTTTATCGTCGAG

AACCAAACTCTTATTACAGCTTCAGAAAAGTTAAATAAATCGAATTAGAAAAGGTTTTTT

TGAGTGATACCGGTACACTACTGGACAAAAATTTATTCTCGATTTTTTTAAAAGAATATT

GAGTTGGCAGGTCTAAAAATTATTTTACTGCTCTGAAAACTTTGTTTTTCAATGTAGACA

TAAAAAGAGATTAACTTAGGTAATGAAATTTGTCTTTAGTACATTCTGAAATTTTCAGAC

AAATCTGAAATATTTAGACTA

 

>CYP37A1 F01D5    Z81493 516 aa also Y39G8 Z92851

         Length = 517

 

 Score = 2596 (913.8 bits), Expect = 2.2e-273, P = 2.2e-273

 Identities = 503/524 (95%), Positives = 505/524 (96%)

 

Query:     1 MGIAVYLLALVVIYVVFNLSKILKFVKERMRLYHLMSKIDGPLALPLLGTTFQFKMDPVE 60

             MGIAVYLLALVVIYVVFNLSKILKFVKERMRLYHLMSKIDGPLALPLLGTTFQFKMDPVE

Sbjct:     1 MGIAVYLLALVVIYVVFNLSKILKFVKERMRLYHLMSKIDGPLALPLLGTTFQFKMDPVE 60

 

Query:    61 FALQLYNWGLEYSTKGSSLAAFWMGPYPMVIVLTPEANKVQTRNMSGAQNFHIFQKVLES 120

             FALQLYNWGLEYSTKGSSLAAFWMGPYPMVIVLTPEANKVQTRNMSGAQNFHIFQKVLES

Sbjct:    61 FALQLYNWGLEYSTKGSSLAAFWMGPYPMVIVLTPEANKVQTRNMSGAQNFHIFQKVLES 120

 

Query:   121 NALINKSSEYDIFLPWLGTGLLLASGEKWRGRRKMMTPSFHFNVLIDFQVVFNSQSMILL 180

             NALINKSSEYDIFLPWLGTGLLLASGEKWRGRRKMMTPSFHFNVLID    ++ Q  ILL

Sbjct:   121 NALINKSSEYDIFLPWLGTGLLLASGEKWRGRRKMMTPSFHFNVLIDEFANWDFQ--ILL 178

 

Query:   181 EQIENAAKKTDDSTIDAFPYIKRCALDIICETAMGTTVSAQTNHTHPYVVAVNEMNSLAF 240

             EQIENAAKKTDDSTIDAFPYIKRCALDIICETAMGTTVSAQTNHTHPYVVAVNEMNSLAF

Sbjct:   179 EQIENAAKKTDDSTIDAFPYIKRCALDIICETAMGTTVSAQTNHTHPYVVAVNEMNSLAF 238

 

Query:   241 KYQRMPWLWIKPIRQLIGYEADFQRNLDIVTSFTKKVIDRKLREHDETDGMVVVEEESKK 300

             KYQRMPWLWIKPIRQLIGYEADFQRNLDIVTSFTKKVIDRKLREHDETDGMVVVEEESKK

Sbjct:   239 KYQRMPWLWIKPIRQLIGYEADFQRNLDIVTSFTKKVIDRKLREHDETDGMVVVEEESKK 298

 

Query:   301 KAFLDMLIEKKEEGGLGYEDIREEVDTFMFEGHDTTSAGIGWSLWCLANCPEYQKKCHEE 360

             KAFLDMLIE   EGGLGYEDIREEVDTFMFEGHDTTSAGIGWSLWCLANCPEYQKKCHEE

Sbjct:   299 KAFLDMLIE---EGGLGYEDIREEVDTFMFEGHDTTSAGIGWSLWCLANCPEYQKKCHEE 355

 

Query:   361 LDEIFEGTSRECSVEDLKKMKYLEKCVKEALRMRPSVPQMARSVEEEVEIDGKILPKGCS 420

             LDEIF     E    DLKKMKYLEKCVKEALRMRPSVPQMARSVEEEVEIDGKILPKGCS

Sbjct:   356 LDEIF---GEEFDSFDLKKMKYLEKCVKEALRMRPSVPQMARSVEEEVEIDGKILPKGCS 412

 

Query:   421 VMISPAFIQNNPRTFPNHEVFDPERFNEDEISKRHAYAYIPFSAGPRNCIGQKFAMQEEK 480

             VMISPAFIQNNPRTFPNHEVFDPERFNEDEISKRHAYAYIPFSAGPRNCIGQKFAMQEEK

Sbjct:   413 VMISPAFIQNNPRTFPNHEVFDPERFNEDEISKRHAYAYIPFSAGPRNCIGQKFAMQEEK 472

 

Query:   481 TVISWVLRRFHIHTDIGLLENMPLPETITRPSLGFPLKFTVRQQ 524

             TVISWVLRRFHIHTDIGLLENMPLPETITRPSLGFPLKFTVRQQ

Sbjct:   473 TVISWVLRRFHIHTDIGLLENMPLPETITRPSLGFPLKFTVRQQ 516

 

compare to EST

 

>gi|14837585|dbj|AU205355.1|AU205355  UniGene info AU205355 unpublished oligo-capped cDNA library, stage L4

           Caenorhabditis elegans cDNA clone yk852g07 5'.

          Length = 603

 

 Score =  183 bits (465), Expect = 6e-48

 Identities = 94/104 (90%), Positives = 95/104 (91%), Gaps = 2/104 (1%)

 Frame = +1

 

Query: 11  QKVLESNALINKSSEYDIFLPWLGTGLLLASGEKWRGRRKMMTPSFHFNVLID--EFANW 68

           +KVLESNALINKSSEYDIFLPWLGTGLLLASGEKWRGRRKMMTPSFHFNVLID     N

Sbjct: 208 KKVLESNALINKSSEYDIFLPWLGTGLLLASGEKWRGRRKMMTPSFHFNVLIDFQVVFNS 387

 

Query: 69  DFQILLEQIENAAKKTDDSTIDAFPYIKRCALDIICETAMGTTV 112

              ILLEQIENAAKKTDDSTIDAFPYIKRCALDIICETAMGTTV

Sbjct: 388 QSMILLEQIENAAKKTDDSTIDAFPYIKRCALDIICETAMGTTV 519

 

>gi|18321101|dbj|BJ153116.1|BJ153116   BJ153116 unpublished oligo-capped cDNA library, C. elegans L1 stage

           Caenorhabditis elegans cDNA clone yk1315e09 3'.

          Length = 725

 

 Score =  243 bits (620), Expect = 2e-65

 Identities = 124/136 (91%), Positives = 124/136 (91%), Gaps = 6/136 (4%)

 Frame = -1

 

Query: 1   KKVIDRKLREHDETDGMVVVEEESKKKAFLDMLIE---EGGLGYEDIREEVDTFMFEGHD 57

           KKVIDRKLREHDETDGMVVVEEESKKKAFLDMLIE   EGGLGYEDIREEVDTFMFEGHD

Sbjct: 623 KKVIDRKLREHDETDGMVVVEEESKKKAFLDMLIEKKEEGGLGYEDIREEVDTFMFEGHD 444

 

Query: 58  TTSAGIGWSLWCLANCPEYQKKCHEELDEIF---GEEFDSFDLKKMKYLEKCVKEALRMR 114

           TTSAGIGWSLWCLANCPEYQKKCHEELDEIF     E    DLKKMKYLEKCVKEALRMR

Sbjct: 443 TTSAGIGWSLWCLANCPEYQKKCHEELDEIFEGTSRECSVEDLKKMKYLEKCVKEALRMR 264

 

Query: 115 PSVPQMARSVEEEVEI 130

           PSVPQMARSVEEEVEI

Sbjct: 263 PSVPQMARSVEEEVEI 216

 

This is the correct seq UNIPROT

MGIAVYLLALVVIYVVFNLSKILKFVKERMRLYHLMSKIDGPLALPLLGT  50

TFQFKMDPVEFALQLYNWGLEYSTKGSSLAAFWMGPYPMVIVLTPEANKV 100

QTRNMSGAQNFHIFQKVLESNALINKSSEYDIFLPWLGTGLLLASGEKWR 150

GRRKMMTPSFHFNVLIDFQVVFNSQSMILLEQIENAAKKTDDSTIDAFPY 200

IKRCALDIICETAMGTTVSAQTNHTHPYVVAVNEMNSLAFKYQRMPWLWI 250

KPIRQLIGYEADFQRNLDIVTSFTKKVIDRKLREHDETDGMVVVEEESKK 300

KAFLDMLIEKKEEGGLGYEDIREEVDTFMFEGHDTTSAGIGWSLWCLANC 350

PEYQKKCHEELDEIFEGTSRECSVEDLKKMKYLEKCVKEALRMRPSVPQM 400

ARSVEEEVEIDGKILPKGCSVMISPAFIQNNPRTFPNHEVFDPERFNEDE 450

ISKRHAYAYIPFSAGPRNCIGQKFAMQEEKTVISWVLRRFHIHTDIGLLE 500

NMPLPETITRPSLGFPLKFTVRQQ

 

CYP13B1

 

F02C12.5a; CE09177.

F02C12.5b; CE09178.

F02C12.5c; CE23627.

MGAIIVLVVLFATIAGYFKWIHTYWRRRGISGPEGLPFIGNYYDLADVNK  50

PRGYLIHKWTQKFGKVFGYYEGAVPVLVVSDMDMLQELFLKKFDNFYARK 100

STNHIHGNLECSKSEPRINLFTSRGARWKRLRALASPGFSVKALKQVHDV 150

MEDSAINMVDLMAKHEDGKPFNIHAYFQEFTYDVISRLAMGQPNSELFNN 200

SGVEIVKSIFMRTHRVLPWYFTVLFPQFEHLVKRMFYNHAAVQGGDIEKL 250

LLICKKTVESRIQEREENAKLGFENAENDFIDMFLNYYSEQVEDIEFGST 300

VEKKVTAEDVIGACFVFLLAGFDTTANSLAYASYLLAKHPEKMKLAQEEV 350

DTVVGSENVSYDDMTKLKYLDAVVRESLRLYPVAWFACSRECVKPTTLGD 400

IYIDKGVKIEADVMSLHRSKEIWGENADDFVPERWLEPSSRHTMSWIPFG 450

AGPRQCVGMRLGLSEAKTALAHLLRRYDLVAGVETEVRLTLRFFFIFKSS 500

QIQKELNILGCTTTSPEAVTLYLKPRI                        527

 

>CYP13B1 F02C12.5   Z54269  (C-terminal) C29F7 Z92827 N-terminal 511 aa

         Length = 511

 

 Score = 2512 (884.3 bits), Expect = 7.3e-276, Sum P(2) = 7.3e-276

 Identities = 483/491 (98%), Positives = 484/491 (98%)

 

Query:     1 MGAIIVLVVLFATIAGYFKWIHTYWRRRGISGPEGLPFIGNYYDLADVNKPRGYLIHKWT 60

             MGAIIVLVVLFATIAGYFKWIHTYWRRRGISGPEGLPFIGNYYDLADVNKPRGYLIHKWT

Sbjct:     1 MGAIIVLVVLFATIAGYFKWIHTYWRRRGISGPEGLPFIGNYYDLADVNKPRGYLIHKWT 60

 

Query:    61 QKFGKVFGYYEGAVPVLVVSDMDMLQELFLKKFDNFYARKSTNHIHGNLECSKSEPRINL 120

             Q FGKVFGYYEGAVPVLVVSDMDMLQELFLKKFDNFYARKSTNHIHGNLECSKSEPRINL

Sbjct:    61 QVFGKVFGYYEGAVPVLVVSDMDMLQELFLKKFDNFYARKSTNHIHGNLECSKSEPRINL 120

 

Query:   121 FTSRGARWKRLRALASPGFSVKALKQV--HDVMEDSAINMVDLMAKHEDGKPFNIHAYFQ 178

             FTSRGARWKRLRALASPGFSVKALKQV  HDVMEDSAINMVDLMAKHEDGKPFNIH YFQ

Sbjct:   121 FTSRGARWKRLRALASPGFSVKALKQVYVHDVMEDSAINMVDLMAKHEDGKPFNIHRYFQ 180

 

Query:   179 EFTYDVISRLAMGQPNSELFNNSGVEIVKSIFMRTHRVLPWYFTVLFPQFEHLVKRMFYN 238

             EFTYDVISRLAMGQPNSELFNNSGVEIVK IFMRTHRVLPWYFTVLFPQFEHLVKRMFYN

Sbjct:   181 EFTYDVISRLAMGQPNSELFNNSGVEIVK-IFMRTHRVLPWYFTVLFPQFEHLVKRMFYN 239

 

Query:   239 HAAVQGGDIEKLLLICKKTVESRIQEREENAKLGFENAENDFIDMFLNYYSEQVEDIEFG 298

             HAAVQGGDIEKLLLICKKTVESRIQERE NAKLGFENAENDFIDMFLNYYSEQVEDIEFG

Sbjct:   240 HAAVQGGDIEKLLLICKKTVESRIQERE-NAKLGFENAENDFIDMFLNYYSEQVEDIEFG 298

 

Query:   299 STVEKKVTAEDVIGACFVFLLAGFDTTANSLAYASYLLAKHPEKMKLAQEEVDTVVGSEN 358

             STVEKKVTAEDVIGACFVFLLAGFDTTANSLAYASYLLAKHPEKMKLAQEEVDTVVGSEN

Sbjct:   299 STVEKKVTAEDVIGACFVFLLAGFDTTANSLAYASYLLAKHPEKMKLAQEEVDTVVGSEN 358

 

Query:   359 VSYDDMTKLKYLDAVVRESLRLYPVAWFACSRECVKPTTLGDIYIDKGVKIEADVMSLHR 418

             VSYDDMTKLKYLDAVVRESLRLYPVAWFACSRECVKPTTLGDIYIDKGVKIEADVMSLHR

Sbjct:   359 VSYDDMTKLKYLDAVVRESLRLYPVAWFACSRECVKPTTLGDIYIDKGVKIEADVMSLHR 418

 

Query:   419 SKEIWGENADDFVPERWLEPSSRHTMSWIPFGAGPRQCVGMRLGLSEAKTALAHLLRRYD 478

             SKEIWGENADDFVPERWLEPSSRHTMSWIPFGAGPRQCVGMRLGLSEAKTALAHLLRRYD

Sbjct:   419 SKEIWGENADDFVPERWLEPSSRHTMSWIPFGAGPRQCVGMRLGLSEAKTALAHLLRRYD 478

 

Query:   479 LVAGVETEVRLTL 491

             LVAGVETE  L +

Sbjct:   479 LVAGVETEKELNI 491

 

 Score = 131 (46.1 bits), Expect = 7.3e-276, Sum P(2) = 7.3e-276

 Identities = 27/44 (61%), Positives = 33/44 (75%)

 

Query:   484 ETEVRLTLRFFFIFKSSQIQKELNILGCTTTSPEAVTLYLKPRI 527

             +T +   LR + +    + +KELNILGCTTTSPEAVTLYLKPRI

Sbjct:   467 KTALAHLLRRYDLVAGVETEKELNILGCTTTSPEAVTLYLKPRI 510

 

compare to ests

 

>gi|30744632|gb|CB402905.1|CB402905  UniGene info OSTF221B8_1 AD-wrmcDNA Caenorhabditis elegans cDNA.

          Length = 556

 

 Score =  192 bits (489), Expect = 4e-50

 Identities = 97/100 (97%), Positives = 97/100 (97%)

 Frame = +1

 

Query: 1   ECSKSEPRINLFTSRGARWKRLRALASPGFSVKALKQVYVHDVMEDSAINMVDLMAKHED 60

           ECSKSEPRINLFTSRGARWKRLRALASPGFSVKALKQV  HDVMEDSAINMVDLMAKHED

Sbjct: 262 ECSKSEPRINLFTSRGARWKRLRALASPGFSVKALKQV--HDVMEDSAINMVDLMAKHED 435

 

Query: 61  GKPFNIHRYFQEFTYDVISRLAMGQPNSELFNNSGVEIVK 100

           GKPFNIH YFQEFTYDVISRLAMGQPNSELFNNSGVEIVK

Sbjct: 436 GKPFNIHAYFQEFTYDVISRLAMGQPNSELFNNSGVEIVK 555

 

 

>gi|14853552|dbj|AU215395.1|AU215395  UniGene info AU215395 unpublished oligo-capped cDNA library, stage L1

           Caenorhabditis elegans cDNA clone yk825b05 3'.

          Length = 623

 

 Score =  145 bits (365), Expect = 4e-36

 Identities = 70/70 (100%), Positives = 70/70 (100%)

 Frame = -3

 

F02C12.5a; CE09177.

F02C12.5b; CE09178.

F02C12.5c; CE23627.

These sequences fail to remove a short intron see below.

 

Query: 1   RHTMSWIPFGAGPRQCVGMRLGLSEAKTALAHLLRRYDLVAGVETEKELNILGCTTTSPE 60

           RHTMSWIPFGAGPRQCVGMRLGLSEAKTALAHLLRRYDLVAGVETEKELNILGCTTTSPE

Sbjct: 291 RHTMSWIPFGAGPRQCVGMRLGLSEAKTALAHLLRRYDLVAGVETEKELNILGCTTTSPE 112

 

Query: 61  AVTLYLKPRI 70

           AVTLYLKPRI

Sbjct: 111 AVTLYLKPRI 82

 

CYP14A5

 

F08F3.7; CE09262. 100%

MSVFIVALSVFIISYVISFYWKVRKYPKGPFPLPFFGNLLQFPADNIQEH  50

LDKLSKTYGPCFTVWTPLPAVVLTDYEHIKEAFVTQGDAFVNRAQRLPEI 100

LFQPHPNTGVVFSSGDNWKIQRRTALKILRDFGLGRNLMEEQVMRSVHEM 150

LAQLEHISDKKNVDMYWPIQLCVGNVINESLFGYHYKYEDAGRFEKFVKV 200

VDRHLKIAQGNASLLVSAFPWLRHLPVIGNLGYHSIKNNIKSYQQFIEEE 250

VTSQLKNYDGESEPENFVHAYMQQMKQTGNPNLDMTNLCASVLDFWLAGM 300

ETASNSLRWHLAFMMKYPEVQDKVRNEIFENIGTARLPSMSDKQNMPYTQ 350

AVIHEVQRCSNMVPILATHMNTEDVLVKEHNIPTGTLLFAQIWSVLKNDP 400

VFEENSKFNPDRYLMPDGKTLNKTVLERTIPFSVGKRNCVGEGLARMELF 450

LIFSALIQKYEFIPKTNVDLKPVYGGVITVKPYLCELVPQNA

 

CYP35D1

 

>CYP35D1 F14H3.10 498 aa

         Length = 499

 

 Score = 2595 (913.5 bits), Expect = 2.8e-273, P = 2.8e-273

 Identities = 498/499 (99%), Positives = 498/499 (99%)

 

add the extra K

 

Query:     1 MILILLFLTAITVITVRLYRKVLRFPPGPFPLPLIGNAHQIAYQAWRRGGILPALDYYRK 60

             MILILLFLTAITVITVRLYRKVLRFPPGPFPLPLIGNAHQIAYQAWRRGGILPALDYYRK

Sbjct:     1 MILILLFLTAITVITVRLYRKVLRFPPGPFPLPLIGNAHQIAYQAWRRGGILPALDYYRK 60

 

Query:    61 KYGNAYTLWLGPKASVSITDFETSQEVFVKQGKKCYNRQLAPILEHVTGGVGLLIANGEN 120

              YGNAYTLWLGPKASVSITDFETSQEVFVKQGKKCYNRQLAPILEHVTGGVGLLIANGEN

Sbjct:    61 -YGNAYTLWLGPKASVSITDFETSQEVFVKQGKKCYNRQLAPILEHVTGGVGLLIANGEN 119

 

Query:   121 WAEMRRFTLLTFRQMGVGTNIMEKRIMDELNGRCLEIDAQIARNDRAIVDVKFFDLTVGS 180

             WAEMRRFTLLTFRQMGVGTNIMEKRIMDELNGRCLEIDAQIARNDRAIVDVKFFDLTVGS

Sbjct:   120 WAEMRRFTLLTFRQMGVGTNIMEKRIMDELNGRCLEIDAQIARNDRAIVDVKFFDLTVGS 179

 

Query:   181 VINSFLIGKRFEDEEEFLKIKKLFDESSETFNIFDLNVPVWFLKTFLPSRFKLTWDSRHQ 240

             VINSFLIGKRFEDEEEFLKIKKLFDESSETFNIFDLNVPVWFLKTFLPSRFKLTWDSRHQ

Sbjct:   180 VINSFLIGKRFEDEEEFLKIKKLFDESSETFNIFDLNVPVWFLKTFLPSRFKLTWDSRHQ 239

 

Query:   241 IMDHVMKGVEERIRDIESGAYKIDPKKPNDVVDAFLSKMKKEEEIAGGQHPYYNLKSLKL 300

             IMDHVMKGVEERIRDIESGAYKIDPKKPNDVVDAFLSKMKKEEEIAGGQHPYYNLKSLKL

Sbjct:   240 IMDHVMKGVEERIRDIESGAYKIDPKKPNDVVDAFLSKMKKEEEIAGGQHPYYNLKSLKL 299

 

Query:   301 VLHDLWLAGQGTTATTLYVGFMKLVNHPGIIQNIQKELLDITENGARDLTLKDRPNTPYL 360

             VLHDLWLAGQGTTATTLYVGFMKLVNHPGIIQNIQKELLDITENGARDLTLKDRPNTPYL

Sbjct:   300 VLHDLWLAGQGTTATTLYVGFMKLVNHPGIIQNIQKELLDITENGARDLTLKDRPNTPYL 359

 

Query:   361 NATIAEIQRHASILNVNFWRINHETIHFNDYQVDPGTMIAAQVGVLHVNEELFDNPKDFN 420

             NATIAEIQRHASILNVNFWRINHETIHFNDYQVDPGTMIAAQVGVLHVNEELFDNPKDFN

Sbjct:   360 NATIAEIQRHASILNVNFWRINHETIHFNDYQVDPGTMIAAQVGVLHVNEELFDNPKDFN 419

 

Query:   421 VEKYLKNPKLLQQVIPFGIGKRSCVGEQIAKSELYLVFGNILLRYNVKKHGSTPTNEDVY 480

             VEKYLKNPKLLQQVIPFGIGKRSCVGEQIAKSELYLVFGNILLRYNVKKHGSTPTNEDVY

Sbjct:   420 VEKYLKNPKLLQQVIPFGIGKRSCVGEQIAKSELYLVFGNILLRYNVKKHGSTPTNEDVY 479

 

Query:   481 PYSSAKLPDVSGKLEFVKL 499

             PYSSAKLPDVSGKLEFVKL

Sbjct:   480 PYSSAKLPDVSGKLEFVKL 498

 

CYP37B1

 

>CYP37B1 F28G4 and T13F3 515 aa

         Length = 516

 

 Score = 2547 (896.6 bits), Expect = 3.4e-268, P = 3.4e-268

 Identities = 485/506 (95%), Positives = 488/506 (96%)

 

Query:     1 MHFLNIATIFLITIFLFYYKLIYNFIRDRLRIYKYMRKFDGPYSFPIIGTLYMVNIFDIS 60

             MHFLNIATIFLITIFLFYYKLIYNFIRDRLRIYKYMRKFDGPYSFPIIGTLYMVNIFDIS

Sbjct:     1 MHFLNIATIFLITIFLFYYKLIYNFIRDRLRIYKYMRKFDGPYSFPIIGTLYMVNIFDIS 60

 

Query:    61 KFTTQSMELAQYYCQKGCGTIGLWLGPVPLIAVINPQHAKEILESNEVITKAEEYDILFP 120

             KFTTQSMELAQYYCQKGCGTIGLWLGPVPLIAVINPQHAKEILESNEVITKAEEYDILFP

Sbjct:    61 KFTTQSMELAQYYCQKGCGTIGLWLGPVPLIAVINPQHAKEILESNEVITKAEEYDILFP 120

 

Query:   121 WLGTGLLTSTGSKWRQRRKMLTPAFHFKVLNDFLSVHDYQAKV-FLEQIKPYADSGKEVD 179

             WLGTGLLTSTGSKWRQRRKMLTPAFHFKVLNDFLSVHDYQAKV F  QIKPYADSGKEVD

Sbjct:   121 WLGTGLLTSTGSKWRQRRKMLTPAFHFKVLNDFLSVHDYQAKVRFFFQIKPYADSGKEVD 180

 

Query:   180 LFPYIKRLALDVICDTSMGVTIDAQNNHDHQYVESVRLLSEYAFEWILRPWLRLKPLWYL 239

             LFPYIKRLALDVICDTSMGVTIDAQNNHDHQYVE V LLSEYAFEWILRPWLRLKPLWYL

Sbjct:   181 LFPYIKRLALDVICDTSMGVTIDAQNNHDHQYVEWVFLLSEYAFEWILRPWLRLKPLWYL 240

 

Query:   240 TGPGHEYDRHLKIVTDFTKTVIKEKWEEFQKFHVDPVVKTDKRSMAFLDLLLELRNEGLM 299

             TGPGHEYDRHLKIVTDFTKTVIKEKWEEFQKFHVDPVVKTDKRSMAFLDLLLELRNEGLM

Sbjct:   241 TGPGHEYDRHLKIVTDFTKTVIKEKWEEFQKFHVDPVVKTDKRSMAFLDLLLELRNEGLM 300

 

Query:   300 NEDDIRE---EV-----DTFMFEGHDTTSASMGWTLWCLAHNPEFQEKVIQEVDGIFGTS 351

             NEDDIRE   E+     D     GHDTTSASMGWTLWCLAHNPEFQEKVIQEVDGIFGTS

Sbjct:   301 NEDDIREFEGELRFSAADLSQVTGHDTTSASMGWTLWCLAHNPEFQEKVIQEVDGIFGTS 360

 

Query:   352 DRDCTNDDLKQMKYLEKCLKESLRMYPSVPFFGRTVEQDVVINGDFFPKGVRIIVMPLLL 411

             DRDCTNDDLKQMKYLEKCLKESLRMYPSVPFFGRTVEQDVVI+GDFFPKGVRIIVMPLLL

Sbjct:   361 DRDCTNDDLKQMKYLEKCLKESLRMYPSVPFFGRTVEQDVVIDGDFFPKGVRIIVMPLLL 420

 

Query:   412 QRNPLIFDNPNQYNPENFSEDKIGSRHAYSDIPFSAGPRNCIGQKFAMMEEKAVISWFFR 471

             QRNPLIFDNPNQYNPENFSEDKIGSRHAYSDIPFSAGPRNCIGQKFAMMEEKAVISWFFR

Sbjct:   421 QRNPLIFDNPNQYNPENFSEDKIGSRHAYSDIPFSAGPRNCIGQKFAMMEEKAVISWFFR 480

 

Query:   472 KYRVTASQPFGMNKILPELILKSSLG 497

             KYRVTASQPFGMNKILPELILKSS+G

Sbjct:   481 KYRVTASQPFGMNKILPELILKSSVG 506

 

>CYP37B1 F28G4 and T13F3 515 aa

MHFLNIATIFLITIFLFYYKLIYNFIRDRLRIYKYMRKFDGPYSFPIIGTLYMV

NIFDISKFTTQSMELAQYYCQKGCGTIGLWLGPVPLIAVINPQHAKEILESNE

VITKAEEYDILFPWLGTGLLTSTGSKWRQRRKMLTPAFHFKVLNDFLSVHDY

QAKVRFFFQIKPYADSGKEVDLFPYIKRLALDVICDTSMGVTIDAQNNHDHQYVE

WVFLLSEYAFEWILRPWLRLKPLWYLTGPGHEYDRHLKIVTDFTKTVIKEKWEEF

QKFHVDPVVKTDKRSMAFLDLLLELRNEGLMNEDDIREFEGELRFSAAD

LSQVTGHDTTSASMGWTLWCLAHNPEFQEKVIQEVDGIFGTSDRDCTNDDLKQM

KYLEKCLKESLRMYPSVPFFGRTVEQDVVIDGDFFPKGVRIIVMPLLLQRNPLIFD

NPNQYNPENFSEDKIGSRHAYSDIPFSAGPRNCIGQKFAMMEEKAVISWFFRKYR

VTASQPFGMNKILPELILKSSVGISVDSASSD*

 

F28G4.1; CE15919.

MHFLNIATIFLITIFLFYYKLIYNFIRDRLRIYKYMRKFDGPYSFPIIGT  50

LYMVNIFDISKFTTQSMELAQYYCQKGCGTIGLWLGPVPLIAVINPQHAK 100

EILESNEVITKAEEYDILFPWLGTGLLTSTGSKWRQRRKMLTPAFHFKVL 150

NDFLSVHDYQAKVFLEQIKPYADSGKEVDLFPYIKRLALDVICDTSMGVT 200

IDAQNNHDHQYVESVRLLSEYAFEWILRPWLRLKPLWYLTGPGHEYDRHL 250

KIVTDFTKTVIKEKWEEFQKFHVDPVVKTDKRSMAFLDLLLELRNEGLMN 300

EDDIREEVDTFMFEGHDTTSASMGWTLWCLAHNPEFQEKVIQEVDGIFGT 350

SDRDCTNDDLKQMKYLEKCLKESLRMYPSVPFFGRTVEQDVVINGDFFPK 400

GVRIIVMPLLLQRNPLIFDNPNQYNPENFSEDKIGSRHAYSDIPFSAGPR 450

NCIGQKFAMMEEKAVISWFFRKYRVTASQPFGMNKILPELILKSSLGFPL 500

TVHHRTDNK

 

BJ150948

>EMBOSS_001_6

MAFLDLLLELRNEGLMNEDDIREEVDTFMFEGHDTTSASMGWTLWCLAHNPEFQEKVIQE

VDGIFGTSDRDCTNDDLKQMKYLEKCLKESLRMYPSVPFFGRTVEQDVVINGDFFPKGVR

IIVMPLLLQRNPLIFDNPNQYNPENFSEDKIGSRHAYSDIPFSAGPRNCIGQKFAMMEEK

AVISWFFRKYRVTASQPFGMNKILPELILKSSLGFPLTVHHRTDNK*

 

CYP33C7

 

F41B5.2; CE10218. C-term end wrong

MILLLLFTVLIAFLFHEMYWKRRNLPPGPIPLPLIGNLQSMLLEKPGYEC  50

FRRWNKQFGDLYTFWMGNTPYVIVSSYELMKETFIRDGDTYKDKFPQPFN 100

EKFRGGIFGIIETNGHLWNTHRRFALSTLRDFGLGKDLMQEKILIEVEDI 150

FRKYDAQIDKDMEISTILHNAIANVINQTIFGYRFDESNQEEYKKLKHLI 200

EFQENVFTSAKVTVQVFAPKLGKILPGESLEDLMKDWKNSFYDFFNTQIE 250

NHRQKIDFDSEESQDYAEAYLKEQKKYEALGDTELFSNKQLSNTCLDLWF 300

AGLSTTNTTANWTICYVMNTPGVLEKIHDELDKVVGSDRLVTTADKNNLP 350

YMNAVINESQRCTNIVPINLFHATTRDTVINGYPVKKGTGVIAQISTVML 400

DEKVFPEPYKFKPERFIDESGKLIKVDELVPFSIGKRQCPGEGLARMELF 450

LFIANFFNRYQ VHTHLSFKITFF

 

 

>CYP33C7 F41B5.2 494 aa

MILLLLFTVLIAFLFHEMYWKRRNLPPGPIPLPLIGNLQSMLLEKPGYECFRRWNK

QFGDLYTFWMGNTPYVIVSSYELMKETFIRDGDTYKDKFPQPFNEKFRGGIFGIIET

NGHLWNTHRRFALSTLRDFGLGKDLMQEKILIEVEDIFRKYDAQIDKDMEISTILH

NAIANVINQTIFGYRFDESNQEEYKKLKHLIEFQENVFTSAKVTVQVFAPKLGKILP

GESLEDLMKDWKNSFYDFFNTQIENHRQKIDFDSEESQDYAEAYLKEQKKYEALG

DTELFSNKQLSNTCLDLWFAGLSTTNTTANWTICYVMNTPGVLEKIHDELDKVVG

SDRLVTTADKNNLPYMNAVINESQRCTNIVPINLFHATTRDTVINGYPVKKGTGVI

AQISTVMLDEKVFPEPYKFKPERFIDESGKLIKVDELVPFSIGKRQCPGEGLARM

ELFLFIANFFNRYQISPSSEGLPSIDKSERVGVFPRKFNAILKKRHV*

 

compare to cDNA

>gi|30738882|gb|CB397171.1|CB397171  UniGene info OSTR184G11_1 AD-wrmcDNA Caenorhabditis elegans cDNA.

          Length = 532

 

 Score =  124 bits (312), Expect(2) = 1e-31

 Identities = 58/60 (96%), Positives = 60/60 (100%)

 Frame = -2

 

Query: 11  KVFPEPYKFKPERFIDESGKLIKVDELVPFSIGKRQCPGEGLARMELFLFIANFFNRYQI 70

           +VFPEPYKFKPERFIDESGKLIKVDELVPFSIGKRQCPGEGLARMELFLFIANFFNRYQ+

Sbjct: 213 QVFPEPYKFKPERFIDESGKLIKVDELVPFSIGKRQCPGEGLARMELFLFIANFFNRYQV 34

 

1 aacaaaaagt atcttgaaac taagatgtgt atgtacctga tatctgttaa aaaaattagc

       61 aataaaaagg aaaagctcca tcctggcaag tccttcgcct gggcattgcc gctttccaat

      121 tgagaatggg acaagttcat ctacttttat aagttttcca ctttcatcta taaatctttc

      181 aggcttgaat ttataaggtt ctgggaagac ctgcaatgga aaatttttta aattaaaatt

      241 tttcaagttt tcaatttttg tcttaccttc tcatcaagca taactgtact aatctgggcg

      301 atgactccag ttcccttttt aactggatat ccatttatta ctgtatccct tgtggttgca

      361 tggaataaat tgatcggaac aatattcgtg catctttgag attcattgat cactgcattc

      421 atataaggaa gattattctt atcagcagtt gtcaccaatc tatcactccc aaccacctta

      481 tcaagctcgt cgtgaatttt tcctaaaact cctggagtat tcataacata gc

 

>EMBOSS_001_4

LCYEYSRSFRKNSRRA**GGWE**IGDNC**E*SSLYECSDQ*ISKMHEYCSDQFIPCNH

KGYSNKWISS*KGNWSHRPD*YSYA**EGKTKIENLKNFNLKNFPLQVFPEPYKFKPERF

IDESGKLIKVDELVPFSIGKRQCPGEGLARMELFLFIANFFNRYQVHTHLSFKILFV

 

 

compare to trlated EST

>gi|18292112|dbj|BJ131955.1|BJ131955  UniGene info BJ131955 unpublished oligo-capped cDNA library, C. elegans L1 stage

           Caenorhabditis elegans cDNA clone yk1060e10 3'.

          Length = 794

 

 Score =  179 bits (454), Expect = 1e-46

 Identities = 83/100 (83%), Positives = 94/100 (94%)

 Frame = -3

 

Query: 1   AQISTVMLDEKVFPEPYKFKPERFIDESGKLIKVDELVPFSIGKRQCPGEGLARMELFLF 60

           AQISTVMLDEKVFP+PYKF P+RFIDE GKLIKV++L+PFSIG+RQCPGEGLARME+FLF

Sbjct: 408 AQISTVMLDEKVFPDPYKFNPDRFIDEHGKLIKVEQLIPFSIGRRQCPGEGLARMEIFLF 229

 

Query: 61  IANFFNRYQISPSSEGLPSIDKSERVGVFPRKFNAILKKR 100

           +ANFFN YQISPSSEG PSIDKS+RVGVFP+KF AIL +R

Sbjct: 228 VANFFNHYQISPSSEGFPSIDKSDRVGVFPKKFKAILNRR 109

 

 

CYP33C5

 

>CYP33C5 F41B5.3 494 aa

         Length = 495

 

 Score = 2628 (925.1 bits), Expect = 8.8e-277, P = 8.8e-277

 Identities = 492/494 (99%), Positives = 494/494 (100%)

 

Query:     1 MILLLLSAAVCLFLFHELYWKRRSLPPGPTPLPFMGNTLAMLLEKPGYECFRRWTKQFGG 60

             MILLLLSAAVCLFLFHELYWKRRSLPPGPTPLPFMGNTLAMLLEKPGYECFRRWTKQFGG

Sbjct:     1 MILLLLSAAVCLFLFHELYWKRRSLPPGPTPLPFMGNTLAMLLEKPGYECFRRWTKQFGG 60

 

Query:    61 VYTFWMGNIPYVIIGSYDLLKETFVRDGDTYKDKYPQPFNEKLRGGMYGIVESNGHMWST 120

             VYTFWMGNIPYVIIGSYDLLKETFVRDGDTYKDKYPQPFNEKLRGGMYGIVESNGHMWST

Sbjct:    61 VYTFWMGNIPYVIIGSYDLLKETFVRDGDTYKDKYPQPFNEKLRGGMYGIVESNGHMWST 120

 

Query:   121 HRRFALSTLRDFGLGKDLMQEKILIEVEDIFRKYDAQLGKEQDIQVVLNNAIANVINQTI 180

             HRRFALSTLRDFGLGKDLMQEKILIEVEDIFRKYDAQLGKEQDIQVVLNNAIANVINQTI

Sbjct:   121 HRRFALSTLRDFGLGKDLMQEKILIEVEDIFRKYDAQLGKEQDIQVVLNNAIANVINQTI 180

 

Query:   181 FGYRFDETNQEEFERMRHLVEYQEKQFATVKVYVEAFVPTIGKFLPGKSLDQLLDDWRNS 240

             FGYRFDETNQEEFERMRHLVEYQEKQFATVKVYVEAFVPTIGKFLPGKSLDQLLDDWRNS

Sbjct:   181 FGYRFDETNQEEFERMRHLVEYQEKQFATVKVYVEAFVPTIGKFLPGKSLDQLLDDWRNS 240

 

Query:   241 FYTFFDTQIENHRKKIDFDSEESQDYAEAYLKEQKKQEALGEFELFSNTQFSNTCLDLWL 300

             FYTFFDTQIENHRKKIDFDSEESQDYAEAYLKEQKKQEALGEFELFSNTQFSNTCLDLWL

Sbjct:   241 FYTFFDTQIENHRKKIDFDSEESQDYAEAYLKEQKKQEALGEFELFSNTQFSNTCLDLWL 300

 

Query:   301 AGVSTTNTTVNWTICYVLNHPDVLQKMNEEFDQVVGSDRLVTMGDKNNLPYFNAVLNESQ 360

             AGVSTTNTTVNWTICYVLNHPDVLQKMNEEFDQVVGSDRLVTMGDKNNLPYFNAVLNESQ

Sbjct:   301 AGVSTTNTTVNWTICYVLNHPDVLQKMNEEFDQVVGSDRLVTMGDKNNLPYFNAVLNESQ 360

 

Query:   361 RCANIVPINLFHATTKDTVINGYPVKKGTGVIAQISTVMLDEKIFPEPYKFNPDRFIDEN 420

             RCANIVPINLFHATTKDTVINGYPVKKGTGVIAQISTVMLDEK+FPEPYKFNPDRFIDEN

Sbjct:   361 RCANIVPINLFHATTKDTVINGYPVKKGTGVIAQISTVMLDEKVFPEPYKFNPDRFIDEN 420

 

Query:   421 GKPIKIEQLIPFSIGKRQCPGEGLARMEIFLFLANFFNRYKISPSSKVFPNLDKKDNVGV 480

             GKPIKIEQLIPFSIGKRQCPGEGLARMEIFLFLANFFNRY+ISPSSKVFPNLDKKDNVGV

Sbjct:   421 GKPIKIEQLIPFSIGKRQCPGEGLARMEIFLFLANFFNRYEISPSSKVFPNLDKKDNVGV 480

 

Query:   481 FPKDLHAILNRRNC 494

             FPKDLHAILNRRNC

Sbjct:   481 FPKDLHAILNRRNC 494

 

compare to EST change my seq to match EST

 

>gi|18319978|dbj|BJ151993.1|BJ151993   BJ151993 unpublished oligo-capped cDNA library, C. elegans L1 stage

           Caenorhabditis elegans cDNA clone yk1302a10 3'.

          Length = 597

 

 Score =  327 bits (839), Expect = 7e-91

 Identities = 158/161 (98%), Positives = 160/161 (99%)

 Frame = -1

 

Query: 1   VVGSDRLVTMGDKNNLPYFNAVLNESQRCANIVPINLFHATTKDTVINGYPVKKGTGVIA 60

           VVGSDRLVTMGDKNNLPYFNAVLNESQRCANIVPINLFHATTKDTVINGYPVKKGTGVIA

Sbjct: 495 VVGSDRLVTMGDKNNLPYFNAVLNESQRCANIVPINLFHATTKDTVINGYPVKKGTGVIA 316

 

Query: 61  QISTVMLDEKVFPEPYKFNPDRFIDENGKPIKIEQLIPFSIGKRQCPGEGLARMEIFLFL 120

           QISTVM DEK+FPEPYKFNPDRFIDENGKPIKIEQLIPFSIGKRQCPGEGLARMEIFLFL

Sbjct: 315 QISTVMXDEKIFPEPYKFNPDRFIDENGKPIKIEQLIPFSIGKRQCPGEGLARMEIFLFL 136

 

Query: 121 ANFFNRYEISPSSKVFPNLDKKDNVGVFPKDLHAILNRRNC 161

           ANFFNRY+ISPSSKVFPNLDKKDNVGVFPKDLHAILNRRNC

Sbjct: 135 ANFFNRYKISPSSKVFPNLDKKDNVGVFPKDLHAILNRRNC 13

 

 

CYP33C6

 

>CYP33C6 F41B5.7 496 aa

         Length = 495

 

 Score = 2438 (858.2 bits), Expect = 1.2e-256, P = 1.2e-256

 Identities = 459/462 (99%), Positives = 462/462 (100%)

 

Query:     1 MIILLLSAAVCVFLFHELFWKRRHLPPGPTPLPFMGNILAVILEKPGYECFRRWTKQFGD 60

             MIILLLSAAVCVFLFHELFWKRRHLPPGPTPLPFMGNILAVILEKPGYECFRRWTKQFGD

Sbjct:     1 MIILLLSAAVCVFLFHELFWKRRHLPPGPTPLPFMGNILAVILEKPGYECFRRWTKQFGD 60

 

Query:    61 VYTFWMGNTPYIIIGTYDLLKETYIRDGDTYKDKYPHPFNEQLRGGMYGIVESNGHLWST 120

             VYTFWMGNTPYIIIGTYDLLKETYIRDGDTYKDKYPHPFNEQLRGGMYGIVESNGHLWST

Sbjct:    61 VYTFWMGNTPYIIIGTYDLLKETYIRDGDTYKDKYPHPFNEQLRGGMYGIVESNGHLWST 120

 

Query:   121 HRKFALSTLRNFGLGKDLMQENILIEVEDIFRKFDAQLGKEQEIQVVLKNAIANVINQII 180

             HRKFALSTLRNFGLGKDLMQENILIEVEDIFRKFDAQLGKEQEIQVVLKNAIANVINQII

Sbjct:   121 HRKFALSTLRNFGLGKDLMQENILIEVEDIFRKFDAQLGKEQEIQVVLKNAIANVINQII 180

 

Query:   181 FGYRFDATNQDKLEKMRYLVEYQEKAFTTVKASIQAFAPRIGSILPGKSLDQLLNEFKDN 240

             FGYRFDATNQDKLEKMRYLVEYQEKAFTTVKASIQAFAPRIGSILPGKSLDQLLNEFKDN

Sbjct:   181 FGYRFDATNQDKLEKMRYLVEYQEKAFTTVKASIQAFAPRIGSILPGKSLDQLLNEFKDN 240

 

Query:   241 FYTFFNAEIENHRRKIDVDSEESQDYAEAYLKEQRKHEARGEFELFSNTQFANTCLDLWL 300

             FYTFFNAEIENHRRKIDVDSEESQDYAEAYLKEQRKHEARGEFELFSNTQFANTCLDLWL

Sbjct:   241 FYTFFNAEIENHRRKIDVDSEESQDYAEAYLKEQRKHEARGEFELFSNTQFANTCLDLWL 300

 

Query:   301 AGVSTTNTTVNWTICYVLNHPEVLQKMNEEFDQVVGSDRLVSMGDKNNLPYFNAVLNESQ 360

             AGVSTTNTTVNWTICYVLNHPEVLQKMNEEFDQVVGSDRLVSMGDKNNLPYFNAVLNESQ

Sbjct:   301 AGVSTTNTTVNWTICYVLNHPEVLQKMNEEFDQVVGSDRLVSMGDKNNLPYFNAVLNESQ 360

 

Query:   361 RCANIVTINLFHATTKDTVINGYPIKKGTGVIAQISTVMLDEKIFPEPYKFNPDRFIGEN 420

             RCANIVTINLFHATTKDTVINGYPIKKGTGVIAQISTVMLDEK+FPEPYKFNPDRFIGEN

Sbjct:   361 RCANIVTINLFHATTKDTVINGYPIKKGTGVIAQISTVMLDEKVFPEPYKFNPDRFIGEN 420

 

Query:   421 GKPIKIEQLIPFSIGKRQCPGEGLARMEIFLLVANLFNRYKV 462

             GKPIKIEQLIPFSIGKRQCPGEGLARMEIFLLVANLFNRY++

Sbjct:   421 GKPIKIEQLIPFSIGKRQCPGEGLARMEIFLLVANLFNRYQI 462

 

END IS WRONG

 

compare to 33C5

>CYP33C5 F41B5.3 494 aa

         Length = 495

 

 Score = 421 (148.2 bits), Expect = 6.5e-43, P = 6.5e-43

 Identities = 80/102 (78%), Positives = 88/102 (86%)

 

Query:     1 QISTVMLDEKIFPEPYKFNPDRFIGENGKPIKIEQLIPFSIGKRQCPGEGLARMEIFLLV 60

             QISTVMLDEK+FPEPYKFNPDRFI ENGKPIKIEQLIPFSIGKRQCPGEGLARMEIFL +

Sbjct:   394 QISTVMLDEKVFPEPYKFNPDRFIDENGKPIKIEQLIPFSIGKRQCPGEGLARMEIFLFL 453

 

Query:    61 ANLFNRYQISPFSEGFPNLEKNSVVEIFAKGLRVILTKRHC* 102

             AN FNRY+ISP S+ FPNL+K   V +F K L  IL +R+C*

Sbjct:   454 ANFFNRYEISPSSKVFPNLDKKDNVGVFPKDLHAILNRRNC* 495

 

>gi|30738878|gb|CB397167.1|CB397167   OSTR184F6_1 AD-wrmcDNA Caenorhabditis elegans cDNA.

          Length = 561

 

 Score =  253 bits (646), Expect = 2e-68

 Identities = 122/125 (97%), Positives = 125/125 (100%)

 Frame = -3

 

Query: 1   DRLVSMGDKNNLPYFNAVLNESQRCANIVTINLFHATTKDTVINGYPIKKGTGVIAQIST 60

           DRLVSMGDKNNLPYFNAVLNESQRCANIVTINLFHATTKDTVINGYPIKKGTGVIAQIST

Sbjct: 379 DRLVSMGDKNNLPYFNAVLNESQRCANIVTINLFHATTKDTVINGYPIKKGTGVIAQIST 200

 

Query: 61  VMLDEKVFPEPYKFNPDRFIGENGKPIKIEQLIPFSIGKRQCPGEGLARMEIFLLVANLF 120

           VMLDEK+FPEPYKFNPDRFIGENGKPIKIEQLIPFSIGKRQCPGEGLARMEIFLLVANLF

Sbjct: 199 VMLDEKIFPEPYKFNPDRFIGENGKPIKIEQLIPFSIGKRQCPGEGLARMEIFLLVANLF 20

 

Query: 121 NRYQI 125

           NRY++

Sbjct: 19  NRYKV 5

 

 

 

CYP25A5

 

>CYP25A5 F42A6 495 aa

         Length = 496

 

Score = 203 (71.5 bits), Expect = 2.0e-251, Sum P(2) = 2.0e-251

 Identities = 40/40 (100%), Positives = 40/40 (100%)

 

Query:     1 MAILIISTIFFTIITFISYIIWRRHSIFKLRSSIGIPGPP 40

             MAILIISTIFFTIITFISYIIWRRHSIFKLRSSIGIPGPP

Sbjct:     1 MAILIISTIFFTIITFISYIIWRRHSIFKLRSSIGIPGPP 40

 

this seq is missing the KYG motif

 

 Score = 2209 (777.6 bits), Expect = 2.0e-251, Sum P(2) = 2.0e-251

 Identities = 419/424 (98%), Positives = 421/424 (99%)

 

Query:    54 FSLYCGTQLHITVSEEEDIKEIFIQNFSNFSDRMTPNIFGMNQLNQSLLQNTYATGWKHT 113

             F LYCGTQLHITVSEEEDIKEIFIQNFSNFSDRMTPNIFGMN+LNQSLLQNTYATGWKHT

Sbjct:    72 FGLYCGTQLHITVSEEEDIKEIFIQNFSNFSDRMTPNIFGMNRLNQSLLQNTYATGWKHT 131

 

Query:   114 RSAVAPIFSTGKMKAMHETLVSKIDIFLEVLKEKSSSGQKWDIFDNFQSLSLDIIGKCAF 173

             RSAVAPIFSTGKMKAMHETLVSKIDIFLEVLKEKSSSGQKWDIF NFQSLSLDIIGKCAF

Sbjct:   132 RSAVAPIFSTGKMKAMHETLVSKIDIFLEVLKEKSSSGQKWDIFSNFQSLSLDIIGKCAF 191

 

Query:   174 AIDSNCQRDRIDLFYVQARKFVGAVALKKSWILPVSLILPELSWLWRFLYKFSDLSAAEL 233

             AIDSNCQRDRIDLFYVQARKFVGAVALKKSWILPVS+ILPELSWLWRFLYKFSDLSAAEL

Sbjct:   192 AIDSNCQRDRIDLFYVQARKFVGAVALKKSWILPVSVILPELSWLWRFLYKFSDLSAAEL 251

 

Query:   234 PLVKGLVDLYDRRRAGEGGNDSTDLLNLLIRRETIGKMTKREVIENCFAFLIAGYETTST 293

             PLVKGLVDLYDRRRAGEGGNDSTDLLNLLIRRETIGKMTKREVIENCFAFLIAGYETTST

Sbjct:   252 PLVKGLVDLYDRRRAGEGGNDSTDLLNLLIRRETIGKMTKREVIENCFAFLIAGYETTST 311

 

Query:   294 AMMFSAYLLAEYPIVQQKLYEEIKKTKESAGLNYDSIHNMKYSDCVYKESLRFYPPTTHF 353

             AMMFSAYLLAEYPIVQQKLYEEIKKTKESAGLNYDSIHNMKYSDCVYKESLRFYPPTT F

Sbjct:   312 AMMFSAYLLAEYPIVQQKLYEEIKKTKESAGLNYDSIHNMKYSDCVYKESLRFYPPTTEF 371

 

Query:   354 TNRVCLNDMTIQGQIYPKDSTLKVQPHTIHRNPANWESPDEFQPERFENWEEKSSSLKWI 413

             TNRVCLNDMTIQGQIYPKDSTLKVQPHTIHRNPANWESPDEFQPERFENWEEKSSSLKWI

Sbjct:   372 TNRVCLNDMTIQGQIYPKDSTLKVQPHTIHRNPANWESPDEFQPERFENWEEKSSSLKWI 431

 

Query:   414 PFGVGPRYCVGMRFAEMEFKTTIAKLLDTFELSLVPGDPPMIPETNGVIFRPRCPVRLNL 473

             PFGVGPRYCVGMRFAEMEFKTTIAKLLDTFELSLVPGDPPMIPETNGVIFRPRCPVRLNL

Sbjct:   432 PFGVGPRYCVGMRFAEMEFKTTIAKLLDTFELSLVPGDPPMIPETNGVIFRPRCPVRLNL 491

 

Query:   474 KLRI 477

             KLRI

Sbjct:   492 KLRI 495

 

compare to

>CYP25A4 C36A4.6    Z66495 495 aa

         Length = 496

 

 Score = 543 (191.1 bits), Expect = 7.7e-56, P = 7.7e-56

 Identities = 102/110 (92%), Positives = 106/110 (96%)

 

Query:     1 MAILIISTIFFTIITFISYIIWRRHSIFKLRSSIGIPGPPVNWLXGNLNIIKEWGDTTQW 60

             MAILIISTIFFTIITFISY IWRRH+IFKLRSSIGIPGPPV+WL GNLNIIK+  DTTQW

Sbjct:     1 MAILIISTIFFTIITFISYSIWRRHAIFKLRSSIGIPGPPVHWLWGNLNIIKDRVDTTQW 60

 

Query:    61 HPTLHTKYGPIFGLYCGTQLHITVSEEEDIKEIFIQNFSNFSDRMTPNIF 110

             HPTLHTKYGPIFGLYCGTQLHITVSEEEDIKEIFIQNFSNFSDRMTP+IF

Sbjct:    61 HPTLHTKYGPIFGLYCGTQLHITVSEEEDIKEIFIQNFSNFSDRMTPDIF 110

 

The X is a stop codon in AF038613.2 so this might be a pseudogene or seq .error

>gi|14578220|gb|AF038613.2|  Download subject sequence spanning the HSP Caenorhabditis elegans cosmid F42A6, complete sequence

          Length = 38222

 

 Score =  113 bits (283), Expect(2) = 2e-34

 Identities = 54/57 (94%), Positives = 56/57 (98%)

 Frame = -3

 

Query: 1     MAILIISTIFFTIITFISYIIWRRHSIFKLRSSIGIPGPPVNWLXGNLNIIKEWGDT 57

             MAILIISTIFFTIITFISYIIWRRHSIFKLRSSIGIPGPPVNWL GNLNIIKEWG++

Sbjct: 10272 MAILIISTIFFTIITFISYIIWRRHSIFKLRSSIGIPGPPVNWL*GNLNIIKEWGNS 10102

 

 

CYP33C4

 

>CYP33C4 F44C8 493 aa

         Length = 494

 

 Score = 2619 (921.9 bits), Expect = 7.9e-276, P = 7.9e-276

 Identities = 492/493 (99%), Positives = 493/493 (100%)

 

Query:     1 MIILLVFTAFLAFLFHELYWKRRNLPPGPIPLPFMGNILTMLLNKPGYECFRNCTKKYGD 60

             MIILLVFTAFLAFLFHELYWKRRNLPPGPIPLPFMGNILTMLLNKPGYECFRNCTKKYGD

Sbjct:     1 MIILLVFTAFLAFLFHELYWKRRNLPPGPIPLPFMGNILTMLLNKPGYECFRNCTKKYGD 60

 

Query:    61 VYTFWMGKTPFVMISSYDLLKDTFVRDGDTYKDKYPQPLNQKIRGGNYGIVESNGHLWST 120

             VYTFWMGKTPFVMISSYDLLKDTFVRDGDTYKDKYPQPLNQKIRGGNYGIVESNGHLWST

Sbjct:    61 VYTFWMGKTPFVMISSYDLLKDTFVRDGDTYKDKYPQPLNQKIRGGNYGIVESNGHLWST 120

 

Query:   121 HRRFALTTLRDFGLGKDLMQEKILIEVEDIFKKFDAQLGLEQDVSVVMNNAIANVINQNI 180

             HRRFALTTLRDFGLGKDLMQEKILIEVEDIFKKFDAQLGLEQDVSVVMNNAIANVINQNI

Sbjct:   121 HRRFALTTLRDFGLGKDLMQEKILIEVEDIFKKFDAQLGLEQDVSVVMNNAIANVINQNI 180

 

Query:   181 FGYRFEGDKEEDFKKLRELMEYQETAFATFKVYVEAFIPKVGALLPGRSLNELLEEWRDN 240

             FGYRFEGDKEEDFKKLRELMEYQETAFATFKVYVEAFIPKVGALLPGRSLNELLEEWRDN

Sbjct:   181 FGYRFEGDKEEDFKKLRELMEYQETAFATFKVYVEAFIPKVGALLPGRSLNELLEEWRDN 240

 

Query:   241 FYTFFDTQIENHRKKIDFDSQESLDYAEAYLKEQRKQEALGEFELFSNKQLSNTCLDLWL 300

             FYTFFDTQIENHRKKIDFDSQESLDYAEAYLKEQRKQEALGEFELFSNKQLSNTCLDLWL

Sbjct:   241 FYTFFDTQIENHRKKIDFDSQESLDYAEAYLKEQRKQEALGEFELFSNKQLSNTCLDLWL 300

 

Query:   301 AGLSTTNTTVNWTICYVLNHPDVLQKMNEEFDQVVGSDRLVTMGDKNNLPYFNAVLNESQ 360

             AGLSTTNTTVNWTICYVLNHPDVLQKMNEEFDQVVGSDRLVTMGDKNNLPYFNAVLNESQ

Sbjct:   301 AGLSTTNTTVNWTICYVLNHPDVLQKMNEEFDQVVGSDRLVTMGDKNNLPYFNAVLNESQ 360

 

Query:   361 RCANIVPINLFHATTKDTVINGYPVKKGTGVIAQISTVMLDEKIFPDPYKFNPDRFIDEN 420

             RCANIVPINLFHATTKDTVINGYPVKKGTGVIAQISTVMLDEK+FPDPYKFNPDRFIDEN

Sbjct:   361 RCANIVPINLFHATTKDTVINGYPVKKGTGVIAQISTVMLDEKVFPDPYKFNPDRFIDEN 420

 

Query:   421 GKPIKIEQLIPFSIGKRQCPGEGLARMEMFLFLANFFNRYKISPSSKGFPNLDKKDNVGV 480

             GKPIKIEQLIPFSIGKRQCPGEGLARMEMFLFLANFFNRYKISPSSKGFPNLDKKDNVGV

Sbjct:   421 GKPIKIEQLIPFSIGKRQCPGEGLARMEMFLFLANFFNRYKISPSSKGFPNLDKKDNVGV 480

 

Query:   481 FPKDLHAILNKRN 493

             FPKDLHAILNKRN

Sbjct:   481 FPKDLHAILNKRN 493

 

 

CYP33D1

 

>CYP33D1 K05D4.4 485 aa

         Length = 485

 

 Score = 2257 (794.5 bits), Expect = 1.8e-237, P = 1.8e-237

 Identities = 442/492 (89%), Positives = 450/492 (91%)

 

Query:     1 MLLLLISSLIFLFFAYHFSWKRRNYPPGPTPLPLVGNLLQLQKFGYDIFHKWKKEFGPVF 60

             MLLLLISSLIFLFFAYHFSWKRRNYPPGPTPLPLVGNLLQLQKFGYDIFHKWKKEFGPVF

Sbjct:     1 MLLLLISSLIFLFFAYHFSWKRRNYPPGPTPLPLVGNLLQLQKFGYDIFHKWKKEFGPVF 60

 

Query:    61 TFWLADRPFIFITSYEVMKETFVKDGDTFADKQLNQIDKKKLQRNYGVLDTNGEMWREHR 120

             TFWL  RPFIFITSYEVMKETFVKDGDTFADKQLNQIDKKKLQRNYGVLDTNGEMWREHR

Sbjct:    61 TFWLGKRPFIFITSYEVMKETFVKDGDTFADKQLNQIDKKKLQRNYGVLDTNGEMWREHR 120

 

Query:   121 RFTLSQLRDLGLGKDLMQEKILLEIEEQFKDINSHLGEEIDLPSVLDRGVGNVINLTLFN 180

             RFTLSQLRDLGLGKDLMQEK+   +      ++  +    DL          V+NL+  

Sbjct:   121 RFTLSQLRDLGLGKDLMQEKVFW-VGTCRPTLHCSIK---DLKRYFFPDP--VLNLSQIF 174

 

Query:   181 KRFETNQRDEFTYLKSLIDGLRDVASEFRYFIQFLVPWSSKIIPGPTLGDKTKGMKEELD 240

             K F+ NQRDEFTYLKSLIDGLRDVASEFRYFIQFLVPWSSKIIPGPTLGDKTKGMKEELD

Sbjct:   175 K-FQ-NQRDEFTYLKSLIDGLRDVASEFRYFIQFLVPWSSKIIPGPTLGDKTKGMKEELD 232

 

Query:   241 VFFVKQVEEHRKEIDFDTVESYDYVEAYLKEQKKREADGDHETFCNKQLYAMCFDLWMAG 300

             VFFVKQVEEHRKEIDFDTVESYDYVEAYLKEQKK         F NKQLYAMCFDLWMAG

Sbjct:   233 VFFVKQVEEHRKEIDFDTVESYDYVEAYLKEQKKFRGLLKCFLFSNKQLYAMCFDLWMAG 292

 

Query:   301 LQTTTVTLTWGFSFYLHNADVQLKIREELDRVIGNDRLITTADKNCLPYLTAFINETQRC 360

             LQTTTVTLTWGFSFYLHNADVQLKIREELDRVIGNDRLITTADKNCLPYLTAFINETQRC

Sbjct:   293 LQTTTVTLTWGFSFYLHNADVQLKIREELDRVIGNDRLITTADKNCLPYLTAFINETQRC 352

 

Query:   361 ANIIPFNLLHVATRDTVIEGYPVKKGTGVIAQIGTVMSDEQIFPDAHCFNPGRFIENGKL 420

             ANIIPFNLLHVATRDTVIEGYPVKKGTGVIAQIGTVMSDEQIFPDAHCFNPGRFIENGKL

Sbjct:   353 ANIIPFNLLHVATRDTVIEGYPVKKGTGVIAQIGTVMSDEQIFPDAHCFNPGRFIENGKL 412

 

Query:   421 KKVDEVIPFSIGKRQCLGEGLARMELFLFFANIFNRYDVQLDFSGNLPDLDKSKDNFVTP 480

             KKVDEVIPFSIGKRQCLGEGLARMELFLFFANIFNRYDVQLDFSGNLPDLDKSKDNFVTP

Sbjct:   413 KKVDEVIPFSIGKRQCLGEGLARMELFLFFANIFNRYDVQLDFSGNLPDLDKSKDNFVTP 472

 

Query:   481 RKFNAVLNKRYV 492

             RKFNAVLNKRYV

Sbjct:   473 RKFNAVLNKRYV 484

 

CYP13B2

 

>CYP13B2 K06G5   Z81565 511 aa

         Length = 512

 

 Score = 2634 (927.2 bits), Expect = 2.0e-277, P = 2.0e-277

 Identities = 494/495 (99%), Positives = 494/495 (99%)

 

Query:     5 YFKWIHSYWKRRGIEGPVGLPFIGSFYDLADREKPRGFIINKWTKMFGKVFGYYEGVIPV 64

             YFKWIHSYWKRRGIEGPVGLPFIGSFYDLADREKPRGFIINKWTK FGKVFGYYEGVIPV

Sbjct:    17 YFKWIHSYWKRRGIEGPVGLPFIGSFYDLADREKPRGFIINKWTKXFGKVFGYYEGVIPV 76

 

Query:    65 LVVSDLDMLQEMFIKKFDCFYARKTTNLIHGNLECSQEEPRVNLFAARGARWKRLRALAS 124

             LVVSDLDMLQEMFIKKFDCFYARKTTNLIHGNLECSQEEPRVNLFAARGARWKRLRALAS

Sbjct:    77 LVVSDLDMLQEMFIKKFDCFYARKTTNLIHGNLECSQEEPRVNLFAARGARWKRLRALAS 136

 

Query:   125 PAFSVKALKQIHETMEDSVFSMVDHMSKQVNGEAFNIHEYYQEFTYDVISRLAMGQTYSE 184

             PAFSVKALKQIHETMEDSVFSMVDHMSKQVNGEAFNIHEYYQEFTYDVISRLAMGQTYSE

Sbjct:   137 PAFSVKALKQIHETMEDSVFSMVDHMSKQVNGEAFNIHEYYQEFTYDVISRLAMGQTYSE 196

 

Query:   185 QFNNEGVDIVKKIFLRKNRVYPWYLAVMFPGFENTIKNTFFNHEAVRGGDVGKLLKFCET 244

             QFNNEGVDIVKKIFLRKNRVYPWYLAVMFPGFENTIKNTFFNHEAVRGGDVGKLLKFCET

Sbjct:   197 QFNNEGVDIVKKIFLRKNRVYPWYLAVMFPGFENTIKNTFFNHEAVRGGDVGKLLKFCET 256

 

Query:   245 AVHDRLKERADNVEQGIENPHNDFIDMFLDYYTDTNIEDNAFGIKVEKKVTSEDVIGACF 304

             AVHDRLKERADNVEQGIENPHNDFIDMFLDYYTDTNIEDNAFGIKVEKKVTSEDVIGACF

Sbjct:   257 AVHDRLKERADNVEQGIENPHNDFIDMFLDYYTDTNIEDNAFGIKVEKKVTSEDVIGACF 316

 

Query:   305 VFLLAGFDTTANTLAYASYLLAKHPQEMRKVQDEIDRICTSEHISYDDIGKLRYMDAVIR 364

             VFLLAGFDTTANTLAYASYLLAKHPQEMRKVQDEIDRICTSEHISYDDIGKLRYMDAVIR

Sbjct:   317 VFLLAGFDTTANTLAYASYLLAKHPQEMRKVQDEIDRICTSEHISYDDIGKLRYMDAVIR 376

 

Query:   365 EALRMYPVAWFACSRECVQATTLGNYYIEKGVRIEADVRALHYSEEIWGKNANEFVPERW 424

             EALRMYPVAWFACSRECVQATTLGNYYIEKGVRIEADVRALHYSEEIWGKNANEFVPERW

Sbjct:   377 EALRMYPVAWFACSRECVQATTLGNYYIEKGVRIEADVRALHYSEEIWGKNANEFVPERW 436

 

Query:   425 LESSPRHNMSWIPFGAGPRQCVGMRLGLSEAKTALAHLLRRFSILAGPKTEKELHLQGCT 484

             LESSPRHNMSWIPFGAGPRQCVGMRLGLSEAKTALAHLLRRFSILAGPKTEKELHLQGCT

Sbjct:   437 LESSPRHNMSWIPFGAGPRQCVGMRLGLSEAKTALAHLLRRFSILAGPKTEKELHLQGCT 496

 

Query:   485 TTSPEKVTVHLMTRD 499

             TTSPEKVTVHLMTRD

Sbjct:   497 TTSPEKVTVHLMTRD 511

 

CYP35B1

 

>CYP35B1 K07C6.4 499 aa

         Length = 499

 

Score = 810 (285.1 bits), Expect = 3.3e-271, Sum P(2) = 3.3e-271

 Identities = 153/153 (100%), Positives = 153/153 (100%)

 

Query:     1 MIFVLLLTTFLAVLIIRQYQKARKLPPGPVSLPLIGNIHQLVYHIWNEKGVVPAFDLFRK 60

             MIFVLLLTTFLAVLIIRQYQKARKLPPGPVSLPLIGNIHQLVYHIWNEKGVVPAFDLFRK

Sbjct:     1 MIFVLLLTTFLAVLIIRQYQKARKLPPGPVSLPLIGNIHQLVYHIWNEKGVVPAFDLFRK 60

 

Query:    61 KYGDVFTIWLGPIPHVSICNYETSQEVFVKNGNKYKDRFLPPVYLHVSNNLGLFTANGPV 120

             KYGDVFTIWLGPIPHVSICNYETSQEVFVKNGNKYKDRFLPPVYLHVSNNLGLFTANGPV

Sbjct:    61 KYGDVFTIWLGPIPHVSICNYETSQEVFVKNGNKYKDRFLPPVYLHVSNNLGLFTANGPV 120

 

Query:   121 CAEMRKFTLLAFRNMGVGRDLMEQRILDELNTR 153

             CAEMRKFTLLAFRNMGVGRDLMEQRILDELNTR

Sbjct:   121 CAEMRKFTLLAFRNMGVGRDLMEQRILDELNTR 153

 

 

 Score = 1789 (629.8 bits), Expect = 3.3e-271, Sum P(2) = 3.3e-271

 Identities = 346/346 (100%), Positives = 346/346 (100%)

 

Query:   203 CAEIDADAVNGKTIVHTTEFFDLTVGSVINSLLVGKRFEAEDKDDFLEIKRLMAEAADLF 262

             CAEIDADAVNGKTIVHTTEFFDLTVGSVINSLLVGKRFEAEDKDDFLEIKRLMAEAADLF

Sbjct:   154 CAEIDADAVNGKTIVHTTEFFDLTVGSVINSLLVGKRFEAEDKDDFLEIKRLMAEAADLF 213

 

Query:   263 TMFDLVVPVWFLKSFFPSRFERVVAIFEKALNYLSREALARYEKLKTGEYSVNAEDPQDF 322

             TMFDLVVPVWFLKSFFPSRFERVVAIFEKALNYLSREALARYEKLKTGEYSVNAEDPQDF

Sbjct:   214 TMFDLVVPVWFLKSFFPSRFERVVAIFEKALNYLSREALARYEKLKTGEYSVNAEDPQDF 273

 

Query:   323 VEAYLAKMEQERLRDSDTPYTMECLKFVIGDLWLAGQDTTSTTLVAGFNHLVKNPEIVRK 382

             VEAYLAKMEQERLRDSDTPYTMECLKFVIGDLWLAGQDTTSTTLVAGFNHLVKNPEIVRK

Sbjct:   274 VEAYLAKMEQERLRDSDTPYTMECLKFVIGDLWLAGQDTTSTTLVAGFNHLVKNPEIVRK 333

 

Query:   383 CREEILRITENGSRPLQLKDRAESHYLNATIAEIQRHASILNVNFWRLVHEPTTVQGHPV 442

             CREEILRITENGSRPLQLKDRAESHYLNATIAEIQRHASILNVNFWRLVHEPTTVQGHPV

Sbjct:   334 CREEILRITENGSRPLQLKDRAESHYLNATIAEIQRHASILNVNFWRLVHEPTTVQGHPV 393

 

Query:   443 DSGSVICAQLGALHVNDDLFKNPDKFYPERFIENEQLLSQIIPFGIGKRSCVGENIAKSE 502

             DSGSVICAQLGALHVNDDLFKNPDKFYPERFIENEQLLSQIIPFGIGKRSCVGENIAKSE

Sbjct:   394 DSGSVICAQLGALHVNDDLFKNPDKFYPERFIENEQLLSQIIPFGIGKRSCVGENIAKSE 453

 

Query:   503 LYLMIGNLILRYDIKPHGSEPSTEDKLPYSAGKTPDKSVKLEFVKL 548

             LYLMIGNLILRYDIKPHGSEPSTEDKLPYSAGKTPDKSVKLEFVKL

Sbjct:   454 LYLMIGNLILRYDIKPHGSEPSTEDKLPYSAGKTPDKSVKLEFVKL 499

 

K07C6.4; CE17175. extra seq remove it

MIFVLLLTTFLAVLIIRQYQKARKLPPGPVSLPLIGNIHQLVYHIWNEKG  50

VVPAFDLFRKKYGDVFTIWLGPIPHVSICNYETSQEVFVKNGNKYKDRFL 100

PPVYLHVSNNLGLFTANGPVCAEMRKFTLLAFRNMGVGRDLMEQRILDEL 150

NTR YLWGFELFLVQKKFCRTFRRIFKYREKFLNSQFPVQNFEKSVSSFRR 200

FS CAEIDADAVNGKTIVHTTEFFDLTVGSVINSLLVGKRFEAEDKDDFLE 250

IKRLMAEAADLFTMFDLVVPVWFLKSFFPSRFERVVAIFEKALNYLSREA 300

LARYEKLKTGEYSVNAEDPQDFVEAYLAKMEQERLRDSDTPYTMECLKFV 350

IGDLWLAGQDTTSTTLVAGFNHLVKNPEIVRKCREEILRITENGSRPLQL 400

KDRAESHYLNATIAEIQRHASILNVNFWRLVHEPTTVQGHPVDSGSVICA 450

QLGALHVNDDLFKNPDKFYPERFIENEQLLSQIIPFGIGKRSCVGENIAK 500

SELYLMIGNLILRYDIKPHGSEPSTEDKLPYSAGKTPDKSVKLEFVKL   548

 

compare to related cDNA

>gi|30742706|gb|CB400979.1|CB400979  UniGene info OSTF185G7_1 AD-wrmcDNA Caenorhabditis elegans cDNA.

          Length = 603

 

 Score =  140 bits (352), Expect = 2e-34

 Identities = 80/149 (53%), Positives = 89/149 (59%)

 Frame = +3

 

Query: 1   PPVYLHVSNNLGLFTANGPVCAEMRKFTLLAFRNMGVGRDLMEQRILDELNTRYLWGFEL 60

           PP+++HVS NLGL TANG   AEMR FTLLAFRNMGVG+DLME+RILDELN R      

Sbjct: 300 PPIFIHVSENLGLLTANGDSWAEMRGFTLLAFRNMGVGKDLMEKRILDELNAR------- 458

 

Query: 61  FLVQKKFCRTFRRIFKYREKFLNSQFPVQNFEKSVSSFRRFSCAEIDADAVNGKTIVHTT 120

                                                     CAEIDADAVNGKTIV  +

Sbjct: 459 ------------------------------------------CAEIDADAVNGKTIVKIS 512

 

Query: 121 EFFDLTVGSVINSLLVGKRFEAEDKDDFL 149

           +FFDLTVGSVINS LVGKRFEA +KD+FL

Sbjct: 513 DFFDLTVGSVINSTLVGKRFEAHNKDEFL 599

 

 

CYP14A2

 

>CYP14A2 K09A11.3    Z50742 491 aa

         Length = 492

 

 Score = 2579 (907.9 bits), Expect = 1.4e-271, P = 1.4e-271

 Identities = 488/492 (99%), Positives = 489/492 (99%)

 

Query:     1 MSILIVAFLVSITVYIAYFYWKVSKYPKGPFPLPFIGNILQIPSENIQEYLDDLSKTYGP 60

             MSILIVAF VSITVYIAYFYWKVSKYPKGPFPLPFIGNILQIPSENIQEYLDDLSKTYGP

Sbjct:     1 MSILIVAF-VSITVYIAYFYWKVSKYPKGPFPLPFIGNILQIPSENIQEYLDDLSKTYGP 59

 

Query:    61 CFTLWTPLPAIVLTDYEHVKEAFVTQGDAFIYRADRPPETLLQPHLNTGVLFSNGDNWRF 120

             CFTLWTPLPAIVLTDYEHVKEAFVTQGDAFIYRADRPPETLLQPHLNTGVLFSNGDNWRF

Sbjct:    60 CFTLWTPLPAIVLTDYEHVKEAFVTQGDAFIYRADRPPETLLQPHLNTGVLFSNGDNWRF 119

 

Query:   121 QRRTALKILRDFGLGRNLMEEQVMRSVHEMLAQLERIADKKNVDMFWPIQLCVGNVINES 180

             QRRTALKILRDFGLGRNLMEEQVMRSVHEMLAQLERIADKKNVDMFWPIQLCVGNVINES

Sbjct:   120 QRRTALKILRDFGLGRNLMEEQVMRSVHEMLAQLERIADKKNVDMFWPIQLCVGNVINES 179

 

Query:   181 LFSYHYKYEDSKKFETFVKVLDKHLKTVQGKTIFLMSAFPWLKHFPVIGELGYHRIKKNI 240

             LFSYHYKYEDSKKFETFVKVLDKHLKTVQGKTIFLMSAFPWLKHFPVIGELGYHRIKKNI

Sbjct:   180 LFSYHYKYEDSKKFETFVKVLDKHLKTVQGKTIFLMSAFPWLKHFPVIGELGYHRIKKNI 239

 

Query:   241 QSYQEFINDEVTSQIKHYDGESEPENFVHAYMQQMKQSGNPNLDMNNLCASVIDFWLAGM 300

             QSYQEFINDEVTSQIKHYDGESEPENFVHAYMQQMKQSGNPNLDMNNLCASV+DFWLAGM

Sbjct:   240 QSYQEFINDEVTSQIKHYDGESEPENFVHAYMQQMKQSGNPNLDMNNLCASVLDFWLAGM 299

 

Query:   301 ETTSNSLRWHLAFMMKYPEIQDKVRKEILDNVGTARLPSMSDKPNMPYTQAVIHEVQRCS 360

             ETTSNSLRWHLA MMKYPEIQDKVRKEI DNVGTARLPSMSDKPNMPYTQAVIHEVQRCS

Sbjct:   300 ETTSNSLRWHLAAMMKYPEIQDKVRKEIFDNVGTARLPSMSDKPNMPYTQAVIHEVQRCS 359

 

Query:   361 NMIPILGTHTNRDDILLKGKKIPTGTLVFAQIWSVLKNDPVFEESSKFNPDRYLMADGKT 420

             NMIPILGTHTNRDDILLKGKKIPTGTLVFAQIWSVLKNDPVFEESSKFNPDRYLMADGKT

Sbjct:   360 NMIPILGTHTNRDDILLKGKKIPTGTLVFAQIWSVLKNDPVFEESSKFNPDRYLMADGKT 419

 

Query:   421 INKSVLERTIPFSVGKRNCVGEGLARMELFLIFSALIQKYEFIPKTNIDVKPVCGAVLTT 480

             INKSVLERTIPFSVGKRNCVGEGLARMELFLIFSALIQKYEFIPKTNIDVKPVCGAVLTT

Sbjct:   420 INKSVLERTIPFSVGKRNCVGEGLARMELFLIFSALIQKYEFIPKTNIDVKPVCGAVLTT 479

 

Query:   481 KPYICELVPQAA 492

             KPYICELVPQAA

Sbjct:   480 KPYICELVPQAA 491

 

CYP35A3

 

>CYP35A3 AC006673 K09D9 494 aa Revised 3/19/04

         Length = 494

 

 Score = 2546 (896.2 bits), Expect = 4.3e-268, P = 4.3e-268

 Identities = 491/494 (99%), Positives = 491/494 (99%)

 

Query:     1 MFFVLFIAACLSWLIVRQYQKVSRHPPGPISFPLIGNLPQICYYLWSTGGIVSALDLLRK 60

             MFFVLFIAACLSWLIVRQYQKVSRHPPGPISFPLIGNLPQICYYLWSTGGIVSALDLLRK

Sbjct:     1 MFFVLFIAACLSWLIVRQYQKVSRHPPGPISFPLIGNLPQICYYLWSTGGIVSALDLLRK 60

 

Query:    61 KYGNIFTLWVGPVPYVSIADFETSHEVFVKNGGKYADKLHAPIMRDIRKDKGIAFTNGDH 120

             KYGNIFTLWVGPVPYVSIADFETSHEVFVKNGGKYADKLHAPIMRDIR  KGIAFTNGDH

Sbjct:    61 KYGNIFTLWVGPVPYVSIADFETSHEVFVKNGGKYADKLHAPIMRDIRSMKGIAFTNGDH 120

 

Query:   121 WQEMRRFSLQTFRNMGVGKDIMETRILEELDARCSDIDKSAKNGVTVAQASEFFDLTVGS 180

             WQEMRRFSLQTFRNMGVGKDIMETRILEELDARCSDIDKSAKNGVTVAQASEFFDLTVGS

Sbjct:   121 WQEMRRFSLQTFRNMGVGKDIMETRILEELDARCSDIDKSAKNGVTVAQASEFFDLTVGS 180

 

Query:   181 VINSILVGKRFEEDTKHVFLRIKNTIDESFKLITPFNTTVPVWILKTFFKDRYDKMADAQ 240

             VINSILVGKRFEEDTKHVFLRIKNTIDESFKLITPFNTTVPVWILKTFFKDRYDKMADAQ

Sbjct:   181 VINSILVGKRFEEDTKHVFLRIKNTIDESFKLITPFNTTVPVWILKTFFKDRYDKMADAQ 240

 

Query:   241 EIAKNFVAAEALKRIEDIKSGKYVIDENNLQDYTDAFLLKMQKEGENLDFNTETLKTMLV 300

             EIAKNFVAAEALKR EDIKSGKYVIDENNLQDYTDAFLLKMQKEGENLDFNTETLKTMLV

Sbjct:   241 EIAKNFVAAEALKRFEDIKSGKYVIDENNLQDYTDAFLLKMQKEGENLDFNTETLKTMLV 300

 

Query:   301 DLWLTGQETTTTTLTSGFNQLLLHPEVMVKAREELMNVTENGSRSLSLTDRSSTPYLNAM 360

             DLWLTGQETTTTTLTSGFNQLLLHPEVMVKAREELMNVTENGSRSLSLTDRSSTPYLNAM

Sbjct:   301 DLWLTGQETTTTTLTSGFNQLLLHPEVMVKAREELMNVTENGSRSLSLTDRSSTPYLNAM 360

 

Query:   361 IGEIQRHASILNLSFWKVNKEFTYIGGHPVDAGALVTAQLSALHVNETYFTNPQVFEPER 420

             IGEIQRHASILNLSFWKVNKEFTYIGGHPVDAGALVTAQLSALHVNETYFTNPQVFEPER

Sbjct:   361 IGEIQRHASILNLSFWKVNKEFTYIGGHPVDAGALVTAQLSALHVNETYFTNPQVFEPER 420

 

Query:   421 YSQDEKLLQKIIPFGVGKRSCLGESLAKAELYLIFGNLLLRYKFEPHGKLSATDLMPYSA 480

             YSQDEKLLQKIIPFGVGKRSCLGESLAKAELYLIFGNLLLRYKFEPHGKLSATDLMPYSA

Sbjct:   421 YSQDEKLLQKIIPFGVGKRSCLGESLAKAELYLIFGNLLLRYKFEPHGKLSATDLMPYSA 480

 

Query:   481 GRRPFKLEMKFVKI 494

             GRRPFKLEMKFVKI

Sbjct:   481 GRRPFKLEMKFVKI 494

 

cDNA support

>gi|30742693|gb|CB400966.1|CB400966  UniGene info OSTF185F11_1 AD-wrmcDNA Caenorhabditis elegans cDNA.

          Length = 574

 

 Score =  217 bits (553), Expect = 4e-58

 Identities = 104/107 (97%), Positives = 105/107 (98%)

 Frame = +3

 

Query: 1   RKKYGNIFTLWVGPVPYVSIADFETSHEVFVKNGGKYADKLHAPIMRDIRSMKGIAFTNG 60

           +KKYGNIFTLWVGPVPYVSIADFETSHEVFVKNGGKYADKLHAPIMRDIR  KGIAFTNG

Sbjct: 174 KKKYGNIFTLWVGPVPYVSIADFETSHEVFVKNGGKYADKLHAPIMRDIRKDKGIAFTNG 353

 

Query: 61  DHWQEMRRFSLQTFRNMGVGKDIMETRILEELDARCSDIDKSAKNGV 107

           DHWQEMRRFSLQTFRNMGVGKDIMETRILEELDARCSDIDKSAKNGV

Sbjct: 354 DHWQEMRRFSLQTFRNMGVGKDIMETRILEELDARCSDIDKSAKNGV 494

 

>gi|4263265|gb|AC006673.1|  Download subject sequence spanning the HSP Caenorhabditis elegans cosmid K09D9, complete sequence

          Length = 37242

 

 Score = 72.8 bits (177), Expect = 5e-14

 Identities = 41/58 (70%), Positives = 41/58 (70%), Gaps = 17/58 (29%)

 Frame = +3

 

Query: 1     EIAKNFVAAEALKRF-----------------EDIKSGKYVIDENNLQDYTDAFLLKM 41

             EIAKNFVAAEALKRF                 EDIKSGKYVIDENNLQDYTDAFLLKM

Sbjct: 32793 EIAKNFVAAEALKRFVFAFFHNHIIILFV*RIEDIKSGKYVIDENNLQDYTDAFLLKM 32966

 

CYP34A4

 

>CYP34A4 T09H2.1 485 aa

         Length = 487

 

 Score = 2512 (884.3 bits), Expect = 1.7e-264, P = 1.7e-264

 Identities = 486/500 (97%), Positives = 486/500 (97%)

 

Query:     1 MLLILLFTTVLAILIIHQWIKRRNLPNGPTPIPIIGNLHQLFYLYWKHGGAVSAYRQLEQ 60

             MLLILLFTTVLAILIIHQWIKRRNLPNGPTPIPIIGNLHQLFYLYWKHGGAVSAYRQLEQ

Sbjct:     1 MLLILLFTTVLAILIIHQWIKRRNLPNGPTPIPIIGNLHQLFYLYWKHGGAVSAYRQLEQ 60

 

Query:    61 TYGKVFTVWIGPLPTVYISDYDLAHETHIKKSNIFGARFAVGALNYIREGRGIVASNGEF 120

             TYGKVFTVWIGPLPTVYISDYDLAHETHIKKSNIFGARFAVGALNYIREGRGIVASNGEF

Sbjct:    61 TYGKVFTVWIGPLPTVYISDYDLAHETHIKKSNIFGARFAVGALNYIREGRGIVASNGEF 120

 

Query:   121 WQEHRRFALTTLRNFGLGRNLMEEKIMEEYRYRFAQTGNGNNNKSSIETNSSMFFDLLIG 180

             WQEHRRFALTTLRNFGLGRNLMEEKIMEEYRYRFAQTGNGNNNKSSIETNSSMFFDLLIG

Sbjct:   121 WQEHRRFALTTLRNFGLGRNLMEEKIMEEYRYRFAQTGNGNNNKSSIETNSSMFFDLLIG 180

 

Query:   181 SIINQLLISERFEQGDPEFEKLKESLSIGLEKFGVLDIFLPDWIMNAWWMKWRMDDILGP 240

             SIINQLLISERFE    EFEKLKESLSIGLEKFGVLDIFLPDWIMNAWWMKWRMDDILGP

Sbjct:   181 SIINQLLISERFE----EFEKLKESLSIGLEKFGVLDIFLPDWIMNAWWMKWRMDDILGP 236

 

Query:   241 FSWIHRLSQRNVQRRMEQIESGEHVIDGDGTDFMDTYINKIEKDKREGVDSTFTLETLAI 300

             FSWIHRLSQRNVQRRMEQIESGEHVIDGDGTDFMDTYINKIEKDKREGVDSTFT     

Sbjct:   237 FSWIHRLSQRNVQRRMEQIESGEHVIDGDGTDFMDTYINKIEKDKREGVDSTFT------ 290

 

Query:   301 DMYDLWIAGQETTSTTLSWACACLLNHPEVVKIAREELVHLTGGHRSISLTDKTSTPYLN 360

                DLWIAGQETTSTTLSWACACLLNHPEVVKIAREELVHLTGGHRSISLTDKTSTPYLN

Sbjct:   291 ---DLWIAGQETTSTTLSWACACLLNHPEVVKIAREELVHLTGGHRSISLTDKTSTPYLN 347

 

Query:   361 AVINEVQRIASILNVNIFRQTSEDTTVNGQPIAAGTALTTHLSLIHTDENLFQNHTEFRP 420

             AVI EVQRIASILNVNIFRQTSEDTTVNGQPIAAGTALTTHLSLIHTDENLFQNHTEFRP

Sbjct:   348 AVI-EVQRIASILNVNIFRQTSEDTTVNGQPIAAGTALTTHLSLIHTDENLFQNHTEFRP 406

 

Query:   421 ERFLENNNLEKKLIPFGIGKRACLGESLARAELYLVTGNMILDYDLQPIGEVPQIKTTSP 480

             ERFLENNNLEKKLIPFGIGKRACLGESLARAELYLVTGNMILDYDLQPIGEVPQIKTTSP

Sbjct:   407 ERFLENNNLEKKLIPFGIGKRACLGESLARAELYLVTGNMILDYDLQPIGEVPQIKTTSP 466

 

Query:   481 CGIMKRPPVYSLRFVPVHHA 500

             CGIMKRPPVYSLRFVPVHHA

Sbjct:   467 CGIMKRPPVYSLRFVPVHHA 486

 

>gi|3168923|gb|AF068712.1|   Caenorhabditis elegans cosmid T09H2, complete sequence

          Length = 3304

 

 Score =  180 bits (457), Expect = 3e-46

 Identities = 101/166 (60%), Positives = 110/166 (66%), Gaps = 33/166 (19%)

 Frame = +3

 

Query: 15   GDPEFEKLKESLSIGLEKFGVLDIFLPDWIMNAWWMKWRMDDILGPFSWIHRLSQRNVQR 74

            GDPEFEKLKESLSIGLEKFGVLDIFLPDWIMNAWWMKWRMDDILGPFSWIHRLSQRNVQR

Sbjct: 1923 GDPEFEKLKESLSIGLEKFGVLDIFLPDWIMNAWWMKWRMDDILGPFSWIHRLSQRNVQR 2102

 

Query: 75   R----------------MEQIESGEHVIDGDGTDFMDTYINKIE----KDKREGVDSTF- 113

            R                +E  +S    +    T  +   +  I+    K+K   + S  

Sbjct: 2103 R*GKKTCYLKNFFIIIFLEWNKSKVVSMSSMVTVLILWTLISIKSRKIKEKASILHSRM* 2282

 

Query: 114  ------------TLETLAIDMYDLWIAGQETTSTTLSWACACLLNH 147

                        +LETLAIDMYDLWIAGQETTSTTLSWACACLLNH

Sbjct: 2283 FFCKSFIKNYISSLETLAIDMYDLWIAGQETTSTTLSWACACLLNH 2420

 

 

 

 

 Score = 87.8 bits (216), Expect = 3e-18

 Identities = 64/147 (43%), Positives = 80/147 (54%), Gaps = 33/147 (22%)

 Frame = +2

 

Query: 1    SIINQLLISERFEQGDPEFEKLK-ESLSIGLEKFGVLDIFLPDWIMNAW---WMKWRMDD 56

            SIINQLLISERFEQ       LK +++ +G  +    +    DW    W   ++  R+D+

Sbjct: 1835 SIINQLLISERFEQV*QFNVMLKLKNMFVG*PRI*KTEREPVDWSGKVWSFGYIFARLDN 2014

 

Query: 57   --ILGPF------------------SWIHRLSQRNVQ---------RRMEQIESGEHVID 87

              ++                         ++ ++NV           RMEQIESGEHVID

Sbjct: 2015 ECVVDEMENG*HFRSI*LDS*AFAKKCAEKVRKKNVLPKKFFYYNFSRMEQIESGEHVID 2194

 

Query: 88   GDGTDFMDTYINKIEKDKREGVDSTFT 114

            GDGTDFMDTYINKIEKDKREGVDSTFT

Sbjct: 2195 GDGTDFMDTYINKIEKDKREGVDSTFT 2275

 

 

CYP14A4

 

>CYP14A4 K09A11    Z50742   end of cosmid = R04D3.1  Z70212 486 aa

         Length = 487

 

 Score = 2571 (905.0 bits), Expect = 9.7e-271, P = 9.7e-271

 Identities = 484/491 (98%), Positives = 484/491 (98%)

 

Query:     1 MSPLLILAFFVATVGYLVHFYLKVRKYPKGPFPLPVIGNLHQIPGGNLEIWFDELAKIYG 60

             MSPLLILAFFVATVGYLVHFYLKVRKYPKGPFPLPVIGNLHQIPGGNLEIWFDELAKIYG

Sbjct:     1 MSPLLILAFFVATVGYLVHFYLKVRKYPKGPFPLPVIGNLHQIPGGNLEIWFDELAKIYG 60

 

Query:    61 PCYTVWSPLPCVVLTDFDHIKNAFVTQGEFNNRAHKPPESLLHVHDNTGILNSSGENWQL 120

             PCYTVWSPLPCVVLTDFDHIKNAFVTQGEFNNRAHKPPESLLHVHDNTGILNSSGENWQL

Sbjct:    61 PCYTVWSPLPCVVLTDFDHIKNAFVTQGEFNNRAHKPPESLLHVHDNTGILNSSGENWQL 120

 

Query:   121 QRRTSLKILRDFGMGKNLMEEQIVKSVQEMMTQLEKTDDKKKADIFWPVQLCVGNVINQV 180

             QRRTSLKILRDFGMGKNLMEEQIVKSVQEMMTQLEKTDDKKKADIFWPVQLCVGNVINQV

Sbjct:   121 QRRTSLKILRDFGMGKNLMEEQIVKSVQEMMTQLEKTDDKKKADIFWPVQLCVGNVINQV 180

 

Query:   181 LFGYHFKYDDCERFQKFVSVIDFHLRTLLGKCSLLVSAFPMLRHVPIIGEYGYQRIKRNI 240

             LFGYHFKYDDCERFQKFVSVIDFHLRTLLGKCSLLVSAFPMLRHVPIIGEYGYQRIKRNI

Sbjct:   181 LFGYHFKYDDCERFQKFVSVIDFHLRTLLGKCSLLVSAFPMLRHVPIIGEYGYQRIKRNI 240

 

Query:   241 KSYQYFIEEEVATQMKSFEEENEPENFVHAYMQQMKQNGHPGLDVKNLRACALDFWLAGM 300

             KSYQYFIEEEVATQMKSFEEENEPENFVHAYMQQMKQNGHPGLDVKNLRACALDFWLAGM

Sbjct:   241 KSYQYFIEEEVATQMKSFEEENEPENFVHAYMQQMKQNGHPGLDVKNLRACALDFWLAGM 300

 

Query:   301 ETTSNSLRWHIAYMMKHPDIQDKVRKEILDVVGNSRFPSMSDKPNMPYTQAVIHEVQRHS 360

             ETTSNSLRWHIAYMMKHPDIQDKVRKEILDVVGNSRFPSMSDKPNMPYTQAVIHEVQRHS

Sbjct:   301 ETTSNSLRWHIAYMMKHPDIQDKVRKEILDVVGNSRFPSMSDKPNMPYTQAVIHEVQRHS 360

 

Query:   361 NMVPFLGTHQSNKDTNVLGQNIPAGTNVLVQAWSVMRNDPIFINQLSFNPDRYLMADGKS 420

             NMVPFLGTH  NKDTNVLGQNIPAGTNVLVQAWSVMRNDPIF     FNPDRYLMADGKS

Sbjct:   361 NMVPFLGTHSGNKDTNVLGQNIPAGTNVLVQAWSVMRNDPIF-----FNPDRYLMADGKS 415

 

Query:   421 FDKNILEKTIPFSVGKRSCIGEGLARMELFLIFSALIQKYEFIANGPVDMSYDFGAVLTI 480

             FDKNILEKTIPFSVGKRSCIGEGLARMELFLIFSALIQKYEFIANGPVDMSYDFGAVLTI

Sbjct:   416 FDKNILEKTIPFSVGKRSCIGEGLARMELFLIFSALIQKYEFIANGPVDMSYDFGAVLTI 475

 

Query:   481 KPYTCQLKAQF 491

             KPYTCQLKAQF

Sbjct:   476 KPYTCQLKAQF 486

 

compare to cDNA

>gi|30740757|gb|CB399030.1|CB399030   OSTR214C8_1 AD-wrmcDNA Caenorhabditis elegans cDNA.

          Length = 534

 

 Score =  233 bits (593), Expect = 9e-63

 Identities = 112/114 (98%), Positives = 112/114 (98%)

 Frame = -1

 

Query: 1   VVGNSRFPSMSDKPNMPYTQAVIHEVQRHSNMVPFLGTHSGNKDTNVLGQNIPAGTNVLV 60

           VVGNSRFPSMSDKPNMPYTQAVIHEVQRHSNMVPFLGTH  NKDTNVLGQNIPAGTNVLV

Sbjct: 459 VVGNSRFPSMSDKPNMPYTQAVIHEVQRHSNMVPFLGTHQSNKDTNVLGQNIPAGTNVLV 280

 

Query: 61  QAWSVMRNDPIFINQLSFNPDRYLMADGKSFDKNILEKTIPFSVGKRSCIGEGL 114

           QAWSVMRNDPIFINQLSFNPDRYLMADGKSFDKNILEKTIPFSVGKRSCIGEGL

Sbjct: 279 QAWSVMRNDPIFINQLSFNPDRYLMADGKSFDKNILEKTIPFSVGKRSCIGEGL 118

 

 

CYP33C8

 

>CYP33C8 R08F11 494 aa also Y39H10 contig 180

         Length = 495

 

 Score = 2626 (924.4 bits), Expect = 1.4e-276, P = 1.4e-276

 Identities = 493/494 (99%), Positives = 494/494 (100%)

 

Query:     1 MFFILVLVLTGISIYLFHLLYWKRRNLPPGPTPLPLFGNTLPTLFDIPGYKTFGKLKEQY 60

             MFFILVLVLTGISIYLFHLLYWKRRNLPPGPTPLPLFGNTLPTLFDIPGYKTFGKLKEQY

Sbjct:     1 MFFILVLVLTGISIYLFHLLYWKRRNLPPGPTPLPLFGNTLPTLFDIPGYKTFGKLKEQY 60

 

Query:    61 GDAYTFWMGKDPYVMICSYDLLKETFIRDADTYKDKDYNPIDEKIRGGIYGILQTNGHVW 120

             GDAYTFWMGKDPYVMICSYDLLKETFIRDADTYKDKDYNPIDEKIRGGIYGILQTNGHVW

Sbjct:    61 GDAYTFWMGKDPYVMICSYDLLKETFIRDADTYKDKDYNPIDEKIRGGIYGILQTNGHVW 120

 

Query:   121 NTHRRFALTTFRDFGLGKDLMQQKILIEVEDMFRKLDENIGEEQDVPTVLYNAVANVINQ 180

             NTHRRFALTTFRDFGLGKDLMQQKILIEVEDMFRKLDENIGEEQDVPTVLYNAVANVINQ

Sbjct:   121 NTHRRFALTTFRDFGLGKDLMQQKILIEVEDMFRKLDENIGEEQDVPTVLYNAVANVINQ 180

 

Query:   181 IIFGYRFEGVKQEEFTKLKELMEYLETAFATLKIYVEIFVPWIGKLLPGKSLDDVLNYWK 240

             IIFGYRFEGVKQEEFTKLKELMEYLETAFATLKIYVEIFVPWIGKLLPGKSLDDVLNYWK

Sbjct:   181 IIFGYRFEGVKQEEFTKLKELMEYLETAFATLKIYVEIFVPWIGKLLPGKSLDDVLNYWK 240

 

Query:   241 DTSYEFFNSQIENHRLKIDFDTEESLDYAEAYLKEQKKQKAQGEFELFCDKQLSNNCLDL 300

             DTSYEFFNSQIENHRLKIDFDTEESLDYAEAYLKEQKKQKAQGEFELFCDKQLSNNCLDL

Sbjct:   241 DTSYEFFNSQIENHRLKIDFDTEESLDYAEAYLKEQKKQKAQGEFELFCDKQLSNNCLDL 300

 

Query:   301 WTAGLSTTIITINWTICYIMNTPGVQEKMQEEMDKVVGGGRLVTTADKNDLPYMNAVINE 360

             WTAGLSTTIITINWTICYIMNTPGVQEKMQEEMDKVVGGGRLVTTADKNDLPYMNAVINE

Sbjct:   301 WTAGLSTTIITINWTICYIMNTPGVQEKMQEEMDKVVGGGRLVTTADKNDLPYMNAVINE 360

 

Query:   361 AQRCGNIVPLNLLHCTTKDTVINGYSVKKGTGVVAQISTVMYDEKIFPDPYTFNPDRFID 420

             AQRCGNIVPLNLLHCTTKDTVINGYSVKKGTGVVAQISTVMYDEK+FPDPYTFNPDRFID

Sbjct:   361 AQRCGNIVPLNLLHCTTKDTVINGYSVKKGTGVVAQISTVMYDEKVFPDPYTFNPDRFID 420

 

Query:   421 DNGKLVKIEQLIPFSIGKRQCLGEGLARMELFLFIANFLNRYHISPGSSGLPSLDKSKGI 480

             DNGKLVKIEQLIPFSIGKRQCLGEGLARMELFLFIANFLNRYHISPGSSGLPSLDKSKGI

Sbjct:   421 DNGKLVKIEQLIPFSIGKRQCLGEGLARMELFLFIANFLNRYHISPGSSGLPSLDKSKGI 480

 

Query:   481 TPRKFEPVLSRRHA 494

             TPRKFEPVLSRRHA

Sbjct:   481 TPRKFEPVLSRRHA 494

 

CYP13A5

 

>CYP13A5 T10B9.2   Z48717 496 aa

         Length = 497

 

 Score = 1470 (517.5 bits), Expect = 7.7e-292, Sum P(2) = 7.7e-292

 Identities = 287/302 (95%), Positives = 287/302 (95%)

 

Query:     1 MSLSILIAGASFIGLLTYYIWIWSFWIRKGVKGPRGFPFFGVIHEFQDYENPGLLKLGEW 60

             MSLSILIAGASFIGLLTYYIWIWSFWIRKGVKGPRGFPFFGVIHEFQDYENPGLLKLGEW

Sbjct:     1 MSLSILIAGASFIGLLTYYIWIWSFWIRKGVKGPRGFPFFGVIHEFQDYENPGLLKLGEW 60

 

Query:    61 TKEYGPIYGITEGVEKTLIVSNPEFVHEVFVKQFDNFYGRKTNPIQGDPNKNKRAHLVSA 120

             TKEYGPIYGITEGVEKTLIVSNPEFVHEVFVKQFDNFYGRKTNPIQGDPNKNKRAHLVSA

Sbjct:    61 TKEYGPIYGITEGVEKTLIVSNPEFVHEVFVKQFDNFYGRKTNPIQGDPNKNKRAHLVSA 120

 

Query:   121 QGHRWKRLRTLSSPTFSNKNLRKIMSTVEETVVELMRHLDDASAKGKAVDLLDYYQEFTL 180

             QGHRWKRLRTLSSPTFSNKNLRKIMSTVEETVVELMRHLDDASAKGKAVDLLDYYQEFTL

Sbjct:   121 QGHRWKRLRTLSSPTFSNKNLRKIMSTVEETVVELMRHLDDASAKGKAVDLLDYYQEFTL 180

 

Query:   181 DIIGRIAMGQTESLMFRNPMLPKVKGIFKDGRKLPFLVSGIFPIAGTMFREFFMRFPSIQ 240

             DIIGRIAMGQTESLMFRNPMLPKVKGIFKDGRKLPFLVSGIFPIAGTMFREFFMRFP  

Sbjct:   181 DIIGRIAMGQTESLMFRNPMLPKVKGIFKDGRKLPFLVSGIFPIAGTMFREFFMRFP--- 237

 

Query:   241 PAFDIMSTVEKALNKRLEQRAADEKAGIEPSGEPQDFIDLFLDARANVDFFEEESALGFA 300

                        ALNKRLEQRAADEKAGIEPSGEPQDFIDLFLDARANVDFFEEESALGFA

Sbjct:   238 -----------ALNKRLEQRAADEKAGIEPSGEPQDFIDLFLDARANVDFFEEESALGFA 286

 

Query:   301 KT 302

              T

Sbjct:   287 LT 288

 

 Score = 1324 (466.1 bits), Expect = 7.7e-292, Sum P(2) = 7.7e-292

 Identities = 259/269 (96%), Positives = 259/269 (96%)

 

Query:   252 ALNKRLEQRAADEKAGIEPSGEPQDFIDLFLDARANVDFFEEESALGFAKTEIAKVDKQL 311

             ALNKRLEQRAADEKAGIEPSGEPQDFIDLFLDARANVDFFEEESALGFA          L

Sbjct:   238 ALNKRLEQRAADEKAGIEPSGEPQDFIDLFLDARANVDFFEEESALGFA----------L 287

 

Query:   312 TFDEIIGQLFVFLLAGYDTTALSLSYSSYLLARHPEIQKKLQEEVDRECPNPEVTFDQIS 371

             TFDEIIGQLFVFLLAGYDTTALSLSYSSYLLARHPEIQKKLQEEVDRECPNPEVTFDQIS

Sbjct:   288 TFDEIIGQLFVFLLAGYDTTALSLSYSSYLLARHPEIQKKLQEEVDRECPNPEVTFDQIS 347

 

Query:   372 KLKYMECVVKEALRMYPLASIVHNRKCMKETNVLGVQIEKGTNVQVDTWTLHYDPKVWGE 431

             KLKYMECVVKEALRMYPLASIVHNRKCMKETNVLGVQIEKGTNVQVDTWTLHYDPKVWGE

Sbjct:   348 KLKYMECVVKEALRMYPLASIVHNRKCMKETNVLGVQIEKGTNVQVDTWTLHYDPKVWGE 407

 

Query:   432 DANEFRPERWESGDELFYAKGGYLPFGMGPRICIGMRLAMMEKKMLLTHILKKYTFETST 491

             DANEFRPERWESGDELFYAKGGYLPFGMGPRICIGMRLAMMEKKMLLTHILKKYTFETST

Sbjct:   408 DANEFRPERWESGDELFYAKGGYLPFGMGPRICIGMRLAMMEKKMLLTHILKKYTFETST 467

 

Query:   492 QTEIPLKLVGSATTAPRSVMLKLTPRHSN 520

             QTEIPLKLVGSATTAPRSVMLKLTPRHSN

Sbjct:   468 QTEIPLKLVGSATTAPRSVMLKLTPRHSN 496

 

T10B9.2; CE01656. CYP13A5

MSLSILIAGASFIGLLTYYIWIWSFWIRKGVKGPRGFPFFGVIHEFQDYE  50

NPGLLKLGEWTKEYGPIYGITEGVEKTLIVSNPEFVHEVFVKQFDNFYGR 100

KTNPIQGDPNKNKRAHLVSAQGHRWKRLRTLSSPTFSNKNLRKIMSTVEE 150

TVVELMRHLDDASAKGKAVDLLDYYQEFTLDIIGRIAMGQTESLMFRNPM 200

 

LPKVKGIFKDGRKLPFLVSGIFPIAGTMFREFFMRFPSIQPAFDIMSTVE 250

KALNKRLEQRAADEKAGIEPSGEPQDFIDLFLDARANVDFFEEESALGFA 300

KTEIAKVDKQLTFDEIIGQLFVFLLAGYDTTALSLSYSSYLLARHPEIQK 350

 

KLQEEVDRECPNPEVTFDQISKLKYMECVVKEALRMYPLASIVHNRKCMK 400

ETNVLGVQIEKGTNVQVDTWTLHYDPKVWGEDANEFRPERWESGDELFYA 450

KGGYLPFGMGPRICIGMRLAMMEKKMLLTHILKKYTFETSTQTEIPLKLV 500

GSATTAPRSVMLKLTPRHSN                               520

 

best match in briggsae

>gi|22417475|emb|CAAC01000006.1|  Download subject sequence spanning the HSP Caenorhabditis briggsae contig cb25.fpc0022 from assembly cb25.agp8, whole

               genome shotgun sequence

          Length = 1502397

 

Score = 29.6 bits (65), Expect(2) = 9e-46

 Identities = 13/24 (54%), Positives = 14/24 (58%)

 Frame = -3

 

Query: 7       IFKDGRKLPFLVSGIFPIAGTMFR 30

               IF DGRK  F  S +FP  G M R

Sbjct: 1381777 IFVDGRKHAFFASAMFPFLGAMLR 1381706

 

 Score =  177 bits (449), Expect(2) = 9e-46

 Identities = 87/119 (73%), Positives = 100/119 (84%)

 Frame = -1

 

Query: 32      FFMRFPSIQPAFDIMSTVEKALNKRLEQRAADEKAGIEPSGEPQDFIDLFLDARANVDFF 91

               F   FPSIQPA DIM+ +EKA+  R++QR  DEK GIE SGEPQDFIDLFLDARA+VDFF

Sbjct: 1381710 FVSMFPSIQPASDIMNIIEKAVKARIDQRKTDEKLGIEVSGEPQDFIDLFLDARADVDFF 1381531

 

Query: 92      EEESALGFAKTEIAKVDKQLTFDEIIGQLFVFLLAGYDTTALSLSYSSYLLARHPEIQK 150

               E+E+   F KT+I KVDK LTFDE +GQLFVFLLAGYDTTALSLSYSS+LLA HPE+Q+

Sbjct: 1381530 EQEAKDDFGKTQITKVDKHLTFDETVGQLFVFLLAGYDTTALSLSYSSFLLATHPEVQR 1381354

 

 

 

 

 

 

CYP13A6 add to blast file (missing)

 

>CYP13A6   T10B9.3    Z48717 493 aa

           Length = 494

 

 Score = 1451 (510.8 bits), Expect = 3.5e-154, P = 3.5e-154

 Identities = 283/297 (95%), Positives = 283/297 (95%)

 

Query:     1 MIFVLLSAVLLGVFTYSVWIWSYFIRKGIKGPRGFPGIGMLIQTIDHENPPFLKYRDWTK 60

             MIFVLLSAVLLGVFTYSVWIWSYFIRKGIKGPRGFPGIGMLIQTIDHENPPFLKYRDWTK

Sbjct:     1 MIFVLLSAVLLGVFTYSVWIWSYFIRKGIKGPRGFPGIGMLIQTIDHENPPFLKYRDWTK 60

 

Query:    61 QYGPVYGFTEGPQQTMIISEPEMVNEIFKKQFDNFYGRKLRPIIGDPEKDKRVNIFSTQG 120

             QYGPVYGFTEGPQQTMIISEPEMVNEIFKKQFDNFYGRKLRPIIGDPEKDKRVNIFSTQG

Sbjct:    61 QYGPVYGFTEGPQQTMIISEPEMVNEIFKKQFDNFYGRKLRPIIGDPEKDKRVNIFSTQG 120

 

Query:   121 KRWKRLRTLSSPSFSNNSLRKVRNSVQECGTEILWNIEQKVRKNEDIDMLIVYQEYTLGV 180

             KRWKRLRTLSSPSFSNNSLRKVRNSVQECGTEILWNIEQKVRKNEDIDMLIVYQEYTLGV

Sbjct:   121 KRWKRLRTLSSPSFSNNSLRKVRNSVQECGTEILWNIEQKVRKNEDIDMLIVYQEYTLGV 180

 

Query:   181 ISRIALGQSESNMFKNPLLPKVQAIFNGSWHVFLITGIFPPLAGVFRKMSKMLPASFIPA 240

             ISRIALGQSESNMFKNPLLPKVQAIFNGSWHVFLITGIFPPLAGVFRKMSKMLPA    

Sbjct:   181 ISRIALGQSESNMFKNPLLPKVQAIFNGSWHVFLITGIFPPLAGVFRKMSKMLPA----- 235

 

Query:   241 FKIFDLIEVAVQARIDQRAKDEIKGVEPGEPQDFIDLFLDARVPDVKILSGEANEDF 297

                      AVQARIDQRAKDEIKGVEPGEPQDFIDLFLDARVPDVKILSGEANEDF

Sbjct:   236 ---------AVQARIDQRAKDEIKGVEPGEPQDFIDLFLDARVPDVKILSGEANEDF 283

 

 Score = 1338 (471.0 bits), Expect = 3.3e-142, P = 3.3e-142

 Identities = 259/277 (93%), Positives = 262/277 (94%)

 

Query:   242 KIFDLIEVAVQARIDQRAKDEIKGVEPGEPQDFIDLFLDARVPDVKILSGEANEDFAKSS 301

             K+  ++  AVQARIDQRAKDEIKGVEPGEPQDFIDLFLDARVPDVKILSGEANEDF   

Sbjct:   228 KMSKMLPAAVQARIDQRAKDEIKGVEPGEPQDFIDLFLDARVPDVKILSGEANEDF---- 283

 

Query:   302 VVKINKELTFDEIIAQCFVFLAAGFDTTALSLSYATYLLATHPEIQTKLQEEVDRECPDP 361

                    LTFDEIIAQCFVFLAAGFDTTALSLSYATYLLATHPEIQTKLQEEVDRECPDP

Sbjct:   284 -------LTFDEIIAQCFVFLAAGFDTTALSLSYATYLLATHPEIQTKLQEEVDRECPDP 336

 

Query:   362 EIFFDHLSKLKYLECVMKETLRLYPLGTTANTRKCMRETTINGVNFDEGMNIQVDTWTLH 421

             EIFFDHLSKLKYLECVMKETLRLYPLGTTANTRKCMRETTINGVNFDEGMNIQVDTWTLH

Sbjct:   337 EIFFDHLSKLKYLECVMKETLRLYPLGTTANTRKCMRETTINGVNFDEGMNIQVDTWTLH 396

 

Query:   422 HNPRIWGEDVEDFKPERWENGACEHLEHNGSYIPFGSGPRQCIGMRLAQMEQKILLAQIL 481

             HNPRIWGEDVEDFKPERWENGACEHLEHNGSYIPFGSGPRQCIGMRLAQMEQKILLAQIL

Sbjct:   397 HNPRIWGEDVEDFKPERWENGACEHLEHNGSYIPFGSGPRQCIGMRLAQMEQKILLAQIL 456

 

Query:   482 KEYSFRTTKNTQIPVKLVGKLTLSPESVIVKLEPRDS 518

             KEYSFRTTKNTQIPVKLVGKLTLSPESVIVKLEPRDS

Sbjct:   457 KEYSFRTTKNTQIPVKLVGKLTLSPESVIVKLEPRDS 493

 

> T10B9.3          CE01657

MIFVLLSAVLLGVFTYSVWIWSYFIRKGIKGPRGFPGIGMLIQTIDHENP  50

PFLKYRDWTKQYGPVYGFTEGPQQTMIISEPEMVNEIFKKQFDNFYGRKL 100

RPIIGDPEKDKRVNIFSTQGKRWKRLRTLSSPSFSNNSLRKVRNSVQECG 150

TEILWNIEQKVRKNEDIDMLIVYQEYTLGVISRIALGQSESNMFKNPLLP 200

KVQAIFNGSWHVFLITGIFPPLAGVFRKMSKMLPASFIPAFKIFDLIEVA 250

VQARIDQRAKDEIKGVEPGEPQDFIDLFLDARVPDVKILSGEANEDFAKS 300

SVVKINKELTFDEIIAQCFVFLAAGFDTTALSLSYATYLLATHPEIQTKL 350

QEEVDRECPDPEIFFDHLSKLKYLECVMKETLRLYPLGTTANTRKCMRET 400

TINGVNFDEGMNIQVDTWTLHHNPRIWGEDVEDFKPERWENGACEHLEHN 450

GSYIPFGSGPRQCIGMRLAQMEQKILLAQILKEYSFRTTKNTQIPVKLVG 500

KLTLSPESVIVKLEPRDS                                 518

 

CYP13A8

 

>CYP13A8 T10B9.4  Z48717 (C-terminal) and Y53C12 whole 13A subfamily 510 aa

         Length = 511

 

 Score = 2592 (912.4 bits), Expect = 5.7e-273, P = 5.7e-273

 Identities = 499/510 (97%), Positives = 499/510 (97%)

 

Query:     1 MIFELILISIVTYYFWHWTFWKRRGLPGPWGVPIFGKAGAMLEDSFPPGYTLQKWTKEYG 60

             MIFELILISIVTYYFWHWTFWKRRGLPGPWGVPIFGKAGAMLEDSFPPGYTLQKWTKEYG

Sbjct:     1 MIFELILISIVTYYFWHWTFWKRRGLPGPWGVPIFGKAGAMLEDSFPPGYTLQKWTKEYG 60

 

Query:    61 KIYGFTEGMQKVMVISDPDLVQEILVKQYDNFYGRKHNPVQGDPDKDKDIHIVGAQGFRW 120

             KIYGFTEGMQKVMVISDPDLVQEILVKQYDNFYGRKHNPVQGDPDKDKDIHIVGAQGFRW

Sbjct:    61 KIYGFTEGMQKVMVISDPDLVQEILVKQYDNFYGRKHNPVQGDPDKDKDIHIVGAQGFRW 120

 

Query:   121 KRLRTITAPAFSNGSIKKVLTTMEDSTQELMKKLREESENGKAVNMHLFYQEYTFDVISR 180

             KRLRTITAPAFSNGSIKKVLTTMEDSTQELMKKLREESENGKAVNM  FYQEYTFDVISR

Sbjct:   121 KRLRTITAPAFSNGSIKKVLTTMEDSTQELMKKLREESENGKAVNMFSFYQEYTFDVISR 180

 

Query:   181 VAMGQPDSQMFKNPLLKDVKG--------FFEHNRWQIWMFSGGFPFAVSFLKWLFIKVG 232

             VAMGQPDSQMFKNPLLKDVKG        FFEHNRWQIWMFSGGFPFAVSFLKWLFIKVG

Sbjct:   181 VAMGQPDSQMFKNPLLKDVKGVNKNVFLQFFEHNRWQIWMFSGGFPFAVSFLKWLFIKVG 240

 

Query:   233 KFGAGPFIVVQKSVTDAVMSRIAQREADKKHGVEPGEAADYIDMFLNARAEVEHFGESND 292

             KFGAGPFIVVQKSVTDAVMSRIAQREADKKHGVEPGEAADYIDMFLNARAEVEHFGESND

Sbjct:   241 KFGAGPFIVVQKSVTDAVMSRIAQREADKKHGVEPGEAADYIDMFLNARAEVEHFGESND 300

 

Query:   293 EFHKSSSYNNRQLTTQEIISQCFVFLVAGFDTTAISLSYVTYFLALNPKIQSKLQDEVDK 352

             EFHK        LTTQEIISQCFVFLVAGFDTTAISLSYVTYFLALNPKIQSKLQDEVDK

Sbjct:   301 EFHK--------LTTQEIISQCFVFLVAGFDTTAISLSYVTYFLALNPKIQSKLQDEVDK 352

 

Query:   353 ECPNDEITFDQLSKLKYMDNVIKESLRLFPFASFANSRRCMRNTVIGEQIVEAGVDVMID 412

             ECPNDEITFDQLSKLKYMDNVIKESLRLFPFASFANSRRCMRNTVIGEQIVEAGVDVMID

Sbjct:   353 ECPNDEITFDQLSKLKYMDNVIKESLRLFPFASFANSRRCMRNTVIGEQIVEAGVDVMID 412

 

Query:   413 TWTLHHDKNVWGNDVEEFKPERWDSPLTPQQAYLS-FGAGPRVCLGMRFALLEQKGLLSH 471

             TWTLHHDKNVWGNDVEEFKPERWDSPLTPQQAYLS FGAGPRVCLGMRFALLEQKGLLSH

Sbjct:   413 TWTLHHDKNVWGNDVEEFKPERWDSPLTPQQAYLSPFGAGPRVCLGMRFALLEQKGLLSH 472

 

Query:   472 ILKKYTFETNAKTQLPIKLVGRATARPENLFLSLKPRV 509

             ILKKYTFETNAKTQLPIKLVGRATARPENLFLSLKPRV

Sbjct:   473 ILKKYTFETNAKTQLPIKLVGRATARPENLFLSLKPRV 510

 

T10B9.4; CE13541.

MIFELILISIVTYYFWHWTFWKRRGLPGPWGVPIFGKAGAMLEDSFPPGY  50

TLQKWTKEYGKIYGFTEGMQKVMVISDPDLVQEILVKQYDNFYGRKHNPV 100

QGDPDKDKDIHIVGAQGFRWKRLRTITAPAFSNGSIKKVLTTMEDSTQEL 150

MKKLREESENGKAVNMHLFYQEYTFDVISRVAMGQPDSQMFKNPLLKDVK 200

G  FFEHNRWQIWMFSGGFPFAVSFLKWLFIKVGKFGAGPFIVVQKSVTDAV 250

MSRIAQREADKKHGVEPGEAADYIDMFLNARAEVEHFGESNDEFHKSSSY 300

NNRQLTTQEIISQCFVFLVAGFDTTAISLSYVTYFLALNPKIQSKLQDEV 350

DKECPNDEITFDQLSKLKYMDNVIKESLRLFPFASFANSRRCMRNTVIGE 400

QIVEAGVDVMIDTWTLHHDKNVWGNDVEEFKPERWDSPLTPQQAYLS  FGA 450

GPRVCLGMRFALLEQKGLLSHILKKYTFETNAKTQLPIKLVGRATARPEN 500

LFLSLKPRV                                          509

 

CYP13A3

 

>CYP13A3 T10B9.5   Z48717 496 aa

         Length = 497

 

 Score = 1483 (522.0 bits), Expect = 2.9e-292, Sum P(2) = 2.9e-292

 Identities = 286/300 (95%), Positives = 286/300 (95%)

 

Query:     1 MSLSILIAIALFIGVFTYYLWIWSFWMRKGIKGPRGLPFFGIINAFQSYEKPWILRLGDW 60

             MSLSILIAIALFIGVFTYYLWIWSFWMRKGIKGPRGLPFFGIINAFQSYEKPWILRLGDW

Sbjct:     1 MSLSILIAIALFIGVFTYYLWIWSFWMRKGIKGPRGLPFFGIINAFQSYEKPWILRLGDW 60

 

Query:    61 TKEYGPMYGFTDGVEKTLVVSDPEFVHEVFVKQFDNFYARKQNPLQGDPDKDPRIHLVTS 120

             TKEYGPMYGFTDGVEKTLVVSDPEFVHEVFVKQFDNFYARKQNPLQGDPDKDPRIHLVTS

Sbjct:    61 TKEYGPMYGFTDGVEKTLVVSDPEFVHEVFVKQFDNFYARKQNPLQGDPDKDPRIHLVTS 120

 

Query:   121 QGHRWKRLRTLASPTFSNKSLRKIFSTVEESVAEMMRHLEKGTAGGKTIDILEYYQEFTM 180

             QGHRWKRLRTLASPTFSNKSLRKIFSTVEESVAEMMRHLEKGTAGGKTIDILEYYQEFTM

Sbjct:   121 QGHRWKRLRTLASPTFSNKSLRKIFSTVEESVAEMMRHLEKGTAGGKTIDILEYYQEFTM 180

 

Query:   181 DIIGKIAMGQSGSMMFENPWLDKIRAIFNTRGNIIFIICGIVPFTGSIFRWFFSKVPTAQ 240

             DIIGKIAMGQSGSMMFENPWLDKIRAIFNTRGNIIFIICGIVPFTGSIFRWFFSKVPTA

Sbjct:   181 DIIGKIAMGQSGSMMFENPWLDKIRAIFNTRGNIIFIICGIVPFTGSIFRWFFSKVPTA- 239

 

Query:   241 TVTSLMHTLEIALTKRVEQRAADEKAGIESSGEPQDFIDLFLDVQADTDFLEDETKNGFA 300

                          TKRVEQRAADEKAGIESSGEPQDFIDLFLDVQADTDFLEDETKNGFA

Sbjct:   240 -------------TKRVEQRAADEKAGIESSGEPQDFIDLFLDVQADTDFLEDETKNGFA 286

 

 Score = 1315 (462.9 bits), Expect = 2.9e-292, Sum P(2) = 2.9e-292

 Identities = 257/267 (96%), Positives = 257/267 (96%)

 

Query:   254 TKRVEQRAADEKAGIESSGEPQDFIDLFLDVQADTDFLEDETKNGFARSQIVKVDKHLTF 313

             TKRVEQRAADEKAGIESSGEPQDFIDLFLDVQADTDFLEDETKNGFA          LTF

Sbjct:   240 TKRVEQRAADEKAGIESSGEPQDFIDLFLDVQADTDFLEDETKNGFA----------LTF 289

 

Query:   314 DEIIGQLFVFLLAGYDTTALSLSYSSYLLARHPEIQKKLQEEVDRECPDPEVTFDQLSKL 373

             DEIIGQLFVFLLAGYDTTALSLSYSSYLLARHPEIQKKLQEEVDRECPDPEVTFDQLSKL

Sbjct:   290 DEIIGQLFVFLLAGYDTTALSLSYSSYLLARHPEIQKKLQEEVDRECPDPEVTFDQLSKL 349

 

Query:   374 KYMECVIKETLRLYPLASIVHNRKCMKSTTVLGMKIEEGTNVQADTWTLHYDPKFWGENA 433

             KYMECVIKETLRLYPLASIVHNRKCMKSTTVLGMKIEEGTNVQADTWTLHYDPKFWGENA

Sbjct:   350 KYMECVIKETLRLYPLASIVHNRKCMKSTTVLGMKIEEGTNVQADTWTLHYDPKFWGENA 409

 

Query:   434 NEFKPERWESGDEQAVAKGAYLPFGLGPRICIGMRLAYMEEKMLLAQILKKYSLETTFET 493

             NEFKPERWESGDEQAVAKGAYLPFGLGPRICIGMRLAYMEEKMLLAQILKKYSLETTFET

Sbjct:   410 NEFKPERWESGDEQAVAKGAYLPFGLGPRICIGMRLAYMEEKMLLAQILKKYSLETTFET 469

 

Query:   494 HIPLKLVGIATTAPTNVHLKLKPRHSD 520

             HIPLKLVGIATTAPTNVHLKLKPRHSD

Sbjct:   470 HIPLKLVGIATTAPTNVHLKLKPRHSD 496

 

T10B9.5; CE01658.

MSLSILIAIALFIGVFTYYLWIWSFWMRKGIKGPRGLPFFGIINAFQSYE  50

KPWILRLGDWTKEYGPMYGFTDGVEKTLVVSDPEFVHEVFVKQFDNFYAR 100

KQNPLQGDPDKDPRIHLVTSQGHRWKRLRTLASPTFSNKSLRKIFSTVEE 150

SVAEMMRHLEKGTAGGKTIDILEYYQEFTMDIIGKIAMGQSGSMMFENPW 200

LDKIRAIFNTRGNIIFIICGIVPFTGSIFRWFFSKVPTAQTVTSLMHTLE 250

IALTKRVEQRAADEKAGIESSGEPQDFIDLFLDVQADTDFLEDETKNGFA 300

RSQIVKVDKHLTFDEIIGQLFVFLLAGYDTTALSLSYSSYLLARHPEIQK 350

KLQEEVDRECPDPEVTFDQLSKLKYMECVIKETLRLYPLASIVHNRKCMK 400

STTVLGMKIEEGTNVQADTWTLHYDPKFWGENANEFKPERWESGDEQAVA 450

KGAYLPFGLGPRICIGMRLAYMEEKMLLAQILKKYSLETTFETHIPLKLV 500

GIATTAPTNVHLKLKPRHSD                               520

 

CYP13A2 add to blast file (missing)

 

>CYP13A2   T10B9.7    Z48717 495 aa

           Length = 496

 

 Score = 1453 (511.5 bits), Expect = 2.1e-154, P = 2.1e-154

 Identities = 286/298 (95%), Positives = 286/298 (95%)

 

Query:     1 MSLGFVLAVTFSIFLGILTYYLWIWTYWMRKGVKGPRGRPFVGVLDV-LLEHETPGLIKL 59

             MSLGFVLAVTFSIFLGILTYYLWIWTYWMRKGVKGPRGRPFVGVLDV LLEHETPGLIKL

Sbjct:     1 MSLGFVLAVTFSIFLGILTYYLWIWTYWMRKGVKGPRGRPFVGVLDVELLEHETPGLIKL 60

 

Query:    60 GEWTKKYGKVYGYTDGTQRTLVVADPAMVHEIFVKQFDNFYGRKLNPIQGNPEKEQRVHL 119

             GEWTKKYGKVYGYTDGTQRTLVVADPAMVHEIFVKQFDNFYGRKLNPIQGNPEKEQRVHL

Sbjct:    61 GEWTKKYGKVYGYTDGTQRTLVVADPAMVHEIFVKQFDNFYGRKLNPIQGNPEKEQRVHL 120

 

Query:   120 LAAQGYRWKRLRTISSQSFSNASLKKMKRTVEDSALELLRHIEKQTAGGEQIDMLRFYQE 179

             LAAQGYRWKRLRTISSQSFSNASLKKMKRTVEDSALELLRHIEKQTAGGEQIDMLRFYQE

Sbjct:   121 LAAQGYRWKRLRTISSQSFSNASLKKMKRTVEDSALELLRHIEKQTAGGEQIDMLRFYQE 180

 

Query:   180 YTMDVIGRFAMGQTDSMMFKNPIVNVVREIFCGSRKNLMLICQVFPPIGQFIRDLTFKFP 239

             YTMDVIGRFAMGQTDSMMFKNPIVNVVREIFCGSRKNLMLICQVFPPIGQFIRDLTFKF

Sbjct:   181 YTMDVIGRFAMGQTDSMMFKNPIVNVVREIFCGSRKNLMLICQVFPPIGQFIRDLTFKF- 239

 

Query:   240 RIPAFKLYSIMQDVVAARIAQREREKGAESGEPQDFIDLFLDARSDDVDFSAEAREDFS 298

                        QDVVAARIAQREREKGAESGEPQDFIDLFLDARSDDVDFSAEAREDFS

Sbjct:   240 -----------QDVVAARIAQREREKGAESGEPQDFIDLFLDARSDDVDFSAEAREDFS 287

 

 Score = 1307 (460.1 bits), Expect = 6.3e-139, P = 6.3e-139

 Identities = 261/278 (93%), Positives = 261/278 (93%)

 

Query:   238 FPRIPAF--KLYSIMQDVVAARIAQREREKGAESGEPQDFIDLFLDARSDDVDFSAEARE 295

             FP I  F   L    QDVVAARIAQREREKGAESGEPQDFIDLFLDARSDDVDFSAEARE

Sbjct:   225 FPPIGQFIRDLTFKFQDVVAARIAQREREKGAESGEPQDFIDLFLDARSDDVDFSAEARE 284

 

Query:   296 DFSKRNLKITKELSADEVVGQCFLFLIGGFDTTALSLSYVTYLLAVNPKIQEKVIEEIAR 355

             DFS         LSADEVVGQCFLFLIGGFDTTALSLSYVTYLLAVNPKIQEKVIEEIAR

Sbjct:   285 DFS---------LSADEVVGQCFLFLIGGFDTTALSLSYVTYLLAVNPKIQEKVIEEIAR 335

 

Query:   356 EFGTSEVEFEKLGRLKYMDCVIKEALRLYPLASISNSRKCMKTTTVNGVKIEAGVYVQMD 415

             EFGTSEVEFEKLGRLKYMDCVIKEALRLYPLASISNSRKCMKTTTVNGVKIEAGVYVQMD

Sbjct:   336 EFGTSEVEFEKLGRLKYMDCVIKEALRLYPLASISNSRKCMKTTTVNGVKIEAGVYVQMD 395

 

Query:   416 TWSLHYDPELWGEDVKEFKPERWSTDEPLEHKGAYLPFGLGPRQCIGMRLAIMEQKILLT 475

             TWSLHYDPELWGEDVKEFKPERWSTDEPLEHKGAYLPFGLGPRQCIGMRLAIMEQKILLT

Sbjct:   396 TWSLHYDPELWGEDVKEFKPERWSTDEPLEHKGAYLPFGLGPRQCIGMRLAIMEQKILLT 455

 

Query:   476 HLLKNYTFETGNKTRIPLKLVGSATTSPEDVFVHLRPRIW 515

             HLLKNYTFETGNKTRIPLKLVGSATTSPEDVFVHLRPRIW

Sbjct:   456 HLLKNYTFETGNKTRIPLKLVGSATTSPEDVFVHLRPRIW 495

 

T10B9.7; CE01659.

MSLGFVLAVTFSIFLGILTYYLWIWTYWMRKGVKGPRGRPFVGVLDV  LLE  50

HETPGLIKLGEWTKKYGKVYGYTDGTQRTLVVADPAMVHEIFVKQFDNFY 100

GRKLNPIQGNPEKEQRVHLLAAQGYRWKRLRTISSQSFSNASLKKMKRTV 150

EDSALELLRHIEKQTAGGEQIDMLRFYQEYTMDVIGRFAMGQTDSMMFKN 200

PIVNVVREIFCGSRKNLMLICQVFPPIGQFIRDLTFKFPRIPAFKLYSIM 250

QDVVAARIAQREREKGAESGEPQDFIDLFLDARSDDVDFSAEAREDFSKR 300

NLKITKELSADEVVGQCFLFLIGGFDTTALSLSYVTYLLAVNPKIQEKVI 350

EEIAREFGTSEVEFEKLGRLKYMDCVIKEALRLYPLASISNSRKCMKTTT 400

VNGVKIEAGVYVQMDTWSLHYDPELWGEDVKEFKPERWSTDEPLEHKGAY 450

LPFGLGPRQCIGMRLAIMEQKILLTHLLKNYTFETGNKTRIPLKLVGSAT 500

TSPEDVFVHLRPRIW                                    515

 

CYP13A4

 

compare to briggsae

 

>CYP13A.f cb25.fpc0022 6 exons Minus Strand

          Length = 519

 

 Score = 2033 (715.7 bits), Expect = 9.9e-214, P = 9.9e-214

 Identities = 374/520 (71%), Positives = 437/520 (84%)

 

Query:     1 MSLSLLIAGALFIGFLTYYIWIWSFWIRKGVKGPRGFPFFGVILKFHDYENPGLLKLGEW 60

             MS SLL+   +FIGFL+YY+WIWS+WIRKGVKGPRG PF G++ +F DY+ P    + +W

Sbjct:     1 MSFSLLLISTIFIGFLSYYVWIWSYWIRKGVKGPRGIPFSGIMHRFQDYDYPFWRVMMDW 60

 

Query:    61 TKKYGSIYGITEGVEKTLVVSNPEFVHEVFVKQFDNFYGRKTNPIQGDPNKNKRAHLVLA 120

             TK+YG IYGITEGVEKTLVVS+PEFVHEVFV QFDNFYGRK NP+QGDPNKNKRAHLV 

Sbjct:    61 TKEYGPIYGITEGVEKTLVVSDPEFVHEVFVTQFDNFYGRKLNPLQGDPNKNKRAHLVST 120

 

Query:   121 QGHRWKRLRTLASPTFSNKSLRKIMSTVEETVVELMRHLDEASAKGKAVDLLDYYQEFTL 180

             QGHRWKRLRTL+SPTFSNK+L+KIM TV+++V ELMRHL++  A GK VD+L+YYQEFT+

Sbjct:   121 QGHRWKRLRTLSSPTFSNKNLKKIMKTVDDSVAELMRHLEKGCAGGKTVDMLEYYQEFTI 180

 

Query:   181 DIIGRIAMGQTESLMFRNPMLPKVKEIFKKGGKMPFLIAGVFPIAGTLMRQLFMKFPKFS 240

             DIIG+IAMGQT+SLMF NPMLPKVK IF  G K  F  + +FP  G ++R     FP  

Sbjct:   181 DIIGKIAMGQTQSLMFENPMLPKVKSIFVDGRKHAFFASAMFPFLGAMLR--LNVFPSIQ 238

 

Query:   241 PAFGIMNTMEKALNKRLEQRAADKKAGIEPSGEPQDFIDLFLDARANVDFIEEESTLGFA 300

             PA  IMN +EKA+  R++QR  D+K GIE SGEPQDFIDLFLDARA+VDF E+E+   F

Sbjct:   239 PASDIMNIIEKAVKARIDQRKTDEKLGIEVSGEPQDFIDLFLDARADVDFFEQEAKDDFG 298

 

Query:   301 KSEVLKVDKHLTFDEIIGQLFVFLLAGYDTTALSLSYSSYLLATHPEIQKKLQEEVDREC 360

             K+++ KVDKHLTFDE +GQLFVFLLAGYDTTALSLSYSS+LLATHPE+Q+K QEEVDREC

Sbjct:   299 KTQITKVDKHLTFDETVGQLFVFLLAGYDTTALSLSYSSFLLATHPEVQRKCQEEVDREC 358

 

Query:   361 PDPEVTFDQISKLKYMECVVKEALRMYPLASLVHNRKCMKKTNVLGVEIDEGTNVQVDTW 420

              DPEVTFD +SKLKYM+ V+KEALR YPLAS+VHNRKCMK T V GVEI+EGTNVQVDTW

Sbjct:   359 SDPEVTFDHLSKLKYMDWVIKEALRFYPLASIVHNRKCMKATTVCGVEIEEGTNVQVDTW 418

 

Query:   421 TLHYDPKVWGDDASEFKPERWETGDELFYAKGGYLPFGMGPRICIGMRLAMMEEKLLLTH 480

             T+H DP +WGDD +EFKPERWE+GDE F  KGGYLPFGMGPRICIGMRLA+MEEK+LLT

Sbjct:   419 TMHKDPNIWGDDVNEFKPERWESGDEQFIQKGGYLPFGMGPRICIGMRLALMEEKMLLTQ 478

 

Query:   481 ILKKYTFDTSTETEIPLKLVGSATIAPRNVMLKLTPRHSN 520

             ILKKYTF+T++ET IPLKLVG +T AP +V LKL PRHS+

Sbjct:   479 ILKKYTFETTSETVIPLKLVGRSTTAPSSVNLKLRPRHSD 518

 

CYP13A1 add to blast file (missing)

 

>CYP13A1   T10B9.8   Z48717 498 aa

           Length = 499

 

 Score = 1483 (522.0 bits), Expect = 1.4e-157, P = 1.4e-157

 Identities = 296/334 (88%), Positives = 299/334 (89%)

 

Query:     1 MGYFWFPWFSAIFVAVFSYYIWQWTFWRRRGVVGPMGFPVLGVFLNSLDNNFPFPLQCRE 60

             MGYFWFPWFSAIFVAVFSYYIWQWTFWRRRGVVGPMGFPVLGVFLNSLDNN PFPLQCRE

Sbjct:     1 MGYFWFPWFSAIFVAVFSYYIWQWTFWRRRGVVGPMGFPVLGVFLNSLDNN-PFPLQCRE 59

 

Query:    61 WTKKFGKIYGFTEGTLKTLVISDPELVHEVFVTQYDNFYGRKRNPIQGDSEKEKRTNLFA 120

             WTKKFGKIYGFTEGTLKTLVISDPELVHEVFVTQYDNFYGRKRNPIQGDSEKEKRTNLFA

Sbjct:    60 WTKKFGKIYGFTEGTLKTLVISDPELVHEVFVTQYDNFYGRKRNPIQGDSEKEKRTNLFA 119

 

Query:   121 AQGFRWKRLRAISSPTFSNSSLRKLYQTVEDSALELLRHIEKQSAGGKQIDMLKFYQEFT 180

             AQGFRWKRLRAISSPTFSNSSLRKLYQTVEDSALELLRHIEKQSAGGKQIDMLKFYQEFT

Sbjct:   120 AQGFRWKRLRAISSPTFSNSSLRKLYQTVEDSALELLRHIEKQSAGGKQIDMLKFYQEFT 179

 

Query:   181 LDVIGRIAMGQTDSQMFKNP--IMPIVSKLFQGNFAKLFLIGGIFPTFLVEIIRQILLKN 238

             LDVIGRIAMGQTDSQMFKNP  IMPIVSKLFQGNFAKLFLIGGIFPTFLVEIIRQILLKN

Sbjct:   180 LDVIGRIAMGQTDSQMFKNPNPIMPIVSKLFQGNFAKLFLIGGIFPTFLVEIIRQILLKN 239

 

Query:   239 LKVGSFRKINEITLDAIHNRIKQREEDQKNGIEIGEPADFIDLFLDAKAEDVEHFGENNG 298

             L             DAIHNRIKQREEDQKNGIEIGEPADFIDLFLDAKAEDVEHFGENNG

Sbjct:   240 L-------------DAIHNRIKQREEDQKNGIEIGEPADFIDLFLDAKAEDVEHFGENNG 286

 

Query:   299 DFSKSTTYTNRQLTTEEIVGQCTVFLIAGFDTTALS 334

             DFS +T     Q T   I G  T  L   + T  L+

Sbjct:   287 DFSLTTEEIVGQCTVFLIAGFDTTALSLSYATYLLA 322

 

 Score = 1354 (476.6 bits), Expect = 6.6e-144, P = 6.6e-144

 Identities = 269/306 (87%), Positives = 273/306 (89%)

 

Query:   214 AKLFLIGGIFPTFLVEIIRQILLKNLKVGSFRKINEITLDAIHNRIKQREEDQKNGIEIG 273

             +KLF  G     FL+  I    L  +     R+I    LDAIHNRIKQREEDQKNGIEIG

Sbjct:   207 SKLFQ-GNFAKLFLIGGIFPTFLVEI----IRQILLKNLDAIHNRIKQREEDQKNGIEIG 261

 

Query:   274 EPADFIDLFLDAKAEDVEHFGENNGDFSKSTTYTNRQLTTEEIVGQCTVFLIAGFDTTAL 333

             EPADFIDLFLDAKAEDVEHFGENNGDFS         LTTEEIVGQCTVFLIAGFDTTAL

Sbjct:   262 EPADFIDLFLDAKAEDVEHFGENNGDFS---------LTTEEIVGQCTVFLIAGFDTTAL 312

 

Query:   334 SLSYATYLLATHPEIQKKLQEEVNRECPNPEVTIDQLSKLKYMECVFKEALRLYPLGAFA 393

             SLSYATYLLATHPEIQKKLQEEVNRECPNPEVTIDQLSKLKYMECVFKEALRLYPLGAFA

Sbjct:   313 SLSYATYLLATHPEIQKKLQEEVNRECPNPEVTIDQLSKLKYMECVFKEALRLYPLGAFA 372

 

Query:   394 NSRRCMRNTKLGNMKVEVGTMIQVDTWTLHTDPNIWGDDAEDFKPERWQTPNSDQIYQKS 453

             NSRRCMRNTKLGNMKVEVGTMIQVDTWTLHTDPNIWGDDAEDFKPERWQTPNSDQIYQKS

Sbjct:   373 NSRRCMRNTKLGNMKVEVGTMIQVDTWTLHTDPNIWGDDAEDFKPERWQTPNSDQIYQKS 432

 

Query:   454 GYIPFGLGPRQCIGMRLAYMEEKILLVHILRKFTFETGAKTEIPLKLIGRATTQPESVWM 513

             GYIPFGLGPRQCIGMRLAYMEEKILLVHILRKFTFETGAKTEIPLKLIGRATTQPESVWM

Sbjct:   433 GYIPFGLGPRQCIGMRLAYMEEKILLVHILRKFTFETGAKTEIPLKLIGRATTQPESVWM 492

 

Query:   514 HLNPRN 519

             HLNPRN

Sbjct:   493 HLNPRN 498

 

T10B9.8; CE01660.

MGYFWFPWFSAIFVAVFSYYIWQWTFWRRRGVVGPMGFPVLGVFLNSLDN  50

NFPFPLQCREWTKKFGKIYGFTEGTLKTLVISDPELVHEVFVTQYDNFYG 100

RKRNPIQGDSEKEKRTNLFAAQGFRWKRLRAISSPTFSNSSLRKLYQTVE 150

DSALELLRHIEKQSAGGKQIDMLKFYQEFTLDVIGRIAMGQTDSQMFKNP (xx) 200

IMPIVSKLFQGNFAKLFLIGGIFPTFLVEIIRQILLKNLKVGSFRKINEI 250

TLDAIHNRIKQREEDQKNGIEIGEPADFIDLFLDAKAEDVEHFGENNGDF 300

SKSTTYTNRQLTTEEIVGQCTVFLIAGFDTTALSLSYATYLLATHPEIQK 350

KLQEEVNRECPNPEVTIDQLSKLKYMECVFKEALRLYPLGAFANSRRCMR 400

NTKLGNMKVEVGTMIQVDTWTLHTDPNIWGDDAEDFKPERWQTPNSDQIY 450

QKSGYIPFGLGPRQCIGMRLAYMEEKILLVHILRKFTFETGAKTEIPLKL 500

IGRATTQPESVWMHLNPRN                                519

 

CYP13A7 add to blast file (missing)

 

>CYP13A7   T10B9.10    Z48717 494 aa

           Length = 495

 

 Score = 1449 (510.1 bits), Expect = 5.7e-154, P = 5.7e-154

 Identities = 286/300 (95%), Positives = 286/300 (95%)

 

Query:     1 MSFSILIAIAIFVGIISYYLWIWSFWIRKGVKGPRGLPFLGVIHKFTNYENPGALKFSEW 60

             MSFSILIAIAIFVGIISYYLWIWSFWIRKGVKGPRGLPFLGVIHKFTNYENPGALKFSEW

Sbjct:     1 MSFSILIAIAIFVGIISYYLWIWSFWIRKGVKGPRGLPFLGVIHKFTNYENPGALKFSEW 60

 

Query:    61 TKKYGPVYGITEGVEKTLVISDPEFVHEVFVKQFDNFYGRKLTAIQGDPNKNKRVPLVAA 120

             TKKYGPVYGITEGVEKTLVISDPEFVHEVFVKQFDNFYGRKLTAIQGDPNKNKRVPLVAA

Sbjct:    61 TKKYGPVYGITEGVEKTLVISDPEFVHEVFVKQFDNFYGRKLTAIQGDPNKNKRVPLVAA 120

 

Query:   121 QGHRWKRLRTLASPTFSNKSLRKIMGTVEESVTELVRSLEKASAEGKTLDMLEYYQEFTM 180

             QGHRWKRLRTLASPTFSNKSLRKIMGTVEESVTELVRSLEKASAEGKTLDMLEYYQEFTM

Sbjct:   121 QGHRWKRLRTLASPTFSNKSLRKIMGTVEESVTELVRSLEKASAEGKTLDMLEYYQEFTM 180

 

Query:   181 DIIGKMAMGQEKSLMFRNPMLDKVKTIFKEGRNNVFMISGIFPFVGIALRNIFAKFPSLQ 240

             DIIGKMAMGQEKSLMFRNPMLDKVKTIFKEGRNNVFMISGIFPFVGIALRNIFAKFP  

Sbjct:   181 DIIGKMAMGQEKSLMFRNPMLDKVKTIFKEGRNNVFMISGIFPFVGIALRNIFAKFP--- 237

 

Query:   241 MATDIQSILEKALNKRLEQREADEKAGIEPSGEPQDFIDLFLDARSTVDFFEGEAEQDFA 300

                        ALNKRLEQREADEKAGIEPSGEPQDFIDLFLDARSTVDFFEGEAEQDFA

Sbjct:   238 -----------ALNKRLEQREADEKAGIEPSGEPQDFIDLFLDARSTVDFFEGEAEQDFA 286

 

 Score = 1314 (462.6 bits), Expect = 1.1e-139, P = 1.1e-139

 Identities = 262/283 (92%), Positives = 267/283 (94%)

 

Query:   236 FPSLQMATDIQSILEK--ALNKRLEQREADEKAGIEPSGEPQDFIDLFLDARSTVDFFEG 293

             FP + +A  +++I  K  ALNKRLEQREADEKAGIEPSGEPQDFIDLFLDARSTVDFFEG

Sbjct:   222 FPFVGIA--LRNIFAKFPALNKRLEQREADEKAGIEPSGEPQDFIDLFLDARSTVDFFEG 279

 

Query:   294 EAEQDFAKSEVLKVDKHLTFDEIIGQLFVFLLAGYDTTALSLSYSSYLLATHPEIQKKLQ 353

             EAEQDFA          LTFDEIIGQLFVFLLAGYDTTALSLSYSSYLLATHPEIQKKLQ

Sbjct:   280 EAEQDFA----------LTFDEIIGQLFVFLLAGYDTTALSLSYSSYLLATHPEIQKKLQ 329

 

Query:   354 EEVDRECPDPEVTFDQLSKLKYLECVVKEALRLYPLASLVHNRKCLKTTNVLGMEIEAGT 413

             EEVDRECPDPEVTFDQLSKLKYLECVVKEALRLYPLASLVHNRKCLKTTNVLGMEIEAGT

Sbjct:   330 EEVDRECPDPEVTFDQLSKLKYLECVVKEALRLYPLASLVHNRKCLKTTNVLGMEIEAGT 389

 

Query:   414 NINVDTWSLHHDPKVWGDDVNEFKPERWESGDELFFAKGGYLPFGMGPRICIGMRLAMME 473

             NINVDTWSLHHDPKVWGDDVNEFKPERWESGDELFFAKGGYLPFGMGPRICIGMRLAMME

Sbjct:   390 NINVDTWSLHHDPKVWGDDVNEFKPERWESGDELFFAKGGYLPFGMGPRICIGMRLAMME 449

 

Query:   474 MKMLLTNILKNYTFETTPETVIPLKLVGTATIAPSSVLLKLKSRF 518

             MKMLLTNILKNYTFETTPETVIPLKLVGTATIAPSSVLLKLKSRF

Sbjct:   450 MKMLLTNILKNYTFETTPETVIPLKLVGTATIAPSSVLLKLKSRF 494

 

T10B9.10; CE01655.

MSFSILIAIAIFVGIISYYLWIWSFWIRKGVKGPRGLPFLGVIHKFTNYE  50

NPGALKFSEWTKKYGPVYGITEGVEKTLVISDPEFVHEVFVKQFDNFYGR 100

KLTAIQGDPNKNKRVPLVAAQGHRWKRLRTLASPTFSNKSLRKIMGTVEE 150

SVTELVRSLEKASAEGKTLDMLEYYQEFTMDIIGKMAMGQEKSLMFRNPM 200

LDKVKTIFKEGRNNVFMISGIFPFVGIALRNIFAKFPSLQMATDIQSILE 250

KALNKRLEQREADEKAGIEPSGEPQDFIDLFLDARSTVDFFEGEAEQDFA 300

KSEVLKVDKHLTFDEIIGQLFVFLLAGYDTTALSLSYSSYLLATHPEIQK 350

KLQEEVDRECPDPEVTFDQLSKLKYLECVVKEALRLYPLASLVHNRKCLK 400

TTNVLGMEIEAGTNINVDTWSLHHDPKVWGDDVNEFKPERWESGDELFFA 450

KGGYLPFGMGPRICIGMRLAMMEMKMLLTNILKNYTFETTPETVIPLKLV 500

GTATIAPSSVLLKLKSRF                                 518

 

CYP22A1 N-terms differ in forms a and b

 

>CYP22A1 T13C5.1 U39648 17939-21555 572 aa first 85aa may not be part of this

         gene

         Length = 573

 

 Score = 2862 (1007.5 bits), Expect = 1.4e-301, P = 1.4e-301

 Identities = 543/552 (98%), Positives = 543/552 (98%)

 

Query:     7 RFSSLSCERA-SIWCNMNISQSSWIKCRDWMAFALSHHIIMGIYLLILRNFLPQVVPDFE 65

             R SSL    A SI   MNISQSSWIKCRDWMAFALSHHIIMGIYLLILRNFLPQVVPDFE

Sbjct:    21 RMSSLVFFSANSIQVQMNISQSSWIKCRDWMAFALSHHIIMGIYLLILRNFLPQVVPDFE 80

 

Query:    66 WQHYFMRVFIVHIIYIIISYFIRITRYPPGPPPMAVFGNSPFVNILTPEQTFLEYREIYG 125

             WQHYFMRVFIVHIIYIIISYFIRITRYPPGPPPMAVFGNSPFVNILTPEQTFLEYREIYG

Sbjct:    81 WQHYFMRVFIVHIIYIIISYFIRITRYPPGPPPMAVFGNSPFVNILTPEQTFLEYREIYG 140

 

Query:   126 PIFTLHLSQPTIILAEYKTIQEALVKNGQQTSGRSSAESFVLFTGDRLNGDGVILAMRQK 185

             PIFTLHLSQPTIILAEYKTIQEALVKNGQQTSGRSSAESFVLFTGDRLNGDGVILAMRQK

Sbjct:   141 PIFTLHLSQPTIILAEYKTIQEALVKNGQQTSGRSSAESFVLFTGDRLNGDGVILAMRQK 200

 

Query:   186 WKDMRHEISRFMNKWYGAPMDELVLHHTRCLEQELAKIAETKSLIDLRDPLAGAIANVIQ 245

             WKDMRHEISRFMNKWYGAPMDELVLHHTRCLEQELAKIAETKSLIDLRDPLAGAIANVIQ

Sbjct:   201 WKDMRHEISRFMNKWYGAPMDELVLHHTRCLEQELAKIAETKSLIDLRDPLAGAIANVIQ 260

 

Query:   246 QITIGRNYMYQDQEFQTQLRDINAVVKEIMTAEVFFVNCYPWLRYLPEGILRKWTNYKRS 305

             QITIGRNYMYQDQEFQTQLRDINAVVKEIMTAEVFFVNCYPWLRYLPEGILRKWTNYKRS

Sbjct:   261 QITIGRNYMYQDQEFQTQLRDINAVVKEIMTAEVFFVNCYPWLRYLPEGILRKWTNYKRS 320

 

Query:   306 GFRLQQWFRTILEEHHVNRHQGDFMSHMIDLQESKQEQFRDLSIILTCGDMWTGGMETTV 365

             GFRLQQWFRTILEEHHVNRHQGDFMSHMIDLQESKQEQFRDLSIILTCGDMWTGGMETTV

Sbjct:   321 GFRLQQWFRTILEEHHVNRHQGDFMSHMIDLQESKQEQFRDLSIILTCGDMWTGGMETTV 380

 

Query:   366 TTLRWGIIYLLNNPEVQAKCQMEILDVFGNDIPDMGKMNQTPYVRATLSEIQRLANVLPW 425

             TTLRWGIIYLLNNPEVQAKCQMEILDVFGNDIPDMGKMNQTPYVRATLSEIQRLANVLPW

Sbjct:   381 TTLRWGIIYLLNNPEVQAKCQMEILDVFGNDIPDMGKMNQTPYVRATLSEIQRLANVLPW 440

 

Query:   426 AIPHKTIEECNIGGYDIPVNTEIIPALGAVLFDPNVFESPKQFKPERFLDEEGKYRVMEE 485

             AIPHKTIEECNIGGYDIPVNTEIIPALGAVLFDPNVFESPKQFKPERFLDEEGKYRVMEE

Sbjct:   441 AIPHKTIEECNIGGYDIPVNTEIIPALGAVLFDPNVFESPKQFKPERFLDEEGKYRVMEE 500

 

Query:   486 FRPFGLGPRVCLGERIARTELYLIFASLLQNFRFYLNRGDPIPVAERVIGGITAPPKPYA 545

             FRPFGLGPRVCLGERIARTELYLIFASLLQNFRFYLNRGDPIPVAERVIGGITAPPKPYA

Sbjct:   501 FRPFGLGPRVCLGERIARTELYLIFASLLQNFRFYLNRGDPIPVAERVIGGITAPPKPYA 560

 

Query:   546 TRVEYLGNRLIN 557

             TRVEYLGNRLIN

Sbjct:   561 TRVEYLGNRLIN 572

 

T13C5.1b         CE30451      daf-9

this version of daf-9 (isoform A) does not match mine (alt splice?)

but the other version does (T13C5.1a         CE27206)

MPLVIGRFSSLSCERASIWCNMNISQSSWIKCRDWMAFALSHHIIMGIYL  50

LILRNFLPQVVPDFEWQHYFMRVFIVHIIYIIISYFIRITRYPPGPPPMA 100

VFGNSPFVNILTPEQTFLEYREIYGPIFTLHLSQPTIILAEYKTIQEALV 150

KNGQQTSGRSSAESFVLFTGDRLNGDGVILAMRQKWKDMRHEISRFMNKW 200

YGAPMDELVLHHTRCLEQELAKIAETKSLIDLRDPLAGAIANVIQQITIG 250

RNYMYQDQEFQTQLRDINAVVKEIMTAEVFFVNCYPWLRYLPEGILRKWT 300

NYKRSGFRLQQWFRTILEEHHVNRHQGDFMSHMIDLQESKQEQFRDLSII 350

LTCGDMWTGGMETTVTTLRWGIIYLLNNPEVQAKCQMEILDVFGNDIPDM 400

GKMNQTPYVRATLSEIQRLANVLPWAIPHKTIEECNIGGYDIPVNTEIIP 450

ALGAVLFDPNVFESPKQFKPERFLDEEGKYRVMEEFRPFGLGPRVCLGER 500

IARTELYLIFASLLQNFRFYLNRGDPIPVAERVIGGITAPPKPYATRVEY 550

LGNRLIN

 

cannot find similar N-term in C. briggsae

 

T13C5.1a; CE27206.

MHLENRVLSSVLDYASKFYKRMSSLVFFSANSIQVQMNISQSSWIKCRDW  50

MAFALSHHIIMGIYLLILRNFLPQVVPDFEWQHYFMRVFIVHIIYIIISY 100

FIRITRYPPGPPPMAVFGNSPFVNILTPEQTFLEYREIYGPIFTLHLSQP 150

TIILAEYKTIQEALVKNGQQTSGRSSAESFVLFTGDRLNGDGVILAMRQK 200

WKDMRHEISRFMNKWYGAPMDELVLHHTRCLEQELAKIAETKSLIDLRDP 250

LAGAIANVIQQITIGRNYMYQDQEFQTQLRDINAVVKEIMTAEVFFVNCY 300

PWLRYLPEGILRKWTNYKRSGFRLQQWFRTILEEHHVNRHQGDFMSHMID 350

LQESKQEQFRDLSIILTCGDMWTGGMETTVTTLRWGIIYLLNNPEVQAKC 400

QMEILDVFGNDIPDMGKMNQTPYVRATLSEIQRLANVLPWAIPHKTIEEC 450

NIGGYDIPVNTEIIPALGAVLFDPNVFESPKQFKPERFLDEEGKYRVMEE 500

FRPFGLGPRVCLGERIARTELYLIFASLLQNFRFYLNRGDPIPVAERVIG 550

GITAPPKPYATRVEYLGNRLIN

 

this form has a cDNA

>gi|31236156|dbj|AU111101.2|AU111101  UniGene info AU111101 unpublished oligo-capped cDNA library Caenorhabditis

           elegans cDNA clone yk723h3 5'.

          Length = 339

 

 Score =  206 bits (523), Expect = 1e-54

 Identities = 100/100 (100%), Positives = 100/100 (100%)

 Frame = +2

 

Query: 1   MHLENRVLSSVLDYASKFYKRMSSLVFFSANSIQVQMNISQSSWIKCRDWMAFALSHHII 60

           MHLENRVLSSVLDYASKFYKRMSSLVFFSANSIQVQMNISQSSWIKCRDWMAFALSHHII

Sbjct: 23  MHLENRVLSSVLDYASKFYKRMSSLVFFSANSIQVQMNISQSSWIKCRDWMAFALSHHII 202

 

Query: 61  MGIYLLILRNFLPQVVPDFEWQHYFMRVFIVHIIYIIISY 100

           MGIYLLILRNFLPQVVPDFEWQHYFMRVFIVHIIYIIISY

Sbjct: 203 MGIYLLILRNFLPQVVPDFEWQHYFMRVFIVHIIYIIISY 322

 

 

CYP33D3 two versions

 

Y17D7A.4 with extra exon remove extra seq

MLLLLIFSSIFLYFFYHFHWKRRNLPPGPRPLPFLGNLLSLKTLKPGYEA  50

FSNWKKEYGPIFTFWMANKPFVIIASYEKMKETFVKDGDTYVDKQLTHTE 100

KERLGENYGVLDTNGHMWKEHRRFTLTQLRDLGLGKDLMQEKADLGRNRL 150

KKAKKLHVLTSTVYSENILMEVEELFKELDAHGGEEIDLPKLIDRSVGNV 200

INLTLFNKRFDMDKRDEFAHLKSLIDGMRNVTSQFRYLIQYLVPWTSTVL 250

PGPTLSEKVRAKREELDDFFYSQIDEHRNEIDFDNTENLDFVEAYLKEQK 300

KREEDGDFKTFCNKQLCAMLFDLWIAGLMTTTMTMTWGLSYYLYNPEVQR 350

KIREELDKVIGNDRLISTADKNDLPYLQAFVTETQRTANIIPLNLIHMTT 400

RDTVIDGFPIQKGTGVIAQISTVMYDEKVFPEPYKFKPERFIENGKFKKV 450

DEVIPFSIGKRQCLGEGLARIELFLFFANIFNRYDVMPDFSGSLPDLDKS 500

KDNFVIPRKFKAVLKRRYMD                               520

 

>CYP33D3 C54E10 Y69H2 contigs 04354 and 04878 and Y80D3 contig 02849 488 aa

         Length = 489

 

Score = 773 (272.1 bits), Expect = 3.6e-263, Sum P(2) = 3.6e-263

 Identities = 144/157 (91%), Positives = 145/157 (92%)

 

Query:     1 MLLLLIFSSIFLYFFYHFHWKRRNLPPGPRPLPFLGNLLSLKTLKPGYEAFSNWKKEYGP 60

             MLLLLIFSSIFLYFFYHFHWKRRNLPPGPRPLPFLGNLLSLKTLKPGYEAFSNWKKEYGP

Sbjct:     1 MLLLLIFSSIFLYFFYHFHWKRRNLPPGPRPLPFLGNLLSLKTLKPGYEAFSNWKKEYGP 60

 

Query:    61 IFTFWMANKPFVIIASYEKMKETFVKDGDTYVDKQLTHTEKERLGENYGVLDTNGHMWKE 120

             IFTFWM NKPFVIIASYEKMKETFVKDGDTYVDKQLTHTEKERLGENYGVLDTNGHMWKE

Sbjct:    61 IFTFWMGNKPFVIIASYEKMKETFVKDGDTYVDKQLTHTEKERLGENYGVLDTNGHMWKE 120

 

Query:   121 HRRFTLTQLRDLGLGKDLMQEKADLGRNRLKKAKKLH 157

             HRRFTLTQLRDLGLGKDLMQEK  +    L K    H

Sbjct:   121 HRRFTLTQLRDLGLGKDLMQEKILMEVEELFKELDAH 157

 

 Score = 1750 (616.0 bits), Expect = 3.6e-263, Sum P(2) = 3.6e-263

 Identities = 342/362 (94%), Positives = 344/362 (95%)

 

Query:   159 LTSTVYSENILMEVEELFKELDAHGGEEIDLPKLIDRSVGNVINLTLFNKRFDMDKRDEF 218

             L   +  E ILMEVEELFKELDAHGGEEIDLPKLIDRSVGNVINLTLF       KRDEF

Sbjct:   134 LGKDLMQEKILMEVEELFKELDAHGGEEIDLPKLIDRSVGNVINLTLF-------KRDEF 186

 

Query:   219 AHLKSLIDGMRNVTSQFRYLIQYLVPWTSTVLPGPTLSEKVRAKREELDDFFYSQIDEHR 278

             AHLKSLIDGMRNVTSQ RYLIQYLVPWTSTVLPGPTLSEKVRAKREELDDFFYSQIDEHR

Sbjct:   187 AHLKSLIDGMRNVTSQSRYLIQYLVPWTSTVLPGPTLSEKVRAKREELDDFFYSQIDEHR 246

 

Query:   279 NEIDFDNTENLDFVEAYLKEQKKREEDGDFKTFCNKQLCAMLFDLWIAGLMTTTMTMTWG 338

             NEIDFDNTENLDFVEAYLKEQKKREEDGDF    NKQLCAMLFDLWIAGLMTTTMTMTWG

Sbjct:   247 NEIDFDNTENLDFVEAYLKEQKKREEDGDFSISSNKQLCAMLFDLWIAGLMTTTMTMTWG 306

 

Query:   339 LSYYLYNPEVQRKIREELDKVIGNDRLISTADKNDLPYLQAFVTETQRTANIIPLNLIHM 398

             LSYYLYNPEVQRKIREELDKVIGNDRLISTADKNDLPYLQAFVTETQRTANIIPLNLIHM

Sbjct:   307 LSYYLYNPEVQRKIREELDKVIGNDRLISTADKNDLPYLQAFVTETQRTANIIPLNLIHM 366

 

Query:   399 TTRDTVIDGFPIQKGTGVIAQISTVMYDEKVFPEPYKFKPERFIENGKFKKVDEVIPFSI 458

             TTRDTVIDGFPIQKGTGVIAQISTVMYDE+VFPEPYKFKPERFIENGKFKKVDEVIPFSI

Sbjct:   367 TTRDTVIDGFPIQKGTGVIAQISTVMYDEQVFPEPYKFKPERFIENGKFKKVDEVIPFSI 426

 

Query:   459 GKRQCLGEGLARIELFLFFANIFNRYDVMPDFSGSLPDLDKSKDNFVIPRKFKAVLKRRY 518

             GKRQCLGEGLARIELFLFFANIFNRYDVMPDFSGSLPDLDKSKDNFVIPRKFKAVLKRRY

Sbjct:   427 GKRQCLGEGLARIELFLFFANIFNRYDVMPDFSGSLPDLDKSKDNFVIPRKFKAVLKRRY 486

 

Query:   519 MD 520

             MD

Sbjct:   487 MD 488

 

the extra sequence seems to be an error in assembly.  see below for comparisons

 

>gi|17562203|ref|NM_074675.1|  LocusLink infoUniGene info Caenorhabditis elegans cytochrome 450 family member (5Q654), mRNA

          Length = 1479

 

 Score =  110 bits (274), Expect = 3e-25

 Identities = 56/100 (56%), Positives = 68/100 (68%)

 Frame = +1

 

Query: 1   KERLGENYGVLDTNGHMWKEHRRFTLTQLRDLGLGKDLMQEKADLGRNRLKKAKKLHVLT 60

           K++L  NYGVLDTNG MW+EHRRFTL+QLRDLGLGKDLMQEK                 

Sbjct: 295 KKKLQRNYGVLDTNGEMWREHRRFTLSQLRDLGLGKDLMQEK------------------ 420

 

Query: 61  STVYSENILMEVEELFKELDAHGGEEIDLPKLIDRSVGNV 100

                  IL+E+EE FK++++H GEEIDLP ++DR VGNV

Sbjct: 421 -------ILLEIEEQFKDINSHLGEEIDLPSVLDRGVGNV 519

 

 

 

 

>gi|1914398|emb|Z92804.1|CEK05D4  Download subject sequence spanning the HSP Caenorhabditis elegans cosmid K05D4, complete sequence

          Length = 19000

 

 Score =  101 bits (252), Expect = 1e-22

 Identities = 63/143 (44%), Positives = 81/143 (56%), Gaps = 43/143 (30%)

 Frame = -2

 

Query: 1    KERLGENYGVLDTNGHMWKEHRRFTLTQLRDLGLGKDLMQEK------------------ 42

            K++L  NYGVLDTNG MW+EHRRFTL+QLRDLGLGKDLMQEK                 

Sbjct: 4173 KKKLQRNYGVLDTNGEMWREHRRFTLSQLRDLGLGKDLMQEKVFWVGTCRPTL*LTRAVS 3994

 

Query: 43   ---ADL----------GRNRLKKAKKL--------HVLTSTVYSEN----ILMEVEELFK 77

               ADL           R      ++L        ++L +  +  N    IL+E+EE FK

Sbjct: 3993 LK*ADL*IFFGYEQNWFRGTFVNGRRLGGLGRL*KYLLPTPKFFLNFIFQILLEIEEQFK 3814

 

Query: 78   ELDAHGGEEIDLPKLIDRSVGNV 100

            ++++H GEEIDLP ++DR VGNV

Sbjct: 3813 DINSHLGEEIDLPSVLDRGVGNV 3745

 

 

Z81487; CAA16294.2,JOINED. also = CYP33D3

this is closer to my version

MLLLLIFSSIFLYFFYHFHWKRRNLPPGPRPLPFLGNLLSLKTLKPGYEA  50

FSNWKKEYGPIFTFWMANKPFVIIASYEKMKETFVKDGDTYVDKQLTHTE 100

KERLGENYGVLDTNGHMWKEHRRFTLTQLRDLGLGKDLMQEKILMEVEEL 150

FKELDAHGGEEIDLPKLIDRSVGNVINLTLFNKRFDMDKRDEFAHLKSLI 200

DGMRNVTSQFRYLIQYLVPWTSTVLPGPTLSEKVRAKREELDDFFYSQID 250

EHRNEIDFDNTENLDFVEAYLKEQKKREEDGDFKTFCNKQLCAMLFDLWI 300

AGLMTTTMTMTWGLSYYLYNPEVQRKIREELDKVIGNDRLISTADKNDLP 350

YLQAFVTETQRTANIIPLNLIHMTTRDTVIDGFPIQKGTGVIAQISTVMY 400

DEKVFPEPYKFKPERFIENGKFKKVDEVIPFSIGKRQCLGEGLARIELFL 450

FFANIFNRYDVMPDFSGSLPDLDKSKDNFVIPRKFKAVLKRRYMD      495

 

>CYP33D3 C54E10 Y69H2 contigs 04354 and 04878 and Y80D3 contig 02849 488 aa

         Length = 489

 

 Score = 2516 (885.7 bits), Expect = 6.5e-265, P = 6.5e-265

 Identities = 481/495 (97%), Positives = 482/495 (97%)

 

Query:     1 MLLLLIFSSIFLYFFYHFHWKRRNLPPGPRPLPFLGNLLSLKTLKPGYEAFSNWKKEYGP 60

             MLLLLIFSSIFLYFFYHFHWKRRNLPPGPRPLPFLGNLLSLKTLKPGYEAFSNWKKEYGP

Sbjct:     1 MLLLLIFSSIFLYFFYHFHWKRRNLPPGPRPLPFLGNLLSLKTLKPGYEAFSNWKKEYGP 60

 

Query:    61 IFTFWMANKPFVIIASYEKMKETFVKDGDTYVDKQLTHTEKERLGENYGVLDTNGHMWKE 120

             IFTFWM NKPFVIIASYEKMKETFVKDGDTYVDKQLTHTEKERLGENYGVLDTNGHMWKE

Sbjct:    61 IFTFWMGNKPFVIIASYEKMKETFVKDGDTYVDKQLTHTEKERLGENYGVLDTNGHMWKE 120

 

Query:   121 HRRFTLTQLRDLGLGKDLMQEKILMEVEELFKELDAHGGEEIDLPKLIDRSVGNVINLTL 180

             HRRFTLTQLRDLGLGKDLMQEKILMEVEELFKELDAHGGEEIDLPKLIDRSVGNVINLTL

Sbjct:   121 HRRFTLTQLRDLGLGKDLMQEKILMEVEELFKELDAHGGEEIDLPKLIDRSVGNVINLTL 180

 

Query:   181 FNKRFDMDKRDEFAHLKSLIDGMRNVTSQFRYLIQYLVPWTSTVLPGPTLSEKVRAKREE 240

             F       KRDEFAHLKSLIDGMRNVTSQ RYLIQYLVPWTSTVLPGPTLSEKVRAKREE

Sbjct:   181 F-------KRDEFAHLKSLIDGMRNVTSQSRYLIQYLVPWTSTVLPGPTLSEKVRAKREE 233

 

Query:   241 LDDFFYSQIDEHRNEIDFDNTENLDFVEAYLKEQKKREEDGDFKTFCNKQLCAMLFDLWI 300

             LDDFFYSQIDEHRNEIDFDNTENLDFVEAYLKEQKKREEDGDF    NKQLCAMLFDLWI

Sbjct:   234 LDDFFYSQIDEHRNEIDFDNTENLDFVEAYLKEQKKREEDGDFSISSNKQLCAMLFDLWI 293

 

Query:   301 AGLMTTTMTMTWGLSYYLYNPEVQRKIREELDKVIGNDRLISTADKNDLPYLQAFVTETQ 360

             AGLMTTTMTMTWGLSYYLYNPEVQRKIREELDKVIGNDRLISTADKNDLPYLQAFVTETQ

Sbjct:   294 AGLMTTTMTMTWGLSYYLYNPEVQRKIREELDKVIGNDRLISTADKNDLPYLQAFVTETQ 353

 

Query:   361 RTANIIPLNLIHMTTRDTVIDGFPIQKGTGVIAQISTVMYDEKVFPEPYKFKPERFIENG 420

             RTANIIPLNLIHMTTRDTVIDGFPIQKGTGVIAQISTVMYDE+VFPEPYKFKPERFIENG

Sbjct:   354 RTANIIPLNLIHMTTRDTVIDGFPIQKGTGVIAQISTVMYDEQVFPEPYKFKPERFIENG 413

 

Query:   421 KFKKVDEVIPFSIGKRQCLGEGLARIELFLFFANIFNRYDVMPDFSGSLPDLDKSKDNFV 480

             KFKKVDEVIPFSIGKRQCLGEGLARIELFLFFANIFNRYDVMPDFSGSLPDLDKSKDNFV

Sbjct:   414 KFKKVDEVIPFSIGKRQCLGEGLARIELFLFFANIFNRYDVMPDFSGSLPDLDKSKDNFV 473

 

Query:   481 IPRKFKAVLKRRYMD 495

             IPRKFKAVLKRRYMD

Sbjct:   474 IPRKFKAVLKRRYMD 488

 

compare to cDNA

>EMBOSS_001_1 CB401028

FLYFFYHFHWKRRNLPPGPRPLPFLGNLLSLKTLKPGYEAFSNWKKEYGPIFTFWMANKP

FVIIASYEKMKETFVKDGDTYVDKQLTHTEKERLGENYGVLDTNGHMWKEHKRFTLTQLR

DLGLGKDFMQEKILMEVEKLFKELDAHGGKKIDLPKLIHRSGGNVINL

 

>gi|30739054|gb|CB397327.1|CB397327  UniGene info OSTR186F2_2 AD-wrmcDNA Caenorhabditis elegans cDNA.

          Length = 552

 

 Score =  303 bits (775), Expect = 5e-83

 Identities = 149/150 (99%), Positives = 150/150 (100%)

 Frame = -1

 

Query: 184 GLMTTTMTMTWGLSYYLYNPEVQRKIREELDKVIGNDRLISTADKNDLPYLQAFVTETQR 243

           GLMTTTMTMTWGLSYYLYNPEVQRKIREELDKVIGNDRLISTADKNDLPYLQAFVTETQR

Sbjct: 552 GLMTTTMTMTWGLSYYLYNPEVQRKIREELDKVIGNDRLISTADKNDLPYLQAFVTETQR 373

 

Query: 244 TANIIPLNLIHMTTRDTVIDGFPIQKGTGVIAQISTVMYDEQVFPEPYKFKPERFIENGK 303

           TANIIPLNLIHMTTRDTVIDGFPIQKGTGVIAQISTVMYDE+VFPEPYKFKPERFIENGK

Sbjct: 372 TANIIPLNLIHMTTRDTVIDGFPIQKGTGVIAQISTVMYDEKVFPEPYKFKPERFIENGK 193

 

Query: 304 FKKVDEVIPFSIGKRQCLGEGLARIELFLF 333

           FKKVDEVIPFSIGKRQCLGEGLARIELFLF

Sbjct: 192 FKKVDEVIPFSIGKRQCLGEGLARIELFLF 103

 

>gi|3873427|emb|AL021470.1|CEY17D7A  Download subject sequence spanning the HSP Caenorhabditis elegans YAC Y17D7A, complete sequence

          Length = 21494

 

 Score =  193 bits (491), Expect = 8e-50

 Identities = 94/95 (98%), Positives = 94/95 (98%)

 Frame = -2

 

Query: 71  KRDEFAHLKSLIDGMRNVTSQSRYLIQYLVPWTSTVLPGPTLSEKVRAKREELDDFFYSQ 130

           KRDEFAHLKSLIDGMRNVTSQ RYLIQYLVPWTSTVLPGPTLSEKVRAKREELDDFFYSQ

Sbjct: 637 KRDEFAHLKSLIDGMRNVTSQFRYLIQYLVPWTSTVLPGPTLSEKVRAKREELDDFFYSQ 458

 

Query: 131 IDEHRNEIDFDNTENLDFVEAYLKEQKKREEDGDF 165

           IDEHRNEIDFDNTENLDFVEAYLKEQKKREEDGDF

Sbjct: 457 IDEHRNEIDFDNTENLDFVEAYLKEQKKREEDGDF 353

 

 

 

 

 Score = 80.5 bits (197), Expect = 9e-16

 Identities = 39/42 (92%), Positives = 41/42 (97%)

 Frame = -1

 

Query: 31   KILMEVEELFKELDAHGGEEIDLPKLIDRSVGNVINLTLFKR 72

            +ILMEVEELFKELDAHGGEEIDLPKLIDRSVGNVINLTLF +

Sbjct: 1406 QILMEVEELFKELDAHGGEEIDLPKLIDRSVGNVINLTLFNK 1281

 

 

 

 

 Score = 72.0 bits (175), Expect = 3e-13

 Identities = 31/33 (93%), Positives = 33/33 (100%)

 Frame = -1

 

Query: 1    DTNGHMWKEHRRFTLTQLRDLGLGKDLMQEKIL 33

            DTNGHMWKEHRRFTLTQLRDLGLGKDLMQEK++

Sbjct: 2453 DTNGHMWKEHRRFTLTQLRDLGLGKDLMQEKVI 2355

 

CYP31A3

 

>CYP31A3 T16C6 486 aa this sequence revised according to Y17G9.contig 61

         Length = 488

 

Score = 1164 (409.7 bits), Expect = 1.2e-121, P = 1.2e-121

 Identities = 242/332 (72%), Positives = 263/332 (79%)

 

Query:     1 MGVIIPAVLLASATIIAWLLYKHLRMRQALKHLNQPRSYPIVGHGLVTKPDPEGFMNQVI 60

             MGVIIPAVLLASATIIAWLLYKHLRMRQALKHLNQPRSYPIVGHGLVTKPDPEGFMNQVI

Sbjct:     1 MGVIIPAVLLASATIIAWLLYKHLRMRQALKHLNQPRSYPIVGHGLVTKPDPEGFMNQVI 60

 

Query:    61 GMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTS 120

             GMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTS

Sbjct:    61 GMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTS 120

 

Query:   121 QKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILVQKMCSLGAEEEVDVLSVITLCTLD 180

             QKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILVQKMCSLGAEEEVDVLSVITLCTLD

Sbjct:   121 QKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILVQKMCSLGAEEEVDVLSVITLCTLD 180

 

Query:   181 IICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPLMWNS---------FIYNLYD 231

             IICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPL+             +++  

Sbjct:   181 IICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPLITEDGRTHEKCLRILHDFTK 240

 

Query:   232 SFIIKKVNSILF--FRTEDGRTHEKCLRILHDFTKKVIVERKEALQEND-YKMEGR-LAF 287

               I+++  ++    ++ E GR     L +L +  K   ++  +   E D +  EG    

Sbjct:   241 KVIVERKEALQENDYKME-GRL--AFLDLLLEMVKSGQMDETDVQAEVDTFMFEGHDTTS 297

 

Query:   288 LDLLLEMVKSGQMDETD--VQAEVDTFMFEGHDTT 320

               L+  +   G   E    VQAE+D  M +  D T

Sbjct:   298 TGLMWAIHLLGNHPEVQRKVQAELDEVMGDDEDVT 332

 

 

 Score = 1394 (490.7 bits), Expect = 5.1e-146, P = 5.1e-146

 Identities = 274/295 (92%), Positives = 277/295 (93%)

 

Query:   217 NNPLMWNSFIYNLYDSFIIKKVNSILFFRTEDGRTHEKCLRILHDFTKKVIVERKEALQE 276

             NN  +W     N     I K+ N+ L   TEDGRTHEKCLRILHDFTKKVIVERKEALQE

Sbjct:   198 NNEYVWAVHTIN---KLISKRTNNPLI--TEDGRTHEKCLRILHDFTKKVIVERKEALQE 252

 

Query:   277 NDYKMEGRLAFLDLLLEMVKSGQMDETDVQAEVDTFMFEGHDTTSTGLMWAIHLLGNHPE 336

             NDYKMEGRLAFLDLLLEMVKSGQMDETDVQAEVDTFMFEGHDTTSTGLMWAIHLLGNHPE

Sbjct:   253 NDYKMEGRLAFLDLLLEMVKSGQMDETDVQAEVDTFMFEGHDTTSTGLMWAIHLLGNHPE 312

 

Query:   337 VQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRLFPSVPIITRELSDDQVIGGV 396

             VQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRLFPSVPIITRELSDDQVIGGV

Sbjct:   313 VQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRLFPSVPIITRELSDDQVIGGV 372

 

Query:   397 NIPKGVTFLLNLYLVHRDPSQWKDPDVFDPDRFLPENSIARKSFAFIPFSAGSRNCIGQR 456

             NIPKGVTFLLNLYLVHRDPSQWKDPDVFDPDRFLPENSIARKSFAFIPFSAGSRNCIGQR

Sbjct:   373 NIPKGVTFLLNLYLVHRDPSQWKDPDVFDPDRFLPENSIARKSFAFIPFSAGSRNCIGQR 432

 

Query:   457 FALMEEKVIMAHLLRNFNVKAVELMHEVRPKMEIIVRPVTPIHMKLTRRRPIVSP 511

             FALMEEKVIMAHLLRNFNVKAVELMHEVRPKMEIIVRPVTPIHMKLTRRRPIVSP

Sbjct:   433 FALMEEKVIMAHLLRNFNVKAVELMHEVRPKMEIIVRPVTPIHMKLTRRRPIVSP 487

 

Y17G9B.3; CE24183. with extra seq

MGVIIPAVLLASATIIAWLLYKHLRMRQALKHLNQPRSYPIVGHGLVTKP  50

DPEGFMNQVIGMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHL 100

NKGFAYVLLEPWLGISILTSQKEQWRPKRKLLTPTFHYDILKDFLPIFNE 150

QSKILVQKMCSLGAEEEVDVLSVITLCTLDIICETSMGKAIGAQLAENNE 200

YVWAVHTINKLISKRTNNPLMWNSFIYNLYDSFIIKKVNSILFFRTEDGR 250

THEKCLRILHDFTKKVIVERKEALQENDYKMEGRLAFLDLLLEMVKSGQM 300

DETDVQAEVDTFMFEGHDTTSTGLMWAIHLLGNHPEVQRKVQAELDEVMG 350

DDEDVTIEHLSRMKYLECALKEALRLFPSVPIITRELSDDQVIGGVNIPK 400

GVTFLLNLYLVHRDPSQWKDPDVFDPDRFLPENSIARKSFAFIPFSAGSR 450

NCIGQRFALMEEKVIMAHLLRNFNVKAVELMHEVRPKMEIIVRPVTPIHM 500

KLTRRRPIVSP                                        511

 

compare to a related EST

>gi|18246951|dbj|BJ104281.1|BJ104281  UniGene info BJ104281 unpublished oligo-capped cDNA library, C. elegans L1 stage

           Caenorhabditis elegans cDNA clone yk1059h10 5'.

          Length = 601

 

 Score =  164 bits (415), Expect = 4e-42

 Identities = 84/100 (84%), Positives = 86/100 (86%), Gaps = 8/100 (8%)

 Frame = +1

 

Query: 14  DVLSVITLCTLDIICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPL-------- 65

           DVLSVI LCTLDIICETSMG AIGAQLAENNEYVWAVHTINKLISKRTNNPL       

Sbjct: 1   DVLSVIALCTLDIICETSMGIAIGAQLAENNEYVWAVHTINKLISKRTNNPLMWNSFIYN 180

 

Query: 66  ITEDGRTHEKCLRILHDFTKKVIVERKEALQENDYKMEGR 105

           +TEDGRTHEKCL I HDFTKK+I ERK ALQENDYKMEGR

Sbjct: 181 LTEDGRTHEKCLHIFHDFTKKLIGERK*ALQENDYKMEGR 300

 

 

 

 

 

 

CYP29A3

 

>CYP29A3 Y108G3 contig 541 504 aa

         Length = 504

 

 Score = 2570 (904.7 bits), Expect = 1.2e-270, P = 1.2e-270

 Identities = 491/504 (97%), Positives = 493/504 (97%)

 

Query:     1 MSLILPCLLIILLLFIVSFWKIMKNILKYRKYDDLLPGPPAHPIFGNTKTFSNKTTE--E 58

             MSLILPCLLIILLLFIVSFWKIMKNILKYRKYDD LPGPPAHPIFGNTKTFSNKTTE  E

Sbjct:     1 MSLILPCLLIILLLFIVSFWKIMKNILKYRKYDDQLPGPPAHPIFGNTKTFSNKTTEISE 60

 

Query:    59 IFQALRDLFSEAVEKGQSLIRHRILGTFYVWPLDGKTVSKILESTTELDKGGPYEFFNDW 118

             IFQALRDLFSEAVEKGQSLIRHRILGTFYVWPLDGKTVSKILESTTELDKGGPYEFFNDW

Sbjct:    61 IFQALRDLFSEAVEKGQSLIRHRILGTFYVWPLDGKTVSKILESTTELDKGGPYEFFNDW 120

 

Query:   119 LGGGTLLEGYGERWRSHRKMLTPTFHFAKLEGYFEVFNTESRVVVDCLDKFAKSGETVDL 178

             LGGGTLLEGYGERWRSHRKMLTPTF FAK+ GY EV   ESRV +DCLDKFAKSGETVDL

Sbjct:   121 LGGGTLLEGYGERWRSHRKMLTPTFSFAKMGGYLEVSIPESRVSLDCLDKFAKSGETVDL 180

 

Query:   179 FPFFKRCTLDTICKTAMGAKVDAQLQNSHPYITAIEQALQLGVLYAMNPHHQIPAIYWAL 238

             FPFFKRCTLDTICKTAMGAKVDAQLQNSHPYITAIEQALQLGV YAMNPHHQIPAIYWAL

Sbjct:   181 FPFFKRCTLDTICKTAMGAKVDAQLQNSHPYITAIEQALQLGVSYAMNPHHQIPAIYWAL 240

 

Query:   239 GHQKKKDEYFNIMKTFTRNVIAERRTARESGEVEKETSKRNMNFLDILLSNEESSVLSPE 298

             GHQKKKDEYFNIMKTFTRNVIAERRTARESGEVEKETSKRNMNFLDILLSNEESSVLSPE

Sbjct:   241 GHQKKKDEYFNIMKTFTRNVIAERRTARESGEVEKETSKRNMNFLDILLSNEESSVLSPE 300

 

Query:   299 DLRQEVDTFMFAGHDTTTTSVSWVCWNLAHHPDIQQNVYEEIVSVFGEDPNEDVTTEGIK 358

             DLRQEVDTFMFAGHDTTTTSVSWVCWNLAHHPDIQQNVYEEIVSVFGEDPNEDVTTEGIK

Sbjct:   301 DLRQEVDTFMFAGHDTTTTSVSWVCWNLAHHPDIQQNVYEEIVSVFGEDPNEDVTTEGIK 360

 

Query:   359 KLEYTERMLKESKRICPTVPAVLRQLISDMEIGGVLIPAGANVAIAPMAIHKNANIYQNP 418

             KLEYTERMLKESKRICPTVPAVLRQLISDMEIGGVLIPAGANVAIAPMAIHKNANIYQNP

Sbjct:   361 KLEYTERMLKESKRICPTVPAVLRQLISDMEIGGVLIPAGANVAIAPMAIHKNANIYQNP 420

 

Query:   419 DIFDPDRFLPEETAKRHAYDFIPFSAGLRNCIGQKFAQLNEKVMVIHLLKNFKIEPMGGY 478

             DIFDPDRFLPEETAKRHAYDFIPFSAGLRNCIGQKFAQLNEKVMVIHLLKNFKIEPMGGY

Sbjct:   421 DIFDPDRFLPEETAKRHAYDFIPFSAGLRNCIGQKFAQLNEKVMVIHLLKNFKIEPMGGY 480

 

Query:   479 YSTKQVFEPVGKPSNGIPVRLVRR 502

             YSTKQVFEPVGKPSNGIPVRLVRR

Sbjct:   481 YSTKQVFEPVGKPSNGIPVRLVRR 504

 

Y38C9B.1; CE35365.

MSLILPCLLIILLLFIVSFWKIMKNILKYRKYDDLLPGPPAHPIFGNTKT  50

FSNKTTEEIFQALRDLFSEAVEKGQSLIRHRILGTFYVWPLDGKTVSKIL 100

ESTTELDKGGPYEFFNDWLGGGTLLEGYGERWRSHRKMLTPTFHFAKLEG 150

YFEVFNTESRVVVDCLDKFAKSGETVDLFPFFKRCTLDTICKTAMGAKVD 200

AQLQNSHPYITAIEQALQLGVLYAMNPHHQIPAIYWALGHQKKKDEYFNI 250

MKTFTRNVIAERRTARESGEVEKETSKRNMNFLDILLSNEESSVLSPEDL 300

RQEVDTFMFAGHDTTTTSVSWVCWNLAHHPDIQQNVYEEIVSVFGEDPNE 350

DVTTEGIKKLEYTERMLKESKRICPTVPAVLRQLISDMEIGGVLIPAGAN 400

VAIAPMAIHKNANIYQNPDIFDPDRFLPEETAKRHAYDFIPFSAGLRNCI 450

GQKFAQLNEKVMVIHLLKNFKIEPMGGYYSTKQVFEPVGKPSNGIPVRLV 500

RRL                                                503

 

cDNA supports seq

 

>gi|30742793|gb|CB401066.1|CB401066  UniGene info OSTF187E12_1 AD-wrmcDNA Caenorhabditis elegans cDNA.

          Length = 563

 

 Score =  130 bits (326), Expect = 7e-32

 Identities = 63/63 (100%), Positives = 63/63 (100%)

 Frame = +1

 

Query: 8  LLIILLLFIVSFWKIMKNILKYRKYDDLLPGPPAHPIFGNTKTFSNKTTEEIFQALRDLF 67

          LLIILLLFIVSFWKIMKNILKYRKYDDLLPGPPAHPIFGNTKTFSNKTTEEIFQALRDLF

Sbjct: 1  LLIILLLFIVSFWKIMKNILKYRKYDDLLPGPPAHPIFGNTKTFSNKTTEEIFQALRDLF 180

 

 

>gi|30742793|gb|CB401066.1|CB401066  UniGene info OSTF187E12_1 AD-wrmcDNA Caenorhabditis elegans cDNA.

          Length = 563

 

 Score =  128 bits (321), Expect = 6e-31

 Identities = 60/60 (100%), Positives = 60/60 (100%)

 Frame = +1

 

Query: 1   LGGGTLLEGYGERWRSHRKMLTPTFHFAKLEGYFEVFNTESRVVVDCLDKFAKSGETVDL 60

           LGGGTLLEGYGERWRSHRKMLTPTFHFAKLEGYFEVFNTESRVVVDCLDKFAKSGETVDL

Sbjct: 334 LGGGTLLEGYGERWRSHRKMLTPTFHFAKLEGYFEVFNTESRVVVDCLDKFAKSGETVDL 513

 

>gi|4263194|gb|AC006744.1|  Download subject sequence spanning the HSP Caenorhabditis elegans cosmid Y38C9B, complete sequence

          Length = 19368

 

 Score = 60.1 bits (144), Expect = 3e-10

 Identities = 29/31 (93%), Positives = 30/31 (96%)

 Frame = +3

 

Query: 1     TAMGAKVDAQLQNSHPYITAIEQALQLGVLY 31

             TAMGAKVDAQLQNSHPYITAIEQALQLGV +

Sbjct: 10719 TAMGAKVDAQLQNSHPYITAIEQALQLGVSF 10811

 

 

 

 

 Score = 45.4 bits (106), Expect = 9e-06

 Identities = 20/32 (62%), Positives = 23/32 (71%)

 Frame = +2

 

Query: 15    HPYITAIEQALQLGVLYAMNPHHQIPAIYWAL 46

             HP++        + VLYAMNPHHQIPAIYWAL

Sbjct: 11486 HPFL-----CFYIQVLYAMNPHHQIPAIYWAL 11566

 

 

CYP13A10

 

>CYP13A10 ZK1320.4    Z46934 505 aa

          Length = 506

 

 Score = 1405 (494.6 bits), Expect = 1.4e-263, Sum P(2) = 1.4e-263

 Identities = 273/305 (89%), Positives = 280/305 (91%)

 

Query:     3 VILLAIPTLFIGFISYYLWIWTYWRRRGIPGPLGYPLVGSFPKTLKSEYPQYLQIRDWTK 62

             +  L +   +I ++SYYLWIWTYWRRRGIPGPLGYPLVGSFPKTLKSEYPQYLQIRDWTK

Sbjct:    14 IFYLFLKAKYI-YLSYYLWIWTYWRRRGIPGPLGYPLVGSFPKTLKSEYPQYLQIRDWTK 72

 

Query:    63 LYGPIYGYTEGTIKTLIVSDIDIVRQIFVEQYDNFYGRKLNPIQGDPEKDERTNLFSAQG 122

             LYGPIYGYTEGTIKTLIVSDIDIVRQIFVEQYDNFYGRKLNPIQGDPEKDERTNLFSAQG

Sbjct:    73 LYGPIYGYTEGTIKTLIVSDIDIVRQIFVEQYDNFYGRKLNPIQGDPEKDERTNLFSAQG 132

 

Query:   123 FRWKRLRAISSPTFSNNSLRKINVTVEDSAMELLRHIEEQTSEGQQIDMLQFYQEFTMDT 182

             FRWKRLRAISSPTFSNNSLRKINVTVEDSAMELLRHIEEQTSEGQQIDMLQFYQEFTMDT

Sbjct:   133 FRWKRLRAISSPTFSNNSLRKINVTVEDSAMELLRHIEEQTSEGQQIDMLQFYQEFTMDT 192

 

Query:   183 IGRIAMGQTDSQMFKNPLLKFVRAIFGDNRKHIPLIGGVFPTLAQVFRFFMLKFPLLGAA 242

             IGRIAMGQTDSQMFKNPLLKFVRAIFGDNRKHIPLIGGVFPTLAQVFRFFMLKF     

Sbjct:   193 IGRIAMGQTDSQMFKNPLLKFVRAIFGDNRKHIPLIGGVFPTLAQVFRFFMLKF------ 246

 

Query:   243 NFIHVNKTVVTAVQNRIDQRENDRKNGIEIGEPQDFIDLFLEARADDVEHFQENNGDFSK 302

                      VTAVQNRIDQRENDRKNGIEIGEPQDFIDLFLEARADDVEHFQENNGDFS

Sbjct:   247 ---------VTAVQNRIDQRENDRKNGIEIGEPQDFIDLFLEARADDVEHFQENNGDFSL 297

 

Query:   303 TSSYV 307

             T+  +

Sbjct:   298 TTQEI 302

 

 Score = 1122 (395.0 bits), Expect = 1.4e-263, Sum P(2) = 1.4e-263

 Identities = 216/256 (84%), Positives = 229/256 (89%)

 

Query:   283 LEARADDVEHFQENNGDFSKTSSYVSF----RRKIIPEFKKLQGNRQLTTQEIVGQCLVF 338

             ++ R D  E+ ++N  +  +   ++      R   +  F++  G+  LTTQEIVGQCLVF

Sbjct:   250 VQNRIDQRENDRKNGIEIGEPQDFIDLFLEARADDVEHFQENNGDFSLTTQEIVGQCLVF 309

 

Query:   339 LIAGFDTTALSLSYTTFLLATHPEVQKKLQEEIERECIEPSISFDHLSKLKYMDCIIKET 398

             LIAGFDTTALSLSYTTFLLATHPEVQKKLQEEIERECIEPSISFDHLSKLKYMDCIIKET

Sbjct:   310 LIAGFDTTALSLSYTTFLLATHPEVQKKLQEEIERECIEPSISFDHLSKLKYMDCIIKET 369

 

Query:   399 LRLYPLGTMANSRRCMRATKLGNVEVEVGTMVQVDTWSLHTDTKIWGDDAKEFKPERWLD 458

             LRLYPLGTMANSRRCMRATKLGNVEVEVGTMVQVDTWSLHTDTKIWGDDAKEFKPERWLD

Sbjct:   370 LRLYPLGTMANSRRCMRATKLGNVEVEVGTMVQVDTWSLHTDTKIWGDDAKEFKPERWLD 429

 

Query:   459 PNCDQVFQKGGYISFGLGPRQCVGMRLAYMEEKMLLAHILRKYTFEVGTKTEIPLKLVGR 518

             PNCDQVFQKGGYISFGLGPRQCVGMRLAYMEEKMLLAHILRKYTFEVGTKTEIPLKLVGR

Sbjct:   430 PNCDQVFQKGGYISFGLGPRQCVGMRLAYMEEKMLLAHILRKYTFEVGTKTEIPLKLVGR 489

 

Query:   519 ATTQPETVWMHLKQRI 534

             ATTQPETVWMHLKQRI

Sbjct:   490 ATTQPETVWMHLKQRI 505

 

ZK1320.4; CE30715. remove cyan seq

MSVILLAIPTLFIGFISYYLWIWTYWRRRGIPGPLGYPLVGSFPKTLKSE  50

YPQYLQIRDWTKLYGPIYGYTEGTIKTLIVSDIDIVRQIFVEQYDNFYGR 100

KLNPIQGDPEKDERTNLFSAQGFRWKRLRAISSPTFSNNSLRKINVTVED 150

SAMELLRHIEEQTSEGQQIDMLQFYQEFTMDTIGRIAMGQTDSQMFKNPL 200

LKFVRAIFGDNRKHIPLIGGVFPTLAQVFRFFMLKFPLLGAANFIHVNKT 250

VVTAVQNRIDQRENDRKNGIEIGEPQDFIDLFLEARADDVEHFQENNGDF 300

SKTSSYVSFRRKIIPEFKKLQGNRQLTTQEIVGQCLVFLIAGFDTTALSL 350

SYTTFLLATHPEVQKKLQEEIERECIEPSISFDHLSKLKYMDCIIKETLR 400

LYPLGTMANSRRCMRATKLGNVEVEVGTMVQVDTWSLHTDTKIWGDDAKE 450

FKPERWLDPNCDQVFQKGGYISFGLGPRQCVGMRLAYMEEKMLLAHILRK 500

YTFEVGTKTEIPLKLVGRATTQPETVWMHLKQRI                 534

 

>CYP42A1 CM08B12  M89401, AL020988 Y80D3 contig 01341 and 00427 504 aa

         Length = 505

 

 Score = 2290 (806.1 bits), Expect = 5.8e-241, P = 5.8e-241

 Identities = 440/450 (97%), Positives = 440/450 (97%)

 

Query:     1 MGIITASLIVLTITWIIHFAFRKAKFIYNKLTVFQGPAALPLIGNFHQFHFSPEEFFEQS 60

             MGIITASLIVLTITWIIHFAFRKAKFIYNKLTVFQGPAALPLIGNFHQFHFSPEEFFEQS

Sbjct:     1 MGIITASLIVLTITWIIHFAFRKAKFIYNKLTVFQGPAALPLIGNFHQFHFSPEEFFEQS 60

 

Query:    61 QGIAYMMRKGDERITRVWLGGLPFVLLYGAHEVEAILGSPKMLNKPFLYGFLSAWIGDGL 120

             QGIAYMMRKGDERITRVWLGGLPFVLLYGAHEVEAILGSPKMLNKPFLYGFLSAWIGDGL

Sbjct:    61 QGIAYMMRKGDERITRVWLGGLPFVLLYGAHEVEAILGSPKMLNKPFLYGFLSAWIGDGL 120

 

Query:   121 LIS-KPDKWRPRRKLLTPTFHYDILKDFVEVYNRHGRTLLSKFEAQAGTGEYSDVFHTIT 179

             LIS KPDKWRPRRKLLTPTFHYDILKDFVEVYNRHGRTLLSKFEAQAGTGEYSDVFHTIT

Sbjct:   121 LISRKPDKWRPRRKLLTPTFHYDILKDFVEVYNRHGRTLLSKFEAQAGTGEYSDVFHTIT 180

 

Query:   180 LCTLDVICEAALGTSINAQKDPHSPYLDAVFKMKDIVFQRLLRPHYFSDTIFNLIGPGKE 239

             LCTLDVICEAALGTSINAQKDPHSPYLDAVFKMKDIVFQRLLRPHYFSDTIFNLIGPGKE

Sbjct:   181 LCTLDVICEAALGTSINAQKDPHSPYLDAVFKMKDIVFQRLLRPHYFSDTIFNLIGPGKE 240

 

Query:   240 HDECVK--ILHEFTSKAIYARKAKVDAAGGVEQLLAQETAEGRRRMAFLDLMLDMNSKGE 297

             HDECVK  ILHEFTSKAIYARK          QLLAQETAEGRRRMAFLDLMLDMNSKGE

Sbjct:   241 HDECVKVIILHEFTSKAIYARK----------QLLAQETAEGRRRMAFLDLMLDMNSKGE 290

 

Query:   298 LPMEGICEEVDTFTFEGHDTTSAAMNWFLHLMGANPEIQSKVQKEIDEVLGEADRPVSYE 357

             LPMEGICEEVDTFTFEGHDTTSAAMNWFLHLMGANPEIQSKVQKEIDEVLGEADRPVSYE

Sbjct:   291 LPMEGICEEVDTFTFEGHDTTSAAMNWFLHLMGANPEIQSKVQKEIDEVLGEADRPVSYE 350

 

Query:   358 DLGKLKYLEACFKETLRLYPSVPLIARQCVEDIQVRGHTLPSGTAVVMVPSMVHKDPRYW 417

             DLGKLKYLEACFKETLRLYPSVPLIARQCVEDIQVRGHTLPSGTAVVMVPSMVHKDPRYW

Sbjct:   351 DLGKLKYLEACFKETLRLYPSVPLIARQCVEDIQVRGHTLPSGTAVVMVPSMVHKDPRYW 410

 

Query:   418 DDPEIFNPERFITGELKHPYAYIPFSAGSRNCI 450

             DDPEIFNPERFITGELKHPYAYIPFSAGSRNCI

Sbjct:   411 DDPEIFNPERFITGELKHPYAYIPFSAGSRNCI 443