Comparison of Dictyoselium assemblies from genome annotation and my annotation.

Attempts are made to reconcile the differences in the assemblies.

D. Nelson

May 13, 2003

 

I have 55 sequences identified

 

51     = seq 19             ng7190 use ng7190

508A1  = seq 1+2            ng3018 missing N-term exon use my seq

508A2  = seq 22+45+69+86+88 ng5262 N-term incorrect use my seq

508A3  = seq 21+67+72+82+87 ng10949 correct the two diffs and add my N-term

508A4  = seq 66+70          ng4360 remove K from mine at boundary then use mine

508B1  = seq 15+77          ng6400 missing most of N-term use my seq

508B2P = seq 60

508C1  = seq 4+23+24+83     ng1934 missing three parts use my seq

508D1  = seq 59+81          ng1935 missing N-terminal exon use my seq

508E1  = seq 71+85          ng8897 use my seq correct two other defects

513A1  = seq 8+53           ng6889 n-term missing use my seq

513A2P = seq 38             ng12344 correct 1 aa diff use my seq

513A3  = seq 7+50           ng4838 add missing intron back use my seq

513B1  = seq 49             ng627 2aa diffs use ng627 seq

513C1  = seq 5+56           ng6446 100% match

513D1  = seq 54+55+78       ng11186

513E1  = seq 51+52+90       ng3101 use N instead of D

513E2P = seq 37a complete except for missing N-term exon 55% to 513E1

         (old name 513C1P)  ng1289 missing first 13 aa use my seq

513E3P = seq 37b upstream of 513E2P only 429 bp between them (old name 513C2P)

513F1  = seq 57

513G1P = seq 41 37% to CYP513A1

514A1  = seq 11             ng1282 add 7aa to exon 1 and change 1 aa L to F in ng1282

514A2  = seq 65             ng1006 joint at N-term exon does not agree, 2aa diffs

      inside, use ng1006 seq

514A3P = seq 89

514A4  = new seq            ng2792 7aa difs to 514A1

515A1  = seq 79b            ng4137 some parts of both seqs have support

515A2P = seq 79a

515B1  = seq 6              ng2647 N-terms do not match use ng2647 but extend back

516A1  = seq 9+43           ng9107 use ng9107

516A2P = seq 64

516B1  = seq 10+42          ng9290 use mine  but add insertion

516B2P = seq 48 partial, might be a pseudogene

517A1  = seq 3+61           ng4501 use this seq with my N-terminal

517A2  = seq 74             ng10310 correct 3aa diffs and use my seq

517A3P = seq 36             ng682 missing some seq. Use my seq add new C-term piece

517A4  = new seq            ng5440 most like 517A1, 13 aa diffs

518A1  = seq 20

518A2P = seq 25+27+29+30

518B1  = seq 26+28

519A1  = seq 12             ng8085 add N-term exon then use ng8085

519B1  = seq 35+44+75       ng6320 USE ng6320

519C1  = seq 31+32+33       ng5581 use my seq (N-term missing)

519C2P = seq 46

519D1  = seq 14+34          ng12432 use my seq

519E1  = seq 17+84          ng6352 add two short pieces use ng6352

520A1  = seq 47+68+80       ng8864 use this seq with my N-term

520B1  = seq 58             ng1210 correct stop codon. use my seq.

521A1  = seq 13             ng2405 100% match

522A1  = seq 18

519H1P = seq 39+73          ng11335 old 523A1 correct three aa diffs add my N-term

524A1  = seq 91             ng752 N-terms do not agree use my seq

525A1  = seq 62

554A1  = seq 40             ng1193 small diffs use ng1193 seq

555A1  = seq 76             ng10297 use ng10297

556A1  = seq 92             ng10943 use my seq

 

Dicty database did not identify the following as P450s 508B2P, 513E2P, 513F1, 513G1P,

515A2P, 516A2P, 516B2, 518A1, 518A2P, 518B1, 519C2P, 522A1, 525A1,

note 518A1, 518B1 and 522A1 have non-standard Heme signatures

(13 seqs)

Dicty database found two new genes I had missed 514A4 and 517A4 both very similar to existing genes.

 

>514A2

>aa-ng1006        Contig_0356     pred cdna   pORF 5256         3526 strand r start y stop y ph 0

MNLIYTIILTIIILVLIISIKDLFFEDKIKKINKSIPSPPTIPIFGNLLQINSKDVATCF

NDFYKQYGKVYRLRLGSVETVVLTGGDIIDECFNKKYRDFLKARYVKFSRYLGKDTNILH

SNGDYHFLLKGVLSSQVTVRKLNNGRLEFNKYILQMFNNLNNNDEGSTMFLANDVPSQIK

KLILKVVLNFTLGIEENDDINLSLFQNGSNIFKAAGLFIYSDYLPFLFPLDIKSMAKSNM

ISSYVFVRDYLAKKLEEVKKKEYIINGDDDGGVDTSQTPLIESYYKLYLQGLIGYDSILL

SIVDIIIASVDTTSNSISFIIARLTNHQEIQSKIYEEIMSNDINNNSNNISFSDHSKYPY

IISIMNETYRYYASVPLPEPNMTTEDIEVDGYKIAKGTQIYKNIRGTLISKEFWGEDALE

FKPERFKTQTLNQKGLLHFGAGPRGCPGARFTECFFFTLMVLLFKNYKLQNPNDNPIDDR

GDVGLSMQCKPYDALFIKRN

 

JC1c209h10.r1 has FL not CF

JC3a120c12.s1 has ?F not FL

Contig_0356   has CF not FL use CF

 

Query:     1 MNLIYTIILTIIILVLIISIKDLFFEDKIKKINKSIPSPPTIPIFGNLLQINSKDVATCF 60

             MNLIYTIILTIIIL       DLFFEDKIKKINKSIPSPPTIPIFGNLLQINSKDVAT 

Sbjct:     1 MNLIYTIILTIIIL-------DLFFEDKIKKINKSIPSPPTIPIFGNLLQINSKDVATFL 53

 

Query:    61 NDFYKQYGKVYRLRLGSVETVVLTGGDIIDECFNKKYRDFLKARYVKFSRYLGKDTNILH 120

             NDFYKQYGKVYRLRLGSVETVVLTGGDIIDECFNKKYRDFLKARYVKFSRYLGKDTNILH

Sbjct:    54 NDFYKQYGKVYRLRLGSVETVVLTGGDIIDECFNKKYRDFLKARYVKFSRYLGKDTNILH 113

 

Query:   121 SNGDYHFLLKGVLSSQVTVRKLNNGRLEFNKYILQMFNNLNNNDEGSTMFLANDVPSQIK 180

             SNGDYHFLLKGVLSSQVTVRKLNNGRLEFNKYILQMFNNLNNNDEGSTMFLANDVPSQIK

Sbjct:   114 SNGDYHFLLKGVLSSQVTVRKLNNGRLEFNKYILQMFNNLNNNDEGSTMFLANDVPSQIK 173

 

Query:   181 KLILKVVLNFTLGIEENDDINLSLFQNGSNIFKAAGLFIYSDYLPFLFPLDIKSMAKSNM 240

             KLILKVVLNFTLGIEENDDINLSLFQNGSNIFKAAGLFIYSDYLPFLFPLDIKSMAKSNM

Sbjct:   174 KLILKVVLNFTLGIEENDDINLSLFQNGSNIFKAAGLFIYSDYLPFLFPLDIKSMAKSNM 233

 

Query:   241 ISSYVFVRDYLAKKLEEVKKKEYIINGDDDGGVDTSQTPLIESYYKLYLQGLIGYDSILL 300

             ISSYVFVRDYLAKKLEEVKKKEYIINGDDDGGVDTSQTPLIESYYKLYLQGLIGYDSILL

Sbjct:   234 ISSYVFVRDYLAKKLEEVKKKEYIINGDDDGGVDTSQTPLIESYYKLYLQGLIGYDSILL 293

 

Query:   301 SIVDIIIASVDTTSNSISFIIARLTNHQEIQSKIYEEIMSNDINNNSNNISFSDHSKYPY 360

             SIVDIIIASVDTTSNSISFIIARLTNHQEIQSKIYEEIMSNDINNNSNNISFSDHSKYPY

Sbjct:   294 SIVDIIIASVDTTSNSISFIIARLTNHQEIQSKIYEEIMSNDINNNSNNISFSDHSKYPY 353

 

Query:   361 IISIMNETYRYYASVPLPEPNMTTEDIEVDGYKIAKGTQIYKNIRGTLISKEFWGEDALE 420

             IISIMNETYRYYASVPLPEPNMTTEDIEVDGYKIAKGTQIYKNIRGTLISKEFWGEDALE

Sbjct:   354 IISIMNETYRYYASVPLPEPNMTTEDIEVDGYKIAKGTQIYKNIRGTLISKEFWGEDALE 413

 

Query:   421 FKPERFKTQTLNQKGLLHFGAGPRGCPGARFTECFFFTLMVLLFKNYKLQNPNDNPIDDR 480

             FKPERFKTQTLNQKGLLHFGAGPRGCPGARFTECFFFTLMVLLFKNYKLQNPNDNPIDDR

Sbjct:   414 FKPERFKTQTLNQKGLLHFGAGPRGCPGARFTECFFFTLMVLLFKNYKLQNPNDNPIDDR 473

 

Query:   481 GDVGLSMQCKPYDALFIKRN 500

             GDVGLSMQCKPYDALFIKRN

Sbjct:   474 GDVGLSMQCKPYDALFIKRN 493

 

>CYP513B1

>aa-ng627        Contig_0257     pred cdna   pORF 5998         4427 strand r start y stop y ph 0

MNLLVLSVILAIIIYLIFKRNYKYSPSKINSKIPGPIGLPIFGNILSLDNKNGIHTTFQK

WFKIYGPIYSVNMGNKSAVVLTGFPIIKKAFIDNSEAFAPHYTFESRYKLNKCSDITQEN

GKNQSALKRIFLSELTVTRIKKQESHIQNEIVKLMKVLDKHSEDGKPFLLNNYFSMFSIN

IISRFLFGIDFPYQDFEETSDLMVGIRDLLIASGEIVLSDFLPIPHSKRSKLYTSYQALV

VQIETLVKSHKYKEDDECMLSKLMIEHDKGNIPWDAVISNCNTIITAGSDSTSSTALFFL

IEMMNNPTIQTKVYNDIVVSFEQNQQADDYMNESMVILKYSKYRSLIPYLSLALKENYRK

HPAAPFGAPHETTQETVIEGYTIAKGTMIFQNIYATQRSDTFYSQPDEFIPERWNGDENS

QTLISFGTGIRDCIGKSLAYNEIFTIIASVLNRYEFINPNPSIPFDDNGIPGLTTQCKNT

VVQIKKR

 

Note only one seq has L C.  The others all have S W so correct my seq

 

>CYP513B1 seq 49 complete seq 45% to seq 7, 50

          Length = 488

 

 Score = 2504 (881.5 bits), Expect = 4.8e-264, P = 4.8e-264

 Identities = 485/487 (99%), Positives = 485/487 (99%)

 

Query:     1 MNLLVLSVILAIIIYLIFKRNYKYSPSKINSKIPGPIGLPIFGNILSLDNKNGIHTTFQK 60

             MNLLVLSVILAIIIYLIFKRNYKYSPSKINSKIPGPIGLPIFGNILSLDNKNGIHTTFQK

Sbjct:     1 MNLLVLSVILAIIIYLIFKRNYKYSPSKINSKIPGPIGLPIFGNILSLDNKNGIHTTFQK 60

 

Query:    61 WFKIYGPIYSVNMGNKSAVVLTGFPIIKKAFIDNSEAFAPHYTFESRYKLNKCSDITQEN 120

             WFKIYGPIYSVNMGNKSAVVLTGFPIIKKAFIDNSEAFAPHYTFESRYKLNKCSDITQEN

Sbjct:    61 WFKIYGPIYSVNMGNKSAVVLTGFPIIKKAFIDNSEAFAPHYTFESRYKLNKCSDITQEN 120

 

Query:   121 GKNQSALKRIFLSELTVTRIKKQESHIQNEIVKLMKVLDKHSEDGKPFLLNNYFSMFSIN 180

             GKNQSALKRIFLSELTVTRIKKQESHIQNEIVKLMKVLDKHSEDGKPFLLNNYFSMFSIN

Sbjct:   121 GKNQSALKRIFLSELTVTRIKKQESHIQNEIVKLMKVLDKHSEDGKPFLLNNYFSMFSIN 180

 

Query:   181 IISRFLFGIDFPYQDFEETSDLMVGIRDLLIASGEIVLSDFLPIPHSKRSKLYTSYQALV 240

             IISRFLFGIDFPYQDFEETSDLMVGIRDLLIASGEIVLSDFLPIPHSKRSKLYTSYQALV

Sbjct:   181 IISRFLFGIDFPYQDFEETSDLMVGIRDLLIASGEIVLSDFLPIPHSKRSKLYTSYQALV 240

 

Query:   241 VQIETLVKSHKYKEDDECMLSKLMIEHDKGNIPWDAVISNCNTIITAGSDSTSSTALFFL 300

             VQIETLVKSHKYKEDDECML KLMIEHDKGNIP DAVISNCNTIITAGSDSTSSTALFFL

Sbjct:   241 VQIETLVKSHKYKEDDECMLLKLMIEHDKGNIPCDAVISNCNTIITAGSDSTSSTALFFL 300

 

Query:   301 IEMMNNPTIQTKVYNDIVVSFEQNQQADDYMNESMVILKYSKYRSLIPYLSLALKENYRK 360

             IEMMNNPTIQTKVYNDIVVSFEQNQQADDYMNESMVILKYSKYRSLIPYLSLALKENYRK

Sbjct:   301 IEMMNNPTIQTKVYNDIVVSFEQNQQADDYMNESMVILKYSKYRSLIPYLSLALKENYRK 360

 

Query:   361 HPAAPFGAPHETTQETVIEGYTIAKGTMIFQNIYATQRSDTFYSQPDEFIPERWNGDENS 420

             HPAAPFGAPHETTQETVIEGYTIAKGTMIFQNIYATQRSDTFYSQPDEFIPERWNGDENS

Sbjct:   361 HPAAPFGAPHETTQETVIEGYTIAKGTMIFQNIYATQRSDTFYSQPDEFIPERWNGDENS 420

 

Query:   421 QTLISFGTGIRDCIGKSLAYNEIFTIIASVLNRYEFINPNPSIPFDDNGIPGLTTQCKNT 480

             QTLISFGTGIRDCIGKSLAYNEIFTIIASVLNRYEFINPNPSIPFDDNGIPGLTTQCKNT

Sbjct:   421 QTLISFGTGIRDCIGKSLAYNEIFTIIASVLNRYEFINPNPSIPFDDNGIPGLTTQCKNT 480

 

Query:   481 VVQIKKR 487

             VVQIKKR

Sbjct:   481 VVQIKKR 487

 

>517A3P

MEIVNV (frameshift)

FIILIILFLVKDF (0)

VKKNKKIHTKSPSGPIAFPILGNVVQIRFWELFKIQEHEL (10 aa deletion)

IVRAWIGE RLFLFVSNYDVKYFQKDENFLYKPSLLVPGWRYASSNGLGVMSSSDDEWKRAKSS

VS**LRVHTSKKLMEEKAIEFIDSLEKISNNNEI (0)

FYPKGHIQGYACSMLFKYMFNQDLSVESGLSRTIGNAVEHVFGNLSKLTAFDCFE

IFSPLYDWFFTRRL*DCDIV*QIINILTENH

 

This is a pseudogene, so the sequences in green need to be added.

Ng682 has an extension that I did not have that needs to be added to my seq.

The seq similarity stops after LCSNG so I removed that last two aa.

 

>aa-ng682        Contig_0274     pred cdna   pORF 17004       15801 strand r start y stop y ph 0

MEIVNVKKNKKIHTKSPSGPIAFPILGNVVQIRFWELFKIQEHELIVRAWIGERLFLFVS

NYDVKYFQKDENFLYKPSLLVPGWRYASSNGLGVMSSSDDEWKRAKSS

FYPKGHIQGYACSMLFKYMFNQDLSVESGLSRTIGNAVEHVFGNLSKLTAFDCFEIFSPLYDWFFTRRL

KGCDIVRQIISSQNENHFKSIDPSKPRDLMDDLLIEYGLNEITKEDTMKINQICFDIFGPAIG

TVTITMNWVILQLCNRPEPQEIAYLEIKKAVKDDEYVNFNHKQNAPYIVAFIKETMRLCS

NGPN

 

>CYP517A3P Seq 36 pseudogene 77% to seq 3 77% to seq 61

           Length = 242

 

 Score = 546 (192.2 bits), Expect = 8.8e-99, Sum P(2) = 8.8e-99

 Identities = 103/103 (100%), Positives = 103/103 (100%)

 

Query:     6 VKKNKKIHTKSPSGPIAFPILGNVVQIRFWELFKIQEHELIVRAWIGERLFLFVSNYDVK 65

             VKKNKKIHTKSPSGPIAFPILGNVVQIRFWELFKIQEHELIVRAWIGERLFLFVSNYDVK

Sbjct:    20 VKKNKKIHTKSPSGPIAFPILGNVVQIRFWELFKIQEHELIVRAWIGERLFLFVSNYDVK 79

 

Query:    66 YFQKDENFLYKPSLLVPGWRYASSNGLGVMSSSDDEWKRAKSS 108

             YFQKDENFLYKPSLLVPGWRYASSNGLGVMSSSDDEWKRAKSS

Sbjct:    80 YFQKDENFLYKPSLLVPGWRYASSNGLGVMSSSDDEWKRAKSS 122

 

 Score = 418 (147.1 bits), Expect = 8.8e-99, Sum P(2) = 8.8e-99

 Identities = 79/86 (91%), Positives = 80/86 (93%)

 

Query:   109 FYPKGHIQGYACSMLFKYMFNQDLSVESGLSRTIGNAVEHVFGNLSKLTAFDCFEIFSPL 168

             FYPKGHIQGYACSMLFKYMFNQDLSVESGLSRTIGNAVEHVFGNLSKLTAFDCFEIFSPL

Sbjct:   157 FYPKGHIQGYACSMLFKYMFNQDLSVESGLSRTIGNAVEHVFGNLSKLTAFDCFEIFSPL 216

 

Query:   169 YDWFFTRRLKGCDIVRQIISSQNENH 194

             YDWFFTRRL  CDIV QII+   ENH

Sbjct:   217 YDWFFTRRL*DCDIV*QIINILTENH 242

 

>CYP524A1

MKTPTKYFIIFILLAALAVF (0)

KGSLPGPSFVPPFFGMLFQLIFTPFSFYEKQEKYGPISWTSIMNKF

VLFVTDAEINRQVFKEENAKLYLSLGAKKILTEK

AIPFIEGAP HRQLRKQLLPLFTIRALSSYLPIQESIVDEHIAMWIKNGKADINARNNCRDLNMAISTGVFV

GNNTPESVRDDIAKNFFVMNEGFLCLPIDLPGTTLRKAINARVRLVEIFTDIIAKSRKRMGDGE

KPQSLIDLWVEHFLNCPKEERDELSNDTIIFTLLSFMFASQDALTS

SLVWTVQLMAEHPDILAKVRAEQASLRPNNEKLDLDTMRQATYTRMVVSEILRFRPPAVMVPHE

NIEDIVIGDNVHVPKGTMILPSIWSAHFQEGGYSDPYKFDPQRFDSVRKEDVTCAKNSLVFGAG

PHFCIGKELAKNQIEVFLTKLAMSTEWTHNKTPGGDEIIFGPTIFPKDGCNITIKARN*

 

Note: two exon extensions occur in ng752. VSEAT to KLIVA extends exon 1 downstream

VGKFT to ITQK extends exon 2 upstream.  This makes the N-term too long.

The N-terminal up to the PFFGML motif is usually 30-50 aa long, but ng752 is 96aa long.  I include the sequence VLFV to LTEK, which ng752 removes as an intron.

I think my solution is better, since it preserves the length between the PFFGML motif and the C-helix motif (HRQLRK) as compared to the Arabidopsis CYP710A1 seq and

The CYP26B1 FROM FUGU (see aligns below, deleted intron shown in red)

 

>aa-ng752        Contig_0287     pred cdna   pORF 3086         1351 strand r start y stop y ph 0

MKTPTKYFIIFILLAALAVF

VSEATSKVGQQTTTTTQQVKCSGLQCTLNKLIVA VGKFTIKQILVGIVIVLATIALHQQYVITQK

KGSLPGPSFVP PFFGML FQLIFTPFSFYEKQEKYGPISWTSIMNKF

AIPFIEGAP HRQLRKQLLPLFTIRALSSYLPIQESIVDEHIAMWIKNGK

ADINARNNCRDLNMAISTGVFVGNNTPESVRDDIAKNFFVMNEGFLCLPIDLPGTTLRKA

INARVRLVEIFTDIIAKSRKRMGDGEKPQSLIDLWVEHFLNCPEEERDELSNDTIIFTLL

SFMFASQDALTSSLVWTVQLMAEHPDILAKVRAEQASLRPNNEKLDLDTMRQATYTRMVV

SEILRFRPPAVMVPHENIEDIVIGDNVHVPKGTMILPSIWSAHFQEGGYSDPYKFDPQRF

DSVRKEDVTCAKNSLVFGAGPHFCIGKELAKNQIEVFLTKLAMSTEWTHNKTPGGDEIIF

GPTIFPKDGCNITIKARN

 

>CYP710A1 (subject) CYP524A1 = query

         Length = 496

 

 Score = 780 (274.6 bits), Expect = 1.3e-80, P = 1.3e-80

 Identities = 167/469 (35%), Positives = 267/469 (56%)

 

Query:     8 FIIFILLAALA-VFKG-SLPGPSFVPPFFGMLFQLIFTPFSFYEKQEKYGPISWTS---I 62

             F++F+L+  L+ +FK  ++PGP FVPP  G    L+  P SF++KQ     IS  S   +

Sbjct:    19 FLLFLLVEQLSYLFKKRNIPGPFFVPPIIGNAVALVRDPTSFWDKQSSTANISGLSANYL 78

 

Query:    63 MNKFVLFVTDAEINRQVFKEENAKLYLSLG---AKKILTEKAIPFIEGAPHRQLRKQLLP 119

             + KF++++ D E++ Q+F       +  +G    KK+  +  + ++ G  H+ +R+QL P

Sbjct:    79 IGKFIVYIRDTELSHQIFSNVRPDAFHLIGHPFGKKLFGDHNLIYMFGEDHKSVRRQLAP 138

 

Query:   120 LFTIRALSSYLPIQESIVDEHIAMW---IKNGKADINARNNCRDLNMAISTGVFVGNNTP 176

              FT +ALS+Y  +Q+ ++  H+  W      G   ++ R   R+LN+  S  VFVG   

Sbjct:   139 NFTPKALSTYSALQQLVILRHLRQWEGSTSGGSRPVSLRQLVRELNLETSQTVFVGPYLD 198

 

Query:   177 ESVRDDIAKNFFVMNEGFLCLPIDLPGTTLRKAINARVRLVEIFTDIIAKSRKRMGDGEK 236

             +  ++    ++ + N G + LPIDLPG    +A  A  RL E       KS+ RM  GE+

Sbjct:   199 KEAKNRFRTDYNLFNLGSMALPIDLPGFAFGEARRAVKRLGETLGICAGKSKARMAAGEE 258

 

Query:   237 PQSLIDLWVEHFLNCPKEERDELSNDTIIFTLL-SFMFASQDALTSSLVWTVQLMAEHPD 295

             P  LID W++  +   +  +   S D  I  LL  F+FA+QDA TSSL+W V L+   P+

Sbjct:   259 PACLIDFWMQAIV--AENPQPPHSGDEEIGGLLFDFLFAAQDASTSSLLWAVTLLDSEPE 316

 

Query:   296 ILAKVRAEQASL-RP-NNEKLDLDTMRQATYTRMVVSEILRFRPPAVMVPHENIEDIVIG 353

             +L +VR E A +  P +N  + +D + +  YTR V  E++R+RPPA MVPH    D  +

Sbjct:   317 VLNRVREEVAKIWSPESNALITVDQLAEMKYTRSVAREVIRYRPPATMVPHVAAIDFPLT 376

 

Query:   354 DNVHVPKGTMILPSIWSAHFQEGGYSDPYKFDPQRFDSVRKEDVTCAKNSLVFGAGPHFC 413

             +   +PKGT++ PS++ + FQ  G+++P +FDP RF   R+ED    +N L FG GPH C

Sbjct:   377 ETYTIPKGTIVFPSVFDSSFQ--GFTEPDRFDPDRFSETRQEDQVFKRNFLAFGWGPHQC 434

 

Query:   414 IGKELAKNQIEVFLTKLAMSTEWTHNKTPGGDEIIFGPTIFPKDGCNITIKAR 466

             +G+  A N + +F+   +   ++   ++ G DEI++ PTI PKDGC + +  R

Sbjct:   435 VGQRYALNHLVLFIAMFSSLLDFKRLRSDGCDEIVYCPTISPKDGCTVFLSRR 487

 

Fugu CYP26B1

>CYP26B1 Scaffold_4267 Scaffold_49 partial sequence 74% to 26B1

         Length = 513

 

 Score = 355 (125.0 bits), Expect = 3.4e-36, P = 3.4e-36

 Identities = 122/440 (27%), Positives = 195/440 (44%)

 

Query:    24 LPGPSFVPPFFGMLFQLIFTPFSFY-EKQEKYGPISWTSIMNKFVLFVTDAEINRQVFKE 82

             +P  S   PF G     +     F+  +++KYG +  T ++ + ++ VT AE  R+V  

Sbjct:    49 MPKGSMGFPFIGETCHWLLQGSGFHASRRQKYGNVFKTHLLGRPLIRVTGAENIRKVLMG 108

 

Query:    83 ENAKLYLSL--GAKKILTEKAIPFIEGAPHRQLRKQLLPLFTIRALSSYLPIQESIVDEH 140

             E+  + +        +L   ++    G  HR+ RK    +F+  AL SYLP  + ++ E

Sbjct:   109 EHTLVTVDWPQSTSTLLGPNSLANSIGDIHRKKRKVFAKVFSHEALESYLPKIQQVIQES 168

 

 

>CYP524A1 Seq 91 complete seq 25% to 10, 42 468 aa

          Length = 468

 

 Score = 1926 (678.0 bits), Expect = 1.1e-236, Sum P(3) = 1.1e-236

 Identities = 371/388 (95%), Positives = 376/388 (96%)

 

Query:     1 MKTPTKYFIIFILLAALAVF 20

             MKTPTKYFIIFILLAALAVF

Sbjct:     1 MKTPTKYFIIFILLAALAVF 20

 

Query:    86 KGSLPGPSFVPPFFGMLFQLIFTPFSFYEKQEKYGPISWTSIMNKFAIPFIEGAPHRQLR 145

             KGSLPGPSFVPPFFGMLFQLIFTPFSFYEKQEKYGPISWTSIMNKF +   +   +RQ+

Sbjct:    21 KGSLPGPSFVPPFFGMLFQLIFTPFSFYEKQEKYGPISWTSIMNKFVLFVTDAEINRQVF 80

 

Query:   146 KQ 147

             K+

Sbjct:    81 KE 82

 

Query:   112 FYEKQEK-YGPISWTSIMNKFAIPFIEGAPHRQLRKQLLPLFTIRALSSYLPIQESIVDE 170

             F E+  K Y  +    I+ + AIPFIEGAPHRQLRKQLLPLFTIRALSSYLPIQESIVDE

Sbjct:    80 FKEENAKLYLSLGAKKILTEKAIPFIEGAPHRQLRKQLLPLFTIRALSSYLPIQESIVDE 139

 

Query:   171 HIAMWIKNGKADINARNNCRDLNMAISTGVFVGNNTPESVRDDIAKNFFVMNEGFLCLPI 230

             HIAMWIKNGKADINARNNCRDLNMAISTGVFVGNNTPESVRDDIAKNFFVMNEGFLCLPI

Sbjct:   140 HIAMWIKNGKADINARNNCRDLNMAISTGVFVGNNTPESVRDDIAKNFFVMNEGFLCLPI 199

 

Query:   231 DLPGTTLRKAINARVRLVEIFTDIIAKSRKRMGDGEKPQSLIDLWVEHFLNCPEEERDEL 290

             DLPGTTLRKAINARVRLVEIFTDIIAKSRKRMGDGEKPQSLIDLWVEHFLNCP+EERDEL

Sbjct:   200 DLPGTTLRKAINARVRLVEIFTDIIAKSRKRMGDGEKPQSLIDLWVEHFLNCPKEERDEL 259

 

Query:   291 SNDTIIFTLLSFMFASQDALTSSLVWTVQLMAEHPDILAKVRAEQASLRPNNEKLDLDTM 350

             SNDTIIFTLLSFMFASQDALTSSLVWTVQLMAEHPDILAKVRAEQASLRPNNEKLDLDTM

Sbjct:   260 SNDTIIFTLLSFMFASQDALTSSLVWTVQLMAEHPDILAKVRAEQASLRPNNEKLDLDTM 319

 

Query:   351 RQATYTRMVVSEILRFRPPAVMVPHENIEDIVIGDNVHVPKGTMILPSIWSAHFQEGGYS 410

             RQATYTRMVVSEILRFRPPAVMVPHENIEDIVIGDNVHVPKGTMILPSIWSAHFQEGGYS

Sbjct:   320 RQATYTRMVVSEILRFRPPAVMVPHENIEDIVIGDNVHVPKGTMILPSIWSAHFQEGGYS 379

 

Query:   411 DPYKFDPQRFDSVRKEDVTCAKNSLVFGAGPHFCIGKELAKNQIEVFLTKLAMSTEWTHN 470

             DPYKFDPQRFDSVRKEDVTCAKNSLVFGAGPHFCIGKELAKNQIEVFLTKLAMSTEWTHN

Sbjct:   380 DPYKFDPQRFDSVRKEDVTCAKNSLVFGAGPHFCIGKELAKNQIEVFLTKLAMSTEWTHN 439

 

Query:   471 KTPGGDEIIFGPTIFPKDGCNITIKARN 498

             KTPGGDEIIFGPTIFPKDGCNITIKARN

Sbjct:   440 KTPGGDEIIFGPTIFPKDGCNITIKARN 467

 

>aa-ng1193        Contig_0430     pred cdna   pORF 4315         2561 strand r start y stop y ph 0

MDLLLFIFFLILFYYSVKYYKADNQNSLSLSGPTPVPILGNIHQVGKDAHLTIPIISKKY

HGIFRMWLGGTYYVVVSDYKLIREMYIENFENFKNRIATFKTMTGDDSRGIIGCNGDIWD

SNKELIMKSYKKVLNKDMNDFILLKSKELFNFFEKNGIKNEEEEDDDDDGNKSIIINNTR

FYFQSLTLTVMFKMIFNENKSFQLYSDSTEFKLIFKTILNLLNSLNVYNVIYDFLGIFQP

ILLKFTKILDKNSFLSKIATEKFNSRIKEIDFTSDDFKANDLLDSLIMTINEDENGLNEK

QIENIKSICIDFLMAGTDTTGSTIEWIILKLVNNPEFQELIFQELKKLNKSEITANDKIN

TPFLNSFIKETNRLYPIAPLSLPRKSINEMIIGDNKYYIPANTNILMDVKGFSLDENNYK

DPNEFKPDRFLNSKVSDTLNFGIGPRNCIGQTIAMNQIHIFLSNLILNYRMFSIDCLPLP

ENLILSVSVRPTEYSLKLIKRV

 

>CYP554A1 SEQ 40 30% to 508 469 aa complete

MDLLLFIFFLILFYYS (0)

YYKADNQNSLSLSGPTPVPILGNIHQVGKDAHLTIPIISKKYHGLFRMWFGGTYYVVV

SDYKLIREMYIENFENFKNRIA

TFKTMTGDDSRGIIGCNGDIWDSNKELIMKSYKKVLNKDMNDFILLKSKELFNFFEKNGIKNE (INTRON)

IINNTRFYFQSLTLT

VMFKMIFNENKSFQLYSDST

EFKLIFKTILNLLNSLNVYNVI

YDFLGIFQPILLKFTKILDKNSFLSKIA

TEKFNSRIKEIDFTSDDFKANDLLDSLIMTINEDEN

GLNEKQIENIKSICIDFLMAGTDTTGSTIEWIILKLVNNPEFQELIFQELK

KLNKSEITANDKINTPFLNSFIKETNRLY (0)

KSNNNFFFRKSINEMIIGDNKYYIPANTNILMDVKGFSLDENNYKDPNEFKPDRFLNSKVSDTLNFGI

GPRNCIGQTIAMNQIHIFLSNLILNYRMFSIDCLPLPENLF*

 

1. VK there are two possible GTs for this exon boundary

2. IIACP2E4184 is only seq with LFRMWF so change this to IFRMWL

3. EEEDDDDDGNKS no good GT AG pairs in frame so leave this in

4. PIAPLSLP matches other related P450s in this region so this is correct

5. ILSVSVRPTEYSLKLIKRV adds correct C-terminal

use the ng1193 seq as the correct seq.

 

>CYP554A1 SEQ 40 30% to 508 469 aa complete

          Length = 470

 

 Score = 2323 (817.7 bits), Expect = 7.3e-245, P = 7.3e-245

 Identities = 458/483 (94%), Positives = 459/483 (95%)

 

Query:     1 MDLLLFIFFLILFYYSVKYYKADNQNSLSLSGPTPVPILGNIHQVGKDAHLTIPIISKKY 60

             MDLLLFIFFLILFYYS  YYKADNQNSLSLSGPTPVPILGNIHQVGKDAHLTIPIISKKY

Sbjct:     1 MDLLLFIFFLILFYYS--YYKADNQNSLSLSGPTPVPILGNIHQVGKDAHLTIPIISKKY 58

 

Query:    61 HGIFRMWLGGTYYVVVSDYKLIREMYIENFENFKNRIATFKTMTGDDSRGIIGCNGDIWD 120

             HG+FRMW GGTYYVVVSDYKLIREMYIENFENFKNRIATFKTMTGDDSRGIIGCNGDIWD

Sbjct:    59 HGLFRMWFGGTYYVVVSDYKLIREMYIENFENFKNRIATFKTMTGDDSRGIIGCNGDIWD 118

 

Query:   121 SNKELIMKSYKKVLNKDMNDFILLKSKELFNFFEKNGIKNEEEEDDDDDGNKSIIINNTR 180

             SNKELIMKSYKKVLNKDMNDFILLKSKELFNFFEKNGIKNE             IINNTR

Sbjct:   119 SNKELIMKSYKKVLNKDMNDFILLKSKELFNFFEKNGIKNE-------------IINNTR 165

 

Query:   181 FYFQSLTLTVMFKMIFNENKSFQLYSDSTEFKLIFKTILNLLNSLNVYNVIYDFLGIFQP 240

             FYFQSLTLTVMFKMIFNENKSFQLYSDSTEFKLIFKTILNLLNSLNVYNVIYDFLGIFQP

Sbjct:   166 FYFQSLTLTVMFKMIFNENKSFQLYSDSTEFKLIFKTILNLLNSLNVYNVIYDFLGIFQP 225

 

Query:   241 ILLKFTKILDKNSFLSKIATEKFNSRIKEIDFTSDDFKANDLLDSLIMTINEDENGLNEK 300

             ILLKFTKILDKNSFLSKIATEKFNSRIKEIDFTSDDFKANDLLDSLIMTINEDENGLNEK

Sbjct:   226 ILLKFTKILDKNSFLSKIATEKFNSRIKEIDFTSDDFKANDLLDSLIMTINEDENGLNEK 285

 

Query:   301 QIENIKSICIDFLMAGTDTTGSTIEWIILKLVNNPEFQELIFQELKKLNKSEITANDKIN 360

             QIENIKSICIDFLMAGTDTTGSTIEWIILKLVNNPEFQELIFQELKKLNKSEITANDKIN

Sbjct:   286 QIENIKSICIDFLMAGTDTTGSTIEWIILKLVNNPEFQELIFQELKKLNKSEITANDKIN 345

 

Query:   361 TPFLNSFIKETNRLYPIAPLSLPRKSINEMIIGDNKYYIPANTNILMDVKGFSLDENNYK 420

             TPFLNSFIKETNRLY        RKSINEMIIGDNKYYIPANTNILMDVKGFSLDENNYK

Sbjct:   346 TPFLNSFIKETNRLYKSNNNFFFRKSINEMIIGDNKYYIPANTNILMDVKGFSLDENNYK 405

 

Query:   421 DPNEFKPDRFLNSKVSDTLNFGIGPRNCIGQTIAMNQIHIFLSNLILNYRMFSIDCLPLP 480

             DPNEFKPDRFLNSKVSDTLNFGIGPRNCIGQTIAMNQIHIFLSNLILNYRMFSIDCLPLP

Sbjct:   406 DPNEFKPDRFLNSKVSDTLNFGIGPRNCIGQTIAMNQIHIFLSNLILNYRMFSIDCLPLP 465

 

Query:   481 ENL 483

             ENL

Sbjct:   466 ENL 468

 

>aa-ng1210        Contig_0437     pred cdna   pORF 7346         5011 strand r start y stop y ph 0

MDVNGPWSLPIIGGIYLINDNPNRALTKLSKKYGGIYKIWLGESFSMVVSDPEIVNEIWV

KQHDNFINRPKNITHKMFSSNYRSLNFGDNPNWKFNRSMASSHFTKTKLLSSKVTSVVEK

KLNKLIETMEYHSINKLPFDSYVGFSEYSLNIILNMLVSMDIDECENSTQNVIYSINEIF

KMLSTNSPQYSFPYLKFFFKKDLNNFKFHLDKIKSFIHSIYLKQLESYDPSNPRNILDSF

ISDLQSNDIDILLQICIDIVVAGTDTVANLLQWFVLFCINYPEIQEKLYNEIIEVVGKDC

KVLKYEHISKMPYLYGCFRESLRIRPVTPLSLPRVAKCDTYIKDDIFIPKGATIIQNIFG

MGNDEKYISEPNKFKPERWVEYIKNKKVNKNGNENSVNKYFNDLDKISIPFGVGKRQCLS

PAMAEQESLLSIATVVLNYKLKSNGQKKLNEKEVYSITIKPQPFKLFLEKRFVRKRV

 

Note: ng1210 missing N-terminal exon and some exon 2. compare to 520A1

One amino acid diff. Y vs. *

 

Stop codon in JC1c13f09.s1 Y in Contig_0437 correct seq to Y

Use my sequence for N-terminal

 

>CYP520B1 Seq 58 47% to seq 47 317 aa

MNYLLIIICIIFFSLFFDF

KIRKNWNLNFKRLFKKDVNGPWSLPIIGGIYLINDNPNRALTKLSKKYGGIYKIWLGESFSM

VVSDPEIVNEIWVKQHDNFINRPKNITHK

MFSSNYRSLNFGDNPNWKFNRSMASSHFTKTKLLSSK

VTSVVEKKLNKLIETMEYHSINKLP

FDSYVGFSEYSLNIILNMLVSMDIDECENSTQNVIYSINEIFKMLSTNSPQYSFP*LKF

FFKKDLNNFKFHLDKIKSFIHSIYLKQLESYDPSNPRNILDSF

ISDLQSNDIDILLQICIDIVVAGTDTVANLLQWFVLFCINY

PEIQEKLYNEIIEVVGKDCKVLKYEHISKMPYLYGCFRESLRIRPVTPLS

LPRVAKCDTYIKDDIFIPKG

ATIIQNIFGMGNDEKYISEPNKFKPERWVEYIKNKKVNKNGNENSVNKYF

NDLDKISIPFGVGKRQCLSPAMAEQESLLSIATVVLNYKLKSNGQKKLNE

KEVYSITIKPQPFKLFLEKRV*

 

>CYP520A1 seq (47+68+80) complete 47% to seq 58 517 aa

          Length = 517

 

 Score = 1247 (439.0 bits), Expect = 7.6e-131, P = 7.6e-131

 Identities = 248/516 (48%), Positives = 358/516 (69%)

 

Query:520B1 1 MNYLLIIICIIFFSLFFDFKIRK-NWNLNFKRLFKKDVNGPWSLPIIGGIYLINDNPNRA 59

              M  L  II +I F + FDFK +K   N  + +   K+ NGPWSLPIIGG++LI D PNR+

Sbjct:520A1 1 MEILTFIIYLITFFILFDFKKKKFKKNKRYSKSPNKEANGPWSLPIIGGLHLIGDRPNRS 60

 

>CYP520B1 Seq 58 47% to seq 47 317 aa

          Length = 507

 

 Score = 2460 (866.0 bits), Expect = 2.2e-259, P = 2.2e-259

 Identities = 469/470 (99%), Positives = 469/470 (99%)

 

Query:     2 DVNGPWSLPIIGGIYLINDNPNRALTKLSKKYGGIYKIWLGESFSMVVSDPEIVNEIWVK 61

             DVNGPWSLPIIGGIYLINDNPNRALTKLSKKYGGIYKIWLGESFSMVVSDPEIVNEIWVK

Sbjct:    36 DVNGPWSLPIIGGIYLINDNPNRALTKLSKKYGGIYKIWLGESFSMVVSDPEIVNEIWVK 95

 

Query:    62 QHDNFINRPKNITHKMFSSNYRSLNFGDNPNWKFNRSMASSHFTKTKLLSSKVTSVVEKK 121

             QHDNFINRPKNITHKMFSSNYRSLNFGDNPNWKFNRSMASSHFTKTKLLSSKVTSVVEKK

Sbjct:    96 QHDNFINRPKNITHKMFSSNYRSLNFGDNPNWKFNRSMASSHFTKTKLLSSKVTSVVEKK 155

 

Query:   122 LNKLIETMEYHSINKLPFDSYVGFSEYSLNIILNMLVSMDIDECENSTQNVIYSINEIFK 181

             LNKLIETMEYHSINKLPFDSYVGFSEYSLNIILNMLVSMDIDECENSTQNVIYSINEIFK

Sbjct:   156 LNKLIETMEYHSINKLPFDSYVGFSEYSLNIILNMLVSMDIDECENSTQNVIYSINEIFK 215

 

Query:   182 MLSTNSPQYSFPYLKFFFKKDLNNFKFHLDKIKSFIHSIYLKQLESYDPSNPRNILDSFI 241

             MLSTNSPQYSFP LKFFFKKDLNNFKFHLDKIKSFIHSIYLKQLESYDPSNPRNILDSFI

Sbjct:   216 MLSTNSPQYSFP*LKFFFKKDLNNFKFHLDKIKSFIHSIYLKQLESYDPSNPRNILDSFI 275

 

Query:   242 SDLQSNDIDILLQICIDIVVAGTDTVANLLQWFVLFCINYPEIQEKLYNEIIEVVGKDCK 301

             SDLQSNDIDILLQICIDIVVAGTDTVANLLQWFVLFCINYPEIQEKLYNEIIEVVGKDCK

Sbjct:   276 SDLQSNDIDILLQICIDIVVAGTDTVANLLQWFVLFCINYPEIQEKLYNEIIEVVGKDCK 335

 

Query:   302 VLKYEHISKMPYLYGCFRESLRIRPVTPLSLPRVAKCDTYIKDDIFIPKGATIIQNIFGM 361

             VLKYEHISKMPYLYGCFRESLRIRPVTPLSLPRVAKCDTYIKDDIFIPKGATIIQNIFGM

Sbjct:   336 VLKYEHISKMPYLYGCFRESLRIRPVTPLSLPRVAKCDTYIKDDIFIPKGATIIQNIFGM 395

 

Query:   362 GNDEKYISEPNKFKPERWVEYIKNKKVNKNGNENSVNKYFNDLDKISIPFGVGKRQCLSP 421

             GNDEKYISEPNKFKPERWVEYIKNKKVNKNGNENSVNKYFNDLDKISIPFGVGKRQCLSP

Sbjct:   396 GNDEKYISEPNKFKPERWVEYIKNKKVNKNGNENSVNKYFNDLDKISIPFGVGKRQCLSP 455

 

Query:   422 AMAEQESLLSIATVVLNYKLKSNGQKKLNEKEVYSITIKPQPFKLFLEKR 471

             AMAEQESLLSIATVVLNYKLKSNGQKKLNEKEVYSITIKPQPFKLFLEKR

Sbjct:   456 AMAEQESLLSIATVVLNYKLKSNGQKKLNEKEVYSITIKPQPFKLFLEKR 505

 

>aa-ng1282        Contig_0470     pred cdna   pORF 2724         4834 strand f start y stop y ph 0

MNTIFTIILTITILVLSLILKDLLFEGRIKKINKLIPGPSTIPVFGNLLQINAKDFPKSV

NDFYERYGKVFRLRLGSVEIVVLTGPEVIDECFNKKHREIFKERYIKFSRFFGKDYNIIS

SNGDYHYVLRGILTSEITTRKLNNGRLESNKFILEMFSNLCKDNKETLVKNTPNQIRILA

VKLILNFTLGIEENDETILIIVEKIKCIFEAAGLLIYSDYLPFLFPLDIKSMSKNDIISS

YFFLKDFIGIKLDAIKIKYEKENELKNETTDETSSKLNNIPIIENYYKNYLDGSIHYDSI

LFSISDIIFAAVDSTSNGFSLLIGQLINKPEIQDKIYEEIMRNDENNNTNNISFADHTKY

PYIISVMNESYRYNSSVPITEPNKTTEDVEVNGYKIAKGTMIIKNLRGTHLSKEFWGEDA

LEFKPERFKNQPLYQKGLFHFGAGPRGCPGGRFTESLTFTFLVIMLKNFKIVNPTDIPID

VEGEVGLAMQCKPFDALFIKHTNRNDFSKINKGLNQQCFTSSSQFPTCYQTEFFEFGFNG

NFQLNDGIDCSSGVQQLSFFEKGFVFRVDHFLFEF

 

Note exon 1 longer in ng1282 by 7 aa better match to 514A2

1 aa diff L vs. F

 

SSM757 has F

IIAFP1D28194 has F

JAX4a185c11.s1 has F  Note: there appear to be two genes use L in this gene

 

>CYP514A1 Seq. 11 complete seq 35% to seq 7

          Length = 496

 

 Score = 2525 (888.8 bits), Expect = 2.9e-266, P = 2.9e-266

 Identities = 492/500 (98%), Positives = 492/500 (98%)

 

Query:     1 MNTIFTIILTITILVLSLILKDLLFEGRIKKINKLIPGPSTIPVFGNLLQINAKDFPKSV 60

             MNTIFTIILTITIL       DLLFEGRIKKINKLIPGPSTIPVFGNLLQINAKDFPKSV

Sbjct:     1 MNTIFTIILTITIL-------DLLFEGRIKKINKLIPGPSTIPVFGNLLQINAKDFPKSV 53

 

Query:    61 NDFYERYGKVFRLRLGSVEIVVLTGPEVIDECFNKKHREIFKERYIKFSRFFGKDYNIIS 120

             NDFYERYGKVFRLRLGSVEIVVLTGPEVIDECFNKKHREIFKERYIKFSRFFGKDYNIIS

Sbjct:    54 NDFYERYGKVFRLRLGSVEIVVLTGPEVIDECFNKKHREIFKERYIKFSRFFGKDYNIIS 113

 

Query:   121 SNGDYHYVLRGILTSEITTRKLNNGRLESNKFILEMFSNLCKDNKETLVKNTPNQIRILA 180

             SNGDYHYVLRGILTSEITTRKLNNGRLESNKFILEMFSNLCKDNKETLVKNTPNQIRILA

Sbjct:   114 SNGDYHYVLRGILTSEITTRKLNNGRLESNKFILEMFSNLCKDNKETLVKNTPNQIRILA 173

 

Query:   181 VKLILNFTLGIEENDETILIIVEKIKCIFEAAGLLIYSDYLPFLFPLDIKSMSKNDIISS 240

             VKLILNFTLGIEENDETILIIVEKIKCIFEAAGLLIYSDYLPFLFPLDIKSMSKNDIISS

Sbjct:   174 VKLILNFTLGIEENDETILIIVEKIKCIFEAAGLLIYSDYLPFLFPLDIKSMSKNDIISS 233

 

Query:   241 YFFLKDFIGIKLDAIKIKYEKENELKNETTDETSSKLNNIPIIENYYKNYLDGSIHYDSI 300

             YFFLKDFIGIKLDAIKIKYEKENELKNETTDETSSKLNNIPIIENYYKNYLDGSIHYDSI

Sbjct:   234 YFFLKDFIGIKLDAIKIKYEKENELKNETTDETSSKLNNIPIIENYYKNYLDGSIHYDSI 293

 

Query:   301 LFSISDIIFAAVDSTSNGFSLLIGQLINKPEIQDKIYEEIMRNDENNNTNNISFADHTKY 360

             LFSISDIIFAAVDSTSNGFSLLIGQLINKPEIQDKIYEEIMRNDENNNTNNISFADHTKY

Sbjct:   294 LFSISDIIFAAVDSTSNGFSLLIGQLINKPEIQDKIYEEIMRNDENNNTNNISFADHTKY 353

 

Query:   361 PYIISVMNESYRYNSSVPITEPNKTTEDVEVNGYKIAKGTMIIKNLRGTHLSKEFWGEDA 420

             PYIISVMNESYRYNSSVPITEPNKTTEDVEVNGYKIAKGTMIIKNLRGTHLSKEFWGEDA

Sbjct:   354 PYIISVMNESYRYNSSVPITEPNKTTEDVEVNGYKIAKGTMIIKNLRGTHLSKEFWGEDA 413

 

Query:   421 LEFKPERFKNQPLYQKGLFHFGAGPRGCPGGRFTESLTFTFLVIMLKNFKIVNPTDIPID 480

             LEFKPERFKNQPLYQKGLFHFGAGPRGCPGGRFTES TFTFLVIMLKNFKIVNPTDIPID

Sbjct:   414 LEFKPERFKNQPLYQKGLFHFGAGPRGCPGGRFTESFTFTFLVIMLKNFKIVNPTDIPID 473

 

Query:   481 VEGEVGLAMQCKPFDALFIK 500

             VEGEVGLAMQCKPFDALFIK

Sbjct:   474 VEGEVGLAMQCKPFDALFIK 493

 

>aa-ng1289        Contig_0472     pred cdna   pORF 124          1538 strand f start n stop y ph 1

PIGLPIIGNLHQLKNNPNKVFKKWYKKYGPIYRIRMGNIETIILTGYPIIKKSFIDNSNV

FQKRYEYRSKIQFNNCSDLTFTNGETHSNLKKIILSEMTLTKLKKQENYIILECEKLCKH

LDNHCESGLPFKFKFYGKLFSLNIIFRFLFGLNYQYDNEKIIEIINLILELVEGAGDPII

SDFIPLVKYFENINQNKFLKIYKKLIILIENIINNLKLKFIENNYDDDNDENNITIFLKL

FKEFKNGKITWDNLVGTCVDLTVGGTDTVSDSLNFSLIALINNPNCQEKLYNEIKNELLK

NNNNNNIDDLIILKHSLYRLSIPYLSMVMKETYRNFPVALLGLPIITTQDMEINGYKISK

GTQVFKNIFLSHKSKDFFKSPNDFNPERFSDSGNNFMFGGGTTNLVQFGAGSRDCIGKNV

VDCELFTVLGTLINRYQFLNPVLSKPLNENGIIGLANQSPDNYFIIKKRK

 

>513E2P Seq 37a 45% to seq 5

KNRINKINSKIPGPIGLPIIGNLHQLKNNPNK

VFKKWYKKYGPIYRIRMGNIETIILTGYPIIKKSFIDNSNVFQKRYEYRSKIQFNNCSDL

TFTNGETHSNLKKIILSEMTLTKLKKQENYIILECEKLCKHLDNHCESGLPFKFKFYGKL

FSLNIIFRFLFGLNYQYDNEKIIEIINLILELVEGAGDPIISDFIPLVKYFENINQNKFL

KIYKKLIILIENIINNLKLKFIENNYDDDNDENNITIFLKLFKEFKNGKITWDNLVGTCV

DLTVGGTDTVSDSLNFSLIALINNPNCQEKLYNEIKNELLKNNNNNNIDDLIILKHSLYR

LSIPYLSMVMKETYRNFPVALLGLPIITTQDMEINGYKISKGTQVFKNIFLSHKSKDFFK

SPNDFNPERFSDSGNNFMFGGGTTNLVQFGAGSRDCIGKNVVDCELFTVLGTLINRYQFL

NPVLSKPLNENGIIGLANQSPDNYFIIKKRK*

 

Note ng1289 is missing first part of exon 2

There is no exon 1 since this is a pseudogene

 

Ng1679 = NADPH cytochrome P450 reductase not a P450

 

>aa-ng1681        Contig_0635     pred cdna   pORF 3022          992 strand r start y stop y ph 0

MLTNIIILIILYLFYDFCYKNFKYRNYGSPWALPVIGHFIHVINQPHLVVHNDRMKYNNG

RFVNYWFGDYLSIAITDPILYKKIYLNFPKQINSRLKSPTVLNISERFRGIISSNENNWD

FHHGILSKLFNGHKAKINNFLFEKETKFIIEYMKKISKSGENFDTRSNFLYFYSNILFDY

ILGKRVENIYENELRKDRKKFMVSIQEVMDSVGLIKFLNYLILSYPFLSIYLRYFTYTTF

NLKKILKQYYDEHLETIDLNKPRDVLDNLIMEYKNQNAIECDKSFVAIAIELLAAGTDTN

SSTSEWFLTYLVNNPIYQDKIYDELIGALKINKPLKGSDVLINLSHRPLTPLFNATLKEV

LRLIPATPFSVPRMSNEGFEVDGIKIPKGTYLFPSMYSIFRDEKYWGENANKFYPERFLT

DSHSNNYFPYGVGKRMCLGSNFSQHELYICLTNIVLNFKIKSIDGKPLNEIPNYGITFRP

NIFEVKLENR

 

 

Two intron boundaries do not agree use ng1681 as correct seq

 

>CYP518B1 Seq (26+28) 50% to seq 20

          Length = 491

 

 Score = 2574 (906.1 bits), Expect = 1.8e-271, P = 1.8e-271

 Identities = 488/490 (99%), Positives = 489/490 (99%)

 

Query:     1 MLTNIIILIILYLFYDFCYKNFKYRNYGSPWALPVIGHFIHVINQPHLVVHNDRMKYNNG 60

             MLTNIIILIILYLFYDFCYKNFKYRNYGSPWALPVIGHFIHVINQPHLVVHNDRMKYNNG

Sbjct:     1 MLTNIIILIILYLFYDFCYKNFKYRNYGSPWALPVIGHFIHVINQPHLVVHNDRMKYNNG 60

 

Query:    61 RFVNYWFGDYLSIAITDPILYKKIYLNFPKQINSRLKSPTVLNISERFRGIISSNENNWD 120

             RFVNYWFGDYLSIAITDPILYKKIYLNFPKQINSRLKSPTVLNISERFRGIISSNENNWD

Sbjct:    61 RFVNYWFGDYLSIAITDPILYKKIYLNFPKQINSRLKSPTVLNISERFRGIISSNENNWD 120

 

Query:   121 FHHGILSKLFNGHKAKINNFLFEKETKFIIEYMKKISKSGENFDTRSNFLYFYSNILFDY 180

             FHHGILSKLFNGHKAKINNFLFEKETKFIIEYMKKISKSGENFDTRSNFLYFYSNILFDY

Sbjct:   121 FHHGILSKLFNGHKAKINNFLFEKETKFIIEYMKKISKSGENFDTRSNFLYFYSNILFDY 180

 

Query:   181 ILGKRVENIYENELRKDRKKFMVSIQEVMDSVGLIKFLNYLILSYPFLSIYLRYFTYTTF 240

             ILGKRVENIYENELRKDRKKFMVSIQEVMDSVGLIKFLNYLILSYPFLSIYLRYFTYTTF

Sbjct:   181 ILGKRVENIYENELRKDRKKFMVSIQEVMDSVGLIKFLNYLILSYPFLSIYLRYFTYTTF 240

 

Query:   241 NLKKILKQYYDEHLETIDLNKPRDVLDNLIMEYKNQNAIECDKSFVAIAIELLAAGTDTN 300

             NLKKILKQYYDEHLETIDLNKPRDVLDNLIMEYKNQNAIECDKSFVAIAIELLAAG DTN

Sbjct:   241 NLKKILKQYYDEHLETIDLNKPRDVLDNLIMEYKNQNAIECDKSFVAIAIELLAAG-DTN 299

 

Query:   301 SSTSEWFLTYLVNNPIYQDKIYDELIGALKINKPLKGSDVLINLSHRPLTPLFNATLKEV 360

             SSTSEWFLTYLVNNPIYQDKIYDELIGALKINKPLKGSDVLINLSHRPLTPLFNATLKEV

Sbjct:   300 SSTSEWFLTYLVNNPIYQDKIYDELIGALKINKPLKGSDVLINLSHRPLTPLFNATLKEV 359

 

Query:   361 LRLIPATPFSVPRMSNEGFEVDGIKIPKGTYLFPSMYSIFRDEKYWGENANKFYPERFLT 420

             LRLIPATPFSVPRMSNEGFEVDGIKIPKGTYLFPSMYSIFRDEKYWGENANKFYPERFLT

Sbjct:   360 LRLIPATPFSVPRMSNEGFEVDGIKIPKGTYLFPSMYSIFRDEKYWGENANKFYPERFLT 419

 

Query:   421 DSHSNNYFPYGVGKRMCLGSNFSQHELYICLTNIVLNFKIKSIDGKPLNEIPN-YGITFR 479

             DSHSNNYFPYGVGKRMCLGSNFSQHELYICLTNIVLNFKIKSIDGKPLNEIP+ YGITFR

Sbjct:   420 DSHSNNYFPYGVGKRMCLGSNFSQHELYICLTNIVLNFKIKSIDGKPLNEIPSDYGITFR 479

 

Query:   480 PNIFEVKLENR 490

             PNIFEVKLENR

Sbjct:   480 PNIFEVKLENR 490

 

>CYP508C1

>aa-ng1934        Contig_0725     pred cdna   pORF 555          2164 strand f start y stop y ph 0

MIQIGKTKPHIELMKLEKIYNQRILKIWLGDYYSVFLSDIDLIKDIFINKFENFSSRPKS

PLTRLGTNDFRGINGSSGETWFKNKNIIVNAMKRANTKTIYTLLDNQ

 

FNPKYYFRKFVLSTMFKYIFNEDVPYDENLENGKLSELTMEMENIFKTLKVGKLAN

SIEILETPYYYYLQKTDKVFKNIKKLIIEKYKNHNLSINPEKPRDLLDILINEY

 

AANTLEWIIIKLCNSPIHQEIAY

NELKKVVSSKVIIDDSIKREITLSDRPNTPYIQAIIKETMRMHPVVVFGLPRYCENDIFI

GDENYFIPKGCKVFINFHSIGYNEKYFKDPYKFEPNRFLENSNNSMDSFFPFGLGNRVCL

GRQLANDQLYLVIANLILKYKLKTIDENKINEDGIFGLTVSPNKYKINLESR

 

>CYP508C1 seq 4, 23, 24, 83 complete seq 42% to seq 66 486 aa

MGPWGYIKNKRIHKNEAKGPIGFPLIGN

MIQIGKTKPHIELMKLEKIYNQRILKIWLGDYYSVFLSD

IDLIKDIFINKFENFSSRPKSPLTRLGTNDFRGINGSSGETWFKNKNIIVNAMKRANTKTIYTLLDNQ

VNDLIKEISKFESQNKS (0)

FNPKYYFRKFVLSTMFKYIFNEDVPYDENLENGKLSELTMEMENIFKTLKVGKLAN

SIEILETPYYYYLQKTDKVFKNIKKLIIEKYKNHNLSINPEKPRDLLDILINEY

GTTDDDVLNITQVTLDMFMAGTDT (1)

TANTLEWIIIKLCNSPIHQEIAYNELKKVVSSKVIIDDSIKREITLSDRPNTPYIQA

IIKETMRMHPVVVFGLPRYCENDIFIGDENYFIPKGCKVFINFHSIGYNEKYFKDPYKFE

PNRFLENSNNSMDSFFPFGLGNRVCLGRQLANDQLYLVIANLILKYKLKTIDENKINEDG

IFGLTVSPNKYKINLESR*

 

Ng1934 is missing three pieces of sequence that need to be present

My N-term may still be too short

 

Compare with 508B1

 

>CYP508B1 Seq. 15, 77  50% identical to 508A1

          Length = 485

 

 Score = 1028 (361.9 bits), Expect = 1.2e-107, P = 1.2e-107

 Identities = 207/476 (43%), Positives = 307/476 (64%)

 

Query:     5 GYIKNKRIHKNEAKGPIGFPLIGNMIQIGKTKPHIELMKLEKIYNQRILKIWLGDYYSVF 64

             G  K K+IHKNE  GP   PL+GN+ Q+GK +PH  L K+  +Y + I ++  GD Y+V

Sbjct:    18 GIKKYKKIHKNELSGPFPIPLLGNLHQLGK-EPHYTLTKMHNVYGE-IFRLHFGDVYTVV 75

 

Query:    65 LSDIDLIKDIFINKFENFSSRPKSPLTRLGTNDFRGINGSSGETWFKNKNIIVNAMKRAN 124

             +SD  LI+++F++  ENF  RP  P  + G     G++  S E W + + ++ NAMKR +

Sbjct:    76 VSDPILIREMFVDNHENFKYRPLLPTFKFGAGGDHGLS-LSNERWERTRELVQNAMKRTS 134

 

Query:   125 TKTIYTLLDNQVNDLIKEISKFESQNKSFNPKYYFRKFVLSTMFKYIFNEDVPYDENLEN 184

              K IY +LD+QV +LIK + +++     F    Y ++F LS MFKYIFNED+ YDE++ 

Sbjct:   135 IKKIYDMLDSQVYELIKSMKQYQVTGNPFEIHLYAQRFTLSIMFKYIFNEDISYDEDITK 194

 

Query:   185 GKLSELTMEMENIFKTLKVGKLANSIEILETPYYYYLQKTDEVFKN----IKKLIIEDK- 239

             GK++EL   M+ IFK+L  GKL + I I +  YY +L+ +D+ F      +KK I +  

Sbjct:   195 GKIAELVQPMDQIFKSLGSGKLGDFISIAQPFYYQWLKFSDKQFNGPFSTVKKFIYKRYL 254

 

Query:   240 NHNLSINPEKPRDLLDILINEYGTTDDDVLNITQVTLDMFMAGTDTTANTLEWIIIKLCN 299

              H  +I+ + PRDL+D+LINE+    + +  I Q +LDMF+AG DTTA ++ W ++++ 

Sbjct:   255 EHINTIDHDNPRDLMDLLINEFSNDKNLIPTILQTSLDMFLAGNDTTAASIIWFVLRMQE 314

 

Query:   300 SPIHQEIAYNELKKVVSSKVIIDDSIKREITLSDRPNTPYIQAIIKETMRMHPVVVFGLP 359

              P  Q  AYNE+K  V  +    D     + LSDRP TPY+ AIIKE +R++PV  FGLP

Sbjct:   315 HPEIQLKAYNEIKDAVGDR----D----RVLLSDRPKTPYLNAIIKEILRLNPVGPFGLP 366

 

Query:   360 RYCENDIFIGDENYFIPKGCKVFINFHSIGYNEKYFKDPYKFEPNRFLENSNNSMDSFFP 419

                 NDI IG+  YFIPK  ++ +N+  +G+NEKYF++P +F+P+RFL N NN  D++ P

Sbjct:   367 HRSSNDIVIGNGKYFIPKDSQILVNYRGLGFNEKYFENPSQFDPSRFL-NKNN--DAYMP 423

 

Query:   420 FGLGNRVCLGRQLANDQLYLVIANLILKYKLKTIDENKINEDGIFGLTVSPNKYKINLES 479

             FG+G+R C+G QLA D+ YL  +N++L + LK I    +++   FGLT+ PNK+K+ LES

Sbjct:   424 FGVGDRKCVGLQLAGDEQYLSFSNILLNFNLKNIG-TPVSDYEEFGLTLKPNKFKVLLES 482

 

Query:   480 R 480

             R

Sbjct:   483 R 483

 

>Contig_0725, D. discoideum, sequenced by the D. discoideum Sequencing

            Consortium, assembled with Phusion

        Length = 4136

 

  Plus Strand HSPs:

 

 Score = 764 (274.0 bits), Expect = 1.3e-172, Sum P(3) = 1.3e-172

 Identities = 147/147 (100%), Positives = 147/147 (100%), Frame = +3

 

Query:     6 YIKNKRIHKNEAKGPIGFPLIGNMIQIGKTKPHIELMKLEKIYNQRILKIWLGDYYSVFL 65

             YIKNKRIHKNEAKGPIGFPLIGNMIQIGKTKPHIELMKLEKIYNQRILKIWLGDYYSVFL

Sbjct:   486 YIKNKRIHKNEAKGPIGFPLIGNMIQIGKTKPHIELMKLEKIYNQRILKIWLGDYYSVFL 665

 

Query:    66 SDIDLIKDIFINKFENFSSRPKSPLTRLGTNDFRGINGSSGETWFKNKNIIVNAMKRANT 125

             SDIDLIKDIFINKFENFSSRPKSPLTRLGTNDFRGINGSSGETWFKNKNIIVNAMKRANT

Sbjct:   666 SDIDLIKDIFINKFENFSSRPKSPLTRLGTNDFRGINGSSGETWFKNKNIIVNAMKRANT 845

 

Query:   126 KTIYTLLDNQVNDLIKEISKFESQNKS 152

             KTIYTLLDNQVNDLIKEISKFESQNKS

Sbjct:   846 KTIYTLLDNQVNDLIKEISKFESQNKS 926

 

 Score = 681 (244.8 bits), Expect = 1.3e-172, Sum P(3) = 1.3e-172

 Identities = 136/147 (92%), Positives = 140/147 (95%), Frame = +1

 

Query:   153 FNPKYYFRKFVLSTMFKYIFNEDVPYDENLENGKLSELTMEMENIFKTLKVGKLANSIEI 212

             FNPKYYFRKFVLSTMFKYIFNEDVPYDENLENGKLSELTMEMENIFKTLKVGKLANSIEI

Sbjct:  1003 FNPKYYFRKFVLSTMFKYIFNEDVPYDENLENGKLSELTMEMENIFKTLKVGKLANSIEI 1182

 

Query:   213 LETPYYYYLQKTDEVFKNIKKLIIED-KNHNLSINPEKPRDLLDILINEYGTTDDDVLNI 271

             LETPYYYYLQKTD+VFKNIKKLIIE  KNHNLSINPEKPRDLLDILINEYGTTDDDVLNI

Sbjct:  1183 LETPYYYYLQKTDKVFKNIKKLIIEKYKNHNLSINPEKPRDLLDILINEYGTTDDDVLNI 1362

 

Query:   272 TQVTLDMFMAGTDTTANTLEW-IIIKL 297

             TQVTLDMFMAGTDT+  T+ + I IKL

Sbjct:  1363 TQVTLDMFMAGTDTSNYTILY*IKIKL 1443

 

sdic6B16c2.p1c has DEV not DKV

JC2b120c09.r1  has DKV not DEV

Contig_0725    has DKV not DEV

 

 Score = 290 (107.1 bits), Expect = 1.3e-172, Sum P(3) = 1.3e-172

 Identities = 57/58 (98%), Positives = 57/58 (98%), Frame = +3

 

Query:   285 TTANTLEWIIIKLCNSPIHQEIAYNELKKVVSSKVIIDDSIKREITLSDRPNTPYIQA 342

             T ANTLEWIIIKLCNSPIHQEIAYNELKKVVSSKVIIDDSIKREITLSDRPNTPYIQA

Sbjct:  1494 TAANTLEWIIIKLCNSPIHQEIAYNELKKVVSSKVIIDDSIKREITLSDRPNTPYIQA 1667

 

>aa-ng1935        Contig_0725     pred cdna   pORF 4043         2373 strand r start n stop y ph 0

VKKNKKSTKENDLKGPIALPIIGNLFGLRNDTYSIMDFYHKMYGGIYRLWFGDYFVVSLN

DPEIIREIFIKNYSNFSSRPFLPTITFGSFNYRGISGSNGDYWKRNRNLLLNAMKKSNLK

QTYDNLSDSVNSLINLMKEFQSSNESFQPDMYLRKYALTTMFKYVFNETVSFENKIVQGE

EAELIDNINEAFNFMTLGNAGDFFKILQPLYYQYLLYRGGCFNRIRTLIRNRYIEHRKTI

DIENPRDLLDLLIIEYGDHSDENMISIVQVCFDVILAGVDTLASSLEWFLVMLCNNQQIQ

DTVYNELKETVVGPVVTLNDRPSTPYTMACIKETIRLKAPAPFGLPHTTDQDIIVKGHFI

PKDSMVLINFYSLGRNPKDFPDPLKFDPNRFIGSTPDSFMPFGTGPRNCIGQALGMDQIY

LLLSNIFLNFKITSENGKKLDDTDYVSGLNLKPAKYKVCLEKR

 

CYP508D1 Seq (59+81) 42% to seq 4  483 aa complete

MVYLNNIFIFFIIFFIYSFVKKNKKSTKENDLKGPIALPIIGNLFGLRNDTYSIMD

FYHKMYGGIYRLWFGDYFVVSLNDPEIIREIFIKNYSNFSS (phase 2)

RPFLPTITFGSFNYRGISGSNGDYWKRNRNLLLNAMKKSNLKQTYDNLSDSVNSLINLMKEFQSSNES

FQPDMYLRKYALTTMFKYVFNETVSFENKIVQGEEAELIDNINEAFNFMTLGNAGDFFKILQPLYYQYLLYRGGCFNRIRTLIRNR

YIEHRKTIDIENPRDLLDLLIIEYGDHSDENMISIVQVCFDVILAGVDTLASSLEWFLVMLCNNQQIQDTVYNELKETVVGPVVTL

NDRPSTPYTMACIKETIRLKAPAPFGLPHTTDQDIIVKGHFI (intron)

PKDSMVLINFYSLGRNPKDFPDPLKFDPNRFIGSTPDSFMPFGTGPRNCI (intron)

GQALGMDQIYLLLSNIFLNFKITSENGKKLDDTDYVSGLNLKPAKYKVCLEKR*

 

Ng1935 is missing n-terminal exon

1 aa diff inside change KKY to RKY then use my seq

 

>CYP508D1 Seq (59+81) 42% to seq 4  483 aa complete

          Length = 483

 

 Score = 2443 (860.0 bits), Expect = 1.4e-257, P = 1.4e-257

 Identities = 462/463 (99%), Positives = 463/463 (100%)

 

Query:     1 VKKNKKSTKENDLKGPIALPIIGNLFGLRNDTYSIMDFYHKMYGGIYRLWFGDYFVVSLN 60

             VKKNKKSTKENDLKGPIALPIIGNLFGLRNDTYSIMDFYHKMYGGIYRLWFGDYFVVSLN

Sbjct:    20 VKKNKKSTKENDLKGPIALPIIGNLFGLRNDTYSIMDFYHKMYGGIYRLWFGDYFVVSLN 79

 

Query:    61 DPEIIREIFIKNYSNFSSRPFLPTITFGSFNYRGISGSNGDYWKRNRNLLLNAMKKSNLK 120

             DPEIIREIFIKNYSNFSSRPFLPTITFGSFNYRGISGSNGDYWKRNRNLLLNAMKKSNLK

Sbjct:    80 DPEIIREIFIKNYSNFSSRPFLPTITFGSFNYRGISGSNGDYWKRNRNLLLNAMKKSNLK 139

 

Query:   121 QTYDNLSDSVNSLINLMKEFQSSNESFQPDMYLRKYALTTMFKYVFNETVSFENKIVQGE 180

             QTYDNLSDSVNSLINLMKEFQSSNESFQPDMYL+KYALTTMFKYVFNETVSFENKIVQGE

Sbjct:   140 QTYDNLSDSVNSLINLMKEFQSSNESFQPDMYLKKYALTTMFKYVFNETVSFENKIVQGE 199

 

Query:   181 EAELIDNINEAFNFMTLGNAGDFFKILQPLYYQYLLYRGGCFNRIRTLIRNRYIEHRKTI 240

             EAELIDNINEAFNFMTLGNAGDFFKILQPLYYQYLLYRGGCFNRIRTLIRNRYIEHRKTI

Sbjct:   200 EAELIDNINEAFNFMTLGNAGDFFKILQPLYYQYLLYRGGCFNRIRTLIRNRYIEHRKTI 259

 

Query:   241 DIENPRDLLDLLIIEYGDHSDENMISIVQVCFDVILAGVDTLASSLEWFLVMLCNNQQIQ 300

             DIENPRDLLDLLIIEYGDHSDENMISIVQVCFDVILAGVDTLASSLEWFLVMLCNNQQIQ

Sbjct:   260 DIENPRDLLDLLIIEYGDHSDENMISIVQVCFDVILAGVDTLASSLEWFLVMLCNNQQIQ 319

 

Query:   301 DTVYNELKETVVGPVVTLNDRPSTPYTMACIKETIRLKAPAPFGLPHTTDQDIIVKGHFI 360

             DTVYNELKETVVGPVVTLNDRPSTPYTMACIKETIRLKAPAPFGLPHTTDQDIIVKGHFI

Sbjct:   320 DTVYNELKETVVGPVVTLNDRPSTPYTMACIKETIRLKAPAPFGLPHTTDQDIIVKGHFI 379

 

Query:   361 PKDSMVLINFYSLGRNPKDFPDPLKFDPNRFIGSTPDSFMPFGTGPRNCIGQALGMDQIY 420

             PKDSMVLINFYSLGRNPKDFPDPLKFDPNRFIGSTPDSFMPFGTGPRNCIGQALGMDQIY

Sbjct:   380 PKDSMVLINFYSLGRNPKDFPDPLKFDPNRFIGSTPDSFMPFGTGPRNCIGQALGMDQIY 439

 

Query:   421 LLLSNIFLNFKITSENGKKLDDTDYVSGLNLKPAKYKVCLEKR 463

             LLLSNIFLNFKITSENGKKLDDTDYVSGLNLKPAKYKVCLEKR

Sbjct:   440 LLLSNIFLNFKITSENGKKLDDTDYVSGLNLKPAKYKVCLEKR 482

 

.CYP521A1 100% match

>aa-ng2405        Contig_0941     pred cdna   pORF 14409       12827 strand r start y stop y ph 0

MILLTLLYLIIFYIIIDFIKKNYKTKNQLPSPLGIALPIIGHLHLLRTDPYKTLAKASKK

TEHGILKCWNGEHLMVVVDNPSIIKQMYVNTNNFTDRPQTKVFEIISRNYKNSGFANGEK

WKHLRGLYAPSFTKIKSRPHENIILKYVNFEIKSLKNHAITNSIYNPFLIENINSFGTKV

ITEIIFGREFSENEVYSLIGPMNKLFGILDTPFPSESISFLKPFYRRSYKECDKQCEELF

KLVEKVYDDHLLNLDKDNPKDVMDVMIVETDFKEKDHVICICCDLLMGTKDTFNTIVLWF

FVLMINYQDVQLKGYQEIIKVLECTGRDHVTIEDIDKLPYIDGIIKEISRIHPAGPLSVP

RTAINDIMINGYFIPKGCHVFQNTYGAVYNYMKESDEPCKMKPERWIENEKLRKDGKLDP

TNDLALISLPFSSGIRNCPGVGFAEYELFLLFSNIILNFHLSSPNNLKLNESGHFGLTMK

PFPFLVDLKLR

 

>aa-ng2647        Contig_1035     pred cdna   pORF 453          2253 strand f start y stop y ph 0

MITFIIYKNFDSKRKNKYPPGPINLPFIGGLYKLKPGKQHLSLNELYQKYGKVFSMKFGS

YDTVILNEPDVIVEAFHLNSTSFMDRILLPSFEVVGKNQNIGFTQTEYWKKIRGILNISL

TKSKTRLLEGLFNQEYFRFDQFIKKQLKSKSDTMFIRPYLKRLSFNIIYSYLFSETIPYE

DELIPSDILDFIHASEELLVVLSRMPSDYIKVLRPFESHKKLKEICDRMSKFIKPRVDKK

VELLDQDNPKCFIDYLILQIRSDPSQTIKIDAIQYIVLDLLVGGTDSTSSALEWMILFLS

NHESIQEKLYNEICKNTNTNTTATTTENDNESYFPKLIEKNNYPLFNACVKETLRRSPPV

PLGLPHLCSEDTEIGGYLIPKGTQIISNIYSASNCDKVFTDPLEFNPLRFIENSPPQTVT

FSTGPRVCPGKNLSEDELFSFGTKLFKTFKFSRPNKNQLYDEVAILGITLEPKPFITKVS

LRS

 

>CYP515B1 Seq. 6 34% to seq 5

          Length = 498

 

 Score = 2437 (857.9 bits), Expect = 6.0e-257, P = 6.0e-257

 Identities = 468/472 (99%), Positives = 470/472 (99%)

 

Query:    13 KRKNKYPPGPINLPFIGGLYKLKPGKQHLSLNELYQKYG-KVFSMKFGSYDTVILNEPDV 71

             KRKNKYPPGPINLPFIGGLYKLKPGKQHLSLNELYQKYG KVFSMKFGSYDTVILNEPDV

Sbjct:    26 KRKNKYPPGPINLPFIGGLYKLKPGKQHLSLNELYQKYGRKVFSMKFGSYDTVILNEPDV 85

 

Query:    72 IVEAFHLNSTSFMDRILLPSFEVVGKNQNIGFTQTEYWKKIRGILNISLTKSKTRLLEGL 131

             IVEAFHLNSTSFMDRILLPSFEVVGK +NIGFTQTEYWKKIRGILNISLTKSKTRLLEGL

Sbjct:    86 IVEAFHLNSTSFMDRILLPSFEVVGKKKNIGFTQTEYWKKIRGILNISLTKSKTRLLEGL 145

 

Query:   132 FNQEYFRFDQFIKKQLKSKSDTMFIRPYLKRLSFNIIYSYLFSETIPYEDELIPSDILDF 191

             FNQEYFRFDQFIKKQLKSKSDTMFIRPYLKRLSFNIIYSYLFSETIPYEDELIPSDILDF

Sbjct:   146 FNQEYFRFDQFIKKQLKSKSDTMFIRPYLKRLSFNIIYSYLFSETIPYEDELIPSDILDF 205

 

Query:   192 IHASEELLVVLSRMPSDYIKVLRPFESHKKLKEICDRMSKFIKPRVDKKVELLDQDNPKC 251

             IHASEELLVVLSRMPSDYIKVLRPFESHKK+KEICDRMSKFIKPRVDKKVELLDQDNPKC

Sbjct:   206 IHASEELLVVLSRMPSDYIKVLRPFESHKKIKEICDRMSKFIKPRVDKKVELLDQDNPKC 265

 

Query:   252 FIDYLILQIRSDPSQTIKIDAIQYIVLDLLVGGTDSTSSALEWMILFLSNHESIQEKLYN 311

             FIDYLILQIRSDPSQTIKIDAIQYIVLDLLVGGTDSTSSALEWMILFLSNHESIQEKLYN

Sbjct:   266 FIDYLILQIRSDPSQTIKIDAIQYIVLDLLVGGTDSTSSALEWMILFLSNHESIQEKLYN 325

 

Query:   312 EICKNTNTNTTATTTENDNESYFPKLIEKNNYPLFNACVKETLRRSPPVPLGLPHLCSED 371

             EICKNTNTNTTATTTENDNESYFPKLIEKNNYPLFNACVKETLRRSPPVPLGLPHLCSED

Sbjct:   326 EICKNTNTNTTATTTENDNESYFPKLIEKNNYPLFNACVKETLRRSPPVPLGLPHLCSED 385

 

Query:   372 TEIGGYLIPKGTQIISNIYSASNCDKVFTDPLEFNPLRFIENSPPQTVTFSTGPRVCPGK 431

             TEIGGYLIPKGTQIISNIYSASNCDKVFTDPLEFNPLRFIENSPPQTVTFSTGPRVCPGK

Sbjct:   386 TEIGGYLIPKGTQIISNIYSASNCDKVFTDPLEFNPLRFIENSPPQTVTFSTGPRVCPGK 445

 

Query:   432 NLSEDELFSFGTKLFKTFKFSRPNKNQLYDEVAILGITLEPKPFITKVSLRS 483

             NLSEDELFSFGTKLFKTFKFSRPNKNQLYDEVAILGITLEPKPFITKVSLRS

Sbjct:   446 NLSEDELFSFGTKLFKTFKFSRPNKNQLYDEVAILGITLEPKPFITKVSLRS 497

 

JC3e13g02.r1 has KLK not KIK

JC3a84h08.r1 has KLK not KIK fix this in my seq

 

JC2b306h10.s1 has KNQNIG not KKKNIG

JC3e86b07.r1 has KNQNIG not KKKNIG fix this in my seq

 

JAX4a105a03.r1 has KYGKV not KYGRKV

JC3a40f05.s1 has KYGKV not KYGRKV fix in my seq

 

CYP515B1 Seq. 6 34% to seq 5

MATLLLVIIFMITFIIYKN (0)

FDSKRKNKYPPGPINLPFIGGLYKLKPGKQHLSLNELYQKYG

KVFSMKFGSYDTVILNEPDVIVEAFHLNSTSFMDRILLPSFEVVGKNQNIGFT

QTEYWKKIRGILNISLTKSKTRLLEGLFNQEYFRFDQFIKKQLKSKSDT (0)

MFIRPYLKRLSFNIIYSYLFSETIPYEDELIPSDILDFIHASEELLVVLSRMPS

DYIKVLRPFESHKKLKEICDRMSKFIKPRVDKKVELLDQD

NPKCFIDYLILQIRSDPSQTIKIDAIQYIVLDLLVGGTDSTSSALEWMILFLSNHESIQEKLYNEIC

KNTNTNTTATTTENDNESYFPKLIEKNNYPLFNACV

KETLRRSPPVPLGLPHLCSED TEIGGYLIPKGTQIISNIYSASNCDKVFTDPLEFNPLRFIENSPPQTV

TFSTGPRVCPGKNLSEDELFSFGTKLFK

TFKFSRPNKNQLYDEVAILGITLEPKPFITKVSLRS*

 

>IIAFP1D83520 45769 letters

        Length = 45,769

 

  Plus Strand HSPs:

 

 Score = 754 (270.5 bits), Expect = 7.1e-118, Sum P(3) = 7.1e-118

 Identities = 144/145 (99%), Positives = 145/145 (100%), Frame = +1

 

Query:    10 FDSKRKNKYPPGPINLPFIGGLYKLKPGKQHLSLNELYQKYGKVFSMKFGSYDTVILNEP 69

             FDSKRKNKYPPGPINLPFIGGLYKLKPGKQHLSLNELYQKYGKVFSMKFGSYDTVILNEP

Sbjct:   727 FDSKRKNKYPPGPINLPFIGGLYKLKPGKQHLSLNELYQKYGKVFSMKFGSYDTVILNEP 906

 

Query:    70 DVIVEAFHLNSTSFMDRILLPSFEVVGKNQNIGFTQTEYWKKIRGILNISLTKSKTRLLE 129

             DVIVEAFHLNSTSFMDRILLPSFEVVGKNQNIGFTQTEYWKKIRGILNISLTKSKTRLLE

Sbjct:   907 DVIVEAFHLNSTSFMDRILLPSFEVVGKNQNIGFTQTEYWKKIRGILNISLTKSKTRLLE 1086

 

Query:   130 GLFNQEYFRFDQFIKKQLKSKSDTM 154

             GLFNQEYFRFDQFIKKQLKSKSDT+

Sbjct:  1087 GLFNQEYFRFDQFIKKQLKSKSDTV 1161

 

 Score = 450 (163.5 bits), Expect = 7.1e-118, Sum P(3) = 7.1e-118

 Identities = 91/99 (91%), Positives = 92/99 (92%), Frame = +1

 

Query:   142 FIKKQLKSKSDTMFIRPYLKRLSFNIIYSYLFSETIPYEDELIPSDILDFIHASEELLVV 201

             FI   +K K   MFIRPYLKRLSFNIIYSYLFSETIPYEDELIPSDILDFIHASEELLVV

Sbjct:  1243 FIFLNIKKK---MFIRPYLKRLSFNIIYSYLFSETIPYEDELIPSDILDFIHASEELLVV 1413

 

Query:   202 LSRMPSDYIKVLRPFESHKKLKEICDRMSKFIKPRVDKK 240

             LSRMPSDYIKVLRPFESHKKLKEICDRMSKFIKPRVDKK

Sbjct:  1414 LSRMPSDYIKVLRPFESHKKLKEICDRMSKFIKPRVDKK 1530

 

 Score = 46 (21.3 bits), Expect = 7.1e-118, Sum P(3) = 7.1e-118

 Identities = 9/9 (100%), Positives = 9/9 (100%), Frame = +3

 

Query:     1 MITFIIYKN 9

             MITFIIYKN

Sbjct:   462 MITFIIYKN 488

 

>514A4

>aa-ng2792        Contig_1093     pred cdna   pORF 7469         9223 strand f start y stop y ph 0

MNTIFTIILTITILVLSLILKDLLFEGRIKKINKLIPGPSTIPVFGNLLQINAKDFPKSV

NDFYERYGKVFRLRLGSVEIVVLTGPEVIDECFNKKHREIFKERYIKFSRFLGKDYNIFS

SNGDYHYVLRGILTSEITTRKLNNGRLESNKFILEMFSNLCKDNKETLVKNTPNQIRILA

VKLILNFTLGIEENDETILIIVEKIKCIFEAAGLLIYSDYLPFLFPLDIKSMSKNDIISS

YFFIKDFIGIKLDAIKIKYEKENELKNETTDETSSKLNNIPIIENYYKNYLDGSIHYDSI

LFSISDIIFAAVDSTSNGFSLLIGQLINKPEIQDKIYEEIMRNDENNNTNNISFADHTKY

PYIISVMNESYRYNSSVPITEPNKTTEDVEVNGYKIAKGTMIIKNLRGTHLSKEFWGEDA

LEFKPERFKNQPLNQKGLFHFGAGARACPGGRFTESFTFTFLVIMLKNFKIVNPTDIPID

VEGEVGLAMQCKPFDALFIKRN

 

Note only 7aa diffs to ng1282 are there two genes?

 

CYP514A1 Seq. 11 complete seq 35% to seq 7

MNTIFTIILTITILVLSLILK (0)

DLLFEGRIKKINKLIPGPSTIPVFGNLLQINAKDFPKSVNDFYERYGKVF

RLRLGSVEIVVLTGPEVIDECFNKKHREIFKERYIKFSRFFGKDYNIISS

NGDYHYVLRGILTSEITTRKLNNGRLESNKFILEMFSNLCKDNKETLVKN

TPNQIRILAVKLILNFTLGIEENDETILIIVEKIKCIFEAAGLLIYSDYL

PFLFPLDIKSMSKNDIISSYFFLKDFIGIKLDAIKIKYEKENELKNETT

DETSSKLNNIPIIENYYKNYLDGSIHYDS

ILFSISDIIFAAVDSTSNGFSLLIGQLINKPEIQDKIYEEIMRNDENNNTNNISFADHTKYPYIISVMN (1)

ESYRYNSSVPITEPNKTTEDVEVNGYKIAKGTMIIKNLRGTHLSKEF

WGEDALEFKPERFKNQPLYQKGLFHFGA (1)

GPRGCPGGRFTESFTFTFLVIMLKNFKIVNPTDIPI

DVEGEVGLAMQCKPFDALFIKRN*

 

6aa diffs

 

JC2b365b08.s1  FGKDYNIIS

JC3c23c03.s1   FGKDYNIIS

JC3a164c10.s1  FGKDYNIIS

IIAGP1D6746    FGKDYNIIS

IIAGP1D4567    FGKDYNIIS

JAX4a225h05.s1 LGKDYNIFS

IIAGP1D1903    LGKDYNIFS

JAX4a225h05.r1 LGKDYNIFX

SFH636         LGKDYNIFS

 

Two different genes present

 

JC3a164c10.s1 YFFLKD

JC3f62d07.s1  YFFLKD

JC1a54g04.r1  YFFLKD use

 

SSM757 PLN and GARA

IIAFP1D28194   GARA

JC3e102f08.s1 PLY

JC3c13d09.r1  PLY

JC3c13f09.r1  PLY

 

GGTGAAGATGCTTTAGAATTTAAACCAGAAAGATTTAAAAATCAACCTTTATATCAAAAA

GGACTTTTTCATTTTGGTGCAGGTATGTTTTGATACCATTTTAAAAAGGAATTAAAATTT

            F  G  A  G 

 

 

TTTATTTAAACAGGAGCTAGAGCTTGTCCAGGAGGAAGATT

            G  A  R  A

 

>CYP514A1 Seq. 11 complete seq 35% to seq 7

          Length = 503

 

 Score = 2557 (900.1 bits), Expect = 1.2e-269, P = 1.2e-269

 Identities = 496/502 (98%), Positives = 497/502 (99%)

 

Query:     1 MNTIFTIILTITILVLSLILKDLLFEGRIKKINKLIPGPSTIPVFGNLLQINAKDFPKSV 60

             MNTIFTIILTITILVLSLILKDLLFEGRIKKINKLIPGPSTIPVFGNLLQINAKDFPKSV

Sbjct:     1 MNTIFTIILTITILVLSLILKDLLFEGRIKKINKLIPGPSTIPVFGNLLQINAKDFPKSV 60

 

Query:    61 NDFYERYGKVFRLRLGSVEIVVLTGPEVIDECFNKKHREIFKERYIKFSRFLGKDYNIFS 120

             NDFYERYGKVFRLRLGSVEIVVLTGPEVIDECFNKKHREIFKERYIKFSRF GKDYNI S

Sbjct:    61 NDFYERYGKVFRLRLGSVEIVVLTGPEVIDECFNKKHREIFKERYIKFSRFFGKDYNIIS 120

 

Query:   121 SNGDYHYVLRGILTSEITTRKLNNGRLESNKFILEMFSNLCKDNKETLVKNTPNQIRILA 180

             SNGDYHYVLRGILTSEITTRKLNNGRLESNKFILEMFSNLCKDNKETLVKNTPNQIRILA

Sbjct:   121 SNGDYHYVLRGILTSEITTRKLNNGRLESNKFILEMFSNLCKDNKETLVKNTPNQIRILA 180

 

Query:   181 VKLILNFTLGIEENDETILIIVEKIKCIFEAAGLLIYSDYLPFLFPLDIKSMSKNDIISS 240

             VKLILNFTLGIEENDETILIIVEKIKCIFEAAGLLIYSDYLPFLFPLDIKSMSKNDIISS

Sbjct:   181 VKLILNFTLGIEENDETILIIVEKIKCIFEAAGLLIYSDYLPFLFPLDIKSMSKNDIISS 240

 

Query:   241 YFFIKDFIGIKLDAIKIKYEKENELKNETTDETSSKLNNIPIIENYYKNYLDGSIHYDSI 300

             YFF+KDFIGIKLDAIKIKYEKENELKNETTDETSSKLNNIPIIENYYKNYLDGSIHYDSI

Sbjct:   241 YFFLKDFIGIKLDAIKIKYEKENELKNETTDETSSKLNNIPIIENYYKNYLDGSIHYDSI 300

 

Query:   301 LFSISDIIFAAVDSTSNGFSLLIGQLINKPEIQDKIYEEIMRNDENNNTNNISFADHTKY 360

             LFSISDIIFAAVDSTSNGFSLLIGQLINKPEIQDKIYEEIMRNDENNNTNNISFADHTKY

Sbjct:   301 LFSISDIIFAAVDSTSNGFSLLIGQLINKPEIQDKIYEEIMRNDENNNTNNISFADHTKY 360

 

Query:   361 PYIISVMNESYRYNSSVPITEPNKTTEDVEVNGYKIAKGTMIIKNLRGTHLSKEFWGEDA 420

             PYIISVMNESYRYNSSVPITEPNKTTEDVEVNGYKIAKGTMIIKNLRGTHLSKEFWGEDA

Sbjct:   361 PYIISVMNESYRYNSSVPITEPNKTTEDVEVNGYKIAKGTMIIKNLRGTHLSKEFWGEDA 420

 

Query:   421 LEFKPERFKNQPLNQKGLFHFGAGARACPGGRFTESFTFTFLVIMLKNFKIVNPTDIPID 480

             LEFKPERFKNQPL QKGLFHFGAG R CPGGRFTESFTFTFLVIMLKNFKIVNPTDIPID

Sbjct:   421 LEFKPERFKNQPLYQKGLFHFGAGPRGCPGGRFTESFTFTFLVIMLKNFKIVNPTDIPID 480

 

Query:   481 VEGEVGLAMQCKPFDALFIKRN 502

             VEGEVGLAMQCKPFDALFIKRN

Sbjct:   481 VEGEVGLAMQCKPFDALFIKRN 502

 

>CYP508A1 missing N-terminal 100% match

>aa-ng3018        Contig_1182     pred cdna   pORF 2352          535 strand r start n stop y ph 0

ISKYKKIHVNELCGPTPIPILGNLHQFGELPHRVLTKMTKKYGHILRVYMADMYTVVVSD

PLLIREMYVDNSDIFTDRVKKPSVEHGTFYHGTVTSYGEHWKNNREIVGKAMRKTNLKHI

YELLDKQVDVLIRSMKSIETSGKTFDTRYYITKFTMSAMFKFLFNHDIPEDEDINKGDTQ

KLMGPMSEVFQNAGRGSLFDVINITQPLYLLYLEMFDQSFKDIMKYHREKYNEHLKTFDP

DVERDLLDILIKEYGTDNDDKILSILATINDFFLAGVDTSSTALESMVLMLTNYPEIQEK

AFDEIKTVVNGRSKVNLSDRQSTPYLVAVIKETLRYKPMSPFGLPRSSSKDCMIGGHFIP

KNAQILINYQALGMNEEYYENPEQFDPSRFLKVESNVAFLPFSIGIRSCVGQSFAQDELY

ICISNILLNFKLKSIDGKKIDETEEYGLTLKTKNRFNVTLEKRII

 

>CYP508A1 complete (old seqs. 1 and 2) 28% to seq 8

MALFEIIISLFVVYIIHNAISKYKKIHVNELCGPTPIPILGNLHQFGELPHRVLTKMTKK

YGHILRVYMADMYTVVVSDPLLIREMYVDNSDIFTDRVKKPSVEHGTFYHGTVTSYGEHW

KNNREIVGKAMRKTNLKHIYELLDKQVDVLIRSMKSIETSGKTFDTRYYITKFTMSAMFK

FLFNHDIPEDEDINKGDTQKLMGPMSEVFQNAGRGSLFDVINITQPLYLLYLEMFDQSFK

DIMKYHREKYNEHLKTFDPDVERDLLDILIKEYGTDNDDKILSILATINDFFLAGVDTSS

TALESMVLMLTNYPEIQEKAFDEIKTVVNGRSKVNLSDRQSTPYLVAVIKETLRYKPMSP

FGLPRSSSKDCMIGGHFIPKNAQILINYQALGMNEEYYENPEQFDPSRFLKVESNVAFLP

FSIGIRSCVGQSFAQDELYICISNILLNFKLKSIDGKKIDETEEYGLTLKTKNRFNVTLE

KRII*

 

Add N-term exon

 

 

>513E1

>aa-ng3101        Contig_1216     pred cdna   pORF 338          1958 strand f start y stop y ph 0

MDLYISILILIISLIIFFKNNNRISKINSKIPGPIRLPIFGNLLQINKDPHIQFQKWYEK

YGVIYSIRLGNIETVVFTGYPIFKKAFIENSQIFAPRFQHLSRFEANGCKNLIGSNDEIH

STLKKLILTEITSNKIKKMENHIVLECENLCKQLDKHCQDGLPFSLNMYFKLFSLNIILR

FLFGTINNSYQDKSNQDIVDVIIEFLHYGGNPIMSDFIPILKPFYKQNKFFKFYPILCDH

LNKLIENYKNNKQLQKQQKQQQNEDDDDDDDGTIIGKLLKEYHNGKISWTSVVSTCVDVF

LAGVDSTSNSTIFTLIALVNNSNCQEKLFNEIKNNLKKSDDGHDEIVIRHSLYRSSIPYL

SLVMKEVYRLYSVILIGLPHITSEDVEIEGYKIAKGTQIIQNVFSTHLCEKTFPMSKSFI

PERFIETGSNNMFGGGQTNLVHFGTGVRDCVGKSLADCEIFTVLATLINRYQFINPTLEP

LNDIGSFGIAYQPPINNFIIKKRL

 

Contig 1216 shows D here

JC3e44c07.r1 shows N.  N is more like other seqs so use N

 

>513E1 seq 90+52+51 39% to seq 5+56; 42% to seq 8; 47% to seq 37 shorter match

       Length = 505

 

 Score = 2635 (927.6 bits), Expect = 6.4e-278, P = 6.4e-278

 Identities = 503/504 (99%), Positives = 504/504 (100%)

 

Query:     1 MDLYISILILIISLIIFFKNNNRISKINSKIPGPIRLPIFGNLLQINKDPHIQFQKWYEK 60

             M+LYISILILIISLIIFFKNNNRISKINSKIPGPIRLPIFGNLLQINKDPHIQFQKWYEK

Sbjct:     1 MNLYISILILIISLIIFFKNNNRISKINSKIPGPIRLPIFGNLLQINKDPHIQFQKWYEK 60

 

Note 1 aa diff at aa2

 

>aa-ng4137        Contig_1639     pred cdna   pORF 484          2497 strand f start y stop y ph 0

MILGIILGLFIYIYLINIKFFNRAVPSSLLVGENKLKCKFRSGPLILPIIGSLYKLSLKY

PHLSFKQLSDKYGKVFSLKMGSIDTIVINDINFLQKSFRDNPTFFSQRFHLPSFYYMGKY

QGIIFGNGSHWRKLKDILSSSITKSKSRQMEELFYNEYFKAEEYLLKKINQDNNIDMGPI

FKKILLNILYRFLFGVSFEYDDNLLSKEFYSFIQSYNKLFEYLAKQPADFIPILKPFNNY

KEIEKEYNNCLNFFQPLIDNILKNISDDDDDGNEPKCFLEYFISEIRKDTSNLIKITDLP

YICFDIIVAGIVTTSTTMDWMLLYLTNYPNIQEKLFFEINTPNHPLHKDKLQFPYLNSII

KETLRISPPAPFALPHICTDDIVIDDIFIPKNTQVIPNIYGCNRSNIESSESNVFNPDHF

LSKDELNIGQCAFSFGSRQCPGANVADSIMFLVSTKLYKTFKFERTTTQLNDENGHFTRS

LSPFEFKSKLIIRK

 

>CYP515A1 Seq 79b  4 diffs

          Length = 495

 

 Score = 2569 (904.3 bits), Expect = 6.3e-271, P = 6.3e-271

 Identities = 491/494 (99%), Positives = 492/494 (99%)

 

Query:     1 MILGIILGLFIYIYLINIKFFNRAV-PSSLLVGENKLKCKFRSGPLILPIIGSLYKLSLK 59

             MILGIILGLFIYIYLINIKFFNRAV PSSLLVGENKLKCKF SGPLILPIIGSLYKLSLK

Sbjct:     1 MILGIILGLFIYIYLINIKFFNRAVLPSSLLVGENKLKCKFPSGPLILPIIGSLYKLSLK 60

 

Query:    60 YPHLSFKQLSDKYGKVFSLKMGSIDTIVINDINFLQKSFRDNPTFFSQRFHLPSFYYMGK 119

             YPHLSFKQLSDKYGKVFSLKMGSIDTIVINDINFLQKSFRDNPTFFSQRFHLPSFYYMGK

Sbjct:    61 YPHLSFKQLSDKYGKVFSLKMGSIDTIVINDINFLQKSFRDNPTFFSQRFHLPSFYYMGK 120

 

Query:   120 YQGIIFGNGSHWRKLKDILSSSITKSKSRQMEELFYNEYFKAEEYLLKKINQDNNIDMGP 179

             YQGIIFGNGSHWRKLKDILSSSITKSKSRQMEELFYNEYFKAEEYLLKKINQDNNIDMGP

Sbjct:   121 YQGIIFGNGSHWRKLKDILSSSITKSKSRQMEELFYNEYFKAEEYLLKKINQDNNIDMGP 180

 

Query:   180 IFKKILLNILYRFLFGVSFEYDDNLLSKEFYSFIQSYNKLFEYLAKQPADFIPILKPFNN 239

             IFK+ILLNILYRFLFGVSFEYDDNLLSKEFYSFIQSYNKLFEYLAKQPADFIPILKPFNN

Sbjct:   181 IFKRILLNILYRFLFGVSFEYDDNLLSKEFYSFIQSYNKLFEYLAKQPADFIPILKPFNN 240

 

Query:   240 YKEIEKEYNNCLNFFQPLIDNILKNISDDDDDGNEPKCFLEYFISEIRKDTSNLIKITDL 299

             YKEIEKEYNNCLNFFQPLIDNILKNISDDDDDGNEPKCFLEYFISEIRKDTSNLIKITDL

Sbjct:   241 YKEIEKEYNNCLNFFQPLIDNILKNISDDDDDGNEPKCFLEYFISEIRKDTSNLIKITDL 300

 

Query:   300 PYICFDIIVAGIVTTSTTMDWMLLYLTNYPNIQEKLFFEINTPNHPLHKDKLQFPYLNSI 359

             PYICFDIIVAGIVTTSTTMDWMLLYLTNYPNIQEKLFFEINTPNHPLHKDKLQFPYLNSI

Sbjct:   301 PYICFDIIVAGIVTTSTTMDWMLLYLTNYPNIQEKLFFEINTPNHPLHKDKLQFPYLNSI 360

 

Query:   360 IKETLRISPPAPFALPHICTDDIVIDDIFIPKNTQVIPNIYGCNRSNIESSESNVFNPDH 419

             IKETLRISPP PFALPHICTDDIVIDDIFIPKNTQVIPNIYGCNRSNIESSESNVFNPDH

Sbjct:   361 IKETLRISPP-PFALPHICTDDIVIDDIFIPKNTQVIPNIYGCNRSNIESSESNVFNPDH 419

 

Query:   420 FLSKDELNIGQCAFSFGSRQCPGANVADSIMFLVSTKLYKTFKFERTTTQLNDENGHFTR 479

             FLSKDELNIGQCAFSFGSRQCPGANVADSIMFLVSTKLYKTFKFERTTTQLNDENGHFTR

Sbjct:   420 FLSKDELNIGQCAFSFGSRQCPGANVADSIMFLVSTKLYKTFKFERTTTQLNDENGHFTR 479

 

Query:   480 SLSPFEFKSKLIIRK 494

             SLSPFEFKSKLIIRK

Sbjct:   480 SLSPFEFKSKLIIRK 494

 

>JC3a177b07.3764 183690 letters

        Length = 183,692

 

  Minus Strand HSPs:

 

Does not have the L does have the P remove the L keep P

 

Query:    19 KFFNRAVLPSSLLVGENKLKCKFPSGPLILPIIGSLYKLSLKY 61

             KFFNRAV PSSLLVGENKLKCKFPSGPLILPIIGSLYKLSLKY

Sbjct: 49578 KFFNRAV-PSSLLVGENKLKCKFPSGPLILPIIGSLYKLSLKY 49453

 

R present in both RILLN  514A1 and 514A2P use R here

 

Query:    67 NQDNNI-DMGPIFKRILLNILYRFLFGVSFEYDD 99

              +     DMGPIFKRILLNILYRFLFGVSFEYDD

Sbjct: 46162 KKKKKKKDMGPIFKRILLNILYRFLFGVSFEYDD 46061

 

 Score = 142 (55.0 bits), Expect = 1.6e-34, Sum P(4) = 1.6e-34

 Identities = 27/27 (100%), Positives = 27/27 (100%), Frame = -1

 

Query:    73 DMGPIFKRILLNILYRFLFGVSFEYDD 99

             DMGPIFKRILLNILYRFLFGVSFEYDD

Sbjct: 48929 DMGPIFKRILLNILYRFLFGVSFEYDD 48849

 

>JC2c172c05.r1 Clone JC2c172c05, reverse read, bases 57 through 651, from

            2000-01-17

        Length = 593

 

  Minus Strand HSPs:

 

Query:   100 IKETLRISP 108

             IKETLRISP

Sbjct:   474 IKETLRISPcgt 448 coud be either PPP or PPAP use PPAP

 

Query:   110 PFALPHICTDDIVIDDIFIPKNTQVIPNIY 139

             PFALPHICTDDIVIDDIFIPKNTQVIPNIY

agcagca  348 PFALPHICTDDIVIDDIFIPKNTQVIPNIY 259

 

 

>aa-ng4360        Contig_1738     pred cdna   pORF 283          2061 strand f start y stop y ph 0

MIMLIKVFVLLLVYKKYKKLDKNELKGPTPIPVLGNLHQLSSLPHRDLSKMTKDYGDIFR

VWYTVVISDPVLIRKIYVENHESFRDRPKIPSMKYGTYYHGTAASMGEDWVRNRGIVSSA

MRKSNIKHIYEVINNQVDVLMSTMKKYEKRSEPFEPRYYMTKYTMAAMFKYIFNEDIGED

EDIHTGEIQKIMGPMNQVMEDFGTGSLFDVLEISQTFYLKWLELTEKNFPLLLKFFNGRY

EQHLETIKPESPRDLLDILINEYGTNTHDDYLNIASTVLDFFFAGTDTSSTTLEYLFLMM

ANYPEIQDKVHQEVKSYLKQIGKDKVELNDRQSLPYVVAVIKETLRFKPVTPFGVPRSCV

NEITIDEKYFIPKGAQVIINYPSIFENEKYFKNANQFDPSRFLQTTTTNTASNEESSFNS

NLAFIPFSIGPRNCVGMQFAQDELFLAFANIVLNFTIKSVDGKKIDETISYGVTLKPKTR

FKVLLEKRLI

 

>CYP508A4 seq (66+70) complete 55% to CYP508 3 indel diffs

          Length = 502

 

 Score = 2507 (882.5 bits), Expect = 2.4e-264, P = 2.4e-264

 Identities = 490/502 (97%), Positives = 490/502 (97%)

 

Query:     1 MIMLIKVFVLLLV-------YKKYKKLDKNELKGPTPIPVLGNLHQLSSLPHRDLSKMTK 53

             MIMLIKVFVLLLV       YKKYKKLDKNELKGPTPIPVLGNLHQLSSLPHRDLSKMTK

Sbjct:     1 MIMLIKVFVLLLVVYILHNSYKKYKKLDKNELKGPTPIPVLGNLHQLSSLPHRDLSKMTK 60

 

Query:    54 DYGDIFRVW----YTVVISDPVLIRKIYVENHESFRDRPKIP-SMKYGTYYHGTAASMGE 108

             DYGDIFRVW    YTVVISDPVLIRKIYVENHESFRDRPKIP SMKYGTYYHGTAASMGE

Sbjct:    61 DYGDIFRVWFADLYTVVISDPVLIRKIYVENHESFRDRPKIPKSMKYGTYYHGTAASMGE 120

 

Query:   109 DWVRNRGIVSSAMRKSNIKHIYEVINNQVDVLMSTMKKYEKRSEPFEPRYYMTKYTMAAM 168

             DWVRNRGIVSSAMRKSNIKHIYEVINNQVDVLMSTMKKYEKRSEPFEPRYYMTKYTMAAM

Sbjct:   121 DWVRNRGIVSSAMRKSNIKHIYEVINNQVDVLMSTMKKYEKRSEPFEPRYYMTKYTMAAM 180

 

Query:   169 FKYIFNEDIGEDEDIHTGEIQKIMGPMNQVMEDFGTGSLFDVLEISQTFYLKWLELTEKN 228

             FKYIFNEDIGEDEDIHTGEIQKIMGPMNQVMEDFGTGSLFDVLEISQTFYLKWLELTEKN

Sbjct:   181 FKYIFNEDIGEDEDIHTGEIQKIMGPMNQVMEDFGTGSLFDVLEISQTFYLKWLELTEKN 240

 

Query:   229 FPLLLKFFNGRYEQHLETIKPESPRDLLDILINEYGTNTHDDYLNIASTVLDFFFAGTDT 288

             FPLLLKFFNGRYEQHLETIKPESPRDLLDILINEYGTNTHDDYLNIASTVLDFFFAGTDT

Sbjct:   241 FPLLLKFFNGRYEQHLETIKPESPRDLLDILINEYGTNTHDDYLNIASTVLDFFFAGTDT 300

 

Query:   289 SSTTLEYLFLMMANYPEIQDKVHQEVKSYLKQIGKDKVELNDRQSLPYVVAVIKETLRFK 348

             SSTTLEYLFLMMANYPEIQDKVHQEVKSYLKQIGKDKVELNDRQSLPYVVAVIKETLRFK

Sbjct:   301 SSTTLEYLFLMMANYPEIQDKVHQEVKSYLKQIGKDKVELNDRQSLPYVVAVIKETLRFK 360

 

Query:   349 PVTPFGVPRSCVNEITIDEKYFIPKGAQVIINYPSIFENEKYFKNANQFDPSRFLQTTTT 408

             PVTPFGVPRSCVNEITIDEKYFIPKGAQVIINYPSIFENEKYFKNANQFDPSRFLQTTTT

Sbjct:   361 PVTPFGVPRSCVNEITIDEKYFIPKGAQVIINYPSIFENEKYFKNANQFDPSRFLQTTTT 420

 

Query:   409 NTASNEESSFNSNLAFIPFSIGPRNCVGMQFAQDELFLAFANIVLNFTIKSVDGKKIDET 468

             NTASNEESSFNSNLAFIPFSIGPRNCVGMQFAQDELFLAFANIVLNFTIKSVDGKKIDET

Sbjct:   421 NTASNEESSFNSNLAFIPFSIGPRNCVGMQFAQDELFLAFANIVLNFTIKSVDGKKIDET 480

 

Query:   469 ISYGVTLKPKTRFKVLLEKRLI 490

             ISYGVTLKPKTRFKVLLEKRLI

Sbjct:   481 ISYGVTLKPKTRFKVLLEKRLI 502

 

>VFK340 (VFK340Q) /pub/dna_csm/LIBRARY/VF/VFK3-B/VFK340Q.Seq.d/

        Length = 681

 

  Plus Strand HSPs:

 

 Score = 310 (114.2 bits), Expect = 9.3e-27, P = 9.3e-27

 Identities = 59/60 (98%), Positives = 59/60 (98%), Frame = +3

 

Query:     1 DYGDIFRVWFADLYTVVISDPVLIRKIYVENHESFRDRPKIPKSMKYGTYYHGTAASMGE 60

             DYGDIFRVWFADLYTVVISDPVLIRKIYVENHESFRDRPKIP SMKYGTYYHGTAASMGE

Sbjct:   204 DYGDIFRVWFADLYTVVISDPVLIRKIYVENHESFRDRPKIP-SMKYGTYYHGTAASMGE 380

 

Remove K but keep FADL

 

 

>aa-ng4501        Contig_1803     pred cdna   pORF 4169         5254 strand f start y stop y ph 0

MEEKAIEFIDSLEKISNNNEIFYPKGHIQGYACSMLFKYMFNQDLSVESGMSRTIGNAVE

HVFGNLSKLTAFDCFEIFSPLYDWFFTRRLKGCDIVRQIISSQNENHLKSIDPSKPRDLM

DDLLIEYGLNEITKEDTMQINQICFDIFGPAVGTVTITMNWVILQLCNRPELQEIAYQEI

KKAVKDDEYVNLNHKQNAPYIVAFIKETMRLCSNGFGLPRTAKNDQICGDFFIPKDAIIF

INYLEISQNEEIFKNAKEFNPTRYLDESLPVPNIHFGVGQRACPGRFVAIDKMFLGISNL

LLKYKLKSQNGEKIDDTIHFSVSLKAKDYGIKLEKRI

 

9 seqs support the ng4501 ending

 

12 seqs support my ending

 

clearly there are two genes probably 517A1 and 517A4

 

my ending seems to be from 517A4 swap ends

 

>CYP517A1 Seq. 3 complete 37% to 508 484 aa short, 3 diffs at end

          Length = 484

 

 Score = 1756 (618.1 bits), Expect = 9.0e-185, P = 9.0e-185

 Identities = 334/337 (99%), Positives = 336/337 (99%)

 

Query:     1 MEEKAIEFIDSLEKISNNNEIFYPKGHIQGYACSMLFKYMFNQDLSVESGMSRTIGNAVE 60

             MEEKAIEFIDSLEKISNNNEIFYPKGHIQGYACSMLFKYMFNQDLSVESGMSRTIGNAVE

Sbjct:   147 MEEKAIEFIDSLEKISNNNEIFYPKGHIQGYACSMLFKYMFNQDLSVESGMSRTIGNAVE 206

 

Query:    61 HVFGNLSKLTAFDCFEIFSPLYDWFFTRRLKGCDIVRQIISSQNENHLKSIDPSKPRDLM 120

             HVFGNLSKLTAFDCFEIFSPLYDWFFTRRLKGCDIVRQIISSQNENHLKSIDPSKPRDLM

Sbjct:   207 HVFGNLSKLTAFDCFEIFSPLYDWFFTRRLKGCDIVRQIISSQNENHLKSIDPSKPRDLM 266

 

Query:   121 DDLLIEYGLNEITKEDTMQINQICFDIFGPAVGTVTITMNWVILQLCNRPELQEIAYQEI 180

             DDLLIEYGLNEITKEDTMQINQICFDIFGPAVGTVTITMNWVILQLCNRPELQEIAYQEI

Sbjct:   267 DDLLIEYGLNEITKEDTMQINQICFDIFGPAVGTVTITMNWVILQLCNRPELQEIAYQEI 326

 

Query:   181 KKAVKDDEYVNLNHKQNAPYIVAFIKETMRLCSNGFGLPRTAKNDQICGDFFIPKDAIIF 240

             KKAVKDDEYVNLNHKQNAPYIVAFIKETMRLCSNGFGLPRTAKNDQICGDFFIPKDAIIF

Sbjct:   327 KKAVKDDEYVNLNHKQNAPYIVAFIKETMRLCSNGFGLPRTAKNDQICGDFFIPKDAIIF 386

 

Query:   241 INYLEISQNEEIFKNAKEFNPTRYLDESLPVPNIHFGVGQRACPGRFVAIDKMFLGISNL 300

             INYLEISQNEEIFKNAKEFNPTRYLDESLPVPNIHFGVGQRACPGRFVAIDKMFLGISNL

Sbjct:   387 INYLEISQNEEIFKNAKEFNPTRYLDESLPVPNIHFGVGQRACPGRFVAIDKMFLGISNL 446

 

Query:   301 LLKYKLKSQNGEKIDDTIHFSVSLKAKDYGIKLEKRI 337

             LLKYKLK+QNGEKIDD+I FSVSLKAKDYGIKLEKRI

Sbjct:   447 LLKYKLKTQNGEKIDDSIQFSVSLKAKDYGIKLEKRI 483

 

>aa-ng5440        Contig_2215     pred cdna   pORF 8556         6953 strand r start y stop y ph 0

MEIVNVLLFLIILFLVKDFVKKNKKIHTKSPSGPIAFPILGNVVQIRFWELFKIQEHELI

GNYSKKYNGVVRAWIGERLFLFVSNYDVVKYFQKDENFLYRPSLLVPGWRYASSNGLGVM

SSSDDEWKRAKSSVSQSLRVHTSKKLMEEKAIEFIDSLEKISNNNEIFYPKGHIQGYACS

MLFKYMFNQDLSVESGMSRTIGNAVEHVFGNLSKLTAFDCFEIFSPLYDWFFTRRLKGCD

IVRQIISSQNENHLKSIDPSKPRDLMDDLLIEYGLNEITKEDTMQINQICFDIFGPAVGT

VTITMNWVILQLCNRPELQEIAYQEIKKAVKDDEYVNLNHKQNAPYIVAFIKETMRLCSN

GFGLPRTAKNDQICGDFFIPKDAIIFINYLEISQNEEIFKNAKEFNPTRYLDESLPVPNK

HFGVGQRACPGRFVAIDKMFLGISNLLLKYKLKTQNGEKIDDSIQFSVSLKAKDYGIKLE

KRI

 

This is 517A4

 

>CYP517A1 Seq. 3 complete 37% to 508 484 aa 13 diffs second gene???

          Length = 484

 

 Score = 2460 (866.0 bits), Expect = 2.3e-259, P = 2.3e-259

 Identities = 470/483 (97%), Positives = 474/483 (98%)

 

Query:     1 MEIVNVLLFLIILFLVKDFVKKNKKIHTKSPSGPIAFPILGNVVQIRFWELFKIQEHELI 60

             MEI+NV LFLIILFLVKDFVKKNKKIHTKSPSGPIAFPILGNVVQIRFWELFKIQEHEL

Sbjct:     1 MEIINVFLFLIILFLVKDFVKKNKKIHTKSPSGPIAFPILGNVVQIRFWELFKIQEHELF 60

 

Query:    61 GNYSKKYNGVVRAWIGERLFLFVSNYDVVKYFQKDENFLYRPSLLVPGWRYASSNGLGVM 120

             G YSKKYNGVVRAW GERLF FVSNYDVVKYFQKDENF  RPS+LVPGWRYASSNGLGVM

Sbjct:    61 GGYSKKYNGVVRAWFGERLFFFVSNYDVVKYFQKDENFHNRPSVLVPGWRYASSNGLGVM 120

 

Query:   121 SSSDDEWKRAKSSVSQSLRVHTSKKLMEEKAIEFIDSLEKISNNNEIFYPKGHIQGYACS 180

             SSSDD+WKRAKSSVSQSLRV T+KKLMEEKAIEFIDSLEKISNNNEIFYPKGHIQGYACS

Sbjct:   121 SSSDDKWKRAKSSVSQSLRVRTTKKLMEEKAIEFIDSLEKISNNNEIFYPKGHIQGYACS 180

 

Query:   181 MLFKYMFNQDLSVESGMSRTIGNAVEHVFGNLSKLTAFDCFEIFSPLYDWFFTRRLKGCD 240

             MLFKYMFNQDLSVESGMSRTIGNAVEHVFGNLSKLTAFDCFEIFSPLYDWFFTRRLKGCD

Sbjct:   181 MLFKYMFNQDLSVESGMSRTIGNAVEHVFGNLSKLTAFDCFEIFSPLYDWFFTRRLKGCD 240

 

Query:   241 IVRQIISSQNENHLKSIDPSKPRDLMDDLLIEYGLNEITKEDTMQINQICFDIFGPAVGT 300

             IVRQIISSQNENHLKSIDPSKPRDLMDDLLIEYGLNEITKEDTMQINQICFDIFGPAVGT

Sbjct:   241 IVRQIISSQNENHLKSIDPSKPRDLMDDLLIEYGLNEITKEDTMQINQICFDIFGPAVGT 300

 

Query:   301 VTITMNWVILQLCNRPELQEIAYQEIKKAVKDDEYVNLNHKQNAPYIVAFIKETMRLCSN 360

             VTITMNWVILQLCNRPELQEIAYQEIKKAVKDDEYVNLNHKQNAPYIVAFIKETMRLCSN

Sbjct:   301 VTITMNWVILQLCNRPELQEIAYQEIKKAVKDDEYVNLNHKQNAPYIVAFIKETMRLCSN 360

 

Query:   361 GFGLPRTAKNDQICGDFFIPKDAIIFINYLEISQNEEIFKNAKEFNPTRYLDESLPVPNK 420

             GFGLPRTAKNDQICGDFFIPKDAIIFINYLEISQNEEIFKNAKEFNPTRYLDESLPVPN

Sbjct:   361 GFGLPRTAKNDQICGDFFIPKDAIIFINYLEISQNEEIFKNAKEFNPTRYLDESLPVPNI 420

 

Query:   421 HFGVGQRACPGRFVAIDKMFLGISNLLLKYKLKTQNGEKIDDSIQFSVSLKAKDYGIKLE 480

             HFGVGQRACPGRFVAIDKMFLGISNLLLKYKLKTQNGEKIDDSIQFSVSLKAKDYGIKLE

Sbjct:   421 HFGVGQRACPGRFVAIDKMFLGISNLLLKYKLKTQNGEKIDDSIQFSVSLKAKDYGIKLE 480

 

Query:   481 KRI 483

             KRI

Sbjct:   481 KRI 483

 

 

>aa-ng4838        Contig_1954     pred cdna   pORF 3150         1572 strand r start y stop y ph 0

MTSLTLYLIIFSIILYLFVNRNKRKNLKIPGPNGIPIFGNLLSLSGEMHLTLQEWYKTYG

SVFSIRMGNIDTVVLTEYPTIRKAFVDNSLAFASRYQLKSRVVLTGAKDLAIQNGEIHSL

LKKVVLSEMTTTKIKRMEIHIIKETEKILKILDKHAERGEPFIINNYLNMFSMNVILRFL

LGIDYPYENVDETVGYVKSIKSFFAVAGLPILSDFIPIPLKKSGVFFDSYKELEIETDKL

IEKFKKSRNEKIENGTYNEEEDESILSKLLKEYEHGNITWECVSHTCIDIISAGTDTSAN

TLVMALIELINNQEIQSKAFSSIRSSsTFRKHPIALLGLPHVTTEDVEIDGYKIEAGTYI

IQNIFSSHRSDKIFQSPNEFIPERFFESSQNQGLIHFGLGVRDCVGKSLAECEIFTLIAT

LLNRYQFINPNNSKKLNDIGTFGLAQVCPDTNIILKKRI

 

>CYP513A3 Seq. (7+50)complete seq 49% to seq 8 2 indels 2 aa diffs

          Length = 491

 

Add the VN to exon 1

 

This seq missing part of the K-helix in cDNA SSI868

(removed as intron?)

Restore this seq

 

Change the two aa diffs to match ng4838

 

 

>SSI868 (SSI868Q) /pub/dna_csm/LIBRARY/SS/SSI8-C/SSI868Q.Seq.d/

        Length = 669

 

  Plus Strand HSPs:

 

 Score = 594 (214.2 bits), Expect = 7.6e-57, P = 7.6e-57

 Identities = 118/118 (100%), Positives = 118/118 (100%), Frame = +3

 

Query:     9 DIISAGTDTSANTLVMALIELINNQEIQSKAFSSIRSSCLNDSNDDDDDDEIVITHSKYR 68

             DIISAGTDTSANTLVMALIELINNQEIQSKAFSSIRSSCLNDSNDDDDDDEIVITHSKYR

Sbjct:    12 DIISAGTDTSANTLVMALIELINNQEIQSKAFSSIRSSCLNDSNDDDDDDEIVITHSKYR 191

 

Query:    69 SLLPYISMIIKETFRKHPIALLGLPHVTTEDVEIDGYKIEAGTYIIQNIFSSHRSDKI 126

             SLLPYISMIIKETFRKHPIALLGLPHVTTEDVEIDGYKIEAGTYIIQNIFSSHRSDKI

Sbjct:   192 SLLPYISMIIKETFRKHPIALLGLPHVTTEDVEIDGYKIEAGTYIIQNIFSSHRSDKI 365

 

 

 

Query:     1 MTSLTLYLIIFSIILYLFVNRNKRKNLKIPGPNGIPIFGNLLSLSGEMHLTLQEWYKTYG 60

             MTSLTLYLIIFSIILYLF  RNKRKNLKIPGPNGIPIFGNLLSLSGEMHLTLQEWYKTYG

Sbjct:     1 MTSLTLYLIIFSIILYLF--RNKRKNLKIPGPNGIPIFGNLLSLSGEMHLTLQEWYKTYG 58

 

Query:    61 SVFSIRMGNIDTVVLTEYPTIRKAFVDNSLAFASRYQLKSRVVLTGAKDLAIQNGEIHSL 120

             SVFSIRMGNIDTVVLTEYPTIRKAFVDNSLAFASRYQLKSRVVLTGAKDLAIQNGEIHSL

Sbjct:    59 SVFSIRMGNIDTVVLTEYPTIRKAFVDNSLAFASRYQLKSRVVLTGAKDLAIQNGEIHSL 118

 

Query:   121 LKKVVLSEMTTTKIKRMEIHIIKETEKILKILDKHAERGEPFIINNYLNMFSMNVILRFL 180

             LKKVVLSEMTTTKIKRMEIHIIKETEKILKILDKHAERGEPFIINNYLNMFSMN ILRFL

Sbjct:   119 LKKVVLSEMTTTKIKRMEIHIIKETEKILKILDKHAERGEPFIINNYLNMFSMNGILRFL 178

 

Query:   181 LGIDYPYENVDETVGYVKSIKSFFAVAGLPILSDFIPIPLKKSGVFFDSYKELEIETDKL 240

             LGIDYPYENVDETVGYVKSIKSFFAVAGLPILSDFIPIPLKKSG FFDSYKELEIETDKL

Sbjct:   179 LGIDYPYENVDETVGYVKSIKSFFAVAGLPILSDFIPIPLKKSGGFFDSYKELEIETDKL 238

 

Query:   241 IEKFKKSRNEKIENGTYNEEEDESILSKLLKEYEHGNITWECVSHTCIDIISAGTDTSAN 300

             IEKFKKSRNEKIENGTYNEEEDESILSKLLKEYEHGNITWECVSHTCIDIISAGTDTSAN

Sbjct:   239 IEKFKKSRNEKIENGTYNEEEDESILSKLLKEYEHGNITWECVSHTCIDIISAGTDTSAN 298

 

Query:   301 TLVMALIELINNQEIQSKAFSSIRSS 326

             TLVMALIELINNQEIQSKAFSSIRSS

Sbjct:   299 TLVMALIELINNQEIQSKAFSSIRSS 324

 

 Score = 691 (243.2 bits), Expect = 4.5e-240, Sum P(2) = 4.5e-240

 Identities = 133/140 (95%), Positives = 135/140 (96%)

 

Query:   320 FSSIRSSSTFRKHPIALLGLPHVTTEDVEIDGYKIEAGTYIIQNIFSSHRSDKIFQSPNE 379

             + S+    TFRKHPIALLGLPHVTTEDVEIDGYKIEAGTYIIQNIFSSHRSDKIFQSPNE

Sbjct:   351 YISMIIKETFRKHPIALLGLPHVTTEDVEIDGYKIEAGTYIIQNIFSSHRSDKIFQSPNE 410

 

Query:   380 FIPERFFESSQNQGLIHFGLGVRDCVGKSLAECEIFTLIATLLNRYQFINPNNSKKLNDI 439

             FIPERFFESSQNQGLIHFGLGVRDCVGKSLAECEIFTLIATLLNRYQFINPNNSKKLNDI

Sbjct:   411 FIPERFFESSQNQGLIHFGLGVRDCVGKSLAECEIFTLIATLLNRYQFINPNNSKKLNDI 470

 

Query:   440 GTFGLAQVCPDTNIILKKRI 459

             GTFGLAQVCPDTNIILKKRI

Sbjct:   471 GTFGLAQVCPDTNIILKKRI 490

 

>aa-ng5261        Contig_2128     pred cdna   pORF 246          5321 strand f start y stop y ph 0 not a P450

MVISSPITKIKKLKEDNIDNSSSSNSNSNSNNNNNEINNFGNHHHNDDNISFSSNNNNNN

NNSNKGSNINNSSTTTTTTSTPLIGGDGGGDSCEVKRSRNILPHEVLLTRVTFKVETPIG

YVRENNVLAIIGDSAEIGNWKSKDAPQMVNSECTDTHLIWTIILSFPKATRINYKYIIKS

NSNNIEWEASKGLSRTLSIEGIDMFVNDGLFGVLESGSKPWIGRGWLPEDEYQLRIQLQC

DDNQSPVHLYDHLIEQNQLVLRLHDPFGLSQDFHLPMTGFTELVLHTHSLSSFSFSVTLL

LKDGLNLPITTTTNNTTTTTTTTTTATANSNIGSNGASSPTSYPSSPNGYKVLGKSYICS

NQVSTDLWGKLYTPIIGEKYFPIGEFRCNYLVVTPFSHPQNSLSNLWHSFVEQPQTLIGH

RGNGKNNFGINANAVTENTILSFLTAAQFGAKMVEFDLQLTFDGVPVIFHDYEIEIETNE

GFTMKETINRLTLEQFLKVKPTTNKSDLISQKLKRLKSLRISKSTSDLLSLSSCPSSTSF

DPSSFLENVTQCNSNLVVSDNNNNNNNNNNNNNNNNNNNNNKNNDNNNNNNNKNNNTIRP

SSIIHDRFSTLQDAFHLVPQEIGFMIEIKYPNLAMQNLRKFKAPERNEFIDIILNIVFNE

TNDRRIAFLTFDPDIAILLRTKQFRYPVLFLVCCDTPTFFEEFDPDVNINDNRGNSITNA

ISFVKTVNLDGIVCDSQTILNNHSFVNLIHSDNLLLFTYGSKNVDPVNVKIQKELGVDGI

IADNITKLNKTVIELHTCNCTTYCGFILDGLISAVEGDAFFNCSNSNNCVFSQNVINQML

PVIPMTCMAGECINNNSSGSQNNNNTTSDNGSRNGSENSHHNSDSSNNDNIKCPTCDCNK

STTHFMIAIIIFLVLKKKKKMAINYEINGNEIKNLQTITYNYKEPIEIDKELEYCLTNWS

IEKGQKGCPYCQQSFNIKSVSLPPKFVTNTLDSLLVYCNTDGCEFNSNSDRNQLIRRCEY

RNHFERICKVICIDCNQQFTREQLKLHSNECTSKSVKCNASSLMCSWVGPIKELQSHESN

CHYIALSPILNKLYDKIQQLDISNQKTTSFLLEKINYLESIISSSSALSTPNLSSPIYYQ

PIPTAPLPDSTTMYDNRNTTTTTTTTTTISLDKITPVEIKKVASIEIFKNNSNKYVIGKV

ESGVKSVSIDGDYADTWPIVKGNLPNSVDTLILLDGFKELVYSVPKSIKVLHIGDIKKSH

AFYIQCSSVNEIHFHDNCKIKFTKNVIPDSAKTVHFYRINEPLTTESIGKNVKCLHIHDG

FSKSLGYGVLPSSIQELHLYDIKRQALSIPSSIKILYLHNQFNNESIELPDQLETLYLDN

IIHPIKNFPSSLKTIHLINYKFEDKVPFPPKVKIIMGTK

 

 

>aa-ng5262        Contig_2128     pred cdna   pORF 7224         5425 strand r start y stop y ph 0

MIFGIIGLNENQLPGPFPIPILGNIYQLTNLPHFDLTKMSEKYGKIFRIYLADLYTVIVC

DPIIARELFVDKFDNFIDRPKIPSVKHGTFYHGTVASMGDNWKNNKEIVGKAMRKTNLKH

IYQLLDDQVDVLIESMRTIESSGETFDPRYYLTKFTMSAMFKYIFNEDISKDEDVHNGQL

AQLMKPMQKVFKDFGTGSLFDVLEITRPLYFLYLEWFTSHYYQVINFGKMKIYKHLETYK

PDVQRDLMDLLIKEYGTETDDQILSISATVSDFFLAGVDTSATSLELIVMMLINYPEYQE

KAYNEIKSALSSNGGGGGGGLTQRNKVLLSDRQSTPFVVSLFKETLRYKPISPFGLPRST

TSDIILNNGQFIPKNAQILINYHALSRNEEYFENPNQFDPTRFLNSDSNPAFMPFSIGPR

NCVGSNFAQDEIYIALSNMILNFKFKSIDGKPVDETQTYGLTLKPNPFKVILEKRK

 

>CYP508A2 Seq (22+45+69+86+88) Complete sequence 62% to CYP508

          Length = 493 5 diffs N-term missing

 

 Score = 2429 (855.1 bits), Expect = 4.3e-256, P = 4.3e-256

 Identities = 464/469 (98%), Positives = 467/469 (99%)

 

Query:     8 LNENQLPGPFPIPILGNIYQLTNLPHFDLTKMSEKYGKIFRIYLADLYTVIVCDPIIARE 67

             LNENQLPGPFPIPILGNIYQLTNLPHFDLTKMSEKYGKIFRIYLADLYTVIVCDPIIARE

Sbjct:    25 LNENQLPGPFPIPILGNIYQLTNLPHFDLTKMSEKYGKIFRIYLADLYTVIVCDPIIARE 84

 

Query:    68 LFVDKFDNFIDRPKIPSVKHGTFYHGTVASMGDNWKNNKEIVGKAMRKTNLKHIYQLLDD 127

             LFVDKFDNFIDRPKIPSV +GTFYHGTVASMGDNWK +KEIVGKAMRKTNL+HIYQLLDD

Sbjct:    85 LFVDKFDNFIDRPKIPSVNNGTFYHGTVASMGDNWKKHKEIVGKAMRKTNLEHIYQLLDD 144

 

Query:   128 QVDVLIESMRTIESSGETFDPRYYLTKFTMSAMFKYIFNEDISKDEDVHNGQLAQLMKPM 187

             QVDVLIESMRTIESSGETFDPRYYLTKFTMSAMFKYIFNEDISKDEDVHNGQLAQLMKPM

Sbjct:   145 QVDVLIESMRTIESSGETFDPRYYLTKFTMSAMFKYIFNEDISKDEDVHNGQLAQLMKPM 204

 

Query:   188 QKVFKDFGTGSLFDVLEITRPLYFLYLEWFTSHYYQVINFGKMKIYKHLETYKPDVQRDL 247

             QKVFKDFGTGSLFDVLEITRPLYFLYLEWFTSHYYQVINFGKMKIYKHLETYKPDVQRDL

Sbjct:   205 QKVFKDFGTGSLFDVLEITRPLYFLYLEWFTSHYYQVINFGKMKIYKHLETYKPDVQRDL 264

 

Query:   248 MDLLIKEYGTETDDQILSISATVSDFFLAGVDTSATSLELIVMMLINYPEYQEKAYNEIK 307

             MDLLIKEYGTETDDQILSISATVSDFFLAGVDTSATSLELIVMMLINYPEYQEKAYNEIK

Sbjct:   265 MDLLIKEYGTETDDQILSISATVSDFFLAGVDTSATSLELIVMMLINYPEYQEKAYNEIK 324

 

Query:   308 SALSSNGGGGGGGLTQRNKVLLSDRQSTPFVVSLFKETLRYKPISPFGLPRSTTSDIILN 367

             SALSSNGGGGGGGLTQRNKVLLSDRQSTPFVVSLFKETLRYKPISPFGLPRSTTSDIILN

Sbjct:   325 SALSSNGGGGGGGLTQRNKVLLSDRQSTPFVVSLFKETLRYKPISPFGLPRSTTSDIILN 384

 

Query:   368 NGQFIPKNAQILINYHALSRNEEYFENPNQFDPTRFLNSDSNPAFMPFSIGPRNCVGSNF 427

             NGQFIPKNAQILINYHALSRNEEYFENPNQFDPTRFLNSDSNPAFMPFSIGPRNCVGSNF

Sbjct:   385 NGQFIPKNAQILINYHALSRNEEYFENPNQFDPTRFLNSDSNPAFMPFSIGPRNCVGSNF 444

 

Query:   428 AQDEIYIALSNMILNFKFKSIDGKPVDETQTYGLTLKPNPFKVILEKRK 476

             AQDEIYIALSNMILNFKFKSIDGKPVDETQTYGLTLKPNPFKVILEKRK

Sbjct:   445 AQDEIYIALSNMILNFKFKSIDGKPVDETQTYGLTLKPNPFKVILEKRK 493

 

>VFH640 (VFH640Q) /pub/dna_csm/LIBRARY/VF/VFH6-B/VFH640Q.Seq.d/

        Length = 1335

 

  Plus Strand HSPs:

 

This cDNA matches ng5262 so change these aa

 

Query:     1 LFVDKFDNFIDRPKIPSVKHGTFYHGTVASMGDNWKNNKEIVGKAMRKTNLKHIYQLLDD 60

             LFVDKFDNFIDRPKIPSVKHGTFYHGTVASMGDNWKNNKEIVGKAMRKTNLKHIYQLLDD

Sbjct:   291 LFVDKFDNFIDRPKIPSVKHGTFYHGTVASMGDNWKNNKEIVGKAMRKTNLKHIYQLLDD 470

 

>aa-ng5581        Contig_2278     pred cdna   pORF 5550         6959 strand f start y stop y ph 0

MVKQFTNFSDRPRLKSFESFTGGGVNLIFIDYNEKWPVIRKIVSSSITKTKIISNYKEVI

ENQTKILINSMRTHSKINEPFKSKKYFGKFSISIVLGIMFKQDNDEKQININDNNIDNDP

ITKLTEPIQQVFLLLGTGNISDFIKILRPFFKNEYKKLNNSASKVFKFMEEIYDQHLLKF

DKSNPRDLMDYFIEYEFTNSPNTTLEEKKISIIKGCMSFVFAGDDTVAATLEWVCLYLIN

NPAIQEKCYNELISVLGDNNNESKIKFISLKERDNCQYLINVIKEVLRIRTPLPLSVPRI

ATQNCEINGFFIEKGTQILSNAFGMSHLYVDEPNVFNPDRWINYYNQKQQQQQQQQQPQP

IQNNNYFNDLDRVCLPFSTGPRNCVGISIAELNLFSVCANIILNFQIKSIDGMQLKDIEV

SGISIHPIPFSIKLISRN

 

>CYP519C1 Seq (31+32+33) complete sequence 39% to seq 12

          Length = 533 4 diffs N-term missing

 

Change all 4 to ng5581 seq add my N-term

 

 Score = 2258 (794.9 bits), Expect = 5.7e-238, P = 5.7e-238

 Identities = 434/438 (99%), Positives = 434/438 (99%)

 

Query:     1 MVKQFTNFSDRPRLKSFESFTGGGVNLIFIDYNEKWPVIRKIVSSSITKTKIISNYKEVI 60

             MVKQFTNFSDRPRLKSFESFTGGGVNLIFIDYNEKWPVIRKIVSSSITKTKIISNYKEVI

Sbjct:    96 MVKQFTNFSDRPRLKSFESFTGGGVNLIFIDYNEKWPVIRKIVSSSITKTKIISNYKEVI 155

 

Query:    61 ENQTKILINSMRTHSKINEPFKSKKYFGKFSISIVLGIMFKQDNDEKQININDNNIDNDP 120

             ENQTKILINSMRTHSKINEPFKSKKYFGKFSISIVLGIMFKQDNDEKQININDNNIDNDP

Sbjct:   156 ENQTKILINSMRTHSKINEPFKSKKYFGKFSISIVLGIMFKQDNDEKQININDNNIDNDP 215

 

Query:   121 ITKLTEPIQQVFLLLGTGNISDFIKILRPFFKNEYKKLNNSASKVFKFMEEIYDQHLLKF 180

             ITKLTEPIQQVFLLLGTGNISDFIKILRPFFKNEYKKLNNS SKVFK MEEIYDQHLLKF

Sbjct:   216 ITKLTEPIQQVFLLLGTGNISDFIKILRPFFKNEYKKLNNSDSKVFKIMEEIYDQHLLKF 275

 

Query:   181 DKSNPRDLMDYFIEYEFTNSPNTTLEEKKISIIKGCMSFVFAGDDTVAATLEWVCLYLIN 240

             DKSNPRDLMDYFIEYEFTNSPNTTLEEKKISIIK CMSFVFAGDDTVAATLEWVCLYLIN

Sbjct:   276 DKSNPRDLMDYFIEYEFTNSPNTTLEEKKISIIKFCMSFVFAGDDTVAATLEWVCLYLIN 335

 

Query:   241 NPAIQEKCYNELISVLGDNNNESKIKFISLKERDNCQYLINVIKEVLRIRTPLPLSVPRI 300

             NPAIQEKCYNELISVLGDNNNESKIKFISLKERDNCQYLINVIKEVLRIRTPLPLSVPRI

Sbjct:   336 NPAIQEKCYNELISVLGDNNNESKIKFISLKERDNCQYLINVIKEVLRIRTPLPLSVPRI 395

 

Query:   301 ATQNCEINGFFIEKGTQILSNAFGMSHLYVDEPNVFNPDRWINYYNQKQQQQQQQQQPQP 360

             ATQNCEINGFFIEKGTQILSNAFGMSHLYVDEPNVFNPDRWINYYNQKQQQQQQQQQPQP

Sbjct:   396 ATQNCEINGFFIEKGTQILSNAFGMSHLYVDEPNVFNPDRWINYYNQKQQQQQQQQQPQP 455

 

Query:   361 IQNNNYFNDLDRVCLPFSTGPRNCVGISIAELNLFSVCANIILNFQIKSIDGMQLKDIEV 420

             IQNNNYFNDLDRVCLPFSTGPRNCVGISIAELNLF VCANIILNFQIKSIDGMQLKDIEV

Sbjct:   456 IQNNNYFNDLDRVCLPFSTGPRNCVGISIAELNLFRVCANIILNFQIKSIDGMQLKDIEV 515

 

Query:   421 SGISIHPIPFSIKLISRN 438

             SGISIHPIPFSIKLISRN

Sbjct:   516 SGISIHPIPFSIKLISRN 533

 

 

>aa-ng6320        Contig_2573     pred cdna   pORF 634          2249 strand f start y stop y ph 0

MNLINLILYFILFWIVFDFIRKNRRISFNDPPSPWALPIIGHLHKLSLNPHRSLTELAKV

YGGVYSLHIGDSKTVVITDVSAFKDVTIKQFKNFANRPQPKSIRVITNFKGLAFADYDQW

QKTRKLVSSALTKTKIKTFNNLIEKQTENLIESMNEFSNKNELFHPRKYLTKYSLNIILS

MLFSKEIGKNESINKGTMERLTIPFNEAFKKVGKVDDFLWFLSPFFYFSNKQYKKYIFDI

YYFMEEIYDQHLLDLDYNEPKDLLDQLIIASQGREKETVILVGMDFLLAGSDTQKATQEW

FCLYLINNPDVQKKAYQELISVVGKDCKFVTSNHIENCPYFISIIKEVFRIRSPGPLGLP

RISIDDTYLSNGMFIPKGTQILLNIFGMGNLLVSEPDQFKPERWINYKNQQQQKQQQQQQ

QVNNKNSIDSSESSNLEFFDDLEKVSNPFSLGPRNCVGMAIAKSSIYSVCSNILLNFELS

SINNQIIDDNEVFGVSINPKEFSIKLTKR

 

>JC1b162b04.s1 Clone JC1b162b04, standard read, bases 71 through 684, from

            2000-09-07

        Length = 612

 

  Plus Strand HSPs:

 

Supports ng6320 seq

 

Query:    78 LTKYSLNIILSMLFSKEIGKNESINKGTMERLTIPFNEAFKKVGKVDDFLWFLSPFFYFS 137

             LTKYSLNIILSMLFSKEIGKNESINKGTMERLTIPFNEAFKKVGKVDDFLWFLSPFFYFS

Sbjct:     1 LTKYSLNIILSMLFSKEIGKNESINKGTMERLTIPFNEAFKKVGKVDDFLWFLSPFFYFS 180

 

Query:   138 NKQYKKYIFDIYYFMEEIYDQHLLDLDYNEPKDLLDQLIIASQGREKETVILVGMDFLLA 197

             NKQYKKYIFDIYYFMEEIYDQHLLDLDYNEPKDLLDQLIIASQGREKETVILVGMDFLLA

Sbjct:   181 NKQYKKYIFDIYYFMEEIYDQHLLDLDYNEPKDLLDQLIIASQGREKETVILVGMDFLLA 360

 

Query:   198 GSDTQKATQEW 208

             GSDTQKATQEW

Sbjct:   361 GSDTQKATQEW 393

 

>JAX4a216e09.r1 Clone JAX4a216e09, reverse read, bases 164 through 715, from

            1999-02-08

        Length = 550

 

  Minus Strand HSPs:

 

 Score = 446 (162.1 bits), Expect = 6.7e-57, Sum P(2) = 6.7e-57

 Identities = 86/87 (98%), Positives = 86/87 (98%), Frame = -3

 

Query:   209 RISIDDTYLSNGMFIPKGTQILLNIFGMGNLLVSEPDQFKPERWINYKNQQQQKQQQQQQ 268

             RISIDDTYLSNGMFIPKGTQILLNIFGMGNLLV EPDQFKPERWINYKNQQQQKQQQQQQ

Sbjct:   263 RISIDDTYLSNGMFIPKGTQILLNIFGMGNLLVXEPDQFKPERWINYKNQQQQKQQQQQQ 84

 

Query:   269 QVNNKNSIDSSESSNLEFFDDLEKVSN 295

             QVNNKNSIDSSESSNLEFFDDLEKVSN

Sbjct:    83 QVNNKNSIDSSESSNLEFFDDLEKVSN 3

 

>JAX4a216e09.r1 Clone JAX4a216e09, reverse read, bases 164 through 715, from 1999-02-08 translate frame +1 translate plus frames translate all frames

GGATTTGATACTTTTTCTAAATCATCAAAAAATTCTAAATTACTTGATTCACTACTATCA

ATTGAATTTTTATTATTTACTTGTTGTTGTTGTTGTTGTTGTTTTTGTTGTTGTTGATTT

TTATAATTTATCCATCTTTCTGGTTTAAATTGATCAGGTTCANAAACTAATAAATTGCCC

ATTCCAAATATATTTAAAAGTATTTGAGTACCTTTTGGTATGAACATACCATTACTTAAA

TAAGTATCATCAATTGAAATTCTTGGTAAACCAAGTGGTCCAGGTGAACGAATACGGAAA

ACTTNTTTAATAATTGAAATAAAGNATGGACAATTTTCAATATGATTTGANGTTACAAAT

TTACAATCTTTACCAACCACACCTTATTAATTCTTGATATGCTTTCTTTTGAACATCTGG

ATTATTAATTAAATAAAGGCAAAACCATTCTTGAGTTGCTTTTTGTGTATCTGAACCTGC

CAATAGGAAATCCATTCCGACCAATATTACTGGCTCTTTTTCTCTACCTTGACTTGCAAT

TATTAGTTGA

 

>JC3f115e15.s1 Clone JC3f115e15, standard read, bases 27 through 739, from

            2002-12-10

        Length = 711

 

  Minus Strand HSPs:

 

Supports ng6320 seq

 

Query:     1 NFANRPQPKSIRVITNFKGLAFADYDQWQKTRKLVSSALTKTKIKTFNNLIEKQTENLIE 60

             NFANRPQPKSIRVITNFKGLAFADYDQWQKTRKLVSSALTKTKIKTFNNLIEKQTENLIE

Sbjct:   367 NFANRPQPKSIRVITNFKGLAFADYDQWQKTRKLVSSALTKTKIKTFNNLIEKQTENLIE 188

 

Query:    61 SMNEFSNKNELFHPRKYLTKY-SLNIIL 87

             SMNEFSNKNEL +   +   Y  +NI++

Sbjct:   187 SMNEFSNKNELVNNYYFYYYY*HINILI 104

 

>IIAFP1D53328

        Length = 865

 

  Minus Strand HSPs:

 

Supports ng6320 seq

 

Query:   273 KNSIDSSESSNLEFFDDLEKVSNPFSLGPRNCVGMAIAKSSIYSVCSNILLNFELS 328

             K  +  SESSNLEFFDDL+KVSNPFSLGPRNCVGMAIAKSSIYSVCSNILLNFELS

Sbjct:   567 KIQLIGSESSNLEFFDDLKKVSNPFSLGPRNCVGMAIAKSSIYSVCSNILLNFELS 400

 

>CYP519B1 Seq (35+44+75) complete seq 51% to seq 12 496 aa

          Length = 488

 

 Score = 2147 (755.8 bits), Expect = 1.2e-257, Sum P(2) = 1.2e-257

 Identities = 412/431 (95%), Positives = 420/431 (97%)

 

Query:     1 MNLINLILYFILFWIVFDFIRKNRRISFNDPPSPWALPIIGHLHKLSLNPHRSLTELAKV 60

             MNLINLILYFILFWIVFDFIRKNRRISFNDPPSPWALPIIGHLHKLSLNPHRSLTELAKV

Sbjct:     1 MNLINLILYFILFWIVFDFIRKNRRISFNDPPSPWALPIIGHLHKLSLNPHRSLTELAKV 60

 

Query:    61 YGGVYSLHIGDSKTVVITDVSAFKDVTIKQFKNFANRPQPKSIRVITNFKGLAFADYDQW 120

             YGGVYSLHIGDSKTVVITDVSAFKDVTIKQFKNFANRPQPKSIRVITNFKGLAF + DQW

Sbjct:    61 YGGVYSLHIGDSKTVVITDVSAFKDVTIKQFKNFANRPQPKSIRVITNFKGLAFGEGDQW 120

 

Query:   121 QKTRKLVSSALTKTKIKTFNNLIEKQTENLIESMNEFSNKNELFHPRKYLTKYSLNIILS 180

             QKT+KLVSSALTKTKIKTFNNLIEKQ+ENLIESMNEFSNKNELFHPRKYLTKYSLNIILS

Sbjct:   121 QKTKKLVSSALTKTKIKTFNNLIEKQSENLIESMNEFSNKNELFHPRKYLTKYSLNIILS 180

 

Query:   181 MLFSKEIGKNESINKGTMERLTIPFNEAFKKVGKVDDFLWFLSPFFYFSNKQYKKYIFDI 240

             MLFSKEIGKNESINKG+MERLTIPFNEAFKKVGKVDDFLWFLSPFFYFSNKQYKKYIFDI

Sbjct:   181 MLFSKEIGKNESINKGSMERLTIPFNEAFKKVGKVDDFLWFLSPFFYFSNKQYKKYIFDI 240

 

Query:   241 YYFMEEIYDQHLLDLDYNEPKDLLDQLIIASQGREKETVILVGMDFLLAGSDTQKATQEW 300

             YYFMEEIYDQHLLDLDYNEPKDLLDQLIIASQGREKE VILVGMDFLLAGSDTQKATQEW

Sbjct:   241 YYFMEEIYDQHLLDLDYNEPKDLLDQLIIASQGREKEPVILVGMDFLLAGSDTQKATQEW 300

 

Query:   301 FCLYLINNPDVQKKAYQELISVVGKDCKFVTSNHIENCPYFISIIKEVFRIRSPGPLGLP 360

             FCLYLINNPDVQKKAYQELISVVGKDCKFVTSNHIENCPYFISIIKEVFRIRSPGPLGLP

Sbjct:   301 FCLYLINNPDVQKKAYQELISVVGKDCKFVTSNHIENCPYFISIIKEVFRIRSPGPLGLP 360

 

Query:   361 RISIDDTYLSNGMFIPKGTQILLNIFGMGNLLVSEPDQFKPERWINYKNQQQQKQQQQQ- 419

             RISIDDTYLSNGMFIPKGTQILLNIFGMGNLLVSEPDQFKPERWINYKNQQ QKQQ Q 

Sbjct:   361 RISIDDTYLSNGMFIPKGTQILLNIFGMGNLLVSEPDQFKPERWINYKNQQHQKQQPQPT 420

 

Query:   420 --QQVNNKNSI 428

               ++V+N  S+

Sbjct:   421 TLKKVSNPFSL 431

 

 Score = 320 (112.6 bits), Expect = 1.2e-257, Sum P(2) = 1.2e-257

 Identities = 65/68 (95%), Positives = 66/68 (97%)

 

Query:   442 LEKVSNPFSLGPRNCVGMAIAKSSIYSVCSNILLNFELSSINNQIIDDNEVFGVSINPKE 501

             L+KVSNPFSLGPRNC GMAIAKSSIYSVCSNILLNFELSSINNQIIDDNEVFGVSINPKE

Sbjct:   422 LKKVSNPFSLGPRNC-GMAIAKSSIYSVCSNILLNFELSSINNQIIDDNEVFGVSINPKE 480

 

Query:   502 FSIKLTKR 509

             FSIKL KR

Sbjct:   481 FSIKLAKR 488

 

 

>aa-ng6352        Contig_2589     pred cdna   pORF 2441          615 strand r start y stop y ph 0

MGIGLIILYLLIGLLAYDFTKKNKKISKNDPKQPLAIPVLGHLHLFGSQPHRSLTELAKK

FGGIFTLWMGDERSMVITDPNILRELYVKNHLNFYNRASSESIRIYSGNLVDISFSVGES

WKNRRRYVSAALTKTKVLNVITLIEEQANFLINSMQYYAKSGEPFFPHKYYNKYTMNIVM

SIGFSKTISENESVEEGPISQLIIPFYNILENLGSGNLGDYVWYTQPFFYFKNKKLEQDT

KKVYTFLEEIYNEHIKNLDESNPRDLMDQLIISTGGKEKDMVIHVSTDFLLAGSDTNAST

LEWFCIFLANNPEIQKKAYEELISVVGKDCKAVTTKYRDDCPYLVGAIKETLRMRTPAPL

SLIRVSEEDFMTSGGIFIPKGTQIVPNLYGIGQNFVDDPSSYKPERWVEYYKNKTPTREM

EATTETKSNITTEILPNDLDKVVLPFSIGPRNCPGNIISEINLFLACSNILLNFEFSNGG

KKIDETEVFGITIHPKDFSIQLKKRE

 

>CYP519E1 seq 84+17 41% to seq 14

          Length = 467

 

Note add to exon 2 as shown below

Check between PERW and heme

EST supports extra piece here add as above

 

 Score = 2039 (717.8 bits), Expect = 4.5e-251, Sum P(2) = 4.5e-251

 Identities = 401/434 (92%), Positives = 403/434 (92%)

 

Query:     1 MGIGLIILYLLIGLLAYDFTKKNKKISKNDPKQPLAIPVLGHLHLFGSQPHRSLTELAKK 60

             MGIGLIILYLLIGLLAYDF             QPLAIPVLGHLHLFGSQPHRSLTELAKK

Sbjct:     1 MGIGLIILYLLIGLLAYDF-------------QPLAIPVLGHLHLFGSQPHRSLTELAKK 47

 

Query:    61 FGGIFTLWMGDERSMVITDPNILRELYVKNHLNFYNRASSESIRIYSGNLVDISFSVGES 120

             FGGIFTLWMGDERSMVITDPNILRELYVKNHLNFYNRASSESIRIYSGNLVDISFSVGES

Sbjct:    48 FGGIFTLWMGDERSMVITDPNILRELYVKNHLNFYNRASSESIRIYSGNLVDISFSVGES 107

 

Query:   121 WKNRRRYVSAALTKTKVLNVITLIEEQANFLINSMQYYAKSGEPFFPHKYYNKYTMNIVM 180

             WKNRRRYVSAALTKTKVLNVITLIEEQANFLINSMQYYAKSGEPFFPHKYYNKYTMNIVM

Sbjct:   108 WKNRRRYVSAALTKTKVLNVITLIEEQANFLINSMQYYAKSGEPFFPHKYYNKYTMNIVM 167

 

Query:   181 SIGFSKTISENESVEEGPISQLIIPFYNILENLGSGNLGDYVWYTQPFFYFKNKKLEQDT 240

             SIGFSKTISENESVEEGPISQLIIPFYNILENLGSGNLGDYVWYTQPFFYFKNKKLEQDT

Sbjct:   168 SIGFSKTISENESVEEGPISQLIIPFYNILENLGSGNLGDYVWYTQPFFYFKNKKLEQDT 227

 

Query:   241 KKVYTFLEEIYNEHIKNLDESNPRDLMDQLIISTGGKEKDMVIHVSTDFLLAGSDTNAST 300

             KKVYTFLEEIYNEHIKNLDESNPRDLMDQLIISTGGKEKDMVIHVSTDFLLAGSDTNAST

Sbjct:   228 KKVYTFLEEIYNEHIKNLDESNPRDLMDQLIISTGGKEKDMVIHVSTDFLLAGSDTNAST 287

 

Query:   301 LEWFCIFLANNPEIQKKAYEELISVVGKDCKAVTTKYRDDCPYLVGAIKETLRMRTPAPL 360

             LEWFCIFLANNPEIQKKAYEELISVVGKDCKAVTTKYRDDCPYLVGAIKETLRMRTPAPL

Sbjct:   288 LEWFCIFLANNPEIQKKAYEELISVVGKDCKAVTTKYRDDCPYLVGAIKETLRMRTPAPL 347

 

Query:   361 SLIRVSEEDFMTSGGIFIPKGTQIVPNLYGIGQNFVDDPSSYKPERWVEYYKNKTPTREM 420

             SLIRVSEEDFMTSGGIFIPKGTQIVPNLYGIGQNFVDDPSSYKPERWVE   +K     

Sbjct:   348 SLIRVSEEDFMTSGGIFIPKGTQIVPNLYGIGQNFVDDPSSYKPERWVENDLDKVVLPFS 407

 

Query:   421 EATTETKSNITTEI 434

                     NI +EI

Sbjct:   408 IGPRNCPGNIISEI 421

 

 Score = 366 (128.8 bits), Expect = 4.5e-251, Sum P(2) = 4.5e-251

 Identities = 70/70 (100%), Positives = 70/70 (100%)

 

Query:   437 NDLDKVVLPFSIGPRNCPGNIISEINLFLACSNILLNFEFSNGGKKIDETEVFGITIHPK 496

             NDLDKVVLPFSIGPRNCPGNIISEINLFLACSNILLNFEFSNGGKKIDETEVFGITIHPK

Sbjct:   397 NDLDKVVLPFSIGPRNCPGNIISEINLFLACSNILLNFEFSNGGKKIDETEVFGITIHPK 456

 

Query:   497 DFSIQLKKRE 506

             DFSIQLKKRE

Sbjct:   457 DFSIQLKKRE 466

 

 

>aa-ng6400        Contig_2611     pred cdna   pORF 90           1077 strand f start n stop y ph 0

DISYDEDITKGKIAELVQPMDQIFKSLGSGKLGDFISIAQPFYYQWLKFSDKQFNGPFST

VKKFIYKRYLEHINTIDHDNPRDLMDLLINEFSNDKNLIPTILQTSLDMFLAGNDTTAAS

IIWFVLRMQEHPEIQLKAYNEIKDAVGDRDRVLLSDRPKTPYLNAIIKEILRLNPVGPFG

LPHRSSNDIVIGNGKYFIPKDSQILVNYRGLGFNEKYFENPSQFDPSRFLNKNNDAYMPF

GVGDRKCVGLQLAGDEQYLSFSNILLNFNLKNIGTPVSDYEEFGLTLKPNKFKVLLESRK

 

>CYP508B1 Seq. 15, 77  50% identical to 508A1

          Length = 485 missing N-term

 

Use my seq

 

 Score = 1576 (554.8 bits), Expect = 1.1e-165, P = 1.1e-165

 Identities = 300/300 (100%), Positives = 300/300 (100%)

 

Query:     1 DISYDEDITKGKIAELVQPMDQIFKSLGSGKLGDFISIAQPFYYQWLKFSDKQFNGPFST 60

             DISYDEDITKGKIAELVQPMDQIFKSLGSGKLGDFISIAQPFYYQWLKFSDKQFNGPFST

Sbjct:   185 DISYDEDITKGKIAELVQPMDQIFKSLGSGKLGDFISIAQPFYYQWLKFSDKQFNGPFST 244

 

Query:    61 VKKFIYKRYLEHINTIDHDNPRDLMDLLINEFSNDKNLIPTILQTSLDMFLAGNDTTAAS 120

             VKKFIYKRYLEHINTIDHDNPRDLMDLLINEFSNDKNLIPTILQTSLDMFLAGNDTTAAS

Sbjct:   245 VKKFIYKRYLEHINTIDHDNPRDLMDLLINEFSNDKNLIPTILQTSLDMFLAGNDTTAAS 304

 

Query:   121 IIWFVLRMQEHPEIQLKAYNEIKDAVGDRDRVLLSDRPKTPYLNAIIKEILRLNPVGPFG 180

             IIWFVLRMQEHPEIQLKAYNEIKDAVGDRDRVLLSDRPKTPYLNAIIKEILRLNPVGPFG

Sbjct:   305 IIWFVLRMQEHPEIQLKAYNEIKDAVGDRDRVLLSDRPKTPYLNAIIKEILRLNPVGPFG 364

 

Query:   181 LPHRSSNDIVIGNGKYFIPKDSQILVNYRGLGFNEKYFENPSQFDPSRFLNKNNDAYMPF 240

             LPHRSSNDIVIGNGKYFIPKDSQILVNYRGLGFNEKYFENPSQFDPSRFLNKNNDAYMPF

Sbjct:   365 LPHRSSNDIVIGNGKYFIPKDSQILVNYRGLGFNEKYFENPSQFDPSRFLNKNNDAYMPF 424

 

Query:   241 GVGDRKCVGLQLAGDEQYLSFSNILLNFNLKNIGTPVSDYEEFGLTLKPNKFKVLLESRK 300

             GVGDRKCVGLQLAGDEQYLSFSNILLNFNLKNIGTPVSDYEEFGLTLKPNKFKVLLESRK

Sbjct:   425 GVGDRKCVGLQLAGDEQYLSFSNILLNFNLKNIGTPVSDYEEFGLTLKPNKFKVLLESRK 484

 

>aa-ng6418        Contig_2617     pred cdna   pORF 14697        9940 strand r start y stop y ph 0 not a P450

MKFQKHLKNVTVWNHHYLPYQKLKRKLRTFFSPPQSPSDSTPPPPTTTTTTASTTTVNNN

NRAISISDGGNGNLNFLDILENEIKKVNQFILDEKLTIVMNRFEDLKLKFKSIDFSSPFS

VYLSVGLMTPQQIEDNNNNYLNTIQSNISNNNNNNNNNSNNNNNNNNSSNNNNSNINNST

DNNKDLSTPLDQSSSSLYFFSEQQQQPPGQGSNDVDTVTAFQIAFEDILKQLAEIDQFSQ

LNIQAIQKLLNTYSKKKIKTLISDRKKRKVSQTETTLPKMFHLEINNNHNSNNSNNNNNN

NNNNNNNNNNNNNNNNNNNNSNNTLSPATNTISILSKEQLIEDVKKDIDNYFKEKVEVLE

LYHHTQIKKLTLEIELEYQTICQTNVHLLNNNYYDILNIKEKELSNKVESLCRNVVNNNI

SIVKSLIEDIKNLMRDELKLEQVNNTMWIETIHSCIHKSCYLGNVEMVKFLFNTFNDLKM

VNIKYCDEKNKQFIHLAAENGRLSVMEFLIENGADIECLDFLSRRPLHLSAKNGKYDCVK

LLLEKGAKQDVLDREGCTPLYLACRQPPSANNIINSSGDISLSVVDLLLLSGSPLTRGPY

GRHILHEASSKGNCNNINSILKIDPDSANSTDSFGRTPLFESIKHGQLQAISTLLEKNAQ

IVSVFDDDKRTPLHEAAANGQLDSMHLLINNILENLKEDQVSKVVNQQDDDGWSPLHDAC

YLNYIHCVKCLLENKANPNLLDKGEWSPIVHSLYRGNIQAAITIIEFIKKNPDLFQDTTT

TTTLSSSNTIPQTTLKDSKNLSCSSSGVPLITASTVGTTVLPPKLDIVDLNKSISNLNLS

SNIILKSPESELLSPTPSSSSNNSSSISVTPSTSPSTSKKILKSLMSTVTFRIQLVSTTI

KPGQKVGIVGNRKAIGSWNPSAALMFREVDNIGSGGWIGKVQVPVNVQLEYKYVIFEGSR

LDTWEALPENRKFTPQEEEEFVDDGVFGQITFDEDSVTAEQSKQSLFMERGWLVQDTQIR

IRFGETSPTDLQRKIKQPITLYNGKEQAGKIVVTLRDRSSNIQSGTGLSQVNLPLTKDHS

VQFQTNQLKPNSYHSIQFDIYRRGMSSVLIGRSIVLVSELTSTNPYKKVVPIINSSLLSV

GELYFFPLIVSPFTHSKISDVLSNTYWKSTLLIGHRGGGAENARNVGKYKRTHIKENTIL

SFVTAASLGAQYIEFDVQLSRDNIPVIYHDFEILGQDGIKIPINKINLEQFKSLQQEKKN

SSNNNNNNNNNNSNNNNNNNNKNNSNRINNNNSNQHDENNKNNISKTLPSLKTLRGRSKS

MQDIFNDFGTSSSNILANLRNSGIINVNNTPSSPLASSSSSANSSQPISDSFTTLEETFK

NVPIATGFNIEIKYPNQEKEMEIQMNNMNRNAYVDTILKVVFEHAGNRSVMFSSFDADIC

LLCSLKQPKYPVFFLNNAGFTQHADPRANSISEAIRFSKSAHLLGIVTNSKILCEAPPII

GQVKSAGLMLCSWGSENNDPALVDLQEQLGCDGVISDHIAYISKHYYTNNNTIINNQNNQ

 

 

>aa-ng6446        Contig_2629     pred cdna   pORF 2447          885 strand r start y stop y ph 0

MNYLVLILVSLVSIYFLFIKNQDRKKKINSKIPGPKGFLFGNLIELARTKKNLHLKYLEW

FEKYGPVFRVSIGSLETVVLTGYPILREAFIDNSDTFTSRFQRENARSINGYKGLINSNG

DYHKNLKSVILSEMTATKIKKMESHINQESKRLCELLDQHAKQGTPFTMNKYLNLFSINI

ILRFLFGVNYPYTELDDGSSSIIQVIQQFLKLVSQPSITTYFPILSPFMNDRSKEFYDIH

KLLSNHIINLIERYKDSKQHQQQEQEPIDGATEPTVTILDKLLIEVENNRITQNALISIC

IDVLIAGTDTVGQTLSFAIVALVNNAEIQEKLSRNIIDSMEKGDNHYSYSKYRNGIPYLA

LVVKEVFRMYPAGILGLPHMTSEDCEIQGHKIAKGTQIIQNIYSTHRSESFWPNPNNFIP

ERHIQNDVSKSVHFAVGTRNCMGMSLSEAEVHTAMAELFGNFKFTNPSNIPLNDQGTFSV

ALNCPDFFVKIERRN

 

>CYP513C1 Seq 5+56 complete seq 43% to seq 7 486 aa

          Length = 496

 

 Score = 2559 (900.8 bits), Expect = 7.3e-270, P = 7.3e-270

 Identities = 495/495 (100%), Positives = 495/495 (100%)

 

>aa-ng6889        Contig_2828     pred cdna   pORF 197          2177 strand f start y stop y ph 0

MKNDKNMNSKIPGPKGIPILGNLLSMKGDLHLKLQEWYKQYGVIYRIKMGNVETVVLTEY

PIIREAFIGNSNSFVNRFQRKSRLKLNNGENLVIVNGDIHNKLKTLVLSEMTNQRIKKYE

TSFIDNEIKKLFKVLDEHADTGKPIILNNHIKMFSMNIVLCFTFGLNYSYPYDEFEKASE

FIKLMVEFFNIAGQPIISDFIPSLEPFIDTSNYLNTYKRIFNYTSDLITKFKNENEIHNN

INDNNKSLADKPILSKLLQSFENGEISWDSVVSTCIDLQTAGADTSANTILYCLLELINN

PNIQSKVYDDIKQAIIQSKENENQNDNENQEQTEEIITLSFNKYRTLAPYLSMVVKETFR

KYPSGTIGLPHVTSEDVELNGYKICAGTQIIQNIWATHRNEKQFSEPDSFIPERFISQQQ

SANSNLIHFGCGVRDCIGKSLADSEIFTMLASLINRYEFTNPNPSTPLNEIGKFGITYSC

PENKIIIKKRF

 

>CYP513A1 Seq. 8+53 complete same as 8b

          Length = 510 1aa diff N-term missing

 

Most seqs have T at this site so change seq to T add N-term

 

 Score = 2551 (898.0 bits), Expect = 5.1e-269, P = 5.1e-269

 Identities = 489/490 (99%), Positives = 490/490 (100%)

 

Query:     2 KNDKNMNSKIPGPKGIPILGNLLSMKGDLHLKLQEWYKQYGVIYRIKMGNVETVVLTEYP 61

             KNDKNMNSKIPGPKGIPILGNLLSMKGDLHLKLQEWYKQYGVIYRIKMGNVETVVLTEYP

Sbjct:    20 KNDKNMNSKIPGPKGIPILGNLLSMKGDLHLKLQEWYKQYGVIYRIKMGNVETVVLTEYP 79

 

Query:    62 IIREAFIGNSNSFVNRFQRKSRLKLNNGENLVIVNGDIHNKLKTLVLSEMTNQRIKKYET 121

             IIREAFIGNSNSFVNRFQRKSRLKLNNGENLVIVNGDIHNKLKTLVLSEMTNQRIKKYET

Sbjct:    80 IIREAFIGNSNSFVNRFQRKSRLKLNNGENLVIVNGDIHNKLKTLVLSEMTNQRIKKYET 139

 

Query:   122 SFIDNEIKKLFKVLDEHADTGKPIILNNHIKMFSMNIVLCFTFGLNYSYPYDEFEKASEF 181

             SFIDNEIKKLFKVLDEHADTGKPIILNNHIKMFSMNIVLCFTFGLNYSYPYDEFEKASEF

Sbjct:   140 SFIDNEIKKLFKVLDEHADTGKPIILNNHIKMFSMNIVLCFTFGLNYSYPYDEFEKASEF 199

 

Query:   182 IKLMVEFFNIAGQPIISDFIPSLEPFIDTSNYLNTYKRIFNYTSDLITKFKNENEIHNNI 241

             IKLMVEFFNIAGQPIISDFIPSLEPFIDTSNYLNTYKRIFNYTSDLI+KFKNENEIHNNI

Sbjct:   200 IKLMVEFFNIAGQPIISDFIPSLEPFIDTSNYLNTYKRIFNYTSDLISKFKNENEIHNNI 259

 

Query:   242 NDNNKSLADKPILSKLLQSFENGEISWDSVVSTCIDLQTAGADTSANTILYCLLELINNP 301

             NDNNKSLADKPILSKLLQSFENGEISWDSVVSTCIDLQTAGADTSANTILYCLLELINNP

Sbjct:   260 NDNNKSLADKPILSKLLQSFENGEISWDSVVSTCIDLQTAGADTSANTILYCLLELINNP 319

 

Query:   302 NIQSKVYDDIKQAIIQSKENENQNDNENQEQTEEIITLSFNKYRTLAPYLSMVVKETFRK 361

             NIQSKVYDDIKQAIIQSKENENQNDNENQEQTEEIITLSFNKYRTLAPYLSMVVKETFRK

Sbjct:   320 NIQSKVYDDIKQAIIQSKENENQNDNENQEQTEEIITLSFNKYRTLAPYLSMVVKETFRK 379

 

Query:   362 YPSGTIGLPHVTSEDVELNGYKICAGTQIIQNIWATHRNEKQFSEPDSFIPERFISQQQS 421

             YPSGTIGLPHVTSEDVELNGYKICAGTQIIQNIWATHRNEKQFSEPDSFIPERFISQQQS

Sbjct:   380 YPSGTIGLPHVTSEDVELNGYKICAGTQIIQNIWATHRNEKQFSEPDSFIPERFISQQQS 439

 

Query:   422 ANSNLIHFGCGVRDCIGKSLADSEIFTMLASLINRYEFTNPNPSTPLNEIGKFGITYSCP 481

             ANSNLIHFGCGVRDCIGKSLADSEIFTMLASLINRYEFTNPNPSTPLNEIGKFGITYSCP

Sbjct:   440 ANSNLIHFGCGVRDCIGKSLADSEIFTMLASLINRYEFTNPNPSTPLNEIGKFGITYSCP 499

 

Query:   482 ENKIIIKKRF 491

             ENKIIIKKRF

Sbjct:   500 ENKIIIKKRF 509

 

>aa-ng7190        Contig_2949     pred cdna   pORF 4620         3115 strand r start y stop y ph 0

MIGTIAVLVIAILSPSNIPPIVETIPFIGCFYQFAKNPLQLVRNSYDRLGEIFTLHLMGF

KMTFVLGPEAQALFFRGTDEELSPKEAYRFVTPVFGKGVVYDSETEIMYEQLRFVKNGLV

LSQLKKAVGIIQEETEKYFETKWGDSGEIDLLYEMNKLTILTASRCLMGKSINKSLGQSG

QLADLYHELEEGLNPISFFFPNLPLPSFKKRDAARAKVAAIFHSIIQERRRSTDDSVDDV

LYTLMNSKYKDGSVLEDEQIVGLMIGLLFAGQHTSSITLTYTIFYLLNNLEYFDETQKDI

NDIVQKENQGEINFDGLKRMNRLETVIREVLRLHPPLIFLMRKVMTPMEYKGKTIPAGHI

LAVSPQVGMRLPTVYKNPDSFEPKRFDVEDKTPFSFIAFGGGKHGCPGENFGILQIKTIW

TVLSTKYNLEVGPVPPTDFTSLVAGPKGPCMVKYSKKQK

 

>VFN761 (VFN761Q) /pub/dna_csm/LIBRARY/VF/VFN7-C/VFN761Q.Seq.d/

        Length = 1049

 

  Plus Strand HSPs:

 

EST supports ng7190

 

Query:     1 QLADLYHELEEGLNPISFFFPNLPLPSFKKRDAARAKVAAIFHSIIQERRRSTDDSVDDV 60

             QLADLYHELEEGLNPISFFFPNLPLPSFKKRDAARAKVAAIFHSIIQERRRSTDDSVDDV

Sbjct:   358 QLADLYHELEEGLNPISFFFPNLPLPSFKKRDAARAKVAAIFHSIIQERRRSTDDSVDDV 537

 

>CYP51 Seq 19 complete 26% to seq 8

       Length = 466 mid and N-term diffs

 

 Score = 2149 (756.5 bits), Expect = 2.0e-226, P = 2.0e-226

 Identities = 428/465 (92%), Positives = 433/465 (93%)

 

Query:     1 MIGTIAVLVIAIL-------SPSNIPPIVETIPFIGCFYQFAKNPLQLVRNSYDRLGEIF 53

             MIGTIAVLVIAIL       SPSNIPPIVETIPFIGCFYQFAKNPLQLVRNSYDRLGEIF

Sbjct:     1 MIGTIAVLVIAILVIFAFKKSPSNIPPIVETIPFIGCFYQFAKNPLQLVRNSYDRLGEIF 60

 

Query:    54 TLHLMGFKMTFVLGPEAQALFFRGTDEELSPKEAYRFVTPVFGKGVVYDSETEIMYEQLR 113

             TLHLMGFKMTFVLGPEAQALFFRGTDEELSPKEAYRFVTPVFGKGVVYDSETEIMYEQLR

Sbjct:    61 TLHLMGFKMTFVLGPEAQALFFRGTDEELSPKEAYRFVTPVFGKGVVYDSETEIMYEQLR 120

 

Query:   114 FVKNGLVLSQLKKAVGIIQEETEKYFETKWGDSGEIDLLYEMNKLTILTASRCLMGKSIN 173

             FVKNGLVLSQLKKAVGIIQEETEKYFETKWGDSGEIDLLYEMNKLTILTASRCLMGKSIN

Sbjct:   121 FVKNGLVLSQLKKAVGIIQEETEKYFETKWGDSGEIDLLYEMNKLTILTASRCLMGKSIN 180

 

Query:   174 KSLGQSGQLADLYHELEEGLNPISFF-FPNLPLPSFKKRDAARAKVAAIFHSIIQERRRS 232

             KSLGQSGQLADLY   ++    I F  F  +      KR+      +  F SIIQERRRS

Sbjct:   181 KSLGQSGQLADLYPNWKKV--SIQFHSFSQISHYHHSKRETLPVLKSRHFPSIIQERRRS 238

 

Query:   233 TDDSVDDVLYTLMNSKYKDGSVLEDEQIVGLMIGLLFAGQHTSSITLTYTIFYLLNNLEY 292

             TDDSVDDVLYTLMNSKYKDGSVLEDEQIVGLMIGLLFAGQHTSSITLTYTIFYLLNNLEY

Sbjct:   239 TDDSVDDVLYTLMNSKYKDGSVLEDEQIVGLMIGLLFAGQHTSSITLTYTIFYLLNNLEY 298

 

Query:   293 FDETQKDINDIVQKENQGEINFDGLKRMNRLETVIREVLRLHPPLIFLMRKVMTPMEYKG 352

             FDETQKDINDIVQKENQGEINFDGLKRMNRLETVIREVLRLHPPLIFLMRKVMTPMEYKG

Sbjct:   299 FDETQKDINDIVQKENQGEINFDGLKRMNRLETVIREVLRLHPPLIFLMRKVMTPMEYKG 358

 

Query:   353 KTIPAGHILAVSPQVGMRLPTVYKNPDSFEPKRFDVEDKTPFSFIAFGGGKHGCPGENFG 412

             KTIPAGHILAVSPQVGMRLPTVYKNPDSFEPKRFDVEDKTPFSFIAFGGGKHGCPGENFG

Sbjct:   359 KTIPAGHILAVSPQVGMRLPTVYKNPDSFEPKRFDVEDKTPFSFIAFGGGKHGCPGENFG 418

 

Query:   413 ILQIKTIWTVLSTKYNLEVGPVPPTDFTSLVAGPKGPCMVKYSKKQK 459

             ILQIKTIWTVLSTKYNLEVGPVPPTDFTSLVAGPKGPCMVKYSKKQK

Sbjct:   419 ILQIKTIWTVLSTKYNLEVGPVPPTDFTSLVAGPKGPCMVKYSKKQK 465

 

>aa-ng7890        Contig_3262     pred cdna   pORF 3080         5244 strand f start y stop y ph 0 reductase

MEILESIDFIEVLILDNLGAIIIVAVIVGTYLYMNKPPPPPPVFNKPNNKINKEAQKPKK

TITKNEDGKKVMKIFFGTQTRTAEDFSRIIEKECKKIGIPCEVVDLESYEHEQELHSESF

VMFLVATHGEGDPTDNAKEFYLWLTNDERPTDLLNGVPFTVFGLGNKTYEHYNAVARVID

RRMEELGGKRVFERGEGDDDATLEEDFNRWKKDMWPVVCKFLGYELKSTEDDKFVPRFRM

VTLNQDSKDINDPFIKIVSTPLKPKLSTDNKVIYDMKNPYYAEVLENRELHSNESDRSCR

HIEFKLGDEVSYTTGDHLGVFPINDSKLVEQLIKRLGVNGDDMIALVPIDQEGSVIKASF

GPMTIRRAFSEHLDITNPVRKSVLRALAESTTNEEEKKRLLYLATEEANEEYNKYIKNDF

RGVVDLLESFPGLQPLIAHFLEFTPRLPARMYSISSSPHNKNGVVSITSVVVNFTTGNQR

AHNGVASTWLSHLKVGDKVPLFVRESHFKLPSAATEQKPVIMVGPGTGLAPFRGFLQELQ

HRNHSQQQQSLLFFGCRSDTVDYIYREELEQYHQSSVLGDLVVAFSRKTSQKVYVQNKLL

EHKEKVWELLNKGAYFYVCGDGRNMSKAVQQALLSIIKEFGSKDDNSAQQFIDDMSSHGR

YLQDVWF

 

>aa-ng8085        Contig_3339     pred cdna   pORF 1808           75 strand r start n stop y ph 0

LKKNISFRKNEPPRGGVAFPIFGDLPKLGENPHRYLTNLAMKKGGIYSVWLGDEKVFILT

DPEAVRDAWVKQFRNFSDHPKTKSVRIFSGNFNDMAFAEYSQWKINSKWVSSAFTKTKLK

TIGDLIEKESNYFIEHLKAYSNSGQPIFPKPYISKFGINVISGMMFSQVISKDESVDKGA

MEKLTVPIQAVFKRLGADNLDDFISILQPVFYFQNEKFKRQVQEIYDYLEGIYNQHDTNL

DTENPKDLMDLLIISTEGKERDMIIHIGMDCLLAGSDSTSATCEWFCLFMINNPDVQKKA

YQELINALKDEDNKKFIPISKKDNCPYMLSIFKEVLRLRPVGVLGIPRVALEETTIMGYT

IPKGSQIFQNVYGMSHLFVSDPYKFKPERWIEYK KQKDLLKEKEMQQLQEGADVVIDNKN

NIKNNNSSNKPNSKTNSIFDD LDKVSIPYSVGNRKCPGASLSELALFSLCSNILLNFELK

SIDGKPIDDTEVYGLTIHTKVHPISLTLRP

 

Add large insert and N-terminal seq

 

>CYP519A1 Seq. 12 32% to 508 complete sequence

          Length = 484 joint at end differs N-term missing

 

 Score = 2085 (734.0 bits), Expect = 8.8e-255, Sum P(2) = 8.8e-255

 Identities = 395/397 (99%), Positives = 395/397 (99%)

 

Query:     1 LKKNISFRKNEPPRGGVAFPIFGDLPKLGENPHRYLTNLAMKKGGIYSVWLGDEKVFILT 60

             LKKNISFRKNEPPRGGVAFPIFGDLPKLGENPHRYLTNLAMKKGGIYSVWLGDEKVFILT

Sbjct:    21 LKKNISFRKNEPPRGGVAFPIFGDLPKLGENPHRYLTNLAMKKGGIYSVWLGDEKVFILT 80

 

Query:    61 DPEAVRDAWVKQFRNFSDHPKTKSVRIFSGNFNDMAFAEYSQWKINSKWVSSAFTKTKLK 120

             DPEAVRDAWVKQFRNFSDHPKTKSVRIFSGNFNDMAFAEYSQWKINSKWVSSAFTKTKLK

Sbjct:    81 DPEAVRDAWVKQFRNFSDHPKTKSVRIFSGNFNDMAFAEYSQWKINSKWVSSAFTKTKLK 140

 

Query:   121 TIGDLIEKESNYFIEHLKAYSNSGQPIFPKPYISKFGINVISGMMFSQVISKDESVDKGA 180

             TIGDLIEKESNYFIEHLKAYSNSGQPIFPKPYISKFGINVISGMMFSQVISKDESVDKGA

Sbjct:   141 TIGDLIEKESNYFIEHLKAYSNSGQPIFPKPYISKFGINVISGMMFSQVISKDESVDKGA 200

 

Query:   181 MEKLTVPIQAVFKRLGADNLDDFISILQPVFYFQNEKFKRQVQEIYDYLEGIYNQHDTNL 240

             MEKLTVPIQAVFKRLGADNLDDFISILQPVFYFQNEKFKRQVQEIYDYLEGIYNQHDTNL

Sbjct:   201 MEKLTVPIQAVFKRLGADNLDDFISILQPVFYFQNEKFKRQVQEIYDYLEGIYNQHDTNL 260

 

Query:   241 DTENPKDLMDLLIISTEGKERDMIIHIGMDCLLAGSDSTSATCEWFCLFMINNPDVQKKA 300

             DTENPKDLMDLLIISTEGKERDMIIHIGMDCLLAGSDSTSATCEWFCLFMINNPDVQKKA

Sbjct:   261 DTENPKDLMDLLIISTEGKERDMIIHIGMDCLLAGSDSTSATCEWFCLFMINNPDVQKKA 320

 

Query:   301 YQELINALKDEDNKKFIPISKKDNCPYMLSIFKEVLRLRPVGVLGIPRVALEETTIMGYT 360

             YQELINALKDEDNKKFIPISKKDNCPYMLSIFKEVLRLRPVGVLGIPRVALEETTIMGYT

Sbjct:   321 YQELINALKDEDNKKFIPISKKDNCPYMLSIFKEVLRLRPVGVLGIPRVALEETTIMGYT 380

 

Query:   361 IPKGSQIFQNVYGMSHLFVSDPYKFKPERWIEYKKQK 397

             IPKGSQIFQNVYGMSHLFVSDPYKFKPERWIEYK  K

Sbjct:   381 IPKGSQIFQNVYGMSHLFVSDPYKFKPERWIEYKLDK 417

 

 Score = 355 (125.0 bits), Expect = 8.8e-255, Sum P(2) = 8.8e-255

 Identities = 69/69 (100%), Positives = 69/69 (100%)

 

Query:   442 LDKVSIPYSVGNRKCPGASLSELALFSLCSNILLNFELKSIDGKPIDDTEVYGLTIHTKV 501

             LDKVSIPYSVGNRKCPGASLSELALFSLCSNILLNFELKSIDGKPIDDTEVYGLTIHTKV

Sbjct:   415 LDKVSIPYSVGNRKCPGASLSELALFSLCSNILLNFELKSIDGKPIDDTEVYGLTIHTKV 474

 

Query:   502 HPISLTLRP 510

             HPISLTLRP

Sbjct:   475 HPISLTLRP 483

 

>aa-ng8864        Contig_3673     pred cdna   pORF 2489          854 strand r start n stop y ph 0

YKKNKRYSKSPNKEANGPWSLPIIGGLHLIGDRPNRSFSELSKIYGGIYKIWLAERMLMI

VTDPEIIQDIWIKQHDKFVNRPHNITSQIFSLNHKSLVFGDVDEWNKVRPKMTCHFTKIK

LNSTKPKQIVNDQLKKMLKIMTTHSLDSKPFNQYVYLNTYSMNIILGLMLSIELPHSNSN

DKDGQFSKVLHSIDEIFKSIGTNGPEDIFPTLLPFFKNRISTFTNHLNVIKDFIRSIYKQ

QIKTFDINIEPRNIMDCLISEYYEDDDQEDEVAKQELIIQLCIDMLVAATDTSASTLEWF

MLFMINNPNLQEDLYEEVVNVVGKDCPYVTFDDVPKLALIKACYFEILRIRPVTSLSLPR

VSMEDTTTLNDIFIPKDTIIIQNIFGMGNSEKFVSNPTVFNPSRWLEYKKMKDLN QFGNR

DDSIDTTNTTTNTTLNGTTS KYYNDLERVSIPFGVGKRRCMAPSMADHNVLIAMANIVLN

FTMKSSDPKQMPLSEEEQYAITIKPKYPFKVLFEKRS

 

>CYP520A1 seq (47+68+80) complete 47% to seq 58 517 aa

          Length = 517 n_term missing Joint at end differs 6aa diffs

 

 Score = 2164 (761.8 bits), Expect = 3.8e-265, Sum P(2) = 3.8e-265

 Identities = 412/415 (99%), Positives = 414/415 (99%)

 

Query:     1 YKKNKRYSKSPNKEANGPWSLPIIGGLHLIGDRPNRSFSELSKIYGGIYKIWLAERMLMI 60

             +KKNKRYSKSPNKEANGPWSLPIIGGLHLIGDRPNRSFSELSKIYGGIYKIWLAERMLMI

Sbjct:    24 FKKNKRYSKSPNKEANGPWSLPIIGGLHLIGDRPNRSFSELSKIYGGIYKIWLAERMLMI 83

 

Query:    61 VTDPEIIQDIWIKQHDKFVNRPHNITSQIFSLNHKSLVFGDVDEWNKVRPKMTCHFTKIK 120

             VTDP+IIQDI IKQHDKFVNRPHNITSQIFSLNHKSLVFGDVDEWNKVRPKMTCHFTKIK

Sbjct:    84 VTDPKIIQDIGIKQHDKFVNRPHNITSQIFSLNHKSLVFGDVDEWNKVRPKMTCHFTKIK 143

 

Query:   121 LNSTKPKQIVNDQLKKMLKIMTTHSLDSKPFNQYVYLNTYSMNIILGLMLSIELPHSNSN 180

             LNSTKPKQIVNDQLKKMLKIMTTHSLDSKPFNQYVYLNTYSMNIILGLMLSIELPHSNSN

Sbjct:   144 LNSTKPKQIVNDQLKKMLKIMTTHSLDSKPFNQYVYLNTYSMNIILGLMLSIELPHSNSN 203

 

Query:   181 DKDGQFSKVLHSIDEIFKSIGTNGPEDIFPTLLPFFKNRISTFTNHLNVIKDFIRSIYKQ 240

             DKDGQFSKVLHSIDEIFKSIGTNGPEDIFPTLLPFFKNRISTFTNHLNVIKDFIRSIYKQ

Sbjct:   204 DKDGQFSKVLHSIDEIFKSIGTNGPEDIFPTLLPFFKNRISTFTNHLNVIKDFIRSIYKQ 263

 

Query:   241 QIKTFDINIEPRNIMDCLISEYYEDDDQEDEVAKQELIIQLCIDMLVAATDTSASTLEWF 300

             QIKTFDINIEPRNIMDCLISEYYEDDDQEDEVAKQELIIQLCIDMLVAATDTSASTLEWF

Sbjct:   264 QIKTFDINIEPRNIMDCLISEYYEDDDQEDEVAKQELIIQLCIDMLVAATDTSASTLEWF 323

 

Query:   301 MLFMINNPNLQEDLYEEVVNVVGKDCPYVTFDDVPKLALIKACYFEILRIRPVTSLSLPR 360

             MLFMINNPNLQEDLYEEVVNVVGKDCPYVTFDDVPKLALIKACYFEILRIRPVTSLSLPR

Sbjct:   324 MLFMINNPNLQEDLYEEVVNVVGKDCPYVTFDDVPKLALIKACYFEILRIRPVTSLSLPR 383

 

Query:   361 VSMEDTTTLNDIFIPKDTIIIQNIFGMGNSEKFVSNPTVFNPSRWLEYKKMKDLN 415

             VSMEDTTTLNDIFIPKDTIIIQNIFGMGNSEKFVSNPTVFNPSRWLEYKKMKDLN

Sbjct:   384 VSMEDTTTLNDIFIPKDTIIIQNIFGMGNSEKFVSNPTVFNPSRWLEYKKMKDLN 438

 

 Score = 374 (131.7 bits), Expect = 3.8e-265, Sum P(2) = 3.8e-265

 Identities = 74/78 (94%), Positives = 74/78 (94%)

 

Query:   441 KYYNDLERV-SIPFGVGKRRCMAPSMADHNVLIAMANIVLNFTMKSSDPKQMPLSEEEQY 499

             KYYNDLERV SIPFGVGKRRCMAPSMADHNV IAMANIVLNF MKSSDPKQMPLSEE QY

Sbjct:   440 KYYNDLERVISIPFGVGKRRCMAPSMADHNVFIAMANIVLNFAMKSSDPKQMPLSEEGQY 499

 

Query:   500 AITIKPKYPFKVLFEKRS 517

             AITIKPKYPFKVLFEKRS

Sbjct:   500 AITIKPKYPFKVLFEKRS 517

 

>IIAFP1D30759

        Length = 967

 

  Plus Strand HSPs:

 

 Score = 248 (92.4 bits), Expect = 2.4e-20, P = 2.4e-20

 Identities = 51/62 (82%), Positives = 52/62 (83%), Frame = +3

 

Query:     1 VTDPEIIQDIWIKQHDKFVNRPHNITSQIFSLNHKSLVFGDVDEWN-KVRPKMTCHFT-K 58

             VTDPEIIQDIWIKQHDKFVNRPHNITSQIFSLNHKSLVFGDVDE      PKMT  F  K

Sbjct:   600 VTDPEIIQDIWIKQHDKFVNRPHNITSQIFSLNHKSLVFGDVDEME*S*DPKMTLSFPLK 779

 

>JC1a198c01.s1 Clone JC1a198c01, standard read, bases 84 through 533, from

            2000-05-12

        Length = 448

 

  Minus Strand HSPs:

 

 Score = 729 (261.7 bits), Expect = 5.6e-71, P = 5.6e-71

 Identities = 139/139 (100%), Positives = 139/139 (100%), Frame = -1

 

Query:    17 DTIIIQNIFGMGNSEKFVSNPTVFNPSRWLEYKKMKDLNQFGNRDDSIDTTNTTTNTTLN 76

             DTIIIQNIFGMGNSEKFVSNPTVFNPSRWLEYKKMKDLNQFGNRDDSIDTTNTTTNTTLN

Sbjct:   448 DTIIIQNIFGMGNSEKFVSNPTVFNPSRWLEYKKMKDLNQFGNRDDSIDTTNTTTNTTLN 269

 

Query:    77 GTTSKYYNDLERVSIPFGVGKRRCMAPSMADHNVLIAMANIVLNFTMKSSDPKQMPLSEE 136

             GTTSKYYNDLERVSIPFGVGKRRCMAPSMADHNVLIAMANIVLNFTMKSSDPKQMPLSEE

Sbjct:   268 GTTSKYYNDLERVSIPFGVGKRRCMAPSMADHNVLIAMANIVLNFTMKSSDPKQMPLSEE 89

 

Query:   137 EQYAITIKPKYPFKVLFEK 155

             EQYAITIKPKYPFKVLFEK

Sbjct:    88 EQYAITIKPKYPFKVLFEK 32

 

 

 

>JC2c167d08.r1 Clone JC2c167d08, reverse read, bases 38 through 462, from

            2000-01-18

        Length = 423

 

  Plus Strand HSPs:

 

 Score = 615 (221.5 bits), Expect = 7.2e-59, P = 7.2e-59

 Identities = 117/117 (100%), Positives = 117/117 (100%), Frame = +2

 

Query:    41 NPSRWLEYKKMKDLNQFGNRDDSIDTTNTTTNTTLNGTTSKYYNDLERVSIPFGVGKRRC 100

             NPSRWLEYKKMKDLNQFGNRDDSIDTTNTTTNTTLNGTTSKYYNDLERVSIPFGVGKRRC

Sbjct:     2 NPSRWLEYKKMKDLNQFGNRDDSIDTTNTTTNTTLNGTTSKYYNDLERVSIPFGVGKRRC 181

 

Query:   101 MAPSMADHNVLIAMANIVLNFTMKSSDPKQMPLSEEEQYAITIKPKYPFKVLFEKRS 157

             MAPSMADHNVLIAMANIVLNFTMKSSDPKQMPLSEEEQYAITIKPKYPFKVLFEKRS

Sbjct:   182 MAPSMADHNVLIAMANIVLNFTMKSSDPKQMPLSEEEQYAITIKPKYPFKVLFEKRS 352

 

>aa-ng8897        Contig_3681     pred cdna   pORF 9565        10869 strand f start y stop y ph 0

MVSDYKIIEELFIKNHNSFLERPITPSFSHCSDNQNGILLSNEKWVNNREMIQKAMKKGI

VKSVYELLNKQINDLIQSVKPFSESGEPLNFRLFATRFTLSTMMTYIFNEPLSYNENLEN

GTVFFLKPIYSLFLKLTDSNVPLARDYVYKKFYEHLKTIDNEKEENYDLLHTFIKEVGIK

DKESVKSIVSNSLDLLVAGTDTAAKGIEWIILRIANNQDVQELIFKELKSIVDSCGRVED

RIYLSDRSSTPYLNATIKESMRITPITPYGIPRVVGKDLIIKGHFIPKGSHVVINYRALN

HDESIYKNPNQFNPNRFLDNNIESFIPFSIGNRNCVGQQIANHELFLFVSNFILNYKITP

ITNEPIDTTENFGINIRPNSFKINVYKRN

N-term missing

Deleted one seq as an intron between GTV and FLKP

There is a frameshift there

Two other small diffs

 

>CYP508E1 seq (71+85) complete seq 38% to 508A1

          Length = 481

 

 Score = 1317 (463.6 bits), Expect = 3.0e-138, P = 3.0e-138

 Identities = 258/267 (96%), Positives = 260/267 (97%)

 

Query:   123 VFFLKPIYSLFLKLTDSNVPLARDYVYKKFYEHLKTIDNEKEENYDLLHTFIKEVGIKDK 182

             + FLKPIYSLFLKLTDSNVPLARDYVYKKFYEHLKTIDNE     DLLHTFIKEVGIKDK

Sbjct:   218 ISFLKPIYSLFLKLTDSNVPLARDYVYKKFYEHLKTIDNE-----DLLHTFIKEVGIKDK 272

 

Query:   183 ESVKSIV-SNSLDLLVAGTDTAAKGIEWIILRIANNQDVQELIFKELKSIVDSCGRVEDR 241

             ESVKS+  SNSLDLLVAGTDTAAKGIEWIILRIANNQDVQELIFKELKSIVDSCGRVEDR

Sbjct:   273 ESVKSMFHSNSLDLLVAGTDTAAKGIEWIILRIANNQDVQELIFKELKSIVDSCGRVEDR 332

 

Query:   242 IYLSDRSSTPYLNATIKESMRITPITPYGIPRVVGKDLIIKGHFIPKGSHVVINYRALNH 301

             IYLSDRSSTPYLNATIKESMRITPITPYGIPRVVGKDLIIKGHFIPKGSHVVINYRALNH

Sbjct:   333 IYLSDRSSTPYLNATIKESMRITPITPYGIPRVVGKDLIIKGHFIPKGSHVVINYRALNH 392

 

Query:   302 DESIYKNPNQFNPNRFLDNNIESFIPFSIGNRNCVGQQIANHELFLFVSNFILNYKITPI 361

             DESIYKNPNQFNPNRFLDNNIESFIPFSIGNRNCVGQQIANHELFLFVSNFILNYKITPI

Sbjct:   393 DESIYKNPNQFNPNRFLDNNIESFIPFSIGNRNCVGQQIANHELFLFVSNFILNYKITPI 452

 

Query:   362 TNEPIDTTENFGINIRPNSFKINVYKRN 389

             TNEPIDTTENFGINIRPNSFKINVYKRN

Sbjct:   453 TNEPIDTTENFGINIRPNSFKINVYKRN 480

 

 Score = 715 (251.7 bits), Expect = 1.8e-74, P = 1.8e-74

 Identities = 156/204 (76%), Positives = 166/204 (81%)

 

Query:     1 MVSDYKIIEELFIKNHNSFLERPITPSFSHCSDNQNGILLSNEKWVNNREMIQKAMKKGI 60

             MVSDYKIIEELFIKNHNSFLERPITPSFSHCSDNQNGILLSNEKWVNNREMIQKAMKKGI

Sbjct:    75 MVSDYKIIEELFIKNHNSFLERPITPSFSHCSDNQNGILLSNEKWVNNREMIQKAMKKGI 134

 

Query:    61 VKSVYELLNKQINDLIQSVKPFSESGEPLNFRLFATRFTLSTMMTYIFNEPLSYNENLEN 120

             VKSVYELLNKQINDLIQSVKPFSESGEPLNFRLFATRFTLSTMMTYIFNEPLSYNENLEN

Sbjct:   135 VKSVYELLNKQINDLIQSVKPFSESGEPLNFRLFATRFTLSTMMTYIFNEPLSYNENLEN 194

 

Query:   121 GTVF--FLKPIYSL-----------FLK-LTDSNVPLARDYV-YKKFYEHLKTIDNEKE- 164

             GTV   FL+ I+ L           FLK +    + L    V   + Y + K  ++ K 

Sbjct:   195 GTVLYMFLENIFVLADVGHIGDYISFLKPIYSLFLKLTDSNVPLARDYVYKKFYEHLKTI 254

 

Query:   165 ENYDLLHTFIKEVGIKDKESVKSI 188

             +N DLLHTFIKEVGIKDKESVKS+

Sbjct:   255 DNEDLLHTFIKEVGIKDKESVKSM 278

 

>JC1c218c07.r1 Clone JC1c218c07, reverse read, bases 61 through 714, from

            2001-01-11

        Length = 652

 

  Plus Strand HSPs:

 

 Score = 311 (114.5 bits), Expect = 8.6e-79, Sum P(3) = 8.6e-79

 Identities = 65/68 (95%), Positives = 66/68 (97%), Frame = +2

 

Query:   101 NYDLLHTFIKEVGIKDKESVKSMFHSNSLDLLVAGTDTAAKGIEWIILRIANNQDVQELI 160

             NYDLLHTFIKEVGIKDKESVKS+  SNSLDLLVAGTDTAAKGIEWIILRIANNQDVQELI

Sbjct:   452 NYDLLHTFIKEVGIKDKESVKSIV-SNSLDLLVAGTDTAAKGIEWIILRIANNQDVQELI 628

 

Query:   161 FKELKSIV 168

             FKELKSIV

Sbjct:   629 FKELKSIV 652

 

 Score = 307 (113.1 bits), Expect = 8.6e-79, Sum P(3) = 8.6e-79

 Identities = 59/59 (100%), Positives = 59/59 (100%), Frame = +3

 

Query:    42 LENIFVLADVGHIGDYISFLKPIYSLFLKLTDSNVPLARDYVYKKFYEHLKTIDNEKEE 100

             LENIFVLADVGHIGDYISFLKPIYSLFLKLTDSNVPLARDYVYKKFYEHLKTIDNEKEE

Sbjct:   201 LENIFVLADVGHIGDYISFLKPIYSLFLKLTDSNVPLARDYVYKKFYEHLKTIDNEKEE 377

 

 Score = 215 (80.7 bits), Expect = 8.6e-79, Sum P(3) = 8.6e-79

 Identities = 41/41 (100%), Positives = 41/41 (100%), Frame = +2

 

Query:     1 EPLNFRLFATRFTLSTMMTYIFNEPLSYNENLENGTVLYMF 41

             EPLNFRLFATRFTLSTMMTYIFNEPLSYNENLENGTVLYMF

Sbjct:    80 EPLNFRLFATRFTLSTMMTYIFNEPLSYNENLENGTVLYMF 202

 

>JC1c218c07.r1 Clone JC1c218c07, reverse read, bases 61 through 714, from 2001-01-11 translate frame +1 translate plus frames translate all frames

TGTTAAAAGTGTTTATGAATTATTAAATAAACAAATCAATGATTTAATTCAATCAGTTAA

ACCATTTTCAGAAAGTGGTGAACCATTAAACTTTCGTTTATTTGCCACAAGATTTACATT

ATCAACTATGATGACTTATATTTTCAATGAACCATTATCATATAATGAAAATTTAGAAAA

TGGGACAGTTTTGTACATGTTTAGAAAATATTTTTGTCTTGGCTGATGTTGGTCATATTG

GTGATTATATTAGTTTCTTAAAACCAATTTATTCTTTATTTTTAAAATTAACAGATTCAA

ATGTTCCATTAGCAAGAGATTATGTATATAAAAAATTTTATGAACATTTAAAAACAATTG

ATAATGAAAAAGAAGAGTAAATATATATTTAAAAAATTTATTTTTTTATTATTATTATTC

TAATAATTTTTTTTTTTTAAAACTTTTTAGAAATTATGATTTATTACATACATTTATTAA

AGAAGTTGGAATCAAAGATAAAGAAAGTGTTAAATCAATTGTTTCAAATTCATTAGATTT

ATTAGTTGCAGGTACAGATACTGCTGCAAAAGGAATTGAATGGATTATATTAAGAATTGC

AAATAATCAAGATGTTCAAGAATTAATTTTTAAAGAATTAAAAAGTATTGTA

 

>aa-ng9107        Contig_3767     pred cdna   pORF 15219       13452 strand r start y stop y ph 0

MIILLLSIIIFILYIVKIFKNKSCCGNEIILPPGPISLPFIGNLHQLAIDPHLAIQKLMF

KYGNVMTVYFANIKTVVISDPNYLKEVFVNQSHKTSDRYLMGTSRIIGNEKDILFSNGQY

WKNYRQILAQSFLKLRDHKGITEKISLESVKLSQAFEWYATSGQVVNPCSLFKMYTLNVI

MQLLYSHRSSYDLKGQHPIIDALKMVEESLAVGNVLDLFPILNIFLKKNKLKLVNNLQQV

WSYSQDSIKEHREKLLINPEKIDDLLDLFINEIKLSKNSEFFDDEGLYRVCSDLLLSGTE

TSSSTMSWLLLFLINNPNFQDKVRTELLEATGGKKTIGLTEKSKTPFFNACIKEALRIRP

VGALSLPRIASEDVTCGPYTIEKGSQIIMNVYGLAMDPTVWEDPETFNPYRWLSSDISQS

TYSFIPFGCGSRVCVGSSLARDEIFLGIGNILLNYIFESQNGKPINEKGHFGIALQTVDY

NK

 

>CYP516A1 seq 9, 43 complete seq 39% to 10, 42; 30% to 508A1

          Length = 485 first intron joint differs

 

 Score = 2465 (867.7 bits), Expect = 6.6e-260, P = 6.6e-260

 Identities = 478/481 (99%), Positives = 478/481 (99%)

 

Query:     1 MIILLLSIIIFILYIVKIFKNKSCCGNEIILPPGPISLPFIGNLHQLAIDPHLAIQKLMF 60

             MIILLLSIIIFILYI   FKNKSCCGNEIILPPGPISLPFIGNLHQLAIDPHLAIQKLMF

Sbjct:     1 MIILLLSIIIFILYI---FKNKSCCGNEIILPPGPISLPFIGNLHQLAIDPHLAIQKLMF 57

 

Query:    61 KYGNVMTVYFANIKTVVISDPNYLKEVFVNQSHKTSDRYLMGTSRIIGNEKDILFSNGQY 120

             KYGNVMTVYFANIKTVVISDPNYLKEVFVNQSHKTSDRYLMGTSRIIGNEKDILFSNGQY

Sbjct:    58 KYGNVMTVYFANIKTVVISDPNYLKEVFVNQSHKTSDRYLMGTSRIIGNEKDILFSNGQY 117

 

Query:   121 WKNYRQILAQSFLKLRDHKGITEKISLESVKLSQAFEWYATSGQVVNPCSLFKMYTLNVI 180

             WKNYRQILAQSFLKLRDHKGITEKISLESVKLSQAFEWYATSGQVVNPCSLFKMYTLNVI

Sbjct:   118 WKNYRQILAQSFLKLRDHKGITEKISLESVKLSQAFEWYATSGQVVNPCSLFKMYTLNVI 177

 

Query:   181 MQLLYSHRSSYDLKGQHPIIDALKMVEESLAVGNVLDLFPILNIFLKKNKLKLVNNLQQV 240

             MQLLYSHRSSYDLKGQHPIIDALKMVEESLAVGNVLDLFPILNIFLKKNKLKLVNNLQQV

Sbjct:   178 MQLLYSHRSSYDLKGQHPIIDALKMVEESLAVGNVLDLFPILNIFLKKNKLKLVNNLQQV 237

 

Query:   241 WSYSQDSIKEHREKLLINPEKIDDLLDLFINEIKLSKNSEFFDDEGLYRVCSDLLLSGTE 300

             WSYSQDSIKEHREKLLINPEKIDDLLDLFINEIKLSKNSEFFDDEGLYRVCSDLLLSGTE

Sbjct:   238 WSYSQDSIKEHREKLLINPEKIDDLLDLFINEIKLSKNSEFFDDEGLYRVCSDLLLSGTE 297

 

Query:   301 TSSSTMSWLLLFLINNPNFQDKVRTELLEATGGKKTIGLTEKSKTPFFNACIKEALRIRP 360

             TSSSTMSWLLLFLINNPNFQDKVRTELLEATGGKKTIGLTEKSKTPFFNACIKEALRIRP

Sbjct:   298 TSSSTMSWLLLFLINNPNFQDKVRTELLEATGGKKTIGLTEKSKTPFFNACIKEALRIRP 357

 

Query:   361 VGALSLPRIASEDVTCGPYTIEKGSQIIMNVYGLAMDPTVWEDPETFNPYRWLSSDISQS 420

             VGALSLPRIASEDVTCGPYTIEKGSQIIMNVYGLAMDPTVWEDPETFNPYRWLSSDISQS

Sbjct:   358 VGALSLPRIASEDVTCGPYTIEKGSQIIMNVYGLAMDPTVWEDPETFNPYRWLSSDISQS 417

 

Query:   421 TYSFIPFGCGSRVCVGSSLARDEIFLGIGNILLNYIFESQNGKPINEKGHFGIALQTVDY 480

             TYSFIPFGCGSRVCVGSSLARDEIFLGIGNILLNYIFESQNGKPINEKGHFGIALQTVDY

Sbjct:   418 TYSFIPFGCGSRVCVGSSLARDEIFLGIGNILLNYIFESQNGKPINEKGHFGIALQTVDY 477

 

Query:   481 N 481

             N

Sbjct:   478 N 478

 

>aa-ng12432        Contig_5098     pred cdna   pORF 64            307 strand f start n stop y ph 2 = 519D1

NSKHKYYNDLDKISIPFSTGRRACVGGQLGEAELYIVCANLVYNFKIESWDGKKINELED

FGIIIHPSSHNLKITKRNNK

 

>CYP519D1 Seq. (14+34) complete seq 38% to seq 12 492 aa

          Length = 493 2 diffs N-term missing

 

 Score = 406 (142.9 bits), Expect = 1.0e-41, P = 1.0e-41

 Identities = 76/78 (97%), Positives = 76/78 (97%)

 

Only one seq supports ng12432 = IIAFP1D52853, three seqs support 519D1 SSI404 IIAFP1D78046 JC3b13f10.s1 so keep seq as is

 

Query:     3 KHKYYNDLDKISIPFSTGRRACVGGQLGEAELYIVCANLVYNFKIESWDGKKINELEDFG 62

             KHKYYNDLDKISIPFSTGRR CVG QLGEAELYIVCANLVYNFKIESWDGKKINELEDFG

Sbjct:   415 KHKYYNDLDKISIPFSTGRRGCVGVQLGEAELYIVCANLVYNFKIESWDGKKINELEDFG 474

 

Query:    63 IIIHPSSHNLKITKRNNK 80

             IIIHPSSHNLKITKRNNK

Sbjct:   475 IIIHPSSHNLKITKRNNK 492

 

 

>aa-ng10297        Contig_4318     pred cdna   pORF 525          2131 strand f start y stop y ph 0

MIIIVIVVFLFYFSFLNLNLNPKKKRPPSPITLPVIGNLISLLNNQPQNILFNYYKKYGK

IYQLQYGIVNTVVLSEFDILKEAFIENGEVFIERYNKITKKFKSSENIVNSNGLIWKKLQ

SISIQELSPNIKIKKYEPMIINETNKLIDSFNEHIKSNESIDPTLNIKICFLNIIISFLF

NFRYNDYKDEKVIQLVDYIHSIFRMGSHPIPQDYIPILNKFYINKTTKIHQKIFENIYEY

IENQVQKRLEILNKNNNNNNNNINECFVDLLLLKFKSNLLTWNEVIKTTTDLMIAGSDTN

SLFTIHLIIALTNRENIQNKVFNEILNFYILNENNKITFSNKSKTPYYNSVLKEVERRFT

VSPLSQPHRTNKDIILNGYFIPSGSQIIQNVYSCHLNDKDWENPFQFNPDRFLNNNQLEK

KLITFGMGPRNCLGFQFALMSIWIVNLILFKSIKFSSNKLIEEEIREGGTTLSPFPFKIN

LIKR

 

>CYP555A1 seq 76 41% to seq 4

         Length = 486 4 diffs at two locations

 

 Score = 2495 (878.3 bits), Expect = 4.4e-263, P = 4.4e-263

 Identities = 481/485 (99%), Positives = 482/485 (99%)

 

Query:     1 MIIIVIVVFLFYFSFLNLNLNPKKKRPPSPITLPVIGNLISLLNNQPQNILFNYYKKYGK 60

             MIIIVIVVFLFYFSFLNLNLNPKKKRPPSPITLPVIGNLISLLNNQPQNILFNYYKKYGK

Sbjct:     1 MIIIVIVVFLFYFSFLNLNLNPKKKRPPSPITLPVIGNLISLLNNQPQNILFNYYKKYGK 60

 

Query:    61 IYQLQYGIVNTVVLSEFDILKEAFIENGEVFIERYNKITKKFKS-SENIVNSNGLIWKKL 119

             IYQLQYGIVNTVVLSEFDILKEAFIENGEVFIERYNKITKKFK  +ENIVNSNGLIWKKL

Sbjct:    61 IYQLQYGIVNTVVLSEFDILKEAFIENGEVFIERYNKITKKFKIIAENIVNSNGLIWKKL 120

 

Query:   120 QSISIQELSPNIKIKKYEPMIINETNKLIDSFNEHIKSNESIDPTLNIKICFLNIIISFL 179

             QSISIQELSPNIKIKKYEPMIINETNKLIDSFNEHIKSNESIDPTLNIKICFLNIIISFL

Sbjct:   121 QSISIQELSPNIKIKKYEPMIINETNKLIDSFNEHIKSNESIDPTLNIKICFLNIIISFL 180

 

Query:   180 FNFRYNDYKDEKVIQLVDYIHSIFRMGSHPIPQDYIPILNKFYINKTTKIHQKIFENIYE 239

             FNFRYNDYKDEKVIQLVDYIHSIFRMGSHPIPQDYIPILNKFYINKTTKIHQKIFENIYE

Sbjct:   181 FNFRYNDYKDEKVIQLVDYIHSIFRMGSHPIPQDYIPILNKFYINKTTKIHQKIFENIYE 240

 

Query:   240 YIENQVQKRLEILNKNNNNNNNNINECFVDLLLLKFKSNLLTWNEVIKTTTDLMIAGSDT 299

             YIENQVQKRLEILNKNNNNNNNNINECFVDLLLLKFKSNLLTWNEVIKTTTDLMIAGSDT

Sbjct:   241 YIENQVQKRLEILNKNNNNNNNNINECFVDLLLLKFKSNLLTWNEVIKTTTDLMIAGSDT 300

 

Query:   300 NSLFTIHLIIALTNRENIQNKVFNEILNFYILNENNKITFSNKSKTPYYNSVLKEVERRF 359

             NSLFTIHLIIALTNRENIQNKVFNEILNFYILNENNKITFSNKSKTPYYNSVLKEVERRF

Sbjct:   301 NSLFTIHLIIALTNRENIQNKVFNEILNFYILNENNKITFSNKSKTPYYNSVLKEVERRF 360

 

Query:   360 TVSPLSQPHRTNKDIILNGYFIPSGSQIIQNVYSCHLNDKDWENPFQFNPDRFLNNNQLE 419

             TVSPLSQPHRTNKDIILNGYFIPSGS IIQNVYSCHLNDKDWENPFQFNPDRFLNNNQLE

Sbjct:   361 TVSPLSQPHRTNKDIILNGYFIPSGSHIIQNVYSCHLNDKDWENPFQFNPDRFLNNNQLE 420

 

Query:   420 KKLITFGMGPRNCLGFQFALMSIWIVNLILFKSIKFSSNKLIEEEIREGGTTLSPFPFKI 479

             KKLITFGMGPRNCLGFQFALMSIWIVNLILFKSIKFSSNKLIEEEIREGGTTLSPFPFKI

Sbjct:   421 KKLITFGMGPRNCLGFQFALMSIWIVNLILFKSIKFSSNKLIEEEIREGGTTLSPFPFKI 480

 

Query:   480 NLIKR 484

             NLIKR

Sbjct:   481 NLIKR 485

 

>aa-ng10310        Contig_4323     pred cdna   pORF 4996         3612 strand r start y stop y ph 0

MEENFFDRPSTFVPTWRYMSSNGGGIMSSNDEKWKRAKTTFLKSLKIHGKKYLIEKKSIE

FVNSIEKFSNSNQVFYPKQYSQGFTSSIFFKYMFNEDISIDNKFLKEIGTAVGMVFTKNS

HLTVFDCFGILSPFYDLFFKFRLRPIEILKKTIDKQLTNHLNSMDSKMGDHQSRDIMDDL

LIEYGSLNEISNQDRIQINQICFDVMSTDIGTVATTIDWVLLQLCNRQDLQEIICNEIQD

TIKIKRNNVINDGGGGADTDNLFINLCDKQSIPYLIAFIKETMRVFSNGWSLPKTSKHDQ

ICANYFIPKGSILFINYFSIHLNEEFFKNPREFNPARYLDDSIPIPDLHFGIGQRGCPGR

FVAMDQVFLCIANTLLKYKIKSIDGKKIDDTIQFSVYLKPKDFGILLEKRNKLFVND

 

 

>CYP517A2 SEQ 74 66% to seq 3

          Length = 511 3 aa diffs missing N-term

 

 Score = 2186 (769.5 bits), Expect = 2.4e-230, P = 2.4e-230

 Identities = 413/416 (99%), Positives = 415/416 (99%)

 

Query:     2 EENFFDRPSTFVPTWRYMSSNGGGIMSSNDEKWKRAKTTFLKSLKIHGKKYLIEKKSIEF 61

             EENFFDRPSTFVPTWRYMSSNGGGIMSSNDEKWKRAKTTFLKSLKIHGKKYLIE+KSIEF

Sbjct:    95 EENFFDRPSTFVPTWRYMSSNGGGIMSSNDEKWKRAKTTFLKSLKIHGKKYLIERKSIEF 154

 

Query:    62 VNSIEKFSNSNQVFYPKQYSQGFTSSIFFKYMFNEDISIDNKFLKEIGTAVGMVFTKNSH 121

             VNS EKFSNSNQVFYPKQYSQGFTSSIFFKYMFNEDISIDNKFLKEIGTAVGMVFTKNSH

Sbjct:   155 VNSSEKFSNSNQVFYPKQYSQGFTSSIFFKYMFNEDISIDNKFLKEIGTAVGMVFTKNSH 214

 

Query:   122 LTVFDCFGILSPFYDLFFKFRLRPIEILKKTIDKQLTNHLNSMDSKMGDHQSRDIMDDLL 181

             LTVFDCFGILSPFYDLFF+FRLRPIEILKKTIDKQLTNHLNSMDSKMGDHQSRDIMDDLL

Sbjct:   215 LTVFDCFGILSPFYDLFFRFRLRPIEILKKTIDKQLTNHLNSMDSKMGDHQSRDIMDDLL 274

 

Query:   182 IEYGSLNEISNQDRIQINQICFDVMSTDIGTVATTIDWVLLQLCNRQDLQEIICNEIQDT 241

             IEYGSLNEISNQDRIQINQICFDVMSTDIGTVATTIDWVLLQLCNRQDLQEIICNEIQDT

Sbjct:   275 IEYGSLNEISNQDRIQINQICFDVMSTDIGTVATTIDWVLLQLCNRQDLQEIICNEIQDT 334

 

Query:   242 IKIKRNNVINDGGGGADTDNLFINLCDKQSIPYLIAFIKETMRVFSNGWSLPKTSKHDQI 301

             IKIKRNNVINDGGGGADTDNLFINLCDKQSIPYLIAFIKETMRVFSNGWSLPKTSKHDQI

Sbjct:   335 IKIKRNNVINDGGGGADTDNLFINLCDKQSIPYLIAFIKETMRVFSNGWSLPKTSKHDQI 394

 

Query:   302 CANYFIPKGSILFINYFSIHLNEEFFKNPREFNPARYLDDSIPIPDLHFGIGQRGCPGRF 361

             CANYFIPKGSILFINYFSIHLNEEFFKNPREFNPARYLDDSIPIPDLHFGIGQRGCPGRF

Sbjct:   395 CANYFIPKGSILFINYFSIHLNEEFFKNPREFNPARYLDDSIPIPDLHFGIGQRGCPGRF 454

 

Query:   362 VAMDQVFLCIANTLLKYKIKSIDGKKIDDTIQFSVYLKPKDFGILLEKRNKLFVND 417

             VAMDQVFLCIANTLLKYKIKSIDGKKIDDTIQFSVYLKPKDFGILLEKRNKLFVND

Sbjct:   455 VAMDQVFLCIANTLLKYKIKSIDGKKIDDTIQFSVYLKPKDFGILLEKRNKLFVND 510

 

Frameshift gone in ng10310

 

Correct the three aa diffs and use my N-term

 

Dict-IV-V896c06.plc supports these changes

 

>aa-ng12344        Contig_5063     pred cdna   pORF 5408         4653 strand r start y stop y ph 0

MIFFTSNSNKYIEKFKSGKNSKQSKTENKPIISKLLSLYENGEISWASVIGSCIDMQASS

TDTIMFCLIELTNLKNIQNKVYNEIKQEIKKQNINNNNNEDEVLIQYNKYRNSLPYFSMV

IKETFRKHPVTAHETSNDVEFRGYKISKGTQIIKNIWATHRNEKIFQSPNSFIPERFLEE

LPNSNLVHFGVGVRDCMGKSLAESQIFTILASFINRYEFLNPNPSIPLNDVGKFGLAFSC

PQNKIIIKKRI

 

>CYP513A2 seq 38 54% to seq 8 n-term missing

          Length = 452

 

 Score = 1273 (448.1 bits), Expect = 1.4e-133, P = 1.4e-133

 Identities = 245/248 (98%), Positives = 245/248 (98%)

 

Query:     3 FFTSNSNKYIEKFKSGKNSKQSKTENKPIISKLLSLYENGEISWASVIGSCIDMQASSTD 62

             FF  NSNKYIEKFKSGKNSKQSKTENKPIISKLLSLYENGEISWASVIGSCIDMQASSTD

Sbjct:   203 FFYINSNKYIEKFKSGKNSKQSKTENKPIISKLLSLYENGEISWASVIGSCIDMQASSTD 262

 

Query:    63 TIMFCLIELTNLKNIQNKVYNEIKQEIKKQNINNNNNEDEVLIQYNKYRNSLPYFSMVIK 122

             TIMFCLIEL NLKNIQNKVYNEIKQEIKKQNINNNNNEDEVLIQYNKYRNSLPYFSMVIK

Sbjct:   263 TIMFCLIELPNLKNIQNKVYNEIKQEIKKQNINNNNNEDEVLIQYNKYRNSLPYFSMVIK 322

 

Query:   123 ETFRKHPVTAHETSNDVEFRGYKISKGTQIIKNIWATHRNEKIFQSPNSFIPERFLEELP 182

             ETFRKHPVTAHETSNDVEFRGYKISKGTQIIKNIWATHRNEKIFQSPNSFIPERFLEELP

Sbjct:   323 ETFRKHPVTAHETSNDVEFRGYKISKGTQIIKNIWATHRNEKIFQSPNSFIPERFLEELP 382

 

Query:   183 NSNLVHFGVGVRDCMGKSLAESQIFTILASFINRYEFLNPNPSIPLNDVGKFGLAFSCPQ 242

             NSNLVHFGVGVRDCMGKSLAESQIFTILASFINRYEFLNPNPSIPLNDVGKFGLAFSCPQ

Sbjct:   383 NSNLVHFGVGVRDCMGKSLAESQIFTILASFINRYEFLNPNPSIPLNDVGKFGLAFSCPQ 442

 

Query:   243 NKIIIKKR 250

             NKIIIKKR

Sbjct:   443 NKIIIKKR 450

 

T supported by two sequences P by only one.  T is the consensus

 

>JC1a87f04.r1 Clone JC1a87f04, reverse read, bases 46 through 667, from

            2000-04-13

        Length = 620

 

  Plus Strand HSPs:

 

 Score = 580 (209.2 bits), Expect = 2.5e-55, P = 2.5e-55

 Identities = 113/113 (100%), Positives = 113/113 (100%), Frame = +1

 

Query:     1 SNKYIEKFKSGKNSKQSKTENKPIISKLLSLYENGEISWASVIGSCIDMQASSTDTIMFC 60

             SNKYIEKFKSGKNSKQSKTENKPIISKLLSLYENGEISWASVIGSCIDMQASSTDTIMFC

Sbjct:   229 SNKYIEKFKSGKNSKQSKTENKPIISKLLSLYENGEISWASVIGSCIDMQASSTDTIMFC 408

 

Query:    61 LIELTNLKNIQNKVYNEIKQEIKKQNINNNNNEDEVLIQYNKYRNSLPYFSMV 113

             LIELTNLKNIQNKVYNEIKQEIKKQNINNNNNEDEVLIQYNKYRNSLPYFSMV

Sbjct:   409 LIELTNLKNIQNKVYNEIKQEIKKQNINNNNNEDEVLIQYNKYRNSLPYFSMV 567

 

 

 

>JAX4a21c04.r1 Clone JAX4a21c04, reverse read, bases 45 through 797, from

            1998-11-11

        Length = 751

 

  Minus Strand HSPs:

 

 Score = 449 (163.1 bits), Expect = 1.6e-41, P = 1.6e-41

 Identities = 89/95 (93%), Positives = 89/95 (93%), Frame = -1

 

Query:    19 TENKPIISKLLSLYENGEISWASVIGSCIDMQASSTDTIMFCLIELTNLKNIQNKVYNEI 78

             T   PIISKLLSLYENGEISWASVIGSCIDMQASSTDTIMFCLIEL NLKNIQNKV NEI

Sbjct:   730 TRK*PIISKLLSLYENGEISWASVIGSCIDMQASSTDTIMFCLIELPNLKNIQNKVXNEI 551

 

Query:    79 KQEIKKQNINNNNNEDEVLIQYNKYRNSLPYFSMV 113

             KQEIK QNINNNNNEDEVLIQYNKYRNSLPYFSMV

Sbjct:   550 KQEIKXQNINNNNNEDEVLIQYNKYRNSLPYFSMV 446

 

 

 

>IIAFP2D52585

        Length = 809

 

  Plus Strand HSPs:

 

 Score = 282 (104.3 bits), Expect = 7.3e-24, P = 7.3e-24

 Identities = 54/54 (100%), Positives = 54/54 (100%), Frame = +2

 

Query:    60 CLIELTNLKNIQNKVYNEIKQEIKKQNINNNNNEDEVLIQYNKYRNSLPYFSMV 113

             CLIELTNLKNIQNKVYNEIKQEIKKQNINNNNNEDEVLIQYNKYRNSLPYFSMV

Sbjct:    50 CLIELTNLKNIQNKVYNEIKQEIKKQNINNNNNEDEVLIQYNKYRNSLPYFSMV 211

 

>aa-ng10949        Contig_4574     pred cdna   pORF 2253          638 strand r start n stop y ph 2

NELKGPIPIPILGNLYQLTSGLPHRDLTKISEKYGGIYRFWFADLYTVVLSDPILIREMF

VNNGDYFLDRPKIPSIRHATHYHGIATSSGEYWLKIRDIINKAMRKTNLKLIYDSLDQQV

DNLIESMNKIESDGQVFEPRIYFKKYTMAAMYKFIFNEEINFNNEISELIGPIEQVFKDL

GSGSLFDVLLISRPLYYQWIEHTDKNYPKILNFLKKKYHQHLKTYNPEIQRDLLDLLIKE

YYSGSDDDILTIIATINDLFLAGTDTSSASLEYMVMMLVNYPEIQEKVYDEIKLTVNGRN

KVLLSDRQFTPYTVSFIKETLRYKPPSSVGVPRTTSQDIIIGDKFIPKDAQIFINYYGLS

RNQDYFENPEQFEPSRFMNPDTNIAFLPFSIGTRNCVGQNFALDEMFLAFSNIILNFKFS

SIDGKQIDETELYGVTLRCKNKFNVSIKKRI

 

>CYP508A3 seq 21+67+72+82+87 57% to 508A1 458aa

          Length = 477 missing N-term two other small diffs

 

Correct the two diffs add my N-term

 

 Score = 2319 (816.3 bits), Expect = 2.0e-244, P = 2.0e-244

 Identities = 446/451 (98%), Positives = 446/451 (98%)

 

Query:     1 NELKGPIPIPILGNLYQLTSGLPHRDLTKISEKYGGIYRFWFADLYTVVLSDPILIREMF 60

             NELKGPIPIPILGNLYQLTSGLPHRDLTKISEKYGGIYRFWFADLYTVVLSDPILIREMF

Sbjct:    29 NELKGPIPIPILGNLYQLTSGLPHRDLTKISEKYGGIYRFWFADLYTVVLSDPILIREMF 88

 

Query:    61 VNNGDYFLDRPKIPSIRHATHYHGIATSSGEYWLKIRDIINKAMRKTNLKLIYDSLDQQV 120

             VNNGDYFLDRPKIPSIRHATHYH    SSGEYWLKIRDIINKAMRKTNLKLIYDSLDQQV

Sbjct:    89 VNNGDYFLDRPKIPSIRHATHYHA---SSGEYWLKIRDIINKAMRKTNLKLIYDSLDQQV 145

 

Query:   121 DNLIESMNKIESDGQVFEPRIYFKKYTMAAMYKFIFNEEINFNNEISELIGPIEQVFKDL 180

             DNLIESMNKIESDGQVFEPRIYFKKYTMAAMYKFIFNEEINFNNEISELIGPIEQVFKDL

Sbjct:   146 DNLIESMNKIESDGQVFEPRIYFKKYTMAAMYKFIFNEEINFNNEISELIGPIEQVFKDL 205

 

Query:   181 GSGSLFDVLLISRPLYYQWIEHTDKNYPKILNFLKKKYHQHLKTYNPEIQRDLLDLLIKE 240

             GSGSLFDVLLISRPLYYQWIEHTDKNYPKILNFLKKKYHQHLKTYNPEIQRDLLDLLIKE

Sbjct:   206 GSGSLFDVLLISRPLYYQWIEHTDKNYPKILNFLKKKYHQHLKTYNPEIQRDLLDLLIKE 265

 

Query:   241 YYSGSDDDILTIIATINDLFLAGTDTSSASLEYMVMMLVNYPEIQEKVYDEIKLTVNGRN 300

             YYSGSDDDILTIIATINDLFLAG DTSSASLEYMVMMLVNYPEIQEKVYDEIKLTVNGRN

Sbjct:   266 YYSGSDDDILTIIATINDLFLAGLDTSSASLEYMVMMLVNYPEIQEKVYDEIKLTVNGRN 325

 

Query:   301 KVLLSDRQFTPYTVSFIKETLRYKPPSSVGVPRTTSQDIIIGDKFIPKDAQIFINYYGLS 360

             KVLLSDRQFTPYTVSFIKETLRYKPPSSVGVPRTTSQDIIIGDKFIPKDAQIFINYYGLS

Sbjct:   326 KVLLSDRQFTPYTVSFIKETLRYKPPSSVGVPRTTSQDIIIGDKFIPKDAQIFINYYGLS 385

 

Query:   361 RNQDYFENPEQFEPSRFMNPDTNIAFLPFSIGTRNCVGQNFALDEMFLAFSNIILNFKFS 420

             RNQDYFENPEQFEPSRFMNPDTNIAFLPFSIGTRNCVGQNFALDEMFLAFSNIILNFKFS

Sbjct:   386 RNQDYFENPEQFEPSRFMNPDTNIAFLPFSIGTRNCVGQNFALDEMFLAFSNIILNFKFS 445

 

Query:   421 SIDGKQIDETELYGVTLRCKNKFNVSIKKRI 451

             SIDGKQIDETELYGVTLRCKNKFNVSIKKRI

Sbjct:   446 SIDGKQIDETELYGVTLRCKNKFNVSIKKRI 476

 

>JC2d95b07.s1 Clone JC2d95b07, standard read, bases 124 through 395, from

            2000-01-07

        Length = 270

 

  Plus Strand HSPs:

 

 Score = 147 (56.8 bits), Expect = 4.4e-09, P = 4.4e-09

 Identities = 26/27 (96%), Positives = 27/27 (100%), Frame = +3

 

Query:     1 VNNGDYFLDRPKIPSIRHATHYHGIAT 27

             VNNGDYFLDRPKIPSIRHATHYHGIA+

Sbjct:    90 VNNGDYFLDRPKIPSIRHATHYHGIA agt 170

 

>VFJ134 (VFJ134Q) /pub/dna_csm/LIBRARY/VF/VFJ1-B/VFJ134Q.Seq.d/

        Length = 671

 

  Plus Strand HSPs:

 

 Score = 289 (106.8 bits), Expect = 1.6e-24, P = 1.6e-24

 Identities = 63/75 (84%), Positives = 66/75 (88%), Frame = +2

 

Query:    46 KT-NLKLIYDSLDQQV--YYSGSDDDILTIIATINDLFLAGTDTSSASLEYMVMMLVNYP 102

             KT N ++  D LD  +  YYSGSDDDILTIIATINDLFLAGTDTSSASLEYMVMMLVNYP

Sbjct:    41 KTYNPEIQRDLLDLLIKEYYSGSDDDILTIIATINDLFLAGTDTSSASLEYMVMMLVNYP 220

 

Query:   103 EIQEKVYDEIKLTVN 117

             EIQEKVYDEIKLTVN

Sbjct:   221 EIQEKVYDEIKLTVN 265

 

>JC2b306d10.r1 Clone JC2b306d10, reverse read, bases 37 through 603, from

            2000-03-15

        Length = 565

 

  Plus Strand HSPs:

 

 Score = 168 (64.2 bits), Expect = 1.3e-11, P = 1.3e-11

 Identities = 33/33 (100%), Positives = 33/33 (100%), Frame = +1

 

Query:    28 SSGEYWLKIRDIINKAMRKTNLKLIYDSLDQQV 60

             SSGEYWLKIRDIINKAMRKTNLKLIYDSLDQQV

Sbjc agca 22 SSGEYWLKIRDIINKAMRKTNLKLIYDSLDQQV 120

 

>aa-ng11186        Contig_4662     pred cdna   pORF 10810        9162 strand r start y stop y ph 0

MGISSIIIILFIIVLLKKLIKKEDRIHRINKNIPGPKSKLLVGNLFDLKGQVHEKLKEWY

EQYGSVYRIEFGSVSTVVLTEYATLKEAFVDNGEIFQSRFQRKSRTTCNKGLNLANSNGE

YFNHLKKTLSNEITNQKMKKNEKIIKIQVGLLSEFFNEISGGGGGGSGGSGISKEPINNI

IKMYSLNVMLSLLFNIHFPYNNNSYQDELMSTITRYFKSTGLPYPSDFIPILYPFLKNKP

KEYFEDYESVKKLITRITNEYQLKHMTEISNKSTIEEIENYQPTNILESLLKQYRLNKIP

YDGVIGCLMDLILAGSDTTGNTCLFSLVALVNNSNIQEKLFNEISNAFNDDDGDELNGAN

DISNSLLKLSYFSDRIKTPYLVAFIKEVKRYYPCAPLSVPHLLTEDCEIQGYKIAKGTQV

IQNIYSTHLSQSFCSNPLEFSPERFLDSTNEPKIITFGIGQRKCPGENIFEIEIYIFLVY

LIKKFKFSHPIDDNLQLNDRGQFGLSLQCPQLNIKVESR

 

>CYP513D1 Seq (54+55+78) complete seq 38% to seq 8

          Length = 505 one insertion

 

No 513 seq has an intron here so this is probably an insertion and not an intron

 

 Score = 2571 (905.0 bits), Expect = 3.9e-271, P = 3.9e-271

 Identities = 503/519 (96%), Positives = 504/519 (97%)

 

Query:     1 MGISSIIIILFIIVLLKKLIKKEDRIHRINKNIPGPKSKLLVGNLFDLKGQVHEKLKEWY 60

             MGISSIIIILFIIVLLKKLIKKEDRIHRINKNIPGPKSKLLVGNLFDLKGQVHEKLKEWY

Sbjct:     1 MGISSIIIILFIIVLLKKLIKKEDRIHRINKNIPGPKSKLLVGNLFDLKGQVHEKLKEWY 60

 

Query:    61 EQYGSVYRIEFGSVSTVVLTEYATLKEAFVDNGEIFQSRFQRKSRTTCNKGLNLANSNGE 120

             EQYGSVYRIEFGSVSTVVLTEYATLKEAFVDNGEIFQSRFQRKSRTTCNKGLNLANSNGE

Sbjct:    61 EQYGSVYRIEFGSVSTVVLTEYATLKEAFVDNGEIFQSRFQRKSRTTCNKGLNLANSNGE 120

 

Query:   121 YFNHLKKTLSNEITNQKMKKNEKIIKIQVGLLSEFFNEISGGGGGGSGGSGISKEPINNI 180

             YFNHLKKTLSNEITNQKMKKNEKIIKIQVGLLSEFFNEI               +PINNI

Sbjct:   121 YFNHLKKTLSNEITNQKMKKNEKIIKIQVGLLSEFFNEI---------------KPINNI 165

 

Query:   181 IKMYSLNVMLSLLFNIHFPYNNNSYQDELMSTITRYFKSTGLPYPSDFIPILYPFLKNKP 240

             IKMYSLNVMLSLLFNIHFPYNNNSYQDELMSTITRYFKSTGLPYPSDFIPILYPFLKNKP

Sbjct:   166 IKMYSLNVMLSLLFNIHFPYNNNSYQDELMSTITRYFKSTGLPYPSDFIPILYPFLKNKP 225

 

Query:   241 KEYFEDYESVKKLITRITNEYQLKHMTEISNKSTIEEIENYQPTNILESLLKQYRLNKIP 300

             KEYFEDYESVKKLITRITNEYQLKHMTEISNKSTIEEIENYQPTNILESLLKQYRLNKIP

Sbjct:   226 KEYFEDYESVKKLITRITNEYQLKHMTEISNKSTIEEIENYQPTNILESLLKQYRLNKIP 285

 

Query:   301 YDGVIGCLMDLILAGSDTTGNTCLFSLVALVNNSNIQEKLFNEISNAFNDDDGDELNGAN 360

             YDGVIGCLMDLILAGSDTTGNTCLFSLVALVNNSNIQEKLFNEISNAFNDDDGDELNGAN

Sbjct:   286 YDGVIGCLMDLILAGSDTTGNTCLFSLVALVNNSNIQEKLFNEISNAFNDDDGDELNGAN 345

 

Query:   361 DISNSLLKLSYFSDRIKTPYLVAFIKEVKRYYPCAPLSVPHLLTEDCEIQGYKIAKGTQV 420

             DISNSLLKLSYFSDRIKTPYLVAFIKEVKRYYPCAPLSVPHLLTEDCEIQGYKIAKGTQV

Sbjct:   346 DISNSLLKLSYFSDRIKTPYLVAFIKEVKRYYPCAPLSVPHLLTEDCEIQGYKIAKGTQV 405

 

Query:   421 IQNIYSTHLSQSFCSNPLEFSPERFLDSTNEPKIITFGIGQRKCPGENIFEIEIYIFLVY 480

             IQNIYSTHLSQSFCSNPLEFSPERFLDSTNEPKIITFGIGQRKCPGENIFEIEIYIFLVY

Sbjct:   406 IQNIYSTHLSQSFCSNPLEFSPERFLDSTNEPKIITFGIGQRKCPGENIFEIEIYIFLVY 465

 

Query:   481 LIKKFKFSHPIDDNLQLNDRGQFGLSLQCPQLNIKVESR 519

             LIKKFKFSHPIDDNLQLNDRGQFGLSLQCPQLNIKVESR

Sbjct:   466 LIKKFKFSHPIDDNLQLNDRGQFGLSLQCPQLNIKVESR 504

 

>aa-ng11003        Contig_4600     pred cdna   pORF 4748            1 strand r start y stop y ph 0 not a P450, zinc finger

MTNEEEQQTTNQQQQQPTPQLVINDLDSILSPIGGGGRVKYTCLDVSKRYLALGANTGSL

YFFERKSMSSIIPKNKNQDNQQQLQPQLQQPQSYLLPTVKIFTNENLSFTQILSLNDIRD

QISIIKINPCNDNLVAIATHKTIFIIEPNISIRREKEKVLVKMTDHPKDAEITTLIWSHC

GGYLFSGDDLGNLYCCSVTKARKSFFFSADLVYKCDSKLVQLDIIPSIPTITNTQLHQDI

QLTTNSNSNSNNSSLNNHHQQQQNIVHTIPITNSIPPVNSITISILASTLTKSIVVNFQI

LTATNVGKVLSIIQIGKKQRENTKQGACFHPFYKSSVYTSRPGKRLWLADPIDGRVLSTM

NFTPSSTTNEDGTTNPIPNIILKPKPLISSNSSNDANTYLTGKFPMIFSKLLPFGNYLIS

YDELSLLLIDVNEVEVLEWKLDTNQIHDLVVYHDSIYALHGQNHSISRIFTSIPLEILPT

PQELQEQSDQLQKQQQQQQQQQQQPEEIISEPQSQPQPIAIASLIEEKKDEQLQPKKSES

QENLTTLSNSDNIIGSEIDSTNTDNSTLSSSTNSLTIEGSASLPLVSESSNQQQQQQQQQ

QPSSLKSSTDQFVDDKLLDSTTTTTAVETSIQRVASKECIFSDIPLSSEPLMVSTSVKDP

TLAASTTRKVIKKKVIKTALPQDQQLNGTNNTATLSTSPPQSIVSSSTTATTATATSTSI

GAPTEVKRTVIVKKKIIKKAAALSTTTTTTPQISSPPISNPTPSPPLYVETVSSSTTTTT

TTPAEVSLSPKKEETTIAPTPTPIPTSATTVTANNIQQGIFKTLNSATSSFLEIKQFALN

KTNEIKSDITHKILGPSSGDDINSSNNTNVTLNNTTTATATATTATTITDNLNTTIVNEQ

QSQQHPIEVLEKLTKQTYESLQQYKENKTKTTSFEPMLQIWVQLFNSIEDSVIPSDMIKD

IVTSCFCFGINLKSTKECSTGGGSGSGGNGKSLTHTNKLCKCWNEQLAIEFIKEYYQFLN

TNRVYLNSNERKWDRCIEQLVEFESFTKQESKIISTIYQFIDSSDSSTCLTLLDSHCSND

YGLLFRFLNQLLELDPIESSKFFAKHFPIILPRNILNFTNSLIENDDIIINKSNDSGGGS

RSNKELGNLIKFEYLNQLMKLKPECKFNSLELLEQWFLSNLMNNKPNQNQLFEEREDMST

TSTRKFIPKLKSNLIEWKSKDTLIQIIDNVINNEYGNGNLLKSLESMSAENGFHIGLYKI

YEHYFKNVYSMKSILHNQEQLNHYSKKLLELTIFTDDKKGFSNCFENSTSIEIWSNTLEI

MKQLRIDQHENEDQSGESSITFEISELFILDLIGKAIGPLETIELVCKFSEMFEKSKIET

KLLSDWIKFGKWSLYDRPKLSYDILSLLDSHLWLKKPLTISPQFLTMIGPELDPTLPNNP

FKEFFNKLGTSSHDSGDNIDFKIPSFFEDNSRRHWGVETELSTGSCPICTLSLAENPESS

FLGSTPATIIVFPNCGHSYHHCCIEDQGCMLCFSKGLKKKKKKK

 

>aa-ng11335        Contig_4699     pred cdna   pORF 3095         3652 strand f start y stop y ph 0

MEWFTLFMINNKDCQEKAYNEIVSVMGEDCNKISYADRPKLPYLVACINECLRMRTEDPL

GIPRGAVEDIEINGYFMPKGAKVHHYLYAFGMNETVFENVNKFQPDRWLTNDQVHLKQML

NHLVPFSVGPRSCFGKNLSELEVFVVCSNILLNFELSSYNGKPVDDFEIFGIHPPEFPVK

LIKRK

 

>CYP523A1 seq 39, 73 complete 41% to seq 14 487 aa

          Length = 488 4 diffs N-term missing

 

 Score = 926 (326.0 bits), Expect = 8.1e-97, P = 8.1e-97

 Identities = 171/175 (97%), Positives = 172/175 (98%)

 

Query:    11 NKDCQEKAYNEIVSVMGEDCNKISYADRPKLPYLVACINECLRMRTEDPLGIPRGAVEDI 70

             N DCQEKAYNEIVSVMGEDCNKISYADRPKLPYLVACINECLRMRTEDPLGIPRGAVEDI

Sbjct:   313 NGDCQEKAYNEIVSVMGEDCNKISYADRPKLPYLVACINECLRMRTEDPLGIPRGAVEDI 372

 

Query:    71 EINGYFMPKGAKVHHYLYAFGMNETVFENVNKFQPDRWLTNDQVHLKQMLNHLVPFSVGP 130

             EINGYFMPKGAKVHHYLYAFGMNETVFENVNKFQPDRWLTNDQVHLKQMLNHLVPFSVGP

Sbjct:   373 EINGYFMPKGAKVHHYLYAFGMNETVFENVNKFQPDRWLTNDQVHLKQMLNHLVPFSVGP 432

 

Query:   131 RSCFGKNLSELEVFVVCSNILLNFELSSYNGKPVDDFEIFGIHPPEFPVKLIKRK 185

             RSCFGKNLSELEVFVVCSNILLNFELSSYNG PVDD +IFGIHPPEFPVKLIKRK

Sbjct:   433 RSCFGKNLSELEVFVVCSNILLNFELSSYNGNPVDDLKIFGIHPPEFPVKLIKRK 487

 

Correct the three diffs add my N-terminal

 

>CYP519H1P = old CYP523A1 seq 39, 73 complete 41% to seq 14 487 aa

MFIIYFIFLFLLIISLFIDF (0)

IKKNLKKSNNDPPGPISLPLLGNLHNLTKNPHRGLKNLSDKYGGIFRCYLGDHYSIVVSDP

VIINEIYIKKFEKVCTRPNNDTFKMFSSGFKDLAFSDNYNIWSKIRTIVNSNFTKTKLT

KTIYNYLEDQTNQLIENMGNYSKSGEPV (1)

FLSTITIYMKISLNVICKLFFSKEILQNESFDNGKMRRIAVPIQIVCKELGAGN frameshift

FGDFVGILSPLLYFTKKKYQKNSSTSTDFIGEINDEHLKNLDHD (bad GT boundary)

agtPKDLMDMLIIDSTKGKDEDKEPIVHIGYDFLMVDQIRHLVLWN (1)

DCQEKAYNEIVSVMGEDCNKISYADRP

KLPYLVACINECLRMRTEDPLGIPRGAVEDIEINGYFMPKGAKVHHYLYAFGMNETVFEN

VNKFQPDRWLTNDQVHLKQMLNHLVPFSVGPRSCFGKNLSELEVFVVCSNILLNFELSSY

NGKPVDDFEIFGIHPPEFPVKLIKRK*

 

>IIIBP1E1290A 104132 letters

        Length = 104,133

 

  Plus Strand HSPs:

 

Score = 806 (288.8 bits), Expect = 5.6e-234, Sum P(5) = 5.6e-234

 Identities = 156/179 (87%), Positives = 164/179 (91%), Frame = +2

 

Query:    13 IISLFIDFIKKNLKKSNNDPPGPISLPLLGNLHNLTKNPHRGLKNLSDKYGGIFRCYLGD 72

             II L+   IKKNLKKSNNDPPGPISLPLLGNLHNLTKNPHRGLKNLSDKYGGIFRCYLGD

Sbjct: 66797 IIILYYI*IKKNLKKSNNDPPGPISLPLLGNLHNLTKNPHRGLKNLSDKYGGIFRCYLGD 66976

 

Query:    73 HYSIVVSDPVIINEIYIKKFEKVCTRPNNDTFKMFSSGFKDLAFSDNYNIWSKIRTIVNS 132

             HYSIVVSDPVIINEIYIKKFEKVCTRPNNDTFKMFSSGFKDLAFSDNYNIWSKIRTIVNS

Sbjct: 66977 HYSIVVSDPVIINEIYIKKFEKVCTRPNNDTFKMFSSGFKDLAFSDNYNIWSKIRTIVNS 67156

 

Query:   133 NFTKTKLTKTIYNYLEDQTNQLIENMGNYSKSGEPVF-LSTITIYMKISLNVICKLFFS 190

             NFTKTKLTKTIYNYLEDQTNQLIENMGNYSKSGEPV  +  I IY+ I + ++ +  F+

Sbjct: 67157 NFTKTKLTKTIYNYLEDQTNQLIENMGNYSKSGEPVCKIFNIYIYIYIYI*ILIQF*FN 67333

 

 Score = 266 (98.7 bits), Expect = 5.6e-234, Sum P(5) = 5.6e-234

 Identities = 53/53 (100%), Positives = 53/53 (100%), Frame = +1

 

Query:   170 LSTITIYMKISLNVICKLFFSKEILQNESFDNGKMRRIAVPIQIVCKELGAGN 222

             LSTITIYMKISLNVICKLFFSKEILQNESFDNGKMRRIAVPIQIVCKELGAGN

Sbjct: 67354 LSTITIYMKISLNVICKLFFSKEILQNESFDNGKMRRIAVPIQIVCKELGAGN 67512

 

 Score = 220 (82.5 bits), Expect = 5.6e-234, Sum P(5) = 5.6e-234

 Identities = 42/42 (100%), Positives = 42/42 (100%), Frame = +3

 

Query:   233 FGDFVGILSPLLYFTKKKYQKNSSTSTDFIGEINDEHLKNLD 274

             FGDFVGILSPLLYFTKKKYQKNSSTSTDFIGEINDEHLKNLD

Sbjct: 67512 FGDFVGILSPLLYFTKKKYQKNSSTSTDFIGEINDEHLKNLD 67637

 

 Score = 220 (82.5 bits), Expect = 5.6e-234, Sum P(5) = 5.6e-234

 Identities = 41/42 (97%), Positives = 42/42 (100%), Frame = +2

 

Query:   274 DLMDMLIIDSTKGKDEDKKPIVHIGYDFLMVDQIRHLVLWNG 315

             DLMDMLIIDSTKGKDEDK+PIVHIGYDFLMVDQIRHLVLWNG

Sbjct: 67700 DLMDMLIIDSTKGKDEDKEPIVHIGYDFLMVDQIRHLVLWNG 67825

 

Score = 926 (331.0 bits), Expect = 5.6e-234, Sum P(5) = 5.6e-234

 Identities = 171/175 (97%), Positives = 172/175 (98%), Frame = +1

 

Query:   314 NGDCQEKAYNEIVSVMGEDCNKISYADRPKLPYLVACINECLRMRTEDPLGIPRGAVEDI 373

             N DCQEKAYNEIVSVMGEDCNKISYADRPKLPYLVACINECLRMRTEDPLGIPRGAVEDI

Sbjct: 67846 NKDCQEKAYNEIVSVMGEDCNKISYADRPKLPYLVACINECLRMRTEDPLGIPRGAVEDI 68025

 

Query:   374 EINGYFMPKGAKVHHYLYAFGMNETVFENVNKFQPDRWLTNDQVHLKQMLNHLVPFSVGP 433

             EINGYFMPKGAKVHHYLYAFGMNETVFENVNKFQPDRWLTNDQVHLKQMLNHLVPFSVGP

Sbjct: 68026 EINGYFMPKGAKVHHYLYAFGMNETVFENVNKFQPDRWLTNDQVHLKQMLNHLVPFSVGP 68205

 

Query:   434 RSCFGKNLSELEVFVVCSNILLNFELSSYNGNPVDDLKIFGIHPPEFPVKLIKRK 488

             RSCFGKNLSELEVFVVCSNILLNFELSSYNG PVDD +IFGIHPPEFPVKLIKRK

Sbjct: 68206 RSCFGKNLSELEVFVVCSNILLNFELSSYNGKPVDDFEIFGIHPPEFPVKLIKRK 68370

 

 

JC3e64g10.s1

JC3e62b07.s1

JAX4a44c04.r1

JC3f107h06.s1

JC3e08f02.r1

 

These five seqs support a frameshift in the sequence

 

>JC3e08f02.r1 Clone JC3e08f02, reverse read, bases 48 through 650, from

            2002-07-16

        Length = 601

 

  Minus Strand HSPs:

 

 Score = 262 (97.3 bits), Expect = 1.3e-21, P = 1.3e-21

 Identities = 55/75 (73%), Positives = 61/75 (81%), Frame = -1

 

Query:     1 LSTITIYMKISLNVICKLFFSKEILQNESFDNGKMRRIAVPIQIVCKELGAGFGDFVGIL 60

             LSTITIYMKISLNVICKLFFSKEILQNESFDNGKMRRIAVPIQIVCKELGAG    + +

Sbjct:   445 LSTITIYMKISLNVICKLFFSKEILQNESFDNGKMRRIAVPIQIVCKELGAGNLVILWVF 266

 

Query:    61 SPLLYFTKKKYQKNS 75

               L +  ++K  K +

Sbjct:   265 YHLYFILQRKSIKKT 221

 

 Score = 220 (82.5 bits), Expect = 3.6e-17, P = 3.6e-17

 Identities = 42/42 (100%), Positives = 42/42 (100%), Frame = -3

 

Query:    53 FGDFVGILSPLLYFTKKKYQKNSSTSTDFIGEINDEHLKNLD 94

             FGDFVGILSPLLYFTKKKYQKNSSTSTDFIGEINDEHLKNLD

Sbjct:   287 FGDFVGILSPLLYFTKKKYQKNSSTSTDFIGEINDEHLKNLD 162

 

>JC3e62b07.r1 Clone JC3e62b07, reverse read, bases 335 through 598, from

            2002-09-06

        Length = 262

 

  Minus Strand HSPs:

 

 Score = 126 (49.4 bits), Expect = 7.6e-07, P = 7.6e-07

 Identities = 23/23 (100%), Positives = 23/23 (100%), Frame = -1

 

Query:    33 PVDDFEIFGIHPPEFPVKLIKRK 55

             PVDDFEIFGIHPPEFPVKLIKRK

Sbjct:   262 PVDDFEIFGIHPPEFPVKLIKRK 194

 

>JC3a191g11.r1 Clone JC3a191g11, reverse read, bases 128 through 664, from

            2001-07-06

        Length = 535

 

  Minus Strand HSPs:

 

 Score = 291 (107.5 bits), Expect = 1.2e-24, P = 1.2e-24

 Identities = 55/55 (100%), Positives = 55/55 (100%), Frame = -2

 

Query:     1 RSCFGKNLSELEVFVVCSNILLNFELSSYNGKPVDDFEIFGIHPPEFPVKLIKRK 55

             RSCFGKNLSELEVFVVCSNILLNFELSSYNGKPVDDFEIFGIHPPEFPVKLIKRK

Sbjct:   201 RSCFGKNLSELEVFVVCSNILLNFELSSYNGKPVDDFEIFGIHPPEFPVKLIKRK 37

 

 

>JC3e08f02.r1 Clone JC3e08f02, reverse read, bases 48 through 650, from

            2002-07-16

        Length = 601

 

  Minus Strand HSPs:

 

 Score = 323 (118.8 bits), Expect = 2.0e-42, Sum P(2) = 2.0e-42

 Identities = 62/71 (87%), Positives = 67/71 (94%), Frame = -1

 

Query:     1 LSTIPIYMKISLNVICTLFFSKKILQNESFDNGKMKKIPVPIQIVCKKLGPGNLVILWVF 60

             LSTI IYMKISLNVIC LFFSK+ILQNESFDNGKM++I VPIQIVCK+LG GNLVILWVF

Sbjct:   445 LSTITIYMKISLNVICKLFFSKEILQNESFDNGKMRRIAVPIQIVCKELGAGNLVILWVF 266

 

Query:    61 YHLYFILQKKS 71

             YHLYFILQ+KS

Sbjct:   265 YHLYFILQRKS 233

 

 Score = 215 (80.7 bits), Expect = 4.4e-31, Sum P(2) = 4.4e-31

 Identities = 41/42 (97%), Positives = 42/42 (100%), Frame = -3

 

Query:    73 FGNFVGILSPLLYFTKKKYQKNSSTSTDFIGEINDEHLKNLD 114

             FG+FVGILSPLLYFTKKKYQKNSSTSTDFIGEINDEHLKNLD

Sbjct:   287 FGDFVGILSPLLYFTKKKYQKNSSTSTDFIGEINDEHLKNLD 162

 

 Score = 152 (58.6 bits), Expect = 2.0e-42, Sum P(2) = 2.0e-42

 Identities = 30/33 (90%), Positives = 31/33 (93%), Frame = -2

 

Query:   114 DLMDMLIIDSTKGKDEDKKPIVHIGYDFLMVDQ 146

             DLMDMLIIDSTKGKDEDK+PIVHI  DFLMVDQ

Sbjct:    99 DLMDMLIIDSTKGKDEDKEPIVHIEIDFLMVDQ 1

 

>JC3e08f02.r1_frame-1 Clone JC3e08f02, reverse read, bases 48 through 650, from 2002-07-16, translation frame -1

FRRSN*SIN*KYGKLFKIW*TCM*NI*YIYIYIYIYINFNTILI*LLLLLLV LSTITIYM

KISLNVICKLFFSKEILQNESFDNGKMRRIAVPIQIVCKELGAGN LVILWVFYHLYFILQ

RKSIKKTVQHQQIS*VKLMMNI*KI*IMIKIILLT*IIKKKTIIKSKRFNGYVNYRFNKR

*G*R*RTYCSY*N*FFNGGS

>JC3e08f02.r1_frame-2 Clone JC3e08f02, reverse read, bases 48 through 650, from 2002-07-16, translation frame -2

LEDQTNQLIENMGNYSKSGEPVCKIFNIYIYIYIYI*ILIQF*FNYYYYY*FYPQLQYI*

KFH*M*YVNYFFQKKFYKMKVLIMGK*EELQYLFK*FVKSWEQVIW*FCGYFITFTLFYK

EKVSKKQFNINRFHR*N***TFKKFRS*LKLFY*PK**KKKQ**SPKDLMDMLIIDSTKG

KDEDKEPIVHIEIDFLMVDQ

>JC3e08f02.r1_frame-3 Clone JC3e08f02, reverse read, bases 48 through 650, from 2002-07-16, translation frame -3

*KIKLIN*LKIWEIIQNLVNLYVKYLIYIYIYIYIYKF*YNFNLIIIIIISFIHNYNIYE

NFIKCNM*IIFFKRNFTK*KF**WENEKNCSTYSNSL*RVGSR*FGDFVGILSPLLYFTK

KKYQKNSSTSTDFIGEINDEHLKNLDHD*NYFINLNNKKKNNNKVQKI*WIC*L*IQQKV

RMKIKNLLFILKLIF*WWI

 

agttLSTITIYMKISLNVICKLFFSKEILQNESFDNGKMRRIAVPIQIVCKELGAGN

FGDFVGILSPLLYFTKKKYQKNSSTSTDFIGEINDEHLKNLD

 

>JC3e08f02.r1 Clone JC3e08f02, reverse read, bases 48 through 650, from 2002-07-16 translate frame +1 translate plus frames translate all frames

CTGATCCACCATTAAAAAATCAATTTCAATATGAACAATAGGTTCTTTATCTTCATCCTT

ACCTTTTGTTGAATCTATAATTAACATATCCATTAAATCTTTTGGACTTTATTATTGTTT

TTTTTTTTATTATTTAGGTTAATAAAATAATTTTAATCATGATCTAAATTTTTTAAATGT

TCATCATTAATTTCACCTATGAAATCTGTTGATGTTGAACTGTTTTTTTGATACTTTTTC

TTTGTAAAATAAAGTAAAGGTGATAAAATACCCACAAAATCACCAAATTACCTGCTCCCA

ACTCTTTACAAACTATTTGAATAGGTACTGCAATTCTTCTCATTTTCCCATTATCAAAAC

TTTCATTTTGTAAAATTTCTTTTGAAAAAAATAATTTACATATTACATTTAATGAAATTT

TCATATATATTGTAATTGTGGATAAAACTAATAATAATAATAATAATTAAATTAAAATTG

TATTAAAATTTATATATATATATATATATATATATATATATTAAATATTTTACATACAGG

TTCACCAGATTTTGAATAATTTCCCATATTTTCAATTAATTGATTAGTTTGATCTTCTAA

A

 

>JC3e62b07.s1 Clone JC3e62b07, standard read, bases 101 through 671, from

            2002-09-06

        Length = 569

 

  Plus Strand HSPs:

 

 Score = 248 (92.4 bits), Expect = 3.7e-72, Sum P(4) = 3.7e-72

 Identities = 50/53 (94%), Positives = 51/53 (96%), Frame = +1

 

Query:    30 LSTITIYMKISLNVICKLFFSKEILQNESFDNGKMRRIAVPIQIVCKELGAGN 82

             LSTIT+YMKISLNVICKLFFSKEILQNESFDNGKM  IAVPIQIVCKELGAGN

Sbjct:    28 LSTITLYMKISLNVICKLFFSKEILQNESFDNGKML*IAVPIQIVCKELGAGN 186

 

 Score = 232 (86.7 bits), Expect = 3.7e-72, Sum P(4) = 3.7e-72

 Identities = 43/44 (97%), Positives = 44/44 (100%), Frame = +2

 

Query:   125 PKDLMDMLIIDSTKGKDEDKKPIVHIGYDFLMVDQIRHLVLWNG 168

             PKDLMDMLIIDSTKGKDEDK+PIVHIGYDFLMVDQIRHLVLWNG

Sbjct:   368 PKDLMDMLIIDSTKGKDEDKEPIVHIGYDFLMVDQIRHLVLWNG 499

 

 Score = 222 (83.2 bits), Expect = 3.7e-72, Sum P(4) = 3.7e-72

 Identities = 43/47 (91%), Positives = 45/47 (95%), Frame = +3

 

Query:    78 LGAGNFGDFVGILSPLLYFTKKKYQKNSSTSTDFIGEINDEHLKNLD 124

             +G+  FGDFVGILSPLLYFTKKKYQKNSSTSTDFIGEINDEHLKNLD

Sbjct:   171 VGSR*FGDFVGILSPLLYFTKKKYQKNSSTSTDFIGEINDEHLKNLD 311

 

 Score = 77 (32.2 bits), Expect = 3.7e-72, Sum P(4) = 3.7e-72

 Identities = 15/16 (93%), Positives = 15/16 (93%), Frame = +1

 

Query:   167 NGDCQEKAYNEIVSVM 182

             N DCQEKAYNEIVSVM

Sbjct:   520 NKDCQEKAYNEIVSVM 567

 

>JC3e62b07.s1 Clone JC3e62b07, standard read, bases 101 through 671, from 2002-09-06 translate frame +1 translate plus frames translate all frames

ATTTAATTATTATTATTATTATTAGTTTTATCCACAATTACTTTATATATGAAAATTTCA

TTAAATGTAATATGTAAATTATTTTTTTCAAAAGAAATTTTACAAAATGAAAGTTTTGAT

AATGGGAAAATGTTATGAATTGCAGTACCTATTCAAATAGTTTGTAAAGAGTTGGGAGCA

GGTAATTTGGTGATTTTGTGGGTATTTTATCACCTTTACTTTATTTTACAAAGAAAAAGT

ATCAAAAAAACAGTTCAACATCAACAGATTTCATAGGTGAAATTAATGATGAACATTTAA

AAAATTTAGATCATGATTAAAATTATTTTATTAACCTAAATAATAAAAAAAAAAACAATA

ATAAAGTCCAAAAGATTTAATGGATATGTTAATTATAGATTCAACAAAAGGTAAGGATGA

AGATAAAGAACCTATTGTTCATATTGGATATGATTTTTTAATGGTGGATCAGATTCGTCA

TCTGGTATTATGGAATGGTTCACACTTTTCATGATTAATAATAAAGATTGTCAAGAAAAA

GCATACAATGAAATTGTTTCAGTAATGGG

 

 

>JC3f107h06.s1 Clone JC3f107h06, standard read, bases 45 through 725, from

            2002-11-22

        Length = 679

 

  Plus Strand HSPs:

 

 Score = 266 (98.7 bits), Expect = 5.0e-33, Sum P(2) = 5.0e-33

 Identities = 53/53 (100%), Positives = 53/53 (100%), Frame = +3

 

Query:    30 LSTITIYMKISLNVICKLFFSKEILQNESFDNGKMRRIAVPIQIVCKELGAGN 82

             LSTITIYMKISLNVICKLFFSKEILQNESFDNGKMRRIAVPIQIVCKELGAGN

Sbjct:   456 LSTITIYMKISLNVICKLFFSKEILQNESFDNGKMRRIAVPIQIVCKELGAGN 614

 

 Score = 157 (60.3 bits), Expect = 1.3e-21, Sum P(2) = 1.3e-21

 Identities = 33/51 (64%), Positives = 40/51 (78%), Frame = +1

 

Query:     1 KTIYNYLEDQTNQLIENMGNYSKSGEPVF-LSTITIYMKISLNVICKLFFS 50

             KTIYNYLEDQTNQLIENMGNYSKSGEPV  +  I IY+ I + ++ +  F+

Sbjct:   283 KTIYNYLEDQTNQLIENMGNYSKSGEPVCKIFNIYIYIYIYI*ILIQF*FN 435

 

 Score = 119 (46.9 bits), Expect = 5.0e-33, Sum P(2) = 5.0e-33

 Identities = 23/27 (85%), Positives = 25/27 (92%), Frame = +2

 

Query:    78 LGAGNFGDFVGILSPLLYFTKKKYQKN 104

             +G+  FGDFVGILSPLLYFTKKKYQKN

Sbjct:   599 VGSR*FGDFVGILSPLLYFTKKKYQKN 679

 

>JC3f107h06.s1 Clone JC3f107h06, standard read, bases 45 through 725, from 2002-11-22 translate frame +1 translate plus frames translate all frames

CCTCACAAAAAACCACATAGAGGTTTTAAAAATCTTTCTGATAAATATGGTGGGATTTTT

AGATGTTATCTTGGTGATCATTACAGTATTGTAGTAAGTGATCCTGTGATAATTAATGAA

ATTTATATTAAAAAATTTGAAAAGGTTTGTACAAGACCCAATAATGATTCATTTAATTTG

TTTTCAAGTGGTTTTAAATTTTTTGCATTTTCAGATAATTATAATATTTGGAGTAAAATT

AGAACAATAGTTAATTCAAATTTTACAAAAACAAAATTAACAAAAACCATTTACAATTAT

TTAGAAGATCAAACTAATCAATTAATTGAAAATATGGGAAATTATTCAAAATCTGGTGAA

CCTGTATGTAAAATATTTAATATATATATATATATATATATATATATATAAATTTTAATA

CAATTTTAATTTAATTATTATTATTATTATTAGTTTTATCCACAATTACAATATATATGA

AAATTTCATTAAATGTAATATGTAAATTATTTTTTTCAAAAGAAATTTTACAAAATGAAA

GTTTTGATAATGGGAAAATGAGAAGAATTGCAGTACCTATTCAAATAGTTTGTAAAGAGT

TGGGAGCAGGTAATTTGGTGATTTTGTGGGTATTTTATCACCTTTACTTTATTTTACAAA

GAAAAAGTATCAAAAAAAC

 

>aa-ng9290        Contig_3853     pred cdna   pORF 172          1696 strand f start n stop y ph 0

FHKKRTNGMPPGPFPLPIIGNLHQLGKSPYKSLKSFSDKYGGLTTIFLGSVPT

Deleted some exon sequence here

INPHDYI

RRYSLNGVIDYSFSDSVEYESDTHHIVIRAAEIMEEILATGNPHDYLPFLKPFYTKKRNT

LAMAVGQVWDYCNDAITVHRKTLDHEKPRDLLNLILMEIEKSEEKQFYDDDSLSKCLTDL

IVAGHETVAITLGWMILFLSNHQDVQQKVYDELINVVGKGNLPALVHRKDTSYLNACIQE

TMRIRTAAPLALPRIASEDIKVGGYTIPKGTQVMMSVYGMASDERYWKDPHIFNPERWLS

SNHSTENGGGGGGVVGNSSQSEVFIPFGVGPRMCVGMGVAKDELYYCASQMFMNFKWSPV

NDKPIDDEGVARIALEYKEYQVVLERRK

 

>CYP516B1 seq 10, 42 complete seq 39% to 9, 43

          Length = 477

 

Score = 296 (104.2 bits), Expect = 7.7e-201, Sum P(2) = 7.7e-201

 Identities = 60/81 (74%), Positives = 66/81 (81%)

 

Query:     1 FHKKRTNGMPPGPFPLPIIGNLHQLGKSPYKSLKSFSDKYGGLTTIFLGSVPT--INPHD 58

             FHKKRTNGMPPGPFPLPIIGNLHQLGKSPYKSLKSFSDKYGGLTTIFLGSVPT  I+  +

Sbjct:    16 FHKKRTNGMPPGPFPLPIIGNLHQLGKSPYKSLKSFSDKYGGLTTIFLGSVPTVLISEPN 75

 

 Score = 1634 (575.2 bits), Expect = 7.7e-201, Sum P(2) = 7.7e-201

 Identities = 317/342 (92%), Positives = 319/342 (93%)

 

Query:    47 FLGSVPTINPHDYIRRYSLNGVIDYSFSDSVEYESDTHHIVIRAAEIMEEILATGNPHDY 106

             +  S   IN HDYIRRYSLNGVIDYSFSDSVEYESDTHHIVIRAAEIMEEILATGNPHDY

Sbjct:   151 YANSKQAIN-HDYIRRYSLNGVIDYSFSDSVEYESDTHHIVIRAAEIMEEILATGNPHDY 209

 

Query:   107 LPFLKPFYTKKRNTLAMAVGQVWDYCNDAITVHRKTLDHEKPRDLLNLILMEIEKSEEKQ 166

             LPFLKPFYTKKRNTLAMAVGQVWDYCNDAITVHRKTLDHEKPRDLL+LILMEIEKSEEKQ

Sbjct:   210 LPFLKPFYTKKRNTLAMAVGQVWDYCNDAITVHRKTLDHEKPRDLLDLILMEIEKSEEKQ 269

 

Two seqs have D here

 

Query:   167 FYDDDSLSKCLTDLIVAGHETVAITLGWMILFLSNHQDVQQKVYDELINVVGKGNLPALV 226

             FYDDDSLSKCLTDLIVAGHETVAITLGWMILFLSNHQDVQQKVYDELINVVGKGNLPALV

Sbjct:   270 FYDDDSLSKCLTDLIVAGHETVAITLGWMILFLSNHQDVQQKVYDELINVVGKGNLPALV 329

 

Query:   227 HRKDTSYLNACIQETMRIRTAAPLALPRIASEDIKVGGYTIPKGTQVMMSVYGMASDERY 286

             HRKDTSYLNACIQETMRIRTAAPLALPRIASEDIKVGGYTIPKGTQVMMSVYGMASDERY

Sbjct:   330 HRKDTSYLNACIQETMRIRTAAPLALPRIASEDIKVGGYTIPKGTQVMMSVYGMASDERY 389

 

Query:   287 WKDPHIFNPERWLSSNHSTENGGGGGGVVGNSSQSEVFIPFGVGPRMCVGMGVAKDELYY 346

             WKDPHIFNPERWLSSNHS  N               VFIPFGVGPRMCVGMGVAKDELYY

Sbjct:   390 WKDPHIFNPERWLSSNHSETN---------------VFIPFGVGPRMCVGMGVAKDELYY 434

 

Query:   347 CASQMFMNFKWSPVNDKPIDDEGVARIALEYKEYQVVLERRK 388

             CASQMFMNFKWSPVNDKPIDDEGVARIALEYKEYQVVLERRK

Sbjct:   435 CASQMFMNFKWSPVNDKPIDDEGVARIALEYKEYQVVLERRK 476

 

 

>CFF181 (CFF181Q) /pub/dna_csm/LIBRARY/CF/CFF1-D/CFF181Q.Seq.d/

        Length = 1132

 

  Plus Strand HSPs:

 

Supports insertion

 

Query:     2 KDPHIFNPERWLSSNHSTENGGGGGGVVGNSSQSEVFIPFGVGPRMCVGMGVAKDELYY 60

             KDPHIFNPERW SSNHSTENGGGGGGVVGNSSQ EVFIPFGVGPRMCVGMGVAKD L Y

Sbjct:   807 KDPHIFNPERWXSSNHSTENGGGGGGVVGNSSQXEVFIPFGVGPRMCVGMGVAKDXLXY 983

 

>SSA260 (SSA260Q) /pub/dna_csm/LIBRARY/SS/SSA2-C/SSA260Q.Seq.d/

        Length = 507

 

  Plus Strand HSPs:

 

 Score = 337 (123.7 bits), Expect = 1.7e-29, P = 1.7e-29

 Identities = 60/60 (100%), Positives = 60/60 (100%), Frame = +1

 

Query:     1 WKDPHIFNPERWLSSNHSTENGGGGGGVVGNSSQSEVFIPFGVGPRMCVGMGVAKDELYY 60

             WKDPHIFNPERWLSSNHSTENGGGGGGVVGNSSQSEVFIPFGVGPRMCVGMGVAKDELYY

Sbjct:    79 WKDPHIFNPERWLSSNHSTENGGGGGGVVGNSSQSEVFIPFGVGPRMCVGMGVAKDELYY 258

 

&&&&&&&&&&&&&&&&&&&&&&

 

>aa-ng10943        Contig_4571     pred cdna   pORF 327          2462 strand f start y stop y ph 0

MFLTSILYTIIIILIFYKGLEYLIEKRSFPLVHPIKGVMNGTKPYFIFGDLPFQRLLNLK

PKLKELGSIYFRWFFWYPIVEIKDINAIQYVYNEKSNNYSLYWNLNKSSNFILTGSEIKR

FFRIYYWAFNCKDSLQRIMPVIKSQVFDFIHSHKFQNSTLSTNDDVTN

 

IPSKFSRKYIKN

KNKRALYQVLAMKAYYGVVKQSSDDHIEESMINIIAETSYNDKEGLSLEEIKMPSYLLNA

SSIKGPMIMVENLMFQLIEKSEIESKIRKEIKLVFEKNGKDVNSFDFDDIMEMKYLEATL

DEINRLYPPFPKLMPRQTKESDRILGYHIPKGTMISCPVADILRDPSNFQDPLTFKPERQ

LIFSNPKFASPSITSIQEINGLS

SSSSNSFALHHRSLPSINNNNNNNNNNNNNNNNNNNN

NNNNNSNNNSINGNNKNNNRNCIQSFNNSALKKSFLSDSSSIIDNIVGTNRVDKLKLDSL

NENSNINNNNNKDLLNVPNIIEINKNNPISNNKYNSYNNLTIEE

RNQRIIKNLPWGIGSKKCLGKELAKLIVKTIIVILYSQYTFDKHLDENDEELCDTNNQPKIQITFNPEIKPPLLLK

SRKLFSISQPKQ

 

Deleted some seq between DDVTN and IPSK

Added seq in the region between PERF and heme

Use my seq

 

>CYP556A1 =  Seq 92

         Length = 517

 

Score = 906 (318.9 bits), Expect = 3.6e-140, Sum P(2) = 3.6e-140

 Identities = 172/184 (93%), Positives = 176/184 (95%)

 

Query:     1 MFLTSILYTIIIILIFYKGLEYLIEKRSFPLVHPIKGVMNGTKPYFIFGDLPFQRLLNLK 60

             MFLTSILYTIIIILIFYKGLEYLIEKRSFPLVHPIKGVMNGTKPYFIFGDLPFQRLLNLK

Sbjct:     1 MFLTSILYTIIIILIFYKGLEYLIEKRSFPLVHPIKGVMNGTKPYFIFGDLPFQRLLNLK 60

 

Query:    61 PKLKELGSIYFRWFFWYPIVEIKDINAIQYVYNEKSNNYSLYWNLNKSSNFILTGSEIKR 120

             PKLKELGSIYFRWFFWYPIVEIKDINAIQYVYNEKSNNYSLYWNLNKSSNFILTGSEIKR

Sbjct:    61 PKLKELGSIYFRWFFWYPIVEIKDINAIQYVYNEKSNNYSLYWNLNKSSNFILTGSEIKR 120

 

Query:   121 FFRIYYWAFNCKDSLQRIMPVIKSQVFDFIHSHKFQNSTLSTNDDVTNIPSKF-SRKYIK 179

             FFRIYYWAFNCKDSLQRIMPVIKSQVFDFIHSHKFQNSTLSTNDDVTN   K  SR Y+

Sbjct:   121 FFRIYYWAFNCKDSLQRIMPVIKSQVFDFIHSHKFQNSTLSTNDDVTNFMLKLLSRVYLG 180

 

Query:   180 NKNK 183

             + ++

Sbjct:   181 SSDE 184

 

 Score = 1097 (386.2 bits), Expect = 2.4e-160, Sum P(2) = 2.4e-160

 Identities = 239/314 (76%), Positives = 258/314 (82%)

 

Query:    70 YFRWFFWYPIVEIKD-INAIQYVYNEKSNNY---SLYWNLNKSSNFILTGSEIKRFFRIY 125

             +FR ++W      KD +  I  V   +  ++     + N   S+N  +T   +K   R+Y

Sbjct:   121 FFRIYYW--AFNCKDSLQRIMPVIKSQVFDFIHSHKFQNSTLSTNDDVTNFMLKLLSRVY 178

 

Query:   126 YW----AFNC-KDSLQRIMPVIKSQVFDFIHSHKFQNSTLSTNDDVTNIPSKFSRKYIKN 180

                   A++C K + ++     KS + DF H + F   TL        IPSKF RKYIKN

Sbjct:   179 LGSSDEAYHCFKANYKKFN---KSYL-DFFH-YLFP--TL------LKIPSKFLRKYIKN 225

 

Query:   181 KNKRALYQVLAMKAYYGVVKQSSDDHIEESMINIIAETSYNDKEGLSLEEIKMPSYLLNA 240

             KNKRALYQVLAMKAYYGVVKQSSDDHIEESMINIIAETSYNDKEGLSLEEIKMPSYLLNA

Sbjct:   226 KNKRALYQVLAMKAYYGVVKQSSDDHIEESMINIIAETSYNDKEGLSLEEIKMPSYLLNA 285

 

Query:   241 SSIKGPMIMVENLMFQLIEKSEIESKIRKEIKLVFEKNGKDVNSFDFDDIMEMKYLEATL 300

             SSIKGPMIMVENLMFQLIEKSEIESKIRKEIKLVFEKNGKDVNSFDFDDIMEMKYLEATL

Sbjct:   286 SSIKGPMIMVENLMFQLIEKSEIESKIRKEIKLVFEKNGKDVNSFDFDDIMEMKYLEATL 345

 

Query:   301 DEINRLYPPFPKLMPRQTKESDRILGYHIPKGTMISCPVADILRDPSNFQDPLTFKPERQ 360

             DEINRLYPPFPKLMPRQTKESDRILGYHIPKGTMISCPVADILRDPSNFQDPLTFKPERQ

Sbjct:   346 DEINRLYPPFPKLMPRQTKESDRILGYHIPKGTMISCPVADILRDPSNFQDPLTFKPERQ 405

 

Query:   361 LIFSNPKFASPSITSIQEINGLS 383

             LIFSNPKFASPSITSIQEINGLS

Sbjct:   406 LIFSNPKFASPSITSIQEINGLS 428

 

Score = 450 (158.4 bits), Expect = 2.4e-160, Sum P(2) = 2.4e-160

 Identities = 93/108 (86%), Positives = 97/108 (89%)

 

Query:   510 SNNKYNSYNNLTIEE------RNQRIIKNLPWGIGSKKCLGKELAKLIVKTIIVILYSQY 563

             SN K+ S +  +I+E      RNQ IIKNLPWGIGSKKCLGKELAKLIVKTIIVILYSQY

Sbjct:   409 SNPKFASPSITSIQEINGLSKRNQ-IIKNLPWGIGSKKCLGKELAKLIVKTIIVILYSQY 467

 

Query:   564 TFDKHLDENDEELCDTNNQPKIQITFNPEIKPPLLLKSRKLFSISQPKQ 612

             TFDKHLDENDEELCDTNNQPKIQITFNPEIKPPLLLKSRKLFSISQPKQ

Sbjct:   468 TFDKHLDENDEELCDTNNQPKIQITFNPEIKPPLLLKSRKLFSISQPKQ 516

 

>CYP508B2P pUC18 152517 letters translate frame +1 translate plus frames translate all frames

TTTTATTTTTTTAAACTAACTATTATTAAAAGAAATATTATCATATAAATTAACTAAAAT 134220

TTTTTTATAGAGAAAAATGATTGAAACTTTTGTTTTTATATTTATTTCAATATTAATGTT

                M  I  E  T

TCAATTTTTTATTAAATGTGTAATTATTATTGAAAATTGTTTATAAAAAAAAAAAAAAAA

AAAAAGAATTAATCAAATTTTTAATATTTTTTTTTAAATAATAAGTATATATTGTATAGA

                                             Y  I  L

CCAAAATTTAAAAATGAATTAGATGGTCCAAGTTTTCCTATACCATTCTTTGGAAATAAT

TTACAAATTGGTAAAAATAAAATTTTATATTTTCACAAATTAGAAAATTATTTTAAAAAA

GGAATTTTTAGAGTTTGGATAGGTGAAGTATTTACAGTAATAATTAGTGATCCTTTAATT

GAAAAAGAAATGAAAAAAACTATAAAAGAAAAACCTAAATATATTAAAAGAGATATTTTT

                                    K  Y

ATTTTAAATTTAAAATCAAATTTAATAAATTTGGTTTTTAATGAAAGTACAATTAATTTA

ATTCAGTCAATGAATTGTAGTATAAAAAACAATAAAGAGGTAATTTTTATAATATAATAA

TATTATTATTATTAGTTTACGGAAATTATTATTAACAACTATTTATTACAGTTTGAACCA

                                                   F  E  P

AAATTATTATGTTCAATTTTTTCATTTTCCATTATCTTTAAATTATTATTTAATATTGAT

TTAAAGAATGAAAATTATAAAGAAATAAAAAAAATAATGAAATTGGTGAATGAAATTAAT

L  K  N  E  N                          K  L  V

AAGAATAAAGAAAAGTCAAATTCTAGTGATTTAAATTCAATAACTAATTTTATTGAAAAT

CAATTTTATTACCATATATCAAATATTGATCAAAGGTTAGTTGATTTTTTTTTTTTTTAT

      Y  Y  H

AATTTAATAATAAATAATATTATTTACAATTTATACTTTAATTTAAATTAGTTTAAAAGA 135120

    *  *  *  I  I  L  F             F  N  L  NI  S  L  K  D

TTTTATGGATATTATTATTTATAAAGAAAATAGTTTTGAAACAAAAGAAAAAATTAAAAA

 F  M  D                       

ACAAGTAGTTTATAAATGTACAGACTATATTTTAAATAAATCCGAAAGTATATCAAAAGT

TATTGAAAATTTTTTTGTATTAATTGCACAAGATCAAGACTATCAATATTTAGCTTTTAA

TGAATTGAAATCCGTTATTAATACAAAACTTTTATATAATGGTGTTGGGGAAAATATTAT

AAAACTATCTGATAAATCATATACACCCATTACAAATTCTATTTGTAAAGAAGTTTTAAG

GTTAAATCCAGTGGATCAATTATCGTCACCAATTACTTGTAAAAAAGATAGTTTGGTAAA

TGGTTATTTTATTCCAAAAGATTCCCAAATAATTATAAATCATAAGTCAATGAATTTAAA

TGAAAAATATTTTAATGATCCATTTAATTTTAATCCAAAAAGATTTTTAAATTATAATAA

CCAATCAATTAATTTGTAAGTAATAAAAAAAAAGAAAAAAAAAATTATCAACCAAATACT

AACTTTTTTTTTTTCTTTTTTAAAAGCAACTCTTGTAATTTTTCAAGTGATGAGATTTAT

ATTGCAATTTCAAATATTTTATTAAATTTTAAAATAACAGCTGTTAAAAATCATTATAAT

                                       A  V  K  N

TTTAATGTATTTGATGATAATTTACAATCATCAGTTCTGATTGAAAAAAGATAAAAATAA

                                    L  I  E  K  R  *

ATTTCGTTTTTTCATTTTTTATTTTTTTTTTTTTTATTTTTATTTTTCAAAAAAACGATA