828 sequences from Dictyostelium discoideum 
in 55 contigs (by David Nelson).  These sequences completely searched for 
new hits and revised by Jinchuan Xing (March 19, 2001) and again by D. Nelson 
August 15, 2001.  Many sequences have been joined so the number of contigs has 
dropped to 55.  There are 45 full length sequences and one more that only lacks 
the N-terminal exon.
We have made best estimates of some N-terminal exons though this is difficult 
to do without EST data since these exons are very short and not well conserved.
We estimate there are 15 families represented.
Last modified August 16, 2001 and revised again on Feb. 25, 2002, May 16, 2003

45 complete (42 genes and three pseudogenes)
(1+2 = 508A1), (3+61 = 517A1), (4+23+24+83 = 508C1), (5+56 = 513C1), (6 = 515B1), (7+50 = 513A3), 
(8+53 = 513A1), (9+43 = 516A1), (10+42 = 516B1), (11 = 514A1), (12 = 519A1), (13 = 521A1),
(14+34 = 519D1), (15+77 = 508B1), (17+84 = 519E1), (18 = 522A1), (19 = CYP51), (20 = 518A1), 
(21+67+72+82+87 = 508A3), (22+45+69+86+88 = 508A2), (26+28 = 518B1), (31+32+33 = 519C1), (35+44+75 = 519B1),
(38 = 513A2P), (39+73 = 519H1P), (40 = 554A1), (41P = 513G1P), (47+68+80 = 519F1), (49 = 513B1), 
(51+52+90 = 513E1), (54+55+78 = 513D1), (57 = 513F1), (58 = 519G1), (59+81= 508D1), (62 = 525A1), 
(65 = 514A2), (66+70 = 508A4), (71+85 = 508E1), (74 = 517A2), (76 = 555A1), (79b = 515A1), (91 = 524A1), 
(92 = unnamed), 514A4 (new seq) 517A4 (new seq)

6 C-terminal pseudogene fragments (does not include 45 complete seqs above)
(25+27+29+30 = 518A2P), (37a = 513E2P), (37b = 513E3P), (60 = 508B2P), (64 = 516A2P), (79 = 515A2P)

complete except for N-term exon
(37a = 513E2P)

4 contigs that do not include a C-terminal fragment
(36 = 517A3P), (46 = 519C2P), (48 = 516B2P), (89 = 514A3P)

for intron locations mapped to an alignment see 
Intron alignment map

51     = seq 19       (1 intron)
508A1  = seq 1+2      (4 introns)
508A2  = seq 22+45+69+86+88 (3 introns)
508A3  = seq 21+67+72+82+87 (4 introns)
508A4  = seq 66+70    (3 introns)
508B1  = seq 15+77    (2 introns)
508B2P = seq 60       (2 introns and a corrupted heme region)
508C1  = seq 4+23+24+83 (4 introns)
508D1  = seq 59+81    (4 introns)
508E1  = seq 71+85    (2 introns)
513A1  = seq 8+53     (1 intron)
513A2P = seq 38       (0 introns, processed pseudogene)
513A3  = seq 7+50     (1 intron)
513B1  = seq 49       (1 intron)
513C1  = seq 5+56     (1 intron)
513D1  = seq 54+55+78 (1 intron)
513E1  = seq 51+52+90 (1 intron)
513E2P = seq 37a      (0 introns) complete except for missing N-term exon 55% to 513E1 
513E3P = seq 37b      (insertion, deletion, frameshifts) upstream of 513E2P only 429 bp between them  
513F1  = seq 57       (1 intron)
513G1P = seq 41       (0 introns, processed pseudogene)  37% to CYP513A1 
514A1  = seq 11       (3 introns)
514A2  = seq 65       (3 introns)
514A3P = seq 89       (1 intron with bad boundary)
514A4  = new seq      (3 introns) only 7aa diffs to 514A1
515A1  = seq 79b      (6 introns)
515A2P = seq 79a      (3 introns, partial sequence, exons 4-7)
515B1  = seq 6        (2 introns)
516A1  = seq 9+43     (2 introns)
516A2P = seq 64       (0 introns, partial sequence)
516B1  = seq 10+42    (2 introns)
516B2P = seq 48       (1 intron, bad boundary, partial sequence)
517A1  = seq 3+61     (2 introns)
517A2  = seq 74       (2 introns)
517A3P = seq 36       (2 introns)
517A4  = new sequence (2 introns) only 13 aa difs to 517A1 
518A1  = seq 20       (1 intron)
518A2P = seq 25+27+29+30 (0 introns, partial sequence)
518B1  = seq 26+28    (6 introns)
519A1  = seq 12       (2 introns)
519B1  = seq 35+44+75 (1 intron)
519C1  = seq 31+32+33 (2 introns)
519C2P = seq 46       (0 introns, partial sequence)
519D1  = seq 14+34    (2 introns)
519E1  = seq 17+84    (3 introns)
519F1  = seq 47+68+80 (2 introns)
519G1  = seq 58       (5 introns)
519H1P = seq 39+73    (3 introns and 50 nuc. insertion)
520A1    renamed 519F1
520B1    renamed 519G1
521A1  = seq 13       (1 intron)
522A1  = seq 18       (3 introns)
523A1    renamed 519H1P
524A1  = seq 91       (1 intron)
525A1  = seq 62       (
554A1  = seq 40       (1 intron)
555A1  = seq 76       (2 introns)
556A1  = seq 92       (3 introns)

CYP51 Seq 19 complete 26% to seq 8 40% to rice CYP51
MIGTIAVLVIAILVIFAFKKSPSNIPPIVETIPFIGCFYQFAKNPLQLVRNSYDRLGEIF
TLHLMGFKMTFVLGPEAQALFFRGTDEELSPKEAYRFVTPVFGKGVVYDSETEIMYEQLR
FVKNGLVLSQLKKAVGIIQEETEKYFETKWGDSGEIDLLYEMNKLTILTASRCLMGKSIN
KSLGQSGQLADLYHELEEGLNPISFFFPNLPLPSFKKRDAARAKVAAIFHSIIQERRRSTD
DSVDDVLYTLMNSKYKDGSVLEDEQIVGLMIGLLFAGQHTSSITLTYTIFYLLNNLEYFD
ETQKDINDIVQKENQGEINFDGLKRMNRLETVIREVLRLHPPLIFLMRKVMTPMEYKGKT
IPAGHILAVSPQVGMRLPTVYKNPDSFEPKRFDVEDKTPFSFIAFGGGKHGCPGENFGIL
QIKTIWTVLSTKYNLEVGPVPPTDFTSLVAGPKGPCMVKYSKKQK*
AU033519
AU060691
Contig7113
c-JAX4a244b11.s1 
JAX4b08b04.r1
JC1a178c02.r2
JC2b25d02.r1
JC2b119a09.s1
JC2e67h09.s1
sdic2Bf6.p1t
sdic2BF6.q1t
sdic6A53a12.q1t
sdic6A53b7.p1t
SLB124

>SLB124 (SLB124Q) /pub/dna_csm/LIBRARY/SL/SLB1-A/SLB124Q.Seq.d/
        Length = 1550

  Plus Strand HSPs:

 Score = 222 (83.2 bits), Expect = 5.6e-17, P = 5.6e-17
 Identities = 44/44 (100%), Positives = 44/44 (100%), Frame = +1

Query:     1 MIGTIAVLVIAILVIFAFKKSPSNIPPIVETIPFIGCFYQFAKN 44
             MIGTIAVLVIAILVIFAFKKSPSNIPPIVETIPFIGCFYQFAKN
Sbjct:   115 MIGTIAVLVIAILVIFAFKKSPSNIPPIVETIPFIGCFYQFAKN 246

CYP508A1 complete (old seqs. 1 and 2) 28% to seq 8 
on single contig with seq 22 and 21 CHR2.0.28372 2447-4459
MALFEIIISLFVVYIIHNA (0)
ISKYKKIHVNELCGPTPIPILGNLHQFGELPHRVLTKMTKK
YGHILRVYMADMYTVVVSDPLLIREMYVDNSDIFTDRVKKP (0)
SVEHGTFYHGTVTSYGEHW
KNNREIVGKAMRKTNLKHIYELLDKQVDVLIRSMKSIETSGKTFDTRYYITKFTMSA
MFKFLFNHDIPEDEDINKGDTQKLMGPMSEVFQNAGRGSLFDVINITQPLYLLYLEMFDQSFK
DIMKYHREKYNEHLKTFDPDVERDLLDILIKEYGTDNDDKILSILATINDFFLA (1)
GVDTSSTALESMVLMLTNYPEIQEKAFDEIKTVVNGRSKVNLSDRQSTPYLVAVIKETLRYKPMSP
FGLPRSSSKDCMIGGHFIPKNAQILINYQALGMNEEYYENPEQFDPSRFLKVESNVAFLP
FSIGIRSCV (2)
GQSFAQDELYICISNILLNFKLKSIDGKKIDETEEYGLTLKTKNRFNVTLEKRII*
AU033852
AU034252
AU034703
AU037735
AU037979
AU053778
AU060139
AU060437
AU060796
AU071927
AU074155
C24684
C25600
C89925
C90052
C91122
C92049
C92378
C94043
C94448
Contig13138 
IIAFP1D59967
IIAFP1D72549
JAX4a63h05.r1
JAX4a63h05.s1
JAX4a134e03.r1
JC2a57e02.s1
JC2b54f08.r1
JC2b54f08.s1 = 80% to 508A1 not same seq no exact match to any other seq.
TWGTLYYITKFTISAIFKFLFNHDIPQDEDINKGDALKLMGPMS*VFQNTGIGTLFDVIN
ITLPLYLLYLKMFDQSFKDIIKYHTEKYNEHLKTFYPHVQTYLLHILIK*YGTDNYNKIL 408
SILAAINDFFL 441
JC2b181d11.s1
JC2c128a08.r1   
JC2c128a08.s1 
JC2d13d01.r1 
c-JC2d95b07.s1
JC2e48a02.s1
JC2e111b07.r1
sdic2Af2.q1t
sdic2Ce4.p1t
sdic2Ce5.q1t (92%)
SLA411 N-term
SLB521
SLC329
SLE212
SLJ668
SLJ729
SSA192
SSA247
SSC117
SSC755
SSE231
SSE715
SSG143
SSG323
SSG630
SSH119
SSJ655
SSK816
AFK388
AFI677
AFO867
AFL432
AFF717

CYP508A2 Seq (22+45+69+86+88) Complete sequence from c-JC2d95b07.s1 
62% to CYP508A1
on single contig with seq 508A1 and 21 CHR2.0.28372 5025-6821
5025 MIFGIIGYLFLIYILHNA 5078 (0) 
5195 YSKYKRLNENQLPGPFPIPILGNIYQLTNL
     PHFDLTKMSEKYGKIFRIYLADLYTVIVCDPIIARELFVDKFDNFIDRPKIP 5440 (0)
5543 SVKHGTFYHGTVASMGDNW
KNNKEIVGKAMRKTNLKHIYQLLDDQVDVLIESMRTIESSGETFDPRYYLTKFTMSAMFKYIFNEDISKDEDVH
NGQLAQLMKPMQKVFKDFGTGSLFDVLEITRPLYFLYLEWFTSHYYQVINFGKMKIYKHLETYKPDVQRDLMDLL
IKEYGTETDDQILSISATVSDFFLAGVDTSATSLELIVMMLINYPEYQEK
AYNEIKSALSSNGGGGGGGLTQRNKVLLSD
RQSTPFVVSLFKETLRYKPISPFGLPRSTTSDIILNNGQ
FIPKNAQILINYHALSRNEEYFENPNQFDPTRFLNSDSNPAFMPFSIGPRNCV 6562 (2)
6663 GSNFAQDEIYIALSNMILNFKFKSIDGKPVDETQTYGLTLKPNPFKVILEKRK* 6824
c-JC2d95b07.s1 22366 letters 
CHR2.0.28372 5025-6821
Contig13205
IIAFP1D1875
IIAFP1D80337
IIAFP2D50103 
IIAFP2D52521
IIAGP1D0079
JAX4b61d06.s1
JAX4c01f11.s1
JC1a68e02.s2
JC2a57e02.r1
JC2a66h06.s1 this may be a typo check for .r1
JC2a66h07.s1 N-term
JC2a81b01.r1
JC2a162c10.r1
JC2a162d02.r1
JC2a193f03.s1
JC2b21g02.r1
JC2b76b01.r1 
JC2b76b01.s1 
JC2b322g02.r1
JC2b363f02.r1
JC2c04g01.r1 formerly seq 86
JC2c11e11.s1
JC2c11e11.r1
JC2c123h07.r1
JC2c123h07.s1
JC2c157c02.r1
JC2c157c02.s1
c-JC2d95b07.s1
JC2e48a02.r1 N-term
SLD887
VFH640
VFL109
VFL314
VFG480

CYP508A3 seq 21+67+72+82+87 complete 57% to 508A1 408aa
on single contig with seq 22 and 508A1 CHR2.0.28372 --1065
the N-terminal region is badly frameshifted but reconstructed 
based on seq 22, 45, 69 still missing 26 aa but reconstruction 
matches contig 5911 
MEFLKLILFLIIFYIIHNT (0) 
YIKFKKINKNELKGPIPIPILGNLYQLTSGLPHRDLTKISEKYGGIYRFWFADLYTVVLS
DPILIREMFVNNGDYFLDRPKIPSIRHATHYHGIA (1)
TSSGEYWLKIRDIINKAMRKTNLKLIYDSLDQQVDNLIESMNKIESDG
QVFEPRIYFKKYTMAAMYKFIFNEEINFNNEISELIGPIEQVFKDLGSGSLFDVLLISRPLYYQWIEHTDKNYPK
ILNFLKKKYHQHLKTYNPEIQRDLLDLLIKEYYSGSDDDILTIIATINDLFLA (1)
GTDTSSASLEYMVMMLVNYPEIQEK 
VYDEIKLTVNGRNKVLLSDRQFTPYTVSFIKETLRYKPPSSVGVPRTTSQDIIIGDKFIP
KDAQIFINYYGLSRNQDYFENPEQFEPSRFMNPDTNIAFLPFSIGTRNCV (2)
GQNFALDEMFLAFSNIILNFKFSSIDGKQIDETELYGVTLRCKNKFNVSIKKRI* 
chr2 28372 929-1605
Contig5911 chr 2
Contig13310 chr 2
IIAFP1D65605
JAX4a231g12.r1
JC1a148h05.s1
JC1b30e03.r1 
JC1b215d01.r1
JC2a31d03.s2
JC2a114d02.r1
JC2b191g09.r1
JC2b306d10.s1 
JC2b306d10.r1 
JC2c166b04.s1
JC2d13d01.s1 
JC2d95b07.s1 
c-JC2d95b07.s1
JC2e11e01.r1
JC2e73f10.s1
JC3a25h05.r1
IIAGP1D19240

>_4
                              NFFFFIFLLCPISPLFLYKN*LLLLFYLYYN*LLLLLLLLLLLLLLLLFFFFFFFD*YLT
                              IVKGYLLGFHLLIIY*DFFIGWYRILIKKKKKKMIKKKYFIFIKNMLKK*KRI*YKKTIC
                              LCVF*LYN*FFIINWFF**IKILIRTTD*LNKIEKKKKKKKKKKKKKKKKKKKRWHLIF*
                              D*PQDFFLKIN*FFYF*FL*FFFFYFLFFIYYIILKKKKKKIYCNKTTSKT*WSF*N*YC
                              F**FFI*SIILFVFFL*F**KKKKKKDNNNNYTFF*TQIW*NLNN*FFFFFFFFFFFFLE
                              IKSIFISILNLKK*IKMN*KDQFQFQF*EIYINLQVVYHTEI*LKYLKNMVEFIDFGLLI
                              >_5
                              FFFFYFSPLSHFPSFFI*KLIIIIILFIL*LIIIIIIIIIIIIIIIIIFFFFFF*LIFDN
                              CQGIFIGFSFVNYILRFFYWVV*NFN*KKKKKNDKKKIFYFYKKHVKEVKKNLIQKNNLF
                              MCILII*LIFYY*LVFLIDKDFNSYHRLIK*N*KKKKKKKKKKKKKKKKKKKKVAFNFLG
                              LTPRFFFENKLIFLFLIFVIFFFLFFIFYLLYNFKKKKKKNLLQ*NYK*NIMEFLKLILF
                              LIIFYIIHNTVCIFSLILIKKKKKKG****LYFFLNTDMVKFK*LIFFFFFFFFFFFFGN
                              *INFYQYIKFKKINKNELKGPIPIPILGNLYQLTSGLPHRDLTKISEKYGGIYRFWFADX
                              >_6
                              IFFFLFFSSVPFPLFFYIKINYYYYFIYIIINYYYYYYYYYYYYYYYYFFFFFFLINI*Q
                              LSRDIYWVFIC*LYIKIFLLGGIEF*LKKKKKK**KKNILFL*KTC*RSKKEFNTKKQFV
                              YVYFNYIINFLLLIGFFNR*RF*FVPQIN*IKLKKKKKKKKKKKKKKKKKKKKGGI*FFR
                              IDPKIFF*K*INFFIFNFCNFFFFIFYFLSII*F*KKKKKKFIAIKLQVKHNGVFKINIV
                              SNNFLYNP*YCLYFFFNFNKKKKKKRIIIIIILFSKHRYGKI*IINFFFFFFFFFFFFWK
                              LNQFLSVY*I*KNK*K*IKRTNSNSNFRKFISTYKWFTTQRFN*NI*KIWWNL*ILVC*X

>JC2a86a02.3396 541398 letters
        Length = 541,404

  Minus Strand HSPs:

 Score = 484 (175.4 bits), Expect = 1.3e-45, P = 1.3e-45
 Identities = 90/92 (97%), Positives = 90/92 (97%), Frame = -3

Query:     21 YIKFKKINLNELKGPIPIPILGNLYQLTSGLPHRDLTKISEKYGGIYRFWFADLYTVVWS 80
              YIKFKKIN NELKGPIPIPILGNLYQLTSGLPHRDLTKISEKYGGIYRFWFADLYTVV S
Sbjct: 192760 YIKFKKINKNELKGPIPIPILGNLYQLTSGLPHRDLTKISEKYGGIYRFWFADLYTVVLS 192581

Query:     81 DPILIREMFVNNGDYFLDRPKIPSIRHATHYH 112
              DPILIREMFVNNGDYFLDRPKIPSIRHATHYH
Sbjct: 192580 DPILIREMFVNNGDYFLDRPKIPSIRHATHYH 192485

CYP508A4 seq (66+70) complete 55% to CYP508
MIMLIKVFVLLLVVYILHNS (0)
YKKYKKLDKNELKGPTPIPVLGNLHQLSSLPHRDLSKMTKDYGDIFRVWFADL (2)
YTVVISDPVLIRKIYVENHESFRDRPKIP (0)
SMKYGTYYHGTAASMGEDWVRNRGIV SSAMRKSNIKHIYEVINNQVDVLMSTMKK
YEKRSEPFEPRYYMTKYTMAAMFKYIFNEDIGEDEDIHTGEIQKIMGPMNQVMEDF
GTGSLFDVLEISQTFYLKWLELTEKNFPLLLKFFNGRYEQHLETIKPESPRDLLDI
LINEYGTNTHDDYLNIASTVLDFFFAGTDTSSTTLEYLFLMMANYPEIQDKVHQEV
KSYLKQIGKDKVELNDRQSLPYVVAVIKETLRFKPVTPFGVPRSCVNEITIDEKYF
IPKGAQVIINYPSIFENEKYFKNANQFDPSRFLQTTTTNTASNEESSFNSNLAFIP
FSIGPRNCVGMQFAQDELFLAFANIVLNFTIKSVDGKKIDETISYGVTLKPKTRFK
VLLEKRLI
Contig11215  Chr 2
IIAFP1D53115
IIAFP1D83733 poor quality seq
IIAFP1D83741
IIAGP1D11636
JC1b242d09.s1
JC1c118b04.r1
JC1c167b01.s1
JC2b119f08.r1
JC2b119f08.s1 KYG region
JC2b220e03.r1 
VSG666
AFH195
AFK530
VFK340

>VFK340 (VFK340Q) /pub/dna_csm/LIBRARY/VF/VFK3-B/VFK340Q.Seq.d/
        Length = 681

  Plus Strand HSPs:

Supports N-term exon

Query:     1 MIMLIKVFVLLLVVYILHNSYKKYKKLDKNELKGPTPIPVLGNLHQLS 48
             MIMLIKVFVLLLVVYILHNSYKKYKKLDKNELKGPTPIPVLGNLHQLS
Sbjct:    24 MIMLIKVFVLLLVVYILHNSYKKYKKLDKNELKGPTPIPVLGNLHQLS 167

CYP508B1 Seq. 15, 77  complete 50% identical to 508A1
MLFNIFLYLFIFCIVSSG (0)
IKKYKKIHKNELSGPFPIPLLGNLHQLGKE
PHYTLTKMHNVYGEIFRLHFGDVYTVVVSDPILIREMFVDNHENFKYRPLLPTFKFGAGG
DHGLSLSNER WERTRELVQNAMKRTSIKKIYDMLDSQVYELIKSMKQYQVTGNPFEIHLY
AQRFTLSIMFKYIFNEDISYDEDITKGKIAELVQPMDQIFKSLGSGKLGDFISIAQPFYY
QWLKFSDKQFNGPFSTVKKFIYKRYLEHINTIDHDNPRDLMDLLINEFSNDKNLIPTILQ
TSLDMFLAGN (0)
DTTAASIIWFVLRMQEHPEIQLKAYNEIKDAVGDRDRVLLSDRPKTPYLN
AIIKEILRLNPVGPFGLPHRSSNDIVIGNGKYFIPKDSQILVNYRGLGFNEKYFENPSQF
DPSRFLNKNNDAYMPFGVGDRKCVGLQLAGDEQYLSFSNILLNFNLKNIGTPVSDYEEFG
LTLKPNKFKVLLESRK*
AU038895 *
AU074803 *
IIAAP1D5220 *
IIAFP1D67917 *
IICBP1D42314 
JC1b221g12.s1 *
JC2a236e12.r1 *
JC2b149h06.s1 *
JC3a263c07.s1 25685 letters
c-JC2b149h06.s1 *
JC2d41b07.s1 *
SSM113
FC-IC0188 EST
Length = 635

Query: 15236 LVTLKNIFFFF--FFFFFLVGKN 15298
             +V L NIF FF  FF +  V KN
Sbjct:     1 MVYLNNIFIFFIIFFIYSFVKKN 23

>_1
PILDYSIPFI**I***IHLFFFFYLFIYKDLWTEKKKKDNNEKKKKK**KKKKKKNIFIF
IFFFIFLFFNFSQRP*FPFFISF*FILPQVDFLKKK*KIKKKKIYLKNQDNKFFFNFFLG
LGFKKKNKKKK*KIKLKLLQY*KYPYPRIKLTIDTSE*KFKTLRILRWLKYSSNIKKIIN
*KSLDINFIN*SIIAHQEFGDVKKYFFFFFFFFFFGWKKLFLGFFFESF*DILNLNSPKK
KKKKMFINFYF*IFLFLFFFIVIFYFLFFIFLLLFFLII*KKKK*INIIKNVI*YIFIFI
YILYC*LRRMYFTFIFF*KKKNILIKKKKKKKNRLKNIRKFIKMNYQDHFQFHYLEIYIN
>_2
QFWIIVYHSFDEYDDESIFFFFFIYLFIRTYGLKKKKKIIMKKKKKNNKKKKKKKIFSFL
FFFLFFYFLIFPKDLNSLFLYLFSLFYPRSIS*KKNKK*KKKKFI*RTKIINFFLIFF*G
*GLKKKIKKKNKKLN*NFYNIKNIHTHA*N*QLTQVSRNLKH*EY*GG*NIQAISKR*SI
KKVWISILLINQL*PIKNLVTLKNIFFFFFFFFFLVGKNYFWVFSLNPFKIF*ISTHPKK
KKKKCSSIFIFKFFYFYFFLLLFFIFYFLFFYCYFF*SSKKKKNK*I**KMLFNIFLYLF
IFCIVSSGVCILHLFFFKKKKIY*LKKKKKKKID*KI*ENS*K*IIRTISNSITWKFTS
>_3
NFGL*YTIHLMNMMMNPSFFFFLFIYL*GPMD*KKKKR***KKKKKIIKKKKKKKYFHFY
FFFYFFIF*FFPKTLIPFFYIFLVYFTPGRFPKKKIKNKKKKNLFKEPR**IFF*FFFRA
RV*KKK*KKKIKN*IKTFTILKISIPTHKTNN*HK*VEI*NIKNIEVVKIFKQYQKDNQL
KKFGYQFY*LINYSPSRIW*R*KIFFFFFFFFFFWLEKIIFGFFL*ILLRYFKSQLTQKK
KKKNVHQFLFLNFFIFIFFYCYFLFFIFYFFIVIFFNHLKKKKINKYNKKCYLIYFYIYL
YFVLLAPAYVFYIYFFLKKKKYTN*KKKKKKK*IKKYKKIHKNELSGPFPIPLLGNLHQ

JC3a263c07.s1 25685 letters extends N-term
Query: 15642 IKKYKKIHKNELSGPFPIPLLGNLHQLGKEPHYTLTKMHNVYGEIFRLHFGDVYTVVVSD 15821
             I KYKKIH NEL GP PIP+LGNLHQ G+ PH  LTKM   YG I R++  D+YTVVVSD
Sbjct:    20 ISKYKKIHVNELCGPTPIPILGNLHQFGELPHRVLTKMTKKYGHILRVYMADMYTVVVSD 79

Query: 15822 PILIREMFVDNHENFKYRPLLPTFKFGAGGDHGLSLS-NERWERTRELVQNAMKRTSIKK 15998
             P+LIREM+VDN + F  R   P+ + G    HG   S  E W+  RE+V  AM++T++K 
Sbjct:    80 PLLIREMYVDNSDIFTDRVKKPSVEHGTFY-HGTVTSYGEHWKNNREIVGKAMRKTNLKH 138

Query: 15999 IYDMLDSQVYELIKSMKQYQVTGNPFEIHLYAQRFTLSIMFKYIFNEDISYDEDITKGKI 16178
             IY++LD QV  LI+SMK  + +G  F+   Y  +FT+S MFK++FN DI  DEDI KG  
Sbjct:   139 IYELLDKQVDVLIRSMKSIETSGKTFDTRYYITKFTMSAMFKFLFNHDIPEDEDINKGDT 198

Query: 16179 AELVQPMDQIFKSLGSGKLGDFISIAQPFYYQWLKFSDKQFNGPFSTVKKFIYKRYLEHI 16358
              +L+ PM ++F++ G G L D I+I QP Y  +L+  D+ F      + K+  ++Y EH+
Sbjct:   199 QKLMGPMSEVFQNAGRGSLFDVINITQPLYLLYLEMFDQSFKD----IMKYHREKYNEHL 254

Query: 16359 NTIDHDNPRDLMDLLINEFSNDKN-LIPTILQTSLDMFLAGNVCTS 16493
              T D D  RDL+D+LI E+  D +  I +IL T  D FLAG V TS
Sbjct:   255 KTFDPDVERDLLDILIKEYGTDNDDKILSILATINDFFLAG-VDTS 299
           
CYP508C1 Seq 4,23,24,83 (complete) 42% to seq 66 N-term short may be missing real 1st exon
MGPWG (0)
YIKNKRIHKNEAKGPIGFPLIGNMIQIGKTKPHIELMKLEKIYNQRILKIWLGDYYSVFLSDIDLIKDIFINKFE
NFSSRPKSPLTRLGTNDFRGINGSSGETWFKNKNIIVNAMKRANTKTIYTLLDNQVNDLIKEISKFESQNKS (0)
FNPKYYFRKFVLSTMFKYIFNEDVPYDENLENGKLSELTMEMENIFKTLKVGKLANSIEILETPYYYY
LQKTDKVFKNIKKLIIEKYKNHNLSINPEKPRDLLDILINEYGTTDDDVLNITQVTLDMFMAGTDT (1)
TANTLEWIIIKLCNSPIHQEIAYNELKKVVSSKVIIDDSIKREITLS
DRPNTPYIQAIIKETMRMHPVVVFGLPRYCENDIFIGDENYFIPKG (0)
CKVFINFHSIGYNEKYFKDPYKFEPNRFLENSNNSMDSFFPFGLGNRVCLG
RQLANDQLYLVIANLILKYKLKTIDENKINEDGIFGLTVSPNKYKINLESR*
C84082
C90646
Contig8310 (3387-1715) minus strand
IIACP2E3219
IIAEP1D1344
IIAFP1D46071
IIAFP1D67346
IIAGP1D6549
JAX4a27b02.s1 
JC1c281f06.s1 
JC2b06a12.r1 no exact match to any other seq
RYYMTKYTMAAMFKYIFNEDIGEDEDIGKIMGPMNQVMEDFGTGSLFDVLEISQTFYLKWLELTEKNFPLLLK
FFNGRYEQHLETIKPESPRDLLDILINEY
JC2b120c09.r1 
JC2b120c09.s1 Seq 24 46% to seq 21 is this from a cluster?
KTIDIENPRDLLDLLIIEYGDHSDENMILIVQVCFDVILAGVDTLASSLEWFLVMLCNNQQIQD
sdic6A4b6.q1t
sdic6A87h12.p1c
sdic6B16c2.p1c possibly seq 4 with frame shift and bad seq
FNPKYYFRKFVLSTMFKYIFNEDVPYDENLENGKLSELTMEMENIFKTLKVGKLANSIEI 447
LETPYYYYLQKTDEVFKNIKKLIIEKYKNHNLSINPEKPRDLLDILINEYGTTDDDVL 621
SSC732
SSI265
SFG492

CYP508D1 Seq (59+81) 42% to seq 4  483 aa complete
MVYLNNIFIFFIIFFIYSF (0)
VKKNKKSTKENDLKGPIALPIIGNLFGLRNDTYSIMD
FYHKMYGGIYRLWFGDYFVVSLNDPEIIREIFIKNYSNFSS (2)
RPFLPTITFGSFNYRGISGSNGDYWKRNRNLLLNAMKKSNLKQTYDNLSDSVNSLINLMKEFQSSNES (0)
FQPDMYLRKYALTTMFKYVFNETVSFENKIVQGEEAELIDNINEAFNFMTLGNAGDFFKILQPLYYQYLLYRGGCFNRIRTLIRNR
YIEHRKTIDIENPRDLLDLLIIEYGDHSDENMISIVQVCFDVILAGVDTLASSLEWFLVMLCNNQQIQDTVYNELKETVVGPVVTL
NDRPSTPYTMACIKETIRLKAPAPFGLPHTTDQDIIVKGHFI
PKDSMVLINFYSLGRNPKDFPDPLKFDPNRFIGSTPDSFMPFGTGPRNCI (2)
GQALGMDQIYLLLSNIFLNFKITSENGKKLDDTDYVSGLNLKPAKYKVCLEKR*
Contig8310  Chr 6
IIADP1D3779
IIADP2D3779 note 90% to same clone opposite read error rate is about 10%
IIAFP1D13883
IIAFP1D54156
IIAFP1D67079
IIAFP2D44284
JC1a137g04.r1
JC1a231c03.r1
JC1b152d10.r1
JC1b152d10.s1
JC1b218a10.r1
JC1c260b08.r1
AFJ424 (N-term)
CFB272

Query:   164 ESFQPDMYLRKYALTTMFKYVFNETVSFENKIVQGEEAELIDNINEAFNFMTLGNAGDFF 223
             + FQPDMYLRKYALTTMFKYVFNETVSFENKIVQGEEAELIDNINEAFNFMTLGNAGDFF
Sbjct:  3408 KKFQPDMYLRKYALTTMFKYVFNETVSFENKIVQGEEAELIDNINEAFNFMTLGNAGDFF 3229

Query:   224 KILQPLYYQYLLYRGGCFNRIRTLIRNRYIEHRKTIDIENPRDLLDLLIIEYGDHSDENM 283
             KILQPLYYQYLLYRGGCFNRIRTLIRNRYIEHRKTIDIENPRDLLDLLIIEYGDHSDENM
Sbjct:  3228 KILQPLYYQYLLYRGGCFNRIRTLIRNRYIEHRKTIDIENPRDLLDLLIIEYGDHSDENM 3049

Query:   284 ISIVQVCFDVILAGVDTLASSLEWFLVMLCNNQQIQDTVYNELKETVVGPVVTLNDRPST 343
             ISIVQVCFDVILAGVDTLASSLEWFLVMLCNNQQIQDTVYNELKETVVGPVVTLNDRPST
Sbjct:  3048 ISIVQVCFDVILAGVDTLASSLEWFLVMLCNNQQIQDTVYNELKETVVGPVVTLNDRPST 2869

Query:   344 PYTMACIKETIRLKAPAPFGLPHTTDQDIIVKGHFIPKDSMVLINFYSLGRNPKDFPDPL 403
             PYTMACIKETIRLKAPAPFGLPHTTDQDIIVKGHFIPKDSMVLINFYSLGRNPKDFPDPL
Sbjct:  2868 PYTMACIKETIRLKAPAPFGLPHTTDQDIIVKGHFIPKDSMVLINFYSLGRNPKDFPDPL 2689

Query:   404 KFDPNRFIGSTPDSFMPFGTGPRNCI 429
             KFDPNRFIGSTPDSFMPFGTGPRNC+
Sbjct:  2688 KFDPNRFIGSTPDSFMPFGTGPRNCM 2611

 Score = 426 (155.0 bits), Expect = 3.8e-214, Sum P(3) = 3.8e-214
 Identities = 87/108 (80%), Positives = 91/108 (84%), Frame = -1

Query:     3 YLNNIFIFFIIFFIYSF-VKKNKKSTKENDLKGPIALPIIGNLFGLRNDTYSIMDFYHKM 61
             + NN F + +I  I+   VKKNKKSTKENDLKGPIALPIIGNLFGLRNDTYSIMDFYHKM
Sbjct:  4094 FYNN-F*WILIDLIFLI*VKKNKKSTKENDLKGPIALPIIGNLFGLRNDTYSIMDFYHKM 3918

Query:    62 YGGIYRLWFGDYFVVSLNDPEIIREIFIKNYSNFSSRPFLPTITFGSF 109
             YGGIYRLWFGDYFVVSLNDPEIIREIFIKNYSNFSS  F   I F  F
Sbjct:  3917 YGGIYRLWFGDYFVVSLNDPEIIREIFIKNYSNFSSFVFKN*IIFFFF 3774

 Score = 361 (132.1 bits), Expect = 2.8e-207, Sum P(3) = 2.8e-207
 Identities = 74/93 (79%), Positives = 78/93 (83%), Frame = -3

Query:    73 YFVVSLNDPEIIREIFIKNYSNFSSRPFLPTITFGSFNYRGISGSNGDYWKRNRNLLLNA 132
             +F +  N    I  IF+     + SRPFLPTITFGSFNYRGISGSNGDYWKRNRNLLLNA
Sbjct:  3777 FFFLKNNLLIFILLIFLIFLIIYHSRPFLPTITFGSFNYRGISGSNGDYWKRNRNLLLNA 3598

Query:   133 MKKSNLKQTYDNLSDSVNSLINLMKEFQSSNES 165
             MKKSNLKQTYDNLSDSVNSLINLMKEFQSSNES
Sbjct:  3597 MKKSNLKQTYDNLSDSVNSLINLMKEFQSSNES 3499

 Score = 272 (100.8 bits), Expect = 3.8e-214, Sum P(3) = 3.8e-214
 Identities = 53/53 (100%), Positives = 53/53 (100%), Frame = -1

Query:   430 GQALGMDQIYLLLSNIFLNFKITSENGKKLDDTDYVSGLNLKPAKYKVCLEKR 482
             GQALGMDQIYLLLSNIFLNFKITSENGKKLDDTDYVSGLNLKPAKYKVCLEKR
Sbjct:  2534 GQALGMDQIYLLLSNIFLNFKITSENGKKLDDTDYVSGLNLKPAKYKVCLEKR 2376

>Contig_0725, D. discoideum, sequenced by the D. discoideum Sequencing Consortium, assembled with Phusion translate frame +1 translate plus frames translate all frames
TTTTTTTTTTTTAAAATTTTTTTTTAAAAAAAAAAAAAAAAAAAAAAAAGGCCCCATGGG
GCCCCTGGGGGGTTTTTTTTTTTTTTGGAAATTTGGAAATTTGGAAAAAATCCAATTTTT
TTTTTTAAAAAAATTTTTTTTTTAAAAAAAAAAAAAAAGGCCCCAAAAAAAAAAGGGGGA
AAATTTAAATTATTAAAACCCTTTTACCCCCTTTTTTAAAAAATTTTTTTTTTAAATTTT
TTTTTTAAAATTTTTTTTTTTTTTTTCCCCCCCAACCAATTAAAAAAAAAATTTTCTTTT
AAAACCCCCCCAGGGAATTTTTAATTTTTTCCCGGTTTTTTTTTTAAATTTTTTAAATCC
CTCCCTTGGGAAAAATTTAAAAACCTTTTTTTAAAAAAAAAATTTAAAACCAAAATTTTT
TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT
ATTAGTATATTAAAAATAAAAGAATTCATAAAAATGAAGCAAAAGGTCCAATTGGGTTCC
CATTAATTGGTAATATGATTCAAATTGGTAAAACAAAGCCACATATTGAATTAATGAAAC
TTGAAAAAATTTATAATCAAAGAATTTTAAAAATTTGGCTTGGAGATTATTATTCAGTTT
TCTTATCAGATATTGATTTAATAAAAGATATTTTCATAAATAAATTTGAAAATTTTTCAT
CAAGACCAAAATCTCCATTAACTAGACTTGGTACAAATGATTTTAGAGGTATTAATGGTT
CATCAGGTGAAACATGGTTTAAAAATAAAAATATTATTGTAAATGCAATGAAAAGAGCAA
ATACAAAAACGATCTATACTTTATTGGATAATCAAGTAAATGATTTAATAAAAGAAATTT
CAAAATTTGAATCACAAAATAAATCAGTATGTTTAATTAATAATACAAATATTATTAAAA
AAACTAATTTTTTTTTTTTTTTTCTTTTTTTTTTATATATAGTTTAATCCAAAATATTAT
TTTAGAAAGTTTGTTTTATCAACAATGTTTAAATATATTTTTAATGAAGATGTACCATAT
GATGAAAATTTGGAAAATGGTAAATTATCAGAATTAACAATGGAAATGGAAAATATTTTT
AAAACTTTAAAAGTTGGGAAATTAGCTAATAGTATAGAAATTTTAGAAACTCCATATTAT
TACTATTTACAAAAAACTGATAAAGTATTTAAAAATATTAAAAAATTGATTATTGAAAAG
TATAAGAATCACAATTTATCTATAAATCCTGAAAAACCAAGAGATCTTTTAGATATTTTA
ATTAATGAATATGGTACCACTGATGATGATGTTTTAAATATTACTCAAGTAACTTTGGAT
ATGTTTATGGCAGGAACAGATACAAGTAACTATACTATACTATACTAAATAAAAATTAAA
TTATATTATTTATAATTTTCTAAAATTTTCTAAAATATTCTAAAATTATAATTACAGCTG
CCAATACTTTAGAATGGATAATCATTAAACTTTGCAATAGTCCAATTCATCAAGAAATAG
CATATAATGAATTAAAAAAGGTAGTATCTTCAAAAGTGATAATTGATGATTCAATTAAAA
GGGAAATAACATTATCAGATAGACCAAATACACCATATATTCAAGCAATCATTAAAGAAA
CCATGAGAATGCATCCTGTTGTTGTATTTGGTTTACCAAGATATTGTGAAAATGATATAT
TTATTGGTGATGAAAATTATTTCATTCCAAAAGGAGTATGTATAATAATAATTATATTTT
TAAATTATGTATGATAATAATAATAATAATAATAATTTTTTTTTTTTTTTTACAGTGTAA
AGTTTTCATTAATTTTCATTCAATTGGGTATAATGAAAAATATTTTAAAGATCCATATAA
ATTTGAACCAAATAGATTTTTAGAAAATTCAAATAATTCAATGGATTCTTTTTTTCCATT
TGGTTTAGGTAATAGAGTTTGTTTGGGTAGACAATTAGCAAATGATCAACTTTATTTGGT
AATTGCAAATTTAATTTTAAAATATAAATTAAAAACAATTGATGAAAATAAAATTAATGA
AGATGGTATTTTTGGTTTAACTGTTAGTCCAAATAAATATAAAATTAATTTAGAATCAAG
ATAAATAGTACACACACCCACACACACACACATATATATATATATCTATAAATTACACAA
AAGTAAAAATTAAAAAAATAAAAAAAAAATAAAAAAATAAAAAAATACAAATGTATAATA
ATTTAATAATTTATTTAATTGATTTTTTTATATTTAAATTTTTTTTTTTTTTTTTTTACA
ATAAAATTATAAAAAAATAGATAAATAATGATTTATCTTTTTTCAAGACAAACTTTATAT 2400
TTTGCTGGTTTGAGATTTAATCCTGAAACATAATCAGTATCATCAAGTTTTTTACCATTT
TCAGAAGTAATTTTAAAATTAAGGAAAATATTTGAAAGAAGTAAATAAATTTGATCCATA 
CCTAATGCTTGACCACTAAAAAAAAATTAAATTAAAATAATATTAATATTAATTATAAAA 2580
AAAAAAAAAAAAAATTAAAAAAAAAACTTACATGCAATTTCTAGGACCAGTACCAAAAGG
CATAAAGGAATCAGGTGTAGAACCAATGAATCTATTTGGATCAAATTTAAGAGGGTCAGG
GAAATCTTTTGGATTTCTACCTAATGAATAAAAGTTTATAAGTACCATTGAATCTTTTGG
AATAAAGTGACCCTTAACGATTATATCTTGATCAGTTGTATGAGGTAAACCAAATGGAGC
AGGTGCTTTTAATCTTATGGTTTCTTTAATACATGCCATTGTATATGGTGTTGATGGTCT
ATCATTTAATGTTACCACTGGTCCAACAACAGTCTCTTTCAATTCATTATAAACAGTGTC
TTGAATTTGTTGATTATTGCATAACATTACCAAGAACCATTCTAATGAACTTGCCAATGT
ATCAACACCGGCCAAGATTACGTCAAAACAAACTTGAACGATTGAAATCATATTCTCGTC
AGAATGGTCACCATATTCAATGATTAACAAGTCTAATAAGTCTCTTGGGTTTTCAATGTC
TATGGTTTTACGATGTTCAATATATCTATTTCTTATTAATGTTCTAATTCTATTGAAACA
GCCACCTCTATATAAAAGATATTGGTAATATAATGGTTGCAAAATTTTAAAGAAATCACC
TGCATTACCAAGTGTCATAAAATTAAATGCTTCATTTATATTATCAATTAATTCAGCTTC
TTCACCTTGTACAATTTTATTTTCAAATGATACAGTTTCATTAAATACATATTTAAACAT
AGTTGTTAATGCATACTTTCTTAAATACATATCAGGTTGAAACTTTTTAAAAAAAAAAAA 3420
AAAAAAAAAAAAAAAAAAAAAAAATTTAATTAAAGAATAAAAAAAAAAGGATTTAAAATT 3480
CTTGTTGTAAATACATACACTTTCATTTGAAGATTGAAATTCTTTCATTAAATTAATTAA
AGAATTAACAGAATCACTTAAATTATCATATGTTTGTTTTAAATTTGATTTTTTCATTGC 3600
ATTTAAAAGTAAATTTCTATTTCTTTTCCAATAATCACCATTTGATCCTGAAATTCCTCT
ATAATTAAATGATCCAAATGTTATTGTTGGTAAAAATGGACGACTATGGTAAATAATTAA
AAAAATTAAAAAAATTAATAATATAAATATTAGTAAATTATTTTTTAAAAAAAAAAAAAA 3780
AAAAATATAATTTAATTTTTAAATACAAACGATGAAAAATTTGAATAATTTTTAATAAAT
ATTTCTCTAATTATTTCTGGATCATTTAAAGAAACAACAAAATAATCACCAAACCATAAT
CTATAAATTCCACCATACATTTTATGATAAAAGTCCATAATTGAATATGTGTCATTTCTT
AAACCAAATAAATTACCAATAATTGGTAAGGCTATTGGCCCTTTTAAATCATTTTCTTTA
GTGCTTTTTTTATTTTTTTTTACCTAAATTAAAAAAATTAAATCAATTAATATCCATTAA
AAATTATTATAAAATTATAAAATTAATAATATTTTCCAATGAATAAATAAAAAAAA



CYP508E1 seq (71+85) complete seq 38% to 508A1
MNIINIVIFLIIFYLFKSN (0)
YKKYIKKNKFEVNGPFPLPIIGFTSNYIKPHIKCHELVKQYGDIFRV
YLGDNKTIMVSDYKIIEELFIKNHNSFLERPITPSF
SHCSDNQNGILLSNEKWVNNREMIQKAMKKGIVKSVYELLNKQINDLIQSVKPFSESG 
EPLNFRLFATRFTLSTMMTYIFNEPLSYNENLENGTVLYMF (FRAME SHIFT)
LENIFVLADVGHIGDYISFLKPIYSLFLKLTDSNVPLARDYVYKKFYEHLKTIDNEKEE (2)
NYDLLHTFIKEVGIKDKESVKSIVSNSLDLLVAGTDTAAKGIEWIILRIANNQDVQELIFKELKSIVDSCGR
VEDRIYLSDRSSTPYLNATIKESMRITPITPYGIPRVVGKDLIIKG
HFIPKGSHVVINYRALNHDESIYKNPNQFNPNRFLDNNIESFIPFSIGNRNCVGQQIANH
ELFLFVSNFILNYKITPITNEPIDTTENFGINIRPNSFKINVYKRN*
CHR2.0.4511 
JAX4a63e05.s1
JAX4a63e05.r1
JAX4a246e10.s1
c-JAX4a246e10.r1
JC1a11b06.r1
JC1a65c04.s2
JC1a181f06.r1
JC1a195c01.s1 
JC1b01e02.s1
JC1b01e02.r1
JC1b117h11.r1 
JC1b117h11.s1 
JC1b128d04.r1 
JC1b231a04.r1 
JC1b231a04.s1 
JC1c88c05.s1  
JC1c110e07.s1
JC1c218c07.r1 
JC2d96a12.r1
sdic6A95c3.q1t

CYP513A1 Seq. 8+53 complete same as 8b 
MNYLVGLVLIFTIFYFFLQKNDKNMNSKIPGPKGIPILGNLLSMKGDLHLKLQEWYKQYG
VIYRIKMGNVETVVLTEYPIIREAFIGNSNSFVNRFQRKSRLKLNNGENLVIVNGDIHNK
LKTLVLSEMTNQRIKKYETSFIDNEIK KLFKVLDEHADTGKPIILNNHIKMFSMNIVLCF
TFGLNYSYPYDEFEKASEFIKLMVEFFNIAGQPIISDFIPSLEPFIDTSNYLNTYKRIFN
YTSDLISKFKNENEIHNNINDNNKSLADKPILSKLLQSFENGEISWDSVVSTCIDLQTAG
ADTSANTILYCLLELINNPNIQSKVYDDIKQAIIQSKENENQNDNENQEQTEEIITLSFN
KYRTLAPYLSMVVKETFRKYPSGTIGLPHVTSEDVELNGYKICAGTQIIQNIWATHRNEK
QFSEPDSFIPERFISQQQSANSNLIHFGCGVRDCIGKSLADSEIFTMLASLINRYEFTNP
NPSTPLNEIGKFGITYSCPENKIIIKKRF*
AU039107
AU039893
AU061936
C90256
C91045
Contig15093
IIAFP1D86342
IIAGP1D0885
JAX4a50g04.r1
JAX4a50g04.s1
JAX4a69d01.r1
JAX4a223f01.r1
JAX4b04a08.r1
JAX4b04a08.s1
JC2a07a03.r1
JC2a86a12.s1
JC2b18g07.r1
JC2b18g07.s1
JC2b46a06.r1 N-term seq 8b
JC2b46a06.s1 89% identical to seq 8
c-JC2e41d02.s1
SLG684
SSI504
SSJ415
SSM394
SFH684
AFK177

CYP513A2P seq 38 complete 54% to seq 513A1 probable pseudogene no ESTs
Processed pseudogene only intron in 513A1 is removed in 513A2P
MNFLIILINIIIVLTTIIFLK (frameshift)
KIIKKKNKYIPGPIGLPILGNLLSLKG (frameshift)
ISFQEFYKQYGPIYRIRMGSIETIVLTESSTLKEAFIGNSYIFENRFQ 
KERNNCENVLLANGEMF (28aa deletion)
KLFKELDNLAEIGEPIILNRY (frameshift)
IKLISSNVMLTFTYGDKLLYKYNEIETFEEYLKNLSNYLKVTGRPILSDFIPFLKPF 
SKNVNIEEWDDFFYINSNKYIEKFKSGKNSKQSKTENK
PIISKLLSLYENGEISWASVIGSCIDMQASSTDTIMFCLIELTNLKNIQNKV
YNEIKQEIKKQNINNNNNEDEVLIQYNKYRNSLPYFSMVIKETFRKHPVTAH
ETSNDVEFRGYKISKGTQIIKNIWATHRNEKIFQSPNSFIPERFLEELPNSNLVHFGV
GVRDCMGKSLAESQIFTILASFINRYEFLNPNPSIPLNDVGKFGLAFSCPQNKIIIKKRK*
Dict-IV-V477b10.p1c same seq as Dict-IV-V42f04.p1c including all frameshifts etc.
Dict-IV-V42f04.p1c 
Dict-IV-V831a03.q1c
Contig1006
IIAFP1D52879
IIAFP1D84459
JAX4a21c04.r1 73% identical to seq 8
JC1a87f04.r1
Contig_5063

>JC3a109h04.r1 Clone JC3a109h04, reverse read, bases 52 through 600, from
            2001-03-22
        Length = 547

  Minus Strand HSPs:

Query:     1 MNFLIILINIIIVLTTIIFLKKIIKKK 27
             MNFLIILINIIIVLTTIIFLK   KKK
Sbjct:   356 MNFLIILINIIIVLTTIIFLKNNKKKK 276 frameshift

Query:    18 IFLKKIIKKKNKYIPGPIGLPILGNLLSLKG 48
             +F  KIIKKKNKYIPGPIGLPILGNLLSLKG
Sbjct:   307 LFS*KIIKKKNKYIPGPIGLPILGNLLSLKG 215

>Contig_5063, D. discoideum, sequenced by the D. discoideum Sequencing
            Consortium, assembled with Phusion
        Length = 10,471

  Minus Strand HSPs:

 Score = 355 (130.0 bits), Expect = 4.2e-67, Sum P(3) = 4.2e-67
 Identities = 69/70 (98%), Positives = 70/70 (100%), Frame = -3

Query:     1 ISFQEFYKQYGPIYRIRMGSIETIVLTESSTLKEAFIGNSYIFENRFQKERNNCENVLLA 60
             ISFQEFYKQYGPIYRIRMGSIETIVLTESSTLKEAFIGNSYIFENRFQKERNNCENVLLA
Sbjct:  5864 ISFQEFYKQYGPIYRIRMGSIETIVLTESSTLKEAFIGNSYIFENRFQKERNNCENVLLA 5685

Query:    61 NGEMFKIIQR 70
             NGE+FKIIQR
Sbjct:  5684 NGELFKIIQR 5655

 Score = 294 (108.6 bits), Expect = 4.2e-67, Sum P(3) = 4.2e-67
 Identities = 57/57 (100%), Positives = 57/57 (100%), Frame = -2

Query:    92 IKLISSNVMLTFTYGDKLLYKYNEIETFEEYLKNLSNYLKVTGRPILSDFIPFLKPF 148
             IKLISSNVMLTFTYGDKLLYKYNEIETFEEYLKNLSNYLKVTGRPILSDFIPFLKPF
Sbjct:  5607 IKLISSNVMLTFTYGDKLLYKYNEIETFEEYLKNLSNYLKVTGRPILSDFIPFLKPF 5437

 Score = 106 (42.4 bits), Expect = 4.2e-67, Sum P(3) = 4.2e-67
 Identities = 21/21 (100%), Positives = 21/21 (100%), Frame = -1

Query:    71 KLFKELDNLAEIGEPIILNRY 91
             KLFKELDNLAEIGEPIILNRY
Sbjct:  5668 KLFKELDNLAEIGEPIILNRY 5606

AAAATCATTACAATCATCAATATTCACATTTTTTGAAAATGGTTTTAAAAATGGAATAAA
ATCTGATAAAATTGGTCGACCTGTAACTTTTAAATAATTTGATAAATTTTTTAAATATTC 5520
TTCAAAAGTTTCAATTTCATTATATTTATATAATAATTTATCACCATATGTAAAAGTTAG
CATGACATTCGATGAAATCAATTTTATATCTATTTAAAATTATTGGTTCACCAATTTCAG 5640
CAAGGTTATCTAATTCTTTGAATAATTTTAAACAATTCTCCATTTGCCAATAACACATTC
TCACAATTATTTCTTTCTTTTTGAAATCTATTTTCAAAAATATAAGAATTTCCAATAAAT 5780
GCCTCCTTTAAAGTAGAGGATTCTGTTAAAACTATAGTTTCTATAGAACCCATTCTAATT
CTATAAATTGGTCCATATTGTTTATAAAATTCTTGAAATGAGATTAACCTTTTAATGATA 5900

>_4
                              YH*KVNLISRIL*TIWTNL*N*NGFYRNYSFNRILYFKGGIYWKFLYF*K*ISKRKK*L*
                              ECVIGKWRIV*NYSKN*ITLLKLVNQ*F*IDIKLISSNVMLTFTYGDKLLYKYNEIETFE
                              EYLKNLSNYLKVTGRPILSDFIPFLKPFSKNVNIDDCNDF
                              >_5
                              SLKG*SHFKNFINNMDQFIELEWVL*KL*F*QNPLL*RRHLLEILIFLKIDFKKKEIIVR
                              MCYWQMENCLKLFKELDNLAEIGEPIILNRYKIDFIECHANFYIW**III*I**N*NF*R
                              IFKKFIKLFKSYRSTNFIRFYSIFKTIFKKCEY**L**FX
                              >_6
                              IIKRLISFQEFYKQYGPIYRIRMGSIETIVLTESSTLKEAFIGNSYIFENRFQKERNNCE
                              NVLLANGELFKIIQRIR*PC*NW*TNNFK*I*N*FHRMSC*LLHMVINYYINIMKLKLLK
                              NI*KIYQII*KLQVDQFYQILFHF*NHFQKM*ILMIVMIX


>JC3a109h04.r1 Clone JC3a109h04, reverse read, bases 52 through 600, from
            2001-03-22
        Length = 547

  Minus Strand HSPs:

 Score = 251 (93.4 bits), Expect = 2.1e-20, P = 2.1e-20
 Identities = 50/57 (87%), Positives = 52/57 (91%), Frame = -3

Query:    19 ISFQEFYKQYGPIYRIRMGSIETIVLTESSTLKEAFIGNSYIFENRFQKLFKELDNL 75
             ISFQEFYKQYGPIYRIRMGSIETIVLTESSTLKEAFIGNSYIFENRFQK     +N+
Sbjct:   212 ISFQEFYKQYGPIYRIRMGSIETIVLTESSTLKEAFIGNSYIFENRFQKERNNCENV 42

>JC3a109h04.r1_4 Clone JC3a109h04, reverse read, bases 52 through 600, from 2001-03-22 translate frame +1 translate plus frames translate all frames
                              DL*YLMIIKPPISF*NNYLIFLQTPNF*FNFIYLFKFLNYYFLIVILYLYFKKLIIIF*K
                              *IKNEFFNNIN*YYNSFNNNYFLKK**KKKINIYLVLLGYQF*EIYYH*KVNLISRIL*T
                              IWTNL*N*NGFYRNYSFNRILYFKGGIYWKFLYF*K*ISKRKK*L*ECVIGKWRNV*NYS
                              KK
                              >JC3a109h04.r1_5 Clone JC3a109h04, reverse read, bases 52 through 600, from 2001-03-22 translate frame +1 translate plus frames translate all frames
                              *FIILNDNKTTNIFLK*LSYFFTNPQFLI*FYLFI*IFKLLFFNCYFIFIF*KINYYLLK
                              INKK*IF**Y*LIL**F*QQLFS*KIIKKKNKYIPGPIGLPILGNLLSLKG*SHFKNFIN
                              NMDQFIELEWVL*KL*F*QNPLL*RRHLLEILIFLKIDFKKKEIIVRMCYWQMEKCLKLF
                              KEX
                              >JC3a109h04.r1_6 Clone JC3a109h04, reverse read, bases 52 through 600, from 2001-03-22 translate frame +1 translate plus frames translate all frames
                              IYNT****NHQYLFKIIILFFYKPPIFNLILFIYLNF*IIIF*LLFYIYILKN*LLSFKN
                              K*KMNFLIILINIIIVLTTIIFLKNNKKKK*IYTWSYWVTNFRKFIIIKRLISFQEFYKQ
                              YGPIYRIRMGSIETIVLTESSTLKEAFIGNSYIFENRFQKERNNCENVLLANGEMFKIIQ
                              RX

CYP513A3 Seq. (7+50)complete  49% to seq 8 only one intron
MTSLTLYLIIFSIILYLFVN (0)
RNKRKNLK IPGPNGI PIFGNLLSLSGEMHLTLQEWYKTYGSVFSIRMGNIDTVVLT
EYPTIRKAFVDNSLAFASRYQLKSRVVLTGAKDLAIQNGEIHSLLKKVVLSEMTTTKIKRMEIHIIKE
TEKILKILDKHAERGEPFIINNYLNMFSMNVILRFLLGIDYPYENVDETVGYVKSIKSFFAVAGLPIL
SDFIPIPLKKSGVFFDSYKELEIETDKLIEKFKKSRNEKIENGTYNEEEDESILSKLLKEYEHGNITW
ECVSHTCIDIISAGTDTSANTLVMALIELINNQEIQSKAFSSIRSSCLNDSNDDDDDDEIVITHSKY
RSLLPYISMIIKETFRKHPIALLGLPHVTTEDVEIDGYKIEAGTYIIQNIFSSHRSDKI
FQSPNEFIPERFFESSQNQGLIHFGLGVRDCVGKSLAECEIFTLIATLLNRYQFINPNN
SKKLNDIGTFGLAQVCPDTNIILKKRI*
C90627
IIAFP1D53507
IIAFP1D59661
IIAFP1D61001
IIAFP1D72987
JAX4b12e11.s1 N-term
JC1a07f07.r1
JC1a25b04.r1
JC1a26e12.r1
JC1a68c10.s2
JC1a91f11.r1 N-term
JC1a105h02.s1 N-term
JC1a107g08.r1
JC1a111f06.s2
JC1a135d11.s1 mid region
JC1a150d03.r2
JC1a226f02.s1
JC1b144h08.r1
JC1b186b12.r1
JC1c102g01.r1
JC1c102g01.s1
JC1c131e03.s1
JC1c232h03.s1
JC1c235b07.s1
JC1c247d05.r1
JC1c247d05.s1
JC1c253h06.s1
JC1c279d04.r1
JC1c279d04.s1
JC2a11g11.r1 CALLED 7B
JC2b41a05.r1
JC2b257e01.s1
JC2b257e01.r1 mid region
JC2b257f06.s1
JC2c20g05.r1
JC2c20g05.s1
SSI868
AFC386
Seq 7b JC2a11g11.r1  C-term 90% to seq 7 may be a different gene
DTSANTLVMALIELINNQEIQSKALPSIRSSCL
NDSNDDDDDDEIVIAHSKYRSLLPYISMIIKESFRKHPIALLGLPHVTTEDVEIDGYKIE 281
AGPFIIPNILSSHRSDKIFQSPNEFIPEIFFGSCQNX 173
NQGLIHFGLGVRDCVGKSLAECEIFPLIATLLNKYQFINPNNS*KLNDIGTFGL 14

>JC1b186b12.r1 Clone JC1b186b12, reverse read, bases 66 through 644, from
            2000-09-15
        Length = 577

  Minus Strand HSPs:

Query:     1 MTSLTLYLIIFSIILYLFRN 20
             MTSLTLYLIIFSIILYLF N
Sbjct:   501 MTSLTLYLIIFSIILYLFVN 442

Query:    19 RNKRKNLKIPGPNGIPIFGNLLSLSG 44
             RNKRKNLKIPGPNGIPIFGNLLSLSG
Sbjct:   340 RNKRKNLKIPGPNGIPIFGNLLSLSG 263

>JC1b186b12.r1 Clone JC1b186b12, reverse read, bases 66 through 644, from 2000-09-15 translate frame +1 translate plus frames translate all frames
TTTGGTTGTAGTCATTTTCTGAAAAACTACTTTTTTCAATAATGAATGAATTTCACCATT
TTGAATAGCAAGATCTTTAGCACCAGTTAATACAACTCTACTTTTCAATTGATATCTACT
TGCAAATGCTAATGAATTATCAACAAATGCTTTTCTAATAGTTGGATATTCAGTTAAAAC
CACTGTATCAATATTACCCATTCTAATGGGAAAATACAGATCCATAAGTTTTATACCATT
CTTGTAGTGTTAAATGCATTTCACCACTTAAAGATAATAAATTACCGAAAATTGGAATTC
CATTAGGTCCTGGAATTTTTAAATTTTTTCTTTTATTTCTCTAGTTAATTAATTTAAAAT
                              R  N  K  R
AAATAAAAAAAAAATTAGTTTTAAGATAAAGATAAATCAATTTTTGAAAAAAAAAATAAA
AAAAAAAAAATAAAAAATTACATTTACAAACAAATATAAGATAATTGAAAATATAATCAA
                       N  V  F  L  Y
ATATAATGTTAAACTAGTCATTGTTTAAGATTTTGCTTTGGAAATTGAGTTTGAGGGTTT
TTTTTTTTTTTTTTTGTTTTTTTGGTTTTTTTTTTAA

CYP513B1 seq 49 complete seq 45% to seq 7, 50 only one intron
MNLLVLSVILAIIIYLIFKR (0)
NYKYSPSKINSKIPGPIGLPIFGNILSLDNKNGIHTTFQKWFKIYGPIYSVN
MGNKSAVVLTGFPIIKKAFIDNSEAFAPHYTFESRYKLNKCSDITQENGKNQSALKRIFL
SELTVTRIKKQESHIQNEIVKLMKVLDKHSEDGKPFLLNNYFSMFSINIISRFLFGID
FPYQDFEETSDLMVGIRDLLIASGEIVLSDFLPIPHSKRSKLYTSYQALVVQIETLV
KSHKYKEDDECMLSKLMIEHDKGNIPWDAVISNCNTIITAG
SDSTSSTALFFLIEMMNNPTIQTKVYNDIVVSFEQNQQADDYMNESMVILKYSKYRSLIPYLSLALKENYR
KHPAAPFGAPHETTQETVIEGYTIAKGTMIFQNIYATQRSDTFYSQPDEFIPERWNGDENSQTLIS
FGTGIRDCIGKSLAYNEIFTIIASVLNRYEFINPNPSIPFDDNGIPGLTTQCKNTVVQIKKR*
IIADP1D4990
JAX4a221a10.r1
JC1c244e11.r1
JC1c244e11.s1
CFG492 SUPPORTS FIRST INTRON BOUNDARY

>CFG492 (CFG492Q) /pub/dna_csm/LIBRARY/CF/CFG4-D/CFG492Q.Seq.d/
        Length = 1416

  Plus Strand HSPs:

 Score = 254 (94.5 bits), Expect = 1.5e-20, P = 1.5e-20
 Identities = 51/51 (100%), Positives = 51/51 (100%), Frame = +2

Query:     1 MNLLVLSVILAIIIYLIFKRNYKYSPSKINSKIPGPIGLPIFGNILSLDNK 51
             MNLLVLSVILAIIIYLIFKRNYKYSPSKINSKIPGPIGLPIFGNILSLDNK
Sbjct:    20 MNLLVLSVILAIIIYLIFKRNYKYSPSKINSKIPGPIGLPIFGNILSLDNK 172



CYP513C1 Seq 5+56 complete seq 43% to seq 7 495 aa only one intron
MNYLVLILVSLVSIYFLFIKN (0?)
QDRKKKINSKIPGPKGFLFGNLIELARTKKNLHLKY
LEWFEKYGPVFRVSIGSLETVVLTGYPILREAFIDNSDTFTSRFQRENARS
INGYKGLINSNGDYHKNLKSVILSEMTATKIKKMESHINQESKRLCELLDQHAKQGTPFTMNKYLNLF
SINIILRFLFGVNYPYTELDDGSSSIIQVIQQFLKLVSQPSITTYFPILSPFMNDRSKEF
YDIHKLLSNHIINLIERYKDSKQHQQQEQEPIDGATEPTVTILDKLLIEVENNRITQNAL
ISICIDVLIAGTDTVGQTLSFAIVALVNNAEIQEKLSRNIIDSMEKGDNHYSYSKYRNGI
PYLALVVKEVFRMYPAGILGLPHMTSEDCEIQGHKIAKGTQIIQNIYSTHRSESFWPNPN
NFIPERHIQNDVSKSVHFAVGTRNCMGMSLSEAEVHTAMAELFGNFKFTNPSNIPLNDQG
TFSVALNCPDFFVKIERRN*
C91402
CHR2.0.12287
Contig2443  Chr 6
JAX4a56e09.r1
c-JAX4a56e09.r1
JAX4a73a11.r1
JC1a97h05.r1
JC1b31b11.s1
JC1b136b11.s1
JC1c226g09.r1 
JC1c226g09.s1
sdic6A2e2.q1t 41-92 KYG region
sdic6A74a10.p1c
sdic6Fh12.q1t
sdic6Rf10.q1t
SSK171 
AFH363 N-term EST supports first intron boundary
VFB834
SFK345 N-term EST supports first intron boundary

>CYP513C1 Seq 5+56 complete seq 43% to seq 7 486 aa
          Length = 486

 Score = 215 (75.7 bits), Expect = 1.0e-20, P = 1.0e-20
 Identities = 48/58 (82%), Positives = 48/58 (82%)

Query:     1 MNYLVLILVSLVSIYFLFIKNQDRKKKINSKIPGPKGFLFGNLIELARTKKNLHLKYL 58
             MNYLVLILVSL          QDRKKKINSKIPGPKGFLFGNLIELARTKKNLHLKYL
Sbjct:     1 MNYLVLILVSL----------QDRKKKINSKIPGPKGFLFGNLIELARTKKNLHLKYL 48


CYP513D1 Seq (54+55+78) complete seq 38% to seq 8 only one intron
MGISSIIIILFIIVLLKKLIKK (0)
EDRIHRINKNIPGPKSKLLVGNLFDLKGQVHEKLKEWYEQYGSVYRIEFGSVSTVVLTEYATLKEAFV
DNGEIFQSRFQRKSRTTCNKGLNLANSNGEYFNHLKKTLSNEITNQKMKK
NEKIIKIQVGLLSEFFNEISGGGGGGSGGSGISKE
PINNIIKMYSLNVML
SLLFNIHFPYNNNSYQDELMSTITRYFKSTGLPYPSDFIPILYPFLKNK
PKEYFEDYESVKKLITRITNEYQLKHMTEISNKSTIEEIENYQPTNILESLLKQYRLNKI
PYDGVIGCLMDLILAGSDTTGNTCLFSLVALVNNSNIQEKLFNEISNAFNDDDGDEL
NGANDISNSLLKLSYFSDRIKTPYLVAFIKEVKRYYPCAPLSVPHL
LTEDCEIQGYKIAKGTQVIQNIYSTHLSQSFC
SNPLEFSPERFLDSTNEPKIITFGIGQRKCPGENIFEIEIYIFLVYLIKKFKFSHPI
DDNLQLNDRGQFGLSLQCPQLNIKVESR*
IIAFP1D25075
IIAFP1D36726
JAX4b58a08.r1
JC2a05e01.s1 
JC2c166f03.r1 
JC2e05a11.r1
JC2e05a11.s1
c-JC2e05a11.s1
SFE487

>SFE487 (SFE487Q) /pub/dna_csm/LIBRARY/SF/SFE4-D/SFE487Q.Seq.d/
        Length = 1079

  Plus Strand HSPs:

 Score = 163 (62.4 bits), Expect = 9.2e-11, P = 9.2e-11
 Identities = 39/50 (78%), Positives = 40/50 (80%), Frame = +1

Query:     1 MGISSIIIILFIIGLLKK---------KINKNIPGPKSKLLVGNLFDLKG 41
             MGISSIIIILFII LLKK         +INKNIPGPKSKLLVGNLFDLKG
Sbjct:    40 MGISSIIIILFIIVLLKKLIKKEDRIHRINKNIPGPKSKLLVGNLFDLKG 189

>JC2e05a11.r1_frame+1 Clone JC2e05a11, reverse read, bases 59 through 417, from 1999-04-07, translation frame +1
GTFFFLXNITWKFSKMGISSIIIILFIXGLLKKGXKKVFSIFFLKKNNNNNNYNKLIFFK
IK*NKIRKIEFXE*IKIFQDQKVNYWLVIYLI*KDKXMKN*RNGMXNMEXFIVLNLVVL
>JC2e05a11.r1_frame+2 Clone JC2e05a11, reverse read, bases 59 through 417, from 1999-04-07, translation frame +2
GPFFF*XISRGNFQKWEFLQLLLSYLLXDY*KRGXKRYFLFFF*KKIIIIIIIIN*FFLK
*NKIKSGR*NSQNK*KYSRTKK*IIGW*FI*FKRTSX*KIKGMV*XIWKXLSY*IW*C
>JC2e05a11.r1_frame+3 Clone JC2e05a11, reverse read, bases 59 through 417, from 1999-04-07, translation frame +3
DLFFFX*YHVEIFKNGNFFNYYYPIYYXIIKKGG*KGIFYFFFKKK*****L**INFF*N
KIK*NQEDRIXRINKNIPGPKSKLLVGNLFDLKGQVHEKLKEWYXQYGXVYRIEFGSV

>JC2e05a11.r1 Clone JC2e05a11, reverse read, bases 59 through 417, from 1999-04-07 translate frame +1 translate plus frames translate all frames
GGGACCTTTTTTTTTTTAANTAATATCACGTGGAAATTTTCAAAAATGGGAATTTCTTCA
ATTATTATTATCCTATTTATTATNGGATTATTAAAAAAGGGGGNTAAAAAGGTATTTTCT
ATTTTTTTTTTAAAAAAAAATAATAATAATAATAATTATAATAAATTAATTTTTTTTAAA
ATAAAATAAAATAAAATCAGGAAGATAGAATTCNCAGAATAAATAAAAATATTCCAGGAC
CAAAAAGTAAATTATTGGTTGGTAATTTATTTGATTTAAAAGGACAAGTNCATGAAAAAT
TAAAGGAATGGTATGANCAATATGGAAGNGTTTATCGTATTGAATTTGGTAGTGTTA

CYP514A1 Seq. 11 complete seq 35% to seq 7

MNTIFTIILTITILVLSLILK (0)
DLLFEGRIKKINKLIPGPSTIPVFGNLLQINAKDFPKSVNDFYERYGKVF
RLRLGSVEIVVLTGPEVIDECFNKKHREIFKERYIKFSRFFGKDYNIISS
NGDYHYVLRGILTSEITTRKLNNGRLESNKFILEMFSNLCKDNKETLVKN
TPNQIRILAVKLILNFTLGIEENDETILIIVEKIKCIFEAAGLLIYSDYL
PFLFPLDIKSMSKNDIISSYFFLKDFIGIKLDAIKIKYEKENELKNETT
DETSSKLNNIPIIENYYKNYLDGSIHYDS
ILFSISDIIFAAVDSTSNGFSLLIGQLINKPEIQDKIYEEIMRNDENNNTNNISFADHTKYPYIISVMN (1)
ESYRYNSSVPITEPNKTTEDVEVNGYKIAKGTMIIKNLRGTHLSKEF
WGEDALEFKPERFKNQPLYQKGLFHFGA (1)
GPRGCPGGRFTESLTFTFLVIMLKNFKIVNPTDIPI
DVEGEVGLAMQCKPFDALFIKRN*
* means sequence was verified by blast with seq 11
Note 514A1 is only 7aa diffs from 514A4 so some of these genomic seqs are from
514A4
AU075025 *
C93191  *
Contig16233 Chr 6 whole gene *
IIAFP1D5994 *
IIAFP1D46081 *
IIAFP1D56636 probably same gene some errors
IIAGP1D1903  *
IIAGP1D4567 *
IIAGP1D6746 *
JAX4a85a04.r1 *
JAX4a165d01.r1 * 
JAX4a185c11.s1 *
JAX4a225h05.r1 *
JAX4a225h05.s1 *
JC1a54g04.r1  *
JC2b365b08.s1 *
JC2c130f08.r1 *
JC3c23c03.s1   FGKDYNIIS
JC3a164c10.s1  FGKDYNIIS
SSM757
SFH636
Contig_0470

>Contig_0470, D. discoideum, sequenced by the D. discoideum Sequencing
            Consortium, assembled with Phusion
        Length = 26,456

  Plus Strand HSPs:

 Score = 1791 (635.5 bits), Expect = 3.1e-257, Sum P(4) = 3.1e-257
 Identities = 349/352 (99%), Positives = 350/352 (99%), Frame = +3

Query:    22 DLLFEGRIKKINKLIPGPSTIPVFGNLLQINAKDFPKSVNDFYERYGKVFRLRLGSVEIV 81
             DLLFEGRIKKINKLIPGPSTIPVFGNLLQINAKDFPKSVNDFYERYGKVFRLRLGSVEIV
Sbjct:  2865 DLLFEGRIKKINKLIPGPSTIPVFGNLLQINAKDFPKSVNDFYERYGKVFRLRLGSVEIV 3044

Query:    82 VLTGPEVIDECFNKKHREIFKERYIKFSRFFGKDYNIISSNGDYHYVLRGILTSEITTRK 141
             VLTGPEVIDECFNKKHREIFKERYIKFSRFFGKDYNIISSNGDYHYVLRGILTSEITTRK
Sbjct:  3045 VLTGPEVIDECFNKKHREIFKERYIKFSRFFGKDYNIISSNGDYHYVLRGILTSEITTRK 3224

Query:   142 LNNGRLESNKFILEMFSNLCKDNKETLVKNTPNQIRILAVKLILNFTLGIEENDETILII 201
             LNNGRLESNKFILEMFSNLCKDNKETLVKNTPNQIRILAVKLILNFTLGIEENDETILII
Sbjct:  3225 LNNGRLESNKFILEMFSNLCKDNKETLVKNTPNQIRILAVKLILNFTLGIEENDETILII 3404

Query:   202 VEKIKCIFEAAGLLIYSDYLPFLFPLDIKSMSKNDIISSYFFLKDFIGIKLDAIKIKYEK 261
             VEKIKCIFEAAGLLIYSDYLPFLFPLDIKSMSKNDIISSYFFLKDFIGIKLDAIKIKYEK
Sbjct:  3405 VEKIKCIFEAAGLLIYSDYLPFLFPLDIKSMSKNDIISSYFFLKDFIGIKLDAIKIKYEK 3584

Query:   262 ENELKNETTDETSSKLNNIPIIENYYKNYLDGSIHYDSILFSISDIIFAAVDSTSNGFSL 321
             ENELKNETTDETSSKLNNIPIIENYYKNYLDGSIHYDSILFSISDIIFAAVDSTSNGFSL
Sbjct:  3585 ENELKNETTDETSSKLNNIPIIENYYKNYLDGSIHYDSILFSISDIIFAAVDSTSNGFSL 3764

Query:   322 LIGQLINKPEIQDKIYEEIMRNDENNNTNNISFADHTKYPYIISVMNESYRY 373
             LIGQLINKPEIQDKIYEEIMRNDENNNTNNISFADHTKYPYIISVMN +Y Y
Sbjct:  3765 LIGQLINKPEIQDKIYEEIMRNDENNNTNNISFADHTKYPYIISVMNGNYPY 3920

 Score = 410 (149.4 bits), Expect = 3.1e-257, Sum P(4) = 3.1e-257
 Identities = 77/86 (89%), Positives = 81/86 (94%), Frame = +2

Query:   359 KYPYIISVMNESYRYNSSVPITEPNKTTEDVEVNGYKIAKGTMIIKNLRGTHLSKEFWGE 418
             K  +++  + ESYRYNSSVPITEPNKTTEDVEVNGYKIAKGTMIIKNLRGTHLSKEFWGE
Sbjct:  3965 KINFVLI*IIESYRYNSSVPITEPNKTTEDVEVNGYKIAKGTMIIKNLRGTHLSKEFWGE 4144

Query:   419 DALEFKPERFKNQPLYQKGLFHFGAG 444
             DALEFKPERFKNQPLYQKGLFHFGAG
Sbjct:  4145 DALEFKPERFKNQPLYQKGLFHFGAG 4222

 Score = 305 (112.4 bits), Expect = 3.1e-257, Sum P(4) = 3.1e-257
 Identities = 58/65 (89%), Positives = 59/65 (90%), Frame = +3

Query:   438 LFHFGAGARACPGGRFTESLTFTFLVIMLKNFKIVNPTDIPIDVEGEVGLAMQCKPFDAL 497
             +  F  G R CPGGRFTESLTFTFLVIMLKNFKIVNPTDIPIDVEGEVGLAMQCKPFDAL
Sbjct:  4329 IIFFKTGPRGCPGGRFTESLTFTFLVIMLKNFKIVNPTDIPIDVEGEVGLAMQCKPFDAL 4508

Query:   498 FIKRN 502
             FIKRN
Sbjct:  4509 FIKRN 4523

 Score = 94 (38.1 bits), Expect = 3.1e-257, Sum P(4) = 3.1e-257
 Identities = 21/21 (100%), Positives = 21/21 (100%), Frame = +3

Query:     1 MNTIFTIILTITILVLSLILK 21
             MNTIFTIILTITILVLSLILK
Sbjct:  2724 MNTIFTIILTITILVLSLILK 2786

>Contig_0470, D. discoideum, sequenced by the D. discoideum Sequencing Consortium, assembled with Phusion translate frame +1 translate plus frames translate all frames
ATTTTATAATTTAAATAGTTTTTTTTTTAATATAAAAACTTTAATATAAAAGATTCTCAA 2700
TTAAAATATTTTATAATATTACAATGAATACAATATTTACAATTATTTTAACAATTACAA
TATTAGTATTATCATTAATATTAAAAGTAAAATTTTAAAATATTATATATATTATTGATT
ATTAAAAATTAAACACTTACTTATACTTTTATTTTTTTAAATAGGATTTATTATTTGAAG
GTAGGATTAAAAAAATTAATAAATTAATACCTGGTCCTTCTACAATTCCAGTTTTTGGTA
ATTTATTACAAATTAACGCAAAAGATTTCCCAAAAAGTGTAAATGATTTCTATGAAAGAT
ATGGCAAAGTTTTTAGATTAAGATTGGGTAGTGTTGAAATTGTTGTTCTAACAGGGCCTG
AAGTTATTGATGAATGTTTTAATAAAAAACATAGAGAAATTTTCAAAGAAAGATATATTA
AATTCTCAAGATTTTTTGGTAAAGATTATAACATTATTTCTTCAAATGGTGATTATCATT
ATGTTCTAAGAGGAATTCTTACAAGTGAAATTACAACAAGAAAGTTAAATAATGGCAGAT
TAGAATCAAATAAATTTATTTTAGAAATGTTTAGTAATCTTTGTAAAGATAATAAAGAAA
CTCTTGTTAAAAATACACCAAATCAAATTAGGATTCTTGCTGTTAAATTAATATTAAATT
TCACATTAGGAATTGAAGAGAATGATGAAACCATTCTAATCATTGTAGAAAAGATTAAAT
GCATTTTTGAAGCTGCTGGATTATTAATTTACTCTGATTATTTACCATTTTTATTTCCAT
TAGATATAAAATCAATGTCAAAGAATGATATTATTTCTAGTTACTTTTTTTTAAAAGATT
TTATAGGTATAAAACTTGATGCTATTAAAATTAAATATGAAAAAGAAAATGAATTGAAAA
ATGAAACTACTGATGAAACATCTTCAAAACTAAACAATATTCCAATTATTGAGAATTATT
ATAAAAATTATTTAGATGGTTCAATTCATTATGATTCAATCTTGTTTTCAATTTCTGATA
TTATTTTTGCAGCAGTTGATTCAACATCCAATGGGTTTAGTCTTTTAATTGGTCAATTAA
TTAATAAACCTGAAATTCAAGATAAAATATATGAAGAAATCATGAGAAATGATGAAAATA
ATAATACAAATAATATATCATTTGCTGATCATACAAAATATCCATATATTATTTCTGTAA 3900
TGAATGGTAATTATCCATATATATATATATATTGGTATTAATATGTGTTGATATATTTTT
TCAAAAAATAAATTTTGTATTAATTTGAATAATAGAATCATATAGATATAATTCTTCAGT
ACCAATAACTGAACCGAATAAAACAACAGAAGATGTTGAAGTAAATGGATATAAAATTGC
AAAAGGTACAATGATAATTAAAAATCTTCGTGGTACACATTTATCAAAAGAGTTTTGGGG 4140
TGAAGATGCTTTAGAATTTAAACCAGAAAGATTTAAAAATCAACCTTTATATCAAAAAGG
ACTTTTTCATTTTGGTGCAGGTATGTTTTGATACCATTTTAAAAAGGAATTAAAATTTAC
CACTATTTTTTTTTTTTTTAATTTACAACTAATTTATAATAATAATAATAATAATAATAA 4320
TCATCATTATTATTTTTTTTAAAACAGGACCTAGAGGTTGTCCAGGAGGAAGATTTACAG 4380
AATCTTTGACTTTTACATTTTTGGTGATAATGTTAAAGAATTTTAAAATAGTCAATCCAA
CTGATATTCCAATTGATGTTGAAGGAGAAGTTGGTTTGGCAATGCAATGTAAACCTTTTG 4500
ATGCCTTATTCATAAAACGTAATTAATAAAAAAAATATTTATTTTTTTATTATTTTAATC
ATTGTTTTCATTGTTGCAAAAATATAATATTTAATAAATAAATTTAGATACAAATAGAAA

CYP514A2 Seq 65 similar to seq 11 500 aa 
MNLIYTIILTIIILVLIISIK (0)
DLFFEDKIKKINKSIPSPPTIPIFGNLLQINSKDVATCFNDFYKQYGKVYRLRLGSVET
VVLTGGDIIDECFNKKYRDFLKARYVKFSRYLGKDTNILHSNGDYHFLLKGVLS
SQVTVRKLNNGRLEFNKYILQMFNNLNNNDEGSTMFLANDVPSQIKKLILKVVLNFTLGI
EENDDINLSLFQNGSN
IFKAAGLFIYSDYLPFLFPLDIKSMAKSNMISSYVFVRDYLAKKLEEVKKKEYIING
DDDGGVDTSQTPLIESYYKLYLQGLIGYDSILLSIVDIIIASVDTTS
NSISFIIARLTNHQEIQSKIYEEIMSNDINNNSNNISFSDHSKYPYIISIMN (1)
ETYRYYASVPLPEPNMTTEDIEVDGYKIAKGTQIYKNIRGTLIS
KEFWGEDALEFKPERFKTQTLNQKGLLHFGA (1)
GPRGCPGARFTECFFFTLMVLLFKNYKLQNPNDNPIDDRGDVGLSMQCKPYDALFIKRN*
AB050504 C-term only
IIADP1D1846   *
IIAFP1D29162  * 43/45 identities
JC1a251d12.r1 *
JC1c209h10.r1 *
JC1c220h10.s1 *
JC1c262g11.r1 *
JC1c290g01.r1 *
JC2a117h07.r1 *
Dict_IV1e09.p1c extends upstream
IICBP3D35368

>Contig_0356, D. discoideum, sequenced by the D. discoideum Sequencing
            Consortium, assembled with Phusion
        Length = 17,230

  Minus Strand HSPs:

 Score = 1773 (629.2 bits), Expect = 3.6e-258, Sum P(4) = 3.6e-258
 Identities = 346/349 (99%), Positives = 347/349 (99%), Frame = -3

Query:    22 DLFFEDKIKKINKSIPSPPTIPIFGNLLQINSKDVATCFNDFYKQYGKVYRLRLGSVETV 81
             DLFFEDKIKKINKSIPSPPTIPIFGNLLQINSKDVATCFNDFYKQYGKVYRLRLGSVETV
Sbjct:  5126 DLFFEDKIKKINKSIPSPPTIPIFGNLLQINSKDVATCFNDFYKQYGKVYRLRLGSVETV 4947

Query:    82 VLTGGDIIDECFNKKYRDFLKARYVKFSRYLGKDTNILHSNGDYHFLLKGVLSSQVTVRK 141
             VLTGGDIIDECFNKKYRDFLKARYVKFSRYLGKDTNILHSNGDYHFLLKGVLSSQVTVRK
Sbjct:  4946 VLTGGDIIDECFNKKYRDFLKARYVKFSRYLGKDTNILHSNGDYHFLLKGVLSSQVTVRK 4767

Query:   142 LNNGRLEFNKYILQMFNNLNNNDEGSTMFLANDVPSQIKKLILKVVLNFTLGIEENDDIN 201
             LNNGRLEFNKYILQMFNNLNNNDEGSTMFLANDVPSQIKKLILKVVLNFTLGIEENDDIN
Sbjct:  4766 LNNGRLEFNKYILQMFNNLNNNDEGSTMFLANDVPSQIKKLILKVVLNFTLGIEENDDIN 4587

Query:   202 LSLFQNGSNIFKAAGLFIYSDYLPFLFPLDIKSMAKSNMISSYVFVRDYLAKKLEEVKKK 261
             LSLFQNGSNIFKAAGLFIYSDYLPFLFPLDIKSMAKSNMISSYVFVRDYLAKKLEEVKKK
Sbjct:  4586 LSLFQNGSNIFKAAGLFIYSDYLPFLFPLDIKSMAKSNMISSYVFVRDYLAKKLEEVKKK 4407

Query:   262 EYIINGDDDGGVDTSQTPLIESYYKLYLQGLIGYDSILLSIVDIIIASVDTTSNSISFII 321
             EYIINGDDDGGVDTSQTPLIESYYKLYLQGLIGYDSILLSIVDIIIASVDTTSNSISFII
Sbjct:  4406 EYIINGDDDGGVDTSQTPLIESYYKLYLQGLIGYDSILLSIVDIIIASVDTTSNSISFII 4227

Query:   322 ARLTNHQEIQSKIYEEIMSNDINNNSNNISFSDHSKYPYIISIMNETYR 370
             ARLTNHQEIQSKIYEEIMSNDINNNSNNISFSDHSKYPYIISIMN  Y+
Sbjct:  4226 ARLTNHQEIQSKIYEEIMSNDINNNSNNISFSDHSKYPYIISIMNGNYK 4080

 Score = 405 (147.6 bits), Expect = 3.6e-258, Sum P(4) = 3.6e-258
 Identities = 76/76 (100%), Positives = 76/76 (100%), Frame = -1

Query:   367 ETYRYYASVPLPEPNMTTEDIEVDGYKIAKGTQIYKNIRGTLISKEFWGEDALEFKPERF 426
             ETYRYYASVPLPEPNMTTEDIEVDGYKIAKGTQIYKNIRGTLISKEFWGEDALEFKPERF
Sbjct:  4018 ETYRYYASVPLPEPNMTTEDIEVDGYKIAKGTQIYKNIRGTLISKEFWGEDALEFKPERF 3839

Query:   427 KTQTLNQKGLLHFGAG 442
             KTQTLNQKGLLHFGAG
Sbjct:  3838 KTQTLNQKGLLHFGAG 3791

 Score = 329 (120.9 bits), Expect = 3.6e-258, Sum P(4) = 3.6e-258
 Identities = 59/59 (100%), Positives = 59/59 (100%), Frame = -2

Query:   442 GPRGCPGARFTECFFFTLMVLLFKNYKLQNPNDNPIDDRGDVGLSMQCKPYDALFIKRN 500
             GPRGCPGARFTECFFFTLMVLLFKNYKLQNPNDNPIDDRGDVGLSMQCKPYDALFIKRN
Sbjct:  3705 GPRGCPGARFTECFFFTLMVLLFKNYKLQNPNDNPIDDRGDVGLSMQCKPYDALFIKRN 3529

 Score = 93 (37.8 bits), Expect = 3.6e-258, Sum P(4) = 3.6e-258
 Identities = 21/21 (100%), Positives = 21/21 (100%), Frame = -2

Query:     1 MNLIYTIILTIIILVLIISIK 21
             MNLIYTIILTIIILVLIISIK
Sbjct:  5256 MNLIYTIILTIIILVLIISIK 5194

CYP514A3P seq 89 67% to 514A1 possible pseudogene no ESTs
LSGIPFVEDCYESYLDGSVRYDSFLFSVSDVVFAAVDSASNGFSLLIGQLFNRPGIQDGI
YEGIMRNDEVSNADDMSFADRAECPCVFSVMD (bad intron joint)
ESYRF (frame shift)
YSSVPITEPNVATGDVEVNGYRIAEGAMIIXNLRGAHFSRGFWG
IIAFP1D7832  87% to seq 11 
IIAFP2D7832


>IIAFP2D7832
        Length = 1248

  Minus Strand HSPs:

 Score = 484 (175.4 bits), Expect = 6.1e-67, Sum P(2) = 6.1e-67
 Identities = 93/98 (94%), Positives = 94/98 (95%), Frame = -3

Query:     1 LSGIPFVEDCYESYLDGSVRYDSFLFSVSDVVFAAVDSASNGFSLLIGQLFNRPGIQDGI 60
             LSGIPFVEDCYESYLDGSVRYDSFLFSVSDVVFAAVDSASNGFSLLIGQLFNRPGIQDGI
Sbjct:   685 LSGIPFVEDCYESYLDGSVRYDSFLFSVSDVVFAAVDSASNGFSLLIGQLFNRPGIQDGI 506

Query:    61 YEGIMRNDEVSNADDMSFADRAECPCVFSVMDESYRFY 98
             YEGIMRNDEVSNADDMSFADRAECPCVFSVMD  +  Y
Sbjct:   505 YEGIMRNDEVSNADDMSFADRAECPCVFSVMDVCFSIY 392

 Score = 221 (82.9 bits), Expect = 6.1e-67, Sum P(2) = 6.1e-67
 Identities = 42/44 (95%), Positives = 42/44 (95%), Frame = -2

Query:    98 YSSVPITEPNVATGDVEVNGYRIAEGAMIIXNLRGAHFSRGFWG 141
             YSSVPITEPNVATGDVEVNGYRIAEGAMII NLRGAHFS GFWG
Sbjct:   305 YSSVPITEPNVATGDVEVNGYRIAEGAMIIXNLRGAHFSXGFWG 174

>IIAFP2D7832  translate frame +1 translate plus frames translate all frames
ACCACCCACAACAAAAAACCACAAAAACACCATCGGCCCNNCCAAATAAACNCCACCCCG
CTCACCCTCACACGCAACCCATACAACGCCCAGGTCAATTTTCTTTCNNNAAGGCAAGGG
GCGACCCACCNNATTGAGCCCCAGTCGAAGGCGCAACNAAATGAACCTCCCCCTCCCCAA
AACCCTNTNGAAAAATGNGCACCACGAAGATTNTTAATTATCATTGCACCTTCTGCAATT
CTATATCCATTCACTTCAACATCTCCTGTTGCTACATTCGGCTCAGTTATTGGTACTGAA
GAATAAATCTATATGATTCTACTATTCAAACTAACACAAAACTCATTCTTTGAAAAAACA
TACCAACACATATCAACACCAACATACATATATATATGGAGAAACACACATCCATTACAG
AAAAAACACATGGACATTCCGCACGATCAGCAAAAGACATATCATCCGCATTACTAACCT
CATCATTCCTCATGATTCCTTCATATATTCCATCCTGAATTCCAGGCCTATTAAACAACT
GACCAATCAAAAGACTAAACCCATTGGACGCTGAATCAACTGCCGCAAAAACAACATCAG
AAACCGAAAACAAGAACGAATCATAACGAACCGAACCATCCAAATAACTCTCATAACAAT
CCTCAACAAACGGAATACCGCTCAAGCCTCGAAAATGCCCCACCAACCAGGCCCCAATTC
CCAAATCCAATCTCCCTCTCCCAAGACTGAAAGCGCTAAACAACCAACCAAAGGCGCCCA
AAAACCCCAATAAAAAACACCTTTCCAAAAAAAAAACAACCTAACCCTAAGAACAAAACA
AACAACCCCATCCCCCCCTGGGAACAACACGGGAACCTTCTAAAAAACCCCAAAACGGGA
AAAAATAACAAAAAAAGGGGCAAAAAATAAAACCCACCAAGCAAAAACCAAAAAAAAATC
CCAAACCCAGCCCTCCAAAAAAAACGCCAAACCAAAAACCCCCTGCCAACAAAGGAACCA
AAAAAGGGGCCAAACAAACCCTCTTCAAAACCCAAAGGGGAAACTCAAAAACCAAAGTCA
CAAGCAAGAAACCCAAACGCGAACTGCGGCACCACCAAAAAGAAGCCTCCCACAACCCCA
AAAACAAAACCAAACATCTCAAAAAAAAGAAAGGAACCAAAACAGCCAAAATAAAACAAC
GGCGGACAACCACCCGAAAAAACACCCCAAAAACAACGGAACCCCCCC

>_4
                              LGRGRFIXLRLRLGLNXVGRPLPXXKKIDLGVVWVACEGERGGVYLXGPMVFLWFFVVGG
                              >_5
                              GEGEVHXVAPSTGAQXGGSPLAXXKEN*PGRCMGCV*G*AGWXLFGXADGVFVVFCCGWX
                              >_6
                              WGGGGSFXCAFDWGSXXWVAPCLXERKLTWALYGLRVRVSGVXFIWXGRWCFCGFLLWVV

CYP514A4 almost identical to 514A1 (7 aa diffs)
MNTIFTIILTITILVLSLILKDLLFEGRIKKINKLIPGPSTIPVFGNLLQINAKDFPKSV
NDFYERYGKVFRLRLGSVEIVVLTGPEVIDECFNKKHREIFKERYIKFSRFLGKDYNIFS
SNGDYHYVLRGILTSEITTRKLNNGRLESNKFILEMFSNLCKDNKETLVKNTPNQIRILA
VKLILNFTLGIEENDETILIIVEKIKCIFEAAGLLIYSDYLPFLFPLDIKSMSKNDIISS
YFFIKDFIGIKLDAIKIKYEKENELKNETTDETSSKLNNIPIIENYYKNYLDGSIHYDSI
LFSISDIIFAAVDSTSNGFSLLIGQLINKPEIQDKIYEEIMRNDENNNTNNISFADHTKY
PYIISVMNESYRYNSSVPITEPNKTTEDVEVNGYKIAKGTMIIKNLRGTHLSKEFWGEDA
LEFKPERFKNQPLNQKGLFHFGAGARACPGGRFTESFTFTFLVIMLKNFKIVNPTDIPID
VEGEVGLAMQCKPFDALFIKRN
ng2792        Contig_1093

>Contig_1093, D. discoideum, sequenced by the D. discoideum Sequencing
            Consortium, assembled with Phusion
        Length = 13,682

  Plus Strand HSPs:

 Score = 1791 (635.5 bits), Expect = 2.2e-259, Sum P(4) = 2.2e-259
 Identities = 349/352 (99%), Positives = 350/352 (99%), Frame = +2

Query:    22 DLLFEGRIKKINKLIPGPSTIPVFGNLLQINAKDFPKSVNDFYERYGKVFRLRLGSVEIV 81
             DLLFEGRIKKINKLIPGPSTIPVFGNLLQINAKDFPKSVNDFYERYGKVFRLRLGSVEIV
Sbjct:  7610 DLLFEGRIKKINKLIPGPSTIPVFGNLLQINAKDFPKSVNDFYERYGKVFRLRLGSVEIV 7789

Query:    82 VLTGPEVIDECFNKKHREIFKERYIKFSRFLGKDYNIFSSNGDYHYVLRGILTSEITTRK 141
             VLTGPEVIDECFNKKHREIFKERYIKFSRFLGKDYNIFSSNGDYHYVLRGILTSEITTRK
Sbjct:  7790 VLTGPEVIDECFNKKHREIFKERYIKFSRFLGKDYNIFSSNGDYHYVLRGILTSEITTRK 7969

Query:   142 LNNGRLESNKFILEMFSNLCKDNKETLVKNTPNQIRILAVKLILNFTLGIEENDETILII 201
             LNNGRLESNKFILEMFSNLCKDNKETLVKNTPNQIRILAVKLILNFTLGIEENDETILII
Sbjct:  7970 LNNGRLESNKFILEMFSNLCKDNKETLVKNTPNQIRILAVKLILNFTLGIEENDETILII 8149

Query:   202 VEKIKCIFEAAGLLIYSDYLPFLFPLDIKSMSKNDIISSYFFIKDFIGIKLDAIKIKYEK 261
             VEKIKCIFEAAGLLIYSDYLPFLFPLDIKSMSKNDIISSYFFIKDFIGIKLDAIKIKYEK
Sbjct:  8150 VEKIKCIFEAAGLLIYSDYLPFLFPLDIKSMSKNDIISSYFFIKDFIGIKLDAIKIKYEK 8329

Query:   262 ENELKNETTDETSSKLNNIPIIENYYKNYLDGSIHYDSILFSISDIIFAAVDSTSNGFSL 321
             ENELKNETTDETSSKLNNIPIIENYYKNYLDGSIHYDSILFSISDIIFAAVDSTSNGFSL
Sbjct:  8330 ENELKNETTDETSSKLNNIPIIENYYKNYLDGSIHYDSILFSISDIIFAAVDSTSNGFSL 8509

Query:   322 LIGQLINKPEIQDKIYEEIMRNDENNNTNNISFADHTKYPYIISVMNESYRY 373
             LIGQLINKPEIQDKIYEEIMRNDENNNTNNISFADHTKYPYIISVMN +Y Y
Sbjct:  8510 LIGQLINKPEIQDKIYEEIMRNDENNNTNNISFADHTKYPYIISVMNGNYPY 8665

 Score = 409 (149.0 bits), Expect = 2.2e-259, Sum P(4) = 2.2e-259
 Identities = 77/86 (89%), Positives = 81/86 (94%), Frame = +3

Query:   359 KYPYIISVMNESYRYNSSVPITEPNKTTEDVEVNGYKIAKGTMIIKNLRGTHLSKEFWGE 418
             K  +++  + ESYRYNSSVPITEPNKTTEDVEVNGYKIAKGTMIIKNLRGTHLSKEFWGE
Sbjct:  8712 KINFVLI*IIESYRYNSSVPITEPNKTTEDVEVNGYKIAKGTMIIKNLRGTHLSKEFWGE 8891

Query:   419 DALEFKPERFKNQPLNQKGLFHFGAG 444
             DALEFKPERFKNQPLNQKGLFHFGAG
Sbjct:  8892 DALEFKPERFKNQPLNQKGLFHFGAG 8969

 Score = 311 (114.5 bits), Expect = 2.2e-259, Sum P(4) = 2.2e-259
 Identities = 59/59 (100%), Positives = 59/59 (100%), Frame = +2

Query:   444 GARACPGGRFTESFTFTFLVIMLKNFKIVNPTDIPIDVEGEVGLAMQCKPFDALFIKRN 502
             GARACPGGRFTESFTFTFLVIMLKNFKIVNPTDIPIDVEGEVGLAMQCKPFDALFIKRN
Sbjct:  9044 GARACPGGRFTESFTFTFLVIMLKNFKIVNPTDIPIDVEGEVGLAMQCKPFDALFIKRN 9220

 Score = 94 (38.1 bits), Expect = 2.2e-259, Sum P(4) = 2.2e-259
 Identities = 21/21 (100%), Positives = 21/21 (100%), Frame = +2

Query:     1 MNTIFTIILTITILVLSLILK 21
             MNTIFTIILTITILVLSLILK
Sbjct:  7469 MNTIFTIILTITILVLSLILK 7531

CYP515A1 Seq 79b complete
MILGIILGLFIYIYLINIK (0)
FFNRAVPSSLLVGENKLKCKFPSGPLILPIIGSLYKLSLKYPHLSFKQLSDKYGKVFSLK
MGSIDTIVINDINFLQKSFRDNPTFFSQRFHLPSFYYMGKYQGII (2)
FGNGSHWRKLKDILSSSITKSKSRQMEELFYNEYFKAEEYLLKKINQDNNI(0)
DMGPIFKRILLNILYRFLFGVSFEYDDNLLSKEFYSFIQSYNKLF
EYLAKQPADFIPILKPFNNYKEIEKEYNNCLNFFQPLIDNILKNISDDDDDGN(2)
EPKCFLEYFISEIRKDTSNLIKITDLPYICFDIIVAGI(1)
VTTSTTMDWMLLYLTNYPNIQEKLFFEINTPNHPLHKDKLQFPYLNSIIKETLRISPP(1)
APFALPHICTDDIVIDDIFIPKNTQVIPNIYGCNRSNIESSESNVFNPDHFLSKDELNIGQ
CAFSFGSRQCPGANVADSIMFLVSTKLYKTFKFERTTTQL
NDENGHFTRSLSPFEFKSKLIIRK*
IIAFP1D72152
JC1a231d09.s1
JC1a254g04.s1
JC2a52d03.r1
JC2a204a05.s1
JC2b337a08.s1
JC2c56e03.r1
JC2c172c05.r1 
JC2c172c05.s1
JC2d33f05.s1
JC2d53c08.r1
JC2e77c08.r1
JC2e116h09.s1
c-JC2e13d10.s1
Dict-IV-V627e11.p1c
Dict-IV-V627e11.q1c
Dict-IV-V635h04.p1c

>Contig_1639, D. discoideum, sequenced by the D. discoideum Sequencing
            Consortium, assembled with Phusion
        Length = 3429

  Plus Strand HSPs:

Query:    19 KFFNRAVPSSLLVGENKLKCKFPSGPLILPIIGSLYKLSLKYPHLSFKQLSDKYGKVFSL 78
             KFFNRAVPSSLLVGENKLKCKF SGPLILPIIGSLYKLSLKYPHLSFKQLSDKYGKVFSL
Sbjct:   620 KFFNRAVPSSLLVGENKLKCKFRSGPLILPIIGSLYKLSLKYPHLSFKQLSDKYGKVFSL 799

Query:    79 KMGSIDTIVINDINFLQKSFRDNPTFFSQRFHLPSFYYMGKYQGI--IFGNGSHWRKLKD 136
             KMGSIDTIVINDINFLQKSFRDNPTFFSQRFHLPSFYYMGKYQGI  ++   + +  LK 
Sbjct:   800 KMGSIDTIVINDINFLQKSFRDNPTFFSQRFHLPSFYYMGKYQGIM*VYNFKNKYLVLKK 979

Query:   137 ILS 139
             I++
Sbjct:   980 IIN 988

Query:   109 FHLPSFYYMGKYQGIIFGNGSHWRKLKDILSSSITKSKSRQMEELFYNEYFKAEEYLLKK 168
             F +   Y + KY  I FGNGSHWRKLKDILSSSITKSKSRQMEELFYNEYFKAEEYLLKK
Sbjct:   964 FSIKKNY*LIKYI*IRFGNGSHWRKLKDILSSSITKSKSRQMEELFYNEYFKAEEYLLKK 1143

Query:   169 INQDNNI 175
             INQDNNI
Sbjct:  1144 INQDNNI 1164

Query:   176 DMGPIFKRILLNILYRFLFGVSFEYDDNLLSKEFYSFIQSYNKLFEYLAKQPADFIPILK 235
             DMGPIFK+ILLNILYRFLFGVSFEYDDNLLSKEFYSFIQSYNKLFEYLAKQPADFIPILK
Sbjct:  1269 DMGPIFKKILLNILYRFLFGVSFEYDDNLLSKEFYSFIQSYNKLFEYLAKQPADFIPILK 1448

Query:   236 PFNNYKEIEKEYNNCLNFFQPLIDNILKNISDDDDDG 272
             PFNNYKEIEKEYNNCLNFFQPLIDNILKNISDDDDDG
Sbjct:  1449 PFNNYKEIEKEYNNCLNFFQPLIDNILKNISDDDDDG 1559


Query:   312 VTTSTTMDWMLLYLTNYPNIQEKLFFEINTPNHPLHKDKLQFPYLNSIIKETLRISP 368
             VTTSTTMDWMLLYLTNYPNIQEKLFFEINTPNHPLHKDKLQFPYLNSIIKETLRISP
Sbjct:  1853 VTTSTTMDWMLLYLTNYPNIQEKLFFEINTPNHPLHKDKLQFPYLNSIIKETLRISP 2023

Query:   370 APFALPHICTDDIVIDDIFIPKNTQVIPNIYGCNRSNIESSESNVFNPDHFLSKDELNIG 429
             APFALPHICTDDIVIDDIFIPKNTQVIPNIYGCNRSNIESSESNVFNPDHFLSKDELNIG
Sbjct:  2120 APFALPHICTDDIVIDDIFIPKNTQVIPNIYGCNRSNIESSESNVFNPDHFLSKDELNIG 2299

Query:   430 QCAFSFGSRQCPGANVADSIMFLVSTKLYKTFKFERTTTQLNDENGHFTRSLSPFEFKSK 489
             QCAFSFGSRQCPGANVADSIMFLVSTKLYKTFKFERTTTQLNDENGHFTRSLSPFEFKSK
Sbjct:  2300 QCAFSFGSRQCPGANVADSIMFLVSTKLYKTFKFERTTTQLNDENGHFTRSLSPFEFKSK 2479

Query:   490 LIIRK 494
             LIIRK
Sbjct:  2480 LIIRK 2494


>Contig_1639, D. discoideum, sequenced by the D. discoideum Sequencing Consortium, assembled with Phusion translate frame +1 translate plus frames translate all frames
TTTTTTAAAAAATTTTTTTAAAAAAAAGGTTTTTTTTGAAAAAAAAAAATGGGGGGAAAA
AAAATTTTTTTTTAAAAGGTTTAAAATTTTTAAAAAAAAAAAAAAAACCCAAAAAAAGGT
TTTGGGAAAAAAAAGGGAAGGAAAAGGGTTTAAACCCCCCCAAATTTTTTATTTTTTTTT
TCCAAAAAAATTTTTTTTAGGGGGGAAAAATTTTTATTTTTCCCCCCCCTTTTTTTTTTT
TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGGGAAAAGGGGGGTTTTTTTTTGAAT
TAAACCCTTTTTTAAAAAGGGGAAACCCCTTTGGGCAAAAAAAAAAAATGTTTTTTTTTC
AAAAAGGAAGGGTTTTTTTTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAGTTACTAATAAAATTTTAATAATTAAATTTTTAAAACTTAATTTAATTAATATTAA
AAAATGATATTAGGAATTATTTTAGGATTATTTATATATATATATTTAATAAATATAAAA
GTAAGCATTAATATAATATAATATAATATTAATAAAATTTTTAAATAAAAAATAATATTA
ATAAAATTTTTAATTTAAAAAGTTTTTTAATAGAGCAGTACCTTCATCATTATTAGTTGG
AGAAAATAAATTAAAATGTAAATTTCGAAGTGGTCCATTAATATTACCTATTATTGGTAG
CCTTTACAAATTAAGTTTAAAATATCCTCACTTATCATTCAAACAATTATCAGATAAATA
CGGAAAAGTATTTTCATTGAAAATGGGTTCAATTGATACAATTGTAATTAATGATATCAA
TTTTTTACAAAAATCATTTCGTGACAATCCAACCTTTTTTTCACAAAGATTTCATTTACC
ATCTTTTTATTACATGGGAAAATATCAAGGTATTATGTAAGTATATAACTTTAAAAATAA
ATATTTAGTATTAAAAAAAATTATTAATTAATTAAATATATATAAATTAGATTTGGTAAT
GGTTCTCATTGGAGAAAATTAAAAGATATTTTAAGTAGTTCAATTACTAAATCAAAATCT
CGTCAAATGGAAGAATTATTTTATAATGAATATTTTAAAGCTGAAGAATATTTATTAAAA 1140
AAAATAAATCAAGATAATAATATTGTAAGTTTTTATTATTTTTGAAAAAAATAAATAAAT
AAATAAATAAATATATATATATATAATTGATTATTAGTTTAATTTTTCACAAAAAAAAAA
AAAAAAAGGATATGGGTCCAATTTTTAAAAAAATTTTATTAAATATTTTATATAGATTTT
TATTTGGTGTATCATTTGAATATGATGATAATTTATTATCAAAAGAATTTTATTCATTTA
TCCAAAGCTATAATAAATTATTTGAATATTTAGCAAAACAACCAGCAGATTTTATCCCAA
TTTTAAAACCATTTAATAATTATAAAGAAATTGAAAAAGAGTATAATAATTGTTTAAATT 1500
TCTTTCAACCATTAATTGATAATATTTTAAAAAATATTAGTGATGATGATGATGATGGAA
AGTATGTATAAGTTATAATTCTAGTATGTGTTTAATTAAAATACTAAATTTTATTTTTTT
TTTTTTTTTTATTAAATAATAGTGAACCAAAATGTTTTTTAGAATATTTCATTAGTGAAA
TTCGAAAGGATACATCAAATTTAATTAAAATAACTGATCTTCCATATATATGTTTCGATA
TTATTGTTGCAGGTATAGGTAAATCTAATATTTATTTTTTCTTTTTTTCTTTTTTTTTTT 1800
TTTTTTTTTTTTTTCAAAAAATTAAAATTTATGATTCTAATTTTTTTTTAAAGTTACAAC
AAGTACAACTATGGATTGGATGTTATTATATTTAACAAATTATCCGAATATTCAAGAGAA
ATTATTTTTTGAAATAAACACACCAAACCATCCATTACATAAGGATAAATTACAATTCCC
ATATTTAAATTCAATTATTAAAGAAACTTTAAGAATTTCACCACGTAAATATTTATAATA 2040
CATTATTATAATTTATTATACATTTTGAAATAATTACTGAAAATTTTTATTTTTTTAAAA
AATCCATATATATATAGCAGCACCATTTGCATTACCACATATATGTACTGATGATATTGT
TATTGATGATATATTTATTCCAAAAAATACTCAAGTAATTCCAAACATATATGGATGTAA
TAGAAGTAATATTGAAAGCAGTGAATCTAACGTCTTCAATCCTGATCACTTTTTATCAAA
AGATGAATTAAATATTGGTCAATGCGCTTTTTCATTTGGTTCACGTCAATGTCCTGGTGC
CAATGTCGCTGATTCAATAATGTTTTTGGTATCAACTAAATTATATAAAACATTTAAATT
TGAGAGAACCACTACTCAATTAAATGATGAAAATGGTCATTTTACAAGATCATTATCCCC
ATTTGAATTCAAATCAAAATTAATTATTAGAAAATAATTTACTAATATTTTATTTAAAAA
AAAAAAAAAAAAAAAAAAAAATTGCTAAAAATACAATTAAAATAAAAGGGTTTTAAATTA
TCTATATTTGCCCCCAGATAGATATTTTATATTTTTTTAAATTATCTATTAAAGTTATAT
CTAAGATAAAATGATGTGTCGGATATCCAAAAAAAAAAAATAATAATAATAATAATAATA
ATCCGAAATAAATCGGGTTCACACAGATACTCTATTTCATTATTTGGATGAGGTTATTTG
AAATTTAAAATTAAGCACGCTATCAGTAATGTAATTGATCCTATTGGAGTTGTTGAAATG
GCAATAATTTCAATGTAGAGTAGAATTGAATGGAAACATAACAATAACTGGTTCACCTGT
ATTTGGACTATTTTCTTCAGTATTAACAATGACATATGTGTCTACTGAAAGACTTGGTAC
ATCATTACATGTTTCTTCACCGGTGATAAAAGATGGAGCAGTGTAGGTTTTTAGAATTAC
TGATGTTTGATTCAACCTCATGACCCTAACCAGTTGGAAGTGCTGTAATCAAATAGTTGC
AGGATTAAAGAATGAATATGAATCGGTTGTATCTGTAAGGGTGGGGACGAAAGGAGTCTT
ACTATCCGCATAGTAAATGAAAACTGGAATCTTTGCGAGTGGCATATCTTTTTGCTGTAC
CAAATTAACCACTGATTGTAATTGTACCGCTATTAATGATAATAACTTTATTGTTTAATT
GAATTACCTGGTGGAATTGAACCCATTTCCAAAAGCTATATTGTAAAACACCATGGTGTT
TGAAGGAATATCAGCTGATGTAATATTTAAGAGAAGTCTAGTTACATTTGGTAATTATTT
TTCATCATTGATATTATATCATCATTTTATTTAAACAAAAGTGTTTATTTTTAATTTTTA
AATTGATTT



>CYP515A1 dd_02444 chr2 genome assembly one exon differs this is probably correct
MILGIILGLFIYIYLINIKFFNRAVPSSLLVGENKLKCKFPSGPLILPIIGSLYKLSLKY
PHLSFKQLSDKYGKVFSLKMGSIDTIVINDINFLQKSFRDNPTFFSQRFHLPSFYYMGKY
QGIIFGNGSHWRKLKDILSSSITKSKSRQMEELFYNEYFKAEEYLLKKINQDNNI DMGPI
FKRILLNILYRFLFGVSFEYDDNLLSKEFYSFIQSYNKLFEYLAKQPADFIPILKPFNNY
KEIEKEYNNCLNFFQPLIDNILKNISDDDDDG NEPKCFLEYFISEIRKDTSNLIKITDL P
YICFDIIVAGIV TTSTTMDWMLLYLTNYPNIQEKLFFEINTPNHPLHKDKLQFPYLNSII
KETLRISPPAPFALPHICTDDIVIDDIFIPKNTQVIPNIYGCNRSNIESSESNVFNPDHF
LSKDELNIGQCAFSFGSRQCPGANVADSIMFLVSTKLYKTFKFERTTTQLNDENGHFTRS
LSPFEFKSKLIIRK

>c-JC2e13d10.s1_1 3245 letters
                              FRIIYIYRFNKYKSIINII*YNINKIFK*KIILIKFLI*KVF**STTFIIISWRK*IKM*
                              ISKWSINITYYW*PLQIKFKISSLIIQTIIR*IRKSIFIENGFN*YNCN**YQFFTKIIS
                              *QSTFFSQRFHLPSFYYMGKYQGIM*VYNFKNKYLVLKKIIN*LNIYKLDLVMVLIGEN*
                              KIF*VVQLLNQNLVKWKNYFIMNILKLKNIY*KK*IKIIIL*VFIIFEKNK*INK*IYIY
                              IIDY*FNFSQKKKKKGYGSNF*KNFIKYFI*ISIWCII*I***FIIKRILFIYPKL**II
                              *IFSKTTSRFYPNFKTI**L*RN*KRV**LFKFLSTIN**YFKKY*X
                              >c-JC2e13d10.s1_2 3245 letters
                              LGLFIYIDLINIKVSLI*YNIILIKFLNKK*Y**NF*FKKFFNRALPSSLLVGENKLKCK
                              FPSGPLILPIIGSLYKLSLKYPHLSFKQLSDKYGKVFSLKMGSIDTIVINDINFLQKSFR
                              DNPPFFHKDFIYHLFITWENIKVLCKYITLKINI*Y*KKLLIN*IYIN*IW*WFSLEKIK
                              RYFK*FNY*IKISSNGRIIL**IF*S*RIFIKKNKSR**YCKFLLFLKKINK*INKYIYI
                              *LIISLIFHKKKKKKDMGPIFKRILLNILYRFLFGVSFEYDDNLLSKEFYSFIQSYNKLF
                              EYLAKQPADFIPILKPFNNYKEIEKEYNNCLNFFQPLIDNILKNISX
                              >c-JC2e13d10.s1_3 3245 letters
                              *DYLYI*I**I*KYH*YNII*Y**NF*IKNNINKIFNLKSFLIEHYLHHY*LEKIN*NVN
                              FQVVH*YYLLLVAFTN*V*NILTYHSNNYQINTEKYFH*KWVQLIQL*LMISIFYKNHFV
                              TIHLFFTKISFTIFLLHGKISRYYVSI*L*K*IFSIKKNY*LIKYI*IRFGNGSHWRKLK
                              DILSSSITKSKSRQMEELFYNEYFKAEEYLLKKINQDNNIVSFYYF*KK*INK*INIYIY
                              N*LLV*FFTKKKKKRIWVQFLKEFY*IFYIDFYLVYHLNMMIIYYQKNFIHLSKAIINYL
                              NI*QNNQQILSQF*NHLIIIKKLKKSIIIV*ISFNH*LIIF*KILV

Query:    11 IYIYLINIKFFNRAVPSSLLVGENKLKCKFPSGPLILPIIGSLYKLSLKYPHLSFKQLSD 70
             IYIY  N K+  + +PSSLLVGENKLKCKFPSGPLILPIIGSLYKLSLKYPHLSFKQLSD
Sbjct:     4 IYIYRFN-KY--KTLPSSLLVGENKLKCKFPSGPLILPIIGSLYKLSLKYPHLSFKQLSD 60

Query:    71 KYGKVFSLKMGSIDTIVINDINFLQKSFRDNPTFFSQRFHLPSFYYMGKYQGII-FGNGS 129
             KYGKVFSLKMGSIDTIVINDINFLQKSFRDNP FF + F    F      + +  FGNGS
Sbjct:    61 KYGKVFSLKMGSIDTIVINDINFLQKSFRDNPPFFHKDFIYHLFITWENIKVLCKFGNGS 120


CYP515A2P Seq 79 38% to 508A1 appears to be different from seq 79b
At DNA and protein seq levels missing some parts, exons 4,5,6,7
DMGPIFKRILLNILYRFLFGVSFEYDDNLLSK*FYSFIQSYNKLFEYLAKQ 45989
PADFIPILKPFNNYKEIEKEYNNCLNFFQPLIDNILKNIIDDDDDGI atgt (2) 45851
agn EPKCFLEYFISEIRKDTSNLIKITDL 45690

(1)VTTSTTMDWLLLYLTNYPNIQEKLFFEINTPNHPLHKDKLQFPYLNSFIKETLRISPP(1)
APFALPHICTDDIVIDDIFIPKNTQVIPNLYGCNRSNIESS
EPNIFNPDRFLSKDESNIGQCAFSSGLRQCPGANVADSIMFLVSTKLYKTFKFERTTTQL
NDENGYLTKTLCPLEFKSKLIIRK*
JAX4a108h12.r1
JC1a254g08.r1
JC2a190g04.r1
JC2b80a12.s1
JC2b234d12.s2
c-JC2c22g11.s1 12873 letters
JC3a177b07.3764 183690 letters

>CYP515A2 dd_00373 chr2 genome assembly
MDWLLLYLTNYPNIQEKLFFEINTPNHPLHKDKLQFPYLNSFIKETLRISPPAPFALPHI
CTDDIVIDDIFIPKNTQVIPNLYGCNRSNIESSEPNIFNPDRFLSKDESNIGQCAFSSGL
RQCPGANVADSIMFLVSTKLYKTFKFERTTTQLNDENGYLTKTLCPLEFKSKLIIRK


CYP515B1 Seq. 6 34% to seq 5
MATLLLVIIFMITFIIYKN (0)
FDSKRKNKYPPGPINLPFIGGLYKLKPGKQHLSLNELYQKYG
KVFSMKFGSYDTVILNEPDVIVEAFHLNSTSFMDRILLPSFEVVGKNQNIGFT
QTEYWKKIRGILNISLTKSKTRLLEGLFNQEYFRFDQFIKKQLKSKSDT (0)
MFIRPYLKRLSFNIIYSYLFSETIPYEDELIPSDILDFIHASEELLVVLSRMPS
DYIKVLRPFESHKKLKEICDRMSKFIKPRVDKKVELLDQD
NPKCFIDYLILQIRSDPSQTIKIDAIQYIVLDLLVGGTDSTSSALEWMILFLSNHESIQEKLYNEIC
KNTNTNTTATTTENDNESYFPKLIEKNNYPLFNACV
KETLRRSPPVPLGLPHLCSED TEIGGYLIPKGTQIISNIYSASNCDKVFTDPLEFNPLRFIENSPPQTV
TFSTGPRVCPGKNLSEDELFSFGTKLFK
TFKFSRPNKNQLYDEVAILGITLEPKPFITKVSLRS* 
AU037975
AU071546
C91073
C92981
Contig170 chr 2
JC2a64f12.r1
SSB769
SSF410
SSH115
SSJ454 
AFI690
AFK630 
SFE728 

>IIAFP1D83520 45769 letters
        Length = 45,769

  Plus Strand HSPs:

 Score = 1725 (612.3 bits), Expect = 2.5e-257, Sum P(3) = 2.5e-257
 Identities = 334/342 (97%), Positives = 335/342 (97%), Frame = +1

Query:   152 FIKKQLKSKSDTMFIRPYLKRLSFNIIYSYLFSETIPYEDELIPSDILDFIHASEELLVV 211
             FI   +K K   MFIRPYLKRLSFNIIYSYLFSETIPYEDELIPSDILDFIHASEELLVV
Sbjct:  1243 FIFLNIKKK---MFIRPYLKRLSFNIIYSYLFSETIPYEDELIPSDILDFIHASEELLVV 1413

Query:   212 LSRMPSDYIKVLRPFESHKKLKEICDRMSKFIKPRVDKKVELLDQDNPKCFIDYLILQIR 271
             LSRMPSDYIKVLRPFESHKKLKEICDRMSKFIKPRVDKKVELLDQDNPKCFIDYLILQIR
Sbjct:  1414 LSRMPSDYIKVLRPFESHKKLKEICDRMSKFIKPRVDKKVELLDQDNPKCFIDYLILQIR 1593

Query:   272 SDPSQTIKIDAIQYIVLDLLVGGTDSTSSALEWMILFLSNHESIQEKLYNEICKNTNTNT 331
             SDPSQTIKIDAIQYIVLDLLVGGTDSTSSALEWMILFLSNHESIQEKLYNEICKNTNTNT
Sbjct:  1594 SDPSQTIKIDAIQYIVLDLLVGGTDSTSSALEWMILFLSNHESIQEKLYNEICKNTNTNT 1773

Query:   332 TATTTENDNESYFPKLIEKNNYPLFNACVKETLRRSPPVPLGLPHLCSEDTEIGGYLIPK 391
             TATTTENDNESYFPKLIEKNNYPLFNACVKETLRRSPPVPLGLPHLCSEDTEIGGYLIPK
Sbjct:  1774 TATTTENDNESYFPKLIEKNNYPLFNACVKETLRRSPPVPLGLPHLCSEDTEIGGYLIPK 1953

Query:   392 GTQIISNIYSASNCDKVFTDPLEFNPLRFIENSPPQTVTFSTGPRVCPGKNLSEDELFSF 451
             GTQIISNIYSASNCDKVFTDPLEFNPLRFIENSPPQTVTFSTGPRVCPGKNLSEDELFSF
Sbjct:  1954 GTQIISNIYSASNCDKVFTDPLEFNPLRFIENSPPQTVTFSTGPRVCPGKNLSEDELFSF 2133

Query:   452 GTKLFKTFKFSRPNKNQLYDEVAILGITLEPKPFITKVSLRS 493
             GTKLFKTFKFSRPNKNQLYDEVAILGITLEPKPFITKVSLRS
Sbjct:  2134 GTKLFKTFKFSRPNKNQLYDEVAILGITLEPKPFITKVSLRS 2259

 Score = 756 (271.2 bits), Expect = 2.5e-257, Sum P(3) = 2.5e-257
 Identities = 145/155 (93%), Positives = 148/155 (95%), Frame = +1

Query:    10 FMITFIIYKNFDSKRKNKYPPGPINLPFIGGLYKLKPGKQHLSLNELYQKYGKVFSMKFG 69
             F   +  +  FDSKRKNKYPPGPINLPFIGGLYKLKPGKQHLSLNELYQKYGKVFSMKFG
Sbjct:   697 FFFFYFFF**FDSKRKNKYPPGPINLPFIGGLYKLKPGKQHLSLNELYQKYGKVFSMKFG 876

Query:    70 SYDTVILNEPDVIVEAFHLNSTSFMDRILLPSFEVVGKNQNIGFTQTEYWKKIRGILNIS 129
             SYDTVILNEPDVIVEAFHLNSTSFMDRILLPSFEVVGKNQNIGFTQTEYWKKIRGILNIS
Sbjct:   877 SYDTVILNEPDVIVEAFHLNSTSFMDRILLPSFEVVGKNQNIGFTQTEYWKKIRGILNIS 1056

Query:   130 LTKSKTRLLEGLFNQEYFRFDQFIKKQLKSKSDTM 164
             LTKSKTRLLEGLFNQEYFRFDQFIKKQLKSKSDT+
Sbjct:  1057 LTKSKTRLLEGLFNQEYFRFDQFIKKQLKSKSDTV 1161

 Score = 90 (36.7 bits), Expect = 2.5e-257, Sum P(3) = 2.5e-257
 Identities = 19/19 (100%), Positives = 19/19 (100%), Frame = +3

Query:     1 MATLLLVIIFMITFIIYKN 19
             MATLLLVIIFMITFIIYKN
Sbjct:   432 MATLLLVIIFMITFIIYKN 488

CYP516A1 seq 9, 43 complete seq 39% to 10, 42; 30% to 508A1
MIILLLSIIIFILYIVKI (0)
FKNKSCCGNEIILPPGPISLPFIGNLHQLAIDPHLAIQKLMFKYGNVMTVYFANIKTVV
ISDPNYLKEVFVNQSHKTSDRYLMGTSRIIGNEKDILFSNGQYWKNYRQILAQSFLKLRDHKGITEKI
SLESVKLSQAFEWYATSGQVVNPCSLFKMYTLNVIMQLLYSHRSSYDLKGQHPIIDA
LKMVEESLAVGNVLDLFPILNIFLKKNKLKLVNNLQQVWSYSQDSIKEHREKLLINPEK (2)
IDDLLDLFINEIKLSKNSEFFDDEGLYRVCSDLLLSGTETSSSTMSWLLL
FLINNPNFQDKVRTELLEATGGKKTIGLTEKSKTPFFNACIKEALRIRPVGALSLPRIAS
EDVTCGPYTIE KGSQIIMNVYGLAMDPTVWEDPETFNPYRWLSSDISQSTYSFIPFGCGS
RVCVGSSLARDEIFLGIGNILLNYIFESQNGKPINEKGHFGIALQTVDYNVKLTKI*
C90082
Contig14304 Chr 6
IIAAP1D3115
IIAAP1D3581
IIAAP1E3107 N-term
LPPLPISLPFIGNLHQLAIDPHLAIQKLMFKYGNVMTVYFANIK
IIACP2D2392 95% to IIAAP1E3107
LPPGPISLPFIGNLHQLAIDPHLAIQKLMFKYGNVMTVYFAHIK
IIADP1D2090
IIAFP1D1377
JAX4a92g02.s1
JAX4b47f08.r1 91% to IIAAP1E3107
PFIGNLHQLSIDPHLSIQKLMFKYGNVMTVYFPNIK
JC1b185h09.r1 
JC1c288g05.s1
sdic6A3g4.p1c
sdic6A46b12.p1c
sdic6A90a2.q1t
sdic6Ma11.p1ca
sdic6Td4.p1c
SSG513

>Contig_3767, D. discoideum, sequenced by the D. discoideum Sequencing
            Consortium, assembled with Phusion
        Length = 15,392

  Minus Strand HSPs:

 Score = 1263 (449.7 bits), Expect = 4.7e-253, Sum P(3) = 4.7e-253
 Identities = 243/243 (100%), Positives = 243/243 (100%), Frame = -1

Query:    19 FKNKSCCGNEIILPPGPISLPFIGNLHQLAIDPHLAIQKLMFKYGNVMTVYFANIKTVVI 78
             FKNKSCCGNEIILPPGPISLPFIGNLHQLAIDPHLAIQKLMFKYGNVMTVYFANIKTVVI
Sbjct: 15077 FKNKSCCGNEIILPPGPISLPFIGNLHQLAIDPHLAIQKLMFKYGNVMTVYFANIKTVVI 14898

Query:    79 SDPNYLKEVFVNQSHKTSDRYLMGTSRIIGNEKDILFSNGQYWKNYRQILAQSFLKLRDH 138
             SDPNYLKEVFVNQSHKTSDRYLMGTSRIIGNEKDILFSNGQYWKNYRQILAQSFLKLRDH
Sbjct: 14897 SDPNYLKEVFVNQSHKTSDRYLMGTSRIIGNEKDILFSNGQYWKNYRQILAQSFLKLRDH 14718

Query:   139 KGITEKISLESVKLSQAFEWYATSGQVVNPCSLFKMYTLNVIMQLLYSHRSSYDLKGQHP 198
             KGITEKISLESVKLSQAFEWYATSGQVVNPCSLFKMYTLNVIMQLLYSHRSSYDLKGQHP
Sbjct: 14717 KGITEKISLESVKLSQAFEWYATSGQVVNPCSLFKMYTLNVIMQLLYSHRSSYDLKGQHP 14538

Query:   199 IIDALKMVEESLAVGNVLDLFPILNIFLKKNKLKLVNNLQQVWSYSQDSIKEHREKLLIN 258
             IIDALKMVEESLAVGNVLDLFPILNIFLKKNKLKLVNNLQQVWSYSQDSIKEHREKLLIN
Sbjct: 14537 IIDALKMVEESLAVGNVLDLFPILNIFLKKNKLKLVNNLQQVWSYSQDSIKEHREKLLIN 14358

Query:   259 PEK 261
             PEK
Sbjct: 14357 PEK 14349

 Score = 1178 (419.7 bits), Expect = 4.7e-253, Sum P(3) = 4.7e-253
 Identities = 226/227 (99%), Positives = 227/227 (100%), Frame = -1

Query:   261 KIDDLLDLFINEIKLSKNSEFFDDEGLYRVCSDLLLSGTETSSSTMSWLLLFLINNPNFQ 320
             +IDDLLDLFINEIKLSKNSEFFDDEGLYRVCSDLLLSGTETSSSTMSWLLLFLINNPNFQ
Sbjct: 14252 RIDDLLDLFINEIKLSKNSEFFDDEGLYRVCSDLLLSGTETSSSTMSWLLLFLINNPNFQ 14073

Query:   321 DKVRTELLEATGGKKTIGLTEKSKTPFFNACIKEALRIRPVGALSLPRIASEDVTCGPYT 380
             DKVRTELLEATGGKKTIGLTEKSKTPFFNACIKEALRIRPVGALSLPRIASEDVTCGPYT
Sbjct: 14072 DKVRTELLEATGGKKTIGLTEKSKTPFFNACIKEALRIRPVGALSLPRIASEDVTCGPYT 13893

Query:   381 IEKGSQIIMNVYGLAMDPTVWEDPETFNPYRWLSSDISQSTYSFIPFGCGSRVCVGSSLA 440
             IEKGSQIIMNVYGLAMDPTVWEDPETFNPYRWLSSDISQSTYSFIPFGCGSRVCVGSSLA
Sbjct: 13892 IEKGSQIIMNVYGLAMDPTVWEDPETFNPYRWLSSDISQSTYSFIPFGCGSRVCVGSSLA 13713

Query:   441 RDEIFLGIGNILLNYIFESQNGKPINEKGHFGIALQTVDYNVKLTKI 487
             RDEIFLGIGNILLNYIFESQNGKPINEKGHFGIALQTVDYNVKLTKI
Sbjct: 13712 RDEIFLGIGNILLNYIFESQNGKPINEKGHFGIALQTVDYNVKLTKI 13572

 Score = 79 (32.9 bits), Expect = 4.7e-253, Sum P(3) = 4.7e-253
 Identities = 18/18 (100%), Positives = 18/18 (100%), Frame = -3

Query:     1 MIILLLSIIIFILYIVKI 18
             MIILLLSIIIFILYIVKI
Sbjct: 15219 MIILLLSIIIFILYIVKI 15166

>JC3f71h11.s1 Clone JC3f71h11, standard read, bases 110 through 740, from
            2002-11-15
        Length = 629

  Plus Strand HSPs:

Query:     1 MIILLLSIIIFILYIFK 17
             MIILLLSIIIFILYI K
Sbjct:    62 MIILLLSIIIFILYIVK 112

Query:    16 FKNKSCCGNEIILPPGPISLPFIGNLHQLAI 46
             FKNKSCCGNEIILPPGPISLPFIGNLHQLAI
Sbjct:   204 FKNKSCCGNEIILPPGPISLPFIGNLHQLAI 296

CYP516A2P Seq 64 70% to seq 9 probable pseudogene fragment
KMAREDGTWGPXXXRGGQIKKERVRGLGRGPTVWGGPETFLPXXXXXXXXXXXXX
SVIPCGWGSRGWGGSSLAREEILVGMGNV*LNYIWESQKGKPIKEKGHLGSALQTGDYNVKVTKI
IIAAP1E3541

CYP516B1 seq 10, 42 complete seq 39% to 9, 43
MYLILSLIIFLAYVA (0)
FHKKRTNGMPPGPFPLPIIGNLHQLGKSPYKSLKSF
SDKYGGLTTIFLGSVPTVLISEPNILREIIIKNNDSIIDRYISDSGLIIG
GERNLLFSKGSFWIKYRKIFSSAMTNARKFNIASRIEQQA
ISLNNYFGTYANSKQA (0)
INPHDYIRRYSLNGVIDYSFSDSVEYESDTHHIVIRAAEIMEEILATGNPHDYLPFLKPF 868
YTKKRNTLAMAVGQVWDYCNDAITVHRKTLDHEKPRDLLDLILMEIEKSEE 1021
KQFYDDDSLSKCLTDLIVAGHETVAITLGWMILFLSNHQDVQQKVYDELINVVGKGN 1192
LPALVHRKDTSYLNACIQETMRIRTAAPLALPRIASEDIKVGG
YTIPKGTQVMMSVYGMASDERYWKDPHIFNPERWLSSNHSTENGGGGGGVVGNSSQSEV
FIPFGVGPRMCVGMGVAKDELYYCASQMFMNFKWSPVNDKPIDDEGVARIALEYKEYQVVLERRK*
C25607
Contig11546 Chr 2
JAX4a161d08.s1 N-term
c-JAX4a161d08.s1
JAX4a161d08.r1
JAX4b34h10.r1 90%
JAX4b38d02.s1 90%
JAX4d06b12.s1
JC1c262b03.s1
JC2b265h11.s1
JC2b383g02.s1
JC2b388b09.r1 
JC2d19a07.s1
JC2c86c05.r1
JC2c92b06.r1 poor quality
JC2d19a07.s1
SSA260

>Contig_3853, D. discoideum, sequenced by the D. discoideum Sequencing
            Consortium, assembled with Phusion
        Length = 8577

  Plus Strand HSPs:

 Score = 1759 (624.3 bits), Expect = 5.2e-260, Sum P(3) = 5.2e-260
 Identities = 334/337 (99%), Positives = 335/337 (99%), Frame = +2

Query:   156 QAIN-HDYIRRYSLNGVIDYSFSDSVEYESDTHHIVIRAAEIMEEILATGNPHDYLPFLK 214
             Q IN HDYIRRYSLNGVIDYSFSDSVEYESDTHHIVIRAAEIMEEILATGNPHDYLPFLK
Sbjct:   683 Q*INPHDYIRRYSLNGVIDYSFSDSVEYESDTHHIVIRAAEIMEEILATGNPHDYLPFLK 862

Query:   215 PFYTKKRNTLAMAVGQVWDYCNDAITVHRKTLDHEKPRDLLDLILMEIEKSEEKQFYDDD 274
             PFYTKKRNTLAMAVGQVWDYCNDAITVHRKTLDHEKPRDLL+LILMEIEKSEEKQFYDDD
Sbjct:   863 PFYTKKRNTLAMAVGQVWDYCNDAITVHRKTLDHEKPRDLLNLILMEIEKSEEKQFYDDD 1042

Query:   275 SLSKCLTDLIVAGHETVAITLGWMILFLSNHQDVQQKVYDELINVVGKGNLPALVHRKDT 334
             SLSKCLTDLIVAGHETVAITLGWMILFLSNHQDVQQKVYDELINVVGKGNLPALVHRKDT
Sbjct:  1043 SLSKCLTDLIVAGHETVAITLGWMILFLSNHQDVQQKVYDELINVVGKGNLPALVHRKDT 1222

Query:   335 SYLNACIQETMRIRTAAPLALPRIASEDIKVGGYTIPKGTQVMMSVYGMASDERYWKDPH 394
             SYLNACIQETMRIRTAAPLALPRIASEDIKVGGYTIPKGTQVMMSVYGMASDERYWKDPH
Sbjct:  1223 SYLNACIQETMRIRTAAPLALPRIASEDIKVGGYTIPKGTQVMMSVYGMASDERYWKDPH 1402

Query:   395 IFNPERWLSSNHSTENGGGGGGVVGNSSQSEVFIPFGVGPRMCVGMGVAKDELYYCASQM 454
             IFNPERWLSSNHSTENGGGGGGVVGNSSQSEVFIPFGVGPRMCVGMGVAKDELYYCASQM
Sbjct:  1403 IFNPERWLSSNHSTENGGGGGGVVGNSSQSEVFIPFGVGPRMCVGMGVAKDELYYCASQM 1582

Query:   455 FMNFKWSPVNDKPIDDEGVARIALEYKEYQVVLERRK 491
             FMNFKWSPVNDKPIDDEGVARIALEYKEYQVVLERRK
Sbjct:  1583 FMNFKWSPVNDKPIDDEGVARIALEYKEYQVVLERRK 1693

 Score = 736 (264.1 bits), Expect = 5.2e-260, Sum P(3) = 5.2e-260
 Identities = 142/144 (98%), Positives = 144/144 (100%), Frame = +1

Query:    16 FHKKRTNGMPPGPFPLPIIGNLHQLGKSPYKSLKSFSDKYGGLTTIFLGSVPTVLISEPN 75
             FHKKRTNGMPPGPFPLPIIGNLHQLGKSPYKSLKSFSDKYGGLTTIFLGSVPTVLISEPN
Sbjct:   172 FHKKRTNGMPPGPFPLPIIGNLHQLGKSPYKSLKSFSDKYGGLTTIFLGSVPTVLISEPN 351

Query:    76 ILREIIIKNNDSIIDRYISDSGLIIGGERNLLFSKGSFWIKYRKIFSSAMTNARKFNIAS 135
             ILREIIIKNNDSIIDRYISDSGLIIGGERNLLFSKGSFWIKYRKIFSSAMTNARKFNIAS
Sbjct:   352 ILREIIIKNNDSIIDRYISDSGLIIGGERNLLFSKGSFWIKYRKIFSSAMTNARKFNIAS 531

Query:   136 RIEQQAISLNNYFGTYANSKQAIN 159
             RIEQQAISLNNYFGTYANSKQA++
Sbjct:   532 RIEQQAISLNNYFGTYANSKQAVS 603

 Score = 69 (29.3 bits), Expect = 5.2e-260, Sum P(3) = 5.2e-260
 Identities = 15/15 (100%), Positives = 15/15 (100%), Frame = +1

Query:     1 MYLILSLIIFLAYVA 15
             MYLILSLIIFLAYVA
Sbjct:    13 MYLILSLIIFLAYVA 57

CYP516B2P seq 48 72% to 10 no ESTs
MFLILSLIIFWAFVA bad AG boundary
FHKKRPKGLPPGPFPFPILGNLHQWGRSPFKSLKSFSAKFGGWPPIFWGG
VPPVLIGDPNFLREIFIKN
sdic6A62f2.p1c N-term
sdic6A62f3.p1c
FHKKGTQGFPPGPFPFPI

CYP517A1 Seq. 3 complete 37% to 508 483 aa
MEIINVFLFLIILFLVKDF (0)
VKKNKKIHTKSPSGPIAFPILGNVVQIRFWELFKIQEHELF
GGYSKKYNGVVRAWFGE RLFFFVSNYDVVKYFQKDENFHNRPSVLVPGWRYASSNGLGVM
SSSDDKWKRAKSSVSQSLRVRTTKKLMEEKAIEF
IDSLEKISNNNEI (0)
FYPKGHIQGYACSMLFKYMFNQDLSVESGMSRTIGNAVEHVFGNLSKLTAF
DCFEIFSPLYDWFFTRRLKGCDIVRQIISSQNENHLKSIDPSKPRDLMDDLLIEYGLNEI
TKEDTMQINQICFDIFGPAVGTVTITMNWVILQLCNRPELQEIAYQEIKKAVKDDEYVNL
NHKQNAPYIVAFIKETMRLCSNG FGLPRTAKNDQICGDFFIPKDAIIFINYLEISQNEEI
FKNAKEFNPTRYLDESLPVPNIHFGVGQRACPGRFVAIDKMFLGISNLLLKYKLKSQNGE
KIDDTIHFSVSLKAKDYGIKLEKRI*
AU071503
AU071705
C84055
Contig14028 chr 2
IIAFP2D15468
JAX4a13e01.r1  92% to seq 3
VFKYMFNQDLSVESGMSRTIGNAAEHVFGNLSKLTAFDWFEIFSPLYDWLFTRRLKGCDIARQIIS
JAX4a15h01.s1
JAX4a27h12.s1
JAX4a82d03.r1
JAX4a82d03.s1
JAX4a150b03.s1
JAX4a207g03.s1
JAX4b33d01.r2 (formerly seq 61) actual translation
KLFLFLSNYXVXXYFQKXKNFLYKPSLLXPXWXYPSXNGLXVXTSSXDQW
KKPKSXVSQSLKLHTSKKLMEKKXIEFIDSLXKISNNNEI (intron)
Only 12 aa diffs in 90 to seq 36 and 14 Xs
16 diffs with seq 3 plus 14Xs but QS matches where ** is in seq 36
and there is no 1 aa deletion as in seq 36   so this is more like seq 3
This is probably a poor seq version of seq 3
JAX4d06c02.r1
JAX4d06c03.s1
c-JAX4d09b12.s1
JC1a221g07.s1
JC2a129a01.s1
JC2a188c06.r1
JC2b115g01.r1
JC2b115g01.s1
JC2e14a08.s1
sdic6A83g9.p1c CHR 6
sdic6A76c7.q1t MEIINVFLFLIILFFGKRF N-term
SSB673 
SSC317
SSC561
SFF882
CFF730
CFI212
SFA813
Exact match to contig 1803

>Contig_1803, D. discoideum, sequenced by the D. discoideum Sequencing
            Consortium, assembled with Phusion
        Length = 8010

  Plus Strand HSPs:

 Score = 1669 (592.6 bits), Expect = 2.7e-247, Sum P(3) = 2.7e-247
 Identities = 316/316 (100%), Positives = 316/316 (100%), Frame = +2

Query:   168 FYPKGHIQGYACSMLFKYMFNQDLSVESGMSRTIGNAVEHVFGNLSKLTAFDCFEIFSPL 227
             FYPKGHIQGYACSMLFKYMFNQDLSVESGMSRTIGNAVEHVFGNLSKLTAFDCFEIFSPL
Sbjct:  4304 FYPKGHIQGYACSMLFKYMFNQDLSVESGMSRTIGNAVEHVFGNLSKLTAFDCFEIFSPL 4483

Query:   228 YDWFFTRRLKGCDIVRQIISSQNENHLKSIDPSKPRDLMDDLLIEYGLNEITKEDTMQIN 287
             YDWFFTRRLKGCDIVRQIISSQNENHLKSIDPSKPRDLMDDLLIEYGLNEITKEDTMQIN
Sbjct:  4484 YDWFFTRRLKGCDIVRQIISSQNENHLKSIDPSKPRDLMDDLLIEYGLNEITKEDTMQIN 4663

Query:   288 QICFDIFGPAVGTVTITMNWVILQLCNRPELQEIAYQEIKKAVKDDEYVNLNHKQNAPYI 347
             QICFDIFGPAVGTVTITMNWVILQLCNRPELQEIAYQEIKKAVKDDEYVNLNHKQNAPYI
Sbjct:  4664 QICFDIFGPAVGTVTITMNWVILQLCNRPELQEIAYQEIKKAVKDDEYVNLNHKQNAPYI 4843

Query:   348 VAFIKETMRLCSNGFGLPRTAKNDQICGDFFIPKDAIIFINYLEISQNEEIFKNAKEFNP 407
             VAFIKETMRLCSNGFGLPRTAKNDQICGDFFIPKDAIIFINYLEISQNEEIFKNAKEFNP
Sbjct:  4844 VAFIKETMRLCSNGFGLPRTAKNDQICGDFFIPKDAIIFINYLEISQNEEIFKNAKEFNP 5023

Query:   408 TRYLDESLPVPNIHFGVGQRACPGRFVAIDKMFLGISNLLLKYKLKSQNGEKIDDTIHFS 467
             TRYLDESLPVPNIHFGVGQRACPGRFVAIDKMFLGISNLLLKYKLKSQNGEKIDDTIHFS
Sbjct:  5024 TRYLDESLPVPNIHFGVGQRACPGRFVAIDKMFLGISNLLLKYKLKSQNGEKIDDTIHFS 5203

Query:   468 VSLKAKDYGIKLEKRI 483
             VSLKAKDYGIKLEKRI
Sbjct:  5204 VSLKAKDYGIKLEKRI 5251

 Score = 563 (203.2 bits), Expect = 2.7e-247, Sum P(3) = 2.7e-247
 Identities = 109/126 (86%), Positives = 114/126 (90%), Frame = +3

Query:     3 IINVFLFLIILFLVKDF-----VKKNKKIHTKSPSGPIAFPILGNVVQIRFWELFKIQEH 57
             I+  +  L+IL++ K       VKKNKKIHTKSPSGPIAFPILGNVVQIRFWELFKIQEH
Sbjct:  3723 ILENYYNLLILYIKKKKNKKKKVKKNKKIHTKSPSGPIAFPILGNVVQIRFWELFKIQEH 3902

Query:    58 ELFGGYSKKYNGVVRAWFGERLFFFVSNYDVVKYFQKDENFHNRPSVLVPGWRYASSNGL 117
             ELFGGYSKKYNGVVRAWFGERLFFFVSNYDVVKYFQKDENFHNRPSVLVPGWRYASSNGL
Sbjct:  3903 ELFGGYSKKYNGVVRAWFGERLFFFVSNYDVVKYFQKDENFHNRPSVLVPGWRYASSNGL 4082

Query:   118 GVMSSS 123
             GVMSSS
Sbjct:  4083 GVMSSS 4100

 Score = 210 (79.0 bits), Expect = 2.7e-247, Sum P(3) = 2.7e-247
 Identities = 43/43 (100%), Positives = 43/43 (100%), Frame = +2

Query:   125 DKWKRAKSSVSQSLRVRTTKKLMEEKAIEFIDSLEKISNNNEI 167
             DKWKRAKSSVSQSLRVRTTKKLMEEKAIEFIDSLEKISNNNEI
Sbjct:  4103 DKWKRAKSSVSQSLRVRTTKKLMEEKAIEFIDSLEKISNNNEI 4231

 Score = 95 (38.5 bits), Expect = 6.8e-198, Sum P(3) = 6.8e-198
 Identities = 20/20 (100%), Positives = 20/20 (100%), Frame = +1

Query:     1 MEIINVFLFLIILFLVKDFV 20
             MEIINVFLFLIILFLVKDFV
Sbjct:  3646 MEIINVFLFLIILFLVKDFV 3705

>Contig_1803, D. discoideum, sequenced by the D. discoideum Sequencing Consortium, assembled with Phusion translate frame +1 translate plus frames translate all frames
AAAAAAAAAATATAGCTTTAAATTTAAAATTTTATATTAAGAAAAATGGAAATAATTAAT 3660
GTTTTTTTGTTTCTAATCATTCTTTTCTTGGTAAAAGATTTTGTATGTATTTAAATTTTT 3720
ATATTTTAGAAAATTATTACAATTTATTAATTCTTTACATAAAAAAAAAAAAAAATAAAA 3780
AAAAAAAGGTTAAAAAAAATAAGAAAATTCATACAAAATCTCCAAGTGGACCAATTGCAT
TTCCTATTCTTGGAAATGTTGTTCAAATTAGATTTTGGGAATTATTCAAAATTCAAGAAC
ATGAATTATTTGGTGGTTATAGTAAAAAATATAATGGTGTTGTAAGAGCATGGTTTGGAG
AAAGATTATTTTTTTTTGTTTCAAATTATGATGTTGTAAAGTATTTTCAAAAAGATGAAA 4020
ACTTCCATAACAGACCATCAGTTTTAGTTCCAGGTTGGAGATATGCTTCAAGTAATGGGC
TAGGTGTAATGAGCTCATCTATGACAAATGGAAAAGAGCAAAATCAAGTGTTTCGCAATC 4140
ATTAAGAGTTCGTACAACTAAAAAATTAATGGAAGAAAAAGCAATTGAATTTATTGATTC
ACTTGAAAAAATTTCAAATAATAATGAAATTGTAAGTTTTTAAAAATAATAATTACAATC 4260
AAAATATTGATAACATATTAATTTATTAAATTTTTATTTTTAGTTTTATCCAAAAGGACA 4320
TATTCAAGGATATGCTTGTTCAATGTTATTCAAATATATGTTTAATCAAGATTTATCAGT
TGAAAGTGGCATGTCAAGAACTATTGGTAATGCAGTTGAACATGTTTTTGGTAATCTTTC
AAAATTAACTGCATTTGATTGTTTTGAAATTTTCTCACCACTTTATGATTGGTTCTTTAC
AAGAAGATTAAAAGGTTGCGATATCGTTAGACAAATAATCAGTAGTCAAAATGAAAATCA
TTTAAAGTCAATTGATCCAAGTAAACCAAGAGATTTAATGGATGATTTGTTAATTGAATA
TGGATTAAATGAAATCACTAAAGAAGATACAATGCAAATCAATCAAATTTGTTTTGATAT
TTTTGGACCAGCCGTTGGTACAGTTACAATCACAATGAATTGGGTAATTTTACAATTATG
TAATCGTCCAGAACTTCAAGAGATTGCATATCAAGAAATTAAAAAAGCTGTCAAAGATGA
TGAATATGTCAATTTGAATCATAAACAAAATGCCCCTTATATCGTTGCCTTTATTAAAGA
AACAATGAGACTTTGTTCAAATGGATTTGGTTTACCAAGAACTGCTAAAAATGATCAAAT
TTGTGGTGATTTTTTCATTCCAAAAGATGCTATTATTTTTATTAATTATTTAGAAATTAG
TCAAAATGAAGAAATCTTTAAAAATGCCAAAGAATTTAACCCAACTCGTTATTTAGATGA
ATCACTTCCTGTACCAAATATTCACTTTGGTGTTGGTCAAAGAGCATGTCCTGGTCGTTT
TGTCGCAATCGATAAAATGTTTCTTGGAATTTCAAATTTACTTTTAAAGTATAAATTAAA
ATCTCAAAATGGTGAAAAAATTGATGATACAATTCATTTTAGTGTTTCTTTAAAAGCAAA
AGATTATGGAATAAAATTAGAAAAGAGAATTTAGAACAATAAAAATAATTGTATTTAATT
CAAATTATTATCCCAATTCATTGATTTAAATAATTATTATTAATAAAGACTTTTTTTAAA
AAAAAAATTATTTTTTTATTTATTTATTTATTTATTTATTTATTTATTTATTTATCTATT
TATTTATTTTTTAACCCTATTTTTTTTTTTTTTTTTTATATTTTCATTTTTGCTAACAAA
TGTGTAATCTATTAAAAAGAGATAATTTGAAATTATTATTTTTAATATTTATTATAGAAA
TATATAATAATAAAAATAATAATAAAAATTTCGCACAATCATTGTTATTATTATTTCTAT
AATAAATAGGGAATTTAATTCAAATAGCCTTTGTGGATAACTAAATATGTTATGTTGAAC
TTCATTAATTTATAAAAATAAAAAAAAAAAAAAAAAAAAAAAAAAATAAAAACAAATTAA
GAATTTCCAACTTTATTAATAAAAAATATAATATCGGTAAAGTTGAAGGGGTATTTAAAA
AATTTCAACATTGCTCGACCCCTCTTCAACTTAATCGGTATTTAATTTTTTTTTTTTTAC
TCTATGTTGTTAGTGGTAAAATTAAAATAAAAATAAAAAAATAAAAATAAAAATAAAAAT
AAAAATAAAAATAAAAATAAAAACAATGGTGTTGTTTAAAGATTTAAGTTTTTTTTTGTT
TTTTTTTTTTTTTTTTAATACATATTTTTTAAAATTCTAATACAAATAAAATAAATGTCA
AATACTTCAAGTGATAATTATAAAGTTTGGTTCATAACTGGAGTTACAGGTGGAGCAGGT
AAAGCATTAGCATCTAGATTATTAGAAATTGGAGGTTATAAAGTTGCAGGTACATCCAGA
AGTAAAGAGAAATTAAATTCTTTAGATGGTAGTTTAATATCAAATAAGGATTTCCTTAGA
TTACAAGTAAATTTAATTGATGAGAAAAATGTAAAGGAATCAATTGAAGAAACCATAAAG
AAATTCGGTAAAATTGATATTATTGTAAATAATGCCGGACAATCAATATTTGGTAGTGTT
GAAGAGATTACTGATAAAGAACAAAGGTTATTAATGGATGTATTGTATTTCGGTCCTTGT
AATGTAATTCGTTCAGTTTTACCTCATTTTAGAAAGAGAAAATCCGGTTTAATCTTCAAC
ATCTCTTCAGTTATTGGTGGTGATCCTACCAATAATGTACCCTTCTTTTCTGGATATTGT
GCTGGTAAATCTGCAATTCACTCAATGACTCAATGTCTCAGGGAAGAGGTTAAACAATTT
AATATTAATGTGTGTTGCATCCTTTTAGGTTATTTAAAAACCAGTTTCAGTAGTCCATTA
ACAACTAATCAAATTAAAGACTATAATCTTAAAGAGAAAGCCAATGATATGAATAAACAA
TTCCAAAATAATACACCAGAATCACCAATTAATTTTGCTAATTTTATCATTGAAAATTCA
AAACATTCAAATATTCCCCAAACAATCCAATTTGGTAAAAATCCATACCATAATCCAAAT
AATTCAAATAATCCAAATAATAATAATAAACCATCTTATCCTCATCAAAAAGTAAGAGTT
AATCAAGATCTTCCAAAGCAATTTGATAAAGATATTGCAAATTTACCAGAATTAACTGAT
GGTAATATTCATATGCCTTATATTTTACATGAATAAATAAAATCTTATTTCATAAAGATA
TAAATTAAATAAATTGGTCCCCACAATTTCTATTTTTTTTTTTTTAATGTTTTTAATTAT
ATTCCTTTTTTATCAATTTAATGTAAAGAAACCATCATTTATTCATTTCAATGTATTATT
AAATATTTATTTAAATAGTATTATTTATTTATTTATTATTATAAATTAATTATTCTAAAT
ATAGTAAATTAGTTATTATTATATAATTATATTTCTTAAAAATAGTAATATAAAATTAAT
ATAATATAATATTAATATAAAATTATTTATTTCCAAAGATTATTATATTCAATTAAAAAA
TTAATAATCTTATTAATTTTTTAGTTAATTCTATCAACTCATTTTTAATAGTATTAATTA
AAATTTTTATTTTTATTTTTATAATATAAACAAATAAAAGAATAAAAAGATAAAATTCAA
TATTAAATTATATAAATTTCAAAGATAATAATTATAAAGGAAGACCACTTCTGTTAAAAT
AAGTAATATAAGTTATTATTTCTTTACAGTTATTATTTTTTTTTTTTTTTTTTTTTTTTT
AAAAAATTAATATTTATATAAAATATATTCCCTACTCTTTTTTCTTCAAAATAAAAAAAT
AATTACAAAAAATTAACCATAATTAAAAATAAAAGAAGGGATTTGAAATGAATGTGTCAT
ATGACGATCGATTTCTTTGGTTTTTATTTTTTATTTTTTTTATTTTTTATTTTTAATTTT
CAAATTGTTTTGGTTAATATGGTGGTGTTTTGTGAAAATTATTATTTGAAAAAAAAAAAA
TAGTCGCAACACGAAGAAAAAAAATTAAAAAAAAATTAAAAAAAAAATTAAAATTTAAAA
TTAAAAAAAAAAAATTATTAAAAAAAATTAAAAAAAATTAAAAAAAAAAAAAAATTTTGA
AAATAATTTTTTTTTAATTGCACATAAAATTTTAACCCCCCCCCCCCCAGATCCAATAAT
TTTCTTGTTTTTTAAATTTTTAAAAACAAATAATAAACAAATAATTTTTTTAAAAAAAAA
AAAAATAAAAATTTTAAAAGAGCTACTTTT


CYP517A2 SEQ 74 66% to seq 3
MRILIIIILIIIVFLVKDT (0)
IKKNKKVNSKSPCGPFAFPILGNIIQYFFYQILNIKEHSIIERYSRKYDG
ITRIWFGDIFILYVSNYEIVKCFQKEENFFDRPSTFVPTWRYMSSNGGGI
MSSNDEKWKRAKTTFLKSLKIHGKKYLIEKKSIEFVNSIEKFSNSNQV (0)
FYPKQYSQGFTSSIFFKYMFNEDISIDNKFLKEIGTAVGMVFTKNSHLT
VFDCFGILSPFYDLFFKFRLRPIEILK
KTIDKQLTNHLNSMDSKMGDHQSRDIMDDLLIEYGSLNEISNQDRIQINQ
ICFDVMSTDIGTVATTIDWVLLQLCNRQDLQEIICNEIQDTIKIKRNNVI
NDGGGGADTDNLFINLCDKQSIPYLIAFIKETMRVFSNGWSLPKTSKHDQ
ICANYFIPKGSILFINYFSIHLNEEFFKNPREFNPARYLDDSIPIPDLHF
GIGQRGCPGRFVAMDQVFLCIANTLLKYKIKSIDGKKIDDTIQFSVYLKP
KDFGILLEKRNKLFVND*
Contig16783  Chr 6
IIAFP1D11203
IIAFP1D19488
IIAFP1D53751
IIAFP1D67548 (may be a pseudo gene)
IIAFP1D76983  
sdic6A65c10.p1c 
Dict-IV-V896c06.plc
Contig_4323

CYP517A3P Seq 36 pseudogene 77% to seq 3 77% to seq 61 
MEIVNV (frameshift)
FIILIILFLVKDF (0)
VKKNKKIHTKSPSGPIAFPILGNVVQIRFWELFKIQEHEL (10 aa deletion)
IVRAWIGERLFLFVSNYDVKYFQKDENFLYKPSLLVPGWRYASSNGLGVMSSSDDEWKRAKSS
VS**LRVHTSKKLMEEKAIEFIDSLEKISNNNEI (0)
FYPKGHIQGYACSMLFKYMFNQDLSVESGLSRTIGNAVEHVFGNLSKLTAFDCFE
IFSPLYDWFFTRRLKGCDIVRQIISSQNENHFKSIDPSKPRDLMDDLLIEYGLNEITKEDTMKINQICFDIFGPAIG
TVTITMNWVILQLCNRPEPQEIAYLEIKKAVKDDEYVNFNHKQNAPYIVAFIKETMRLCS
NG

JC2e10h11.s1 matches pseudogene seq
c-JC2e10h11.r2 matches pseudogene seq
JC1a286e01.r1 matches pseudogene seq
JC2d93e02.s1 matches pseudogene seq
JC2e75c06.s1 matches pseudogene seq
JC2a172b05.r1 matches pseudogene seq
JAX4a222c08.r1 matches pseudogene seq
JC2b182g05.s1 matches pseudogene seq
SFG734 seq matches pseudogene except it has the insert GNYSKKYNGV that 17A3P is missing. 4 diffs with 17A1
SFF555 seq matches pseudogene except it has the insert GNYSKKYNGV that 17A3P is missing. 4 diffs with 17A1

CYP517A4 13 aa diffs with 517A1 ng5440 exact match to Contig_2215
MEIVNVLLFLIILFLVKDFVKKNKKIHTKSPSGPIAFPILGNVVQIRFWELFKIQEHELI
GNYSKKYNGVVRAWIGERLFLFVSNYDVVKYFQKDENFLYRPSLLVPGWRYASSNGLGVM
SSSDDEWKRAKSSVSQSLRVHTSKKLMEEKAIEFIDSLEKISNNNEIFYPKGHIQGYACS
MLFKYMFNQDLSVESGMSRTIGNAVEHVFGNLSKLTAFDCFEIFSPLYDWFFTRRLKGCD
IVRQIISSQNENHLKSIDPSKPRDLMDDLLIEYGLNEITKEDTMQINQICFDIFGPAVGT
VTITMNWVILQLCNRPELQEIAYQEIKKAVKDDEYVNLNHKQNAPYIVAFIKETMRLCSN
GFGLPRTAKNDQICGDFFIPKDAIIFINYLEISQNEEIFKNAKEFNPTRYLDESLPVPNK
HFGVGQRACPGRFVAIDKMFLGISNLLLKYKLKTQNGEKIDDSIQFSVSLKAKDYGIKLE
KRI

CYP518A1 Seq. 20 38% to CYP508 complete no short first exon
MSILIILIISIIFYLIFDFLYKNFSNRQFKGPLALPLVGSLLHLKDDTHL
VFQNDSKLYKDGDNNGKIIKYWFCDQLTLAIYDTNTMKEIYLKNPESLNTRVKSPSTNVIGNRFRGIVTADENYWQF
HRDILMKSFTGRKVKSLSSSIEKETIDLITYMKFIEKSGQS (0)
FSPRSNFMNFYSNIIFDYVFSRRIENIYEGVNEEQGKVLLAIRELFDYLADTLIVNYLIFTK
PFYFLYLKMFGHPADSLKKILTKYYLEHFESIDLNNARDVLDSLI
IEYRKVGGKEEQSSIIPMVNELILAGTETNSSTAEWFILTMVNNLDYQDKIYNELKSTLE
TTTAMIKLSNRNQTPLFNAALKEVLRLYPPVPFGVPRQVNQSFEINGGSLKIPKGTQIIQ
SLYSIFRDENYWDSPDQFKPERFLDQDSHSNNYFPYGIGVRNCIGMGFSQDELYISLSNL
VLNFKLLPLIENSKICDKPIFGFSFKPNEFKINLEKRNN*
FC-BG13
IIAFP1D6284
IIAFP1D35357
IIAFP1D75459 
JC2b232e02.r1
JC2c81c10.r1
JC2c81c10.s1 
VFB214
VFJ528
JC3a13b12.s1

>JC3a13b12.s1 183648 letters
        Length = 183,648

Query:     1 MSILIILIISIIFYLIFDFLYKNFSNRQFKGPLALPLVGSLLHLKDDT 48
             MSILIILIISIIFYLIFDFLYKNFSNRQFKGPLALPLVGSLLHLKDDT
Sbjct: 66633 MSILIILIISIIFYLIFDFLYKNFSNRQFKGPLALPLVGSLLHLKDDT 66490

CYP518A2P Seq 25, 27, 29, 30  62% to seq 20 pseudogene two in frame stops and a frame 
shift missing I-helix
PFYFIYSKIFGHPSDPLKNIINKYY*EHCETIYIDNPR
DVLDSIISEYRKVNGKEECS 211
XXXXXXXXXILAAVENNSATMKWFTL
(24aa gap)
LINLSHRPLTPLFNASLKEVLRLYPPV (frame shift)
SIGVSRLVIEEFTIDNGKYTFLKGAQIIQSLYSIFRDEKYWSSPNEFIP
ERFIYQNN*SNNWFPYSIGVRNCVGMGFSQDEL
YLLLTNLVLNFHILPPFENTKIDGTPIFGFSFKPKLR*
Contig6572   Chr 2
IIAFP1D12433
JAX4a48g04.s1
JAX4a81g11.r1
JAX4a81g11.s1 missing I helix exon
JC1c02f04.r2 
JC2a18h05.s1
JC2a64g05.s1 
JC2b88e02.r1
JC2b112a04.r1 90% to seq 27
JC2b155a02.r1
JC2b212a08.r1 short frag at C-term
JC2b254e11.s1 
JC2b329d01.r1 5 prime part shown below
JC2c48h11.r1
c-JC2c48h11.s1
JC2c145g10.r1 
JC2c145g10.s1
JC2e17e02.r2
JC2e17e05.r1 91% to seq 27
JC2e21b09.s1 5 prime part shown below
JC2e71b08.s1

CYP518B1 complete Seq (26+28) 50% to seq 20 482 aa
MLTNIIILIILYLFYDF (0)
CYKNFKYRNYGSPWALPVI (1)
GHFIHVINQPHLVVHNDRMKYNNGRFVNYWFGDYL
SIAITDPILYKKIYLNFPKQINSRLKSPTVLNISERFRGIISSNENNWDF
HHGILSKLFNGHKAKINNFLFEKETKFIIEYMKKISKSGEN(0)
FDTRSNFLYFYSNILFDYILGKRVENIYENELRKDRKKFMVSIQEVMDSVG
LIKFLNYLILSYPFLSIYLRYFTYTTFNLKKIL
KQYYDEHLETIDLNKPRDVLDNLIMEYKNQNAIECDKSFVAIAIELLAAGT (1)
DTNSSTSEWFLTYLVNNPIYQDKIYDELIGALKINKPLKGSDVLINLSHRPLTPLFNATLKEVLRLIP
ATPFSVPRMSNEGFEVDGIKIPKG (0)
TYLFPSMYSIFRDEKYWGENANKFYPERFLTDSHSNNYFPYGVGKRMCLG
SNFSQHELYICLTNIVLNFKIKSIDGKPLNEIP (1)
NYGITFRPNIFEVKLENR*

Contig5329 Chr 2
Contig13654 Chr 2
IIADP2D5718
IIAGP1D2186
JAX4a94e04.r1
JAX4a94e04.s1
JAX4a215e06.r1
JC1b80a09.s1

>JPCRa04a07.p1 413590 letters
        Length = 413,591

  Minus Strand HSPs:

Query:      5 IIILIILYLF-----YDFCYKNFKYRNYGSPWALPVIGHF 39
              +I L ++Y F     Y  CYKNFKYRNYGSPWALPVIG F
Sbjct: 147997 VIPLKLIYKFFY*KNYIKCYKNFKYRNYGSPWALPVIGKF 147878

Query:      8 LIILYLFYDFCYKNFKYRNYGSPWALPVI-GHFIHVINQPHLVVHNDRMKYNNGRFVNYW 66
              L++ + F  F +  F +  Y   +    I GHFIHVINQPHLVVHNDRMKYNNGRFVNYW
Sbjct: 147891 LLVNFFFIFFFFFLFFFN*YL**YFFNFIKGHFIHVINQPHLVVHNDRMKYNNGRFVNYW 147712

Query:     67 FGDYLSIAITDPILYKKIYLNFPKQINSRLKSPTVLNISERFRGIISSNENNWDFHHGIL 126
              FGDYLSIAITDPILYKKIYLNFPKQINSRLKSPTVLNISERFRGIISSNENNWDFHHGIL
Sbjct: 147711 FGDYLSIAITDPILYKKIYLNFPKQINSRLKSPTVLNISERFRGIISSNENNWDFHHGIL 147532

Query:    127 SKLFNGHKAKINNFLFEKETKFIIEYMKKISKSGEN 162
              SKLFNGHKAKINNFLFEKETKFIIEYMKKISKSGEN
Sbjct: 147531 SKLFNGHKAKINNFLFEKETKFIIEYMKKISKSGEN 147424

Query:    161 ENFDTRSNFLYFYSNILFDYILGKRVENIYENELRKDRKKFMVSIQEVMDSVGLIKFLNY 220
              + FDTRSNFLYFYSNILFDYILGKRVENIYENELRKDRKKFMVSIQEVMDSVGLIKFLNY
Sbjct: 147324 KKFDTRSNFLYFYSNILFDYILGKRVENIYENELRKDRKKFMVSIQEVMDSVGLIKFLNY 147145

Query:    221 LILSYPFLSIYLRYFTYTTFNLKKILKQYYDEHLETIDLNKPRDVLDNLIMEYKNQNAIE 280
              LILSYPFLSIYLRYFTYTTFNLKKILKQYYDEHLETIDLNKPRDVLDNLIMEYKNQNAIE
Sbjct: 147144 LILSYPFLSIYLRYFTYTTFNLKKILKQYYDEHLETIDLNKPRDVLDNLIMEYKNQNAIE 146965

Query:    281 CDKSFVAIAIELLAAGT 297
              CDKSFVAIAIELLAAGT
Sbjct: 146964 CDKSFVAIAIELLAAGT 146914

Query:    298 DTNSSTSEWFLTYLVNNPIYQDKIYDELIGALKINKPLKGSDVLINLSHRPLTPLFNATL 357
              DTNSSTSEWFLTYLVNNPIYQDKIYDELIGALKINKPLKGSDVLINLSHRPLTPLFNATL
Sbjct: 146822 DTNSSTSEWFLTYLVNNPIYQDKIYDELIGALKINKPLKGSDVLINLSHRPLTPLFNATL 146643

Query:    358 KEVLRLIPATPFSVPRMSNEGFEVDGIKIPKG 389
              KEVLRLIPATPFSVPRMSNEGFEVDGIKIPKG
Sbjct: 146642 KEVLRLIPATPFSVPRMSNEGFEVDGIKIPKG 146547

Query:    390 TYLFPSMYSIFRDEKYWGENANKFYPERFLTDSHSNNYFPYGVGKRMCLGSNFSQHELYI 449
              TYLFPSMYSIFRDEKYWGENANKFYPERFLTDSHSNNYFPYGVGKRMCLGSNFSQHELYI
Sbjct: 146458 TYLFPSMYSIFRDEKYWGENANKFYPERFLTDSHSNNYFPYGVGKRMCLGSNFSQHELYI 146279

Query:    450 CLTNIVLNFKIKSIDGKPLNEIPN 473
              CLTNIVLNFKIKSIDGKPLNEIP+
Sbjct: 146278 CLTNIVLNFKIKSIDGKPLNEIPS 146207


CYP519A1 Seq. 12 32% to 508 complete sequence
MESIINLIFYIIIFLILIDF (0)
LKKNISFRKNEPPRGGVAFPIFGDLPKLGENPHRYLTNLA
MKKGGIYSVWLGDEKVFILTDPEAVRDAWVKQFRNFSDHPKTKSVRIFSGNFNDMAFAEY
SQWKINSKWVSSAFTKTKLKTIGDLIEKESNYFIEHLKAYSNSGQP (0)
IFPKPYISKFGINVISGMMFSQVISKDESVDKGAMEKLTVPIQAVFKRLGADNLDDFISI 
LQPVFYFQNEKFKRQVQEIYDYLEG
IYNQHDTNLDTENPKDLMDLLIISTEG
KERDMIIHIGMDCLLAGSDSTSATCEWFCLFMINNP
DVQKKAYQELINALKDEDNKKFIPISKKDNCPYMLSIFKEVLRLRPVGVLGIPRVALEET
TIMGYTIPKGSQIFQNVYGMSHLFV
SDPYKFKPERWIEYK 
KQKDLLKEKEMQQLQEGADVVIDNKNNIKNNNSSNKPNSKTNSIFDD
LDKVSIPYSVGNRKCPGASLSELALFSLCSNILLNFELKSIDGKPIDDTEVYGLTIHTKVHPISLTLRP*
AU037246
AU039172
AU071956
AU072353
AU072354
AU073173
AU074914
C84123
IIAFP1D41514
JC1a150h12.r1
c-JC1b185d08.r1
JC2a50a01.r1 93% to seq 12
YSVGNRKCPGAFFSELALFSLGSNILLNFTLKSIDGKPIDDTEVYGLTIHTKVHPISLTLRP
JC2a186g04.r1
JC2a186g04.s1
JC2a236b08.s1
JC2b07f01.s1
JC2b85e11.r1 ends match Seq 12 middle does not 
JC2b306d05.s1
JC2b332h01.s1
JC2c176f07.r1
JC2d23b01.r1
JC2e69g10.s1
JC2e116g02.s1
JC2e116g02.r1
QELINALKDEDNKKFIPISKKDHCPYMGAIFKQGLLLKPLGLLRIPPVALK*TTIMGYTI
PKGSQIFQNVYGMSHLFVSDPYKFKP*RWIEYKKQK
sdic6A22g4.p1c
sdic6A45h1.p1ca
SSB654
SSC825
SSE270
SSE271
SSG312
SSM444
SSM473
AFD538
AFK771

>JC2b07f01.s1_frame-1 Clone JC2b07f01, standard read, bases 149 through 555, from 1999-06-29, translation frame -1
YFFLFXVYFDN*NFLFFFILILINQWSLL*I*YFI*LFF*F*LILKXLYFENYIL**NII
IYIILK*FFFFFKYS*KRIYXLGKMSPXEEVXHFQYLEIYQN*ERTLTDI*QTWQ*KKEE
SIQFG*EMKKFSF*L
>JC2b07f01.s1_frame-2 Clone JC2b07f01, standard read, bases 149 through 555, from 1999-06-29, translation frame -2
IFFYSRFILIIKIFYFFLY*Y*LINGVYYKFNILYNYFFNFN*F*XIYILKTIYYNKI**
YILY*NNFFFFLNTVKKEYXF*EK*APXRRXCISNIWRFTKIRREPSQIFNKLGNEKRRN
LFSLXRR*KSFHFN
>JC2b07f01.s1_frame-3 Clone JC2b07f01, standard read, bases 149 through 555, from 1999-06-29, translation frame -3
FFFIXGLF**LKFSIFFYININ*SMESIINLIFYIIIFLILIDFEXFIF*KLYIIIKYNN
IYYIKIIFFFF*IQLKKNIXFRKNEPPRGGXAFPIFGDLPKLGENPHRYLTNLAMKKGGI
YSVWXGDEKVFILT

>sdic6A22g4.p1c_frame+1 translation frame +1
ILMIKIFYFFLY*Y*LINGVYYKFNRLYNYFCNFN*FCKYLYFENYIL**NIIIYIILK*
FFFFFKYR*KRIYLLGKMSHQEEVLHFQYLEIYQN*ERTLTDI*QTWQ*KKEESIQFG*E
MKKFSF*LTQKQLEMHGLNSLEILVTIQKQKVLEYFPVILMIWPSLNILNGK*IVNGYHL
PSQRQN*KLLGD
>sdic6A22g4.p1c_frame+2 translation frame +2
F**LKFSIFFYININ*SMESIINLIGYIIIFVILIDFVNIYILKTIYYNKI**YILY*NN
FFFFLNTDKKEYIF*EK*ATKRRCCISNIWRFTKIRREPSQIFNKLGNEKRRNLFSLVRR
*KSFHFN*PRSS*RCMG*TV*KF**PSKNKKC*NIFR*F**YGLR*IFSMENK**MGIIC
LHKDKIKNYWV
>sdic6A22g4.p1c_frame+3 translation frame +3
FDD*NFLFFFILILINQWSLL*I**VI*LFL*F*LIL*IFIF*KLYIIIKYNNIYYIKII
FFFF*IQIKKNISFRKNEPPRGGVAFPIFGDLPKLGENPHRYLTNLAMKKGGIYSVWLGD
EKVFILTDPEAVRDAWVKQFRNFSDHPKTKSVRIFSGNFNDMAFAEYSQWKINSEWVSSA
FTKTKLKTIG*

>CYP519A1 dd_02971 chr 2 genome assembly one extra exon missing N-term
MLKKNISFRKNEPPRGGVAFPIFGDLPKLGENPHRYLTNLAMKKGGIYSVWLGDEKVFIL
TDPEAVRDAWVKQFRNFSDHPKTKSVRIFSGNFNDMAFAEYSQWKINSKWVSSAFTKTKL
KTIGDLIEKESNYFIEHLKAYSNSGQPIFPKPYISKFGINVISGMMFSQVISKDESVDKG
AMEKLTVPIQAVFKRLGADNLDDFISILQPVFYFQNEKFKRQVQEIYDYLEGIYNQHDTN
LDTENPKDLMDLLIISTEGKERDMIIHIGMDCLLAGSDSTSATCEWFCLFMINNPDVQKK
AYQELINALKDEDNKKFIPISKKDNCPYMLSIFKEVLRLRPVGVLGIPRVALEETTIMGY
TIPKGSQIFQNVYGMSHLFVSDPYKFKPERWIEYKKQKDLLKEKEMQQLQEGADVVIDNK
NNIKNNNSSNKPNSKTNSIFDDLDKVSIPYSVGNRKCPGASLSELALFSLCSNILLNFEL
KSIDGKPIDDTEVYGLTIHTKVHPISLTLRP


CYP519B1 Seq (35+44+75) complete seq 51% to seq 12 496 aa
MNLINLILYFILFWIVFDFIRKNRRISFNDPPSPWALPIIGHLHKLSLNPHRSLTELAKV
YGGVYSLHIGDSKTVVITDVSAFKDVTIKQFKNFANRPQPKSIRVITNFKGLAFADYDQW
QKTRKLVSSALTKTKIKTFNNLIEKQTENLIESMNEFSNKNEL (0)
FHPRKYLTKYSLNIILS
MLFSKEIGKNESINKGTMERLTIPFNEAFKKVGKVDDFLWFLSPFFYFSNKQYKKYIFDI
YYFMEEIYDQHLLDLDYNEPKDLLDQLIIASQGREKETVILVGMDFLLAGSDTQKATQEW
FCLYLINNPDVQKKAYQELISVVGKDCKFVTSNHIENCPYFISIIKEVFRIRSPGPLGLP
RISIDDTYLSNGMFIPKGTQILLNIFGMGNLLVSEPDQFKPERWINYKNQQQQKQQQQQQ
QVNNKNSIDSSESSNLEFFDDLEKVSNPFSLGPRNCVGMAIAKSSIYSVCSNILLNFELS
SINNQIIDDNEVFGVSINPKEFSIKLTKR
Contig6040
IIACP1D2091 N-term
IIAFP1D26250 
IIAFP1D53328
IIAGP1D2374
JAX4a216e09.r1 
JAX4a216e09.s1 91% to IIACP1D2091 45% to seq 12
JC1b162b04.s1
sdic6A33h1.p1c 81% TO N-TERMINAL OF SEQ (35, 44, 75) WITH NO INTRON
MNLINSILIFYFIWIVFDFIRKNRRISFNDPPSLWAFP

>JC3f115e15.s1 Clone JC3f115e15, standard read, bases 27 through 739, from
            2002-12-10
        Length = 711

  Minus Strand HSPs:

 Score = 259 (96.2 bits), Expect = 2.3e-21, P = 2.3e-21
 Identities = 48/48 (100%), Positives = 48/48 (100%), Frame = -3

Query:     1 MNLINLILYFILFWIVFDFIRKNRRISFNDPPSPWALPIIGHLHKLSL 48
             MNLINLILYFILFWIVFDFIRKNRRISFNDPPSPWALPIIGHLHKLSL
Sbjct:   643 MNLINLILYFILFWIVFDFIRKNRRISFNDPPSPWALPIIGHLHKLSL 500

CYP519C1 Seq (31+32+33) complete sequence 39% to seq 12
MNILLLIFYFLVCFLIFDF (0) 
IKKNKVKKYDVPTLSYALPIIGHLYKLGVNPHRNLTKLVEKNGGIFSLWLGDIKTVIVTDP 
SINKEI MVKQFTNFSDRPRLKSFESFTGGGVNLIFIDYNEKWPVIRKIVSSSITK 
TKIISNYKEVIENQTKILINSMRTHSKINEP (0)
FKSKKYFGKFSISIVLGIMFKQDNDEKQININDNNIDNDPIT
KLTEPIQQVFLLLGTGNISDFIKILRPFFKNEYKKLNNSAS
KVFKFMEEIYDQHLLKFDKSNPRDLMDYFIEYEFTNSPNTTLEEKKISIIKGCMSFVFAGDDTVAAT
LEWVCLYLINNPAIQEKCYNELISVLGDNNNESKIKFISLKERDNCQYLINVIKE
VLRIRTPLPLSVPRIATQNCEINGFFIEKGTQILSNAFGMSHLYVDEPNVFNPDRW
INYYNQKQQQQQQQQQPQPIQNNNYFNDLDRVCLPFSTGPRNCVGISIAELNLFSVCANI
ILNFQIKSIDGMQLKDIEVSGISIHPIPFSIKLISRN
CHR2.0.36204 
Contig14835 Chr 2
IIAFP1D21425 
IIAGP1D1935
JAX4a151f11.s1 92% to seq 33
JAX4b23a04.s1 91% to seq 33
c-JAX4a75h04.s1 
JC2a39h06.s1 
JC2a233c03.r1
JC2b147a08.s1
JC2b175d02.s1
JC2b175d02.r1
JC2b233a07.r1
JC2e04d01.s1 may extend to N-terminal exon (MLVLFINYLXXEXISXDF?)
JC2e22c10.r1 
JC2e115e03.s1 92% to seq 31
JC2f02e07.s1
JC2f02e07.r1
JC2f11b07.r1

>CYP519C1 dd_00538 chr2 genome assembly missing first exon
MNIKKNKVKKYDVPTLSYALPIIGHLYKLGVNPHRNLTKLVEKNGGIFSLWLGDIKTVIV
TDPSINKEIMVKQFTNFSDRPRLKSFESFTGGGVNLIFIDYNEKWPVIRKIVSSSITKTK
IISNYKEVIENQTKILINSMRTHSKINEPFKSKKYFGKFSISIVLGIMFKQDNDEKQINI
NDNNIDNDPITKLTEPIQQVFLLLGTGNISDFIKILRPFFKNEYKKLNNSASKVFKFMEE
IYDQHLLKFDKSNPRDLMDYFIEYEFTNSPNTTLEEKKISIIKGCMSFVFAGDDTVAATL
EWVCLYLINNPAIQEKCYNELISVLGDNNNESKIKFISLKERDNCQYLINVIKEVLRIRT
PLPLSVPRIATQNCEINGFFIEKGTQILSNAFGMSHLYVDEPNVFNPDRWINYYNQKQQQ
QQQQQQPQPIQNNNYFNDLDRVCLPFSTGPRNCVGISIAELNLFSVCANIILNFQIKSID
GMQLKDIEVSGISIHPIPFSIKLISRN

CYP519C2P seq 46 and related seqs 77% to seq 32
KIKKNDIPTLPFALPIIGHLYRPGSNPHRDLTKLVEKNGGIISLCLGDIKTVIFTDPSITKEL
JC2e83h08.r1 3 diffs with JC2e99d08.s1
JC2e99d08.s1 N-term
sdic6A2d5.p1c N-term 89% to JC2e99d08.s1
sdic6B24c11.p1c
FLFVNNKIIGYNNLLQIIYSLYLKKMRGKVKKG this might be an N-terminal exon

>sdic6B24c11.p1c translate frame +1 translate plus frames translate all frames
TTTTTTGTTTGTTAACAATAAAATAATTGGATATAATAATCTTTTACAAATTATATATTC
TTTATATCTCAAAAAAATGCGAGGAAAAGTAAAAAAAGGTTGATTTTAGTTTGTCAGAAT
TGGCCTTATATTATTATTTGGTGGGTTAAAAGTATAAATTTATTTGATGAATGTTTTTAA
ATTAAATTTATATTATACAAATAATTAAAAAAAAAGAAAAAAAAAAATTAAAGATAATAA
TTTTTTTTTTTTGATTAAAAATTAAAAAACATATTCTTTTTTTATAGATAAAGAAAAATA
AAAAAAAATGATATACCAACCTTACCATTTGCTTTACCAATAATTGGTCATCTTTACAGA
CCTGGAAGTAATCCACATAGAGATTTAACAAAACTGGTAGAAAAGAATGGTGGAATTATC
TCTTTGTGTCTTGGTGATATCAAAACTGTAATTTTTACAGATCCTTCTATAACCAAAGAA
TTATTAGGCCAATTTAAGTAATCCATAATAACTAGAAATTTGAAATAAAGAAATCCCTAC
TAAAATTTCTAATTTAAAAAATCTTGGGTAAATTAATAATTATAAAAAAAATTTAAAAAT
AAAAACTAATGAACTATCACTAGGT

>Contig_2045, D. discoideum, sequenced by the D. discoideum Sequencing Consortium, assembled with Phusion translate frame +1 translate plus frames translate all frames
ATTGGAAGCAATATCAAGAATAAATAATTTAATTACTTGTTTAAGTAATGTTGGTATCAC
TTTTCAAAATAATACTACTAGTAATAAATTAGTTAATGACTATATCTATAATAGAGATAA
AAATGGTAATCAACTTTCAATTGAAGCTTCAGTTAGTTTATTAAATGAAATTATTAAAAA
AGAGAAAGAACAACAACAAGACCAACAAAATCAACAAGAACAACAAGAGCAACAAGATGA
TGATGATGATGAATACTTTGAAGATTATGAAGATGAAGATGAAGATTATCAATATGATGA
ATATGATGAAAATGATGAAAATGATTTATATGATTGTTGAGATGTGGAACAGCACTTGAA
TAATTGTTTTTTAAAAAAAAAGTTTTATTAAAAAAAATAATAATAAAGAAAATTCTTAAA
AAACTTAGACAAAAAACTATTACAAAAGATCCTTTGAAAATTTTTTTTTTTTTAAAAAAA
AAGGTACTCGAAATGATTCATTTATATGTTGACGAACCAAATGTATTTAATCCTGATAGA
TTCACAAAATCAAAAACTACCAACAACAACTGATCAACAACAATGATCTTGATAGGGTTT
GTTTACCATTTCCAATTGGTCCTAGAAATTGTGTAGTGTTGTAGTGAGTTTTTTCACCCC
GACTGTAGGGAAAAATAGCGCTATGTGGTCAAGAAAAAAAAAAAATGTAAAATTTCCCTA
CAACATGGGTTCCACAAAAAATAAAAAAATAAATAATAAAATTGAAAGTTGACTAACCAT
CAAATAAAAAAATAAAAAAAAAGTGGTTCATTTATTTTTATTATATATTTTTGTTTTTTA
TTTATTTTTTAAATTATCACCTTTTTTATTATCTCTTTTTTTAATTATAAATTATTTTTC
AAATTAATTCAAACACTAATTCTACTCAAAAAATCAAAACATTAATTCTGCACCTAAAAT
CAGCCATGGCTGAAATTATTTTTTATTTTCTTCTTATCAATTTAGTTTATATTAATTGTC
TAAAACAATAGATAAAAATTGATTTTGGTAAATTTATCTCTTCACCATCTATATTTGATA
CAGCTCTTAACAATGGCGAAAAATTACAACTATTATTAATCGAATTAGCTGTTGGTGTAT
TTATTAAAACTCTTTTTTTATTTTTTTTTATTGGTATATTATTAATATTGAAATGATAGT
AATACAAATTAATTATTCTTTGTTTTTTTTTTTTTTCAATTTTAACATTATATAATTTGT
ATAAATTATATATTTGTTCGCACATGTTTACGAAATCTTCAGTAGAGATTTTTTTTTTTT
CAAAAATAATAAATAATTGTTTATTCATACCCCATAATATAAACAATCTGTGAGAATCCA
AGGATAAATTAAGACTGTTGATATTAGTTAAAATAAAATGTATTGTTTTAACTGTTTATA
ATATTTATTATTTATTTAATTATTTTTATTTTTTTATAAATAATTTTTTTTTTTTTTTTT
TTAATTTTCACAGTTCCCAAAACAAATCTTATAATAAATTAAAGTAAATAGTTATTAATA
GTATATTATAAATAGATTTTGATAATAAATTATAAGTTTTTTAATTTAAAGATAATATAT
TCTTTAAATCGATTTTATCATACCACCTGCCTATTTTTTTTTTTTTTTTTTTTTTTTTTT
TTTTTTTTTTTTTTGTTAACAATAAAATAATTGGATATAATAATCTTTTACAAATTATAT
ATTCTTTATATCTCAAAAAAATACAAGGAAAAGTAGAAAAAAGTTTATTTTAGTTTGTCA
GAATTGGCCTTATATTATTATTTGGTGGGTTAAAAGTATAAATTTATTTGATGAATGTTT
TTAAATTAAATTTATATTATACAAATAATTAAAAAAAAAGAAAAAAAAAAATTAAAGATA 1920
ATAATTTTTTTTTTTTGATTAAAAATTAAAAAACATATTCTTTTTTTATAGATAAAGAAA
AATAAAAAAAAATGATATACCAACCTTACCATTTGCTTTACCAATAATTGGTCATCTTTA
CAGACCTGGAAGTAATCCACATAGAGATTTAACAAAACTGGTAGAAAAGAATGGTGGAAT
TATCTCTTTGTGTCTTGGTGATATCAAAACTGTAATTTTTACAGATCCTTCTATAACCAA
AGAATTATTAGGCCAATTTAAGTAATCCATAATAACTAGAAATTTGAAATAAAGAAATCC
CTACTAAAATTTCTAATTTAAAAAATCTTGGGTAAATTAATAATTATAAAAAAAATTTAA
AAATAAAAACTAATGAACTATCACTAGGTTTCAAAAGCTCATTGGGAGGAATATTATTTG
AGAATCGAACCATTGATCTTAAGTGTTAATTTTTATAAATTTCGCCCAAAGGGGGGCTCG
AACCCCCGACCACAAGGTTAAAAGCCTTGCGCTCTACCGACTGAGCTATCTGGGCCTTGA
ATGAAAATTGCATTTTTATTACAAAAAATCGATCAATTAAAATTATTTTTCAAAAAGTTA
TAAAATAAACATTCAAAAACAAAAAAGAGTATACTCTTAAGTGCAAATTATAAGGGGGAC
TGAAACAGAGGGACAAATATTGTATTATAGGATGCAATTGCGTCTTATAATTCATTTTAT

CYP519D1 Seq. (14+34) 38% to seq 12 566 aa
MNVFVLTVFICIIYLLFDL (0)
IKKNKKLKDEPPTPKLALPLIGHLYLLGDRPNRSFLELSKRYGG
IFKIWMGEYPTVVLTDPDHVNEVWCKQFLNFTNRPHFNSLDQFSSGFRNLSFSDYPLWSE
LRKLVSSSFTKSKVKGISNLLETQTNYLINTMNNYSINNKP (0)
FNPKKYIHKLTLNVVCMIA
FSKEIKNDEDVNEGDMARLTKPKEMILKHLGSSNFCDFVPLVRPLFYLKNKRFDQTLKQV
IEYIKEIYDDHLLNLDLNSSPKDIMDLLIMSTNDSKEDIIIHTCIDLLIAGSDTVGVTFR
VFRFIYPNNPIIQEKCFNELFNAFSNSNNTDNNNNNSTITAA
IGFGDEYSSKTPFLNACIKEVLRIKPVTSLGLPRIANDDTFVNGYRIPKGT
PIIENIYGLSNSDQLIDDPTTFNPYRWLEYQKLKS
FQNDLKQQQQQQQQQQQQQQQLQLQQEQQEQEQQKINLEFNNNNNNNNNNNNNNS
NNKHKYYNDLDKISIPFSTGRRGCVGVQLGEAELYIVCANLVYNFKIESWDGKKINELEDFG
IIIHPSSHNLKITKRNNK*
AU073802
C90413
IIAAP1D4848
IIADP1D0354
IIAFP1D73623
IIAFP1D78046
IIAFP1D97427
JAX4a136d08.s1
JAX4a136e05.r1
JAX4a169f04.r1
sdic6A3c3.q1t
SSI404 (deletion of mid region)

>IIAFP1D74108
        Length = 822

  Minus Strand HSPs:

 Score = 115 (45.5 bits), Expect = 3.9e-13, Sum P(2) = 3.9e-13
 Identities = 23/24 (95%), Positives = 23/24 (95%), Frame = -3

Query:    20 IKKNKKLKDEPPTPKLALPLIGHL 43
             IKKNKKLKDEPPTPKLALPLIG L
Sbjct:   109 IKKNKKLKDEPPTPKLALPLIGIL 38

 Score = 89 (36.4 bits), Expect = 3.9e-13, Sum P(2) = 3.9e-13
 Identities = 17/20 (85%), Positives = 18/20 (90%), Frame = -1

Query:     1 MNVFVLTVFICIIYXLFDLI 20
             MNVFVLT FICIIY LFDL+
Sbjct:   351 MNVFVLTFFICIIYLLFDLV 292

>IIAFP1D78046  translate frame +1 translate plus frames translate all frames
TTTACCTATATAGNCNGCCCGTTTGAGTGAACTTCAGNGCCGAACTTTANAGGGATCCAC
CAACAACAACAATAGGATTTGGTGACGAATATTCTTGTAAAGCACCATTAATACATGCAT
GCATTAAAGAAGTTTTAAGAATTAAACCAGTTACATCATTAGGTTTACCACGTATTGCAA
ATGATGATACTTTTGTAAATGGTTATAGAATTCCAAAAGGAACTCAAATCATTGAAAATA
TTTATGGTCTTTCTATTCTGATCAATTAATAGATGATCCTACAACTTTTAATCCTTATCG
TTGGTTGGAATATCAAAAATTAAAATCATTTCAAAATGATTTAAAACAACAACAACAACA
ACAACAACAACAACAACAACAACAACAACAACTACAACTACAACAAGAACAACAAGAACA
AGAACAACAAAAAATTAATTTAGAATTTAATAATAATAATAATAATAATAATAATAATAA
TAATAATAATAGTAATAATAAACATAAATATTATAATGATTTAGATAAAATTTCAATTCC
ATTTTCAACTGGTAGAAGGGGATGTGTTGGTGTACAATTAGGTGAAGCAGAGTTATATAT
CGTTTGTGCAAATTTAGTTTATAATTTCAAAATTGAATCATGGGATGGTAAAAAAATAAA
TGAATTGGAAGATTTTGGTATTATTATTCACCCTTCTTCTCATACTTTAAAATTACAAAA
AGAAAATATTAATAANAGTANTAAAATAAAAAATAAAAANAAAAGATATTCTTTTTACCA
TATCACAATTTTTAGATATCTAAAAAAAA

>_1
                              FTYIXXPFE*TSXPNFXGIHQQQQ*DLVTNILVKHH*YMHALKKF*ELNQLHH*VYHVLQ
                              MMILL*MVIEFQKELKSLKIFMVFLF*SINR*SYNF*SLSLVGISKIKIISK*FKTTTTT
                              TTTTTTTTTTTTTTTRTTRTRTTKN*FRI******************T*IL**FR*NFNS
                              IFNW*KGMCWCTIR*SRVIYRLCKFSL*FQN*IMGW*KNK*IGRFWYYYSPFFSYFKITK
                              RKY**XX*NKK*KXKIFFLPYHNF*ISKKX
                              >_2
                              LPI*XARLSELQXRTLXGSTNNNNRIW*RIFL*STINTCMH*RSFKN*TSYIIRFTTYCK
                              **YFCKWL*NSKRNSNH*KYLWSFYSDQLIDDPTTFNPYRWLEYQKLKS FQNDLKQQQQQ
                              QQQQQQQQQQLQLQQEQQEQEQQKINLEFNNNNNNNNNNNNNNS NNKHKYYNDLDKISIP
                              FSTGRRGCVGVQLGEAELYIVCANLVYNFKIESWDGKKINELEDFGIIIHPSSHTLKLQK
                              ENINXSXKIKNKXKRYSFYHITIFRYLKKX
                              >_3
                              YLYXXPV*VNFXAELXRDPPTTTIGFGDEYSCKAPLIHACIKEVLRIKPVTSLGLPRIAN
                              DDTFVNGYRIPKGTQIIENIYGLSILIN**MILQLLILIVGWNIKN*NHFKMI*NNNNNN
                              NNNNNNNNNNYNYNKNNKNKNNKKLI*NLIIIIIIIIIIIIIIVIININIIMI*IKFQFH
                              FQLVEGDVLVYN*VKQSYISFVQI*FIISKLNHGMVKK*MNWKILVLLFTLLLIL*NYKK
                              KILIXVXK*KIKXKDILFTISQFLDI*KK
>IIAFP1D97427
        Length = 792

  Minus Strand HSPs:

Query:     1 MNVFVLTI 8
             MNVFVLT+
Sbjct:   698 MNVFVLTV 675

Query:     8 IKKNKKLKDEPPTPKLALPLIGHLYLLGD 36
             IKKNKKLKDEPPTPKLALPLIGHLYLLGD
Sbjct:   456 IKKNKKLKDEPPTPKLALPLIGHLYLLGD 370

>IIAFP1D97427  translate frame +1 translate plus frames translate all frames
TTTAACTTTTNNAAGCACTGNGGTGAATGCCCCCCGACCTTAGAGTGATCCACTNATTAA
TTAAATAATTTGTTTGAGTTTCTAATATGTTTGAAATTCCTTTAACTTTTGATTTTGTGA
AGGATGAAGAAACTAATTTTCTAAGTTCTGACCAAAGTGGATAATCACTAAATGATAAAT
TTCTAAAACCTGATGAGAATTGATCTAAACTATTAAAGTGTGGTCTATTTGTAAAATTAG
AAACTGNTTACACCATACTTCATTAACATGATCAGGATCAGTCAAAACAACTGTTGGATA
TTCTCCCATCCAAATCTTAAAAATTCCACCGTATCGTTTTGATAATTCTAAAAATGATCT
ATTTGGTCTATCACCTAATAAATATAAATGTCCTATTAATGGTAATGCTAATTTTGGTGT
TGGTGGTTCATCTTTTAATTTTTTATTTTTTTTAATCTATTTTTAAATAAATAAAATAAA
TAAATAAAATTAATAAATAAATTAAAAAAGGGTAGTGGTGGTGCGTAAAAAAAAAAAGTT
TTGAAAAAAAAAAAGATATTTTTTTTTTTTTTTTTTTTTTTTGGGAATAAAAAATATTAA
AAAAATGAAAAAATTAAATAAAATTAATAAAAGATATTTACCAAATCGAATAATANATAA
ATAATGCAAATAAAAACAGTTAAAACAAATACATTCATTTTGAGTGTAAAATATTTTTTT
TTTAAACTCTAAAACTTTTTTTTTTTTTTTTTTCCTGATTTTTTTTTCTACCGCCCCATC
TTTATAATAAAA

>IIAFP1D97427_frame-1 translation frame -1
FYYKDGAVEKKIRKKKKKKVLEFKKKIFYTQNECICFNCFYLHYLXIIRFGKYLLLILFN
FFIFLIFFIPKKKKKKKKYLFFFQNFFFLRTTTTLF*FIY*FYLFILFI*K*IKKNKKLK
DEPPTPKLALPLIGHLYLLGDRPNRSFLELSKRYGGIFKIWMGEYPTVVLTDPDHVNEVW
CXQFLILQIDHTLIV*INSHQVLEIYHLVIIHFGQNLEN*FLHPSQNQKLKEFQTY*KLK
QII*LXSGSL*GRGAFTXVLXKVK
>IIAFP1D97427_frame-2 translation frame -2
FIIKMGR*KKKSGKKKKKKF*SLKKKYFTLKMNVFVLTVFICIIYXLFDLVNIFY*FYLI
FSFF*YFLFPKKKKKKKNIFFFFKTFFFYAPPLPFFNLFINFIYLFYLFKNRLKKIKN*K
MNHQHQN*HYH**DIYIY*VIDQIDHF*NYQNDTVEFLRFGWENIQQLF*LILIMLMKYG
VXSF*FYK*TTL**FRSILIRF*KFII**LSTLVRT*KISFFILHKIKS*RNFKHIRNSN
KLFN*XVDHSKVGGHSPQCXXKL
>IIAFP1D97427_frame-3 translation frame -3
LL*RWGGRKKNQEKKKKKSFRV*KKNILHSK*MYLF*LFLFALFXYYSIW*ISFINFI*F
FHFFNIFYSQKKKKKKKISFFFSKLFFFTHHHYPFLIYLLILFIYFIYLKID*KK*KIKR
*TTNTKISITINRTFIFIR**TK*IIFRIIKTIRWNF*DLDGRISNSCFD*S*SC**SMV
*XVSNFTNRPHFNSLDQFSSGFRNLSFSDYPLWSELRKLVSSSFTKSKVKGISNILETQT
NYLINXWITLRSGGIHXSAXKS*

>CYP519E1 seq 84+17 complete 48% TO 519B1 look upstream of QPLA for more
MGIGLIILYLLIGLLAYDF (0)
TKKNKKISKNDPKQPLAIPVLGHLHLFGSQPHRSLTELAKKFGGIFTLWMGDERSMVITDPNILRELYVKNHLNFYNRASS
ESIRIYSGNLVDISFSVGESWKNRRRYVSAALTKTKVLNVITLIEEQANFLINSMQYYAKSGEP (0)
FFPHKYYNKYTMNIVMSIGFSKTISENESVEEGPISQLIIPFYNILENLGSGNLGD
YVWYTQPFFYFKNKKLEQDTKKVYTFLEEIYNEHIKNLDESNPRDLMDQLIISTG
GKEKDMVIHVST (0)  
DFLLAGSDTNASTLEWFCIFLANNPEIQKKAYEELISVVGKDCKAVTTKYRDDCPYLV
GAIKETLRMRTPAPLSLIRVSEEDFMTSGGIFIPKGTQIVPNLYGIGQNFVDDPSSYK
PERWVEYYKNKTPTREMEATTETKSNITTEILP
NDLDKVVLPFSIGPRNCPGNIISEINLFLACSNILLNFEFSNGGKKIDETEV
FGITIHPKDFSIQLKKRE*
Dict-IV-V228a03.q1c
Dict-IV-V7e07.p1c
Dict-IV-V35d03.q1c
Dict-IV-V35d09.q1c
AU071937
IIBCP1D2282
SSC778
IICCP1D19875
IICBP1E43296
IICBP1E27480
JC1c295d05.s1
AU037557
IIAFP1D71814
SSD784 supports C-term 

>SSD784 (SSD784Q) /pub/dna_csm/LIBRARY/SS/SSD7-D/SSD784Q.Seq.d/
        Length = 600

  Plus Strand HSPs:

 Score = 633 (227.9 bits), Expect = 6.3e-61, P = 6.3e-61
 Identities = 120/120 (100%), Positives = 120/120 (100%), Frame = +3

Query:     1 SLIRVSEEDFMTSGGIFIPKGTQIVPNLYGIGQNFVDDPSSYKPERWVEYYKNKTPTREM 60
             SLIRVSEEDFMTSGGIFIPKGTQIVPNLYGIGQNFVDDPSSYKPERWVEYYKNKTPTREM
Sbjct:    39 SLIRVSEEDFMTSGGIFIPKGTQIVPNLYGIGQNFVDDPSSYKPERWVEYYKNKTPTREM 218

Query:    61 EATTETKSNITTEILPNDLDKVVLPFSIGPRNCPGNIISEINLFLACSNILLNFEFSNGG 120
             EATTETKSNITTEILPNDLDKVVLPFSIGPRNCPGNIISEINLFLACSNILLNFEFSNGG
Sbjct:   219 EATTETKSNITTEILPNDLDKVVLPFSIGPRNCPGNIISEINLFLACSNILLNFEFSNGG 398

>Contig_2589, D. discoideum, sequenced by the D. discoideum Sequencing
            Consortium, assembled with Phusion
        Length = 2691

  Minus Strand HSPs:

 Score = 93 (37.8 bits), Expect = 1.9e-10, Sum P(2) = 1.9e-10
 Identities = 19/19 (100%), Positives = 19/19 (100%), Frame = -2

Query:     1 MGIGLIILYLLIGLLAYDF 19
             MGIGLIILYLLIGLLAYDF
Sbjct:  2441 MGIGLIILYLLIGLLAYDF 2385

 Score = 85 (35.0 bits), Expect = 1.9e-10, Sum P(2) = 1.9e-10
 Identities = 16/16 (100%), Positives = 16/16 (100%), Frame = -1

Query:    20 QPLAIPVLGHLHLFGS 35
             QPLAIPVLGHLHLFGS
Sbjct:  2208 QPLAIPVLGHLHLFGS 2161

>_4
ISL*FCKF*K*LLLYYYFYF*NLYLFVFLIKIFYFIFFLFFFFNLYFLN**TKKNKKISK
NDPKQPLAIPVLGHLHLFGSQPHRSLTELAKKFGGIFTLWMGDERSMVITDPNILRELYV
>_5
*PMIL*VLKIIITLLLFLFLKSIFICFFN*NILFYFFFIFFF*SLFFKLIDKKE*KN**K
*SKATIGNSSIRTFTFIWKSTTSFFNRISKEIWWNFYIMDG**KINGHNRPKYTS*IICX
>_6
LAYDFVSFKNNYYFIIIFIFKIYIYLFF*LKYFILFFFYFFFLIFIF*INRQKRIKKLVK
MIQSNHWQFQY*DIYIYLEVNHIVL*QN*QRNLVEFLHYGWVMKDQWS*QTQIYFVNYMX

CYP519F1 Seq 47+68+80 (complete) 47% to seq 58
MEILTFIIYLITFFILFDF(phase 0)
KKKKFKKNKRYSKSPNKEANGPWSLPIIGGLHLIGDRPNRSFSELSKIYGGIYKIWLAERMLMI
VTDPEIIQDIWIKQHDKFVNRPHNITSQIFSLNHKSLVFGDVDEWNKVRPKMTCHFTKIK
LNSTKPKQIVNDQLKKMLKIMTTHSLDSKPFNQYVYLNTYSMNIILGLMLSIELPHSNSN
DKDGQFSKVLHSIDEIFKSIGTNGPEDIFPTLLPFFKNRISTFTNHLNVIKDFIRSIYKQ
QIKTFDINIEPRNIMDCLISEYYEDDDQEDEVAKQELIIQLCIDMLVAATDTSASTLEWF
MLFMINNPNLQEDLYEEVVNVVGKDCPYVTFDDVPKLALIKACYFEILRIRPVTSLSLPR
VSMEDTTTLNDIFIPKDTIIIQNIFGMGNSEKFVSNPTVFNPSRWLEYKKMKDLN QFGNR
DDSIDTTNTTTNTTLNGTTS KYYNDLERVSIPFGVGKRRCMAPSMADHNVLIAMANIVLN
FTMKSSDPKQMPLSEEEQYAITIKPKYPFKVLFEKRS
Contig4769 Chr 6
IIADP1D1565 
IIAFP1D30759 
IIAFP1D53731
IIAFP1D75367
JC1a198c01.s1
JC1a278b03.s1
JC2c167d08.r1
sdic6A29c1.q1t N-term
sdic6B9c1.p1c

CYP519G1 Seq 58 complete 47% to seq 47
MNYLLIIICIIFFSLFFDF (0)
KIRKNWNLNFKRLFKKDVNGPWSLPIIGGIYLINDNPNRALTKLSKKYGGIYKIWLGESFSM (0)
VVSDPEIVNEIWVKQHDNFINRPKNITHK
MFSSNYRSLNFGDNPNWKFNRSMASSHFTKTKLLSSK
VTSVVEKKLNKLIETMEYHSINKLP (0)
FDSYVGFSEYSLNIILNMLVSMDIDECENSTQNVIYSINEIFKMLSTNSPQYSFPYLKF
FFKKDLNNFKFHLDKIKSFIHSIYLKQLESYDPSNPRNILDSF
ISDLQSNDIDILLQICIDIVVAGT (1)
DTVANLLQWFVLFCINY
PEIQEKLYNEIIEVVGKDCKVLKYEHISKMPYLYGCFRESLRIRPVTPLS
LPRVAKCDTYIKDDIFIPKG (0)
ATIIQNIFGMGNDEKYISEPNKFKPERWVEYIKNKKVNKNGNENSVNKYF
NDLDKISIPFGVGKRQCLSPAMAEQESLLSIATVVLNYKLKSNGQKKLNE
KEVYSITIKPQPFKLFLEKRV*
Dict-IV-V885f05.p1c
IIAAP1D3111
IIAAP1E3151 
IIAEP1D2263 
IIAFP1D59991 
JAX4a82a11.r1
JC1c13f09.s1
sdic6Ce8.q1t
Contig_0437

>Contig_0437, D. discoideum, sequenced by the D. discoideum Sequencing
            Consortium, assembled with Phusion
        Length = 7555

  Minus Strand HSPs:


Query:    11 IFFSLFF-DF-KIRKNWNLNFKRLFKKDVNGPWSLPIIGGIYLINDNPNRALTKLSKKYG 68
             +FF + + +F KI+KNWNLNFKRLFKKDVNGPWSLPIIGGIYLINDNPNRALTKLSKKYG
Sbjct:  7114 LFFIINY*NF*KIKKNWNLNFKRLFKKDVNGPWSLPIIGGIYLINDNPNRALTKLSKKYG 6935

Query:    69 GIYKIWLGESFSMVVSDPEIVNEIWVKQHDNFI 101
             GIYKIWLGESFSMV    +I+  I +K ++N +
Sbjct:  6934 GIYKIWLGESFSMV----KII--IIIKNNNNLL 6854

Query:    82 VVSDPEIVNEIWVKQHDNFINRPKNITHKMFSSNYRSLNFGDNPNWKFNRSMASSHFTKT 141
             VVSDPEIVNEIWVKQHDNFINRPKNITHKMFSSNYRSLNFGDNPNWKFNRSMASSHFTKT
Sbjct:  6830 VVSDPEIVNEIWVKQHDNFINRPKNITHKMFSSNYRSLNFGDNPNWKFNRSMASSHFTKT 6651

Query:   142 KLLSSKVTSVVEKKLNKLIETMEYHSINKLPFDSY 176
             KLLSSKVTSVVEKKLNKLIETMEYHSINKLP   Y
Sbjct:  6650 KLLSSKVTSVVEKKLNKLIETMEYHSINKLPVSIY 6546

Query:   170 KLPFDSYVGFSEYSLNIILNMLVSMDIDECENSTQNVIYSINEIFKMLSTNSPQYSFPYL 229
             KL FDSYVGFSEYSLNIILNMLVSMDIDECENSTQNVIYSINEIFKMLSTNSPQYSFPYL
Sbjct:  6499 KL*FDSYVGFSEYSLNIILNMLVSMDIDECENSTQNVIYSINEIFKMLSTNSPQYSFPYL 6320

Query:   230 KFFFKKDLNNFKFHLDKIKSFIHSIYLKQLESYDPSNPRNILDSFISDLQSNDIDILLQI 289
             KFFFKKDLNNFKFHLDKIKSFIHSIYLKQLESYDPSNPRNILDSFISDLQSNDIDILLQI
Sbjct:  6319 KFFFKKDLNNFKFHLDKIKSFIHSIYLKQLESYDPSNPRNILDSFISDLQSNDIDILLQI 6140

Query:   290 CIDIVVAGTDTVANLLQWFVLFCI 313
             CIDIVVAGT  +  ++   ++  I
Sbjct:  6139 CIDIVVAGTGNIIIIIIIIIIIII 6068

Query:   293 IVVAGTDTVANLLQWFVLFCINYPEIQEKLYNEIIEVVGKDCKVLKYEHISKMPYLYGCF 352
             +++   DTVANLLQWFVLFCINYPEIQEKLYNEIIEVVGKDCKVLKYEHISKMPYLYGCF
Sbjct:  5967 LLLFNIDTVANLLQWFVLFCINYPEIQEKLYNEIIEVVGKDCKVLKYEHISKMPYLYGCF 5788

Query:   353 RESLRIRPVTPLSLPRVAKCDTYIKDDIFIPKGAT 387
             RESLRIRPVTPLSLPRVAKCDTYIKDDIFIPKG +
Sbjct:  5787 RESLRIRPVTPLSLPRVAKCDTYIKDDIFIPKGVS 5683

Query:   381 FIPKGATIIQNIFGMGNDEKYISEPNKFKPERWVEYIKNKKVNKNGNENSVNKYFNDLDK 440
             F  K ATIIQNIFGMGNDEKYISEPNKFKPERWVEYIKNKKVNKNGNENSVNKYFNDLDK
Sbjct:  5611 FFFKKATIIQNIFGMGNDEKYISEPNKFKPERWVEYIKNKKVNKNGNENSVNKYFNDLDK 5432

Query:   441 ISIPFGVGKRQCLSPAMAEQESLLSIATVVLNYKLKSNGQKKLNEKEVYSITIKPQPFKL 500
             ISIPFGVGKRQCLSPAMAEQESLLSIATVVLNYKLKSNGQKKLNEKEVYSITIKPQPFKL
Sbjct:  5431 ISIPFGVGKRQCLSPAMAEQESLLSIATVVLNYKLKSNGQKKLNEKEVYSITIKPQPFKL 5252

Query:   501 FLEKRV 506
             FLEKRV
Sbjct:  5251 FLEKRV 5234


MNYLLIIICIIFFSLFFDF JC1c13f09.r1 possible N-terminal of 520B1 
|| || |       | |||
MNILLLIFYFLVCFLIFDF N-term of seq 519C1
IIAFP1D75985 matches N-term seq shown above from JC1c13f09.r1

>JC3e96f03.r1 Clone JC3e96f03, reverse read, bases 60 through 572, from
            2002-09-12
        Length = 511

  Minus Strand HSPs:

 Score = 320 (117.7 bits), Expect = 2.3e-27, P = 2.3e-27
 Identities = 57/61 (93%), Positives = 59/61 (96%), Frame = -1

Query:    83 VVSDPEIVNEIWVKQHDNFINRPKNITHKMFSSNYRSLNFGDNPNWKFNRSMASSHFTKT 142
             VVSDP+IVNEIWVKQHDNFINRPKNITHKMFSSNYRSLNFGDNPNWKFNRSM + HFTKT
Sbjct:   184 VVSDPKIVNEIWVKQHDNFINRPKNITHKMFSSNYRSLNFGDNPNWKFNRSMVNGHFTKT 5

Query:   143 K 143
             K
Sbjct:     4 K 2

 Score = 199 (75.1 bits), Expect = 5.1e-14, P = 5.1e-14
 Identities = 42/93 (45%), Positives = 61/93 (65%), Frame = -2

Query:    11 IFFSL-FFDFKKKKFKKNKRYSKSPNKEANGPWSLPIIGGLHLIGDRPNRSFSELSKIYG 69
             +FF + + +F K +   N  + +   K+ NGPWSLPIIGG++LI D PNR+ ++LSK YG
Sbjct:   468 LFFIINYLNF*KIRKNWNLNFKRLFKKDVNGPWSLPIIGGIYLINDNPNRALTKLSKKYG 289

Query:    70 GIYKIWLAERMLMVVSDPEIVNEIWVKQHDNFI 102
             GIYKIWL E   MV    +I+  I +K ++N +
Sbjct:   288 GIYKIWLGESFSMV----KII--IIIKNNNNLL 208

CYP521A1 Seq. 13 complete seq 36% to seq 12
MILLTLLYLIIFYIIIDFIKKNYKTKNQLPSPLGIALPIIGHLHLLRTDPYKTLAKASKK
TEHGILKCWNGEHLMVVVDNPSIIKQMYVNTNNFTDRPQTKVFEIISRNYKNSGFANGEK
WKHLRGLYAPSFTKIKSRPHENIILKYVNFEIKSLKNHAITNSIYNPFLIENINSFGTKV
ITEIIFGREFSENEVYSLIG PMNKLFGILDTPFPSESISFLKPFYRRSYKECDKQCEELF
KLVEKVYDDHLLNLDKDNPKDVMDVMIVETDFKEKDHVICICCDLLMGTKDTFNTIVLWF
FVLMINYQDVQLKGYQEIIK
VLECTGRDHVTIEDIDKLPYIDGIIKEISRIH PAGPLSVPRTAINDIMINGYFIPK
GCHVFQNTYGAVYNYMK
ESDEPCKMKPERWIENEKLRK
DGKLDPTNDLALISLPFSSGIRNCPGVGFAEYELFLLFSNIILNFHLSSPNNLKLNESGH
FGLTMKPFPFLVDLN*
AU071731
AU074308
C84168
C90134
Contig15846
IIAAP1D2650
IIAFP1D39484
IIAFP1D63655
IIAFP1D67462 Length = 834 
IIAFP1D78046
IIAFP1D81808
IIAFP1D85856
JC1b64g05.s1
JC1b246f07.s1
JC2d102b06.r1
SSC405
SSI130
SSI404
SSK320

>SSK320 (SSK320Q) /pub/dna_csm/LIBRARY/SS/SSK3-A/SSK320Q.Seq.d/
        Length = 625

  Plus Strand HSPs:

 Score = 242 (90.2 bits), Expect = 1.6e-19, P = 1.6e-19
 Identities = 48/48 (100%), Positives = 48/48 (100%), Frame = +1

Query:     1 MILLTLLYLIIFYIIIDFIKKNYKTKNQLPSPLGIALPIIGHLHLLRT 48
             MILLTLLYLIIFYIIIDFIKKNYKTKNQLPSPLGIALPIIGHLHLLRT
Sbjct:    16 MILLTLLYLIIFYIIIDFIKKNYKTKNQLPSPLGIALPIIGHLHLLRT 159

>IIAFP1D81808
        Length = 786

  Minus Strand HSPs: no first intron

 Score = 242 (90.2 bits), Expect = 1.3e-19, P = 1.3e-19
 Identities = 48/48 (100%), Positives = 48/48 (100%), Frame = -3

Query:     1 MILLTLLYLIIFYIIIDFIKKNYKTKNQLPSPLGIALPIIGHLHLLRT 48
             MILLTLLYLIIFYIIIDFIKKNYKTKNQLPSPLGIALPIIGHLHLLRT
Sbjct:   625 MILLTLLYLIIFYIIIDFIKKNYKTKNQLPSPLGIALPIIGHLHLLRT 482

CYP522A1 Seq. 18 29% to seq 85 489aa
MILTIVIIILTVIFVNKYLLNNGGKNDLRKVQGPFNLPLLGALYAFDKER
FHKSFDKFYDKYKDFYFIKFGQHDCIVLNSPKLIKQVVIEQSDSVFERFHTPSIKRYAQ
GKSILGCSPDEWKKLRSFIVISFSKNKMGQQVLDKIFHTQYLKFENHIKKLIKSNNNI (0)
VTLEPEFKRLTISIIFNFQFGTDLEFTDPLIDSLLVCTEKIIASCQKASDLMP
IFEIFTS (0)
YKDIDGVVKEMYALVKPFLEKYLKQH
DRNNPKCALDHMINCILDQDEPKLITYEHLPHFLMDMFIGGTESTARTM (0)
DWFTLMMTNRKEMQDRIRTELLDVGIRLPVLVDKQKYPLLNASIKEIHR 
LRPIQPIIASRVVNDPIVLKHECSAKGES 
YTIPVGTLIIPNAHSFNFDPQYHKDPLTFNPNRYIGDNPEILHMTFDIGIRTCPF
MSFAIDELFIIFSRLFQSFEFQPIDNTPISEEAFTINSIRPKQWSCQVIERDHK*
AU036927
AU071581
Contig14543 Chr 2 also in excel as 14583 check for typo
Contig 15712 
IIAEP1D0380
IIAFP1D28356
IIAFP1D73782
IIAFP1D77020
IIAFP1D85481
IIAGP1D3671
c-JAX4a56g08.s1
JAX4a66b12.s1
JAX4a86b01.r1
JAX4a86b01.s1
JAX4b25h02.r1
JC1a76h05.s1
JC2a62d07.r1
JC2b337b05.r1
JC2e96f05.r1
JC2e128b02.s1
SSB828
SFH475
SFH209
AFO254
AFJ405
SFK749
AFA531
AFH875
AFB727
AFO263
AFK692
SFH771
SFA776
SFD153

>Contig_2198, D. discoideum, sequenced by the D. discoideum Sequencing
            Consortium, assembled with Phusion
        Length = 4993

  Minus Strand HSPs:

 Score = 993 (354.6 bits), Expect = 8.3e-260, Sum P(4) = 8.3e-260
 Identities = 187/187 (100%), Positives = 187/187 (100%), Frame = -3

Query:   303 DWFTLMMTNRKEMQDRIRTELLDVGIRLPVLVDKQKYPLLNASIKEIHRLRPIQPIIASR 362
             DWFTLMMTNRKEMQDRIRTELLDVGIRLPVLVDKQKYPLLNASIKEIHRLRPIQPIIASR
Sbjct:  2438 DWFTLMMTNRKEMQDRIRTELLDVGIRLPVLVDKQKYPLLNASIKEIHRLRPIQPIIASR 2259

Query:   363 VVNDPIVLKHECSAKGESYTIPVGTLIIPNAHSFNFDPQYHKDPLTFNPNRYIGDNPEIL 422
             VVNDPIVLKHECSAKGESYTIPVGTLIIPNAHSFNFDPQYHKDPLTFNPNRYIGDNPEIL
Sbjct:  2258 VVNDPIVLKHECSAKGESYTIPVGTLIIPNAHSFNFDPQYHKDPLTFNPNRYIGDNPEIL 2079

Query:   423 HMTFDIGIRTCPFMSFAIDELFIIFSRLFQSFEFQPIDNTPISEEAFTINSIRPKQWSCQ 482
             HMTFDIGIRTCPFMSFAIDELFIIFSRLFQSFEFQPIDNTPISEEAFTINSIRPKQWSCQ
Sbjct:  2078 HMTFDIGIRTCPFMSFAIDELFIIFSRLFQSFEFQPIDNTPISEEAFTINSIRPKQWSCQ 1899

Query:   483 VIERDHK 489
             VIERDHK
Sbjct:  1898 VIERDHK 1878

 Score = 877 (313.8 bits), Expect = 8.3e-260, Sum P(4) = 8.3e-260
 Identities = 168/168 (100%), Positives = 168/168 (100%), Frame = -2

Query:     1 MILTIVIIILTVIFVNKYLLNNGGKNDLRKVQGPFNLPLLGALYAFDKERFHKSFDKFYD 60
             MILTIVIIILTVIFVNKYLLNNGGKNDLRKVQGPFNLPLLGALYAFDKERFHKSFDKFYD
Sbjct:  3570 MILTIVIIILTVIFVNKYLLNNGGKNDLRKVQGPFNLPLLGALYAFDKERFHKSFDKFYD 3391

Query:    61 KYKDFYFIKFGQHDCIVLNSPKLIKQVVIEQSDSVFERFHTPSIKRYAQGKSILGCSPDE 120
             KYKDFYFIKFGQHDCIVLNSPKLIKQVVIEQSDSVFERFHTPSIKRYAQGKSILGCSPDE
Sbjct:  3390 KYKDFYFIKFGQHDCIVLNSPKLIKQVVIEQSDSVFERFHTPSIKRYAQGKSILGCSPDE 3211

Query:   121 WKKLRSFIVISFSKNKMGQQVLDKIFHTQYLKFENHIKKLIKSNNNIV 168
             WKKLRSFIVISFSKNKMGQQVLDKIFHTQYLKFENHIKKLIKSNNNIV
Sbjct:  3210 WKKLRSFIVISFSKNKMGQQVLDKIFHTQYLKFENHIKKLIKSNNNIV 3067

 Score = 409 (149.0 bits), Expect = 8.3e-260, Sum P(4) = 8.3e-260
 Identities = 75/81 (92%), Positives = 77/81 (95%), Frame = -2

Query:   222 FEIFTSYKDIDGVVKEMYALVKPFLEKYLKQHDRNNPKCALDHMINCILDQDEPKLITYE 281
             + +   YKDIDGVVKEMYALVKPFLEKYLKQHDRNNPKCALDHMINCILDQDEPKLITYE
Sbjct:  2754 YRLTEQYKDIDGVVKEMYALVKPFLEKYLKQHDRNNPKCALDHMINCILDQDEPKLITYE 2575

Query:   282 HLPHFLMDMFIGGTESTARTM 302
             HLPHFLMDMFIGGTESTARTM
Sbjct:  2574 HLPHFLMDMFIGGTESTARTM 2512

 Score = 303 (111.7 bits), Expect = 8.3e-260, Sum P(4) = 8.3e-260
 Identities = 60/60 (100%), Positives = 60/60 (100%), Frame = -2

Query:   168 VTLEPEFKRLTISIIFNFQFGTDLEFTDPLIDSLLVCTEKIIASCQKASDLMPIFEIFTS 227
             VTLEPEFKRLTISIIFNFQFGTDLEFTDPLIDSLLVCTEKIIASCQKASDLMPIFEIFTS
Sbjct:  2997 VTLEPEFKRLTISIIFNFQFGTDLEFTDPLIDSLLVCTEKIIASCQKASDLMPIFEIFTS 2818

>Contig_2198, D. discoideum, sequenced by the D. discoideum Sequencing Consortium, assembled with Phusion translate frame +1 translate plus frames translate all frames
GATGATGATGAAGATGAAGATGAAGATGAAGAACATGATGAAGATGAAGAAGAGGATGAT
GATAATATATTTTATGAATCATTTATTAATGGTGATATTAAAATTTGTAAATTAATTGAA
AAATTTTGTCCTAATCAATTTAAAATAACAAAGAAATCAATTTTAAATGCAATTAAAAAT
GGAAATATTCACATTGTTAAATATTATTATAATCATATTAAAAATAATGATCAATTATTA
TTTAATAATTTTAAAAGTTTATTTTCAAAATATTTTTTATAATAAACATTTTATTTATTT
ATTAATTAATTAATTTATTTATTTAATTGTTTTTTTTTTTAAATTTTTTTTTATTTTTTT
TTATTTATTTATTCTAATAAAATACACCAATCAATATTTGGTTTAATTGTTTTTGAATTA
TTTAAAGTTGAAACAGCCATATGACCAGCAAAGAATTCTGTATTATATTCAGTACCATTA
TCATCGATTTTAACAGGTACAAAGAGGAAACCTCTTGGGACGTCATTGGTATTGACAAAG
ATCCAATCAGTGTTGAATTTTCCTCCTCCAAATTCATGTCTTTCTTTATTTTTACCCATC
CATTTACCATCATTATCAAAAACACAGAAAGCTGTAATCCAACCACTGATATAACGTGGA
CCTGAGCCTGCACCTTCTTGACTGGCAATACGATTCCACCATTCAATATCTGGTTTACCA
TTGACGGAACTAATGATTTTATCAATGATTGGTGAGAGCATTTCAGACCAATCTTTCATA
ACTGAACCATCTTTATTATCTTTGAAATCAAATTCTTTTAATTTTTCAATTCTATTCTTT
ATATCGACCCAATCATCGAGGGTACCCTCTAATGTTACCTCTGGTAAACCACACATTAAA
CACATTTTAAAGTCAAAGTAACTTTTAACGGTTGCCATTAATGCAATTGATGCTGCCATT
GTATCGTTTGGAGTTGTAGTTGAAAATGGTTTGTTTGCCCATTCTCTAATTGATGGATCT
TTGATATTCTTTTCAATTTCTTGAGTCATTCTAATGGTTAATGATTGATAATCTGCTGTC
ATCAATTTTCCACCACCCCTTACTACCAATTGTTTTTTTCCTTGGAAATCAACAATCTTA
CTTCTTAATTGTTCAGCGTTGGCATTTAAATATGACGAGAATTGTACCAAAATTGCCATC
CAAATATCATCTGGACGAATGATTAAAGAATGGTGATTTGAATACGCAATGAACGAACTT
AAAACAAATGAATTACCGCCTACACTCTTTGAAATACCTTCTTTGATACTACTTCTAACT
ACTTTTCTTTCTGGTTCATTTGGTTTACCACCATACAATTTTTGGTGTGAAAAATCTAAT
CCTTTGAATTCACTCTCTCTTACATTTGCTACTTTAAATGTTATAGACATTGTTGTTATT
GTTCTGCTGTTGTTGAAAAAATGGAATTAATAAAAAAAAAAAAAAAAAAATTTATATAGT
TAATAAAAAACAAAAAAAACAAAAAAAAAAAACCCAAAAAAAACAAAAAAAAAAACCAAA
AAAAAAAATTTAAAATTTAATTTAAAAAAATAAAATAAAAAATAATTTAATTGAAAATTA
CTTTCTCGGGTGGGAAATTCAAAGTAGAATTTCGTAAGGTCCGAGCGTTGAAAATAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAATGAAAATAATAGTTTTTTTTGGAATTTGGACTC
ATTTGTTTTTGTTAATTAAAAACATTTAATGATTTTTTTTTAATTAAAAAAATAAAAAAA
AAAAAAAATCTTTTAATTTTTTTTTACAATTGTAAAAATACCTTATCTAATTTTAATGTT
GTATAATAATTTATTTATTTATGATCTCTTTCAATTACTTGACATGACCATTGTTTTGGT
CTAATTGAATTAATTGTAAAAGCTTCTTCTGATATTGGTGTATTGTCAATTGGTTGGAAT
TCAAAAGACTGGAATAATCTAGAGAATATAATAAACAATTCATCGATAGCAAATGACATG
AATGGACATGTTCTAATACCAATATCAAATGTCATATGAAGGATTTCAGGATTATCACCA
ATGTAACGATTTGGATTGAAAGTTAATGGGTCTTTGTGATATTGTGGATCAAAGTTGAAT
GAATGTGCATTTGGTATGATTAATGTACCAACTGGAATAGTATAACTTTCACCTTTGGCA
CTACATTCATGTTTTAAAACAATTGGATCATTAACAACACGTGAAGCAATAATAGGTTGA
ATTGGTCTTAATCTATGAATTTCTTTAATTGATGCATTTAACAATGGATACTTTTGTTTA
TCAACTAAAACTGGTAATCTAATACCAACATCTAATAATTCTGTACGTATTCTATCTTGC 2400
ATCTCTTTTCTATTTGTCATCATTAATGTAAACCAGTCCTGTATTTATATAAACAACAAA
ATAATTAGTAACTATTGTTTGGGTAGCACTTTCACATTAATGTATAATTACCATTGTTCT 2520
AGCAGTTGATTCAGTACCACCTATAAACATATCCATTAAGAAATGTGGCAAATGTTCATA
TGTAATTAATTTTGGTTCATCTTGATCTAATATACAATTTATCATATGATCTAATGCACA
TTTTGGATTATTTCTGTCGTGTTGTTTTAAATATTTCTCTAAAAATGGTTTAACCAATGC 2700
ATACATTTCTTTAACTACACCATCAATATCTTTATACTGTTCGGTCAATCGATAATTTAC
AAATTAATAAATTATACATACTACACTATTAATAATAATAATAAATAAAATACATACACT 2820
TGTAAATATTTCAAATATTGGCATTAAATCTGATGCTTTTTGACAACTTGCAATTATTTT
TTCAGTACACACTAATAAAGAATCAATTAATGGATCAGTAAATTCTAAATCTGTACCAAA 2940
TTGAAAATTGAAAATAATACTAATTGTAAGTCTTTTAAATTCAGGCTCCAATGTAACCTA 3000
TACAAATAAATAAATTAATATGTGTGCAGTTGTAATTTATATAATTATTATTGATTTGAT
GGTACATACGATATTATTATTTGATTTGATTAATTTTTTAATATGATTTTCAAATTTTAA
ATATTGAGTATGAAATATTTTATCCAATACTTGTTGACCCATTTTATTTTTTGAAAATGA
AATTACAATGAATGATCTTAACTTTTTCCATTCATCAGGACTACATCCTAAAATACTTTT
ACCTTGAGCGTATCTTTTAATTGATGGTGTATGAAATCTTTCGAAAACACTATCACTTTG
TTCTATTACAACTTGTTTAATTAATTTTGGTGAATTTAAAACAATACAATCATGTTGACC
AAATTTAATGAAATAAAAATCTTTATATTTGTCATAAAATTTATCAAAACTTTTATGAAA
TCTTTCTTTATCAAATGCATATAATGCACCTAATAATGGTAAATTGAATGGACCTTGTAC
TTTTCGTAAATCATTTTTACCACCATTATTTAATAAATATTTATTTACAAAAATTACTGT
TAAAATTATTATTACTATAGTTAATATCATTGTGTGTTATTATTAATTTATTGGTTTTTA
TGAATATGAAAAAAAAATAATAGTGGTCTGTGTGTGTTTAAATATAAAAAAATTTCATTT
TTAATTGAATAAAAAATAAAAAAAAAAATAAAAAAAAAAAATAAATAATTTTAAATAAAT
TTAATTTTTTAAATTATTATTATTATTGAATTATATTTATTTATTTTGAATGTTGTAGGG
AATTTTCAATAAAATTTCACTACACAGCGTTCATCATCATCCATTGGCATATTAATAATA
TTAACTGGTAATAGTAATGGCGAATCTAATTGTTGTTGTTGTGGTGATGTTGTTGTTTTT
GTTGTTGTTGAAGAAGATGAAGAAGATGAAGAAGATGAAGATGATCTTAATTGCAGTGGT
TGTAGTTGATTTATTACATTCTTCAATTGGTTCCTCTGGAATTGCATGCTTCAGTTTCTT
GTTTAGTTAATTCTTTACTTGGTTTAATTTCTTGTAAAATTTGACTAGCCGTTGTTACAT
TATGTTGATTTAAATCTTGTGAGGAAGATAAAGAATTCTCTGCATTAACTTTTAATTGTA
AAATATCAAATGGTGATAATACTTCAGGTGCTTCATCAATCATTTTGGATGATTGTTACT
ATTATGTTGTTGTTGTTGTGATGATGATGATATTGATAATGATCAAGATGAAGCTGATTG
ATGATTTTTTCTTTCTTTACCAAGGAAATCAGTTGATCAATTCAAAGAAGAATTTAATTT
TGGCATCATATTGCACCGACTGATGGTTTGCGTTGAATAAAACTTGTTGATCTGGTGATG
ATGGTATACCCCTAATTGTTGGGGATTATCCAACTGATACACTTGTAGTAGTATTACCAC
CAATATTATTACTACTAGTATTTAATGGAGTTGGAATTGGTGTCGATGCACTACTTAATG
ATGGTGAAGATTGAGCTGATGATATCACATTATTATTATTATTATTTATTTTTTATTATT
GATAACAACTGGAGTAACACTTAGAATTGTTTTGAAAGAAACCTTCACTATCTAATTTTG
CTGCTTTTGTTCTTTCATCTTTTGAAATTAATACCCAATTAGTATTTTAATTTTGGTAAT
TCATTAATAACAAATAATATAAACTCTTCACATTTCTTTAAATTAAATTATTTTCAAATG
ATAAATATTTGAAGTTTTAGATTTCTTTAATGTTTGTTATTTGTTGTCCTTAAATTGAAA
ATATGAAAATAAAAAAAAAATGAAAAACAAAAACGAAAAAAAAAATGAGAAAAAAAAAAA
AAAAAAAAAAAAACCCTTTAAAAAACCTTAATTAAACCTATTATTCAAATAAAAAATATT
TTAATAACCTTTTTTTGTTTTTTAAAAAAAAAAAAAATTAAGGGCCAGGATTTTTTTTTT
TTTCCCTCCCCTC


>AFB727 (AFB727Q) /pub/dna_csm/LIBRARY/AF/AFB7-B/AFB727Q.Seq.d/
        Length = 1042

  Plus Strand HSPs:

 Score = 246 (91.7 bits), Expect = 3.7e-20, P = 3.7e-20
 Identities = 49/49 (100%), Positives = 49/49 (100%), Frame = +2

Query:     1 MILTIVIIILTVIFVNKYLLNNGGKNDLRKVQGPFNLPLLGALYAFDKE 49
             MILTIVIIILTVIFVNKYLLNNGGKNDLRKVQGPFNLPLLGALYAFDKE
Sbjct:    14 MILTIVIIILTVIFVNKYLLNNGGKNDLRKVQGPFNLPLLGALYAFDKE 160



>IIAEP1D0380
        Length = 907

  Plus Strand HSPs:

 Score = 246 (91.7 bits), Expect = 4.2e-20, P = 4.2e-20
 Identities = 49/49 (100%), Positives = 49/49 (100%), Frame = +1

Query:     1 MILTIVIIILTVIFVNKYLLNNGGKNDLRKVQGPFNLPLLGALYAFDKE 49
             MILTIVIIILTVIFVNKYLLNNGGKNDLRKVQGPFNLPLLGALYAFDKE
Sbjct:    88 MILTIVIIILTVIFVNKYLLNNGGKNDLRKVQGPFNLPLLGALYAFDKE 234

CYP519H1P =  old CYP523A1 seq 39, 73 complete 41% to seq 14 487 aa
MFIIYFIFLFLLIISLFIDF (0)
IKKNLKKSNNDPPGPISLPLLGNLHNLTKNPHRGLKNLSDKYGGIFRCYLGDHYSIVVSDP 
VIINEIYIKKFEKVCTRPNNDTFKMFSSGFKDLAFSDNYNIWSKIRTIVNSNFTKTKLT
KTIYNYLEDQTNQLIENMGNYSKSGEPV (1)
FLSTITIYMKISLNVICKLFFSKEILQNESFDNGKMRRIAVPIQIVCKELGAGN frameshift
FGDFVGILSPLLYFTKKKYQKNSSTSTDFIGEINDEHLKNLDHD (bad GT boundary)
PKDLMDMLIIDSTKGKDEDKEPIVHIGYDFLMVDQIRHLVLWN (1)
DCQEKAYNEIVSVMGEDCNKISYADRP
KLPYLVACINECLRMRTEDPLGIPRGAVEDIEINGYFMPKGAKVHHYLYAFGMNETVFEN
VNKFQPDRWLTNDQVHLKQMLNHLVPFSVGPRSCFGKNLSELEVFVVCSNILLNFELSSY
NGKPVDDFEIFGIHPPEFPVKLIKRK*
Contig1006 Chr 2
IIAFP1D52879
JAX4a43c04.s1 N-TERM
JAX4a44c04.r1
JAX4a127e01.r1
JC2a56d05.r1
JC2a201c07.r1
JC2b169a08.r1 
c-JC2b169a08.r1
JC2d58c03.s1
JC2e17c09.r1 similar to seq 39 may be seq 39
Length = 501
KISLNXIWGLFFSKKILQNKIFXNXKIKKIPXPIQIFXKKXXPPNX
FXNFXXILSPLLYFTKKNYQKNXSTSTNFIXXINXXHLKNLXXXXX
PKNLMDILIINSTKGKDKNKKPIXHIXYNFLMVGSNX


>Contig_4699, D. discoideum, sequenced by the D. discoideum Sequencing
            Consortium, assembled with Phusion
        Length = 3784

  Plus Strand HSPs:

Query:    13 IISLFIDFIKKNLKKSNNDPPGPISLPLLGNLHNLTKNPHRGLKNLSDKYGGIFRCYLGD 72
             II L+   IKKNLKKSNNDPPGPISLPLLGNLHNLTKNPHRGLKNLSDKYGGIFRCYLGD
Sbjct:  2076 IIILYYI*IKKNLKKSNNDPPGPISLPLLGNLHNLTKNPHRGLKNLSDKYGGIFRCYLGD 2255

Query:    73 HYSIVVSDPVIINEIYIKKFEKVCTRPNNDTFKMFSSGFKDLAFSDNYNIWSKIRTIVNS 132
             HYSIVVSDPVIINEIYIKKFEKVCTRPNNDTFKMFSSGFKDLAFSDNYNIWSKIRTIVNS
Sbjct:  2256 HYSIVVSDPVIINEIYIKKFEKVCTRPNNDTFKMFSSGFKDLAFSDNYNIWSKIRTIVNS 2435

Query:   133 NFTKTKLTKTIYNYLEDQTNQLIENMGNYSKSGEPVF-LSTITIYMKISLNVICKLFFS 190
             NFTKTKLTKTIYNYLEDQTNQLIENMGNYSKSGEPV  +  I IY+ I + ++ +  F+
Sbjct:  2436 NFTKTKLTKTIYNYLEDQTNQLIENMGNYSKSGEPVCKIFNIYIYIYIYI*ILIQF*FN 2612

Query:   170 LSTITIYMKISLNVICKLFFSKEILQNESFDNGKMRRIAVPIQIVCKELGAGN 222
             LSTITIYMKISLNVICKLFFSKEILQNESFDNGKMRRIAVPIQIVCKELGAGN
Sbjct:  2633 LSTITIYMKISLNVICKLFFSKEILQNESFDNGKMRRIAVPIQIVCKELGAGN 2791

Query:   218 LGAGNFGDFVGILSPLLYFTKKKYQKNSSTSTDFIGEINDEHLKNLDHD 266
             +G+  FGDFVGILSPLLYFTKKKYQKNSSTSTDFIGEINDEHLKNLDHD
Sbjct:  2776 VGSR*FGDFVGILSPLLYFTKKKYQKNSSTSTDFIGEINDEHLKNLDHD 2922

Query:   267 PKDLMDMLIIDSTKGKDEDKEPIVHIGYDFLMVDQIRHLVLWN 309
             PKDLMDMLIIDSTKGKDEDKEPIVHIGYDFLMVDQIRHLVLWN
Sbjct:  2973 PKDLMDMLIIDSTKGKDEDKEPIVHIGYDFLMVDQIRHLVLWN 3101

Query:   310 DCQEKAYNEIVSVMGEDCNKISYADRPKLPYLVACINECLRMRTEDPLGIPRGAVEDIEI 369
             DCQEKAYNEIVSVMGEDCNKISYADRPKLPYLVACINECLRMRTEDPLGIPRGAVEDIEI
Sbjct:  3131 DCQEKAYNEIVSVMGEDCNKISYADRPKLPYLVACINECLRMRTEDPLGIPRGAVEDIEI 3310

Query:   370 NGYFMPKGAKVHHYLYAFGMNETVFENVNKFQPDRWLTNDQVHLKQMLNHLVPFSVGPRS 429
             NGYFMPKGAKVHHYLYAFGMNETVFENVNKFQPDRWLTNDQVHLKQMLNHLVPFSVGPRS
Sbjct:  3311 NGYFMPKGAKVHHYLYAFGMNETVFENVNKFQPDRWLTNDQVHLKQMLNHLVPFSVGPRS 3490

Query:   430 CFGKNLSELEVFVVCSNILLNFELSSYNGKPVDDFEIFGIHPPEFPVKLIKRK 482
             CFGKNLSELEVFVVCSNILLNFELSSYNGKPVDDFEIFGIHPPEFPVKLIKRK
Sbjct:  3491 CFGKNLSELEVFVVCSNILLNFELSSYNGKPVDDFEIFGIHPPEFPVKLIKRK 3649

>Contig_4699, D. discoideum, sequenced by the D. discoideum Sequencing Consortium, assembled with Phusion translate frame +1 translate plus frames translate all frames
TAAAATTTTTTTTTTTTTTTTTTTTTAAAAATAAGGTTTTTTTTGGAAAAAAAAAAAAAA
AATTTTTTAAAAAAAAAAAAAAATTTTAAATTTAAAAAAATTTGAAATTTTTTTTTTTTT
TTTTTTTTTTTTTTTTTTAAAGGTTTGAAAATTCTCCATTGGGGCTCGGTTTTTTTTTAA
TTTTAATTTTAATTTTTAATAAATTAATTTTTTTTTTTTGGGTTATTTTTAACTTTTAAA
AATTGGGAAAAAAAAATATTTTGGATTTTTAAAAAAAAAATGTTTTTTTTTTTTTAACAA
TTTTATTCCTTATTAAATTTCCAAAATGGGTTTTTTTTTTTTTTTTTTTTTTAATGGAAA
AAAATTGGTTTTTGGAAAATATAGATTTACATCCATAATATCAAAAATCGATATTTTTAG
TTAAAAAAAAAAAAGTATTTTGGGGGGTTTTTTTTTTTTTTTTTTTTATTTAATTAGTGG
TCATATGTTATAAAATAAAAAAAATTATTTAAATCATATGACCACTTAAAAAAATAGTCT
AAATGTGTATTAGTGATATAAGTAGATTATTTTAAAAAGTTTCTTTTCTCGTTTATAAAT
ATGAAAATTTGATTTAATAAAAAATAAAGAAAAATAATAAATTTTTTTAGGTGGAGATTT
CGATTTTAGTTTTAAAATATATTTGGATTGCAAATAATTTTTTTGAGATTTTGGAGGGTG
GTTTAATTTTTATCTAATTGGAAAAAAAAGGTTAATTAAAATTTAATTTTTTTATTTTTT
TATTTTTTATTTTATTTTTTTTTTTATTATTTCTATTTATAATATATAATATATAAAAAA
AAAAAAAAAAAAAATTTTTTTTTATTATAATCCCTTAATCTAAAATAAAAAAATTAATAC
AAGCTCAACACCCAATTATTTAAATACCCTTGAAACTAAATATGTTTATGAAAATTTAAT
GGAATTATTTAGAGTTGTATGAAGTTGGTGGACAAATAATGAAAGAGAATGCAATGGATG
AAGCTTTTGCCAATAAAGAAGGTAATCATTCTCGATTGGCTTTGAAAACATTTGGACAAT
TGGGTCGCTATAATATTGAACACTATGAAAGATGTCAATCTACTATTTCAAGAGTTTATA
AATATGTTAATAAGGAAAATATTAATGTTTTATATTCATTTCTAATTTGTTGTGAATTAT
CAGCTTATGCATTCCATGAAAAAAGCTATGGAAGAATTTGGTGTAAAGGATCTTTTTTAT
TTCCATGTCCATTTAACTGCCGATAATCATGATGATGGACATGGTGTGAAAATGTTAAAA
AATGACCATAGAAGAGTTTAATTTAGGAATTCAATTATATTTAACTTTTTTTTTTTTCAG
ATATTTTTGATAATTAAAAAATAATTTCAATTGAAAAACAATAAGAAATTCATTTATTTT
CTTATGCTTGGATCCTAAAAAAAAAAAAAAAAAAAATTTATAAAATGTGTTACTTTTATT
TTTTTAAACTTTAAGAAAAAATTAAAAAAAGTATTTCATTTTATTTTTATCTTTACTTTT
ATTTTTTTATTTATTTTTCATCTTTTTTTTTTTTAGGATCCAAGCATAAAAATTTTTAAA
CATTTTTTTTCCAATTAAATATTAAATTTGATTTACCTTTATAGAACATTGTCTTTTGAT
AAAAATGGAGTTTATTTTTAAATCCTAATTTTTCCTAAATATTTTAAATGAATTTGTAAA
AGGTTAGTTAATATTTTAGAAAATAATATTTGGGGTGATCTATCGAATAATTTTTTTAAT
TGGATTGTGTGATGTGTGTGAGTTAGTTATATTTTATAATTCTGGGGTATTATAAGATTT
TCAATTTAGTTTTTTTTAATTTTTTTTTTTATTAAAATTTTTTTTTTGAAAATATACATA
CCGATTGAATATTAATACAATGTTTTATCATTTATTTTATTTTTTTATTTTTATTAATAA
TTTCTTTGTTTATTGATTTTGTTTGTATATAAATTTTTCAAAATATTTTTTTTTTAATAT
AAAATATAAAAAAAAAAAAAAAAAAAATTATTAACATTATTATTTTATATTATATTTAGA
TAAAAAAAAATCTAAAGAAAAGTAATAATGATCCACCTGGGCCCATTAGTTTGCCATTAT
TAGGAAATCTTCATAACCTCACAAAAAACCCACATAGAGGTCTTAAAAATCTTTCTGATA
AATATGGTGGGATTTTTAGATGTTATCTTGGTGATCATTACAGTATTGTAGTAAGTGATC 2280
CTGTGATAATTAATGAAATTTATATTAAAAAATTTGAAAAGGTTTGTACAAGACCCAATA
ATGATACATTTAAAATGTTTTCAAGTGGTTTTAAAGATCTTGCATTTTCAGATAATTATA
ATATTTGGAGTAAAATTAGAACAATAGTTAATTCAAATTTTACAAAAACAAAATTAACAA
AAACCATTTACAATTATTTAGAAGATCAAACTAATCAATTAATTGAAAATATGGGAAATT
ATTCAAAATCTGGTGAACCTGTATGTAAAATATTTAATATATATATATATATATATATAT 2580
ATATATAAATTTTAATACAATTTTAATTTAATTATTATTATTATTATTAGTTTTATCCAC 2640
AATTACAATATATATGAAAATTTCATTAAATGTAATATGTAAATTATTTTTTTCAAAAGA
AATTTTACAAAATGAAAGTTTTGATAATGGGAAAATGAGAAGAATTGCAGTACCTATTCA 2760
AATAGTTTGTAAAGAGTTGGGAGCAGGTAATTTGGTGATTTTGTGGGTATTTTATCACCT
TTACTTTATTTTACAAAGAAAAAGTATCAAAAAAACAGTTCAACATCAACAGATTTCATA
GGTGAAATTAATGATGAACATTTAAAAAATTTAGATCATGATTAAAATTATTTTATTAAC 2940
CTAAATAATAAAAAAAAAAACAATAATAAAGTCCAAAAGATTTAATGGATATGTTAATTA
TAGATTCAACAAAAGGTAAGGATGAAGATAAAGAACCTATTGTTCATATTGGATATGATT
TTTTAATGGTGGATCAGATTCGTCATCTGGTATTATGGAATGGTTCACACTTTTCATGAT 3120
TAATAATAAAGATTGTCAAGAAAAAGCATACAATGAAATTGTTTCAGTAATGGGTGAAGA
TTGTAATAAAATTAGTTATGCTGATCGTCCAAAACTTCCATATTTAGTTGCATGTATAAA
TGAATGTTTAAGAATGAGAACTGAAGATCCATTAGGTATACCAAGAGGAGCAGTTGAAGA
TATTGAAATCAATGGTTATTTCATGCCAAAAGGTGCAAAAGTTCATCATTATCTTTATGC
ATTTGGTATGAATGAAACTGTTTTTGAAAATGTAAATAAATTTCAACCAGATAGATGGTT
AACAAATGATCAAGTTCATTTAAAACAAATGTTAAATCATCTCGTTCCATTCTCAGTTGG
TCCTAGAAGTTGTTTTGGTAAAAATTTATCAGAATTAGAAGTATTTGTAGTTTGTTCAAA
TATTCTATTAAACTTTGAATTATCATCATACAATGGTAAACCGGTTGATGATTTTGAAAT
ATTTGGAATCCATCCACCTGAATTCCCAGTTAAATTAATTAAAAGAAAATAAAATATTTT
TTTTTTATTTTTTTTTTTATTATTAAATATATTTTTATTAATTTTATTATTATTATTATT
AATTTTTAAAAGTTTTAAAATCTAAATTATCCCAATTTAAATTAATACCATTTTTAATAG 3780
AATC


CYP524A1 Seq 91 complete seq only one intron 468 aa 25% to 10, 42 
34% to arabidopsis AC004077 CYP710
MKTPTKYFIIFILLAALAVF (0)
KGSLPGPSFVPPFFGMLFQLIFT
PFSFYEKQEKYGPISWTSIMNKFVLFVTDAEINRQVFKEENAKLYLSLGAKKILTEKAIPFIEG
APHRQLRKQLLPLFTIRALSSYLPIQESIVDEHIAMWIKNGKADINARNNCRDLNMAISTGVFV
GNNTPESVRDDIAKNFFVMNEGFLCLPIDLPGTTLRKAINARVRLVEIFTDIIAKSRKRMGDGE
KPQSLIDLWVEHFLNCPKEERDELSNDTIIFTLLSFMFASQDALTS
SLVWTVQLMAEHPDILAKVRAEQASLRPNNEKLDLDTMRQATYTRMVVSEILRFRPPAVMVPHE
NIEDIVIGDNVHVPKGTMILPSIWSAHFQEGGYSDPYKFDPQRFDSVRKEDVTCAKNSLVFGAG
PHFCIGKELAKNQIEVFLTKLAMSTEWTHNKTPGGDEIIFGPTIFPKDGCNITIKARN*
Contig5056 Chr 6
IIAFP1D8969
IIAFP1D16522
IIAFP1D78718
IIAFP1D75849
IIAFP1D85002 
JAX4b34b12.r1 
JC1a01g08.r1
JC1a216b01.s1
JC1a217d02.s1
JC1a219g03.r1
JC1a220f04.s1
JC1a236c06.r1
JC1a278e02.r1
JC1a297d10.s1
JC1b17h02.s1
JC1b22g10.s2
JC1b23b12.r2
JC1b23b12.s2
JC1b53h02.r1
JC1b53h02.s1
JC1b81a03.r1
JC1c81e02.r1  
JC1b107f09.r1
JC1b107f09.s1
JC1b108a09.s1
JC1b129f10.r1
JC1b131a08.s1
JC1b141e07.r1
JC1b149b11.r1
JC1b153e11.r1
JC1b153e11.s1
JC1c21e05.r1
JC1c21e05.s1  
JC1c36a06.r1
JC1c49b03.r1
JC1c123f05.r1 
JC1c123f05.s1 
JC1c134c01.r1
JC1c144a09.r1 
JC1c197d08.s1 
JC1c231h06.s1  
JC1c245b05.s1 
JC1c271e07.r1 
JC1c288d09.r1
JC1c288d09.s1
JC2e29b11.r1  
JC2e29b11.s1  
sdic6B36g12.p1c
sdic6Ca4.p1t  
VSA365
VSH354
VSH529

CYP525A1 Seq 62 33% to seq 22 no ESTs
MEDYSVQSVVSLVVFLLVL (0)
QILKYYNKTNKNNKYNLPKGPSFLKC (2)
QEELIEDTSENTVLKWFNQLNSDNYSVSFFGRPMIFTRDTTISKYILSSNNIDNY
TKPPDSSGVLIRLAQNSILMSEGDQWRYHRSIINQPFSSKNVKLMIPTIITTINKLINHL (?) 
Possible insertion of 54 NNNNNN here
TIIIDIHSYCTKLTFDIIGKLSIG
YDFNSIESSDNDNDNNDDDDISKQFDFILNEMIRPIRRFSSYLPLYNDIKLFKFLNELES
IIKGAINSRSLITDNNNNKTYKKNFLLDNLLDDNVKEKD (0)
IIGNINTFLLAGHETSANLLTFIFYLLSTHNNVQNDLYNHLIENQKKKINKDNKFTEEDE
DYQSIEFLDWVIYETLRLFPPAPMIGRTSKNDDILKSGN
NNNNNNNNISIPSETLILISV
YAIHRDPKLWKDPNIFNPYRWKNIENINNRSDFIPFSSGGRVCVGQKFSIVEARIIISKL
ILNFELSFNNLKSKPFKIYQRATLPPKYPVFLNFKKRENK*
Dict-IV-V885a03.q1c
JC1b219h01.r1 
JC2d53a04.r1
JC2d108d01.r1

>sdi45A310e06.q1c 146570 letters
        Length = 146,570

  Plus Strand HSPs:

 Score = 1155 (411.6 bits), Expect = 3.6e-243, Sum P(4) = 3.6e-243
 Identities = 219/220 (99%), Positives = 219/220 (99%), Frame = +2

Query:     1 MEDYSVQSVVSLVVFLLVL 19
             MEDYSVQSVVSLVVFLLVL
Sbjct:  6692 MEDYSVQSVVSLVVFLLVL 6748

Query:     8 SVVSLVVFLLVL-QILKYYNKTNKNNKYNLPKGPSFLKCQEELIEDTSENTVL--KWFNQ 64
             S+  L  F ++  QILKYYNKTNKNNKYNLPKGPSFLK     + +  +  +   K  + 
Sbjct:  7188 SIQFLTPFQII**QILKYYNKTNKNNKYNLPKGPSFLKWFINYLFNFYDLKLSNNKEEDN 7367

Query:    22 LKYYNKTNKNNKYNLPKGPSFLKCQEELIEDTSENTVLKWFNQLNSDNYSVSFFGRPMIF 81
             LK  N   ++N  N  K  + L  QEELIEDTSENTVLKWFNQLNSDNYSVSFFGRPMIF
Sbjct:  7335 LKLSNNKEEDNNNNNNKSNNSLS-QEELIEDTSENTVLKWFNQLNSDNYSVSFFGRPMIF 7511

Query:    82 TRDTTISKYILSSNNIDNYTKPPDSSGVLIRLAQNSILMSEGDQWRYHRSIINQPFSSKN 141
             TRDTTISKYILSSNNIDNYTKPPDSSGVLIRLAQNSILMSEGDQWRYHRSIINQPFSSKN
Sbjct:  7512 TRDTTISKYILSSNNIDNYTKPPDSSGVLIRLAQNSILMSEGDQWRYHRSIINQPFSSKN 7691

Query:   142 VKLMIPTIITTINKLINHL 160
             VKLMIPTIITTINKLINHL
Sbjct:  7692 VKLMIPTIITTINKLINHL 7748

SKNVKLMIPTIITTINKLINHLNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNTIIIDIHSYCTKLTFDIIGKLSI

Query:   158 NHLTIIIDXIHSYCTKLTFDIIGKLSIGYDFNSIESSDNDNDNNDDDDISKQFDFILNEM 217
             N+ TIIID IHSYCTKLTFDIIGKLSIGYDFNSIESSDNDNDNNDDDDISKQFDFILNEM
Sbjct:  7902 NNNTIIID-IHSYCTKLTFDIIGKLSIGYDFNSIESSDNDNDNNDDDDISKQFDFILNEM 8078

Query:   218 IRPIRRFSSYLPLYNDIKLFKFLNELESIIKGAINSRSLITDNNNNKTYKKNFLLDNLLD 277
             IRPIRRFSSYLPLYNDIKLFKFLNELESIIKGAINSRSLITDNNNNKTYKKNFLLDNLLD
Sbjct:  8079 IRPIRRFSSYLPLYNDIKLFKFLNELESIIKGAINSRSLITDNNNNKTYKKNFLLDNLLD 8258

Query:   278 DNVKEKDIIGNIN 290
             DNVKEKD+   IN
Sbjct:  8259 DNVKEKDVCNIIN 8297



Query:   285 IIGNINTFLLAGHETSANLLTFIFYLLSTHNNVQNDLYNHLIENQKKKINKDNKFTEEDE 344
             IIGNINTFLLAGHETSANLLTFIFYLLSTHNNVQNDLYNHLIENQKKKINKDNKFTEEDE
Sbjct:  8516 IIGNINTFLLAGHETSANLLTFIFYLLSTHNNVQNDLYNHLIENQKKKINKDNKFTEEDE 8695

Query:   345 DYQSIEFLDWVIYETLRLFPPAPMIGRTSKNDDILKSGNNNNNNNNNISIPSETLILISV 404
             DYQSIEFLDWVIYETLRLFPPAPMIGRTSKNDDILKSGNNNNNNNNNISIPSETLILISV
Sbjct:  8696 DYQSIEFLDWVIYETLRLFPPAPMIGRTSKNDDILKSGNNNNNNNNNISIPSETLILISV 8875

Query:   405 YAIHRDPKLWKDPNIFNPYRWKNIENINNRSDFIPFSSGGRVCVGQKFSIVEARIIISKL 464
             YAIHRDPKLWKDPNIFNPYRWKNIENINNRSDFIPFSSGGRVCVGQKFSIVEARIIISKL
Sbjct:  8876 YAIHRDPKLWKDPNIFNPYRWKNIENINNRSDFIPFSSGGRVCVGQKFSIVEARIIISKL 9055

Query:   465 ILNFELSFNNLKSKPFKIYQRATLPPKYPVFLNFKKRENK 504
             ILNFELSFNNLKSKPFKIYQRATL PKYPVFLNFKKRENK
Sbjct:  9056 ILNFELSFNNLKSKPFKIYQRATLTPKYPVFLNFKKRENK 9175


>49. Cyp3a44     mouse AB039380 Tsutomu Sakuma 10-JAN-2001
     Length = 504

 Score = 360 (126.7 bits), Expect = 2.1e-35, P = 2.1e-35
 Identities = 131/502 (26%), Positives = 249/502 (49%)

Query:     1 MEDYSVQSVVSLVVFLLVLQIL-KYYNKTNKN-NKYNLPKGPSFLKCQEELIEDTSENTV 58
             M  +S  S+ +LV+  ++L +L +Y  +T+    K  +P GP  L     ++   +  T 
Sbjct:     1 MNLFSALSLDTLVLLAIILVLLYRYGTRTHGLFKKQGIP-GPKPLPFLGTVL---NYYTG 56

Query:    59 LKWFNQLNSDNYSVS---FFGR-PMIFTRDTTISKYILSSNNIDNYTKPPDSSGVLIRLA 114
             +  F+    + Y  +   F G+ P++   D    K +L  + +  +T   +   V I   
Sbjct:    57 IWKFDMECYEKYGKTWGLFDGQTPLLVITDPETIKNVLVKDCLSVFTNRREFGPVGIM-- 114

Query:   115 QNSILMSEGDQWRYHRSIINQPFSSKNVKLMIPTIITTINKLINHLTIIIDXIHSYCTKL 174
               +I +S+ ++W+ +R++++  F+S  +K M P I    + L+ +L    +       K 
Sbjct:   115 SKAISISKDEEWKRYRALLSPTFTSGRLKEMFPVIEQYGDILVKYLRQEAEKGMPVAMK- 173

Query:   175 TFDIIGKLSIGYDFNSIESSDNDNDNNDDDDI---SKQF---DFILNEMIRPIRRFSSYL 228
               D++G  S+    ++    + D+ NN +D     +K+F   DF  + ++  +  F    
Sbjct:   174 --DVLGAYSMDVITSTSFGVNVDSLNNPEDPFVEEAKKFLRVDFF-DPLLFSVVLFPLLT 230

Query:   229 PLYNDIKLFKFLNELESIIKGAINSRSLITDNNNNKTYKKNFL-LDNLLDDNVKEKD--- 284
             P+Y  + +  F N+     K  ++ R   +  ++N+ ++ +FL L     +N K+KD   
Sbjct:   231 PVYEMLNICMFPNDSIEFFKKFVD-RMQESRLDSNQKHRVDFLQLMMNSHNNSKDKDSHK 289

Query:   285 IIGNINT------FLLAGHETSANLLTFIFYLLSTHNNVQNDLYNHLIENQKKKINKDNK 338
                N+        F+ AG+ET+++ L+F  Y L+TH ++Q  L   +     K +   NK
Sbjct:   290 AFSNMEITVQSIIFISAGYETTSSTLSFTLYCLATHPDIQKKLQAEI----DKAL--PNK 343

Query:   339 FTEEDEDYQSIEFLDWVIYETLRLFPPAPMIGRTSKNDDILKSGNNNNNNNNNISIPSET 398
              T   +    +E+LD V+ ETLRL+P    + R  K D  L          N + IP  +
Sbjct:   344 ATPTCDTVMEMEYLDMVLNETLRLYPIVTRLERVCKKDVEL----------NGVYIPKGS 393

Query:   399 LILISVYAIHRDPKLWKDPNIFNPYRWKNIENINNRSDFI--PFSSGGRVCVGQKFSIVE 456
             +++I  YA+H DP+ W DP  F P R+   EN  +   ++  PF  G R C+G +F+++ 
Sbjct:   394 MVMIPSYALHHDPQHWPDPEEFQPERFSK-ENKGSIDPYVYLPFGIGPRNCIGMRFALMN 452

Query:   457 ARIIISKLILNFELSFNNLKSKPFKIYQRATLPPKYPVFLNFKKRE 502
              ++ ++K++ NF          P K+ ++  L P+ P+ L    R+
Sbjct:   453 MKLAVTKVLQNFSFQPCQETQIPLKLSRQGILQPEKPIVLKVVPRD 498

>Ciona SEQUENCE 110, 121, 122 139 67% TO 4F3
          Length = 508

 Score = 331 (116.5 bits), Expect = 2.2e-31, P = 2.2e-31
 Identities = 132/474 (27%), Positives = 242/474 (51%)

Query:     4 YSVQSVVSLVVFLLVLQILKYYNKTNKNNKYNLPKGPSFLKWFINYLFNFYDLKLSV--L 61
             Y+  S+++L +   +  I ++Y  T K  K N P+  S   W     F    LK+S    
Sbjct:    11 YASFSLLALSILHSLFSIQEFYTVT-KYLKSNWPEPESHWLWGS---FGAVMLKMSADPS 66

Query:    62 KWFNQLNS--DNYSVSFFGR--PMIFTRDT---TISKYILSSNNIDNYTKPPDSSGVLIR 114
              +F  +N     Y   F  +  P I + +T   TI+K IL      N  K   +  +L  
Sbjct:    67 YYFTYMNDCMKKYPYGFRTQLGPFIKSLNTYHPTIAKAIL------NVPKLKRAYKLLFE 120

Query:   115 LAQNSILMSEGDQWRYHRSIINQPFSSKNVKLMIPTIITTINKLI-NHLTII-------I 166
                + +L+ E   W  HR ++   F  + +K  + T+  + + ++ N L+         +
Sbjct:   121 WLGHGLLVLEDKVWLRHRRLLTPSFHFEVLKPYVKTMNESAHVMVENWLSKTSDNKVAKV 180

Query:   167 DIHSYCTKLTFDIIGKLSIGYDFNSIESSDNDNDNNDDDDISKQFDFILNEMIRPIRRFS 226
             +I  Y + +T D   +  + Y  N        ++N+ +D ++K ++  L+E I   +R  
Sbjct:   181 EIFHYASLMTLDTTLRCLMSYQSNC------QDENSTNDYVAKIYE--LSETIVRRQRNL 232

Query:   227 SYL----PLYNDIK----LFKFLNELESIIKGAINSRSLITDNN-NNKTYKKNFLLDNLL 277
             + L    P+YN  K      K  +++    +  IN R   ++   +N+T K    LD LL
Sbjct:   233 NILNRIDPIYNVTKEGRKYLKLCDDVHKFSESVINRRKCDSEQPAHNETRKYFDFLDTLL 292

Query:   278 DDNVKEKDVCNIINYHHPIYFLYLSYSFQIIGNINTFLLAGHETSANLLTFIFYLLSTHN 337
                 ++ D               LS S +I   ++TF+  GH+T+A+ +++ FY L+ H 
Sbjct:   293 --KARDSDGKG------------LSDS-EIRAEVDTFMFEGHDTTASGISWTFYCLAMHP 337

Query:   338 NVQNDLYNHLIENQKKKINKDNKFTEEDEDYQSIEFLDWVIYETLRLFPPAPMIGRTSKN 397
               Q   +  +     +K+  D    E + D  ++  L   I E+LR +PP P+I R   N
Sbjct:   338 EHQEKCFQEI-----EKVMADRTDIEWN-DLSNLPHLTLCIKESLRQYPPVPIIFR-KLN 390

Query:   398 DDILKSGNISIPSETLILISVYAIHRDPKLWKDPNIFNPYRW--KNIENINNRSDFIPFS 455
              DI   G  +I  +T +++ +YA+H   + WKDP+IF+P R+  +N++++N+ + ++PFS
Sbjct:   391 KDIEVDGK-TIVKDTNVVLHIYALHHHEEFWKDPHIFDPSRFTQENMKSMNSYA-YVPFS 448

Query:   456 SGGRVCVGQKFSIVEARIIISKLILNFEL 484
             +G R C+GQ+F++ E +I +++++  F+L
Sbjct:   449 AGPRNCIGQRFAMNEIKIAVAQVLSKFQL 477

>Cyp4e2 AC020402 41848-44099
        Length = 526

 Score = 354 (124.6 bits), Expect = 3.3e-34, P = 3.3e-34
 Identities = 131/494 (26%), Positives = 239/494 (48%)

Query:    10 VSLVVFLLVLQILKYYNKTNKNNKYNLPKGPSFLKWFINYLFNFYDLKLSVLKWFNQLNS 69
             ++L + L+    L  + +    NK+N P+G   +        N  ++  +V  W++Q   
Sbjct:     9 LALPLLLVAYLELSTFRRRRVLNKFNGPRGLPLMGNAHQMGKNPSEILDTVFSWWHQYGK 68

Query:    70 DNYSVSFFGR-PMIFTRDTTISKYILSSNNIDNYTKPPDSSGVLIRLAQNSILMSEGDQW 128
             DN+ V + G    +    +   ++ILSS  +   TK  D   +        +L S G +W
Sbjct:    69 DNF-VFWIGTYSNVLVTSSKYLEFILSSQTL--ITKS-DIYQLTHPWLGLGLLTSTGSKW 124

Query:   129 RYHRSIINQPFSSKNVKLMIPTIITTINKLINHL-TI-----IIDIHSYCTKLTFDIIGK 182
               HR +I   F    ++     +     K I HL T+     I D       LT D+I  
Sbjct:   125 HKHRKMITPAFHFNILQDFHEVMNENSTKFIKHLKTVAAGDNIFDFQEQAHYLTLDVICD 184

Query:   183 LSIGYDFNSIESSDNDNDNNDDD---DISKQFDFIL--NEMIRPIRRFSSYLPLYN-DIK 236
              ++G   N++E+  +       D   +I+ +    L  NE++    R +   P Y+  +K
Sbjct:   185 TAMGVSINAMENRSSSIVQAFKDMCYNINMRAFHPLKRNELLY---RLAPDYPAYSRTLK 241

Query:   237 LFK-FLNELESIIKGAINSRSLITDNNNNKTYKKNFLLDNLLDDNVKEKDVCNIINYHHP 295
               + F NE+ +    A  S ++ T+  +  T KK   LD LL   +  +     +N    
Sbjct:   242 TLQDFTNEIIAKRIEAHKSGAVSTNAGDEFTRKKMAFLDTLLSSTIDGRP----LN---- 293

Query:   296 IYFLYLSYSFQIIGNINTFLLAGHETSANLLTFIFYLLSTHNNVQNDLYNHLIENQ-KKK 354
                     S ++   ++TF+  GH+T+ + ++F  YLLS H + Q  L+    E     +
Sbjct:   294 --------SKELYEEVSTFMFEGHDTTTSGVSFAVYLLSRHQDEQRKLFKEQREVMGNSE 345

Query:   355 INKDNKFTEEDEDYQSIEFLDWVIYETLRLFPPAPMIGRTSKNDDILKSGNISIPSETLI 414
             + +D  F E  +    +++LD  I E  R++P  P IGR ++ D ++  G++ +P  T +
Sbjct:   346 LGRDATFQEISQ----MKYLDLFIKEAQRVYPSVPFIGRFTEKDYVI-DGDL-VPKGTTL 399

Query:   415 LISVYAIHRDPKLWKDPNIFNPYRWKNIENINNRSDFIPFSSGGRVCVGQKFSIVEARII 474
              + +  +  + K++KDP+ F P R++ +E      +++PFS+G R C+GQKF+++E + +
Sbjct:   400 NLGLVMLGYNEKVFKDPHKFRPERFE-LEK-PGPFEYVPFSAGPRNCIGQKFALLEIKTV 457

Query:   475 ISKLILNFEL--SFNNLKSKPFKIYQRATLP 503
             +SK+I NFE+  + + L SK   I     LP
Sbjct:   458 VSKIIRNFEVLPALDELVSKDGYISTTIGLP 488

>CYP97A3
        Length = 590

 Score = 328 (115.5 bits), Expect = 4.5e-30, P = 4.5e-30
 Identities = 124/444 (27%), Positives = 212/444 (47%)

Query:   100 YSVSFFGRPMIFTRDTTISKYILSSNNIDNYTKPPDSSGVLIRLAQNSILMSEGDQWRYH 159
             + ++F  +  +   D +I+K+IL  +N   Y+K   +  +L  +    ++ ++G+ WR  
Sbjct:   143 FRLTFGPKSFLIVSDPSIAKHILK-DNAKAYSKGILAE-ILDFVMGKGLIPADGEIWRRR 200

Query:   160 RSIINQPFSSKNVKLMIPTIITTINKLINHLTII------IDIHSYCTKLTFDIIGKLSI 213
             R  I      K V  MI       ++L   L         +++ S  ++LT DIIGK   
Sbjct:   201 RRAIVPALHQKYVAAMISLFGEASDRLCQKLDAAALKGEEVEMESLFSRLTLDIIGKAVF 260

Query:   214 GYDFNSIESSDNDNDNNDDDDISKQFDFILNEMIRPIRRFSSYLPLYNDIKLFKFLNELE 273
              YDF+S+         ND   I   +  +     R +    S +P++ DI ++K ++  +
Sbjct:   261 NYDFDSL--------TNDTGVIEAVYTVLREAEDRSV----SPIPVW-DIPIWKDISPRQ 307

Query:   274 SIIKGAINSRSLITDNNNN--KTYKKNFLLDNLL--DDNVKEKDVCNIINYHHPIYFLYL 329
                +    S  LI D  ++   T K+    + L   ++ + E+D  +I+++        +
Sbjct:   308 ---RKVATSLKLINDTLDDLIATCKRMVEEEELQFHEEYMNERDP-SILHFLLASGDDVI 363

Query:   330 SYSFQIIGNINTFLLAGHETSANLLTFIFYLLSTHNNVQNDLYNHLIENQKKKINKDNKF 389
               S Q+  ++ T L+AGHETSA +LT+ FYLL+T  +V   L     + +   +  D +F
Sbjct:   364 VSSKQLRDDLMTMLIAGHETSAAVLTWTFYLLTTEPSVVAKL-----QEEVDSVIGD-RF 417

Query:   390 TEEDEDYQSIEFLDWVIYETLRLFPPAPMIGRTSKNDDILKSGNISIPSETLILISVYAI 449
                 +D + +++   V+ E+LRL+P  P++ R S ++DIL  G   I     I ISV+ +
Sbjct:   418 PTI-QDMKKLKYTTRVMNESLRLYPQPPVLIRRSIDNDIL--GEYPIKRGEDIFISVWNL 474

Query:   450 HRDPKLWKDPNIFNPYRWK----NIENINNRSDFIPFSSGGRVCVGQKFSIVEARIIISK 505
             HR P  W D   FNP RW     N    N    ++PF  G R C+G  F+  E  + I+ 
Sbjct:   475 HRSPLHWDDAEKFNPERWPLDGPNPNETNQNFSYLPFGGGPRKCIGDMFASFENVVAIAM 534

Query:   506 LILNFELSFNNLKSKPFKIYQRATLPPKYPVFLNFKKR 543
             LI  F        + P K+   AT+     + L   KR
Sbjct:   535 LIRRFNFQIAP-GAPPVKMTTGATIHTTEGLKLTVTKR 571

>31. CYP4F12 human GenEMBL AC004523  missing N-terminal
     Length = 458

 Score = 316 (111.2 bits), Expect = 8.2e-30, P = 8.2e-30
 Identities = 100/363 (27%), Positives = 176/363 (48%)

Query:   148 ILMSEGDQWRYHRSIINQPFSSKNVKLMIPTIITTINKLINHLTIIIDIHSYCTKLTFDI 207
             IL+S GD+W  HR ++   F    +K  I     + N +++    +    S C  + F+ 
Sbjct:    70 ILLSGGDKWSRHRRMLTPAFHFNILKSYITIFNKSANIMLDKWQHLASEGSSCLDM-FEH 128

Query:   208 IGKLSIGYDFNSIESSDNDNDNNDDDDISKQFDFILNEMIRPIRRFSSYLPLYNDIKLFK 267
             I  +++    +S++      D++  +  S+    IL E+   + + S ++  + D  L+ 
Sbjct:   129 ISLMTL----DSLQKCIFSFDSHCQERPSEYIATIL-ELSALVEKRSQHILQHMDF-LY- 181

Query:   268 FLNELESIIKGAINSRSLITD---NNNNKTYKKNFLLDNLLDDNVKEK--DVCNIINYHH 322
             +L+        A       TD       +T     + D+   D  K K  D  +++    
Sbjct:   182 YLSHDGRRFHRACRLVHDFTDAVIRERRRTLPTQGI-DDFFKDKAKSKTLDFIDVLLLSK 240

Query:   323 PIYFLYLSYSFQIIGNINTFLLAGHETSANLLTFIFYLLSTHNNVQNDLYNHLIENQKKK 382
                   LS    I    +TF+  GH+T+A+ L+++ Y L+ H   Q      + E  K  
Sbjct:   241 DEDGKALSDE-DIRAEADTFMFGGHDTTASGLSWVLYNLARHPEYQERCRQEVQELLK-- 297

Query:   383 INKDNKFTEEDEDYQSIEFLDWVIYETLRLFPPAPMIGRTSKNDDILKSGNISIPSETLI 442
              ++D K  E D D   + FL   + E+LRL PPAP I R    D +L  G + IP     
Sbjct:   298 -DRDPKEIEWD-DLAQLPFLTMCVKESLRLHPPAPFISRCCTQDIVLPDGRV-IPKGITC 354

Query:   443 LISVYAIHRDPKLWKDPNIFNPYRWKNIENINNRSD--FIPFSSGGRVCVGQKFSIVEAR 500
             LI +  +H +P +W DP +++P+R+ + EN   RS   FIPFS+G R C+GQ F++ E +
Sbjct:   355 LIDIIGVHHNPTVWPDPEVYDPFRF-DPENSKGRSPLAFIPFSAGPRNCIGQAFAMAEMK 413

Query:   501 IIISKLILNF 510
             ++++ ++L+F
Sbjct:   414 VVLALMLLHF 423

>sdi45A310e06.q1c 146570 letters
        Length = 146,570

  Plus Strand HSPs:

 Score = 1150 (409.9 bits), Expect = 3.4e-116, P = 3.4e-116
 Identities = 218/219 (99%), Positives = 218/219 (99%), Frame = +2

Query:     1 IIGNINTFLLAGHETSANLLTFIFYLLSTHNNVQNDLYNHLIENQKKKINKDNKFTEEDE 60
             IIGNINTFLLAGHETSANLLTFIFYLLSTHNNVQNDLYNHLIENQKKKINKDNKFTEEDE
Sbjct:  8516 IIGNINTFLLAGHETSANLLTFIFYLLSTHNNVQNDLYNHLIENQKKKINKDNKFTEEDE 8695

Query:    61 DYQSIEFLDWVIYETLRLFPPAPMIGRTSKNDDILKSGNNNNNNNNNISIPSETLILISV 120
             DYQSIEFLDWVIYETLRLFPPAPMIGRTSKNDDILKSGNNNNNNNNNISIPSETLILISV
Sbjct:  8696 DYQSIEFLDWVIYETLRLFPPAPMIGRTSKNDDILKSGNNNNNNNNNISIPSETLILISV 8875

Query:   121 YAIHRDPKLWKDPNIFNPYRWKNIENINNRSDFIPFSSGGRVCVGQKFSIVEARIIISKL 180
             YAIHRDPKLWKDPNIFNPYRWKNIENINNRSDFIPFSSGGRVCVGQKFSIVEARIIISKL
Sbjct:  8876 YAIHRDPKLWKDPNIFNPYRWKNIENINNRSDFIPFSSGGRVCVGQKFSIVEARIIISKL 9055

Query:   181 ILNFELSFNNLKSKPFKIYQRATLPPKYPVFLNFKKREN 219
             ILNFELSFNNLKSKPFKIYQRATL PKYPVFLNFKKREN
Sbjct:  9056 ILNFELSFNNLKSKPFKIYQRATLTPKYPVFLNFKKREN 9172

>_1
                              R*I*YILFFNVFNIWFYIMDFN*GMNFFIFFLFFNFLYNF*PPSKLYNSKF*NIIIKQIK
                              IINIIYQKVQVS*NGL*IIYLIFMI*NYQIIKRRIIIIIIIKAIIV*VKKN**KIQVKIQ
                              Y*NGLIN*IVIIIVYHSLVDQ*YSQGIQLLVNIYYHQIILITIQNHPIHLVF*LD*LKIV
                              F*CLKVINGDIIDQSLINHSLLKMLN**YQL**QQ*IN*LII*IIIIIIIIIIIIIIIII
                              IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIITIL***IYIHIVLN*HLI*LVNYQL
                              VMILIV*NQVITIMIIMTMMIFQNSLISF*MK*LDQLEGFLPIYLYIMI*NYLNF*MN*N
                              QL*KVQLIQDH*LLIIIIIRPIKRIFY*IIYLMIMLKKKMYVILLIITIQYIFYIYLTHF
                              N**YYYYYYYYYYYYYIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIILDN
                              WKYKYIFISRS*NISKFIDIYILFIINT**CSK*FI*SFNRKSKKENK*R**IYRRG*RL
                              SIN*IFRLGNL*NIKIIPTCTNDR*NFKK**YFKEW*********YINTK*NINFNISLC
                              NS*RSKIMERSKYIQSL*MEKY*KY***I*FYTILKWW*SLCWSKI*YS*SKNYHFKINF
                              KF*III**FKIKTI*NLSKSNFNSKISSFFKF*KKRK*IKIKINHNFKIILYLFF*NK*I
                              GWCSSQTHNQKHTKKKKKKN*KIKKKKYLF*LK*LL*YFK*VKSLFIYLFIFCFI*IFLF
                              FYNFNIIWN*IIINWMSKTFTKIN*F*RII*NTIF*LMIIT*EN*FFNRIWY*IF**FFF
                              NIT*YQFL*FSTIFN*EWLNLLFKSIT*M*FFN*FWYYFIFKI*FNIK*Y*SFNRF*NIL
                              RNNWMIKTTI*TYLFYT*WY
                              >_2
                              DEFDISYFLTCSIFGFILWILTEV*IFLFFFYFLIFYTIFNPLPNYIIANFKIL**NK*K
                              **I*FTKRSKFLKMVYKLFI*FL*FKIIK**RGG*******KQ**FKSRRINRRYK*KYS
                              TKMV*SIK***L*CIILW*TNDIHKGYNY**IYIIIK*Y**LYKTTRFIWCFN*ISSK*Y
                              FNV*R*SMEIS*INH*STILF*KC*INDTNYNNNNK*IN*SFK*****************
                              ***********************************QYYNNRYTFILY*INI*YNW*IINW
                              L*F**YRIK**R*****R**YFKTV*FHFK*ND*TN*KVFFLFTFI**YKII*IFK*IRI
                              NYKRCN*FKIINY******DL*KEFFIR*FT***C*RKRCM*YY*LSPSNIFFIFILLIS
                              TNNTIIIIIIIIIIIILLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLF*II
                              GNINTFLLAGHETSANLLTFIFYLLSTHNNVQNDLYNHLIENQKKKINKDNKFTEEDEDY
                              QSIEFLDWVIYETLRLFPPAPMIGRTSKNDDILKSGNNNNNNNNNISIPSETLILISVYA
                              IHRDPKLWKDPNIFNPYRWKNIENINNRSDFIPFSSGGRVCVGQKFSIVEARIIISKLIL
                              NFELSFNNLKSKPFKIYQRATLTPKYPVFLNFKKRENK*K*K*IIILK*FYICFSKTNKL
                              DGVLHKHITKNTQKKKKKKIKKLKKKNIYFN*NNYYNILNRSKVYLFIYLFFVLFKFFYF
                              FIISILFGIKLLSIG*VKLLLK*INSKESFKIPFSN**L*PKKINFLTEFGIEFFNSFSL
                              ISHNTNFSNFPPFSIENG*ICCSNPSHKCNSLIDFGITLFSRFNLILNNTKVSTDSEIF*
                              GIIG*LKLLYKRIFFTLDG
                              >_3
                              MNLIYLIF*RVQYLVLYYGF*LRYEFFYFFFIF*FSIQFLTPFQII**QILKYYNKTNKN
                              NKYNLPKGPSFLKWFINYLFNFYDLKLSNNKEEDNNNNNNKSNNSLSQEELIEDTSENTV
                              LKWFNQLNSDNYSVSFFGRPMIFTRDTTISKYILSSNNIDNYTKPPDSSGVLIRLAQNSI
                              LMSEGDQWRYHRSIINQPFSSKNVKLMIPTIITTINKLINHLNNNNNNNNNNNNNNNNNN
                              NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN TIIIDIHSYCTKLTFDIIGKLSIG
                              YDFNSIESSDNDNDNNDDDDISKQFDFILNEMIRPIRRFSSYLPLYNDIKLFKFLNELES
                              IIKGAINSRSLITDNNNNKTYKKNFLLDNLLDDNVKEKD VCNIINYHHPIYFLYLSYSFQ
                              LIILLLLLLLLLLLLYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYFR*L
                              EI*IHFY*QVMKHQQIY*HLYFIYYQHIIMFKMIYIII**KIKKRK*IKIINLQKRMKII
                              NQLNF*IG*FMKH*DYSHLHQ**VELQKMMIF*RVVIIIIIIIIIYQYQVKH*F*YQFMQ
                              FIEIQNYGKIQIYSILIDGKILKILIIDLILYHSQVVVEFVLVKNLV*LKQELSFQN*F*
                              ILNYHLII*NQNHLKFIKEQL*LQNIQFF*ILKKEKINKNKNKS*F*NNFIFVFLKQINW
                              MVFFTNT*PKTHKKKKKKKLKN*KKKIFILIKIITIIF*IGQKFIYLFIYFLFYLNFFIF
                              L*FQYYLELNYYQLDE*NFY*NKLILKNHLKYHFLIDDYNLRKLIF*QNLVLNFLIVFL*
                              YHIIPISLIFHHFQLRMVEFVVQIHHINVIL*LILVLLYFQDLI*Y*IILKFQQILKYFE
                              E*LDD*NYYINVSFLHLMV

>CYP554A1 SEQ 40 30% to 508 469 aa complete
MDLLLFIFFLILFYYSVK (0)
YYKADNQNSLSLSGPTPVPILGNIHQVGKDAHLTIPIISKKYHGLFRMWFGGTYYVVV
SDYKLIREMYIENFENFKNRIA
TFKTMTGDDSRGIIGCNGDIWDSNKELIMKSYKKVLNKDMNDFILLKSKELFNFFEKNGIKNE
EEEDDDDDGNKSIINNTRFYFQSLTLT
VMFKMIFNENKSFQLYSDST
EFKLIFKTILNLLNSLNVYNVI
YDFLGIFQPILLKFTKILDKNSFLSKIA
TEKFNSRIKEIDFTSDDFKANDLLDSLIMTINEDEN
GLNEKQIENIKSICIDFLMAGTDTTGSTIEWIILKLVNNPEFQELIFQELK
KLNKSEITANDKINTPFLNSFIKETNRLYPIAPLSLP
KSNNNFFFRKSINEMIIGDNKYYIPANTNILMDVKGFSLDENNYKDPNEFKPDRFLNSKVSDTLNFGI
GPRNCIGQTIAMNQIHIFLSNLILNYRMFSIDCLPLPENLILSVSVRPTEYSLKLIKRV*
Dict-IV-V318a05.p1c
Dict-IV-V784b11.p1c
IIACP2E4184
IIAFP1D29966
JAX4a66c03.s1 N-TERM
JAX4a66c03.r1
JC1b85f01.s1
IICCP1D17820
IICCP2D13776

>CYP513E1 48% to 513A3
seq 90 complete 39% to seq 5+56; 42% to seq 8; 47% to seq 37 shorter match
seq 52 62% to seq 5 JOINED WITH SEQ 90
seq 51 47% to seq 8 probaly same as 52, 90
MNLYISILILIISLIIFFKN (0) 
NNRISKINSKIPGPIRLPIFGNLLQINKDPHIQFQKWYEKYG
VIYSIRLGNIETVVFTGYPIFKKAFIENSQIFAPRFQHLSRFEAN
GCKNLIGSNDEIHSTLKKLILTEITSNKIKKMENHIVL
ECENLCKQLDKHCQDGLPFSLNMYFKLFSLNIILRFLFGTINNSYQDKSNQDIVDVI
IEFLHYGGNPIMSDFIPILKPFYKQNKFFKFYPILCDHLNKLIENYKNNKQLQKQQKQQQ
NEDDDDDDDGTIIGKLLKEYHNGKISWTSVVSTCVDVFLAGVDSTSNSTIFTLIALVNNS
NCQEKLFNEIKNNLKKSDDGHDEIVIRHSLYRSSI
PYLSLVMKEVYRLYSVILIGLPHITSEDVEIEGYKIAKGTQIIQNVFSTHLCEKTFPMSKSFI
PERFIETGSNNMFGGGQTNLVHFGTGVRDCVGKSLADCEIFTVLATLINRYQFINPTLEPLNDIGSFGIAYQPPINNFIIKKRL*
JC2a63e10.r1    367-432
IIAGP1D21757
IICDP1D0343
IIAFP1D61388
JC1a74a05.s1
IICDP1E0343 NEW N-TERM
IIAFP1D96533
JAX4d09e08.r1   28-142

>JC3e44c07.r1 25202 letters
        Length = 25,202

EKLFNEIKNNLKKSDDGHDEIVIRHSLYRSSIPYLSLVMKEVYRLYSVILIGLPHITSED
VEIEGYKIAKGTQIIQNVFSTHLCEKTFPMSKSFIPERFIETGSNNMFGGGQTNLVHFGT
GVRDCVGKSLADCEIFTVLATLINRYQFINPTLEPLNDIG SFGIAYQPPINNFIIKKRL*

>_1
AGVKEMERILSIAGVKKKKLKKNRKSKKIKIKKK*KKIKVKNRITKIKKIKKKILAQTS*
LLFNIQSNNFLKIN*LNKQKKMF*RRVI**KLSYNFFFFMPFHSNKKNNKKKKKKSKKNK
FKNYLIHNYSLT*KKKKKLNYQLKFLKNNSI*GNKCYHV*NKPFYLPTIFFFFFFFFF*L
FIFFFIFYLIFFSCSFL*ILFFKLIIYYLLNLNI**KKKKNEFIYFNFNINNIINNFF*K
CNKFNIKKKKKKKKYYLLLFIVTIITIINYYFKK*RIIELVKLIQKYQDQLDYQFLEIYF
KLIRIHIFNFKNGMKSME*FIQ*D*VILKQLFSLAIQYLK
>_2
QVLKKWKEFFPSRELKKKN*KKIEKVKK*K*KKNKKK*KLKIG*QKLKK*KKKFWRKQVN
CYLIFNQITF*K*IN*INKKKCFKEE*FSENCLITFFFLCHFIQIKKIIKKKKKNLKKTN
LKII*FIIIH*HKKKKKN*TTN*NF*KIIPFKVINVIMYKINPSISPQFFFFFFFFFFDY
LFFFLFFI*FFFLVHFYKFFFLN*SYIIY*I*IYNKKKKKMNLYISILILIISLIIFFKN
VINSILKKKKKKKNIIYYYLLLLSLLLLIIILKNKE**N**N*FKNTRTN*ITNFWKFTS
N**GSTYSISKMV*KVWSNLFNKIR*Y*NSCFHWLSNI*
>_3
RC*RNGKNSFHRGS*KKKIKKK*KK*KNKNKKKIKKNKS*K*DNKN*KNKKKNFGANKLI
VI*YSIK*LFKNKLIK*TKKNVLKKSNLVKIVL*LFFFYAISFK*KK**KKKKKI*KKQI
*KLFNS*LFINIKKKKKIKLPIKIFKK*FHLR**MLSCIK*TLLSPHNFFFFFFFFFLII
YFFFYFLFNFFFLFISINSFF*INHILFIEFKYIIKKKKK*IYIFQF*Y**YH**FFLKM
**IQY*KKKKKKKILFIIIYCYYHYYY*LLF*KIKNNRISKINSKIPGPIRLPIFGNLLQ
INKDPHIQFQKWYEKYGVIYSIRLGNIETVVFTGYPIFK

  Plus Strand HSPs:

 Score = 2449 (867.1 bits), Expect = 4.8e-254, P = 4.8e-254
 Identities = 466/470 (99%), Positives = 467/470 (99%), Frame = +3

Query:     1 NNRISKINSKIPGPIRLPIFGNLLYINKDPHIQFQKWYEKYGVIYSIRLGNIETVVFTGY 60
             NNRISKINSKIPGPIRLPIFGNLL INKDPHIQFQKWYEKYGVIYSIRLGNIETVVFTGY
Sbjct:  6708 NNRISKINSKIPGPIRLPIFGNLLQINKDPHIQFQKWYEKYGVIYSIRLGNIETVVFTGY 6887

Query:    61 PIFKKAFIENSQIFAPRFQHLSRFEANGCKNLIGSNDEIHSTLKKLILTEITSNKIKKME 120
             PIFKKAFIENSQIFAPRFQHLSRFEANGCKNLIGSNDEIHSTLKKLILTEITSNKIKKME
Sbjct:  6888 PIFKKAFIENSQIFAPRFQHLSRFEANGCKNLIGSNDEIHSTLKKLILTEITSNKIKKME 7067

Query:   121 NHIVLECENLCKQLDKHCQDGLPFSLNMYFKLFSLNIILRFLFGTINNSYQDKSNQDIVD 180
             NHIVLECENLCKQLDKHCQDGLPFSLNMYFKLFSLNIILRFLFGTINNSYQDKSNQDIVD
Sbjct:  7068 NHIVLECENLCKQLDKHCQDGLPFSLNMYFKLFSLNIILRFLFGTINNSYQDKSNQDIVD 7247

Query:   181 VIIEFLHYGGNPIMSDFIPILKPFYKQNKFFKFYPILCDHLNKLIENYKNNKQLQKQQKQ 240
             VIIEFLHYGGNPIMSDFIPILKPFYKQNKFFKFYPILCDHLNKLIENYKNNKQLQKQQKQ
Sbjct:  7248 VIIEFLHYGGNPIMSDFIPILKPFYKQNKFFKFYPILCDHLNKLIENYKNNKQLQKQQKQ 7427

Query:   241 QQNEDDDDDDDGTIIGKLLKEYHNGKISWTSVVSTCVDVFLAGVDSTSNSTIFTLIALVN 300
             QQNEDDDDDDDGTIIGKLLKEYHNGKISWTSVVSTCVDVFLAGVDSTSNSTIFTLIALVN
Sbjct:  7428 QQNEDDDDDDDGTIIGKLLKEYHNGKISWTSVVSTCVDVFLAGVDSTSNSTIFTLIALVN 7607

Query:   301 NSNCQEKLFNEIKNNLKKSDDGHDEIVIRQSLYRSSIPYLSLVMKEVYRLYSVILIGLPH 360
             NSNCQEKLFNEIKNNLKKSDDGHDEIVIR SLYRSSIPYLSLVMKEVYRLYSVILIGLPH
Sbjct:  7608 NSNCQEKLFNEIKNNLKKSDDGHDEIVIRHSLYRSSIPYLSLVMKEVYRLYSVILIGLPH 7787

Query:   361 ITSEDVEIEGYKIAKGTQIIQNVFSTHLCEKTFPMSKSFIPERFIETGSNNMFGGGQTNL 420
             ITSEDVEIEGYKIAKGTQIIQNVFSTHLCEKTFPMSKSFIPERFIETGSNNMFGGGQTNL
Sbjct:  7788 ITSEDVEIEGYKIAKGTQIIQNVFSTHLCEKTFPMSKSFIPERFIETGSNNMFGGGQTNL 7967

Query:   421 VHFGTGVRDCVGKSLADCEIFTVLATLINRYQFINPTLEPLNDIGFFGIS 470
             VHFGTGVRDCVGKSLADCEIFTVLATLINRYQFINPTLEPLNDIG FGI+
Sbjct:  7968 VHFGTGVRDCVGKSLADCEIFTVLATLINRYQFINPTLEPLNDIGSFGIA 8117

Unnamed fragments

CYP513F1 Seq 57 33% to seq 8  complete gene 39% to 513E1, only one intron
MILSLLFLFVITLYFLIPSR (0)
ISKINKNIPGPIGYPIVGNLFQINKNVVKSIDGFYKEFGPVYRLRMGNIETVVLTGIDT
LEESFLFNKHSFVDRFVKKSRKINNGLDIIHSNGEYWKILKTIFQTQMTPRKIKSYQFEIQS
QVDLMAEQLYKSKNNDNIVTNINENMKFMLFNIMSILIFGKQSIYCNNTNNINNKD
DDDVDKEKKHIIFSIGRFFKTSGSLFYSDFIPILLPFDLINLSRNNFF
KDFQVLTNFVS KNVNQQLSKLNDNNNNNKEKEGEERKSIVEAYLENYLNGEIKFESVL 372
SSCTNLLLAGTDSSANTLSFLLVSLINNPEIQEKVYNEIITNLKNDEISINDRFKCPYTCAVIK 181
ETHRLYSIAPLSEPHYCSNDVEIKGFKIAKGTQIIQNIYSSSRSEQYWDKPLSFIPERF 2
IDNANIKEKNKNIVSFGLGLRGCIGKSFAEYMIFLTVVRLIKNYKFSNPSPNQPLKEIGEYGL
VMNCANYNAKIEKRK*
CHR2.0.13853 
JAX4a239b02.r1
JC2a35c06.s2
c-JC2a35c06.s2
JC2a47b09.s1
JC2d18f08.r1
IICAP1D22625
SFG519 extends past PERF

>JC3a98a01.r1 21796 letters
        Length = 21,796

  Minus Strand HSPs:

Query:     1 MILSLLFLFVITLYFLI 17
             MILSLLFLFVITLYFLI
Sbjct: 20132 MILSLLFLFVITLYFLI 20082

Query:    18 PSRISKINKNIPGPYWIPIVGNLFQINKNVVKSIDGFYKEFGPVYRLRMGNIETVVLTGI 77
             PSRISKINKNIPGP   PIVGNLFQINKNVVKSIDGFYKEFGPVYRLRMGNIETVVLTGI
Sbjct: 19997 PSRISKINKNIPGPIGYPIVGNLFQINKNVVKSIDGFYKEFGPVYRLRMGNIETVVLTGI 19818

Query:    78 DTLEESFLFNKHSFVDRFVKKSRKINNGLDIIHSNGEYWKILKTIFQTQMTPRKIKSYQF 137
             DTLEESFLFNKHSFVDRFVKKSRKINNGLDIIHSNGEYWKILKTIFQTQMTPRKIKSYQF
Sbjct: 19817 DTLEESFLFNKHSFVDRFVKKSRKINNGLDIIHSNGEYWKILKTIFQTQMTPRKIKSYQF 19638

Query:   138 EIQSQVDLMAEQLYKSKNNDNIVTNINENMKFMLFNIMSILIFGKQSIYY---------- 187
             EIQSQVDLMAEQLYKSKNNDNIVTNINENMKFMLFNIMSILIFGKQSIY           
Sbjct: 19637 EIQSQVDLMAEQLYKSKNNDNIVTNINENMKFMLFNIMSILIFGKQSIYCNNTNNINNKD 19458

Query:   188 DDDVDKEKKHIIFSIGRFFKTSGSLFYSDFIPILLPFDLINLSRNNFFKIFKVLTNFVSK 247
             DDDVDKEKKHIIFSIGRFFKTSGSLFYSDFIPILLPFDLINLSRNNFFK F+VLTNFVSK
Sbjct: 19457 DDDVDKEKKHIIFSIGRFFKTSGSLFYSDFIPILLPFDLINLSRNNFFKDFQVLTNFVSK 19278

Query:   248 NVNQQLSKLNDNNNNNKEKEGEERKSIVEAYLENYLNGEIKFESVLSSCTNLLLAGTDSS 307
             NVNQQLSKLNDNNNNNKEKEGEERKSIVEAYLENYLNGEIKFESVLSSCTNLLLAGTDSS
Sbjct: 19277 NVNQQLSKLNDNNNNNKEKEGEERKSIVEAYLENYLNGEIKFESVLSSCTNLLLAGTDSS 19098

Query:   308 ANTLSFLLVSLINNPEIQEKVYNEIITNLKNDEISINDRFKCPYTCAVIKETHRLYSIAP 367
             ANTLSFLLVSLINNPEIQEKVYNEIITNLKNDEISINDRFKCPYTCAVIKETHRLYSIAP
Sbjct: 19097 ANTLSFLLVSLINNPEIQEKVYNEIITNLKNDEISINDRFKCPYTCAVIKETHRLYSIAP 18918

Query:   368 LSEPHYCSNDVEIKGFKIAKGTQIIQNIYSSSRSEQYWDKPLSFIPERFIDNANIKEKNK 427
             LSEPHYCSNDVEIKGFKIAKGTQIIQNIYSSSRSEQYWDKPLSFIPERFIDNANIKEKNK
Sbjct: 18917 LSEPHYCSNDVEIKGFKIAKGTQIIQNIYSSSRSEQYWDKPLSFIPERFIDNANIKEKNK 18738

Query:   428 NIVSFGLGLRGCIGKSFAEYMIFLTVVRLIKNYKFSNPSPNQPLKEIGEYGLVMNCANYN 487
             NIVSFGLGLRGCIGKSFAEYMIFLTVVRLIKNYKFSNPSPNQPLKEIGEYGLVMNCANYN
Sbjct: 18737 NIVSFGLGLRGCIGKSFAEYMIFLTVVRLIKNYKFSNPSPNQPLKEIGEYGLVMNCANYN 18558

Query:   488 AKIKKK--INK--KIK 499
             AKI+K+  I K  KIK
Sbjct: 18557 AKIEKRK*IKK*NKIK 18510


>SFG519 (SFG519Q) /pub/dna_csm/LIBRARY/SF/SFG5-A/SFG519Q.Seq.d/
        Length = 1375

  Plus Strand HSPs:

 Score = 206 (77.6 bits), Expect = 2.6e-15, P = 2.6e-15
 Identities = 42/45 (93%), Positives = 42/45 (93%), Frame = +1

Query:     1 MILSLLFLFVITLYFLIPSRISKINKNIPGPYWIPIVGNLFQINK 45
             MILSLLFLFVITLYFLIPSRISKINKNIPGP   PIVGNLFQINK
Sbjct:    55 MILSLLFLFVITLYFLIPSRISKINKNIPGPIGYPIVGNLFQINK 189



>JC3f151d24.s1 Clone JC3f151d24, standard read, bases 33 through 687, from
            2002-12-06
        Length = 653

  Plus Strand HSPs:

 Score = 127 (49.8 bits), Expect = 4.5e-14, Sum P(2) = 4.5e-14
 Identities = 25/28 (89%), Positives = 25/28 (89%), Frame = +3

Query:    18 PSRISKINKNIPGPYWIPIVGNLFQINK 45
             PSRISKINKNIPGP   PIVGNLFQINK
Sbjct:   231 PSRISKINKNIPGPIGYPIVGNLFQINK 314

 Score = 79 (32.9 bits), Expect = 4.5e-14, Sum P(2) = 4.5e-14
 Identities = 17/17 (100%), Positives = 17/17 (100%), Frame = +3

Query:     1 MILSLLFLFVITLYFLI 17
             MILSLLFLFVITLYFLI
Sbjct:    96 MILSLLFLFVITLYFLI 146

Seq 37a 45% to seq 5 (513E2P) no ESTs to ends. no introns in this seq
No N-term exon exists between this seq and seq 37b 429bp above it
So it is probably a newly formed pseudogene
KNRINKINSKIPGPIGLPIIGNLHQLKNNPNK
VFKKWYKKYGPIYRIRMGNIETIILTGYPIIKKSFIDNSNVFQKRYEYRSKIQFNNCSDL
TFTNGETHSNLKKIILSEMTLTKLKKQENYIILECEKLCKHLDNHCESGLPFKFKFYGKL
FSLNIIFRFLFGLNYQYDNEKIIEIINLILELVEGAGDPIISDFIPLVKYFENINQNKFL
KIYKKLIILIENIINNLKLKFIENNYDDDNDENNITIFLKLFKEFKNGKITWDNLVGTCV
DLTVGGTDTVSDSLNFSLIALINNPNCQEKLYNEIKNELLKNNNNNNIDDLIILKHSLYR
LSIPYLSMVMKETYRNFPVALLGLPIITTQDMEINGYKISKGTQVFKNIFLSHKSKDFFK
SPNDFNPERFSDSGNNFMFGGGTTNLVQFGAGSRDCIGKNVVDCELFTVLGTLINRYQFL
NPVLSKPLNENGIIGLANQSPDNYFIIKKRK*
Dict-IV-V720e06.q1c
Dict-IV-V63d01.q1c
Contig_0472, D. discoideum, Length = 3730
JC3e44c07.r1

>JC3e44c07.r1 25202 letters
        Length = 25,202

  Plus Strand HSPs:

 Score = 817 (292.7 bits), Expect = 6.0e-81, P = 6.0e-81
 Identities = 152/152 (100%), Positives = 152/152 (100%), Frame = +3

Query:     1 KNRINKINSKIPGPIGLPIIGNLHQLKNNPNKVFKKWYKKYGPIYRIRMGNIETIILTGY 60
             KNRINKINSKIPGPIGLPIIGNLHQLKNNPNKVFKKWYKKYGPIYRIRMGNIETIILTGY
Sbjct:  2913 KNRINKINSKIPGPIGLPIIGNLHQLKNNPNKVFKKWYKKYGPIYRIRMGNIETIILTGY 3092

Query:    61 PIIKKSFIDNSNVFQKRYEYRSKIQFNNCSDLTFTNGETHSNLKKIILSEMTLTKLKKQE 120
             PIIKKSFIDNSNVFQKRYEYRSKIQFNNCSDLTFTNGETHSNLKKIILSEMTLTKLKKQE
Sbjct:  3093 PIIKKSFIDNSNVFQKRYEYRSKIQFNNCSDLTFTNGETHSNLKKIILSEMTLTKLKKQE 3272

Query:   121 NYIILECEKLCKHLDNHCESGLPFKFKFYGKL 152
             NYIILECEKLCKHLDNHCESGLPFKFKFYGKL
Sbjct:  3273 NYIILECEKLCKHLDNHCESGLPFKFKFYGKL 3368

>_1
                              QF**IMILKLMVLKLKKVLKLLKIFINHIDMKIFLNYQIILYLKEILKVVINLFMVVVIQ
                              IWYNLVLEIEIVLVNL*QPVKFLHY*QL*LIDMNFKIQNLQYL*MIKVN*VLFFIHQMLN
                              F*LKKENK*KINLNSRLVVYL*F*I*IYFFFF*KKKKKKWVKINKNIKNWFNLFNFIIII
                              KKKKKKKKVKFFHK*LKKKKKKKKKVKILIFLCY*FKKKKEINFNYYYNNFNFIYI*KSN
                              KK*FFFFFLNFYSF*QFFFFFFFFFFFFFFRKIELIKLIQKYQDQLVYQLLVIYIN*KII
                              >_2
                              SFNR**Y*N*WF*N*KRYSNY*KFLSIT*T*RFF*ITKSFYT*KKY*KW**IYLWLW*YK
                              YGTIWSWK*RLYW*IFSNQ*NFYIISNSN**I*ILKSKTFNTFK**R*IKSYSSSTKC*I
                              FN*KKKINKKLI*IVG*WFIYNFRYRFTFFFFKKKKKKNGLK*IKILKIGLIYLILL*L*
                              KKKKKKKKLNFFINN*KKKKKKKKKLKF*FFYVINLKKKKKLILIIIIIILTLFIFEKVI
                              KNNFFFFF*IFIVFNNFFFFFFFFFFFFFSEK*N**N*FKNTRTNWFTNYW*FTSIKK*
                              >_3 seq 37B C-terminal
                              VLIDNDIEIDGFKIKKGTQIIKNFYQSHRHEDFFKLPNHFIPERNIESGNKFIYGCGDTN
                              MVQFGLGNRDCIGKSLATSEIFTLLATLINRYEF*NPKPSIPLNDKGKLSLILHPPNVKF
                              LIKKRK*IKN*FK**VSGLFIILDIDLLFFFLKKKKKKMG*NK*KY*KLV*FI*FYYNYK
                              KKKKKKKS*IFS*IIKKKKKKKKKS*NFDFFMLLI*KKKRN*F*LLL**F*LYLYLKK**
                              KIIFFFFFKFL*FLTIFFFFFFFFFFFFFQ KNRINKINSKIPGPIGLPIIGNLHQLKNN

>_1
                              ENENKKKKKKKKKKFCWGKKKKVLAQKKKKK*NKKKKK*NKKKKK*KNLYRTNNQIKDNN
                              LINFICFLKKKKFRVSDFFFSLRYCSEIK**NSLRYCSEILLKIPYNNFLLFLQFISY*L
                              SICLEIKK*PNNNYNNNFLVLYILIQDLQHSCVCIH*NSYIKLLIIVYLL*C**TMIINL
                              V*KNIKLMFFF*LYVPLFIMINIAITLNLGRVWQIS**VIMKLYIS*LLCLLK*V*YSKP
                              LILKNI*FNFI*KKKIKKKKKNKKK*NPISKSIIPLYQ*IFISQIINYLLFSKKSLSGKI
                              KLKKKLIFFFFFFLKPPIVWEKRF*NCFI*IINILIY*SIDSNIDIDHFLKYNKKK*FNF
                              DFFFFFFFFFFFFFFGYGMKNLVQFIKLNLDLLKL*F*LVIQL*KDQLKIIQQFFQIDIN
                              FQVKLK*IIIQIY*YVMVNNINY*EK*FIQN*LKQKLK*LKIIFYYKLKNYVIILIKQ*M
                              MNQFYILVMVYQFMIF*NIFHYI*FFVYYLENQIENGIIDKNKNQSIVVKLLDEFNNGLL
                              NWDNIFVIFVLVVRIQVVIHLYFH*FYEQIIKVVKKNFIMKFKNH*LIIIVVRNLLIIII
                              IMKNL*IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMKNL*IIIIIIKIIIIIKIIIIII
                              IIIIKIY*LLKIVIIVHQLLTLQ*L*KKFIDYILLVY*VYQF**IMILKLMVLKLKKVLK
                              LLKIFINHIDMKIFLNYQIILYLKEILKVVINLFMVVVIQIWYNLVLEIEIVLVNL*QPV
                              KFLHY*QL*LIDMNFKIQNLQYL*MIKVN*VLFFIHQMLNF*LKKENK*KINLNSRLVVY
                              L*F*I*IYFFFF*KKKKKKWVKINKNIKNWFNLFNFIIIIKKKKKKKKVKFFHK*LKKKK
                              KKKKKVKILIFLCY*FKKKKEINFNYYYNNFNFIYI*KSNKK*FFFFFLNFYSF*QFFFF
                              FFFFFFFFFFRKIELIKLIQKYQDQLVYQLLVIYIN*KII
                              >_2
                              KMKIKKKKKKKKKSFVGEKKKKFWRKKKKKNKIKKKKNKIKKKKNKKTYIELIIK*KIII
                              **TLFVF*KKKNLGSVIFFFHYATVVKLNNKIHYATVVKFY*KFPTTIFFYFYNLLVIN*
                              VYV*K*RNNQTTIIIIIF*FYIF*FKICNIRVFVYIKIHI*NY*L*FTCFNVDKL*LSIW
                              FKKILN*CFFFNCMFLYLL*SILPSH*IWEEFGKYHDE*L*NYIFHNYFAS*NRFDILNH
                              *Y*KIYNLILYKKKK*KKKKKIKKNKTLYQNL*FPYINRFLSLKLLIIYYFQKKVLVGK*
                              N*KKN*FFFFFFF*NHQ*FGKRGFKTVLFK**IY*FIEV*IQI*ILTTF*NIIKKSNSIL
                              IFFFFFFFFFFFFFLVMV*KIWSNL*N*TWIY*NCSFNWLSNYEKIN*RLSNNFFK*ISI
                              FK*N*NE**FKFINM*W*TI*IIKKNNSFRINLNKS*NN*KLYFITS*KIM**Y**NNK*
                              *INSTSW*WYINL*FFKTFFIIYNSLFTIWKIKLKMV*LIKIKINQ*L*NY*MNLIMVY*
                              IGIIYL*SLYWWYGYK**YTYIFINSMNK*SKLSRKTL**NSKIIN******GIF*****
                              **RISK*******************************RISK******K*****K******
                              ****RFINY*K**L*FINYLPFNDYERSL*IISCWCIRFTSFNR**Y*N*WF*N*KRYSN
                              Y*KFLSIT*T*RFF*ITKSFYT*KKY*KW**IYLWLW*YKYGTIWSWK*RLYW*IFSNQ*
                              NFYIISNSN**I*ILKSKTFNTFK**R*IKSYSSSTKC*IFN*KKKINKKLI*IVG*WFI
                              YNFRYRFTFFFFKKKKKKNGLK*IKILKIGLIYLILL*L*KKKKKKKKLNFFINN*KKKK
                              KKKKKLKF*FFYVINLKKKKKLILIIIIIILTLFIFEKVIKNNFFFFF*IFIVFNNFFFF
                              FFFFFFFFFSEK*N**N*FKNTRTNWFTNYW*FTSIKK*
                              >_3
                              K*K*KKKKKKKKKVLLGKKKKSFGAKKKKKIK*KKKKIK*KKKKIKKLI*N**SNKR**F
                              NKLYLFFEKKKI*GQ*FFFFTTLL**N*IIKFTTLL**NFIENSLQQFSFISTIY*LLIE
                              YMFRNKEITKQQL***FFSFIYFDSRFATFVCLYTLKFIYKTINYSLLALMLINYDYQFG
                              LKKY*TNVFFLIVCSFIYYDQYCHHIKFGKSLANIMMSNYEIIYFIITLPLKIGLIF*TI
                              NIKKYII*FYIKKKNKKKKKK*KKIKPYIKIYNSLISIDFYLSNY*LFIIFKKKS*WENK
                              IKKKINFFFFFFFKTTNSLGKEVLKLFYLNNKYINLLKYRFKYRY*PLFEI**KKVIQF*
                              FFFFFFFFFFFFFFWL WYEKFGPIYKIKLGSIETVVLTGYPIMKRSIKDYPTIFSNRYQF
                              SSKIKMNNNSNLLICNGEQYKLLRKIIHSELT*TKVKIIENYILLQVEKLCNNIDKTIND
                              ESILHLGDGISIYDFLKHFSLYIILCLLFG KSN*KWYN**K*KSINSCEIIR*I**WFIK
                              LG*YIC DLCIGGTDTSSDTLIFSLIL*TNNQSCQEKLYNEIQKSLINNN SSEESFNNNNN
                              NEESLNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNEESLNNNNNNKNNNNNKNNNNNNN
                              NNNKDLLIIKNSNYSSSITYPSMIMKEVYRLYPVGVLGLPVLIDNDIEIDGFKIKKGTQI
                              IKNFYQSHRHEDFFKLPNHFIPERNIESGNKFIYGCGDTNMVQFGLGNRDCIGKSLATSE
                              IFTLLATLINRYEF*NPKPSIPLNDKGKLSLILHPPNVKFLIKKRK*IKN*FK**VSGLF
                              IILDIDLLFFFLKKKKKKMG*NK*KY*KLV*FI*FYYNYKKKKKKKKS*IFS*IIKKKKK
                              KKKKS*NFDFFMLLI*KKKRN*F*LLL**F*LYLYLKK**KIIFFFFFKFL*FLTIFFFF
                              FFFFFFFFFQKNRINKINSKIPGPIGLPIIGNLHQLKNN

CYP513E3P Seq 37b a pseudogene 39% to 513E2P many frameshifts, an insertion and a deletion
This seq is before seq 37a above on the same clone (gene cluster) no ESTs
aa 50 WYEKFGPIYKIKLGSIETVVLTGYPIMKRSIKDYPTIFSNRYQF
SSKIKMNNNSNLLICNGEQYKLLRKIIHSELT*TKVKIIENYILLQVEKLCNNIDKTIND
ESILHLGDGISIYDFLKHFSLYIILCLLFG (frameshift)
NQIENGIIDKNKNQSIVVKLLDEFNNGLLNWDNIFV (frameshift)
(approx. 69aa gap)
DLCIGGTDTSSDTLIFSLIL*TNNQSCQEKLYNEIQKSLINNN (73aa insertion of mostly NNNNN)
NKDLLIIKNSNYSSS
ITYPSMIMKEVYRLYPVGVLGLPVLIDNDIEIDGFKIKKGTQ
IIKNFYQSHRHEDFFKLPNHFIPERNIESGNKFIYGCGDTNMVQFGLGNRDCIGKSLATSEIFTLLATLINRYEF
*NPKPSIPLNDKGKLSLILHPPNVKFLIKKRK*
Dict-IV-V63d01.p1c
Dict-IV-V545f10.q1c
JAX4a160h07.r1 N-term
JAX4a160h07.s1 C-term
JC2a127b07.s1
JC3e44c07.r1

>JC3e44c07.r1 25202 letters translate frame +1 translate plus frames translate all frames
GAAAATGAAAATAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGTTTTGTTGGGGAAAAAAA
AAAAAAGTTTTGGCGCAAAAAAAAAAAAAAAAATAAAATAAAAAAAAAAAAAAATAAAAT
AAAAAAAAAAAAAAATAAAAAAACTTATATAGAACTAATAATCAAATAAAAGATAATAAT
TTAATAAACTTTATTTGTTTTTTGAAAAAAAAAAAATTTAGGGTCAGTGATTTTTTTTTT
TCACTACGCTACTGTAGTGAAATTAAATAATAAAATTCACTACGCTACTGTAGTGAAATT
TTATTGAAAATTCCCTACAACAATTTTCTTTTATTTCTACAATTTATTAGTTATTAATTG
AGTATATGTTTAGAAATAAAGAAATAACCAAACAACAATTATAATAATAATTTTTTAGTT
TTATATATTTTGATTCAAGATTTGCAACATTCGTGTGTTTGTATACATTAAAATTCATAT
ATAAAACTATTAATTATAGTTTACTTGCTTTAATGTTGATAAACTATGATTATCAATTTG
GTTTAAAAAAATATTAAACTAATGTTTTTTTTTTAATTGTATGTTCCTTTATTTATTATG
ATCAATATTGCCATCACATTAAATTTGGGAAGAGTTTGGCAAATATCATGATGAGTAATT
ATGAAATTATATATTTCATAATTACTTTGCCTCTTAAAATAGGTTTGATATTCTAAACCA
TTAATATTAAAAAATATATAATTTAATTTTATATAAAAAAAAAAAATAAAAAAAAAAAAA
AAAAATAAAAAAAAATAAAACCCTATATCAAAATCTATAATTCCCTTATATCAATAGATT
TTTATCTCTCAAATTATTAATTATTTATTATTTTCAAAAAAAAGTCTTAGTGGGAAAATA
AAATTAAAAAAAAAATTAATTTTTTTTTTTTTTTTTTTTTTAAAACCACCAATAGTTTGG
GAAAAGAGGTTTTAAAACTGTTTTATTTAAATAATAAATATATTAATTTATTGAAGTATA
GATTCAAATATAGATATTGACCACTTTTTGAAATATAATAAAAAAAAGTAATTCAATTTT
GATTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGGTTATGGTATGAAA
AATTTGGTCCAATTTATAAAATTAAACTTGGATCTATTGAAACTGTAGTTTTAACTGGTT
ATCCAATTATGAAAAGATCAATTAAAGATTATCCAACAATTTTTTCAAATAGATATCAAT
TTTCAAGTAAAATTAAAATGAATAATAATTCAAATTTATTAATATGTAATGGTGAACAAT
ATAAATTATTAAGAAAAATAATTCATTCAGAATTAACTTAAACAAAAGTTAAAATAATTG
AAAATTATATTTTATTACAAGTTGAAAAATTATGTAATAATATTGATAAAACAATAAATG
ATGAATCAATTCTACATCTTGGTGATGGTATATCAATTTATGATTTTTTAAAACATTTTT
CATTATATATAATTCTTTGTTTACTATTTGGA AAATCAAATTGAAAATGGTATAATTGAT 1560
AAAAATAAAAATCAATCAATAGTTGTGAAATTATTAGATGAATTTAATAATGGTTTATTA
AATTGGGATAATATATTTGTGATCTTTGTATTGGTGGTACGGATACAAGTAGTGATACAC
TTATATTTTCATTAATTCTATGAACAAATAATCAAAGTTGTCAAGAAAAACTTTATAATG
AAATTCAAAAATCATTAATTAATAATAATAGTAGTGAGGAATCTTTTAATAATAATAATA
ATAATGAAGAATCTCTAAATAATAATAATAATAATAATAATAATAATAATAATAATAATA
ATAATAATAATAATAATAATAATAATAATAATAATAATAATAATAATAATGAAGAATCTC 1920
TAAATAATAATAATAATAATAAAAATAATAATAATAATAAAAATAATAATAATAATAATA 1980
ATAATAATAATAAAGATTTATTAATTATTAAAAATAGTAATTATAGTTCATCAATTACTT 2040
ACCCTTCAATGATTATGAAAGAAGTTTATAGATTATATCCTGTTGGTGTATTAGGTTTAC
CAGTTTTAATAGATAATGATATTGAAATTGATGGTTTTAAAATTAAAAAAGGTACTCAAA
TTATTAAAAATTTTTATCAATCACATAGACATGAAGATTTTTTTAAATTACCAAATCATT
TTATACCTGAAAGAAATATTGAAAGTGGTAATAAATTTATTTATGGTTGTGGTGATACAA
ATATGGTACAATTTGGTCTTGGAAATAGAGATTGTATTGGTAAATCTTTAGCAACCAGTG
AAATTTTTACATTATTAGCAACTCTAATTAATAGATATGAATTTTAAAATCCAAAACCTT
CAATACCTTTAAATGATAAAGGTAAATTAAGTCTTATTCTTCATCCACCAAATGTTAAAT
TTTTAATTAAAAAAAGAAAATAAATAAAAAATTAATTTAAATAGTAGGTTAGTGGTTTAT

>_1
                              KSN*KWYN**K*KSINSCEIIR*I**WFIKLG*YICDLCIGGTDTSSDTLIFSLIL*TNN
                              QSCQEKLYNEIQKSLINNN SSEESFNNNNNNEESLNNNNNNNNNNNNNNNNNNNNNNNNN
                              NNNNNNEESLNNNNNNKNNNNNKNNNNNNNNN NKDLLIIKNSNYSSSITYPSMIMKEVYR
                              LYPVGVLGLX
                              >_2
                              NQIENGIIDKNKNQSIVVKLLDEFNNGLLNWDNIFVIFVLVVRIQVVIHLYFH*FYEQII
                              KVVKKNFIMKFKNH*LIIIVVRNLLIIIIIMKNL*IIIIIIIIIIIIIIIIIIIIIIIII
                              IIIIIMKNL*IIIIIIKIIIIIKIIIIIIIIIIKIY*LLKIVIIVHQLLTLQ*L*KKFID
                              YILLVY*VY
                              >_3
                              IKLKMV*LIKIKINQ*L*NY*MNLIMVY*IGIIYL*SLYWWYGYK**YTYIFINSMNK*S
                              KLSRKTL**NSKIIN******GIF*******RISK*************************
                              ******RISK******K*****K**********RFINY*K**L*FINYLPFNDYERSL*I
                              ISCWCIRF

CYP513G1P seq 41P complete 37% to 513A1 no ESTs, probable pseudogene, several small deletions and 
a frameshift and in frame stop codon
MIYFIIIVLIIFLIIKNLNNRIYQI
NSKIPGPGFSLPILGNLHSMNNEPHLKIHEWYNKYGNIFRIKMA
NIETVVLTEKPLLKKAFIDNSNLFKKRYLYLNKNAGIILS
NGEVHNKIKTAVLS
TITNSKIKSIKDHHFNRELLKLIEILDEIANQGKPVQLISLIKQFTLNITCSLIF
GFSFPIKVENDGEASKLVNGVNNFLN
FSKFNVFLQYLPKIGNYLKCDPHSIFIEMVSDLVDKYIENEKKMILQILL
FQINF*NYMKKMKLLKEXXXXXXXXXXIYYFYA
DLLFTGVDLTSNTALFSLV
ELINNRNYQIKLYDQVKNNXXXXXXXXXXHGELILLNSKYNNNFTFLKSFLLETYRLHPALPLSA
FTPHYLIEDVEINHIKIAKGXXXXXXXXXXXX (frameshift)
SNKFFKSPNDFIPERFEKENDEMDLAAYGIGVRDCIGKSIAKSELFTILATL
INRYEFINPGTNGFGSYKIGLSCPDNFILIKKRNNKN*

Dict-IV-V45f06.p1c some diferences
JAX4a97h06.r1 = PGFSXPILGNXHSMNNXPHLKIHEWYNKY
JAX4a97h06.s1 N-terminal
IICBP3D35219 5 diffs with other seq
JAX4a45g03.s1 extends to mid
JC3a262e07.s1
JC3e128e02.r1
JC3a01e08.r1
JC3a276e05.r1

>JC3a276e05.r1 Clone JC3a276e05, reverse read, bases 114 through 679, from
            2002-08-26
        Length = 564

  Plus Strand HSPs:

No first intron like other 513 seqs.

Query:     1 MIYFIIIVLIIFLIIKNLNNRIYQINSKIPGPGFSLPILGNLHSMNN 47
             MIYFIIIVLIIFLIIKNLNNRIYQINSKIPGPGFSLPILGNLHSMNN
Sbjct:   233 MIYFIIIVLIIFLIIKNLNNRIYQINSKIPGPGFSLPILGNLHSMNN 373

>JC3a262e07.s1 Clone JC3a262e07, standard read, bases 48 through 644, from
            2002-08-13
        Length = 595

  Plus Strand HSPs:

 Score = 145 (56.1 bits), Expect = 3.2e-09, P = 3.2e-09
 Identities = 27/27 (100%), Positives = 27/27 (100%), Frame = +1

Query:    60 YQIKLYDQVKNNHGELILLNSKYNNNF 86
             YQIKLYDQVKNNHGELILLNSKYNNNF
Sbjct:     1 YQIKLYDQVKNNHGELILLNSKYNNNF 81

>JC3a262e07.s1_1 Clone JC3a262e07, standard read, 
                              YQIKLYDQVKNN HGELILLNSKYNNNFTFLKSFLLETYRLHPALPLSAFTPHYLIEDVEI
                              NHIKIAKGTQINFSNHQMISYQKDLKKKMMRWI*LLMELV*EIVLVNQLPNLNYLQF*QL
                              *LIVMNLLIQELMDLEVIKLAFHVLIILY*LKKEIIKIKKKKK*DK*KNQS*FNFISHVL
                              FFKKS*I*K*F*KNFTLIX
                              >JC3a262e07.s1_2 Clone JC3a262e07, standard read, 
                              IKLSYMIK*KIIMVN*FY*TQNIIIISHF*NHFY*KHIDFILLFLYLRLLLII*LKMLK*
                              IILKLQKVLK*IFQITK*FHTRKI*KRK**DGFSCLWNWCKRLYW*INCQI*IIYNFSNF
                              N*SL*IY*SRN*WIWKL*NWPFMS**FYIN*KKK**KLKKKKNKINKKINLNLTL*VMFY
                              FLKKVKFKNNFKKILH*Y
                              >JC3a262e07.s1_3 Clone JC3a262e07, standard read, 
                              SN*VI*SSEK*SW*IDFIKLKI***FHIFKIIFIRNI*TSSCSSFICVYSSLFN*RC*NK
                              SY*NCKRYSNKFFKSPNDFIPERFEKENDEMDLAAYGIGVRDCIGKSIAKSELFTILATL
                              INRYEFINPGTNGFGSYKIGLSCPDNFILIKKRNNKN*KKKKIR*IKKSILI*LYKSCFI
                              F*KKLNLKIILKKFYIN

>JC3a262e07.s1 Clone JC3a262e07, standard read, bases 48 through 644, from 2002-08-13 translate frame +1 translate plus frames translate all frames
TATCAAATTAAGTTATATGATCAAGTGAAAAATAATCATGGTGAATTGATTTTATTAAAC
TCAAAATATAATAATAATTTCACATTTTTAAAATCATTTTTATTAGAAACATATAGACTT
CATCCTGCTCTTCCTTTATCTGCGTTTACTCCTCATTATTTAATTGAAGATGTTGAAATA
AATCATATTAAAATTGCAAAAGGTACTCAAATAAATTTTTCAAATCACCAAATGATTTCA
TACCAGAAAGATTTGAAAAAGAAAATGATGAGATGGATTTAGCTGCTTATGGAATTGGTG
TAAGAGATTGTATTGGTAAATCAATTGCCAAATCTGAATTATTTACAATTTTAGCAACTT
TAATTAATCGTTATGAATTTATTAATCCAGGAACTAATGGATTTGGAAGTTATAAAATTG
GCCTTTCATGTCCTGATAATTTTATATTAATTAAAAAAAGAAATAATAAAAATTAAAAAA
AAAAAAAAATAAGATAAATAAAAAAATCAATCTTAATTTAACTTTATAAGTCATGTTTTA
TTTTTTAAAAAAAGTTAAATTTAAAAATAATTTTAAAAAAATTTTACATTAATAC


>JC3e128e02.r1 Clone JC3e128e02, reverse read, bases 115 through 726, from
            2002-10-08
        Length = 610

  Minus Strand HSPs:

 Score = 403 (146.9 bits), Expect = 1.4e-36, P = 1.4e-36
 Identities = 79/87 (90%), Positives = 83/87 (95%), Frame = -1

Query:     1 NEKKMILQILLFQINF-NYMKKMKLLKEIYYFYADLLFTGVDLTSNTALFSLVELINNRN 59
             N+KK ILQILLF+ NF NYMK+M+ LKEIYYFYADLLFTGVDLTSNTALFSLVELINNRN
Sbjct:   610 NKKKRILQILLFRTNF*NYMKRMRFLKEIYYFYADLLFTGVDLTSNTALFSLVELINNRN 431

Query:    60 YQIKLYDQVKNNHGELILLNSKYNNNF 86
             YQIKLYDQVKNNHGELILLNSKYNNNF
Sbjct:   430 YQIKLYDQVKNNHGELILLNSKYNNNF 350

>JC3a01e08.r1 Clone JC3a01e08, reverse read, bases 20 through 462, from
            2000-01-18
        Length = 441

  Minus Strand HSPs:

 Score = 243 (90.6 bits), Expect = 3.2e-30, Sum P(2) = 3.2e-30
 Identities = 48/50 (96%), Positives = 48/50 (96%), Frame = -1

Query:     1 FSKFNXFLQYLPKIGNYLKXDPHSIFIEMVSDLVDKYIENEKKMILQILL 50
             FSKFN FLQYLPKIGNYLK DPHSIFIEMVSDLVDKYIENEKKMILQILL
Sbjct:   378 FSKFNVFLQYLPKIGNYLKCDPHSIFIEMVSDLVDKYIENEKKMILQILL 229

 Score = 117 (46.2 bits), Expect = 3.2e-30, Sum P(2) = 3.2e-30
 Identities = 23/31 (74%), Positives = 24/31 (77%), Frame = -1

Query:    48 ILLGXDLTSNTALFXXXELINNRNYQIKXYD 78
             +  G DLTSNTALF   ELINNRNYQIK YD
Sbjct:   153 LFTGVDLTSNTALFSLVELINNRNYQIKLYD 61

>JC3a01e08.r1_4 Clone JC3a01e08, reverse read, 
                              IKVENDGEASKLVNGVNNFLNFSKFNVFLQYLPKIGNYLKCDPHSIFIEMVSDLVDKYIE
                              NEKKMILQILL  FQINF*NYMKKMKLLKEIYYFYA DLLFT GVDLTSNTALFSLVELINNRN
                              YQIKLYDQVKNNHGELILLNSKYNNNF
                              >JC3a01e08.r1_5 Clone JC3a01e08, reverse read, 
                              *SRK*W*S**IGKWS**FFKLFKI*CFFTIFTKNW*LFKM*ST*YFYRNGFRFGR*IY*K
                              *KKDDFTNITIPNQFLKLYEKNEITKRNLLFLCRFIIYWCRFNFKYCFI*FSGIN***KL
                              SN*VI*SSEK*SW*IDFIKLKI***FX
                              >JC3a01e08.r1_6 Clone JC3a01e08, reverse read, 
                              LK*KMMVKLVNW*MELIIF*TFQNLMFFYNIYQKLVII*NVIHIVFL*KWFQIW*INILK
                              MKKR*FYKYYYSKSIFKII*KK*NY*KKFIIFMQIYYLLV*I*LQILLYLV*WN*LIIEI
                              IKLSYMIK*KIIMVN*FY*TQNIIIIS

>CYP513A1 Seq. 8+53 complete same as 8b only one intron
MNYLVGLVLIFTIFYFFLQKNDKNMNSKIPGPKGIPILGNLLSMKGDLHLKLQEWYKQYG
VIYRIKMGNVETVVLTEYPIIREAFIGNSNSFVNRFQRKSRLKLNNGENLVIVNGDIHNK
LKTLVLSEMTNQRIKKYETSFIDNEIKKLFKVLDEHADTGKPIILNNHIKMFSMNIVLCF
TFGLNYSYPYDEFEKASEFIKLMVEFFN IAGQPIISDFIPSLEPFIDTSNYLNTYKRIFN
YTSDLITKFKNENEIHNNINDNN KSLADKPILSKLLQSFENGEISWDSVVSTCIDLQTA
GADTSANTILYCLLELINNPNIQSKVYDDIKQAIIQSKENENQNDNENQEQTEEIITLSFN
KYRTLAPYLSMVVKETFRKYPSGTIGLPHVTSEDVELNGYKICAGTQIIQNIWATHRNEK
QFSEPDSFIPERFISQQQSANSNLIHFGCGVRDCIGKSLADSEIFTMLASLINRYEFTNP
NPSTPLNEIGKFGITYSCPENKIIIKKRF*

>SLG684 (SLG684Q) /pub/dna_csm/LIBRARY/SL/SLG6-D/SLG684Q.Seq.d/
        Length = 1191

  Plus Strand HSPs:

Supports first exon boundary

Query:     1 MNYLVGLVLIFTIFYFFLQKNDKNMNSKIPGPKGIPILGNLLSMKG 46
             MNYLVGLVLIFTIFYFFLQKNDKNMNSKIPGPKGIPILGNLLSMKG
Sbjct:     3 MNYLVGLVLIFTIFYFFLQKNDKNMNSKIPGPKGIPILGNLLSMKG 140

>JAX4a45g03.s1 Clone JAX4a45g03, standard read, bases 178 through 781, from
            1998-11-20
        Length = 602

  Plus Strand HSPs:

 Score = 254 (94.5 bits), Expect = 9.1e-21, P = 9.1e-21
 Identities = 50/52 (96%), Positives = 50/52 (96%), Frame = +3

Query:   152 ANQGKPVQLISLIKQFTLNITCSLIFGFSFPIKVENDGEASKLVNGVNNFLN 203
             ANQGKP QLISLIKQFTLNITCSLIFGFSFPIK ENDGEASKLVNGVNNFLN
Sbjct:    84 ANQGKPXQLISLIKQFTLNITCSLIFGFSFPIKXENDGEASKLVNGVNNFLN 239

>_1
TNSKIKFFXXXXPFXXLWGIN*XFRGEXQIKVNQXN*LV*LNNLH*I*LAHLFLGFPFLL
KXKMMXKLVNW*MELIIF*TFQNLMXFYNIYQKLVII*NXIHIVFL*KWFQIW*INILKM
KKR*FYKYYYSKSIFKII*KK*NY*KKFIIFMQIYYLLGXDLTSNTALFKFKXN*LIIEI
IKLXYMIKXKNNHGGIGFY*T
>_2
LIPKLNFXPPPPPFXNFGXLIEXLEXXXKSR*TXSIN*FN*TIYIKYNLLTYFWVFLSY*
XRK*WXS**IGKWS**FFKLFKI*XFFTIFTKNW*LFKMXST*YFYRNGFRFGR*IY*K*
KKDDFTNITIPNQFLKLYEKNEITKRNLLFLCRFIIYWXXI*LQILLYLSLXGIN***KL
SN*XI*SRXKIIMGELDFIKX
>_3
*FQN*IFXXXXPLFXTLGX*LKX*RXXANQGKPXQLISLIKQFTLNITCSLIFGFSFPIK
XENDGEASKLVNGVNNFLN FSKFNXFLQYLPKIGNYLKXDPHSIFIEMVSDLVDKYIENE
KKMILQILL FQINF*NYMKKMKLLKEIYYFYADLLFTGXRFNFKYCFI*V*XELINNRNY
QIKXYDQGEK*SWGNWILLN

>JAX4a97h06.s1 Clone JAX4a97h06, standard read, bases 46 through 724, from
            1998-11-25
        Length = 677

  Plus Strand HSPs:

 Score = 475 (172.3 bits), Expect = 3.1e-44, P = 3.1e-44
 Identities = 92/97 (94%), Positives = 92/97 (94%), Frame = +2

Query:    26 SKIPGPGFSLPILGNLHSMNNEPHLKIHEWYNKYGNIFRIKMANIETVVLTEKPLLKKAF 85
             SKIPGPGFSLPILGNLHSMNNEP  KIH WY KYGNIFRIKMANIETVVLTEKPLLKKAF
Sbjct:   362 SKIPGPGFSLPILGNLHSMNNEPXXKIHXWYXKYGNIFRIKMANIETVVLTEKPLLKKAF 541

Query:    86 IDNSNLFKKRYLYLNKNAGIILSNGEVHNKIKTAVLS 122
             IDNSNLF KRYLYLNKNAGIILSNGEVHNKIKTAVLS
Sbjct:   542 IDNSNLFXKRYLYLNKNAGIILSNGEVHNKIKTAVLS 652



>JC3a276e05.r1 Clone JC3a276e05, reverse read, bases 114 through 679, from
            2002-08-26
        Length = 564

  Plus Strand HSPs:

 Score = 467 (169.5 bits), Expect = 2.6e-43, P = 2.6e-43
 Identities = 91/106 (85%), Positives = 95/106 (89%), Frame = +2

Query:     9 LIFTIFYFFLQKNLSN-----NSKIPGPGFSLPILGNLHSMNNEPHLKIHEWYNKYGNIF 63
             +I  +  F + KNL+N     NSKIPGPGFSLPILGNLHSMNNEPHLKIHEWYNKYGNIF
Sbjct:   245 IIIVLIIFLIIKNLNNRIYQINSKIPGPGFSLPILGNLHSMNNEPHLKIHEWYNKYGNIF 424

Query:    64 RIKMANIETVVLTEKPLLKKAFIDNSNLFKKRYLYLNKNAGIILSN 109
             RIKMANIETVVLTEKPLLKKAFIDNSNLFKKRYLYLNKNAGIILSN
Sbjct:   425 RIKMANIETVVLTEKPLLKKAFIDNSNLFKKRYLYLNKNAGIILSN 562

>JC3a276e05.r1 Clone JC3a276e05, reverse read, bases 114 through 679, from 2002-08-26 translate frame +1 translate plus frames translate all frames
TAAATAAAATTTTAAAAATTTTATATTTTTATAGATTAAAAATAAAAATCTAAAGATTAC
AAAAAAAAAAAAGAATTTATTATAAATCCACTTATAAAAAAAAAACAAAAAAAAAATGGT
TATTATCAACCAATTTCAAAAAAATATAAAACCCTCCAATAAAAACTTTTAATAATTTAT
TTATAATAATAATTTTTTTCTGTAAAAAAAAATCAAAAATATTTAAAGAGAAATGATTTA
TTTTATAATAATTGTTTTAATAATATTTTTAATAATAAAAAATTTAAATAATAGAATTTA
TCAAATAAATTCTAAAATTCCAGGACCTGGATTTAGTTTACCAATTTTGGGTAATTTACA
TTCAATGAATAATGAACCACATTTGAAAATTCATGAATGGTATAATAAATATGGAAATAT
ATTTAGAATTAAAATGGCAAATATTGAAACTGTTGTTTTAACTGAAAAACCATTATTAAA
AAAAGCATTTATTGATAATTCAAATTTATTTAAAAAAAGATATTTATACTTAAATAAAAA
TGCTGGTATAATTTTATCAAATGG

>JC3a276e05.r1_1 Clone JC3a276e05, reverse read, 
                              *IKF*KFYIFID*K*KSKDYKKKKEFIINPLIKKKQKKNGYYQPISKKYKTLQ*KLLIIY
                              L***FFSVKKNQKYLKRNDLFYNNCFNNIFNNKKFK**NLSNKF*NSRTWI*FTNFG*FT
                              FNE**TTFENS*MV**IWKYI*N*NGKY*NCCFN*KTIIKKSIY**FKFI*KKIFILK*K
                              CWYNFIKW
                              >JC3a276e05.r1_2 Clone JC3a276e05, reverse read, 
                              K*NFKNFIFL*IKNKNLKITKKKKNLL*IHL*KKNKKKMVIINQFQKNIKPSNKNF**FI
                              YNNNFFL*KKIKNI*REMIYFIIIVLIIFLIIKNLNNRIYQINSKIPGPGFSLPILGNLH
                              SMNNEPHLKIHEWYNKYGNIFRIKMANIETVVLTEKPLLKKAFIDNSNLFKKRYLYLNKN
                              AGIILSN
                              >JC3a276e05.r1_3 Clone JC3a276e05, reverse read, 
                              NKILKILYFYRLKIKI*RLQKKKRIYYKSTYKKKTKKKWLLSTNFKKI*NPPIKTFNNLF
                              IIIIFFCKKKSKIFKEK*FIL**LF**YF***KI*IIEFIK*ILKFQDLDLVYQFWVIYI
                              Q*IMNHI*KFMNGIINMEIYLELKWQILKLLF*LKNHY*KKHLLIIQIYLKKDIYT*IKM
                              LV*FYQM


CYO508B2P Seq 60 58% to seq 15 pseudogene 2 introns and a corrupted heme region
MIETFVFIFISILMFQFFIKC (0)
YILYRPKFKNELDGPSFPIPFFGNNLQIGKNKILYFHKLENYFKKGIFRVWIGE
VFTVIISDPLIEKEM KKTIKEKPKY
IKRDIFI
Missing C-helix region
LNLKSNLINLVFNESTINLIQSMNCSIKNNKE (0)
FEPKLLCSIFSFSIIFKLLFNIDLKNENYKEIKKIMKLVNEINK
NKEKSNSSDLNSITNFIENQFYY
HISNIDQRLVDFFFFYNLIINNIIYNLYFNL (frameshift)
ISLKDFMDIIIYKENSFETKEKIKKQVVYKCTDYILNK
SESISKVIENFFVLIAQDQDYQYLAFNELKSVINTKL LYNGVGENIIKLSDKSYTPITNSICK
EVLRLNPVDQLSS
PITCKKDSLVNGYFIPKDSQIIINHKSMNLNEKYFNDPFNFNPKRFLNYNNQSINF (2?)
Missing the heme binding region 17 amino acids not in genomic clone
DEIYIAISNILLNFKITAVKNHYNFNVFDDNLQSSVLIEKR*

c-13364a02f06.r1 152368 letters
13364a12c08.s1
13364a15f09.r2
13364a12g04.s1
13364a04c11.s1
AU038895.1 this called 508B1 which is right?
Contig133 Chr 6
DY3850.0.509
IIAGP1D9043
JC2d41b07.s1 this called 508B1 which is right?
sDY3850Ac7.p1c
JC1c234h04.r1
DICTY6P2_0004

>pUC18 152517 letters
        Length = 152,517

  Plus Strand HSPs:

 Score = 166 (63.5 bits), Expect = 6.6e-12, P = 6.6e-12
 Identities = 30/30 (100%), Positives = 30/30 (100%), Frame = +1

Query:      1 YILYRPKFKNELDGPSFPIPFFGNNLQIGK 30
              YILYRPKFKNELDGPSFPIPFFGNNLQIGK
Sbjct: 134386 YILYRPKFKNELDGPSFPIPFFGNNLQIGK 134475

>_1
                              GCLYDDEICDGSTNRCVTKDLIMTPLCKESEKSRNFCYVSNKCAFLNGEEYPNLNKRSCS
                              MKNCVVETIEHLKQCDRIYSLYLSCNNNNNIS*FFF*KIKNK*INK*ILLRK*T*KKKNF
                              KIFFFKLWF*ILLKKKKKN*PHIK*RFSR*ITI*PVVIKHSIEIEKKMLLGKFNQDINL*
                              K*K*NK*KK*NN*L*KFFFFFYFFKLTIIKRNIII*IN*NFFIEKND*NFCFYIYFNINV
                              SIFY*MCNYY*KLFIKKKKKKKN*SNF*YFFLNNKYILYRPKFKNELDGPSFPIPFFGNN
                              >_2
                              VACMMMKFVMVQQIDVLQKI***LLFVKKVKNLEIFVMFQINVHF*MVKNIPT*IKGHVL
                              *KIA*LKQ*NT*SNVIESIHYISHAITIIIFLNFFFKK*KINK*INKYY*ENRHKKKKIL
                              KFFFLNCGSEFY*KKKKKINHT*NKDLVDK*LFDP***NIPLKLKKKCY*ENLTKILIFK
                              NKNKINKKNKIINYKSFFFFFIFLN*LLLKEILSYKLTKIFL*RKMIETFVFIFISILMF
                              QFFIKCVIIIENCL*KKKKKKRINQIFNIFF*IISIYCIDQNLKMN*MVQVFLYHSLEI
                              >_3
                              LLV***NL*WFNK*MCYKRFNNDSSL*RK*KI*KFLLCFK*MCIFEW*RISQLK*KVMFY
                              EKLRS*NNRTPKAM**NLFIISLMQ*Q**YFLIFFLKNKK*INK*INIIKKIDIKKKKF*
                              NFFF*IVVLNFIKKKKKKLTTHKIKI**INNYLTRSNKTFH*N*KKNVIRKI*PRY*SLK
                              IKIK*IKKIK*LIIKVFFFFLFF*TNYY*KKYYHIN*LKFFYREK*LKLLFLYLFQY*CF
                              NFLLNV*LLLKIVYKKKKKKKELIKFLIFFFK**VYIV*TKI*K*IRWSKFSYTILWK*

GGTTGCTTGTATGATGATGAAATTTGTGATGGTTCAACAAATAGATGTGTTACAAAAGAT
TTAATAATGACTCCTCTTTGTAAAGAAAGTGAAAAATCTAGAAATTTTTGTTATGTTTCA
AATAAATGTGCATTTTTGAATGGTGAAGAATATCCCAACTTAAATAAAAGGTCATGTTCT
ATGAAAAATTGCGTAGTTGAAACAATAGAACACCTAAAGCAATGTGATAGAATCTATTCA
TTATATCTCTCATGCAATAACAATAATAATATTTCTTAATTTTTTTTTTAAAAAATAAAA
AATAAATAAATAAATAAATAAATATTATTAAGAAAATAGACATAAAAAAAAAAAAATTTT
AAAATTTTTTTTTTTAAATTGTGGTTCTGAATTTTATTAAAAAAAAAAAAAAAAAATTAA
CCACACATAAAATAAAGATTTAGTAGATAAATAACTATTTGACCCGTAGTAATAAAACAT
TCCATTGAAATTGAAAAAAAAATGTTATTAGGAAAATTTAACCAAGATATTAATCTTTAA
AAATAAAAATAAAATAAATAAAAAAAATAAAATAATTAATTATAAAAGTTTTTTTTTTTT
TTTTATTTTTTTAAACTAACTATTATTAAAAGAAATATTATCATATAAATTAACTAAAAT
TTTTTTATAGAGAAAAATGATTGAAACTTTTGTTTTTATATTTATTTCAATATTAATGTT
TCAATTTTTTATTAAATGTGTAATTATTATTGAAAATTGTTTATAAAAAAAAAAAAAAAA
AAAAAGAATTAATCAAATTTTTAATATTTTTTTTTAAATAATAAGTATATATTGTATAGA 134400
CCAAAATTTAAAAATGAATTAGATGGTCCAAGTTTTCCTATACCATTCTTTGGAAATAAT


>pUC18 152517 letters
        Length = 152,517

  Plus Strand HSPs:

Query:      1 MIETFVFIFISILMFQFFIKCYIL 24
              MIETFVFIFISILMFQFFIKC I+
Sbjct: 134237 MIETFVFIFISILMFQFFIKCVII 134308

Query:     17 FFIKC-YILYRPKFKNELDGPSFPIPFFGNNLQIGKNKILYFHKLENYFKKGIFRVWIGE 75
              FF+   YILYRPKFKNELDGPSFPIPFFGNNLQIGKNKILYFHKLENYFKKGIFRVWIGE
Sbjct: 134368 FFLNNKYILYRPKFKNELDGPSFPIPFFGNNLQIGKNKILYFHKLENYFKKGIFRVWIGE 134547

Query:     76 VFTVIISDPLIEKEMKKTIKEKPKYIKRDIFILNLKSNLINLVFNESTINLIQSMNCSIK 135
              VFTVIISDPLIEKEMKKTIKEKPKYIKRDIFILNLKSNLINLVFNESTINLIQSMNCSIK
Sbjct: 134548 VFTVIISDPLIEKEMKKTIKEKPKYIKRDIFILNLKSNLINLVFNESTINLIQSMNCSIK 134727

Query:    136 NNKEVIFII--YYYY-FTEIIINNYLLQFEPKLLCSIFSFSIIFKLLFNIDLKNENYKEI 192
              NNKEVIFII  YYYY FTEIIINNYLLQFEPKLLCSIFSFSIIFKLLFNIDLKNENYKEI
Sbjct: 134728 NNKEVIFII**YYYY*FTEIIINNYLLQFEPKLLCSIFSFSIIFKLLFNIDLKNENYKEI 134907

Query:    193 KKIMKLVNEINKNKEKSNSSDLNSITNFIENQFYYHISNIDQRLVDFFFFYNLIINNIIY 252
              KKIMKLVNEINKNKEKSNSSDLNSITNFIENQFYYHISNIDQRLVDFFFFYNLIINNIIY
Sbjct: 134908 KKIMKLVNEINKNKEKSNSSDLNSITNFIENQFYYHISNIDQRLVDFFFFYNLIINNIIY 135087

Query:    253 NLYFNLN 259
              NLYFNLN
Sbjct: 135088 NLYFNLN 135108

 Score = 923 (330.0 bits), Expect = 1.2e-241, Sum P(4) = 1.2e-241
 Identities = 183/205 (89%), Positives = 190/205 (92%), Frame = +2

Query:    237 VDFFFFYNLIINNIIYNLYFNLN--IILFTIYTLI-ISLKDFMDIIIYKENSFETKEKIK 293
              ++F   Y ++I   +   +F +   IILFTIYTLI ISLKDFMDIIIYKENSFETKEKIK
Sbjct: 134999 INFITIYQILIKG*LIFFFFII***IILFTIYTLI*ISLKDFMDIIIYKENSFETKEKIK 135178

Query:    294 KQVVYKCTDYILNKSESISKVIENFFVLIAQDQDYQYLAFNELKSVINTKLLYNGVGENI 353
              KQVVYKCTDYILNKSESISKVIENFFVLIAQDQDYQYLAFNELKSVINTKLLYNGVGENI
Sbjct: 135179 KQVVYKCTDYILNKSESISKVIENFFVLIAQDQDYQYLAFNELKSVINTKLLYNGVGENI 135358

Query:    354 IKLSDKSYTPITNSICKEVLRLNPVDQLSSPITCKKDSLVNGYFIPKDSQIIINHKSMNL 413
              IKLSDKSYTPITNSICKEVLRLNPVDQLSSPITCKKDSLVNGYFIPKDSQIIINHKSMNL
Sbjct: 135359 IKLSDKSYTPITNSICKEVLRLNPVDQLSSPITCKKDSLVNGYFIPKDSQIIINHKSMNL 135538

Query:    414 NEKYFNDPFNFNPKRFLNYNNQSIN 438
              NEKYFNDPFNFNPKRFLNYNNQSIN
Sbjct: 135539 NEKYFNDPFNFNPKRFLNYNNQSIN 135613

 Score = 219 (82.2 bits), Expect = 1.2e-241, Sum P(4) = 1.2e-241
 Identities = 47/63 (74%), Positives = 50/63 (79%), Frame = +1

Query:    423 NFNPKR---FLNYNNQSINF--DEIYIAISNILLNFKITAVKNHYNFNVFDDNLQSSVLI 477
              N+ P     F  + + S NF  DEIYIAISNILLNFKITAVKNHYNFNVFDDNLQSSVLI
Sbjct: 135643 NYQPNTNFFFFFFKSNSCNFSSDEIYIAISNILLNFKITAVKNHYNFNVFDDNLQSSVLI 135822

Query:    478 EKR 480
              EKR
Sbjct: 135823 EKR 135831

>pUC18 152517 letters translate frame +1 translate plus frames translate all frames
TTTTATTTTTTTAAACTAACTATTATTAAAAGAAATATTATCATATAAATTAACTAAAAT 134220
TTTTTTATAGAGAAAAATGATTGAAACTTTTGTTTTTATATTTATTTCAATATTAATGTT
                M  I  E  T
TCAATTTTTTATTAAATGTGTAATTATTATTGAAAATTGTTTATAAAAAAAAAAAAAAAA
AAAAAGAATTAATCAAATTTTTAATATTTTTTTTTAAATAATAAGTATATATTGTATAGA
                                             Y  I  L
CCAAAATTTAAAAATGAATTAGATGGTCCAAGTTTTCCTATACCATTCTTTGGAAATAAT
TTACAAATTGGTAAAAATAAAATTTTATATTTTCACAAATTAGAAAATTATTTTAAAAAA
GGAATTTTTAGAGTTTGGATAGGTGAAGTATTTACAGTAATAATTAGTGATCCTTTAATT
GAAAAAGAAATGAAAAAAACTATAAAAGAAAAACCTAAATATATTAAAAGAGATATTTTT
                                    K  Y
ATTTTAAATTTAAAATCAAATTTAATAAATTTGGTTTTTAATGAAAGTACAATTAATTTA
ATTCAGTCAATGAATTGTAGTATAAAAAACAATAAAGAGGTAATTTTTATAATATAATAA
TATTATTATTATTAGTTTACGGAAATTATTATTAACAACTATTTATTACAGTTTGAACCA
                                                   F  E  P
AAATTATTATGTTCAATTTTTTCATTTTCCATTATCTTTAAATTATTATTTAATATTGAT
TTAAAGAATGAAAATTATAAAGAAATAAAAAAAATAATGAAATTGGTGAATGAAATTAAT
L  K  N  E  N                          K  L  V
AAGAATAAAGAAAAGTCAAATTCTAGTGATTTAAATTCAATAACTAATTTTATTGAAAAT
CAATTTTATTACCATATATCAAATATTGATCAAAGGTTAGTTGATTTTTTTTTTTTTTAT
      Y  Y  H
AATTTAATAATAAATAATATTATTTACAATTTATACTTTAATTTAAATTAGTTTAAAAGA 135120
    *  *  *  I  I  L  F             F  N  L  NI  S  L  K  D
TTTTATGGATATTATTATTTATAAAGAAAATAGTTTTGAAACAAAAGAAAAAATTAAAAA
 F  M  D                        
ACAAGTAGTTTATAAATGTACAGACTATATTTTAAATAAATCCGAAAGTATATCAAAAGT
TATTGAAAATTTTTTTGTATTAATTGCACAAGATCAAGACTATCAATATTTAGCTTTTAA
TGAATTGAAATCCGTTATTAATACAAAACTTTTATATAATGGTGTTGGGGAAAATATTAT
AAAACTATCTGATAAATCATATACACCCATTACAAATTCTATTTGTAAAGAAGTTTTAAG
GTTAAATCCAGTGGATCAATTATCGTCACCAATTACTTGTAAAAAAGATAGTTTGGTAAA
TGGTTATTTTATTCCAAAAGATTCCCAAATAATTATAAATCATAAGTCAATGAATTTAAA 135540
TGAAAAATATTTTAATGATCCATTTAATTTTAATCCAAAAAGATTTTTAAATTATAATAA 135600
CCAATCAATTAATTTGTAAGTAATAAAAAAAAAGAAAAAAAAAATTATCAACCAAATACT
 Q  S  I  N
AACTTTTTTTTTTTCTTTTTTAAAAGCAACTCTTGTAATTTTTCAAGTGATGAGATTTAT
                                                D  E  I  Y
ATTGCAATTTCAAATATTTTATTAAATTTTAAAATAACAGCTGTTAAAAATCATTATAAT
                                       A  V  K  N
TTTAATGTATTTGATGATAATTTACAATCATCAGTTCTGATTGAAAAAAGATAAAAATAA
                                    L  I  E  K  R  *
ATTTCGTTTTTTCATTTTTTATTTTTTTTTTTTTTATTTTTATTTTTCAAAAAAACGATA

CYP555A1 seq 76 complete 485aa 41% to seq 4 no ESTs at N-term, no short N-term exon
MIIIVIVVFLFYFSFLNLNLNPKKKRPPSPITLPVIGNLISLLNNQPQNILFNYYKKYGKIYQLQYGIVNTVVL
SEFDILKEAFIENGEVFIERYNKITKKFKS (1)
SENIVNSNGLIWKKLQSISIQ
ELSPNIKIKKYEPMIINETNKLIDSFNEHIKSNESIDPTLNIKICFLNIIISFLFNFRYNDYKDEKVIQLVDYIHSIF
RMGSHPIPQDYIPILNKFYINKTTKIHQKIFENIYEYIENQVQKRLEILNKNNNNNNNNINECFVDLLLLKFKSNLLTWNEVIKTTTDLMIAGSDTN (0)
SLFTIHLIIALTNRENIQNKVFNEILNFYILNENNKITFSNKSKTPYYNSVLKEVERRFTVSPLSQPHRTNKDIILNG
YFIPSGSQIIQNVYSCHLNDKDWENPFQFNPDRFLNNNQLEKKLITFGMGPRNCLGFQFA 256
LMSIWIVNLILFKSIKFSSNKLIEEEIREGGTTLSPFPFKINLIKR* 394
Dict-IV-V992d07.q1c
Dict-IV-V617c06.q1c
IIABP1D3513
JC1a143d05.s1
JC1b44a01.s1
JC1c66e12.r1
IICCP1E10018
IICCP1D1715
IIAHP1D8389
Contig_4318

>Contig_4318_1 D. discoideum, sequenced by the D. 
                              *LIILFNLKLYFLSYL*FFFIIYLLLFMNNLQIKIQ*NYMSNN*NSNKKFKKVFFFFFY*
                              LKNSVKEKKKNLKKLTTK***LNNLKFFLII*KRKKTHKNTQKHQPFFFIFFFFLKRIFF
                              FWGLSFIIFFIYLIFYFSSLIIE*ILKKTKFEFVKKQ*IKIKIKIKIKKKK**KK***L*
                              *LFFYFIFLF*I*I*ILKRKDHLLQLLCQ*LGI*FLY*IINHKIFYLIIIKNMVKFINYN
                              MVL*ILLYYLNLIF*KKHSLKMVKFLLKDIIKLLKNLNLVSINIFYFILFFLLI*KLF*I
                              IAENIVNSNGLIWKKLQSIS
                              >Contig_4318_2 D. discoideum, sequenced by the D. 
                              NLLFYLI*NYIFYLICNFFLLFICYFL*IIFK*RFNKTI*VIIRTVIKNLKKFFFFFFIN
                              *KIVLRKKKKI*KN*PQNNNN*II*NFF*LFKKEKKHTKTHKNTSHFFLFFFFF*KEFFF
                              FGDYLLLFFLFI*FFISLH**LNKY*KKQSLNLLRNNK*K*K*K*KLKKKNNKKNDNNCN
                              SCFFILFFFFKFKFKS*KEKTTFSNYFASNWEFNFFIK*STTKYFI*LL*KIW*NLSITI
                              WYCKYCCII*I*YFKRSIH*KW*SFY*KI**NY*KI*IL*VSIYFILFYFFY*FKNCFEL
                              *QKIL*IQMD*YGKNYNQF
                              >Contig_4318_3 D. discoideum, sequenced by the D. 
                              TYYFI*FEIIFFILFVIFFYYLFVTFYE*SSNKDSIKLYE**LEQ**KI*KSFFFFFLLI
                              KK*C*GKKKKFKKINHKIIIIK*FKIFFDYLKKKKNTQKHTKTPAIFFYFFFFFEKNFFF
                              LGIIFYYFFYLFNFLFLFINN*INIKKNKV*IC*ETINKNKNKNKN*KKKIIKK MIIIVI
                              VVFLFYF SFLNLNLNPKKKRPPSPITLPVIGNLISLLNNQPQNILFNYYKKYGKIYQLQY
                              GIVNTVVLSEFDILKEAFIENGEVFIERYNKITKKFKSCKYQYILFYFIFFINLKIVLNY
                              SRKYCKFKWINMEKITINF

>Contig_4318, D. discoideum, sequenced by the D. discoideum Sequencing
            Consortium, assembled with Phusion
        Length = 6911

  Plus Strand HSPs:

Query:     1 MIIIVIVVFLFYFSFLNLNLNPKKKRPPSPITLPVIGNLISLLNNQPQNILFNYYKKYGK 60
             MIIIVIVVFLFYFSFLNLNLNPKKKRPPSPITLPVIGNLISLLNNQPQNILFNYYKKYGK
Sbjct:   525 MIIIVIVVFLFYFSFLNLNLNPKKKRPPSPITLPVIGNLISLLNNQPQNILFNYYKKYGK 704

Query:    61 IYQLQYGIVNTVVLSEFDILKEAFIENGEVFIERYNKITKKFKS 104
             IYQLQYGIVNTVVLSEFDILKEAFIENGEVFIERYNKITKKFKS
Sbjct:   705 IYQLQYGIVNTVVLSEFDILKEAFIENGEVFIERYNKITKKFKS 836

Query:    91 F--IERYNKITKKF-KSSENIVNSNGLIWKKLQSISIQELSPNIKIKKYEPMIINETNKL 147
             F  I  +  I K F   +ENIVNSNGLIWKKLQSISIQELSPNIKIKKYEPMIINETNKL
Sbjct:   853 FYFILFFLLI*KLF*IIAENIVNSNGLIWKKLQSISIQELSPNIKIKKYEPMIINETNKL 1032

Query:   148 IDSFNEHIKSNESIDPTLNIKICFLNIIISFLFNFRYNDYKDEKVIQLVDYIHSIFRMGS 207
             IDSFNEHIKSNESIDPTLNIKICFLNIIISFLFNFRYNDYKDEKVIQLVDYIHSIFRMGS
Sbjct:  1033 IDSFNEHIKSNESIDPTLNIKICFLNIIISFLFNFRYNDYKDEKVIQLVDYIHSIFRMGS 1212

Query:   208 HPIPQDYIPILNKFYINKTTKIHQKIFENIYEYIENQVQKRLEILNKNNNNNNNNINECF 267
             HPIPQDYIPILNKFYINKTTKIHQKIFENIYEYIENQVQKRLEILNKNNNNNNNNINECF
Sbjct:  1213 HPIPQDYIPILNKFYINKTTKIHQKIFENIYEYIENQVQKRLEILNKNNNNNNNNINECF 1392

Query:   268 VDLLLLKFKSNLLTWNEVIKTTTDLMIAGSDTNSLFTIHLII 309
             VDLLLLKFKSNLLTWNEVIKTTTDLMIAGSDTN +F    II
Sbjct:  1393 VDLLLLKFKSNLLTWNEVIKTTTDLMIAGSDTNVIFNYF*II 1518

Query:   301 SLFTIHLIIALTNRENIQNKVFNEILNFYILNENNKITFSNKSKTPYYNSVLKEVERRFT 360
             SLFTIHLIIALTNRENIQNKVFNEILNFYILNENNKITFSNKSKTPYYNSVLKEVERRFT
Sbjct:  1577 SLFTIHLIIALTNRENIQNKVFNEILNFYILNENNKITFSNKSKTPYYNSVLKEVERRFT 1756

Query:   361 VSPLSQPHRTNKDIILNGYFIPSGSQIIQNVYSCHLNDKDWENPFQFNPDRFLNNNQLEK 420
             VSPLSQPHRTNKDIILNGYFIPSGSQIIQNVYSCHLNDKDWENPFQFNPDRFLNNNQLEK
Sbjct:  1757 VSPLSQPHRTNKDIILNGYFIPSGSQIIQNVYSCHLNDKDWENPFQFNPDRFLNNNQLEK 1936

Query:   421 KLITFGMGPRNCLGFQFALMSIWIVNLILFKSIKFSSNKLIEEEIREGGTTLSPFPFKIN 480
             KLITFGMGPRNCLGFQFALMSIWIVNLILFKSIKFSSNKLIEEEIREGGTTLSPFPFKIN
Sbjct:  1937 KLITFGMGPRNCLGFQFALMSIWIVNLILFKSIKFSSNKLIEEEIREGGTTLSPFPFKIN 2116

Query:   481 LIKR 484
             LIKR
Sbjct:  2117 LIKR 2128

>Contig_4318, D. discoideum, sequenced by the D. discoideum Sequencing Consortium, assembled with Phusion translate frame +1 translate plus frames translate all frames
TAACTTATTATTTTATTTAATTTGAAATTATATTTTTTATCTTATTTGTAATTTTTTTTT
ATTATTTATTTGTTACTTTTTATGAATAATCTTCAAATAAAGATTCAATAAAACTATATG
AGTAATAATTAGAACAGTAATAAAAAATTTAAAAAAGTTTTTTTTTTTTTTTTTTATTAA
TTAAAAAATAGTGTTAAGGAAAAAAAAAAAAATTTAAAAAAATTAACCACAAAATAATAA
TAATTAAATAATTTAAAATTTTTTTTGATTATTTAAAAAAGAAAAAAAACACACAAAAAC
ACACAAAAACACCAGCCATTTTTTTTTATTTTTTTTTTTTTTTTGAAAAGAATTTTTTTT
TTTTGGGGATTATCTTTTATTATTTTTTTTATTTATTTAATTTTTTATTTCTCTTCATTA
ATAATTGAATAAATATTAAAAAAAACAAAGTTTGAATTTGTTAAGAAACAATAAATAAAA
ATAAAAATAAAAATAAAAATTAAAAAAAAAAAATAATAAAAAAAATGATAATAATTGTAA
TAGTTGTTTTTTTATTTTATTTTTCTTTTTTAAATTTAAATTTAAATCCTAAAAAGAAAA
GACCACCTTCTCCAATTACTTTGCCAGTAATTGGGAATTTAATTTCTTTATTAAATAATC
AACCACAAAATATTTTATTTAATTATTATAAAAAATATGGTAAAATTTATCAATTACAAT
ATGGTATTGTAAATACTGTTGTATTATCTGAATTTGATATTTTAAAAGAAGCATTCATTG 780
AAAATGGTGAAGTTTTTATTGAAAGATATAATAAAATTACTAAAAAATTTAAATCTTGTA
AGTATCAATATATTTTATTTTATTTTATTTTTTTTATTAATTTAAAAATTGTTTTGAATT 900
ATAGCAGAAAATATTGTAAATTCAAATGGATTAATATGGAAAAAATTACAATCAATTTCA
ATTCAAGAATTATCACCAAATATAAAAATTAAAAAATATGAACCAATGATAATCAATGAA
ACAAATAAATTAATTGATAGTTTCAATGAACATATAAAATCAAATGAATCAATAGACCCA
ACATTAAATATTAAAATTTGTTTTTTAAATATAATTATATCATTTTTATTTAATTTTAGA
TATAATGATTATAAAGATGAAAAAGTCATTCAACTTGTTGATTATATACATTCTATATTT
AGAATGGGGTCACATCCAATTCCACAAGATTATATTCCAATTTTAAATAAATTTTATATA
AATAAAACTACAAAAATTCATCAAAAAATATTTGAAAATATATATGAATATATAGAAAAC
CAAGTTCAAAAACGTTTAGAAATTTTAAATAAAAATAATAATAATAATAATAATAATATA
AATGAATGTTTTGTTGATTTATTATTATTAAAATTTAAATCAAATCTATTAACTTGGAAT
GAAGTTATAAAAACAACAACTGATTTAATGATTGCTGGATCTGATACAAATGTAATTTTT 1500
AATTATTTTTAAATTATATATTATTATTGGTTTTTTTACTAACTTTTTTAATATATTTAA
TTAAATTTAATTTTAGTCACTTTTTACAATTCATTTAATTATTGCATTAACAAATAGAGA
AAATATTCAAAATAAAGTATTTAATGAAATTTTAAATTTTTATATATTAAATGAAAATAA
TAAAATTACATTTTCAAATAAAAGTAAAACACCATATTATAATTCAGTTTTAAAAGAAGT
TGAAAGAAGATTCACAGTTTCCCCATTATCACAACCTCATAGAACAAATAAAGATATTAT
ATTAAATGGTTATTTCATACCAAGTGGTTCTCAAATTATTCAAAATGTTTATTCTTGTCA
TTTAAATGACAAAGATTGGGAAAACCCATTCCAATTTAATCCAGATAGATTTTTAAATAA
TAATCAATTAGAAAAGAAATTAATTACATTTGGAATGGGTCCTAGAAATTGTTTAGGTTT
TCAATTTGCATTAATGTCAATATGGATTGTAAATTTAATATTATTTAAATCAATTAAATT
CTCTTCAAATAAATTAATTGAAGAAGAAATTAGAGAAGGCGGAACAACATTATCACCATT
TCCTTTTAAAATTAATTTAATAAAAAGATAAAAAAAAATAAAAATAAATTAAAACACATT
TTCTTTTTATATATATGTTATTTTTATTTTTATATTTTTATATTTTTTTATATTTTTAAT
TATTCTTTGTTTTTTTGAAAAACAATAGTAATAATTATTAATTTATATATTAATTTAAAT

CYP556A1 Seq 92 no ESTs
MFLTSILYTIIIILIFYKGLE (0)
YLIEKRSFPLVHPIKGVMNGT
KPYFIFGDLPFQRLLNLKPKLKELGSIYFRWFFWYPIVEIKDINAIQYVYNEKSNNYSLY
WNLNKSSNFILTGSEIKRFFRIYYWAFNCKDSLQRIMPVIKSQVFDFIHSHKFQNSTLST
NDDVTN (0)
FMLKLLSRVYLGSSDEAYHCFKANYKKFNKSYLDFFHYL
FPTLLKIPSKFLRKYIKNKNKRALYQVLAMKAYYGVVKQSSDDHIEESMINIIAETSYN
DKEGLSLEEIKMPSYLLNASSIKGPMIMVENLMFQLIEKSEIESKIRKEIKLVFEKNGKDVN
SFDFDDIMEMKYLEATLDEINRLYPPFPKLMPRQTKESDRILGYHIPKGTMISCPVADILRD
PSNFQDPLTFKPERQLIFSNPKFASPSITSIQEINGLS (1)
KRNQRIIKNLPWGIGSKKCLGKELAKLIVKTIIVILYSQYTFDKHLDEND
EELCDTNNQPKIQITFNPEIKPPLLLKSRKLFSISQPKQ*
IIAGP1D5007 
IIAFP2D30522
IICAP1D27823
IICBP1D15714
IICCP1D16048
JC1c140f11.s1


>Contig_4571, D. discoideum, sequenced by the D. discoideum Sequencing
            Consortium, assembled with Phusion
        Length = 2625

  Plus Strand HSPs:

Query:     1 MFLTSILYTIIIILIFYKGLE 21
             MFLTSILYTIIIILIFYKGLE
Sbjct:   327 MFLTSILYTIIIILIFYKGLE 389

Query:    14 LIFYKGLEYLIEKRSFPLVHPIKGVMNGTKPYFIFGDLPFQRLLNLKPKLKELGSIYFRW 73
             L FY    YLIEKRSFPLVHPIKGVMNGTKPYFIFGDLPFQRLLNLKPKLKELGSIYFRW
Sbjct:   460 LFFY--F*YLIEKRSFPLVHPIKGVMNGTKPYFIFGDLPFQRLLNLKPKLKELGSIYFRW 633

Query:    74 FFWYPIVEIKDINAIQYVYNEKSNNYSLYWNLNKSSNFILTGSEIKRFFRIYYWAFNCKD 133
             FFWYPIVEIKDINAIQYVYNEKSNNYSLYWNLNKSSNFILTGSEIKRFFRIYYWAFNCKD
Sbjct:   634 FFWYPIVEIKDINAIQYVYNEKSNNYSLYWNLNKSSNFILTGSEIKRFFRIYYWAFNCKD 813

Query:   134 SLQRIMPVIKSQVFDFIHSHKFQNSTLSTNDDVTN 168
             SLQRIMPVIKSQVFDFIHSHKFQNSTLSTNDDVTN
Sbjct:   814 SLQRIMPVIKSQVFDFIHSHKFQNSTLSTNDDVTN 918

Query:   169 FMLKLLSRVYLGSSDEAYHCFKANYKKFNKSYLDFFHYLFPTLLKIPSKFLRKYIKNKNK 228
             FMLKLLSRVYLGSSDEAYHCFKANYKKFNKSYLDFFHYLFPTLLKIPSKF RKYIKNKNK
Sbjct:   993 FMLKLLSRVYLGSSDEAYHCFKANYKKFNKSYLDFFHYLFPTLLKIPSKFSRKYIKNKNK 1172

Query:   229 RALYQVLAMKAYYGVVKQSSDDHIEESMINIIAETSYNDKEGLSLEEIKMPSYLLNASSI 288
             RALYQVLAMKAYYGVVKQSSDDHIEESMINIIAETSYNDKEGLSLEEIKMPSYLLNASSI
Sbjct:  1173 RALYQVLAMKAYYGVVKQSSDDHIEESMINIIAETSYNDKEGLSLEEIKMPSYLLNASSI 1352

Query:   289 KGPMIMVENLMFQLIEKSEIESKIRKEIKLVFEKNGKDVNSFDFDDIMEMKYLEATLDEI 348
             KGPMIMVENLMFQLIEKSEIESKIRKEIKLVFEKNGKDVNSFDFDDIMEMKYLEATLDEI
Sbjct:  1353 KGPMIMVENLMFQLIEKSEIESKIRKEIKLVFEKNGKDVNSFDFDDIMEMKYLEATLDEI 1532

Query:   349 NRLYPPFPKLMPRQTKESDRILGYHIPKGTMISCPVADILRDPSNFQDPLTFKPERQLIF 408
             NRLYPPFPKLMPRQTKESDRILGYHIPKGTMISCPVADILRDPSNFQDPLTFKPERQLIF
Sbjct:  1533 NRLYPPFPKLMPRQTKESDRILGYHIPKGTMISCPVADILRDPSNFQDPLTFKPERQLIF 1712

Query:   409 SNPKFASPSITSIQEINGLS 428
             SNPKFASPSITSIQEINGLS
Sbjct:  1713 SNPKFASPSITSIQEINGLS 1772

Query:   409 SNPKFASPSITSIQEINGLSKRNQ-IIKNLPWGIGSKKCLGKELAKLIVKTIIVILYSQY 467
             SN K+ S +  +I+E      RNQ IIKNLPWGIGSKKCLGKELAKLIVKTIIVILYSQY
Sbjct:  2151 SNNKYNSYNNLTIEE------RNQRIIKNLPWGIGSKKCLGKELAKLIVKTIIVILYSQY 2312

Query:   468 TFDKHLDENDEELCDTNNQPKIQITFNPEIKPPLLLKSRKLFSISQPKQ 516
             TFDKHLDENDEELCDTNNQPKIQITFNPEIKPPLLLKSRKLFSISQPKQ
Sbjct:  2313 TFDKHLDENDEELCDTNNQPKIQITFNPEIKPPLLLKSRKLFSISQPKQ 2459

>Contig_4571, D. discoideum, sequenced by the D. discoideum Sequencing Consortium, assembled with Phusion translate frame +1 translate plus frames translate all frames
AAAACCCCTATTAAAATTTATTTATTATGGGCTATTAAAAAAAAATTAATTAAAAAAAAA
ATTATTTGAAATATTATTTATTTATAAAAATTTTTAAATTTTCAATATTAAAATAATATT
TAAATAATATTTGTTTTTTTTTTTTATCTATATTGTTTTTTTTTTTTTTTTTATCTATTT
TTTTTTTTTTTTTATATTATAATTTTTTAATTTATATTTAAAAGTAATAATAATAATAGT
AATAATAATAATTTTTTCTTTTTCTTTTTTTTAAAAACCCCCATATAAAATAATAAAAAT
AATAATAAATAATAGTAATTTAAGAAATGTTTTTAACAAGTATATTATATACAATTATAA
TAATTTTAATATTCTATAAAGGTTTAGAAGTAAATTATTTTTAATTAATTAAAATTCATA
TATTTTTTTTTTTTTTTTTTGAAATATAATTTCTAACATTTATTTTTTTATTTTTAGTAT
TTAATTGAAAAAAGATCATTCCCTTTAGTACACCCAATTAAAGGTGTTATGAATGGTACA
AAACCATATTTTATATTTGGGGATTTACCATTTCAAAGATTATTAAACCTTAAACCAAAA 600
TTAAAAGAACTAGGAAGTATTTATTTTAGATGGTTCTTTTGGTATCCAATAGTTGAAATT
AAAGATATAAATGCAATTCAATATGTTTACAATGAAAAGTCAAATAATTATAGTCTTTAT
TGGAATTTAAATAAATCTAGTAATTTCATTTTAACAGGTTCAGAAATTAAAAGATTCTTT
AGAATTTATTATTGGGCATTCAATTGTAAAGATTCATTACAAAGAATTATGCCAGTTATA
AAATCACAAGTTTTTGATTTCATTCATTCTCACAAATTTCAAAATTCAACATTATCAACA 900
AATGATGATGTTACAAATGTAATTTTTTTCAATTATTATTATTAATATTATTAATATTAT
TTATTAATTTTTTTTTTTTTATTTCGAAAAAGTTTATGTTAAAATTATTAAGTAGAGTTT
ATTTAGGTAGTAGTGATGAAGCATATCATTGCTTTAAAGCAAATTATAAAAAATTTAATA
AGAGTTATTTAGACTTTTTTCATTATTTATTTCCAACATTATTAAAGATACCAAGTAAAT
TCTCAAGAAAATATATAAAGAATAAGAATAAGAGAGCTTTATATCAAGTGTTAGCAATGA 1200
AAGCTTATTATGGTGTTGTTAAACAATCAAGTGATGATCATATTGAAGAATCAATGATAA
ATATAATAGCAGAGACAAGTTATAATGATAAAGAGGGGTTATCTCTTGAAGAGATAAAAA
TGCCATCCTATTTATTAAATGCATCCTCAATTAAAGGCCCAATGATAATGGTAGAGAATT
TAATGTTTCAATTGATAGAAAAATCAGAGATTGAAAGTAAAATTCGTAAAGAGATTAAAC
TTGTATTTGAAAAGAATGGTAAAGATGTAAATTCATTTGATTTTGATGATATAATGGAAA 1500
TGAAATACTTAGAAGCAACATTGGATGAAATTAATAGATTATATCCACCATTTCCAAAAT 
TAATGCCACGTCAAACTAAAGAATCTGATAGAATCTTAGGATATCATATACCAAAGGGTA
CCATGATATCATGTCCTGTCGCTGATATACTTAGAGATCCTTCCAATTTTCAAGACCCAT
TAACATTTAAACCAGAACGTCAATTAATTTTCTCAAATCCTAAATTTGCATCACCATCGA
TTACTTCAATTCAAGAAATTAATGGATTAAGTAGTTCTTCTTCAAATTCATTTGCATTAC 1800
ATCATAGATCATTACCATCAATTAACAACAACAACAACAACAACAACAACAACAACAACA
ACAACAACAACAACAACAACAACAACAACAACAACAACAGCAATAACAACAGCATCAACG
GTAACAATAAAAACAATAATAGAAATTGTATTCAATCTTTTAATAATTCAGCACTTAAAA
AATCATTTTTATCTGATAGTTCCTCAATTATTGATAATATAGTTGGTACAAATCGTGTTG
ATAAATTAAAATTAGATTCTTTAAATGAAAATAGCAATATTAATAATAATAATAATAAAG 2100
ATTTATTAAATGTACCAAATATAATTGAAATTAATAAAAATAATCCAATTTCAAATAATA
AATATAATTCTTATAATAATTTAACAATTGAAGAAAGAAATCAAAGAATTATTAAAAATT 2220
TACCATGGGGTATTGGTAGTAAAAAATGTTTAGGTAAAGAATTGGCAAAATTAATAGTAA
AAACAATAATTGTCATACTTTATTCTCAATATACTTTTGATAAACATCTCGATGAAAATG
ATGAAGAATTATGTGATACTAATAATCAACCTAAAATTCAAATAACTTTTAATCCAGAAA
TTAAACCACCTTTATTACTTAAATCAAGAAAATTATTTTCAATTTCTCAACCAAAACAAT
AAATAAATAAAAATAATAAAAATAAAAAAAAATTATTATTTCTGTAAATAATTTATAAAA
ATGTGTATATATTGTTTTTTTTTTTATTTATTTATTTAATTTTTTAATTTTTAAATTTTT
TAATTTTTTATTTAATTATTTAATTATTTAATTATTTAATTATTT

>Contig_4571, D. discoideum, sequenced by the D. discoideum Sequencing
            Consortium, assembled with Phusion
        Length = 2625

  Plus Strand HSPs:

 Score = 1330 (473.2 bits), Expect = 1.5e-144, Sum P(2) = 1.5e-144
 Identities = 264/273 (96%), Positives = 265/273 (97%), Frame = +3

Query:     1 MLKLLSRVYLGSSDEAYHCFKANYKKFNKSYLDFFHYLRIPFPTLLKIPSKFLRKYIKNK 60
             MLKLLSRVYLGSSDEAYHCFKANYKKFNKSYLDFFHYL   FPTLLKIPSKF RKYIKNK
Sbjct:   996 MLKLLSRVYLGSSDEAYHCFKANYKKFNKSYLDFFHYL---FPTLLKIPSKFSRKYIKNK 1166

Query:    61 NKRALYQVLAMKAYYGVVKQSSDDHIEESMINIIAETSYNDKEGLSLEEIKMPSYLLNAS 120
             NKRALYQVLAMKAYYGVVKQSSDDHIEESMINIIAETSYNDKEGLSLEEIKMPSYLLNAS
Sbjct:  1167 NKRALYQVLAMKAYYGVVKQSSDDHIEESMINIIAETSYNDKEGLSLEEIKMPSYLLNAS 1346

Query:   121 SIKGPMIMVENLMFQLIEKSEIESKIRKEIKLVFEKNGKDVNSFDFDDIMEMKYLEATLD 180
             SIKGPMIMVENLMFQLIEKSEIESKIRKEIKLVFEKNGKDVNSFDFDDIMEMKYLEATLD
Sbjct:  1347 SIKGPMIMVENLMFQLIEKSEIESKIRKEIKLVFEKNGKDVNSFDFDDIMEMKYLEATLD 1526

Query:   181 EINRLYPPFPKLMPRQTKESDRILGYHIPKGTMISCPVADILRDPSNFQDPLTFKPERQL 240
             EINRLYPPFPKLMPRQTKESDRILGYHIPKGTMISCPVADILRDPSNFQDPLTFKPERQL
Sbjct:  1527 EINRLYPPFPKLMPRQTKESDRILGYHIPKGTMISCPVADILRDPSNFQDPLTFKPERQL 1706

Query:   241 IFSNPKFASPSITSIQEINGLSSSSS--IHLHY 271
             IFSNPKFASPSITSIQEINGLSSSSS    LH+
Sbjct:  1707 IFSNPKFASPSITSIQEINGLSSSSSNSFALHH 1805

 Score = 97 (39.2 bits), Expect = 1.5e-144, Sum P(2) = 1.5e-144
 Identities = 17/17 (100%), Positives = 17/17 (100%), Frame = +1

Query:   267 IHLHYIIDHYHQLTTTT 283
             IHLHYIIDHYHQLTTTT
Sbjct:  1786 IHLHYIIDHYHQLTTTT 1836

>Contig_4571, D. discoideum, sequenced by the D. discoideum Sequencing
            Consortium, assembled with Phusion
        Length = 2625

  Plus Strand HSPs:

 Score = 1245 (443.3 bits), Expect = 4.0e-269, Sum P(4) = 4.0e-269
 Identities = 245/256 (95%), Positives = 247/256 (96%), Frame = +3

Score = 845 (302.5 bits), Expect = 4.0e-269, Sum P(4) = 4.0e-269
 Identities = 158/163 (96%), Positives = 158/163 (96%), Frame = +1

Query:     1 MFLTSILYTIIIILIFYKGLE 21
             MFLTSILYTIIIILIFYKGLE
Sbjct:   327 MFLTSILYTIIIILIFYKGLE 389

Query:    14 LIFYKGLEYLIEKRSFPLVHPIKGVMNGTKPYFIFGDLPFQRLLNLKPKLKELGSIYFRW 73
             L FY    YLIEKRSFPLVHPIKGVMNGTKPYFIFGDLPFQRLLNLKPKLKELGSIYFRW
Sbjct:   460 LFFY--F*YLIEKRSFPLVHPIKGVMNGTKPYFIFGDLPFQRLLNLKPKLKELGSIYFRW 633

Query:    74 FFWYPIVEIKDINAIQYVYNEKSNNYSLYWNLNKSSNFILTGSEIKRFFRIYYWAFNCKD 133
             FFWYPIVEIKDINAIQYVYNEKSNNYSLYWNLNKSSNFILTGSEIKRFFRIYYWAFNCKD
Sbjct:   634 FFWYPIVEIKDINAIQYVYNEKSNNYSLYWNLNKSSNFILTGSEIKRFFRIYYWAFNCKD 813

Query:   134 SLQRIMPVIKSQVFDFIHSHKFQNSTLSTNDDVTNVIFFNYYY 176
             SLQRIMPVIKSQVFDFIHSHKFQNSTLSTNDDVTNVIFFNYYY
Sbjct:   814 SLQRIMPVIKSQVFDFIHSHKFQNSTLSTNDDVTNVIFFNYYY 942

Query:   168 NVIFF-NYYYMLKLLSRVYLGSSDEAYHCFKANYKKFNKSYLDFFHYLRIPFPTLLKIPS 226
             N  FF +  +MLKLLSRVYLGSSDEAYHCFKANYKKFNKSYLDFFHYL   FPTLLKIPS
Sbjct:   966 NFFFFISKKFMLKLLSRVYLGSSDEAYHCFKANYKKFNKSYLDFFHYL---FPTLLKIPS 1136

Query:   227 KFLRKYIKNKNKRALYQVLAMKAYYGVVKQSSDDHIEESMINIIAETSYNDKEGLSLEEI 286
             KF RKYIKNKNKRALYQVLAMKAYYGVVKQSSDDHIEESMINIIAETSYNDKEGLSLEEI
Sbjct:  1137 KFSRKYIKNKNKRALYQVLAMKAYYGVVKQSSDDHIEESMINIIAETSYNDKEGLSLEEI 1316

Query:   287 KMPSYLLNASSIKGPMIMVENLMFQLIEKSEIESKIRKEIKLVFEKNGKDVNSFDFDDIM 346
             KMPSYLLNASSIKGPMIMVENLMFQLIEKSEIESKIRKEIKLVFEKNGKDVNSFDFDDIM
Sbjct:  1317 KMPSYLLNASSIKGPMIMVENLMFQLIEKSEIESKIRKEIKLVFEKNGKDVNSFDFDDIM 1496

Query:   347 EMKYLEATLDEINRLYPPFPKLMPRQTKESDRILGYHIPKGTMISCPVADILRDPSNFQD 406
             EMKYLEATLDEINRLYPPFPKLMPRQTKESDRILGYHIPKGTMISCPVADILRDPSNFQD
Sbjct:  1497 EMKYLEATLDEINRLYPPFPKLMPRQTKESDRILGYHIPKGTMISCPVADILRDPSNFQD 1676

Query:   407 PLTFKPERQLIFSNPK 422
             PLTFKPERQLIFSNPK
Sbjct:  1677 PLTFKPERQLIFSNPK 1724

Query:   420 NPKIIKNLPWGIGSKKCLGKELAKLIVKTIIVILYSQYTFDKHLDENDEELCDTNNQPKI 479
             N +IIKNLPWGIGSKKCLGKELAKLIVKTIIVILYSQYTFDKHLDENDEELCDTNNQPKI
Sbjct:  2199 NQRIIKNLPWGIGSKKCLGKELAKLIVKTIIVILYSQYTFDKHLDENDEELCDTNNQPKI 2378

Query:   480 QITFNPEIKPPLLLKSRKLFSISQPKQ 506
             QITFNPEIKPPLLLKSRKLFSISQPKQ
Sbjct:  2379 QITFNPEIKPPLLLKSRKLFSISQPKQ 2459

>Contig_4571_1 D. discoideum, sequenced by the D. discoideum Sequencing Consortium, assembled with Phusion translate frame +1 translate plus frames translate all frames
                              KTPIKIYLLWAIKKKLIKKKII*NIIYL*KFLNFQY*NNI*IIFVFFFYLYCFFFFFYLF
                              FFFFYIIIF*FIFKSNNNNSNNNNFFFFFFLKTPI*NNKNNNK***FKKCF*QVYYIQL*
                              *F*YSIKV*K*IIFN*LKFIYFFFFFLKYNF*HLFFYF*YLIEKRSFPLVHPIKGVMNGT
                              KPYFIFGDLPFQRLLNLKPKLKELGSIYFRWFFWYPIVEIKDINAIQYVYNEKSNNYSLY
                              WNLNKSSNFILTGSEIKRFFRIYYWAFNCKDSLQRIMPVIKSQVFDFIHSHKFQNSTLST
                              NDDVTNVIFFNYYY*YY*YYLLIFFFLFRKSLC*NY*VEFI*VVVMKHIIALKQIIKNLI
                              RVI*TFFIIYFQHY*RYQVNSQENI*RIRIRELYIKC*Q*KLIMVLLNNQVMIILKNQ**
                              I**QRQVIMIKRGYLLKR*KCHPIY*MHPQLKAQ**W*RI*CFN**KNQRLKVKFVKRLN
                              LYLKRMVKM*IHLILMI*WK*NT*KQHWMKLIDYIHHFQN*CHVKLKNLIES*DIIYQRV
                              P*YHVLSLIYLEILPIFKTH*HLNQNVN*FSQILNLHHHRLLQFKKLMD*VVLLQIHLHY
                              IIDHYHQLTTTTTTTTTTTTTTTTTTTTTTTTTAITTASTVTIKTIIEIVFNLLIIQHLK
                              NHFYLIVPQLLII*LVQIVLIN*N*IL*MKIAILIIIIIKIY*MYQI*LKLIKIIQFQII
                              NIILIII*QLKKEIKELLKIYHGVLVVKNV*VKNWQN***KQ*LSYFILNILLINISMKM
                              MKNYVILIINLKFK*LLIQKLNHLYYLNQENYFQFLNQNNK*IKIIKIKKNYYFCK*FIK
                              MCIYCFFFYLFI*FFNF*IF*FFI*LFNYLII*LF
                              >Contig_4571_2 D. discoideum, sequenced by the D. discoideum Sequencing Consortium, assembled with Phusion translate frame +1 translate plus frames translate all frames
                              KPLLKFIYYGLLKKN*LKKKLFEILFIYKNF*IFNIKIIFK*YLFFFFIYIVFFFFFIYF
                              FFFFIL*FFNLYLKVIIIIVIIIIFSFSFF*KPPYKIIKIIINNSNLRNVFNKYIIYNYN
                              NFNIL*RFRSKLFLIN*NSYIFFFFF*NIISNIYFFIFSI*LKKDHSL*YTQLKVL*MVQ
                              NHILYLGIYHFKDY*TLNQN*KN*EVFILDGSFGIQ*LKLKI*MQFNMFTMKSQIIIVFI
                              GI*INLVISF*QVQKLKDSLEFIIGHSIVKIHYKELCQL*NHKFLISFILTNFKIQHYQQ
                              MMMLQM*FFSIIIINIINIIY*FFFFYFEKVYVKIIK*SLFR****SISLL*SKL*KI**
                              ELFRLFSLFISNIIKDTK*ILKKIYKE*E*ESFISSVSNESLLWCC*TIK**SY*RINDK
                              YNSRDKL***RGVIS*RDKNAILFIKCILN*RPNDNGREFNVSIDRKIRD*K*NS*RD*T
                              CI*KEW*RCKFI*F**YNGNEILRSNIG*N**IISTISKINATSN*RI**NLRISYTKGY
                              HDIMSCR*YT*RSFQFSRPINI*TRTSINFLKS*ICITIDYFNSRN*WIK*FFFKFICIT
                              S*IITIN*QQQQQQQQQQQQQQQQQQQQQQQQQQ*QQHQR*Q*KQ**KLYSIF**FST*K
                              IIFI**FLNY**YSWYKSC**IKIRFFK*K*QY******RFIKCTKYN*N**K*SNFK**
                              I*FL**FNN*RKKSKNY*KFTMGYW**KMFR*RIGKINSKNNNCHTLFSIYF**TSR*K*
                              *RIM*Y**ST*NSNNF*SRN*TTFIT*IKKIIFNFSTKTINK*K**K*KKIIISVNNL*K
                              CVYIVFFFIYLFNFLIFKFFNFLFNYLII*LFNY
                              >Contig_4571_3 D. discoideum, sequenced by the D. discoideum Sequencing Consortium, assembled with Phusion translate frame +1 translate plus frames translate all frames
                              NPY*NLFIMGY*KKIN*KKNYLKYYLFIKIFKFSILK*YLNNICFFFLSILFFFFFLSIF
                              FFFLYYNFLIYI*K*********FFLFLFFKNPHIK**K***IIVI*E MFLTSILYTIII
                              ILIFYKGLE VNYF*LIKIHIFFFFFFEI*FLTFIFLFLVFN*KKIIPFSTPN*RCYEWYK
                              TIFYIWGFTISKIIKP*TKIKRTRKYLF*MVLLVSNS*N*RYKCNSICLQ*KVK*L*SLL
                              EFK*I**FHFNRFRN*KIL*NLLLGIQL*RFITKNYASYKITSF*FHSFSQISKFNIINK
                              **CYKCNFFQLLLLILLILFINFFFFISKK FMLKLLSRVYLGSSDEAYHCFKANYKKFNK
                              SYLDFFHYLFPTLLKIPSKFSRKYIKNKNKRALYQVLAMKAYYGVVKQSSDDHIEESMIN
                              IIAETSYNDKEGLSLEEIKMPSYLLNASSIKGPMIMVENLMFQLIEKSEIESKIRKEIKL
                              VFEKNGKDVNSFDFDDIMEMKYLEATLDEINRLYPPFPKLMPRQTKESDRILGYHIPKGT
                              MISCPVADILRDPSNFQDPLTFKPERQLIFSNPKFASPSITSIQEINGLSSSSSNSFALH
                              HRSLPSINNNNNNNNNNNNNNNNNNNNNNNNNSNNNSINGNNKNNNRNCIQSFNNSALKK
                              SFLSDSSSIIDNIVGTNRVDKLKLDSLNENSNINNNNNKDLLNVPNIIEINKNNPISNNK
                              YNSYNNLTIEERNQRIIKNLPWGIGSKKCLGKELAKLIVKTIIVILYSQYTFDKHLDEND
                              EELCDTNNQPKIQITFNPEIKPPLLLKSRKLFSISQPKQ*INKNNKNKKKLLFL*IIYKN
                              VYILFFFLFIYLIF*FLNFLIFYLII*LFNYLII


>CYP508A2 dd_00050 chr2 genome assembly 6aa diffs
MIFGIIVYLFLIYILHNAYSKYKRLNENQLPGPFPIPILGNIYQLTNLPHFDLTKMSEKY
GKIFRIYLADLYTVIVCDPIIARELFVDKFDNFIDRPKIPSVKHGTFYHGTVASMGDNWK
NNKEIVGKAMRKTNLKHIYQLLDDQVDVLIESMRTIESSGETFDPRYYLTKFTMSAMFKY
IFNEDISKDEDVHNGQLAQLMKPMQKVFKDFGTGSLFDVLEITRPLYFLYLEWFTSHYYQ
VINFGKMKIYKHLETYKPDVQRDLMDLLIKEYGTETDDQILSISATVSDFFLAGVDTSAT
SLELIVMMLINYPEYQEKAYNEIKSALSSNGGGGGGGLTQRNKVLLSDRQSTPFVVSLFK
ETLRYKPISPFGLPRSTTSDIILNNGQFIPKNAQILINYHALSRNEEYFENPNQFDPTRF
LNSDSNPAFMPFSIGPRNCVGSNFAQDEIYIALSNMILNFKFKSIDGKPVDETQTYGLTL
KPNPFKVILEKRK

Query:     1 MIFGIIVYLFLIYILHNAYSKYKRLNENQLPGPFPIPILGNIYQLTNLPHFDLTKMSEKY 60
             MIFGII YLFLIYILHNAYSKYKRLNENQLPGPFPIPILGNIYQLTNLPHFDLTKMSEKY
Sbjct:     1 MIFGIIGYLFLIYILHNAYSKYKRLNENQLPGPFPIPILGNIYQLTNLPHFDLTKMSEKY 60

Query:    61 GKIFRIYLADLYTVIVCDPIIARELFVDKFDNFIDRPKIPSVKHGTFYHGTVASMGDNWK 120
             GKIFRIYLADLYTVIVCDPIIARELFVDKFDNFIDRPKIPSV +GTFYHGTVASMGDNWK
Sbjct:    61 GKIFRIYLADLYTVIVCDPIIARELFVDKFDNFIDRPKIPSVNNGTFYHGTVASMGDNWK 120

Query:   121 NNKEIVGKAMRKTNLKHIYQLLDDQVDVLIESMRTIESSGETFDPRYYLTKFTMSAMFKY 180
              +KEIVGKAMRKTNL+HIYQLLDDQVDVLIESMRTIESSGETFDPRYYLTKFTMSAMFKY
Sbjct:   121 KHKEIVGKAMRKTNLEHIYQLLDDQVDVLIESMRTIESSGETFDPRYYLTKFTMSAMFKY 180