Cryptococcus neoformans P450s

(several serotypes)

 

D. Nelson

 

August 9, 2009

 

8 sequences + 5 duplicates

 

See the genome manuscript at Science 307, 1321-1324 (2005) [25 Feb. issue]

 

>CYP51F1 Cryptococcus neoformans var. neoformans B-3501A chromosome 1

56% to CYP51F1 Phanerochaete chrysosporium

AAEY01000001

CNBA0300"

ESTs gb|CF192489.1|CF192489,

gb|CF192488.1|CF192488, gb|CF191439.1|CF191439;

EAL23379.1"

MSAIIPQVQQLLGQVAQFFPPWFAALPTSLKVAIAVVGIPALII

GLNVFQQLCLPRRKDLPPVVFHYIPWFGSAAYYGEDPYKFLFECRDKYGDLFTFILMG

RRITVALGPKGNNLSLGGKISQVSAEEAYTHLTTPVFGKGVVYDCPNEMLMQQKKFIK

SGLTTESLQSYPPMITSECEDFFTKEVGISPQKPSATLDLLKAMSELIILTASRTLQG

KEVRESLNGQFAKYYEDLDGGFTPLNFMFPNLPLPSYKRRDEAQKAMSDFYLKIMENR

RKGESDHEHDMIENLQSCKYRNGVPLSDRDIAHIMIALLMAGQHTSSATSSWTLLHLA

DRPDVVEALYQEQKQKLGNPDGTFRDYRYEDLKELPIMDSIIRETLRMHAPIHSIYRK

VLSDIPVPPSLSAPSENGQYIIPKGHYIMAAPGVSQMDPRIWQDAKVWNPARWHDEKG

FAAAAMVQYTKAEQVDYGFGSVSKGTESPYQPFGAGRHRCVGEQFAYTQLSTIFTYVV

RNFTLKLAVPKFPETNYRTMIVQPNNPLVTFTLRNAEVKQEV

 

>CYP51F1 56% to CYP51F1 white rot

taken from the Genbank chromosome entries

AAEY01000001          945048 bp    DNA     linear   PLN 13-JUL-2004

DEFINITION  Cryptococcus neoformans var. neoformans B-3501A chromosome 1

CDS             join(94307..94462,94559..94798,94853..94930,94980..95351,

95399..95571,95626..95752,95807..95820,95867..96284,

96343..96417)

/locus_tag="CNBA0300"

/note="Match to ESTs gb|CF192489.1|CF192489,

gb|CF192488.1|CF192488, gb|CF191439.1|CF191439; Similar to

gi|7021480|gb|AAF35366.1| lanosterol 14 alpha-demethylase

[Filobasidiella neoformans], FASTA scores: opt: 3704, E():

0, (100.000% identity (100.000% similar) in 550 aa overlap

(1-550:1-550)); HMMPfam hit to p450, Cytochrome P450,

score: 185.7, E(): 9.4e-53"

/codon_start=1

/product="hypothetical protein"

/protein_id="EAL23379.1"

/db_xref="GI:50260729"

/translation="MSAIIPQVQQLLGQVAQFFPPWFAALPTSLKVAIAVVGIPALII

GLNVFQQLCLPRRKDLPPVVFHYIPWFGSAAYYGEDPYKFLFECRDKYGDLFTFILMG

RRITVALGPKGNNLSLGGKISQVSAEEAYTHLTTPVFGKGVVYDCPNEMLMQQKKFIK

SGLTTESLQSYPPMITSECEDFFTKEVGISPQKPSATLDLLKAMSELIILTASRTLQG

KEVRESLNGQFAKYYEDLDGGFTPLNFMFPNLPLPSYKRRDEAQKAMSDFYLKIMENR

RKGESDHEHDMIENLQSCKYRNGVPLSDRDIAHIMIALLMAGQHTSSATSSWTLLHLA

DRPDVVEALYQEQKQKLGNPDGTFRDYRYEDLKELPIMDSIIRETLRMHAPIHSIYRK

VLSDIPVPPSLSAPSENGQYIIPKGHYIMAAPGVSQMDPRIWQDAKVWNPARWHDEKG

FAAAAMVQYTKAEQVDYGFGSVSKGTESPYQPFGAGRHRCVGEQFAYTQLSTIFTYVV

RNFTLKLAVPKFPETNYRTMIVQPNNPLVTFTLRNAEVKQEV

 

>CYP61A1

Cryptococcus neoformans var. neoformans B-3501A chromosome 6

61% to CYP61A1   Ustilago maydis

AAEY01000030

CNBF1100"

ESTs gb|CF191501.1|CF191501,gb|CF188802.1|CF188802, gb|CF194425.1|CF194425

EAL20298.1

MESHTVLRPTAIPDLAAIKTWGLEGLTKAKFSFDTKTTTATILT

LILSLLVLEQLVYRAKKAHLPGAKWTIPVIGKFADSLNPTLANYKAQWNSGPLSAVSV

FNIFIVIGSSNEMARKILNSPNHAEPCLVASAKKVLLPENWVFLHGKVHADYRKALNV

LFTKQALSIYLPIQEKIYRSYFNKWMSDPAPAQQYMMKMRDLNMDTSLSVFIGPYLTE

AQKQEINDKYWLITISLELVNFPLAIPGTKVYNAIQARKIVMKYLSAASAASKIRMED

DDAEPECLLDHWVRAMILARRAKDDGEQTRLLSREYSDHEIAMVLLSFLFASQDAMSS

ALVYTFQLTADHPEVLEKVREEQYRVRGNDLERPLTLDMLDDMVYTRATIKEVLRFRP

PVIMVPYMTTKPFPVSPEYTAPKNSMIIPAFWNSLHDETCYPEPDRFLPERWLPQADG

SAPIADSKPQNYLVWGSGPHKCIGGQYASMHLAATLGTASVLMDWKHERTELSDEVQV

IAAIFPKDHCLLKFTPRAPPS

 

>CYP61A1

AAEY01000030          650090 bp    DNA     linear   PLN 13-JUL-2004

CYP61 sequence most like white rot CYP61

DEFINITION  Cryptococcus neoformans var. neoformans B-3501A chromosome 6

CDS             complement(join(343289..343346,343418..343761,

343818..344502,344552..344740,344796..345005,

345059..345162))

/locus_tag="CNBF1100"

/note="Match to ESTs gb|CF191501.1|CF191501,

gb|CF188802.1|CF188802, gb|CF194425.1|CF194425; HMMPfam

hit to p450, Cytochrome P450, score: 43.1, E(): 2.4e-13"

/codon_start=1

/product="hypothetical protein"

/protein_id="EAL20298.1"

/db_xref="GI:50257593"

/translation="MESHTVLRPTAIPDLAAIKTWGLEGLTKAKFSFDTKTTTATILT

LILSLLVLEQLVYRAKKAHLPGAKWTIPVIGKFADSLNPTLANYKAQWNSGPLSAVSV

FNIFIVIGSSNEMARKILNSPNHAEPCLVASAKKVLLPENWVFLHGKVHADYRKALNV

LFTKQALSIYLPIQEKIYRSYFNKWMSDPAPAQQYMMKMRDLNMDTSLSVFIGPYLTE

AQKQEINDKYWLITISLELVNFPLAIPGTKVYNAIQARKIVMKYLSAASAASKIRMED

DDAEPECLLDHWVRAMILARRAKDDGEQTRLLSREYSDHEIAMVLLSFLFASQDAMSS

ALVYTFQLTADHPEVLEKVREEQYRVRGNDLERPLTLDMLDDMVYTRATIKEVLRFRP

PVIMVPYMTTKPFPVSPEYTAPKNSMIIPAFWNSLHDETCYPEPDRFLPERWLPQADG

SAPIADSKPQNYLVWGSGPHKCIGGQYASMHLAATLGTASVLMDWKHERTELSDEVQV

IAAIFPKDHCLLKFTPRAPPS

 

>CYP5139B1 Cryptococcus neoformans var. neoformans B-3501A chromosome 6

AAEY01000032

CNBF3400

EAL20013.1

42% to CYP5139A1 Phanerochaete chrysosporium

yellow region is too long

MTMELLKVLHHGASQLFPNCIRSSPVACIVLYSFGGIAILLFTV

YLWLWPFQYAKLYFRNLPGPPSDSWFWGVVPTLIKSPPSVPHSMWTDEYGPTVRYRVA

LGAQRFLTIDPTALNYILSHADLFPKPSRVRKALSDLLGNELLTAEGHTHKKQRKALN

PSFSPAAVRGMIPVFYDKAYELKAKLLGIIEGDETEQASPTPCKEEDEVEGGKKIDVM

KYLGKTTLDVIGIVGFSYDFKALSEPRNELSEAYSKMF

QAGMDANFWDFLRGAIPLVNKLP

NKRATEIAARKAVTLRISKKIVEDKKREVMSAHSEGLEKREDIGDDLLSILIKAN

MASDVKPEQKLSDEEVLDQITTFMLAGNETSSTALTWILYSLTQHPECQKRLREEVLA

VPDDRPSLETLNNLPYMDAVIREALRLHAPAPGTMREAKEDTVIPLSMPVIGRDGKQI

DSVKINKGTMVFIPIITVNTSPAIWGPDARVFNPDRHFKTSSDSFGGANMHVPGVWGN

MLSFLGGARNCIGYKLALAEISTILFVLIRSFEFQELKSKPEVEKKASVVMRPRIKGE

ESAGLQMPLMVKPLLM

 

>CYP5139B1

AAEY01000032          550752 bp    DNA     linear   PLN 13-JUL-2004

Most like white rot fungal P450 on scaffold 7

DEFINITION  Cryptococcus neoformans var. neoformans B-3501A chromosome 6

CDS             join(207402..207588,207646..208292,208345..208404,

208451..208547,208598..208805,208860..209050,

209105..209381,209438..209519)

/locus_tag="CNBF3400"

/note="HMMPfam hit to p450, Cytochrome P450, score: 137.2,

E(): 3.7e-38"

/codon_start=1

/product="hypothetical protein"

/protein_id="EAL20013.1"

/db_xref="GI:50257304"

/translation="MTMELLKVLHHGASQLFPNCIRSSPVACIVLYSFGGIAILLFTV

YLWLWPFQYAKLYFRNLPGPPSDSWFWGVVPTLIKSPPSVPHSMWTDEYGPTVRYRVA

LGAQRFLTIDPTALNYILSHADLFPKPSRVRKALSDLLGNELLTAEGHTHKKQRKALN

PSFSPAAVRGMIPVFYDKAYELKAKLLGIIEGDETEQASPTPCKEEDEVEGGKKIDVM

KYLGKTTLDVIGIVGFSYDFKALSEPRNELSEAYSKMFQAGMDANFWDFLRGAIPLVN

KLPNKRATEIAARKAVTLRISKKIVEDKKREVMSAHSEGLEKREDIGDDLLSILIKAN

MASDVKPEQKLSDEEVLDQITTFMLAGNETSSTALTWILYSLTQHPECQKRLREEVLA

VPDDRPSLETLNNLPYMDAVIREALRLHAPAPGTMREAKEDTVIPLSMPVIGRDGKQI

DSVKINKGTMVFIPIITVNTSPAIWGPDARVFNPDRHFKTSSDSFGGANMHVPGVWGN

MLSFLGGARNCIGYKLALAEISTILFVLIRSFEFQELKSKPEVEKKASVVMRPRIKGE

ESAGLQMPLMVKPLLM

 

>CYP5139B2 Cryptococcus neoformans serotype A

CNAG_05842.1  94% to CYP5139B1

MELLKALHHETLQLFPDCIRSSSVACIVLYSLSGIAILLSTVYLWLWPFQYAKLHFRNLPGPPSDSWFWGVIPTLIKSPP

SVPHSMWTDEHGPTIRYRVALGAQRFLTIDPTALNYILSHADLFPKPSRVRKALSDLLGNGLLTAEGYTHKKQRKALNPS

FSPAAIRGMVPVFYDKAYELKAKLLSIIEGDETEQASPTPCKEEDEVEGGKKIDVMKYLGKTTLDVIGIVGFSYDFKALS

EPHNELSQAYSKMFQAGMDANFWDFLRGAIPLVNKLPNKRATEIAARKAVTLRIGKKIVEDKKREVMSAHSEGLEKREDI

GDDLLSILIKANMASDVKPEQKLSDKEVLDQITTFMLAGNETSSTALTWILYSLTQHPECQKRLREEVLAVPDDRPSLET

LNNLSYMDAVIREALRLHAPAPGTMREAKEDTIIPLSMPVTGRDGKQIDSVKINKGIMVFIPIMTVNTSPAIWGPDARVF

NPDRHLKASSNSFGGANMHVPGVWGNMLSFLGGARNCIGYKLALAEISTILFVLIRSFEFQELNSKPVVEKKASVVMRPR

IKGEESAGLQMPLMVKPLLM

 

>CYP5215A1 Cryptococcus neoformans var. neoformans B-3501A chromosome 2

AAEY01000006

CNBB0620

EAL22841.1

CF712065.1 = mRNA yellow is an insertion

32% to AAEY01000032, 32% to CYP5151A1

MLKQLSFQSPRTIEIIPLALVVLVVWLFRVFVYKPFTSPLKNVP

CPPGGTSSQGHIAEIMNLQGTEVNQWIKTYGSTFMVRGPFGVHHRIFTVDPRALSHVL

QHTNIYTKSDLLRGLVRRYMKEGLIVAEGERHKAQRKVSQKLFSMGGLKSMGQVVQDK

SNQLRDILLNLCTNPTASNPYSPVNRTLPPGSREVDVYSTASRCTFDLIGSIGVDHQF

DSIGDWEGSGGKLFHKYERMQLLCPGTTGFRMLLSLAWPLIDKIWVNNIFPLPSENTK

RVNDAMGSLEKFAKEKMIERQQELLTIDSKKGDVPDRGDLLTLMLRHNMTRKISPADK

LRDHEISGQLSTFMFAGSETTAGTISFGLCDLARHPDIQSRLRAEILECGDNLPFNQI

DELPYLDAVVKEIMRINPSLPGTVRQAQKDDIIPLAEPVILTNGKVVTDIHIRKGQL (0)

VHVPIEHLHTSEHIWGPTAKEFNPSRFLSTTQPSAFSG

RPTLASNPAATSSARRDAVPN

YVPEGPGIWPNFMTFIDGPRRCIGYKLAVMEIKTVIFTLLREFEIELMEGQHIFRWNM ()

MSNRPFVANTLRSKGSRLPLHFKLYKGEQQHEKTGGS*

 

>CYP5125A1

AAEY01000006          265665 bp    DNA     linear   PLN 13-JUL-2004

DEFINITION  Cryptococcus neoformans var. neoformans B-3501A chromosome 2

CDS join(176011..176192,176251..176557,176610..176930,

176967..177117,177172..177277,177339..177663)

/locus_tag="CNBB0620"

/note="HMMPfam hit to p450, Cytochrome P450, score: -6.0,

E(): 4.5e-11"

/codon_start=1

/product="hypothetical protein"

/protein_id="EAL22841.1"

/db_xref="GI:50260180"

/translation="MLKQLSFQSPRTIEIIPLALVVLVVWLFRVFVYKPFTSPLKNVP

CPPGGTSSQGHIAEIMNLQGTEVNQWIKTYGSTFMVRGPFGVHHRIFTVDPRALSHVL

QHTNIYTKSDLLRGLVRRYMKEGLIVAEGERHKAQRKVSQKLFSMGGLKSMGQVVQDK

SNQLRDILLNLCTNPTASNPYSPVNRTLPPGSREVDVYSTASRCTFDLIGSIGVDHQF

DSIGDWEGSGGKLFHKYERMQLLCPGTTGFRMLLSLAWPLIDKIWVNNIFPLPSENTK

RVNDAMGSLEKFAKEKMIERQQELLTIDSKKGDVPDRGDLLTLMLRHNMTRKISPADK

LRDHEISGQLSTFMFAGSETTAGTISFGLCDLARHPDIQSRLRAEILECGDNLPFNQI

DELPYLDAVVKEIMRINPSLPGTVRQAQKDDIIPLAEPVILTNGKVVTDIHIRKGQL (0)

Note this seq is short on the end, most like white rot P450s

Add this sequence (yellow too long may have to delete)

VHVPIEHLHTSEHIWGPTAKEFNPSRFLSTTQPSAFSG

RPTLASNPAATSSARRDAVPN

YVPEGPGIWPNFMTFIDGPRRCIGYKLAVMEIKTVIFTLLREFEIELMEGQHIFRWNM ()

MSNRPFVANTLRSKGSRLPLHFKLYKGEQQHEKTGGS*

 

>CYP5215A2 Cryptococcus neoformans serotype A

88% to CYP5215A1 CNAG_04029.1

MLKQLSFQPPTMIGSILLALVILVVWLLHVFVYKSFTSALKNVPCPPGGNGSQGHIGEIMNLQGIEVHQWIRTYGSTFMV

RGPFGVHHRIFTIDPRALSHVLQHTNIYTKSDLLRDLVRRYMKEGLIVAEGERHKVQRKVSQKLFSMGGLKSMGQIVQDK

SNQLRDILLNLCANPTASNPYSPVNRTLPPGSREVDVYSAASRCTFDLIGSIGVDHQFDSIADWEGSGGKLFHKYEQMQL

LCPGAMGIRMLLSLTWPLVDRIWVRIIAFHWYVLSPSENTKRVNDAMGSLEKFAKEKMIERQQELLTMDSKKGDVPDRRD

LLTLMLRHNMSRKISPADKLRDHEISGQLSTFMFAGSETTAGTISFGLYDLARHPDVQSRLRAEIFECGDNLPFDQIDEL

PYLDAVVKEIMRINPSLPGTVRQAQKDDIIPLAEPVTLTNGKVGTDIHIQKGQLVSIPVSYLILVSSLMGWINESPNCQ

 

>CYP5216A1 Cryptococcus neoformans var. neoformans B-3501A chromosome 3

34% to CYP63A4 Phanerochaete chrysosporium

AAEY01000013

CNBC2600

EAL22122.1

MKGGRFLLPALFKTLLLPPLIAALFIHHFPSLFPTFTALIYLIS

FPALYVFRSYASLVASSWKASALGAIDIPRVNGTWPLNIDILLNWAKSGTEEEVGRMM

VLMGRQYGGTYNTRVLGEDQIISSDPKVIKHVLIDDFDNFVKGQKFKERAQDFLGDGI

FNSDGDNWKFHRSFMRPFFQPKYISPLHFSTNVQNFFAKLPSYGKVFDMQAHIGQLAL

ELAIMWLCGEDMSRDTNNVAWTEEWKKAKGEISWAMTEAQKIVGKRVKIGTIWPLFEV

RHDPLERPMKVIRAFFRPIISRALNRKRQRCKLDNTEDVYMIDRLVEATDDIKLVEDQ

LINVLLASRDTLASLLTFSVYAIVLHPDIAVRLKDEVFTIANQECEVTKDTIRQLRYC

RAFINEVLRLFPPVPLNIRRTLRPSLLPTPNHLAYMPANTSIILATILMQRDPAVWGE

DAQVFNPDRWLEGGGLGKEKEGFMSWNLGPRMCLGQSFALAITHTFLVYFFRHIAHVG

TLVGHNTTVQLALDAQPPETVLPKEWMTPEGGDGRARGGRDKVWIVADVVLAIKGGLW

IRFGAEETE

 

>CYP5216A1

AAEY01000013         1122915 bp    DNA     linear   PLN 13-JUL-2004

Most like white rot P450s

DEFINITION  Cryptococcus neoformans var. neoformans B-3501A chromosome 3

CDS             complement(join(682935..682976,683028..683267,

683315..683534,683587..683638,683699..683837,

683893..683948,684005..684170,684228..684530,

684578..684721,684785..685150))

/locus_tag="CNBC2600"

/note="HMMPfam hit to p450, Cytochrome P450, score: 56.3,

E(): 5.8e-14"

/codon_start=1

/product="hypothetical protein"

/protein_id="EAL22122.1"

/db_xref="GI:50259449"

/translation="MKGGRFLLPALFKTLLLPPLIAALFIHHFPSLFPTFTALIYLIS

FPALYVFRSYASLVASSWKASALGAIDIPRVNGTWPLNIDILLNWAKSGTEEEVGRMM

VLMGRQYGGTYNTRVLGEDQIISSDPKVIKHVLIDDFDNFVKGQKFKERAQDFLGDGI

FNSDGDNWKFHRSFMRPFFQPKYISPLHFSTNVQNFFAKLPSYGKVFDMQAHIGQLAL

ELAIMWLCGEDMSRDTNNVAWTEEWKKAKGEISWAMTEAQKIVGKRVKIGTIWPLFEV

RHDPLERPMKVIRAFFRPIISRALNRKRQRCKLDNTEDVYMIDRLVEATDDIKLVEDQ

LINVLLASRDTLASLLTFSVYAIVLHPDIAVRLKDEVFTIANQECEVTKDTIRQLRYC

RAFINEVLRLFPPVPLNIRRTLRPSLLPTPNHLAYMPANTSIILATILMQRDPAVWGE

DAQVFNPDRWLEGGGLGKEKEGFMSWNLGPRMCLGQSFALAITHTFLVYFFRHIAHVG

TLVGHNTTVQLALDAQPPETVLPKEWMTPEGGDGRARGGRDKVWIVADVVLAIKGGLW

IRFGAEETE

 

>CYP5216A2 Cryptococcus neoformans serotype A

90% to CYP5216A1

MKGGRFLLPALFKTLLLPPLIAALFTHHFPSLSLSFTALIYLLSFPALYVFRSYTSLVASSWKASALGAIDIPRVRGTWP

LNIDVLLNWAKSGTADEVGRMMVLMVRQYGGTYNTRVLGEDQIISSDPRVIKHVLIDDFDNFVKGQKFKERAQDFLGDGI

FNSDGDNWKFHRSFMRPFFHPTYISPLHLATSVQNFFTRLPSYGEAFDIQARIGQLALELAMMWLCGEDMSAGASNVART

EEWKRAKGQISWAMTEAQKIVGKRVKIGTVWPLFELTHDPLERPMKVIRAFFRPIISEALNRKRQRCKLDNTEDVYMIDR

LVEATDDIKLVEDQLINVFLASRDTLASLLTFSAYAIVLHPDIAARLKDEVFTVANQESEVTKDTVRQLRYCRAFINEVL

RLFPPVPLNIRRTLRPSLLPTPNHLAYMPANTSIILATILMQRDPAVWGEDAQVFNPDRWLEGGLGKEREGFVSWNLGPR

MCLGQSLALTITHTFLVYFFRHIAHVGTLVGQNITTQLAPDAQPPETALPKEWMTLEGGDGRARGGRDRVWIVADVVLAI

KGGLWIRFGAEGTE