Gene CNB05180 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNB05180 
Symbol 
ID3255898 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006684 
Strand
Start bp1473634 
End bp1476575 
Gene Length2942 bp 
Protein Length803 aa 
Translation table 
GC content49% 
IMG OID638255162 
Producthypothetical protein 
Protein accessionXP_569266 
Protein GI58264220 
COG category[K] Transcription 
COG ID[COG5169] Heat shock transcription factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.349279 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTCTT TCTCCTACCT TCCTCCGCCT CCTCGTGAGC CCAACTTGGC TCTTTTTCAA 
CCGTCGTCCT ACCCTCCACC ATCGGCATCT CCTTTTGGCA GACCCAACCT CCCACATCAA
ACCTCGCTTC ATCATCAACA CAACAACAGC AATGGCAGCT CTTACCAACC AGATAACAGC
CCAGTTATAT TAGACGACTG GGATCCCAAA TCACCTGTCT CCCAATCTAG AACTTATGTC
TCACCCGATC CACCTCCTAG GGATGAGGAT GTCTTCCCAC CACCTTACGA GCATCACCAT
TCAAACGTCA ACCAACATGA GCAATCACAT TCACAATCAT ATCCACCAAG ACGGCCCATT
TCTGTTCCAG CAGCGTACTC GGCTTCCACA TCATCGTCAT CTATGCGCAA AGTGCTGTTC
CCGCATCATC ATGGATTAAT AGCGTCTTCT TCAAACATGC CTGCAGGGAG TGATGGTGTC
AGAGATGGAG GGTTGCATAT TAGTATAGGA AAAGGTGATA TGACGCCTGA GATGGACCCG
TTTTACAATC CTATTATGTC TCCGGTTCCT CAGCAATCGT CGAAAAAGAA GAAGGCCAGG
AAGACAGAGG GGAAACAACC GACTTTCTTG ATAAAGCTCT ATTCGTGAGT GATTAATATA
GTCATAGGCT GTCATATGGG TGACTGACGG ATTTTGCAGG TTACTGTGAG TAGTACTTGG
TAAGGCATAA GTGATGTTAG CTGACTCGAC TATCAAGGTC TCAACCCGAA TATAGTCATG
TCAGTTCCGC TTTTGCATTC AGTTTCACGG AATAAATATT GATGCTAGCT GCAGATCATC
CGATGGGACG AAACAGGTGA ATTAATAATC ATCGAAAATC CTGAAGAGCT GGCAGACAAG
ATCTTACCTG TGGTATATCG ACAAAGCAGG TTCGCAAGCT TCTCACGACA ACTCAATGTA
AGTTTCACAC CTGTTTCAGA GGGTAAGGCT GATGGAATGA CAGATTTACG GATTCAACAG
AAAGCTTAGC TTGAGGAATG TCGAAAAAGG CATCTGCGAC CCCGATGCCA GCAGTTGGTG
TAAGTAACCT TTTATCGCTG CACTGAATCA TCACTTGAAC TTATGTTCAA ACGCAGCTCA
TCCCTTCCTT CGGCGAGATT CGACCAAACA AGAGATTACC TCTTTCAAAC GTCGCGTTCC
TCCTCGCCCC TCTCAAGCCC AAAAACGTCG CATGTCGATG GGCCTTGGCA TCGGCATCAA
GCCATCTTAC GCCGGCGCCT CCAACTGCGA AGATCAAGCC TCACCCACAT CTTCCGAACG
CTCGCTTGAT TGGCAATCAC CTCCAGACCC TTATCGACAT CACCTCCTAC CGGACGTTGA
CGAGGAAGCG CCGTTTGTAT TTCCAACAAG GGATTACTTT GGTATGGCCG GCCATGCGCC
TATGGAGTAC GAAGGATGGA AACATAATGG TACAGCGGCA GGGAATTTAC AAGGTGCGGT
AGGACAAGTG GAAGAGGGAT TTTCGCCTAC GATGTCAATT CATTTTGATT ATGGGATACC
ACCTGATAGA AATAGGTTGG ATGGGTGTAT GGCGTCGAGT CAAGTCCGGC ATGGAGGAAG
CCCGAAAGGA CTTGCGATCA ATATTCCCTG CTCATCGCAC CTTCCTAACC ATCACACTCA
ACAGCAGGTT CAACCATCTC TCCCACTTTT ATGTCAACCA GCATCTTTCT CTCTACTCAC
CCAGAAATCA CCAACAAATA TTGTTCCACA GAGCGCACCA GCCAATACAG GCTCGTTTCC
CATTCCTATA CAGGTGACGC AGCAGCACAT CCGTACTCGG AGCGTACAGG GTGAACCTCC
AAGTGCTATG TTGTTCTCCC CATTCGGTGA AGAGTTGGGG GAAGTCCCTG GACCTGCTGG
ATTTTCTCAG AGTCTCAATG GTACAAGAGG TTACAGACGT CAAGCGGGAA ATATGGGCCA
GCCTGCTATT CTGGCAGCAC CGATTCTTGA TCCTTCTGAT CCATCTACCT GGGCTCGACG
CGGGTTCATA GACCTCACCA CGGCAGGCGC TTCCAACCCG CTCCCATTCA ACCCAGTGCC
AATTTCTGCA AGCCACGCAA CCTCGCCCCA CTCTTTACCT ACTAATCTGA ATTCCTTGCA
CCAACAGCAG ACTGCGTCAC CCTCGGAATT GATGGGTCAG TCAATGAGCG CTGCACTGAG
CGATGATTCG CCAACAACGG TCTCGCCAGG GATATACCAA TTGGGCTTTT CATTGCCTGC
CTATCCACCT TTGAAGAGAC ACATCTCGTC GCTTAACCCA CCTGTAAGCG CTCACCTCAA
CCCGACTGCA AACATGACCT CTAACACTGG TCTCGCAAAG TCGCCTGACA TGGCTAACGG
GAATAATGTC CGTCTTTCAG CTATCATCCA GACGAAACAA GATCGACGGC AGTCAATCAG
TGCAAGCCCA TATCCGCATT CTGCGCAGTC TCCAAGGCAA AGGCCAGGGG TGCTTAATGC
GAGTGATAAC GTGGGTGGGA ATGGGTCTTG GACGGGGATG AATGGAGGTT CGTTGCGGAT
GATAGGATGT TCGGGTCGGG GTAGTGAAGG AGGCAGTTCG GCAGTTGATG TAGGTAATGT
TGACGGTGGT GAAAGGAAGC AGAATGGGCA TTCGTCTTCT TGAGCTGGTG GAGGTTTTCA
ATGGGTTGAT GGTTTGCGTA CTGGTTGCGA GAATCTTCCC GCTACTGTAT CCATATGTTT
ATATCCTCTT CGACTTTACA TTTCACATTC CTTTTATTAC GATTTCGAAT TAACAGCCAG
AAATAATGTC GATAGCCAAT TTAGTCAACG TTTGTCAGAC AGGAAAGATG GCCGCCCGTC
ACGAGACCCT CAGAGTTGTA CATTGTGTTA GATCCCTCGT ATCTTATTGC TTTGCGTATG
TT
 
Protein sequence
MNSFSYLPPP PREPNLALFQ PSSYPPPSAS PFGRPNLPHQ TSLHHQHNNS NGSSYQPDNS 
PVILDDWDPK SPVSQSRTYV SPDPPPRDED VFPPPYEHHH SNVNQHEQSH SQSYPPRRPI
SVPAAYSAST SSSSMRKVLF PHHHGLIASS SNMPAGSDGV RDGGLHISIG KGDMTPEMDP
FYNPIMSPVP QQSSKKKKAR KTEGKQPTFL IKLYSSQPEY SHIIRWDETG ELIIIENPEE
LADKILPVVY RQSRFASFSR QLNIYGFNRK LSLRNVEKGI CDPDASSWSH PFLRRDSTKQ
EITSFKRRVP PRPSQAQKRR MSMGLGIGIK PSYAGASNCE DQASPTSSER SLDWQSPPDP
YRHHLLPDVD EEAPFVFPTR DYFGMAGHAP MEYEGWKHNG TAAGNLQGAV GQVEEGFSPT
MSIHFDYGIP PDRNRLDGCM ASSQVRHGGS PKGLAINIPC SSHLPNHHTQ QQVQPSLPLL
CQPASFSLLT QKSPTNIVPQ SAPANTGSFP IPIQVTQQHI RTRSVQGEPP SAMLFSPFGE
ELGEVPGPAG FSQSLNGTRG YRRQAGNMGQ PAILAAPILD PSDPSTWARR GFIDLTTAGA
SNPLPFNPVP ISASHATSPH SLPTNLNSLH QQQTASPSEL MGQSMSAALS DDSPTTVSPG
IYQLGFSLPA YPPLKRHISS LNPPVSAHLN PTANMTSNTG LAKSPDMANG NNVRLSAIIQ
TKQDRRQSIS ASPYPHSAQS PRQRPGVLNA SDNVGGNGSW TGMNGGSLRM IGCSGRGSEG
GSSAVDVGNV DGGERKQNGH SSS