Gene CNA04600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA04600 
Symbol 
ID3253314 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp1221104 
End bp1222867 
Gene Length1764 bp 
Protein Length441 aa 
Translation table 
GC content52% 
IMG OID638252780 
Productsingle-stranded DNA binding protein, putative 
Protein accessionXP_566835 
Protein GI58258845 
COG category[R] General function prediction only 
COG ID[COG0724] RNA-binding proteins (RRM domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AAGTAAAGTA TTATATATAA AATGGCCAAA TCCGCCAAAT CTGCTCCTGC TGCCACCGTC 
AAGGTTGACA AGAAGGACAA AAAAAGCAAG AAGGATGAGA AGCCCGTCCC TGCCCCCGCT
CCTGCCAAGG CTGCTAAGGT CGGTTTCTCC CTCTGTTGCG ATAACATACC TTTGATACTG
ACAGAGTTTA CAGAAGGATG TAAAGGAGAA GAAGGAGAAG AAATCCAAGA AGGCTAAGAC
CCCCACACCC CCTCCTGAGA GCTCTACCTC CGAGTCTGAG GACTCTGAGG AGGACTCTTC
CGACTCTGAA TCTTCTTCTT CCGAGGACGA AAAGCCTGCT GCTAAGACTG CTGCCCCTGC
CGCCAAGGTT TGTTATGCTT CTACATGTTT TTCTGAATGC CAATAACACA TTGCTTAGGC
TAAGGAGGAA GAGCCTTCTA CTGAAGAGTC CTCCGACTCT GACTCTGAGT CCGAGGAAGA
ATTCAAGACT GAGACTAAAA AAGAGGTTAA CGAAGAAGTA GGTCGCTACC AGTTTATTTA
AATAGAGAAG TTGTTGACGT TCTTACAGTC CGAATCCGAT TCTGACTCTG ACTCTGGTTC
TGACTCCAAC TCTTCTGAGG ATGAGGACGA AAAGGTGGAG GAGACCAAGG AAGAGGCTAA
GCCCCAGGCG AACGGTAAGC CAATATTATG CTATTTCGGA TTTAGACTCA GGCTAAAATT
ACCCAGGGAA CAAGCGAAAG GCCGAGGAAG AGTCTATTGC CCCTGCGAAG AAGGCCAGGG
CCGATGGCGG TGACGAGGAA GCCACTACCA ACGTTTTCGT CGGCCAGCTC TCTTGGAATG
TCGACAACGA CTGGCTCAAG TCCGAGTTCG AGTCTTGCGG TGAAGTTGTT TCTGCCCGAG
TCGTCTTCGA CCGTGACTCC CAAAAGTCTC GTGGTTTCGG CTACGTGGAA TTCGCCGACC
TTGAGTCCTC TGCCAAGGCT ATTGAGAAGG ACGGTTCTGA GATTGACGGC CGTGCCATCC
GTGTTAATTA CGCCACTCAG CGAAAGCCCA ACGAGGCCGC CGAGAAGCGT GCTAGGGTCT
TCAACGACAA GCAATCTCCT CCTGCCGAGA CTCTTTGGAT CGGTTCCCTT TCTTTCTCTG
TTACCGAGGA CCAGGTCTAC GAGGCATTCG GCCAACACGG TGACGTCCAG AGCGTTAGGC
TCCCCACCGA CAGGGACACT GGCGCCCCTA AAGGTTTCGG TTACGTCCAG TTCTCTTCCG
TTGACGACGC CTCTGCTGCT CTCAAGGCTA TGAACGGTGC CGAGATCGCT GGCCGTGCCA
TTAGGGTTGA CTTCGCTCCT CCTAAGCAGG ACAACGGTGA GAGAGGTGGT TTCGGTGGCG
GTCGTGGTGG CGGTGGCTTC GGCGGCCGTG GTGGTGGCCG AGGTGGCGGT AGAGGACGGG
GTGGTTTCGA CCGGGGTGGT AGGGGCGGTG GCCGTGGCCG TGGCGGTCCC CCTCGAGGGT
GAGTCTCTTG CTCATCTGCA TGTCCATTCT GACGCTGACT TTCCAACGTA GTGGTGCCCG
AACTGGCGGT ATTGTCAAGC CTGAGGGCCA GAAGGTTACT TTCGACTAGA CTGCAATAAT
GTAATACTTT CTTCCAGTGC TTCGAGTTTC CATTCATCTG TCAAATAGAG CTTTAATAGC
ATCTCCTATT AAAAGGATAC CCTTTCTTTG CAACTTGTTA TTCTTGGTTG AAATATTCGT
GTGTGGTATC TATTATGTGT AATG
 
Protein sequence
MAKSAKSAPA ATVKVDKKDK KSKKDEKPVP APAPAKAAKK DVKEKKEKKS KKAKTPTPPP 
ESSTSESEDS EEDSSDSESS SSEDEKPAAK TAAPAAKAKE EEPSTEESSD SDSESEEEFK
TETKKEVNEE SESDSDSDSG SDSNSSEDED EKVEETKEEA KPQANGNKRK AEEESIAPAK
KARADGGDEE ATTNVFVGQL SWNVDNDWLK SEFESCGEVV SARVVFDRDS QKSRGFGYVE
FADLESSAKA IEKDGSEIDG RAIRVNYATQ RKPNEAAEKR ARVFNDKQSP PAETLWIGSL
SFSVTEDQVY EAFGQHGDVQ SVRLPTDRDT GAPKGFGYVQ FSSVDDASAA LKAMNGAEIA
GRAIRVDFAP PKQDNGERGG FGGGRGGGGF GGRGGGRGGG RGRGGFDRGG RGGGRGRGGP
PRGGARTGGI VKPEGQKVTF D