Gene CNC05330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNC05330 
Symbol 
ID3256204 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006685 
Strand
Start bp1592986 
End bp1594360 
Gene Length1375 bp 
Protein Length186 aa 
Translation table 
GC content47% 
IMG OID638255751 
Producthistone H1, putative 
Protein accessionXP_569727 
Protein GI58265142 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAACTCTCAA CTTTTATTTT TTCAAGGTAA GCTGACGTCT CTTTCGTCTG CCAAAGGTTT 
ATAAATGCGC CAGACTACTT CTCCCTGTGA CTAGAAAGGC GACGCTAACA AATCCTTTCC
TTTTCATTGC CATCTCTTCC CCTATCAGCC ATTTGCCACT TCTCACAAAC AACTGTCATA
CTTGAATCAT CATGGCCCCT GTCAAGAAGA CTGCTGCTCC TCCCAGGAAG GCTACTACTC
ACCCAACTTT CCTCTCTATG ATCCAAGTAA GTTCATGTTT TGTCATTCGT GTATGGTTAA
GATTCTTTCC CACGAGATTG CGGGCATGTC TACATTTTGG CCCTTCACTG GTCCATCCTA
ATAGGAGTGG CAAAACGGCC TTTGAATGGC TGTTAATCGT CAATTGAGCC ATCTGGAGCG
ACTCTTCGTC GCATCTTTGC TCGAATAATT ATGACATTCC CATGCGGGGC TGCACAGGCG
ATCTTACCCG GTTACTTCCG CCGATGTAGA TCTCGCTTTG TTTTTGGTTA CCCAATGTAC
AATGCCAAGC GGAGGGAAAG CATCAATGAT TTATTTTTCT ATCCAGTTCA AGGGTTACTG
CTATTATGCG CAGTTCGCAT ATATGCATAC CCCACTGGCG ATAGATTTAA ATTTAAGGTG
TATAGCCATA ATCTTTATTC TGCCTGGAAA GATAAAAAAA ATAGAAGGGC TTGTGCTGAT
TACTTTCATC ACAGGAATGC ATCGCCCAGA ACAAAGGGGA TGCTCGAAAA GGTGTCTCTC
GACCTACTAT CAAGAAGTAA GTCGATCACA GAGCTTAGTG GGTGTCACGT CTGACTGGAG
GTTACAGATT CCTCGCCGAC AAGTACAAAC TCGACATGAG CTCCGCTGCC AACATCAGCA
ACTTATCGAA CGCCATCAAG CGGGGTGCTG AAAAGGGCCA GCTTACTCTT CCTAGTGGGA
TTGCTGGTCG AGTGAAGGCC GGTGCCAAAG TTAGTAAAGT TGCATTATCA GTTTCTGGAA
CATGCACTGA CTTATCTTGC AGAAGCCTGC TCTTGTTCAC AAGAAGTCGT CTGCTGGCAA
GGAGAACGTT GCTCCTAAGA AAGCTGCAAG TACCGAGACT AGAAAGCACG CAGTTAGGAA
GGGTGTTACT GCTCCCGCTG CCATCAAGGC CGTTCCCACG AAGAAGCCTG TGGTGAAGAA
GGTCGCTCCT GCTGTCAAAA AGACTTCCGC CTCTAAGAAA GTTCATACGC GTGAGTTATT
ACCGTTTGCC GAGTGTGTAT CCATGTTGAA CATTGTTATA GCCAAGGGCG CTGTTGTTGT
CCCTGAAAAG GCCGCCCCTA GGAAGAAGGC TGCTCCCAAG AAGGCAGCCG CCTAA
 
Protein sequence
MAPVKKTAAP PRKATTHPTF LSMIQECIAQ NKGDARKGVS RPTIKKFLAD KYKLDMSSAA 
NISNLSNAIK RGAEKGQLTL PSGIAGRVKA GAKKPALVHK KSSAGKENVA PKKAASTETR
KHAVRKGVTA PAAIKAVPTK KPVVKKVAPA VKKTSASKKV HTPKGAVVVP EKAAPRKKAA
PKKAAA