Gene CNA04270 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA04270 
Symbol 
ID3253370 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp1147503 
End bp1151144 
Gene Length3642 bp 
Protein Length895 aa 
Translation table 
GC content51% 
IMG OID638252747 
ProductHMG1, putative 
Protein accessionXP_566774 
Protein GI58258723 
COG category[B] Chromatin structure and dynamics 
COG ID[COG5648] Chromatin-associated proteins containing the HMG domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GCACGTCCCA TACTCCGTAC CTGCGCCTGC TGACGCACTA CGCGACTGTC CCCGCCATAG 
ATCATAGATC CTGTAGCCTG TAATCTCCCC TGATCATCCA CACGGACCTC GCAATCCGCA
CCGCTCTTCC GCCACTCCAT CATTCCCGCA TCTGCATCCA CGGCGACCGC GTTGCCCCTT
CTCTCCCTCA CCGCCCCTCT TCCGCCGTAG CCAAGCGATC TAATCGCGCA TTATCTCATT
ATCAGTACCT ATCATGTGAG TCCTCTATAT ATCACTTGAC GCGCCTTCTT ACGTCGCGTC
GCGTATGGCC ATCCTATGTA TGCAGTAACG CTGATGTTTT CTTGATGTAG TGCTGAGCGC
GTTGCTGCTC CTCGATTAAT ATGCAAAGCA CCAACGCGCC TGTTAACGGT GGTGTTTCCC
AACAGTGGCA GGTAAGCCTT CCCCTTCATA CCTCGGTCTC TACTGTTGAT CTAATTTTGA
ATGTAATCGC CGACGTGCTT TTGTTTCCAT TTGGTCTCTT TATTCACTGA TTATGCCTCT
TTTTTGTCAT GGCCTAATTC ATCACCCTCT CGACGCAGTC GGCCATGCAG GGCATGTACT
CAGGTGCCCA GCAGAGGCCC ACATCCAATT CGAAGCCCTA TAACGCGCCG GATGCTAATG
GTATGCCCGC CAAGAAGGAC GATGCGAATC AATCCATGTA CTCTGTCAAC ACAGAAAATG
GCTATCCCGC GTATCAGCAA CAAGGGTATG GCACGCTTGG AGGCTTAGGA CAATATGGAT
ACAATAGTGC TGGACTGGGG TGGGTGCCAT GTGACTAAAT CTGGAAGTAA CCTTGTTGAC
ATGGACGTAC TGCAGCGGCC TGTCAGCAGC ATATGGTGGA GCAACATCAT CAACAGCTGG
TGCGACCAAT GGACGCAGAA CATCTGGCCA GAGGATGATG ACCCCGCCTC CTCACCCCTC
TTCAGTTGCA ACAAACACCT CTCCATCTGC TCGATATTAC ACTGGTCAAG AAGGTGAGGC
AGGCCGTCAT GCTTCGCCCT ATCAGCCCGC TTCCGCACCC CCTCAGATCC TCTCTCATCA
GTTCAACGCC ATGACCAATT CACCTCAAAC AGCCCAAGGT TACCCCCAAT ATCAGTACAA
TCAACAGTCG CATCAACAAA ATCCAACCCA ATGGGCGGGA TATCCTACGT ACGGTCAGTC
ACAGGCCCAA GCTCGTAATA ATATGTCCGG ACGATTGAGC ACGACACCGG CCATCCCACA
AAACAGTGCG CTGGGCGACA GTTCAGCTCG ACAGCAGCCT ACCTCGATGT ACGATTACAC
CACCCACGCC CTTCAACAAT GGCCCGCCTC TGATCTGTCA AAGGTGCCGA GCGCTCACCA
GAACTCATGG CAGAACGCCC AGTCCTTGTC AAGACTGACA CAGGCTCAAT CACCAGCCCT
TGGTCAAACG CAACCATCAT CCGCTTCTCA ACCCACTTCA TCGACATCAT CCACACAAAA
TGCGTCACCC GCTGTCCCTC AAAGCGGCTG GCAAGGATAC CCCTCTATCC CTAGCATACC
TCCGCATACA CTAGGTGGAG GACATTTGAT GAGTGGGCAA CCGTACGGCT GGGGACACAC
CCAACAATGG GGTGGTTATT ATGGCGCTCA ACAACCATCC CAGGCGCCTC AAGGGACCCC
GGCTCATCGA CCTCCGTTGA ATAACTTTCC AACTGGCAGC ACCAGCTCTG CGCCTGCAGA
GTCTCTCCCT ACTGGGCGAA AGAGGATGAG CAAGGATAAA GGCAAAACGG CCAAGGAAAA
GGAGGAAAAC AGAAGTCATT TCGAAGATTA TCATGGGTTG GGTAAGCGTT CTGCAGAAGC
CGTGGTTGAG GAGATACCTG GGGAAGATAA GAAAGAGAGC AAGAAGAAGA AGGGTAAGGA
AGAAAAGGAG AAGCCGGTTC CCAAAGCCAA GTCTCATCTG CATCCACCAA AACAAGCACC
CTCCTCGTGG CAACTCTTTT TCGCCGATGA ACTTGCCAAA GCCAAAGCAG CTGAACCCGA
GTCAAGATCC CCAGGCGGTA CTTCACATCC ACAAAAACTT AATGTCGCAC AGATAGCTAA
AGAGGCTGGC GTGGCATATG CGAACCTCAG TGAGGAAAGG AAAAAATATT ATGCTGAGCG
CGTCAAGGAG CATAGAGAGA TTTACGCGAA GGAGTTGGCG GCTTGGCAAG CAACACTAAC
TCCTGAGGAC ATTCGTGCTG AGAACGCGTT CAGAGCCCAG CAAAGGAAAG AAGGAAAGTC
GCGCAAGGGA AATATCAAAG ATCCGAACGC GCCAAAGAAA CCCCTCAGTG CCTATTTTTT
ATTTTTGAAG GCTATTAGAG AGAACAGCGA TATCAGAGCT CAAGTTTGGG GAACCGAGGC
TGAGACTACT AAGCAAAGCG TTATGGCGGC TGAGAAGTGG AGAAGTTTAA CTGATGACGA
AAAGAGAGTA AGTCGAGTCT GCATGATCAG TCGGCTTCCT ATTCTAACAG GTGCTACAGC
CTTACCTCGA ACAAGCTGAA CATGATAAGC AAACCTACGA GACTGCTCGT AAGCAGTACG
AGGAAGATTC TGCTGCTCGC GCTCGAGGTG AAGACGTTCC CATTAGGGCT GTGGAAGCAC
CCGCTTCGCC GCCTAAGCCT CCTGCATCTA TCTTGAGATC TATCCACGGA GGACAAGCAC
CGGTTACCAA GTCATCACCT TCTGTGACAG ATTCTGCACC TCATCCTGCG CCTCATTCAG
ACCTCGAACC AGGCATCACC TCTCTGGGCA AGTCACCCTC ACCCAACCCT CCCTCTGAAC
CTAGTCTTGC CCAGTTTCAC ACTTCACCTC GCTCTGTCCC TCATTATGAC CCCTCACTCG
AGGTGGATGA TTTCCAAGGA TTCTCTGATC CTCTTAACAT GGATCTGTCG GGTTTGGACG
GCATAGATGT TGGGTCTATG GAAGTAGGTG CTGGTGGCGA TAGTGAACAA CCGTGGGATG
AGCTGCAAAA GCTCATAGGC ACAGAAGATG TGTATAGTTC AGCAAAGCTT CAGCCTGCAG
CGCATATCGT ACCTGACGCT TCAACCGCAG CAAGAGTCGC CCCTGCTTCT GGATTTGAAA
GCACTGACAT TTCAGTTGCG CCATCCTATG CTGAAGCTAA GAACGAGACA CAGCAAGAGA
CTTCCTTGAA TGGGACCCCT GAGGGGAACG GGAATGAGTC AAAGGTACGC CATATTGCAG
AGGCAGCGGA GGGCGAGTTG GCTGTGCTTC CAGCTCTAGG CGAAAATACC GCCCAGAAGA
ACCAAGGGGG TTTAGAGGTC ATGGTTGGAG CAGGGGAAGC CCGAAGACAT AATAAAAACT
TGTCAACTGA ATCAGGACAG GCGGCTGCCG ACGCTCCAGT GGTTGATGGG GTGTAAAATG
GGGCTTTGTG GTTTATAATA TAATATGTAG TGAAGGTTTT GCGTGGACCA GTACGTAGAA
CAAACACATT AGCTTATTAA TCTTACTTTG AACATAGTCA TATATTCATA TCAGATTCAA
CATCCTTGCT ATACGCGAAT AGGAGCATAC ATTGATATAT TGTAGTTTTA TCTTATTAAT
CTTGCTTTCC AACACGTAGT ACATCCATCT GTATTATTGT AA
 
Protein sequence
MQSTNAPVNG GVSQQWQSAM QGMYSGAQQR PTSNSKPYNA PDANGMPAKK DDANQSMYSV 
NTENGYPAYQ QQGYGTLGGL GQYGYNSAGL GGLSAAYGGA TSSTAGATNG RRTSGQRMMT
PPPHPSSVAT NTSPSARYYT GQEGEAGRHA SPYQPASAPP QILSHQFNAM TNSPQTAQGY
PQYQYNQQSH QQNPTQWAGY PTYARQQPTS MYDYTTHALQ QWPASDLSKV PSAHQNSWQN
AQSLSRLTQA QSPALGQTQP SSASQPTSST SSTQNASPAV PQSGWQGYPS IPSIPPHTLG
GGHLMSGQPY GWGHTQQWGG YYGAQQPSQA PQGTPAHRPP LNNFPTGSTS SAPAESLPTG
RKRMSKDKGK TAKEKEENRS HFEDYHGLGK RSAEAVVEEI PGEDKKESKK KKGKEEKEKP
VPKAKSHLHP PKQAPSSWQL FFADELAKAK AAEPESRSPG GTSHPQKLNV AQIAKEAGVA
YANLSEERKK YYAERVKEHR EIYAKELAAW QATLTPEDIR AENAFRAQQR KEGKSRKGNI
KDPNAPKKPL SAYFLFLKAI RENSDIRAQV WGTEAETTKQ SVMAAEKWRS LTDDEKRPYL
EQAEHDKQTY ETARKQYEED SAARARGEDV PIRAVEAPAS PPKPPASILR SIHGGQAPVT
KSSPSVTDSA PHPAPHSDLE PGITSLGKSP SPNPPSEPSL AQFHTSPRSV PHYDPSLEVD
DFQGFSDPLN MDLSGLDGID VGSMEVGAGG DSEQPWDELQ KLIGTEDVYS SAKLQPAAHI
VPDASTAARV APASGFESTD ISVAPSYAEA KNETQQETSL NGTPEGNGNE SKVRHIAEAA
EGELAVLPAL GENTAQKNQG GLEVMVGAGE ARRHNKNLST ESGQAAADAP VVDGV