Gene CNM01890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNM01890 
Symbol 
ID3255154 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006682 
Strand
Start bp576289 
End bp578078 
Gene Length1790 bp 
Protein Length434 aa 
Translation table 
GC content51% 
IMG OID638254343 
Productconserved hypothetical protein 
Protein accessionXP_568292 
Protein GI58261764 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1820] N-acetylglucosamine-6-phosphate deacetylase 
TIGRFAM ID[TIGR00221] N-acetylglucosamine-6-phosphate deacetylase 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCTTCTGGTA GAACAGATCG GCGCCTGGTC GACATTTTCA CCTTTGTTCG TTCGTCCTTT 
GTTTGGTTTC CTGGCCAGAA GTACGCACAT CGTATATATA GAAGGGTGCC CACGTCTTGA
CTAGTAGTGA TCGCAGCTGC CAGCCATGTC GGATATAGTC AGGTTCACCA ATGGCTACTT
GGCAATGCCA GACGGCACAG TATGTGCTCC ATCACATTCC ATGCGCTCTT CTTCTCACAC
TCTGTAGGCC GTCAAAGCAG ACCTTTACAT CTCATCTTCC TCTGGCAAGA TCATCTCCGG
CCAATCCTCC TTCTATTCCA ACCACTCCCC ATGTCGGACA GTCGACTTGC AAGGCAATCT
TCTCTCGCCA GGCTTGATCG ATATTCAGAT CAACGGTGCT TGGCGCGTAG ACTTTTCAGA
GCTGGATGTT CAAGCAGGGG AAGAGGGCGA GAAAAAGTAT ATCAAGGGGC TGGAGAGGGT
AGCGAGGAGG TTGGCGCAAT ATGGAACCAC AAGTTTTGTG CCGACCATCA TCACTCAGCA
TCAAGAGCTT TATAGCAAAG TGAGCCCCAA GATATTTTGT TACATGTGGC TATTGACATT
GAGTAGCTTC TTCGACTTCT TTGCCCGCGC TCCCCTCCTG GCTCCTCCCA TATCCTAGGT
TACCACGCCG AAGGCCCTTT CCTTTCCCCT ATCCGTAAAG GCGCCCATTC TTCAACCCTC
CTATTGACCG CCTCCTCCAC CTCTCCCATA TTCCCCCCTG GGGCGTCGGA TACCTCACCC
ATGAAAGCTC TCGAGATTGT CTACGGTAAA GAAGGGCTCG ATCAGCAAGG TGTAAAGATC
ATCACCTTGG CACCCGATGT AGATGGGGTT ATGGATTGTA TTGAACCGTT GGTAGAACGA
GGAGTTGTGG TTTCCGTAGG GCATAGGTGA GCCCCTATTA TCCCTTCCAG AGAAGTTTGG
TGGCAGATAA CTGACAAGGT CCGTGTAAGT GACGCTTCAT TAGAGCAAGT CGAAGAAGCG
TTTGACAAAG GCGCTCGCAT GATTACCCAT TTGTTCAAGT ACGTGTCTCC TCACACTTTT
TCCATCCCTT TCTTCCTACA CTAACGACGT CCCTCTAGTG CCATGCCACC AATCCATCAT
CGTGATCCGG GTGTAGTCGG CATGCTCGGC AACCCTAACC GTCGCCCATA CTTTGGGATC
ATCGTCGACG GCTTACATTC TCATCCCAAT ACTGTCCGAA TCGCTTATGG TGCTTGTGAA
GAGGGTTGTG TCCTTGTTTC GGATGGTGAG CACCTCATCA CCTCCTCGAT TGATCATCAT
ATGGAATTGT TGCTGATCGT TGAATGGCAT ACTTTAGCGC AAAGTATCAT GGACCCCTCG
CAGCCAGATG GAGTCATCGA CTGGCGACCA GGACTTCGGT TTAGGAAAGA AGGTCTCAAA
GTCCTTGTGG ACGGCACTTC TACCTTGGCA GGTAGCGCGG CACCCCTCGC ACCACTCGCT
CACAATCTCG CAAAATTCGC GTCCATCTCG CTCCCCATGG CTCTTGTCTG TGCGACCAAA
CACCCAGCAG AGTGTCTGGG AGGGGAAGTG GCCAAGCGCA AGGGGCAGTT GATAGAAGGG
TTTGATGCGG ATTTATGTGT GTTTGATTGG GAGGGGAATG TTAAGGACGT TTGGATTATG
GGTGAGGAAA TTTGGAAGGA TGGGGAAGGA TGGGTGGATG GTCGTGGAGA TGGGCCATAG
TAGATCAATC TTGTTGAATC TGGGGTTTCA GGTACATTTA CGAGCTTGAC
 
Protein sequence
MSDIVRFTNG YLAMPDGTAV KADLYISSSS GKIISGQSSF YSNHSPCRTV DLQGNLLSPG 
LIDIQINGAW RVDFSELDVQ AGEEGEKKYI KGLERVARRL AQYGTTSFVP TIITQHQELY
SKLLRLLCPR SPPGSSHILG YHAEGPFLSP IRKGAHSSTL LLTASSTSPI FPPGASDTSP
MKALEIVYGK EGLDQQGVKI ITLAPDVDGV MDCIEPLVER GVVVSVGHSD ASLEQVEEAF
DKGARMITHL FNAMPPIHHR DPGVVGMLGN PNRRPYFGII VDGLHSHPNT VRIAYGACEE
GCVLVSDAQS IMDPSQPDGV IDWRPGLRFR KEGLKVLVDG TSTLAGSAAP LAPLAHNLAK
FASISLPMAL VCATKHPAEC LGGEVAKRKG QLIEGFDADL CVFDWEGNVK DVWIMGEEIW
KDGEGWVDGR GDGP