Gene CNM01900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNM01900 
Symbol 
ID3255173 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006682 
Strand
Start bp579104 
End bp581123 
Gene Length2020 bp 
Protein Length555 aa 
Translation table 
GC content52% 
IMG OID638254344 
Productconserved hypothetical protein 
Protein accessionXP_568477 
Protein GI58262134 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2971] Predicted N-acetylglucosamine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGCAAG AGAATGGAAG TGGTTTAGTA TCCGATCAAC CAACCGTCAC TGGCGATCAG 
GTATCAGACG TCGCTATGCC TACCAGGCTG CCCACTCCAC CAGCTTCACC CACTGTCCCA
CAGCTCATCC TCTGTGCTGA TGGAGGGGGT TCCAAGGTGT GTGTGGTGGT TAGGAGTGCG
GATGGGTTAG AGGTCAGAGG CACTGCTGGG CCCTGTAATG TGTGAGTTAT TACACTAATA
TCACTATCGC AAGAACTGAG CAACTCTCTA GTCAAAGTGT TGGGTATGCA GCGGCTACTC
AGTCACTCCT TTTGGCCACC TATCGTGCCC TTGCCCAACT ACCTCGCTCT CACATCCCAC
ACAACCTCCT CATGCCCTCG ATTGACCTTT CTGACAGCAC GCGTTCATCT CCTCAACTTC
AAGCCGTCCC CATGATACTC AAAAACCAAC TGCCCTCACC ACCTTCTTCC ATTGGTTCAA
AATCTGTGGT GATTCATAAT CTTCCATCTG TCATGAACAA CCTCACACTC GAACCGCCTT
CTGCTTCTTC TTCCTTCTCC TCTTCCTCTT CCCTCCCCCC TTCATCATCG CCTTCTACAA
CCCATTCGTC TCCCACGTCA ATCCCCTTGA CGCACCTCTC GTCATACCCG ACCTTGACGG
ACAAATCAAT ATCCACGCCA AACCCAGGCA ACCCCACCCC TCGCACGCGC TTACCGCCCC
TTAGTGTACC GATCTTTCAA TACGCGTGGC TGGCGTTGGC TGGAATATCG TGCAAGGCAG
ATGAACAAGC TTTTGCAAAG GTCGTCTGTG GAGTCTTGGG TTTAGATATG GAACGGCTCA
AAGTTACGAA TGGTAGGTTC CCATAGCGTT TCAAGATCTT TTTTATGACG CAGCTGATAC
GTGTGTGGAA AGATGTAAAC CTCTTGGCGG CACCAGCACT CGACCTTCCG GACATAGACC
ATGTAATTGC ACTTGTCGCT GGGACAGGGA CGGTAGGACG GGCGATCAAA GTTGGCGATA
AGAAGCGAGG GTTGCCTCTG GAAGATGTTG CCATGTCCCG AGGTTGGGGT TATTTGTGAG
TGATTTATTT TTTTGCTATT CTTAGTAAAA CTCAATAAAT GTGCTGACTG TGATGCAGAT
TATGCGATGA AGGATCGGCA TTTTGGATCG GTCGGTTAGC TATTAGAGCC CTTTTATCTC
TTTCCGACCG CCATGCTTCA TCGGGCATCT ATTCCTCCCC TCCACCGCCT TTCCTGCCCC
TTCACAACGA CCTCCTAGCA TACTTTGGAA CGTCCAACCC CCTCGACTTG ATCAACGTCG
CATCGCTCAC TGCGTCAGGG ATGGCAGAGC CTACCGAAAG TGTGGGCGAA GCGACGAGCC
GGAGGAACGC TTTACTAGCA GGTGCAGCGA GGGTGGTGTT CAAACACGCT TTCCCAGGGG
ATGTTAGTCC CCGCCCAGGA TTCCTTACGC CGCCACGCAG TACAGATGGA GGTGCTGATA
TGGATGAGGA TCATGAAAGC ACGTCGAGTC CTCGACAGCC GGAAGAGTTG AAGCACGATG
GTATTTTGGA TCACGCGTCC CACCTTGAAG CACTCGGTAT CGCACGTCAG GCAGCCGCGC
CGCTTATTAC GCTTACACTC TCGCTCCTTG GCGACCGCAC AATCGTCAGA CCTGAAAGGT
CAGCGTTAAC ACTTGGAGGC GGGCTGATGA TGAGCGAGGG ATACAGAGAG ATGCTCTTGG
ATGGATTGAA GAAGGAGGGA GTGAGCTTTG GACGGGTGAT GGTGGTGGGT GACGCTGCTG
GTGAAGGGGC CCAGGCTCTT GGTAGAGTTG AGTTTGAGTG AGAGATGTCT CTGAACTGGT
ATTATTTGTG TCTGCATTTG TAGTAGCTGC TCCTGGACGC TATTATAATC CATACCATAG
CGAGCTATAT ATTTCTATAT ACCATGCCTG TCAGGTTACT TACCGTCGTC ATCGTTTTGG
TCTTTGGCGC TGTCACCCAC GTTGTACGAC GAGTTTGCGT
 
Protein sequence
MLQENGSGLV SDQPTVTGDQ VSDVAMPTRL PTPPASPTVP QLILCADGGG SKVCVVVRSA 
DGLEVRGTAG PCNVQSVGYA AATQSLLLAT YRALAQLPRS HIPHNLLMPS IDLSDSTRSS
PQLQAVPMIL KNQLPSPPSS IGSKSVVIHN LPSVMNNLTL EPPSASSSFS SSSSLPPSSS
PSTTHSSPTS IPLTHLSSYP TLTDKSISTP NPGNPTPRTR LPPLSVPIFQ YAWLALAGIS
CKADEQAFAK VVCGVLGLDM ERLKVTNDVN LLAAPALDLP DIDHVIALVA GTGTVGRAIK
VGDKKRGLPL EDVAMSRGWG YLLCDEGSAF WIGRLAIRAL LSLSDRHASS GIYSSPPPPF
LPLHNDLLAY FGTSNPLDLI NVASLTASGM AEPTESVGEA TSRRNALLAG AARVVFKHAF
PGDVSPRPGF LTPPRSTDGG ADMDEDHEST SSPRQPEELK HDGILDHASH LEALGIARQA
AAPLITLTLS LLGDRTIVRP ERSALTLGGG LMMSEGYREM LLDGLKKEGV SFGRVMVVGD
AAGEGAQALG RVEFE