Gene Mmcs_4001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_4001 
Symbol 
ID4112831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp4266757 
End bp4267905 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content70% 
IMG OID638033144 
Productglycogen synthase 
Protein accessionYP_641162 
Protein GI108800965 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID[TIGR02149] glycogen synthase, Corynebacterium family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0312479 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCGGG AGTACCCACC CGAGGTCTAC GGCGGAGCAG GCGTACACGT CACCGAACTC 
GTCGCGCAGT TGCGCCGGCT GTGCGAGGTC GACGTGCACT GCATGGGAGC GCCGCGCGCC
GACGCGTTCG TCGCCGCACC GGACCCCGCC CTGGCGGGTG CCAACCCGGC GCTGTCCACG
CTCTCGGCCG ACCTGAACAT GGTCAACGCC GCCGGCGGGG CCACCGTCGT GCACTCCCAC
ACCTGGTACG CCGGGCTCGC CGGCCATCTC TCGTCGCTGC TGTACGGCAT TCCCCACGTG
CTCACCGCGC ACTCGCTGGA GCCGATGCGG CCGTGGAAAG CCGAACAGCT CGGCGGCGGC
TACCGGGTGT CGTCGTGGGT GGAGAAGACG GCGGTCGAGG CGGCCGACGC GGTGATCGCG
GTGAGCTCCG GCATGCGCGA CGACGTCCTC AAGACCTATC CCGCGTTGGA CCCCGCCCGG
GTGCACGTGG TGCGCAACGG CATCGACACC GACGTCTGGT ATCCCAGCCA GCCGCAGTCC
GGCGAGTCGG TGCTGGCCGA ACTCGGTGTG GATCCGGGCC GGCCGATCGT CGCGTTCGTC
GGCCGCATCA CCCGGCAGAA GGGGGTGGCG CACCTGGTGG CGGCCGCCCA TCACTTCGCG
CCCGAGGTGC AGTTGGTGCT GTGCGCCGGT GCACCCGACA CCCCGGAGAT CGCCGCGGAG
GTGACGTCCG CGGTGCAGCA GCTGGCCCGC GCCCGCACCG GGGTGTTCTG GGTGCGGGAG
ATGCTGCCGA TCGGCAAGAT TCGCGAAATC CTCTCGGCGG CAACTGTTTT CGTCTGCCCG
TCGGTGTATG AACCGCTGGG CATCGTCAAC CTCGAGGCGA TGGCATGCGG GACGGCGGTG
GTCGCTTCGG ACGTCGGCGG CATCCCCGAG GTCGTCGCCG ACCACCAGAC CGGGCTGCTG
GTGCACTACG ACGCTGCCGA CACCGGCTTC TTCGAGACCC GGTTGGCCGA TGCGGTCAAC
TCACTGATCG CCGAACCGCA GCGGGCGCGC GCATACGGCG CCGCAGGCCG TGAACGGTGC
ATCGCCGAAT TCTCGTGGGC GCACATCGCC GAGCAGACCA TGGAGATCTA CCGCAAGGTG
TCGGGGTAG
 
Protein sequence
MTREYPPEVY GGAGVHVTEL VAQLRRLCEV DVHCMGAPRA DAFVAAPDPA LAGANPALST 
LSADLNMVNA AGGATVVHSH TWYAGLAGHL SSLLYGIPHV LTAHSLEPMR PWKAEQLGGG
YRVSSWVEKT AVEAADAVIA VSSGMRDDVL KTYPALDPAR VHVVRNGIDT DVWYPSQPQS
GESVLAELGV DPGRPIVAFV GRITRQKGVA HLVAAAHHFA PEVQLVLCAG APDTPEIAAE
VTSAVQQLAR ARTGVFWVRE MLPIGKIREI LSAATVFVCP SVYEPLGIVN LEAMACGTAV
VASDVGGIPE VVADHQTGLL VHYDAADTGF FETRLADAVN SLIAEPQRAR AYGAAGRERC
IAEFSWAHIA EQTMEIYRKV SG