Gene EcSMS35_0095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0095 
SymbolmurG 
ID6146870 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp106277 
End bp107344 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content56% 
IMG OID641614996 
Productundecaprenyldiphospho-muramoylpentapeptide beta-N- acetylglucosaminyltransferase 
Protein accessionYP_001742212 
Protein GI170683320 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0707] UDP-N-acetylglucosamine:LPS N-acetylglucosamine transferase 
TIGRFAM ID[TIGR01133] undecaprenyldiphospho-muramoylpentapeptide beta-N-acetylglucosaminyltransferase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.130586 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGGTC AAGGAAAGCG ATTAATGGTG ATGGCAGGCG GAACCGGTGG ACATGTATTC 
CCGGGACTGG CGGTTGCGCA CCATCTAATG GCTCAGGGTT GGCAAGTTCG CTGGCTGGGG
ACTGCCGACC GTATGGAAGC GGACTTAGTG CCAAAACATG GCATCGAAAT TGATTTCATT
CGTATCTCTG GTCTGCGTGG AAAAGGTATA AAAGCACTGA TAGCTGCGCC GCTGCGTATC
TTCAACGCCT GGCGTCAGGC GCGGGCGATT ATGAAAGCGT ACAAACCTGA CGTGGTGCTC
GGTATGGGCG GCTATGTATC AGGTCCAGGT GGTCTGGCTG CGTGGTCGTT AGGCATTCCG
GTTGTACTTC ATGAACAAAA CGGTATTGCG GGCTTAACCA ATAAATGGCT GGCGAAGATT
GCTACCAAAG TGATGCAGGC GTTTCCAGGC GCTTTCCCTA ATGCGGAAGT GGTGGGTAAC
CCGGTGCGTA CCGATGTGCT GGCGCTGCCG TTGCCGCAGC AACGTTTGGC TGGACGTGAA
GGTCCGGTTC GTGTGTTGGT AGTGGGTGGT TCCCAGGGCG CACGCATTCT TAATCAGACA
ATGCCGCAGG TTGCTGCAAA ACTGGGTGAT TCAGTCACTA TCTGGCATCA GAGCGGCAAA
GGTTCGCAAC AATCCGTTGA ACAGGCGTAT GCCGAAGCGG GACAACCGCA GCATAAAGTG
ACGGAATTTA TTGATGATAT GGCGGCGGCG TATGCGTGGG CGGATGTCGT TGTTTGCCGC
TCCGGTGCGT TAACGGTGAG TGAAATCGCC GCGGCAGGAC TTCCGGCGTT GTTTGTGCCG
TTTCAACATA AAGACCGTCA GCAATACTGG AATGCGCTAC CGCTGGAAAA AGCGGGCGCA
GCCAAAATTA TCGAGCAGCC ACAGCTTAGC GTGGATGCTG TCGCCAACAC CCTGGCCGGG
TGGTCGCGAG AAACCTTATT AACCATGGCA GAACGCGCCC GGGCTGCATC CATTCCGGAT
GCCACCGAGC GAGTGGCAAA TGAAGTGAGC CGGGCTGCCC GGGCGTAA
 
Protein sequence
MSGQGKRLMV MAGGTGGHVF PGLAVAHHLM AQGWQVRWLG TADRMEADLV PKHGIEIDFI 
RISGLRGKGI KALIAAPLRI FNAWRQARAI MKAYKPDVVL GMGGYVSGPG GLAAWSLGIP
VVLHEQNGIA GLTNKWLAKI ATKVMQAFPG AFPNAEVVGN PVRTDVLALP LPQQRLAGRE
GPVRVLVVGG SQGARILNQT MPQVAAKLGD SVTIWHQSGK GSQQSVEQAY AEAGQPQHKV
TEFIDDMAAA YAWADVVVCR SGALTVSEIA AAGLPALFVP FQHKDRQQYW NALPLEKAGA
AKIIEQPQLS VDAVANTLAG WSRETLLTMA ERARAASIPD ATERVANEVS RAARA