Gene EcHS_A0096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0096 
SymbolmurG 
ID5590910 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp101399 
End bp102466 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content56% 
IMG OID640919284 
Productundecaprenyldiphospho-muramoylpentapeptide beta-N- acetylglucosaminyltransferase 
Protein accessionYP_001456879 
Protein GI157159561 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0707] UDP-N-acetylglucosamine:LPS N-acetylglucosamine transferase 
TIGRFAM ID[TIGR01133] undecaprenyldiphospho-muramoylpentapeptide beta-N-acetylglucosaminyltransferase 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value0.178087 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGGTC AAGGAAAGCG ATTAATGGTG ATGGCAGGCG GAACCGGTGG ACATGTATTC 
CCGGGACTGG CGGTTGCGCA CCATCTAATG GCTCAGGGTT GGCAAGTTCG CTGGCTGGGG
ACTGCCGACC GTATGGAAGC GGACTTAGTG CCAAAACATG GCATCGAAAT TGATTTCATT
CGTATCTCTG GTCTGCGTGG AAAAGGTATA AAAGCACTGA TAGCTGCCCC GCTGCGTATC
TTCAACGCCT GGCGTCAGGC GCGGGCGATT ATGAAAGCGT ACAAACCTGA CGTGGTGCTC
GGTATGGGAG GCTACGTGTC AGGTCCAGGT GGTCTGGCCG CGTGGTCGTT AGGCATTCCG
GTTGTACTTC ATGAACAAAA CGGTATTGCG GGCTTAACCA ATAAATGGCT GGCGAAGATT
GCCACCAAAG TGATGCAGGC GTTTCCAGGT GCTTTCCCTA ATGCGGAAGT AGTGGGTAAC
CCGGTGCGTA CCGATGTGTT GGCGCTGTCG TTGCCGCAGC AACGTTTGGC TGGACGTGAA
GGTCCGGTTC GTGTGCTGGT AGTGGGTGGT TCTCAGGGCG CACGCATTCT TAACCAGACA
ATGCCGCAGG TTGCTGCGAA ACTGGGTGAT TCAGTCACTA TCTGGCATCA GAGCGGCAAA
GGTTCGCAAC AATCCGTTGA ACAGGCGTAT GCCGAAGCGG GGCAACCGCA GCATAAAGTG
ACGGAATTTA TTGATGATAT GGCGGCGGCG TATGCGTGGG CGGATGTCGT CGTTTGCCGC
TCCGGTGCGT TAACGGTGAG TGAAATCGCC GCGGCAGGAC TACCGGCGTT GTTTGTGCCG
TTTCAACATA AAGACCGCCA GCAATACTGG AATGCGCTAC CGCTGGAAAA AGCGGGCGCA
GCCAAAATTA TCGAGCAGCC ACAGCTTAGC GTGGATGCTG TCGCCAACAC CCTGGCCGGG
TGGTCGCGAG AAACCTTATT AACCATGGCA GAACGCGCCC GCGCTGCATC CATTCCGGAT
GCCACCGAGC GAGTGGCAAA TGAAGTGAGC CGGGTTGCCC GGGCGTAA
 
Protein sequence
MSGQGKRLMV MAGGTGGHVF PGLAVAHHLM AQGWQVRWLG TADRMEADLV PKHGIEIDFI 
RISGLRGKGI KALIAAPLRI FNAWRQARAI MKAYKPDVVL GMGGYVSGPG GLAAWSLGIP
VVLHEQNGIA GLTNKWLAKI ATKVMQAFPG AFPNAEVVGN PVRTDVLALS LPQQRLAGRE
GPVRVLVVGG SQGARILNQT MPQVAAKLGD SVTIWHQSGK GSQQSVEQAY AEAGQPQHKV
TEFIDDMAAA YAWADVVVCR SGALTVSEIA AAGLPALFVP FQHKDRQQYW NALPLEKAGA
AKIIEQPQLS VDAVANTLAG WSRETLLTMA ERARAASIPD ATERVANEVS RVARA