Gene EcolC_3567 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3567 
SymbolmurG 
ID6065888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3898870 
End bp3899937 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content56% 
IMG OID641602984 
Productundecaprenyldiphospho-muramoylpentapeptide beta-N- acetylglucosaminyltransferase 
Protein accessionYP_001726508 
Protein GI170021554 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0707] UDP-N-acetylglucosamine:LPS N-acetylglucosamine transferase 
TIGRFAM ID[TIGR01133] undecaprenyldiphospho-muramoylpentapeptide beta-N-acetylglucosaminyltransferase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.713723 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00628085 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTGGTC AAGGAAAGCG ATTAATGGTG ATGGCAGGCG GAACCGGTGG ACATGTATTC 
CCGGGACTGG CGGTTGCGCA CCATCTAATG GCTCAGGGTT GGCAAGTTCG CTGGCTGGGG
ACTGCCGACC GTATGGAAGC GGACTTAGTG CCAAAACATG GCATCGAAAT TGATTTCATT
CGTATCTCTG GTCTGCGTGG AAAAGGTATA AAAGCACTGA TAGCTGCCCC GCTGCGTATC
TTCAACGCCT GGCGTCAGGC GCGGGCGATT ATGAAAGCGT ACAAACCTGA CGTGGTGCTC
GGTATGGGAG GCTACGTGTC AGGTCCAGGT GGTCTGGCCG CGTGGTCGTT AGGCATTCCG
GTTGTACTTC ATGAACAAAA CGGTATTGCG GGCTTAACCA ATAAATGGCT GGCGAAGATT
GCCACCAAAG TGATGCAGGC GTTTCCAGGT GCTTTCCCTA ATGCGGAAGT AGTGGGTAAC
CCGGTGCGTA CCGATGTGTT GGCGCTGCCG TTGCCGCAGC AACGTTTGGC TGGACGTGAA
GGTCCGGTTC GTGTGCTGGT AGTGGGTGGT TCTCAGGGCG CACGCATTCT TAACCAGACA
ATGCCGCAGG TTGCTGCGAA ACTGGGTGAT TCAGTCACTA TCTGGCATCA GAGCGGCAAA
GGTTCGCAAC AATCCGTTGA ACAGGCGTAT GCCGAAGCGG GGCAACCGCA GCATAAAGTG
ACGGAATTTA TTGATGATAT GGCGGCGGCG TATGCGTGGG CGGATGTCGT CGTTTGCCGC
TCCGGTGCGT TAACGGTGAG TGAAATCGCC GCGGCAGGAC TACCGGCGTT GTTTGTGCCG
TTTCAACATA AAGACCGCCA GCAATACTGG AATGCGCTAC CGCTGGAAAA AGCGGGCGCA
GCCAAAATTA TCGAGCAGCC ACAGCTTAGC GTGGATGCTG TCGCCAACAC CCTGGCCGGG
TGGTCGCGAG AAACCTTATT AACCATGGCA GAACGCGCCC GCGCTGCATC CATTCCGGAT
GCCACCGAGC GAGTGGCAAA TGAAGTGAGC CGGGTTGCCC GGGCGTAA
 
Protein sequence
MSGQGKRLMV MAGGTGGHVF PGLAVAHHLM AQGWQVRWLG TADRMEADLV PKHGIEIDFI 
RISGLRGKGI KALIAAPLRI FNAWRQARAI MKAYKPDVVL GMGGYVSGPG GLAAWSLGIP
VVLHEQNGIA GLTNKWLAKI ATKVMQAFPG AFPNAEVVGN PVRTDVLALP LPQQRLAGRE
GPVRVLVVGG SQGARILNQT MPQVAAKLGD SVTIWHQSGK GSQQSVEQAY AEAGQPQHKV
TEFIDDMAAA YAWADVVVCR SGALTVSEIA AAGLPALFVP FQHKDRQQYW NALPLEKAGA
AKIIEQPQLS VDAVANTLAG WSRETLLTMA ERARAASIPD ATERVANEVS RVARA