Gene EcSMS35_0093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0093 
SymbolmurD 
ID6143343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp103720 
End bp105036 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content55% 
IMG OID641614994 
ProductUDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase 
Protein accessionYP_001742210 
Protein GI170681706 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0771] UDP-N-acetylmuramoylalanine-D-glutamate ligase 
TIGRFAM ID[TIGR01087] UDP-N-acetylmuramoylalanine--D-glutamate ligase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.388971 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGATT ACCAGGGTAA AAATGTCGTC ATTATCGGCC TGGGCCTCAC CGGGCTTTCT 
TGCGTGGACT TTTTCCTCGC TCGCGGTGTG ACGCCGCGCG TTATGGATAC GCGTATGACA
CCGCCTGGCC TGGATAAATT ACCCGAAGCT GTAGAACGCC ACACGGGCGG TCTGAATGAT
GAATGGCTGA TGGCGGCCGA TCTGATTGTC GCCAGTCCCG GTATTGCACT GGCGCATCCA
TCCTTAAGCG CTGCCGCTGA TGCCGGAATC GAAATCGTTG GCGATATCGA GCTGTTCTGT
CGTGAAGCAC AAGCACCGAT TGTGGCGATC ACCGGTTCTA ACGGCAAAAG CACGGTCACC
ACGCTAGTGG GTGAAATGGC GAAAGCGGCG GGGGTTAACG TTGGTGTGGG TGGCAATATT
GGCCTGCCTG CGTTGATGCT GCTGGATGAT GAGTGCGAAC TGTACGTGCT GGAGCTGTCG
AGCTTCCAGC TTGAAACCAC CTCCAGTTTA CAGGCGGTAG CAGCGACCAT TCTGAACGTA
ACTGAAGATC ATATGGATCG CTATCCGTTT GGTTTACAAC AGTATCGTGC AGCAAAACTG
CGCATTTACG AAAACGCAAA AATTTGCGTG GTTAATGCTG ATGATGCCTT AACAATGCCG
ATTCGCGGTG CGGATGAACG CTGCGTCAGC TTTGGCGTCA ACATGGGTGA CTATCACCTG
AATCATCAGC AGGGTGAAAC CTGGCTGCGG GTGAAGGGCG AGAAAGTGCT GAACGTGAAA
GAAATGAAAC TTTCCGGGCA GCATAACTAC ACCAATGCAC TGGCAGCTCT GGCACTGGCA
GATGCCGCTG GTTTGCCGCG CGCCAGCAGC CTGAAAGCGT TAACCACATT CACTGGTCTG
CCGCATCGCT TTGAAGTTGT GCTGGAGCAT AACGGTGTGC GCTGGATTAA CGATTCAAAA
GCGACCAACG TCGGCAGTAC GGAAGCGGCA CTGAATGGCC TGCACGTAGA CGGCACGCTG
CATTTGTTGC TGGGCGGCGA TGGTAAATCG GCAGATTTTA GCCCACTGGC GCGTTACCTG
AATGGCGATA ACGTACGTCT GTATTGTTTC GGTCGGGACG GCGCGCAGCT GGCGGCGCTA
CGCCCGGAAG TGGCAGAACA AACCGAAACC ATGGAACAGG CGATGCGCTT GCTGGCTCCG
CGTGTTCAGC CGGGCGATAT GGTTCTGCTC TCCCCGGCCT GTGCCAGCCT TGATCAGTTC
AAGAACTTTG AACAACGAGG CAATGAGTTT GCCCGTCTGG CGAAGGAGTT AGGTTGA
 
Protein sequence
MADYQGKNVV IIGLGLTGLS CVDFFLARGV TPRVMDTRMT PPGLDKLPEA VERHTGGLND 
EWLMAADLIV ASPGIALAHP SLSAAADAGI EIVGDIELFC REAQAPIVAI TGSNGKSTVT
TLVGEMAKAA GVNVGVGGNI GLPALMLLDD ECELYVLELS SFQLETTSSL QAVAATILNV
TEDHMDRYPF GLQQYRAAKL RIYENAKICV VNADDALTMP IRGADERCVS FGVNMGDYHL
NHQQGETWLR VKGEKVLNVK EMKLSGQHNY TNALAALALA DAAGLPRASS LKALTTFTGL
PHRFEVVLEH NGVRWINDSK ATNVGSTEAA LNGLHVDGTL HLLLGGDGKS ADFSPLARYL
NGDNVRLYCF GRDGAQLAAL RPEVAEQTET MEQAMRLLAP RVQPGDMVLL SPACASLDQF
KNFEQRGNEF ARLAKELG