Gene EcSMS35_0225 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0225 
SymbolmltD 
ID6144705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp241941 
End bp243161 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content52% 
IMG OID641615124 
Productmembrane-bound lytic murein transglycosylase D 
Protein accessionYP_001742334 
Protein GI170681258 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0741] Soluble lytic murein transglycosylase and related regulatory proteins (some contain LysM/invasin domains) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.459174 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGATG GGACGTCTAT CGCGCCAGAT GGTGACTTGT GGGCTTTCAT TGGCGACGAG 
CTAAAGATGG GAATTCCGGA AAACGACCGG ATTCGCGAAC AGAAACAGAA ATATTTACGC
AATAAGAGCT ATCTCCACGA TGTAACTTTA CGGGCAGAGC CGTATATGTA CTGGATAGCC
GGGCAAGTTA AAAAACGTAA CATGCCTATG GAACTGGTAC TACTACCCAT AGTGGAGAGC
GCTTTTGATC CTCACGCAAC GTCTGGCGCC AATGCCGCGG GCATCTGGCA GATCATTCCG
AGCACGGGGC GCAATTATGG TTTGAAACAG ACCCGCAATT ATGACGCGCG TCGCGATGTT
GTTGCTTCAA CAACTGCCGC GCTGAACATG ATGCAGCGTC TGAACAAGAT GTTTGACGGC
GACTGGCTTC TGACCGTAGC GGCTTATAAC AGCGGCGAAG GTCGGGTCAT GAAGGCAATT
AAAACGAACA AAGCGCGTGG GAAATCCACG GACTTCTGGT CGTTACCGTT GCCGCAGGAA
ACGAAGCAGT ACGTGCCTAA AATGCTGGCA TTGAGTGATA TTCTCAAAAA CAGCAAGCGT
TATGGCGTAC GTCTGCCAAC GACCGATGAA AGCCGTGCTC TGGCGCGTGT GCACCTGAGC
AGCCCGGTTG AAATGGCGAA GGTTGCAGAT ATGGCGGGGA TTTCCGTCAG CAAGCTGAAG
ACATTCAACG CTGGCGTGAA AGGCTCCACG CTGGGCGCAA GTGGTCCGCA GTACGTGATG
GTGCCAAAGA AGCATGCAGA TCAACTGCGT GAATCTCTGG CTTCAGGCGA AATTGCTGCT
GTACAGTCGA CGCTGGTTGC CGACAATACG CCGCTTAACA GCCGTGTTTA CACCGTACGC
TCTGGCGACA CGCTTTCAAG TATCGCTTCA CGTCTCGGCG TAAGCACCAA AGATTTGCAG
CAGTGGAACA AACTGCGCGG ATCTAAGCTG AAGCCAGGCC AAAGTTTGAC GATTGGTGCA
GGCAGTAGCG CACAGCGACT GGCAAACAAC AGCGATAGCA TTACGTATCG TGTGCGCAAA
GGCGATTCGC TTTCAAGCAT TGCTAAACGC CACGGCGTGA ACATCAAAGA TGTAATGCGC
TGGAACAGCG ATACTGCGAA TCTGCAACCA GGCGATAAGC TGACGTTGTT TGTGAAAAAC
AACAGCATGC CAGACTCCTG A
 
Protein sequence
MDDGTSIAPD GDLWAFIGDE LKMGIPENDR IREQKQKYLR NKSYLHDVTL RAEPYMYWIA 
GQVKKRNMPM ELVLLPIVES AFDPHATSGA NAAGIWQIIP STGRNYGLKQ TRNYDARRDV
VASTTAALNM MQRLNKMFDG DWLLTVAAYN SGEGRVMKAI KTNKARGKST DFWSLPLPQE
TKQYVPKMLA LSDILKNSKR YGVRLPTTDE SRALARVHLS SPVEMAKVAD MAGISVSKLK
TFNAGVKGST LGASGPQYVM VPKKHADQLR ESLASGEIAA VQSTLVADNT PLNSRVYTVR
SGDTLSSIAS RLGVSTKDLQ QWNKLRGSKL KPGQSLTIGA GSSAQRLANN SDSITYRVRK
GDSLSSIAKR HGVNIKDVMR WNSDTANLQP GDKLTLFVKN NSMPDS