Gene ECH74115_0223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0223 
SymbolmltD 
ID6969108 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp236130 
End bp237350 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content52% 
IMG OID643384297 
Productmembrane-bound lytic murein transglycosylase D 
Protein accessionYP_002268814 
Protein GI209397531 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0741] Soluble lytic murein transglycosylase and related regulatory proteins (some contain LysM/invasin domains) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGATG GGACGTCTAT CGCGCCAGAT GGTGACTTGT GGGCTTTCAT TGGCGACGAG 
CTAAAGATGG GAATTCCGGA AAATGACCGG ATTCGCGAAC AGAAACAGAA ATATTTACGC
AATAAGAGCT ATCTCCACGA TGTAACTTTA CGGGCAGAGC CGTATATGTA CTGGATAGCC
GGGCAAGTTA AAAAACGTAA CATGCCTATG GAACTGGTAC TACTACCCAT AGTGGAGAGC
GCTTTTGATC CTCACGCAAC GTCTGGCGCC AATGCCGCAG GCATCTGGCA GATCATTCCG
AGCACGGGGC GCAATTATGG TTTGAAACAG ACCCGCAATT ATGACGCGCG TCGCGATGTT
GTTGCTTCAA CAACTGCCGC GCTGAATATG ATGCAGCGTC TGAACAAAAT GTTTGATGGC
GACTGGCTTC TGACCGTAGC GGCTTATAAC AGCGGCGAAG GTCGGGTCAT GAAGGCAATT
AAAACGAACA AAGCGCGTGG GAAATCCACG GACTTCTGGT CGTTACCGTT GCCGCAGGAA
ACGAAGCAGT ACGTGCCTAA AATGCTGGCA TTGAGTGATA TTCTCAAAAA CAGCAAGCGT
TATGGCGTAC GTCTGCCAAC GACCGATGAA AGCCGTGCTC TGGCGCGTGT GCACCTGAGT
AGCCCGGTTG AAATGGCGAA GGTTGCAGAT ATGGCGGGGA TTTCCGTCAG CAAGCTGAAG
ACATTCAACG CTGGCGTGAA AGGCTCCACG CTGGGCGCAA GTGGTCCGCA GTACGTGATG
GTGCCAAAGA AGCATGCAGA TCAACTGCGT GAATCTCTGG CTTCAGGCGA AATTGCTGCT
GTACAGTCGA CGCTGGTTGC CGACAATACG CCGCTTAACA GCCGTGTTTA CACCGTACGC
TCTGGCGACA CGCTTTCAAG TATCGCTTCA CGTCTCGGCG TAAGCACCAA AGATTTGCAG
CAGTGGAACA AACTGCGCGG CTCTAAGCTG AAACCAGGCC AAAGTCTGAC GATTGGCGCA
GGTAGTAGCG CACAGCGGTT GGCAAACAAC AGCGATAGCA TTACGTATCG TGTGCGCAAA
GGCGATTCGC TTTCAAGCAT TGCTAAACGC CACGGCGTGA ACATCAAAGA TGTGATGCGC
TGGAACAGCG ATACTGCGAA TCTGCAACCA GGCGATAAGC TGACGTTGTT TGTGAAAAAC
AACAGCATGC CAGACTCCTG A
 
Protein sequence
MDDGTSIAPD GDLWAFIGDE LKMGIPENDR IREQKQKYLR NKSYLHDVTL RAEPYMYWIA 
GQVKKRNMPM ELVLLPIVES AFDPHATSGA NAAGIWQIIP STGRNYGLKQ TRNYDARRDV
VASTTAALNM MQRLNKMFDG DWLLTVAAYN SGEGRVMKAI KTNKARGKST DFWSLPLPQE
TKQYVPKMLA LSDILKNSKR YGVRLPTTDE SRALARVHLS SPVEMAKVAD MAGISVSKLK
TFNAGVKGST LGASGPQYVM VPKKHADQLR ESLASGEIAA VQSTLVADNT PLNSRVYTVR
SGDTLSSIAS RLGVSTKDLQ QWNKLRGSKL KPGQSLTIGA GSSAQRLANN SDSITYRVRK
GDSLSSIAKR HGVNIKDVMR WNSDTANLQP GDKLTLFVKN NSMPDS