Gene EcSMS35_3106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3106 
SymbolmltC 
ID6143122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3191239 
End bp3192318 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content51% 
IMG OID641617974 
Productmurein transglycosylase C 
Protein accessionYP_001745125 
Protein GI170683301 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0741] Soluble lytic murein transglycosylase and related regulatory proteins (some contain LysM/invasin domains) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.00368993 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAAAT ATCTCGCGCT GGCTTTGATT GCGCCGTTGC TCATCTCCTG TTCGACGACC 
AAAAAAGGCG ATACCTATAA CGAAGCCTGG GTCAAAGACA CCAACGGTTT TGATATTCTG
ATGGGGCAAT TTGCCCACAA TATTGAGAAC ATCTGGGGAT TCAAAGAGGT GGTGATCGCC
GGTCCTAAGG ACTACGTGAA ATACACCGAT CAATATCAGA CCCGCAGCCA CATCAACTTC
GATGACGGTA CGATTACTAT CGAAACCATC GCCGGGACAG AACCTGCCGC GCATCTGCGC
CGGGCAATTA TCAAAACGTT GCTGATGGGT GACGATCCGA GTTCGGTCGA TCTCTATTCC
GACGTTGATG ACATTACGAT TTCGAAAGAA CCTTTCCTTT ACGGTCAGGT GGTGGACAAC
ACCGGGCAGC CGATTCGCTG GGAAGGTCGC GCGAGCAACT TCGCGGATTA TCTGCTGAAA
AACCGTCTGA AAAGCCGTAG CAACGGGCTA CGAATCATCT ATAGCGTCAC CATTAACATG
GTGCCAAACC ACCTTGATAA ACGTGCGCAC AAATATCTCG GCATGGTCCG CCAGGCGTCA
CGGAAATATG GCGTTGATGA GTCGCTGATT CTGGCGATTA TGCAGACCGA GTCATCCTTT
AACCCGTATG CGGTCAGCCG TTCCGATGCG CTGGGATTAA TGCAGGTGGT ACAACATACT
GCCGGGAAAG ATGTGTTCCG CTCGCAGGGG AAATCCGGCA CGCCGAGCCG CAGTTTCTTG
TTTGATCCTG CCAGCAATAT TGATATCGGC ACCGCGTATC TGGCGATGCT GAACAATGTT
TATCTCGGCG GAATTGATAA CCCAACGTCG CGGCGTTATG CCGTCATCAC CGCCTATAAC
GGCGGTGCAG GCAGCGTGCT GCGAGTCTTT TCGAATGACA AGATTCAGGC GGCCAATATT
ATTAACACCA TGACGCCGGG CGATGTTTAT CAAACGCTGA CGACCCGCCA TCCCTCTGCG
GAATCTCGCC GTTATCTTTA TAAAGTGAAT ACCGCGCAAA AATCCTACCG CCGCCGATAA
 
Protein sequence
MKKYLALALI APLLISCSTT KKGDTYNEAW VKDTNGFDIL MGQFAHNIEN IWGFKEVVIA 
GPKDYVKYTD QYQTRSHINF DDGTITIETI AGTEPAAHLR RAIIKTLLMG DDPSSVDLYS
DVDDITISKE PFLYGQVVDN TGQPIRWEGR ASNFADYLLK NRLKSRSNGL RIIYSVTINM
VPNHLDKRAH KYLGMVRQAS RKYGVDESLI LAIMQTESSF NPYAVSRSDA LGLMQVVQHT
AGKDVFRSQG KSGTPSRSFL FDPASNIDIG TAYLAMLNNV YLGGIDNPTS RRYAVITAYN
GGAGSVLRVF SNDKIQAANI INTMTPGDVY QTLTTRHPSA ESRRYLYKVN TAQKSYRRR