Gene EcSMS35_4943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4943 
Symbolslt 
ID6144401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp5057620 
End bp5059557 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content54% 
IMG OID641619746 
Productlytic murein transglycosylase 
Protein accessionYP_001746850 
Protein GI170679874 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0741] Soluble lytic murein transglycosylase and related regulatory proteins (some contain LysM/invasin domains) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.996707 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAAAAAG CCAAACACGT TACCTGGCGG CTGTTGGCTG TCGGTGTCTG TCTGCTGACG 
GTCAGCAGCG TGGCGCGAGC CGACTCACTG GATGAGCAGC GTAGCCGTTA CGCGCAAATT
AAGCAGGCCT GGGATAATCG ACAAATGGAT GTGGTCGAAC AAATGATGCC TGGATTGAAG
GATTATCCAC TTTATCCCTA TCTGGAATAC CGCCAGATCA CCGATGACCT GATGAATCAA
CCGGCGGTGA CGGTCACTAA CTTTGTTCGC GCTAACCCCA CGCTTCCTCC CGCTCGCACG
CTGCAATCTC GTTTCGTCAA TGAACTGGCG CGGCGTGAAG ACTGGCGTGG CTTGTTAGCC
TTTAGCCCGG AAAAACCCGG AACTACCGAA GCGCAATGTA ATTACTACTA TGCGAAATGG
AACACCGGGC AGAGTGAAGA AGCCTGGCAA GGGGCGAAAG AGCTGTGGCT AACCGGCAAG
AGCCAGCCTA ACGCCTGTGA CAAGTTATTT AGCGTCTGGC GTGCGTCAGG TAAACAAGAT
CCGCTGGCGT ATTTAGAGCG TATCCGTCTG GCGATGAAAG CGGGTAACAC CGGCCTGGTA
ACAGTGCTGG CAGGGCAGAT GCCTGCCGAT TACCAGACTA TCGCTTCGGC AATCATTTCA
CTGGCGAACA ACCCTAATAC GGTACTGACC TTCGCGCGTA CAACCGGCGC GACCGATTTT
ACCCGTCAAA TGGCGGCGGT GGCGTTTGCC AGCGTGGCGC GGCAGGATGC TGAAAATGCA
CGGCTGATGA TTCCATCGCT TGCTCAGGCA CAGCAGCTTA ATGAAGATCA GATTCAGGAA
CTGCGCGATA TCGTCGCCTG GCGTTTGATG GGCAACGATG TCACCGACGA ACAGGCGAAA
TGGCGTGATG ACGCCATTAT GCGTTCGCAA TCTACTTCGC TTATTGAACG CCGTGTGCGA
ATGGCGCTTG GCACCGGCGA TCGTCGCGGC CTGAATACCT GGCTGGCGCG TCTGCCGATG
GAGGCGAAAG AGAAAGATGA ATGGCGTTAC TGGCAGGCGG ATTTACTGCT TGAACGCGGA
CGTGAAGCTG AAGCAAAAGA GATTTTGCAT CAACTCATGC AACAGCGTGG TTTCTACCCG
ATGGTTGCTG CACAACGCAT CGGCGAAGAG TATGAGCTGA AGATTGATAA AGCGCCGCAG
AATGTTGACA GCGCCCTGAC TCAGGGGCCG GAGATGGCGC GCGTGCGCGA GTTGATGTAC
TGGAATCTCG ATAACACCGC GCGTAGCGAG TGGGCCAATC TGGTGAAGAG CAAGTCAAAA
ACAGAGCAGG CGCAACTGGC GCGGTATGCT TTCAATAACC AATGGTGGGA TCTCAGCGTT
CAGGCAACGA TCGCCGGGAA GCTGTGGGAT CATCTGGAAG AGAGATTCCC GCTGGCTTAC
AACGATCTTT TCAAACGTTA CACCAGTGGG AAGGAGATCC CGCAAAGCTA TGCGATGGCG
ATTGCCCGTC AGGAGAGTGC CTGGAATCCG AAAGTGAAAT CGCCGGTAGG AGCCAGCGGC
CTGATGCAGA TTATGCCGGG TACAGCGACC CATACGGTGA AGATGTTCTC TATTCCCGGT
TATAGCAGCC CTGGGCAATT GCTGGATCCG GAAACAAACA TCAACATTGG CACCAGTTAC
CTGCAATATG TTTATCAGCA GTTTGGCAAT AACCGTATTT TCTCCTCAGC AGCTTATAAC
GCCGGACCAG GGCGGGTGCG AACCTGGCTT GGCAACAGCG CAGGGCGTAT CGACGCAGTG
GCATTTGTCG AGAGTATTCC ATTCTCCGAG ACGCGCGGTT ATGTGAAAAA CGTGCTGGCT
TATGACGCTT ACTACCGCTA TTTCATGGGG GATAAACCGA CGTTGATGAG CGCCACGGAA
TGGGGACGTC GTTACTGA
 
Protein sequence
MEKAKHVTWR LLAVGVCLLT VSSVARADSL DEQRSRYAQI KQAWDNRQMD VVEQMMPGLK 
DYPLYPYLEY RQITDDLMNQ PAVTVTNFVR ANPTLPPART LQSRFVNELA RREDWRGLLA
FSPEKPGTTE AQCNYYYAKW NTGQSEEAWQ GAKELWLTGK SQPNACDKLF SVWRASGKQD
PLAYLERIRL AMKAGNTGLV TVLAGQMPAD YQTIASAIIS LANNPNTVLT FARTTGATDF
TRQMAAVAFA SVARQDAENA RLMIPSLAQA QQLNEDQIQE LRDIVAWRLM GNDVTDEQAK
WRDDAIMRSQ STSLIERRVR MALGTGDRRG LNTWLARLPM EAKEKDEWRY WQADLLLERG
REAEAKEILH QLMQQRGFYP MVAAQRIGEE YELKIDKAPQ NVDSALTQGP EMARVRELMY
WNLDNTARSE WANLVKSKSK TEQAQLARYA FNNQWWDLSV QATIAGKLWD HLEERFPLAY
NDLFKRYTSG KEIPQSYAMA IARQESAWNP KVKSPVGASG LMQIMPGTAT HTVKMFSIPG
YSSPGQLLDP ETNINIGTSY LQYVYQQFGN NRIFSSAAYN AGPGRVRTWL GNSAGRIDAV
AFVESIPFSE TRGYVKNVLA YDAYYRYFMG DKPTLMSATE WGRRY