Gene EcSMS35_0090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0090 
SymbolmurE 
ID6145782 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp99799 
End bp101286 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content57% 
IMG OID641614991 
ProductUDP-N-acetylmuramoylalanyl-D-glutamate--2, 6-diaminopimelate ligase 
Protein accessionYP_001742207 
Protein GI170682313 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0769] UDP-N-acetylmuramyl tripeptide synthase 
TIGRFAM ID[TIGR01085] UDP-N-acetylmuramyl-tripeptide synthetase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.764756 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAGATC GTAATTTGCG CGACCTTCTT GCTCCGTGGG TGCCAGACGC ACCTTCGCGA 
GCACTGCGAG AGATGACACT CGACAGCCGT GTGGCTGCGG CGGGCGATCT CTTTGTAGCT
GTAGTAGGTC ATCAGGCGGA CGGGCGTCGA TATATCCCGC AGGCGATAGC GCAAGGTGTG
GCTGCCATTA TTGCAGAGGC GAAAGATGAG GCGACCGACG GTGAAATCCG TGAAATGCAC
GGCGTACCGG TCATCTATCT CAGCCAGCTC AACGAGCGTT TATCTGCACT GGCGGGCCGC
TTTTACCATG AACCCTCTGA TAATTTACGT CTTGTAGGCG TAACGGGCAC CAACGGCAAA
ACCACGACGA CCCAGCTGCT GGCACAGTGG AGCCAACTGC TTGGCGAAAC CAGCGCGGTA
ATGGGCACCG TTGGTAACGG CCTGCTGGGG AAAGTGATCC CGACAGAAAA TACCACCGGT
TCGGCAGTCG ATGTTCAGCA TGAGCTGGCG GGGCTGGTGG ATCAGGGCGC GACTTTTTGC
GCAATGGAAG TCTCTTCCCA CGGGCTGGTA CAGCATCGCG TGGCGGCGCT GAAATTTGCC
GCATCGGTCT TTACCAACTT AAGCCGCGAT CACCTTGATT ATCATGGTGA TATGGAACAC
TACGAAGCCG CGAAATGGCT GCTTTATTCT GAACATCGTT GCGGTCAGGC GATTATTAAC
GCCGACGATG AAGTGGGCCG CCGCTGGCTG GCAAAACTGC CGGACGCGGT TGCGGTATCA
ATGGAAGATC ATATCAATCT GAACTGTCAC GGACGCTGGT TGAAAGCGAC CGAAGTGAAC
TATCACGACA GCGGTGCGAC GATTCGCTTT AGCTCAAGCT GGGGCGATGG CGAAATTGAA
AGCCATCTGA TGGGCGCTTT TAACGTCAGC AACCTGCTGC TTGCGCTGGC GACACTGCTG
GCACTCGGCT ACCCGCTCGC TGATCTGTTA AAAACCGCCG CGCGTCTGCA ACCGGTTTGC
GGACGTATGG AAGTGTTCAC TGCGCCAGGC AAACCGACGG TGGTGGTGGA TTACGCGCAT
ACGCCGGATG CACTGGAAAA AGCCTTACAG GCGGCGCGCC TGCACTGTGC GGGCAAGCTG
TGGTGCGTCT TTGGCTGTGG TGGCGATCGT GATAAAGGCA AACGTCCACT GATGGGCGCA
ATTGCTGAAG AGTTTGCTGA CGTGGCGGTG GTAACGGACG ATAACCCGCG TACCGAAGAA
CCGCGTGCCA TCATCAACGA TATTCTGGCG GGAATGTTAG ATGCCGGACA TGCCAAAGTG
ATGGAAGGCC GTGCTGAAGC GGTGACTTGC GCCGTTATGC AGGCTAAAGA GAATGATGTG
GTACTGGTCG CGGGCAAAGG TCATGAGGAT TACCAGATTG TTGGCAATCA GCGTCTGGAC
TACTCCGATC GCGTCACGGT GGCGCGTCTG CTGGGGGTGA TTGCATGA
 
Protein sequence
MADRNLRDLL APWVPDAPSR ALREMTLDSR VAAAGDLFVA VVGHQADGRR YIPQAIAQGV 
AAIIAEAKDE ATDGEIREMH GVPVIYLSQL NERLSALAGR FYHEPSDNLR LVGVTGTNGK
TTTTQLLAQW SQLLGETSAV MGTVGNGLLG KVIPTENTTG SAVDVQHELA GLVDQGATFC
AMEVSSHGLV QHRVAALKFA ASVFTNLSRD HLDYHGDMEH YEAAKWLLYS EHRCGQAIIN
ADDEVGRRWL AKLPDAVAVS MEDHINLNCH GRWLKATEVN YHDSGATIRF SSSWGDGEIE
SHLMGAFNVS NLLLALATLL ALGYPLADLL KTAARLQPVC GRMEVFTAPG KPTVVVDYAH
TPDALEKALQ AARLHCAGKL WCVFGCGGDR DKGKRPLMGA IAEEFADVAV VTDDNPRTEE
PRAIINDILA GMLDAGHAKV MEGRAEAVTC AVMQAKENDV VLVAGKGHED YQIVGNQRLD
YSDRVTVARL LGVIA