Gene EcolC_3572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3572 
SymbolmurE 
ID6064791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3904928 
End bp3906415 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content57% 
IMG OID641602989 
ProductUDP-N-acetylmuramoylalanyl-D-glutamate--2, 6-diaminopimelate ligase 
Protein accessionYP_001726513 
Protein GI170021559 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0769] UDP-N-acetylmuramyl tripeptide synthase 
TIGRFAM ID[TIGR01085] UDP-N-acetylmuramyl-tripeptide synthetase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0030111 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGCAGATC GTAATTTGCG CGACCTTCTT GCTCCGTGGG TGCCAGACGC ACCTTCGCGA 
GCACTGCGAG AGATGACACT CGACAGCCGT GTGGCTGCGG CGGGCGATCT CTTTGTAGCT
GTAGTAGGTC ATCAGGCGGA CGGGCGTCGA TATATCCCGC AGGCGATAGC GCAAGGTGTG
GCTGCCATTA TTGCAGAGGC GAAAGATGAG GCGACCGATG GTGAAATCCG TGAAATGCAC
GGTGTACCGG TCATCTATCT CAGCCAGCTC AACGAGCGTT TATCTGCACT GGCGGGCCGC
TTTTACCATG AACCCTCTGA CAATTTACGT CTCGTGGGCG TAACGGGCAC CAACGGCAAA
ACCACGACTA CCCAGCTGTT GGCGCAGTGG AGCCAACTGC TTGGCGAAAC CAGCGCGGTA
ATGGGCACCG TTGGTAACGG CCTGCTGGGG AAAGTGATCC CGACAGAAAA TACAACCGGT
TCGGCAGTCG ATGTTCAGCA TGAGCTGGCG GGGCTGGTGG ATCAGGGCGC GACGTTTTGC
GCAATGGAAG TTTCCTCCCA CGGGCTGGTA CAGCACCGTG TGGCGGCATT GAAATTTGCG
GCGTCGGTCT TTACCAACTT AAGCCGCGAT CACCTTGATT ATCATGGTGA TATGGAACAC
TACGAAGCCG CGAAATGGCT GCTTTATTCT GAGCATCATT GCGGTCAGGC GATTATTAAC
GCCGACGATG AAGTGGGCCG CCGCTGGCTG GCAAAACTGC CGGACGCGGT TGCGGTATCA
ATGGAAGATC ATATTAATCC GAACTGTCAC GGACGCTGGT TAAAAGCGAT CGACGTGAAC
TATCACGACA GCGGTGCGAC GATTCGCTTT AGCTCAAGTT GGGGCGATGG CGAAATTGAA
AGCCATCTGA TGGGCGCTTT TAACGTCAGC AACCTGCTGC TCGCGCTGGC GACACTGTTG
GCACTCGGCT ATCCATTGGC TGATCTGTTG AAAACCGCCG CGCGTCTGCA ACCGGTTTGC
GGACGTATGG AAGTGTTCAC TGCGCCAGGC AAACCGACGG TGGTGGTGGA TTACGCGCAT
ACGCCGGATG CGCTGGAAAA AGCCTTACAG GCGGCGCGGC TGCACTGTGC GGGCAAGCTG
TGGTGCGTCT TTGGCTGTGG TGGCGATCGC GATAAAGGTA AGCGTCCACT GATGGGCGCA
ATTGCCGAAG AGTTTGCTGA CGTGGCGGTG GTGACGGACG ATAACCCGCG TACCGAAGAA
CCGCGTGCCA TCATCAACGA TATTCTGGCG GGAATGTTAG ATGCCGGACA TGCCAAAGTG
ATGGAAGGCC GTGCTGAAGC GGTGACTTGC GCCGTTATGC AGGCTAAAGA GAATGATGTG
GTACTGGTCG CGGGCAAAGG CCATGAGGAT TACCAGATTG TTGGCAATCA GCGCCTGGAC
TACTCCGATC GCGTCACGGT GGCGCGTCTG CTGGGGGTGA TTGCATGA
 
Protein sequence
MADRNLRDLL APWVPDAPSR ALREMTLDSR VAAAGDLFVA VVGHQADGRR YIPQAIAQGV 
AAIIAEAKDE ATDGEIREMH GVPVIYLSQL NERLSALAGR FYHEPSDNLR LVGVTGTNGK
TTTTQLLAQW SQLLGETSAV MGTVGNGLLG KVIPTENTTG SAVDVQHELA GLVDQGATFC
AMEVSSHGLV QHRVAALKFA ASVFTNLSRD HLDYHGDMEH YEAAKWLLYS EHHCGQAIIN
ADDEVGRRWL AKLPDAVAVS MEDHINPNCH GRWLKAIDVN YHDSGATIRF SSSWGDGEIE
SHLMGAFNVS NLLLALATLL ALGYPLADLL KTAARLQPVC GRMEVFTAPG KPTVVVDYAH
TPDALEKALQ AARLHCAGKL WCVFGCGGDR DKGKRPLMGA IAEEFADVAV VTDDNPRTEE
PRAIINDILA GMLDAGHAKV MEGRAEAVTC AVMQAKENDV VLVAGKGHED YQIVGNQRLD
YSDRVTVARL LGVIA