Gene EcolC_3778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3778 
Symbol 
ID6066641 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4133309 
End bp4134682 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content55% 
IMG OID641603191 
ProductUDP-N-acetylmuramate 
Protein accessionYP_001726710 
Protein GI170021756 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0773] UDP-N-acetylmuramate-alanine ligase 
TIGRFAM ID[TIGR01081] UDP-N-acetylmuramate:L-alanyl-gamma-D-glutamyl-meso-diaminopimelate ligase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCATTC ATATTTTAGG AATTTGTGGC ACGTTTATGG GCGGTCTGGC GATGCTGGCG 
CGCCAGTTAG GCCATGAAGT AACGGGTTCG GACGCCAATG TGTATCCGCC GATGAGCACC
TTACTTGAGA AGCAAGGCAT TGAACTGATT CAGGGTTACG ATGCCAGCCA GCTCGATCCG
CAGCCGGATC TGGTGATTAT TGGCAACGCC ATGACCCGTG GAAATCCGTG TGTGGAAGCG
GTACTGGAAA AAAACATCCC TTATATGTCA GGTCCACAGT GGCTGCACGA TTTTGTGCTG
CGCGACCGCT GGGTGCTGGC CGTTGCCGGT ACACATGGCA AAACCACCAC CGCGGGAATG
GCGACCTGGA TTCTGGAACA GTGCGGTTAC AAACCGGGAT TTGTGATCGG CGGTGTGCCG
GGGAACTTTG AGGTTTCGGC GCGTCTGGGC GAAAGCAACT TCTTTGTTAT CGAAGCGGAT
GAGTATGACT GCGCCTTCTT CGACAAACGC TCTAAATTTG TTCATTACTG CCCGCGTACG
CTGATCCTCA ACAACCTTGA GTTCGATCAC GCCGATATCT TTGACGACCT GAAAGCGATC
CAGAAACAGT TCCACCATCT GGTGCGTATC GTTCCGGGGC AGGGCCGTAT TATCTGGCCA
GAAAACGACA TCAACCTGAA ACAGACCATG GCGATGGGCT GCTGGAGCGA GCAGGAGCTG
GTGGGCGAGC AAGGTCACTG GCAGGCGAAA AAGCTGACCA CCGATGCTTC CGAATGGGAA
GTCTTGCTGG ATGGCGAAAA AGTGGGCGAA GTGAAATGGT CGCTGGTAGG CGAACATAAT
ATGCACAATG GCCTGATGGC GATTGCGGCG GCTCGCCATG TTGGTGTAGC GCCGGCAGAT
GCCGCTAACG CGCTGGGTTC GTTTATTAAC GCTCGCCGCC GTCTGGAGTT GCGTGGTGAA
GCGAATGGCG TAACGGTATA TGACGATTTT GCCCATCACC CGACGGCGAT TCTGGCAACG
CTTGCGGCGC TGCGTGGCAA AGTTGGCGGT ACGGCGCGCA TTATTGCTGT GCTGGAACCG
CGCTCGAATA CCATGAAAAT GGGGATCTGC AAAGACGATC TGGCACCTTC ATTAGGTCGT
GCCGATGAAG TCTTCCTGCT GCAACCAGCG CATATTCCGT GGCAGGTGGC AGAAGTGGCA
GAAGCCTGCG TTCAGCCTGC ACACTGGAGT GGCGATGTGG ATACGCTGGC AGATATGGTG
GTGAAAACCG CTCAGCCTGG CGACCATATT CTGGTGATGA GCAACGGCGG TTTTGGTGGG
ATCCATCAGA AACTGCTGGA TGGTCTGGCG AAGAAGGCGG AAGCTGCGCA GTAA
 
Protein sequence
MLIHILGICG TFMGGLAMLA RQLGHEVTGS DANVYPPMST LLEKQGIELI QGYDASQLDP 
QPDLVIIGNA MTRGNPCVEA VLEKNIPYMS GPQWLHDFVL RDRWVLAVAG THGKTTTAGM
ATWILEQCGY KPGFVIGGVP GNFEVSARLG ESNFFVIEAD EYDCAFFDKR SKFVHYCPRT
LILNNLEFDH ADIFDDLKAI QKQFHHLVRI VPGQGRIIWP ENDINLKQTM AMGCWSEQEL
VGEQGHWQAK KLTTDASEWE VLLDGEKVGE VKWSLVGEHN MHNGLMAIAA ARHVGVAPAD
AANALGSFIN ARRRLELRGE ANGVTVYDDF AHHPTAILAT LAALRGKVGG TARIIAVLEP
RSNTMKMGIC KDDLAPSLGR ADEVFLLQPA HIPWQVAEVA EACVQPAHWS GDVDTLADMV
VKTAQPGDHI LVMSNGGFGG IHQKLLDGLA KKAEAAQ