Gene EcolC_4048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4048 
SymbolmurB 
ID6065008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4459244 
End bp4460272 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content46% 
IMG OID641603467 
ProductUDP-N-acetylenolpyruvoylglucosamine reductase 
Protein accessionYP_001726974 
Protein GI170022020 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0812] UDP-N-acetylmuramate dehydrogenase 
TIGRFAM ID[TIGR00179] UDP-N-acetylenolpyruvoylglucosamine reductase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0452163 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00307055 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACCACT CCTTAAAACC CTGGAACACA TTTGGCATTG ATCATAATGC TCAGCACATT 
GTATGTGCCG AAGACGAACA ACAATTACTC AATGCCTGGC AGCATGCAAC CGCAGAAGGA
CAACCCGTTC TTATTCTGGG TGAAGGAAGT AATGTACTTT TTCTGGAAGA CTATCGCGGC
ACGGTGATCA TCAACCGGAT CAAAGGTATC GAAATTCATG ATGAACCTGA TGCGTGGTAT
TTACATGTAG GAGCCGGAGA AAACTGGCAT CGCCTGGTAA AATACACTTT GCAGGAAGGT
ATGCCTGGTC TGGAAAATCT GGCATTAATT CCTGGTTGTG TCGGCTCATC ACCTATCCAG
AATATTGGTG CTTATGGCGT AGAATTACAG CGAGTTTGCG CTTATGTTGA TTGTGTTGAA
CTGGCGACAG GCAAGCAAGT GCGCTTAACT GCCAAAGAGT GCCGTTTTGG CTATCGCGAC
AGTATTTTTA AACATGAATA CCAGGATCGC TTCGCCATTG TAGCCGTAGG TCTGCGTCTG
CCAAAAGAGT GGCAACCTGT ACTAACGTAT GGTGACTTAA CTCGTCTGGA TCCTACAACA
GTAACGCCAC AGCAAGTATT TAATGCGGTG TGTCATATGC GCACCACCAA ACTCCCTGAT
CCAAAAGTGA ATGGCAATGC CGGTAGTTTC TTCAAAAACC CTGTTGTATC TGCCGAAACG
GCTGAAGCAT TACTGTCACA ATTTCCAACA GCACCAAATT ACCCCCAGGC GGATGGTTCA
GTAAAACTGG CAGCAGGTTG GCTTATTGAT CAGTGCCAGC TAAAAGGGAT GCAAATGGGT
GGGGCTGCGG TGCACCGTCA ACAGGCGTTA GTTCTCATTA ATGAAGACAA TGCAAAAAGC
GAAGATGTGG TGCAACTGGC ACACCATGTA AGACAAAAAG TGGGTGAAAA ATTTAATGTC
TGGCTTGAGC CTGAAGTCCG CTTTATTGGT GCATCAGGTG AAGTGAGCGC AGTGGAGACA
ATTTCATGA
 
Protein sequence
MNHSLKPWNT FGIDHNAQHI VCAEDEQQLL NAWQHATAEG QPVLILGEGS NVLFLEDYRG 
TVIINRIKGI EIHDEPDAWY LHVGAGENWH RLVKYTLQEG MPGLENLALI PGCVGSSPIQ
NIGAYGVELQ RVCAYVDCVE LATGKQVRLT AKECRFGYRD SIFKHEYQDR FAIVAVGLRL
PKEWQPVLTY GDLTRLDPTT VTPQQVFNAV CHMRTTKLPD PKVNGNAGSF FKNPVVSAET
AEALLSQFPT APNYPQADGS VKLAAGWLID QCQLKGMQMG GAAVHRQQAL VLINEDNAKS
EDVVQLAHHV RQKVGEKFNV WLEPEVRFIG ASGEVSAVET IS