Gene ECH74115_5437 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5437 
SymbolmurB 
ID6972277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5084187 
End bp5085215 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content47% 
IMG OID643389087 
ProductUDP-N-acetylenolpyruvoylglucosamine reductase 
Protein accessionYP_002273492 
Protein GI209396487 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0812] UDP-N-acetylmuramate dehydrogenase 
TIGRFAM ID[TIGR00179] UDP-N-acetylenolpyruvoylglucosamine reductase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00119493 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.00675039 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACCACT CCTTAAAACC CTGGAACACA TTTGGCATTG ATCATAATGC TCAGCACATT 
GTATGTGCCG AAGACGAACA ACAACTACTC AATGCCTGGC AGCATGCAAC CGCAGAAGGG
CAACCCGTTC TTATTCTGGG TGAAGGAAGT AATGTACTTT TTCTGGAAGA CTATCGCGGC
ACGGTGATCA TCAACCGGAT CAAAGGTATC GAAATTCATG ATGAACCTGA TGCGTGGTAT
TTACATGTAG GAGCCGGAGA AAACTGGCAT CGCCTGGTAA AATACACTTT GCAGGAAGGT
ATGCCTGGTC TGGAAAATCT GGCATTAATT CCTGGTTGTG TCGGCTCATC ACCTATCCAG
AATATTGGTG CTTATGGCGT AGAATTACAG CGAGTTTGCG CTTATGTTGA TTGTGTTGAA
CTGGCGACAG GCAAGCAAGT GCGCTTAACT GCCAAAGAGT GCCGTTTTGG CTATCGCGAC
AGTATTTTTA AACATGAATA CCAGGACCGC TTCGCCATTG TAGCCGTAGG TCTGCGTCTG
CCAAAAGAGT GGCAACCTGT ACTAACGTAT GGTGACTTAA CTCGTCTGGA TCCTACAACA
GTAACGCCAC AGCAAGTATT TGATGCGGTG TGTCATATGC GCACCACCAA ACTCCCTGAT
CCAAAAGTGA ATGGCAATGC CGGTAGTTTC TTCAAAAACC CTGTTGTATC TGCCGAAACG
GCTAAAGCAT TACTGTCACA ATTTCCAACA GCACCAAATT ACCCCCAGGC GGATGGTTCA
GTAAAACTGG CAGCAGGTTG GCTTATCGAT CAGTGCCAGC TAAAAGGGAT GCAAATGGGT
GGGGTTGCGG TGCACCGTCA ACAGGCGTTA GTTCTCATTA ATGAAGACAA TGCAAAAAGC
GAAGATGTGG TGCAACTGGC ACACCATGTA AGACAAAAAG TGGGTGAAAA ATTTAATGTC
TGGCTTGAGC CTGAAGTCCG CTTTATTGGT GCATCAGGTG AAGTGAGCGC AGTGGAGACA
ATTTCATGA
 
Protein sequence
MNHSLKPWNT FGIDHNAQHI VCAEDEQQLL NAWQHATAEG QPVLILGEGS NVLFLEDYRG 
TVIINRIKGI EIHDEPDAWY LHVGAGENWH RLVKYTLQEG MPGLENLALI PGCVGSSPIQ
NIGAYGVELQ RVCAYVDCVE LATGKQVRLT AKECRFGYRD SIFKHEYQDR FAIVAVGLRL
PKEWQPVLTY GDLTRLDPTT VTPQQVFDAV CHMRTTKLPD PKVNGNAGSF FKNPVVSAET
AKALLSQFPT APNYPQADGS VKLAAGWLID QCQLKGMQMG GVAVHRQQAL VLINEDNAKS
EDVVQLAHHV RQKVGEKFNV WLEPEVRFIG ASGEVSAVET IS