Gene EcSMS35_4420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4420 
SymbolmurB 
ID6143730 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4517629 
End bp4518657 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content46% 
IMG OID641619240 
ProductUDP-N-acetylenolpyruvoylglucosamine reductase 
Protein accessionYP_001746360 
Protein GI170684112 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0812] UDP-N-acetylmuramate dehydrogenase 
TIGRFAM ID[TIGR00179] UDP-N-acetylenolpyruvoylglucosamine reductase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000273523 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.000726399 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACCACT CCTTAAAACC CTGGAACACA TTTGGCATTG ATCATAATGC TCAGCACATT 
GTATGTGCCG AAGACGAACA ACAACTACTC AATGCCTGGC AGCATGCAAC CGCAAAAGGA
CAATCCGTTC TTATTCTGGG TGAAGGAAGT AATGTACTTT TTCTGGAAGA CTATCGCGGT
ACGGTGATCA TCAACCGGAT CAAAGGTATC GAAATTCATG ATGAACCTGA TGCGTGGTAT
TTACATGTAG GAGCCGGAGA AAACTGGCAT CGCCTGGTAA AATACACTTT GCAGGAAGGT
ATGCCTGGTC TGGAAAATCT GGCATTAATT CCTGGTTGTG TCGGCTCATC ACCTATCCAG
AATATTGGTG CTTATGGCGT AGAATTACAG CGAGTTTGCG CTTATGTTGA CTGTGTTGAA
CTGGCGACAG GCAAGCAAGT GCGCTTAACT GCCAAAGAGT GCCGTTTTGG CTATCGCGAC
AGTATTTTTA AACATGAATA CCAGGACCGC TTCGCCATTG TAGCCGTAGG TCTGCGTCTG
CCAAAAGAGT GGCAACCTGT ACTAACGTAT GGTGACTTAA CTCGTCTGGA TCCTACAACC
GTAACGCCAC AGCAAGTATT TAATGCGGTA TGTCATATGC GCACCACCAA ACTCCCTGAT
CCAAAAGTGA ATGGCAATGC CGGTAGTTTC TTCAAAAACC CTGTTGTATC TGCCGAAACG
GCTAAAGCAT TACTGGCACA ATTTCCAACA GCACCAAATT ATCCCCAGGC GGATGGTTCA
GTAAAACTGG CAGCAGGTTG GCTTATCGAT CAGTGCCAGC TAAAAGGGAT GCAAATGGGT
GGGGCTGCGG TGCACCGTCA ACAGGCGTTA GTCCTCATTA ATGAAGACAA TGCAAAAAGC
GAAGATGTGG TGCAACTGGC ACATCATGTA AGACAAAAAG TGGGTGAAAA ATTTAATGTC
TGGCTTGAGC CTGAAGTTCG CTTTATTGGT GCATCAGGTG AAGTTAGCGC AGTGGAGACG
ATTTCATGA
 
Protein sequence
MNHSLKPWNT FGIDHNAQHI VCAEDEQQLL NAWQHATAKG QSVLILGEGS NVLFLEDYRG 
TVIINRIKGI EIHDEPDAWY LHVGAGENWH RLVKYTLQEG MPGLENLALI PGCVGSSPIQ
NIGAYGVELQ RVCAYVDCVE LATGKQVRLT AKECRFGYRD SIFKHEYQDR FAIVAVGLRL
PKEWQPVLTY GDLTRLDPTT VTPQQVFNAV CHMRTTKLPD PKVNGNAGSF FKNPVVSAET
AKALLAQFPT APNYPQADGS VKLAAGWLID QCQLKGMQMG GAAVHRQQAL VLINEDNAKS
EDVVQLAHHV RQKVGEKFNV WLEPEVRFIG ASGEVSAVET IS