Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4420 |
Symbol | murB |
ID | 6143730 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4517629 |
End bp | 4518657 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641619240 |
Product | UDP-N-acetylenolpyruvoylglucosamine reductase |
Protein accession | YP_001746360 |
Protein GI | 170684112 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0812] UDP-N-acetylmuramate dehydrogenase |
TIGRFAM ID | [TIGR00179] UDP-N-acetylenolpyruvoylglucosamine reductase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00000273523 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.000726399 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAACCACT CCTTAAAACC CTGGAACACA TTTGGCATTG ATCATAATGC TCAGCACATT GTATGTGCCG AAGACGAACA ACAACTACTC AATGCCTGGC AGCATGCAAC CGCAAAAGGA CAATCCGTTC TTATTCTGGG TGAAGGAAGT AATGTACTTT TTCTGGAAGA CTATCGCGGT ACGGTGATCA TCAACCGGAT CAAAGGTATC GAAATTCATG ATGAACCTGA TGCGTGGTAT TTACATGTAG GAGCCGGAGA AAACTGGCAT CGCCTGGTAA AATACACTTT GCAGGAAGGT ATGCCTGGTC TGGAAAATCT GGCATTAATT CCTGGTTGTG TCGGCTCATC ACCTATCCAG AATATTGGTG CTTATGGCGT AGAATTACAG CGAGTTTGCG CTTATGTTGA CTGTGTTGAA CTGGCGACAG GCAAGCAAGT GCGCTTAACT GCCAAAGAGT GCCGTTTTGG CTATCGCGAC AGTATTTTTA AACATGAATA CCAGGACCGC TTCGCCATTG TAGCCGTAGG TCTGCGTCTG CCAAAAGAGT GGCAACCTGT ACTAACGTAT GGTGACTTAA CTCGTCTGGA TCCTACAACC GTAACGCCAC AGCAAGTATT TAATGCGGTA TGTCATATGC GCACCACCAA ACTCCCTGAT CCAAAAGTGA ATGGCAATGC CGGTAGTTTC TTCAAAAACC CTGTTGTATC TGCCGAAACG GCTAAAGCAT TACTGGCACA ATTTCCAACA GCACCAAATT ATCCCCAGGC GGATGGTTCA GTAAAACTGG CAGCAGGTTG GCTTATCGAT CAGTGCCAGC TAAAAGGGAT GCAAATGGGT GGGGCTGCGG TGCACCGTCA ACAGGCGTTA GTCCTCATTA ATGAAGACAA TGCAAAAAGC GAAGATGTGG TGCAACTGGC ACATCATGTA AGACAAAAAG TGGGTGAAAA ATTTAATGTC TGGCTTGAGC CTGAAGTTCG CTTTATTGGT GCATCAGGTG AAGTTAGCGC AGTGGAGACG ATTTCATGA
|
Protein sequence | MNHSLKPWNT FGIDHNAQHI VCAEDEQQLL NAWQHATAKG QSVLILGEGS NVLFLEDYRG TVIINRIKGI EIHDEPDAWY LHVGAGENWH RLVKYTLQEG MPGLENLALI PGCVGSSPIQ NIGAYGVELQ RVCAYVDCVE LATGKQVRLT AKECRFGYRD SIFKHEYQDR FAIVAVGLRL PKEWQPVLTY GDLTRLDPTT VTPQQVFNAV CHMRTTKLPD PKVNGNAGSF FKNPVVSAET AKALLAQFPT APNYPQADGS VKLAAGWLID QCQLKGMQMG GAAVHRQQAL VLINEDNAKS EDVVQLAHHV RQKVGEKFNV WLEPEVRFIG ASGEVSAVET IS
|
| |