Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0094 |
Symbol | murD |
ID | 5594240 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 98842 |
End bp | 100158 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640919282 |
Product | UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase |
Protein accession | YP_001456877 |
Protein GI | 157159559 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0771] UDP-N-acetylmuramoylalanine-D-glutamate ligase |
TIGRFAM ID | [TIGR01087] UDP-N-acetylmuramoylalanine--D-glutamate ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 0.684668 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGATT ATCAGGGTAA AAATGTCGTC ATTATCGGCC TGGGCCTCAC CGGGCTTTCC TGCGTGGACT TTTTCCTCGC TCGCGGTGTG ACGCCGCGCG TTATGGATAC GCGTATGACA CCGCCTGGCC TGGATAAATT ACCCGAAGCC GTAGAACGCC ACACGGGCAG TCTGAATGAT GAATGGCTGA TGGCGGCAGA TCTGATTGTC GCCAGTCCCG GTATTGCACT GGCGCATCCA TCCTTAAGCG CTGCCGCTGA TGCCGGAATC GAAATCGTTG GCGATATCGA GCTGTTCTGT CGCGAAGCAC AAGCACCGAT TGTGGCGATT ACCGGTTCTA ACGGCAAAAG CACGGTCACC ACGCTAGTGG GTGAAATGGC GAAAGCGGCG GGGGTTAACG TTGGTGTGGG TGGCAATATT GGCCTGCCTG CGTTGATGCT ACTGGATGAT GAGTGTGAAC TGTACGTGCT GGAACTGTCG AGCTTCCAGC TGGAAACCAC CTCCAGCTTA CAGGCGGTAG CAGCGACCAT TCTGAACGTG ACTGAAGATC ATATGGATCG CTATCCGTTT GGTTTACAAC AGTATCGTGC AGCAAAACTG CGCATTTACG AAAACGCGAA AGTTTGCGTG GTTAATGCTG ATGATGCCTT AACAATGCCG ATTCGCGGTG CGGATGAACG CTGCGTCAGC TTTGGCGTCA ACATGGGTGA CTATCACCTG AATCATCAGC AGGGCGAAAC CTGGCTGCGG GTTAAAGGCG AGAAAGTGCT GAATGTGAAA GAGATGAAAC TTTCCGGGCA GCATAACTAC ACCAATGCGC TGGCGGCGCT GGCGCTGGCA GATGCTGCAG GGTTACCGCG TGCCAGCAGC CTGAAAGCGT TAACCACATT CACTGGTCTG CCGCATCGCT TTGAAGTTGT GCTGGAGCAT AACGGCGTAC GTTGGATTAA CGATTCGAAA GCGACCAACG TCGGCAGTAC GGAAGCGGCG CTGAATGGCC TGCACGTAGA CGGCACACTG CATTTGTTGC TGGGTGGCGA TGGTAAATCG GCGGACTTTA GCCCACTGGC GCGTTACCTG AATGGCGATA ACGTACGTCT GTATTGTTTC GGTCGTGACG GCGCGCAGCT GGCGGCGCTA CGCCCGGAAG TGGCAGAACA AACCGAAACT ATGGAACAGG CGATGCGCTT GCTGGCTCCG CGTGTTCAGC CGGGCGATAT GGTTCTGCTC TCCCCAGCCT GTGCCAGCCT TGATCAGTTC AAGAACTTTG AACAACGAGG CAATGAGTTT GCCCGTCTGG CGAAGGAGTT AGGTTGA
|
Protein sequence | MADYQGKNVV IIGLGLTGLS CVDFFLARGV TPRVMDTRMT PPGLDKLPEA VERHTGSLND EWLMAADLIV ASPGIALAHP SLSAAADAGI EIVGDIELFC REAQAPIVAI TGSNGKSTVT TLVGEMAKAA GVNVGVGGNI GLPALMLLDD ECELYVLELS SFQLETTSSL QAVAATILNV TEDHMDRYPF GLQQYRAAKL RIYENAKVCV VNADDALTMP IRGADERCVS FGVNMGDYHL NHQQGETWLR VKGEKVLNVK EMKLSGQHNY TNALAALALA DAAGLPRASS LKALTTFTGL PHRFEVVLEH NGVRWINDSK ATNVGSTEAA LNGLHVDGTL HLLLGGDGKS ADFSPLARYL NGDNVRLYCF GRDGAQLAAL RPEVAEQTET MEQAMRLLAP RVQPGDMVLL SPACASLDQF KNFEQRGNEF ARLAKELG
|
| |