Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1796 |
Symbol | |
ID | 6146138 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1816025 |
End bp | 1816990 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641616673 |
Product | mandelate racemase/muconate lactonizing enzyme family protein |
Protein accession | YP_001743851 |
Protein GI | 170681912 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.671548 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAACCG TTAAGGTATT TGAGGAGGCC TGGCCCTTAC ATACCCCGTT TGTGATTGCT CGGGGAAGTC GCAGTGAAGC GCGCGTGGTG GTGGTTGAAC TGGAAGAAGA AGGTATTAAA GGCACCGGTG AATGCACGCC GTATCCGCGT TATGGAGAAA GTGATGCCTC GGTAATGGCG CAAATTATGA GCGTCGTGCC GCAATTAGAG AAAGGGCTGA CACGGGAGGA GTTGCAAAAA ATCCTCCCTG CCGGCGCAGC ACGTAATGCG CTGGATTGTG CATTGTGGGA TCTGGCGGCG CGAAAACAGC AGCAATCGCT GGCTGCTTTG ATCGGCATAA CGCTTCCCGA GACAGTCACC ACGGCACAGA CGATAGTCAT CGGTACGCCT GATCAGATGG CCAATAGTGC ATCAACACTC TGGCAGGCAG GCGCGAAATT ACTGAAAGTG AAGCTGGATA ACCATCTTAT CAGTGAGCGG ATGGTGGCAA TTCGCACCGC TGTGCCCGAT GCGACGCTGA TCGTTGATGC AAATGAATCC TGGCGTGCAG AAGGGTTGGC GGCGCGTTGC CAGCTATTGG CAGATTTAGG CGTTGCGATG CTTGAACAAC CGCTTTCTGC GCAGGACGAT GCGGCGCTGG AGAATTTTAT TCATCCGTTA CCGATTTGTG CTGATGAAAG TTGTCATACT CGTAGCAATC TGAAGGCGCT GAAAGGGCGC TATGAGATGG TTAACATTAA GCTTGATAAA ACCGGTGGTC TGACGGAAGC GCTGGCGCTG GCGACTGAAG CGCGTGCACA AGGTTTTCGT CTGATGCTGG GCTGCATGTT GTGTACCTCT CGTGCCATTA GCGCCGCTTT ACCGCTGGTG CCGCAGGTCA GTTTCGCCGA TCTTGACGGA CCGACCTGGC TGGCGGTAGA TGTGGAACCG GCGCTTCAGT TCACGACGGG CGAATTGCAT CTTTAG
|
Protein sequence | MRTVKVFEEA WPLHTPFVIA RGSRSEARVV VVELEEEGIK GTGECTPYPR YGESDASVMA QIMSVVPQLE KGLTREELQK ILPAGAARNA LDCALWDLAA RKQQQSLAAL IGITLPETVT TAQTIVIGTP DQMANSASTL WQAGAKLLKV KLDNHLISER MVAIRTAVPD ATLIVDANES WRAEGLAARC QLLADLGVAM LEQPLSAQDD AALENFIHPL PICADESCHT RSNLKALKGR YEMVNIKLDK TGGLTEALAL ATEARAQGFR LMLGCMLCTS RAISAALPLV PQVSFADLDG PTWLAVDVEP ALQFTTGELH L
|
| |