Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1559 |
Symbol | |
ID | 6269755 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 1421446 |
End bp | 1422411 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641725652 |
Product | mandelate racemase/muconate lactonizing enzyme family protein |
Protein accession | YP_001880158 |
Protein GI | 187730278 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 46 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAACCG TTAAGGTATT TGAGGAAGCC TGGCCCTTAC ATACCCCGTT TGTGATTGCC CGGGGAAGTC GCAGTGAAGC GCGCGTGGTG GTGGTTGAAC TGGAAGAAGA GGGTATTAAA GGCACCGGCG AATGCACGCC GTATCCGCGT TATGGAGAAA GTGATGCCTC GGTAATGGCG CAAATTATGA GCGTCGTGTC GCAATTAGAG AAAGGGCTGA CACGGGAGGA GTTGCAAAAA ATTCTCCCTG CCGGCGCAGC ACGTAATGCG CTGGATTGTG CATTGTGGGA TCTGGCCGCG CGAAAACAGC AGCAATCGCT GGCTGATTTG ATCGGCATAA CGCTTCCCGA GACAGTTATC ACTGCACAGA CGGTAGTCAT CGGTACGCCT GATCAGATGG CCAATAGTGC ATCAACACTC TGGCAGGCAG GCGCGAAATT ACTGAAAGTG AAGCTGGATA ACCATCTTAT TAGTGAGCGG ATGGTGGCAA TTCGCACAGC TGTGCCCGAT GCGACGCTGA TCGTTGATGC AAATGAATCC TGGCGTGCAG AAGGGTTGGC GGCGCGTTGC CAGCTATTGG CGGATTTAGG CGTTGCGATG CTTGAACAAC CCCTTCCTGC GCAGGACGAT GCGGCGCTGG AGAATTTTAT TCATCCGTTG CCGATTTGTG CTGATGAGAG TTGTCATACA CGTAGCAATC TGAAGGCGCT GAAAGGGCGC TATGAGATGG TTAACATTAA GCTCGATAAA ACCGGTGGTC TGACGGAAGC GCTGGCGCTG GCGACTGAAG CGCGTGCACA AGGTTTCAGT CTGATGCTGG GCTGCATGTT GTGTACCTCT CGTGCCATTA GCGCCGCTTT ACCGCTGGTG CCGCAGGTCA GTTTCGCCGA TCTTGACGGA CCGACCTGGC TGGCGGTAGA TGTGGAACCG GCGCTTCAGT TCACGACGGG CGAATTGCAT CTTTAG
|
Protein sequence | MRTVKVFEEA WPLHTPFVIA RGSRSEARVV VVELEEEGIK GTGECTPYPR YGESDASVMA QIMSVVSQLE KGLTREELQK ILPAGAARNA LDCALWDLAA RKQQQSLADL IGITLPETVI TAQTVVIGTP DQMANSASTL WQAGAKLLKV KLDNHLISER MVAIRTAVPD ATLIVDANES WRAEGLAARC QLLADLGVAM LEQPLPAQDD AALENFIHPL PICADESCHT RSNLKALKGR YEMVNIKLDK TGGLTEALAL ATEARAQGFS LMLGCMLCTS RAISAALPLV PQVSFADLDG PTWLAVDVEP ALQFTTGELH L
|
| |