Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E3086 |
Symbol | |
ID | 6271471 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 2880752 |
End bp | 2882092 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641727013 |
Product | glucarate dehydratase |
Protein accession | YP_001881472 |
Protein GI | 187730357 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | [TIGR03247] glucarate dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.0151 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTCTC AATTTACGAC GCCTGTTGTT ACTGAAATGC AGGTCATCCC GGTGGCGGGT CATGACAGTA TGCTGATGAA TCTGAGTGGT GCACACGCAC CGTTCTTTAC GCGTAATATT ATGATTATCA AAGATAATTC TGGTCACACT GGCGTAGGGG AAATTCCCGG CGGCGAGAAA ATCCGTAAAA CGCTGGAAGA TGCGATTCCG CTGGTGGTAG GTAAAACGCT GGGTGAATAC AAAAACGTTC TGACGCTGGT GCGTAATACT TTTGCCGATC GTGATGCTGG TGGGCGCGGT TTGCAGACAT TTGATTTGCG TACCACTATT CATGTAGTTA CCGGGATAGA AGCGGCAATG CTGGATCTGC TGGGGCAGCA TCTGGGGGTA AACGTGGCAT CCCTGCTGGG CGATGGTCAA CAGCGTAGCG AAGTCGAAAT GCTCGGTTAT CTGTTCTTCG TCGGTGATCG CAAAGCCACA CCGCTGCCGT ATCAAAGCCA GCCGGATGAC TCATGCGACT GGTATCGCCT GCGTCATGAA GAAGCGATGA CGCCGGATGC GGTGGTGCGC CTGGCGGAAG CGGCGTATGA AAAATATGGC TTCAACGATT TCAAACTGAA AGGCGGTGTA CTGGCCGGGG AAGAAGAGGC CGAGTCTATT GTGGCACTGG CGCAACGCTT CCCGCAGGCG CGTATTACGC TCGATCCTAA CGGTGCCTGG TCGCTGAACG AAGCGATTAA AATTGGTAAA TACCTGAAAG GGTCGCTGGC TTATGCAGAA GATCCGTGTG GTGCGGAGCA AGGTTTCTCC GGGCGTGAAG TGATGGCAGA GTTCCGTCGC GCAACTGGCC TGCCGACCGC AACCAATATG ATCGCCACCG ACTGGCGGCA GATGGGCCAT ACGCTCTCCC TGCAATCCGT TGATATCCCG CTGGCGGATC CGCATTTCTG GACGATGCAA GGTTCGGTAC GTGTGGCGCA AATGTGCCAT GAATTTGGCC TGACCTGGGG TTCACACTCA AACAACCACT TCGATATTTC CCTGGCGATG TTTACCCATG TTGCCGCCGC TGCACCGGGT AAAATCACCG CGATTGATAC GCACTGGATT TGGCAGGAAG GCAATCAGCG CTTGACCAAA GAACCGTTTG AGATCAAAGG CGGTCTGGTT CAGGTGCCGG AAAAACCGGG GCTGGGTGTA GAAATCGATA TGGATCAAGT GATGAAAGCC CATGAGCTGT ATCAGAAACA CGGGCTTTGC GCGCGTGACG ATGCGATGGG AATGCAGTAT CTGATTCCTG TCTGGACGTT CGATAACAAG CGCCCGTGCA TGGTGCGTTA A
|
Protein sequence | MSSQFTTPVV TEMQVIPVAG HDSMLMNLSG AHAPFFTRNI MIIKDNSGHT GVGEIPGGEK IRKTLEDAIP LVVGKTLGEY KNVLTLVRNT FADRDAGGRG LQTFDLRTTI HVVTGIEAAM LDLLGQHLGV NVASLLGDGQ QRSEVEMLGY LFFVGDRKAT PLPYQSQPDD SCDWYRLRHE EAMTPDAVVR LAEAAYEKYG FNDFKLKGGV LAGEEEAESI VALAQRFPQA RITLDPNGAW SLNEAIKIGK YLKGSLAYAE DPCGAEQGFS GREVMAEFRR ATGLPTATNM IATDWRQMGH TLSLQSVDIP LADPHFWTMQ GSVRVAQMCH EFGLTWGSHS NNHFDISLAM FTHVAAAAPG KITAIDTHWI WQEGNQRLTK EPFEIKGGLV QVPEKPGLGV EIDMDQVMKA HELYQKHGLC ARDDAMGMQY LIPVWTFDNK RPCMVR
|
| |