Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4816 |
Symbol | |
ID | 6270136 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 4487172 |
End bp | 4488191 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641728558 |
Product | oxidoreductase, zinc-binding dehydrogenase family |
Protein accession | YP_001882952 |
Protein GI | 187733393 |
COG category | [R] General function prediction only |
COG ID | [COG1064] Zn-dependent alcohol dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.140966 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGATGA TAAAAAGCTA TGCCGCAAAA GAAGCGGGCG GCGAACTGGA AGTTTATGAG TACGATCCCG GTGAGCTGAG GCCACAAGAT GTTGAAGTGC AGGTGGATTA TTGCGGGATC TGCCATTCCG ATCTATCGAT GATCGACAAT GAATGGGGAT TTTCACAATA TCCGCTGGTT GCCGGGCATG AGGTGATTGG GCGCGTGGTG GCACTCGGGA GCGCCGCGCA GGATAAAGGT TTGCAGGTCG GTCAGCGTGT CGGGATTGGC TGGACGGCGC GTAGCTGTGG TCACTGCGAC GCCTGTATTA GCGGTAATCA GATCAACTGC GAGCAAGGTG CGGTGCCGAC GATTATGAAT CGCGGTGGCT TTGCCGAGAA GTTGCGTGCG GACTGGCAAT GGGTGATTCC ACTGCCAGAA AATATTGATA TCGAGTCCGC CGGGCCGCTG TTGTGCGGCG GTATCACGGT CTTTAAACCA CTGTTGATGC ACCATATCAC TGCTACCAGC CGCGTTGGGG TAATTGGTAT TGGCGGGCTG GGGCATATCG CTATAAAACT TCTGCACGCA ATGGGATGCG AGGTGACGGC CTTTAGTTCT AATCCGGCGA AAGAGCAGGA AGTACTGGCG ATGGGTGCCG ATAAAGTGGT GAATAGCCGC GATCCGCAGG CACTGAAAGC CCTGTCGGGG CAGTTTGATC TCATTATCAA TACTGTGAAC GTCAGCCTCG ACTGGCAGCC TTATTTTGAG GCGCTGACCT ATGGCGGTAA TTTCCATACG GTCGGTGCGG TTCTCACGCC GCTGTCTGTT CCGGCCTTTA CGTTAATTGC GGGCGACCGC AGCATCTCTG GTTCTGCTAC CGGCACGCCT TATGAGCTGC GAAAGCTGAT GCGCTTTGCC GCCCGTAGCA AGGTTGCGCC GACCACCGAA CTGTTCCCGA TGTCGAAAAT TAACGATGCC ATCAAGCATG TGCGCGACGG TAAGGCGCGT TACCGCGTGG TTCTGAAAGC CGACTTCTGA
|
Protein sequence | MSMIKSYAAK EAGGELEVYE YDPGELRPQD VEVQVDYCGI CHSDLSMIDN EWGFSQYPLV AGHEVIGRVV ALGSAAQDKG LQVGQRVGIG WTARSCGHCD ACISGNQINC EQGAVPTIMN RGGFAEKLRA DWQWVIPLPE NIDIESAGPL LCGGITVFKP LLMHHITATS RVGVIGIGGL GHIAIKLLHA MGCEVTAFSS NPAKEQEVLA MGADKVVNSR DPQALKALSG QFDLIINTVN VSLDWQPYFE ALTYGGNFHT VGAVLTPLSV PAFTLIAGDR SISGSATGTP YELRKLMRFA ARSKVAPTTE LFPMSKINDA IKHVRDGKAR YRVVLKADF
|
| |