Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E3430 |
Symbol | |
ID | 6271739 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 3190567 |
End bp | 3191607 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641727316 |
Product | aldo-keto reductase |
Protein accession | YP_001881765 |
Protein GI | 187732579 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 44 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCTGGT TAGCGAATCC CGAACGTTAC GGGCAGATGC AGTACCGCTA TTGCGGAAAA AGTGGTTTAC GCCTGCCCGC GTTATCGCTC GGTTTATGGC ACAATTTCGG TCACGTTAAC GCGCTGGAAT CACAGCGTGC AATCCTGCGT AAAGCGTTTG ATTTAGGCAT TACGCACTTT GATTTAGCCA ACAATTACGG GCCGCCTCCA GGAAGCGCAG AAGAGAACTT TGGTCGCCTG CTGCGGGAGG ATTTTGCCGC TTATCGCGAT GAACTGATTA TCTCTACCAA GGCTGGCTAC GATATGTGGC CCGGCCCTTA CGGCTCTGGC GGTTCACGTA AATACCTGCT CGCCAGCCTC GACCAAAGCC TGAAGCGTAT GGGGCTTGAG TATGTCGATA TCTTTTACTC TCATCGCGTC GATGAAAATA CGCCGATGGA AGAAACCGCC TCTGCGCTGG CTCATGCGGT ACAAAGCGGT AAGGCGCTGT ATGCCGGGAT CTCCTCTTAC TCGCCAGAGC GGACGCAAAA AATGGTCGAG TTGCTGCACG AGTGGAAAAT TCCGCTTTTA ATTCATCAAC CTTCGTACAA TTTACTGAAC CGCTGGGTGG ATAAAAGCGG CCTGCTGGAT ACCCTGCAAA ATAACGGCGT GGGCTGTATT GCCTTTACTC CTCTGGCTCA GGGATTGCTG ACCGGAAAAT ATCTCAACGG CATACCGCAA GATTCACGGA TGCATCGTGA AGGGAATAAA GTTCGTGGTC TGACGCCGAA AATGCTTACC GAAGCCAACC TCAACAGCCT GCGCTTATTG AATGAAATGG CACAGCAGCG TGGACAATCA ATGGCGCAAA TGGCGTTAAG CTGGTTGCTG AAAAATGATC GCGTGACGTC GGTATTGATT GGTGTCAGCC GCGCGGAGCA ACTAGAGGAG AACGTGCAGG CGCTGAATAA TCTGACATTT AGCACCGAGG AGCTGGCGCA GATTGATCAG CATATCGCCG ATGGCGAGCT GAATCTGTGG CAGGCGTCTT CCGATAAATG A
|
Protein sequence | MVWLANPERY GQMQYRYCGK SGLRLPALSL GLWHNFGHVN ALESQRAILR KAFDLGITHF DLANNYGPPP GSAEENFGRL LREDFAAYRD ELIISTKAGY DMWPGPYGSG GSRKYLLASL DQSLKRMGLE YVDIFYSHRV DENTPMEETA SALAHAVQSG KALYAGISSY SPERTQKMVE LLHEWKIPLL IHQPSYNLLN RWVDKSGLLD TLQNNGVGCI AFTPLAQGLL TGKYLNGIPQ DSRMHREGNK VRGLTPKMLT EANLNSLRLL NEMAQQRGQS MAQMALSWLL KNDRVTSVLI GVSRAEQLEE NVQALNNLTF STEELAQIDQ HIADGELNLW QASSDK
|
| |