Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3287 |
Symbol | |
ID | 6143573 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3363225 |
End bp | 3364265 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641618117 |
Product | aldo-keto reductase |
Protein accession | YP_001745267 |
Protein GI | 170682381 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCTGGT TAGCGAATCC CGAACGTTAC GGGCAGATGC AGTACCGCTA TTGCGGAAAA AGCGGTTTAC GCCTGCCCGC GTTATCGCTC GGTTTATGGC ACAATTTCGG TCACGTTAAC GCGCTGGAAT CACAGCGTGC GATCCTGCGT AAAGCGTTTG ATTTGGGCAT TACGCACTTT GATTTAGCCA ACAATTACGG GCCACCTCCA GGAAGCGCAG AAGAGAACTT TGGTCGCCTG CTGCGGGAGG ATTTTGCCGC TTATCGCGAT GAACTGATTA TCTCTACCAA GGCTGGCTAC GATATGTGGC CCGGCCCTTA CGGCTCTGGC GGTTCACGTA AATACCTGCT CGCCAGCCTC GACCAAAGCC TGAAGCGGAT GGGGCTTGAG TATGTCGATA TCTTTTACTC TCATCGCGTC GATGAAAATA CGCCGATGGA AGAAACCGCC TCTGCGCTGG CTCATGCGGT ACAAAGCGGT AAGGCGCTGT ATGTCGGGAT CTCCTCTTAC TCGCCAGAGC GGACGCAAAA AATGGTCGAG TTACTGCGCG AGTGGAAAAT TCCGCTGTTA ATTCATCAAC CTTCGTACAA TTTACTGAAC CGCTGGGTGG ATAAAAGCGG CCTGCTGGAT ACCCTGCAAA ATAACGGCGT GGGCTGCATT GCCTTTACTC CTCTGGCTCA GGGATTGCTG ACCGGAAAAT ATCTCAACGG CATTCCGCAA GATTCACGGA TGCATCGTGA AGGGAATAAA GTTCGTGGTC TGACGCCGAA AATGCTCACC GAAGCCAACC TCAACAGCCT GCGGTTATTG AATGAAATGG CACAGCAGCG TGGACAATCA ATGGCGCAAA TGGCGTTAAG CTGGTTGCTG AAAGATGATC GCGTGACGTC GGTATTGATT GGTGCCAGCC GCGCGGAGCA ACTTGAGGAG AACGTGCAGG CGCTGAATAA TCTGACATTT AGCACCGAGG AGCTGGCGCA GATTGATCAG CATATCGCCG ATGGCGAGCT GAATCTGTGG CAGGCGTCTT CCGATAAATG A
|
Protein sequence | MVWLANPERY GQMQYRYCGK SGLRLPALSL GLWHNFGHVN ALESQRAILR KAFDLGITHF DLANNYGPPP GSAEENFGRL LREDFAAYRD ELIISTKAGY DMWPGPYGSG GSRKYLLASL DQSLKRMGLE YVDIFYSHRV DENTPMEETA SALAHAVQSG KALYVGISSY SPERTQKMVE LLREWKIPLL IHQPSYNLLN RWVDKSGLLD TLQNNGVGCI AFTPLAQGLL TGKYLNGIPQ DSRMHREGNK VRGLTPKMLT EANLNSLRLL NEMAQQRGQS MAQMALSWLL KDDRVTSVLI GASRAEQLEE NVQALNNLTF STEELAQIDQ HIADGELNLW QASSDK
|
| |