Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2982 |
Symbol | tas |
ID | 6144810 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3060229 |
End bp | 3061269 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641617851 |
Product | putative aldo-keto reductase |
Protein accession | YP_001745003 |
Protein GI | 170681174 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0288207 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.0122492 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAATATC ACCGTATACC CCACAGTTCG CTGGAAGTCA GCACGCTGGG GCTTGGCACG ATGACGTTTG GTGAACAGAA CAGCGAAGCC GACGCCCACG CACAACTCGA CTATGCCGTC GCTCAGGGCA TTAACCTTAT CGACGTTGCC GAAATGTACC CAGTACCTCC GCGCCCCGAA ACGCAAGGGT TAACCGAAAC CTACGTCGGC AACTGGCTGG CGAAACATGG CAGCCGCGAA AAGTTAATTA TCGCCTCCAA AGTGAGCGGA CCGTCGCGCA ATAATGACAA GGGCATCCGC CCGGATCAGG CGCTGGATCG GAAGAATATC CGCGAAGCGT TGCATGACAG CCTCAAGCGT CTGCAGACTG ATTACCTCGA TCTTTATCAG GTGCACTGGC CGCAGCGCCC AACCAACTGC TTTGGCAAAC TCGGTTATAG CTGGACGGAT TCTGCGCCTG CAGTTTCGCT GCTGGATACG CTGGACGCAC TGGCAGAGTA CCAACGCGCG GGAAAAATTC GTTATATCGG CGTGTCGAAC GAAACTGCAT TTGGCGTAAT GCGCTACCTG CATCTGGCAG ACAAACACGA TCTGCCGCGT ATTGTCACCA TTCAGAACCC TTACAGTCTG TTAAACCGCA GTTTTGAAGT AGGTCTGGCA GAAGTCAGCC AGTATGAAGG GGTCGAACTG CTGGCCTATT CGTGCCTGGG TTTCGGCACG CTGACCGGGA AATATCTCAA CGGTGCAAAA CCCGCTGGCG CACGTAATAC GCTCTTTAGT CGCTTCACAC GCTATAGCGG TGAGCAAACG CAAAAAGCCG TCGCGGCGTA TGTTGATATA GCCAGACGTC ATGGCCTGGA TCCTGCACAG ATGGCGCTCG CTTTTGTACG CCGTCAACCG TTTGTTGCCA GCACTCTGCT GGGCGCAACC ACGATGGAGC AGCTGAAAAC TAACGTCGAA AGTTTGCATC TGGAGTTAAG CGAAGACGTA TTAGCTGAAA TTGAAGCGGT ACATCAGGTT TATACTTATC CGGCACCATA A
|
Protein sequence | MQYHRIPHSS LEVSTLGLGT MTFGEQNSEA DAHAQLDYAV AQGINLIDVA EMYPVPPRPE TQGLTETYVG NWLAKHGSRE KLIIASKVSG PSRNNDKGIR PDQALDRKNI REALHDSLKR LQTDYLDLYQ VHWPQRPTNC FGKLGYSWTD SAPAVSLLDT LDALAEYQRA GKIRYIGVSN ETAFGVMRYL HLADKHDLPR IVTIQNPYSL LNRSFEVGLA EVSQYEGVEL LAYSCLGFGT LTGKYLNGAK PAGARNTLFS RFTRYSGEQT QKAVAAYVDI ARRHGLDPAQ MALAFVRRQP FVASTLLGAT TMEQLKTNVE SLHLELSEDV LAEIEAVHQV YTYPAP
|
| |