Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_3154 |
Symbol | tas |
ID | 5589254 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 3168289 |
End bp | 3169329 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640926796 |
Product | putative aldo-keto reductase |
Protein accession | YP_001464169 |
Protein GI | 157154848 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.000682822 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAATATC ACCGTATACC CCACAGTTCG CTGGAAGTCA GCACGCTGGG GCTTGGCACG ATGACGTTTG GTGAACAGAA CAGCGAAGCC GACGCCCACG CACAACTCGA CTATGCCGTC GCTCAGGGCA TTAACCTTAT CGACGTTGCC GAAATGTACC CAGTACCTCC GCGCCCCGAA ACGCAAGGGT TAACCGAAAC CTACGTCGGC AACTGGCTGG CGAAACATGG CAGCCGCGAA AAGTTAATTA TCGCCTCCAA AGTGAGCGGA CCGTCGCGCA ATAATGACAA GGGCATCCGC CCGGATCAGG CGCTGGACCG GAAGAATATC CGCGAAGCGT TGCATGACAG CCTCAAGCGC CTACAGACTG ATTACCTCGA TCTTTATCAG GTGCACTGGC CGCAGCGCCC AACCAACTGC TTTGGCAAAC TCGGTTATAG CTGGACGGAT TCTGCGCCTG CAGTTTCGCT GCTGGATACG CTGGACGCAC TGGCAGAGTA CCAACGCGCG GGAAAAATTC GTTATATCGG CGTGTCGAAC GAAACTGCAT TTGGCGTAAT GCGCTACCTG CATCTGGCGG ACAAACACGA TCTGCCGCGT ATTGTCACCA TTCAGAACCC TTACAGTCTG TTAAACCGCA GTTTTGAAGT AGGTCTGGCA GAAGTCAGCC AGTATGAAGG GGTCGAACTG CTGGCCTATT CGTGCCTGGG TTTCGGCACG CTGACCGGGA AATATCTCAA TGGTGCAAAA CCCACTGGCG CACGTAATAC GCTCTTTAGT CGGTTCACCC GCTATAGCGG TGAGCAAACG CAAAAAGCCG TCGCGGCGTA TGTTGATATT GCCAGACGTC ATGGACTGGA TCCTGCTCAG ATGGCGCTCG CGTTTGTACG CCGTCAACCG TTTGTTGCCA GCACTCTGCT GGGCGCAACC ACGATGGAGC AGCTGAAAAC TAACGTCGAA AGTTTGCATC TGGAGTTAAG CGAAGACGTA TTAGCTGAAA TTGAAGCGGT GCATCAGGTT TATACTTATC CGGCACCATA A
|
Protein sequence | MQYHRIPHSS LEVSTLGLGT MTFGEQNSEA DAHAQLDYAV AQGINLIDVA EMYPVPPRPE TQGLTETYVG NWLAKHGSRE KLIIASKVSG PSRNNDKGIR PDQALDRKNI REALHDSLKR LQTDYLDLYQ VHWPQRPTNC FGKLGYSWTD SAPAVSLLDT LDALAEYQRA GKIRYIGVSN ETAFGVMRYL HLADKHDLPR IVTIQNPYSL LNRSFEVGLA EVSQYEGVEL LAYSCLGFGT LTGKYLNGAK PTGARNTLFS RFTRYSGEQT QKAVAAYVDI ARRHGLDPAQ MALAFVRRQP FVASTLLGAT TMEQLKTNVE SLHLELSEDV LAEIEAVHQV YTYPAP
|
| |