Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2981 |
Symbol | tas |
ID | 5595491 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 2989917 |
End bp | 2990957 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640922100 |
Product | putative aldo-keto reductase |
Protein accession | YP_001459605 |
Protein GI | 157162287 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.0000187356 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAATATC ACCGTATACC CCACAGTTCG CTGGAAGTCA GCACGCTGGG GCTTGGCACG ATGACGTTTG GTGAACAGAA CAGCGAAGCC GACGCCCACG CACAACTCGA CTATGCCGTT GCTCAGGGCA TTAACCTTAT CGACGTTGCC GAAATGTACC CAGTACCTCC GCGCCCCGAA ACGCAAGGGT TAACCGAAAC CTACGTCGGC AACTGGCTGG CGAAACATGG CAGCCGCGAA AAGTTAATTA TCGCCTCCAA AGTGAGCGGA CCGTCGCGCA ATAATGACAA GGGCATCCGC CCGGATCAGG CGCTGGATCG GAAGAATATC CGCGAAGCGC TGCATGACAG CCTCAAGCGC CTACAGACTG ATTACCTCGA TCTTTATCAG GTGCACTGGC CGCAGCGCCC AACCAACTGC TTTGGCAAAC TCGGTTATAG CTGGACGGAT TCTGCGCCTG CAGTTTCGCT GCTGGATACG CTGGACGCAC TGGCAGAGTA CCAACGCGCG GGAAAAATTC GTTATATCGG CGTGTCGAAC GAAACTGCAT TTGGCGTAAT GCGCTACCTG CATCTGGCGG ACAAACACGA TCTGCCGCGT ATTGTCACCA TTCAGAACCC CTACAGTCTG TTAAACCGCA GTTTTGAAGT AGGTCTGGCA GAAGTCAGCC AGTATGAAGG GGTCGAACTG CTGGCCTATT CGTGCCTGGG TTTCGGTACG CTGACCGGGA AATATCTCAA TGGTGCAAAA CCCGCTGGCG CACGTAATAC GCTCTTTAGT CGGTTCACCC GCTATAGCGG TGAGCAAACG CAAAAAGCCG TCGCGGCGTA TGTTGATATT GCCAGACGTC ATGGCCTGGA CCCTGCTCAA ATGGCGCTCG CATTTGTACG CCGTCAACCT TTTGTTGCCA GCACTCTGCT GGGCGCAACC ACGATGGAGC AGTTGAAAAC TAACGTCGAA AGTTTGCATC TGGAGTTAAG CGAAGACGTA TTAGCTGAAA TTGAAGCTGT GCATCAGGTT TATACTTATC CGGCACCATA A
|
Protein sequence | MQYHRIPHSS LEVSTLGLGT MTFGEQNSEA DAHAQLDYAV AQGINLIDVA EMYPVPPRPE TQGLTETYVG NWLAKHGSRE KLIIASKVSG PSRNNDKGIR PDQALDRKNI REALHDSLKR LQTDYLDLYQ VHWPQRPTNC FGKLGYSWTD SAPAVSLLDT LDALAEYQRA GKIRYIGVSN ETAFGVMRYL HLADKHDLPR IVTIQNPYSL LNRSFEVGLA EVSQYEGVEL LAYSCLGFGT LTGKYLNGAK PAGARNTLFS RFTRYSGEQT QKAVAAYVDI ARRHGLDPAQ MALAFVRRQP FVASTLLGAT TMEQLKTNVE SLHLELSEDV LAEIEAVHQV YTYPAP
|
| |