Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4101 |
Symbol | tas |
ID | 6967342 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3798394 |
End bp | 3799434 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643387858 |
Product | putative aldo-keto reductase |
Protein accession | YP_002272298 |
Protein GI | 209397783 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000408733 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.00103859 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAATATC ACCGTATACC CCACAGTTCG CTGGAAGTCA GCACGCTGGG GCTTGGCACG ATGACGTTTG GTGAACAGAA CAGCGAAGCC GACGCCCACG CACAACTCGA CTATGCCGTC GCTCAGGGCA TTAACCTTAT CGACGTTGCC GAAATGTACC CAGTACCTCC GCGCCCCGAA ACGCAAGGGT TAACCGAAAC CTACGTCGGC AACTGGCTGG CGAAACATGG CAGCCGCGAA AAGTTAATTA TCGCCTCCAA AGTGAGCGGA CCGTCGCGCA ATAATGACAA GGGCATCCGC CCGGATCAGG CGCTGGATCG GAAGAATATC CGCGAAGCGC TGCATGACAG CCTCAAGCGT CTGCAGACTG ATTACCTCGA TCTTTATCAG GTGCACTGGC CGCAGCGCCC AACCAACTGC TTCGGCAAAC TCGGTTATAG CTGGACAGAT TCTGCGCCTG CGGTTTCGCT GCTGGATACG CTGGACGCAC TGGCAGAGTA CCAACGAGCG GGAAAAATTC GTTATATCGG CGTGTCGAAC GAAACTGCAT TTGGCGTAAT GCGCTACCTG CATCTGGCGG ACAAACACGA TCTGCCGCGT ATTGTCACCA TTCAGAACCC TTACAGTCTG TTAAACCGCA GTTTTGAAGT AGGTCTGGCA GAAGTCAGCC AGTATGAAGG GGTCGAACTG CTGGCCTATT CGTGCCTGGG TTTCGGCACG CTGACCGGGA AATATCTCAA TGGTGCAAAA CCCGCTGGCG CACGTAATAC GCTCTTTAGT CGGTTCACCC GCTATAGCGG TGAGCAAACG CAAAAAGCCG TCGCGGCGTA TGTTGATATC GCCAGACGTC ATGGCCTAGC CCCTGCTCAG ATGGCGCTCG CGTTTGTACG CCGTCAACCG TTTGTTGCCA GCACTCTGCT GGGCGCAACC ACGATGGATC AGCTGAAAAC TAACATCGAA AGTTTGCATC TGGAGTTAAG CGAAGACGTA TTAGCTGAAA TTGAAGCGGT GCATCAGGTT TATACTTATC CGGCACCATA A
|
Protein sequence | MQYHRIPHSS LEVSTLGLGT MTFGEQNSEA DAHAQLDYAV AQGINLIDVA EMYPVPPRPE TQGLTETYVG NWLAKHGSRE KLIIASKVSG PSRNNDKGIR PDQALDRKNI REALHDSLKR LQTDYLDLYQ VHWPQRPTNC FGKLGYSWTD SAPAVSLLDT LDALAEYQRA GKIRYIGVSN ETAFGVMRYL HLADKHDLPR IVTIQNPYSL LNRSFEVGLA EVSQYEGVEL LAYSCLGFGT LTGKYLNGAK PAGARNTLFS RFTRYSGEQT QKAVAAYVDI ARRHGLAPAQ MALAFVRRQP FVASTLLGAT TMDQLKTNIE SLHLELSEDV LAEIEAVHQV YTYPAP
|
| |