Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3686 |
Symbol | tktB |
ID | 6970304 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3399877 |
End bp | 3401880 |
Gene Length | 2004 bp |
Protein Length | 667 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643387480 |
Product | transketolase |
Protein accession | YP_002271933 |
Protein GI | 209398617 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0021] Transketolase |
TIGRFAM ID | [TIGR00232] transketolase, bacterial and yeast |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.709027 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 90 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCGAA AAGACCTTGC CAATGCGATT CGCGCACTCA GTATGGATGC GGTACAAAAA GCCAATTCTG GTCATCCCGG CGCGCCGATG GGCATGGCTG ATATTGCCGA AGTGCTGTGG AACGATTTTC TTAAACATAA CCCTACCGAC CCAACCTGGT ATGATCGCGA CCGCTTTATT CTTTCCAACG GTCACGCGTC GATGCTGCTC TACAGTTTGC TGCATCTGAC CGGTTACGAC CTGCCGCTGG AAGAGCTGAA GAACTTTCGT CAACTGCATT CGAAAACCCC TGGCCACCCG GAAATCGGCT ATACGCCCGG AGTTGAAACC ACCACCGGTC CTCTTGGACA AGGTTTGGCG AACGCCGTCG GGCTGGCGAT AGCGGAACGT ACACTGGCGG CGCAGTTTAA CCAGCCGGAT CATGAAATTG TCGATCACTT CACCTATGTG TTTATGGGCG ACGGCTGCCT GATGGAAGGT ATTTCCCACG AAGTCTGTTC GCTGGCGGGC ACGCTGGGAC TGGGCAAGCT GATTGGTTTT TACGATCACA ACGGTATTTC GATTGATGGA GAAACCGAAG GCTGGTTTAC CGACGATACG GCAAAACGTT TTGAAGCCTA TCACTGGCAT GTGATCCATG AAATCGACGG TCACGATCCG CAGGCGGTGA AGGAAGCAAT CCTTGAAGCG CAAAGCGTGA AAGATAAGCC GTCGCTGATT ATCTGCCGTA CGGTGATTGG CTTTGGTTCG CCGAATAAAG CAGGTAAGGA AGAGGCGCAC GGCGCACCGC TGGGGGAAGA AGAAGTGGCG CTGGTACGGC AAAAACTGGG CTGGCACCAT CCGCCATTTG AGATCCCTAA AGATATTTAT CACGCTTGGG ATGCCCGCGA AAAAGGCGAA AAAGCGCAGC AGAGCTGGAA TGAGAAGTTT GCCGCCTATA AAAAGGCTCA TCCGCAACTG GCAGAAGAGT TTACCCGTCG GATGAGCGGT GGTTTACCGA AGGACTGGGA GAAAACGACT CAGAAATATA TCAATGAGTT GCAGGCGAAT CCGGCGAAAA TCGCTACCCG TAAGGCTTCG CAAAATACGC TTAACGCTTA CGGGCCGATG CTACCGGAGC TGCTCGGCGG TTCGGCGGAT CTGGCTCCCA GCAACCTGAC CATCTGGAAA GGTTCTGTTT CGCTGAAGGA AGATCCGGCG GGCAACTACA TTCACTACGG GGTGCGTGAA TTTGGCATGA CCGCTATCGC CAACGGCATT GCGCACCACG GCGGCTTTGT GCCGTATACC GCAACGTTCC TGATGTTTGT TGAATACGCC CGTAACGCCG CGCGGATGGC GGCACTGATG AAAGCGCGGC AGATTATGGT TTATACCCAC GACTCAATTG GTCTGGGCGA AGATGGTCCG ACGCACCAGG CTGTTGAGCA ACTGGCCAGC CTGCGCTTAA CGCCAAATTT CAGCACCTGG CGACCGTGCG ATCAGGTGGA AGCGGCGGTG GGCTGGAAGC TGGCGGTTGA GCGCCACAAC GGACCGACGG CACTGATTCT CTCCAGGCAG AATCTGGCCC AGGTGGAACG TACGCCGGAT CAGGTTAAAG AGATTGCTCG TGGCGGTTAT GTGCTGAAAG ACAGCGGCGG TAAGCCAGAT ATTATTTTGA TTGCCACCGG TTCAGAGATG GAAATCACCC TGCAAGCGGC GGAGAAATTA GCGGGAGAAG GTCGCAATGT TCGCGTGGTT TCCCTGCCCT CGACCGATAT TTTCGACGCC CAGGATGAGG AATATCGGGA GTCGGTGTTG CCTTCTAACG TTGCGGCTCG CGTGGCGGTG GAAGCAGGTA TTGCCGATTA CTGGTACAAG TATGTTGGTC TGAAAGGGGC AATTGTCGGG ATGACGGGTT ATGGGGAATC TGCTCCGGCG GATAAGCTGT TCCCGTTCTT TGGCTTTACC GCCGAGAATA TTGTGGCAAA AGCGCATAAG GTGCTGGGAG TAAAAGGTGC CTGA
|
Protein sequence | MSRKDLANAI RALSMDAVQK ANSGHPGAPM GMADIAEVLW NDFLKHNPTD PTWYDRDRFI LSNGHASMLL YSLLHLTGYD LPLEELKNFR QLHSKTPGHP EIGYTPGVET TTGPLGQGLA NAVGLAIAER TLAAQFNQPD HEIVDHFTYV FMGDGCLMEG ISHEVCSLAG TLGLGKLIGF YDHNGISIDG ETEGWFTDDT AKRFEAYHWH VIHEIDGHDP QAVKEAILEA QSVKDKPSLI ICRTVIGFGS PNKAGKEEAH GAPLGEEEVA LVRQKLGWHH PPFEIPKDIY HAWDAREKGE KAQQSWNEKF AAYKKAHPQL AEEFTRRMSG GLPKDWEKTT QKYINELQAN PAKIATRKAS QNTLNAYGPM LPELLGGSAD LAPSNLTIWK GSVSLKEDPA GNYIHYGVRE FGMTAIANGI AHHGGFVPYT ATFLMFVEYA RNAARMAALM KARQIMVYTH DSIGLGEDGP THQAVEQLAS LRLTPNFSTW RPCDQVEAAV GWKLAVERHN GPTALILSRQ NLAQVERTPD QVKEIARGGY VLKDSGGKPD IILIATGSEM EITLQAAEKL AGEGRNVRVV SLPSTDIFDA QDEEYRESVL PSNVAARVAV EAGIADYWYK YVGLKGAIVG MTGYGESAPA DKLFPFFGFT AENIVAKAHK VLGVKGA
|
| |