Gene ECH74115_3686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3686 
SymboltktB 
ID6970304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3399877 
End bp3401880 
Gene Length2004 bp 
Protein Length667 aa 
Translation table11 
GC content54% 
IMG OID643387480 
Producttransketolase 
Protein accessionYP_002271933 
Protein GI209398617 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0021] Transketolase 
TIGRFAM ID[TIGR00232] transketolase, bacterial and yeast 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.709027 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones90 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCGAA AAGACCTTGC CAATGCGATT CGCGCACTCA GTATGGATGC GGTACAAAAA 
GCCAATTCTG GTCATCCCGG CGCGCCGATG GGCATGGCTG ATATTGCCGA AGTGCTGTGG
AACGATTTTC TTAAACATAA CCCTACCGAC CCAACCTGGT ATGATCGCGA CCGCTTTATT
CTTTCCAACG GTCACGCGTC GATGCTGCTC TACAGTTTGC TGCATCTGAC CGGTTACGAC
CTGCCGCTGG AAGAGCTGAA GAACTTTCGT CAACTGCATT CGAAAACCCC TGGCCACCCG
GAAATCGGCT ATACGCCCGG AGTTGAAACC ACCACCGGTC CTCTTGGACA AGGTTTGGCG
AACGCCGTCG GGCTGGCGAT AGCGGAACGT ACACTGGCGG CGCAGTTTAA CCAGCCGGAT
CATGAAATTG TCGATCACTT CACCTATGTG TTTATGGGCG ACGGCTGCCT GATGGAAGGT
ATTTCCCACG AAGTCTGTTC GCTGGCGGGC ACGCTGGGAC TGGGCAAGCT GATTGGTTTT
TACGATCACA ACGGTATTTC GATTGATGGA GAAACCGAAG GCTGGTTTAC CGACGATACG
GCAAAACGTT TTGAAGCCTA TCACTGGCAT GTGATCCATG AAATCGACGG TCACGATCCG
CAGGCGGTGA AGGAAGCAAT CCTTGAAGCG CAAAGCGTGA AAGATAAGCC GTCGCTGATT
ATCTGCCGTA CGGTGATTGG CTTTGGTTCG CCGAATAAAG CAGGTAAGGA AGAGGCGCAC
GGCGCACCGC TGGGGGAAGA AGAAGTGGCG CTGGTACGGC AAAAACTGGG CTGGCACCAT
CCGCCATTTG AGATCCCTAA AGATATTTAT CACGCTTGGG ATGCCCGCGA AAAAGGCGAA
AAAGCGCAGC AGAGCTGGAA TGAGAAGTTT GCCGCCTATA AAAAGGCTCA TCCGCAACTG
GCAGAAGAGT TTACCCGTCG GATGAGCGGT GGTTTACCGA AGGACTGGGA GAAAACGACT
CAGAAATATA TCAATGAGTT GCAGGCGAAT CCGGCGAAAA TCGCTACCCG TAAGGCTTCG
CAAAATACGC TTAACGCTTA CGGGCCGATG CTACCGGAGC TGCTCGGCGG TTCGGCGGAT
CTGGCTCCCA GCAACCTGAC CATCTGGAAA GGTTCTGTTT CGCTGAAGGA AGATCCGGCG
GGCAACTACA TTCACTACGG GGTGCGTGAA TTTGGCATGA CCGCTATCGC CAACGGCATT
GCGCACCACG GCGGCTTTGT GCCGTATACC GCAACGTTCC TGATGTTTGT TGAATACGCC
CGTAACGCCG CGCGGATGGC GGCACTGATG AAAGCGCGGC AGATTATGGT TTATACCCAC
GACTCAATTG GTCTGGGCGA AGATGGTCCG ACGCACCAGG CTGTTGAGCA ACTGGCCAGC
CTGCGCTTAA CGCCAAATTT CAGCACCTGG CGACCGTGCG ATCAGGTGGA AGCGGCGGTG
GGCTGGAAGC TGGCGGTTGA GCGCCACAAC GGACCGACGG CACTGATTCT CTCCAGGCAG
AATCTGGCCC AGGTGGAACG TACGCCGGAT CAGGTTAAAG AGATTGCTCG TGGCGGTTAT
GTGCTGAAAG ACAGCGGCGG TAAGCCAGAT ATTATTTTGA TTGCCACCGG TTCAGAGATG
GAAATCACCC TGCAAGCGGC GGAGAAATTA GCGGGAGAAG GTCGCAATGT TCGCGTGGTT
TCCCTGCCCT CGACCGATAT TTTCGACGCC CAGGATGAGG AATATCGGGA GTCGGTGTTG
CCTTCTAACG TTGCGGCTCG CGTGGCGGTG GAAGCAGGTA TTGCCGATTA CTGGTACAAG
TATGTTGGTC TGAAAGGGGC AATTGTCGGG ATGACGGGTT ATGGGGAATC TGCTCCGGCG
GATAAGCTGT TCCCGTTCTT TGGCTTTACC GCCGAGAATA TTGTGGCAAA AGCGCATAAG
GTGCTGGGAG TAAAAGGTGC CTGA
 
Protein sequence
MSRKDLANAI RALSMDAVQK ANSGHPGAPM GMADIAEVLW NDFLKHNPTD PTWYDRDRFI 
LSNGHASMLL YSLLHLTGYD LPLEELKNFR QLHSKTPGHP EIGYTPGVET TTGPLGQGLA
NAVGLAIAER TLAAQFNQPD HEIVDHFTYV FMGDGCLMEG ISHEVCSLAG TLGLGKLIGF
YDHNGISIDG ETEGWFTDDT AKRFEAYHWH VIHEIDGHDP QAVKEAILEA QSVKDKPSLI
ICRTVIGFGS PNKAGKEEAH GAPLGEEEVA LVRQKLGWHH PPFEIPKDIY HAWDAREKGE
KAQQSWNEKF AAYKKAHPQL AEEFTRRMSG GLPKDWEKTT QKYINELQAN PAKIATRKAS
QNTLNAYGPM LPELLGGSAD LAPSNLTIWK GSVSLKEDPA GNYIHYGVRE FGMTAIANGI
AHHGGFVPYT ATFLMFVEYA RNAARMAALM KARQIMVYTH DSIGLGEDGP THQAVEQLAS
LRLTPNFSTW RPCDQVEAAV GWKLAVERHN GPTALILSRQ NLAQVERTPD QVKEIARGGY
VLKDSGGKPD IILIATGSEM EITLQAAEKL AGEGRNVRVV SLPSTDIFDA QDEEYRESVL
PSNVAARVAV EAGIADYWYK YVGLKGAIVG MTGYGESAPA DKLFPFFGFT AENIVAKAHK
VLGVKGA