Gene EcSMS35_2611 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2611 
SymboltktB 
ID6142658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2663937 
End bp2665940 
Gene Length2004 bp 
Protein Length667 aa 
Translation table11 
GC content54% 
IMG OID641617482 
Producttransketolase 
Protein accessionYP_001744647 
Protein GI170679675 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0021] Transketolase 
TIGRFAM ID[TIGR00232] transketolase, bacterial and yeast 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCGAA AAGACCTTGC CAATGCGATT CGCGCACTCA GTATGGATGC GGTACAAAAA 
GCCAATTCTG GTCATCCCGG CGCGCCGATG GGCATGGCTG ATATTGCCGA AGTGCTGTGG
AACGATTTTC TTAAACATAA TCCTACCGAC CCAACCTGGT ATGATCGCGA CCGCTTTATT
CTTTCCAACG GTCACGCGTC GATGCTGCTC TACAGTTTGC TGCATCTGAC CGGTTACGAC
CTGCCGCTGG AAGAACTGAA GAACTTTCGT CAGTTGCATT CGAAAACCCC TGGTCACCCG
GAAATCGGCT ATACGCCTGG TGTTGAAACC ACCACCGGTC CACTTGGACA AGGTTTGGCG
AACGCCGTCG GGCTGGCGAT AGCAGAGCGT ACGCTGGCGG CGCAGTTTAA CCAGCCAGAT
CATGAAATTG TCGATCACTT CACCTATGTG TTTATGGGCG ACGGCTGCCT GATGGAAGGT
ATTTCCCACG AAGTCTGTTC GCTGGCGGGC ACGCTGGGAC TGGGCAAGCT GATTGGTTTT
TACGATCACA ACGGTATTTC CATCGACGGT GAAACCGAAG GCTGGTTTAC CGACGATACA
GCAAAACGTT TTGAAGCCTA TCACTGGCAT GTGATCCACG AGATCGACGG CCACGATCCG
CAGGCGGTGA AGGAAGCGAT CCTTGAAGCG CAAAGCGTGA AAGATAAGCC GTCGCTGATT
ATCTGCCGTA CGGTGATTGG CTTTGGTTCG CCGAATAAAG CAGGTAAGGA AGAGGCGCAC
GGCGCACCAC TGGGGGAAGA AGAAGTGGCG CTGGCACGGC AAAAACTGGG CTGGCACCAT
CCGCCATTTG AGATCCCTAA AGAGATTTAT CACGCCTGGG ATGCCCGCGA AAAAGGCGAA
AAAGCGCAGC AGAGCTGGAA TGAGAAGTTT GCCGCCTATA AAAAGGCTCA TCCGCAACTG
GCAGAAGAGT TTACCCGTCG GATGAGCGGT GGTTTACCGA AGGACTGGGA GAAAACGACT
CAGAAATATA TCAATGAGTT GCAGGCGAAT CCGGCGAAAA TCGCTACCCG TAAGGCTTCG
CAAAATACGC TTAACGCTTA CGGGCCTATG CTACCGGAGC TGCTCGGCGG TTCGGCGGAT
CTGGCTCCCA GCAACCTGAC CATCTGGAAA GGTTCTGTTT CGCTGAAGGA AGATCCGGCG
GGCAACTACA TTCACTACGG GGTGCGTGAA TTTGGCATGA CCGCTATCGC CAACGGCATT
GCGCACCACG GCGGCTTTGT GCCGTATACC GCGACGTTCC TGATGTTTGT TGAATACGCC
CGTAACGCCG CGCGGATGGC GGCACTGATG AAAGCGCGGC AGATTATGGT TTATACCCAC
GACTCAATTG GCCTGGGCGA AGATGGTCCT ACGCACCAGG CTGTTGAGCA ACTGGCCAGT
CTGCGCTTAA CGCCAAATTT CAGCACCTGG CGACCGTGCG ATCAGGTGGA AGCAGCGGTG
GGCTGGAAGC TGGCGGTTGA GCGCCACAAC GGACCGACGG CACTGATCCT CTCCAGGCAG
AATCTGGCCC AGGTGGAACG TACGCCGGAT CAGGTTAAAG AGATTGCTCG TGGTGGTTAT
GTGTTGAAAG ACAGCGGCGG TAAGCCAGAT ATTATTTTGA TTGCCACCGG TTCAGAGATG
GAAATCACCC TGCAAGCGGC AGAGAAATTA GCGGGAGAAG GTCGCAATGT ACGCGTAGTT
TCCCTGCCCT CGACCGATAT TTTCGACGCC CAGGATGAGG AATATCGGGA GTCGGTGTTG
CCTTCTAACG TTGCGGCTCG CGTAGCGGTG GAAGCAGGCA TCGCCGATTA CTGGTACAAG
TATGTTGGTC TGAAAGGGGC AATTGTCGGG ATGACGGGTT ACGGGGAATC TGCTCCGGCG
GATAAGCTGT TCCCGTTCTT TGGCTTTACC GCCGAGAATA TTGTGGCAAA AGCGCATAAG
GTGCTGGGAG TGAAAGGTGC CTGA
 
Protein sequence
MSRKDLANAI RALSMDAVQK ANSGHPGAPM GMADIAEVLW NDFLKHNPTD PTWYDRDRFI 
LSNGHASMLL YSLLHLTGYD LPLEELKNFR QLHSKTPGHP EIGYTPGVET TTGPLGQGLA
NAVGLAIAER TLAAQFNQPD HEIVDHFTYV FMGDGCLMEG ISHEVCSLAG TLGLGKLIGF
YDHNGISIDG ETEGWFTDDT AKRFEAYHWH VIHEIDGHDP QAVKEAILEA QSVKDKPSLI
ICRTVIGFGS PNKAGKEEAH GAPLGEEEVA LARQKLGWHH PPFEIPKEIY HAWDAREKGE
KAQQSWNEKF AAYKKAHPQL AEEFTRRMSG GLPKDWEKTT QKYINELQAN PAKIATRKAS
QNTLNAYGPM LPELLGGSAD LAPSNLTIWK GSVSLKEDPA GNYIHYGVRE FGMTAIANGI
AHHGGFVPYT ATFLMFVEYA RNAARMAALM KARQIMVYTH DSIGLGEDGP THQAVEQLAS
LRLTPNFSTW RPCDQVEAAV GWKLAVERHN GPTALILSRQ NLAQVERTPD QVKEIARGGY
VLKDSGGKPD IILIATGSEM EITLQAAEKL AGEGRNVRVV SLPSTDIFDA QDEEYRESVL
PSNVAARVAV EAGIADYWYK YVGLKGAIVG MTGYGESAPA DKLFPFFGFT AENIVAKAHK
VLGVKGA