Gene EcHS_A2594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2594 
SymboltktB 
ID5595512 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2601543 
End bp2603546 
Gene Length2004 bp 
Protein Length667 aa 
Translation table11 
GC content55% 
IMG OID640921715 
Producttransketolase 
Protein accessionYP_001459242 
Protein GI157161924 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0021] Transketolase 
TIGRFAM ID[TIGR00232] transketolase, bacterial and yeast 


Plasmid Coverage information

Num covering plasmid clones60 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCGAA AAGACCTTGC CAATGCGATT CGCGCACTCA GTATGGATGC GGTACAAAAA 
GCCAACTCTG GTCATCCCGG CGCGCCGATG GGCATGGCTG ATATTGCCGA AGTGCTGTGG
AACGATTTTC TTAAACATAA CCCTACCGAC CCAACCTGGT ATGATCGCGA CCGCTTTATT
CTTTCCAACG GTCACGCGTC GATGCTGCTC TACAGTTTGC TACATCTGAC CGGTTACGAC
CTGCCGCTGG AAGAACTGAA GAACTTCCGT CAGTTGCATT CGAAAACCCC AGGCCACCCG
GAGATTGGCT ATACGCCAGG CGTTGAAACC ACCACCGGCC CGCTTGGACA AGGTTTGGCG
AACGCCGTCG GGCTGGCGAT AGCAGAGCGT ACACTGGCGG CGCAGTTTAA CCAGCCAGAC
CATGAGATCG TCGATCACTT CACCTATGTG TTTATGGGCG ACGGCTGCCT GATGGAAGGT
ATTTCCCACG AAGTCTGTTC GCTGGCAGGC ACGCTGGGAC TGGGCAAGCT GATTGGTTTT
TACGATCACA ACGGTATTTC CATCGACGGT GAAACAGAAG GCTGGTTTAC CGACGATACG
GCAAAACGTT TTGAAGCCTA TCACTGGCAT GTGATCCATG AAATCGACGG TCACGATCCG
CAGGCGGTGA AGGAAGCGAT CCTTGAAGCG CAAAGCGTGA AAGATAAGCC GTCGCTGATT
ATCTGCCGTA CGGTGATTGG CTTTGGTTCG CCGAATAAAG CAGGTAAGGA AGAGGCGCAC
GGCGCACCAC TGGGGGAAGA AGAAGTGGCG CTGGCACGGC AAAAACTGGG CTGGCACCAT
CCGCCATTTG AGATCCCTAA AGAGATTTAT CACGCCTGGG ATGCCCGTGA AAAAGGCGAA
AAAGCGCAGC AGAGCTGGAA TGAGAAGTTT GCCGCCTATA AAAAGGCTCA TCCGCAACTG
GCAGAAGAGT TTACCCGACG GATGAGCGGT GGTTTACCGA AGGACTGGGA GAAAACGACT
CAGAAATATA TCAATGAGTT ACAGGCAAAT CCGGCGAAAA TCGCTACCCG TAAGGCTTCG
CAAAATACGC TTAACGCTTA CGGGCCGATG CTGCCTGAGT TGCTCGGCGG TTCGGCGGAT
CTGGCTCCCA GCAACCTGAC CATCTGGAAA GGTTCTGTTT CGCTGAAGGA AGATCCAGCG
GGCAACTACA TTCACTACGG GGTGCGTGAA TTTGGCATGA CCGCTATCGC CAACGGCATC
GAGCACCACG GCGGCTTTGT GCCGTATACC GCGACGTTCC TGATGTTTGT TGAATACGCC
CGTAACGCCG CGCGGATGGC GGCACTGATG AAAGCGCGGC AGATTATGGT TTATACCCAC
GACTCAATTG GCCTGGGCGA AGATGGTCCG ACGCACCAGG CTGTTGAGCA ACTGGCCAGC
CTGCGCTTAA CGCCAAATTT CAGCACCTGG CGACCGTGCG ATCAGGTGGA AGCGGCGGTG
GGCTGGAAGC TGGCGGTTGA GCGCCACAAC GGACCGACGG CACTGATCCT CTCAAGGCAG
AATCTGGCCC AGGTGGAACG TACGCCGGAT CAGGTTAAAG AGATTGCTCG TGGCGGTTAT
GTGCTGAAAG ACAGCGGCGG TAAGCCAGAT ATTATTCTGA TTGCCACCGG TTCAGAGATG
GAAATTACCC TGCAAGCGGC AGAGAAATTA GCAGGAGAAG GTCGCAATGT ACGCGTAGTT
TCCCTGCCCT CGACCGATAT TTTCGACGCC CAGGATGAGG AATATCGGGA GTCGGTGTTG
CCTTCTAACG TTGCGGCTCG CGTGGCGGTG GAAGCAGGTA TTGCCGATTA CTGGTACAAG
TATGTTGGTC TGAAAGGGGC AATTGTCGGG ATGACGGGTT ACGGGGAATC TGCTCCGGCG
GATAAGCTGT TCCCGTTCTT TGGCTTTACC GCCGAGAATA TTGTGGCAAA AGCGCATAAG
GTGCTGGGAG TGAAAGGTGC CTGA
 
Protein sequence
MSRKDLANAI RALSMDAVQK ANSGHPGAPM GMADIAEVLW NDFLKHNPTD PTWYDRDRFI 
LSNGHASMLL YSLLHLTGYD LPLEELKNFR QLHSKTPGHP EIGYTPGVET TTGPLGQGLA
NAVGLAIAER TLAAQFNQPD HEIVDHFTYV FMGDGCLMEG ISHEVCSLAG TLGLGKLIGF
YDHNGISIDG ETEGWFTDDT AKRFEAYHWH VIHEIDGHDP QAVKEAILEA QSVKDKPSLI
ICRTVIGFGS PNKAGKEEAH GAPLGEEEVA LARQKLGWHH PPFEIPKEIY HAWDAREKGE
KAQQSWNEKF AAYKKAHPQL AEEFTRRMSG GLPKDWEKTT QKYINELQAN PAKIATRKAS
QNTLNAYGPM LPELLGGSAD LAPSNLTIWK GSVSLKEDPA GNYIHYGVRE FGMTAIANGI
EHHGGFVPYT ATFLMFVEYA RNAARMAALM KARQIMVYTH DSIGLGEDGP THQAVEQLAS
LRLTPNFSTW RPCDQVEAAV GWKLAVERHN GPTALILSRQ NLAQVERTPD QVKEIARGGY
VLKDSGGKPD IILIATGSEM EITLQAAEKL AGEGRNVRVV SLPSTDIFDA QDEEYRESVL
PSNVAARVAV EAGIADYWYK YVGLKGAIVG MTGYGESAPA DKLFPFFGFT AENIVAKAHK
VLGVKGA