Gene ECH74115_4237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4237 
SymboltktA 
ID6971447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3924652 
End bp3926643 
Gene Length1992 bp 
Protein Length663 aa 
Translation table11 
GC content56% 
IMG OID643387975 
Producttransketolase 
Protein accessionYP_002272414 
Protein GI209400304 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0021] Transketolase 
TIGRFAM ID[TIGR00232] transketolase, bacterial and yeast 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTCAC GTAAAGAGCT TGCCAATGCT ATTCGTGCGC TGAGCATGGA CGCAGTACAG 
AAAGCCAAAT CCGGTCACCC GGGTGCCCCT ATGGGTATGG CTGACATTGC CGAAGTCCTG
TGGCGTGATT TCCTGAAACA TAACCCGCAG AATCCGTCCT GGGCTGACCG TGACCGCTTC
GTGCTGTCCA ACGGCCACGG CTCCATGCTG ATCTACAGCC TGCTGCACCT CACCGGTTAC
GATCTGCCGA TGGAAGAACT GAAAAACTTC CGTCAGCTGC ACTCTAAAAC TCCGGGCCAC
CCGGAAGTGG GTTACACCGC TGGTGTGGAA ACCACCACCG GTCCGCTGGG GCAGGGTATT
GCCAACGCAG TAGGTATGGC GATTGCAGAA AAAACGCTGG CGGCGCAGTT TAACCGTCCG
GGCCACGACA TTGTCGACCA CTACACCTAC GCCTTCATGG GCGACGGCTG CATGATGGAA
GGCATCTCCC ACGAAGTTTG CTCTCTGGCG GGTACGCTGA AGCTGGGTAA ACTGATTGCG
TTCTACGATG ACAACGGTAT TTCTATCGAT GGTCACGTTG AAGGCTGGTT CACCGACGAC
ACCGCAATGC GTTTCGAAGC TTACGGCTGG CACGTTATTC GCGACATCGA CGGTCATGAC
GCAGCATCCA TCAAACGCGC AGTAGAAGAA GCGCGCGCAG TGACTGACAA ACCGTCCCTG
CTGATGTGCA AAACCATCAT CGGTTTCGGT TCCCCGAACA AAGCCGGTAC TCACGACTCC
CACGGTGCGC CGCTGGGTGA TGCAGAAATC GCTCTGACCC GCGAACAGCT GGGCTGGAAA
TACGCACCGT TCGAAATCCC GTCTGAAATC TATGCGCAGT GGGATGCGAA AGAAGCAGGC
CAGGCGAAAG AATCTGCATG GAATGAGAAG TTTGCGGCTT ACGCGAAAGC TTATCCGCAG
GAAGCGGCTG AATTTACCCG CCGTATGAAA GGCGAAATGC CGTCTGACTT CGACGCCAAA
GCGAAAGAGT TTATCGCTAA ACTGCAGGCT AATCCGGCGA AAATCGCCAG CCGTAAAGCG
TCTCAGAATG CTATCGAAGC GTTCGGCCCG CTGTTGCCGG AATTCCTCGG CGGCTCCGCT
GACCTGGCAC CGTCTAACCT GACCCTGTGG TCTGGTTCTA AAGCAATCAA CGAAGATGCT
GCGGGTAACT ACATCCACTA CGGTGTTCGC GAGTTCGGTA TGACCGCGAT TGCTAACGGT
ATCTCCCTGC ACGGTGGCTT CCTGCCGTAC ACCTCCACCT TCCTGATGTT TGTCGAATAC
GCACGTAACG CCGTACGTAT GGCTGCGCTG ATGAAACAGC GTCAGGTTAT GGTTTACACC
CACGACTCCA TCGGTCTGGG CGAAGATGGC CCGACTCACC AGCCGGTTGA GCAGGTTGCT
TCTCTGCGCG TGACCCCGAA CATGTCTACA TGGCGTCCGT GTGACCAGGT TGAATCCGCG
GTCGCGTGGA AATACGGTGT TGAGCGTCAG GACGGTCCGA CCGCGCTTAT CCTCTCCCGT
CAGAACCTGG CGCAGCAGGA ACGTACTGAA GAGCAACTGG CAAACATCGC GCGCGGTGGT
TATGTGCTGA AAGACTGTGC TGGTCAGCCG GAACTGATCT TCATCGCGAC CGGTTCGGAA
GTTGAACTGG CTGTTGCCGC CTACGAAAAA CTGACTGCCG AAGGCGTGAA GGCGCGCGTG
GTTTCCATGC CGTCTACCGA TGCATTCGAC AAGCAGGATG CGGCTTACCG TGAATCCGTA
CTGCCGAAAG CGGTTACTGC ACGCGTTGCG GTAGAAGCGG GTATTGCTGA CTACTGGTAC
AAGTATGTTG GCCTGAATGG CGCTATCATC GGTATGACCA CCTTCGGTGA GTCAGCTCCG
GCAGAGTTGC TGTTCGAAGA GTTCGGCTTC ACCGTCGACA ACGTCGTTGC GAAAGCAAAA
GAACTGCTGT AA
 
Protein sequence
MSSRKELANA IRALSMDAVQ KAKSGHPGAP MGMADIAEVL WRDFLKHNPQ NPSWADRDRF 
VLSNGHGSML IYSLLHLTGY DLPMEELKNF RQLHSKTPGH PEVGYTAGVE TTTGPLGQGI
ANAVGMAIAE KTLAAQFNRP GHDIVDHYTY AFMGDGCMME GISHEVCSLA GTLKLGKLIA
FYDDNGISID GHVEGWFTDD TAMRFEAYGW HVIRDIDGHD AASIKRAVEE ARAVTDKPSL
LMCKTIIGFG SPNKAGTHDS HGAPLGDAEI ALTREQLGWK YAPFEIPSEI YAQWDAKEAG
QAKESAWNEK FAAYAKAYPQ EAAEFTRRMK GEMPSDFDAK AKEFIAKLQA NPAKIASRKA
SQNAIEAFGP LLPEFLGGSA DLAPSNLTLW SGSKAINEDA AGNYIHYGVR EFGMTAIANG
ISLHGGFLPY TSTFLMFVEY ARNAVRMAAL MKQRQVMVYT HDSIGLGEDG PTHQPVEQVA
SLRVTPNMST WRPCDQVESA VAWKYGVERQ DGPTALILSR QNLAQQERTE EQLANIARGG
YVLKDCAGQP ELIFIATGSE VELAVAAYEK LTAEGVKARV VSMPSTDAFD KQDAAYRESV
LPKAVTARVA VEAGIADYWY KYVGLNGAII GMTTFGESAP AELLFEEFGF TVDNVVAKAK
ELL