Gene EcHS_A3093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3093 
Symboltkt 
ID5593908 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3107271 
End bp3109262 
Gene Length1992 bp 
Protein Length663 aa 
Translation table11 
GC content56% 
IMG OID640922212 
Producttransketolase 
Protein accessionYP_001459712 
Protein GI157162394 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0021] Transketolase 
TIGRFAM ID[TIGR00232] transketolase, bacterial and yeast 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCTCAC GTAAAGAGCT TGCCAATGCT ATTCGTGCGC TGAGCATGGA CGCAGTACAG 
AAAGCCAAAT CCGGTCACCC GGGTGCCCCT ATGGGTATGG CTGACATTGC CGAAGTCCTG
TGGCGTGATT TCCTGAAACA CAACCCGCAG AATCCGTCCT GGGCTGACCG TGACCGCTTC
GTGCTGTCCA ACGGCCACGG CTCCATGCTG ATCTACAGCC TGCTGCACCT CACCGGTTAC
GATCTGCCGA TGGAAGAACT GAAAAACTTC CGTCAGCTGC ACTCTAAAAC TCCGGGCCAC
CCGGAAGTAG GTTATACCGC TGGTGTGGAA ACCACCACCG GTCCGCTGGG TCAGGGTATT
GCCAACGCAG TCGGTATGGC GATTGCAGAA AAAACGCTGG CGGCGCAGTT TAACCGTCCA
GGTCACGACA TTGTCGACCA CTACACCTAC GCCTTCATGG GCGACGGCTG CATGATGGAA
GGCATCTCCC ACGAAGTTTG CTCTCTGGCG GGTACGCTGA AGCTGGGTAA ACTGATTGCG
TTCTACGATG ACAACGGTAT CTCAATCGAT GGTCACGTTG AAGGCTGGTT CACTGACGAC
ACCGCAATGC GTTTCGAAGC TTACGGCTGG CACGTTATTC GCGACATCGA CGGTCATGAC
GCGGCATCCA TCAAACGCGC AGTAGAAGAA GCGCGCGCAG TGACTGACAA ACCGTCCCTG
CTGATGTGCA AAACCATCAT CGGTTTCGGT TCCCCGAACA AAGCCGGTAC CCACGACTCC
CACGGTGCGC CGCTGGGCGA CGCTGAAATT GCCCTGACCC GCGAACAGCT GGGCTGGAAA
TACGCGCCGT TCGAAATCCC GTCTGAAATC TATGCTCAGT GGGATGCGAA AGAAGCAGGC
CAGGCGAAAG AATCTGCATG GAATGAGAAG TTTGCGGCTT ACGCGAAAGC TTATCCGCAG
GAAGCGGCTG AATTTACCCG CCGTATGAAA GGCGAAATGC CGTCTGACTT CGACGCCAAA
GCGAAAGAGT TTATCGCTAA ACTGCAGGCT AATCCGGCGA AAATCGCCAG CCGTAAAGCG
TCGCAGAATG CTATCGAAGC GTTCGGCCCG CTGTTGCCTG AATTCCTCGG CGGCTCTGCT
GACCTGGCAC CGTCTAACCT GACCCTGTGG TCTGGTTCTA AAGCAATCAA CGAAGATGCT
GCAGGTAACT ACATCCACTA CGGTGTTCGC GAGTTCGGTA TGACCGCGAT TGCTAACGGT
ATCTCCCTGC ACGGTGGTTT CCTGCCGTAC ACCTCCACCT TCCTGATGTT CGTGGAATAC
GCACGTAACG CCGTACGTAT GGCTGCGCTG ATGAAACAGC GTCAGGTGAT GGTTTACACC
CACGACTCCA TCGGTCTGGG CGAAGATGGC CCGACTCACC AGCCGGTTGA GCAGGTCGCT
TCTCTGCGCG TGACCCCGAA CATGTCTACA TGGCGTCCGT GTGACCAGGT TGAATCCGCG
GTCGCGTGGA AATACGGCGT TGAGCGTCAG GACGGCCCGA CTGCGCTTAT CCTCTCCCGT
CAGAACCTGG CGCAGCAGGA ACGAACTGAA GAGCAACTGG CAAACATCGC GCGCGGTGGT
TATGTGCTGA AAGACTGCGC CGGTCAGCCG GAACTGATTT TCATCGCTAC CGGTTCAGAA
GTTGAACTGG CTGTTGCTGC CTACGAAAAA CTGACTGCCG AAGGCGTGAA AGCGCGCGTG
GTGTCCATGC CGTCTACCGA CGCATTTGAC AAGCAGGATG CTGCTTACCG TGAATCCGTA
CTGCCGAAAG CGGTTACTGC ACGCGTTGCT GTAGAAGCGG GTATTGCTGA CTACTGGTAC
AAGTATGTTG GCCTGAACGG TGCTATCGTC GGTATGACCA CCTTCGGTGA ATCTGCTCCG
GCAGAGCTGC TGTTTGAAGA GTTCGGCTTC ACTGTTGATA ACGTTGTTGC GAAAGCAAAA
GAACTGCTGT AA
 
Protein sequence
MSSRKELANA IRALSMDAVQ KAKSGHPGAP MGMADIAEVL WRDFLKHNPQ NPSWADRDRF 
VLSNGHGSML IYSLLHLTGY DLPMEELKNF RQLHSKTPGH PEVGYTAGVE TTTGPLGQGI
ANAVGMAIAE KTLAAQFNRP GHDIVDHYTY AFMGDGCMME GISHEVCSLA GTLKLGKLIA
FYDDNGISID GHVEGWFTDD TAMRFEAYGW HVIRDIDGHD AASIKRAVEE ARAVTDKPSL
LMCKTIIGFG SPNKAGTHDS HGAPLGDAEI ALTREQLGWK YAPFEIPSEI YAQWDAKEAG
QAKESAWNEK FAAYAKAYPQ EAAEFTRRMK GEMPSDFDAK AKEFIAKLQA NPAKIASRKA
SQNAIEAFGP LLPEFLGGSA DLAPSNLTLW SGSKAINEDA AGNYIHYGVR EFGMTAIANG
ISLHGGFLPY TSTFLMFVEY ARNAVRMAAL MKQRQVMVYT HDSIGLGEDG PTHQPVEQVA
SLRVTPNMST WRPCDQVESA VAWKYGVERQ DGPTALILSR QNLAQQERTE EQLANIARGG
YVLKDCAGQP ELIFIATGSE VELAVAAYEK LTAEGVKARV VSMPSTDAFD KQDAAYRESV
LPKAVTARVA VEAGIADYWY KYVGLNGAIV GMTTFGESAP AELLFEEFGF TVDNVVAKAK
ELL