Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4237 |
Symbol | tktA |
ID | 6971447 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3924652 |
End bp | 3926643 |
Gene Length | 1992 bp |
Protein Length | 663 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643387975 |
Product | transketolase |
Protein accession | YP_002272414 |
Protein GI | 209400304 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0021] Transketolase |
TIGRFAM ID | [TIGR00232] transketolase, bacterial and yeast |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 68 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCTCAC GTAAAGAGCT TGCCAATGCT ATTCGTGCGC TGAGCATGGA CGCAGTACAG AAAGCCAAAT CCGGTCACCC GGGTGCCCCT ATGGGTATGG CTGACATTGC CGAAGTCCTG TGGCGTGATT TCCTGAAACA TAACCCGCAG AATCCGTCCT GGGCTGACCG TGACCGCTTC GTGCTGTCCA ACGGCCACGG CTCCATGCTG ATCTACAGCC TGCTGCACCT CACCGGTTAC GATCTGCCGA TGGAAGAACT GAAAAACTTC CGTCAGCTGC ACTCTAAAAC TCCGGGCCAC CCGGAAGTGG GTTACACCGC TGGTGTGGAA ACCACCACCG GTCCGCTGGG GCAGGGTATT GCCAACGCAG TAGGTATGGC GATTGCAGAA AAAACGCTGG CGGCGCAGTT TAACCGTCCG GGCCACGACA TTGTCGACCA CTACACCTAC GCCTTCATGG GCGACGGCTG CATGATGGAA GGCATCTCCC ACGAAGTTTG CTCTCTGGCG GGTACGCTGA AGCTGGGTAA ACTGATTGCG TTCTACGATG ACAACGGTAT TTCTATCGAT GGTCACGTTG AAGGCTGGTT CACCGACGAC ACCGCAATGC GTTTCGAAGC TTACGGCTGG CACGTTATTC GCGACATCGA CGGTCATGAC GCAGCATCCA TCAAACGCGC AGTAGAAGAA GCGCGCGCAG TGACTGACAA ACCGTCCCTG CTGATGTGCA AAACCATCAT CGGTTTCGGT TCCCCGAACA AAGCCGGTAC TCACGACTCC CACGGTGCGC CGCTGGGTGA TGCAGAAATC GCTCTGACCC GCGAACAGCT GGGCTGGAAA TACGCACCGT TCGAAATCCC GTCTGAAATC TATGCGCAGT GGGATGCGAA AGAAGCAGGC CAGGCGAAAG AATCTGCATG GAATGAGAAG TTTGCGGCTT ACGCGAAAGC TTATCCGCAG GAAGCGGCTG AATTTACCCG CCGTATGAAA GGCGAAATGC CGTCTGACTT CGACGCCAAA GCGAAAGAGT TTATCGCTAA ACTGCAGGCT AATCCGGCGA AAATCGCCAG CCGTAAAGCG TCTCAGAATG CTATCGAAGC GTTCGGCCCG CTGTTGCCGG AATTCCTCGG CGGCTCCGCT GACCTGGCAC CGTCTAACCT GACCCTGTGG TCTGGTTCTA AAGCAATCAA CGAAGATGCT GCGGGTAACT ACATCCACTA CGGTGTTCGC GAGTTCGGTA TGACCGCGAT TGCTAACGGT ATCTCCCTGC ACGGTGGCTT CCTGCCGTAC ACCTCCACCT TCCTGATGTT TGTCGAATAC GCACGTAACG CCGTACGTAT GGCTGCGCTG ATGAAACAGC GTCAGGTTAT GGTTTACACC CACGACTCCA TCGGTCTGGG CGAAGATGGC CCGACTCACC AGCCGGTTGA GCAGGTTGCT TCTCTGCGCG TGACCCCGAA CATGTCTACA TGGCGTCCGT GTGACCAGGT TGAATCCGCG GTCGCGTGGA AATACGGTGT TGAGCGTCAG GACGGTCCGA CCGCGCTTAT CCTCTCCCGT CAGAACCTGG CGCAGCAGGA ACGTACTGAA GAGCAACTGG CAAACATCGC GCGCGGTGGT TATGTGCTGA AAGACTGTGC TGGTCAGCCG GAACTGATCT TCATCGCGAC CGGTTCGGAA GTTGAACTGG CTGTTGCCGC CTACGAAAAA CTGACTGCCG AAGGCGTGAA GGCGCGCGTG GTTTCCATGC CGTCTACCGA TGCATTCGAC AAGCAGGATG CGGCTTACCG TGAATCCGTA CTGCCGAAAG CGGTTACTGC ACGCGTTGCG GTAGAAGCGG GTATTGCTGA CTACTGGTAC AAGTATGTTG GCCTGAATGG CGCTATCATC GGTATGACCA CCTTCGGTGA GTCAGCTCCG GCAGAGTTGC TGTTCGAAGA GTTCGGCTTC ACCGTCGACA ACGTCGTTGC GAAAGCAAAA GAACTGCTGT AA
|
Protein sequence | MSSRKELANA IRALSMDAVQ KAKSGHPGAP MGMADIAEVL WRDFLKHNPQ NPSWADRDRF VLSNGHGSML IYSLLHLTGY DLPMEELKNF RQLHSKTPGH PEVGYTAGVE TTTGPLGQGI ANAVGMAIAE KTLAAQFNRP GHDIVDHYTY AFMGDGCMME GISHEVCSLA GTLKLGKLIA FYDDNGISID GHVEGWFTDD TAMRFEAYGW HVIRDIDGHD AASIKRAVEE ARAVTDKPSL LMCKTIIGFG SPNKAGTHDS HGAPLGDAEI ALTREQLGWK YAPFEIPSEI YAQWDAKEAG QAKESAWNEK FAAYAKAYPQ EAAEFTRRMK GEMPSDFDAK AKEFIAKLQA NPAKIASRKA SQNAIEAFGP LLPEFLGGSA DLAPSNLTLW SGSKAINEDA AGNYIHYGVR EFGMTAIANG ISLHGGFLPY TSTFLMFVEY ARNAVRMAAL MKQRQVMVYT HDSIGLGEDG PTHQPVEQVA SLRVTPNMST WRPCDQVESA VAWKYGVERQ DGPTALILSR QNLAQQERTE EQLANIARGG YVLKDCAGQP ELIFIATGSE VELAVAAYEK LTAEGVKARV VSMPSTDAFD KQDAAYRESV LPKAVTARVA VEAGIADYWY KYVGLNGAII GMTTFGESAP AELLFEEFGF TVDNVVAKAK ELL
|
| |