Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3077 |
Symbol | tktA |
ID | 6144686 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3165813 |
End bp | 3167804 |
Gene Length | 1992 bp |
Protein Length | 663 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641617945 |
Product | transketolase |
Protein accession | YP_001745096 |
Protein GI | 170681853 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0021] Transketolase |
TIGRFAM ID | [TIGR00232] transketolase, bacterial and yeast |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.00342437 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCCTCAC GTAAAGAGCT TGCCAATGCT ATTCGTGCGC TGAGCATGGA CGCAGTACAG AAAGCCAAAT CCGGTCACCC GGGTGCCCCT ATGGGTATGG CTGACATTGC CGAAGTCCTG TGGCGTGATT TCCTGAAACA CAACCCGCAG AATCCGTCCT GGGCTGACCG TGACCGCTTC GTGCTGTCCA ACGGCCACGG CTCCATGCTG ATCTACAGCC TGCTGCACCT CACCGGTTAC GATCTGCCGA TGGAAGAACT GAAAAACTTC CGTCAGCTGC ACTCTAAAAC TCCGGGCCAC CCGGAAGTGG GTTACACCGC TGGTGTGGAA ACCACTACAG GTCCGCTGGG TCAGGGTATT GCCAACGCAG TCGGTATGGC GATTGCAGAA AAAACGCTGG CGGCGCAGTT TAACCGTCCG GGCCACGACA TCGTCGACCA CTACACCTAC GCCTTCATGG GCGACGGCTG CATGATGGAA GGCATCTCCC ACGAAGTTTG TTCTCTGGCG GGTACGCTGA AGCTGGGTAA ACTGATTGCG TTCTACGATG ACAACGGTAT CTCTATCGAT GGTCACGTTG AAGGCTGGTT CACCGACGAC ACCGCAATGC GTTTCGAAGC TTACGGCTGG CACGTTATTC GCGACATCGA CGGTCATGAC GCGGCATCCA TCAAACGCGC AGTAGAAGAA GCGCGCGCAG TGACTGACAA ACCGTCCCTG CTGATGTGCA AAACCATCAT CGGTTTCGGT TCCCCGAACA AAGCCGGTAC GCATGACTCC CACGGTGCGC CGCTGGGCGA TGCTGAAATC GCCCTGACCC GCGAACAGCT GGGCTGGAAA TATGCACCGT TCGAAATCCC GTCTGAAATC TATGCTCAGT GGGATGCGAA AGAAGCAGGC CAGGCGAAAG AATCCGCATG GAACGAGAAA TTCGCTGCTT ACGCGAAAGC TTATCCGCAG GAAGCCGCTG AATTTACCCG CCGTATGAAA GGCGAAATGC CGTCTGACTT CGACGCCAAA GCGAAAGAAT TCATCGCTAA ACTGCAGGCT AATCCGGCGA AAATCGCCAG TCGTAAAGCG TCTCAGAATG CGATTGAAGC GTTCGGTCCG CTGTTGCCGG AATTCCTCGG CGGCTCTGCT GACCTGGCAC CGTCTAACCT GACCCTGTGG TCTGGTTCTA AAGCAATCAA CGAAGATGCT GCGGGTAACT ACATCCACTA CGGTGTTCGC GAGTTCGGTA TGACCGCGAT TGCTAACGGT ATCTCCCTGC ACGGTGGTTT CCTGCCGTAC ACCTCCACCT TCCTGATGTT CGTGGAATAC GCACGTAACG CCGTACGTAT GGCTGCGCTG ATGAAACAGC GTCAGGTGAT GGTTTACACC CACGACTCCA TCGGTCTGGG CGAAGATGGC CCGACTCACC AGCCGGTTGA GCAGGTCGCT TCACTGCGCG TGACCCCGAA CATGTCTACA TGGCGTCCAT GTGACCAGGT TGAATCTGCA GTGGCATGGA AATACGGCGT TGAGCGTCAG GACGGCCCGA CCGCGCTGAT CCTCTCCCGT CAGAACCTGG CGCAGCAGGA ACGTACTGAA GAGCAGCTGG CAAACATCGC GCGCGGTGGT TATGTGCTGA AAGACTGTGC CGGTCAGCCG GAACTGATCT TCATCGCGAC CGGTTCGGAA GTTGAACTGG CTGTTGCCGC CTACGAAAAA CTGACAGCCG AAGGCGTGAA GGCGCGCGTG GTTTCCATGC CGTCTACCGA CGCGTTCGAC AAGCAGGATG CGGCTTACCG TGAATCCGTA CTGCCGAAAG CGGTTACTGC ACGCGTTGCG GTAGAAGCGG GTATTGCTGA CTACTGGTAC AAGTATGTTG GCCTGAACGG CGCTATCGTC GGCATGACCA CCTTCGGTGA ATCTGCTCCG GCAGAACTGC TGTTTGAAGA GTTCGGCTTC ACCGTTGACA ACGTTGTTGC GAAAGCCAAA GCGCTGCTGT AA
|
Protein sequence | MSSRKELANA IRALSMDAVQ KAKSGHPGAP MGMADIAEVL WRDFLKHNPQ NPSWADRDRF VLSNGHGSML IYSLLHLTGY DLPMEELKNF RQLHSKTPGH PEVGYTAGVE TTTGPLGQGI ANAVGMAIAE KTLAAQFNRP GHDIVDHYTY AFMGDGCMME GISHEVCSLA GTLKLGKLIA FYDDNGISID GHVEGWFTDD TAMRFEAYGW HVIRDIDGHD AASIKRAVEE ARAVTDKPSL LMCKTIIGFG SPNKAGTHDS HGAPLGDAEI ALTREQLGWK YAPFEIPSEI YAQWDAKEAG QAKESAWNEK FAAYAKAYPQ EAAEFTRRMK GEMPSDFDAK AKEFIAKLQA NPAKIASRKA SQNAIEAFGP LLPEFLGGSA DLAPSNLTLW SGSKAINEDA AGNYIHYGVR EFGMTAIANG ISLHGGFLPY TSTFLMFVEY ARNAVRMAAL MKQRQVMVYT HDSIGLGEDG PTHQPVEQVA SLRVTPNMST WRPCDQVESA VAWKYGVERQ DGPTALILSR QNLAQQERTE EQLANIARGG YVLKDCAGQP ELIFIATGSE VELAVAAYEK LTAEGVKARV VSMPSTDAFD KQDAAYRESV LPKAVTARVA VEAGIADYWY KYVGLNGAIV GMTTFGESAP AELLFEEFGF TVDNVVAKAK ALL
|
| |