Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_0776 |
Symbol | |
ID | 6066655 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 831109 |
End bp | 833100 |
Gene Length | 1992 bp |
Protein Length | 663 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641600180 |
Product | transketolase |
Protein accession | YP_001723775 |
Protein GI | 170018821 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0021] Transketolase |
TIGRFAM ID | [TIGR00232] transketolase, bacterial and yeast |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0260346 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCTCAC GTAAAGAGCT TGCCAATGCT ATTCGTGCGC TGAGCATGGA CGCAGTACAG AAAGCCAAAT CCGGTCACCC GGGTGCCCCT ATGGGTATGG CTGACATTGC CGAAGTCCTG TGGCGTGATT TCCTGAAACA CAACCCGCAG AATCCGTCCT GGGCTGACCG TGACCGCTTC GTGCTGTCCA ACGGCCACGG CTCCATGCTG ATCTACAGCC TGCTGCACCT CACCGGTTAC GATCTGCCGA TGGAAGAACT GAAAAACTTC CGTCAGCTGC ACTCTAAAAC TCCGGGCCAC CCGGAAGTAG GTTATACCGC TGGTGTGGAA ACCACCACCG GTCCGCTGGG TCAGGGTATT GCCAACGCAG TCGGTATGGC GATTGCAGAA AAAACGCTGG CGGCGCAGTT TAACCGTCCA GGTCACGACA TTGTCGACCA CTACACCTAC GCCTTCATGG GCGACGGCTG CATGATGGAA GGCATCTCCC ACGAAGTTTG CTCTCTGGCG GGTACGCTGA AGCTGGGTAA ACTGATTGCG TTCTACGATG ACAACGGTAT CTCAATCGAT GGTCACGTTG AAGGCTGGTT CACTGACGAC ACCGCAATGC GTTTCGAAGC TTACGGCTGG CACGTTATTC GCGACATCGA CGGTCATGAC GCGGCATCCA TCAAACGCGC AGTAGAAGAA GCGCGCGCAG TGACTGACAA ACCGTCCCTG CTGATGTGCA AAACCATCAT CGGTTTCGGT TCCCCGAACA AAGCCGGTAC CCACGACTCC CACGGTGCGC CGCTGGGCGA CGCTGAAATT GCCCTGACCC GCGAACAGCT GGGCTGGAAA TACGCGCCGT TCGAAATCCC GTCTGAAATC TATGCTCAGT GGGATGCGAA AGAAGCAGGC CAGGCGAAAG AATCTGCATG GAATGAGAAG TTTGCGGCTT ACGCGAAAGC TTATCCGCAG GAAGCGGCTG AATTTACCCG CCGTATGAAA GGCGAAATGC CGTCTGACTT CGACGCCAAA GCGAAAGAGT TTATCGCTAA ACTGCAGGCT AATCCGGCGA AAATCGCCAG CCGTAAAGCG TCGCAGAATG CTATCGAAGC GTTCGGCCCG CTGTTGCCTG AATTCCTCGG CGGCTCTGCT GACCTGGCAC CGTCTAACCT GACCCTGTGG TCTGGTTCTA AAGCAATCAA CGAAGATGCT GCAGGTAACT ACATCCACTA CGGTGTTCGC GAGTTCGGTA TGACCGCGAT TGCTAACGGT ATCTCCCTGC ACGGTGGTTT CCTGCCGTAC ACCTCCACCT TCCTGATGTT CGTGGAATAC GCACGTAACG CCGTACGTAT GGCTGCGCTG ATGAAACAGC GTCAGGTGAT GGTTTACACC CACGACTCCA TCGGTCTGGG CGAAGATGGC CCGACTCACC AGCCGGTTGA GCAGGTCGCT TCTCTGCGCG TGACCCCGAA CATGTCTACA TGGCGTCCGT GTGACCAGGT TGAATCCGCG GTCGCGTGGA AATACGGCGT TGAGCGTCAG GACGGCCCGA CTGCGCTTAT CCTCTCCCGT CAGAACCTGG CGCAGCAGGA ACGAACTGAA GAGCAACTGG CAAACATCGC GCGCGGTGGT TATGTGCTGA AAGACTGCGC CGGTCAGCCG GAACTGATTT TCATCGCTAC CGGTTCAGAA GTTGAACTGG CTGTTGCTGC CTACGAAAAA CTGACTGCCG AAGGCGTGAA AGCGCGCGTG GTGTCCATGC CGTCTACCGA CGCATTTGAC AAGCAGGATG CTGCTTACCG TGAATCCGTA CTGCCGAAAG CGGTTACTGC ACGCGTTGCT GTAGAAGCGG GTATTGCTGA CTACTGGTAC AAGTATGTTG GCCTGAACGG TGCTATCGTC GGTATGACCA CCTTCGGTGA ATCTGCTCCG GCAGAGCTGC TGTTTGAAGA GTTCGGCTTC ACTGTTGATA ACGTTGTTGC GAAAGCAAAA GAACTGCTGT AA
|
Protein sequence | MSSRKELANA IRALSMDAVQ KAKSGHPGAP MGMADIAEVL WRDFLKHNPQ NPSWADRDRF VLSNGHGSML IYSLLHLTGY DLPMEELKNF RQLHSKTPGH PEVGYTAGVE TTTGPLGQGI ANAVGMAIAE KTLAAQFNRP GHDIVDHYTY AFMGDGCMME GISHEVCSLA GTLKLGKLIA FYDDNGISID GHVEGWFTDD TAMRFEAYGW HVIRDIDGHD AASIKRAVEE ARAVTDKPSL LMCKTIIGFG SPNKAGTHDS HGAPLGDAEI ALTREQLGWK YAPFEIPSEI YAQWDAKEAG QAKESAWNEK FAAYAKAYPQ EAAEFTRRMK GEMPSDFDAK AKEFIAKLQA NPAKIASRKA SQNAIEAFGP LLPEFLGGSA DLAPSNLTLW SGSKAINEDA AGNYIHYGVR EFGMTAIANG ISLHGGFLPY TSTFLMFVEY ARNAVRMAAL MKQRQVMVYT HDSIGLGEDG PTHQPVEQVA SLRVTPNMST WRPCDQVESA VAWKYGVERQ DGPTALILSR QNLAQQERTE EQLANIARGG YVLKDCAGQP ELIFIATGSE VELAVAAYEK LTAEGVKARV VSMPSTDAFD KQDAAYRESV LPKAVTARVA VEAGIADYWY KYVGLNGAIV GMTTFGESAP AELLFEEFGF TVDNVVAKAK ELL
|
| |