Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0138 |
Symbol | pgk |
ID | 4808696 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 175233 |
End bp | 176426 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640105549 |
Product | phosphoglycerate kinase |
Protein accession | YP_001036572 |
Protein GI | 125972662 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0126] 3-phosphoglycerate kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0810442 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAATGA TGAACAAGAA AACCATTGAG GATATTGACG TAAAAGGTAA AAAGGTTATA GTCAGGGTTG ATTTTAATGT GCCTTTGGAC GAAAACAGAA AGATTACCGA TGACAAGAGA ATAGTTGGTG CGCTTCCAAC CATCAAGTAT CTGGTTGAAC ACGGTGCAAA AACTATATTG GTTTCACACC TGGGAAGACC AAAAGAAGGT TTTGAGGAAA AGTACAGCAT GGCTCCGACT GCCGTAAGAT TGGGAGAACT TCTTGGAAAA GAAGTAATAA TGGCGAAAGA TGTTATTGGT CCTGATGCAA AGGCAAAAGC AGCGGCATTA AAAGAAGGAG AAGTTTTAAT GCTCGAAAAT GTCAGATTCC ACAAGGAAGA GACTAAAAAC GATCCTGCTT TTGCAAAAGA ATTGGCAAGC ATGGCTGAAA TTTATGTAAA TGATGCGTTC GGAACTGCCC ACAGAGCCCA CGCTTCAACG GCAGGTCTTG CCGACTATCT GCCGGCGGTA TGCGGATACC TCATCCAAAA GGAAATTGAG GTTATGGGCA AGGCTCTGTC AAATCCTGAA AGACCGTTTG TGGCAATATT GGGCGGTGCC AAGGTTTCAG ACAAAATAGG CGTTATTGAG AACCTTATTG ACAAAGTTGA CACTCTTATA ATCGGCGGAG GTATGGCATA TACCTTCTTT AAAGCAAAGG GATACAGCAT AGGAACATCC CTGTGTGAAG ATGATAAAGT TGAGCTTGCA AAGAGCCTTA TGGATAAAGC AGAGAAAAAG GGAGTAAAAC TCTTGCTTCC TGTGGACAAT GTTGTAGGCA AAGAGTTCAG CAATGATACA GAACGCAAGA CTGTTCCGTC TGACCAGATT CCGGACGGAT GGATGGGTAT GGACATCGGT GAGGAAACAA TCAAGTTGTA TTCTGAGGCA ATCAAGAATG CCAAAACCGT GGTATGGAAC GGTCCTATGG GCGTATTTGA GTTCTCCAAC TTTGCCAACG GTACAAGGGA AGTTGCAAGA GCGGTTGCCG AGTCCGGAGC AATTTCAATA ATCGGAGGCG GAGATTCTGC CGCAGCTATA GAACAGCTTG GTTTTGCCGA TAAGATTACC CACATTTCAA CCGGAGGCGG CGCGTCTTTG GAGTTTCTTG AAGGAAAAGT ATTGCCGGGA ATTGATGTAT TAATGGATAA ATAA
|
Protein sequence | MAMMNKKTIE DIDVKGKKVI VRVDFNVPLD ENRKITDDKR IVGALPTIKY LVEHGAKTIL VSHLGRPKEG FEEKYSMAPT AVRLGELLGK EVIMAKDVIG PDAKAKAAAL KEGEVLMLEN VRFHKEETKN DPAFAKELAS MAEIYVNDAF GTAHRAHAST AGLADYLPAV CGYLIQKEIE VMGKALSNPE RPFVAILGGA KVSDKIGVIE NLIDKVDTLI IGGGMAYTFF KAKGYSIGTS LCEDDKVELA KSLMDKAEKK GVKLLLPVDN VVGKEFSNDT ERKTVPSDQI PDGWMGMDIG EETIKLYSEA IKNAKTVVWN GPMGVFEFSN FANGTREVAR AVAESGAISI IGGGDSAAAI EQLGFADKIT HISTGGGASL EFLEGKVLPG IDVLMDK
|
| |