Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0137 |
Symbol | |
ID | 4808695 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 173889 |
End bp | 174899 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640105548 |
Product | glyceraldehyde-3-phosphate dehydrogenase |
Protein accession | YP_001036571 |
Protein GI | 125972661 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0057] Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase |
TIGRFAM ID | [TIGR01534] glyceraldehyde-3-phosphate dehydrogenase, type I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.289587 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGTAA AAATTGGTAT CAACGGTTTT GGACGTATCG GTCGTCTTGT GTTCAGGGCC AGTCTCAACA ACCCGAACGT TGAGGTTGTA GGTATAAACG ACCCATTTAT TGACCTTGAA TACATGCAAT ATATGTTAAA GTATGATACA GTACATGGTC AGTTCAAAGG TGAAATTTCA CAAGACAACG GAAAACTCGT TGTAAACGGC AGAAAAATAA GTGTTTATGG TTTCACTGAT CCTGCTGAGA TTCCGTGGAG CGAATGCGGT GCGGAATACA TCGTTGAATC AACGGGTGTA TTCACAACTA CTGAAAAGGC TTCAGCTCAC TTCAAGGGCG GTGCAAAGAA GGTAGTTATC AGTGCTCCTT CGGCAGATGC TCCGATGTTT GTTATGGGTG TAAACCATGA TAAATACACA AAGGATATGA ACGTTGTATC TAACGCTTCA TGTACAACAA ACTGCCTTGC TCCTCTGGCT AAAGTTATAC ATGAAAACTT CGGAATCGTA GAAGGCTTGA TGACTACTGT ACACGCTACA ACTGCAACTC AAAAGACTGT TGACGGTCCT TCAAAGAAAG ACTGGAGAGG CGGACGTGCA GCAGCAGGCA ACATCATTCC TTCATCAACA GGAGCTGCAA AGGCAGTTGG AAAGGTTATC CCTGAATTGA ACGGTAAGTT GACAGGTATG GCTTTCAGAG TTCCGACTCT TGACGTATCT GTTGTTGACT TGACTTGCCG TCTTGAAAAA CCGGCTACTT ATGATGAAAT CAAAGCCGCT GTTAAGAAAG CTTCAGAAAA TGAACTTAAG GGTATTTTGG GATACACTGA GGATGCGGTT GTTTCTTCAG ACTTCATCGG CGATCCCCGC ACTTCAATTT TCGATGCTGA AGCAGGTATT TCTCTCAACA GCAACTTTGT AAAACTTGTT GCCTGGTACG ACAATGAATG GGGTTATTCA AACAAAGTTG TTGATTTGAT AGTTCATATG GCTTCGGTTG ATGCAAAATA A
|
Protein sequence | MAVKIGINGF GRIGRLVFRA SLNNPNVEVV GINDPFIDLE YMQYMLKYDT VHGQFKGEIS QDNGKLVVNG RKISVYGFTD PAEIPWSECG AEYIVESTGV FTTTEKASAH FKGGAKKVVI SAPSADAPMF VMGVNHDKYT KDMNVVSNAS CTTNCLAPLA KVIHENFGIV EGLMTTVHAT TATQKTVDGP SKKDWRGGRA AAGNIIPSST GAAKAVGKVI PELNGKLTGM AFRVPTLDVS VVDLTCRLEK PATYDEIKAA VKKASENELK GILGYTEDAV VSSDFIGDPR TSIFDAEAGI SLNSNFVKLV AWYDNEWGYS NKVVDLIVHM ASVDAK
|
| |