Gene Cthe_0137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0137 
Symbol 
ID4808695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp173889 
End bp174899 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content43% 
IMG OID640105548 
Productglyceraldehyde-3-phosphate dehydrogenase 
Protein accessionYP_001036571 
Protein GI125972661 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0057] Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase 
TIGRFAM ID[TIGR01534] glyceraldehyde-3-phosphate dehydrogenase, type I 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.289587 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGTAA AAATTGGTAT CAACGGTTTT GGACGTATCG GTCGTCTTGT GTTCAGGGCC 
AGTCTCAACA ACCCGAACGT TGAGGTTGTA GGTATAAACG ACCCATTTAT TGACCTTGAA
TACATGCAAT ATATGTTAAA GTATGATACA GTACATGGTC AGTTCAAAGG TGAAATTTCA
CAAGACAACG GAAAACTCGT TGTAAACGGC AGAAAAATAA GTGTTTATGG TTTCACTGAT
CCTGCTGAGA TTCCGTGGAG CGAATGCGGT GCGGAATACA TCGTTGAATC AACGGGTGTA
TTCACAACTA CTGAAAAGGC TTCAGCTCAC TTCAAGGGCG GTGCAAAGAA GGTAGTTATC
AGTGCTCCTT CGGCAGATGC TCCGATGTTT GTTATGGGTG TAAACCATGA TAAATACACA
AAGGATATGA ACGTTGTATC TAACGCTTCA TGTACAACAA ACTGCCTTGC TCCTCTGGCT
AAAGTTATAC ATGAAAACTT CGGAATCGTA GAAGGCTTGA TGACTACTGT ACACGCTACA
ACTGCAACTC AAAAGACTGT TGACGGTCCT TCAAAGAAAG ACTGGAGAGG CGGACGTGCA
GCAGCAGGCA ACATCATTCC TTCATCAACA GGAGCTGCAA AGGCAGTTGG AAAGGTTATC
CCTGAATTGA ACGGTAAGTT GACAGGTATG GCTTTCAGAG TTCCGACTCT TGACGTATCT
GTTGTTGACT TGACTTGCCG TCTTGAAAAA CCGGCTACTT ATGATGAAAT CAAAGCCGCT
GTTAAGAAAG CTTCAGAAAA TGAACTTAAG GGTATTTTGG GATACACTGA GGATGCGGTT
GTTTCTTCAG ACTTCATCGG CGATCCCCGC ACTTCAATTT TCGATGCTGA AGCAGGTATT
TCTCTCAACA GCAACTTTGT AAAACTTGTT GCCTGGTACG ACAATGAATG GGGTTATTCA
AACAAAGTTG TTGATTTGAT AGTTCATATG GCTTCGGTTG ATGCAAAATA A
 
Protein sequence
MAVKIGINGF GRIGRLVFRA SLNNPNVEVV GINDPFIDLE YMQYMLKYDT VHGQFKGEIS 
QDNGKLVVNG RKISVYGFTD PAEIPWSECG AEYIVESTGV FTTTEKASAH FKGGAKKVVI
SAPSADAPMF VMGVNHDKYT KDMNVVSNAS CTTNCLAPLA KVIHENFGIV EGLMTTVHAT
TATQKTVDGP SKKDWRGGRA AAGNIIPSST GAAKAVGKVI PELNGKLTGM AFRVPTLDVS
VVDLTCRLEK PATYDEIKAA VKKASENELK GILGYTEDAV VSSDFIGDPR TSIFDAEAGI
SLNSNFVKLV AWYDNEWGYS NKVVDLIVHM ASVDAK