Gene Cthe_0140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0140 
Symbol 
ID4808698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp177383 
End bp178918 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content43% 
IMG OID640105551 
Productphosphoglyceromutase 
Protein accessionYP_001036574 
Protein GI125972664 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0696] Phosphoglyceromutase 
TIGRFAM ID[TIGR01307] 2,3-bisphosphoglycerate-independent phosphoglycerate mutase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0674113 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGACA AACTTGTGAT GTTGATTATT TTGGATGGTT ATGGTATTAA TCCGAGAAAA 
GAGGGAAATG CCATAGAGGC TGCAAATAAA CCGAATATTG ACAGGTTTAT GAGGGAATAT
CCCAATACAA TAGTTCGTAC CAGCGGTATG GATGTGGGAC TTCCCGACGG TCAGATGGGC
AATTCCGAAG TAGGGCATAC CAATATTGGT GCAGGAAGAA TTGTATATCA GGAACTTACA
AGGATAACCA AATCCATTCA GGACGGAGAC TTTTTTGAGA AGAAAGAATT TTTGGATGCT
GCCGAAAACT GCAGAAAGCA CAATTCAAAG CTGCATCTCT TCGGACTTCT TTCTGACGGC
GGCGTGCACA GCCACAATAC CCACCTGTAT GGCCTCTTGG AGTTTGCAAA AAGGCAGAAT
TTGAAAGATG TGTATGTTCA TTGCTTCTTT GACGGCAGGG ACGTTCCGCC GGACAGTGCA
ATGGGCTATG TGGAAGAGCT TGAAAATAAA ATCAGGGAAA TCGGCGTAGG TGAAATTGCC
ACAGTTATGG GAAGATACTA CGCCATGGAC CGTGACAACA GATGGGAGAG AGTAAAACTT
GCCTATGATG CCATGGTTCT TGGAAGAGGA AATCAAGCCC AAAGTGCAAA AGAGGCGGTT
GCAGAATCTT ATAAAAGACA AGAGTTCGAT GAATTTGTAA AACCCACAGT TATAATGAAA
AACGGCTCTC CGGTGGCTAC TGTCGGGGAA AACGACTCTA TAATATTCTT TAACTTCAGG
CCTGACAGAG CCAGAGAAAT TACCAGGGCT TTCACGGAGG TCAATTTTTC AGGTTTTGAA
AGGGAAAAAG GATATTTTCC GGTGTTCTTT GTCTGCATGA CCCAGTATGA CAAAACTTTT
GAAAACGTTG TTGTGGCATT TAAGCCTGAA AGCCTTGAGA ATACCTTTGG AGAGTATATC
AGCAAGAAAG GGCTGAGACA GCTTAGAATT GCCGAAACGG AAAAATATGC CCATGTAACC
TTCTTCTTTA ACGGAGGTGT TGAGGCGGTA TACGAAGGAG AAGACAGGAT ATTGATAAAT
TCTCCGAAAG TTGCAACATA TGATTTGAAG CCTGAAATGA GTGCCTACGA GGTAACTGAC
AAGGTGCTTG AGTGCATAAA TAAAAAGGAA TATGATGTAA TAATATTAAA TTATGCAAAT
CCCGACATGG TGGGGCATAC CGGAGTGTTT GAGGCGGCAA AGGCTGCAAT TGAAGCCATT
GACGAATGTT TGGGCAAGGT TGTTCCCGCA GTGCTTGAGC AAAACGGAGT GGTATTGATA
ACCGCGGATC ACGGAAATTC CGAGCAGATG ATAGATTATG AAACCGGAGG ACCTTTCACG
GCACATACAA CAAATCCTGT TCCTCTCATT GTCATTGGCC TTGGAGATGT CAAGCTCAGA
GAAGGAAGGC TTGCGGACCT TGCGCCGACA ATGCTTGATA TTTTAGGATT TGAGAAGCCT
AAGGAAATGA CAGGGGAATC GTTGATTGTA AAATAA
 
Protein sequence
MKDKLVMLII LDGYGINPRK EGNAIEAANK PNIDRFMREY PNTIVRTSGM DVGLPDGQMG 
NSEVGHTNIG AGRIVYQELT RITKSIQDGD FFEKKEFLDA AENCRKHNSK LHLFGLLSDG
GVHSHNTHLY GLLEFAKRQN LKDVYVHCFF DGRDVPPDSA MGYVEELENK IREIGVGEIA
TVMGRYYAMD RDNRWERVKL AYDAMVLGRG NQAQSAKEAV AESYKRQEFD EFVKPTVIMK
NGSPVATVGE NDSIIFFNFR PDRAREITRA FTEVNFSGFE REKGYFPVFF VCMTQYDKTF
ENVVVAFKPE SLENTFGEYI SKKGLRQLRI AETEKYAHVT FFFNGGVEAV YEGEDRILIN
SPKVATYDLK PEMSAYEVTD KVLECINKKE YDVIILNYAN PDMVGHTGVF EAAKAAIEAI
DECLGKVVPA VLEQNGVVLI TADHGNSEQM IDYETGGPFT AHTTNPVPLI VIGLGDVKLR
EGRLADLAPT MLDILGFEKP KEMTGESLIV K