Gene Cthe_0677 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0677 
Symbol 
ID4810295 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp832702 
End bp833868 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content44% 
IMG OID640106094 
Productphosphopentomutase 
Protein accessionYP_001037105 
Protein GI125973195 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1015] Phosphopentomutase 
TIGRFAM ID[TIGR01696] phosphopentomutase 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGAG CTATAATAAT CGTTTTGGAC AGTGTTGGCA TGGGAGAGCT TCCCGATGCG 
GCAAAATACG GTGACGAAGG CAGCAACACA TTAGGAAATA TTGCAAAGAA TTTACCTGAT
TTTAGTCTGC CAAATCTTGA GTCTTTGGGA TTGGGAAATA TTGACGGTAT GACAGGCTAT
GAGCCTTCAA AAAATCCTTT AGGCTCTTAC GGAAGAATGG CGGAAAAATC CGCGGGCAAG
GACACAACAA CAGGTCATTG GGAGATTGCC GGCCTGATAT TGGATAAGCC TTTTCCGGTA
TATCCAAACG GATTTCCCGA AGATATAATA AAAAGATTCG AAGACAGTAT AGGAACAAAG
ACATTGGGAA ATGTTCCGGC ATCGGGGACA GAGATAATCA AGCTGTTAGG AGATGAGCAT
GTAAAGACAG GCTATCCAAT CGTGTACACA TCGGCCGACA GTGTGTTTCA AATAGCAGCC
CATGAGAATG TAATACCCGT GGAGAGGCTC TATGACATGT GCCGGACGGC ACGAAACATT
CTTACCGGAG AACATGCGGT CGGACGGGTA ATTGCAAGGC CTTTCATCGG CGAGTCGGGA
AACTACAAAA GAACCGACAG AAGGAAAGAT TTTTCTCTTG CTCCTGTAGG AAAAACACTT
TTGGACTATG CAGTTGAAAA TGGTTACAAA GTCAAGGCAG TCGGAAAGAT TGAGGATATA
TTTGGCGGAA GAGGTATTAC CGAGTCAGTC CACATTCACG ACAACATGGA TGGAGTGGAC
AGGACCCTTG AGTATATGAG GGATGATTTT GAAGGTATTC TTTTTACAAA TCTTGTGGAC
TTTGACATGC TTTACGGGCA TCGCAACGAT ATTGCCGGTT ATGCCAATGC TTTGAAAGAG
TTTGACCGAA GGATTCCGGA AATATTGGCA AATTTGCGGG AAGATGACCT TCTTGTTATA
ACTGCAGATC ACGGCTGTGA CCCATCCACG GAAAGTACCG ATCATTCAAG AGAATATGTG
CCTTTACTTG TATACGGAAA GAAGTTTAAA AGCAATGTAA ACTTAGGTAC GAGAAGCACC
TTTGCGGATG TTGCAAAAAC TGTGGCCCAC TATCTTGGAA TCAGCAGCAA TTTAGAGGGA
GAAAGCTTTC TTGGAAGCAT ACTGTAA
 
Protein sequence
MKRAIIIVLD SVGMGELPDA AKYGDEGSNT LGNIAKNLPD FSLPNLESLG LGNIDGMTGY 
EPSKNPLGSY GRMAEKSAGK DTTTGHWEIA GLILDKPFPV YPNGFPEDII KRFEDSIGTK
TLGNVPASGT EIIKLLGDEH VKTGYPIVYT SADSVFQIAA HENVIPVERL YDMCRTARNI
LTGEHAVGRV IARPFIGESG NYKRTDRRKD FSLAPVGKTL LDYAVENGYK VKAVGKIEDI
FGGRGITESV HIHDNMDGVD RTLEYMRDDF EGILFTNLVD FDMLYGHRND IAGYANALKE
FDRRIPEILA NLREDDLLVI TADHGCDPST ESTDHSREYV PLLVYGKKFK SNVNLGTRST
FADVAKTVAH YLGISSNLEG ESFLGSIL