Gene Cthe_0349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0349 
Symbol 
ID4808498 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp438913 
End bp439842 
Gene Length930 bp 
Protein Length309 aa 
Translation table11 
GC content42% 
IMG OID640105763 
Productfructose-bisphosphate aldolase 
Protein accessionYP_001036780 
Protein GI125972870 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0191] Fructose/tagatose bisphosphate aldolase 
TIGRFAM ID[TIGR01859] fructose-1,6-bisphosphate aldolase, class II, various bacterial and amitochondriate protist
[TIGR00167] ketose-bisphosphate aldolases 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value1.91041e-05 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCATTAG TTACCAGTAC TGAAATGTTT AAAAAAGCCT ACGAAGGCAA ATACGCTATA 
GGTGCTTTCA ATGTTAACAA TATGGAAATT ATTCAAGGTA TTACCGAAGC GGCGAAGGAA
GTTAATGCAC CGTTGATTCT TCAAGTTTCC GCAGGTGCAA GAAAGTATGC AAATCATACA
TACCTGGTAA AACTTGTTGA AGCCGCTGTT GAAGAGACGG GACTTCCTAT CTGCCTGCAT
CTTGACCATG GTGACAGCTT CGAGCTTTGT AAATCATGTA TTGACGGCGG ATTTACATCC
GTTATGATTG ACGGTTCCCA TCTTCCTTTT GAAGAGAACA TAAAACTTAC AAAGCAGGTT
GTTGACTACG CTCACTCAAA AGGTGTTGTT GTTGAAGGAG AGCTGGGAAG ACTTGCAGGT
ATAGAAGACG ATGTTAATGT GTCTGAGGCT GACGCTGCAT TTACCGACCC TGACCAGGCT
GAAGAGTTTG TAAAGAGAAC AGGTGTTGAT TCCCTGGCCA TAGCAATTGG TACCAGCCAT
GGTGCTTACA AATTTAAGGG AGAAGCAAAA TTAAGATTTG ATATATTGGA AGAGATTGAA
AAGAGACTTC CGGGATTCCC GATTGTTTTG CACGGTGCTT CCTCAGTTAT ACCCGAATAC
GTTGATATGA TTAACAAATA TGGCGGAGAT ATGCCCGGAG CGAAGGGTGT ACCGGAAGAC
ATGCTCAGAA AGGCTGCTTC CATGGCTGTC TGCAAGATAA ACATAGACTC AGACTTAAGA
CTTGCCATGA CAGCGACAAT CAGGAAGTAC TTTGCTGAAA ATCCGTCACA CTTTGACCCA
AGACAGTACT TAGGTCCCGC AAGAAATGCA ATTAAAGAGC TTGTTAAACA CAAAATTGTT
AATGTTCTTG GATGCGACGG AAAAGCTTAA
 
Protein sequence
MPLVTSTEMF KKAYEGKYAI GAFNVNNMEI IQGITEAAKE VNAPLILQVS AGARKYANHT 
YLVKLVEAAV EETGLPICLH LDHGDSFELC KSCIDGGFTS VMIDGSHLPF EENIKLTKQV
VDYAHSKGVV VEGELGRLAG IEDDVNVSEA DAAFTDPDQA EEFVKRTGVD SLAIAIGTSH
GAYKFKGEAK LRFDILEEIE KRLPGFPIVL HGASSVIPEY VDMINKYGGD MPGAKGVPED
MLRKAASMAV CKINIDSDLR LAMTATIRKY FAENPSHFDP RQYLGPARNA IKELVKHKIV
NVLGCDGKA