Gene Cthe_0344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0344 
Symbol 
ID4808493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp432885 
End bp434057 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content43% 
IMG OID640105758 
Productmalate dehydrogenase 
Protein accessionYP_001036775 
Protein GI125972865 
COG category[C] Energy production and conversion 
COG ID[COG0281] Malic enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.318957 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTACA GAAAAGAATC ACTAAGGCTT CACGGTGAGT GGAAGGGTAA AATTGAGGTT 
ATACACAAGG TACCTGTTTC AACCAAGGAA GAGTTGTCGC TTGCTTATAC ACCGGGTGTT
GCAGAACCAT GTCTTGCAAT TCAGAAAGAT GTTAATCTTT CTTATGAATA TACAAGACGT
TGGAACCTGG TAGCGGTTAT TACCGACGGT ACGGCGGTTT TAGGGCTCGG AGACATAGGA
CCTGAAGCCG GAATGCCTGT TATGGAAGGT AAATGCGTAC TCTTCAAGAA GTTTGGTGAT
GTGGACGCAT TTCCGCTCTG TATCAAATCA AAAGACGTAG ATGAAATTGT AAAGACAATC
AAGCTCATCT CCGGAAGCTT TGGCGGTATA AACCTCGAAG ATATATCCGC TCCGAGATGC
TTTGAAATAG AAAGAAGACT CAAAGAGGAA TGTGACATTC CAATATTCCA TGATGACCAG
CACGGTACAG CCGTTGTTAC TGTTGCAGCA ATGATCAATG CATTAAAGCT TGTCAACAAG
AAAATCGAGG ATATAGAAGT TGTTGTAAAC GGTTCAGGTG CTGCCGGCAT AGCTGTAACA
AGACTGCTCA TGAGTATGGG GCTTAAGAAA GTTATCCTTT GCGATACCAA AGGTGCAATT
TATGATGGAA GAGACAACTT AAACAGTGAA AAAGCCCTGA TTGCTAAAAT CTCGAACCTC
GAGAAAAAGA AAGGTACTCT TGAAGATGTA ATCAAGGGAG CTGACGTATT CATCGGTCTT
TCCGTTCCAG GAACAGTTAC AAAGGATATG GTAAAATCCA TGGCAAAGGA TCCGATTATC
TTTGCTATGG CAAATCCTAC TCCTGAAATA ATGCCTGATG AAGCAAAAGA AGCAGGAGCA
AAGGTAGTGG GTACCGGAAG ATCCGACTTC CCGAACCAGA TAAACAACGT TCTTGCGTTC
CCCGGAATAT TCAGAGGTGC GCTTGATGTA AGAGCAAGAG ATATCAATGA TGAAATGAAG
ATAGCCGCTG CAAAAGCAAT AGCTTCTCTG GTAAGCGATG AAGAGCTCAA TCCTGACTTC
ATTCTTCCGC TCCCATTTGA CCCAAGAGTC GGAAAAACAG TTGCTGCAGC AGTTGCTGAA
GCAGCAAGAA AAACCGGAGT TGCAAGAATA TAA
 
Protein sequence
MDYRKESLRL HGEWKGKIEV IHKVPVSTKE ELSLAYTPGV AEPCLAIQKD VNLSYEYTRR 
WNLVAVITDG TAVLGLGDIG PEAGMPVMEG KCVLFKKFGD VDAFPLCIKS KDVDEIVKTI
KLISGSFGGI NLEDISAPRC FEIERRLKEE CDIPIFHDDQ HGTAVVTVAA MINALKLVNK
KIEDIEVVVN GSGAAGIAVT RLLMSMGLKK VILCDTKGAI YDGRDNLNSE KALIAKISNL
EKKKGTLEDV IKGADVFIGL SVPGTVTKDM VKSMAKDPII FAMANPTPEI MPDEAKEAGA
KVVGTGRSDF PNQINNVLAF PGIFRGALDV RARDINDEMK IAAAKAIASL VSDEELNPDF
ILPLPFDPRV GKTVAAAVAE AARKTGVARI