Gene Cthe_3035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3035 
Symbol 
ID4811107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3558127 
End bp3559302 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content43% 
IMG OID640108456 
ProductD-3-phosphoglycerate dehydrogenase 
Protein accessionYP_001039424 
Protein GI125975514 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0111] Phosphoglycerate dehydrogenase and related dehydrogenases 
TIGRFAM ID[TIGR01327] D-3-phosphoglycerate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.334583 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATACTA TTCAAACACT TAACAAGATT TCACTAAAAG GCTTGGAGCT TTTTCCAAGA 
GATTCCTATG AAATAGCCAC AGAAATATCC AATCCCGACG CAATTCTTGT AAGGAGCTAT
GACATGCTCA GCATGGAGCT TCCGAAGAAT CTTAAGGCCA TTGCCAGAGC CGGGGCCGGA
GTCAACAACA TCCCTGTTGA AAAGTGCACC GAAAGAGGAA TTGTGGTTTT CAATACACCG
GGCGCAAATG CAAATGCCGT AAAAGAACTT GTTCTCGCAT CACTGTTTAT GTCATCCCGC
AAAATATACA AAGGTATTTC CTGGGTTCAG TCCCTCAAAG GCAAAGGAAA TGAAGTGGCT
GAATTGGTGG AAAAGTACAA ATCCCAGTTC GCCGGACCCG AAATAAAAGG GAAAAAACTT
GGGGTTATCG GTCTTGGCGC CATAGGTGGA TTGGTTGCCA ACGATGCTGT TGCTTTGGGT
ATGGAAGTAA TCGGTTATGA CCCGTTTATT TCCATAGACT CCGCCTGGGA GCTTTCAAGC
TCGGTAGAAA AAGCAGTAAG TCTTGACTAT CTGCTTTCCA CCTGTGACTA CATAACCATA
CACGTGCCTT TCAATCCTAA AACCAAAGGT ATGATAAACA AAGAGAAATT TGAGATAATG
AAAAAAGGTG TGAGGCTTTT GAACTTTGCA AGAGGCGGAC TTGTAGTCAA CAAGGACCTC
CTTGAAGCAA TAGAAAACGG CACTGTTGCC TGCTATGTCA CCGACTTCCC TGAAGACGAA
CTGCTTGGCA ACGACAATAT TATTACTTTG CCCCATCTCG GCGCTTCAAC ACCGGAATCC
GAGGAAAACT GCGCCGTAAT GGCGGCAAGC CAGCTTCGTG ATTTCCTTGA ATACGGCAAC
ATCAAAAACT CCGTAAACTT CCCCAACTGT GAACTTCCCT ACACAGGAAA CGTCAGAGTA
ATTGTCGCCC ATGACAACAT ACCCAACATG TTTGGCCAAA TTACTTCTCT TATAGCCCGC
AACGGAATCA ATATCGGGGA TATGATAAGC AAACACAAGG ATAAAATCGG ATACACAATT
TTGAATGTCG AAAGAGAAAT TTCCGATGAA ATTGTAGAGA ACATAAGAGC AATAGAAGGA
GTAAGAATGG TGAGAGTAAT TAACAAGACC AAATAA
 
Protein sequence
MYTIQTLNKI SLKGLELFPR DSYEIATEIS NPDAILVRSY DMLSMELPKN LKAIARAGAG 
VNNIPVEKCT ERGIVVFNTP GANANAVKEL VLASLFMSSR KIYKGISWVQ SLKGKGNEVA
ELVEKYKSQF AGPEIKGKKL GVIGLGAIGG LVANDAVALG MEVIGYDPFI SIDSAWELSS
SVEKAVSLDY LLSTCDYITI HVPFNPKTKG MINKEKFEIM KKGVRLLNFA RGGLVVNKDL
LEAIENGTVA CYVTDFPEDE LLGNDNIITL PHLGASTPES EENCAVMAAS QLRDFLEYGN
IKNSVNFPNC ELPYTGNVRV IVAHDNIPNM FGQITSLIAR NGINIGDMIS KHKDKIGYTI
LNVEREISDE IVENIRAIEG VRMVRVINKT K