Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3035 |
Symbol | |
ID | 4811107 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 3558127 |
End bp | 3559302 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640108456 |
Product | D-3-phosphoglycerate dehydrogenase |
Protein accession | YP_001039424 |
Protein GI | 125975514 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0111] Phosphoglycerate dehydrogenase and related dehydrogenases |
TIGRFAM ID | [TIGR01327] D-3-phosphoglycerate dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.334583 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATACTA TTCAAACACT TAACAAGATT TCACTAAAAG GCTTGGAGCT TTTTCCAAGA GATTCCTATG AAATAGCCAC AGAAATATCC AATCCCGACG CAATTCTTGT AAGGAGCTAT GACATGCTCA GCATGGAGCT TCCGAAGAAT CTTAAGGCCA TTGCCAGAGC CGGGGCCGGA GTCAACAACA TCCCTGTTGA AAAGTGCACC GAAAGAGGAA TTGTGGTTTT CAATACACCG GGCGCAAATG CAAATGCCGT AAAAGAACTT GTTCTCGCAT CACTGTTTAT GTCATCCCGC AAAATATACA AAGGTATTTC CTGGGTTCAG TCCCTCAAAG GCAAAGGAAA TGAAGTGGCT GAATTGGTGG AAAAGTACAA ATCCCAGTTC GCCGGACCCG AAATAAAAGG GAAAAAACTT GGGGTTATCG GTCTTGGCGC CATAGGTGGA TTGGTTGCCA ACGATGCTGT TGCTTTGGGT ATGGAAGTAA TCGGTTATGA CCCGTTTATT TCCATAGACT CCGCCTGGGA GCTTTCAAGC TCGGTAGAAA AAGCAGTAAG TCTTGACTAT CTGCTTTCCA CCTGTGACTA CATAACCATA CACGTGCCTT TCAATCCTAA AACCAAAGGT ATGATAAACA AAGAGAAATT TGAGATAATG AAAAAAGGTG TGAGGCTTTT GAACTTTGCA AGAGGCGGAC TTGTAGTCAA CAAGGACCTC CTTGAAGCAA TAGAAAACGG CACTGTTGCC TGCTATGTCA CCGACTTCCC TGAAGACGAA CTGCTTGGCA ACGACAATAT TATTACTTTG CCCCATCTCG GCGCTTCAAC ACCGGAATCC GAGGAAAACT GCGCCGTAAT GGCGGCAAGC CAGCTTCGTG ATTTCCTTGA ATACGGCAAC ATCAAAAACT CCGTAAACTT CCCCAACTGT GAACTTCCCT ACACAGGAAA CGTCAGAGTA ATTGTCGCCC ATGACAACAT ACCCAACATG TTTGGCCAAA TTACTTCTCT TATAGCCCGC AACGGAATCA ATATCGGGGA TATGATAAGC AAACACAAGG ATAAAATCGG ATACACAATT TTGAATGTCG AAAGAGAAAT TTCCGATGAA ATTGTAGAGA ACATAAGAGC AATAGAAGGA GTAAGAATGG TGAGAGTAAT TAACAAGACC AAATAA
|
Protein sequence | MYTIQTLNKI SLKGLELFPR DSYEIATEIS NPDAILVRSY DMLSMELPKN LKAIARAGAG VNNIPVEKCT ERGIVVFNTP GANANAVKEL VLASLFMSSR KIYKGISWVQ SLKGKGNEVA ELVEKYKSQF AGPEIKGKKL GVIGLGAIGG LVANDAVALG MEVIGYDPFI SIDSAWELSS SVEKAVSLDY LLSTCDYITI HVPFNPKTKG MINKEKFEIM KKGVRLLNFA RGGLVVNKDL LEAIENGTVA CYVTDFPEDE LLGNDNIITL PHLGASTPES EENCAVMAAS QLRDFLEYGN IKNSVNFPNC ELPYTGNVRV IVAHDNIPNM FGQITSLIAR NGINIGDMIS KHKDKIGYTI LNVEREISDE IVENIRAIEG VRMVRVINKT K
|
| |