Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0388 |
Symbol | |
ID | 4808465 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 482647 |
End bp | 483681 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640105802 |
Product | alcohol dehydrogenase GroES-like protein |
Protein accession | YP_001036819 |
Protein GI | 125972909 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | [TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00875101 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTACAAT CGGTTATGGT ATCTCCTGGA AAGATTGAGT TCCATGAGGT GGAAAAACCG GAACTAAAAC CTGGACAAGT ACTAATAAAG ATTATGAGAA TTGGCATATG TGGTTCGGAT ATTCATGTAA ATCATGGAAA ACATCCTTTC ACAAAGTATC CGGTAACTCA AGGACATGAA GTAAGTGGAA AAATCGTCGA AGTTGCAGAA GATGTGGAGC ATCTGAAAGT TGGCCAGAAA GTGACTATAG AGCCTCAGGT TGTATGTGGT AAATGTCATC CGTGCCGTAC AGGAAAATAT AACCTCTGTG AGGAACTAAA AGTTATGGGA TTCCAGACCG TTGGCGCAGG CAGCGAATAT TTTGCTGTTG ATGCGAAGAA TGTAACGACT GTCCCTGACC ATCTTTCTTA TGACGAGGCC GCGATGATTG AGCCTTTGGC TGTTACCGTT CATGCGGCAA ACAGAGTCGG TGATGTGAAG GATAAGGACA TTGTAGTAAT AGGTGCGGGT CCTATAGGAA TTTTGCTGGT TCAGACCCTG AAAGCAAAAG GTGCCCGCAA GGTAATGGTT ACGGATGTAA GCGATTATCG TTTGGAATTG GCGCTCAAGT GTGGTGCTGA TTTTGCAGTC AATACCAAAA AAGAAGATTT CGGAGAAGCG ATGCTTCGCT GTTTTGGACC TGATAAAGCA GATGTTATAT ATGATTGTGC GGGCAACAAT ACAACAATGG AACAAGCCAT AAAACATTCC CGCAAGGGAA GCATAATTGT TCTGGTTGCA GTATTTGAAG GTATGGCAAC AGTCGATCTT GCAACTCTTA ACGATAAGGA ACTGGATTTG AATACGACAA TGATGTATCG GCATGAGGAC TATGTGGAAG CCATTGAGCT TGTTGAAGCG GGTAAAGTAA AATTAACCCC TCTGATGAGC AAGCATTTTG CTTTTAGAGA TTGGGAAAAA GCCTATGAGT ATATTGACAA TAATCGTGAA ATTACTATGA AAGTCCTCAT TGATGTTAAT AACGATGATG ATTAA
|
Protein sequence | MLQSVMVSPG KIEFHEVEKP ELKPGQVLIK IMRIGICGSD IHVNHGKHPF TKYPVTQGHE VSGKIVEVAE DVEHLKVGQK VTIEPQVVCG KCHPCRTGKY NLCEELKVMG FQTVGAGSEY FAVDAKNVTT VPDHLSYDEA AMIEPLAVTV HAANRVGDVK DKDIVVIGAG PIGILLVQTL KAKGARKVMV TDVSDYRLEL ALKCGADFAV NTKKEDFGEA MLRCFGPDKA DVIYDCAGNN TTMEQAIKHS RKGSIIVLVA VFEGMATVDL ATLNDKELDL NTTMMYRHED YVEAIELVEA GKVKLTPLMS KHFAFRDWEK AYEYIDNNRE ITMKVLIDVN NDDD
|
| |