Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0854 |
Symbol | |
ID | 4810472 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1029680 |
End bp | 1030576 |
Gene Length | 897 bp |
Protein Length | 298 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640106270 |
Product | shikimate dehydrogenase |
Protein accession | YP_001037281 |
Protein GI | 125973371 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0169] Shikimate 5-dehydrogenase |
TIGRFAM ID | [TIGR00507] shikimate 5-dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATATCG ATGTCAGGGT AACGGGAAAG ACAAAATTGT TGGGACTTGT AGGAAATCCT GTTGAGCATT CCATATCACC CCAACTTCAC AATACGTTAA GTTCGCTGCT GGGACTGGAT ATTGTTTACA TACCTCTGGC AGTTGGAAAA GAAGACCTTG AGACAGTGGT AAAAGCTCTC AAAGCTTTGG ATTTTATAGG TTTTAACGTT ACAATTCCTT ACAAAAGGGA CATAATGAAA TATCTTGACG AGAATTCAAA AGAAGCAATA CTCATGGGAG CGGTAAACAC GGTAAAAAAG ATTGACGGTC GTCTGTACGG TTATAATACC GATGCGGAAG GTTTTTTAAG ATCTTTTAAG GAAGAAGCCG GTGTGGGCTT TAAAGGCAAA AAGGTTGTAC TTATAGGAGC CGGAGGAGTG GCCCGGGCAA TTGCCGTTAA AATTGCCTCT GAAGATGCGG AAAAAATCAG TGTTGTAAAC CGTACCGTCG AGAAATCCGT TGAACTTGCC GAGGTTGTAA ATGAAAATAT AAAAGAAATA GTCCAGGTGT ACAATTTTGA AGATAAAACT TTCAGAATGG CTTTTGAGGA AAGTGATATT ATTATTAATA CAACTTCTGT AGGTATGTAT CCTAAAACCC GGGAAACCCC CGTTAAATTC ACTGAGTGCT TCAACAAAAA TCAAATAGTG TATGATGTGA TCTACAATCC TGAAAAAACA AAATTTCTCA ATGATGCTGA AAAAAGAGGG GCAAAAATCA TAAATGGCCT TGGAATGCTT TTCTATCAAG GGATAAGCGC CTATGAAATC TGGACGGGTG TAAAGTTTAC CGAGGATAAA CTTAAAAGTG TTTATGAATC TTTCAAAAAA TATTTGCAGC TTAAATCAGC CAAATAA
|
Protein sequence | MDIDVRVTGK TKLLGLVGNP VEHSISPQLH NTLSSLLGLD IVYIPLAVGK EDLETVVKAL KALDFIGFNV TIPYKRDIMK YLDENSKEAI LMGAVNTVKK IDGRLYGYNT DAEGFLRSFK EEAGVGFKGK KVVLIGAGGV ARAIAVKIAS EDAEKISVVN RTVEKSVELA EVVNENIKEI VQVYNFEDKT FRMAFEESDI IINTTSVGMY PKTRETPVKF TECFNKNQIV YDVIYNPEKT KFLNDAEKRG AKIINGLGML FYQGISAYEI WTGVKFTEDK LKSVYESFKK YLQLKSAK
|
| |