Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2646 |
Symbol | |
ID | 4808957 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 3127922 |
End bp | 3129145 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640108059 |
Product | hypothetical protein |
Protein accession | YP_001039038 |
Protein GI | 125975128 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATGA AAAAAGTTTT CTTTGTGACT TACGGCGGAG GCCATGTAAG AAGCGTAATT CCGGTTATTA AAGAATTAAA ATCAAGGGGC CATAAAGTCT CTGTTCTCGG ATTAACAAGC AGCGTTAATG ATTTAAAAAA AGAAGAGATT GAATTTAAGG GCATCAGGGA TTATTTGAAT TTGTTCAAAG ATGAAGAAGC ACAAAAGATT TTAAAATACG GAGATATGTT TATTGATGAA CATTTTGATG CCGGTTCAGG CCTGGATAAA TTTGAAATCA AAGTGTATTT GGGAATGAAT CTATGGGATT TGTCCCTTCA GCTTAAAAGT TTTGAAGAAG CATTAAAACT TTTCAGAGAG CGCGGCAGAA GCTGTTTTTT CCCCATAAAT TTAATGGAAA GGATATTAAG CTTTGAAAAA CCGGACGTAA TTGTGGTTAC CAGCGGGAAA AGAGCTGAAA AAGCTGCAGC CTTCAGCGCC AATAAAATGG ATGTAAAAGT GGTACGTATA GTTGACCTTC TGGGAGAAAA TTTGAAAATT CCATACAAAG CAACGGTTTG TGTGTTAAAC GATTATGCCA AAGCAAACAT ACTTTCCTGC AATGAAAACC TGAATGAACG GGACGTAGTC GTCACAGGGC AGCCAAATAT TGAACCGACT TACACCGAAA AGCATTTTGA GGATTTTATA AAGAGGTACA ATCTTGATAA ATTCGACAAG GTTATTTCTT TTTTCTCCCA GCCCAATATA GCTTACAGAG AGGATATCCT GGTCGAATTT ATTAAGCTTA TGCAAAAAAG ACCAAACTTC ATGGGTATAT GGAAAACCCA TCCCAACGAG CAAATGGACC TATATACCGG GTATTTGAAT ACATTGCCGC AAAATTTATT GATTGTAAAA GAAGAGGATA CCAATTTGAT TTTAAGTAAG TCCAATTTGG TAATTACTTT TTACTCTACA GTCGGATTAC AGGCCATAGC CGCAGACAAA CCTCTGATAA CAGTCAATTT TTCAAAAAAT GCACATCCGG TGGAATATGA CAAGCTGGGC TGCGCCCTTC CTGTCAAAAA TACCGAAGAA TTTGAAAATG CCATAAATCT TTTGCTTGAA AGCAGCAATT CAGATGCCCG TAATTTACAT GCCCGCCTCA GGGAGGCAAG GAAAAAACTC ATGCCCCCTG CCGGGGCGGC CCAAAATATA GCCAATGTTA TCGAATACTC ATAA
|
Protein sequence | MKMKKVFFVT YGGGHVRSVI PVIKELKSRG HKVSVLGLTS SVNDLKKEEI EFKGIRDYLN LFKDEEAQKI LKYGDMFIDE HFDAGSGLDK FEIKVYLGMN LWDLSLQLKS FEEALKLFRE RGRSCFFPIN LMERILSFEK PDVIVVTSGK RAEKAAAFSA NKMDVKVVRI VDLLGENLKI PYKATVCVLN DYAKANILSC NENLNERDVV VTGQPNIEPT YTEKHFEDFI KRYNLDKFDK VISFFSQPNI AYREDILVEF IKLMQKRPNF MGIWKTHPNE QMDLYTGYLN TLPQNLLIVK EEDTNLILSK SNLVITFYST VGLQAIAADK PLITVNFSKN AHPVEYDKLG CALPVKNTEE FENAINLLLE SSNSDARNLH ARLREARKKL MPPAGAAQNI ANVIEYS
|
| |