Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3132 |
Symbol | |
ID | 4809695 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3702140 |
End bp | 3703375 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640108565 |
Product | cellulosome enzyme, dockerin type I |
Protein accession | YP_001039520 |
Protein GI | 125975610 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGTCA AGCTAAAGGG CACATTTTTA CTTTTGATTT TGATTGCAGC GCTCTTGTTT AATACGGTCT GTTCAAGTGC TGCAGAGAGC GTACTGCAAG ACAGGACCAT TGATGACATA GTGAAGAGAT ATCAAAACAA TCCGTTTCGT ATTAATGTGT CCGTTTCTGA CATATATGAA ATTGAGCCCA AAGCTGAGCC TCCTTATGTT GCAGGAAAAT TAAAAAGTGA CTATTTAAAG GAAGCATTAA ATTGTGTAAA TTTTATGCGT TATCTGGTTG GATTGCCGAA TGATCTTGTT CTGGACGACA ATTATAATAA TTATGCCCAG CATGGAACAG TATTACTGGC AAGATTAAGA GGTATAGCTC ATTATCCTCA AAAACCCGGT GATATGCCTG ATGAATTTTA CAATCTTGCA TATAAAGGAA CTTCAAGTTC CAGTATTGCA TACGGTTTTT CATCTTTAAT GGATAGCATT ATGGCATTTA TGAAAGATAA TAATAGTGAA TTGAATCTTA GTACAGTGGG TCACCGCAGA TGGCTGCTAA ATCCAGGTAT GGAGAAAACA GGCTTTGGAC AGTGCGGGCG TTACTATTGC ACATATATAT TGGATTCAGT TATGGGTGCT TCCGTTAAAT TTGACTTTAT AGCGTGGCCT GCAAGAAATT ATATGCCTGT GGAATATTTT AATGATGCAA GCGTACCTTG GTCGGTTAAT CTTGGAAGCG ATTATTTTTC GCCTTCCCTT AATGAAGTCG AAGTGACTCT TAAGAGAAGA AGTGACAACA AAGTATGGAT TTTTAATAAA GACAATATTG AGGAATATGG GCTTTTTAAT GTGAACAATG ATTATTATGG AATGACAAAA TGTATTATTT TCAGGCCAAA GGGTATCGGT AGCTATAATA AGAATGATGT ATTTGATGTA AACATCAAAG GAATAAGGCT TTCTACCGAC GGACCGACTG AAATTAATTA CACGGTGAGA TTTTTCAGTT TGAAAGATGC TATTGCTGAA AGGGAAAGAA ATTTTACATA CGGGGATTTA AACGGTGACG GAAGGGTAAA TTCGACGGAC TTGGCAGTAA TGAAAAGGTA TCTTTTAAAA CAAGTACAAA TTTCAGATAT CAGACCTGCA GATTTAAATG GTGACGGAAA AGCAAATTCC ACTGATTACC AGTTACTTAA ACGGTATATT TTAAAAACGA TAGATATATT TCCTGTTGAA AAATAG
|
Protein sequence | MKVKLKGTFL LLILIAALLF NTVCSSAAES VLQDRTIDDI VKRYQNNPFR INVSVSDIYE IEPKAEPPYV AGKLKSDYLK EALNCVNFMR YLVGLPNDLV LDDNYNNYAQ HGTVLLARLR GIAHYPQKPG DMPDEFYNLA YKGTSSSSIA YGFSSLMDSI MAFMKDNNSE LNLSTVGHRR WLLNPGMEKT GFGQCGRYYC TYILDSVMGA SVKFDFIAWP ARNYMPVEYF NDASVPWSVN LGSDYFSPSL NEVEVTLKRR SDNKVWIFNK DNIEEYGLFN VNNDYYGMTK CIIFRPKGIG SYNKNDVFDV NIKGIRLSTD GPTEINYTVR FFSLKDAIAE RERNFTYGDL NGDGRVNSTD LAVMKRYLLK QVQISDIRPA DLNGDGKANS TDYQLLKRYI LKTIDIFPVE K
|
| |