Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3012 |
Symbol | |
ID | 4811160 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3535136 |
End bp | 3537028 |
Gene Length | 1893 bp |
Protein Length | 630 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640108433 |
Product | carbohydrate-binding family 6 protein |
Protein accession | YP_001039401 |
Protein GI | 125975491 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5520] O-Glycosyl hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTAAGT TAAGAAAATT GCTGCTATTT TCTACTGTTT TGTTTGTCGT TTTTACTCAA CTATTTGGCT TTATAATCAC GGTAGATGCG GCAGAAACGG CAACAATCAA CTTGTCGGCG GAAAAACAGG TAATCCGCGG ATTTGGAGGA ATGAACCATC CCGTTTGGAT TTCCGACCTG ACGCCGCAGC AAAGGGATAC GGCTTTTGGT AATGGAGAGG GACAGCTGGG CTTTACTATT TTGAGAATTC ATGTCGATGA GAACAGAAAC AATTGGTCAA AAGAAGTGGC AACCGCCAGA AGAGCTATTG AGCTTGGAGC AATAGTTTTT GCTTCTCCGT GGAATCCTCC AAGTAATATG GTGGAGACCT TCACTCGTAA CGGTGTGCCA AATCAAAAGA GACTCAGATA TGATAAATAC GGAGATTATG TACAGCATCT TAACGACTTT GTTGCGTACA TGAAAAGTAA CGGAGTGGAT TTGTATGCCA TTTCAGTTCA GAACGAACCA GACTATGCCC ATGAGTGGAC ATGGTGGACT CCTCAGGAAA TGCTTCGCTT TATGAGGGAC TATGCCGGCC AAATCAACTG CAGGGTTATG GCGCCGGAGT CATTCCAATA TCTGAAAAAT ATGTCCGACC CGATTTTGAA TGACCCTCAG GCTCTTGCGA ATTTGGATAT ACTTGGTGCC CACTTTTACG GCACTACTGT AAACAATATG CCCTATCCTT TGTTTGAGCA AAAAGGAGCG GGAAAAGAGC TGTGGATGAC AGAGGTTTAT GTTCCAAACA GCGACAGCAA TTCGGCAGAC CGCTGGCCTG AGGCACTAGA GGTTGCGCAT AATATGCACA ATGCTTTGGT AGAGGGAAAT TTCCAGGCAT ATGTTTGGTG GTATATCCGC AGGTCATACG GACCTATGAA AGAAGACGGT ACTATAAGCA AGCGCGGATA TATGATGGCA CATTACTCAA AGTTTGTCCG CCCGGGATAT GTAAGGGTTG ATGCAACGAA AAATCCTACA TACAATGTAT ATTTATCTGC TTACAAAAAC AAAAAAGATA ACAGCGTTGT GGCAGTGGTT ATAAATAAAA GTACCGAGGC GAAGACAATT AATATATCCG TTCCGGGAAC AAGTATCAGA AAGTGGGAAA GATATGTTAC TACAGGGTCA AAAAATCTTA GGAAAGAATC AGACATAAAT GCAAGTGGAA CCACTTTCCA GGTTACTTTG GAGCCTCAAA GCGTTACAAC TTTTGTAGGC GGTGGATCCA GTGAACCGCA AATACCGGTT GAAAGAAATG CTTTCTCAAA GATAGAATGC GAAGAATATA ACGCTACCAA TTCTTCCACT GTACAAGTAG TGGGTACCGG CACAGGAAGC GGTCTCGGAT ATATCGAAAA CGGCAACTAT TTTGCTTACA AAAATATTAA TTTCGGTAAC GGTGCAAATT CATTCAAAAT CAGGGCTGCA ACTACCGGTA CTCCAAAAAT AGAAATCCGA CTGGGCAGTC CGACAGGCAC TCTTGCAGGT ACATTGCAAG TGGCTGCAAC CGGAGGCTTT AATGCCTATG AGGAGCAGAG CTGCAGTATT AATAAAATTA CGGGTGTCCA GGACGTCTAT TTGGTATTCG GAGGAGCTGT AAATGTTGAC TGGTTTACCT TTGAGTCAAA ACAGGAGCCG ACTTTCAAGT ACGGCGACCT CAACGGTGAC GGCAATGTTA ACTCCACTGA TTCCACGCTT ATGTCAAGAT ATCTTTTAGG TATAATCACC ACTTTGCCGG CCGGTGAAAA GGCTGCGGAT TTGAATGGGG ACGGAAAGGT AAATTCTACA GACTACAATA TTTTAAAGAG ATATTTGCTT AAATATATTG ATAAATTTCC TGTAGAATCA TAA
|
Protein sequence | MRKLRKLLLF STVLFVVFTQ LFGFIITVDA AETATINLSA EKQVIRGFGG MNHPVWISDL TPQQRDTAFG NGEGQLGFTI LRIHVDENRN NWSKEVATAR RAIELGAIVF ASPWNPPSNM VETFTRNGVP NQKRLRYDKY GDYVQHLNDF VAYMKSNGVD LYAISVQNEP DYAHEWTWWT PQEMLRFMRD YAGQINCRVM APESFQYLKN MSDPILNDPQ ALANLDILGA HFYGTTVNNM PYPLFEQKGA GKELWMTEVY VPNSDSNSAD RWPEALEVAH NMHNALVEGN FQAYVWWYIR RSYGPMKEDG TISKRGYMMA HYSKFVRPGY VRVDATKNPT YNVYLSAYKN KKDNSVVAVV INKSTEAKTI NISVPGTSIR KWERYVTTGS KNLRKESDIN ASGTTFQVTL EPQSVTTFVG GGSSEPQIPV ERNAFSKIEC EEYNATNSST VQVVGTGTGS GLGYIENGNY FAYKNINFGN GANSFKIRAA TTGTPKIEIR LGSPTGTLAG TLQVAATGGF NAYEEQSCSI NKITGVQDVY LVFGGAVNVD WFTFESKQEP TFKYGDLNGD GNVNSTDSTL MSRYLLGIIT TLPAGEKAAD LNGDGKVNST DYNILKRYLL KYIDKFPVES
|
| |