Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2196 |
Symbol | |
ID | 4811061 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2621203 |
End bp | 2622804 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640107602 |
Product | carbohydrate-binding family 6 protein |
Protein accession | YP_001038591 |
Protein GI | 125974681 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3507] Beta-xylosidase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAAGG TTTTATTGTC TCTGTTGATA AGTACTTTGA TTATAACTTT TTATATACCG TCATGTTTTG CGGACAATCC GATAGTACAA ACAATCTACA CTGCTGACCC TGCTCCGATG GTATATAACG GGGTATGCTA CGTGTATACC ACCCATGATG AGGATGTTCT TATTGATAAC TTCTTTACCA TGAATGACTG GAGATGCTAC TCCACGACAG ACATGGCAAA CTGGACCGAT CATGGAACAG TGTTGTCCTA CACTGACTTC AGTTGGTCAA GCGGTAAAGC ATGGGCGGGT CAGTGCGTGG AAAGAAACGG CAAATTCTAT TTTTACGTTC CTCTGGCAAA GAAAGGCGGA GGAGAGGCGA TTGGAGTTGC AGTATCGGAC AGTCCGACGG GTCCGTTTAA AGATGCCTTG GGGAAACCTT TGATAGACCG CGGGGGCTGG GGTGAGATAG ACCCCACCGT GTTTATCGAT GATGACGGGC AGGCGTACCT TTACTGGGGA AACCCTGATC TTTACTATGT GAAACTGAAT CCTGACATGA TTTCCTATTC GGGCGGCATT GTCAAAGTAC CTCTTACCAC AGCAGGATTT GGACAGCGAA GCAAAAACGA CAGACCGACT TCCTATGAAG AAGGTCCGTG GTTTTACAAG CGTAACAATT TATATTATAT GGTGTTTGCA GCAGGTCCGA TACCCGAACA TATTGCATAT TCAACGAGTA CGAGTCCCAC CGGACCGTGG ACGTATCGCG GCGTAATAAT GCCGACCCAG GGAGGCAGTT TTACCAATCA TCCCGGAATA ATTGATTATA AAGGGAATTC CTACTTCTTC TACCATAATG CCGCTTTACC GGGGGGAAGC GGCTACCACC GTTCTGTTTG CGTGGAACAG TTTCAATATA ATCCCGACGG AACAATTCCA AGGATTAATA TGACCAAAGA AGGGCCCCCG CAGATAGGCA CTTTGAATCC ATATGTAAGA ACCGAAGCTG AAACCATTTG CTGGAGCTCA GGTATCGAGA CGGAAAAATG CAGTGAAGGC GGAATGAATG TAGGCTTTAT TGAAAACGGG GATTACATAA AGGTTAAAGG TGTGAATTTC GGAACCGGTG CGGCGTCCTT TGAGGCAAGA GTGGCATCGG CAACCAACGG CGGAAACATA GAAATTCGGC TTGACAGCCC AACGGGAAAA TTAGTGGGAA CGTGTACCGT TACAGGAACC GGAGGATGGC AGACCTGGAC TACCAAATCT TGTCCGGTTT CCGGTGCCGA GGGAGTACAC GACTTATACT TTGTTTTCAA GGGTGGCAGC GGTTATTTGT TCAATATAGA CTGGTGGAAG TTCACTCCGG CAAATCCGGA TCCAACGCCA ACACCGATGC CGGATAAACG TTTGGGTGAT TTGAATAATG ACGGAAAAGT AAACTCGACA GACTTTCAGC TGTTAAAAAT GCATGTACTC CGTCAAGAAC TTCCGGCAGG AACGGACCTT TCAAATGCGG ATGTAAACAG AGACGGAAAA GTGGATTCCA GCGACTGTAC TTTGTTAAAA AGATATATAC TGCGTGTTAT ATCGGATTTT CCTCAAAATT AA
|
Protein sequence | MRKVLLSLLI STLIITFYIP SCFADNPIVQ TIYTADPAPM VYNGVCYVYT THDEDVLIDN FFTMNDWRCY STTDMANWTD HGTVLSYTDF SWSSGKAWAG QCVERNGKFY FYVPLAKKGG GEAIGVAVSD SPTGPFKDAL GKPLIDRGGW GEIDPTVFID DDGQAYLYWG NPDLYYVKLN PDMISYSGGI VKVPLTTAGF GQRSKNDRPT SYEEGPWFYK RNNLYYMVFA AGPIPEHIAY STSTSPTGPW TYRGVIMPTQ GGSFTNHPGI IDYKGNSYFF YHNAALPGGS GYHRSVCVEQ FQYNPDGTIP RINMTKEGPP QIGTLNPYVR TEAETICWSS GIETEKCSEG GMNVGFIENG DYIKVKGVNF GTGAASFEAR VASATNGGNI EIRLDSPTGK LVGTCTVTGT GGWQTWTTKS CPVSGAEGVH DLYFVFKGGS GYLFNIDWWK FTPANPDPTP TPMPDKRLGD LNNDGKVNST DFQLLKMHVL RQELPAGTDL SNADVNRDGK VDSSDCTLLK RYILRVISDF PQN
|
| |