Gene Cthe_2196 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2196 
Symbol 
ID4811061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2621203 
End bp2622804 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content46% 
IMG OID640107602 
Productcarbohydrate-binding family 6 protein 
Protein accessionYP_001038591 
Protein GI125974681 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3507] Beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAGG TTTTATTGTC TCTGTTGATA AGTACTTTGA TTATAACTTT TTATATACCG 
TCATGTTTTG CGGACAATCC GATAGTACAA ACAATCTACA CTGCTGACCC TGCTCCGATG
GTATATAACG GGGTATGCTA CGTGTATACC ACCCATGATG AGGATGTTCT TATTGATAAC
TTCTTTACCA TGAATGACTG GAGATGCTAC TCCACGACAG ACATGGCAAA CTGGACCGAT
CATGGAACAG TGTTGTCCTA CACTGACTTC AGTTGGTCAA GCGGTAAAGC ATGGGCGGGT
CAGTGCGTGG AAAGAAACGG CAAATTCTAT TTTTACGTTC CTCTGGCAAA GAAAGGCGGA
GGAGAGGCGA TTGGAGTTGC AGTATCGGAC AGTCCGACGG GTCCGTTTAA AGATGCCTTG
GGGAAACCTT TGATAGACCG CGGGGGCTGG GGTGAGATAG ACCCCACCGT GTTTATCGAT
GATGACGGGC AGGCGTACCT TTACTGGGGA AACCCTGATC TTTACTATGT GAAACTGAAT
CCTGACATGA TTTCCTATTC GGGCGGCATT GTCAAAGTAC CTCTTACCAC AGCAGGATTT
GGACAGCGAA GCAAAAACGA CAGACCGACT TCCTATGAAG AAGGTCCGTG GTTTTACAAG
CGTAACAATT TATATTATAT GGTGTTTGCA GCAGGTCCGA TACCCGAACA TATTGCATAT
TCAACGAGTA CGAGTCCCAC CGGACCGTGG ACGTATCGCG GCGTAATAAT GCCGACCCAG
GGAGGCAGTT TTACCAATCA TCCCGGAATA ATTGATTATA AAGGGAATTC CTACTTCTTC
TACCATAATG CCGCTTTACC GGGGGGAAGC GGCTACCACC GTTCTGTTTG CGTGGAACAG
TTTCAATATA ATCCCGACGG AACAATTCCA AGGATTAATA TGACCAAAGA AGGGCCCCCG
CAGATAGGCA CTTTGAATCC ATATGTAAGA ACCGAAGCTG AAACCATTTG CTGGAGCTCA
GGTATCGAGA CGGAAAAATG CAGTGAAGGC GGAATGAATG TAGGCTTTAT TGAAAACGGG
GATTACATAA AGGTTAAAGG TGTGAATTTC GGAACCGGTG CGGCGTCCTT TGAGGCAAGA
GTGGCATCGG CAACCAACGG CGGAAACATA GAAATTCGGC TTGACAGCCC AACGGGAAAA
TTAGTGGGAA CGTGTACCGT TACAGGAACC GGAGGATGGC AGACCTGGAC TACCAAATCT
TGTCCGGTTT CCGGTGCCGA GGGAGTACAC GACTTATACT TTGTTTTCAA GGGTGGCAGC
GGTTATTTGT TCAATATAGA CTGGTGGAAG TTCACTCCGG CAAATCCGGA TCCAACGCCA
ACACCGATGC CGGATAAACG TTTGGGTGAT TTGAATAATG ACGGAAAAGT AAACTCGACA
GACTTTCAGC TGTTAAAAAT GCATGTACTC CGTCAAGAAC TTCCGGCAGG AACGGACCTT
TCAAATGCGG ATGTAAACAG AGACGGAAAA GTGGATTCCA GCGACTGTAC TTTGTTAAAA
AGATATATAC TGCGTGTTAT ATCGGATTTT CCTCAAAATT AA
 
Protein sequence
MRKVLLSLLI STLIITFYIP SCFADNPIVQ TIYTADPAPM VYNGVCYVYT THDEDVLIDN 
FFTMNDWRCY STTDMANWTD HGTVLSYTDF SWSSGKAWAG QCVERNGKFY FYVPLAKKGG
GEAIGVAVSD SPTGPFKDAL GKPLIDRGGW GEIDPTVFID DDGQAYLYWG NPDLYYVKLN
PDMISYSGGI VKVPLTTAGF GQRSKNDRPT SYEEGPWFYK RNNLYYMVFA AGPIPEHIAY
STSTSPTGPW TYRGVIMPTQ GGSFTNHPGI IDYKGNSYFF YHNAALPGGS GYHRSVCVEQ
FQYNPDGTIP RINMTKEGPP QIGTLNPYVR TEAETICWSS GIETEKCSEG GMNVGFIENG
DYIKVKGVNF GTGAASFEAR VASATNGGNI EIRLDSPTGK LVGTCTVTGT GGWQTWTTKS
CPVSGAEGVH DLYFVFKGGS GYLFNIDWWK FTPANPDPTP TPMPDKRLGD LNNDGKVNST
DFQLLKMHVL RQELPAGTDL SNADVNRDGK VDSSDCTLLK RYILRVISDF PQN