Gene Cthe_1271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1271 
Symbol 
ID4809776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1545824 
End bp1547863 
Gene Length2040 bp 
Protein Length679 aa 
Translation table11 
GC content43% 
IMG OID640106694 
Productcarbohydrate-binding family 6 protein 
Protein accessionYP_001037696 
Protein GI125973786 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3507] Beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAAAAA AACTGAAAAA AATAATTAAG TTATGCAGCA TGGTATTCGT TATCGGCATC 
TTAACATTAT TACTGCCTGA AAAAGGGGCT GCCGATTATC CGATTTTCTC ACAGCGTTTT
ACTGCAGACC CAGCCGCAGT CGTGTACAAC GGAAGACTTT ATATTTATTG TTCCCATGAC
TCGGATGCCA CACCCGGCCA GTCCACTTAC AACATCCCGG ATATAACCTG TATATCCACA
GATGATTTGA AAAACTGGAC CGACCATGGA GAAGTGTTTA ATGCAAAAAG AGATTCCAGG
TGGGCTTCGG TGTCCTGGGC ACCGTCAATT GTCTATCGCA ACAACAAGTT TTATTTGTAT
TACGGTAACG GAGGAAACGG AATCGGAGTT GCTGTAAGCG ACAGTCCGAC AGGACCTTTC
AAAGATCCTC TTCCGGGTCC GTTGGTAAGC TGGAATACCC CCGGAGTACA GCCTGCACAA
AATATGTGGC TGTTTGACCC CGGTGTTTTT GTTGATGATG ACGGACAGGC TTATATGTAC
TTTGGTGGAA ACGGACAAAA CAATATCAGA GTTATTAAGT TGGGCAATGA TATGATAAGC
ACGGTAGGTT CAGCTATGAC GATGTCTGCT CCGAGATTTT TCGAGGCAGC ATATATGCAT
AAGTATAATG GAAAATATTA TTTTTCCTAT GCCAGCGATT TCTCTCAAGG TGCATCTAAG
ATTGAATATA TGATGAGCGA CAAGCCTACC ACAGGTTTTC AATATAAAGG GGTAATATTG
CCGCAGCCAC CCGACAACTA TAGTAATAAT AACCATCATG CTATCGTTGA GTATAAAGGC
AATTGGTATG TTGTGTATCA CAACAGGACT GTGGCAAAAC AGCGTGGACT GGACCCGGTA
TATCAGAGAA ATGTTTGTAT TGATCAGATG TTTTATAACG CCGACGGTAC CATAAAGCAG
GTTGTTCCAA CGGTAGATGG CTTGAAACAG CTCAAATATG TAGATCCTTA CACAAAAAAT
TTAGCTGTAA CCATGCATAA GGAATCCGGT ATTGAGACTG AAGAATGCAG TGAAGGCGGA
CGTAATGTGG CATTTATTGA AAACGGAGAT TGGATTCAAG TCAAAGGAGT TGATTTCGGA
AATGTAGGCC CAACCAGTTT TGAAGCCAGA GTCGCCAGTG CTACAAACGG CGGAAACATT
GAAATAAGAC TGGATAGTCC TACAGGAACT CTGATTGGCA CATGCAAAGT TGAAGGAACC
GGAGACTGGC AGAAATGGGT AACCAAAACC TGTTCCGTCA GCAAGGTAAC CGGTGTTCAC
GATTTGTTCT TCAGGTTTAC GGGGGGAAGC GGTTATCTGT TTAATTTCAG CTGGTGGAAA
TTCAACTCAG ACGCAACCCC AACCCCAACC CCTCCCCCCC AACCTTCAAC TGTACCTGTA
ACTGAAAGAA GTGCTTTTTC GAAAATAGAA GTGGAAGATT TCAATGACAT TAAGTCTTCC
ACAATACAAA AAATCGGCAC TCCCAACGGC GGAAGCGGCA TAGGATATAT TGAAAACGGA
GACTGGCTGG CGTATAAAAA TATTGACTTC GGAAACGGTG CGACCACCTT TAAGGCATTG
GTTGCAAGTA CCCTTTCTCC AAATATTGAA CTTCGATTAG ACAGCCCGAC GGGAACACTT
ATAGGAACAT TGAAAGTAGC AGCGACCGGC GGTTTTAACG CATACGAAGA ACAAAGCTGC
AACATCAGCA AAGTTACAGG AAAACATGAC TTGTATCTTG TATTTTCCGG TGCCGTAAAT
ATTGACTGGT TTACTTTTGG CGGCAGTAGT GGCATAATTA AACGAGGCGA TACAAACAGC
GACGGAAAAA TCAACTCGAC AGACGTCACA GCACTCAAGA GACATTTGCT CAGAGTAACC
CAGCTTACAG GCGATAATCT TGCCAACGCA GATGTCAACG GGGACGGAAA TGTTAACTCT
ACAGATCTTC TTCTGCTTAA ACGATATATA TTAGGGGAAA TAGAAAATTT CCCGATATAA
 
Protein sequence
MPKKLKKIIK LCSMVFVIGI LTLLLPEKGA ADYPIFSQRF TADPAAVVYN GRLYIYCSHD 
SDATPGQSTY NIPDITCIST DDLKNWTDHG EVFNAKRDSR WASVSWAPSI VYRNNKFYLY
YGNGGNGIGV AVSDSPTGPF KDPLPGPLVS WNTPGVQPAQ NMWLFDPGVF VDDDGQAYMY
FGGNGQNNIR VIKLGNDMIS TVGSAMTMSA PRFFEAAYMH KYNGKYYFSY ASDFSQGASK
IEYMMSDKPT TGFQYKGVIL PQPPDNYSNN NHHAIVEYKG NWYVVYHNRT VAKQRGLDPV
YQRNVCIDQM FYNADGTIKQ VVPTVDGLKQ LKYVDPYTKN LAVTMHKESG IETEECSEGG
RNVAFIENGD WIQVKGVDFG NVGPTSFEAR VASATNGGNI EIRLDSPTGT LIGTCKVEGT
GDWQKWVTKT CSVSKVTGVH DLFFRFTGGS GYLFNFSWWK FNSDATPTPT PPPQPSTVPV
TERSAFSKIE VEDFNDIKSS TIQKIGTPNG GSGIGYIENG DWLAYKNIDF GNGATTFKAL
VASTLSPNIE LRLDSPTGTL IGTLKVAATG GFNAYEEQSC NISKVTGKHD LYLVFSGAVN
IDWFTFGGSS GIIKRGDTNS DGKINSTDVT ALKRHLLRVT QLTGDNLANA DVNGDGNVNS
TDLLLLKRYI LGEIENFPI