Gene Cthe_0267 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0267 
Symbol 
ID4808550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp328214 
End bp330229 
Gene Length2016 bp 
Protein Length671 aa 
Translation table11 
GC content44% 
IMG OID640105679 
Producttype 3a, cellulose-binding 
Protein accessionYP_001036699 
Protein GI125972789 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACATT ACACGGGAAT CATTTTGAAA CTGGAAAGTG ACAGGGCAAT CGTATTGACT 
GACGGGCTTG ATTTTATGGA ACTGAAGCTC AAACCCGGCA TGCAACGCGG CCAGCATGTT
ATTTTCGATG AATCCGACTT GTATTCAGCC GGTCTCATTA CGAGGTATAA AAGTATCATT
ATGCCTTTTT CTGCGTTTGC CGCTGCCGCT GCAGTTTTTC TTGTAATACT TTTCAGTCTG
AGATTCGTTT CAATTTCCCA GGAATATGCT TATATAGATG TTGATATAAA TCCAAGCATT
GGGCTTGTAA TTGATAAAAA GGAAAAGGTT ATAGACGCAA AACCTTTAAA TAATGACGCA
AAACCGATTC TCGATGAAGC GGCCCCAAAA GACATGCCGT TATACGACGC TTTATCGAAA
ATACTGGATA TTTCAAAGAA AAACGGATAT ATAAATTCCG CAGACAATAT TGTTCTGTTC
TCCGCATCAA TAAACTCCGG CCGGAACAAT GTTTCCGAAA GTGACAAAGG CATTCAGGAA
ATAATATCCA CCCTCAAAGA CGTTGCAAAG GATGCAGGAG TCAAATTTGA AATTATACCG
TCTACCGAAG AGGACAGACA AAAAGCTTTA GACCAAAATC TTTCCATGGG AAGGTATGCT
ATATATGTAA AAGCTGTGGA AGAAGGTGTC AATCTTAACC TGGAAGATGC CCGAAACTTA
AGCGTCTCCG AGATTCTTGG CAAAGTAAAC ATTGGCAAAT TCGCCATCTC CGATACTCCG
GAAGACTCCG GAATTATGCC TGCGATTTCG GTTCCGGCTG AACCTGTGCC AAGTGTTACA
CCGGCTTATA CCGCTGTACC GGAAAAAACA GAAGCGCAAC CTGTTGATAT ACCAAAATCT
TCACCAACAC CGGCAAGTTT TACGGCACAT GTTCCCACAC CGCCAAAAAC TCCGTCAATC
CCGCATACAT CCGGGCCTGC CATCGTGCAC ACTCCGGCTG CCGATAAAAC CACCCCGACT
TTCACGGGGT CATCCACACC TGTACCAACC AATGTAGTAG CAATTGCATC CACACCTGTA
CCTGTATCTA CGCCCAAGCC TGTATCAACA CCTGCATATT CATCTACTCC CACACCCGAA
TCTACACCTG TACCTGTGTC CACACCCAAG CCAGCATCAA CACCTACACC TGCATCTACA
CCCAAGCCTG TATCGACACC CACACATGTA TCTACACCCA AGCCTATATC AACACCTACA
TCCACACCTA GACCTGCGTC CACACCTAAG CCTACATCAA CGCCTACACC GGAATCTACG
CCCAAGCCTA CGTCAACACC TGCACCCGTA TCCACACCTA CATCGACACC AATACCTACA
TATACATCCA CTCCTGCATC CACACCAATA CCTGCATATA CATCCACACC TACGTCCATA
CCAACGCTTA CACCTGCAAC ATCACCTGCG CCCACATCAT CGCCGACACC GATTCCATCG
CCAGCGCCTA CGGAAACTGA CTTGTTAACA AAGATTGAGC TTCAGGCATA CAATCACATC
AGAACCTCCG AAACCAAAGA GCTTCAACCC AGAATCAAGC TGATAAATAC CGGAAATACG
CCAATAACGC TCTCTGAGGT AAAAATCAGG TATTATTATA CAAAGGATCA AGTAATCAAT
GAGATTTACA CATGCGACTG GTCAAACATT ACTTCATCCA AAATAACAGG AACTGTGGTT
CAAATGTCAA ACCCAAAACC TAATGCCGAC AGCTACGTTG AAATAGGTTT CACTAACAGT
GCCGGTGTTC TAAATCCCGG TGAGTACGTT GAAATAATCA GCAGAATCGG AAACAGTTAT
GCTTTAAGTC TGGCAACTCC GCCTTATTCA GAATGGAATT ATATGTACGA CCAAAATTCC
GACTACTCCT TCAACAACAG TTCTTCAGAT TTTGTAGTCT GGGACAAGAT TACAGTGTAT
ATATCAGGAA CTCTATATTG GGGAATTGAA CCTTAA
 
Protein sequence
MSHYTGIILK LESDRAIVLT DGLDFMELKL KPGMQRGQHV IFDESDLYSA GLITRYKSII 
MPFSAFAAAA AVFLVILFSL RFVSISQEYA YIDVDINPSI GLVIDKKEKV IDAKPLNNDA
KPILDEAAPK DMPLYDALSK ILDISKKNGY INSADNIVLF SASINSGRNN VSESDKGIQE
IISTLKDVAK DAGVKFEIIP STEEDRQKAL DQNLSMGRYA IYVKAVEEGV NLNLEDARNL
SVSEILGKVN IGKFAISDTP EDSGIMPAIS VPAEPVPSVT PAYTAVPEKT EAQPVDIPKS
SPTPASFTAH VPTPPKTPSI PHTSGPAIVH TPAADKTTPT FTGSSTPVPT NVVAIASTPV
PVSTPKPVST PAYSSTPTPE STPVPVSTPK PASTPTPAST PKPVSTPTHV STPKPISTPT
STPRPASTPK PTSTPTPEST PKPTSTPAPV STPTSTPIPT YTSTPASTPI PAYTSTPTSI
PTLTPATSPA PTSSPTPIPS PAPTETDLLT KIELQAYNHI RTSETKELQP RIKLINTGNT
PITLSEVKIR YYYTKDQVIN EIYTCDWSNI TSSKITGTVV QMSNPKPNAD SYVEIGFTNS
AGVLNPGEYV EIISRIGNSY ALSLATPPYS EWNYMYDQNS DYSFNNSSSD FVVWDKITVY
ISGTLYWGIE P