Gene Cthe_1256 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1256 
Symbol 
ID4809761 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1522938 
End bp1525205 
Gene Length2268 bp 
Protein Length755 aa 
Translation table11 
GC content43% 
IMG OID640106679 
Productglycoside hydrolase family protein 
Protein accessionYP_001037681 
Protein GI125973771 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.179739 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGTAG ATATCAAGAA AATAATAAAG CAGATGACTT TGGAAGAAAA AGCAGGGTTG 
TGCTCGGGAC TGGATTTTTG GCATACCAAG CCTGTTGAGA GACTGGGCAT TCCTTCAATA
ATGATGACTG ACGGACCTCA TGGACTGAGA AAGCAGAGGG AAGATGCAGA GATTGCGGAC
ATCAACAACA GCGTTCCAGC AACCTGTTTT CCGTCTGCAG CAGGTTTGGC ATGTTCCTGG
GACAGAGAAC TGGTTGAGAG AGTAGGTGCA GCACTAGGAG AAGAATGTCA GGCGGAAAAT
GTCTCAATAC TGCTTGGACC AGGTGCAAAT ATAAAGCGTT CACCTTTGTG TGGAAGAAAT
TTTGAATATT TTTCCGAAGA CCCTTATCTT TCGTCAGAGC TGGCGGCAAG CCATATAAAA
GGAGTTCAAA GTCAGGGAGT GGGTGCATGT CTTAAACATT TTGCCGCAAA CAACCAGGAA
CACCGGAGAA TGACCGTTGA TACCATTGTA GATGAAAGAA CGTTGAGGGA AATATATTTT
GCAAGCTTTG AGAATGCTGT AAAAAAAGCA CGGCCTTGGG TGGTTATGTG TGCATATAAC
AAGCTCAACG GTGAATATTG TTCGGAGAAC AGATATCTTT TGACGGAAGT TTTAAAGAAT
GAATGGATGC ATGACGGCTT TGTGGTATCC GACTGGGGTG CGGTAAATGA CAGGGTCAGC
GGCCTGGATG CAGGTCTTGA CCTGGAAATG CCCACCAGTC ATGGTATTAC GGATAAAAAG
ATAGTTGAAG CCGTAAAAAG CGGAAAGCTG TCTGAAAATA TTTTAAACAG AGCTGTGGAA
AGAATTTTGA AAGTAATTTT TATGGCACTG GAAAACAAAA AAGAAAACGC GCAGTATGAC
AAAGATGCTC ATCACAGACT GGCAAGGCAG GCTGCGGCCG AATCGATGGT TCTTCTTAAA
AACGAGGACG ATGTGCTTCC TTTAAAAAAG AGCGGAACCA TAGCTTTGAT AGGAGCTTTT
GTGAAAAAAC CAAGATACCA GGGTTCGGGC AGTTCTCATA TTACCCCGAC AAGACTTGAT
GATATTTATG AAGAGATAAA AAAGGCCGGA GGCGACAAAG TAAACCTTGT ATATTCGGAA
GGATACAGGC TTGAAAATGA CGGTATTGAT GAGGAATTGA TAAACGAAGC TAAAAAGGCG
GCATCAAGCT CGGATGTTGC GGTAGTATTT GCAGGGCTTC CGGATGAATA TGAATCTGAA
GGATTTGACA GAACTCACAT GAGTATTCCG GAAAATCAAA ACAGGCTGAT AGAAGCGGTG
GCCGAAGTCC AGAGTAATAT TGTTGTGGTA TTGCTTAACG GCTCACCGGT TGAAATGCCG
TGGATTGACA AGGTAAAATC CGTGCTTGAA GCTTATCTTG GAGGCCAGGC GCTGGGAGGC
GCGCTGGCGG ATGTGCTATT CGGTGAAGTC AATCCGTCGG GAAAACTTGC GGAGACCTTC
CCGGTGAAAT TAAGCCATAA TCCGTCCTAT TTGAATTTTC CCGGAGAGGA TGACCGAGTG
GAGTATAAAG AAGGGTTGTT TGTCGGATAC AGATATTATG ATACAAAGGG AATTGAGCCA
TTGTTCCCCT TTGGTCACGG ACTTAGCTAT ACCAAATTTG AATACAGTGA TATATCAGTC
GATAAAAAAG ATGTTTCGGA CAATAGCATC ATAAATGTCA GCGTTAAAGT CAAAAATGTT
GGAAAAATGG CAGGAAAAGA AATTGTGCAG CTGTATGTAA AAGATGTGAA AAGCAGCGTC
AGAAGACCTG AGAAAGAGCT TAAAGGATTT GAAAAGGTCT TCCTTAATCC GGGAGAAGAA
AAGACGGTTA CATTTACTTT GGACAAAAGG GCTTTTGCAT ATTACAATAC TCAGATTAAG
GACTGGCATG TTGAAAGCGG AGAGTTTCTG ATATTAATAG GAAGGTCCTC CAGGGACATA
GTTTTAAAAG AATCAGTGAG AGTAAATTCA ACGGTGAAGA TAAGAAAAAG ATTCACAGTG
AATTCAGCGG TTGAAGATGT AATGTCCGAT TCTTCGGCTG CGGCCGTTTT AGGGCCTGTA
CTAAAAGAGA TAACCGATGC ACTGCAGATT GATATGGACA ATGCTCATGA CATGATGGCG
GCCAATATAA AGAATATGCC TTTGCGCTCA CTTGTCGGTT ACTCTCAGGG AAGGTTAAGC
GAAGAAATGC TGGAGGAACT GGTTGACAAA ATAAACAACG TGGAATAA
 
Protein sequence
MAVDIKKIIK QMTLEEKAGL CSGLDFWHTK PVERLGIPSI MMTDGPHGLR KQREDAEIAD 
INNSVPATCF PSAAGLACSW DRELVERVGA ALGEECQAEN VSILLGPGAN IKRSPLCGRN
FEYFSEDPYL SSELAASHIK GVQSQGVGAC LKHFAANNQE HRRMTVDTIV DERTLREIYF
ASFENAVKKA RPWVVMCAYN KLNGEYCSEN RYLLTEVLKN EWMHDGFVVS DWGAVNDRVS
GLDAGLDLEM PTSHGITDKK IVEAVKSGKL SENILNRAVE RILKVIFMAL ENKKENAQYD
KDAHHRLARQ AAAESMVLLK NEDDVLPLKK SGTIALIGAF VKKPRYQGSG SSHITPTRLD
DIYEEIKKAG GDKVNLVYSE GYRLENDGID EELINEAKKA ASSSDVAVVF AGLPDEYESE
GFDRTHMSIP ENQNRLIEAV AEVQSNIVVV LLNGSPVEMP WIDKVKSVLE AYLGGQALGG
ALADVLFGEV NPSGKLAETF PVKLSHNPSY LNFPGEDDRV EYKEGLFVGY RYYDTKGIEP
LFPFGHGLSY TKFEYSDISV DKKDVSDNSI INVSVKVKNV GKMAGKEIVQ LYVKDVKSSV
RRPEKELKGF EKVFLNPGEE KTVTFTLDKR AFAYYNTQIK DWHVESGEFL ILIGRSSRDI
VLKESVRVNS TVKIRKRFTV NSAVEDVMSD SSAAAVLGPV LKEITDALQI DMDNAHDMMA
ANIKNMPLRS LVGYSQGRLS EEMLEELVDK INNVE