Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1256 |
Symbol | |
ID | 4809761 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1522938 |
End bp | 1525205 |
Gene Length | 2268 bp |
Protein Length | 755 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640106679 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001037681 |
Protein GI | 125973771 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.179739 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGTAG ATATCAAGAA AATAATAAAG CAGATGACTT TGGAAGAAAA AGCAGGGTTG TGCTCGGGAC TGGATTTTTG GCATACCAAG CCTGTTGAGA GACTGGGCAT TCCTTCAATA ATGATGACTG ACGGACCTCA TGGACTGAGA AAGCAGAGGG AAGATGCAGA GATTGCGGAC ATCAACAACA GCGTTCCAGC AACCTGTTTT CCGTCTGCAG CAGGTTTGGC ATGTTCCTGG GACAGAGAAC TGGTTGAGAG AGTAGGTGCA GCACTAGGAG AAGAATGTCA GGCGGAAAAT GTCTCAATAC TGCTTGGACC AGGTGCAAAT ATAAAGCGTT CACCTTTGTG TGGAAGAAAT TTTGAATATT TTTCCGAAGA CCCTTATCTT TCGTCAGAGC TGGCGGCAAG CCATATAAAA GGAGTTCAAA GTCAGGGAGT GGGTGCATGT CTTAAACATT TTGCCGCAAA CAACCAGGAA CACCGGAGAA TGACCGTTGA TACCATTGTA GATGAAAGAA CGTTGAGGGA AATATATTTT GCAAGCTTTG AGAATGCTGT AAAAAAAGCA CGGCCTTGGG TGGTTATGTG TGCATATAAC AAGCTCAACG GTGAATATTG TTCGGAGAAC AGATATCTTT TGACGGAAGT TTTAAAGAAT GAATGGATGC ATGACGGCTT TGTGGTATCC GACTGGGGTG CGGTAAATGA CAGGGTCAGC GGCCTGGATG CAGGTCTTGA CCTGGAAATG CCCACCAGTC ATGGTATTAC GGATAAAAAG ATAGTTGAAG CCGTAAAAAG CGGAAAGCTG TCTGAAAATA TTTTAAACAG AGCTGTGGAA AGAATTTTGA AAGTAATTTT TATGGCACTG GAAAACAAAA AAGAAAACGC GCAGTATGAC AAAGATGCTC ATCACAGACT GGCAAGGCAG GCTGCGGCCG AATCGATGGT TCTTCTTAAA AACGAGGACG ATGTGCTTCC TTTAAAAAAG AGCGGAACCA TAGCTTTGAT AGGAGCTTTT GTGAAAAAAC CAAGATACCA GGGTTCGGGC AGTTCTCATA TTACCCCGAC AAGACTTGAT GATATTTATG AAGAGATAAA AAAGGCCGGA GGCGACAAAG TAAACCTTGT ATATTCGGAA GGATACAGGC TTGAAAATGA CGGTATTGAT GAGGAATTGA TAAACGAAGC TAAAAAGGCG GCATCAAGCT CGGATGTTGC GGTAGTATTT GCAGGGCTTC CGGATGAATA TGAATCTGAA GGATTTGACA GAACTCACAT GAGTATTCCG GAAAATCAAA ACAGGCTGAT AGAAGCGGTG GCCGAAGTCC AGAGTAATAT TGTTGTGGTA TTGCTTAACG GCTCACCGGT TGAAATGCCG TGGATTGACA AGGTAAAATC CGTGCTTGAA GCTTATCTTG GAGGCCAGGC GCTGGGAGGC GCGCTGGCGG ATGTGCTATT CGGTGAAGTC AATCCGTCGG GAAAACTTGC GGAGACCTTC CCGGTGAAAT TAAGCCATAA TCCGTCCTAT TTGAATTTTC CCGGAGAGGA TGACCGAGTG GAGTATAAAG AAGGGTTGTT TGTCGGATAC AGATATTATG ATACAAAGGG AATTGAGCCA TTGTTCCCCT TTGGTCACGG ACTTAGCTAT ACCAAATTTG AATACAGTGA TATATCAGTC GATAAAAAAG ATGTTTCGGA CAATAGCATC ATAAATGTCA GCGTTAAAGT CAAAAATGTT GGAAAAATGG CAGGAAAAGA AATTGTGCAG CTGTATGTAA AAGATGTGAA AAGCAGCGTC AGAAGACCTG AGAAAGAGCT TAAAGGATTT GAAAAGGTCT TCCTTAATCC GGGAGAAGAA AAGACGGTTA CATTTACTTT GGACAAAAGG GCTTTTGCAT ATTACAATAC TCAGATTAAG GACTGGCATG TTGAAAGCGG AGAGTTTCTG ATATTAATAG GAAGGTCCTC CAGGGACATA GTTTTAAAAG AATCAGTGAG AGTAAATTCA ACGGTGAAGA TAAGAAAAAG ATTCACAGTG AATTCAGCGG TTGAAGATGT AATGTCCGAT TCTTCGGCTG CGGCCGTTTT AGGGCCTGTA CTAAAAGAGA TAACCGATGC ACTGCAGATT GATATGGACA ATGCTCATGA CATGATGGCG GCCAATATAA AGAATATGCC TTTGCGCTCA CTTGTCGGTT ACTCTCAGGG AAGGTTAAGC GAAGAAATGC TGGAGGAACT GGTTGACAAA ATAAACAACG TGGAATAA
|
Protein sequence | MAVDIKKIIK QMTLEEKAGL CSGLDFWHTK PVERLGIPSI MMTDGPHGLR KQREDAEIAD INNSVPATCF PSAAGLACSW DRELVERVGA ALGEECQAEN VSILLGPGAN IKRSPLCGRN FEYFSEDPYL SSELAASHIK GVQSQGVGAC LKHFAANNQE HRRMTVDTIV DERTLREIYF ASFENAVKKA RPWVVMCAYN KLNGEYCSEN RYLLTEVLKN EWMHDGFVVS DWGAVNDRVS GLDAGLDLEM PTSHGITDKK IVEAVKSGKL SENILNRAVE RILKVIFMAL ENKKENAQYD KDAHHRLARQ AAAESMVLLK NEDDVLPLKK SGTIALIGAF VKKPRYQGSG SSHITPTRLD DIYEEIKKAG GDKVNLVYSE GYRLENDGID EELINEAKKA ASSSDVAVVF AGLPDEYESE GFDRTHMSIP ENQNRLIEAV AEVQSNIVVV LLNGSPVEMP WIDKVKSVLE AYLGGQALGG ALADVLFGEV NPSGKLAETF PVKLSHNPSY LNFPGEDDRV EYKEGLFVGY RYYDTKGIEP LFPFGHGLSY TKFEYSDISV DKKDVSDNSI INVSVKVKNV GKMAGKEIVQ LYVKDVKSSV RRPEKELKGF EKVFLNPGEE KTVTFTLDKR AFAYYNTQIK DWHVESGEFL ILIGRSSRDI VLKESVRVNS TVKIRKRFTV NSAVEDVMSD SSAAAVLGPV LKEITDALQI DMDNAHDMMA ANIKNMPLRS LVGYSQGRLS EEMLEELVDK INNVE
|
| |