Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0661 |
Symbol | |
ID | 4808191 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 815804 |
End bp | 817519 |
Gene Length | 1716 bp |
Protein Length | 571 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640106076 |
Product | Ricin B lectin |
Protein accession | YP_001037089 |
Protein GI | 125973179 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3507] Beta-xylosidase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGTAAAAA AGAATGTCGG ATTGAGGTTT CTTTCAATCC TGATTCTTAT GGCACTTCTC ATTGGAAATG TCCAAAGTTT TAATGTGGCG GCGGCAGAAG GGGTTATAGT CAACGGAACT CAGTTTAAAG ACACATCGGG AAATGTGATA CATGCCCATG GGGGAGGCAT GTTAAAGCAT GGTGACTATT ATTACTGGTA CGGTGAATAC CGGGACGACT CCAACTTGTT TTTGGGTGTA AGTTGCTACA GGTCAAAAGA TCTTGTAAAC TGGGAATACA GAGGAGAAGT GCTGAGCCGA AATTCCGCTC CTGAACTGAA TCACTGCAAT ATTGAAAGAC CGAAAGTCAT GTACAACGCA TCAACCGGTG AATTTGTCAT GTGGATGCAC TGGGAGAACG GCATAAACTA CGGTCAGGCA AGAGCAGCTG TTGCGTATTC CAAAACGCCC GACGGCAAAT TCACATACAT TCGAAGCTTT CGTCCCATGC AGGATACCGG CGTTATGGAT CATGGCCTTC CGGGATATAT GTCAAGGGAC TGCAATGTAT TTGTGGACAC TGACGGCAAG GGATATTTTA TATCCGCAGC CAATGAGAAC ATGGACCTGC ACCTTTATGA GCTGACACCT GACTATAAAA ATATTGCATC CCTTAAGGCA AAGCTGTTTG TCGGACAGCA GAGGGAAGCA CCATGCCTTA TAAAGAGAAA CGGCTACTAT TACCTTATTA CTTCCGGTTG TACAGGTTGG AACCCGAATC AGGCTAAATA CGCATATTCC AAAGATTTGG CCAGTGGCTG GTCCCAGCTT TACAATCTTG GTAATTCAAC CACCTACAGG TCACAGCCGA CTTTTATCAT TCCCGTTCAG GGAAGCTCGG GAACCAGTTA TCTTTATATG GGTGACCGTT GGGCCGGTGC CTGGGGAGGA AAGGTTAATG ACTCCCAATA TGTATGGCTT CCCTTAAACT TCATATCCGA TACAACACTT GAACTGCCCT ATTATGACTC TGTAAAGATT GATGCTTCTT CAGGAATAAT TTCCGAGTAC ATACCGGACA CTACACGCTA CAAGCTGGTA AACAAAAACA GCGGAAAAGT CCTGGATGTT CTTGACGGTT CTGTCGATAA TGCAGCCCAG ATAGTCCAAT GGACCGATAA CGGGTCTTTG AGTCAACAGT GGTACCTTGT GGACGTGGGC GGTGGTTATA AAAAGATTGT AAATGTAAAG AGCGGAAGAG CCTTGGATGT AAAAGACGAA TCCAAGGAAG ACGGTGGAGT ATTAATACAA TATACCAGCA ACGGCGGATA TAATCAGCAC TGGAAATTCA CAGACATAGG TGACGGGTAT TACAAGATTT CCAGCCGCCA CTGCGGAAAA CTTATAGATG TGCGAAAATG GTCAACGGAA GACGGCGGAA TAATTCAGCA GTGGTCCGAT GCCGGAGGAA CAAATCAGCA TTGGAAGCTG GTGCTTGTAT CAAGTCCCGA GCCTTCACCA TCACCTTCTC CCCAAGTGGT TAAAGGAGAT GTAAACGGCG ACTTGAAAGT AAATTCAACG GATTTTTCCA TGTTAAGAAG ATATTTACTT AAAACCATTG ACAATTTTCC GACAGAAAAC GGAAAACAGG CTGCCGATTT GAACGGAGAC GGCAGAATAA ACTCTTCGGA TCTTACAATG CTGAAAAGAT ACTTGCTTAT GGAAGTGGAT TTGTAA
|
Protein sequence | MVKKNVGLRF LSILILMALL IGNVQSFNVA AAEGVIVNGT QFKDTSGNVI HAHGGGMLKH GDYYYWYGEY RDDSNLFLGV SCYRSKDLVN WEYRGEVLSR NSAPELNHCN IERPKVMYNA STGEFVMWMH WENGINYGQA RAAVAYSKTP DGKFTYIRSF RPMQDTGVMD HGLPGYMSRD CNVFVDTDGK GYFISAANEN MDLHLYELTP DYKNIASLKA KLFVGQQREA PCLIKRNGYY YLITSGCTGW NPNQAKYAYS KDLASGWSQL YNLGNSTTYR SQPTFIIPVQ GSSGTSYLYM GDRWAGAWGG KVNDSQYVWL PLNFISDTTL ELPYYDSVKI DASSGIISEY IPDTTRYKLV NKNSGKVLDV LDGSVDNAAQ IVQWTDNGSL SQQWYLVDVG GGYKKIVNVK SGRALDVKDE SKEDGGVLIQ YTSNGGYNQH WKFTDIGDGY YKISSRHCGK LIDVRKWSTE DGGIIQQWSD AGGTNQHWKL VLVSSPEPSP SPSPQVVKGD VNGDLKVNST DFSMLRRYLL KTIDNFPTEN GKQAADLNGD GRINSSDLTM LKRYLLMEVD L
|
| |