Gene Cthe_0661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0661 
Symbol 
ID4808191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp815804 
End bp817519 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content45% 
IMG OID640106076 
ProductRicin B lectin 
Protein accessionYP_001037089 
Protein GI125973179 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3507] Beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTAAAAA AGAATGTCGG ATTGAGGTTT CTTTCAATCC TGATTCTTAT GGCACTTCTC 
ATTGGAAATG TCCAAAGTTT TAATGTGGCG GCGGCAGAAG GGGTTATAGT CAACGGAACT
CAGTTTAAAG ACACATCGGG AAATGTGATA CATGCCCATG GGGGAGGCAT GTTAAAGCAT
GGTGACTATT ATTACTGGTA CGGTGAATAC CGGGACGACT CCAACTTGTT TTTGGGTGTA
AGTTGCTACA GGTCAAAAGA TCTTGTAAAC TGGGAATACA GAGGAGAAGT GCTGAGCCGA
AATTCCGCTC CTGAACTGAA TCACTGCAAT ATTGAAAGAC CGAAAGTCAT GTACAACGCA
TCAACCGGTG AATTTGTCAT GTGGATGCAC TGGGAGAACG GCATAAACTA CGGTCAGGCA
AGAGCAGCTG TTGCGTATTC CAAAACGCCC GACGGCAAAT TCACATACAT TCGAAGCTTT
CGTCCCATGC AGGATACCGG CGTTATGGAT CATGGCCTTC CGGGATATAT GTCAAGGGAC
TGCAATGTAT TTGTGGACAC TGACGGCAAG GGATATTTTA TATCCGCAGC CAATGAGAAC
ATGGACCTGC ACCTTTATGA GCTGACACCT GACTATAAAA ATATTGCATC CCTTAAGGCA
AAGCTGTTTG TCGGACAGCA GAGGGAAGCA CCATGCCTTA TAAAGAGAAA CGGCTACTAT
TACCTTATTA CTTCCGGTTG TACAGGTTGG AACCCGAATC AGGCTAAATA CGCATATTCC
AAAGATTTGG CCAGTGGCTG GTCCCAGCTT TACAATCTTG GTAATTCAAC CACCTACAGG
TCACAGCCGA CTTTTATCAT TCCCGTTCAG GGAAGCTCGG GAACCAGTTA TCTTTATATG
GGTGACCGTT GGGCCGGTGC CTGGGGAGGA AAGGTTAATG ACTCCCAATA TGTATGGCTT
CCCTTAAACT TCATATCCGA TACAACACTT GAACTGCCCT ATTATGACTC TGTAAAGATT
GATGCTTCTT CAGGAATAAT TTCCGAGTAC ATACCGGACA CTACACGCTA CAAGCTGGTA
AACAAAAACA GCGGAAAAGT CCTGGATGTT CTTGACGGTT CTGTCGATAA TGCAGCCCAG
ATAGTCCAAT GGACCGATAA CGGGTCTTTG AGTCAACAGT GGTACCTTGT GGACGTGGGC
GGTGGTTATA AAAAGATTGT AAATGTAAAG AGCGGAAGAG CCTTGGATGT AAAAGACGAA
TCCAAGGAAG ACGGTGGAGT ATTAATACAA TATACCAGCA ACGGCGGATA TAATCAGCAC
TGGAAATTCA CAGACATAGG TGACGGGTAT TACAAGATTT CCAGCCGCCA CTGCGGAAAA
CTTATAGATG TGCGAAAATG GTCAACGGAA GACGGCGGAA TAATTCAGCA GTGGTCCGAT
GCCGGAGGAA CAAATCAGCA TTGGAAGCTG GTGCTTGTAT CAAGTCCCGA GCCTTCACCA
TCACCTTCTC CCCAAGTGGT TAAAGGAGAT GTAAACGGCG ACTTGAAAGT AAATTCAACG
GATTTTTCCA TGTTAAGAAG ATATTTACTT AAAACCATTG ACAATTTTCC GACAGAAAAC
GGAAAACAGG CTGCCGATTT GAACGGAGAC GGCAGAATAA ACTCTTCGGA TCTTACAATG
CTGAAAAGAT ACTTGCTTAT GGAAGTGGAT TTGTAA
 
Protein sequence
MVKKNVGLRF LSILILMALL IGNVQSFNVA AAEGVIVNGT QFKDTSGNVI HAHGGGMLKH 
GDYYYWYGEY RDDSNLFLGV SCYRSKDLVN WEYRGEVLSR NSAPELNHCN IERPKVMYNA
STGEFVMWMH WENGINYGQA RAAVAYSKTP DGKFTYIRSF RPMQDTGVMD HGLPGYMSRD
CNVFVDTDGK GYFISAANEN MDLHLYELTP DYKNIASLKA KLFVGQQREA PCLIKRNGYY
YLITSGCTGW NPNQAKYAYS KDLASGWSQL YNLGNSTTYR SQPTFIIPVQ GSSGTSYLYM
GDRWAGAWGG KVNDSQYVWL PLNFISDTTL ELPYYDSVKI DASSGIISEY IPDTTRYKLV
NKNSGKVLDV LDGSVDNAAQ IVQWTDNGSL SQQWYLVDVG GGYKKIVNVK SGRALDVKDE
SKEDGGVLIQ YTSNGGYNQH WKFTDIGDGY YKISSRHCGK LIDVRKWSTE DGGIIQQWSD
AGGTNQHWKL VLVSSPEPSP SPSPQVVKGD VNGDLKVNST DFSMLRRYLL KTIDNFPTEN
GKQAADLNGD GRINSSDLTM LKRYLLMEVD L