Gene Cthe_2193 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2193 
Symbol 
ID4811058 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2613733 
End bp2616579 
Gene Length2847 bp 
Protein Length948 aa 
Translation table11 
GC content48% 
IMG OID640107599 
Productcarbohydrate-binding family 6 protein 
Protein accessionYP_001038588 
Protein GI125974678 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.43679 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGGCAA GTATTAAAAC AAGTATTAAA ATTAGAACAG TTGCATTTGT ATCAATTATT 
GCAATTGCTT TAAGTATTCT AAGTTTTATT CCAAACCGGG CATATGCAAG CCCGCAACGT
GGCCGGCCGC GTCTTAATGC AGCCAGAACC ACGTTTGTAG GAGACAATGG GCAGCCGTTA
CGCGGTCCAT ACACTTCCAC GGAATGGACG GCGGCAGCTC CTTATGACCA GATTGCGAGA
GTTAAGGAAC TGGGATTTAA TGCAGTACAC CTCTACGCAG AATGCTTTGA CCCCAGATAT
CCTGCTCCCG GAAGCAAAGC TCCGGGATAC GCGGTTAATG AAATTGACAA GATAGTGGAA
AGGACTCGTG AACTTGGTCT TTATCTGGTA ATAACCATAG GCAACGGTGC CAATAACGGA
AATCATAACG CGCAATGGGC AAGGGATTTC TGGAAGTTCT ATGCACCGCG TTATGCAAAA
GAAACACATG TATTGTATGA AATACACAAT GAGCCTGTGG CATGGGGACC TCCATATTCT
TCTTCAACGG CCAATCCTCC CGGTGCAGTT GATATGGAAA TTGATGTTTA CAGGATAATC
CGTACTTATG CACCGGAAAC ACCGGTATTA CTTTTCTCCT ATGCAGTATT TGGAGGCAAA
GGGGGAGCGG CCGAAGCCCT GAAAGACATA CGCGCTTTCA ACAAGGCGGT TTTTGGCAAT
GAAAACGCAG TGTGGACTAA CGAAGCTGTG GCGTTTCACG GATATGCAGG TTGGCAGGAA
ACCACCATTG CGGTGGAGGA ACTTTTAAAG GCCGGTTATC CCTGCTTTAT GACTGAGTAT
GCCGGAGGTG CCTGGGGCAG CGGCATGGGA GGATTGGATG TTGAACTTAC CTACGAGCTG
GAACGCCTGG GAGTCTCCTG GCTTACTTTC CAGTACATTC CGCCAACCGG TGTGTCTGAT
GATGTTACAA AGCCGGAATA TTTCTCAGCA TTGGTGGAAA ATTCCGGTCT TTCTTGGACT
CCCGATTACG GGAACTGGCC GGCGGCCCGC GGTGTATACG GCAACGGTGG TCTGGCAAGG
GAAACTGCGA CGTGGATTAA CAACTTCTTA ACCGGTACAA CCCGTATCGA AGCAGAAGAC
TTCGATTGGG GCGGAAACGG GGTTTCGTAT TATGACACGG ATTCGGTGAA TGTTGGAGGA
CAATACCGCC CGGATGAAGG AGTGGATATT GAGAAAACTT CAGACACAGG CGGCGGTTAC
AATGTCGGAT GGATTTCGGA AGGAGAATGG CTTGAATATA CCATAAGAGT TCGGAATCCC
GGATACTATA ACTTGTCGCT CCGTGTGGCA GGCATCAGCG GCAGCAGAGT ACAGGTGAGT
TTCGGAAACC AGGACAAGAC CGGAGTTTGG GAACTGCCTG CTACCGGAGG TTTTCAGACT
TGGACTACAG CCACAAGGCA GGTGTTTCTT GGAGCCGGCC TGCAAAAATT ACGTATCAAT
GCTTTGTCCG GAGGGTTCAA TTTGAATTGG ATTGAACTTT CTCCGATATC AACAGGAACC
ATTCCCGACG GAACATATAA GTTTTTGAAC CGCGCAAATG GAAAGACATT GCAGGAAGTA
ACCGGCAACA ACAGCATAAT AACCGCCGAT TACAAAGGAA TCACGGAACA GCACTGGAAG
ATTCAGCACA TTGGCGGCGG CCAATACAGA ATTTCATCCG CAGGCAGAGG CTGGAACTGG
AACTGGTGGA TGGGTTTTGG AACTGTCGGA TGGTGGGGAA CAGGCTCCAG TACGTGTTTT
ATTATCAGTC CTACGGGTGA CGGTTACTAC AGAATCGTAC TTGTCGGTGA CGGTACAAAC
CTGCAAATAT CCTCAGGTGA TCCGAGCAAG ATAGAGGGAA AGGCTTTTCA TGGTGGAGCC
AATCAGCAGT GGGCAATACT TCCGGTTTCC GCTCCCGCGT TTCCGACAGG GCTAAGTGCG
GTACTTGATT CTTCCGGCAA TACGGCCAAT TTGACATGGA ATGCCGCTCC GGGTGCGAAC
TCTTACAATG TTAAACGTTC CACCAAAAGC GGTGGTCCGT ATACAACTAT TGCCACCAAT
ATCACATCGA CAAACTATAC CGACACCGGT GTGGCAACGG GTACTAAATA CTATTATGTG
GTAAGTGCGG TAAGCAATGG AGTGGAAACC CTCAACAGTG CGGAAGCGAT ACTGCAATAT
CCTAAACTTA CGGGTACCGT TATTGGAACC CAAGGTTCGT GGAATAACAT TGGGAACACA
ATTCACAAAG CTTTTGACGG TGACCTGAAC ACGTTTTTTG ACGGTCCTAC AGCAAACGGC
TGCTGGCTGG GACTGGATTT TGGGGAAGGT GTGAGGAATG TCATTACACA AATTAAATTC
TGCCCGCGTT CCGGCTATGA ACAGCGCATG ATAGGGGGAA TTTTTCAGGG GGCAAATAAA
GAAGATTTCA GCGATGCAGT GACGCTGTTT ACCATTACCT CACTACCAGG CTCCGGTACG
TTAACTTCGG TGGATGTAGA CAATCCAACC GGCTTCCGCT ATGTCCGCTA TTTGTCCCCG
GACGGCAGTA ATGGAAATAT TGCAGAGCTG CAGTTTTTCG GTACACCGGC CGGTGAGGAG
AATGATGATG TGCATTTGGG CGATATAAAC GATGACGGAA ATATAAACTC AACAGACCTT
CAGATGCTAA AAAGGCATTT GCTCCGCAGT ATCCGGCTTA CGGAAAAACA GCTTTTAAAT
GCGGATACAA ACAGAGACGG CAGAGTGGAT TCCACCGACC TTGCTTTATT AAAAAGATAT
ATACTCCGTG TCATAACTAC TTTATAA
 
Protein sequence
MGASIKTSIK IRTVAFVSII AIALSILSFI PNRAYASPQR GRPRLNAART TFVGDNGQPL 
RGPYTSTEWT AAAPYDQIAR VKELGFNAVH LYAECFDPRY PAPGSKAPGY AVNEIDKIVE
RTRELGLYLV ITIGNGANNG NHNAQWARDF WKFYAPRYAK ETHVLYEIHN EPVAWGPPYS
SSTANPPGAV DMEIDVYRII RTYAPETPVL LFSYAVFGGK GGAAEALKDI RAFNKAVFGN
ENAVWTNEAV AFHGYAGWQE TTIAVEELLK AGYPCFMTEY AGGAWGSGMG GLDVELTYEL
ERLGVSWLTF QYIPPTGVSD DVTKPEYFSA LVENSGLSWT PDYGNWPAAR GVYGNGGLAR
ETATWINNFL TGTTRIEAED FDWGGNGVSY YDTDSVNVGG QYRPDEGVDI EKTSDTGGGY
NVGWISEGEW LEYTIRVRNP GYYNLSLRVA GISGSRVQVS FGNQDKTGVW ELPATGGFQT
WTTATRQVFL GAGLQKLRIN ALSGGFNLNW IELSPISTGT IPDGTYKFLN RANGKTLQEV
TGNNSIITAD YKGITEQHWK IQHIGGGQYR ISSAGRGWNW NWWMGFGTVG WWGTGSSTCF
IISPTGDGYY RIVLVGDGTN LQISSGDPSK IEGKAFHGGA NQQWAILPVS APAFPTGLSA
VLDSSGNTAN LTWNAAPGAN SYNVKRSTKS GGPYTTIATN ITSTNYTDTG VATGTKYYYV
VSAVSNGVET LNSAEAILQY PKLTGTVIGT QGSWNNIGNT IHKAFDGDLN TFFDGPTANG
CWLGLDFGEG VRNVITQIKF CPRSGYEQRM IGGIFQGANK EDFSDAVTLF TITSLPGSGT
LTSVDVDNPT GFRYVRYLSP DGSNGNIAEL QFFGTPAGEE NDDVHLGDIN DDGNINSTDL
QMLKRHLLRS IRLTEKQLLN ADTNRDGRVD STDLALLKRY ILRVITTL