Gene Cthe_2809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2809 
Symbol 
ID4809646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3310693 
End bp3314658 
Gene Length3966 bp 
Protein Length1321 aa 
Translation table11 
GC content42% 
IMG OID640108229 
Productglycoside hydrolase family protein 
Protein accessionYP_001039201 
Protein GI125975291 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2273] Beta-glucanase/Beta-glucan synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.149168 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATAAAA GATTATTGTC GTCAGTACTG ATAATTATGC TGTTATTATC AGCCTGGTCG 
CCAATATCCG TACAAGCTTC TGATGGAATC AATGACATTA GAGGTCATTG GGCTGAAGAA
GACTTGAACA AATGGATGGA AAAAGGTATT TTGGTGGGCT ACCAGGATGG GACGATAAGG
CCCGATAATA ATATCACAAG AGCCGAATTT GTCACATTAA TTAACAAGGT TTTCGGGCTT
TATGAATTAA GCCGGGAGCA ATTCGCAGAT GTTGAAGACT CAAAATGGTA TTCCCGTGAA
ATATTAAAAG CCAGGGCTGC GGGATATATT GCAGGTTATG GAAGCAATGT TTTCAAACCT
GACAATTATA TTACAAGACA AGAAGCCGTT GTTATAATCG CGAAAGTTTT TGAACTTCAA
AGCGGCAGCA ATTATACAAG CAAGTTTAAA GATGGAAGTC TGGTAAAGGA ATACGCAAAA
GATTCCGTTA GCGCGTTGGT TGAAAAAGGC TACATAGCAG GTTATGAAGA TGGCACTTTC
AGGCCGGACA ACTACATTAC CCGTGCAGAA ACAATAAAAA TTCTGAATAA AATTATTCCT
TCCTTGTATA ACGAGAAAGG AGATTATAAA AATGAAGAAG TAGCCGGAAA CGCTCTGATT
AACACCGAAG GAGTTATTTT AAAAGATACC GTAATAAACG GGGATTTGTA TCTTGCTCAG
GGAATTCAGA ACGGCGATGT TACCCTTGAC GGTGTGAATG TAAAAGGAAC GGTTTTCGTA
AATGGTGGAG GAAGCGACAG CATACATTTT ATAAATACGA AAATAAACAG GGTTGTTGTC
AATAAAACAG GAGTTAGAAT TGTAACTTCC GGCAATACCT CGGTTGAAAG TGTTGTCGTT
AAATCCGGTG CAAAACTTGA AGAAAAAGAA TTGACGGGCG ACGGCTTTAA AAACGTTACA
GTCGATTCTC AACTTTCAGC CGGCAATGAA ATAATATTTG TCGGGGATTT TGAACAGGTC
GATGTTCTGG CGGATGATGC CTTGCTGGAA ACCAAAGAGG CAAAAATGAA ACTGAGAATA
TTCGGCCAAA GGATTAAAGT AAATGGAAAG GCAATAGAAA AATCATCAAA GAACTATATT
GTAAACGGGG AACTTATATC AACTGAGGAA GAACCCGGTC CTTCCGACGC ACCCGGTGCG
GAAGACGATC AAAATTCAGG TAGTCCGGGC TCATCGACTA ATCCTGCACC AACCAAGAAT
CCGAATGAAG AGTGGCGTCT GGTTTGGAGC GATGAGTTTA ACGGTTCTGA AATAAATATG
GCTAATTGGA GCTATGACGA CCCGACCAAC GGAAGATGGA ACGGGGAAGT ACAATCCTAC
ACACAAAACA ATGCCTATAT CAAAGACGGC GCGTTGGTTA TTGAAGCAAG AAAAGAAGAC
ATTACGGAAC CAAGCGGTGA GACTTATCAT TATACATCGT CAAAGCTGAT TACCAAAGGC
AAAAAGTCAT GGAAGTACGG AAAATTTGAA ATAAGGGCAA AAATGCCACA GGGACAAGGT
ATATGGCCTG CAATCTGGAT GATGCCGGAA GACGAACCCT TCTACGGAAC ATGGCCAAAG
TGCGGCGAAA TAGATATTAT GGAGCTTTTG GGCCACGAGC CTGATAAAAT TTATGGAACG
ATCCATTTTG GAGAGCCTCA TAAAGAATCC CAGGGAACGT ATACCTTGCC GGAAGGCCAG
ACTTTTGCTG ATGATTTCCA CGTTTATTCG ATTGAATGGG AACCGGGAGA AATACGCTGG
TATATAGACG GCAAGCTGTA TCATGTCGCT AATGACTGGT ACTCGAGGGA CCCGTACCTT
GCCGATGACT ACACTTATCC CGCACCTTTT GACCAGAATT TCTTCTTGAT TCTCAATATA
TCCGTTGGTG GCGGCTGGCC GGGATATCCT GACGAAACGA CAGTTTTCCC GCAGCAAATG
GTTGTGGACT ATGTGAGAGT ATATCAAAAA GATAAATATC CTCACAGGGA AAAACCGGCA
AAGGAAGAAG TGAAGCCAAG AGAGCCTCTT GAGGACGGCA ATTATATCTA TAACGGCGGT
TTTGATGTGG ATGATTCTGC AGCAGTTGGT GTGGACGGTG TTCCCTATAC GTCTTACTGG
ACATTCTTAA CAGCATCCGG TGGAGCTGCG ACAGTCAATG TAGAGGAAGG TGTTATGCAC
GTACAGATAG AAAACGGAGG GACAACCGAC TACGGCGTAC AATTGCTTCA AGCTCCGATT
CATCTTGAAA AAGGCGCAAA ATATAAAGCA TCTTTTGACA TGAAAGCTGA AAATCCAAGG
CAGGTAAAAC TGAAAATAGG CGGAGACGGC GACAGGGGAT GGAAAGATTA TGCGGCTATT
CCACCGTTTA CGGTCTCAAC AGAGATGACC AACTATGAGT TTGAGTTTAC TATGAAAGAT
GATACCGATG TTAAGGCACG GTTTGAGTTT AATATGGGTT TGGACGATAA TGATGTCTGG
ATTGACAATG TTAAACTGAT TAAAACAGAA GATGCGCCGG TTATAGATCC TTCCGAAATA
GCAAGACCTC CGCTTCTTTC CGGCAACTAT ATATACAACG GTACCTTTGA CCAAGGTCCG
AACAGAATGG GATTCTGGAA TTTTGTTGTG GATAGCACTG CAAAGGCTAC ATACTATATT
GGAAGCGATG TTAATGAGCG CAGGTTTGAA ACAAGAATAG AAAAAGGCGG AACATCGAGG
GGAGCCATAA GATTGGTTCA GCCGGGAATT AACATTGAAA ACGGCAAAAC ATACAAGGTT
AGCTTCGAAG CCAGTGCGGC AAATACAAGA ACTATTGAGG TGGAAATTGC AAGCAATCTT
CACAACAGCA GCATTTTTGC GACAACTTTT GAAATAAGCA AAGAGAGCAA GATATACGAA
TTTGAGTTTA CAATGGACAA AGATTCGGAC AAGAACGGAG AACTTAGGTT CAATCTGGGC
GGAAGCAACG TGAACGTCTA TATTGATAAT GTCGTTATGA AAAGAGTAAG TACCGATGAA
GTTGAAGGAA ACCTGATTTT AAACGGCGTA TTTAACGGCC TGGCAGGCTG GGGATATGGA
GCGTATGAAC CTGGATCGGC AGATTTTGAA AGTCATGAGG AACAATTTAG GGCAATTATT
AGCTCTGTCG GTAATGAAGG TTGGAATGTA CAGTTGTATC AGGATAATGT TCCGCTGGAA
CAAGGGCAAA CCTACGAAGT TTCTTTTGAT GCAAAATCAA CGATTGACAG AAAGATAATT
GTTCAGCTGC AAAGGAACGG TACTTCGGAT AATAATTGGG ACTCCTATTT CTATCAAGAA
GTTGAACTTA CTAATGAACT TAAAACATTC AAATATGAAT TTACAATGAG TAAACCTACA
GATTCGGCGT CAAGATTTAA TTTTGCTTTG GGTAATACTG AAAACAAAAC TTATGCTCCT
CATGAAATAA TAATTGACAA TGTTGTAGTA AGAAAAGTTG CGACTCCTTC TGCGCTGATA
TTGAACGGAA CCTTTGACGA TGGAATGGAT CATTGGCTGC TATACTGGGG AGACGGTGAA
GGCAATTGCG ATGTAACTGA CGGAGAGCTT GAAATTAACA TTACCAAGGT AGGTACCGCG
GATTACATGC CGCAGATTAA ACAGGAAAAC ATAGCGTTGC AAGAGGGTGT GACGTATACT
TTGTCTCTTA AAGCGAGAGC GCTTGAGGCA AGAAGTATTA AAGTGGACAT ATTGGATTCT
TCTTATAACT GGTATGGCGG AACTATTTTC GATTTAACAA CGGAAGATGC CGTATACACG
TTTACATTTA CCCAAAGCAA GTCGATAAAT AACGGTGTCT TAACTATAAA TTTAGGTACC
ATAGAAGGTA AGACATCCGC CGCAACTACT GTCTATCTTG ATGATATTTT GCTGGAACAA
CAGTAA
 
Protein sequence
MYKRLLSSVL IIMLLLSAWS PISVQASDGI NDIRGHWAEE DLNKWMEKGI LVGYQDGTIR 
PDNNITRAEF VTLINKVFGL YELSREQFAD VEDSKWYSRE ILKARAAGYI AGYGSNVFKP
DNYITRQEAV VIIAKVFELQ SGSNYTSKFK DGSLVKEYAK DSVSALVEKG YIAGYEDGTF
RPDNYITRAE TIKILNKIIP SLYNEKGDYK NEEVAGNALI NTEGVILKDT VINGDLYLAQ
GIQNGDVTLD GVNVKGTVFV NGGGSDSIHF INTKINRVVV NKTGVRIVTS GNTSVESVVV
KSGAKLEEKE LTGDGFKNVT VDSQLSAGNE IIFVGDFEQV DVLADDALLE TKEAKMKLRI
FGQRIKVNGK AIEKSSKNYI VNGELISTEE EPGPSDAPGA EDDQNSGSPG SSTNPAPTKN
PNEEWRLVWS DEFNGSEINM ANWSYDDPTN GRWNGEVQSY TQNNAYIKDG ALVIEARKED
ITEPSGETYH YTSSKLITKG KKSWKYGKFE IRAKMPQGQG IWPAIWMMPE DEPFYGTWPK
CGEIDIMELL GHEPDKIYGT IHFGEPHKES QGTYTLPEGQ TFADDFHVYS IEWEPGEIRW
YIDGKLYHVA NDWYSRDPYL ADDYTYPAPF DQNFFLILNI SVGGGWPGYP DETTVFPQQM
VVDYVRVYQK DKYPHREKPA KEEVKPREPL EDGNYIYNGG FDVDDSAAVG VDGVPYTSYW
TFLTASGGAA TVNVEEGVMH VQIENGGTTD YGVQLLQAPI HLEKGAKYKA SFDMKAENPR
QVKLKIGGDG DRGWKDYAAI PPFTVSTEMT NYEFEFTMKD DTDVKARFEF NMGLDDNDVW
IDNVKLIKTE DAPVIDPSEI ARPPLLSGNY IYNGTFDQGP NRMGFWNFVV DSTAKATYYI
GSDVNERRFE TRIEKGGTSR GAIRLVQPGI NIENGKTYKV SFEASAANTR TIEVEIASNL
HNSSIFATTF EISKESKIYE FEFTMDKDSD KNGELRFNLG GSNVNVYIDN VVMKRVSTDE
VEGNLILNGV FNGLAGWGYG AYEPGSADFE SHEEQFRAII SSVGNEGWNV QLYQDNVPLE
QGQTYEVSFD AKSTIDRKII VQLQRNGTSD NNWDSYFYQE VELTNELKTF KYEFTMSKPT
DSASRFNFAL GNTENKTYAP HEIIIDNVVV RKVATPSALI LNGTFDDGMD HWLLYWGDGE
GNCDVTDGEL EINITKVGTA DYMPQIKQEN IALQEGVTYT LSLKARALEA RSIKVDILDS
SYNWYGGTIF DLTTEDAVYT FTFTQSKSIN NGVLTINLGT IEGKTSAATT VYLDDILLEQ
Q