Gene Cthe_2760 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2760 
Symbol 
ID4810263 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3255101 
End bp3257986 
Gene Length2886 bp 
Protein Length961 aa 
Translation table11 
GC content42% 
IMG OID640108180 
Productglycoside hydrolase family protein 
Protein accessionYP_001039152 
Protein GI125975242 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.26535 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGATT CACCACAAAA GAGGATTTTG AAACGAAAAA ACAGGTGTAA AGGGCTTATA 
ACGGCTGTCA TCATTGCAGC TCAGTTACTA ACGTCGAGCA TATTATTTGC AGAAGCACCA
CCTGCGACTT TTACACCTGC TGAAAACTGG GAGGATTATG ATTACTTCAA CTTTGCCGAA
GCCCTGCAGA AGTCGCTGTA TTTTTATGAC GCTCAGAAAT GCGGAGTTGA GGCAGGCTAC
GACCACGGTG GAAGGCTTGA GTGGAGAGGG GCGTGTCATG AAGTCGATGA AAGAATTCCT
ATTTCCAACA CATCTCTTTC TGAAGCTTTT CTGGCAAAAT ACAGACATAT AATCGACCCT
GACGGAGACG GTACTGTGGA TGTACACGGA GGCTTCCATG ATGCAGGTGA CCATGTCCGT
TTCGGTCTTC CCCAAAGCTA TACGGCAGGT ACTTTGGGCT GGGGATTCTA TGAATTCAGA
GAATCCTTCA GGGCAATCGG GGAAGAAGAG CACATGATTG AGATTTTAAG ATACTTTACG
GATACTTTTT TACGCTGTTC ATTTATGGAT GAGGAAGGCA ATATTGTTGC TTTTTGCTAC
ATGGTTGGTG AAGGAGACGA AGACCATTGC TATTGGGGAC CTCCGGAGTT ATACCCTGAA
GAATACTTAA GAAGCAGACC GGCGGATTTT GCAACTTTTG ATGATCCCGG AAGCGATGTT
TGTGCAAGCA CGGCTGCAGC ACTCTGCACA TCATATTTGA ATTTTAAGGA TGAAGACCCT
GAATACGCTG AAAAATGCCT TACCGTTGCC AAGGCACTGT ATGATTTTGC AGTTAAGTAC
AGAGGATTGC ATAAAGGCGA CGGTTACTAC ACCTCGGATT ATGACGAAGA TGAATTGGCA
TGGGCTGCTG TCTGGCTTTA TGAGTGCACC GGAGACATGA AATACATAAA TGACATAGTA
GCTGTTGATG AAACCGGCAA TTATGTGGGA TATATGAAAA GGATAATACC CGACACCTTT
AAACAGAACG TTTGGTATAA TTCATGGGTT CACTGTTGGG ATGCCGTATG GGGAGGAACT
TTCATAAAAC TGAACGAGCT GTTCCCTGAA AATGAATTGT TTGACTTTAT TGCAAGATGG
AATGTTGAGT ATTTGTCAGG CGGAAAATGC CCACACGAGG ATCCTAATGA TCATAATTAT
TGTAAACCAT CTCCTGCCGG CTACACAATG ATAAACGGCT GGGGTTCTGC GCGTTACAAT
GCCGCTGCGC AATTATGCGC TCTTGTATAC ATGAAAAACA ATCCGGACAG GACAGATTTC
GGCGAGTGGG CAAAAAGTCA AATGGAATAC CTTATGGGAA GAAATCCTAT GGGTTATTCG
TACATAGTCG GTTACGGATA TGAAAAAGGC TTGCCTTTTG CAAAGCATCC GCACCACAGA
GCGGCTCACG GCTCAAAAAC AAACAGCATG AACGATCCTG AGGAGCATCG CCATATATTG
TGGGGGGCTC TTGTGGGAGG ACCCGATTTG AATGATTATC ACATTGACTC AACTACCGAG
TACGCTTATA ATGAGGTGGC AGTTGACTAT AACGCTGCCT TTGTAGGCGC GCTGGCAGGA
CTGTATAAAT ATTATGGACA GGGACATGAA CCTATTCCGA ATTTCCCGCC GCTAGAGCCG
GAAACCGACG ATTATTTCTG CGAAGCAAAA ATTGTCCGTG AAACTAAAGA CAGTACACAA
GTTCTTTTAA GAATTCATAA TGAATCGACC CGGCCTCCTC ATTATGAAAC AGGAATGATG
GCAAGATACT TCTTCAATAT AAGCGAGCTT ATTGAAAACG GTCAAAGTAT AGATGATGTA
ATATTTACTA TTGAATATGA TGAACAGATT TCCATGCAGC AGGAACCGGT TGTATACAGA
GGACCTTTTA AATGGGATGA TGCAGGAACA TACTATTTTG AATTTGACTG GAGCGGAAGA
AAAATTTACG GAGACAGGGA GCTTCAGATT TCCTTCAGAG TTAAACAGGA TTCGAATTAC
ATGACCCATT GGGACTCCAG TAATGACTAT AGCAGGCAAG GCCTTACAAA TGAATATGCA
ATATCCAAGA ATGTGCCCGT ATATCTGAAT GGTGTAAAAG TTTACGGTGA AGAACCGCCA
AAGCTTTCTC CGACTCCGAC TCCAACAATT GATCCCAGCC AAACTCCGGA TGCTAATGCT
TCAATCAGTG TATCATACAA GTGCGGAGTT AAGGATGGTA CGAAAAATAC TATAAGAGCT
ACAATAAATA TTAAGAATAC CGGAACCACT CCTGTGAATT TATCGGATAT CAAGGTTCGA
TACTGGTTTA CAAGTGACGG AAACGAACAG AATAACTTTG TGTGCGATTA TGCGGCTTTT
GGAACGGACA AAGTAAAAGG TATTGTGAAA AAGATAGAAA ACTCTGTCCC TGGTGCTGAT
ACGTATTGTG AAATCTCATT TACTGAGGAT GCGGGTAGGC TTGCACCCGG AGGAAGCACA
GGAACAATAC CTTTCAGAAT TGAGGGTGCG GCAGAGTATG ACCAGACAGA TGATTATTCC
TATAATTCTG AAATGTCAGA TGATTTTGGG GATAACACCA AGATTACTGC TTATATAAAA
GATAAACTCA AATATGGAGT TGAGCCTGTT ACAATAATTG ATATTACATT GGGTGACCTG
AACTATGACG GTAAAGTTAA CTCTACAGAC TATTTAGTTT TGAAAAGGTA TTTGCTTGGA
ACAATTGACA AAGAATCAGA TCCTAACTTC CTGAAAGCCG CAGATCTTAA CAGGGATGGA
CGTGTTAATT CGACAGACAT GTCGTTAATG AAACGTTATC TTCTTGGCAT AATAACGTCT
TTTTAG
 
Protein sequence
MTDSPQKRIL KRKNRCKGLI TAVIIAAQLL TSSILFAEAP PATFTPAENW EDYDYFNFAE 
ALQKSLYFYD AQKCGVEAGY DHGGRLEWRG ACHEVDERIP ISNTSLSEAF LAKYRHIIDP
DGDGTVDVHG GFHDAGDHVR FGLPQSYTAG TLGWGFYEFR ESFRAIGEEE HMIEILRYFT
DTFLRCSFMD EEGNIVAFCY MVGEGDEDHC YWGPPELYPE EYLRSRPADF ATFDDPGSDV
CASTAAALCT SYLNFKDEDP EYAEKCLTVA KALYDFAVKY RGLHKGDGYY TSDYDEDELA
WAAVWLYECT GDMKYINDIV AVDETGNYVG YMKRIIPDTF KQNVWYNSWV HCWDAVWGGT
FIKLNELFPE NELFDFIARW NVEYLSGGKC PHEDPNDHNY CKPSPAGYTM INGWGSARYN
AAAQLCALVY MKNNPDRTDF GEWAKSQMEY LMGRNPMGYS YIVGYGYEKG LPFAKHPHHR
AAHGSKTNSM NDPEEHRHIL WGALVGGPDL NDYHIDSTTE YAYNEVAVDY NAAFVGALAG
LYKYYGQGHE PIPNFPPLEP ETDDYFCEAK IVRETKDSTQ VLLRIHNEST RPPHYETGMM
ARYFFNISEL IENGQSIDDV IFTIEYDEQI SMQQEPVVYR GPFKWDDAGT YYFEFDWSGR
KIYGDRELQI SFRVKQDSNY MTHWDSSNDY SRQGLTNEYA ISKNVPVYLN GVKVYGEEPP
KLSPTPTPTI DPSQTPDANA SISVSYKCGV KDGTKNTIRA TINIKNTGTT PVNLSDIKVR
YWFTSDGNEQ NNFVCDYAAF GTDKVKGIVK KIENSVPGAD TYCEISFTED AGRLAPGGST
GTIPFRIEGA AEYDQTDDYS YNSEMSDDFG DNTKITAYIK DKLKYGVEPV TIIDITLGDL
NYDGKVNSTD YLVLKRYLLG TIDKESDPNF LKAADLNRDG RVNSTDMSLM KRYLLGIITS
F