Gene Cthe_0625 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0625 
Symbol 
ID4808227 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp770401 
End bp772533 
Gene Length2133 bp 
Protein Length710 aa 
Translation table11 
GC content44% 
IMG OID640106039 
Productglycoside hydrolase family protein 
Protein accessionYP_001037053 
Protein GI125973143 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGAAAA AGACATTATG CTTTGTACTG ACTTTGGCTA TGCTGACGGC ATTTATTCTT 
CCTCAGGGGA TTGTGTCCGC AGCAGGAAGC TATAACTATG CGGAAGCACT TCAGAAAGCC
ATTTACTTTT ATGAGTGTCA GCAGGCCGGC CCTCTACCTG AATGGAACCG CGTTGAGTGG
CGTGGCGACG CAACAATGAA TGATGAGGTA CTTGGTGGAT GGTATGACGC AGGTGACCAT
GTCAAGTTTA ATCTGCCTAT GGCGTATTCG GCGGCAATGC TTGGCTGGGC TCTTTATGAG
TATGGCGATG ACATTGAGGC ATCGGGGCAG AGACTTCATC TTGAAAGGAA CCTTGCCTTT
GCCCTTGACT ATCTTGTTGC CTGCGACAGA GGTGACAGTG TCGTTTATCA GATAGGTGAC
GGTGCCGCTG ACCATAAATG GTGGGGTTCT GCGGAAGTTA TTGAAAAAGA AATGACAAGA
CCTTACTTTG TAGGAAAGGG ATCCGCCGTT GTAGGTCAGA TGGCTGCAGC TTTGGCTGTA
GGTTCCATAG TTCTTAAAAA TGATACATAC CTCAGATATG CGAAGAAGTA TTTCGAACTT
GCAGATGCAA CAAGAAGTGA CAGCACTTAT ACTGCTGCAA ATGGTTTCTA CAGTTCCCAC
AGCGGATTCT GGGATGAGCT GTTGTGGGCT TCCACTTGGC TCTATCTTGC AACAGGTGAT
AGAAATTATC TTGATAAAGC TGAGTCCTAT ATTCCAAAAT TAAACCGTCA GAATCAGACC
ACAGATATAG AATATCAGTG GGCACATTGC TGGGATGACT GCCACTATGG AGCAATGATC
TTGCTTGCAA GAGCTACAGG TAAAGAAGAG TATCACAAAT TTGCACAAAT GCATCTGGAT
TGGTGGACAC CTCAAGGTTA TAACGGAAAG AGAGTTGCAT ATACTCCCGG CGGACTTGCG
CATCTTGATA CCTGGGGACC GTTGAGATAT GCTACAACTG AAGCATTCCT CGCTTTTGTA
TATGCCGATT CAATAAATGA CCCGGCTCTC AAGCAAAAAT ATTATAATTT TGCGAAAAGC
CAGATTGACT ATGCATTGGG TTCAAATCCT GACAACAGAA GCTATGTAGT CGGATTTGGA
AACAATCCGC CACAGCGTCC TCACCACAGA ACCGCTCATG GAACTTGGTT GGATAAAAGA
GATATTCCGG AAAAGCACAG ACATGTACTT TACGGTGCTC TGGTCGGAGG ACCCGGAAGA
GATGACAGTT ATGAAGACAA TATAGAGGAT TATGTAAAAA ATGAAGTTGC CTGCGACTAC
AATGCAGGTT TTGTAGGCGC GCTCTGCAGA TTGACTGCTG AATACGGCGG AACTCCTCTT
GCGAACTTCC CGCCACCGGA ACAAAGAGAT GATGAGTTCT TCGTAGAAGC GGCTATAAAT
CAGGCAAGTG ATCATTTCAC TGAAATAAAA GCATTGCTCA ACAACCGTTC ATCCTGGCCG
GCAAGACTTA TTAAGGACCT TTCATACAAC TATTATATGG ATTTGACTGA AGTTTTTGAG
GCAGGTTACA GTGTTGACGA TATTAAAGTA ACAATAGGCT ATTGCGAAAG CGGTATGGAT
GTCGAGATTT CGCCGATTAC TCATTTGTAT GACAATATTT ATTACATAAA AATATCATAT
ATCGACGGAA CCAATATTTG TCCGATAGGT CAGGAACAGT ATGCCGCTGA GCTTCAGTTC
CGTATTGCGG CACCTCAAGG TACTAAATTC TGGGATCCGA CAAATGACTT CTCATATCAG
GGACTTACCA GAGAGTTGGC AAAGACAAAA TATATGCCCG TTTTTGACGG AGCAACAAAA
ATCTTTGGAG AAGTTCCAGG CGGCTTTGAA CCGGTTCCTT CACCTTCGCC GACTCCTGCT
CAATATAAAG TCGGTGACTT AAACGGTGAC GGAGTGGTTA ATTCAACTGA CAGTGTAATA
TTGAAAAGAC ATATAATTAA ATTTTCTGAA ATAACAGATC CAGTTAAATT GAAAGCTGCT
GATCTTAACG GAGATGGCAA TATAAACTCC AGCGATGTTT CATTAATGAA GAGATATCTG
CTCCGTATAA TAGATAAATT TCCGGTAGAA TAG
 
Protein sequence
MVKKTLCFVL TLAMLTAFIL PQGIVSAAGS YNYAEALQKA IYFYECQQAG PLPEWNRVEW 
RGDATMNDEV LGGWYDAGDH VKFNLPMAYS AAMLGWALYE YGDDIEASGQ RLHLERNLAF
ALDYLVACDR GDSVVYQIGD GAADHKWWGS AEVIEKEMTR PYFVGKGSAV VGQMAAALAV
GSIVLKNDTY LRYAKKYFEL ADATRSDSTY TAANGFYSSH SGFWDELLWA STWLYLATGD
RNYLDKAESY IPKLNRQNQT TDIEYQWAHC WDDCHYGAMI LLARATGKEE YHKFAQMHLD
WWTPQGYNGK RVAYTPGGLA HLDTWGPLRY ATTEAFLAFV YADSINDPAL KQKYYNFAKS
QIDYALGSNP DNRSYVVGFG NNPPQRPHHR TAHGTWLDKR DIPEKHRHVL YGALVGGPGR
DDSYEDNIED YVKNEVACDY NAGFVGALCR LTAEYGGTPL ANFPPPEQRD DEFFVEAAIN
QASDHFTEIK ALLNNRSSWP ARLIKDLSYN YYMDLTEVFE AGYSVDDIKV TIGYCESGMD
VEISPITHLY DNIYYIKISY IDGTNICPIG QEQYAAELQF RIAAPQGTKF WDPTNDFSYQ
GLTRELAKTK YMPVFDGATK IFGEVPGGFE PVPSPSPTPA QYKVGDLNGD GVVNSTDSVI
LKRHIIKFSE ITDPVKLKAA DLNGDGNINS SDVSLMKRYL LRIIDKFPVE