Gene Cthe_0405 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0405 
Symbol 
ID4808408 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp505876 
End bp507456 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content43% 
IMG OID640105819 
Productglycoside hydrolase family protein 
Protein accessionYP_001036836 
Protein GI125972926 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000478388 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGAAAGG TTAAGGCCTT GTTGTTGGGA TTGATTGTAT TGGCTGTAGC TTTGTTACCT 
ACAGTGTCCT TTAAGTCACC GACTGTTGCG GCCGATCCGA ACAATGACGA CTGGCTGCAT
GTTGAAGGTA ACAAAATAGT GGACATGTAC GGTAATCAGG TCTGGCTGAC CGGCTGCAAC
TGGTTTGGAT TCAATACCGG TACCAATGTG TTTGACGGAG TATGGAGCTG CAATATGAGA
GAAGCCCTCA AGGGTATGGC GGACAGAGGA ATAAATTTTT TGAGAATACC TATTTCAACA
GAATTGCTGT ATCAATGGTC TCAAGGAATA TATCCCAAAG CAAATGTTAA TGATTTTGTA
AATCCGGAGC TGAAAGGAAA GAACAGCCTT GAGCTTTTTG ACTTTGCCGT TCAGTGCTGC
AAAGAATTCG GAATAAAGAT AATGGTGGAT ATACACAGTC CGGCAACAGA TGCCATGGGG
CATATGTATC CTTTATGGTA TGACGGTCAA TTTACAACAG AGATATGGAT TTCAACTTTG
GAGTGGTTGA CGGAAAGATA TAAAAATGAT GACACAATTC TTGCACTGGA CCTTAAAAAT
GAGCCTCACG GCACCCCGGG CAGCGAATTA ATGGCCAAAT GGGATGGTTC CACGGATTTG
AACAACTGGA AGCATGCTGC TGAAACATGC GCAAAGAGAA TCCTTGCAAT AAATCCGAAT
ATTCTTATTG TGGTAGAAGG AGTGGAAGTT TATCCAAAGC CTGGCTATGA TTATACCGCA
GTGGACGAAT GGGGAAAAGA GAGTAAATAT TTCTATAACT GGTGGGGAGG AAATTTAAGA
GGAGTCAGGG ATTATCCCAT TGACCTTGGC AAGCATCAGA AGCAGCTTGT ATACTCACCT
CACGATTACG GTCCCCTCGT ACATAAACAA CCTTGGTTCT ATGAAGGCTT TAACAAAGAA
ACTTTGTATA ATGATTGCTG GAGAGATAAC TGGGCATACA TACACGAGGA AAACATCGCT
CCTCTGATAG TGGGTGAATG GGGAGGTTTC ATGGACCGCG GAGACAACGA GAAATGGATG
AAAGCGCTGA GAGATTATAT GATTGAGAAT AAAATATCCC ACACTTTTTG GTGCTATAAT
GCAAATTCCG GTGATACCGG AGGACTTGTA TACTATGATT TTATTACCTG GGACGAAGAA
AAATATGCTC TTCTGAAGCC TGCATTATGG CAGACAGAGG ACGGAAAGTT TATAGGCCTT
GACCATCAGA TACCTCTTGG TTCAAATGGA ATTACCGTAA CTGAATATTA TGGCGGCTAT
ATTCCGGAAC CGTCACCGAC TGCTACTGTT CCAGACGTAC CGACACCGTC GCATTCTTTC
GAAATAGAGA AGGGGGATGT AAACGGTGAC GGTAATGTTA ATTCAACAGA TGTTGTATGG
CTTAGGAGAT TTTTGCTAAA ATTGGTCGAG GATTTTCCTG TACCTTCCGG AAAACAGGCG
GCGGATATGA ATGATGACGG GAATATCAAT TCTACCGATA TGATAGCCTT AAAGAGGAAA
GTGCTTAAAA TACCAATATA A
 
Protein sequence
MRKVKALLLG LIVLAVALLP TVSFKSPTVA ADPNNDDWLH VEGNKIVDMY GNQVWLTGCN 
WFGFNTGTNV FDGVWSCNMR EALKGMADRG INFLRIPIST ELLYQWSQGI YPKANVNDFV
NPELKGKNSL ELFDFAVQCC KEFGIKIMVD IHSPATDAMG HMYPLWYDGQ FTTEIWISTL
EWLTERYKND DTILALDLKN EPHGTPGSEL MAKWDGSTDL NNWKHAAETC AKRILAINPN
ILIVVEGVEV YPKPGYDYTA VDEWGKESKY FYNWWGGNLR GVRDYPIDLG KHQKQLVYSP
HDYGPLVHKQ PWFYEGFNKE TLYNDCWRDN WAYIHEENIA PLIVGEWGGF MDRGDNEKWM
KALRDYMIEN KISHTFWCYN ANSGDTGGLV YYDFITWDEE KYALLKPALW QTEDGKFIGL
DHQIPLGSNG ITVTEYYGGY IPEPSPTATV PDVPTPSHSF EIEKGDVNGD GNVNSTDVVW
LRRFLLKLVE DFPVPSGKQA ADMNDDGNIN STDMIALKRK VLKIPI