Gene Cthe_1428 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1428 
Symbol 
ID4810578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1748095 
End bp1749423 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content42% 
IMG OID640106851 
Productglycoside hydrolase family protein 
Protein accessionYP_001037852 
Protein GI125973942 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.574986 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTACTT TTAAGCTTAA TGATGAATTC ATGTTTGGAA CCGCTACCGC AAGTACTCAG 
ATTGAGGGTG GGGATACGGG AAATACATGG TATAAGTGGT GCCAGGAAGG ACGTATCAAG
GACTCCAGCA GCTGTATCAC TGCATGTGAC CATTGGAACA GGGTGGAGGA GGATACGGAG
CTGTTGAAGA ACTTGGGAGT TCAAACCCAC AGAATGAGTC TTGAGTGGAG CAGAATAGAG
CCTTCCAGGG GCAAATTTTC CGATGACGCA ATGAAACATT ACAGAGATGA GATTAAGCTT
TTGGTGGAGA ACAACATAAA GCCTCTGGTT ACGCTTCATC ACTTTTCCGA GCCCATTTGG
TTTCATGAAA TGGGGGGATG GAAAAAAACG GGCAATGCAG ATATTTTTAT AGAATATGTG
AAGTATGTGG TTGAAAATTT GGGTGACCTT GTAAGCGACT GGGTAACCTT TAACGAGCCC
AATGTCTATG TTGATTTTGG TTATGTAATC GGCATTTTCC CTCCGGGGGA AAGAAGCCTG
TCTGAAGGGT TAAAGGTTAC GGCAGAGCTT ATAAACACCC ATGTAAAACT ATACCGGCTG
ATACATAGGA TAAGAAGAGA GCGCAAATTT GCAGGCAGGA CAATGGTAGG AACGGCAATG
CACCTTCGCA TCTTTGACGG GATAAGTTCT ACCGGAAAAA TGATAGCCAA AGTTGTAGAT
TATCTGTTTA ACGAAATGTT TATGGAAGGC ATGACGACAG GGCACATGAT GTTTCCTCTT
TCCAAAAAGG GTTCAAGCCA TAAAAAAGGC AGGTATGCGG ATTTTTTGGG AATTAATTAT
TATACAAGAA ATATTGTTGA GTTCGTATTT GACCCGTCCC TTTATTTTCA CGAGCTTGTA
TGTGACAAGG ATTTGACCAA ATCGGACCTC GGGTGGGACA TATATCCGGA AGGCATATAC
AAAGTATGCA AGAGGTACTA TAAGAAATAT AAACTTCCCA TTTATATAAC CGAAAACGGA
ATAAGCGATA AAAATGACAC CAAACGGCCG AGCTTTATTG CCAGCCATCT TGCTTATATT
GCAAAAGCCA TAAAAGAAGG GATTCCGATA GAACGGTATT ATTACTGGAC GCTGATGGAT
AACTTCGAAT GGCTTGAAGG TGAGTCAACG GATTTCGGCC TTTACGACTG CAATTTCCGC
ACGCAGGAGA GGATACCGAG AAAAAGCGTC CGGCTTTATG AGCAAATATG CAGAAGAAAA
GAATTAACCG CGGAGATGAT TGAGGATTTT AAGAAGTACA GCGGGATTAC TATAGAAACA
ATCCGGTGA
 
Protein sequence
MVTFKLNDEF MFGTATASTQ IEGGDTGNTW YKWCQEGRIK DSSSCITACD HWNRVEEDTE 
LLKNLGVQTH RMSLEWSRIE PSRGKFSDDA MKHYRDEIKL LVENNIKPLV TLHHFSEPIW
FHEMGGWKKT GNADIFIEYV KYVVENLGDL VSDWVTFNEP NVYVDFGYVI GIFPPGERSL
SEGLKVTAEL INTHVKLYRL IHRIRRERKF AGRTMVGTAM HLRIFDGISS TGKMIAKVVD
YLFNEMFMEG MTTGHMMFPL SKKGSSHKKG RYADFLGINY YTRNIVEFVF DPSLYFHELV
CDKDLTKSDL GWDIYPEGIY KVCKRYYKKY KLPIYITENG ISDKNDTKRP SFIASHLAYI
AKAIKEGIPI ERYYYWTLMD NFEWLEGEST DFGLYDCNFR TQERIPRKSV RLYEQICRRK
ELTAEMIEDF KKYSGITIET IR