Gene Cthe_2807 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2807 
Symbol 
ID4809644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3308398 
End bp3309429 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content39% 
IMG OID640108227 
Productglycoside hydrolase family protein 
Protein accessionYP_001039199 
Protein GI125975289 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000151011 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGAGTT TTAAAGCAGG TATAAATTTA GGCGGATGGA TATCACAATA TCAAGTTTTC 
AGCAAAGAGC ATTTCGATAC ATTCATTACG GAGAAGGACA TTGAAACTAT TGCAGAAGCA
GGGTTTGACC ATGTCAGACT GCCTTTTGAT TATCCAATTA TCGAGTCTGA TGACAATGTG
GGAGAATATA AAGAAGATGG GCTTTCTTAT ATTGACCGGT GCCTTGAGTG GTGTAAAAAA
TACAATTTGG GGCTTGTGTT GGATATGCAT CACGCTCCCG GGTACCGCTT TCAAGATTTT
AAGACAAGCA CCTTGTTTGA AGATCCGAAC CAGCAAAAGA GATTTGTTGA CATATGGAGA
TTTTTAGCCA AGCGTTACAT AAATGAACGG GAACATATTG CCTTTGAACT GTTAAATGAA
GTTGTTGAGC CTGACAGTAC CCGCTGGAAC AAGTTGATGC TTGAGTGTGT AAAAGCAATC
AGGGAAATTG ATTCCACCAG GTGGCTTTAC ATTGGGGGCA ATAACTATAA CAGTCCTGAT
GAGCTTAAAA ACCTTGCAGA TATTGATGAT GATTACATAG TTTACAATTT CCATTTTTAC
AATCCTTTTT TCTTTACGCA TCAGAAAGCC CACTGGTCGG AAAGTGCCAT GGCGTACAAC
AGGACTGTAA AATATCCGGG ACAATATGAG GGAATTGAAG AGTTTGTGAA AAATAATCCT
AAGTACAGTT TTATGATGGA ATTGAATAAC CTGAAGCTGA ATAAAGAGCT TTTGCGCAAA
GATTTAAAAC CAGCAATTGA GTTCAGGGAA AAGAAAAAAT GCAAACTATA TTGCGGGGAG
TTTGGCGTAA TTGCCATTGC TGACCTGGAG TCCAGGATAA AATGGCATGA AGATTATATA
AGTCTTCTAG AGGAGTATGA TATCGGCGGC GCGGTGTGGA ACTACAAAAA AATGGATTTT
GAAATTTATA ATGAGGATAG AAAACCTGTC TCGCAAGAAT TGGTAAATAT ACTGGCGAGA
AGAAAAACTT GA
 
Protein sequence
MVSFKAGINL GGWISQYQVF SKEHFDTFIT EKDIETIAEA GFDHVRLPFD YPIIESDDNV 
GEYKEDGLSY IDRCLEWCKK YNLGLVLDMH HAPGYRFQDF KTSTLFEDPN QQKRFVDIWR
FLAKRYINER EHIAFELLNE VVEPDSTRWN KLMLECVKAI REIDSTRWLY IGGNNYNSPD
ELKNLADIDD DYIVYNFHFY NPFFFTHQKA HWSESAMAYN RTVKYPGQYE GIEEFVKNNP
KYSFMMELNN LKLNKELLRK DLKPAIEFRE KKKCKLYCGE FGVIAIADLE SRIKWHEDYI
SLLEEYDIGG AVWNYKKMDF EIYNEDRKPV SQELVNILAR RKT