Gene Cthe_0322 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0322 
Symbol 
ID4808540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp405713 
End bp407047 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content44% 
IMG OID640105733 
Productglycoside hydrolase family protein 
Protein accessionYP_001036753 
Protein GI125972843 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00637507 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA GAGTTCATAT TCTAATAATG ACCTTAATGC TAATTGTGGC TTGTGTCGGA 
TGCAGCGGCA GTTCCGATAA TCCGCCGCCT GCCAACAGTT TGCATTCGGC ATCAACCATA
CCGATATCAG CTATGCCTTC TTTTTCGACT CCTTCACCTA CTGTTGAAGA TACTGAAAAT
CAACAATCGA AAACAGTGAA TTACTTGATA AATTCCATGA CCTTGGAGGA AAAGATCGGA
CAGATTTTCA TTGTTGCCTT TAGAAAAGGC AAATCTTCAC GCCCGTTAAA AGTATTGGAC
AATTCTACAA AACTGAAGAT TCAGAACTTT AATCCCGGTG GTATAATACT TTTCAGCGAA
AACATAGACA CCATACCGCA GACACAAAAA CTCATTCGTG ATATGCAAGA GGCAAGCAAA
ATTCCCATGT TCATCGCTGT TGACGAAGAA GGCGGGCGAA TAGCAAGAAT CGGAAACAAT
CCGAAAATGC ATTCCACAAA AATACCCTCC GCCCAGACAA TAGGACTTGC CGATGACCCC
GAACTTGCGT ATGAGGCAGG CAGGATACTG GGTGCGGAGC TGTCTGCCCT TGGCTTTAAC
ATGAATTTCG CACCGGTGGC CGATGTAAAT ACAAATCCGG ACAATCCTGT TATCGGAGAC
AGATCTTTTG GTTCCGACCC CTATAAAGTC GGCTTAATGG TACAGGCGAT GTCCAAAGGT
ATGCAGGAGC AAAATGTCTG CACCGTGCTA AAGCACTTTC CCGGTCATGG CGACACCTCC
TACGACTCGC ATCTTGGTCA GGTGGTAATA AATCACGACA TTGAAAGGCT TCGCCAAATA
GAACTGACAC CTTTTAAAAT GGGCATCAAG GCCGGCGCCG ACGGCGTAAT GACGGCTCAT
ATCATAATGC CCAACATTAC CGGCAGCAAT CTGCCCGCAA CTCTGTCTGA AGAAATCCTC
AGCGGGCTTT TGAGAAATGA GTTAAAACAC GAAAAGCTCA TTATAACCGA TGCAATGGAA
ATGAAGGCAA TCAGCAACTA CTGGTCTTCT TCCAAGGCTG CGGTCATGGC ATTTAAAGCA
GGAGCAGACA TCATACTCAT GCCCGAATCA TTTGAAGAAG CCTATAACGG TATTCTCAAA
GCCGTAAAAG ATGGTGAAAT AACGGAAGAA AGGCTAAATC AGTCCCTGCA AAGAATTCTC
GCTCTAAAAT TTGAAAGAAA CATACTTGCA AATAAAGAAA GCTCCGTCGA CCCTGAAAAA
GTATTGGGCA GACAAGAGCA TACTGACATA GTAGTGAAAA TCATGCAAAA GGCAGAAGAG
CAAAATATCC CTTAA
 
Protein sequence
MKKRVHILIM TLMLIVACVG CSGSSDNPPP ANSLHSASTI PISAMPSFST PSPTVEDTEN 
QQSKTVNYLI NSMTLEEKIG QIFIVAFRKG KSSRPLKVLD NSTKLKIQNF NPGGIILFSE
NIDTIPQTQK LIRDMQEASK IPMFIAVDEE GGRIARIGNN PKMHSTKIPS AQTIGLADDP
ELAYEAGRIL GAELSALGFN MNFAPVADVN TNPDNPVIGD RSFGSDPYKV GLMVQAMSKG
MQEQNVCTVL KHFPGHGDTS YDSHLGQVVI NHDIERLRQI ELTPFKMGIK AGADGVMTAH
IIMPNITGSN LPATLSEEIL SGLLRNELKH EKLIITDAME MKAISNYWSS SKAAVMAFKA
GADIILMPES FEEAYNGILK AVKDGEITEE RLNQSLQRIL ALKFERNILA NKESSVDPEK
VLGRQEHTDI VVKIMQKAEE QNIP