Gene Cthe_0032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0032 
Symbol 
ID4808797 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp38313 
End bp40085 
Gene Length1773 bp 
Protein Length590 aa 
Translation table11 
GC content41% 
IMG OID640105441 
Productglycoside hydrolase family protein 
Protein accessionYP_001036466 
Protein GI125972556 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4124] Beta-mannanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00146021 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGAAAA TTTTTGGCAG GACACTGAGT CTGCTGGTAA CATTTGCAAT GGTGTTTTCC 
GTTCTTTTAG TCATGCCCGT TTCAACTTAT GCTGCATATT CCCTTCCTGT GGACGTTGAA
GCAGAAGATT GCACTCTTGG CAACGGTGCC GTTGTTACCA CCAATGTATA CGGAACTCAA
TATCCCGGAT ATTCCGGCGA CGGATTCGTA TGGGTGGCCA ACTCGGGAAC GATAACATTG
GAAGTCACCA TTCCTGAAAA CGGTATGTAT GAGCTTTCCA CAAGATGCTG GATGTATCTT
GGCAAAGAAG ATGAGACCAG AATGCAGGTT ATAAGCATCA ACGGAAAATC ACACAGCAAC
TATTTTATTC CAAACAAAGG CCAATGGATT GATTACAGTT TCGGATTCTT CTATCTTGAG
GCCGGTAAAG CAACTATTGA GATAGGTTCC TCCGGAAGCT GGGGCTTTAT ACTGTACGAC
AAAATATACT TTGACCATGC TGACATGCCC GATCATATAA TTGACCCGAC TCCGTGTGAT
CCAAATGCAA CTCCTGAAAC AAGAGCTCTT ATGAAATACC TTACCAGCGT GTACGGAAAA
TATGTTATTT CCGGCCAGCA GGAGATTTAC GGAAACGGAA ACGACGGCAA TTATGAACTT
GAATTCGATT ATATTTATGA GAAGACAGGC AAATATCCTG CAATCAGAGG CTTTGATTTC
ATGAACTACA ATCCTCTGTA CGGATGGGAA GACGGTACAA CGGCACGTAT AATCGACTGG
GTAAAAAATC GCGGCGGTAT TGCAACAGCA TGCTGGCATA TAAATATTCC CAGGGATTTT
GCAAGTTATA AACTCGGTGA GCCGGTGGAT TGGACAAACT GTACATACAA ACCGACAAGC
AGCTTTAATA CCGCAAACTG CCTTGATGAA ACAACAAAAG AACATGCTTA CCTGATGATG
GCAATTGAAG ACCTTGCAGA GCAGCTTTTA ATTCTTCAGG AGCAAAACAT TCCTATACTT
TTCCGTCCGT TCCATGAAGC TGAAGGCTAC AACAACACCG ACGGCTCCGG CGCATGGTTC
TGGTGGGGTT CTGCAGGTGC TGAAGTTTAC AAGGAACTCT GGAAACTTCT TTATAAAACT
CTTACCGAAA AATACGGCAT TCATAATTTG ATATGGGAAG TAAACCTTTA TACATATGCC
AATTCTTATG AATGGTATCC CGGCGATGAG TATGTGGACA TTATCGGATA CGACAAATAT
GAAGGTTCAC CCAATACCTG GGGCACAAGC GCCGCATCAT CATTATTCCT TACACTTGTA
AATTACACAA ACGACACAAA GATGGTTGCA TTGACTGAAA ATGACGTTAT TCCCGATATT
CAAAATATAG TTAATGAGGA AGCCTGGTGG CTGTATTTCT GCCCATGGTA CGGTGATTTC
CTTATGAGTC CCAGATACAA CGACCCCGTA CTTTTGAACA CTATCTACAA CAGTGAATAT
GTAATCACCT TGGATGAACT TCCGGAAAAC CTTTATGAAT ATGATGGTGA AATACCGGAT
ATCAACTACG GCGATTTGAA CAATGACGGA AATATAAACT CAACCGATTA TATGATACTG
AAGAAATATA TTTTAAAAGT TCTTGAAAGA ATGAATGTCC CTGAAAAAGC AGCAGATTTA
AACGGTGACG GTTCAATCAA TTCAACCGAT TTGACAATAT TAAAAAGATT TATAATGAAA
GCAATTACAA AATTTCCCGT TACACAAAAG TAA
 
Protein sequence
MGKIFGRTLS LLVTFAMVFS VLLVMPVSTY AAYSLPVDVE AEDCTLGNGA VVTTNVYGTQ 
YPGYSGDGFV WVANSGTITL EVTIPENGMY ELSTRCWMYL GKEDETRMQV ISINGKSHSN
YFIPNKGQWI DYSFGFFYLE AGKATIEIGS SGSWGFILYD KIYFDHADMP DHIIDPTPCD
PNATPETRAL MKYLTSVYGK YVISGQQEIY GNGNDGNYEL EFDYIYEKTG KYPAIRGFDF
MNYNPLYGWE DGTTARIIDW VKNRGGIATA CWHINIPRDF ASYKLGEPVD WTNCTYKPTS
SFNTANCLDE TTKEHAYLMM AIEDLAEQLL ILQEQNIPIL FRPFHEAEGY NNTDGSGAWF
WWGSAGAEVY KELWKLLYKT LTEKYGIHNL IWEVNLYTYA NSYEWYPGDE YVDIIGYDKY
EGSPNTWGTS AASSLFLTLV NYTNDTKMVA LTENDVIPDI QNIVNEEAWW LYFCPWYGDF
LMSPRYNDPV LLNTIYNSEY VITLDELPEN LYEYDGEIPD INYGDLNNDG NINSTDYMIL
KKYILKVLER MNVPEKAADL NGDGSINSTD LTILKRFIMK AITKFPVTQK