Gene Cthe_2811 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2811 
Symbol 
ID4809648 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3316953 
End bp3318728 
Gene Length1776 bp 
Protein Length591 aa 
Translation table11 
GC content44% 
IMG OID640108231 
Productglycoside hydrolase family protein 
Protein accessionYP_001039203 
Protein GI125975293 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4124] Beta-mannanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGAACA AAACTTCGAT AAAAGTTTTC CTGGCATTGA TGCTTGCGGT TCTTATGCCG 
GTGGTTTCAT GCATCAATGT GTCAAATGCA GTGCTTTCTG ACGGGGATAA GTATGAGTTT
GAAGACGGTA TCCATAAAGG TGCCCAGATT TATACCGACT ATGTCGGCCA AAATGAATAC
GGCGAGGTAT TCGACCTTAC CGGAAGCACA TGCAGCTTTA TTGCACAAAA GGGTACAAGC
ACTTCGGTGA ATGTTGAGGT TGACAAGGAA GGACTTTATG AAATATTCAT CTGCTATGTG
CAGCCTTACG ACAAAAACAA AAAAGTGCAG TATCTTAATG TAAACGGTGT CAATCAGGGA
GAAATAAGCT TTCCGTTTAC TCTTAAATGG AGAGAGATTT CAGCAGGAAT TGTAAAGCTT
AATGCAGGTA TCAACAATAT TGAACTTGAA AGCTACTGGG GTTATACCTA TTTTGATTAT
CTGATTGTCA AACCTGCGGA TGAAAGCATT GTAGAACTTA AGGTTCCGAA AAAATTGGTA
AATCCCAATG CCACAAAAGA AGCTAAAGCC CTTATGAGTT ATCTGGTTGA TATTTACGGT
AAACACATCC TTTCGGGTCA GCAGGAGATC TGCGGTTCCC ACAACTATCC GGGATCTGAG
GCGGAGTTTA CATACATACA GGAAAAGACC GGGAAACTGC CTGCGATAAG AGGTTTTGAC
TTTATGAACT ACAGAGGGAA CGGTTTGATG TGGGACGACC AGTGTGCCGA GCGTGTTATC
GAATGGTACA AGGAAAAAGG CGGTATACCG ACGGTTTGCT GGCACTGGTT CTCGCCCGGT
GATATTGGAA AAAAAGCGGA CAACAGTTTC TATACAGAAA GTACGACTTT CAGCATATCA
AGGGCTTTGA CTCCCGGAAC GGAGGAAAAT ATTGCACTGC TTAACGATAT CGACACCATG
GCCAGAAAGC TCAAGCAGGT TCAGGATGCC GGAGTTCCGG TACTGTTCAG ACCGCTCCAT
GAGGCGGAAG GTGGATGGTT CTGGTGGGGA GCCGAAGGTC CGGAGCCTTG TGTCAGACTG
TACAGGCTGC TCTATGACAA ATTTACCAAT GAATATGGTT TGAACAATCT TATCTGGGTT
TGGACTTCAT ATGATTATGA AACCTCGGCT GCATGGTATC CGGGCGATGA TGTGGTGGAT
ATCATCGGTT ACGACAAATA TAATGCAAAA GACGGAAAAC CCAATGGAAG TGCGATTTCA
TCCACATTCT ACAATCTGGT GAAACTTACT AATGGCAAAA AGTTGGTTGC GATGACTGAA
AATGATACAA TTCCGAGAGT TTCAAACCTT GTAAATGAAA AAGCAGGATG GCTTTATTTC
TGTCCGTGGT ACGGCTGGTG GCTGACAAGC GAACAGAACA ATCCTGTGGA TTGGCTTGTT
GAAATGTATC AGAGCGATTA CTGCATAACT TTGGACGAGC TTCCTGACTT AAAGAATTAT
CCTATATCGG ATTATGAGGA TTCCAATCCG GATCCGTCTC CGACACCTAC ACAGCCGCCG
AAAATTACCT ATGGAGATTT GAACGGAGAC GGCAAAGTCA ATTCAACGGA CCTGACAATT
ATGAAAAGAT ATATTCTCAA AAACTTTGAT AAGCTAGCCG TCCCTGAAGA AGCGGCTGAC
CTGAACGGGG ACGGAAGGAT AAACTCAACG GACCTTTCGA TACTACACAG GTATCTGCTT
CGCATAATAA CGAGTTTTCC CGTTGAACAA CAGTAG
 
Protein sequence
MENKTSIKVF LALMLAVLMP VVSCINVSNA VLSDGDKYEF EDGIHKGAQI YTDYVGQNEY 
GEVFDLTGST CSFIAQKGTS TSVNVEVDKE GLYEIFICYV QPYDKNKKVQ YLNVNGVNQG
EISFPFTLKW REISAGIVKL NAGINNIELE SYWGYTYFDY LIVKPADESI VELKVPKKLV
NPNATKEAKA LMSYLVDIYG KHILSGQQEI CGSHNYPGSE AEFTYIQEKT GKLPAIRGFD
FMNYRGNGLM WDDQCAERVI EWYKEKGGIP TVCWHWFSPG DIGKKADNSF YTESTTFSIS
RALTPGTEEN IALLNDIDTM ARKLKQVQDA GVPVLFRPLH EAEGGWFWWG AEGPEPCVRL
YRLLYDKFTN EYGLNNLIWV WTSYDYETSA AWYPGDDVVD IIGYDKYNAK DGKPNGSAIS
STFYNLVKLT NGKKLVAMTE NDTIPRVSNL VNEKAGWLYF CPWYGWWLTS EQNNPVDWLV
EMYQSDYCIT LDELPDLKNY PISDYEDSNP DPSPTPTQPP KITYGDLNGD GKVNSTDLTI
MKRYILKNFD KLAVPEEAAD LNGDGRINST DLSILHRYLL RIITSFPVEQ Q