Gene Ccel_2337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2337 
Symbol 
ID7311012 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2749414 
End bp2751033 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content40% 
IMG OID643609265 
Productglycoside hydrolase family 5 
Protein accessionYP_002506653 
Protein GI220929744 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA CTATTAGTGT ATTTCTTGCA TTAATAATGC TTTTGACATT ATTAATACCA 
TCAGTTACAA AGGTTTCAGC AGCTGAGCCC GGTGTAGCAG AATCCGGTGA TGACTGGCTG
CATGTTGAAG GAACAAATAT TGTAGACAAA TATGGCAACA AGGTATGGAT TACAGGTGCC
AACTGGTTTG GTTTCAATTG CCGCGAAAGA ATGCTTTTGG ATTCATATCA CAGTAATATT
GTTGCCGATA TCGAAATTGT TGCCGACAAA GGAATTAACG TTGTCAGAAT GCCGATTGCA
ACAGACCTGC TATATGCGTG GAGTAAAGGC GAATATCCTG CTTCTACGGA TACAAGCTAC
AACAATGCTG ATCTTACAGG CTTGAATAGC TTTGAATTAT TCAATTTTAT GCTGGATAAC
TTTAAAAGGG TTGGTATCAA GGTTATTCTT GACGTACATA GTGCTGAGAC CGACAATATG
GGACATACCT ACCCGTTATG GTATAACGGC ACCATAACAG AGGAAGTCTT CAAAGAAGCC
TGGGTTTGGG TTGCTAACCA CTATAAAAAC GATGATACTA TTATTGGTTT TGATTTGAAA
AATGAGCCCC ACACAAATAC AGGTACTTTA AAAATGAAAT CCCAAAGTGC TATCTGGGAT
GACTCCACAC ATGCAAACAA TTGGAAAAGA GTAGCACAAG AAACTGCCCT TGCTATAATG
AAGGTTCATC CTAATGCATT AATTTTTGTT GAAGGCGTTG AAATGTACCC TAAAGATGGT
TTATGGAATG ATGAATCCTT TGATACAAGT CCATGGACAG GCACCAATGA TTATTATGGA
AACTGGTGGG GCGGCAACCT TAGGGGTGTA AAGGATTATC CAATTAATCT GGGAGCATAT
CAGAAGCAGC TTGTGTATTC ACCTCATGAT TACGGCCCTA TGGTTTTCGA GCAGGAGTGG
TTCAAGGGTA GTTTCCCAAC TTGTGATGAT GCTACAGCAA AGAAAATACT TTATGAACAG
TGTTGGAGGG ACAATTGGGC TTATATAATG GAAAACGGAA CAAGCCCACT GCTTATAGGT
GAATGGGGAG GCCTTACTGA AGGAGAAGAC AAGCTTCTGG AGGCCAATAA GAAATATCTC
AGAAGTATGA GAGATTACAT TTTAGAAAAC AAATACCAGC TTCATCACAC TTTCTGGTGT
ATAAATATTG ACTCTGCGGA TACAGGAGGA CTTCTGACAC GTGGTGAGGG AACTGCTTTC
CCGGGTGGAA GGGACCTTAA ATGGAATGAC AATAAATATG ATAACTATTT ATACCCTGTG
CTATGGAAAA ACAGCGAAGG AAAATTTATC GGCTTGGATC ATAAAATTCC TCTTGGAAAA
AACGGTGTGT TACTGGGCAG TCCCGATGAT GAGCCAACTA TAAACTATGG AGATATTAAC
AAAGACGGAC AAATTGATGC TCTTGACGTT ATTGCATTGA AGTCATATAT TCTAGGCATA
AACCAGAATA TAGACACACA GGCAGCTGAC CTTAACAAGG ACAGCTCAAT AGATGCGTTG
GATATGCAGA TTTTGAAAAG GTATCTATTG GGTCAGGTGA CTCAACTGCC GTTAGGTTAA
 
Protein sequence
MKKTISVFLA LIMLLTLLIP SVTKVSAAEP GVAESGDDWL HVEGTNIVDK YGNKVWITGA 
NWFGFNCRER MLLDSYHSNI VADIEIVADK GINVVRMPIA TDLLYAWSKG EYPASTDTSY
NNADLTGLNS FELFNFMLDN FKRVGIKVIL DVHSAETDNM GHTYPLWYNG TITEEVFKEA
WVWVANHYKN DDTIIGFDLK NEPHTNTGTL KMKSQSAIWD DSTHANNWKR VAQETALAIM
KVHPNALIFV EGVEMYPKDG LWNDESFDTS PWTGTNDYYG NWWGGNLRGV KDYPINLGAY
QKQLVYSPHD YGPMVFEQEW FKGSFPTCDD ATAKKILYEQ CWRDNWAYIM ENGTSPLLIG
EWGGLTEGED KLLEANKKYL RSMRDYILEN KYQLHHTFWC INIDSADTGG LLTRGEGTAF
PGGRDLKWND NKYDNYLYPV LWKNSEGKFI GLDHKIPLGK NGVLLGSPDD EPTINYGDIN
KDGQIDALDV IALKSYILGI NQNIDTQAAD LNKDSSIDAL DMQILKRYLL GQVTQLPLG