Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_2337 |
Symbol | |
ID | 7311012 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 2749414 |
End bp | 2751033 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 643609265 |
Product | glycoside hydrolase family 5 |
Protein accession | YP_002506653 |
Protein GI | 220929744 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA CTATTAGTGT ATTTCTTGCA TTAATAATGC TTTTGACATT ATTAATACCA TCAGTTACAA AGGTTTCAGC AGCTGAGCCC GGTGTAGCAG AATCCGGTGA TGACTGGCTG CATGTTGAAG GAACAAATAT TGTAGACAAA TATGGCAACA AGGTATGGAT TACAGGTGCC AACTGGTTTG GTTTCAATTG CCGCGAAAGA ATGCTTTTGG ATTCATATCA CAGTAATATT GTTGCCGATA TCGAAATTGT TGCCGACAAA GGAATTAACG TTGTCAGAAT GCCGATTGCA ACAGACCTGC TATATGCGTG GAGTAAAGGC GAATATCCTG CTTCTACGGA TACAAGCTAC AACAATGCTG ATCTTACAGG CTTGAATAGC TTTGAATTAT TCAATTTTAT GCTGGATAAC TTTAAAAGGG TTGGTATCAA GGTTATTCTT GACGTACATA GTGCTGAGAC CGACAATATG GGACATACCT ACCCGTTATG GTATAACGGC ACCATAACAG AGGAAGTCTT CAAAGAAGCC TGGGTTTGGG TTGCTAACCA CTATAAAAAC GATGATACTA TTATTGGTTT TGATTTGAAA AATGAGCCCC ACACAAATAC AGGTACTTTA AAAATGAAAT CCCAAAGTGC TATCTGGGAT GACTCCACAC ATGCAAACAA TTGGAAAAGA GTAGCACAAG AAACTGCCCT TGCTATAATG AAGGTTCATC CTAATGCATT AATTTTTGTT GAAGGCGTTG AAATGTACCC TAAAGATGGT TTATGGAATG ATGAATCCTT TGATACAAGT CCATGGACAG GCACCAATGA TTATTATGGA AACTGGTGGG GCGGCAACCT TAGGGGTGTA AAGGATTATC CAATTAATCT GGGAGCATAT CAGAAGCAGC TTGTGTATTC ACCTCATGAT TACGGCCCTA TGGTTTTCGA GCAGGAGTGG TTCAAGGGTA GTTTCCCAAC TTGTGATGAT GCTACAGCAA AGAAAATACT TTATGAACAG TGTTGGAGGG ACAATTGGGC TTATATAATG GAAAACGGAA CAAGCCCACT GCTTATAGGT GAATGGGGAG GCCTTACTGA AGGAGAAGAC AAGCTTCTGG AGGCCAATAA GAAATATCTC AGAAGTATGA GAGATTACAT TTTAGAAAAC AAATACCAGC TTCATCACAC TTTCTGGTGT ATAAATATTG ACTCTGCGGA TACAGGAGGA CTTCTGACAC GTGGTGAGGG AACTGCTTTC CCGGGTGGAA GGGACCTTAA ATGGAATGAC AATAAATATG ATAACTATTT ATACCCTGTG CTATGGAAAA ACAGCGAAGG AAAATTTATC GGCTTGGATC ATAAAATTCC TCTTGGAAAA AACGGTGTGT TACTGGGCAG TCCCGATGAT GAGCCAACTA TAAACTATGG AGATATTAAC AAAGACGGAC AAATTGATGC TCTTGACGTT ATTGCATTGA AGTCATATAT TCTAGGCATA AACCAGAATA TAGACACACA GGCAGCTGAC CTTAACAAGG ACAGCTCAAT AGATGCGTTG GATATGCAGA TTTTGAAAAG GTATCTATTG GGTCAGGTGA CTCAACTGCC GTTAGGTTAA
|
Protein sequence | MKKTISVFLA LIMLLTLLIP SVTKVSAAEP GVAESGDDWL HVEGTNIVDK YGNKVWITGA NWFGFNCRER MLLDSYHSNI VADIEIVADK GINVVRMPIA TDLLYAWSKG EYPASTDTSY NNADLTGLNS FELFNFMLDN FKRVGIKVIL DVHSAETDNM GHTYPLWYNG TITEEVFKEA WVWVANHYKN DDTIIGFDLK NEPHTNTGTL KMKSQSAIWD DSTHANNWKR VAQETALAIM KVHPNALIFV EGVEMYPKDG LWNDESFDTS PWTGTNDYYG NWWGGNLRGV KDYPINLGAY QKQLVYSPHD YGPMVFEQEW FKGSFPTCDD ATAKKILYEQ CWRDNWAYIM ENGTSPLLIG EWGGLTEGED KLLEANKKYL RSMRDYILEN KYQLHHTFWC INIDSADTGG LLTRGEGTAF PGGRDLKWND NKYDNYLYPV LWKNSEGKFI GLDHKIPLGK NGVLLGSPDD EPTINYGDIN KDGQIDALDV IALKSYILGI NQNIDTQAAD LNKDSSIDAL DMQILKRYLL GQVTQLPLG
|
| |