Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_0840 |
Symbol | |
ID | 7309687 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 956466 |
End bp | 958220 |
Gene Length | 1755 bp |
Protein Length | 584 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 643607780 |
Product | glycoside hydrolase family 5 |
Protein accession | YP_002505196 |
Protein GI | 220928287 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGA TATTAGCACT GATTATATCG TGCAGTATTA TTATGAGCTT TTTACCTATG TCTGTCTATG GTGCTATCAA TTCTCAGGAT ATGGTAAAAA AGATGGGAAT AGGAATGAAC CTTGGAAATA CGTTTGACGC ACCAACGGAA GGTTCTTGGT CAAAGGCAGC TCAGGAGTAC TATTTTGATG ATTTTAAACA GGCCGGTTTC AAGCATGTTA GAATTCCAAT CCGTTGGGAT CAGCATACAC TTGCAAACAG TCCGTATACT GTTGACAGCA ACTTCTTAAA CAGAATTGAA ACTGTAATAG ACTGGTCACT TTCCCGTGGA TTTGTGACGG TAATTAATTC TCATCATGAT ACTTGGCTTA TGGATAACTA TAGTCAGAAT ATCGGACGGT TTGAAAAAAT ATGGGAACAA ATTGCACAAC GTTTTAAAGG CAAGTCTGAA AATCTGGTTT TTGAAATATT AAATGAGCCA CATGGTAATA TAACAGATAG CCAGATAAAC GATATGAACA AAAGAATTTT AAATATAATA AGAAAGACTA ATCCGACTAG AAATGTAATT ATTGGCGCAG GTTACTGGAA TAGCTATAAT TCATTAAGTC AGTTGGAAAT TCCAAACGAC CCAAATCTTA TTGCAACTTT TCACTACTAT GACCCATACT CTTTCACTCA CCAGTGGCAG GGTACATGGG GAACCAAAAA TGATATGGAT GCCATAGCAA TGGTTTTTAA CCATGTTAAA AAATGGTCAG ATAAAAATAA CATACCTGTA TATCTTGGGG AATATGGTGT AATGGGACAT TCCGATAGAA CATCTGCTGT AAAATGGTTT GATTTTGTAA GTGATCAGGC TATATCACAT GGATTTTCTT GTGGAGCTTG GGATAATGGA GTATTTGGCT CTGTTGACAA TGATATGGCA TTTTATAATA GAGATACCAG ACAGTTTGAT AAGGAAATCT TAAATGCAAT ATTAACTACT GGGACAACCT ATGATTGGAC TCCTCCAACT GAAACAAATC CAGATCCTCC TCGTACACCT GCAACACCAG CCTACGGGGA ACAGCTTATT GAAGATTTCG AAGGTGCTAT GCAGTGGGCT GCATATTCCG GTGTTGATGC TACTGCATCA TGTAAGATTT CTAGTGGCAA ATCAAATAAC GGTTTGGAAA TAACATATGC AGGTTCATCA AATGGGTACT GGGGTGTTGT AGACAATGAA CATAGAAATC AAGATTGGGA GAAATGGCAA AAAATATCCT TTGATATAAA ATCCTCAAAC ACAAATGAAG TTAGACTACT AATAGCTGAA CAAAGTAAGA TTGAAGGAGA AGATGGAGAA CACTGGACTT ATGTCATAAA ACCTAGTACC TCTTGGACTA CGATTGAGAT TCCATTTTCA TCTTTTACAA AAAGAATGGA TTATCAGCCA CCTGCACAAG ATGGTAGTGA AACTTTTGAC TTATACAAAG TAGGTTCCTT GCATTTTATG TATAGTAACA GTAATTCAGG TACTTTAAAT ATTGATAATA TTAAGTTGAT TGGCTTACCA GAAGAACAAA TTGGAGGAAA AATTGGAGAT GTTAATGAAG ATGGCAATAT TGATGCAATA GATTTTGCAT TGTTAAAAAA ATACTTGCTA GACTCATCTA TTAGCATAAA TAAAGTGAAT GCAGACATTA ATTTAGATGG AGATATCAAT GCCATTGACT TTGCAAAGTT GAAAATGATG TTACTTGGAG ACTAA
|
Protein sequence | MKKILALIIS CSIIMSFLPM SVYGAINSQD MVKKMGIGMN LGNTFDAPTE GSWSKAAQEY YFDDFKQAGF KHVRIPIRWD QHTLANSPYT VDSNFLNRIE TVIDWSLSRG FVTVINSHHD TWLMDNYSQN IGRFEKIWEQ IAQRFKGKSE NLVFEILNEP HGNITDSQIN DMNKRILNII RKTNPTRNVI IGAGYWNSYN SLSQLEIPND PNLIATFHYY DPYSFTHQWQ GTWGTKNDMD AIAMVFNHVK KWSDKNNIPV YLGEYGVMGH SDRTSAVKWF DFVSDQAISH GFSCGAWDNG VFGSVDNDMA FYNRDTRQFD KEILNAILTT GTTYDWTPPT ETNPDPPRTP ATPAYGEQLI EDFEGAMQWA AYSGVDATAS CKISSGKSNN GLEITYAGSS NGYWGVVDNE HRNQDWEKWQ KISFDIKSSN TNEVRLLIAE QSKIEGEDGE HWTYVIKPST SWTTIEIPFS SFTKRMDYQP PAQDGSETFD LYKVGSLHFM YSNSNSGTLN IDNIKLIGLP EEQIGGKIGD VNEDGNIDAI DFALLKKYLL DSSISINKVN ADINLDGDIN AIDFAKLKMM LLGD
|
| |