Gene Ccel_0840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_0840 
Symbol 
ID7309687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp956466 
End bp958220 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content36% 
IMG OID643607780 
Productglycoside hydrolase family 5 
Protein accessionYP_002505196 
Protein GI220928287 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGA TATTAGCACT GATTATATCG TGCAGTATTA TTATGAGCTT TTTACCTATG 
TCTGTCTATG GTGCTATCAA TTCTCAGGAT ATGGTAAAAA AGATGGGAAT AGGAATGAAC
CTTGGAAATA CGTTTGACGC ACCAACGGAA GGTTCTTGGT CAAAGGCAGC TCAGGAGTAC
TATTTTGATG ATTTTAAACA GGCCGGTTTC AAGCATGTTA GAATTCCAAT CCGTTGGGAT
CAGCATACAC TTGCAAACAG TCCGTATACT GTTGACAGCA ACTTCTTAAA CAGAATTGAA
ACTGTAATAG ACTGGTCACT TTCCCGTGGA TTTGTGACGG TAATTAATTC TCATCATGAT
ACTTGGCTTA TGGATAACTA TAGTCAGAAT ATCGGACGGT TTGAAAAAAT ATGGGAACAA
ATTGCACAAC GTTTTAAAGG CAAGTCTGAA AATCTGGTTT TTGAAATATT AAATGAGCCA
CATGGTAATA TAACAGATAG CCAGATAAAC GATATGAACA AAAGAATTTT AAATATAATA
AGAAAGACTA ATCCGACTAG AAATGTAATT ATTGGCGCAG GTTACTGGAA TAGCTATAAT
TCATTAAGTC AGTTGGAAAT TCCAAACGAC CCAAATCTTA TTGCAACTTT TCACTACTAT
GACCCATACT CTTTCACTCA CCAGTGGCAG GGTACATGGG GAACCAAAAA TGATATGGAT
GCCATAGCAA TGGTTTTTAA CCATGTTAAA AAATGGTCAG ATAAAAATAA CATACCTGTA
TATCTTGGGG AATATGGTGT AATGGGACAT TCCGATAGAA CATCTGCTGT AAAATGGTTT
GATTTTGTAA GTGATCAGGC TATATCACAT GGATTTTCTT GTGGAGCTTG GGATAATGGA
GTATTTGGCT CTGTTGACAA TGATATGGCA TTTTATAATA GAGATACCAG ACAGTTTGAT
AAGGAAATCT TAAATGCAAT ATTAACTACT GGGACAACCT ATGATTGGAC TCCTCCAACT
GAAACAAATC CAGATCCTCC TCGTACACCT GCAACACCAG CCTACGGGGA ACAGCTTATT
GAAGATTTCG AAGGTGCTAT GCAGTGGGCT GCATATTCCG GTGTTGATGC TACTGCATCA
TGTAAGATTT CTAGTGGCAA ATCAAATAAC GGTTTGGAAA TAACATATGC AGGTTCATCA
AATGGGTACT GGGGTGTTGT AGACAATGAA CATAGAAATC AAGATTGGGA GAAATGGCAA
AAAATATCCT TTGATATAAA ATCCTCAAAC ACAAATGAAG TTAGACTACT AATAGCTGAA
CAAAGTAAGA TTGAAGGAGA AGATGGAGAA CACTGGACTT ATGTCATAAA ACCTAGTACC
TCTTGGACTA CGATTGAGAT TCCATTTTCA TCTTTTACAA AAAGAATGGA TTATCAGCCA
CCTGCACAAG ATGGTAGTGA AACTTTTGAC TTATACAAAG TAGGTTCCTT GCATTTTATG
TATAGTAACA GTAATTCAGG TACTTTAAAT ATTGATAATA TTAAGTTGAT TGGCTTACCA
GAAGAACAAA TTGGAGGAAA AATTGGAGAT GTTAATGAAG ATGGCAATAT TGATGCAATA
GATTTTGCAT TGTTAAAAAA ATACTTGCTA GACTCATCTA TTAGCATAAA TAAAGTGAAT
GCAGACATTA ATTTAGATGG AGATATCAAT GCCATTGACT TTGCAAAGTT GAAAATGATG
TTACTTGGAG ACTAA
 
Protein sequence
MKKILALIIS CSIIMSFLPM SVYGAINSQD MVKKMGIGMN LGNTFDAPTE GSWSKAAQEY 
YFDDFKQAGF KHVRIPIRWD QHTLANSPYT VDSNFLNRIE TVIDWSLSRG FVTVINSHHD
TWLMDNYSQN IGRFEKIWEQ IAQRFKGKSE NLVFEILNEP HGNITDSQIN DMNKRILNII
RKTNPTRNVI IGAGYWNSYN SLSQLEIPND PNLIATFHYY DPYSFTHQWQ GTWGTKNDMD
AIAMVFNHVK KWSDKNNIPV YLGEYGVMGH SDRTSAVKWF DFVSDQAISH GFSCGAWDNG
VFGSVDNDMA FYNRDTRQFD KEILNAILTT GTTYDWTPPT ETNPDPPRTP ATPAYGEQLI
EDFEGAMQWA AYSGVDATAS CKISSGKSNN GLEITYAGSS NGYWGVVDNE HRNQDWEKWQ
KISFDIKSSN TNEVRLLIAE QSKIEGEDGE HWTYVIKPST SWTTIEIPFS SFTKRMDYQP
PAQDGSETFD LYKVGSLHFM YSNSNSGTLN IDNIKLIGLP EEQIGGKIGD VNEDGNIDAI
DFALLKKYLL DSSISINKVN ADINLDGDIN AIDFAKLKMM LLGD