Gene Ccel_0740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_0740 
Symbol 
ID7309593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp862504 
End bp864108 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content40% 
IMG OID643607678 
Productglycoside hydrolase family 5 
Protein accessionYP_002505098 
Protein GI220928189 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.209978 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA GAATAGTTTC AATGCTTATG GCATTGGCGA TAACTTCAAC TATGATATTG 
TCAAGCCATG GTCTTACTGC TGCAGCAGCA GTAGATACAA ATAATGATGA CTGGCTTCAT
TGTGTAGGTG ACAAGATATA TGATATGAAC GGGCGTGAAG TTTGGCTTAC TGGTGCAAAC
TGGTTTGGTT TTAACTGCAG CGAGAATGTA TTCCATGGTG CATGGTACGA TGTTAAGAAT
ATATTGACAA GCGTGGCAGA CAGAGGAATA GGGTTACTGA GAGTACCAAT TTCAACCGAG
CTTTTATACA GCTGGATGAC TGGTAAACCA AACAAGGTTT CCAGTGTTAC AGCAAGTAAC
AATCCGCCGT ATACGGTAGT AAACCCTGAT TTTTATGATC CTGCAACTGA TGGACCTAAA
AACAGTATGG AAATATTCGA TATAATAATG AAGTATTGCA AAGAGTTGGG AATCAAGGTA
ATGATTGATG TTCACAGCCC TGATGCCAAT AACTCCGGTC ACATGTATCC GTTATGGTAT
GGTCTTGAAA CGACAACAGC AGGTATGATT ACTACAGACA AGTGGATAGA CACATTGACA
TGGCTTGCAG GCAAATATAA GAACGATGAT ACAATACTTG CCATAGACCT GAAAAATGAG
CCTCATGGAA AAAGAGGATA TACTAATGCG GCACCAACTG ATATGGCAAA ATGGGATAAC
ACTACAGATG AAAATAACTG GAAATATGCA GCTGAAAGGT GTTCAAAAGA AATATTGGCT
GTAAATCCAA AATTGTTAAT AATGATTGAA GGTATTGAGC AATACCCAAA AACTGAAAAG
GGCTATACAT TCGACACTCC TGATGTCTGG GGTGCTTCCG GTGACGCTGC TCCATGGCAT
GGCGGATGGT GGGGCGGAAA TCTGAGAGGT GTGAAAGATT ACCCTATTGA TCTAGGCCCA
TTAAACAGTC AGATAGTATA TTCACCACAT GATTACGGTC CTTCTGTATA TAATCAATCA
TGGTTTGACA AGGATTTCAC AACACAAACC CTTCTTGATG ATTACTGGTA TGACACATGG
GCATACATAG ACGATCAAAA AATTGCTCCT CTTCTTATAG GTGAATGGGG AGGATTCATG
GATGGTGCAA AGAACCAGAA GTGGATGACC TTATTAAGAG ATTACATGAT TAAGAACCGT
ATCAACCATA CCTTCTGGTG CTTAAATCCT AACTCAGGTG ATACAGGCGG ATTGATAGGA
AACGACTGGT CAACATGGGA TGAAGAAAAG TATGGATTAC TTAAGCCTGC ATTGTGGCAG
TCAGGCGGTA AGTTTATCGG ACTTGACCAT CAGATTCCTC TCGGTAAAAA CGGAATGTCT
CTAGGAGAGT ACTACGGAGA TGGACCAATT ATCGTAGACC CTGAGCCGAT TGTCGGAGAT
GTAAACGATG ATGGAAATGT TGATGCATTG GATTTAGCTG TAATGAAGCA GTATATCCTA
GGTGCAAACC CCAAAATAAG CTTAACGAAT TCAGATATGA ATTCAGACGG AGACATAAAT
GCCCTTGACT TTGCAATGTT AAAGGCAAAG GTTCTTGGTA AATAA
 
Protein sequence
MKKRIVSMLM ALAITSTMIL SSHGLTAAAA VDTNNDDWLH CVGDKIYDMN GREVWLTGAN 
WFGFNCSENV FHGAWYDVKN ILTSVADRGI GLLRVPISTE LLYSWMTGKP NKVSSVTASN
NPPYTVVNPD FYDPATDGPK NSMEIFDIIM KYCKELGIKV MIDVHSPDAN NSGHMYPLWY
GLETTTAGMI TTDKWIDTLT WLAGKYKNDD TILAIDLKNE PHGKRGYTNA APTDMAKWDN
TTDENNWKYA AERCSKEILA VNPKLLIMIE GIEQYPKTEK GYTFDTPDVW GASGDAAPWH
GGWWGGNLRG VKDYPIDLGP LNSQIVYSPH DYGPSVYNQS WFDKDFTTQT LLDDYWYDTW
AYIDDQKIAP LLIGEWGGFM DGAKNQKWMT LLRDYMIKNR INHTFWCLNP NSGDTGGLIG
NDWSTWDEEK YGLLKPALWQ SGGKFIGLDH QIPLGKNGMS LGEYYGDGPI IVDPEPIVGD
VNDDGNVDAL DLAVMKQYIL GANPKISLTN SDMNSDGDIN ALDFAMLKAK VLGK