Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_0740 |
Symbol | |
ID | 7309593 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 862504 |
End bp | 864108 |
Gene Length | 1605 bp |
Protein Length | 534 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 643607678 |
Product | glycoside hydrolase family 5 |
Protein accession | YP_002505098 |
Protein GI | 220928189 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.209978 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA GAATAGTTTC AATGCTTATG GCATTGGCGA TAACTTCAAC TATGATATTG TCAAGCCATG GTCTTACTGC TGCAGCAGCA GTAGATACAA ATAATGATGA CTGGCTTCAT TGTGTAGGTG ACAAGATATA TGATATGAAC GGGCGTGAAG TTTGGCTTAC TGGTGCAAAC TGGTTTGGTT TTAACTGCAG CGAGAATGTA TTCCATGGTG CATGGTACGA TGTTAAGAAT ATATTGACAA GCGTGGCAGA CAGAGGAATA GGGTTACTGA GAGTACCAAT TTCAACCGAG CTTTTATACA GCTGGATGAC TGGTAAACCA AACAAGGTTT CCAGTGTTAC AGCAAGTAAC AATCCGCCGT ATACGGTAGT AAACCCTGAT TTTTATGATC CTGCAACTGA TGGACCTAAA AACAGTATGG AAATATTCGA TATAATAATG AAGTATTGCA AAGAGTTGGG AATCAAGGTA ATGATTGATG TTCACAGCCC TGATGCCAAT AACTCCGGTC ACATGTATCC GTTATGGTAT GGTCTTGAAA CGACAACAGC AGGTATGATT ACTACAGACA AGTGGATAGA CACATTGACA TGGCTTGCAG GCAAATATAA GAACGATGAT ACAATACTTG CCATAGACCT GAAAAATGAG CCTCATGGAA AAAGAGGATA TACTAATGCG GCACCAACTG ATATGGCAAA ATGGGATAAC ACTACAGATG AAAATAACTG GAAATATGCA GCTGAAAGGT GTTCAAAAGA AATATTGGCT GTAAATCCAA AATTGTTAAT AATGATTGAA GGTATTGAGC AATACCCAAA AACTGAAAAG GGCTATACAT TCGACACTCC TGATGTCTGG GGTGCTTCCG GTGACGCTGC TCCATGGCAT GGCGGATGGT GGGGCGGAAA TCTGAGAGGT GTGAAAGATT ACCCTATTGA TCTAGGCCCA TTAAACAGTC AGATAGTATA TTCACCACAT GATTACGGTC CTTCTGTATA TAATCAATCA TGGTTTGACA AGGATTTCAC AACACAAACC CTTCTTGATG ATTACTGGTA TGACACATGG GCATACATAG ACGATCAAAA AATTGCTCCT CTTCTTATAG GTGAATGGGG AGGATTCATG GATGGTGCAA AGAACCAGAA GTGGATGACC TTATTAAGAG ATTACATGAT TAAGAACCGT ATCAACCATA CCTTCTGGTG CTTAAATCCT AACTCAGGTG ATACAGGCGG ATTGATAGGA AACGACTGGT CAACATGGGA TGAAGAAAAG TATGGATTAC TTAAGCCTGC ATTGTGGCAG TCAGGCGGTA AGTTTATCGG ACTTGACCAT CAGATTCCTC TCGGTAAAAA CGGAATGTCT CTAGGAGAGT ACTACGGAGA TGGACCAATT ATCGTAGACC CTGAGCCGAT TGTCGGAGAT GTAAACGATG ATGGAAATGT TGATGCATTG GATTTAGCTG TAATGAAGCA GTATATCCTA GGTGCAAACC CCAAAATAAG CTTAACGAAT TCAGATATGA ATTCAGACGG AGACATAAAT GCCCTTGACT TTGCAATGTT AAAGGCAAAG GTTCTTGGTA AATAA
|
Protein sequence | MKKRIVSMLM ALAITSTMIL SSHGLTAAAA VDTNNDDWLH CVGDKIYDMN GREVWLTGAN WFGFNCSENV FHGAWYDVKN ILTSVADRGI GLLRVPISTE LLYSWMTGKP NKVSSVTASN NPPYTVVNPD FYDPATDGPK NSMEIFDIIM KYCKELGIKV MIDVHSPDAN NSGHMYPLWY GLETTTAGMI TTDKWIDTLT WLAGKYKNDD TILAIDLKNE PHGKRGYTNA APTDMAKWDN TTDENNWKYA AERCSKEILA VNPKLLIMIE GIEQYPKTEK GYTFDTPDVW GASGDAAPWH GGWWGGNLRG VKDYPIDLGP LNSQIVYSPH DYGPSVYNQS WFDKDFTTQT LLDDYWYDTW AYIDDQKIAP LLIGEWGGFM DGAKNQKWMT LLRDYMIKNR INHTFWCLNP NSGDTGGLIG NDWSTWDEEK YGLLKPALWQ SGGKFIGLDH QIPLGKNGMS LGEYYGDGPI IVDPEPIVGD VNDDGNVDAL DLAVMKQYIL GANPKISLTN SDMNSDGDIN ALDFAMLKAK VLGK
|
| |