Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_1298 |
Symbol | |
ID | 7310086 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 1602832 |
End bp | 1604265 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 643608218 |
Product | glycoside hydrolase family 8 |
Protein accession | YP_002505633 |
Protein GI | 220928724 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3405] Endoglucanase Y |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000471564 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGT TTTTTGCTTG CTTGCTTGTT TTGGCTATAA TTGTTACGAT TGTACCAATA AATATAGCAA GTGCAGAGAC AGGGGGATAT TACGCCACCG GGAACTACAG GAACGTCTTC GTTGAAACAG GAAGGACAGA GGCTCAGGTA CAGGATAAAA TGAATACTAT GTTTGACAAG TTCTTCAAAG GAGACACAAA CAATCAGAGG CTATTTTATG AGACCGGAAC CGACGAGGCA TACATAAGAG ATACCGGAAA CGGTGATGTA CGTTCTGAAG GTATGTCCTA CGGTATGATG GTTTGTGTGC AAATGGACAA AAAGAAGGAA TTTGATATGC TGTGGAAATG GGCTAGAAAC CACATGTATC AAACCTCTGG TCAATTTAAG GGGTTTTTTG CATGGCAGTG TAACTATGAT GGTGGTATAA TAGACAGCAC TCCTGCTTCT GACGGTGATG AATATTTTGC AATGGCTTTG CTCTTTGCTG CACGCAGATG GGGAAATGGA ACAGGTATAT ACAACTATGA AGCAGAAGCA CAGACTATAT TGGATGCAAT GCTTCACCAG TCAGACGATG GTGTTGGATA TAATATGATC AATAAAAACG CAAATCAAGT TGTTTTCTGC CCAAGTGCAG GAAATTATGA TTTTACAGAT CCGTCCTATC ATCTCCCGGC ATTTTATGAA TTATGGGCAA TGTGGGGACC TGAAAGAGAC AGAGCTACTT GGAGTAAAGT AGCTGCTACT AGCAGGGAAT TCTTAAAAAA GTCTACTCAT CCGACAACAG GACTTAACCC AGACTATGCA AATTTCGATG GTTCAGCAAA AGAAGTTTCA TGGAGTTCAG GTCATGGTGA TTTCAGATTT GACGCATGGA GAGTTATTCA AAATTCTTGT GTAGACTATG CTTGGTGGCA AAAAGACAGC TGGCCTGCAA CTACATTTGC ACCTAAAATT CAGGCTTTCT TTAAAAACCA AGGGTTATCC ACCTACGGAA ACCAGTATAC ATTATCCGGT TCAAAGCTGA GCAGTGATCA TTCTCCGGGA TTGGTAGCTA TGAATGCCGT ATCAGCTCTG GCATCAGATG CAACAACTGC TAAGCCTTTT GTTGATGAAT TATGGAATAC TGCTGTTCCA TCAGGTCAGT ACCGTTATTA TGACGGAATG CTTTATATGC TTGGAATGCT TAACGTAAGC GGAAACTTCA AGATATGGGG TGCACCTACA GAGCCTTCGG TAACACGTGG TGATATTAAC GATGACGGCA CTATAGATTC TGTTGACTTT GCATTGCTGA AGTCTTACCT GCTGGGTAGG ACAACTACAT TGCCTAATAT GAAAGCAGCA GACTTAAATG GCGATGGTGT AGTGGATGCA ATGGATTGGG CTGTACTTAG GCAGTATCTT TTAGGTATAA TAAAAACTCT ATAA
|
Protein sequence | MKKFFACLLV LAIIVTIVPI NIASAETGGY YATGNYRNVF VETGRTEAQV QDKMNTMFDK FFKGDTNNQR LFYETGTDEA YIRDTGNGDV RSEGMSYGMM VCVQMDKKKE FDMLWKWARN HMYQTSGQFK GFFAWQCNYD GGIIDSTPAS DGDEYFAMAL LFAARRWGNG TGIYNYEAEA QTILDAMLHQ SDDGVGYNMI NKNANQVVFC PSAGNYDFTD PSYHLPAFYE LWAMWGPERD RATWSKVAAT SREFLKKSTH PTTGLNPDYA NFDGSAKEVS WSSGHGDFRF DAWRVIQNSC VDYAWWQKDS WPATTFAPKI QAFFKNQGLS TYGNQYTLSG SKLSSDHSPG LVAMNAVSAL ASDATTAKPF VDELWNTAVP SGQYRYYDGM LYMLGMLNVS GNFKIWGAPT EPSVTRGDIN DDGTIDSVDF ALLKSYLLGR TTTLPNMKAA DLNGDGVVDA MDWAVLRQYL LGIIKTL
|
| |