Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_0649 |
Symbol | |
ID | 7309514 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 750568 |
End bp | 752166 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643607590 |
Product | cellulosome protein dockerin type I |
Protein accession | YP_002505010 |
Protein GI | 220928101 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5520] O-Glycosyl hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA TCATTCGCTT ATTAGGTCTA ACTATGGTTT TGATGCTTGT ATTTACAATG GTATTACCAT TAAATCTTTA TGCAGCATCA ACTGTTACCG TGGATTGGGG TACCAATTAT CAAACAATTG ATGGTTTTGG TGTTTCAGAA GCTTTTCATC AGTCAAATAA TATTGCTTTA TTAGGAGATA CCAAGAAAAA GGAAATTTAT GACTTACTAT TTTCAACTAC AAAGGGGGCA GGGTTTTCAA TATTCCGTTC TATACTTGGA GACGGAGGAA CATGGGGGAA TGCAACTGAC GGACCAAATA AGACAATGCA GCCTTCTGAG ACAACTTGGG ACTGGAAAGA ATCAAATGAT GACCAGATAT CTATGATTAG AGAGATACAG TCCGGCTACG GAATCAATAA AATTCTTTAC ACTGTATGGA GTCCGCCTGC ATGGATGAAA TCAAACGGGT CAACTTCAAG AGGATATCTA AAGACCGATA AATATCAAGC ATATGCAACA TATTTAGCAG AGCATATAAA AAACTACAAA TCAAAATTTG GAATTGATAT TACTCATATA GGGATTTCAA ATGAGCCTAA CCTTGAAACA GACTATTCTT CATGTACATG GACAGCAGCT CAATTCAAAA CCTTTATGAA GGATTATCTG GTACCAACTT TTGATAAAGA AGGTATTACT GCAAAAGTTA TTATGGGAGA ACCAATGTCA TGTACCGAAT CATTTGCAAT TGACTGTTTG AATGATGCCA CAGCATTGAC AAGAACAGAT ATTGTAGGTT GTCACAATTA TGGATCATCA TACACAACTT TTCCAACCAC TAAGGCAAAG GGAAAAGGAA TATGGCAGAC AGAAATATCA GACATGAATG GAAACGATAC TACAATAACT GATGGTTTAA AGTGGTCAAA ACAAATCTTT GATTTTATGA CAATAACTCA GGGAAATGCA TGGAATTACT GGTGGGGTGC GTGCTATAAA ACATATAATG GAGAAGGTCT CATACAAATG GACATGAATT CAAAGACCTA TAAAGTTGCT AAAAGACTCT ATACTGTTGG ACAATATTCA AGATTTATCA GACCGGGATG GCAGAGATTC GCTGCTACTT CGAACCCTGT GTCCAATGTA TATGTTACCG CATATAAGGA TCCCGCTACA GGAAAATTTG CAATTGTTGC TATGAATGAC GGTTATACAA ATCAATCAAT TACATATACA TTGAAAGGAT TTACTCCTGA CTCGGTTACT CCATACACAA CTTCATCAAC CCAAGATTTG GCTGAAGGTA CAAAAATAAC TGTAAGCGGA GGTAGCTTTA CAGCTAATCT GGCAGCAAAT TCTATAACAA CATTTGTTGG CGGAAGTGAT GTAAATCCCG GTATCTATGG TGATGTCAAC GGCGACAAAG TTGTTGATGC CATTGACTTT GCACTTTACA AGCAGTATCT CATAAAGCAG ATTAGCACCT TCCCGTCACC TGACGGAATG AAGCTTGCTG ATGTAAACGG TGATAACAGT GTTGATGCAA TTGATTTTGC ATTAATCAAG AAATACTTGC TTGGTTCAAT AACTAAACTT CCGGTTTAA
|
Protein sequence | MKKIIRLLGL TMVLMLVFTM VLPLNLYAAS TVTVDWGTNY QTIDGFGVSE AFHQSNNIAL LGDTKKKEIY DLLFSTTKGA GFSIFRSILG DGGTWGNATD GPNKTMQPSE TTWDWKESND DQISMIREIQ SGYGINKILY TVWSPPAWMK SNGSTSRGYL KTDKYQAYAT YLAEHIKNYK SKFGIDITHI GISNEPNLET DYSSCTWTAA QFKTFMKDYL VPTFDKEGIT AKVIMGEPMS CTESFAIDCL NDATALTRTD IVGCHNYGSS YTTFPTTKAK GKGIWQTEIS DMNGNDTTIT DGLKWSKQIF DFMTITQGNA WNYWWGACYK TYNGEGLIQM DMNSKTYKVA KRLYTVGQYS RFIRPGWQRF AATSNPVSNV YVTAYKDPAT GKFAIVAMND GYTNQSITYT LKGFTPDSVT PYTTSSTQDL AEGTKITVSG GSFTANLAAN SITTFVGGSD VNPGIYGDVN GDKVVDAIDF ALYKQYLIKQ ISTFPSPDGM KLADVNGDNS VDAIDFALIK KYLLGSITKL PV
|
| |