Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_0417 |
Symbol | |
ID | 7309299 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 475628 |
End bp | 478174 |
Gene Length | 2547 bp |
Protein Length | 848 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 643607349 |
Product | cellulosome protein dockerin type I |
Protein accession | YP_002504781 |
Protein GI | 220927872 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000135998 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTACT TTAAAAAGGT ATGCATTACT CTGGCTGCTT TTCTTATCAG TACCGGTTTT GCTGTCGGTA TAGGGACGCA ATCTGTAAAC GCAGCTTCAA CAACTATTCC ATGCAGACAG CTTGAACAGC TGGATAGGGG AATTGTAGCT GTTAATCAGG GAAATGGGAA GGTTTATGTA AGCTGGCGTC TTCTTGGAAC AGAACCCAGT AATACTGCAT TCAATCTTTA TCGAAAAACT GCCGGTGGCA CTGAAGTTAA ACTCAATAGT TCACCCATTA CATCCTGTAC AAATTATGTT GATAATGGTG TTGACACAAC AAAGGATAAC ACTTATACAG TAAGAACCGT TTTAAATGGA ATTGAGAATA ATGAAAGTGA ACAATACACT TTGGCTGCAA ATACTTCTGT CAGACAATAT ATCCCTATAA AATTAAAACC TCTTCCCACC GGATATTACA CAATGCATGT AAATGTGGGT GACCTTGATG GGGACGGAAA ATATGACTAT ATAGTAAAGC GTATGAATGA TGACAGGTCT CCTGTGCAGG TAGAGGCCTA TAAATCAGAC GGAACATTTC TATGGCGTAT TGATTTAGGG CCAAATATCG AAACATATAA CACAGCTATG ACTTCACCTT TGGTTGTATC GGATTTAAAC AGTGACGGAA AGGCGGAAGT TCTTATAAAA ACAGGTGAAA GTACCCGGTT TGGTGACGGT ACCTTAATCG GCGACACAAA TAATGACGGA ATAACTAACT ATTGTAATAC ATCTTCAACT AGTTACCAGG TTCTTTCAGG CCCCGAATTT ATTTCTGTAA TTGATGGTAT GACAGGTAAA GAGCTAAGTA GGGCTGATTT TATTGCAAGA GGACAAGTTA CGGACTGGGG AGATAACTAT GGGAACCGTG CAAGCTTTAT TTTTATGACA GTTGCATATC TGGACGGTAT TCACCCAAGT GTAGTAATGT CAAGAGGCCC CGGAAATGTT ATGAAAGTTG AAGCATGGGA TTTCAAAGAT GGAAAGCTAA GTCAACGGTG GAAATGGGAT GCTAGAAATC AAGTATTACC TTCTGGGAAG AATTTCCCTG ATTTTCATGC AATCCGTGCC GTAGATGTGG ATAAGGATGG AAAAGATGAA ATTTCATGGG GAGGCTCCAT GCTCAATGAC GACGGTAAAT TACTGTACGC TACGGAGCTG ACACATGGAG ATCGGTTTGT AATAGGGGAT ATTGATCCCG ACCGGGATGG TCTTGAATGT TACGCAATAC AGCAGAACAA CCCGAGTCTT CTGGGTGCGG CCTTATATGA TGCAGGTAAT GGTACAATGA TTAAAAAAAT GTATATGAGT GCAGTTGGCG ATGTAGGAAG AGGGGATTGT GCTGATATTG ATCCTAATTA CAGGGGGATG GAATGTTGGT CTACACTTGA GAATTTATAC AACTGCAAGG GAGGGGTTAT CGGTTCTGAA AAGTCTTTTC CGTTCTTGAG TATATGGTGG GATGGAGATT TGCTCAGAGA ATTTTTCATA GGTGTTGACT CAAATGGTTT TAATGCGGCC ATCAACAAAT GGAATTATAC TACAAAAACC AGCAACCGGT TATATTCTGT TTATCAGGAG GGGGTCAAGT CAACATATGC CGGGAGACCG CCTTTCTATG GTGACATTAT GGGCGACTGG CGTGAGGAAG TTATTCTTGA AACTACTGAT AATACGGAGC TTCGTATATA CACTACAAAC ATTCCCACCA ACTATAGGAT ATATACCCTG ATGCATAACC CGGCATACAG AAATTCCGTA GATGTTAAGG GATATCTGTC TTCTGTTTAT CCTGATTATT ATCTGGGTGA AGGTATGTCA ACGCCACCGA CCCCCAATAT TTATACAGGG GATGGAGAAT TAATAAAATC ACTAAGGGTA AATGATTCCA ACAATGCAGC TGACTGGTGC ATACAATCAA ATTTACAGGT TGGTGATACA GTTTACGGAG ACAGAACATA TAAATATACA AAAATTCCTC AAAGTCTTAC AGGTACAGAA TGGATAAGGA CTGCTTGTGA TTCTAAGAGC TATTTAGATG AGGAAGCGAA TTTTACGGCA AAAAAAGACA TATCGGTTTA TATTGGTTTG GATTCAAGAA TTACCAGTAT TCCGGCATGG CTGAGCGATT GGACGAAAAC AGGTGAAACA TTAAGTGATG ATAGCTCAAT TACCTTCAAC CTATACAAGA AAGATTTTAC TTCCGGCTCT GTTGTGAGGC TTGGAACTAA TGGGGGTTCT TCCAGTTTTG TGAATTATAC GGTTATTGTT AAGCCAAACA CGGCACCTGC TTTTCTTTAC GGTGATGTAA ATGGAGACGG TATTGTGGAT GTTCTGGATT ACAGTACAAT GCGAAGTTAT CTTCTGCAGA TAACGAACTC AATGCCATCT GCATATTGGC AGATGGCCGG AGATTTGAAT TCCGACGGGG CTATCGACAG TATGGATTAT TTATACCTGA AAATGTACTT GTTGGGGACA ATTAACTCTC TACCAGTTTC TCCATAA
|
Protein sequence | MKYFKKVCIT LAAFLISTGF AVGIGTQSVN AASTTIPCRQ LEQLDRGIVA VNQGNGKVYV SWRLLGTEPS NTAFNLYRKT AGGTEVKLNS SPITSCTNYV DNGVDTTKDN TYTVRTVLNG IENNESEQYT LAANTSVRQY IPIKLKPLPT GYYTMHVNVG DLDGDGKYDY IVKRMNDDRS PVQVEAYKSD GTFLWRIDLG PNIETYNTAM TSPLVVSDLN SDGKAEVLIK TGESTRFGDG TLIGDTNNDG ITNYCNTSST SYQVLSGPEF ISVIDGMTGK ELSRADFIAR GQVTDWGDNY GNRASFIFMT VAYLDGIHPS VVMSRGPGNV MKVEAWDFKD GKLSQRWKWD ARNQVLPSGK NFPDFHAIRA VDVDKDGKDE ISWGGSMLND DGKLLYATEL THGDRFVIGD IDPDRDGLEC YAIQQNNPSL LGAALYDAGN GTMIKKMYMS AVGDVGRGDC ADIDPNYRGM ECWSTLENLY NCKGGVIGSE KSFPFLSIWW DGDLLREFFI GVDSNGFNAA INKWNYTTKT SNRLYSVYQE GVKSTYAGRP PFYGDIMGDW REEVILETTD NTELRIYTTN IPTNYRIYTL MHNPAYRNSV DVKGYLSSVY PDYYLGEGMS TPPTPNIYTG DGELIKSLRV NDSNNAADWC IQSNLQVGDT VYGDRTYKYT KIPQSLTGTE WIRTACDSKS YLDEEANFTA KKDISVYIGL DSRITSIPAW LSDWTKTGET LSDDSSITFN LYKKDFTSGS VVRLGTNGGS SSFVNYTVIV KPNTAPAFLY GDVNGDGIVD VLDYSTMRSY LLQITNSMPS AYWQMAGDLN SDGAIDSMDY LYLKMYLLGT INSLPVSP
|
| |