Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_2442 |
Symbol | |
ID | 7312363 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 2943945 |
End bp | 2946557 |
Gene Length | 2613 bp |
Protein Length | 870 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 643609372 |
Product | cellulosome protein dockerin type I |
Protein accession | YP_002506751 |
Protein GI | 220929842 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4870] Cysteine protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000150479 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA GGCTAAAGAA AGCAAGTATA GTACTGGCTC TTGCAGTTCT AGTCCAGTGT ATGATGTTCA ATCTTGGGTT TGAAACAATT CAAGCCGGAG CGGTTGGTAT TATTAATAAA GATTACTGGA ATTACAGGAA TATAGGTAAT GCAAATGAAT ACTCTACCGC TGATGTTTTT ACGGATAAAG TAATTGTTGC AGGCAGCGGG TCTGGAGTGT TTAATACTGA GGACTCCTTC ACTTATTCAT ACATACCTGT AAATGGTGAC TGCACAATTC AGGCAAGAAT CGTCTCAGAA AGTAGTACGG ATGCTTTGGC TAAAGCCGGA CTGATGATAA GAGAGAGTCT GGATACCGAT AGTAAGAATG CATTTATTGC ATTATCTAAA TCAAATCAGA TCCAATACCA GTACAGGGCT ATGACAGGAG GTGCCACCGC ATCTGATGCC AGTATCTCCG GCAATGCCCC GGTTTATTTG AAGTTAACAA GGGTGGGAGA CAGTTTTGAA GCATTTATGT CAACAGATGG GACCAATTGG ACAAAAACAG GCAATACTCA GACAATAGCT ATGGGTTCAA AAGTATATTT AGGTTTTGCT TCAACGTCAA CAGACCCCAA CAAGCTGTGT ACGGCCAGGT TTGAAAATAT CGATATTGAA TATACAGATA ATACCCCACC GCTGGCACCG ACCAATCTAA GGGTGGTTTA TGAATCCCAG CCTAGTTGCC AACTAGCTTG GGATGAAGCC TCTGATGATT CAGGAGCAGT GTTGTACGAG ATTTACTCAA ATGGTAGTCT AAAACGAATT ACACACGATT GTAAATCCAT CTGCCCGAAT ATTGATTTTA AAAATACCGT TGATCTCTGT GTGGTTGCAG TCGATGCCAA GGGAAATAGG TCGCCGGAAA ATAGCACTAT AAAAATAGTG TCTCAGAATG CCTTAATATC ATCGGCTGAT GTTACTAATA TCCGATTGAA TTCCATTGGT TTAGAACGTA TGAATACTAA ACGCCAGCAG CAAAACAAGC CATTGGTGGA AGCTGATCCT GTTCAAGTGG GAGAGGAAAT TATGACAGAC AGTACTCCTA ACAATGTTAT TGTTCAGGGA AATTCGGTAG ATTTGAATAC AATATACGCC GAGTCGCTTC CTTCTTCTGT GGACAACAGT ACACTGCAGT GTTTCCCCAG AATAGACAAT CAGATATACG GCGACTGCGT TATCTGGTCT ACTGGGTATT ATACGATGAC ACATATGGTA GGGCTCGCCA AGGTAAATGC GGGAGGACAA TGGGATGCCA AAAACGATAC TACAGGAAGC AAGGTCTTTT CTCCTAAGTT TGCTTTTAGC GTAGGTAATG CACCTAGTAG CACCGGACTT ATGACCGGAG TATATAAAAC GTATTTGGAT TCAGGCTGTG CAACATTGGC TGATGCACCG TATATAAATG ATGGTGTGGA CGGATTCAAA CTGAGTACCG ATTTAGATTC ATGGGAAAAT GCAATAAACT ACAGAATGGA TAAGTACGGA TATATAGATG CAAATGAAAA CAGTTTAGAG CGTATAAAAC AGCTACTGAA CAATGGCTAT GTGATGTCCT TTGATACAGG TACATACAAT TTCATGCAAT ATCCGAATGT AGTTTTGGAT AACCCTGACC CAGCGGTAAA TGATGATTAC GCCGTTGGAA AGCACTTCTT TTATATGGTT GACGGAGTTA ACTCGGGACA TCAAATGACC CTTGTGGGGT ACGATGACAA TATTTGGTGG GGTGATGTAA ATGGTGATGG AATCCCTCAG CCTGAAGAGA AAGGTCTTTT TAAAATAGCC AACAGTTGGG GATCAAATTA CGGTTATGAT GGCTTTGTGT ATGTTAACTA TGACAGTATA TATAGAAATT CTCAATTCAG CCAGTTTAAC AGCTCCACAA GAATGCCGAT TTTTAAGGAT GTATTGGAGT GGATGACTCC ACGAAGAGAT TACGTACCCC AGCTTATTGC GGAGTTTACG GTGAGCCATG CAAAAGCAGA TCAGCTTAGA ATTGCCGTTG GTTATTCTGA TATGGACAAA AACATGCCTG AAGCATATTT CTTTCCCGGT AGCTTAAATT ATCTCAGTCA TACAGAGCCT TTCGACTTTA ATGTCGATGG CACTGCCTGT GATGGCAACT TTGCGGTTGA CATGACGGAC TTCATCACGA AGTTCAATTT GGACAAGAGT AAAAGATACA AGTGGTATTT AATGGTGGGA GACAATGAGG AAGATGGCTC TCCTGTCACA TTAAAAAGCT TCAGGGTTCA CGACAAAATT AATAATAAAT ATTCAACTTA CAGAGGTCCT GAACTTCAGA ATGACGGGGA CAACAGCTAT GTCAGTGTAG ATTACAGCTG GGCACTTGTA GGGGATGTAG ACGGAAACGG TATTATTGAT GATGCTGATC AATTATTAAT TGTAGATTAT AGTCTAGGCT ATATTAATGA TTTCCCAGTT GAAGATGACA TGTGGGCTGC AGATGTTAAT GGCGACGGTA TCATAAATAT GATTGATTCT GCTTTCATTA GAAAGTATAT CCTTGGGCAG ATAAATATTT TTCCTAAGCA GCAGCTAAAT TAA
|
Protein sequence | MKKRLKKASI VLALAVLVQC MMFNLGFETI QAGAVGIINK DYWNYRNIGN ANEYSTADVF TDKVIVAGSG SGVFNTEDSF TYSYIPVNGD CTIQARIVSE SSTDALAKAG LMIRESLDTD SKNAFIALSK SNQIQYQYRA MTGGATASDA SISGNAPVYL KLTRVGDSFE AFMSTDGTNW TKTGNTQTIA MGSKVYLGFA STSTDPNKLC TARFENIDIE YTDNTPPLAP TNLRVVYESQ PSCQLAWDEA SDDSGAVLYE IYSNGSLKRI THDCKSICPN IDFKNTVDLC VVAVDAKGNR SPENSTIKIV SQNALISSAD VTNIRLNSIG LERMNTKRQQ QNKPLVEADP VQVGEEIMTD STPNNVIVQG NSVDLNTIYA ESLPSSVDNS TLQCFPRIDN QIYGDCVIWS TGYYTMTHMV GLAKVNAGGQ WDAKNDTTGS KVFSPKFAFS VGNAPSSTGL MTGVYKTYLD SGCATLADAP YINDGVDGFK LSTDLDSWEN AINYRMDKYG YIDANENSLE RIKQLLNNGY VMSFDTGTYN FMQYPNVVLD NPDPAVNDDY AVGKHFFYMV DGVNSGHQMT LVGYDDNIWW GDVNGDGIPQ PEEKGLFKIA NSWGSNYGYD GFVYVNYDSI YRNSQFSQFN SSTRMPIFKD VLEWMTPRRD YVPQLIAEFT VSHAKADQLR IAVGYSDMDK NMPEAYFFPG SLNYLSHTEP FDFNVDGTAC DGNFAVDMTD FITKFNLDKS KRYKWYLMVG DNEEDGSPVT LKSFRVHDKI NNKYSTYRGP ELQNDGDNSY VSVDYSWALV GDVDGNGIID DADQLLIVDY SLGYINDFPV EDDMWAADVN GDGIINMIDS AFIRKYILGQ INIFPKQQLN
|
| |