Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_1241 |
Symbol | |
ID | 7310037 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 1531759 |
End bp | 1535253 |
Gene Length | 3495 bp |
Protein Length | 1164 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 643608162 |
Product | Carbohydrate binding family 6 |
Protein accession | YP_002505577 |
Protein GI | 220928668 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1554] Trehalose and maltose hydrolases (possible phosphorylases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATATTA ATAAGAAAAT CATCAGGAAA ATCGGTGCAA CTGTATTAGC GGGGGTACTG ATTGTAAGCG GGAATTTACC TTTTAACGGC TACAAATCAC TGGCTGCTTC TCAAGACACT GTTGAAGCTG GGGGTATTGC TACTCAGGAT AATGATTTGA AATTATTCTA CAACAGCATG GCAGGAAATA ACTTTTCAGG TAACCCGTAC GACCTTAACG AATCATTTTA CAAGGCGTTA CCCCTTGGAA ACGGTCGTAT CGGTGCAATG GTTTACGGGA ACTATCCTGA TGAGAGGATT GATTTGAATG AAGCTACCTT TTGGAGCAGC GGACCTGGCA ATAATAATAG AGCTGGAGCC GCAAACTTTT TAAAAACCGC ACAGGATCAA TTGTTCGCCG GACAATATAA AACCGGAAGT GCCACTATTG CCAATAATAT GATCGGTGGT GGGGAAGCCA AGTATCAATC AATTGGGGAC TTGAAACTTT CATTTGGCCA TAGTTCGGTT TCTAATTACT CAAGACAGTT GGATATGAAT ACCGGAGTTG TTTCAAGTGA TTATACATAT AACGGAAAAA AATATCACCG TGAGTCTTTT GTGAGTTATC CGGACCAGGT TATGGTAACA AAAATTACTT GTAGTTCACC TGGGTCAATA TCACTTACTG CCGGGTACGA ATCTTCACTT ACCGGTCAGT ATACTGTTTC TACCTCGGGT AATGACACCT TAGTAATGAA TGGACACGGT GATTCTGACA ATGGCATTTC ATATGCTGTA TGGTTTTCAA CTAGATCAAA AATTATTAAT TCAAATGGTT CAGTCTCAGC CAACAACAAT CAAATATCCG TTTCAAATGC AGATTCGGTG GTTATCCTGA CCTCTATCCG TACAAATTTT GTAAATTATA AAACCTGTAA TGGAGATGAA AAGGGAAAAG CCACTACGGA TATTGCTAAT GCTTCGGCAA AATCATATGA TACCTTATAC AATAACCATG TTACAGATTA TCAGAACTTA TTCAAAAGAG TGGATGTTGA CCTTGGCGGC AGCGGAAGCG AAAACGGTAA ACCTATGGGA CAACGTATAT CTGAATTTGG TACAACTAAT GACCCCAAAT TGGCAAAAGT GCTTTTCCAA TACGGAAGAT ATTTAATGAT TAGTGCTTCA CGTGATTCCC AGCCTATGAA TCTTCAGGGA ATCTGGAATA AATTCCGTAA TCCTGCATGG GGTTGCAAAA TGACGACTAA TATAAACTAC GAAATGAATT ACTGGCCTGC CTTTACCACA AATCTGGCCG AGTGCTTTGA GCCATTTGTA AAAAAGGCAA AAGAACTTCA GGCCCCCGGT AATGAAACTG CCCGTGTCCA TTATAACATA TCAAACGGAT GGGTTTTGCA CCATAACACC GACCTATGGA ACAGAACGGC TCCTATAGAC GGTGATTGGG GGTTCTGGCC AACAGGTGCC GGCTGGGTAT CCAATATGCT GTTTGATGCG TATAGCTTTA ATCAGGATAC AGTGTATTTG AACGAAATCT ATCCCGTAAT AAAGGGAGCA GCTGATTTCT TGCAAACCCT GATGCAGTCT AAAAGTATAA ATGGACAAAA TTATCAGGTT ATCTGTCCGA GTACTTCACC TGAACTTACA CCACCCGGTA CCAGCGGAGG ACAGGGTGCA TATAACAGTT ATGGAGTAAC AATGGACAAC GGAATCAGCC GTGAACTTTT CAAAGACGTA ATCCAAGCCT CCAAAATCCT TAACATTGAT TCTTCTTTTC GTTCTACTCT CGCATCCAAG GTATCCCAAA TTAAACCAAA TACTGTTGGC AGCTGGGGTC AACTGCAGGA ATGGGCTTAT GACTGGGACA GCCAATCTGA GAAAAACCGC CACATTTCCT TCGCATATGA CCTATTCCCG GGATTGGAAA TAAACAAACG GAATACACCT GCTATTGCCA GTGCAGTAAG TAAGTCACTA AATACACGTG GAGATGTTGG AACAGGTTGG TCCGAAGCTT GGAAATTAAA TTGCTGGGCC AGGCTGGAAG ACGGAGCCCA TTCATACAAT CTTGTCAAAC TGCTGATTAC ACCTGTTAGT AAAGACGGAC GACTCTACGA TAACTTATGG GACGCACATC CTCCTTTCCA GATAGACGGC AACTTCGGAT TTACCTCAGG AATAGCAGAA ATGCTTTTGC AAAGTCATAA TAACGAAATT CAACTTTTAC CGGCTTTACC AAGCCAATGG TCAACCGGGC ATGCAAATGG TCTCTGTGCC CGTGGAAATT TCACTGTTAC AAAAATGAAT TGGGCAAATG GTGTATTAAC CGATGCTACT ATCAAATCAA ATTCAGGCAA TGTTTGTAAT GTCCGTTACG GCAACAAAAC CATAAGCTTT CCGACAAAGA AAGGATACAC CTATCAGTTG AATGGTTCAT TACAGTTGGT AGAACCCGGT ACTACATTGA CAAATGTGGC TTTAAATAAG ACTGCCACTG CTTCAGGGAC AAATTTAGGT GAAGAGGCAG GAAAAGCTAT AGATGGCTCA ACTACTTCCA AGTGGTGTCA TGATAACGGC ATGAGAGGTG AATGGCTGCA GGTTGATCTT GGTGCAAAGT ATGATATCAG CCGTTGGGTT GTGAAACATG CCGGAGTAGC AGAAACAATC AGATTTAATA CTAGAGATTT TACCCTGCAA AAGAGTGATG ACGGAACTAA CTGGGCCGAT GTGGATGAGG TGTATGGAAA TCAACAAAAC ATAACCGACA GAAATGTTCC GACCTTTAAT GCAAGATATG TACGCCTATA TATCAATACA GCCACTCAGG ATAATTCCGG TGGTGCCAGA ATTAGTGAAC TTGAACTCTG GGGCAAACCC AGCGTTGATA TTCCAAAATC TGCGTTTTCA CAGATAGAAG CAGAAGAGTT CAACAGTCAA TACGGAGTAC AAGCTGAAAC CTGTAGTGAA GGTGGACAGG ATGTTGCATT TATTGAAAAT GAAGACTTTG CTGCTTATAG TAATGTTGAT TTCGGAGAAG GTGCTAAAAG CTTTCAGGCA AGGGTATCAA GTGCCACTAG TGGAGGAAAC ATTGAAATAA GACTTGACAG TATTGACGGC CCATTAGTAG GAACTTGTCC CGTTACAGGA ACAGGTGATT GGCAGAATTG GTCTGATGTA ACCTGCAATG TCAGCGGAGC AAGCGGTAAA CATGATTTAT ACCTAAAATT TACTGGGGAC AGCGGATACC TGTTCAACCT TAACTGGTTC AAATTCTCTA ATGCACCCAT TGTAACTGGA AAACCGGGTG ACATAAATAA CGACGGACAA ATAGATGCAA TAGATTTGCT ATTGTTGAAG AAATATCTTT TGGGATTAGA AACAATAGAA AATACTAAAT TAGCTGATCT GGATGCCAAT GGTGAGATAA ATGCAATTGA TTTCTCACTG CTCAAGCAAT ATTTGCTGGG AAATATAAGC GTATTTCCGG GTTAA
|
Protein sequence | MNINKKIIRK IGATVLAGVL IVSGNLPFNG YKSLAASQDT VEAGGIATQD NDLKLFYNSM AGNNFSGNPY DLNESFYKAL PLGNGRIGAM VYGNYPDERI DLNEATFWSS GPGNNNRAGA ANFLKTAQDQ LFAGQYKTGS ATIANNMIGG GEAKYQSIGD LKLSFGHSSV SNYSRQLDMN TGVVSSDYTY NGKKYHRESF VSYPDQVMVT KITCSSPGSI SLTAGYESSL TGQYTVSTSG NDTLVMNGHG DSDNGISYAV WFSTRSKIIN SNGSVSANNN QISVSNADSV VILTSIRTNF VNYKTCNGDE KGKATTDIAN ASAKSYDTLY NNHVTDYQNL FKRVDVDLGG SGSENGKPMG QRISEFGTTN DPKLAKVLFQ YGRYLMISAS RDSQPMNLQG IWNKFRNPAW GCKMTTNINY EMNYWPAFTT NLAECFEPFV KKAKELQAPG NETARVHYNI SNGWVLHHNT DLWNRTAPID GDWGFWPTGA GWVSNMLFDA YSFNQDTVYL NEIYPVIKGA ADFLQTLMQS KSINGQNYQV ICPSTSPELT PPGTSGGQGA YNSYGVTMDN GISRELFKDV IQASKILNID SSFRSTLASK VSQIKPNTVG SWGQLQEWAY DWDSQSEKNR HISFAYDLFP GLEINKRNTP AIASAVSKSL NTRGDVGTGW SEAWKLNCWA RLEDGAHSYN LVKLLITPVS KDGRLYDNLW DAHPPFQIDG NFGFTSGIAE MLLQSHNNEI QLLPALPSQW STGHANGLCA RGNFTVTKMN WANGVLTDAT IKSNSGNVCN VRYGNKTISF PTKKGYTYQL NGSLQLVEPG TTLTNVALNK TATASGTNLG EEAGKAIDGS TTSKWCHDNG MRGEWLQVDL GAKYDISRWV VKHAGVAETI RFNTRDFTLQ KSDDGTNWAD VDEVYGNQQN ITDRNVPTFN ARYVRLYINT ATQDNSGGAR ISELELWGKP SVDIPKSAFS QIEAEEFNSQ YGVQAETCSE GGQDVAFIEN EDFAAYSNVD FGEGAKSFQA RVSSATSGGN IEIRLDSIDG PLVGTCPVTG TGDWQNWSDV TCNVSGASGK HDLYLKFTGD SGYLFNLNWF KFSNAPIVTG KPGDINNDGQ IDAIDLLLLK KYLLGLETIE NTKLADLDAN GEINAIDFSL LKQYLLGNIS VFPG
|
| |