Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_1239 |
Symbol | |
ID | 7312202 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 1526092 |
End bp | 1529139 |
Gene Length | 3048 bp |
Protein Length | 1015 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 643608160 |
Product | Carbohydrate binding family 6 |
Protein accession | YP_002505575 |
Protein GI | 220928666 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000298242 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGAGAA AGAAAATTTT ATGTATATTT CTTGTGACTG TATTAATGCT AACTATATTA CCAATACCTC AACAGACGGT AATGGCTGAT ACAGGGGTAC TTAAGGAACT TAAAGGAACT GACATCTACA ACGGATTAAG GGGACTGAAT TTTAACGAGG GTTGGAAATT CAATAAGGGG GATGTAAGCA ACGGCCAGAG TACCGGGTAT AATGACAGCG GCTGGTCAGG TGTTACACTG CCACATGACT GGAGTATTTA TAACACTTTT AATAAATCCT CAGCAGCGGG TGCAGGAGGA GGTTATCTGG ATGGAGGAAT CGGATGGTAC AGGAAAACCT TTACCGTACC TTCGGATTAT ACAGGAAAGA AGGTATTCAT TGAATTCGAC GGAGCGTACA TGAACAGCCA GGTATGGATT AACGGTACTT TGCTGGGAAC CCGTCCATAT GGCTATTCAT CCTTTGAATA TGATTTAACT CCATACCTCA ATATAGGTGG AAGTAATGTT ATTGCAGTAA GGCTCAACAA TAACCAACCC ACTAGCCGCT GGTATTCAGG AAGCGGTATT TATCGGAATG TATGGCTGAC AGTATTAGAT CCTGTGCATG TGACATACTG CGGAATGTTT GTAACAACAC CCACGGTCAG CAGCAGTTCG GCCACCGCAA ACGTCAGCAC AAAGGTATTG AATCAGGGAA GTACTGCAAA ATCGGTATCA TTAAAAACTA CAATTACCGA TGCGGAGGGT AATGTCGTAG CCACAAATAC TTCTTCAGTG GCTAGTATAG CTGGCAGTGG TAGCAATACC TTCAGTCAGA ATCTGACAGT ATCAAACCCG CATTTATGGT CTCCCGCTTC ACCATACCTA TATGCTGTTC AAACCCAGGT TATTGTGGAC GGAAATGTAA CAGATACATA TAGTTCTACA TTGGGAATCA GATATTTCAG CTTTAGCTCC ACATCAGGCT TTTCACTTAA TGGTGTAAAT ATGAAGATAA ATGGAGTGTG TCTGCATCAT GATTTGGGTT CTCTTGGTGC GGCTGTTAAC TATCGTGCAA TTGAAAGAGA ACTTCAGATT ATGAAGGATA TGGGATGTAA TGCAATTCGT ACTTCACATA ACCCGCCAGA CCCGCAAATG CTGGAAATAT GTGACAGATT AGGTTTGATG GTTATGGATG AAGCATTTGA CTGCTGGGAA ACAGGAAAAA ATTCTAATGA CTACCATCTG TACTTTAACA ATTGGGCACA GACCGATCTT CAAGCAATGG TTACAAGGGA TCGTAATCAT CCGTCAATTA TCATGTACAG CATAGGCAAT GAGATTCCTT CACCAAGTGT AGCTACAGCG ACAAAGCTTA AGAATTGGGT AAAAGATGTT GATAACACAC GACCAGTCAC ATTGGGAACT TTCGCCGTAA GCATGGGTGA TGCAACTCCG CAGGCGGTCG CAAGTGTTCT TGATTTAGTG GGCTATAACT ATTTTCCATA CATGTACGAC GGAGGACATA ACAATCATCC GGAGTGGAAG ATGTTTGGAA GTGAAACGAG TTCAGCGGTC AGAAGCCGCG GGGTTTATAA AACACCCACC AATAAAAACA TCCTGACTGA CAGTGATAAC CAGTGTTCCT CTTATGATAA CAGTGTGGTC AGCTGGGGCA ACAGTGCTGA ATCTTCCTAT AATGAAATTA ATAAAAGAAA TTACATGGCG GGTGAATTCA TATGGACAGG ATTTGACTAT ATCGGTGAAC CTACACCTTA CGAATGGCCG GCAAAAAGTT CGTATTTTGG AATAGTGGAT ACATGCGGGT TCCCGAAAGA TATCTACTAT TTCTATCAAA GTAAGTGGAG TACTAAACCG ATGGTTCACA TACTCCCACA TTGGAACTGG TCTACGGGTA CTAATGTAGA GGTATGGGCA TACAGTAACT GCGATACTGT AGAACTATTC CTTAATGGGA AGTCACTTGG CTCGAAAAGC GTTGGAACAG CAGGACATCT TTCATGGAGT GTTCCATGGT CTTCAGGGAC ATTGCGGGCA AAAGGTACAA AGGGCGGCAC TGTGGTATAC GATGAAGTCG TCACAGCAGG TACTCCTTCA AAAGTCCTGT TGAAGCCTGA CAGAACTTCT GTTAAAGCAG ACGGTAAGGA TTTGATATAT ATCGAAACAG ATATTGCAGA CGGCAACAAT GTGACTGTCC CCACAGCTGA CAATACAGTG AATTTCTCCA TATCAGGTCC CGGTGTAATT GTGGGAGTTG ATAATGGAAA TCCAATAAGT ACGGAAGCAT ATAAAGGCAG CAGTCGTAAG GCTTTCAACG GTAAGTGTCT GGTAATTGTC CAGCCTACCA AAGTTAATGG AACAATTGTA GTAACGGCAA GCTCCAACGG ACTATCCTCT GGAAGTGTTT CCATCGCTTC AACAGGAGGA GCAGAAGCAC CCATCGTATC TGCCTATAAA AAGATTGAGG CCGAAAACTA CGATAATCAG TCTGGAATCC AGACAGAAGC CTGTTCGGAA GGCGGACAGG ATGTAGGATT TATTGAAAAC GGGGACTACA CTGTTTACAA CAATGTGGAT TTCGGTAGCG GTGCCGAGAG CTTTACGGCA AGAGCAGCAA GTGCTACTAG CGGAGGAAAT ATTGAGATCA GACTTGACAG TCCTAACGGG ACTTTAATTG GAACTTGTCC AGTTGCAGGA ACAGGTGATT GGCAGACTTA TGCTGATGTA AATTGCAGTG TCAGCGAGGT AAGCGGAAAA CACGACCTGT ATCTAAAATT CACCGGGGAT AGCGGATATT TATTTAATAT TAACTGGTTT ACATTTACTG CACGGAATTC AGCAAAATTG GGAGATTTGA ATTCCGATGA CCAAATAGAC GCGATGGATT TCCAGTTGAT GAAAAAGTAT CTGCTGGGAC TGGGAGAAAT TAAAGATACA AAGCTGGCCG ATTTGGATGC AAGTGGTACT ATTGATGTTT TAGATCTGAT GCTGCTCAAA CAGTACCTGC TAGGCACTAT AACTTCTTTT CCGGGGCAGG GTACTTAA
|
Protein sequence | MLRKKILCIF LVTVLMLTIL PIPQQTVMAD TGVLKELKGT DIYNGLRGLN FNEGWKFNKG DVSNGQSTGY NDSGWSGVTL PHDWSIYNTF NKSSAAGAGG GYLDGGIGWY RKTFTVPSDY TGKKVFIEFD GAYMNSQVWI NGTLLGTRPY GYSSFEYDLT PYLNIGGSNV IAVRLNNNQP TSRWYSGSGI YRNVWLTVLD PVHVTYCGMF VTTPTVSSSS ATANVSTKVL NQGSTAKSVS LKTTITDAEG NVVATNTSSV ASIAGSGSNT FSQNLTVSNP HLWSPASPYL YAVQTQVIVD GNVTDTYSST LGIRYFSFSS TSGFSLNGVN MKINGVCLHH DLGSLGAAVN YRAIERELQI MKDMGCNAIR TSHNPPDPQM LEICDRLGLM VMDEAFDCWE TGKNSNDYHL YFNNWAQTDL QAMVTRDRNH PSIIMYSIGN EIPSPSVATA TKLKNWVKDV DNTRPVTLGT FAVSMGDATP QAVASVLDLV GYNYFPYMYD GGHNNHPEWK MFGSETSSAV RSRGVYKTPT NKNILTDSDN QCSSYDNSVV SWGNSAESSY NEINKRNYMA GEFIWTGFDY IGEPTPYEWP AKSSYFGIVD TCGFPKDIYY FYQSKWSTKP MVHILPHWNW STGTNVEVWA YSNCDTVELF LNGKSLGSKS VGTAGHLSWS VPWSSGTLRA KGTKGGTVVY DEVVTAGTPS KVLLKPDRTS VKADGKDLIY IETDIADGNN VTVPTADNTV NFSISGPGVI VGVDNGNPIS TEAYKGSSRK AFNGKCLVIV QPTKVNGTIV VTASSNGLSS GSVSIASTGG AEAPIVSAYK KIEAENYDNQ SGIQTEACSE GGQDVGFIEN GDYTVYNNVD FGSGAESFTA RAASATSGGN IEIRLDSPNG TLIGTCPVAG TGDWQTYADV NCSVSEVSGK HDLYLKFTGD SGYLFNINWF TFTARNSAKL GDLNSDDQID AMDFQLMKKY LLGLGEIKDT KLADLDASGT IDVLDLMLLK QYLLGTITSF PGQGT
|
| |