Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_0992 |
Symbol | |
ID | 7309822 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 1222003 |
End bp | 1225173 |
Gene Length | 3171 bp |
Protein Length | 1056 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 643607919 |
Product | glycoside hydrolase family 2 sugar binding |
Protein accession | YP_002505334 |
Protein GI | 220928425 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACAAAA GGTCTTTCCC GCTGGACGGA GATTGGTATG TAGCTCTGGA TGAAAAGGAG ATCGGAATAA AAGAGAAATG GTTTGAAAGT ACATTTACAG AAAAGGTTAA ATTACCCGGT ACCCTTGATG AAAATGGCAT TGGCAGTCTG GTAACAGGTA CTGACACCTT TAGATTGAAC CGTATACGCA AATATGTAGG GGCAGCATGG TATAACAGAT TGGTAGTGCT ACCTGACGAC ATAGTAGAAA AGCATGTCAC ATTGTTTTTG GAAAGATGCA TGTGGAAAAC AGAACTTTGG ATAAATGGCA ATTATGCCGG CTCTCAAAAT AGCTTAAGCA CCCCTCACAA GTATAAGCTT GACGGAATGC TTAAAGCGGG TGAAAACCTT ATCAGCATCA AGGTGGATAA TTCACCTATC TATAATCTGG GAGTCATGAG CCACGGATAT TCGGAAGAGG TACAAACCGT TTGGAACGGC ATAGTGGGAA GAATAGAACT GGATATTAGG GATAAAGTTT ATGTGGAGCG AGCAAATGTG TATTCTGATA TGGAAAGCAG TAAGCTATCA ACGCGCCTGT TGCTGGTTAA TACTGCTTCA AAAGCAGTCG AAGGCTTGAT TAGTCTTTCG GTCCGTCCTC AAGATGGAGA AAATGTAATT ATAAGGGATT CATATTATTT TAGTATAGGA GCAAATTCAT CAGTTAACGT TGAGTTGACT CAGGCGTATG AGGACAAGCT AGGCTTGTGG GATGAGTTTA ATCAAAATTT GTATAGAATG GATATAAATT TGGAATGCCA GTGTGAAAAA GAGAATGAAA TGTATTCTGA TTCAAGCCAA ATACTTTTCG GTATTCGGAG CTTTAAGGCA GAAGGTCCTG TGTTTAGGCT TAACGGCAAA AAAATATTCC TAAGGGGGAC TCATGATGCC GGTAATTTTC CGATTACAGG GTATCCGTCA ATGGAGGTTG AAGACTGGAA ACGTATATAT GCAATAGCAA AATCCTATGG AATCAACCAC TTCAGATTTC ACTCATGGTG TCCCGGCGAA GCTGCATTTG CTGCCGCTGA TGAAGAGGGA ATGATTTTAC AGGCAGAATT GCCCTTATTT GGTTTTACTG CTCCGCCGCT GGGACAGGAT GAACCTAGAG ACAGCTTCCT GAGAGAAGAG CTGCTTAGAA TTCTGGAAGA GTATGGGAAC CACCCGTCAT TTTGCATGAT GTGCATGGGA AACGAACTTC GCGGAGACTA TGAGATGCTC TCTGAATTTG TTGAAATGGG CAGGAAAACG GACGGAAGGC ACCTTTATTC GTCAGCCGCA AACAATGCGG CAGAACCAGG ACTTGGGATC AAACCTAATA AAGGTGATCA GTACTATGTT GCTCATGAGG CAAGGATAAA CGGTGAAAGG GTTAATAGAC GCTGTGAGAA TGTTTTCAAT AATGAAAGGC CTGAAACTAT ATCCGACTAC AGTGAGACAC TGAAAGGAAT AGAGGTTCCG ACCATATCTC ATGAAGTAGG CCAGTGGGAG GTTTATCCCG ACTTTGAAGA AATAAAAAAG TACACTGGTG TTTTAAAGGC AAAAAATTTT GAGGAGTTTA AAGGATCGGC AGCGGCAAAA CAGGTTTTGA ACAAAAATAA GGATTTTGTA ATGGCTTCGG GCAAATTGGC GGTTTTATTG TACAGGGAAG AAATTGAACG CTCGCTGAGA ACTGCGGATT ATGGAGGTTT TCAGCTTCTT GACATGCATG ATTTCCCTGG GCAGGGTACT GCCCTTGTGG GATGGCTGGA TGCCTTCTGG GATTCCAAAG GTCTGGTGGA GCCCCGGGAA TTCAGAAACT GGTGCAACCA CTCTGTGCTA CTGGCAAGAA TGGAAAAACG TGTGTGGTTA AATAGTGAGG TATTCAGTGC TGAGATCAAT TTTGCCAATT ACAGTCAAAA TGATTATTCA CAGCTTAATA TCAGGTGGGA ATTAACCCGA AGTGACGGTA CGATGTATTC CTGCGGTAAA TTTAATAACA TCTTCATTCC CCAGGGCAAC CTATCTTATA TTGGAGTGGC AGCAGCTGAG CTCAATAAGG TAGAAACTGC TGAAAAACTT ATACTTACAA TTGTTACGGA CGGCATTTGT ATAACAAATC AATGGGATAT CTGGGTATAT CCCGAAAAGC TAGCTGTTGA AATGCCGGAA GACATATATG TTGCCGAAAG CTGGGATGAC GGAGTGTCCG AAACCTTGGG CAATGGGGGG ACAGTGCTTA TGTTTACTGG AGCAGTAAGA AATTCGGAGG CAATGTGTTT TACGACTCCA TTTTGGAATA CCCAGATGTT TGAAAACCAG AGGAAAACCA TGGGTATTCT TTGTAACCCT AATCATCCTG CTCTTCTGGA CTTCCCCACG GAGTACCATA CAAACTGGCA ATGGTGGGAG CTGCTTGCCG ATTCAAAGTG TATAGGCATA AATGAACTCC CGGCAGGGTT TGAGCCAATT GTCTCTGCAA TAGATCACCC TGTCAGAAAT AACAGGCTAG GAGTTATTTT TGAAGCAAGG GTTCTCAGCG GAAAACTTCT TATATGTAGC CTCGATTTGA ACGGCGATTT AGGCGGCCTT CCGGCAGCAA GACAGTTAAA GTACAGTATT CTGAATTATA TCCAGAGTGA AAGGTTTGAG CCTGAATTTT ATGTGGATGA GAGCATTATG GCAAATATGT TGTTAAAAAA CACTGCATCC AATTTAAAAT TACTGACAAA AAATATTAAG GCATCAAATA CAAAATACTC AAGTAAAGTG GAATATATTC TTGATACTGA TACGTCAAAT TTTTGGGTCA CAATGGGAGG ACAATACCCG TATGTCATAG ATATTGAACT TACGGAAAGT ACAGCCATAA AAGGTTTAAC ATATTGGCCG AGACAGGATG GTATGACGAT AGGACTTATA TCCAGATATG AAATATATAT ATCCGGTGAT CCGGTGCAAT ACGGTAAGCC TGTCGCATCA GGAAGCTTTA AAAACACACT TGAAAAACAG GAGATAATTC TGGATTGGAT TGATGACGGA TTTAATGTTA CCAGGAGCAA GACAGGGAAA TATATTAGAT TTGTTGCTGT TAAAGGGTTT AACAATGACC GGGAAGCTGC CATTGGAAGT CTGGACATAA TAACCGTATA A
|
Protein sequence | MDKRSFPLDG DWYVALDEKE IGIKEKWFES TFTEKVKLPG TLDENGIGSL VTGTDTFRLN RIRKYVGAAW YNRLVVLPDD IVEKHVTLFL ERCMWKTELW INGNYAGSQN SLSTPHKYKL DGMLKAGENL ISIKVDNSPI YNLGVMSHGY SEEVQTVWNG IVGRIELDIR DKVYVERANV YSDMESSKLS TRLLLVNTAS KAVEGLISLS VRPQDGENVI IRDSYYFSIG ANSSVNVELT QAYEDKLGLW DEFNQNLYRM DINLECQCEK ENEMYSDSSQ ILFGIRSFKA EGPVFRLNGK KIFLRGTHDA GNFPITGYPS MEVEDWKRIY AIAKSYGINH FRFHSWCPGE AAFAAADEEG MILQAELPLF GFTAPPLGQD EPRDSFLREE LLRILEEYGN HPSFCMMCMG NELRGDYEML SEFVEMGRKT DGRHLYSSAA NNAAEPGLGI KPNKGDQYYV AHEARINGER VNRRCENVFN NERPETISDY SETLKGIEVP TISHEVGQWE VYPDFEEIKK YTGVLKAKNF EEFKGSAAAK QVLNKNKDFV MASGKLAVLL YREEIERSLR TADYGGFQLL DMHDFPGQGT ALVGWLDAFW DSKGLVEPRE FRNWCNHSVL LARMEKRVWL NSEVFSAEIN FANYSQNDYS QLNIRWELTR SDGTMYSCGK FNNIFIPQGN LSYIGVAAAE LNKVETAEKL ILTIVTDGIC ITNQWDIWVY PEKLAVEMPE DIYVAESWDD GVSETLGNGG TVLMFTGAVR NSEAMCFTTP FWNTQMFENQ RKTMGILCNP NHPALLDFPT EYHTNWQWWE LLADSKCIGI NELPAGFEPI VSAIDHPVRN NRLGVIFEAR VLSGKLLICS LDLNGDLGGL PAARQLKYSI LNYIQSERFE PEFYVDESIM ANMLLKNTAS NLKLLTKNIK ASNTKYSSKV EYILDTDTSN FWVTMGGQYP YVIDIELTES TAIKGLTYWP RQDGMTIGLI SRYEIYISGD PVQYGKPVAS GSFKNTLEKQ EIILDWIDDG FNVTRSKTGK YIRFVAVKGF NNDREAAIGS LDIITV
|
| |