Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_1223 |
Symbol | |
ID | 7310020 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 1497206 |
End bp | 1498234 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 643608144 |
Product | periplasmic binding protein/LacI transcriptional regulator |
Protein accession | YP_002505559 |
Protein GI | 220928650 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.999664 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGA AGTTAATTGC ACTTCTGATG TGTTTTTCAT TGGTAATAGC GGCAGGATGT GGAGCTTCCG ATACCAATTC ATCAGATTCC GAATCGTCTG AGTCTGCCCA ATCCACATCA GCTAATGATT CAGGCAGCAA CAAGTTGATT ACAATAGGCT TCTCACAGGT TGGTGCTGAA AGTGACTGGC GGGTAGCTAA CACTGCATCA ATGAAATCCG CATTATCCGA AAAAAATGGC TTCAAATTGA TTTTTGCTGA TGCTCAGCAG AAGCAGGAGA ACCAGATTAA AGCAGTAAGG GATTTTATAT CACAGGATGT TGACGTTATA GCTATAGCAC CTGTTACAGA AACGGGCTGG GAAACGGTAT TGGGTGAAGC AAAGGATGCA GATATACCGG TAATTATTGT TGATAGAATG ATAAAGGTTT CGGATGATTC ACTCTTTAGC TGCTGGGTTG GTTCAGACTT CCAGAAGGAA GGTGTTAACG CAGCTGAATG GTTAGTTAAC TATATGAAAG AGAAAGGCAA GACCGATAAA CAAAATGTTG TAGTTCTTCA GGGAACAATA GGATCATCTG CTGAAATAGG CCGTACAAAG GGTTTTGGTG ATACTATAAA GAAATATGAT AACTTTAATA TACTGGCACA ACAGACTGGA GAGTTTACTC AGGCAAAAGG CCAGGAAGTA ATGGAATCCT TCTTAAAACA GTACAATGAC ATCGATGTAG TTATAGCACA GAACGATAAT ATGGCCTTCG GAGCTATTGA TGCTTTAAAA GCAGCAGGTA AGGCTCCTGG AAAAGATGTA ACAATTGTAT CCTTTGACGC AGTTAAGGCT GCATTCAAAT CAATGATAGC AGGGGATATG AATGTATCGG TAGAATGTAA TCCTTTACAC GGGCCTAGAG TAGCTGAACT GGCTAAAAAA CTCATGAACG ATGAAAAAGT TGAAAAGATA CAGTATGTTG ATGAAAAAGT ATATCCGGCT GAAATAGCTG AAAAAGAACT CCCGAATCGC CAATATTAA
|
Protein sequence | MKKKLIALLM CFSLVIAAGC GASDTNSSDS ESSESAQSTS ANDSGSNKLI TIGFSQVGAE SDWRVANTAS MKSALSEKNG FKLIFADAQQ KQENQIKAVR DFISQDVDVI AIAPVTETGW ETVLGEAKDA DIPVIIVDRM IKVSDDSLFS CWVGSDFQKE GVNAAEWLVN YMKEKGKTDK QNVVVLQGTI GSSAEIGRTK GFGDTIKKYD NFNILAQQTG EFTQAKGQEV MESFLKQYND IDVVIAQNDN MAFGAIDALK AAGKAPGKDV TIVSFDAVKA AFKSMIAGDM NVSVECNPLH GPRVAELAKK LMNDEKVEKI QYVDEKVYPA EIAEKELPNR QY
|
| |