Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_0996 |
Symbol | |
ID | 7309826 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 1236408 |
End bp | 1237775 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 643607923 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002505338 |
Protein GI | 220928429 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTAAGC GTTGTAAAAT ATATTTATTG ATTTTTATGC TGTCTATAAC TGCAGCCTGT AGTGACTCTA CAGGCTTGTA CAAAAATGCT GGGACATCGG AAGCTTTTAC ACCGACGAAT AAAGTGAAAA TCCTCTTTCT CTCCATAAGT GACAGCGATG TAAGGGAAAA TATCAGAGAG CACTATATTA AAGCAAATCT GGAAAAAGAA ATGCCCGATT TGGAAGTGGA ATTTGATTTA GGCGGAGGGG GACAGGATTA CGCCAACAAG TTGAAGGTGT ATAACTCATC AGGCAACATG CCAGATGTAT GGTTTTCGGA GCAAAACCTG TCTACCGTTG TTATTGATTC CGGCAATGCA CTTGATTTGG CACCATATAT ATATAAATGG GAATTTGATA AAAAGTTTAC CGAAAAGGAA TTTATAATGC CGGACAAAAA CGGCAGGATT TACTCTGTTC AATCAGGCAA CGACATGTTT ACCACTCCAA GGCTGTGGTA TCACAAAAAT ATATTCAAAA GATATGGTAT ACAGGTTCCT AATACTTATG ACGAACTGCT GAAAGTATGT GAACTGTTAA AGGTAAAAGG ACTAGTTCCT ATTTCAATTT CGGGCAGGGA TGGATGGACA CTTAATCTTC ATCTACTTCA GACAATGATT ATGGCTGAAG AACCGCAGGT TGCACTTGAT ATGCTTAATG GGAATACTGA TTTTACCAAC CCAGTAATAC AGAGGGCTCT GGAACGTATA AGGAAGTTGG TTCAGGTGGG AGCATTTCCT TATAATACTG CAAATCTTGA CTATGAGCCG GCAATAAACA TGTACACCTC AGGAAGGGCT GCAATGCTGT CGGCATTTTC ATGGGAACTG CCGAGGCTGG AAAAGTTGGC CCCTGATACT GATTTCATGC CATGGCCGTC TGTAAGAGGG GGGACTGACA ATACTAAGGC TGTACAATAC TGGGGTGCTC CCTTAAGCGG CTATATGGTA TGGGCAAAAA CAAAAAATCC TGAGGCAGCC GCACGTTTTG CAATGTACTG TGCTACCCAG GATGCATTAT ATTATAATAT TGAGAGTAAA GCTCCAACAA TGCTTAACAC CGGAGTAAGA ATAGAGAATA CCTCTGCCTT GGCAAAAAAG GATCTGAAAC AGCTTGATAT GGCTGAAATA AAGATACCCT CCATTTGGTC GGCTGCCTTT AATACCAGAA CCTCTGCAGA AATTTCCGTT CAAAACAGAA AGCTTCTGAC AGGGAGGTAT TCTCCCGATG AATATATCAA GGCAATAAAT CCTCTGTGGA TTGAAAACGC GAAGGAAATA AGGGAGAGAA GCTATGATAT ACACTATAAA TTTCATCCTC AACCCTGA
|
Protein sequence | MAKRCKIYLL IFMLSITAAC SDSTGLYKNA GTSEAFTPTN KVKILFLSIS DSDVRENIRE HYIKANLEKE MPDLEVEFDL GGGGQDYANK LKVYNSSGNM PDVWFSEQNL STVVIDSGNA LDLAPYIYKW EFDKKFTEKE FIMPDKNGRI YSVQSGNDMF TTPRLWYHKN IFKRYGIQVP NTYDELLKVC ELLKVKGLVP ISISGRDGWT LNLHLLQTMI MAEEPQVALD MLNGNTDFTN PVIQRALERI RKLVQVGAFP YNTANLDYEP AINMYTSGRA AMLSAFSWEL PRLEKLAPDT DFMPWPSVRG GTDNTKAVQY WGAPLSGYMV WAKTKNPEAA ARFAMYCATQ DALYYNIESK APTMLNTGVR IENTSALAKK DLKQLDMAEI KIPSIWSAAF NTRTSAEISV QNRKLLTGRY SPDEYIKAIN PLWIENAKEI RERSYDIHYK FHPQP
|
| |