Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_2997 |
Symbol | |
ID | 7311607 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 3547922 |
End bp | 3549310 |
Gene Length | 1389 bp |
Protein Length | 462 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643609901 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002507271 |
Protein GI | 220930362 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000366828 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTAAAA ATAGGATTTT ACCATTAGTT CTGAGTGTGG CAATGATAGC TATAGCCTTT ACTGCATGCG GATCAAAGGA GAAGCCTAAT GACACAGATT TAACCTCGTC AGCGTCATCA ACCGCAGCAG GTACAACTTC AAAAGCAAAG GATATTAAAG GAGAAATTCT GTTCTATAAT ACCAGAACTG ATATGGAACT GGACAGTTAT GACAAAAACT GGGAATATTA TATTGGAGAA TTTAATAAAA ATTATCCTAA TATAGAAGTA AATATTGAGA CATCAAAGGA TTATGAAGGT GATCTTGCAA TCCGAATGAA TTCAAACGAA TACGGTGATG TTTTATTCAT GTCTGCAAAA ATGAAGGATT CGGATCTTCC TAGCTTTTTC ATACCGTTAG GAAAAAAAGC AGATTTGGAA AAGAAGTATG ATTTTGTTCA GGACAGATAT GTAGGTGAGG ATATATACGG AATTCCACCT AACGGAAACG GACAGGGTAT AGTATATAAC AAGGCTGTAT TTGCAAAGGC TGGTATTACT TCTTTGCCGA AGTCTGAGGA TGAATTCCTT TCTGATTTGA AACTTATTAA AGAAAAAACA GACGCTATTC CTTTGTATAC AAACTACAAG GACAGCTGGG CCCTCAATGC ATGGGAAGGA TACATAGATA GCGTGTCAGG CAGTGATACA TACACTAATC AGGTAATGCT TCACGAGGAT GATCCGTTTG CACCCGGGAA ACCTCACTAC ATAGTTTACA AGTATTTATT TGATGTTGTA AGTCAGAAAT TGGTTGAGGA CGATCCGATG ACTACTGACT GGGAAAGTTC AAAGCAACTA CTGGCAGATG CTAAAATTGC AACTATGCCT TTAGGCTCAT GGGCAATTCC ACAGATTAAA TCAAAAGCAA AAAATCCGGA TGACGTGTCA TACATGCCAT TCCCATATAA TATTGACGGA AAAATGTATA CACAGGCTGC AGGCGACTAC AAGCTCTGCA TAAGTAAGAA CAGTAAGAAT ATAGAAGCAG CAAAGGCATT TTTATGGTGG TTCCTTGATG AATCAAACTA TGCTCAGAAT GAAGGGCTTA TACCATCACT TAAAGGCTCC GCATATCCTG ATACATTAAA AAACTTCCAA GATATGGGAG TAACACTCCT TATTGATAAG GGAGCTATCA ATGATGTAAC CAAAAACGAA AACGGATGGC TGGATGCTAT AGATAAAGAG TCAGAAGTAG GCTTGTGGAA TGAAAACTTT AAGAAGGATA TTGTAGATAC AGCACTAGGT AATAAAAAAG GTACATATAA TGATGTTATG AATGGCCTTA ACAAAAAATG GGCTGATACC AGAGCAAAGC TTATAAAAGA AGGAACTATA GGTAAGTAA
|
Protein sequence | MIKNRILPLV LSVAMIAIAF TACGSKEKPN DTDLTSSASS TAAGTTSKAK DIKGEILFYN TRTDMELDSY DKNWEYYIGE FNKNYPNIEV NIETSKDYEG DLAIRMNSNE YGDVLFMSAK MKDSDLPSFF IPLGKKADLE KKYDFVQDRY VGEDIYGIPP NGNGQGIVYN KAVFAKAGIT SLPKSEDEFL SDLKLIKEKT DAIPLYTNYK DSWALNAWEG YIDSVSGSDT YTNQVMLHED DPFAPGKPHY IVYKYLFDVV SQKLVEDDPM TTDWESSKQL LADAKIATMP LGSWAIPQIK SKAKNPDDVS YMPFPYNIDG KMYTQAAGDY KLCISKNSKN IEAAKAFLWW FLDESNYAQN EGLIPSLKGS AYPDTLKNFQ DMGVTLLIDK GAINDVTKNE NGWLDAIDKE SEVGLWNENF KKDIVDTALG NKKGTYNDVM NGLNKKWADT RAKLIKEGTI GK
|
| |