Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_0145 |
Symbol | |
ID | 7312064 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 164518 |
End bp | 166110 |
Gene Length | 1593 bp |
Protein Length | 530 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643607074 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002504513 |
Protein GI | 220927604 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGC TAAGTACCAG AATTGTAAGC CTGGCACTGT TAACATCAAT GAGCGTCACT CTTTTTGCTG GGTGCGGTTC AAACGGCGGA ACAGACGCAG ATAGTTCAAA GAGTGCTTCA TCTTCAGGTG CAGCAAAATC GGGCCCAGCG GTAGAATTAA CCGTAGAAGT TTTTGACAGA GCTACACCAG GATATAAGGC TGATGATAAT TTCCAGATAA AATGGATTCA AGAGAACTTT GGTAAGCCAA ACAATATTAA TGTTAAGTTT GTACCTGTTT TAAGACAACA GGAAGTAGAA AAGCTGAACG TTCTTATGGC ATCAAACCAA GCACCGGATA TCAGCTTTAC TTATAATGAC GGTATTATTT ACAACTATGT TAAGAGCGGA GGACTTACTG ATTTAGGAGA CCTTTTAACA AAGAATGCTT CCAATCTTAC AAAATATCTT GGGCAAACTT TACTTGATTA TGGCAAATTT GACGGAAAGC AGCTTGCAGT TCCTGCAAAG AGAGTTATAG AAGGCTGCTT CTCGGCATAT ATCCGTAAAG ACTGGCTGGA TGCAGTCGGA ATGTCAGTTC CTACTACAAC TGATGAATGG TATCAGGTAA TGAAGGCATT TAAGGAAAAG GATCCCGGAA AACTTGGAGA CAAGAACTAT CCTTTTAGTA CATTTGTTGA TCCAAACAAT ATAAACTGGA CTACATCAAT GTTGTTAGAG TCCTTCAAAC AGCCTATTTC AGAAGAACAA AGAATGACAT TGCCAAACTG GGTAATTCCT GGATTTAAAG ATGGTATGAA ATTCTTGAAT AAGCTATACA ACGAGGGTAT ATTGAATCCT CAGTTCGCCT TGGATAAAGA CGGAAAGCAG TATGAAAAAG ATGTATCACA GGGTAGAATT GGTTTCATGA TACATAATTA TGACTTCCCC ATCAGAGTTA CTCCCGGATT ATTATCTGAG TTGAAAAAGC AGGTTCCTGG AGCAGATATG GTTCCATGTG ATCCGTTCAC AAACTCGGAT GGTAAACATC CAAAAATGAA ATATAATCCT AATGGATTGT ATATAATAGT TCCAAAAGTA AGCAAACATG CGGAAGAAGC CGTTAAGTAC CTTGAATGGC AATCAAAACC AGAAGTAATT AAGTTCCTGC AAAATGGTAT TAAAGGTGAC CAGTATACTG ATGAAGTTGA CGGTATCCCG GCTAACTTTA TACAGAATGA TCAGCTTTCT GATGACAAGA AAGCTAACTT CACTGATTTG GCATTGATTG TTAACGGTAA AGAATTCGGA GATCCTGCAA AGAACATTCA GGCGGCATCT TTCGGATACC CCGGATTTGA AGATACATTT AAGAAGGCAT ATGACATTTC TCTTACAGAT GCAAACTATA TTCCACACTT TGATACTGTT ATAGAAGCAC AGGCAAAATA TCAGAAGGCT CTTTCCGACA AAGAAGCAGA AATATTTGTT AAGAGTATCA CTTGCAAACC TGCTGATTTC GATAAGACTT ATGACAAACT TGTTGCTGAG TACATGAAGT CAGGCGGACA GGAAATTGTT GATGAGAAAT TAGCTGCATT GAAGAAGAAA TAA
|
Protein sequence | MKKLSTRIVS LALLTSMSVT LFAGCGSNGG TDADSSKSAS SSGAAKSGPA VELTVEVFDR ATPGYKADDN FQIKWIQENF GKPNNINVKF VPVLRQQEVE KLNVLMASNQ APDISFTYND GIIYNYVKSG GLTDLGDLLT KNASNLTKYL GQTLLDYGKF DGKQLAVPAK RVIEGCFSAY IRKDWLDAVG MSVPTTTDEW YQVMKAFKEK DPGKLGDKNY PFSTFVDPNN INWTTSMLLE SFKQPISEEQ RMTLPNWVIP GFKDGMKFLN KLYNEGILNP QFALDKDGKQ YEKDVSQGRI GFMIHNYDFP IRVTPGLLSE LKKQVPGADM VPCDPFTNSD GKHPKMKYNP NGLYIIVPKV SKHAEEAVKY LEWQSKPEVI KFLQNGIKGD QYTDEVDGIP ANFIQNDQLS DDKKANFTDL ALIVNGKEFG DPAKNIQAAS FGYPGFEDTF KKAYDISLTD ANYIPHFDTV IEAQAKYQKA LSDKEAEIFV KSITCKPADF DKTYDKLVAE YMKSGGQEIV DEKLAALKKK
|
| |