Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_0200 |
Symbol | |
ID | 7309104 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 224941 |
End bp | 226266 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 643607129 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002504567 |
Protein GI | 220927658 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAAG TGTCAGTGAC AAAAAGAGGA GTAGCACTTA TACTGGCTGG TGCATTAACC GTTGGAATGG CGGCTTGTGG CAAGAATACA ACCAATAATA ATGCTGCGGG CACAACTAAT AATGCAGGTG GAAACAAGGC TAAAAATGTT GAAATTAAAT TCAGTCATAT TTGGGGATCA GCAGCAGACC CCTTTACACC AGCTGCTAAG AAAGTTATTG AGGACTACCA GACTGCAAAT CCAAATGTTA AGATCGCTGT CGATACAAAC GAAAATGAAG CATATAAGAC AAAAATCAAA GCAATGGCAG CAGCTAATGA ACTGCCTGAT ATTTTTTCCA CATGGGGCGG CGGATTCTCA CAGCCATTCA TTCAGTCAAA ATCAGTAGTA CAACTTGATC AATATCTGAC GGATGATATA AAGAACAAGC TTGTTAACGG TGCATTGACC AATGTGACTT ATGACGGGAA GGTGTATGGA TTACCCTTCT TCTTGTCAGT AGGAGCAATG TTTGTCAATA CAGAGCTTTT TGATAAAAAT GGAGTCAAGG TTCCTACTAC ATGGGAAGAG CTTCTTACCG CAGTAAAAAC CTTTAAAGCT AAAGGAATAA CACCTATGGC TGTATCAGGT AAGGACAAAT GGACAATAGC AATGTACTTT GACGTAATGG CACTAAGAGC TGCAGGCCCT GAAAAGGTAA CCAAAACTCT TACAAAGCAA GGTTCATTCA AGGACCCAGA ATTCCTCAAT GCTGCTAACA GATTTAAAGA GTTAGTTGAT GCAGGAGCAT TCTCAAAGGG TGCTGCAGGT GTTTCAAATG ATGAAGCTGA AGTACCATTT TTTGAAGGAA AAATTCCAAT GATGTTCAAA GGTAGCTGGA CAGCAGGAAA AGCAGGTTCA AAGGATTCCA AGGTTGCAGG AAAGGTCAAG GCAATATCCT TCCCATCAAT ACCTGGCGGT CTGGGAAATC CAAAGCAGTT TACAGGCGGA GCTGTTGATG CTGTAATGGT AAGCGAAAAT TCTAAGAACA AGGAAGAAGC AATTAAATTC CAGATATATT TCGCTGAAAA TATGGCTAAA GAATCATACT TGTCCGGTGC ATCAATGCCT GCATGGAAAA CAGATGTTGA CGAAAGCAAG GTTAATCCTT CGCTTGTTGA CGTTGTTAAT CTGACTAAGG ACGCTGAATC ATATACAATC TGGTGGGATA CACTTCTTGC CGGCAAAGAT ACAGAAACTT ATCTCAATGC TTTGCAGGAA TTATTTATGG GTACAAAAAC ACCTCAACAG TTTGTTAACA GTTTGCAGAC AATTTATGGT AAGTAA
|
Protein sequence | MNKVSVTKRG VALILAGALT VGMAACGKNT TNNNAAGTTN NAGGNKAKNV EIKFSHIWGS AADPFTPAAK KVIEDYQTAN PNVKIAVDTN ENEAYKTKIK AMAAANELPD IFSTWGGGFS QPFIQSKSVV QLDQYLTDDI KNKLVNGALT NVTYDGKVYG LPFFLSVGAM FVNTELFDKN GVKVPTTWEE LLTAVKTFKA KGITPMAVSG KDKWTIAMYF DVMALRAAGP EKVTKTLTKQ GSFKDPEFLN AANRFKELVD AGAFSKGAAG VSNDEAEVPF FEGKIPMMFK GSWTAGKAGS KDSKVAGKVK AISFPSIPGG LGNPKQFTGG AVDAVMVSEN SKNKEEAIKF QIYFAENMAK ESYLSGASMP AWKTDVDESK VNPSLVDVVN LTKDAESYTI WWDTLLAGKD TETYLNALQE LFMGTKTPQQ FVNSLQTIYG K
|
| |