Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_1025 |
Symbol | |
ID | 7312175 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 1277219 |
End bp | 1278553 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643607952 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002505367 |
Protein GI | 220928458 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000399953 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATGC ACAAGATTTT AGTATCTACC ATACTATCTG TAGCGATGTT GTTATCTATA ACTGCATGTG GATCTAATGA TGCGGGATCT GGTGATAAGG CACAGAGTAC GGGTACTACA TCTGCTGATA CAAATAAAGA AAATAAAGGT ACTACTAAAT TAACAATGTG GCATATCCAG ACACAAGAAA ATGTTGCTGA TATTATTGAT GCTTCAATGG ATAGATTTGC AAAAGACAAT CCAGGTTTTG AAATGGAAGC TGTCCCAATG CAAAATGACC CATATAAGAC CAAACTTATC ACTGCAATGA GTGCAAATGA ACTCCCTGAC GTCTTTATTC ACTGGACAGG AGGTCCGATG ATCTCCTACA TTGATTCAGG TGCAGTATAT GATATAACTG AATATATGAA TAAAGATAAT TACAAGGATA AATTCCTGGA TGCTTCTATA CAGCAAGCTA CTTATAAAGA TTCCATTTGG GCTGTTCCTG TAGAAAATGT ATCTCCTGCC CTGATGTTCT ACAACAAAAA GCTGTTTGCT GATAATGGCA TTGAAGTTCC AAAAACTCTT GACGAATTTG AAAAAGTAAG TGACACTTTT GTACAAAAGG GTATCATACC TGTTTCACTT GCTAACAAGA CAAAATGGCC TGGATCAATT GTATATGGAT ATGTTCTGGA TAGAATAGCT GGTCCAAATG CATTTGCAGA TGCAAATAAC GGCGTAACTC AATTTGATAC ACCAGACTTT TTGGCTGCCG CTACAAAAAT ACAAACTTGG GCTAAAAAAG GTTATTTCGG GAAAGACTTC AACGGAATGG ACTATGATGC CGGTCAAGAC AGACAATTGT TCTACAATGG TAAAGCAGCC ATGTACATAA TGGGAGGCTG GTTCCTGTTT ACTGCAAAGG GAGAAAATCC AGAATTTGTT GAAAATGTAG GAGTAATGCA GTTCCCAGCA AACCCTGACA GCAAGGGTAA GGCATCCGAA TATATCGGTA CAATGGGTGA CAACTTCTAT TCTGTATCCA AGAATTCCAA GAACCCAGAG ATGGCATTCA AAGCTATTAC ATATATGCTT GATGATCAGG CTGTTAAGGA AAGGATTGAA TCAGGAAAAA TTCCTCCTCT TAAAGGCATA ACATTGACTG ATGAAATCAG TAAAATTGTT TCTGATAGTG CATCTTCAGC TACAAGCATG CAGCTCTGGC TTGATCAGTA TCTTGCACCG AACGATGCAG AATTACATAA AGATCAATTA CAAAAATTAT TGGCTGGTTC GATCACACCT GAACAATATA ATAAAAGTAT GACGGATGGC GTAAAGAACA AATAA
|
Protein sequence | MKMHKILVST ILSVAMLLSI TACGSNDAGS GDKAQSTGTT SADTNKENKG TTKLTMWHIQ TQENVADIID ASMDRFAKDN PGFEMEAVPM QNDPYKTKLI TAMSANELPD VFIHWTGGPM ISYIDSGAVY DITEYMNKDN YKDKFLDASI QQATYKDSIW AVPVENVSPA LMFYNKKLFA DNGIEVPKTL DEFEKVSDTF VQKGIIPVSL ANKTKWPGSI VYGYVLDRIA GPNAFADANN GVTQFDTPDF LAAATKIQTW AKKGYFGKDF NGMDYDAGQD RQLFYNGKAA MYIMGGWFLF TAKGENPEFV ENVGVMQFPA NPDSKGKASE YIGTMGDNFY SVSKNSKNPE MAFKAITYML DDQAVKERIE SGKIPPLKGI TLTDEISKIV SDSASSATSM QLWLDQYLAP NDAELHKDQL QKLLAGSITP EQYNKSMTDG VKNK
|
| |