Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0547 |
Symbol | |
ID | 4808296 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 668249 |
End bp | 669205 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640105961 |
Product | periplasmic solute binding protein |
Protein accession | YP_001036976 |
Protein GI | 125973066 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0803] ABC-type metal ion transport system, periplasmic component/surface adhesin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.624279 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAAGA ATACAAAAAA GGTGGCTTTT CTGGTTGCAT GTCTGGTATT TCTCATTACG GGCTGTTTTG GCAACTCAAA GGTGGGAGAG CGAAAGGACA GTGATAAAAT CAATGTATAT GCCAGTTTTT ATACCATGTA TGATTTTGCA AATAAAATAG GCGGCGACAG GATTAACCTT GTCAATATTA TGCCTGCGGG AACGGAACCT CATCACTGGG AACCGTCACC GAAAGATATT CTTGAGATTG AGAAAGCTGA CGTGTTTATA TATAACGGAG CGGGTATGGA GCCTTGGGTG GAAAAGGTTC TGGGTTCTAT AACTAACAAA GAACTTGTGG TGGTTGAAAC ATCAAAAAAT ATAGAACTTC TTCGTGGCGG CCACTATCAT GATGAAGAAA ATGCGGAGGA AGAATCCGAC GGTCATGATG ACCTTGCATA CGACCCTCAT GTGTGGTTAA GTCCCAAAAA TGCCAAAAAG CAAATGGAAG CAATAAAAGA TGCTTTTGTA AAAGCGGATC CTGAAAACAA AGAATATTAT GAAAAAAATT TTGAAGACAA TGTCAAAAAG CTTGACGATC TTGACAGAGA ATACAGAGAA GAACTGGCGA AGTATACAAA AAAGGATATT GTTGTGGGAC ATCAGGCTTA CGGATATTTA TGCAAGGAAT ACGGACTAAG ACAGTATGCC ATTGAAGGCC TTAATGCGGA ATCTGAGCCG ACGCCGGCCA GAATGGCTGA AATAGTAAAG TTTATTAAAG AAAATGATAT AAAGGTCATA TTTTCGGAAA AACTGCTAAG CCAGAAAGTA CCGAATGCTA TTGCCGCTGA GACAGGAGTT AGGGTGGAAT TTTTAAATCC CCTTGGCGGA CTTACACAGG AAGAGATTGA TTCGGGGAAA GAATATTTTA CAGTAATGAG GGAAAATCTG GAGGCGTTAA AAAAGGCTTT TGAATAA
|
Protein sequence | MRKNTKKVAF LVACLVFLIT GCFGNSKVGE RKDSDKINVY ASFYTMYDFA NKIGGDRINL VNIMPAGTEP HHWEPSPKDI LEIEKADVFI YNGAGMEPWV EKVLGSITNK ELVVVETSKN IELLRGGHYH DEENAEEESD GHDDLAYDPH VWLSPKNAKK QMEAIKDAFV KADPENKEYY EKNFEDNVKK LDDLDREYRE ELAKYTKKDI VVGHQAYGYL CKEYGLRQYA IEGLNAESEP TPARMAEIVK FIKENDIKVI FSEKLLSQKV PNAIAAETGV RVEFLNPLGG LTQEEIDSGK EYFTVMRENL EALKKAFE
|
| |