Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0747 |
Symbol | |
ID | 4810365 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 913436 |
End bp | 914500 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640106164 |
Product | extracellular solute-binding protein |
Protein accession | YP_001037175 |
Protein GI | 125973265 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0687] Spermidine/putrescine-binding periplasmic protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.389252 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA TTGCGTTGCT ATTGGCTGTG TTGATGTTTG TATTTACAGG CTGTGCCGGC AATGTTAAAA ATGACCCCAG CAAACCTCTT GCGGGTACGA CTATTTATGT CTATAACTGG GGAGATTATA TAGCTGAAGA TACTATTGAA CGTTTTACCA AGGAAACGGG CATTAAGGTT ATTTATGAAA CTTTTGATTC CAATGAAACC ATGTATGCTA AATATAAATC CGGTGCGGTA AATTACGATG TTTTGATTCC GTCCGATTAT ATGATTGAGA AATTGATTGC GGAAAATGAA CTGTTGCCTT TGAATTTTGA CAATATTCCC AATGCAAAAT ATATTGACGA ATCATTCAGA AATTTGGGTT ATGACCCGGA AAACAAGTAT TCTGTGCCAT ATTTCTGGGG AACTCTCGGA ATTCTTTACA ACAAAAAAAT GGTGCAGGAG GAAGTTGACT CCTGGGATAT TCTGTGGGAC AGCAAGTACA AGGGTCAAAT TATAATGATG GATTCGGTAA GAGACACTTT TGCTGTTGCC CTCAAAAGGT TGGGATATTC CCTGAACAGC ACTGACAAAG CGCAGGTGGA TGAAGCAGTG GCAAGCCTGA TAGAGCAGGC TCCGTTGGTT CAGGCTTACC TGATGGACCA GGTAAAGGAC AAGATGATAG GGGAAGAGGC GGCTTTGGCG GTAATTTATT CCGGAGAAGC TGTTTATACT TCTGAATACA ACGAGAATTT GGAATATGCA GTTCCAAAGG AAGGTACAAA CTTCTTTGTT GACGCCATGG TTATACCAAA GACTTGCCAA AACAAAGAAG CAGCAGAAGC TTTTATCAAT TTTATGAATG ATCCTCAGAT TGCTTACAAC AATACCAAGT ATGTTGGATA TTCCACACCG CATACGGAAG CCAGAGACAT GTTGGATGAA GAAATAAAAA ACAATCCTGC GGCTTACCCT CCACAGGAAA TTATAGACAA GTGTGAAGTG TTTGTGGATC TTGGACCGGA GATGACGGTT TACTACAATG ACAAATGGAA TGAATTGAAA GCATCGTTGC GTTAA
|
Protein sequence | MKKIALLLAV LMFVFTGCAG NVKNDPSKPL AGTTIYVYNW GDYIAEDTIE RFTKETGIKV IYETFDSNET MYAKYKSGAV NYDVLIPSDY MIEKLIAENE LLPLNFDNIP NAKYIDESFR NLGYDPENKY SVPYFWGTLG ILYNKKMVQE EVDSWDILWD SKYKGQIIMM DSVRDTFAVA LKRLGYSLNS TDKAQVDEAV ASLIEQAPLV QAYLMDQVKD KMIGEEAALA VIYSGEAVYT SEYNENLEYA VPKEGTNFFV DAMVIPKTCQ NKEAAEAFIN FMNDPQIAYN NTKYVGYSTP HTEARDMLDE EIKNNPAAYP PQEIIDKCEV FVDLGPEMTV YYNDKWNELK ASLR
|
| |