Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1458 |
Symbol | |
ID | 4810608 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1777408 |
End bp | 1778217 |
Gene Length | 810 bp |
Protein Length | 269 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640106879 |
Product | extracellular solute-binding protein |
Protein accession | YP_001037880 |
Protein GI | 125973970 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000000125438 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAT TAAAAATGAA AAAATTAATT TCATTGCTAT TGACACTGAC CATGATTCTT GCATTAACTG CCTGTGGCGC AAAAGAAACC GGTAAATCAC AGGATGAAAG TGATTTGAAT TACGTTAAAA GTAAAGGCAA ACTTATCATT GGCTACACAA ATTATGCTCC TATGAATTAT ACAGATGAAA ACGGCGTTTT TACCGGCTTT GACACAGAGT TGGCAATTCT AACGTGTGAG AAACTGGGAG TTGAACCTGA ATTTGTTGAA ATTAACTGGG ATACCAAAGA AGTGGAACTG AATGCAAAAT CCATTGACTG TATTTGGAAT GGATTGACAA TTGATGACGA TCGCAAGGCG AAAATGGAAA TTACACAGCC TTATGTGAAG AACGCTCAGG TGGTTTTGGT GAAGGAAGGT ACTGAGTATA ACGGAACGGA ATCCTTGATA GGAAAGACCG TGGTGGCCGA GCAGAGTTCT GCCGGAGAAA AAACCATCAT GGCTGATGAG AATTTAAAGA AGGCAAACTA TGTTCCAAAG ACTTTGCAGA CAGACTGTCT GATGGAATTG AAATCCGGTA CCGCCGACGC AGCCGTATTG GATTTGACTT TGGCAAAAAC AATGACCGGT GAAGGCACCA GCTATGAGGA TATCGTGATT GTAGATTACC TGGCAGAAGA AAATTATGGG GTTGCTTTCC GCAAGGGTTC TGATATTTGT GCGGAAGTCA ACAAGATTTT CGACGATTTT TTGGCGGACG GTACAATGGC TGCATTGGCT GAAAAATACG GACTGGAGCT GGCTGAATAA
|
Protein sequence | MKKLKMKKLI SLLLTLTMIL ALTACGAKET GKSQDESDLN YVKSKGKLII GYTNYAPMNY TDENGVFTGF DTELAILTCE KLGVEPEFVE INWDTKEVEL NAKSIDCIWN GLTIDDDRKA KMEITQPYVK NAQVVLVKEG TEYNGTESLI GKTVVAEQSS AGEKTIMADE NLKKANYVPK TLQTDCLMEL KSGTADAAVL DLTLAKTMTG EGTSYEDIVI VDYLAEENYG VAFRKGSDIC AEVNKIFDDF LADGTMAALA EKYGLELAE
|
| |