Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1570 |
Symbol | |
ID | 4810077 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1898462 |
End bp | 1899523 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640106988 |
Product | extracellular solute-binding protein |
Protein accession | YP_001037989 |
Protein GI | 125974079 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 40 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGGTT TTATAAAATT GATCAGTGCA GTCGCTCTGT TTTCCATTTT TATAGGCTTG TTTTCCGGAT GTGGCAGGAC GAATACGGGA GGTACCCAGG ACAACGGGAA AACGGGGACA TCTTCAGCAG AATACAAATA CGGAAAAATT GATATTCCCG GTAAAGACGG TTCACTTTGC GGTGCACCTA TCTATATTGC ATATGAAAAG GGCTTTTTCA AGCAAGAGGG CTTTGATGTC AACCTTATAT CGGCAGATAC CGAAACCCGT AAAATCGGTT TGAACAACGG TACCATCCCG ATTGTCAACG GGGACTTTCA GTTCTTTCCC TCTATTGAAA ACAATGTGAA GGTAAAGGTG GTGGACGGTC TCCACTATGG ATGCATAAAA CTGATTGTTC CGAAGGATTC ACCGATTCAA GGAGTTCAAG ACCTTAGGGG AAAAAAGATC AGCGTTGATG AAATAGGCGG CACTCCCCAT CAGGTAGCAT CGGTGTGGCT GGAGAAAAAC GGAATTTCCG CAAAGCAGGA GGACGGAGAA GTTACGTTCC TTCCCTTCTC CGACGGAAAT CTGGCAGTGG AAGCCCTGAG AAAAGGAGAG GTTGATGTTG CGGCACTGTG GGATCCCTTC GGCTCTGTTC AGGAAAAAAC GGGAAATTAC CGTGTAATTC TTGATATTTC CAAGGATGAA CCTTTTGCCG GAAAATACTG CTGTTTCCTT TATGCTTCGG AAAAGCTGCT TGACGAAAAA CCAGAACAGG TTGCCGCATT GCTGCGTGCA TATAGGGCAG CCCAAAACTG GATTTCGGAA AACCCGGAAG AAGCCGTCGA CATTATAATA AATGGTAAAT ACGCGCAGAT TGAAGACAGA GAATTAGCCA TTAAGCTTAT CAAGAGCTAT CAATACCCTT CTTATGCCGA ACGGGAAAAA AATAAAACAC AGGTTCGAGA CAATGTTTAC TATTTTGCCG AACAATTGAA CCAAATTGGA TATTTAAAGA CGGATCCTGA TACTTTCACA AAGAATGCTT ATGTCGAGGT CGACATCAAC CTGGGTTTAT AA
|
Protein sequence | MKGFIKLISA VALFSIFIGL FSGCGRTNTG GTQDNGKTGT SSAEYKYGKI DIPGKDGSLC GAPIYIAYEK GFFKQEGFDV NLISADTETR KIGLNNGTIP IVNGDFQFFP SIENNVKVKV VDGLHYGCIK LIVPKDSPIQ GVQDLRGKKI SVDEIGGTPH QVASVWLEKN GISAKQEDGE VTFLPFSDGN LAVEALRKGE VDVAALWDPF GSVQEKTGNY RVILDISKDE PFAGKYCCFL YASEKLLDEK PEQVAALLRA YRAAQNWISE NPEEAVDIII NGKYAQIEDR ELAIKLIKSY QYPSYAEREK NKTQVRDNVY YFAEQLNQIG YLKTDPDTFT KNAYVEVDIN LGL
|
| |