Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2278 |
Symbol | |
ID | 4809867 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2709129 |
End bp | 2709992 |
Gene Length | 864 bp |
Protein Length | 287 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640107684 |
Product | extracellular solute-binding protein |
Protein accession | YP_001038673 |
Protein GI | 125974763 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000000861711 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGA GTTTTATTTT TACCGTTATA ATGCTGATGC TTTTTAGTCT TATCCTCGCA GGTTGCAATT CGGACAGCAA AGTTTCAGGC GCCAAACGTG CCAAAGTTAT TGACATTCAG TTAACCCAGG AGGAATATGC TTTTGGAGTT GATAAAAACC AACCGGAACT GCTTGCAAAG GTAAATGAGT TTATTTCGCA AATCAAGTCC GATGGAACCC TTCAAAAAAT AAGCGACAAA TATTTCAGTG ACGGGGAACC TGAACCTGTT ATATCTGCAG TGGAAGACAG CAGCAAAGAC CAGCTTGTAG TTGCCACAAA TGCGGCGTTT GAGCCTTTTG AATATACTCT CGGAGGCAGC GAGTATTACG GTATTGATAT GGAAATTGCC GCACTTCTTG CAGAATATCT CGGAAAAGAG CTTGTAATCA AAAACATGGA CTTTGATGCC GTTTGCCTTT CCGTAGGCCA GGGAAAAGCC GACATTGCAA TGTCCGGTCT TACGGTAAAA GAGGACAGGA AAGAGTATGT TACATTCTCG GACACATATT ATAATGCTTC CCAAAAACTT ATTGTCAGAG AAGACGACAA TACATTTGAC AACTGTAAAA CCGCTGCTGA TGTGGAAGCA ATTCTTTCAA GCTTAAGCAG TGATACAAAA ATCGGTGTTC AGACAGGTAC AACCGGCCAA TTCTATGTTG AAGGCGATGA AGACTGGGGT TTTGAGGGCT ATAATGTTAA ATGCGTAGGT TACAAGTCAG GCTCTCTTGC CGTTCAGGAC ATGCTCAACG GCAACATTGA CTTTGTCATT ATTGATGAGG CTCCTGCCGA CAGCATAGTA AGAGCAATTA ATGAGCTTAA CTAA
|
Protein sequence | MKKSFIFTVI MLMLFSLILA GCNSDSKVSG AKRAKVIDIQ LTQEEYAFGV DKNQPELLAK VNEFISQIKS DGTLQKISDK YFSDGEPEPV ISAVEDSSKD QLVVATNAAF EPFEYTLGGS EYYGIDMEIA ALLAEYLGKE LVIKNMDFDA VCLSVGQGKA DIAMSGLTVK EDRKEYVTFS DTYYNASQKL IVREDDNTFD NCKTAADVEA ILSSLSSDTK IGVQTGTTGQ FYVEGDEDWG FEGYNVKCVG YKSGSLAVQD MLNGNIDFVI IDEAPADSIV RAINELN
|
| |