Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1304 |
Symbol | |
ID | 4809556 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1583135 |
End bp | 1584508 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640106727 |
Product | PhoH-like protein |
Protein accession | YP_001037729 |
Protein GI | 125973819 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG1875] Predicted ATPase related to phosphate starvation-inducible protein PhoH |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAAAAAAA CCTTCGTTCT TGACACAAAT GTATTGCTTC AAACTCCGTT TGCATTGTAT TCCTTCGGAG ACAATGATGT TGTGATACCT GAAGTTGTGC TGGAGGAACT GGATAAATTT AAGAAGGACA ATACGGAATT GGGTGCAAAT GCACGGCATG CGGCAAGAAT TTTGGACGAA TTGAGGACAA AGGGGAATTT AAAAGAAGGT GTAAAGCTTG AAAACGGAGG GATGTTGAGG GTTGAATCTA ACTATAGCAG GGTAAAGCTT CCGGAAAGCT GGGATGATTC AAACAATGAC AATATAATAC TTAAAATTTG CAAGGGCCTG CTCGAAAAGG GCGAAAATGT TTTCCTGGTT ACAAAGGACA TATTTGAAAG AATAAAGGCA AATATTCTGG ATATTGAAGC CCAGGACTTT TTTGCGGAGC AGGTCCCTGT ATTTGAAGAA CAGTATAAGG GAAGAATTGA GGTATATACA ACAGAAGAGA AACTAAATGA GTTTTACAGT AAAGGCTTTT TGGACGAGGA AGACGTTTTT TTGTTCAACC CCGATTCAAC CAGGAAGAAG CCTGCCTTGA TTGTGAATCA GTTTCTTTTG ATACAGTCGT ATATAAATAA AAAACATACT GCTTTAGGAA GGTTTGACGG TCAAAAAATT GTACCTCTCA AGTTTCTCAA TGTTCGTCCT TTTGGAGTCA CTCCCAGAAA TACAGGACAG AAATTTATGC AGGAAGCTTT GATGATGGAT GCTGACAGGG CGCCTCTTGT GATAATTAAA GGGCCGGCGG GAACGGCAAA AACCTTTTAC TCTCTGGCAG TGGGTCTTCA TAAATTATTG GATGATCCGA ACAGACTGTA CAGGAAGATC CTTGTGTGCA GGCCAAATGT AAAACTGGAT GAGGACATTG GATTTCTTCC GGGAACAGAA CAGGAAAAAA TCGCTCCGTT TTTGAGACCT GTTATAGACA ATTTGGAAAT ACTGATTGAC AACGATGACA ATGAGAGATA TTCCAGCGAA AAAGAACTTA AGGACAAAAT TGACGAATTG TTTGACAGGA AAATTATTAA TACCGAGGCC ATTGCTTTCA TAAGAGGAAG ATCCATAACT AAGCAGTGGG TAATAATAGA TGAAGCTCAA AATCTTACCC CAAAACAAGT AAAAGGTATT ATAACAAGAG CCGGCAAGGG AACCAAAATA ATACTTATTG GGGACCCGGA ACAAATAGAT CATCCGTTTC TTGATATCAG GACAAACGGA CTGTGTTATG CATCTGAAAG GATGAAAGGA AGCAGTCTGT GTTTCCAGGT TACTTTGTAT GACGAGGAAT GTGAACGTTC CGAACTTGCA TATGAAGGCG CGAAAAGAAT GTGA
|
Protein sequence | MKKTFVLDTN VLLQTPFALY SFGDNDVVIP EVVLEELDKF KKDNTELGAN ARHAARILDE LRTKGNLKEG VKLENGGMLR VESNYSRVKL PESWDDSNND NIILKICKGL LEKGENVFLV TKDIFERIKA NILDIEAQDF FAEQVPVFEE QYKGRIEVYT TEEKLNEFYS KGFLDEEDVF LFNPDSTRKK PALIVNQFLL IQSYINKKHT ALGRFDGQKI VPLKFLNVRP FGVTPRNTGQ KFMQEALMMD ADRAPLVIIK GPAGTAKTFY SLAVGLHKLL DDPNRLYRKI LVCRPNVKLD EDIGFLPGTE QEKIAPFLRP VIDNLEILID NDDNERYSSE KELKDKIDEL FDRKIINTEA IAFIRGRSIT KQWVIIDEAQ NLTPKQVKGI ITRAGKGTKI ILIGDPEQID HPFLDIRTNG LCYASERMKG SSLCFQVTLY DEECERSELA YEGAKRM
|
| |