Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hore_18910 |
Symbol | |
ID | 7312706 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothermothrix orenii H 168 |
Kingdom | Bacteria |
Replicon accession | NC_011899 |
Strand | - |
Start bp | 2021197 |
End bp | 2022192 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 643612338 |
Product | KpsF/GutQ family protein |
Protein accession | YP_002509634 |
Protein GI | 220932726 |
COG category | [K] Transcription [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0794] Predicted sugar phosphate isomerase involved in capsule formation [COG2524] Predicted transcriptional regulator, contains C-terminal CBS domains |
TIGRFAM ID | [TIGR00393] KpsF/GutQ family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 0.413862 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAATGAAC CGGTAAACCT CGATGAAAAG ATGATAATTG ACTGTCTTCA GGAAGCCCGT AAAGTCCTTG AGATAGAGGC CTATTCGGTT TTAAAACTCA AAGACAGTAT CGGTAGTGAA TTTGCTGATA TTGTCAGGGT TATTCTGGAG AGCAAGGGTC GGGTTATTTT TACCGGTATC GGAAAATCCG GCCTTATCGG ACAGAAACTG GCCGCTACCT TTTCCAGTAC CGGGACACCT GCTTTTTTTG TACATGCCGG TGAGGCCCTG CATGGTGACC TGGGAATGGT AACCGGAGAT GATATAATAA TTGCCATTTC CAACAGCGGG GAGACGGAAG AGGTTTTAAG TCTTGTGCCC TCCATCAGGA GGATCGGAGC CTTTTTGATA GCTGTTACCG GGAATAGGTC TTCTACTCTG GCCCGTTATG CCAACAATCA CTTATTAGTC AATATTGAGG AAGAGGCCTG TCCCCATGGC CTGGCCCCGA CAGCCAGTAC TACGGCTACT CTAGCCCTGG GTGATGCCCT GGCTATTGCT TTATCAAAGC TAAAGGGTTT TACCCCCGAG GATTTTGCCC TCTTTCACCC CGGTGGAAGC CTGGGAAGGA AGTTATTGAC AAAGGTAGAA GATGTCCTCC AGGTTAGAAA ACAAAACCCG GTTGTTCAGT CCGGGACAAG TGTCAAAGAA GCCCTCTTTA CCATGACTGC CAGTAAAATG GGTTCTACTT CAGTAGTGGA TGAAAGGGGG CGGCTGGTCG GGATAATCAC TGATGGAGAT ATCAGGCGCC TTTTAGAGGA GTCGACCGAC TTTCTCCAGA AACCGGTATT AGAGGTAATG ACAAAAGACC CTATTACCAT TGAAAAAGAC CGGCTGGCCG CTGAAGCCCT GAAAATTATG GAGGATAAGG AAGTTAATGA CCTGCCGGTA GTCGAAGATG GGAAGCCAGT GGGAATGCTT AACTTCCAGG ACCTGTTAAG GGCCCGGGTC TTTTAG
|
Protein sequence | MNEPVNLDEK MIIDCLQEAR KVLEIEAYSV LKLKDSIGSE FADIVRVILE SKGRVIFTGI GKSGLIGQKL AATFSSTGTP AFFVHAGEAL HGDLGMVTGD DIIIAISNSG ETEEVLSLVP SIRRIGAFLI AVTGNRSSTL ARYANNHLLV NIEEEACPHG LAPTASTTAT LALGDALAIA LSKLKGFTPE DFALFHPGGS LGRKLLTKVE DVLQVRKQNP VVQSGTSVKE ALFTMTASKM GSTSVVDERG RLVGIITDGD IRRLLEESTD FLQKPVLEVM TKDPITIEKD RLAAEALKIM EDKEVNDLPV VEDGKPVGML NFQDLLRARV F
|
| |