Gene Hore_18910 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_18910 
Symbol 
ID7312706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2021197 
End bp2022192 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content48% 
IMG OID643612338 
ProductKpsF/GutQ family protein 
Protein accessionYP_002509634 
Protein GI220932726 
COG category[K] Transcription
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0794] Predicted sugar phosphate isomerase involved in capsule formation
[COG2524] Predicted transcriptional regulator, contains C-terminal CBS domains 
TIGRFAM ID[TIGR00393] KpsF/GutQ family protein 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value0.413862 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATGAAC CGGTAAACCT CGATGAAAAG ATGATAATTG ACTGTCTTCA GGAAGCCCGT 
AAAGTCCTTG AGATAGAGGC CTATTCGGTT TTAAAACTCA AAGACAGTAT CGGTAGTGAA
TTTGCTGATA TTGTCAGGGT TATTCTGGAG AGCAAGGGTC GGGTTATTTT TACCGGTATC
GGAAAATCCG GCCTTATCGG ACAGAAACTG GCCGCTACCT TTTCCAGTAC CGGGACACCT
GCTTTTTTTG TACATGCCGG TGAGGCCCTG CATGGTGACC TGGGAATGGT AACCGGAGAT
GATATAATAA TTGCCATTTC CAACAGCGGG GAGACGGAAG AGGTTTTAAG TCTTGTGCCC
TCCATCAGGA GGATCGGAGC CTTTTTGATA GCTGTTACCG GGAATAGGTC TTCTACTCTG
GCCCGTTATG CCAACAATCA CTTATTAGTC AATATTGAGG AAGAGGCCTG TCCCCATGGC
CTGGCCCCGA CAGCCAGTAC TACGGCTACT CTAGCCCTGG GTGATGCCCT GGCTATTGCT
TTATCAAAGC TAAAGGGTTT TACCCCCGAG GATTTTGCCC TCTTTCACCC CGGTGGAAGC
CTGGGAAGGA AGTTATTGAC AAAGGTAGAA GATGTCCTCC AGGTTAGAAA ACAAAACCCG
GTTGTTCAGT CCGGGACAAG TGTCAAAGAA GCCCTCTTTA CCATGACTGC CAGTAAAATG
GGTTCTACTT CAGTAGTGGA TGAAAGGGGG CGGCTGGTCG GGATAATCAC TGATGGAGAT
ATCAGGCGCC TTTTAGAGGA GTCGACCGAC TTTCTCCAGA AACCGGTATT AGAGGTAATG
ACAAAAGACC CTATTACCAT TGAAAAAGAC CGGCTGGCCG CTGAAGCCCT GAAAATTATG
GAGGATAAGG AAGTTAATGA CCTGCCGGTA GTCGAAGATG GGAAGCCAGT GGGAATGCTT
AACTTCCAGG ACCTGTTAAG GGCCCGGGTC TTTTAG
 
Protein sequence
MNEPVNLDEK MIIDCLQEAR KVLEIEAYSV LKLKDSIGSE FADIVRVILE SKGRVIFTGI 
GKSGLIGQKL AATFSSTGTP AFFVHAGEAL HGDLGMVTGD DIIIAISNSG ETEEVLSLVP
SIRRIGAFLI AVTGNRSSTL ARYANNHLLV NIEEEACPHG LAPTASTTAT LALGDALAIA
LSKLKGFTPE DFALFHPGGS LGRKLLTKVE DVLQVRKQNP VVQSGTSVKE ALFTMTASKM
GSTSVVDERG RLVGIITDGD IRRLLEESTD FLQKPVLEVM TKDPITIEKD RLAAEALKIM
EDKEVNDLPV VEDGKPVGML NFQDLLRARV F