Gene Hore_10100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_10100 
Symbol 
ID7314598 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp1099304 
End bp1100953 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content39% 
IMG OID643611449 
ProductDak phosphatase 
Protein accessionYP_002508761 
Protein GI220931853 
COG category[R] General function prediction only 
COG ID[COG1461] Predicted kinase related to dihydroxyacetone kinase 
TIGRFAM ID[TIGR03599] DAK2 domain fusion protein YloV 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000509589 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGGTTA ATGAGATTTC CCACGGTTCA TTAAGTAATG TAGACGGAAA AAAATTTAGA 
GAAATGTTGT ACACAGCACT GGCCTGGTTA AAAGAACAGC AATCCTTTAT AGATTCCCTC
AATGTATTCC CGGTGCCTGA TGGTGATACC GGTACCAATA TGTATCTAAC TTTTCTGGAG
GCAATTAAAG AAGTAAAAAA GATAGAGACA AATAATGTTT CTGAAATTAC TTCTGCTATG
GCCAAAGGAG CCTTAATGGG AGCCCGGGGC AATTCAGGTG TAATTTTATC TCAGCTATTG
AGGGGGTTTT CCCAGGCCAA TGAGGCCAAT AGCAGCTTGA CAGCAACCCA TCTGGTGAAA
GCCCTGAGGA AAGCCTCAGA TGTTGCCTAT CAGGGTGTTT TAAAACCTGT TGAAGGTACT
ATTTTGACTG TTTCCAGAAA GGCAGCTGAA GGTGCTGAAT TAGCCCTTGA AAATAATCTC
GATATTAACG GTATTATGGA AAACACGGTT GCTGCTGCCC GTGATGCTTT AAATAAAACC
CCTGAACAGT TGCCGATTCT AAAAGAGGCC GGGGTAGTAG ACGCCGGAGG ACAGGGTTAT
TTAATTATCC TAGAAGGACT TCTTAAAGGG TTAAACTCAG AGTATGTCCC CCAGGGAGAC
CTGGAGGTTG TAAAGCCTTC AAAAAAAGAA CAACAAATTG CTGAAGACAT AAAATATGCC
TATTGTACCC AGGCTTTAAT TAACCTTCAA AAAGATACTA CAAAATCAAT AGAAGAAATA
AGGAATGACT TACAACATTA CGGTGATTCC CTGATGGTGG TTGGTTCAGA TAGAACCGTA
AAAATCCATA TTCATACCAA CCATCCGGGA ATTATTTTAG AGTATGGTTT AAAACTGGGC
TCCCTCATTG ATATTAATAT AGATAATATG AAAATCCAGA GTGCCGAGAA AGTTCGCCAG
ACTGAAGAAC AGCAAAGGCA GGAATTCATG CCAGCTAAAA AGAAGGGCAT TATTGCTGTT
GGTAAGGGTG ATGGTATTAA GGAAATATTT AAAGATCTGG GAATAGATGT TGTTATTGAC
GGTGGCCAGT CTATGAATCC CAGCACCAAT GATTTTCTGG AAGCAATTAA TAACTTGAAT
TCTTCAGAGA TAATAATCCT ACCCAACAAC AAAAATATAA TTTCAGCCGC AGAACAGGCA
GCTTCTTTAA GTGATAAAGA TGTGGTTGTG ATTCCGACGA AAACCATACC TCAGGCTGTT
AGCAGTATGA TGGTTTTTAA TGATGAGGCT GATTTAGATG AGTTAAAAGA AGCTATGGAA
ATGGAAACAG AAAATGTTAC CACCCTCGAA ATTACCAGAG CTGTAAAATC ATCAAAAGTA
AATGGTCATA ATATTTCAAC CGGTGATGTA ATAGGTCTTG AAAATGGTCA AATAGAAGCA
GTTGGTAAGG AATATCAGGA CGTGATTGTT GAACTTCTTA AAAAAGTATG TAGTGGTGAT
GAATTTATTA CAATTTTCTA TGGAGAGGAA ATTAACGAGG ATGAAGCCGG TGACCTGGTT
GATACCCTGG AAGAAGAATT TAATTTTGAA GATATAGAGC TATACAGAGG AGGGCAACCA
CTGTATCCGT ATATCATATC AGTAGAATAG
 
Protein sequence
MPVNEISHGS LSNVDGKKFR EMLYTALAWL KEQQSFIDSL NVFPVPDGDT GTNMYLTFLE 
AIKEVKKIET NNVSEITSAM AKGALMGARG NSGVILSQLL RGFSQANEAN SSLTATHLVK
ALRKASDVAY QGVLKPVEGT ILTVSRKAAE GAELALENNL DINGIMENTV AAARDALNKT
PEQLPILKEA GVVDAGGQGY LIILEGLLKG LNSEYVPQGD LEVVKPSKKE QQIAEDIKYA
YCTQALINLQ KDTTKSIEEI RNDLQHYGDS LMVVGSDRTV KIHIHTNHPG IILEYGLKLG
SLIDINIDNM KIQSAEKVRQ TEEQQRQEFM PAKKKGIIAV GKGDGIKEIF KDLGIDVVID
GGQSMNPSTN DFLEAINNLN SSEIIILPNN KNIISAAEQA ASLSDKDVVV IPTKTIPQAV
SSMMVFNDEA DLDELKEAME METENVTTLE ITRAVKSSKV NGHNISTGDV IGLENGQIEA
VGKEYQDVIV ELLKKVCSGD EFITIFYGEE INEDEAGDLV DTLEEEFNFE DIELYRGGQP
LYPYIISVE