Gene Hore_16050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_16050 
Symbol 
ID7312641 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp1720418 
End bp1721386 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content44% 
IMG OID643612052 
Productglucokinase 
Protein accessionYP_002509349 
Protein GI220932441 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID[TIGR00744] ROK family protein (putative glucokinase) 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGAAT ACTATGTAGG TGTTGATTTA GGAGGTACCA AAATACTAAC TGCTCTGGCT 
GATGCCAGAG GAAAGATTGT TGCGAAAAAG AAGCTACCGA CAGAAGCCCG TAAAGGGGAA
GAAAAGGTAA TACAAAATAT TGTGTCATCA ATAGATGCTG TTCTTCAGGA GAAAGGACTA
TCAAGGGAAG ATGTCATTAC TCTGGGTGTC GGAAGCCCCG GGCCCTTAAA TACACAGGAG
GGTATTATAT ACCTGGCCCC TAATCTGGGA TGGAGGAATG TACATATTAA AGATATCCTT
GAGGAGGAAA CAGGTATTCC GGTAATCCTG GAAAATGACG CCAATGCAGC GGCCCTCGGA
GAAAAATGGT TTGGGGCCGG CCAGGATGTT GACAACTTAA TATATATTAC TGTCAGTACC
GGTATCGGAG GCGGAATTAT TATTAATAAG AAAATTTTCC ATGGTATCAA TGATGGAGCC
GGTGAGGTTG GACATATGGT TATAGAGCCA GGTGGACCTG TCTGTGGTTG TGGTAACAGG
GGTTGTTTTG AGGCCGTTGC TTCCGGGACT GCCATTAATA AAATGGGCCG GGAGGCTGTA
AAAGAAAATA AAGCTACCCT GTTAATGGAA TTATCAGGAG GAGATCCCGA GAAAATTGAC
GGAAGTTTAA TTGCCAGAGC TGCCAGGCAG GGAGATGAAG TAGCCAGGAA AATATGGGAT
AAGGCCGGTT ATTATCTGGG GATTGGACTT GCCAACCTTT TAAATATTTT TAACCCGGAA
ATGATAATTC TGGGTGGTGG TGTCATGAAT GCTGGTGATT TAATAATGGA ACCAATGAAA
AAAAGCTTAA AAGATCATGC TTTAGAATCA GCCTTTAATT CAGTTGAGAT ACGCCAGGCT
GAGCTGGGCA ATGATACTGG AGTAATCGGG GCAGTTGCAG TAGCCATGGG GGACAGGTTA
TTAGAATGA
 
Protein sequence
MKEYYVGVDL GGTKILTALA DARGKIVAKK KLPTEARKGE EKVIQNIVSS IDAVLQEKGL 
SREDVITLGV GSPGPLNTQE GIIYLAPNLG WRNVHIKDIL EEETGIPVIL ENDANAAALG
EKWFGAGQDV DNLIYITVST GIGGGIIINK KIFHGINDGA GEVGHMVIEP GGPVCGCGNR
GCFEAVASGT AINKMGREAV KENKATLLME LSGGDPEKID GSLIARAARQ GDEVARKIWD
KAGYYLGIGL ANLLNIFNPE MIILGGGVMN AGDLIMEPMK KSLKDHALES AFNSVEIRQA
ELGNDTGVIG AVAVAMGDRL LE