Gene Hore_20660 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_20660 
Symbol 
ID7314390 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2234999 
End bp2236213 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content44% 
IMG OID643612510 
Productgalactokinase 
Protein accessionYP_002509806 
Protein GI220932898 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID[TIGR00131] galactokinase 


Plasmid Coverage information

Num covering plasmid clones69 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGCG TAGATGAATT ATGGGAGTTC TTTGAGAATA GATATGGTGA TAATGGCTTG 
AAAAAGGGGT CCTATTCAGC CCCCGGGCGG GTTAATTTAA TCGGAGAGCA TACTGACTAT
AATGATGGTT TTGTCCTGCC AATGGCCATT GAAAAAAATG TCACCATGCT GGGTCAGTTA
AGGCATGACC GGAAAATAAA AGTCTATTCT CTTGACTACG ATACTGAGTT GTGCTTTAAC
CTGGATAAAC TTGAAAAAGA TGAGGAACAC ACCTGGGTTA ATTATGTGAT GGGGGTTGCC
GATGAAATTG AGAAAAAGGG TCATAAACTA AAGGGGATGA ACCTGGTATT TACCGGTAAT
GTGCCCCAGG GTTCAGGGTT AAGCTCTTCA GCTGCCCTGG AAGTAGTTAC AGCCATGACC
ATGGCTGATT TAAATGAACT GGATATAGAC CCGGTGGAGA TGGCTTTGCT CTGCCAGGCT
GCAGAAAACA ATTTTGTCGG TGTGGCCTGT GGGATTATGG ATCAATATAT TTCCCGCCTT
GGCCACAGGG ACCATGCCCT CCTGATAGAT TGCCGAACCA ATGAATATGA ACTAATTCCC
TTTAAAGATA AAAGGTATCG GATAGTAATC TGTAACTCAA AAGCCCGGAG GGGGCTAGTT
GATTCTGAAT ATAATACCCG AAGGTCAGAG TGTAACCAGG CTGTTGCCTT TTTTAATGAG
AAGCTGGGCC GTAACATAAC TGCCCTGAGG GATGTAAAGC TCAATGAAGT AGGACAATAC
AGGGGAGAAC TCTCTGATTC AGTATATCGC AGGGCTCATC ATGTAGTCAG TGAGAATGAA
AGGGTTCTGG CCAGTGTCGA GGCCCTAAAA AATAATGATT TTGAGAAATT TGGACAGCTA
ATGATTGAAT CACATCAAAG CCTCAGGGAC GATTATGAGG TTAGCTGCCG GGAACTGGAT
TGCCTGGTAG ACGTGGCCCT TAAACAGGAA GGGGTTCTGG GAGCCAGGAT GACCGGGGCC
GGTTTTGGTG GTTGTACGGT TAATCTTGTT GACATCAATT ATGTTGAAGT TTTTATAAAA
GGGATTAAAG AAGGTTATAA ACGGGAAACC GGGATAGAGC CTGAAATCTA TGTAAGTCGA
CCGGCAGAAG GGGCAAGAAG ATTGGAGGTT GATAGAGATG GAGGAAACCC TGGTAAAGAA
GGAAAAAATA GATAA
 
Protein sequence
MKSVDELWEF FENRYGDNGL KKGSYSAPGR VNLIGEHTDY NDGFVLPMAI EKNVTMLGQL 
RHDRKIKVYS LDYDTELCFN LDKLEKDEEH TWVNYVMGVA DEIEKKGHKL KGMNLVFTGN
VPQGSGLSSS AALEVVTAMT MADLNELDID PVEMALLCQA AENNFVGVAC GIMDQYISRL
GHRDHALLID CRTNEYELIP FKDKRYRIVI CNSKARRGLV DSEYNTRRSE CNQAVAFFNE
KLGRNITALR DVKLNEVGQY RGELSDSVYR RAHHVVSENE RVLASVEALK NNDFEKFGQL
MIESHQSLRD DYEVSCRELD CLVDVALKQE GVLGARMTGA GFGGCTVNLV DINYVEVFIK
GIKEGYKRET GIEPEIYVSR PAEGARRLEV DRDGGNPGKE GKNR