Gene Hlac_1458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1458 
Symbol 
ID7400285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1465442 
End bp1466443 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content68% 
IMG OID643708519 
Productdihydroxyacetone kinase subunit DhaK 
Protein accessionYP_002566116 
Protein GI222479879 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2376] Dihydroxyacetone kinase 
TIGRFAM ID[TIGR02363] dihydroxyacetone kinase, DhaK subunit 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAGC TGATCAACGA TCCGGACGAC GTCGTCGACG AGATGCTCGA CGGGATGACC 
GCGGCGTACC CGGACCGGCT CCGACGGCTC CCCGACACGC AGGTCGTCGT TCGGAACGAC
GCACCGGTAG CGGGGAAGGT CGCGCTCGTG ACGGGCGGCG GGAGCGGGCA CGAGCCGACC
CACGCGGGCT ACATCGGCGA CGGCATGCTC GACGGGGCGG CCGCGGGCGA CGTGTTCTCC
TCGCCGACCG CCGACGAGTT CGAGGAACTG ATCGAAGCCT GCGACGCGGG CGACGGCATC
CTCGCGATCA TCAAGAACTA CGAGGGCGAC GTGATGAACT TCGAGACCGC TATCGAACTC
GCCGAGATGG AGGGTGTCGA GGTCGAGAGC GTCGTGGTCG ACGACGACGT GGCCGTCGAG
GACTCGCTGT ACACCTCCGG CCGGCGCGGC GTCTGCGGGA CGATCCTCGT CCACAAGGCC
GCCGGCGCGA AGGCCGCACA GGGCGCCGAT CTCTCGGAGG TCAAGCGCGT CGCGGAGAAG
GTAGTCGACA ACGTCGGGAC CATGGGCACC GCGCTCACCT CGTGTGTCAC CCCCGAGAAA
GGCGAGCCTA CCTTCGATCT GGGCGACGAC GAGATCGAAC TGGGGATCGG GATCCACGGC
GAGCCCGGCA CCGAGCGCAC GGAGATGATG AGCGCCGACG AGATCACCGA CGCGCTGACC
GAGGCCATCC TCGACGACCT CGATCTCGGC GCCGGACAGG AGGTGCTCAC GGTCGTCAAC
GGGATGGGCG GGACCCCGCA GATGGAGCTG TTCGTCGTCA ATCGTCGGCT CCAGGAGCTG
CTGGGTGAGC GCGAGGTGGA GACCTGGGAT TCGTGGGTCG GCGACTACAT GACCTCGCTC
GATATGGCGG GCGCGTCGAT CACCGTCTGC GCCGTCGACG ACGAGCTGAA GGAGCTGCTC
GGCGCGCCGG CCGACACCCC CGCGCTCTCC CGGATCCAAT GA
 
Protein sequence
MKKLINDPDD VVDEMLDGMT AAYPDRLRRL PDTQVVVRND APVAGKVALV TGGGSGHEPT 
HAGYIGDGML DGAAAGDVFS SPTADEFEEL IEACDAGDGI LAIIKNYEGD VMNFETAIEL
AEMEGVEVES VVVDDDVAVE DSLYTSGRRG VCGTILVHKA AGAKAAQGAD LSEVKRVAEK
VVDNVGTMGT ALTSCVTPEK GEPTFDLGDD EIELGIGIHG EPGTERTEMM SADEITDALT
EAILDDLDLG AGQEVLTVVN GMGGTPQMEL FVVNRRLQEL LGEREVETWD SWVGDYMTSL
DMAGASITVC AVDDELKELL GAPADTPALS RIQ