Gene Hlac_0335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0335 
Symbol 
ID7399725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp358436 
End bp359299 
Gene Length864 bp 
Protein Length287 aa 
Translation table11 
GC content69% 
IMG OID643707397 
ProductHAD superfamily (subfamily IA) hydrolase, TIGR01548 
Protein accessionYP_002565009 
Protein GI222478772 
COG category[R] General function prediction only 
COG ID[COG0546] Predicted phosphatases 
TIGRFAM ID[TIGR01548] haloacid dehalogenase superfamily, subfamily IA hydrolase, TIGR01548
[TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0232711 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGTCG ACGCAGTCGT CCTCGACATC GACGGGGTGT TGGTCGACGT AGCGGACTCC 
TACCGGCGGG CGATCATCGA GTCCGTCGAA CGGGTGTGTG ACAAGACGAT CGATCGCGAC
GCGATTCAGT TGTTCAAGGA CGCCGGTGGG TTCAACAACG ACTGGGAGCT GACTGACGCG
GCCGCCCTCT ACGTGCTGGC CCGCCGGGAG GGGCTCGGGA TGGATGTCGA GACCTTCACC
GGTCGGATCG CCGAGGGCGG GGGCGGGCTT GACGCTGCGA AGGAGGTCGT CGGCGACCTC
CCCCGAGTCG CGCAGGCGCG GGTCCGCGAC CAGTGGGATA CCGAGCGACT CCGCGAGACG
TTTCAGGCGC TGTACCTCGG CGAGGAGCTG TACCGCGAGC TGGAGGGCGG CGCCCCCCCG
CTGTCGGCGC CGGGGTACAT CCACGACGAG CCGACCTTGG TGGAGCCGGA GACGATCGCT
GACCTGACCG ACCGGTTCGA CGTTGGCGTG TTGACGGGGC GGCCCGCCGC CGAGGCGGAG
ATCGCCTTGG AGCGCGTCGG CCTCGACGTG CCCGAAGATC GGCGGTTCAC GATGGACGAC
TGGGAGGAGG GGAAGCCGCA TCCGCGGGCG CTGGTGACTC TCGCGGAGCG GTTCGACGCC
GACCGGATCG CCTTCATCGG GGACACCCTC GACGACGTGC GAACCGCGCG CAACGCCGAC
GATGAAGACG CGAGCCGCGT CTACTACGGG ATCGGCGTCC TCACCGGCGG CCTGACCGGT
GACGAGGGGC GCCGGAAGTT CGCCGAAAAC GGAGCCGACG CGGTCGTCGA GGACGTGAAC
GAACTCGTCG AGCTGTTGGA GTAA
 
Protein sequence
MQVDAVVLDI DGVLVDVADS YRRAIIESVE RVCDKTIDRD AIQLFKDAGG FNNDWELTDA 
AALYVLARRE GLGMDVETFT GRIAEGGGGL DAAKEVVGDL PRVAQARVRD QWDTERLRET
FQALYLGEEL YRELEGGAPP LSAPGYIHDE PTLVEPETIA DLTDRFDVGV LTGRPAAEAE
IALERVGLDV PEDRRFTMDD WEEGKPHPRA LVTLAERFDA DRIAFIGDTL DDVRTARNAD
DEDASRVYYG IGVLTGGLTG DEGRRKFAEN GADAVVEDVN ELVELLE