Gene Hlac_0425 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0425 
Symbol 
ID7401043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp442997 
End bp444424 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content69% 
IMG OID643707490 
Productmetal dependent phosphohydrolase 
Protein accessionYP_002565098 
Protein GI222478861 
COG category[R] General function prediction only 
COG ID[COG1078] HD superfamily phosphohydrolases 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.410662 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.214403 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCACCA CCCAGATCAA AGATCCCGTC CACGGCTACG TCGAGCTCCC GGACGCGCTC 
GTCGAGGGCG TCGTCGACAC CCGGCCGTTC CAGCGGCTGC GGTACGTCCG CCAGCTCTCG
GCGACGCACC TCGTGTACCC GGGCGCGAAT CACACCCGGT TCGAGCACTC GCTGGGCGTC
TACCACCTCG GCCGAACCGT CTTCGAGAAC CTCCGACAGC AGTCGTACTT CGCACGGGAG
GCGACCGTCG ACGAACTGGA AGAGATCCAG CGCACCTTAG AGTGCGCCTG CCTGCTCCAC
GACGTGGGCC ATCCGCCCTT CTCGCACCTC TCCGAGGGGT TCCTCGACGA GGGGGTACTC
CGGGAGCGCG TCGCGGAGAC GGGCTTAGTC GACGCCTTCG ACGCGGCCGG CGTCGGCGGC
GCCCCGCTCC GCTCGGCGAA CCCGCACGAG CTACTCGGCT GCGTGATTAT CGTCGAGGAG
TACGGCGACG CGCTCCGGGC GTTCGATGTC GACCCCTTCG AGGTGTGCGC GTACGTGCTC
GGCTACAGCC TCGCGTACGA GCGCGGCGAA CCGTGGCAGT ACGGGGTCGG CGCCCAGCTG
CTCCACTCAC CCATCGACGT GGACCGGCTC GACTACATCA CTCGGGACAA CTACATGACC
GGTGCCGGCG TGTTGAGCTT CGACGTCGAC CGTATGGTCG ACGCCTACAC CGCTCACCCC
GAGGAGGGCC TGGCGCTCAC CGAGAAGGCG CTCTCGACCA TCGGCAACTA CCTCGAAGGG
CGGATCGCGC TGTACATGTG GGTCACCCAG CACCACAAGT CGGTGTACGC GAACCGGCTC
CTCCAGGCGA TGCTCGGCGA ATACGCCGCC GAGACCGGCG AGAGCCCGGT TACGGTCAAC
GGCGTGCTCT CCCGAGAGCT CGACGACAAT GCGGTGCTTG AGCGCCTCCG GATCGCCGCC
CGCGATCGCC CCGATTCGAC GCTGGCGTCG ATGTACGATC GCTTCCGGGG GCGGCGCTTC
CCGGCCACCT GCTGGAAACA CCGGATCGCG CTCGCCGACC GGGTGGGCCG AGACCTCGAC
GGCGACCTCG GCGGGGACGG CGGCGAAGCC CTCGACGAGT TCACGGCGTG GCTCACCGAG
GGCGACGATC GGCTGGAACG ACTCCTCGCC GACGCCCTCG ACGTGCCGGT CCACGAGGTG
TGGATCGACC GGTCGTACGT GCCGGCCTAC GACCCCGACG AACTGGAGGA CATCCCCATC
GCGTACGGCG GGACGACGCG GTCCGTCGGC GATTGGGGGC TGTACGGCGA CCGCGCGTTC
GACGTGCCGA TCCCCTTCGT GTTCGTCCCC GACGGGACGA AGCGGCGGGC GATCCGCGTG
CTCACGGAGG CGTTCGAGCG GGAGGTCGGG GAGACGAAGC AAGCTTGA
 
Protein sequence
MPTTQIKDPV HGYVELPDAL VEGVVDTRPF QRLRYVRQLS ATHLVYPGAN HTRFEHSLGV 
YHLGRTVFEN LRQQSYFARE ATVDELEEIQ RTLECACLLH DVGHPPFSHL SEGFLDEGVL
RERVAETGLV DAFDAAGVGG APLRSANPHE LLGCVIIVEE YGDALRAFDV DPFEVCAYVL
GYSLAYERGE PWQYGVGAQL LHSPIDVDRL DYITRDNYMT GAGVLSFDVD RMVDAYTAHP
EEGLALTEKA LSTIGNYLEG RIALYMWVTQ HHKSVYANRL LQAMLGEYAA ETGESPVTVN
GVLSRELDDN AVLERLRIAA RDRPDSTLAS MYDRFRGRRF PATCWKHRIA LADRVGRDLD
GDLGGDGGEA LDEFTAWLTE GDDRLERLLA DALDVPVHEV WIDRSYVPAY DPDELEDIPI
AYGGTTRSVG DWGLYGDRAF DVPIPFVFVP DGTKRRAIRV LTEAFEREVG ETKQA