Gene Hlac_1569 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1569 
Symbol 
ID7401502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1587293 
End bp1588537 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content70% 
IMG OID643708636 
Productmetal dependent phosphohydrolase 
Protein accessionYP_002566226 
Protein GI222479989 
COG category[R] General function prediction only 
COG ID[COG1078] HD superfamily phosphohydrolases 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCACGG TCAAGGACAC CGTCCACGAC CACATCGAGA TCGACGGTGT CGCGGCGGAC 
CTCCTCGACA CCCCCGCAGT CCAGCGGCTC AGACACGTCA AACAGCTCGG CACGGTCCAG
CTCGTCTACC CCTCCGCGAA CCACACCCGC TTCGAGCACT CGCTCGGCGT CTACCACCTC
GCCAGCCGCG CGCTCGGCCA CCTCGGGATT GGGGGAAAGC GCGCAGACCG GATCGAAGCC
GCGGCCATGC TCCACGACGT GGGTCACGGC CCGTTCAGCC ACAATCTGGA GTCGCTCACC
CACCGCCGCA CGGGGAAGTA CCACGACGAC GTCGACGAGG TGCTCGCGAC CGGCGCGGTC
GGCGAGGTGC TCCGCGATCA CGACCTCGAC CCGGAGAAGA TCGCCGGGCT CGTCGCCGGC
GAGGGACCGT ACGGCGGGCT CGTCTCGGGC GAGCTCGACG TTGATCGCAT GGACTACCTC
GTGCGCGACG CCTACCACAC CGGGGTGCCG TACGGCACCA TCGACACCGA GCGGTTCGTC
CGGGAGCTGA CGTTCGTCGA CGTGGGCACC GGCACCAACG AACTCGTCTT GGACGAGGGG
AACGTCCAGA CGGCCGAGAG CCTCCTTCTG GCGCGCGCAC TGATGAACCC GGTCGTGTAC
ACCCACCACG TCGCGCGCAT TTCGAAGGCA ATGCTTCGGC GGGCGGCGAG CGACTTACTC
GACGCGACCA CGACGACCCC GGCCCAGCTT CGCCGGATGG ACGACCACGA CTTCCTCGCG
GCGATCCGAA GCTGCTCGGA GACCGCCGAG CTCTCCCGGC GGTACGACGA GCGCGACCTG
TACAAGCGGG CGGTGTGGGC CGAGTACGAC GACGTGGCCG AGCGTGTCCA TGAGGCCGAC
CACGACACTG AGAGTGCGCT GGAACGCGAG ATCGCCGAGG AGGCGGGCGT CGCCCGTCAG
CACGTGATCC TCGATGTCCC CCCGGAGCCG TCGATGCGGG AGTCGACAGC GCGGGTCACC
GTCAACGGCG AGGTGCGTCG GTTAGAGCGG CAGTCACCCC TCGTCTCCAC GCTCCGGACC
GCCCAGCGCA ACCAGTGGCG CCTCGGTGTC TACGCCCCTC ACCCCGCGAC CGATCGCGTC
GGCCGCGCCG CCGCCGACGT GCTCGGACTC GACCCCGACG GGCTCGTCGC GGAGGTGCGC
GGCGCGATGC CGACGACGCT CGACGAGTTC CGAGACGGGG CGTGA
 
Protein sequence
MITVKDTVHD HIEIDGVAAD LLDTPAVQRL RHVKQLGTVQ LVYPSANHTR FEHSLGVYHL 
ASRALGHLGI GGKRADRIEA AAMLHDVGHG PFSHNLESLT HRRTGKYHDD VDEVLATGAV
GEVLRDHDLD PEKIAGLVAG EGPYGGLVSG ELDVDRMDYL VRDAYHTGVP YGTIDTERFV
RELTFVDVGT GTNELVLDEG NVQTAESLLL ARALMNPVVY THHVARISKA MLRRAASDLL
DATTTTPAQL RRMDDHDFLA AIRSCSETAE LSRRYDERDL YKRAVWAEYD DVAERVHEAD
HDTESALERE IAEEAGVARQ HVILDVPPEP SMRESTARVT VNGEVRRLER QSPLVSTLRT
AQRNQWRLGV YAPHPATDRV GRAAADVLGL DPDGLVAEVR GAMPTTLDEF RDGA