Gene Hlac_2552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2552 
Symbol 
ID7399777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2528677 
End bp2530098 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content69% 
IMG OID643709624 
Productphosphoesterase DHHA1 
Protein accessionYP_002567194 
Protein GI222480957 
COG category[L] Replication, recombination and repair 
COG ID[COG0608] Single-stranded DNA-specific exonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.131524 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGTCG CCGATCCTCC CGTGCCGGAG CTCACTGCGC GCGCGAACGC CTGCGCCGGC 
CGGCTCCGCG CAGCGGACCG CGTGCTACTC GCGTCCCACA TCGACGCTGA TGGGCTTACC
AGCGGCGCAG TCGCCTCGAC GGCGCTCGCG CGGGCCGGGA TCGACCACGA GGTCGTCTTC
GAAAAGCAGC TCGACGCCGA ATCGATCGCG GGGATCGCCG CGCGTGAGTT CGACGTAGTG
CTCTTTACCG ACTTCGGCTC CGGACAGCTC GACGTGATCG TTGGCCACGA GGATCGCGGC
GACTTCGTCC CCGTGATCGC CGACCACCAT CAGCCAGCCG ACCGTGACAC GGATTTCCAC
CTCAACCCCC TCCTTGAGGG GCTAAACGGC GCGAGCGAGC TGTCGGGCGC GGGCGCGAGC
TACGTGCTTG CCCGCGCGTT GGAGGGCCCC GACGGCGACA ACCGTGACCT CGCCGCACTG
GCGGTGGTGG GCGCGGTCGG CGACATGCAG GACTCCGACG GCGGGCTCAT CGGCGCCAAC
GAGGCGATCG TCGCCGACGG CGTTGACGCC GGCGTCATCG AGACACGGAC GGACTTGGAC
CTGTACGGCA GGCAGACGCG CCCGCTCCCG AAACTTCTTG AGTACGCCTC TGACGTGAAG
ATCCCGGGTG TCTCGAACGA CGAGGCGGGT GCGATCTCCT TCCTCACCGA CCTCGACATC
GAGGTGAAAC GCGACGGAGA GTGGCGGCGT TGGGTCGACC TCGATGCAGG GGAGCGACAG
ATTCTCGCGT CCGGACTGAT GCGCCGCGCC GTCGCCTCCG GCGTCCCCGC CGACCGGATC
GAGGCGCTCG TCGGTACCTC CTACACCCTC GTCGACGAGG AGCCCGGCAC AGAGTTGCGG
GACGTAAGCG AGTTTTCCAC GCTCCTCAAC GCCACCGCCC GGTATGAACG GGCCGACGTG
GGACTCGCGG TGTGTCTCGG CGACCGGGGT GACGCACTCT CCGAGGCGCG CCGGCTCCTC
CGGAACCACA GAAAGAACCT CTCTGAGGGG CTTCAGTGGG TGAAAAGCGA GGGCGTCACT
GAGGAGCAGC ACCTCCAGTG GTTCGACGCT GGCTCGCGGA TCCGCGAGAC GATCGTCGGG
ATCGTCGCGG GGATGGCGAT CGGCTCGCCG GCGCTCGATC GCTCGAAGCC CGTAATCGCG
TTCGCCGAGG AGAGCGCCGA GGAGCTGAAG GTGTCCTCGC GGGGATCACA CTCCCTGGTT
CGGCAGGGAC TCGACCTCTC CGCAGTGATG CGAGAGGCGA GTCAGGCCGT CGGCGGCGAC
GGCGGCGGTC ACGACGTGGC CGCGGGCGCG ACGATCCCGA TCGGCGAGCG CGATGCGTTC
GTCGCCGAGG CGGACCGGCT GATCGGCGAA CAGCTCTCGT AG
 
Protein sequence
MAVADPPVPE LTARANACAG RLRAADRVLL ASHIDADGLT SGAVASTALA RAGIDHEVVF 
EKQLDAESIA GIAAREFDVV LFTDFGSGQL DVIVGHEDRG DFVPVIADHH QPADRDTDFH
LNPLLEGLNG ASELSGAGAS YVLARALEGP DGDNRDLAAL AVVGAVGDMQ DSDGGLIGAN
EAIVADGVDA GVIETRTDLD LYGRQTRPLP KLLEYASDVK IPGVSNDEAG AISFLTDLDI
EVKRDGEWRR WVDLDAGERQ ILASGLMRRA VASGVPADRI EALVGTSYTL VDEEPGTELR
DVSEFSTLLN ATARYERADV GLAVCLGDRG DALSEARRLL RNHRKNLSEG LQWVKSEGVT
EEQHLQWFDA GSRIRETIVG IVAGMAIGSP ALDRSKPVIA FAEESAEELK VSSRGSHSLV
RQGLDLSAVM REASQAVGGD GGGHDVAAGA TIPIGERDAF VAEADRLIGE QLS