Gene Hlac_2084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2084 
Symbol 
ID7400604 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2070733 
End bp2073171 
Gene Length2439 bp 
Protein Length812 aa 
Translation table11 
GC content69% 
IMG OID643709155 
Productheavy metal translocating P-type ATPase 
Protein accessionYP_002566732 
Protein GI222480495 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2217] Cation transport ATPase 
TIGRFAM ID[TIGR01494] ATPase, P-type (transporting), HAD superfamily, subfamily IC
[TIGR01512] heavy metal-(Cd/Co/Hg/Pb/Zn)-translocating P-type ATPase
[TIGR01525] heavy metal translocating P-type ATPase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.155382 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGACG GCTCGAACGG CAGCGAAGAG ACGGCCGGCA GCGAGACGAC CGGCGGGGCG 
GCCGGCGCAG ACGGGAGGCG ACGAGAGCTG ACCGTCAGCC TCGCGGTCCC CGAGATGGAC
TGCCCCTCTT GTGCCGGCAA GGTCGACAAC GCGATTGGGC GTCTCGACGG CGTGACCAAC
GCCGCGCTCA ATCCGACCGC CGGGACGGCG ACGGTGACCT ACGACCCCGA TGTCATCGAT
GAAGACGACG TGGTCGCAGC CATCGAGGGC GCCGGCTACG AGGTCACGGG CGGGCGGTCG
GGCAGCGACG AGAGCGGCGA CGACACCGAG GGAGGCACCG ACGCCGACAC GGAATCCGGC
GGGGTCGCCG TCGCGCCCCC CGCCGAGGTC TGGACGACGG CGCGCGCGAA GAAGACGTGG
CTTGGCGCGG GACTCGTAAC GCTCGGACTT CTGTTCGAGT TCATTCTGAC CGCACAGAAC
CCCGAAGTAG CGAGTGTGCT CGGCGTCCCG TTCACGCTCG CCGACGCTCT GTTCCTCGGC
GCGGTCGCGG CGAGCGGGCT CCCCGTCATC CGCGGGGGGT ACTACTCGGC GCGGAACCGG
AGCCTCGACA TCGACCTGCT GATGGGAACG GCGATCATCG CGGCGACGGG GATCGGCTAC
TTCGTCGAGG CCGCCACGCT GGCGGTGCTT TTCAGCATCG CCGAGCTGTT GGAAGATTAC
GCGATGGACC GGGCTCGCGA CTCCCTGCGC GAGCTAATGG AGCTGTCGCC GGACGAGGCC
ACCGTGAAAC GCGAGAATGC GGAGACGACG ATCCCGGCCG ATGACGTAGC GGTCGGCGAG
ACCGTGATCG TCCGTCCGGG CGAGAAGGTC CCGCTCGACG GAACGGTTAT CGAGGGTGAG
AGCGCGGTCG ACGAGTCGCC GATAACCGGC GAGAGCGTGC CCGTCGACAA GGTTTCCGGA
GACGAAGTGT TCGCGGGCGC GATCAACGAG GAGGGGTATT TGGAGATCGA AGTAACCTCG
ACTGCAGGCG ACTCGACGCT CGCGCGCGTC ATCGAGATGG TGCAGGGTGC ACAGGCCAAG
AAGACCGATA CCGAGCGGTT CGTCGACCGA TTCGCTGGGT ACTACACGCC GGTTGTGGTC
GTGCTAGCGG TCCTGACGGC CGCGATCCCG CCGCTGGTCA TCGCGGAGCC GATCGCCGTC
GAGGTCGCCG GCTACGGGAT CACGTTCGCC GCGGACTGGG GGACGTGGTT CGTCCGCGGG
CTCACGCTGC TGGTGATCGC GTGTCCCTGC GCGTTCGTCA TCTCGACGCC CGTCTCGGTC
GTCTCCGGGA TCACGAGCGC GGCGAAAAAC GGCGTCCTGA TCAAGGGCGG GAACCACCTC
GAAGCGATGG GTGAGGTCGA CGCCGTCGCG CTCGACAAGA CCGGCACGCT CACCAAGGGC
AAACTCACGG TCACCGACCT CGTACCGCTC GGCGACGCCG ACGAGGCGAC GCTGCTACGG
CGGGCCGCGG CGCTGGAGCG GCGGAGCGAG CACCCGATCG CGAGTGCGAT CCTCGACCGC
GCCGACCGGA CAGGCGTGAC CGACCACCCC GAGCCGGCGG CGTTCGAGAG CCTGACGGGG
AAGGGGATCC GCGCCGAGAT AGACGGCGAG ACGTACTACG CCGGGAAGCC CGCGCTGTTC
GAGGACCTCG GGTTCGACCT GTCGCGGGCA CGGGCGGAGA CCGACGGCGG GGTCGTCACC
GAGGGCGACG ACGCCGACCC GCCGGCGGGC GACATCGGTC CCAGAGAGTT CGCCGAGGGG
ACCCTCGCCG CGCTGGAACG GGAGGGGAAA ACGGTGGTGC TCGTCGGGAC GGCGACACAG
CTAACGGGAG CCATTGCCAT CGCCGACGAG GTTCGACGCG ACTCGAAGCG GGCGGTCGAG
CGCCTTCGGG AACTGGGCGT GAAGCGCGTC GTGATGCTGA CCGGCGACAA CGAGGGGACC
GCGCGGGCGA TCGCCGAGCA GACCGGGGTC GACGAGTACC GCGCCGAGCT GCTGCCCGAA
GAGAAGGTCG AGGCGGTCCG AGCGCTGCAG GCCGAATACG GCGATGTGGC GATGGTCGGC
GACGGGATCA ACGACGCGCC GGCGCTGGCC GCGGCAGAGG TCGGCGTCGC GATGGGCGCC
GCGGGAACGG ACACGGCCTT GGAGACCGCG GACATCGCAC TGATGGGCGA CGACGTGGCG
AAGCTCCCGT ACCTGTACGC GCTCTCGCAC ACCGCCAACG GCGTCATCCG TCAGAACGTC
TGGGCGAGCC TCGGCGTGAA GGCCCTGCTC GCGCTGGGCG TGCCGCTGGG ACTGGTGAGC
GTCGCGGTCG CGGTCGTCGT CGGCGACATG GGGATGAGCC TCGGTGTCAC CGGCAACGCG
ATGCGCCTCT CCGGTATTGA GCCGGAGTCC TTCGAGTAG
 
Protein sequence
MRDGSNGSEE TAGSETTGGA AGADGRRREL TVSLAVPEMD CPSCAGKVDN AIGRLDGVTN 
AALNPTAGTA TVTYDPDVID EDDVVAAIEG AGYEVTGGRS GSDESGDDTE GGTDADTESG
GVAVAPPAEV WTTARAKKTW LGAGLVTLGL LFEFILTAQN PEVASVLGVP FTLADALFLG
AVAASGLPVI RGGYYSARNR SLDIDLLMGT AIIAATGIGY FVEAATLAVL FSIAELLEDY
AMDRARDSLR ELMELSPDEA TVKRENAETT IPADDVAVGE TVIVRPGEKV PLDGTVIEGE
SAVDESPITG ESVPVDKVSG DEVFAGAINE EGYLEIEVTS TAGDSTLARV IEMVQGAQAK
KTDTERFVDR FAGYYTPVVV VLAVLTAAIP PLVIAEPIAV EVAGYGITFA ADWGTWFVRG
LTLLVIACPC AFVISTPVSV VSGITSAAKN GVLIKGGNHL EAMGEVDAVA LDKTGTLTKG
KLTVTDLVPL GDADEATLLR RAAALERRSE HPIASAILDR ADRTGVTDHP EPAAFESLTG
KGIRAEIDGE TYYAGKPALF EDLGFDLSRA RAETDGGVVT EGDDADPPAG DIGPREFAEG
TLAALEREGK TVVLVGTATQ LTGAIAIADE VRRDSKRAVE RLRELGVKRV VMLTGDNEGT
ARAIAEQTGV DEYRAELLPE EKVEAVRALQ AEYGDVAMVG DGINDAPALA AAEVGVAMGA
AGTDTALETA DIALMGDDVA KLPYLYALSH TANGVIRQNV WASLGVKALL ALGVPLGLVS
VAVAVVVGDM GMSLGVTGNA MRLSGIEPES FE