Gene Hlac_3445 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3445 
Symbol 
ID7402291 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012030 
Strand
Start bp195373 
End bp197295 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content64% 
IMG OID643709986 
Productarsenite-activated ATPase ArsA 
Protein accessionYP_002567552 
Protein GI222481316 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACACGA ATACGATACC TCGAGACCTC GTCGAACCGA GCGATGACGA CACCGAGTTC 
GTCTTCTTCA GCGGGAAAGG CGGCGTCGGC AAGAGTACCG TCAGCTGCGC GACTGCGACG
TGGCTCGCCG ACAACGATTA CGAGACGCTG CTCGTGACGA CCGACCCCGC ACCGAACCTC
TCGGATATCT TCGACCAGGT AATCGGCCAC GAGGTGACCG AAATCGAGGG AATCGAGAAC
CTCTCCGCGA TCGAGATCGA CCCGGACACG GCCGCCGAGG AGTACCGACA GGAGACCATC
GAACCGATGC GCCAACTGCT CGGCGACGAC GAGATCGAAA CCGTCGAAGA GCAGCTTAAC
AGCCCCTGCG TCGAGGAGAT CGCGGCCTTC GACAACTTCG TCGACTTCAT GGACAGTCCG
GAGTACGACG TGGTGGTCTT CGATACCGCC CCGACCGGCC ACACCATCCG TCTGATGGAG
TTGCCCTCCG ACTGGAACGC CGAACTCGAG AAGGGCGGCT CGACGTGCAT CGGCCCCGCC
GCCTCGATGG AGGACAAGAA GGTCCAGTAC GAGCGCGCAA TCGACACGCT CCAGGACACC
GAGCAGACGA CGTTTGCGTT CGTCGGCAAG CCCGAGGACT CCTCGATCGA CGAGGTCGAA
CGGAGCGCGG GCGACCTCGC CGAACTCGGC ATCGAATCGC AATTGCTGAT ACTCAACGGC
TACCTGCCCG AGTCGGTGTG TGAAGACCCC TTCTTCGAGG GGAAACGCGA GGACGAACAG
GCCGTCATCG AGCGCGCCCG CGAAGAGTTC GACGCCGATG CGACCGGGAC GTACCCGCTC
CAGCCCGGCG AGATCACCGG GCTCGACCTG CTGTCCGACG TCGCCGGCGT CCTCTACGAC
GGCGCGGAAG CGACCGTCGA CGTCGGCTCG GCAACGGATA TCGAGACCGA CCAGTCGGTC
GACGTCGAGG CGCTGGCCGA TCCGGCATCG GTCGCCGACC GAGTGACGCC GAGCGACGAC
GAGACGCGGT ATCTGTTCTT CACCGGGAAA GGCGGCGTCG GCAAGAGCAC CATCGCCGCG
GCCTCGGCGA CGAAGCTCGC GGAGGCGGGC TACGAGACGC TCGTCGTGAC GACGGACCCG
GCGGCCCACC TCGAGGACAT CTTCGGCGAG CCGGTCGGCC ACGACCCGAC GTCGGTGAGT
CAGGCGAACC TCGACGCGGC CCGGATCGAC CAGGAAAAGG CTCTCGAGGA GTACCGCACG
CAGGTCCTCG ATCACGTCAC CGAGATGTAC GAGGACAAGG AGGACACGGA GATCGACGTC
GAGGCCGCCA TCGCGAACGT CGAGGAGGAG TTGGAGTCGC CGTGTGCCGA GGAGATGGCG
GCCCTCGAGA AGTTCGTCAG CTACTTCCAG CAAGACGGCT ACGACGTGGT GGTCTTCGAC
ACCGCGCCGA CCGGCCACAC GCTCCGCTTG CTGGAGCTAC CCTCCGACTG GAAGGGATTC
ATGGACCTGG GCTCGCTGAC CAAGGGAGCG GCTCCCGCGA AAGGCGACCA GTACGACGAG
GTCATCGAGA CGATGCAGGA CCCCGAGCGG AGCTCGTTCG CGTTCGTCAT GTATCCGGAG
TACACGCCGA TGATGGAAGC GTACCGGGCA GCCGAGGACC TCAACGACCA GGTCGGTATC
GAGACGGCGT TCGTCGTCGC GAACTACCTG CTGCCCGAAG AGTACGGCGA CAACGCCTTC
TTCGCGAATC GCCGGGCGCA ACAGGAGAAA TATCTCGGCG AGATCAAGGA CCGCTTCGAG
ACGCCTTTGA TGTGCGCGCC CCTGCGCCGT GACGAACCGA TCGGACTCGA AGAGCTGAGC
GCCTTCGGCG ACGAGATCAC GGGTCTGTCC GAGATCAGCA AGGAAGAGGT GACCATCCAA
TGA
 
Protein sequence
MDTNTIPRDL VEPSDDDTEF VFFSGKGGVG KSTVSCATAT WLADNDYETL LVTTDPAPNL 
SDIFDQVIGH EVTEIEGIEN LSAIEIDPDT AAEEYRQETI EPMRQLLGDD EIETVEEQLN
SPCVEEIAAF DNFVDFMDSP EYDVVVFDTA PTGHTIRLME LPSDWNAELE KGGSTCIGPA
ASMEDKKVQY ERAIDTLQDT EQTTFAFVGK PEDSSIDEVE RSAGDLAELG IESQLLILNG
YLPESVCEDP FFEGKREDEQ AVIERAREEF DADATGTYPL QPGEITGLDL LSDVAGVLYD
GAEATVDVGS ATDIETDQSV DVEALADPAS VADRVTPSDD ETRYLFFTGK GGVGKSTIAA
ASATKLAEAG YETLVVTTDP AAHLEDIFGE PVGHDPTSVS QANLDAARID QEKALEEYRT
QVLDHVTEMY EDKEDTEIDV EAAIANVEEE LESPCAEEMA ALEKFVSYFQ QDGYDVVVFD
TAPTGHTLRL LELPSDWKGF MDLGSLTKGA APAKGDQYDE VIETMQDPER SSFAFVMYPE
YTPMMEAYRA AEDLNDQVGI ETAFVVANYL LPEEYGDNAF FANRRAQQEK YLGEIKDRFE
TPLMCAPLRR DEPIGLEELS AFGDEITGLS EISKEEVTIQ