Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_3445 |
Symbol | |
ID | 7402291 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012030 |
Strand | + |
Start bp | 195373 |
End bp | 197295 |
Gene Length | 1923 bp |
Protein Length | 640 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643709986 |
Product | arsenite-activated ATPase ArsA |
Protein accession | YP_002567552 |
Protein GI | 222481316 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0003] Oxyanion-translocating ATPase |
TIGRFAM ID | [TIGR00345] arsenite-activated ATPase (arsA) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACACGA ATACGATACC TCGAGACCTC GTCGAACCGA GCGATGACGA CACCGAGTTC GTCTTCTTCA GCGGGAAAGG CGGCGTCGGC AAGAGTACCG TCAGCTGCGC GACTGCGACG TGGCTCGCCG ACAACGATTA CGAGACGCTG CTCGTGACGA CCGACCCCGC ACCGAACCTC TCGGATATCT TCGACCAGGT AATCGGCCAC GAGGTGACCG AAATCGAGGG AATCGAGAAC CTCTCCGCGA TCGAGATCGA CCCGGACACG GCCGCCGAGG AGTACCGACA GGAGACCATC GAACCGATGC GCCAACTGCT CGGCGACGAC GAGATCGAAA CCGTCGAAGA GCAGCTTAAC AGCCCCTGCG TCGAGGAGAT CGCGGCCTTC GACAACTTCG TCGACTTCAT GGACAGTCCG GAGTACGACG TGGTGGTCTT CGATACCGCC CCGACCGGCC ACACCATCCG TCTGATGGAG TTGCCCTCCG ACTGGAACGC CGAACTCGAG AAGGGCGGCT CGACGTGCAT CGGCCCCGCC GCCTCGATGG AGGACAAGAA GGTCCAGTAC GAGCGCGCAA TCGACACGCT CCAGGACACC GAGCAGACGA CGTTTGCGTT CGTCGGCAAG CCCGAGGACT CCTCGATCGA CGAGGTCGAA CGGAGCGCGG GCGACCTCGC CGAACTCGGC ATCGAATCGC AATTGCTGAT ACTCAACGGC TACCTGCCCG AGTCGGTGTG TGAAGACCCC TTCTTCGAGG GGAAACGCGA GGACGAACAG GCCGTCATCG AGCGCGCCCG CGAAGAGTTC GACGCCGATG CGACCGGGAC GTACCCGCTC CAGCCCGGCG AGATCACCGG GCTCGACCTG CTGTCCGACG TCGCCGGCGT CCTCTACGAC GGCGCGGAAG CGACCGTCGA CGTCGGCTCG GCAACGGATA TCGAGACCGA CCAGTCGGTC GACGTCGAGG CGCTGGCCGA TCCGGCATCG GTCGCCGACC GAGTGACGCC GAGCGACGAC GAGACGCGGT ATCTGTTCTT CACCGGGAAA GGCGGCGTCG GCAAGAGCAC CATCGCCGCG GCCTCGGCGA CGAAGCTCGC GGAGGCGGGC TACGAGACGC TCGTCGTGAC GACGGACCCG GCGGCCCACC TCGAGGACAT CTTCGGCGAG CCGGTCGGCC ACGACCCGAC GTCGGTGAGT CAGGCGAACC TCGACGCGGC CCGGATCGAC CAGGAAAAGG CTCTCGAGGA GTACCGCACG CAGGTCCTCG ATCACGTCAC CGAGATGTAC GAGGACAAGG AGGACACGGA GATCGACGTC GAGGCCGCCA TCGCGAACGT CGAGGAGGAG TTGGAGTCGC CGTGTGCCGA GGAGATGGCG GCCCTCGAGA AGTTCGTCAG CTACTTCCAG CAAGACGGCT ACGACGTGGT GGTCTTCGAC ACCGCGCCGA CCGGCCACAC GCTCCGCTTG CTGGAGCTAC CCTCCGACTG GAAGGGATTC ATGGACCTGG GCTCGCTGAC CAAGGGAGCG GCTCCCGCGA AAGGCGACCA GTACGACGAG GTCATCGAGA CGATGCAGGA CCCCGAGCGG AGCTCGTTCG CGTTCGTCAT GTATCCGGAG TACACGCCGA TGATGGAAGC GTACCGGGCA GCCGAGGACC TCAACGACCA GGTCGGTATC GAGACGGCGT TCGTCGTCGC GAACTACCTG CTGCCCGAAG AGTACGGCGA CAACGCCTTC TTCGCGAATC GCCGGGCGCA ACAGGAGAAA TATCTCGGCG AGATCAAGGA CCGCTTCGAG ACGCCTTTGA TGTGCGCGCC CCTGCGCCGT GACGAACCGA TCGGACTCGA AGAGCTGAGC GCCTTCGGCG ACGAGATCAC GGGTCTGTCC GAGATCAGCA AGGAAGAGGT GACCATCCAA TGA
|
Protein sequence | MDTNTIPRDL VEPSDDDTEF VFFSGKGGVG KSTVSCATAT WLADNDYETL LVTTDPAPNL SDIFDQVIGH EVTEIEGIEN LSAIEIDPDT AAEEYRQETI EPMRQLLGDD EIETVEEQLN SPCVEEIAAF DNFVDFMDSP EYDVVVFDTA PTGHTIRLME LPSDWNAELE KGGSTCIGPA ASMEDKKVQY ERAIDTLQDT EQTTFAFVGK PEDSSIDEVE RSAGDLAELG IESQLLILNG YLPESVCEDP FFEGKREDEQ AVIERAREEF DADATGTYPL QPGEITGLDL LSDVAGVLYD GAEATVDVGS ATDIETDQSV DVEALADPAS VADRVTPSDD ETRYLFFTGK GGVGKSTIAA ASATKLAEAG YETLVVTTDP AAHLEDIFGE PVGHDPTSVS QANLDAARID QEKALEEYRT QVLDHVTEMY EDKEDTEIDV EAAIANVEEE LESPCAEEMA ALEKFVSYFQ QDGYDVVVFD TAPTGHTLRL LELPSDWKGF MDLGSLTKGA APAKGDQYDE VIETMQDPER SSFAFVMYPE YTPMMEAYRA AEDLNDQVGI ETAFVVANYL LPEEYGDNAF FANRRAQQEK YLGEIKDRFE TPLMCAPLRR DEPIGLEELS AFGDEITGLS EISKEEVTIQ
|
| |