Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_2712 |
Symbol | |
ID | 7401323 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 2703484 |
End bp | 2704509 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643709787 |
Product | arsenite-activated ATPase ArsA |
Protein accession | YP_002567353 |
Protein GI | 222481116 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0003] Oxyanion-translocating ATPase |
TIGRFAM ID | [TIGR00345] arsenite-activated ATPase (arsA) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACGGT TCGTCTTCTT CGGCGGGAAG GGTGGCGTCG GCAAGACCAC CGTCTCCTGT GCGTACGCCT CTCGCTGTGC GAACGACGGG GTGCGGACGC TGGTCGTCTC GACGGACCCC GCACACTCGG TGTCGGACGT GTTCGACCAG TCGTTCGGCG ATGAGCCGGC GCCCGTCGAC GGGATCGAGG GACTCGACGC GATGGAGATC GACCCCGAAG ACGAGATGCA GCGGCACCTC CAGGAGATCC GCGAGGCGCT CTCCGAGCAG GTGTCGGCGG CGATGGTCTC GGAGATCAAC CGCCAACTGG AGATGTCGCA CGGCACGCCG GGCGCGTACG AGGCTGCGCT CTTCGACGCG TTCGTGAGCG TGATGCGCGA GGAGGGTGAG TCGTACGATC GGATCGTCTT CGACACCGCG CCGACCGGGT CGACGCTGCG GCTCTTGGGG CTCCCCGAGT TCCTCGGCGA CTGGATCGAC CGGCTGCTGT ACAAGCGCAA GCAGTCGATC GACCTGTTCG AGAAGGCCGC TATCGGCGAC ATGGAACCCC GGCGGTTGAT GGACGGCGAC CCCGTCTTAG AGCGGCTCCA GCGCCGCAAG GAGTTCTTCG AGTTCGCGGG CGACACCATG CGAGACGAGG CCGCCTTCTT CCTCGTGTTG AACCCCGACC AGCTCTCGGT CAACGAGACA GGACGGGCGA TCGAGGGGTT CGCCGAGCGC GACTTGCGCG TCCGTGGGCT CGTCGCGAAC AAGCTTACCC CGGAGCCCGA CGACGACGAG GAGGGACGCG GAGCCACCTA CCTCCGCGAG AAGGTCGCGA CCGAGCGCGA CCGGCTCCGG CAGGTCCGAG AGGAGTTCGA GCCCCCGCTC GTCGCCGAGA TTGAGTCGCG GACGCGGGAA GTCCGCGGCG ACGTGCTCGC GGAGGTGGCG GCCGCGCTCG ACATCGAGAC GGCGAGCGAC GTGAGCGGGG AGGACGACGA CCGCACCCGA AGTGACGACG GTGGCCCCGT CCGCGCCGAT CGGTAA
|
Protein sequence | MERFVFFGGK GGVGKTTVSC AYASRCANDG VRTLVVSTDP AHSVSDVFDQ SFGDEPAPVD GIEGLDAMEI DPEDEMQRHL QEIREALSEQ VSAAMVSEIN RQLEMSHGTP GAYEAALFDA FVSVMREEGE SYDRIVFDTA PTGSTLRLLG LPEFLGDWID RLLYKRKQSI DLFEKAAIGD MEPRRLMDGD PVLERLQRRK EFFEFAGDTM RDEAAFFLVL NPDQLSVNET GRAIEGFAER DLRVRGLVAN KLTPEPDDDE EGRGATYLRE KVATERDRLR QVREEFEPPL VAEIESRTRE VRGDVLAEVA AALDIETASD VSGEDDDRTR SDDGGPVRAD R
|
| |