Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_3342 |
Symbol | |
ID | 8409420 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013201 |
Strand | - |
Start bp | 150702 |
End bp | 152633 |
Gene Length | 1932 bp |
Protein Length | 643 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 645018269 |
Product | arsenite-activated ATPase ArsA |
Protein accession | YP_003175790 |
Protein GI | 257373016 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0003] Oxyanion-translocating ATPase |
TIGRFAM ID | [TIGR00345] arsenite-activated ATPase (arsA) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACGA CGCCCCCCGA CGTACGAGCG GTCGTAGAGC CGACCAGCAA GGAGACCGAA TTCGTCTTTT TCAGCGGCAA AGGCGGTGTC GGTAAGAGTA CTGTGAGCTG CGCGACGGCG ACTTGGCTCG CGGACAACGA CTACGAGACG CTGCTGGTGA CGACCGATCC TGCACCGAAC CTCTCGGATA TCTTCGGCCA GGAGATCGGT CACGATGTCA CTGCTATCGA CGACATCGAG AACCTCTCGG CCATCGAGAT CGACCCAGAC ACGGCGGCCG AGGAGTATCG ACAGGAGACG ATTGAACCGA TGCAGCAATT GCTCGACGAC GAGCAACTCG AAACCGTCGA GGAGCAACTC AACAGCCCGT GTGTCGAAGA GATCGCCGCC TTCGACAACT TCGTAGACTT CATGGACTGC CCCGAGTACG ATGTGGTCGT CTTCGACACG GCGCCGACCG GTCACACGAT CCGGTTGATG GAGCTGCCCT CCGACTGGAA CGCTGAACTG GAGAAAGGCG GCTCGACCTG TATTGGGCCG GCAGCCTCGA TGGAAGAGCG CAAACAGGAC TACGAGCGTG CCATCGACAC GCTCCAAGAC GGCGAGCGTA CGTCCTTCGC GTTCGTCGGC AAGCCCGAGG ACTCCTCAAT CGACGAAATC GAGCGCAGTG CAAGAGACCT CGGCGAGCTT GGGATCGAGT CCCAACTGCT GATCATCAAC GGCTATCTCC CCGAGCCGGT GTGTGAGGAT CCGTTCTTCC AGGGGAAACG CGCAGACGAG CAGGCAGTTA TCGAGCGCGC ACGGACGGAG TTCGACGCCG ACGCGATGGC GACGTACCCA CTCCAGCCGG GCGAAATCGC AGGGCTCGAT CTACTCGCAG ATGTCGGTGG CGTACTGTAC GACGGCGACG AGGCGACCGT CGACGTTGGG ACGGCGACCG ATGTAGACGC AGAGACTGCG GTCGACTTCG AGTCGATGGC TGACTCCGAG GCGGTCGCCG ACCAGCTCCA ACCGGGCGAT GAGACGCGGT ATCTTTTCTT CACTGGGAAG GGCGGCGTCG GCAAGAGTAC GATTGCCTCG ACGGCGGCGA CGAAACTCGC CGAAGCTGGC CACGAAACGC TCGTCGTCAC AACTGACCCG GCCGCCCATC TTGAGGATAT CTTCGGCGAA CCGGTCGGCC ACGAGCCGAC ATCGGTCGGT CAAGCAAACC TCGACGCGGC ACGGATTGAC CAGGAGAAAG CGCTCGAAGA GTACCGCACG CAGGTCCTCG ACCACGTCAC CGAGATGTAC GAAGACAAGG AGGACACACA GATTGACGTG GACGCGGCGA TTGCGAACGT CGAAGAGGAA CTGGAGTCTC CGTGTGCCGA AGAGATGGCC GCGCTCGAGA AGTTCGTGAG CTACTTCGAC GAAGACGGCT ACGATGTCGT CGTCTTTGAC ACCGCTCCCA CGGGACACAC GCTCCGACTG CTCGAACTCC CCTCCGACTG GAAGGGCTTC ATGGATTTGG GTTCGCTGAC AAAGGGTGCC GCGCCCGCGA AAGGCGACCA GTACGACGAG GTCATCGAGA CGATGAAGGA CCCCGAACGG AGTACGTTCG CGTTCGTGAT GTACCCTGAG TACACCCCAA TGATGGAAGC CTACCGGGCC GCCGCAGACC TCAAAGATCA AGTCGGGATC GAGACTTCGC TGGTGGTCAC AAACTACCTG CTCCCCGAGG AATACGGTGA CAACGCCTTC TTCGAGAATC GCCGCGCTCA GCAGGCGGAG TACCTCGGCA AGATCAACGA TCGGTTCGAC GTGCCGATGA TGCTCGCCCC GCTACGTCAG GACGAGCCGA TTGGACTGGA CGAACTACGC GCGTTCGGTG AAGAGATCAC CGGACTGGAC GCACTCACTA CGGAGACGGA ACAGGAGGTG ACGGTGTCGT GA
|
Protein sequence | MSTTPPDVRA VVEPTSKETE FVFFSGKGGV GKSTVSCATA TWLADNDYET LLVTTDPAPN LSDIFGQEIG HDVTAIDDIE NLSAIEIDPD TAAEEYRQET IEPMQQLLDD EQLETVEEQL NSPCVEEIAA FDNFVDFMDC PEYDVVVFDT APTGHTIRLM ELPSDWNAEL EKGGSTCIGP AASMEERKQD YERAIDTLQD GERTSFAFVG KPEDSSIDEI ERSARDLGEL GIESQLLIIN GYLPEPVCED PFFQGKRADE QAVIERARTE FDADAMATYP LQPGEIAGLD LLADVGGVLY DGDEATVDVG TATDVDAETA VDFESMADSE AVADQLQPGD ETRYLFFTGK GGVGKSTIAS TAATKLAEAG HETLVVTTDP AAHLEDIFGE PVGHEPTSVG QANLDAARID QEKALEEYRT QVLDHVTEMY EDKEDTQIDV DAAIANVEEE LESPCAEEMA ALEKFVSYFD EDGYDVVVFD TAPTGHTLRL LELPSDWKGF MDLGSLTKGA APAKGDQYDE VIETMKDPER STFAFVMYPE YTPMMEAYRA AADLKDQVGI ETSLVVTNYL LPEEYGDNAF FENRRAQQAE YLGKINDRFD VPMMLAPLRQ DEPIGLDELR AFGEEITGLD ALTTETEQEV TVS
|
| |