Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_2621 |
Symbol | |
ID | 8743234 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 2691360 |
End bp | 2692607 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 646513210 |
Product | arsenite-activated ATPase ArsA |
Protein accession | YP_003404171 |
Protein GI | 284165892 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0003] Oxyanion-translocating ATPase |
TIGRFAM ID | [TIGR00345] arsenite-activated ATPase (arsA) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGGAA TCGACGTCGA GCGGGTCGAC GAGGAGGCCG AGACGGCGGA CGAGACCGAC GACGCCCATA CCATCGAGGT GACGCCGACG GACTCTCTCG AGGACGACGA ACGGGAAACC ATTGACGTGG AGCCGTCCGA CGAGCCCGTC GATGGCCCGG ACTACGTGCT CTACGGCGGG AAGGGCGGCG TCGGCAAGAC GACGATGGCC GCCGCGACCG CGTTAGACAG CGCCCGCGGC GGCACGTCGA CGCTGGTCGT CTCGACGGAC CCGGCTCACT CGCTGTCGGA CACCTTCGAG ACCGACGTGC CGGCCGAACC GGGCCGGATC CGCGACGATA TCCCCCTCTA CGCGGCCGAG ATCGACCCCG AGTCCGCGAT GGAGGCGGGC GAGGTCGCCT TCCCCGGCGC CGGTGGTCCC GACGATGCGG CGAACGCGGA CGACGGAACC GCGGGGCCGT TCGGCGGCGG AGCAGACAGC GGCGCGGGCC CCTTCGGCGG GAGCGACGGC GGCGCGGGCG AGATGGGCGG CATGGGCGGG CTCGGCGACC TTCTCGGCGG TGGGGACGGA TCGCCGATGG AGGCGCTGTT CGGCGGCGCG ATGCCCGGCG CCGACGAGGC CGCCGCGATG CAACTGCTGC TCGAGTACAT GGACGACCCC CGGTTCGAGC GCGTCGTCAT CGACACCGCC CCGACGGGCC ACACCCTCCG GCTGCTGAAG CTCCCGGAAC TGATGGACAC CATGATGGGT CGGATGATGA AGGTCCGCCA GCGCATTAGC GGCATGCTCG AGGGGATGAA GGGGATGTTC CCCGGTCAGG AGGCGCCCGA GGAGGACGAC CTCGAGGACC TGGACGAACT CCGGGAGCGC ATCGAGCGCC TGCGGGCGGC CCTGCAGGAC CCCGCGCGGA CTGACTTCCG AATCGTCATG GTCCCCGAGG AGATGAGCGT CTTCGAATCC AAACGCTTGC GCCAGCAACT CGAGGAGTTC CAGATTCCGG TCGGCACGGT CGTCGTCAAC CGCGTCATGG AGCCCCTCTC GGACGTCACC GACGACGTTC GGGGCGAGTT CCTCCAGCCG AATCTGGACG ACTGCGAATT CTGCCAACGG CGGTGGGACG TCCAGCAGGG CGCCCTCGCC GAGGCCCAGG AACTGTTCCG CGGGACCGAG GTGCGACGCG TCCCGCTGTT CGCCGACGAA GTCCGCGGCG AGGGGATGCT CGAGGTCGTC GCGGCCTGTC TGCGGTGA
|
Protein sequence | MSGIDVERVD EEAETADETD DAHTIEVTPT DSLEDDERET IDVEPSDEPV DGPDYVLYGG KGGVGKTTMA AATALDSARG GTSTLVVSTD PAHSLSDTFE TDVPAEPGRI RDDIPLYAAE IDPESAMEAG EVAFPGAGGP DDAANADDGT AGPFGGGADS GAGPFGGSDG GAGEMGGMGG LGDLLGGGDG SPMEALFGGA MPGADEAAAM QLLLEYMDDP RFERVVIDTA PTGHTLRLLK LPELMDTMMG RMMKVRQRIS GMLEGMKGMF PGQEAPEEDD LEDLDELRER IERLRAALQD PARTDFRIVM VPEEMSVFES KRLRQQLEEF QIPVGTVVVN RVMEPLSDVT DDVRGEFLQP NLDDCEFCQR RWDVQQGALA EAQELFRGTE VRRVPLFADE VRGEGMLEVV AACLR
|
| |