Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2710 |
Symbol | |
ID | 4269501 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 3075534 |
End bp | 3077249 |
Gene Length | 1716 bp |
Protein Length | 571 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 638127471 |
Product | arsenite-activated ATPase ArsA |
Protein accession | YP_743540 |
Protein GI | 114321857 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0003] Oxyanion-translocating ATPase |
TIGRFAM ID | [TIGR00345] arsenite-activated ATPase (arsA) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAATTTC TGGAAAACGC CACGCGCAAT CTCTTTTTCA CCGGAAAGGG CGGTGTCGGT AAGACCTCCC TCGCCTGCGC CACGGCGCTG GCCCTAGCGG AGCGGGGCAA GCGTGTCCTG CTGGTCTCCA CCGACCCGGC CTCCAACATC GACGAGGTGC TGGAGACCGA CCTGACCGGT ACTCCGCGCC CGGTCAATGG CGTGGACAAC CTCCATGCAC TCAACATCGA TCCGGAAAAG GCTGCCGAGG AGTACCGCGA GCGCGTCGTC GGTCCTTACC GTGGTCAGCT CCCGGACGCC ATCGTGCGGA GCATGGAAGA GCAGCTTTCT GGTGCCTGCA CCGTCGAGAT CGCCGCCTTC GACGCCTTCG CCGGCCTGCT GGGCGACCCC CGCGCCGCCG AGGGCTACGA CCACCTGGTG TTCGACACCG CCCCCACCGG TCACACCCTG CGGCTGCTCT CCCTGCCCAG CGCTTGGAGC GGGTACATCG AGACCAACAC CTCCGGCACC TCCTGCCTGG GCCCGCTGGA AGGGCTGTCC GCCCAGAAGG ACGTCTATGC CGGCGCGGTG GAGGCGCTGG CGGAGGCCGA CCGCACCACC CTGGTGCTGG TCAGCCGGCC GGAGGGGGCC GCGCTGGACG AGGCCGCCCG CACCAGCGAA GAACTGCGCG ACCTCGGTGT GAAAAACCAG CATCTGGTGG TCAACGGCGT CTTTCGCGCC ACCGACGCCG ACGACCCGGT GGCCCGGGCC CTGGAGGCGC GGGGCCAGCG CGCCCTCGAG GCCATGCCCG CCGGGCTGGC GGAGCTGCCG CGCAGCGAGC GGCCACTGCG GGCGCACGCG CCCATGGGCC TGGACGGCCT GCGCATCCTG CTGGGCGAGC AGGCCCCCGA CATTCCCGAA CAGCCGGCGG AGGACGCACC CGAGGGCGAG TCCTTCGAGC AACTCATTGA CAGCCTGGAG CGCGACGGCC GCGGTGCGGT CATGACCCTG GGCAAGGGCG GGGTGGGCAA GACTACCCTG GCGGCCCGGA TCGCCGTGGC GCTCGCCTCC CGCGGGCACA GTGTTCACCT GACCACCACC GACCCGGCGG CCCACGTGGC CGCTGCCGTG GGCGGCGAGC TGCCCACCGG GCTGACGGTC GGCCGGGTGG ACCCCAAGGC CGAGACCGAG CGCTACCGCG AGCACGTCAT GGCCACCGCC GGCGCCGACA TGGACGAGGA GGGGCGCAAG CTGCTGGAGG AGGACCTGCG TTCGCCCTGT ACCGAGGAGA TCGCCGTCTT CCAGGCCTTC GCCCGTACCG TGGCCCGGGC CGAGGACGAG ATCGTGGTGC TGGACACCGC CCCCACCGGC CACACCATCC TGCTGCTGGA TGCCGCCCAG GCCTATCACC GCGAGCTGGG CCGGCAGAGC CAGGAGGTGG CGCCGGAGGT GGAGCAGTTG CTGCCCCGGC TGCGTGACCC CCACTACACC CACATGTTGA TCTGCACCCT GCCGGAGGCC ACGCCCGTGC ACGAGGCGGC AGCTCTGCAG GCGGACCTGC GCCGGGCGGA GATCGAGCCG GCGGCCTGGA TTGTCAACCA GTCGCTGACC CCGCTGGCGG TGACCGACCC GGTGCTGCGC GCCCGCCAGG CGCAGGAGGC CCGCTGGCTG CGCGAGATCG TGAGCGAACA TCACAGCCGA CTGATTATTG AACCCTGGAG CGAAGACTAC TCATGA
|
Protein sequence | MQFLENATRN LFFTGKGGVG KTSLACATAL ALAERGKRVL LVSTDPASNI DEVLETDLTG TPRPVNGVDN LHALNIDPEK AAEEYRERVV GPYRGQLPDA IVRSMEEQLS GACTVEIAAF DAFAGLLGDP RAAEGYDHLV FDTAPTGHTL RLLSLPSAWS GYIETNTSGT SCLGPLEGLS AQKDVYAGAV EALAEADRTT LVLVSRPEGA ALDEAARTSE ELRDLGVKNQ HLVVNGVFRA TDADDPVARA LEARGQRALE AMPAGLAELP RSERPLRAHA PMGLDGLRIL LGEQAPDIPE QPAEDAPEGE SFEQLIDSLE RDGRGAVMTL GKGGVGKTTL AARIAVALAS RGHSVHLTTT DPAAHVAAAV GGELPTGLTV GRVDPKAETE RYREHVMATA GADMDEEGRK LLEEDLRSPC TEEIAVFQAF ARTVARAEDE IVVLDTAPTG HTILLLDAAQ AYHRELGRQS QEVAPEVEQL LPRLRDPHYT HMLICTLPEA TPVHEAAALQ ADLRRAEIEP AAWIVNQSLT PLAVTDPVLR ARQAQEARWL REIVSEHHSR LIIEPWSEDY S
|
| |