Gene Mlg_2710 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2710 
Symbol 
ID4269501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp3075534 
End bp3077249 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content71% 
IMG OID638127471 
Productarsenite-activated ATPase ArsA 
Protein accessionYP_743540 
Protein GI114321857 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATTTC TGGAAAACGC CACGCGCAAT CTCTTTTTCA CCGGAAAGGG CGGTGTCGGT 
AAGACCTCCC TCGCCTGCGC CACGGCGCTG GCCCTAGCGG AGCGGGGCAA GCGTGTCCTG
CTGGTCTCCA CCGACCCGGC CTCCAACATC GACGAGGTGC TGGAGACCGA CCTGACCGGT
ACTCCGCGCC CGGTCAATGG CGTGGACAAC CTCCATGCAC TCAACATCGA TCCGGAAAAG
GCTGCCGAGG AGTACCGCGA GCGCGTCGTC GGTCCTTACC GTGGTCAGCT CCCGGACGCC
ATCGTGCGGA GCATGGAAGA GCAGCTTTCT GGTGCCTGCA CCGTCGAGAT CGCCGCCTTC
GACGCCTTCG CCGGCCTGCT GGGCGACCCC CGCGCCGCCG AGGGCTACGA CCACCTGGTG
TTCGACACCG CCCCCACCGG TCACACCCTG CGGCTGCTCT CCCTGCCCAG CGCTTGGAGC
GGGTACATCG AGACCAACAC CTCCGGCACC TCCTGCCTGG GCCCGCTGGA AGGGCTGTCC
GCCCAGAAGG ACGTCTATGC CGGCGCGGTG GAGGCGCTGG CGGAGGCCGA CCGCACCACC
CTGGTGCTGG TCAGCCGGCC GGAGGGGGCC GCGCTGGACG AGGCCGCCCG CACCAGCGAA
GAACTGCGCG ACCTCGGTGT GAAAAACCAG CATCTGGTGG TCAACGGCGT CTTTCGCGCC
ACCGACGCCG ACGACCCGGT GGCCCGGGCC CTGGAGGCGC GGGGCCAGCG CGCCCTCGAG
GCCATGCCCG CCGGGCTGGC GGAGCTGCCG CGCAGCGAGC GGCCACTGCG GGCGCACGCG
CCCATGGGCC TGGACGGCCT GCGCATCCTG CTGGGCGAGC AGGCCCCCGA CATTCCCGAA
CAGCCGGCGG AGGACGCACC CGAGGGCGAG TCCTTCGAGC AACTCATTGA CAGCCTGGAG
CGCGACGGCC GCGGTGCGGT CATGACCCTG GGCAAGGGCG GGGTGGGCAA GACTACCCTG
GCGGCCCGGA TCGCCGTGGC GCTCGCCTCC CGCGGGCACA GTGTTCACCT GACCACCACC
GACCCGGCGG CCCACGTGGC CGCTGCCGTG GGCGGCGAGC TGCCCACCGG GCTGACGGTC
GGCCGGGTGG ACCCCAAGGC CGAGACCGAG CGCTACCGCG AGCACGTCAT GGCCACCGCC
GGCGCCGACA TGGACGAGGA GGGGCGCAAG CTGCTGGAGG AGGACCTGCG TTCGCCCTGT
ACCGAGGAGA TCGCCGTCTT CCAGGCCTTC GCCCGTACCG TGGCCCGGGC CGAGGACGAG
ATCGTGGTGC TGGACACCGC CCCCACCGGC CACACCATCC TGCTGCTGGA TGCCGCCCAG
GCCTATCACC GCGAGCTGGG CCGGCAGAGC CAGGAGGTGG CGCCGGAGGT GGAGCAGTTG
CTGCCCCGGC TGCGTGACCC CCACTACACC CACATGTTGA TCTGCACCCT GCCGGAGGCC
ACGCCCGTGC ACGAGGCGGC AGCTCTGCAG GCGGACCTGC GCCGGGCGGA GATCGAGCCG
GCGGCCTGGA TTGTCAACCA GTCGCTGACC CCGCTGGCGG TGACCGACCC GGTGCTGCGC
GCCCGCCAGG CGCAGGAGGC CCGCTGGCTG CGCGAGATCG TGAGCGAACA TCACAGCCGA
CTGATTATTG AACCCTGGAG CGAAGACTAC TCATGA
 
Protein sequence
MQFLENATRN LFFTGKGGVG KTSLACATAL ALAERGKRVL LVSTDPASNI DEVLETDLTG 
TPRPVNGVDN LHALNIDPEK AAEEYRERVV GPYRGQLPDA IVRSMEEQLS GACTVEIAAF
DAFAGLLGDP RAAEGYDHLV FDTAPTGHTL RLLSLPSAWS GYIETNTSGT SCLGPLEGLS
AQKDVYAGAV EALAEADRTT LVLVSRPEGA ALDEAARTSE ELRDLGVKNQ HLVVNGVFRA
TDADDPVARA LEARGQRALE AMPAGLAELP RSERPLRAHA PMGLDGLRIL LGEQAPDIPE
QPAEDAPEGE SFEQLIDSLE RDGRGAVMTL GKGGVGKTTL AARIAVALAS RGHSVHLTTT
DPAAHVAAAV GGELPTGLTV GRVDPKAETE RYREHVMATA GADMDEEGRK LLEEDLRSPC
TEEIAVFQAF ARTVARAEDE IVVLDTAPTG HTILLLDAAQ AYHRELGRQS QEVAPEVEQL
LPRLRDPHYT HMLICTLPEA TPVHEAAALQ ADLRRAEIEP AAWIVNQSLT PLAVTDPVLR
ARQAQEARWL REIVSEHHSR LIIEPWSEDY S