Gene Paes_1353 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1353 
Symbol 
ID6460289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp1472209 
End bp1473402 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content47% 
IMG OID642725337 
Productarsenite-activated ATPase ArsA 
Protein accessionYP_002016022 
Protein GI194334162 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.284232 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00308524 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGCAATA TCATTTTTAC GGGTAAGGGA GGCGTTGGAA AAACCTCTGT TGCAGCCGCA 
ACAGCACTGA AAGCTGCTGA CATGGGTTAT AAAACCCTGA TAATGTCTAC TGATCCCGCT
CACAGTCTCG GTGATTCACT TGATGTGCAG CTTGGCCCTT CCCCTGTCAA GGTTGCTGAA
AATCTCTGGG GTCAGGAAGT CAGTGTTTTC GGTGATCTGA ACCTGAACTG GGATGTTGTT
CGGGAACACT TTGCTCAGCT AATGGAATCA AGAGGTGTAG AGGGTATTTA TGCTGAAGAG
ATGGGTGTCC TTCCTGGTAT GGAAGAGCTT TTCTCTCTCT CCTACATCAA ACGTTATAAC
GAAGAAGAGT CCGATTACGA CCTGCTTGTC GTTGACTGTG CTCCTACCGG CGAAACGCTT
CGTCTCCTTT CACTTCCCGA GACATTCGGC TGGTTTATCA AGCTGATCCG CAACGTTGAG
AAATATATGG TCAAGCCAAT GATCAGGCCG CTCTCCAAAA AGGTCAAGAA AATTGACTCG
ATGGTCGCAC CTGAAGAGGT TTACGAGAAA GTCGACAATC TGTTCGCTTC AACAGAAGGC
ATCATCGAGC TGCTTGCCGA CGGTTCAAAA TCGACCGTTC GTCTTGTTAT GAACCCTGAA
AAGATGGTTA TCAAAGAGTC CATGAGGGCG CTGACCTATC TCAACCTCTA TGGCATCACT
GTTGACAGCA TCACTATCAA CAGAGTCATG CCTGCTCATA CCGAGGATCC TTATTTCAAG
AAATGGAGAG ATATTCAGCA GAATTATATC AAACAGATTG AAGGTTCATT CGCGCCTATT
CCGATCGGCC AGGTTCCTTT GTTTGATCAG GAGGTCGTCG GTCTTGACAT GCTTCGTCAG
GTTGGTGAGA AAGTCTATGC CGAAAAGAAT CCTGTCGATA TTTTCTTCAA GGAAGACCCG
ATTGCTATTG AGAAGGTCAA CGATGGTCAC TACAAGGTTC GAGTGAAATT GCCATTTATG
GAAACTATGG GACAGGAGCC CAAGATCCTT AAGCTCGGCG ATGATCTTAC CATCAGAATT
GGCGATTATC AGAAGGTTGT CGCTCTGCCG ATCTTTATTG CCGGACTTGA ATCTTCCGGA
GCGAGCTTTG ACAACGGCTG GCTCAGCATC GACTTTACCA GGGACGGCGA GTAA
 
Protein sequence
MRNIIFTGKG GVGKTSVAAA TALKAADMGY KTLIMSTDPA HSLGDSLDVQ LGPSPVKVAE 
NLWGQEVSVF GDLNLNWDVV REHFAQLMES RGVEGIYAEE MGVLPGMEEL FSLSYIKRYN
EEESDYDLLV VDCAPTGETL RLLSLPETFG WFIKLIRNVE KYMVKPMIRP LSKKVKKIDS
MVAPEEVYEK VDNLFASTEG IIELLADGSK STVRLVMNPE KMVIKESMRA LTYLNLYGIT
VDSITINRVM PAHTEDPYFK KWRDIQQNYI KQIEGSFAPI PIGQVPLFDQ EVVGLDMLRQ
VGEKVYAEKN PVDIFFKEDP IAIEKVNDGH YKVRVKLPFM ETMGQEPKIL KLGDDLTIRI
GDYQKVVALP IFIAGLESSG ASFDNGWLSI DFTRDGE