Gene Paes_0222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_0222 
Symbol 
ID6459087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp232597 
End bp233784 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content50% 
IMG OID642724213 
Productarsenite-activated ATPase ArsA 
Protein accessionYP_002014926 
Protein GI194333066 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.267897 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTTGA TTCTGATGAC GGGGAAAGGC GGTGTGGGAA AAACGTCTAT GGCTGCGGCA 
ACCGGTCTTC GTTGTGCTGA GTTGGGATAC AAGACTCTGG TATTAAGCAC TGACCCTGCC
CATTCCCTTG CTGATAGTTT TGCTGTTGCC CTTGGTCATG AACCTCGTAA GATCTGTGAA
AATTTATGGG GTGCGGAGCT TGATGTTCTT GAGGAACTGG AGCAGAACTG GGGCTCTGTC
AAACGCTATA TTTCCGAGGT TCTTCAGGCC AGAGGGCTGG AGGGCGTTCA GGCGGAGGAG
CTTGCTATTC TTCCGGGCTC CGATGAGATT TTCGGGCTTG TGCGTGTGTT CCGTCATTAC
AAGGAGGGGG AGTATGATGT GCTGATTATC GATTCGGCAC CGACCGGAAC GGCATTGAGG
CTGCTGAGTA TTCCTGAAGT TGGCGGATGG TACATGCGCC GGCTCTATAA ACCGTTCGAG
AAAGTGGCGA TGACGCTTCG TCCGTTAGTA GAGCCTATTT TCAGGCCTCT TGCAGGATTT
TCTCTTCCTG ATAAGGAGAT GATGGATGTT CCTTATGAGT TTTATCAGAA AATAGAGAAG
CTGGGGGAAA TTCTCAAGGA TAATACCGTA ACCACCGTGC GACTTGTAAC CAACCCCGAG
AGAATGGTCA TCAACGAGTC GCTGCGGGCT CACGCCTATT TGAGTCTGTA CGATATTTCG
ACGGACCTTA TCATTGCCAA CAGAATTATT CCTGATGAGG TATCTGATCC GTATTTTCAG
TACTGGAAGG AAAATCAGCG TCTTTACCGG GGGGAAATTC ATGATAACTT CAGTCCTCTT
CCTGTTAAGG AGGTTCCTCT CTACTCACGT GAGATCTGCG GACTTGAGAC TCTTGAAAAG
CTCAGCAGGC TGTTGTATGC CGATGAGGAT CCCTCAAAGG TCTACTATAA GGAGACGACT
TTCAAGGTCA ACCAGGTGAA AAACGGATAT CAGCTCGAAC TGTTTCTCCC CGGCATTCAG
AAGGACCAGG TGCAGATCAG CAAGAAGGGC GATGAGCTTA ACGTTCGCAT CGGCAATCAT
CGGAGAAACA TCGTTCTGCC TCAGGCTCTT GCAGCCCTTA AAACTGCAGG AGCGGAGATG
GACGGAGAGC ATTTGAGAAT CAAGTTCGTT GCCCAGGCGG GGCGGTAA
 
Protein sequence
MRLILMTGKG GVGKTSMAAA TGLRCAELGY KTLVLSTDPA HSLADSFAVA LGHEPRKICE 
NLWGAELDVL EELEQNWGSV KRYISEVLQA RGLEGVQAEE LAILPGSDEI FGLVRVFRHY
KEGEYDVLII DSAPTGTALR LLSIPEVGGW YMRRLYKPFE KVAMTLRPLV EPIFRPLAGF
SLPDKEMMDV PYEFYQKIEK LGEILKDNTV TTVRLVTNPE RMVINESLRA HAYLSLYDIS
TDLIIANRII PDEVSDPYFQ YWKENQRLYR GEIHDNFSPL PVKEVPLYSR EICGLETLEK
LSRLLYADED PSKVYYKETT FKVNQVKNGY QLELFLPGIQ KDQVQISKKG DELNVRIGNH
RRNIVLPQAL AALKTAGAEM DGEHLRIKFV AQAGR