Gene Paes_1962 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1962 
Symbol 
ID6459943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp2143808 
End bp2145025 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content48% 
IMG OID642725947 
Productarsenite-activated ATPase ArsA 
Protein accessionYP_002016621 
Protein GI194334761 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000471213 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTATTT TAACTTTTAC AGGTAAAGGC GGGGTTGGAA AAACCAGTGT TTCAGCAGCG 
ACAGCTGTCC GGCTATCCCA GCTTGGGTAC CGTACTCTGG TACTGTCGAC GGATCCTGCC
CACAGTCTCT CGGATTCTTT TAATCTTTCT CTGGGGGCTG AACCTACCAA GATCAAGGAG
AATTTGCATG CTATTGAAGT CAATCCCTAT GTTGATTTAA AAGAAAACTG GCAGGCTGTC
CAGAAATATT ATACCAGAGT GTTTGCGGCT CAGGGCGTTT CGGGCGTTGT TGCTGATGAA
ATGACGATTC TTCCCGGTAT GGAAGAACTT TTTTCTCTGC TGAGAATTAA ACGATATAAG
TCAGCCGGTC TCTACGATGC GCTTGTTCTC GATACTGCTC CAACCGGTGA AACACTGCGA
TTGCTCTCCC TTCCGGACAC CTTGTCCTGG GGAATGAAAG CGGTGAAAAA TGTCAATAAG
TACATTATGA AGCCGCTCAG CAAGCCGCTT TCTAAAATGT CTGACAAGAT TGCCTACTAT
ATTCCTCCGG AAGATGCTAT CGATTCAGTC GATCAGGTTT TCGACGAACT TGAGGATATC
CGCGATATTC TTACCGATAA TCTCAATTCG ACGGTCAGGC TCGTGATGAA CGCCGAAAAA
ATGTCCATCA AGGAGACTAT GCGCGCGCTG ACCTATCTGA ACCTCTACGG TTTCAATGTC
GATATGGTGC TGGTCAATAA AATGCTCGAT ACCCAGGAAG ACAGCGGTTA TCTTGAAAAG
TGGAAGAGCA TTCAGCAGAA ATATCTTGGA GAGATTGAAG AAGGGTTCGC GCCGCTTCCT
GTCAAGAAGC TGAAGATGTA TGATCAGGAA ATTGTCGGCC TGGAGGCTCT TGAACGCTTC
GCGAAGGATA TGTATGGTGA CGATGATCCG TCTGAAGTCG TCTACGACGA GCCGCCGATC
AAGTTCGAGC GCAGCGGTGA TATTTATGAG GTGCAGTTGA AGCTCATGTT TGCCAACCCT
GTCGATATCG ATGTCTGGGT AACGGGGGAT GAACTGTTTG TGCAGATTGG AAGCCAGCGC
AAGATTATCA CGCTGCCGAT CAGTCTCACC GGTCTCGATC CGGGCGATGC GGTATTCAAG
GACAAATGGC TGCATATTCC CTTTGACCTG AACCGTCAGG GGCAGCATCA GAACAGGAAA
GAGTATAACA AAGTGTAG
 
Protein sequence
MRILTFTGKG GVGKTSVSAA TAVRLSQLGY RTLVLSTDPA HSLSDSFNLS LGAEPTKIKE 
NLHAIEVNPY VDLKENWQAV QKYYTRVFAA QGVSGVVADE MTILPGMEEL FSLLRIKRYK
SAGLYDALVL DTAPTGETLR LLSLPDTLSW GMKAVKNVNK YIMKPLSKPL SKMSDKIAYY
IPPEDAIDSV DQVFDELEDI RDILTDNLNS TVRLVMNAEK MSIKETMRAL TYLNLYGFNV
DMVLVNKMLD TQEDSGYLEK WKSIQQKYLG EIEEGFAPLP VKKLKMYDQE IVGLEALERF
AKDMYGDDDP SEVVYDEPPI KFERSGDIYE VQLKLMFANP VDIDVWVTGD ELFVQIGSQR
KIITLPISLT GLDPGDAVFK DKWLHIPFDL NRQGQHQNRK EYNKV