Gene Paes_1958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1958 
Symbol 
ID6459957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp2139769 
End bp2140923 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content47% 
IMG OID642725943 
Productarsenite-activated ATPase ArsA 
Protein accessionYP_002016617 
Protein GI194334757 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.367574 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAATTA TCCTTTATCT GGGTAAAGGT GGAGTCGGCA AAACGACAGT TTCGGCTTCA 
ACAGCAACAG CGATTGCCCG TAGCGGCAAG CGGGTACTTA TTATGAGTAC GGATGTCGCT
CATAGTCTTG CCGATGCCCT TGGTGTCGAG TTGAGCGCGA CACCCGTTGA GGTTGAAAAC
AACCTTTTCG CCATGGAAGT TAATGTTCTG GCCGAAATCA GAGAGAATTG GACGGAACTC
TATTCTTATT TCTCTTCGAT TCTGATGAAT GACGGTGCCA ACGAGGTCGT TGCCGAGGAG
CTGGCTGTCG TTCCCGGCAT GGAGGAGATG ATCAGTTTGC GCTATATCTG GAAGGCTGCC
AAGTCCGGAT TGTATGATGC CATTGTTGTT GACGCCGCAC CTACCGGTGA GACGATGCGT
TTGCTTGGTA TGCCTGAATC GTATGGCTGG TACTCGGAAA AAATTGGCGG CTGGCACTCC
AAGGCGATCG GTTTTGCTGC TCCGCTTCTG AACCGGTTTA TGCCCAAGAA AAATATTTTC
AAGCTGATGC CTGAGGTGAA CGATCATATG AAGGAGCTGC ACGGCATGCT TCAGGATAAG
TCGGTTACCA CATTCAGGGT CGTTGTCAAT CCTGAAAATA TGGTGATTAA AGAGGCGCTA
CGTGTGCAGA CCTACCTTAA TCTTTTCGGC TATAAGCTCG ATGCGGTCAT TGTCAACAAG
ATTCTGCCGG CAAGTTCGTC GGATGACTAT CTCAACAGTC TTATCGCTCT GCAGCAGAAG
TATCTCAAGG TTATCGACGA CTGTTTCTAC CCGATTCCTA TTTTCAAGGC ATCTCAGGCT
ACCCGCGAAG TGATCAAAAC TGATCAGCTC TATGCACTGA GCCAGCAGAT GTTCGATGGG
CACAATCCTA TCGAAGTGCT TTATGCGGAT GATAAAACGC AGTCGATTGA AAAGATCGAT
GGCAAGTATG TGTTGAAGCT GCACATGCCA AACGTTGAAA TTACGAAGCT CAATGTCAAT
ATCAAGGGTG ACGAGCTTCT GGTTGATATC AACAACTTCA GAAAGAGCAT TGTTCTTCCC
AATATTCTTG TCGGAAGAAA AACAGAAGGT GCTGATTTCG AGGGAGGACA TCTTAACATT
ACTTTCGCGA ATTGA
 
Protein sequence
MRIILYLGKG GVGKTTVSAS TATAIARSGK RVLIMSTDVA HSLADALGVE LSATPVEVEN 
NLFAMEVNVL AEIRENWTEL YSYFSSILMN DGANEVVAEE LAVVPGMEEM ISLRYIWKAA
KSGLYDAIVV DAAPTGETMR LLGMPESYGW YSEKIGGWHS KAIGFAAPLL NRFMPKKNIF
KLMPEVNDHM KELHGMLQDK SVTTFRVVVN PENMVIKEAL RVQTYLNLFG YKLDAVIVNK
ILPASSSDDY LNSLIALQQK YLKVIDDCFY PIPIFKASQA TREVIKTDQL YALSQQMFDG
HNPIEVLYAD DKTQSIEKID GKYVLKLHMP NVEITKLNVN IKGDELLVDI NNFRKSIVLP
NILVGRKTEG ADFEGGHLNI TFAN