Gene Pnap_3731 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_3731 
Symbol 
ID4689565 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp3972724 
End bp3974526 
Gene Length1803 bp 
Protein Length600 aa 
Translation table11 
GC content70% 
IMG OID639836749 
Productarsenite-activated ATPase ArsA 
Protein accessionYP_983948 
Protein GI121606619 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.759868 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0489134 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAACG CTACAAATTC CACCCCCGGC TTTCTCCTTG CGCCCACGCG CTTTGTGTTT 
TTCACCGGCA AGGGCGGCGT CGGCAAGACC TCGCTGTCCA CCGCCACGGC GATTGCGCTG
GCCGACGCGG GCCGGCGCGT GCTGCTGGTC AGCACCGACG CGGCGTCGAA CCTCGATGAA
ATGCTGGGCA TGCCGATGTC CAACCAGCCC GCGCAAGTGC CCGGCGTGCC GCGCCTGCGC
ATGCTCAACA TCGACCCGGA CGCCGCCGCC GAGGCCTACC GGCTGCGCGT GCTGGAGCAA
CTGGGGCTGG ACGCCAGCGA CGACGAGCGC AAGACCGTGC GCGAGCAGTT GTCAGGCGCC
TGCACCACCG AGATCGCCGC GTTCGACGAA TTCGCCGCGC TGCTGGCCGG CGAAGGCGAA
GGCGATGGCG CGGGCAGCGG CTACGACCAC GTCATCTTCG ACACCGCGCC GACCGGCCAC
ACGCTGCGCC TGCTGAGCCT GCCCAAGGCC TGGAGCGGCT TTCTGGCCGG CAACGACCGG
GGCGCGTCGT GCCTGGGTCC GCATTCGGGC CTGAAGATGC AGGAAGCGCG CTTCAACGCC
GCGCTGGCCG CGCTGAGCGA CGCAAAGCTG ACCACCGTCG TCCTCGTCAC CCGGCCCGAC
CCGCGCCCGA TGCAGGAGGC CGCGCGCACC GCCGAAGAGC TGCGCACGCT GGGTCTTTCC
AACCAGCGGC TGGTGATCAA CGGCGTGTTC CATGCCAGCC GCCCCGACGA CCCGACCGCC
CGCGCCCTCG AAGCGCTGGG CCTGCAGGCC ATCGCGCAGA TGCCCGATGC GCTGGCCGGC
CTGCCGCGCG ACGAGGTGCC GCTGCGCGCC TTCGACACCG TGGGCTTGAG CGCCCTGCGC
GCGCTGCTGG GTGGCGGAGC GATTCCGGCA AGCGCGCCGG TTGCGCTGGC CGCCGAGTTG
CCGGCCGAGC CCTTGAGCCG GCTGGCCGAC GAACTGGCCG CGATGGGCCA CGGGCTGATC
ATGGTCATGG GCAAGGGCGG CGTCGGCAAG ACCACGATTG CCAGCGCGCT GGCCGTGGGG
CTGGTGCAGC GCGGCCACAG CGTTCACCTG ACGACCACCG ACCCGGCCGC GCATGTGGCC
GAAACCCTGA ACGGCAGCCT GCCGAACCTG AAGGTCGGCC GCATCGACCC CAGGGCGGAA
ACCGAAGCCT ACATTGCCAA GATCATGGCG ACGCGCGGCA AGGCGCTCGA CGAACAGGGA
CGGGCGCTGC TGCTCGAAGA CCTGCAATCG CCCTGCACCG AAGAAGTCGC CGTGTTCCAC
GCCTTCAGCC GCGTGGTGAA CGAGGCGCGC AGCGCTTTCG TGGTGCTCGA TACCGCGCCC
ACCGGCCACA GCCTGCTGCT GATGGACGCC ACCGGCGCCT ACCACCGGCA GATGCTGCAG
CAGTATGAAA GCAGCTCCAA CGCCATGCAC CTCATCACGC CGCTGATGCG GCTGCAGGAT
GCGTCGATGA CGCATGTCAT TCTCGTCACG CTGCCCGAAG TCACGCCCGT CAGCCAGGCA
GCCGCGCTAC AGGACGATTT GCGCCGCGCG AAGATCGAGC CCTGGGCCTG GGTCATCAAC
AAAAGCATCG CCGCCACCGG CACGAACGAC CCGCTGCTAA AGGCCCGGCT GGCCGGCGAG
CACCGGCAGG CCGCGCGCAT TGCTGGCGGA CTGGCGCAGC GAACCTTCGT GCTGCCGTGG
CTGCCTGAGT CGCCGGTGGG GGTGCGGGCG CTGGAGGCAT TGGCGGCATT CAATCCTGTT
TAG
 
Protein sequence
MNNATNSTPG FLLAPTRFVF FTGKGGVGKT SLSTATAIAL ADAGRRVLLV STDAASNLDE 
MLGMPMSNQP AQVPGVPRLR MLNIDPDAAA EAYRLRVLEQ LGLDASDDER KTVREQLSGA
CTTEIAAFDE FAALLAGEGE GDGAGSGYDH VIFDTAPTGH TLRLLSLPKA WSGFLAGNDR
GASCLGPHSG LKMQEARFNA ALAALSDAKL TTVVLVTRPD PRPMQEAART AEELRTLGLS
NQRLVINGVF HASRPDDPTA RALEALGLQA IAQMPDALAG LPRDEVPLRA FDTVGLSALR
ALLGGGAIPA SAPVALAAEL PAEPLSRLAD ELAAMGHGLI MVMGKGGVGK TTIASALAVG
LVQRGHSVHL TTTDPAAHVA ETLNGSLPNL KVGRIDPRAE TEAYIAKIMA TRGKALDEQG
RALLLEDLQS PCTEEVAVFH AFSRVVNEAR SAFVVLDTAP TGHSLLLMDA TGAYHRQMLQ
QYESSSNAMH LITPLMRLQD ASMTHVILVT LPEVTPVSQA AALQDDLRRA KIEPWAWVIN
KSIAATGTND PLLKARLAGE HRQAARIAGG LAQRTFVLPW LPESPVGVRA LEALAAFNPV