Gene RPB_3871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3871 
Symbol 
ID3911675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4424838 
End bp4427564 
Gene Length2727 bp 
Protein Length908 aa 
Translation table11 
GC content69% 
IMG OID637885772 
Productheavy metal translocating P-type ATPase 
Protein accessionYP_487475 
Protein GI86750979 
COG category[P] Inorganic ion transport and metabolism
[S] Function unknown 
COG ID[COG2217] Cation transport ATPase
[COG3350] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01494] ATPase, P-type (transporting), HAD superfamily, subfamily IC
[TIGR01511] copper-(or silver)-translocating P-type ATPase
[TIGR01525] heavy metal translocating P-type ATPase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGACA CAGAGCACAC CGGGGGCAGC ACGGCTGTCA AGGATGCGGG TTGCGGATGC 
TCGGCGGAGG CAGCTGTTCC TGCGCCGCCT GCGGCGTCGT CCTGTTGCGG CCGCCACGCG
AGCGATCAGC CGATCTCACC TGCGTCCGCC AAGGCGATCG ATCCCGTCTG CGGCATGACC
GTCGACCCGG CGTCCAGCAA GCATCGCTTC GACCACGCCG GCACCACCTA TCACTTCTGC
TGCGCCGGAT GCCGCACCAA ATTCGCGGCC GACCCGGAAG GCATTCTGGC CAAAGCCGCC
AGGCCGACTG CGGCGCCCAA GCCCGCAGCG CAATTGCATC AACTGACCGA CTTCGCCGCA
CCGTCGTCCT GCTGCGGCGG GCATGATCAT GCCGCACATC ACCACGATCA CGGCGCGACT
GCAGCAGCCG ACGGCAAGGT GATCGATCCG GTCTGCGGCA TGAAAGTGGA CCCGGCGACC
ACGCCGCATC GGTTCGATTA CCAGGGCCAG ACCTATTTCT TCTGCGCGGC GAGCTGCCGC
GGCAAATTCG CCGCCGATCC CGTCTCCTAT CTCGACAAGT CGAAAGCGAA GCCGGCGCCG
GTGGTGCCGG AGGGCACGAT CTACACCTGC CCGATGGATC CGCAGATCCG TCAGGTCGGC
CCGGGAAGCT GCCCGATCTG CGGCATGGCG CTCGAACCCG AGCTGGTGTC GCTCGACGCG
CCGCCGAATG CCGAACTGAT CGACATGACC CGGCGTTTCT GGATCGGCCT CGCGCTGGCG
CTGCCGGCGG TCGTGCTGGA AATGGGCGGC CACCTCGTTG GTGGTCACGG CTTGATCGAT
CCTGCGCTGT CGAATTGGAT CCAGCTCGCC TGCGCGACGC CGGTCGTGCT GTGGGCCGGC
TGGCCGTTCT TCGTCCGCGG CTGGCAGTCG CTGGTCACGC GCAACCTCAA CATGTTCACG
CTGGTCGCGA TGGGCACCGG CGTCGCCTAT GTCTACAGCC TGGTCGCGAC GCTGGCGCCG
CAGCTGTTCC CACCGGCCTT CCAGAGCCAT GGCGGCAGCG TGCCGGTGTA TTTCGAGGCC
GCAGCGGTGA TCACCGTGCT GGTGCTGCTC GGCCAGGTGC TCGAACTGCG CGCCCGCGAG
GCGACCTCCG GCGCGATCAA GGCACTGCTC ACCCTCGCGC CGAAATCCGC ACGGCGAATC
GCGGCGGACG GGACCGACCA CGAGGTCGAG ATCGACAGCC TCGCGGTCGG CGACAAGCTC
CGCGTCCGCC CCGGCGAAAA GGTGCCGGTC GACGGCATCA TCCTCGAGGG ACGCTCGACG
CTCGACGAAT CGCTGGTGAC CGGCGAATCG ATGCCGGTGA CGCGCGAGGC CGGCGGCAAG
GTCGTCGCCG GAACGCTCAA CCAGGCCGGC GGCTTCGTGA TGCGCGCCGA ACAGGTCGGC
CGCGACACCG TGCTGTCGCA GATCGTGCAG ATGGTGGCGC AGGCGCAGCG ATCGCGCGCG
CCGATCCAGC GCGTCGCCGA CCTCGTCGCG GGCTGGTTCG TGCCGGCCGT GGTGCTGGCC
GCGCTGGTCG CGTTCGCCGC CTGGGCGACC TTCGGCCCCG AGCCGCGGCT GACCTTCGCG
CTGGTCGCCG CGGTCAGCGT GCTGATCATC GCCTGCCCGT GCGCGCTCGG TCTCGCCACG
CCGATGTCGA TCATGGTCGG CGTCGGCCGC GGCGCGCAGG CCGGGGTGCT GATCCGCAAC
GCCGAAGCGC TGGAGCGGAT GGAGAAGGTC GACACGCTGG TGATCGACAA GACCGGCACG
CTGACCGAAG GCAAGCCGAA GGTGGTGGCG ATCGCCACCG CGAGCGGCTT CGACGAGGCC
GAATTGCTGC GGCTCGCGGC CGGCGTCGAA CGCGCCAGCG AACACCCGCT CGCGCACGCC
ATCGTCACCG CCGCCAGTGA TCGCAGTCTC GACCTCGCGC CGGTCGACGG GTTCGAGGCG
CCGACCGGCA AGGGCGCGAC CGGCCGGGTC GCCGGCCGTT CGGTCGTGAT CGGCAACGTC
GACTATCTCG CTTCGCTCGG GATCGACACG GCCCCGCTCG CCGACATAGC GGAGCATCAC
CGCGCCGACG GAGCCACCGT GGTCAGCGTC GGCATCGACG GGCGGTTCGC CGGATTGATC
GCGATTGCCG ATCCGGTGAA AGCATCGACG CCGGACGCGT TGCGCGCACT CGCCGCCGAA
GGCCTCCGGG TGATCATGCT GACCGGCGAC AACCGAACCA CGGCGCAGGC CGTCGCCCGA
AAACTCGGCA TCGCCGATGT CGAAGCCGAG GTCCTGCCCG ATCAGAAGAG CGCGGTGGTC
GAAAAGCTGC GCAAGCAGGG CCGCATCGTC GCGATGGCCG GCGACGGCGT CAACGACGCT
CCGGCATTGG CCGCCGCCGA TGTCGGCATC GCGATGGGAA CCGGAACCGA CGTGGCGATG
GAGAGCGCGG GCATCACGCT GCTGAAGGGC GACCTTGGCG GCATCGTTCG CGCGCGAAAA
CTGTCCCAGG CGACGATGCG CAATATCCGG CAGAATCTGT TCTTCGCCTT CATCTACAAT
TCGGCTGGAA TCCCGATCGC CGCCGGCATT CTGTATCCGA GCTTCGGCCT GCTGCTGTCG
CCGATCATCG CCGCAGCGGC AATGTCGCTG TCGTCGGTCA GCGTGATCGG CAATGCCCTG
CGGCTGCGCG CCACCTCGCT GGATTGA
 
Protein sequence
MHDTEHTGGS TAVKDAGCGC SAEAAVPAPP AASSCCGRHA SDQPISPASA KAIDPVCGMT 
VDPASSKHRF DHAGTTYHFC CAGCRTKFAA DPEGILAKAA RPTAAPKPAA QLHQLTDFAA
PSSCCGGHDH AAHHHDHGAT AAADGKVIDP VCGMKVDPAT TPHRFDYQGQ TYFFCAASCR
GKFAADPVSY LDKSKAKPAP VVPEGTIYTC PMDPQIRQVG PGSCPICGMA LEPELVSLDA
PPNAELIDMT RRFWIGLALA LPAVVLEMGG HLVGGHGLID PALSNWIQLA CATPVVLWAG
WPFFVRGWQS LVTRNLNMFT LVAMGTGVAY VYSLVATLAP QLFPPAFQSH GGSVPVYFEA
AAVITVLVLL GQVLELRARE ATSGAIKALL TLAPKSARRI AADGTDHEVE IDSLAVGDKL
RVRPGEKVPV DGIILEGRST LDESLVTGES MPVTREAGGK VVAGTLNQAG GFVMRAEQVG
RDTVLSQIVQ MVAQAQRSRA PIQRVADLVA GWFVPAVVLA ALVAFAAWAT FGPEPRLTFA
LVAAVSVLII ACPCALGLAT PMSIMVGVGR GAQAGVLIRN AEALERMEKV DTLVIDKTGT
LTEGKPKVVA IATASGFDEA ELLRLAAGVE RASEHPLAHA IVTAASDRSL DLAPVDGFEA
PTGKGATGRV AGRSVVIGNV DYLASLGIDT APLADIAEHH RADGATVVSV GIDGRFAGLI
AIADPVKAST PDALRALAAE GLRVIMLTGD NRTTAQAVAR KLGIADVEAE VLPDQKSAVV
EKLRKQGRIV AMAGDGVNDA PALAAADVGI AMGTGTDVAM ESAGITLLKG DLGGIVRARK
LSQATMRNIR QNLFFAFIYN SAGIPIAAGI LYPSFGLLLS PIIAAAAMSL SSVSVIGNAL
RLRATSLD