Gene RPD_3643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3643 
Symbol 
ID4024157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4064637 
End bp4066817 
Gene Length2181 bp 
Protein Length726 aa 
Translation table11 
GC content66% 
IMG OID637963847 
Productheavy metal translocating P-type ATPase 
Protein accessionYP_570767 
Protein GI91978108 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2217] Cation transport ATPase 
TIGRFAM ID[TIGR01494] ATPase, P-type (transporting), HAD superfamily, subfamily IC
[TIGR01511] copper-(or silver)-translocating P-type ATPase
[TIGR01512] heavy metal-(Cd/Co/Hg/Pb/Zn)-translocating P-type ATPase
[TIGR01525] heavy metal translocating P-type ATPase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.545825 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGAAG CCGTAGCTGC CGATCAGACC AGGATGCGCG TCGAAGGCAT GGACTGCGCG 
TCCTGCGCGG TGAAGATCGA GAACGCGCTG CGCCGCGTGC CGGGAGTAAC CGACGTTGCC
GTCTCGGTGG CGGCCGGAAA CGTGACCGTC AGACACGATG GCGCCGATTT CGGCACATTG
GCCGCCCGGA TCACCGCATT GGGCTACAAG GTCACCCCTG CCGACGAGAA GGGTATCGAT
GCGCCTCACG GCCATGGCGG CGCATCTCAC GACCATGACG ACGGCCACTC GCACGGTCAC
GACCACGGTC CCACCGACGG GTCATGGTGG CGGACATCCA AAGGCATCCT GACAATCGCT
TCCGGAACGG CGCTAGGAGC GGCGTTTTTG ATCGGGAAGA TCGCGCCGGC CACGGAAAAG
TGGGCGTTTC TCGTCGCGAT GCTGGTCGGG TTGATCCCGA TCGGTCGCCG CGCTTTCTCC
GCCGCGATAT CGGGTACGCC GTTCTCGATC GAGATGCTGA TGACGATCGC GGCGATCGGC
GCCGTCTTCA TCGGCGCCAC GGAAGAGGCC GCCGCCGTCG TATTCCTGTT CCTCATCGGC
GAACTGCTCG AAGGAGTCGC CGCAAGCAGG GCGCGCGCAA GCATTCAGGA TTTGACCAAG
CTCGTCCCGA AGACAGCGAG ATTGGAGGAG AACGGACAGG TCCGCGAGGT GCAGGCGGAC
ACGCTGGTGG TAGGATCGAT GATCCAAGTT AGACCAGGCG ACCGGATTCC GGCTGACGGC
GTCATCGTCT CCGGAGAGAG CTCCGTCGAC GAGGCCCCCG TGACCGGCGA AAGCACGCCC
GTCCGGAAGG GCCGCGACGA GAGCCTGTTC GCGGGCACCA TCAACGGCGA CGGGTTGCTC
AGTATACGCG TGACCGCTGC CGCTGCGGAC AACACGATCG CGCGGGTGGT CCGGCTAGTC
GAGGAGGCCC AAGAATCCAA GGCGCCCACG GAACGTTTCA TCGACCGCTT TTCCCGGTAC
TACACGCCCG GCGTCGTGGT GGTCGCATTC CTGGTAGCCG TCGTCCCGCC GCTGCTGTTC
GGCGGAATCT GGAGCGAATG GGTCTACAAG GGGCTGGCGA TCCTTTTGAT CGGATGTCCG
TGCGCCCTGG TGATCTCGAC GCCCGCGGCG ATCGCCGCCA GCCTCTCCGC CGGTGCACGC
CGTGGACTGC TGCTCAAGGG CGGCGTCGTC CTCGAGCAGA TGGGCAAGAT CACGCTCGCG
TGCTTCGACA AGACCGGGAC GCTGACGGCA GGTAAACCGG TCGTGACCGA CGTGCTGTCG
TTCGGCGCTG CGGAGAACGA GGTGCTGCGC CTCGCGGCGG CGCTGGAGAC GGGGTCCAGC
CATCCGCTGG CGATCGCGAT CCTCGCCGAA GCCTCGAAGC GGGGCATCGT GCTGCCGTCC
ACATCGGGAT CGCAGGCGTT CGGAGGCAAG GGCATCAAGG CGACCGTGGA TGGCCAGCAG
ATCTTCCTAG GGTCGCCGAA GGCGGCCGAG GAAATCGGTG TTCTGGATCT CGAGCATCAA
GGCCGTGTCG CGGCTCTCAA CGACGAGGGC AAGACCGTTT CGATCTTGAC GGTCGGAACG
ACGATGGCGG GAGCGATCGC CATGCGCGAC GAGCCCCGCC CCGACGCCGC AAAGGGGCTT
AAGCTGCTGA CCGACGCCGG AATCCGGACG GTCATGCTGA CCGGAGACAA CCGCCGGACC
GCCACGGCGA TCGGCAAGTC GCTCGGGATC GAAGTGCAGG CCGGGCTCCT GCCGCAGGAC
AAGCAGCGGA TCGTCGCGGA TTTTCAAGCA CAGGGTTTCA CCGTTGCGAA GATCGGCGAC
GGCATCAACG ACGCGCCGGC GCTCGCCGCC GCCGATGTCG GGATCGCGAT GGGCGGCGGC
ACCGACGTCG CCCTGGAGAC CGCCGACGCC GCCGTTCTTC ACGGCAGGGT CGCCGATGTG
GCCGCGATGG TCGACCTCTC GAAACGCACG ATGCTCAACA TCAAGCAGAA CATCACGGTC
GCGCTCGGCC TAAAGGCGGT CTTCCTCGTC ACTACGGTCA TCGGCCTCAC CGGCCTGTGG
CCGGCGATCC TCGCCGACAC CGGTGCCACG GTGCTCGTCA CGCTGAACGC CCTCCGGCTG
CTGAAGCCGT CGAAAATCTA G
 
Protein sequence
MPEAVAADQT RMRVEGMDCA SCAVKIENAL RRVPGVTDVA VSVAAGNVTV RHDGADFGTL 
AARITALGYK VTPADEKGID APHGHGGASH DHDDGHSHGH DHGPTDGSWW RTSKGILTIA
SGTALGAAFL IGKIAPATEK WAFLVAMLVG LIPIGRRAFS AAISGTPFSI EMLMTIAAIG
AVFIGATEEA AAVVFLFLIG ELLEGVAASR ARASIQDLTK LVPKTARLEE NGQVREVQAD
TLVVGSMIQV RPGDRIPADG VIVSGESSVD EAPVTGESTP VRKGRDESLF AGTINGDGLL
SIRVTAAAAD NTIARVVRLV EEAQESKAPT ERFIDRFSRY YTPGVVVVAF LVAVVPPLLF
GGIWSEWVYK GLAILLIGCP CALVISTPAA IAASLSAGAR RGLLLKGGVV LEQMGKITLA
CFDKTGTLTA GKPVVTDVLS FGAAENEVLR LAAALETGSS HPLAIAILAE ASKRGIVLPS
TSGSQAFGGK GIKATVDGQQ IFLGSPKAAE EIGVLDLEHQ GRVAALNDEG KTVSILTVGT
TMAGAIAMRD EPRPDAAKGL KLLTDAGIRT VMLTGDNRRT ATAIGKSLGI EVQAGLLPQD
KQRIVADFQA QGFTVAKIGD GINDAPALAA ADVGIAMGGG TDVALETADA AVLHGRVADV
AAMVDLSKRT MLNIKQNITV ALGLKAVFLV TTVIGLTGLW PAILADTGAT VLVTLNALRL
LKPSKI