Gene RPB_2239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2239 
Symbol 
ID3909022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2563693 
End bp2566839 
Gene Length3147 bp 
Protein Length1048 aa 
Translation table11 
GC content64% 
IMG OID637884134 
Producthydrophobe/amphiphile efflux-1 HAE1 
Protein accessionYP_485855 
Protein GI86749359 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family
[TIGR01131] ATP synthase subunit 6 (eukaryotes),also subunit A (prokaryotes) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCGTT TTTTCATCGA ACGGCCGGTG TTCGCATGGG TGATTGCGAT CACCATCATG 
CTCGGCGGGC TGCTTGCGCT GCAGACGCTG CCGATCGCGC AATATCCCCA GATCGCCCCG
ACCACGGTGC GCATCACCGG GACCTATGCC GGCGCCGACG CGCAGACGGT GGAGAATTCG
GTCACCAAGG TGATCGAGCA GGGCATGACC GGGATCGACC ATCTCGACTA CATGACCTCG
ACCTCGACCT CCACCGGGCA GTCGCAGATC ACGCTGACCT TCACCGCCGC GGCCGATCCG
GACGTGGCGC AGATGCAGGT GCAGAACAAG CTCCAATTGG TGACCGCGCT TCTGCCGCAG
ATCGTCCAGA ACACCGGTCT CTCGGTCACG AAGTCCTCCA CCGGCTTCCT GATGGTGATC
GCCTTCGTTT CGACCGACGG CAAGCTGACA TCGATCGATC TCGCCGACTA CGTCAACAGC
ACGCTCAACG ATACGCTGAA GCGGATCGAA GGCGTCGGCG ACACCCAGTT GTTCGGCTCC
GGCTATGCGA TGCGGATCTG GCTCGATCCG GACAAGCTGG CGAAATACGC GCTGATGCCG
AGCAACGTCG CGACTGCAGT CGAGGCTCAG AACACCCAGG TCTCGGCCGG CCAGCTCGGC
GGTCTTCCAT CCCGCAAGGG GCAGCAGCTC AACGCCACGG TCACCGCGCG AAGCCGGCTA
CAGACCCCCG AGCAGTTTCG CAACATCATC CTGAAGAGCA CGACCGACGG CTCACTGGTG
CGCCTCAACG ACGTCGCCAC CGTTGAATTG GGCGCCGAGA GCTACACGAC GACGGCCGCC
TTCGACGGCA TGCCGGCCGC CGGTCTCGGC GTGAATCTCG CGACCGGCGC CAACGCCCTC
GATACCGCGA AGAACGTCCA GTCGACGATC TCCCGGCTGT CGGCGACGTT CCCGCAGGGC
GTCTCGGTGG TCTATCCCTA CGACACCACG CCGTTCGTTC GACTGTCGAT CGAGGAGGTC
GTGAAGACGC TGATCGAGGC GATCGCGCTG GTGTTCGTCG TGATGTTCGT GTTCCTGCAG
AACATCCGTG CCACTCTGAT TCCGACCATC GCCGTCCCTG TCGTGCTGCT CGGGACGGTC
GGTGTGCTGT CGGCGTTCGG CTATTCGATC AACACGTTGA CGATGTTCGC GATGGTGTTG
GCGATCGGGC TGCTGGTCGA CGACGCCATC GTCGTTGTCG AGAACGTCGA GCGGGTGATG
CAGGAAGAGG GGCTGTCGCC CAAGGAGGCG ACCCGCAAGT CGATGGACGA GATCACCGGC
GCGCTGGTCG GAATTGCCAC GGTGCTGTCG GCGGTGTTCA TCCCGATGGC GTTCTTCGGC
GGCTCTGTCG GTGTGATCTA CCGGCAATTC TCGGTGACCA TCGTCACGGC GATGGTGCTG
TCGGTGATCA CCGCCCTGGT CCTGACGCCG GCGCTGTGCG CGACGATCCT GCGACCGCCG
CAGGCCCATG CGACCGGCAA AGGGCTGTTC GGCTGGTTCA ACCGCACGTT CGATCGCAGC
GCGCAGGCCT ATCGGAATGG CGCGCAGGGC GTGATCGCGC GATCCTTGCG GTTCGGATTG
CTGTTCGTGG CGATCTCGGT AGGCGTGGGA CTGATGTTCA TGCGCATTCC GAGCTCGTTC
CTGCCGCAGG AGGATCAGGG CGTCTTGATC ACCAGCGTGC AACTGCCGGT CGGCGCCACG
CAGGACCGCA CGTTGCGCGC GCTCGAACAG GTTCGCGAAT ACTACGCGAC AAAGGAAAAG
GACGCCGTCG ACAGCGTCTT CTATACGGCG GGCTTCGGCT TCTCCGGCCA GGGCCAGAAT
ATCGGCATCG CCTTCGTCAA GTTGAAACCT TTCGATCAGC GAAAGTCGGC GGCCTTGAGT
GCGCAGGCGG TGGCCGGTCG CGCGATGATG GCGTTTCGGC AGATCAAGGA CGGCATGGTG
TTCGCGTTGG CGCCGCCGGC GATCCAGGGC ATGGGCAACT CGAACGGATT CGATTTCTAC
CTGCAGGACG TCAACGGCGC CGGCCATGCG AAACTGATCG AAGCTCGCAA CCAGTTGCTG
GGCGCAGCGT CCCAGAGCAA GCAACTGGCG AATACACGGC CCAATGGGCA GGAGGACGAG
CCGCAATTCT CGGTCACGAT CGACCAGGAG AAGGCGAGTG CGCTCGGTGT CGGCCTCGCG
GATATCAACA CCACGCTGTC GACGGCATGG GGCAGCGACT ACGTCAACGA TTTCATCGAC
CGCGGCCGCG TCAAGAAGGT CTATCTGCAA TCTGACAGGA ATTTCCGGAT GCAGCCGGAT
GACATCGGCC GCTGGTACGT GCGCAACTCG TCCGGCGTGA TGGTGCCGTT CTCGGCCTTC
GCATCCGGTC AGTGGAGCTT CGGATCGCCG CGGCTGGAAC GCTACAATGG ATCGGCGGCG
GTCGAGATTC AGGGCGAGGC TGCCGCAGGC GTCAGCTCCG GAACGGCGAT GGACGAGATC
GACACGCTGG TCAAGCAACT GCCCTCCGGC TTCGGCCACC AATGGACCGG CCTGTCGGCG
CAGGAACGGT TGTCGGGCAG TCAGGCGACG TCGCTCTATG CGATCTCGAC ATTGGTGGTG
TTTCTTTGCC TCGCCGCGCT GTACGAGAGC TGGTCGATCC CGCTCGGCGT GATGTTGGCG
GTGCCGATCG GCATCTTCGG CGCGTTGCTG GCGGCGCTGC TGTTCGGCCA GACCAACGAC
GTCTATTTCA AGGTCGGTCT GTTGACCACG ATCGGCCTGG CGGCGAAGAA CGCGATCCTG
ATCGTCGAGT TCGCGATCGA GCGGCAGGCT GCGGGCCAGC CTTTGGTCGA GGCGACGCTG
GAAGCGGCGC GACAGCGGCT GCGGCCGATT CTGATGACGT CGTTCGCCTT CATCCTCGGC
GTCACGCCGC TCGCGATCGC GTCGGGTGCG GGCTCTGGCG CCCAGAATTC CATCGGCATC
GGTGTGATGG GAGGCATGAT CGCGGCCACC GTGCTCGGCA TCTTCTTCGT GCCTTTGCTG
TATGTCGGCG TGCGCCGGCT GTTCGATCGC AAGTCGGCGA CGGACGAGAC TTCCAAAAGC
GATCCGGCGA AGGGGACGAC GGGATGA
 
Protein sequence
MSRFFIERPV FAWVIAITIM LGGLLALQTL PIAQYPQIAP TTVRITGTYA GADAQTVENS 
VTKVIEQGMT GIDHLDYMTS TSTSTGQSQI TLTFTAAADP DVAQMQVQNK LQLVTALLPQ
IVQNTGLSVT KSSTGFLMVI AFVSTDGKLT SIDLADYVNS TLNDTLKRIE GVGDTQLFGS
GYAMRIWLDP DKLAKYALMP SNVATAVEAQ NTQVSAGQLG GLPSRKGQQL NATVTARSRL
QTPEQFRNII LKSTTDGSLV RLNDVATVEL GAESYTTTAA FDGMPAAGLG VNLATGANAL
DTAKNVQSTI SRLSATFPQG VSVVYPYDTT PFVRLSIEEV VKTLIEAIAL VFVVMFVFLQ
NIRATLIPTI AVPVVLLGTV GVLSAFGYSI NTLTMFAMVL AIGLLVDDAI VVVENVERVM
QEEGLSPKEA TRKSMDEITG ALVGIATVLS AVFIPMAFFG GSVGVIYRQF SVTIVTAMVL
SVITALVLTP ALCATILRPP QAHATGKGLF GWFNRTFDRS AQAYRNGAQG VIARSLRFGL
LFVAISVGVG LMFMRIPSSF LPQEDQGVLI TSVQLPVGAT QDRTLRALEQ VREYYATKEK
DAVDSVFYTA GFGFSGQGQN IGIAFVKLKP FDQRKSAALS AQAVAGRAMM AFRQIKDGMV
FALAPPAIQG MGNSNGFDFY LQDVNGAGHA KLIEARNQLL GAASQSKQLA NTRPNGQEDE
PQFSVTIDQE KASALGVGLA DINTTLSTAW GSDYVNDFID RGRVKKVYLQ SDRNFRMQPD
DIGRWYVRNS SGVMVPFSAF ASGQWSFGSP RLERYNGSAA VEIQGEAAAG VSSGTAMDEI
DTLVKQLPSG FGHQWTGLSA QERLSGSQAT SLYAISTLVV FLCLAALYES WSIPLGVMLA
VPIGIFGALL AALLFGQTND VYFKVGLLTT IGLAAKNAIL IVEFAIERQA AGQPLVEATL
EAARQRLRPI LMTSFAFILG VTPLAIASGA GSGAQNSIGI GVMGGMIAAT VLGIFFVPLL
YVGVRRLFDR KSATDETSKS DPAKGTTG