Gene RPD_3780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3780 
Symbol 
ID4024296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4217749 
End bp4220913 
Gene Length3165 bp 
Protein Length1054 aa 
Translation table11 
GC content63% 
IMG OID637963984 
Producthydrophobe/amphiphile efflux-1 HAE1 
Protein accessionYP_570902 
Protein GI91978243 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0221065 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTCAT TCTTTATCGA CAGGCCGGTC TTCGCCTGGG TCATCGCGCT GTTCATCTGT 
TTGATCGGCG CGATTTCGAT TCCGTTCCTG CCGGTCGCGC AGTATCCGAT CATCGCGCCG
CCGTCGATTT CGGTGTCGAC GCAGTATCCC GGCGCATCGC CCGAGAACCT GTACAACAGC
GTCACCCGGC TGATCGAAGA AGAGCTCAAC GGCGCCAATG GCATTCTGAA CTTCGAATCG
ACCAGCGACT CGCTCGGTCA GGTCGAGATC ATCGCCAATT TCGTGCCCGG CACCGACACC
GGCATGGCGT CGGTCGACGT GCAGAACCGA ATCAAGCGCG TCGAGGCGCG GTTGCCGCGG
GCGGTGCTGC AGCAGGGCAT CCTGGTCGAG GAAGCCTCGA GCGCGGTGCT GCAGATCATC
ACGCTGAGTT CGACCGACGG CTCGCTCGAC GAAGTAGGCC TCGGCGACTT CATGATCCGC
AACGTGCTGG GCGAAGTTCG CCGCATCCCG GGCGTCGGCC GCGCCACGTT GTACTCGACG
GAACGCGCGA TGCGGATCTG GATCGATCCG GACAAGCTGG TCGGCTACAA CCTGACCGCC
GACGACGTCA CCAAGGCGAT CCAGGCGCAG AACGCCCAGG TCGCCTCGGG AAGCATCGGC
GCCGAACCCA GCAGCAAGGG CCAGAAGATT TCCGCGCTCG TGCTGGTCAA GGGCCAGCTC
ACATCGCCGG ATGAGTTCGG CGCGATCGTG CTGCGCGCCA ATCCCGATGG CTCGACTGTG
CGGCTGCGCG ACGTCGGCCG GGTCGAGGTC GGCGGCTTCA GCTATCAGTT TAACACCCGC
CTCAACGGCA AGGCGACGGC AGGCCTGTCG GTATTGTTAG CGCCGACTGG CAATGCGCTC
GCCACCGCCA GCGCGGTCGA AGAAAAGATG AAGGAGCTGT CGCGCTTCTT CCCGGCGAAC
ATCAGCTACC AGATTCCCTA CAACATCACC CCGGTGGTCG AAGCCTCGAT CACCAAAGTG
CTGTACACGC TGATCGAAGC GGTGGTGCTG GTTTTCATCG TGATGTTCAT ATTCCTGCAG
AACATCCGTT ACACAATCAT TCCGACCATT GTGGTGCCTG TGGCGCTGCT GGGCACCTGC
CTGTCGCTGC TGCTGTTCGG CTATTCGATC AACATGCTGA CGATGTTCGG CATGGTGCTG
GCGATCGGCA TTCTGGTCGA CGATGCGATC GTCGTGGTCG AAAACGTCGA ACGCATCATG
GCCGAGGAGG GGCTGCCGCC GAAGGAAGCG ACCCGCAAGG CGATGACGCA GATCACCAGC
GCGATCATCG GCATCACGCT GGTGCTGATC GCGGTGTTCG TGCCGATGGC GTTCTTTCCC
GGCTCGGTCG GCATCATCTA TCGGCAGTTC TCGGTGACGA TGGTGTCGGC GATCGCGTTT
TCGGCGTTGC TGGCGCTATC GCTGACGCCG GCGCTGTGCG CTACGCTGCT GAAGCCGGTG
GTGAAAGGCC ATGCCCACGC TGAACGCGGC TTCTTCGGCC GGTTCAATCG CATTCTCGAC
GGCACGCGCG AGCGTTATTC CAGTATCGTC CGGTGGAATC TCGGGCGAAC CGGCCGGCTG
ATGGTCATCT ACGCGGTGCT CGTCGGCGTT CTTGGATGGG CGCTCGTCAA GATGCCCGGC
GGCTTTTTGC CGGTGGACGA CCAGGGCTTC GTCACCGTCG ATCTGCAGAC GCCGTCGGAC
TCGTCTTACA ACCGCACCTA CGATGTGGTG AAGCAGGTCG AGGAGTATCT GCTCAAGCGC
GACGGCGTCG ACAACGTCAC CTTCCTCACC GGCTTCAGCT TTCTCGGCCA GGGCATGAAC
GCGGCGCAAG CCTTCGTCAC CCTCAAGCAC TGGTCGGACC GCGGCGCAAA AGACAGCGCT
TCTGCGATCG TCGACGACGC CAACAAATCG CTCAGCTCGA TTCGCGACGC CCGTATCGCC
GCGCAACAGC CGCCGCCGGT CGACAACCTC GGCAACTCGT CGGGCTTCAG CTTCCGCCTG
CAGGATCGTG GCCAGAAGGG CAACGCCGCG TTGGTACGGG CCAGCGAACA GCTCGTCGCC
GCCGCCAACA AGAGCCCGAT TCTGCACAAG GTCTACGTCG AAGGCCTGCC CCCGGCGCCG
GTCGTAAATC TGATGATCGA CCGTGAGAAG GCCGGCGCCT TCGGCGTCAC CTTCGAAGAC
ATCAACAACA CCATCTCGAC CAATCTCGGC TCGGCCTACA TCAACGACTT CCCCAATCGC
GGCCGTATGC AACGCGTCAT CGTGCAGGCC GATATCTCGG ACCGCATGAA GGCCGAGGAA
ATCCTCGCTT ATTCGGTGAA GAACAGCCGC GGTCAGCTCG TGCCGCTGTC GTCCTTTGCG
ACGATCGAGT GGTCGAAGGG GCCGACGCAG ATCGTCGGCT TCAATTACTA TCCAGCCGTT
CGCATCAGCG GCGAGGCGCG GCCCGGCTAC ACCTCGGGCG ATGCGATCGG TGAGATGGAG
CGGCTCGCCG GGCAGCTGCC GCGCGGCTTC GGTTACGATT GGACCGGCCA GTCGCTACAG
GAGAAGCTGT CGGGTTCGCA GGCGCCGTTC ATCCTCCTGC TGTCGGCGCT GATGGTGTTC
CTGGTGCTCG CCGCGCTGTA CGAAAGCTGG ACGATTCCGC TGACGGTGCT GCTTGCCGTG
CCGCTCGGCA TCACCGGATC GGTTATCGCC GCGACGATAC GTTCTCTACC GAACGACGTG
TATTTCACCG TCGGGCTGAT CACGATCATC GGATTGGCCG CCAAGGACGC CATTCTGATC
GTCGAATTCG CCAAGGACCT GCGTAAGGAA GGCAAGACGC TGCGTGAGGC GACGATGGAG
GCCTGTCATC TGCGATTTCG GCCGATCCTG ATGACCGGGC TCGCGTTCGT CAGCGGCGTT
CTGCCGATGG CGATCGCCAC TGGCGCTGGC GGCAAGAGCC AGCAGGCGCT CGGTTCCGGC
GTGATGGGCG GCATGATCGC CGTGGTGGTG CTGGCGCTGT TGATGGTTCC GGTATTCTTC
GTCGTGGTGC AGCGGCTGTT CGCCGGCGAC CGGTCGGACG ATCCGGTCAA AGCCGGCGTG
AAGCACGAGG TGCCAGCGGA GATTCAGCCA AGAGATGCCC GCTAA
 
Protein sequence
MPSFFIDRPV FAWVIALFIC LIGAISIPFL PVAQYPIIAP PSISVSTQYP GASPENLYNS 
VTRLIEEELN GANGILNFES TSDSLGQVEI IANFVPGTDT GMASVDVQNR IKRVEARLPR
AVLQQGILVE EASSAVLQII TLSSTDGSLD EVGLGDFMIR NVLGEVRRIP GVGRATLYST
ERAMRIWIDP DKLVGYNLTA DDVTKAIQAQ NAQVASGSIG AEPSSKGQKI SALVLVKGQL
TSPDEFGAIV LRANPDGSTV RLRDVGRVEV GGFSYQFNTR LNGKATAGLS VLLAPTGNAL
ATASAVEEKM KELSRFFPAN ISYQIPYNIT PVVEASITKV LYTLIEAVVL VFIVMFIFLQ
NIRYTIIPTI VVPVALLGTC LSLLLFGYSI NMLTMFGMVL AIGILVDDAI VVVENVERIM
AEEGLPPKEA TRKAMTQITS AIIGITLVLI AVFVPMAFFP GSVGIIYRQF SVTMVSAIAF
SALLALSLTP ALCATLLKPV VKGHAHAERG FFGRFNRILD GTRERYSSIV RWNLGRTGRL
MVIYAVLVGV LGWALVKMPG GFLPVDDQGF VTVDLQTPSD SSYNRTYDVV KQVEEYLLKR
DGVDNVTFLT GFSFLGQGMN AAQAFVTLKH WSDRGAKDSA SAIVDDANKS LSSIRDARIA
AQQPPPVDNL GNSSGFSFRL QDRGQKGNAA LVRASEQLVA AANKSPILHK VYVEGLPPAP
VVNLMIDREK AGAFGVTFED INNTISTNLG SAYINDFPNR GRMQRVIVQA DISDRMKAEE
ILAYSVKNSR GQLVPLSSFA TIEWSKGPTQ IVGFNYYPAV RISGEARPGY TSGDAIGEME
RLAGQLPRGF GYDWTGQSLQ EKLSGSQAPF ILLLSALMVF LVLAALYESW TIPLTVLLAV
PLGITGSVIA ATIRSLPNDV YFTVGLITII GLAAKDAILI VEFAKDLRKE GKTLREATME
ACHLRFRPIL MTGLAFVSGV LPMAIATGAG GKSQQALGSG VMGGMIAVVV LALLMVPVFF
VVVQRLFAGD RSDDPVKAGV KHEVPAEIQP RDAR