Gene RPD_3330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3330 
Symbol 
ID4023840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3689464 
End bp3692592 
Gene Length3129 bp 
Protein Length1042 aa 
Translation table11 
GC content60% 
IMG OID637963534 
Producthydrophobe/amphiphile efflux-1 HAE1 
Protein accessionYP_570455 
Protein GI91977796 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.66718 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTCGTT TCTTCATCGA CCGCCCGATC TTCGCGTGGG CCATTTCGCT CTTCATCATG 
CTGGCCGGCG GCATCTCATT GGTGTCCCTC CCCATATCGC AATATCCAGA CGTCGCCCCC
GTGACGGTCT CGATCACGGC GAACTATCCC GGCGCGACAC CTGAGCGGCT CTACGACGGC
GTCACACGCA TCATCGAGGA GGAGCTCAAT GGCATCCCGG GGATGATGTA TTTCGAGTCG
ACCAACGACG CCGGCGGCGC GGCCCAGATC ACCGTGTCGT TCGGCCCCGG CTATGATCCG
TCAAAAGCAA CGATCGCGGT TCAGAATCGA ATCAAGCGTA TCGAAGCGCG CCTTCCACGC
GCCGTTGTGC AGCAGGGCGT GCTTGTCGAG GAGGCGAGCA CTGCGTTTCT GATGTTCGTC
ACACTCAGCG CGACCGGCGC CGGCACAACC GAGATCGAAC TCGGCGACAT CGCCGCCCGC
CGCGTCACCG GAGAGCTGCG GCGCGTCCCC GGCGTCGGCC GCGCGACGCT CTATTCGAGC
GAGAAAGCGA TGCGCATCTG GATTGATCCC GACAAGCTCA TCGGCCTCAA TCTGACCGCG
GGCGACGTGA CCTCGGCGGT TGAAGGCCAG AACGCTCAGA TTGCGTCCGG CATGATCGGG
GCACAGCCTG CCAAAAAAGG GCAGCGCATT GCAGCGAACG TCCTGGTCAA GGGCCAACTC
ACTACGGTGA AGGAGTTCGA AGAAATCGTC CTGCGCGCCA ACCCGGACGG ATCTATCGTT
CGTCTTCGCG ACGTCGCCGA GGTCGAATTG GGAGGGCAGA GTTACCTGCA GCAGACCCGT
CAGGATGGTA AACCCTCCGC CGGCATCGGC ATCCAGCTTG CACCGGGCGC CAACGCTCTC
GCGACCGCGA CCGGCGTCCG CGCCAAAATC GCAGAGCTTC AAAAAACCCT GACCAACGTC
AAGCTGCAGG TGCCCTACGA CACCACTCCG TTCGTCCAGG TCTCGATCAA GCAGGTGTTG
ATGACCTTGC TCGAGGCGAT GGTTCTGGTC TTCGCGGTGA TGTTCCTGTT TCTCCAGAAC
ATCCGGTACA CGATCATTCC CACCATCGTC GTTCCGATCG CGTTGCTCGG CACCTGCGCG
GTGATGTTGA TGCTCGGATT CTCCATCAAT GTTCTGACGA TGTTCGGAAT GGTCCTGGCG
ATCGGCATCC TCGTCGACGA CGCCATCGTC GTCGTCGAGA ACGTCGAACG CATCATGAAG
GATGAAGGTC TTCCCCCCAG GGAGGCGACA TTCAAAGCCA TGAGCCAGAT CTCCGGAGCA
GTCGTCGGCA TCACCGTTGT TCTGATTTCC GTTTTCGTGC CGCTCGCCTT CTTCCCAGGC
TCGGTCGGCG TGATCTACCG ACAATTCTCG GTAGCGATGG CGACCTCGAT CGCTTTCTCG
GCGTTTCTCG CTCTGTCGCT GACGCCTGCG CTTTGCGCGA CGCTGCTGAA GCCGGTCGAC
AAGGCGCATG GCCACAGCCA GCGCGGATTC TTCGGGTTGT TCAACAGATT CTTCGATGCG
ACCAGCCGGC GTTACGTCGG AGGAGTCAGC AGCGTCGTGA GGCGGCCGGT TCGTTCGCTC
CTCGTCTACA GCGTTCTCAT CGCAGCGATG GTTTTTGGCT TCAACCGTCT GCCATCCGGA
TTCCTGCCGG GAGAAGACCA AGGCTACCTG ATCGTCGATG TTCAAACACC GCCGGAATCT
TCGACGGAAC GAACTCTCGA TATCATCAAG CAGATCGAGG CTCATTTTTC GTCCGAGCGC
GCCGTAGACA GCTACACGAC CGTCGGAGGG TACGGCTTTT CCGGCCAAGG ACAGAACACG
GCAATTGCCT TCATCAACTT GAAGGATTGG TCGGAGCGCG GGGCAAATGA CAGCGCGCAG
TCGATCGGCG ATCGCGCGAA TGCGTTTCTG AGCACACTCC CCGACGCTAT CGCAATTTCG
CTCGCGCCGC CGCCGATCGA GTCGCTCGGC AATTCGGCCG GCTTCACCTT CCGTCTCCAG
GACAAGGAGC AAAAGGGGTA TGCCGCTCTC GCTGCGGCAC GAGATCAGCT GCTGAATGCG
GCGACACAGA GCCCGGTCCT GCAAGGCGTC TATGTCGAGG GTTTGCCGAC CGCGCCCCAG
ATCGAGATGC TGATCGACCG CGAGAAGGCC AATGCACTCG GCGTCACCTT CGCAGCGATC
AACCAGGCGC TATCGACCAG CTTGGGATCG ACCTACGTCA ACGATTTCCC GAACAACGCA
CGCATGCAGC GCGTGATCGT GCAAGCCGAT GCAAATCGGC GGATGACGGC CGAGGACATT
TTGCAACTGT CGGTGAGGAA CAGCAAGAAT CAGATGGTTC CGCTGCAATC GGTGGCACAG
GTGAAATGGT CGATGGGACC GTCACAAGTC GTCGGCTTCA ACGGGTTTCC AAGTATCAAA
TTCAGCGGCA GCGCGGCGCC AGGATACGCG AGCGGCGACG CGATGGCCGA GATGGAGCGC
CTTGCCGCGG AGCTTCCGAG CGGCTTCGAC TACGCATGGT CGGGCCAATC ATTGCAGGAA
AAGCTGTCGG GCAGCCAAGC CATCTACCTG CTCGTGCTAT CGCTGCTGTG CGTCTTCCTG
TGCCTGGCGG CGCTTTACGA AAGCTGGTCG ATTCCGTTCG CCGTGCTGCT CGTGGTGCCG
ACCGGCGTGA TCGGATCGGT GTTCGCGATG CTGCTGCGCG ATATGCCGAA TGACATCTAC
TTCAAGGTCG GCCTGATTAC CGTGATCGGG CTGTCGGCGA AGAACGCGAT CCTGATTATT
GAGATCGCAA AGGACCTCGT CGCCCAGGGC GTAGCCTTCG GCGAGGCGGC GATCGAAGCC
TGCCGACGTC GCTTCCGTCC CATCTTGATG ACATCGCTCG CATTCATTCT GGGTGTTTTG
CCGCTCGCGA TTGCAACGGG CGCAGGCTCC AATAGTCAGC GCGCCATCGG CACCGGCGTT
TTCGGCGGCA TGTTGACGGC GACGGCGCTC GCGATCTTCT TCACCCCTGT CTTGTACGTC
CTGATCACCA GCACATTCGG GAAGCGCAAA GGCAAAAGCA GTGGTGGTGA GACGCCAGTT
CCTGAGTAA
 
Protein sequence
MTRFFIDRPI FAWAISLFIM LAGGISLVSL PISQYPDVAP VTVSITANYP GATPERLYDG 
VTRIIEEELN GIPGMMYFES TNDAGGAAQI TVSFGPGYDP SKATIAVQNR IKRIEARLPR
AVVQQGVLVE EASTAFLMFV TLSATGAGTT EIELGDIAAR RVTGELRRVP GVGRATLYSS
EKAMRIWIDP DKLIGLNLTA GDVTSAVEGQ NAQIASGMIG AQPAKKGQRI AANVLVKGQL
TTVKEFEEIV LRANPDGSIV RLRDVAEVEL GGQSYLQQTR QDGKPSAGIG IQLAPGANAL
ATATGVRAKI AELQKTLTNV KLQVPYDTTP FVQVSIKQVL MTLLEAMVLV FAVMFLFLQN
IRYTIIPTIV VPIALLGTCA VMLMLGFSIN VLTMFGMVLA IGILVDDAIV VVENVERIMK
DEGLPPREAT FKAMSQISGA VVGITVVLIS VFVPLAFFPG SVGVIYRQFS VAMATSIAFS
AFLALSLTPA LCATLLKPVD KAHGHSQRGF FGLFNRFFDA TSRRYVGGVS SVVRRPVRSL
LVYSVLIAAM VFGFNRLPSG FLPGEDQGYL IVDVQTPPES STERTLDIIK QIEAHFSSER
AVDSYTTVGG YGFSGQGQNT AIAFINLKDW SERGANDSAQ SIGDRANAFL STLPDAIAIS
LAPPPIESLG NSAGFTFRLQ DKEQKGYAAL AAARDQLLNA ATQSPVLQGV YVEGLPTAPQ
IEMLIDREKA NALGVTFAAI NQALSTSLGS TYVNDFPNNA RMQRVIVQAD ANRRMTAEDI
LQLSVRNSKN QMVPLQSVAQ VKWSMGPSQV VGFNGFPSIK FSGSAAPGYA SGDAMAEMER
LAAELPSGFD YAWSGQSLQE KLSGSQAIYL LVLSLLCVFL CLAALYESWS IPFAVLLVVP
TGVIGSVFAM LLRDMPNDIY FKVGLITVIG LSAKNAILII EIAKDLVAQG VAFGEAAIEA
CRRRFRPILM TSLAFILGVL PLAIATGAGS NSQRAIGTGV FGGMLTATAL AIFFTPVLYV
LITSTFGKRK GKSSGGETPV PE