Gene RPB_4026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4026 
Symbol 
ID3911833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4593515 
End bp4596676 
Gene Length3162 bp 
Protein Length1053 aa 
Translation table11 
GC content65% 
IMG OID637885930 
Producthydrophobe/amphiphile efflux-1 HAE1 
Protein accessionYP_487630 
Protein GI86751134 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.698151 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCAT TTTTTATCGA CAGGCCGATT TTCGCCTGGG TCGTCGCGCT GTTCATCTGC 
CTTCTCGGCG CGATCTCGAT TCCGTTTCTG CCGATCGCGC AATATCCGAT CATCGCGCCG
CCGTCGATCT CGGTGTCGAC GCAATATCCC GGCGCCTCGC CGGAGAATTT GTACAACAGC
GTCACCCGGC TGATCGAGGA AGAGCTCAAC GGCGCCAACG GCATTCTCAA TTTCGAATCG
ACCTCCGACT CGCTCGGTCA GGTCGAAATC ATCGCCAATT TCGAGCCCGG CACCGACACC
GAAATGGCGT CGGTCAACGT GCAGAACCGG ATCAAACGTA TTGAGGCGCG GCTGCCGCGC
GCCGTTTTGC AGCAGGGCAT CCTGGTCGAG GAAGCCTCCA GCGCGGTGCT GCAGATCATC
ACGCTGAGTT CGACCGACGG CTCGCTCGAC GAGGTCGGCC TCGGCGACTT CATGATCCGC
AACGTGCTCG GCGAAATCCG CCGCATTCCC GGCGTCGGCC GCGCCACGCT GTATTCGACC
GAACGCGCGC TGCGGATCTG GATCGATCCC GACAAGCTGG TCGGCTACAA TCTCACCGCC
GACGAGGTCA CCAAGGCGAT CGAGGCGCAG AACGCCCAGG TCGCGTCCGG CGCCGTCGGC
GCCGAGCCGA GCGCCAAGGG GCAGAAGACC TCGGTGCTGG TGCTGGTGAA GGGGCAGCTC
TCCTCCGCCG ACGAGTTCGG CGCGGTGGTG CTGCGCGCCA ATCCCGATGG TTCGACCGTT
CGGCTGCGCG ACGTCGCGCG GATCGAGATC GGCGGCTTCA GCTACCAGTT CAACACCCGC
CTCAACGGCA AGCCCACCGC CGGCCTCTCG GTGCTGATGT CGCCGACCGG CAACGCGCTG
GCGACCGCGA GCGCGGTCGA AGCCAAGATG AAGGAGCTGT CGCGCTTCTT CCCGGCGAAC
ATCAGCTACC AGATCCCGTA CAACATCACG CCGGTGGTCG AGGCCTCGAT CACCAAAGTG
GTGCACACGC TGCTCGAGGC GGTGGCGCTG GTGTTCGTGG TGATGTTCCT GTTCCTGCAG
AACATCCGCT ACACCATCAT TCCGACCATC GTGGTGCCGG TGGCGCTGCT CGGCACCTGC
GTGTCGCTGC TGCTGTTCGG CTATTCGATC AACATGCTGA CGATGTTCGG CATGGTGCTG
GCGATCGGCA TCCTGGTCGA CGACGCCATC GTCGTGGTCG AAAACGTCGA ACGCATCATG
GCCGAGGAGG GGCTGCCGCC GAAGGAGGCC ACCCGCAAGG CGATGACGCA GATCACCAGC
GCCATCATCG GCATCACGCT GGTGCTGATC GCGGTGTTCG TGCCGATGGC GTTCTTCCCC
GGCTCGGTCG GCATCATCTA TCGGCAGTTC TCGGTGACCA TGGTGTCGGC GATCGCGTTC
TCGGCGCTGA TGGCGTTGAC GCTGACGCCG GCGCTGTGCG CCACGCTGCT GAAGCCGGTC
GTCAAGGGCC ACGCCCATGC CGAACGCGGC TTCTTCGGCC GGTTCAACCG CATTCTCGAC
GGCACCCGCG AGCGCTATTC GCGGATCGTG CGGTGGAATC TCGGGCGAAC GGGCCGGCTG
ATGATTCTCT ACGCCGTGCT GCTCGGCGTG CTCGGCTGGG GACTGGTCAG GATGCCGGGC
GGCTTCCTGC CGATCGACGA CCAGGGCTTC ATCACCGTCG ACCTGCAGAC GCCGTCGGAT
TCGTCCTACA ACCGGACCTT CGAGGTGATC AAGAAGGTCG AGGATTATCT GCTGAAGCGC
GACGGCGTCG ACAACGTCAC CTTCCTCACC GGCTTCAGCT TCCTCGGCCA GGGCATGAAC
GCGGCGCAGG CCTTCGTCAC GCTCAAGGAC TGGTCCGAGC GCGACGGCAA ATCCTCCGCG
GCAGCGATCG TCGACGACGC CAACAAGGCG CTCGGGTCGG TGCGCGATGC GCGCATCGCG
GCGCAGCAGC CGCCGCCGAT CGACAATCTC GGCAACTCCT CGGGGTTCAG CTTCCGCCTG
CAGGACCGCG GCCAGAAGGG CTACCCCGCG TTGATCCAGG CCAGTCAGCA ACTGGTGGCG
GCGGCCAATG CCAGCCCGAT CCTCGAAAAC GTCTATGTCG AAGGCCTCCC GCCGGCGCCG
GTCATCAATC TGATGATCGA CCGCGAGAAG GCCGGCGCCT TCGGCGTCAC CTTCCAGGAC
ATCAACAATA CGATCTCGAC CAATCTCGGC TCGGCCTATG TCAACGACTT CCCGAACCGC
GGGCGGATGC AGCGCGTGAT CGTGCAGGCC GACATTCCCG ACCGGATGAA GGCCGACGAC
ATCCTGACCT ATTCGGTGAA GAACAGCCGC GGCCAGCTCG TGCCGCTGTC GTCCTTCGCG
ACAATCGAGT GGTCGAAGGG GCCGACCCAG GTGGTCGGCT TCAACTACTA TCCGGCGATC
CGCGTGAGCG GTCAGGCCCG TGCCGGCTAC ACCTCGGGCG ATGCGATCGC CGAGATGGAG
CGGCTCGCCG CGCAATTGCC CCGCGGTTTC GGTTACGACT GGACCGGCCA GTCGCTGCAG
GAGAAGCTGT CGGGCTCGCA GGCGCCGTTC CTGCTGGCGC TGTCGGCGCT GGTGGTGTTC
CTCGTGCTCG CCGCGCTTTA CGAGAGCTGG ACGATTCCGG TCGCCGTGCT GCTCACGGTG
CCGCTCGGAA TCATCGGGGC GGTCGTCGCG GCGACGACGC GCTCGCTGCC GAACGACGTG
TACTTCACGG TCGGCCTGAT CACGATCATC GGGCTCGCGG CCAAGGACGC GATTCTGATC
GTCGAATTCG CCAAGGATCT GCGCAAGGAA GGCAAGTCGC TGCGCGAAGC GACGCTGGAA
GCCTGCCACC TGCGGTTCCG CCCGATCGTG ATGACCGGCC TCGCCTTCTG CAGCGGCGTG
CTACCGATGG CGATTGCAAC GGGCGCCGGC GCCAAGAGCC AGCAGGCGCT CGGCACCAGC
GTGATGGGCG GCATGATCGC GGTGGTGGTG CTGGCGCTGC TGATGGTTCC GGTGTTCTTC
GTCGTGGTGC AGCGGCTGTT CGCCGGCGAT CGGTCCGACG ATCCGGTCAG GGCGCCCCAC
GAGGTCCCGA TGCACCCGCA ACTGCCGAAA GAATCCGCCT AG
 
Protein sequence
MPAFFIDRPI FAWVVALFIC LLGAISIPFL PIAQYPIIAP PSISVSTQYP GASPENLYNS 
VTRLIEEELN GANGILNFES TSDSLGQVEI IANFEPGTDT EMASVNVQNR IKRIEARLPR
AVLQQGILVE EASSAVLQII TLSSTDGSLD EVGLGDFMIR NVLGEIRRIP GVGRATLYST
ERALRIWIDP DKLVGYNLTA DEVTKAIEAQ NAQVASGAVG AEPSAKGQKT SVLVLVKGQL
SSADEFGAVV LRANPDGSTV RLRDVARIEI GGFSYQFNTR LNGKPTAGLS VLMSPTGNAL
ATASAVEAKM KELSRFFPAN ISYQIPYNIT PVVEASITKV VHTLLEAVAL VFVVMFLFLQ
NIRYTIIPTI VVPVALLGTC VSLLLFGYSI NMLTMFGMVL AIGILVDDAI VVVENVERIM
AEEGLPPKEA TRKAMTQITS AIIGITLVLI AVFVPMAFFP GSVGIIYRQF SVTMVSAIAF
SALMALTLTP ALCATLLKPV VKGHAHAERG FFGRFNRILD GTRERYSRIV RWNLGRTGRL
MILYAVLLGV LGWGLVRMPG GFLPIDDQGF ITVDLQTPSD SSYNRTFEVI KKVEDYLLKR
DGVDNVTFLT GFSFLGQGMN AAQAFVTLKD WSERDGKSSA AAIVDDANKA LGSVRDARIA
AQQPPPIDNL GNSSGFSFRL QDRGQKGYPA LIQASQQLVA AANASPILEN VYVEGLPPAP
VINLMIDREK AGAFGVTFQD INNTISTNLG SAYVNDFPNR GRMQRVIVQA DIPDRMKADD
ILTYSVKNSR GQLVPLSSFA TIEWSKGPTQ VVGFNYYPAI RVSGQARAGY TSGDAIAEME
RLAAQLPRGF GYDWTGQSLQ EKLSGSQAPF LLALSALVVF LVLAALYESW TIPVAVLLTV
PLGIIGAVVA ATTRSLPNDV YFTVGLITII GLAAKDAILI VEFAKDLRKE GKSLREATLE
ACHLRFRPIV MTGLAFCSGV LPMAIATGAG AKSQQALGTS VMGGMIAVVV LALLMVPVFF
VVVQRLFAGD RSDDPVRAPH EVPMHPQLPK ESA