Gene RPB_3303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3303 
Symbol 
ID3911104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3775085 
End bp3778321 
Gene Length3237 bp 
Protein Length1078 aa 
Translation table11 
GC content65% 
IMG OID637885205 
Producthydrophobe/amphiphile efflux-1 HAE1 
Protein accessionYP_486910 
Protein GI86750414 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.110746 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.948069 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTTTT CCAAGTTCTT CATCGATCGC CCGATCTTCG CCGGCGTCTT GTCGGCCCTG 
ATCTTCCTCG CCGGCCTGCT GTCGCTGCCG GTGCTGCCGA TCTCCGAATA TCCCGAAGTC
GCGCCGCCCA CCATCGTGGT GACCGCCAAC TATCCGGGTG CCAATCCCAA GGTGATCGCC
GAAACGGTGT CGACGCCGAT CGAGGAGCAG ATCAACGGCG TCGAGAACAT GCTTTACATG
AGCAGCCAGG CGACCACCGA CGGCCGGATG ACGCTCAACG TCACGTTCCG CCTCGGCACC
GATCCTGACA AGGCGCAGCA GCTGGTGCAG AACCGCGTCT CGCAGGCCGA GCCGCGGTTG
CCGGCGGAAG TCCGCGCGCT CGGCGTCACC ACCATCAAGA GCGCGCCCGA TTTGACCATG
GTCGTGCATC TGATGTCACC GAACGACCGC TACGACATGA CTTACTTGCG CAACTACGCG
GTGCTCAACG TCAAGGACCG GCTGGCGCGG ATCGAGGGCG TCGGCCAGGT GCAGCTGTTC
GGCTCCGGCG ACTACTCGAT GCGGGTCTGG CTCGATCCGC AGAAGGTCGC CGAGCGCGGG
CTGTCGCCGA GCGACGTCGT GGCCGAGATC CGCGCCCAGA ACGTGCAGGC GGCGGCCGGG
CAGGTCGGCA GCTCGCCGAG CCAGCCGGGG CTCGATCTGC AGTTGTCGAT CAACGCCCAG
GGGCGGCTGC AGACCGAAGA GGAGTTCGGC GACATCATCG TCAAGAACGG CGACAACGGC
GAAGTGGTAC GGCTGCGCGA CGTCGCGCGG ATCGAACTCG GCGCCGCCGA CTACGCGCTG
CGTTCGCTGC TCAACAACAA ATCCGCGGTC GCGATTCCGA TCTTCCAGTC GCCGGGCTCG
AACGCGATCC AGATTTCCGA CAATGTCCGC GCGACGATGA AGGAACTGCA GCAGAACATG
CCGGACGGCG TCGCCTACGA CATCGTCTAC GATCCGACGC AATTCGTCCG CGCCTCGATC
GAGGCGGTGA TCCACACGCT GCTCGAAGCC ATCGCGCTGG TGGTGCTGGT GGTGATCCTG
TTCCTGCAGA CCTGGCGCGC CTCGATCATT CCGCTGATCG CGGTCCCGGT CTCCGTGATC
GGCACCTTCG CGGTGATGCA CGTGTTCGGA TTCTCGATCA ATGCGCTGAC CCTGTTCGGG
CTGGTGCTGG CGATCGGCAT CGTGGTCGAC GACGCCATCG TCGTGGTCGA GAATGTCGAA
CGCAACATCG AGCGCGGGCT GACCCCGAAG GACGCCGCCT ATCAGGCGAT GCGCGAAGTC
ACGGGGCCGA TCATCGCGAT CGCGCTGGTG CTGGTCGCGG TGTTCGTGCC GCTGGCCTTC
ATCACCGGCC TGACCGGGCA GTTCTACAAG CAGTTTGCCC TGACGGTGGC GATCTCGACG
GTGATCTCGG CGTTCAACTC GCTGACGCTG TCGCCGGCGC TCGCCGCGCT GCTGCTGAAG
GGCCACGACG CGCCGAAGGA TGTGCTGACG CGGTTCATGG ACAGGACGCT CGGCTGGTTC
TTCGTCCGCT TCAACCGGTT TTTCGCGCGG AGTTCGGAGG CCTATGGCGG TGGCGTCAGG
CGGGTGATCT CGCGTCGCGC GATCGGCATG GCGGTGTATC TGCTGCTGGT CGGCGTCACC
GGGTATCTGT TCCACGCGGT GCCGGGCGGC TTCGTGCCGG GCCAGGACAA GCAATATCTG
GTCGGCTTCG CGCAATTGCC GGACGGCGCC ACGCTCGACC GCACCGAAGA GGTGATCCGG
CGGATGGGCG AGATCGCCCA GCAGGAGCCG GGGGTCGAGA ACTCGATCCA GTTTCCCGGC
CTGTCGATCA ACGGCTTCAC CAACTCGTCG AACTCCGGCA TCGTCTTCAT CGGCCTGAAG
GATTTTGCGG ACCGCAAGGA CAAGTCGCTC AGCGGCAACG CGATCGCATT GTCGCTGAAC
AAGAAGTTCG CCGGCATCCA GGACGCCTTC ATCGCCATGT TCCCACCGCC GCCGGTCGCC
GGTCTCGGCA CCATCGGCGG CTTCAAACTG CAGATCGAGG ACCGTGCCGG TCTCGGCTAC
GAGGCGCTGA ACGACGCCAC CCAGGCGTTC GTCAAGAAGG CCTCGGCGCA GAAGGAGCTG
GCCGGGCTGT TCACCAGCTA CCAGATCAAC GTGCCGCAGC TCTATGCCGA CGTCGATCGC
ACCAAGGCGC GGCAGCTCAA CGTGCCGGTG ACATCGGTGT TCGACACCAT GCAGATCTAT
CTCGGCTCGC TCTACGTCAA CGACTTCAAC AAATTCGGCC GCACCTATTC GGTGCGGGTG
CAGGCCGACG CCAAATATCG GGCGCGCGCC GACGACATCG GCCAGCTCAA GGTGCGCTCC
GACAGCAACG AGATGATTCC GCTGTCGACG CTGCTGCGCG TCAAGGAGAG CGCCGGGCCG
GAGCGGGCGA TGCGCTACAA CGGCTTTCTC ACCGCCGATC TCAACGGCGG CGCGGCGCCG
GGCTATTCGA CCGGGCAGGC GCAGGACGCG ATCGCGCGCG TCGCCAAGGA GACGTTCCCG
AAAGGCATCT CCTATGAATG GACCGAGCTG ACCTATCAGG AGATCATCGC CGGAAACTCG
TCGCTGATCG TGTTTCCGGT GGCGCTGTTG CTGGTGTTCC TCGTGCTCGC CGCGCAATAC
GAGAGCCTGA CGCTGCCGCT GTCGATCATC ATGATCGTGC CGATGGGGCT GCTGGCCGCG
ATGTTCGGCG TCTGGATCAG CAAGGGCGAC AACAACGTCT TCACCCAGAT CGGGTTGATC
GTGCTGGTCG GACTGTCGGC CAAGAACGCG ATCCTGATCG TGGAGTTCGC GCGCGAACTG
GAATTCGCCG GGCGAACGCC GGTGCGGGCG GCGATCGAGG CCAGCCGGCT GCGGCTGCGT
CCGATCCTGA TGACGTCGAT GGCGTTCATC ATGGGCGTCG TGCCGCTGGT GACGTCGTCG
GGCGCGGGCT CCGAGATGCG GCACGCGATG GGCGTCGCGG TGTTCGCAGG CATGATCGGT
GTCACCGCCT TCGGCATCTT CTTCACGCCG ATGTTCTACG TGGCTCTGCG CGCGCTGGCC
GGCAACCGCC CGCTGACGCA GCACGATGAG GCGACCATCG ATCCCGTCGC GCCGAGCGCC
CAGGGATCGC GCGACTCCGC GCCCGGCGCG CATCCGGCGC CGGTCAAGCC TTCCTGA
 
Protein sequence
MNFSKFFIDR PIFAGVLSAL IFLAGLLSLP VLPISEYPEV APPTIVVTAN YPGANPKVIA 
ETVSTPIEEQ INGVENMLYM SSQATTDGRM TLNVTFRLGT DPDKAQQLVQ NRVSQAEPRL
PAEVRALGVT TIKSAPDLTM VVHLMSPNDR YDMTYLRNYA VLNVKDRLAR IEGVGQVQLF
GSGDYSMRVW LDPQKVAERG LSPSDVVAEI RAQNVQAAAG QVGSSPSQPG LDLQLSINAQ
GRLQTEEEFG DIIVKNGDNG EVVRLRDVAR IELGAADYAL RSLLNNKSAV AIPIFQSPGS
NAIQISDNVR ATMKELQQNM PDGVAYDIVY DPTQFVRASI EAVIHTLLEA IALVVLVVIL
FLQTWRASII PLIAVPVSVI GTFAVMHVFG FSINALTLFG LVLAIGIVVD DAIVVVENVE
RNIERGLTPK DAAYQAMREV TGPIIAIALV LVAVFVPLAF ITGLTGQFYK QFALTVAIST
VISAFNSLTL SPALAALLLK GHDAPKDVLT RFMDRTLGWF FVRFNRFFAR SSEAYGGGVR
RVISRRAIGM AVYLLLVGVT GYLFHAVPGG FVPGQDKQYL VGFAQLPDGA TLDRTEEVIR
RMGEIAQQEP GVENSIQFPG LSINGFTNSS NSGIVFIGLK DFADRKDKSL SGNAIALSLN
KKFAGIQDAF IAMFPPPPVA GLGTIGGFKL QIEDRAGLGY EALNDATQAF VKKASAQKEL
AGLFTSYQIN VPQLYADVDR TKARQLNVPV TSVFDTMQIY LGSLYVNDFN KFGRTYSVRV
QADAKYRARA DDIGQLKVRS DSNEMIPLST LLRVKESAGP ERAMRYNGFL TADLNGGAAP
GYSTGQAQDA IARVAKETFP KGISYEWTEL TYQEIIAGNS SLIVFPVALL LVFLVLAAQY
ESLTLPLSII MIVPMGLLAA MFGVWISKGD NNVFTQIGLI VLVGLSAKNA ILIVEFAREL
EFAGRTPVRA AIEASRLRLR PILMTSMAFI MGVVPLVTSS GAGSEMRHAM GVAVFAGMIG
VTAFGIFFTP MFYVALRALA GNRPLTQHDE ATIDPVAPSA QGSRDSAPGA HPAPVKPS