Gene Rsph17029_4045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_4045 
Symbol 
ID4898677 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp1192354 
End bp1195473 
Gene Length3120 bp 
Protein Length1039 aa 
Translation table11 
GC content68% 
IMG OID640114649 
Producthydrophobe/amphiphile efflux-1 (HAE1) family protein 
Protein accessionYP_001045895 
Protein GI126464782 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACGTT TCTTCATCGA CCGCCCGGTC TTCGCCTGGG TGATCGCCAT CGTCATCATG 
GCGCTCGGGG TCCTCTCGAT CCTGCGCCTG CCCGTATCGC AGTATCCCTC GATCGCGCCG
CCTGCGGTGG TGATCGGCGC GACCTATCCC GGCGCCTCGG CCGAGACCGT GACCGACACC
GTGACCCAGG TGATCGAGCG CGAGATGACC GGCCTCGACG GGCTGCGCTA CATCTCCTCC
TCCTCGACCT CGACCGGGGT CGCGCAGATC ACCCTGACCT TCGAACTCGG CACCGATCCC
GACATCGCGC AGGTCCAGGT GCAGAACAAG CTGAGCCAGG CGCAGGCGCT CCTGCCGCAG
GCGGTGGTGC GGCAGGGCGT GACGGTGCGG AAATCCTCGG CGGGCTTTCT CATGGTCATC
GCCATGACCT CCGACAATGG CGCCTATTCC GCCTCGGAGC TGGCGGATTA CATGTCCTCG
AACATGGTCG AGAGCATCAG CCGCCTGCCG GGCGTGGGCT CGGTGCAGGT CTTCGGCTCG
CAGCATGCGA TGCGGATCTG GCTCGATCCG AACAAGCTCA ACGCCTATGA CCTCTCGCCC
TCAGACGTGA CGGCGGCGGT TTCGGCGCAG AACGCGCAGA TCTCGGCAGG CTCGTTCGGC
GCGCGCCCCG CCCCCGAGGG CCAGCAGCTT CAGGCCACGA TCACCGCCCA GTCGCTTCTG
CAGACGCCCG AGGATTTCGA GCAGATCGTG CTCCGGGCCG AGACCGACGG AGGGCTCGTC
CTGCTGCGCG ACGTGGCGCG GGTCGAGCTC GGCGCGCAGA GCTACGAGAT CGACGGACGC
TACAACCGCA GCCCCGCCTC GGGCATGGCG ATCCAGCTCG CCTCGGGCGC CAATGCGCTC
GACACGGCCG ATGTGGTGCG CGCGAAGGTG GACGAGCTCG CGGCCTTCTT CCCCGAGGGC
ATGACCTACG AGGTGCCCTA CGACACCACC CCCTTCGTGC GCATCTCGAT CGAGGAGGTG
GTCAAGACAC TGATCGAGGC CATCGTCCTC GTCGTGCTGG TGATGTTGCT GTTCCTCCAG
AACATCCGCG CCACGCTCAT TCCCACGCTC GCGGTGCCGG TCGTGCTTCT GGGCACCTTC
GGCGTGATGG CGGCGCTCGG CTTCTCGATC AACACGCTCA CCATGCTCGC CATGGTGCTG
GCCATCGGCC TTCTGGTCGA TGACGCCATC GTGGTGGTCG AGAACGTCGA ACGGATCATG
GAGCAGGAGG GGCTCGATCC GGTGGCAGCC ACCCGCAAGA GCATGGACGA GATCTCGGGC
GCGCTCGTGG CCATCGCCAT GGTGCTCTCG GCGGTGTTCG TGCCGATGGC CTTCTTCGGC
GGCTCGACGG GCGAGATCTA CAAGCAGTTC TCCATCACCA TCGTGTCGGC CATGGCGCTC
TCGGTGCTGG TTGCGCTGAC GCTGACGCCT GCGCTCTGCG CCACGATCCT CAAGCGGGGT
CACCATTCCG CCCGGCGCGG CCCGGTGGGC TGGTTCAACC GCGGCTTCGA CGCGACCACC
CGTGGCTATG GCGGGCTCGT GGGCCGCGTG GTGCGCCGCC CGGTGCTGAT GATGGTGCTG
TTCGGCGTGA TCGTGGCGGG CATGGTGACC CTCTTTCAGC GCACCCCCAC GGCCTTCCTG
CCCGACGAGG ATCAGGGCGT GCTCATCACC CTGATCCAGA CCCCTTCGGG CGCCACGGCC
GAGCGCACGC TCGCCGCGAT CAAGGAGGTC GAGAATTACT GGCTCGAGAA GGAGGGCGAG
AACGTCACCG CCGTCTTCGG CGTGAACGGC TTCTCCTTTG CGGGGCAGGG GCAGAACATG
GGCATCGTCT TCGTGCGGCT GAAGCCGTGG GAAGAGCGTC TGCGCCCCGA CCAGAGCGTG
GCCGCCATGG CCGGGCGGGC CTTCCCGACC TTCATGGGGA TGCGCGATGC GATGGTCTTT
CCCATCGTGC CGCCGCCGGT GCTGGAGCTC GGCAACTCGA ACGGCTTCAC GCTCTTCCTG
CAGGCGCGTC AGGGCCAGAG CCACGAGGAA CTGCTCGACG CGCGCAACAT GCTGCTGGGC
CTTGCCTCGC AGAGCCCGCT GCTCGCCTCG GTGCGCCCGA ACGGGGTCGA GGATGCCTCG
CAGTTCCGCC TCGACATCGA CTGGCGCAAG GCGGGTGCGG TGGGCGTGAC CGCGGCACAG
GTCGGCGATT TCCTCAACAC CGCCTGGGCC GGGTCCTACG TCAACGACTT CCTCGACCAG
GGTCGGGTGA AGCGGGTCTA TGTGCAGGGC GAGCCCTGGG CGCGGACCGA TCCGGACGAT
CTCGACCTCT GGCGCATCCC GAACAAGGAC GGCGAGTTCG TGGCGCTCTC GACCTTCGCC
GACCAGACCT GGTTCTACGG GCCCCAGCAG GTCGCGCGCT ACAATGCCGT CCGCTCGATG
GAGATCCAGG GACAGCCGGT GCCCGGCATC TCCTCGGGCG AGGCCATGGC CGAAATGGAG
CGGCTGGCCG CCCAGCTTCC GCCGGGCTTC GCGCTCGAAT GGACGGGCCT CTCGCTCGAA
GAGCGCGAGG CGGGGGATCA GGCGAGCCTT CTGTTCCTCT TGTCGGTGGG CGCGGTCTTC
CTCTGCCTCG CCGCGCTCTA CGAGAGCTGG TCGGTGCCGA TCGCGGTCTT GCTCGCCATG
CCGGTGGGCA TCCTCGGCGC GCTGCTCGGC GCCTGGGCGG GCCATCAGGC GAACGGCGTC
TATTTCCAGG TGGGGCTTCT CACCGTGGTG GGTCTGACCG GCAAGAACGG GATCATGATC
GTCGAATTCG CCCGCAGCCG GCTTGCGCAG GGCGAACCCC TGCTCGAGGC GATCCGGCAT
GCGGCGGTGC TCCGGTTCCG CCCGATCCTG ATGACCTCGC TCGCCTTTTC GCTGGGGGTG
GTGCCGCTCG TGCTCTCGAC CGGCGCGGGG GCAGGGGCGC GGCGCGCGAT CGGCGACGGC
ATCCTCGGCG GCACGATCAC CGGCACGCTT CTGGGGATCG TTTTCGTACC TGTGTTCTTT
GTTCTGGTGA ACAGCCTCTT CCGTCGCCGC AAGACCGCTC CTGCGGCCGC GACGGCCTGA
 
Protein sequence
MGRFFIDRPV FAWVIAIVIM ALGVLSILRL PVSQYPSIAP PAVVIGATYP GASAETVTDT 
VTQVIEREMT GLDGLRYISS SSTSTGVAQI TLTFELGTDP DIAQVQVQNK LSQAQALLPQ
AVVRQGVTVR KSSAGFLMVI AMTSDNGAYS ASELADYMSS NMVESISRLP GVGSVQVFGS
QHAMRIWLDP NKLNAYDLSP SDVTAAVSAQ NAQISAGSFG ARPAPEGQQL QATITAQSLL
QTPEDFEQIV LRAETDGGLV LLRDVARVEL GAQSYEIDGR YNRSPASGMA IQLASGANAL
DTADVVRAKV DELAAFFPEG MTYEVPYDTT PFVRISIEEV VKTLIEAIVL VVLVMLLFLQ
NIRATLIPTL AVPVVLLGTF GVMAALGFSI NTLTMLAMVL AIGLLVDDAI VVVENVERIM
EQEGLDPVAA TRKSMDEISG ALVAIAMVLS AVFVPMAFFG GSTGEIYKQF SITIVSAMAL
SVLVALTLTP ALCATILKRG HHSARRGPVG WFNRGFDATT RGYGGLVGRV VRRPVLMMVL
FGVIVAGMVT LFQRTPTAFL PDEDQGVLIT LIQTPSGATA ERTLAAIKEV ENYWLEKEGE
NVTAVFGVNG FSFAGQGQNM GIVFVRLKPW EERLRPDQSV AAMAGRAFPT FMGMRDAMVF
PIVPPPVLEL GNSNGFTLFL QARQGQSHEE LLDARNMLLG LASQSPLLAS VRPNGVEDAS
QFRLDIDWRK AGAVGVTAAQ VGDFLNTAWA GSYVNDFLDQ GRVKRVYVQG EPWARTDPDD
LDLWRIPNKD GEFVALSTFA DQTWFYGPQQ VARYNAVRSM EIQGQPVPGI SSGEAMAEME
RLAAQLPPGF ALEWTGLSLE EREAGDQASL LFLLSVGAVF LCLAALYESW SVPIAVLLAM
PVGILGALLG AWAGHQANGV YFQVGLLTVV GLTGKNGIMI VEFARSRLAQ GEPLLEAIRH
AAVLRFRPIL MTSLAFSLGV VPLVLSTGAG AGARRAIGDG ILGGTITGTL LGIVFVPVFF
VLVNSLFRRR KTAPAAATA