Gene Rleg_3352 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3352 
Symbol 
ID8014234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3359198 
End bp3362398 
Gene Length3201 bp 
Protein Length1066 aa 
Translation table11 
GC content62% 
IMG OID644825911 
Producttransporter, hydrophobe/amphiphile efflux-1 (HAE1) family 
Protein accessionYP_002977138 
Protein GI241206042 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.176726 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00729316 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACATAT CCAGATTCTT TGTCGACCGC CCGGTCTTTG CCGGTGTTCT TTCGGTCCTC 
ATCCTGGTCG CCGGCCTGAT CGGCCTGCGC GCGCTGCCGA TTTCCGAATA TCCGGAGGTC
GTGCCGCCGT CGATCGTCGT GCGCGCCACC TATCCCGGCG CCAACCCAAG CGTCATCGCC
GAAACGGTGG CAACGCCGCT CGAAGAGCAG ATCAACGGCG TCGAAGGCAT GCTCTATATG
GCTAGCCAGG CGACATCGGA CGGCGTGCTC AACGTCACCG TCACCTTCAA GCTCGGCACC
GACCCTGATA AGGCGCAGCA GCTCGTGCAG AACCGCGTTT CGCAGGCCGA ACCGCGCCTG
CCGGCGGAAG TCCGTTCGCT CGGCATCACC ACAGTCAAGA GTTCGCCCAA CTTCATTATG
GTCGTCAACC TCGTCTCTGA CGGGAACAGT CACGACATCA CCTATCTCCG CAACTACGCG
ACCTTGAACG TCAAGGATCG GCTCGCCCGC ATTGCAGGCG TCGGTCAGGT GCAGGTCTTC
GGCGCCGGCG ACTATTCCAT GCGTGTCTGG ATCGACCCGC AGAAGGCCGC CGAGCACAAT
CTTGCCGCCA GCGACATCAG CAGCGCGATC AGCTCCCAGA ACATCCAGGC CGCCGCCGGT
ATCATCGGCG CATCGCCGAG CCAACCGGGT GTGGACCTGC AGCTCAACGT CAATGCCCAG
GGCCGCCTGC GCACGCCTGA GGAATTTGGC AACATCATCG TCAAGACGGG CGCCAGTGGC
GAGATCACCC GCCTTCGCGA TGTCGCCCGC ATCGAGCTCG GTGCTGCGGA CTACACCCTG
CGTTCGCTGC TTGACGGCAA GCCGGCCGTC GCAGTTGCTG TGCTGCAGGC GCCGGGTTCG
AACGCGATCG AGATCGCGGA CAATGTGAAC GCGACCATGG ATCAGTTGCA GCTCGCCATG
CCTGAGGGCG TCAAGTACGA GATCGTCTAC GATACGACGA AATTCGTGCG TGCCTCGATC
GAGAAGGTCA TCGACACGCT GCTCGAGGCC ATTGCGCTGG TCGTCCTCGT CGTCATCCTG
TTCCTGCAGA CGTGGCGCGC CTCGATCATC CCGCTGATCG CGGTTCCGGT ATCGATCATC
GGCACCTTCG CGGTGATGTA TGTCTTCGGC TTCTCGATCA ACGCGCTCAG TCTGTTCGGC
CTGGTGCTTG CGATCGGTAT CGTCGTCGAC GACGCGATCG TGGTGGTCGA AAACGTCGAG
CGCAATATCG AGCATGGCCT GTCGCCGCGG GCTGCCACCT ACAAGGCAAT GAAGGAAGTC
TCCGGTCCGA TCGTCGCGAT CGCGCTGGTC CTCGTCGCGG TCTTCGTGCC GCTTGCCTTC
ATCTCCGGCC TGTCGGGTCA GTTTTACCGC CAGTTCGCGC TGACGATCGC AATCTCGACC
GTCATCTCGG CCTTCAACTC ACTCACCCTG TCTCCGGCAC TGGCAGCCCT TCTCCTGAAG
GGCCATGATC AGCCGAAGGA TTGGCTGACG CGGTTCATGG ACGCCATCTT CGGCTGGTTC
TTCCGCGGCT TCAACCGTGT CTTCGGCGCG GGCTCGAATG CCTACGGCAA GGGCGTGGGC
GGGCTGTTGT CGCGCAAGAG CATCGTCATG GTGATCTATC TGGCACTGGT CGGTGCGACC
TACAGTCTCT TCAGTACTGT TCCCGGCGGC TTCGTGCCAT CGCAGGACAA GCAGTATCTG
ATCGGCTTCG CCCAGCTGCC GGATGCCGCA AGCCTCGACC GCACGGAAGA CGTCATCAAA
CGGATGACCG ACATCGCGCT GGCGCAGCCG GGCGTTGCCA ATGCGATCGC CTTCCCGGGC
CTGTCGATCA ATGGCTTCAC CAACTCCTCG AATGCAGGCA TCGTCTTCGT GACGCTGAAG
GACTTCGAGG AGCGCAAGAC GCCTGATCTC TCGGGCGGCG CAATCGCCAT GGCGCTGAAC
CAGAAGTTCG GCGTCATCCA GGATGCCTTC ATCGCCATGT TCCCGCCCCC GCCGGTCAAT
GGTCTCGGCA CGACCGGCGG CTTCAAGCTG CAGATCGAGG ATCGTGCCGG CCTCGGCAAC
CAGGCGCTCG ACGAAGCCAC GAAGGCCGTG CTTGCAAAGG CCTACCAGAC GCCTGAACTC
GCCGGGCTGT TCTCCAGCTT CCAGATCAAC GTGCCGCAGC TCTATGCCGA TCTCGACCGT
GCCAAGGCCG AGCAGCTCGG GGTTTCCGTC ACCGACGTCT TCCAGACGCT GCAGATCTAT
CTCGGTTCGC TCTATGTCAA CGACTTCAAC GCCTTCGGCC GCACCTACAG CGTCCGTGTG
CAGGCCGATG CGAAATTCCG CGCCCAGCCG GAAGATATCG GCCAGTTGAA GGTCCGTTCG
GCATCGGGTG AGATGATCCC ACTTTCGGCC CTTCTGAAGG TGGAGCCGAG CACCGGTCCG
GAACGCGCGA ACCGCTACAA CGGCTTCCTT GCCGCCGATA TCAACGGCGG TCCGGCACCG
GGCTTTTCGT CCGGCCAGGC GCAGGCGGCA ATCGAGAAGA TCCTTCACGA GACCCTGCCT
GCGGGCATCG ACTTCGAATG GACGGATCTG ACCTATCAGC AGATCCTGGC CGGCAATTCG
AGCATCGTCG TCTTCCCCCT GGCACTGTTG CTCGTCTTCC TCGTGTTGGC CGCCCAGTAT
GAAAGCCTGA CGCTGCCGCT TGCGATCATC ATGATCGTGC CGATGGGCGT GCTGGCCGCG
CTGACCGGCG TCTGGCTCAC CGGTGGAGAC AACAACATCT TCACCCAGAT CGGTCTTGTG
GTGCTTGTCG GTCTATCGGC GAAGAACGCG ATCCTGATCG TGGAATTCGC CCGCGAACTG
GAGTTCGAGG GAAGAACACC GCGGGAGGCC GCAATCGAGG CCAGCCGCCT TCGCCTTCGC
CCGATCCTCA TGACCTCCCT TGCCTTCATC ATGGGTGTCG TGCCGCTCGT CGTCTCTACA
GGCGCCGGCG CGGAAATGCG CGCGGCCATG GGTGTCGCGG TCTTCTCCGG CATGATCGGC
GTGACCTTCT TCGGCATCTT CATGACGCCG GTGTTCTACG TGCTGCTACG GCGGCTGACG
GGTAACCGTC CGCTCGTCCA GCACAAGCCG GACGAACACA AGGAAGAAGA GGCGGAGGTC
ATCCGGCTCG CGGCGGAATA A
 
Protein sequence
MNISRFFVDR PVFAGVLSVL ILVAGLIGLR ALPISEYPEV VPPSIVVRAT YPGANPSVIA 
ETVATPLEEQ INGVEGMLYM ASQATSDGVL NVTVTFKLGT DPDKAQQLVQ NRVSQAEPRL
PAEVRSLGIT TVKSSPNFIM VVNLVSDGNS HDITYLRNYA TLNVKDRLAR IAGVGQVQVF
GAGDYSMRVW IDPQKAAEHN LAASDISSAI SSQNIQAAAG IIGASPSQPG VDLQLNVNAQ
GRLRTPEEFG NIIVKTGASG EITRLRDVAR IELGAADYTL RSLLDGKPAV AVAVLQAPGS
NAIEIADNVN ATMDQLQLAM PEGVKYEIVY DTTKFVRASI EKVIDTLLEA IALVVLVVIL
FLQTWRASII PLIAVPVSII GTFAVMYVFG FSINALSLFG LVLAIGIVVD DAIVVVENVE
RNIEHGLSPR AATYKAMKEV SGPIVAIALV LVAVFVPLAF ISGLSGQFYR QFALTIAIST
VISAFNSLTL SPALAALLLK GHDQPKDWLT RFMDAIFGWF FRGFNRVFGA GSNAYGKGVG
GLLSRKSIVM VIYLALVGAT YSLFSTVPGG FVPSQDKQYL IGFAQLPDAA SLDRTEDVIK
RMTDIALAQP GVANAIAFPG LSINGFTNSS NAGIVFVTLK DFEERKTPDL SGGAIAMALN
QKFGVIQDAF IAMFPPPPVN GLGTTGGFKL QIEDRAGLGN QALDEATKAV LAKAYQTPEL
AGLFSSFQIN VPQLYADLDR AKAEQLGVSV TDVFQTLQIY LGSLYVNDFN AFGRTYSVRV
QADAKFRAQP EDIGQLKVRS ASGEMIPLSA LLKVEPSTGP ERANRYNGFL AADINGGPAP
GFSSGQAQAA IEKILHETLP AGIDFEWTDL TYQQILAGNS SIVVFPLALL LVFLVLAAQY
ESLTLPLAII MIVPMGVLAA LTGVWLTGGD NNIFTQIGLV VLVGLSAKNA ILIVEFAREL
EFEGRTPREA AIEASRLRLR PILMTSLAFI MGVVPLVVST GAGAEMRAAM GVAVFSGMIG
VTFFGIFMTP VFYVLLRRLT GNRPLVQHKP DEHKEEEAEV IRLAAE