Gene Rleg_3809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3809 
Symbol 
ID8014633 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3867743 
End bp3870913 
Gene Length3171 bp 
Protein Length1056 aa 
Translation table11 
GC content61% 
IMG OID644826372 
Producttransporter, hydrophobe/amphiphile efflux-1 (HAE1) family 
Protein accessionYP_002977591 
Protein GI241206495 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0639553 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTCGC GTTTCTTCAT CGAGCGACCG ATACTCGCCA ATGTTCTGGC GTTGGTTTTC 
GTGCTCATCG GCGCGGTGGC GCTGTTCCAG CTGCCTGTGG CGCAATATCC GAATGTCGTT
CCGCCAACGG TGCAGGTGAC GACGCGCTTC CCGGGCGCCA GCGCACAGAC CCTCGTCGAT
ACCGTCGCGC TGCCGATCGA ACAACAGGTC AACGGCGTGC AGGACATGCT CTATATGCAG
TCGACGAGCG CAAGCGACGG CACCTATTCC CTGACCGTGA CCTTTGCCAT CGGAACGGAT
CCGGACCAGG CCCAGGTTCT GGTGCAAAAC CGCGTCGCCA TAGCAATGTC ATCGCTTCCC
GAAGCGGTGC AGCTTCAGGG CGTGACGACG CAAAAGAAGT CGACGGCGAT CCTCGGCTTC
GTCAGCCTGA CCTCGCCCGA CAGCCGCTAT GACAGCCTGT TCTTGTCGAA CTACGCCGTC
ATCAATCTTC AGAATGAACT CGCCCGCCTG CCGGGTGTCG GCAATGTCAC CGTGTTCGGG
GCCGGCCAAT ATGCCATGCG CATCTGGATG GATCCGAACC TGCTGCAGGC GCGCGGCCTG
ACGCCGCAGG ATGTGGTCAG CGTCGTGCAG CAGCAAAGCC AGGAGGTGGC CGCCGGCCAG
ATCGGCATTC CGCCGGTACC GAAGGGACAG GTCTTCCAAT ACACGCTGAA CGTCAATGGC
CGATTGAACG AGGCGGCCGA CTACGAAAAC ATCGTCGTCA AGGTCGAGAG CGGACAGGGC
GGCCGTGTTA CCCGGATCCG CGATATCGGC CGCGTCGAAC TCGGCGCCCA GACTTACCGT
CAGTCCTTCA TGCAGAACGG CCGGCCGGCT GCCGGCATCG GCATTTTTCA GCTGCCGGAG
GCAAATGCCA TCGCGGTGGC GCAGGTGGTG AACACGAAAA TGCAGGAGCT CTCGAAGAGC
TTCCCGCCGG GCCTCGAATA TCACGTGCCG TTCGACACGA CAAAATTCGT CGAGGCCTCC
ATCGACGAGG TTTACGTCAC CTTGATCGAG GCCGGCATTC TCGTTCTCAT CGTCATTCTC
GTGTTCCTGC AGGATTGGCG GGCGATGCTC GTGCCGGCGA CCACCGTTCC GGTCACCATC
ATCGGCGCCT TCGCGGCCAT GGCAGCATTG GGTTTCACCG TCAACCTGTC GACGCTGTTT
GCCATCGTTC TCGCCATCGG TATCGTCGTT GATGATGCCA TCGTCATTGT CGAAGGGGTC
GCCCGACACA TCGAGGCCGG CATGTCCGGC CGAAAGGCGG CCGAAAAGGC GATGGAGGAG
CTGCTTGGCC CGGTCATCGG CATTACGCTG GTGCTGATGG CCGTTTTCAT TCCGGCCGCC
TTCCTGCCCG GCCTGACCGG ACAGCTCTAC CGGCAGTTCG CCCTCGTCAT CGCCGCCACG
GCACTGATCA GCGCCATCAA TGCCGTCACG CTGAAACCGA CGCAATGTGC CCTTTGGCTG
CGGCCACCAG TCCCGCCAGA GAAACGCAAC ATCTTCTATC GCGGCTTCAA CAAGGTCTAC
GACAGGGGTG AACGCAGCTA TGCCGGGCTG ATCGGATCGA TGACCCGCCA CAGCGGTATC
ATGGTCATAG CAGCACTTGC GCTGATCGGT GTCGCCGTCT GGGGGCTCGC CCGTCTGCCG
ACTGCGTTTC TGCCCATCGA GGATCAAGGC TATGTGCTGA TCAGTGCGCA GCTGCCCGAC
GGCGCATCGA AGGAGCGCAC CGACGCGGTG ATGGAAGAGG TCGGCAAGAT CGCCGAAGCC
ACGCCTGGCG TCGATCAGGT GCTGACCATC AGCGGCATCT CCGTTCTCGA CAACAATGCC
AGTCTGCAGA ATGCCGGCGT CGCCTATGTC GTGCTGAAGG ATTGGGACGA GCGCGGCAAG
GAAAAGGGGC AGGATCTGCT GTCGATCTAC CAGCATTTGA ACGGCACGCT GCAAAGCGTG
CTCGCCGCCA AGACGCTGGT GGTCGTGCCA CCTCCGATCC AGGGCGTCGG CAATGCCAGT
GGCTTTACCA TGCAGGTCGA GATCAGGAAC GGCATTTCCG ACTACCCGCT GCTGCAATCG
CTTGCCGACA CGATCGTCAA GAACGGCGGT GCTCAGTCAT CCCTGCAAAG GCTGAGCACG
CCTTTTCGCT CGAACGTGCC GCAACTCGCA GTCTCTGTCG ACCGGATCAA GGCGGAGACG
CTAGGGGTTA CCGTGGGCCA GGTTTTCTCG GCTCTCTCCA GTTACGTCGG ATCGAGCTAT
GTCACCCAGT TCAACAAATT TGGCCGGACC TTCCAGGTCT ATGCGCAAGC CGCTTCCGAT
TTCCGGGTCA GCGCCGAAGA CATCCGCAAT CTGAAGGTCA AGGCCGGTGA CGGCACGATG
GTGCCGCTCG GCACGGTCGT CGATGTCACC GCGACACAAG GCCCATCTCT GATCAGCCTC
TACAATCTCT ACCCGTCTGC AACGATCGTT GGCGGGCCTG CCGCAGGCTT CAGTTCCGGC
CAGTCGCTCG ACGTCATGGA GCAGATTGCC GATCAGACAC TGCCACCGGG CACAGGCTTC
GAATGGACGG CGCTGTCCTA TCAGGAGAAG GCCGTGGGCG GGCAGATCTA TTTCATCTTC
GCGCTCGCCA TGCTCCTCGT CTATTTCGTG CTCGCAGGCC AGTATGAAAG CTGGATCTTG
CCGCTGGCAG TCCTTCTCGC CGTGCCGCTC GCCCTCCTTG GCACAGTGGC AGCGCTTATG
GCAGCCGGCG TTGCCAACAA CCTTTATACG CAGATCGGCC TTATCCTGCT GATCGCGCTT
GCATCGAAGA ATGCCATCCT CATCGTCGAA TATGCTAGGG AAAGGCGGGC GGAAGGCATG
GAAATACTGG ATGCGGCCGT CGAGGCCGCC CGCTTGCGTT TCCGGCCAAT TTTGATGACG
TCCTTCGCCT TTATCCTCGG CGTCCTGCCG CTCGTGCTGG CGACTGGCGC CGGCGCATCC
GCCCGCAAAT CTATCGGCAT ATCGGTCTTC AGCGGCATGA TCGCCTCGAC CTGCCTCGCC
GTGCTCTTTG TCCCGTCCTT CTACGTCCTG TTGCAGCGCC TCGAGGAATA TTGGAAAGGG
CGCACAAAGA CCGCAGGCCT GGCGGAAACG GAGATATCGA AGGTTCAGTA G
 
Protein sequence
MISRFFIERP ILANVLALVF VLIGAVALFQ LPVAQYPNVV PPTVQVTTRF PGASAQTLVD 
TVALPIEQQV NGVQDMLYMQ STSASDGTYS LTVTFAIGTD PDQAQVLVQN RVAIAMSSLP
EAVQLQGVTT QKKSTAILGF VSLTSPDSRY DSLFLSNYAV INLQNELARL PGVGNVTVFG
AGQYAMRIWM DPNLLQARGL TPQDVVSVVQ QQSQEVAAGQ IGIPPVPKGQ VFQYTLNVNG
RLNEAADYEN IVVKVESGQG GRVTRIRDIG RVELGAQTYR QSFMQNGRPA AGIGIFQLPE
ANAIAVAQVV NTKMQELSKS FPPGLEYHVP FDTTKFVEAS IDEVYVTLIE AGILVLIVIL
VFLQDWRAML VPATTVPVTI IGAFAAMAAL GFTVNLSTLF AIVLAIGIVV DDAIVIVEGV
ARHIEAGMSG RKAAEKAMEE LLGPVIGITL VLMAVFIPAA FLPGLTGQLY RQFALVIAAT
ALISAINAVT LKPTQCALWL RPPVPPEKRN IFYRGFNKVY DRGERSYAGL IGSMTRHSGI
MVIAALALIG VAVWGLARLP TAFLPIEDQG YVLISAQLPD GASKERTDAV MEEVGKIAEA
TPGVDQVLTI SGISVLDNNA SLQNAGVAYV VLKDWDERGK EKGQDLLSIY QHLNGTLQSV
LAAKTLVVVP PPIQGVGNAS GFTMQVEIRN GISDYPLLQS LADTIVKNGG AQSSLQRLST
PFRSNVPQLA VSVDRIKAET LGVTVGQVFS ALSSYVGSSY VTQFNKFGRT FQVYAQAASD
FRVSAEDIRN LKVKAGDGTM VPLGTVVDVT ATQGPSLISL YNLYPSATIV GGPAAGFSSG
QSLDVMEQIA DQTLPPGTGF EWTALSYQEK AVGGQIYFIF ALAMLLVYFV LAGQYESWIL
PLAVLLAVPL ALLGTVAALM AAGVANNLYT QIGLILLIAL ASKNAILIVE YARERRAEGM
EILDAAVEAA RLRFRPILMT SFAFILGVLP LVLATGAGAS ARKSIGISVF SGMIASTCLA
VLFVPSFYVL LQRLEEYWKG RTKTAGLAET EISKVQ