Gene Rleg_5659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5659 
Symbol 
ID8016885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012853 
Strand
Start bp236797 
End bp239865 
Gene Length3069 bp 
Protein Length1022 aa 
Translation table11 
GC content57% 
IMG OID644827813 
Productacriflavin resistance protein 
Protein accessionYP_002979013 
Protein GI241518385 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.891494 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.720908 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTTTT CCTCCTGGTC CATTCGCAAT CCCGTTCCGC CAATTCTGCT TTTCATTTTG 
TTGACGGCCT GCGGGCTATG GGCCTTCAAC CGTCTCGATA TCCAAAATTT CCCCGACATG
GACCTTCCAA CGATCGAGAT CAGCGCGTCG CTCGATGGAG CAGCCGCATC TCAGCTTGAA
ACGGAGGTCG CGCGCAAGAT CGAGGATGAG CTGACGGGCC TCACCAAGCT GGATTCCGTT
ACCACGACGA TAACGGACGG TTCGGTCAGC ATCTCGGTGG CATTCGAAGT CGGAAAGGAC
ACCCAAGAAG CCCTCGATGA GGTTAAAAGT GCTGTCGACC AGGCGCAGGA TGAGCTGCCG
GAAGAGATGA ATGCGCCGAC CGTTACGAAA CAGTCGCTAA ACTCTTCGCC CTTGATCACC
TACGTCGTGC GTTCGGATAA GCTGGATTCG GCTGAACTTT CTTGGTTCAT CGATAACGAC
ATGTCACGTG CGCTGATGGC GGTGGACGGT GTTGGCGAAG TCGGCCGCCT GGGAGGGGTC
GACCGGGAAA TCAGGGTGGA GCTGAACGCA AACCTGCTCG ACGGCTTGGG ATTGACCGTA
AACGACGTCA ACAGCCAGGT TGAAGCGGTA CAATCCGACC TGTCGGGTGG CAAAGGTAGA
ATTGGCGGCG AAAGCCAGTC CGTGCGTACA TTGGCTGCCG TAAAGAGCGC CGAGCAACTT
GGTGCGATGG CGATCCCCTT GCCCGAAGGC AGCTGGGTTG GCTTGGATGA ACTCGGAACG
GTGACCGATT CACACAGCGA TCTAACCTCG CTCGCTTATC TGAATGGCAA GCCGGTGATC
GCCGCTCAGA TCAAACGGTC GAAAGGTTAT TCGGATACGG CGGTCACTGA CAAAGTCCGT
GACGCCATGA AGGTATTCGC CGCGGCCCAT CCCGAAGTGA CGATTGAAGA GGCTTACAAC
ACCATCGTGC CGACCGAGCA GAACTATGAA TCGTCGATGC ATATGGTCTA CGAGGGGGCG
TTGATCGCTG TCTTCGTTGT CTGGTTGTTT CTTCGCGATT GGCGGGCGAC GCTTCTGGCT
GCGGTGGCAT TGCCCCTATC GATCATTCCC ACGTTTCTCG TGATGTATAT GCTCGGCTAT
TCGCTGAACA CCATCACGCT GTTGGCAATC TCGTTGGTGG TCGGCATTCT CGTAGATGAC
GCGATCGTCG AGATCGAAAA TATCGAGCGA CACCTCAACA TGGGCAAAAA CCCATTCGAC
GCGGCGATGG AAGCGGCAGA CGAAATCGGT ATGGCGGTCA TCGCCACCAC GTTTACACTG
GTTGCCGTGT TCCTACCCAC GGCATTCATG GGTGGAATTC CCGGCATCAT CTTCAAACAG
TTCGGCATCA CGGCTTCCGT TGCCGTTCTG ACTTCTCTCT TGGTCGCACG GTTGGTAACG
CCTATGATGG CGGCTTATAT GATGAAAGCC AGCGGCAAGG CGCACGAAGA AGACGGTCGC
ATCATGAGAT GCTATCTTTG GCTCGTGAAA GGCGCACTCC GCCGCCGCTG GATACCAATA
CTCGCAACAG TCGGGTTTCT CGCATTCACG GCACTGCTTC TATCTCATCT GTCCACGGGC
TTTTTCCCGG CATCCGACGA TGGGCAGACC CAAGTTAGCC TAACAACGCC ATATGGATCG
ACGATCGAAG CGACCGATGA GGCCGCGCGC AAGGCATCGG CGATCATTGC TGGCGTTGAC
CATGTCACGT CGGTCTTTCA GGCGACGGGC ACGGCCTCCA CGGGAGGGAT GAACGGCACG
TCAAATGCGA GCACCAACAG CGCGACGCTC GTCGTCAATC TCACGCCGAT CGATGACAGA
GATGTCAAGC AGTCACAAAT CGAAGCCGAT TTGCGCAAGG CATTGGAACA GCTGCCCGGC
GTTCGCCTGG AAATCGGTTC CGGCGGCAAT GGCACGCAAC TCACCCTGAC GCTGGCAGGC
GACAATTCCG AGCTTCTTGA AAAGGCCGCG GCAAACCTCG AGGCCGATCT GCGTACTCTT
TCGGGCATCG GTAACGTTAC GTCATCGGCT GCCATGCAAA CGCCTGAAGT CACGATCAAA
CCTGACTTGG CGGAAGCAGC ATCCCTGGGT GTCACGTCCA AGGCCATCGC CGAGGCTATT
CGCGTTGCAA CGGCCGGTGC CTACGACACC GCTTTGTCCA AGCTCAACCT GCCGGAGCGC
CAGGTCGCCA TTCGGGTCAT GCTCGATACG GCAAATCGAC AGTCGCTCGA CGCCATATCG
CTCATTCCTG TCGAGGGAAA AGAGGGTAAT GTCGCGCTGG GTGCCATCGC CGATATCTCT
CTTGGATCCA GCCCAAGCCA GATAGATCGG CTGGACCGGT CACGAAACGT CTCTTTGACA
GTCGAATTGA ACGGCAGAAA CCTTTCCGAC GTGACGGCAG AGGCCGCGCG ATTGCCCAGC
TATCAGAACC TGCCACAGGG TGTGAAATTC GTTGAACAGG GCGAGCTCAA GCGTCAGAGC
GAGCTGTTCA CCAGCTTTGG TACGTCGATG GCGATCGGCA TCTTCTGCAT TTACGCCGTT
CTGGTCCTTC TCTTCCATGA TTTCCTTCAG CCGGTGACAA TCTTGATGGC ACTGCCGCTG
GCACTTGGCG GCGCCCTCTT GCCACTCGTG TTGACCGGCA CCAGCTTCTC CATGCCAGCG
GTTATCGGCC TTCTTCTCCT GATGGGTATC GTATCGAAAA ACTCCATTCT CCTTGTCGAG
TACGCAATCG AGGCGCGCCG TGCCGGAATG TCTCGTTACG ACGCCCTCGT GGACGCCTGC
CACAAGCGAG CGCGACCCAT CATCATGACA ACCATTGCCA TGGCCGGCGG CATGCTTCCG
GCAGCACTCA GCCTTGTTTC CGGTGACCCA AGCTTCCGCC AACCCATGGG TATCGTCGTG
ATCGGAGGAC TGATCACCTC GACATTCCTG AGCCTTCTTG TCATCCCCGT CGTCTTTACA
TTTCTGGATG ATGTTCTGAA CTGGCTGAAA ACGAGGCTCC AAGAAGATAA CAGGCACGCC
GCCGAATAG
 
Protein sequence
MNFSSWSIRN PVPPILLFIL LTACGLWAFN RLDIQNFPDM DLPTIEISAS LDGAAASQLE 
TEVARKIEDE LTGLTKLDSV TTTITDGSVS ISVAFEVGKD TQEALDEVKS AVDQAQDELP
EEMNAPTVTK QSLNSSPLIT YVVRSDKLDS AELSWFIDND MSRALMAVDG VGEVGRLGGV
DREIRVELNA NLLDGLGLTV NDVNSQVEAV QSDLSGGKGR IGGESQSVRT LAAVKSAEQL
GAMAIPLPEG SWVGLDELGT VTDSHSDLTS LAYLNGKPVI AAQIKRSKGY SDTAVTDKVR
DAMKVFAAAH PEVTIEEAYN TIVPTEQNYE SSMHMVYEGA LIAVFVVWLF LRDWRATLLA
AVALPLSIIP TFLVMYMLGY SLNTITLLAI SLVVGILVDD AIVEIENIER HLNMGKNPFD
AAMEAADEIG MAVIATTFTL VAVFLPTAFM GGIPGIIFKQ FGITASVAVL TSLLVARLVT
PMMAAYMMKA SGKAHEEDGR IMRCYLWLVK GALRRRWIPI LATVGFLAFT ALLLSHLSTG
FFPASDDGQT QVSLTTPYGS TIEATDEAAR KASAIIAGVD HVTSVFQATG TASTGGMNGT
SNASTNSATL VVNLTPIDDR DVKQSQIEAD LRKALEQLPG VRLEIGSGGN GTQLTLTLAG
DNSELLEKAA ANLEADLRTL SGIGNVTSSA AMQTPEVTIK PDLAEAASLG VTSKAIAEAI
RVATAGAYDT ALSKLNLPER QVAIRVMLDT ANRQSLDAIS LIPVEGKEGN VALGAIADIS
LGSSPSQIDR LDRSRNVSLT VELNGRNLSD VTAEAARLPS YQNLPQGVKF VEQGELKRQS
ELFTSFGTSM AIGIFCIYAV LVLLFHDFLQ PVTILMALPL ALGGALLPLV LTGTSFSMPA
VIGLLLLMGI VSKNSILLVE YAIEARRAGM SRYDALVDAC HKRARPIIMT TIAMAGGMLP
AALSLVSGDP SFRQPMGIVV IGGLITSTFL SLLVIPVVFT FLDDVLNWLK TRLQEDNRHA
AE