Gene Rleg_5089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5089 
Symbol 
ID8007682 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp476867 
End bp479938 
Gene Length3072 bp 
Protein Length1023 aa 
Translation table11 
GC content62% 
IMG OID644822004 
Productacriflavin resistance protein 
Protein accessionYP_002973264 
Protein GI241113429 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0880645 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0178395 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGTCCT TCAACCTTTC CGACTGGGCG CTCGAACACC GTTCGCTCGT CTGGTACTTC 
ATGATCGTCT TCATTCTCGC AGGCGCCTTC TCCTACGTGA AGCTCGGCCG TGAGGAAGAC
CCGAACTTCA CGATCAAGAC GATGGTGATC ACCGCCCAGT GGCCCGGCGC GTCCGCCGAA
GAGGTGACGC GGCAGGTCAC CGACAGGATC GAGAAGAAGC TCCAGGAACT GGAATCGCTT
GATTACACCA AGAGCGAGAC CGTTGCCGGC CAGACAACCG TCTTCGTCGA GCTGCTGCCG
ACGACCAAAG CAAAGGATGT CGCTCCCACC TGGCTGCGCA TCCGCAACAT GATCGCCGAC
ATCAAGGGCG ATTTCCCGAC CGGCGTCGTC GGTCCCTTCT TCAACGATCG CTTCGGCGAC
GTCTTCGGAA ACATCTACGC CTTCACCAGC GACGGCCTTA CCCAGCGGCA GCTTCGCGAT
CTCGTGGAAA ACGCCCGCTC CGAGGTCCTG ACCGTGCCCA ACGTCGGCAA GGTCGATGTG
GTCGGTGCCC AGGATGAGGC GATCTATCTC GAATTCTCCA CACGCCAGAT CGCAGCTCTC
GGGATCGACC AGCAGGCGGT CATCCAGACC CTGCAGGCGC AGAATGCCGT CACGCAGTCC
GGCTTCGTCG ACGCCGGGCC GGAGCGCATC GCATTGAGGG TGAGTGGACA GTTCACCTCC
GAGGCCAGTC TGAGATCGAT CAATCTTCGG ATAAACGACC GCTTCTTTCC GCTGACCGAC
GTCGCCACGA TCAAACGCGG CTACGTGGAT CCGCCGTCGG CGCTGTTCCG GTTCAACGGC
GAGCCTGCGA TCGGGCTTGC GATCGGCATG AAGCAGGGCG CCAATCTTCT GGAGTTCGGC
GAGGGGCTCG ACGCGCAGAT GAAACGCGTC GTCGCCGATC TCCCGATCGG CGTGGACGTC
CACCGTGTCT CCGATCAGCC GGCCGTCGTC GACGAAGCGG TGTCCGGATT TACCCGCGCG
CTCTTCGAAG CGATTGCCAT CGTCCTCATC ATCAGCTTCA TCAGTCTCGG TCTTCGCGCC
GGCATGGTGG TGGCGATCTC AATTCCTCTC GTCCTGGCGA TCACCTTCGT GGTGATGGAA
TATTCCGGCA TTTCGCTTCA GCGCATCTCG CTCGGCGCGC TGATCATCGC GCTCGGCCTG
CTTGTCGACG ATGCCATGAT CGCCGTCGAG ATGATGGTGG CCCGCCTGGA GGCGGGCGAC
GATATCAGGA GGGCGGCCAC CCATGTTTAC ACGTCGACCG CCTTTCCGAT GCTGACGGGG
ACGCTCGTCA CCGTGGCGGG CTTCATTCCG ATCGGTCTTA ACGACAGCGC GGCGGGCGAA
TTCACCTTCA CGCTTTTCGT CGTCATCGCG GTTTCGCTGA TCGTTTCCTG GGTAGTGGCG
GTGCTGTTTA CGCCGCTTCT CGGCGTCACG ATCCTGCCGA AGACCATGAA ATCGCATTAT
GAGAAGAAGG GCCGCTTTGC CTCCATCTTC TCCTGGCTTC TGGGGCTTGC GATGCGCTGG
CGCTGGGTCA CCATCATTCT GACGGTCGGC GTTTTCGGGC TTTCGATTGG CGGCATGGGG
CTGGTGCAGC AGCAGTTCTT TCCCAATTCG GACAGGCCGG AGCTCATCAT CGACTGGAAC
CTCCCCCACA ACAGTTCGAT CGCCGAAACC AACAGGCAGA TGGCGAGATT CGAAAAAGAG
ATGCTGGCCG ACAACAAGGA TATCGATCAT TGGACGACCT ATGTCGGCCA AGGGGCGCCG
CGCTTCATCC TGTCCTTCGA CGTGCAGACG CCCAACGTGT CGTTCGGGCA GACGATCATC
GTCACCAAGG GGCTCGACGT GCGTGACAAG GTGCGGACGG AACTGCAGGG CTATCTGACG
AAAACCTTCG CCGGCACCGA CGCCTTCGTG AAGCTTCTCG ACATCGGTCC GCCGGTCGGC
AAGCCGGTCC AGTACCGGAT CAGCGGCCCC GATATTCAAA AGGTCCGCGA TCTCTCCCAG
CAATTTGCCG GCGTCATGGG ATCACACCCG CTTCTGACGA ACATGGTGCT GGACTGGAAC
GAGCCCTCTC GCGTGGTGAA GATCGATGTG CTCCAGGATA AGGCGCGCCA GCTCGGCGTA
TCCTCAGAAG ACATTGCTAC CGCCCTCAAC GGCATCGTCG AAGGCTCGAC GGCCACCCAG
GTTCGAGACG GCATCTACCT GGTCAACGTT ATCGGTCGCG CCAGGGCATC CGAGCGCGAT
TCCATCCAGA CGCTGCAGAA CCTGCAGCTC TCCACCTCCA ACGGCAAGGT CGTGCCGCTC
TCCGCCGTAG CCAATTTCCG CTACGAGCTC GAGCAGCCGA CAATCTGGCG TCGCGACCGG
CAACCTACAA TAACGGTCAA GGCTGCCGTC GTCGGTCCGA CGCAACCGGC CACGATCGTC
GACCAGCTCA CACCGAAGGT CGAAGACTTC CAGAAAGGCC TTCCGGTGGG ATACAAGGTG
GAAGTGGGCG GCGCGGTCGA ATCCAGTGCC GATGCCCAGG GGCCGATCGC GGCTGTGGCT
CCGCTGATGC TGTTTGCGAT GGCGACGATC CTGATGATCC AGCTGCAAAG CTTCAGCCGG
CTTTTCCTCG TGTTCGCGGT TGCGCCGACC GCCCTGATCG GCGTCGTCGC GGCGCTGCTT
CTCAGCAACG CCCCCATGGG CTTCGTCGCG ATCCTGGGCG TGCTCGCGCT CATCGGCATC
CTGATCCGCA ACTCCGTCAT CCTGGTCGTC CAGATCGAGC ATTTGCGCAG CGAAGGAATG
GCGCCTTGGC AGGCGGTTGT CGAAGCCACC GAACACCGCA TGCGGCCGAT CATGCTGACG
GCCGCTGCAG CCACGCTGGC GCTGATCCCG ATCTCGCGCG AGATCTTCTG GGGACCGATG
GCCTACGCCA TGATGGGCGG CATCGTCGTC GGAACCGCGC TCACCCTGTT GTTCCTGCCT
GCGCTCTACG TCGCGTGGTT CAGGATCCCG AGGGATGAGC GTGTTCAGGC CGAAGCCGCG
GCAAAGGCAT GA
 
Protein sequence
MKSFNLSDWA LEHRSLVWYF MIVFILAGAF SYVKLGREED PNFTIKTMVI TAQWPGASAE 
EVTRQVTDRI EKKLQELESL DYTKSETVAG QTTVFVELLP TTKAKDVAPT WLRIRNMIAD
IKGDFPTGVV GPFFNDRFGD VFGNIYAFTS DGLTQRQLRD LVENARSEVL TVPNVGKVDV
VGAQDEAIYL EFSTRQIAAL GIDQQAVIQT LQAQNAVTQS GFVDAGPERI ALRVSGQFTS
EASLRSINLR INDRFFPLTD VATIKRGYVD PPSALFRFNG EPAIGLAIGM KQGANLLEFG
EGLDAQMKRV VADLPIGVDV HRVSDQPAVV DEAVSGFTRA LFEAIAIVLI ISFISLGLRA
GMVVAISIPL VLAITFVVME YSGISLQRIS LGALIIALGL LVDDAMIAVE MMVARLEAGD
DIRRAATHVY TSTAFPMLTG TLVTVAGFIP IGLNDSAAGE FTFTLFVVIA VSLIVSWVVA
VLFTPLLGVT ILPKTMKSHY EKKGRFASIF SWLLGLAMRW RWVTIILTVG VFGLSIGGMG
LVQQQFFPNS DRPELIIDWN LPHNSSIAET NRQMARFEKE MLADNKDIDH WTTYVGQGAP
RFILSFDVQT PNVSFGQTII VTKGLDVRDK VRTELQGYLT KTFAGTDAFV KLLDIGPPVG
KPVQYRISGP DIQKVRDLSQ QFAGVMGSHP LLTNMVLDWN EPSRVVKIDV LQDKARQLGV
SSEDIATALN GIVEGSTATQ VRDGIYLVNV IGRARASERD SIQTLQNLQL STSNGKVVPL
SAVANFRYEL EQPTIWRRDR QPTITVKAAV VGPTQPATIV DQLTPKVEDF QKGLPVGYKV
EVGGAVESSA DAQGPIAAVA PLMLFAMATI LMIQLQSFSR LFLVFAVAPT ALIGVVAALL
LSNAPMGFVA ILGVLALIGI LIRNSVILVV QIEHLRSEGM APWQAVVEAT EHRMRPIMLT
AAAATLALIP ISREIFWGPM AYAMMGGIVV GTALTLLFLP ALYVAWFRIP RDERVQAEAA
AKA