Gene Rleg_4140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4140 
Symbol 
ID8014934 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4224005 
End bp4225543 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content64% 
IMG OID644826710 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002977920 
Protein GI241206824 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.347022 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.239769 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACATGC AACTCGCGCC TGCGCCGCTC GTGACGGATC CTCGTCGTCG GCTTATCCTC 
TTCTTCTTCC TGATGACCGC CATGTTCATG GCGACGCTTG ATAATCAGAT CGTCTCCACG
GCGCTGCCGA CGATCGTCGG CGAATTCGGC CATCTCGAGC GCTTCGGCTG GATCGGCTCG
GCCTATCTCC TGTCGCTGAG CGCCGTCATG CCGGTCTACG GCAAGCTCGG CGACCTGTTC
GGCCGAAAAT ACGTGATGAT GACGGCGATC ATGATCTTCA CCGTCGGATC GACAGTCTGC
GGCCTTGCGG TCTCGATGAA TACGCTGATC GCCGCCCGCG TGCTGCAGGG TCTTGGCGGC
GGCGGCATCA TGGTGTCGAT CTTCGCCGTC AACGCCGACC TGTTCGAGCC GCGCGAGCGG
GCGCGCTACC AAAGCTATTC CAGCCTTGTG CTGATGGCAT CGGGCGCGAT CGGCCCGGTG
CTCGGCGGTA CGATGAGCGA TCTCTTCGGC TGGCGCTCGA TCTTCCTCGT CAACGTGCCG
ATCGGCTTCA TCGTGCTCAC CGGCCTTGCC TTCATGCTGC CGTACCGCAA ACCGCATCGT
CGCCCCAAGA TCGATTATGC CGGTGCGCTC CTGCTTGCCA TGACGACGAC AAGCATCGTG
CTTGCCACCG ACAGCAGCGA ATTGTTCGGC GCATTGATCT CGCCGGAGAG TATCGGCATC
GTCGCCTTCG GCGTCGTCTG CGCCGTCACC TGGGTGTTCG TCGAGCGCCG CGCGCCGGAA
CCGATCGTTC CCCTGCAGCT GTTTCGCAAT TCGACCTTCA GCCTGCTCCT GGTGATCTCG
ATCATGGGCG GCGCCATCGC CATCGGCATG GTCAATTATC TCGCCCTCTT TCTGCAGACA
ACGACCGGCC TTTCGCCGTC TGCCGCCGGC CTGCTCTTCA TCCTTCTGAC CGGCGGCCTC
GTCTGCGGGT CGCTTTCCGC AGGCCGCATC ATCTCGAAGA CGGGGCGCTA CAAGCCCTTC
GCCATCGCCA GCCTCACCTG CAGCGCCATC GCCTTTGCGC TGATGTCGCA GATCCACGCC
GGAACGCCGA TCGCCTTCAT CGGCGCGGTC ATGATGCTGC ACGGCATCGG CATCGGCCTT
GCCCAGCAGG TTCCCGTCAT CGGCGTACAG AATGCAGCAC CCGCCCGCGA CGTCGGCGCC
GCCACCGGCT CGGTGACGCT GTCGCGCATG GGCGGCGCCT CGATCGCCAT TTCCATCTAT
GGCGCCATCA TCGCCTCTGA GCTCGGCAAG GTCGGCGTCT CCATTCCTGG CGTCGCCGAT
ATCAAGCAGC TGACGCCGAA AATGATGGCC GCCCTTCCCG AAGCGAGCCG CCAAGCGGTC
GCCGATACCT ATGCCGCCGC ATTCTCGCCG CTCTTCATGA CCTCCTGCGC CATTGCGCTG
ATCGGCCTTG CCGCCGCCAT CATGCTGAAA CCCGTGCAAC TGCCCCGTGC CGGCGAGACG
ATAAAGCCGC AACCGGCGAC GGCGGAAGCT GCCGAATAG
 
Protein sequence
MDMQLAPAPL VTDPRRRLIL FFFLMTAMFM ATLDNQIVST ALPTIVGEFG HLERFGWIGS 
AYLLSLSAVM PVYGKLGDLF GRKYVMMTAI MIFTVGSTVC GLAVSMNTLI AARVLQGLGG
GGIMVSIFAV NADLFEPRER ARYQSYSSLV LMASGAIGPV LGGTMSDLFG WRSIFLVNVP
IGFIVLTGLA FMLPYRKPHR RPKIDYAGAL LLAMTTTSIV LATDSSELFG ALISPESIGI
VAFGVVCAVT WVFVERRAPE PIVPLQLFRN STFSLLLVIS IMGGAIAIGM VNYLALFLQT
TTGLSPSAAG LLFILLTGGL VCGSLSAGRI ISKTGRYKPF AIASLTCSAI AFALMSQIHA
GTPIAFIGAV MMLHGIGIGL AQQVPVIGVQ NAAPARDVGA ATGSVTLSRM GGASIAISIY
GAIIASELGK VGVSIPGVAD IKQLTPKMMA ALPEASRQAV ADTYAAAFSP LFMTSCAIAL
IGLAAAIMLK PVQLPRAGET IKPQPATAEA AE