Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4140 |
Symbol | |
ID | 8014934 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 4224005 |
End bp | 4225543 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644826710 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002977920 |
Protein GI | 241206824 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00711] drug resistance transporter, EmrB/QacA subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.347022 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.239769 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACATGC AACTCGCGCC TGCGCCGCTC GTGACGGATC CTCGTCGTCG GCTTATCCTC TTCTTCTTCC TGATGACCGC CATGTTCATG GCGACGCTTG ATAATCAGAT CGTCTCCACG GCGCTGCCGA CGATCGTCGG CGAATTCGGC CATCTCGAGC GCTTCGGCTG GATCGGCTCG GCCTATCTCC TGTCGCTGAG CGCCGTCATG CCGGTCTACG GCAAGCTCGG CGACCTGTTC GGCCGAAAAT ACGTGATGAT GACGGCGATC ATGATCTTCA CCGTCGGATC GACAGTCTGC GGCCTTGCGG TCTCGATGAA TACGCTGATC GCCGCCCGCG TGCTGCAGGG TCTTGGCGGC GGCGGCATCA TGGTGTCGAT CTTCGCCGTC AACGCCGACC TGTTCGAGCC GCGCGAGCGG GCGCGCTACC AAAGCTATTC CAGCCTTGTG CTGATGGCAT CGGGCGCGAT CGGCCCGGTG CTCGGCGGTA CGATGAGCGA TCTCTTCGGC TGGCGCTCGA TCTTCCTCGT CAACGTGCCG ATCGGCTTCA TCGTGCTCAC CGGCCTTGCC TTCATGCTGC CGTACCGCAA ACCGCATCGT CGCCCCAAGA TCGATTATGC CGGTGCGCTC CTGCTTGCCA TGACGACGAC AAGCATCGTG CTTGCCACCG ACAGCAGCGA ATTGTTCGGC GCATTGATCT CGCCGGAGAG TATCGGCATC GTCGCCTTCG GCGTCGTCTG CGCCGTCACC TGGGTGTTCG TCGAGCGCCG CGCGCCGGAA CCGATCGTTC CCCTGCAGCT GTTTCGCAAT TCGACCTTCA GCCTGCTCCT GGTGATCTCG ATCATGGGCG GCGCCATCGC CATCGGCATG GTCAATTATC TCGCCCTCTT TCTGCAGACA ACGACCGGCC TTTCGCCGTC TGCCGCCGGC CTGCTCTTCA TCCTTCTGAC CGGCGGCCTC GTCTGCGGGT CGCTTTCCGC AGGCCGCATC ATCTCGAAGA CGGGGCGCTA CAAGCCCTTC GCCATCGCCA GCCTCACCTG CAGCGCCATC GCCTTTGCGC TGATGTCGCA GATCCACGCC GGAACGCCGA TCGCCTTCAT CGGCGCGGTC ATGATGCTGC ACGGCATCGG CATCGGCCTT GCCCAGCAGG TTCCCGTCAT CGGCGTACAG AATGCAGCAC CCGCCCGCGA CGTCGGCGCC GCCACCGGCT CGGTGACGCT GTCGCGCATG GGCGGCGCCT CGATCGCCAT TTCCATCTAT GGCGCCATCA TCGCCTCTGA GCTCGGCAAG GTCGGCGTCT CCATTCCTGG CGTCGCCGAT ATCAAGCAGC TGACGCCGAA AATGATGGCC GCCCTTCCCG AAGCGAGCCG CCAAGCGGTC GCCGATACCT ATGCCGCCGC ATTCTCGCCG CTCTTCATGA CCTCCTGCGC CATTGCGCTG ATCGGCCTTG CCGCCGCCAT CATGCTGAAA CCCGTGCAAC TGCCCCGTGC CGGCGAGACG ATAAAGCCGC AACCGGCGAC GGCGGAAGCT GCCGAATAG
|
Protein sequence | MDMQLAPAPL VTDPRRRLIL FFFLMTAMFM ATLDNQIVST ALPTIVGEFG HLERFGWIGS AYLLSLSAVM PVYGKLGDLF GRKYVMMTAI MIFTVGSTVC GLAVSMNTLI AARVLQGLGG GGIMVSIFAV NADLFEPRER ARYQSYSSLV LMASGAIGPV LGGTMSDLFG WRSIFLVNVP IGFIVLTGLA FMLPYRKPHR RPKIDYAGAL LLAMTTTSIV LATDSSELFG ALISPESIGI VAFGVVCAVT WVFVERRAPE PIVPLQLFRN STFSLLLVIS IMGGAIAIGM VNYLALFLQT TTGLSPSAAG LLFILLTGGL VCGSLSAGRI ISKTGRYKPF AIASLTCSAI AFALMSQIHA GTPIAFIGAV MMLHGIGIGL AQQVPVIGVQ NAAPARDVGA ATGSVTLSRM GGASIAISIY GAIIASELGK VGVSIPGVAD IKQLTPKMMA ALPEASRQAV ADTYAAAFSP LFMTSCAIAL IGLAAAIMLK PVQLPRAGET IKPQPATAEA AE
|
| |