Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_2166 |
Symbol | |
ID | 6980905 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 2221725 |
End bp | 2223170 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643396887 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002281675 |
Protein GI | 209549758 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00711] drug resistance transporter, EmrB/QacA subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.773581 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACGG CAGAGACGAG CGAGAATGGG CAGGAGGCGG CAAGCGGCCG CCAGGCGAAG ATCGTGGCGC TGGTCGTTGC CGTTTCCTTC TTCATGCAGA TCCTCGACGG CACGATCGTC GCCACCTCGC TGCCGCAGAT GGCGGCGAGC TTTGACGTGC AGCCGGTGTC GATGAGCATC GGCATTACCG TCTACATGCT GACGATGGCC GCTTTCATTC CGCTCTCGGG CTGGCTCGGC GACAGGTTCG GTGCGCGCCG TATCTTCCTG ATGTCGATCG CCGTCTTTAC CGCCGCCTCG CTATTCTGCG GCCTGTCCGG CAGCCTCACG GAATTTGTGC TTTGGCGCGC CGTCCAAGGC GCGGGCAGCG CGCTGATGAC GCCGGTCGGG CGGATCATCG TGCTGAAGAA CGCCCGCAAA TCCGAACTCG TCCAGGCGAT CGCGCTGATC ACCTGGCCGG CGCTGACCGC ACCGGTGATC GGCCCGGTGC TCGGCGGCTT CATCACGACC TATGCCAGCT GGCACTGGAA CTTCCTCATA AACATCCCGA TCGGCATTCT CGGCATGGCG CTGGTGCTGC GCTTCGTGCC GGAGCAGCGC GAGACAGATC CCGGCCGGTT GGATTTCGCC GGCTTCGTTC TGAGTGCTGC GGGGCTGACC TTCCTGCTCG CCGCCCTGGA GCTTTCGGTC AAATGGGACG GCGGGCTTTT GCCGGTCATA TCCATGCTTG CAGCCGGCAT CGTGCTGTCT GTGATGGCGA CCCGGCATTT CCTCGCTGTC GACAATCCGC TGCTCGACCT TTCGGCCTTC CGCGTCCAGA CCTTCTCGAT GTCGACGCTG TCGGCGGGCA CGGCCTGCCG GGTGGCGATC AATGCGACGC CGTTCCTGCT GCCGCTGCTG TTCCAGCTCG GTTTCGGGCT GAGCTCGATT GCCGCCGGCA CCTATCTGCT AGTCTACTTC CTCGGCAATC TCGGCATGAA GACGGTGACG ACGCCGCTCT TGCGCTTCTT CGGCTTCCGC ATCGTGCTTG TGGTCAACGG GCTGATTGCG GCGCTTTCAA TCGCCGCCTG CGGTTTCCTC ACGCCGGATA CGCCGCAGTT TTTCATCCAT GCGCTGCTTT TTCTCGCCGG CCTGTCGCGG TCGATGGAAT TCACCGCGCT GAACACACTC GCCTTCGCCG ATATCGGTCC GGCGCAGCGA AGCTCGGCCT CGACGCTGTC GAGCATGCTG CAGCAGGTGT CGATGCTGCT CGGCGTCGCC GTTGCGGCGG CCGTGCTGAA TATCGCTTCG GCTCTCAGGG GCGCCGACAA TCCCGTTCTT GCCGATTTCC GCTGGGCTTT CGTCGTGGTC GGCGCGATCG GCGTCATGTC GTCGCTGCGC TTCCTGCAAT TGCCGGCGGA GGCAGGCGCC GAAGTATCCG GCCATCGGAA ATTCCAGAAA AATTAG
|
Protein sequence | MTTAETSENG QEAASGRQAK IVALVVAVSF FMQILDGTIV ATSLPQMAAS FDVQPVSMSI GITVYMLTMA AFIPLSGWLG DRFGARRIFL MSIAVFTAAS LFCGLSGSLT EFVLWRAVQG AGSALMTPVG RIIVLKNARK SELVQAIALI TWPALTAPVI GPVLGGFITT YASWHWNFLI NIPIGILGMA LVLRFVPEQR ETDPGRLDFA GFVLSAAGLT FLLAALELSV KWDGGLLPVI SMLAAGIVLS VMATRHFLAV DNPLLDLSAF RVQTFSMSTL SAGTACRVAI NATPFLLPLL FQLGFGLSSI AAGTYLLVYF LGNLGMKTVT TPLLRFFGFR IVLVVNGLIA ALSIAACGFL TPDTPQFFIH ALLFLAGLSR SMEFTALNTL AFADIGPAQR SSASTLSSML QQVSMLLGVA VAAAVLNIAS ALRGADNPVL ADFRWAFVVV GAIGVMSSLR FLQLPAEAGA EVSGHRKFQK N
|
| |