Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_2151 |
Symbol | |
ID | 8013167 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 2141583 |
End bp | 2142863 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644824737 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002975967 |
Protein GI | 241204871 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00710] drug resistance transporter, Bcr/CflA subfamily [TIGR00880] Multidrug resistance protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCCGC CGCATAAACC GCACCAGGAA AACCAGGGCT CCCGCCGCAT CGGCATGGGC TTTGGCGAAT TCGTCGTCAC CATCGCCATC ATGACCGCCA GCGTCGCTAT GGCGATCGAC AGCATGCTAC CGGCATTGCC CAATATCGGC CATTCCCTCG GCGTCACCAA TACCAACGAC GCGCAGCTGA TCATCGGCGT GTTCTTCTTA GGCTTCGGCG TCTCGCAGAT CTTCTTCGGC AGCCTTTCGG ATACGTTCGG CCGCCGCAAC ATCCTGCTCG GCGGCTTGGC CTGCTATATT GTCGGAATGT TTGCCGCCGC AGCGACCGGC AGCTTCGAAA TGCTGCTCGT CATGCGTTTT GTCCAGGGTA TTGGCGGTGC GGCCGTGCGT ATCACCACCA TGGCCATGGT GCGCGACTGC TTCGGTGGCC GCGAAATGGC CCGAGTCATG TCCTATGTGA TGATCGTCTT CATGATCGTG CCGATCGTTG CCCCATCCGT CGGCCAACTC ATTATCCTCT ATGCCAACTG GCACTGGATC TTCATCCTGC TCGGCATCAT CGCCACCATT CTGTTCGTCT GGGCTCTCCT GCGGCTGAAG GAATCGCTGC CGCCGGAAGA GCGGCTGCCG CTGTCGGTCG CCTCTGTCGT GGATGGCTTC AAGACCGTGC TCACCAACCG CATCACCTGC GGCTACATGA TCGGCCTCAC CATGTTCACC GGCGTCATCA GCGCCTATGT GATCTCGGTG CAGCAGGTCT TCGGCGAGGT CTACGGCCTT GGCGACTGGC TGCCGATCGC TTTTGCGGCC ACCGCTGGCG GCATCGCCGT CGCCAATTTC GCCAACGGCT TCTTCGTCCG CAAATTCGGC ATGCGCCGCA TCTCGCACGC CGCCCTGCTG ATTTTCACGG CGCTATCGGC TGTCGGCTTT TTTTATTCGC TGGCCGGCAA GCCCGATTTC GCCATTGCCT ACGGCATCTT CACCATCGTG CTGATGATGT TTGCCCTGAT CGCCACCAAT TTCACCGCCA TCTGCCTCGA GCCAATGGGT AATCTCGCCG GCACGGCGAC CGCCATCACC GGCTTCGTTT CGACGACGGC CGGCGCCATC CTCGGCGGCC TGGTCGGCCA GATGTTCAAC GGCACGGTGC AGCCGCTCTT CGGCGGCTTC GCGCTCTTCG GCGCGGTGAC GATCGCAGCC ACCCTTTGGG CCGAAAACGG CAAGCTCTTC ACCCATCCAG GCGACAGCCC GCAGCTCGAT CCGGGTGCGG CGCATTTCTG A
|
Protein sequence | MTPPHKPHQE NQGSRRIGMG FGEFVVTIAI MTASVAMAID SMLPALPNIG HSLGVTNTND AQLIIGVFFL GFGVSQIFFG SLSDTFGRRN ILLGGLACYI VGMFAAAATG SFEMLLVMRF VQGIGGAAVR ITTMAMVRDC FGGREMARVM SYVMIVFMIV PIVAPSVGQL IILYANWHWI FILLGIIATI LFVWALLRLK ESLPPEERLP LSVASVVDGF KTVLTNRITC GYMIGLTMFT GVISAYVISV QQVFGEVYGL GDWLPIAFAA TAGGIAVANF ANGFFVRKFG MRRISHAALL IFTALSAVGF FYSLAGKPDF AIAYGIFTIV LMMFALIATN FTAICLEPMG NLAGTATAIT GFVSTTAGAI LGGLVGQMFN GTVQPLFGGF ALFGAVTIAA TLWAENGKLF THPGDSPQLD PGAAHF
|
| |