Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_2319 |
Symbol | |
ID | 6981058 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 2376872 |
End bp | 2378062 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643397032 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002281820 |
Protein GI | 209549903 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.0805208 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.902577 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACATCA ACTACGACAT GAGAATTGAA GCCAAGACGA CGCCGTGGAG TGCGGTCATC TGCATGATGC TGACCTCCTT CGTGCTCGTG GCGTCGGAAT TCATGCCGGT GAGCCTGCTG ACCCCGATCG CCGACGAACT GGCGAGCACC GCAGGCCAGG CGGGTCAGGC GATCTCGATT TCGGGTTTCT TCGCCGTCAT CACCGCCTTG TTCAGCAATG TGCTGTTTGC GCGCTTCGAC CGGCGCCGGG TGATCCTCTG CTACACCGTC GTGCTGGTGG CGTCCGGCCT TGCGGTCACC TTTGCCCCCA ACTACCTTAT CTTCATGGTC GGCCGCGCGC TCATCGGTGT GTCGATCGGC GGCTACTGGT CCTTGGCAAC GGCGATCATC GCCCGCATTG CCTCCGGTCC CGACGTGCCC AAGGCGCTGG CCATGCTTCA AGGCGGCAGC GCGCTCGCCG CCGTGATCGC CGCCCCGCTT GGAAGTTTCC TCGGCAGCCT TGTCGGATGG CGCGGCGCCT TCTTCATCGT CGTGCCGATC GGCATTGTCG CCTTCATCTG GCAAGCCATC GCCCTGCCCC GGATGCCGGG CGGCCAAAGC GGATCGCTCG GCAGGACGTT CCGCCTGATG GGCAACCGCA CCTTCGCCCT TGGCATGACG GCGATGATCC TGTTTTTCAT GGGGCAATTT GCGCTCTCGA CCTATCTGAG GCCGTTTCTC GAAGACATCA CCCATCTCGG CGTCAACGCG CTCTCGCTGG TGCTGCTCGG GATCGGCCTT GCCGGCCTCG CCGGAACCTC GCTGATCCCC TCCATGCTGC GCGCGCATAT GGCCCACGTG CTGATCGGGT TTCCGGCGGT GCTGGTGACC GTGGCCTTGG CGCTCGTCGG CCTCGGCCCC GTGGCCTTCG CGACGGCCGG CCTGCTGCTT TTCTGGGGCT TGCTGACGAC GCCGGTGCCG GCGGCATGGA CGACCTGGAT GACGCGGACG GTCCCGCACC ATCTGGAGGA AGCCGGCGCC TGGTTTGTCG CGCTTATTCA GTTTGCGATC ACTTCAGGGG CATTCGCCGG CGGTCTGTTG TTCGATCATA TCGGCTGGTG GAGCCCGTTC GTATTGAGTG CGGTGACTAT GCTGGGTTCG GCGGTGACTG CGGTTGGCGT GACGCGGGCA TCCAAGAGAG CCTCATCCTG A
|
Protein sequence | MDINYDMRIE AKTTPWSAVI CMMLTSFVLV ASEFMPVSLL TPIADELAST AGQAGQAISI SGFFAVITAL FSNVLFARFD RRRVILCYTV VLVASGLAVT FAPNYLIFMV GRALIGVSIG GYWSLATAII ARIASGPDVP KALAMLQGGS ALAAVIAAPL GSFLGSLVGW RGAFFIVVPI GIVAFIWQAI ALPRMPGGQS GSLGRTFRLM GNRTFALGMT AMILFFMGQF ALSTYLRPFL EDITHLGVNA LSLVLLGIGL AGLAGTSLIP SMLRAHMAHV LIGFPAVLVT VALALVGLGP VAFATAGLLL FWGLLTTPVP AAWTTWMTRT VPHHLEEAGA WFVALIQFAI TSGAFAGGLL FDHIGWWSPF VLSAVTMLGS AVTAVGVTRA SKRASS
|
| |