Gene Rleg2_2319 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_2319 
Symbol 
ID6981058 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp2376872 
End bp2378062 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content64% 
IMG OID643397032 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002281820 
Protein GI209549903 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0805208 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.902577 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACATCA ACTACGACAT GAGAATTGAA GCCAAGACGA CGCCGTGGAG TGCGGTCATC 
TGCATGATGC TGACCTCCTT CGTGCTCGTG GCGTCGGAAT TCATGCCGGT GAGCCTGCTG
ACCCCGATCG CCGACGAACT GGCGAGCACC GCAGGCCAGG CGGGTCAGGC GATCTCGATT
TCGGGTTTCT TCGCCGTCAT CACCGCCTTG TTCAGCAATG TGCTGTTTGC GCGCTTCGAC
CGGCGCCGGG TGATCCTCTG CTACACCGTC GTGCTGGTGG CGTCCGGCCT TGCGGTCACC
TTTGCCCCCA ACTACCTTAT CTTCATGGTC GGCCGCGCGC TCATCGGTGT GTCGATCGGC
GGCTACTGGT CCTTGGCAAC GGCGATCATC GCCCGCATTG CCTCCGGTCC CGACGTGCCC
AAGGCGCTGG CCATGCTTCA AGGCGGCAGC GCGCTCGCCG CCGTGATCGC CGCCCCGCTT
GGAAGTTTCC TCGGCAGCCT TGTCGGATGG CGCGGCGCCT TCTTCATCGT CGTGCCGATC
GGCATTGTCG CCTTCATCTG GCAAGCCATC GCCCTGCCCC GGATGCCGGG CGGCCAAAGC
GGATCGCTCG GCAGGACGTT CCGCCTGATG GGCAACCGCA CCTTCGCCCT TGGCATGACG
GCGATGATCC TGTTTTTCAT GGGGCAATTT GCGCTCTCGA CCTATCTGAG GCCGTTTCTC
GAAGACATCA CCCATCTCGG CGTCAACGCG CTCTCGCTGG TGCTGCTCGG GATCGGCCTT
GCCGGCCTCG CCGGAACCTC GCTGATCCCC TCCATGCTGC GCGCGCATAT GGCCCACGTG
CTGATCGGGT TTCCGGCGGT GCTGGTGACC GTGGCCTTGG CGCTCGTCGG CCTCGGCCCC
GTGGCCTTCG CGACGGCCGG CCTGCTGCTT TTCTGGGGCT TGCTGACGAC GCCGGTGCCG
GCGGCATGGA CGACCTGGAT GACGCGGACG GTCCCGCACC ATCTGGAGGA AGCCGGCGCC
TGGTTTGTCG CGCTTATTCA GTTTGCGATC ACTTCAGGGG CATTCGCCGG CGGTCTGTTG
TTCGATCATA TCGGCTGGTG GAGCCCGTTC GTATTGAGTG CGGTGACTAT GCTGGGTTCG
GCGGTGACTG CGGTTGGCGT GACGCGGGCA TCCAAGAGAG CCTCATCCTG A
 
Protein sequence
MDINYDMRIE AKTTPWSAVI CMMLTSFVLV ASEFMPVSLL TPIADELAST AGQAGQAISI 
SGFFAVITAL FSNVLFARFD RRRVILCYTV VLVASGLAVT FAPNYLIFMV GRALIGVSIG
GYWSLATAII ARIASGPDVP KALAMLQGGS ALAAVIAAPL GSFLGSLVGW RGAFFIVVPI
GIVAFIWQAI ALPRMPGGQS GSLGRTFRLM GNRTFALGMT AMILFFMGQF ALSTYLRPFL
EDITHLGVNA LSLVLLGIGL AGLAGTSLIP SMLRAHMAHV LIGFPAVLVT VALALVGLGP
VAFATAGLLL FWGLLTTPVP AAWTTWMTRT VPHHLEEAGA WFVALIQFAI TSGAFAGGLL
FDHIGWWSPF VLSAVTMLGS AVTAVGVTRA SKRASS