Gene Rleg2_1780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1780 
Symbol 
ID6980517 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1823868 
End bp1825097 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content65% 
IMG OID643396502 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002281292 
Protein GI209549375 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.378935 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0668238 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGTCA TGCATCAGGA CTCGGCCGAC CCGGTAGCCG CAAAGCGCAA CTCCTGGGTG 
CTCACCGTTG CGCAGGCCTT CGGGGGCGCC AATGCTCCAA TCATCGTCTC GCTCGGCGGC
CTGGTCGGGC AGCATCTGTC GACCGATCCG GATCTCGTCA CGCTTCCCGT CAGCCTGCTC
AGCCTTGGGC TGGCACTCGG GACTCTGCCT GCCGCCTGGG TGATGCGTCG GTTCGGACGC
AAGCCCGGAT ATCTGCTGGG CTCGGTGACC GGCATGGTTT CGGGCCTGAT CGCTGCACTG
GGGATCGTGC TCTCCAGCTT CCTGGTTTTC TGCCTCGGCA CCTGCCTCGC CGGTTTCTAT
TCCTCCTATG TGCAGAGCTA CCGCTTCGCC GCGACCGACA ACACCACTGC AGCGCAGAGC
CATAAAGCGA TCGTCCGCGT CATGGTCGGC GGCCTGATCG CCGCGATCAT TGGCCCGCAG
CTCGTCATCT GGACGCGCGA CGCTTTGCCG GGGACACCCT TCGCCGGGAG CTTCCTCAGC
CAGGCCGTTC TCGCCGCCCT GGCGTTTCCG GTGCTGCTCA TGCTTCGCAC ATCGACGCCG
CCGACGGCTC ACGCGTCGGA AAGCGCCCTG GAGCGGCCCC TTGCCCAGAT TCTGACATCG
CCGCGCTATC TGCTCGCCAT CGCAACCGGT GTCGTGTCCT ACGGGCTGAT GACCTTCGTG
ATGACTGCGT CGCCGATCGC GATGGTCGGG CATGGTCACT CGATCGACCA GGCGGCATTG
GGCATCCAAT GGCATATTCT CGCCATGTAT GCGCCGAGCT TCGTCACCGG CCGCCTGATG
GTGCGTTTCG GCAAGGAACG GGTCGCGGCC GTCGGTCTGC TCCTCATCGG CTGCTCGGCG
GCCGTCGCGC TCTCCGGCTT CGACATCTCC CATTTCTGGC TCTCGCTGGT TCTGCTCGGG
ATCGGTTGGA ACTTCGGCTT CATCGGAGCA ACCGCCATGG TGGCCGACTG CCATACGCCG
GCCGAACGCA GCAAGGTACA GGGGGCAAAC GACTTCGTGG TCTTCGGTAC GGTCGCCTGC
GCGTCCTTCT CCGCCGGGTC GCTTCTCCAC AGCTCCGGCT GGGAAACGAT CAACTGGATC
GTGCTTCCGG CAGTCGCCCT GGTGCTGGTT CCCTTGGTCT GGCGGGCGGC GCGGCCCGGC
GATCACTCGG GAAGTCCGGC CTTGCGGTAG
 
Protein sequence
MNVMHQDSAD PVAAKRNSWV LTVAQAFGGA NAPIIVSLGG LVGQHLSTDP DLVTLPVSLL 
SLGLALGTLP AAWVMRRFGR KPGYLLGSVT GMVSGLIAAL GIVLSSFLVF CLGTCLAGFY
SSYVQSYRFA ATDNTTAAQS HKAIVRVMVG GLIAAIIGPQ LVIWTRDALP GTPFAGSFLS
QAVLAALAFP VLLMLRTSTP PTAHASESAL ERPLAQILTS PRYLLAIATG VVSYGLMTFV
MTASPIAMVG HGHSIDQAAL GIQWHILAMY APSFVTGRLM VRFGKERVAA VGLLLIGCSA
AVALSGFDIS HFWLSLVLLG IGWNFGFIGA TAMVADCHTP AERSKVQGAN DFVVFGTVAC
ASFSAGSLLH SSGWETINWI VLPAVALVLV PLVWRAARPG DHSGSPALR