Gene Rleg_1859 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1859 
Symbol 
ID8012913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1843584 
End bp1844849 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content64% 
IMG OID644824449 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002975681 
Protein GI241204585 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0875597 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.419035 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACG AACTGCGTCC GATCGATGCT GCCGCAAGCG CCGTGTCGCA GAAACTTGAC 
TGGCGGCTGA TGCTGCCGGT TTTCATCATC GTCAGCCTCG ATGCCGCCAG CAGCGGCGCC
ATCCTGCCGT TCCTGCCATT CTACCTCCGG AATCTCGGGG CGTCGCCGCT CGTTCTCGGG
CTTGTTCTCG GTGCGGAAGC GCTCAGCCAG TTCGTTGCCG CGCCCTGGCT AGGTCAACTT
TCCGACCGTT GCGGACGCAA GAGGGTTTTG CTCGCCAGCC AGGCCGGAGC GTTGATCAGC
CTTTTGCTGC TGGCGCTTGC CAACAGCGTC GTCTTCGTGC TGCTGGCGCG GATCCTGCTC
GGCCTGACCG CAGCCAATTT CTCGGCCGCG GCAGCCTATG CTGCCGACAA CAGCAGCGCC
ACCACCCGGC GCCAGGCCAT CGGCATTCTG AGTGCGGGCC TCGGCCTTGG CGGAATGATC
GGACCGAGCC TCTCCGGATA CCTTGCCGAC ACGTCTCTGA CGGCGCCGAT CTGGGTCGCG
CTTGCCCTGT CGGCGACCAG CATGCTGGTG ACCGGGCTTT GGCTGAAAGG CGCCGATGCG
CCCGGCCGGT TCGGCAACGA CAGCGAAGCG GACGAGACGG TCGGCGAGAA GGTTTCCTTC
CGAACCCTGC TCGCCTCGCC GGTCATCCGC GTCCTCGTCG CCGTTCTTCT CTGCCACTAT
TTCTCATACG GGATGTTCAG TTCGCAGCTC GCCGTTTTTC TGGCGGATAC ATTCACCTGG
AATGAACATG CGTTCGGTCC GAAGGAGCTG GGTTACCTCC TGAGCGCCGA CGGTGCGATC
AACGTCCTGG TCCAGCTTTT CCTGCTGAGA TGGCTCGGCG GCACCTTCTC TGAGCGAGGC
CTGATCGTCC TGGTCTTCAC CATTCTCTCA ATTGGTTATG TCACGGCTGG CCTCGCCACC
GACATCGTTA CCCTCGCCTT CGCCGTCCTT TGCATCAGCA CGGGCGTGGC ATTGGCGCGG
CCGACATTCG TTGCAGCACT CTCCGTGCAT GTGCCGCAGC AACGCCAGGG CATCGTCATG
GGAGCAACGC AGTCGCTCGT CGCCGTCACC GACATCGTCA CGCCGGTCCT TGCCGGCGTC
ATTCTCGGGC AGAGCTTGTA TGGCGCATGG ATCGGCGCCG TGGTGGCGAT CGCACTGGTC
GGAGCCGTCA TCGCCCGCAG CCGGCTGCCC GCAATCGATC CGGAGACGAG TGCTACCGGC
GGCTGA
 
Protein sequence
MSDELRPIDA AASAVSQKLD WRLMLPVFII VSLDAASSGA ILPFLPFYLR NLGASPLVLG 
LVLGAEALSQ FVAAPWLGQL SDRCGRKRVL LASQAGALIS LLLLALANSV VFVLLARILL
GLTAANFSAA AAYAADNSSA TTRRQAIGIL SAGLGLGGMI GPSLSGYLAD TSLTAPIWVA
LALSATSMLV TGLWLKGADA PGRFGNDSEA DETVGEKVSF RTLLASPVIR VLVAVLLCHY
FSYGMFSSQL AVFLADTFTW NEHAFGPKEL GYLLSADGAI NVLVQLFLLR WLGGTFSERG
LIVLVFTILS IGYVTAGLAT DIVTLAFAVL CISTGVALAR PTFVAALSVH VPQQRQGIVM
GATQSLVAVT DIVTPVLAGV ILGQSLYGAW IGAVVAIALV GAVIARSRLP AIDPETSATG
G