Gene Rleg_6050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6050 
Symbol 
ID8016312 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012852 
Strand
Start bp82003 
End bp83262 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content60% 
IMG OID644827358 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002978558 
Protein GI241258674 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCATGT TTGAAAAGCG GATGACTGCG ACAGGTGGGT CCGATTCTGG CGACGTCGAC 
TCTGGGTATG CCTGGTGGCG GCTGGTACTC ACGTTGATCC TCGGCACCGC TGCCTGCGTC
GGCAATTGGT CCGTCGTGGT GCTTCTTCCA ACCCTGCAAG TCGAGTTCGA CACGGTGCGC
GGCGGTGCCT CGCTTCCCTA TACCTGCACG ATGCTTGGCT TTGCCTTTGG CGGGGTTGTA
ATGGGGCGTC TGGCCGATCG GGTCGGCATC GTCGTGCCAG TGCTGATCGG CGCGTTCCTG
CTTTGCGTTG GCTACATCCT GGCTGCCCTT ACCACCAATA TTTGGCAGTT CGCAGCCTTC
TCGCTGGTAA TCGGCCTTGG ATCAGCTGCC GGCTTCGCGC CGTTGATATC CGACCTTTCT
CTTTGGTTTA GCAGGCACCG GGCGCTCGCA GTCGCCTTTG CGGCCTCGGG CAGCTATCTT
TCCGGGGCGG TTTGGCCGAT GGCTATCGAG CATTTCCAGA CGACCCAGGG CTGGCGTGCA
ACCCATGTCG GAATCGGCAT TTTCATCCCA CTGGTAATGG TGCCCATCGG TCTGCTGTTG
AAGCGGCGTC TACAGACCGT AACCTATGTC CAGGCCGAGG CGGTAACCGA AGCAGCACGC
AACGAACTCG GCCTAAGCCC GAACGCGTTG CAGGTCGTCC TTGTCGTCGC GGGCTTTGCG
TGCTGCATGG CAATGTCAAT GCCGCAGGTC CATATCGTCG CATATTGCGG GGATCTCGGC
TACGGTGTGG CGGTGGGCAC GCAGATCATC GCCTTGATGC TTGGACTGGG TGTCGTCAGC
CGCTTGGCGT CCGGGGCGGT CGCCGATCGG ATCGGTGCAG GGCCGATGTT GATCCTCGGT
TCCTCGATGC AGGCGGCAGC GCTGCTGCTA TATCTGTATT TCAACAGCAA GTCGTCGCTC
TATGTGATCT CCGGCCTGTT CGGGCTATTT CAGGGTGGTA TAGTTCCGAT GTATGCCGTG
ATCATTCGGA AATATCTGCC GCCACGTGAG GCGGGTATCC GCATCAGCCT GGTGTTAATG
GCGACCGTAC TTGGCATGGC TTGTGGGGGC TTGGCCGCTG GTTATATTTT CGACGCCACC
GGCTCTTACC GCCTGGCTTT CCTGCATGGC TTCCTTTGGA ACTGCGTCAA CCTTGCTTTG
GTGAGCTGGT TGATCCTATG GCCCAGGCTG CGGCGGAGGC AGCAGGCGTT GGCGACGTGA
 
Protein sequence
MAMFEKRMTA TGGSDSGDVD SGYAWWRLVL TLILGTAACV GNWSVVVLLP TLQVEFDTVR 
GGASLPYTCT MLGFAFGGVV MGRLADRVGI VVPVLIGAFL LCVGYILAAL TTNIWQFAAF
SLVIGLGSAA GFAPLISDLS LWFSRHRALA VAFAASGSYL SGAVWPMAIE HFQTTQGWRA
THVGIGIFIP LVMVPIGLLL KRRLQTVTYV QAEAVTEAAR NELGLSPNAL QVVLVVAGFA
CCMAMSMPQV HIVAYCGDLG YGVAVGTQII ALMLGLGVVS RLASGAVADR IGAGPMLILG
SSMQAAALLL YLYFNSKSSL YVISGLFGLF QGGIVPMYAV IIRKYLPPRE AGIRISLVLM
ATVLGMACGG LAAGYIFDAT GSYRLAFLHG FLWNCVNLAL VSWLILWPRL RRRQQALAT