Gene Rleg2_1776 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1776 
Symbol 
ID6980513 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1819303 
End bp1820472 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content64% 
IMG OID643396498 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002281288 
Protein GI209549371 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.909315 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0859175 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCCG AAAACATCCT ATCACCGAGG GAGGCCCGCA TTCCATGGGC GGCGATGGCA 
GGGATCACCG CGACCGTTAC GATATTCGCC GTGGCGCAGG GGCTGACCTA TCCGCTGCTG
AGTTTCATCC TGGAGCGGCA GGGAACGACA TCCGGCCTGA TCGGCCTGTC GGCGGCGATG
ACGCCGCTGG GCTTCATTGT ATCGGCGCCC TTCATTCCGG CGCTCTCACA GTACCTCGGC
GGAGCGCGGC TGGCCATTCT GTGTTCGATC CTGGCCGCCC TCACCCTGAT GGCGATTGGC
TGGATGCAGA ATGTTTGGGC CTGGATGACG CTGCGATTTC TGCTCGGCGT CTTCGCCAAT
CCGCTCTATG TGATCAGCGA GACCTGGCTG ATCACGATCA CGCCGGCGCC ACGGCGCGGC
CGCATTATGG GCCTTTATTC GTCGATCGTT TCGGCTGGCT TCGCCATCGG CCCGCTGTCG
CTCGGCTTCA CCGGCACGCA GGGCTGGCCG CCCTTCGTGA TCGGTATCGC GGCCTTCCTC
GCCTGTGGCT TCATCGTGCT GGCGGTCGCA CCACGCGTGC CAAAGATCTC AAGCGAAGGC
GAAGCAACAT CGCTCGGCGG CTTCTTCGCA CTGGCGCCGC TGCTATTGTT TGCTGTTTTC
ACGGCTGCCG CCTTCGAGCA GATCCTGCTT TCCCTATTCG CGGTCTATGG CGCGGCGCAT
GGCAGCGCCG AGGGACGGAT CGCTTCGCTC ATCACCTGTT TCATCGCCGG CAATGCCGTG
ATGCAGATCC TGCTCGGCCG CGTGGCCGAA CGGCTTGGCT CGACAGGGAC CATGTCATTT
TGCGTCCTGG CTTGCCTTGC CGGCTGCCTG CTGCTGCCGC TGGTCTTCAG CGCATGGCTG
ATCTGGCCAC TGGTTTTCGT CTGGGGCGGC GTCTCGTTCG GGATCTACAC CATGTCGCTG
ATCCAGCTCG GCGAGCGTTT CACCGGCCAG ACCCTGATCG CCGGCAACGC GGCCTTCGCC
TTGGCGTGGG GCATCGGCGG GATGGCGGGC TCGCCCGCGG CAGGATTGGC GATGCAGCTG
ATCGGACACC AAGGACTGCC GATGTCACTT GGCCTGCTCA GCTGTGTCCT GGCGGTGTTT
CTGATGGCGG GGAGACGACG GGGCGGGTGA
 
Protein sequence
MSAENILSPR EARIPWAAMA GITATVTIFA VAQGLTYPLL SFILERQGTT SGLIGLSAAM 
TPLGFIVSAP FIPALSQYLG GARLAILCSI LAALTLMAIG WMQNVWAWMT LRFLLGVFAN
PLYVISETWL ITITPAPRRG RIMGLYSSIV SAGFAIGPLS LGFTGTQGWP PFVIGIAAFL
ACGFIVLAVA PRVPKISSEG EATSLGGFFA LAPLLLFAVF TAAAFEQILL SLFAVYGAAH
GSAEGRIASL ITCFIAGNAV MQILLGRVAE RLGSTGTMSF CVLACLAGCL LLPLVFSAWL
IWPLVFVWGG VSFGIYTMSL IQLGERFTGQ TLIAGNAAFA LAWGIGGMAG SPAAGLAMQL
IGHQGLPMSL GLLSCVLAVF LMAGRRRGG