Gene Rleg_2985 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_2985 
Symbol 
ID8013904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp2982023 
End bp2983189 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content65% 
IMG OID644825555 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002976783 
Protein GI241205687 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0895325 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCTCG CCCTCCTCGT TCTTGCCTTG AGCTCATTTG CGATAGGCAC CACTGAATTC 
GTCATCATGG GTCTGTTGCC GGAGGTCGCC GCCGATCTCT CGGTCAGCAT CCCGCAGGCC
GGATGGCTGG TGACCGGTTA TGCCCTGGCG GTCGCGATCG GCGCCCCTGT GATGGCGATT
TCGACCGCGA AGTTGAAGCG CCGTACCGCC CTGATTGCGC TGATGGCCTT CTTCATCGCC
GGCAACCTGC TGTGCGCTCT GGCGAGCGAC TACTGGGTGC TGATGATCGC CCGTGTCGTG
ACAGCACTCT GCCACGGCGC CTTCTTCGGC ATCGGCTCGG TGGTCGCCGC CGGCCTCGTC
GCCGAAGACC GCAAGGCCCG AGCCGTCGCG CTGATGTTCA CTGGCCTGAC GCTCGCCAAC
GTTCTCGGCG TGCCGCTCGG CACCGCGATC GGTCAGGCCT ATGGCTGGCG CGCCACCTTC
GGCGTCGTCA CCGTCATCGG TATCTTCACC ATATCAGGCC TGATCGCCAT CCTGCCCAGG
GACAAGCAGC AAGAAAACGG CAGCATCCTG CGCGAGATTG CGGCACTCAG GAATGGCGGT
CTGTGGCTAG CACTCTCCAC CACCGTCTTC TTCGCCGCCT CTATGTTCAC CCTCTTCACC
TATATCGCGC CGCTGCTGCG CGACGTCACC GGCGTTTCGC CGGAAGGCGT CACCTGGACG
CTGTTCCTGA TCGGCCTCGG GCTGACCATC GGCAACCTCG TCGGCGGCAA GCTTGCCGAT
TGGCGGCTCG GCGCGACGCT AGCCGGGGTC TTTGCCGCGA TCGCCATCAC TTCGATCGCC
TTCAGCTATA CGAGCCGCTT CTTCATCCCG GCTGAAATCA CCCTCTTCCT CTGGGCGATG
GCAAGCTTTG CCGCCGTACC GGCGCTGCAA GTCGGCGTCG TCGGCTTCGG CAAGGACGCC
CCGAACCTCG TCTCGACGAT CAACATCGGC GCCTTCAACA CCGGCAATGC GCTCGGCGCA
TGGGTGGGTG GCTTGGTCAT CGACGCCGGC TTCGATCTGA CCCGCGTTCC GCTCGCCGCG
GCCTTGATGG CCCTGATCGG CCTCGGGGCG ACGGCGCTCA CCTATCTCTC CGCCAGGGGC
CGGGCTGCCC TCGCCCCTGC CGAGTGA
 
Protein sequence
MPLALLVLAL SSFAIGTTEF VIMGLLPEVA ADLSVSIPQA GWLVTGYALA VAIGAPVMAI 
STAKLKRRTA LIALMAFFIA GNLLCALASD YWVLMIARVV TALCHGAFFG IGSVVAAGLV
AEDRKARAVA LMFTGLTLAN VLGVPLGTAI GQAYGWRATF GVVTVIGIFT ISGLIAILPR
DKQQENGSIL REIAALRNGG LWLALSTTVF FAASMFTLFT YIAPLLRDVT GVSPEGVTWT
LFLIGLGLTI GNLVGGKLAD WRLGATLAGV FAAIAITSIA FSYTSRFFIP AEITLFLWAM
ASFAAVPALQ VGVVGFGKDA PNLVSTINIG AFNTGNALGA WVGGLVIDAG FDLTRVPLAA
ALMALIGLGA TALTYLSARG RAALAPAE