Gene Rleg2_1573 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1573 
Symbol 
ID6980309 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1598354 
End bp1599544 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content63% 
IMG OID643396298 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002281089 
Protein GI209549172 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.120591 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTCGGA CCTACGAGTG GACCATGCAA CGTCATTTCC TTGTCCTGTT TTGTCTCGCG 
CTGACCCAGA TCACCGGCTG GGGCGTGGTC GGCGTCCTGC CCGTGATTGC GACGCCGGTG
GCGGCGGAAT TCAAAACCTC ACTGCCGTCG GTCTTTTTGG GAACCTCTGT GATGTTTGTC
GCAATGGGGC TTGCCGCACC CTGGGCCGGC CGCGCCTTTC GCAGGTTCGG AACGCGCCAG
GTCATGGCGG CGGGAGCGGG CCTGATCGGC CTGGCTCTGT GCCTGCTCGC GCTGTCTCCG
AACCTGCCTG TCTTCTGGGT CGGGTGGGCC CTGACCGGGC TGGCAGGCGC GATGTTCCTG
ACGACCTCGG CTTATGCCTA TGTCGCGGAA TATGCAGAGG ATCGGGCGCG CAGCCTGATT
GGAACATTGA TGCTGGTGAC CGGTCTGGCA GGCAGCGTCT TCTGGCCCAT AACTGCCTTT
CTCGACCATC TTGTGGGGTG GCGGCAGGTT TTCGTCGTCT ATGCCGGTGT CATGGTTTTC
ATCATCTGCC CTTTGGTCCG GTTCGGTCTG CCCGTCACCG GGGCGGCTGC CGCAGCGACA
GCTCATCGGC GGCGCGGGCG ATGGGAACCT GTCTTGATCC TGCTGGTCGC CGCCATCGCT
TTGAACAGCT TTGTCACCTT CGGCGTCGAG GCAGTGGGAA TCAAGCTCCT GCAGTCGATG
GGGATGGATC TTGCCGGCGC CGTCGCAATC GCCTCGCTCC TTGGGGTCTT CAAGGTCGGC
GGGCGCGTGA TCGACCTTCT CGGCGGCCAA AAATGGGACG GATTGTCTAC CGCAATCGTC
TCGGGGGCGA TGATCCCGAT GGGACTGGCT ACGATCTGGA TCGGCGGCGC CGGTATTCTA
TCTGTGGGTG GCTACCTCGT CCTGTTCGGG GTCGGGAGCG GTGCCTTTGC CGTCGCACGC
GCCACGATGC CGCTCGTTTT CTTTGAGAAG GCCGATTATA CCGCCGCGAT GGCAACCATC
GCCCTGCCGA TGAACCTGAT CAACGCACTC GCCCCGCCCG GCATAGCGGC GCTGATGGCC
GGTATCGGGG CGCAGGCGAC TTTCGCGGTT CTGGGCGGGC TGAGCATGGC GGCTTTGGCG
GTTCTGTTGC CGCTGAACGG CATGAAGGCG CGGCCAACAC TCGCCAAATG A
 
Protein sequence
MLRTYEWTMQ RHFLVLFCLA LTQITGWGVV GVLPVIATPV AAEFKTSLPS VFLGTSVMFV 
AMGLAAPWAG RAFRRFGTRQ VMAAGAGLIG LALCLLALSP NLPVFWVGWA LTGLAGAMFL
TTSAYAYVAE YAEDRARSLI GTLMLVTGLA GSVFWPITAF LDHLVGWRQV FVVYAGVMVF
IICPLVRFGL PVTGAAAAAT AHRRRGRWEP VLILLVAAIA LNSFVTFGVE AVGIKLLQSM
GMDLAGAVAI ASLLGVFKVG GRVIDLLGGQ KWDGLSTAIV SGAMIPMGLA TIWIGGAGIL
SVGGYLVLFG VGSGAFAVAR ATMPLVFFEK ADYTAAMATI ALPMNLINAL APPGIAALMA
GIGAQATFAV LGGLSMAALA VLLPLNGMKA RPTLAK