Gene Rleg2_2187 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_2187 
Symbol 
ID6980926 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp2242485 
End bp2243735 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content64% 
IMG OID643396906 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002281694 
Protein GI209549777 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00891] putative sialic acid transporter 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0329536 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.242906 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCTT TGGAAAGCCT GCGCCGGCTG ACGCCGCAGC AGCGCAACAC CGTCATCGCC 
AGCTATCTCG GCTGGACGCT CGATGCCTTC GATTTTTTCA TTCTCGTCTT CGTTCTCAAA
TATATCGCCG AGGAATTCCA CACCGACGTT CCCGCCGTCT CGGTGGCGAT CTTCCTGACG
CTCGCCATGC GGGCGCTGGG CGCGCTGATC TTCGGGTTGG CGGCCGACCG CTATGGCCGG
CGCATCACGC TGATGGCCGA CGTGCTGCTC TATTCGCTGT TCGAATTCCT GACAGGTTTC
TCCACCGGCC TCACCATGTT CCTGGTGCTC CGGGCGCTCT ACGGCATCGC CATGGGCGGC
GAATGGGGCG TCGGCGCCTC GCTGGTCATG GAGACGGTGC CGGAGGAAAG CCGCGGCATC
GTCTCAGGCA TCCTGCAGGC GGGTTATCCC TCGGGCTATC TGATCGCCTC GGTCGTGTTC
TTCCTGCTCT TTCCCGTCAT CGGCTGGCGC GGCATGTTCT TCATCGGCGC GGCCCCGGCG
CTGCTGGTGC TCTATATCCG GCGGAACGTC GAGGAGAGCC CCGCCTTTCT GAGACGGCAG
GCCGAGGGGC GCCGGCCGTT CCTGACGGTG CTGCGCGAAA ATATTCCGCT GTTCATCTGG
GCGGTGCTCT TGATGACGGC GTTCAATTTC TTCAGTCACG GCACGCAGGA TATCTACCCC
ACCTTCCTCG AAACTCAGCG TAACTATTCG AGCTATACGG TCGGCGCCAT CGCCATCGTC
TACAATATCG GGGCGATCTG CGGCGGGCTG TTCTTCGGGG CTCTGTCGCA GCGGATCGGC
CGCAAGCGGG CGATTGTGAC GGCCGCACTG ATCGCCGTGC CCGTGGCACC TCTCTGGGCC
TATTCGCCGG GGCCGGTGCT GCTCGCCATC GGTGCCTTTC TCATGCAGTT CTTCGTCCAG
GGCGCCTGGG GCATCGTGCC GGTGCATCTG AACGAGTTGT CGCCGGACGA AGTGCGCGGC
ACCTTTCCCG GCTTCGCTTA TCAACTCGGC AACCTGCTGG CCTCTGGCAA CGCCACGCTG
CAGGCGGGGC TCGCCGCCCG CTGGAACGGC GATTACGCCT CAGCGCTGCT GATCGTCGCG
GCGGTGGTGG CGCTCGTCGT CGCCCTGCTC GCCGGCTTCG GCTACGAGAA GAAGGATGTT
CGCTTCGGCA TGGAGGAAGC CGAGGAGCCG CATGGCGCGA TGCGAATCTA G
 
Protein sequence
MSALESLRRL TPQQRNTVIA SYLGWTLDAF DFFILVFVLK YIAEEFHTDV PAVSVAIFLT 
LAMRALGALI FGLAADRYGR RITLMADVLL YSLFEFLTGF STGLTMFLVL RALYGIAMGG
EWGVGASLVM ETVPEESRGI VSGILQAGYP SGYLIASVVF FLLFPVIGWR GMFFIGAAPA
LLVLYIRRNV EESPAFLRRQ AEGRRPFLTV LRENIPLFIW AVLLMTAFNF FSHGTQDIYP
TFLETQRNYS SYTVGAIAIV YNIGAICGGL FFGALSQRIG RKRAIVTAAL IAVPVAPLWA
YSPGPVLLAI GAFLMQFFVQ GAWGIVPVHL NELSPDEVRG TFPGFAYQLG NLLASGNATL
QAGLAARWNG DYASALLIVA AVVALVVALL AGFGYEKKDV RFGMEEAEEP HGAMRI