Gene Rleg_2477 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_2477 
Symbol 
ID8013452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp2476691 
End bp2477941 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content64% 
IMG OID644825058 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002976288 
Protein GI241205192 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00891] putative sialic acid transporter 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.655762 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCTT TGGAAAGCCT GCGCCGGCTG ACGCCGCAGC AGCGCAACAC CGTCATCGCC 
AGCTATCTCG GCTGGACGCT CGATGCCTTC GATTTCTTCA TTCTCGTCTT CGTTCTCAAA
TATATCGCCG AGGAATTCCA CACCGACGTT CCTGCCGTCT CGGTTGCGAT CTTCCTGACG
CTAGCCATGC GGGCGCTCGG CGCGCTGATC TTCGGGCTGG CGGCCGACCG TTACGGGCGG
CGCATCACGC TGATGGCCGA CGTGCTGCTT TATTCGCTGT TCGAATTCCT GACCGGCTTC
TCCACGGGGC TCACGATGTT TCTCGTGCTC CGAGCGCTCT ACGGCATCGC CATGGGTGGT
GAATGGGGCG TCGGCGCCTC GCTTGTTATG GAGACGGTGC CGGAGGAAAG CCGCGGTATC
GTTTCCGGCA TCCTGCAGGC CGGCTATCCC TCGGGCTATC TGATCGCCTC GATCGCGTTC
TTCCTGCTCT TCCCGGTCAT CGGCTGGCGT GGCATGTTTT TTGTTGGCGC GGTGCCGGCG
CTGCTGGTGC TCTACATCAG ACGCAATGTC GAGGAGAGCC CAGCCTTCCT GAAGCGGAAG
GCCGAGGGGC GCCGGCCGTT CCTGACGGTC TTGCGCGAAA ACATTCCGCT GTTCATCTGG
GCGGTGCTCC TGATGACGGC CTTCAACTTC TTCAGCCACG GCACGCAGGA TATCTACCCG
ACCTTCCTCG AGACCCAGCG CAACTATTCG AGCTATACGG TCGGCGCGAT CGCCATCGTC
TACAATATCG GGGCGATCTG CGGCGGGCTG TTTTTCGGGG CACTGTCGCA GCGGATCGGC
CGGAAGAAGG CCATCGTCAT CGCCGCACTG ATCGCCGTGC CCGTCGCGCC GCTCTGGGCC
TATGCGCCGG GGCCGGTGCT GCTCGCCATC GGCGCTTTCC TGATGCAGTT CTTCGTCCAG
GGCGCTTGGG GCATCGTGCC GGTGCATCTG AACGAACTGT CACCCGACGA GGTGCGCGGC
ACCTTTCCTG GCTTCGCCTA CCAGCTCGGC AACCTGCTGG CCTCTGGCAA TGCTACGCTG
CAGGCGGGGC TGGCCGCCCG CTGGGACGGC GACTATGCCT ATGCGCTCCT GATCGTTGCG
GCCGTGGTGG CGCTCATCGT CGCGGCCCTT GCCGGCTTCG GCTACGAGAA GAAGGATGTC
CGCTTCGGCA CGGAGGAGGC CGAGGAACCG CATGGCGCGA TGCGAATCTA G
 
Protein sequence
MSALESLRRL TPQQRNTVIA SYLGWTLDAF DFFILVFVLK YIAEEFHTDV PAVSVAIFLT 
LAMRALGALI FGLAADRYGR RITLMADVLL YSLFEFLTGF STGLTMFLVL RALYGIAMGG
EWGVGASLVM ETVPEESRGI VSGILQAGYP SGYLIASIAF FLLFPVIGWR GMFFVGAVPA
LLVLYIRRNV EESPAFLKRK AEGRRPFLTV LRENIPLFIW AVLLMTAFNF FSHGTQDIYP
TFLETQRNYS SYTVGAIAIV YNIGAICGGL FFGALSQRIG RKKAIVIAAL IAVPVAPLWA
YAPGPVLLAI GAFLMQFFVQ GAWGIVPVHL NELSPDEVRG TFPGFAYQLG NLLASGNATL
QAGLAARWDG DYAYALLIVA AVVALIVAAL AGFGYEKKDV RFGTEEAEEP HGAMRI