Gene Rleg2_3914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3914 
Symbol 
ID6982678 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp4059839 
End bp4061227 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content62% 
IMG OID643398637 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002283402 
Protein GI209551485 
COG category[R] General function prediction only 
COG ID[COG2270] Permeases of the major facilitator superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.133818 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGAATC GCATTGACTG GACAGGAACA CAGCCGCCGA AAGCCACGGA GAAGGGCATC 
TGGGGGTGGA TGTTCTTCGA TTGGGCAGCC CAGCCCTTCT TTACCGTGGT CACGACCTTC
ATCTTCGGTC CCTATTTCGT TTCCCGTCTG ACCGATGACC CGGTTTCCGC GCAGACGACG
TGGAGCAACA TGGCGACGAT CTCCTCGGTG ATCATCGCCC TGCTTTCGCC CATCCTCGGC
TCGATCGCCG ACCAGTCGGG CGCGCGCAAA CCCTGGATCG GCTTTTTTGC GATCATCAAG
ATCGTCAGCC TGTTCTGCCT GTGGTTCGCA GCCCCCGGCT CGCCTGTTCT TTATCCGGTC
ATTTTCATGA TCCTTGCCTC GATCTCGGCC GAGTTTTCGA TCGTCTTCAA CGATTCGATG
ATGCCGCGGC TGGTCGCCAA ACACGAGGTC GGCAAGCTCT CCAACACAGC CTGGGGGCTC
GGTTATCTCG GCGGCATCAT CGTGCTCATT GCCGTCGTGA CGCTTTTGGC GGCGAGCCCG
GAGAGCGGCA AGACCATCCT CGGCCTCGAT CCGCTCTTCG GTCTCGATCC TCGGACCGGC
CAGGATGCAC GCATCACCGG GCCGATCTCG GCCGTCTGGT ATCTGATCTT CATCCTGCCG
ATGTTCTTCT TCACGCCGGA TGTCGACAGA GGTCTTCCGT TCGGCACCGC CGTCCGTGCC
GGCCTGCGGG AAGTGAAAAA CACGCTTGGC GAACTCAAGG AACGCCGCGG CATCCTGAGA
TTTCTCATCG CCCGGATGAT CTATCAGGAC GGCGTCAACG GCCTGCTGAT CCTCGGCGGT
ATCTTCGCCG CCGGCATGTT CGGCTGGGCA ACGATCGAGA TCGGCATTTA CGGCATCATC
CTGAATGTGG TCGCGATCTT CGGCTGCCTG ATCGCCGGCC GCATCGACAA GGGTGTCGGG
TCGAAGGTGA CCGTCGTCAT CAGCCTCACC ATGCTGCTTC TCGCCACCAT CGGCATCATC
TCGACAGGGC CGGGTTACAC CCTGTTCGGC CTGCTGCCGC TGCCGACGGC GGATTCTGGC
GGCCTCTTCG GCACCGCGGC GGAAAAAGCC TATATCCTCT ACGGCCTGCT GATCGGGTTC
GCCTTCGGGC CGGTGCAGGC CTCGTCGCGC TCCTATCTCG CCCGCAGCGT CAGCCCTGAG
GAAGCCGGCC GCTACTTCGG CATCTACGCG CTTTCGGGGC GCGCCACGAG TTTCATGGCG
ACGCTCCTTT TCTCGCTCAT GACCTATATG AGCGGGTCAC CGCGGCTTGG AATGGCCACA
CTGATCCTGT TTCTCGCCGG TGGCCTGGTG CTGCTCGTCC GCACGCCCTA TCCGGCCGAT
CGCGCGTAG
 
Protein sequence
MLNRIDWTGT QPPKATEKGI WGWMFFDWAA QPFFTVVTTF IFGPYFVSRL TDDPVSAQTT 
WSNMATISSV IIALLSPILG SIADQSGARK PWIGFFAIIK IVSLFCLWFA APGSPVLYPV
IFMILASISA EFSIVFNDSM MPRLVAKHEV GKLSNTAWGL GYLGGIIVLI AVVTLLAASP
ESGKTILGLD PLFGLDPRTG QDARITGPIS AVWYLIFILP MFFFTPDVDR GLPFGTAVRA
GLREVKNTLG ELKERRGILR FLIARMIYQD GVNGLLILGG IFAAGMFGWA TIEIGIYGII
LNVVAIFGCL IAGRIDKGVG SKVTVVISLT MLLLATIGII STGPGYTLFG LLPLPTADSG
GLFGTAAEKA YILYGLLIGF AFGPVQASSR SYLARSVSPE EAGRYFGIYA LSGRATSFMA
TLLFSLMTYM SGSPRLGMAT LILFLAGGLV LLVRTPYPAD RA