Gene Rleg_4238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4238 
Symbol 
ID8015021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4337631 
End bp4339019 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content61% 
IMG OID644826808 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002978017 
Protein GI241206921 
COG category[R] General function prediction only 
COG ID[COG2270] Permeases of the major facilitator superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.239966 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGAATC GCATTGACTG GACAGGAACG CAACCGCCGA AGGCCACGGA GAAGGGCATC 
TGGGGGTGGA TGTTCTTCGA CTGGGCAGCA CAGCCCTTCT TTACCGTGGT CACAACCTTT
ATCTTCGGCC CCTATTTCGT TTCCCGCCTG ACCGATGACC CGGTTTCCGC GCAGACGACG
TGGAGCAACA TGGCGACGAT CTCCTCTGTG ATCATCGCCC TGCTCTCACC CGTTCTCGGT
TCGATCGCCG ACCAGTCCGG CGCACGCAAA CCTTGGATCG GCTTCTTCGC GATCATCAAG
ATCGCCAGCC TCTCCTGCCT GTGGTTTGCC GCACCCGGTT CGCCTATTGT CTATCCCGTC
ATTTTCATGA TCCTCGCCTC GATCTCGGCC GAGTTTTCGA TCGTCTTCAA TGATTCGATG
ATGCCGCGCC TGGTCAGCAA GCACGAAGTC GGCAAGCTTT CCAACACCGC CTGGGGGCTC
GGTTACCTCG GCGGCATCAT TGTGCTCATT GCCGTCGTGA CGCTTTTGGC GGCGAGCCCC
GAGACCGGCA AGACCATCCT CGGTCTCGAT CCGCTATTCG GCCTCGATCC TCAGACCGGT
CAGGATGCAC GCATCACCGG GCCGATCTCG GCCGTCTGGT ATCTGATCTT CATCCTGCCG
ATGTTCTTCT TTACGCCGGA TGTCGGCAGG GGTCTTCCCT TCGGCACCGC CGTCCGCTCC
GGCTTGCGGG AACTCAGAAA CACGCTTGGC GAACTCAGAG AACGCCGCGG CATCCTGACA
TTCCTCATCG CCCGCATGAT TTATCAGGAC GGCGTCAACG GCCTGCTGAT CCTTGGCGGT
ATCTTCGCGG CCGGCATGTT CGGCTGGGCG ACGATCGAGA TCGGTATCTA CGGCATCATC
CTGAATGTTG TCGCGATCTT CGGCTGCCTG ATCGCCGGCC GCGTCGACAA GAGCGTCGGT
TCGAAGGTGA CCGTCGTCAT CAGCCTCACC ATGCTGCTTC TCGCCACCAT CGGCATCATC
TCGACAGGAC CGGGTTACAC CCTATTCGGA CTGATGCCAC TGCCGACGGC CGATTCCGGC
GGCCTTTTCG GTACTGCCGC GGAAAAGGCC TATATCCTCT ATGGTTTGCT GATCGGGCTC
GCCTTCGGGC CGGTGCAGGC CTCGTCGCGC TCCTATCTCG CCCGCAGCGT CAGCCCGGAG
GAAGCCGGCC GCTACTTCGG CATCTACGCG CTTTCGGGCC GCGCCACCAG TTTCATGGCG
ACGCTGCTCT TCTCTCTGGT GACTTATATG AGCGGATCAC CGCGGCTCGG GATGGCAACG
CTGATCCTCT TTCTTGCCGG CGGACTGGTG CTCTTGTTCC GTACACCCTA TCCGGCCGCC
CGGGCATAG
 
Protein sequence
MLNRIDWTGT QPPKATEKGI WGWMFFDWAA QPFFTVVTTF IFGPYFVSRL TDDPVSAQTT 
WSNMATISSV IIALLSPVLG SIADQSGARK PWIGFFAIIK IASLSCLWFA APGSPIVYPV
IFMILASISA EFSIVFNDSM MPRLVSKHEV GKLSNTAWGL GYLGGIIVLI AVVTLLAASP
ETGKTILGLD PLFGLDPQTG QDARITGPIS AVWYLIFILP MFFFTPDVGR GLPFGTAVRS
GLRELRNTLG ELRERRGILT FLIARMIYQD GVNGLLILGG IFAAGMFGWA TIEIGIYGII
LNVVAIFGCL IAGRVDKSVG SKVTVVISLT MLLLATIGII STGPGYTLFG LMPLPTADSG
GLFGTAAEKA YILYGLLIGL AFGPVQASSR SYLARSVSPE EAGRYFGIYA LSGRATSFMA
TLLFSLVTYM SGSPRLGMAT LILFLAGGLV LLFRTPYPAA RA