Gene Rleg_4026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4026 
Symbol 
ID8014832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4102882 
End bp4104132 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content63% 
IMG OID644826595 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002977806 
Protein GI241206710 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.557727 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTTTT CGCCCACCGG AGACCGTTTT GCCGCGTTCC GGCACTCGTC CTATACGCGG 
TTCTTCTTCG CGCGCTTCCT GCTTTCCTTC TCGCAGCAGA TCGTCAGCGT CGCCGTCGGC
TGGCAGATGT ACGACCAGAC GGGCAGCGCG ATCTATCTCG GTTTGATTGG TCTCGTGCAG
TTCCTGCCGT CGCTGCTGCT CATCCTCGTC ACCGGTTCGG TAGCCGATCG GTACAATCGC
CGGGCGATCG CCGCCCTCTG CTCGCTGGTG AGCGCGCTCT GTACGCTGGC ACTGCTGGTT
ATGACTTTAA TGGGAAGCTT TACGCCGCTG CCTGTCTTCG CGGTGCTTTT GATCTTCGGC
ATCGAGCGCG CCTTCATGTC GCCGGCGGTA CAGTCGCTGG CGCCCAATCT GGTGCCGGAG
GAGGCACTCT CCAATGCGAT CGCCTGGAAT TCGTCGTCCT GGCAGCTCGC GGCAATCACC
GGACCGGTGC TCGGTGGCCT GCTCTATGGT GTCAGCGCGC CGACTGCCTA TACGGTGGCG
GTGATCTTTT CGGTGCTCGG TGCGGCCCTT CTCTACATGA TCCCGAAACC GGTGCAGAAG
ACGACCGGCG AGACCAAGAG CTGGGCGATG ATCCTCGGCG GCTTCAGTTT CATCCGTGCC
GAAAAGGTGG TGCTCGGGGC GATCTCGCTC GATCTGTTCG CCGTGCTGCT CGGCGGGGCC
ACGGCGCTGA TGCCGATTTT TGCGCGCGAT ATCCTCACCC TCGGTCCCTG GGGCCTCGGA
CTGCTGCGCG CCGCACCCGG ACTTGGCGCC ATCGTCATGG CGATCTTCCT GGCCGCCTAT
CCGCTCAGAC ATCGCGCCGG CATCTACATG TTTATCGGCG TCGCCCTGTT CGGCGTCGGA
ACGATCATCT TCGGCATCTC GACCAACACC GAGGTCTCGA TCGCGGCGCT AGCGCTAATG
GGGGCGGCTG ACATGGTATC GGTCTATGTG CGCGAGAGCC TGATTGCGCT CTGGACGCCG
GATCAGCTGC GCGGCCGCGT CAATGCGGTC AACATGGTCT TCGTCGGCGC TTCGAACGAG
CTTGGGGAAT TCAGGGCGGG CACGATGGCG GCGCTCTTCG GCGCTGTGCC GGCGGTCGTC
ATCGGCGGAA TCGGGACGCT TGTCGTGGCG GCGATCTGGG CGTCGAGTTT CCCCAAACTG
CGCGGGATCG ATACGCTCGA CGCGCCCAGC GCATCGTCGA AATCGATTTA A
 
Protein sequence
MSFSPTGDRF AAFRHSSYTR FFFARFLLSF SQQIVSVAVG WQMYDQTGSA IYLGLIGLVQ 
FLPSLLLILV TGSVADRYNR RAIAALCSLV SALCTLALLV MTLMGSFTPL PVFAVLLIFG
IERAFMSPAV QSLAPNLVPE EALSNAIAWN SSSWQLAAIT GPVLGGLLYG VSAPTAYTVA
VIFSVLGAAL LYMIPKPVQK TTGETKSWAM ILGGFSFIRA EKVVLGAISL DLFAVLLGGA
TALMPIFARD ILTLGPWGLG LLRAAPGLGA IVMAIFLAAY PLRHRAGIYM FIGVALFGVG
TIIFGISTNT EVSIAALALM GAADMVSVYV RESLIALWTP DQLRGRVNAV NMVFVGASNE
LGEFRAGTMA ALFGAVPAVV IGGIGTLVVA AIWASSFPKL RGIDTLDAPS ASSKSI