Gene Rleg_1098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1098 
Symbol 
ID8012221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1080251 
End bp1081477 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content64% 
IMG OID644823681 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002974932 
Protein GI241203836 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.150369 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.279355 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTGAGA TTGCCGCCAA TATTTCCATC GATAGCCTGG ATGAAGAGCT GCCGTCGGCA 
ATGACCGTGG CGCTGGTCCA GTTGGCGCTC GCCTGCGGCG GCTTCGGCAT CGGCACCGGC
GAATTCGCAA TCATGGGATT GCTGCCCAAT GTCGCCGAAA CCTTCTCGGT CACCACGCCG
CAAGCCGGCT ACGTCATCAG CGCCTATGCG CTCGGAGTCG TCGTCGGCGC ACCTGTTATT
GCCGTGCTCG CTGCGAAGAT GGCGCGCCGC ACGCTGCTTT TAATGCTGAT GCTGATCTTT
GCCGCCGGCA ATATATCAAG TGCCATGGCG CCGACCTTCG AAAGCTTCAC GCTGCTGCGC
TTTGTCAGCG GCCTGCCGCA TGGCGCTTAT TTCGGCGTCG CGGCCCTTGT CGCCGCCTCA
ATGGTGCCGG TGCATCGCCG CGCCCGCGCC GTCGGCCGCG TCATGCTCGG CCTGACCGTT
GCGACACTTC TCGGCACGCC GTTGACGACC TTCTTCGGTC AGTCGCTCGA CTGGCAGGTG
GCGTTTTTCT CGGTCGGCGT GCTCGGCCTG CTAACGGTGG CGCTGATCTG GTTCTACGTT
CCAAAGGACA GGGTTTCCGA GGAGGCGGGC TTCCTGCGCG AACTCGGCGC TTTCCGCCGG
CCGCAGGTGT GGCTGACGCT CGGCATCGCC GCCGTCGGTT ACGGCGGCAT GTTCGCGATG
TTCAGCTATA TCGCCTCGAC AACGACTGAG GTGGCGATGC TGCCGGAGAC GGCCGTTCCG
ATCATGCTGG TCCTCTTCGG TGTCGGCATG AATGCGGGCA ATTTCATCGG TTCGTGGCTC
GCCGACAAAT CGCTGCTCGG CACGATCGGC GGGTCGCTGA TCTATAATGT CGTCGTGCTG
ACCACTTTCT CGCTGACCGC CGCCAATCCC TATATGCTCG GCCTTTGCGT CTTCCTCGTC
GGCTGCGGTT TTGCTGCTGG ACCGGCGCTG CAGACGCGGC TGATGGATGT CGCCGCCGAC
GCGCAGACGC TTGCGGCCGC TTCCAACCAT TCCGCCTTCA ACATCGCCAA TGCGATCGGC
GCCTGGCTCG GCGGCCTCGT CATCGCCGGG GGTTATGGTT TTGCGGCGAC CGGCTATGTC
GGTGCAGCAC TATCCTTCCT CGGCCTCTTC GTCTTTGCAG CCTCGCTACG CCTTGAGCGC
CGCGACCGGA GCACGCAGGC CGTCTGA
 
Protein sequence
MSEIAANISI DSLDEELPSA MTVALVQLAL ACGGFGIGTG EFAIMGLLPN VAETFSVTTP 
QAGYVISAYA LGVVVGAPVI AVLAAKMARR TLLLMLMLIF AAGNISSAMA PTFESFTLLR
FVSGLPHGAY FGVAALVAAS MVPVHRRARA VGRVMLGLTV ATLLGTPLTT FFGQSLDWQV
AFFSVGVLGL LTVALIWFYV PKDRVSEEAG FLRELGAFRR PQVWLTLGIA AVGYGGMFAM
FSYIASTTTE VAMLPETAVP IMLVLFGVGM NAGNFIGSWL ADKSLLGTIG GSLIYNVVVL
TTFSLTAANP YMLGLCVFLV GCGFAAGPAL QTRLMDVAAD AQTLAAASNH SAFNIANAIG
AWLGGLVIAG GYGFAATGYV GAALSFLGLF VFAASLRLER RDRSTQAV