Gene Rleg_4854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4854 
Symbol 
ID8007242 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp232142 
End bp233329 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content62% 
IMG OID644821784 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002973044 
Protein GI241113209 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.011677 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.12131 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTCG AGTTGACATT GAATGAGACC TCACTGACGG ACGACACGGC GACCTCGTGG 
TCTGCTGTCA TCTGTCTCAC GCTTCTTACT TTCCTTCTGG TGGGGTTGGA GTTTCTCCCT
GTGAGCCTTC TGACCCCGAT CGCCAGGGAC CTCTCGGTGT CAGAGGGGCA GGCCGGATTG
GCGATCACCG TGTCGGGGGT CTTCGCGGTT GTCACCAGCC TTTTCGGCAA CGCCTTCCTG
GCGAAGATCG ACCGGAAGTC TGTCTTCTTG CTTTATACCG CGGTTCTTGT TGTATCGAGC
TTGGCGGTTG CCCTCGCACC CAACTTCCTT GTCTTTCTCG TCGGGCGGTC CCTGGTCGGC
GTGTCGATCG GCGGCTTCTG GTCGCTGTCG ACGGCTATTC TGGCACGCCT GACGTCAGAC
CGTGACCTGC CCAAGGCCAT TGCGCTTCTT CAAGGTGGCA CCGCATTCGC CCTCGTCCTT
GCCGCGCCGC TCGGCAGTTT TCTTGGCGGG TTGATCGGAT GGCGCGGAAC CTTCTTCATC
ACGGTACCGA TTGGATTTGC CGCGCTCGTC TGGCAACTGG TCGTCCTGCC GAGGATGCCG
GCGACATCAA CCGTCTCGGT GGCCAGAATA TTCGGGCTGC TGCGCAATCG CACATTCGCG
ATTGGAATGG CGGCGACCGC TCTCGCCTTC ATCGGCCAGA ACGCGCTGTC TATTTATCTT
CGTCCGTTCC TCGAAGGCGT CACAGGACTG GAATTGGATG TTCTGTCCAT GGTGCTTCTC
GGCCTCGGCG TCGGCGGACT GGCTGGAACC TCCGTCATTG GCTTCGCCGC CCGGCGCCAC
CTCCTCTCCG TTCTCGTAGG CCTGCCGGCT GCTCTTTCGG TCCTTGCCCT GCTGCTGATC
GCTCTCGGGC CGTTCGCGGC GGTTACCGCA TCCCTGCTTG TCATGTGGGG ATTTTTCTCG
ACGCCGATTC CGGTCGCCTG GAACACCTGG ATGGCCGCTA TCGTCCCCGG TGAGTTGGAA
GCGGCGGGTG GGCTGCAGGT GGCGCTGATC CAACTTGCCA TTGCCGGCGG CGCTTTCGCT
GGCGGCATGC TGTTCGACAC CGTGGGATGG TGGAGCACCT TCCTTCTGGC TGCCTGCCTT
CTTGCCGGTT CGGCAGTCCT TGCCGCGCTC GCCGGTCGCC GTTCCTGA
 
Protein sequence
MSVELTLNET SLTDDTATSW SAVICLTLLT FLLVGLEFLP VSLLTPIARD LSVSEGQAGL 
AITVSGVFAV VTSLFGNAFL AKIDRKSVFL LYTAVLVVSS LAVALAPNFL VFLVGRSLVG
VSIGGFWSLS TAILARLTSD RDLPKAIALL QGGTAFALVL AAPLGSFLGG LIGWRGTFFI
TVPIGFAALV WQLVVLPRMP ATSTVSVARI FGLLRNRTFA IGMAATALAF IGQNALSIYL
RPFLEGVTGL ELDVLSMVLL GLGVGGLAGT SVIGFAARRH LLSVLVGLPA ALSVLALLLI
ALGPFAAVTA SLLVMWGFFS TPIPVAWNTW MAAIVPGELE AAGGLQVALI QLAIAGGAFA
GGMLFDTVGW WSTFLLAACL LAGSAVLAAL AGRRS