Gene Rleg_5075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5075 
Symbol 
ID8007668 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp458813 
End bp460174 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content62% 
IMG OID644821990 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002973250 
Protein GI241113415 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.550975 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.411558 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTCAA TCGAACAAAA CTCCAGGGCT TACGCACAGT TGAAGCCGGC CTCTCTCGCC 
ATTCTCTGCC TCGGCGTGAT CGTGGCGCAG GTCGATACAT CGGTCGTTAA TCTGGCGGTT
CAGCCAATCG GACTCGATCT CAAAGCCTCC GTCACCGAGC TGCAATGGGT CGTCGACGCC
TACAATCTCG TTTACGCAGC ACTTCTCATC AGCGGCGGCT TGTTCGCCGA TCTTTACGGC
CGACGTCTGA TGTTCGTGAT CGGCTGCGCT GTCTTTGCGC TGGCGAGCCT TGGCTGCGCA
TTCGCCTCGA CCATCGCGAT CCTCATCGCC GCGAGAGCCT TGACCGGCTT CGGCTCCGCC
CTCCTGCTGC CCGCTTCGCT TTCGCTGATC CGGGTCATCT ACCGGGATGA GAAGGTCCGT
GCCCGAGCGC TCGGGATCTG GGCCGGCTGC AATGGAATGT CGCTTGCGAT CGGCCCAAGC
CTCGGCGGCT TCCTCATCCG TGATTTCGGC TGGCGATCCG TCTTCTTCGT CGTCATCCCC
ATCGCGCTGA TCGCGGCGGC GGCGGCACGG TTTTTCGTTC CGGAAAGCGC CGATCGGCAA
GGCCGGTCCT TCGACATGCC GGGTCAGTTG CTGGGGATAG CATCCCTCAC CGTTCTTACC
CTCACGGCGA TCGAATCATT GCATCTGCCG CCTTTGTGGA CGGCCCTCCT GGCCATCGCC
GGCGCCCTGC TTCTGCTGCT CTTCATCATC GTCGAAAAGC GCCTGGAGCA GACGGCTCTG
GTTCCGATCT CGATGTTTTC GGGCAGGCAG TTTCGCGGTG CGATGGCCGG AACGGCGGCA
ATGACGTTCG GCATGTATGG CACGCTCTTT CTCTTTCCAC TCGCCTCCCT CAGCCTTCGA
CGCCTGGCCT CGGTGGAGGT CGGGCTGTCT CTGCTGCCAA TGGCCATCAG CTTCATCGCG
ATCTCACCCT TCTCCGGTTC GATCTCGGAG CGCCTGGGGA AGAAACGCAC CATATCGGCG
GGACTGGCGC TGATGGGTTT GGGCAACCTT CTGCTCGGCT CATCCTTTCT GGCCGATTGG
TTCATTGCCG AAGAGGTCGG ATTGTTGCTG ACCGGGGTCG GGATGGGCAT GGCGACGGGG
CCTCTGACGG CGGTCGCGGT TTCAACCGTG GCGGCCGATC GCGCCGGCAC GGCGAGCGCC
CTGATCAATG TTGCCCGCAT GGTCGGGGCG ACGATAGGCG TCGCCTTACT GGGAGCGATC
TTCGCCTTTT TGGGAGAAGC AGAAACGGCC TTCATCGTCG CGATGTCGGT CGGCGGCAGC
ACGCAACTGC TTGGAAGCCT CGCTGCCTGG CGCTTGCTCT GA
 
Protein sequence
MNSIEQNSRA YAQLKPASLA ILCLGVIVAQ VDTSVVNLAV QPIGLDLKAS VTELQWVVDA 
YNLVYAALLI SGGLFADLYG RRLMFVIGCA VFALASLGCA FASTIAILIA ARALTGFGSA
LLLPASLSLI RVIYRDEKVR ARALGIWAGC NGMSLAIGPS LGGFLIRDFG WRSVFFVVIP
IALIAAAAAR FFVPESADRQ GRSFDMPGQL LGIASLTVLT LTAIESLHLP PLWTALLAIA
GALLLLLFII VEKRLEQTAL VPISMFSGRQ FRGAMAGTAA MTFGMYGTLF LFPLASLSLR
RLASVEVGLS LLPMAISFIA ISPFSGSISE RLGKKRTISA GLALMGLGNL LLGSSFLADW
FIAEEVGLLL TGVGMGMATG PLTAVAVSTV AADRAGTASA LINVARMVGA TIGVALLGAI
FAFLGEAETA FIVAMSVGGS TQLLGSLAAW RLL