Gene Rleg2_2685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_2685 
Symbol 
ID6981429 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp2725778 
End bp2726998 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content66% 
IMG OID643397398 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002282182 
Protein GI209550265 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.188987 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACATAT CCCAAGGCCC GCGGAGCGTT CTTCGCCACC CCGGCTATCT GAACTTCGCT 
GCCTCCCGCG TCTTTTCCTC GCTTGCCTTC CAGTCCATCG GCATCGCCAT GGGCTGGATG
ATCTATGATC AGACGCACAG CGCCTTTGCG CTCGGCCTCG TCGGCCTCTG CCAGTTCCTG
CCGATGGCGG TGCTGACCTT TGTCGTCGGC CATGTCGCCG ACCGGTTCGA CCGGCGGCGT
ATCGGCCTCA TCTGCCAGCT GATCGAGGCG GTGACGGCGC TGGTGCTGGC GGTTGCCACC
TGGCAGCAAT GGCTGACGCC TTCAGGCATC CTTGTCGCCG TCACGGTGCT TGGCGCCGTT
GTCGCCTTCG AGCGGCCGAC CATGGCGGCT CTGCTGCCGA ACATCGTGCC GGCTTCGATG
CTGCAGAAGG CGGTCGCCAC CTCCACCTCG CTGATGCAGA CGGCGCTGAT CATCGGCCCC
TCGCTCGGCG GCTTGCTCTA CGGCCTCCAC CCGGTGGCGC CCTTTGCCGT GTCGGCGCTG
CTCTTTGCCG TTGCAAGCTT CAACGTCATC TCGATCCGCA TGCAGTGGGC GCCTTCCAAA
CGCGAGCCGG TGACGCTCGC CTCGGTCTTT GCCGGCGTCT CCTTCATCCG CAGCCGGCCG
GTGATGCTCG GCACGATCTC GCTCGATCTC TTCGCGGTGC TGCTCGGCGG CGCCACGGCA
CTGCTGCCGA TGTTTGCCCG CGATATCCTG CATGCCGGTC CCTGGGAGCT CGGCCTTTTG
CGCGCCGCAC CGGCGATCGG CGCGCTTGCC ATGTCGATCG TGCTCGCCCG CCGGCCGCTC
GAGAGCAATG TCGGGCGCAA GATGCTTGCT GCCGTCGCCG TGTTCGGCCT CGCCACCATC
GTCTTTTCGC TGTCCACCAA CATCACGCTT TCGGTCGCTG CCCTGCTTGT TGTCGGCGCG
TCGGATACGG TCAGCGTCGT CGTGCGCAGT TCGCTGGTGC AGCTTCTGAC GCCGGATGAG
ATGCGCGGCC GCGTCAGCGC GGTCAACTCG CTGTTCATCG GCACCTCCAA CCAGCTCGGC
GAATTCGAAT CCGGCATGAT GGCGGCAGCC CTCGGGCCGG TCGCCACCGG CATCGTCGGC
GGGCTCGGCA CGATCATCGT CGTGCTCCTG TGGATGAGGC TCTTCCCCGA TCTTACCAAG
GTCAAGACGC TGCAGGGCTG A
 
Protein sequence
MDISQGPRSV LRHPGYLNFA ASRVFSSLAF QSIGIAMGWM IYDQTHSAFA LGLVGLCQFL 
PMAVLTFVVG HVADRFDRRR IGLICQLIEA VTALVLAVAT WQQWLTPSGI LVAVTVLGAV
VAFERPTMAA LLPNIVPASM LQKAVATSTS LMQTALIIGP SLGGLLYGLH PVAPFAVSAL
LFAVASFNVI SIRMQWAPSK REPVTLASVF AGVSFIRSRP VMLGTISLDL FAVLLGGATA
LLPMFARDIL HAGPWELGLL RAAPAIGALA MSIVLARRPL ESNVGRKMLA AVAVFGLATI
VFSLSTNITL SVAALLVVGA SDTVSVVVRS SLVQLLTPDE MRGRVSAVNS LFIGTSNQLG
EFESGMMAAA LGPVATGIVG GLGTIIVVLL WMRLFPDLTK VKTLQG