Gene Rleg_2004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_2004 
Symbol 
ID8013039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1999307 
End bp2000737 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content64% 
IMG OID644824591 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002975822 
Protein GI241204726 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0685608 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTGAAAT CGATCATTGC AGAGCGAAGC GAGGCCATCG CCGAACAGTC ACCCTCCGTA 
CGATGGGCGC TTGCCAGTCT TTCCCTCTCG ATGTTGTTGC CGTCACTCGG CACCAGCATC
GCCAATGTCG GCCTGCCGAG CTTGGCGCAA GCGTTCAATG CGTCCTTCCA GGATGTGCAG
TGGATCGTTC TCGCCTATCT CCTTGCCATC ACCACCCTGG TCGTCAGCGT CGGACGGCTC
GGCGACATCA CCGGCCGACG CCGGCTGCTG CTCATCGGCA TCCTGCTCTT CACGCTGGCC
TCGATCCTCT GCGGCGTCGC GCCGACTTTA TGGCTGATGA TTGCCGCGCG CGCCGTGCAG
GGGCTGGGTG CGGCCATCAT GATGGCGCTC ACCATGGCCT TTGTCGGCGA AACGGTGCCG
AAGGAAAAAA CCGGGAGCGT CATGGGGCTG CTCGGAACGA TGTCGGCGAT CGGCACCGCT
CTTGGGCCTT CGCTCGGCGG CCTGCTGATC GCCGGGCTCG GCTGGCCGGC AATCTTTCTC
GTCAACGTGC CGCTCGGCGT TCTGACCTTC GTTCTCGCCT ATCGCCATTT GCCCGCCGAT
ATCAGCAAGG GGAAGACGGA TCGGAAGGGC TTCGACTTGG CGGGCACACT GCTGCTTGCG
CTGACGCTTT CGGCCTATGC GCTGGCGATG ACGATCGGGC ACGGCCGCTT TGGGCCGCTG
AACGTCGCTC TGCTCTCGGC CGCCGTCTTC GGCACCGGTT TGTTCGTCTT AGCCGAGGCG
AGAGCGGCAT CGCCGCTGAT CCAGTTGACG GAATTTCGCA ATCGCGTCCT CAGCGCCAGC
CTGGCGATGA ACGCGCTGGT TTCGACTGTG ATGATGGCGA CGCTGGTGGT GGCGCCATTC
TACCTTTCCC ATGCGCTCGG GCTCAATCAA GCTCTCGTTG GCATCGTCAT GTCGATCGGG
CCCGTCATCT CCATCTTGAG CGGGGTCCCG GCTGGCCGCC TGGTCGACCG TCTGGACACG
TCCTTCGTGG TTGTCGCAGG GCTCGTTGCC ATGGCGGCAG CCTCCGTCGC CATGGCCGTG
CTGCCCGGGA TATCAGGCTA CATCGCCGCC ATCGCCATGC TGACACCCGG TTATCAGCTG
TTCCAGGCGG CCAACAACAC TGCCGTCATG GCGGATGTCC GTCCCGACCA GCGAGGCGTC
ATCTCCGGCA TGCTCAACCT GTCGCGCAAT CTCGGGCTGA TTACCGGCGC ATCCGTGATG
GGCGCCGTCT TCGCGCTCTC TTCGGGAGCA CCCGATATCA CAGCGGCGCA TCCCGAGGCC
GTCGCCTCAG GTATGCGGGT CACCTTCGCC GTTGCGGCAG CACTGATTAT CGTCGCGCTC
GCCATTGCGG TCGGGACTTA CCGCCGCCGC CGAGCGCTCG GGGAGAGTTG A
 
Protein sequence
MVKSIIAERS EAIAEQSPSV RWALASLSLS MLLPSLGTSI ANVGLPSLAQ AFNASFQDVQ 
WIVLAYLLAI TTLVVSVGRL GDITGRRRLL LIGILLFTLA SILCGVAPTL WLMIAARAVQ
GLGAAIMMAL TMAFVGETVP KEKTGSVMGL LGTMSAIGTA LGPSLGGLLI AGLGWPAIFL
VNVPLGVLTF VLAYRHLPAD ISKGKTDRKG FDLAGTLLLA LTLSAYALAM TIGHGRFGPL
NVALLSAAVF GTGLFVLAEA RAASPLIQLT EFRNRVLSAS LAMNALVSTV MMATLVVAPF
YLSHALGLNQ ALVGIVMSIG PVISILSGVP AGRLVDRLDT SFVVVAGLVA MAAASVAMAV
LPGISGYIAA IAMLTPGYQL FQAANNTAVM ADVRPDQRGV ISGMLNLSRN LGLITGASVM
GAVFALSSGA PDITAAHPEA VASGMRVTFA VAAALIIVAL AIAVGTYRRR RALGES