Gene Rleg2_0840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_0840 
Symbol 
ID6979558 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp859456 
End bp860697 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content65% 
IMG OID643395551 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002280360 
Protein GI209548443 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGATA CGGAATTGGA CGCGGCGAGC GTGAGCGCAG GCTCGATCTA CTGGAAGCGC 
AACCTGGCGA TCTCGCTGAT CGGCTCCTTC ACCACCATCG TGGCGATGAC GCTGCTGCTG
CCTTTCCTGC CGCTTTATGT CGAGGAGCTC GGCGTCAGCG ATCATGCGGC GATTGTGCAA
TGGTCTGGCA TCGCCTATGG CGCCACCTTT CTGGCTGCGG CTCTGGTCGC GCCGCTCTGG
GGGCGGCTCG GCGATATTTA CGGGCGCAAG CTGATGCTGG TGCGCGCGAG CCTCGGCATG
ACGCTGGCGA TCTCGTTGAT GGGCATGGCC GGCAATATCT GGCAGCTGGT GGCGCTGCGC
CTCTTCGTCG GGCTTGCCGG CGGTTATTCC TCCGGCTCGA TGGTGCTGGT GGCGACGCAG
ACGCCGAAAG ACCGCTCGGC CTGGGCGCTC GGCCTCCTCT CCTCCGGTAT CATGGCCGGC
AATCTCGTCG GGCCGCTGAT CGGCGGAGCG CTGCCGCCGC TGATCGGCAT CCGAGGCACC
TTTCTCGCCG CCGGGGCGAT GATCTTCCTC GCCTTTCTCG CCACGACCTT TTTGATCAAA
GAGGAGAAGT CGCCGGCCCG CAAACAGGCG GCCAAGGCGA GCGGCGGCTG GAAATCCATC
GCCGACAAAC GGCCTGTCAT CGCCATGCTG GCGACCGGCA TGCTGTTGAT GTTCGCCAAT
ATGTCGATCG AGCCGATCAT CACCGTCTAT GTCGCGCAGA TCGTGCCGGC CGCCGATGCG
GTGACGATAA TCTCCGGCAT CGTCATGTCG GCGGCCGCCC TCGGCAGCAT TCTTTCGGCC
TCATGGCTCG GCAAGCTTGC CGACAGGATC GGTCATTGGC CGGTGATTTC AGGCGCGCTC
GCCGTCGCCG GGCTGCTGCT GATCCCGCAG GCCTTCGTCA CCAGCGCCTG GCAGCTGATC
ATCCTGCGCT TCCTGATGGG CGCGGCGCTC GGCGGACTGC TGCCCTGCAT CGCCGCTGTC
ATCCGTCACA GCGTGCCGGA CAGTGCGGCC GGCAGCATTC TCGGGTTTTC CATTTCCTCG
CAATATGTCG GCCAGGTGGC CGGCCCGATC CTCGGCGGCT TCGTCGGCGG GCATATCGGC
ATGCGGGCGG TTTTCCTCGG CACCTCGGTG CTGCTTGTCG CCGGTGCTGC CTATGCCTGG
CTGGTGAGGC CAACAGATAA GCAGACATCC CACCTGGATT AG
 
Protein sequence
MTDTELDAAS VSAGSIYWKR NLAISLIGSF TTIVAMTLLL PFLPLYVEEL GVSDHAAIVQ 
WSGIAYGATF LAAALVAPLW GRLGDIYGRK LMLVRASLGM TLAISLMGMA GNIWQLVALR
LFVGLAGGYS SGSMVLVATQ TPKDRSAWAL GLLSSGIMAG NLVGPLIGGA LPPLIGIRGT
FLAAGAMIFL AFLATTFLIK EEKSPARKQA AKASGGWKSI ADKRPVIAML ATGMLLMFAN
MSIEPIITVY VAQIVPAADA VTIISGIVMS AAALGSILSA SWLGKLADRI GHWPVISGAL
AVAGLLLIPQ AFVTSAWQLI ILRFLMGAAL GGLLPCIAAV IRHSVPDSAA GSILGFSISS
QYVGQVAGPI LGGFVGGHIG MRAVFLGTSV LLVAGAAYAW LVRPTDKQTS HLD