Gene Rleg2_0949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_0949 
Symbol 
ID6979667 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp971851 
End bp973068 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content66% 
IMG OID643395660 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002280469 
Protein GI209548552 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAGA TCGCCATCAA TGATTCCATC GGAAGCCAGC CGGATGACGA GCTGCCGTCG 
GCAACGACCG TCGCCCTGGT TCAGCTGGCG CTCGCCTGCG GCGGCTTCGG CATCGGCACC
GGCGAATTCG CGATCATGGG GCTGCTGCCG AATGTCGCCG ACACCTTCTC GGTGACGACG
CCGCAGGCCG GCTACGTCAT CAGCGCCTAT GCGCTCGGCG TCGTCATCGG CGCGCCGGTT
ATCGCCGTGC TCGCCGCGAA AATGGCGCGC CGCACGCTGT TGCTGACACT GATGCTGATC
TTTGCCGCCG GCAATATCTT CAGCGCCATG GCGCCGACCT TCGAAACCTT CACGCTGCTG
CGCTTCGTCA GCGGCCTGCC GCATGGCGCC TATTTCGGCG TCGCGGCGCT GGTCGCCGCC
TCGATGGTGC CGGTGCATCG CCGCGCGCGG GCCGTCGGCC GCGTCATGCT CGGCCTGACC
GTCGCGACGC TTCTCGGCAC GCCCTTGACG ACATTCTTCG GCCAGTCGCT CGACTGGCAG
GTCGCATTTT TCTCCGTCGG CGTGCTCGGC CTGCTGACGG TTGTGCTGAT CTGGTTCTAC
GTTCCCCAGG ACAGGGTTTC CAAAGAGGCA AGCTTCCTGC GCGAACTCGG CGCCTTCCGC
CGGCCGCAGG TGTGGTTGAC GCTCGGCATC GCCGCCGTCG GCTACGGCGG CATGTTTGCG
ATGTTCAGCT ATATCGCCTC GACGACGACC GAAGTCGCAT TGCTGCCGGA AACGGCCGTT
CCGATCATGC TGGTGCTCTT CGGCGTCGGC ATGAATGCCG GCAATTTCAT CGGCTCGTGG
CTGGCCGACA AATCGCTGCT CGGCACGATC GGCGGCTCGC TCGTCTACAA TATCGTCGTG
CTGACCACCT TCTCGCTGAC CGCTGCCAAC CCCTATATGC TAGGCCTCTC GGTCTTCCTG
GTCGGCTGCG GTTTTGCCGC CGGCCCGGCG CTGCAGACCC GGCTGATGGA TGTCGCCGCC
GATGCGCAGA CGCTTGCCGC CGCCTCCAAC CATTCCGCCT TCAACATCGC CAATGCGATC
GGCGCCTGGC TCGGCGGCCT CGTCATCGCC TGGGGTTACG GTTTCGCCGC CACCGGTTAT
GTCGGCGCAG CACTTTCCTT CCTCGGCCTG TTCGTCTTCG CCGCCTCCGC ACGGCTGGAG
CGCCGCGCCG GCGCATAA
 
Protein sequence
MSEIAINDSI GSQPDDELPS ATTVALVQLA LACGGFGIGT GEFAIMGLLP NVADTFSVTT 
PQAGYVISAY ALGVVIGAPV IAVLAAKMAR RTLLLTLMLI FAAGNIFSAM APTFETFTLL
RFVSGLPHGA YFGVAALVAA SMVPVHRRAR AVGRVMLGLT VATLLGTPLT TFFGQSLDWQ
VAFFSVGVLG LLTVVLIWFY VPQDRVSKEA SFLRELGAFR RPQVWLTLGI AAVGYGGMFA
MFSYIASTTT EVALLPETAV PIMLVLFGVG MNAGNFIGSW LADKSLLGTI GGSLVYNIVV
LTTFSLTAAN PYMLGLSVFL VGCGFAAGPA LQTRLMDVAA DAQTLAAASN HSAFNIANAI
GAWLGGLVIA WGYGFAATGY VGAALSFLGL FVFAASARLE RRAGA