Gene Rleg2_1810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1810 
Symbol 
ID6980548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1859508 
End bp1860983 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content66% 
IMG OID643396532 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002281321 
Protein GI209549404 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.174923 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00284389 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTTGCCG GCGGCTCCCA TTCCGGAGCG AGAAAAACGG AGTCTACGGT GAAACCGATC 
ATTGCAGGGC CCGGCGAGGC CGCCGCCGAA CGCCCCTATT CCGCCCGCTG GGCGCTCGCC
AGCCTTTCCC TCTCGATGCT GCTGCCATCG CTCGGCACCA GCATCGCTAA TGTCGGCCTG
CCGAGCCTGG CGCGGCGGTT CGATGCCGCC TTCCAGGATG TCCAGTGGAT CGTGCTCGCC
TATCTCCTTG CCATTACCAC TTTGATCGTC AGCGTCGGGC GGCTCGGCGA TGTCACCGGC
CGGCGACGGC TGCTGCGCAT CGGTATCCTG CTCTTCACAC TGGCCTCGGT CCTCTGCGGC
CTTGCGCCGA CGCTGTGGCT GATGATTGCG GCGCGCGCGC TGCAAGGTCT GGGCGCGGCG
ATCATGATGG CGCTGACCAT GGCCTTTGTC GGTGAAACGG TGCCGAAGGC GAGGATCGGC
AGCGCCATGG GGCTGCTCGG AACGATGTCG GCGATCGGCA CCGCTCTTGG GCCCCCACTC
GGCGGTCTGC TGATCGCCTA TTTCGGCTGG CCGGCGATCT TCCTCGTCAA CGTGCCGCTC
GGCCTGGCGA CCTTCGTTCT CGCCTTTCGC TGTCTGCCGG ACGATATCGG CGGGAGGAAG
AAGGATCGGG CCGGCTTCGA CAGAGTGGGC ACGCTGCTGC TTGCCCTGAC GCTGTCGGCC
TATGCGCTGG CGATGACGAT CGGGCATGGC AGCTTTGGCT TGCTGAACTT CGCCCTGCTG
CTCATGGCCG GTTTCTGCGC CGGTCTCTTC GTTTTCACCG AGGCGAGAAC GGCATCGCCG
TTGATCCAGC TGGCGGTGTT CCGCGACCGC GTGCGCACCG CCAGCCTGGC GATGAACGGG
CTCGTCTCGA CCGTGATGAT GGCGACGCTG GTGGTCGCGC CCTTCTATCT CTCCCGTGCG
CTCGGGCTCA ACGAGGCGCT GGTTGGCGCC GTCATGTCGA TCGGTCCGGT GATCTCCATC
CTCAGCGGTG TGCCGGCCGG CCGCCTCGTC GATCGTCTGG ACGCGCCGTT GGTGGTTGCC
GCAGGGCTCG TCACCATGGC CGCCGGTTCC ATCGCTCTCG CCGTGCTGCC CGGAATTGCC
GGCTATATCG CCGGCATCGC CCTGCTGACG CCTGGCTATC AGCTGTTCCA GGCAGCCAAC
AACACAGCCG TCATGGGAGA TGTGCACCCC GACCAGCGCG GCGTCATTTC CGGCATGCTC
AACCTGTCGC GCAATCTCGG GCTGATTACC GGCGCATCCG TCATGGGAGC CGTGTTCGCG
CACGGGTCGG GGAGCTCAGA GATCGCCGCG GCACGTCCCG AGGCCGTTCA TTCAGGCATG
CAGATCACCT TCGGCGTGGC AGCAGCATTG ATCGCCGCCG CGCTGACCAT CGCGGCCGGG
ACCTACCGGT GCCGAACCAG TTCCGAAGGG ACATGA
 
Protein sequence
MFAGGSHSGA RKTESTVKPI IAGPGEAAAE RPYSARWALA SLSLSMLLPS LGTSIANVGL 
PSLARRFDAA FQDVQWIVLA YLLAITTLIV SVGRLGDVTG RRRLLRIGIL LFTLASVLCG
LAPTLWLMIA ARALQGLGAA IMMALTMAFV GETVPKARIG SAMGLLGTMS AIGTALGPPL
GGLLIAYFGW PAIFLVNVPL GLATFVLAFR CLPDDIGGRK KDRAGFDRVG TLLLALTLSA
YALAMTIGHG SFGLLNFALL LMAGFCAGLF VFTEARTASP LIQLAVFRDR VRTASLAMNG
LVSTVMMATL VVAPFYLSRA LGLNEALVGA VMSIGPVISI LSGVPAGRLV DRLDAPLVVA
AGLVTMAAGS IALAVLPGIA GYIAGIALLT PGYQLFQAAN NTAVMGDVHP DQRGVISGML
NLSRNLGLIT GASVMGAVFA HGSGSSEIAA ARPEAVHSGM QITFGVAAAL IAAALTIAAG
TYRCRTSSEG T