Gene Rleg_1600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1600 
Symbol 
ID8012675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1591350 
End bp1592579 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content58% 
IMG OID644824186 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002975427 
Protein GI241204331 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGATT CAGTTGCAGC ACAGAAAGTC GATGGCGGCG AGTTTGCCGT TTTGTGCATC 
GGAGGGTTTC TTCTTTCGAT TGCCTATGGC GTGACCTTTC TGATCCCGGT GCTGGTTGGC
CAGCGTGGTG GCGACGAGGC GCTGGCCGGC CTGATCATTT CGGCTGCGAC TGTCAGCACC
GTCATTCTCG TGATCCTGTC CGGCCACATC GCGGATGCCA TTGGCTCGGC GCGGGCGGTG
GCTGTTTCGG GATTGTTTCT TGCCGCGTCA GCGCTGGGAT TCGCCATGGT TCCCTCAGCT
GGGCTGAGCC TGATGGTGGT CGGTTTCATC CTCGGCATAG GCTGGGGCAC CTTCTATGCG
CTCGGCCCTA TTCTGGTCGC CGCGATCGTC GAACCCGAAC ATCGGATCCG GTTTTTTGCC
CTGCTTTCGG GATCGATGAT GTCGGGCATC GGAGCCGGAC CGATCATTGG CCGCATTGCT
ACCAGCTGGT CGATGCCGAT CGAGGCGGCC TTCGCATTTG CGTTCCTTGC CAGTCTCGCC
GGCGGTGCGC TTTACTTTTT GCTCCATATC CGGCTGACCA ATGCCGGCAA GATTTTGCCC
CATGTTAACA AGATCTCATT TGGCTCAGCA CGCGAGGTGA TCGGCTCGCG GGCCATCTAT
TCCATCGTCA TGGTCGGCAT CGGCGGCGCG ATTTTCGGCG GGCTGTCCAG CTTTCAGACC
AGCTACGCCA AGGCGCACGG ATTCGACTAC TCCCTGTTCT TCATCGGCTT CACGTCCGCC
GCGATTCTGA GCAGGCTGTT CGTGGCGGGA TATGTGGTCA AGAAAGACCC ACTCTATTCG
CTTGTGGTCC TGACAAGTCT GACGCTGGCA TCGATCGTGC TGTTCCTGAT GCTGACATCA
AATCAGTTTG CCTATCTGGG AGGCGCGGCG ATGCTGGGAG TGGGCTATGG CTTGACTTAT
TCCGTCATCA ACGGCCTGGC GGCCAATGAG GCTCCCGCCG GCCTTATGCC GCAATCGCTG
CTGTTATTTT CGCTCGCCTA TTCCATCGGC GTCTTCGGCT TTCCACTGAT CGCCGGCAAT
CTGATCGTTT CCTCCGGCGT GCAGACCATG CTGTACGTCG TGCTTCTGCT TGCCGTCCTG
AATTTTGCAA TCGTCCTGTT TCGTGTTGCC CACCGTGCGA CGCAAGAACG AAACAAAGCA
TCCGTAGGAG ACAATACAAG CACTCTATGA
 
Protein sequence
MSDSVAAQKV DGGEFAVLCI GGFLLSIAYG VTFLIPVLVG QRGGDEALAG LIISAATVST 
VILVILSGHI ADAIGSARAV AVSGLFLAAS ALGFAMVPSA GLSLMVVGFI LGIGWGTFYA
LGPILVAAIV EPEHRIRFFA LLSGSMMSGI GAGPIIGRIA TSWSMPIEAA FAFAFLASLA
GGALYFLLHI RLTNAGKILP HVNKISFGSA REVIGSRAIY SIVMVGIGGA IFGGLSSFQT
SYAKAHGFDY SLFFIGFTSA AILSRLFVAG YVVKKDPLYS LVVLTSLTLA SIVLFLMLTS
NQFAYLGGAA MLGVGYGLTY SVINGLAANE APAGLMPQSL LLFSLAYSIG VFGFPLIAGN
LIVSSGVQTM LYVVLLLAVL NFAIVLFRVA HRATQERNKA SVGDNTSTL