Gene Rleg2_1174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1174 
Symbol 
ID6979894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1184693 
End bp1185922 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content61% 
IMG OID643395887 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002280694 
Protein GI209548777 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.148034 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0846365 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAATC AGGAGGGAAC AGCTGCAGCA GTCATGCCGG CGACATACCG CCGTATTCCG 
GCCGGCATTT GGGCGCTCGG TTTCGTCTCG ATGCTGATGG ACATCTCTTC CGAGATGATC
CATGCGCTCC TCCCGGTTTA CATGGTCTCG GTGCTTGGCA TCTCCATGTT CGCGGTCGGC
GTCATCGAAG GCATTGCCGA GGCAACGGCA TCGATAACCA AGGTATTCTC GGGGGCTTTG
AGCGACTGGC TCGGTAGACG CAAGTTTCTC GCAGCACTAG GTTATGGCCT TGCCGCGGTC
ACCAAGCCGA TCTTTCCACT CGCCTCTTCT CTCGACTGGC TTATTGCGGC ACGATTTGTC
GACCGCGTCG GCAAAGGGAT CCGCGGTGCG CCGCGGGATG CACTTGTTGC TGACATCGCT
CCTCCTGAAC TGCGCGGAGC GAGCTTCGGA CTGCGCCAGT CGCTCGACAC TGTGGGCGCC
TTTGTCGGCC CCCTCCTGGC GATCGGTCTG ATGTGGCTGA CAGCGGATCA TTTCCAAAGG
GTGTTGTGGA TTGCGGTCCT TCCCGCCTTC CTGTCTGTCG GTGTGCTGCT GTTCGTCGTC
AAGGAGCCCG AGCGACCGCG GGAGTTTCGC CACGTGCGCA TGCCGCTCCA CAGGGATGAA
CTGGGCCGTC TCGGTAGATC CTATTGGTGG GTCGTGGCGG TCGCCGCAGC ATTTACGCTA
GCCCGTTTCA GCGACGCGTT CCTCATCCTG AAGGCACAGT CGATCGGCCT GCCGATAGCC
TTGGTGCCGC TTGCGCTGGT CCTTATGAGT CTAGCTTATT CGCTCTCGGC TTATCCCGCC
GGCATGCTCT CAGACAAAAT GGATCGGTTC ACCATTCTTG CTATCGGTCT CGTGTTGCTT
GTCTGCGCCG ATCTTACCCT GGCGTTCGCA CAGAGTGTCA TCGGCGCCGG ACTCGGTGTC
CTCCTCTGGG GTCTGCACAT GGGGTTCACG CAGGGGCTGC TGACGAAGGT GATTGCCGAT
ACATCGCCTG CTGAACTGCG TGGCACAGCC TTCGGCATGT TCAATCTGAT CACCGGGCTG
GCTCTGCTGC TTGCCAGTGT CATCGCGGGC ACGCTTTGGG ACCTCGCCGG GCCGCGAGGA
ACCTTCCTCG CCGGCGCCGG GTTCGCAATG CTGACTATGA TCGGGCTGCT CGTCGTCCGC
ACGCGACTCT CTACACAGGC CGGTGCCTGA
 
Protein sequence
MANQEGTAAA VMPATYRRIP AGIWALGFVS MLMDISSEMI HALLPVYMVS VLGISMFAVG 
VIEGIAEATA SITKVFSGAL SDWLGRRKFL AALGYGLAAV TKPIFPLASS LDWLIAARFV
DRVGKGIRGA PRDALVADIA PPELRGASFG LRQSLDTVGA FVGPLLAIGL MWLTADHFQR
VLWIAVLPAF LSVGVLLFVV KEPERPREFR HVRMPLHRDE LGRLGRSYWW VVAVAAAFTL
ARFSDAFLIL KAQSIGLPIA LVPLALVLMS LAYSLSAYPA GMLSDKMDRF TILAIGLVLL
VCADLTLAFA QSVIGAGLGV LLWGLHMGFT QGLLTKVIAD TSPAELRGTA FGMFNLITGL
ALLLASVIAG TLWDLAGPRG TFLAGAGFAM LTMIGLLVVR TRLSTQAGA