Gene Rleg2_4358 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4358 
Symbol 
ID6983132 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp4523687 
End bp4524925 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content64% 
IMG OID643399086 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002283842 
Protein GI209551925 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.00900983 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTGCCGT CAGCGCATGC CGCAAAGAGC GTGATCGCAG AAGAAAAGCA CTACCTGACG 
CGTGGCACCG GCGCCTATCG CCGCGCCAGC CTGGCGCTTT TCCTCTCCGG CTTTTCCACC
TTCTCGCTGC TCTATTGCGT CCAGCCGCTG CTGCCGATCT TCTCACAGGA ATTCTCCGTC
AGCCCGGCCG AAAGCTCGCT GTCCCTCTCG CTTTCCACCG GTTTTCTGGC AGTCGCCATT
GTCTGCGCCG CGGCCGTTTC GGAGGGTCTC GGCCGCCGCA GCCTGATGGC GCTGTCGCTG
GTCAGCGCCG CCTTGCTGAC GATCGCCACC GCCTTTGCCC CGACCTGGCA TCTGCTACTC
GTCATCCGCG CCCTGCAGGG CCTCGTTCTC GGCGGCGTGC CCGCCGTCGC CATGGCCTAT
CTAGCTGAGG AAATCGATCC GCGCGGCCTC GGCGCCACCA TGGGCCTTTA TGTCGGCGGC
ACGGCCTTCG GCGGCATGTC CGGCCGCGTG CTGACGGGCA TCTTCGCCGA ATATCTCACA
TGGCGGCCGG CGCTCTTCCT CATTGGCGCC ATCGGCCTTG CCGCCGCAAT CGGTTTCATC
GCCCTGCTGC CGCCGTCGAA GAATTTCGTC CGACGGCCGG GATTCGATCC GCGCTTTCAC
GCAAAGGCCT GGCTCAGCCA TCTCGAAAAT CCGGCGCTGC CCTTCATCTT CGCCATCGCC
TTCCTGGCGA TGGGCTCCTT CGTAACGATC TACAATTATG CCGGCTTCCG CTTGGTGGCG
CCGCCCTATG GCCTCAACCA GACCGAACTC GGCCTGATCT TTACCGTCTA TCTCTTCGGC
ATCGGCGCCT CTTCGATCGG CGGCCTCATC GGAGACCGGA TCGGGCATTT TCGCGTGCTG
CTCTTTGGTC TGGCGCTGAC CGCCGCCGGC AGCGCGCTGA CGCTCTTTGC CGCGCTGCCA
TCCATCATCC TCGGTATAAC AGTGCTGACG ACCGGCTTCT TCATGAGCCA TTCCATCGCC
AGCGGCCTCG TCGGCAAACT GGCGCGGGGC ACCAAGGGCC ATGCCTCGTC GCTTTATATG
CTCGCCTATT ACGTCGGCTC CAGCCTCATG GGCTCGGCCG GCGGCTGGTT CTTCGCGATG
GAAGGCTGGA CCGCCGTCGT CCTTTTCACG CTGGCCATGC TGGCGCTGGC CTTTATCTCC
GCTTGTGTGG CGCAGCACTT CGCACGGAGA AAAGCATGA
 
Protein sequence
MLPSAHAAKS VIAEEKHYLT RGTGAYRRAS LALFLSGFST FSLLYCVQPL LPIFSQEFSV 
SPAESSLSLS LSTGFLAVAI VCAAAVSEGL GRRSLMALSL VSAALLTIAT AFAPTWHLLL
VIRALQGLVL GGVPAVAMAY LAEEIDPRGL GATMGLYVGG TAFGGMSGRV LTGIFAEYLT
WRPALFLIGA IGLAAAIGFI ALLPPSKNFV RRPGFDPRFH AKAWLSHLEN PALPFIFAIA
FLAMGSFVTI YNYAGFRLVA PPYGLNQTEL GLIFTVYLFG IGASSIGGLI GDRIGHFRVL
LFGLALTAAG SALTLFAALP SIILGITVLT TGFFMSHSIA SGLVGKLARG TKGHASSLYM
LAYYVGSSLM GSAGGWFFAM EGWTAVVLFT LAMLALAFIS ACVAQHFARR KA