Gene Rleg_5688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5688 
Symbol 
ID8016651 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012853 
Strand
Start bp270041 
End bp271297 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content60% 
IMG OID644827841 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002979041 
Protein GI241518413 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0510347 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.192354 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGATC CCGATCGGAT TGGTGAATTC TGGCGGAAAG CCGCAGGGCT TCGTGCAATC 
CTGCATGTCA TGGCGAGCAT CTGCATCGTC GCGACTGGCA ATTCTCTTCT GACGACCACG
GTTTCTCTGC ACCTCAGTGA CCCCGCAATC GATCCCCACA TCGTCCAGTT GTTGCTGACG
GCGTTTCCCG TAGGCTTCCT TGCCGGCTGC CTCTCAGCTC GTGTCATGGT CGTCCGCTTG
GGACACGAGC GGGCTTTCCT GGCCGTCGCA TTGCTCGCTG CTTTCGGCGC CTGCGGCTAC
ATGCTGACGC AAGCCGCTCC GGTCTGGTTC TGCCTGCGTC TGATAAACGG TTTTTCCATC
GCAACGCTGT TCGTCGTGTC CGAAAGCTGG ATCAATCTCT ACGCTGACCA GAAGAACCGC
GGAGCTTATT TCTCGCTCTA TATGCTGATG ACGTCGCTGG CGACCCTGTT TGCGCAATTG
CTTGTCGAAG CGGCCGGAGC GGACTCTCCC CATCTCTTTC AGATCGTGCT CGGCGTCATC
CTTCTTGGAC TGATCTACGC CCGCTTTATC GGTGGACCCT GGCCCACCTT GCGCCTGCCG
CTGGCAGTAG CGGTCGAGGC CGGCAACGCC CACTCCGGGC ATCGCTATGG CATCTGGCGG
CTCGTCGCTC TCGCACCGGT GGCTGTCGTC TGCGTCTTTC AAGCGGGCAT GACGAATATG
AATGTCTATA CGATGACGCC GATCTATGCG GAGCGGGTGC ACCTCGACGC GGCGGTGGCG
GTGACACTGG TAACCGCTTT CAGCCTGGGC GGCATGCTCG CTCAGGCCCC GGTCGGATGG
TTGTCGGATC GTATGGACCG GCGCGTTCTA CTTCTCGTTC AGGGATTGGC GGGAGCAGGA
CTGTGCGCAG CAATCGCCTG GCCCGGAAGC TATCCGCAGA TGCTTCTCTA CGGTCTGTTT
TTCGCCTATG GCGCAATTGC GCTGACGATC TATCCGGTCG GTATCGCTTA CGCTAACTCA
CAGCTCGATA GCCGCCATAT GGTCTCGGCA TCGGGTAGCC TGCTGCTCCT TTATTCGATC
GGCAACATCA TGACACCTGG GCTCGCCGCC CAGCTGATGG AGCTGTTTGC ACCGCAGGCA
CTCTTCCTTC TGCTCGGGAG CGGCGCGTTC CTGGTTGCTG TCGCTGCCTG CTTCAATCTC
TTTCGCCGTC CGATCGGCGC CACCAAACCT TGCCTTGTTT CGGGAGGAAG CGAATGA
 
Protein sequence
MRDPDRIGEF WRKAAGLRAI LHVMASICIV ATGNSLLTTT VSLHLSDPAI DPHIVQLLLT 
AFPVGFLAGC LSARVMVVRL GHERAFLAVA LLAAFGACGY MLTQAAPVWF CLRLINGFSI
ATLFVVSESW INLYADQKNR GAYFSLYMLM TSLATLFAQL LVEAAGADSP HLFQIVLGVI
LLGLIYARFI GGPWPTLRLP LAVAVEAGNA HSGHRYGIWR LVALAPVAVV CVFQAGMTNM
NVYTMTPIYA ERVHLDAAVA VTLVTAFSLG GMLAQAPVGW LSDRMDRRVL LLVQGLAGAG
LCAAIAWPGS YPQMLLYGLF FAYGAIALTI YPVGIAYANS QLDSRHMVSA SGSLLLLYSI
GNIMTPGLAA QLMELFAPQA LFLLLGSGAF LVAVAACFNL FRRPIGATKP CLVSGGSE