Gene Rleg_5071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5071 
Symbol 
ID8007664 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp454239 
End bp455696 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content62% 
IMG OID644821986 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002973246 
Protein GI241113411 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.706796 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.279172 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGACA ACACAATTAC GCTGGCCGCA TCCGCGACCG CAGATGGGCT TGAGGCCCGC 
GATCCGAGCC CGGCATCCGA CAGCGGTCGC GCCGCAGCGC TGGCTGCCCG TTTCGAAGCC
ATTCCCTTTA CACCCTGGCA CCGCCGGGCG CGGATCGTCA TGGGGAGCGC CACGTTTCTC
GATGCCTTCG ATGCCCTTTC GCTGGCATTC GTCCTTCCGA TCCTGATCAA GCTATGGGAG
CTTTCTCCGG CCCAGATCGG CTGGATGATC GCCGCCAGCT ACATCGGCCA ATTGCTCGGC
GCCCTGCTTT TCAGCCGTCT TGCCGAAAGT TTCGGCCGCG TCCCCATGGC GGCAGCGGCC
ACAGCGCTGA TGTCGGTCAT GGGCCTCGCC TGCGCGCTCA CCGGCAATTT TCAGATGCTT
TTCCTGTGCC GGCTGATCCA GGGTATTGGC GTCGGAGGAG AAATGCCGGT CGCCGCCACC
TATATCAGCG AACTTTTGCG GGCGAAGGGA CGCGGCCGCT ATTTCATGCT GTATGAGATG
ATCTTTCCGG TCGGGCTGAT GATCACCGGC CAGGTTGGGA CCTTGCTCGT GCCGATCCTC
GGATGGACAT CGCTGTTCTT CATCGGCGGC ATTCCCGGTC TCGTCATCGC CTATCTGCTT
TATCGGCTGC CGGAATCGCC GCGCTGGTTG ATCGGCCAGC AACGCCTGGA CGAAGCGGAA
GCGATCATCC TTCAGGCTGA GGCGAGCGCA AGAAAGGCAA ACCTCGACTA TCAACAGGAG
CCGCAACCGG CGCCTGAAGC GCCGGCTGCA GCGCCCGCCC CCGCACAGGT CGCCCGGCAG
CCGAGAAGCC GCTGGAGCGA GCTGCTGTCT CCACGCTTCC GCGCCAGGAC GTTGATCGCC
TGGGTCCTTT GGGCGAGTTC GTTCTTCGTC GCCAACAGCC TCAACAACTG GATGCCAACA
CTCTATCACA CCGTCTACAA GCTCGAGCTC GGAAGCGCTC TCCGGGCGGC GTCGATGACC
AATGTCGCCC AGGTCGCCAT CCTTCTGGTT TGCGCTTTCT GCATCGACCG CATCGGTCGA
AGAACTTGGG CCATTGTATC CTTCCTGGTC GGCGCGGCAC TGTTGACCGC TTTGGCTGCC
GGTGGCGCAG GCCAGCTCTG GTCGTTGATC GTGCTGGCAA CGCTTGCCTA TGGCGTTGTC
GGGTCGATCA ATGCGGTCTT GTATCTGTAC ACGCCGGAGA TCTATCCGAC GCGGATGCGC
GCTATCGGAA CGGGTCTTGT GACATCATGG CTGCGCATAG CCTCGGCCGT CGGGCCGACC
ACGGTCGGTT ATATGATGGG GACGCAGGGC ATCAATTCGG TGTTCATGAT GTTCGCGATC
GTCGCCGCCA TCGGTGCGGT TGCGGCTATC GGAATGATCG AGACCGGCGG ACGTCGACTG
GAAGAAACCT CATCATAA
 
Protein sequence
MHDNTITLAA SATADGLEAR DPSPASDSGR AAALAARFEA IPFTPWHRRA RIVMGSATFL 
DAFDALSLAF VLPILIKLWE LSPAQIGWMI AASYIGQLLG ALLFSRLAES FGRVPMAAAA
TALMSVMGLA CALTGNFQML FLCRLIQGIG VGGEMPVAAT YISELLRAKG RGRYFMLYEM
IFPVGLMITG QVGTLLVPIL GWTSLFFIGG IPGLVIAYLL YRLPESPRWL IGQQRLDEAE
AIILQAEASA RKANLDYQQE PQPAPEAPAA APAPAQVARQ PRSRWSELLS PRFRARTLIA
WVLWASSFFV ANSLNNWMPT LYHTVYKLEL GSALRAASMT NVAQVAILLV CAFCIDRIGR
RTWAIVSFLV GAALLTALAA GGAGQLWSLI VLATLAYGVV GSINAVLYLY TPEIYPTRMR
AIGTGLVTSW LRIASAVGPT TVGYMMGTQG INSVFMMFAI VAAIGAVAAI GMIETGGRRL
EETSS