Gene Rleg2_6117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_6117 
Symbol 
ID6983190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011370 
Strand
Start bp44137 
End bp45795 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content50% 
IMG OID643399139 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002283895 
Protein GI209551979 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.154468 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.620567 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTACAAG TAGACATACT GCCAAAAACA GAGGGGATCT CCTCAAACGA GCGACGCGTC 
ATTGTTGCTG CATCACTCGG AACAGTTTTT GAATTCTACG ACTTTTTTCT CATTGGATTG
TTAGCTAATG AAATTTCGAA AGCATTTTTT TCCGGCGTAA ACCCAACAGC TGGTTTCATC
TTTACGCTTC TCGGCTTTGC GGCAGGCTTT TTGTTAAGGC CGTTCGGGGC GATCGTGTTT
GGTCGCCTTG GTGACATGGC AGGGAGAAAA TATACGTTTC TGGTGACGAT ATTGCTGATG
GGCATATCGA CTTTCACAAT AGGTCTACTA CCGGCCTATT CTACGATAGG CCTTGCGGCA
CCTCTTGGGT TTGTGGCGAT GCGGATGCTG CAAGGCCTCG CTCTTGGTGG AGAGTTCGGC
GGTGCGCTAA TCTATGTTGC CGAACACGCG CCTGCGAACC GAAGGGCGGC CTGGACGGCC
TGGGTGATAT TGACGGCCGC GCTTGGATTT CTCTTGGCAG TCGCTGTCAT CATTCCTCTA
AGGCTGGCAA TTGGCGCTGA TGCCTTCTCT CTTTGGGGAT GGCGCGCCCC CTTTCTTGTT
TCAATCCTAC TGCTCGGAGT TTCTTTGTGG ATTCGATTGA AATTGGACGA AACTCCCGAG
TTCATAAGGA TGAAGGCGGA GGGAAAGGCA TCTAAAGCCC CAATCTCGGA AACGCTTGGA
ACGTGGAAAA ACCTCCGCCT TGTGCTAATC GCTGCGCTCT GCATCGTTCC GGGGCAGGCG
GTTGTATGGT ACACTGGCCA ATTCTACTCG TTGTTCTTTT TAACTAAAGT GTTGCGGATC
GAAAATCTGA CAGCAAATTT TCTGCTGATC GCTGCGACGA TCATCACGGC CCCTCTTTAC
GTTGTCTTTG GTGCGTTGTC TGACCGTATC GGTCGCCGGC CAGTTTACGT GGCTGGTTTC
CTGCTTGCAG CTGTATTTAC GGTCCCCCTT TTCAAAGCTC TTACGCACTA CGGCAACCCG
ACACTCGAAC AAGCGCAAAT TAATGCGCCC ATCACAATTG TATCAGGGAG TGACGCTTGT
TCAGTACAAT TCAATCCGCT GGGCACCGCA AAACCAATCA CATCTTGCGA CATCGTGGTC
GACGCGATCG CAAAACTTGG TCTGAATTAC AATAGTGCAC ACTCAGCAGA GTCAGCGACT
ACAATCGTGA AGATCGGCGA CCGTGAGGTT CCTGGATACT CCGCCGATAC ATCCGACGTT
TCCGTTAAAA AAACACGGTT TGAATCGGAA CTGAAGACAG CATTGACCGA TATGGGCTAT
CCCTTAGGAG AAGCCGCCCA TGAAGACATC AATCAAACTA TGATCGTCGT TCTATTGTCC
ATCCTTTTAT GCTTTGGAAC GATGACGTTC GCGCCTTCGA CAACTGCTCT ACTCGAAATG
TTCCCTTCGC GGATACGGTA TACTGCCATG TCATTTCCCT ATCACCTAAG TGCAGCGTGG
TTTGGTGGGT TCCTACCCGC GACAGCGTTT GCCATCGTTG CGTCCACCGG CAACATTTAT
TCTGGGCTTT ATTATCCGGC GTGCATCGCT GCAGCTTGTA TAGTCTTGAG CACTATCTTT
GCGAACGAGA CAAAAGGAGC GGATCTCTCC GGAGATTGA
 
Protein sequence
MVQVDILPKT EGISSNERRV IVAASLGTVF EFYDFFLIGL LANEISKAFF SGVNPTAGFI 
FTLLGFAAGF LLRPFGAIVF GRLGDMAGRK YTFLVTILLM GISTFTIGLL PAYSTIGLAA
PLGFVAMRML QGLALGGEFG GALIYVAEHA PANRRAAWTA WVILTAALGF LLAVAVIIPL
RLAIGADAFS LWGWRAPFLV SILLLGVSLW IRLKLDETPE FIRMKAEGKA SKAPISETLG
TWKNLRLVLI AALCIVPGQA VVWYTGQFYS LFFLTKVLRI ENLTANFLLI AATIITAPLY
VVFGALSDRI GRRPVYVAGF LLAAVFTVPL FKALTHYGNP TLEQAQINAP ITIVSGSDAC
SVQFNPLGTA KPITSCDIVV DAIAKLGLNY NSAHSAESAT TIVKIGDREV PGYSADTSDV
SVKKTRFESE LKTALTDMGY PLGEAAHEDI NQTMIVVLLS ILLCFGTMTF APSTTALLEM
FPSRIRYTAM SFPYHLSAAW FGGFLPATAF AIVASTGNIY SGLYYPACIA AACIVLSTIF
ANETKGADLS GD