Gene Rleg2_6133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_6133 
Symbol 
ID6983206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011370 
Strand
Start bp66767 
End bp68425 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content59% 
IMG OID643399152 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002283908 
Protein GI209551992 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.376404 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0270764 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACATG TCGAAACCAT GACTCAGGCT CAGGGGATAT CCCGCAGAGA CAGGAAGGTT 
ATCCTTGCAG CTTCTCTGGG GACGGTTTTT GAGTTTTACG ACTTCTTTCT AATCGGACTT
GTCGCCACCG AAATCGCCAA GGCGTTTTTC TCGGGCGTCA ATCCGACAGC GGGCTTCATC
TTCACCCTCT TGGGTTTCGC CGCTGGCTTC ATGCTGAGGC CATTCGGCGC GATTGTGTTC
GGACGTCTCG GCGACCTGGT GGGCCGGAAG TACACGTTCC TCGTCACGAT CGTTCTCATG
GGCGGCTCGA CGTTCCTGAT CGGGCTTCTG CCGGCTTACG CGACGATCGG GGTGGCGGCG
CCAATCGCAT TCGTCGCCAT GAGAATGCTT CAGGGCCTGG CGCTCGGAGG CGAGTTCGGG
GGCGCCATGG TGTACGTGGC GGAACATGCT CCTTCGGATA GACGTGCGAC CTATACTGCC
TGGATCATCA TGACGGCGGC GATCGGCTTC CTGCTCGCGG TAGCGGTAAT CATCCCTCTC
CGCTTGGCTT TGGGAGCGGA CGCGTTCGCA CTCTGGGGAT GGCGCGTTCC GTTCATTATC
TCGATCGTTC TGCTGGGCGT GTCCCTGTGG ATCAGACTTA GGCTCGACGA ATCGCCCGAG
TTCAAGCGGA TGAAGGCGGA GGGCAAGGCT TCGAAGTCTC CTCTGGCGGA GACCTTCGGA
ACCTGGAGAT ACGTCAAGGT CATCATTGTC GCGGCCCTCT GCATCCTGCC GGCTCAGGCA
GTGATCTGGT ATACGGGACA ATTCTACACG CTGTTCTTCC TTACCAAGGT CCTCAAGGTT
GAGAACCTTT CCGCAAACAT GATGCTCATC ATCGCCACCG TGTTAACCGC GCCCCTATAC
GTCGTTTTCG GAAAACTCTC CGATAGGATT GGACGTAAAC CTGTTTACAT CGCGGGTTAC
CTCCTCGCAG CTCTGGTAAC CATCCCGACA TTCCACGGAC TGACGCACTT TGCCAATCCT
GCATTGGAAC GTGCGCAGGC GAACACTCCG ATCACGATTG TTGCTGATCC CAATGACTGC
TCGTTCCAGT TCAATCCCCT CGGGACGTCG AAATTCACTA CCTCATGCGA CGTTGGTATC
AACGCTGTCG CGAACCTCGG CTTGAACTAT CAAAGCCAGG ACGCCGCCGC GGGGACGGTT
GCATCGGTTA AGGTGGGAGA CCGCGTCATC GCGAGCTACG CCGCCGATGC TGCGGATGCG
GCTTCTCAGA AGACGAGATT GGAAGCGGAA CTGAAGCAGG CCCTGGCAGA GGCTGGGTAC
CCGGTTGGAA GCGCCGACCC CGAAAGTGTG AACAGCCCTG CGATCATAGC GTTGCTTTGC
GTGCTTCTGG CGCTCGGCGC CATGGTTTTC GCGCCGACGA CGACCTCGCT ACTTGAGATG
TTCCCTTCCC GGATTCGGTA TACGGCGATG TCCTTCCCCT ACCATCTCAG CGCGGCGTGG
TTCGGCGGCT TCCTGCCAGC AACGGCGTTT GCGATCGTCG CTGCCACCGG CAACGTGTAC
TCGGGGCTTT ATTACCCGGT TAGCATCGCG GCGGCCTGCA TGGTCTTGAG CCTGCTCTTC
GCACGGGAGA CGCGCGGGAC GGACATCTCC AAGGGCTGA
 
Protein sequence
MTHVETMTQA QGISRRDRKV ILAASLGTVF EFYDFFLIGL VATEIAKAFF SGVNPTAGFI 
FTLLGFAAGF MLRPFGAIVF GRLGDLVGRK YTFLVTIVLM GGSTFLIGLL PAYATIGVAA
PIAFVAMRML QGLALGGEFG GAMVYVAEHA PSDRRATYTA WIIMTAAIGF LLAVAVIIPL
RLALGADAFA LWGWRVPFII SIVLLGVSLW IRLRLDESPE FKRMKAEGKA SKSPLAETFG
TWRYVKVIIV AALCILPAQA VIWYTGQFYT LFFLTKVLKV ENLSANMMLI IATVLTAPLY
VVFGKLSDRI GRKPVYIAGY LLAALVTIPT FHGLTHFANP ALERAQANTP ITIVADPNDC
SFQFNPLGTS KFTTSCDVGI NAVANLGLNY QSQDAAAGTV ASVKVGDRVI ASYAADAADA
ASQKTRLEAE LKQALAEAGY PVGSADPESV NSPAIIALLC VLLALGAMVF APTTTSLLEM
FPSRIRYTAM SFPYHLSAAW FGGFLPATAF AIVAATGNVY SGLYYPVSIA AACMVLSLLF
ARETRGTDIS KG