Gene Rleg_1035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1035 
Symbol 
ID8012165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1013808 
End bp1015208 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content67% 
IMG OID644823618 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002974869 
Protein GI241203773 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCTTT CCAAATCGCG AAGCGGAGCG GGGCTGGCAT TGCTGCTGCT CTGCGCCGCC 
AATTTCCTCG ACGCCATGGA CGTCTCCACC ATCGGCGTTG CGCTCCCCGC CATCCAGGCC
GAACTCGGCA TGGAGGCGAC CTCGTTGCAA TGGGCCGTCA GCGCCTATGT GCTCGGCTAT
GGCGGCTTCC TGCTGCTCGG CGGCCGGGTC GCCGATGTCT TCGGCCATCG CCGCGTCTTT
CTCTGGTCGC TGGCGATCTT TGCCGCCGCC AGCATCGCCG GCGGCTTCGT CAACAGCGGC
CCGACGCTGA TCGCCGCCCG ATTGGTCAAG GGCATTGCCG CCGCCTTCAC CGCGCCCGCG
GCGCTTGCGC TGCTGCTCTC CGTCTTCGGC GAGGGAACGG CACGGGCAAA GGCGCTCGGG
GTCTTCTCCT CCACCGGCGC CGCCGGCTTC GTGCTCGGCA TGGTGCTCGG TGGTGCTGCG
ACGATTATCA GCTGGCGGGC GACATTGGTC ATGGGCGCGC CCGTCGCCAT CCTGACGCTT
TTTGTTGCGC CACTGGTGCT GCCCGCCGAC CCGAAAAGGA CGGGACCGCG GCCGGCCTTC
GACTGGGCCG GCGCGCTGAC GATCACCCCT GGGCTGCTGC TCTTCGTCTT CGGCATCACC
AATGCTGCGG CCGCCGGCTG GCAGGCTTTT GCGACCTGGG GTTCGCTCGT CGCGTCGCTG
GCGCTGATCC TGCTCTTCCT CGTGGTCGAA GCGCGCCATG CCGATCCAAT GGTGCCGCTC
GGCATGTTCC GGCGGGCCAA GCTCCGGCAT GCCAATGCGA TTGCCGCCCT CTTCCAGGGC
GCCTATGTCG GCTTCCAGTT CCTGGCGACG CTCTATTATC AGAACGTCCT CGGCTGGTCG
GCCTTCACCA CCGGCTTCTG TTTTGCGCTT GGCGGTGTCT TCGTGATGTT CCTGGCGCCC
CGCTTTGCGA CGCTTGCGCA AAATCGCGGC GCCACCGGGC TGATGGCAGT GGGGGTCGGT
CTGCAGGCGT TCAGCTACAT CTTCTGGGTG ACCGCACTTG GACATGTCGA TCCGATCCTG
CTCGTGCTGT TCTCGCAGAT CCCGCTCGGC CTCGGTTACG CCCTGACCTA CCCCTCGGTG
CAGGTTGCGG CCCTTTCCGA TGTGGAGGAT GACAAGGCGG GACTCGCCTC CGGACTGCTC
TTTGCCTCAT TCCAGATCGG CGGCGGCATC GTGCTTGCGG CCGCCTCGGC AGTCTTCGGC
GCCGCACCGC ATTTCGGCTG GAACCCCTAT GTCGCCGGCA TCGCCTTCGT GGCGCTGCTT
GCGGTGGCGA TTACCCTGCT TGCCGCTGCC GGTCCGCGGA CATCGGCCGC CCGGAGACCC
AGCTATCAGG CGGCGGAATA A
 
Protein sequence
MTLSKSRSGA GLALLLLCAA NFLDAMDVST IGVALPAIQA ELGMEATSLQ WAVSAYVLGY 
GGFLLLGGRV ADVFGHRRVF LWSLAIFAAA SIAGGFVNSG PTLIAARLVK GIAAAFTAPA
ALALLLSVFG EGTARAKALG VFSSTGAAGF VLGMVLGGAA TIISWRATLV MGAPVAILTL
FVAPLVLPAD PKRTGPRPAF DWAGALTITP GLLLFVFGIT NAAAAGWQAF ATWGSLVASL
ALILLFLVVE ARHADPMVPL GMFRRAKLRH ANAIAALFQG AYVGFQFLAT LYYQNVLGWS
AFTTGFCFAL GGVFVMFLAP RFATLAQNRG ATGLMAVGVG LQAFSYIFWV TALGHVDPIL
LVLFSQIPLG LGYALTYPSV QVAALSDVED DKAGLASGLL FASFQIGGGI VLAAASAVFG
AAPHFGWNPY VAGIAFVALL AVAITLLAAA GPRTSAARRP SYQAAE