Gene Rleg_1064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1064 
Symbol 
ID8012193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1037742 
End bp1038968 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content66% 
IMG OID644823647 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002974898 
Protein GI241203802 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.391824 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAAAT CCTCCCGCTT CCGCTCGGCG CAGACGGTCA CCGTCCTCGC CGTTACCCAG 
CTGATCGGCT GGGGCACGAC GTTCGACATG CTCGGGGTCA TGGGCCGTGT CGTCGCGCCG
GATCTCGGCC TGGCGAACGA AGTGGCGTTT GCCGGCCTGA CGATCATGAT GGTGGTCAGC
GCCATCGTCG GTCCGGCGAC CGGCCGATGG CTCGGCCGCT ATGGTGCTGC CCGTGTGCTT
TCGGCCGCCT CGCTGACCTT TGCGCTCGGG CTGCTTCTGC TTGCCGCCGC AAACGGCATC
GTGCTCTATG CCAGCGCCTG GGTCATCATC GGCATCGGCG GCGCATTTGG CCTCTCGGCG
CCGGCCTATA CCGCCGTCGT CGAGCGCGAA GGAGCAAACG GCAAACGCGT CATCGCCATC
CTGATGCTGT TCACCGGGCT TTCGAGCGCC ATCTTCTGGC CGATCCTCAG CCTGCTCAAC
GAGGCGGTCG GCTGGCGCCT CACCTTCCTG GTCTGTGCGG CGCTGCAATT CTTCGTCTGT
CTGCCGCTGC ATCTCTTAGG CCTGCCGAAG CCGATCGCAA CACATGTCGA AGGCGGCACA
GCCGAAATCG CTCCGGTGCC GCTGTCGAAA GCCAAGCAGC GAAAAGCCTT CCTGCTGATC
GCCGCGGCGA CGACGATCTC GACCTTCGTC ACCTTCGGAA TCTCGCCATC ACTGCTCGAA
ATCTTCCGCC AGTCCGGCGC CTCGCCGGCC TTTGCGCTGC AGCTCGGCTC GGCACGCGGC
GTCCTCGGCA TCTCTGCACG TTTCCTCGAC ATGCTGCTCG GCCGGCACGG CAACCCCATG
CTCAGCGCGG TCATGGGCAT CAGCCTGATG ATGATCAGTT TCCTGATGAT GCTGGTTGCC
AGCCCGTCGA CGCCGCTGCT TGTCACCTTC GTCCTGTTTT ACGGGTTCGG CACCGGGGTC
ATGACCGTCG CCCGCGCGCT GCTACCGCTG GCGCTGTTCT CACCGCGCGA ATTCGGACTG
CAATCGGCCC GGCTGTCGCT GCCGCAGAAC CTCGCCAACG CCATCGCCCC CGTCATCTTC
ACCGCCATCC TCGATCGCGC CGGCACCGGC CCGGCGCTCG CCGCCTGCGC CGTTCTCGCG
GCCTTGTCGC TGGCCTTCGT GCTGATGCTG ATGGCGCTGG TGCGCGGTGC CCGCGCATCA
GAGTCAGCCA TTCTTAATGT CTCTTGA
 
Protein sequence
MPKSSRFRSA QTVTVLAVTQ LIGWGTTFDM LGVMGRVVAP DLGLANEVAF AGLTIMMVVS 
AIVGPATGRW LGRYGAARVL SAASLTFALG LLLLAAANGI VLYASAWVII GIGGAFGLSA
PAYTAVVERE GANGKRVIAI LMLFTGLSSA IFWPILSLLN EAVGWRLTFL VCAALQFFVC
LPLHLLGLPK PIATHVEGGT AEIAPVPLSK AKQRKAFLLI AAATTISTFV TFGISPSLLE
IFRQSGASPA FALQLGSARG VLGISARFLD MLLGRHGNPM LSAVMGISLM MISFLMMLVA
SPSTPLLVTF VLFYGFGTGV MTVARALLPL ALFSPREFGL QSARLSLPQN LANAIAPVIF
TAILDRAGTG PALAACAVLA ALSLAFVLML MALVRGARAS ESAILNVS