Gene Rleg_6620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6620 
Symbol 
ID8022870 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012858 
Strand
Start bp49640 
End bp50839 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content61% 
IMG OID644833489 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002984623 
Protein GI241666539 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.397819 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.21382 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGAC TTTTCCCTGA CGTCTTCCGC AATCCGGCGA TCCGCGCCAG CATGATCGCC 
ATTTTCATCT TCGGCATGGC GGGAGCGATG ACCGCACCCT ACCGTTCGAT CATCGGCATC
CGCGAATTGG GGCTGAGCGA CGGCCTCTAT TCCTTCCTGA GTTTCGTCTC GGCGGCGGTG
AATGTCGTCA TCAGCATCCT GCTCGGCAAT CTCGCCGACC GGCTTGGTGA ATACCGTTCG
ACGATGATCG GCGCCTGCCT GTTCGGCATC GTCGGCTACG GCGTGGTCTA CGCCTTTCCC
AGCGCCGCCG TCTTCGTCAT CAGCGGGTTG CTGCCCCTGC CGATCTACGG GGCGCTGAAC
TCGCTGCTGT TTGCCAATGC GCGCGCGGCT ATGCACGGCA TGAACCGAAG CGACATGGTG
ACGGCCAACT CCGGCGTGCG CGCCATGATC TCGCTGTCGT GGGTACTGAT CCCAGGGATA
ACTGGCCTGC TGCTGTCTGG CGCATCGAGC ATGCTGCCGG CCTACCTCTT TGCCAGCATC
TCATGTCTGT TGTGCCAGGG GATCATCCTC TTCGCCTTGC CGAAGCGAGC GGCAACGGAA
ATGGCAGCAG TTCATCATCT CACTTACCTC GGCGCGCTTG GCCAAGTGGT TTCTCCGCGG
ATTTCGGCGC ATATTTGCGG GGTCGCGCTG ATCACCAGTA CGCTGCATCT GAATGACGCC
CTGCTGCCAT TGATCGCCAC TGGTGCTGCG CATGGCAAGC TGAGCGACGT CGGCATTCTC
GTCGGCATCG TCGCATTGCT GGAAGTCGTC TTCATCATCG TCTGGTCGCG GATCGCGCGG
AAGACAGGAC AGATGACGGC GCTTGGCGCC GGTACCATCA TCTATGCCGT CTTCCTCAGT
CTGCTTGGCT TTGCCTCCGA GCCGTGGCAC CTCTATGCGC TCACCTTGCT TGCCGGCATC
GGAGCGTCGG CGATCATCAC CATTCCGATC ACCTATCTGC AGGATCTGAT CGCCGACCGG
CCGGGCCTCG GCAGCGCACT GATCTCCGTC AATATCTTTG CCAGTGCCGG GATCGGCGCG
CTGGTCTTTG CCGCCGGCAC CTATGTGACC GGCTATTCGG GAACCGCAAT CCTCAGCGCT
GTCACCGGAT TGGCGGGGAT AGCGATCATC GGCCTCCTGC GTAGAGGCAA AGCCCGCTAG
 
Protein sequence
MSRLFPDVFR NPAIRASMIA IFIFGMAGAM TAPYRSIIGI RELGLSDGLY SFLSFVSAAV 
NVVISILLGN LADRLGEYRS TMIGACLFGI VGYGVVYAFP SAAVFVISGL LPLPIYGALN
SLLFANARAA MHGMNRSDMV TANSGVRAMI SLSWVLIPGI TGLLLSGASS MLPAYLFASI
SCLLCQGIIL FALPKRAATE MAAVHHLTYL GALGQVVSPR ISAHICGVAL ITSTLHLNDA
LLPLIATGAA HGKLSDVGIL VGIVALLEVV FIIVWSRIAR KTGQMTALGA GTIIYAVFLS
LLGFASEPWH LYALTLLAGI GASAIITIPI TYLQDLIADR PGLGSALISV NIFASAGIGA
LVFAAGTYVT GYSGTAILSA VTGLAGIAII GLLRRGKAR