Gene Rleg_4629 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4629 
Symbol 
ID8015375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4751875 
End bp4753374 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content63% 
IMG OID644827204 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002978404 
Protein GI241207308 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.458536 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.277274 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGACGT CAGCGCATGC AATGAAGACC GCCGAGACAG CCGCAATCCC AGGGCAGAAG 
CAATACCTGA CCCGCGGAAC CGGCGCTTAT CGCCGCGCCA GCCTGGCACT CTTCCTGTCT
GGCTTCTCCA CCTTCTCGCT GCTCTATTGT GTGCAGCCGC TGCTGCCGAT CTTTTCGCAG
CAATTCTCCG TCAGTCCGGC CGAAAGCTCG CTGTCGCTCT CGCTCTCCAC CGGCTTCCTG
GCGGTCGCCA TCGTCTGCGC CGCTGCCGTC TCGGAAGGCC TTGGCCGCCG CAGCCTGATG
TCGATATCGC TGGTCGGCGC GGCATTGCTG ACCATCGCCA CCGCCTTTGC CCCGAACTGG
CACCTTCTGC TCGTCATCCG CGCCCTGCAG GGCCTCGTTC TCGGCGGCGT GCCTGCCGTT
GCCATGGCCT ATCTCGCCGA GGAAATCGAC CCGCGCGGAC TTGGCGCCAC CATGGGCCTC
TATGTCGGCG GCACGGCCTT CGGCGGTATG TCCGGCCGCG TGCTGACCGG CATCTTTGCC
GAATATCTCA CCTGGCGTCC GGCGCTTTTT CTCATCGGCG CCATCGGTCT TGCCCCCGCG
ATCGGCTTCA TCGCCCTCCT GCCGCCGTCA CGCAATTTCG TCCGCCGGCC GGGCTTCGAT
CCGCGCTTTC ATGCAAAAGC CTGGCTCGGC CATCTCAGCA ATCCGGCGTT GCCCTTCATC
TTCGCCATCG CCTTCCTGGC AATGGGCTCC TTCGTGACGA TCTACAATTA TGCCGGTTTC
CGCCTTGTGG CGCCGCCCTA CGGCCTCAAC CAGACCGAAC TCGGCCTGAT CTTCACCGTC
TATCTCTTCG GCATCGGCGC CTCCTCGATC GGCGGCCTGC TCGGGGACAG GATCGGGCAC
TTTTCCGTGC TTCTCTTCGG TCTGGCACTC ACCGCCGCCG GCAGCGCGTT GACGCTCTTT
GCCTCGCTCC CGGTTATCAT CCTCGGTATA ATTGTGCTCA CGACCGGCTT CTTCATGAGC
CATTCGATCG CCAGCGGCCT TGTCGGCAAG CTGGCGCATG GCACCAAGGG CCATGCCTCG
TCGCTCTATA TGCTCGCCTA TTATGTCGGC TCCAGCCTCA TGGGTTCGGC GGGCGGCTGG
TTCTTCGCGG TTGAAGGCTG GGTCGCTGTC GTTATCTTCA CGCTAGCCAT GCTGGGGCTG
GCCTCTCCGC CTGTTTTGCC CAGCAATTCG CGAGGAGAAA AGCATGATCC GCATAGACCG
TCTCGACCAT CTCGTGCTGA CCGTCGACGA TATCGCCATC AGCTGCGATT TTTATTCCCG
CATCCTCGGC ATGTCGGTCG AAACCTTTGC GGAGAGCCGC AAGGCGCTGA AATTCGGCGG
GCAGAAGATC AACCTGCACC AGGCCGGCCG CGAATTCGAT CCCAAGGCGC GACATCCCAC
ACCCGGCTCC GGCGACCTCT GCTTCATCGC CGAGACACCG CTTGCCGATG TCATCGCTGA
 
Protein sequence
MPTSAHAMKT AETAAIPGQK QYLTRGTGAY RRASLALFLS GFSTFSLLYC VQPLLPIFSQ 
QFSVSPAESS LSLSLSTGFL AVAIVCAAAV SEGLGRRSLM SISLVGAALL TIATAFAPNW
HLLLVIRALQ GLVLGGVPAV AMAYLAEEID PRGLGATMGL YVGGTAFGGM SGRVLTGIFA
EYLTWRPALF LIGAIGLAPA IGFIALLPPS RNFVRRPGFD PRFHAKAWLG HLSNPALPFI
FAIAFLAMGS FVTIYNYAGF RLVAPPYGLN QTELGLIFTV YLFGIGASSI GGLLGDRIGH
FSVLLFGLAL TAAGSALTLF ASLPVIILGI IVLTTGFFMS HSIASGLVGK LAHGTKGHAS
SLYMLAYYVG SSLMGSAGGW FFAVEGWVAV VIFTLAMLGL ASPPVLPSNS RGEKHDPHRP
SRPSRADRRR YRHQLRFLFP HPRHVGRNLC GEPQGAEIRR AEDQPAPGRP RIRSQGATSH
TRLRRPLLHR RDTACRCHR