Gene Rleg_4226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4226 
Symbol 
ID8015903 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4320206 
End bp4322092 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content62% 
IMG OID644826796 
ProductGeneral substrate transporter 
Protein accessionYP_002978005 
Protein GI241206909 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.162126 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAATG TCGCAAGCAT CGACGGCGCA AAGGCCGGTC CGATGACCGG TGAGGAGAAG 
AAGGTCATCT TCGCCTCTTC GCTCGGCACC GTTTTCGAAT GGTACGATTT CTATCTCTAT
GGTTCGCTCG CCACCTATAT CGGCGCGACC TATTTCACCC AATATCCCGA GGCAACGCGT
AACATCTTCA CGTTGCTCGC CTTTGCCGCC GGCTTCCTGG TGCGCCCCTT CGGCGCGCTG
GTGTTCGGCC GTCTCGGCGA TCTCGTCGGC CGTAAATACA CCTTCCTGAT GACGATCATG
ATCATGGGTC TGTCGACCTT CCTCGTCGGC ATCCTGCCGG GTGCCGCCAC GATCGGTATC
GCAGCCCCGA TCATCCTGAT CGCGCTCCGT CTGCTCCAGG GTCTGGCGCT GGGCGGTGAA
TATGGCGGCG CGGCCACCTA TGTCGCCGAA CATGCGCCGA ACGGGCGCCG CGGCTACTTC
ACCTCGTGGA TCCAGACGAC GGCAACGCTC GGCCTGTTCC TGTCGCTGAT CGTCATCGTC
CTGGTTCAAT ATCTGATGGG TGCGGCTCAG TTTGCCGCCT GGGGCTGGCG CATTCCGTTC
CTGGTCTCGG TCGTCCTGCT CGGCATTTCC GTCTGGATCC GCCTGAGGAT GAACGAATCG
CCGGCGTTCC AGCGGATGAA GGCAGAAGGC AAAGGCTCCA AGGCGCCGCT GACCGAGGCC
TTCGGGACGT GGAAAAATGC CAAGATCGCG ATCATCGCGC TGCTCGGCGC CACCATGGGC
CAGGCGGTCG TCTGGTACGG CGGCCAGTTC TATGCGCTGT TCTTCCTGCA GAACGTGCTG
AAGGTGGACC TGTTTTCGGC CAATGTCATG GTGGCCATCG CACTTCTCCT CGGCACGCCC
TTCTTCGTCA TCTTCGGCGG TCTCTCCGAC AAGATTGGCC GCAAGCCGAT CATCATGGCA
GGCCTTTTCA TTGCGGCGGT GACCTATAAT CCGCTGTTCA AGGCGATGAC CTGGACGGCG
AACCCGGCGC TTGCCGAAGC GCAGGCTTCG ATTCGGGCAA CGGTGACGGC CGATCCGGCT
GATTGCAGGT TCCAGTTCAA CCCGACCGGG ACGACGAAGT TCACCAGTTC CTGCGACGTG
GCAACGGCGT TCCTGACCAG GAACTCGGTG CCTTACGACG TTGTGCCCGG TACCGCCGGA
CAGCCGGCAA CGGTGAAGGT CGGCAACGCG ACGATCCCAA GCTTCGACGT CGTCGCTGCC
GGCGACAAGG CGAAGGGGAT GACCGCCGCC TTCGAAAAGA GCGTCAACAT CGCGCTCCAC
GATGCCGGCT ATCCGCTGAA CCGCGGCGCC GTCAAGGTGC CGGATGCCAA GCTCGACGCC
TTCATCGCAG CCAATCCCGA GCTGTCGCTC AACGCCGATG CCGTGCGCGC CGGCGAGAAG
GAAACCGTGC CTGCGGCCAA GCTGGTCGAG ACCAAGCTGC TGACCGCGGA TGAGGCCAAT
GGCGTCACCG ACATGACGGT CTACAATATC GCCAATGGCG GCACCTTCGC CATGGTCGCC
GATCCGGCTC GCGTTAACTG GATCGGCACG ATCGCCGTGC TGTTCGTCCT TGTCTTCTAT
GTGACGATGG TCTACGGCCC GATCGCCGCT CTGCTGGTCG AGCTTTTCCC GACCCGCATC
CGCTATACCG GCATGTCGCT GCCCTATCAC ATTGGCAACG GCTGGTTCGG TGGCCTGCTT
CCGGCGACGG CCTTCGCGAT GAGCGCTGCC GCGGGCGATA TCTACTACGG TCTCTGGTAC
CCGATCGTCT TTGCGACGAT CACGCTGGTG ATCGGCTTGA TCTTCCTGCC GGAAACGAAG
AACAGGGATA TCCACGCCAT GGATTGA
 
Protein sequence
MANVASIDGA KAGPMTGEEK KVIFASSLGT VFEWYDFYLY GSLATYIGAT YFTQYPEATR 
NIFTLLAFAA GFLVRPFGAL VFGRLGDLVG RKYTFLMTIM IMGLSTFLVG ILPGAATIGI
AAPIILIALR LLQGLALGGE YGGAATYVAE HAPNGRRGYF TSWIQTTATL GLFLSLIVIV
LVQYLMGAAQ FAAWGWRIPF LVSVVLLGIS VWIRLRMNES PAFQRMKAEG KGSKAPLTEA
FGTWKNAKIA IIALLGATMG QAVVWYGGQF YALFFLQNVL KVDLFSANVM VAIALLLGTP
FFVIFGGLSD KIGRKPIIMA GLFIAAVTYN PLFKAMTWTA NPALAEAQAS IRATVTADPA
DCRFQFNPTG TTKFTSSCDV ATAFLTRNSV PYDVVPGTAG QPATVKVGNA TIPSFDVVAA
GDKAKGMTAA FEKSVNIALH DAGYPLNRGA VKVPDAKLDA FIAANPELSL NADAVRAGEK
ETVPAAKLVE TKLLTADEAN GVTDMTVYNI ANGGTFAMVA DPARVNWIGT IAVLFVLVFY
VTMVYGPIAA LLVELFPTRI RYTGMSLPYH IGNGWFGGLL PATAFAMSAA AGDIYYGLWY
PIVFATITLV IGLIFLPETK NRDIHAMD