Gene Rleg2_5984 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5984 
Symbol 
ID6977370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011366 
Strand
Start bp401233 
End bp402564 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content65% 
IMG OID643393436 
ProductGeneral substrate transporter 
Protein accessionYP_002278254 
Protein GI209546364 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACAA CATCGCATTA CGGACCGTCG TCATCGTCGC TGGAACGCGA CGCGCGGCGT 
ATTCACGACG ACAAACCGGT TTCGCCAGGC AGCATCGCCA TCGGCGTGGT CATCGGCCGC
ATGTCGGAAT TCTTCGATTT CTTCGTCTAC GGCCTCGCCT CCGTCCTAGT CTTTCCACAG
CTGGTCTTCC CCTTCGCGCC CGACCGGCTG ACGGGCACGC TCTATTCCTT CGCGATCTTC
TCGCTCGCCT TCCTCGCCCG CCCGGTCGGC TCCGTCGTCT TCATGACGAT CGACCGCATG
TATGGGCGCG GCACCAAGCT GACGATCGCG CTCTTCCTGC TCGGCGGCTC GACGGCCTCG
ATCGCCTTTC TGCCGGGTTA CGAGGAGATC GGCGTCTGGT CGATTGCGCT GCTGGCGCTC
TTCCGTCTCG GCCAGGGTTT TGCGCTTGGC GGCGCCTGGG ACGGGCTTGC CTCGCTGCTG
GCGCTCAATG CGCCGGCCAA TCACCGCGGC TGGTATGCGA TGATCCCGCA GCTCGGCGCG
CCGATCGGTT TTGCGCTGGC AAGCACGCTG TTCGGTTATT TCGTCGCCAA TCTTTCCAGT
GAGGATTTTC TCTCATGGGG TTGGCGTTAC CCGTTCTTCG TCGCCTTCGC GATCAACGTC
GTGGCGCTGT TTGCGCGTCT GCGCCTGGTC ATGACCAAGG AATTCGGCAC GCTGCTCGAA
CAGCACGAGC TGGAAGCCGC ACCGATCCTC GACGTGCTGC GCGTTCACGG CCGCGACATT
CTGATCGGCG CCTTTGTGCC GCTCGCCAGT TTCGCCATGT TCCACCTCGT CACCATCTTC
CCGCTCGGCT GGATGAGCCT TTACGGCAAC CAGCCGATCG GCGCCTTCAT GGTGGTGCAG
GTGGTCGGCG CCATGGTCGG CATCGTCGCC ATCGTCGCCT CGGGCCTGAT CGCCGACCGC
ATCGGCCGGC GCGCCCAGCT TGCCATCTGC GCCGTCATCA TCGCCGTCTT CAGCTTCGTC
GGCCCAATCC TGATCGCATC GGGCAACAGC GGCCATGACG CCTTCGTCAT CGTCGGCTTC
GGCGTGCTCG GCCTGTCCTT CGGCCAGGCG ACCGGCTCGA TCTCGTCGCG CTTCGGCCGC
GGCTATCGCT ACACCGGTGC CGCCTTCACC TCGGACCTCG CCTGGCTGAT CGGTGCCGGC
TTCGCGCCGC TGGTGGCGCT CAGCCTCTCC AGCCGCTTCG GCCTGACCTT CGTCGGCTAC
TACCTGCTCT CCGGCGCCAT CTGCACACTG GCCGCGCTCG CCTTCAGCAG GGCGCTGGAA
CAGCGCGAAT AG
 
Protein sequence
MATTSHYGPS SSSLERDARR IHDDKPVSPG SIAIGVVIGR MSEFFDFFVY GLASVLVFPQ 
LVFPFAPDRL TGTLYSFAIF SLAFLARPVG SVVFMTIDRM YGRGTKLTIA LFLLGGSTAS
IAFLPGYEEI GVWSIALLAL FRLGQGFALG GAWDGLASLL ALNAPANHRG WYAMIPQLGA
PIGFALASTL FGYFVANLSS EDFLSWGWRY PFFVAFAINV VALFARLRLV MTKEFGTLLE
QHELEAAPIL DVLRVHGRDI LIGAFVPLAS FAMFHLVTIF PLGWMSLYGN QPIGAFMVVQ
VVGAMVGIVA IVASGLIADR IGRRAQLAIC AVIIAVFSFV GPILIASGNS GHDAFVIVGF
GVLGLSFGQA TGSISSRFGR GYRYTGAAFT SDLAWLIGAG FAPLVALSLS SRFGLTFVGY
YLLSGAICTL AALAFSRALE QRE