Gene Rleg2_5852 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5852 
Symbol 
ID6977241 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011366 
Strand
Start bp264608 
End bp265867 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content59% 
IMG OID643393307 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002278125 
Protein GI209546235 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAC TAATAATCTC GACGCTCTTT GCTTCGATGA TGGCGGGTAC GGCCTTTGCC 
GATACGACGC TGAAGCTTGT CGAAGTCATC ACCAGCCCGG AGCGCACCGA AACGCTGAAA
TCGATCGTCG GCAAGTTCGA GGCGGCCAAT CCCGGCACCA AGGTCGACAT CATCTCGCTG
CCCTGGAACG AAGCCTTCCA GAAGTTCGCG ACCATGGTAT CGGCCGGCGA CGTGCCCGAT
GTGATGGAGA TGCCCGATAC CTGGCTGTCG CTCTATGCCA ATAACGGCAT GCTCGAAAGC
CTCGAGCCCT ATCTCGCAAA GTGGGAGCAC ACCAAGGAGC TGACGCCGCG CGCGCTCGAA
CTCGGCCGCG ACGTCAAGAA CACCGCCTAC ATGCTGCCCT ACGGCTTCTA TTTGAGGGCG
ATGTTCTACA ACAAGAAGCT GCTTTCGGAA GCCGGTGTCG CAGCGCCGCC GAAGACGCTG
GAGGAATTCA CCGCCGCTTC GGAAAAGATC TCCAAACTGC AGGGCAAATA CGGTTACTGC
ATGCGCGGCG GCGCGGGCGG CCTCAACGGC TGGATGATCT TCGCCGCCTC GATGGCCGGC
TCGAACAAAT ACTTCAACGA AGACGGCACC TCGACGATGA ACAGCCCGGG CTGGGCCAAG
GGCATCGAAT GGATGGTCGA TCTCTACAAG AAGGGTTATG CGCCGAAGGA CAGCGTCAAC
TGGGGCTTCA ACGAAGTCGT CGCCGGCTTC TATTCCGGCA CCTGCGCTTT CCTCGACCAG
GATCCGGATG CGCTGATCGC CATTGCCGAA CGCATGAAAA AGGAAGATTT CGGCGTCATG
CCGCTGCCGA AAGGCCCGGA TGGCAAGTCC TTCCCGACGA TCGGCTATGG CGGCTGGTCG
ATGTTTGCGA CCAGCGGCAA CAAGGATCTC TCGTGGAAGC TGATCGCCAC CCTCGAAGGG
CCGGAAGGCA ATATCGAGTG GAACAAGCGC ATCGGCGCCC TGCCGGCCTA TACGGCGGCC
GAGAAGGATC CCTTCTATGC CGGTGACCAG TTCAAGGGCT GGTTCGAGGA ACTAGCCGAC
CCGAACACGG TGCCGACTGT CATGCCGACC TACCTCGAGG AATTTGCCTT CTTCAAGGAT
TCGCTGGCGA TCAAGACCTC GCAGCAGGCC TTGCTCGGCG ATATCTCGGC AAAGGATCTG
GCCGACCAGT GGGCGGACTA TCTGACCAAG GCGCAGCAGA AGTTTCTGAG CAAGAAGTAA
 
Protein sequence
MKKLIISTLF ASMMAGTAFA DTTLKLVEVI TSPERTETLK SIVGKFEAAN PGTKVDIISL 
PWNEAFQKFA TMVSAGDVPD VMEMPDTWLS LYANNGMLES LEPYLAKWEH TKELTPRALE
LGRDVKNTAY MLPYGFYLRA MFYNKKLLSE AGVAAPPKTL EEFTAASEKI SKLQGKYGYC
MRGGAGGLNG WMIFAASMAG SNKYFNEDGT STMNSPGWAK GIEWMVDLYK KGYAPKDSVN
WGFNEVVAGF YSGTCAFLDQ DPDALIAIAE RMKKEDFGVM PLPKGPDGKS FPTIGYGGWS
MFATSGNKDL SWKLIATLEG PEGNIEWNKR IGALPAYTAA EKDPFYAGDQ FKGWFEELAD
PNTVPTVMPT YLEEFAFFKD SLAIKTSQQA LLGDISAKDL ADQWADYLTK AQQKFLSKK