Gene Rleg_6765 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6765 
Symbol 
ID8022695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012858 
Strand
Start bp201781 
End bp203040 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content59% 
IMG OID644833632 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002984766 
Protein GI241666682 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.979933 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAC TAATAATCTC GACGATCTTT GCGTCGATGA TGGCGGGCAC GGCCTTTGCC 
GATACGACGC TCAAGCTTGT CGAAGTCATC ACCAGCCCGG AGCGCACCGA AACGCTGAAA
TCGATCGTCG GCAAGTTCGA AGCCGCCAAT CCCGGCACCA AAGTCGACAT CATCTCGTTG
CCCTGGAACG AGGCGTTCCA GAAGTTCGCG ACTATGGTGT CTGCCGGCGA CGTGCCCGAT
GTGATGGAGA TGCCCGACAC CTGGCTGTCG CTTTATGCCA ATAACGGCAT GCTCGAGAGC
TTAGAGCCCT ACCTTGCAAA GTGGGAGCAC ACCAAGGAGC TGACGCCGCG CGCACTCGAA
CTCGGCCGCG ACGTCAAGAA CACCGCCTAT ATGCTGCCCT ACGGATTCTA TCTCAGGGCG
ATGTTCTACA ACAAGAAGCT GCTTGCCGAA GCCGGCGTCG CCGCGCCGCC GAAGACGATG
GAAGAATTCA CCGCCGCTTC GGAAAAGGTT TCCAAACTGT CCGGCAAATA CGGCTACTGC
ATGCGCGGAG GAGCGGGCGG CCTCAACGGC TGGATGATCT TCGCCGCCTC GATGGCCGGC
TCGAACAAGT ATTTCAACGA CGACGGCACC TCGACGATGA ACAGCCCCGG CTGGGCCAAG
GGCATCGAAT GGATGGTCGA TCTCTACAAG AAGGGCTATG CGCCGAAGGA CAGCGTCAAC
TGGGGCTTCA ACGAGGTCGT CGCCGGCTTC TATTCTGGCA CCTGCGCATT CCTCGATCAG
GATCCGGATG CGTTGATCGC CATTGCCGAA CGCATGAAGA AGGAGGATTT CGGCGTCATG
CCGCTGCCGA AGGGGCCTGA CGGCAAGTCC TTCCCGACGA TCGGCTATGG CGGCTGGTCG
ATGTTCACGA CCAGCGGCAA CAAGGATCTC TCCTGGAAGC TGATCGCCAC GCTCGAAGGG
CCGGAAGGCA ATATCGAGTG GAACAAGCGC ATCGGCGCCC TGCCGGCCTA TACGGCGGCC
GAGAAGGATC CCTTCTACGC CGGTGACCAG TTCAAGGGCT GGTTCGAGGA ACTGGCCGAC
CCGAACACGG TACCAACAGT CATGCCGACC TATCTCGAGG AATTCGCCTT CTTCAAGGAT
TCGCTAGCGA TCAAGACCTC GCAACAGGCG ATGCTCGGCG ATATCTCGGC GAAGGATCTG
GCCGACCAGT GGGCGGAATA CCTCACCAAG GCGCAGCAGA AGTTCCTCGC CAAGAAATAA
 
Protein sequence
MKKLIISTIF ASMMAGTAFA DTTLKLVEVI TSPERTETLK SIVGKFEAAN PGTKVDIISL 
PWNEAFQKFA TMVSAGDVPD VMEMPDTWLS LYANNGMLES LEPYLAKWEH TKELTPRALE
LGRDVKNTAY MLPYGFYLRA MFYNKKLLAE AGVAAPPKTM EEFTAASEKV SKLSGKYGYC
MRGGAGGLNG WMIFAASMAG SNKYFNDDGT STMNSPGWAK GIEWMVDLYK KGYAPKDSVN
WGFNEVVAGF YSGTCAFLDQ DPDALIAIAE RMKKEDFGVM PLPKGPDGKS FPTIGYGGWS
MFTTSGNKDL SWKLIATLEG PEGNIEWNKR IGALPAYTAA EKDPFYAGDQ FKGWFEELAD
PNTVPTVMPT YLEEFAFFKD SLAIKTSQQA MLGDISAKDL ADQWAEYLTK AQQKFLAKK