Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5852 |
Symbol | |
ID | 6977241 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011366 |
Strand | - |
Start bp | 264608 |
End bp | 265867 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643393307 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002278125 |
Protein GI | 209546235 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAC TAATAATCTC GACGCTCTTT GCTTCGATGA TGGCGGGTAC GGCCTTTGCC GATACGACGC TGAAGCTTGT CGAAGTCATC ACCAGCCCGG AGCGCACCGA AACGCTGAAA TCGATCGTCG GCAAGTTCGA GGCGGCCAAT CCCGGCACCA AGGTCGACAT CATCTCGCTG CCCTGGAACG AAGCCTTCCA GAAGTTCGCG ACCATGGTAT CGGCCGGCGA CGTGCCCGAT GTGATGGAGA TGCCCGATAC CTGGCTGTCG CTCTATGCCA ATAACGGCAT GCTCGAAAGC CTCGAGCCCT ATCTCGCAAA GTGGGAGCAC ACCAAGGAGC TGACGCCGCG CGCGCTCGAA CTCGGCCGCG ACGTCAAGAA CACCGCCTAC ATGCTGCCCT ACGGCTTCTA TTTGAGGGCG ATGTTCTACA ACAAGAAGCT GCTTTCGGAA GCCGGTGTCG CAGCGCCGCC GAAGACGCTG GAGGAATTCA CCGCCGCTTC GGAAAAGATC TCCAAACTGC AGGGCAAATA CGGTTACTGC ATGCGCGGCG GCGCGGGCGG CCTCAACGGC TGGATGATCT TCGCCGCCTC GATGGCCGGC TCGAACAAAT ACTTCAACGA AGACGGCACC TCGACGATGA ACAGCCCGGG CTGGGCCAAG GGCATCGAAT GGATGGTCGA TCTCTACAAG AAGGGTTATG CGCCGAAGGA CAGCGTCAAC TGGGGCTTCA ACGAAGTCGT CGCCGGCTTC TATTCCGGCA CCTGCGCTTT CCTCGACCAG GATCCGGATG CGCTGATCGC CATTGCCGAA CGCATGAAAA AGGAAGATTT CGGCGTCATG CCGCTGCCGA AAGGCCCGGA TGGCAAGTCC TTCCCGACGA TCGGCTATGG CGGCTGGTCG ATGTTTGCGA CCAGCGGCAA CAAGGATCTC TCGTGGAAGC TGATCGCCAC CCTCGAAGGG CCGGAAGGCA ATATCGAGTG GAACAAGCGC ATCGGCGCCC TGCCGGCCTA TACGGCGGCC GAGAAGGATC CCTTCTATGC CGGTGACCAG TTCAAGGGCT GGTTCGAGGA ACTAGCCGAC CCGAACACGG TGCCGACTGT CATGCCGACC TACCTCGAGG AATTTGCCTT CTTCAAGGAT TCGCTGGCGA TCAAGACCTC GCAGCAGGCC TTGCTCGGCG ATATCTCGGC AAAGGATCTG GCCGACCAGT GGGCGGACTA TCTGACCAAG GCGCAGCAGA AGTTTCTGAG CAAGAAGTAA
|
Protein sequence | MKKLIISTLF ASMMAGTAFA DTTLKLVEVI TSPERTETLK SIVGKFEAAN PGTKVDIISL PWNEAFQKFA TMVSAGDVPD VMEMPDTWLS LYANNGMLES LEPYLAKWEH TKELTPRALE LGRDVKNTAY MLPYGFYLRA MFYNKKLLSE AGVAAPPKTL EEFTAASEKI SKLQGKYGYC MRGGAGGLNG WMIFAASMAG SNKYFNEDGT STMNSPGWAK GIEWMVDLYK KGYAPKDSVN WGFNEVVAGF YSGTCAFLDQ DPDALIAIAE RMKKEDFGVM PLPKGPDGKS FPTIGYGGWS MFATSGNKDL SWKLIATLEG PEGNIEWNKR IGALPAYTAA EKDPFYAGDQ FKGWFEELAD PNTVPTVMPT YLEEFAFFKD SLAIKTSQQA LLGDISAKDL ADQWADYLTK AQQKFLSKK
|
| |