Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6765 |
Symbol | |
ID | 8022695 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012858 |
Strand | - |
Start bp | 201781 |
End bp | 203040 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 644833632 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002984766 |
Protein GI | 241666682 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.979933 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAC TAATAATCTC GACGATCTTT GCGTCGATGA TGGCGGGCAC GGCCTTTGCC GATACGACGC TCAAGCTTGT CGAAGTCATC ACCAGCCCGG AGCGCACCGA AACGCTGAAA TCGATCGTCG GCAAGTTCGA AGCCGCCAAT CCCGGCACCA AAGTCGACAT CATCTCGTTG CCCTGGAACG AGGCGTTCCA GAAGTTCGCG ACTATGGTGT CTGCCGGCGA CGTGCCCGAT GTGATGGAGA TGCCCGACAC CTGGCTGTCG CTTTATGCCA ATAACGGCAT GCTCGAGAGC TTAGAGCCCT ACCTTGCAAA GTGGGAGCAC ACCAAGGAGC TGACGCCGCG CGCACTCGAA CTCGGCCGCG ACGTCAAGAA CACCGCCTAT ATGCTGCCCT ACGGATTCTA TCTCAGGGCG ATGTTCTACA ACAAGAAGCT GCTTGCCGAA GCCGGCGTCG CCGCGCCGCC GAAGACGATG GAAGAATTCA CCGCCGCTTC GGAAAAGGTT TCCAAACTGT CCGGCAAATA CGGCTACTGC ATGCGCGGAG GAGCGGGCGG CCTCAACGGC TGGATGATCT TCGCCGCCTC GATGGCCGGC TCGAACAAGT ATTTCAACGA CGACGGCACC TCGACGATGA ACAGCCCCGG CTGGGCCAAG GGCATCGAAT GGATGGTCGA TCTCTACAAG AAGGGCTATG CGCCGAAGGA CAGCGTCAAC TGGGGCTTCA ACGAGGTCGT CGCCGGCTTC TATTCTGGCA CCTGCGCATT CCTCGATCAG GATCCGGATG CGTTGATCGC CATTGCCGAA CGCATGAAGA AGGAGGATTT CGGCGTCATG CCGCTGCCGA AGGGGCCTGA CGGCAAGTCC TTCCCGACGA TCGGCTATGG CGGCTGGTCG ATGTTCACGA CCAGCGGCAA CAAGGATCTC TCCTGGAAGC TGATCGCCAC GCTCGAAGGG CCGGAAGGCA ATATCGAGTG GAACAAGCGC ATCGGCGCCC TGCCGGCCTA TACGGCGGCC GAGAAGGATC CCTTCTACGC CGGTGACCAG TTCAAGGGCT GGTTCGAGGA ACTGGCCGAC CCGAACACGG TACCAACAGT CATGCCGACC TATCTCGAGG AATTCGCCTT CTTCAAGGAT TCGCTAGCGA TCAAGACCTC GCAACAGGCG ATGCTCGGCG ATATCTCGGC GAAGGATCTG GCCGACCAGT GGGCGGAATA CCTCACCAAG GCGCAGCAGA AGTTCCTCGC CAAGAAATAA
|
Protein sequence | MKKLIISTIF ASMMAGTAFA DTTLKLVEVI TSPERTETLK SIVGKFEAAN PGTKVDIISL PWNEAFQKFA TMVSAGDVPD VMEMPDTWLS LYANNGMLES LEPYLAKWEH TKELTPRALE LGRDVKNTAY MLPYGFYLRA MFYNKKLLAE AGVAAPPKTM EEFTAASEKV SKLSGKYGYC MRGGAGGLNG WMIFAASMAG SNKYFNDDGT STMNSPGWAK GIEWMVDLYK KGYAPKDSVN WGFNEVVAGF YSGTCAFLDQ DPDALIAIAE RMKKEDFGVM PLPKGPDGKS FPTIGYGGWS MFTTSGNKDL SWKLIATLEG PEGNIEWNKR IGALPAYTAA EKDPFYAGDQ FKGWFEELAD PNTVPTVMPT YLEEFAFFKD SLAIKTSQQA MLGDISAKDL ADQWAEYLTK AQQKFLAKK
|
| |