Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4771 |
Symbol | |
ID | 6977865 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 402637 |
End bp | 403797 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643393935 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002278753 |
Protein GI | 209546835 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0687] Spermidine/putrescine-binding periplasmic protein |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAAGG ACTACAAGGA TCATCTGCCT ATTACCCCTG AAGGCTTCAT GGACGAGTTC ATGCGCCTGA AGCGCGGCTC CGTCAGCCGC CGCCACTTCC TCGGCGTCAC CGGCCTCGGG CTTGCGACCG CCGTGTTGTC GCGTTTCCCC GGTGCGCTGT CGACGCCCGC CTACGCCGAG GACCTCGGAA CCCAGATGTC GATCGCCACC TGGCCGAATT ACCACGACCC TGCGACCTTC GAGAATTTCA AAGCGGCGAC CGGCGTTGCC GTCGAGGTCA ACGTCTTCGG CTCCAACGAA GAAATGCTGG CCAAGCTCCA AGCGGGCGCC TCCGGCTGGT CGCTCTTCGT GCCGACCAAC TACACGATCT CCACCCACCA CAAGCTCGGC CTGATCGACG AACTCGATCT CTCCAAGATC CCGAATTTCA GCCAGGCGAC GGAAAATCCG CGCTTCACCA AGGAAGGCAT GATCGACGGC AAGACCTACG CCGTGCCGAA GAACTGGGGT ACGACCGGGT TCTCGGTCAA CACCGCAAAG ATCAAGACCA AGCTTTCGAG CTGGAAGGAC TTCTTTGACA TCGCCCAGAC GGAAGCCGAC GGCCGCGCCA TGGTGCATGA CTATCAGTTG ACGACGATCG GCAGCGCGCT GGTTTCGCTC GGCTACGATT TCAATTCGAT CAAGGCGGAC GAACTCGCAA AGGCCGAGGA ACTGCTGATC AAAGTCAAAC CGCACCTTTT CGCCATCAAC AGCGACTACC AACCGGCCAT GCGCGCCACC GACGCCTGGC TCACCATGTG CTGGACCAAC GACGGAGCGC AGCTCAACCG TGACGTCCCC GAGATCGCCT ATGTGCTTGG CACCGACGGC GGCGAGATCT GGACCGACTA TTACGCCATT CCGAAGGACG CGCCGAACAA GGCGGCGGGT TACGCGCTGC TCAACTACCT CATGGATCCG GCCAATGCCG TCAAGGAGCA CGTCGCCAAT GGCGCACCGA CGACCGACAG CCGGGTCATC GCACTGCTGC CGAAGGAGAT CACAGCGAAC AAGATCGTCT ATCCGGACGA GGCGTCATTG ACGCCGCTGG AATTCGGCGC GGCGGTGACG TTGACCGACC CCGGCCGGGC AGAACTGATG GCGCGTTTCA AGTCGGCTTG A
|
Protein sequence | MSKDYKDHLP ITPEGFMDEF MRLKRGSVSR RHFLGVTGLG LATAVLSRFP GALSTPAYAE DLGTQMSIAT WPNYHDPATF ENFKAATGVA VEVNVFGSNE EMLAKLQAGA SGWSLFVPTN YTISTHHKLG LIDELDLSKI PNFSQATENP RFTKEGMIDG KTYAVPKNWG TTGFSVNTAK IKTKLSSWKD FFDIAQTEAD GRAMVHDYQL TTIGSALVSL GYDFNSIKAD ELAKAEELLI KVKPHLFAIN SDYQPAMRAT DAWLTMCWTN DGAQLNRDVP EIAYVLGTDG GEIWTDYYAI PKDAPNKAAG YALLNYLMDP ANAVKEHVAN GAPTTDSRVI ALLPKEITAN KIVYPDEASL TPLEFGAAVT LTDPGRAELM ARFKSA
|
| |