Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5419 |
Symbol | |
ID | 6978513 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 1061701 |
End bp | 1062528 |
Gene Length | 828 bp |
Protein Length | 275 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643394521 |
Product | extracellular solute-binding protein family 3 |
Protein accession | YP_002279339 |
Protein GI | 209547421 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.00240079 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAATCCTT TTCAATCATT TGCGGCCACC CTTGCCCCAT TGCGGGCCGG TATTCTGACC TGCCTTCTTG CGATGGGGGC AGGCTCTGCG ACCGCGGCAG ACAATCCCTA TAACCTGATC GAGGCCGGGG TGATCAGCGT CGGAACGATG GGTGATTCCA AGCCCTATAC CTTTGCGACG GCGGACGGTC AGTTCACCGG TTTCGATATC GAGCTGTTTC TCAACGTCGT CTCCCGCCTC GGCTTTGCGA AAGACAAGGT GACGTTCACG GGTCAGGAAT TTTCAGCCCT CCTGCCGTCG GTGGCAAACG AACGCTTCGA CGTTGCCGTC GCCGCGATCG GCACCACCGA AGCCCGCAAG AAGACCGTCG ATTTTTCCGA CGGCTATCTT GCCGGTTATC TCTCGGTTTT GACCGCGGAT GCCGGCATCA AGGATGCTGA CGGCCTCAAG GGCAAACGTC TCGGCGTCGT GCAGGGGACC CTGCAGGAAG TCTATGCGGC CAAGAATTTC GGCGGGACCG ATCTGGTGAA ATTCCCCGAC AACAATTCCG CCGTCGCCGC CCTCAACAAC GGAACGGTCG ATGCGCATTT CCTCGATTAC GAGGCTGCCA AACAATATGG CGAGCGTTAT CCCGCACTGA AGGTTGCGGT CAACATCCCG TCCTTCGATG CGCCGGCGGG CTTCGTGATC CGCAAGGGAA ACGATGCCTT CCGCACGGCG CTGAACGGCG CCCTTCACGA CGCGATGCAG GACGGCACCT GGAAGACCCT CTACGAAAAA TGGTTCCCAG GCTCGCCGAT GCCGGACCAG TATCTTCCCA AGAAGTGA
|
Protein sequence | MNPFQSFAAT LAPLRAGILT CLLAMGAGSA TAADNPYNLI EAGVISVGTM GDSKPYTFAT ADGQFTGFDI ELFLNVVSRL GFAKDKVTFT GQEFSALLPS VANERFDVAV AAIGTTEARK KTVDFSDGYL AGYLSVLTAD AGIKDADGLK GKRLGVVQGT LQEVYAAKNF GGTDLVKFPD NNSAVAALNN GTVDAHFLDY EAAKQYGERY PALKVAVNIP SFDAPAGFVI RKGNDAFRTA LNGALHDAMQ DGTWKTLYEK WFPGSPMPDQ YLPKK
|
| |