Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_2315 |
Symbol | |
ID | 5323176 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 2394167 |
End bp | 2395534 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640791253 |
Product | extracellular solute-binding protein |
Protein accession | YP_001327982 |
Protein GI | 150397515 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.271919 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAATA GATCCAGCCT TGTTGCGGCA TCCGGCAAGT CGCGCCGGGA ATTTCTCCGC AACGGCGCAA CCTTCGCCGC CGCCGGACTG GCCGGCGGCC TGAGCGGCTT TCCGTTCATC AACCGGCTGC CGGTGCGTGC CCAGGACGCC CCGTTGAAGT TCTGGCAATT TTACGCGCCC GGCGGCCAGG TGAAGCCGCA GGTCGAATGG TTCGAGAAAA CCGTCGCCGA TTGGAACGCA ACGCACGACC AGAAGGTCGA GCTCGAATTC ATCCCGAACA AGGAATACAT CAACGGTCCG AAGCTCGCGA CCGCCTTCGC CTCCGGTGAC GGGCCGGACA TCTTCATCAT CTCGCCCGGC GACTTCCTGC GCTATTACAA TGGCGGGGTT CTGCAAGATC TGACGCCCTA TATCGACGAG AAGGCCCGGG CCGATTTCCC GGAAAGCGTG CTTGCGAACC GTATGGTCGA CGGCAAGATC TTCGGCCTTC CGATGGAAGT CGAGCCGATG GCGATGTTCT ACTCCATCAA GGCCTTTGAG GATGCCGGCC TCAATGAGAA TGACGTGCCG AAGACCTGGG ACGAACTGTT GGAACTGGGC AAGAAACTGA CCACGCCGGA ACGTTATGGC CTGCTGTTCC AGACCGCGCC GGGCTATTAC CAGAACTTCA CCTGGTATCC CTTCCTCTGG CAGGGCGGCG GCGAATTCCA GAACGCCGAG GGAAAAAGCG CGTTCGACTC GCCCGCGACC GTGCAGGCGC TGAAGCTCTG GCAGGATGCC GTGAATTCGG GCGCCGCACC CCGGCAGGTA CTCGGCAACG GCGCCAACGA CAGTGTCGCC AATCTCGCCT CCGGCTATTG CGCGATCCAG AATGTCGGTA TCTGGGCCAT TTCCCAATTG AAGAACAACG CCAAGGACTT CCCCTACGGC GTGTTCCGCT TGCCGACGCC GGCGAATGGC AAGTACGTCA CAGTCGGCGG CGGATGGGCT TTCGTCGCCA ATTCCAAAGG CAAGAACCCG GAAGCTGCCG GGCAGTTCTG CGCCTGGGCG CTGGCATCGA TGGATCAAGG CTCGATCGAT CGTGTCGCGA GCTGGTGCAC CGAAGCGAAA TCCGACATGC CCCCGCGCGA CAGCGCTCTG AAAGCACGTG AAGCGGCATT CAGCGAAGGC ATAATCGGCC AGTTCGCCAA AGAGATTCAC CCGGGTACGC GCGCCGAGCC GCGGGTGCCG CCGGAAGTCT ACAAGATCAT CTCGGACGCC GTACAACAGG CCATGCTCGG TGGCGCCGAC CCGCAGGCGA CCGCGACCAC GGCCTCGCAG CGGCTCGACG CCTACCTGGC CTCCTATTCC GGCGCGCCGA TTCTTTAA
|
Protein sequence | MKNRSSLVAA SGKSRREFLR NGATFAAAGL AGGLSGFPFI NRLPVRAQDA PLKFWQFYAP GGQVKPQVEW FEKTVADWNA THDQKVELEF IPNKEYINGP KLATAFASGD GPDIFIISPG DFLRYYNGGV LQDLTPYIDE KARADFPESV LANRMVDGKI FGLPMEVEPM AMFYSIKAFE DAGLNENDVP KTWDELLELG KKLTTPERYG LLFQTAPGYY QNFTWYPFLW QGGGEFQNAE GKSAFDSPAT VQALKLWQDA VNSGAAPRQV LGNGANDSVA NLASGYCAIQ NVGIWAISQL KNNAKDFPYG VFRLPTPANG KYVTVGGGWA FVANSKGKNP EAAGQFCAWA LASMDQGSID RVASWCTEAK SDMPPRDSAL KAREAAFSEG IIGQFAKEIH PGTRAEPRVP PEVYKIISDA VQQAMLGGAD PQATATTASQ RLDAYLASYS GAPIL
|
| |