Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4159 |
Symbol | |
ID | 5319208 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 632630 |
End bp | 633895 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640775964 |
Product | extracellular solute-binding protein |
Protein accession | YP_001312897 |
Protein GI | 150376301 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.119299 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.873951 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGGCCCC GCATTTGGGA GGAATGCAAA ATGTTCACGA AACTTATGGC GGCAACGGCC CTCATATCGG CCAGCATGAT CTCGGTCGCC TCGGCCGAGA CGATCAGCAT GTGGGTGCGC TCGGGTATAG GCGACTCGTT CAAGGAAGTC GTGAAGGCCT ACAACACAGC GCACGAGAAC AAGGTGGAAC TCACCGAGGT GCCCTTTGCC GAGCTCGTGC AGAAATATGC GACGGCGATC GCCGGCGGGC AGGCACCCGA CGCGCTGTCC CTCGACCTGA TCTACACGCC GGCCTTTGCA GCTGCCGGCC AGCTGGAAGA CCTCACCGAC TGGGCGAAGG CGCTACCCTA TTTCAACTCG CTGTCGCCGT CGCACGTCAA GCTCGGCACC TATGAGGACA AGATCTACGG CCTGCCGCTG ACGGTCGAGA CCTCAATTTT CGCCTGGAAC AAGGACCTCT ACAAGAAGGC CGGCCTCGAT CCCGAAAAAG CGCCGGCCAC CTGGGAGGAG ATCACTGCCA ACGCCGAGAA AATCCGTGGG CTCGGCGGCG ATACATACGG CTTCTATTTC TCCGGCGGCG GCTGCGGCGG CTGCATGATA TTCACCTTCA CGCCCCTGAC CTGGGGTGCG GGTGCGGATA TCCTTTCGGC CGACGGCAAG ACGGCGACGC TCGACACGCC CCCGATGCGC AAGGCCGTCG ACATCTACCG CAACATGATC GCGAAGGATC TGGTGCCGGC GGGTGCTGCA AGCGACAACG GGGTGAATTT CCTGAGCTTC ACCAATGGCA AGATCGGCCA GCAAAGCCTC GGCGCCTTCG CGATCGGCAC ACTGGTGACG CAGCATCCGG AGATTGATTT CGGCGTGACG CTGATCCCGG GCGTCGACGG CAAGCCGTCC TCCTTTGCCG GCGGGGACAA TTTCGTCGTC ACCAAGGGCA CGCCTAAGCT CGCGGACGTC AAGGAATTCC TTGAATACAC CTATTCGCCC GAAGGCCAGA AGATCATGGC GAAGTATGGC AGCCTGCCGA CCCGCGGCGA CATCGCCAAT GAGGTGCTCG AGGGCCTTGA CCCGCGCCTG AAGGTCGGTC TCGACGCGAT CGCCGTCGCC AAGACGCCCT ACACGCTGCA GTTCAACGAT CTGATCAACA GCGCAAACGG CCCATGGGCG ACCTTCACCA ATGCCGCGAT CTACGGCGAC GACGTCGACG GTGCCTTCGC GGATGCGCAA GCAGAGATGC AGTCGATCAT CGACGCGGGG CAGTAA
|
Protein sequence | MWPRIWEECK MFTKLMAATA LISASMISVA SAETISMWVR SGIGDSFKEV VKAYNTAHEN KVELTEVPFA ELVQKYATAI AGGQAPDALS LDLIYTPAFA AAGQLEDLTD WAKALPYFNS LSPSHVKLGT YEDKIYGLPL TVETSIFAWN KDLYKKAGLD PEKAPATWEE ITANAEKIRG LGGDTYGFYF SGGGCGGCMI FTFTPLTWGA GADILSADGK TATLDTPPMR KAVDIYRNMI AKDLVPAGAA SDNGVNFLSF TNGKIGQQSL GAFAIGTLVT QHPEIDFGVT LIPGVDGKPS SFAGGDNFVV TKGTPKLADV KEFLEYTYSP EGQKIMAKYG SLPTRGDIAN EVLEGLDPRL KVGLDAIAVA KTPYTLQFND LINSANGPWA TFTNAAIYGD DVDGAFADAQ AEMQSIIDAG Q
|
| |