Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4916 |
Symbol | |
ID | 5319128 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 1426382 |
End bp | 1427614 |
Gene Length | 1233 bp |
Protein Length | 410 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640776700 |
Product | extracellular solute-binding protein |
Protein accession | YP_001313632 |
Protein GI | 150377036 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.06645 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.512511 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTATCC GAAAATATGC AATGCTGGGC GCATTCGCGC TCGCCAGCGT TTCCTTGTCA ACCTTTGGCG CAAGCGCCGA GGACGTAACG ATCAGAGTCT GGTCGCTTGA TCGAGACATT CAGCCGGCAC CGAACTTGAT AAAGGATTTC AACGAGCTGA ACTCCGGTAT TAAAGTCGAG TATCGGCAGA TCCAGTTCGA TGACGTGGTC AGCGAAGCCA TGCGGGCTTA TTCGACCGGA CAAGCGCCGG ACATCATCGC CGTCGACAAT CCCGAACACG CGCTCTTTGC GTCTCGCGGC GCGTTTCTCG ATCTGAGCGA CATGATCGCG AAGTCCTCCG TCGTAAAGCC GGAGAACTAT TTCAAGGGGC CGCTCGCCTC CGTGACCTGG GACGGGAAAT ATTACGGTAT CCCCAAGGCG ACCAACACGA TCGCCCTTTA TTACAACAAG GACATGTTCA AGGCGAAAGG TCTGGACCCG AACAAGCCTC CGCAAACCTG GGACGAACTG GTCGAGGCGG CACGCAAGCT GAGCGACCCC GCTCAGAACG TCTACGGCAT CACCTTTTCG GCAAAGGCCA ACGAGGAGGG GACATTCCAG TTTCTGCCCT GGGCGCAGAT GGCCGGGGGT GGTTACGACA ACATCAATTC CGAAGGCGCG GTCAAGGCGC TCGACGTCTG GAAGACGATC ATCGACGAGA AGCTTGCCTC GCCGGACACG CTGACACGCA GCCAATGGGA CGCCACCGGT ACCTTCAATT CCGGCAACGC GGCCATGGCG ATTTCCGGCC CCTGGGAACT CGACCGCATG CTCGAGGAAG CGAAGTTCGA TTGGGGCGTC GCGCTGATGC CCGTGCCGCA GGCCGGCGCT GAACGCTCTT CGGCCATGGG CGATTTCAAT TGGGCGATCT TTGCGAACAC CGAGCATCCG GAAGAGGCCT TCAAGGTCCT CGAATATTTC GTTTCCCAGG ACGACCGGAT GTTCAAGGAC TTCGGACAAC TGCCGCCTCG TTCGGACATC GCCATTCCGG CGACGGGCGA GCCGAAAAAG GACGCGGCTC TCCAGGTCTT CGTGGAGCAG CTGAAATATG CCAAGCCGCG TGGCCCCCAT CCCGCATGGC CCAAGATCTC CAAGGCGATC CAGGACGCCA TCCAGGCGGC GCTTACCGGG CAGATGAGCT CGAAGGAAGC GCTCGACCAG GCGGCGGAGA AGATCAAGGC GGTTCTTGGC TGA
|
Protein sequence | MGIRKYAMLG AFALASVSLS TFGASAEDVT IRVWSLDRDI QPAPNLIKDF NELNSGIKVE YRQIQFDDVV SEAMRAYSTG QAPDIIAVDN PEHALFASRG AFLDLSDMIA KSSVVKPENY FKGPLASVTW DGKYYGIPKA TNTIALYYNK DMFKAKGLDP NKPPQTWDEL VEAARKLSDP AQNVYGITFS AKANEEGTFQ FLPWAQMAGG GYDNINSEGA VKALDVWKTI IDEKLASPDT LTRSQWDATG TFNSGNAAMA ISGPWELDRM LEEAKFDWGV ALMPVPQAGA ERSSAMGDFN WAIFANTEHP EEAFKVLEYF VSQDDRMFKD FGQLPPRSDI AIPATGEPKK DAALQVFVEQ LKYAKPRGPH PAWPKISKAI QDAIQAALTG QMSSKEALDQ AAEKIKAVLG
|
| |