Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_1751 |
Symbol | |
ID | 5322609 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 1831471 |
End bp | 1833087 |
Gene Length | 1617 bp |
Protein Length | 538 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640790689 |
Product | extracellular solute-binding protein |
Protein accession | YP_001327421 |
Protein GI | 150396954 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.150343 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.0954392 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGAACT TGTTACTGGC CGGCGTCTGC GCCGCCGCAC TGATGGGAAA TCCCGCATTC GCCGACGACA TCAAGCAGGG TGGCGAAATG ACCGTCACCT ATAAGGACGA TGTTTCGACA CTCGATCCGG CGATCGGCTA CGACTGGCAG AACTGGTCGA TGATCAAGTC GCTGTTCGAC GGCCTGATGG ATTATGTCCC GGGCACGACC GAGTTGCGTC CCGATCTTGC CGAAGCCTAT GAAATCTCCG GGGACGGCAA AATCTTCACG TTCAAGCTGC GCCAGGGCGT CAAGTTTCAC AATGGTCGTG AGCTGACTGC CGAGGACGTG AAATATTCGA TTGAGCGCGT GGTGAATCCG ACGACCCAGA GCCCGGGTGC CGGGTTCTTC TCATCGATCA AAGGCTTCGA AGATGTCTCG GCCGGAAAGG GGGGTGATCT GTCCGGCATC GCCGTGCAGG ATCCGCACAC AATCAGGTTC GAACTGAGCC GGCCGGACGC CACCTTCCTC CACGTCATGG CGCTCAACTT TGCCCATGTC GTGCCAAAGG AAGAGGTCGA GAAACACGGT GCGGATTTCG GGAAAAATCC CGTCGGTTCC GGAGCGTTCA AGCTTGCCGA GTGGACGCTT GGGCAACGCC TGGTGTTCGA ACGCTTCGCC GACTACTGGA ACGAAGGGCT TCCGAAGCTT GACCGCATTA CCTTCGAGGT CGGCCAGGAG CCCGTTGTCG CGCTCCTTCG CCTGCAGAAC GGCGAAATCG ACGTGCCCGG AGACGGCATT CCGCCGGCGA AGTTCGTCGA GGTGACCAAA GATCCTAATT TCAAGGAGCT GATCATTCAG GGCGGTCAGT TGCACACCGG CTATGTGACG ATGAACGTCA AGATGGCCCC CTTCGACAAG GTCGAGGTGC GCAAGGCTGT GAACATGGCC ATCAACAAGG ATCGCATCCT GCGCATCATC AACGGTCGCG CAGTCGCCGC CAATCAGCCG CTGCCGCCCT CGATGCCGGG ATATGCCAAG GATTATAAAG GATATGCCTA TGATCCCGAG GGCGCCAAGA AGCTGCTCGA ACAGGCCGGC CTGGGCGACG GGTTCTCGAC CGAACTCTAT GTCATGAACA CCGACCCTCA GCCGCGTATC GCCCAGGCCA TCCAGCAGGA CCTGAAGGCG ATCGGCATCA CGGCATCGAT AAAGTCGCTG GCACAGGCCA ATGTCATCGC GGCGGGCGGC GAGGAGAACC AGGCGCCGAT GGTCTGGTCG GGCGGCATGG CATGGATTGC CGACTTCCCG GATCCCTCGA ATTTCTACGG CCCCATTCTG GGGTGCGGCG GTGCCGTGCC GGGAGGCTGG AACTGGTCCT GGTACTGCAA TGAGGAGCTC GACAAGAAGG CAGCCGAAGC CGATGCCATC GTAGACCCGG CAAAGGCCGC CGAGCGCGAG GCCATGTGGC GCGACATCTA TGTGAAGATC ATGGAGGACG CACCCTGGGC ACCGATCTTC AACGAGGAGC GCTTTACCAT TCGCTCGGAG CGTATCGGCG GCGACGACAA GCTGTTCGTC GATCCGGTCC ACATTCCCGT TCACTACGAT CAGGTATATG CAAAAGATGT GCAGTAA
|
Protein sequence | MRNLLLAGVC AAALMGNPAF ADDIKQGGEM TVTYKDDVST LDPAIGYDWQ NWSMIKSLFD GLMDYVPGTT ELRPDLAEAY EISGDGKIFT FKLRQGVKFH NGRELTAEDV KYSIERVVNP TTQSPGAGFF SSIKGFEDVS AGKGGDLSGI AVQDPHTIRF ELSRPDATFL HVMALNFAHV VPKEEVEKHG ADFGKNPVGS GAFKLAEWTL GQRLVFERFA DYWNEGLPKL DRITFEVGQE PVVALLRLQN GEIDVPGDGI PPAKFVEVTK DPNFKELIIQ GGQLHTGYVT MNVKMAPFDK VEVRKAVNMA INKDRILRII NGRAVAANQP LPPSMPGYAK DYKGYAYDPE GAKKLLEQAG LGDGFSTELY VMNTDPQPRI AQAIQQDLKA IGITASIKSL AQANVIAAGG EENQAPMVWS GGMAWIADFP DPSNFYGPIL GCGGAVPGGW NWSWYCNEEL DKKAAEADAI VDPAKAAERE AMWRDIYVKI MEDAPWAPIF NEERFTIRSE RIGGDDKLFV DPVHIPVHYD QVYAKDVQ
|
| |