Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_2857 |
Symbol | |
ID | 5323727 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 2983926 |
End bp | 2985554 |
Gene Length | 1629 bp |
Protein Length | 542 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640791802 |
Product | extracellular solute-binding protein |
Protein accession | YP_001328522 |
Protein GI | 150398055 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.791512 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.700488 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTCTCA TCAACAGGCG CGGCGCGCTG GGGCTCATCG GCGCAACTGC GGGCAGCATG ATCCTGCCGC GCTTCGCCGT GGGTCAGGGC ACGCGCCCAT CCGTCACCAT CGCCGTGCAG AAGATCACCA TCAACAACAC GCTCGACGTC TGGAACGAGC AGTCGAATGT CGGCGAGCGT GTGTTCTTCC CCAACCTCTG GGAAGGCCTG ATCCTGCGCA ACTGGATGGG CGATCAGGGT CCGGTTCCCG GCCTTGCGAC GGAATGGAAG CGCATTGACG ACAAAACGCT CGAGCTGAAG CTGCGCCAGG GCGTAAAGTT CCATAATGGC GACGAACTCA CGGCCGACGA CGTCGTCTTC AGCTTCTCGG CGCAGCGCGT GTTCGGGGAC ACCCAGCCTG CCGGCGGCAA GACGGTCTTC GAGGATGAGC ACAAGCCGGC GACTGCGAAG GAACTGCCTG CGGTCGTGCC GGGTACCGGT CGCCGCCTGT GGCCGGCGCT GGCCGGTGTC GAGGCAGTCG ACAAGTATAC CGTGCGTTTC CACAACGCGA CGCCCGACGT CACCATCGAG GGCCGCCTTT ATGCATTCGG CAGCCAGATC GCAAATCGTC GCGCCTGGGA TGAGGCGCCG ACCTACATGG ATTGGGCACG CAAGCCGATC ACCACCGGGC CCTACAGGGT CGGCGAACAT AAGCCGGACG TGTCCCTGAC GCTGGTTGCC TTCGACGACT ATTGGGGAGG CCGCCCGCCA CTCGAGCAAA TCCGTTTCGT TGAGGTTCCG GAGGTCTCGT CTCGCGTGAA CGGGCTCCTC TCCGGCGAAT ATGATTTCGC CTGCGACCTG CCGCCGGACC AGATCGCTGC CGTTCAGGCC GCACCGGGCT ACGAGGTTCA GGGCTCGACG ATCCACAACC ACCGCATCTC GGTCTTCAAT GTCCAGAACC CGACCCTTCA GGATCCGCTC GTCCGCCGCG CCATGACCCA TTCGGTCGAT CGCCAGGCGA TCGTCGATGC GCTCTGGGCC GGACAGACGA CCGTCCCCGC TGGCCTGCAA TTTCCGTTCT ACGGGGATAT GTTCGTCGAA GGCTGGGCGG TCCCTGAATA CGATCCGCAA CTGGCCAAGG ACCTCTTGAA GCAGGCCAAT TACAAAGGGG ATGCGATCCC GTTCCGCCTT CTCAACAACT ACTATACGAA CCAGACCGCA AACGGCCAGA TCATGGTCGA GATGTGGAAG CAGGTCGGGC TGAACGTCGA GATTGAAATG AAGGAGAACT GGGGCCAGAT CCACGACCCG TCGGGCGTCA AGGGCGTGCG CGACTGGTCA GCCGGTGCAG CCTTCAGCGA CCCCGTCTCC TCGATCGTTG CCCAGTTCGG ACCCAATGGC GAGGTCCAGC AGAAGAAGGA CTGGTCGAAC GCCGAGGCCA ACCAGATGTC CCAGATCCTC GAAACGGAAA CCGATCAGGC AAAGCGCAAG AAGGCATTTG CCCGCATGCT CGAAATCTGC GAGCGCGAAG ACCCGGTCTA TCAGGTGCTT CATCAGAATG CGGTCTTCAC CGGAATGAAA TCCTCTTTGA AGTGGAAGGC GGCACCCGCC TTCGCCATGG ATTTCCGCGC CGCCAACTGG TCCAGCTGA
|
Protein sequence | MLLINRRGAL GLIGATAGSM ILPRFAVGQG TRPSVTIAVQ KITINNTLDV WNEQSNVGER VFFPNLWEGL ILRNWMGDQG PVPGLATEWK RIDDKTLELK LRQGVKFHNG DELTADDVVF SFSAQRVFGD TQPAGGKTVF EDEHKPATAK ELPAVVPGTG RRLWPALAGV EAVDKYTVRF HNATPDVTIE GRLYAFGSQI ANRRAWDEAP TYMDWARKPI TTGPYRVGEH KPDVSLTLVA FDDYWGGRPP LEQIRFVEVP EVSSRVNGLL SGEYDFACDL PPDQIAAVQA APGYEVQGST IHNHRISVFN VQNPTLQDPL VRRAMTHSVD RQAIVDALWA GQTTVPAGLQ FPFYGDMFVE GWAVPEYDPQ LAKDLLKQAN YKGDAIPFRL LNNYYTNQTA NGQIMVEMWK QVGLNVEIEM KENWGQIHDP SGVKGVRDWS AGAAFSDPVS SIVAQFGPNG EVQQKKDWSN AEANQMSQIL ETETDQAKRK KAFARMLEIC EREDPVYQVL HQNAVFTGMK SSLKWKAAPA FAMDFRAANW SS
|
| |