Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4979 |
Symbol | |
ID | 5318792 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 1492278 |
End bp | 1493597 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640776761 |
Product | extracellular solute-binding protein |
Protein accession | YP_001313693 |
Protein GI | 150377097 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.336131 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.060129 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAGGC TGCTTTCCGG CGTATCGGCC GGTGTAATCA TGCTTGTCTG CGCTATGGGG GCGGCAAGTG CCGCTGACCT GCCGGGCAAG TTCGAGGGTG TTACCATCGA CGCCAAGCTG ATTGGCGGCC AGCAATATGA AAAGCTCTAC GAGCGCATCG GCGAATGGGA GAAGGCGACC GGCGCCAAGG TCAACATTCT GTCGAAGAAG AACCACTTCG AACTCGACAA GGAGATCAAA TCCGACATCG CCACCGGCGG CATCACCTGG TGCATCGGTT CGAATCATTC CTCATTCGCA CCGCAATATC CGGATATCTA CGCCGATCTT TTCGGCCTCG TCCCGTCCGA GGAGGTCGCC AGGTTCGTGC CGGCCGTGAT CGATGCCTCG ACGCTCGAGG GCAAGCTTGT CATGCTGCCG CGGGCGCAAT TCGACGTCTC GGCACTCTAT TACCAGAAGA GCCTTTATCA GGACGAGGCG AAGAAGGCCG AGTTCAAGGC CAAGTACGGC TATGATCTCG CACCGCCCGA CACCTGGGCT CAGGTGAGCG ATCAGGCGGA ATTCTTCGCC GCGCCGCCGA ACTTCTACGG CACGCAGTTC GCCGGCAAGG AGGAAGCGAT CAACGGCCGC TTCTACGAGA TGCTGGTCGC CGAGGGTGGA GAATATCTCG ACAAGGACGG CCGACCGGCG TTCAATTCGG AAGCGGGCGT GCGCGCCCTC GATTGGTTCG TCAAGCTCTA CAAAAGCAAA GCGGTTCCGC CGGGCACCAC CAACTATCTC TGGGACGATC TCGGCCAGGG CTTCGCTTCG GGCTCGATCG CGGTCAATCT CGACTGGCCT GGCTGGGCGT CCTTCTTCAA CGATCCGAAA TCCTCGAAGG TTGCCGGCAA TGTCGGGGTG AAGGTTCAGC CGGCCGGGTC GTCCGGCAAG CGCACCGGCT GGTCCGGGCA TCACGGCTTT TCGGTAACGG AGTCCTGCGG GAGCAAGGAG GCAGCCGCCT CGCTCGTCTG GTGGCTGACC AACGAAGACA GCCAGATGCT CGAATCCGCA GCCGGCCCTC TTCCGACCCG CAGCGCCGTA TGGGATCACA ACATCAAGGC CGCCGAGGGC GACGCCTACA AGACCGAGGT GCTGCAGGCC TTCCAGGAGG CAGCAAAGCA CGCCTTCCCG GTTCCGCAGA CGGCCGAGTG GATCGAGATA TCCAATGCCG TCTATCCGGA GCTTCAGGCG GCCATCCTCG GCGACAAGAC GTCGAAGGAA GCGCTCGATG CTGCCGCCGA AAAGGCGACC GGCATCCTCG AAGACGCCGG CAAGCTCTAG
|
Protein sequence | MNRLLSGVSA GVIMLVCAMG AASAADLPGK FEGVTIDAKL IGGQQYEKLY ERIGEWEKAT GAKVNILSKK NHFELDKEIK SDIATGGITW CIGSNHSSFA PQYPDIYADL FGLVPSEEVA RFVPAVIDAS TLEGKLVMLP RAQFDVSALY YQKSLYQDEA KKAEFKAKYG YDLAPPDTWA QVSDQAEFFA APPNFYGTQF AGKEEAINGR FYEMLVAEGG EYLDKDGRPA FNSEAGVRAL DWFVKLYKSK AVPPGTTNYL WDDLGQGFAS GSIAVNLDWP GWASFFNDPK SSKVAGNVGV KVQPAGSSGK RTGWSGHHGF SVTESCGSKE AAASLVWWLT NEDSQMLESA AGPLPTRSAV WDHNIKAAEG DAYKTEVLQA FQEAAKHAFP VPQTAEWIEI SNAVYPELQA AILGDKTSKE ALDAAAEKAT GILEDAGKL
|
| |