Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4685 |
Symbol | |
ID | 5319327 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 1200706 |
End bp | 1202304 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640776483 |
Product | extracellular solute-binding protein |
Protein accession | YP_001313415 |
Protein GI | 150376819 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.192999 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.292705 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTTAC TCAGACATAA TCTCGCCGGC GCGGCGCTCA TCTGTTCGCT GCTCATGGGG GCAAGCCCCG CGCTCGCCCA GGCGGTACTT CACCGCGGCA ATGCCGGCGA GCCGCAGACG CTCGACCAGG CTCACACCTC CATCAATATC GAGGAGTTCA TTCTCAAGGA CCTCTATGAA GGCCTGACCA TCTATGATGC CGCGGGAAAG ATCGTACCGG GTGCCGCCGA AACCTGGGAG CTTTCGGATG ATGGTACCGT CTATACCTTC AAACTGCGTG CCGATGCCAA GTGGTCTGAC GGCTCACCGG TGACGGCAGA AGATTTCGCC TTCTCCCTCC GTCGGGTGGA AGATCCGAAG ACGGCGGCCG AGTACGCCAA TATCCTGTTC CCGATCAAGA ACGCGGAAAA GGTCAACAAG GGCGAACTGC CGGTCGACCA GCTCGGCGTG AAGGCCGTCG ACGAGAAGAC GCTCGAAGTC ACCCTCGAAC GTCCGACGCC TTTCTTCCTG GAACTGCTCG CACATCAGAC GGCTCTTCCG GTCAGCAAGG CCAACGTCGA GAAGAACGGT GCCGACTTCG TGAAGCCGGG CGTGATGGTT TCGAACGGGG CCTTCAAACT GGCGTCACAT GTGCCGAACG ACAGTCTGAC CGTGGAGAAG AACACGAACT ACTGGGATGC CGCCAACGTC AAGCTCGACA AGGTGATCTT CTATCCGATC GATGATCAGG CCGCCTCGGT GCGCCGTTTC GAAGCGAAGG AAATGGACCT CGCCTATAAC TTCTCGGCCG ACCAGATCGA CCGCCTGCGT AAATCCTATG GTGAACAGGT GCACGTTTCT CCGACGCTTG CGACCTACTA CTACGCTTTC GACACGCGCC AGGAGCCCTA CAACGATGTC CGGGTCCGCC GGGCACTCTC TATGGCGGTT GACCGCGACT TCCTTGCCAA GGAAATCTAC AGCGGCTCGC AGCTGCCGTC CTATTCGATG GTGCCGCCAG GCATCGAGAG CTACGGAGAT CCCGCCAAGG CCGATTTCGG GGACATGTCG CAACTCGACC GCGAGGACAA GGCGATCGAG TTGATGAAGG AAGCCGGCTA CGGCGAAGGC GGCAAGCCGC TCAACATCGA AATCCGCTAC AACACCAACC CCAACCATGA GCGTGTCGCG ACCGCGGTTG CCGACATGTG GAAGAACACC TTCGGTGCCA AGGTCTCGCT GGTGAATCTC GATGTGTCGT CCCACTACGC CTATCTGCAG GAAGGCGGCA AGTTCAACGT CGCGCGCGCA GGCTGGGTCG CCGATTACGC CGATGCCGAG AACTTTCTGG CGCTGAGCCT CAGCACCAAC AAGACGTTCA ATTACGGCCA CTTCGAAAAT GCGGAATTCG ACGCGTTGAT GAAGAAATCC TATGAAGAAC AGGATCCTGC AGCACGTTCG AAAATCCTGC ATGAGGCCGA AACGCTGCTG ATGAAGGAAC AGCCGATAGC GCCCCTCCTG ACCCAGGCCG ACCTCTGGCT CGTTTCGGAA CGGGTCCAGG GTTGGGTGGA CAATGCGCCG AACGCTCACC TGAGCAAGTT CCTGAGCATC GCCGAGTAA
|
Protein sequence | MALLRHNLAG AALICSLLMG ASPALAQAVL HRGNAGEPQT LDQAHTSINI EEFILKDLYE GLTIYDAAGK IVPGAAETWE LSDDGTVYTF KLRADAKWSD GSPVTAEDFA FSLRRVEDPK TAAEYANILF PIKNAEKVNK GELPVDQLGV KAVDEKTLEV TLERPTPFFL ELLAHQTALP VSKANVEKNG ADFVKPGVMV SNGAFKLASH VPNDSLTVEK NTNYWDAANV KLDKVIFYPI DDQAASVRRF EAKEMDLAYN FSADQIDRLR KSYGEQVHVS PTLATYYYAF DTRQEPYNDV RVRRALSMAV DRDFLAKEIY SGSQLPSYSM VPPGIESYGD PAKADFGDMS QLDREDKAIE LMKEAGYGEG GKPLNIEIRY NTNPNHERVA TAVADMWKNT FGAKVSLVNL DVSSHYAYLQ EGGKFNVARA GWVADYADAE NFLALSLSTN KTFNYGHFEN AEFDALMKKS YEEQDPAARS KILHEAETLL MKEQPIAPLL TQADLWLVSE RVQGWVDNAP NAHLSKFLSI AE
|
| |