Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5594 |
Symbol | |
ID | 5319896 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | - |
Start bp | 560644 |
End bp | 562251 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640777339 |
Product | extracellular solute-binding protein |
Protein accession | YP_001314271 |
Protein GI | 150377676 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.459863 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.392289 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGACA TCACCAATTG GACCAGATCT GACGACGCTA TGATCGAAAC CGCCATTCGT CGCGGAGCGA CGCGCCGCGA ACTCCTGCAG ATGATGCTGG CCGGCGGTGC TGCCCTCTCT GCCGGCAGTC TGATGCTCGG CCGAGCCGGC AATGCGGTTG CCGCAACGCC GGTGGCCGGC GGTACGCTCA AAGCGGCCGG CTGGTCGGCT TCCACCGCCG ACACGCTGGA CCCGGCCAAG GCATCGCTCT CCACCGACTA TGTCCGCTGC TGCTCCTTCT ACAACCGACT CACCTTCCTC GATAAAGGCG GCACGCCGCA GATGGAACTG GCCGAAGCTA TCGAGACCAA GGATGCCAAG ACCTGGACCG TCAAGCTGAG GAAGGGCGTT ACCTTCCATG ATGGCAAGCC GCTGACAGCC GACGACGTGA TCTTCTCGCT GAAGCGCCAC CTCGACCCGG CGGTCGGTTC AAAGGTCGCC AAGATCGCCG CGCAGATGAC GAGCTTCAAG GCGGTGGACA AGCAGACGGT CGAGATCACG CTCGCGAGCC CGAACGCCGA CCTGCCGACG ATCCTCTCGA TGCACCACTT CATGATCGTC GCCGACGGCA CCACCGACTT CTCGAAAGGG AACGGCACCG GCGCTTTTGT GAGAGAAGTC TTCGAGCCCG GCGTGCGCTC CGTCGGCCTC AAGAACAAGA ACTACTGGAA GTCCGGACCG AACGTCGATT CCTTCGAGTA CTTCGCCATC AGCGACGACA GTTCCCGGGT GAATGCGCTC TTGTCCGGCG ACATCCACCT TGCCGCCACG ATCAATCCGC GCTCCATGCG CCTGGTCGAG AGCCAAGGCG ACGGCTTCGT CCTCTCGAAG ACGACCTCCG GCAACTACAC CAACCTCAAC ATGCGGCTGG ACATGGAACC CGGCAGCAAG CGCGACTTCG TCGAAGGCAT GAAGTACCTC GTCAACCGCG AGCAGATCGT CAAGTCGGCG CTTCGGGGTC TAGGCGAGGT CGGCAACGAC CAGCCGGTGT CCCCGGCCAA CTTCTATCAC AATCCCGACC TGAGGCCGCA CGCCTTCGAC CCCGAGAAGG CGAAGTTCCA CTTCGAGAAG GCCGGCATGC TAGGCCAGTC GATCCCGGTG GTTGCCTCCG ATGCAGCGAA CTCCGCGATC GATATGGCAA TGATCATTCA GGCCTCCGCT GCCGAAATCG GATTGAAGCT CGATGTGCAG CGCGTTCCCG CAGACGGTTA CTGGGATAAT TACTGGCTGA AGGCGCCGAT CCACTTCGGC AACATCAACC CGCGCCCCAC GCCGGATATC CTGTTCTCTC TGCTCTACTC CTCGGAGGCT CCGTGGAACG AGAGCCAATA CAAGTCGGAG AAATTCGATA AGATGCTGAT CGAAGCGCGT GGCTCGCTCG ACCAGGAGAA GCGCAAGGCG ATCTACAATG AGATGCAGGT GATGGTCGCC AGTGAAGCCG GCACCATCAT TCCGGCCTAT ATCTCCAACG TCGACGCGAT CACCGCCAAG CTCAAGGGCC TGGAAGCCAA TCCGCTTGGC GGGCAGATGG GTTATGCTTT TGCGGAATAT GTCTGGCTCG AGGCCTGA
|
Protein sequence | MNDITNWTRS DDAMIETAIR RGATRRELLQ MMLAGGAALS AGSLMLGRAG NAVAATPVAG GTLKAAGWSA STADTLDPAK ASLSTDYVRC CSFYNRLTFL DKGGTPQMEL AEAIETKDAK TWTVKLRKGV TFHDGKPLTA DDVIFSLKRH LDPAVGSKVA KIAAQMTSFK AVDKQTVEIT LASPNADLPT ILSMHHFMIV ADGTTDFSKG NGTGAFVREV FEPGVRSVGL KNKNYWKSGP NVDSFEYFAI SDDSSRVNAL LSGDIHLAAT INPRSMRLVE SQGDGFVLSK TTSGNYTNLN MRLDMEPGSK RDFVEGMKYL VNREQIVKSA LRGLGEVGND QPVSPANFYH NPDLRPHAFD PEKAKFHFEK AGMLGQSIPV VASDAANSAI DMAMIIQASA AEIGLKLDVQ RVPADGYWDN YWLKAPIHFG NINPRPTPDI LFSLLYSSEA PWNESQYKSE KFDKMLIEAR GSLDQEKRKA IYNEMQVMVA SEAGTIIPAY ISNVDAITAK LKGLEANPLG GQMGYAFAEY VWLEA
|
| |