Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_2178 |
Symbol | |
ID | 5323038 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 2250833 |
End bp | 2252422 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640791116 |
Product | extracellular solute-binding protein |
Protein accession | YP_001327846 |
Protein GI | 150397379 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.246301 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.618007 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGATT ACAAAGACTA CTTGGCTCGT CAGGTCATGC TCGGCAAGAT GAAGCGGCGC GAGTTTCTCG GACGCGCAGC CGCTCTCGGC ATTGCCGCGT CGAGCGCAAA TATGCTCTTC GCGTCCAGCG CCGCAGCGCA GGAGCCAAAA CGCGGCGGTC ACCTTAAGCT CGGCCTAGAA GGGGCTGCTG CAACGGACTC GCGGGACCCC GCAAAGGCCC TGTCGCAATT CATGTTCGTC GTCGGCCGCA ACTGGGGCGA CATGCTGGTC GAGAGCCATC CGACGACCGG CGAGCCGGTG CCGGCACTTG CGGAATCCTG GGAACCCTCC GCCGATGCGT CCACCTGGAC CTTTACGATC CGCAAGGGCG TGAAATTCCA TGACGGCAAG GAACTGACGA TCGACGACGT CATCAAGACT CTGCAGCGAC ACACGGATGA AAAATCCGAG TCCGGCGCGC TCGGCGTGAT GAAATCCATC AAGGAAATCA AGGCCGATGG CGATAAGCTC GTCCTGGTGC TGACGGAAGG CAATGCGGAC CTGCCGCTGC TTTTGACCGA CTACCATTTG ATCATCCAGC CGAACGGCGG CACCGACAAT CCCGATGCGA TGATCGGTAC CGGTCCCTAC AAGGTCGCAA GCTTCGAGCC GGGCATACGC GCCACGTTCG AGAAGAACCC GGACGACTGG CGCACCGACC GCGGCTTCGT CGATTCCATC GAATTGATCG CCATGAACGA CGCGACGGCG CGTGTCGCGG CGCTTTCCTC GGGCCAGGTC CACTTCATCA ACCGCGTCGA TCCTAAAACC GTCAATCTGC TGAAGAAGGC GCCGACTGTT GAAATCCTCA ACACGTCCGG CCGCGGCCAC TACGTGTTCA TCATGCATTG CAACACGGCG CCCTTCGACA ACAACGATCT GCGCATGGCG CTGAAATATG CCATGGACCG CGAGACCCTC GTAGAGCGCA TCCTCGGCGG CTACGGCAAG ATCGGGAACG ACTTCCCGAT CAACGACACC TATGCGCTTT TCCCGGAAGG GATAGAGCAA CGCACTTACG ACCCCGACAA GGCCGCATTC CACTACAAGA AATCCGGCCA TAGCGGTCCG GTGCTGCTGC GCACCTCCGA CGTCGCCTTC CCGAACGCGG TCGACGCCGC GGTTCTCTAC CAGGCGAGCG CCAGGAAGGC CGGCATCGAG ATCGAGGTCA AGCGCGAGCC CGGCGACGGC TACTGGTCCA ATGTCTGGAA CGTCCAGCCT TTCTCGACAT CTTATTGGGG CGGACGCCCG ACCCAGGATC AGATGTACTC CACCGCCTAT CTCTCGACGG CCGACTGGAA CGACACCCGT TTCAAGCGTC CGGATTTCGA CAAGATCCTG CTTGAGGCGC GTTCCGAGCT GGACGAAGCC AAGCGCAAGG ACATGTACCG CACCATGGCG ATGATGGTGC GCGACGAGGG CGGCCTGATC CTGCCCATGT TCAACGACTT CGTGAACGCG GCCGGCAAGA CGGTGAAGGG CTATGTCCAC GACATCGGCA ACGACATGTC CAACGGCTAT GTCGCGACCA GGGTATGGCT GGACGCCTGA
|
Protein sequence | MSDYKDYLAR QVMLGKMKRR EFLGRAAALG IAASSANMLF ASSAAAQEPK RGGHLKLGLE GAAATDSRDP AKALSQFMFV VGRNWGDMLV ESHPTTGEPV PALAESWEPS ADASTWTFTI RKGVKFHDGK ELTIDDVIKT LQRHTDEKSE SGALGVMKSI KEIKADGDKL VLVLTEGNAD LPLLLTDYHL IIQPNGGTDN PDAMIGTGPY KVASFEPGIR ATFEKNPDDW RTDRGFVDSI ELIAMNDATA RVAALSSGQV HFINRVDPKT VNLLKKAPTV EILNTSGRGH YVFIMHCNTA PFDNNDLRMA LKYAMDRETL VERILGGYGK IGNDFPINDT YALFPEGIEQ RTYDPDKAAF HYKKSGHSGP VLLRTSDVAF PNAVDAAVLY QASARKAGIE IEVKREPGDG YWSNVWNVQP FSTSYWGGRP TQDQMYSTAY LSTADWNDTR FKRPDFDKIL LEARSELDEA KRKDMYRTMA MMVRDEGGLI LPMFNDFVNA AGKTVKGYVH DIGNDMSNGY VATRVWLDA
|
| |