Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5619 |
Symbol | |
ID | 5319921 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | + |
Start bp | 585732 |
End bp | 587312 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640777362 |
Product | extracellular solute-binding protein |
Protein accession | YP_001314294 |
Protein GI | 150377699 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR02294] nickel ABC transporter, periplasmic nickel-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.4089 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGTGA ACAGGCGTAC GTTCCTGCAG GGCGCATTCG GCGCCGTGGG ATTGGTCATG GCGCAGGGCG CCTTGTCCAA GCTTGTATAC GCGCAAGGGG CAAGCGGCAC GCTGCGGGTC GCTATCGCAA AGCCCGCAGG TAACCTCGAT CCGCAAAGCC ACTACGCAAT CTGGGCGATA CAGGACCTGA TGTTCGAACC GCTGGTCAAA TACGGCCGGG GAGGCCAGAT CGAACCTTGT CTCGCGACCG ACTGGAAGAT CGAGGGCGGT GGCAAGACGC TACATCTCAC CTTGCGACAG GGTGTTACCT TCCAGGACGG AACCAAGTTC GATGCCGCCG CGTGCAAGTG GAATCTCGAG CGGTGGATGG GGCTCGACCA GTTCAGTTGG ATGAACTGCT CGAAGTATTT CGAGTCGCTC GAAGTCGTTG ACGACTACCA CATCACCCTC CACTTCAACG AGCCCGTGCT GGCGCTGATG CAAGAGCTTT CCTACACCCG GCCGCCACGC TTCCTCAGCC CGATGTCCGT TGGAGCCGAT GGCAAGTTCA AAGAGCCGGT CGGTACGGGC CCTTGGCGCC AAGTCAAGGC GGATGACACC GAAAGCGCAT TCGAGCGCTA CGACGGCTAT TGGGGTGACA AACCATCATA CGAGCGTCTT GAGGCGAAGG TTATTCCCGA CCCGCGCTCG CGGGTCGCGG CACTGCGCAG CGGCGAGATC GATCTGGTCG GCGGCTTCTG GATTGCGCCC TTGACCCCGG AAGAGGCCAA GCAACTCGAG GCGGCCGCCG TCAACGTCGT CGTCGATCCG GGCAATGTTA CACTGGTGAT GGCGTTCAAT CCCGATCGCG CCGCGGCACT CAAGGATTCG CAGGTACGCA AGGCGATCAG TATCGGCATC GATCGTGCGG CAATCTCTCA GGTGCTCTAC CATGGCTATG CCAAGCCTGC GGGTAACTTG TTCTCAAGCG CTTTGCCTTA TGCCGGCAAG CAGCATGGCG CTCCCGTCCG CGACGCGGCG GCCGCGTCCG CGCTGCTGGA GAAGGCCGGC TGGACTGGTG GTCCTATTCG ATCCAAGGAT GGCAAGCCGC TGACGCTCGA GATGGTCGTC AGTCCGGACG CAGTGCCGGG GTCACGGATC ATCGCCGAAG TCATCCAGTC CGAGATGAAG GAGATCGGCA TCGACCTGGT GATCCGCTCG GTCGACCATG CTTCCAAGCA CACCGACATG CTGGAACAGA AGTACGACCT CGGCTTCTTC CTGACCTACG GCGCGCCTTA TGACCCGTTT GGCTCGATCG TCGGGCTGTG CCTGTCGACT TTCAAGAATG ATGTCGAGGG CAAGCTGGTT ACCGATCCGG TTAACCTCGA TCCGTTGATC AATGCGGCCA CGGCCGCAAC CGGAGACCAG ATCGAGCCGA CCATTCAGAA GGTCTACGAC TGGCTGCGCG ACAACGACGC GATTGCGCCG CTGGTCTACG TACCAAGCAT CTGGGCGCAT TCCAACCGGG TACAGGGCTT CACCAGTCCC GTCACCGAAT ACGACATGCC ATACGAAAAC ATCGTTTTGG CCGCCGAGTA G
|
Protein sequence | MTVNRRTFLQ GAFGAVGLVM AQGALSKLVY AQGASGTLRV AIAKPAGNLD PQSHYAIWAI QDLMFEPLVK YGRGGQIEPC LATDWKIEGG GKTLHLTLRQ GVTFQDGTKF DAAACKWNLE RWMGLDQFSW MNCSKYFESL EVVDDYHITL HFNEPVLALM QELSYTRPPR FLSPMSVGAD GKFKEPVGTG PWRQVKADDT ESAFERYDGY WGDKPSYERL EAKVIPDPRS RVAALRSGEI DLVGGFWIAP LTPEEAKQLE AAAVNVVVDP GNVTLVMAFN PDRAAALKDS QVRKAISIGI DRAAISQVLY HGYAKPAGNL FSSALPYAGK QHGAPVRDAA AASALLEKAG WTGGPIRSKD GKPLTLEMVV SPDAVPGSRI IAEVIQSEMK EIGIDLVIRS VDHASKHTDM LEQKYDLGFF LTYGAPYDPF GSIVGLCLST FKNDVEGKLV TDPVNLDPLI NAATAATGDQ IEPTIQKVYD WLRDNDAIAP LVYVPSIWAH SNRVQGFTSP VTEYDMPYEN IVLAAE
|
| |